Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2022 Sep 29;121(21):4119–4127. doi: 10.1016/j.bpj.2022.09.031

Phase separation in amino acid mixtures is governed by composition

David De Sancho 1,
PMCID: PMC9675019  PMID: 36181270

Abstract

Macromolecular phase separation has recently come to immense prominence as it is central to the formation of membraneless organelles, leading to a new paradigm of cellular organization. This type of phase transition, often termed liquid-liquid phase separation (LLPS), is mediated by molecular interactions between biomolecules, including nucleic acids and both ordered and disordered proteins. In the latter case, the separation between protein-dense and -dilute phases is often interpreted using models adapted from polymer theory. Specifically, the “stickers and spacers” model proposes that the formation of condensate-spanning networks in protein solutions originates from the interplay between two classes of residues and that the main determinants for phase separation are multivalency and sequence patterning. The duality of roles of stickers (aromatics like Phe and Tyr) and spacers (Gly and polar residues) may apply more broadly in protein-like mixtures, and the presence of these two types of components alone may suffice for LLPS to take place. In order to explore this hypothesis, we use atomistic molecular dynamics simulations of capped amino acid residues as a minimal model system. We study the behavior of pure amino acids in water for three types of residues corresponding to the spacer and sticker categories and of their multicomponent mixtures. In agreement with previous observations, we find that the spacer-type amino acids fail to phase separate on their own, while the sticker is prone to aggregation. However, ternary amino acid mixtures involving both types of amino acids phase separate into two phases that retain intermediate degrees of compaction and greater fluidity than sticker-only condensates. Our results suggest that LLPS is an emergent property of amino acid mixtures determined by composition.

Significance

Liquid-liquid phase separation has become a dominant theme in the scientific literature across fields, as it is a type of phase transition that provides a means for organisms to organize their contents. This process is triggered by interactions between intervening molecules, mainly proteins and nucleic acids, and those with the solvent. A recently proposed paradigm is that of the “stickers and spacers” model, whereby components of phase-separating mixtures may adopt either of these roles. Inspired by design principles derived from recent experimental work on low-complexity regions of prion-like domains, here we aim to characterize the simplest possible model system that accommodates the sticker and spacer description, i.e., a three-component solution of amino acids, using atomistic molecular dynamics simulations. Despite its simplicity, the system recapitulates many properties of more complex condensates.

Introduction

Cellular organisms are well known to use lipid membranes to compartmentalize their many functions into organelles. In the last decade, it has become apparent that organelles lacking membranes are also ubiquitous in cells (1). These membraneless organelles include the nucleolus, stress granules, nuclear speckles, and Cajal bodies, among many others. They are rich in proteins and also nucleic acids (2), which form molecular condensates as they undergo a type of phase transition usually called liquid-liquid phase separation (LLPS). This term is used because, as a result, a dilute and a concentrated phase are formed, both of which have liquid-like properties such as fusion into larger droplets, flow in response to stress, or wetting and dropping behaviors typical of liquids (3). In fact, LLPS may be more accurately described as the formation of condensate-spanning networks, due to the interplay between the propensity of the biomolecule to both phase separate and percolate (4,5). When condensates are formed following phase separation coupled to percolation, they are viscoelastic in nature rather than simple liquids (6).

The arrival to prominence of membraneless organelles has sparked considerable interest in the types of molecules undergoing LLPS and the physical principles governing this process (7). Although this type of phase transition has been reported for modular folded domains (8), many proteins that phase separate are intrinsically disordered or have intrinsically disordered regions. These intrinsically disordered proteins or intrinsically disordered regions have strong biases in sequence composition (9, 10, 11), as in the case of the low-complexity regions (LCRs) that are abundant in biomolecular condensates (12).

Because polypeptide chains are often the drivers of LLPS, concepts from polymer physics have been borrowed and adapted to interpret experimental results (3). One such model is the stickers and spacers model, where different types of modules come to play different roles in the polymer solution (7). In the case of intrinsically disordered proteins or intrinsically disordered regions, aromatic amino acid residues like Tyr or Phe behave as stickers that can form intra- and intermolecular contacts, while glycine and polar amino acid residues are spacers that do not have strong interaction patterns. This simple model has been used successfully to interpret experiments on prion-like low-complexity domains, which are frequently involved in the formation of condensates (13). Both experiments and simulations show that specific patterns of stickers and spacers in polypeptide sequences are important determinants of phase separation (10,11,13, 14, 15). Specifically, both randomly and regularly distributed stickers result in the formation of liquid-like condensates, while patchy regions with multiple stickers clustered together result in the formation of amorphous precipitates (13).

The duality from the stickers and spacers model for different amino acid types may apply more broadly, even in the absence of an ordered polypeptide sequence where multivalency and patterning effects become important. In fact, composition alone may be sufficient to drive phase separation in the absence of additional information. Here, we explore this possibility using atomistic molecular dynamics (MD) simulations. Molecular simulations with atomistic resolution of condensate formation have been attempted by others before (16, 17, 18). These simulations often involve multiple protein chains and large simulation boxes, which unfortunately cannot be run for sufficiently long timescales for observing phase separation or computing equilibrium properties. Alternatively, much more computationally cheap coarse-grained simulation models can recapitulate general principles of phase separation of disordered proteins (9,19, 20, 21, 22, 23), although details of chemical interactions are inevitably lost. An intermediate possibility is to use coarse-grained models with explicit solvent that may more faithfully account for hydration properties of condensates (24, 25, 26). Even in this case, protein-solvent interactions may need to be rebalanced and differences in the solvation entropy contribution need careful consideration due to reduction in the number of degrees of freedom from the coarse graining (24).

In this work, we run atomistic MD simulations using amino acid mixtures as a minimal, albeit detailed, model for macromolecular phase separation in protein-like systems. This type of simulation has been carried out in the past to study crowding and peptide solubility (27, 28, 29), although often in single component boxes. Because of the small size of amino acid residues, we can run direct coexistence simulations in solutions of hundreds of molecules. Specifically, we attempt to capture the behavior that emerges from mixing amino acids of the sticker and spacer types, using compositions inspired by those observed in LCRs. Specifically, we choose Ser and Gly, which make up over 50% of the sequence in some LCRs (12), as representative spacers and the aromatic Tyr, which has been shown to be key for phase separation (14), as a sticker. From our simulations, we conclude that LLPS is an emergent property of amino acid mixtures driven by composition.

Materials and methods

System preparation

We have used the Packmol software (30) to introduce multiple copies of acetylated and amidated amino acid residues, usually termed dipeptides, in elongated boxes with longest dimensions (Lx) between 10 and 20 nm, and Ly=Lz=3.5 nm. We generate boxes for pure dipeptide solutions for Gly, Ser, and Tyr residues, as well as stoichiometric binary and ternary mixtures. Target concentrations for these systems were between 1 and 2.5 M (see Tables S1–S4). Amino acids were built using the leap program from the Amber suite (31) and then solvated to fill the simulation boxes. For most of our simulations, the Amber 99SB-ILDN force field (32), together with the TIP3P water model (33), was used for all systems. We have run additional simulations using two other force fields from the Amber family, ff03 (34) and ff99SB-disp (35).

MD simulations

For equilibrium simulations, we followed the same simulation workflow irrespective of molecular system, box size, or force field combination. The amino acid boxes were solvated, energy minimized, and equilibrated in the NVT and NPT ensembles at 300 K. Equilibration involved an initial energy minimization step using a steepest descent algorithm, a short (500 ps) NVT simulation with position restraints on the protein heavy atoms to equilibrate the water molecules at 300 K, and a 1 ns NPT run to converge the density to the corresponding value at 1 Pa using the Berendsen barostat (36). Long (500 ns to 1 μs) production simulations were run for all systems at 300 K in the NVT ensemble. In all cases, a stochastic velocity-rescaling thermostat was used (37) with a time step of 2 fs. Long-range electrostatics were calculated using the particle-mesh Ewald method (38). We used the GROMACS program (39) (v.2021) to run all simulations.

For the boxes with Lx=10 nm as the longest dimension, we started our simulations from a random distribution of blocked amino acids. For longer boxes of Lx=20 nm, where a random mixture may easily result in the independent nucleation of condensates at multiple positions in the box, we ran direct coexistence simulations, where a preformed condensate was first simulated in a shorter (Lx=8 nm long) box. Then, the box was expanded to the final 20 nm length, filled with solvent molecules, and equilibrated in the NPT ensemble before the production run. In order to estimate dynamical properties of the resulting dense and dilute phases for selected systems, we sliced snapshots from the equilibrium runs and simulated them independently. In addition to the equilibrium runs, we have used the temperature replica exchange method to estimate the phase diagram of the Gly/Ser/Tyr equimolar mixture in the simulation box with 20 nm as the longest dimension. Forty replicas with temperatures ranging between 288 and 444 K were initialized from the system equilibrated at 300 K and run in parallel for 300 ns with attempted replica swaps every 1,000 integration time steps. The first 100 ns from each replica were discarded from the analysis.

Analysis of the simulations

We have analyzed the simulation results using a combination of the suite of programs included as part of the GROMACS package (39) and in-house Python scripts that make ample use of the MDtraj library (40). Molecular visualization was performed using the VMD program (41). Specifically, the solvent-accessible surface area was calculated using the gmx sasa program (42). For the spontaneous phase-separating systems, the resulting values were employed to estimate the collapse degree (43), defined as the ratio between the solvent-accessible surface area in the initial configuration and that at a given time t. The number and distribution of molecular clusters were estimated using the gmx clustsize tool using a cutoff of 7 Å. The clustering degree was defined as the size of the largest cluster. For the collapse and clustering degrees, we estimated errors from block averaging, leaving out the initial segment of the simulation where the collapse takes place.

We calculated density profiles across our simulation boxes using the gmx density program. Phase diagrams were determined from the temperature replica exchange data using the methods described by Dignon et al. (9,19). For each temperature where we observed coexistence, we estimated the protein densities for the protein-dense and -dilute phases, ρH and ρL, respectively. The critical temperature was then fitted from the equation

ρHρL=A(TcT)β, (1)

where β=0.325 is the critical exponent (9,19) and the critical density is derived from the law of rectilinear diameters

(ρH+ρL)/2=ρc+C(TcT) (2)

with positive fitting constant C.

Diffusion coefficients were calculated using the Einstein relationship, |r(t+τ)r(t)|2t=6Dτ, where the brackets denote averages over different starting points, t. We limited the fits of the diffusion coefficient to the range of times 1 to 5 ns, where there are no deviations from linearity. Autocorrelation functions for the torsion angles were calculated from the simulation trajectories using the expression by Van der Spoel and Berendsen (44):

C(t)=cosθ(τ)cosθ(τ+t)+sinθ(τ)sinθ(τ+t)τ, (3)

where θ is the angle of interest and τ is the lag time.

Results and discussion

Composition determines phase behavior in amino acid solutions

First, we focus on a set of simulations of pure amino acid solutions, where copies of the capped amino acid molecules were randomly inserted in 10 × 3.5 × 3.5 nm simulation boxes for residues of the spacer (Gly, Ser) and sticker (Tyr) types at concentrations between 1 and 2.5 M (see Fig. 1). In the case of the Gly boxes, we find that the peptide remains soluble even at the largest concentration for the full duration of the MD simulations, resulting in flat density profiles. For Ser, the dipeptide solution is mostly homogeneous up to 2 M, while at 2.5 M, some regions become enriched in amino acid (Fig. 1, second column). Failing to form the condensate could be due to insufficient simulation time. For this reason, we have run an additional simulation of Ser at 2.5 M, but this time starting from a preformed condensate. As monitored by the increase in the solvent-accessible surface area (SASA), the slab rapidly dissolves (see Fig. S1). On the contrary, in the case of Tyr, we find phase separation across all the concentrations explored, resulting in the formation of a slab (Fig. 1, center column). The differences observed between Gly/Ser and Tyr are consistent with the very low solubility reported for experiments on Tyr in its zwitterionic state (45) and suggest that very small amounts of this amino acid may have drastic effects in properties of mixtures, as indeed has been found experimentally (14).

Figure 1.

Figure 1

Condensate formation in single and multicomponent amino acid systems. (A and C) Snapshots from equilibrium simulations at 1 M and 2.5 M. Left to right: Gly (white), Ser (yellow), and Tyr (green) dipeptides and their Gly/Ser and Gly/Ser/Tyr mixtures. (B) Density profiles along the longest axis of the simulation boxes for the pure dipeptide solutions at different concentrations for Gly, Ser, Tyr, and their Gly/Ser and Gly/Ser/Tyr mixtures. Different colors correspond to averages over 50 ns blocks in the 500 ns simulations.

We then repeat the same type of calculations but in this case for amino acid mixtures of spacers and stickers (see Fig. 1, fourth and fifth columns). We first look at equimolecular combinations of Gly and Ser for total amino acid concentrations between 1 and 2.5 M, as before (hence ranging between 0.5 and 1.25 M for each component). We find that these mixtures retain the properties of the single amino acid solutions of their constituents. At all concentrations probed in our calculations, the density profiles remain flat, which is consistent with the mixture exhibiting properties intermediate between their individual components. However, simulations for the same range of total concentration with stoichiometric amounts of Tyr result in the separation of the amino acid solution into dilute and dense phases.

This result may be a consequence of the protein force field and water model combination used in our simulations. Although the Amber ff99SB-ILDN force field with TIP3P water appropriately reproduces experimental solution densities in simulations of crowded amino acids (27), it also induces compact states in proteins (46). We have run simulations with two other optimized force fields of the Amber family at a 2 M amino acid concentration (see Fig. S2). Using the ff03 force field (34) with TIP3P water, the density profiles for the Gly/Ser/Tyr mixture are comparable to those obtained using ff99SB-ILDN, indicating that our result applies beyond our chosen force field and water model combination. With the more recently derived a99SB-disp force field and water (32), the amino acid mixture fails to phase separate. The latter result can be attributed to the same reasons that prevent this specific force field and water model combination from adequately capturing protein-protein binding, which led to its reparametrization (47). In any case, the properties of the simulated amino acid condensates will depend on the choice of the force field.

Clustering degree and aggregation propensity as metrics of condensate formation

In all systems for which we report the formation of condensates (i.e., Tyr and Gly/Ser/Tyr), the initial collapse takes place within the first 50–100 ns of the simulation run. After this initial collapse, the systems sample the equilibrium distribution for the remainder of the 500 ns runs, suggesting good convergence in our simulations (see Fig. S3). In Fig. 2 A, we show the time series for the SASA for all systems at 1.5 M concentration. In the case of the Gly/Ser/Tyr mixture, SASA decreases to a value intermediate between that of the Gly/Ser solutions and that of the Tyr simulation box. This result occurs consistently across the full range of concentrations explored in our simulations (see Fig. S3). This type of intermediate behavior for the Gly/Ser/Tyr mixture is also observed for the maximum cluster size, which has also been shown to be a robust descriptor of the degree of clustering (26) (Fig. 2 A). We note that also the time series data for the maximum cluster size and the average number of clusters seem stable after the initial 100 ns of simulation, which gives further reassurance on the convergence of the results (see Figs. S4 and S5). The only exception is the 2 M solution of Tyr, which nucleates in two different points along the simulation box, forming two slabs that finally merge at ∼250 ns.

Figure 2.

Figure 2

Aggregation and clustering in single and multicomponent systems. (A) Time series data for the solvent-accessible surface area (left) and the maximum cluster size (right) for all systems in the 1.5 M solutions. (B) Projection of all the simulation datasets on the aggregation propensity and clustering degree. Different symbols represent total peptide concentrations (circles: 1 M; squares: 1.5 M; triangles: 2 M; diamonds: 2.5 M). Error bars were derived from block averaging.

Two useful metrics that summarize the information from these simulations are the “aggregation propensity,” defined as the ratio between the initial and instantaneous SASAs (43), and the “clustering degree,” which we obtain by normalizing the maximum cluster size by the total number of residues in the box (26). In the projection of our data sets on these axes (Fig. 2 B), the Gly/Ser/Tyr system stands out as having intermediate values of the aggregation propensity and clustering degree between the systems that do not phase separate (Gly, Ser, and Gly/Ser mixture) and the Tyr boxes. This result suggests a qualitatively different behavior between the sticker-only or spacer-only solutions and the mixture of stickers with spacers.

Densities from coexistence simulations

Despite the interesting differences identified from these simulation runs, the relatively small size of our simulation boxes may lead to finite size effects. This is noticeable, particularly, for the lowest concentration (1 M). As a result, for the Gly/Ser/Tyr mixture, the density peaks at different values depend on the target concentration. In order to address the severity of these problems, we have run additional simulations for longer simulation boxes with the longest dimension of Lx=14 nm. For the 2 M boxes of all single and multicomponent systems, we find consistent results irrespective of box size (see Fig. S6). For the multicomponent Gly/Ser/Tyr system, which we are most interested in, we have explored the dependence with the box length across concentrations. For even longer boxes, we often find simultaneous nucleation of condensates at multiple points along the X axis that fail to merge within the simulation time.

In order to accurately determine the densities of amino acid condensates, we resort to the slab method, where we start from a preequilibrated biomolecule-dense phase that is put in contact with bulk water (see materials and methods). Specifically, we run simulations using the temperature-replica exchange method for long boxes of Lx=20 nm at 2 M total amino acid concentrations, from which we calculate density profiles (see Fig. 3). We observe the expected temperature dependence for both protein and water densities, with gradually disappearing slabs upon increasing temperature. The results are consistent with those presented above for the three component mixtures in the shorter boxes. At room temperature, the densities of the protein-dilute and protein-dense phases are, respectively, ∼50 and ∼1,040 mg/mL, respectively. Conversely, the water density in the condensate is only ∼180 mg/mL. These values for the densities are more extreme than those observed for the FUS low-complexity domain and LAF-1 RGG protein in experiments and previous atomistic simulations, where condensates were typically much more hydrated (e.g., having ∼500 mg/mL protein and ∼600 mg/mL water in the dense phase (17)). This difference may be due to the tighter packing that becomes possible in our systems, due to the reduction in excluded volume effects from a covalently bonded polypeptide chain and may also be influenced by the combination of force field and water model.

Figure 3.

Figure 3

Density profiles and phase diagram for GSY. (A) Density profiles from replica-exchange molecular dynamics simulations of the ternary Gly/Ser/Tyr equimolecular mixture at 2 M. Blue and red lines correspond to the lowest and highest temperatures, respectively. Inset: density profiles for Gly, Ser, and Tyr at room temperature. (B) Density profiles for water molecules for all replicas. Lighter colors correspond to higher temperatures. (C) Phase diagram derived from the replica-exchange molecular dynamics simulations (circles). The fitted critical point is shown as a square. Error bars were derived from block averaging.

Phase behavior in the ternary amino acid mixtures

We have seen that our results from simulations of very simple mixtures of capped sticker and spacer amino acids captured properties observed in many disordered protein segments, which experience a phase transition with an upper critical solution temperature (12). This type of phase behavior is recapitulated in our simulations. We use the densities of the dense and dilute phases (ρH and ρL, respectively) from the temperature-dependent density profiles obtained from the replica-exchange simulations and derive a phase diagram that can be fitted to extract a critical temperature of Tc=414 K (see Fig. 3 C).

So far, we have treated the amino acid mixtures as homogeneous protein solutions. However, lacking the connectivity of covalently bonded polypeptide chains, amino acids could be partitioning independently of each other. We have used the density profiles for Gly, Ser, and Tyr to calculate their individual compositions along the simulation boxes (see Fig. 3 A, inset). Although all three amino acids are enriched in the slab, the condensed phase is richer in Tyr than in Gly/Ser, which is consistent with the greater hydrophobicity of this amino acid. Conversely, the dilute phase is richer in Gly/Ser, while the Tyr density of this phase is almost negligible.

In order to map the composition dependence of the phase behavior, we have run 1 μs equilibrium simulations for amino acid mixtures, varying the ratio between stickers and spacers while retaining the total number of amino acid residues in the mixtures constant. From the densities of sticker and spacer types (ρY and ρGS, respectively) in the dilute and dense phases, we obtain the phase diagram shown in Fig. 4 A (we also show representative snapshots from simulations at different compositions in Fig. 4 B). The emerging picture is that of a “scaffold-client” system (48), where the spacer-type residues, which do not phase separate on their own, are recruited by the stickers. This is due to the greater strength of the homotypic interactions between tyrosine residues relative to every other pairwise interactions. However, heterotypic interactions between tyrosine and glycine/serine also contribute to the stability of the condensate. This is supported by the statistics of pairwise contacts among residues (which we show for the equimolar mixture in Fig. S7). While Tyr is the residue that forms more contacts, these are not exclusively homotypic, as Tyr-Ser and Tyr-Gly contacts are also relevant. We note that the relative weights of different types of contacts are comparable to that observed by Zheng and his co-workers in atomistic simulations of FUS LC and LAF1 RGG (i.e., with Tyr contributing twice as much as other residue types) (17). Our results support that spacers are not inert but are relevant actors in the phase behavior of protein condensates.

Figure 4.

Figure 4

Stickers and spacers phase diagram. (A) Two-component phase diagram from simulations of Gly/Ser/Tyr mixtures for different compositions, represented in terms of sticker and spacer densities (ρY and ρGS, respectively) for the dilute and dense phases. Different colors indicate simulated fraction percentage of Tyr within the mixture. Error bars were derived from block averaging. (B) Representative snapshots of the simulation for different compositions.

Slowdown in the dynamics within the molecular condensates

An important consideration is to what extent the condensates formed are aggregates or retain a degree of fluidity comparable to the amino acids in solution. Even if a neat distinction may be difficult to establish, we can inspect the dynamical properties in our simulations to clarify this point. Calculating these from the coexistence runs would result in averaging their values across different phases. For this reason, we have run separate simulations of the condensed phases derived from pure Tyr and Gly/Ser/Tyr slabs and from the dilute phase of the ternary mixture. First, we use Einstein’s relation to calculate diffusion constants, D, for the amino acid residues in the condensed phases (see Fig. 5 A). We observe that the value of D is considerably slower in the case of the Tyr condensate than in the Gly/Ser/Tyr mixture, with individual amino acid values commensurate with their size and stickiness. This result indicates a greater slowdown in the dynamics in the Tyr condensate than in the ternary mixture. We have also calculated D for the water molecules in the condensates, which exhibit a drastic slowdown with respect to D for water in the dilute phase (4.4 nm2/ns) (see Fig. 5 B). This effect is again more pronounced in the case of the Tyr condensate (3.7 × 10−2 nm2/ns) than in the ternary mixture (6.7 × 10−2 nm2/ns). These values are comparable to the slowest mode identified by Zheng and co-workers and attributed to water molecules interacting with protein atoms (17), which is an accurate description for the water molecules in our protein-rich condensates. From these simulations, we also analyze the tyrosine dynamics by looking at the autocorrelation functions of the Ramachandran angles (φ and ψ) and the χ1 side-chain torsion (see Fig. 5 CE). When compared with the dilute phase, the dynamics in the condensates are again slowed down but to a larger degree in the Tyr condensates than in the ternary mixtures.

Figure 5.

Figure 5

Dynamical properties in the condensates. (A) Mean squared displacement versus lag time for amino acid residues in condensate simulations. The solid line corresponds to a Tyr condensate, while dashed lines correspond to different components in the Gly/Ser/Tyr condensate. (B) Same for water molecules within the Tyr (green) and Gly/Ser/Tyr (purple) condensates. (CE) Autocorrelation functions for the Ramachandran angles (φ, ψ) and first side-chain torsion (χ) for Tyr residues in the Tyr condensate simulations (green) and in the Gly/Ser/Tyr simulations for the dense (pink) and dilute (brown) phases. Thin lines are autocorrelation functions from individual molecules, and thick lines are their averages.

Conclusions

In summary, in this work, we have shown with a minimalistic model system that LLPS is an emergent property of amino acid mixtures that is governed by composition. Considering a mixture of spacer residues (Gly/Ser) as a reference system—which is warranted by the abundance of these amino acids in phase-separating LCRs—we show that small amounts of a sticker (in this case, the rather insoluble Tyr amino acid) make their solutions separate into two phases. This suggests that the duality of amino acid types in the sticker and spacers model holds more generally than in the context of intrinsically disordered proteins and that the stoichiometry of the mixture alone may be predictive of the phase behavior. Provided a small number of aromatics or stickers is present, amino acids, peptides, or proteins may be able to spontaneously phase separate. The critical role of Tyr in our simulations is consistent with previous experiments on folded modules and intrinsically disordered regions (11,13,14).

The formation of aggregates as opposed to liquid-like condensates in the case of patchy sequences with clusters of stickers found experimentally (13) may be understood as being closer to the results from our simulations of Tyr-only boxes. In this case, we observe the formation of condensates with a greater degree of dynamical arrest, as monitored by both diffusion constants and intramolecular dynamics. The number of contacts formed by stickers remains stable after the formation of condensates, suggesting that aging or maturation are not taking place, at least in the timescales that we investigate (25,49).

Our work also confirms in full atomistic detail recent results by Tang et al. (26), who explored dipeptide solutions in simulations using the intermediate-resolution MARTINI model (50). Further evidence for the generality of LLPS is accumulating in the experimental front in a broad range of length scales, from short peptides (51,52) to globular proteins (8). The occurrence of macromolecular phase separation may hence turn out to be a general property of both peptides and proteins, determined by composition and further fine-tuned by multivalency and patterning (13), the presence of charged residues (11), nucleic acids (53,54), ionic strength (22), or crowding agents(55).

Acknowledgments

Financial support to D.D.S. comes from Eusko Jaurlaritza (Basque Government) through the project IT1254-19 and the Spanish Government through grants RYC-2016-19590 and PID2021-127907NB-I00 (MCIN/AEI/10.13039/501100011033/FEDER, UE). The author thanks Xabier López for useful discussions and Athi N. Naganathan and Robert B. Best for their comments on the manuscript. The author also acknowledges the staff at the DIPC Supercomputing Center for technical support.

Declaration of interests

The author declares no competing interests.

Editor: Samrat Mukhopadhyay.

Footnotes

Supporting material can be found online at https://doi.org/10.1016/j.bpj.2022.09.031.

Supporting material

Document S1. Figures S1–S4 and Figures S1–S7
mmc1.pdf (5.7MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (8MB, pdf)

References

  • 1.Banani S.F., Lee H.O., et al. Rosen M.K. Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol. 2017;18:285–298. doi: 10.1038/nrm.2017.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Shin Y., Brangwynne C.P. Liquid phase condensation in cell physiology and disease. Science. 2017;357:eaaf4382. doi: 10.1126/science.aaf4382. [DOI] [PubMed] [Google Scholar]
  • 3.Brangwynne C., Tompa P., Pappu R. Polymer physics of intracellular phase transitions. Nat. Phys. 2015;11:899–904. [Google Scholar]
  • 4.Harmon T.S., Holehouse A.S., et al. Pappu R.V. Intrinsically disordered linkers determine the interplay between phase separation and gelation in multivalent proteins. Elife. 2017;6:e30294. doi: 10.7554/eLife.30294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Seim I., Posey A.E., et al. Gladfelter A.S. Dilute phase oligomerization can oppose phase separation and modulate material properties of a ribonucleoprotein condensate. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2120799119. e2120799119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mittag T., Pappu R.V. A conceptual framework for understanding phase separation and addressing open questions and challenges. Mol. Cell. 2022;82:2201–2214. doi: 10.1016/j.molcel.2022.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Choi J.-M., Holehouse A.S., Pappu R.V. Physical principles underlying the complex biology of intracellular phase transitions. Annu. Rev. Biophys. 2020;49:107–133. doi: 10.1146/annurev-biophys-121219-081629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Li P., Banjade S., et al. Rosen M.K. Phase transitions in the assembly of multivalent signalling proteins. Nature. 2012;483:336–340. doi: 10.1038/nature10879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dignon G.L., Zheng W., et al. Mittal J. Sequence determinants of protein phase behavior from a coarse-grained model. PLoS Comput. Biol. 2018;14:e1005941. doi: 10.1371/journal.pcbi.1005941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lin Y.-H., Forman-Kay J.D., Chan H.S. Theories for sequence-dependent phase behaviors of biomolecular condensates. Biochemistry. 2018;57:2499–2508. doi: 10.1021/acs.biochem.8b00058. [DOI] [PubMed] [Google Scholar]
  • 11.Bremer A., Farag M., et al. Mittag T. Deciphering how naturally occurring sequence features impact the phase behaviours of disordered prion-like domains. Nat. Chem. 2022;14:196–207. doi: 10.1038/s41557-021-00840-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Martin E.W., Mittag T. Relationship of sequence and phase separation in protein low-complexity regions. Biochemistry. 2018;57:2478–2487. doi: 10.1021/acs.biochem.8b00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Martin E.W., Holehouse A.S., Mittag v.T. Valence and patterning of aromatic residues determine the phase behavior of prion-like domains. Science. 2020;367:694–699. doi: 10.1126/science.aaw8653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lin Y., Currie S.L., Rosen M.K. Intrinsically disordered sequences enable modulation of protein phase separation through distributed tyrosine motifs. J. Biol. Chem. 2017;292:19110–19120. doi: 10.1074/jbc.M117.800466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Holehouse A.S., Ginell G.M., et al. Böke E. Clustering of aromatic residues in prion-like domains can tune the formation, state, and organization of biomolecular condensates. Biochemistry. 2021;60:3566–3581. doi: 10.1021/acs.biochem.1c00465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rauscher S., Pomès R. The liquid structure of elastin. Elife. 2017;6:e26526. doi: 10.7554/eLife.26526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zheng W., Dignon G.L., et al. Mittal J. Molecular details of protein condensates probed by microsecond long atomistic simulations. J. Phys. Chem. B. 2020;124:11671–11679. doi: 10.1021/acs.jpcb.0c10489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Paloni M., Bailly R., et al. Barducci A. Unraveling molecular interactions in liquid–liquid phase separation of disordered proteins by atomistic simulations. J. Phys. Chem. B. 2020;124:9009–9016. doi: 10.1021/acs.jpcb.0c06288. [DOI] [PubMed] [Google Scholar]
  • 19.Dignon G.L., Zheng W., et al. Mittal J. Relation between single-molecule properties and phase behavior of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA. 2018;115:9929–9934. doi: 10.1073/pnas.1804177115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Espinosa J.R., Joseph J.A., et al. Collepardo-Guevara R. Liquid network connectivity regulates the stability and composition of biomolecular condensates with many components. Proc. Natl. Acad. Sci. USA. 2020;117:13238–13247. doi: 10.1073/pnas.1917569117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Regy R.M., Thompson J., et al. Mittal J. Improved coarse-grained model for studying sequence dependent phase separation of disordered proteins. Protein Sci. 2021;30:1371–1379. doi: 10.1002/pro.4094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Joseph J.A., Reinhardt A., et al. Collepardo-Guevara R. Physics-driven coarse-grained model for biomolecular phase separation with near-quantitative accuracy. Nat. Comput. Sci. 2021;1:732–743. doi: 10.1038/s43588-021-00155-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tesei G., Schulze T.K., et al. Lindorff-Larsen K. Accurate model of liquid–liquid phase behavior of intrinsically disordered proteins from optimization of single-chain properties. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2111696118. e2111696118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Benayad Z., von Bülow S., et al. Hummer G. Simulation of FUS protein condensates with an adapted coarse-grained model. J. Chem. Theor. Comput. 2021;17:525–537. doi: 10.1021/acs.jctc.0c01064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Garaizar A., Espinosa J.R. Salt dependent phase behavior of intrinsically disordered proteins from a coarse-grained model with explicit water and ions. J. Chem. Phys. 2021;155:125103. doi: 10.1063/5.0062687. [DOI] [PubMed] [Google Scholar]
  • 26.Tang Y., Bera S., et al. Wei G. Prediction and characterization of liquid-liquid phase separation of minimalistic peptides. Cell Rep. Phys. Sci. 2021;2:100579. [Google Scholar]
  • 27.Andrews C.T., Elcock A.H. Molecular dynamics simulations of highly crowded amino acid solutions: comparisons of eight different force field combinations with experiment and with each other. J. Chem. Theor. Comput. 2013;9:4585–4602. doi: 10.1021/ct400371h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Karandur D., Wong K.-Y., Pettitt B.M. Solubility and aggregation of Gly5 in water. J. Phys. Chem. B. 2014;118:9565–9572. doi: 10.1021/jp503358n. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sarma R., Wong K.-Y., et al. Pettitt B.M. Peptide solubility limits: backbone and side-chain interactions. J. Phys. Chem. B. 2018;122:3528–3539. doi: 10.1021/acs.jpcb.7b10734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Martínez L., Andrade R., et al. Martínez J.M. PACKMOL: a package for building initial configurations for molecular dynamics simulations. J. Comput. Chem. 2009;30:2157–2164. doi: 10.1002/jcc.21224. [DOI] [PubMed] [Google Scholar]
  • 31.Salomon-Ferrer R., Case D.A., Walker R.C. An overview of the Amber biomolecular simulation package. WIREs. Comput. Mol. Sci. 2013;3:198–210. [Google Scholar]
  • 32.Lindorff-Larsen K., Piana S., et al. Shaw D.E. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins. 2010;78:1950–1958. doi: 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jorgensen W.L., Chandrasekhar J., et al. Klein M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
  • 34.Best R.B., Hummer G. Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. J. Phys. Chem. B. 2009;113:9004–9015. doi: 10.1021/jp901540t. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Robustelli P., Piana S., Shaw D.E. Developing a molecular dynamics force field for both folded and disordered protein states. Proc. Natl. Acad. Sci. USA. 2018;115:E4758–E4766. doi: 10.1073/pnas.1800690115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Berendsen H.J.C., Postma J.P.M., et al. Haak J.R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984;81:3684–3690. [Google Scholar]
  • 37.Bussi G., Donadio D., Parrinello M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007;126:014101. doi: 10.1063/1.2408420. [DOI] [PubMed] [Google Scholar]
  • 38.Darden T., York D., Pedersen L. Particle mesh Ewald: an N log(N) method for Ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. [Google Scholar]
  • 39.Abraham M.J., Murtola T., et al. Lindahl E. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. Software. 2015;1-2:19–25. [Google Scholar]
  • 40.McGibbon R.T., Beauchamp K.A., et al. Pande V.S. MDTraj: a modern open library for the analysis of molecular dynamics trajectories. Biophys. J. 2015;109:1528–1532. doi: 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Humphrey W., Dalke A., Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  • 42.Eisenhaber F., Lijnzaad P., Argos P., Sander C., Scharf M. The double cubic lattice method: efficient approaches to numerical integration of surface area and volume and to dot surface contouring of molecular assemblies. J. Comput. Chem. 1995;16:273–284. [Google Scholar]
  • 43.Frederix P.W.J.M., Ulijn R.V., et al. Tuttle T. Virtual screening for dipeptide aggregation: toward predictive tools for peptide self-assembly. J. Phys. Chem. Lett. 2011;2:2380–2384. doi: 10.1021/jz2010573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.van der Spoel D., Berendsen H.J. Molecular dynamics simulations of Leu-enkephalin in water and DMSO. Biophys. J. 1997;72:2032–2041. doi: 10.1016/S0006-3495(97)78847-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Nozaki Y., Tanford C. The solubility of amino acids and two Glycine peptides in aqueous ethanol and dioxane solutions: establishment of a hydrophobicity scale. J. Biol. Chem. 1971;246:2211–2217. [PubMed] [Google Scholar]
  • 46.Piana S., Donchev A.G., et al. Shaw D.E. Water dispersion interactions strongly influence simulated structural properties of disordered protein states. J. Phys. Chem. B. 2015;119:5113–5123. doi: 10.1021/jp508971m. [DOI] [PubMed] [Google Scholar]
  • 47.Piana S., Robustelli P., et al. Shaw D.E. Development of a force field for the simulation of single-chain proteins and protein–protein complexes. J. Chem. Theor. Comput. 2020;16:2494–2507. doi: 10.1021/acs.jctc.9b00251. [DOI] [PubMed] [Google Scholar]
  • 48.Dignon G.L., Best R.B., Mittal J. Biomolecular phase separation: from molecular driving forces to macroscopic properties. Annu. Rev. Phys. Chem. 2020;71:53–75. doi: 10.1146/annurev-physchem-071819-113553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Mathieu C., Pappu R.V., Taylor J.P. Beyond aggregation: pathological phase transitions in neurodegenerative disease. Science. 2020;370:56–60. doi: 10.1126/science.abb8032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Marrink S.J., Risselada H.J., et al. de Vries A.H. The MARTINI force field: coarse grained model for biomolecular simulations. J. Phys. Chem. B. 2007;111:7812–7824. doi: 10.1021/jp071097f. [DOI] [PubMed] [Google Scholar]
  • 51.Wang Y., Lomakin A., et al. Benedek G.B. Liquid–liquid phase separation in oligomeric peptide solutions. Langmuir. 2017;33:7715–7721. doi: 10.1021/acs.langmuir.7b01693. [DOI] [PubMed] [Google Scholar]
  • 52.Yuan C., Levin A., et al. Yan X. Nucleation and growth of amino acid and peptide supramolecular polymers through liquid–liquid phase separation. Angew. Chem., Int. Ed. Engl. 2019;58:18116–18123. doi: 10.1002/anie.201911782. [DOI] [PubMed] [Google Scholar]
  • 53.Mann J.R., Donnelly C.J. RNA modulates physiological and neuropathological protein phase transitions. Neuron. 2021;109:2663–2681. doi: 10.1016/j.neuron.2021.06.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Dutagaci B., Nawrocki G., et al. Feig M. Charge-driven condensation of RNA and proteins suggests broad role of phase separation in cytoplasmic environments. Elife. 2021;10:e64004. doi: 10.7554/eLife.64004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Poudyal M., Patel K., et al. Maji S.K. Liquid condensate is a common state of proteins and polypeptides at the regime of high intermolecular interactions. bioRxiv. 2022 doi: 10.1101/2021.12.31.474648. Preprint at. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S4 and Figures S1–S7
mmc1.pdf (5.7MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (8MB, pdf)

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES