Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2017 May 5;112(10):2043–2046. doi: 10.1016/j.bpj.2017.04.021

Phase Separation and Single-Chain Compactness of Charged Disordered Proteins Are Strongly Correlated

Yi-Hsuan Lin 1,3, Hue Sun Chan 1,2,
PMCID: PMC5448239  PMID: 28483149

Abstract

Liquid-liquid phase separation of intrinsically disordered proteins (IDPs) is a major undergirding factor in the regulated formation of membraneless organelles in the cell. The phase behavior of an IDP is sensitive to its amino acid sequence. Here we apply a recent random-phase-approximation polymer theory to investigate how the tendency for multiple chains of a protein to phase-separate, as characterized by the critical temperature Tcr, is related to the protein’s single-chain average radius of gyration 〈Rg〉. For a set of sequences containing different permutations of an equal number of positively and negatively charged residues, we found a striking correlation Tcr ∼ 〈Rgγ with γ as large as ∼6.0, indicating that electrostatic effects have similarly significant impact on promoting single-chain conformational compactness and phase separation. Moreover, Tcr ∝ −SCD, where SCD is a recently proposed “sequence charge decoration” parameter determined solely by sequence information. Ramifications of our findings for deciphering the sequence dependence of IDP phase separation are discussed.

Main Text

The biological functions and disease-causing malfunctions of proteins are underpinned by their structures, dynamics, and myriad intra- and intermolecular interactions. Many critical cellular functions are carried out by intrinsically disordered proteins or protein regions (collectively abbreviated as “IDPs” here) with sequences that are less hydrophobic than those of globular proteins, but are enriched in charged, polar, and aromatic residues (1, 2, 3, 4, 5, 6). At least 75% of IDPs are polyampholytes (7, 8) in that they contain both positively and negatively charged residues (9, 10). Accordingly, electrostatic effects are important in determining individual IDPs’ conformational dimensions (8, 11, 12) and binding (13, 14). Charge-charge interactions are often significant in the recently discovered phenomenon of functional IDP liquid-liquid phase separation as well (15, 16, 17, 18, 19, 20, 21, 22). IDP phase separation appears to be the physical basis of membraneless organelles, performing many vital tasks. Recent examples include subcompartmentalization within the nucleolus (22) and synaptic plasticity (21). Malfunction of phase separation processes can lead to disease-causing amyloidogenesis (18) and neurological disorders (21). Speculatively, membraneless liquid-liquid phase separation of biomolecules might even have played a role in the origins of life (23).

Electrostatic effects encoded by a sequence of charges depend not only on the total positive and negative charges or net charge (24, 25) but also the charge pattern (8). For IDPs, this was demonstrated by Das and Pappu (8), who conducted explicit-chain, implicit-solvent conformational sampling of 30 different sequences, each composed of 25 lysine (K) and 25 glutamic acid (E) residues (termed “KE sequences” hereafter). They found that the average radius of gyration, 〈Rg〉, is strongly sequence dependent, and is correlated with a charge pattern parameter κ that quantifies local deviations from global charge asymmetry (8). A subsequent analytical treatment of the KE sequences by Sawle and Ghosh (26) rationalized the trend through another charge pattern parameter called “sequence charge decoration” (SCD) that also correlates well with 〈Rg〉. For IDP phase separation, a recent sequence-dependent random-phase-approximation (RPA) approach we put forth (27, 28) accounted for the experimental difference in phase-separation tendency between the wild-type and a charge-scrambled mutant of the 236-residue N-terminal fragment of DEAD-box RNA helicase Ddx4 (16).

These advances suggest that a deeper understanding of the fundamental relationship between single- and multiple-chain IDP properties is in order. It would be helpful, for instance, if experiments on single-chain properties can infer the conditions under which a protein sequence would undergo multiple-chain phase separation. We embark on this endeavor by first focusing on electrostatics, while leaving aromatic and other π-interactions—which can figure prominently in IDP behavior (16, 27, 29)—to future effort. To reach this initial goal, we apply RPA to the 30 KE sequences of length N = 50 to ascertain their phase-separation properties under salt-free conditions. Adopting our previous notation and making the same simplifying assumption that amino acid residues and water molecules are of equal size in the theory (27, 28), the free energy FRPA of the multiple-chain system of a given polyampholytic sequence with charge pattern {σi} = {σ1, σ2,...,σN}, where σi = ±1 is the sign of electronic charge of the ith residue, is given by (see Eqs. 13 and 40 of (28)):

FRPAa3VkBT=ϕmNlnϕm+(1ϕm)ln(1ϕm)+0dkk24π2{ln[1+G(k)]G(k)}, (1)

where a = 3.8 Å is the Cα-Cα distance, V is the system volume, kB is the Boltzmann constant, T is the absolute temperature, ϕm = ρma3 is the volume ratio of amino acid residues wherein ρm/N is protein density, and G(k) is given by:

G(k)=4πϕmk2(1+k2)TNi,j=1Nσiσjexp(k26|ij|). (2)

Here Ta/lB is the reduced temperature. The Bjerrum length is lB = e2/(4πϵ0ϵrkBT), where e is the elementary charge, ε0 is the vacuum permittivity, and ϵr is the relative permittivity (27, 28); ϵr ≈ 80 for water, but can be significantly lower for water-IDP solutions (28). Here ϵr is treated largely as an unspecified constant because our main concern is the relative Tcr values of different sequences.

We determined the phase diagrams of the 30 KE sequences from the free energy expression Eq. 1 using standard procedures (28). For each sequence, the highest temperature on the coexistence curve is the critical temperature Tcr, which is the highest T at which phase separation can occur (Fig. 1 a). The critical temperatures of the KE sequences are highly diverse, ranging from Tcr = 0.089 (sv1) to 8.570 (sv30). The variation of critical volume fraction ϕcr ≡ ϕm(Tcr) from 0.0123 (sv30, sv24) to 0.0398 (sv1) is narrower. The KE sequences were originally labeled as sv1, sv2, …, sv30 in ascending values for Das and Pappu’s charge pattern parameter κ, from the strictly alternating sequence sv1 with κ = 0.0009 (minimum segregation of opposite charges) to the diblock sequence sv30 with κ = 1.0 (maximum charge segregation) (8). Our RPA-predicted Tcr values follow largely, though not exactly, the same order: sv1 and sv30 have the lowest and highest Tcr values, respectively; however, e.g., sv24 rather than sv27 has the fourth largest Tcr and sv5, not sv2, has the second lowest Tcr. If ϵr = 80 is assumed, RPA predicts that 21 KE sequences can, but 9 KE sequences cannot, phase-separate at T300 K (Fig. 1, b and c).

Figure 1.

Figure 1

(a) Shown here are coexistence curves computed by RPA for KE sequences 1–30 in (b), listed in descending order of Tcr (except sequences 23 and 24, which have the same Tcr), with K and E residues in red and blue, respectively; those with Tcr < 0.55 (corresponding to T < 300 K when εr = 80) are shown on a gray background in (b). The “sv” sequence labels are those in Das and Pappu (8). Critical points (T = Tcr) for several high-Tcr sequences are marked by circles in (a). (c) Shown here is a logarithmic correlation between RPA-predicted Tcr and 〈Rg〉 simulated in Das and Pappu (8) (green circles). The fitted line (blue) is −lnTcr = −18.4 + 5.83ln〈Rg〉 with squared Pearson coefficient r2 = 0.92. The dashed horizontal line represents Tcr = 0.55.

Because 〈Rg〉 correlates positively with κ (8), this Tcr trend suggests that multiple-chain Tcr should correlate with single-chain 〈Rg〉. Indeed, a striking correlation (Fig. 1 b) satisfying the approximate power-law:

Tcr9.8×107Rg5.83, (3)

with Rg in units of Å, is observed for the KE sequences. The variation of Tcr with 〈Rg〉 is very sharp: Tcr increases ∼100 times whereas 〈Rg〉 decreases by ≲50%. Qualitatively, the positive (Tcr)–〈Rg〉 correlation may be understood by considering two extreme cases: the diblock, and the strictly alternating sequences (Fig. 2). For the diblock, attractive interactions are absent—cannot be satisfied—within most stretches of several (e.g., <6) residues. However, once a pair of opposite charges is in spatial proximity, chain connectivity brings two oppositely charged blocks together, leading to a strong Coulomb attraction, thus a small 〈Rg〉 and a higher tendency to phase-separate (higher Tcr). In contrast, for the strictly alternating sequence, attractive Coulomb interactions that are already weakened relative to that of the diblock sequence require more conformational restriction, resulting in more open, large-〈Rg〉 single-chain conformations and less tendency to phase-separate (lower Tcr).

Figure 2.

Figure 2

Schematics. Similar electrostatic effects are at play in single-chain compactness (left) and multiple-chain phase separation (right). (Top) Long stretches of like charges entail strong intra- and interchain attractions (gray areas). Favorable intrachain interactions are among residues that are nonlocal, i.e., more than a few residues apart, along the chain sequence. Most local interactions are repulsive because of the charge blocks. (Bottom) Attractions within and among polyampholytes that lack long charge blocks are weaker. Overall attractive interactions now require conformationally restrictive charge pairings and are weaker because of repulsion from neighboring like charges.

It is instructive to compare the predictive power of κ and another charge pattern parameter SCD i<jNσiσjji/N that has emerged from the analysis of Sawle and Ghosh (26). The two parameters are well correlated (r2 = 0.95; see Fig. 7 of Sawle and Ghosh (26)), yet the variation of both Tcr and 〈Rg〉 of the KE sequences is significantly smoother with respect to SCD than κ (Fig. 3). For example, despite the large variation in κ for sv24, sv26, and sv28 (0.45, 0.61, and 0.77, respectively), their 〈Rg〉 = 17.6, 17.5, and 17.9 Å (8), and their Tcr = 5.16, 5.08, and 5.18, are almost identical (Fig. 3 b). This similarity, however, is well reflected by their similar SCD = −17.0, −16.2, and 16.0. Indeed, a near-linear relationship (r2 = 0.997):

Tcr0.314(SCD), (4)

is observed (Fig. 3 b). A likely origin of SCD’s better performance is that it accounts for potential interactions between charges far apart along the sequence, whereas κ relies on averaging over five or six consecutive charges. Accordingly, SCD is less sensitive than κ to isolated charge reversals. The rather smooth SCD–〈Rg〉 dependence is remarkable because the simulated 〈Rg〉 (8) bears no formal relationship with the variational theory from which SCD emerges (26). Future effort should be directed toward further assessment of these and other possible charge pattern parameters (30) as predictors for IDP conformational properties.

Figure 3.

Figure 3

Charge-pattern parameters. (a) Given here is the single-chain 〈Rg〉 in Das and Pappu (8) versus the κ-parameter of Das and Pappu (8) (top horizontal scale) and the SCD parameter of Sawle and Ghosh (26) (bottom scale for −SCD). (b) Given here is the variation of RPA-predicted Tcr with κ (left vertical scale) and −SCD (right vertical scale).

In summary, we have quantified a close relationship between single-chain conformational compactness of polyampholytes and their phase-separation tendency. The above RPA results were derived with a short-range cutoff for Coulomb interactions to account for residue sizes (28, 31). If we had adopted an unphysical interaction scheme without such a cutoff, similar trends would still hold although the scaling relations Eqs. 3 and 4 would be modified, respectively, to Tcr ∼ (Rg)−3.57 and Tcr ≈ −0.490(SCD). Thus, in any event, basic physics dictates a rather sharp positive correlation between Tcr and 〈Rg〉. This connection should be further explored by both theory and simulation (30, 32) to help decipher the sequence determinants of IDP phase separation.

Author Contributions

Y.-H.L. and H.S.C. both designed research, performed research, analyzed data, and wrote the article.

Acknowledgments

We thank Julie Forman-Kay and Robert Vernon for helpful discussions.

This work was supported by Canadian Cancer Society Research Institute grant No. 703477, Canadian Institutes of Health Research grant No. MOP-84281, and computational resources provided by SciNet of Compute Canada.

Editor: Rohit Pappu.

References

  • 1.Uversky V.N., Gillespie J.R., Fink A.L. Why are “natively unfolded” proteins unstructured under physiologic conditions? Proteins. 2000;41:415–427. doi: 10.1002/1097-0134(20001115)41:3<415::aid-prot130>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]
  • 2.Tompa P. Intrinsically disordered proteins: a 10-year recap. Trends Biochem. Sci. 2012;37:509–516. doi: 10.1016/j.tibs.2012.08.004. [DOI] [PubMed] [Google Scholar]
  • 3.Forman-Kay J.D., Mittag T. From sequence and forces to structure, function, and evolution of intrinsically disordered proteins. Structure. 2013;21:1492–1499. doi: 10.1016/j.str.2013.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.van der Lee R., Buljan M., Babu M.M. Classification of intrinsically disordered regions and proteins. Chem. Rev. 2014;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Chen T., Song J., Chan H.S. Theoretical perspectives on nonnative interactions and intrinsic disorder in protein folding and binding. Curr. Opin. Struct. Biol. 2015;30:32–42. doi: 10.1016/j.sbi.2014.12.002. [DOI] [PubMed] [Google Scholar]
  • 6.Das R.K., Ruff K.M., Pappu R.V. Relating sequence encoded information to form and function of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2015;32:102–112. doi: 10.1016/j.sbi.2015.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sickmeier M., Hamilton J.A., Dunker A.K. DisProt: the database of disordered proteins. Nucleic Acids Res. 2007;35:D786–D793. doi: 10.1093/nar/gkl893. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Das R.K., Pappu R.V. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc. Natl. Acad. Sci. USA. 2013;110:13392–13397. doi: 10.1073/pnas.1304749110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Higgs P.G., Joanny J.-F. Theory of polyampholyte solutions. J. Chem. Phys. 1991;94:1543–1554. [Google Scholar]
  • 10.Dobrynin A.V., Colby R.H., Rubinstein M. Polyampholytes. J. Polym. Sci. Part B Polym. Phys. 2004;42:3513–3538. [Google Scholar]
  • 11.Liu B., Chia D., Gradinaru C.C. The effect of intrachain electrostatic repulsion on conformational disorder and dynamics of the Sic1 protein. J. Phys. Chem. B. 2014;118:4088–4097. doi: 10.1021/jp500776v. [DOI] [PubMed] [Google Scholar]
  • 12.Song J., Gomes G.N., Chan H.S. An adequate account of excluded volume is necessary to infer compactness and asphericity of disordered proteins by Förster resonance energy transfer. J. Phys. Chem. B. 2015;119:15191–15202. doi: 10.1021/acs.jpcb.5b09133. [DOI] [PubMed] [Google Scholar]
  • 13.Borg M., Mittag T., Chan H.S. Polyelectrostatic interactions of disordered ligands suggest a physical basis for ultrasensitivity. Proc. Natl. Acad. Sci. USA. 2007;104:9650–9655. doi: 10.1073/pnas.0702580104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Csizmok V., Orlicky S., Forman-Kay J.D. An allosteric conduit facilitates dynamic multisite substrate recognition by the SCFCdc4 ubiquitin ligase. Nat. Commun. 2017;8:13943. doi: 10.1038/ncomms13943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Toretsky J.A., Wright P.E. Assemblages: functional units formed by cellular phase separation. J. Cell Biol. 2014;206:579–588. doi: 10.1083/jcb.201404124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Nott T.J., Petsalaki E., Baldwin A.J. Phase transition of a disordered nuage protein generates environmentally responsive membraneless organelles. Mol. Cell. 2015;57:936–947. doi: 10.1016/j.molcel.2015.01.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Brangwynne C.P., Tompa P., Pappu R.V. Polymer physics of intracellular phase transitions. Nat. Phys. 2015;11:899–904. [Google Scholar]
  • 18.Molliex A., Temirov J., Taylor J.P. Phase separation by low complexity domains promotes stress granule assembly and drives pathological fibrillization. Cell. 2015;163:123–133. doi: 10.1016/j.cell.2015.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bergeron-Sandoval L.-P., Safaee N., Michnick S.W. Mechanisms and consequences of macromolecular phase separation. Cell. 2016;165:1067–1079. doi: 10.1016/j.cell.2016.05.026. [DOI] [PubMed] [Google Scholar]
  • 20.Pak C.W., Kosno M., Rosen M.K. Sequence determinants of intracellular phase separation by complex coacervation of a disordered protein. Mol. Cell. 2016;63:72–85. doi: 10.1016/j.molcel.2016.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zeng M., Shang Y., Zhang M. Phase transition in postsynaptic densities underlies formation of synaptic complexes and synaptic plasticity. Cell. 2016;166:1163–1175.e12. doi: 10.1016/j.cell.2016.07.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Feric M., Vaidya N., Brangwynne C.P. Coexisting liquid phases underlie nucleolar subcompartments. Cell. 2016;165:1686–1697. doi: 10.1016/j.cell.2016.04.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Keating C.D. Aqueous phase separation as a possible route to compartmentalization of biological molecules. Acc. Chem. Res. 2012;45:2114–2124. doi: 10.1021/ar200294y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mao A.H., Crick S.L., Pappu R.V. Net charge per residue modulates conformational ensembles of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA. 2010;107:8183–8188. doi: 10.1073/pnas.0911107107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Müller-Späth S., Soranno A., Schuler B. Charge interactions can dominate the dimensions of intrinsically disordered proteins. Proc. Natl. Acad. Sci. USA. 2010;107:14609–14614. doi: 10.1073/pnas.1001743107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Sawle L., Ghosh K. A theoretical method to compute sequence dependent configurational properties in charged polymers and proteins. J. Chem. Phys. 2015;143:085101. doi: 10.1063/1.4929391. [DOI] [PubMed] [Google Scholar]
  • 27.Lin Y.-H., Forman-Kay J.D., Chan H.S. Sequence-specific polyampholyte phase separation in membraneless organelles. Phys. Rev. Lett. 2016;117:178101. doi: 10.1103/PhysRevLett.117.178101. [DOI] [PubMed] [Google Scholar]
  • 28.Lin Y.-H., Song J., Chan H.S. Random-phase-approximation theory for sequence-dependent, biologically functional liquid-liquid phase separation of intrinsically disordered proteins. J. Mol. Liq. 2017;228:176–193. [Google Scholar]
  • 29.Song J., Ng S.C., Chan H.S. Polycation-π interactions are a driving force for molecular recognition by an intrinsically disordered oncoprotein family. PLOS Comput. Biol. 2013;9:e1003239. doi: 10.1371/journal.pcbi.1003239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Holehouse A.S., Das R.K., Pappu R.V. CIDER: resources to analyze sequence-ensemble relationships of intrinsically disordered proteins. Biophys. J. 2017;112:16–21. doi: 10.1016/j.bpj.2016.11.3200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ermoshkin A.V., Olvera de la Cruz M. Polyelectrolytes in the presence of multivalent ions: gelation versus segregation. Phys. Rev. Lett. 2003;90:125504. doi: 10.1103/PhysRevLett.90.125504. [DOI] [PubMed] [Google Scholar]
  • 32.Ruff K.M., Harmon T.S., Pappu R.V. CAMELOT: a machine learning approach for coarse-grained simulations of aggregation of block-copolymeric protein sequences. J. Chem. Phys. 2015;143:243123. doi: 10.1063/1.4935066. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES