Abstract
Intrinsically Disordered Proteins (IDPs), unlike folded proteins, lack a unique folded structure and rapidly interconvert among ensembles of disordered states. However, they have specific conformational properties when averaged over their ensembles of disordered states. It is critical to develop a theoretical formalism to predict these ensemble average conformational properties that are encoded in the IDP sequence (the specific order in which amino acids/residues are linked). We present a general heteropolymer theory that analytically computes the ensemble average distance profiles () between any two (i, j) monomers (amino acids for IDPs) as a function of the sequence. Information rich distance profiles provide a detailed description of the IDP in contrast to typical metrics such as scaling exponents, radius of gyration, or end-to-end distance. This generalized formalism supersedes homopolymer-like models or models that are built only on the composition of amino acids but ignore sequence details. The prediction of these distance profiles for highly charged polyampholytes and naturally occurring IDPs unmasks salient features that are hidden in the sequence. Moreover, the model reveals strategies to modulate the entire distance map to achieve local or global swelling/compaction by subtle changes/modifications—such as phosphorylation, a biologically relevant process—in specific hotspots in the sequence. Sequence-specific distance profiles and their modulation have been benchmarked against all-atom simulations. Our new formalism also predicts residue-pair specific coil-globule transitions. The analytical nature of the theory will facilitate design of new sequences to achieve specific target distance profiles with broad applications in synthetic biology and polymer science.
I. INTRODUCTION
Intrinsically disordered proteins or disordered protein regions (both termed IDPs in the present paper) participate in many vital biological functions,1 from transcriptional regulation2 and cellular differentiation to DNA condensation.3,4 IDPs do not have a unique native state, unlike folded proteins; instead, they exist in an ensemble of disordered states.4 The disordered ensembles in IDPs confer conformational plasticity/entropy5–7 that facilitate several processes: formation of transient contacts,8 multi-protein fuzzy complexes,2,9 spatial search by transcription factors,10 regulated motion between domains, enhanced specificity of promiscuous interactions,11 allosteric coupling,12 and other activities.10,13 Recent work has also shown the importance of an IDP ensemble in determining its phase separation propensity.14–18 Formation of liquid droplets upon phase separation mediated by IDPs promotes several biological processes such as endocytosis,19 silencing chromatin,20,21 and gene transcription.22–24
The above examples highlight the need to model the disordered ensembles of IDPs. To quantitate the features of the disordered ensemble, we need to compute ensemble average properties. For example, a detailed description of conformational features can be obtained from the ensemble average distance between any two amino acids i and j, where amino acids are treated as coarse-grain particles. However, like folded proteins, IDPs too have a specific sequence. Any attempt to model IDPs must respect the order in which the amino acids (or monomers) are linked. All-atom and coarse-grain simulations of synthetic sequences and IDPs have shown ensemble average properties such as the radius of gyration, or scaling exponents can critically depend on sequence specificity, not just the composition.25–31 Limited experimental data also support this observation.32–34 Thus, the modern day equivalent of the folding puzzle is to decode the link between an IDP’s conformational properties and its sequence. Motivated by this fundamental question, we appreciate that a general theoretical formalism is needed that can compute ensemble average distance profiles between any two residues/amino acids as a function of arbitrary sequence patterning. Building a theoretical model of heteropolymers capable of describing residue pair specific distance profiles will also advance problems in polymer science, for example, modeling self-coacervation in solutions of homopolyelectrolytes and blockpolyampholytes35–39 and other applications in artificial protein polymers.40 Furthermore, theoretical models provide the much needed alternate to simulations that tend to be computationally expensive. All-atom molecular dynamics simulations—successful in describing folded proteins—are limited in describing IDPs due to possible inaccuracies in the force-field41 and sampling issues.42 Less accurate coarse-grain simulations have become popular, but they too remain computationally costly compared to analytical approaches. Moreover, analytical tractability helps gain deep insights—including design principles—that are not always apparent in computer simulations.
Existing analytical theories of heteropolymers typically average over different monomer sequences assuming that the sequence disorder is annealed.43–46However, a given biologically functional protein or its mutated/chemically modified version has a unique sequence. A new set of analytical models have recently been developed that explicitly account for patterning of amino acids to describe the conformation of a single IDP.47–49The first work47 in this series developed a theory to compute ensemble average distances for a given sequence. However, the model was limited to cases where intra-chain interactions are overall repulsive. Consequently, the model could only describe chain dimensions that were greater than the Flory random coil (FRC) limit. The FRC is a reference state where the two-body interaction between two monomers in a chain is effectively zero. Consequently, the earlier model failed to capture globular states and coil-globule transitions. Subsequent work48 extended the model by adding a three-body repulsive term allowing the two-body interaction to be attractive. This model could describe the globule state where chain dimensions could reach below FRC. However, the modified model was only extended to compute the ensemble average end-to-end distance. In this study, we provide a formalism that allows the prediction of inter-residue distance profiles—between any two residues, not just the end-to-end distance—even in the globule state. The general formalism presents a powerful tool to describe the entire conformational ensemble—both above and below FRC—for arbitrary sequence patterning. Consequently, the theory can describe a residue-pair specific coil-globule transition. We reiterate that these predictions are detailed and different from the well-known polymer models that provide scaling laws or end-to-end distance and/or radius of gyration. All existing scaling laws and previous results of flexible homopolymers50,51 and heteropolymers47–49 can be derived as a limiting case of this general theory.
II. METHOD
The Hamiltonian of a polymer chain in the presence of inter-monomer excluded volume, electrostatic interaction, and three body interaction is given by
(1) |
where s is the contour length variable of the backbone, R(s) is the position vector at s, L is the total contour length with l denoting the Kuhn length, T is the temperature, and kb is Boltzmann’s constant with β = 1/(kbT). ω(s, s′) is the residue pair specific excluded volume, q(s) is charge at s, and ω3 is the sequence independent repulsive three body interaction parameter. Next, we will use a variational approach—analogous to early work used to determine the average size of homopolymer chains in the presence of inter-monomer interactions50–54—to compute inter-residue distances. Within this variational scheme,50–54 we map the Hamiltonian (Ht), with all interactions, to a renormalized Hamiltonian (Hr) defined as
(2) |
where lr(s2, s1) is the renormalized Kuhn length and is a function of all the interaction parameters and specific to the residue pairs s1, s2 between which the distance is to be determined. The ensemble average distance between s1, s2 is given by . Thus, lr(s2, s1) as residue pair specific Kuhn lengths have been introduced to facilitate the calculation of the ensemble average distances between the two residues.47,54
A. Coil globule transition for ensemble average distance between an arbitrary pair of amino acid residues
Switching from continuous to discrete notation, we replace s2, s1 by i, j, with i>j. We also define a new dimensionless quantity xij as the ratio of the residue-pair specific renormalized Kuhn length and bare Kuhn length, xij = lr,ij/l. Values of xij greater than unity define the coil state, while xij less than unity denotes the globule state. Following the procedure from our earlier paper,48 we provide the free energy F(xij) as a function of xij as
(3) |
where lb = 7.2(T0/T)Å, T0 = 298, N is the total number of amino acids, and Qij′, Tij, and Ωij are given by
(4) |
(5) |
(6) |
(7) |
where ωm,ns are two-body residue pair (m, n) specific non-electrostatic interaction parameters. These are equivalent to ω(s, s′) used in the continuous notation. For all the subsequent application of this formalism, we will assume ωm,n = ω2. This approximation implies that the sequence effect of non-electrostatic interactions can be represented by a mean-field term. This is a reasonable assumption since we will limit our examples in this paper on sequences that are highly charged.
The value of xij is determined by minimizing F(xij). From the minimized xij, is computed by using
(8) |
Thus, the presented formulation provides a full description of the intra-protein conformational ensemble, including the possibility of observing coil-globule transitions for arbitrary (not just the end-to-end) residue-pairs as well. Equation (3) recovers several earlier results under different limits.50,51,55 See the Appendix for details of the derivation using an equivalent approach similar to the one used earlier.52,54
B. Protocol for all-atom simulation
We test our theoretical prediction of the distance profiles against all-atom simulations using the CAMPARI Monte Carlo scheme with the ABSINTH implicit solvation model.56,57 We used CAMPARI for all of the proteins that we simulated: SV25, SV30, DP00877 WT, P0A8H9 WT, P0A8H9 S2T15, P0A8H9 S54S56, and prothymosin-alpha. In a simulation procedure similar to our previous work,48,49 for each simulation, we used 15 different temperatures for the thermal replica exchange: 280 K, 286 K, 293 K, 298 K, 305 K, 315 K, 325 K, 335 K, 350 K, 365 K, 380 K, 395 K, 410 K, 430 K, and 450 K. Energy considerations, Monte Carlo move sets, and replica swapping procedures are identical to the protocol utilized by Das and Pappu.26 Each of our simulations were run at zero salt, with only explicit neutralizing Na+ and Cl− ions being added in a droplet size of 400 Å radius. We ran 12 simulation trajectories at each temperature. All individual simulations were run for at least 1.4 × 107 steps with the first 1.5 × 106 steps being discarded as equilibration, and an output PDB file was generated every 4.0 × 103 steps. P0A8H9 S2T15 denotes wild type sequence P0A8H9 that has been phosphorylated at S2 and T15 locations. Likewise, P0A8H9 S54S56 has been phosphorylated at S54 and S56. Phosphorylations were mimicked by replacing the serine (S) or threonine (T) residue with glutamic acid (E), a method that has previously been used to represent phosphorylation.58
Convergence tests were done by comparing the fluctuations in the distance between each amino acid i, j pair in the first and second halves of each simulation. The fluctuation of a half simulation δdi,j is defined as
(9) |
where is the variance of the distance, di,j, between any i, j pair of amino acids and is the mean squared of di,j. This test was done assuming a self-consistent simulation must reproduce the same statistics captured during the first and second halves of the simulation.59 Although this does not ensure convergence, it is a rigorous criterion for self-consistency.60 A majority of convergence tests compare cluster distribution probabilities to ensure self-consistency. Our proposed criteria of comparing distance profiles to ensure self-consistency are immune to ad-hoc criteria needed for clustering. Figure 1 in the supplementary material shows this comparison for our simulations. We ensure that R2 between the two statistics (first and second half of the trajectory) is at least 0.9, and the slope is between 0.9 and 1.10 (see Table 1provided in the supplementary material). For all proteins, we ran at least 12 individual trajectories at each temperature as noted above. For the cases where 12 individual thermal replica exchange simulations were not sufficient, additional 12 thermal replica exchange simulations were run until our criteria was satisfied. The strategy of “check-extend-check” can be used in simulations of IDPs to ensure self-consistency criteria.
III. RESULT
A. Theory captures intra-chain distance profile similar to all-atom simulation
1. Application to synthetic polyampholytes
Equation (3) can account for sequence specificity for charge (via term) and non-charge (via Ωij and Tij) monomers. For the present work, we focus only on the charge patterning effects. Motivated by these criteria, we consider two synthetic sequences that contain only charged amino acids. These two sequences were chosen from the list of 30 heteropolymeric sequences originally created by Das and Pappu.26 These 30 sequences have 25 lysine and 25 glutamic acids but shuffled in different patterning. We have chosen SV25 and SV30 from the original list. We expect the role of non-electrostatic effects to be minimal for these two sequences. Below, we show inter-residue distance profiles of these two sequences computed using the all-atom simulation (upper triangle in Fig. 1).
We compare these distance profiles against our theoretical model using the following steps. As noted earlier, non-electrostatic effects are modeled with a mean-field interaction parameter by assuming ωm,n≈ω2 in Eq. (7). The sequence effect of the charge patterning is explicitly calculated using Eq. (4), where glutamic acid and lysine were assigned a charge q = −1 and q = 1, respectively. To determine that ω2 and ω3 best describe the distance profile, we chose a set of two residue pairs, say (m, n) and (p, q). Using the simulated values of and , we solved for ω3, ω2 specific to the pair (m, n) and (p, q). The linear dependence of xij on ω2, ω3 [see Eq. (A8) in the Appendix] allows solving these two parameters from the two values. Different solutions of ω3, ω2 are obtained by varying the set of pairs of residues.
These solutions are then clustered with the K-means algorithm, and the solution of ω3, ω2 was chosen forming the largest cluster. One of the issues in the clustering is the proper choice of the number (K) of clusters. The optimum value of K was determined by employing the “elbow method.” This refers to clustering the set of ω3, ω2 with a variable number of K clusters and plotting the average ω3, ω2 from the top cluster as a function of K. The optimal number of clusters (K) was chosen where the dependence of the average ω3, ω2 on the number of clusters forms an “elbow.” The selected value of ω3, ω2 corresponding to the largest cluster with the optimal K was then used to predict the entire distance profile, i.e., for all i, j. Figure 1 (lower triangle) shows the theoretical prediction (in terms of xij) for the two synthetic sequences (SV25 and SV30). In spite of adopting a mean-field assumption for non-charge patterning, our theory can capture the main features of the distance profile when compared against the all-atom simulation (upper triangle in Fig. 1). To facilitate comparison between simulation and theory, we normalized the xij map between 0 and 1. We conclude that the features of distance profiles primarily induced by electrostatics are reasonably described by our theory. We emphasize that this is a unique feature of the sequence specific model presented here and different from the prediction of the end-to-end distance or scaling exponents typically computed using traditional polymer physics based theories.
2. Application to intrinsically disordered protein
Next, we test our model on naturally occurring proteins. We have chosen three sequences DP00877, prothymosin-alpha, andP0A8H9 from the database of IDPs that have high charge content 40%, 50%, and 29%, respectively. We expect that these proteins will have conformational heterogeneity largely dictated by electrostatics allowing us to adopt the same mean-field approximation for the non-electrostatic patterning. We followed the same procedure, described above, to determine the most likely choice of mean-field values of ω2 and ω3 and generated the distance map using our theoretical formalism (lower triangles in Fig. 2). As before, the agreement between all atom simulation and the theoretical model is reasonable. From Figs. 1 and 2, it is evident that our model is able to capture specific features of distance profiles encoded in each sequence. We also note that there are noticeable differences between theory and simulation. Although there is no particular trend in these differences to highlight specific origins of the sources of deviations, we can identify several factors that may cause these differences. All-atom simulations provide a more detailed description of amino acids compared to point-like coarse-grain approximations in the proposed analytical model. Furthermore, we have neglected the non-electrostatic interaction details by invoking a mean-field approximation ωm,n = ω2. Interactions between polar–polar, charge–polar, and hydrophobic–hydrophobic are expected to be different causing deviation from the mean-field approximation. Careful parameterization—using different scales and schemes31—is needed to disentangle these effects and further improve model predictions.
Black lines in Fig. 2 show noticeable deviation from homopolymer-like behavior. For a homopolymer, we expect similar values of for pairs of residue i, j when |i−j| is fixed. The black lines in Fig. 2 are the locus of all residue pairs (i, j) that have the same |i−j| values. The strong variation in size depicted by changes in the distance profile color indicates that there is a significant heterogeneity at the segmental (inter-residue) level. The all-atom simulation (upper triangle) also exhibits the same variance. Schuler and colleagues noted such translational symmetry breaking in the sm-FRET experiments.61
B. Phosphorylation hotspots can significantly alter distance profiles
Next, we investigate the effect of phosphorylation that can alter charge patterning and consequently influence the distance profiles. To investigate these effects, we consider the wild type protein P0A8H9 as an illustrative example. In our earlier work,48 we identified two different sequence variants of P0A8H9 that yield different end-to-end distances. The two sequences are (i) S2T15, phosphorylated at Serine and Threonine at residue 2 and 15 and (ii) S54S56, phosphorylated at the Serine residue locations 54 and 56. Both the sequences have the same charge content in terms of positively and negatively charged amino acids. We revisited the distance profiles of these two sequences. Figure 3 shows the difference in the distance profiles between thetwo sequences and the wild type sequence. Regions of red indicate that the modified sequence is more expanded compared to the wild type. On the other hand, blue implies the modified sequence is more compact compared to the wild type. The lower triangle shows results from theory compared against the all-atom simulation depicted in the upper triangle. Predominantly, red and blue regions indicate that minor—but subtle—changes in the location of phosphorylation can incur significant changes (expansion or contraction) in the distance profile. Placing the phosphorylation group at positions S2 and T15 (Fig. 3; left panel) causes specific regions to expand (red in the top left and bottom right corners) and a separate region to contract (blue in the bottom left corner). This is in contrast to choosing phosphorylation sites at S54 and S56 (Fig. 3; right panel). For the S54S56 variant, the same red region (expanded) exists in the top left corner. However, the blue region appears in the bottom right corner (lower triangle in Fig. 3; right). We conclude that different phosphorylation sites can be targeted to create compaction in a specific region and expansion in some other region. We also notice there is reasonable agreement between theory (lower triangle in Fig. 3) and the all-atom simulation (upper triangle in Fig. 3).
C. Coil-globule transition is residue-pair specific
In addition to quantifying changes in the conformation profile with changes in charge patterning (discussed above), our model can predict response to non-electrostatic effects as well. We demonstrate this by tuning the mean field interaction parameter (ω2). Changes in this parameter can be envisioned as changes in hydrophobicity31 in the sequence and/or temperature.48 The changes due to the local environment inside cells arising from non-specific soft interactions62–65 can also modify ω2. An intriguing prediction of our model is that different pairs of residues respond differently to these changes (Fig. 4; left panel). As an example, consider globule (x < 1) to coil (x > 1) transitions for two residue pairs: Pair 1 with i = 52, j = 3 (blue curve) and pair 2 with p = 63, q = 48 (red curve). Pair 1 has much more gradual transition compared to pair 2. This example highlights the ability to achieve significant conformational changes at specific locations in the protein by tuning local environment (equivalent to changing ω2). Intriguingly, upon phosphorylation at positions S2 and T15, the difference between the two curves gets suppressed (Fig. 4; middle panel). The sharpness of the transition significantly reduces for pair 2 compared to its wild type version. Phosphorylation at positions S54 and S56 again shows very different outcomes. The red and blue curves are very different, and the ensemble average distance for pair 2 (red curve) changes much more gradually as compared to the wild-type (Fig. 4; right panel). These findings further showcase the ability to achieve selective distance profiles by simultaneously changing charge patterning (using phosphorylation) and ω2 (proxy for solution properties). We note that the prediction of sequence specific coil-globule transition for intra-chain distances will be of interest in polymer physics as well.
IV. CONCLUSION
We present a general formalism that can predict the ensemble average distance between any two monomers in a heteropolymer. The prescription accounts for the sequence effect beyond just the composition. The performance of the model is tested against all-atom simulations for two synthetic sequences and three naturally occurring IDPs that are high in charge content. The model also captures differences in the distance profiles when chemical modifications (such as phosphorylation) alter sequences in even as little as two sites. There are specific hotspots for modification that can cause significant changes in the distance profiles. Our model can predict coil-globule transition for the distance between any two residues. These transitions can significantly depend on the choice of the residue pairs. Although theoretical prediction remarkably captures features similar to all-atom simulation, there are also differences between theory and simulation. The observed discrepancies between all-atom simulation and coarse-grain analytical theory could be due to several reasons, including the mean-field assumption imposed on non-electrostatic two-body interactions. Explicit consideration of these non-electrostatic interactions may further enhance the predictive power of our theoretical formalism. Insights revealed from these studies and the analytical nature of the theory will provide a tool to design heteropolymers (both biological and non-biological) for new applications. At the same time, it will also allow us to investigate the role of mutations and modifications to alter biological functions.
SUPPLEMENTARY MATERIAL
ACKNOWLEDGMENTS
The authors acknowledge support from the NIH (Award No. 1R15GM128162-01A1), KIHA (Knoebel Institute for Health Aging at the University of Denver), PROF (from University of Denver), and NSF (Award No. 1149992). The authors appreciate discussions with Hue Sun Chan, Austin Clark, Taylor Firman, Yi-Hsuan Lin, Murugappan Muthukumar, Benjamin Schuler, and Andrea Sorrano. The authors also acknowledge the computational support from Benjamin Fotovich and the High Performance Computing (HPC) facility at the University of Denver. The simulation data that support the findings of this study are available from the corresponding author upon reasonable request.
Here, we show the equivalence between the free energy based formalism introduced by Muthukumar66 given in Eq. (3) and the alternate approach using Edwards and Singh formalism.52 In the Edwards–Singh approach, the renormalized Kuhn length lr(s2, s1) is determined by demanding [see Eq. (5) in Ref. 47]
(A1) |
where ⟨…⟩r is the average over the trial Hamiltonian Hr. Using the definition of Ht and Hr from Eqs. (1) and (2), the above equation becomes
(A2) |
Using Eqs. (A2), (A6), and (A9) from our earlier work,47 we note the three following equalities:
(A3) |
(A4) |
and
(A5) |
where we further use the identity50
(A6) |
The contribution due to the new term involving the three body interaction is calculated as
(A7) |
Combining Eqs. (A2)–(A7) and using discrete notation by assigning s2 = i, s1 = j, s = l, s′ = m, s″ = n, and xij = lr,ij/l, we get the final equation
(A8) |
where Ωij and Tij are identical to Eqs. (7) and (6). Qij is defined as
(A9) |
with A(m, n, xij, κl) as
(A10) |
Minimization of the free energy Eq. (3) with respect to xij, i.e.,∂[βF(xij)]/∂xij = 0, yields an equation identical to Eq. (A8). In establishing the equivalence, we also used the following relation:
(A11) |
The above derivation shows the validity of our approach. However, we use the free energy minimization equation instead of Eq. (A8) due to the ease of computation when multiple solutions (due to two minima and one maxima in the free energy) may arise.
REFERENCES
- 1.Dunker A. K., Silman I., Uversky V. N., and Sussman J. L., Curr. Opin. Struct. Biol. 18, 756 (2008). 10.1016/j.sbi.2008.10.002 [DOI] [PubMed] [Google Scholar]
- 2.Mittag T., Kay L. E., and Forman-Kay J. D., J. Mol. Recognit. 23, 105 (2010). 10.1002/jmr.961 [DOI] [PubMed] [Google Scholar]
- 3.Xie H., Vucetic S., Iakoucheva L. M., Oldfield C. J., Dunker A. K., Uversky V. N., and Obradovic Z., J. Proteome Res. 6, 1882 (2007). 10.1021/pr060392u [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tompa P., Trends Biochem. Sci. 37, 509 (2012). 10.1016/j.tibs.2012.08.004 [DOI] [PubMed] [Google Scholar]
- 5.Tompa P., Chem. Rev. 114, 6715 (2014). 10.1021/cr4005082 [DOI] [PubMed] [Google Scholar]
- 6.Flock T., Weatheritt R. J., Latysheva N. S., and Babu M. M., Curr. Opin. Struct. Biol. 26, 62 (2014). 10.1016/j.sbi.2014.05.007 [DOI] [PubMed] [Google Scholar]
- 7.Drake J. A. and Pettitt B. M., Biophys. J. 114, 2799 (2018). 10.1016/j.bpj.2018.04.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zhou H.-X., Trends Biochem. Sci. 37, 43 (2012). 10.1016/j.tibs.2011.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mittag T., Marsh J., Grishaev A., Orlicky S., Lin H., Sicheri F., Tyers M., and Forman-Kay J. D., Structure 18, 494 (2010). 10.1016/j.str.2010.01.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Vuzman D., Azia A., and Levy Y., J. Mol. Biol. 396, 674 (2010). 10.1016/j.jmb.2009.11.056 [DOI] [PubMed] [Google Scholar]
- 11.Liu J., Faeder J. R., and Camacho C. J., Proc. Natl. Acad. Sci. U. S. A. 106, 19819 (2009). 10.1073/pnas.0907710106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hilser V. J. and Thompson E. B., Proc. Natl. Acad. Sci. U. S. A. 104, 8311 (2007). 10.1073/pnas.0700329104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Shoemaker B. A., Portman J. J., and Wolynes P. G., Proc. Natl. Acad. Sci. U. S. A. 97, 8868 (2000). 10.1073/pnas.160259697 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lin Y.-H. and Chan H. S., Biophys. J. 112, 2043 (2017). 10.1016/j.bpj.2017.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lin Y.-H., Brady J. P., Forman-Kay J. D., and Chan H. S., New J. Phys. 19, 115003 (2017). 10.1088/1367-2630/aa9369 [DOI] [Google Scholar]
- 16.Lin Y. H., Forman-Kay J. D., and Chan H. S., Biochemistry 57, 2499 (2018). 10.1021/acs.biochem.8b00058 [DOI] [PubMed] [Google Scholar]
- 17.Dignon G. L., Zheng W., Best R. B., Kim Y. C., and Mittal J., Proc. Natl. Acad. Sci. U. S. A. 115, 9929 (2018). 10.1073/pnas.1804177115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lin Y.-H., Brady J. P., Chan H. S., and Ghosh K., J. Chem. Phys. 152, 045102 (2020). 10.1063/1.5139661 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bergeron-Sandoval L. P., Heris H. K., Chang C., Cornell C. E., Keller S. L., Hendricks A. G., Ehrlicher A. J., Francois P., Pappu R. V., and Michnick S. W., “Endocytosis caused by liquid-liquid phase separation of proteins,” bioRxiv:145664 (2018).
- 20.Strom A. R., Emelyanov A. V., Mir M., Fyodorov D. V., Darzacq X., and Karpen G. H., Nature 547, 241 (2017). 10.1038/nature22989 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Larson A. G., Elnatan D., Keenen M. M., Trnka M. J., Johnston J. B., Burlingame A. L., Agard D. A., Redding S., and Narlikar G. J., Nature 547, 236 (2017). 10.1038/nature22822 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Plys A. J. and Kingston R. E., Science 361, 329 (2018). 10.1126/science.aau4795 [DOI] [PubMed] [Google Scholar]
- 23.Cho W.-K., Spille J.-H., Hecht M., Lee C., Li C., Grube V., and Cisse I. I., Science 361, 412 (2018). 10.1126/science.aar4199 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sabari B. R. et al. , Science 361, eaar3958 (2018). 10.1126/science.aar3958 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Srivastava D. and Muthukumar M., Macromolecules 29, 2324 (1996). 10.1021/ma951204g [DOI] [Google Scholar]
- 26.Das R. K. and Pappu R. V., Proc. Natl. Acad. Sci. U. S. A. 110, 13392 (2013). 10.1073/pnas.1304749110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zerze G. H., Best R. B., and Mittal J., J. Phys. Chem. B 119, 14622 (2015). 10.1021/acs.jpcb.5b08619 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dignon G. L., Zheng W., Kim Y. C., Best R. B., and Mittal J., PLoS Comput. Biol. 14, e1005941 (2018). 10.1371/journal.pcbi.1005941 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Baul U., Chakraborty D., Mugnai M. L., Straub J. E., and Thirumalai D., J. Phys. Chem. B 123, 3462 (2019). 10.1021/acs.jpcb.9b02575 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.McCarty J., Delaney K. T., Danielsen S. P. O., Fredrickson G. H., and Shea J.-E., J. Phys. Chem. Lett. 10, 1644 (2019). 10.1021/acs.jpclett.9b00099 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zheng W., Dignon G., Brown M., Kim Y.C., and Mittal J., J. Phys. Chem. Lett. “Hydropathy patterning complements charge patterning to describe conformational preferences of disordered proteins” (published online 2020). 10.1021/acs.jpclett.Oc00288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Marsh J. A. and Forman-Kay J. D., Biophys. J. 98, 2383 (2010). 10.1016/j.bpj.2010.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Muller-Spath S., Soranno A., Hirschfeld V., Hofmann H., Ruegger S., Reymond L., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 107, 14609 (2010). 10.1073/pnas.1001743107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hofmann H., Soranno A., Borgia A., Gast K., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 109, 16155 (2012). 10.1073/pnas.1207719109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shen K. and Wang Z.-G., Macromolecules 51, 1706 (2018). 10.1021/acs.macromol.7b02685 [DOI] [Google Scholar]
- 36.Danielson S. P. O., McCarty J., Shea J. E., Delaney K. T., and Fredrickson G. H., Proc. Natl. Acad. Sci. U. S. A. 116, 8224 (2019). 10.1073/pnas.1900435116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Danielsen S. P. O., McCarty J., Shea J.-E., Delaney K. T., and Fredrickson G. H., J. Chem. Phys. 151, 034904 (2019). 10.1063/1.5109045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Madinya J. J., Chang L. W., Perry S. L., and Sing C. E., Mol. Syst. Des. Eng. 5, 632 (2019). 10.1039/C9ME00074G [DOI] [Google Scholar]
- 39.Lytle T. K., Chang L.-W., Markiewicz N., Perry S. L., and Sing C. E., ACS Cent. Sci. 5, 709 (2019). 10.1021/acscentsci.9b00087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dzuricky M., Roberts S., and Chilkoti A., Biochemistry 57, 2405 (2018). 10.1021/acs.biochem.8b00056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Best R. B., Zheng W., and Mittal J., J. Chem. Theory Comput. 10, 5113 (2014). 10.1021/ct500569b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hicks A. and Zhou H.-X., J. Chem. Phys. 149, 072313 (2018). 10.1063/1.5027409 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Higgs P. G. and Joanny J. F., J. Chem. Phys. 94, 1543 (1991). 10.1063/1.460012 [DOI] [Google Scholar]
- 44.Dobrynin A. V., Colby R. H., and Rubinstein M., J. Polym. Sci. B 42, 3513 (2004). 10.1002/polb.20207 [DOI] [Google Scholar]
- 45.Gutin A. M. and Shakhnovich E. I., Phys. Rev. E 50, R3322 (1994). 10.1103/physreve.50.r3322 [DOI] [PubMed] [Google Scholar]
- 46.Samanta H. S., Chakraborty D., and Thirumalai D., J. Chem. Phys. 149, 163323 (2018). 10.1063/1.5035428 [DOI] [PubMed] [Google Scholar]
- 47.Sawle L. and Ghosh K., J. Chem. Phys. 143, 085101 (2015). 10.1063/1.4929391 [DOI] [PubMed] [Google Scholar]
- 48.Firman T. and Ghosh K., J. Chem. Phys. 148, 123305 (2018). 10.1063/1.5005821 [DOI] [PubMed] [Google Scholar]
- 49.Huihui J., Firman T., and Ghosh K., J. Chem. Phys. 149, 085101 (2018). 10.1063/1.5037727 [DOI] [PubMed] [Google Scholar]
- 50.Muthukumar M., J. Chem. Phys. 86, 7230 (1987). 10.1063/1.452763 [DOI] [Google Scholar]
- 51.Ha B.-Y. and Thirumalai D., Phys. Rev. A 46, R3012 (1992). 10.1103/physreva.46.r3012 [DOI] [PubMed] [Google Scholar]
- 52.Edwards S. F. and Singh P., J. Chem. Soc. Faraday Trans. 2 75, 1020 (1979). 10.1039/f29797501020 [DOI] [Google Scholar]
- 53.Ghosh K., Carri G. A., and Muthukumar M., J. Chem. Phys. 115, 4367 (2001). 10.1063/1.1386924 [DOI] [Google Scholar]
- 54.Ghosh K. and Muthukumar M., J. Polym. Sci., Part B: Polym. Phys. 39, 2644 (2001). 10.1002/polb.10016 [DOI] [Google Scholar]
- 55.Muthukumar M., J. Chem. Phys. 81, 6272 (1984). 10.1063/1.447585 [DOI] [Google Scholar]
- 56.Vitalis A. and Pappu R. V., Annu. Rep. Comput. Chem. 5, 49 (2009). 10.1016/s1574-1400(09)00503-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Vitalis A. and Pappu R. V., J. Comput. Chem. 30, 673 (2009). 10.1002/jcc.21005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Martin E. W., Holehouse A. S., Grace C. R., Hughes A., Pappu R. V., and Mittag T., J. Am. Chem. Soc. 138, 15323 (2016). 10.1021/jacs.6b10272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lyman E. and Zuckerman D. M., Biophys. J. 91, 164 (2006). 10.1529/biophysj.106.082941 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sawle L. and Ghosh K., J. Chem. Theory Comput. 12, 861 (2016). 10.1021/acs.jctc.5b00999 [DOI] [PubMed] [Google Scholar]
- 61.Wuttke R., Hofmann H., Nettels D., Borgia M., Mittal J., Best R., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 111, 5213 (2014). 10.1073/pnas.1313006111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Guo M., Xu Y., and Gruebele M., Proc. Natl. Acad. Sci. U. S. A. 109, 17863 (2012). 10.1073/pnas.1201797109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Theillet F.-X., Binolfi A., Frembgen-Kesner T., Hingorani K., Sarkar M., Kyne C., Li C., Crowley P. B., Gierasch L., Pielak G. J. et al. , Chem. Rev. 114, 6661 (2014). 10.1021/cr400695p [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yu I., Mori T., Ando T., Harada R., Jung J., Sugita Y., and Feig M., eLife 5, e19274 (2016). 10.7554/elife.19274 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Ribeiro S., Ebbinghaus S., and Marcos J. C., FEBS Lett. 592, 3040 (2018). 10.1002/1873-3468.13211 [DOI] [PubMed] [Google Scholar]
- 66.Muthukumar M., J. Chem. Phys. 120, 9343 (2004). 10.1063/1.1701839 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.