An analytical theory to describe sequence-specific inter-residue distance profiles for polyampholytes and intrinsically disordered proteins

Jonathan Huihui; Kingshuk Ghosh

doi:10.1063/5.0004619

. 2020 Apr 22;152(16):161102. doi: 10.1063/5.0004619

An analytical theory to describe sequence-specific inter-residue distance profiles for polyampholytes and intrinsically disordered proteins

Jonathan Huihui ¹, Kingshuk Ghosh ^1,^a)

PMCID: PMC7180065 PMID: 32357776

Abstract

Intrinsically Disordered Proteins (IDPs), unlike folded proteins, lack a unique folded structure and rapidly interconvert among ensembles of disordered states. However, they have specific conformational properties when averaged over their ensembles of disordered states. It is critical to develop a theoretical formalism to predict these ensemble average conformational properties that are encoded in the IDP sequence (the specific order in which amino acids/residues are linked). We present a general heteropolymer theory that analytically computes the ensemble average distance profiles ( $⟨ R_{i j}^{2} ⟩$ ) between any two (i, j) monomers (amino acids for IDPs) as a function of the sequence. Information rich distance profiles provide a detailed description of the IDP in contrast to typical metrics such as scaling exponents, radius of gyration, or end-to-end distance. This generalized formalism supersedes homopolymer-like models or models that are built only on the composition of amino acids but ignore sequence details. The prediction of these distance profiles for highly charged polyampholytes and naturally occurring IDPs unmasks salient features that are hidden in the sequence. Moreover, the model reveals strategies to modulate the entire distance map to achieve local or global swelling/compaction by subtle changes/modifications—such as phosphorylation, a biologically relevant process—in specific hotspots in the sequence. Sequence-specific distance profiles and their modulation have been benchmarked against all-atom simulations. Our new formalism also predicts residue-pair specific coil-globule transitions. The analytical nature of the theory will facilitate design of new sequences to achieve specific target distance profiles with broad applications in synthetic biology and polymer science.

I. INTRODUCTION

Intrinsically disordered proteins or disordered protein regions (both termed IDPs in the present paper) participate in many vital biological functions,¹ from transcriptional regulation² and cellular differentiation to DNA condensation.^3,4 IDPs do not have a unique native state, unlike folded proteins; instead, they exist in an ensemble of disordered states.⁴ The disordered ensembles in IDPs confer conformational plasticity/entropy^5–7 that facilitate several processes: formation of transient contacts,⁸ multi-protein fuzzy complexes,^2,9 spatial search by transcription factors,¹⁰ regulated motion between domains, enhanced specificity of promiscuous interactions,¹¹ allosteric coupling,¹² and other activities.^10,13 Recent work has also shown the importance of an IDP ensemble in determining its phase separation propensity.^14–18 Formation of liquid droplets upon phase separation mediated by IDPs promotes several biological processes such as endocytosis,¹⁹ silencing chromatin,^20,21 and gene transcription.^22–24

The above examples highlight the need to model the disordered ensembles of IDPs. To quantitate the features of the disordered ensemble, we need to compute ensemble average properties. For example, a detailed description of conformational features can be obtained from the ensemble average distance $⟨ R_{i j}^{2} ⟩$ between any two amino acids i and j, where amino acids are treated as coarse-grain particles. However, like folded proteins, IDPs too have a specific sequence. Any attempt to model IDPs must respect the order in which the amino acids (or monomers) are linked. All-atom and coarse-grain simulations of synthetic sequences and IDPs have shown ensemble average properties such as the radius of gyration, or scaling exponents can critically depend on sequence specificity, not just the composition.^25–31 Limited experimental data also support this observation.^32–34 Thus, the modern day equivalent of the folding puzzle is to decode the link between an IDP’s conformational properties and its sequence. Motivated by this fundamental question, we appreciate that a general theoretical formalism is needed that can compute ensemble average distance profiles between any two residues/amino acids as a function of arbitrary sequence patterning. Building a theoretical model of heteropolymers capable of describing residue pair specific distance profiles $⟨ R_{i j}^{2} ⟩$ will also advance problems in polymer science, for example, modeling self-coacervation in solutions of homopolyelectrolytes and blockpolyampholytes^35–39 and other applications in artificial protein polymers.⁴⁰ Furthermore, theoretical models provide the much needed alternate to simulations that tend to be computationally expensive. All-atom molecular dynamics simulations—successful in describing folded proteins—are limited in describing IDPs due to possible inaccuracies in the force-field⁴¹ and sampling issues.⁴² Less accurate coarse-grain simulations have become popular, but they too remain computationally costly compared to analytical approaches. Moreover, analytical tractability helps gain deep insights—including design principles—that are not always apparent in computer simulations.

Existing analytical theories of heteropolymers typically average over different monomer sequences assuming that the sequence disorder is annealed.^43–46However, a given biologically functional protein or its mutated/chemically modified version has a unique sequence. A new set of analytical models have recently been developed that explicitly account for patterning of amino acids to describe the conformation of a single IDP.^47–49The first work⁴⁷ in this series developed a theory to compute ensemble average distances for a given sequence. However, the model was limited to cases where intra-chain interactions are overall repulsive. Consequently, the model could only describe chain dimensions that were greater than the Flory random coil (FRC) limit. The FRC is a reference state where the two-body interaction between two monomers in a chain is effectively zero. Consequently, the earlier model failed to capture globular states and coil-globule transitions. Subsequent work⁴⁸ extended the model by adding a three-body repulsive term allowing the two-body interaction to be attractive. This model could describe the globule state where chain dimensions could reach below FRC. However, the modified model was only extended to compute the ensemble average end-to-end distance. In this study, we provide a formalism that allows the prediction of inter-residue distance profiles—between any two residues, not just the end-to-end distance—even in the globule state. The general formalism presents a powerful tool to describe the entire conformational ensemble—both above and below FRC—for arbitrary sequence patterning. Consequently, the theory can describe a residue-pair specific coil-globule transition. We reiterate that these predictions are detailed and different from the well-known polymer models that provide scaling laws or end-to-end distance and/or radius of gyration. All existing scaling laws and previous results of flexible homopolymers^50,51 and heteropolymers^47–49 can be derived as a limiting case of this general theory.

II. METHOD

The Hamiltonian of a polymer chain in the presence of inter-monomer excluded volume, electrostatic interaction, and three body interaction is given by

\begin{matrix} β H_{t} & = & (\frac{3}{2 l} \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2} + \frac{l_{b}}{l^{2}} \int_{0}^{L} d s \int_{0}^{s} d s^{'} q (s) q (s^{'}) \\ \times \frac{\exp (- κ | R (s) - R (s^{'}) |)}{| R (s) - R (s^{'}) |} + l \int_{0}^{L} d s \int_{0}^{s} d s^{'} ω (s, s^{'}) \\ \times δ [R (s) - R (s^{'})] + ω_{3} l^{3} \int_{0}^{L} d s \int_{0}^{s} d s^{'} \int_{0}^{s^{'}} d s^{''} \\ \times δ [R (s) - R (s^{'})] δ [R (s^{'}) - R (s^{''})]), \end{matrix}

(1)

where s is the contour length variable of the backbone, R(s) is the position vector at s, L is the total contour length with l denoting the Kuhn length, T is the temperature, and k_b is Boltzmann’s constant with β = 1/(k_bT). ω(s, s′) is the residue pair specific excluded volume, q(s) is charge at s, and ω₃ is the sequence independent repulsive three body interaction parameter. Next, we will use a variational approach—analogous to early work used to determine the average size of homopolymer chains in the presence of inter-monomer interactions^50–54—to compute inter-residue distances. Within this variational scheme,^50–54 we map the Hamiltonian (H_t), with all interactions, to a renormalized Hamiltonian (H_r) defined as

β H_{r} = (\frac{3}{2 l_{r}} \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2}),

(2)

where l_r(s₂, s₁) is the renormalized Kuhn length and is a function of all the interaction parameters and specific to the residue pairs s₁, s₂ between which the distance is to be determined. The ensemble average distance between s₁, s₂ is given by $⟨ {(R (s_{1}) - R (s_{2}))}^{2} ⟩ = | s_{2} - s_{1} | l l_{r} (s_{2}, s_{1})$ . Thus, l_r(s₂, s₁) as residue pair specific Kuhn lengths have been introduced to facilitate the calculation of the ensemble average distances between the two residues.^47,54

A. Coil globule transition for ensemble average distance between an arbitrary pair of amino acid residues

Switching from continuous to discrete notation, we replace s₂, s₁ by i, j, with i>j. We also define a new dimensionless quantity x_ij as the ratio of the residue-pair specific renormalized Kuhn length and bare Kuhn length, x_ij = l_r,ij/l. Values of x_ij greater than unity define the coil state, while x_ij less than unity denotes the globule state. Following the procedure from our earlier paper,⁴⁸ we provide the free energy F(x_ij) as a function of x_ij as

\begin{matrix} β F (x_{i j}) & = & \frac{3}{2} (x_{i j} - \ln x_{i j}) + {(\frac{3}{2 π})}^{3 / 2} \frac{Ω_{i j}}{(i - j) x_{i j}^{3 / 2}} \\ + {(\frac{3}{2 π})}^{3} \frac{ω_{3} T_{i j}}{2 (i - j) x_{i j}^{3}} + \frac{2 l_{b}}{π l} \frac{Q_{i j}^{'}}{(i - j)}, \end{matrix}

(3)

where l_b = 7.2(T₀/T)Å, T₀ = 298, N is the total number of amino acids, and Q_ij′, T_ij, and Ω_ij are given by

\begin{matrix} Q_{i j}^{'} & = & \sum_{m = j}^{i} \sum_{n = 1}^{j - 1} q_{m} q_{n} {(m - j)}^{2} A^{'} (m, n, x_{i j}, κ l) + \sum_{m = j + 1}^{i} \sum_{n = j}^{m - 1} q_{m} q_{n} {(m - n)}^{2} A^{'} (m, n, x_{i j}, κ l) \\ + \sum_{m = i + 1}^{N} \sum_{n = 1}^{j - 1} q_{m} q_{n} {(i - j)}^{2} A^{'} (m, n, x_{i j}, κ l) + \sum_{m = i + 1}^{N} \sum_{n = j}^{i} q_{m} q_{n} {(i - n)}^{2} A^{'} (m, n, x_{i j}, κ l), \end{matrix}

(4)

A^{'} (m, n, x, κ l) = [\frac{1}{2} {(\frac{6 π}{x})}^{1 / 2} \frac{1}{{(m - n)}^{3 / 2}} - (κ l) \frac{π}{2} \frac{1}{(m - n)} \exp (\frac{{(κ l)}^{2} x (m - n)}{6}) e r f c (\sqrt{\frac{{(κ l)}^{2} x (m - n)}{6}})],

(5)

\begin{matrix} T_{i j} & = & \sum_{l = j + 1}^{i} \sum_{m = 2}^{j} \sum_{n = 1}^{m - 1} [\frac{{(l - j)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}}] + \sum_{l = j + 2}^{i} \sum_{m = j + 1}^{l - 1} \sum_{n = 1}^{j} [\frac{{(l - m)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}} + \frac{{(m - j)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}] \\ + \sum_{l = i + 1}^{N} \sum_{m = 2}^{j} \sum_{n = 1}^{m - 1} [\frac{{(i - j)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}}] + \sum_{l = i + 1}^{N} \sum_{m = i}^{l - 1} \sum_{n = 1}^{j} [\frac{{(i - j)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}] \\ + \sum_{l = i + 1}^{N} \sum_{m = j + 1}^{i - 1} \sum_{n = 1}^{j} [\frac{{(i - m)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}} + \frac{{(m - j)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}] \\ + \sum_{l = j + 3}^{i} \sum_{m = j + 2}^{l - 1} \sum_{n = j + 1}^{m - 1} [\frac{{(l - m)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}} + \frac{{(m - n)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}] \\ + \sum_{l = i + 1}^{N} \sum_{m = j + 2}^{i} \sum_{n = j + 1}^{m - 1} [\frac{{(i - m)}^{2}}{{(l - m)}^{5 / 2} {(m - n)}^{3 / 2}} + \frac{{(m - n)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}] + \sum_{l = i + 2}^{N} \sum_{m = i + 1}^{l - 1} \sum_{n = j + 1}^{i} [\frac{{(i - n)}^{2}}{{(l - m)}^{3 / 2} {(m - n)}^{5 / 2}}], \end{matrix}

(6)

Ω_{i j} = \sum_{m = j}^{i} \sum_{n = 1}^{j - 1} ω_{m, n} \frac{{(m - j)}^{2}}{{(m - n)}^{5 / 2}} + \sum_{m = j + 1}^{i} \sum_{n = j}^{m - 1} ω_{m, n} {(m - n)}^{- 1 / 2} + \sum_{m = i + 1}^{N} \sum_{n = 1}^{j - 1} ω_{m, n} \frac{{(i - j)}^{2}}{{(m - n)}^{5 / 2}} + \sum_{m = i + 1}^{N} \sum_{n = j}^{i} ω_{m, n} \frac{{(i - n)}^{2}}{{(m - n)}^{5 / 2}},

(7)

where ω_m,ns are two-body residue pair (m, n) specific non-electrostatic interaction parameters. These are equivalent to ω(s, s′) used in the continuous notation. For all the subsequent application of this formalism, we will assume ω_m,n = ω₂. This approximation implies that the sequence effect of non-electrostatic interactions can be represented by a mean-field term. This is a reasonable assumption since we will limit our examples in this paper on sequences that are highly charged.

The value of x_ij is determined by minimizing F(x_ij). From the minimized x_ij, $⟨ R_{i j}^{2} ⟩$ is computed by using

⟨ R_{i j}^{2} ⟩ = | i - j | x_{i j} l^{2} .

(8)

Thus, the presented formulation provides a full description of the intra-protein conformational ensemble, including the possibility of observing coil-globule transitions for arbitrary (not just the end-to-end) residue-pairs as well. Equation (3) recovers several earlier results under different limits.^50,51,55 See the Appendix for details of the derivation using an equivalent approach similar to the one used earlier.^52,54

B. Protocol for all-atom simulation

We test our theoretical prediction of the distance profiles against all-atom simulations using the CAMPARI Monte Carlo scheme with the ABSINTH implicit solvation model.^56,57 We used CAMPARI for all of the proteins that we simulated: SV25, SV30, DP00877 WT, P0A8H9 WT, P0A8H9 S2T15, P0A8H9 S54S56, and prothymosin-alpha. In a simulation procedure similar to our previous work,^48,49 for each simulation, we used 15 different temperatures for the thermal replica exchange: 280 K, 286 K, 293 K, 298 K, 305 K, 315 K, 325 K, 335 K, 350 K, 365 K, 380 K, 395 K, 410 K, 430 K, and 450 K. Energy considerations, Monte Carlo move sets, and replica swapping procedures are identical to the protocol utilized by Das and Pappu.²⁶ Each of our simulations were run at zero salt, with only explicit neutralizing Na⁺ and Cl⁻ ions being added in a droplet size of 400 Å radius. We ran 12 simulation trajectories at each temperature. All individual simulations were run for at least 1.4 × 10⁷ steps with the first 1.5 × 10⁶ steps being discarded as equilibration, and an output PDB file was generated every 4.0 × 10³ steps. P0A8H9 S2T15 denotes wild type sequence P0A8H9 that has been phosphorylated at S2 and T15 locations. Likewise, P0A8H9 S54S56 has been phosphorylated at S54 and S56. Phosphorylations were mimicked by replacing the serine (S) or threonine (T) residue with glutamic acid (E), a method that has previously been used to represent phosphorylation.⁵⁸

Convergence tests were done by comparing the fluctuations in the distance between each amino acid i, j pair in the first and second halves of each simulation. The fluctuation of a half simulation δd_i,j is defined as

δ d_{i, j} = \frac{σ_{d_{i, j}}^{2}}{{\bar{d_{i, j}}}^{2}},

(9)

where $σ_{d_{i, j}}^{2}$ is the variance of the distance, d_i,j, between any i, j pair of amino acids and ${\bar{d_{i, j}}}^{2}$ is the mean squared of d_i,j. This test was done assuming a self-consistent simulation must reproduce the same statistics captured during the first and second halves of the simulation.⁵⁹ Although this does not ensure convergence, it is a rigorous criterion for self-consistency.⁶⁰ A majority of convergence tests compare cluster distribution probabilities to ensure self-consistency. Our proposed criteria of comparing distance profiles to ensure self-consistency are immune to ad-hoc criteria needed for clustering. Figure 1 in the supplementary material shows this comparison for our simulations. We ensure that R² between the two statistics (first and second half of the trajectory) is at least 0.9, and the slope is between 0.9 and 1.10 (see Table 1provided in the supplementary material). For all proteins, we ran at least 12 individual trajectories at each temperature as noted above. For the cases where 12 individual thermal replica exchange simulations were not sufficient, additional 12 thermal replica exchange simulations were run until our criteria was satisfied. The strategy of “check-extend-check” can be used in simulations of IDPs to ensure self-consistency criteria.

III. RESULT

A. Theory captures intra-chain distance profile similar to all-atom simulation

1. Application to synthetic polyampholytes

Equation (3) can account for sequence specificity for charge (via $Q_{i j}^{'}$ term) and non-charge (via Ω_ij and T_ij) monomers. For the present work, we focus only on the charge patterning effects. Motivated by these criteria, we consider two synthetic sequences that contain only charged amino acids. These two sequences were chosen from the list of 30 heteropolymeric sequences originally created by Das and Pappu.²⁶ These 30 sequences have 25 lysine and 25 glutamic acids but shuffled in different patterning. We have chosen SV25 and SV30 from the original list. We expect the role of non-electrostatic effects to be minimal for these two sequences. Below, we show inter-residue distance profiles of these two sequences computed using the all-atom simulation (upper triangle in Fig. 1).

FIG. 1. — Predicted distance profiles of two synthetic sequences. Color coding denotes different values of x_ij (normalized between 0 and 1) for residue pairs i, j shown in x and y axis. The lower triangle represents the fitted distance map from theory using mean-field parameters ω₃ and ω₂ compared against the all-atom simulation data shown in the upper triangle. The two distance profiles are for the two synthetic sequences SV25 (left) and SV30 (right) from Das and Pappu.²⁶

We compare these distance profiles against our theoretical model using the following steps. As noted earlier, non-electrostatic effects are modeled with a mean-field interaction parameter by assuming ω_m,n≈ω₂ in Eq. (7). The sequence effect of the charge patterning is explicitly calculated using Eq. (4), where glutamic acid and lysine were assigned a charge q = −1 and q = 1, respectively. To determine that ω₂ and ω₃ best describe the distance profile, we chose a set of two residue pairs, say (m, n) and (p, q). Using the simulated values of $⟨ R_{m n}^{2} ⟩$ and $⟨ R_{p q}^{2} ⟩$ , we solved for ω₃, ω₂ specific to the pair (m, n) and (p, q). The linear dependence of x_ij on ω₂, ω₃ [see Eq. (A8) in the Appendix] allows solving these two parameters from the two values. Different solutions of ω₃, ω₂ are obtained by varying the set of pairs of residues.

These solutions are then clustered with the K-means algorithm, and the solution of ω₃, ω₂ was chosen forming the largest cluster. One of the issues in the clustering is the proper choice of the number (K) of clusters. The optimum value of K was determined by employing the “elbow method.” This refers to clustering the set of ω₃, ω₂ with a variable number of K clusters and plotting the average ω₃, ω₂ from the top cluster as a function of K. The optimal number of clusters (K) was chosen where the dependence of the average ω₃, ω₂ on the number of clusters forms an “elbow.” The selected value of ω₃, ω₂ corresponding to the largest cluster with the optimal K was then used to predict the entire distance profile, i.e., $⟨ R_{i j}^{2} ⟩$ for all i, j. Figure 1 (lower triangle) shows the theoretical prediction (in terms of x_ij) for the two synthetic sequences (SV25 and SV30). In spite of adopting a mean-field assumption for non-charge patterning, our theory can capture the main features of the distance profile when compared against the all-atom simulation (upper triangle in Fig. 1). To facilitate comparison between simulation and theory, we normalized the x_ij map between 0 and 1. We conclude that the features of distance profiles primarily induced by electrostatics are reasonably described by our theory. We emphasize that this is a unique feature of the sequence specific model presented here and different from the prediction of the end-to-end distance or scaling exponents typically computed using traditional polymer physics based theories.

2. Application to intrinsically disordered protein

Next, we test our model on naturally occurring proteins. We have chosen three sequences DP00877, prothymosin-alpha, andP0A8H9 from the database of IDPs that have high charge content 40%, 50%, and 29%, respectively. We expect that these proteins will have conformational heterogeneity largely dictated by electrostatics allowing us to adopt the same mean-field approximation for the non-electrostatic patterning. We followed the same procedure, described above, to determine the most likely choice of mean-field values of ω₂ and ω₃ and generated the distance map using our theoretical formalism (lower triangles in Fig. 2). As before, the agreement between all atom simulation and the theoretical model is reasonable. From Figs. 1 and 2, it is evident that our model is able to capture specific features of distance profiles encoded in each sequence. We also note that there are noticeable differences between theory and simulation. Although there is no particular trend in these differences to highlight specific origins of the sources of deviations, we can identify several factors that may cause these differences. All-atom simulations provide a more detailed description of amino acids compared to point-like coarse-grain approximations in the proposed analytical model. Furthermore, we have neglected the non-electrostatic interaction details by invoking a mean-field approximation ω_m,n = ω₂. Interactions between polar–polar, charge–polar, and hydrophobic–hydrophobic are expected to be different causing deviation from the mean-field approximation. Careful parameterization—using different scales and schemes³¹—is needed to disentangle these effects and further improve model predictions.

Black lines in Fig. 2 show noticeable deviation from homopolymer-like behavior. For a homopolymer, we expect similar values of $⟨ R_{i j}^{2} ⟩$ for pairs of residue i, j when |i−j| is fixed. The black lines in Fig. 2 are the locus of all residue pairs (i, j) that have the same |i−j| values. The strong variation in size depicted by changes in the distance profile color indicates that there is a significant heterogeneity at the segmental (inter-residue) level. The all-atom simulation (upper triangle) also exhibits the same variance. Schuler and colleagues noted such translational symmetry breaking in the sm-FRET experiments.⁶¹

B. Phosphorylation hotspots can significantly alter distance profiles

Next, we investigate the effect of phosphorylation that can alter charge patterning and consequently influence the distance profiles. To investigate these effects, we consider the wild type protein P0A8H9 as an illustrative example. In our earlier work,⁴⁸ we identified two different sequence variants of P0A8H9 that yield different end-to-end distances. The two sequences are (i) S2T15, phosphorylated at Serine and Threonine at residue 2 and 15 and (ii) S54S56, phosphorylated at the Serine residue locations 54 and 56. Both the sequences have the same charge content in terms of positively and negatively charged amino acids. We revisited the distance profiles of these two sequences. Figure 3 shows the difference in the distance profiles between thetwo sequences and the wild type sequence. Regions of red indicate that the modified sequence is more expanded compared to the wild type. On the other hand, blue implies the modified sequence is more compact compared to the wild type. The lower triangle shows results from theory compared against the all-atom simulation depicted in the upper triangle. Predominantly, red and blue regions indicate that minor—but subtle—changes in the location of phosphorylation can incur significant changes (expansion or contraction) in the distance profile. Placing the phosphorylation group at positions S2 and T15 (Fig. 3; left panel) causes specific regions to expand (red in the top left and bottom right corners) and a separate region to contract (blue in the bottom left corner). This is in contrast to choosing phosphorylation sites at S54 and S56 (Fig. 3; right panel). For the S54S56 variant, the same red region (expanded) exists in the top left corner. However, the blue region appears in the bottom right corner (lower triangle in Fig. 3; right). We conclude that different phosphorylation sites can be targeted to create compaction in a specific region and expansion in some other region. We also notice there is reasonable agreement between theory (lower triangle in Fig. 3) and the all-atom simulation (upper triangle in Fig. 3).

FIG. 3. — There are hotspots for phosphorylation. The difference in distance profiles (x_ij) quantified between two different phosphorylated forms [S2T15 (left) and S54S56 (right)] of the wild type protein P0A8H9 shows that there are specific sites for phosphorylation to induce drastic conformation changes at all scales. Positive (red) and negative (blue) differences are evident from the heat map. Theoretical results (lower triangle) agree well with the all-atom simulation (upper triangle).

C. Coil-globule transition is residue-pair specific

In addition to quantifying changes in the conformation profile with changes in charge patterning (discussed above), our model can predict response to non-electrostatic effects as well. We demonstrate this by tuning the mean field interaction parameter (ω₂). Changes in this parameter can be envisioned as changes in hydrophobicity³¹ in the sequence and/or temperature.⁴⁸ The changes due to the local environment inside cells arising from non-specific soft interactions^62–65 can also modify ω₂. An intriguing prediction of our model is that different pairs of residues respond differently to these changes (Fig. 4; left panel). As an example, consider globule (x < 1) to coil (x > 1) transitions for two residue pairs: Pair 1 with i = 52, j = 3 (blue curve) and pair 2 with p = 63, q = 48 (red curve). Pair 1 has much more gradual transition compared to pair 2. This example highlights the ability to achieve significant conformational changes at specific locations in the protein by tuning local environment (equivalent to changing ω₂). Intriguingly, upon phosphorylation at positions S2 and T15, the difference between the two curves gets suppressed (Fig. 4; middle panel). The sharpness of the transition significantly reduces for pair 2 compared to its wild type version. Phosphorylation at positions S54 and S56 again shows very different outcomes. The red and blue curves are very different, and the ensemble average distance for pair 2 (red curve) changes much more gradually as compared to the wild-type (Fig. 4; right panel). These findings further showcase the ability to achieve selective distance profiles by simultaneously changing charge patterning (using phosphorylation) and ω₂ (proxy for solution properties). We note that the prediction of sequence specific coil-globule transition for intra-chain distances will be of interest in polymer physics as well.

FIG. 4. — Coil-globule transition is residue pair specific.x_ij values were plotted as a function of the mean-field parameter ω₂ for two pairs of residues, i = 52, j = 3 (blue) and p = 63, q = 48 (red). Each panel corresponds to P0A8H9 WT, S2T15, and S54S56 from left to right, respectively. The vertical black line represents the wild-type ω₂obtained from the all-atom simulation. All ω₃ values were used specific to the sequence obtained from the all-atom simulation using K-means clustering mentioned above. The changes in the mean-field parameter mimic changes due to hydrophobicity, temperature, or local variations in soft interactions inside cells.

IV. CONCLUSION

We present a general formalism that can predict the ensemble average distance between any two monomers in a heteropolymer. The prescription accounts for the sequence effect beyond just the composition. The performance of the model is tested against all-atom simulations for two synthetic sequences and three naturally occurring IDPs that are high in charge content. The model also captures differences in the distance profiles when chemical modifications (such as phosphorylation) alter sequences in even as little as two sites. There are specific hotspots for modification that can cause significant changes in the distance profiles. Our model can predict coil-globule transition for the distance between any two residues. These transitions can significantly depend on the choice of the residue pairs. Although theoretical prediction remarkably captures features similar to all-atom simulation, there are also differences between theory and simulation. The observed discrepancies between all-atom simulation and coarse-grain analytical theory could be due to several reasons, including the mean-field assumption imposed on non-electrostatic two-body interactions. Explicit consideration of these non-electrostatic interactions may further enhance the predictive power of our theoretical formalism. Insights revealed from these studies and the analytical nature of the theory will provide a tool to design heteropolymers (both biological and non-biological) for new applications. At the same time, it will also allow us to investigate the role of mutations and modifications to alter biological functions.

SUPPLEMENTARY MATERIAL

See the supplementary material for figures and tables related to self-consistency criteria of simulated trajectories.

ACKNOWLEDGMENTS

The authors acknowledge support from the NIH (Award No. 1R15GM128162-01A1), KIHA (Knoebel Institute for Health Aging at the University of Denver), PROF (from University of Denver), and NSF (Award No. 1149992). The authors appreciate discussions with Hue Sun Chan, Austin Clark, Taylor Firman, Yi-Hsuan Lin, Murugappan Muthukumar, Benjamin Schuler, and Andrea Sorrano. The authors also acknowledge the computational support from Benjamin Fotovich and the High Performance Computing (HPC) facility at the University of Denver. The simulation data that support the findings of this study are available from the corresponding author upon reasonable request.

Here, we show the equivalence between the free energy based formalism introduced by Muthukumar⁶⁶ given in Eq. (3) and the alternate approach using Edwards and Singh formalism.⁵² In the Edwards–Singh approach, the renormalized Kuhn length l_r(s₂, s1) is determined by demanding [see Eq. (5) in Ref. 47]

{⟨ {(R (s_{2}) - R (s_{1}))}^{2} (H_{r} - H_{t}) ⟩}_{r} = {⟨ {(R (s_{2}) - R (s_{1}))}^{2} ⟩}_{r} {⟨ (H_{r} - H_{t}) ⟩}_{r},

(A1)

where ⟨…⟩_r is the average over the trial Hamiltonian H_r. Using the definition of H_t and H_r from Eqs. (1) and (2), the above equation becomes

\begin{matrix} {⟨{(R (s_{2}) - R (s_{1}))}^{2} \frac{3}{2} (\frac{1}{l_{r}} - \frac{1}{l}) \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2}⟩}_{r} - {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\frac{3}{2} (\frac{1}{l_{r}} - \frac{1}{l}) \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2}⟩}_{r} \\ = l {⟨{(R (s_{2}) - R (s_{1}))}^{2} \int_{0}^{L} d s \int_{0}^{s} d s^{'} ω (s, s^{'}) δ [R (s) - R (s^{'})]⟩}_{r} \\ - l {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\int_{0}^{L} d s \int_{0}^{s} d s^{'} ω (s, s^{'}) δ [R (s) - R (s^{'})]⟩}_{r} \\ + {⟨{(R (s_{2}) - R (s_{1}))}^{2} \frac{l_{b}}{l^{2}} \int_{0}^{L} d s \int_{0}^{s} d s^{'} q (s) q (s^{'}) \frac{\exp (- κ | R (s) - R (s^{'}) |)}{| R (s) - R (s^{'}) |}⟩}_{r} \\ - {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\frac{l_{b}}{l^{2}} \int_{0}^{L} d s \int_{0}^{s} d s^{'} q (s) q (s^{'}) \frac{\exp (- κ | R (s) - R (s^{'}) |)}{| R (s) - R (s^{'}) |}⟩}_{r} \\ + ω_{3} l^{3} {⟨{(R (s_{2}) - R (s_{1}))}^{2} \int_{0}^{L} d s \int_{0}^{s} d s^{'} \int_{0}^{s^{'}} d s^{''} δ [R (s) - R (s^{'})] δ [R (s^{'}) - R (s^{''})]⟩}_{r} \\ - ω_{3} l^{3} {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\int_{0}^{L} d s \int_{0}^{s} d s^{'} \int_{0}^{s^{'}} d s^{''} δ [R (s) - R (s^{'})] δ [R (s^{'}) - R (s^{''})]⟩}_{r} . \end{matrix}

(A2)

Using Eqs. (A2), (A6), and (A9) from our earlier work,⁴⁷ we note the three following equalities:

{⟨{(R (s_{2}) - R (s_{1}))}^{2} \frac{3}{2} (\frac{1}{l_{r}} - \frac{1}{l}) \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2}⟩}_{r} - {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\frac{3}{2} (\frac{1}{l_{r}} - \frac{1}{l}) \int_{0}^{L} d s {(\frac{d R (s)}{d s})}^{2}⟩}_{r} = l_{r}^{2} (s_{2} - s_{1}) (\frac{1}{l_{r}} - \frac{1}{l}),

(A3)

\begin{matrix} {⟨{(R (s_{2}) - R (s_{1}))}^{2} \int_{0}^{L} d s \int_{0}^{s} d s^{'} ω (s, s^{'}) δ [R (s) - R (s^{'})]⟩}_{r} - {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\int_{0}^{L} d s \int_{0}^{s} d s^{'} ω (s, s^{'}) δ [R (s) - R (s^{'})]⟩}_{r} \\ = - \frac{l_{r}^{2}}{3} {(\frac{1}{2 π})}^{3 / 2} {(\frac{3}{l_{r}})}^{5 / 2} [\int_{s_{1}}^{s_{2}} d s \int_{0}^{s_{1}} d s^{'} ω (s, s^{'}) \frac{{(s - s_{1})}^{2}}{{(s - s^{'})}^{5 / 2}} + \int_{s_{1}}^{s_{2}} d s \int_{s_{1}}^{s} d s^{'} ω (s, s^{'}) \frac{1}{{(s - s^{'})}^{1 / 2}} \\ + \int_{s_{2}}^{L} d s \int_{0}^{s_{1}} d s^{'} ω (s, s^{'}) \frac{{(s_{2} - s_{1})}^{2}}{{(s - s^{'})}^{5 / 2}} + \int_{s_{2}}^{L} d s \int_{s_{1}}^{s_{2}} d s^{'} ω (s, s^{'}) \frac{{(s_{2} - s^{'})}^{2}}{{(s - s^{'})}^{5 / 2}}], \end{matrix}

(A4)

and

\begin{matrix} {⟨{(R (s_{2}) - R (s_{1}))}^{2} \frac{l_{b}}{l^{2}} \int_{0}^{L} d s \int_{0}^{s} d s^{'} q (s) q (s^{'}) \frac{\exp (- κ | R (s) - R (s^{'}) |)}{| R (s) - R (s^{'}) |}⟩}_{r} \\ - {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\frac{l_{b}}{l^{2}} \int_{0}^{L} d s \int_{0}^{s} d s^{'} q (s) q (s^{'}) \frac{\exp (- κ | R (s) - R (s^{'}) |)}{| R (s) - R (s^{'}) |}⟩}_{r} \\ = - \frac{4 π l_{b}}{l^{2}} \frac{l_{r}^{2}}{9} [\int_{s_{1}}^{s_{2}} d s \int_{0}^{s_{1}} d s^{'} q (s) q (s^{'}) {(s - s_{1})}^{2} \int \frac{d^{3} k k^{2}}{{(2 π)}^{3} (k^{2} + κ^{2})} \exp (- \frac{k^{2} l_{r} | s - s^{'} |}{6}) \\ + \int_{s_{1}}^{s_{2}} d s \int_{s_{1}}^{s} d s^{'} q (s) q (s^{'}) {(s - s^{'})}^{2} \int \frac{d^{3} k k^{2}}{{(2 π)}^{3} (k^{2} + κ^{2})} \exp (- \frac{k^{2} l_{r} | s - s^{'} |}{6}) \\ + \int_{s_{2}}^{L} d s \int_{0}^{s_{1}} d s^{'} q (s) q (s^{'}) {(s_{2} - s_{1})}^{2} \int \frac{d^{3} k k^{2}}{{(2 π)}^{3} (k^{2} + κ^{2})} \exp (- \frac{k^{2} l_{r} | s - s^{'} |}{6}) \\ + \int_{s_{2}}^{L} d s \int_{s_{1}}^{s_{2}} d s^{'} q (s) q (s^{'}) {(s_{2} - s^{'})}^{2} \int \frac{d^{3} k k^{2}}{{(2 π)}^{3} (k^{2} + κ^{2})} \exp (- \frac{k^{2} l_{r} | s - s^{'} |}{6})], \end{matrix}

(A5)

where we further use the identity⁵⁰

\begin{matrix} \int \frac{d^{3} k k^{2}}{(k^{2} + κ^{2})} \exp (- \frac{k^{2} l_{r} | s - s^{'} |}{6}) \\ = 4 π [\frac{π^{1 / 2}}{4} {(\frac{6}{l_{r} (s - s^{'})})}^{3 / 2} - \frac{π^{1 / 2}}{2} κ^{2} {(\frac{6}{l_{r} (s - s^{'})})}^{1 / 2} \\ + \frac{π}{2} κ^{3} \exp (\frac{κ^{2} l_{r} (s - s^{'})}{6}) e r f c (\sqrt{\frac{κ^{2} l_{r} (s - s^{'})}{6}})] . \end{matrix}

(A6)

The contribution due to the new term involving the three body interaction is calculated as

\begin{matrix} ω_{3} l^{3} {⟨{(R (s_{2}) - R (s_{1}))}^{2} \int_{0}^{L} d s \int_{0}^{s} d s^{'} \int_{0}^{s^{'}} d s^{''} δ [R (s) - R (s^{'})] δ [R (s^{'}) - R (s^{''})]⟩}_{r} \\ - ω_{3} l^{3} {⟨{(R (s_{2}) - R (s_{1}))}^{2}⟩}_{r} {⟨\int_{0}^{L} d s \int_{0}^{s} d s^{'} \int_{0}^{s^{'}} d s^{''} δ [R (s) - R (s^{'})] δ [R (s^{'}) - R (s^{''})]⟩}_{r} \\ = - ω_{3} l^{3} {(\frac{3}{2 π})}^{3} \frac{1}{l_{r}^{2}} [\int_{s_{1}}^{s_{2}} d s \int_{0}^{s_{1}} d s^{'} \int_{0}^{s^{'}} d s^{''} [\frac{{(s - s_{1})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}}] \\ + \int_{s_{1}}^{s_{2}} d s \int_{s_{1}}^{s} d s^{'} \int_{0}^{s_{1}} d s^{''} [\frac{{(s - s^{'})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}} + \frac{{(s^{'} - s_{1})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] \\ + \int_{s_{2}}^{L} d s \int_{0}^{s_{1}} d s^{'} \int_{0}^{s^{'}} d s^{''} [\frac{{(s_{2} - s_{1})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}}] + \int_{s_{2}}^{L} d s \int_{s_{2}}^{s} d s^{'} \int_{0}^{s_{1}} d s^{''} [\frac{{(s_{2} - s_{1})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] \\ + \int_{s_{2}}^{L} d s \int_{s_{1}}^{s_{2}} d s^{'} \int_{0}^{s_{1}} d s^{''} [\frac{{(s_{2} - s^{'})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}} + \frac{{(s^{'} - s_{1})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] \\ + \int_{s_{1}}^{s_{2}} d s \int_{s_{1}}^{s} d s^{'} \int_{s_{1}}^{s^{'}} d s^{''} [\frac{{(s - s^{'})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}} + \frac{{(s^{'} - s^{''})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] \\ \times \int_{s_{2}}^{L} d s \int_{s_{1}}^{s_{2}} d s^{'} \int_{s_{1}}^{s^{'}} d s^{''} [\frac{{(s_{2} - s^{'})}^{2}}{{(s - s^{'})}^{5 / 2} {(s^{'} - s^{''})}^{3 / 2}} + \frac{{(s^{'} - s^{''})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] \\ + \int_{s_{2}}^{L} d s \int_{s_{2}}^{s} d s^{'} \int_{s_{1}}^{s_{2}} d s^{''} \frac{{(s_{2} - s^{''})}^{2}}{{(s - s^{'})}^{3 / 2} {(s^{'} - s^{''})}^{5 / 2}}] . \end{matrix}

(A7)

Combining Eqs. (A2)–(A7) and using discrete notation by assigning s₂ = i, s₁ = j, s = l, s′ = m, s″ = n, and x_ij = l_r,ij/l, we get the final equation

\begin{matrix} x_{i j}^{3 / 2} (i - j) (1 - 1 / x_{i j}) & = & {(\frac{3}{2 π})}^{3 / 2} \frac{Ω_{i j}}{x_{i j}} + {(\frac{3}{2 π})}^{3} ω_{3} \frac{T_{i j}}{x_{i j}^{5 / 2}} \\ + \frac{1}{9} \frac{4 π l_{b}}{l} \frac{2}{{(2 π)}^{2}} Q_{i j}, \end{matrix}

(A8)

where Ω_ij and T_ij are identical to Eqs. (7) and (6). Q_ij is defined as

\begin{matrix} Q_{i j} & = & \sum_{m = j}^{i} \sum_{n = 1}^{j - 1} q_{m} q_{n} A (m, n, x_{i j}, κ l) {(m - j)}^{2} \\ + \sum_{m = j + 1}^{i} \sum_{n = j}^{m - 1} q_{m} q_{n} A (m, n, x_{i j}, κ l) {(m - n)}^{2} \\ + \sum_{m = i + 1}^{N} \sum_{n = 1}^{j - 1} q_{m} q_{n} A (m, n, x_{i j}, κ l) {(i - j)}^{2} \\ + \sum_{m = i + 1}^{N} \sum_{n = j}^{i} q_{m} q_{n} A (m, n, x_{i j}, κ l) {(i - n)}^{2}, \end{matrix}

(A9)

with A(m, n, x_ij, κl) as

\begin{matrix} A (m, n, x_{i j}, κ l) & = & x_{i j}^{3 / 2} [\frac{π^{1 / 2}}{4} {(\frac{6}{x_{i j}})}^{3 / 2} \frac{1}{{(m - n)}^{3 / 2}} - \frac{π^{1 / 2}}{2} {(κ l)}^{2} \\ \times {(\frac{6}{x_{i j}})}^{1 / 2} \frac{1}{{(m - n)}^{1 / 2}} + \frac{π}{2} {(κ l)}^{3} \\ \times \exp (\frac{{(κ l)}^{2} x_{i j} (m - n)}{6}) e r f c \\ \times (\sqrt{\frac{{(κ l)}^{2} x_{i j} (m - n)}{6}})] . \end{matrix}

(A10)

Minimization of the free energy Eq. (3) with respect to x_ij, i.e.,∂[βF(x_ij)]/∂x_ij = 0, yields an equation identical to Eq. (A8). In establishing the equivalence, we also used the following relation:

\frac{\partial A^{'}}{\partial x_{i j}} = - \frac{1}{6} \frac{A}{x_{i j}^{3 / 2}} .

(A11)

The above derivation shows the validity of our approach. However, we use the free energy minimization equation instead of Eq. (A8) due to the ease of computation when multiple solutions (due to two minima and one maxima in the free energy) may arise.

REFERENCES

1.Dunker A. K., Silman I., Uversky V. N., and Sussman J. L., Curr. Opin. Struct. Biol. 18, 756 (2008). 10.1016/j.sbi.2008.10.002 [DOI] [PubMed] [Google Scholar]
2.Mittag T., Kay L. E., and Forman-Kay J. D., J. Mol. Recognit. 23, 105 (2010). 10.1002/jmr.961 [DOI] [PubMed] [Google Scholar]
3.Xie H., Vucetic S., Iakoucheva L. M., Oldfield C. J., Dunker A. K., Uversky V. N., and Obradovic Z., J. Proteome Res. 6, 1882 (2007). 10.1021/pr060392u [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Tompa P., Trends Biochem. Sci. 37, 509 (2012). 10.1016/j.tibs.2012.08.004 [DOI] [PubMed] [Google Scholar]
5.Tompa P., Chem. Rev. 114, 6715 (2014). 10.1021/cr4005082 [DOI] [PubMed] [Google Scholar]
6.Flock T., Weatheritt R. J., Latysheva N. S., and Babu M. M., Curr. Opin. Struct. Biol. 26, 62 (2014). 10.1016/j.sbi.2014.05.007 [DOI] [PubMed] [Google Scholar]
7.Drake J. A. and Pettitt B. M., Biophys. J. 114, 2799 (2018). 10.1016/j.bpj.2018.04.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Zhou H.-X., Trends Biochem. Sci. 37, 43 (2012). 10.1016/j.tibs.2011.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Mittag T., Marsh J., Grishaev A., Orlicky S., Lin H., Sicheri F., Tyers M., and Forman-Kay J. D., Structure 18, 494 (2010). 10.1016/j.str.2010.01.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Vuzman D., Azia A., and Levy Y., J. Mol. Biol. 396, 674 (2010). 10.1016/j.jmb.2009.11.056 [DOI] [PubMed] [Google Scholar]
11.Liu J., Faeder J. R., and Camacho C. J., Proc. Natl. Acad. Sci. U. S. A. 106, 19819 (2009). 10.1073/pnas.0907710106 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Hilser V. J. and Thompson E. B., Proc. Natl. Acad. Sci. U. S. A. 104, 8311 (2007). 10.1073/pnas.0700329104 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Shoemaker B. A., Portman J. J., and Wolynes P. G., Proc. Natl. Acad. Sci. U. S. A. 97, 8868 (2000). 10.1073/pnas.160259697 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Lin Y.-H. and Chan H. S., Biophys. J. 112, 2043 (2017). 10.1016/j.bpj.2017.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Lin Y.-H., Brady J. P., Forman-Kay J. D., and Chan H. S., New J. Phys. 19, 115003 (2017). 10.1088/1367-2630/aa9369 [DOI] [Google Scholar]
16.Lin Y. H., Forman-Kay J. D., and Chan H. S., Biochemistry 57, 2499 (2018). 10.1021/acs.biochem.8b00058 [DOI] [PubMed] [Google Scholar]
17.Dignon G. L., Zheng W., Best R. B., Kim Y. C., and Mittal J., Proc. Natl. Acad. Sci. U. S. A. 115, 9929 (2018). 10.1073/pnas.1804177115 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Lin Y.-H., Brady J. P., Chan H. S., and Ghosh K., J. Chem. Phys. 152, 045102 (2020). 10.1063/1.5139661 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Bergeron-Sandoval L. P., Heris H. K., Chang C., Cornell C. E., Keller S. L., Hendricks A. G., Ehrlicher A. J., Francois P., Pappu R. V., and Michnick S. W., “Endocytosis caused by liquid-liquid phase separation of proteins,” bioRxiv:145664 (2018).
20.Strom A. R., Emelyanov A. V., Mir M., Fyodorov D. V., Darzacq X., and Karpen G. H., Nature 547, 241 (2017). 10.1038/nature22989 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Larson A. G., Elnatan D., Keenen M. M., Trnka M. J., Johnston J. B., Burlingame A. L., Agard D. A., Redding S., and Narlikar G. J., Nature 547, 236 (2017). 10.1038/nature22822 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Plys A. J. and Kingston R. E., Science 361, 329 (2018). 10.1126/science.aau4795 [DOI] [PubMed] [Google Scholar]
23.Cho W.-K., Spille J.-H., Hecht M., Lee C., Li C., Grube V., and Cisse I. I., Science 361, 412 (2018). 10.1126/science.aar4199 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Sabari B. R. et al. , Science 361, eaar3958 (2018). 10.1126/science.aar3958 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Srivastava D. and Muthukumar M., Macromolecules 29, 2324 (1996). 10.1021/ma951204g [DOI] [Google Scholar]
26.Das R. K. and Pappu R. V., Proc. Natl. Acad. Sci. U. S. A. 110, 13392 (2013). 10.1073/pnas.1304749110 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Zerze G. H., Best R. B., and Mittal J., J. Phys. Chem. B 119, 14622 (2015). 10.1021/acs.jpcb.5b08619 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Dignon G. L., Zheng W., Kim Y. C., Best R. B., and Mittal J., PLoS Comput. Biol. 14, e1005941 (2018). 10.1371/journal.pcbi.1005941 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Baul U., Chakraborty D., Mugnai M. L., Straub J. E., and Thirumalai D., J. Phys. Chem. B 123, 3462 (2019). 10.1021/acs.jpcb.9b02575 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.McCarty J., Delaney K. T., Danielsen S. P. O., Fredrickson G. H., and Shea J.-E., J. Phys. Chem. Lett. 10, 1644 (2019). 10.1021/acs.jpclett.9b00099 [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Zheng W., Dignon G., Brown M., Kim Y.C., and Mittal J., J. Phys. Chem. Lett. “Hydropathy patterning complements charge patterning to describe conformational preferences of disordered proteins” (published online 2020). 10.1021/acs.jpclett.Oc00288 [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Marsh J. A. and Forman-Kay J. D., Biophys. J. 98, 2383 (2010). 10.1016/j.bpj.2010.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Muller-Spath S., Soranno A., Hirschfeld V., Hofmann H., Ruegger S., Reymond L., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 107, 14609 (2010). 10.1073/pnas.1001743107 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Hofmann H., Soranno A., Borgia A., Gast K., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 109, 16155 (2012). 10.1073/pnas.1207719109 [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Shen K. and Wang Z.-G., Macromolecules 51, 1706 (2018). 10.1021/acs.macromol.7b02685 [DOI] [Google Scholar]
36.Danielson S. P. O., McCarty J., Shea J. E., Delaney K. T., and Fredrickson G. H., Proc. Natl. Acad. Sci. U. S. A. 116, 8224 (2019). 10.1073/pnas.1900435116 [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Danielsen S. P. O., McCarty J., Shea J.-E., Delaney K. T., and Fredrickson G. H., J. Chem. Phys. 151, 034904 (2019). 10.1063/1.5109045 [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Madinya J. J., Chang L. W., Perry S. L., and Sing C. E., Mol. Syst. Des. Eng. 5, 632 (2019). 10.1039/C9ME00074G [DOI] [Google Scholar]
39.Lytle T. K., Chang L.-W., Markiewicz N., Perry S. L., and Sing C. E., ACS Cent. Sci. 5, 709 (2019). 10.1021/acscentsci.9b00087 [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Dzuricky M., Roberts S., and Chilkoti A., Biochemistry 57, 2405 (2018). 10.1021/acs.biochem.8b00056 [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Best R. B., Zheng W., and Mittal J., J. Chem. Theory Comput. 10, 5113 (2014). 10.1021/ct500569b [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Hicks A. and Zhou H.-X., J. Chem. Phys. 149, 072313 (2018). 10.1063/1.5027409 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Higgs P. G. and Joanny J. F., J. Chem. Phys. 94, 1543 (1991). 10.1063/1.460012 [DOI] [Google Scholar]
44.Dobrynin A. V., Colby R. H., and Rubinstein M., J. Polym. Sci. B 42, 3513 (2004). 10.1002/polb.20207 [DOI] [Google Scholar]
45.Gutin A. M. and Shakhnovich E. I., Phys. Rev. E 50, R3322 (1994). 10.1103/physreve.50.r3322 [DOI] [PubMed] [Google Scholar]
46.Samanta H. S., Chakraborty D., and Thirumalai D., J. Chem. Phys. 149, 163323 (2018). 10.1063/1.5035428 [DOI] [PubMed] [Google Scholar]
47.Sawle L. and Ghosh K., J. Chem. Phys. 143, 085101 (2015). 10.1063/1.4929391 [DOI] [PubMed] [Google Scholar]
48.Firman T. and Ghosh K., J. Chem. Phys. 148, 123305 (2018). 10.1063/1.5005821 [DOI] [PubMed] [Google Scholar]
49.Huihui J., Firman T., and Ghosh K., J. Chem. Phys. 149, 085101 (2018). 10.1063/1.5037727 [DOI] [PubMed] [Google Scholar]
50.Muthukumar M., J. Chem. Phys. 86, 7230 (1987). 10.1063/1.452763 [DOI] [Google Scholar]
51.Ha B.-Y. and Thirumalai D., Phys. Rev. A 46, R3012 (1992). 10.1103/physreva.46.r3012 [DOI] [PubMed] [Google Scholar]
52.Edwards S. F. and Singh P., J. Chem. Soc. Faraday Trans. 2 75, 1020 (1979). 10.1039/f29797501020 [DOI] [Google Scholar]
53.Ghosh K., Carri G. A., and Muthukumar M., J. Chem. Phys. 115, 4367 (2001). 10.1063/1.1386924 [DOI] [Google Scholar]
54.Ghosh K. and Muthukumar M., J. Polym. Sci., Part B: Polym. Phys. 39, 2644 (2001). 10.1002/polb.10016 [DOI] [Google Scholar]
55.Muthukumar M., J. Chem. Phys. 81, 6272 (1984). 10.1063/1.447585 [DOI] [Google Scholar]
56.Vitalis A. and Pappu R. V., Annu. Rep. Comput. Chem. 5, 49 (2009). 10.1016/s1574-1400(09)00503-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Vitalis A. and Pappu R. V., J. Comput. Chem. 30, 673 (2009). 10.1002/jcc.21005 [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Martin E. W., Holehouse A. S., Grace C. R., Hughes A., Pappu R. V., and Mittag T., J. Am. Chem. Soc. 138, 15323 (2016). 10.1021/jacs.6b10272 [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Lyman E. and Zuckerman D. M., Biophys. J. 91, 164 (2006). 10.1529/biophysj.106.082941 [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Sawle L. and Ghosh K., J. Chem. Theory Comput. 12, 861 (2016). 10.1021/acs.jctc.5b00999 [DOI] [PubMed] [Google Scholar]
61.Wuttke R., Hofmann H., Nettels D., Borgia M., Mittal J., Best R., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 111, 5213 (2014). 10.1073/pnas.1313006111 [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Guo M., Xu Y., and Gruebele M., Proc. Natl. Acad. Sci. U. S. A. 109, 17863 (2012). 10.1073/pnas.1201797109 [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Theillet F.-X., Binolfi A., Frembgen-Kesner T., Hingorani K., Sarkar M., Kyne C., Li C., Crowley P. B., Gierasch L., Pielak G. J. et al. , Chem. Rev. 114, 6661 (2014). 10.1021/cr400695p [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Yu I., Mori T., Ando T., Harada R., Jung J., Sugita Y., and Feig M., eLife 5, e19274 (2016). 10.7554/elife.19274 [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Ribeiro S., Ebbinghaus S., and Marcos J. C., FEBS Lett. 592, 3040 (2018). 10.1002/1873-3468.13211 [DOI] [PubMed] [Google Scholar]
66.Muthukumar M., J. Chem. Phys. 120, 9343 (2004). 10.1063/1.1701839 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

See the supplementary material for figures and tables related to self-consistency criteria of simulated trajectories.

[c1] 1.Dunker A. K., Silman I., Uversky V. N., and Sussman J. L., Curr. Opin. Struct. Biol. 18, 756 (2008). 10.1016/j.sbi.2008.10.002 [DOI] [PubMed] [Google Scholar]

[c2] 2.Mittag T., Kay L. E., and Forman-Kay J. D., J. Mol. Recognit. 23, 105 (2010). 10.1002/jmr.961 [DOI] [PubMed] [Google Scholar]

[c3] 3.Xie H., Vucetic S., Iakoucheva L. M., Oldfield C. J., Dunker A. K., Uversky V. N., and Obradovic Z., J. Proteome Res. 6, 1882 (2007). 10.1021/pr060392u [DOI] [PMC free article] [PubMed] [Google Scholar]

[c4] 4.Tompa P., Trends Biochem. Sci. 37, 509 (2012). 10.1016/j.tibs.2012.08.004 [DOI] [PubMed] [Google Scholar]

[c5] 5.Tompa P., Chem. Rev. 114, 6715 (2014). 10.1021/cr4005082 [DOI] [PubMed] [Google Scholar]

[c6] 6.Flock T., Weatheritt R. J., Latysheva N. S., and Babu M. M., Curr. Opin. Struct. Biol. 26, 62 (2014). 10.1016/j.sbi.2014.05.007 [DOI] [PubMed] [Google Scholar]

[c7] 7.Drake J. A. and Pettitt B. M., Biophys. J. 114, 2799 (2018). 10.1016/j.bpj.2018.04.027 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c8] 8.Zhou H.-X., Trends Biochem. Sci. 37, 43 (2012). 10.1016/j.tibs.2011.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c9] 9.Mittag T., Marsh J., Grishaev A., Orlicky S., Lin H., Sicheri F., Tyers M., and Forman-Kay J. D., Structure 18, 494 (2010). 10.1016/j.str.2010.01.020 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c10] 10.Vuzman D., Azia A., and Levy Y., J. Mol. Biol. 396, 674 (2010). 10.1016/j.jmb.2009.11.056 [DOI] [PubMed] [Google Scholar]

[c11] 11.Liu J., Faeder J. R., and Camacho C. J., Proc. Natl. Acad. Sci. U. S. A. 106, 19819 (2009). 10.1073/pnas.0907710106 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c12] 12.Hilser V. J. and Thompson E. B., Proc. Natl. Acad. Sci. U. S. A. 104, 8311 (2007). 10.1073/pnas.0700329104 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c13] 13.Shoemaker B. A., Portman J. J., and Wolynes P. G., Proc. Natl. Acad. Sci. U. S. A. 97, 8868 (2000). 10.1073/pnas.160259697 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c14] 14.Lin Y.-H. and Chan H. S., Biophys. J. 112, 2043 (2017). 10.1016/j.bpj.2017.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c15] 15.Lin Y.-H., Brady J. P., Forman-Kay J. D., and Chan H. S., New J. Phys. 19, 115003 (2017). 10.1088/1367-2630/aa9369 [DOI] [Google Scholar]

[c16] 16.Lin Y. H., Forman-Kay J. D., and Chan H. S., Biochemistry 57, 2499 (2018). 10.1021/acs.biochem.8b00058 [DOI] [PubMed] [Google Scholar]

[c17] 17.Dignon G. L., Zheng W., Best R. B., Kim Y. C., and Mittal J., Proc. Natl. Acad. Sci. U. S. A. 115, 9929 (2018). 10.1073/pnas.1804177115 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c18] 18.Lin Y.-H., Brady J. P., Chan H. S., and Ghosh K., J. Chem. Phys. 152, 045102 (2020). 10.1063/1.5139661 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c19] 19.Bergeron-Sandoval L. P., Heris H. K., Chang C., Cornell C. E., Keller S. L., Hendricks A. G., Ehrlicher A. J., Francois P., Pappu R. V., and Michnick S. W., “Endocytosis caused by liquid-liquid phase separation of proteins,” bioRxiv:145664 (2018).

[c20] 20.Strom A. R., Emelyanov A. V., Mir M., Fyodorov D. V., Darzacq X., and Karpen G. H., Nature 547, 241 (2017). 10.1038/nature22989 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c21] 21.Larson A. G., Elnatan D., Keenen M. M., Trnka M. J., Johnston J. B., Burlingame A. L., Agard D. A., Redding S., and Narlikar G. J., Nature 547, 236 (2017). 10.1038/nature22822 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c22] 22.Plys A. J. and Kingston R. E., Science 361, 329 (2018). 10.1126/science.aau4795 [DOI] [PubMed] [Google Scholar]

[c23] 23.Cho W.-K., Spille J.-H., Hecht M., Lee C., Li C., Grube V., and Cisse I. I., Science 361, 412 (2018). 10.1126/science.aar4199 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c24] 24.Sabari B. R. et al. , Science 361, eaar3958 (2018). 10.1126/science.aar3958 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c25] 25.Srivastava D. and Muthukumar M., Macromolecules 29, 2324 (1996). 10.1021/ma951204g [DOI] [Google Scholar]

[c26] 26.Das R. K. and Pappu R. V., Proc. Natl. Acad. Sci. U. S. A. 110, 13392 (2013). 10.1073/pnas.1304749110 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c27] 27.Zerze G. H., Best R. B., and Mittal J., J. Phys. Chem. B 119, 14622 (2015). 10.1021/acs.jpcb.5b08619 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c28] 28.Dignon G. L., Zheng W., Kim Y. C., Best R. B., and Mittal J., PLoS Comput. Biol. 14, e1005941 (2018). 10.1371/journal.pcbi.1005941 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c29] 29.Baul U., Chakraborty D., Mugnai M. L., Straub J. E., and Thirumalai D., J. Phys. Chem. B 123, 3462 (2019). 10.1021/acs.jpcb.9b02575 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c30] 30.McCarty J., Delaney K. T., Danielsen S. P. O., Fredrickson G. H., and Shea J.-E., J. Phys. Chem. Lett. 10, 1644 (2019). 10.1021/acs.jpclett.9b00099 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c31] 31.Zheng W., Dignon G., Brown M., Kim Y.C., and Mittal J., J. Phys. Chem. Lett. “Hydropathy patterning complements charge patterning to describe conformational preferences of disordered proteins” (published online 2020). 10.1021/acs.jpclett.Oc00288 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c32] 32.Marsh J. A. and Forman-Kay J. D., Biophys. J. 98, 2383 (2010). 10.1016/j.bpj.2010.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c33] 33.Muller-Spath S., Soranno A., Hirschfeld V., Hofmann H., Ruegger S., Reymond L., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 107, 14609 (2010). 10.1073/pnas.1001743107 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c34] 34.Hofmann H., Soranno A., Borgia A., Gast K., Nettels D., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 109, 16155 (2012). 10.1073/pnas.1207719109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c35] 35.Shen K. and Wang Z.-G., Macromolecules 51, 1706 (2018). 10.1021/acs.macromol.7b02685 [DOI] [Google Scholar]

[c36] 36.Danielson S. P. O., McCarty J., Shea J. E., Delaney K. T., and Fredrickson G. H., Proc. Natl. Acad. Sci. U. S. A. 116, 8224 (2019). 10.1073/pnas.1900435116 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c37] 37.Danielsen S. P. O., McCarty J., Shea J.-E., Delaney K. T., and Fredrickson G. H., J. Chem. Phys. 151, 034904 (2019). 10.1063/1.5109045 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c38] 38.Madinya J. J., Chang L. W., Perry S. L., and Sing C. E., Mol. Syst. Des. Eng. 5, 632 (2019). 10.1039/C9ME00074G [DOI] [Google Scholar]

[c39] 39.Lytle T. K., Chang L.-W., Markiewicz N., Perry S. L., and Sing C. E., ACS Cent. Sci. 5, 709 (2019). 10.1021/acscentsci.9b00087 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c40] 40.Dzuricky M., Roberts S., and Chilkoti A., Biochemistry 57, 2405 (2018). 10.1021/acs.biochem.8b00056 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c41] 41.Best R. B., Zheng W., and Mittal J., J. Chem. Theory Comput. 10, 5113 (2014). 10.1021/ct500569b [DOI] [PMC free article] [PubMed] [Google Scholar]

[c42] 42.Hicks A. and Zhou H.-X., J. Chem. Phys. 149, 072313 (2018). 10.1063/1.5027409 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c43] 43.Higgs P. G. and Joanny J. F., J. Chem. Phys. 94, 1543 (1991). 10.1063/1.460012 [DOI] [Google Scholar]

[c44] 44.Dobrynin A. V., Colby R. H., and Rubinstein M., J. Polym. Sci. B 42, 3513 (2004). 10.1002/polb.20207 [DOI] [Google Scholar]

[c45] 45.Gutin A. M. and Shakhnovich E. I., Phys. Rev. E 50, R3322 (1994). 10.1103/physreve.50.r3322 [DOI] [PubMed] [Google Scholar]

[c46] 46.Samanta H. S., Chakraborty D., and Thirumalai D., J. Chem. Phys. 149, 163323 (2018). 10.1063/1.5035428 [DOI] [PubMed] [Google Scholar]

[c47] 47.Sawle L. and Ghosh K., J. Chem. Phys. 143, 085101 (2015). 10.1063/1.4929391 [DOI] [PubMed] [Google Scholar]

[c48] 48.Firman T. and Ghosh K., J. Chem. Phys. 148, 123305 (2018). 10.1063/1.5005821 [DOI] [PubMed] [Google Scholar]

[c49] 49.Huihui J., Firman T., and Ghosh K., J. Chem. Phys. 149, 085101 (2018). 10.1063/1.5037727 [DOI] [PubMed] [Google Scholar]

[c50] 50.Muthukumar M., J. Chem. Phys. 86, 7230 (1987). 10.1063/1.452763 [DOI] [Google Scholar]

[c51] 51.Ha B.-Y. and Thirumalai D., Phys. Rev. A 46, R3012 (1992). 10.1103/physreva.46.r3012 [DOI] [PubMed] [Google Scholar]

[c52] 52.Edwards S. F. and Singh P., J. Chem. Soc. Faraday Trans. 2 75, 1020 (1979). 10.1039/f29797501020 [DOI] [Google Scholar]

[c53] 53.Ghosh K., Carri G. A., and Muthukumar M., J. Chem. Phys. 115, 4367 (2001). 10.1063/1.1386924 [DOI] [Google Scholar]

[c54] 54.Ghosh K. and Muthukumar M., J. Polym. Sci., Part B: Polym. Phys. 39, 2644 (2001). 10.1002/polb.10016 [DOI] [Google Scholar]

[c55] 55.Muthukumar M., J. Chem. Phys. 81, 6272 (1984). 10.1063/1.447585 [DOI] [Google Scholar]

[c56] 56.Vitalis A. and Pappu R. V., Annu. Rep. Comput. Chem. 5, 49 (2009). 10.1016/s1574-1400(09)00503-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c57] 57.Vitalis A. and Pappu R. V., J. Comput. Chem. 30, 673 (2009). 10.1002/jcc.21005 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c58] 58.Martin E. W., Holehouse A. S., Grace C. R., Hughes A., Pappu R. V., and Mittag T., J. Am. Chem. Soc. 138, 15323 (2016). 10.1021/jacs.6b10272 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c59] 59.Lyman E. and Zuckerman D. M., Biophys. J. 91, 164 (2006). 10.1529/biophysj.106.082941 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c60] 60.Sawle L. and Ghosh K., J. Chem. Theory Comput. 12, 861 (2016). 10.1021/acs.jctc.5b00999 [DOI] [PubMed] [Google Scholar]

[c61] 61.Wuttke R., Hofmann H., Nettels D., Borgia M., Mittal J., Best R., and Schuler B., Proc. Natl. Acad. Sci. U. S. A. 111, 5213 (2014). 10.1073/pnas.1313006111 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c62] 62.Guo M., Xu Y., and Gruebele M., Proc. Natl. Acad. Sci. U. S. A. 109, 17863 (2012). 10.1073/pnas.1201797109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c63] 63.Theillet F.-X., Binolfi A., Frembgen-Kesner T., Hingorani K., Sarkar M., Kyne C., Li C., Crowley P. B., Gierasch L., Pielak G. J. et al. , Chem. Rev. 114, 6661 (2014). 10.1021/cr400695p [DOI] [PMC free article] [PubMed] [Google Scholar]

[c64] 64.Yu I., Mori T., Ando T., Harada R., Jung J., Sugita Y., and Feig M., eLife 5, e19274 (2016). 10.7554/elife.19274 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c65] 65.Ribeiro S., Ebbinghaus S., and Marcos J. C., FEBS Lett. 592, 3040 (2018). 10.1002/1873-3468.13211 [DOI] [PubMed] [Google Scholar]

[c66] 66.Muthukumar M., J. Chem. Phys. 120, 9343 (2004). 10.1063/1.1701839 [DOI] [PubMed] [Google Scholar]

PERMALINK

An analytical theory to describe sequence-specific inter-residue distance profiles for polyampholytes and intrinsically disordered proteins

Jonathan Huihui

Kingshuk Ghosh

Abstract

I. INTRODUCTION

II. METHOD

A. Coil globule transition for ensemble average distance between an arbitrary pair of amino acid residues

B. Protocol for all-atom simulation

III. RESULT

A. Theory captures intra-chain distance profile similar to all-atom simulation

1. Application to synthetic polyampholytes

FIG. 1.

2. Application to intrinsically disordered protein

FIG. 2.

B. Phosphorylation hotspots can significantly alter distance profiles

FIG. 3.

C. Coil-globule transition is residue-pair specific

FIG. 4.

IV. CONCLUSION

SUPPLEMENTARY MATERIAL

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

An analytical theory to describe sequence-specific inter-residue distance profiles for polyampholytes and intrinsically disordered proteins

Jonathan Huihui

Kingshuk Ghosh

Abstract

I. INTRODUCTION

II. METHOD

A. Coil globule transition for ensemble average distance between an arbitrary pair of amino acid residues

B. Protocol for all-atom simulation

III. RESULT

A. Theory captures intra-chain distance profile similar to all-atom simulation

1. Application to synthetic polyampholytes

FIG. 1.

2. Application to intrinsically disordered protein

FIG. 2.

B. Phosphorylation hotspots can significantly alter distance profiles

FIG. 3.

C. Coil-globule transition is residue-pair specific

FIG. 4.

IV. CONCLUSION

SUPPLEMENTARY MATERIAL

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases