Abstract
The bacterial colicin-immunity proteins Im7 and Im9 fold by different mechanisms. Experimentally, at pH 7.0 and 10°C, Im7 folds in a three-state manner via an intermediate but Im9 folding is two-state-like. Accordingly, Im7 exhibits a chevron rollover, whereas the chevron arm for Im9 folding is linear. Here we address the biophysical basis of their different behaviors by using native-centric models with and without additional transferrable, sequence-dependent energies. The Im7 chevron rollover is not captured by either a pure native-centric model or a model augmented by nonnative hydrophobic interactions with a uniform strength irrespective of residue type. By contrast, a more realistic nonnative interaction scheme that accounts for the difference in hydrophobicity among residues leads simultaneously to a chevron rollover for Im7 and an essentially linear folding chevron arm for Im9. Hydrophobic residues identified by published experiments to be involved in nonnative interactions during Im7 folding are found to participate in the strongest nonnative contacts in this model. Thus our observations support the experimental perspective that the Im7 folding intermediate is largely underpinned by nonnative interactions involving large hydrophobics. Our simulation suggests further that nonnative effects in Im7 are facilitated by a lower local native contact density relative to that of Im9. In a one-dimensional diffusion picture of Im7 folding with a coordinate- and stability-dependent diffusion coefficient, a significant chevron rollover is consistent with a diffusion coefficient that depends strongly on native stability at the conformational position of the folding intermediate.
Author Summary
In order to fold correctly, a globular protein must avoid being trapped in wrong, i.e., nonnative conformations. Thus a biophysical account of how attractive nonnative interactions are bypassed by some amino acid sequences but not others is key to deciphering protein structure and function. We examine two closely related bacterial immunity proteins, Im7 and Im9, that are experimentally known to fold very differently: Whereas Im9 folds directly, Im7 folds through a mispacked conformational intermediate. A simple model we developed accounts for their intriguingly different folding kinetics in terms of a balance between the density of native-promoting contacts and the hydrophobicity of local amino acid sequences. This emergent principle is extensible to other biomolecular recognition processes.
Introduction
The study of proteins that fold in an apparent two-state-like manner [1] has led to tremendous advances in protein folding biophysics [2, 3]. In line with the consistency [4] and minimal frustration [5] principles, the energy landscapes of these proteins may be pictured as smooth funnels with little ruggedness [6–8]. However, the consistency between local and nonlocal interactions is never perfect [4]. Frustration exists [5] in biomolecules and can sometimes serve important biological functions [9]. It is physically intuitive that energetically favorable nonnative interactions can occur [10]. Through improved experimental techniques, nonnative interactions are now known to be more prevalent than previously appreciated [11, 12]. From a fundamental biophysical standpoint, a better understanding of the presence and absence of nonnative interactions is key to deciphering biomolecular recognition and to assessing our grasp of basic protein energetics [13].
As one of the earliest definitive examples of nonnative effects in single-domain proteins, the folding kinetics of bacterial immunity protein Im7 and its homolog Im9 are well characterized [14, 15]. Despite their very similar native structures (Fig 1A and 1B), a large body of experimental work demonstrates that Im7 folds via an intermediate stabilized by nonnative contacts, whereas Im9 folding is essentially two-state [16–22]. The relative simplicity of the Im7/Im9 system makes it well suited for an informative case study. Unlike some of the larger proteins (number of residues n ≳ 100) such as cytochrome c [23] and ribonuclease A [24] that fold in a more complex manner [25], Im7 and Im9 folding is not complicated by a heme or disulfide bonds. Indeed, in view of many single-domain proteins that can fold with no apparent nonnative effects, the nonnative interactions in Im7 are likely a consequence of functional constraints [26, 27]. It is noteworthy in this connection that the biological functions of Im7 and Im9 are evolutionarily related by promiscuous interactions [28] that are probably underpinned by nonnative excited-state conformations [29].
Theory and computation have provided valuable insights into the Im7/Im9 system. Experimental Φ-values were used as constraints in conformational sampling to derive putative folding transition states of these proteins [27, 30]. The results suggest a functional origin for the nonnative interactions in Im7 [27]. In a separate effort, an equilibrium intermediate state was predicted for Im7 using a Gō-like model that assumes no favorable nonnative interaction [31]. However, although topological frustration and heterogeneity in contact density can, in some cases, lead to kinetic and equilibrium folding intermediates in the absence of favorable nonnative interactions [32–34], a subsequent computational study indicates that Im7 folding cannot be explained by native-centric interactions alone [26]. Instead, nonnative effects arising from “localized frustration” [35] was seen as necessary for rationalizing the peculiar behaviors of Im7 [26]. Consistent with this finding as well as with experiment, a sequential stabilization algorithm for predicting folding pathway was not able to reach the Im7 native structure because of kinetic trapping; but the same algorithm was successful in accessing the Im9 native structure [36].
A clear kinetic difference between Im7 and Im9 is manifested by their chevron plots of logarithmic folding and unfolding rates versus denaturant concentration [11]. The folding arm of the Im7 chevron at pH 7.0 and 10°C exhibits a significant rollover, whereas that of the Im9 does not [16, 18–20]. The present study addresses this basic distinction between Im7 and Im9 by direct simulations of folding/unfolding rates. Because each chevron plot is a summary of kinetic and thermodynamic data from a large set of folding/unfolding trajectories [13], it is not yet practical to employ all-atom molecular dynamics [37, 38] for the extensive computation necessary to produce model chevron plots. Moreover, current molecular dynamics forcefields are probably insufficient to rationalize highly cooperative folding behaviors such as that of Im9 because the forcefields tend to over-predict nonnative effects [38, 39]. Therefore, as an interim method that has been applied elsewhere [40–42], we develop tractable explicit-chain coarse-grained models [43] to tackle the chevron behaviors of Im7 and Im9, as these behaviors have not been addressed by direct simulations to date. We model nonnative effects using “hybrid” formulations that augment structure-based native-centric interactions with physics-based, sequence-dependent transferrable energy terms [44, 45]. Limitations notwithstanding, this approach has been accounting for an increasing number of experiments [13, 43, 46–51]. By comparing nonnative interactions that do [52] and do not [47, 49] reflect the variation of hydrophobicity among nonpolar residues [53], we find that the difference between the Im7 and Im9 chevrons is well rationalized by nonative interactions involving large hydrophobic residues.
The present study addresses also the relationship between conformational diffusion and folding intermediates. Diffusion is a useful concept [54–59] in describing physical effects of solvent and internal friction in folding [60–63]. Whereas mild internal friction likely arises from the particulate nature of the solvent [62] and correlated dihedral rotations along the polypeptide [63], elevated internal friction in compact chains [60] can emerge more generally from topological frustration [32, 33] and favorable nonnative interactions [10, 54]. As discussed below, the Im7 chevron rollover in our model is associated with a coordinate- and stability-dependent coefficient of one-dimensional diffusion, with a strong anticorrelation between native stability and diffusion rate at the position of the transiently trapped intermediate. Notably, the smallest diffusion coefficients at these trapped positions can be orders of magnitude smaller than those encountered in two-state-like folding.
Results/Discussion
We study three classes of coarse-grained chain models for Im7 and Im9. The rationale for the models—termed db (desolvation-barrier), db+hϕ, and db+MJhϕ—are detailed in Methods. The db models are purely native-centric, whereas the other two are hybrid models [13] that allow for sequence-dependent nonnative hydrophobic interactions based on either homogeneous or heterogeneous nonnative energies. The nonnative interaction strength between any pair of hydrophobic residues is taken to be the same in the homogeneous db+hϕ models. We compare this simple approach [49]—which does not account for effects of mutations among hydrophobic residues—to the heterogeneous db+MJhϕ models that utilize a Miyazawa-Jernigan (MJ) statistical potential [52] for the nonnative interactions among different hydrophobic pairs. To compare models on an equal footing, the average hydrophobic interaction strength in the heterogeneous db+MJhϕ models is chosen to be identical to that of the homogeneous db+hϕ models.
The difference between Im7 and Im9 folding is not apparent in the model proteins’ Q-dependent free energy profiles
The equilibrium free energy profiles computed near the models’ transition midpoints (Fig 1C and 1D) show no dramatic difference between Im7 and Im9. The free energy barrier is lower for Im7 than for Im9 in the db models (dotted curves); but this trend is reversed when the nonnative interactions in the db+hϕ and db+MJhϕ models are included (dashed and solid curves). Nonnative interactions in these models slow down folding for Im7 but speed up folding for Im9. Unlike previous Im7 models that exhibit a significantly populated equilibrium intermediate [26, 31] (which is apparently not quite in line with the success of two-state fitting of experimental equilibrium data for wildtype Im7 [22]), folding in our models is thermodynamically two-state as their folding/unfolding barriers under midpoint conditions are quite high (≳ 5k B T, where k B is Boltzmann constant and T is absolute temperature). The only hint of an Im7 folding intermediate is a small dip in the Im7 profiles (Fig 1C) at Q ≈ 0.85 that is absent in the Im9 profiles (Fig 1D). This feature by itself is no definitive evidence for complex folding kinetics, however. Under much stronger folding conditions, folding in our models becomes downhill [64]. Now even less difference is seen in Fig 2A between the equilibrium free energy profiles of Im7 and Im9 under zero-denaturant conditions (ΔG/k B T ≈ −10.5 and −12.0, corresponding to the experimental folding free energy of approximately −24.9 kJ mol−1 for Im7 [19] and −28.2 kJ mol−1 for Im9 [15] at pH 7.0 and 10°C; see Fig 2B).
The main difference between Im7 and Im9 chevron plots is rationalized by heterogeneous nonnative hydrophobic interactions
The approximate linearity of native stability versus interaction strength ϵ/T (Fig 2B) allows ΔG/k B T to be used as a proxy for denaturant concentration [42] in model chevron plots. Fig 3 shows that the folding-arm rollover and lack thereof, respectively, in the experimental chevrons for Im7 and Im9 at pH 7.0 and 10°C [16, 18–20] is captured by the db+MJhϕ but not the db and db+hϕ models, suggesting that the Im7 rollover arises from the strong nonnative interactions among the large hydrophobic residues as modeled by db+MJhϕ (S1 Fig). The difference between Im7 and Im9 folding cannot be explained by native interactions alone (as in db) or the more generic nonnative hydrophobic effects in db+hϕ. The chevron rollover in the db+MJhϕ Im7 model is a consequence of transient yet long-lived trapped conformations at Q ≈ 0.85 (Fig 4A), which do not appear in Im9 folding under similarly strong folding conditions (Fig 4B).
An overview of Im7 and Im9 folding kinetics is afforded by their kinetic profiles, which show a deep minimum at Q ≈ 0.85 for Im7 (Fig 4C) but not for Im9 (Fig 4D). Determined from folding trajectories alone [59], kinetic profiles are more useful than free energy profiles for identifying folding intermediates. The Im7/Im9 difference is not apparent from the free energy profiles because, on one hand, kinetic trapping is minimal when folding is only weakly favored (Fig 1C). On the other hand, when folding is strongly favored (Fig 2A), the contribution from folding trajectories to an equilibrium profile is overwhelmed by that from unfolding trajectories, viz., the resident time in the folded state is much longer than that in the unfolded and intermediate states. Consequently, the deep well at Q ≈ 0.85 in Fig 4C translates to merely a small kink around the same Q value in Fig 2A.
A physical account of the main difference between Im7 and Im9 folding kinetics is thus provided. Many mutational experiments are rationalized below as well. Because of their simplicity, however, db+MJhϕ models are limited in some respects. For instance, the midpoint folding rate of Im7 is ≈ 1/5 that of Im9 in this model (Fig 3C); but the experimental midpoint rate of Im7 (≈ 1.2–3.0 s−1 [19, 65]) is ≳ 40 times that of Im9 (≈ 0.03 s−1 [15, 20]). Moreover, whereas the folding and unfolding arms of the simulated chevron plots are quite symmetric around the transition midpoint, experimental unfolding rate exhibits a much weaker denaturant dependence than folding rate [16, 18–20]. These drawbacks are typical of topology-based models [42], which are more apt for folding than for unfolding kinetics [43, 66]. But this limitation has little bearing on our analysis of folding kinetics. Improved modeling likely requires special stability-enhancing energies that have minimal effects on folding kinetics [67, 68]; but such efforts are outside the scope of the present work.
Contact pattern of the computed Im7 folding intermediate is consistent with experimental inference
Structural properties of our simulated Im7 intermediate (Fig 5) are largely in agreement with mutagenesis experiments, which indicate that the intermediate is stabilized by nonnative interactions between Helix IV and the open end of the Helix I-Helix II hairpin involving residues L3, I7, F15, V16, L18, L19, L34, L37, L38, F41, V42, I68, and I72 [19]. Notably, 12 of these 13 residues are involved in the most populated 20 nonnative hydrophobic contacts (with > 80% probability of occurrence) in the Im7 intermediate simulated using db+MJhϕ (Fig 5A, upper triangle). The only exception is V42, for which the most probable nonnative contact V36–V42 has nonetheless a 73% occurrence probability in the simulated intermediate. Among the 20 most probable nonnative contacts in the simulated Im7 intermediate, three are between the N-terminal segment and Helix II [L3–V33 (94%), L3–L34 (85%), I7–V36 (92%)], eight are between Helices I and II [F15–L34 (92%), F15–V36 (99%), F15–L37 (97%), F15–L38 (85%), V16–L37 (92%), V16–L38 (80%), L19–L38 (80%), L18–L34 (96%)], four are between different residues in Helix II [V33–F41 (98%), L34–F41 (99%), V36–F41 (90%), V36–I44 (82%)], and five are between Helices II and IV [L37–V69 (99%), L37–I72 (96%), L38–I68 (99%), L38–V69 (99%), L38–I72 (91%)]. Helix III hardly contributes to the intermediate-stabilizing nonnative contacts in the model. The most likely nonnative contact in the intermediate ensemble that involves Helix III, L38–L53, has an occurrence probability of only 17%.
Our computed probabilities of contacts are in line with experiments indicating that Helices I and IV are fully formed but Helix II is partly formed in the Im7 intermediate [14]. In Fig 5A, intrahelical contacts between residues i, i + 4 are present but less probable for Helix II (residues 32 to 45) than for Helices I and IV (residues 12 to 26 and 65 to 78). Experiment indicates also that Helix III is absent [14] but it is present in our simulated Im7 intermediate. This limitation of the model is likely related to its simple treatment of native interactions. Nonetheless, in agreement with experiment, amino acid substitutions in Helix III result only in small changes in folding rate in the db+MJhϕ model (see below).
A snapshot of the simulated Im7 intermediate state is shown in Fig 5B wherein each of the highlighted nonnative contacts has ≥ 80% occurrence probability except M1–L18 (56%) in the Im7 intermediate ensemble (Fig 5C). All except one (V42) of the 13 residues identified by mutagenesis experiments (see above) to be stabilizing the Im7 intermediate are represented in the highlighted nonnative contacts. We have verified that structures very similar to the Cα intermediate conformation in Fig 5B are physically realizable by constructing a corresponding atomic structure [69] with added sidechains [70] (S2 Fig).
Our simulated Im7 kinetic intermediate is stabilized by nonnative interactions (S1 Fig). As such, it is diametrically different from the equilibrium intermediates simulated using purely native-centric models [31] with heterogeneous Gō energies [71]. Instead of being a product of nonnative effects, equilibrium intermediates in such Gō-like models arise from their reduced folding cooperativity [72], which can lead to three-state-like free energy profiles for Im7 and Im9 (S3 Fig); but such features are at odds with experiment.
Kinetic effects of Im7 mutations
Effects of select mutations in the db+MJhϕ model for Im7 are examined through their folding kinetic profiles [59] (Fig 6). Some mutations reduce the depth of the kinetic trap at Q ≈ 0.85 relative to that of the wildtype (WT) while others lead only to negligible changes. We compute also the rates of reaching the intermediate position at Q ≈ 0.85 and the folded state at Q = 0.98 from initially unfolded conformations. The former rate (≈ 3.9 × 10−7 for WT, in units of reciprocal number of time steps) varies little, whereas the latter overall folding rate (= 5.0 × 10−8 for WT) is sensitive to mutation. The overall folding rate correlates, albeit imperfectly, with the depth of the Q ≈ 0.85 minimum.
The general trend of variation of the simulated folding rates is largely in line with that of the experimental intermediate-to-native folding rates k in [19] or k IN [65] for the single mutants (= 238 s−1 for WT) in Fig 6A. For both simulation and experiment, folding rate remains essentially unchanged for three mutants (simulated rate in units of 10−8, experimental rate in s−1 [19, 65]): I54A (4.4, 200), I72V (5.0, 250), A77G (4.9, 235) [(5.0, 238) for WT]; and is speeded up for four mutants: F15A (30.7, 550), L34A (40.2, 1850), L37A (49.6, 450), L38A (31.1, 1600). Folding rate remains essentially unchanged experimentally but is speeded up in simulation for five mutants; nevertheless the simulated increase is less than that for mutants that fold faster in experiments: F15Y (19.3, 220), V16A (10.7, 220), V33A (19.7, 238), V36A (22.3, 190), F41Y (9.3, 186).
However, the present model cannot account for the dramatic experimental increase in folding rate and the disappearance of folding-arm rollover for F41L (k in = 5000 s−1 [19], ≈ 21 times of that of WT) because F and L have similar MJ energies [52]. For this mutant, the simulated rate 3.6 × 10−8 is smaller than that of WT. Even mutating F to a non-hydrophobic in the model cannot produce the experimental effect of F41L, viz., the simulated rate for the F41G mutant is 2.68 times that of WT but is far from sufficient. To account for the dramatic impact of F41L, future theoretical studies will need to pursue subtle effects beyond our simple treatment of hydrophobicity, perhaps by considering energetics specific to aromatic residues [13, 73].
Consistent with experiment [14], L53A/I54A has a negligible kinetic effect on Im7 in our model (Fig 6B), which is in line with the small experimental Φ-values of ≈ 0.03–0.16 and k in = 200 s−1 for L53 and I54 in Helix III [19]. In contrast, many double mutants with hydrophobicity-reducing substitutions in Helices I and II can dramatically destabilize the folding intermediate and thus speed up Im7 folding (Fig 6B). These predictions should be testable by future experiments. However, because mutations in our models change only the nonnative but not the native interactions, as it stands our approach cannot address mutations such as L18A/L19A/L37A that prevent Im7 folding [22].
Im7/Im9 difference in native contact density and hydrophobicity of Helix II
The three-state kinetics of Im7 is related to its hydrophobic composition. Im7 has 32 hydrophobic residues (17 with stronger and 15 with weaker hydrophobicities; Fig 1) whereas Im9 has 28 (15 and 13 in the two categories). In Helix II, Im7 has two more hydrophobics (V33, V42) and the stronger L38 instead of the weaker V37 in Im9. In Helix IV, Im7 has I72 instead of Im9’s V71. Notably, V33, L38, and V72 are involved in 10 of the 20 most probable nonnative contacts in the simulated Im7 intermediate listed above.
Im7 and Im9 have nearly equal numbers of native contacts involving Helices I and IV (54 and 50, respectively, for Im7 and 53 and 49 for Im9). But the number of native contacts involving Helix II is 52 for Im7 (residues 32 to 45) and 62 for Im9 (residues 30 to 44). The native contact density of Helix II is thus appreciably lower for Im7 (52/14 = 3.71) than for Im9 (62/15 = 4.13). With lower local native-centricity and higher local hydrophobicity (Fig 7), Im7’s Helix II—which contains two more hydrophobic residues than Im9’s as shown by the sequences at the bottom of Fig 7 (see also discussion above)— is more prone to nonnative contacts than Im9’s Helix II. Indeed, Helix II is involved in all of the 20 most probable nonnative contacts in the simulated Im7 intermediate.
We emphasize that the critical factor here is the local native contact density of Helix II but not necessarily the overall native contact density of the protein. Im7 has fewer native contacts than Im9 (154 versus 164) in our models; yet the simulated Im7 intermediate remains essentially unchanged even if the number of Im7 native contacts is increased to 161 by using Swiss-PdbViewer [69] to construct additional contacts in its less ordered N-terminal region. Moreover, the trend seen here is not limited to our specific definition of native contacts. To assess the robustness of our inference, we have also applied the CSU software, which employs detailed analysis of interatomic contacts and interface complementarity to determine native contacts [74, 75]. Under the CSU criterion, the total number of native contacts is very similar for Im7 and Im9 (177 and 180 respectively; see upper-left map in Fig 7). Nonetheless, similar to the observation above, the local density of CSU-defined native contacts of Helix II is also appreciably lower for Im7 (59/14 = 4.21) than for Im9 (67/15 = 4.47).
Experiments on Im9 have shown that V37L/V71I and V37L/E41V/V71I can lead to three-state folding [15, 21] and folding-arm rollover at pH 7.0 and 10°C [21]. Computationally (S4 Fig), these mutations deepen somewhat the shallow minimum at Q ≈ 0.85 in the Im9 kinetic profile (A and C of S4 Fig). But the effect is insufficient to account for experimental data, indicating that further effort is needed to better model the balance between native and nonnative interactions in Im9. For instance, if the native interaction strength of L33 (which acts as a “gatekeeper” [76]) in Helix II was reduced, much deeper Im9 kinetic traps would develop for V37L/V71I and V37L/E41V/V71I (B and D of S4 Fig). Although our present model does not address mutational effects on native interactions, this result indicates nonetheless that L33 mutations that reduce the native interaction strengths (e.g., by substituting it with a less hydrophobic residue) may lead to less cooperative folding of Im9. This suggested behavior should be testable by future experiments.
The above analysis of the interplay between local native contact density and hydrophobicity suggests that the different folding kinetics of wildtype Im7 and Im9 may also be seen in variants of the homogeneous db+hϕ model (K HP = 1 as defined in Methods) with stronger nonnative hydrophobic interaction strengths (K HP > 1). Consistent with this idea, S5 Fig shows that a signficant folding intermediate population starts to develop at K HP = 1.3 for Im7 but no corresponding folding intermediate is observed for Im9 at the same K HP. Two comments are in order here. On one hand, the result in S5 Fig from an alternate formulation of hydrophobicity reinforces our general notion that local native contact density and hydrophobicity are the main physical underpinnings for the Im7-Im9 kinetic difference. On the other hand, a strength of ≳ 1.3 for the homogeneous nonnative hydrophobic interaction is needed to achieve the desired Im7-Im9 difference, whereas the heterogeneous nonnative hydrophobic interaction strengths in the db+MJhϕ model that produce a similar effect average only to 1.0 (see Methods; note that even at K HP = 1.3, the minimum at Q ≈ 0.85 in (A of S5 Fig) is shallower than that in Fig 4C). Physically, K HP ≳ 1.3 is problematic because it implies that nonnative interaction strength is ≳ 30% higher than native interaction strength. For this reason and considering the obvious limitation of the homogeneous approach that it cannot address effects of mutations among hydrophobic residues, the more refined db+MJhϕ approach is adopted in the present study.
Conformational diffusion in Q is extremely coordinate- and stability-dependent in the presence of a significant kinetic trap
The Im7/Im9 system is instructive in elucidating nonnative effects and kinetic trapping in the diffusion picture of protein folding [54–59]. Conformational diffusion models with a coordinate and stability-dependent diffusion coefficient on a one-dimensional free energy profile were constructed for two-state-like [57] and downhill [58] folding; but corresponding modeling for folding with a significant chevron rollover has not been much explored. In this regard, it is noteworthy that the rollover in our Im7 db+MJhϕ model appears across only ≈ 8% variation in interaction strength (ϵ/k B T = 1.37 and 1.48, respectively, for midpoint and zero denaturant). In contrast, rollover-like features for two-state-like and downhill folders emerge over much wider ranges of interaction strength [58].
The restraining-potential method [56, 58] in Methods is used to compute Q- and ΔG-dependent autocorrelation function C Q(t) (Fig 8) and diffusion coefficient D(Q) (Fig 9). The restraining-potential method directly addresses the escape probability from a given Q. Rather than seeking a good fit by Bayesian analysis [55], we adopt this method to explore possible limits of the diffusion picture by testing the consistency between diffusive accounts of restrained and unrestrained chain kinetics.
The most notable Im7/Im9 difference presents itself around the Im7 kinetic trap at Q ≈ 0.8–0.9. Here a dramatic deepening of a dip in D(Q) with increasing native stability is seen for Im7 but not for Im9, whereas D(Q) for other Q-values is not very sensitive to ΔG (Fig 9). Achieving numerical convergence of the computed D(Q) in the Q ≈ 0.85 region of Im7 is difficult because of kinetic trapping. To delimit theoretical possibilities, we obtain lower and upper bounds of D(Q) for Im7 in this region, respectively, by initializing restrained runs from kinetically trapped and random conformations (Fig 9).
Im7 chevrons may now be computed in the diffusion model; but considerable variation ensues (shaded area in Fig 10A) because of numerical uncertainties. The rollover trend of the simulated Im7 chevron is among the predicted possibilities. However, when matched against explicit-chain kinetics, D(Q) is found to be underestimated by an overall factor of e 2.7 ≈ 15 (Fig 10), indicating that the method for computing D(Q) [56, 58] needs to be improved or that a one-dimensional diffusion perspective is of limited applicability here.
Despite these uncertainties, it is clear that a D(Q, ΔG) that decreases exponentially with ΔG at the trap position Q ≈ 0.85 (Fig 10B) is necessary to reproduce the folding-arm rollover for Im7 (Fig 10A, circles). The required variation of D(Q, ΔG) at this position, which spans two orders of magnitude, is reassuringly consistent with the lower bound estimated by initiating restrained runs from the kinetic trap. In the absence of such a strong dependence of D(Q, ΔG) on ΔG, the predicted folding arm would become essentially linear (top dashed line in Fig 10A). In the same vein and consistent with our explicit-chain results (Fig 3), no folding-arm rollover is produced by the diffusion model for Im9.
Concluding remarks
To recapitulate, our explicit-chain models account physically for the strikingly different folding kinetics of Im7 and Im9 in terms of prevalent nonnative interactions among large hydrophobic residues in Im7 but not in Im9. The proteins’ different experimental chevron behaviors are rationalized by our simulation. The same phenomenon may also be described by a one-dimensional diffusion process with a very small and strongly stability-dependent diffusion coefficient at the position of the Im7 kinetic trapped intermediate.
Our model interaction schemes are tentative [13, 46]. For instance, possible contributions to nonnative interactions from electrostatic [48, 51, 77] and aromatic [73] effects are not taken into account. Nonetheless, by comparing different modeling schemes as controls and contrasting Im7 and Im9 behaviors under the same scheme, we arrive at a physical picture that is largely in agreement with experiment. As observed experimentally [65, 78], Helices I and IV are essentially formed while Helix II is partially formed in our simulated Im7 intermediate. Kinetic effects of many mutations in our model are consistent with experiment, including those involving Helix III (Figs 3–7), demonstrating the versatility of the hydrid modeling approach to nonnative effects [13].
Several limitations of our model are noted. In particular, the short Helix III is present in our simulated Im7 intermediate but experimentally that is apparently not the case [65, 78]. To address this issue, more sophisticated treatments of local conformational propensity [36, 79] and sidechain effects [13, 42] are probably needed. Indeed, the rich repertoire of experiments on the Im7/Im9 system, such as those on pH [18, 21] and temperature [15] effects, offers ample data for testing extensions of our models.
Perhaps the most useful insight from the present effort is that the peculiar folding kinetics of Im7 vis-à-vis that of Im9 is closely related to their difference in the balance between local native contact density and hydrophobicity. This principle embodies a competition between native topology and nonnative interactions [49] and is likely applicable to protein dynamics and biomolecular processes in general. As such, it should be examined in detail and extended to other forms of nonnative interactions in future investigations.
Methods
Explicit-chain models
Three related Cα chain models for Im7 and Im9 are considered, namely the db, db+hϕ, and db+MJhϕ models. Common to these models is a set of native-centric interactions with desolvation barriers for each protein. Folding and unfolding kinetics is simulated by Langevin dynamics [80]. Desolvation barrier (db) is a robust feature in hydrophobic interactions [81] that tends to enhance folding cooperativity [40, 82]. Indeed, for some proteins such as ribosomal protein S6, Cα models with db lead to highly cooperative folding behaviors that are consistent with experiments [49] whereas models without db exhibit only weak folding cooperativity [76]. Here, following Ref. [80], the pairwise db energy is defined by a contact minimum well depth of ϵ = 1.0, a db height of ϵ db = 0.1ϵ, and a solvent-separated minimum well depth of ϵ ssm = 0.2ϵ.
The db model is purely native-centric with the total interaction potential, denoted here as E N, equal to V total in Ref. [80]. The same interaction strength is applied to all native-centric interactions. The native contact sets for Im7 and Im9 are constructed using the same criterion [80]. A pair of residues i, j belongs to the native contact set if at least one pair of their non-hydrogen atoms, one from each residue, are less than 4.5 Å apart in the Protein Data Bank (PDB) structure. The PDB Cα separation between i, j is denoted by . The total number of native contacts in the set, , is equal to 154 and 164, respectively, for Im7 and Im9 (Fig 7). We have explored using alternate “flavored” native-centric interaction strengths [72, 83] in accordance with the residue-dependent contact energies in Ref. [71] but, interestingly, the resultant models for Im7 and Im9 fail to fold cooperatively.
Homogeneous and heterogeneous nonnative interactions
Favorable nonnative interactions are included in db+hϕ and db+MJhϕ. Using a hybrid formulation [13, 43–51, 84–92], the total interaction potentials E T of these models are given by E T = E N + E HP, where is the sum of sequence-dependent nonnative contact energies over i, j that are both hydrophobic (hϕ), defined to be the eight amino acids A, V, L, I, M, W, F, and Y [47]. r ij is the Cα distance between i, j during simulation (1 ≤ i, j ≤ n, where the total number of residues n = 87 for Im7 and n = 86 for Im9); and σ hϕ = 5.0 Å. The nonnative hϕ interactions in the db+hϕ model are homogeneous with κ ij = −1.0 irrespective of hydrophobic residue type and K HP = 1.0 as in Refs. [47, 49], whereas the nonnative hϕ interactions in the db+MJhϕ model are heterogeneous, with κ ij = Δϵ ij where Δϵ ij is the contact energy in Table V of Miyazawa and Jernigan [52] and K HP = 1.8 such that the average hϕ energy K HP⟨κ ij⟩ over all possible 8 × 7/2 + 8 = 36 hϕ pairs is equal to −1.0 (the K HP κ ij values range from −0.216 for A-A to −1.584 for F-F). This average hϕ interaction energy of −1.0 is essentially maintained by the average hϕ energies over all possible nonnative hϕ contact pairs (defined below) for the Im7 and Im9 sequences in the db+MJhϕ models. Those average energies are equal to −0.994 for wildtype Im7 (412 possible nonnative hϕ pairs) and −0.998 for wildtype Im9 (306 possible nonnative hϕ pairs).
MJ-type potentials [52, 71] are derived from the statistics of native contacts in the protein structure database. Because protein native structures do not contain many significantly unfavorable contacts, MJ potentials are not expected to describe repulsive interactions between amino acid residues with accuracy [93]. Nonetheless, they do provide a crude account of the relative strengths of favorable physical interactions between residues. In fact, it has long been known that MJ potentials for nonpolar pairs reflect the combined hydrophobicities of the two contacting residues [94, 95], as is illustrated by the good correlation (Fig 3b of [96]) between a set of MJ energies [71] and the experimental octanol-water transfer free energies of amino acids [53]. In this regard, although there are considerable variations among experimental hydrophobicity scales for all twenty types of amino acids [96, 97], a higher degree of consistency among different experimental scales is seen for the hydrophobic (nonpolar and non-charged) amino acids themselves [98]. Taking these considerations together, we view MJ energies between nonpolar residues as a reasonable coarse-grained model of the underlying physics of hydrophobicity. Thus, they should be applicable to favorable nonnative hydrophobic interactions and represent a more refined model than those with homogeneous hydrophobic interaction strengths.
In our models, two hydrophobic residues i, j that are not in contact in the native PDB structure are considered to be in a nonnative contact if |i − j| > 3 and r ij < 8.0 Å (Fig 5). The total number of nonnative contacts in a conformation is denoted by n HP (S1 Fig). All non-bonded energies in our models are temperature independent and pairwise additive. For simplicity, temperature dependence and nonadditity of interactions [99–102] are not considered here.
Free energy profiles, kinetic profiles, and chevron plots
We consider a residue pair i, j in the native contact set to be in contact during the folding/unfolding simulation when Å; i.e., when r ij is not larger than that of the db and therefore within the attractive well of the contact minimum. We use Q, the number of native contacts divided by , as progress variable of folding [103, 104]. A free energy profile in units of k B T corresponds to −ln P(Q) where P(Q) is the normalized conformational population distribution as a function of Q (Figs 1 and 2). As was introduced before [59], the kinetic folding path (FP) profiles, −ln P FP|s(Q), is the negative logarithm of average fractional resident time P FP as a function of Q along folding trajectories wherein the notation “|s” indicates that equal weight is assigned to every folding trajectory [59] (Figs 4 and 6). Chevron plots are simulated using change in native stability by varying the simulation temperature as a proxy for variation of denaturant concentration [105] (Fig 3). With a low Langevin viscosity, this approach is computationally efficient and is appropriate for our present purpose because the trend (shape) of model chevron rollover is apparently unaffected by variation over a wide range of Langevin viscosities [101]. Recent tests also indicate that the model chevron plots thus obtained are very similar to those simulated using more sophisticated coarse-grained sidechain models that account for denaturant dependence by experimental transfer free energies [13, 41, 42].
Nonexplicit-chain models of one-dimensional conformational diffusion
We use the restraining (bias) potential method [55, 56, 58, 106] to estimate Q-dependent diffusion coefficients at different simulation temperatures (hence different free energies of folding ΔG). Following Ref. [55], a Q-dependent diffusion coefficient is given by
(1) |
for a given ΔG. Here the variance , where ⟨…⟩t0 denotes time averaging over different t 0 values; the correlation time where the autocorrelation function [54, 107]
(2) |
is Q-dependent. The var(Q) and C Q(t) for determining D(Q, ΔG) (Figs 8 and 9) are estimated using bias potentials , where the prescription in Ref. [108] is used to treat Q as a continuum variable. Unless specified otherwise, K Q = 0.1ϵ is used with 64 different Q 0 values for Im7 or Im9. This choice of K Q is similar to that in Ref. [56] and serves to localize conformational fluctuations to Gaussian-like distributions (S6 Fig). D(Q) is quite insensitive to lowering K Q by at least a factor of two (S7 Fig).
This method for determining D(Q) is exact if the diffusion process is truly governed by the Smoluchowski equation and K Q is sufficiently large so that variation of free energy G(Q) within a constrained conformational ensemble is essentially linear in Q. The applicability of this approach to protein folding, however, hinges on whether the dynamics along Q is Markovian to a good approximation [55]. For protein folding, D(Q) estimated by the restraining-potential method does exhibit a weak dependence on K Q [58]. We have checked our restraining-potential methodology against that of Xu et al. [58] by using a K Q value that produces conformational distributions similar to theirs. Our D(Q) for chymotrypsin inhibitor 2 at transition midpoint matches well with theirs (S8 Fig). D(Q) can also be estimated using Bayesian analysis [55]. For one dipeptide system, the Bayesian-estimated D(Q) was verified to be very similar to that from restraining potentials [55]. Here we use only the restraining-potential method.
Once D(Q) is in place for a given native stability (free energy of folding) ΔG, the folding MFPT in our nonexplicit-chain models of one-dimensional conformational diffusion (Fig 10) is computed using the discretized form [59]
(3) |
of the general formula [54, 109]
(4) |
where P eq(Q) is the normalized equilibrium conformational population at Q. The boundary values Q N and Q D for the native (folded) and denatured (unfolded) states are the same as that in our explicit-chain simulations (Fig 2). Alternatively, MFPT can be computed using Kawasaki Monte Carlo (MC) dynamics by generalizing the formulation in Ref. [59] to coordinate-dependent D(Q), viz., the transition probability from Q to Q + δQ is now given by where δG ≡ G(Q + δQ) − G(Q) and A is a constant. The above geometric mean may also be replaced by the arithmetic mean [D(Q) + D(Q + δQ)]/2; the two means are equal in the limit of D(Q + δQ) − D(Q) → 0. In addition to MFPT, Kawasaki MC is useful also for providing distribution of folding times and other properties of individual trajectories.
Supporting Information
Acknowledgments
H.S.C. is grateful to Sheena Radford for several very helpful discussions on the Im7/Im9 system since 2004. We also benefited from interactions with colleagues in the NSF Protein Folding Consortium as well as the assistance and generous allotments of computational resources by the colleagues at SciNet of Compute Canada.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
The authors received the Canadian Institutes of Health Research Grant No. MOP-84281 (http://www.cihr-irsc.gc.ca/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Jackson SE, Fersht AR. Folding of chymotrypsin inhibitor 2. 2. Influence of proline isomerization on the folding kinetics and thermodynamic characterization of the transition state of folding. Biochemistry. 1991; 30:10436–10443. 10.1021/bi00107a011 [DOI] [PubMed] [Google Scholar]
- 2. Baker D. A surprising simplicity to protein folding. Nature. 2000; 405:39–42. 10.1038/35011000 [DOI] [PubMed] [Google Scholar]
- 3. Sosnick TR, Barrick D. The folding of single domain proteins—Have we reached a consensus? Curr Opin Struct Biol. 2011; 21:12–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Gō N. Theoretical studies of protein folding. Annu Rev Biophys Bioeng. 1983; 12:183–210. 10.1146/annurev.bb.12.060183.001151 [DOI] [PubMed] [Google Scholar]
- 5. Bryngelson JD, Wolynes PG. Spin glasses and the statistical mechanics of protein folding. Proc Natl Acad Sci USA. 1987; 84:7524–7528. 10.1073/pnas.84.21.7524 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Leopold PE, Montal M, Onuchic JN. Protein folding funnels: A kinetic approach to the sequence-structure relationship. Proc Natl Acad Sci USA. 1992; 89:8721–8725. 10.1073/pnas.89.18.8721 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, pathways, and the energy landscape of protein folding: A synthesis. Proteins. 1995; 21:167–195. 10.1002/prot.340210302 [DOI] [PubMed] [Google Scholar]
- 8. Dill KA, Chan HS. From Levinthal to pathways to funnels. Nature Struct Biol. 1997; 4:10–19. 10.1038/nsb0197-10 [DOI] [PubMed] [Google Scholar]
- 9. Ferreiro DU, Komives EA, Wolynes PG. Frustration in biomolecules. Q Rev Biophys. 2014; 47:285–363. 10.1017/S0033583514000092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Chan HS, Dill KA. Transition states and folding dynamics of proteins and heteropolymers. J Chem Phys. 1994; 100:9238–9257. 10.1063/1.466677 [DOI] [Google Scholar]
- 11. Brockwell DJ, Radford SE. Intermediates: Ubiquitous species on folding energy landscapes? Curr Opin Struct Biol. 2007; 17:30–37. 10.1016/j.sbi.2007.01.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Bartlett AI, Radford SE. An expanding arsenal of experimental methods yields an explosion of insights into protein folding mechanisms. Nature Struct Mol Biol. 2009; 16:582–588. 10.1038/nsmb.1592 [DOI] [PubMed] [Google Scholar]
- 13. Chen T, Song J, Chan HS. Theoretical perspectives on nonnative interactions and intrinsic disorder in protein folding and binding. Curr Opin Struct Biol. 2015; 30:32–42. 10.1016/j.sbi.2014.12.002 [DOI] [PubMed] [Google Scholar]
- 14. Whittaker SB-M, Spence GR, Grossmann JG, Radford SE, Moore GR. NMR analysis of the conformational properties of the trapped on-pathway folding intermediate of the bacterial immunity protein Im7. J Mol Biol. 2007; 366:1001–1015. 10.1016/j.jmb.2006.11.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Morton VL, Friel CL, Allen LR, Paci E, Radford SE. The effect of increasing the stability of non-native interactions on the folding landscape of the bacterial immunity protein Im9. J Mol Biol. 2007; 371:554–568. 10.1016/j.jmb.2007.05.010 [DOI] [PubMed] [Google Scholar]
- 16. Ferguson N, Capaldi AP, James R, Kleanthous C, Radford SE. Rapid folding with and without populated intermediates in the homologous four-helix proteins Im7 and Im9. J Mol Biol. 1999; 286:1597–1608. 10.1006/jmbi.1998.2548 [DOI] [PubMed] [Google Scholar]
- 17. Capaldi AP, Shastry MCR, Kleanthous C, Roder H, Radford SE. Ultrarapid mixing experiments reveal that Im7 folds via an on-pathway intermediate. Nature Struct Biol. 2001; 8:68–72. 10.1038/83074 [DOI] [PubMed] [Google Scholar]
- 18. Gorski SA, Capaldi AP, Kleanthous C, Radford SE. Acidic conditions stabilise intermediates populated during the folding of Im7 and Im9. J Mol Biol. 2001; 312:849–863. 10.1006/jmbi.2001.5001 [DOI] [PubMed] [Google Scholar]
- 19. Capaldi AP, Kleanthous C, Radford SE. Im7 folding mechanism: misfolding on a path to the native state. Nature Struct Biol. 2002; 9:209–216. [DOI] [PubMed] [Google Scholar]
- 20. Friel CT, Capaldi A, Radford SE. Structural analysis of the rate-limiting transition states in the folding of Im7 and Im9: Similarities and differences in the folding of homologous proteins. J Mol Biol. 2003; 326:293–305. 10.1016/S0022-2836(02)01249-4 [DOI] [PubMed] [Google Scholar]
- 21. Friel CT, Beddard GS, Radford SE. Switching two-state to three-state kinetics in the helical protein Im9 via the optimisation of stabilising non-native interactions by design. J Mol Biol. 2004; 342:261–273. 10.1016/j.jmb.2004.06.076 [DOI] [PubMed] [Google Scholar]
- 22. Pashley CL, Morgan GJ, Kalverda AP, Thompson GS, Kleanthous C, Radford SE. Conformational properties of the unfolded state of Im7 in nondenaturing conditions. J Mol Biol. 2012; 416:300–318. 10.1016/j.jmb.2011.12.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Ikai A, Tanford C. Kinetic evidence for incorrectly folded intermediate states in refolding of denatured proteins. Nature. 1971; 230:100–102. 10.1038/230100a0 [DOI] [PubMed] [Google Scholar]
- 24. Tsong TY, Baldwin RL, Elson EL. The sequential unfolding of ribonuclease A: Detection of a fast initial phase in the kinetics of folding. Proc Natl Acad Sci USA. 1971; 68:2712–2715. 10.1073/pnas.68.11.2712 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Kim PS, Baldwin RL. Intermediates in the folding reactions of small proteins. Annu Rev Biochem. 1990; 59:631–660. 10.1146/annurev.bi.59.070190.003215 [DOI] [PubMed] [Google Scholar]
- 26. Sutto L, Lätzer J, Hegler JA, Ferreiro DU, Wolynes PG. Consequences of localized frustration for the folding mechanism of the IM7 protein. Proc Natl Acad Sci USA. 2007; 104:19825–19830. 10.1073/pnas.0709922104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Friel CT, Smith DA, Vendruscolo M, Gsponer J, Radford SE. The mechanism of folding of Im7 reveals competition between functional and kinetic evolutionary constraints. Nature Struct Mol Biol. 2009; 16:318–324. 10.1038/nsmb.1562 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Levin KB, Dym O, Albeck S, Magdassi S, Keeble AH, Kleanthous C, et al. Following evolutionary paths to protein-protein interactions with high affinity and selectivity. Nature Struct Mol Biol. 2009; 16:1049–1055. 10.1038/nsmb.1670 [DOI] [PubMed] [Google Scholar]
- 29. Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface. 2014; 11:20140419 10.1098/rsif.2014.0419 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Paci E, Friel CT, Lindorff-Larsen K, Radford SE, Karplus M, Vendruscolo M. Comparison of the transition state ensembles for folding of Im7 and Im9 determined using all-atom molecular dynamics simulations with ϕ value restraints. Proteins. 2004; 54:513–525. 10.1002/prot.10595 [DOI] [PubMed] [Google Scholar]
- 31. Karanicolas J, Brooks CL. Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. J Mol Biol. 2003; 334:309–325. 10.1016/j.jmb.2003.09.047 [DOI] [PubMed] [Google Scholar]
- 32. Kaya H, Chan HS. Origins of chevron rollovers in non-two-state protein folding kinetics. Phys Rev Lett. 2003; 90:258104 10.1103/PhysRevLett.90.258104 [DOI] [PubMed] [Google Scholar]
- 33. Gosavi S, Chavez LL, Jennings PA, Onuchic JN. Topological frustration and the folding of interleukin-1β . J Mol Biol. 2006; 357:986–996. 10.1016/j.jmb.2005.11.074 [DOI] [PubMed] [Google Scholar]
- 34. Zhang Z, Chan HS. Native topology of the designed protein Top7 is not conducive to cooperative folding. Biophys J. 2009; 96:L25–L27. 10.1016/j.bpj.2008.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Ferreiro DU, Hegler JA, Komives EA, Wolynes PG. Localizing frustration in native proteins and protein assemblies. Proc Natl Acad Sci USA. 2007; 104:19819–19824. 10.1073/pnas.0709915104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Adhikari AN, Freed KF, Sosnick TR. De novo prediction of protein folding pathways and structure using the principle of sequential stabilization. Proc Natl Acad Sci USA 2012;109:17442–17447. 10.1073/pnas.1209000109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How fast-folding proteins fold. Science. 2011; 334:517–520. 10.1126/science.1208351 [DOI] [PubMed] [Google Scholar]
- 38. Piana S, Klepeis JL, Shaw DE. Assessing the accuracy of physical models used in protein-folding simulations: Quantitative evidence from long molecular dynamics simulations. Curr Opin Struct Biol. 2014; 24:98–105. 10.1016/j.sbi.2013.12.006 [DOI] [PubMed] [Google Scholar]
- 39. Skinner JJ, Yu W, Gichana EK, Baxa MC, Hinshaw J, Freed KF, et al. Benchmarking all-atom simulations using hydrogen exchange. Proc Natl Acad Sci USA. 2014; 111:15975–15980. 10.1073/pnas.1404213111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Kaya H, Chan HS. Solvation effects and driving forces for protein thermodynamic and kinetic cooperativity: How adequate is native-centric topological modeling? J Mol Biol. 2003; 326:911–931 [Corrigendum: J Mol Biol. 2004; 337:1069–1070]. 10.1016/S0022-2836(02)01434-1 [DOI] [PubMed] [Google Scholar]
- 41. Liu Z, Reddy G, O’Brien EP, Thirumalai D. Collapse kinetics and chevron plots from simulations of denaturant-dependent folding of globular proteins. Proc Natl Acad Sci USA. 2011; 108:7787–7792. 10.1073/pnas.1019500108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Chen T, Chan HS. Effects of desolvation barriers and sidechains on local-nonlocal coupling and chevron behaviors in coarse-grained models of protein folding. Phys Chem Chem Phys. 2014; 16:6460–6479. 10.1039/c3cp54866j [DOI] [PubMed] [Google Scholar]
- 43. Chan HS, Zhang Z, Wallin S, Liu Z. Cooperativity, local-nonlocal coupling, and nonnative interactions: Principles of protein folding from coarse-grained models. Annu Rev Phys Chem. 2011; 62:301–326. 10.1146/annurev-physchem-032210-103405 [DOI] [PubMed] [Google Scholar]
- 44. Shea J-E, Nochomovitz YD, Guo Z, Brooks CL. Exploring the space of protein folding Hamiltonians: The balance of forces in a minimalist beta-barrel model. J Chem Phys. 1998; 109:2895–2903. 10.1063/1.476842 [DOI] [Google Scholar]
- 45. Kaya H, Chan HS. Energetic components of cooperative protein folding. Phys Rev Lett. 2000; 85:4823–4826. 10.1103/PhysRevLett.85.4823 [DOI] [PubMed] [Google Scholar]
- 46. Yadahalli S, Rao VVHG, Gosavi S. Modeling non-native interactions in designed proteins. Israel J Chem. 2014; 54:1230–1240. 10.1002/ijch.201400035 [DOI] [Google Scholar]
- 47. Zarrine-Afsar A, Wallin S, Neculai AM, Neudecker P, Howell PL, Davidson AR, et al. Theoretical and experimental demonstration of the importance of specific nonnative interactions in protein folding. Proc Natl Acad Sci USA. 2008; 105:9999–10004. 10.1073/pnas.0801874105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Azia A, Levy Y. Nonnative electrostatic interactions can modulate protein folding: Molecular dynamics with a grain of salt. J Mol Biol. 2009; 393:527–542. 10.1016/j.jmb.2009.08.010 [DOI] [PubMed] [Google Scholar]
- 49. Zhang Z, Chan HS. Competition between native topology and nonnative interactions in simple and complex folding kinetics of natural and designed proteins. Proc Natl Acad Sci USA. 2010; 107:2920–2925. 10.1073/pnas.0911844107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Shental-Bechor D, Smith MTJ, MacKenzie D, Broom A, Marcovitz A, Ghashut F, et al. Nonnative interactions regulate folding and switching of myristoylated protein. Proc Natl Acad Sci USA. 2012; 109:17839–17844. 10.1073/pnas.1201803109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Zarrine-Afsar A, Zhang Z, Schweiker KL, Makhatadze GI, Davidson AR, Chan HS. Kinetic consequences of native state optimization of surface-exposed electrostatic interactions in the Fyn SH3 domain. Proteins. 2012; 80:858–870. 10.1002/prot.23243 [DOI] [PubMed] [Google Scholar]
- 52. Miyazawa S, Jernigan RL. Self-consistent estimation of inter-residue protein contact energies based on an equilibrium mixture approximation of residues. Proteins. 1999; 34:49–68. 10.1002/(SICI)1097-0134(19990101)34:1%3C49::AID-PROT5%3E3.0.CO;2-L [DOI] [PubMed] [Google Scholar]
- 53. Fauchère J-L, Pliška V. Hydrophobic parameters Π of amino-acid side chains from the partitioning of N-acetyl-amino-acid amides. Eur J Med Chem-Chem Therm. 1983; 18:369–375. [Google Scholar]
- 54. Socci ND, Onuchic JN, Wolynes PG. Diffusive dynamics of the reaction coordinate for protein folding funnels. J Chem Phys. 1996; 104:5860–5868. 10.1063/1.471317 [DOI] [Google Scholar]
- 55. Hummer G. Position-dependent diffusion coefficients and free energies from Bayesian analysis of equilibrium and replica molecular dynamics simulation. New J Phys. 2005; 7:34 10.1088/1367-2630/7/1/034 [DOI] [Google Scholar]
- 56. Chahine J, Oliveira RJ, Leite VBP, Wang J. Configuration-dependent diffusion can shift the kinetic transition state and barrier height of protein folding. Proc Natl Acad Sci USA. 2007; 104:14646–14651. 10.1073/pnas.0606506104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Best RB, Hummer G. Coordinate-dependent diffusion in protein folding. Proc Natl Acad Sci USA. 2010; 107:1088–1093. 10.1073/pnas.0910390107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Xu W, Lai Z, Oliveira RJ, Leite VBP, Wang J. Configuration-dependent diffusion dynamics of downhill and two-state protein folding. J Phys Chem B. 2012; 116:5152–5159. 10.1021/jp212132v [DOI] [PubMed] [Google Scholar]
- 59. Zhang Z, Chan HS. Transition paths, diffusive processes, and preequilibria of protein folding. Proc Natl Acad Sci USA. 2012; 109:20919–20924. 10.1073/pnas.1209891109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Waldauer SA, Bakajin O, Lapidus LJ. Extremely slow intramolecular diffusion in unfolded protein L. Proc Natl Acad Sci USA. 2010; 107:13713–13717. 10.1073/pnas.1005415107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Soranno A, Buchli B, Nettels D, Cheng RR, Müller-Späth S, Pfeil SH, et al. Quantifying internal friction in unfolded and intrinsically disordered proteins with single-molecule spectroscopy. Proc Natl Acad Sci USA. 2012; 109:17800–17806. 10.1073/pnas.1117368109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. de Sancho D, Sirur A, Best RB. Molecular origins of internal friction effects on protein-folding rates. Nature Communications. 2014; 5:4307 10.1038/ncomms5307 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Echeverria I, Makarov DE, Papoian GA. Concerted dihedral rotations give rise to internal friction in unfolded proteins. J Am Chem Soc. 2014; 136:8708–8713. 10.1021/ja503069k [DOI] [PubMed] [Google Scholar]
- 64. Knott M, Chan HS. Criteria for downhill protein folding: Calorimetry, chevron plot, kinetic relaxation, and single-molecule radius of gyration in chain models with subdued degrees of cooperativity. Proteins. 2006; 65:373–391. 10.1002/prot.21066 [DOI] [PubMed] [Google Scholar]
- 65. Bartlett AI, Radford SE. Desolvation and development of specific hydrophobic core packing during Im7 folding. J Mol Biol. 2010; 396:1329–1345. 10.1016/j.jmb.2009.12.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Plaxco KW, Simons KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J Mol Biol. 1998; 277:985–994. 10.1006/jmbi.1998.1645 [DOI] [PubMed] [Google Scholar]
- 67. Kaya H, Chan HS. Simple two-state protein folding kinetics requires near-Levinthal thermodynamic cooperativity. Proteins. 2003; 52:510–523. 10.1002/prot.10506 [DOI] [PubMed] [Google Scholar]
- 68. Kaya H, Liu Z, Chan HS. Chevron behavior and isostable enthalpic barriers in protein folding: Successes and limitations of simple Gō-like modeling. Biophys J. 2005; 89:520–535. 10.1529/biophysj.104.057471 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Guex N, Peitsch MC, Schwede T. Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: A historical perspective. Electrophoresis. 2009; 30:S162–S173. 10.1002/elps.200900140 [DOI] [PubMed] [Google Scholar]
- 70. Holm L, Sander C. Database algorithm for generating protein backbone and side-chain co-ordinates from a Cα trace. Application to model building and detection of co-ordinate errors. J Mol Biol. 1991; 218:183–194. 10.1016/0022-2836(91)90883-8 [DOI] [PubMed] [Google Scholar]
- 71. Miyazawa S, Jernigan RL. Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol. 1996; 256:623–644. 10.1006/jmbi.1996.0114 [DOI] [PubMed] [Google Scholar]
- 72. Cho SS, Levy Y, Wolynes PG. Quantitative criteria for native energetic heterogeneity influences in the prediction of protein folding kinetics. Proc Natl Acad Sci USA. 2009; 106:434–439. 10.1073/pnas.0810218105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Meyer EA, Castellano RK, Diederich F. Interactions with aromatic rings in chemical and biological recognition. Angew Chem Int Ed. 2003; 42:1210–1250. 10.1002/anie.200390319 [DOI] [PubMed] [Google Scholar]
- 74. Sobolev V, Wade RC, Vriend G, Edelman M. Molecular docking using surface complementarity. Proteins. 1996; 25:120–129. 10.1002/(SICI)1097-0134(199605)25:1%3C120::AID-PROT10%3E3.3.CO;2-1 [DOI] [PubMed] [Google Scholar]
- 75. Sobolev V, Sorokine A, Prilusky J, Abola EE, Edelman M. Automated analysis of interatomic contacts in proteins. Bioinformatics. 1999; 15:327–332. 10.1093/bioinformatics/15.4.327 [DOI] [PubMed] [Google Scholar]
- 76. Stoycheva AD, Brooks CL, Onuchic JN. Gatekeepers in the ribosomal protein S6: Thermodynamics, kinetics, and folding pathways revealed by a minimalist protein model. J Mol Biol. 2004; 340:571–585. 10.1016/j.jmb.2004.04.073 [DOI] [PubMed] [Google Scholar]
- 77. Tzul FO, Schweiker KL, Makhatadze GI. Modulation of folding energy landscape by charge-charge interactions: Linking experiments with computational modeling. Proc Natl Acad Sci USA. 2015; 112:E259–E266. 10.1073/pnas.1410424112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Knowling SE, Figueiredo AM, Whittaker SB-M, Moore GR, Radford SE. Amino acid insertion reveals a necessary three-helical intermediate in the folding pathway of the colicin E7 immunity protein Im7. J Mol Biol. 2009; 392:1074–1086. 10.1016/j.jmb.2009.07.085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Muñoz V, Serrano L. Development of the multiple sequence approximation within the AGADIR model of α-helix formation: Comparison with the Zimm-Bragg and Lifson-Roig formalisms. Biopolymers. 1997; 41:495–509. 10.1002/(SICI)1097-0282(19970415)41:5%3C495::AID-BIP2%3E3.0.CO;2-H [DOI] [PubMed] [Google Scholar]
- 80. Liu Z, Chan HS. Solvation and desolvation effects in protein folding: Native flexibility, kinetic cooperativity, and enthalpic barriers under isostability conditions. Phys Biol. 2005; 2:S75–S85. 10.1088/1478-3975/2/4/S01 [DOI] [PubMed] [Google Scholar]
- 81. Pratt LR, Chandler D. Theory of the hydrophobic effect. J Chem Phys. 1997; 67:3683–3704. 10.1063/1.435308 [DOI] [Google Scholar]
- 82. Cheung MS, García AE, Onuchic JN. Protein folding mediated by solvation: Water expulsion and formation of the hydrophobic core occur after the structural collapse. Proc Natl Acad Sci USA. 2002; 99:685–690. 10.1073/pnas.022387699 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Hills RD, Brooks CL. Insights from coarse-grained Gō models for protein folding and dynamics. Int J Mol Sci. 2009; 10:889–905. 10.3390/ijms10030889 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Treptow WL, Barbosa MAA, Garcia LG, de Araújo AFP. Non-native interactions, effective contact order, and protein folding: A mutational investigation with the energetically frustrated hydrophobic model. Proteins. 2002; 49:167–180. 10.1002/prot.10166 [DOI] [PubMed] [Google Scholar]
- 85. Fan K, Wang J, Wang W. Folding of lattice protein chains with modified Gō potential. Eur Phys J B. 2002; 30:381–391. 10.1140/epjb/e2002-00393-4 [DOI] [Google Scholar]
- 86. Clementi C, Plotkin SS. The effects of nonnative interactions on protein folding rates: Theory and simulation. Protein Sci. 2004; 13:1750–1766. 10.1110/ps.03580104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Pogorelov TV, Luthey-Schulten Z. Variations in the fast folding rates of the λ-repressor: A hybrid molecular dynamics study. Biophys J. 2004; 87:207–214. 10.1529/biophysj.104.042861 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Faisca PFN, Nunes A, Travasso RDM, Shakhnovich EL Non-native interactions play an effective role in protein folding dynamics. Protein Sci. 2010; 19:2196–2209. 10.1002/pro.498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Enciso M, Rey A. Improvement of structure-based potentials for protein folding by native and nonnative hydrogen bonds. Biophys J. 2011; 101:1474–1482. 10.1016/j.bpj.2011.08.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Best RB. How well does a funneled energy landscape capture the folding mechanism of spectrin domains? J Phys Chem B. 2013; 117:13235–13244. 10.1021/jp403305a [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91. Truong HH, Kim BL, Schafer NP, Wolynes PG. Funneling and frustration in the energy landscapes of some designed and simplified proteins. J Chem Phys. 2013; 139:121908 10.1063/1.4813504 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Sun Y, Ming D. Energetic frustrations in protein folding at residue resolution: A homologous simulation study of Im9 proteins. PLoS ONE. 2014; 9:e87719 10.1371/journal.pone.0087719 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Chan HS, Kaya H, Shimizu S. Computational methods for protein folding: Scaling a hierarchy of complexities In: Jiang T, Xu Y, Zhang MQ, editors. Current Topics in Computational Molecular Biology. Cambridge, Massachusetts: The MIT Press; 2002. pp 403–447. [Google Scholar]
- 94. Godzik A, Kolinski A, Skolnick J. Are proteins ideal mixtures of amino acids—Analysis of energy parameter sets. Protein Sci. 1995; 4:2107–2117. 10.1002/pro.5560041016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Chan HS. Folding alphabets. Nature Struct Biol. 1999; 6:994–996. 10.1038/14876 [DOI] [PubMed] [Google Scholar]
- 96.Chan, HS. Amino acid side-chain hydrophobicity. In: Wiley Online Library: eLS—Citable Reviews In The Life Sciences; 2002. DOI: 10.1038/npg.els.0003005 (http://onlinelibrary.wiley.com/doi/10.1038/npg.els.0003005/full).
- 97. DeVido DR, Dorsey JG, Chan HS, Dill KA. Oil/water partitioning has a different thermodynamic signature when the oil solvent chains are aligned than when they are amorphous. J Phys Chem B. 1998; 102:7272–7279. 10.1021/jp9807165 [DOI] [Google Scholar]
- 98. Karplus PA. Hydrophobicity regained. Protein Sci. 1997; 6:1302–1307. 10.1002/pro.5560060618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Liu Z, Chan HS. Desolvation is a likely origin of robust enthalpic barriers to protein folding. J Mol Biol. 2005; 349:872–889. 10.1016/j.jmb.2005.03.084 [DOI] [PubMed] [Google Scholar]
- 100. Moghaddam MS, Shimizu S. Chan HS. Temperature dependence of three-body hydrophobic interactions: Potential of mean force, enthalpy, entropy, heat capacity, and nonadditivity. J Am Chem Soc. 2005; 127:303–316. 10.1021/ja040165y [DOI] [PubMed] [Google Scholar]
- 101. Badasyan A, Liu Z, Chan HS. Probing possible downhill folding: Native contact topology likely places a significant constraint on the folding cooperativity of proteins with ~ 40 residues. J Mol Biol. 2008; 384:512–530. 10.1016/j.jmb.2008.09.023 [DOI] [PubMed] [Google Scholar]
- 102. Dias CL, Chan HS. Pressure-dependent properties of elementary hydrophobic interactions: Ramifications for activation properties of protein folding. J Phys Chem B. 2014; 118:7488–7509. 10.1021/jp501935f [DOI] [PubMed] [Google Scholar]
- 103. Šali A, Shakhnovich E, Karplus M. Kinetics of protein folding—A lattice model study of the requirements for folding to the native state. J Mol Biol. 1994; 235:1614–1636. [DOI] [PubMed] [Google Scholar]
- 104. Cho SS, Levy Y, Wolynes PG. P versus Q: Structural reaction coordinates capture protein folding on smooth landscapes. Proc Natl Acad Sci USA. 2006; 103:586–591. 10.1073/pnas.0509768103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105. Kaya H, Liu Z, Chan HS. Chevron behavior and isostable enthalpic barriers in protein folding: Successes and limitations of simple Gō-like modeling. Biophys J. 2005; 89:520–535. 10.1529/biophysj.104.057471 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106. Woolf TB, Roux B. Conformational flexibility of o-phosphorylcholine and o-phosphorylethanolamine: A molecular dynamics study of solvation effects. J Am Chem Soc. 1994; 116:5916–5926. 10.1021/ja00092a048 [DOI] [Google Scholar]
- 107. Hummer G, García AE, Garde S. Conformational diffusion and helix formation kinetics. Phys Rev Lett. 2000; 85:2637–2640. 10.1103/PhysRevLett.85.2637 [DOI] [PubMed] [Google Scholar]
- 108. Beharry AA, Chen T, Al-Abdul-Wahid MS, Samanta S, Davidov K, Sadovski O, et al. Quantitative analysis of the effects of photoswitchable distance constraints on the structure of a globular protein. Biochemistry. 2012; 51:6421–6431. 10.1021/bi300685a [DOI] [PubMed] [Google Scholar]
- 109. Pontryagin L, Andronov A, Vitt A. On the statistical treatment of dynamical systems. Zh Eksp Teor Fiz. 1933; 3:165–180. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.