Significance
Frustration is a central concept in the study of protein folding kinetics. While it has been proposed that most proteins are minimally frustrated, recent experiments have found systems with surprisingly slow reconfiguration dynamics, suggesting an underlying landscape that is frustrated or “rough.” Our results suggest that protein size and topology play a role in determining how sensitive a protein is to frustration.
Keywords: frustration, protein folding, misfolding, protein dynamics
Abstract
The presence of conflicting interactions, or frustration, determines how fast biomolecules can explore their configurational landscapes. Recent experiments have provided cases of systems with slow reconfiguration dynamics, perhaps arising from frustration. While it is well known that protein folding speed and mechanism are strongly affected by the protein native structure, it is still unknown how the response to frustration is modulated by the protein topology. We explore the effects of nonnative interactions in the reconfigurational and folding dynamics of proteins with different sizes and topologies. We find that structural correlations related to the folded state size and topology play an important role in determining the folding kinetics of proteins that otherwise have the same amount of nonnative interactions. In particular, we find that the reconfiguration dynamics of α-helical proteins are more susceptible to frustration than β-sheet proteins of the same size. Our results may explain recent experimental findings and suggest that attempts to measure the degree of frustration due to nonnative interactions might be more successful with α-helical proteins.
Protein folding has become an exemplar problem for the study of conformational changes in biomolecules. During the past couple of decades, significant theoretical (1) and computational (2) advances have shown that, even though the folding of a protein involves the complex and coordinated motion of many atoms, it can often be described by relatively simple models (3, 4). In general, the number of collective variables needed to describe large macromolecular rearrangements depends on the separation between fast and slow relaxation timescales, and the complexity of these systems often requires the use of computational methods to extract the relaxation timescales from simulation and understand their corresponding structural processes (5). However, energy landscape theory has shown that, under general physical assumptions (6), protein folding can be described as the slow equilibration between the folded and unfolded free energy basins while neglecting the fast relaxations.
The principle of minimal frustration, a central tenet of energy landscape theory, posits that evolution has crafted protein interactions to stabilize the native state while destabilizing competing misfolded states (6). This picture has inspired “structure-based” coarse-grain models where the energy landscape is funneled toward the native state by design, by including the native structural geometry in the construction of the model and by discarding potentially conflicting “nonnative” interactions (7). These models have had success illuminating how folding rates, folding mechanisms, and functional motions depend on native topology (8, 9). However, proteins with the same native structure can still have starkly different folding behavior (10), suggesting that sequence-specific effects are necessary for a more complete picture.
While the principle of minimal frustration has become a central paradigm for molecular biophysics, recent single-molecule experiments on several aggregation-prone proteins, such as the prion protein (PrP) (11) and the intrinsically disordered protein -synuclein (12), have provided examples where frustration may be significant. In particular, a careful analysis of the extension statistics from force spectroscopy has revealed multiple “off-pathway” misfolded states in monomeric PrP (11) and several marginally stable states in monomeric -synuclein (12). These proteins provide concrete examples where nonnative interactions may be an important source of frustration. While some progress has been made in understanding the role of nonnative interactions in specific cases (e.g., ref. 13), it has been unclear if there are more general conclusions to be made. Experimental evidence has shown that the effect of nonnative interactions varies between different proteins and experimental conditions. For example, studies have found that the unfolded state tends to collapse in a protein-specific manner at low concentrations of denaturant (14) or high concentrations of salt (15), with more collapsed states showing slower rates of folding or reconfiguration.
As an attempt to garner general principles, we investigate the effect of nonnative interactions on the (mis)folding of proteins with different topologies. We show that rates of folding and reconfiguration depend on the strength of nonnative interactions in a protein-specific way. In particular, we find that -helical proteins have more compact misfolded ensembles than -sheet proteins of a comparable size and, consequently, have slower reconfiguration dynamics. We further connect these differences to the underlying energy landscape. Our results provide some general insight into the role of frustration in protein (mis)folding and why some proteins appear more frustrated than others.
Results
Nonnative Heterogeneity and Folding Dynamics.
The role of nonnative interactions can be quantified in terms of a simple diffusional model of folding. If folding is much slower than forming nonnative structure, the dynamics can be modeled as diffusion along a one-dimensional reaction coordinate (16, 17) with the folding rate given by Kramers theory (18):
[1] |
where is the thermal energy, is free energy barrier to folding, and is the rate prefactor. The prefactor describes the elementary rate of conformational change (19), and it is sensitive to the structure of the energy landscape. According to the diffusion model, the prefactor in Eq. 1 is given by ref. 18:
[2] |
where () is the curvature of the unfolded well (transition barrier) along the free energy profile, , and is the diffusion coefficient. Theory predicts that frustration increases energy fluctuations on the folding landscape, , and that averaging over these fluctuations gives the diffusion coefficient a strong temperature dependence, (16). The diffusion model has been fruitfully applied to simulations (20) and to extract from experiments (21, 22). Interestingly, experiments on several aggregation-prone proteins, such as the PrP (11, 23), the intrinsically disordered protein -synuclein (12), and the engineered protein D (21), have found diffusion coefficients much smaller than other globular proteins, indicating that these proteins may have frustrated landscapes. Connecting measurements of to properties of the energy landscape () can help illuminate the degree of frustration for real proteins.
Here we take a statistical approach to modeling the effects of nonnative interactions. Starting with a structure-based model (7), we add nonnative interactions whose strengths are assigned from a zero-mean Gaussian distribution with standard deviation b (24). As such, each nonnative interaction is equally likely to be attractive or repulsive (see Materials and Methods and SI Appendix for details). The parameter increases the nonnative heterogeneity and, as a result, increases the potential for competing misfolded states (i.e., frustration). For example, of nonnative interactions are stronger than native interactions when (see SI Appendix, Fig. S1).
We simulate many independently sampled nonnative parameter sets for each value of and average the resulting observables (e.g., folding time) across the different parameter sets. Data points and error bars (see Figs. 1, 3, and 5) show the average and SD, respectively, over all parameter sets at each (unless otherwise indicated). Averaging over parameter sets reveals that many important structural and kinetic properties depend generically on the level of nonnative heterogeneity, through . In addition, the SD across parameter sets shows that some observables are more sensitive than others to the specific set of nonnative interactions and that the differences parameter sets increases at higher degrees of frustration (see SI Appendix, Fig. S4).
A statistical approach is motivated by the fact that, at the level of our coarse-grained representation (one bead per residue), nonnative interactions could have different physical and chemical origins, and they could be modulated by denaturant, salt, mutations, etc. While this level of coarse-graining does not allow us to comment on proteins with a specific sequence of amino acids, we can investigate the general principles underlying the differences in the level of frustration observed between different protein topologies and between different sequences for the same topology.
To determine how the effects of nonnative interactions depend on native structure, we have selected 10 proteins with different sizes and topologies (Table 1). Several of our proteins were chosen because they have exhibited signs of frustration in other studies, such as the aggregation-prone PrP (23) and the engineered protein D (A3D) (21) [the topology of protein A3D is also similar to one of the Spectrin proteins (25), on which frustration has been experimentally observed]. Our set also includes the ribosomal protein S6 and two of its circular permutants [ and (26)]. S6 and its circular permutants are included because they have the same native structure but different contact order, which we found to be a useful metric in understanding our findings.
Table 1.
Name | PDB | N | M | ACO | , % | , % |
Lambda | 1R69 | 63 | 155 | 17.14 | 54 | 0 |
A3D | 2A3D | 73 | 151 | 18.68 | 75 | 0 |
LysM | 1E0G | 48 | 117 | 18.69 | 38 | 17 |
1imq | 1IMQ | 85 | 219 | 22.39 | 64 | 0 |
SH3 | 1FMK | 58 | 164 | 22.67 | 5 | 41 |
2akk | 2AKK | 74 | 213 | 24.41 | 0 | 46 |
1RIS | 95 | 263 | 27.53 | 30 | 47 | |
PrP | 1QLX | 104 | 266 | 27.61 | 58 | 4 |
1RIS | 95 | 264 | 31.57 | 30 | 47 | |
1RIS | 95 | 263 | 37.37 | 30 | 47 |
ACO, absolute contact order; M, number of native contacts; N, number of residues; PDB codes, Protein Data Bank codes.
Absolute Contact Order (ACO) characterizes differences in native structure (27) and is defined as the average sequence separation between native contacts: , where is the sequence separation between residues that make a native contact and is the number of native contacts. ACO captures aspects of both size and topology; ACO increases with size, and proteins have larger ACO than proteins of the same size.
We seek to characterize the folding and reconfiguration dynamics as a function of nonnative heterogeneity . To do so, we project the dynamics onto the fraction of native contacts . By combining Eqs. 1 and 2, we can recast the folding time in the diffusion model as
[3] |
where remaining factors have been grouped into . Eq. 3 shows that the folding time as a function of nonnative heterogeneity, , normalized by its value in the pure structure-based model at , , depends on the change in free energy barrier and diffusion coefficient.
We compare the terms of Eq. 3 estimated from simulation. The folding time is calculated as the average dwell time in the unfolded state (SI Appendix, Fig. S1, Left), and the free energy barrier is estimated from the free energy profile (SI Appendix, Fig. S1, Right). While Kramers’ theory technically requires the diffusion coefficient at the barrier top in Eq. 2, here we report the diffusion coefficient in the unfolded state: , where and is the reconfigurational timescale in the unfolded state, which can be estimated as the decay time of the autocorrelation function (28), . We report because it may be more relevant for comparing with energy landscape theory (29) and experiments that probe transient misfolding (30). If has a weak dependence on , as some previous studies have found for structure-based models (31), then is an acceptable approximation for in Eq. 2.
We find that folding time decreases for some range of nonnative heterogeneity for all proteins except Lambda and LysM (Fig. 1A), which have two of the three lowest ACO values. Previous work suggests that the impact of nonnative interactions on the free energy barrier and diffusion coefficient can have potentially competing effects on the folding time (24, 32). Consistent with previous studies, we find that, for all proteins studied, there is a range of nonnative heterogeneity that lowers the barrier to fold (Fig. 1B). The protein-specific behavior of indicates that the folding time is not solely determined by the free energy barrier height but depends significantly on the diffusion coefficient , and changes in are strongly protein-dependent.
The unfolded state diffusion coefficient displays two regimes for all proteins (Fig. 1C): a regime where is relatively insensitive to , and a regime where decreases dramatically with . We can empirically define the crossover between these regimes as the amount of nonnative heterogeneity, , needed to decrease the diffusion coefficient such that (see Fig. 1C). We find that correlates strongly with ACO (Fig. 2 red points) but is less correlated with the number of residues and uncorrelated with relative contact order (see SI Appendix, Table S1 and Fig. S3). Interestingly, Fig. 2 shows that the behavior of versus ACO is linear at small values and then appears to saturate at high values of ACO. The saturation can be explained by considering that ACO increases with the protein size, while is an intensive variable and is expected to converge to a finite value in the thermodynamic limit (32).
This result implies that size and topology both play a role in setting the reconfiguration rate in proteins with frustration. In particular, for proteins with the same number of residues, the ones with lower ACO (i.e., -helical proteins) are more sensitive to frustration. This has important implications for how we understand the connection between intermolecular interactions and molecular motion.
Recent single-molecule experiments have been interested in the transition path time (SI Appendix, Fig. S1, Middle) as a way to measure the diffusion coefficient, because only depends weakly on the free energy barrier in the diffusion model (33), , where is Euler’s constant and the equation is exact only in the large barrier limit . We find that calculated from simulation (Fig. 1D) increases at large and qualitatively resembles the behavior of for all proteins, except A3D. It is worth noting that A3D has the smallest folding barrier of all proteins studied (), and the assumptions of the diffusion model underlying the equation above may not apply.
We expect that our findings can be tested by experiments that measure to determine the reconfigurational diffusion coefficient . In particular, we expect that proteins with larger contact order require more frustration (i.e., a higher fraction of strongly attractive nonnative interactions) to slow their diffusion compared with proteins with lower contact order.
Collapse and Nonnative Structural Motifs.
We investigate the structural properties of the unfolded state to understand how nonnative interactions can have a protein-specific effect on the reconfiguration dynamics. We find several interesting differences between how proteins with different topologies respond to frustration. The first difference arises in the collapse of the unfolded state. The second difference arises in the patterns of nonnative contacts, with -helical and -sheet topologies displaying different “motifs” in nonnative contact formation.
The central feature of disordered polymer states, such as the unfolded state in protein folding, is their overall size, which is given by their radius of gyration or their degree of collapse . The radius of gyration of the unfolded state (Fig. 3A) scales with protein size at and decreases monotonically with increasing nonnative heterogeneity. To compare from different proteins on the same scale, we define the degree of collapse as , where is the radius of gyration of the unfrustrated coil and is the radius of gyration of the completely collapsed chain (details in SI Appendix). The degree of collapse (Fig. 3B) depends on frustration in a similar way to the diffusion coefficient discussed above (Fig. 1C). For example, proteins become more compact than proteins of a similar size. The degree of collapse has a sigmoidal dependence on nonnative heterogeneity, and the midpoints of the sigmoid functions for each protein also strongly correlate with ACO (Fig. 2 blue points). The same saturation behavior at high values of ACO observed for above is also present for .
Contact maps show that, even when a significant amount of nonnative heterogeneity is present, the unfolded state is largely unstructured on average (Fig. 4, Upper Triangles), with some partially formed native contacts (blue) but very little nonnative structure (red). However, even if the unfolded state has very little average structure, it is not featureless. Patterns emerge in how structure forms transiently, and these patterns are revealed by the fluctuations in contact formation. Contact fluctuations, given by the mean-squared variation of contacts where , are largest for contacts that form and break most often and, therefore, reveal preferences in how structure forms transiently.
Contact fluctuations in the unfolded state (Fig. 4, Lower Triangles) show that native topology qualitatively changes how proteins form transient nonnative structure. In particular, contact fluctuations show that proteins (Fig. 4, Left) form more nonnative contacts close to each other in sequence than proteins (Fig. 4, Right). In addition, the proteins form more nonnative contacts in between secondary structural elements, while proteins form nonnative contacts that are more localized near native -sheets. This suggests that the coupling of secondary structure formation with nonnative interactions may lead to different motifs in the misfolded ensembles of and proteins.
Probing the Underlying Energy Landscape.
Energy landscape theory links protein folding kinetics to the statistical properties of the energy landscape (6). At the folding temperature, the distribution of energies is bimodal (Fig. 5A orange), with the lower (higher) energy peak corresponding to the folded (unfolded) state. At a constant temperature, these peaks are significantly broadened from vibrations around the minima on the landscape. By performing energy minimization of configurations sampled along the trajectory, we get a clearer picture of the “inherent structure” of the energy landscape (Fig. 5A blue) (34) (details in SI Appendix).
We perform inherent structure analysis on a subset of our proteins and calculate two important properties of their landscapes: the depth of the folded state minimum relative to the unfolded ensemble, , and the breadth of the energy distribution within the unfolded state, (Fig. 5A). The ratio of these energy scales describes the degree of frustration on the landscape: When approaches 1, misfolded minima are just as deep as the native minimum (35).
Interestingly, we find that (Fig. 5B) has protein-specific dependence on the amount of frustration similar to what was noted above for the diffusion coefficient and degree of collapse: The amount of nonnative heterogeneity needed to increase the degree of frustration on the landscape by a given amount also strongly correlates with ACO (Fig. 2, green points). Again, the saturation behavior at high values of ACO noted above is observed. This means that for the same amount of nonnative heterogeneity, size and topology modulate the depth of the unfolded state minima. This result provides an energy landscape basis for interpreting the results of previous sections.
The degree of frustration on the energy landscape can alternatively be quantified by comparing the folding temperature to the glass temperature , where is the temperature where the system would get stuck in a single misfolded state. The glass temperature can be calculated by the distribution of inherent structures , where is the entropy of the unfolded state minima (6, 35). The ratio of temperatures displays qualitative similar dependence on size and topology (SI Appendix, Fig. S5). Theoretical estimates of for proteins span a range, from (36) to (35). Our findings suggest there is a range of nonnative heterogeneity allowable for proteins to be within the theoretical range (gray rectangle in SI Appendix, Fig. S5). Notably, proteins with larger ACO appear to have a wider range of nonnative heterogeneity permissible.
It must be noted that our estimates appear to increase beyond the theoretical bound of . This is most likely an artifact of approximations made in the inherent structure analysis that underestimate the magnitude of .
Discussion
We have found that the impact of nonnative interactions on protein folding and reconfiguration dynamics depends systematically on native state size and topology. In particular, we have found that the amount of nonnative heterogeneity required to produce signs of a frustrated landscape (e.g., slower reconfiguration, collapse, etc.), which we have called , has a simple dependence on the ACO (Fig. 2). In particular, is linear at low ACO and then saturates when ACO 25. This saturation is expected because is an intensive variable that should converge to a finite value in the thermodynamic limit (and ACO increases with size) (32). This means that protein topologies in the finite-size regime (ACO < 25) are more sensitive to nonnative interactions. It interesting to note that the PrP protein, which appears frustrated in the experiment (11), has a large enough ACO () to fall in the plateau region of Fig. 2, which would suggest that this topology is generally less sensitive to nonnative interactions. However, it should be noted that the PrP structure has several long stretches lacking secondary structure. As our model considers the effect of nonnative interactions on the tendency to form native secondary structure, we expect the results may differ for proteins that are partially intrinsically unstructured.
Our model suggests that the coupling between nonnative interactions and protein topology could result in different misfolding behavior for and proteins. In particular, proteins have slower reconfiguration times because nonnative interactions induce the unfolded state to become more collapsed, creating deeper misfolded minima on the folding landscape. Coupling between secondary structure and nonnative interactions might explain why the latter have been posited to be important in -helical proteins, such as Im9 (15), the Spectrin proteins R16/R17 (25), and protein A3D (21). Indeed, A3D is one of the protein topologies that was found to be most sensitive to the effect of frustration in our study, in agreement with experimental observation. A study of the salt-induced collapse of S6 also found native-like secondary structure content (37), indicating a general link between secondary structure and the effect of nonnative interactions.
Interestingly, a recent study using a native-centric polymer model found that, in the absence of nonnative interactions, the native contact map of proteins promotes collapse while folding (38). As our study focuses on the coupling of secondary structure with nonnative interactions, such results do not conflict with our findings.
We also find that the variation between parameter sets tends to increase with the onset of the frustrated regime. For example, some quantities, such as the folding time, show large variations between parameter sets at large levels of nonnative heterogeneity (SI Appendix, Fig. S4). This means that particularly attractive or repulsive nonnative interactions, or clusters of such interactions, could have a large influence on these quantities. This could explain, for example, why a small number of mutations in the Spectrin proteins can result in large differences in folding time (25). Interestingly, the radius of gyration of the unfolded state is relatively insensitive to the parameter set, as shown by the small error bars in Fig. 3A.
Although there is no experimental equivalent to our parameter , experiments can modulate nonnative interactions by varying solution conditions and measure the degree of collapse, secondary structure formation, and reconfiguration rate to connect with our results. Our results could be tested by studying circular permutations of the same protein under solution conditions that modulate collapse (e.g., salt, denaturant). If, for example, salt concentration is analogous to our parameter , then the different circular permutants may show a turnover in their folding rate at different concentrations of salt (e.g., see figure 3 of ref. 39). Alternatively, our findings could be tested by measuring the transition path time under different pH conditions, as was done for the protein D (21).
Materials and Methods
Simulation Model.
We simulate a set of 10 proteins that span a range of sizes and topologies, described in Table 1. Our structure-based simulation model with nonnative interactions is similar to what was used in previous work (24) and is described in SI Appendix.
Inherent Structure Analysis.
Performing energy minimization of configurations sampled in a constant temperature simulation maps each frame to its inherent structure minimum on the landscape (34). Following previous applications, we perform inherent structure analysis on a subsample of frames from trajectories at the folding temperature. Details are provided in SI Appendix.
Supplementary Material
Acknowledgments
We are grateful to members of the C.C. research group for inspiring discussions. This work was supported by National Science Foundation Grants CHE-1265929 and CHE-1738990 and Welch Foundation Grant C-1570. T.A.B. was supported by the Center for Theoretical Biological Physics sponsored by National Science Foundation Grant PHY-1427654. Simulations have been performed on the computer clusters of the Center for Research Computing at Rice University, supported in part by the Big-Data Private-Cloud Research Cyberinfrastructure MRI award (NSF Grant CNS-1338099).
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1801406115/-/DCSupplemental.
References
- 1.Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, pathways, and the energy landscape of protein folding: A synthesis. Proteins Struct Funct Genet. 1995;21:167–195. doi: 10.1002/prot.340210302. [DOI] [PubMed] [Google Scholar]
- 2.Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How fast-folding proteins fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
- 3.Clementi C. Coarse-grained models of protein folding: Toy models or predictive tools? Curr Opin Struct Biol. 2008;18:10–15. doi: 10.1016/j.sbi.2007.10.005. [DOI] [PubMed] [Google Scholar]
- 4.Best RB, Hummer G, Eaton WA. Native contacts determine protein folding mechanisms in atomistic simulations. Proc Natl Acad Sci USA. 2013;110:17874–17879. doi: 10.1073/pnas.1311599110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Noé F, Clementi C. Collective variables for the study of long-time kinetics from molecular trajectories: Theory and methods. Curr Opin Struct Biol. 2017;43:141–147. doi: 10.1016/j.sbi.2017.02.006. [DOI] [PubMed] [Google Scholar]
- 6.Bryngelson JD, Wolynes PG. Spin glasses and the statistical mechanics of protein folding. Proc Natl Acad Sci USA. 1987;84:7524–7528. doi: 10.1073/pnas.84.21.7524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Clementi C, Nymeyer H, Onuchic JN. Topological and energetic factors: What determines the structural details of the transition state ensemble and “en-route” intermediates for protein folding? An investigation for small globular proteins. J Mol Biol. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
- 8.Koga N, Takada S. Roles of native topology and chain-length scaling in protein folding: A simulation study with a go-like model. J Mol Biol. 2001;313:171–180. doi: 10.1006/jmbi.2001.5037. [DOI] [PubMed] [Google Scholar]
- 9.Gosavi S, Whitford PC, Jennings PA, Onuchic JN. Extracting function from a -trefoil folding motif. Proc Natl Acad Sci USA. 2008;105:10384–10389. doi: 10.1073/pnas.0801343105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nickson AA, Clarke J. What lessons can be learned from studying the folding of homologous proteins? Methods. 2010;52:38–50. doi: 10.1016/j.ymeth.2010.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yu H, et al. Direct observation of multiple misfolding pathways in a single prion protein molecule. Proc Natl Acad Sci USA. 2012;109:5283–5288. doi: 10.1073/pnas.1107736109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Solanki A, Neupane K, Woodside MT. Single-molecule force spectroscopy of rapidly fluctuating, marginally stable structures in the intrinsically disordered protein -synuclein. Phys Rev Lett. 2014;112:158103. doi: 10.1103/PhysRevLett.112.158103. [DOI] [PubMed] [Google Scholar]
- 13.Zarrine-Afsar A, et al. Theoretical and experimental demonstration of the importance of specific nonnative interactions in protein folding. Proc Natl Acad Sci USA. 2008;105:9999–10004. doi: 10.1073/pnas.0801874105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hofmann H, et al. Polymer scaling laws of unfolded and intrinsically disordered proteins quantified with single-molecule spectroscopy. Proc Natl Acad Sci USA. 2012;109:16155–16160. doi: 10.1073/pnas.1207719109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Morton VL, Friel CT, Allen LR, Paci E, Radford SE. The effect of increasing the stability of non-native interactions on the folding landscape of the bacterial immunity protein Im9. J Mol Biol. 2007;371:554–568. doi: 10.1016/j.jmb.2007.05.010. [DOI] [PubMed] [Google Scholar]
- 16.Bryngelson JD, Wolynes PG. Intermediates and barrier crossing in a random energy model (with applications to protein folding) J Phys Chem. 1989;93:6902–6915. [Google Scholar]
- 17.Socci ND, Onuchic JN, Wolynes PG. Diffusive dynamics of the reaction coordinate for protein folding funnels. J Chem Phys. 1996;104:5860–5868. [Google Scholar]
- 18.Kramers HA. Brownian motion in a field of force and the diffusion model of chemical reactions. Physica. 1940;7:284–304. [Google Scholar]
- 19.Kubelka J, Hofrichter J, Eaton WA. The protein folding ‘speed limit’. Curr Opin Struct Biol. 2004;14:76–88. doi: 10.1016/j.sbi.2004.01.013. [DOI] [PubMed] [Google Scholar]
- 20.Zheng W, Best RB. Reduction of all-atom protein folding dynamics to one-dimensional diffusion. J Phys Chem B. 2015;119:15247–15255. doi: 10.1021/acs.jpcb.5b09741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chung HS, Piana-Agostinetti S, Shaw DE, Eaton WA. Structural origin of slow diffusion in protein folding. Science. 2015;349:1504–1510. doi: 10.1126/science.aab1369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Neupane K, Manuel AP, Woodside MT. Protein folding trajectories can be described quantitatively by one-dimensional diffusion over measured energy landscapes. Nat Phys. 2016;12:700–703. [Google Scholar]
- 23.Yu H, et al. Protein misfolding occurs by slow diffusion across multiple barriers in a rough energy landscape. Proc Natl Acad Sci USA. 2015;112:8308–8313. doi: 10.1073/pnas.1419197112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Clementi C, Plotkin SS. The effects of nonnative interactions on protein folding rates: Theory and simulation. Protein Sci. 2004;13:1750–1766. doi: 10.1110/ps.03580104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wensley BGG, Kwa LGG. Separating the effects of internal friction and transition state energy to explain the slow, frustrated folding of spectrin domains. Proc Natl Acad Sci USA. 2012;109:17795–17799. doi: 10.1073/pnas.1201793109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Haglund E, Lindberg MO, Oliveberg M. Changes of protein folding pathways by circular permutation. Overlapping nuclei promote global cooperativity. J Biol Chem. 2008;283:27904–27915. doi: 10.1074/jbc.M801776200. [DOI] [PubMed] [Google Scholar]
- 27.Plaxco KW, Simons KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J Mol Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
- 28.Hummer G. Position-dependent diffusion coefficients and free energies from Bayesian analysis of equilibrium and replica molecular dynamics simulations. New J Phys. 2005;7:34. [Google Scholar]
- 29.Wang J, Plotkin SS, Wolynes PG. Configurational diffusion on a locally connected correlated energy landscape; application to finite, random heteropolymers. J Phys. 1997;7:395–421. [Google Scholar]
- 30.Neupane K, Solanki A, Sosova I, Belov M, Woodside MT. Diverse metastable structures formed by small oligomers of -synuclein probed by force spectroscopy. PLoS One. 2014;9:e86495. doi: 10.1371/journal.pone.0086495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Best RB, Hummer G. Coordinate-dependent diffusion in protein folding. Proc Natl Acad Sci USA. 2010;107:1088–1093. doi: 10.1073/pnas.0910390107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Plotkin SS. Speeding protein folding beyond the Gō model: How a little frustration sometimes helps. Proteins Struct Funct Genet. 2001;45:337–345. doi: 10.1002/prot.1154. [DOI] [PubMed] [Google Scholar]
- 33.Chung HS, Louis JM, Eaton WA. Experimental determination of upper bound for transition path times in protein folding from single-molecule photon-by-photon trajectories. Proc Natl Acad Sci USA. 2009;106:11837–11844. doi: 10.1073/pnas.0901178106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Stillinger FH, Weber TA. Hidden structure in liquids. Phys Rev A. 1982;25:978–989. [Google Scholar]
- 35.Onuchic JN, Wolynes PG, Luthey-Schulten Z, Socci ND. Toward an outline of the topography of a realistic protein-folding funnel. Proc Natl Acad Sci USA. 1995;92:3626–3630. doi: 10.1073/pnas.92.8.3626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kaya H, Chan HS. Polymer principles of protein calorimetric two state cooperativity. Proteins Struct Funct Genet. 2000;40:637–661. doi: 10.1002/1097-0134(20000901)40:4<637::aid-prot80>3.0.co;2-4. [DOI] [PubMed] [Google Scholar]
- 37.Otzen DE, Oliveberg M. Salt-induced detour through compact regions of the protein folding landscape. Proc Natl Acad Sci USA. 1999;96:11746–11751. doi: 10.1073/pnas.96.21.11746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Samanta HS, et al. Protein collapse is encoded in the folded state architecture. Soft Matter. 2017;13:3622–3638. doi: 10.1039/c7sm00074j. [DOI] [PubMed] [Google Scholar]
- 39.Kurnik M, Hedberg L, Danielsson J, Oliveberg M. Folding without charges. Proc Natl Acad Sci USA. 2012;109:5705–5710. doi: 10.1073/pnas.1118640109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.