Abstract
Protein aggregation, linked to many of diseases, is initiated when monomers access rogue conformations that are poised to form amyloid fibrils. We show, using simulations of src SH3 domain, that mechanical force enhances the population of the aggregation prone (N*) states, which are rarely populated under force free native conditions, but are encoded in the spectrum of native fluctuations. The folding phase diagrams of SH3 as a function of denaturant concentration ([C]), mechanical force (f), and temperature exhibit an apparent two-state behavior, without revealing the presence of the elusive N* states. Interestingly, the phase boundaries separating the folded and unfolded states at all [C] and f fall on a master curve, which can can be quantitatively described using an analogy to superconductors in a magnetic field. The free energy profiles as a function of the molecular extension (R), which are accessible in pulling experiments, (R), reveal the presence of a native-like N* with a disordered solvent-exposed amino terminal β-strand. The structure of the N* state is identical to that found in Fyn SH3 by NMR dispersion experiments. We show that the time scale for fibril formation can be estimated from the population of the N* state, determined by the free energy gap separating the native structure and the N* state, a finding that can be used to assess fibril forming tendencies of proteins. The structures of the N* state are used to show that oligomer formation and likely route to fibrils occur by a domain-swap mechanism in SH3 domain.
Keywords: phase diagram, protein denaturation, self-organized polymer model, protein aggregation, single molecule force spectroscopy
1. Introduction
The continuing effort to understand how proteins fold [1–5] is amply justified by the irrefutable link between misfolding[6, 7], aggregation, and a growing list of diseases[8, 9]. It is now firmly established that all proteins, regardless of their role in causing diseases, form amyloid fibrils under suitable conditions[9, 10]. In most instances, newly synthesized proteins do not aggregate, but rather fold, carry out the intended functions, and are subsequently degraded. The potential deleterious effects of interactions between misfolded structures leading to fibrils have made it urgent to understand the characteristics of proteins that harbor propensities to aggregate. Based on high resolution crystal structures of fibrils of small peptides, sequence-based methods have been introduced to identify motifs that harbor amyloidogenic tendencies[11–13]. In addition to sequence, it is likely that the structures sampled by the monomer under native conditions encode not only the structures in the fibril state, but also the rate of fibril formation. Thus, in order to decipher the aggregation-prone states, a complete structural characterization of not only the native state, but also of higher free energy excitations, that are amyloidogenic, is needed.
Since the cascade of events driving a monomer to a fibril depends both on the protein sequence and external conditions, there are multiple scenarios within the standard nucleation growth theory of protein aggregation. Regardless of the scenario, we theorize that protein aggregation is initiated if the aggregation prone ensemble of structures (denoted as the N* state) is transiently populated due to thermal fluctuations or denaturation stress[14–16]. Thus, it is likely that both the tendency of a protein to form aggregates, and, more interestingly, the rate of fibril formation are determined by the population of the N* state. It also implies that the excitations in the spectrum of monomer conformations themselves are harbingers of protein aggregation. Consequently, identifying and revealing the stability and structure of the N* state should yield quantitative insights into protein aggregation.
Because N* is typically an excitation around the lowest free energy state (for a vast majority of proteins with a possible exception of mammalian prions[14]), it is only sparsely populated, and thus is hard to detect experimentally. High structural and temporal resolutions are required to characterize structure and the extent of population of the N* state. Techniques like hydrogen exchange[17] and NMR spectroscopy[18] are successful in these endeavors. Here, we show that the elusive N* state can be more readily identified using single molecule force spectroscopy (SMFS)[19, 20], because application of mechanical force (f) can enhance the population of high free energy states and slow down the dynamics. The only accessible experimental observable in the SMFS is the time-dependent change in the end-to-end distance (R), from which the f-dependent free energy profile can be determined. We show that F(R) can reveal the presence of the elusive N* state, the population of which is enhanced by the application of force (R being the variable conjugate to f), thus establishing the potential applicability of the SMFS to characterize excited states in the free energy spectra of monomers.
To reveal the nature of the N* ensemble, we performed simulations of a coarse-grained model of the SH3 domain from the G.Gallus src tyrosine kinase (PDB code 1SRL) at various values of constant force applied to the ends of the protein. We determined the folding phase diagrams in the (f, T ) and (f, [C]) planes ([C] is the concentration of the denaturant guanidinium chloride (GdmCl)). The phase diagrams show that SH3 folding (triggered by changing f, [C], or T ) can be approximately characterized by a two-state model. The phase boundaries in the (f, T) and (f, [C]) planes collapse onto a universal curve, which we quantify using an analogy to superconductors in a magnetic field. The f-dependent F(R)s reveal the presence of a state N*, which becomes prominent as f increases. Structurally, the N* state corresponds to melting of the β-sheet formed by the N- and C-terminal strands. Surprisingly, an identical structure, with very low population, which has the same native fold as src SH3, has been recently identified in Fyn SH3 [18]. Force-unfolding kinetics shows that N* state becomes populated ahead of global unfolding. Most importantly, we establish that the structure of N* allows us to determine the mechanism of oligomer formation, which in turn naturally suggests a route to fibril formation. For the src SH3 protein this occurs by a domain-swap mechanism. Our work also shows that estimates of fibril formation times can be made using the population of N*, thus establishing a direct link between the entire folding landscape and the propensity to aggregate.
2. Results
SH3 stability as a function of perturbations
The native structure of SH3 domains consists of five anti-parallel β-strands packed to form two perpendicular β-sheets. The strands are connected by the RT-loop, n-src loop, and a short distal loop (Fig.1A). The N-terminal β-strand (β4 in Fig.1A) participates in the hydrophobic core along with the strands β1, β2 and β3. In order to quantify the equilibrium response of SH3 to mechanical force (f), we performed multiple replica exchange low friction Langevin dynamics simulations (see Methods) by applying a fixed constant f to the ends of the protein with and without denaturant.
Figure 1.
Force-dependent thermodynamics as a function of temperature and denaturant concentration: (A) Crystal structure and sequence of the src SH3 domain (1SRL). (B) Guanidinium chloride titration curves at f = 7 pN. (C) Protein stabilities as functions of [GdmCl] in the two-state picture, . (D) Reduced melting temperatures ΔTm = (Tm([C]) − Tm([0]))/Tm([0]) at different forces from 0 pN (blue) to 14 pN (orange). The data for all forces fall on a single line as explained in the main text; [C] denotes the concentration of GdmCl, ΔTm does not depend on f. (E) (inset) m(f) (plotted in kcal/mol/M at T = 310 K) depends weakly on f, with all of the dependence coming from the unfolded ensemble.
To discriminate between the folded and unfolded states, that are populated in the equilibrium simulation trajectories, we use the order parameter χ (structural overlap function)
| (1) |
where the sum is over the native contacts (as pairs of beads (i, j)), M is the number of native contacts, Θ(x) is the Heaviside function, Δ = 2Å is the tolerance in the definition of a contact, and ri,j and , respectively, are the coordinates of the beads in a given conformation {r} and the native state. The histogram of χ exhibits a bimodal behavior, implying that SH3 folds in an apparent two state manner (see Fig.S1). The value of χ = χc separating the native basin of attraction (NBA) from the unfolded ensemble, is obtained from analyzing the thermodynamics near the transition point, giving χc=0.65. The fraction of proteins in the NBA fNBA = 〈Θ(χc − χ)〉, with the averaging being over the Hamiltonian, which includes the transfer energy contribution due to the presence of denaturant(see Methods).
The titration curves, plotting fNBA as a function of [C] (Fig.1B) at different temperatures with f = 7 pN, show that the midpoint of the transition, Cm, calculated using fNBA([Cm]; f) = 0.5, decreases sharply as T increases. Interestingly, the results in Fig.1B show that at f ≠ 0, SH3 globally folds and unfolds reversibly in an apparent two state manner, just as in ensemble experiments at a fixed T [21–25]. Using the results in Fig.1B, the computed ΔGNBA([C], T, f) = −kBT ln [fNBA/(1 − fNBA)] at a fixed T with f = 7 pN is shown in Fig.1C. At a fixed T , the dependence on [C] is given by ΔGNBA([C], T, f) = Δ GNBA([0], T, f)+m(f)[C]. Surprisingly, m(f) is only weakly dependent on force because mechanical force does not perturb the intrinsic forces that determine the stability of proteins. Indeed, to the first approximation, m should be proportional to the solvent accessible surface area, or , and Rg does not depend on f while the protein is folded. From this perspective, use of f is a natural way to perturb the protein as opposed to T and [C], which invariably alter the interactions involving proteins, the solvent, and denaturants. All of the m(f) arises from the increase in the solvent accessible surface ares of the unfolded ensemble (see Fig.S2), which grows as [C] approaches Cm from below. Linear response to [C] is also indicated in the dependence of Tm ([C]) = Tm ([0], f) − γ(f)[C] at various f values (Fig.1D). By the reasoning given above we expect the reduced temperature ΔTm = (Tm([C]) − Tm([0]))/Tm([0]) = k[C] to be proportional to [C] at all the forces, with k being independent of f. The melting temperature without the denaturant Tm([0], f) depends on f, because the force changes the protein stability, but since it does not alter the interactions between the residues themselves, k should not depend on f. The thick solid line in Fig.1D, showing superposition of results at different f values, indeed satisfies the expected “scaling” behavior with k = −0.016M−1 for all the f values.
Phase diagrams
From the titration curves at multiple forces and a fixed temperature (T = 310K) we constructed the force-denaturant (f, [C]) phase diagram. Fig.2A shows that the critical (melting) force fm([C]) needed to unfold SH3 increases as [C] decreases, reflecting the [C]-dependent stability of the native state. The boundary separating the NBA and unfolded basin of attraction(UBA) is relatively sharp, implying that SH3 behaves as a two-state folder in the (f, [C]) plane. The phase boundaries at other temperatures are shown in Fig.2B. Similarly, the phase diagram as a function of (f, T) calculated at different [C] (see Fig.2C) also shows a two-state behavior. Not surprisingly, the melting temperature and the critical (melting) force decreases as [C] increases (Fig.2B). For a fixed T, the phase boundary can be quantified using,
| (2) |
where Cm(f, T) is melting concentration, and f0(T) is the melting force at [C] = 0. The parameters f0(T), C0(T) and β should depend on the protein. From fitting we find β = 1.25 ± 0.1, estimating error using jackknife[26]. A similar fit (fm/f0([C]) = 1 − (T/Tm)α) can be used to determine the family of curves at various values of [C][27] (Figs.2C and 2E). The solid lines in Figs.2D and 2E show that all curves (given in Figs.2B and 2C) collapse onto a master plot using the scaling functions above. The collapsed phase boundary in the restricted range of (f, T) values in Fig.2E appears linear, but we expect the master curve to be non-linear (α different from unity), which will be observable if the range of temperature is increased. However, such low T values may not be physically relevant. Nevertheless, the finding that the phase boundary for various values of [C] and f collapse onto a single curve is surprising, and is ACCEPTED MANUSCRIPT amenable to experimental scrutiny.
Figure 2.
Phase diagram of the src SH3 domain: (A) Force-denaturant phase diagram of the src SH3 domain at T=310K. A fraction of conformations inside the native basin of attraction is color-coded from white (fNBA=1) to purple (fNBA=0). (B) Boundaries between the folded and unfolded regions at different temperatures. (C) Boundaries between the folded and unfolded regions at different GdmCl concentrations. (D) (f,C) diagrams collapsed onto a master curve C/C0(T) = 1−(f/f0(T))β, β = 1.25±0.1. (E) (f, T) diagrams collapsed onto a master curve f/f0([C]) = 1 − (T/T0([C]))α.
The behavior of phase boundaries here is analogous to that found in super-conductors, which exhibits a sharp phase boundary in the (H, T) plane described by the identical power law[28]. The analogy to superconductor in a magnetic field is appropriate, because in both cases the transition is likely to be first order. As the magnetic field (H) in the superconductor, in our case, force enters linearly into the Hamiltonian, but is coupled to microscopic coordinates in a complicated fashion, leading to non-trivial response to f here or H in superconductor. Similarly, our Hamiltonian is a linear function of [C], which is coupled to a very complicated function of coordinates.
N* state is visible in the presence of force
The global and experimentally accessible quantities in Fig.1 show that upon various perturbations (T, [C] and f) SH3 folds in an apparent two state manner. A more nuanced picture emerges when the folding landscape is examined using a free energy profile F(R), where R is the extension of SH3 conjugate to f. Although not straightforward, F(R) can be precisely calculated[29] using folding trajectories generated in laser optical trap (LOT) experiments[30, 31]. The calculated free energy profile F(R) = −kBT ln P(R), where kB is the Boltzmann constant, P(R) is the distribution of R, exhibits a minimum at R = 1.15 nm, which is distinct from the one at R = 0.68 nm corresponding to the NBA (Figs.3,4). The fine structure in F(R) is subtle, and is not noticeable in other quantities such as the distribution of the radius of gyration (see Figs.S3–S6). The R = 1.15 nm peak in P(R) does not correspond to the globally unfolded state at R ≈ 10 nm. Barely noticeable at f = 0, the second minimum in F(R) becomes increasingly pronounced as f grows (Fig.4). In other words, mechanical force reveals an elusive state in the F(R) profiles.
Figure 3.
Free energy profiles at f = 7.5 pN. (A) F(R) at various temperatures, showing that globally, SH3 even under force unfolds in an apparent two-state manner. The transition state location nm is insensitive to temperature. (B) Fine structure in F(R) shows the emergence of the N* state, which is prominent at high forces. Population of the N* state is between 3% (at 300 K) and 6% (at 340 K).
Figure 4.
Distributions of the end-to-end distance, R, at different values of the stretching force at a fixed T=310K. The smaller peak corresponds to the aggregation-prone state N*, which becomes more prominent as the magnitude of the force grows.
Structural characteristics of N*
In order to characterize the structure of the N* state corresponding to the peak at R = 1.15 nm (Fig.4), we calculated the individual overlap parameters using Qβ1,β2, Qβ2,β3 and Qβ4,β5[32] for each pair of β-strands that are part of a β-sheet using
| (3) |
where the sum is over the contacts defining the Q({r}) (for instance, for Qβ1,β2 these would be pairs of beads, one from β1 and the other from β2 over all the beads in β1 and β2); M is the number of such contacts, ri,j are the coordinates of the beads in the conformation for which the Q is calculated ({r}), and are the corresponding coordinates in the native state.
Two-dimensional histogram, in terms of the order parameters (R, Qβ4,β5) in Fig.5A, shows that the second peak in the P(R) profiles (Fig.4) corresponds to a smaller value of Qβ4,β5 compared to that found in the NBA. However, Qβ1,β2 and Qβ2,β3 retain their values in the native state (the higher peak) (Fig.5B). Consequently, the second peak in P(R) must reflect a state in which the β4-β5 sheet is melted (or disordered) with the rest of the native structure remaining intact. Because the N* structure is native-like in SH3, we surmise that it is a native substate, which is hidden at f = 0. This assumption is corroborated by kinetic simulations showing that N* is frequently visited during the native dynamics. We identify the ensemble of conformations belonging to the second peak as the N* state that is aggregation prone (see below for comparison to experiments).
Figure 5.
Structural characteristics of the N* state. Histograms of the conformations in the folded state at f = 10 pN and T = 350K. The second, lower peak in P(R) correlates with the melting of the β4-β5 sheet as shown by the fraction of contacts between the residues of those strands Qβ4,β5 (A). The y-axis on the 1D plot shows the probability density, while for the 2D plot it is Qβ4,β5, and the probability density is shown by color from blue (low) to red (high). Qβ1,β2 and Qβ2,β3 (B) remain the same in the N* state as in N.
Force-induced unfolding kinetics
In order to determine if remnants of the N* state arise in the force-unfolding kinetics of SH3 domain we performed Brownian dynamics simulations at high friction as described in Methods at T = 340, 350 and 360K, and f = 10 pN. The observed unfolding times spanned a broad range of timescales (from hundreds of microseconds to over 10 milliseconds) (see Table S1 showing the unfolding times for individual trajectories). The sequence of events during unfolding was always the following. First the β4-β5 ruptures, populating the N* state. Following this event, in about 2–3μs, the β1-β2-β3 sheet rips, taking a few more microseconds (Figs.6B and Figs.6C). In this process, contacts between either β2-β3, or β1-β2 can break first. Both β2-β3 and β1-β2 sheets continue to transiently and partially reform in the unfolded state. Thus, breaking of the β4-β5 sheet is not by itself rate limiting in the global unfolding of SH3 under these conditions. The sheet ruptures and reforms (Fig.6) on the order of hundreds of microseconds (in agreement with the reported accessibility of the N* on the scale of milliseconds in the NMR experiment for Fyn SH3[18]), but only when it stays melted long enough for the other β sheet to follow (about 2μs) does the protein globally unfold. The kinetic simulations show that N* is the same intermediate identified in equilibrium free energy profiles. Thus, in the folding landscape of SH3, the N* state is indeed a folding intermediate forming after the major folding barrier is crossed leading to formation of N (formation of the β1-β2-β3 sheet). Since we observe unfolding rather folding, and N* is visited multiple times before global unfolding, it is natural to think of it as a native substate; N* is a native excitation around the folded state, and is visited during native dynamics even when the protein is thermodynamically in the folded state. After the initial event, there is a bifurcation in the unfolding pathways; in some of the trajectories β1-β2 ruptures first (Fig.6B for example) while in others β2-β3 unfolds first (Fig.6C). Thus, the kinetic trajectories also show a two-state behavior, with two possible unfolding pathways.
Figure 6.
Highlights of unfolding kinetics under stretching (trajectories at f = 10 pN, T = 350 K, [C] = 0 are shown). (A) The β4-β5 sheet melts and reforms on the order of 10−5 s before completely unfolding. (B,C) When β4-β5 stays unfolded long enough so that contacts in the other β-sheet start breaking, the β1-2-3 sheet also melts completing the unfolding. Contacts between β1 and β2, and between β2 and β3 continue to transiently and partially reform even after reaching the unfolded state. β1-β2 breaks before the β2-β3 in trajectory shown in (B). The reverse sequence occurs in the trajectory shown in (C).
Dimerization
Our main hypothesis is that the excited N* state is prone to aggregation. In order to test this hypothesis, we have simulated the process of dimerization, starting from the two monomers in N* conformations, which were generated in the monomer simulations with one pair of β4 and β5 in contact, and the other pair on the opposite sides (Fig.7A). We performed overdamped Brownian dynamics simulations at T = 277K, controlling the concentration of the protein by constraining the distance between the centers of mass to 15 Å (~ Rg of a monomer)[33]. On a relatively short time scale, we observed dimerization, through the stages presented in simulation snapshots in Fig.7B–D: partial unfolding of RT-loops, and the remaining β4 finding the β5 of the other molecule, forming the domain-swapped dimer (see Supplementary Movie 1). Thus, the most probable route to aggregation in this protein is through domain-swap mechanism, which was already established in a previous study[34]. Here, we explicitly show that domain swapped structures form readily by accessing a high energy N* state.
Figure 7.
Stages of dimerization through domain swapping. (A) Starting configuration, corresponding to two monomers in N* conformation. (B) RT-loop of the red molecule unfolded with sticky β4 on its end; the blue RT-loop starts to unfold. (C) RT-loops unfolded. Red β4 continues diffusional motion around the blue monomer. (D) Red β4 finds forms a β-sheet by associating with the blue β5 in a domain-swapped dimer. (E) Trimer of the src SH3 domain formed during simulations of three SH3 molecules.
To investigate whether this process of swapping the β4 can provide a pathway to aggregation of multiple molecules, we also simulated addition of a third monomer to the dimer. We started from a configuration where dimer is not formed with one of the sticky β4 strands being unstructured (Fig.7C). With the same conditions for temperature and concentration as for dimer formation, we observed formation of a trimer (Fig.7E and Supplementary Movie 2).
3. Discussion
N* state as precursor to oligomerization
Theoretical arguments along with molecular dynamics simulations of peptides and proteins[14, 35–37] have shown that aggregation is initiated when the protein (at least transiently) populates a high free energy N* state. In the last few years there have been several experiments on proteins of different lengths, with no sequence or structural similarity, in which the N* states have been experimentally characterized[12, 18, 38]. The closest example that is very similar to the N* state studied here arises in G.Gallus Fyn SH3 domain[18]. Remarkably, Kay and coworkers identified using relaxation dispersion NMR experiments, a state with low population in A39V/N53P/V55L mutant Fyn SH3 under native conditions[18], that is identical to our finding for the structurally similar src SH3 domain. The backbone structure of the folding intermediate determined for Fyn SH3 is similar to the native state everywhere except in regions adjacent to the N- and C- termini. A detailed comparison is provided in Fig.S7. Of particular note is that the Cterminal β5 strand is disordered in the Fyn SH3 intermediate. The melting of the β4-β5 sheet leaves the N-terminal region exposed, which can trigger oligomer formation. They further corroborated the aggregation propensity of the region by preparing a truncation mutant, which is a mimic of the intermediate (or the ACCEPTED MANUSCRIPT N* state), and established that the resulting mutant forms amyloid-like fibrils with high β-strand content.
Interpreting the results of the NMR experiments on Fyn SH3 mutant using our findings, we conclude, that the folding intermediate of Fyn SH3 and the native substate identified in the src SH3 from the tyrosine kinase are the N* states. In other words, they are the ensembles of conformations in the spectrum of native excitations, that each monomer must populate in the course of oligomerization and fibril formation. Although not linked to any known disease, SH3 domains are known to aggregate in in vitro experiments. In many proteins, containing the SH3 domain, for instance, the growth factor receptor-bound pro260 tein 2[39] or phosphatidylinositol 3’-kinase[40], SH3 is located at the N-terminus of the protein. It is likely that the mechanism of aggregation by exposure of the sticky N-terminal β-strand of SH3 might remain operational in vivo as well.
β2-microglobulin and acylphosphatase populate the N* under native conditions
Two other examples are also worth pointing out. In one of the earliest ex265 periments, Radford and coworkers showed that aggregation of β2-microglobulin into amyloid-like fibrils occurs from a native-like (N*) intermediate that has low population at equilibrium[38, 41]. Despite the large barrier created by the proline frozen in the cis conformation, it was suggested that the N* state is part of the native state ensemble separated from the NBA by a large enough free energy gap that its population is low under native conditions. Finally, using NMR experiments and molecular dynamics with H/D exchange data as restraints, a free energy profile was reconstructed using RMSD of the Cα as the reaction coordinate for acylphosphatase was generated in the absence and presence of trifluoroethanol (TFE). These results suggest that a native-like N* state becomes visible at non-zero concentration of TFE. Just as in SH3 domains, interactions between two critical β-strands in acylphosphatase are disrupted exposing them to the solvent, thus making them aggregation prone[42]. Taken together, these results not only show that aggregation by populating N* state is a generic mechanism, but also reinforces earlier prediction that sequences of most natural proteins with predominantly β-sheet secondary structure have evolved so that intermolecular interactions between edge strands (β5 in SH3 for example) are unfavorable[41, 43].
Population of N* and fibril formation rates
In our previous work [44] we showed that the time scale of fibril formation is related to PN* (expressed in terms of percentage), the probability of populating the N* state as
| (4) |
where τF is the folding time, λ ≈ 108 and C ≈ 1 [44]. It is tempting to estimate τfib for SH3 domain using the experimentally measured τF ≈ 1.7 · 10−2 s. The value of PN* (T ) = 2% for Fyn SH3 domain[18], which coincides with the calculated value for src SH3 domain (see Figs. S8 and S9). Using τF and the estimate for PN* (T) the above equation yields τfib is on the order of 2–3 days, which is consistent with the measurements reported for Fyn SH3[18].
A few implications for amyloidogenesis follows from the importance of PN*. (i) The fibril formation time scale should decrease dramatically as PN* increases. (ii) Since PN* (at a fixed temperature) depends on the free energy gap between N* and N states, it follows that only those proteins for which there is a reasonable probability of being the N* state under physiological conditions would aggregate in relevant time scales. (iii) The temperature dependence of τfib (Eq.4) is highly non-trivial because PN*(T) would be a maximum at some optimal temperature and is expected to decrease at low temperatures. (iv) It is the free energy gap between N* and the NBA, rather than the global stability, that determines the propensity of a protein to aggregate.
In our simulations force was applied to the ends of the protein. However, in simulations that we performed with applying force to the residues 9 (N-terminal end) and 59 (before the β5 sheet in sequence), as in reported SMFS experiments with SH3[19], the N* is also accentuated by force, showing some robustness of the SMFS as a tool for highlighting the elusive states. We also note, that although not easily detectable, N* is present as the excitation in the native ensemble even at f = 0, so many of the implications for fibril formation do not depend on force (including the pulling points). Thus, the detectability of N* to some extent does not depend on the pulling direction.
From N* to oligomers to fibrils
Our simulations of the process of oligomer formation starting from N* structures, support the finding that N* is aggregation prone. The ease of assembly of dimer and trimer by domain-swap mechanism allows for a route to aggregation in SH3 domain. Our trimer simulation shows that when the dimer is not fully formed, the sticky β4 dangling on the unfolded RT-loop, associated not with counterpart in the dimer but with a third molecule in N* state, leading to the formation of a a trimer. In the formed dimer (or trimer), the contacts between β4 and β5 break once in a while, allowing for the sticky β4 strand of another molecule to come into proximity, leading to the growth of higher order structures. This domain-swapping (or rather β-strand swapping) mechanism can include an arbitrary number of molecules: β4Aβ5A monomer, β4Aβ5B + β4Bβ5A dimer, β4Aβ5B + β4Bβ5C + β4Cβ5A trimer, β4Aβ5B + β4Bβ5C + β4Cβ5D + β4Dβ5A tetramer etc. (Fig.8). The strand exchange mechanism shows that as the molecular weight of higher order structures increases, the higher is the probability that the β4 strand at the end of the assembly will dangle on the RT-loop rather than form contacts with β5, because it becomes harder to accommodate the RT-loops within the oligomer of growing size.
Figure 8.
The pathway to aggregation through the N* state. First, if two molecules in N* state come close together, and one of the sticky β4 makes contacts with the other molecule, the RT-loops will unfold. Eventually, the other β4 can come into contact with β5 of another molecule forming a domain-swapped dimer. This process can be repeated resulting in fibril formation.
The domain-swapping mechanism of src SH3 aggregation has been reported more than a decade ago[34]. Our main message here is the importance of native fluctuations of a monomer leading to the population of the aggregation-prone state N* and how this state can be gleaned by application of mechanical force. Our oligomerization simulations serve merely as an illustration of the principle that force can be used to access the N* states. Subsequent growth of oligomers can occur by domain swap mechanism as in SH3 or by other complex scenarios.
Rationale for using the coarse-grained models
In all computer simulations it is important, but seldom practiced, to ascertain the extent to which the models capture reality. Our view is that the efficacy of the model can only be assessed by the ability to provide new insights and predict the outcomes of experiments. We have indeed produced a number of novel testable predictions using coarse-grained models. This level of falsifiable predictions cannot be currently achieved using all-atom simulations. We have shown in a number of studies that coarse-grained model used here has been remarkably successful in reproducing and predicting the force-stretching single-molecule experimental data in proteins[29, 45–47] and RNA[46, 48] including predictions that have been later validated quantitatively. The SOP-SC model used in this work is a more refined version of SOP. It has reproduced experimental results for the denaturant induced unfolding[49, 50]. The choice of the SOP-SC model, therefore, is very adequate for studying the questions we have answered in this work.
4. Conclusions
We have established, using in silico force spectroscopy of the src SH3 domain, that response of native proteins to mechanical force provides quantitative insights into protein aggregation. The force-denaturant phase diagram for the SH3 domain shows that the boundary separating the folded and unfolded states is universal (Eq.2) implying that determination of C0, f0 and β is sufficient to predict the phase diagram at any temperature.
Of particular importance is the demonstration that mechanical force can be used to elucidate a fine structure in the organization of the native state. In particular, force highlights the aggregation-prone N* state in the native ensemble of SH3, by increasing its stability. The structure of the N* state in src SH3 domain is identical to an aggregation-prone folding intermediate recently found in the Fyn SH3 domain in a relaxation dispersion NMR experiment[18]. Sim360 ulations of kinetics of unfolding under force shows that the N* state has to be formed and linger for a substantial time period prior to global unfolding. Our work shows that force spectroscopy in combination with simulations, carried out under the same conditions as experiments, is a viable technique to characterize important substates in the spectrum of native excitations even if they have low populations, as is the case in SH3 domain.
A byproduct of our study is that force (or more generally stress) can enhance the probability of oligomer and fibril formation. This implies that proteins that operate against load (motor proteins for example) have evolved to minimize the tendency to sample N* states by ensuring that the free energy gap separating the N* and the native state is large.
It follows from our work that aggregation propensity of folded proteins under native conditions is determined by the accessibility of the N* state, which in SH3 domains involves partial unfolding of the folded state. Thus, it is the free energy gap separating the NBA and N* state (ΔGNBA,N*), rather than the global stability, which plays a critical role in protein aggregation. A corollary of this prediction is that mutations that increase (decrease) ΔGNBA,N* would decrease (increase) aggregation probability. Protein aggregation, therefore, depends not only on sequence, but also on ΔGNBA,N*, which can be manipulated by changing external conditions.
Our work also shows that by knowing the structure of N* state it might be possible to construct structural models for oligomers, as we have done for the src SH3 domain. The generality of this approach requires further work.
5. Methods
Model
Our simulations were performed using the self-organized polymer model with side-chains (SOP-SC)[49] for the protein. In the SOP-SC model each amino-acid is represented using two interaction centers. The energy function in the SOP-SC representation of a polypeptide chain is taken to be,
| (5) |
The potential is based on the native structure (PDB structure in our case), which assigns an attraction between the residues that are in contact in the native state and are at least 2 residues apart along the chain. The form of is,
| (6) |
where the sums are over the residues. We use the Lennard-Jones function for the beads that are in contact in the native state (with the minimum at the native distance):
| (7) |
where Δ = 1 if the native distance in the (we use Rc = 8Å), and Δ = 0 otherwise. We use the Betancourt–Thirumalai statistical potential for εij[51]. Superscripts b and s stand for backbone bead and side chain bead, respectively.
Non-native interactions are modeled by soft sphere interactions:
| (8) |
where σbb = 3.8Å is the average distance between neighboring Cα atoms, and is the sum of van der Waals radii of corresponding beads (backbone or side-chain depending on the amino acid).
Repulsive interactions between neighboring residues stabilize local secondary structure:
| (9) |
The values of all σ and ε are the same as used in [49].
The chain connectivity is described by UFENE, or finite extensible nonlinear elastic potential:
| (10) |
where the summation is over all bonds (whether between two backbone beads or between a backbone and a sidechain bead), l is the distance between the two bonded beads and l0 is the length of the bond in the native state. We used spring constant k = 20 kcal/mol/Å2 and R0 = 2Å.
Model for denaturant
The molecular transfer model[49, 52, 53] is a phenomenological way to account for the effect of denaturants, where the effective interactions involving denaturant represent the free energy of transfer of a residue or backbone from pure water to the aqueous solution of the denaturant with concentration [C]. The transfer free energy term is a sum of terms proportional to the solvent accessible surface areas of individual residues (backbone and sidechain):
| (11) |
where the sum is over all backbone and sidechain interaction centers, δg(k, [C]) = mk [C]+bk is the transfer free energy of the bead k, and mk and bk are different for the backbone and sidechains, and depend on the residue identity. The solvent accesible surface area (SASA) is αk(r) for bead k in the protein conformation r, and αGly−k−Gly is the SASA of the bead k in the tripeptide Gly-k-Gly. We used the same values of mk, bk and αGly−k−Gly as in [49]. We used the procedure described in [54] to calculate the SASA.
Calculating SASA is computationally expensive. It is possible to avoid calculating ΔG(r, [C]) at every simulation step, if only equilibrium properties are of interest. If ℋ[C] is the full Hamiltonian, ℋ0, and −ΔG(r, [C]) is the restriction potential similar to the one used in umbrella sampling simulation, it is possible to perform converged simulations using the Hamiltonian ℋ[0]. The properties of the system under the unperturbed Hamiltonian ℋ0 = ℋ[C] can then be calculated using the weighted histogram analysis method (WHAM) equations[55].
Multichain interactions
The non-bonded interactions between beads on different chains were the same as if they were on the same chain. That is, if two residues, e.g. V61 and V11 form native contact, then V61 and V11 have the same native interaction (with εs and σs for valine-valine) even if they belong to different chains. Two residues that interact non-natively (i.e. via soft sphere repulsion) also behave the same way on different chains. Similar model for multichain interaction has been employed by Ding et al[34] for studying of aggregation of src SH3, that is the native interactions were the same between the residues whether on the same or different chains. The differences were that Ding et al. used pure Go model and added hydrogen bonds between the backbone atoms of different molecules.
Simulations
We used Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) [56] to perform the simulations, enhancing it for the SOPSC model. Langevin dynamics is performed as the numerical solution of the Langevin equation for every protein bead:
| (12) |
where m is the mass of the bead, v(t) = ṙ; Fc are conservative forces acting on the bead (interactions with the other bead), Fc = ∂ℋ/∂r; Fr is the white noise random force with . We integrated the Langevin equations using Verlet algorithm as implemented in LAMMPS[56, 57]. The timescale in Eq.12 is defined by ζ/m. The relevant timescale for low friction limit, where inertial term is significant in SOP-SC model is ps. For low friction limit (when simulating thermodynamics) we use ζ = 0.05m/τL. We used LAMMPS facilities for the replica exchange simulations. We made random exchange attempts at 100τL intervals, with the same intervals between saving the trajectory snapshots for analysis (to calculate the transfer energies). For the overdamped limit (when simulating kinetics), where the inertial term in the Langevin equation is negligible, the relevant timescale is τH = ζσ2/kT . For high friction we use ζ = 50m/τL, yielding τH ≈ 150 ps[49, 58].
For calculating phase diagrams, we have performed replica exchange simulations at low friction and at the following fixed values of f: 0, 0.5, 1, 1.5, 2, 2.25, 2.5, 2.75, 3, 3.25, 3.5, 4, 4.5, 5, 5.5, 5.75, 6, 6.5, 7, 7.25, 7.5, 8, 8.25, 8.5, 9, 9.5, 10, 10.5, 11, 11.5, 12, 12.5, 13, 13.5, 14, 15 pN, at [C] = 0 and used the sampled conformations at various temperatures to infer the distribution of NBA and denatured states at particular values of C and T for every f, using WHAM with the transfer energy in place of biasing potential as detailed in the subsection below. From individual replicas we calculated distributions and free energy profiles as a function of end-to-end distance R for various values of f (at fixed T and [C] = 0) and T (for fixed f and [C] = 0)
For calculating unfolding kinetics, we ran 16 trajectories at 3 different temperatures, as detailed in the Table S1.
To illustrate dimerization and path to fibril formation we started the simulation with two molecules in N* conformation. We fixed the distance between their centers of mass at ~ 15 Å (~ Rg) with a parabolic potential (as a way to simulate high concentration) and ran Brownian dynamics at T=277K until observing the formation of a dimer, just after about 100 μs. The whole process is shown in the Supplementary Movie 1.
For the trimer formation, we started from a snapshot from the dimerization trajectory and added a third molecule in N* state. We constrained the three distances between three centers of mass to the same ~ 15 Å. We observed the formation of trimer (that is, the last β4 finding its spot on another molecule next to its β5) after about 1 ms. Supplementary Movie 2 shows ~ 50μs preceding the formation of a trimer.
WHAM for equilibrium properties
For simulations with the Hamiltonian
| (13) |
where Vrest is the restriction potential with a coupling constant λ, WHAM can be used to calculate equilibrium properties of the system with Hamiltonian ℋ0(r). In case of MTM Eq.13 becomes:
| (14) |
The unnormalized probability density of an observable ξ from M simulations performed using ℋ[0] with i-th simulation carried out at temperature Ti is[55]:
| (15) |
where Nk (ξ) is the histogram of ξ in k-th simulation, nm is the number of snapshots (conformations) analyzed for m-th simulation. The free energy of the i-th simulation, fi is:
| (16) |
which can be found self-consistently using:
| (17) |
In Eq.17, ℋ[0](k, t) is the potential energy of the t-th snapshot from the k-th simulation (in the absence of denaturants). After calculating the fi the averages for the MTM Hamiltionian (11) can be obtained using (15):
| (18) |
with the partition function
| (19) |
where H[Ci](k, t) = H[0](k, t)+ΔG(k, t, [Ci]), and ξ(k, t), H[0](k, t) and ΔG(k, t, [Ci]) are the values of ξ, potential energy and transfer energy to the solution of concentration [Ci] in the t-th snapshot of k-th simulation.
Supplementary Material
Research Highlights.
Low population of aggregation prone N* states encoded in the monomer folding landscape can detected by applying mechanical force.
Phase diagram in the force-denaturant obeys a universal boundary separating the folded and unfolded states.
N* states are hidden in conventional ensemble experiments but are visible in single molecule force spectroscopy.
Population of N* states is related to aggregation rates.
In src SH3 domain collisions between N* states results in amyloid formation by a domain swap growth mechanism.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Schuler B, Eaton WA. Protein folding studied by single-molecule FRET. Current Opinion In Structural Biology. 2008;18(1):16–26. doi: 10.1016/j.sbi.2007.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How Fast-Folding Proteins Fold. Science (New York, NY) 2011;334(6055):517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
- 3.Onuchic JN, Wolynes PG. Theory of protein folding. Current Opinion In Structural Biology. 2004;14(1):70–75. doi: 10.1016/j.sbi.2004.01.009. [DOI] [PubMed] [Google Scholar]
- 4.Shakhnovich E. Protein folding thermodynamics and dynamics: where physics, chemistry, and biology meet. Chemical Reviews. 2006;106(5):1559–1588. doi: 10.1021/cr040425u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Thirumalai D, O’Brien EP, Morrison G, Hyeon C. Theoretical perspectives on protein folding. Annual Review Of Biophysics. 2010;39(39):159–183. doi: 10.1146/annurev-biophys-051309-103835. [DOI] [PubMed] [Google Scholar]
- 6.Shea J-E, Urbanc B. Insights into Aβ aggregation: a molecular dynamics perspective. Current topics in medicinal chemistry. 2012;12(22):2596–2610. doi: 10.2174/1568026611212220012. [DOI] [PubMed] [Google Scholar]
- 7.Straub JE, Thirumalai D. Principles governing oligomer formation in amyloidogenic peptides. Current Opinion In Structural Biology. 2010;20(2):187–195. doi: 10.1016/j.sbi.2009.12.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Aguzzi A, O’Connor T. Protein aggregation diseases: pathogenicity and therapeutic perspectives. Nature reviews. Drug discovery. 2010;9(3):237–248. doi: 10.1038/nrd3050. [DOI] [PubMed] [Google Scholar]
- 9.Chiti F, Dobson CM. Protein misfolding, functional amyloid, and human disease. Annual review of biochemistry. 2006;75:333–366. doi: 10.1146/annurev.biochem.75.101304.123901. [DOI] [PubMed] [Google Scholar]
- 10.Bucciantini M, Giannoni E, Chiti F, Baroni F, Formigli L, Zurdo J, Taddei N, Ramponi G, Dobson CM, Stefani M. Inherent toxicity of aggregates implies a common mechanism for protein misfolding diseases. Nature. 2002;416(6880):507–511. doi: 10.1038/416507a. [DOI] [PubMed] [Google Scholar]
- 11.Goldschmidt L, Teng PK, Riek R, Eisenberg D. Identifying the amylome, proteins capable of forming amyloid-like fibrils. Proceedings of the National Academy of Sciences of the United States of America. 2010;107(8):3487–3492. doi: 10.1073/pnas.0915166107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.De Simone A, Dhulesia A, Soldi G, Vendruscolo M, Hsu S-TD, Chiti F, Dobson CM. Experimental free energy surfaces reveal the mechanisms of maintenance of protein solubility. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(52):21057–21062. doi: 10.1073/pnas.1112197108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tartaglia GG, Pawar AP, Campioni S, Dobson CM, Chiti F, Vendruscolo M. Prediction of aggregation-prone regions in structured proteins. Journal of molecular biology. 2008;380(2):425–436. doi: 10.1016/j.jmb.2008.05.013. [DOI] [PubMed] [Google Scholar]
- 14.Thirumalai D, Klimov DK, Dima RI. Emerging ideas on the molecular basis of protein and peptide aggregation. Current Opinion In Structural Biology. 2003;13(2):146–159. doi: 10.1016/s0959-440x(03)00032-0. [DOI] [PubMed] [Google Scholar]
- 15.Tarus B, Straub JE, Thirumalai D. Dynamics of Asp23-Lys28 saltbridge formation in Abeta10-35 monomers. JACS. 2006;128(50):16159–16168. doi: 10.1021/ja064872y. [DOI] [PubMed] [Google Scholar]
- 16.Massi F, Straub JE. Energy landscape theory for Alzheimer’s amyloid beta-peptide fibril elongation. Proteins. 2001;42(2):217–229. doi: 10.1002/1097-0134(20010201)42:2<217::aid-prot90>3.0.co;2-n. [DOI] [PubMed] [Google Scholar]
- 17.Hu W, Walters BT, Kan Z-Y, Mayne L, Rosen LE, Marqusee S, Englander SW. Stepwise protein folding at near amino acid resolution by hydrogen exchange and mass spectrometry. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(19):7684–7689. doi: 10.1073/pnas.1305887110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Neudecker P, Robustelli P, Cavalli A, Walsh P, Lundstrom P, Zarrine-Afsar A, Sharpe S, Vendruscolo M, Kay LE. Structure of an Intermediate State in Protein Folding and Aggregation. Science (New York, NY) 2012;336(6079):362–366. doi: 10.1126/science.1214203. [DOI] [PubMed] [Google Scholar]
- 19.Jagannathan B, Elms PJ, Bustamante C, Marqusee S. Direct observation of a force-induced switch in the anisotropic mechanical unfolding pathway of a protein. Proceedings of the National Academy of Sciences of the United States of America. 2012;109(44):17820–17825. doi: 10.1073/pnas.1201800109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jagannathan B, Marqusee S. Protein folding and unfolding under force. Biopolymers. doi: 10.1002/bip.22321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fernádez-Escamilla AM, Cheung MS, Vega MC, Wilmanns M, Onuchic JN, Serrano L. Solvation in protein folding analysis: combination of theoretical and experimental approaches. Proc Natl Acad Sci U S A. 2004;101(9):2834–2839. doi: 10.1073/pnas.0304180101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Grantcharova VP, Baker D. Folding dynamics of the src SH3 domain. Biochemistry. 1997;36(50):15685–15692. doi: 10.1021/bi971786p. [DOI] [PubMed] [Google Scholar]
- 23.Grantcharova VP, Riddle DS, Baker D. Long-range order in the src SH3 folding transition state. Proc Natl Acad Sci U S A. 2000;97(13):7084–7089. doi: 10.1073/pnas.97.13.7084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Riddle DS, Grantcharova VP, Santiago JV, Alm E, Ruczinski I, Baker D. Experiment and theory highlight role of native state topology in SH3 folding. Nat Struct Biol. 1999;6(11):1016–1024. doi: 10.1038/14901. [DOI] [PubMed] [Google Scholar]
- 25.Martínez JC, Serrano L. The folding transition state between SH3 domains is conformationally restricted and evolutionarily conserved. Nat Struct Biol. 1999;6(11):1010–1016. doi: 10.1038/14896. [DOI] [PubMed] [Google Scholar]
- 26.Efron B. The Jackknife, the Bootstrap, and Other Resampling Plans. Philadelphia, PA: Society for Industrial and Applied Mathematics; 1982. [Google Scholar]
- 27.Klimov DK, Thirumalai D. Stretching single-domain proteins: phase diagram and kinetics of force-induced unfolding. Proc Natl Acad Sci U S A. 1999;96(11):6166–6170. doi: 10.1073/pnas.96.11.6166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Keefe PD. Quantum mechanics and the second law of thermodynamics: an insight gleaned from magnetic hysteresis in the first order phase transition of an isolated mesoscopic-size type I superconductor. Physica Scripta. 2012;2012(T151):014029. [Google Scholar]
- 29.Hinczewski M, Gebhardt JCM, Rief M, Thirumalai D. From mechanical folding trajectories to intrinsic energy landscapes of biopolymers. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(12):4500–4505. doi: 10.1073/pnas.1214051110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Greenleaf WJ, Block SM. Single-molecule, motion-based DNA sequencing using RNA polymerase. Science (New York, NY) 2006;313(5788):801. doi: 10.1126/science.1130105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Stigler J, Ziegler F, Gieseke A, Gebhardt JCM, Rief M. The Complex Folding Network of Single Calmodulin Molecules. Science (New York, NY) 2011;334(6055):512–516. doi: 10.1126/science.1207598. [DOI] [PubMed] [Google Scholar]
- 32.Hardin C, Eastwood MP, Luthey-Schulten Z, Wolynes PG. Associative memory hamiltonians for structure prediction without homology: alphahelical proteins. Proc Natl Acad Sci U S A. 2000;97(26):14235–14240. doi: 10.1073/pnas.230432197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dima RI, Thirumalai D. Exploring protein aggregation and selfpropagation using lattice models: Phase diagram and kinetics. Protein science : a publication of the Protein Society. 2002;11(5):1036–1049. doi: 10.1110/ps.4220102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ding F, Dokholyan NV, Buldyrev SV, Stanley HE, Shakhnovich EI. Molecular dynamics simulation of the SH3 domain aggregation suggests a generic amyloidogenesis mechanism. Journal of molecular biology. 2002;324(4):851–857. doi: 10.1016/s0022-2836(02)01112-9. [DOI] [PubMed] [Google Scholar]
- 35.De Simone A, Kitchen C, Kwan AH, Sunde M, Dobson CM, Frenkel D. Intrinsic disorder modulates protein self-assembly and aggregation. Proceedings of the National Academy of Sciences of the United States of America. 2012;109(18):6951–6956. doi: 10.1073/pnas.1118048109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Krobath H, Estacio SG, Faisca PFN, Shakhnovich EI. Identification of a Conserved Aggregation-Prone Intermediate State in the Folding Pathways of Spc-SH3 Amyloidogenic Variants. Journal of molecular biology. 2012;422(5):705–722. doi: 10.1016/j.jmb.2012.06.020. [DOI] [PubMed] [Google Scholar]
- 37.Rosenman DJ, Connors C, Chen W, Wang C, García AE. Aβ Monomers Transiently Sample Oligomer and Fibril-Like Configurations: Ensemble Characterization Using a Combined MD/NMR Approach. Journal of molecular biology. doi: 10.1016/j.jmb.2013.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Eichner T, Radford SE. A generic mechanism of beta2-microglobulin amyloid assembly at neutral pH involving a specific proline switch. Journal of molecular biology. 2009;386(5):1312–1326. doi: 10.1016/j.jmb.2009.01.013. [DOI] [PubMed] [Google Scholar]
- 39.Lowenstein EJ, Daly RJ, Batzer AG, Li W, Margolis B, Lammers R, Ullrich A, Skolnik EY, Bar-Sagi D, Schlessinger J. The SH2 and SH3 domain-containing protein GRB2 links receptor tyrosine kinases to ras signaling. Cell. 1992;70(3):431–442. doi: 10.1016/0092-8674(92)90167-b. [DOI] [PubMed] [Google Scholar]
- 40.Chen JK, Schreiber SL. SH3 domain-mediated dimerization of an nterminal fragment of the phosphatidylinositol 3-kinase p85 subunit. Bioorganic & Medicinal Chemistry Letters. 1994;4(14):1755–1760. [Google Scholar]
- 41.Jahn TR, Parker MJ, Homans SW, Radford SE. Amyloid formation under physiological conditions proceeds via a native-like folding intermediate. Nature structural & molecular biology. 2006;13(3):195–201. doi: 10.1038/nsmb1058. [DOI] [PubMed] [Google Scholar]
- 42.Bemporad F, Chiti F. Native-like aggregation” of the acylphosphatase from Sulfolobus solfataricus and its biological implications. FEBS Lett. 2009;583(16):2630–2638. doi: 10.1016/j.febslet.2009.07.013. [DOI] [PubMed] [Google Scholar]
- 43.Richardson JS, Richardson DC. Natural beta-sheet proteins use negative design to avoid edge-to-edge aggregation. Proc Natl Acad Sci U S A. 2002;99(5):2754–2759. doi: 10.1073/pnas.052706099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Li MS, Co NT, Reddy G, Hu C-K, Straub JE, Thirumalai D. Factors governing fibrillogenesis of polypeptide chains revealed by lattice models. Phys Rev Lett. 2010;105(21):218101. doi: 10.1103/PhysRevLett.105.218101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mickler M, Dima RI, Dietz H, Hyeon C, Thirumalai D, Rief M. Revealing the bifurcation in the unfolding pathways of GFP by using single-molecule experiments and simulations. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(51):20268–20273. doi: 10.1073/pnas.0705458104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hyeon C, Dima RI, Thirumalai D. Pathways and kinetic barriers in mechanical unfolding and refolding of RNA and proteins. Structure (London, England :1993) 2006;14(11):1633–1645. doi: 10.1016/j.str.2006.09.002. [DOI] [PubMed] [Google Scholar]
- 47.Hyeon C, Thirumalai D. Capturing the essence of folding and functions of biomolecules using coarse-grained models. Nature communications. 2011;2:487. doi: 10.1038/ncomms1481. [DOI] [PubMed] [Google Scholar]
- 48.Lin J-C, Thirumalai D. Relative Stability of Helices Determines the Folding Landscape of Adenine Riboswitch Aptamers. Journal of the American Chemical Society. 2008;130(43):14080. doi: 10.1021/ja8063638. [DOI] [PubMed] [Google Scholar]
- 49.Liu Z, Reddy G, O’Brien EP, Thirumalai D. Collapse kinetics and chevron plots from simulations of denaturant-dependent folding of globular proteins. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(19):7787–7792. doi: 10.1073/pnas.1019500108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Reddy G, Liu Z, Thirumalai D. Denaturant-dependent folding of GFP. Proceedings of the National Academy of Sciences of the United States of America [Google Scholar]
- 51.Betancourt MR, Thirumalai D. Pair potentials for protein folding: choice of reference states and sensitivity of predicted native states to variations in the interaction schemes. Protein science : a publication of the Protein Society. 1999;8(2):361–369. doi: 10.1110/ps.8.2.361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.O’Brien EP, Ziv G, Haran G, Brooks BR, Thirumalai D. Effects of denaturants and osmolytes on proteins are accurately predicted by the molecular transfer model. Proceedings of the National Academy of Sciences of the United States of America. 2008;105(36):13403–13408. doi: 10.1073/pnas.0802113105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liu Z, Reddy G, Thirumalai D. Theory of the Molecular Transfer Model for Proteins with Applications to the Folding of the src-SH3 Domain. J Phys Chem B. doi: 10.1021/jp211941b. [DOI] [PubMed] [Google Scholar]
- 54.Hayryan S, Hu C-K, Skrivánek J, Hayryan E, Pokorný I. A new analytical method for computing solvent-accessible surface area of macromolecules and its gradients. Journal of Computational Chemistry. 2005;26(4):334–343. doi: 10.1002/jcc.20125. [DOI] [PubMed] [Google Scholar]
- 55.Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kollman PA. The weighted histogram analysis method for free-energy calculations on biomolecules. I. The method. Journal of Computational Chemistry. 1992;13(8):1011–1021. [Google Scholar]
- 56.Plimpton S. Fast Parallel Algorithms for Short-Range Molecular Dynamics. Journal of Computational Physics. 1995;117:1–19. [Google Scholar]
- 57.Verlet L. Computer “Experiments” on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules. Phys. Rev. 1967;159:98–103. [Google Scholar]
- 58.Veitshans T, Klimov D, Thirumalai D. Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties. Fold Des. 1997;2(1):1–22. doi: 10.1016/S1359-0278(97)00002-3. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








