Abstract
One of the fundamental events in protein folding is α-helix formation, which involves sequential development of a series of helical hydrogen bonds between the backbone C=O group of residues i and the –NH group of residues i + 4. While we now know a great deal about α-helix folding dynamics, a key question that remains to be answered is where the productive helical nucleation event occurs. Statistically, a helical nucleus (or the first helical hydrogen-bond) can form anywhere within the peptide sequence in question; however, the one that leads to productive folding may only form at a preferred location. This consideration is based on the fact that the α-helical structure is inherently asymmetric, due to the specific alignment of the helical hydrogen bonds. While this hypothesis is plausible, validating it is challenging because there is not an experimental observable that can be used to directly pinpoint the location of the productive nucleation process. Therefore, in this study we combine several techniques, including peptide cross-linking, laser-induced temperature-jump infrared spectroscopy, and molecular dynamics simulations, to tackle this challenge. Taken together, our experimental and simulation results support an α-helix folding mechanism wherein the productive nucleus is formed at the N-terminus, which propagates toward the C-terminal end of the peptide to yield the folded structure. In addition, our results show that incorporation of a cross-linker can lead to formation of differently folded conformations, underscoring the need for all-atom simulations to quantitatively assess the proposed cross-linking design.
Graphical abstract
1. INTRODUCTION
Protein folding involves precise packing and organization of various secondary structural elements. This hierarchical structural assembly at the molecular level thus necessitates studying how and on what time scales protein secondary structures form in order to understand how proteins achieve their tertiary folds. In this regard, much effort has been devoted to elucidating the folding dynamics and mechanism of α-helix,1–46 the most common structural motif found in proteins. The interest in studying the mechanism of α-helix formation is also motivated by the effort to design and develop individually folded, stable α-helices with novel biological functions and/or therapeutic utilities.19,47–54
Within the framework of the one-dimensional Ising model,1,2,10,11,26–29 each residue in an α-helical peptide can be assumed to adopt either a helical (H) or a coil (C) state and, as a result, the microscopic conformational states of the peptide can be distinguished by the number and distribution of the H states within the peptide sequence of interest. By considering the statistical weights of these states, Zimm and Bragg,55 as well as Lifson and Roig,1 were able to calculate the partition function and hence the thermodynamic properties of the system. A key outcome of this theoretical treatment is that the formation of the first turn of the helix, stabilized by a hydrogen-bond (H-bond) formed between the amide units of residue i and i+4 is thermodynamically unfavorable.11,55–59 However, the subsequent propagation steps that elongate this helical nucleus via sequential addition of helical H-bonds act to decrease the free energy of the system.2,55,57,58 In other words, the α-helix folding kinetics can be described by a series of discrete steps where the nucleation event is the slowest and occurs first, followed by a faster elongation process. In this regard, many experimental studies4,5,7,8,12–14,16,19,20,23,25–28,31,34,35,38,41,42,59–61 have focused on the kinetics of α-helix formation, attempting to resolve those key conformational events. While taken together, these studies provided many microscopic details into the folding dynamics of α-helix, we still lack a complete understanding of the folding mechanism of this simple structural motif. One key remaining question is whether the productive nucleation event (i.e., the one leading to α-helix formation) occurs randomly or at a specific location within the peptide sequence.
Even for a homopolypeptide, the corresponding α-helix structure is inherently asymmetric, due to the specific alignment of the helical H-bonds. Therefore, it has been hypothesized,62 and observed in simulations,3,15,21,39 that the dominant α-helix folding flux arises from an N-to-C pathway, namely, the α-helical conformation is produced by elongating an α-helical nucleus formed at the N-terminus. Consequently, the reverse and unfolding process would run in a C-to-N manner. However, experimentally it has been difficult to directly validate this hypothesis due to the fact that there is not a distinct signal that only reports on the nucleation event.37 Herein, we aim to provide new insights into the α-helix folding mechanism by studying the folding kinetics of three cross-linked, alanine-based peptides using laser-induced temper-ature-jump (T-jump) infrared (IR) spectroscopy63 and molecular dynamics (MD) simulations.
Side-chain to side-chain cross-linking is a commonly used strategy to stabilize the α-helical conformation of short peptides by reducing the entropic cost associated with folding.45,47,52 However, traditional cross-linking methods tend to leave flexible helical ends. We recently developed a strategy to constrain helix caps with bis-thioether macrocycles and have achieved high helical contents in short peptides (Figure 1).54,64 Here, we take advantage of this approach and use it to “expose” the productive nucleation site in an alanine-based peptide.52 The name and sequence of the cross-linked peptides studied are given in Table 1. Our working hypothesis is that if the α-helix folding flux proceeds in a preferred direction, the folding rate of these cross-linked peptides would depend on the position of the cross-linker. Indeed, our spectroscopic measurements show that the cross-linker position has a profound effect on the conformational relaxation kinetics of the peptides. This effect is also observed in the MD simulations, which provide an atomistic view of the folding dynamics of those cross-linked peptides. Taken together, our experimental and simulation results support the N-to-C α-helix folding hypothesis. In addition, an unexpected finding arising from our study is that cross-linking may promote the peptide in question to populate differently folded helical conformations, highlighting the need for careful consideration of the peptide sequence and the cross-linker in designing capped or cross-linked α-helices.
Table 1.
peptide | sequence |
---|---|
C-Cap | Ac-AAARAARACAAAaC-NH2 |
N-Cap1 | Ac-CAAAACRAARAAAA-NH2 |
N-Cap2 | Ac-CTPAQCRAARAAAA-NH2 |
2. EXPERIMENTAL SECTION
2.1. Peptide Synthesis.
The details of the peptide synthesis and purification are given in the Supporting Information. Briefly, the linear precursors of cross-linked peptides were synthesized using standard 9-fluorenylmethoxy-carbonyl (Fmoc) chemistry, and peptide cross-linking was achieved via cysteine alkylation reaction following previously published procedures.47,52
2.2. Circular Dichroism (CD) and Fourier Transform Infrared (FTIR) Measurements.
CD spectra were measured on an Aviv 62A DS circular dichroism spectrometer (Aviv Associate, NJ) using a 1 mm quartz cuvette. FTIR spectra were collected on a Magna-IR 860 spectrometer (Nicolet, WI) using a homemade sample holder65 composed of two CaF2 windows and a 50 μm spacer. All peptide samples were prepared in deuterated 10 mM phosphate buffer solution (pH 7), and the final peptide concentrations were 4 mM for static and time-resolved IR experiments and 100 μM for CD measurements.
2.3. Infrared Kinetics Measurements.
The T-jump coupled transient IR setup has been described in detail elsewhere.66 The only difference is that in the current setup the 1.9 μm T-jump pump pulse was generated from a Continuum Surelite OPO, which was pumped by the 1064 nm output of a Continuum Surelite-EX.
2.4. Molecule Dynamics Simulation.
Simulation Setup.
MD simulations of a total of ~668 μs were carried out using GROMACS 4.5.467 on the Folding@home distributed computing platform.68 The AMBER ff99sb-ildn-NMR69 force field was used in combination with the TIP3P explicit solvent model. NVT molecular dynamics was performed at 300 K, starting from 10 conformations taken from implicit-solvent replica-exchange molecular dynamics (REMD) simulations (Table S1 in the Supporting Information).
Markov State Model Construction.
Markov state model (MSM) approaches have been used with great success to model the conformational dynamics of protein folding as a kinetic network of metastable conformational states.70–75 A key advantage of MSMs is the ability to infer complete information about folding thermodynamics and kinetics from ensembles of short, nonequilibirum trajectories. Briefly, constructing an MSM involves first identifying the metastable states, usually through geometry-based conformational clustering. Then, once the states have been defined, the probabilities Tij(τ) of transitioning between metastable states i and j over some lag time τ can be estimated from the number of observed transitions in the trajectory data. The matrix T(τ) (with elements Tij) thus contains full information about the thermodynamics and kinetics. The equilibrium populations π are given by the stationary eigenvector Tπ = π, and the implied time scales τn of each relaxation eigenmode are obtained from the eigenvalues μn of T as τn = –τ/ln(μn). The rate matrix K of continuous-time rates between metastable states is related to the transition matrix by T = exp(Kτ), and the implied time scales τn are related to the eigenvalues of the rate matrix λn through τn = (–λn)–1. The longest implied time scale corresponds to the folding relaxation rate of the system.
The MSMBuilder3 software package76 was used to construct MSMs from the simulation data. The first 100 ns data were excluded to alleviate potential bias from nonequilibrated distribution of initial seeding for simulations (see Figure S1 in the Supporting Information for trajectory distribution). Time-structure-based independent component analysis77,78 (tICA) was used to find an optimal low-dimensional subspace onto which the high-dimensional trajectory coordinate data could be projected for the purpose of conformational clustering. As inputs to tICA, we used 378 pairwise distances between Cα and Cβ atoms as input for tICA projection. The outputs of tICA are the set of time-lagged independent components (tICs) that best capture a low-rank approximation of the time-lagged correlation matrix. Thus, the tICs represent the degrees of freedom along which the longest-time scale conformational dynamics occurs. The MSM transition matrix can similarly be thought of as a time correlation matrix for orthogonal indicator basis functions;79 and therefore, provided that the MSM decomposition is sufficiently fine-grained, the slowest MSM relaxation eigenmodes should correspond to the same slowest motions identified by tICA.
The optimal number of metastable states for MSM construction was determined using the generalized matrix Rayleigh quotient (GMRQ) variational cross-validation method79 (Figure S2 in the Supporting Information). We used a tICA lag time of 5 ns and performed k-center clustering over four tICs to identify 500 metastable states for each crosslinked peptide. The maximum-likelihood estimator (MLE) in MSMBuilder (default setting) was used to estimate the MSM transition matrix from sliding-window counts. An MSM lag time of 20 ns was chosen for construction of the MSM model. Plots of implied time scales versus MSM lag time showed convergence near 20 ns, validating that dynamics was sufficiently Markovian.
3. RESULTS AND DISCUSSION
3.1. Stability Characterization.
The α-helix stabilizing role of the bis-benzylic cross-linker has been demonstrated previously by Wu et al.54 Consistent with this notion, the CD spectra of both C-Cap and N-Cap1 peptides at 5 °C are characteristic of an α-helical conformation (Figure 2A). Interestingly, however, the CD spectrum of the N-Cap2 peptide shows a distinct difference, suggesting that this peptide can also sample other conformation(s) besides the expected α-helical and disordered structural ensembles (see below for further discussion). As expected (Figure 2B), the CD thermal-melting curves (T-melts) of these peptides, measured at 222 nm, indicate that the position of the cross-linker and peptide sequence has a profound effect on the thermodynamic stability of the α-helical structure. In particular, the results obtained for C-Cap and N-Cap1 peptides, which only differ in the cross-linker position, reveal that C-terminal cross-linking is more effective in stabilizing the α-helical structure than N-terminal cross-linking. To further assess this difference, we estimated the folding–unfolding thermodynamics of these two peptides by globally fitting their CD T-melts to a two-state model, which assumes that the folded CD baselines of these peptides, or the mean residue ellipticities of their folded states are identical. As shown (Figure S4 and Table S2 in the Supporting Information), this analysis yielded a thermal-melting temperature (Tm) of 8.3 °C for C-Cap, which agrees with that determined by Wu et al.,54 and a Tm of –6.4 °C for N-Cap1. Because the thermal unfolding transition of N-Cap2 involves more than two conformational assembles, we did not attempt to fit its CD T-melt. In addition, it is worth noting that these alanine-based peptides contain a RAAR segment in the middle of the peptide sequence, which destabilizes the α-helical conformation due to charge repulsion. Therefore, unlike other α-helical peptides studied previously,4 which contain Lys residues that are four residues apart and hence lack such an effect, these short peptides do not show any appreciable folding without the cross-linker.54
3.2. Conformational Relaxation Kinetics.
As shown (Figure S5 in the Supporting Information), the amide Iʹ band of C-Cap loses (gains) intensity at ~1630 cm–1 (~1665 cm–1) with increasing temperature, which also manifests, as observed with other α-helical peptides,6 in the thermal unfolding of the underlying α-helical structure. Therefore, to assess the kinetics of this conformational transition, we measured the transient IR absorption kinetics of the peptide in question in response to a nanosecond T-jump pulse. As shown (Figure 3), the T-jump induced conformational relaxation kinetic traces of both C-Cap and N-Cap1 peptides, measured at 1640 cm–1, can be satisfactorily described by a single-exponential decay function, whereas that of N-Cap2 requires a biexponential function to fit. Interestingly, as indicated (Figure 4), the relaxation rate constants of C-Cap and N-Cap1 peptides show quite different dependence on the final temperature. Moreover, around room temperature the T-jump induced conformational relaxation of N-Cap1 is significantly faster than that of C-Cap. For example, the relaxation rate constant of N-Cap1 is ~(162 ns)–1 at 19.8 °C, compared to ~(472)–1 for C-Cap at 20.3 °C.
The conformational relaxation rate constant (kR) of a two-state system is the sum of the respective folding (kf) and unfolding (ku) rate constants. Therefore, to determine kf and ku from kR measured at a specific temperature, one would also need to know the corresponding folding equilibrium constant (Keq), which equals the ratio of kf and ku. For both C-Cap and N-Cap1 peptides, we were able to further decompose the experimentally determined kR into kf and ku using the respective folding-unfolding thermodynamics estimated from their CD T-melts (Table S2 in the Supporting Information). As shown (Figure 5), the folding and unfolding rate constants of these two peptides not only differ in values but also exhibit different dependence on temperature. More specifically, those results indicate, in the temperature range of the experiment, that (1) the unfolding rate constant of C-Cap is not only slower but also more sensitively dependent on temperature than that of N-Cap1, suggesting that the unfolding process of the C-Cap α-helix involves a larger enthalpic barrier; (2) the folding rate constant of N-Cap1 is essentially temperature independent, suggesting that its folding process does not encounter a significant free energy barrier; and (3) around room temperature the α-helix folding rate of N-Cap1 is much faster than that of C-Cap. Taken together, these differences indicate that the role of an N-terminal cross-linker is to increase the α-helix folding rate, whereas that of a C-terminal cross-linker is to decrease its unfolding rate or to prevent it from unfolding. Moreover, it is worthy of mentioning that the role of the D-alanine residue in C-Cap is to provide further stabilization of the C-terminal end of the α-helix and that a previous study80 has shown that an individual C-terminal d-alanine only marginally affects the folding rate of helical peptides. Considering the fact that the C-Cap α-helix is more stable than the N-Cap1 α-helix, these kinetic results also offer practical insight into the design of stable α-helices, namely, that decreasing the unfolding rate is a more effective strategy than increasing the folding rate to increase the α-helix stability.
The kinetic results obtained with these two cross-linked peptides are also useful in understanding how linear α-helices fold. A previous study by Serrano et al. showed that the T-jump induced conformation relaxation rate of an alanine-based cyclic peptide (i.e., cyc-RKAAAD where the Lys and Asp side chains are connected via a cyclization reaction), which folds into a single-turn α-helix, is about (600 ns)–1 at room temperature. Interestingly, this relaxation rate is similar to that of C-Cap but is significantly different from that of N-Cap1 (Figure 4). In fact, the conformational relaxation kinetics of C- Cap are quite similar to those of cyc-RKAAAD in the entire temperature range of the experiment (Figure S6 in the Supporting Information). Therefore, given the structural similarity between cyc-RKAAAD and the cross-linked region in C-Cap and N-Cap1 peptides, these results suggest that the T-jump induced conformational relaxation of C-Cap involves folding and unfolding of the capped region and hence is slower, whereas that of N-Cap1 does not and is therefore faster. In other words, the conformational transition of C-Cap involves a helical and a nonhelical state and, as shown above, its unfolding process encounters a larger enthalpic barrier. However, the conformational transition of N-Cap1 takes places between a helical and a partial helical state where the capped region is still folded and acts as a pre-existing helical nucleus. As such, the folding process of N-Cap1 is expected to be dominated by helical propagation events, leading to a faster and temperature-insensitive folding rate, as observed (Figure 5). Together, these findings support an α-helix unfolding mechanism wherein unfolding begins from the C-terminus, which in turn suggests that the most productive folding pathway begins with formation of an α-helical nucleus at the N-terminus. However, given the intrinsic complexity of the α-helix folding dynamics, there is always a possibility that the presence of a cross-linker will lead to a fundamental change in the underlying folding mechanism, hence making our interpretation incorrect.
In comparison to that of N-Cap1, the N-terminal TPAQ motif of N-Cap2 is designed to further promote helix initiation and also provide additional helix capping interactions.54 Therefore, it is expected that the α-helical conformation of N-Cap2 is more stable than that of N-Cap1 (Figure 2B). However, given the similarity between these two peptides, it is unexpected that the T-jump induced conformation relaxation process of N-Cap2 occurs in a biexponential manner (Figure 3B). Theoretically, such kinetic behavior indicates that the folding process of N-Cap2 could proceed via one of the following mechanisms: (1) a single folding pathway involving an on-pathway intermediate; (2) a single folding pathway involving an off-pathway intermediate; (3) two connected folding pathways involving one folded state and two unfolded states; (4) two separate folding pathways involving two folded states and two unfolded states; (5) two connected folding pathways involving two folded states and one unfolded state. While distinguishing among these possibilities is difficult, the unique CD spectrum of N-Cap2 (Figure 2A) prompts us to suggest that the last mechanism is the most probable. In other words, N-Cap2 can fold into two different conformations, with one being an α-helix that is similar to that formed by N-Cap1. In support of this notion, as indicated (Figure 4), the rate constant of the fast component of the conformation relaxation kinetics of N-Cap2 is identical to that of N-Cap1 within experimental uncertainty. In addition, the fact that the slow relaxation component of N-Cap2 has a smaller rate constant than that of C-Cap (Figure 4) suggests that this kinetic phase is unlikely to arise from a conformation relaxation process involving unfolding of the capped region.
To better elucidate these mechanistic hypotheses, we next performed MD simulations to gain microscopic insight into the conformational relaxation kinetics of C-Cap, N-Cap1, and N- Cap2.
3.3. MD Simulations.
Comparison with experimental folding relaxation rates provides a stringent test of the ability of all-atom simulations to accurately predict folding kinetics. Using the Folding@home distributed computing platform, we obtained more than 200 μs of aggregate trajectory data for each of the three cross-linked peptides and analyzed the results by constructing MSMs of the folding kinetics, each comprising 500 metastable conformational states (see Experimental Section).
Predicted Relaxation Time Scales from MSMs Agree Well with Experiment.
An MSM is described by a matrix of transition rates Tij(τ) corresponding to the probability of transitioning from state i to state j in some lag time τ. Since the accuracy of this coarse-grained description of kinetics depends on whether the lag time τ is long enough to approximate the dynamics as Markovian, it is typical to build a series of MSMs using a range of lag times to find the best choice. The implied time scales for each cross-linked peptide, plotted as a function of the MSM lag time, plateau after the lag time becomes sufficiently long, indicating an appropriate choice of lag time to be 20 ns for all three MSM models (Figure 6).
The slowest implied time scale of the MSM corresponds directly to the experimentally observed relaxation rate kR = kf + ku. A comparison of experimentally measured relaxation times (i.e., τR = kR–1) and the slowest implied time scales predicted by the MSMs show remarkably good agreement (Figure 7). Absolute comparison of measured and predicted relaxation times at 300 K are, on average, within 0.6 orders of magnitude (i.e., rms deviations of log10(τR). The absolute agreement improves to within 0.4 orders of magnitude if we compare the simulated relaxation times to experimental times measured at ~293 K. The larger uncertainty estimates for the slow kinetic phase of N-Cap2 are due to finite sampling error, as simulated transition times begin to approach the average trajectory length. Overall, the level of agreement between simulated and experimental folding times is comparable to the current state- of-the-art method33 with the MSMs being able to correctly predict the rank order of folding times.
MSMs Reveal Non-native Interactions That Contribute to Misfolded States.
Remarkably, the MSMs were able to correctly predict the existence of slow and fast kinetic phases for N-Cap2 and single kinetic phases for C-Cap and N-Cap1. A hallmark of two-state folding is a gap between the slowest implied time scale and the next-slowest time scale, such that time-resolved experiments may only be able to detect single-exponential relaxation kinetics. Whereas C-Cap and N-Cap1 exhibit such a time scale gap, N-Cap2 does not (Figure 6), instead showing two slow relaxation time scales in the range detectable by the T-jump experiment.
To better understand the microscopic interactions that might contribute to the folding mechanism, we examined the structural features of long-lived kinetic intermediates contributing to population flux in the MSM. A powerful feature of the tICA approach is the ability to project high-dimensional simulation trajectory data to a low-dimensional subspace representing the slowest motions. Provided that the MSM is sufficiently fine-grained (in our case, 500 states), the most significant tICs should correspond to the slowest relaxation eigenmodes of the MSM, a fact that has been exploited in much previous work.70–75 Projections onto the two largest tICA components show the broad contours of the folding landscape and reveal long-lived kinetic intermediates.
For the C-Cap peptide, partially folded metastable states are separated from the native state along largest tICA component (tIC1), indicating that relaxation from these non-native states to the native folded structure correspond to the slowest folding relaxation (Figure 8). These non-native states are characterized by non-native backbone hydrogen-bonding interaction between Ala3/Ala4/Ala7 and Cys15/Ala14/Ala13 (upper left panel in Figure 8). The next-slowest motion, as seen from projections of the dynamics along tIC2, is coupled to relaxations from another partially folded state stabilized by a π-cation interaction between the Arg8/Arg5 side chain and the aromatic ring on the capping group. Similar non-native interactions were observed in misfolded peptides simulated in previous work.68 As evidenced by the smaller kinetic barriers along tIC2, the MSM predicts a much faster relaxation time scale likely to be undiscernible by a T-jump experiment.
For the N-Cap1 peptide, the two slowest folding relaxations are similar in time scale and are both predicted to be faster than the slowest relaxation of the C-Cap peptide, consistent with the experimental results. Compared to the C-Cap peptide, the folding landscape is predicted to be more “downhill”, with the tICA projection showing multiple partially folded states contributing to the relaxation kinetics (Figure 9). The states most separated from the native helix structure on the tICA landscape include partially folded structures with either a folded N-terminus or a folded C-terminus. Each of these states has a low kinetic barrier for forming the native helical state, while remaining kinetically separated from each other. Both the fast relaxation kinetics and the shape of the tICA landscape are highly consistent with the hypothesis that the N-Cap1 cross-linker is able to lower the nucleation barrier to folding.
For the N-Cap2 peptide, the tICA landscape yields structural insight into the fast and slow kinetic relaxations observed in experiment. As hypothesized from the experimental kinetics and the unique CD spectrum, the slowest relaxation process (along tIC1) appears to arise from the interconversion between two competing folded states: one state is the regularly folded α-helix, while the other is a misfolded state with a hairpin structure stabilized by the cross-linker (Figure 10). Unlike N-Cap1, which has polyalanine between the N-terminal cysteines, N-Cap2 contains the sequence TPAQ which is designed to serve as an effective N-terminal cap of the α-helix. The formation of this hairpin structure, which we do not observe for the other two peptides, may be encouraged by the presence of both threonine, an amino acid with β-sheet propensity, and proline, which can promote tight turns. The next-slowest relaxation (along tIC2) is predicted to result from the transitions of unfolded structures to these two folded states. The time scale for this process is faster than, and more similar to, the folding relaxation seen for the C-Cap peptide, consistent with similar barrier-separated two-state folding to each competing folded state.
4. CONCLUSION
While it has long been recognized that a-helices fold through a nucleation–propagation mechanism, it remains unclear whether the (productive) nucleation event occurs randomly or at a specific location within the peptide sequence. To provide insight into this question, herein we studied the folding kinetics and mechanism of three cross-linked, alanine-based peptides (i.e., C-Cap, N-Cap1, and N-Cap2) using T-jump IR spectroscopy and MD simulations. Our T-jump IR measurements revealed that N-Cap1 folds and unfolds faster than C- Cap. Since these two peptides differ only in their cross-linker position and both are more stable than their uncross-linked counterpart, these results demonstrate that a bis-benzylic cross-linker can promote α-helix formation by either increasing the folding flux from the N- to C-terminus or deceasing the unfolding flux from the C- to N-terminus. Our MD simulations not only validated this picture but also provided an atomistic description of the folding landscape of these peptides. Taken together, these findings support a folding–unfolding mechanism of linear peptides wherein the folding process begins by forming a productive helical nucleus at the N-terminus, whereas the reverse process or unwinding of the helical structure begins from the C-terminal end of the α-helix.
Interestingly, the T-jump induced relaxation kinetics of N- Cap2, whose sequence is only different from that of N-Cap1 in the N-terminal region, are more complex than those of N-Capl and C-Cap. Our MSM analysis indicated that this is because this peptide can populate two folded conformations, with one being a regular α-helix and the other being a hairpin structure. It is somewhat surprising that N-Cap2 may not fold uniquely to the intended α-helical state, as expected from the rational design of this peptide. Therefore, our results show the nontrivial challenge of designing cross-linked or stapled α-helices, and the need for all-atom simulations to quantitatively assess proposed designs. Another lesson underscored by this work is the synergistic nature of combing simulation methods with experiments to achieve an atomic-level understanding of folding. MSM-based approaches are a powerful tool for modeling long-time scale folding behavior and providing a mechanistically interpretable picture of these processes. The work presented here is further evidence of the level of quantitative predictive power such models can achieve: not only do the MSM models well predict absolute folding relaxation times, but they also correctly predict the rank order of folding times of the three different designed peptides, as well as the existence of slow and fast kinetic phases for N-Cap2. This quantitative accuracy, combined with the ability to rationalize the mechanisms, suggests a much larger role for all-atom simulation and MSM methods for peptide design, and the need for developing more robust and efficient MSM-based algorithms for this purpose.
Supplementary Material
ACKNOWLEDGMENTS
This research was supported in part by funding from the National Institutes of Health (P41GM104605 to F.G., R01GM123296 and 1S100D020095 to V.A.V.) and by XSEDE allocation TG-MCB140270 (to V.A.V.). Research is supported by R35GM122603 to W.F.D. H.W. was supported by a CTSI TL1 Postdoctoral Fellowship (TL1TR001871). This research includes calculations carried out on Temple University’s HPC resources and thus was supported in part by the National Science Foundation through major research instrumentation grant number 1625061 and by the US Army Research Laboratory under contract number W911NF-16-2-0189. We thank the participants of Folding@home, without whom this work would not be possible. We also thank the Protein Folding Consortium supported by the National Science Foundation (award number 1516959) for providing us the platform to establish the collaborations.
Footnotes
ASSOCIATED CONTENT
Supporting Information
This material is available free of charge at The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jpcb.8b12220.
Details about peptide synthesis, MD simulation set up, and global fitting of CD T-melts; tables of number of particles, perodic box sizes, simulation times, and folding thermodynamic parameters; figures of trajectory length distributions, generalized matrix Rayleigh quotient, global fitting results, FTIR difference spectrum, temperature dependence of the T-jump, HPLC traces (PDF)
Notes
The authors declare no competing financial interest.
REFERENCES
- (1).Lifson S; Roig A On the theory of helix-coil transition in polypeptides. J. Chem. Phys. 1961, 34 (6), 1963–1974. [Google Scholar]
- (2).Schwarz G On the kinetics of the helix-coil transition of polypeptides in solution. J. Mol. Biol. 1965, 11 (1), 64–77. [DOI] [PubMed] [Google Scholar]
- (3).Voegler Smith A.; Hall CK α-helix formation: discontinuous molecular dynamics on an intermediate-resolution protein model. Proteins: Struct., Funct., Genet. 2001, 44 (3), 344–360. [DOI] [PubMed] [Google Scholar]
- (4).Huang CY; Klemke JW; Getahun Z; DeGrado WF; Gai F Temperature-dependent helix-coil transition of an alanine based peptide. J. Am. Chem. Soc. 2001, 123 (38), 9235–9238. [DOI] [PubMed] [Google Scholar]
- (5).Kimura T; Takahashi S; Akiyama S; Uzawa T; Ishimori K; Morishima I Direct observation of the multistep helix formation of poly-L-glutamic acids. J. Am. Chem. Soc. 2002, 124 (39), 11596–11597. [DOI] [PubMed] [Google Scholar]
- (6).Levy Y; Jortner J; Becker OM Solvent Effects on the energy landscapes and folding kinetics of polyalanine. Proc. Natl. Acad. Sci. U. S. A. 2001, 98 (5), 2188–2193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Chen E; Kumita JR; Woolley GA; Kliger DS The kinetics of helix unfolding of an azobenzene cross-linked peptide probed by nanosecond time-resolved optical rotatory dispersion. J. Am. Chem. Soc. 2003, 125 (5), 12443–12449. [DOI] [PubMed] [Google Scholar]
- (8).Wang T; Du D; Gai F Helix-coil kinetics of two 14-residue peptides. Chem. Phys. Lett. 2003, 370 (5–6), 842–848. [Google Scholar]
- (9).Krieger F; Fierz B; Bieri O; Drewello M; Kiefhaber T Dynamics of unfolded polypeptide chains as model for the earliest steps in protein folding. J. Mol. Biol. 2003, 332 (1), 265–274. [DOI] [PubMed] [Google Scholar]
- (10).Roder H Stepwise helix formation and chain compaction during protein folding. Proc. Natl. Acad. Sci. U. S. A. 2004, 101 (7), 1793–1794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Qian H; Schellman JA Helix-coil theories: a comparative study for finite length polypeptides. J. Phys. Chem. 1992, 96 (10), 3987–3994. [Google Scholar]
- (12).Wang T; Zhu Y; Getahun Z; Du D; Huang CY; DeGrado WF; Gai F Length dependent helix-coil transition kinetics of nine alanine-based peptides. J. Phys. Chem. B 2004, 108 (39), 15301–15310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Thompson PA; Eaton WA; Hofrichter J Laser temperature jump study of the helix-coil kinetics of an alanine peptide interpreted with a “kinetic zipper” model. Biochemistry 1997, 36 (30), 9200–9210. [DOI] [PubMed] [Google Scholar]
- (14).Meisner WK; Sosnick TR Fast folding of a helical protein initiated by the collision of unstructured chains. Proc. Natl. Acad. Sci. U. S. A. 2004, 101 (37), 13478–13482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Monticelli L; Tieleman DP; Colombo G Mechanism of helix nucleation and propagation: microscopic view from microsecond time scale MD simulations. J. Phys. Chem. B 2005, 109 (43), 20064–20067. [DOI] [PubMed] [Google Scholar]
- (16).Pozo Ramajo A.; Petty SA; Starzyk A; Decatur SM; Volk M The α-helix folds more rapidly at the C-terminus than at the N- terminus. J. Am. Chem. Soc. 2005, 127 (40), 13784–13785. [DOI] [PubMed] [Google Scholar]
- (17).Buchete NV; Straub JE Mean first-passage time calculations for the coil-to-helix transition: the active helix Ising model. J. Phys. Chem. B 2001, 105 (28), 6684–6697. [Google Scholar]
- (18).Ghosh T; Garde S; García AE Role of backbone hydration and salt-bridge formation in stability of α-helix in solution. Biophys. J. 2003, 85 (5), 3187–3193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Bredenbeck J; Helbing J; Kumita JR; Woolley GA; Hamm P α-helix formation in a photoswitchable peptide tracked from picoseconds to microseconds by time-resolved IR spectroscopy. Proc. Natl. Acad. Sci. U. S. A. 2005, 102 (7), 2379–2384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Du D; Gai F Understanding the folding mechanism of an α-helical hairpin. Biochemistry 2006, 45 (44), 13131–13139. [DOI] [PubMed] [Google Scholar]
- (21).Miick SM; Casteel KM; Millhauser GL Experimental molecular dynamics of an alanine-based helical peptide determined by spin label electron spin resonance. Biochemistry 1993, 32 (31), 8014–8021. [DOI] [PubMed] [Google Scholar]
- (22).Du D; Bunagan MR; Gai F The effect of charge-charge interactions on the kinetics of α-helix formation. Biophys. J. 2007, 93 (11), 4076–4082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Balakrishnan G; Hu Y; Bender GM; Getahun Z; DeGrado WF; Spiro TG Enthalpic and entropic stages in α-helical peptide unfolding, from laser T-jump/UV raman spectroscopy. J.Am. Chem. Soc. 2007, 129 (42), 12801–12808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Bertsch RA; Vaidehi N; Chan SI; Goddard WA Kinetic steps for α-helix formation. Proteins: Struct, Funct., Genet. 1998, 33 (3), 343–357. [DOI] [PubMed] [Google Scholar]
- (25).Ye M; Zhang Q-L; Li H; Weng Y-X; Wang W-C; Qiu X-G Infrared spectroscopic discrimination between the loop and α-helices and determination of the loop diffusion kinetics by temperature-jump time-resolved infrared spectroscopy for cyto-chrome C. Biophys. J. 2007, 93, 2756–2766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Chen Y; Zhou Y; Ding J The helix–coil transition revisited. Proteins: Struct., Funct., Genet. 2007, 69, 58–68. [DOI] [PubMed] [Google Scholar]
- (27).Ihalainen JA; Paoli B; Muff S; Backus EHG; Bredenbeck J; Woolley GA; Caflisch A; Hamm P α-helix folding in the presence of structural constraints. Proc. Natl. Acad. Sci. U. S. A. 2008, 105 (28), 9588–9593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Mohammed OF; Jas GS; Lin MM; Zewail AH Primary peptide folding dynamics observed with ultrafast temperature jump. Angew. Chem, Int. Ed. 2009, 48 (31), 5628–5632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (29).Zwanzig R Simple model of protein folding kinetics. Proc. Natl. Acad. Sci. U. S. A. 1995, 92 (21), 9801–9804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (30).Serrano AL; Tucker MJ; Gai F Direct assessment of the α-helix nucleation Time. J. Phys. Chem. B 2011, 115 (22), 7472–7478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (31).De Sancho D; Best RB What is the time scale for α-helix nucleation? J. Am. Chem. Soc. 2011, 133 (17), 6809–6816. [DOI] [PubMed] [Google Scholar]
- (32).Leader DP; Milner-White EJ The structure of the ends of α-helices in globular proteins: effect of additional hydrogen bonds and implications for helix formation. Proteins: Struct., Funct., Genet. 2011, 79 (3), 1010–1019. [DOI] [PubMed] [Google Scholar]
- (33).Piana S; Lindorff-Larsen K; Shaw DE Protein folding kinetics and thermodynamics from atomistic simulations. Proc. Natl. Acad. Sci. U. S. A. 2012, 109 (44), 17845–17850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (34).Tucker MJ; Abdo M; Courter JR; Chen J; Brown SP; Smith AB; Hochstrasser RM Nonequilibrium dynamics of helix reorganization observed by transient 2D IR spectroscopy. Proc. Natl. Acad. Sci. U. S. A. 2013, 110 (43), 17314–17319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (35).Lednev IK; Karnoup AS; Sparrow MC; Asher SA α-helix peptide folding and unfolding activation barriers: a nanosecond UV resonance raman study. J. Am. Chem. Soc. 1999, 121 (35), 8074–8086. [Google Scholar]
- (36).Sung SS Folding simulations of alanine-based peptides with lysine residues. Biophys. J. 1995, 68 (3), 826–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Lin CW; Gai F Microscopic nucleation and propagation rates of an alanine-based α-helix. Phys. Chem. Chem. Phys. 2017, 19 (7), 5028–5036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Jas GS; Kuczera K Helix-coil transition courses through multiple pathways and intermediates: fast kinetic measurements and dimensionality reduction. J. Phys. Chem. B 2018, 122 (48), 10806–10816. [DOI] [PubMed] [Google Scholar]
- (39).Young WS; Brooks CL A Microscopic view of helix propagation: N and C-terminal helix growth in alanine helices. J. Mol. Biol. 1996, 259 (3), 560–572. [DOI] [PubMed] [Google Scholar]
- (40).Williams S; Causgrove TP; Gilmanshin R; Fang KS; Callender RH; Woodruff WH; Dyer RB Fast events in protein folding: helix melting and formation in a small peptide. Biochemistry 1996, 35 (3), 691–697. [DOI] [PubMed] [Google Scholar]
- (41).Clarke DT; Doig AJ; Stapley BJ; Jones GR The alpha-helix folds on the millisecond time scale. Proc. Natl. Acad. Sci. U. S. A. 1999, 96 (13), 7232–7237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Thompson PA; Munoz V; Jas GS; Henry ER; Eaton WA; Hofrichter J The helix-coil kinetics of a heteropeptide. J. Phys. Chem. B 2000, 104 (2), 378–389. [Google Scholar]
- (43).Ferrara P; Apostolakis J; Caflisch A Thermodynamics and kinetics of folding of two model peptides investigated by molecular dynamics simulations. J. Phys. Chem. B 2000, 104 (20), 5000–5010. [Google Scholar]
- (44).Engel J; Schwarz G Cooperative conformational transitions of linear biopolymers. Angew. Chem., Int. Ed. Engl. 1970, 9 (6), 389–400. [DOI] [PubMed] [Google Scholar]
- (45).Hill RB; Raleigh DP; Lombardi A; DeGrado WF De Novo design of helical bundles as models for understanding protein folding and function. Acc. Chem. Res. 2000, 33 (11), 745–754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (46).Dyer RB; Gai F; Woodruff WH; Gilmanshin R; Callender RH Infrared studies of fast events in protein folding. Acc. Chem. Res. 1998, 31 (11), 709–716. [Google Scholar]
- (47).Jackson DY; King DS; Chmielewski J; Singh S; Schultz PG General approach to the synthesis of short α-helical peptides. J. Am. Chem. Soc. 1991, 113 (24), 9391–9392. [Google Scholar]
- (48).Kutchukian PS; Yang JS; Verdine GL; Shakhnovich EI All-atom model for stabilization of α-helical structure in peptides by hydrocarbon staples. J. Am. Chem. Soc. 2009, 131 (13), 4622–4627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Phillips C; Roberts LR; Schade M; Bazin R; Bent A; Davies NL; Moore R; Pannifer AD; Pickford AR; Prior SH; et al. Design and structure of stapled peptides binding to estrogen receptors. J. Am. Chem. Soc. 2011, 133 (25), 9696–9699. [DOI] [PubMed] [Google Scholar]
- (50).Stewart ML; Fire E; Keating Amy E.; Walensky LD The MCL-1 BH3 helix is an exclusive MCL-1 inhibitor and apoptosis sensitizer. Nat. Chem. Biol. 2010, 6 (8), 595–601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (51).Bernal F; Wade M; Godes M; Davis TN; Whitehead DG; Kung AL; Wahl GM; Walensky LD A stapled P53 helix overcomes HDMX-mediated suppression of P53. Cancer Cell 2010, 18, 411–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (52).Jo H; Meinhardt N; Wu Y; Kulkarni S; Hu X; Low KE; Davies PL; DeGrado WF; Greenbaum DC Development of α- helical calpain probes by mimicking a natural protein-protein interaction. J. Am. Chem. Soc. 2012, 134 (42), 17704–17713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (53).Yakimov AP; Afanaseva AS; Khodorkovskiy MA; Petukhov MG Design of stable α-helical peptides and thermostable proteins in biotechnology and biomedicine. Acta Naturae 2016, 8 (4), 70–81. [PMC free article] [PubMed] [Google Scholar]
- (54).Wu H; Acharyya A; Wu Y; Liu L; Jo H; Gai F; DeGrado WF Design of a short thermally stable α-helix embedded in a macrocycle. ChemBioChem 2018, 19 (9), 902–906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (55).Zimm BH; Bragg JK Theory of the phase transition between helix and random coil in polypeptide chains. J. Chem. Phys. 1959, 31 (2), 526–535. [Google Scholar]
- (56).Luo P; Baldwin RL Interaction between water and polar groups of the helix backbone: an important determinant of helix propensities. Proc. Natl. Acad. Sci. U. S. A. 1999, 96 (9), 4930–4935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (57).Muñoz V; Serrano L Development of the multiple sequence approximation within the AGADIR model of α-helix formation: comparison with Zimm–Bragg and Lifson–Roig formalisms. Biopolymers 1997, 41, 495–509. [DOI] [PubMed] [Google Scholar]
- (58).Sorin EJ; Pande VS Exploring the helix-coil transition via all-atom equilibrium ensemble simulations. Biophys. J. 2005, 88 (4), 2472–2493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (59).Miller SE; Watkins AM; Kallenbach NR; Arora PS Effects of side chains in helix nucleation differ from helix propagation. Proc. Natl. Acad. Sci. U. S. A. 2014, 111 (18), 6636–6641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (60).Eaton WA; Munoz V; Thompson PA; Henry ER; Hofrichter J Kinetics and Dynamics of loops, α-helices, β-hairpins, and fast-folding proteins. Acc. Chem. Res. 1998, 31 (11), 745–753. [Google Scholar]
- (61).Huang CY; Balakrishnan G; Spiro TG Early events in apomyoglobin unfolding probed by laser T-Jump/UV resonance raman spectroscopy. Biochemistry 2005, 44 (48), 15734–15742. [DOI] [PubMed] [Google Scholar]
- (62).Alexandrov N Structural arguments for N-terminal initiation of protein folding. Protein Sci. 1993, 2, 1989–1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (63).Serrano AL; Waegele MM; Gai F Spectroscopic studies of protein folding: linear and nonlinear methods. Protein Sci. 2012, 21 (2), 157–170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (64).Findeisen F; Campiglio M; Jo H; Abderemane-Ali F; Rumpf CH; Pope L; Rossen ND; Flucher BE; DeGrado WF; Minor DL Stapled voltage-gated calcium channel (CaV) α-interaction domain (AID) peptides act as selective protein-protein interaction inhibitors of CaVfunction. ACS Chem. Neurosci. 2017, 8 (6), 1313–1326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (65).Ding B; Hilaire MR; Gai F Infrared and fluorescence assessment of protein dynamics: from folding to function. J. Phys. Chem. B 2016, 120 (23), 5103–5113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (66).Huang C-Y; Getahun Z; Zhu Y; Klemke JW; DeGrado WF; Gai F Helix formation via conformation diffusion search. Proc. Natl. Acad. Sci. U. S. A. 2002, 99, 2788–2793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (67).Pronk S; Apostolov R; Shirts MR; Smith JC; Kasson PM; van der Spoel D; Hess B; Lindahl E GROMACS 4.5: A high- throughput and highly parallel open source molecular simulation toolkit. Bioinformatics 2013, 29 (7), 845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (68).Shirts M COMPUTING: Screen savers of the world unite! Science (Washington, DC, U. S.) 2000, 290, 1903–1904. [DOI] [PubMed] [Google Scholar]
- (69).Li D-W; Bruschweiler R NMR-based protein potentials. Angew. Chem., Int. Ed. 2010, 49, 6778–6780. [DOI] [PubMed] [Google Scholar]
- (70).Ge Y; Borne E; Stewart S; Hansen MR; Arturo EC; Jaffe EK; Voelz VA Simulations of the regulatory ACT domain of human phenylalanine hydroxylase (PAH) unveil its mechanism of phenylalanine binding. J. Biol. Chem. 2018, 293, 19532–19543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (71).Razavi AM; Voelz VA Kinetic network models of tryptophan mutations in fi - hairpins reveal the importance of nonnative interactions. J. Chem. Theory Comput. 2015, 11, 2801–2812. [DOI] [PubMed] [Google Scholar]
- (72).Razavi AM; Wuest WM; Voelz VA Computational screening and selection of cyclic peptide hairpin mimetics by molecular simulation and kinetic network models. J. Chem. Inf. Model. 2014, 54, 1425–1432. [DOI] [PubMed] [Google Scholar]
- (73).Ge Y; Kier BL; Andersen NH; Voelz VA Computational and experimental evaluation of designed β- cap hairpins using molecular simulations and kinetic network models. J. Chem. Inf. Model. 2017, 57, 1609–1620. [DOI] [PubMed] [Google Scholar]
- (74).Zhou G; Voelz VA Using kinetic network models to probe non-native salt-bridge effects on α-helix folding. J. Phys. Chem. B 2016, 120, 926–935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (75).Zhou G; Pantelopulos GA; Mukherjee S; Voelz VA Bridging microscopic and macroscopic mechanisms of P53-MDM2 binding with kinetic network models. Biophys. J. 2017, 113 (4), 785–793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (76).Husic BE; Eastman P; Harrigan MP; Sultan MM; Herna CX MSMBuilder: Statistical models for biomolecular dynamics. Biophys. J. 2017, 112, 10–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (77).Pérez-hernandez G; Paul F; Giorgino T; Fabritiis G De; Noe, F. Identification of slow molecular order parameters for Markov model construction. J. Chem. Phys. 2013, 139, 015102. [DOI] [PubMed] [Google Scholar]
- (78).Schwantes CR; Pande VS Improvements in Markov state model construction reveal many non-native interactions in the folding of NTL9. J. Chem. Theory Comput. 2013, 9, 2000–2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (79).McGibbon RT; Pande VS Variational cross-validation of slow dynamical modes in molecular kinetics. J. Chem. Phys. 2015, 142, 124105–124113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (80).Culik RM; Annavarapu S; Nanda V; Gai F Using D -amino acids to delineate the mechanism of protein folding: application to trp-cage. Chem. Phys. 2013, 422, 131–134. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.