Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2000 Mar 14;97(6):2544–2549. doi: 10.1073/pnas.97.6.2544

Mechanisms and kinetics of β-hairpin formation

D K Klimov 1,*, D Thirumalai 1
PMCID: PMC15965  PMID: 10716988

Abstract

Thermodynamics and kinetics of off-lattice models with side chains for the β-hairpin fragment of immunoglobulin-binding protein and its variants are reported. For all properties (except refolding time τF) there are no qualitative differences between the full model and the Gō version. The validity of the models is established by comparison of the calculated native structure with the Protein Data Bank coordinates and by reproducing the experimental results for the degree of cooperativity and τF. For the full model τF ≈ 2 μs at the folding temperature (experimental value is 6 μs); the Gō model folds 50 times faster. Upon refolding, structural changes take place over three time scales. On the collapse time scale compact structures with intact hydrophobic cluster form. Subsequently, hydrogen bonds form, predominantly originating from the turn by a kinetic zipping mechanism. The assembly of the hairpin is complete when most of the interstrand contacts (the rate-limiting step) is formed. The dominant transition state structure (located by using cluster analysis) is compact and structured. We predict that when hydrophobic cluster is moved to the loop τF marginally increases, whereas moving the hydrophobic cluster closer to the termini results in significant decrease in τF relative to wild type. The mechanism of hairpin formation is predicted to depend on turn stiffness.


Fast folding experiments on proteins and their building blocks (α-helices, β-hairpins, and loops) are providing glimpses into the time scales of the early events in the assembly of biomolecules (1, 2). In a recent study, Munoz et al. (3) showed that the 16-residue C-terminal peptide from the protein GB1 forms a β-hairpin in about 6 μs at the folding transition temperature TF (≈300 K), which is in accord with our earlier theoretical predictions (4, 5). They also proposed a statistical mechanical model to explain the observed thermodynamics and kinetics (6).

The key experimental findings of Munoz et al. (3) are (i) Thermodynamically, the formation of β-hairpin can be described by a two-state process. The transition to the folded state is broad, which is expected for small finite systems. (ii) In the temperature range 15–55°C the folding kinetics (monitored by tryptophan fluorescence) is exponential. This behavior suggests two-state folding kinetics. (iii) β-Hairpins form at rates considerably slower than helices. (iv) The temperature dependence of the measured relaxation rates shows slight curvature and an Arrhenius fit to the data gives negative activation energy.

Before these experiments theoretical and computational studies had demonstrated that generically β-hairpins can form in microseconds (4, 5). There are large variations in the time scales for β-turn formation depending on the underlying sequence-dependent characteristic temperatures (4). All-atom simulations in water have shown that many of the characteristics of folding kinetics in proteins (kinetic partitioning, “mini” hydrophobic core packing, etc.) are found in β-turn forming peptides (7). The experiments (3) raise the need for more realistic models, which are needed to understand the precise mechanisms and structural aspects of the transition states.

The purpose of this paper is to provide a complete picture of the folding of the 16-residue peptide and its variants by using Langevin simulations of minimal off-lattice models. We reproduce, qualitatively, all the major experimental observations. Our studies, based on coarse-grained off-lattice models with side chains, suggest that the zipping of the hydrogen bonds starting from the turn is essential for the assembly of the wild-type (WT) 16-mer peptide. The basic mechanisms are determined by the intrinsic rigidity of the turn which, in this class of peptides, is determined by the location of the hydrophobic cluster.

Methods

Model.

The off-lattice model is a coarse-grained representation of a polypeptide chain, in which only the positions of the Cα carbons and the centers of mass of amino acid residues are retained. The amino acid residues are represented as united side chains around their centers of mass. The hydrogen bonds (HBs) between the backbone carbonyl oxygen CO and amide hydrogen NH groups are mimicked as virtual moieties located between backbone α-carbons (8).

The sets of vectors representing the α-carbon positions {r⃗b,i} and the side-chain centers of mass {r⃗s,i}, where i = 1, 2, … , N, specify the conformation of the polypeptide chain. The WT sequence, which corresponds to the C-terminal fragment of the GB1 protein, is GEWTYDDATKTFTVTE.

The potential energy of a conformation is given by Ep({r⃗b,i}, {r⃗s,i}) = VBL + VSBC + VBA + VDIH + VHB + VNON, where VBL, VSBC, VBA, VDIH, VHB, and VNON are bond-length potential, side-chain–backbone connectivity potential, bond-angle potential, dihedral angle potential, hydrogen bond, and nonbonded long-range potentials, respectively. The interaction potentials for VBL, VBA, and VHB are given elsewhere (4, 8). Here we give a brief description of the additional terms.

Side-Chain–Backbone Connectivity Potential (VSBC).

We use VSBC = ∑i=1N (ks/2)(|r⃗s,i − r⃗b,i| − as)2 with ks = 200ɛh/a2 (= kb, the spring constant used in VBL) and as = a, where a (= 3.8 Å) is the average distance between Cα carbons and ɛh is an energy of hydrophobic interactions.

Dihedral Angle Potential and Chirality.

The inclusion of side chains introduces chirality in the native state topology—i.e., the native structure and its mirror image become topologically distinctive. The dihedral angle potential VDIH is

graphic file with name M1.gif 1

where Ai = Bi = 1.2ɛh and Ci = ±0.6ɛh are the values used for the dihedral angles outside the turn region—i.e., for i ranging from 1 to 5 and from 10 to 13. Within the loop (for i = 6 to 9) we choose Ai = 0, Bi = 0.2ɛh, and Ci = ±0.3ɛh. In Eq. 1 the states g and g+ have different energies and the symmetry in VDIH with respect to the substitution φ → −φ is broken. This modification is sufficient to distinguish between chiral β-hairpin conformations. We selected the signs of Ci by examining the dihedral angles φiPDB from α-carbon representation of the Protein Data Bank (PDB) native structure (ID code 1gb1) of 16-mer β-hairpin. If φiPDB < 0 (>0), then we chose Ci > 0 (<0). The signs of Ci are altered with i to trace the native fold of the β-hairpin. Other ways of mimicking chiral effects can be found elsewhere (9, 10).

Nonbonded Potential.

The nonbonded potential has three terms VNON = VS + VB + VBS, where VS is the interaction between side chains, VB is the interaction between backbone α-carbons, and VBS is the potential between backbone α-carbons and side chains (4).

The interactions between side-chain atoms are

graphic file with name M2.gif 2

The residue-dependent term bij is the contact interaction matrix taken from table III of Kolinski et al. (KGS) (11). The pairwise contact distances rij0 (expressed in the units of a = 3.8 Å) are from table I of ref. 11. The parameter sij = bij/|bij| in Eq. 2 controls the nature of the VS potential. If sij < 0, then the interaction is attractive at rrij0. If sij is positive, VS is purely repulsive and we replace ɛh with ɛl = Inline graphicɛh.

The general scheme to develop force fields described above for polypeptide chains is used for the 16-mer fragment. We consider two models. The first, the Klimov and Thirumalai (KT) model, uses the potentials described above. A Gō version of the KT model is constructed, in which the interactions between side chains that are in contact in the PDB native conformation are assumed to be given by Eq. 2. The nonbonded interactions between all other side chains (as well as VB and VBS in both models) account only for excluded volume and do not depend on the residue type. The general functional form for these interactions is

graphic file with name M4.gif 3

In our Gō-like model the dihedral angle potentials and HB interactions are the same as in the KT model.

Equilibrium quantities are computed by using the multiple histogram technique (12), and dynamics is obtained from Langevin simulations. The energy scale ɛh is typically between 1 and 1.5 kcal/mol. For the purposes of converting to real units we take ɛh = 1.25 kcal/mol and time scale τL = (ma2h)1/2 ≃ 3 ps (4), where m (= 5·10−22 g) is the average residue mass.

Results

Native State Topologies.

In Fig. 1A the native state, identified as the lowest-energy conformation at T = 0.0 using multiple slow cooling simulations, is shown for the KT model. Fig. 1B shows structural aspects of the hairpin—the HBs and the interaction between the residues constituting the hydrophobic cluster (3W, 5Y, 12F, 14V). The structure is stabilized by 26 (21 in the case of the Gō model) native contacts between side chains (defined with 7.6-Å cut-off distance) and 14 HBs. The angle γ between the planes defined by the two strands (residues 1–6 and 10–16) is 12.1°, and the radius of gyration of the native conformation is Rg,N = 11.9 Å. The corresponding values for the Gō model are 13.4° and 12.2 Å, respectively. The radius of gyration of the PDB native conformation (ID code 1gb1) RgPDB is 12.2 Å, and the angle γ is 12.6°. The backbone rms (compared to the PDB structure) of the Gō model is 1.8 Å, whereas the full rms deviation is 2.5 Å. For the KT model the backbone and full rms deviations are 2.6 Å and 3.1 Å, respectively. The contributions to the energy of the native state (per interaction) arising from side-chain contacts (−1.3 kcal/mol) and HB interactions (−0.6 kcal/mol) for the Gō model are in agreement with those derived from the analysis of the energetics stabilizing the WT β-hairpin (stabilization energies due to side-chain interactions ΔGsc = −2.1 kcal/mol and HB formation ΔHHB = −1.1 kcal/mol) (3). The results for the KT model are −1.6 and −0.5 kcal/mol.

Figure 1.

Figure 1

(A) The native conformation of the β-hairpin for the KT model. For clarity CO and NH groups are not shown. The figure is created with the program rasmol 2.6 (13). (B) Schematic representation of interactions in the native conformation of a β-hairpin. The 14 HBs are given by dashed lines connecting COi (in magenta) and NH17−i (in green) groups; HB numbering starts with Gly-1 (1GLY). HB labels indicate bond number (superscript) and the pair of residues (subscript). Thick dashed lines show interactions between hydrophobic cluster side chains.

Because only generic functional forms for the interaction potentials with physically reasonable values are chosen and no attempt has been made to optimize them, the agreement between the predicted structures and the PDB data should be considered excellent. We expect that the strategy developed here should provide a good low resolution of the folds of proteins as well.

Thermodynamics of Folding.

The collapse transition temperature, Tθ, obtained from the peak of specific heat (data not shown) is 346 K for the Gō model. Because this transition corresponds to chain compaction it can also be determined from the temperature dependence of the radius of gyration 〈Rg(T)〉 (Fig. 2A). The derivative dRg(T)〉/dT has a maximum at T = 365 K ≈ Tθ (data not shown). The value of 〈Rg(T)〉 in the range TF < T < Tθ changes by less than 10%. Moreover, ΔT = TθTF (= 0.12TF) is smaller than the width of folding transition (see Inset to Fig. 2A). Thus, the measurements of Tθ would be difficult, if not impossible, for such a small peptide. The folding transition temperature computed from the fluctuations of the overlap function Δχ (ref. 8) is TF = 308 K (for the Gō model). For the KT model the peak in Δχ occurs at TF ≈ 333 K, which is higher than the experimental estimate.

Figure 2.

Figure 2

(A) Temperature dependence of the fraction of native state fN measured by the fraction of native contacts 〈Q(T/TF)〉 for the Gō model (dashed curve) and the native hairpin population (solid curve, temperature scale is given at the top) calculated by using the experimental thermodynamic parameters ΔH = −11.6 kcal/mol and ΔS = −39 cal/(mol⋅K) (ref. 3). By fitting 〈Q(T/TF)〉 to a two-state model we obtained ΔH = −7.0 kcal/mol and ΔS = −22 cal/(mol⋅K). The thermal average of the radius of gyration 〈Rg〉 (in Å, the right-hand vertical axis) vs. T/TF is plotted (dotted curve). The location of collapse temperature is marked by an arrow. In the temperature range corresponding to that scanned in the experiments (3) the change in 〈Rg〉 is less than 10%. (Inset) Derivatives of 〈Q〉 (dashed curve) and experimental hairpin population (solid curve) with respect to temperature. The temperature scales are the same as in the main figure. It is seen that folding transition in both the experiment and the simulations is very broad. (B) Probabilities of HB formation PHBi (i gives the HB number) as a function of T/TF for the Gō model. PHBi for the HBs proximal to the hydrophobic cluster (HB3–143, HB4–134, HB12–511, HB13–412) are shown by solid curves, other HBs are represented by dotted curves. Terminal HBs and those near the turn are labeled.

The temperature dependence of the probabilities of formation of the HBs PHBi (i = 1, 2, … , 14) (Fig. 2B) for the Gō model shows that they form over a broad temperature range. Even at relatively high temperatures (T > TF) HB9–88 (i.e., a HB labeled 8 formed between CO9 and NH8) and HB7–107 are established with high probability, which is a consequence of their proximity to the turn. We should emphasize that the formation of these HBs does not in any way assist the assembly of the hairpin (Fig. 3B and discussions below). The first HB outside the loop region (namely, HB10–79) becomes stable at T = 359 K (≈Tθ). As the temperature is lowered the HBs are progressively “zipped” in accordance to their distance from the loop region—i.e., the ones closer to the loop become stable at higher temperatures. Generally, the process of formation of HBs along the hairpin resembles the thermodynamic analog of the recently proposed “kinetic zipper” model for hairpins (3, 14). In both models a hierarchy of structural changes occurs as the temperature is lowered (collapse, folding of the backbone, and ordering of the side chains).

Figure 3.

Figure 3

(A) Plots of the time dependence of the radius of gyration 〈Rg(t)〉 (in blue), the fraction of HBs 〈NHB(t)〉 (in red), the fraction of unfolded molecules Pu(t) (in green), and the fraction of hydrophobic cluster contacts 〈QHC(t)〉 (in yellow) for the Gō model at T = 0.82TF. The data are shown by dotted lines, and the biexponential fits are shown as solid curves. The collapse time τc (coinciding with the time scale for hydrophobic cluster formation), the HB formation time τHB, and the folding time τF are indicated by arrows. The data are normalized to vary between 1 and 0 and obtained by averaging over 200 trajectories. (Inset) Folding time τF vs. temperature. An exponential increase in τF is observed at low T. (B) Time evolution of color-coded kinetic probabilities of HB formation 〈PHBi〉 (the superscript i indicates the HB number as in Fig. 1B) at T = 0.82TF for the Gō model. The data are averaged over 200 trajectories. This figure shows kinetic zipping of HBs (only for t > τc do we observe HB formation away from the turn). The heterogeneity in HB stability after relaxation to equilibrium is also evident: the HBs near the turn are very stable (red color), whereas terminal ones are significantly frayed (light blue). (C) Snapshots from a generic folding trajectory at the three time scales indicated in A. At t = 0 two turn HBs and two intrastrand native contacts are preformed. At t ≃ τc the hydrophobic cluster is fully formed (shown by dashed lines) and the size of the hairpin is reduced to its equilibrium collapsed value. There are also four turn HBs. At t ≃ τHB all 14 HBs are already formed, but only about 60% of native side-chain contacts (including all in the hydrophobic cluster) is established. At t ≃ τF the native structure is reached. (D) Superposition of transition state (TS) conformations for two main TS clusters TS1 and TS2 in the Gō model. The superposition is accomplished by minimizing the rms deviation of structures constituting a given TS cluster. The structural characteristics of TSs are given in the text.

At first glance it may appear that the structural changes that seem to take place over nearly 40°C (308–346 K) is at variance with the apparent two-state transition seen experimentally. However, it should be emphasized that the thermodynamic transition is broad (see below), which is the reflection of the finite size of the chain. To quantitatively assess the degree of cooperativity we compare the theoretical results with experimental data. A measure of the degree of cooperativity of folding is Ωc = (Tmax2T)(dPN/dT) (15), where TmaxTF, ΔT is the full width at half maximum of dPN/dT, and PN(T) is a measure of formation of the native state. We find Ωc ≃ 0.82 when 〈Q(T)〉 = PN is used. The experimental value of Ωc, calculated from the temperature dependence of the fraction of the native state (see inset to figure 2 of ref. 3), is 0.14. Typical values of Ωc for proteins exceed 5 (15). Hence, we find that the degree of cooperativity in the hairpin models and that computed using experimental data are low, as expected.

In the Inset to Fig. 2A, we plot dfN/dT (where fN is taken to be the fraction of native hairpin for the experiment and 〈Q〉 for simulations). The transition width of the experimental curve is about 40°, and it is also large for our model. It is this large width of the transition that leads to an apparent hierarchy of structural changes. Such changes would be impossible to detect experimentally because they take place within the width of the transition. The low cooperativity and broad transition is consistent with low stability of the hairpin (see legend to Fig. 2A).

Time Scales of Folding.

We have computed the folding time scales at several temperatures TTF at the friction coefficient ζ = 50τL−1 [corresponding to water viscosity (5)] for the Gō model. Since the foldability index σ [= (TθTF)/Tθ] is an indicator of the rate of folding (4, 16), we expect σ = 0.11 for the Gō model to be a lower bound. The folding time for the Gō model at TF is 0.13 μs, which is about a factor of 50 less than the experimental value. Because the computations are considerably more demanding for the KT model, detailed kinetic studies were done only at TTF, at which τF ≈ 2.2 μs. The complete elimination of nonnative tertiary interactions drastically increases the folding rates.

The folding kinetics is monitored by the time dependence of Pu(t), the fraction of unfolded molecules at time t (i.e., those that have not yet reached the native state). We compute Pu(t), which decays exponentially at TF, from the distribution of the first passage times. At T = 0.82TF the folding time τF is minimal and the relaxation of Pu(t) has a dominant fast phase (Φ = 0.96) and a very small slow phase (0.04). In the inset to Fig. 3A we display the temperature dependence of τF, which is Arrhenius-like with a slight curvature. This trend compares favorably with experiments (see figure 6a of ref. 2).

Folding Mechanism: Collapse, HB Formation, and the Acquisition of Native State.

The mechanism of assembly of the β-hairpin structure is monitored by using several progress variables. It is important to define what is meant by the native structure in our kinetic simulations. Operationally, we assume that the native state is reached when all the HBs and the native contacts between the side chains are established simultaneously, and the instance this occurs for the first time is the first passage time, τ1i. Steepest descent quenches starting with these structures map onto the low-energy native-like conformations (Fig. 1A), and hence they belong to the native basin of attraction.

A note concerning the definition of the native conformation used in our simulations is in order. In our models and in other studies (3, 17) it is found that the ends are frayed, which is a reflection of the low stability of the folded state. If the contacts corresponding to these terminal residues are excluded from the definition of the first passage time, then τF reduces by about 30%. More importantly, we find no change in the mechanism of hairpin formation.

Different scenarios for the relation between collapse and folding emerge depending on the value of foldability index σ (16): Polypeptide chains with small σ collapse and fold almost synchronously. In other words, τcFO(1), where τc is the time scale for collapse. We have computed 〈Rg(t)〉, Pu(t), and the dynamics of the fraction of the native contacts 〈Q(t)〉 by averaging over 200 independent trajectories at temperatures slightly below TF (Ts = 0.82TF) for the Gō model. On the time scale τc the radius of the gyration decreases sharply (blue curve in Fig. 3A). The initial collapse time τc is approximately 0.2τF, where the folding time (= 0.04 μs) is computed by using Pu(t) (green curve in Fig. 3A). Thus, the initial event in the refolding process is the collapse of the polypeptide chain.

Additional insight into the mechanism of β-hairpin formation may be obtained by examining the kinetics of formation of the HBs 〈NHB(t)〉 and the dynamics of contacts between the residues in the hydrophobic cluster (HC) 〈QHC(t)〉. It is suspected that much of the inherent stability of this β-hairpin arises from contacts between the residues in the hydrophobic cluster. The time dependence of 〈QHC(t)〉 (yellow curve in Fig. 3A) shows that on an average hydrophobic cluster contacts are formed on time scale slightly less than the collapse time τc. Only subsequently do the majority of the HBs (on the time scale τHB) and other contacts form. A comparison of Fig. 3 A and B (in which the kinetics of formation of all HBs is displayed) shows that the acquisition of contacts in the HC does not lead to the rapid formation of HBs close to these residues. We should also emphasize that although HB9–88 is preformed it does not lead to rapid assembly of the hairpin (Fig. 3B), i.e., it does not act as a nucleation site. The formation of HBs (to a large extent) starts near the turn and proceeds by a zipping mechanism described by Munoz et al. (3). Formation of all native contacts, leading to the assembly of the hairpin, takes place of the time scale ≈τF. Conformations illustrating the structural changes that take place on the time scales τc (collapse and formation of HC contacts), τHB (formation of HBs by kinetic zipping mechanism), and τF (folding time) are shown in Fig. 3C. Animations are available at www.glue.umd.edu/∼klimov.

Identification of Transition State (TS) Structures by Cluster Analysis.

We have used a pattern recognition method based on the cluster analysis technique to identify TS structures in the two models (18). We anticipate (this is explicitly shown in our analysis) that the formation of a stable hairpin should be preceded by the formation of at least a few contacts between the two strands (19), which ensures a rapid access to the native conformation. This kinetic pattern is very reminiscent of a nucleation process. Our method, described in detail elsewhere (18), is readily suited to identify the TS structures that are responsible for interstrand contacts. We should note that although the TS structures are, in general, not unique (20) their number is small (21, 22), especially in a system as simple as the β-hairpin.

Gō model.

The TS structures were searched at Ts = 0.82TF, where the folding time is minimal. We generated 100 trajectories so that our results are statistically reliable. We searched for TS structures in every folding trajectory within the time interval (1 − δTS1,i, τ1,i. We chose δTS (= 0.08) from the condition PIITS) = ξ (= 0.9), where PIITS) is the probability of forming at least one interstrand (type II) contact. The relatively sharp increase in PIITS) for δTS ≲ 0.5 (see Fig. 4A) ensures us that no qualitative changes are expected upon changing ξ within certain limits. Our identification of TS structures also requires that the contacts be stable (with some tolerance) within the time interval given above. We also varied δTS to get a physically reasonable picture of the properties of TS structures which is important because the underlying reaction coordinate is unknown.

Figure 4.

Figure 4

(A) Kinetic probability of forming at least one interstrand contact PII(t) for the Gō (solid curve) and the KT (dashed line) models as a function of δTS, which reflects the progress to the native state. Data are obtained at T = 0.82TF and TTF, respectively, by analyzing 100 trajectories. We assume that the passage of TS occurs at the level of PII = 0.9 (dotted horizontal line). (B and C) Color-coded kinetic probabilities PTS(qi) for the contacts labeled qi to be formed in the TS for the Gō (B) and KT (C) models. PTS(qi) are represented by a color profile at given value of δTS. PTS(qi) for most intrastrand contacts (type I) rapidly increase, while PTS(qi) for interstrand contacts (type II) remain low until the late stages of folding. Then the probabilities PTS(qi) for interstrand contacts explosively grow, underscoring the dynamic cooperativity of β-hairpin assembly. For the Gō model the contact labels qi = 1, … , 21 represent the following native contacts (the numbers in parentheses are residue numbers): 1 (89), 2 (1516), 3 (13), 4 (24), 5 (35), 6 (46), 7 (57), 8 (911), 9 (1113), 10 (1214), 11 (1315), 12 (69), 13 (611), 14 (512), 15 (613), 16 (312), 17 (413), 18 (314), 19 (415), 20 (215), and 21 (216). For the KT model (qi = 1, … , 26) we have 1 (45), 2 (56), 3 (89), 4 (1516), 5 (13), 6 (24), 7 (35), 8 (46), 9 (57), 10 (68), 11 (911), 12 (1113), 13 (1214), 14 (1315), 15 (69), 16 (710), 17 (611), 18 (511), 19 (411), 20 (512), 21 (513), 22 (312), 23 (413), 24 (213), 25 (314), and 26 (215).

By analyzing each trajectory we identified structures in the TS region (δTS = 0.08), using the conditions given above. In each trajectory only a subset of native contacts is formed in the TS region. Their formation is preceded by the repeated collisions and break-up of various native contacts. Rather than analyze these trajectories in detail we present in Fig. 4B the color-coded kinetic probabilities PTS(qi) (the residues involved in the contacts qi are given in the legend to Fig. 4 B and C) averaged over 100 trajectories as a function of δTS. Even in the early stages of folding (δTS ≈ 1.0, t ≪ τc or τF) certain native contacts form with high probability. These are local intrastrand (type I) contacts, which occur only along the first or the second β-strands. At these early stages of folding PTS(qi) for interstrand contacts (type II contacts in Fig. 4B) are negligible. The TS structure cannot involve only local intrastrand contacts, since their formation alone does not lead to a stable native state. The profiles PTS(qi) change dramatically as δTS decreases. Starting with δTS = 0.08 (Gō model), which we identify with a TS region, PTS(qi) of certain interstrand contacts (contacts labeled qi = 13, 14, 16, 18 between residues 6–11, 5–12, 3–12, 3–14) exceed 0.5. Strikingly, three of these contacts belong to the HC (Fig. 1B). Hence, we conclude that formation of the contacts between strands is necessary for subsequent rapid formation of the native state in a dynamically cooperative manner (Fig. 4B).

We computed a number of overall characteristics of the TS conformations, such as the fraction of native contacts 〈QTS〉, the radius of gyration 〈Rg,TS〉, and the contact order 〈STS〉 (23). These quantities are averaged over the conformations constituting the clusters at the time (1 − δTS1i. On an average the TS structures are compact (〈Rg,TS〉/Rg,N〉 = 1.06) and native-like (〈QTS〉 = 0.64). The fluctuations in the distributions are small (unpublished results), implying that there is only a small number of TS structures (see below). The distribution of the contact order 〈STS/SN〉 with respect to the native value is found to be relatively broad with a slight shift toward local interactions. This finding emphasizes the role of both local and nonlocal contacts in the TS.

The diversity of the TS structures and their characteristics can be inferred by applying the cluster analysis to the ensemble of TS conformations. There are two main TS clusters, which are composed of rather compact but structurally different conformations. Approximately two-thirds of folding trajectories pass through the dominant TS (TS1), which is native-like (Fig. 3D). On an average, TS1 contains 15.1 native contacts (of which 7.9 are interstrand) and 10.7 HBs. The hydrophobic cluster is fully formed in TS1. The activation energy for the passage from the unfolded state U to the native state through this TS is negative (Ea,1 ≈ −1.9 kcal/mol). (Note the energy of U is calculated as a thermal weighted average over all conformations with less than 4 HBs at the simulation temperature.) The remaining one-third of unfolded peptides reach the native state through the second TS (TS2), which is less structured than TS1 and has significantly smaller native content. There are, on average, 12.3 native contacts (of which 5.5 are interstrand) and 5.9 HBs, which implies that roughly only half of the native interactions are formed. Interestingly, the hydrophobic cluster contains two of the three contacts formed. The activation energy for TS2 is higher (Ea,2 = 0.18 kcal/mol). The presence of a dominant ordered native-like TS with negative activation energy is consistent with the experiment and statistical mechanical model developed by Munoz et al. However, our results suggest that the dominant hairpin TS1 is more ordered than that predicted by Munoz et al. (3, 6).

KT model.

The major challenge in the analysis of the TS structures in the KT model is the dramatically increased folding time scales. To deal with a manageable number of structures we recorded the instantaneous conformations for cluster analysis at the intervals Δt = 0.6(1 + [τ1i/6000]) ns. For this analysis we generated 100 trajectories at TF. The value of δTS for the KT model using PIITS) = ξ (= 0.9) is 0.01 (Fig. 4A). A comparison of the kinetic probabilities PTS(qi) for the Gō (Fig. 4B) and KT (Fig. 4C) models shows that qualitatively these figures are similar. In particular, at times less than τc (large δTS) only intrastrand contacts are formed. However, rapid assembly of the β-hairpin occurs only when interstrand contacts begin to form. This is seen in the dramatic increase of PIITS) at δTS ≳ 0.01 (Fig. 4A). Fig. 4C shows that the dynamic cooperativity is much greater in the KT model than in the Gō model. This is seen in the explosive growth of PTS(qi) for interstrand contacts and rapid formation of β-hairpin at δTS < 0.01. Thus, in complete agreement with the results obtained for the Gō model, we conclude the β-hairpin reaches the TS when the contacts between the strands start to form. Consistent with the results for the Gō model, PTS(qi) are not determined by the strength of native contacts (data not shown), which emphasizes the role of topology in selecting the folding pathways.

The structural characteristics of the TS conformations (calculated with δTS = 0.01) show that on an average they are highly structured. The TS conformations have on average about 50% of native contacts. The average contact order (more precisely, the ratio of the contact order of the TS ensemble to the native value) 〈STS/SN〉 is equal to 0.80, which implies that local contacts are weakly dominant in the TS.

The comparison of the results for the Gō and KT models gives a consistent picture for TS structures. The TS analysis suggests the following folding scenario for a β-hairpin. The local contacts are formed extremely fast at early stages of folding, but their formation by themselves is not sufficient for reaching the native fold. The β-hairpin passes the TS region at later stages of folding when interstrand contacts form in a dynamically cooperative manner that guarantees quick native state assembly.

Folding Properties of β-Hairpin Mutants: Folding Speed and Mechanism Depend on the Stiffness of the Turn.

We used the Gō model and studied two mutants, M1 and M2, in which the hydrophobic cluster (residues 3W, 5Y, 12F, 14V) is moved either to the loop region (M1) or to sequence termini (M2). The resulting sequences are GEDTWDYATFTVTKTE (M1) and WEYTGDDATKTETFTV (M2). Moving the HC should affect the stiffness of the turn as measured by the bending rigidity κ. We expect that κM1 > κWT > κM2. The relative differences between the values of κ are expected to be small. If a continuum description is used, then the free energy of the polypeptide chain upon forming a hairpin can be written as F ≃ (κ/2)(L/R2) − (lH/a)Inline graphic where Inline graphic is the average gain per residue due to interstrand interactions, lH is the length of the strand, R is the curvature of the hairpin, and L = 2lH + πR. Minimizing this with respect to R gives R ≈ (alpL)1/3[kBT/Inline graphic]1/3, where the persistence length lp = κ/kB, where kB is the Boltzmann constant. This suggests that sequences with stiff turns have large turn radius, and hence their formation may require overcoming larger free energy barriers. On the basis of this argument we predict that τFM1 > τFWT > τFM2.

To test this prediction we performed simulations to probe the thermodynamics and kinetics of the two mutant sequences. The collapse temperatures Tθ are 315 K and 340 K for M1 and M2, respectively. The corresponding values for the folding transition temperatures are 264 K and 327 K. The most significant and meaningful changes in folding characteristics are seen in the degree of cooperativity Ωc and the foldability index σ. Mutant M1 has a lower folding cooperativity (Ωc = 0.62) and, consequently, stability as indicated by a noticeable decrease in Ωc derived from 〈Q(T)〉. Recall for the WT Ωc = 0.82. In contrast, there is an enhancement in cooperativity (and, therefore, stability) in the mutant M2. Indeed, Ωc = 1.42 for M2 is almost twice as large as for WT. Other studies have reached different conclusion on cooperativity (24).

Changes in cooperativity match those observed for the foldability index (σ = 0.16 for M1, 0.11 for the WT, and 0.04 for M2). Since σ is coupled with the folding rates (4, 16), we expect that M2 sequence should fold faster than WT. Because σ values for WT and M1 are rather close, we expect M1 to fold on similar or slightly larger time scales. These expectations are confirmed by kinetic simulations, in which we calculated folding times τF at the constant TsTF. Similar results are obtained at two different temperatures. We found that τFM2 ≃ 0.6τFWT, i.e., M2 folds faster than the WT (τFWT = 0.04 μs). The folding time for M1 is τFM1 ≃ 1.1τFWT. Thus, thermodynamic analysis of WT and both mutants suggests that moving the hydrophobic cluster to the loop compromises the folding cooperativity and stability and leads to somewhat longer folding times. In contrast, placing strong hydrophobic interactions closer to the hairpin ends enhances cooperativity (and stability) and folding rates. We expect these effects to be more dramatic in the real hairpins, where nonnative interactions participate in slowing the folding kinetics. Our findings also suggest that the folding mechanism depends on the rigidity of the hairpin.

Conclusions

Simple off-lattice models (with side chains represented as united atoms) of 16-residue β-hairpin-forming peptide are used to provide description of the kinetic and thermodynamic data. The simulations for the WT qualitatively reproduce all the important experimental observations, including a broad two-state thermodynamic transition, two-state folding kinetics, and the temperature dependence of the folding rates with an apparent negative activation energy. With this calibration we have made a number of theoretical predictions for the WT and two mutants.

(i) The mechanism of β-hairpin formation involves several structural stages. On the collapse time scale τc the contacts between the residues in the hydrophobic cluster form. Their formation is followed by zipping of HBs on the time scale τHB, predominantly starting from those near the turn. This is in accord with our earlier studies, which showed that nucleation of β-sheet formation starts from the turns (3, 21). Finally, rapid assembly of the hairpin takes place in a dynamically cooperative manner on the folding time scale τF. The rate-limiting step is the formation of sufficient number of interstrand contacts, and this is the only kinetic barrier. Estimate for τFc for the Gō model is between 3 and 5 and is expected to be the same for more realistic models as well. The value of τHBc ≈ 1.3. This implies that τc ≈ 2 μs and τHB ≈ 2.6 μs, assuming that τF ≈ 6 μs. Thus, in the experiment it would be hard to draw a distinction between the collapse and folding dynamics in this 16-mer peptide, especially given the low stability of the folded state and the small changes in the radius of gyration. Even in our simulations these changes can be detected only by using averages over hundreds of trajectories, and hence cannot be easily established in all-atom simulations.

(ii) On an average the dominant transition state structures are found to be highly compact, with the radius of gyration being only 6% larger than that for the native conformation. Fluctuations in these quantities are relatively small. The transition states, the number of which is very small, are highly ordered, with a large fraction of native contacts already formed.

(iii) We predict that the folding kinetics of mutants, which were generated by moving the hydrophobic cluster either close to the turn (M1) or near the termini (M2), is determined by the intrinsic stiffness of the hairpin. Both theoretical arguments and explicit simulations suggest that folding times are arranged as τFM1 ≳ τFWT > τFM2. The folding mechanism is determined by the rigidity of the turn. This finding implies that the complex mechanism described for WT may not hold for the mutants M1 and M2.

The models used here are not without limitations. In addition to the obvious limitation of the force fields used (this is also shared in all-atom simulations that rely on transferability of potentials) the neglect of explicit water molecules may be serious. Nevertheless, the combination of simple arguments and detailed simulations using semirealistic models has allowed us to not only reproduce experimental measurements but also make several testable predictions.

Acknowledgments

This work has been supported in part by Grant CHE99-75150 from the National Science Foundation.

Abbreviations

WT

wild type

HB

hydrogen bond

PDB

Protein Data Bank

KT

Klimov and Thirumalai

TS

transition state

Note

After this paper was completed we became aware of two papers on hairpin formation (17, 25). Our results for the folding mechanism, including the importance of hydrophobic cluster formation and folding (for WT) by passing through certain (short-lived) well-defined structures, are in agreement with molecular dynamics simulations (17). The finding that β-hairpins share many features in common with proteins (17) is in agreement with the present study as well as earlier reports (4, 7).

References

  • 1.Winkler J R, Gray H B, editors. Acc Chem Res (Protein Folding Special Issue) 1998;31:697–780. [Google Scholar]
  • 2.Eaton W A, Munoz V, Thompson P A, Henry E R, Hofrichter J. Acc Chem Res. 1998;31:745–753. [Google Scholar]
  • 3.Munoz V, Thompson P A, Hofrichter J, Eaton W A. Nature (London) 1997;390:196–199. doi: 10.1038/36626. [DOI] [PubMed] [Google Scholar]
  • 4.Veitshans T, Klimov D K, Thirumalai D. Folding Des. 1997;2:1–22. doi: 10.1016/S1359-0278(97)00002-3. [DOI] [PubMed] [Google Scholar]
  • 5.Klimov D K, Thirumalai D. Phys Rev Lett. 1997;79:317–320. [Google Scholar]
  • 6.Munoz V, Henry E R, Hofrichter J, Eaton W A. Proc Natl Acad Sci USA. 1998;95:5872–5879. doi: 10.1073/pnas.95.11.5872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mohanty D, Elber R, Thirumalai D, Beglov D, Roux B. J Mol Biol. 1997;272:423–442. doi: 10.1006/jmbi.1997.1246. [DOI] [PubMed] [Google Scholar]
  • 8.Klimov D K, Betancourt M R, Thirumalai D. Folding Des. 1998;3:481–496. doi: 10.1016/s1359-0278(98)00065-0. [DOI] [PubMed] [Google Scholar]
  • 9.Takada S, Luthey-Schulten Z, Wolynes P G. J Chem Phys. 1999;110:11616–11629. [Google Scholar]
  • 10.Hardin C, Luthey-Schulten Z, Wolynes P G. Proteins Struct Funct Genet. 1999;31:281–294. [PubMed] [Google Scholar]
  • 11.Kolinski A, Godzik A, Skolnick J. J Chem Phys. 1993;98:7420–7433. [Google Scholar]
  • 12.Ferrenberg A M, Swendsen R H. Phys Rev Lett. 1989;63:1195–1198. doi: 10.1103/PhysRevLett.63.1195. [DOI] [PubMed] [Google Scholar]
  • 13.Sayle R, Milner-White E J. Trends Biochem Sci. 1995;20:374–376. doi: 10.1016/s0968-0004(00)89080-5. [DOI] [PubMed] [Google Scholar]
  • 14.Dill K A, Fiebig K M, Chan H S. Proc Natl Acad Sci USA. 1993;90:1942–1946. doi: 10.1073/pnas.90.5.1942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Klimov D K, Thirumalai D. Folding Des. 1998;3:127–139. doi: 10.1016/s1359-0278(98)00018-2. [DOI] [PubMed] [Google Scholar]
  • 16.Thirumalai D. J Phys I. 1995;5:1457–1467. [Google Scholar]
  • 17.Pande V S, Rokhsar D S. Proc Natl Acad Sci USA. 1999;96:9062–9067. doi: 10.1073/pnas.96.16.9062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Klimov D K, Thirumalai D. J Mol Biol. 1998;282:471–492. doi: 10.1006/jmbi.1998.1997. [DOI] [PubMed] [Google Scholar]
  • 19.Ramirez-Alvarado M, Blanco F J, Serrano L. Nat Struct Biol. 1996;3:604–611. doi: 10.1038/nsb0796-604. [DOI] [PubMed] [Google Scholar]
  • 20.Onuchic J N, Socci N D, Luthey-Schulten Z, Wolynes P G. Folding Des. 1996;1:441–450. doi: 10.1016/S1359-0278(96)00060-0. [DOI] [PubMed] [Google Scholar]
  • 21.Guo Z, Thirumalai D. Folding Des. 1997;2:377–391. doi: 10.1016/S1359-0278(97)00052-7. [DOI] [PubMed] [Google Scholar]
  • 22.Pande V S, Grosberg A Y, Tanaka T, Rokhsar D S. Curr Opin Struct Biol. 1998;8:68–79. doi: 10.1016/s0959-440x(98)80012-2. [DOI] [PubMed] [Google Scholar]
  • 23.Plaxco K W, Simons K T, Baker D. J Mol Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
  • 24.Kolinski A, Ilkowski B, Skolnick J. Biophys J. 1999;77:2942–2952. doi: 10.1016/S0006-3495(99)77127-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Dinner A R, Lazarisis T, Karplus M. Proc Natl Acad Sci USA. 1999;96:9068–9073. doi: 10.1073/pnas.96.16.9068. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES