Abstract
Site-specific information about structural heterogeneities of the protein-folding transition-state ensemble is commonly derived from the scatter of the Brønsted plot through the individual values of φ = Δlogkf/ΔlogKD-N. Here, we provide a second level of site-specific detail in the transition-state analysis by demonstrating that the scatter of the Hammond plot is related to heterogeneities in the φ-value growth. That is, the extent of transition-state movement (Δβ‡) is proportional to the free-energy gradient of the mutational perturbation across the top of the activation barrier, φ′(β‡) ∝ ΔlogKD-N. The analysis is applied to the two-state protein L23 where the site-specific free-energy gradients are used to identify the interactions that show the highest degree of consolidation after crossing the barrier top. These interactions are distributed as a shell around the high-φ initiation point and denote the side-chain contacts that add criticality to the folding nucleus.
Keywords: transition state, protein engineering, φ-value analysis, Brønsted plot, Hammond behavior
Amajor challenge in experimental studies of protein folding is to deduce more detailed, high-dimensional information about the diffuse transition-state ensemble to aid the microscopic characterization of folding nucleation and the folding free-energy landscape (1, 2). One strategy has been to see how the structural features of the transition-state ensemble change upon perturbation by denaturants and mutation. A frequently encountered example of such transition-state changes is the Hammond postulate behavior (3). The phenomenon was first reported for barnase and CI2, whose transition-state structures became more native-like as the activation free energy for unfolding was decreased by protein engineering (4, 5). Detection of the transition-state shift was by changes of the kinetic m values, i.e., by small tilts of the v-shaped chevron plots. The Hammond behavior, which has subsequently appeared as a general feature of small proteins (6–16), indicates that the top of the activation barrier is smoothly curved and displays some breadth. Upon mutational perturbation, any slanting of this curvature will cause a movement of the barrier maximum along the folding trajectory (13). From second-order polynomial curvatures of the chevron plots (4), it is further apparent that the barrier tops display an approximately quadratic curvature (13). However, the magnitude of the Hammond shifts for proteins with v-shaped chevron plots is relatively subtle and covers typically <15% of the total folding distance as measured by β‡ = mf/(mu - mf). Deviation from this minimal behavior is sometimes seen for proteins with pronounced distortions of the v-shaped chevron plot, indicating much larger changes of the transition state position (13, 17). The phenomenon, however, can still be accounted for by low-dimensional movements along the barrier profile, but this time in the form of discrete shifts between consecutive pointed maxima along the folding free-energy profile (7, 14, 18–20). More complex transition-state alterations along secondary order parameters are not normally discerned by the gross parameter β‡ (6), and seem also to require extensive mutational alterations (7, 21–23) or changes of the sequence connectivity through circular permutation (24–27).
In this study, we present a formalism that quantitatively relates mutational perturbations to the extent of Hammond shifts in protein folding, revealing another aspect of the structural events taking place in the transition-state ensemble: the site-specific growth of contact free energy along the experimental progress coordinate β. The analysis is applied to the ribosomal protein L23 (PDB ID code 1N88; ref. 28), which has not previously been subjected to folding studies. L23 shows v-shaped chevron plots and a classical nucleation-condensation distribution of fractional φ values in the transition state. The mutant-induced m-value changes show further that L23 describes moderate Hammond shifts, but no significant plasticity of the denatured or native ground states. The critical feature of the L23 Hammond plot is, however, that the individual m values are scattered beyond the experimental errors, allowing a detailed description of the contact growth within the critical nucleus as the protein traverses the barrier top.
Theory
General Features of Two-State Folding. In two-state folding, the equilibrium constant (KD-N = [D]/[N]) has been shown to equal the ratio of the unfolding and refolding rate constants (29), KD-N = ku/kf, where
[1] |
[2] |
[3] |
according to the empirical observation of linear free-energy changes after the addition of chemical denaturants like GdmCl or urea (30). The parameters are derived experimentally from chevron plots; i.e., plots of log kf and log ku versus [GdmCl], fitting the equation
[4] |
The slopes mu - mf = mD-N are commonly taken as measures of the change in solvent exposure in the activation process of unfolding and refolding, respectively. The m values are used to position the folding transition-state ensemble on the experimental progress coordinate β according to refs. 29 and 31:
[5] |
Additional, site-specific information about the contacts present in the transition-state structure is obtained from the φ values (30)
[6] |
that measure the effect of point mutation on the transition-state stability (ΔΔG‡) relative to that of the native state (ΔΔGD-N). A φ value of 1 indicates that the structure around the mutated side chain has grown fully native-like at the position of the transition state (β = β‡), whereas a φ value of 0 indicates that it is still disordered.
Basic Rationale for Hammond Postulate Behavior in Protein Folding. The relation between mutational perturbation and transition-state movements; i.e., changes of β‡, was deduced as follows. We have previously demonstrated, by monitoring the effect of individual mutations along the broad barrier region of U1A, that the stability loss on mutation increases progressively with β (15). That is, mutations have generally larger energetic effect on the late part of the reaction profile where the interactions are better consolidated. A general expression for this mutant-induced perturbation is given by
[7] |
where ΔΔG(β) is the stability loss as a function of β and φ(β) describes the φ-value growth along β (Fig. 1). Because ΔΔG(β) increases along the experimental progress coordinate, the free-energy profile is shifted up more near the native state and, in combination with any width of the barrier top, this biased perturbation will cause the transition state to shift closer to the native state. For quadratic curvatures of the barrier top, as is suggested experimentally from second-order polynomial dependence of logku on [GdmCl] (13), the folding free-energy profile near the transition state may be approximated with the function
[8] |
where c < 0. The transition-state shift following the mutational perturbation ΔGmut(β) = ΔG(β) + φ(β)ΔΔGD-N is then given by 0 = ΔG′(β) + φ′(β)ΔΔGD-N ⇒
[9] |
and from Eq. 5
[10] |
assuming that the perturbation φ(β)ΔΔGD-N is approximately linear in the narrow interval Δβ‡. The corresponding barrier height at β‡mut is
[11] |
In other words, the extent of transition-state movement (Δmu) is proportional to the gradient of the mutational perturbation across the free-energy maximum. Hammond shifts are thus not expected for mutations with constant values of φ across the barrier top, because these mutations will just contribute to offset the barrier without affecting its tilt. β‡ shifts arise only from mutation of interactions that undergo changes upon traversing the barrier top, i.e., interactions that are in the process of being formed (5). Preformed or fully unfolded parts of the protein are just carried over silently. Notably, it can be shown that the proportionality between Δmu and φ′(β‡ wt)ΔΔGD-N is qualitatively the same whether the barrier top is not perfectly smooth as assumed in Eq. 8, but riddled with fine structure. The values of Δmu would then tend to change discretely with ΔΔGD-N as the transition state shifts from one local maximum to the next. However, the approximately smooth distribution of the experimentally observed mu values provides no evidence for the existence of such fine structure (compare Fig. 5). To this end, it may further be argued that, for the small β-windows studied here, the fine structure should be smoothened out by conformational heterogeneities of the transition-state ensemble. Another notable feature of the simplistic formalism in Fig. 1 is that it captures the experimental finding that, in protein folding, the Hammond shift (Δβ‡ or Δmu) is linearly related to ΔΔGD-N (13) and not to changes in the barrier height (Eqs. 8, 9, 10, 11).
Experiment
Materials. Mutagenesis was performed by standard procedures using the QuikChange kit (Stratagene). The protein was expressed in Escherichia coli (BL21 DE3) and purified with cation exchange (SP-Sepharose, Amersham Pharmacia) and gel filtration (S100, Amersham Pharmacia). The stopped-flow measurements were carried out on a SX-17MV instrument (Applied Photophysics, Surrey, U.K.) at 25°. Buffer was 50 mM Mes at pH 6.3 (Sigma) and 0.23 M Na2SO4 (Merck). The guanidinium chloride (GdmCl) was ultra pure from ICN Biomedicals.
Na2SO4 Allows Analysis of Marginally Stable Proteins. Because the stability of wild-type L23 is relatively low, the experiments were performed under mildly stabilizing conditions in the presence of 0.23 M Na2SO4 to assure accurate determination of the refolding rate constant and the refolding m value (Eqs. 2 and 3). Under these conditions, L23 displays a transition midpoint of 3.73 M GdmCl and ΔGD-N = 7.4 kcal/mol, corresponding to a stabilizing power of Na2SO4 of ≈3 (kcal/mol) M. In the absence of Na2SO4, midpoint = 2.77 M and ΔGD-N = 4.6 kcal/mol.
A side effect of subjecting proteins to Na2SO4, or other stabilizing agents, is that the increased contact strength may lead to premature collapse of the coil, revealed by rollovers in the refolding limb of the chevron plot (32, 33). Although such rollovers may be discerned >0.5 M Na2SO4, the effect is negligible at 0.23 M. As a control, the φ values for several mutants were determined in both 0 and 0.23 M Na2SO4 and found to be the same within the experimental errors. Moreover, the exclusion of chevron data <1 M GdmCl does not significantly affect the m value changes upon mutation.
V-Shaped Chevrons and Diffuse Nucleus Suggest “Classical” Two-State Behavior. L23 consists of a central four-stranded β-sheet that is clamped by three helices forming a saddle-shaped hydrophobic core (28). A notable feature of this core is that the aromatic side chains are polarized toward the N-terminal side, whereas the C-terminal side consists mainly of aliphatic moieties. For the analysis, 17 conservative mutations were performed throughout the interior of the L23 structure, all of which showed approximately v-shaped chevron plots with overall similar, but not identical, m values (Fig. 2). By this criterion, L23 classifies as a classical two-state folder (34). Even so, the slightly different tilts of the chevron plots, i.e., changes in mf and mu (Table 1), will produce a slight GdmCl dependence of the φ value according to Eq. 6. The phenomenon is apparent to various extent in all two-state proteins and seems to reflect structural reorganizations of the transition state upon mutation (35). To minimize the consequences of these m-value changes, we have estimated the φ values near the transition midpoint at 1.5 M GdmCl, where the refolding rate constant (kf) can be measured directly and the extrapolation of unfolding data (ku) is minimal. All L23 mutations display fractional φ values, ranging from a highest value of 0.44 for V81A to nearly zero for V12A and I89A (Table 1). This diffuse nature of the L23 transition state is further apparent from the Brønsted plot (i.e., Δlogku versus ΔlogKD-N), which yields <φ> = 0.18 (Fig. 3). When calculated at other GdmCl concentrations, the internal graduation of the φ values remains largely the same, showing that the deviation from “ideal” two-state behavior of L23 is too small to have any significant bearing on the structural analysis of the transition-state ensemble. The stability loss of the mutants in Table 1 varies from 1.1 kcal/mol (L57A) to >4 kcal/mol (Y26A), with the exception of the excluded mutation Y18A that shows too slight a stability change to allow accurate determination of its φ-value. Taken together, these straightforward experimental characteristics make L23 an ideal model system for tests of the formalism in Fig. 1.
Table 1. Parameters from kinetic measurements.
Mutant | Midpoint (M) | mf | mu | mD-N (mol-1·M-1) | ΔGD-N(kcal/mol-1) | ΔΔGD-N (kcal/mol-1) | ø1.5M | β‡ | Δmu | Δmu/ΔlogKD-N | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|
WT | 1.31 ± 0.03 | -4.13 ± 0.02 | 3.73 ± 0.05 | -0.75 ± 0.02 | 0.70 ± 0.04 | 1.46 ± 0.05 | 7.39 ± 0.28 | 0.52 | ||||
18V | 1.22 ± 0.04 | -3.10 ± 0.16 | 3.17 ± 0.07 | -0.74 ± 0.03 | 0.62 ± 0.03 | 1.36 ± 0.04 | 5.87 ± 0.22 | 1.52 ± 0.36 | 0.08 | 0.55 | 0.09 | 0.11 |
V12A | 1.25 ± 0.04 | -2.76 ± 0.16 | 3.06 ± 0.07 | -0.73 ± 0.03 | 0.58 ± 0.04 | 1.31 ± 0.04 | 5.45 ± 0.22 | 1.94 ± 0.36 | 0.02 | 0.56 | 0.12 | 0.14 |
L13A | 1.21 ± 0.06 | -3.10 ± 0.21 | 3.15 ± 0.09 | -0.74 ± 0.04 | 0.62 ± 0.05 | 1.37 ± 0.06 | 5.87 ± 0.29 | 1.52 ± 0.41 | 0.08 | 0.54 | 0.08 | 0.10 |
Y18A | 1.20 ± 0.03 | -4.27 ± 0.23 | 3.91 ± 0.05 | -0.70 ± 0.02 | 0.70 ± 0.05 | 1.40 ± 0.05 | 7.45 ± 0.31 | -0.06 ± 0.42 | 0.50 | 0.01 | -0.02 | |
F21A | 1.24 ± 0.03 | -3.17 ± 0.14 | 3.39 ± 0.05 | -0.72 ± 0.01 | 0.58 ± 0.03 | 1.30 ± 0.04 | 6.00 ± 0.20 | 1.39 ± 0.34 | 0.03 | 0.55 | 0.12 | 0.26 |
Y26A | 0.84 ± 0.02 | -1.20 ± 0.06 | 1.85 ± 0.05 | -0.65 ± 0.03 | 0.45 ± 0.02 | 1.10 ± 0.03 | 2.78 ± 0.09 | 4.61 ± 0.29 | 0.11 | 0.59 | 0.25 | 0.10 |
V30A | 0.76 ± 0.05 | -2.58 ± 0.09 | 2.30 ± 0.06 | -0.80 ± 0.04 | 0.66 ± 0.02 | 1.46 ± 0.05 | 4.55 ± 0.14 | 2.84 ± 0.31 | 0.29 | 0.55 | 0.04 | 0.02 |
I39V | 1.02 ± 0.04 | -3.40 ± 0.12 | 3.03 ± 0.05 | -0.79 ± 0.02 | 0.67 ± 0.03 | 1.46 ± 0.04 | 6.00 ± 0.18 | 1.39 ± 0.33 | 0.33 | 0.54 | 0.03 | 0.03 |
V43A | 0.83 ± 0.09 | -2.53 ± 0.17 | 2.24 ± 0.10 | -0.91 ± 0.07 | 0.59 ± 0.04 | 1.50 ± 0.08 | 4.58 ± 0.26 | 2.82 ± 0.38 | 0.34 | 0.61 | 0.11 | 0.06 |
V49A | 1.15 ± 0.05 | -2.20 ± 0.11 | 2.29 ± 0.06 | -0.84 ± 0.04 | 0.62 ± 0.03 | 1.46 ± 0.05 | 4.56 ± 0.16 | 2.83 ± 0.32 | 0.14 | 0.57 | 0.08 | 0.04 |
V51A | 1.09 ± 0.08 | -1.97 ± 0.06 | 2.41 ± 0.06 | -0.76 ± 0.05 | 0.50 ± 0.01 | 1.27 ± 0.05 | 4.15 ± 0.14 | 3.24 ± 0.31 | 0.12 | 0.60 | 0.20 | 0.11 |
V54A | 1.24 ± 0.03 | -1.46 ± 0.06 | 2.03 ± 0.05 | -0.88 ± 0.03 | 0.45 ± 0.01 | 1.33 ± 0.04 | 3.66 ± 0.09 | 3.73 ± 0.29 | 0.10 | 0.66 | 0.25 | 0.11 |
L57A | 1.19 ± 0.04 | -3.42 ± 0.31 | 3.42 ± 0.09 | -0.71 ± 0.03 | 0.64 ± 0.07 | 1.35 ± 0.07 | 6.28 ± 0.43 | 1.11 ± 0.51 | 0.08 | 0.53 | 0.07 | 0.15 |
I80A | 0.97 ± 0.03 | -2.42 ± 0.07 | 2.49 ± 0.05 | -0.79 ± 0.03 | 0.57 ± 0.02 | 1.36 ± 0.03 | 4.60 ± 0.11 | 2.79 ± 0.30 | 0.21 | 0.58 | 0.13 | 0.08 |
V81A | 0.54 ± 0.04 | -2.82 ± 0.09 | 2.44 ± 0.05 | -0.81 ± 0.03 | 0.57 ± 0.02 | 1.38 ± 0.04 | 4.57 ± 0.13 | 2.82 ± 0.31 | 0.44 | 0.59 | 0.14 | 0.08 |
V83A | 0.83 ± 0.03 | -2.32 ± 0.06 | 2.30 ± 0.04 | -0.73 ± 0.03 | 0.65 ± 0.01 | 1.37 ± 0.03 | 4.29 ± 0.09 | 3.10 ± 0.29 | 0.21 | 0.53 | 0.06 | 0.03 |
I89A | 1.22 ± 0.03 | -2.49 ± 0.10 | 2.88 ± 0.05 | -0.71 ± 0.02 | 0.57 ± 0.02 | 1.29 ± 0.03 | 5.04 ± 0.15 | 2.35 ± 0.32 | 0.02 | 0.55 | 0.13 | 0.11 |
ø1.5M calculated from refolding and unfolding data at 1.5 M GdmCl (Eq. 6) High and intermediate values are bold and underlined, respectively.
φ Value Analysis of the L23 Transition State. The positions of the mutated residues in the L23 structure are shown in Fig. 4, where the individual φ values are coded yellow (0<φ < 0.10), orange (0.10<φ < 0.25), and red (φ > 0.25). The thresholds have been chosen for clarity. Apparently, the L23 φ values describe an approximately radial graduation from the center of the hydrophobic core, which is in good agreement with the nucleation condensation mechanism for two-state folding (36, 37). The highest φ values (red) are observed for the triad Val-81 (φ = 0.44), Ile-39 (φ = 0.33) and Val-43 (φ = 0.34), linking the central strand 4 and helix 2 by contacts through the aliphatic side of the core, flanked by Val-30 (φ = 0.29) in strand 3 closing the loop to helix 2. In direct connection to this high-φ cluster is a contiguous group of residues with intermediate φ values (orange), closing up the central part of the β-sheet by connecting strands 2, 4, and 3; i.e., Tyr-26 (φ = 0.11), Val-49 (φ = 0.14), Val-51 (φ = 0.12), Val-54 (φ = 0.10), Ile-80 (φ = 0.21), and Val-83 (φ = 0.21). The lowest φ values (yellow) are found for the more peripheral residues Ile-8 (φ = 0.08), Val-12 (φ = 0.02), Leu-13 (φ = 0.08), Phe-21 (φ = 0.03), and Ile-89 (φ = 0.02), anchoring the N and C termini, and Leu-57 (φ = 0.08) near the disordered loop.
Matched m Value Changes Indicative of Transition-State Movements. Transition-state movements in protein folding are commonly inferred from changes in β‡ (Eq. 5), following stability perturbations by either mutation or addition of denaturant (5, 13). However, the interpretation of these β‡ changes is not always straightforward because they may also result from alterations of the ground states D or N (8, 19). To test for such ground-state alterations in the case of L23, we first replaced β‡ with mu that measures directly the “distance” between the native starting material and the transition state for unfolding (Eq. 2). The plot of mu versus change in unfolding activation free energy (∝ Δlogku) for the 17 mutations in Table 1 shows a significant linear correlation (R = 0.82), which is in apparent agreement with the Hammond postulate (Fig. 5). Because the correlation is largely retained in a plot of β‡ versus ΔlogKD-Nmut (R = 0.74), we conclude that contributions from denatured-state plasticity is after all very small with L23 (Fig. 5). The increased values of mf are matched by a decrease in mu, as would be expected for transition-state movements (Table 1). As an alternative test, the average φ value (<φ >) derived from the slope of the Brønsted plot was used as a measure of transition-state position and plotted against ΔlogKD-NGdmCl = <mD-N>[GdmCl], where <mD-N> is the average value of mD-N in Table 1. The result shows a linear correlation with a slope that is within the experimental error of that for β‡ versus ΔlogKD-Nmut (Fig. 5). It is thus apparent that <φ> and β‡ produce coherent Hammond shifts although their overall values for the transition-state placement are very different at ≈0.18 and ≈0.5, respectively.
Scatter in Hammond Plot Exceeds Experimental Errors. In view of the smooth transition-state shifts observed upon addition of denaturant (13), the plot of mu versus Δlogku reveals considerable scatter (Fig. 5). Notably, this scatter does not arise from experimental errors but shows mutant-specific deviations from the mean kinetic pattern. Compare, for example, I80A and V83A in Fig. 2 that have similar Δlogku but distinctly different mu. It can further be noted that the outliers in the L23 Hammond plot are not correlated with their extent of interaction in the transition-state ensemble, i.e., anomalously small mu changes are observed for both high and low φ values (Table 1). Scattered Hammond plots are not unique for L23 but have also been observed for barnase and CI2, where the susceptibility to undergo transition-state movements was found to be different in different parts of the structure (4, 5). Even so, these deviations from uniform Hammond behavior cannot a priori be attributed to transition state effects alone. One possibility is that they arise from partial unfolding of the native protein. Such ground-state fraying has been presented as an alternative explanation for kinked logku limbs of certain S6 mutations, based on their location at the late-forming interface between the N and C termini (19). However, this interpretation seems unlikely in the case of L23 because the largest mu changes involve core mutations where native-state fraying seems incompatible with microscopic reversibility (Fig. 4). Moreover, the mutations at the seemingly late folding N- and C-terminal regions of L23 show the smallest mu changes (Fig. 4). Another possibility is that the Hammond shifts describe redistributions between parallel folding pathways that extend beyond the minimal two-dimensional barrier model (4, 6). In accord with the prediction in Eq. 10, however, we opt for a more rational explanation: the scattered Hammond plots in protein folding emerge as an intrinsic consequence of nonuniform values of φ′ (β‡ wt) ΔΔGD-N (Fig. 1).
Discussion
The Diffuse Transition-State Ensemble. The φ-value data of L23 yields an overall diffuse impression of the transition state with a graduated distribution of fractional φ values that encompasses the majority of the native contacts (37). This delocalized nucleation pattern is a typical feature of two-state proteins (34, 36, 37) and is generally taken to indicate that the folding transition state is an ensemble of states whose average topology is native-like (38, 39). Consistently, the logarithm of the refolding rate constant across different two-state proteins is found to correlate with the topological parameter “contact order” of the native state (40), with some intrinsic modulation by protein stability and chain length (40–44). The native-like appearance of the transition-state ensemble suggests that the various parts of a protein fold with similar probability (38). At the extreme, such equalization would lead to a situation where all folding and unfolding trajectories are equally favorable, producing ideal Brønsted plots with uniform φ values and maximal cooperativity; local unfolding of the native structure is entropically prevented because all native interactions are fractionally present in the transition state (25). In practice, however, the graduated φ-value distribution indicates often a dominant initiation point that can be qualitatively predicted by simplistic topology-based folding models (45–48). Simulations suggest further that this initiation point, although it looks diffuse by φ values, could still be consistent with a subset of fully established contacts that are shared by the majority of species in the transition-state ensemble (49). With L23, the initiation point is the docking of the helix 2 and strand 4 regions by long-range contacts through the aliphatic side of the core (Fig. 4). Concomitant with this long-range hydrophobic association is probably the local trapping of hydrogen bonds in helix 2 that are “flickeringly” present in the denatured ensemble (37, 50). Structural rationalization of the subsequent layers of contacts becomes increasingly degenerate as the configurational dimensionality grows rapidly with decreasing values of φ (39). Even so, additional information about the folding nucleation events can be deduced from the mu value changes in Fig. 5.
Variable Hammond Response Linked to Heterogeneous φ Value Growth. In contrast to the low-dimensional case of covalent bonds, it is evident that the Hammond behavior in protein folding is governed by multiple order parameters: the global progress coordinate β by which the transition-state shift is measured, and a large set of site-specific coordinates ΔΔG(β) = φ (β)ΔΔGD-N that determine the energetic perturbations upon mutation. It follows that the individual data points in the Hammond plot (Δmu versus ΔΔGD-N), which are derived from site-specific free-energy perturbations, reflect local properties ∝φ′(β‡). The common Hammond plot, based on a superposition of data from structurally dispersed mutations, is therefore expected to contain scatter unless the local values of φ′(β‡) are the same throughout the transition-state structure. However, such uniform growth of the local progress coordinates seems incompatible with the way real proteins fold. The φ values are different throughout the protein structure and are bound to show nonlinear dependencies on β. Thus, the individual values of φ′(β‡) are also likely to be different at any given point along the reaction profile, which is consistent with experimental data on U1A (15).
On this basis, we conclude that the scatter of the Hammond plot (Fig. 5) is related to site-specific variations in φ′(β‡), similar to the way the scatter of the Brønsted plot is related to the site-specific values of φ(β‡) (Fig. 3).
An Additional Facet of the Folding Nucleus: The Critical Contact Layer. To examine the mechanistic implications of the Hammond data, we have tentatively defined Δmu ∝ φ′(β‡) as the site-specific free-energy gradient, and the normalized ditto Δmu/ΔΔGD-Nφ′(β‡) as the site-specific φ value gradient. Absolute estimates of these parameters require precise knowledge about the barrier curvature, i.e., the value of c in Eq. 8. Accordingly, Δmu can be seen as a directional parameter indicating in which part of the transition-state structure the growth of interaction free energy is most downhill, i.e., the interactions that show the largest degree of consolidation after traversing the barrier top. Δmu/ΔΔGD-N, on the other hand, represents a local order parameter indicating heterogeneities in the structural growth as defined by Eq. 6. The graduation of these parameters throughout the L23 structure is shown in Fig. 4, where the individual values are coded: magenta (Δmu > 0.21), green (0.1 < Δmu< 0.20), white (Δmu < 0.1), and pale blue (Δmu/ΔΔGD-N > 0.079). The result shows that the distribution of Δmu values is different from that of the φ values. Whereas the center of the folding nucleus is in the aliphatic side of the core, the high values of Δmu are, on the whole, shifted toward the aromatic side (Fig. 4). The implication of this spatial separation of the Δmu and φ value distributions is that, at the barrier top, the local free-energy gradients are steeper at the interface to the folding nucleus than in its center. Looking at a more detailed level the pattern emerges even clearer. The two highest Δmu values (> 0.21) are found for the sheet mutations Y26A (φ = 0.11, strand 2) and V54A (φ = 0.10, strand 3), flanking the initiation site V81 (φ = 0.44) in the central strand 4, whereas the next layer of residues with intermediate Δmu values covers the aromatic part of the core. The lowest Δmu values are observed at the interface to the N and C termini, and at positions peripheral to the sheet. Notably, this low-Δmu set includes also the two residues in the high-φ cluster facing the C terminus, i.e., V30A (φ = 0.29) and I39V (φ = 0.33), emphasizing the polarization of high Δmu values toward side chains closing up the β-sheet.
The accompanying values of Δmu/ΔΔGD-N yield a very similar picture. The φ value growth is higher for the residues distributed around the initiation point than in its center (Fig. 4), in apparent analogy with the “hydrophobic cloud” wetting the central nucleus in the capillarity description of protein folding (51).
Accordingly, the results allow us to make a crude distinction between a primary set of interactions that provides the gross topology for nucleation; i.e., those that form on the uphill side of the free-energy barrier, and a secondary set that brings the process downhill. In somewhat simplified terms, the latter, deterministic, interactions are those that make the embryonic nucleus critical: the interactions that undergo the highest free-energy change after passing the barrier top. For comparison, the critical nucleus of water condensation is solely defined by size, because the individual molecules within the emerging droplet are indistinguishable. In protein folding, the interactions that add criticality to the nucleus have identity: the critical contact layer.
Acknowledgments
We thank Håkan Wennerström and Anders Öhman for stimulating discussions and the Swedish Research Council for financial support.
This paper was submitted directly (Track II) to the PNAS office.
References
- 1.Bryngelson, J. D., Onuchic, J. N., Socci, N. D. & Wolynes, P. G. (1995) Proteins 21, 167-195. [DOI] [PubMed] [Google Scholar]
- 2.Onuchic, J. N., Nymeyer, H., Garcia, A. E., Chahine, J. & Socci, N. D. (2000) Adv. Protein Chem. 53, 87-152. [DOI] [PubMed] [Google Scholar]
- 3.Hammond, G. S. (1955) J. Am. Chem. Soc. 77, 334-338. [Google Scholar]
- 4.Matouschek, A. & Fersht, A. R. (1993) Proc. Natl. Acad. Sci. USA 90, 7814-7818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Matouschek, A., Otzen, D. E., Itzhaki, L. S., Jackson, S. E. & Fersht, A. R. (1995) Biochemistry 34, 13656-13662. [DOI] [PubMed] [Google Scholar]
- 6.Matthews, J. M. & Fersht, A. R. (1995) Biochemistry 34, 6805-6814. [DOI] [PubMed] [Google Scholar]
- 7.Crane, J. C., Koepf, E. K., Kelly, J. W. & Gruebele, M. (2000) J. Mol. Biol. 298, 283-292. [DOI] [PubMed] [Google Scholar]
- 8.Sanchez, I. E. & Kiefhaber, T. (2003) J. Mol. Biol. 327, 867-884. [DOI] [PubMed] [Google Scholar]
- 9.Sanchez, I. E. & Kiefhaber, T. (2003) Biophys. Chem. 100, 397-407. [DOI] [PubMed] [Google Scholar]
- 10.Yiu, C. P., Mateu, M. G. & Fersht, A. R. (2000) Chembiochem. 1, 49-55. [DOI] [PubMed] [Google Scholar]
- 11.Isaacson, R. L., Weeds, A. G. & Fersht, A. R. (1999) Proc. Natl. Acad. Sci. USA 96, 11247-11252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Dalby, P. A., Oliveberg, M. & Fersht, A. R. (1998) Biochemistry 37, 4674-4679. [DOI] [PubMed] [Google Scholar]
- 13.Silow, M. & Oliveberg, M. (1997) Biochemistry 36, 7633-7637. [DOI] [PubMed] [Google Scholar]
- 14.Otzen, D. E., Kristensen, O., Proctor, M. & Oliveberg, M. (1999) Biochemistry 38, 6499-6511. [DOI] [PubMed] [Google Scholar]
- 15.Ternstrom, T., Mayor, U., Akke, M. & Oliveberg, M. (1999) Proc. Natl. Acad. Sci. USA 96, 14854-14859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Portman, J. J., Takada, S. & Wolynes, P. G. (1998) Phys. Rev. Lett. 81, 5237-5240. [Google Scholar]
- 17.Oliveberg, M. (1998) Acc. Chem. Res. 31, 765-772. [Google Scholar]
- 18.Oliveberg, M., Tan, Y. J., Silow, M. & Fersht, A. R. (1998) J. Mol. Biol. 277, 933-943. [DOI] [PubMed] [Google Scholar]
- 19.Otzen, D. E. & Oliveberg, M. (2002) J. Mol. Biol. 317, 639-653. [DOI] [PubMed] [Google Scholar]
- 20.Sanchez, I. E. & Kiefhaber, T. (2003) J. Mol. Biol. 325, 367-376. [DOI] [PubMed] [Google Scholar]
- 21.Nauli, S., Kuhlman, B. & Baker, D. (2001) Nat. Struct. Biol. 8, 602-605. [DOI] [PubMed] [Google Scholar]
- 22.Daggett, V. & Fersht, A. R. (2003) Trends Biochem. Sci. 28, 18-25. [DOI] [PubMed] [Google Scholar]
- 23.Gianni, S., Guydosh, N. R., Khan, F., Caldas, T. D., Mayor, U., White, G. W., DeMarco, M. L., Daggett, V. & Fersht, A. R. (2003) Proc. Natl. Acad. Sci. USA 100, 13286-13291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Viguera, A. R., Blanco, F. J. & Serrano, L. (1995) J. Mol. Biol. 247, 670-681. [DOI] [PubMed] [Google Scholar]
- 25.Lindberg, M., Tangrot, J. & Oliveberg, M. (2002) Nat. Struct. Biol. 9, 818-822. [DOI] [PubMed] [Google Scholar]
- 26.Li, L. & Shakhnovich, E. I. (2001) J. Mol. Biol. 306, 121-132. [DOI] [PubMed] [Google Scholar]
- 27.Weikl, T. R. & Dill, K. A. (2003) J. Mol. Biol. 332, 953-963. [DOI] [PubMed] [Google Scholar]
- 28.Ohman, A., Rak, A., Dontsova, M., Garber, M. B. & Hard, T. (2003) J. Biomol. NMR 26, 131-137. [DOI] [PubMed] [Google Scholar]
- 29.Jackson, S. E. & Fersht, A. R. (1991) Biochemistry 30, 10428-10435. [DOI] [PubMed] [Google Scholar]
- 30.Fersht, A. R. (1999) Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding (Freeman, New York).
- 31.Tanford, C. (1970) Adv. Protein Chem. 24, 1-95. [PubMed] [Google Scholar]
- 32.Otzen, D. E. & Oliveberg, M. (1999) Proc. Natl. Acad. Sci. USA 96, 11746-11751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Silow, M. & Oliveberg, M. (2003) J. Mol. Biol. 326, 263-271. [DOI] [PubMed] [Google Scholar]
- 34.Jackson, S. E. (1998) Folding Des. 3, R81-R91. [DOI] [PubMed] [Google Scholar]
- 35.Oliveberg, M. (2001) Curr. Opin. Struct. Biol. 11, 94-100. [DOI] [PubMed] [Google Scholar]
- 36.Abkevich, V. I., Gutin, A. M. & Shakhnovich, E. I. (1994) Biochemistry 33, 10026-10036. [DOI] [PubMed] [Google Scholar]
- 37.Fersht, A. R. (1995) Proc. Natl. Acad. Sci. USA 92, 10869-10873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Plotkin, S. S. & Onuchic, J. N. (2000) Proc. Natl. Acad. Sci. USA 97, 6509-6514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Vendruscolo, M., Paci, E., Dobson, C. M. & Karplus, M. (2001) Nature 409, 641-645. [DOI] [PubMed] [Google Scholar]
- 40.Plaxco, K. W., Simons, K. T. & Baker, D. (1998) J. Mol. Biol. 277, 985-994. [DOI] [PubMed] [Google Scholar]
- 41.Ivankov, D. N., Garbuzynskiy, S. O., Alm, E., Plaxco, K. W., Baker, D. & Finkelstein, A. V. (2003) Protein Sci. 12, 2057-2062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Koga, N. & Takada, S. (2001) J. Mol. Biol. 313, 171-180. [DOI] [PubMed] [Google Scholar]
- 43.Lindberg, M. O., Tangrot, J., Otzen, D. E., Dolgikh, D. A., Finkelstein, A. V. & Oliveberg, M. (2001) J. Mol. Biol. 314, 891-900. [DOI] [PubMed] [Google Scholar]
- 44.Fersht, A. R. (2000) Proc. Natl. Acad. Sci. USA 97, 1525-1529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Shoemaker, B. A., Wang, J. & Wolynes, P. G. (1997) Proc. Natl. Acad. Sci. USA 94, 777-782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Munoz, V. & Eaton, W. A. (1999) Proc. Natl. Acad. Sci. USA 96, 11311-11316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Alm, E. & Baker, D. (1999) Curr. Opin. Struct. Biol. 9, 189-196. [DOI] [PubMed] [Google Scholar]
- 48.Galzitskaya, O. V. & Finkelstein, A. V. (1999) Proc. Natl. Acad. Sci. USA 96, 11299-11304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hubner, I. A., Shimada, J. & Shakhnovich, E. I. (2004) J. Mol. Biol. 366, 745-761. [DOI] [PubMed] [Google Scholar]
- 50.Krantz, B. A., Srivastava, A. K., Nauli, S., Baker, D., Sauer, R. T. & Sosnick, T. R. (2002) Nat. Struct. Biol. 9, 458-463. [DOI] [PubMed] [Google Scholar]
- 51.Wolynes, P. G. (1997) Proc. Natl. Acad. Sci. USA 94, 6170-6175. [DOI] [PMC free article] [PubMed] [Google Scholar]