Abstract
The elongated three-helix‐bundle spectrin domains R16 and R17 fold and unfold unusually slowly over a rough energy landscape, in contrast to the homologue R15, which folds fast over a much smoother, more typical landscape. R15 folds via a nucleation–condensation mechanism that guides the docking of the A and C-helices. However, in R16 and R17, the secondary structure forms first and the two helices must then dock in the correct register. Here, we use variants of R16 and R17 to demonstrate that substitution of just five key residues is sufficient to alter the folding mechanism and reduce the landscape roughness. We suggest that, by providing access to an alternative, faster, folding route over their landscape, R16 and R17 can circumvent their slow, frustrated wild-type folding mechanism.
Keywords: protein folding, Φ-value analysis, energy landscape, helix bundle, minimal frustration
Graphical Abstract
Research Highlights
► Homologous spectrin domains have very different folding behavior. ► Spectrin domain R16 folds slowly over an atypically rough energy landscape. ► We have substituted just five residues from R15 to R16. ► The mutated protein folds via a different folding mechanism (more like R15). ► This results in a faster folding across a smoother (more R15 like) landscape.
Introduction
Comparative folding studies combined with energy landscape theory have been applied successfully to the 15th, 16th and 17th repeats of chicken brain α-spectrin (R15, R16 and R17).1–7 These domains are elongated three-helix bundles with a 106‐residue repeat length.8–11 All three have similar structures, stabilities and Tanford β-values, but R16 and R17 fold and unfold some 3 orders of magnitude more slowly than R15.3,12 The folding landscapes of spectrin domains are complex. We have previously shown that R16 and R17 are best described as folding on a landscape with a high-energy intermediate and that there are two consecutive transitions states, one early (TS1, rate limiting at low denaturant concentrations) and one late (TS2) (Fig. 1).3–5,13 These slow-folding domains have been shown to have a rough energy landscape at TS1, which is responsible in part for the reduction in both folding and unfolding rate constants.6 In fact, landscape roughness acts to reduce the folding and unfolding rate constants around 5-fold.14 Recent characterization of a single point mutant (E18F) with a landscape that remains as rough as that of its R16 parent protein and folds via the same mechanism but with vastly increased rate constants has identified the remainder (majority) of the “slowing” to traditional effects of burial of charge on the transition state.14 For both R16 and R17, although TS1 is rough, the landscape at TS2 is smooth (B. G. Wensley and J. Clarke, unpublished data). R15 probably also has a complex folding landscape, but due to the speed with which it folds, only events in the early part of this landscape can be probed. In contrast to R16 and R17, the early, rate determining TS (equivalent to TS1 in R16 and R17) over which R15 folds and unfolds appears to lack roughness.7
Energy landscape theory (introduced in the late 1980s) proposes that evolution has resulted in energy landscapes that are smooth, or unfrustrated (the principle of minimal frustration).2 In particular, nonnative interactions are disfavored so that folding can proceed rapidly on a funnel-shaped free energy landscape. However, theory predicts that some frustration, or energetic roughness, may still exist in natural proteins, manifested as small local energetic traps, which will slow folding.2,15 These small local kinetic traps are not traditional folding intermediates that accumulate but are manifested as “internal friction”. To date, landscape roughness effects on folding dynamics have not been seen for any other domains of comparable size and folding timescale, although roughness has been predicted in theoretical studies.2,15–21
The similarities between R15, R16 and R17 suggested a sequence-specific origin for this atypical roughness. The behavior of two fast-folding core-swapped spectrin chimeric domains, R16o15c (outside of R16 with the core of R15) and R17o15c, shows this to be the case as both have ~ 80% sequence identity with their slow-folding parental domain but have reduced transition state roughness6 and an altered folding mechanism: from a framework, diffusion–collision mechanism in the slow‐folding wild-type parent domains to a more nucleation–condensation-like mechanism as is seen in R15.22
Here, we take the core-swapping strategy further, in order to probe the mechanistic basis for the landscape roughness observed in R16 and R17 folding. We investigate eight more core-swapped domains and identify a subset of just five residues on the A‐helix that are sufficient to induce a decrease in landscape roughness. We use Φ-value analysis to probe the folding mechanism of one of these fast‐folding versions of R16 and show that the roughness can be diminished by providing an alternative folding pathway via a stable folding nucleus.
Results
The minimal fast‐folding core
Previously, we inserted the entire core of R15 into R16 and R17 to produce faster‐folding core-swapped domains with reduced landscape roughness. We noticed that there are 10 core residues that are identical in R16 and R17 but are different in R15 and used these to narrow down the residues responsible for this reduced roughness. Of these, seven are in close proximity to one another (Fig. 2). In R16, these seven residues were mutated to the side chain found at this position in R15 to produce the minimal core-swapped domain R16m7 (m, a minimal core-swapped variant of the named parental domain; 7, seven residues are different to the parent domain). These mutations comprise five in helix A (E18F, E19D, I22L, K25V and V29L), one in helix B (V65L) and one in helix C (L97I). R16m7 was fully folded, as judged by size-exclusion chromatography, circular dichroism spectroscopy and cooperative unfolding in urea. It has a comparable thermodynamic stability to R16, despite showing a slight reduction in mD–N (Table 1 and Table S1). The folding kinetics of R16m7 are shown in Fig. 3a, Table 1 and Table S1. The effect of thermodynamic stability on both folding and unfolding rate constants can complicate the interpretation of how fast a domain folds. To correct for differing stabilities, we have used the rate constant at an equilibrium free energy, ∆GD–N, of 0.0 kcal mol− 1 (k∆G = 0) to judge the folding kinetics of these spectrin domains. (k∆G = 0 also avoids the need for long extrapolations to reach rate constants at zero M denaturant.) For R16m7, k∆G = 0 = 14 (± 4) s− 1, about 2 orders of magnitude faster than the R16 value [k∆G = 0 = 0.19 (± 0.01) s− 1] despite being only seven residues different (Table 1).
Table 1.
Domain | mD − Neqb (kcal mol− 1 M− 1) | ΔGD − NH2O (kcal mol− 1) | kΔG = 0 (s− 1) |
---|---|---|---|
R15a | 1.8 (± 0.1) | 6.8 (± 0.2) | 50 (± 20) |
R16a | 1.9 (± 0.1) | 6.3 (± 0.2) | 0.19 (± 0.01) |
R17a | 2.0 (± 0.1) | 6.1 (± 0.2) | 3.0 × 10− 2 (± 3 × 10− 3) |
R16o15ca | 1.9 (± 0.1) | 5.5 (± 0.2) | 12 (± 2) |
R17o15ca | 1.7 (± 0.1) | 3.3 (± 0.2) | 10 (± 1) |
R16m7 | 1.5 (± 0.1) | 6.2 (± 0.1) | 14 (± 4) |
R16m6(AC) | 1.7 (± 0.1) | 5.7 (± 0.1) | 28 (± 9) |
R16m6(AB) | 1.5 (± 0.1) | 5.9 (± 0.1) | 20.9 (± 0.8) |
R16m5 | 1.5 (± 0.1) | 4.6 (± 0.1) | 50 (± 10) |
R17m7 | 1.5 (± 0.1) | 3.9 (± 0.1) | 13 (± 4) |
R17m6(AC) | 1.4 (± 0.1) | 3.1 (± 0.1) | 20 (± 10) |
R17m6(AB) | 1.4 (± 0.1) | 3.6 (± 0.1) | 21 (± 5) |
R17m5 | 1.4 (± 0.1) | 2.9 (± 0.1) | 22 (± 5) |
See Table S1 for other thermodynamic and kinetic parameters and Materials and Methods for fitting details.
See Table S1 for other thermodynamic and kinetic parameters and Materials and Methods for fitting details.
Two strategies were adopted to dissect the effect of individual residues. First, of the seven residues mutated in R16m7, we investigated the two conservative substitutions that are not in helix A, V65L (B-helix) and L97I (C-helix) (Fig. 2). These two residues were returned to those seen in R16 either alone or in concert. Residue 97 was returned to Leu to produce a protein with mutations only in helices A and B [R16m6(AB)], residue 65 was returned to Val to produce R16m6(AC) and both were returned to create R16m5, where only the five residues on the A‐helix are different to R16. These variants all behave in a way similar to that of R16m7; thus, R16m6(AC), R16m6(AB) and R16m5 all still fold and unfold considerably faster than R16 (Fig. 3a, Table 1 and Table S1). The return of V65 and L97 either alone or in concert did not significantly slow the folding of these minimal core-swapped R16 variants. Second, we changed each of the residues in the A-helix, individually, in the background of the wild-type protein.14 Mutation of three of the residues has little effect on the folding kinetics (E19D, I22L and V29L), whereas mutation of two (E18F and K25V) speeds folding; in the case of E18F, this speeding of folding and unfolding is significant. However, and most importantly, this speeding of folding is not accompanied by either a reduction in landscape roughness or a change in pattern of Φ-values (and thus folding mechanism).14
Since the residues exchanged in R16m7 are identical in R16 and R17, the equivalent R17 minimal core-swapped domains [i.e., R17m7, R17m6(AB), R17m6(AC) and R17m5] were produced (Fig. 3b, Table 1 and Table S1). These R17 minimal core swaps are considerably destabilized, with reduced mD–N values and folding m-values, mkf, relative to R17. These results are consistent with the behavior observed for the previously studied full core-swapped domain R17o15c. The k∆G = 0 for all these minimal core swaps is increased by ~ 3 orders of magnitude compared to the parental R17 (Table 1). The R16 minimal core variants are more attractive for further study because they are considerably more stable than those of R17.
The aim of the study reported here was to investigate the link between a rough energy landscape and the mechanism for folding. We investigate roughness using solvent viscosity dependence and folding mechanism using a comparative Φ-value analysis. We have previously shown that comparisons of the Φ-values of the C-helix are the clearest indicator for different folding mechanisms in spectrin domains4,6,7; thus, we wished to use a protein with an entirely wild type C-helix for our comparative study. Since our results showed that the conservative mutation L97I in the C-helix does not slow the folding of the minimal core‐swapped domain, we chose to use the core‐swapped domain with the changes in the A‐helix and in the B‐helix only, R16m6(AB) for further study. This is the most stable of the minimal core-swapped variants.
The effect of solvent viscosity on the folding of R16m6(AB)
The effect of solvent viscosity on the folding of R16m6(AB) was determined as a measure of folding landscape roughness. The approach used is based on that previously applied to R15, R16, R17, R16o15c and R17o15c, which uses an empirically derived formulation of Kramers' theory24 describing folding as a diffusive process over an energy surface, where the folding or unfolding rate constant (k) is dependent on η (the solvent viscosity), σ (the internal friction of the protein), ∆GTS (the height of the energy barrier) and C (a temperature‐ and solvent‐independent term, comprising all components of the pre-exponential factor except the friction terms)6,24–26:
(1) |
This formulation assumes that solvent friction and internal friction are additive. At a constant ∆GTS when internal friction is negligible (i.e., σ ≪ η), k is inversely proportional to solvent viscosity and the slope of the plot of relative solvent viscosity versus relative rate constant should be close to unity. This should be the case for reactions with a smooth energy landscape and was seen for R15 (and other similar small proteins17–22). In contrast, for R16 and R17, k shows little dependence on solvent viscosity. They have substantial values of internal friction (σ) significantly larger than solvent viscosity (η); thus, the effect of altering the solvent viscosity on the pre-exponential factor is small. R16o15c shows an intermediate dependence, and R17o15c shows a strong R15-like solvent viscosity dependence (Fig. 4 and Table S2). Small‐molecule viscogens such as glucose tend to stabilize proteins and thus alter ∆GD–N and ∆GTS. To offset this, we apply the widely used isostability approach.17–22,28,29 The stabilizing effect of the glucose is counteracted using a chemical denaturant, and the stronger denaturant guanidinium chloride (GdmCl) is generally required, rather than urea. k is determined when ∆GD–N = 0.0 and 1.5 kcal mol− 1, for each glucose concentration. A value for the magnitude of σ can also be determined by rearranging Eq. (1) (see Materials and Methods). Although this approach has been criticized,30 the strength of our comparative studies is that any caveats apply equally to the three parent proteins and to their core-swapped derivatives. (For a full discussion of this approach in determining the magnitude of internal friction, see Supplementary Information for Wensley et al.6 and Borgia et al.15).
Folding and unfolding kinetics as a function of GdmCl and glucose concentrations are shown in Fig. S1a (see Materials and Methods for fitting details). The effect of solvent viscosity on the folding of R16m6(AB) is shown as relative solvent viscosity versus relative rate constant plots (Fig. 4 and Table S2). The average slope of the two viscosity plots for R16m6(AB) is 0.39 (± 0.01), compared with 0.20 (± 0.07) for R16, 0.75 (± 0.10) for R15 and 0.38 (± 0.06) for R16o15c. The average value of σ for R16m6(AB) is 1.6 (± 0.2) cP (Fig. S1b and Table S2), again, similar to that of R16o15c [2.1 (± 0.2) cP]. The internal friction seen for both R16o15c and R16m6(AB) is considerably lower than that for R16 [3.9 (± 0.8) cP] but is greater than that for R15 [0.26 (± 0.09)]. The six residues mutated in R16m6(AB) have reduced the landscape roughness of R16 to levels comparable with the full core-swapped R16o15c, despite the 95% sequence identity between R16 and R16m6(AB).
The transition‐state structure of R16m6(AB)
When the Φ-values of R15 are compared with those of R16 and R17, there are clear differences in the pattern of Φ-values in the A‐helix and in the C‐helix. In particular, the pattern of Φ-values in the C-helix suggests very different folding mechanisms (Fig. S2). Thus, to determine whether the reduced roughness in the minimal core-swapped domain is related to a shift in folding mechanism, we performed a Φ-value analysis of the C‐helix. Identical conditions and mutations were used to those in the Φ-value analysis of R16.4 Two types of substitution were made: (i) core mutations using a nondisruptive deletion mutation to probe the formation of tertiary structure at the transition state and (ii) surface Ala–Gly helix scanning mutations to probe helix formation.31,32
Equilibrium denaturation curves were used to determine [urea]50%, mD–N value and ΔGD − NH2O of all mutants (Table S3). Although we have not determined the structure of R16m6(AB), the ∆∆GD–N for most of these mutants is similar to that for the same mutation made in R16. This suggests only small structural changes as a result of the mutations. The exceptions to this are F90A, W94F and L97A, which are less destabilizing in R16m6(AB) than in R16 (Fig. S3). These three are at the center of the C‐helix and pack against the mutations made in the A‐helix to create R16m6(AB) where we might expect the packing to be significantly different.
Φ-Values for TS1 were calculated as described in Materials and Methods (Fig. S4 and Table S3). There are two core mutations that give rise to nonstandard values of Φ (Φ > 1 or Φ < 0), F90A and W94F (Fig. S4b). Both show a reduction in both folding and unfolding rate constants upon mutation. They have been excluded from the following Φ-value analysis but will be discussed below. The R16m6(AB) Φ-values are shown in Fig. 5 along with the TS1 Φ-values of R15, R16, R16o15c and R16 E18F.4,6,7,14 Despite the C-helix of R16m6(AB) having an identical sequence with that of R16, qualitatively, the pattern of Φ-values is clearly different. Notably, there is no longer a clear distinction between the surface and core Φ-values, with the former being consistently larger, as is the case for R16 (p = 0.007). In fact, in R16m6(AB), the magnitude of both types of Φ-values increases substantially in the center of the helix and decreases considerably at the C-terminal end of the helix.
This comparison can be made more quantitative. The C-helix Φ-values for R15, R16o15c and R16m6(AB) can all be fitted to a single Gaussian peak with similar peak positions (around residues 99, 96 and 93, respectively) and widths (around 2, 4 and 5 residues, respectively). In addition, the peak height and basal value agree within fitting error (Table S4). In contrast, the C-helix Φ-values for R16 cannot be well fit by a Gaussian. There is a larger fitting error in the peak position (91 ± 4), and the fitting errors for the other variables are significantly larger than the estimated values themselves. The pattern of Φ-values for the R16m6 is therefore much more similar to that of R15 than R16. Indeed, the individual Φ-values for R16m6 are better correlated with R15 values (p = 0.039, n = 9) than with R16 values (p = 0.13, n = 12) despite the absolute sequence identity for the C-helix with the latter. Critically, changing just six residues, none of which are present within the C-helix itself, has changed the apparent pattern of Φ-values and, by implication, the folding mechanism. Note that stabilization of TS by the mutation E18F, which speeds both folding and unfolding significantly, does NOT change the Φ-value pattern in the manner seen in the core-swapped proteins.14 Instead, the Φ-values are very well correlated (pairwise) with the parent protein R16 (p = 0.001, n = 11).14
Discussion
In R16 (and R17) secondary structure, Φ-values (probed by surface Ala–Gly mutations) are significantly higher than core Φ-values, and the Φ-values are fairly uniform along the entire length of the A‐helix and of the C‐helix.4,5 This pattern of Φ-values suggests a framework, diffusion–collision-like folding mechanism where partly preformed helices dock. R15 folds by a nucleation–condensation mechanism: the regions with high Φ-values in the A‐helix and in the C-helix pack together in the native structure.7 The minimal core residues in the A‐helix of R16m6(AB) co-localize with those residues that constitute the nucleation site on the A‐helix of R15 (shaded area in Fig. S2a). Both qualitatively and quantitatively, it can be seen that the pattern of Φ-values in R16m6(AB) is very different to its parent R16 but is remarkably similar to that of R15 (and R16o15c) (Fig. 5). In helix C of R16m6(AB), the region of high Φ-values (residues 92–99) are those that pack onto the minimal core residues in the A‐helix (Fig. 6). Given the 100% identity between the C‐helix of R16 and the C‐helix of R16m6(AB), the clear Φ-value differences in this helix are strong evidence for a change in transition‐state structure and folding mechanism in R16m6(AB). Since we know that the B-helix is essentially unfolded in the early transition state of all parent spectrin domains, we infer that the minimal core B-helix residue V65L (Φ-value of 0.1 in both R15 and R16) is not involved in the early TS in this core‐swapped protein.4,5,7 Thus, we propose that the minimal core residues engineered into the A‐helix of R16m6(AB) provide a folding nucleus against which the potential nucleating region in helix C can pack (Fig. 6). In this light, the nonclassical Φ-values of F90A and W94F can perhaps be understood. These large hydrophobic residues, which are at the heart of the putative nucleating region in the C‐helix (Fig. 6), slow both folding and unfolding (Fig. S4b). Perhaps, without these residues, the new nucleation site cannot be formed and these mutants are reverting to a more framework-like mechanism, associated with slower folding and unfolding.
The relationship between frustration and folding mechanism and folding speed
Our results on the minimal core-swapped proteins described here clearly support the relationship between the shift toward a nucleation–condensation folding mechanism and the reduced landscape frustration (roughness) for these spectrin domains.6 R16m6(AB) behaves in a manner similar to that of R16o15c, the full R16-based core swap, including an intermediate dependence of the folding and unfolding rate constants on solvent viscosity. We have shown that the origin of these behavioral shifts in R16m6(AB) and R16o15c has been narrowed down to five mutations in the A‐helix, E18F, E19D, I22L, K25V and V29L.
Just these five key substitutions are sufficient to shift the folding mechanism of R16 toward one much more similar to R15. We propose that it is this change in mechanism that is responsible for the reduced roughness of R16m6(AB), R16o15c and R17o15c relative to R16 and R17.6 In the folding of R16 and R17, the long spectrin helices partially preform and must then dock as the domain crosses the transition state. We hypothesize that it is the frustrated search for the correct register (repeated cycles of misdocking and undocking) that is manifested as internal friction. Once a good nucleation site is engineered into the A‐helix of R16 or R17, this nucleus allows the correct docking of the nucleation site in helix C and, thus, sets up the correct register for A‐helix and C-helix docking, enabling more rapid folding across a less frustrated energy landscape. The internal friction slows the folding of R16 and R17 by about 5-fold.14 Added to this is the effect of the individual mutations E18F and K25V that speed folding significantly (by ~ 40‐fold when combined) but without reducing the internal friction, and the Φ-value analysis of E18F shows that the folding mechanism of R16 is unaltered in this fast-folding variant (Ref. 14; see Table S5).
The same five mutations are also sufficient to significantly increase the folding and unfolding rate constants of R17 (Fig. 3b) and are a subset of the residues mutated to produce R17o15c that displays faster‐folding kinetics and a smooth, R15-like folding landscape.6
R15, R16o15c and now R16m6(AB) all fold and unfold by a nucleation–condensation mechanism on energy landscapes that are significantly less frustrated at TS1 than is R16. Interestingly, in the three-helix‐bundle homeodomain family, studied by Fersht et al., the faster‐folding members fold by a diffusion–collision mechanism, with docking of very well formed helices. Nucleation condensation is significantly slower,22,32–36 perhaps due to the necessity to form entropically more unfavorable long-range interactions. We suggest that the difference between the two systems is due to the length of the helices. In the homeodomains, the helices are very short (2–4 turns/helix), whereas in spectrin domains, the helices are 6–10 turns long, making the search for the correct docking register more difficult in the absence of a strong nucleation site.
There is other evidence to suggest that such misdocking events may indeed occur, from all-atom unfolding simulations performed in the Daggett laboratory.5 In these simulations, short-lived contiguous helical segments were seen in the denatured state and “a number of helical docking events are observed”. These docking events were only seen to occur between helices A and C, and they were “always out of register”. Importantly, these events never lead to rearrangement and correct folding, although their typical lifetime was “100s of ps”.
An important requirement of any hypothesis for the origin of the internal friction seen for R16 and R17 is that it must provide an explanation for the localization of the roughness at TS1 that has been observed along the reaction coordinate (B. G. Wensley and J. Clarke, unpublished data). Since spectrin domains fold via two consecutive transition states, we have been able to determine the roughness in both TS1 (described here), where the helices dock and topology is established, and TS2, where the protein becomes more structured.4,5 The denaturant dependence of the rate constants of R16 and R17 for folding and unfolding over TS2 is very strongly dependent on solvent viscosity (i.e., early internal friction is lost at TS2) (B. G. Wensley and J. Clarke, unpublished data). This is consistent with our proposal that the frustration at TS1 is due to helix misdocking as the TS is traversed and the helix register is established. Once the helices are correctly docked, folding proceeds rapidly along a smooth, unfrustrated landscape as TS2 is traversed.
While the importance of this nucleus in the A‐helix to the unfrustrated folding of these spectrin domains is clear, the nucleation mechanism is not known. Does the A‐helix nucleate first, capturing the C‐helix by the transition state, or do the two form concomitantly? Although the five key mutations are in the region of R15 with the greatest helical propensity, as determined by AGADIR, they do not significantly alter the low helix propensity shown by R16 and R17 in this region, and merely increasing the helical propensity of R16 in this region does not alter the folding kinetics significantly.3,7 The exact residue set involved in inducing the formation of the nucleus is unknown. An all-by-all search would take 30 variants and viscosity analyses and be prohibitively time consuming. A more productive approach must be one where simulations guide experiment, although simulations of these spectrin domains are not easy.5
Conclusions: the folding landscape of R15, R16 and R17
The relative ease with which we have manipulated the folding route taken by R16 and R17 suggests that, in addition to the unusual roughness seen at TS1, the energy landscapes must be complex with at least two potential routes across them. In the absence of clear nucleating signals, the wild-type proteins access a framework-like mechanism pathway involving a frustrated search for the correct docking. However, just a few substitutions allow access to faster folding by a non-frustrated nucleation–condensation pathway, which is very similar to the one traversed by R15. In wild-type R16 and R17, this latter path is not preferred, presumably because the putative nucleation site is unstable. In producing the two full core swaps and all eight minimal core swaps, we have altered this landscape, stabilized the A-helix to C-helix folding nucleus and allowed access to an alternate pathway. The intermediate roughness observed for R16o15c and R16m6(AB) contrasts with the more complete loss of internal friction seen for R17o15c and may indicate that, in the R16-derived core swaps, the switch to the nucleation–condensation route is not complete.22 It is not, however, possible to determine if this is due to the use of some intermediate route or to persistent nonnative interactions that are not removed in any of our R16-based core-swapped proteins.
The idea that nonnative interactions might introduce kinetic traps in an energy landscape is not new; such kinetic traps have been observed as intermediates with some nonnative contacts that need to unfold before folding can be completed (in the immunity proteins, for example, see Ref. 37, and even in simulations using simple Gõ models38). However, in our case, the kinetic traps are associated with small energy barriers—the roughness we observe is not associated with accumulation of intermediates. Our results pose a further conundrum: what is the nature of the proposed rearrangement events that result in escape from the misdocked kinetic traps? Escape is apparently viscosity independent, thus unlikely to require large movements of the polypeptide chain. We hope that our experiments will stimulate simulations to investigate this question.
It is worth noting that viscosity analyses have been undertaken for several small domains with a number of folds. One of these, GCN4-p2′, has elongated helices reminiscent of the spectrin domains, but it does not show any evidence for internal friction.20 To date, no other domains that fold on a comparable millisecond‐to‐second timescale have been shown to have a frustrated landscape.16–21 However, given that this frustration may result in only relatively small (here, about 5-fold) changes in folding and unfolding rates constants,14 few experimental studies have been performed; thus, we cannot know how common this phenomenon may be across fold space.
Materials and Methods
Synthetic genes for R16m7 and R17m7 were purchased from GenScript and inserted into the modified pRSETA vector used to express all of our spectrin domains. All mutagenesis, protein expression and purification methods have been described elsewhere as have details of how biophysical data are collected for these spectrin domains.3 The minimal core-swapped domains were treated in a manner analogous to that taken with their respective parental domains.
All equilibrium denaturation curves were fitted well to a two-state transition; fitting the kinetic data, however, was more complex.39 The chevron plots for R16m6(AC), R16m5, R16o15c, R17m7, R17m6(AB), R17m6(AC) and R17m5 had linear arms and, thus, were fitted to the two-state model
(2) |
where kobs is the observed rate constant, kfH2O is the folding rate constant in water, mkf is the folding m-value, kuH2O is the unfolding rate constant in water and mku is the unfolding m-value. R16m7 and R16m6(AB) displayed observable downward curvature in both chevron arms. In all cases, curvature in the refolding arm was removed by eye prior to fitting. R16m7 and R16m6(AB) could not be fitted to the sequential transition‐state model usually applied to R16 as they exhibit reduced mD–N values.3,40,41 Consequently, a broad transition‐state model, which has also been successfully used to fit the R16 Φ-value data set, was employed for all chevrons collected in urea.13,42–46 This model incorporates a second-order polynomial into the two-state model to account for the curvature. This term, m′, was only added to the unfolding arm as the refolding arms were limited to the linear region only.
k∆G = 0, the rate constant at ∆GD–N = 0.0 kcal mol− 1, that is, kf = ku, was determined using both thermodynamic and kinetic data. The [urea] at which ∆GD–N = 0.0 kcal mol− 1 was determined using stability measurements. Due to small deviations between kinetic and equilibrium ∆GD–N and mD–N values, at this concentration of urea, kf was similar but not always identical to ku. Consequently, refolding data only (kfH2O and mkf) were used to determine k∆G = 0. However, if unfolding data only (kuH2O and mkf) or ∆GD–N = 0.0 kcal mol− 1 is determined kinetically, the same results, within error, are seen.
The methodology used for the viscosity analysis of R16m6(AB) was based on that previously optimized with R15, R16, R17 and R16o15c, adding solvent viscosity using 0.0 M, 0.5 M, 1.0 M and 1.5 M glucose and making chevron plots using GdmCl.6,16–18,20,21,25,26,28 The chevron arms collected were very short due to a combination of the limit of our stopped-flow apparatus (kmax ~ 600 s− 1) and the use of the alternative denaturant GdmCl. Consequently, all curvature was removed from the data set and each chevron was fitted individually to a two-state model. Equilibrium and kinetic data did not agree due to inaccuracies in fitting such short chevron arms; thus, chevron plots only were used to determine kf and ku at ∆GD–N = 1.5 kcal mol− 1 and kf = ku at ∆GD–N = 0.0 kcal mol− 1. A consequence of this is that both the slope and σ-values for kf and ku at ∆GD–N = 1.5 kcal mol− 1 are identical; thus, only the kf data have been used. Solvent viscosities at the relevant isostability, as well as the slope of the viscosity plots and magnitude of σ, were measured and calculated as previously described.6
Briefly, since, from Eq. (1), at isostability (i.e., constant ∆GTS), , a plot of 1/k versus solvent viscosity (η) allows internal friction (σ) in centipoises to be determined and the effect of σ on k to be evaluated.
The R16m6(AB) Φ-value chevrons were globally fitted, and the wild-type value quoted in Table 1 and Table S1 comes from this global fit. In the global fit, mkf and m′ were shared. The equilibrium free energy in water, ΔGD − NH2O, was calculated for each mutant from the equilibrium data using
(3) |
where [urea]50% is the [urea] where [D] = [N] and 〈mD–N〉 is the mean mD–N of 39 mutants and was 1.49 kcal mol− 1 M− 1. From these, ΔΔGD − NH2O was calculated for each Φ-value pair. Φ-Values were calculated from folding data using
[4] |
where kf,WTH2O and kf,mutH2O are the folding rate constants in water for the wild type and mutant, respectively.48 Φ was only calculated where ΔΔGD − NH2O ≥ 0.5 kcal mol− 1. As mkf was shared, the value of Φ is invariant with [urea].
Acknowledgements
This work was supported by the Wellcome Trust (grant number GR064417MA). J.C. is a Wellcome Trust Senior Research Fellow. J.M.R. is supported by a Biotechnology and Biological Sciences Research Council studentship. We thank Ben Schuler and Alessandro Borgia (University of Zurich) for helpful discussions and advice on the manuscript and Peter Wolynes for his interest and stimulating discussions.
Edited by C. R. Matthews
Footnotes
Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.jmb.2012.08.003
Appendix A. Supplementary data
References
- 1.Nickson A.A., Clarke J. What lessons can be learned from studying the folding of homologous proteins? Methods. 2010;52:38–50. doi: 10.1016/j.ymeth.2010.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bryngleson J.D., Onuchic J.N., Socci N.D., Wolynes P.G. Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins. 1995;21:167–195. doi: 10.1002/prot.340210302. [DOI] [PubMed] [Google Scholar]
- 3.Scott K.A., Batey S., Hooton K.A., Clarke J. The folding of spectrin domains I: wild-type domains have the same stability but very different kinetic properties. J. Mol. Biol. 2004;344:195–205. doi: 10.1016/j.jmb.2004.09.037. [DOI] [PubMed] [Google Scholar]
- 4.Scott K.A., Randles L.G., Clarke J. The folding of spectrin domains II: Φ-value analysis of R16. J. Mol. Biol. 2004;344:207–221. doi: 10.1016/j.jmb.2004.09.023. [DOI] [PubMed] [Google Scholar]
- 5.Scott K.A., Randles L.G., Moran S.J., Daggett V., Clarke J. The folding pathway of spectrin R17 from experiment and simulation: using experimentally validated MD simulations to characterize states hinted at by experiment. J. Mol. Biol. 2006;359:159–173. doi: 10.1016/j.jmb.2006.03.011. [DOI] [PubMed] [Google Scholar]
- 6.Wensley B.G., Batey S., Bone F.A.C., Chan Z.M., Tumelty N.R., Steward A. Experimental evidence for a frustrated energy landscape in a three-helix-bundle protein family. Nature. 2010;463:685–688. doi: 10.1038/nature08743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wensley B.G., Gärtner M., Choo W., Batey S., Clarke J. Different members of a simple three-helix bundle protein family have very different folding rate constants and fold by different mechanisms. J. Mol. Biol. 2009;390:1074–1085. doi: 10.1016/j.jmb.2009.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yan Y., Winograd E., Viel A., Cronin T., Harrison S.C., Branton D. Crystal structure of the repetitive segments of spectrin. Science. 1993;262:2027–2030. doi: 10.1126/science.8266097. [DOI] [PubMed] [Google Scholar]
- 9.Winograd E., Hume D., Branton D. Phasing the conformational unit of spectrin. Proc. Natl Acad. Sci. USA. 1991;88:10788–10791. doi: 10.1073/pnas.88.23.10788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pascual J., Pfuhl M., Walther D., Saraste M., Nilges M. Solution structure of the spectrin repeat: a left-handed antiparallel triple-helical coiled-coil. J. Mol. Biol. 1997;273:740–751. doi: 10.1006/jmbi.1997.1344. [DOI] [PubMed] [Google Scholar]
- 11.Pascual J., Pfuhl M., Rivas G., Pastore A., Saraste M. The spectrin repeat folds into a three-helix bundle in solution. FEBS Lett. 1996;383:201–207. doi: 10.1016/0014-5793(96)00251-7. [DOI] [PubMed] [Google Scholar]
- 12.Kusunoki H., Minasov G., Macdonald R.I., Mondragon A. Independent movement, dimerization and stability of tandem repeats of chicken brain α-spectrin. J. Mol. Biol. 2004;344:495–511. doi: 10.1016/j.jmb.2004.09.019. [DOI] [PubMed] [Google Scholar]
- 13.Scott K.A., Clarke J. Spectrin R16: broad energy barrier or sequential transition states? Protein Sci. 2005;14:1617–1629. doi: 10.1110/ps.051377105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wensley B.G., Kwa L., Shammas S.L., Rogers J.M., Browning S., Yang Z., Clarke J. Separating the effects of internal friction and transition state energy to explain the slow, frustrated folding of spectrin domains. Proc. Natl Acad. Sci. USA. 2012;18 doi: 10.1073/pnas.1201793109. published ahead of print June 18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bryngleson J.D., Wolynes P.G. Intermediates and barrier crossing in a random energy model (with applications to protein folding) J. Phys. Chem. 1989;93:6902–6915. [Google Scholar]
- 16.Chrunyk B.A., Mathews C.R. Role of diffusion in the folding of the α subunit of tryptophan synthase from Escherischia coli. Biochemistry. 1990;29:2149–2154. doi: 10.1021/bi00460a027. [DOI] [PubMed] [Google Scholar]
- 17.Jacob M., Schindler T., Balbach J., Schmid F.X. Diffusion control in an elementary protein folding reaction. Proc. Natl Acad. Sci. USA. 1997;94:5622–5627. doi: 10.1073/pnas.94.11.5622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Plaxco K.W., Baker D. Limited internal friction in the rate-limiting step of a two-state protein folding reaction. Proc. Natl Acad. Sci. USA. 1998;95:13591–13596. doi: 10.1073/pnas.95.23.13591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bhattacharyya R.P., Sosnick T.R. Viscosity dependence of the folding kinetics of a dimeric and monomeric coiled coil. Biochemistry. 1999;38:2601–2609. doi: 10.1021/bi982209j. [DOI] [PubMed] [Google Scholar]
- 20.Pradeep L., Udgaonkar J.B. Diffusional barrier in the unfolding of a small protein. J. Mol. Biol. 2007;366:1016–1028. doi: 10.1016/j.jmb.2006.11.064. [DOI] [PubMed] [Google Scholar]
- 21.Qiu L.L., Hagen S.J. Internal friction in the ultrafast folding of the trpyophan cage. Chem. Phys. 2004;312:327–333. [Google Scholar]
- 22.Gianni S., Guydosh N.R., Khan F., Caldas T.D., Mayor U., White G.W. Unifying features in protein-folding mechanisms. Proc. Natl Acad. Sci. USA. 2003;100:13286–13291. doi: 10.1073/pnas.1835776100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kusunoki H., MacDonald R.I., Mondragon A. Structural insights into the stability and flexibility of unusual erythroid spectrin repeats. Structure. 2004;12:645–656. doi: 10.1016/j.str.2004.02.022. [DOI] [PubMed] [Google Scholar]
- 24.Kramers H.A. Brownian motion in a field of force and the diffusion model of chemical reactions. Physica. 1940;7:284–304. [Google Scholar]
- 25.Ansari A., Jones C.M., Henry E.R., Hofrichter J., Eaton W.A. The role of solvent viscosity in the dynamics of protein conformational changes. Science. 1992;256:1796–1798. doi: 10.1126/science.1615323. [DOI] [PubMed] [Google Scholar]
- 26.Cellmer T., Henry E.R., Hofrichter J., Eaton W.A. Measuring internal friction of an ultrafast-folding protein. Proc. Natl Acad. Sci. USA. 2008;105:18320–18325. doi: 10.1073/pnas.0806154105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Timasheff S.N. The control of protein stability and association by weak interactions with water: how do solvents affect these processes? Annu. Rev. Biophys. Biomol. Struct. 1993;22:67–97. doi: 10.1146/annurev.bb.22.060193.000435. [DOI] [PubMed] [Google Scholar]
- 28.Jas G.S., Eaton W.A., Hofrichter J. Effect of viscosity on the kinetics of α-helix and β-hairpin formation. J. Phys. Chem. B. 2001;105:261–272. [Google Scholar]
- 29.Ladurner A.G., Fersht A.R. Glutamine, alanine or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates. J. Mol. Biol. 1997;273:330–337. doi: 10.1006/jmbi.1997.1304. [DOI] [PubMed] [Google Scholar]
- 30.Matthews J.M., Fersht A.R. Exploring the energy surface of protein folding by structure–reactivity relationships and engineered proteins: observation of Hammond behavior for the gross structure of the transition state and anti-Hammond behavior for structural elements for unfolding/folding of barnase. Biochemistry. 1995;34:6805–6814. doi: 10.1021/bi00020a027. [DOI] [PubMed] [Google Scholar]
- 31.Scott K.A., Alonso D.O., Sato S., Fersht A.R., Daggett V. Conformational entropy of alanine versus glycine in protein denatured states. Proc. Natl Acad. Sci. USA. 2007;104:2661–2666. doi: 10.1073/pnas.0611182104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mayor U., Johnson C.M., Daggett V., Fersht A.R. Protein folding and unfolding in microseconds to nanoseconds by experiment and simulation. Proc. Natl Acad. Sci. USA. 2000;97:13518–13522. doi: 10.1073/pnas.250473497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Mayor U., Grossmann J.G., Foster N.W., Freund S.M., Fersht A.R. The denatured state of Engrailed Homeodomain under denaturing and native conditions. J. Mol. Biol. 2003;333:977–991. doi: 10.1016/j.jmb.2003.08.062. [DOI] [PubMed] [Google Scholar]
- 34.Religa T.L., Markson J.S., Mayor U., Freund S.M., Fersht A.R. Solution structure of a protein denatured state and folding intermediate. Nature. 2005;437:1053–1056. doi: 10.1038/nature04054. [DOI] [PubMed] [Google Scholar]
- 35.White G.W.N., Gianni S., Grossmann J.G., Jemth P., Fersht A.R., Daggett V. Simulation and experiment conspire to reveal cryptic intermediates and a slide from nucleation–condensation to framework mechanism of folding. J. Mol. Biol. 2005;350:757–775. doi: 10.1016/j.jmb.2005.05.005. [DOI] [PubMed] [Google Scholar]
- 36.Religa T.L., Johnson C.M., Vu D.M., Brewer S.H., Dyer R.B., Fersht A.R. The helix–turn–helix motif as an ultrafast independently folding domain: the pathway of folding of Engrailed homeodomain. Proc. Natl Acad. Sci. USA. 2007;104:9272–9277. doi: 10.1073/pnas.0703434104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Capaldi A.P., Kleanthous C., Radford S.E. Im7 folding mechanism: misfolding on a path to the native state. Nat. Struct. Biol. 2002;9:209–216. doi: 10.1038/nsb757. [DOI] [PubMed] [Google Scholar]
- 38.Kaya H., Chan H.S. Origins of chevron rollovers in non-two-state protein folding kinetics. Phys. Rev. Lett. 2003;90:258104. doi: 10.1103/PhysRevLett.90.258104. [DOI] [PubMed] [Google Scholar]
- 39.Pace C.N. Determination and analysis of urea and guanidine hydrochloride denaturation curves. Methods Enzymol. 1986;131:266–280. doi: 10.1016/0076-6879(86)31045-0. [DOI] [PubMed] [Google Scholar]
- 40.Bachmann A., Kiefhaber T. Apparent two-state tendamistat folding is a sequential process along a defined route. J. Mol. Biol. 2001;306:375–386. doi: 10.1006/jmbi.2000.4399. [DOI] [PubMed] [Google Scholar]
- 41.Szabo Z.G. Kinetic characterisation of complex reaction systems. In: Bamford C.H., Tipper C.F.H., editors. Vol. 2. Elsevier; Amsterdam, The Netherlands: 1969. pp. 1–81. (Comprehensive Chemical Kinetics). [Google Scholar]
- 42.Ternstrom T., Mayor U., Akke M., Oliveberg M. From snapshot to movie: Φ analysis of protein folding transition states taken one step further. Proc. Natl Acad. Sci. USA. 1999;96:14854–14859. doi: 10.1073/pnas.96.26.14854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Oliveberg M., Tan Y.J., Silow M., Fersht A.R. The changing nature of the protein folding transition state: implications for the shape of the free-energy profile for folding. J. Mol. Biol. 1998;277:933–943. doi: 10.1006/jmbi.1997.1612. [DOI] [PubMed] [Google Scholar]
- 44.Otzen D.E., Kristensen O., Proctor M., Oliveberg M. Structural changes in the transition state of protein folding: alternative interpretations of curved chevron plots. Biochemistry. 1999;38:6499–6511. doi: 10.1021/bi982819j. [DOI] [PubMed] [Google Scholar]
- 45.Oliveberg M. Characterisation of the transition states for protein folding: towards a new level of mechanistic detail in protein engineering analysis. Curr. Opin. Struct. Biol. 2001;11:94–100. doi: 10.1016/s0959-440x(00)00171-8. [DOI] [PubMed] [Google Scholar]
- 46.Silow M., Oliveberg M. High-energy channeling in protein folding. Biochemistry. 1997;36:7633–7637. doi: 10.1021/bi970210x. [DOI] [PubMed] [Google Scholar]
- 47.Fersht A.R., Matouschek A., Serrano L. The folding of an enzyme. I. Theory of protein engineering analysis of stability and pathway of protein folding. J. Mol. Biol. 1992;224:771–782. doi: 10.1016/0022-2836(92)90561-w. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.