Abstract
Proteins can sample a variety of partially folded conformations during the transition between the unfolded and native states. Some proteins never significantly populate these high-energy states and fold by an apparently two-state process. Many proteins, however, populate detectable, partially folded forms during the folding process. The role of such intermediates is a matter of considerable debate. A single amino acid change can convert E. coli ribonuclease H from a three-state folder that populates a kinetic intermediate to one that folds in an apparent two-state fashion. We have compared the folding trajectories of the three-state and two-state RNases H, proteins with the same native state topology but altered regional stability, using a protein engineering approach. Our data suggest that that both versions of RNase H fold through a similar trajectory with similar high-energy conformations. Mutations in the core and the periphery of the protein affect similar aspects of folding for both variants, suggesting a common trajectory with folding of the core region followed by the folding of the periphery. Our results suggest that formation of specific partially folded conformations may be a general feature of protein folding that can promote, rather than hinder, efficient folding.
INTRODUCTION
Elucidating the high-energy conformations a protein samples as it traverses its folding landscape is the key to understanding which factors determine its folding trajectory. Many proteins are known to populate intermediates, and the role of these partially formed species has been the subject of much debate.1 Intermediates may aid the protein in its search for the native state by greatly reducing the number of accessible conformations.2 Alternatively, intermediates containing non-native interactions can result in a kinetic trap, inhibiting the folding progress and in some cases leading to the formation of disease-associated amyloid fibrils.3 What characteristics of these intermediates contribute to their helpful or harmful nature? Furthermore, understanding the nature of the rate-limiting transition state can lend insight into what restrictions are placed on an efficient folding process.4 How strictly encoded are these species? Can perturbations in these regions of the landscape uncover alternative folding routes to the same native structure?
Several factors are known to guide a protein’s folding trajectory. The native state structure, or topology, is clearly important. Contact order, the average sequence separation of native contacts, was the first topological parameter observed to correlate with the free-energy barrier for simple two-state folding proteins.5 Contact order itself, however, does not appear to be the defining factor,6 and other structural parameters such as long-range order,7 secondary structure content,8 and the total number of long-range contacts9 have also been observed to correlate with folding rates. Circular permutants of the ribosomal protein S610 and α-spectrin SH311 fold through different transition states, demonstrating the malleability of the pathway in response to changes in topology.
Conversely, comparing the folding trajectories within families of proteins shows that, in many cases, the general properties of the folding pathway are conserved while the details vary widely, highlighting the importance of sequence-dependent effects12; 13; 14; 15; 16; 17. While topology clearly plays a role in determining folding pathways, the detailed physical basis of its role is unclear.
Local or regional stability has also been implicated in guiding the folding process and intermediate formation. For the proteins cytochrome c18 and RNase H,19 the stability of partially folded forms identified by native-state hydrogen exchange has been shown to correspond with the order of structure formation along the kinetic pathway; the most stable regions appear to fold first. Folding pathways have also been re-routed based on thermodynamic considerations. Tripp and Barrick successfully moved the transition state of an ankyrin repeat protein by introducing stabilized consensus repeats at various locations in the Notch ankyrin domain.20 These designed repeats were observed to fold first, showing that local energetics guide the folding mechanism of this protein. A similar plasticity was demonstrated with the structurally similar proteins G and L. In the transition state of protein L, the first β hairpin is structured while the second is not, and the opposite is true for the transition state of protein G.21 The first β hairpin of protein G was redesigned to increase its stability, and the folding of the resulting protein proceeded through the stabilized first turn so that it resembled the folding pathway of protein L.22
Here, we set out to explore the relative contributions of topology and regional stability by comparing variants of the same protein that fold via distinctly different kinetics. The energy landscape of E. coli RNase H has been well-characterized using native-state hydrogen exchange,23; 24; 25 stopped-flow kinetics, quenched-flow hydrogen exchange,19 and protein engineering.26 The protein populates a kinetic refolding intermediate whose structure is mirrored by a partially folded form detected by the equilibrium native-state hydrogen exchange experiment. Recently, single-molecule experiments confirmed that the intermediate is on-pathway and obligatory.27 This partially folded form is characterized by a high degree of secondary structure formation in the protein core and a relatively unfolded periphery (Figure 1).
In a point mutant that introduces a potentially-charged residue into this core, I53D, the intermediate is no longer detected, and the protein folds with an apparently two-state mechanism.28 Using this mutant, we compare the folding mechanisms of a three-state and a two-state version of the protein that differ by only one amino acid. The I53D mutation destabilizes the protein, and the addition of subsequent mutations was expected to result in further destabilization. To counter this effect and ensure sufficient stability of all constructs, we used a stabilized mutant, D10A, as our reference protein. 24; 26 This mutation relieves charge repulsion in the periphery of the protein and stabilizes the protein by more than 3 kcal/mol. Introducing this mutation also allowed us to ask if destabilizing the core and stabilizing the periphery simultaneously (D10A/I53D) results in an altered kinetic trajectory in which folding proceeds through a more diffuse transition state with looser interactions over a greater portion of the protein. A phi-value (ϕ-value) analysis on these variant proteins that share the same RNase H fold, but differ dramatically in their distribution of regional stability, allow us to evaluate the nature of the transition state for both two-state and three-state folders.
RESULTS
Our reference protein for three-state folding is the variant D10A RNase H, referred to herein as RNH3. 24; 26 The variant D10A/I53D, abbreviated as RNH2, serves as our two-state reference protein. These two proteins differ only by a single mutation: the insertion of a potentially charged residue in the core of the protein (Figure 1).
Design of Mutants
Significant changes in stability without notable alterations in structure are required to measure ϕ-values.29 Since we sought to examine the effect of core mutations on both the three-state and two-state variants, we did not want to destabilize the core of the protein so much that the intermediate would no longer be apparent, as is the case with I53D. This requirement places an upper limit on ΔΔGUN of approximately 3.5 kcal/mol (the stability of the intermediate) for the effect of mutations in this region. A variant containing a mutation in helix D of the core, Q105G (Figure 1a), was previously characterized in the wild type background and is known to destabilize the molecule by just over 2 kcal/mol, making this a good target.26 The Trp at position 85 in helix C (Figure 1a) was also targeted as a likely residue to destabilize the core. Ile 25 and Phe 8 were chosen as the structural probes for the periphery. Both residues are part of a hydrophobic cluster in this region (Figure 1b).
Equilibrium Characterization of the Mutants
The circular dichroism (CD) spectra of all of the proteins used in this study are shown in Figure 2 (insets). All variants exhibit similar profiles characteristic of the RNase H α/β fold, indicating that the mutations do not significantly disturb the native structure. The stabilities of the variants, as measured by the free energy of unfolding, were determined by urea-induced denaturation profiles assuming a two-state transition and linear extrapolation.30 A representative normalized denaturation melt of each protein is shown in Figure 2.
Treatment of the equilibrium data
When each denaturation profile was fit independently, variation in the m-value (the denaturant-dependence of the free energy) never exceeded 10 %. Fixing the m-value for all the proteins to that obtained for both the wild type and RNH3 variant (m = 2.1 kcal mol−1 M−1)26 fit the data equally well. Since the magnitude of the stability change between the reference proteins and the variants is especially crucial for the calculation of ϕ-values and is highly dependent on the m-value, we calculated the ϕ-values three different ways: 1) using ΔΔGUN’s obtained from urea melts with no constraint on the m-value, 2) using ΔΔGUN’s from the fits with fixed m-values, and 3) using ΔΔGUN’s calculated from fits to the kinetic data (see Tables 1 and 2 for a complete summary). Although absolute values vary somewhat, the general trends and conclusions remain the same. For this reason, we describe the effect of the mutations in terms of the data obtained by fixing the m-values.
Table 1.
Periphery | Core | Periphery | Core | |||||
---|---|---|---|---|---|---|---|---|
RNH3* | I25A RNH3 | F8A RNH3 | Q105G RNH3 | W85A RNH3 | F8A | W85A | ||
Δ G urea melt (kcal mol −1 ) | 13 | 8.2 (0.4) | 10.0 (0.2) | 10.6 (0.5) | 11.3 (0.5) | 6.5 (0.5) | 6.8 (0.5) | |
meq urea melt (kcal mol −1 M−1 ) | 2.1 | 2.2 (0.1) | 2.14 (0.03) | 2.11 (0.05) | 2.24 (0.1) | 1.9 (0.2) | 1.9 (0.1) | |
Δ G fixed m (kcal mol −1 ) | 7.86 (0.07) | 9.78 (0.07) | 10.5 (0.2) | 10.56 (0.06) | 7.05 (0.08) | 7.5 (0.1) | ||
Δ G kinetic (kcal mol −1 ) | 13.4 | 8.1 | 11.02 | 11.9 | 11.67 | 9.22 | 8.6 | |
m kinetic (kcal mol −1 M−1 ) | 2.25 | 2.13 | 2.35 | 2.43 | 2.37 | 2.7 | 2.48 | |
kin (H 2O) (s −1 ) | 1.89 | 0.37 | 0.55 | 0.59 | 0.48 | 0.227 | 0.1514 | |
min (kcal mol −1 M−1 ) | 0.28 | 0.49 | 0.50 | 0.15 | 0.19 | 0.647 | 0.415 | |
kni (H 2O) (s −1 ) | 1.5E-07 | 4.4E-05 | 2.6E-06 | 5.5E-08 | 1.6E-07 | 9.50E-05 | 2.50E-05 | |
mni (kcal mol −1 M−1 ) | −0.48 | −0.49 | −0.48 | −0.60 | −0.51 | −0.529 | −0.472 | |
Kui | 585 | 110 | 611 | 54 | 133 | 2580 | 350 | |
mui (kcal mol −1 M−1 ) | 1.50 | 1.15 | 1.37 | 1.68 | 1.66 | 1.505 | 1.596 | |
Δ Gui (kcal mol −1 ) | 3.76 | 2.77 | 3.79 | 2.35 | 2.89 | 4.64 | 3.46 | |
ϕ I | 1 | 0.19 | −0.01 | 0.56 | 0.36 | −0.42 | 0.04 | |
2 | 0.2 | 0.0 | 0.6 | 0.5 | −0.1 | 0.0 | ||
3 | 0.19 | −0.01 | 0.94 | 0.51 | −1.64 | 0.07 | ||
ϕ t.s. | 1 | 0.35 | 0.47 | 1.23 | 0.97 | 0.51 | 0.78 | |
2 | 0.3 | 0.4 | 1.2 | 1.0 | 0.4 | 0.5 | ||
3 | 0.37 | 0.29 | 1.39 | 0.96 | −0.93 | 0.62 | ||
β T | 1 | 0.77 | 0.77 | 0.77 | 0.71 | 0.76 | 0.75 | 0.78 |
2 | 0.8 | 0.8 | 0.8 | 0.7 | 0.8 | 0.7 | 0.8 | |
3 | 0.79 | 0.77 | 0.80 | 0.75 | 0.78 | 0.80 | 0.81 |
These values were previously determined by Raschke, Kho, and Marqusee.26
1. calculated using a fixed m-value of 2.1
2. calculated using the m-value from urea melts
3. calculated using the m-value from the kinetic data; mkinetic = mui + min − mni
Table 2.
Periphery | Core | |||||
---|---|---|---|---|---|---|
RNH2 | I25A RNH2 | F8A RNH2 | Q105G RNH2 | W85A RNH2 | ||
Δ G urea melt (kcal mol −1 ) | 8.5 (0.4) | 4.8 (0.1) | 6.32 (0.4) | 7.1 (0.5) | 7.2 (0.3) | |
meq urea melt (kcal mol −1 M−1 ) | 2.0 (0.1) | 2.24 (0.01) | 2.0 (0.1) | 2.1 (0.1) | 2.08 | |
Δ G fixed m (kcal mol −1 ) | 8.9 (0.1) | 4.5 (0.1) | 6.62 (0.08) | 7.05 (0.08) | 7.219 (0.006) | |
Δ G kinetic (kcal mol −1 ) | 9.09 | 4.60 | 6.56 | 7.24 | 7.76 | |
m kinetic (kcal mol −1 M−1 ) | 2.07 | 2.05 | 2.13 | 2.31 | 2.34 | |
kun (H 2O) (s −1 ) | 0.40 | 0.058 | 0.126 | 0.020 | 0.037 | |
mun (kcal mol −1 M−1 ) | 1.51 | 1.58 | 1.64 | 1.62 | 1.67 | |
knu (H 2O) (s −1 ) | 8.10E-08 | 2.40E-05 | 2.10E-06 | 9.54E-08 | 7.61E-08 | |
mnu (kcal mol−1 M−1) | −0.56 | −0.47 | −0.49 | −0.69 | −0.67 | |
ϕt.s. | 1 | 0.26 | 0.30 | 0.95 | 0.84 | |
2 | 0.31 | 0.31 | 1.26 | 1.05 | ||
3 | 0.25 | 0.27 | 0.95 | 1.06 | ||
βT | 1 | 0.72 | 0.75 | 0.78 | 0.77 | 0.80 |
2 | 0.76 | 0.71 | 0.80 | 0.77 | 0.80 | |
3 | 0.73 | 0.77 | 0.77 | 0.70 | 0.71 |
1. calculated using a fixed m-value of 2.1
2. calculated using the m-value from urea melts
3. calculated using the m-value from the kinetic data
Core mutations
The core mutation Q105G was previously shown to destabilize the wild type protein by 2.2 kcal/mol.26 In the background of RNH3, this mutation has a similar effect, bringing the global stability to 10.5 kcal/mol (compare to 13 kcal/mol for RNH326). In RNH2, this mutation appears to be slightly less destabilizing (ΔΔGUN ~1.9 kcal/mol). The core mutation W85A destabilizes the wild type protein and RNH3 to a similar extent, 2.2 and 2.4 kcal/mol respectively, and the magnitude of this effect also decreases in the background of RNH2, with a change in stability of 1.7 kcal/mol.
Periphery mutations
I25A has an unexpectedly dramatic effect on the stability of RNase H; this variant destabilizes the periphery enough to populate the intermediate under native conditions in the wild type background, prohibiting a full kinetic analysis. In the stabilized background of RNH3, this mutation destabilizes the protein by approximately 5.1 kcal/mol. In the two-state background, RNH2, I25A destabilizes the protein by 4.4 kcal/mol. The F8A mutation destabilizes the protein by 2.6 kcal/mol in the context of the wild type protein and 3.2 kcal/mol in the background of RNH3. The effect of this mutation decreases to 2.3 kcal/mol in the two-state background of RNH2.
Kinetic phi-value analysis in the two-state and three-state RNases H
Core mutations
The mutations introduced into the core, while destabilizing, were selected to avoid being so disruptive that they abrogate detection of the burst-phase kinetic intermediate. Indeed, a large percentage of the CD signal was acquired in the dead time of the stopped-flow instrument for both Q105G RNH3 and W85A RNH3; these burst-phase signals melted out cooperatively with urea (Figure 3a), as expected for the formation of an early intermediate. Furthermore, the chevron plots display the significant rollover in the folding limbs characteristic of the formation of an intermediate. These data were therefore fit using a three-state mechanism with an on-pathway intermediate in pre-equilibrium with the unfolded state:31
Scheme 1 |
The stability of the intermediate was inferred from an analysis of the burst phase amplitudes as a function of urea (Figure 3a)28. Both Q105G and W85A shift the Cm of the intermediate to lower urea concentrations. The ratio of the change in stability to the intermediate and the native state, ϕI, for Q105G is 0.9 (Table 1), indicating that native-like interactions are formed at this position in the intermediate. W85A, in helix C, has a ϕI-value of 0.5, suggesting that there may be some native-like interactions in the intermediate, but that they are only partially formed or are fully formed in only some portion of the molecules in the ensemble.
If these two residues are also structured in the transition state, as is expected, the change in stability should manifest itself in a shift in the folding limb of the chevron while the unfolding limb remains unchanged. This is indeed observed for both Q105G RNH3 and W85A RNH3 (Figure 3a). The quantitative effects of these core mutations corroborate the qualitative observations. The ϕt.s.-values, the ratio of the change in stability of the transition state to that of the native state (Equation 1 in Materials and Methods), for both of these mutants are high, 1.3 for Q105G and 0.96 for W85A (Table 1). The ϕt.s.-value over 1 for Q105G arises from an unfolding rate that is actually slower than that of the reference protein. Given the errors inherent in the calculation of ϕ-values, we feel it is best to interpret them qualitatively as low, medium, and high rather than quantitatively.32; 33 ϕt.s.˜-values for these core residues are classified as high, confirming that, as expected, the core region is very important for the energetics of the transition state.
In the background of RNH2, no burst phase intermediate was observed, and there was no apparent rollover in the folding limb of any of the variants (Figure 3b). The chevrons for these proteins were fit to a simple two-state mechanism:
Scheme 2 |
These mutations show qualitatively similar shifts in the folding limb of the RNH2 variant as they do in RNH3. Q105G RNH2 and W85A RNH2 have very similar unfolding rates in water as RNH2 but have a steeper unfolding limb, the implications of which are discussed below. The ϕt.s.-values (Equation 2 in Materials and Methods) are high (0.95 for Q105G and 1.06 for W85A (Table 2)), similar to those calculated for these mutations in the background of RNH3. Therefore, in both the three-state and two-state proteins, it appears that the core residues are in a native-like conformation in the rate-limiting transition state.
Periphery Mutations
I25A RNH3 and F8A RNH3 both show a notable burst phase amplitude and rollover in the folding limb of the chevron plots (Figure 3c), and the data were therefore fit to the three-state mechanism shown in Scheme 1. Mutations in the periphery of RNH3 are not expected to affect the energetics of the intermediate or the transition state of the protein, and indeed, in contrast to the core mutations, the Cm for the burst-phase amplitudes in these periphery mutants is similar to that of RNH3 (Figure 3c). The ϕI-value for I25A is 0.2 and 0 for F8A (Table 1), confirming that these residues are relatively unstructured in the intermediate. The unfolding limbs of I25A RNH3 and F8A RNH3 are clearly shifted from that of RNH3; these variants unfold more rapidly than the reference protein. The folding limbs also exhibit some shift, however, suggesting that these mutations do not leave the stability of the transition state totally unaffected. The ϕt.s.-value (Equation 1) is 0.35 for I25A RNH3 and 0.47 for F8A RNH3 (Table 1). These are intermediate ϕ-values, indicating that there may be some limited native-like structure in this region of the transition state.
In the two-state background, there is no evidence of rollover and no apparent burst phase intermediate (Figure 3d); hence these data were fit to Scheme 2. The trends observed for these mutations in the background of RNH2 are similar to those in RNH3. Dramatic shifts are evident in the unfolding limbs of I25A RNH2 and F8A RNH2 while the folding limbs are slightly shifted. The ϕt.s.-values (Equation 2) are also similar to those in the background of RNH3, at 0.25 and 0.27 for I25A RNH2 and F8A RNH2, respectively (Table 2).
Destabilized core/stabilized periphery
We also examined the kinetic trajectory of W85A and F8A in the wild type background (the kinetics of Q105G were reported previously26, and I25A cannot be examined in the wild type background). This allows a direct comparison between the wild type mechanism and that of RNH2, in which the unfolded part of the intermediate and transition state is stabilized (D10A) and the folded part is destabilized (I53D), a more drastic alteration of the relative stability of the core and periphery.
W85A WT (W85A in the wild type background) shows a rollover in the folding limb along with cooperative melting of burst phase amplitudes (Figure 4a). Although the rollover is less apparent for F8A WT (F8A in the wild type background), this variant also has a large burst-phase amplitude (Figure 4b). Scheme 1 was therefore used to fit the chevron plots of both variants. Qualitatively it is clear from the burst phase signals that the core mutation compromises the stability of the intermediate while the periphery mutation leaves it unaffected, although the ϕI-values are not well defined (Table 1). Because the intermediate is only present in very limited range of urea concentrations, the value for mUI, defined by the curvature in the chevron, is ill defined for F8A, and the intermediate appears stabilized, resulting in negative ϕI-values. Similarly, the mUI value for W85A is larger than that of wild type, resulting in a calculated value for ΔΔGUI very close to that of wild type. However, it is clear from the burst phase amplitudes that the intermediate is indeed destabilized.
The ϕt.s.-values, calculated from the unfolding limb data (Equation 1), appear more reliable. W85A, with a value of 0.8, clearly affects the energetics of the transition state. F8A shows a moderate ϕt.s.-value of 0.5. These are very similar to those calculated in the background of RNH3, although slightly less polarized. It is clear that the transition state of RNH2 is no more diffuse than that of the wild type protein; if anything, structure is more localized to the core.
Burial of solvent accessible surface area in the transition state
The ratio of solvent accessible surface area buried in the transition state compared to the native state, mt.s./meq, where mt.s. is the change in solvent accessible surface area upon folding and meq is the equilibrium m-value, is quantified as the Tanford β-value (βT). βT was calculated as 1 – (mni/meq) for the three-state proteins, where mni represents the denaturant dependence of the unfolding rate constant. mt.s. was taken directly as mun for the two-state proteins, where mun is the denaturant dependence of the folding rate constant. For all variants studies, βT is 0.7 – 0.8, indicating that 70 to 80 % of the total surface area buried upon folding is buried in the transition state (Tables 1 and 2). This speci24es is similar in compactness across all variants.
DISCUSSION
Both native state topology and regional stability have been shown to play a role in determining protein-folding pathways. To further investigate the relative contributions of these factors, we manipulated the distribution of regional stability in E. coli RNase H through mutation while maintaining native state topology. One of these mutations, in the core of the molecule, transforms the three-state folder into one that folds without the accumulation of an intermediate. We used phi-value analysis to compare the folding trajectories of the three-state and two-state reference proteins and found that RNase H folds through a structured core and comparatively unstructured periphery regardless.
Coupling of distal regions of RNH2
Each of the four mutations chosen for this investigation destabilized the two-state protein less than the three-state protein. Given that the only difference between these proteins is a single mutation at position 53, this non-additivity suggests that the charge introduced at residue 53 couples to other regions of the protein. Applying a thermodynamic cycle suggests that the magnitude of this coupling effect ranges from 0.6 kcal/mol (residue 105) to 0.9 kcal/mol (residue 8); the value is 0.7 kcal/mol for positions 85 and 25 (Figure 5). Since coupling between the core and the periphery has not been detected in previous experiments,25 this suggests that the coupling is due to the insertion of a charge at position 53 in the background of D10A.
The folding mechanism of RNase H is robust; two-state and three-state RNases H fold through similar pathways
With the introduction of a single mutation into the folding core of RNase H (I53D), the protein is converted from a three-state folder with an observable intermediate to one that folds in an apparently two-state mechanism. A comparison of the ϕt.s.–values for each site in each of these backgrounds is presented in Table 3. RNH3 shows high values for the core mutations and intermediate values for the periphery mutations. These intermediate values suggest that although it is less involved, some regions of the periphery may be more important for the structure of the transition state than originally proposed during the initial discovery of this trajectory.19; 26 It is clear, however, that the transition state of RNH3 is highly polarized with a native-like core and a comparatively unstructured periphery. The periphery of the two-state protein appears to be no more structured in the transition state than in the three-state protein, with very similar ϕ-values for each mutation. Furthermore, all variants bury a similar proportion of solvent accessible surface area upon folding.
Table 3.
ϕt.s. | ||||
---|---|---|---|---|
Periphery | Core | |||
I25A | F8A | Q105G | W85A | |
Three-State | 0.35 | 0.47 | 1.23 | 0.97 |
Two-State | 0.31 | 0.30 | 0.95 | 0.84 |
wt | 0.51 | 1.19 | 0.78 |
The wild type trajectory can also be compared to that of RNH2, which has two residue changes resulting in a stabilized periphery as well as a destabilized core. Even these drastic changes in the distribution of stability do not appear to re-route folding. This is somewhat surprising considering that folding of the wild type enzyme appears to follow a hierarchical mechanism in which the most stable regions fold first. When the core is destabilized by more than 6 kcal/mol, in both the I53D/W85A and I53D /Q105G mutations, this region is still capable of initiating folding without apparently incorporating additional elements from the periphery. The stability of the core of the molecule only determines whether the protein folds through an observable intermediate in a three-state fashion or through an apparent two-state mechanism. When this region is destabilized to such an extent that this intermediate is no longer detectable, this region still dominates the folding trajectory and presumably the intermediate species still exists, albeit at a higher energy on the folding trajectory.28 Apparently, the topology of RNase H places strict constraints on the order of folding events and gives rise to the apparent robustness of this mechanism.
What makes RNase H different from proteins such as protein G and the Notch ankyrin domain, whose pathways were successfully re-routed based on thermodynamic considerations? One possible explanation is the simplified architecture of these two proteins. The notch protein is made up of repeats of the ankyrin domain, consisting entirely of sequence-local interactions. The modular arrangement of structurally identical units may account for its plasticity in response to changes in the distribution of stability. Proteins G and L are small (~60 residues) and symmetric in structure.34 Though globular, these proteins can be thought of as modular as well, with two structurally identical halves, in which folding proceeds through the half with the more stable β-hairpin.
Changes in mnu for RNH2 mutants
It is interesting to note that in the background of RNH2, the values for mnu of the core mutants increases quite drastically while they decrease very slightly for the periphery mutants. The slight decrease for I25A and F8A RNH2 results in kinetic m-values that agree nicely with the equilibrium m-values. This is not the case for the core mutants Q105G and W85A RNH2, whose kinetic m-values are greater than those calculated from the fit to the equilibrium denaturant melts. The increase in slope of the unfolding limb for the core mutants can also be detected in the background of RNH3, though to a much lesser extent.
Increased values for mnu upon mutation can arise from a movement of the transition state, a complete change in the barrier traversed upon folding, or ground state effects in the native state.35 The equilibrium m-values for all proteins studied are the same within error, and ground state effects are much less common for the native state than the unfolded state. Whether this increase in mnu reflects a complete change in mechanism under high urea or movement36 of the transition state is unclear and in a sense describes two extremes of the same phenomenon. This debate is beyond the scope of this paper. Nonetheless, ϕ-values calculated from the unfolding rates in 8M urea are ~0.5 as opposed to ~1, suggesting that in high urea, the protein may indeed not incorporate as much of the core into the transition state. Under conditions in which the native state is destabilized, movement of the structure of the transition state away from that of the native is indicative of anti-Hammond behavior. Similar increases in the m-value for unfolding upon mutation have been observed and interpreted as anti-Hammond behavior for TI I27.37 One explanation for the observed shift in mnu in our two state proteins is that in high levels of denaturant and with the folding core severely compromised, folding can proceed through a more loosely structured core.
Ruggedness in the energy landscape can be productive for folding
The results presented here support the argument that formation of specific ensembles of partially-folded species are a universal feature of folding that play an important role even for proteins that appear to proceed in a two-state manner. The development of new tools to study the progression of folding may uncover these important high-energy ensembles. For instance, novel NMR studies have recently uncovered a high energy intermediate for the FynSH3 domain, which was until recently considered to be a two-state system.36 Trapping partially folded intermediates at equilibrium to gain experimental access is another novel approach. This can be accomplished by introducing destabilizing mutations in the unfolded regions of the intermediate38; 39; 40 (also see Connell, Horner, and Marqusee, accompanying manuscript) or by constructing a fragment representing the folded region.41
Folding studies between different members of a protein family also suggest that two-state behavior masks some of the inherent complexities of folding. Both two-state and multi-state folding has been observed in the immunity proteins,13 immunoglobulin-like proteins,37 and members of the homeodomain family.42; 43 Im7 was observed to populate a kinetic intermediate while Im9, sharing 60% sequence identity, appeared two-state;13 however, under different conditions both proteins folded through an intermediate, albeit with different structural properties.17 Changing just a few key residues of Im9 led to the population of an intermediate closely resembling that of Im7 under the original conditions in which Im9 appeared two-state.14 Changes in sequence or in folding conditions have actually led to the switching from one kinetic mechanism to another for members of all of the above families, suggesting that high-energy structures are only detectable with increases in the ruggedness of the landscape large enough to bring the stability of the species below that of other states.
It is important to note that this increased ruggedness does not necessarily slow folding. In the case of RNase H, populating the intermediate actually promotes efficient folding; the folding rate of RNH3 is an order of magnitude higher than that of RNH2. Wagner and Kiefhaber44 explored the effect of intermediates on the rate of protein folding applying Kramer’s theory of diffusive barrier and found that introducing additional local barriers and valleys (roughness) can actually enhance the folding rate without altering the rate-limiting barrier. In light of the possibility that such partially folded ensembles are ubiquitous and that transient population of such species can promote rather than hinder folding, their characterization remains a crucial step in enhancing our understanding of folding.
Materials and Methods
Construction and purification of RNase H variants
The RNase H variants described in this manuscript were generated using the Quikchange mutagenesis protocol. A plasmid containing the gene encoding D10A RNase H26 was used as a template to create the double mutants. These variants were expressed and purified as described.19 The triple mutants were created using a plasmid containing D10A/I53D RNase H, pGH101, as a template. These variants expressed insolubly as inclusion bodies and were purified accordingly.28
CD Spectra
CD spectra were obtained on an Aviv 410 circular dichroism spectropolarimeter. Protein concentrations were determined based on the extinction coefficient, calculated according to the number of Trp and Tyr residues.45 The buffer conditions for all experiments were 20 mM sodium acetate pH 5.5 and 50 mM potassium chloride. Spectra were collected with a 1 mm pathlength cuvette at a protein concentration of 500 µg/mL.
Equilibrium urea denaturation
For each variant described, samples containing varying amounts of urea and 50 µg/mL protein were equilibrated for the required time and the CD signal was measured at 222 nm at 25°C in a 1 cm cuvette. The signal over 60 seconds was averaged for each sample. The slow folding and unfolding of many of the variants used in this study necessitates long equilibration times, over a week for the slowest folding proteins. All equilibrium denaturation melts were repeated at least three times. The data were fit to a two-state model with a linear free energy extrapolation.30 Even though the kinetics of the three-state proteins reveal a transient intermediate, at equilibrium this species never populates to more than a few percent due to its substantially lower stability compared to the native state. The equilibrium curves of the three-state proteins can be well-represented by a two-state model, as the unfolded and native forms are the dominant species.
Kinetics of folding and unfolding
Refolding of the mutants was initiated in one of two ways. For the double mutant in low denaturant conditions, refolding was initiated by a 1:11 dilution from a urea concentration in the unfolded baseline and monitored using an Aviv 202-SF stopped-flow device (1 mm pathlength). Final protein concentration in the stopped-flow device was at least 500 µg/mL. For the double mutants in higher urea conditions and for all triple mutants, refolding was induced by a 1:30 dilution via manual mixing and monitored using an Aviv 410 spectrometer. Unfolding for all variants was initiated by manual mixing and a 1:30 dilution from a urea concentration in the folded baseline. The final protein concentration for all manual mixing experiments was 50 µg/mL. Progression of folding and unfolding was monitored at 222 nm, and the resulting curves were fit to a single exponential. Kinetic data were fit as previously described.28
Calculation of ϕ-values
ϕ-values in the folding direction for the three-state variants were calculated as
Equation 1 |
For the two-state proteins, ϕ-values were calculated directly from the folding rates:
Equation 2 |
Acknowledgements
We thank the members of the Marqusee laboratory for helpful discussions and Rachel Bernstein for thoughtful comments on the manuscript. This work was supported by NIH grant GM50945 to S.M. and an NSF graduate research fellowship to K.B.C.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Roder H, Maki K, Cheng H. Early events in protein folding explored by rapid mixing methods. Chem Rev. 2006;106:1836–1861. doi: 10.1021/cr040430y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Brockwell DJ, Radford SE. Intermediates: ubiquitous species on folding energy landscapes? Curr Opin Struct Biol. 2007;17:30–37. doi: 10.1016/j.sbi.2007.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jahn TR, Radford SE. The Yin and Yang of protein folding. FEBS J. 2005;272:5962–5970. doi: 10.1111/j.1742-4658.2005.05021.x. [DOI] [PubMed] [Google Scholar]
- 4.Matouschek A, Kellis JT, Jr, Serrano L, Fersht AR. Mapping the transition state and pathway of protein folding by protein engineering. Nature. 1989;340:122–126. doi: 10.1038/340122a0. [DOI] [PubMed] [Google Scholar]
- 5.Plaxco KW, Simons KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J Mol Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
- 6.Miller EJ, Fischer KF, Marqusee S. Experimental evaluation of topological parameters determining protein-folding rates. Proc Natl Acad Sci U S A. 2002;99:10359–10363. doi: 10.1073/pnas.162219099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gromiha MM, Selvaraj S. Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: application of long-range order to folding rate prediction. J Mol Biol. 2001;310:27–32. doi: 10.1006/jmbi.2001.4775. [DOI] [PubMed] [Google Scholar]
- 8.Gong H, Isom DG, Srinivasan R, Rose GD. Local secondary structure content predicts folding rates for simple, two-state proteins. J Mol Biol. 2003;327:1149–1154. doi: 10.1016/s0022-2836(03)00211-0. [DOI] [PubMed] [Google Scholar]
- 9.Makarov DE, Plaxco KW. The topomer search model: A simple, quantitative theory of two-state protein folding kinetics. Protein Sci. 2003;12:17–26. doi: 10.1110/ps.0220003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lindberg M, Tangrot J, Oliveberg M. Complete change of the protein folding transition state upon circular permutation. Nat Struct Biol. 2002;9:818–822. doi: 10.1038/nsb847. [DOI] [PubMed] [Google Scholar]
- 11.Viguera AR, Serrano L, Wilmanns M. Different folding transition states may result in the same native structure. Nat Struct Biol. 1996;3:874–880. doi: 10.1038/nsb1096-874. [DOI] [PubMed] [Google Scholar]
- 12.Dalessio PM, Ropson IJ. Beta-sheet proteins with nearly identical structures have different folding intermediates. Biochemistry. 2000;39:860–871. doi: 10.1021/bi991937j. [DOI] [PubMed] [Google Scholar]
- 13.Ferguson N, Capaldi AP, James R, Kleanthous C, Radford SE. Rapid folding with and without populated intermediates in the homologous four-helix proteins Im7 and Im9. J Mol Biol. 1999;286:1597–1608. doi: 10.1006/jmbi.1998.2548. [DOI] [PubMed] [Google Scholar]
- 14.Friel CT, Beddard GS, Radford SE. Switching two-state to three-state kinetics in the helical protein Im9 via the optimisation of stabilising non-native interactions by design. J Mol Biol. 2004;342:261–273. doi: 10.1016/j.jmb.2004.06.076. [DOI] [PubMed] [Google Scholar]
- 15.Geierhaas CD, Best RB, Paci E, Vendruscolo M, Clarke J. Structural comparison of the two alternative transition states for folding of TI I27. Biophys J. 2006;91:263–275. doi: 10.1529/biophysj.105.077057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Geierhaas CD, Paci E, Vendruscolo M, Clarke J. Comparison of the transition states for folding of two Ig-like proteins from different superfamilies. J Mol Biol. 2004;343:1111–1123. doi: 10.1016/j.jmb.2004.08.100. [DOI] [PubMed] [Google Scholar]
- 17.Gorski SA, Capaldi AP, Kleanthous C, Radford SE. Acidic conditions stabilise intermediates populated during the folding of Im7 and Im9. J Mol Biol. 2001;312:849–863. doi: 10.1006/jmbi.2001.5001. [DOI] [PubMed] [Google Scholar]
- 18.Bai Y, Sosnick TR, Mayne L, Englander SW. Protein folding intermediates: native-state hydrogen exchange. Science. 1995;269:192–197. doi: 10.1126/science.7618079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Raschke TM, Marqusee S. The kinetic folding intermediate of ribonuclease H resembles the acid molten globule and partially unfolded molecules detected under native conditions. Nat Struct Biol. 1997;4:298–304. doi: 10.1038/nsb0497-298. [DOI] [PubMed] [Google Scholar]
- 20.Tripp KW, Barrick D. Rerouting the folding pathway of the Notch ankyrin domain by reshaping the energy landscape. J Am Chem Soc. 2008;130:5681–5688. doi: 10.1021/ja0763201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.McCallister EL, Alm E, Baker D. Critical role of beta-hairpin formation in protein G folding. Nat Struct Biol. 2000;7:669–673. doi: 10.1038/77971. [DOI] [PubMed] [Google Scholar]
- 22.Nauli S, Kuhlman B, Baker D. Computer-based redesign of a protein folding pathway. Nat Struct Biol. 2001;8:602–605. doi: 10.1038/89638. [DOI] [PubMed] [Google Scholar]
- 23.Chamberlain AK, Handel TM, Marqusee S. Detection of rare partially folded molecules in equilibrium with the native conformation of RNaseH. Nat Struct Biol. 1996;3:782–787. doi: 10.1038/nsb0996-782. [DOI] [PubMed] [Google Scholar]
- 24.Goedken ER, Marqusee S. Native-state energetics of a thermostabilized variant of ribonuclease HI. J Mol Biol. 2001;314:863–871. doi: 10.1006/jmbi.2001.5184. [DOI] [PubMed] [Google Scholar]
- 25.Spudich G, Lorenz S, Marqusee S. Propagation of a single destabilizing mutation throughout the Escherichia coli ribonuclease HI native state. Protein Sci. 2002;11:522–528. doi: 10.1110/ps.37202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Raschke TM, Kho J, Marqusee S. Confirmation of the hierarchical folding of RNase H: a protein engineering study. Nat Struct Biol. 1999;6:825–831. doi: 10.1038/12277. [DOI] [PubMed] [Google Scholar]
- 27.Cecconi C, Shank EA, Bustamante C, Marqusee S. Direct observation of the three-state folding of a single protein molecule. Science. 2005;309:2057–2060. doi: 10.1126/science.1116702. [DOI] [PubMed] [Google Scholar]
- 28.Spudich GM, Miller EJ, Marqusee S. Destabilization of the Escherichia coli RNase H kinetic intermediate: switching between a two-state and three-state folding mechanism. J Mol Biol. 2004;335:609–618. doi: 10.1016/j.jmb.2003.10.052. [DOI] [PubMed] [Google Scholar]
- 29.Sanchez IE, Kiefhaber T. Origin of unusual phi-values in protein folding: evidence against specific nucleation sites. J Mol Biol. 2003;334:1077–1085. doi: 10.1016/j.jmb.2003.10.016. [DOI] [PubMed] [Google Scholar]
- 30.Pace CN, Shaw KL. Linear extrapolation method of analyzing solvent denaturation curves. Proteins. 2000;Suppl 4:1–7. doi: 10.1002/1097-0134(2000)41:4+<1::aid-prot10>3.3.co;2-u. [DOI] [PubMed] [Google Scholar]
- 31.Baldwin RL. On-pathway versus off-pathway folding intermediates. Fold Des. 1996;1:R1–R8. doi: 10.1016/S1359-0278(96)00003-X. [DOI] [PubMed] [Google Scholar]
- 32.Fersht AR, Sato S. Phi-value analysis and the nature of protein-folding transition states. Proc Natl Acad Sci U S A. 2004;101:7976–7981. doi: 10.1073/pnas.0402684101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Geierhaas CD, Salvatella X, Clarke J, Vendruscolo M. Characterisation of transition state structures for protein folding using 'high', 'medium' and 'low' {Phi}-values. Protein Eng Des Sel. 2008;21:215–222. doi: 10.1093/protein/gzm092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kim DE, Fisher C, Baker D. A breakdown of symmetry in the folding transition state of protein L. J Mol Biol. 2000;298:971–984. doi: 10.1006/jmbi.2000.3701. [DOI] [PubMed] [Google Scholar]
- 35.Sanchez IE, Kiefhaber T. Hammond behavior versus ground state effects in protein folding: evidence for narrow free energy barriers and residual structure in unfolded states. J Mol Biol. 2003;327:867–884. doi: 10.1016/s0022-2836(03)00171-2. [DOI] [PubMed] [Google Scholar]
- 36.Korzhnev DM, Salvatella X, Vendruscolo M, Di Nardo AA, Davidson AR, Dobson CM, Kay LE. Low-populated folding intermediates of Fyn SH3 characterized by relaxation dispersion NMR. Nature. 2004;430:586–590. [Google Scholar]
- 37.Fowler SB, Clarke J. Mapping the folding pathway of an immunoglobulin domain: structural detail from Phi value analysis and movement of the transition state. Structure. 2001;9:355–366. doi: 10.1016/s0969-2126(01)00596-2. [DOI] [PubMed] [Google Scholar]
- 38.Feng H, Vu ND, Bai Y. Detection and structure determination of an equilibrium unfolding intermediate of Rd-apocytochrome b562: native fold with non-native hydrophobic interactions. J Mol Biol. 2004;343:1477–1485. doi: 10.1016/j.jmb.2004.08.099. [DOI] [PubMed] [Google Scholar]
- 39.Feng H, Zhou Z, Bai Y. A protein folding pathway with multiple folding intermediates at atomic resolution. Proc Natl Acad Sci U S A. 2005;102:5026–5031. doi: 10.1073/pnas.0501372102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Spence GR, Capaldi AP, Radford SE. Trapping the on-pathway folding intermediate of Im7 at equilibrium. J Mol Biol. 2004;341:215–226. doi: 10.1016/j.jmb.2004.05.049. [DOI] [PubMed] [Google Scholar]
- 41.Zhou Z, Feng H, Ghirlando R, Bai Y. The high-resolution NMR structure of the early folding intermediate of the Thermus thermophilus ribonuclease H. J Mol Biol. 2008;384:531–539. doi: 10.1016/j.jmb.2008.09.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gianni S, Guydosh NR, Khan F, Caldas TD, Mayor U, White GW, DeMarco ML, Daggett V, Fersht AR. Unifying features in protein-folding mechanisms. Proc Natl Acad Sci U S A. 2003;100:13286–13291. doi: 10.1073/pnas.1835776100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.White GW, Gianni S, Grossmann JG, Jemth P, Fersht AR, Daggett V. Simulation and experiment conspire to reveal cryptic intermediates and a slide from the nucleation-condensation to framework mechanism of folding. J Mol Biol. 2005;350:757–775. doi: 10.1016/j.jmb.2005.05.005. [DOI] [PubMed] [Google Scholar]
- 44.Wagner C, Kiefhaber T. Intermediates can accelerate protein folding. Proc Natl Acad Sci U S A. 1999;96:6716–6721. doi: 10.1073/pnas.96.12.6716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Edelhoch H. Spectroscopic determination of tryptophan and tyrosine in proteins. Biochemistry. 1967;6:1948–1954. doi: 10.1021/bi00859a010. [DOI] [PubMed] [Google Scholar]