Abstract
Characterization of the mechanisms by which proteins fold into their native conformations is important not only for protein structure prediction and design but also because protein misfolding intermediates may play critical roles in fibril formation that are commonplace in neurodegenerative disorders. In practice, the study of folding pathways is complicated by the fact that for the most part intermediates are low-populated and short-lived so that biophysical studies are difficult. Due to recent methodological advances, relaxation dispersion NMR spectroscopy has emerged as a particularly powerful tool to obtain high-resolution structural information about protein folding events on the millisecond timescale. Applications of the methodology to study the folding of SH3 domains have shown that folding proceeds via previously undetected on-pathway intermediates, sometimes stabilized by nonnative long-range interactions. The relaxation dispersion approach provides a detailed kinetic and thermodynamic description of the folding process as well as the promise of obtaining an atomic level structural description of intermediate states. We review the concerted application of a variety of recently developed NMR relaxation dispersion experiments to obtain a “high-resolution” picture of the folding pathway of the A39V/N53P/V55L Fyn SH3 domain.
Introduction
An understanding of the processes by which an unfolded protein acquires its native state conformation remains a very significant and challenging biophysical problem, even 50 years after the seminal investigations of Anfinsen showed that proteins such as ribonuclease fold spontaneously in a matter of several seconds or faster (1). The timescale of protein folding is remarkable considering the vast number of conformations that potentially are accessible to a polypeptide chain. Moreover, globular proteins are only marginally stable, with free energies of folding on the order of ΔGU→F = GF − GU ≈ −50 kJ/mol, which corresponds to the energy contribution from only a handful of favorable noncovalent interactions out of the thousands found in a globular protein (2). Yet, protein folding appears to be a highly cooperative event. In fact, most experimental kinetic studies of independently folding protein units or “domains” are consistent with a surprisingly simple two-state transition U↔F connecting the unfolded, U, and native, F, states (3–5). In this approximation an understanding of the folding pathway at a structural level involves the characterization of only a single transition state (TS), TS(UF), which can be probed via the effects of point mutations on the folding kinetics in an approach called Φ-value analysis (4,6), or via a related method termed Ψ-value analysis (7). However, experimental detection of additional states along the folding pathway such as intermediates I is often very difficult because they are both low and transiently populated (4,8). The use of denaturants in stopped-flow mixing and equilibrium unfolding experiments can further lead to destabilization of compact intermediates, resulting in their rapid pre-equilibration in kinetic experiments during the instrument dead-time of typically a few milliseconds (9). Accordingly, many recent studies, often invoking additional methods such as ultra-fast mixing (10) or temperature jump (11) to resolve the rapid pre-equilibration stage, have identified more complex folding pathways with intermediate states, suggesting that folding energy landscapes tend to be more rugged than previously anticipated (8). Even with these methods to resolve fast kinetics, however, there remain rather few studies where Φ-values have been measured for the two transition state ensembles and the intermediate of even relatively “simple” three-state folding reactions (12,13). In this regard, native-state hydrogen exchange experiments are particularly powerful in providing detailed structural information about one or more I states on a per-residue basis, even in cases when the intermediates are populated at extremely low levels that would be invisible to the kinetic experiments mentioned above (14–16).
The development of modern multidimensional, multinuclear NMR spectroscopy has proven extremely valuable to the study of protein folding, and has played an important role in many biophysical studies including the native-state hydrogen exchange method mentioned above (14–16). NMR has emerged as a particularly powerful technique to identify and quantify folding intermediates because an abundance of site-specific probes can be studied simultaneously in a single one- or multidimensional NMR spectrum. Experiments can be recorded as kinetic measurements that monitor the return to equilibrium after an initial perturbation (such as a change in denaturant concentration, pH, temperature, or solvent protonation). For example, very slow folding events can be monitored directly in real-time (17), and recently developed fast acquisition methods allow the recording of two-dimensional NMR experiments for real-time folding studies with a time resolution of a few seconds (18). For folding events on the more common millisecond to second timescale, proton/deuterium amide hydrogen exchange NMR spectroscopy studies have proven particularly successful in identifying intermediates (14–16). In favorable cases the site-specific resolution afforded by hydrogen exchange NMR spectroscopy provides a rational basis for stabilizing an excited intermediate state through protein engineering so that it becomes the ground state and hence accessible to conventional high-resolution structure determination, as demonstrated in several elegant studies on redesigned apo-cytochrome b562 (19), En-HD (20), T4 lysozyme (21) and RNase H (22).
It has also long been recognized that NMR spectroscopy is a very powerful method to quantify chemical exchange kinetics of systems in equilibrium (23). Although the basic experimental ideas were formulated over a half-century ago (24) and applications to very simple biological systems have appeared for many years, detailed studies of protein conformational exchange had to await the development of new experiments and labeling schemes (25–28). In recent years our laboratory has studied equilibrium protein folding reactions to elucidate the folding pathways of a number of small single domain proteins. One such system, which has been studied by a large number of laboratories, is the SH3 domain, a module that folds into a five-stranded β-sandwich structure (29). Notably, studies that include calorimetric, equilibrium and kinetic folding/unfolding experiments are consistent with a two-state folding mechanism, with no evidence for the formation of partially folded intermediates (5,30). By contrast, equilibrium NMR experiments that provide kinetic, thermodynamic and structural information on the folding reactions of a number of mutants of the Abp1p and Fyn SH3 domains can only be interpreted in terms of the formation of a low-populated intermediate, which must be on the folding pathway (9,31,32). In the following review, we briefly highlight some of the qualitative features of the NMR methodology that led to the identification of intermediates of Fyn SH3 domain folding that were “invisible” to other more established techniques, and then present a synopsis of a number of NMR studies that have provided detailed atomic level insight into the folding mechanism of the Fyn SH3 domain.
NMR relaxation dispersion experiments
Before describing the underlying physical basis of the NMR methodology that is used to study equilibrium conformational exchange processes, we first discuss how chemical exchange events can affect NMR spectra in general. Consider the exchange between the predominantly populated (ground) state, A, and an excited state, B, , with exchange rate kex,AB = kAB + kBA. The states are populated according to the Boltzmann distribution
(1) |
where pA and pB are the fractional populations of states A and B, respectively, R is the gas constant, and T the absolute temperature. The temperature dependence of the free energy difference between states is given by
(2) |
where ΔHA→B(Tm), ΔSA→B(Tm), and ΔCp(A→B) are the enthalpy, entropy, and heat capacity differences, respectively, between states B and A at an arbitrary reference temperature, Tm (4). Assuming transition-state theory, the temperature dependence of the rate of transition from state A to state B, separated by the rate-limiting transition state TS(AB), is given by the Eyring equation:
(3) |
where κ is a transmission coefficient, and kB and h are the Boltzmann and Planck constants, respectively (33). Suppose that an NMR-active probe on the protein (such as a 15N or 1HN spin, for example) resonates at distinct frequencies (chemical shifts) in the ground and excited states, denoted by ωA and ωB (in units of rad/s), respectively, or alternatively by ϖA and ϖB (in ppm). Chemical shift differences ΔωAB = ωB−ωA normally range from 0 to ≈6000 rad/s, depending, of course, on the NMR probe, the nature of the exchange process, and the spectrometer frequency at which the experiment is recorded. If the exchange rate, kex,AB, is very slow compared to ΔωAB (kex,AB < ≈0.05 × ΔωAB; slow exchange regime) then the NMR spectrum contains separate, well-defined resonances for each of the two states. In cases where kex,AB is on the order of 1/s–10/s the exchange kinetics is most easily quantified via magnetization exchange experiments (25,26). Conversely, if the exchange rate kex,AB is similar to the chemical shift difference, ≈100/s to 2000/s (intermediate exchange regime), then the exchange causes significant line-broadening of signals from both states. The low-populated excited state resonance usually becomes too broad to be observed but the line-broadening of the ground state resonance still reports on the exchange with the invisible excited state conformer and such line-broadening can be quantified to extract out the kinetics and thermodynamics of the exchange event as well as the excited state chemical shifts, which can be interpreted in terms of structure.
Many NMR experiments can be understood without recourse to a rigorous quantum-mechanical description, including those to study equilibrium chemical exchange processes. Upon insertion of the NMR sample into a large external magnetic field the sum over all magnetic dipoles in the sample produces net magnetization, which aligns with the field (taken to be the z axis). Subsequent application of a radio-frequency pulse at the appropriate frequency places the magnetization into the x-y plane, as indicated in Fig. 1 A, where a 90°y pulse rotates magnetization to the x axis. In the absence of further pulses the magnetization subsequently rotates (precesses) around the z axis with a characteristic angular frequency (chemical shift) ω, so that over time it will accrue a phase φ = ωt with respect to the x axis. Let us first consider the case of a protein that exists in two discrete conformations which do not interconvert, and follow the course of a single NMR-active nucleus with different chemical shifts, ωA and ωB, in the two states. Magnetization at sites with different chemical shifts will rotate with different frequencies and thus become increasingly defocused with time. Now, consider the case where after an evolution period of time T/2 an additional pulse is applied, one that inverts the accrued phase of the magnetization at each site (a so-called 180°x pulse), regardless of precession frequency. During the successive delay T/2 the phase differences between magnetization from sites A and B decrease so that at time T the phase difference is zero and both magnetization components are returned to the x axis (Fig. 1 B). The NMR spectrum for such a nonexchanging system is shown in the inset to Fig. 1 B, where the intensities of each of the resonance lines provide a faithful readout of the relative populations of the two states. Next, consider the case where exchange between ground (A) and excited (B) states, A↔B, causes the precession frequency of an NMR spin to vary stochastically between ωA and ωB. The stochastic nature means that the accrued phases for magnetization originating on different molecules in general are not the same and hence the signal, which is the vector sum of the magnetization from all of the molecules in the sample, broadens in the frequency domain and decreases in intensity (inset to Fig. 1 C). For the same reason, application of a 180°x pulse in the middle of the evolution period does not, in general, lead to a complete refocusing of magnetization (Fig. 1 C). However, as more and more refocusing pulses are applied during T, refocusing improves progressively as long as the rate of application of pulses is on the order of or faster than the exchange rate (Fig. 1 D). The idea behind this so-called Carr-Purcell-Meiboom-Gill (CPMG) (34,35) relaxation dispersion (RD) experiment is to monitor the signal intensity I(νCPMG) from spin probes of the observable ground state, A, as a function of the repetition rate of the refocusing pulses, defined as νCPMG = 1/(2τ), where τ is the separation between the pulses. I(νCPMG) is then converted into an effective transverse relaxation rate that quantifies the rate of decay of signal, Reff(νCPMG) = −ln(I(νCPMG)/I0)/T, where I0 is a reference signal intensity. These relaxation dispersion profiles Reff(νCPMG) (Fig. 2) depend in a nonlinear fashion on the rates of the exchange process (kAB and kBA, or equivalently kex,AB and pB) and on the absolute chemical shift differences |ΔϖAB| according to the Bloch-McConnell equations (24). The complete set of exchange parameters can be obtained by least-squares fitting of the RD profiles, provided that the exchange is in the intermediate regime. The sign of ΔϖAB can also be obtained experimentally (28), and hence the chemical shifts of the invisible excited state according to ϖB = ϖA + ΔϖAB. New isotope-labeling schemes have been developed along with improved CPMG RD pulse sequences so that accurate excited state chemical shifts can be extracted for backbone 1HN, 15N, carbonyl 13CO, 1Hα, 13Cα and methyl group resonances (25–28,36).
Figure 1.
NMR spectroscopy of a single probe attached to a protein molecule that exists in two discrete conformations highlighted in red and green. The probe resonates with different characteristic NMR frequencies, ωA and ωB, in each of the two states. (A) A pulse is applied to equilibrium magnetization (along z) that places it along the x axis. The magnetization subsequently rotates around the z axis at its characteristic angular frequency, ω, so that after a time t the accrued phase with respect to the x axis is φ = ωt. (B) In the absence of chemical exchange the evolution of magnetization with different precession frequencies is refocused when a 180°x pulse is applied in the middle of a fixed delay T. The NMR spectrum of the system is shown as an inset. (C) In the presence of chemical exchange the precession frequency stochastically jumps between ωA (red) and ωB (green) with a different trajectory for each molecule. Ten such trajectories are shown with two highlighted in color to indicate the timecourse of the jumps. The exchange process leads to line-broadening, as can be seen in the spectrum shown in the inset. (D) The application of successively increasing numbers of 180°x pulses during the same delay improves refocusing and hence minimizes the effects of chemical exchange.
Figure 2.
15N RD profiles for Ala-6 (left) and Asp-16 (right) of the A39V/N53P/V55L Fyn SH3 domain at 25°C (top) and 45°C (bottom). Continuous lines are the predictions from fitting a global three-state folding model U↔I↔F to all data recorded in 5°C increments between 10°C and 45°C. Adapted from Neudecker et al. (9).
RD experiments are very sensitive to excited states populated to as little as ≈0.5%, corresponding to a free energy difference between exchanging states of <≈13 kJ/mol, so long as these states exchange on the millisecond timescale and the exchange process is accompanied by sizable chemical shift changes |ΔϖAB| (37). A significant number of protein folding equilibria satisfy these criteria. It is worth noting that one of the major advantages of the RD NMR method over approaches that monitor folding/unfolding kinetics as a function of denaturant concentration is that the NMR studies can be performed in the absence of denaturant or other perturbations that might affect intermediate states.
Choice of the A39V/N53P/V55L Fyn SH3 domain to probe protein folding
Over the past decade a large number of folding studies of SH3 domains have emerged that have made use of a wide variety of biophysical techniques, concluding that the folding reaction could be described by a two-state U↔F mechanism with no detectable accumulation of intermediates (5,30). NMR RD experiments cannot be performed on the wild-type Fyn SH3 domain because the folding exchange rate kex,FU is too small and the stability of the domain too high; however, experiments performed on a variety of G48 mutants of the Fyn and Abp1p SH3 domains designed to have folding rates and stabilities suitable for RD analysis while preserving the native fold all showed very strong evidence of an on-pathway intermediate I (31,32). Is this on-pathway intermediate merely a consequence of the choice of the site of mutation (position 48) or does it correspond to a state sampled during folding of wild-type SH3 domains in general? A number of arguments suggest that it is the latter. First, the structural features of the intermediate state for Fyn and Abp1p folding, as established by chemical shifts measured from RD, agree well with the picture of the transition state ensemble of SH3 domain folding emerging from Φ-value analysis using a variety of wild-type SH3 domains (31,32,38–40). Second, computational studies on the src SH3 domain show that folding proceeds through an intermediate state with similar properties to those of the Fyn SH3 domain mutants deduced by RD (41). Finally, we have examined an additional pair of mutants, N53P/V55L and A39V/N53P/V55L Fyn SH3, which preserve Gly-48 and are amenable to RD experiments. Both of these mutants were shown to fold via a low-populated on-pathway intermediate with many of the structural properties of the intermediates of G48 mutants characterized previously, but with some differences as well, as described in detail below (9,32). Of these two mutants, the A39V/N53P/V55L Fyn SH3 domain was of particular interest since i), at temperatures below 30°C the three-state folding process U↔I↔F established by RD measurements was effectively reduced to a two-state exchange reaction I↔F, thus making the intermediate particularly accessible experimentally and ii), a thorough analysis of the 15N chemical shifts of the intermediate, ϖI, revealed the presence of pronounced nonnative long-range interactions (9). The tremendous success of protein folding studies by methods that are interpreted primarily in the context of the formation of native contacts such as Φ-value analysis (4,6) and native-centric computer simulations (42) might lead one to suggest that folding is predominantly driven by native interactions and that evolution has largely selected against nonnative contacts that could cause kinetic traps. However, many recent experimental and theoretical studies have provided increasing evidence that nonnative interactions have a significant role in protein folding (8,15) and can even speed up the process; in fact, the A39V/N53P/V55L Fyn SH3 domain is a case where nonnative interactions involving residue 53 lead to an increase in the folding rate (43). The exact role of nonnative interactions in folding intermediates is still poorly understood and of considerable interest, especially because folding intermediates are suspected to be the precursors for the amyloid fibrils associated with many neurodegenerative diseases (44).
An on-pathway folding intermediate
RD experiments are very sensitive to the presence of intermediate states, partly because a large number of sites in the protein can be studied simultaneously. In principle, each site monitors the exchange event and for the case of a three-state folding reaction, U↔I↔F, site-specific chemical shift differences |ΔϖFI| and |ΔϖFU| are obtained. Just by coincidence, some sites will have large |ΔϖFI| but |ΔϖFU| close to 0 and these predominantly probe the I↔F exchange event, whereas others will have |ΔϖFI| close to 0 but large |ΔϖFU| so that the U↔F process dominates. As a consequence, fitting the RD profiles of every resonance individually assuming a two-state model will result in a very inhomogeneous distribution of exchange rates and excited state populations, and when all residues are fit simultaneously to a global two-state folding model a very elevated fitting residual χ2 is obtained. Such findings should be taken as an indication that the exchange process is more complex than two-state (31,32,37). In the case of the A39V/N53P/V55L Fyn SH3 domain identification of the intermediate was particularly clear. Some residues, like Ala-6 (Fig. 2), exhibit large 15N relaxation dispersions at room temperature but not at elevated temperatures, whereas others such as Asp-16 (Fig. 2) show exactly the opposite behavior (9). In the context of a two-site exchange model this suggests that the excited state in exchange with the native (ground) state, F, is different at low and high temperatures. As expected, the 15N chemical shifts reconstructed from the RD profiles for the most prominent excited state at elevated temperatures are very close to random coil values (45), so this state was readily identified as the unfolded state U, with a fractional population, pU = 3.0% at 45°C (9). At 30°C its population drops to pU = 0.4%, and at 25°C and below pU becomes negligible. At these temperatures the RD profiles are well fit by a simple two-state exchange model I↔F with a different excited state I than at higher temperatures, with amide 1HN and 15N chemical shifts that are not random coil like, but rather well-dispersed, suggesting that this state is folded, although not natively so (see below). The population of I, pI, is 2.7% at 10°C, and decreases to 1.9% at 45°C. Exchange between F and I becomes fast at elevated temperatures (kex,FI = 11081/s at 45°C), but the value of pI remains significant and a three-state model is necessary to fit the data at 30°C and above. It is worth noting that the information content of the RD profiles allows a discrimination between on- (U↔I↔F) and off- (I↔F↔U or F↔U↔I) pathway intermediates since exchange rates and chemical shift values are extracted simultaneously in fits of the data; the best fit was obtained for the model with I on-pathway (9). Thus, the equilibrium RD NMR experiments establish that the A39V/N53P/V55L Fyn SH3 domain folds under native conditions according to a three-state mechanism, U↔I↔F, which effectively reduces to I↔F at 25°C and below.
Comparison between NMR relaxation dispersion and stopped-flow kinetics data
The NMR data argue very strongly that the A39V/N53P/V55L Fyn SH3 domain folds via an on-pathway folding intermediate, yet no such intermediates have been observed to date in extensive studies of SH3 domains using kinetic methods (5,30). To examine this issue further we have measured the folding/unfolding kinetics of the A39V/N53P/V55L Fyn SH3 domain as a function of denaturant concentration by stopped-flow, and the resulting chevron plot was indeed fully consistent with the two-state folding model, U↔F (9). Prediction of the chevron plot on the basis of parameters obtained from the three-state folding model determined from the RD experiments established that the kinetics from both methods are quantitatively consistent. Although the presence of the intermediate state is predicted to slow down folding under native conditions by a very small amount compared to strictly two-state folding, the resulting “roll-over” in the chevron plot is too small for reliable detection even on stopped-flow devices with very short dead-times, especially since even small amounts of denaturant speed up the F↔I pre-equilibrium to the point where the transition from U to I becomes exclusively rate-limiting for folding (9). Thus, the stopped-flow fluorescence measurements report only on the U↔I exchange event. All of the SH3 domains that we have studied to this point by RD NMR fold via similar on-pathway intermediates (9,31,32,37), which most likely cannot be detected by conventional stopped-flow measurements. This suggests that intermediates may be a more general feature of SH3 domain folding than anticipated based on experiment, in agreement with theoretical predictions (41,46). Under typical stopped-flow conditions the early transition state TS(UI) becomes rate-limiting. Thus, in the context of the three-state folding pathway U↔I↔F, the abundance of Φ-values that are available in the literature for SH3 domains obtained from stopped-flow kinetics analyzed assuming a two-state folding pathway should be interpreted as probing the early transition state TS(UI) (9).
The thermodynamics of the folding pathway
Fits of the three-state folding model U↔I↔F to RD profiles of the A39V/N53P/V55L Fyn SH3 domain generated well-defined values for all four kinetic rates (kFI, kIF, kIU, kUI) as well as populations of all three states (pI = kFI/kIF × pF, pU = kIU/kUI × pI, pF + pI + pU = 100%). The corresponding energy level diagram (Fig. 3), derived from Eqs. 1–3, shows the enthalpy/entropy compensation typical for protein folding (4). Both rate-limiting transition states TS(UI) and TS(IF) are entropically destabilized. Interestingly, the I state is more enthalpically favored and entropically disfavored than F, suggesting that I has a fully collapsed, rigid structure. This result is corroborated by studies over a wide temperature range (10–45°C), which allowed quantification of the heat capacity differences ΔCp between I, TS(IF) and F from the curvature of the Eyring plots according to Eqs. 2 and 3. The heat capacity of I is surprisingly low and is identical to that of F within experimental error (9). Heat capacities are considered to reflect solvent-accessible surface areas (47) so that the similar values obtained for I and F argue that I likely has a fully collapsed, albeit nonnative structure (see below). Along these lines, one would expect that the nonnative hydrophobic core of I would be broken and partially rehydrated to allow formation of the native core, and indeed the transition state TS(IF) has a significantly higher heat capacity than either of the two states that it connects (9).
Figure 3.
Profiles of free energy, ΔG (black), and the associated enthalpic ΔH (blue) and entropic −TΔS (red) contributions to ΔG along the folding pathway for the A39V/N53P/V55L Fyn SH3 domain at 45°C. Values of the thermodynamic parameters are referenced with respect to the unfolded state U, which is arbitrarily assigned values of 0. TS(UI) and TS(IF) denote the rate-limiting transition states between states U, I and I, F respectively. Heat capacity values of ΔCp(U→I) = ΔCp(U→F) − (0.15 ± 0.17) kJ/mol/K and ΔCp(U→TS(IF)) = ΔCp(U→F) + (1.46 ± 0.11) kJ/mol/K are extracted from the temperature dependence of kFI and kIF according to Eqs. 2 and 3. Adapted from Neudecker et al. (9).
High-resolution structural information about the folding intermediate
As described above, the excited state spectral parameters that are directly accessible from RD experiments are the NMR chemical shifts, which in turn are well-known to reflect protein primary, secondary and, to a lesser extent, tertiary structure (45). These chemical shifts are routinely recast in terms of dihedral angle restraints in high-resolution structural studies of ground state structures of proteins (45,48,49) but they also provide valuable structural information about invisible excited states (28). Residual anisotropic interactions such as residual dipolar couplings also manifest in shifts of spectral lines and RD experiments have been developed for their measurement as well (28). This is a particularly exciting development because such interactions provide orientational restraints that have been shown to also be very useful for structural studies of excited conformers (50).
The chemical shifts of the folding intermediate of the A39V/N53P/V55L Fyn SH3 domain can be determined very accurately from fitting the two-state model I↔F to RD data recorded at low temperatures (9), bypassing the complexity of a three-state fitting procedure (31,51) required at higher temperatures. As seen in Fig. 4, changes in backbone amide 15N chemical shifts between F and I states are highly localized to the terminal β-sheet (strands β1 and β5) and the 310-helical turn immediately preceding strand β5, whereas the 15N chemical shifts of the central β-sheet (strands β2, β3, β4) differ much less between the two states. Backbone amide 1HN and especially carbonyl 13C shifts, the latter being less sensitive to side-chain effects (45), show very similar profiles (P. Neudecker, P. Lundström, and L. E. Kay, unpublished). The chemical shifts of the intermediate in the NH2- and COOH-terminal regions deviate significantly from random coil values (45). For example, the amide group of Glu-5 shows strongly upfield 1HN and downfield 15N shift values in the intermediate for the A39V/N53P/V55L Fyn SH3 domain (9), but not for the G48M and G48V Fyn SH3 mutants that had been studied previously (31,52). Since the primary sequence of strand β1 in all these mutants is identical the differences in shifts observed for the A39V/N53P/V55L Fyn SH3 domain must reflect nonlocal interactions. One explanation is that the additional hydrophobicity introduced by the mutation N53P (note that position 53 is a solvent-exposed side chain in the native state, Fig. 4) stabilizes a nonnative hydrophobic cluster, which involves residues from the NH2- and COOH-termini, in the intermediate state. This interpretation is corroborated by computer simulations of the Fyn SH3 domain, where increasing the hydrophobicity at position 53 leads to a higher contact probability with the NH2-terminal residues in the transition state ensemble (43). Favorable nonnative interactions in the transition state between a hydrophobic side chain introduced at position 53 and the side chains of Leu-3 and Phe-4 could indeed be demonstrated experimentally using double mutant thermodynamic cycles based on stopped-flow kinetics interpreted on the basis of a two-state model (43). Interestingly, these favorable nonnative interactions were found to speed up folding ≈2-fold in both the simulations and in the stopped-flow experiments (43). This nonnative hydrophobic cluster is not present in the G48M or G48V Fyn SH3 domains, whose termini appear to be much more flexible and partially hydrated (31). We are currently in the process of producing atomic models of the nonnative I state using a combination of extensive chemical shift (1HN, 15N, 13CO, 1Hα, 13Cα) and residual anisotropic interactions that manifest upon weak alignment, measured through a variety of RD experiments (25–28,36). We have established that the cosolvents used for alignment preserve the kinetics and thermodynamics of the three-state folding pathway.
Figure 4.
Schematic representation of the secondary structure of a homology model of the A39V/N53P/V55L Gallus gallus Fyn SH3 domain colored according to the backbone amide 15N chemical shift differences between resonances in intermediate and folded states, |ΔϖN,FI|, from 0.0 ppm (yellow) to 6.0 ppm (blue), or gray if ΔϖN,FI could not be determined due to missing resonance assignments, resonance overlap, or if the exchange contributions to transverse relaxation were of insufficient magnitude to quantify shift differences. The mutated residues Val-39, Pro-53, and Leu-55 are shown in ball-and-stick representation. Values of |ΔϖN,FI| > 1.5 ppm (red to blue) are localized primarily to a region including the proximal NH2- and COOH-termini comprising strands β1 and β5, indicating that the rest of the domain essentially retains a native-like backbone conformation in the intermediate state. Adapted from Neudecker et al. (9).
NMR-based Φ-value analysis of a three-state folding pathway
As described above analysis of RD data can provide detailed information about intermediate states, so long as they are populated at a level of ≈0.5% or more. Insight into the nature of TS ensembles is more difficult to obtain since they are of much higher energy and hence much lower population. Such information is available, however, from Φ-value analysis. Here the effects of mutations on folding/unfolding rates are monitored and recast in terms of changes in free energies that can, with care, be interpreted in terms of formation of structure in the TS at the site of mutation (4,6). Since RD experiments yield kinetic rates, and hence via Eqs. 1–3 the free energy profile that describes the folding event, they can be used as the kinetic method for Φ-value analysis in much the same way as conventional approaches (53). The principle of Φ-value analysis is illustrated in Fig. 5 A. A point mutation at a particular site in the protein will change the stability of the protein by an amount ΔΔGU→F = ΔGU→F(mutant) – ΔGU→F(wt) and in a similar manner modulate the stability of the intermediate and transition states (A) relative to the unfolded state by ΔΔGU→A = ΔGU→A(mutant) – ΔGU→A(wt). The Φ-value of state A is defined as
(4) |
If the mutated side chain is in its native environment in state A the mutation will affect the stability of A in the same way as F, resulting in ΦA ≈ 1. By contrast, if the mutated side chain is disordered it will affect A in the same way as U, so ΔΔGU→A ≈ 0 and ΦA ≈ 0. Φ-values therefore reflect the formation of native contacts along the folding pathway.
Figure 5.
Changes in free energy for state A, ΔGA = GA(mutant) − GA(wt), used to calculate ΦA (A), along with changes in free energy upon mutation, ΔΔG, along the folding pathway for several mutants of the A39V/N53P/V55L Fyn SH3 domain (B–D). Values of G are referenced with respect to the unfolded state U, which is arbitrarily assigned a value of 0. TS(UI) and TS(IF) denote the rate-limiting transition states between states U, I, and I, F respectively. Side chains mutated for Φ-value analysis are indicated in boldface and highlighted in ball-and-stick representation on the right-hand side (same homology model and view as in Fig. 4). (B) Inset shows the pair of ΔG profiles (“pseudo-wt” and mutant) at 40°C from which ΔΔG values are obtained, with the free energies of U states both assigned arbitrarily to 0 (strictly true only if ΔGU = 0). Note that, unlike T47S (B) and F20L (D), E5V (C) is a stabilizing mutation (ΔΔGU→F < 0), but this is irrelevant for Φ-value analysis. Adapted from Neudecker et al. (53).
In agreement with a large body of Φ-values for SH3 domains that have been interpreted on the basis of a two-state folding transition the Φ-values for the early transition state TS(UI) of the A39V/N53P/V55L Fyn SH3 domain derived from NMR RD experiments (53) are highly polarized (Fig. 5). The central β-sheet, probed by the mutations R40T and T47S (for clarity, mutations for Φ-value analysis are highlighted in boldface), is already formed in TS(UI) and is maintained throughout the folding pathway, as indicated by high Φ-values (Fig. 5 B). By contrast, the terminal β-sheet, probed by the mutations L3A and E5V, does not fully adopt its native conformation until after the late transition state TS(IF), as indicated by low Φ-values (Fig. 5 C).
Calculated ΦI values also corroborate the picture of the intermediate state that emerges from analysis of chemical shifts. The largest changes in chemical shifts that accompany the hydrophobic core mutation F20L are in strand β1, which does not make contact with Phe-20 in the folded state. Conversely, the E5V mutation perturbs the peak positions of Leu-18 and Ser-19 in the intermediate. These chemical shift perturbations provide strong evidence of nonnative hydrophobic contacts in the intermediate. The high ΦI associated with F20L but significantly lower ΦTS(IF) value likely reflects these nonnative contacts in the intermediate, which must be broken in the late transition state before formation of the native fold (Fig. 5 D).
A distinct advantage of RD NMR over other kinetic methods for Φ-value analysis is the availability of the chemical shifts of the ground and excited states as sensitive probes to monitor any (undesired) structural changes introduced by the mutation. In the case of the A39V/N53P/V55L Fyn SH3 domain the mutations produced only very small changes in chemical shifts, so that the Φ-values could be interpreted with confidence (53).
Comparison of relaxation dispersion NMR spectroscopy with other biophysical methods
The above discussion has focused on the use of RD NMR spectroscopy to probe the folding pathway of the A39V/N53P/V55L Fyn SH3 domain. The methodology employed is particularly sensitive to exchange events so long as they involve “excited” states populated at a level of 0.5% or higher that interconvert with a “visible” ground state on the millisecond timescale and there is a difference in chemical shifts between the exchanging states. Detailed kinetic, thermodynamic and structural data can be obtained and structures of intermediate states can in principle be determined irrespective of whether such states are native-like or not. A powerful aspect of the NMR technique is that experiments can be performed in the absence of denaturants that can affect structures of U or I states, and that it is often possible to establish whether the intermediate is on- or off-pathway, as has been the case in folding studies of a number of small domains (9,31,32,54).
RD methodology is complementary to other kinetic techniques such as stopped-flow mixing (4,10,11), temperature jump (11), as well as hydrogen exchange experiments (14–16). With these other methods it is possible to detect and characterize excited states with free energies that far exceed those of the “low-lying” states that can be probed by RD NMR. The picture of protein folding that emerges from all of these techniques is similar. For example, stopped-flow derived Φ-values for SH3 domain folding (38–40) paint a picture of a TS ensemble with native-like structure in strands β2 to β4 and the n-Src loop that is similar to what emerges from analysis of chemical shifts of the intermediate state measured via RD NMR (9,31,32,37). RD NMR studies of an FF domain, a small four-helix bundle module, establish the presence of an intermediate state where helices 1 to 3 are at least partially formed, whereas helix 4 is not (54). Hydrogen exchange experiments on the folded state corroborate the fact that helix 4 is not formed in the intermediate state and the relative stabilities of helices 1–3 vs. 4 based on hydrogen exchange rates agrees with expectations from RD NMR (54). The NMR results are consistent with a Φ-value analysis of the rate-limiting transition state between I and F (55). Native state hydrogen exchange experiments have provided a wealth of data on protein folding intermediates and have established that for many proteins folding proceeds through cooperatively folding units serving as templates for further structure formation, sometimes called foldons (16). The folding intermediates of SH3 and FF domains that have been characterized by RD NMR studies (see above) also establish that formation of foldons is an important step in the folding process for these modules as well. In this regard the conclusions from RD NMR experiments support a growing list of experimental and theoretical results that many proteins, even relatively small ones, do not seem to fold along a multitude of equivalent pathways on a smooth funnel-like energy landscape, but rather by preferred step-wise pathways with specific, highly polarized transition states and intermediates, where parts of the protein have already adopted native-like conformations (4,8–11,14–16,37,53,54) and where missing favorable interactions are frequently substituted by nonnative contacts (8,9,15,16,43,53).
Acknowledgments
This work was supported by a grant from the Canadian Institutes of Health Research (CIHR) to L.E.K. P.N. and P.L. hold postdoctoral fellowships from the CIHR and the CIHR Training Grant on Protein Folding in Health and Disease, respectively.
References
- 1.Anfinsen B.C. Principles that govern the folding of protein chains. Science. 1973;181:223–230. doi: 10.1126/science.181.4096.223. [DOI] [PubMed] [Google Scholar]
- 2.Jaenicke R. Protein folding: local structures, domains, subunits, and assemblies. Biochemistry. 1991;30:3147–3161. doi: 10.1021/bi00227a001. [DOI] [PubMed] [Google Scholar]
- 3.Jackson S.E., Fersht A.R. Folding of Chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. Biochemistry. 1991;30:10428–10435. doi: 10.1021/bi00107a010. [DOI] [PubMed] [Google Scholar]
- 4.Fersht A.R. Freeman; New York: 1998. Structure and Mechanism in Protein Science. [Google Scholar]
- 5.Capaldi A.P., Radford S.E. Kinetic studies of β-sheet protein folding. Curr. Opin. Struct. Biol. 1998;8:86–92. doi: 10.1016/s0959-440x(98)80014-6. [DOI] [PubMed] [Google Scholar]
- 6.Fersht A.R., Matouschek A., Serrano L. The folding of an enzyme I. Theory of protein engineering analysis of stability and pathway of protein folding. J. Mol. Biol. 1992;224:771–782. doi: 10.1016/0022-2836(92)90561-w. [DOI] [PubMed] [Google Scholar]
- 7.Sosnick T.R., Krantz B.A., Dothager R.S., Baxa M. Characterizing the protein folding transition state using Ψ analysis. Chem. Rev. 2006;106:1862–1876. doi: 10.1021/cr040431q. [DOI] [PubMed] [Google Scholar]
- 8.Brockwell D.J., Radford S.E. Intermediates: ubiquitous species on folding energy landscapes? Curr. Opin. Struct. Biol. 2007;17:30–37. doi: 10.1016/j.sbi.2007.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Neudecker P., Zarrine-Afsar A., Choy W.-Y., Muhandiram D.R., Davidson A.R. Identification of a collapsed intermediate with non-native long-range interactions on the folding pathway of a pair of Fyn SH3 domain mutants by NMR relaxation dispersion spectroscopy. J. Mol. Biol. 2006;363:958–976. doi: 10.1016/j.jmb.2006.08.047. [DOI] [PubMed] [Google Scholar]
- 10.Roder H., Maki K., Cheng H. Early events in protein folding explored by rapid mixing methods. Chem. Rev. 2006;106:1836–1861. doi: 10.1021/cr040430y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Nölting B. 2nd ed. Springer; Berlin: 2006. Protein Folding Kinetics. [Google Scholar]
- 12.Khan F., Chuang J.I., Gianni S., Fersht A.R. The kinetic pathway of folding of barnase. J. Mol. Biol. 2003;333:169–186. doi: 10.1016/j.jmb.2003.08.024. [DOI] [PubMed] [Google Scholar]
- 13.Nölting B., Golbik R., Neira J.L., Soler-Gonzalez A.S., Schreiber G. The folding pathway of a protein at high resolution from microseconds to seconds. Proc. Natl. Acad. Sci. USA. 1997;94:826–830. doi: 10.1073/pnas.94.3.826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Krishna M.M.G., Hoang L., Lin Y., Englander S.W. Hydrogen exchange methods to study protein folding. Methods. 2004;34:51–64. doi: 10.1016/j.ymeth.2004.03.005. [DOI] [PubMed] [Google Scholar]
- 15.Bai Y. Protein folding pathways studied by pulsed- and native-state hydrogen exchange. Chem. Rev. 2006;106:1757–1768. doi: 10.1021/cr040432i. [DOI] [PubMed] [Google Scholar]
- 16.Englander S.W., Mayne L., Krishna M.M.G. Protein folding and misfolding: mechanism and principles. Q. Rev. Biophys. 2007;40:287–326. doi: 10.1017/S0033583508004654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Zeeb M., Balbach J. Protein folding studied by real-time NMR spectroscopy. Methods. 2004;34:65–74. doi: 10.1016/j.ymeth.2004.03.014. [DOI] [PubMed] [Google Scholar]
- 18.Schanda P., Forge V., Brutscher B. Protein folding and unfolding studied at atomic resolution by fast two-dimensional NMR spectroscopy. Proc. Natl. Acad. Sci. USA. 2007;104:11257–11262. doi: 10.1073/pnas.0702069104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Feng H., Takei J., Lipsitz R., Tjandra N., Bai Y. Specific non-native hydrophobic interactions in a hidden folding intermediate: implications for protein folding. Biochemistry. 2003;42:12461–12465. doi: 10.1021/bi035561s. [DOI] [PubMed] [Google Scholar]
- 20.Religa T.L., Markson J.S., Mayor U., Freund S.M.V., Fersht A.R. Solution structure of a protein denatured state and folding intermediate. Nature. 2005;437:1053–1056. doi: 10.1038/nature04054. [DOI] [PubMed] [Google Scholar]
- 21.Kato H., Feng H., Bai Y. The folding pathway of T4 lysozyme: The high-resolution structure and folding of a hidden intermediate. J. Mol. Biol. 2007;365:870–880. doi: 10.1016/j.jmb.2006.10.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhou Z., Feng H., Ghirlando R., Bai Y. The high-resolution NMR structure of the early folding intermediate of the Thermus thermophilus ribonuclease H. J. Mol. Biol. 2008;384:531–539. doi: 10.1016/j.jmb.2008.09.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Deverell C., Morgan R.E., Strange J.H. Studies of chemical exchange by nuclear magnetic relaxation in the rotating frame. Mol. Phys. 1970;18:553–559. [Google Scholar]
- 24.McConnell H.M. Reaction rates by nuclear magnetic resonance. J. Chem. Phys. 1958;28:430–431. [Google Scholar]
- 25.Palmer A.G., Kroenke C.D., Loria J.P. Nuclear magnetic resonance methods for quantifying microsecond-to-millisecond motions in biological macromolecules. Methods Enzymol. 2001;339:204–238. doi: 10.1016/s0076-6879(01)39315-1. [DOI] [PubMed] [Google Scholar]
- 26.Palmer A.G. NMR characterization of the dynamics of biomacromolecules. Chem. Rev. 2004;104:3623–3640. doi: 10.1021/cr030413t. [DOI] [PubMed] [Google Scholar]
- 27.Palmer A.G., Grey M.J., Wang C. Solution NMR spin relaxation methods for characterizing chemical exchange in high-molecular-weight systems. Methods Enzymol. 2005;394:430–465. doi: 10.1016/S0076-6879(05)94018-4. [DOI] [PubMed] [Google Scholar]
- 28.Hansen D.F., Vallurupalli P., Kay L.E. Using relaxation dispersion NMR spectroscopy to determine structures of excited, invisible protein states. J. Biomol. NMR. 2008;41:113–120. doi: 10.1007/s10858-008-9251-5. [DOI] [PubMed] [Google Scholar]
- 29.Noble M.E.M., Musacchio A., Saraste M., Courtneidge S.A., Wierenga R.K. Crystal structure of the SH3 domain in human Fyn; comparison of the three-dimensional structures of SH3 domains in tyrosine kinases and spectrin. EMBO J. 1993;12:2617–2624. doi: 10.2210/pdb1shf/pdb. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Plaxco K.W., Guijarro J.I., Morton C.J., Pitkeathly M., Campbell I.D., Dobson C.M. The folding kinetics and thermodynamics of the Fyn-SH3 domain. Biochemistry. 1998;37:2529–2537. doi: 10.1021/bi972075u. [DOI] [PubMed] [Google Scholar]
- 31.Korzhnev D.M., Salvatella X., Vendruscolo M., Di Nardo A.A., Davidson A.R. Low-populated folding intermediates of Fyn SH3 characterized by relaxation dispersion NMR. Nature. 2004;430:586–590. doi: 10.1038/nature02655. [DOI] [PubMed] [Google Scholar]
- 32.Korzhnev D.M., Neudecker P., Zarrine-Afsar A., Davidson A.R., Kay L.E. Abp1p and Fyn SH3 domains fold through similar low-populated intermediate states. Biochemistry. 2006;45:10175–10183. doi: 10.1021/bi0611560. [DOI] [PubMed] [Google Scholar]
- 33.Eyring H. The activated complex and the absolute rate of chemical reactions. Chem. Rev. 1935;17:65–77. [Google Scholar]
- 34.Carr H.Y., Purcell E.M. Effects of diffusion on free precession in nuclear magnetic resonance experiments. Phys. Rev. 1954;94:630–638. [Google Scholar]
- 35.Meiboom S., Gill D. Modified spin-echo method for measuring nuclear relaxation times. Rev. Sci. Instrum. 1958;29:688–691. [Google Scholar]
- 36.Lundström P., Hansen D.F., Vallurupalli P., Kay L.E. Accurate measurement of Alpha proton chemical shifts of excited protein states by relaxation dispersion NMR spectroscopy. J. Am. Chem. Soc. 2009;131:1915–1926. doi: 10.1021/ja807796a. [DOI] [PubMed] [Google Scholar]
- 37.Korzhnev D.M., Kay L.E. Probing invisible, low-populated states of protein molecules by relaxation dispersion NMR spectroscopy: An application to protein folding. Acc. Chem. Res. 2008;41:442–451. doi: 10.1021/ar700189y. [DOI] [PubMed] [Google Scholar]
- 38.Martinez J.C., Serrano L. The folding transition state between SH3 domains is conformationally restricted and evolutionarily conserved. Nat. Struct. Biol. 1999;6:1010–1016. doi: 10.1038/14896. [DOI] [PubMed] [Google Scholar]
- 39.Riddle D.S., Grantcharova V.P., Santiago J.V., Alm E., Ruczinski I. Experiment and theory highlight role of native state topology in SH3 domain folding. Nat. Struct. Biol. 1999;6:1016–1024. doi: 10.1038/14901. [DOI] [PubMed] [Google Scholar]
- 40.Northey J.G.B., Di Nardo A.A., Davidson A.R. Hydrophobic core packing in the SH3 domain folding transition state. Nat. Struct. Biol. 2002;9:126–130. doi: 10.1038/nsb748. [DOI] [PubMed] [Google Scholar]
- 41.Guo W., Lampoudi S., Shea J.-E. Posttransition state desolvation of the hydrophobic core of the src-SH3 protein domain. Biophys. J. 2003;85:61–69. doi: 10.1016/S0006-3495(03)74454-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Taketomi H., Ueda Y., Gō N. Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effect of specific amino acid sequence represented by specific inter-unit interactions. Int. J. Pept. Protein Res. 1975;7:445–459. [PubMed] [Google Scholar]
- 43.Zarrine-Afsar A., Wallin S., Neculai A.M., Neudecker P., Howell P.L. Theoretical and experimental demonstration of the importance of specific nonnative interactions in protein folding. Proc. Natl. Acad. Sci. USA. 2008;105:9999–10004. doi: 10.1073/pnas.0801874105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Canet D., Last A.M., Tito P., Sunde M., Spencer A. Local cooperativity in the unfolding of an amyloidogenic variant of human lysozyme. Nat. Struct. Biol. 2002;9:308–315. doi: 10.1038/nsb768. [DOI] [PubMed] [Google Scholar]
- 45.Wishart D.S., Case D.A. Use of chemical shifts in macromolecular structure determination. Methods Enzymol. 2001;338:3–34. doi: 10.1016/s0076-6879(02)38214-4. [DOI] [PubMed] [Google Scholar]
- 46.Ollerenshaw J.E., Kaya H., Chan H.S., Kay L.E. Sparsely populated folding intermediates of the Fyn SH3 domain: Matching native-centric essential dynamics and experiment. Proc. Natl. Acad. Sci. USA. 2004;101:14748–14753. doi: 10.1073/pnas.0404436101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Myers J.K., Pace C.N., Scholtz J.M. Denaturant m values and heat capacity changes: Relation to changes in accessible surface areas of protein unfolding. Protein Sci. 1995;4:2138–2148. doi: 10.1002/pro.5560041020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cavalli A., Salvatella X., Dobson C.M., Vendruscolo M. Protein structure determination from NMR chemical shifts. Proc. Natl. Acad. Sci. USA. 2007;104:9615–9620. doi: 10.1073/pnas.0610313104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Shen Y., Lange O., Delaglio F., Rossi P., Aramini J.M. Consistent blind protein structure generation from NMR chemical shifts data. Proc. Natl. Acad. Sci. USA. 2008;105:4685–4690. doi: 10.1073/pnas.0800256105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Vallurupalli P., Hansen D.F., Kay L.E. Structures of invisible, excited protein states by relaxation dispersion NMR spectroscopy. Proc. Natl. Acad. Sci. USA. 2008;105:11766–11771. doi: 10.1073/pnas.0804221105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Neudecker P., Korzhnev D.M., Kay L.E. Assessment of the effects of increased relaxation dispersion data on the extraction of 3-site exchange parameters characterizing the unfolding of an SH3 domain. J. Biomol. NMR. 2006;34:129–135. doi: 10.1007/s10858-006-0001-2. [DOI] [PubMed] [Google Scholar]
- 52.Korzhnev D.M., Neudecker P., Mittermaier A., Orekhov V.Y., Kay L.E. Multiple-site exchange in proteins studied with a suite of six NMR relaxation dispersion experiments: An application to the folding of a Fyn SH3 domain mutant. J. Am. Chem. Soc. 2005;127:15602–15611. doi: 10.1021/ja054550e. [DOI] [PubMed] [Google Scholar]
- 53.Neudecker P., Zarrine-Afsar A., Davidson A.R., Kay L.E. Φ-value analysis of a three-state protein folding pathway by NMR relaxation dispersion spectroscopy. Proc. Natl. Acad. Sci. USA. 2007;104:15717–15722. doi: 10.1073/pnas.0705097104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Korzhnev D.M., Religa T.L., Lundström P., Fersht A.R., Kay L.E. The folding pathway of an FF domain: Characterization of an on-pathway intermediate state under folding conditions by 15N, 13Cα and 13C-methyl relaxation dispersion and 1H/2H-exchange NMR spectroscopy. J. Mol. Biol. 2007;372:497–512. doi: 10.1016/j.jmb.2007.06.012. [DOI] [PubMed] [Google Scholar]
- 55.Jemth P., Day R., Gianni S., Khan F., Allen M. The structure of the major transition state for folding of an FF domain from experiment and simulation. J. Mol. Biol. 2005;350:363–378. doi: 10.1016/j.jmb.2005.04.067. [DOI] [PubMed] [Google Scholar]