Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2021 Nov 10;118(46):e2115113118. doi: 10.1073/pnas.2115113118

The A39G FF domain folds on a volcano-shaped free energy surface via separate pathways

Ved P Tiwari a,1, Yuki Toyama b,c,d,1, Debajyoti De a, Lewis E Kay b,c,d,e,2, Pramodh Vallurupalli a,2
PMCID: PMC8609552  PMID: 34764225

Significance

Protein molecules interconvert between multiple states, and this exchange process plays an important role in function and misfunction. Here, we use powerful NMR experiments that probe sparsely populated, transiently formed conformational states, which exchange with a highly populated, stable, folded conformer in order to elucidate the folding mechanism of a small protein domain. Notably, by analyzing the exchange profiles in detail, it becomes possible to obtain structural data for states that are about 10-fold less populated and less long lived than has previously been possible. Our results establish that the small FF domain folds via two partially structured intermediates and involves a pair of discrete pathways in which the unfolded state becomes increasingly structured along a volcano-shaped free energy surface.

Keywords: protein folding intermediates, chemical exchange saturation transfer NMR, FF domain, multiple folding pathways, multistate exchange

Abstract

Conformational dynamics play critical roles in protein folding, misfolding, function, misfunction, and aggregation. While detecting and studying the different conformational states populated by protein molecules on their free energy surfaces (FESs) remain a challenge, NMR spectroscopy has emerged as an invaluable experimental tool to explore the FES of a protein, as conformational dynamics can be probed at atomic resolution over a wide range of timescales. Here, we use chemical exchange saturation transfer (CEST) to detect “invisible” minor states on the energy landscape of the A39G mutant FF domain that exhibited “two-state” folding kinetics in traditional experiments. Although CEST has mostly been limited to studies of processes with rates between ∼5 to 300 s−1 involving sparse states with populations as low as ∼1%, we show that the line broadening that is often associated with minor state dips in CEST profiles can be exploited to inform on additional conformers, with lifetimes an order of magnitude shorter and populations close to 10-fold smaller than what typically is characterized. Our analysis of CEST profiles that exploits the minor state linewidths of the 71-residue A39G FF domain establishes a folding mechanism that can be described in terms of a four-state exchange process between interconverting states spanning over two orders of magnitude in timescale from ∼100 to ∼15,000 μs. A similar folding scheme is established for the wild-type domain as well. The study shows that the folding of this small domain proceeds through a pair of sparse, partially structured intermediates via two discrete pathways on a volcano-shaped FES.


The free energy surface (FES) of a molecule dictates the relative energies as well as the interconversion kinetics of the conformers that populate it. A quantitative description of molecular structure and dynamics is predicated, therefore, on a detailed understanding of the FES. In the case of biomolecules, such as proteins, the conformational dynamics that result in the exchange between different molecular states are critical for proper function and can lead to misfunction in some cases (15). In principle, computational methods can be used to determine the FESs for model systems, although more accurate force fields and faster simulation methods are needed for quantitative descriptions. While it remains intractable to experimentally map the FES for complex biomolecules, such as proteins, insights into the shape of the FES can be obtained by detecting the different conformational states that are sampled by the protein under equilibrium and nonequilibrium conditions and by measuring the rates at which these states interconvert with one another (610). For example, determining folding pathways by detecting intermediates provides insights into how a protein folds rapidly by sampling only a subset of all possible conformations. These experimentally detected states and rate constants can be combined with computational techniques to obtain a detailed picture of the FES (11, 12). However, this strategy remains challenging because traditional biophysical techniques cannot detect minor states that are sparsely and fleetingly populated and while modern fluorescence-based methods are indeed sensitive to the presence of minor states (13, 14), they provide little direct atomic resolution structural information. NMR spectroscopy can be used to study the dynamics at virtually all backbone and sidechain positions in a protein, and relaxation-based NMR experiments are sensitive to minor conformational states with lifetimes on the order of ∼0.01 to ∼100 ms and populations as low as ∼1% (1518). Thus, NMR has emerged as a powerful tool to study conformational dynamics of proteins at atomic resolution (7, 15, 1923). The transiently populated minor states that have become amenable to detailed study by NMR are termed “invisible” states, as peaks arising from these conformers are not directly observed in conventional NMR spectra. Yet, as has been established in the past several decades, they can be detected by manipulating the NMR signals associated with the visible major state that they are interconverting with. A number of different NMR experiments have emerged for studies of invisible states. These include Carr–Purcell–Meiboom–Gill (CPMG) relaxation dispersion, chemical exchange and dark-state exchange saturation transfer (CEST and DEST, respectively), and R experiments, with each technique sensitive to a different kinetic exchange regime (15, 16, 20, 2427). These approaches have been used to obtain kinetic and thermodynamic parameters governing the exchange process(es) as well as structural information on the interconverting conformers (2833) and, in some cases, it has been possible to obtain the structures of the invisible minor states (21, 3437). Notably, the methodology is equally applicable to studies of nucleic acids, in which strong correlations between excited conformational states and function have been clearly established (38, 39). In the context of CEST that will be used in the present study and the related DEST technique, the methodology has been exploited to elucidate reaction mechanisms in small molecules (40) and to study the folding (41, 42), aggregation (43, 44), and other conformational exchange processes in large biomolecules, such as proteins (36, 4547) and nucleic acids (16, 38, 48). CEST has also been used to study exchange occurring between a major state and multiple invisible minor states that interconvert with the major state directly or indirectly (36, 41, 42).

One area of focus in our laboratories has involved NMR relaxation–based studies of protein folding intermediates and compact alternate protein conformers that differ from the major state conformation (12, 37, 41, 4951). Studies of the folding mechanisms of a variety of small proteins using different biophysical techniques have provided an understanding of the general principles that govern protein folding (1, 7, 9, 10, 52, 53). One such system is the 71-residue FF domain from human HYPA/FBP11, whose folding mechanism has been studied extensively by using a variety of techniques so that it has emerged as a model system to understand protein folding (8, 37, 5456). Under native conditions, the protein adopts a four-helix bundle structure that consists of helices H1, H2, H3, H4, and a disordered N-terminal tail (57). Initial folding studies suggested that the wild-type FF domain (WT FF) folds via a compact, transiently populated intermediate (54, 55), and subsequent CPMG relaxation dispersion NMR experiments were used to derive the structure of this invisible state (37). The structure showed that the folding intermediate is stabilized by nonnative interactions which slow down the folding processes (37). In a previous study of folding using burst phase analysis, a folding intermediate was detected for 32 out of 33 FF mutants (56). Notably, an intermediate could not be detected for the folding of A39G FF (56), although a folding rate significantly slower (∼135 s−1, 10 °C) than WT FF (∼2,200 s−1, 10 oC) was measured (55), and little insight into the folding pathway of this variant is currently available.

Here, we use 15N-based CEST experiments (58) to elucidate the folding mechanism of A39G FF by taking advantage of the information contained in the minor state linewidths in CEST spectra that provides increased sensitivity to invisible states with populations and lifetimes an order of magnitude smaller than those typically analyzed using this methodology. We show that the A39G FF domain folds through two intermediates via a branched pathway on a volcano-shaped energy landscape, in which the energies of minor, partially folded states increase prior to collapse to the folded conformation and that other FF variants, including the WT FF domain, fold via similar intermediates. Our study further highlights the folding complexities of even single-domain proteins and the unique role that CEST can play in describing increasingly complicated FESs of biomolecules.

Results

Using Sparse States to Detect Other Sparse States via CEST.

As mentioned in the introduction, we have made use of the linewidths of the minor state dips to increase the information content of CEST profiles. (See the SI Appendix, A Brief Description of CEST for an overview of the CEST experiment.) In order to illustrate the approach schematically consider the CEST profiles in Fig. 1. Fig. 1A shows a calculated CEST profile of an 15N spin that results from a two-site exchange process, A ↔ B, with kex,AB = kAB + kBA = 25 s−1 (kij is the rate constant for the reaction from state i to state j) and the fractional population of the minor state pB = 2% (pB = kAB/kex,AB). As states A and B differ in structure, the 15N spin probe in question will have distinct chemical shifts in each state, here taken to be 0 and 7 ppm for states A and B, respectively. In this case, the CEST profile comprises a pair of dips (or peaks) centered at the resonance frequencies of the 15N spin in each state and indicated by the black and green vertical lines in Fig. 1. Notably, the peak from the excited (B) state would not be observed or would be very weak in conventional spectra. In Fig. 1B, a CEST profile for a three-state exchange process, A ↔ B ↔ C, kex,AB = 25 s−1, kex,BC = 200 s−1, pB = 2%, and pC = 0.2% is shown. The chemical shift of state C (ϖC) is set to 3 ppm. Three dips are observed at the chemical shifts of the 15N nucleus in question in each of the three states A, B, and C (blue vertical line). When kex,BC is increased to 4,950 s−1, corresponding to the onset of intermediate-fast exchange (|kex,BC/ΔωBC| ∼1.95 at 23.4 T field and ΔωBC = ωC − ωB, where ωi is the angular resonance frequency [radians/second] of the 15N spin in state i), the dip from state C broadens out and disappears, while the dip derived from B broadens and moves toward the resonance frequency of the 15N spin in state C (Fig. 1C), as expected (5963). Thus, minor state dip positions and widths in 15N CEST profiles are sensitive to the presence of other even more sparsely populated states (C in this example), suggesting that dips from state B, that are themselves only visible in CEST profiles, can be used to “spy” on other even more sparsely populated states that are invisible to CEST. Such exchange among minor states has been challenging to study previously (20, 64). As the position and linewidth of the B state dip that is affected by exchange with C will now depend on the static magnetic field, an analysis of CEST data recorded at different spectrometer field strengths is useful in studying multistate exchange (5963, 65). Note that in all the experiments described here only magnetization from the “visible” major state (A in this case) is detected.

Fig. 1.

Fig. 1.

15N CEST profiles are extremely sensitive to the underlying exchange process(es) that interconvert states. 15N CEST profiles have been calculated for two- (A) and three-state (B and C) exchange using the parameters listed with B1 = 20 Hz, an exchange time TEX = 0.5 s, a static magnetic field of 23.4 T, and chemical shifts of an 15N spin in states A to C of ϖA = 0 ppm, ϖB = 7 ppm, and ϖC = 3 ppm. I and I0 are the intensities of peaks simulated with and without the TEX periods, respectively (58). Longitudinal and transverse relaxation rates of 1 and 10 s−1 were assumed for all sites. Black, green, and blue vertical lines denote ϖA, ϖB, and ϖC, respectively. Because of rapid exchange between states B and C, the minor (“B”) state dip indicated by the gray arrow in C is shifted away from ϖB (green line) toward ϖC (blue line). The ↔ symbol is used to denote equilibrium in all figures, and the number above (or below) it with units of s−1 (second1) is the exchange rate (kex), that is, the sum of the forward and reverse reaction rate constants. See SI Appendix, A Brief Description of CEST for an overview of the CEST experiment.

A39G FF Does Not Fold in a “Two-State” Manner.

As described earlier, WT FF and several of its mutants have all been shown to fold via intermediates, while the slow folding A39G FF variant appears to fold in a “two-state” manner based on conventional stopped-flow experiments (56). Interestingly, the minor unfolded (U) state dip of A39G FF is broadened in 15N CEST profiles (58, 66), much like the state B peak in Fig. 1C, suggesting that the U state is exchanging with other minor states, including possible folding intermediates. To establish whether this is the case, we recorded four 15N CEST datasets with a number of different weak B1 fields to search for invisible conformers: B1 = 15.9 and 29.1 Hz at 600 MHz and B1 = 8.3 and 27.7 Hz at 1 GHz (SI Appendix, Table S1). The amide 1HN15N correlation map (1 °C and 1 GHz; Fig. 2A) of A39G FF is well resolved so that conformational exchange can be studied at almost all backbone amide 15N sites in the domain. 15N CEST profiles recorded with B1 = 27.7 Hz at 1 GHz (Fig. 2B) clearly show a pair of dips, with the main dip derived from the visible, major conformational state of the domain (red numbers in Fig. 2A). Notably, the chemical shifts of the minor state correspond to those of an unfolded protein (referred to in what follows as state U), as a strong correlation is observed between the difference of the U and folded (F) state chemical shifts ΔϖFU = ϖU − ϖF values (parts per million), obtained from two-state fits of the CEST profiles and shift differences calculated using 15N random coil values predicted with the program POTENCI (Fig. 2C) (67). Although we expect the U state intrinsic R2 values (R2U) to be approximately half of the F state intrinsic R2 values (R2F) (68), an exchange model in which R2U was constrained to be R2F/2 did not fit the 15N CEST data well (χred25.3, SI Appendix, Table S2). The fits improved considerably (χred21.43, SI Appendix, Table S2) when R2U was not constrained, with fitted rates that varied significantly from site to site. This large site-to-site variation in R2U rates can be seen in the 15N CEST profiles, in which it is clear that the minor state dip of Ile-43 is broader than dips from Lys-22, Ser-50, and Ser-56 (Fig. 2B). Notably, the exchange-induced broadening of the U state, RexU, approximated by R2UR2F/2, increases with static magnetic field (Fig. 2D) with RexU rates as large as ∼600 s−1 at 1 GHz, suggesting an exchange mechanism that is more complex than two state. It is interesting to note that although the predicted and CEST-derived ΔϖFU values are in good agreement (1.2 ppm RMSD; Fig. 2C), the predicted U state shifts are larger on average than those measured (offset) by ∼0.7 ppm (Fig. 2C).

Fig. 2.

Fig. 2.

The sparse, unfolded state of A39G FF exchanges with other even more rare conformers. (A) A39G FF 15N–1H Heteronuclear Single Quantum Coherence (HSQC) spectrum (1 GHz and 1 °C) with resonances labeled according to the residues from which they derive. (B) 15N CEST profiles recorded at 1 GHz and 1 °C with B1 = 27.7 Hz clearly show a minor state dip in addition to the major state dip. Residues for which CEST profiles are shown in B are labeled in red in A. Experimental data points are shown as magenta circles, and the brown line is calculated from the global two-state best fit parameters, with R2U allowed to vary from site to site (SI Appendix, Table S2). (C) Comparison between 15N ΔϖFU values from global two-state fits of CEST profiles with chemical shift differences obtained using the unfolded state chemical shifts of A39G FF predicted by the program POTENCI (67) and the measured shifts from the folded state. Offset refers to the average difference (QyQx) between the quantities plotted on the y (Qy) and x (Qx) axis. (D) RexU (R2UR2F/2) values for the (minor) unfolded state obtained from fits of CEST dips in profiles recorded at 600 MHz (magenta) and 1 GHz (green) for residues with |ΔϖFU| > 2 ppm show that many dips from the minor state are significantly broadened. A kex value of 76 s−1 (sum of forward and reverse rates) is obtained from the best fit of the CEST data to a two-state exchange model, as indicated above panels C and D.

The 15N CEST data were subsequently analyzed using a linear three-state folding model F↔I↔U, in which folding proceeds via an intermediate I (SI Appendix, Fig. S1). Here, we included three additional 15N CEST datasets recorded with higher B1 values at 1 GHz (64.9, 159.6, and 224.5 Hz; SI Appendix, Table S1). The linear three-state model with constraints R2U = R2F/2 and R2I = R2F did not fit the 15N CEST data well (χred2 = 2.44; SI Appendix, Table S2), but the fits improved considerably (χred2 = 1.27; SI Appendix, Fig. S1A and Table S2) when R2U was unconstrained and R2I = R2F. As in the case of the two-state analysis, the unfolded state chemical shifts obtained from these fits are in good agreement with the predicted values (1.2 ppm RMSD; SI Appendix, Fig. S1B), yet there still remains a 0.6 ppm offset.

Previous studies using CPMG experiments recorded at 30 °C (37) established that WT FF folds via an intermediate; however, further states could not be observed via the CPMG class of experiments. The 15N chemical shifts of the WT FF intermediate are in reasonable agreement with the I state chemical shifts obtained here for A39G FF (RMSD 1.8 ppm; SI Appendix, Fig. S1C), suggesting that WT and A39G FF domains fold via similar intermediates. Yet the fitted R2U rates are still large (SI Appendix, Fig. S1D), with some values exceeding ∼300 s−1 (at 1 GHz), indicating that there is likely an additional exchange mechanism at play that is not captured by the three-state model. Notably, similar poor-quality fits, as those obtained using a linear folding scheme, were also obtained (SI Appendix, Table S2) with a more complex triangular model and the quality of the fits improved (χred21) only when R2U was not constrained. The large, fitted R2U values are consistent with more complex dynamics.

A39G FF Folds via Two Intermediates along Two Pathways.

Having established that neither two- nor three-state folding models could reproduce the experimental CEST data using reasonable R2U values, we next considered different linear four-state exchange schemes with a pair of intermediates, referred to as I1 and I2 in what follows, subject to the constraint that R2I1 = R2I2 = R2F and R2U = R2F/2. Two of these linear exchange schemes are shown in Fig. 3, and although the F ↔ I2 ↔ I1 ↔ U model fit the data better (Fig. 3A) than the I2 ↔ F ↔ I1 ↔ U scheme (Fig. 3B) (χred2 1.1 versus 1.63; SI Appendix, Table S2) and was superior to both two- and three-state models where R2U = R2F/2, the linear four-state fits were still not satisfactory, as illustrated in Fig. 3 A and B for profiles derived from both Ile-43 and Ser-56. However, a four-state model in which all states were allowed to exchange with each other and where R2I1 = R2I2 = R2F and R2U = R2F/2 fit the data well with values of kex,FU and kex,I2U ∼0 s−1 (SI Appendix, Fig. S2A), resulting in the simplified four-state model of Fig. 3C. This model reproduced the CEST data better than previous constrained models described earlier in this section (Fig. 3C, χred20.9; SI Appendix, Table S2) and, importantly, allowing R2U to float during the fits did not improve their quality (χred20.86; SI Appendix, Table S2), in contrast to what was observed for the two- and three-state models (SI Appendix, Table S2). More complex folding pathways were, therefore, not considered. In the four-state exchange model of Fig. 3C, the U state rapidly interconverts with I1 (kex,I1U ∼8,453 s−1) and then folds to F either directly from I1 or via I2 with exchange rates kex,FI1 = 158 ± 20 s−1, kex,FI2 = 344 ± 20 s−1, kex,I1I2 = 1,573 ± 50 s−1, and kex,I1U = 8,453 ± 400 s−1 and fractional populations pI1= 0.34 ± 0.05%, pI2 = 0.19 ± 0.01%, and pU = 1.03 ± 0.05%. Thus, despite the small size of the FF domain, there appears to be more than one folding pathway.

Fig. 3.

Fig. 3.

A two-pathway, four-state folding model reproduces A39G FF CEST data. (AC) 15N CEST profiles of Ile-43 and Ser-56 (B1 = 64.9 Hz and 1 GHz) fitted to different four-state models as shown, along with extracted kex values and populations. Experimental data are represented using magenta points, and the brown line was calculated using the appropriate best fit parameters for models shown on top of the panels. The best fit parameters were obtained from a global analysis of 600 MHz and 1 GHz 15N CEST profiles from 14 residues of A39G FF (SI Appendix, Table S2). The CEST data cannot be satisfied by the linear models (A and B) but is well reproduced using a model in which both I1 and I2 exchange with F (C). Above each of the CEST profiles is plotted the difference between experimental and fitted I/I0 values [Δ(I/I0)]. (D) Comparison between ΔϖFU values obtained using the four-state model shown in C with those calculated from predicted chemical shifts of amides in U by POTENCI (67).

Validation of the Four-State Folding Pathway.

A simple test of the four-state model can be made by comparing the four-state ΔϖFU values with those predicted, and indeed, the agreement is significantly improved relative to what was obtained for both the two- and three-state schemes considered (RMSD 0.7 ppm, Fig. 3D, versus 1.2 ppm, Fig. 2C and SI Appendix, Fig. S1B). Furthermore, the offset between the CEST-derived and -predicted ΔϖFU values has essentially vanished (0.2 ppm, Fig. 3D, versus 0.7 ppm, Fig. 2C and 0.6 ppm SI Appendix, Fig. S1B). In order to establish that the I1 and I2 state chemical shifts are meaningful, we attempted to “simplify” the four-state exchange equilibrium by manipulating the populations of the different minor states through the addition of chemicals or by point mutations (Fig. 4 AE). As a point of reference, Fig. 4A shows four-state fits of 15N CEST data recorded at 1 GHz with B1 = 64.9 Hz for Lys-22, Ile-43, Ser-50, and Ser-56 along with the 15N chemical shifts for these residues in F, I1, I2, and U, indicated using black, blue, green, and red vertical lines, respectively. Note that distinct dips for the I1 state were not observed in any of the 15N CEST profiles, and only Ser-56 showed an I2 state dip (Figs. 3C and 4A). The addition of 1 M urea, that is known to stabilize unfolded structure (1), increased pU (Fig. 4B) so that 15N CEST data (18.3 and 33.9 Hz, 700 MHz) could be well fit to a two-state F↔U exchange model (kex,FU = 17.9 ± 2 s−1 and pU = 7.4 ± 0.4%). The resulting ΔϖFU values are in very good agreement with F, U shift differences obtained both from the four-state fits (RMSD 0.4 ppm; Fig. 4F), and from predicted unfolded state shifts (RMSD 0.6 ppm; SI Appendix, Fig. S3A). Next, we added 10% trifluro ethanol (TFE) to the buffer. TFE can stabilize or destabilize native protein states depending on the system and, additionally, may stabilize folding intermediates (69). Notably, the addition of TFE increased the population of the I1 state (Fig. 4C) so that 15N CEST data (10.4 and 18.2 Hz, 700 MHz) could be fit to a two-state exchange model (kex,FI1 = 95.5 ± 4 s−1 and pI1 = 8.6 ± 0.1%), with the resulting minor state chemical shifts in agreement with those obtained for the I1 state from the four-state model (RMSD 0.9 ppm; Fig. 4G) and with those previously obtained for the folding intermediate of WT FF, based on the analysis of CPMG experiments (37) recorded at 30 °C (RMSD 1 ppm; SI Appendix, Fig. S3B). Thus, A39G FF folds via an intermediate similar to the I state of WT FF, supporting the four-state analysis. The underlying mechanism for the relative stabilization of I1 due to TFE is currently not understood. We also recorded 15N CEST profiles (B1 = 10.3 and 25.8 Hz, 700 MHz, 5 °C; Fig. 4D) of A17G FF (dissolved in buffer containing 20% glycerol), in which the population of the I2 state was elevated. Fits of the data to a two-site exchange model (kex,FI2 = 104.6 ± 10 s−1 and pI2 = 1.0 ± 0.1%) produced intermediate chemical shifts in good agreement with 1) those from the four-state model (RMSD 1 ppm; Fig. 4H) and 2) from analysis of CPMG experiments recorded on A17G FF at 25 °C (8) (RMSD 0.7 ppm; SI Appendix, Fig. S3C). It is worth emphasizing that none of the three systems of Fig. 4 BD are strictly two-state since significant exchange contributions to the minor state dips were observed. Nonetheless, these systems provide a means to test the predictions of the four-state model (Fig. 4 FH).

Fig. 4.

Fig. 4.

Fits of the CEST data to the four-state exchange model are meaningful. (A) 15N CEST profiles recorded for Lys-22, Ile-43, Ser-50, and Ser-56 with B1 = 64.9 Hz, 1 GHz, and 1 °C. The four-state model is shown on the right, along with extracted exchange kex rates and populations. In AE, the chemical shifts of the minor states obtained from the four-state analysis are shown using blue, green, and red lines for I1, I2, and U, respectively, while the position of the major native state is indicated with a black line in A. (B) A39G FF 15N CEST profiles (B1 = 18.3 Hz, 700 MHz, and 1 °C) recorded in the presence of 1 M urea and analyzed using the two-state exchange model shown on the right. Minor state dips appear close to chemical shifts predicted for U by the four-state analysis (red line). (C) A39G FF 15N CEST profiles (B1 = 10.4 Hz, 700 MHz, and 1 °C) recorded with 10% TFE and analyzed using the two-state model shown on the right. Minor state dips close to the I1 state chemical shift predicted by the four-state analysis are observed (blue line). (D) 15N CEST profiles (B1 = 10.3 Hz, 700 MHz, and 5 °C) for A17G FF + 20% glycerol analyzed using a two-state model. Minor state dips proximal to I2 state chemical shifts predicted by the four-state analysis (green line) are observed. (E) A39G FF 15N CEST profiles recorded at 15 °C in the presence of 25% glycerol, analyzed using the three-state model, shown on the right. A dip at I1 is now visible close to the I1 state chemical shift predicted by the four-state analysis (blue line). In AE, the experimental data points are shown in magenta, with the brown lines calculated using the appropriate global best fit parameters. (FH) Comparison of A39G FF ΔϖFU (F), ΔϖFI1 (G), or ΔϖFI2 (H) values obtained using the four-state analysis (A) with the corresponding ΔϖFU (F), ΔϖFI1 (G), or ΔϖFI2 (H) values fitted from the two-state analysis of CEST profiles recorded using A39G FF in the presence of 1 M urea (B), 10% TFE (C), or A17G FF in the presence of 20% glycerol (D). (I and J) Comparison of the ΔϖFI1 (I) or ΔϖFI2 (J) values obtained using the four-state analysis (A) with ΔϖFI1 or ΔϖFI2 values from three-state fits of A39G FF CEST data recorded in presence of 25% glycerol (E). (K) Correlation plot of ΔϖFU[1 M urea] − ΔϖFU[two-state] versus ΔϖFU[1 M urea] − ΔϖFI1[10% TFE]; all Δϖ values used in (K) were obtained from two-state fits of CEST data (see the section titled Validation of the Four-State Folding Pathway for details).

Fits using the four-state exchange model established that pU ∼3pI1, with U and I1 in rapid exchange (Fig. 4A). Thus, states I1 and U should give rise to a single dip in 15N CEST profiles at a position close to ϖU and shifted slightly toward ϖI1 (6063). This is clearly apparent for Ile-43 and Ser-50 in Fig. 4A, in which the minor dip is in between the blue and red vertical lines denoting ϖI1 and ϖU, respectively. As pU is reduced upon addition of 10% TFE, the averaged I1-U dip should appear closer to ϖI1 (blue line), as observed for Ile-43 and Ser-50 (compare Fig. 4 A and C). Note that the fact that the minor dip does not superimpose with ϖI1 for Ile-43 (Fig. 4C) suggests that pU ≠ 0, even with 10% TFE, and that a two-site model of exchange is an oversimplification for A39G FF + 10% TFE, as discussed earlier in this section.

Glycerol is known to stabilize proteins (1), and indeed, upon addition of 25% glycerol to the A39G FF sample (Fig. 4E), pU was reduced significantly, and dips at ϖU were not observed in the 15N CEST profiles (compare Fig. 4 B and E), while dips at ϖI1 were observed (compare Fig. 4 C and E). Notably, however, the broad I1 state dip for Ile-43 is shifted toward U (Fig. 4E), suggesting that a small residual amount of the U state is present, even with 25% glycerol. The ϖI1 and ϖI2 chemical shifts obtained from fits to a three-state exchange process (kex,FI1 = 145.5 ± 10 s−1, kex,FI2 = 330.3 ± 15 s−1, kex,I1I2 = 1,237.2 ± 50 s−1, pI1 = 0.82 ± 0.02%, and pI2 = 0.66 ± 0.02%; SI Appendix, Table S3) are in good agreement with those obtained from the four-state analysis (Fig. 4 I and J).

As a final test of the four-state analysis, we compared ΔϖFU[1 M urea] − ΔϖFU[two-state] and ΔϖFU[1 M urea] − ΔϖFI1[10% TFE], where ΔϖFU[1 M urea] (Fig. 4B), ΔϖFU[two-state] (Fig. 2C), and ΔϖFI1[10% TFE] (Fig. 4C) are obtained from two-state fits of A39G FF CEST profiles recorded on samples with and without urea, and with 10% TFE, respectively (Fig. 4K). For fast exchange on the NMR chemical shift timescale between U and I1 with pU > pI1, the averaged U-I1 dip position, ϖavg, will be shifted by an amount pI1pI1+pUϖI1-ϖU from the pure U state position towards I1 so that ϖU − ϖavg = pI1pI1+pU(ϖUϖI1). As ϖU[two-state] = ϖavg, the difference, ΔϖFU[1 M urea] − ΔϖFU[two-state], is, to an excellent approximation, given by pI1pI1+pU(ϖUϖI1). Similarly, ΔϖFU[1 M urea] − ΔϖFI1[10% TFE] ∼ (ϖU − ϖI1), assuming that only states F and I1 are populated when 10% TFE is added. Thus, the slope of the linear correlation in Fig. 4K should be pI1pI1+pU, which is calculated to be 0.25 ± 0.02 based on the parameters extracted from fits of the four-state model to the CEST data. This agrees reasonably well with a slope of 0.32 ± 0.01 that is measured from the correlation plot, with the modest difference likely due to a residual small population of U in the 10% TFE sample. Notably, if pI1pI1+pU is constrained to 0.32 in the four-state fits, the extracted ΔϖFI1, ΔϖFI2, and ΔϖFU values remain essentially the same, and the fit quality is unaltered (χred20.91) (SI Appendix, Fig. S4). SI Appendix, How Robust Is the Four-State Model? also discusses some more aspects of the validity and robustness of the four-state model.

Having established that robust and meaningful measures of chemical shift differences can be obtained from the four-state analysis and that I1 and U interconvert rapidly, we can now understand why ΔϖFU values estimated on the basis of two-state fits (Fig. 2C) are, on average, less than the predicted differences. Since most of the ΔϖFU and ΔϖI1U values are positive, with ΔϖFU > ΔϖI1U, the resulting shift in the dip position from ϖU toward ϖI1 due to exchange will reduce the fitted ΔϖFU values. In contrast, when the exchange kinetics are properly taken into account, the fitted ΔϖFU values provide an accurate measure of the actual differences between chemical shifts in the F and U states (Fig. 3D). As rapid exchange between the minor states can shift the dips in CEST profiles considerably (Figs. 1C and 4 A and B), it may be prudent to test if the CEST- (or CPMG-, for that matter) derived minor state shifts are physically meaningful when interpreting them in terms of structure. If the shifts obtained are not physical, this implies that the exchange model used for data fitting is incomplete.

Folding Intermediates I1 and I2 Are Largely Structured.

Prior to describing the structural features of I1 and I2, it is important to establish that indeed these two intermediates are unique and that they can be distinguished on the basis of their 15N chemical shifts. That this is the case is made clear by the good agreement between ΔϖFI1 values obtained from the four-state fit and ΔϖFI1[10% TFE] (RMSD 0.9 ppm; SI Appendix, Fig. S5A) and by the poor agreement with ΔϖFI2 [A17G FF + 20% glycerol] (RMSD 2.8 ppm; SI Appendix, Fig. S5B). Conversely, ΔϖFI2 values from the four-state fit agree well with ΔϖFI2 obtained from the analysis of CEST profiles of A17G FF + 20% glycerol (RMSD 1.0 ppm; SI Appendix, Fig. S5C) but are in poor agreement with ΔϖFI1[10% TFE] (RMSD 2.3 ppm; SI Appendix, Fig. S5D); see SI Appendix, Tables S4–S7 for compilation of chemical shift values.

Having established that the 15N chemical shifts are meaningful, we can then use them with confidence to obtain insights into the conformational preferences of the I1 and I2 states. Figs. 5 AC show |ΔϖFU| (Fig. 5A), |ΔϖFI1| (Fig. 5B), and |ΔϖFI2| (Fig. 5C) values for A39G FF, along with the structure of WT FF in Fig. 5D [Protein Data Bank (PDB) ID: 1UZC (57)]. As expected, the U state chemical shifts differ significantly from those of the native state throughout the molecule (Fig. 5A). In the case of the I1 state, the 15N chemical shifts of the first ∼50 residues are similar to those of the native state, while amide shifts of the last ∼20 residues that encompass helices H3 and H4 differ significantly from shifts in the folded conformation (Fig. 5 B and E). Given that the I1 15N chemical shifts of A39G FF are well correlated with the I shifts of WT FF determined by a three-state analysis of CPMG data (37) (SI Appendix, Fig. S3B), the structure of the FF WT I state derived previously (37) (Fig. 5F) is likely to be an excellent model for the I1 conformer of A39G FF. In WT FF, both helices H1 and H2 are present in I, while helix H3 is longer and helix H4 shorter than in the native state (compare Fig. 5 D and F). An A39G FF I1 conformation with these structural features would give rise to these observed ΔϖFI1 values.

Fig. 5.

Fig. 5.

Chemical shifts report on the progression of structure from U to I1 to I2. (AC) Values of |ΔϖFU|, |ΔϖFI1|, and |ΔϖFI2| (from the four-state analysis) are plotted along the sequence. (D) The native structure (57) of the FF domain (PDB: 1UZC) consists of four helices: H1 (Thr13 to Lys28; dark green), H2 (Ser-35 to Asn-45; cyan), H3 (Asp-46 to Leu-52; magenta), and H4 (Lys-54 to Thr-69; orange). (E) Residues with large |ΔϖFI1| values (>3 ppm) are largely localized (B) to the sequence between Ser-50 and Gln-68 that is colored in blue on the structure of the WT FF domain. (F) The structure of the folding intermediate of FF WT (PDB: 2KZG) (37), colored as in D. The last five disordered residues (Val-67 to Lys-71) are not shown. (G) Highlighted on the structure of WT FF in green are residues from the Ser-50 to Lys-59 region, where the large |ΔϖFI2| values (>3 ppm) are localized, as in C. The sidechain of Ser-56 that has the largest |ΔϖFI2| value (>10 ppm) is shown. The olive line in each panel is drawn at 3 ppm. The first 10 (Gly-1 to Thr-10) disordered residues are not shown in DG.

The 15N chemical shifts of I2 are more native like than those for I1 (compare Fig. 5 B and C), with large ΔϖFI2 values observed only for residues in the stretch between Ser50 to Lys59 (Fig. 5 C and G) that includes the end of helix H3 and beginning of helix H4. Thus, the conformation of I2 is closer to that of the native state than I1, with the C-terminal half of helix H4 that was unstable and partially disordered in the I state of WT FF adopting a helical, native-like conformation in I2 (note the small ΔϖFI2 values for residues Gln-60 to Lys-71; Fig. 5C). Detailed structural studies of I2 are currently underway using a variety of CEST experiments to probe the exchange dynamics at backbone and sidechain positions in the domain (58, 66, 7073).

The Four-State Model Explains Why A39G FF Appears To Fold without Intermediates in Stopped-Flow Experiments.

The kinetic matrix for the four-state exchange model (Fig. 3C) can be propagated (SI Appendix) to predict the folding trajectory of A39G FF. Fig. 6A shows the populations of states U, I1, I2, and F, starting from U (pU = 1 at t = 0 s), as a function of time obtained by solving kinetic rate equations using the rate constants from the four-state fits of our CEST measurements. Fig. 6A shows a gradual increase in pI1 (blue), followed by a subsequent buildup of I2 (green) as the U state (red) population decreases. During this folding process, both pI1 and pI2 increase to values higher than at equilibrium and then decrease as pF (black) grows to its equilibrium value. The kinetic matrix describing the evolution of an N state system has N-1 nonzero eigenvalues that are related to the relaxation rates of “transition modes” connecting the different states in the system that in turn are given by the eigenvectors of the matrix (1, 74). The final (Nth) eigenvalue is zero, and the corresponding eigenvector lists the equilibrium populations. The relaxation times of the kinetic rate matrix (Fig. 6B) indicate a slow process (14.52 ms, maroon) and two significantly faster events (0.71 ms, cyan and 0.11 ms, orange). The eigenvectors corresponding to these relaxation times show that the fastest process (Fig. 6C Top, orange) is predominantly one involving the I1-U interconversion; the next fastest can be mainly described in terms of an exchange between U and I2 (Fig. 6C, Middle, cyan), while the slowest process (Fig. 6C Bottom, maroon) corresponds to interconversion between F and the other three states, although mainly between F and U. Thus, a picture emerges whereby F interconverts slowly with I1, I2, and U that, in turn, rapidly interconvert with each other. The folding transition state, denoted by the pink dashed line in Fig. 6D, separates F from the other three states. The large separation of timescales, which effectively sequesters F from the remaining conformers, leads to an apparent two-state folding mechanism via stopped-flow experiments (56), with the folding rate determined by the slowest nonzero eigenvalue.

Fig. 6.

Fig. 6.

A volcano-shaped FES for the A39G FF domain. (A) Dynamics of the folding reaction with all the molecules in state U at t = 0 s, computed by numerically propagating the kinetic equations (SI Appendix), describing the four-state folding model using measured rate constants from 15N CEST. (B) Relaxation times obtained from diagonalization of the kinetic matrix (SI Appendix) show one slow timescale process at ∼14.52 ms (maroon) and two faster processes occurring with time constants of ∼0.71 ms (cyan) and ∼0.11 ms (orange). (C) Eigenvectors corresponding to the three relaxation rates in B highlight the principal dynamic modes as A39G FF folds. The heights of the bars correspond to the relative contribution of a given state to the eigenvector. (D) The folding transition state (TS) separates F from the other three states, as indicated on the four-state folding model. (E) Energy level diagram for A39G FF at 1 °C. The structures above the horizontal energy levels for F and I1 are those determined previously (PDB ID: 1UZC and 2KZG for F and I1, respectively), while the structure for I2 is currently not known (indicated with “?”). Below each of the U, I1 and I2 energy levels is the native structure colored according to Fig. 5 AC to highlight the regions where chemical shifts differ significantly from those of the F state. (F) A schematic illustration of the volcano-shaped FES of A39G FF derived from the CEST experiments showing the increase in free energy from U to I1 to I2, as the structures become more native like, before the energy dramatically decreases because of the formation of tertiary interactions that stabilize the dominant F state.

Discussion

In this study, we have used 15N CEST to elucidate the folding mechanism of the A39G FF domain by focusing on the linewidths of the minor state dips in the CEST profiles. Contributions to the linewidths of minor state dips are a powerful, and often overlooked feature, of CEST profiles that can be used to inform on the existence of additional sparse states that would otherwise escape detection, even by relaxation-based NMR experiments. Moreover, based on the present study, including linewidths in the analysis can, in some cases, extend the applicability of CEST to studies of states with lifetimes an order of magnitude shorter and populations close to 10-fold smaller than what is typically characterized using CEST. In this manner, we were able to show that A39G FF, for which folding intermediates were not detected using more traditional approaches, such as stopped-flow fluorescence (56), clearly folds via two distinct intermediates along at least two pathways and that the transition state separates the folded state from the sparely populated intermediates characterized here. As CEST “reconstructs” the spectra of these minor states, it is possible to obtain detailed structural information that in some cases can lead to structural models (51, 7577). 15N chemical shifts from the present study make it clear that I1 is structurally similar to the folding intermediate of the WT FF domain (37) that was detected using CPMG-based relaxation methods (37). I1 is thus largely structured, except for the C-terminal half of H4. Notably, the second intermediate, I2, is more natively structured, with the C-terminal half of H4 folded. The CEST-derived four-state exchange matrix establishes that F interconverts slowly with U, I1, and I2, while these three states rapidly interconvert with each other (Fig. 6), explaining why A39G FF appears to fold without intermediates when studied using other spectroscopic techniques.

The increase in structure from U to I1 to I2 that is based on a comparison of 15N Δϖ values from CEST is consistent with previously measured m-values using CPMG-based experiments recorded on a number of different FF variants (8). Thermodynamic m-values measure how the free energy difference between states depends on urea concentration and are defined relative to the folded state, such that mKF=mKmF = −dΔGKFd[urea] and ΔGKF=GKGF. Positive values of mKF indicate less compaction of state K relative to F, with mK increasing as state K becomes progressively less folded (78, 79). The values, mUF = 0.74 (obtained from A17G FF), mI1F = 0.31 (obtained from WT FF), and mI2F = 0.16 (obtained from A17G FF), further establish that compaction increases as the folding reaction progresses and also show that the native state is more compact than the other minor states. Furthermore, the relatively high value of mUF compared to mI1F and mI2F explains why the addition of 1 M urea to A39G FF (Fig. 4B) results in a relative increase of the U state population compared to the I1 and I2 states. As pU is significantly larger than either of pI1 and pI2 (Fig. 4A, Right), even in the absence of urea, the addition of urea effectively simplifies the exchange to two state. Interestingly, almost every FF variant studied folds via an intermediate (8, 54, 56). The results from the present study establish that the I2 state is relatively compact (compared to I1 and U), with a conformation similar to that of the native state (Fig. 5). The fact that U interconverts with the less compact and less native-like I1 conformer but not I2 suggests, rather intuitively, that folding proceeds along pathways whereby intermediates accumulate structure on route to the formation of the folded state.

The relative energies of minor state conformers identified from the four-state analysis of A39G FF domain folding reported here increase from U to I1 to I2 (i.e., pI2 < pI1 < pU; Fig. 6E), as the amount of secondary structure in the domain grows and the tertiary structures becomes more compact. The transition to the native structure then occurs with a dramatic decrease in free energy. These observations have broad implications for the nature of the FF domain energy landscape. The foldon–funnel model proposed by Rollins and Dill (80) that builds on the “foldon” concept pioneered by Englander and coworkers (81, 82) assumes that a protein folds sequentially along its folding pathway(s) in units of secondary structure (foldons) that are stabilized by tertiary interactions. However, when the secondary structural elements are unstable, this model leads to a volcano-shaped FES (Fig. 6F) with the transition state located between F and the remaining states, resulting in an apparent two-state folding mechanism (80). The folding intermediates are expected to be loosely packed according to this model so that the entropy lost as the (unstable) structure is progressively formed is not yet fully compensated by interactions between secondary structural elements, giving rise to increases in free energies of states that are formed as the folding reaction progresses. Such a scenario is observed for A39G FF domain folding, where I1 has two native secondary structural elements (helices H1 and H2), while I2 has three (helices H1, H2 and the C-terminal half of helix H4), yet these helices interact with each other only loosely, giving rise to a volcano-shaped landscape (Fig. 6 E and F).

Although the present study has focused on the folding of A39G FF, showing at least a four-state process, our results have broader implications, informing on folding principles of FF domains in general. We have reexamined the folding behavior of the WT FF domain that was studied previously via CPMG approaches (8, 37), in which the data were well fit by a model containing only a single folding intermediate that is similar to the I1 state reported here. Notably, 15N CEST profiles of WT FF recorded at 1 °C show clearly that the I2 state is populated as well (SI Appendix, Fig. S6). The folding of A17G FF, also studied by CPMG approaches, was analyzed previously in terms of a single intermediate (8), corresponding to I2 for A39G FF characterized in this work (Fig. 4D). Yet 15N CEST experiments indicate that state I1 is also populated (SI Appendix, Fig. S7). Thus, the A39G variant is not unique, and a four-state folding mechanism is operative for the WT and A17G FF domains as well. It is likely that the low populations of these additional states preclude observation via CPMG. Indeed, even with excellent CPMG data, it remains unclear as to whether processes more complex than (global) three state can be properly analyzed. We have found that CEST profiles can be extremely sensitive to the underlying exchange kinetics, even for low populated states, as illustrated in Fig. 1 and in SI Appendix, Fig. S8 for three- and four-state processes, respectively, increasing its utility for studying the conformational exchange between multiple states with disparate populations and lifetimes. We expect that the extraction of the full information content inherent in CEST datasets will significantly benefit from the development of machine learning–based approaches, in which algorithms are trained on vast amounts of synthetic data (8385) covering a wide range of different potential exchange models so as to provide reasonable starting exchange parameters and minor state chemical shifts. The continued development of NMR relaxation experiments and the computational tools to best analyze them will open up the possibility for obtaining a detailed understanding of the underlying energy landscapes governing a plethora of different biological processes, including, for example, protein folding/misfolding, molecular recognition, and enzyme function.

Materials and Methods

All the FF variants were expressed in Escherichia coli and purified by ion exchange chromatography followed by size exclusion chromatography. Each NMR sample contained ∼2 mM [U–15N] protein dissolved in the desired buffer. NMR experiments were performed at fields ranging from 11.7 to 23.5 Tesla. Details of protein expression and purification, NMR experiments, and CEST profile analysis are provided in SI Appendix.

Acknowledgments

This work was supported by funding from the Canadian Institutes of Health Research to L.E.K. and intramural funding from the Tata Institute of Fundamental Research (TIFR) (Department of Atomic Energy, Government of India) and the TIFR Hyderabad NMR center to P.V. Dr. K. Rao (TIFR Hyderabad) is acknowledged for maintenance of the TIFR Hyderabad spectrometer facilities, and Dr. G Bouvignies (Ecole Normale Superieure Paris) is acknowledged for providing the program ChemEx. Y.T. was supported by a Japan Society for the Promotion of Science Overseas Research Fellowship and a Uehara Memorial Foundation postdoctoral fellowship. L.E.K. acknowledges a Canada Research Chair in Biochemistry.

Footnotes

Author contributions: V.P.T., Y.T., D.D., L.E.K., and P.V. performed research; V.P.T., Y.T., D.D., L.E.K., and P.V. analyzed data; and L.E.K and P.V. wrote the paper.

The authors declare no competing interest.

This article is a PNAS Direct Submission. H.J.D. is a guest editor invited by the Editorial Board.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2115113118/-/DCSupplemental.

Data Availability

All relevant data are included in the article and/or SI Appendix.

References

  • 1.Bahar I., Jernigan R. L., Dill K. A., Protein Actions: Principles and Modeling (Garland Science, Taylor & Francis Group, New York, 2017). [Google Scholar]
  • 2.Wolynes P. G., Eaton W. A., Fersht A. R., Chemical physics of protein folding. Proc. Natl. Acad. Sci. U.S.A. 109, 17770–17771 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Frauenfelder H., Sligar S. G., Wolynes P. G., The energy landscapes and motions of proteins. Science 254, 1598–1603 (1991). [DOI] [PubMed] [Google Scholar]
  • 4.Karplus M., Aspects of protein reaction dynamics: Deviations from simple behavior. J. Phys. Chem. B 104, 11–27 (2000). [Google Scholar]
  • 5.Dobson C. M., Protein folding and misfolding. Nature 426, 884–890 (2003). [DOI] [PubMed] [Google Scholar]
  • 6.Gelman H., Gruebele M., Fast protein folding kinetics. Q. Rev. Biophys. 47, 95–142 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zhuravleva A., Korzhnev D. M., Protein folding by NMR. Prog. Nucl. Magn. Reson. Spectrosc. 100, 52–77 (2017). [DOI] [PubMed] [Google Scholar]
  • 8.Korzhnev D. M., Religa T. L., Lundström P., Fersht A. R., Kay L. E., The folding pathway of an FF domain: Characterization of an on-pathway intermediate state under folding conditions by (15)N, (13)C(alpha) and (13)C-methyl relaxation dispersion and (1)H/(2)H-exchange NMR spectroscopy. J. Mol. Biol. 372, 497–512 (2007). [DOI] [PubMed] [Google Scholar]
  • 9.Charlier C., et al. , Study of protein folding under native conditions by rapidly switching the hydrostatic pressure inside an NMR sample cell. Proc. Natl. Acad. Sci. U.S.A. 115, E4169–E4178 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Eaton W. A., Modern kinetics and mechanism of protein folding: A retrospective. J. Phys. Chem. B 125, 3452–3467 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chung H. S., Piana-Agostinetti S., Shaw D. E., Eaton W. A., Structural origin of slow diffusion in protein folding. Science 349, 1504–1510 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Vallurupalli P., Chakrabarti N., Pomès R., Kay L. E., Atomistic picture of conformational exchange in a T4 lysozyme cavity mutant: An experiment-guided molecular dynamics study. Chem. Sci. 7, 3602–3613 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Xie X. S., Single-molecule approach to dispersed kinetics and dynamic disorder: Probing conformational fluctuation and enzymatic dynamics. J. Chem. Phys. 117, 11024–11032 (2002). [Google Scholar]
  • 14.Kim J.-Y., Chung H. S., Disordered proteins follow diverse transition paths as they fold and bind to a partner. Science 368, 1253–1257 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Vallurupalli P., Sekhar A., Yuwen T., Kay L. E., Probing conformational dynamics in biomolecules via chemical exchange saturation transfer: A primer. J. Biomol. NMR 67, 243–271 (2017). [DOI] [PubMed] [Google Scholar]
  • 16.Rangadurai A., Szymaski E. S., Kimsey I. J., Shi H., Al-Hashimi H. M., Characterizing micro-to-millisecond chemical exchange in nucleic acids using off-resonance R relaxation dispersion. Prog. Nucl. Magn. Reson. Spectrosc. 112-113, 55–102 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lundström P., Akke M., Microsecond protein dynamics measured by 13Calpha rotating-frame spin relaxation. ChemBioChem 6, 1685–1692 (2005). [DOI] [PubMed] [Google Scholar]
  • 18.Reddy J. G., et al. , Simultaneous determination of fast and slow dynamics in molecules using extreme CPMG relaxation dispersion experiments. J. Biomol. NMR 70, 1–9 (2018). [DOI] [PubMed] [Google Scholar]
  • 19.Sekhar A., Kay L. E., An NMR view of protein dynamics in health and disease. Annu. Rev. Biophys. 48, 297–319 (2019). [DOI] [PubMed] [Google Scholar]
  • 20.Palmer A. G. III, Koss H., Chemical exchange. Methods Enzymol. 615, 177–236 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hansen D. F., Vallurupalli P., Kay L. E., Using relaxation dispersion NMR spectroscopy to determine structures of excited, invisible protein states. J. Biomol. NMR 41, 113–120 (2008). [DOI] [PubMed] [Google Scholar]
  • 22.Torchia D. A., NMR studies of dynamic biomolecular conformational ensembles. Prog. Nucl. Magn. Reson. Spectrosc. 84-85, 14–32 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tugarinov V., Clore G. M., Exchange saturation transfer and associated NMR techniques for studies of protein interactions involving high-molecular-weight systems. J. Biomol. NMR 73, 461–469 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Anthis N. J., Clore G. M., Visualizing transient dark states by NMR spectroscopy. Q. Rev. Biophys. 48, 35–116 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sekhar A., Kay L. E., NMR paves the way for atomic level descriptions of sparsely populated, transiently formed biomolecular conformers. Proc. Natl. Acad. Sci. U.S.A. 110, 12867–12874 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Forsen S., Hoffman R. A., Study of moderately rapid chemical exchange reactions by means of nuclear magnetic double resonance. J. Chem. Phys. 39, 2892–2901 (1963). [Google Scholar]
  • 27.Ward K. M., Aletras A. H., Balaban R. S., A new class of contrast agents for MRI based on proton chemical exchange dependent saturation transfer (CEST). J. Magn. Reson. 143, 79–87 (2000). [DOI] [PubMed] [Google Scholar]
  • 28.Vallurupalli P., Hansen D. F., Stollar E., Meirovitch E., Kay L. E., Measurement of bond vector orientations in invisible excited states of proteins. Proc. Natl. Acad. Sci. U.S.A. 104, 18473–18477 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Vallurupalli P., Hansen D. F., Kay L. E., Probing structure in invisible protein states with anisotropic NMR chemical shifts. J. Am. Chem. Soc. 130, 2734–2735 (2008). [DOI] [PubMed] [Google Scholar]
  • 30.Hansen D. F., Vallurupalli P., Kay L. E., Quantifying two-bond 1HN-13CO and one-bond 1H(alpha)-13C(alpha) dipolar couplings of invisible protein states by spin-state selective relaxation dispersion NMR spectroscopy. J. Am. Chem. Soc. 130, 8397–8405 (2008). [DOI] [PubMed] [Google Scholar]
  • 31.Igumenova T. I., Brath U., Akke M., Palmer A. G. III, Characterization of chemical exchange using residual dipolar coupling. J. Am. Chem. Soc. 129, 13396–13397 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Zhao B., Zhang Q., Measuring residual dipolar couplings in excited conformational states of nucleic acids by CEST NMR spectroscopy. J. Am. Chem. Soc. 137, 13480–13483 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ma R. S., et al. , Determination of pseudocontact shifts of low-populated excited states by NMR chemical exchange saturation transfer. Phys. Chem. Chem. Phys. 18, 13794–13798 (2016). [DOI] [PubMed] [Google Scholar]
  • 34.Vallurupalli P., Hansen D. F., Kay L. E., Structures of invisible, excited protein states by relaxation dispersion NMR spectroscopy. Proc. Natl. Acad. Sci. U.S.A. 105, 11766–11771 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Grey M. J., Wang C., Palmer A. G. III, Disulfide bond isomerization in basic pancreatic trypsin inhibitor: Multisite chemical exchange quantified by CPMG relaxation dispersion and chemical shift modeling. J. Am. Chem. Soc. 125, 14324–14335 (2003). [DOI] [PubMed] [Google Scholar]
  • 36.Xie T., Saleh T., Rossi P., Kalodimos C. G., Conformational states dynamically populated by a kinase determine its function. Science 370, eabc2754 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Korzhnev D. M., Religa T. L., Banachewicz W., Fersht A. R., Kay L. E., A transient and low-populated protein-folding intermediate at atomic resolution. Science 329, 1312–1316 (2010). [DOI] [PubMed] [Google Scholar]
  • 38.Zhao B., Guffy S. L., Williams B., Zhang Q., An excited state underlies gene regulation of a transcriptional riboswitch. Nat. Chem. Biol. 13, 968–974 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kimsey I. J., et al. , Dynamic basis for dG•dT misincorporation via tautomerization and ionization. Nature 554, 195–201 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ramanujam V., Charlier C., Bax A., Observation and kinetic characterization of transient Schiff base intermediates by CEST NMR spectroscopy. Angew. Chem. Int. Ed. Engl. 58, 15309–15312 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Vallurupalli P., Tiwari V. P., Ghosh S., A double-resonance CEST experiment to study multistate protein conformational exchange: An application to protein folding. J. Phys. Chem. Lett. 10, 3051–3056 (2019). [DOI] [PubMed] [Google Scholar]
  • 42.Lim J., Xiao T., Fan J., Yang D., An off-pathway folding intermediate of an acyl carrier protein domain coexists with the folded and unfolded states under native conditions. Angew. Chem. Int. Ed. Engl. 53, 2358–2361 (2014). [DOI] [PubMed] [Google Scholar]
  • 43.Milojevic J., Esposito V., Das R., Melacini G., Understanding the molecular basis for the inhibition of the Alzheimer’s Abeta-peptide oligomerization by human serum albumin using saturation transfer difference and off-resonance relaxation NMR spectroscopy. J. Am. Chem. Soc. 129, 4282–4290 (2007). [DOI] [PubMed] [Google Scholar]
  • 44.Fawzi N. L., Ying J., Ghirlando R., Torchia D. A., Clore G. M., Atomic-resolution dynamics on the surface of amyloid-β protofibrils probed by solution NMR. Nature 480, 268–272 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sekhar A., et al. , Thermal fluctuations of immature SOD1 lead to separate folding and misfolding pathways. eLife 4, e07296 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Gladkova C., et al. , An invisible ubiquitin conformation is required for efficient phosphorylation by PINK1. EMBO J. 36, 3555–3572 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Deshmukh L., Tugarinov V., Appella D. H., Clore G. M., Targeting a dark excited state of HIV-1 nucleocapsid by antiretroviral thioesters revealed by NMR spectroscopy. Angew. Chem. Int. Ed. Engl. 57, 2687–2691 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Rangadurai A., Shi H., Al-Hashimi H. M., Extending the sensitivity of CEST NMR spectroscopy to micro-to-millisecond dynamics in nucleic acids using high-power radio-frequency fields. Angew. Chem. Int. Ed. Engl. 59, 11262–11266 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Korzhnev D. M., et al. , Nonnative interactions in the FF domain folding pathway from an atomic resolution structure of a sparsely populated intermediate: An NMR relaxation dispersion study. J. Am. Chem. Soc. 133, 10974–10982 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Neudecker P., et al. , Structure of an intermediate state in protein folding and aggregation. Science 336, 362–366 (2012). [DOI] [PubMed] [Google Scholar]
  • 51.Bouvignies G., et al. , Solution structure of a minor and transiently formed state of a T4 lysozyme mutant. Nature 477, 111–114 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Neudecker P., Lundström P., Kay L. E., Relaxation dispersion NMR spectroscopy as a tool for detailed studies of protein folding. Biophys. J. 96, 2045–2054 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Fersht A., Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding (W.H. Freeman, New York, 1999). [Google Scholar]
  • 54.Jemth P., et al. , Demonstration of a low-energy on-pathway intermediate in a fast-folding protein by kinetics, protein engineering, and simulation. Proc. Natl. Acad. Sci. U.S.A. 101, 6450–6455 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Jemth P., et al. , The structure of the major transition state for folding of an FF domain from experiment and simulation. J. Mol. Biol. 350, 363–378 (2005). [DOI] [PubMed] [Google Scholar]
  • 56.Jemth P., Johnson C. M., Gianni S., Fersht A. R., Demonstration by burst-phase analysis of a robust folding intermediate in the FF domain. Protein Eng. Des. Sel. 21, 207–214 (2008). [DOI] [PubMed] [Google Scholar]
  • 57.Allen M., Friedler A., Schon O., Bycroft M., The structure of an FF domain from human HYPA/FBP11. J. Mol. Biol. 323, 411–416 (2002). [DOI] [PubMed] [Google Scholar]
  • 58.Vallurupalli P., Bouvignies G., Kay L. E., Studying “invisible” excited protein states in slow exchange with a major state conformation. J. Am. Chem. Soc. 134, 8148–8161 (2012). [DOI] [PubMed] [Google Scholar]
  • 59.Millet O., Loria J. P., Kroenke C. D., Pons M., Palmer A. G., The static magnetic field dependence of chemical exchange linebroadening defines the NMR chemical shift time scale. J. Am. Chem. Soc. 122, 2867–2877 (2000). [Google Scholar]
  • 60.Vallurupalli P., Bouvignies G., Kay L. E., Increasing the exchange time-scale that can be probed by CPMG relaxation dispersion NMR. J. Phys. Chem. B 115, 14891–14900 (2011). [DOI] [PubMed] [Google Scholar]
  • 61.Skrynnikov N. R., Dahlquist F. W., Kay L. E., Reconstructing NMR spectra of “invisible” excited protein states using HSQC and HMQC experiments. J. Am. Chem. Soc. 124, 12352–12360 (2002). [DOI] [PubMed] [Google Scholar]
  • 62.Gopalan A. B., Vallurupalli P., Measuring the signs of the methyl 1H chemical shift differences between major and ‘invisible’ minor protein conformational states using methyl 1H multi-quantum spectroscopy. J. Biomol. NMR 70, 187–202 (2018). [DOI] [PubMed] [Google Scholar]
  • 63.Cavanagh J., Fairbrother W. J., Palmer A. G. III, Rance M., Skelton N. J., Protein NMR Spectroscopy, Principles and Practice (Academic Press, ed. 2, 2007). [Google Scholar]
  • 64.Koss H., Rance M., Palmer A. G. III, General expressions for R relaxation for N-site chemical exchange and the special case of linear chains. J. Magn. Reson. 274, 36–45 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Karunanithy G., Reinstein J., Hansen D. F., Multiquantum chemical exchange saturation transfer NMR to quantify symmetrical exchange: Application to rotational dynamics of the guanidinium group in arginine side chains. J. Phys. Chem. Lett. 11, 5649–5654 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Vallurupalli P., Kay L. E., Probing slow chemical exchange at carbonyl sites in proteins by chemical exchange saturation transfer NMR spectroscopy. Angew. Chem. Int. Ed. Engl. 52, 4156–4159 (2013). [DOI] [PubMed] [Google Scholar]
  • 67.Nielsen J. T., Mulder F. A. A., POTENCI: Prediction of temperature, neighbor and pH-corrected chemical shifts for intrinsically disordered proteins. J. Biomol. NMR 70, 141–165 (2018). [DOI] [PubMed] [Google Scholar]
  • 68.Farrow N. A., Zhang O., Forman-Kay J. D., Kay L. E., Characterization of the backbone dynamics of folded and denatured states of an SH3 domain. Biochemistry 36, 2390–2402 (1997). [DOI] [PubMed] [Google Scholar]
  • 69.Luo Y., Baldwin R. L., Trifluoroethanol stabilizes the pH 4 folding intermediate of sperm whale apomyoglobin. J. Mol. Biol. 279, 49–57 (1998). [DOI] [PubMed] [Google Scholar]
  • 70.Bouvignies G., Vallurupalli P., Kay L. E., Visualizing side chains of invisible protein conformers by solution NMR. J. Mol. Biol. 426, 763–774 (2014). [DOI] [PubMed] [Google Scholar]
  • 71.Yuwen T., Sekhar A., Kay L. E., Separating dipolar and chemical exchange magnetization transfer processes in 1H-CEST. Angew. Chem. Int. Ed. Engl. 56, 6122–6125 (2017). [DOI] [PubMed] [Google Scholar]
  • 72.Long D., Sekhar A., Kay L. E., Triple resonance-based 13C(α) and 13C(β) CEST experiments for studies of ms timescale dynamics in proteins. J. Biomol. NMR 60, 203–208 (2014). [DOI] [PubMed] [Google Scholar]
  • 73.Tiwari V. P., Vallurupalli P., A CEST NMR experiment to obtain glycine 1Hα chemical shifts in ‘invisible’ minor states of proteins. J. Biomol. NMR 74, 443–455 (2020). [DOI] [PubMed] [Google Scholar]
  • 74.Noé F., Fischer S., Transition networks for modeling the kinetics of conformational change in macromolecules. Curr. Opin. Struct. Biol. 18, 154–162 (2008). [DOI] [PubMed] [Google Scholar]
  • 75.Cavalli A., Salvatella X., Dobson C. M., Vendruscolo M., Protein structure determination from NMR chemical shifts. Proc. Natl. Acad. Sci. U.S.A. 104, 9615–9620 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Shen Y., et al. , Consistent blind protein structure generation from NMR chemical shift data. Proc. Natl. Acad. Sci. U.S.A. 105, 4685–4690 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Wishart D. S., et al. , CS23D: A web server for rapid protein structure generation using NMR chemical shifts and sequence data. Nucleic Acids Res. 36, W496–W502 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Myers J. K., Pace C. N., Scholtz J. M., Denaturant m values and heat capacity changes: Relation to changes in accessible surface areas of protein unfolding. Protein Sci. 4, 2138–2148 (1995). Correction in: Protein Sci. 5, 981 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Courtenay E. S., Capp M. W., Saecker R. M., Record M. T. Jr., Thermodynamic analysis of interactions between denaturants and protein surface exposed on unfolding: Interpretation of urea and guanidinium chloride m-values and their correlation with changes in accessible surface area (ASA) using preferential interaction coefficients and the local-bulk domain model. Proteins 4 (suppl. 4), 72–85 (2000). [DOI] [PubMed] [Google Scholar]
  • 80.Rollins G. C., Dill K. A., General mechanism of two-state protein folding kinetics. J. Am. Chem. Soc. 136, 11420–11427 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Maity H., Maity M., Krishna M. M. G., Mayne L., Englander S. W., Protein folding: The stepwise assembly of foldon units. Proc. Natl. Acad. Sci. U.S.A. 102, 4741–4746 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Englander S. W., Mayne L., Krishna M. M. G., Protein folding and misfolding: Mechanism and principles. Q. Rev. Biophys. 40, 287–326 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Beckwith M. A., Erazo-Colon T., Johnson B. A., RING NMR dynamics: Software for analysis of multiple NMR relaxation experiments. J. Biomol. NMR 75, 9–23 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Chao F.-A., Zhang Y., Byrd R. A., Theoretical classification of exchange geometries from the perspective of NMR relaxation dispersion. J. Magn. Reson. 328, 107003 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Karunanithy G., Hansen D. F., FID-Net: A versatile deep neural network architecture for NMR spectral reconstruction and virtual decoupling. J. Biomol. NMR 75, 179–191 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are included in the article and/or SI Appendix.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES