Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2012 May 30;109(44):17777-17782. doi: 10.1073/pnas.1201799109

Transiently populated intermediate functions as a branching point of the FF domain folding pathway

Dmitry M Korzhnev a,1, Tomasz L Religa b,2, Lewis E Kay b,c,1
PMCID: PMC3497754  PMID: 22647611

Abstract

Studies of protein folding and the intermediates that are formed along the folding pathway provide valuable insights into the process by which an unfolded ensemble forms a functional native conformation. However, because intermediates on folding pathways can serve as initiation points of aggregation (implicated in a number of diseases), their characterization assumes an even greater importance. Establishing the role of such intermediates in folding, misfolding, and aggregation remains a major challenge due to their often low populations and short lifetimes. We recently used NMR relaxation dispersion methods and computational techniques to determine an atomic resolution structure of the folding intermediate of a small protein module—the FF domain—with an equilibrium population of 2–3% and a millisecond lifetime, 25 °C. Based on this structure a variant FF domain has been designed in which the native state is selectively destabilized by removing the carboxyl-terminal helix in the native structure to produce a highly populated structural mimic of the intermediate state. Here, we show via solution NMR studies of the designed mimic that the mimic forms distinct conformers corresponding to monomeric and dimeric (Kd = 0.2 mM) forms of the protein. The conformers exchange on the seconds timescale with a monomer association rate of 1.1·104 M-1 s-1 and with a region responsible for dimerization localized to the amino-terminal residues of the FF domain. This study establishes the FF domain intermediate as a central player in both folding and misfolding pathways and illustrates how incomplete folding can lead to the formation of higher-order structures.

Keywords: excited protein states, protein folding intermediate


It is increasingly clear that protein aggregation can be initiated from locally unfolded and/or misfolded conformations that become accessible due to thermal fluctuations from the native state (14). Such conformers, therefore, may play critical roles in the formation of molecular assemblies implicated in aggregation-related diseases (13). However, these species are often sparsely and transiently populated (4, 5), challenging their detailed characterization using conventional tools of structural biology. Their role in aggregate formation and in protein misfolding in general is, therefore, not well understood.

With the development of nuclear magnetic resonance (NMR) relaxation dispersion (RD) methods (5, 6) it has become possible to study transiently formed conformers populated at a level of 0.5% or higher (excited states), provided that they interconvert on the millisecond timescale with a populated conformation that can be observed in NMR spectra (ground state). The thermodynamics and kinetics of the exchange reaction can be accessed by RD methodology. In addition, backbone 1H, 15N, 13C chemical shifts (57), residual dipolar couplings (RDCs) (8), and residual chemical shift anisotropies (RCSAs) (9) of the excited state can be measured, providing a rich source of structural information. By combining the RD measurements (5, 6) with computational approaches for protein structure determination from limited NMR data (1012), it has become possible to generate atomic resolution models of excited protein states (1315).

RD NMR studies of protein folding have been particularly fruitful (5). One example is the wild-type FF domain from human HYPA/FBP11, referred to as FF1–71 in what follows (13, 14, 1618). This domain was shown by Fersht and coworkers to fold via an on-pathway intermediate that forms rapidly on the microsecond timescale and rearranges into the native state within a few milliseconds (16, 17). Because the folding intermediate has a fractional equilibrium population of ≈3% and a millisecond lifetime (25 °C), it cannot be directly observed in NMR spectra. Yet, its structure was recently elucidated using RD NMR (13, 14). The intermediate is remarkably well structured and can be thought of as representing an alternative, less favorable fold of the FF domain (13). Although native-like topology is preserved in this conformation, there are significant nonnative interactions that prevent formation of the C-terminal α-helix of the native four-helix bundle structure. These contacts must be broken prior to forming the native conformer, explaining why folding from the intermediate state is rate-limiting.

RD NMR-derived structures of folding intermediates open the way for designing rational strategies for their isolation (13). In this context, structural differences between intermediate and native states can be used to identify amino acids and/or regions of structure that contribute differently to the stabilities of these states. Through mutations and/or truncations the native state can be selectively destabilized so that the intermediate becomes the predominant conformation in solution. Such an approach led to the isolation of the FF domain folding intermediate (13). Here, truncation of 11 C-terminal residues that are partially disordered in the intermediate state, but form a stabilizing α-helix in the native protein, resulted in a stable variant, FF1–60, with NMR spectra and folding kinetic profiles closely resembling those of the folding intermediate of FF1–71. A similar approach has been employed in the isolation of folding intermediates of apo-cytochrome b562 (19), engrailed homeo domain (20), and the Im7 protein (21), albeit using different strategies for mutant design.

Once isolated, folding intermediates can be further investigated by conventional biophysical techniques. In one such application they can be used to cross-validate RD NMR-derived structural models of the intermediate state, a topic of particular interest at present because RD-derived structures are only beginning to emerge. Notably, high-resolution NMR studies of a truncated variant, FF11–60, that mimics the folding intermediate of the full-length domain have established the accuracy of the RD-based model (22). However, beyond cross-validation of RD NMR-derived structures, “trapped” folding intermediates provide the means to explore transitions to other states on the folding energy landscape that are “hidden” in NMR studies of natively folded proteins by the presence of the ground state. We provide one such example using FF1–60, showing that the FF domain folding intermediate interconverts on a seconds timescale between monomeric and dimeric forms. As such, the intermediate serves as a branch point in the folding pathway, with a conformation that facilitates both folding to the native state and formation of nonnative dimeric structures.

Results and Discussion

The FF Domain Folding Pathway.

The N-terminal FF domain of human HYPA/FBP11 is a 71-residue, four-helix module (23) whose folding mechanism has been extensively studied by equilibrium and kinetic unfolding/refolding experiments, protein engineering methods, and NMR spectroscopy (13, 14, 1618). In their original studies, Fersht and coworkers established that FF1–71 folds to the native state, N, via an on-pathway intermediate, I, in two kinetic phases (16, 17), as shown in Fig. 1A. We have subsequently examined the folding behavior of the wild-type FF domain and a number of mutational variants by RD NMR spectroscopy (13, 14, 18), confirming the multistate nature of FF domain folding. A high-resolution structure of the I state (from restraints that include RD NMR-derived 15N, Inline graphic, Inline graphic, Inline graphic, and Inline graphic chemical shifts and Inline graphic RDCs) was obtained using CS-ROSETTA (13) (Fig. 1A). It is noteworthy that all of the RD dispersion data, recorded in the temperature range 20–35 °C, could be well fit to a two-state model of exchange, NI, because the population of the unfolded state, U, under the conditions of our experiments is approximately an order of magnitude smaller than I and the IU interconversion is on the microsecond timescale, which could not be detected in our RD experiments. Any additional conformational states, if present, were not observed in RD studies of the wild-type domain.

Fig. 1.

Fig. 1.

(A) The folding pathway of the FF domain consists of a fast microsecond transition from U to I and a slower millisecond conversion from I to N. Structural ensembles of I [Protein Data Bank (PDB) ID code 2KZG] and N (PDB ID code 1UZC) were calculated as described elsewhere (13, 23). (B) Selected region of the Inline graphic HSQC spectrum of the truncated variant, FF1–60, a mimic of the folding intermediate of the full-length domain (11.7 T, 25 °C). Two sets of signals are observed, derived from two separate conformations of the protein, denoted as M and D. (C) Backbone Inline graphic, 15N, Inline graphic, Inline graphic, and Inline graphic chemical shift differences between I and N states of the full-length FF domain (FF1–71) obtained from RD NMR data (13) (Inline graphic), plotted vs. chemical differences between FF1–60 (M form) and the native state FF1–71 (Inline graphic). ϖstd,i is a nucleus- and residue-specific normalization value that corresponds to the range of shift values (1 SD) that are observed in a database of protein chemical shifts (www.bmrb.wisc.edu) for the nucleus/residue in question. A similar correlation for the D state of FF1–60 is shown (Inset).

In an effort to probe the energy landscape still further we have designed a truncated variant of the wild-type protein, FF1–60, that closely mimics the folding intermediate of the full-length domain (13). This variant lacks the C-terminal region containing native α-helix H4 that is critical for the structural integrity of the N state. Thus, FF1–60 cannot adopt the native FF domain structure. Notably, FF1–60 folds in a single kinetic phase with a rate of approximately 105/s, corresponding to that of the U to I transition in the full-length domain, FF1–71 (16). Spectra of the truncated variant are of reasonable quality and have been assigned using standard triple-resonance NMR experiments (13). Of interest, two sets of signals were observed for many residues corresponding to a pair of conformations in solution (Fig. 1B). Backbone resonance assignments for one of the two states, “state M,” are very similar to those of the folding intermediate of FF1–71 (Fig. 1C), establishing that the two are structurally very similar. By contrast, the chemical shifts of the second form of FF1–60, “state D,” are distinct from those of the I and N states of FF1–71 (Fig. 1C, Inset). Moreover, backbone resonance assignments for state D extend only to Met42, with resonances from the C-terminal portion of the domain, Ile43-Gln60, missing in NMR spectra, presumably due to microsecond–millisecond timescale exchange with additional conformational states. The chemical shifts for the flexible N-terminal residues extending up to Lys7 for both M and D states are indistinguishable. The presence of an alternative form of FF1–60 in solution, distinct from the I state, suggests that the folding intermediate can access nonnative conformational states off the folding pathway of the FF domain (Fig. 1A).

Mimic of the Folding Intermediate Undergoes Slow Dimerization.

We have carried out further NMR studies of FF1–60 to understand the origin of the second set of correlations that have been observed in spectra in the hope that this would provide an avenue for further exploration of the FF domain energy landscape. The fact that separate sets of signals are observed for the M and D states of FF1–60 indicates that the interconversion of these forms is slow on the NMR chemical shift timescale. Further quantification can be obtained by 15N longitudinal exchange measurements (24, 25), provided that the rate of interconversion is on the order of (or somewhat faster than) the spin-lattice relaxation rates of the backbone 15N probes that are exploited in the experiment (i.e., 1–2 per s). By recording the 15N chemical shift prior to a mixing period of duration T, the resulting correlation spectrum will contain four peaks per amide group: two autopeaks, MM and DD, at chemical shifts (ϖNM, ϖHM) and (ϖND, ϖHD), originating from magnetization that is not transferred between states during T; and two exchange cross-peaks, MD and DM, at chemical shifts (ϖNM, ϖHD) and (ϖND, ϖHM), derived from magnetization transferred between states during the mixing period (24, 25). Fig. 2A shows a region of the Inline graphic exchange spectrum recorded on a 0.34 mM FF1–60 sample (T = 0.34 s, 25 °C) with exchange cross-peaks connecting autopeaks of amide groups in states M and D, indicating that the two forms of FF1–60 interconvert on the seconds timescale.

Fig. 2.

Fig. 2.

(A) Selected region of a Inline graphic magnetization exchange spectrum (24, 25) of 0.34 mM FF1–60 recorded with a mixing time T = 0.34 s (11.7 T, 25 °C), showing two sets of autopeaks (MM and DD) connected by exchange cross-peaks (MD and DM) for Ala34. (B) Mixing time, T, dependencies of autopeak (circles) and exchange cross-peak (boxes) volumes for Ala34 from magnetization exchange experiments. The solid lines are generated from a least-squares fit of the exchange data for Ala34 to Eq. S2 in SI Materials and Methods, resulting in kMD = 3.34 ± 0.17 s-1 and kDM = 1.97 ± 0.09 s-1; averaging over five residues with a full set of four non-overlapped auto- and cross-peaks gives kMD = 3.05 ± 0.95 s-1 and kDM = 2.16 ± 0.76 s-1. (C) Apparent backbone 15N R1 and R2 relaxation rates as a function of residue for correlations derived from M (red circles) and D (green circles) states of FF1–60 obtained by single-exponential fits of peak volumes in a series of two-dimensional Inline graphic correlation spectra measured in conventional 15N R1 and R1ρ experiments (28, 29). As discussed in the text, the apparent 15N R1 and R2 rates are affected by slow exchange between M and D states; intrinsic relaxation rates (solid lines) are calculated using the procedure described in SI Materials and Methods. Secondary structural elements are indicated at the top of the panel. Helix H2 of state D may extend further than Met42 (denoted by dashed line after residue 42); however, correlations for Ile43 and beyond were missing in spectra of the dimeric form. (D) Cross-peak volumes for Ala34 measured from Inline graphic HSQC spectra of FF1–60 recorded as a function of protein concentration. The volumes are normalized by total protein concentration.

Fig. 2B shows the mixing time dependencies of auto- and cross-peak volumes for Ala34, with corresponding traces for Ser35 illustrated in Fig. S1. These were fit to a model of exchange,

graphic file with name pnas.1201799109eq39.jpg

where fM and fD are the fractional populations of states M and D, respectively, as described in detail by Tollinger et al. (25) and summarized in SI Materials and Methods. Although well-resolved autopeaks for over twenty residues of the M and D states are observed in Inline graphic correlation spectra of FF1–60, significantly fewer amide groups produce a set of four non-overlapped auto- and cross-peaks that are required to obtain robust estimates of kMD, kDM. These were obtained from fits of five complete sets of time profiles, with kMD = 3.05 ± 0.95 s-1 and kDM = 2.16 ± 0.76 s-1. In addition to the rates, per residue 15N R1 values were also obtained (see below), with rates for the D state systematically smaller than for the M conformer that is a mimic of the folding intermediate.

Values of 15N R1 and R2 relaxation rates were measured (11.7 T, 25 °C) as described in Materials and Methods and plotted in Fig. 2C (circles). The primary source of motion leading to 15N relaxation is rotational diffusion of the molecule that can be described by a single rotational correlation time, τR, in the limit of isotropic rotation (τR is proportional to the molecular volume) (26). Relaxation rates for 15N are also sensitive to very rapid picosecond–nanosecond intramolecular motions of the HN-N vector, while conformational exchange on the microsecond–millisecond timescale can lead to enhancement of 15N R2 values (26). Systematic differences in 15N R1 and R2 rates of M (Fig. 2C, red circles) and D (Fig. 2C, green circles) establish that molecules in these states tumble with different rates. The slower R2 and faster R1 rates for amide 15N spins of the M state are in keeping with expectations for a smaller protein that tumbles with a faster overall rate. Note that several regions of the M state, including residues Thr13 (N-cap proceeding helix H1), Lys26-Arg29 (end of H1, H1-H2 loop), and Ile44,Asn45 (end of H2, H2-H3 loop), have enhanced 15N R2 values, indicative of microsecond–millisecond conformational dynamics.

Values of τR can be calculated from 15N R2/R1 ratios provided that (i) internal motions of the backbone HN-N vectors are in the extreme narrowing limit (typically less than tens of picoseconds) and/or significantly restricted, and (ii) there is no contribution to 15N R2 from conformational exchange (27). In principle, measured R1 and R2 decay curves are biexponential because they are contaminated by the interconversion process, with contributions from intrinsic relaxation rates in both of the exchanging states as well as from kMD and kDM (SI Materials and Methods). Intrinsic 15N R1 rates can be obtained from fits of the time dependencies of auto- and cross-peaks in the longitudinal exchange experiment (24, 25) (Fig. 2B); however, as described above, complete time profiles are available for only a small fraction of the residues. We have therefore extracted the intrinsic 15N R1 and R2 rates (Fig. 2C, solid lines) from the apparent rates obtained from monoexponential fits of decay curves in conventional 15N R1 and R1ρ experiments (28, 29) using the correction procedure described in SI Materials and Methods. After applying the correction, 15N R2/R1 ratios for residues 14 to 42 that belong to well-ordered regions of M and D were used to calculate τR for each of the two states, excluding residues with 15N R2/R1 ratios outside ± 1 standard deviation from the mean. Values of 5.2 ± 1.2 ns and 10.8 ± 1.1 ns were obtained for states M and D, respectively, consistent with a slow interconversion between monomeric and dimeric forms of the FF1–60 domain.

To obtain additional confirmation that the exchange equilibrium involves a monomer-dimer interconversion, we have carried out a dilution series whereby peak volumes in 1H-15N HSQC spectra from the M and D states were measured as a function of protein concentration. As expected, peak volumes normalized to protein concentration, decrease for the monomer, and increase for the dimer as a function of concentration, as shown in Fig. 2D for Ala34 and Fig. S1 for Ser35. With the “players” involved in exchange now established, the pseudo-first-order rate constants kMD and kDM measured from fits of the magnetization exchange profiles described above can be recast in terms of the rates, kon and koff, corresponding to the dimerization, Inline graphic. Values of kon = 1.10 ± 0.69·104 M-1 s-1; koff = 2.16 ± 1.52 s-1 (see Materials and Methods), and a dissociation constant Kd = 0.20 ± 0.05 mM are obtained. Note that the measured kon rate is several orders of magnitude slower than that expected for diffusion-controlled protein association (30), approximately 106 M-1 s-1, suggesting a rate-limiting step for dimerization that involves at least some rearrangement of structure.

Structural Differences Between the Monomer and Dimer.

Backbone NMR chemical shifts are very sensitive probes of local protein conformation (31). A comparison of such shifts between M and D states can therefore inform on the structural changes accompanying dimerization. Fig. 3A plots the cumulative change in the backbone Inline graphic, 15N, Inline graphic, Inline graphic, and Inline graphic chemical shifts (32), Inline graphic, as a function of residue, where Δϖi is chemical shift difference between the monomeric and dimeric forms of FF1–60, ϖstd,i is a nucleus- and residue-specific normalization constant corresponding to one standard deviation of the chemical shift range listed in the BMRB database, N is the number of backbone chemical shifts available for a given residue, and index i runs from 1 to N. Close to zero Inline graphic values are expected in regions of the protein with little conformational change between the states. In contrast, a stretch of Inline graphic values on the order of 1 or higher points to a structural rearrangement. It is clear from Fig. 3A that the largest chemical shift differences upon dimer formation are localized to a region that includes residues Glu15-Lys18 at the beginning of α-helix H1 of the monomeric form. Elevated Inline graphic values are also observed for residues Leu25-Arg29 at the end of helix H1, with the H1-H2 loop and α-helix H2 displaying smaller chemical shift changes. We are unable to assess the extent of chemical shift changes at the end of α-helix H2, the H2-H3 loop, and for α-helix H3 of FF1–60 because resonances from this region of the dimer are missing in NMR spectra.

Fig. 3.

Fig. 3.

(A) Backbone chemical shift differences between monomeric (M) and dimeric (D) forms of FF1–60, Inline graphic, as described in the text. Values of Inline graphic are shown for residues Thr8-Met42; the chemical shifts of the first seven residues are indistinguishable in the two forms, while those from the C terminus (residues Ile43-Gln60) are missing in NMR spectra of the dimer. (B) TALOS-plus predicted (33) α-helix probability, P (helix), plotted vs. residue for states M (red) and D (green, residues Thr8-Met42 only). (C) RCI-predicted (34, 35) order parameters, S2, for the backbone amide groups of M (red) and D (green).

Fig. 3B compares the secondary structures of the monomeric (red) and dimeric (green) (residues 8–42 only) forms of FF1–60 predicted from measured backbone chemical shifts using the TALOS+ program (33). As expected, the monomer comprises three α-helices encompassing residues Lys14-Glu27 (H1), Trp36-Ile44 (H2), and Arg48-Lys54 (H3), closely matching the corresponding helix positions in the folding intermediate of the full-length domain. Similar secondary structure is predicted for Thr8-Met42 of the dimer, including helices H1 (Glu16-Glu27) and H2 (Trp36-Met42). It is noteworthy that α-helix H1 is shorter by two residues in the dimer, starting at Glu16 as opposed to Lys14. This is in line with the observation that Glu15, Ala17, and Lys18 have the largest backbone chemical shift changes upon dimer formation (Fig. 3A).

The backbone chemical shifts were also used to predict conformational flexibility of the two forms of FF1–60 using the random coil index (RCI) approach (34, 35). RCI-based order parameters, S2, for the backbone amide groups of M (red) and D (green) are plotted vs. residue in Fig. 3C. Note that extreme S2 values of 1 and 0 correspond to fully restricted and unconstrained internal motion of the backbone HN-N group, respectively. Values of S2 for the monomeric form are nearly identical to those of I, obtained in a previous study (13). Interestingly, the dimeric form of FF1–60 is even less flexible than the monomer. Although helix H1 in the dimer is shorter by two residues (at the N terminus), the region from Thr8-Glu16 proceeding H1 is more ordered in the dimer than in the monomer. Similarly, elevated S2 values are observed at the end of helix H1 of the dimer (residues Leu25-Arg29), pointing to restricted mobility in this region as well. The RCI method is sensitive to a timescale ranging from picosecond–nanosecond to approximately hundreds of microseconds (35). Restricted mobility in this time window does not exclude the possibility of slower conformational dynamics on a microsecond–millisecond timescale; although reasonably high S2 values are noted for Lys26-Arg29 and Ile44,Asn45 (> 0.7), these residues do undergo slow conformational exchange as established by elevated 15N R2 rates (Fig. 2C).

Chemical shift changes between M and D states (Fig. 3A) and the S2 vs. residue profile (Fig. 3C) provide a strong indication as to which regions of FF1–60 might participate in dimer formation. The large Inline graphic values and higher-order parameters for residues N-terminal of helix H1 in the dimer relative to those in the monomer potentially point to this region as a site for dimerization that increases in order upon dimer formation. To test this hypothesis we have analyzed FF domain variants prepared with truncated C termini (residues Ala61-Lys71) and with varying truncations at the N terminus: FF1–60, FF7–60, FF9–60, and FF11–60 (the first residue in each of the constructs is mutated to Gly). For the first three variants (FF1–60, FF7–60, and FF9–60), Inline graphic HSQC spectra are very similar, with two sets of signals corresponding to states M and D of the protein, while only a single set of signals is observed in NMR spectra of FF11–60 that belong to the monomeric form. We have previously determined the structure of FF11–60 using a standard NOE-based approach, establishing that it is a very good structural mimic of the folding intermediate of the full-length domain (22). The finding that truncation of the N-terminal residues of the FF domain, and specifically elimination of Thr10 and Trp11, leads to a shift in the monomer-dimer equilibrium toward the monomer form suggests that these residues are responsible for stabilization of the dimer. It is likely that residues N-terminal to helix H1 also contribute to stability of the monomer because FF11–60 is significantly more flexible than FF1–60 based on lower RCI-derived S2 values (22).

Concluding Remarks.

The development of relaxation dispersion NMR spectroscopy (5, 6) opens the possibility for detailed structural studies of sparsely populated, transiently formed intermediates that have been recalcitrant to study using standard biophysical approaches. Key to the success of this method is that the invisible, excited state is probed via the visible, ground state that gives rise to high-quality spectra. The methodology is limited, however, to the study of states that interconvert on the millisecond timescale and to exchange reactions involving one or two excited conformers. Thus, only a small portion of the energy landscape can be explored. One approach to increasing the number of states accessible to study would be to perturb the ground state so that it is no longer populated. In so doing, a new ground state is formed (formerly a higher-energy state) that can then be used to probe further exchange events involving additional conformers using a variety of different NMR experiments that are sensitive to dynamics spanning a window of over 12 orders of magnitude (36). This is the strategy taken here. Atomic resolution structures of both the native state and a folding intermediate of FF1–71, the latter determined by Carr–Purcell–Meiboom–Gill relaxation dispersion, have guided the design of a new ground state, FF1–60, that corresponds to the intermediate structure. This new variant is used to further probe regions of the folding/unfolding landscape that would otherwise be hidden by the native state.

Here, we have demonstrated that FF1–60 undergoes slow dimerization on the seconds timescale with a Kd of 0.2 mM, and with kinetics that are significantly slower than diffusion-limited. Further truncation mutations of FF1–60 involving the amino terminus establish that dimerization proceeds through interactions involving the first 11 residues of the domain, in particular Thr10 and Trp11. Both monomer and dimer are relatively rigid on the picosecond–nanosecond to microsecond timescale and are significantly more dynamic on the slower microsecond–millisecond timescale, leading to exchange line broadening in NMR spectra. Notably, several regions of the monomer have enhanced 15N R2 rates, while resonances from the C-terminal region of the dimer are missing in NMR spectra, presumably due to microsecond–millisecond exchange with additional conformational states.

The observed monomer-dimer equilibrium and microsecond–millisecond dynamics of FF1–60 are consistent with the FF domain folding intermediate serving as a branch point leading to either folding to the native conformation or formation of further misfolded states. Although we have no direct evidence that the dimeric form of FF11–60 further aggregates to form higher oligomeric species, the fact that the C-terminal third of the dimer cannot be observed in NMR spectra strongly suggests that this region of the protein samples additional conformational states. In this regard it is of interest to note that in many of the systems studied to date for which models of aggregation/fibrillation have been obtained, the first step is dimerization, followed by rearrangements to accommodate higher-order oligomerization (3). One such example involves β2-microglobulin that forms fibrils in dialysis-related amyloidosis (37). The mechanism for the assembly of such fibrils involves the interconversion of the native conformation of the protein to one that is native-like, N, by Cu2+ that is present at high concentrations in hemodialysate. N then forms dimers that subsequently assemble into fibrils. Partially unfolded or misfolded conformers that form via thermal fluctuations from the ground state are thought to be implicated in a number of aggregation-related diseases (13). The approach presented here provides a way of characterizing such states. The finding that a sparsely populated intermediate of the FF domain folding pathway can initiate dimerization and the characterization of the resultant nonnative dimeric state is an important step toward unraveling the role of folding intermediates in both protein folding and misfolding pathways.

Materials and Methods

Protein Samples.

Uniformly 15N- and 15N/13C-labeled samples of truncated variants of the first FF domain from HYPA/FBP11 were produced as described previously (13, 17). The plasmid-encoding FF1–60 (residues 1–60) was obtained using the QuikChange protocol (Stratagene) by introducing a stop codon after position 60 in the gene encoding the full-length domain cloned into a modified pRSET vector (13). The plasmids for FF domain variants with truncated N termini, FF7–60, FF9–60, and FF11–60, were produced from a plasmid encoding FF1–60 as described elsewhere (22). All NMR experiments were performed on samples with protein concentrations of approximately 0.3 mM, 50 mM sodium acetate, 100 mM NaCl, pH = 4.9, 25 °C.

Backbone Resonance Assignment.

The backbone 15N, Inline graphic, Inline graphic, Inline graphic, and Inline graphic resonance assignments for the M and D states of 15N/13C-labeled FF1–60 were obtained using a standard set of triple-resonance NMR experiments (38) as described previously (13). All data were recorded on a Varian Inova spectrometer, 11.7 T, 25 °C.

Magnetization-Exchange Measurements.

Nitrogen-15 longitudinal exchange experiments (24, 25) were recorded on a 0.34 mM FF1–60 sample, 14.0 T, 25 °C. A series of 10 two-dimensional Inline graphic correlation spectra was measured with mixing times T for relaxation and exchange of 15N longitudinal magnetization ranging from 0.014 to 0.84 s. Volumes of auto- and exchange cross-peaks were quantified for five residues (Glu15, Ala17, Phe21, Ala34, and Ser35) for which well-resolved correlations (corresponding to MM, DD, MD, and DM peaks; see text) were obtained. The extracted first-order rate constants, kMD and kDM, were used to calculate kon and koff according to kon = [kMD(kMD + kDM)]/[2kDMP0], koff = kDM, where P0 denotes the total protein concentration. The dimer dissociation constant Kd was calculated as koff/kon. Analysis of data is as described in SI Materials and Methods.

Nitrogen-15 Relaxation Measurements.

Measurements for 15N R1 and R1ρ were performed on a 0.32 mM FF1–60 sample, 11.7 T, 25 °C, as described previously (28, 29). The apparent rotating-frame R1ρ and transverse R2 relaxation rates were obtained by single exponential fits of peak volumes in a series of two-dimensional Inline graphic correlation spectra measured as a function of relaxation delay T. Transverse relaxation rates R2 were calculated from R1 and R1ρ using the equation R1ρ = R1 cos2θ + R2 sin2θ, where θ = arctan(ωSLω), Δω is the resonance offset from the spin-lock carrier, and ωSL is the spin-lock field strength in the 15N R1ρ experiments (ωSL = 1.82 KHz). Intrinsic relaxation rates were calculated from the apparent R1 and R2 values as described in SI Materials and Methods.

Dilution Series.

The Inline graphic HSQC spectra of FF1–60 were recorded as a function of protein concentration ranging from 0.68 to 0.034 mM. Volumes of non-overlapped peaks corresponding to M and D forms of FF1–60 were quantified and normalized to the total protein concentration.

Supplementary Material

Supporting Information

ACKNOWLEDGMENTS.

This work was supported by National Institutes of Health Grant P30GM092369 and University of Connecticut Health Center startup funds (to D.K.) and a Canadian Institutes of Health Research (CIHR) Grant (L.E.K.). T.L.R. was funded through a postdoctoral fellowship from CIHR. L.E.K. holds a Canada Research Chair in Biochemistry.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1201799109/-/DCSupplemental.

References

  • 1.Dobson CM. Protein folding and misfolding. Nature. 2003;426:884–890. doi: 10.1038/nature02261. [DOI] [PubMed] [Google Scholar]
  • 2.Chiti F, Dobson CM. Protein misfolding, functional amyloid, and human disease. Annu Rev Biochem. 2006;75:333–366. doi: 10.1146/annurev.biochem.75.101304.123901. [DOI] [PubMed] [Google Scholar]
  • 3.Chiti F, Dobson CM. Amyloid formation by globular proteins under native conditions. Nat Chem Biol. 2009;5:15–22. doi: 10.1038/nchembio.131. [DOI] [PubMed] [Google Scholar]
  • 4.Brockwell DJ, Radford SE. Intermediates: Ubiquitous species on folding energy landscapes? Curr Opin Struct Biol. 2007;17:30–37. doi: 10.1016/j.sbi.2007.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Korzhnev DM, Kay LE. Probing invisible, low-populated states of protein molecules by relaxation dispersion NMR spectroscopy: An application to protein folding. Acc Chem Res. 2008;41:442–451. doi: 10.1021/ar700189y. [DOI] [PubMed] [Google Scholar]
  • 6.Palmer AG, Kroenke CD, Loria JP. Nuclear magnetic resonance methods for quantifying microsecond-to-millisecond motions in biological macromolecules. Methods Enzymol. 2001;339:204–238. doi: 10.1016/s0076-6879(01)39315-1. [DOI] [PubMed] [Google Scholar]
  • 7.Hansen DF, Vallurupalli P, Lundström P, Neudecker P, Kay LE. Probing chemical shifts of invisible states of proteins with relaxation dispersion NMR spectroscopy: How well can we do? J Am Chem Soc. 2008;130:2667–2675. doi: 10.1021/ja078337p. [DOI] [PubMed] [Google Scholar]
  • 8.Vallurupalli P, Hansen DF, Stollar E, Meirovitch E, Kay LE. Measurement of bond vector orientations in invisible excited states of proteins. Proc Natl Acad Sci USA. 2007;104:18473–18477. doi: 10.1073/pnas.0708296104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Vallurupalli P, Hansen DF, Kay LE. Probing structure in invisible protein states with anisotropic NMR chemical shifts. J Am Chem Soc. 2008;130:2734–2735. doi: 10.1021/ja710817g. [DOI] [PubMed] [Google Scholar]
  • 10.Cavalli A, Salvatella X, Dobson CM, Vendruscolo M. Protein structure determination from NMR chemical shifts. Proc Natl Acad Sci USA. 2007;104:9615–9620. doi: 10.1073/pnas.0610313104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Shen Y, et al. Consistent blind protein structure generation from NMR chemical shift data. Proc Natl Acad Sci USA. 2008;105:4685–4690. doi: 10.1073/pnas.0800256105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wishart DS, et al. CS23D: A web server for rapid protein structure generation using NMR chemical shifts and sequence data. Nucleic Acids Res. 2008;36:W496–W502. doi: 10.1093/nar/gkn305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Korzhnev DM, Religa TL, Banachewicz W, Fersht AR, Kay LE. A transient and low-populated protein folding intermediate at atomic resolution. Science. 2010;329:1312–1316. doi: 10.1126/science.1191723. [DOI] [PubMed] [Google Scholar]
  • 14.Korzhnev DM, et al. Nonnative interactions in the FF domain folding pathway from an atomic resolution structure of a sparsely populated intermediate: An NMR relaxation dispersion study. J Am Chem Soc. 2011;133:10974–10982. doi: 10.1021/ja203686t. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bouvignies G, et al. Solution structure of a minor and transiently formed state of a T4 lysozyme mutant. Nature. 2011;477:111–114. doi: 10.1038/nature10349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jemth P, et al. Demonstration of a low-energy on-pathway intermediate in a fast-folding protein by kinetics, protein engineering, and simulation. Proc Natl Acad Sci USA. 2004;101:6450–6455. doi: 10.1073/pnas.0401732101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Jemth P, et al. The structure of the major transition state for folding of an FF domain from experiment and simulation. J Mol Biol. 2005;350:363–378. doi: 10.1016/j.jmb.2005.04.067. [DOI] [PubMed] [Google Scholar]
  • 18.Korzhnev DM, Religa TL, Lundström P, Fersht AR, Kay LE. The folding pathway of an FF domain: Characterization of an on-pathway intermediate state under folding conditions by 15N, Inline graphic and 13C-methyl relaxation dispersion and 1H/2H-exchange NMR spectroscopy. J Mol Biol. 2007;37:497–512. doi: 10.1016/j.jmb.2007.06.012. [DOI] [PubMed] [Google Scholar]
  • 19.Feng HQ, Zhou Z, Bai YW. A protein folding pathway with multiple folding intermediates at atomic resolution. Proc Natl Acad Sci USA. 2005;102:5026–5031. doi: 10.1073/pnas.0501372102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Religa TL, Markson JS, Mayor U, Freund SMV, Fersht AR. Solution structure of a protein denatured state and folding intermediate. Nature. 2005;437:1053–1056. doi: 10.1038/nature04054. [DOI] [PubMed] [Google Scholar]
  • 21.Whittaker SBM, Spence GR, Grossmann JG, Radford SE, Moore GR. NMR analysis of the conformational properties of the trapped on-pathway folding intermediate of the bacterial immunity protein Im7. J Mol Biol. 2007;366:1001–1015. doi: 10.1016/j.jmb.2006.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Barette J, Velyvis A, Religa TL, Korzhnev DM, Kay LE. Cross-validation of the structure of a transiently formed and low populated FF domain folding intermediate determined by relaxation dispersion NMR and CS-Rosetta. J Phys Chem B. 2012 doi: 10.1021/jp209974f. in press. [DOI] [PubMed] [Google Scholar]
  • 23.Allen M, Friedler A, Schon O, Bycroft M. The structure of an FF domain from human HYPA/FBP11. J Mol Biol. 2002;323:411–416. doi: 10.1016/s0022-2836(02)00968-3. [DOI] [PubMed] [Google Scholar]
  • 24.Farrow NA, Zhang OW, Forman-Kay JD, Kay LE. A heteronuclear correlation experiment for simultaneous determination of 15N longitudinal decay and chemical-exchange rates of systems in slow equilibrium. J Biomol NMR. 1994;4:727–734. doi: 10.1007/BF00404280. [DOI] [PubMed] [Google Scholar]
  • 25.Tollinger M, Skrynnikov NR, Mulder FAA, Forman-Kay JD, Kay LE. Slow dynamics in folded and unfolded states of an SH3 domain. J Am Chem Soc. 2001;123:11341–11352. doi: 10.1021/ja011300z. [DOI] [PubMed] [Google Scholar]
  • 26.Korzhnev DM, Billeter M, Arseniev AS, Orekhov VY. NMR studies of Brownian tumbling and internal motions in proteins. Prog Nucl Magn Reson Spectrosc. 2001;38:197–266. [Google Scholar]
  • 27.Kay LE, Torchia DA, Bax A. Backbone dynamics of proteins as studied by 15N inverse detected heteronuclear NMR spectroscopy: Application to staphylococcal nuclease. Biochemistry. 1989;28:8972–8979. doi: 10.1021/bi00449a003. [DOI] [PubMed] [Google Scholar]
  • 28.Farrow NA, et al. Backbone dynamics of a free and a phosphopeptide-complexed Src homology-2 domain studied by 15N NMR relaxation. Biochemistry. 1994;33:5984–6003. doi: 10.1021/bi00185a040. [DOI] [PubMed] [Google Scholar]
  • 29.Korzhnev DM, Skrynnikov NR, Millet O, Torchia DA, Kay LE. An NMR experiment for the accurate measurement of heteronuclear spin-lock relaxation rates. J Am Chem Soc. 2002;124:10743–10753. doi: 10.1021/ja0204776. [DOI] [PubMed] [Google Scholar]
  • 30.Schreiber G. Kinetic studies of protein-protein interactions. Curr Opin Struct Biol. 2002;12:41–47. doi: 10.1016/s0959-440x(02)00287-7. [DOI] [PubMed] [Google Scholar]
  • 31.Wishart DS. Interpreting protein chemical shift data. Prog Nucl Magn Reson Spectrosc. 2011;58:62–87. doi: 10.1016/j.pnmrs.2010.07.004. [DOI] [PubMed] [Google Scholar]
  • 32.Vallurupalli P, Hansen DF, Kay LE. Structures of invisible, excited protein states by relaxation dispersion NMR spectroscopy. Proc Natl Acad Sci USA. 2008;105:11766–11771. doi: 10.1073/pnas.0804221105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Shen Y, Delaglio F, Cornilescu G, Bax A. TALOS plus: A hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J Biomol NMR. 2009;44:213–223. doi: 10.1007/s10858-009-9333-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Berjanskii MV, Wishart DS. A simple method to predict protein flexibility using secondary chemical shifts. J Am Chem Soc. 2005;127:14970–14971. doi: 10.1021/ja054842f. [DOI] [PubMed] [Google Scholar]
  • 35.Berjanskii MV, Wishart DS. Application of the random coil index to studying protein flexibility. J Biomol NMR. 2008;40:31–48. doi: 10.1007/s10858-007-9208-0. [DOI] [PubMed] [Google Scholar]
  • 36.Mittermaier AK, Kay LE. Observing biological dynamics at atomic resolution using NMR. Trends Biochem Sci. 2009;34:601–611. doi: 10.1016/j.tibs.2009.07.004. [DOI] [PubMed] [Google Scholar]
  • 37.Eichner T, Kalverda AP, Thompson GS, Homans SW, Radford SE. Conformational conversion during amyloid formation at atomic resolution. Mol Cell. 2011;41:161–172. doi: 10.1016/j.molcel.2010.11.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Sattler M, Schleucher J, Griesinger C. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog Nucl Magn Reson Spectrosc. 1999;34:93–158. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES