Abstract
Intrinsically disordered proteins and intrinsically disordered regions (IDRs) are ubiquitous in the eukaryotic proteome. The description and understanding of their conformational properties require the development of new experimental, computational, and theoretical approaches. Here, we use nuclear spin relaxation to investigate the distribution of timescales of motions in an IDR from picoseconds to nanoseconds. Nitrogen-15 relaxation rates have been measured at five magnetic fields, ranging from 9.4 to 23.5 T (400–1000 MHz for protons). This exceptional wealth of data allowed us to map the spectral density function for the motions of backbone NH pairs in the partially disordered transcription factor Engrailed at 11 different frequencies. We introduce an approach called interpretation of motions by a projection onto an array of correlation times (IMPACT), which focuses on an array of six correlation times with intervals that are equidistant on a logarithmic scale between 21 ps and 21 ns. The distribution of motions in Engrailed varies smoothly along the protein sequence and is multimodal for most residues, with a prevalence of motions around 1 ns in the IDR. We show that IMPACT often provides better quantitative agreement with experimental data than conventional model-free or extended model-free analyses with two or three correlation times. We introduce a graphical representation that offers a convenient platform for a qualitative discussion of dynamics. Even when relaxation data are only acquired at three magnetic fields that are readily accessible, the IMPACT analysis gives a satisfactory characterization of spectral density functions, thus opening the way to a broad use of this approach.
Introduction
Intrinsically disordered proteins (IDPs) and regions (IDRs) lack a stable three-dimensional structure organized around a hydrophobic core (1). Such proteins nevertheless play crucial roles in many cellular processes (2). The discovery of IDPs and IDRs is a challenge for the structure-function paradigm (3) and has opened the way to new biophysical contributions to modern proteomics (4). The characterization of the conformational space of IDPs and IDRs can provide insight into the ensemble representation of their three-dimensional organization (5–8). A detailed and quantitative description of the time dependence of the exploration of the conformational space of IDPs and IDRs is required to predict (9) and understand the molecular mechanisms underlying their biological function at the atomic scale.
NMR spectroscopy is a powerful tool for probing molecular motions at atomic resolution on a broad range of timescales in both ordered and disordered proteins (6,10,11). In particular, nuclear spin relaxation can be used to probe a diversity of motions from fast (picoseconds to nanoseconds) reorientation to slow (microseconds to milliseconds) chemical exchange (11,12). Pico- and nanosecond motions of protein backbones are most often characterized by analyzing nitrogen-15 relaxation rates, primarily the longitudinal, R1, and transverse, R2, relaxation rates, usually supplemented by 15N-{1H} nuclear Overhauser effects (NOEs). The most general level of analysis provides a map of the spectral density for reorientational motions of the internuclear 15N-1H vectors of the protein backbone (13). In folded proteins, a further step consists in the deconvolution of overall motion (rotational diffusion) and internal dynamics, which is possible when these two types of motions are statistically independent (14,15). The most popular framework for such an analysis is the model-free approach (14), for which the motions of each NH vector are described by a correlation time for overall motion and a correlation time and an order parameter for local motions. The so-called extended model-free approach was later introduced to account for internal motions characterized by two correlation times (16).
The structural flexibility of IDPs and IDRs on nanosecond timescales casts serious doubt on the separation of overall and internal motions. The very notion of a single overall motion can be challenged for IDPs. At best, an overall diffusion tensor would correspond to an average over a set of time-dependent diffusion tensors. In addition, local reorientations of bond vectors due to conformational changes that may occur on timescales similar to the instantaneous overall diffusion would make the statistical independence of internal and overall motions less plausible. New methods therefore have to be developed to describe and rationalize the dynamic properties of IDPs and IDRs (17–19).
Several approaches have been developed in the last 15 years to extract quantitative information about pico- and nanosecond dynamics in IDPs and IDRs from nuclear spin relaxation rates. Often based on spectral density mapping (20,21), most of these approaches rely on the model-free formalism, with residue-specific correlation times (22,23) (i.e., without an overall diffusion tensor), a distribution of picosecond and nanosecond correlation times (24,25), or a statistical analysis of extended model-free parameters (26). An analysis based on a distribution of correlation times necessarily introduces a physical bias, since one must choose a mathematical function to describe the distribution. In the case of the model-independent correlation (MIC) time distribution (26), the statistical independence of three types of motions is not required, since the extended model-free results are considered as a simplified representation of a continuous distribution of correlation times. However, the significance of such a statistical treatment is necessarily limited, since it provides little information about the number of modes of the actual distribution of correlation times. Neither approach seems suited to describe a distribution of correlation times that is a priori unknown. However, simply increasing the number of correlation times or distributions in either approach would be questionable, since the empirical information available from nuclear spin relaxation is limited.
Here, we introduce an approach we call interpretation of motions by a projection onto an array of correlation times (IMPACT) to analyze multiple-field relaxation data in disordered proteins. This method relies as little as possible on any particular physical model of protein motions but constitutes a mathematical reconstruction of the distribution of correlation times. We define an array of n correlation times, τi (or, equivalently, of reciprocal frequencies, ωi = 1/τi) in a range that is effectively sampled by nitrogen-15 relaxation. The experimental spectral density function is then reproduced by a sum of n Lorentzian functions, Ji(ω), one for each correlation time τi. The result of this process, similar to a projection onto a basis of Lorentzian functions, is a discrete distribution of correlation times spanning a range that is relevant to rationalize relaxation. This approach is analogous to the discretization step encountered in regularization methods (27,28), but the volume of experimental data exploited in this study is too limited to use a full regularization approach. Nevertheless, the multimodal character of the distribution of correlation times can be nicely revealed, and the most relevant correlation times for backbone motions can clearly be identified.
IMPACT was originally conceived for a set of relaxation rates obtained at five magnetic fields ranging from 9.4 to 23.5 T (i.e., with proton Larmor frequencies of 400, 500, 600, 800, and 1000 MHz) and later applied to a more limited set recorded at 500, 600, and 800 MHz. Relaxation rates were recorded for a uniformly nitrogen-15-labeled sample of the protein Engrailed 2. Engrailed 2 is a transcription factor that possesses a well-folded DNA-binding homeodomain and a long, 200-residue, mostly disordered N-terminal region. The disordered region plays a crucial role in the regulation of the activity of the protein and, in particular, in binding to transcriptional regulators (29,30). We decided to study an Engrailed 2 fragment (residues 146–259) encompassing the folded homeodomain (residues 200–259) and an N-terminal 54-residue disordered region (residues 146–199) (31). The results of IMPACT show that motions with correlation times close to 1 ns dominate reorientational dynamics in the most disordered regions of the protein, which is believed to be a general property of IDPs and IDRs (32). Yet, the broad variability of correlation times of backbone motions throughout the disordered region of Engrailed stands in stark contrast with the homogeneous dynamic properties of the folded homeodomain. This study reveals a surprising richness of backbone dynamics in IDPs and IDRs on pico- and nanosecond timescales, not found in folded proteins that have been widely studied over the past three decades.
Materials and Methods
Sample
All experiments were performed on a sample of uniformly nitrogen-15-labeled chicken Engrailed 2 (residues 146–259) at a concentration of 0.6 mM in 40 mM sodium succinate buffer at pH 6 supplemented with 1 μg/mL of each of the three protease inhibitors leupeptin, pepstatin, and AEBSF, as well as 10 μM EDTA, which allow one to increase the lifetime of the protein (33). The protein was prepared as described elsewhere (31). Note that the protein construct comprises the residues Gly-Pro-Met at the N-terminus before residue Glu146, which remain after cleavage of the GST-tag by PreScission protease (GE Healthcare, Little Chalfont, UK). All experiments were carried out at 303 K, which was adjusted in each spectrometer to have a chemical shift difference of 1.462 ppm between the signals of the methyl and hydroxyl protons of pure methanol (4% protonated and 96% deuterated).
NMR spectroscopy
The relaxation rates were measured at five different static fields of 9.4, 11.7, 14.1, 18.8, and 23.5 T, with corresponding proton Larmor frequencies of 400, 500, 600, 800, and 1000 MHz. Three aliquots of the same sample were used for all experiments, except at 11.7 T, which was performed on a separate, but identical, sample.
At each field, a full set of 15N relaxation measurements was obtained. The longitudinal relaxation rates, R1(15N), were obtained in the traditional way (34–36), with saturation of the water signal for each scan, whereas the transverse relaxation rates, R2(15N), were recorded with a train of 15N π-pulses (Carr-Purcell-Meiboom-Gill pulse train), interleaved with 1H π-pulses to suppress cross-correlated relaxation effects. 15N-{1H} NOEs were obtained by detecting the 15N steady-state polarization while saturating the protons with a train of π-pulses, with suitable interpulse delays and rf amplitudes (37,38). Finally, experiments to measure the transverse and longitudinal cross-relaxation rates due to correlated fluctuations of the nitrogen-15 chemical shift anisotropy (CSA) and the dipolar coupling between the 15N nucleus and the amide proton were recorded using the so-called symmetrical reconversion principle (39,40). All experiments were recorded on Bruker Avance spectrometers (Billerica, MA). Experiments at 500 MHz, 800 MHz, and 1 GHz, and the NOE at 600 MHz, have been recorded using triple-resonance indirect-detection cryogenic probes (41) equipped with z-axis pulsed-field gradients. Other experiments at 600 MHz were recorded on an indirect-detection triple-resonance probe with triple-axis gradients with detection coils at room temperature. Experiments at 400 MHz were recorded on a liquid-nitrogen-cooled cryogenic probe (Prodigy BBO, Bruker) equipped with a z-axis gradient.
Spectral density analysis
The full analysis was carried out at 11 points obtained with the reduced spectral mapping, J(0.87ωH) and J(ωN) at five fields and J(ω = 0) calculated from relaxation rates measured at 23.5 T. Analyses with two sets of three magnetic fields used seven points on the spectral density function; J(ω = 0) was derived from the relaxation rates measured at the highest magnetic field, i.e., 18.8 T or 23.5 T. A Monte Carlo simulation with 510 steps was performed to evaluate the error of each parameter, Ai. All simulations were carried out with Mathematica (42).
Supporting Material
The Supporting Material includes tables of all relaxation rates used in the analysis and tables of all parameters resulting from conventional analysis with two and three correlation times as well as from our IMPACT analysis; equations relevant for reduced spectral density mapping; a plot of transverse relaxation rates, R2, measured at 18.8 T; a comparison of Akaike’s Information Criteria (AIC) for IMPACT and conventional analyses with two or three correlation times; one-dimensional plots of AIC for five- and six-correlation-time analyses; a correlation of consecutive IMPACT coefficients; plots of IMPACT coefficients and the IMPACT barcode representation of the analysis of relaxation rates based on data recorded at a set of three fields, which cover a broad range (9.4, 14.1, and 23.5 T) and at a set of three more widely accessible fields (11.7, 14.1, and 18.8 T); and IMPACT coefficients for an analysis with five correlation times and τmax = 38 ns.
Results and Discussion
Secondary structure
Fig. 1e displays the secondary structure propensity (SSP) (43) based on the assignment of the protein (31). The three α-helices of the homeodomain (residues 200–259) are well identified by SSP scores close to 1. Another region, including the so-called hexapeptide (residues 169–174, WPAWVY) and surrounding residues, displays SSP scores close to 0.3, thus highlighting the presence of some (residual) structure. The region connecting the hexapeptide and the homeodomain features negative SSP scores, which suggests a trend toward extended conformations.
Figure 1.
Backbone 15N relaxation rates and NOEs measured in Engrailed 2 at five magnetic fields: 400 MHz (red), 500 MHz (burgundy), 600 MHz (purple), 800 MHz (blue), and 1000 MHz (black). (a) Longitudinal relaxation rates, R1, of 15N. (b) 15N-{1H} NOE ratios. (c) Longitudinal cross-relaxation rates, ηz, due to correlated fluctuations of the 15N CSA and the 15N-1H dipolar couplings. (d) Transverse cross-relaxation rates, ηxy, due to the same correlated fluctuations. (e) SSP calculated from the chemical shifts of carbonyl and α and β carbon-13 nuclei. To see this figure in color, go online.
Relaxation experiments were carried out at 400, 500, 600, 800, and 1000 MHz (Fig. 1, a–d) to determine longitudinal R1 nitrogen-15 relaxation rates, the steady-state 15N-{1H} NOE, as well as the longitudinal ηz and transverse ηxy cross-relaxation rates due to correlated fluctuations of the nitrogen-15 CSA and the dipolar coupling with the amide proton. Transverse relaxation rates, R2, were measured using Carr-Purcell-Meiboom-Gill echo trains at 800 MHz (Fig. S1 in the Supporting Material). All experiments were analyzed with NMRPipe and the intensities were obtained from a fit of peaks with the routine nlinLS (44). In some exceptional cases, the limited resolution of spectra measured at 9.4 and 11.7 T may have led to inaccuracies in the intensities of a few poorly resolved peaks.
The uniform decrease of the longitudinal relaxation rates, R1, with increasing magnetic field B0 in the 200–259 homeodomain (Fig. 1a) indicates motions in the nanosecond range, resulting from overall rotational diffusion. The variations R1(B0) are much less pronounced in the disordered region, except in the 169–174 hexapeptide region. Longitudinal cross-relaxation rates, ηz (Fig. 1c), increase with B0 in the IDR. This reflects the very slow decay, slower than 1/ω, of the spectral density function in the range 40–100 MHz (i.e., the range of 15N Larmor frequencies between 9.4 and 23.5 T), as the increase of the amplitude of the CSA interaction counterbalances the decay of the spectral density function with increasing frequency. The profile of transverse relaxation rates, R2, is marked by variations along the sequence of the protein of both the distribution of picosecond-nanosecond motions and contributions of chemical exchange (Fig. S1). NOEs are sensitive markers of local order in IDPs and IDRs and have been used as such for many years (45). Indeed, the variations of NOEs along the sequence are pronounced at moderate fields (9.4–14.1 T); however, the profile of NOEs is much flatter at high fields, in particular at 23.5 T. On the other hand, transverse cross-correlation rates, ηxy, which depend primarily on J(ω = 0), exhibit sharp variations at all fields that are strongly correlated with SSP scores. This suggests that transverse cross-correlation rates, ηxy, should become the method of choice to characterize order in IDPs and IDRs.
Spectral density mapping
Most current software packages designed to characterize protein dynamics based on relaxation rates (46–50) offer a direct derivation of the parameters of local dynamics (order parameters and correlation times for local motions). This approach is efficient and reliable when the theoretical framework of the analysis has been validated. Since a general understanding of motions in intrinsically disordered proteins is still lacking, the derivation of the spectral densities from relaxation rates provides a representation of experimental data that is more amenable to physical analysis than relaxation rates (20).
Spectral density mapping (13) can be achieved without resorting to any proton auto-relaxation rate (51,52). The effective spectral density at high frequency, J(0.87ωH) (see Fig. 2b), can be derived from {1H}-15N NOE and longitudinal nitrogen-15 relaxation rates for different magnetic fields according to (51)
| (1) |
with . μ0 is the permittivity of free space; γH and γN are the gyromagnetic ratios of the proton and nitrogen-15 nuclei; is Planck’s constant divided by 2π and rNH = 1.02 Å is the distance between the amide proton and the nitrogen-15 nucleus.
Figure 2.
Spectral density functions for backbone NH vectors in Engrailed 2. (a) Effective spectral density near the proton Larmor frequency, J(0.87ωH) (ns). (b) Spectral density at the Larmor frequency of nitrogen-15, J(ωN) (ns). (c) Spectral density at zero frequency, J(0) (ns). All data are color-coded as a function of the magnetic field at which the relaxation rates were recorded, with the same code as in Fig. 1. To see this figure in color, go online.
Our data recorded at five magnetic fields allowed us to fit the spectral density function at high frequency, J(0.87ωH), to the expression
| (2) |
in analogy to an earlier study of carbon-13 relaxation (53). The parameters λ and μ are real positive numbers. This functional form is expected to be a good approximation of the spectral density at high frequency in a folded protein, but not necessarily for a protein with significant motions with correlation times in the hundreds of picoseconds. Nevertheless, we obtain satisfactory fits for all residues in the IDR as well as in the homeodomain. This validates the self-consistency of the use of a single effective frequency, ωeff = 0.87ωH, in Eqs. 8–10 of the article by Farrow et al. (51) (see Eqs. S1–S6), where the spectral density function at high frequency was assumed to be of the form of Eq. 2 in both the folded homeodomain and the IDR. Thus, most results of spectral density mapping pertaining to disordered proteins that have been published in recent years are validated.
The results of the fit of the spectral density were used to evaluate contributions to the spectral density at higher frequencies (at ωH ± ωN) in the derivation of J(ωN) from the rate R1 according to the equation
| (3) |
with and Δσ = 160 ppm is the axially symmetric CSA of the nitrogen-15 nucleus.
Overall, contributions of high-frequency terms to R1 are small (54), so that the deviations between the values of J(ωN) obtained from a series of approximations (51) and the current method are limited to ∼2%, which is commensurate with the estimated precision (Fig. S2). Again, this validates, a posteriori, many spectral density mapping studies performed on IDPs. In addition, the low sensitivity of J(ωN) upon the model used to describe the spectral density at high frequency shows that the enhanced accuracy expected from more sophisticated approaches, for instance, following Kadeřávek et al. (20), would be smaller than the typical precision of our measurements. The values of J(ωN) derived at five magnetic fields are shown in Fig. 2b.
To avoid contributions from line-broadening due to chemical exchange, we did not consider transverse relaxation rates, R2(15N), and only used longitudinal and transverse CSA/DD cross-correlated relaxation rates, ηz and ηxy, to derive the spectral density J(0) from J(ωN) using (55)
| (4) |
As can be seen in Fig. S3, measurements of ηz and ηxy are not precise enough at lower fields to provide reliable estimates of J(0) by lack of sensitivity (in particular for ηz). However, the data recorded at 18.8 T and 23.5 T (Fig. 2c) are very similar and do not show any of the outliers observed at lower fields. Significant chemical exchange contributions to R2(15N) can be observed in the hexapeptide region of the disordered region and in the homeodomain (see Fig. S1). Such contributions preclude the proper derivation of J(0) from R2(15N) rates, in particular at high magnetic fields.
Principles and optimization of IMPACT
The limitations of conventional approaches to the analysis of relaxation rates in IDPs and IDRs result from the complexity of their dynamics. These span at least three orders of magnitude, so that it appears unlikely that they can be accurately described by a single distribution of correlation times or by a small number of correlation times. However, the scarcity of relaxation rates limits the number of adjustable parameters that can be determined and thus the sophistication of spectral density functions that can be postulated. Here, we significantly increase the number of correlation times by defining an array of n fixed correlation times. Only the relative coefficient of each correlation time in the distribution is fitted to experimental data, so that the number of adjustable parameters is reduced. Thus, our only assumption is that the correlation function can be approximated by a sum of exponentials. The physical content of the IMPACT model is thus limited to a minimum. IMPACT can be described as a mathematical approach that converts experimental relaxation rates (or, equivalently, spectral density mapping results) into a distribution of correlation times that is more amenable to physical interpretation than the raw experimental data. The array of n correlation times is defined as a geometric series, so that correlation times are equally spaced on a logarithmic scale (Fig. 3, a and b):
| (5) |
Thus, J(ω) is a sum of Lorentzian functions:
| (6) |
where Ai is the coefficient of correlation time τi in the spectral density function. The coefficients Ai must be positive and fulfill the normalization constraint
| (7) |
Thus, the number of free parameters is n−1.
Figure 3.
Principle and optimization of the parameters of our IMPACT analysis. (a) In the 3CT analysis, both the value and the relative weight of each correlation time must be adjusted. (b) In IMPACT, the values of the correlation times are fixed and equally spaced on a logarithmic scale, so that only their relative weights need adjusting. (c) Optimization of IMPACT by considering AIC. The range (τmin, τmax) of correlation times characterized by IMPACT was varied from (1 ps, 1 ns) to (100 ps, 100 ns) and the number of correlation times was varied in the range n = 4–9. Despite the solid lines shown in the contour plot (c), the reader should be aware that the number of correlation times is an integer. To see this figure in color, go online.
A preliminary step of the IMPACT analysis is the optimization of the three parameters τmin, τmax, and n. The first step is to define the range of correlation times that are probed by relaxation rates. A series of correlation times could be chosen as the inverse of the Larmor frequencies at which the spectral density is mapped, in analogy to the study by LeMaster (56). Considering that the range of frequencies where a Lorentzian function varies extends beyond the inflection point, we chose a slightly different approach. We first define the range of correlation times that are sampled by various 15N relaxation rates. We consider that the lowest magnetic field adapted to protein studies is 400 MHz, whereas the highest accessible field currently is 1 GHz. Thus, the lowest nonzero frequency at which the spectral density is sampled is ωN/2π = 40 MHz, and the highest is 0.87ωH/2π = 870 MHz. A Lorentzian function with a correlation time of τc = 40 ns drops to 1% of J(0) at ωN/2π = 40 MHz. A Lorentzian with τc = 18 ps merely decreases to 99% of J(0) at 0.87ωH/2π = 870 MHz. The resulting range spans slightly more than three orders of magnitude. Therefore, we have decided to limit the range to three orders of magnitude
| (8) |
To define the optimal values of τmin and τmax, we carried out a series of IMPACT analyses for [τmin, τmax] = [1 ps, 1 ns] to [100 ps, 100 ns], as well as for 4 < n < 9. In contrast to the approach of LeMaster (56), the number of correlation times is adjustable. The statistical relevance of each combination of parameters was evaluated from the resulting Akaike’s information criteria (AIC) (57–60):
| (9) |
nexp = nJ × nres is the total number of experimental data, with nJ = 11 points at which the spectral density function J(ω) is sampled when relaxation data at five magnetic fields are used; nres = 108 is the number of residues included in the analysis; and nmodel = (n − 1) × nres is the number of free parameters in each model. Here, the constant is C = 0. AIC are shown in Fig. 3c. Two local minima were found for [τmin, τmax] = [34 ps, 34 ns] with n = 5 and for [τmin, τmax] = [21 ps, 21 ns] with n = 6. The likelihood of the latter array of correlation times is 103 times higher than that of the former. Here, we will thus present the IMPACT analysis with [τmin, τmax] = [21 ps, 21 ns] and n = 6; the analysis with [τmin, τmax] = [34 ps, 34 ns] and n = 5 can be found in the Supporting Material. This result is dominated by the diverse dynamic properties of the IDR of Engrailed. Indeed, if we exclude the rigid residues of the homeodomain, the optimal parameters change slightly (residues 146–207) to [τmin, τmax] = [42 ps, 42 ns] with n = 5, whereas the optimal set of parameters for the rigid part of the homeodomain alone (residues 208–259) is significantly different, [τmin, τmax] = [10 ps, 10 ns] and n = 4.
Application of IMPACT to Engrailed
The optimal parameters n = 6 and [τmin, τmax] = [21 ps, 21 ns] were employed to analyze the spectral density function in Engrailed. Note that the coefficients Ai were fitted to spectral density mapping results. In principle, relaxation rates could also be used directly as input for the IMPACT analysis. Fig. 4 illustrates the remarkable variety of dynamics found in Engrailed. In the homeodomain, the second correlation time, τ2, lies just below the correlation time for overall rotational diffusion, which is close to 7 ns, as can be seen from the analysis based on only two correlation times (vide infra). Thus, the second coefficient is by far the most important in the homeodomain. A small amplitude A1 of τ1 corrects for the fact that τ2 is shorter than the actual correlation time for the motion of the whole domain. Note that the correlation time for overall diffusion, τm, is well approximated by:
| (10) |
The average value over the helices of the homeodomain is <τm> = 7.19 ns, which is in good agreement with an estimate of the correlation time for overall diffusion (see below). Small but significant and mostly uniform contributions A3 of the correlation time τ3 in the three α-helices, which are also obtained in a conventional model-free analysis (vide infra), may be attributed to fluctuations of the overall diffusion tensor (61), likely due to conformational fluctuations of the IDR (residues 146–199). Enhanced values of A3 in the loops may reflect the flexibility of these regions (41). The very small coefficients A4 and A5 demonstrate the presence of a gap in the distribution of correlation times, as was also observed in ubiquitin (62). Finally, the coefficients A6 for the shortest correlation time τ6 indicate the presence of fast motions in the tens of picoseconds range. Note that the Lorentzian function J6(ω) drops by ∼1% of J6(0) at the highest frequency explored in this analysis (i.e., ω/2π = 870 MHz). Thus, this last term in the spectral density function can be approximated to a constant that effectively represents all fast motions:
| (11) |
where p(τ) is the probability function of correlation times, containing little information on the complexity of such motions (63).
Figure 4.
Plots of the six coefficients, Ai (i = 1, 2… 6) of the n = 6 correlation times, τi, in the range [τmin, τmax] = [21 ps, 21 ns] determined by the IMPACT analysis of Engrailed: (a) τ1 = 21 ns; (b) τ2 = 5.27 ns; (c) τ3 = 1.33 ns; (d) τ4 = 333 ps; (e) τ5 = 83.6 ps; (f) τ6 = 21 ps. To see this figure in color, go online.
Results obtained in the disordered region of Engrailed will be discussed with the help of Fig. 4 but also with the IMPACT barcode shown in Fig. 5. In the latter, for each residue, the width of each histogram represents the coefficient Ai associated with the correlation time τi that can be read on the y axis. This graph appears to be a convenient way to display the results of the IMPACT analysis in a single figure.
Figure 5.
Graphical representations of (a) IMPACT, (c) 2CT, and (d) 3CT analyses of the spectral density function in Engrailed 2. Histograms are drawn for all residues and represent the contributions of (a) each of the six correlation times, τi (i = 1, 2, …6), considered in IMPACT, (c) each of the two correlation times, τa,b, determined by the 2CT analysis, (d) each of the three correlation times, τa,b,c, determined by the 3CT analysis. The width of each rectangle is proportional to the corresponding weights Ai in IMPACT (a), Ba,b in 2CT (c); and Ba,b,c in 3CT (d). In (c) and (d), the light blue horizontal bars represent the ranges of correlation times, τ, for which reciprocal frequencies lie in the constrained regions between 40 < 1/(2π τ) < 100 MHz or between 348 < 1/(2πτ) < 870 MHz. Gray rectangles in (a), (c), and (d) indicate rigid α-helices, and a green rectangle shows the rigid hydrophobic hexapeptide sequence. (b) As in Fig. 1e, the SSP is shown to guide the comparison between structural and dynamic features. To see this figure in color, go online.
For the first residues at the N-terminus and the last residues at the C-terminus, significant coefficients A3–6 are found for the four shortest correlation times. This seems to indicate the presence of motions that are broadly distributed over all timescales up to 1 ns. On the other hand, the two disordered regions just at the N-terminus and the C-terminus of the hexapeptide display a high density of motions around τ3. The coefficients for the correlation time τ4 decrease almost linearly with the distance to the N- or C-termini of the polypeptide chain in disordered regions and reach different plateaus in each disordered segment. A notable difference between the disordered region at the N-terminus and the one between the hexapeptide and the homeodomain is the slight decrease of the coefficient A3 and a significant increase of A2. It is difficult to assign this change to a particular process without a better characterization of the conformational space of the protein (such characterization is beyond the scope of this article and is a work in progress). Nevertheless, two effects may contribute to the presence of some orientational order beyond 1 ns. First, this IDR is short and located between a folded domain and a small hydrophobic cluster, so that its dynamics is likely influenced by the overall diffusion of both structured elements. Second, this IDR contains a majority of residues that favor extended conformations, as confirmed by SSP scores: three proline residues, nine positively charged, and only one negatively charged residue between positions 177 and 198, which should restrict the conformational space and possibly slow down reorientational dynamics.
The barcode representation of IMPACT coefficients nicely illustrates variations of the ensemble of correlation times between successive structural elements. Thus, for instance, the decrease of motions in the subnanosecond range in the hexapeptide is accompanied by an increase in the suprananosecond range. Similarly, variations of the coefficients for correlation times at the N- and C-termini of the homeodomain (residues 200–210 and 254–260) illustrate the smooth transition of motional properties along the sequence of the polypeptide. Finally, even in the homeodomain, where a classical analysis of relaxation data should be most appropriate, the dynamic transitions between helices and the two loops are clearly visible and quantitatively characterized by the IMPACT approach. Loop α1-α2 features enhanced dynamics in both the 1 ns and tens of picoseconds ranges, as expected from the motions demonstrated by paramagnetic relaxation enhancement studies (41), whereas loop α2-α3 shows a significant but more moderate enhancement of motions. Overall, the IMPACT representation offers an elegant view of the correlation of structural and dynamic features, as can be seen from the SSP scores.
Comparison with conventional analyses based on two or three correlation times
For the sake of comparison, we also fit two simple models where the spectral density function is assumed to consist of a sum of two and three Lorentzians in the manner of the familiar model-free and extended model-free approaches. However, as discussed in the Introduction, the core hypotheses of the model-free formalism cannot be fulfilled a priori, since the longest correlation time is probably an effective correlation time rather than the correlation time of overall rotational diffusion. The spectral density, J2CT, assuming two correlation times (2CT) can be written as
| (12) |
where , τa is the long correlation time, τb is the short effective correlation time, and S2 is similar to the model-free order parameter. The spectral density function J3CT, assuming three correlation times (3CT), can be defined as
| (13) |
where , , and is equivalent to the extended model-free order parameter for fast processes with a correlation time . The two functions were fitted to the experimental spectral density, and a simple model selection was based on the comparison of the second order variant of AIC (see the Supporting Material), with nJ = 11 and nmodel = 3 for the 2CT analysis and 5 for the 3CT analysis.
Results of this analysis are shown in Fig. 6. From a statistical point of view, the 2CT analysis is found to be sufficient to describe the motions in the mostly rigid homeodomain (residues 200–259), except at the flexible N- and C termini. With few exceptions, the longest correlation time yields a reliable measure of overall rotational diffusion, and the average value over the helices of the homeodomain is <τa>hom = 7.08 ns, in good agreement with the IMPACT analysis, with <τm> = 7.19 ns. Interestingly, the second correlation time, τb, found for most residues in the rigid homeodomain lies in the range 0.9 < τb < 1.8 ns. This is in agreement with the IMPACT analysis and is possibly due to fluctuations of the overall diffusion tensor resulting from conformational transitions in the disordered N-terminal region on timescales between 1 and 100 ns (61,64). In the disordered region (residues 146–199), the 2CT and 3CT analyses seem equally probable, with no particular pattern along the sequence, except in the hydrophobic hexapeptide cluster (residues 169–174), where the 2CT analysis is more satisfactory. The random patterns of 2CT versus 3CT selection seems to point to some instability of the model-selection step in the fit procedure. A 2CT or 3CT analysis can be performed with no model selection (see Fig. 5). The built-in absence of site-specific model selection in IMPACT shields this analysis from such a drawback. Low order parameters S2 are found throughout the IDR, with a significant increase in order in the hydrophobic cluster. The long correlation times in the disordered regions have a broad distribution (standard deviation of 2.5 ns), but the average value, <τa>IDR = 5.9 ns, is similar to what was found in unfolded proteins (32) and IDPs (65) and very close to the correlation time for overall tumbling of the homeodomain. The intermediate correlation time, which corresponds to the dominant term in the spectral density function, lies in the range 0.1 < τb < 1.4 ns, in agreement with the IMPACT analysis. The shortest correlation time is rather uniform and lies in the range 40 < τc < 120 ps.
Figure 6.
Results obtained for a conventional analysis with 2CT (blue) or 3CT (red). (a) Order parameter S2. (b) Order parameter S2f for the fastest motion in the 3CT analysis. (c) Longest correlation time, τa. (d and e) Intermediate correlation time, τb (d), and shortest correlation time, τc (e). Either the 2CT or the 3CT model was selected based on the lowest AIC. To see this figure in color, go online.
One should be careful with the physical interpretation of these observations in the IDR of Engrailed. The three correlation times obtained are clearly separated in the time domain, which indicates a broad range of dynamic processes. The results should not be considered a priori as actual correlation times of particular motions, but rather as the best rendition of experimental results using two or three correlation times. This is illustrated by the jumps of order parameters and correlation times observed between the 2CT and 3CT models in the IDR, which illustrates the effective character of the fitted correlation times in this region, at least in the 2CT analysis. For instance, it is difficult to assign the longest correlation time to any particular dynamical process in the absence of complementary experimental or computational information. Such a process could be a single well-defined type of motion, such as the rotational diffusion of an IDR segment. Alternatively, the longest correlation time might account for the tail of a continuous distribution of correlation times and reflect slower motions in parts of the conformational space of the IDR. Interestingly, the correlation times obtained in a 3CT analysis often correspond to reciprocal frequencies (ω = 1/τ) that lie outside regions where the spectral density function can be adequately sampled (i.e., below 40 MHz, between 100 MHz and 348 MHz, and above 870 MHz). This is particularly true in the flexible region between the rigid hexapeptide and the rigid homeodomain and at the C-terminus of the protein. The regions where the spectral density function is most sensitive to the choice of correlation times correspond to ranges where we lack experimental constraints. This would be expected in the presence of a broad distribution of correlation times that would lead to a smooth decay of the spectral density function.
A direct comparison between the results of the 3CT analysis and our IMPACT approach is illustrated in Fig. 5. For the sake of comparison, we define in Fig. 5a the coefficients Ba,b,c associated with the correlation times τa,b,c, as
| (14) |
The statistical significance of the fit resulting from our IMPACT analysis is often better than with either the 2CT or the 3CT analysis in the disordered regions, although it is somehow comparable to that of the 2CT model in the rigid homeodomain (Fig. S4). In particular, the almost complete absence of abnormally elevated χ2 values in the IDR of Engrailed shows that a faithful set of fitted Ai parameters can be obtained with diverse dynamical features. Interestingly, as can be seen from the schematic representation of correlation times that correspond to reciprocal frequencies that can be determined by spectral density mapping, the inflection points of most of the Lorentzian functions often lie in frequency regions where spectral density mapping does not yield any results. This is particularly true in the IDR between the hexapeptide and the homeodomain and at the C-terminus of the protein. This seems to indicate that the decay of the spectral density function is smoother than can be described by a sum of three Lorentzian functions. The fit pushes the inflection points of individual contributions to the spectral density function beyond the areas that benefit from rich experimental constraints. In addition, the absence of a residue-specific model selection in IMPACT provides results that are directly comparable, residue by residue, which allows for a better qualitative description of dynamic properties along the protein sequence. Admittedly, model selection can be omitted in the 2CT or 3CT analysis, as in Fig. 5, a and b.
A potential concern of the IMPACT analysis lies in the fact that the correlation functions Ji(ω) are not independent, since they suffer from significant overlap. Hence, it is possible that different ensembles of coefficients Ai can describe the same experimental data. To test the sensitivity of our analysis to this potential flaw, we have plotted correlations of consecutive coefficients Ai (i.e., Ai as a function of Ai+1) for all 510 steps of the Monte Carlo procedure employed in the fit. Typical results are shown in Fig. S6. There is a small anticorrelation between consecutive coefficients in several instances. This will give rise to a broader distribution of individual coefficients and thus lead to a decrease of the precision of these coefficients. In the worst case, a potential decrease of accuracy due to the interdependence of consecutive coefficients will be accompanied with a decrease in precision.
A potential concern of the IMPACT analysis is the risk of overinterpretation of the results. Here, we should be clear and provide a set of rules that should be followed by users of this approach.
-
1)
The correlation times, τi, are not physical correlation times of the system a priori. The range of correlation times is defined by the experimental observables, but the individual values τi are derived from a statistical analysis, not a physical analysis.
-
2)
A nonzero coefficient A1, with τ1 the longest correlation time, means not that some motions with a correlation time τ1 were detected but that the distribution of correlation times is larger than zero for some correlation times larger than τ2.
-
3)
Similarly, as mentioned in Eq. 11, the coefficient An of the shortest correlation time τn is an effective representation of all fastest motions.
-
4)
If the coefficient Ai is larger than zero, the distribution of correlation times is larger than zero somewhere between τi+1 and τi-1.
-
5)
If the coefficients Ai and Ai+1 are both zero, the distribution of correlation times is expected to be zero at least between τi+1 and τi.
Finally, very few relaxation studies have compared rates at five or more magnetic fields (66,67). We have tested whether an IMPACT analysis of relaxation rates recorded at only three fields could give meaningful results, using either a broad range of magnetic fields (9.4, 14.1, and 23.5 T) or a narrow range of readily accessible magnetic fields (11.7, 14.1, and 18.8 T). In either case, this requires about two weeks of experimental time. The results of the analysis of relaxation rates at 9.4, 14.1, and 23.5 T shown in Figs. S7 and S8 are remarkably similar to those presented in Figs. 4 and 5. When the range of magnetic fields is restricted, with relaxation rates measured at 11.7, 14.1 and 18.8 T (Figs S9 and S10), some significant changes of IMPACT coefficients can be observed, but the overall description of the distribution of correlation times is very similar to what is obtained with relaxation rates at five magnetic fields. Hence, IMPACT can be applied to many proteins at a moderate cost in experimental time, and does not necessarily require exceptionally large data sets or magnetic fields as high as 23.5 T.
Conclusions
We have presented a set of nitrogen-15 relaxation rates in the 114-residue, partially disordered protein Engrailed 2 recorded at five different magnetic fields. The transverse cross-correlated rate, ηxy, was shown to be the most sensitive to the extent of order and disorder at all magnetic fields. The analysis validates the reduced spectral density mapping approach originally developed for folded proteins and already extensively applied to IDPs and IDRs. The spectral density functions can be fitted reasonably well with two or three correlation times, although such results may be difficult to interpret. We have introduced an approach to the analysis of spectral density functions, which we call IMPACT. This provides a better quantitative description of spectral density functions in IDRs as found in Engrailed than an analysis with three correlation times with the same number of adjustable parameters. We also introduce a barcode representation of IMPACT, which provides a condensed graphical representation of large amounts of data in a single figure. This representation lends itself to a qualitative discussion of order and disorder in proteins. IMPACT can also be useful for analyzing a smaller set of relaxation rates recorded at only three magnetic fields. IMPACT provides a unique framework for the description of the timescales of motions in IDPs and IDRs. Our approach is complementary to the determination of conformational ensembles (7,68). Insight into the dynamics of IDPs and IDRs should greatly benefit from a combined analysis.
Author Contributions
F.F., O.L., and P.P. designed the research; S.N.K., C.C., R.A., N.S., O.L., P.P., and F.F. performed the research; C.C., N.S., V.D., G.B., and P.P. contributed analytical tools; S.N.K., C.C., R.A., and V.D. analyzed the data; and S.N.K., C.C., G.B., O.L., P.P., and F.F. wrote the manuscript.
Acknowledgments
We thank Bénédicte Elena-Herrmann (École Normale Supérieure de Lyon), Nelly Morellet (International Cancer Screening Network, Gif-sur-Yvette), and Martial Piotto (Bruker, Wissembourg, France) for assistance in recording experiments, and Arthur G. Palmer (Columbia University, New York) and Daniel Abergel (École Normale Supérieure) for many fruitful discussions.
This research was funded by the European Research Council (ERC) under the European Community’s Seventh Framework Programme (FP7/2007-2013), ERC grant agreement 279519 (2F4BIODYN), and the Agence Nationale de la Recherche (ANR-11-BS07-031-01). Financial support from the IR-RMN-THC FR3050, Centre Nationale de la Recherche Scientifique, is gratefully acknowledged.
Editor: Nathan Baker
Footnotes
This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Shahid N. Khan and Cyril Charlier contributed equally to this work.
Rafal Augustyniak’s present address is Departments of Biochemistry, Chemistry and Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada.
Nicola Salvi’s present address is Institut de Biologie Structurale Jean-Pierre Ebel, CNRS-CEA-UJF UMR 5075, 41 rue Jules Horowitz, 38027 Grenoble Cedex, France.
Supporting Materials and Methods, twelve figures, and thirteen tables are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(15)00731-6.
Supporting Material
References
- 1.van der Lee R., Buljan M., Babu M.M. Classification of intrinsically disordered regions and proteins. Chem. Rev. 2014;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dyson H.J., Wright P.E. Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 2005;6:197–208. doi: 10.1038/nrm1589. [DOI] [PubMed] [Google Scholar]
- 3.Uversky V.N., Dunker A.K. Understanding protein non-folding. Biochim. Biophys. Acta. 2010;1804:1231–1264. doi: 10.1016/j.bbapap.2010.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Dunker A.K., Lawson J.D., Obradovic Z. Intrinsically disordered protein. J. Mol. Graph. Model. 2001;19:26–59. doi: 10.1016/s1093-3263(00)00138-8. [DOI] [PubMed] [Google Scholar]
- 5.Jensen M.R., Markwick P.R.L., Blackledge M. Quantitative determination of the conformational properties of partially folded and intrinsically disordered proteins using NMR dipolar couplings. Structure. 2009;17:1169–1185. doi: 10.1016/j.str.2009.08.001. [DOI] [PubMed] [Google Scholar]
- 6.Jensen M.R., Zweckstetter M., Blackledge M. Exploring free-energy landscapes of intrinsically disordered proteins at atomic resolution using NMR spectroscopy. Chem. Rev. 2014;114:6632–6660. doi: 10.1021/cr400688u. [DOI] [PubMed] [Google Scholar]
- 7.Marsh J.A., Forman-Kay J.D. Structure and disorder in an unfolded state under nondenaturing conditions from ensemble models consistent with a large number of experimental restraints. J. Mol. Biol. 2009;391:359–374. doi: 10.1016/j.jmb.2009.06.001. [DOI] [PubMed] [Google Scholar]
- 8.Tompa P. On the supertertiary structure of proteins. Nat. Chem. Biol. 2012;8:597–600. doi: 10.1038/nchembio.1009. [DOI] [PubMed] [Google Scholar]
- 9.Lindorff-Larsen K., Trbovic N., Shaw D.E. Structure and dynamics of an unfolded protein examined by molecular dynamics simulation. J. Am. Chem. Soc. 2012;134:3787–3791. doi: 10.1021/ja209931w. [DOI] [PubMed] [Google Scholar]
- 10.Mittermaier A., Kay L.E. New tools provide new insights in NMR studies of protein dynamics. Science. 2006;312:224–228. doi: 10.1126/science.1124964. [DOI] [PubMed] [Google Scholar]
- 11.Palmer A.G., 3rd NMR characterization of the dynamics of biomacromolecules. Chem. Rev. 2004;104:3623–3640. doi: 10.1021/cr030413t. [DOI] [PubMed] [Google Scholar]
- 12.Palmer A.G., 3rd Chemical exchange in biomacromolecules: past, present, and future. J. Magn. Reson. 2014;241:3–17. doi: 10.1016/j.jmr.2014.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Peng J.W., Wagner G. Mapping of spectral density-functions using heteronuclear NMR relaxation measurements. J. Magn. Reson. 1992;98:308–332. [Google Scholar]
- 14.Lipari G., Szabo A. Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity. J. Am. Chem. Soc. 1982;104:4546–4559. [Google Scholar]
- 15.Halle B. The physical basis of model-free analysis of NMR relaxation data from proteins and complex fluids. J. Chem. Phys. 2009;131:224507. doi: 10.1063/1.3269991. [DOI] [PubMed] [Google Scholar]
- 16.Clore G.M., Szabo A., Gronenborn A.M. Deviations from the simple two-parameter model-free approach to the interpretation of nitrogen-15 nuclear magnetic relaxation of proteins. J. Am. Chem. Soc. 1990;112:4989–4991. [Google Scholar]
- 17.Eliezer D. Biophysical characterization of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2009;19:23–30. doi: 10.1016/j.sbi.2008.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Jensen M.R., Ruigrok R.W.H., Blackledge M. Describing intrinsically disordered proteins at atomic resolution by NMR. Curr. Opin. Struct. Biol. 2013;23:426–435. doi: 10.1016/j.sbi.2013.02.007. [DOI] [PubMed] [Google Scholar]
- 19.Konrat R. NMR contributions to structural dynamics studies of intrinsically disordered proteins. J. Magn. Reson. 2014;241:74–85. doi: 10.1016/j.jmr.2013.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kadeřávek P., Zapletal V., Žídek L. Spectral density mapping protocols for analysis of molecular motions in disordered proteins. J. Biomol. NMR. 2014;58:193–207. doi: 10.1007/s10858-014-9816-4. [DOI] [PubMed] [Google Scholar]
- 21.Bussell R., Jr., Eliezer D. Residual structure and dynamics in Parkinson’s disease-associated mutants of α-synuclein. J. Biol. Chem. 2001;276:45996–46003. doi: 10.1074/jbc.M106777200. [DOI] [PubMed] [Google Scholar]
- 22.Houben K., Blanchard L., Marion D. Intrinsic dynamics of the partly unstructured PX domain from the Sendai virus RNA polymerase cofactor P. Biophys. J. 2007;93:2830–2844. doi: 10.1529/biophysj.107.108829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Brutscher B., Brüschweiler R., Ernst R.R. Backbone dynamics and structural characterization of the partially folded A state of ubiquitin by 1H, 13C, and 15N nuclear magnetic resonance spectroscopy. Biochemistry. 1997;36:13043–13053. doi: 10.1021/bi971538t. [DOI] [PubMed] [Google Scholar]
- 24.Buevich A.V., Baum J. Dynamics of unfolded proteins: incorporation of distributions of correlation times in the model free analysis of NMR relaxation data. J. Am. Chem. Soc. 1999;121:8671–8672. [Google Scholar]
- 25.Buevich A.V., Shinde U.P., Baum J. Backbone dynamics of the natively unfolded pro-peptide of subtilisin by heteronuclear NMR relaxation studies. J. Biomol. NMR. 2001;20:233–249. doi: 10.1023/a:1011243116136. [DOI] [PubMed] [Google Scholar]
- 26.Modig K., Poulsen F.M. Model-independent interpretation of NMR relaxation data for unfolded proteins: the acid-denatured state of ACBP. J. Biomol. NMR. 2008;42:163–177. doi: 10.1007/s10858-008-9280-0. [DOI] [PubMed] [Google Scholar]
- 27.Sternin E. Use of inverse theory algorithms in the analysis of biomembrane NMR data. Methods Mol. Biol. 2007;400:103–125. doi: 10.1007/978-1-59745-519-0_8. [DOI] [PubMed] [Google Scholar]
- 28.Aster R.C., Borchers B., Thurber C.H. 2nd ed. Academic Press; New York: 2012. Parameter Estimation and Inverse Problems. [Google Scholar]
- 29.Foucher I., Montesinos M.L., Trembleau A. Joint regulation of the MAP1B promoter by HNF3β/Foxa2 and Engrailed is the result of a highly conserved mechanism for direct interaction of homeoproteins and Fox transcription factors. Development. 2003;130:1867–1876. doi: 10.1242/dev.00414. [DOI] [PubMed] [Google Scholar]
- 30.Peltenburg L.T.C., Murre C. Engrailed and Hox homeodomain proteins contain a related Pbx interaction motif that recognizes a common structure present in Pbx. EMBO J. 1996;15:3385–3393. [PMC free article] [PubMed] [Google Scholar]
- 31.Augustyniak R., Balayssac S., Lequin O. 1H, 13C and 15N resonance assignment of a 114-residue fragment of Engrailed 2 homeoprotein, a partially disordered protein. Biomol. NMR Assign. 2011;5:229–231. doi: 10.1007/s12104-011-9306-5. [DOI] [PubMed] [Google Scholar]
- 32.Xue Y., Skrynnikov N.R. Motion of a disordered polypeptide chain as studied by paramagnetic relaxation enhancements, 15N relaxation, and molecular dynamics simulations: how fast is segmental diffusion in denatured ubiquitin? J. Am. Chem. Soc. 2011;133:14614–14628. doi: 10.1021/ja201605c. [DOI] [PubMed] [Google Scholar]
- 33.Augustyniak R., Ferrage F., Bodenhausen G. Methods to determine slow diffusion coefficients of biomolecules: applications to Engrailed 2, a partially disordered protein. J. Biomol. NMR. 2011;50:209–218. doi: 10.1007/s10858-011-9510-8. [DOI] [PubMed] [Google Scholar]
- 34.Kay L.E., Nicholson L.K., Torchia D.A. Pulse sequences for removal of the effects of cross-correlation between dipolar and chemical-shift anisotropy relaxation mechanism on the measurement of heteronuclear T1 and T2 values in proteins. J. Magn. Reson. 1992;97:359–375. [Google Scholar]
- 35.Palmer A.G., III, Skelton N.J., Rance M. Suppression of the effects of cross-correlation between dipolar and anisotropic chemical-shift relaxation mechanisms in the measurement of spin spin relaxations rates. Mol. Phys. 1992;75:699–711. [Google Scholar]
- 36.Ferrage F. Protein dynamics by 15N nuclear magnetic relaxation. Methods Mol. Biol. 2012;831:141–163. doi: 10.1007/978-1-61779-480-3_9. [DOI] [PubMed] [Google Scholar]
- 37.Ferrage F., Cowburn D., Ghose R. Accurate sampling of high-frequency motions in proteins by steady-state 15N-1H nuclear Overhauser effect measurements in the presence of cross-correlated relaxation. J. Am. Chem. Soc. 2009;131:6048–6049. doi: 10.1021/ja809526q. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ferrage F., Reichel A., Ghose R. On the measurement of 15N-1H nuclear Overhauser effects. 2. Effects of the saturation scheme and water signal suppression. J. Magn. Reson. 2010;207:294–303. doi: 10.1016/j.jmr.2010.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Pelupessy P., Espallargas G.M., Bodenhausen G. Symmetrical reconversion: measuring cross-correlation rates with enhanced accuracy. J. Magn. Reson. 2003;161:258–264. doi: 10.1016/s1090-7807(02)00190-8. [DOI] [PubMed] [Google Scholar]
- 40.Pelupessy P., Ferrage F., Bodenhausen G. Accurate measurement of longitudinal cross-relaxation rates in nuclear magnetic resonance. J. Chem. Phys. 2007;126:134508. doi: 10.1063/1.2715583. [DOI] [PubMed] [Google Scholar]
- 41.Carlier L., Balayssac S., Lequin O. Investigation of homeodomain membrane translocation properties: insights from the structure determination of engrailed-2 homeodomain in aqueous and membrane-mimetic environments. Biophys. J. 2013;105:667–678. doi: 10.1016/j.bpj.2013.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wolfram Research . Wolfram Research; Champaign, IL: 2012. Wolfram Mathematica. [Google Scholar]
- 43.Marsh J.A., Singh V.K., Forman-Kay J.D. Sensitivity of secondary structure propensities to sequence differences between α- and γ-synuclein: implications for fibrillation. Protein Sci. 2006;15:2795–2804. doi: 10.1110/ps.062465306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Delaglio F., Grzesiek S., Bax A. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
- 45.Dyson H.J., Wright P.E. Unfolded proteins and protein folding studied by NMR. Chem. Rev. 2004;104:3607–3622. doi: 10.1021/cr030403s. [DOI] [PubMed] [Google Scholar]
- 46.Mandel A.M., Akke M., Palmer A.G., 3rd Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. J. Mol. Biol. 1995;246:144–163. doi: 10.1006/jmbi.1994.0073. [DOI] [PubMed] [Google Scholar]
- 47.Cole R., Loria J.P. FAST-Modelfree: a program for rapid automated analysis of solution NMR spin-relaxation data. J. Biomol. NMR. 2003;26:203–213. doi: 10.1023/a:1023808801134. [DOI] [PubMed] [Google Scholar]
- 48.Fushman D., Cahill S., Cowburn D. The main-chain dynamics of the dynamin pleckstrin homology (PH) domain in solution: analysis of 15N relaxation with monomer/dimer equilibration. J. Mol. Biol. 1997;266:173–194. doi: 10.1006/jmbi.1996.0771. [DOI] [PubMed] [Google Scholar]
- 49.Dosset P., Hus J.C., Marion D. Efficient analysis of macromolecular rotational diffusion from heteronuclear relaxation data. J. Biomol. NMR. 2000;16:23–28. doi: 10.1023/a:1008305808620. [DOI] [PubMed] [Google Scholar]
- 50.Bieri M., d’Auvergne E.J., Gooley P.R. relaxGUI: a new software for fast and simple NMR relaxation data analysis and calculation of ps-ns and μs motion of proteins. J. Biomol. NMR. 2011;50:147–155. doi: 10.1007/s10858-011-9509-1. [DOI] [PubMed] [Google Scholar]
- 51.Farrow N.A., Zhang O., Kay L.E. Spectral density function mapping using 15N relaxation data exclusively. J. Biomol. NMR. 1995;6:153–162. doi: 10.1007/BF00211779. [DOI] [PubMed] [Google Scholar]
- 52.Ishima R., Nagayama K. Protein backbone dynamics revealed by quasi spectral density function analysis of amide N-15 nuclei. Biochemistry. 1995;34:3162–3171. doi: 10.1021/bi00010a005. [DOI] [PubMed] [Google Scholar]
- 53.Hill R.B., Bracken C., Palmer A.G. Molecular motions and protein folding: Characterization of the backbone dynamics and folding equilibrium of α2D using 13C NMR spin relaxation. J. Am. Chem. Soc. 2000;122:11610–11619. [Google Scholar]
- 54.Ropars V., Bouguet-Bonnet S., Roumestand C. Unraveling protein dynamics through fast spectral density mapping. J. Biomol. NMR. 2007;37:159–177. doi: 10.1007/s10858-006-9091-0. [DOI] [PubMed] [Google Scholar]
- 55.Kroenke C.D., Loria J.P., Palmer A.G., III Longitudinal and transverse 1H-15N dipolar/15N chemical shift anisotropy relaxation interference: unambiguous determination of rotational diffusion tensors and chemical exchange effects in biological macromolecules. J. Am. Chem. Soc. 1998;120:7905–7915. [Google Scholar]
- 56.LeMaster D.M. Larmor frequency selective model free analysis of protein NMR relaxation. J. Biomol. NMR. 1995;6:366–374. doi: 10.1007/BF00197636. [DOI] [PubMed] [Google Scholar]
- 57.Akaike, H. 1973. Information theory and an extension of the maximum likelihood principle. Proc. 2nd Int. Symp. Information Theory. B. N. Petrov, and F. Csáki, editors. Akadémiai Kiadó, Budapest, Hungary. 267–281.
- 58.Sugiura N. Further analysis of data by Akaike’s information criterion and finite corrections. Commun. Stat. Theory Methods. 1978;7:13–26. [Google Scholar]
- 59.Hurvich C.M., Tsai C.L. Regression and time-series model selection in small samples. Biometrika. 1989;76:297–307. [Google Scholar]
- 60.Burnham K.P., Anderson D.R. Springer; New York: 2002. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. [Google Scholar]
- 61.Wong V., Case D.A., Szabo A. Influence of the coupling of interdomain and overall motions on NMR relaxation. Proc. Natl. Acad. Sci. USA. 2009;106:11016–11021. doi: 10.1073/pnas.0809994106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Prompers J.J., Brüschweiler R. General framework for studying the dynamics of folded and nonfolded proteins by NMR relaxation spectroscopy and MD simulation. J. Am. Chem. Soc. 2002;124:4522–4534. doi: 10.1021/ja012750u. [DOI] [PubMed] [Google Scholar]
- 63.Calandrini V., Abergel D., Kneller G.R. Fractional protein dynamics seen by nuclear magnetic resonance spectroscopy: relating molecular dynamics simulation and experiment. J. Chem. Phys. 2010;133:145101. doi: 10.1063/1.3486195. [DOI] [PubMed] [Google Scholar]
- 64.Ryabov Y.E., Fushman D. A model of interdomain mobility in a multidomain protein. J. Am. Chem. Soc. 2007;129:3315–3327. doi: 10.1021/ja067667r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Parigi G., Rezaei-Ghaleh N., Luchinat C. Long-range correlated dynamics in intrinsically disordered proteins. J. Am. Chem. Soc. 2014;136:16201–16209. doi: 10.1021/ja506820r. [DOI] [PubMed] [Google Scholar]
- 66.Hall J.B., Fushman D. Variability of the 15N chemical shielding tensors in the B3 domain of protein G from 15N relaxation measurements at several fields. Implications for backbone order parameters. J. Am. Chem. Soc. 2006;128:7855–7870. doi: 10.1021/ja060406x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Charlier C., Khan S.N., Ferrage F. Nanosecond time scale motions in proteins revealed by high-resolution NMR relaxometry. J. Am. Chem. Soc. 2013;135:18665–18672. doi: 10.1021/ja409820g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ozenne V., Bauer F., Blackledge M. Flexible-meccano: a tool for the generation of explicit ensemble descriptions of intrinsically disordered proteins and their associated experimental observables. Bioinformatics. 2012;28:1463–1470. doi: 10.1093/bioinformatics/bts172. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






