Abstract
Spin relaxation in solution-state NMR spectroscopy is a powerful approach to explore the conformational dynamics of biological macromolecules. Probability distribution functions for overall or internal correlation times have been used previously to model spectral density functions central to spin-relaxation theory. Applications to biological macromolecules rely on transverse relaxation rate constants, and when studying nanosecond timescale motions, sampling at ultralow frequencies is often necessary. Consequently, appropriate distribution functions necessitate spectral density functions that are accurate and convergent as frequencies approach zero. In this work, the inverse Gaussian probability distribution function is derived from general properties of spectral density functions at low and high frequencies for macromolecules in solution, using the principle of maximal entropy. This normalized distribution function is first used to calculate the correlation function, followed by the spectral density function. The resulting model-free spectral density functions are finite at a frequency of zero and can be used to describe distributions of either overall or internal correlation times using the model-free ansatz. To validate the approach, 15N spin-relaxation data for the bZip transcription factor domain of the Saccharomyces cerevisiae protein GCN4, in the absence of cognate DNA, were analyzed using the inverse Gaussian probability distribution for intramolecular correlation times. The results extend previous models for the conformational dynamics of the intrinsically disordered, DNA-binding region of the bZip transcription factor domain.
Introduction
Conformational dynamics of proteins and other biological macromolecules play substantial roles in functions such as allostery, catalysis, molecular recognition, regulation, and signaling (1, 2, 3). NMR spectroscopy is often employed to experimentally explore the dynamic and kinetic properties of biological macromolecules. Spin-relaxation experiments probe motions with atomic resolution over a broad range of timescales (4, 5). In addition, multiple methods and applications for 1H, 2H, 13C, and 15N nuclei in proteins and nucleic acids currently exist that make such explorations possible (5, 6, 7). As systems with increasingly complex dynamical properties and as experimental methods become more powerful, correspondingly, more sophisticated approaches are necessary for analysis of data and linkage to theoretical or computational motional models (7, 8, 9).
Probability distributions for correlation times have long been used to model autocorrelation functions, C(t), and spectral density functions, J(ω), for NMR spin relaxation experiments in solution, particularly in flexible polymers (10, 11, 12). As examples of applications to biological macromolecules, the Cole-Cole distribution was employed by Buevich and c-oworkers in an analysis of relaxation data for the highly disordered S-peptide (13), and the Mittag-Leffler distribution, arising from a fractional diffusion equation, was employed by Abergel and co-workers to model intramolecular motions in proteins (14, 15, 16, 17). However, many of the aforementioned distribution functions, such as the Fuoss-Kirkwood, Cole-Cole, and Mittag-Leffler models, yield spectral density functions that diverge as the frequency ω approaches zero (12). Such functions are not straightforward to apply to experimental data sets that include transverse (spin-spin) relaxation or other measurements, such as relaxometry (18), that depend on J(ω) at frequencies equal or near zero, unless another exponentially decaying process is assumed that reduces the associated autocorrelation function to zero sufficiently rapidly. For example, Abergel and co-workers were able to model intramolecular dynamics for folded proteins using the Mittag-Leffler distribution function because the truncation from overall rotational diffusion ensured convergence at J(0). In contrast, Buevich and co-workers, in a more disordered system, could not rely on overall rotational diffusion for truncation, and thus J(0) ≈ J(ω = 1 s−1) was chosen arbitrarily to avoid divergence.
Alternatively, autocorrelation functions can be modeled as sums of exponential decays, essentially modeling the probability distributions as sums of δ functions and yielding spectral density functions that are sums of Lorentzian terms. In this vein, Modig and Poulsen estimated the moments of the underlying distribution of correlation times after fitting a sum of exponential functions to experimental data (19), whereas Ferrage and co-workers fitted a sum of exponential functions, fixing the spacing of time constants to reduce the number of optimizable parameters (20). Increasingly accurate molecular dynamics simulations have also been used to develop and validate models for both distributions and sums of exponential processes (8, 15, 21, 22, 23, 24).
This work derives the inverse Gaussian probability distribution from general properties of spectral density functions at low and high frequencies for macromolecules, in conjunction with the principle of maximal entropy (25). Importantly, the maximal entropy approach constrains the spectral density function to be convergent at ω = 0, facilitating the application to biological macromolecular systems. Schrödinger originally derived the inverse Gaussian distribution to characterize first-passage times in Brownian motion (26). The theory and application of the inverse Gaussian distribution have been reviewed by Folks and Chhikara (27).
The inverse Gaussian spectral density formalism is applied to a set of backbone 15N relaxation data acquired for the bZip (DNA-binding) domain of the Saccharomyces cerevisiae protein GCN4 under the assumption that the inverse Gaussian probability distribution represents the distribution of intramolecular correlation times for the reorientation of the backbone amide N-H bond vector in a local molecular frame of reference. The yeast transcription factor GCN4 (28), which has homologs in mammals, is an example of a protein with extensive intrinsically disordered regions (IDRs) under certain conditions (29, 30, 31). The bZip domain, when attached to DNA substrate, contains a stable, highly basic, N-terminal helical region that inserts into the DNA major groove and a C-terminal region that forms a leucine zipper dimer (32, 33). In the absence of DNA, the N-terminal region becomes partially disordered with significant residual helicity, whereas the C-terminal leucine zipper remains ordered (34, 35, 36, 37). Results obtained by the application of the inverse Gaussian probability distribution provide additional insights into the flexibility of the IDR in GCN4.
Methods
The autocorrelation function for a continuous distribution of exponential terms is given by (38, 39)
| (1) |
in which p(τ) is the probability density function for correlation times τ. The spectral density function important in NMR spin relaxation theory is given by (38, 39)
| (2) |
The second step is obtained by exchanging the order of integration. When ωτ ≪ 1,
| (3) |
and when ωτ ≫ 1,
| (4) |
These two limits are encountered most frequently in spin-relaxation measurements of biological macromolecules performed at high static magnetic fields. For example, investigations of 15N relaxation in amide moieties in proteins or imino moieties in nucleic acids include measurements of the 15N R1 and R2 autorelaxation rate constants and the 1H-15N heteronuclear cross-relaxation rate constant σNH. These relaxation rate constants depend on the 1H-15N dipole-dipole interaction with directly attached 1H spins and on the 15N chemical shift anisotropy. They are also dependent on the values of the spectral density function at frequencies ω = 0, ωN, ωH, and ωH ± ωN (40, 41, 42). Consequently, ωNτ > 1 and ωHτ > 1 for τ > 0.26 ns at a static magnetic field of 14.1 T (600 MHz).
The original model-free formalism of Lipari and Szabo parameterizes the model spectral density function using the mean correlation time <τ> (38, 39). This choice gives a one-parameter exponential Padé approximation of the correlation function (43, 44). The above considerations suggest that a two-parameter approximation should be based on both <τ> and <1/τ>. This idea can be made more rigorous by defining p(τ) using the maximal entropy principle with the constraints <τ> and <1/τ> codified as Lagrange multipliers, the result of which gives (25)
| (5) |
c, λ1, and λ2 are constants determined by the following requirements:
| (6) |
The family of generalized inverse Gaussian probability distributions satisfies the functional normalized form of Eq. 5:
| (7) |
in which λ1 = −λ/(2μ2), λ2 = −λ/2, which can be identified as the coefficients of τ and 1/τ in the exponential function, and Kn(x) is the modified Bessel function of the second kind (Basset or Macdonald function) (45, 46). The prefactor to the exponential arises from the constraint of a normalized distribution. These distributions have similarly skewed shapes in which μ is a location parameter and λ is a scale parameter. The choice of n = −1/2 is made herein for three reasons: 1) in this case, μ = <τ> and is thus independent of the scale parameter, 2) the distribution decays more rapidly at large values of τ rather than other simple choices such as n = 0 or 1, and 3) the resulting distribution function is more tractable mathematically because K−1/2(x) = K1/2(x) = (π/(2x))1/2e−x. The normalized probability distribution resulting from evaluating Eq. 7 for when n = −1/2 is called the inverse Gaussian probability density function:
| (8) |
in which, once again, μ = <τ> is the mean correlation time and the variance of the distribution is σ2 = μ3/λ. Note that <1/τ> <τ> = <μ/τ> = 1 + μ/λ = 1 + σ2/μ2 ≥ 1 with equality being approached as σ → 0, and the distribution becomes a δ function centered around μ (yielding a single exponential correlation function). Thus, <1/τ> <τ> > 1 indicates that the correlation function is multiexponential and is used in some cases herein as a convenient representation of the breadth of the distribution (rather than using σ itself). The mode of the distribution is given by
| (9) |
and approaches μ as <1/τ> <τ> → 1 (or σ/μ → 0) or 0 as <1/τ> <τ> → ∞ (or σ/μ → ∞). Examples of the inverse Gaussian probability distribution are shown in Fig. 1 a.
Figure 1.
Inverse Gaussian. For all figures, the lines are all dependent on the parameter <1/τ><τ>, which is equal to 5 (dashed, purple), 2 (dashed-dotted, green), and 1.1 (dotted, orange). (a) Normalized inverse Gaussian probability distribution functions p(τ). (b) Normalized correlation functions CIG(τ). The black line gives the single exponential correlation function (the limiting case as <1/τ> <τ> → 1). (c) Spectral density functions JIG(ω). The black line gives the single Lorentzian spectral density function (the limiting case as <1/τ><τ> → 1). To see this figure in color, go online.
The autocorrelation function is obtained by integrating over the probability distribution to yield
| (10) |
in which X = (1 + 2t/λ)1/2 − 1 = (1 + 2tσ2/μ3)1/2 − 1. Additionally, the integrated area of CIG(t) is μ, and the first derivative of CIG(t) at t = 0 is <1/τ>. The spectral density function is given by the real part of the Fourier transform of the autocorrelation function:
| (11) |
in which
| (12) |
The Faddeeva function is w(Z) = exp(−Z2)erfc(−iZ) (47), and erfc(x) is the complementary error function. Examples of the autocorrelation and spectral density functions obtained from Eqs. 10 to 11 are shown in Fig. 1, b and c, respectively.
The model-free ansatz gives C(t) = CO(t){S2 + (1 − S2)CI(t)} as the total autocorrelation function for a macromolecule, in which S2 is the square of the generalized order parameter from Lipari and Szabo and Co(t) and CI(t) are the autocorrelation functions for overall and intramolecular motions, respectively (38, 39). If the inverse Gaussian correlation and spectral density functions of Eqs. 10 and 11 represent the overall rotational dynamics of a nuclear spin in a molecule and the intramolecular motions, assumed to be independent of overall motions, are described by a single exponential decay with an effective correlation time τe, then the spectral density function becomes
| (13) |
The second term of Eq. 13 is derived by extending the Fourier transform into the complex domain and is an efficient way of representing the Fourier transform of CIG(t)exp(−t/τe). Alternatively, if the correlation and spectral density functions of Eqs. 10 and 11 represent the internal dynamics of a nuclear spin in a biomolecule with an exponential overall rotational diffusion correlation function of correlation time τm, then the spectral density function analogously becomes
| (14) |
In this case, the second term is the Fourier transform of CIG(t)exp(−t/τm). In summary, Eqs. 10, 11, 13, and 14 are the main theoretical results of this work.
For comparison to Eq. 14, the original Lipari-Szabo model-free formalism is given by (38, 39)
| (15) |
in which 1/τ = 1/τm + 1/τe and τe is an effective internal correlation time. τm is defined as above. The extended model-free formalism is given by (38, 39)
| (16) |
in which 1/τ = 1/τm + 1/τs, 1/τ′ = 1/τm + 1/τf, τf and τs are internal correlation times for fast and slow timescales, respectively, and Sf2 is the square of the generalized order parameter for motions on the fast timescale.
Results
The above formalism was applied to a set of backbone 15N relaxation data acquired for the bZip domain of the yeast transcription factor GCN4. These data were acquired at four static magnetic fields, corresponding to 1H Larmor frequencies 600, 700, 800, and 900 MHz (29). The data were processed by standard methods to give reduced spectral density values. Fig. 2 a shows data for J(0), and Fig. 2 b shows data for J(0.87ωH) with 0.87ωH/2π = 522 and 783 MHz, referring to spectral densities at the lowest and highest frequencies sampled; these illustrate the regions of the bZip domain affected by slower and faster motions, respectively. In the original report by Gill and co-workers (29), the resulting spectral density data were analyzed using the model-free (two-exponential, Eq. 15) and extended model-free (three-exponential, Eq. 16) formalisms. The fitted parameters for the residues in the coiled coil already incorporate the effects of rotational diffusion anisotropy through the assumption of a local isotropic τm. The previous analysis found the diffusion tensor anisotropy to be modest and thus unlikely to affect the results in this scenario (48).
Figure 2.
Spectral density values at (a) ω = 0 and (b) ω = 0.87ωH/(2π) = 522 (open circles) and 783 (solid circles) MHz frequencies, obtained from results reported by Gill et al. (29). Error bars represent the standard deviation in the experimental data values.
The spectral density data for residues 5–28 in the N-terminal basic region and the first turn of the coiled-coil α-helix and residues 56–58 in the C-terminus of the bZip domain of GCN4 were analyzed using Eq. 14. Representative sample fits of Eq. 14 to the experimental data are illustrated in Fig. 3 for amino acid residues 11 and 22. The resulting probability distributions p(τ) are also shown in Fig. 3. For these mobile residues, the simple model-free formalism of Eq. 15 fits the data poorly, and thus more complex motional models such as Eq. 14 are necessary. The qualities of fits to Eq. 14, as measured by the reduced χ2 or Akaike information criterion statistics (49), have a median improvement of 6.9% when compared to the extended three-exponential model-free formalism, even though Eq. 14 has four adjustable parameters and the extended model-free formalism has five.
Figure 3.
Relaxation data for residues 11 (a–c) and 22 (d–f) of the bZip domain of GCN4. (a, b, d, and e) The circles are the experimental data from Gill and co-workers (29). The green solid line is the fitted result using Eq. 14, the orange dotted line is the result when fitted with the extended model-free (three-exponential) formalism (Eq. 16), and the purple dashed line is the result when fitted with the model-free formalism (Eq. 15). (b) and (e) show the zoomed-in views of the low field regime of the data shown in (a) and (d). (c and f) The probability distributions resulting from the fitted data in (a) and (c) are shown. The vertical green dashed lines indicate the values of μ. For residue 11, fitted values using Eq. 14 are S2 = 0.46 ± 0.04, μ = 0.89 ± 0.01 ns, <τ><1/τ> = 1.9 ± 0.5, and τm = 8.9 ± 0.9 ns. For residue 22, fitted values are S2 = 0.75 ± 0.02, μ = 0.92 ± 0.02 ns, <τ><1/τ> = 2.3 ± 0.6, and τm = 14.5 ± 0.6 ns. Data analyses were performed using an in-house software written in Python; optimizations of model parameters were performed using the “curve_fit” algorithm and implemented using the trust region reflective method (63) for minimization of χ2 and a three-point numerical method for the calculation of the Jacobian matrix. Uncertainties in fitted parameters were obtained from Monte Carlo simulations. To see this figure in color, go online.
Fitted values of S2, τm, μ, and <τ><1/τ> for the bZip DNA-binding domain of GCN4 are shown in Fig. 4. As noted above, data for only residues 5–28 and 56–58 were fitted with the inverse Gaussian spectral density function, Eq. 14. Data for the highly ordered coiled-coil residues 29–55 were fitted with the Lipari-Szabo model-free formalism, Eq. 15.
Figure 4.
Distribution function values of (a) μ, (b) <τ><1/τ>, (c) S2, and (d) local τm for the bZip domain of GCN4 fitted to Eq. 14 or the model-free formalism (Eq. 15), depending on the particular region. Regions of the protein are colored identically to Gill et al. (29): region 1 of bZip (residues 3–12), pink diamonds; region 2 of bZip (residues 13–25), green squares; coiled coil (residues 26–55), black circles; disordered C-terminus (residues 56–58), orange triangles. Eq. 15 was used to fit data for residues 29–55 and is the limit of Eq. 14 as <τ><1/τ> → 1, meaning that μ = τe. In (c), the solid line drawn between residues 5 and 15 is calculated using Eq. 18 with <S2> = 0.93, S02 = 0.665, and N0 = 15, and the solid line between residues 56 and 58 is calculated using Eq. 18 with <S2> = 0.76, S02 = 0.86, and N0 = 56. Uncertainties in fitted parameters were obtained from Monte Carlo simulations. To see this figure in color, go online.
Discussion
NMR spin-relaxation methods are powerful approaches for characterizing intramolecular dynamics because relaxation rate constants are functions of the spectral density function, J(ω), for stochastic processes that modulate nuclear spin Hamiltonians. Model spectral density functions based on finite sums of terms or on continuous distribution functions have been applied to various investigations of biological macromolecules. However, historically, the most commonly used distribution functions diverge as ω approaches zero, a disadvantage because J(0) is critical for interpreting various relaxation measurements performed on biomolecules. This work derives a functional form of the spectral density function based on the inverse Gaussian probability distribution. The resulting spectral density function is convergent as ω approaches zero and can represent the overall rotational dynamics or intramolecular stochastic dynamics of a nuclear spin interaction using the model-free ansatz.
To demonstrate the application of this theoretical approach, backbone 15N relaxation data for the bZip domain of GCN4 were fitted using the latter assumption, Eq. 14. The results were compared to the two-exponential (Eq. 15) and three-exponential (Eq. 16) model-free formalisms. Specifically, the inverse Gaussian model-free formalism proved advantageous in fitting the partially disordered, N-terminal basic-region domain of GCN4. The qualities of fits, as measured by the reduced χ2 or Akaike information criterion statistics (49), were comparable to or better than for the extended model-free formalism. The smaller number of free parameters in the inverse Gaussian spectral density function (four) compared to the three-exponential model-free formalism (five) is advantageous particularly because the number of relaxation rate constants measured per spin is frequently small, limiting statistical power. In addition, the inverse Gaussian spectral density function is based on fundamental aspects of spectral density functions for overdamped processes as distinguished from other distributions, such as the Mittag-Leffler function, derived for specific physical models for molecular motions.
As discussed earlier, the extended three-exponential model-free formalism can be regarded as a δ function representation of a continuous probability distribution. In support of this interpretation, the values of τs and τf obtained using Eq. 16 are often similar in magnitude to the mean and mode, respectively, of the fitted inverse Gaussian probability distribution (data not shown). The optimal choice of a given continuous probability distribution or a multiexponential model in the interpretation of NMR spin-relaxation data cannot be established from fitting the available data alone. The model independence and the small number of adjustable parameters of the inverse Gaussian spectral density function are attractive features, but the approach or combination of approaches that will ultimately prove optimal is currently unknown (19). Additional experimental data over a broader frequency range (from relaxometry) and advances in computer simulations are likely to be helpful in addressing these issues.
The intramolecular dynamics of the bZip domain and other IDRs have been discussed extensively elsewhere (9, 29, 34, 35, 50, 51, 52, 53, 54, 55, 56). The following discussion focuses on the results for the residues analyzed with the inverse Gaussian spectral density function: residues 5–28 and 56–58. The values of the parameters in common with the extended model-free spectral density function, S2 and τm, are similar, with correlation coefficients R = 0.96 and 0.95, respectively. The inverse Gaussian spectral density function additionally characterizes the timescales of intramolecular motions by the parameters μ and <τ><1/τ>. The average values of these parameters are relatively constant for residues 5–23 in the basic region, where μ = 0.916 ± 0.010 ns and <τ><1/τ> = 1.99 ± 0.33. The average most probable value is τmode = 0.25 ± 0.02 ns.
The transformational properties of probability distributions can be used to convert p(τ) (Eq. 8) into a distribution for the parameters of a physical model via τ. For example, if τ = τ0exp[Ea/(kBT)], then
| (17) |
in which kB is the Boltzmann constant and T is the temperature. The resulting distribution for the average value of <τ><1/τ> = 1.99 is shown in Fig. 5. The mean, mode, and SD of the distribution are −0.46 kBT, −0.64 kBT, and +0.94 kBT, respectively. The distribution is slightly skewed to the right when compared to a Gaussian distribution with the same mean and SD.
Figure 5.
Distribution of activation energies (solid) for <τ><1/τ> = 1.99 compared to a (dashed) Gaussian distribution with mean = −0.46 kBT and SD of 0.94 kBT.
As discussed previously, residues ∼13–19 in the basic region substantially populate transient helical conformations, resulting in a break in the general decrease in S2 toward the N-terminus from the well-ordered, coiled-coil, leucine zipper region (9). In Fig. 4, the most N-terminal region (pink) and the C-terminal region (yellow) display relatively smooth profiles for S2. As shown by the solid black lines in Fig. 4, the order parameters for these residues can be fitted with a simple model:
| (18) |
in which N is a given residue number and <S2> is a mean multiplicative decrease in S2 per residue relative to a reference residue N0 with order parameter S02. A similar approach has been used by Schneider, Dellwo, and Wand to characterize the C-terminal region of ubiquitin (57).
The above results provide an enhanced description of the intramolecular dynamics of the bZip domain in the absence of DNA, as cartooned in Fig. 6. The coiled-coil region is characterized as a relatively rigid rod, with an average S2 = 0.92 ± 0.01, indicating restricted local motions. The most N-terminal residues of the coiled coil exhibit reduced order parameters, which appears to suggest fraying, followed by a significant drop in the order parameters occurring between residues 28 and 20. The order parameters for residues 14–19 are relatively uniform, consistent with previously identified, transient, helical conformations. Another sharp drop in S2 at residue 13 leads to the next set of residues in the most N-terminal region, residues 5–13. In this portion of the molecule, the fit to Eq. 18 suggests that individual peptide units have limited local motions on timescales smaller than 5 ns, but the cumulative effect from residue to residue results in a decrease in S2 toward the N-terminus. Local interactions must still restrict these motions because the mean reduction in S2 per residue is smaller than observed for the more disordered C-terminal residues. This model suggests that each peptide unit constitutes a local persistence length, like the Kuhn length in polymers, with restricted multidirectional joints in between each unit (58, 59, 60). In this region, compared to the values observed in the coiled-coil domain, the reduced values of the local, effective, overall correlation time, τm, may reflect segmental motions on timescales similar to overall rotation. Thus, in these data, the effects of segmental motions are not distinguishable from the effects of overall tumbling. The spatial extents and timescales of intramolecular and overall motions in IDRs have recently been discussed by Blackledge and co-workers based on extensive spin-relaxation (61) and molecular dynamics simulations (8, 62). Our results for the IDR of the bZip domain of GCN4 share qualitative features with this earlier work. In addition, they also agree with previous investigations suggesting that partially ordered bZip conformations of GCN4 form initial-encounter complexes with DNA and then rapidly rearrange to the high-affinity state with fully formed, basic-region recognition helices (9, 29). The restriction of conformational space in the basic region for the apo bZip domain then reduces the entropic penalty of binding to target DNA molecules (34).
Figure 6.
A cartoon of the bZip domain of GCN4 in which the ribbons are color coded as in Fig. 4. This cartoon illustrates a model of the bZip domain of GCN4 with the DNA substrate removed after some time (simulated). The coiled-coil region is modeled as a pair of rigid rods, and the bZip region is modeled by multiple rigid rods of some persistence length unit (in this cartoon, the persistence length is one residue), all connected by multidirectional joints with restricted flexibility.
Conclusion
In summary, using the principle of maximal entropy, we have obtained the inverse Gaussian probability density as a distribution function for correlation times that mimic the low and high frequency behavior of the spectral density function, central to the theory of NMR spin relaxation. The resulting spectral density function is well behaved as the frequency approaches zero and can be used in analyzing the relaxation rate constants that depend on J(0), such as the transverse relaxation rate R2, as well as relaxation at very low magnetic fields. The inverse Gaussian probability distribution function and its associated spectral density function are particularly suited for the analysis of highly flexible loops or IDRs of proteins, such as the basic region of the bZip transcription factor domain of GCN4.
Author Contributions
A.H., F.F., and A.G.P. performed theoretical calculations and wrote the manuscript. A.H. and A.G.P performed numerical calculations and data fitting. A.G.P. and F.F. designed the research project.
Acknowledgments
We would like to thank Jeffrey Skolnick for helpful discussions and Mark Rance for his insights in the preparation of this manuscript.
This research was funded by the National Institutes of Health through grants R01 GM050291 (A.G.P.) and T32 GM008281-28 (A.H.) and by the European Research Council, ERC 2F4BIODYN (Grant 279519) (F.F.). Some of the work presented here was conducted at the Center on Macromolecular Dynamics by NMR Spectroscopy located at the New York Structural Biology Center, supported by a grant from the National Institutes of Health National Institute of General Medical Sciences (P41 GM118302). A.G.P. is a member of the New York Structural Biology Center. A.G.P. is grateful for a Visiting Professorship with F.F. at ENS.
Editor: Michael Sattler.
References
- 1.Nussinov R., Jang H., Zhang J. Intrinsic protein disorder in oncogenic KRAS signaling. Cell. Mol. Life Sci. 2017;74:3245–3261. doi: 10.1007/s00018-017-2564-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Csizmok V., Follis A.V., Forman-Kay J.D. Dynamic protein interaction networks and new structural paradigms in signaling. Chem. Rev. 2016;116:6424–6462. doi: 10.1021/acs.chemrev.5b00548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Dethoff E.A., Chugh J., Al-Hashimi H.M. Functional complexity and regulation through RNA dynamics. Nature. 2012;482:322–330. doi: 10.1038/nature10885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Palmer A.G., III NMR characterization of the dynamics of biomacromolecules. Chem. Rev. 2004;104:3623–3640. doi: 10.1021/cr030413t. [DOI] [PubMed] [Google Scholar]
- 5.Kay L.E. New views of functionally dynamic proteins by solution NMR spectroscopy. J. Mol. Biol. 2016;428:323–331. doi: 10.1016/j.jmb.2015.11.028. [DOI] [PubMed] [Google Scholar]
- 6.Narayanan C., Bafna K., Doucet N. Applications of NMR and computational methodologies to study protein dynamics. Arch. Biochem. Biophys. 2017;628:71–80. doi: 10.1016/j.abb.2017.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Charlier C., Cousin S.F., Ferrage F. Protein dynamics from nuclear magnetic relaxation. Chem. Soc. Rev. 2016;45:2410–2422. doi: 10.1039/c5cs00832h. [DOI] [PubMed] [Google Scholar]
- 8.Salvi N., Abyzov A., Blackledge M. Multi-timescale dynamics in intrinsically disordered proteins from NMR relaxation and molecular simulation. J. Phys. Chem. Lett. 2016;7:2483–2489. doi: 10.1021/acs.jpclett.6b00885. [DOI] [PubMed] [Google Scholar]
- 9.Robustelli P., Trbovic N., Palmer A.G., III Conformational dynamics of the partially disordered yeast transcription factor GCN4. J. Chem. Theory Comput. 2013;9:5190–5200. doi: 10.1021/ct400654r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dais P., Spyros A. C-13 nuclear magnetic relaxation and local dynamics of synthetic polymers in dilute solution and in the bulk state. Prog. Nucl. Magn. Reson. Spectrosc. 1995;27:555–633. [Google Scholar]
- 11.Konrat R. NMR contributions to structural dynamics studies of intrinsically disordered proteins. J. Magn. Reson. 2014;241:74–85. doi: 10.1016/j.jmr.2013.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Beckmann P.A. Spectral densities and nuclear-spin relaxation in solids. Phys. Rep. 1988;171:85–128. [Google Scholar]
- 13.Buevich A.V., Shinde U.P., Baum J. Backbone dynamics of the natively unfolded pro-peptide of subtilisin by heteronuclear NMR relaxation studies. J. Biomol. NMR. 2001;20:233–249. doi: 10.1023/a:1011243116136. [DOI] [PubMed] [Google Scholar]
- 14.Abergel D., Volpato A., Polimeno A. On the reliability of NMR relaxation data analyses: a Markov chain Monte Carlo approach. J. Magn. Reson. 2014;246:94–103. doi: 10.1016/j.jmr.2014.07.007. [DOI] [PubMed] [Google Scholar]
- 15.Calandrini V., Abergel D., Kneller G.R. Fractional protein dynamics seen by nuclear magnetic resonance spectroscopy: relating molecular dynamics simulation and experiment. J. Chem. Phys. 2010;133:145101. doi: 10.1063/1.3486195. [DOI] [PubMed] [Google Scholar]
- 16.Calligari P., Abergel D. Multiple scale dynamics in proteins probed at multiple time scales through fluctuations of NMR chemical shifts. J. Phys. Chem. B. 2014;118:3823–3831. doi: 10.1021/jp412125d. [DOI] [PubMed] [Google Scholar]
- 17.Calandrini V., Abergel D., Kneller G.R. Protein dynamics from a NMR perspective: networks of coupled rotators and fractional Brownian dynamics. J. Chem. Phys. 2008;128:145102. doi: 10.1063/1.2894844. [DOI] [PubMed] [Google Scholar]
- 18.Charlier C., Khan S.N., Ferrage F. Nanosecond time scale motions in proteins revealed by high-resolution NMR relaxometry. J. Am. Chem. Soc. 2013;135:18665–18672. doi: 10.1021/ja409820g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Modig K., Poulsen F.M. Model-independent interpretation of NMR relaxation data for unfolded proteins: the acid-denatured state of ACBP. J. Biomol. NMR. 2008;42:163–177. doi: 10.1007/s10858-008-9280-0. [DOI] [PubMed] [Google Scholar]
- 20.Khan S.N., Charlier C., Ferrage F. Distribution of pico- and nanosecond motions in disordered proteins from nuclear spin relaxation. Biophys. J. 2015;109:988–999. doi: 10.1016/j.bpj.2015.06.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Calligari P., Calandrini V., Abergel D. From NMR relaxation to fractional Brownian dynamics in proteins: results from a virtual experiment. J. Phys. Chem. B. 2011;115:12370–12379. doi: 10.1021/jp205380f. [DOI] [PubMed] [Google Scholar]
- 22.Case D.A., Scheurer C., Bruschweiler R. Static and dynamic effects on vicinal scalar J couplings in proteins and peptides: a MD/DFT analysis. J. Am. Chem. Soc. 2000;122:10390–10397. [Google Scholar]
- 23.Bernetti M., Masetti M., Cavalli A. Structural and kinetic characterization of the intrinsically disordered protein SeV NTAIL through enhanced sampling simulations. J. Phys. Chem. B. 2017;121:9572–9582. doi: 10.1021/acs.jpcb.7b08925. [DOI] [PubMed] [Google Scholar]
- 24.Mollica L., Bessa L.M., Schneider R. Binding mechanisms of intrinsically disordered proteins: theory, simulation, and experiment. Front. Mol. Biosci. 2016;3:52. doi: 10.3389/fmolb.2016.00052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cover T.M., Thomas J.A. Wiley; New York: 1991. Elements of Information Theory. [Google Scholar]
- 26.Schrodinger E. The theory of drop and rise tests on Brownian motion particles - by Erwin Schrodinger - I Introduction. Phys. Z. 1915;16:289–295. [Google Scholar]
- 27.Folks J.L., Chhikara R.S. Inverse Gaussian distribution and its statistical application - review. J. Roy. Stat. Soc. B Met. 1978;40:263–289. [Google Scholar]
- 28.Miller M. The importance of being flexible: the case of basic region leucine zipper transcriptional regulators. Curr. Protein Pept. Sci. 2009;10:244–269. doi: 10.2174/138920309788452164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Gill M.L., Byrd R.A., Palmer A.G., III Dynamics of GCN4 facilitate DNA interaction: a model-free analysis of an intrinsically disordered region. Phys. Chem. Chem. Phys. 2016;18:5839–5849. doi: 10.1039/c5cp06197k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dunker A.K., Brown C.J., Obradović Z. Intrinsic disorder and protein function. Biochemistry. 2002;41:6573–6582. doi: 10.1021/bi012159+. [DOI] [PubMed] [Google Scholar]
- 31.Wright P.E., Dyson H.J. Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J. Mol. Biol. 1999;293:321–331. doi: 10.1006/jmbi.1999.3110. [DOI] [PubMed] [Google Scholar]
- 32.Ellenberger T.E., Brandl C.J., Harrison S.C. The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted alpha helices: crystal structure of the protein-DNA complex. Cell. 1992;71:1223–1237. doi: 10.1016/s0092-8674(05)80070-4. [DOI] [PubMed] [Google Scholar]
- 33.Saudek V., Pastore A., Gibson T. The solution structure of a leucine-zipper motif peptide. Protein Eng. 1991;4:519–529. doi: 10.1093/protein/4.5.519. [DOI] [PubMed] [Google Scholar]
- 34.Bracken C., Carr P.A., Palmer A.G., III Temperature dependence of intramolecular dynamics of the basic leucine zipper of GCN4: implications for the entropy of association with DNA. J. Mol. Biol. 1999;285:2133–2146. doi: 10.1006/jmbi.1998.2429. [DOI] [PubMed] [Google Scholar]
- 35.Weiss M.A., Ellenberger T., Struhl K. Folding transition in the DNA-binding domain of GCN4 on specific binding to DNA. Nature. 1990;347:575–578. doi: 10.1038/347575a0. [DOI] [PubMed] [Google Scholar]
- 36.Saudek V., Pastore A., Roesch P. Solution structure of the DNA-binding domain of the yeast transcriptional activator protein GCN4. Protein Eng. 1990;4:3–10. doi: 10.1093/protein/4.1.3. [DOI] [PubMed] [Google Scholar]
- 37.Saudek V., Pasley H.S., Pastore A. Solution structure of the basic region from the transcriptional activator GCN4. Biochemistry. 1991;30:1310–1317. doi: 10.1021/bi00219a022. [DOI] [PubMed] [Google Scholar]
- 38.Lipari G., Szabo A. Model-free approach to the interpretation of nuclear magnetic-resonance relaxation in macromolecules.1. Theory and range of validity. J. Am. Chem. Soc. 1982;104:4546–4559. [Google Scholar]
- 39.Lipari G., Szabo A. Model-free approach to the interpretation of nuclear magnetic-resonance relaxation in macromolecules.2. Analysis of experimental results. J. Am. Chem. Soc. 1982;104:4559–4570. [Google Scholar]
- 40.Grishaev A., Yao L., Bax A. Chemical shift anisotropy of imino 15N nuclei in Watson-Crick base pairs from magic angle spinning liquid crystal NMR and nuclear spin relaxation. J. Am. Chem. Soc. 2009;131:9490–9491. doi: 10.1021/ja903244s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Esadze A., Li D.W., Iwahara J. Dynamics of lysine side-chain amino groups in a protein studied by heteronuclear 1H−15N NMR spectroscopy. J. Am. Chem. Soc. 2011;133:909–919. doi: 10.1021/ja107847d. [DOI] [PubMed] [Google Scholar]
- 42.Ishima R., Wingfield P.T., Torchia D.A. Using amide H-1 and N-15 transverse relaxation to detect millisecond time-scale motions in perdeuterated proteins: application to HIV-1 protease. J. Am. Chem. Soc. 1998;120:10534–10542. [Google Scholar]
- 43.Butcher J.C., Chipman F.H. Generalized pade approximations to the exponential function. BIT. 1992;32:118–130. [Google Scholar]
- 44.Iserles A. Generalized pade approximations to the exponential function. SIAM J. Numer. Anal. 1979;16:631–636. [Google Scholar]
- 45.Spanier J., Oldham K.B. Hemisphere Publishing Corp.; Washington, D.C: 1987. An Atlas of Functions. [Google Scholar]
- 46.Samko S.G., Kilbas A.A., Marichev O.I. Gordon and Breach Science Publishers; Philadelphia, PA: 1993. Fractional Integrals and Derivatives: Theory and Applications. [Google Scholar]
- 47.Poppe G.P.M., Wijers C.M.J. More efficient computation of the complex error function. Acm. Trans. Math. Software. 1990;16:38–46. [Google Scholar]
- 48.Gill M.L., Palmer A.G., III Local isotropic diffusion approximation for coupled internal and overall molecular motions in NMR spin relaxation. J. Phys. Chem. B. 2014;118:11120–11128. doi: 10.1021/jp506580c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Akaike H. A new look at the statistical-model identification. IEEE Trans. Automat. Contr. 1974;19:716–723. [Google Scholar]
- 50.Hollenbeck J.J., McClain D.L., Oakley M.G. The role of helix stabilizing residues in GCN4 basic region folding and DNA binding. Protein Sci. 2002;11:2740–2747. doi: 10.1110/ps.0211102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Xue Y., Skrynnikov N.R. Motion of a disordered polypeptide chain as studied by paramagnetic relaxation enhancements, 15N relaxation, and molecular dynamics simulations: how fast is segmental diffusion in denatured ubiquitin? J. Am. Chem. Soc. 2011;133:14614–14628. doi: 10.1021/ja201605c. [DOI] [PubMed] [Google Scholar]
- 52.Alberty R.A., Hammes G.G. Application of the theory of diffusion-controlled reactions to enzyme kinetics. J. Phys. Chem. 1958;62:154–159. [Google Scholar]
- 53.Kohler J.J., Metallo S.J., Schepartz A. DNA specificity enhanced by sequential binding of protein monomers. Proc. Natl. Acad. Sci. USA. 1999;96:11735–11739. doi: 10.1073/pnas.96.21.11735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Halford S.E., Marko J.F. How do site-specific DNA-binding proteins find their targets? Nucleic Acids Res. 2004;32:3040–3052. doi: 10.1093/nar/gkh624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Okahata Y., Niikura K., Morii T. Kinetic studies of sequence-specific binding of GCN4-bZIP peptides to DNA strands immobilized on a 27-MHz quartz-crystal microbalance. Biochemistry. 1998;37:5666–5672. doi: 10.1021/bi980037k. [DOI] [PubMed] [Google Scholar]
- 56.Kohler J.J., Schepartz A. Kinetic studies of Fos.Jun.DNA complex formation: DNA binding prior to dimerization. Biochemistry. 2001;40:130–142. doi: 10.1021/bi001881p. [DOI] [PubMed] [Google Scholar]
- 57.Schneider D.M., Dellwo M.J., Wand A.J. Fast internal main-chain dynamics of human ubiquitin. Biochemistry. 1992;31:3645–3652. doi: 10.1021/bi00129a013. [DOI] [PubMed] [Google Scholar]
- 58.Flory P.J. Cornell University Press; Ithaca, NY: 1953. Principles of Polymer Chemistry. [Google Scholar]
- 59.Flory P.J. Interscience Publishers; New York: 1969. Statistical Mechanics of Chain Molecules. [Google Scholar]
- 60.Rubinstein M., Colby R.H. Oxford University Press; New York: 2003. Polymer Physics. [Google Scholar]
- 61.Abyzov A., Salvi N., Blackledge M. Identification of dynamic modes in an intrinsically disordered protein using temperature-dependent NMR relaxation. J. Am. Chem. Soc. 2016;138:6240–6251. doi: 10.1021/jacs.6b02424. [DOI] [PubMed] [Google Scholar]
- 62.Salvi N., Abyzov A., Blackledge M. Analytical description of NMR relaxation highlights correlated dynamics in intrinsically disordered proteins. Angew. Chem. Int. Ed. Engl. 2017;56:14020–14024. doi: 10.1002/anie.201706740. [DOI] [PubMed] [Google Scholar]
- 63.Coleman T.F., Li Y.Y. An interior trust region approach for nonlinear minimization subject to bounds. Siam. J. Optimiz. 1996;6:418–445. [Google Scholar]






