Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Jan 12.
Published in final edited form as: J Phys Chem B. 2008 Mar 1;112(19):6155–6158. doi: 10.1021/jp077018h

Microsecond molecular dynamics simulation shows effect of slow loop dynamics on backbone amide order parameters of proteins

Paul Maragakis , Kresten Lindorff-Larsen †,, Michael P Eastwood , Ron O Dror , John L Klepeis , Isaiah T Arkin †,§, Morten Ø Jensen , Huafeng Xu , Nikola Trbovic **, Richard A Friesner ††, Arthur G Palmer III **, David E Shaw †,‡‡,*
PMCID: PMC2805408  NIHMSID: NIHMS145982  PMID: 18311962

Abstract

A molecular-level understanding of the function of a protein requires knowledge of both its structural and dynamical properties. NMR spectroscopy allows the measurement of generalized order parameters that provide an atomistic description of picosecond and nanosecond fluctuations in protein structure. Molecular dynamics (MD) simulation provides a complementary approach to the study of protein dynamics on similar timescales. Comparisons between NMR spectroscopy and MD simulations can be used to interpret experimental results and to improve the quality of simulation-related force fields and integration methods. However, apparent systematic discrepancies between order parameters extracted from simulations and experiments are common, particularly for elements of non-canonical secondary structure. In this paper, results from a 1.2 µs explicit solvent MD simulation of the protein ubiquitin are compared with previously determined backbone order parameters derived from NMR relaxation experiments [Tjandra, N.; Feller, S. E.; Pastor, R. W.; Bax, A. J. Am. Chem. Soc. 1995, 117, 12562–12566]. The simulation reveals fluctuations in three loop regions that occur on timescales comparable to or longer than that of the overall rotational diffusion of ubiquitin and whose effects would not be apparent in experimentally derived order parameters. A coupled analysis of internal and overall motion yields simulated order parameters substantially closer to the experimentally determined values than is the case for a conventional analysis of internal motion alone. Improved agreement between simulation and experiment also is encouraging from the viewpoint of assessing the accuracy of long MD simulations.

Introduction

Biological function often emerges from intermolecular interactions that are modulated by dynamic changes in protein conformation.1 NMR spectroscopy can quantify protein motions on a wide range of timescales with atomic resolution.2,3 Nuclear spin relaxation rate constants for backbone amide moieties in proteins in solution depend on the autocorrelation functions C(t) = 〈P2[μ(0)·μ(t)]〉 of the N-H bond unit vector orientations μ(t), where P2[x] = (3x2 −1)/2 and the angle brackets denote ensemble averaging.4 The Lipari-Szabo (LS) model5 interprets experimental relaxation measurements, and hence parameterizes C(t), in terms of the overall rotational diffusion time of the protein (τM), as well as the generalized order parameter (S) and the correlation time (τe) for intramolecular motions of each N-H bond vector. The degree of spatial restriction of an N-H bond vector is quantified by 0 ≤S2 ≤ 1, with high S2 values corresponding to more rigid sites. Backbone amide order parameters have been measured for more than 200 proteins and used to quantify changes in conformational flexibility associated with protein folding, molecular recognition, and catalysis.4

Values of S 2 obtained from protein molecular dynamics (MD) simulations have been compared to those obtained from experiments to validate6,7 and improve8,9 MD simulations, and to aid in the interpretation of experiments.10,11 Ubiquitin is a 76-residue protein that has been used as a model system for experimental and computational studies of protein structure and dynamics.9,11,12,13,14,15,16 Nederveen and Bonvin14 analyzed a 0.2 µs MD simulation of ubiquitin and calculated S2 values as ensemble averages over internal bond vector fluctuations (see method (2) below). In that work, calculated S2 values varied extensively in the loop regions of ubiquitin depending on the timescale over which dynamics were averaged; values of S2 obtained as averages over the full trajectory were much lower than found by experiments. The correlation functions C(t) also were calculated from the simulation data to assess the effect of slow internal motions on NMR spin relaxation rate constants. These analyses were hindered, however, by the limited length of the simulation, and the calculated τM value, 0.74 ns, was surprisingly short.

Here we compare backbone S2 values calculated from a 1.2 µs MD simulation with values determined from NMR relaxation experiments12 for ubiquitin. Simultaneous consideration of all (rather than solely internal) motions in the MD simulation is found to reduce the discrepancies between calculated and experimental S2 values. In particular, the simulation reveals several loop residues that are conformationally mobile at timescales comparable to or longer than overall rotational diffusion (“tumbling”). The calculated S2 values for these residues are in good agreement with the experimentally determined values only if overall motion is included in the analysis of the MD simulation (see method (3) below).

Methods

Our MD simulation of ubiquitin was based on PDB 1D3Z17, with pressure and temperature consistent with the experimental conditions used by Tjandra et al.12 We used the OPLS-AA/SPC force field,18 with the Desmond program for MD simulations.19 Conformations were saved every 2 ps during the 1.2 µs simulation to obtain an ensemble of 6×105 conformations, which represent both the overall and internal motions of ubiquitin.

We calculated S2 from the MD trajectory in three ways. In methods (1) and (2), overall rotational motion of the protein is first removed by superposing all structures to a molecular reference frame. In method (1), S2 is calculated as CI (100 ns), which serves as an approximation to the long-time limit of the internal autocorrelation function CI(t), calculated from μ(t) within the molecular reference frame of the superposed structures. In method (2), S2 is calculated as P2[(Σiji(tj(t) >2)1/2] where μi(t) is the i-th Cartesian component of μ(t) within the molecular reference frame.20 In method (3), C(t) was calculated directly from the MD trajectory21 and S2 was obtained by a weighted least-squares fit (as described further in the Supporting Information) of the “extended” LS model for the correlation function introduced by Clore, et al.,22 which is defined as

C(t)=exp(t/τM)(S2+(Sf2S2)exp(t/τe))

The parameter Sf2 in this model accounts for a fast initial decay of the correlation function on timescales shorter than the first sampled point in the correlation function at 2 ps. The parameter τM was globally optimized to a value of 1.98 ns. (This simulated value of τM is about a factor of 2 smaller than the experimental value of τM,12 consistent with the observation that the self-diffusion constant of the SPC model for water is 1.8 times larger than the value determined experimentally.23) In all three S2 calculation methods, ensemble averages were approximated by time averages over the trajectory. Sample deviations in the correlation functions were estimated using a blocking method.24 Additional details on the MD simulation and the fitting analysis are provided in the Supporting Information.

Results and Discussion

We carried out a 1.2 µs MD simulation of ubiquitin, and calculated backbone S2 values from this simulation using three different methods. All analysis methods yield S2 values for the β-strands and most of the N-terminal helix that are consistent with experiment (Figure 1). In the case of residues 8–11, 30–36, and 46–48, however, the S2 values obtained using method (3) agree better with experimental values than those determined using methods (1) and (2). For these residues, which span three loops and part of the N-terminal helix, the average differences between experimental and simulated values of S2 are 0.16, 0.23, and 0.04, and the root-mean-squared differences are 0.18, 0.26, and 0.07 for methods (1)–(3), respectively. In addition to fast ps–ns motion, these loops sample alternative conformations over periods of tens to hundreds of ns in the MD simulation.

Figure 1.

Figure 1

Experimental12 and simulated backbone amide order parameters for ubiquitin. Calculated values were obtained from MD simulation using methods (1)–(3) as described in the text. The left panel shows S2 as a function of amino acid residue number. Secondary structural elements are shown above the graph (H = helix; S = β strand). The right panel shows the correlation between simulated and experimental S2 values for loop residues 8–11, 30–36, and 46–48. The solid line has an intercept of zero and a slope of one.

In simulation, as well as in experiment, slow internal motion will reduce CI(t) for lag times t > τM. Such motion has little effect, however, on C(t) or the model parameters obtained by fitting to it, because bond-vector directions are already significantly decorrelated by tumbling for t > τM, explaining the improved agreement with the experimental S2 values when using method (3). Like the experimental analysis, method (3) incorporates tumbling and is therefore only weakly sensitive to slow internal motion. In contrast, methods (1) and (2) give long-time fluctuations the same weight as motions faster than tumbling, leading to lower predicted S2 values.

Several attempts to overcome this limitation of methods (1) and (2) have been described8,14,25. For example, Buck et al. used a variant of method (1) in which they calculated S2 as the value of the internal correlation function at 6 ns, a time comparable to the rotational correlation time of the protein lysozyme.8 Markwick et al. carried out several short MD simulations, each of a length comparable to the timescale of rotational motion, for the B3 domain of protein G.25 In that work, S2 values were calculated by applying method (2) to each individual simulation, and then averaging the resulting values over all simulations. These modifications of methods (1) and (2) yielded S2 values in good agreement with experimental values, although whether these modifications could be used more generally to remove the effects of long-timescale dynamics on S2 values calculated from MD simulations is unclear.

Our interpretation of the results of method (3) is exemplified by the CI(t) and C(t) values calculated from the MD simulation for Gly47. As shown in Figure 2, the divergence between CI(t) and C(t) beyond 50 ps reflects the decorrelation of the bond-vector direction due to molecular tumbling; the subsequent decay of CI(t) on the ns timescale results from long-time dynamical processes that only weakly affect the calculation of S2 from C(t) with method (3). For this residue, the experimental value of S2 is 0.82, whereas simulated values for methods (1)–(3) are 0.45, 0.48, and 0.66, respectively. The deviation of 0.16 for method (3) is smaller by more than a factor of two compared to the other methods.

Figure 2.

Figure 2

Internal and full autocorrelation functions (CI(t) and C(t), respectively) calculated from the MD simulation for the amide bond vector of Gly47. Error bars represent one standard deviation.

Any LS analysis of the full autocorrelation function, C(t), is based on the assumption (the “separability assumption”) that this function can be written as the product of two other autocorrelation functions—one depending only on internal motion, the other only on overall rotational motion. Such a decomposition of C(t) is always possible if internal motion occurs only on a timescale much faster than overall rotational diffusion.26 Since several residues in our simulation display motion on longer timescales, the validity of the separability assumption might seem to be called into question. A recent analysis by Wong and Case27 of a 100 ns MD simulation of ubiquitin and of a 200 ns MD simulation of the B3 domain of protein G, however, suggests that such a decomposition may be justifiable in practice even for residues that display internal motion on the same timescale as tumbling. We repeated this analysis for the present 1.2 µs trajectory, and similarly found no evidence of any substantial inconsistency with the separability assumption.

Conclusions

Protein dynamics slower than molecular tumbling can now be probed computationally using long MD simulations. The present work demonstrates that high S2 values, derived from NMR relaxation experiments, are compatible with mobility on the 10–100 ns timescale, observed in long MD simulations, in the loop regions of ubiquitin. Such motions cannot currently be detected with NMR relaxation techniques.3 Residual dipolar couplings are sensitive to motions on these timescales;16,25,28 however, at present, S2 values obtained from such data depend substantially on the method of analysis (see Figure S2 in the Supporting Information) and a consensus approach awaits ongoing developments.

Our observations also highlight some of the complications that can arise in the interpretation of S2 values determined by NMR relaxation experiments. In many applications, including the extraction of configurational entropies and other thermodynamic parameters from relaxation measurements,29 the quantities of interest are the equilibrium bond vector fluctuations. Our results illustrate that the amplitudes of such fluctuations—quantified, for example, by the S2 values calculated using methods (1) and (2)—are not always in agreement with the amplitudes extracted from NMR relaxation experiments, which instead give S2 values that correspond more closely to those calculated using method (3). Because of rotational tumbling, the S2 values obtained from standard relaxation experiments are not affected by long-timescale internal motion, increasing the difficulty of extracting thermodynamic information directly from the experiments.29 In contrast, thermodynamic information obtained from MD simulations is limited only by the timescale of sampling and the accuracy of the force fields employed. To the extent that the results of MD simulations and NMR relaxation experiments can be reconciled within an analytical framework like that described in this paper, we expect that MD simulations will prove to be a valuable tool in the extraction of thermodynamic quantities from NMR experiments.

Finally, our results reiterate15,30 the need for caution in assessing simulation quality from comparisons with experimental values of S2 . Furthermore, the improved agreement between experiment and simulation obtained in the present work by joint fitting of internal and overall motions to the simulation trajectory justifies detailed interpretation of long MD simulations of protein function.

Supplementary Material

si

ACKNOWLEDGMENTS

Thanks are due to Jeno L. Sokoloski and Richard W. Pastor for valuable discussions, to Charles D. Schwieters and Joel R. Tolman for providing order parameters from residual dipolar coupling experiments, and to David A. Case for discussions and for sharing results prior to publication. This work was supported in part by the NIH grants GM52018 to R.A.F. and GM50291 to A.G.P.

Footnotes

Supporting Information Available

Simulation and analysis details; Figure S1 and S2; Table S1. This information is available free of charge via the Internet at http://pubs.acs.org.

References

  • 1.Karplus M, Kuriyan J. Proc. Natl. Acad. Sci. USA. 2005;102:6679–6685. doi: 10.1073/pnas.0408930102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Palmer AG. Chem. Rev. 2004;104:3623–3640. doi: 10.1021/cr030413t. [DOI] [PubMed] [Google Scholar]
  • 3.Mittermaier A, Kay LE. Science. 2006;213:224–228. doi: 10.1126/science.1124964. [DOI] [PubMed] [Google Scholar]
  • 4.Jarymowycz VA, Stone MJ. Chem. Rev. 2006;106:1624–1671. doi: 10.1021/cr040421p. [DOI] [PubMed] [Google Scholar]
  • 5.Lipari G, Szabo A. J. Am. Chem. Soc. 1982;104:4546–4559. [Google Scholar]
  • 6.Lipari G, Szabo A, Levy RM. Nature. 1982;300:197–198. [Google Scholar]
  • 7.Trbovic N, Kim B, Friesner RA, Palmer AG. Proteins. 2007 doi: 10.1002/prot.21750. DOI 10.1002/prot.21750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Buck M, Bouguet-Bonnet S, Pastor RW, MacKerell AD. Biophys. J. 2006;90:L36–L38. doi: 10.1529/biophysj.105.078154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C. Proteins. 2006;65:712–725. doi: 10.1002/prot.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Best RB, Vendruscolo M. J. Am. Chem. Soc. 2004;126:8090–8091. doi: 10.1021/ja0396955. [DOI] [PubMed] [Google Scholar]
  • 11.Lindorff-Larsen K, Best RB, DePristo MA, Dobson CM, Vendruscolo M. Nature. 2005;433:128–132. doi: 10.1038/nature03199. [DOI] [PubMed] [Google Scholar]
  • 12.Tjandra N, Feller SE, Pastor RW, Bax A. J. Am. Chem. Soc. 1995;117:12562–12566. [Google Scholar]
  • 13.Lienin SF, Bremi T, Brutscher B, Brüschweiler R, Ernst RR. J. Am. Chem. Soc. 1998;120:9870–9879. [Google Scholar]
  • 14.Nederveen AJ, Bonvin AMJJ. J. Chem. Theory Comp. 2005;1:363–374. doi: 10.1021/ct0498829. [DOI] [PubMed] [Google Scholar]
  • 15.Showalter SA, Brüschweiler R. J. Chem. Theory Comp. 2007;3:961–975. doi: 10.1021/ct7000045. [DOI] [PubMed] [Google Scholar]
  • 16.Showalter SA, Brüschweiler R. J. Am. Chem. Soc. 2007;129:4158–4159. doi: 10.1021/ja070658d. [DOI] [PubMed] [Google Scholar]
  • 17.Cornilescu G, Marquardt JL, Ottiger M, Bax A. J. Am. Chem. Soc. 1998;120:6836–6837. [Google Scholar]
  • 18.(a) Kaminski GA, Friesner RA, Tirado-Rives J, Jorgensen WL. J. Phys. Chem. B. 2001;105:6474–6487. [Google Scholar]; (b) Berendsen HJC, Postma JPM, van Gunsteren WF, Hermans J. In: Intermolecular Forces. Pullman B, editor. Dordrecht: D. Reidel Publishing Company; 1981. pp. 331–342. [Google Scholar]
  • 19.Bowers KJ, Chow E, Xu H, Dror RO, Eastwood MP, Gregersen BA, Klepeis JL, Kolossvary I, Moraes MA, Sacerdoti FD, Salmon JK, Shan Y, Shaw DE. Scalable Algorithms for Molecular Dynamics Simulations on Commodity Clusters. Proceedings of the 2006 ACM/IEEE Conference on Supercomputing (SC06); Tampa, Florida. 2006. [Google Scholar]
  • 20.Henry ER, Szabo A. J. Chem. Phys. 1985;82:4753–4761. [Google Scholar]
  • 21.Peter C, Daura X, van Gunsteren WF. J Biomol. NMR. 2001;20:297–310. doi: 10.1023/a:1011241030461. [DOI] [PubMed] [Google Scholar]
  • 22.Clore GM, Szabo A, Bax A, Kay LE, Driscoll PC, Gronenborn AM. J. Am. Chem. Soc. 1990;112:4989–4991. [Google Scholar]
  • 23.Smith PE, van Gunsteren WF. Chem. Phys. Lett. 1993;215:315–318. [Google Scholar]
  • 24.Flyvbjerg H, Petersen HG. J. Chem. Phys. 1989;91:461–466. [Google Scholar]
  • 25.Markwick PRL, Bouvignies G, Blackledge M. J. Am. Chem. Soc. 2007;129:4724–4730. doi: 10.1021/ja0687668. [DOI] [PubMed] [Google Scholar]
  • 26.Halle B, Wennerström H. J. Chem. Phys. 1981;75:1928–1943. [Google Scholar]
  • 27.Wong V, Case DA. J. Phys. Chem. 2007 (in press) [Google Scholar]
  • 28.Lakomek NA, Fares C, Becker S, Carlomagno T, Meiler J, Griesinger C. Angew. Chem. Int. Ed. 2005;44:7776–7778. doi: 10.1002/anie.200502573. [DOI] [PubMed] [Google Scholar]
  • 29.Akke M, Brüschweiler R, Palmer AG. J. Am. Chem. Soc. 1993;115:9832–9833. [Google Scholar]
  • 30.Dobson CM, Karplus M. Methods Enzymol. 1986;131:362–389. doi: 10.1016/0076-6879(86)31049-8. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

si

RESOURCES