Abstract
Conformational entropy is a potentially important thermodynamic parameter contributing to protein function. Quantitative measures of conformational entropy are necessary for an understanding of its role but have been difficult to obtain. An empirical method that utilizes changes in conformational dynamics as a proxy for changes in conformational entropy has recently been introduced. Here we probe the microscopic origins of the link between conformational dynamics and conformational entropy using molecular dynamics simulations. Simulation of seven pro! teins gave an excellent correlation with measures of side-chain motion derived from NMR relaxation. The simulations show that the motion of methyl-bearing side-chains are sufficiently coupled to that of other side chains to serve as excellent reporters of the overall side-chain conformational entropy. These results tend to validate the use of experimentally accessible measures of methyl motion - the NMR-derived generalized order parameters - as a proxy from which to derive changes in protein conformational entropy.
Introduction
The thermodynamic nature of the folded state of proteins governs their biological function in innumerable ways. Indeed, the many forces that govern the stability of the native state are complex and continue to be difficult to unravel and dissect through experiment, simulation or theory.1,2 This is particularly true for the residual conformational entropy of proteins, which is potentially a central component of the thermodynamics governing protein function.3-5 Historically, it has been impossible to experimentally determine the contribution of residual protein entropy to fundamental protein activities such as the binding of ligands, vital for protein function. Recent progress has illuminated the possibility of employing NMR relaxation methods to quantitatively determine the role of changes in conformational entropy in molecular recognition by proteins.6 The approach rests on using fast internal protein dynamics as a proxy for conformational entropy.7 The initial approach was to effectively take an inventory of the change in motion at a limited number of sites and interpret this within the context of a simple physical model such as the harmonic oscillator8 or diffusion within an infinite square well.9 This raises several obvious issues such as the effects of correlated motion, the operation of a more complex potential energy function, the completeness of the oscillator count, and so on.7 More recently this issue was sidestepped by using an empirical calibration of the dynamical proxy for conformational entropy. Rather than attempt a model-dependent interpretation of an inventory of changes in local dynamics, an empirical scaling between the experimental measures of local dynamics and local disorder (entropy) was attempted.10 The idea rests on the assumption that the experimentally accessible methyl-bearing side chains are numerous enough to provide good coverage of internal motion and are sufficiently coupled to neighboring side chains to report on the entropy.6 With an appropriate experimental system, this leads to a rather simple relationship between what can be measured (protein motion and total binding entropy) or confidently calculated (solvent entropy) and what is desired (conformational entropy).10 Thus by comprehensively measuring the change in motion of methyl-bearing side chains on the nanosecond time scale using classical NMR relaxation methods11-15 one could obtain a quantitative measure of the underlying change in conformational dynamics.10 This approach was first introduced using the binding of calcium-saturated calmodulin to a series of calmodulin-binding domains.10 The results demonstrated that changes in conformational entropy were important to ligand binding and also confirmed an earlier suggestion16 that conformational entropy contributes to the tuning of the free energy of binding in this system.10 Recently, a second example has appeared where Tzeng & Kalodimos used an impressive collection of mutants of the catabolite activator protein (CAP) to undertake a similar analysis of the thermodynamic origins of high affinity binding of DNA to the cAMP-activated protein.17 The advantage of the CAP system is that the same binding ligand (DNA) was used, the structure does not change appreciably upon binding the DNA and an array of structurally benign mutations remote from the binding interface were available. This eliminated most of the uncertainties that may have hampered the analysis of the calmodulin complexes described above. The resulting empirical calibration for CAP was most impressive and led to a convincing analysis of the role of conformational entropy in the binding of ligands to the CAP protein and its potential role in its allosteric regulation.17
Here we attempt to understand the microscopic origins of the apparent success of the empirical “entropy meter” based on a dynamical proxy measured through NMR relaxation methods. We employ extensive molecular dynamics simulations in explicit water to examine the ability of measures of fast motion of methyl groups in proteins to adequately represent the conformational entropy of the protein. In addition, we explore several issues as such as the degree to which correlated or conditional motion affects the analysis. In order to rely on such an analysis, it is required that the molecular dynamics simulations faithfully represent those motions contributing to experimentally observables such as NMR relaxation. Although there have recently been significant advances in computational approaches, particularly with respect to long-time scale dynamics,18,19 there has been very little bench marking of molecular dynamics simulations with experimental measures of methyl-bearing side chain motion derived from NMR relaxation.20 With one exception,21 the correspondence between Lipari-Szabo (L-S) methyl group squared generalized order parameters derived from molecular dynamics and experiment in the few cases examined has been mixed and insufficient to promote detailed analysis in this context. In a preamble to the main thrust of this paper, we compare L-S squared generalized order parameters of methyl group symmetry axes derived from deuterium or carbon relaxation methods. Using NAMD22,23 simulations of seven proteins with the CHARMM27 force field,24,25 we find excellent agreement with experiment, which provides a solid foundation for addressing the microscopic origins of the “entropy meter” established using NMR relaxation methods. Analysis of the simulations then reveals that the experimentally accessible motions of methyl-bearing side chains are sufficiently coupled to the motion of other side chains to serve as excellent reporters of the protein conformational entropy: The total side chain conformational entropy can be accurately recapitulated using only measures of side-chain methyl motions. The conformational entropy varies mostly through changes in populations of rotameric states rather than by variation in the effective potential defining each rotameric well, although some well narrowing is seen at very rigid sites. Importantly, a statistical analysis reveals that correlated motion is sufficiently limited to have little impact on the relationship between the NMR measure of methyl dynamics and the derived entropy. These results validate the use of conformational dynamics derived experimentally from NMR relaxation as a proxy from which to obtain quantitative estimates of changes in protein conformational entropy.
Methods
Molecular dynamics simulations
Molecular dynamics simulations of the seven proteins listed in Table 1 were carried out with NAMD223, using the CHARMM2724 all-atom parameter set and the TIP3P26 water potential. Hydrogen atoms were added to the PDB structures with VMD.27 Using VMD27 individual proteins were centered in a TIP3P water box such that protein atoms were at least 8 Å from the boundary except in the case of the two small proteins, ubiquitin and α3D, where the solvent layer was at least 5 Å and 6 Å, respectively.
Table 1.
Proteinsa | Residues | PDBb | T (°C)c | Watersd | Lengthe (ns) |
---|---|---|---|---|---|
ALBP | 131 | 1LIB | 20 | 4008 | 112 |
a3D | 73 | 2A3D | 30 | 2532 | 160 |
Cyt c2 | 116 | 1C2R | 30 | 5116 | 120 |
CaM-smMLCKp | 167 | 1CDL | 35 | 5228 | 1280f |
CaM-nNOSp | 168 | 2O60 | 35 | 4957 | 1120f |
HEWL | 129 | 1LZA | 35 | 4133 | 240 |
Ubiquitin | 76 | 1UBQ | 25 | 2238 | 260f |
Abbreviations: ALBP, adipocyte lipid binding protein; Cyt c2, cytochrome c2; CaM-smMLCKp, calcium-saturated calmodulin (CaM) in complex with a peptide corresponding to the smooth muscle myosin light chain kinase calmodulin-binding domain (smMLCKp); CaM-nNOSp, calcium-saturated calmodulin in complex with a peptide corresponding to the neuronal nitric oxide synthase calmodulin-binding domain (nNOSp); HEWL, hen egg white lysozyme
PDB code of starting structure
Temperature of simulation and NMR experiments
Number of waters in the simulation
Length of the simulation
Utilized Anton (see Methods)
Simulations were performed using a time step of 2 fs. Bonds to hydrogen atoms were constrained using the SHAKE algorithm.28 A switching distance of 10 Å and cut-off of 12 Å were used for the non-bonded interaction interactions, combined with a Particle Mesh Ewald summation with 1 Å grid spacing for long-range electrostatics. Simulations were run at constant temperature and pressure of 1 Atm, controlled using the extended Langevin method. Simulation temperatures corresponded to those at which the NMR relaxation experiments were performed (Table 1). Following equilibration runs of at least 1 ns, several 60 ns data production runs were performed with every subsequent 60 ns simulation starting from the final coordinates of the earlier run but with different initial velocities. For three protein systems, ubiquitin, calmodulin-smMLCKp complex and calmodulin-nNOSp complex, longer simulations were also run on the Anton supercomputer at the Pittsburgh Supercomputer Center run using the same force field and simulation conditions except for a non-bond cutoff of 14 Å.
Analysis of internal motion
The Lipari-Szabo29 squared generalized order parameters (O2) were calculated from MD simulations by overlaying snapshots of the protein from the trajectories using a standard rigid-body alignment (Cα) procedure. For each snapshot, the unit vector along the methyl symmetry axis was obtained in terms of its vector components in Cartesian axes, x, y, and z. The O2 parameter for a given methyl is then calculated using:30
(1) |
where <> indicates the average over the trajectory. For comparison with experiment, the order parameter was derived from the measured order parameter for the methyl C-H bond by assuming free rotation around the symmetry axis and ideal tetrahedral geometry (θ = 109.5°) by: 31
(2) |
Order parameters from independent simulations on the same protein were first computed using equation (6), averaged and then compared to those obtained experimentally using NMR relaxation through equation (7).
Calculation of rotameric entropy
Side chain χ angles for each amino acid were calculated from the MD trajectories and binned into one of three conformational states based on the dihedral angle: gauche+ - [0°, 120°], trans - [120°, 240°]; and gauche- - [240°, 360°]. The rotamer probability distribution function (pdf) of all 3Nχ rotamers of each side chain was constructed from the χ angle histograms, where Nχ is the number of side chain χ angles. Only unique side chain torsion angles were considered (e.g. the isopropyl group contributes to one giving Leu 2 and Val 1 side chain torsion angles). The side chain rotamer entropy is:
(3) |
where kB is the Boltzmann constant, pi is the probability of the ith rotamer. Sb=S/kb is the entropy given in dimensionless units, and the normalized entropy is then defined as Sb/Nχ,.
Calculation of the influence of correlated motions on side-chain entropy
To quantify the effect of correlated motions of side-chains on their entropy, one-dimensional, two-dimensional and three-dimensional pdfs for every side-chain torsion angle, doublet or triplet combination, respectively, were constructed from the trajectories. The torsion angles were again binned into gauche+, gauche- and trans conformations. The one-dimensional (i.e. uncorrelated) entropy, S1D was obtained by summing –P(χ)ln P(χ) over each one-dimensional pdf, P(χ), and then summing over all residues in the protein. The total side-chain entropy, including the effect of correlated motions of side-chains was then estimated using the maximum information spanning tree (MIST) method.32 A second order MIST approximation was obtained by computing pair-wise second order mutual information terms (I2) from the one-dimensional and two-dimensional pdfs as:
(4) |
where the two-dimensional pdf expresses the joint probability that side-chain torsion angles j, k are in the I’th and m’th conformations respectively. and are the corresponding one-dimensional pdfs. Given all the I2's, the second order MIST expansion is then obtained by selecting a spanning set of I2's to maximize:
(5) |
where n is the total number of side-chain torsion angles in the protein. To construct this spanning set, for each i, the index j is chosen from the set {1,2..,i-1} which gives the largest I2(i,j). Similarly, a third order mutual information term (I3):
(6) |
expresses the change in mutual information between i and j by including the k’th torsion angle. A third order MIST approximation is built up analogously by choosing a spanning set of I3's to maximize:
(7) |
where for each k, the indices i and j are chosen from the set {1,2..,k-1} which gives the largest I3(i,j,k). Then an upper bound estimate for the total entropy including correlations at order O is: 32
(8) |
Results
Seven proteins ranging in size from 73 to 168 residues were selected for study (Table 1). Some have bound metals (calmodulin) or prosthetic group (cytochrome c2). Two are high-affinity complexes of small peptides with calmodulin. All have an extensive experimental reference sets for dynamics derived from site-resolved solution NMR relaxation studies. The methyl group Lipari-Szabo squared generalized order parameters (O2axis) were taken from published deuterium methyl relaxation studies: α3D,33 adipocyte lipid binding protein,34 cytochrome c2,35 calcium-saturated calmodulin (CaM) complexes with peptides corresponding to the calmodulin-binding domain of the smooth muscle myosin light chain kinase (smMLCKp) and the neuronal nitric oxide synthase (nNOSp),16 hen egg white lysozyme36 and ubiquitin.37 The average dynamical properties of the proteins range from quite rigid (e.g. cytochrome c2 and lysozyme) to quite internally dynamic (e.g. α3D). The molecular coordinates for the proteins used in the simulations were derived from structures deposited in the Protein Data Bank: ubiquitin,38 α3D,39 (Ca2+)4-calmodulin-smMLCK,40 (Ca2+)4-calmodulin-nNOS (unpublished), hen egg white lysozyme,41 adipocyte lipid binding protein,42 and cytochrome c243 (Table 1).
Comparison of simulation and experimental methyl side-chain dynamics
The primary goal of this work is to discover the microscopic origins of the apparent success in using a dynamical proxy employing motion of methyl-bearing side chains for the determination of conformational entropy of proteins. As a first step towards this goal, it is important to determine the reliability and accuracy of molecular dynamics simulations in capturing the experimentally determined side chain motion. Here we employ the CHARMM27 potential24 in implemented in the context of the NAMD molecular dynamics environment.23 Trajectories were carried out in explicit water at the temperature at which the experimental measurements were made (Table 1). Methyl group symmetry axis L-S squared generalized order parameters (O2axis) were calculated from the trajectories using Eq. 6. Convergence of the simulations with regard to O2axis values was confirmed by examination of the full time correlation function C(t)=P2(<μ(t+T)·μ(t)>) for selected methyl groups spanning the range of observed O2 values, and by the variation in O2axis calculated from different batches of individual trajectories (Supplementary Figure S1)
In detail, a methyl-by-methyl comparison reveals considerable variation between the calculated and simulated L-S O2axis parameters with both over- and underestimation being evident (Table 2 and Supplementary Figure S2). Table 2 summarizes the statistical comparison of O2axis parameters obtained from simulation and by experiment. With the exception of cytochrome c2, the correlation coefficients (R2) are all above 0.5, with the highest value of 0.85 being achieved for ubiquitin. Importantly, there is a good correlation between the calculated and observed average order parameters for each protein (Figure 1). This provides a solid foundation for exploring the microscopic basis for fast methyl-bearing side motion as a proxy for conformational entropy.
Table 2.
Proteins | Expt'l <O2axis> | Mol. Dyn. <O2axis> | R2 | Slope |
---|---|---|---|---|
ALBP | 0.633 | 0.619 | 0.75 | 0.88 |
a3D | 0.451 | 0.571 | 0.76 | 1.25 |
Cyt c2 | 0.767 | 0.670 | 0.47 | 0.68 |
CaM-smMLCKp | 0.583 | 0.562 | 0.62 | 0.83 |
CaM-nNOSp | 0.534 | 0.560 | 0.50 | 0.77 |
HEWL | 0.713 | 0.699 | 0.64 | 0.67 |
Ubiquitin | 0.664 | 0.629 | 0.85 | 0.86 |
Average | 0.620 ± 0.11 | 0.616 ± 0.05 | 0.65 ± 0.14 | 0.85 ± 0.13 |
The correlation of methyl-bearing side chain motion and rotamer entropy
A fundamental aspect of the empirical “entropy meter” approach outlined by Marlow and coworkers is the need for a quantitative linkage between the motional averaging of the methyl group symmetry axis and the underlying conformational entropy.10
To examine this question, the rotamer population distributions of all the methyl containing amino acids were calculated for each of the protein simulations. The corresponding side chain rotamer entropies were calculated as described by Equation 3. The resulting entropies are plotted against the corresponding calculated methyl order parameters in Figure 2. There is an excellent linear correlation between the rotamer entropy (Sb) of each methyl containing amino acid when normalized by the number of side-chain torsion angles (Nχ) in that residue and the respective methyl side chain order parameters calculated from the MD simulations. Moreover, the distributions from the different proteins overlay each other, strongly suggesting that the empirical scaling of motion to the corresponding entropy should be universal. Correlations of rotamer entropy and O2axis parameters for the individual proteins are provided in the Supplementary Figure S3 and summarized in Supplementary Table S1.
In principle, the entropic content represented by the O2axis parameter could arise not only from the distribution between rotamer wells but also from the distribution within each rotamer well and the associated vibrational entropy.5,44,45 With this in mind, for each methyl group we examined the fine distribution of torsion angles using a resolution of 3°. The width of a rotameric well is characterized by the root mean square deviation (rmsd) of its χ-angles. Weighting the width of each well of a given torsion by its overall occupancy gives the mean width. This width is shown as a function of the calculated O2axis parameter for the corresponding residue in Figure 3. The within-well distributions of non-Methionine rotamer angles is largely insensitive to values of O2axis below ~0.8. At higher O2axis values some narrowing of the within-well distribution is indicated. Met methyl groups behave somewhat differently: Within-well widths are modestly correlated with O2axis (R2=0.30), suggesting that the energy well governing the directly connected torsion angle of the Met methyl group is somewhat plastic.
Given the rarity of Met residues the generality and origin of this remains to be established. Overall, the high correlation between the normalized rotameric entropy of methyl containing amino acids and the O2axis parameters of the attached methyl groups indicates that methyl probes serve as excellent reporters of the conformational entropy of methyl containing amino acids in proteins. Two conclusions can be drawn. First, since the residues of seven quite different proteins have indistinguishable correlations between rotamer entropy Sb and the O2axis parameter, there appears to be a universal relationship between these two quantities. Second, the O2axis parameter appears to be determined primarily by between-well rotamer transitions.
Though there is significant variation in the normalized rotamer entropy associated with a specific value of O2axis on a per-residue basis, particularly at lower order parameters (Figure 2 and Supplementary Figure S3), this variation is much smaller when considering the average behavior over the entire protein. The average normalized rotamer entropy shows an excellent correlation (R2 = 0.91) with the average O2axis parameter (Figure 4). This suggests that the NMR O2axis parameter and the normalized rotamer entropy are effectively transforms of each other and follow a simple protein-independent linear relationship. Thus, though our current ability to accurately simulate individual experimental order parameters is limited (Table 2 and Supplementary Figure S2), the accuracy of the corresponding derived entropy greatly improves when one uses averages over the entire molecule (Figure 4 and Supplementary Figure S4) and one is not trying to simulate order parameters but has known values from NMR.
The essence of the “entropy meter”
A critical assumption in the use of the dynamical proxy of methyl group motion for conformational entropy of the entire protein is that the motion of the methyl group is sufficiently coupled to its surroundings to faithfully report on local disorder.10 To examine this central issue, we calculated the total side chain rotamer entropy from the probability distributions for all side-chain torsion angles. The linear correlation between the dynamical proxy (methyl group dynamics) and total conformational entropy is excellent (Figure 4). Thus averaged over a protein, the methyl order parameters report well on the total side chain entropy. This can be expressed using the remarkably simple equation:
(9) |
which, as noted, is uncompensated for correlated states. Thus far the rotamer entropy has been calculated from entirely intra-side chain rotamer pdfs. Correlations between χ angles within each residue are accounted for by the exhaustive enumeration of individual side chain torsion angle combinations (see Methods). However, the effect of inter-residue correlations between torsional motions is absent. Such correlations will result in lowering of the derived protein conformational entropy. In order to quantify the effect of such correlations, doublet and triplet torsion angle distributions of side-chains in all the proteins were accumulated and analyzed. The contributions of inter-residue correlations to the total entropy were calculated using the MIST formulation of Tidor and coworkers.32,46 The proportional degree of reduction in total conformational entropy is very similar across the set of proteins studied. Both second and third order corrections for inter-residue correlated motion gave resulting entropies that showed excellent linear correlations with the uncorrected entropy (Inset, Figure 4). Moreover, the correction from third-order (17%) is not much higher than for second order (11%) indicating that the bulk of the correction due to all orders of correlation is captured by these lower order, calculable effects. This leads to a simple correction to Equation 1 so that one can obtain an estimate of the total side chain entropy of a protein, including both methyl and non-methyl containing residues, intra- and inter residue correlation effects from the following equation
(10) |
which is corrected for inter-residue correlations.
In summary, the side-chain rotamer entropy and methyl order parameters are closely related measures of the dynamics of methyl bearing side chains. In addition, the major determinants of side chain entropy changes are shifts in rotamer populations, which are visible to NMR. Moreover, when averaged over the whole protein, methyl order parameters report with identical fidelity on all side-chain motions. Thus NMR measurements of average methyl order parameter values can be leveraged into estimates of total side-chain entropy.
Reformulation of the “entropy meter”
The original construction of the conformational “entropy meter” based on the dynamical proxy of fast methyl group motion was admittedly crude.6,10 This was especially true on how to project the measured motion of methyl groups across the protein. A simple residue-weighting scheme was used. Briefly, the approach rested on relating what could be easily measured (total binding entropy via isothermal titration calorimetry) or calculated (solvent entropy via changes in accessible surface area) to the dynamical proxy. The approach assumed that additional sources of entropy, such as rotational-translational entropy or undocumented entropy were constant across a family of protein-ligand interactions. It was further assumed that measured changes in methyl-group motion were linearly related to changes in changes in local motion (disorder) that not only reflected the methyl-bearing side chain but also that of the surrounding side-chains. Linear correlation between side-chain motion and entropy was suggested by simple models,8,9,47 and by the molecular dynamics simulations presented here. These considerations led to the following formulation of the “entropy meter:”10
(11) |
where ΔStot, ΔSsolv, ΔSconf, ΔSRT and ΔSother are the changes in total system entropy, solvent entropy, conformational entropy, rotational-translational entropy and otherwise undocumented entropy, respectively. As noted in detail elsewhere,6 the two protein systems where this approach was taken give decidedly different conversion constants (m). Though it may indeed be possible that different proteins have qualitatively different dynamics it does seem puzzling.6 We resolve this below.
The ability to recapitulate semi-quantitatively experimentally derived order parameters via molecular dynamics trajectories allows for the rapid exploration of the generality of slight reformulations of the experimental “entropy meter.” The computational results described here suggest a modest refinement. Due to the excellent correlation between measures of angular disorder (O2axis) and the conformational entropy normalized by the number of involved torsion angles (Figure 4), we reanalyzed the empirical calibration of conformational entropy from the study on calmodulin and its binding partners10 and cAMP activated CAP and its mutants binding to DNA17. Now the NMR order parameter derived proxy is given by:
(12) |
where Nχprot and Nχlig are the total number of side-chain torsion angles in the protein and ligand, respectively. Using this formulation of the dynamical proxy, a single linear calibration line applies to both the CaM and CAP protein complexes (Figure 5).
Discussion
Here we have employed molecular dynamics simulations to investigate the validity of several assumptions that are fundamental to the idea of using dynamical information gathered from NMR relaxation in methyl groups of amino acid side chains as a quantitative proxy for conformational entropy. Since the pioneering introduction of molecular dynamics simulations for comprehensive studies of internal protein motion several decades ago,48 the capabilities and accuracy of such simulations have expanded greatly with increases in computational power and significant improvements in various aspects of the force fields employed.19 Perhaps surprisingly, the accuracy of molecular dynamics simulations have not been extensively tested in the ps-ns time regime against the sizable database of side chain order parameters obtained by solution NMR relaxation methods.18-20 Using the NAMD implementation of the CHARMM27 potential in a largely “out of the box” fashion, reasonable “site-to-site” agreement with experiment. These results compare favorably with evaluations of various AMBER potentials employing calbindin and ubiquitin as test proteins.49,50 More importantly, we find excellent on average agreement between experimental Lipari-Szabo squared generalized order parameters and those extracted from molecular dynamics trajectories. There exists a large site-to-site variance between experiment and simulation that remains to be explained and corrected. However, the overall agreement with experiment is sufficiently good to promote use of simulations to illuminate critical aspects of the use of the dynamical proxy as a means to quantify conformational entropy.
A central feature of the dynamical proxy to act as a measure of the distribution of states is the notion that motion between states can be linearly related to the underlying conformational entropy.7 This was initially promoted by examination of a number of simple motional models.8,9,47 The molecular dynamics studies of the seven protein systems described here fully reinforce this expectation. There is a robust linear relationship observed between the calculated rotamer entropy of each methyl bearing side chain - normalized by the number of associated χ torsion angles - and its respective methyl order parameter for each protein. This is perhaps not unexpected given that the motion of terminal methyl groups arises from motion about all torsions connecting it to the backbone. Indeed, this type of motion is mirrored in the product of the corresponding generalized order parameters.29,51 This indicates that methyl groups are excellent reporters of their own side-chain conformational entropy. Importantly, this relationship is uniform across all the proteins studied here suggesting that it is general (Figure 2). Further evidence for this generality comes from a recent and parallel analysis of apo HIV protease simulations.52 Glass et al52 also find a linear dependence of methyl group entropy on O2 over the range 0.1-0.8, with a slope, when expressed in dimensionless units, very similar to that of Figure 2. It is important to note in this regard that the motions detected by the NMR relaxation discussed here are restricted to timescales faster than overall tumbling of the macromolecule,29 which is on the order of ten nanoseconds or less for the proteins examined in this study. Interestingly, the molecular dynamics simulations exceed this by an order of magnitude. Thus, the high correlation suggests that the ensemble of states that is experimentally accessible accurately reflects that sampled on longer timescales. It is important to note that all methy-bearing side chains, except for Thr, show extensive rotamer interconversion during the trajectories. In principle, it is possible that longer time scale motion may interconvert states that are not similarly sampled in the time regime relevant to the generalized order parameter obtained by classical NMR relaxation.53 Fortunately, the high empirical correlation found (Fig. 5) suggests that the possible distortion of the relationship between local entropy and the generalized order parameter due to slow rotamer interconversion is small.
In the “entropy meter” treatment of Marlow and coworkers10 it is assumed that the motion of the experimentally accessible methyl group not only reflects the motional disorder of the entire methyl-bearing side chain but that it also linearly reflects motion (disorder) of the surrounding non-methyl bearing side chains. This coupling of motion is clearly evident in the molecular dynamics simulations where a remarkably robust linear relationship exists between the total normalized rotamer entropy of a protein and the appropriately weighted change in effective amplitude (i.e. the angular disorder represented by the L-S O2axis parameter) of fast methyl group motion. The normalization by Nχ provides an unbiased view of the relationship between rotamer entropy, expressed now as an intensive quantity, and methyl dynamics for different length side-chains e.g. methionine versus valine etc. Furthermore, this linear relationship is valid for the entire range of order parameter i.e. from 0 to 1 irrespective of the type of methyl side-chain, i.e. the relative contribution to entropy from a residue with a given order parameter depends only on <O2axis> and Nχ not the type of side chain.
The microscopic origins of this rotamer entropy and its subsequent linear relationship to methyl side-chain order parameters are intrinsically related to the population distribution between the different energy states of the side-chain i.e. different rotamer wells. The relationship is largely insensitive to the distributions within a rotamer well except for perhaps methionine (Figure 3). In principle, the total side-chain conformational entropy arises from the combined effect of both transitions between rotamer wells (‘conformational entropy’) and distributions within a rotamer well (‘vibrational entropy’).5 However, differences in entropy that distinguish more dynamic or less dynamic residues, or that distinguish more dynamic proteins such as α3D from less dynamic ones such as HEWL arise almost entirely from the conformational component. This insensitivity of the torsional distributions within rotamer wells is attributable to the fact that the energies involved in stabilization of protein structure and in binding ligands are rather modest. They are apparently sufficient to modulate softer modes of motion such as transitions between rotamers, but not enough to greatly affect vibrational contributions such as motion within torsion wells nor indeed the much stiffer vibrational contributions from bond angle and bond stretch motions.
A logical consequence of the coupling of motion is the need to consider how correlation might affect the quantification of conformational entropy via a dynamical proxy. Although exhaustive enumeration of rotamers within a side chain is computational feasible and was carried out explicitly, complete inter-residue correlations are not computationally tractable but were estimated here using the MIST algorithm.32 These calculations indicate that only a small linear scaling of the uncorrected entropy is required. This picture of rather restricted, local effects of correlation is consistent with other recent studies of conformational entropy in proteins.54-56 Given the generally linear relationship revealed by simulations, the experimental “entropy meter” should represent the true conformational entropy change as the effects of correlated motion are encapsulated within the empirical calibration, assuming the absence of significant solvent-protein coupling (see equation 3). As emphasized above, the conformational entropy calculated from molecular dynamics derived <O2> values alone would be insufficiently accurate. In contrast, the empirical entropy meter based on the measured methyl-group dynamical proxy would more accurately relate side chain motion to the underlying conformational entropy, and does so through a remarkably simple relationship:
(13) |
Conclusions
Using molecular dynamics simulations as a guide, we have resolved the apparent discrepancy between the applications of a dynamical proxy for conformational entropy to the binding of ligands by calmodulin and the catabolite activation protein. A simple normalization based on the number of degrees of freedom (torsion angles) is sufficient. The molecular dynamics simulations also suggest that the “entropy meter” derived from this approach may be generally applicable. Further experiments are required to confirm this. Nevertheless, the apparent robustness of the dynamical proxy for conformational entropy from the perspective of both simulation and experiment reinforces the conclusion that conformational entropy can indeed play a significant and potentially determining role in associations involving proteins.6 Looking forward, these results promote a more exhaustive examination of the quantitative role of conformational entropy in the free energy governing protein-ligand associations. This may be particularly important in the context the interaction of proteins with unnatural man-made pharmaceuticals where the influence of conformational entropy remains largely unknown.57
Supplementary Material
ACKNOWLEDGMENT
Supported by NIH grant GM102447 a grant from the Mathers Foundation and the National Resource for Biomedical Supercomputing and the Pittsburgh Supercomputing Center through NIH Award RC2GM093307 to CMU through the NRBSC.
Footnotes
Supporting Information
Convergence of simulation trajectories. Site-by-site comparison of simulated methyl side-chain order parameter to both experimental order parameters and methyl side-chain entropy. Summary of statistics for the effect of correlated motion on side-chain entropy calculations. This material is available free of charge via the internet at http://pubs.acs.org.
REFERENCES
- 1.Dill KA. Biochemistry. 1990;29:7133–7155. doi: 10.1021/bi00483a001. [DOI] [PubMed] [Google Scholar]
- 2.Hilser VJ, Wrabl JO, Motlagh HN. Annu. Rev. Biophys. 2012;41:585–609. doi: 10.1146/annurev-biophys-050511-102319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Cooper A. Proc. Natl. Acad. Sci. U S A. 1976;73:2740–2741. doi: 10.1073/pnas.73.8.2740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Cooper A, Dryden DTF. Eur. Biophys. J. Biophys. Lett. 1984;11:103–109. doi: 10.1007/BF00276625. [DOI] [PubMed] [Google Scholar]
- 5.Karplus M, Ichiye T, Pettitt BM. Biophys. J. 1987;52:1083–1085. doi: 10.1016/S0006-3495(87)83303-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wand AJ. Cur. Opin. Struct. Biol. 2013;23:75–81. doi: 10.1016/j.sbi.2012.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Igumenova TI, Frederick KK, Wand AJ. Chem. Rev. 2006;106:1672–1699. doi: 10.1021/cr040422h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li Z, Raychaudhuri S, Wand AJ. Protein Sci. 1996;5:2647–2650. doi: 10.1002/pro.5560051228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yang D, Kay LE. J. Mol. Biol. 1996;263:369–382. doi: 10.1006/jmbi.1996.0581. [DOI] [PubMed] [Google Scholar]
- 10.Marlow MS, Dogan J, Frederick KK, Valentine KG, Wand AJ. Nat. Chem. Biol. 2010;6:352–358. doi: 10.1038/nchembio.347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Farrow NA, Muhandiram R, Singer AU, Pascal SM, Kay CM, Gish G, Shoelson SE, Pawson T, Formankay JD, Kay LE. Biochemistry. 1994;33:5984–6003. doi: 10.1021/bi00185a040. [DOI] [PubMed] [Google Scholar]
- 12.Muhandiram DR, Yamazaki T, Sykes BD, Kay LE. J. Am. Chem. Soc. 1995;117:11536–11544. [Google Scholar]
- 13.Ishima R, Petkova AP, Louis JM, Torchia DA. J. Am. Chem. Soc. 2001;123:6164–6171. doi: 10.1021/ja0104711. [DOI] [PubMed] [Google Scholar]
- 14.Millet O, Muhandiram DR, Skrynnikov NR, Kay LE. J. Am. Chem. Soc. 2002;124:6439–6448. doi: 10.1021/ja012497y. [DOI] [PubMed] [Google Scholar]
- 15.Tugarinov V, Kay LE. Biochemistry. 2005;44:15970–15977. doi: 10.1021/bi0519809. [DOI] [PubMed] [Google Scholar]
- 16.Frederick KK, Marlow MS, Valentine KG, Wand AJ. Nature. 2007;448:325–329. doi: 10.1038/nature05959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Tzeng S-R, Kalodimos C. Nature. 2012;488:236–240. doi: 10.1038/nature11271. [DOI] [PubMed] [Google Scholar]
- 18.Shaw DE, Maragakis P, Lindorff-Larsen K, Piana S, Dror RO, Eastwood MP, Bank JA, Jumper JM, Salmon JK, Shan YB, Wriggers W. Science. 2010;330:341–346. doi: 10.1126/science.1187409. [DOI] [PubMed] [Google Scholar]
- 19.Dror RO, Dirks RM, Grossman JP, Xu HF, Shaw DE. Annu. Rev. Biophys. 2012;41:429–452. doi: 10.1146/annurev-biophys-042910-155245. [DOI] [PubMed] [Google Scholar]
- 20.Beauchamp KA, Lin YS, Das R, Pande VS. J. Chem. Theor. Comp. 2012;8:1409–1414. doi: 10.1021/ct2007814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Scouras AD, Daggett V. Protein Sci. 2011;20:341–352. doi: 10.1002/pro.565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nelson MT, Humphrey W, Gursoy A, Dalke A, Kale LV, Skeel RD, Schulten K. Intl. J. Supercomputer Appl. High Perform. Comp. 1996;10:251–268. [Google Scholar]
- 23.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kale L, Schulten KJ. Comp. Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M. J. Comp. Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. J. Comp. Chem. 1983;4:187–217. [Google Scholar]
- 26.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
- 27.Humphrey W, Dalke A, Schulten K. J. Mol. Graph. Model. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 28.Ryckaert JP, Ciccotti G, Berendsen HJC. J. Chem. Phys. 1977;23:327–341. [Google Scholar]
- 29.Lipari G, Szabo A. J. Am. Chem. Soc. 1982;104:4546–4559. [Google Scholar]
- 30.Chatfield DC, Szabo A, Brooks BR. J. Am. Chem. Soc. 1998;120:5301–5311. [Google Scholar]
- 31.Nicholson LK, Kay LE, Baldisseri DM, Arango J, Young PE, Bax A, Torchia DA. Biochemistry. 1992;31:5253–5263. doi: 10.1021/bi00138a003. [DOI] [PubMed] [Google Scholar]
- 32.King BM, Tidor B. Bioinformatics. 2009;25:1165–1172. doi: 10.1093/bioinformatics/btp109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Walsh STR, Lee AL, DeGrado WF, Wand AJ. Biochemistry. 2001;40:9560–9569. doi: 10.1021/bi0105274. [DOI] [PubMed] [Google Scholar]
- 34.Constantine KL, Friedrichs MS, Wittekind M, Jamil H, Chu CH, Parker RA, Goldfarb V, Mueller L, Farmer BT. Biochemistry. 1998;37:7965–7980. doi: 10.1021/bi980203o. [DOI] [PubMed] [Google Scholar]
- 35.Flynn PF, Urbauer RJB, Zhang H, Lee AL, Wand AJ. Biochemistry. 2001;40:6559–6569. doi: 10.1021/bi0102252. [DOI] [PubMed] [Google Scholar]
- 36.Moorman VR, Valentine KG, Wand AJ. Protein Sci. 2012;21:1066–1073. doi: 10.1002/pro.2092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lee AL, Flynn PF, Wand AJ. J. Am. Chem. Soc. 1999;121:2891–2902. [Google Scholar]
- 38.Vijaykumar S, Bugg CE, Cook WJ. J. Mol. Biol. 1987;194:531–544. doi: 10.1016/0022-2836(87)90679-6. [DOI] [PubMed] [Google Scholar]
- 39.Walsh STR, Cheng H, Bryson JW, Roder H, DeGrado WF. Proc. Natl. Acad. Sci. U S A. 1999;96:5486–5491. doi: 10.1073/pnas.96.10.5486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Meador WE, Means AR, Quiocho FA. Science. 1992;257:1251–1255. doi: 10.1126/science.1519061. [DOI] [PubMed] [Google Scholar]
- 41.Maenaka K, Matsushima M, Song H, Sunada F, Watanabe K, Kumagai I. J. Mol. Biol. 1995;247:281–293. doi: 10.1006/jmbi.1994.0139. [DOI] [PubMed] [Google Scholar]
- 42.Xu ZH, Bernlohr DA, Banaszak LJ. J. Biol. Chem. 1993;268:7874–7884. [PubMed] [Google Scholar]
- 43.Benning MM, Wesenberg G, Caffrey MS, Bartsch RG, Meyer TE, Cusanovich MA, Rayment I, Holden HM. J. Mol. Biol. 1991;220:673–685. doi: 10.1016/0022-2836(91)90109-j. [DOI] [PubMed] [Google Scholar]
- 44.Chen W, Gilson MK, Webb SP, Potter MJ. J. Chem. Theory Comput. 2010;6:3540–3557. doi: 10.1021/ct100245n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Head MS, Given JA, Gilson MK. J. Phys. Chem. A. 1997;101:1609–1618. [Google Scholar]
- 46.King BM, Silver NW, Tidor B. J. Phys. Chem. B. 2012;116:2891–2904. doi: 10.1021/jp2068123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lee AL, Sharp KA, Kranz JK, Song XJ, Wand AJ. Biochemistry. 2002;41:13814–13825. doi: 10.1021/bi026380d. [DOI] [PubMed] [Google Scholar]
- 48.McCammon JA, Gelin BR, Karplus M. Nature. 1977;267:585–590. doi: 10.1038/267585a0. [DOI] [PubMed] [Google Scholar]
- 49.Long D, Li DW, Walter KF, Griesinger C, Bruschweiler R. Biophys. J. 2011;101:910–915. doi: 10.1016/j.bpj.2011.06.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Showalter SA, Johnson E, Rance M, Bruschweiler R. J. Am. Chem. Soc. 2007;129:14146–14147. doi: 10.1021/ja075976r. [DOI] [PubMed] [Google Scholar]
- 51.Wittebort RJ, Szabo A. J. Chem. Phys. 1978;69:1722–1736. [Google Scholar]
- 52.Glass DC, Krishnan M, Smith JC, Baudry J. J. Phys. Chem. B. 2013;117:3127–3134. doi: 10.1021/jp400564q. [DOI] [PubMed] [Google Scholar]
- 53.Chou JJ, Case DA, Bax A. J. Am. Chem. Soc. 2003;125:8959–8966. doi: 10.1021/ja029972s. [DOI] [PubMed] [Google Scholar]
- 54.Li DW, Showalter SA, Brüschweiler R. J. Phys. Chem. B. 2010;114:16036–16044. doi: 10.1021/jp109908u. [DOI] [PubMed] [Google Scholar]
- 55.DuBay KH, Geissler PL. J. Mol. Biol. 2009;391:484–497. doi: 10.1016/j.jmb.2009.05.068. [DOI] [PubMed] [Google Scholar]
- 56.Das A, Chakrabarti J, Ghosh M. Biophys. J. 2013;104:1274–1284. doi: 10.1016/j.bpj.2012.12.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Marshall GR. J. Comput. Aided Mol. Des. 2012;26:3–8. doi: 10.1007/s10822-011-9494-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.