Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 May 1.
Published in final edited form as: Biochim Biophys Acta. 2014 Sep 16;1850(5):932–943. doi: 10.1016/j.bbagen.2014.09.001

Correcting for the free energy costs of bond or angle constraints in molecular dynamics simulations

Gerhard König 1, Bernard R Brooks 2
PMCID: PMC4339525  NIHMSID: NIHMS632030  PMID: 25218695

Abstract

Background

Free energy simulations are an important tool in the arsenal of computational biophysics, allowing the calculation of thermodynamic properties of binding or enzymatic reactions. This paper introduces methods to increase the accuracy and precision of free energy calculations by calculating the free energy costs of constraints during post-processing. The primary purpose of employing constraints for these free energy methods is to increase the phase space overlap between ensembles, which is required for accuracy and convergence.

Methods

The free energy costs of applying or removing constraints are calculated as additional explicit steps in the free energy cycle. The new techniques focus on hard degrees of freedom and use both gradients and Hessian estimation. Enthalpy, vibrational entropy, and Jacobian free energy terms are considered.

Results

We demonstrate the utility of this method with simple classical systems involving harmonic and anharmonic oscillators, four-atomic benchmark systems, an alchemical mutation of ethane to methanol, and free energy simulations between alanine and serine. The errors for the analytical test cases are all below 0.0007 kcal/mol, and the accuracy of the free energy results of ethane to methanol is improved from 0.15 to 0.04 kcal/mol. For the alanine to serine case, the phase space overlaps of the unconstrained simulations range between 0.15 and 0.9%. The introduction of constraints increases the overlap up to 2.05%. On average, the overlap increases by 94% relative to the unconstrained value and precision is doubled.

Conclusions

The approach reduces errors arising from constraints by about an order of magnitude. Free energy simulations benefit from the use of constraints through enhanced convergence and higher precision.

General Significance

The primary utility of this approach is to calculate free energies for systems with disparate energy surfaces and bonded terms, especially in multi-scale molecular mechanics/quantum mechanics simulations.

Keywords: free energy calculations, constraint correction, molecular dynamics simulations, normal mode analysis, Bennett’s acceptance ratio

1. Introduction

Constraints are used in most molecular dynamics simulations, since the maximum length of the time step for integrating the Newtonian equations of motion is restricted by the frequency of the fastest motions in the system. Imposing constraints that remove the associated rapid vibrational modes makes those degrees of freedom rigid. Thus, it is possible to use longer time steps without losing conservation of energy or significantly distorting the desired ensemble. Thus, bond constraints can reduce the required computer time by a factor of three.[1] Furthermore, constraints can be a very valuable tool to improve the convergence and efficiency of free energy simulations, e.g., when using the Simplified Confinement[2] or Confine-and-Release methods.[3]

Several algorithms are available to impose constraints in molecular dynamics simulations.[4] Among the most widely used methods is the simple two body constraint SHAKE.[5] SHAKE constrains bonds and angles by finding forces that maintain the right geometry with the iterative Gauss-Seidel method. However, constraining too many bond distances that are coupled to each other through SHAKE is not practical due to the recursive nature of this method (e.g., ring structures). Prominent alternative approaches include the pairwise Rattle algorithm,[6] which also includes the velocities of the atoms, or the LINCS algorithm.[7] Recently, the Shape rigid body integrator[8] has been developed that combines the accuracy of SHAKE with the possibility to use higher numbers and more types of constraints (i.e., rigid bodies of three or more centers, thus including bonds, angles and dihedrals).

Notably, imposing constraints can affect the outcome of the simulation by restricting the phase space of specified degrees of freedom to a choice selected by the user. Van Gunsteren reported quenching of dihedral angle transitions if both bonds and angles are constrained.[9, 10] Furthermore, the efficiency of constrained simulations is lowered by angle constraints.[1] Toxvaerd pointed out that angle constraints change the trans-gauche transition rate of decane drastically.[11] Tobias and Brooks[12] determined that constraints shift the frequencies of the normal modes in biomolecules, but only in the region between 100 and 1400 cm−1. Similarly, Hinsen and Kneller found that the dynamics of polypeptides are slightly perturbed by constraints.[13] Recently, Echenique et al.[14] carefully analyzed and quantified the difference between unconstrained models and several types of constraints. Moreover, deviations between 0.2 and 0.5 kcal/mol were found in solvation free energy calculations when bond length constraints were employed.[15] Deviations of such magnitude highlight the need to properly account for free energy changes due to constraints in free energy simulations.

A number of publications have focused on methods for calculating constraint corrections[16, 17, 18, 19]; good overviews are provided by Boresch and Karplus[20] or Wang and Hermans.[21] The most prominent approach for such corrections is the use of the average constraint force.[22] However, this approach only seems to be valid for bond length constraints.[20] The potential of mean force corrections of Pearlman and Kollman[23, 24] compute the contribution to the free energy for systems with constrained bond terms. However, this requires additional simulations. Furthermore, most of the currently existing constraint corrections have been developed for free energy calculations with Thermodynamic Integration,[25] and, therefore, cannot be used straightforwardly in the increasingly popular Bennett’s acceptance ratio (BAR),[26] multistate-BAR (m-BAR),[27] or Non-Boltzmann Bennett (NBB)[28, 29] methods.

For free energy simulations with BAR, the primary utility of including constraints is to increase the phase space overlap between two distinct energy surfaces. Since the variance of the free energy estimate is directly linked to the phase space overlap, constraints allow free energy simulations to be more accurate and to converge more quickly (thus saving computational time). This is of particular utility for coupling disparate energy surfaces, such as QM/MM with MM, where small changes in bond lengths and angles can significantly decrease the necessary overlap needed for an accurate result. This increased accuracy and faster convergence rate off-sets the costs associated with post-processing constraints. Here, we attempt to provide a simple framework to correct for the effect of constraints in free energy calculations with BAR by postprocessing the involved trajectories. In particular, we achieve this by explicitly calculating the free energy costs of adding and removing constraints as additional steps in the free energy cycle. We achieve this by employing a gradient calculation combined with normal mode analysis[30] to approximate the contributions to the partition function associated with the constrained degrees of freedom. Notably, this approach is also compatible with most quantum-packages and QM/MM.[31, 32, 33, 34, 35, 36, 37]

Our approach is inspired by Gō and Scheraga,[38] who showed that bond lengths and angles can be treated as functions of the dihedral angles. This implies that the optimal position of a bond or angle at each frame of a trajectory can be determined after the simulation has been conducted with an suboptimal configuration of that particular bond or angle. In particular, they regarded proteins to be a collection of independent normal modes. Some of the normal modes can be considered hard if their force constant is high enough, while all other normal modes are considered soft. The hard normal modes tend to be insensitive to the changes in conformation, so they can be regarded as functions of the soft variables (changing the bond length and bond angles after the conformation changes). This approximation is of course only valid close to the equilibrium conformation. Using this procedure, the hard modes can be considered to be frozen (constrained) during the dynamics, while the soft modes are treated classically to approximate the partition function. Thus, to account for the changes of the hard degrees of freedom, it is possible to use a post-processing step that changes the bond lengths and angles according to their (soft) environment, and the free energy decrease associated with that change.

The remainder of this paper is organized as follows. First, we outline how to calculate the free energy costs of bond and angle constraints (referred to as “constraint correction”), starting with a simple harmonic oscillator, and progressing to the analysis of trajectories in free energy simulations with multiple constraints (Sect. 2). Methodological details of the benchmark systems and simulations are presented in Section 3. We then present the results for harmonic and anharmonic oscillators, four-atomic benchmark systems, water boxes of varying size, an alchemical mutation of ethane to methanol, and free energy simulations between alanine and serine (Sect. 4, see Figure 1). We conclude with a short discussion concerning the practical advantages of using constraints in free energy calculations in Section 5. A description of the implementation of the constraint correction in CHARMM, as well as a discussion of alternative ways to approximate the Hessian can be found in the Supplementary Material.

Figure 1.

Figure 1

Benchmark systems considered in this paper: Anharmonic oscillators, four-atomic benchmark systems, water boxes of varying size, an alchemical mutation of ethane to methanol, and free energy simulations between alanine and serine

2. Theory

To illustrate the problem of calculating the free energy of releasing a constraint, we start with a simple classical harmonic oscillator with an internal coordinate q. For this system, the potential energy U is given by

U(Δq)=U0+K2Δq2, (1)

where Δq is the deviation from the energy minimum, U0 is the zero point energy that depends on the reference point and K is the force constant of the bond or angle. Given this potential energy function, the partition function Z of the harmonic oscillator (Zh.o.) can be obtained by integrating over the single degree of freedom, q,

Zh.o.=e-βU(Δq)dΔq=e-βU0e-βK2Δq2dΔq. (2)

The absolute free energy of the harmonic oscillator is then

Gh.o.=-β-1lnZh.o.=U0-β-1lne-βK2Δq2dΔq (3)
=U0-β-1ln2πβK, (4)

where we have made use of the Gaussian integral.

If we introduce a constraint in the harmonic oscillator, we completely remove the degree of freedom q, so the integral in the partition function becomes a single point, corresponding to a Dirac delta δ(x). Making use of the integral of the Dirac delta, ∫ δ(x) dx = 1, the partition function of the constrained system becomes

Zconsh.o.=e-βU(Δqcons), (5)

and, correspondingly, the free energy is reduced to

Gconsh.o.=-β-1lnZconsh.o.=U(Δqcons). (6)

Accordingly the free energy of imposing a constraint corresponds to

ΔGconsh.o.=Gconsh.o.-Gh.o.=U(Δqcons)-U0+β-1ln2πβK. (7)

For convenience, it is possible to divide the total energy into two contributions. First, the enthalpic contribution ΔH,

ΔHh.o.=U(Δqcons)-U0, (8)

which is independent of the temperature. Notably, Uqcons) inherently contains U0.

Second, the entropic contribution ΔGharm from the vibrations of the harmonic potential,

ΔGharmh.o.=β-1ln2πβK. (9)

The total free energy of imposing the constraint is just

ΔGconsh.o.=ΔHh.o.+ΔGharmh.o.. (10)

2.1. Implementation using normal mode analysis

The above analysis is only valid if the harmonic oscillator is one-dimensional and isolated (i.e., without non-bonded interactions). In the presence of external forces (e.g., by adding Coulomb charges and Lennard Jones parameters to the oscillator and immersing it into a water box), the system becomes an anharmonic oscillator, with an unknown potential energy function. Starting from an arbitrary constrained structure (Δqcons, where the Δ refers to the deviation from the energy minimum) of the anharmonic oscillator, we can use a Taylor expansion to describe the potential energy function of the unconstrained system.

U(Δq)=U(Δqcons)+U(Δqcons)(Δq-Δqcons)+U(Δqcons)2(Δq-Δqcons)2+ (11)

where U′(Δqcons) and U″(Δqcons) denote the first and second derivative of the potential energy function evaluated at the constrained conformation. Since most classical molecular simulations of biomolecules are conducted at room temperature in a stable chemical environment, the Taylor series is dominated by the harmonic term. This implies that the chemical bond is not going to break and non-bonded interactions are mostly going to shift the energy minimum, but not the frequency. We, therefore, approximate the potential energy function by ignoring the terms with powers of three or higher, i.e.

U(Δq)U(Δqcons)+U(Δqcons)(Δq-Δqcons)+U(Δqcons)2(Δq-Δqcons)2. (12)

Notably, this approximation ignores all anharmonic contributions to the potential energy.

To identify the energy minimum of the unconstrained system, we can use U′(Δqcons) and U″(Δqcons) of the constrained structure in a one-step Newton-Raphson minimization. Since the first derivative is zero at the energy minimum,

ΔqconsU(Δqcons)U(Δqcons). (13)

The Newton Raphson step leads to an approximation of the enthalpic contribution in anharmonic oscillators, making use of equation 1,

ΔHa.o.U(Δqcons)22U(Δqcons). (14)

Correspondingly, if the constrained structure is close to the energy minimum, the harmonic entropy contribution can be approximated by

ΔGharma.o.β-1ln2πβU(Δqcons). (15)

However, if Δqcons is relatively large, it might be advisable to reevaluate U″ at the energy minimum to make sure that the frequencies are correct.

In biomolecules, multiple constraints are used simultaneously. It is, therefore, convenient to reformulate the above analysis in terms of a reduced basis harmonic analysis. This approach relies on partitioning the full Hessian matrix of the whole system (3n × 3n, where n is the number of atoms) into relevant and irrelevant parts, approximately block diagonalizing the Hessian. This assumes that the parts can be separated without incurring too large errors from the missing coupling. However, the influence of soft degrees of freedom on hard degrees of freedom is considered to be small and most hard terms are not coupled to each other (except for the Urey-Bradley term in bond angles).

The “relevant” part in this application are the constrained degrees of freedom. All other degrees of freedom need to be fixed during this analysis to make sure that the ensemble does not get perturbed. If there are m constrained degrees of freedom that need to be corrected, they form an orthonormal (sub-)basis set C that consists of m vectors c. Each vector must be normalized in root-mass-weighted Cartesian bases and must be defined in terms of Cartesian displacements. For a constrained bond ri, the associated basis vector is defined by ci=xri, where x are the 3n Cartesian coordinates of all atoms. The basis vector corresponds to a vector from the Cartesian coordinates of the first atom to the second atom involved in the bond and vice versa.

For a constrained angle θi between three atoms j, k, l (the constraint being imposed on the coordinates of atom l), the vector is ci=xθi, corresponding to the tangential vector (i.e. normal to the bond rkl) on the plane defined by the three atoms j, k, l. Notably, a displacement along such a vector changes the bond length rkl. However, this can be accounted for by including the curvature of the angle in the form of the vector along rkl according to Δx = sin(Δθi)cirkl + (1 − cos(Δθi))rlk, where rlk is the Cartesian vector between atoms l and k (this approach is implemented in the new CANG modes, which are discussed in the next section).

The corresponding reduced Hessian H for the constrained degrees of freedom (an m × m matrix) is generated by projecting the matrix of the mass weighted energy second derivatives F into the 3n × m reduced basis set C

H=CFC. (16)

Similarly, the force vector for the reduced basis set, g (for gradient), is generated from the force vector of the full system f by projection into the basis set,

g=Cf. (17)

Due to the coupling between the constrained degrees of freedom (in the form of off-diagonal elements in H), the reduced Hessian cannot be used directly in equations 14 and 15. However, using an eigendecomposition with an unitary matrix U, we obtain

Λ=UHU, (18)

which is a diagonal matrix that contains the eigenvalues, and, correspondingly, the force constants (and frequencies) of the constrained degrees of freedom. In CHARMM, the above analysis can be conducted in two different ways. a) using the REDUce command of VIBRAN, which corresponds to using the full 3N × m matrix C to evaluate the reduced Hessian and eigendecomposition. b) employing the RAYLeigh command, which conducts the above analysis separately for each vector c of the basis set (thus neglecting the coupling within the constrained degrees of freedom).

From here, it is straightforward to rewrite the equivalent correction terms for enthalpy and entropy of equations 14 and 15 with

ΔH12gΛ-1g1, (19)

and

ΔGharm12βln(2πβΛ-1)1, (20)

where || … ||1 denotes a sum over all elements. Since Λ is a diagonal matrix, applying the logarithm is straightforward and just involves taking the logarithm of each element.

2.2. Jacobian factors

The above constraint correction, and thus equation 10, only applies to internal coordinates. However, molecular dynamics simulations are performed in Cartesian coordinates. The conversion from internal coordinates to Cartesian coordinates introduces another contribution to the free energy, the so-called Jacobian. It is entropic in nature and arises because longer bond lengths and obtuse angles have more phase space available than short bonds or straight angles. Thus, they are required for both bonds and angles, but only have to be considered for degrees of freedom that change (i.e., the degrees of freedom that are affected by the constraints).[39, 20] The free energy change related to the release of constraints in Cartesian space, therefore, becomes

ΔGrls=-ΔGcons=-ΔH-ΔGharm-ΔGJacobian. (21)

For the general case, the change of the Jacobian contribution to the free energy as constraints are released, GJacobian, can be calculated by

ΔGJacobian=-β-1ln(iJiafterJibefore), (22)

where Ji is the Jacobian factor for the transformation between internal and Cartesian coordinates of the degree of freedom corresponding to the ith constraint. The superscripts “before” and “after” refer to the coordinates before and after applying the correction. The necessary factors J can be obtained analytically by using the rigid rotor analysis developed by Herschbach et al.[40], where the Jacobian term for a bond r between two atoms j and k is

Jr=rjk2, (23)

and the Jacobian for an angle θjkl between atoms j, k, and l is

Jθ=sinθjkl, (24)

if j, k and l form a linear chain. If a terminal atom k branches off from a chain of atoms i, j, at atom j, the Jacobian factor of the angle θ′ between the bond rjk and the plane defined by the two bonds rij and rjℓ is defined by

Jθ=sin-1θιk. (25)

Thus, special care has to be taken when dealing with angle constraints of branching atoms, as the current version of the code only considers angle constraints of linear chains. (Notably, this is not a major restriction since the SHAKE algorithm is usually not applied to angles of branched structures, since they cause poor convergence).

2.3. Application to free energy simulations

Running a free energy simulation with constraints corresponds to exploring phase space for a confined subspace, only considering an infinitesimally slim slice of the phase space available to the degrees of freedom that have been constrained. Accounting for the energetic costs of releasing a constraint thus corresponds to the effect of adding extra dimensions to the constrained free energy result. This problem is similar to the treatment of separate parts of phase space in rotational isomers, developed by Straatsma and McCammon.[41]

For this purpose, we consider that the partition function of the unconstrained system Z to consist of integrals for the constrained degrees of freedom (Zcons) and the additional degrees of freedom that arise from releasing the constraints (Zadd). This leads to

Z=Zcons.Zadd=consadde-βU(X)=conse-βU0(ΔXcons)adde-βU(ΔX) (26)
=conse-βU0(ΔXcons)e-βΔGcons(Xcons), (27)

where U0 is the part of the potential energy that only depends on the unconstrained degrees of freedom, and the free energy of releasing the constraint (ΔGcons(xcons)) is calculated for each point in the integral over the constrained degrees of freedom.

The free energy change due to releasing all constraints in a free energy simulation is

ΔGrls=-ΔGcons=β-1lnZZcons=β-1lnconse-βU0(ΔXcons)e-βΔGcons(Xcons)conse-βU0(ΔXcons) (28)
=β-1lne-βΔGcons(Xcons)cons, (29)

where 〈〉cons denotes an ensemble average using constraints. Notably, this result corresponds to the Zwanzig equation[42] (also known as Thermodynamic Perturbation or the exponential formula), which is implemented in most major simulation packages.

Thus, the harmonic analysis corresponds to performing an a posteriori sampling of the constrained degrees of freedom. If the target potential is harmonic, it actually performs perfect sampling of that degree of freedom, given a particular set of “soft” coordinates. Therefore, the only errors arise from anharmonicity and from incomplete sampling of the soft degrees of freedom. Those issues are discussed in Sections 4.1 and 4.2.

3. Methods

3.1. (An-)harmonic oscillators

Our first model systems are (an-)harmonic oscillators that mimic a chemical bond between two atoms. While the first atom is fixed, the second atom is free to change its bond length and corresponds to a hydrogen atom. For such systems, quasi-analytical results can be computed very easily. We consider three cases a) an isolated harmonic oscillator in gas phase (without interaction partners) b) the harmonic oscillator in the presence of a sodium ion at a fixed distance of 2.5 Å from the initial position of the hydrogen, projected along the bond c) the harmonic oscillator in the presence of a chlorine ion at a fixed distance of 2.5 Å from the initial position of the hydrogen, projected along the bond. Cases b) and c) correspond to a transformation into anharmonic oscillators. Notably, the only degree of freedom is the bond length of the hydrogen atom.

Each of the three cases was evaluated with parameters from five different types of hydrogens from the CHARMM general force field[43]: HGA3 (corresponding to a hydrogen in ethane), HGA5 (corresponding to a hydrogen in ethene), HGR61 (corresponding to a hydrogen in benzene), HGP1 (corresponding to the hydrogen of the hydroxyl group in, e.g., methanol), and HGP3 (corresponding to the hydrogen of the thiol group in, e.g., ethanethiol).

Using equation 3, it is possible to compute the absolute free energies of each type of harmonic oscillator by numerical integration of the partition function. All integrations were carried out with the NIntegrate function of Mathematica 9.0 with AccuracyGoal set to Infinity. The singularities of the potential energy function that arise when the positions of the hydrogens and the chlorine atom overlap were avoided by integrating only within ±1.9 Å of the equilibrium bond length (we tested integration ranges between 1 and 2 Å, but the free energy results were invariant to the choice of this cutoff since the probability of states far away from the equilibrium bond length is smaller than the finite machine precision of the computer representation). The gradients and Hessians for equations 13, 14, and 15 were determined by numerical differentiation with the ND function at the equilibrium bond length specified in the CGenFF parameter set. We used a scale of variation of 0.0001 Å. Hessians were computed with the ND function at the global energy minimum (as determined by the FindMinimum function).

3.2. Four-atomic benchmark systems

The calculations on the four-atomic benchmark system test whether the correction step is capable to account for changes of bond lengths and angles. This benchmark system is very sensitive to deviations from the right ensemble (e.g. by using the wrong thermostat) and has been used extensively in the literature. [9, 39, 20, 44, 45] Accurate results can be calculated both analytically and with appropriate molecular dynamics simulations.

The initial state for all corrections had equilibrium bond lengths of 2 Å for all bonds. The force constant for all bonds was 200 kcal/(mol Å2). Bond angles were set to 110°, with angle force constants of 50 kcal/(mol rad2). The dihedral force constant was 1 kcal/mol with a multiplicity 3.

We computed the free energy differences to end states with the following changes relative to this initial state: a) Changing the first bond length to r12= 2.1, 2.25, 2.5, 3 Å b) Changing the first bond angle to θ123=111, 115, 120, 135, 160° c) Changing the bond lengths and angles of the first and last atom to r12 = 3 Å, r34 = 1 Å, θ123 = 160°, θ234 = 50°. The corrections were performed with the new OPTI command in the VIBRAN code of CHARMM.

3.3. Water boxes

Five different boxes with TIP3P[46, 47] water were prepared. The first consisted only of a water pentamer in gas phase without periodic boundary conditions. For this system, the structure of the global energy minimum[48] was used as the initial state. Water boxes with 395, 787, 1636, and 3290 molecules were generated with CHARMM-GUI[49], using cubic boxes with side lengths of 24.01, 30.25, 38.11, and 48.02 Å, respectively. Since the VIBRAN code does not support Ewald summation, the nonbonded interactions were calculated with a cutoff between 10 and 12 Å, using force-shifting for electrostatic interactions and shifting for Van-der-Waals interactions.

In the first step, SHAKE constraints were imposed according to the equilibrium bond length in the parameter file, and each water box was minimized to the local energy minimum using 100 steps of steepest descent and 2000 steps of Adopted Basis Newton-Raphson minimization. This corresponds to the “uncorrected” structure with constraints and a potential energy Uuncorrected. Next, the SHAKE constraints were released and the constraint correction was calculated, using all bond lengths and bond angles as the basis set for the harmonic analysis with the CBND and CANG mode types. This leads to the “corrected” structure with a potential energy Ucorrected. To determine the true local energy minimum, the corrected structure was further minimized with 100 steps of steepest descent and 2000 steps Adopted Basis Newton-Raphson minimization, leading to the potential energy of the minimum Umin.

Ethane-Methanol

Solvation free energy differences between ethane and methanol were calculated as described in Reference 15. All simulations were conducted with CHARMM[50, 51], using the CHARMM22[47] force field. Following the recommendations by Boresch and Karplus,[20, 52] a dual topology hybrid scheme was implemented using the MSCALE module[53] of CHARMM. Bond and angle terms were calculated in the main MSCALE process, while the dihedral and non-bonded energy terms were evaluated in two subprocesses: one corresponding to the ethane end state and one corresponding to methanol. The potential energies from the subprocesses were mixed according to λ.

Gas phase simulations were conducted with Langevin dynamics, using a friction coefficient of 5 ps−1 on all atoms and a temperature of 300 K. The cut-off radius was set to 998 Å. In solution, 862 water molecules and an octahedral box of 32.168 Å were used. The temperature was maintained at about 300 K by a Nosé-Hoover thermostat[54]. Lennard-Jones interactions were switched off between 10 and 12 Å, while electrostatic interactions were computed with the PME method.[55] The time step was 1 fs and simulations with and without SHAKE on all hydrogen atoms were employed.

Free energy differences were calculated with Bennett’s acceptance ratio method[26] based on simulations of 5 ns in gas phase and 1 ns in solution. Trajectories were written every 100 steps in gas phase and every 20 steps in solution. For the free energy calculations, five λ points were employed in gas phase (0.00, 0.25, 0.50, 0.75 and 1.00) and eleven in solution (0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0). The standard deviations of the free energy results were determined by repeating each simulation four times, starting with different initial random velocities.

Alanine-Serine

Free energy differences between N-acetyl-methylamide alanine and serine were calculated in aqueous solution. Both amino acids were simulated using the AMBER Cornell et al.[56] and the CHARMM22[47] force fields once with SHAKE constraints and once without SHAKE constraints. For better comparability, the same setup was used as in Reference 57. 243 TIP3P water molecules were present, and the temperature was maintained at about 300 K by a Nosé-Hoover thermostat[54]. Lennard-Jones interactions were switched off between 9–10 Å, while electrostatic interactions were computed with the Particle Mesh Ewald method [55]. The simulation box was a truncated octahedron, cut out of a cube with side length 21.4 Å. Trajectories were written every 10 steps. The standard deviations of the free energy results were determined by repeating each simulation four times, starting from different initial random velocities. The energies required for BAR were extracted from the trajectories using the EAVG command of the BLOCK module of CHARMM. The constraint corrections were calculated using the VIBRAN module of CHARMM and the analyzed with the Zwanzig equation[42].

4. Results and Discussion

4.1. Analytical results for (an-)harmonic oscillators

The magnitude of ΔGcons depends on multiple factors, including the force constant of the bond or angle, but also non-bonded interactions. To illustrate the dependency on those factors quantitatively, we calculated ΔGcons for three different kinds of oscillators. In each case, a hydrogen atom is attached to a fixed, non-interacting atom at a distance that corresponds to the equilibrium bond length in the parameter set. In the first instance (gas phase), there are no non-bonded interactions that influence the bond. In the second and third data set, the harmonic oscillator is interacting with a fixed sodium or chloride ion at a distance of 2.5 Å. The two cases illustrate how ΔGcons can be affected by interactions with the environment. To test the influence of the bonded terms on ΔGcons, the benchmark systems were evaluated using parameters from several hydrogen atom types in the CHARMM Generalized Force Field (HGA3, HGA5, HGR61, HGP1, HGP3). The hydrogen types correspond to hydrogens found in ethane, ethene, benzene, methanol and ethanethiol, thus covering both apolar, aromatic and polar groups that are encountered in biomolecular simulation. Since the only degree of freedom is the bond length of the hydrogen, the free energy can be calculated analytically for those systems. Table 1 lists their ΔGcons, as well as their enthalpic (ΔH) and entropic components (ΔGharm, ΔGJacobian).

Table 1.

Examples of free energy differences from imposing an constraint for multiple hydrogen types in (an)harmonic oscillators. The individual contributions from enthalpy, vibrational entropy, the Jacobian factor and anharmonic effects are listed in the four rightmost columns. The free energies are evaluated in gas phase and in the presence of a sodium and a chlorine ion, setting the constraint to the equilibrium bond length from the parameter file. All data are in kcal/mol.

Hydrogen type a ΔGconsb ΔHc ΔGharmd ΔGJacobiane Anharm. Errorf
In gas phase at equilibrium bond length
HGA3 (ethane) −1.528 0 −1.528 0 0
HGA5 (ethene) −1.565 0 −1.565 0 0
HGR61 (benzene) −1.544 0 −1.544 0 0
HGP1 (methanol) −1.684 0 −1.684 0 0
HGP3 (ethanethiol) −1.481 0 −1.481 0 0

Interacting with a sodium ion at 2.5 Å
HGA3 (ethane) −1.512 0.017 −1.530 −0.002 0.0001
HGA5 (ethene) −1.488 0.080 −1.568 −0.004 0.0001
HGR61 (benzene) −1.519 0.027 −1.546 −0.002 0.0001
HGP1 (methanol) −1.477 0.211 −1.687 −0.005 0.0003
HGP3 (ethanethiol) −1.423 0.060 −1.483 −0.004 0.0005

Steric clash with a chlorine ion at 2.5 Å
HGA3 (ethane) −1.390 0.179 −1.570 −0.005 0.0000
HGA5 (ethene) −1.576 0.019 −1.595 −0.001 0.0002
HGR61 (benzene) −1.362 0.227 −1.589 −0.006 0.0007
HGP1 (methanol) −1.461 0.218 −1.678 −0.005 0.0004
HGP3 (ethanethiol) −1.428 0.052 −1.480 +0.003 −0.0001
a

Atom type in the CHARMM General Force Field

b

Free energy difference of releasing the constraint

c

Enthalpic contribution from potential energy difference to the energy minimum

d

Vibrational entropy determined by force constant

e

Jacobian factor that accounts for transformation from internal to Cartesian coordinates

f

Anharmonic error that was not accounted for in the harmonic analysis

In gas phase (top third of Table 1), there are no interactions that might influence the free energy except for the bond stretching term of the harmonic oscillator itself. Since the constraint was introduced at the equilibrium bond length, the system resides at the energy minimum. Therefore there are neither enthalpic contributions nor contributions from the Jacobian, as the bond length does not change when the constraint is released. The only contributions to ΔGcons arise from the vibrational entropy (fourth column), which is directly related to the force constant of the bond. The weakest force constant is found for the thiol-hydrogen type HGP3 (last row) which, correspondingly, exhibits the lowest entropic penalty of introducing a free bond with 1.481 kcal/mol. The system with the highest vibrational contribution is HGP1, which corresponds to a hydroxyl group. Due to its strong force constant, the potential energy rises faster as the bond length changes, thus limiting the available phase space for the bond more than all the other atom types. Correspondingly, ΔGharm is also an indicator whether external forces are limiting the available phase space of the bond (a larger absolute value of ΔGharm indicating more confinement).

By adding a positively charged sodium ion at a distance of 2.5 Å (middle third of Table 1), the hydrogens are affected by repulsive electrostatic forces, and, depending on the Van-der-Waals radius of they hydrogen type, slightly attractive or repulsive nonpolar interactions. Due to the external forces, the energy minimum of the bond is changed, which results in a change of bond length if the constraint is released. Correspondingly, the enthalpy changes according to the potential energy difference relative to the energy minimum, and a Jacobian term arises due to the change of the bond length. The vibrational entropy ΔGharm is the dominant contribution to ΔGcons. While the results for ΔGcons lie between −1.423 and −1.519 kcal/mol, ΔGharm ranges between −1.48 and −1.68 kcal/mol. The enthalpic contributions from ΔH amount to up to 0.211 kcal/mol and the Jacobian factors are below −0.005 kcal/mol.

Based on the values for ΔGcons and its contributions, it might appear tempting to neglect the small contributions from ΔH and ΔGJacobian, since they only represent a few percent of the total free energy change. However, if the ΔGcons in gas phase and in the presence of sodium were used in a thermodynamic cycle to calculate the binding affinity of sodium to the harmonic oscillator, the importance of ΔH and ΔGJacobian becomes clearer. In such a situation, the contributions of ΔGharm mostly cancel between gas phase and in the presence of sodium, leaving differences below 0.003 kcal/mol. This is orders of magnitude lower than the contributions from ΔH (0.017–0.211 kcal/mol), and roughly in the same range as the Jacobian factor (−0.002 to −0.005 kcal/mol). Therefore, all three factors are relevant for most applications involving free energy simulations, especially when considering that their contributions are going to add up if multiple constraints are present in the simulation.

Placing a chlorine at a distance of 2.5 Å from the hydrogen (bottom third of Table 1) causes a steric clash for most hydrogen types. This leads to larger values of ΔH (between 0.052 and 0.383 kcal/mol) since changing the bond length allows the system to relax. Compared to gas phase, ΔGharm increases by about 0.03 to 0.05 kcal/mol, since the repulsion from the chlorine limits the available phase space of the bonds. Markedly, the polar hydrogen types HGP1 and HGP3 are an exception to this trend as they have very small Van-der-Waals radii due to polarization. Since they also have relatively large positive charges, they are attracted by the negatively charged chlorine. This leads to a stretching of the bond, which results in positive Jacobian factors. Notably, the Jacobian contribution ΔGJacobian can be positive or negative, while ΔH and is greater or equal to zero (the potential energy of the constrained conformation cannot be lower than the energy minimum) and ΔGharm is always negative as no entropy can be gained by introducing a bond.

One of the main assumptions of the method described here is that bond and angle terms can be sufficiently approximated by a harmonic potential. Any anharmonicity will lead directly to errors in the estimate of ΔGcons. For the analytic case studies in this section, it is straightforward to evaluate the error arising from anharmonicity by comparing the results for ΔGcons with the analytical free energies of the constrained and unconstrained systems. The deviations of the harmonic approximation from the analytical results are shown in the rightmost column of Table 1 for the systems in gas phase and in presence of sodium or chlorine. In gas phase, there is no anharmonicity, since the potential energy is fully determined by the harmonic potential of the bond of the harmonic oscillator. In the presence of non-bonded forces, some anharmonicity arises since the Lennard Jones and Coulomb potentials are not harmonic. However, its effect on ΔGcons is negligible, since the changes of the bond length are small. Even in the worst case encountered here (HGR61 with chlorine) the anharmonic contribution is 0.0007 kcal/mol, which is three orders of magnitude smaller than the effect of ΔH, and also one order of magnitude smaller than the Jacobian factor. Notably, the anharmonic contribution can be both positive or negative, as demonstrated by the results for HGP3 with chlorine. This indicates the possibility of cancellation of errors if multiple constraints are corrected at the same time. Moreover, since SHAKE constraints are usually applied to hydrogens in biomolecules at room temperature, the errors from the harmonic approximation in the proposed correction will, therefore, not be a limiting factor for the accuracy of free energy calculations.

Four atomic-benchmark systems

In addition to corrections of the bond length, as illustrated in the previous section, the suggested algorithm also allows the correction of bond angles. In order to test the performance of both bond and angle corrections, we employ changes of the two terminal atoms of a linear four-atomic benchmark system. Since no non-bonded interactions are calculated for this benchmark system, it is possible to calculate the free energy analytically using the rigid-rotor assumption. Furthermore, precise reference results are also available from computer simulations reported in the literature.[39, 45] The four-atomic benchmark system has proven itself very valuable in the past, as it is very sensitive to even small errors of the thermostat.

Table 2 lists results for three kinds of mutations. In the first section, mutations of the bond length between atoms 1 and 2 are considered. This serves to test the implementation of the correction in CHARMM and also illustrates the dependency on the magnitude of the bond length change. Similarly, the second section tests the accuracy of the bond angle correction for the angle between atoms 1, 2 and 3. It highlights the dependence of the accuracy on the bond angle change. In the last row, we evaluate the performance of the correction if two bond lengths (between atoms 1-2 and 3-4) and two bond angles (between atoms 1-2-3 and 2-3-4) are changed simultaneously to check for possible errors when multiple coupled degrees of freedom are changed.

Table 2.

Mutations of a four atomic benchmark system based on a single structure. The first four columns indicate the magnitude of the change and which bond or angle was perturbed (first bond, last bond, first angle, last angle). The three rightmost columns present the deviation from analytical reference free energies for the uncorrected system, and two different implementations of the correction: one with the standard normal modes in CHARMM (sixth column) and the other the new CBND and CANG normal modes (rightmost column).

Deviation from equilibrium Deviation from analytical free energy (kcal/mol)
Δr12 a Δr34b Δϕ123c Δϕ234d Uncorrected e NMD f C-NMD g
Changes of bond length (Å)
0.10 2.00E+00 6.39E-05 6.91E-19
0.25 1.25E+01 2.38E-03 1.66E-18
0.50 5.00E+01 3.50E-02 5.38E-19
1.00 2.00E+02 4.71E-01 6.04E-18
Changes of bond angle (°)
1 1.52E-02 6.74E-06 9.86E-19
5 3.81E-01 3.98E-03 3.34E-16
10 1.52E+00 5.95E-02 3.34E-16
25 9.52E+00 1.93E+00 3.37E-16
50 3.81E+01 2.40E+01 4.05E-19

Changing both bond lengths and angles of terminal atoms
1.00 −1.00 50 −50 4.76E+02 8.53E+01 7.37E-18
a

Deviation of bond between atoms 1 and 2 from equilibrium bond length in Å

b

Deviation of bond between atoms 3 and 4 (Å)

c

Deviation of bond angle between atoms 1, 2, and 3 from equilibrium in degrees (°)

d

Deviation of bond angle between atoms 2, 3, and 4 (°)

e

Original deviation from analytical free energy result

f

Deviation from analytical free energy result after a correction step based on normal modes and vibrational entropy

g

Deviation from analytical free energy result after a correction step based on CBND and CANG normal modes that account for enthalpy, vibrational entropy, Jacobian contributions and include curvature of bond angles.

For each mutation, we consider three cases: a) not using a correction (Uncorrected, fifth column) b) applying a correction with the OPTI command, but using the standard implementation of normal modes (NMD, sixth column). In this case, the Jacobian term and the curvature of bond angles are not accounted for c) employing the dedicated CBND and CANG normal modes, which save the data required for the correction of the Jacobian factor and for bond angles (C-NMD, rightmost column).

Turning the attention to the first section of Table 2 (changes of bond length), it is possible to see that four different bond length changes are considered: 0.10, 0.25, 0.50, and 1.00 Å. All of these changes are larger than the expected bond length change when releasing SHAKE constraints. They primarily serve to detect even small errors in the implementation. If uncorrected, the resulting errors range between 2 and 200 kcal/mol. By using the OPTI command with conventional normal modes in CHARMM, this error is reduced significantly to between 6.39E-05 and 0.47 kcal/mol. The residual error arises from the missing correction of the Jacobian term. With the dedicated CBND normal modes, the required data for the computation of the Jacobian term are saved. Since there are no non-bonded terms in this benchmark system, there is no anharmonicity, so the accuracy is only limited by the finite machine precision of the computer, leading to errors below 6.04E-18 kcal/mol.

In the second section of Table 2, multiple mutations of the bond angle between the first three atoms of the benchmark system are considered. The angle is changed by between 1 and 50 degrees, leading to errors of up to 38 kcal/mol. When using the standard type of normal modes, only the tangential vector of the bond angle is used for the correction. In case of large displacements of the angle, the use of only the tangential vector will change the bond length of the terminal atom. Thus, errors of up to 24 kcal/mol can remain after the correction step with conventional normal modes. By using the dedicated CANG normal modes, the curvature of the bond angle is considered when conducting the correction. Thus the bond length is maintained during the correction step, leading to errors below 3.34E –16 kcal/mol. However, in order to be able to obtain such high accuracy it is necessary to correct the bond lengths with CBND before the bond angles are corrected with CANG, otherwise the vectors of the bond lengths are not up to date. Moreover, the correction of bond angles employs trigonometric functions, which can lead to errors if the angle has to be moved to a different quadrant, but such drastic changes are not very likely.

To test if the simultaneous correction of multiple constraints has a negative impact on accuracy, all bond lengths and angles of the terminal atoms were changed by ±1Å and ±50°. This leads to an error of 476 kcal/mol. With the conventional normal modes without Jacobian and curvature corrections, the error is reduced to 85 kcal/mol. With the CBND and CANG normal mode types, the error is just 7.37E – 18, which arises from finite machine precision. This demonstrates that constraints on bonds and angles can be fully corrected if there is no coupling by non-bonded interactions.

4.2. Impact of soft degrees of freedom

The described correction is only applicable to hard degrees of freedom (bond and angle terms). However, to reach the local energy minimum, usually also a rearrangement of soft degrees of freedom (dihedral angles, translations, rotations) is necessary after the bond and angles terms have been corrected for. Unfortunately, the constraint correction cannot be applied to soft degrees of freedom, since they are not harmonic and strongly depend on external forces. There is no straightforward way to connect them to the original constrained ensemble. Moreover, changing the soft degrees of freedom would introduce a bias, since the resulting conformations would not be generated according to their correct Boltzmann weight. However, since relatively low energies are required to change soft degrees of freedom, the relevant conformations can be sampled even in the presence of constraints, given enough time.

Since the soft degrees cannot be corrected for, it is not possible to fully connect the constrained ensemble with the unconstrained ensemble in one step. There will be a residual error if the correction is applied to systems where a rearrangement is required. Given this fundamental restriction, a correction step can only aim to minimize this residual error. To evaluate the magnitude of the residual error, we apply the constraint correction to water boxes of varying size and compute the difference between the potential energy of the conformation after the correction from the true local energy minimum. Water is a very polar molecule, so its dipole moment is significantly affected by the bond lengths and angles. In turn, the dipole moment affects the orientation of water relative to the other water molecules around it. Since water clusters are arguably among the most flexible complexes imaginable in biochemistry, this benchmark system can be considered very strongly dependent on soft degrees of freedom.

Table 3 lists the errors introduced by stretching the bonds by 0.05 Å and angles by 1 degree for all TIP3P water molecules in the box. Of course, this benchmark is of no of practical relevance since TIP3P waters are built to be completely rigid, but it is a valuable test since its electrostatic interactions are extremely sensitive to the geometry. The number of water molecules in the box is shown in the first column (NH2O), ranging between 5 and 3290. The errors are presented for the uncorrected structures (second column) and after the correction (third column). The fourth column (% Reduction) indicates the percentage of the error that was corrected for in a single step. The errors before the correction range between 11.2 kcal/mol for a water pentamer and 2256.4 kcal/mol for a box of 3290 water molecules. Given the relatively small perturbations that were employed, this is a rather large change of the potential energy, highlighting the sensitivity of this benchmark. If divided by the number of water molecules, the average error per water molecule is about 0.6 kcal/mol for all box sizes. After one step of correction, those errors are reduced to between 0.2 and 205.9 kcal/mol. As indicated in the fourth column, the percentage of error reduction is consistently around 90% for all cases. This corresponds to a residual error of approximately 0.05 kcal/mol per water molecule, which is an order of magnitude smaller than the uncorrected case, demonstrating that the correction is not perfect, but allows a considerable reduction of the error.

Table 3.

Deviations from the local potential energy minimum of unconstrained water structures with increasing size. All SHAKE constraints were released and only the enthalpy correction (ΔH) was applied. The first column indicates the number of water molecules in the box (NH2O). The second and third column highlight the deviation from the minimal potential energy before and after the optimization. The error reduction in percent and the required computer time are shown in the two rightmost columns.

NH2Oa Uncorrected b Corrected c % Reduction d Time e
5 11.2 0.2 98.2 <0.01

395 263.6 20.0 92.4 29.0
787 538.7 53.9 90.0 227.4
1636 1111.0 105.0 90.5 2038.7
3290 2256.4 205.9 90.9 16494.8
a

Number of water molecules

b

Deviation of the potential energy of the perturbed structure from the potential energy minimum (kcal/mol)

c

Deviation of the potential energy of the corrected structure from the potential energy minimum after OPTI correction of bonds and angles. (kcal/mol)

d

Error reduction of correction in percent

e

Required CPU time for the calculation of Hessians and OPTI command in seconds on a single Q9300 2.50GHz processor

One drawback of the correction approach is the time-consuming computation of the Hessians. The required CPU time for one correction step is indicated in the rightmost column of Table 3 (Time). For a small number of constrained atoms, the computational costs are quite acceptable, but, unfortunately, the costs rise cubically with the number of atoms. Thus, the costs quickly become sizable. While a computation for 395 water molecules (1185 degrees of freedom that have to be corrected) requires 29 seconds on a single Q9300 2.50 GHz processor, doubling the number of molecules (787 molecules, 2361 degrees of freedom) takes almost 4 minutes (227 seconds). Doubling the number of atoms again (1636 molecules, 4908 degrees of freedom) takes half an hour. For more than 3239 molecules, the computation quickly becomes intractable (more than 4.5 hours for a single computation, high memory demand).

In the Supplementary Material we discuss several alternative approaches to approximate the Hessians. If the Hessian is approximated by the force constant in the parameter set, the correction of 3290 water molecules can be performed in 0.56 seconds on a single core (instead of 16494.8 seconds with the calculation of all Hessians). To put this number into perspective, a single energy call on a single core for the same system requires 1.87 seconds, so increasing the accuracy by employing the correction during a post-processing analysis of a trajectory only takes 30% longer. Furthermore, this process can be easily parallelized, since each frame of a trajectory can be treated independently. If the RAYLeigh command is used the compute the Hessians, also the modes are independent of each other. Thus the proposed correction can make optimal use of modern hardware architectures.

4.3. Application to free energy calculations

We use an alchemical transformation from ethane to methanol to evaluate the performance of the constraint correction in a simple free energy calculation. Very high accuracy can be achieved for this system, and proper conformational sampling can be performed within nanoseconds of simulation time. Thus it has become an informal standard benchmark for free energy methods[58, 59, 52, 60, 61, 57, 15]. The mutation is performed both in gas phase and in explicit solvent, using the standard thermodynamic cycle to calculate solvation free energy differences. The corresponding relative free energies in gas phase (ΔAgas), aqueous solution (ΔAH2O) and the solvation free energy difference (ΔΔAsolv) are shown in Table 4. The alchemical mutation was carried out once without SHAKE constraints, using a time step of 1 fs (1.0fs, first column) and once with SHAKE constraints on all hydrogens in the dual topology setup (also including dummy-hydrogens), using a time step of 1 fs (1.0fs/SHAKE, second column). The corresponding free energy result that incorporates the constraint correction for all hydrogens in the dual topology setup is shown in the third column (1.0fs/corr). While the unconstrained simulations yield a deviation of only 0.04 kcal/mol from experiment (−6.93 kcal/mol)[62], the introduction of SHAKE constraints increases the error to about 0.19 kcal/mol. Most of this error (0.18 kcal/mol) arises on the methanol side of the mutation ( ΔAconsMethanol in rightmost column of Table 4) and can probably be attributed to the missing stretching of the hydroxyl bond to accommodate the electrostatic interactions and hydrogen bonds with water. After the constraint correction is applied, the deviation from experiment is reduced, and the deviation from the unconstrained results is only 0.04 kcal/mol. Notably, the difference between unconstrained and corrected results is not statistically significant. This demonstrates that the correction can be a useful tool in free energy calculations, especially if strong non-bonded interactions that might change the bond length during the simulation are present.

Table 4.

Relative solvation free energy for the mutation of ethane to methanol. Free energy simulations are conducted once without SHAKE, using a time step of 1 fs (1.0fs, first column) and once with SHAKE, using a time step of 1 fs. The errors from using constraints are corrected for in the third column. All energies are reported in kcal/mol.

1.0fs a 1.0fs/SHAKE b 1.0fs/corr c
ΔAconsEthane
ΔAconsMethanol
ΔAH2Od −0.86 ± 0.02 −0.70 ± 0.03 −0.90 ± 0.03 10.972 10.772
ΔAgase 6.024 ± 0.006 6.039 ± 0.002 6.027 ± 0.002 10.96404 10.95180
ΔΔAsolvf −6.89 ± 0.02 −6.74 ± 0.03 −6.93 ± 0.03 0.008 −0.180
Deviationg 0.15 0.04
a

Unconstrained simulations with δt = 1.0 fs

b

Constrained simulations with SHAKE applied to all hydrogens and δt = 1.0 fs

c

Results from column b after applying the correction for constraints on solute at both end points

d

Free energy difference in aqueous solution

e

Free energy difference in gas phase

f

Solvation free energy difference (ΔAH2O − ΔAgas), Experimental reference −6.93 kcal/mol - Ref. 62

g

Deviation from the unconstrained simulation results

4.4. Free energy difference between alanine and serine

Previous work[57] has shown that the transformation of alanine to serine is among the largest alchemical mutations that can be conducted in a single step (i.e., without intermediate states). However, the phase space overlap between the end states is only 0.15%, which is critically low. Since the phase space overlap is directly connected to the variance of the free energy estimate in BAR (c.f., equation 11 in Ref. 26), low overlap can lead to poor convergence and imprecise results. As a rule of thumb, it is recommended to ensure a phase space overlap of at least 1%.[63, 64] Interestingly, when conducing the same mutation of alanine to serine using the AMBER Cornell et al. force field,[56] one obtains a slightly higher phase space overlap of 0.55%. Using BAR, it is possible to compute the free energy differences between CHARMM and AMBER, yielding a phase space overlap of 0.90% for alanine and 0.55% for serine. Notably, all overlaps mentioned so far are below the threshold of 1%, indicating that the convergence of the calculations can be poor. To evaluate the performance of free energy simulations with SHAKE constraints on all hydrogens in the dual topology setup, we repeat the free energy simulations of Ref. 57 in aqueous solution once without constraints and once with constraints.

Figure 2 presents the free energy differences between alanine and serine, using both CHARMM and AMBER, with and without constraints. Results for the CHARMM force field are shown in the top cycle, the AMBER results are listed in the cycle at the bottom. In addition, we used BAR to compute the free energy differences resulting from switching the force fields (vertical arrows). Notably, the large free energy differences of the vertical arrows reflect the different arbitrary zero point references of the two force fields, and are not physically meaningful. The thick arrows indicate constraint corrections. Notably, the unconstrained horizontal free energy differences between alanine and serine are in excellent agreement with previously published results (4.94 and −11.62 kcal/mol versus 4.93 and −11.69 kcal/mol).[57, 65]

Figure 2.

Figure 2

Free energy differences in aqueous solution between alanine (left) and serine (right), using both the CHARMM (top) and the AMBER (bottom) force fields, and both unconstrained (front) and constrained trajectories with SHAKE on all hydrogens (back). Bold arrows and numbers indicate the use of the constraint correction based on equation 29. All other free energy differences were calculated with BAR. All data reported in kcal/mol.

Focusing on the constraint corrections between back and front plane, it is possible to see that the standard deviations are very low (less than 0.003 kcal/mol). This indicates the fluctuations of ΔGcons are low, and, therefore, the correction converges very quickly even though exponential averaging is used to obtain the free energy costs of the constraints. Due to the low fluctuations, it is probably possible to use fewer data points to correct for the constraints, thus saving computational costs.

The six sides of the cube shown in Fig. 2 represent thermodynamic cycles, and can also be used to gauge the accuracy and precision of our calculations. Since the free energy is a state function, the free energy change associated with closing each thermodynamic cycle should be zero. Any deviations are called cycle closing error and represent a lower bound for non-canceling errors or indicate lack of convergence. Summing all free energy differences around the whole cube leads to a total cycle closing error of 0.22 kcal/mol, which is smaller than the standard deviation obtained by error propagation, 0.32 kcal/mol. This indicates that there is no statistically significant non-canceling error in our calculations. Notably, the cycle closing error of the constrained trajectories is only 0.02 kcal/mol, while the cycle closing error of the unconstrained trajectories is 0.22 kcal/mol. This suggests that the use of constraints can reduce the error and improve convergence. Similarly, the cycle closing error of the alanine trajectories is 0.04 kcal/mol, while the serine trajectories exhibit a cycle closing error of 0.24 kcal/mol. This indicates that the unconstrained serine trajectories are not fully converged, which is also shown by the high standard deviations of the free energy difference between CHARMM and AMBER for serine (0.28 kcal/mol). Contrariwise, the standard deviation of the corresponding constrained free energy simulations is only 0.08 kcal/mol.

With one exception (the mutation of alanine to serine in CHARMM), all constrained free energy results exhibit lower standard deviations than the unconstrained simulations. On average, the standard deviations of the constrained simulations are about 33% lower than the unconstrained simulations. This can probably be attributed to the increased phase space overlap of the constrained simulations. While the overlaps of the unconstrained simulations range between 0.15 and 0.9%, the introduction of constraints increases the overlap up to 2.05%. On average, the overlap increases by 94% relative to the unconstrained value - effectively doubling the magnitude. In free energy simulations with very low overlap, the introduction of constraints, and employing the constraint correction afterwards, can, therefore, be an efficient way to improve convergence and increase precision.

The improvements in precision incur additional computational costs. Evaluating the Hessians at each point of the trajectory is expensive, so the analysis takes about 14 times longer than a standard evaluation of the energies (e.g., 3 hours instead of 12.8 minutes for 60,000 frames of serine in water). However, since only every 100th or 1000th frame of a simulation is saved for analysis, the post-processing costs are dwarfed by the computational expenses of actually performing the simulation itself. Moreover, in contrast to running the simulation without constraints, where each step has to be computed sequentially, these costs of the correction step are trivially parallelized, since each frame is independent of each other. Thus, using constraints actually allows us to transfer some of the computational burden from a sequential scheme (the simulation), to an embarrassingly parallel process (the post-processing). Given modern multi-core architectures, it is more efficient to run the simulation with constraints (achieving a speedup by a factor of three compared to unconstrained simulations) and then use many cores for the post-processing. Furthermore, if computer time is limited, the Hessians can be approximated by simply using the force constants from the parameter file (see Supplementary Material). This approach neglects the change of vibrational entropy (which is about one or two orders of magnitude smaller than the enthalpic effect), but is only 40% slower than a normal energy evaluation (17.7 instead of 12.8 minutes for 60,000 frames of serine in water). These additional costs are negligible considering that four times longer trajectories are required to achieve an equivalent 50% reduction of the standard deviations with unconstrained simulations, since the precision is related to the square root of the number of frames of the trajectory. If the constraints are also used to perform the simulation at a larger time step (e.g., 2 fs instead of 1 fs), the computational costs relative to an equivalent unconstrained free energy simulation are reduced by a factor of eight. Thus, using constraints in free energy simulations, followed by an additional correction step, can lead to a substantial increase of efficiency.

5. Conclusions

The present work provides the means to calculate the free energy difference of releasing constraints during post-processing by using harmonic analysis. The approach relies on three assumptions a) that the constrained degrees of freedom can be approximated by a harmonic potential (which is true for bond and angle terms in molecular mechanics) b) the constrained structure is close to the energy minimum (i.e., the Hessian of the constrained structure is very similar to the Hessian at the energy minimum). This aspect requires the user to select constraints that are close to the equilibrium bond length in the parameter set. For polar groups, such as hydroxyls, it might be necessary to account for bond stretching due to nonbonded interactions with the solvent. c) the coupling of the constrained degrees of freedom with the unconstrained degrees of freedom is small. As shown by van Gunsteren,[9, 10] angle constraints can change the transition rates of connected dihedral angles, so the angle constraints have to be selected obtuse enough to ensure sufficient sampling of the associated dihedral angles. Furthermore, it is necessary to ensure that the frequencies of the constrained angles are sufficiently separated from the dihedral frequencies.

To demonstrate the usefulness of the algorithm, we tested it with benchmark systems where analytical reference results are available. The maximum free energy error arising from anhar-monicity in the anharmonic oscillators was 0.0007 kcal/mol, while accuracy up to machine precision (i.e., errors between 3.4 10−16 and 4.1 10−19 kcal/mol) was achieved for the purely harmonic four-atomic benchmarks.

To test the applicability to larger systems, we used simulation boxes with up to 3290 water molecules (9870 constrained degrees of freedom). While the algorithm is able to reduce the deviation from the potential energy minimum induced by constrains by more than 90% in all cases, the computation of the Hessians can become very expensive in terms of CPU time and memory. This problem can be alleviated by directly using the force constant from the parameter set (thus neglecting changes of the vibrational entropy, which decreases the error reduction only by about 0.1%). This subject is further discussed in the Supplementary Material.

The functionality was also evaluated in free energy simulations of the alchemical transformation from ethane to methanol. For the calculation of the solvation free energy difference between ethane and methanol, the correction reduced the deviation from the unconstrained free energy results from 0.15 kcal/mol (when using SHAKE constraints on all hydrogens in the free energy simulations) to 0.04 kcal/mol by accounting for the stretching of the hydroxyl group.

The mutation of alanine to serine highlights that it is possible to use the correction step to calculate free energy differences between different force fields. Notably, the phase space overlaps of the unconstrained simulations ranged between 0.15 and 0.9%, which is critically low and introduces imprecision and systematic errors. The introduction of bond constraints alleviates this problem by practically doubling the overlap to up to 2.05%. As a result, the standard deviations of the constrained simulations are, on average, 50% lower than the unconstrained simulations.

The proposed corrections are particularly useful for increasing the accuracy and efficiency of free energy calculations. By using constraints, the simulations can use larger time steps and the phase space overlap between the states involved in the free energy calculation is increased by the confinement. This allows the user to use fewer λ intermediate steps and shorter simulations. In the context of multiscale free energy calculations[15, 66, 67, 68, 69, 70], the use of constraints also allows to include more atoms in the quantum region or dealing with large differences between the QM and the MM energy surface. Since the phase space overlap is related to the variance of the free energy estimate,[26] the efficiency of free energy simulations can be significantly increased by using constraints.

The proposed procedure corrects for about 90% of the free energy change associated with the introduction of constraints. The main limitation lies in the computational costs of the Hessians. Even though their computation is easily parallelizable, it can become intractable if too many constraints are used in the free energy calculation. However, most applications of free energy simulations are limited to ligands with molecular weights less than 500 Dalton, so this limitation should not pose a problem in most cases. Furthermore, the expenses can be mitigated by approximating the Hessians by the bond or angle force constants or by calculating the Hessians only for a small subset of the frames. The standard deviations of the correction factors are one or two orders of magnitude smaller than the standard deviations of the free energy differences of interest for all test cases. Thus, it is highly unlikely that the correction for the free energy costs of constraints will be the computational bottleneck when it comes to applications to larger systems. In fact, for large systems, free energy calculations are limited by a lack of overlap and large fluctuations of the potential energy difference in the exponential averaging. By increasing the overlap and lowering the energy fluctuations, the introduction of constraints allows the use of free energy calculations for significantly larger systems. This further highlights how constraints, if properly used, can be an extremely valuable tool in computational biophysics.

Supplementary Material

supplement

Highlights.

  • We present a method to compute the free energy costs of constraints.

  • The method reduces the free energy errors of constrained simulations by 90%.

  • Precision of free energy calculations is doubled by using constraints.

  • Computational costs are reduced by up to a factor of eight.

Acknowledgments

The authors would like to thank Tim Miller, Stefan Boresch, Yihan Shao, Alex Sodt and Lee Woodcock for insightful discussions, as well as Frank Pickard and Noreen Gervasi for critically reading the manuscript. This work was supported by the intramural research program of the National Heart, Lung and Blood Institute of the National Institutes of Health and utilized the high-performance computational capabilities of the LoBoS and Biowulf Linux clusters at the National Institutes of Health. (http://www.lobos.nih.gov/ and http://biowulf.nih.gov).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Gerhard König, Email: gerhard.koenig@nih.gov, Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States.

Bernard R. Brooks, Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States

References

  • 1.Van Gunsteren WF, Berendsen HJC. Algorithms for macromolecular dynamics and costraint dynamics. Mol Phys. 1977;34:1311–1327. [Google Scholar]
  • 2.Ovchinnikov V, Cecchini M, Karplus M. A Simplified Confinement Method for Calculating Absolute Free Energies and Free Energy and Entropy Differences. J Phys Chem B. 2013;117(3):750–762. doi: 10.1021/jp3080578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Mobley DL, Chodera JD, Dill KA. Confine-and-release method: Obtaining correct binding free energies in the presence of protein conformational change. J Chem Theory Comp. 2007;3(4):1231–1235. doi: 10.1021/ct700032n. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Barth E, Kuczera K, Leimkuhler B, Skeel R. Algorithms for Constrained Molecular Dynamics. J Comput Chem. 1995;16(10):1192–1209. doi: 10.1002/jcc.540161003. [DOI] [Google Scholar]
  • 5.Ryckaert JP, Ciccotti G, Berendsen HJC. Numerical integration of the Cartesian equations of motion of a system with constraints: Molecular dynamics of n-alkanes. J Comput Phys. 1977;23:327–341. [Google Scholar]
  • 6.Andersen H. Rattle: A “velocity” Version of the Shake Algorithm for Molecular Dynamics Calculations. J Comput Phys. 1983;52(1):24–34. doi: 10.1016/0021-9991(83)90014-1. [DOI] [Google Scholar]
  • 7.Hess B, Bekker H, Berendsen H, Fraaije J. LINCS: A linear constraint solver for molecular simulations. J Comput Chem. 1997;18(12):1463–1472. doi: 10.1002/(SICI)1096-987X(199709)18:12&#x000a1;1463::AID-JCC4&#x000bf;3.0.CO;2-H. [DOI] [Google Scholar]
  • 8.Tao P, Wu X, Brooks BR. Maintain rigid structures in Verlet based Cartesian molecular dynamics simulations. J Chem Phys. 137(13) doi: 10.1063/1.4756796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Van Gunsteren W. Constrained dynamics of flexible molecules. Mol Phys. 1980;40(4):1015–1019. doi: 10.1080/00268978000102101. [DOI] [Google Scholar]
  • 10.van Gunsteren W, Karplus M. Effect of Constraints on the Dynamics of Macromolecules. Macromolecules. 1982;15(6):1528–1544. doi: 10.1021/ma00234a015. [DOI] [Google Scholar]
  • 11.Toxvaerd S. Comment on constrained molecular dynamics of macromolecules. J Chem Phys. 1987;87(10):6140–6143. doi: 10.1063/1.453488. [DOI] [Google Scholar]
  • 12.Tobias D, Brooks C. Molecular dynamics with internal coordinate constraints. J Chem Phys. 1988;89(8):5115–5127. doi: 10.1063/1.455654. [DOI] [Google Scholar]
  • 13.Hinsen K, Kneller G. Influence of constraints on the dynamics of polypeptide chains. Phys Rev E. 1995;52(6, B):6868–6874. doi: 10.1103/PhysRevE.52.6868. [DOI] [PubMed] [Google Scholar]
  • 14.Echenique P, Cavasotto CN, Garcia-Risueno P. The canonical equilibrium of constrained molecular models. Eur Phys J-Spec Top. 2011;200(1):5–54. doi: 10.1140/epjst/e2011-01517-9. [DOI] [Google Scholar]
  • 15.König G, Hudson PS, Boresch S, Woodcock HL. Multiscale Free Energy Simulations: An Efficient Method for Connecting Classical MD Simulations to QM or QM/MM Free Energies Using Non-Boltzmann Bennett Reweighting Schemes. J Chem Theory Comput. 2014;10(4):1406–1419. doi: 10.1021/ct401118k. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sprik M, Ciccotti G. Free energy from constrained molecular dynamics. J Chem Phys. 1998;109(18):7737–7744. doi: 10.1063/1.477419. [DOI] [Google Scholar]
  • 17.den Otter WK, Briels WJ. The calculation of free-energy differences by constrained molecular-dynamics simulations. J Chem Phys. 1998;109:4139–4146. [Google Scholar]
  • 18.den Otter W, Briels W. Free energy from molecular dynamics with multiple constraints. Mol Phys. 2000;98(12):773–781. [Google Scholar]
  • 19.Okuyama-Yoshida N, Kataoka K, Nagaoka M, Yamabe T. Structure optimization via free energy gradient method: Application to glycine zwitterion in aqueous solution. J Chem Phys. 2000;113(9):3519–3524. doi: 10.1063/1.1287785. [DOI] [Google Scholar]
  • 20.Boresch S, Karplus M. The role of bonded terms in free energy simulations: 1. theoretical analysis. J Phys Chem A. 1999;103:103–118. [Google Scholar]
  • 21.Wang L, Hermans J. Change of bond length in free-energy simulations: Algorithmic improvements, but when is it necessary. J Chem Phys. 1994;100:9129–9139. [Google Scholar]
  • 22.van Gunsteren WF, Beutler TC, Fraternali F, King PM, Mark A, Smith P. Computation of free energy in practice: choice of approximations and accuracy limiting factors. In: van Gunsteren WF, Weiner PK, Wilkinson AJ, editors. Computer Simulation of Biomolecular Systems, Theoretical and Experimental Applications. Vol. 2. ESCOM Science; Leiden: 1993. pp. 315–348. [Google Scholar]
  • 23.Pearlman DA, Kollman PA. The overlooked bond-stretching contribution in free energy perturbation calculations. J Chem Phys. 1991;94:4532–4545. [Google Scholar]
  • 24.Pearlman DA. Determining the contributions of constraints in free energy calculations: Development, characterization, and recommendations. J Chem Phys. 1993;98:8946–8957. [Google Scholar]
  • 25.Kirkwood JG. Statistical mechanics of fluid mixtures. J Chem Phys. 1935;3:300–313. [Google Scholar]
  • 26.Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comp Phys. 1976;22:245–268. [Google Scholar]
  • 27.Shirts MR, Chodera JD. Statistically optimal analysis of samples from multiple equilibrium states. J Chem Phys. 129(12) doi: 10.1063/1.2978177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.König G, Boresch S. Non-Boltzmann Sampling and Bennett’s Acceptance Ratio Method: How to Profit from Bending the Rules. J Comput Chem. 2011;32(6):1082–1090. doi: 10.1002/jcc.21687. [DOI] [PubMed] [Google Scholar]
  • 29.Wereszczynski J, McCammon JA. Using Selectively Applied Accelerated Molecular Dynamics to Enhance Free Energy Calculations. J Chem Theory Comput. 2010;6(11, 3285):3285–3292. doi: 10.1021/ct100322t. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Brooks BR, Janežič D, Karplus cM. Harmonic analysis of large systems. I. methodology. J Comput Chem. 1995;16:1522–1542. [Google Scholar]
  • 31.Ghysels A, Van Neck D, Van Speybroeck V, Verstraelen T, Waroquier M. Vibrational modes in partially optimized molecular systems. J Chem Phys. 126(22) doi: 10.1063/1.2737444. [DOI] [PubMed] [Google Scholar]
  • 32.Woodcock HL, Zheng W, Ghysels A, Shao Y, Kong J, Brooks BR. Vibrational subsystem analysis: A method for probing free energies and correlations in the harmonic limit. J Chem Phys. 129(21) doi: 10.1063/1.3013558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ghysels A, Van Neck D, Brooks BR, Van Speybroeck V, Waroquier M. Normal modes for large molecules with arbitrary link constraints in the mobile block Hessian approach. J Chem Phys. 130(8) doi: 10.1063/1.3071261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Ghysels A, Van Speybroeck V, Pauwels E, Van Neck D, Brooks BR, Waroquier M. Mobile Block Hessian Approach with Adjoined Blocks: An Efficient Approach for the Calculation of Frequencies in Macromolecules. J Chem Theory Comput. 2009;5(5):1203–1215. doi: 10.1021/ct800489r. [DOI] [PubMed] [Google Scholar]
  • 35.Ghysels A, Van Speybroeck V, Pauwels E, Catak S, Brooks BR, Van Neck D, Waroquier M. Comparative Study of Various Normal Mode Analysis Techniques Based on Partial Hessians. J Comput Chem. 2010;31(5):994–1007. doi: 10.1002/jcc.21386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ghysels A, Woodcock HL, III, Larkin JD, Miller BT, Shao Y, Kong J, Van Neck D, Van Speybroeck V, Waroquier M, Brooks BR. Efficient Calculation of QM/MM Frequencies with the Mobile Block Hessian. J Chem Theory Comput. 2011;7(2):496–514. doi: 10.1021/ct100473f. [DOI] [PubMed] [Google Scholar]
  • 37.Ghysels A, Miller BT, Pickard FC, Brooks BR. Comparing normal modes across different models and scales: Hessian reduction versus coarse-graining. J Comput Chem. 2012;33(28):2250–2275. doi: 10.1002/jcc.23076. [DOI] [PubMed] [Google Scholar]
  • 38.Gō N, Scheraga HA. On the use of classical statistical mechanics in the treatment of polymer chain conformation. Macromolecules. 1976;9:535–542. [Google Scholar]
  • 39.Boresch S, Karplus M. The Jacobian factor in free energy simulations. J Chem Phys. 1996;105:5145–5154. [Google Scholar]
  • 40.Herschbach DR, Johnston HS, Rapp D. Molecular partition functions in terms of local properties. J Chem Phys. 1959;31:1652–1661. [Google Scholar]
  • 41.Straatsma TP, McCammon JA. Treatment of rotational isomers in free energy calculations. ii. molecular dynamics simulation study of 18-crown-6 in aqueous solution as an example of systems with large numbers of rotational isomeric states. J Chem Phys. 1989;91:3631–3304. [Google Scholar]
  • 42.Zwanzig RW. High-temperature equation of state by a perturbation method. I. Nonpolar gases. J Chem Phys. 1954;22:1420. [Google Scholar]
  • 43.Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, Mackerell AD. Charmm general force field: A force field for drug-like molecules compatible with the charmm all-atom additive biological force fields. J Comput Chem. 2010;31(4):671–690. doi: 10.1002/jcc.21367. URL http://dx.doi.org/10.1002/jcc.21367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Boresch S. The role of bonded energy terms in free energy simulations — insights from analytical results. Mol Simulation. 2002;28:13–37. [Google Scholar]
  • 45.König G, Miller BT, Boresch S, Wu X, Brooks BR. Enhanced Sampling in Free Energy Calculations: Combining SGLD with the Bennett’s Acceptance Ratio and Enveloping Distribution Sampling Methods. J Chem Theory Comput. 2012;8(10):3650–3662. doi: 10.1021/ct300116r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Jorgensen WL, Chandrasekhar H, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926. [Google Scholar]
  • 47.MacKerell AD, Jr, Bashford D, Bellott M, Dunbrack RL, Jr, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, III, Roux B, Schlenkrich M, Smith J, Stote R, Straub J, Watanabe M, Wiorkiewicz-Kuczera J, Yin D, Karplus M. All-atom empirical potential for molecular modeling and dynamics studies of protein. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
  • 48.Wales D, Hodges M. Global minima of water clusters (H2O)(n), n <= 21, described by an empirical potential. Chem Phys Lett. 1998;286(1–2):65–72. doi: 10.1016/S0009-2614(98)00065-7. [DOI] [Google Scholar]
  • 49.Jo S, Kim T, Iyer VG, Im W. Charmm-gui: A web-based graphical user interface for charmm. J Comput Chem. 2008;29:1959–1865. doi: 10.1002/jcc.20945. [DOI] [PubMed] [Google Scholar]
  • 50.Brooks B, Brooks C, III, Mackerell A, Jr, Nilsson L, Petrella R, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner A, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor R, Post C, Pu J, Schaefer M, Tidor B, Venable R, Woodcock H, Wu X, Yang W, York D, Karplus M. CHARMM: The Biomolecular Simulation Program. J Comput Chem. 2009;30(10 Sp Iss SI):1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. CHARMM: A program for macromolecular energy, minimization and dynamics calculations. J Comput Chem. 1983;4:187–217. [Google Scholar]
  • 52.Boresch S, Karplus M. The role of bonded terms in free energy simulations. 2. calculation of their influence on free energy differences of solvation. J Phys Chem A. 1999;103:119–136. [Google Scholar]
  • 53.Woodcock HL, Miller BT, Hodoscek M, Okur A, Larkin JD, Ponder JW, Brooks BR. MSCALE: A General Utility for Multiscale Modeling. J Chem Theory Comput. 2011;7(4):1208–1219. doi: 10.1021/ct00738h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hoover WG. Canonical dynamics: Equilibrium phase-space distributions. Phys Rev A. 1985;31:1695–1697. doi: 10.1103/physreva.31.1695. [DOI] [PubMed] [Google Scholar]
  • 55.Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. A smooth particle mesh Ewald method. J Chem Phys. 1995;103:8577–8593. [Google Scholar]
  • 56.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Jr, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. A second generation force field for the simulation of proteins and nucleic acids. J Am Chem Soc. 1995;117:5179–5197. [Google Scholar]
  • 57.König G, Bruckner S, Boresch S. Unorthodox Uses of Bennett’s Acceptance Ratio Method. J Comput Chem. 2009;30(11):1712–1718. doi: 10.1002/jcc.21255. [DOI] [PubMed] [Google Scholar]
  • 58.Jorgensen W, Ravimohan C. Monte-Carlo simulation of differences in free-energies of hydration. J Chem Phys. 1985;83(6):3050–3054. doi: 10.1063/1.449208. [DOI] [Google Scholar]
  • 59.Singh UC, Brown FK, Bash PA, Kollman PA. An approach to the application of free energy perturbation methods using molecular dynamics: Applications to the transformations of CH3OH → CH3CH3, H3O+NH4+, Glycine → Alanine, and Alanine → Phenylalanine in aqueous solution and to H3O+(H2O)3NH4+(H2O)3 in the gas phase. J Am Chem Soc. 109(1987):1607–1614. [Google Scholar]
  • 60.Ytreberg F, Zuckerman D. Efficient use of nonequilibrium measurement to estimate free energy differences for molecular systems. J Comput Chem. 2004;25(14):1749–1759. doi: 10.1002/jcc.20103. [DOI] [PubMed] [Google Scholar]
  • 61.Min D, Yang W. Energy difference space random walk to achieve fast free energy calculations. J Chem Phys. 128(19) doi: 10.1063/1.2927744. [DOI] [PubMed] [Google Scholar]
  • 62.Ben-Naim A, Marcus YJ. Solvation thermodynamics of nonionic solutes. J Chem Phys. 1984;81:2016–2027. [Google Scholar]
  • 63.Bruckner S, Boresch S. Efficiency of alchemical free energy simulations I: Practical comparison of the exponential formula, thermodynamic integration and bennett’s acceptance ratio method. J Comput Chem. 2011;32:1303–1319. doi: 10.1002/jcc.21713. [DOI] [PubMed] [Google Scholar]
  • 64.Bruckner S, Boresch S. Efficiency of alchemical free energy simulations II: Improvements for thermodynamic integration. J Comput Chem. 2011;32:1320–1333. doi: 10.1002/jcc.21712. [DOI] [PubMed] [Google Scholar]
  • 65.König G, Boresch S. Hydration Free Energies of Amino Acids: Why Side Chain Analog Data Are Not Enough. J Phys Chem B. 2009;113(26):8967–8974. doi: 10.1021/jp902638y. [DOI] [PubMed] [Google Scholar]
  • 66.König G, Pickard FC, Mei Y, Brooks BR. Predicting hydration free energies with a hybrid QM/MM approach: an evaluation of implicit and explicit solvation models in SAMPL4. J Comput-Aid Mol Design. 2014;28(3, SI):245–257. doi: 10.1007/s10822-014-9708-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Rod T, Ryde U. Quantum Mechanical Free Energy Barrier for an Enzymatic Reaction. Phys Rev Lett. 94(13) doi: 10.1103/PhysRevLett.94.138302. URL http://link.aps.org/doi/10.1103/PhysRevLett.94.138302. [DOI] [PubMed] [Google Scholar]
  • 68.Rod TH, Ryde U. Accurate QM/MM Free Energy Calculations of Enzyme Reactions: Methylation by Catechol O -Methyltransferase. J Chem Theory Comput. 2005;1(6):1240–1251. doi: 10.1021/ct0501102. URL http://pubs.acs.org/doi/abs/10.1021/ct0501102. [DOI] [PubMed] [Google Scholar]
  • 69.Heimdal J, Ryde U. Convergence of QM/MM free-energy perturbations based on molecular-mechanics or semiempirical simulations. Phys Chem Chem Phys. 2012;14:12592–12604. doi: 10.1039/c2cp41005b. [DOI] [PubMed] [Google Scholar]
  • 70.Fox SJ, Pittock C, Tautermann CS, Fox T, Christ C, Malcolm NOJ, Essex JW, Skylaris CK. Free energies of binding from large-scale first-principles quantum mechanical calculations: application to ligand hydration energies. J Phys Chem B. 2013;117(32):9478–9485. doi: 10.1021/jp404518r. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

RESOURCES