Abstract
Improving the capability of atomistic computer models to predict the thermodynamics of noncovalent binding is critical for successful structure-based drug design, and the accuracy of such calculations remains limited by non-optimal force field parameters. Ideally, one would incorporate protein-ligand affinity data into force field parametrization, but this would be inefficient and costly. We now demonstrate that sensitivity analysis can be used to efficiently tune Lennard-Jones parameters of aqueous host-guest systems for increasingly accurate calculations of binding enthalpy. These results highlight the promise of a comprehensive use of calorimetric host-guest binding data, along with existing validation data sets, to improve force field parameters for the simulation of noncovalent binding, with the ultimate goal of making protein-ligand modeling more accurate and hence speeding drug discovery.
1. INTRODUCTION
The ability to reliably predict protein-ligand binding thermodynamics by means of molecular simulations would have enormous practical impact, such as the acceleration of drug discovery and enzyme engineering. Improving reliability is likely to require advances in two areas. One is efficient sampling, so as to obtain well-converged simulation results which correctly reflect the contributions of all thermodynamically relevant sectors of configuration space. Recent progress on this front includes advances in both algorithms1–4 and computer hardware5–7. For example, microsecond-scale molecular dynamics (MD) simulations of biomolecular systems are now routinely achievable with commodity hardware. However, a molecular simulation is only as accurate as the force field it uses, and, despite pioneering contributions8–15 and important advances16–20, further improvement in force field accuracy are needed for reliable modeling of protein-ligand binding to become a reality21–24.
Every force field includes a large set of adjustable parameters. These are typically set based on quantum chemistry data, such as gas-phase electrostatic potentials and the energetics of gas-phase clusters14,24–28, combined with selected experimental data. Enormous progress in force field development has already been made by using comparatively accessible experimental quantities, such as densities and heats of vaporization of neat liquids29,30, hydration free energies of small molecules31, and, more recently, conformational preferences of peptides and proteins32. However, these data sets are quite limited in size, and are scarcely expanding. For example, although a recently compiled set of ~500 small molecule hydration free energies33 is a powerful aid to testing and adjusting force fields, there is little prospect for increasing the number of such data. In addition, the commonly used data probe only a modest collection of interaction types, and this limitation risks compromising the generality of the resulting force fields. For example, a force field adjusted to replicate the properties of neat acetone and neat benzene may not accurately account for interactions between acetone and benzene; and hydration free energies only probe interactions of small organic molecules with one other molecule, water. Thus, it is perhaps not surprising that force fields do not routinely perform well when used to compute the properties of various chemical mixtures34,35. Nonetheless, such data are relevant to protein-ligand binding, because of the large variety of interactions that occur at the binding interface.
In addition, commonly used experimental observables may not strongly test the performance of force fields when they are applied to binding calculations. For example, although the TIP3P36 and TIP4P-Ew37 water models yield generally similar small molecule hydration free energies and enthalpies38,39, they yield strikingly different results for host-guest binding enthalpies40 with mean signed errors (MSE), relative to experiment, of −3.0 kcal/mol for TIP3P and −6.8 kcal/mol for TIP4P-Ew. The magnitude and systematic character of these deviations could result from errors in the force field’s representation of specific interactions present in the host-guest systems, perhaps amplified by the greater size of these host-guest systems relative to the small molecules in the hydration study. It is also worth noting, however, that neither small molecule hydration data nor the properties of neat liquids probe how accurately water models treat confined water, which is present in the binding sites of host molecules and proteins, and is thought to significantly influence binding thermodynamics41–44.
Ideally, perhaps, one would use actual protein-ligand data to test and adjust force fields. Unfortunately, the calculation of rigorously converged absolute protein-ligand binding affinities by simulation is still too time-consuming to be incorporated into force-field optimization procedures. Moreover, protein simulations pose the challenge of establishing the protonation states of ionizable groups, such as histidine, aspartic acid, and glutamic acid, in complex, partially hydrated actives sites that can generate substantial pKa shifts. The protonation states of such groups influence ligand affinities, but are not easily determined.
Host-guest systems hold great promise as a simple but informative alternative for testing and improving force fields for use in binding calculations. These miniature models of molecular recognition have affinities which span the same range as protein-ligand systems, and their small sizes, chemical simplicity, and often predictable protonation states make it far easier to carry out rigorously converged simulations which can be directly compared with experiment. Accordingly, host-guest systems have already been used to test binding calculations45–48, most recently through the SAMPL series of blind prediction challenges49–51, and also to optimize interaction potentials52. Both binding free energies and enthalpies are often available for host-guest systems47,53,54, and it has been shown that both of these quantities can be computed to high numerical precision40,50,51,55. Given that binding free energies and binding enthalpies are largely independent quantities46,56, it should be useful to tune force field parameters against both of these observables, thus taking maximal advantage of the available thermodynamic data. It is also worth noting that enthalpy changes, in the form of heats of vaporization of neat liquids, have long played a central role in force field parameterization57–59, so it is likely binding enthalpies will be similarly useful. For these reasons, we aim to add host-guest binding free energies and enthalpies to the types of experimental and quantum chemical data already used to optimize force field parameters.
In order to make efficient use of an experimental observable to adjust force field parameters, one needs a way of predicting how a prospective parameter change will affect the computed value of the observable. A natural approach to this is problem is sensitivity analysis60, i.e., the evaluation of the partial derivatives of a simulation average with respect to simulation parameters. These derivatives provide the gradient of the computed quantity in parameter space, and hence can be used to choose parameter changes which will improve the agreement of the calculation experiment. Accordingly, partial derivatives of experimental observables or other target quantities have been incorporated into various parameter-optimization schemes9,37,61–65. However, we are not aware of prior efforts to use the gradients of host-guest binding thermodynamics with respect to force field parameters as a basis for force field optimization. Indeed, only recently has it been demonstrated that host-guest binding enthalpies can be computed with simulations to the level of numerical precision required for such an application.
The present study aims to prove the feasibility of using sensitivity analysis of host-guest binding enthalpies to help guide force field optimization. In particular, we use sensitivity analysis to adjust selected force field parameters to bring computed binding enthalpies for a series of cucurbit[7]uril(CB7)-guest systems47,66–69 into line with experiment. We focus on tuning Lennard-Jones (LJ) parameters, in part because prior energy decompositions of the computed binding enthalpies for these systems indicate that the LJ component of the force field makes dominant, favorable contributions to the computed binding enthalpies40, so small changes in LJ parameters should be effective force-field modifications to improve agreement with experiment. It should be noted that our main goal here is not to arrive at a specific set of recommended parameters for general use, but rather to implement and demonstrate a set of calculations which set the stage for incorporation of binding data into the broader process of force field optimization. We envision ultimately using binding data in concert with the more standard experimental and quantum chemical data sets discussed above, in order to generate force fields which work well not only for pure liquids and small molecules, but also for the noncovalent association of complex molecules in solution.
The paper is organized as follows. The Methods section first describes the calculation of binding free energies and enthalpies, provides expressions for the partial derivatives of these quantities with respect to force field parameters, and explains how these were used to adjust parameters. It then provides details of the simulations and explains how statistical uncertainties were evaluated; additional methodological details are provided in the Supporting Information (SI). The Results section compares binding enthalpies calculated with the unmodified generalized Amber force field (GAFF)70 and TIP3P water against experiment, then presents the sensitivity analysis and demonstrates its use to carry out two cycles of parameter optimization for two atoms associated with the CB7 host, based on a training set of four aliphatic guests. The consequences for the computed binding enthalpies and free energies of the training set and of a test set of seven additional guests are then presented. Finally, the Discussion section considers these particular results and looks more broadly at the implications of adding host-guest binding data to the set of data used to parameterize simulation force fields.
2. METHODS
We computed binding enthalpies and free energies for a training set of host-guest systems comprising the host CB7 and four aliphatic guests bearing either hydroxyl or ammonium groups (Figure 1, shaded in blue); applied sensitivity analysis to obtain the thermodynamic derivatives of the binding enthalpies with respect to the parameters of the Lennard-Jones force field terms; used these derivatives to guide parameter adjustments that would improve the agreement of the calculations with experiment; and reran the binding calculations to test the performance of the parameter adjustments. Two cycles of this process were carried out on the same training set, and the resulting set of parameters then was tested on a separate test set of seven guests (Figure 1). Four of the test set guests are aliphatics with ammonium groups, and hence resemble the training set, while three of the test guest guests are cationic heteroaromatic compounds with little resemblance to the training set. These were included to probe the transferability of the parameter adjustments. The following subsections describe these procedures, and further details are available in the SI.
Figure 1.
Cucurbit[7]uril (CB7) host and eleven guest molecules studied here. The guest molecules in the training set are shaded in blue and those in the test set are shaded in red.
2.1. Calculation of Binding Free Energy and Enthalpy
As previously described40, and detailed in the SI, binding enthalpies were computed as differences between the mean potential energy of a simulation of the solvated host-guest complex and the potential energies of two separate simulations of the solvated isolated host and the solvated isolated guest. As needed, the mean potential energy of a simulation of pure solvent was used to exactly balance the constituents of the bond versus free states; see Eq 1.6 in the SI.
Host-guest binding free energies were computed via the attach-pull-release (APR) approach71. In brief, the APR approach involves computing a series of potentials of mean force along a path which begins with the guest unrestrained in the host’s binding site, and ends with the guest held far enough from the host that their interactions are negligible. The pathway joining the initial and final states comprises an initial “attach” step, in which a system of restraints is imposed on the host-guest complex; a “pull” step, in which the restraint defining the host-guest distance is elongated to move the guest from the host; and a “release” step, in which all restraints on the host and guest that affect their internal coordinates are turned off. For the present calculations, the exit of the guest from the host is facilitated by restraints that hold open the otherwise tightly restrictive portal of the CB7 host. These restraints are turned on and off during the attachment and release steps, respectively. A final analytic correction adjusts the effective concentration of the guest to the standard concentration of 1 M. All PMFs were computed via thermodynamic integration72,73, using umbrella sampling74 in a series of windows along the pathway. The SI provides further methodological details.
2.2. Sensitivity Analysis of Binding Thermodynamics
Analyzing the sensitivity of binding thermodynamics to force field parameters requires expressions for the derivatives of host-guest binding enthalpies and free energies with respect to the chosen force field parameter. We focused on the LJ energy term, given by
| (1) |
where rij is the interatomic distance, and εi and σi, respectively, relate to the well depth at the energy minimum and the distance at which the potential crosses zero. Note that the parameters are listed in GAFF as εi and , where, σi = 2−1/6Rmin and Rmin is the distance at the Lennard-Jones minimum. As noted above, the binding enthalpy of a host-guest system may be computed as the difference in the mean potential energy, 〈U〉, between the host-guest complex, hg; the free host, h; and the free guest, g; each in aqueous solution (The mean energy of a pure solvent simulation may also be required to balance the number of solvent molecules in the complex versus free simulations).40 The sensitivity of the binding enthalpy to a force field parameter c may be obtained by recognizing that the partial derivative of 〈U〉 is given by (see SI)
| (2) |
where R is the gas constant and T is absolute temperature, and c here will represent either εi or σi in equation (1). The sensitivity of the binding enthalpy is computed as the difference of between the bound and unbound states:
| (3) |
Here Nhg, Nh and Ng represent the number of solvent molecules in the hg, h and g simulations respectively, and ΔN = Nhg − Nh − Ng. As shown in the SI, the sensitivity of the binding free energy, ΔG, to parameter c is given by
| (4) |
The mean partial derivatives in these expressions were evaluated with custom-developed code which used analytic expressions for the derivatives of the LJ expression in Eq (1) to post-analyze stored simulation trajectories.
2.3. Parameter Optimization
After the enthalpy derivatives of the training set binding enthalpies had been computed, several selected LJ parameters for atom types associated with the CB7 host were optimized simultaneously. As detailed in Results, we chose to optimize parameters with particularly large values of , in order to minimize the need for large parameter adjustments. Optimization involved minimizing the squared deviation between the computed and experimental ΔH values for all CB7-guest pairs in the training set, using a nonlinear minimizer within the ‘Solver’ tool of Microsoft Excel. A weak L2 regularization term, restraining the parameters toward their original GAFF values (see below), was included, to avoid implausibly large parameter modifications. The regularization term was weighted so that a 50% change in any parameter would generate a penalty as large as that associated with the deviation between the full set of initially computed binding enthalpies and their corresponding experimental values. In order to avoid overfitting, and to test for transferability of the revised host parameters, guest molecules A1, A2, B2 and B5 (Figure 1) were used as a training set for the parameter adjustments, while the remaining guests formed a test set.
2.4. Simulation Setup and Execution
All the MD simulations were performed with the AMBER1475 suite of programs. Topology files for the systems were generated using tleap from AmberTools75. All amines were treated as fully protonated, because such simple aliphatic amines have pKas larger than 9 in water, and these groups remain well-hydrated when bound to CB7, so their pKas are not expected to fall significantly in the bound state; and the experiments were done at neutral pH. The bonded and initial Lennard-Jones parameters were obtained from GAFF. Partial atomic charges were generated with the AM1-BCC method76,77 as implemented in the program Antechamber78. The edge lengths of the cubic TIP3P water boxes ranged between 32 and 34 Å. Counterions were added to neutralize the total charge of each given system. The molecular dynamics simulations followed the same protocol as our previous study40, except that, instead of running the production simulations for 1 microsecond and saving snapshots every 2 ps, we ran the production simulations for 750 ns and save snapshots every 1 ps. This change was implemented because we discovered, via statistical inefficiency analysis79, that the two approaches provide effectively the same statistical power for the standard error of the mean of the total potential energy of a simulation. Each system was initially energy minimized with 500 steps of steepest descent, followed by up to 10,000 steps of conjugate gradient with the host and/or guests restrained by a 100 kcal/mol/A2 harmonic restraint. The harmonic restraints were then removed, and the system was again minimized for 500 steps of steepest descent, followed by up to 10,000 steps of conjugate gradient. The system was equilibrated with a 2ns NVT simulation followed by a 20 ns NPT simulation, and then the production simulation was begun. The temperature of the simulations was set to 300K, using the Langevin thermostat80 with a collision frequency of 1.0 ps−1. The pressure was set to 1 bar for both the NPT stage of equilibration and the production simulation, using the Berendsen barostat81 with the relaxation time set to 2 ps. The particle mesh Ewald (PME) method82 was used for electrostatic interactions. The cutoff distance of Lennard-Jones interactions was set to 10 Å during the minimization and equilibration phases, and then set to 9 Å for the production simulations. (A sample calculation with a distance cutoff of 14 Å led to an unchanged computed binding enthalpy, to within the numerical uncertainty of the simulations.)
All binding free energy simulations were performed using the same settings as for the enthalpy calculations, above, except that larger simulation boxes were used to accommodate the extended geometries required by the pulling calculations. The edge lengths are approximately 36, 36 and 53 Å for all host-guest systems except for CB7-guest B11 which is 36, 36, and 60 Å. Thus, each system was solvated in a rectangular box with 2210 TIP3P water molecules, except for guest B11 which required 2500 water molecules, due to its larger size. A detailed description of the attach-pull-release framework for calculating the binding free energies is provided in the SI. With the latest generation of GPUs, a performance of over 180 ns per day can be easily achieved.
2.5. Error Analysis
Numerical uncertainties of the binding calculations, representing estimated standard deviations of the mean, were obtained by conducting blocking analysis79; see SI for details. For a fair comparison, the experimental uncertainties listed here were computed by dividing their reported experimental standard deviations by the square root of the number of replicated experiments, in order to estimate the standard deviation of the mean. The uncertainties of the error statistics were calculated via bootstrap resampling (100,000 times) of the computed and experimental binding data, by randomly sampling from a Gaussian distribution centered on each data point, with its standard deviation set to the standard deviation of the mean.
3. RESULTS
3.1. Baseline Calculations and Selection of Parameters to be Adjusted
Binding free energies and enthalpies computed with the initial GAFF parameters correlate well with experiment (R=0.93), but systematically overestimate the affinity and exothermicity. In particular, the MSE and RMSD of the binding enthalpies are −3.0 kcal/mol and 4.2 kcal/mol, respectively. The binding enthalpies of all four molecules in the training set are markedly overestimated, and, in the test set, the only exceptions to this pattern are G8 and MVN/Tris, for which the calculations underestimate the exothermicity by 2.3 and 1.3 kcal/mol, respectively. Given the high precision of the calculations, these substantial deviations from experiment must be attributed to errors in the force field. In addition, the nearly uniform tendency to overestimate suggests the existence of a problem in the force field which affects most or all of the results, and whose correction might remove the systematic errors from the calculations. Note, however, that a parameter change which makes all binding enthalpies less favorable could exacerbate the underestimations seen for G8 and MVN/Tris, so one may anticipate that the parameters for these guests will ultimately require special adjustment.
Because they are common to all of the binding calculations, the water model and the CB7 host are obvious places to look for parameters whose adjustment would weaken computed binding enthalpies across many or all of the guest molecules, as required to improve most of the results. Regarding water, we know that this can have a profound effect; as noted in the Introduction, TIP4P-Ew leads to even greater overestimation of exothermicity for these systems. However, adjusting the water model risks generating a water model with unrealistic bulk properties, and we also found that converges slowly when c pertains to the water model. Therefore, although it will be important in the long term to consider adjusting the water model to fit the properties of pure water along with binding thermodynamics, we chose here to focus on parameters associated with the CB7 host. We focused in particular on its LJ parameters, for several reasons. First, bonded parameters are not expected to strongly influence binding affinities for this rather rigid host. Second, LJ forces were previously found to contribute strongly, and always favorably, to the computed binding enthalpies of these systems40, so modest adjustments to these parameters might yield substantial shifts in computed binding enthalpy. Third, the existing GAFF force field assigns the same LJ parameters to many, varied atom types33,83; thus, all 13 nitrogen atom types are assigned identical LJ parameters, and there are only three sets of LJ parameters for all small molecule oxygen atoms. Thus, it is plausible that achieving high accuracy calculations with a general force field like GAFF will require more differentiation of LJ parameters across atom types.
The partial derivatives of the binding enthalpies and free energies for the atom types present in the host (Table 2) converged well (Figure S3), and the values of and are nearly all negative. This is consistent with the intuition that deepening the LJ energy well should strengthen binding, and with the prior observation that the LJ term is a major contributor to the computed binding enthalpies of these systems40. The sensitivities for σ are less consistent in sign, however. Interestingly, only modest correlations (R~0.6) are observed between and and between and , indicating that the values of ΔH and ΔG can be separately manipulated by suitable adjustment of LJ parameters; in the present study, we focus on correcting the binding enthalpies.
Table 2.
Derivatives of computed binding enthalpies and binding free energies with respect to the GAFF parameters of all atom types in CB7, for the four training-set guests. The units of and are kcal/(mol·Å); and are unitless.
| Atom type | A1 | A2 | B2 | B5 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|||||||||
|
|
|
|
|
|
||||||||||||
| n | −13.20 | −23.52 | −12.92 | −23.75 | −15.92 | −20.54 | −15.36 | −21.99 | ||||||||
| o | 5.36 | −1.80 | 3.30 | −4.24 | 5.02 | −1.49 | 3.13 | −5.13 | ||||||||
| c | −2.32 | −18.80 | −1.48 | −16.57 | −4.25 | −18.40 | −4.24 | −20.73 | ||||||||
| c3a | −3.27 | −9.51 | −3.28 | −10.44 | −2.56 | −3.91 | −2.39 | −5.67 | ||||||||
| h2 | 0.02 | 2.27 | −0.98 | −10.80 | −0.52 | −4.94 | −0.93 | −12.36 | ||||||||
| n | −4.08 | −26.17 | −2.87 | −25.91 | −7.97 | −25.36 | −6.13 | −25.37 | ||||||||
|
|
|
|
|
|
|
|
|
|||||||||
|
|
|
|
|
|||||||||||||
| o | 0.27 | −8.63 | −2.92 | −13.07 | 1.18 | −8.18 | −1.58 | −13.74 | ||||||||
| c | 2.13 | −17.14 | 2.83 | −16.47 | 0.38 | −17.53 | 1.03 | −18.50 | ||||||||
| c3a | −5.09 | −25.40 | −4.93 | −25.21 | −4.17 | −19.56 | −3.34 | −20.01 | ||||||||
| h2 | −1.10 | −20.51 | −1.15 | −22.05 | −0.51 | −13.05 | −0.82 | −17.99 | ||||||||
The c3 atom type is present in both CB7 and all the guest molecules, but the values reported here were computed based on changes only in c3 atoms in CB7.
Given the limited number of experimental data used to guide parameterization in this initial proof-of-concept study, we adjusted parameters for only two atom types, n and o in GAFF nomenclature (Figure 1), which occur only in the host molecule. The binding enthalpies are quite sensitive to the LJ parameters of these two atom types (Table 2), which means that conservative adjustments should allow significant improvement in the computed enthalpies; and their variation across the training-set guests provides an opportunity to generate different adjustments to the computed binding enthalpies from one guest to another.
3.2. Parameter Adjustment for Training-Set Host-Guest Binding Enthalpies
Sensitivity analysis predicted that changing σn and εn by −0.1425 Å and −0.0200 kcal/mol, respectively, and σo and εo by 0.1782 Å and 0.0500 kcal/mol, respectively, would significantly improve the agreement of the training set calculations with experiment, with R increasing from 0.77 to 0.85 and the RMSD decreasing from 4.8 to 1.9 kcal/mol. We made these parameter changes and then used a new round of simulations to recalculate the binding enthalpies of the training set guests. The actual R value for the training set improved to 0.87, and the RMSD decreased to 1.4 kcal/mol, in good agreement with the predictions. Because sensitivity analysis uses a linear approximation to parameter dependencies that are not necessarily linear, we then reran the parameter optimization based on derivatives from the new MD simulations. Sensitivity analysis based on these simulations (Table S1) predicted that further changing σn and εn by 0.0437 Å and −0.0095 kcal/mol, respectively, and σo and εo by 0.1614 Å and 0.0508 kcal/mol, respectively, would improve R to 0.89 and RMSD to 1.2 kcal/mol, for the training set. In good agreement with these predictions, the results from a third set of MD calculations of binding enthalpy with these new LJ parameters (Table 1), yielded values of R and RMSD of 0.90 and 1.0 kcal/mol, respectively. Given the marked improvement and diminishing returns after these two optimization steps, no further iterations were done. The final adjusted parameters differ by an average of 20% from the initial GAFF parameters, and the net effect of the parameter adjustment is to improve agreement with experiment by making the computed binding enthalpies less favorable. It is worth noting that the conformation of free CB7 is not discernibly altered by the new parameters, as illustrated in Figure S4 (SI).
Table 1.
Experimental and calculated binding enthalpies and free energies (kcal/mol) for host CB7 with 11 different guest molecules; training set data are highlighted with bold font. Computational results are reported for both unmodified GAFF parameters (Initial) and for the adjusted parameters from the second round of optimization (Final). Uncertainties are reported as standard deviations of the mean. For methylviologen (MVN) both the measurements and calculations included Tris buffer, which influences the binding enthalpy (see ref 40 and Kimoon Kim, personal communication). Initial parameters: σn, εn: 3.2500, 0.1700; σo, εo: 2.9599, 0.2100. Final parameters: σn, εn: 3.1511, 0.1405; σo, εo: 3.2995, 0.3108.
| Binding Enthalpy
|
Binding Free Energy
|
|||||
|---|---|---|---|---|---|---|
| Exp.a | Initial | Final | Exp.a | Initial | Final | |
|
|
|
|||||
| A1 | −19.0±0.2 | −24.9±0.2 | −20.2±0.2 | −14.1±0.1 | −23.7±0.1 | −22.6±0.2 |
| A2 | −19.3±0.2 | −22.7±0.2 | −19.0±0.2 | −19.4±0.1 | −27.4±0.1 | −27.8±0.2 |
| B2 | −15.8±0.1 | −21.9±0.2 | −16.9±0.2 | −13.4±0.1 | −21.3±0.1 | −19.8±0.2 |
| B5 | −15.6±0.2 | −18.3±0.2 | −14.4±0.2 | −19.5±0.1 | −27.1±0.2 | −26.6±0.2 |
| A3 | −21.9±0.2 | −24.8±0.2 | −19.6±0.2 | −20.3b | −28.0±0.1 | −26.6±0.2 |
| A4 | −20.4±0.2 | −23.6±0.2 | −18.5±0.3 | −21.5b | −30.0±0.2 | −28.5±0.2 |
| A5 | −19.5±0.2 | −23.2±0.3 | −18.8±0.2 | −19.1±0.1 | −27.3±0.2 | −27.1±0.2 |
| B11 | −16.3±0.2 | −17.8±0.2 | −16.3±0.2 | −20.6±0.2 | −30.4±0.4 | −31.5±0.6 |
| G8 | −8.5±0.3 | −6.2±0.2 | −0.8±0.2 | −6.1±0.1 | −13.9±0.1 | −13.4±0.2 |
| G9 | −3.8±0.1 | −11.6±0.2 | −7.9±0.2 | −7.4±0.1 | −18.5±0.3 | −19.6±0.3 |
| MVN/Trisc | −3.4±0.2 | −2.1±0.2 | −2.6±0.2 | −7.1±0.1 | −11.1±0.3 | −9.7±0.3 |
3.3. Application of Optimization Parameters to Test-Set Binding Enthalpies
We used the final modified parameters to compute the binding enthalpies of the test-set guest molecules, A3, A4, A5, B11, G8, G9 and MVN/Tris (Table 1, Final; Figure 2), and compared with the corresponding results based on the initial, unmodified GAFF LJ parameters. As anticipated, the present parameter adjustments weaken the binding enthalpies, and this leads to improved agreement with experiment for the aliphatic guests, A3, A4, A5 and B11, which are similar to the training set: the MSE and RMSD for these guests improve from −2.8 and 3.0 kcal/mol, respectively, for the initial GAFF parameters, to 1.2 and 1.6 kcal/mol, respectively, for the final optimized parameters. Thus, the parameter adjustment shows promising transferability among aliphatic guest molecules.
Figure 2.
Calculated versus measured binding enthalpies (kcal/mol) for eleven CB7-guest systems. Orange squares: results with initial GAFF parameters. Blue circles: results after two rounds of adjustment of LJ parameters for atom types n and o in GAFF nomenclature. Colored lines: linear regression fits for each set of calculations. Solid black: line of identity, ΔHcalc = ΔHexpt.
The pattern of changes is more complex and less favorable for aromatic guests, which is perhaps not surprising, given that there are no aromatic guests in the training set. As in the case of the aliphatic guests, the new parameters make the computed binding enthalpies of both G8 and G9 less negative (Table 1, Final vs Initial). This change improves the agreement with experiment for G9, because its calculated binding enthalpy was too negative with the initial GAFF parameters, but it worsens the accuracy for G8, because the initial parameters GAFF made the binding enthalpy insufficiently favorable. In contrast, however, the new parameters make the binding more, rather than less, exothermic in the case of MVN/Tris (Table 1). This difference traces to the fact that the host in the absence of MVN is predicted to be occupied by a molecule of Tris, so that binding of MVN is actually a competitive process, as previously observed40. Further analysis of the binding enthalpy calculations indicates that changing from the initial to final LJ parameters changes the computed binding enthalpy of Tris itself to CB7 from - 2.5 to 0.6 kcal/mol, while changing the binding enthalpy of MVN to CB7 in the absence of Tris from −4.7 to −1.9 kcal/mol. Thus, each individual binding event becomes less exothermic, consistent with the results for all of the other guest molecules, but the change for Tris outweighs that for MVN, leading to a more exothermic net result for the displacement of Tris by MVN. Experimental calorimetry data provided subsequent to these calculations support the view that MVN and Tris bind competitively, as the measured binding enthalpy of MVN with CB7 is −6.4 kcal/mol in the absence of Tris and −3.1 kcal/mol in its presence (Prof. Kimoon Kim, personal communication). Overall, the accuracy across all aromatic compounds in the test set does not improve on going from the initial GAFF parameters to the final adjusted ones; the MSE changes from −1.5 to 1.4 kcal/mol, while the RMSD changes from 4.7 to 5.1 kcal/mol. We conjecture that improving the accuracy for these guests would require adjusting parameters specifically associated with the two benzimidazole derivatives, G8 and G9.
The scatter plots in Figure 2 provide an overview of how the parameter adjustment affects all eleven host-guest binding enthalpies. (Additional statistical breakdowns are provided in Table S2.) The main effect of the parameter adjustment is to shift the computed enthalpies in the positive direction, bringing the scatter plot closer to the line of identity. Thus, the MSE improves from −3.1 to 0.8 kcal/mol, indicating removal of the systematic tendency to overestimate the exothermicity of these binding events, and the RMSD across all training and test set compound falls from 4.2 to 2.9 kcal/mol.
3.4. Effect of Optimized Parameters on Computed Binding Free Energies
Sensitivity analysis of the binding free energies (Eq (4)), based on calculations with the initial GAFF parameters, suggested that the LJ parameter adjustments made here would have little effect on the computed binding free energies, with a predicted RMSD of only 0.9 kcal/mol between the initial and final results. This prediction was borne out by a fresh set of binding free energy calculations with the new parameters (“Final” in Table 1), as the RMSD of the new binding free energies, relative to the initial calculations, is 1.1 kcal/mol. In addition, the mean change in the binding free energy across all eleven cases is only 0.5 kcal/mol, much smaller than the mean change of 3.8 kcal/mol in the binding enthalpies (above). Because ΔG = ΔH − TΔS, these results imply that the change in parameters also changes the entropic contribution to the binding free energy by an average of −3.3 kcal/mol. Thus, these computational data provide a clear case of entropy-enthalpy compensation induced by changes in the LJ parameters.
4. DISCUSSION
The present study demonstrates that molecular dynamics calculations of host-guest binding thermodynamics may readily be used to optimize force field parameters. The optimization procedure is quite efficient, because it uses sensitivity analysis, in which the partial derivatives of the binding enthalpy and free energy with respect to the force field parameters are extracted from the binding calculations, at little extra computational cost. Using this approach, we obtained substantial improvement in training-set accuracy with a single round of optimization of host LJ parameters, and a second round led to additional improvement. Encouragingly, similar improvements in accuracy were observed when the new parameters were used to compute the binding thermodynamics of four test-set guest molecules chemically similar to the guests in the training set. On the other hand, application of the new parameters to chemically distinct test-set guest molecules led to mixed results. Overall, these results provide an initial look at the transferability of the parameter adjustments one may expect from the present approach. It is not unexpected that adjustments to the host molecule alone, as in the present initial study, might not suffice to generate accurate results across all guests. In some cases, such as the benzimidazole guests, G8 and G9, adjustments to guest parameters may also be required. More broadly, although the specific adjusted parameters developed here should be useful for modeling the binding of CB7 to other aliphatic guest molecules, we do not expect them to perform better than the original GAFF parameters for other experimental observables, such as the properties of neat liquids.
In the future, we envision a program of force field optimization in which parameters are adjusted against experimental datasets containing not only host-guest binding thermodynamics but also other measurements of proven value for force field optimization, such as the properties of neat liquids and small molecule hydration free energies. Quantum chemistry results, such as the interaction energies of gas-phase dimers or clusters, may also be useful. Given the large amount of available host-guest binding data66,67,69,84–96, we anticipate that this strategy will allow generation of force fields with well-constrained parameters that provide good agreement with experiment for a broadened set of experimental observables. Importantly, the accessibility of partial derivatives of host-guest binding enthalpies and entropies with respect to force field parameters opens the door to including these experimental observables in a variety of systematic parameter optimization schemes9,37,61–65. It is also conceivable that this effort will reveal weaknesses in the functional form of commonly used force fields. For example, perhaps an explicit treatment of electronic polarization will be needed to handle the transfer of guest molecules from water to the less polar interior of a host. If so, including host-guest binding data may help guide the selection and development of efficient force field models with more advanced functional forms.
The computational accessibility of binding enthalpies, binding free energies, and the gradients of these observables with respect to force field parameters, means that such a program of force field optimization can benefit from adjusting parameters against both of these thermodynamic quantities. In particular, if one parameterizes based solely on free energy, one risks getting the right free energies for one’s training set, but with an incorrect enthalpy-entropy balance, and hence reducing the likelihood that the results will be transferable to other systems. It is also worth noting that including both free energies and enthalpies in the training dataset is consistent with existing practice, in which force fields are adjusted based on, for example, enthalpies of vaporization as well as free energies of solvation. More generally, the weak correlation between measured binding enthalpies and free energies for many types of systems46,56 means that these two quantities provide largely independent, and hence complementary, information to the process of parameter optimization. Indeed, we found that adjusting parameters to better replicate measured binding enthalpies can have a remarkably small effect on the computed binding free energies. This means that our adjustment of LJ parameters was associated with entropy-enthalpy compensation, as it made binding less enthalpically favored, but more entropically favored, in almost equal measure. Although these changes resulted from an artificial adjustment of force field parameters, the overall pattern of entropy-enthalpy compensation97–101 is one that has been seen experimentally and discussed for many years. The fact that it appears spontaneously in the present simulations raises the broader possibility that analysis of simulations of experimentally interesting systems can not only help with force field optimization, but will also advance our understanding of the physical chemistry of molecular recognition.
Supplementary Material
Acknowledgments
This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1053575. MKG acknowledges funding from National Institute of General Medical Sciences (GM61300), and from the AFOSR through a Basic Research Initiative (BRI) grant (FA9550-12-1-0414). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the NIGMS, NIH, NSF or USAF. M.K.G. has an equity interest in, and is a cofounder and scientific advisor of VeraChem LLC. We thank Professor Kimoon Kim for providing information regarding the binding thermodynamics of methylviologen in the absence of Tris, and Dr. David Slochower for useful comments.
Footnotes
The authors declare no competing financial interest.
Supporting Information Available: Equations deriving partial derivatives of binding thermodynamics with respect to force field parameters; protocols of calculating binding thermodynamics and numerical uncertainties; derivatives computed from the second round of binding enthalpy calculations with TIP3P water; error analysis of computed binding enthalpies in TIP3P water before and after perturbations; details of the convergence of binding enthalpy derivatives; and comparison of CB7 conformational variables with initial and final parameters. This material is available free of charge via the internet at http://pubs.acs.org.
References
- 1.Hamelberg D, Mongan J, McCammon JA. Accelerated Molecular Dynamics: A Promising and Efficient Simulation Method for Biomolecules. J Chem Phys. 2004;120:11919–11929. doi: 10.1063/1.1755656. [DOI] [PubMed] [Google Scholar]
- 2.Fukunishi H, Watanabe O, Takada S. On the Hamiltonian Replica Exchange Method for Efficient Sampling of Biomolecular Systems: Application to Protein Structure Prediction. J Chem Phys. 2002;116:9058–9067. [Google Scholar]
- 3.Chen J, Brooks CL, Khandogin J. Recent Advances in Implicit Solvent-Based Methods for Biomolecular Simulations. Curr Opin Struct Biol. 2008;18:140–148. doi: 10.1016/j.sbi.2008.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Monticelli L, Kandasamy SK, Periole X, Larson RG, Tieleman DP, Marrink SJ. The MARTINI Coarse-Grained Force Field: Extension to Proteins. J Chem Theory Comput. 2008;4:819–834. doi: 10.1021/ct700324x. [DOI] [PubMed] [Google Scholar]
- 5.Ohmura I, Morimoto G, Ohno Y, Hasegawa A, Taiji M. MDGRAPE-4: A Special-Purpose Computer System for Molecular Dynamics Simulations. Phil Trans R Soc A. 2014;372:20130387. doi: 10.1098/rsta.2013.0387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shaw DE, Grossman JP, Bank JA, Batson B, Butts JA, Chao JC, Deneroff MM, Dror RO, Even A, Fenton CH, et al. SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. 2014. Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer; pp. 41–53. [Google Scholar]
- 7.Anderson JA, Lorenz CD, Travesset A. General Purpose Molecular Dynamics Simulations Fully Implemented on Graphics Processing Units. J Comput Phys. 2008;227:5342–5359. [Google Scholar]
- 8.Hagler AT, Huler E, Lifson S. Energy Functions for Peptides and Proteins. I. Derivation of a Consistent Force Field Including the Hydrogen Bond from Amide Crystals. J Am Chem Soc. 1974;96:5319–5327. doi: 10.1021/ja00824a004. [DOI] [PubMed] [Google Scholar]
- 9.Lifson S, Warshel A. Consistent Force Field for Calculations of Conformations, Vibrational Spectra, and Enthalpies of Cycloalkane and N-Alkane Molecules. J Chem Phys. 1968;49:5116–5129. [Google Scholar]
- 10.Allinger NL. Calculation of Molecular Structure and Energy by Force-Field Methods. Adv Phys Org Chem. 1976;13:1–82. [Google Scholar]
- 11.Scott WRP, Hünenberger PH, Tironi IG, Mark AE, Billeter SR, Fennen J, Torda AE, Huber T, Krüger P, van Gunsteren WF. The GROMOS Biomolecular Simulation Program Package. J Phys Chem A. 1999;103:3596–3607. [Google Scholar]
- 12.Jorgensen WL, Tirado-Rives J. The OPLS Potential Functions for Proteins. Energy Minimizations for Crystals of Cyclic Peptides and Crambin. J Am Chem Soc. 1988;110:1657–1666. doi: 10.1021/ja00214a001. [DOI] [PubMed] [Google Scholar]
- 13.Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. CHARMM: A Program for Macromolecular Energy, Minimization, and Dynamics Calculations. J Comput Chem. 1983;4:187–217. [Google Scholar]
- 14.Bayly CCI, Cieplak P, Cornell WD, Kollman PA. A Well-Behaved Electrostatic Potential Based Method Using Charge Restraints for Deriving Atomic Charges: The RESP Model. J Phys chem. 1993;97:10269–10280. [Google Scholar]
- 15.Weiner SJ, Kollman PA, Case DA, Singh UC, Ghio C, Alagona G, Profeta S, Weinerl P. A New Force Field for Molecular Mechanical Simulation of Nucleic Acids and Proteins. J Am Chem Soc. 1984;106:765–784. [Google Scholar]
- 16.Ponder JW, Wu C, Ren P, Pande VS, Chodera JD, Schnieders MJ, Haque I, Mobley DL, Lambrecht DS, Distasio RA, et al. Current Status of the AMOEBA Polarizable Force Field. J Phys Chem B. 2010;114:2549–2564. doi: 10.1021/jp910674d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Anisimov VM, Lamoureux G, Vorobyov IV, Huang N, Roux B, MacKerell AD. Determination of Electrostatic Parameters for a Polarizable Force Field Based on the Classical Drude Oscillator. J Chem Theory Comput. 2005;1:153–168. doi: 10.1021/ct049930p. [DOI] [PubMed] [Google Scholar]
- 18.Rick SW, Stuart SJ, Berne BJ. Dynamical Fluctuating Charge Force Fields: Application to Liquid Water. J Chem Phys. 1994;101:6141–6156. [Google Scholar]
- 19.Kaminski GA, Stern HA, Berne BJ, Friesner RA. Development of an Accurate and Robust Polarizable Molecular Mechanics Force Field from Ab Initio Quantum Chemistry. J Phys Chem A. 2004;108:621–627. [Google Scholar]
- 20.Wang ZX, Zhang W, Wu C, Lei H, Cieplak P, Duan Y. Strike a Balance: Optimization of Backbone Torsion Parameters of AMBER Polarizable Force Field for Simulations of Proteins and Peptides. J Comput Chem. 2006;27:781–790. doi: 10.1002/jcc.20386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang L, Wu Y, Deng Y, Kim B, Pierce L, Krilov G, Lupyan D, Robinson S, Dahlgren MK, Greenwood J, et al. Accurate and Reliable Prediction of Relative Ligand Binding Potency in Prospective Drug Discovery by Way of a Modern Free-Energy Calculation Protocol and Force Field. J Am Chem Soc. 2015;137:2695–2703. doi: 10.1021/ja512751q. [DOI] [PubMed] [Google Scholar]
- 22.Durrant JD, McCammon JA. Molecular Dynamics Simulations and Drug Discovery. BMC Biology. 2011;9:71. doi: 10.1186/1741-7007-9-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lopes PEM, Huang J, Shim J, Luo Y, Li H, Roux B, Mackerell AD. Polarizable Force Field for Peptides and Proteins Based on the Classical Drude Oscillator. J Chem Theory Comput. 2013;9:5430–5449. doi: 10.1021/ct400781b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Shi Y, Xia Z, Zhang J, Best R, Wu C, Ponder JW, Ren P. Polarizable Atomic Multipole-Based AMOEBA Force Field for Proteins. J Chem Theory Comput. 2013;9:4046–4063. doi: 10.1021/ct4003702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Momany FA. Determination of Partial Atomic Charges from Ab Initio Molecular Electrostatic Potentials. Application to Formamide, Methanol, and Formic Acid. J Phys Chem A. 1978;82:592–601. [Google Scholar]
- 26.Paesani F, Iuchi S, Voth GA. Quantum Effects in Liquid Water from an Ab Initio-Based Polarizable Force Field. J Chem Phys. 2007;127:074506. doi: 10.1063/1.2759484. [DOI] [PubMed] [Google Scholar]
- 27.Paesani F, Zhang W, Case DA, Cheatham TE, Voth GA. An Accurate and Simple Quantum Model for Liquid Water. J Chem Phys. 2006;125:184507. doi: 10.1063/1.2386157. [DOI] [PubMed] [Google Scholar]
- 28.Ewig CS, Berry R, Dinur U, Hill JR, Hwang MJ, Li H, Liang C, Maple J, Peng Z, Stockfisch TP, et al. Derivation of Class II Force Fields. VIII. Derivation of a General Quantum Mechanical Force Field for Organic Compounds. J Comput Chem. 2001;22:1782–1800. doi: 10.1002/jcc.1131. [DOI] [PubMed] [Google Scholar]
- 29.Jorgensen WL, Maxwell DS, Tirado-Rives J. Development and Testing of the OLPS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. J Am Chem Soc. 1996;118:11225–11236. [Google Scholar]
- 30.Chapman DE, Steck JK, Nerenberg PS. Optimizing Protein–Protein van Der Waals Interactions for the AMBER ff9x/ff12 Force Field. J Chem Theory Comput. 2013;10:273–281. doi: 10.1021/ct400610x. [DOI] [PubMed] [Google Scholar]
- 31.Oostenbrink C, Villa A, Mark AE, Van Gunsteren WF. A Biomolecular Force Field Based on the Free Enthalpy of Hydration and Solvation: The GROMOS Force-Field Parameter Sets 53A5 and 53A6. J Comput Chem. 2004;25:1656–1676. doi: 10.1002/jcc.20090. [DOI] [PubMed] [Google Scholar]
- 32.Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, Shaw DE. Improved Side-Chain Torsion Potentials for the Amber ff99SB Protein Force Field. Proteins Struct Funct Bioinforma. 2010;78:1950–1958. doi: 10.1002/prot.22711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Mobley DL, Bayly CI, Cooper MD, Shirts MR, Dill KA. Small Molecule Hydration Free Energies in Explicit Solvent: An Extensive Test of Fixed-Charge Atomistic Simulations. J Chem Theory Comput. 2009;5:350–358. doi: 10.1021/ct800409d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kang M, Smith PE. A Kirkwood-Buff Derived Force Field for Amides. J Comput Chem. 2006;27:1477–1485. doi: 10.1002/jcc.20441. [DOI] [PubMed] [Google Scholar]
- 35.Weerasinghe S, Smith PE. Kirkwood–Buff Derived Force Field for Mixtures of Acetone and Water. J Chem Phys. 2003;118:10663. [Google Scholar]
- 36.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of Simple Potential Functions for Simulating Liquid Water. J Chem Phys. 1983;79:926–935. [Google Scholar]
- 37.Horn HW, Swope WC, Pitera JW, Madura JD, Dick TJ, Hura GL, Head-Gordon T. Development of an Improved Four-Site Water Model for Biomolecular Simulations: TIP4P-Ew. J Chem Phys. 2004;120:9665–9678. doi: 10.1063/1.1683075. [DOI] [PubMed] [Google Scholar]
- 38.Shirts MR, Pande VS. Solvation Free Energies of Amino Acid Side Chain Analogs for Common Molecular Mechanics Water Models. J Chem Phys. 2005;122:134508. doi: 10.1063/1.1877132. [DOI] [PubMed] [Google Scholar]
- 39.Mobley DL, Dumont É, Chodera JD, Dill KA. Comparison of Charge Models for Fixed-Charge Force Fields: Small-Molecule Hydration Free Energies in Explicit Solvent. J Phys Chem B. 2007;111:2242–2254. doi: 10.1021/jp0667442. [DOI] [PubMed] [Google Scholar]
- 40.Fenley AT, Henriksen NM, Muddana HS, Gilson MK. Bridging Calorimetry and Simulation through Precise Calculations of Cucurbituril – Guest Binding Enthalpies. J Chem Theory Comput. 2014;10:4069–4078. doi: 10.1021/ct5004109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Fornabaio M, Spyrakis F, Mozzarelli A, Cozzini P, Abraham DJ, Kellogg GE. Simple, Intuitive Calculations of Free Energy of Binding for Protein-Ligand Complexes. 3. The Free Energy Contribution of Structural Water Molecules in HIV-1 Protease Complexes. J Med Chem. 2004;47:4507–4516. doi: 10.1021/jm030596b. [DOI] [PubMed] [Google Scholar]
- 42.Rarey M, Kramer B, Lengauer T. The Particle Concept: Placing Discrete Water Molecules during Protein- Ligand Docking Predictions. Proteins Struct Funct Genet. 1999;34:17–28. [PubMed] [Google Scholar]
- 43.Hamelberg D, McCammon JA. Standard Free Energy of Releasing a Localized Water Molecule from the Binding Pockets of Proteins: Double-Decoupling Method. J Am Chem Soc. 2004;126:7683–7689. doi: 10.1021/ja0377908. [DOI] [PubMed] [Google Scholar]
- 44.Lu Y, Yang CY, Wang S. Binding Free Energy Contributions of Interfacial Waters in HIV-1 Protease/inhibitor Complexes. J Am Chem Soc. 2006;128:11830–11839. doi: 10.1021/ja058042g. [DOI] [PubMed] [Google Scholar]
- 45.Rekharsky MV, Mori T, Yang C, Ko YH, Selvapalam N, Kim H, Sobransingh D, Kaifer AE, Liu S, Isaacs L, et al. A Synthetic Host-Guest System Achieves Avidin-Biotin Affinity by Overcoming Enthalpy-Entropy Compensation. Proc Natl Acad Sci U S A. 2007;104:20737–20742. doi: 10.1073/pnas.0706407105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Chang C, Gilson MK. Free Energy, Entropy, and Induced Fit in Host - Guest Recognition: Calculations with the Second-Generation Mining Minima Algorithm. 2004:13156–13164. doi: 10.1021/ja047115d. [DOI] [PubMed] [Google Scholar]
- 47.Moghaddam S, Yang C, Rekharsky M, Ko YH, Kim K, Inoue Y, Gilson MK. New Ultrahigh Affinity Host - Guest Complexes of Cucurbit [ 7 ] Uril with Bicyclo [ 2 . 2 . 2 ] Octane and Adamantane Guests : Thermodynamic Analysis and Evaluation of M2 Affinity Calculations. J Am Chem Soc. 2011;133:3570–3581. doi: 10.1021/ja109904u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Moghaddam S, Inoue Y, Gilson MK. Host–Guest Complexes with Protein–Ligand-like Affinities: Computational Analysis and Design. J Am Chem Soc. 2009;131:4012–4021. doi: 10.1021/ja808175m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Geballe MT, Skillman AG, Nicholls A, Guthrie JP, Taylor PJ. The SAMPL2 Blind Prediction Challenge: Introduction and Overview. Journal of Computer-Aided Molecular Design. 2010;24:259–279. doi: 10.1007/s10822-010-9350-8. [DOI] [PubMed] [Google Scholar]
- 50.Muddana HS, Varnado CD, Bielawski CW, Urbach AR, Isaacs L, Geballe MT, Gilson MK. Blind Prediction of Host-Guest Binding Affinities: A New SAMPL3 Challenge. J Comput Aided Mol Des. 2012;26:475–487. doi: 10.1007/s10822-012-9554-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Muddana HS, Fenley AT, Mobley DL, Gilson MK. The SAMPL4 Host-Guest Blind Prediction Challenge: An Overview. J Comput Aided Mol Des. 2014;28:305–317. doi: 10.1007/s10822-014-9735-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wickstrom L, He P, Gallicchio E, Levy RM. Large Scale Affinity Calculations of Cyclodextrin Host-Guest Complexes: Understanding the Role of Reorganization in the Molecular Recognition Process. J Chem Theory Comput. 2013;9:3136–3150. doi: 10.1021/ct400003r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Inoue Y, Hakushi T, Liu Y, Tong LH, Shen BJ, Jin DS. Thermodynamics of Molecular Recognition by Cyclodextrins 1. Calorimetric Titration of Inclusion Complexation of Naphthalenesulfonates with a-, B-, and G-Cyclodextrins: Enthalpy-Entropy Compensation. J Am Chem Soc. 1993;115:475–481. [Google Scholar]
- 54.Rekharsky M, Inoue Y. Chiral Recognition Thermodynamics of Beta-Cyclodextrin: The Thermodynamic Origin of Enantioselectivity and the Enthalpa-Entropy Compensation Effect. Am Chem Soc. 2000;122:4418–4435. [Google Scholar]
- 55.Lai B, Oostenbrink C. Binding Free Energy, Energy and Entropy Calculations Using Simple Model Systems. Theor Chem Acc. 2012;131:1–13. [Google Scholar]
- 56.Bohm H, Klebe G, Böhm HJ. What Can We Learn from Molecular Recognition in Protein–Ligand Complexes for the Design of New Drugs? Angew Chem Int Ed Engl. 1996;35:2588–2614. [Google Scholar]
- 57.Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, et al. CHARMM General Force Field: A Force Field for Drug-like Molecules Compatible with the CHARMM All-Atom Additive Biological Force Fields. J Comput Chem. 2010;31:671–690. doi: 10.1002/jcc.21367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Jorgensen WL, Maxwell DS, Tirado-Rives J. Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. J Am Chem Soc. 1996;118:11225–11236. [Google Scholar]
- 59.Caleman C, Van Maaren PJ, Hong M, Hub JS, Costa LT, Van Der Spoel D. Force Field Benchmark of Organic Liquids: Density, Enthalpy of Vaporization, Heat Capacities, Surface Tension, Isothermal Compressibility, Volumetric Expansion Coefficient, and Dielectric Constant. J Chem Theory Comput. 2012;8:61–74. doi: 10.1021/ct200731v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Wong CF, Thacher T, Rabitz H. Rev Comput Chem. Wiley-VCH; New York: 1998. Sensitivity Analysis in Biomolecular Simulation. [Google Scholar]
- 61.Di Pierro M, Mugnai ML, Elber R. Optimizing Potentials for a Liquid Mixture: A New Force Field for a Tert -Butanol and Water Solution. J Phys Chem B. 2014 doi: 10.1021/jp505401m. Article ASAP. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Pierro M, Di Elber R. Automated Optimization of Potential Parameters. J Chem Theory Comput. 2013;9:3311–3320. doi: 10.1021/ct400313n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Wang LP, Martinez TJ, Pande VS. Building Force Fields: An Automatic, Systematic, and Reproducible Approach. J Phys Chem Lett. 2014;5:1885–1891. doi: 10.1021/jz500737m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wang LP, Head-Gordon T, Ponder JW, Ren P, Chodera JD, Eastman PK, Martinez TJ, Pande VS. Systematic Improvement of a Classical Molecular Model of Water. J Phys Chem B. 2013;117:9956–9972. doi: 10.1021/jp403802c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Shell MS. The Relative Entropy Is Fundamental to Multiscale and Inverse Thermodynamic Problems. J Chem Phys. 2008;129:144108. doi: 10.1063/1.2992060. [DOI] [PubMed] [Google Scholar]
- 66.Freeman WA, Mock WL, Shih NY. Cucurbituril. J Am Chem Soc. 1981;103:7367–7368. [Google Scholar]
- 67.Lee JW, Samal S, Selvapalam N, Kim HJ, Kim K. Cucurbituril Homologues and Derivatives: New Opportunities in Supramolecular Chemistry. Acc Chem Res. 2003;36:621–630. doi: 10.1021/ar020254k. [DOI] [PubMed] [Google Scholar]
- 68.Kim HJ, Jeon WS, Ko YH, Kim K. Inclusion of Methylviologen in cucurbit[7]uril. Proc Natl Acad Sci U S A. 2002;99:5007–5011. doi: 10.1073/pnas.062656699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Jeon YJ, Kim SY, Ko YH, Sakamoto S, Yamaguchi K, Kim K. Novel Molecular Drug Carrier: Encapsulation of Oxaliplatin in cucurbit[7]uril and Its Effects on Stability and Reactivity of the Drug. Org Biomol Chem. 2005;3:2122–2125. doi: 10.1039/b504487a. [DOI] [PubMed] [Google Scholar]
- 70.Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and Testing of a General Amber Force Field. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- 71.Velez-Vega C, Gilson MK. Overcoming Dissipation in the Calculation of Standard Binding Free Energies by Ligand Extraction. J Comput Chem. 2013;34:2360–2371. doi: 10.1002/jcc.23398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Zwanzig R. High-Temperature Equation of State by a Perturbation Method. I. Nonpolar Gases. J Chem Phys. 1954;22:1420–1426. [Google Scholar]
- 73.Kirkwood JG. Statistical Mechanics of Fluid Mixtures. J Chem Phys. 1935;3:300–313. [Google Scholar]
- 74.Torrie GM, Valleau JP. Nonphysical Sampling Distributions in Monte Carlo Free-Energy Estimation: Umbrella Sampling. J Comput Phys. 1977;23:187–199. [Google Scholar]
- 75.Pearlman DA, Case DA, Caldwell JW, Ross WS, Cheatham TE, DeBolt S, Ferguson D, Seibel G, Kollman P. AMBER, a Package of Computer Programs for Applying Molecular Mechanics, Normal Mode Analysis, Molecular Dynamics and Free Energy Calculations to Simulate the Structural and Energetic Properties of Molecules. Comput Phys Commun. 1995;91:1–41. [Google Scholar]
- 76.Jakalian A, Bush BL, Jack DB, Bayly CI. Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: I. Method. J Comput Chem. 2000;21:132–146. doi: 10.1002/jcc.10128. [DOI] [PubMed] [Google Scholar]
- 77.Jakalian A, Jack DB, Bayly CI. Fast, Efficient Generation of High-Quality Atomic Charges. AM1- BCC Model: II. Parameterization and Validation. J Comput Chem. 2002;23:1623–1641. doi: 10.1002/jcc.10128. [DOI] [PubMed] [Google Scholar]
- 78.Wang J, Wang W, Kollman PA, Case DA. Automatic Atom Type and Bond Type Perception in Molecular Mechanical Calculations. J Mol Graph Model. 2006;25:247–260. doi: 10.1016/j.jmgm.2005.12.005. [DOI] [PubMed] [Google Scholar]
- 79.Flyvbjerg H, Petersen HG. Error Estimates on Averages of Correlated Data. J Chem Phys. 1989;91:461–466. [Google Scholar]
- 80.Zwanzig R. Nonlinear Generalized Langevin Equations. J Stat Phys. 1973;9:215–220. [Google Scholar]
- 81.Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR. Molecular Dynamics with Coupling to an External Bath. J Chem Phys. 1984;81:3684–3690. [Google Scholar]
- 82.Darden T, York D, Pedersen L. Particle Mesh Ewald: An N log(N) Method for Ewald Sums in Large Systems. J Chem Phys. 1993;98:10089. [Google Scholar]
- 83.Mobley DL, Bayly CI, Cooper MD, Dill KA. Predictions of Hydration Free Energies from All-Atom Molecular Dynamics Simulations. J Phys Chem B. 2009;113:4533–4537. doi: 10.1021/jp806838b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Lagona J, Mukhopadhyay P, Chakrabarti S, Isaacs L. The Cucurbit[n]uril Family. Angew Chemie. 2005;44:4844–4870. doi: 10.1002/anie.200460675. [DOI] [PubMed] [Google Scholar]
- 85.Rekharsky MV, Inoue Y. Complexation Thermodynamics of Cyclodextrins. Chem Rev. 1998;98:1875–1918. doi: 10.1021/cr970015o. [DOI] [PubMed] [Google Scholar]
- 86.Zimmerman SC, Vanzyl CM. Rigid Molecular Tweezers: Synthesis, Characterization, and Complexation Chemistry of a Diacridine. J Am Chem Soc. 1987;109:7894–7896. [Google Scholar]
- 87.Zimmerman SC. Rigid Molecular Tweezers as Hosts for the Complexation of Neutral Guests. Top Curr Chem. 1993;165:71–102. [Google Scholar]
- 88.Klärner FG, Kahlert B. Molecular Tweezers and Clips as Synthetic Receptors. Molecular Recognition and Dynamics in Receptor-Substrate Complexes. Acc Chem Res. 2003;36:919–932. doi: 10.1021/ar0200448. [DOI] [PubMed] [Google Scholar]
- 89.Sinha S, Lopes DHJ, Du Z, Pang ES, Shanmugam A, Lomakin A, Talbiersky P, Tennstaedt A, McDaniel K, Bakshi R, et al. Lysine-Specific Molecular Tweezers Are Broad-Spectrum Inhibitors of Assembly and Toxicity of Amyloid Proteins. J Am Chem Soc. 2011;133:16958– 16969. doi: 10.1021/ja206279b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Linton B, Hamilton AD. Host-Guest Chemistry: Combinatorial Receptors. Curr Opin Chem Biol. 1999;3:307–312. doi: 10.1016/S1367-5931(99)80047-7. [DOI] [PubMed] [Google Scholar]
- 91.Dotsevi G, Sogah Y, Cram DJ. Total Chromatographic Optical Resolutions Of. Alpha.-Amino Acid and Ester Salts through Chiral Recognition by a Host Covalently Bound to Polystyrene Resin. J Am Chem Soc. 1976;98:3038–3041. doi: 10.1021/ja00426a073. [DOI] [PubMed] [Google Scholar]
- 92.Cram D, Helgeson R. Host-Guest Complexation. 8. Macrocyclic Polyethers Shaped by Two Rigid Substituted Dinaphthyl or Ditetralyl Units. J Org Chem. 1978;43:1930–1946. [Google Scholar]
- 93.Rebek J., Jr Host–guest Chemistry of Calixarene Capsules. Chem Commun. 2000:637–643. [Google Scholar]
- 94.Rebek J, Jr, Nemeth D. Molecular Recognition: Ionic and Aromatic Stacking Interactions Bind Complementary Functional Groups in a Molecular Cleft. J Am Chem Soc. 1986;108:5637–5638. [Google Scholar]
- 95.Rebek J., Jr . Progress in Molecular Recognition. In: Liebman JF, Greenberg A, editors. Environmental Influences and Recognition in Enzyme Chemistry. VCH Publishers; New York: 1988. pp. 291–250. [Google Scholar]
- 96.Chang SK, van Engen D, Fan E, Hamilton AD. Hydrogen Bonding and Molecular Recognition: Synthetic, Complexation, Amd Structural Studies on Barbiturate Binding to an Artificial Receptor. J Am Chem Soc. 1991;113:7640–7645. [Google Scholar]
- 97.Chodera JD, Mobley DL. Entropy-Enthalpy Compensation: Role and Ramifications in Biomolecular Ligand Recognition and Design. Annu Rev Biophys. 2013;42:121–142. doi: 10.1146/annurev-biophys-083012-130318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Sharp K. Entropy-Enthalpy Compensation: Fact or Artifact? Protein Sci. 2001;10:661–667. doi: 10.1110/ps.37801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Fenley AT, Muddana HS, Gilson MK. Entropy-Enthalpy Transduction Caused by Conformational Shifts Can Obscure the Forces Driving Protein-Ligand Binding. Proc Natl Acad Sci U S A. 2012;109:20006–20011. doi: 10.1073/pnas.1213180109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Lumry R, Rajender S. Enthalpy-Entropy Compensation Phenomena in Water Solutions of Proteins and Small Molecules: A Ubiquitous Property of Water. Biopolymers. 1970;9:1125–1227. doi: 10.1002/bip.1970.360091002. [DOI] [PubMed] [Google Scholar]
- 101.Dunitz JD. Win Some, Lose Some: Enthalpy-Entropy Compensation in Weak Intermolecular Interactions. Chem Biol. 1995;2:709–712. doi: 10.1016/1074-5521(95)90097-7. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


