Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2006 Jul 14;91(8):2798–2814. doi: 10.1529/biophysj.106.084301

Absolute Binding Free Energy Calculations Using Molecular Dynamics Simulations with Restraining Potentials

Jiyao Wang *, Yuqing Deng , Benoît Roux *,†
PMCID: PMC1578458  PMID: 16844742

Abstract

The absolute (standard) binding free energy of eight FK506-related ligands to FKBP12 is calculated using free energy perturbation molecular dynamics (FEP/MD) simulations with explicit solvent. A number of features are implemented to improve the accuracy and enhance the convergence of the calculations. First, the absolute binding free energy is decomposed into sequential steps during which the ligand-surrounding interactions as well as various biasing potentials restraining the translation, orientation, and conformation of the ligand are turned “on” and “off.” Second, sampling of the ligand conformation is enforced by a restraining potential based on the root mean-square deviation relative to the bound state conformation. The effect of all the restraining potentials is rigorously unbiased, and it is shown explicitly that the final results are independent of all artificial restraints. Third, the repulsive and dispersive free energy contribution arising from the Lennard-Jones interactions of the ligand with its surrounding (protein and solvent) is calculated using the Weeks-Chandler-Andersen separation. This separation also improves convergence of the FEP/MD calculations. Fourth, to decrease the computational cost, only a small number of atoms in the vicinity of the binding site are simulated explicitly, while all the influence of the remaining atoms is incorporated implicitly using the generalized solvent boundary potential (GSBP) method. With GSBP, the size of the simulated FKBP12/ligand systems is significantly reduced, from ∼25,000 to 2500. The computations are very efficient and the statistical error is small (∼1 kcal/mol). The calculated binding free energies are generally in good agreement with available experimental data and previous calculations (within ∼2 kcal/mol). The present results indicate that a strategy based on FEP/MD simulations of a reduced GSBP atomic model sampled with conformational, translational, and orientational restraining potentials can be computationally inexpensive and accurate.

INTRODUCTION

Molecular recognition phenomena involving the association of ligands to macromolecules with high affinity and specificity play a key role in biology (13). Although the fundamental microscopic interactions giving rise to bimolecular association are relatively well understood, designing computational schemes to accurately calculate absolute binding free energies remains very challenging. Computational approaches currently used for screening large databases of compounds to identify potential lead drug molecules must rely on very simplified approximations to achieve the needed computational efficiency (4). Nonetheless, the calculated free energies ought to be very accurate to have any predictive value. Furthermore, the importance of solvation in scoring ligands in molecular docking has been stressed previously (5).

In principle, free energy perturbation molecular dynamics (FEP/MD) simulations based on atomic models are the most powerful and promising approaches to estimate binding free energies of ligands to macromolecules (611). Indeed, test calculations have shown that FEP/MD simulations can be more reliable than simpler scoring schemes to compute relative binding affinities in important biological systems (12,13), and that it can naturally handle the influence of solvent and dynamic flexibility (14). There is a hope that calculations based on FEP/MD simulations for protein-ligand interactions could become a useful tool in drug discovery and optimization (1522). Nonetheless, despite outstanding developments in simulation methodologies (23), carrying out FEP/MD calculations of large macromolecular assemblies surrounded by explicit solvent molecules often remains computationally prohibitive. For this reason, it is necessary to seek ways to decrease the computational cost of FEP/MD calculations while keeping them accurate.

To simulate accurately the behavior of molecules, one must be able to account for the thermal fluctuations and the environment-mediated interactions arising in diverse and complex systems (e.g., a protein binding site or bulk solution). In FEP/MD simulations, the computational cost is generally dominated by the treatment of solvent molecules. Computational approaches at different level of complexity and sophistication have been used to describe the influence of solvent on biomolecular systems (24). Those range from MD simulations based on all-atom models in which the solvent is treated explicitly (10,25), to Poisson-Boltzmann (PB) continuum electrostatic models in which the influence of the solvent is incorporated implicitly (24,26). There are also semianalytical approximations to continuum electrostatics, such as generalized Born (2731), as well as empirical treatments based on solvent-exposed surface area (3240). However, even though such approximations are computationally convenient, they are often of unknown validity when they are applied to a new situation.

An intermediate approach, which combines some aspects of both explicit and implicit solvent treatments (4143), consists in simulating a small number of explicit solvent molecules in the vicinity of a region of interest, while representing the influence of the surrounding solvent with an effective solvent-boundary potential (4150). Such an approximation is an attractive strategy to decrease the computational cost of MD/FEP computations because binding specificity is often dominated by local interactions in the vicinity of the ligand while the remote regions of the receptor contribute only in an average manner. The method used in this study is called the generalized solvent boundary potential (GSBP) (43). GSBP includes both the solvent-shielded static field from the distant atoms of the macromolecule and the reaction field from the dielectric response of the solvent acting on the atoms of the simulation region. GSBP is a generalization of spherical solvent boundary potential, which was designed to simulate a solute in bulk water (41). In the GSBP method, all atoms in the inner region belonging to ligand, macromolecule, or solvent can undergo explicit dynamics, whereas the influence of the macromolecular and solvent atoms outside the inner region are included implicitly.

It is also possible to reduce the computational cost of FEP/MD simulations and even improve their accuracy by using a number of additional features. For example, the Weeks Chandler Andersen (WCA) separation of the Lennard-Jones potential can be used to efficiently calculate the free energy contribution arising from the repulsive and dispersive interactions (51,52). Furthermore, biasing potentials restraining the translation, orientation, and conformation of the ligand can help enhance the convergence of the calculations (17,21,22,53,54). Such a procedure can provide correct results as long as the effect of all the restraining potentials is rigorously taken into account and unbiased. Combining these elements yields the present computational strategy, which consists in FEP/MD simulations of a reduced GSBP atomic model with enhanced sampling using conformational, translational, and orientational restraining potentials.

In this study, the absolute (standard) binding free energies of eight FK506-related ligands to FKBP12 (FK506 Binding Protein) are calculated using FEP/MD simulations with GSBP to explore the practical feasibility of such a computational strategy. FKBP12 is a rotamase catalyzing the cis-trans isomerization of peptidyl-prolyl bonds (55). FK506 is a key drug used for immunosuppression in organ transplant. It binds strongly to FKBP12 (56) and the FKBP12/FK506 complex, in turn, binds and inhibits calcineurin, thus blocking the signal transduction pathway for the activation of T-cells (57,58). In addition to its obvious importance as a pharmacological target, FKBP12 was chosen in this study for three main reasons. First, crystal structures of FKBP12 in complex with several ligands are available (5961). Second, the binding constants of those FK506-related ligands with FKBP12 have been experimentally determined (60). Third, this system serves as a rich platform to test and validate different computational strategies to estimate binding free energies (6265). This study is part of an ongoing collaborative effort involving two other groups (Pande (63) and J. A. McCammon, personal communication, 2005) with the goal of comparing the results of calculations based on different treatments and approximations but using the same force field (AMBER). Pande and co-workers (63) and Shirts (64) carried out extensive all-atom free energy perturbation (FEP) molecular dynamics (MD) simulations. With the same system, J. M. Swanson and J. A. McCammon (personal communication, 2005) used molecular mechanics/Poisson-Boltzmann-and-surface-area (MM-PBSA), a popular approach that relies on a mixed scheme combining configurations sampled from molecular dynamics (MD) simulations with explicit solvent, with free energy estimators based on an implicit continuum solvent model (66).

In the next two section, the theoretical formulation and the computational details are given. Then, all the results of the computations are presented and discussed in the following section. The article ends with a brief conclusion summarizing the main points.

METHODS

Theoretical formulation

The theoretical formulation for the equilibrium binding constant used here was previously elaborated in Deng and Roux (52). Briefly, the equilibrium binding constant Kb for the process corresponding to the association of a ligand L to a protein P, L + P Inline graphic LP, can be expressed as

graphic file with name M2.gif (1)

where L represents the coordinates of the ligand (only a single ligand needs to be considered at low concentration), X represents the coordinate of the solvent and the protein, β ≡ 1/kBT, U is the total potential energy of the system, rL is the position of the center-of-mass of the ligand, and r* is some arbitrary position (far away) in the bulk solution. The subscripts site and bulk indicate that the integrals include only configurations in which the ligand is in the binding site or in the bulk solution, respectively. Eq. 1 can be related to the double decoupling method (17,21), though the derivation in Deng and Roux (52) proceeds from population configurational ensemble averages rather than the traditional treatment that consists in equating the chemical potentials of the three species, L, R, and LR. In particular, it should be noted that, Kb has dimension of volume because of the δ-function δ(rLr*) in the denominator. This δ-function arises from the translational invariance of the ligand in the bulk volume (see (52)).

For computational convenience, the reversible work for the entire association/dissociation process is decomposed into eight sequential steps during which the interaction of the ligand with its surrounding (protein and solvent) as well as various restraining potentials are turned “on” and “off” (see Appendix A). Various potentials restraining the conformation, position, and orientation of the ligand are used throughout the step-by-step process. Those are designed to reduce the conformational sampling workload of the free energy simulations by biasing the ligand to be near its bound configuration (conformation, position, and orientation) as it becomes completely decoupled from its surrounding. This approach has the advantage of focusing the sampling on the most relevant conformations, though it is essential that the biasing effect of the restraining potentials be rigorously handled and that the final result from the computation be independent of the restraints. The usage of biasing restraints in computations of binding free energies goes back to early work by Hermans and Subramaniam (67), with a number of recent variants (21,22,5254).

The translational and orientational restraining potentials are constructed from three point-positions defined in the protein (Pc, P1, and P2) and three point-positions defined in the ligand (Lc, L1, and L2) (Fig. 3). Specifically, Pc is the center-of-mass of the protein residues forming the binding site, and Lc is the center-of-mass of the ligand. P1 and P2 are the center-of-mass of two groups of atoms in the protein, while L1 and L2 are the center-of-mass of two groups of atoms in the ligand. The choice of the six reference point-positions is more or less arbitrary, as long as they are not co-linear and allow us to define the orientation of the ligand relative to the protein. The translational restraint is defined as ut = 1/2[kt(rLr0)2 + ka(θLθ0)2 + ka(φLφ0)2], where rL is the distance PcLc, θL is the angle P1PcLc, and φL is the dihedral angle P2P1PcLc; kt and ka are the force constants, and r0, θ0, and φ0 are the average values of the fully interacting ligand in the binding site taken as a reference. Similarly, the orientational restraining potential is defined as ur = 1/2[ka(αLα0)2 + ka(βLβ0)2 + ka(γLγ0)2], where the angle αL (PcLcL1), the dihedral angle βL (P1PcLcL1), and the dihedral angle γL (PcLcL1L2) are three angles defining the rigid body rotation; ka is the force constant, and α0, β0, and γ0 are the reference values taken from the fully interacting ligand in the binding site. Generally, the reference values and the force constants are taken from an average based on an unbiased simulation of the fully interacting ligand in the binding site. The magnitude of the force constants is estimated from the fluctuations of its associated coordinates as kxkBT/〈Δx2〉. This has been shown to yield the optimal biasing in free energy perturbations (53). The conformational restraining potential uc is also constructed as a quadratic function, uc = kc(ζ[L;Lref])2, where kc is a force constant, and ζ is the root mean-square deviation (RMSD) of the ligand coordinates L relative to the average structure of the fully interacting ligand in the binding site Lref, taken as a reference structure.

FIGURE 3.

FIGURE 3

Translational and rotational restraints on ligand 8. Three random point-positions in the protein (Pc, P1, P2) and three random point-positions in the ligand (Lc, L1, and L2) were chosen to set up the translational and rotational restraints. They are the center-of-mass of the protein (green) in the sphere, residue Val-101, residue Tyr-26, the ligand (red), atoms labeled in red in Fig. 1, and atoms labeled in blue in Fig. 1, respectively. These positions are shown in yellow spheres with dashed blue lines connecting them. The translational restraint is defined by r (PcLc), θ (P2PcLc), and φ (P1P2PcLc). The rotational restraint is defined by α (PcLcL1), β (P2PcLcL1), and γ (PcLcL1L2).

With these definitions, the sequential steps corresponding to the dissociation process with the fully interacting ligand in the protein binding site as initial state are (see also Table A1 in Appendix A):

  1. A potential uc is applied to the fully interacting ligand (U1) in the binding site to maintain its conformation near the average bound state.

  2. A potential ut is applied to the center-of-mass of the fully interacting ligand (U1) restrained by uc to maintain its relative position in the binding site.

  3. A potential ur is applied to the fully interacting ligand (U1), restrained by uc and ut, to maintain its relative orientation in the binding site.

  4. The interactions of the ligand, restrained by uc, ut, and ur, with the binding site are turned off (decoupling: U1U0).

  5. The potential ur applied to the decoupled ligand (U0), restrained by uc and ut, is released.

  6. The restraining potential ut applied to the decoupled ligand (U0), restrained by uc, is released.

  7. The interaction of the ligand, restrained by uc, with the surrounding bulk solution is turned on (coupling: U0U1).

  8. The potential uc applied to the fully interacting ligand in the bulk solution (U1) is finally released.

As shown in Appendix A, the standard binding free energy ΔG°bind is given by

graphic file with name M3.gif (2)

where Inline graphic, Inline graphic, Inline graphic, Inline graphic, kBT ln Fr, kBT ln(FtC°), Inline graphic, and Inline graphic correspond to the reversible work done in Steps 1–8, respectively. Since the ligand is decoupled from its environment in Steps 5 and 6, the factor Fr can be evaluated as a numerical integral over three rotation angles, and the factor Ft can be evaluated as a numerical integral over the translation of the ligand center-of-mass in three-dimensional space. The constant C° insures conversion to the standard state concentration (= 1 M or 1/1661 Å−3). All the remaining ΔG contributions must be calculated using FEP/MD simulations. It is useful to combine the corresponding contributions in Eq. 2 and express the standard binding free energy as

graphic file with name M10.gif (3)

where Inline graphic corresponds to the free energy contribution arising from the interactions of the ligand with its surrounding (bulk and/or protein), while Inline graphic, Inline graphic, and Inline graphic correspond to the conformational, translational, and orientational restriction of the ligand upon binding, respectively. Equation 3 makes the interpretation of each contribution intuitively clear (see below). Lastly, if the ligand has symmetry and can bind in a number of equivalent ways, it is necessary to include the effect of the symmetry factor n as − kBT ln(n).

PRACTICALITIES

Translational and orientational contributions

It is customary to describe bimolecular binding as a process in which a ligand free in solution loses translational and orientational degrees of freedom, as it associates with the protein. The unfavorable contribution to the standard binding free energy caused by the loss of freedom is compensated for, as the ligand gains favorable interactions with proteins. In this regard, it is informative to consider Inline graphic, the free energy contribution associated with the translation of the ligand, obtained by combining Inline graphic and the factor Ft,

graphic file with name M17.gif (4)

where Inline graphic is the probability distribution of ligand position in the binding site. If the translational restraining potential ut(rL) is strong and centered on rm—the most probable position of the ligand center-of-mass in the binding site (the maximum of Inline graphic)—the probability distribution with the restraint is sharply peaked at rm,

graphic file with name M20.gif (5)

and the translational contribution is

graphic file with name M21.gif (6)

where ΔV is an effective accessible volume for the center-of-mass of the ligand in the binding site. This volume, which is evaluated naturally in units of Å3 with MD simulations, can be converted to the standard state volume by the constant C°. One may note that the effective volume ΔV is typically on the order of ∼1 Å3. Therefore, for all practical purposes, it is always much smaller than the standard state volume of 1661 Å3, e.g., a ΔV equal to 1 Å3 (a typical value) yields the well-known standard state offset factor −kBT ln(C°) of 4.4 kcal/mol. For this reason, the reduction in translational freedom of the ligand makes an unfavorable contribution to binding free energy.

Similarly, it is informative to consider the total free energy contribution associated with the rotation of the ligand ΔΔGr obtained by combining Inline graphic and Fr,

graphic file with name M23.gif (7)

where Inline graphic is the distribution of the orientation angles (this Inline graphic depends on ut). In the limit of strong rotational restraint potential ur(Ω), the bias potential acts essentially as a δ-function,

graphic file with name M26.gif (8)

which is sharply peaked at Ωm, the maximum of Inline graphic, i.e., the most probable orientation of the ligand in the binding site. For a nonlinear ligand, it follows that

graphic file with name M28.gif (9)

It may be noted that the factor ΔΩ/8π2 is necessarily smaller than (or equal to) 1. For this reason, the reduction in rotational freedom of the ligand always makes an unfavorable contribution to binding free energy.

The above analysis shows that reduction in both translational and orientational freedom yield unfavorable contributions to the binding free energy. To clarify the significance of this result further, it is useful to relate ΔV and ΔΩ to the properties of the bound ligand. Assuming that the thermal fluctuations of the (fully interacting) ligand in the binding site are Gaussian, ΔV has the closed-form expressions

graphic file with name M29.gif (10)

and ΔΩ,

graphic file with name M30.gif (11)

where Inline graphic represent the thermal fluctuations of each variable. Such Gaussian approximation may be advantageous if one is attempting to estimate the translational and orientational contributions to the standard binding free energy using only the information extracted from an unbiased simulation of the fully interacting ligand, i.e., without actually performing FEP/MD simulations. One may note also some similarity with the MM-PBSA scheme (68), in which the translational and orientational contributions are estimated using a quasi-harmonic approximation (69,70).

Solvation free energy of the ligands

Step 7 provides the solvation free energy of a ligand that is restrained by uc to remain near its bound conformation. This does not correspond to the true solvation free energy of a flexible ligand (e.g., the process ligand in vacuum ⇌ ligand in solvent). The latter may be expressed as

graphic file with name M32.gif (12)

where Inline graphic is the free energy corresponding to applying the conformational restraint on the ligand decoupled from its surrounding (≡ vacuum). The values Inline graphic and Inline graphic are the same as defined above. Therefore, one additional quantity (Inline graphic) must be computed if one is interested in evaluating the solvation free energy of the ligand. For the sake of comparison with the results of Pande, Shirts and co-workers (63,64), we also computed the solvation free energy of the ligands, though in practice, this quantity is not required to compute the standard binding free energy.

Atomic models and computational details

The eight FK506-related ligands (ligands 2, 3, 5, 6, 8, 9, 12, and 20) are shown in Fig. 1. These ligands are numbered according to previous experimental (60) and computational work (63). Ligand 20 is the molecule FK506 (56). Three types of starting structures were considered for the computations. The first set comprises the crystal structures with ligands 8, 9, and 20 (PDB code 1FKG, 1FKH, and 1FKJ, respectively). The second set corresponds to models for ligands 3 and 5 obtained by construction from the crystal structure of FKBP12 in complex with ligand 9. Replacing the cyclohexyl group of ligand 9 with a hydrogen gives ligand 5, while replacing the phenylmethyl group of ligand 5 with a hydrogen gives ligand 3. Ligands 3 and 5 are highly similar to ligand 9, and the direct modeling is justifiable. The third set was provided by M. R. Shirts and V. S. Pande (personal communication, 2005); it corresponds to atomic coordinates of docking models of ligands 2, 3, 5, 6, and 12 and crystal structures for ligands 8, 9, and 20, followed by 200 ps of MD simulations with explicit solvent. In all the tables, the three sets are referred to as x-ray, mod, and MD, respectively. The CHARMM biomolecular simulation program was used for all the simulations. To compare with previous calculations by Pande, Shirts and co-workers (63,64) and J. M. Swanson and J. A. McCammon (personal communication, 2005), the same atomic force field was used in this study. The force field for the protein is AMBER99, and that for the ligands is from the 2002 version of general AMBER force field (71) as provided by M. R. Shirts (personal communication, 2005)). The charges of the ligands are from AM1/BCC (72). The conversion of the AMBER force field to CHARMM format is given in Appendix B.

FIGURE 1.

FIGURE 1

Structural formulae of the eight ligands used in the calculation. Ligands 2, 5, 6, 8, and 9 have one or two physically symmetric units (phenyl or cyclohexyl group). Flat-bottom dihedral restraints were applied on these symmetric units to prevent exchange between physically equivalent conformers. Ligand 20 is also referred to as FK506 in the literature (60). The atoms labeled in red and blue are the atom used to define the point-positions L1 and L2, respectively in Fig. 3.

The GSBP method (73,74), implemented in the biomolecular simulation program CHARMM (75), was used to solvate a spherical region centered on the FKBP12 binding site. In GSBP, the system is divided into an outer and an inner region. In the inner region, the ligand, the solvent molecules, and part of the macromolecule are simulated explicitly with MD. In the outer region, the remaining protein atoms are included explicitly while the solvent is represented as a continuum dielectric medium. The influence of the surrounding outer region on the atoms of the inner region is represented in terms of a solvent-shielded static field and a solvent-induced reaction field. The reaction field due to changes in charge distribution in the dynamic inner region is expressed in terms of a basis set expansion of the inner simulation region charge density. The basis set coefficients correspond to generalized electrostatic multipoles. The solvent-shielded static field from outer macromolecular atoms and the reaction field matrix, representing the couplings between the generalized multipoles, are both invariant with respect to the configuration of the explicit atoms in the inner simulation region. They are calculated only once for macromolecules of arbitrary geometry using the finite-difference PB equation, leading to an accurate and computationally efficient hybrid MD/continuum method for simulating a small region of a large biological macromolecular system. A spherical inner region of 15 Å radius was used for all the ligands. The size of the GSBP simulated systems is typically ∼2500 atoms. The systems were hydrated with a fixed number of water molecules, though this could be generated dynamically using grand canonical Monte Carlo (76). Dielectric constants of 80 and 4 were assumed for the solvent and the protein in the outer region, respectively. The static field arising from the protein charges in the outer region and the generalized reaction field matrix including five electric multipoles were calculated using the PBEQ module (77,78) of CHARMM (75) and stored for efficient simulations. A spherical restraining potential was applied to keep the water molecules from escaping the inner region using the MMFP GEO command. The spherical GSBP simulation system is illustrated in Fig. 2 in the case of ligand 8. During the simulation, protein atoms near the edge of the boundary are fixed while a nonpolar potential keeps the water molecules inside the sphere. Each system of ligand/FKBP12 solvated with GSBP was equilibrated for 2 ns at 300 K using Langevin dynamics. A friction coefficient of 5 ps−1 was assigned to all nonhydrogen atoms. A time-step of 2 fs was used. The average structure of the ligand was calculated from the equilibration trajectory (typically from 0.4 ns to 2 ns), which was then used as a reference structure Lref in the conformational restraining potential uc. The fluctuations of the six internal variables (rL, θL, φL, αL, βL, and γL) used in the translational and rotational restraining potentials were monitored to estimate the force constants for the biasing restraining potentials.

FIGURE 2.

FIGURE 2

A sphere containing FKBP12, ligand 8, and water molecules in the GSBP method. The crystal structure (pdb code: 1FKG) of the complex of FKBP12 (green, cartoon) and ligand 8 (red, ball-and-stick) is solvated in water molecules (blue, small spheres). In the GSBP method, only atoms in a sphere (15 Å radius) centered on the ligand are represented explicitly. All atoms outside the sphere were removed and were represented implicitly using the continuum method. The dielectric constants of the solvent and the protein in the outer region are 80 and 4, respectively.

Protocol for binding free energy (steps 1–8)

Conformational restraints (steps 1 and 8)

For better accuracy, the free energies associated with the conformational restriction of the ligand near the reference conformation, Inline graphic and Inline graphic (Steps 1 and 8), was not obtained directly by FEP/MD simulations, but was calculated by integration of the Boltzmann factor of the RMSD potential of mean force (PMF) obtained from umbrella sampling simulations. For the ligand in the bulk, Inline graphic is given by

graphic file with name M40.gif (13)

where Inline graphic is the ligand PMF as a function of the RMSD relative to Lref in bulk solution. Similar expressions hold for Inline graphic and Inline graphic. The umbrella sampling method (79) was used to evaluate the PMF as a function of RMSD. To insure a uniform sampling of the RMSD, the simulations were generated using a quadratic biasing potential of the form kc(ζ[L;Lref] − ζi)2 centered on successive values of ζi. Specifically, 21 biasing windows were used with the RMSD offset value increasing from 0.0 to 4.0 Å in steps of 0.2 Å for the ligand in the binding site, and 21 windows were used with the RMSD offset value increasing from 0.0 to 5.0 Å in steps of 0.25 Å for the ligand in the bulk solution. The initial configurations for the 21 umbrella sampling windows were generated using a short initial run with a strong force constant kc (500 kcal/mol/Å2). Then, each window was equilibrated using a force constant of 10 kcal/mol/Å2, and after sufficient equilibration; a 1-ns simulation was used for sampling. No translational or orientational restraining potential is present during those simulations. The weighted histogram analysis method (8082) was used to unbias the results and compute the PMF as a function of RMSD. An optimal force constant for the conformational restraining potential uc = kcζ2 was determined from the RMSD-PMF of the ligands in solution and in the binding site. Specifically, a value of 10 kcal/mol/Å2 was chosen for kc so that the normalized Boltzmann probability of the conformationally restrained ligand in the bulk or bound to the protein both have their most probable values around the same small RMSD (∼0.5–1.0 Å).

Some ligands have symmetric structural elements (e.g., the two phenyl groups of ligand 8 as shown in Fig. 1), which can undergo isomerization and exchange between physically equivalent conformations. Sampling all these (physically indistinguishable) conformations may become prohibitively slow when the ligand is in the binding site, but less so in the solvent. With finite-length trajectories, isomerizations could take place frequently during the FEP/MD simulation of the ligand in solvent, but not in those of the bound complex. A proper accounting of the relative conformational entropy cost upon ligand binding will be compromised by such nonequivalence in sampling one state of the system, but not the other. This problem can be avoided by limiting the conformational space to a single one of the physically equivalent rotamers of the ligand in all the FEP/MD simulations. In the present calculations, a steep flat-bottom dihedral restraining potential was applied to all the symmetric units of the ligand to prevent exchange between identical rotameric states during the simulations. It should be noted that such flat-bottom restraining potential does not affect the physical properties of the ligand and the final binding free energy as long as it is present during all the computations, with the ligand in the solvent and in the binding site. The restriction was applied to the ligand during all the free energy calculations involving the conformational restraining potential uc (the ligand in the binding site, in solution, and in vacuum for the solvation free energy calculations). The force constant for the flat-bottom restraint is 500 kcal/mol/rad2.

Translational and rotational restraints (Steps 2 and 3)

The free energies corresponding to the translational and rotational restraints with the ligand in the binding site, Inline graphic and Inline graphic, were calculated using FEP/MD simulations. The translational (ut) and rotational (ur) restraints were gradually turned on via the linear coupling parameters κt and κr (with values of 0.0, 0.025, 0.05, 0.075, 0.1, 0.2, 0.4, 0.6, 0.8, and 1.0). The six point-positions in the protein (Pc, P1, and P2) and the ligand (Lc, L1, and L2) used to define the relative position and orientation of the ligand with respect to the protein are illustrated in Fig. 3 in the case of ligand 8.

Interaction energy (Steps 4 and 7)

The contribution corresponding to the interaction energy of the ligand with its surrounding, Inline graphic and Inline graphic, were calculated with FEP/MD simulations. For this purpose, the Lennard-Jones (LJ) potential was separated into repulsive and dispersive free energy using the Weeks Chandler Andersen (WCA) method (51,83). A nonlinear coupling parameter s was introduced to control the repulsive part, and a linear coupling parameter ξ was introduced to control the dispersive part. Furthermore, a linear coupling parameter λ was introduced to control the electrostatic interactions. Such separation of the potential with these coupling parameters permits an accurate evaluation of the free energy and a clear step-by-step decomposition of the nonbond interactions into repulsive, dispersive, and electrostatic contributions (52). Though the free energy decomposition of the nonbonded interactions formally depends on the order in which each of these contributions are activated, introducing the repulsive core in the first step is an unavoidable physical necessity. A molecular entity with dispersive attraction and charges, but no core repulsion, makes no physical sense (the energy does not have a lower bound). The dispersion and electrostatics could be interchangeably introduced as in second or third steps, with little impact on the free energy decomposition, because those do not greatly affect the structure of the solvent. Therefore, the resulting free energy decomposition can lead to useful observations because the step-by-step FEP/MD procedure goes through physically meaningful intermediates. In the context of a particle insertion process, the repulsive interaction is first introduced gradually (s = 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1), while the dispersive and electrostatic interactions are turned off (ξ = 0 and λ = 0). Then, in the presence of the repulsive interaction (s = 1), the dispersive interaction is turned on gradually (ξ = 0.0, 0.25, 0.5, 0.75, and 1.0) while the electrostatic interaction is turned off (λ = 0). Finally, the electrostatic interaction are gradually turned on (λ = 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1.0) in the presence of both the repulsive and dispersive interactions. For each set of coupling parameters, 120-ps Langevin dynamics was generated and averages were calculated using the last 100 ps (additional simulations of 1 ns were generated to check the convergence). The results were unbiased using the weighted histogram-analysis method facility of CHARMM. The free energy components (repulsive, dispersive, and electrostatic free energies) were calculated for the ligand in the binding site or in the bulk solution. For the GSBP simulations, the atoms of the protein in the outer region are not kept explicitly for the sake of computational efficiency. All the FEP window simulations were generated concurrently starting from the structure of the equilibrated system. The contribution from the long-range van der Waals dispersive interaction with the (missing) atoms of the outer region (solvent and protein) was evaluated from a snapshot of a large all-atom model of the solvated protein with its ligand, and was included in the free energy. The large system was built with 8000 water molecules (length ∼ 60 Å) and the full protein for the ligand in the binding site. For the ligand in the bulk solution, 1000 water molecules (length of ∼30 Å) were used. The difference between the bulk and the binding site nearly cancel out, with the magnitude of each component being on the order of −1.0 kcal/mol. For example, the long-range corrections of ligand 8 in the binding site and in the bulk are −0.9 and −1.5 kcal/mol, respectively. Though the magnitude of the long-range correction depends on the cutoff used for the nonbonded interactions, its net impact on the total binding free energy is similar to the values of Pande, Shirts and co-workers (63,64) for similar cutoff (see Table 5.2 in (64)).

All the results are given in Table 1.

TABLE 1.

Nonbond free energy components for the binding of ligands to FKBP12

Lig Struct Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
2 MD 30.8 −47.3 −17.5 30.5 −28.8 −13.6
3 mod 34.9 −47.7 −18.1 29.5 −26.2 −13.5
3 MD 35.7 −45.5 −23.8 25.4 −21.3 −19.8
5 mod 35.9 −51.3 −21.2 30.6 −27.3 −15.2
5 MD 36.4 −53.1 −21.0 34.5 −32.1 −16.3
6 MD 40.4 −57.4 −25.9 38.9 −38.2 −19.6
8 X ray 41.2 −62.3 −22.1 42.3 −41.2 −18.4
8 MD 45.4 −63.1 −22.2 44.4 −42.0 −18.8
9 X ray 44.5 −61.6 −21.4 45.3 −43.2 −15.9
9 MD 51.0 −66.0 −21.0 45.1 −42.3 −15.9
12 MD 41.6 −58.1 −28.8 34.2 −31.4 −24.6
20 X ray 58.8 −84.4 −31.1 59.4 −61.9 −32.5
20 MD 56.5 −84.0 −30.9 57.0 −60.2 −32.2

Three sets of structures were used. X-ray means x-ray crystal structure; mod means the structure is modified from the crystal structure of ligand 9 by replacing some groups with hydrogen. MD means the structure is modeled from crystal structures and equilibrated for ∼200 ps by Shirts (64). The nonbond free energy components include repulsive (ΔGrep), dispersive (ΔGdis), and electrostatic (ΔGelec) free energy. The long-range van der Waals correction is included in ΔGdis. The superscripts site and bulk indicate the ligand is in the binding site or in the bulk solution, respectively.

Translational and orientational factors (Steps 5 and 6)

The translational factor Ft was calculated numerically from the expression

graphic file with name M48.gif (14)

where ut = 1/2[kt(rLr0)2 + ka(θLθ0)2 + ka(φLφ0)2] is a quadratic translational restraining potential. The value kt is the force constant for the distance restraint, and ka is the force constant for the angle and dihedral restraints; r0, θ0, and φ0 are the reference values of the distance, angle, and dihedral determined from an average of the equilibration trajectory and subsequently used to define the position of the ligand.

Similarly, the orientational factor Fr was calculated numerically from the expression

graphic file with name M49.gif (15)

where ur = 1/2[ka(αLα0)2 + ka(βLβ0)2 + ka(γLγ0)2] is a quadratic orientational restraining potential. The value ka is the force constant for the angle and dihedral restraints; α0, β0, and γ0 are the reference values of the angle and dihedrals determined from an average of the equilibration trajectory and subsequently used to define the orientation of the ligand.

RESULTS AND DISCUSSION

The calculated standard binding free energy with the various components are given in Table 2 for the eight FK506-related ligands shown in Fig. 1. In the following, the various steps of the FEP/MD methodology for computing the standard binding free energy are illustrated in the case of ligand 8. Then, general observations are made about the results for the eight ligands.

TABLE 2.

Free energy components for the binding of ligands to FKBP12

Lig Struct Inline graphic Inline graphic + ΔΔGc + ΔΔGt + ΔΔGr Inline graphic
2 MD −34.0 11.8 2.2 3.2 4.4 −12.3
3 mod −30.9 10.2 3.1 3.7 5.2 −8.7
3 MD −33.6 15.7 2.0 3.3 3.5 −9.0
5 mod −36.7 12.0 5.5 3.5 5.4 −10.2
5 MD −37.8 13.9 3.5 3.5 5.2 −11.6
6 MD −42.9 19.0 5.5 3.3 5.4 −9.7
8 X ray −43.3 17.2 6.9 3.4 5.4 −10.3
8 MD −39.9 16.3 5.6 3.3 4.2 −10.3
9 X ray −38.6 13.8 4.5 3.5 5.0 −11.7
9 MD −35.9 13.0 3.4 3.4 4.6 −11.6
12 MD −45.2 21.8 1.0 3.4 5.3 −13.7
20 X ray −56.7 34.9 2.7 3.3 5.6 −10.1
20 MD −58.4 35.5 3.3 3.4 5.4 −10.8

Note each free energy component was corrected with the sign already. The ligands are labeled as in Table 1. The value Inline graphic and Inline graphic are the total nonbond free energies, which are equal to the sum of nonbond free energy components in Table 1. The values ΔΔGc, ΔΔGt°, and ΔΔGr are the net free energies corresponding to the conformational restraints, the translational restraint, and the rotational restraint, respectively (Eq. 3). The value ΔGbind° is the standard binding free energy.

Illustrating the FEP/MD method with ligand 8

The reduced FKBP12/ligand GSBP system is shown in Fig. 2. Only the atoms in the inner region (defined as a sphere of 15 Å radius) are simulated explicitly. The inner region includes the ligand (red, ball-and-stick), the water molecules (blue, small spheres), and a part of the protein (green, cartoon). The influence of the remaining atoms in the outer region (protein and solvent) is included implicitly. The static and reaction field arising from the protein and solvent in the outer region is evaluated using a continuum electrostatic approximation. The static field represents the electrostatic influence from the protein charges in the outer region, shielded by the complex geometry of the protein-solvent interface. The reaction field represents the polarization of the dielectric solvent in the outer region in response to the explicit charges in the inner region.

First, the FKBP12/ligand GSBP system was equilibrated (for 2 ns) with Langevin dynamics without any biasing restraint. The final snapshot of the equilibration (rather than some average) was used as the input structure for FEP/MD simulations. Fig. 4 A shows the RMSD fluctuations of the nonhydrogen atoms of the protein (black), ligand (red), and the fluctuation of the center-of-mass of the ligand (green). The simulated system appears to be very stable. Fig. 4 B shows the fluctuation of the six relative coordinates (rL, θL, φL, αL, βL, and γL), which are used subsequently to implement translational and rotational restraining potential. Those internal coordinates are defined from six point-positions constructed from the Cartesian coordinates of the protein-ligand complex (see Fig. 3). The RMS fluctuation in the distance rL (black) is ∼0.5 Å, while those of the angles are ∼10°. As shown previously, an optimal value of the force constant for a restraining potential is to choose kxkBT/〈Δx2〉 (53). Thus, for the FEP/MD simulations, the force constants for the distance restraint and the angle/dihedral restraint were chosen as 1 kcal/mol/Å2 and 200 kcal/mol/rad2, respectively.

FIGURE 4.

FIGURE 4

Equilibration of the GSBP system containing ligand 8 in complex with FKBP12. (A) RMSDs of the heavy atoms in the protein (black) and in the ligand (red), and the fluctuation of the center-of-mass of the ligand (green). (B) The fluctuation of the six parameters used to define the translational and rotational restraints. The curves are labeled with r (black), θ (red), φ (green), α (blue), β (yellow), and γ (brown), respectively. (C) The fluctuation of the two dihedrals next to the two symmetric phenyl groups shown in Fig. 1. The fluctuation of φ1 is shown in black and the fluctuation of φ2 is shown in red.

Nonbond interaction free energy

The nonbond free energy calculation bears the major part of the computational cost to calculate the standard binding free energy. The conformational, translational, and orientational restraining potentials are applied on the ligand during the FEP/MD simulations (Steps 4 and 7). (The translational and orientational potentials have no influence on the FEP/MD simulations of the ligand in the isotropic bulk.) As a result, the uncoupled ligand retains its bound conformation (on average), and remains in place even after its interactions with the protein and the solvent are turned off. This is clearly advantageous because a floppy and uncoupled ligand wandering freely in the simulation system dramatically increases the size of the configurational space that needs to be explored during the FEP/MD simulations, which can give rise to significant sampling problems. A faster convergence can be achieved using restraining potentials (see also the discussion in Boresch et al. (21) and Woo and Roux (22)). However, one should keep in mind that the specific values for the nonbonded contribution depend on the applied translational, orientational and conformational restraints; variations on the order of 1.0 kcal/mol between different runs with slightly different reference values for the restraints are common.

To further improve the convergence, the LJ interaction is separated into dispersive and repulsive free energy using the WCA decomposition (51). This separation of the LJ potential into pure repulsive and dispersive parts is somewhat arbitrary (others would be possible), though it has the advantage of clearly identifying positive- and negative-definite contributions to the free energy, respectively. The nonbond free energy components (dispersive, repulsive, and electrostatic free energies) are calculated with FEP/MD simulations by gradually varying the coupling parameters ξ, s, and λ, respectively (see Methods). Fig. 6, AC, shows the nonbond free energy components as a function of the corresponding coupling parameters for ligand 8 in the binding site (solid square) and in the bulk solution (dashed triangle), respectively. The repulsive free energy (Fig. 6 A) both in the binding site (solid) and in the bulk solution (dashed) is ∼40 kcal/mol. Thus, the repulsive part of the LJ potential makes only a small net contribution to the binding free energy. In contrast, the dispersive free energy (Fig. 6 B) in the binding site (solid) is much larger than that in the bulk solution (dashed). The difference is ∼−20 kcal/mol, strongly favoring ligand binding. The electrostatic free energy (Fig. 6 C) in the binding site (solid) is ∼4 kcal/mol more negative than that in the bulk solution (dashed), and thus contributes favorably to binding. Therefore, the dispersive van der Waals attraction strongly favors binding. In other words, an isosteric nonpolar analog to ligand 8 would still bind to FKBP, even without any electrostatic contributions from ligand partial-charges. This is a general observation for all the ligands considered here (see below).

FIGURE 6.

FIGURE 6

The free energy components are gradually turned on as the parameters increase from 0 to 1 for ligand 8 in complex with FKBP12. The nonbond free energy components (AC) for the ligand in the binding site (solid line, square) and the ligand in the bulk solution (dashed line, triangle) are shown as solid and dashed lines, respectively. (A) Repulsive free energy. (B) Dispersive free energy. (C) Electrostatic free energy. (D) The free energies corresponding to the translational (shown with circle symbols and κt) and rotational (shown with ♦ and κr) restraints applied to the ligand in the binding site.

RMSD potential of mean force

A key feature of the present strategy is the RMSD restraining potential uc, designed to keep the ligand near the bound conformation. Such restraining potential is an effective device to control the global conformation of a molecule (84,85). Though usefulness of such RMSD restraint becomes more limited in the case of very large structures, drug-like ligands are generally small molecules and their conformation can be accurately controlled without problem. To clarify the significance of the contribution from the RMSD restraining potential, we examine the PMF of the ligand as a function of the RMSD relative to its conformation when bound to the protein. Fig. 5 A shows the PMF as a function of the RMSD values calculated for ligand 8 in the binding site, in the bulk solution, and in vacuum using umbrella sampling simulations. The PMF for the binding site (black) has an absolute minimum at ∼0.4 Å, with a secondary minimum at ∼2 Å. The PMF shows that in the binding site, ligand 8 is stable around its average bound structure calculated from the equilibration trajectory. It also indicates that ligand 8 can adopt a different conformation, which is slightly less favorable by ∼1.3 kcal/mol, while it remains bound to FKBP12. The main (i) and secondary (ii) ligand configurations, illustrated in Fig. 5 B, differ mostly by the rotation of one aromatic ring that interacts weakly with the protein. The PMF in the bulk solution (green) shows a broad minimum at ∼2 Å. The most stable conformation of ligand 8 in the solvent (iii), illustrated in Fig. 5 B, differs mostly by the rotation of two of the aromatic rings. The PMF shows that any conformations differing from the bound state by 1.5 to 3.5 Å RMSD should be accessible through thermal fluctuations. Obviously, there is a free energy penalty to bring the conformation of the ligand, moving freely in the bulk solution, to the conformation it must adopt in the binding site. The free energy required to restrict the conformation of the ligand is determined numerically using Eq. 13. The value of 6.9 kcal/mol given in Table 2 can be easily understood from a direct comparison of the two PMFs in Fig. 5 A. The PMF of the ligand in the solvent (green) goes up to 7 kcal/mol at an RMSD corresponding to the minimum of the PMF in the binding site (black, ∼0.4 Å). In other words, ∼7 kcal/mol of conformational free energy is required to transform the ligand from its most probable conformation in the bulk (iii), to its most probable conformation in the binding site (i). The PMF in vacuum (red) has many similarities with the PMF in the bulk solution, though the minimum at 2.5 Å is not as broad.

FIGURE 5.

FIGURE 5

PMF calculations on the conformational restraints for ligand 8 in complex with FKBP12. (A) PMF curves for ligand 8 in the binding site (black), in the bulk solution (green), and in vacuum (red). The callouts i and ii label the minima of the PMF for the ligand in the binding site. The callouts iii and iv label the minima of the PMFs for the ligand in the bulk solution and in vacuum, respectively. (B) The average structures of the ligand around the minima are shown together with the average structure of the ligand in the equilibration (shown in yellow). The average structures of the ligand around the minima i, ii, iii, and iv are colored in black, gray, green, and red, respectively. The structures are aligned along the piperidine ring in ligand 8. (C) The normalized Boltzmann factors exp[−β(wc(ξ) + kcξ2)] with a force constant kc = 10 kcal/mol/Å2 for the ligand in the binding site (black), in the bulk solution (green), and in vacuum (red).

The optimal force constant kc to restrict the ligand around its average bound conformation was determined from the calculated PMF. Fig. 5 C shows the normalized (biased) Boltzmann distribution Inline graphic, Inline graphic, and Inline graphic as a function of the RMSD ζ for the ligand in the binding site (black), in the bulk solution (green), and in vacuum (red), respectively. With a force constant of 10 kcal/mol/Å2, the conformational distribution functions of the ligand in all the systems (binding site, bulk solution, and vacuum) have a high overlap at ∼0.5 Å RMSD. Such force constant insures that the ligand is kept near the reference conformation Lref during the calculations for Steps 2, 3, 4 and 7.

Using restraints of different strength

The present computational strategy attempts to enhance the configurational sampling of the molecular systems using biasing restraints. Optimal values for the calculations use a force constant kc for the conformational restriction of 10 kcal/mol/Å2, a distance force constant kt of 1 kcal/mol/Å2, and an angle/dihedral force constant ka of 200 kcal/mol/rad2. Fig. 6 D shows the progression of the free energies corresponding to the translational (circles) and rotational (diamonds) restraints applied to the ligand in the binding site. The resulting values are typically on the order of 1–2 kcal/mol, and the FEP/MD simulations converge without problems. However, a legitimate concern might be that the final results for the binding free energy remain tainted by the choice of force constants for the restraints.

To address this issue, the absolute binding free energy was recalculated for ligand 8 using different force constants kc, kt, and ka. The results are given in Table 4. It is observed that the final binding free energy is nearly independent of the biasing restraints, even when the force constants are varied by a large amount. All the calculations in Table 3 used the same set of force constants, with kc = 10 kcal/mol/Å2, kt = 1 kcal/mol/Å2, and ka = 200 kcal/mol/rad2. When the force constants kt and ka are varied, only the following free energy components change: the net free energy contribution associated with the translational and rotational freedom (ΔΔGt and ΔΔGr), and the nonbond interaction free energy contribution of the ligand in the binding site (Inline graphic). In the case of the conformational restraint, it may be noted that the accuracy appears to be somewhat compromised if the force constant is <∼1.0 kcal/mol/Å2. The main reason is that the uncoupled ligand in the binding site begins to adopt conformations different from the average bound conformation, which makes the calculation slower to converge. It is therefore advantageous to use restraints that are sufficiently strong to avoid this problem.

TABLE 4.

Binding free energy of the FKBP12/ligand 8 complex (started from the x-ray structure) at different force constants for the RMSD potentials, the translational restraint, and the rotational restraint

kc kt ka Inline graphic Inline graphic + ΔΔGc + ΔΔGt + ΔΔGr ΔG°bind
1 1 200 −38.6 16.0 2.5 3.4 5.1 −11.6
10 1 200 −43.3 17.2 6.9 3.4 5.4 −10.3
10 0.2 40 −42.1 17.2 6.9 2.4 4.4 −11.1
10 10 1000 −44.4 17.2 6.9 4.5 5.5 −10.2
10 100 4000 −44.8 17.2 6.9 5.0 5.8 −9.8
30 1 200 −44.5 17.2 7.6 3.5 5.4 −10.8
100 1 200 −47.5 19.3 8.5 3.5 5.6 −10.6

The value kc (in kcal/mol/Å2) is the force constant for the RMSD potentials. The value kt (in kcal/mol/Å2) is the distance force constant for the translational restraint. The value ka (in kcal/mol/rad2) is the angle/dihedral force constant for the translational and rotational restraints.

TABLE 3.

Binding free energy and solvation energy compared with other results

Inline graphic
ΔGsolv
Lig Struct This work Experiments Shirts (64) This work Shirts (64)
2 MD −12.3 ± 1.4 −7.8 ± 0.1 −5.42 −11.4 ± 1.1 −13.56
3 mod −8.7 ± 0.7 −8.4 ± 0.1 NA −9.7 ± 0.5 NA
3 MD −9.0 ± 1.1 −8.4 ± 0.1 −8.22 −13.8 ± 0.7 −16.98
5 mod −10.2 ± 1.1 −9.5 ± 0.1 NA −12.9 ± 0.1 NA
5 MD −11.6 ± 2.1 −9.5 ± 0.1 −6.92 −12.7 ± 0.5 −15.02
6 MD −9.7 ± 2.8 −10.8 ± 0.3 −8.29 −18.2 ± 1.7 −17.36
8 X ray −10.3 ± 0.4 −10.9 ± 0.1 NA −17.3 ± 0.9 NA
8 MD −10.3 ± 1.2 −10.9 ± 0.1 −10.42 −15.4 ± 1.9 −15.21
9 X ray −11.7 ± 1.0 −11.1 ± 0.2 NA −12.6 ± 0.8 NA
9 MD −11.6 ± 0.1 −11.1 ± 0.2 −9.11 −11.8 ± 0.8 −13.95
12 MD −13.7 ± 2.8 −10.3 ± 0.2 −7.51 −22.1 ± 2.9 −23.49
20 X ray −10.1 ± 1.2 −12.7 ± 0.2 NA −35.1 ± 0.7 NA
20 MD −10.8 ± 3.0 −12.7 ± 0.2 −13.34 −36.4 ± 1.4 −35.15

The calculated binding free energies are compared with the experimental results (60) and the previous calculations by Shirts (64). The force constants used are: kc = 10 kcal/mol/Å2, kt = 1 kcal/mol/Å2, and ka = 200 kcal/mol/rad2. The error bars in the calculations are the standard deviations of three or more independent runs. The solvation free energies by M. R. Shirts and co-workers are from personal communication (2005) . NA means data are not available.

Standard binding free energies

In Table 3, the results of the calculations for the eight ligands are compared with the experimental values (60), and the results from extensive FEP/MD calculations by Pande, Shirts and co-workers (64). The results are in good agreement with the experimental results, especially in the case of ligands for which it is possible to have a good starting structure of the complex either from x-ray crystallography (ligands 8, 9, and 20) or by simple direct modeling (ligands 3 and 5). The differences are within 1 kcal/mol for most ligands, which is roughly the order of magnitude of the statistical errors of calculations. The calculated binding free energies for a given ligand obtained from different starting structures, e.g., ligand-8 x-ray and MD (64), are very similar, though the errors appear to be smaller for crystal structures in general. The solvation free energy of the ligands offers one additional (simpler) quantity to directly assess the accuracy of the present computations by comparing with the previous results of V. S. Pande and co-workers (personal communication, 2005). As shown in Table 3, the solvation free energies calculated according to Eq. 12 are very consistent with those results, except for ligands 3 and 5. Overall, we conclude that the present strategy, based on FEP/MD simulations of a reduced GSBP atomic model sampled with conformational, translational, and orientational restraining potentials, is accurate and efficient.

Nonbond contribution

Table 1 shows the nonbond free energy components (dispersive, repulsive, and electrostatic free energy) for all ligands in the binding site and in the bulk solution. The nonbond free energy of the ligand in the binding site Inline graphic is typically ∼20 kcal/mol more negative than that of the ligand in the bulk solution Inline graphic. Generally, Inline graphic and Inline graphic increase with the size of the ligand (in the order 2, 3, 5, 6, 8, 9, 12, and 20). As expected for nonpolar ligands, there is only a minor contribution from the electrostatic free energy. The net contribution from the repulsive free energy appears to be generally small for most ligands. The reason for this may be that the binding pocket is at the surface of FKBP12 and that the bound ligand remains solvent-exposed. The trend was somewhat different for the binding of aromatic ligands to a mutant of T4 lysozyme engineered to have a buried nonpolar cavity. In that case, the calculated contribution from the repulsive interaction was more important (52). For the FK506-related ligands, the dispersive interactions make the dominant contribution to the binding free energy (∼20–30 kcal/mol).

The importance of dispersive interactions in driving the binding process can be understood from the relative density of the bulk relative to the protein. There is a larger number of atomic (nonhydrogen) van der Waals interaction centers per unit volume in the protein compared with liquid water. The transfer of a nonpolar ligand from the bulk to the binding site almost invariably yields a change of van der Waals dispersive interactions that favors association. Obviously, the favorable van der Waals interactions increase with the size of the ligand in general. These observations suggest that, to describe ligand binding with quantitative accuracy, one needs to account for the variations in the repulsive and attractive free energy contributions in the different environment of the bulk or the binding site. A similar observation has been made by Levy et al. (86).

Loss of conformational, translational, and orientational freedom

The net contribution from the translational free energy ΔΔGt is ∼3 kcal/mol, though it goes up to 4.5 kcal/mol in the case of ligand 8 with a stronger restraint (Table 4). Since ΔΔGt° ≈ −kBT ln(ΔVC°), this would indicate that the center-of-mass of the bound ligand is fluctuating inside a microscopic volume ΔV on the order of ∼1 Å3. For example, ΔΔGt is estimated to be 4.6 kcal/mol using the fluctuations of the bound ligand in unbiased simulations in Eq. 10 in the case of ligand 8. Similarly, the loss of orientational freedom gives rise to a significantly unfavorable contribution. The net contribution from the rotational ΔΔGr is ∼5 kcal/mol, which corresponds to a significant reduction of orientational freedom (accessible solid angle of rotation) ΔΩ from 8π2. This was also previously noted by Swanson and McCammon (70). In the case of ligand 8 again, ΔΔGr is estimated to be 5.0 kcal/mol using the fluctuations of the bound ligand in unbiased simulations in Eq. 11. Perhaps more surprising is the magnitude of the net contribution corresponding to the conformational restriction of the ligand upon binding, ΔΔGc, which varies from 2 up to 6–7 kcal/mol depending on the ligand (Table 2). This is a very significant fraction of the standard binding free energy for those ligands.

The magnitude of the free energy contributions caused by the loss of conformational, translational, and orientational freedom has consequences for empirical implicit solvent scoring schemes attempting to estimate the ligand-protein binding free energy on the basis of fixed ligand-protein configurations. For example, MM/PBSA consists in averaging the results of PB calculations based on an ensemble of fixed snapshots generated by MD simulations with explicit solvent (66). What can be effectively generated by this procedure is an approximation to ΔΔGint, the ligand-protein interaction free energy. This contribution most obviously favors binding. However, the remaining contributions, which are necessarily unfavorable to binding, must also be taken into account. To some extent, this may be achieved with the use of approximate expressions. For example, reliable estimates of the free energy contributions arising from the loss of translational and orientational degrees of freedom of the ligand may be obtained on the basis of fluctuation averages using Eqs. 10 and 11. Upon inspection, it seems likely that such approximation may be valid only for a fairly rigid ligand, or if a strong conformational restraining potential is applied to a flexible ligand. However, there is no straightforward estimator of ΔΔGc. In the case of ligand 8, for example, the free energy required to conform the flexible ligand in the bulk to its bound state in FKBP12 (∼7 kcal/mol) is not correlated with the potential energy of the ligand molecule corresponding to the bound (83.27 kcal/mol) and free (84.73 kcal/mol) conformations (both relaxed with energy minimization). Designing an approximation to ΔΔGc is very challenging because it is essentially dominated by a loss of conformational freedom of the ligand and not by a change of internal potential energy.

Accuracy and sensitivity to starting structure

Three sets of starting structures were used for the calculations. A first set of structures are x-ray crystallographic structures of FKBP12 with ligands 8, 9, and 20. A second set of structures are FKBP12 in complex with ligands 3 and 5, which were constructed from the crystal structure of ligand 9 (see Methods). A third set of structures are FKBP12 with all eight ligands provided by V. S. Pande and co-workers (personal communication, 2005); these docking models were equilibrated using MD for ∼200 ps with explicit solvent. For each ligand, the complete FEP/MD calculation was repeated at least three times (some up to five) starting from the beginning of the equilibration. The statistical precision of the calculated free energy is on the order of ∼1.0 kcal/mol, though the accuracy of the calculated value differs depending on the starting structure. The results summarized in Table 3 highlight the importance of the starting structure. For example, the results for ligand 8 are different if initiated from the crystallographic x-ray structure or from the MD equilibrated system of Pande, Shirts and co-workers (63,64). In contrast, ligand 5 appears to be particularly problematic. The ligand-protein complex was somewhat unstable during the equilibration, giving rise to more significant variations in the resulting binding free energy.

The present strategy based on biased sampling with restraining potentials does not require very long simulations as long as the starting structure is accurate and can remain stable in this neighborhood (within ∼1 Å). Bad or unstable initial structures start to drift during equilibration, which makes it difficult to design an optimal set of restraining potentials aimed at helping the sampling. In the familiar language of computational drug design, one might say that the present strategy is well suited to assign a value to a given ligand configuration (defined within ∼1 Å), i.e., for “scoring” a given ligand “pose”. The task of searching and finding the ligand pose, i.e., “docking”, is quite different and normally precedes scoring. Although docking by MD can be done, this is clearly not the most efficient approach to accomplish this task. Extensive FEP/MD calculations with very long trajectories amount essentially to accomplishing both the docking and the scoring tasks simultaneously. Ligand docking by extensive MD is certainly feasible, though accurate docking models may be generated at a lesser computational cost with heuristic methods. The concept of scoring a particular ligand pose from FEP/MD makes sense only if rapid interconversion to neighboring poses (assuming they exist) is prevented by energy barriers of several kBT (i.e., they are essential different binding sites). If such condition is not met, then the true binding free energy must be obtained via a sampling over all the substate with proper weighting. While the present step-by-step formulation could be used advantageously with extensively long FEP/MD simulations, we believe that it is most useful when used in conjunction with accurate starting configurations, obtained either from x-ray crystallography or high-quality docking models.

CONCLUSION

The standard binding free energy of eight FK506-related ligands of FKBP12 was calculated using a FEP/MD simulation protocol incorporating the effect of full atomic flexibility and thermal fluctuations. The results are generally in good accord with experiments and the extensive calculations of Pande and co-workers (63) and Shirts (64). When the starting structure is taken from x-ray crystallography, the calculated binding free energy is usually in good agreement with the experimental results (within 2 kcal/mol) and the statistical error is small (∼1 kcal/mol). Given that the program and method used are significantly different from those used by Pande and co-workers (63), we consider the agreement satisfactory. On the other hand, if the initial complex structure is a modeled structure, the calculated binding free energy may be less precise. This highlights the importance of accurate docking to enable a successful rapid evaluation of the binding free energy with FEP/MD.

A number of features were implemented to enhance the accuracy of the results while reducing the overall computational cost of the calculations. First, the reversible work for the entire association/dissociation process was decomposed into eight sequential steps during which the ligand-surrounding interactions as well as various biasing potentials restraining the ligand are turned “on” and “off”. The usage of those biasing potentials decreases the size of the configurational space that needs to be explored, therefore reducing the sampling workload of the FEP/MD simulations. The advantage is noise reduction, which leads to a more rapid convergence of the computations. Second, a RMSD umbrella sampling PMF calculations was used to accurately sample the conformation of the ligand in the binding and in the bulk. Effectively, the RMSD restraining potential enables us to transform a flexible ligand into a relatively rigid molecule, which simplifies all the subsequent steps considerably. One may note that a similar RMSD potential could be easily extended to restrain a flexible protein receptor as well. Third, the convergence of the calculation of the nonbond interaction free energy contribution was improved by separating the LJ potential into a repulsive and dispersive free energy using WCA. This separation also allows a clear identification of the molecular components contributing favorably to the binding. Fourth, only a small number of atoms in the vicinity of the binding site were explicitly simulated with GSBP while the influence of the remaining atoms was incorporated implicitly. This allowed a significant reduction in the size of the simulated systems, from ∼25,000 to 2500 atoms.

Throughout the discussion, a number of observations were made concerning the physical meaning of various free energy contributions upon ligand binding, such as the loss of translational, orientational, and conformational freedom, as well as the changes in repulsive, dispersive, and electrostatic nonbonded interactions. Although the mathematical definition of all these contributions is unambiguous, it is important to recall that their precise numerical values depend on the choices of applied restraint potentials. Similarly, the free energy components arising from the separation of the nonbonded interactions (WCA-LJ and partial charge electrostatics) are well-defined, but nonetheless reflect a series of choices about hypothetical intermediate states. Ultimately, only the total absolute free energy is invariant and independent of all those choices. In trying to attribute a physical meaning to the various contributions, it is therefore important to keep these limitations in mind. Only a careful and judicious analysis of the various components can yield physically meaningful insight. The best is to utilize the free energy decomposition to highlight qualitative concepts, such as, for example, the general importance of the loss of translational and orientation freedom on the total binding free energy, the magnitude of the free energy cost to restrict the ligand into its bound conformation, and the relative change in the nonbonded interactions upon binding. For example, an important qualitative observation from the series of computations is that the dispersive interaction is an important driving force for complexation. In other words, isosteric nonpolar analogs of these ligands (without any partial charges) would still bind (assuming that their internal conformation would be unchanged). This observation arises naturally from the free energy decomposition used here to separate the various contributions to the nonbonded interactions (repulsion, dispersion, and electrostatics).

The step-by-step computational FEP/MD simulation protocol with restraining potential breaks down the complete calculation into many short MD simulations that are easy to distribute over independent compute nodes. This makes it possible to use trivially distributed computing resources for direct calculations of standard binding free energy. For each ligand system, ∼250 MD trajectories systems were generated, each taking ∼2–4 h using a single 1-GHz Pentium CPU. Thus, the total computational time to obtain the standard binding free energy of one ligand can be as little as 2–4 h, if 250 CPUs are available simultaneously. The various biasing restraints help reduce the noise and improve the convergence of the final results. At this point, it seems clear the ultimate limitations of calculated standard binding free energies is not the statistical precision that can be achieved. Whenever necessary, a higher precision could be obtained at reasonable computational cost simply by increasing the length and the number of simulations. Furthermore, it should be noted that while simulating a reduced GSBP atomic model may often be advantageous, such solvent boundary potentials remain approximate. If a better representation of the entire molecular system is warranted, the same step-by-step binding free energy formulation with restraint potentials could be used in extensive all-atom FEP/MD simulations with periodic boundary conditions. One area where there remains ample room for improvement concerns the atomic force fields. Without being overly pessimistic, it seems likely that no miraculous accuracy should be expected from simple nonpolarizable fixed-charges potential function (52). Quantitative accuracy (i.e., >0.5 kcal/mol) might only be possible by including the influence of induced polarization explicitly. With these advances in FEP/MD methodologies, it is clear that improved force fields could enable rapid predictive binding estimates of quantitative accuracy at a very reasonable computational cost for rational drug design.

Acknowledgments

Michael R. Shirts and Vijay S. Pande provided the AMBER force field and the modeled docking structures with equilibration MD. We are grateful to Jessica M. J. Swanson and J. Andrew McCammon for helpful comments.

J.W. was supported by a postdoctoral fellowship from the Charles H. Revson Foundation. This work was funded by grant No. 0415784 from the National Science Foundation.

APPENDIX A

In the expressions in Table A1, U1 is the total potential energy of the system, and U0 is the total potential energy of an intermediate state in which the ligand does not interact with its surrounding (protein and/or solvent). The conformational, translational, and orientational biasing potentials are uc, ut, and ur respectively. Because the ligand molecule does not interact with its surrounding with potential U0, the orientational factor Fr in Step 5 is given by

graphic file with name M72.gif (16)

where ΩL are the three angles defining rigid body rotation of the ligand molecule (relative to the receptor); the volume element implicitly includes a Jacobian for the rotation angles. Similarly, the translational factor Ft in Step 6 is given by

graphic file with name M73.gif (17)

(all other terms cancel out). The translational factor Ft, hence the binding constant Kb, has the dimension of volume (with a natural unit of Å3 in atomistic simulations). The standard equilibrium binding constant is Kb° and the standard binding free energy is ΔG°bind. C° is the standard state concentration (equal to 1 M or 1/1661 Å−3).

TABLE A1.

Decomposition of the binding process in eight steps

Step Process Configurational integral Contribution
1 Inline graphic Inline graphic Inline graphic
2 Inline graphic Inline graphic Inline graphic
3 Inline graphic Inline graphic Inline graphic
4 Inline graphic Inline graphic Inline graphic
5 Inline graphic Inline graphic Fr
6 Inline graphic Inline graphic Ft
7 Inline graphic Inline graphic Inline graphic
8 Inline graphic Inline graphic Inline graphic
Total Inline graphic

APPENDIX B

As used in Table B1, the units of energy and distance used in GROMACS are kJ/mol and nm, while they are kcal/mol and Å in CHARMM. The subscripts c and g (e.g., as in kc and kg) mean CHARMM and GROMACS, respectively. F(= 1/4.184kcal/kJ) is a conversion factor. In GROMACS format, Ryckaert-Bellemans (RB) dihedral energy can be expressed using the parameters kn and φn (n = 0–5), which are converted from the parameters Cn as

graphic file with name M97.gif (18)

where φn = 0 or 180°.

TABLE B1.

Conversion of AMBER force field from GROMACS to CHARMM format

Conversion
GROMACS Energy CHARMM Energy para1 para2
Bond: Inline graphic k(bb0)2 kc = (0.5/100)Fkg bc = 10 bg
Angle: Inline graphic k(θθ0)2 kc = 0.5 Fkg θc = θg
NB: Inline graphic Inline graphic εc = g Inline graphic
Dihe: k + kcos(φ0) k + kcos(φ0) kc = Fkg φc = φg
RB Dihe: Inline graphic Inline graphic kn, c = Fkn, g φn, c = φn, g

References

  • 1.Vindigni, A. 1999. Energetic dissection of specificity in serine proteases. Comb. Chem. High Throughput Screen. 2:139–153. [PubMed] [Google Scholar]
  • 2.Cheng, A. C., V. Calabro, and A. D. Frankel. 2001. Design of RNA-binding proteins and ligands. Curr. Opin. Struct. Biol. 11:478–484. [DOI] [PubMed] [Google Scholar]
  • 3.Garvie, C. W., and C. Wolberger. 2001. Recognition of specific DNA sequences. Mol. Cell. 8:937–946. [DOI] [PubMed] [Google Scholar]
  • 4.Schneider, G., and H. J. Bohm. 2002. Virtual screening and fast automated docking methods. Drug Discov. Today. 7:64–70. [DOI] [PubMed] [Google Scholar]
  • 5.Shoichet, B. K., A. R. Leach, and I. D. Kuntz. 1999. Ligand solvation in molecular docking. Proteins. 34:4–16. [DOI] [PubMed] [Google Scholar]
  • 6.Lu, N. D., and D. A. Kofke. 2001. Accuracy of free energy perturbation calculations in molecular simulation. I. Modeling. J. Chem. Phys. 114:7303–7311. [Google Scholar]
  • 7.Lu, N. D., and D. A. Kofke. 2001. Accuracy of free energy perturbation calculations in molecular simulation. II. Heuristics. J. Chem. Phys. 115:6866–6875. [Google Scholar]
  • 8.Florian, J., M. F. Goodman, and A. Warshel. 2000. Free energy perturbation calculations of DNA destabilization by base substitutions: the effect of neutral guanine thymine, adenine cytosine and adenine difluorotoluene mismatches. J. Phys. Chem. B. 104:10092–10099. [Google Scholar]
  • 9.Brandsdal, B. O., and A. O. Smalas. 2000. Evaluation of protein-protein association energies by free energy perturbation calculations. Protein Eng. 13:239–245. [DOI] [PubMed] [Google Scholar]
  • 10.Straatsma, T. P., and J. A. McCammon. 1992. Computational alchemy. Annu. Rev. Phys. Chem. 43:407–435. [Google Scholar]
  • 11.Beveridge, D. L., and F. M. Dicapua. 1989. Free energy via molecular simulation—applications to chemical and biomolecular systems. Annu. Rev. Biophys. Biol. 18:431–492. [DOI] [PubMed] [Google Scholar]
  • 12.Price, D. J., and W. L. Jorgensen. 2000. Computational binding studies of human pp60c-src SH2 domain with a series of nonpeptide, phosphophenyl-containing ligands. Bioorg. Med. Chem. Lett. 10:2067–2070. [DOI] [PubMed] [Google Scholar]
  • 13.Wesolowski, S. S., and W. L. Jorgensen. 2002. Estimation of binding affinities for celecoxib analogues with COX-2 via Monte Carlo-extended linear response. Bioorg. Med. Chem. Lett. 12:267–270. [DOI] [PubMed] [Google Scholar]
  • 14.Carlson, H. A., K. M. Masukawa, K. Rubins, F. D. Bushman, W. L. Jorgensen, R. D. Lins, J. M. Briggs, and J. A. McCammon. 2000. Developing a dynamic pharmacophore model for HIV-1 integrase. J. Med. Chem. 43:2100–2114. [DOI] [PubMed] [Google Scholar]
  • 15.Kollman, P. 1993. Free energy calculations: applications to chemical and biochemical phenomena. Chem. Rev. 93:2395–2417. [Google Scholar]
  • 16.Ajay, A., and M. A. Murcko. 1995. Computational methods to predict binding free energy in ligand-receptor complexes. J. Med. Chem. 38:4953–4967. [DOI] [PubMed] [Google Scholar]
  • 17.Gilson, M. K., J. A. Given, B. L. Bush, and J. A. McCammon. 1997. The statistical-thermodynamic basis for computation of binding affinities: a critical review. Biophys. J. 72:1047–1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.McCammon, J. A. 1998. Theory of biomolecular recognition. Curr. Opin. Struct. Biol. 8:245–249. [DOI] [PubMed] [Google Scholar]
  • 19.Simonson, T., G. Archontis, and M. Karplus. 2002. Free energy simulations come of age: protein-ligand recognition. Acc. Chem. Res. 35:430–437. [DOI] [PubMed] [Google Scholar]
  • 20.Lazaridis, T., A. Masunov, and F. Gandolfo. 2002. Contributions to the binding free energy of ligands to avidin and streptavidin. Proteins. 47:194–208. [DOI] [PubMed] [Google Scholar]
  • 21.Boresch, S., F. Tettinger, M. Leitgeb, and M. Karplus. 2003. Absolute binding free energies: a quantitative approach for their calculation. J. Phys. Chem. B. 107:9535–9551. [Google Scholar]
  • 22.Woo, H. J., and B. Roux. 2005. Chemical theory and computation special feature: calculation of absolute protein-ligand binding free energy from computer simulations. Proc. Natl. Acad. Sci. USA. 102:6825–6830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sagui, C., and T. A. Darden. 1999. Molecular dynamics simulations of biomolecules: long-range electrostatic effects. Annu. Rev. Biophys. Biomol. Struct. 28:155–179. [DOI] [PubMed] [Google Scholar]
  • 24.Roux, B., and T. Simonson. 1999. Implicit solvent models. Biophys. Chem. 78:1–20. [DOI] [PubMed] [Google Scholar]
  • 25.Brooks III, C. L., M. Karplus, and B. M. Pettitt. 1988. Proteins. A theoretical perspective of dynamics, structure and thermodynamics. In Advances in Chemical Physics, Vol. LXXI. I. Prigogine and S. A. Rice, editors. John Wiley & Sons, New York.
  • 26.Honig, B., and A. Nicholls. 1995. Classical electrostatics in biology and chemistry. Science. 268:1144–1149. [DOI] [PubMed] [Google Scholar]
  • 27.Edinger, S. R., C. Cortis, P. S. Shenkin, and R. A. Friesner. 1997. Solvation free energies of peptides: comparison of approximate continuum solvation models with accurate solution of the Poisson-Boltzmann equation. J. Phys. Chem. B. 101:1190–1197. [Google Scholar]
  • 28.Jayaram, B., D. Sprous, and D. L. Beveridge. 1998. Solvation free energy of biomacromolecules: parameters for a modified generalized Born model consistent with the AMBER force field. J. Phys. Chem. B. 102:9571–9576. [Google Scholar]
  • 29.Jayaram, B., Y. Liu, and D. L. Beveridge. 1998. A modification of the generalized Born theory for improved estimates of solvation energies and pK shifts. J. Chem. Phys. 109:1465–1471. [Google Scholar]
  • 30.Ghosh, A., C. S. Rapp, and R. A. Friesner. 1998. Generalized Born model based on a surface integral formulation. J. Phys. Chem. B. 102:10983–10990. [Google Scholar]
  • 31.Dominy, B. N., and C. L. Brooks III. 1999. Development of a generalized Born model parametrization for proteins and nucleic acids. J. Phys. Chem. B. 103:3765–3773. [Google Scholar]
  • 32.Eisenberg, D., and A. McClachlan. 1986. Solvation energy in protein folding and binding. Nature. 319:199–203. [DOI] [PubMed] [Google Scholar]
  • 33.Wesson, L., and D. Eisenberg. 1992. Atomic solvation parameters applied to molecular dynamics of proteins in solution. Protein Sci. 1:227–235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Fraternali, F., and W. F. V. Gunsteren. 1996. An efficient mean solvation force model for use in molecular dynamics simulations of proteins in aqueous solution. J. Mol. Biol. 256:939–948. [DOI] [PubMed] [Google Scholar]
  • 35.Scheraga, H. A. 1979. Interactions in aqueous solution. Acc. Chem. Res. 12:7–14. [Google Scholar]
  • 36.Kang, Y. K., K. D. Gibson, G. Némethy, and H. Scheraga. 1988. Free energies of hydration of solute molecules. 4. Revised treatment of HE hydration shell model. J. Phys. Chem. 92:4739–4742. [Google Scholar]
  • 37.Colonna-Cesari, F., and C. Sander. 1990. Excluded volume approximation to protein-solvent interaction. Biophys. J. 57:1103–1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Stouten, P., C. Frömmel, H. Nakamura, and C. Sander. 1993. An effective solvation term based on atomic occupancies for use in protein simulations. Mol. Sim. 10:97–120. [Google Scholar]
  • 39.Lazaridis, T., and M. Karplus. 1997. New view of protein folding reconciled with the old through multiple unfolding simulations. Science. 278:1928–1931. [DOI] [PubMed] [Google Scholar]
  • 40.Cummings, M. D., T. N. Hart, and R. J. Read. 1995. Atomic solvation parameters in the analysis of protein-protein docking results. Protein Sci. 4:2087–2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Beglov, D., and B. Roux. 1994. Finite representation of an infinite bulk system: solvent boundary potential for computer simulations. J. Chem. Phys. 100:9050–9063. [Google Scholar]
  • 42.Roux, B., D. Beglov, and W. Im. 1999. Generalized solvent boundary potentials for computer simulations. In Proceedings of the Santa Fe Workshop on Treatment of Electrostatic Interactions in Computer Simulations of Condensed Media, 23–25 June 1999, AIP Conference Proceedings, Vol. 492. L. R. Pratt and G. Hummer, editors. AIP, Melville, NY. 473–491.
  • 43.Roux, B., D. Beglov, and W. Im. 1999. Generalized boundary potentials for computer simulations. In Simulation and Theory of Electrostatic Interactions in Solutions. L. R. Pratt and G. Hummer, editors. American Institute of Physics Conference Proceedings 492. 473–494. New York.
  • 44.Berkowitz, M., and J. A. McCammon. 1982. Molecular dynamics with stochastic boundary conditions. Chem. Phys. Lett. 90:215–217. [Google Scholar]
  • 45.Brunger, A., C. L. Brooks III, and M. Karplus. 1984. Stochastic boundary conditions for molecular dynamics simulations of ST2 water. Chem. Phys. Lett. 105:495–500. [Google Scholar]
  • 46.Brooks III, C. L., and M. Karplus. 1983. Deformable stochastic boundaries in molecular dynamics. J. Chem. Phys. 79:6312–6325. [Google Scholar]
  • 47.Essex, J. W., and W. L. Jorgensen. 1995. An empirical boundary potential for water droplet simulations. J. Comput. Chem. 16:951–972. [Google Scholar]
  • 48.Warshel, A., and G. King. 1985. Polarization constraints in molecular dynamics simulation of aqueous solutions: the surface constraint all-atom solvent (SCAAS) model. Chem. Phys. Lett. 121:124–129. [Google Scholar]
  • 49.King, G., and A. Warshel. 1989. A surface constrained all-atom solvent model for effective simulations of polar solutions. J. Chem. Phys. 91:3647–3661. [Google Scholar]
  • 50.Rullmann, J. A. C., and P. T. van Duijnen. 1987. Analysis of discrete and continuum dielectric models; application to the calculation of protonation energies in solution. Mol. Phys. 61:293–311. [Google Scholar]
  • 51.Deng, Y., and B. Roux. 2004. Hydration of amino acid side chains: nonpolar and electrostatic contributions calculated from staged molecular dynamics free energy simulations with explicit water molecules. J. Phys. Chem. B. 108:16567–16576. [Google Scholar]
  • 52.Deng, Y., and B. Roux. 2006. Nonpolar binding free energy: aromatic molecules bind T4 lysozyme mutant. J. Chem. Theory Comput. In press. [DOI] [PubMed]
  • 53.Roux, B., M. Nina, R. Pomes, and J. C. Smith. 1996. Thermodynamic stability of water molecules in the bacteriorhodopsin proton channel: a molecular dynamics free energy perturbation study. Biophys. J. 71:670–681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hermans, J., and L. Wang. 1997. Inclusion of loss of translational and rotational freedom in theoretical estimates of free energies of binding. application to a complex of benzene and mutant t4 lysozyme. J. Am. Chem. Soc. 119:2707–2714. [Google Scholar]
  • 55.Harrison, R. K., and R. L. Stein. 1992. Mechanistic studies of enzymic and nonenzymic prolyl cis-trans isomerization. J. Am. Chem. Soc. 114:3464–3471. [Google Scholar]
  • 56.Schreiber, S. L. 1991. Chemistry and biology of the immunophilins and their immunosuppressive ligands. Science. 251:283–287. [DOI] [PubMed] [Google Scholar]
  • 57.Kissinger, C. R., H. E. Parge, D. R. Knighton, C. T. Lewis, L. A. Pelletier, A. Tempczyk, V. J. Kalish, K. D. Tucker, R. E. Showalter, E. W. Moomaw, L. N. Gastinel, N. Habuka, X. H. Chen, F. Maldonado, J. E. Barker, R. Bacquet, and J. E. Villafranca. 1995. Crystal-structures of human calcineurin and the human FKBP12-FK506-calcineurin complex. Nature. 378:641–644. [DOI] [PubMed] [Google Scholar]
  • 58.Griffith, J. P., J. L. Kim, E. E. Kim, M. D. Sintchak, J. A. Thomson, M. J. Fitzgibbon, M. A. Fleming, P. R. Caron, K. Hsiao, and M. A. Navia. 1995. X-ray structure of calcineurin inhibited by the immunophilin immunosuppressant FKBP12-FK506 complex. Cell. 82:507–522. [DOI] [PubMed] [Google Scholar]
  • 59.Van Duyne, G. D., R. F. Standaert, P. A. Karplus, S. L. Schreiber, and J. Clardy. 1991. Atomic structure of FKBP-FK506, an immunophilin-immunosuppressant complex. Science. 252:839–842. [DOI] [PubMed] [Google Scholar]
  • 60.Holt, D. A., J. I. Luengo, D. S. Yamashita, H. Oh, A. L. Konialian, H. Yen, L. W. Rozamus, M. Brandt, M. J. Bossard, M. A. Levy, D. S. Eggleston, J. Liang, L. W. Schultz, T. J. Stout, and J. Clardy. 1993. Design, synthesis, and kinetic evaluation of high-affinity FKBP ligands and the x-ray crystal structures of their complexes with FKBP12. J. Am. Chem. Soc. 115:9925–9938. [Google Scholar]
  • 61.Wilson, K. P., M. M. Yamashita, M. D. Sintchak, S. H. Rotstein, M. A. Murcko, J. Boger, J. A. Thomson, M. J. Fitzgibbon, J. R. Black, and M. A. Navia. 1995. Comparative x-ray structures of the major binding protein for the immunosuppressant FK506 (tacrolimus) in unliganded form and in complex with FK506 and rapamycin. Acta Crystallogr. D Biol. Crystallogr. 51:511–521. [DOI] [PubMed] [Google Scholar]
  • 62.Lamb, M. L., J. Tirado-Rives, and W. L. Jorgensen. 1999. Estimation of the binding affinities of FKBP12 inhibitors using a linear response method. Bioorg. Med. Chem. 7:851–860. [DOI] [PubMed] [Google Scholar]
  • 63.Fujitani, H., Y. Tanida, M. Ito, G. Jayachandran, C. D. Snow, M. R. Shirts, E. J. Sorin, and V. S. Pande. 2005. Direct calculation of the binding free energies of FKBP ligands. J. Chem. Phys. 123:084108. [DOI] [PubMed] [Google Scholar]
  • 64.Shirts, M. R. 2005. Calculating precise and accurate free energies in biomolecular systems. Ph.D. Thesis. Stanford University, Stanford, CA.
  • 65.Lee, M., and M. A. Olson. 2006. Calculation of absolute protein-ligand binding affinity using path and endpoint approaches. Biophys. J. 90:864–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Massova, I., and P. A. Kollman. 2000. Combined molecular mechanical and continuum solvent approach (MM-PBSA/GBSA) to predict ligand binding. Perspect. Drug Discov. 18:113–135. [Google Scholar]
  • 67.Hermans, J., and S. Subramaniam. 1986. The free energy of xenon binding to myoglobin from molecular dynamics simulation. Isr. J. Chem. 27:225–227. [Google Scholar]
  • 68.Kollman, P. A., I. Massova, C. Reyes, B. Kuhn, S. Huo, L. Chong, M. Lee, T. Lee, Y. Duan, W. Wang, O. Donini, P. Cieplak, J. Srinivasan, D. A. Case, and T. E. Cheatham. 2000. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc. Chem. Res. 33:889–897. [DOI] [PubMed] [Google Scholar]
  • 69.Luo, H., and K. Sharp. 2002. On the calculation of absolute macromolecular binding free energies. Proc. Natl. Acad. Sci. USA. 99:10399–10404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Swanson, J. M., R. H. Henchman, and J. A. McCammon. 2004. Revisiting free energy calculations: a theoretical connection to MM/PBSA and direct calculation of the association free energy. Biophys. J. 86:67–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Wang, J. M., R. M. Wolf, J. W. Caldwell, P. A. Kollman, and D. A. Case. 2004. Development and testing of a general AMBER force field. J. Comput. Chem. 25:1157–1174. [DOI] [PubMed] [Google Scholar]
  • 72.Jakalian, A., D. B. Jack, and C. I. Bayly. 2002. Fast, efficient generation of high-quality atomic charges. AM1-BCC model. II. Parameterization and validation. J. Comput. Chem. 23:1623–1641. [DOI] [PubMed] [Google Scholar]
  • 73.Im, W., S. Berneche, and B. Roux. 2001. Generalized solvent boundary potential for computer simulations. J. Chem. Phys. 114:2924–2937. [Google Scholar]
  • 74.Banavali, N. K., W. Im, and B. Roux. 2002. Electrostatic free energy calculations using the generalized boundary potential. J. Chem. Phys. 117:7381–7388. [Google Scholar]
  • 75.Brooks, B. R., R. E. Bruccoleri, B. D. Olafson, D. J. States, S. Swaminathan, and M. Karplus. 1983. CHARMM: a program for macromolecular energy minimization and dynamics calculations. J. Comput. Chem. 4:187–217. [Google Scholar]
  • 76.Woo, H. J., A. R. Dinner, and B. Roux. 2004. Grand canonical Monte Carlo simulations of water in protein environments. J. Chem. Phys. 121:6392–6400. [DOI] [PubMed] [Google Scholar]
  • 77.Nina, M., D. Beglov, and B. Roux. 1997. Atomic radii for continuum electrostatics calculations based on molecular dynamics free energy simulations. J. Phys. Chem. B. 101:5239–5248. [Google Scholar]
  • 78.Im, W., D. Beglov, and B. Roux. 1998. Continuum solvation model: Electrostatic forces from numerical solutions to the Poisson-Boltzmann equation. Comput. Phys. Comm. 111:59–75. [Google Scholar]
  • 79.Torrie, G. M., and J. P. Valleau. 1977. Nonphysical sampling distributions in Monte Carlo free energy estimation: umbrella sampling. J. Comput. Phys. 23:187–199. [Google Scholar]
  • 80.Ferrenberg, A. M., and R. H. Swendsen. 1989. Optimized Monte Carlo data analysis. Phys. Rev. Lett. 63:1195–1198. [DOI] [PubMed] [Google Scholar]
  • 81.Kumar, S., D. Bouzida, R. H. Swendsen, P. A. Kollman, and J. M. Rosenberg. 1992. The weighted histogram analysis method for free energy calculations on biomolecules. I. The method. J. Comput. Chem. 13:1011–1021. [Google Scholar]
  • 82.Roux, B. 1995. The calculation of the potential of mean force using computer simulations. Comput. Phys. Comm. 91:275–282. [Google Scholar]
  • 83.Weeks, J. D., D. Chandler, and H. C. Andersen. 1971. Role of repulsive forces in determining equilibrium structure of simple liquids. J. Chem. Phys. 54:5237–5247. [Google Scholar]
  • 84.Banavali, N. K., and B. Roux. 2005. Free energy landscape of A-DNA to B-DNA conversion in aqueous solution. J. Am. Chem. Soc. 127:6866–6876. [DOI] [PubMed] [Google Scholar]
  • 85.Banavali, N. K., and B. Roux. 2005. The N-terminal end of the catalytic domain of SRC kinase Hck is a conformational switch implicated in long-range allosteric regulation. Structure. 11:1715–1723. [DOI] [PubMed] [Google Scholar]
  • 86.Levy, R. M., L. Y. Zhang, E. Gallicchio, and A. K. Felts. 2003. On the nonpolar hydration free energy of proteins: surface area and continuum solvent models for the solute-solvent interaction energy. J. Am. Chem. Soc. 125:9523–9530. [DOI] [PubMed] [Google Scholar]

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES