Abstract
An important unsolved problem in materials science is prediction of the thermodynamic stability of organic crystals and their solubility from first principles. Solubility can be defined as the saturating concentration of a molecule within a liquid solvent, where the physical picture is of solvated molecules in equilibrium with their solid phase. Despite the importance of solubility in determining the oral bioavailability of pharmaceuticals, prediction tools are currently limited to quantitative structure–property relationships that are fit to experimental solubility measurements. For the first time, we describe a consistent procedure for the prediction of the structure, thermodynamic stability and solubility of organic crystals from molecular dynamics simulations using the polarizable multipole AMOEBA force field. Our approach is based on a thermodynamic cycle that decomposes standard state solubility into the sum of solid-vapor sublimation and vapor-liquid solvation free energies , which are computed via the orthogonal space random walk (OSRW) sampling strategy. Application to the n-alkylamides series from aeetamide through octanamide was selected due to the dependence of their solubility on both amide hydrogen bonding and the hydrophobic effect, which are each fundamental to protein structure and solubility. On average, the calculated absolute standard state solubility free energies are accurate to within 1.1 kcal/mol. The experimental trend of decreasing solubility as a function of n-alkylamide chain length is recapitulated by the increasing stability of the crystalline state and to a lesser degree by decreasing favorability of solvation (i.e. the hydrophobic effect). Our results suggest that coupling the polarizable AMOEBA force field with an orthogonal space based free energy algorithm, as implemented in the program Force Field X, is a consistent procedure for predicting the structure, thermodynamic stability and solubility of organic crystals.
INTRODUCTION
Organic crystals are a critical class of materials for the pharmaceutical industry1 and represent an emerging approach for the low-pressure storage of gases within crystalline metal-organic frameworks (MOFs).2,3 Specific examples of organic crystals include the polymorphs of ice4, pharmaceutical tablets such as paracetamol5,6 (acetaminophen) and acetylsalicylic acid7,8 (aspirin), and those responsible for diseases such as gout9 (monosodium urate monohydrate) and kidney stones10 (calcium oxalate). Key properties of organic crystals that are valuable to predict computationally include their atomic structure, thermodynamic stability and their solubility (i.e. pharmaceutical tablets within the digestive tract). Although significant progress has been made toward in silico crystal structure prediction (CSP),11–14 computational prediction of the thermodynamic stability and solubility of organic crystals from first principles has remained a challenging unsolved problem.11,15
For CSP, the underlying algorithms are characterized by application of a search protocol over space groups, unit cell dimensions and atomic coordinates using intermolecular potentials defined by fixed partial charge 16–20 or fixed atomic multipole force fields.21–24 Although a set of fixed atomic multipoles can model the quantum mechanical electrostatic potential outside the molecular surface of a polar organic molecule in vacuum to high accuracy, changes in electron distribution due to alternate conformations or transfer into a condensed phase environment are neglected unless polarization is accounted for.25,26 Therefore, candidate low energy crystal structures are usually rescored using electronic structure calculations based on density functional theory (DFT) with dispersion corrections.27,28 When these algorithms were applied during the 4th blind test of CSP organized by the Cambridge Crystallographic Data Center (CCDC) in 2007, which featured 4 target organic crystals, 7 of 14 participating groups predicted at least one structure correctly and 4 groups had multiple successful predictions.12 Moreover, in the 5th blind test of CSP in 2010, each of the 6 target organic crystals was predicted correctly by at least one of the 14 participating groups.28
However, despite its importance to the rational design of bioavailable pharmaceutical tablets, quantitative prediction of the thermodynamic stability and solubility of organic crystals from first principles has lagged behind both CSP and advancements in the computation of liquid state thermodynamics.11,15,28 In the context of predicting solubility, the solvation free energy15,29 and stability of amorphous solids30 have been computed using Monte Carlo statistical mechanics simulations with the OPLS-AA force field,18 however, a procedure to predict the thermodynamic stability of crystalline solutes has not been presented. An alternative to predicting solubility from first principles (i.e. from an atomic resolution model of molecular energetics) is the revised General Solubility Equation31,32 (GSE), which predicts the molar aqueous solubility S (i.e. the saturating concentration of the solute within solvent in mol/L) based on knowledge of the Celsius melting point (MP) of the crystal and the octonal-water partition coefficient (Kow) of the molecule
Equation 1 |
The usefulness of the revised GSE in drug design settings where MP, Kow or both are unknown is reduced. Therefore, practical tools for solubility prediction are currently limited to quantitative structure–property relationships (QSPR) and trained on experimental data33–45 as discussed in recent reviews.15,46–48
To address the challenge of computing the structure, thermodynamics and solubility of organic crystals from first principles, this work brings together recent advances from three complementary fields including polarizable atomic multipole force field development, free energy simulation methods and algorithms for efficient treatment of long-range electrostatics inherent to lattice summation. First, the importance of a polarizable atomic multipole force field to accurate modeling of crystal energy landscapes11 arises from aspherical features of the electron density inherent to lone pairs and π-electrons,25 and from perturbations such as conformational flexibility (i.e. intra-molecular polarization)26 or transfer between vapor, liquid and crystalline environments (i.e. inter-molecular polarization). These critical features of the crystal energy landscape are neglected by fixed atomic charge force fields such as Amber49, CHARMM50 or OPLS-AA.18 In this work, the AMOEBA force field captures aspherical atomic electron density using permanent atomic multipoles through quadrupole order and transferability between confirmations and phases using explicit treatment of electronic polarization via induced dipoles.51–53 Second, large energy barriers11 that necessitate advanced statistical mechanics theories to efficiently converge sampling of molecular configurations characterize crystal potential energy surfaces.54–56 Here we overcome these large energy barriers using the orthogonal space random walk (OSRW) strategy.56–58 On the other hand, traditional methods such as free energy perturbation59 or thermodynamic integration60 require knowledge of the crystal structure coordinates, while use of OSRW allows coordinate prediction. Finally, algorithms for long-range van der Waals and electrostatics forces such as particle-mesh Ewald (PME) summation61–63 must leverage the space group symmetry of crystalline solids to achieve computational efficiently, which has recently been implemented within the program Force Field X64 On the other hand, widely used molecular dynamics programs such as Amber65, CHARMM66 and TINKER67 require explicit sampling of the entire unit cell, which both increases the cost per energy evaluation while also slowing the kinetics of crystal nucleation.
In the next section, background on the polarizable AMOEBA force field, solubility thermodynamic cycle and space group symmetry extension of PME are presented. This is followed by theory describing the OSRW free energy algorithm, including details of the derivatives of the AMOEBA potential required by OSRW and are a superset of the derivatives required for thermodynamic integration, which are presented here for the first time. The overall implementation of AMOEBA and OSRW in Force Field X is validated by comparison to monovalent ion solvation free energies computed using standard free energy perturbation. Prediction of the structure, thermodynamics and solubility of the n-alkylamides from acetamide to octanamide is then presented. The n-alkylamides series was chosen due to the dependence of its solubility trend on both amide hydrogen bonding and the hydrophobic effect, which are both of broad importance to biomolecular structure and function.
BACKGROUND
A. The Potential Energy Surface of Organic Crystals
The non-covalent interactions between molecules of an organic crystal can be decomposed into van der Waals and electrostatic forces. Until recently, the atomic resolution force fields used to study crystal energetics and liquid state properties were based on the approximation that the electrostatic charge of each atom is invariant, or fixed, with respect to its local environment.68 In reality, the electron density deforms in response to the electric field of its environment. For example, negatively charged electron density moves toward the positive potential of a cation such as Na+ or K+ and away from the negative potential of an anion such as Cl−. The average electric field within a crystal is different from that experienced by a molecule when solvated or in vapor. In the past, this necessitated choosing between fixed atomic charges that were optimal for either the crystalline phase or the liquid phase, but not both. Fortunately, increasing computational power has facilitated explicit inclusion of electronic polarization into force fields for organic molecules and thereby improved their transferability between vacuum, liquid and crystalline phases.52,69–75 In this work, the crystal potential energy landscape is defined by the polarizable Atomic Mulitpole Optimized Energetics for Biomolecular Applications (AMOEBA) force field52 for organic molecules.53,76 The AMOEBA potential is then sampled using the OSRW algorithm described below in order to compute the thermodynamic stability of crystals while also predicting their atomic coordinates.
B. Solubility Thermodynamic Cycle for Organic Crystals
Solubility defines the equilibrium concentration of a solute in contact with an excess of its solid state. The physical process reaches equilibrium when the rate of molecules joining the solid state (assumed to be crystalline in this work) from solution equals the rate of molecules leaving the solid to be solvated. Although the required system size and time scale for explicit atomic resolution simulation of the crystalline-solution phase equilibrium is prohibitive, the standard state solubility free energy ( ) can be decomposed into the sum of standard state sublimation ( ) and solvation ( ) free energy steps
Equation 2 |
as shown in the thermodynamic cycle of Figure 1. Although solvation free energies are routinely computed to high precision,77,78 calculation of the sublimation free energy by annihilation of the entire crystalline solid is novel to this work and is depicted in Figure 2. A correction is applied to the simulation result to account for the change in volume between a crystalline solid containing 1 mole of molecules and the standard state concentration of 1 M chosen for both the vapor and solvated states.
C. Long-Range Forces in Organic Crystals
Evaluation of the polarizable AMOEBA electrostatic energy for a crystalline lattice builds on the theory for lattice summation proposed by Ewald.79 The real space portion of the theory was described by Ren and Ponder in the context of the AMOEBA water model80 and was based on earlier work by Smith.81 The reciprocal space portion is accelerated by fast Fourier transforms (FFT) as described by Darden and colleagues for fixed charge61,62 and multipolar63 atomic electrostatics. Further acceleration of PME for crystals has been achieved via explicit support for space group symmetry,64 which has been applied to biomolecular X-ray refinement.64,82 Without explicit support for space group symmetry, the alchemical transformation shown in Figure 2 would need to consider nsymm molecules, where nsymm is the number of symmetry operators of the space group. For an acetamide crystal, which has 18 molecules in its unit cell (nsymm=18), the kinetics of nucleation for the alchemical transformation may be computationally prohibitive. In particular, the approximate 18x increase in the cost per energy evaluation is exacerbated by the need to convergence an 18x larger free energy change, which is accompanied by crystal nucleation kinetics.
METHODS
A. Orthogonal Space Random Walk
In order to effectively overcome large barriers in the crystalline free energy landscape, a second-order generalized ensemble method called the orthogonal-space random walk (OSRW) is applied.56,57 The central idea of generalized ensemble (GE) free energy simulation methods is to employ a modified ensemble, which allows free energy barriers in the target ensemble to be crossed more frequently in order to efficiently sample the distributions required for free energy estimations. A first-order GE modification is based on the order parameter λ and the biasing energy term fm(λ) in the following target potential:
Equation 3 |
where λ=1 corresponds to the solute interacting with the condensed phase environment and λ=0 to the solute uncoupled from the condensed phase environment. In the first-order GE regime, the biasing term fm(λ) is commonly updated to approach − GAMOEBA(λ), which is the negative of the λ-dependent free energy profile corresponding to the canonical ensemble with the AMOEBA potential energy function. In this way, an order parameter space random walk can be achieved to uniformly sample all the target states. To adaptively estimate GAMOEBA(λ), three major recursion schemes have been developed. The first is adaptive umbrella sampling,83 in which free energy estimations are based on order parameter probability distributions. The second is the adaptive biasing force (ABF) method,84–86 in which free energy estimations are based on the thermodynamic integration (TI) formula.60,87 Finally, multiplicative approaches, such as the metadynamics method for molecular dynamics simulations55 and the Wang-Landau method for Monte-Carlo simulations,88 are realized through dynamic force-balancing.
In first-order GE simulations the free energy surface along λ is flattened, however, “hidden” free energy barriers may exist in the space perpendicular to λ. These “hidden” free energy barriers create a sampling challenge due to slow environmental relaxation. To address this, the second-order GE simulation method, the orthogonal space random walk (OSRW) algorithm, was introduced based on the following modified energy function:
Equation 4 |
where gm(λ, Fλ) is targeted towards −GAMOEBA(λ, Fλ) the negative of the free energy profile along (λ, Fλ) in the ensemble corresponding to the energy function UAMOEBA(λ) − GAMOEBA(λ), and Fλ = ∂UAMOEBA(λ) ∂λ In contrast to first-order GE methods, OSRW requires two recursion components to be adaptively updated. The recursion component responsible for gm(λ, Fλ) is called the “recursion kernel” and the recursion component responsible for fm(λ) is called the “recursion slave” because its target − GAMOEBA(λ) can be determined by the target of gm(λ, Fλ). In our implementation, the recursion slave is determined using the TI relationship and metadynamics is used for the recursion kernel. However, the recursion kernel can be based on any of the three recursion methods discussed above.
The free energy biasing potential gm(λ, Fλ) can be obtained by repetitively adding a relatively small Gaussian-shaped repulsive potential:
Equation 5 |
that is centered at the state [λ(ti), Fλ(ti)] being visited at time ti and thereby discourages the system from often visited configurations. Repeating this procedure builds up the biasing potential
Equation 6 |
and eventually flattens the underlying curvature of the free energy surface along the (λ, Fλ) space. Then, as in traditional metadynamics simulations, the free energy profile along the reaction coordinate (λ, Fλ) will eventually converge to −GAMOEBA(λ, Fλ) and is estimated as −gm(λ, Fλ). For any state λ́, the generalized force distribution should be proportional to e{βgm[λ́Fλ(λ́)]} and the free energy derivative at each state is
Equation 7 |
Following the thermodynamic integration formula, the free energy change between the initial state λi (for example λi=0) and the target state λ (for example λ=1) is given by
Equation 8 |
In our OSRW implementation, the metadynamics strategy described in Eq. 6 serves as the recursion kernel and the TI based formula (Eqs. 7 and 8) serves as the recursion slave with fm(λ) set recursively as the instantaneous estimation of − GAMOEBA(λ)
Our first goal was to derive, implement and parallelize derivatives with respect to λ of the polarizable AMOEBA potential energy function U(λ, X) where λ defines a continuous, softcore transition between the asymmetric unit (solute) fully interacting with its symmetry mates (condensed phase environment) to decoupled in order to compute the sublimation free energy. Analogous decoupling schemes are used for computing solvation free energy (i.e. a solute decoupled from solvent) and binding free energy (i.e. ligand decoupled from the binding pocket of a solvated protein). The terms needed for the various λ-based sampling methods may include ∂U(λ, X)/∂λ, ∂2U(λ, X)/∂λ2 and/or ∂2U(λ, X)/∂X ∂λ. We briefly present key aspects of these derivatives with special emphasis on unique advancements required for the AMOEBA buffered 14-7 van der Waals softcore term, the permanent multipole softcore term and the polarization energy. Parallelization depends on Parallel Java, which is a unified API for shared memory and cluster parallel programming.89
B. AMOEBA Intermolecular Interactions and Their λ Derivatives
1. Softcore Buffered 14-7 van der Waals
The AMOEBA model for water,80,90 ions,91 organic molecules53 and proteins52,92 uses a buffered 14-7 van der Waals term proposed by Halgren.93 A softcore form that allows the potential to smoothly disappear as λ goes to zero has been described previously92 and is given by
Equation 9 |
where for convenience the expression has been factored into the following two terms
Equation 10 |
Equation 11 |
where εij is the well-depth, ρ = r/ŕ is the atomic separation distance r normalized by the minimum energy distance ŕ, the parameters (n=14, m=7, δ=0.07, γ=0.12) specify a buffered 14-7 form (a 6-12 Lennard-Jones potential is recovered by n=12, m=6, δ=0 and γ=0) and the parameters α and β tune the softcore transition.92 In the original work, the buffered 14-7 potential was completely turned on before polarizable atomic multipole electrostatics was considered,92 however, it is more convenient to modify both the van der Waals and electrostatics potentials concomitantly over the λ path. Adoption of the latter approach necessitated different α and β parameters, which are discussed further in the Results section below.
The various derivatives of the softcore buffered 14-7 interaction required for OSRW are now described. First, the partial derivative of Uvdw with respect to an atomic coordinate is given by
Equation 12 |
where
Equation 13 |
and
Equation 14 |
Next, the partial derivative of Uvdw with respect to λ is shown
Equation 15 |
where
Equation 16 |
and
Equation 17 |
For the 2nd derivative of the Uvdw with respect to λ it is convenient to separate the previous first derivative of Equation 15 into a sum of three terms as
Equation 18 |
such that ∂2UvdW(λ, r)/∂λ2 is then given by
Equation 19 |
where
Equation 20 |
Equation 21 |
and
Equation 22 |
In the above expressions, only ∂2t1/∂λ2 and ∂2t2/∂λ2 remain to be defined and are given by
Equation 23 |
and
Equation 24 |
Finally, the 2nd partial derivative with respect to both an atomic coordinate and λ is given by
Equation 25 |
where
Equation 26 |
Equation 27 |
and
Equation 28 |
Only the terms ∂2t1/∂λ∂ri,γ and ∂2t2/∂λ∂ri,γ remain to be defined and are given by
Equation 29 |
and
Equation 30 |
2. Softcore for Multipolar Real Space Ewald Summation
For real space Ewald electrostatic interactions between permanent multipoles, it is essential to formulate inclusion of softcore behavior in a manner that facilitates concise higher order tensors and partial derivatives. Our approach replaces the atomic separation distance r = |rj − ri| by and the parameter α tunes the softcore transition. The softcore electrostatic interaction energy between permanent multipoles through quadrupole order at sites i and j is given by
Equation 31 |
where the coefficients for each order l depend on the multipole coefficients of each site and were given by Smith.81 The function Bl(u) is defined by
Equation 32 |
for the 0th order term based on the complementary error function
Equation 33 |
and higher order Bl(u) terms are obtained from the recursion relationship
Equation 34 |
The derivative of the permanent real space electrostatic energy with respect to an atomic coordinate ri,γ of atom i is given by
Equation 35 |
where the partial derivatives for were given by Smith81 and
Equation 36 |
The 1st derivative with respect to λ is
Equation 37 |
where it can be shown that
Equation 38 |
and the 2nd derivative is given by
Equation 39 |
where
Equation 40 |
Finally, the 2nd derivative of the softcore permanent multipole real space electrostatic energy with respect to an atomic coordinate ri,γ of atom i and λ is given by
Equation 41 |
where
Equation 42 |
3. Permanent Reciprocal Space Ewald Summation
The reciprocal space portion of particle-mesh Ewald summation due to permanent multipoles is not pairwise and is therefore calculated by interpolating between the potential due to the total system (i.e. infinite crystal, solute plus solvent, or ligand plus protein/solvent) and that due to only the solute (i.e. asymmetric unit, solute or ligand). The latter calculation is significantly faster than the computation of the reciprocal potential due to the entire condensed phase system minus the solute (i.e. symmetry mates without the asymmetric unit, solvent without the solute or protein/solvent without the ligand), which can be determined from the relationship
Equation 43 |
The permanent reciprocal space potential is then given by
Equation 44 |
and its 1st derivative with respect to λ is
Equation 45 |
while the 2nd derivative vanishes. The dependence of the permanent multipole reciprocal space energy Urecip(λ) on λ can be formulated as a function of the reciprocal space electrostatic potential φrecip(λ) to give Urecip (φrecip(λ)), which makes it clear that the energy and coordinate gradient can be computed as usual. The first derivative of the permanent multipole reciprocal space energy with respect to λ is given by , the 2nd derivative with respect to λ is zero and terms of the form ∂2Urecip(φrecip(λ))/∂λ∂ri,γ are analogous to the atomic coordinate gradient.
4. Polarization Energy
The AMOEBA polarization model is based on induced dipoles at each atomic center, which are polarized by the field of the permanent multipoles as well as the field of the induced dipoles themselves.53 Pauli repulsion balances favorable deformation of the electron cloud, which is captured by smearing the induced dipole as described by Thole94 via
Equation 46 |
where a is a dimensionless constant that controls the degree of smearing and u = rij/(αiαj)(1/6) is an effective distance that depends on the polarizability α of each atom.
Calculation of the induction energy along the alchemical path is similar to the calculation of the reciprocal space potential due to permanent multipoles because it is not pairwise and interpolation is used. For small values of λ, solute (i.e. asymmetric unit) atoms frequently overlap with solvent (i.e. symmetry mate) atoms because of the softcore modification to solute-solvent van der Waals interactions smoothly and completely eliminates Pauli repulsion by λ = 0. Although the Thole model damps polarization at short range, the self-consistent field (SCF) calculation can require a larger number of iterations than is usual at unphysical, small atomic separations. Therefore, the contribution of the solute to condensed phase polarization is not considered until λ achieves a value greater than . An additional benefit of this approach is that approximately ¾ of the alchemical path requires a single condensed phase SCF calculation, rather than two. As the SCF is the most computationally expensive aspect of evaluating the AMOEBA potential energy and its derivatives, this saves approximately a factor of 2 with regards to the overall efficiency of the OSRW algorithm.
For the interval , the vapor polarization energy for the solute is calculated and summed with the condensed phase energy in the absence of the solute . Over the interval , this sum is smoothly turned off based on shifting and scaling λ to give a state variable for polarization . Concomitantly, the total polarization energy of the condensed phase system is turned on
Equation 47 |
The cubic dependence on λ ensures that the 1st derivative of Upol(λ) with respect to λ is zero at
Equation 48 |
as are the 2nd derivatives
Equation 49 |
and
Equation 50 |
C. Parameterization of Amide Functional Groups for AMOEBA
A 1981 study of acetamide structures from a neutron diffraction experiment at 23 K and ab initio molecular orbital studies in vacuum demonstrated that there are significant equilibrium bond length differences for both C=O and C-N as a function of environment due to hydrogen bonding.95 One consequence is that the total dipole moment in vacuum of the equilibrium vacuum and crystalline structures differ by approximately 10%, as shown in Table 1. This difference is large enough that the solvation free energy for members of the n-alkylamide series is sensitive to the bond lengths used during parameterization of AMOEBA electrostatics. The sensitivity can be rationalized from the quadratic dependence of the solvation free energy on the total molecular dipole moment. For AMOEBA parameterization, the C-N and C=O bond lengths of the n-alkylamides were fixed at 1.34 Å and 1.23 Å, respectively. This protocol yielded a dipole moment for acetamide of 3.98 Debye, which is intermediate between that of unrestrained vacuum optimization (3.69 Debye) and that from fixing the C=O and C-N bond lengths at condensed phase values (4.07 to 4.17 Debye).
Table 1.
Source | Environment | Opt. | Temp. (K) | C-N | C=O | Dipole |
---|---|---|---|---|---|---|
6-31G* | Vacuum | All | 1.359 | 1.197 | 3.69 | |
Electron Diffraction98 | Vacuum | C, H | 1.380 | 1.220 | 3.71 | |
B3LYP/6311++G** | Vacuum | All | 1.369 | 1.215 | 3.78 | |
AMOEBA Parameterization | Vacuum | C, H | 1.340 | 1.230 | 3.98 | |
X-ray (ACEMID02)99 | R3c | C, H | 108 | 1.336 | 1.243 | 4.07 |
Neutron (ACEMID03)95 | R3c | C, H | 23 | 1.337 | 1.250 | 4.10 |
X-ray (ACEMID05)100 | R3c | C, H | 23 | 1.335 | 1.252 | 4.12 |
X-ray (ACEMID01)101 | R3c | C, H | 283–303 | 1.338 | 1.258 | 4.14 |
X-ray (ACEMID)102 | Pccn | C, H | 283–303 | 1.334 | 1.260 | 4.17 |
B3LYP/6311G** | R3c | ALL | 1.336 | 1.249 | 4.17 |
The lattice energy (Elattice) for the AMOEBA acetamide model will be compared to periodic solid-state ab initio calculations based on the program suite CRYSTAL’09,96,97 which uses functions localized at atoms as the basis for expansion of the crystalline orbitals via a linear combination of atomic orbitals (LCAO) technique. All-electron Gaussian type basis sets and the hybrid B3LYP103,104 Hamiltonian was used. The DFT exchange–correlation contribution is evaluated by numerical integration over the unit cell volume. Radial and angular points of the grid were generated through Gauss–Legendre radial quadrature and Lebedev two-dimensional angular point distributions with a pruned grid of 75 radial and 974 angular points. The level of accuracy in evaluating the Coulomb and Hartree–Fock exchange series was controlled by five parameters,97 and values of 7, 7, 7, 7, 16 were used. The reciprocal space integration was performed by sampling the Brillouin zone with the 6 × 6 × 6 Pack-Monkhorst net.105 Structure optimizations were performed using analytical energy gradients with respect to atomic coordinates with cell parameters fixed,106–108 within a quasi-Newton scheme combined with the Broyden–Fletcher–Goldfarb–Shanno scheme for Hessian updating.109–112 Convergence was checked on both gradient components and nuclear displacements and was signaled when the RMS gradient was 0.00015 Hartree/Bohr and RMS displacement was 0.0006 Bohr. The 6-31G*113,114 and 6-311G**115 basis sets were used and condensed phase energies were corrected for BSSE via the counterpoise method.116
The lattice energy was computed as
Equation 51 |
where Eunit is the total energy of the unit cell, Z is the number of the molecules in the unit cell and Emol is the total energy of acetamide optimized in vacuum. Observing the results in Table 2, the AMOEBA lattice energy for acetamide compares well to the ab initio values before the zero point energy (ZPE) was applied. After the ZPE is taken into account, AMOEBA overestimates the lattice energy by a little more than 2 kcal/mol at 0°K compared to DFT. The effect of this limitation on crystal thermodynamics near room temperature is difficult to project.
Table 2.
Potential | Elattice | Elattice with ZPE |
---|---|---|
AMOEBA | −20.78 | |
B3LYP/6-31G* | −20.54 | −18.28 |
B3LYP/6-311G** | −20.06 | −17.84 |
D. Previous AMOEBA Free Energy Perturbation Protocol
The first applications of the AMOEBA force field to the computation of free energy were limited to procedures that did not require the derivative of the potential energy with respect to λ, such as thermodynamic integration, which was not been completed until the present work. The original protocol, based on free energy perturbation (FEP), is described here for ion solvation and will be compared to OSRW in the Results section that follows.91 For FEP, particle growth was completed with the charge and polarizability of each ion set to zero. During particle growth, the van der Waals radius (r) and well depth (ε) parameters were defined by
Equation 52 |
and
Equation 53 |
respectively, for λ = (0.01,0.05,0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9,1.0). During the subsequent charging stage, the charge (q) and polarizability (α) of each ion were defined by
Equation 54 |
and
Equation 55 |
respectively, for λ = (0.0,0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9,1.0) where the first charging trajectory is equivalent to the last growth trajectory. For each λ value, 200 psec of constant volume molecular dynamics was performed using an 18.643 Å cubic box containing 216 AMOEBA water molecules. The Helmholtz free energy between adjacent λ values ΔA(λi → λi+1) was calculated via half-steps as
Equation 56 |
plus
Equation 57 |
using the last 150 psec of each trajectory.91
The standard state solvation free energy (a relative free energy due to transferring the solute from vapor to solution phase) is independent of the solution concentration (Cs) as long as the standard state of the vapor phase is chosen to be the same as that of solution (e.g. 1 M), and the solution is dilute enough that solute molecules do not interact (no excess free energy of solvation). The standard state concentration C° then cancels as shown below
Equation 58 |
In this case, solvation free energy simulation results are essentially independent of the number of water molecules beyond a lower threshold. For example, the volume per molecule at 1M is 1660 Å3, while for the ion solvation simulations the volume per molecule was approximately 6500 Å3 and for the n-alkylamides discussed below it was 15000 Å3. In both cases, there are negligible interactions between solute periodic images.
RESULTS
A. Validation on Ion Solvation Free Energy
The first free energy calculations using the AMOEBA water model80,90 focused on the solvation free energy of K+, Na+ and CI− ions and demonstrated very good agreement with experimental results for the solvation energy of KCl and NaCl salts.91 Here we compare the free energy methodology used in the initial ion solvation work, described above, with our OSRW approach in terms of accuracy and efficiency. The current OSRW protocol follows Grossfield et al. in terms of simulation box size (18.643 Å), number of water molecules (216), temperature (300 K), use of a Berendsen weak coupling thermostat (0.1 psec time constant),117 time step (1.0 fsec), SCF convergence criteria (0.01 RMS Debye), van der Waals cutoff (12 Å) and ensemble (NVT), but differs in a few notable ways that are now described.91 First, the current work uses of a softcore version of the buffered 14-7 van der Waals potential92 as described above, which avoids the difficulties encountered in the original particle growth protocol as λ → 0. Secondly, the original work used conventional Ewald summation80 whereas the more efficient PME approach is used for the current AMOEBA/OSRW protocol as implemented in Force Field X63,64 The PME parameters used throughout this work include a real space cutoff of 7.0 Å, an Ewald convergence parameter of 0.545, 5th order B-spline interpolation and a reciprocal grid spacing of 1.2 per Å. Finally, whereas the original protocol divided particle growth and charging into separate steps, our approach simultaneously performs 1.) growth of the van der Waals potential, 2.) charging of the condensed phase permanent electrostatic interactions and 3.) interpolation between vapor and condensed phase ligand polarization. Coupling ligand van der Waals growth with scaling up of electrostatic interactions between the ligand and condensed phase environment may be more efficient than separation into two stages. In addition, the latter approach does not require a vapor-discharging step for polyatomic ligands because intramolecular solute electrostatics are used throughout the protocol.
The ion solvation free energies under the AMOEBA/OSRW protocol are converged after only 0.5 nsec as shown in Figure 3. Furthermore, a plot of the cumulative free energy as a function of λ demonstrates smooth behavior, although small fluctuations in the ensemble average of ∂U/∂λ remain as shown in Figure 4. The original results of Grossfield et al were referenced to a vapor standard state of 1 atm. Therefore, the AMOEBA/OSRW single ion solvation results were corrected to account for the cost to compress an ideal gas to 1M, which is given by
Equation 59 |
where kB is the Boltzmann constant, T is absolute temperature and the resulting correction is 1.9 kcal/mol.
In Table 3, we compare the AMOEBA/OSRW ion solvation energies for K+, Na+ and Cl− to the original values for AMOEBA, OPLS-AA and CHARMM27 that were computed using FEP by Grossfield et al.91 In each case, the AMOEBA/OSRW result is approximately 1 kcal/mol more positive than the original calculation using FEP. This is partially explained by the lack of a softcore definition of ion-solvent van der Waals interactions in the original work, which is consistent with underestimating the cost to form a cavity. This effect cancels for the relative solvation free energy between K+ and Na+ such that OSRW and FEP agree to within 0.1 kcal/mol. For the whole salts, the original FEP solvation energies were too negative by approximately 0.5 kcal/mol, while our results with OSRW are too positive by 1.2–1.5 kcal/mol. However, the original conclusions of Grossfield et al are unchanged.
Table 3.
Method | Potassium | Sodium | K+ -> Na+ | Chloride | KCl Salt | NaCl Salt |
---|---|---|---|---|---|---|
AMOEBA/OSRW | −71.7 | −88.9 | −17.2 | −83.5 | −155.3 | −172.4 |
AMOEBA/FEP | −72.6 | −89.9 | −17.3 | −84.6 | −157.2 | −174.5 |
OPLS-AA | −65.7 | −83.9 | −18.2 | −85.3 | −151.0 | −169.2 |
CHARMM27 | −70.3 | −91.0 | −20.7 | −91.2 | −161.5 | −182.2 |
Schmid118 | −69.3 | −86.8 | −17.5 | −87.2 | −156.5 | −174.0 |
Friedman119 | −80.8 | −98.3 | −17.5 | −75.8 | −156.6 | −174.1 |
Tissandier120 | −84.1 | −101.3 | −17.2 | −72.7 | −156.8 | −174.0 |
B. Structure, Thermodynamics and Solubility of n-Alkylamides
1. Experimental n-Alkylamide Unit Cell Parameters, Solvation Free Energy and Standard State Solubility
In this work we limit ourselves to canonical ensemble crystal thermodynamics based on the experimentally determined space group and unit cell parameters for the n-alkylamides as shown in Table 4.101,12–24 The space group for each n-alkylamide is P21/c,121 except for the rhombohedral structure of acetamide.101 For propanamide through octanamide, the length of a increases monotonically with chain length, while b and c are nearly constant.121 Atomic coordinates for acetamide,101 propanamide,122 butanamide,124 and pentanamide123 have been solved and deposited in the Cambridge Crystallography Database permitting comparisons with the lowest energy structures found during this work. Although experimental structures for hexanamide, heptanamide and octanamide are currently unknown, we present predictions below as an illustration that the current approach yields structural information in addition to thermodynamics.
Table 4.
n-alkylamide | Temp. | SG | a | b | C | α | β | γ | Vol. | Z | Vol./Z |
---|---|---|---|---|---|---|---|---|---|---|---|
Acetamide | 298a | R3c | 11.53 | 11.53 | 13.59 | 90 | 90.0 | 120 | 1563.4 | 18 | 86.9 |
Propanamide | 298b | P21/c | 8.97 | 5.78 | 9.76 | 90 | 113.5 | 90 | 464.1 | 4 | 116.0 |
203c | P21/c | 8.67 | 5.76 | 9.81 | 90 | 113.8 | 90 | 448.2 | 4 | 112.1 | |
Butanamide | 298b | P21/c | 10.02 | 5.79 | 9.94 | 90 | 100.9 | 90 | 566.3 | 4 | 141.6 |
150d | P21/c | 9.81 | 5.92 | 9.70 | 90 | 112.1 | 90 | 522.6 | 4 | 130.7 | |
Pentanamide | 298b | P21/c | 11.08 | 6.13 | 9.84 | 90 | 100.7 | 90 | 656.7 | 4 | 164.2 |
150e | P21/c | 11.06 | 5.83 | 9.79 | 90 | 103.5 | 90 | 613.8 | 4 | 153.5 | |
Hexanamide | 298b | P21/c | 14.11 | 5.65 | 9.78 | 90 | 101.6 | 90 | 763.8 | 4 | 190.9 |
Heptanamide | 298b | P21/c | 15.29 | 5.65 | 9.90 | 90 | 105.0 | 90 | 826.1 | 4 | 206.5 |
Octanamide | 298b | P21/c | 17.39 | 5.69 | 9.78 | 90 | 91.9 | 90 | 967.2 | 4 | 241.8 |
In addition to structural information, experimental values for the melting point121 and solubility125 for n-alkylamides have been reported and are summarized in Table 5. Furthermore, the solvation free energy of acetamide (−9.68 kcal/mol) and propanamide (−9.38 kcal/mol) have been determined in the context of studying amino acid side-chain analogues.126 We note that the melting points of acetamide and propanamide are lower by 20°K than those for the longer chain length n-alkylamides. Experimental solubility values for acetamide, butanamide, pentanamide and hexanamide are available from the Handbook of Aqueous Solubility Data,125 although uncertainty estimates were not provided. Discrepancies between the saturating concentrations of acetamide from three sources suggest errors on the order of ~0.5 kcal/mol.125
Table 5.
n-Alkylamide | Melting Point (K) | Solubility (mg/ml) | ΔGvol | ΔGexcessa | ||
---|---|---|---|---|---|---|
Acetamide | 352–354 | 705, 2000 | −1.46, −2.08 | 1.9 | 0.44, −0.18 | |
Propanamide | 350–351 | |||||
Butanamide | 386–388 | 230 | −0.57 | 0.3 | −0.27 | |
Pentanamide | 378–379 | 80 | 0.14 | 0.14 | ||
Hexanamide | 372–374 | 30 | 0.80 | 0.80 | ||
Heptanamide | 368–369 | |||||
Octanamide | 378–379 |
2. n-Alkylamide Solubility from Sublimation and Solvation Free Energies
The sublimation free energy for each n-alkylamide was calculated based on sampling the AMOEBA potential for organic molecules53 using OSRW56–58 by alchemically transforming a single molecule in vapor (λ = 0) into an infinite, perfectly symmetric crystal (λ = 1) via strict enforcement of space group symmetry64 within the Force Field X program.64,82,129 Each n-alkylamide molecule repeatedly sampled the minimum crystalline free energy basin (λ = 1) over the course of the simulation, as well as alternative conformations relatively higher in free energy. Each initial n-alkylamide conformation was a vacuum minima used for parameterization of the AMOEBA model in combination with an initial λ value of 0. Any reasonable starting confirmation produces identical thermodynamics once convergence has been reached. The sublimation simulations employed the parameters described above for ion solvation, with one notable difference. Unlike the ion and n-alkylamide solvation free energy calculations that used molecular dynamics, sublimation simulations used stochastic dynamics due to the small number of atoms being simulated.
Each n-alkylamide was fully flexible at all points in the simulation such that conformations or orientations that were not appropriate for the crystalline phase create a large force on the metadynamics particle favoring a return to vapor. Once in vapor, the molecule can easily change conformation and/or orientation. When a conformation is sampled in vapor that is consistent with the crystalline phase, a term in the force applied to the metadynamics particle (−∂U/∂λ) favors changing into condensed phase. Once in an energetically favorable crystalline state, the time dependent biasing potential builds up to allow the simulation to return to the vapor state. Before convergence is reached, each n-alkylamide sampled the correct crystalline conformation tens of times, in addition to exhaustively sampling torsional and rigid body degrees of freedom in vapor.
Based on the results shown in Table 6, the standard deviation for the sublimation free energy of each n-alkylamide averages only 0.25 kcal/mol after 100 nsec simulations for acetamide through hexanamide and 200 nsec simulations for heptanamide and octanamide. The sublimation free energies increase monotonically for propanamide through octanamide, which each share the P 21/c space group. The rhombohedral acetamide crystal is not expected to follow the trend of the larger n-alkylamides, since its crystal packing is substantially different from the others.
Table 6.
n-Alkylamide | Simulation
|
ΔGsim | ΔGvol | σ | ||||||
---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||||||
Acetamide | 12.7 | 12.8 | 12.7 | 12.6 | 12.5 | 12.7 | −1.8 | 10.9 | 0.1 | |
Propanamide | 10.1 | 10.1 | 10.0 | 9.8 | 10.2 | 10.1 | −1.6 | 8.5 | 0.2 | |
Butanamide | 11.8 | 12.2 | 11.8 | 12.1 | 11.9 | 12.0 | −1.5 | 10.5 | 0.2 | |
Pentanamide | 12.5 | 12.6 | 12.8 | 12.3 | 12.8 | 12.6 | −1.4 | 11.2 | 0.2 | |
Hexanamide | 13.5 | 13.1 | 13.1 | 13.4 | 13.4 | 13.3 | −1.3 | 12.0 | 0.2 | |
Heptanamide | 14.9 | 13.6 | 14.6 | 14.5 | 13.6 | 14.2 | −1.3 | 13.0 | 0.6 | |
Octanamide | 15.3 | 15.5 | 16.1 | 15.4 | 15.2 | 15.5 | −1.2 | 14.4 | 0.4 |
The standard state solvation free energies for each n-alkylamides are given in Table 7 and were calculated based on 5 nsec simulations with the AMOEBA model for water80,90 and organic molecules.53 The OSRW56–58 sampling method was used to alchemically solvate a single molecule in vapor (λ = 0) into a cubic box of water (λ = 1) using the Force Field X program. The simulation system was equilibrated with 500 waters within a cubic box of 24.662 Å. All other simulation parameters were changed from those reported for ion solvation. As for the sublimation free energy simulations, the initial conformation of each n-alkylamides was a vacuum energy minimum used for parameterization in tandem with an initial λ value of 0. The calculated standard state solvation free energies of acetamide and propanamide of −10.3 and −8.3 kcal/mol, respectively, agree with experiment to within 0.8 kcal/mol on average. We note that mean standard deviation for the solvation free energies of 0.3 kcal/mol is comparable to that for the sublimation free energies.
Table 7.
n-Alkylamide | Simulation
|
σ | ||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||||
Acetamide | −10.2 | −10.2 | −10.5 | −10.4 | −10.3 | −10.3 | 0.1 | |
Propanamide | −8.6 | −8.4 | −8.2 | −8.0 | −8.4 | −8.3 | 0.2 | |
Butanamide | −10.8 | −10.9 | −10.1 | −10.3 | −10.4 | −10.5 | 0.3 | |
Pentanamide | −9.8 | −9.6 | −9.3 | −9.9 | −9.9 | −9.7 | 0.2 | |
Hexanamide | −9.4 | −9.1 | −9.4 | −9.6 | −9.5 | −9.4 | 0.2 | |
Heptanamide | −9.6 | −9.3 | −9.6 | −10.2 | −8.9 | −9.5 | 0.5 | |
Octanamide | −10.1 | −8.7 | −9.4 | −9.2 | −9.6 | −9.4 | 0.5 |
From the thermodynamic cycle depicted in Figure 1, summing the sublimation free energies given in Table 6 with the solvation free energies given in Table 7 yields the standard state solubility free energies as reported in Table 8. For acetamide through butanamide, the standard state sublimation and solvation free energy values nearly cancel, which yields absolute standard state solubility free energy values near zero. For the n-alkylamides from pentanamide through octanamide, the combined trends of increasingly stable crystals coupled with decreasingly favorable solvation due to the hydrophobic effect leads to increasingly lower solubility, as shown in Figure 5. The reason for the relatively lower solubility predicted for hexanamide by our method remains unclear. Until further studies can be completed, possible explanations include experimental uncertainty, force field accuracy or undetected sampling difficulties for the larger molecules.
Table 8.
n-Alkylamide |
|
Log (S)a |
|||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Calc. | Expt. | Error | Calc. | Expt. | Error | ||||||
Acetamide | 10.9 | −10.3 | 0.6 | −0.18 | 0.8 | 1.0 | 1.2 | 0.3 | |||
Propanamide | 8.5 | −8.3 | 0.1 | 0.9 | |||||||
Butanamide | 10.5 | −10.5 | 0.0 | −0.27 | 0.3 | 0.2 | 0.4 | 0.2 | |||
Pentanamide | 11.2 | −9.7 | 1.5 | 0.14 | 1.4 | −1.1 | −0.1 | 1.0 | |||
Hexanamide | 12.0 | −9.4 | 2.6 | 0.80 | 1.8 | −1.9 | −0.6 | 1.3 | |||
Heptanamide | 13.0 | −9.5 | 3.5 | −2.6 | |||||||
Octanamide | 14.4 | −9.4 | 5.0 | −3.6 | |||||||
Mean | 1.1 | 0.7 |
Solubility (S) in units of mol/L is calculated from where ΔGexcess was given in Table 5, kB is the Boltzmann constant and T is temperature (298°K).
3. n-Alkylamide Crystal Structures
As the OSRW procedure initially approaches the crystalline state (i.e. as λ approaches 1), conformations that are sampled will likely be in the free energy minimum region. However, there may exist hidden free energy barriers that promote sampling of conformations whose ∂U/∂λ values are inconsistent with the free energy minimum. In either case, the OSRW procedure will build up repulsive Gaussian potentials that drive ∂U/∂λ toward a random walk and thereby accelerate convergence of crystalline conformations. As a validation of the OSRW sampling procedure and the AMOEBA force field, samples were saved every 0.1 nsec that were subsequently completely optimized. The lowest potential energy structures for acetamide through pentanamide are shown in Figures 6–9, respectively. Except in the case of butanamide, the structure of the AMOEBA potential energy minimum agrees with the experimental X-ray structure. However, the AMOEBA models are more accurate with regards to the placement of hydrogen atoms, which results from limitations in the isolated atom scattering model (IAM) used in many X-ray structure refinement programs.130
DISCUSSION
A. AMOEBA Transferability Between Vacuum, Liquid and Solid Phases
An important motivation for developing polarizable force fields is to improve transferability between vacuum, liquid and solid phases and between environments with vastly different dielectric constants. 52,68,72 The inherent dependence of solubility on the equilibrium between a molecule in a solid crystalline phase and solvated in a liquid phase provides a test of the transferability of the AMOEBA force field.53,80 The mean signed error between the experimental and calculated absolute standard state solubility values for the n-alkylamides of 1.1 kcal/mol (Table 8) is a promising first step toward the prediction of solubility from first principles to chemical accuracy.
B. Efficient Computation of Free Energies via OSRW
Advancements in potential energy functions require further work to integrate them with existing alchemical free energy methods such as thermodynamic integration, meta-dynamics55,131 and OSRW.56–58 In this work, we have presented derivatives of the AMOEBA potential that are required for OSRW, which include ∂U(λ, X)/∂λ, ∂U2(λ, X)/∂λ2 and ∂U2(λ, X)/∂X∂λ. In particular, our approach includes softcore modifications to the buffered 14-7 van der Waals term and to the real space portion of Ewald summation in order to facilitate a smooth transition between full strength intermolecular interactions and non-interacting. As demonstrated, an approach that allows simultaneous sampling of the λ and ∂U(λ, X)/∂λ spaces is very effective for sampling the rough free energy landscape of the deposition/sublimation phase transition.
C. Deposition/Sublimation Phase Transitions in Force Field X
To the best of our knowledge, Force Field X64,82,129 is the first and only implementation of an algorithm that permits calculation of the free energy of the deposition/sublimation phase transition from first principles. This requires two layers of complexity beyond the implementation of AMOEBA in other widely used computer codes such as TINKER,67 OpenMM132 and Amber.65 First, explicit support for space group symmetry and replicates of the unit cell is necessary,64 otherwise the complete annihilation of the crystalline phase becomes untenable. Consider the case of acetamide, which includes 18 copies of the molecule (i.e. the asymmetric unit) within the unit cell and 144 copies within a replicated super cell large enough to allow application of the minimum image convention. The 1 or 2 orders of magnitude increase in the degrees of freedom not only increases the cost of a single energy and gradient evaluation, but also exponentially increases the complexity of the crystalline free energy surface. Second, the crystalline annihilation algorithm in Force Field X depends on a meta-dynamics sampling algorithm in order to escape the large crystalline free energy barriers. Therefore, derivatives of the polarizable multipole AMOEBA potential energy function such as ∂U(λ, X)/∂λ, ∂2U(λ, X)/∂λ2 and ∂2U(λ, X)/∂X∂λ have been implemented not only for P1 unit cells, but also for the asymmetric unit of each of the 230 space groups.
Acknowledgments
We thank Drs. Jeff Blaney and JW Feng for helpful discussions regarding the importance of crystal thermodynamics and solubility to the pharmaceutical industry, which provided the original motivation for this work. Assistance from Dr. Johnny Wu to parameterize the n-alkylamides using the Poltype automated parameterization tool and from Dr. Bartolomeo Civalleri to use the CRYSTAL09 software is gratefully acknowledged. We appreciate a Burroughs Wellcome Fund 2012 Collaborative Research Travel Grant to M.S. to support the collaboration between the Ren and Yang labs. The National Institute of General Medical Sciences (R01GM079686) and the Robert A. Welch Foundation (F-1691) for support (P.R.). This contribution was also supported by grants MCB 0919983 (W.Y.) and AGS-0927944 (J.B.) from the National Science Foundation (NSF) and by grant UL1RR024979 (J.B.) from the National Center for Research Resources, a part of the National Institutes of Health. TeraGrid and the Texas Advanced Computing Center provided high performance computing resources supported by NSF grant number TG-MCB100057.
References
- 1.Blagden N, de Matas M, Gavan PT, York P. Advanced Drug Delivery Reviews. 2007;59:617. doi: 10.1016/j.addr.2007.05.011. [DOI] [PubMed] [Google Scholar]
- 2.James SL. Chemical Society Reviews. 2003;32:276. doi: 10.1039/b200393g. [DOI] [PubMed] [Google Scholar]
- 3.Furukawa H, Ko N, Go YB, Aratani N, Choi SB, Choi E, Yazaydin AO, Snurr RQ, O’Keeffe M, Kim J, Yaghi OM. Science. 2010;329:424. doi: 10.1126/science.1192160. [DOI] [PubMed] [Google Scholar]
- 4.Matsumoto M, Saito S, Ohmine I. Nature. 2002;416:409. doi: 10.1038/416409a. [DOI] [PubMed] [Google Scholar]
- 5.Haisa M, Kashino S, Maeda H. Acta Crystallographica Section B. 1974;30:2510. [Google Scholar]
- 6.Haisa M, Kashino S, Kawai R, Maeda H. Acta Crystallographica Section B. 1976;32:1283. [Google Scholar]
- 7.Wheatley PJ. Journal of the Chemical Society. 1964:6036. [Google Scholar]
- 8.Vishweshwar P, McMahon JA, Oliveira M, Peterson ML, Zaworotko MJ. Journal of the American Chemical Society. 2005;127:16802. doi: 10.1021/ja056455b. [DOI] [PubMed] [Google Scholar]
- 9.Terkeltaub R. Nature Reviews Rheumatology. 2010;6:30. doi: 10.1038/nrrheum.2009.236. [DOI] [PubMed] [Google Scholar]
- 10.Orson WM. The Lancet. 2006;367:333. [Google Scholar]
- 11.Price SL. Physical Chemistry Chemical Physics. 2008;10:1996. doi: 10.1039/b719351c. [DOI] [PubMed] [Google Scholar]
- 12.Day GM, Cooper TG, Cruz-Cabeza AJ, Hejczyk KE, Ammon HL, Boerrigter SXM, Tan JS, Della Valle RG, Venuti E, Jose J, Gadre SR, Desiraju GR, Thakur TS, van Eijck BP, Facelli JC, Bazterra VE, Ferraro MB, Hofmann DWM, Neumann MA, Leusen FJJ, Kendrick J, Price SL, Misquitta AJ, Karamertzanis PG, Welch GWA, Scheraga HA, Arnautova YA, Schmidt MU, van de Streek J, Wolf AK, Schweizer B. Acta Crystallographica Section B. 2009;65:107. doi: 10.1107/S0108768109004066. [DOI] [PubMed] [Google Scholar]
- 13.Karamertzanis PG, Kazantsev AV, Issa N, Welch GWA, Adjiman CS, Pantelides CC, Price SL. Journal of Chemical Theory and Computation. 2009;5:1432. doi: 10.1021/ct8004326. [DOI] [PubMed] [Google Scholar]
- 14.Price SL, Price LS. Solid State Characterization of Pharmaceuticals. John Wiley & Sons, Ltd; 2011. Computational Polymorph Prediction; p. 427. [Google Scholar]
- 15.Jorgensen WL, Duffy EM. Advanced Drug Delivery Reviews. 2002;54:355. doi: 10.1016/s0169-409x(02)00008-x. [DOI] [PubMed] [Google Scholar]
- 16.Lii JH, Allinger NL. Journal of the American Chemical Society. 1989;111:8576. [Google Scholar]
- 17.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. Journal of the American Chemical Society. 1996;118:2309. [Google Scholar]
- 18.Jorgensen WL, Maxwell DS, Tirado-Rives J. Journal of the American Chemical Society. 1996;118:11225. [Google Scholar]
- 19.MacKerell AD, Bashford D, Bellott M, Dunbrack RL, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiorkiewicz-Kuczera J, Yin D, Karplus M. The Journal of Physical Chemistry B. 1998;102:3586. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- 20.Jorgensen WL, Tirado-Rives J. Proceedings of the National Academy of Sciences. 2005;102:6665. doi: 10.1073/pnas.0408037102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stone AJ, Alderton M. Molecular Physics. 1985;56:1047. [Google Scholar]
- 22.Stone AJ. The Theory of Intermoleular Forces. Vol. 32 Clarendon Press; Oxford: 1996. [Google Scholar]
- 23.Stone AJ. Journal of Chemical Theory and Computation. 2005;1:1128. doi: 10.1021/ct050190+. [DOI] [PubMed] [Google Scholar]
- 24.Stone AJ. Science. 2008;321:787. doi: 10.1126/science.1158006. [DOI] [PubMed] [Google Scholar]
- 25.Price SL. Journal of the Chemical Society, Faraday Transactions. 1996;92:2997. [Google Scholar]
- 26.Ouvrard C, Price SL. Crystal Growth & Design. 2004;4:1119. [Google Scholar]
- 27.Civalleri B, Zicovich-Wilson CM, Valenzano L, Ugliengo P. CrystEngComm. 2008;10:405. [Google Scholar]
- 28.Bardwell DA, Adjiman CS, Arnautova YA, Bartashevich E, Boerrigter SXM, Braun DE, Cruz-Cabeza AJ, Day GM, Della Valle RG, Desiraju GR, van Eijck BP, Facelli JC, Ferraro MB, Grillo D, Habgood M, Hofmann DWM, Hofmann F, Jose KVJ, Karamertzanis PG, Kazantsev AV, Kendrick J, Kuleshova LN, Leusen FJJ, Maleev AV, Misquitta AJ, Mohamed S, Needs RJ, Neumann MA, Nikylov D, Orendt AM, Pal R, Pantelides CC, Pickard CJ, Price LS, Price SL, Scheraga HA, van de Streek J, Thakur TS, Tiwari S, Venuti E, Zhitkov IK. Acta Crystallographica Section B. 2011;67:535. doi: 10.1107/S0108768111042868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jorgensen WL, Duffy EM. Bioorganic & Medicinal Chemistry Letters. 2000;10:1155. doi: 10.1016/s0960-894x(00)00172-4. [DOI] [PubMed] [Google Scholar]
- 30.Lüder K, Lindfors L, Westergren J, Nordholm S, Persson R, Pedersen M. Journal of Computational Chemistry. 2009;30:1859. doi: 10.1002/jcc.21173. [DOI] [PubMed] [Google Scholar]
- 31.Yalkowsky SH, Valvani SC. Journal of Pharmaceutical Sciences. 1980;69:912. doi: 10.1002/jps.2600690814. [DOI] [PubMed] [Google Scholar]
- 32.Jain N, Yalkowsky SH. Journal of Pharmaceutical Sciences. 2001;90:234. doi: 10.1002/1520-6017(200102)90:2<234::aid-jps14>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
- 33.Kühne R, Ebert RU, Kleint F, Schmidt G, Schüürmann G. Chemosphere. 1995;30:2061. [Google Scholar]
- 34.Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Advanced Drug Delivery Reviews. 1997;23:3. doi: 10.1016/s0169-409x(00)00129-0. [DOI] [PubMed] [Google Scholar]
- 35.Mitchell BE, Jurs PC. Journal of Chemical Information and Computer Sciences. 1998;38:489. doi: 10.1021/ci980029a. [DOI] [PubMed] [Google Scholar]
- 36.Abraham MH, Le J. Journal of Pharmaceutical Sciences. 1999;88:868. doi: 10.1021/js9901007. [DOI] [PubMed] [Google Scholar]
- 37.Huuskonen J. Journal of Chemical Information and Computer Sciences. 2000;40:773. doi: 10.1021/ci9901338. [DOI] [PubMed] [Google Scholar]
- 38.Meylan W, Howard P. Perspectives in Drug Discovery and Design. 2000;19:67. [Google Scholar]
- 39.Klopman G, Zhu H. Journal of Chemical Information and Computer Sciences. 2001;41:439. doi: 10.1021/ci000152d. [DOI] [PubMed] [Google Scholar]
- 40.Liu R, So SS. Journal of Chemical Information and Computer Sciences. 2001;41:1633. doi: 10.1021/ci010289j. [DOI] [PubMed] [Google Scholar]
- 41.McElroy NR, Jurs PC. Journal of Chemical Information and Computer Sciences. 2001;41:1237. doi: 10.1021/ci010035y. [DOI] [PubMed] [Google Scholar]
- 42.Tetko IV, Tanchuk VY, Kasheva TN, Villa AEP. Journal of Chemical Information and Computer Sciences. 2001;41:1488. doi: 10.1021/ci000392t. [DOI] [PubMed] [Google Scholar]
- 43.Yan A, Gasteiger J. Journal of Chemical Information and Computer Sciences. 2002;43:429. doi: 10.1021/ci025590u. [DOI] [PubMed] [Google Scholar]
- 44.Duchowicz PR, Talevi A, Bruno-Blanch LE, Castro EA. Bioorganic & Medicinal Chemistry. 2008;16:7944. doi: 10.1016/j.bmc.2008.07.067. [DOI] [PubMed] [Google Scholar]
- 45.Huuskonen J, Livingstone DJ, Manallack DT. SAR and QSAR in Environmental Research. 2008;19:191. doi: 10.1080/10629360802083855. [DOI] [PubMed] [Google Scholar]
- 46.John SD. Drug Discovery Today. 2005;10:289. doi: 10.1016/S1359-6446(04)03365-3. [DOI] [PubMed] [Google Scholar]
- 47.Wang J, Hou T. Combinatorial Chemistry & High Throughput Screening. 2011;14:328. doi: 10.2174/138620711795508331. [DOI] [PubMed] [Google Scholar]
- 48.Mitchell JBO. Future Medicinal Chemistry. 2011;3:451. doi: 10.4155/fmc.11.11. [DOI] [PubMed] [Google Scholar]
- 49.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. Journal of the American Chemical Society. 1996;118:2309. [Google Scholar]
- 50.MacKerell AD, Bashford D, Bellott M, Dunbrack RL, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiorkiewicz-Kuczera J, Yin D, Karplus M. Journal of Physical Chemistry B. 1998;102:3586. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- 51.Ren P, Ponder JW. Journal of Computational Chemistry. 2002;23:1497. doi: 10.1002/jcc.10127. [DOI] [PubMed] [Google Scholar]
- 52.Ponder JW, Wu C, Ren P, Pande VS, Chodera JD, Schnieders MJ, Haque I, Mobley DL, Lambrecht DS, DiStasio RA, Head-Gordon M, Clark GNI, Johnson ME, Head-Gordon T. The Journal of Physical Chemistry B. 2010;114:2549. doi: 10.1021/jp910674d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ren P, Wu C, Ponder JW. Journal of Chemical Theory and Computation. 2011;7:3143. doi: 10.1021/ct200304d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kong X, Brooks CL., III The Journal of Chemical Physics. 1996;105:2414. [Google Scholar]
- 55.Laio A, Parrinello M. Proceedings of the National Academy of Sciences. 2002;99:12562. doi: 10.1073/pnas.202427399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zheng L, Chen M, Yang W. Proceedings of the National Academy of Sciences. 2008;105:20227. doi: 10.1073/pnas.0810631106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Zheng L, Chen M, Yang W. The Journal of Chemical Physics. 2009;130:234105. doi: 10.1063/1.3153841. [DOI] [PubMed] [Google Scholar]
- 58.Lee S, Chen M, Yang W, Richards NGJ. Journal of the American Chemical Society. 2010;132:7252. doi: 10.1021/ja101446u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Zwanzig RW. The Journal of Chemical Physics. 1954;22:1420. [Google Scholar]
- 60.Kirkwood JG. The Journal of Chemical Physics. 1935;3:300. [Google Scholar]
- 61.Darden T, York D, Pedersen L. The Journal of Chemical Physics. 1993;98:10089. [Google Scholar]
- 62.Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. The Journal of Chemical Physics. 1995;103:8577. [Google Scholar]
- 63.Sagui C, Pedersen LG, Darden TA. The Journal of Chemical Physics. 2004;120:73. doi: 10.1063/1.1630791. [DOI] [PubMed] [Google Scholar]
- 64.Schnieders MJ, Fenn TD, Pande VS. Journal of Chemical Theory and Computation. 2011;7:1141. doi: 10.1021/ct100506d. [DOI] [PubMed] [Google Scholar]
- 65.Case DA, Cheatham TE, Darden T, Gohlke H, Luo R, Merz KM, Onufriev A, Simmerling C, Wang B, Woods RJ. Journal of Computational Chemistry. 2005;26:1668. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M. Journal of Computational Chemistry. 2009;30:1545. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Ponder JW. TINKER: Software Tools for Molecular Design. 5.0. Saint Louis, MO: p. 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ponder JW, Case DA. Advances in Protein Chemistry; Vol. 66. Academic Press; 2003. Force fields for protein simulations; p. 27. [DOI] [PubMed] [Google Scholar]
- 69.Maple JR, Cao YX, Damm WG, Halgren TA, Kaminski GA, Zhang LY, Friesner RA. Journal of Chemical Theory and Computation. 2005;1:694. doi: 10.1021/ct049855i. [DOI] [PubMed] [Google Scholar]
- 70.Patel S, Brooks CL. Molecular Simulation. 2006;32:231. [Google Scholar]
- 71.Piquemal JP, Chevreau H, Gresh N. Journal of Chemical Theory and Computation. 2007;3:824. doi: 10.1021/ct7000182. [DOI] [PubMed] [Google Scholar]
- 72.Lopes PEM, Roux B, MacKerell AD. Theoretical Chemistry Accounts. 2009;124:11. doi: 10.1007/s00214-009-0617-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Holt A, Karlström G. International Journal of Quantum Chemistry. 2009;109:1255. [Google Scholar]
- 74.Wang J, Cieplak P, Li J, Hou T, Luo R, Duan Y. The Journal of Physical Chemistry B. 2011;115:3091. doi: 10.1021/jp112133g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wang J, Cieplak P, Li J, Wang J, Cai Q, Hsieh M, Lei H, Luo R, Duan Y. The Journal of Physical Chemistry B. 2011;115:3100. doi: 10.1021/jp1121382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wu JC, Chattre G, Ren P. Theoretical Chemistry Accounts. 2012 doi: 10.1007/s00214-012-1138-6. (inpress) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Shirts MR, Pitera JW, Swope WC, Pande VS. The Journal of Chemical Physics. 2003;119:5740. [Google Scholar]
- 78.Klimovich P, Mobley D. Journal of Computer-Aided Molecular Design. 2010;24:307. doi: 10.1007/s10822-010-9343-7. [DOI] [PubMed] [Google Scholar]
- 79.Ewald PP. Annalen der Physik. 1921;369:253. [Google Scholar]
- 80.Ren P, Ponder JW. The Journal of Physical Chemistry B. 2003;107:5933. [Google Scholar]
- 81.Smith W. CCP5 Information Quaterly. 1982;4 [Google Scholar]
- 82.Fenn TD, Schnieders MJ. Acta Crystallographica Section D. 2011;67:957. doi: 10.1107/S0907444911039060. [DOI] [PubMed] [Google Scholar]
- 83.Bartels C, Karplus M. Journal of Computational Chemistry. 1997;18:1450. doi: 10.1002/jcc.1137. [DOI] [PubMed] [Google Scholar]
- 84.Darve E, Pohorille A. The Journal of Chemical Physics. 2001;115:9169. [Google Scholar]
- 85.Bitetti-Putzer R, Yang W, Karplus M. Chemical Physics Letters. 2003;377:633. [Google Scholar]
- 86.Swendsen RH, Fasnacht M, Rosenberg JM. Computer Physics Communications. 2005;169:274. [Google Scholar]
- 87.Carter EA, Ciccotti G, Hynes JT, Kapral R. Chemical Physics Letters. 1989;156:472. [Google Scholar]
- 88.Wang F, Landau DP. Physical Review Letters. 2001;86:2050. doi: 10.1103/PhysRevLett.86.2050. [DOI] [PubMed] [Google Scholar]
- 89.Kaminsky A. Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java “. Parallel and Distributed Processing Symposium; 2007; Long Beach, CA. [Google Scholar]
- 90.Ren P, Ponder JW. The Journal of Physical Chemistry B. 2004;108:13427. [Google Scholar]
- 91.Grossfield A, Ren PY, Ponder JW. Journal of the American Chemical Society. 2003;125:15671. doi: 10.1021/ja037005r. [DOI] [PubMed] [Google Scholar]
- 92.Jiao D, Golubkov PA, Darden TA, Ren P. Proceedings of the National Academy of Sciences. 2008;105:6290. doi: 10.1073/pnas.0711686105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Halgren TA. Journal of the American Chemical Society. 1992;114:7827. [Google Scholar]
- 94.Thole BT. Chemical Physics. 1981;59:341. [Google Scholar]
- 95.Jeffrey GA, Ruble JR, McMullan RK, DeFrees DJ, Pople JA. Acta Crystallographica Section B. 1981;37:1885. [Google Scholar]
- 96.Dovesi R, Orlando R, Civalleri B, Roetti C, Saunders VR, Zicovich-Wilson CM. Zeitschrift für Kristallographie. 2005;220:571. [Google Scholar]
- 97.Dovesi R, Saunders VR, Roetti R, Orlando R, Zicovich-Wilson CM, Pascale F, Civalleri B, Doll K, Harrison NM, Bush IJ, D’Arco P, Llunell M. CRYSTAL09 User’s Manual. University of Torino; Torino: 2009. CRYSTAL09 User’s Manual. [Google Scholar]
- 98.Kitano M, Kuchitsu K. Bulletin of the Chemical Society of Japan. 1973;46:3048. [Google Scholar]
- 99.Ottersen T. Acta Chemica Scandinavica. 1975;A29:939. [PubMed] [Google Scholar]
- 100.Zobel D, Luger P, Dreissig W, Koritsanszky T. Acta Crystallographica Section B. 1992;48:837. [Google Scholar]
- 101.Denne WA, Small RWH. Acta Crystallographica Section B. 1971;27:1094. [Google Scholar]
- 102.Hamilton W. Acta Crystallographica. 1965;18:866. [Google Scholar]
- 103.Lee C, Yang W, Parr RG. Physical Review B. 1988;37:785. doi: 10.1103/physrevb.37.785. [DOI] [PubMed] [Google Scholar]
- 104.Becke AD. The Journal of Chemical Physics. 1993;98:5648. [Google Scholar]
- 105.Monkhorst HJ, Pack JD. Physical Review B. 1976;13:5188. [Google Scholar]
- 106.Civalleri B, D’Arco P, Orlando R, Saunders VR, Dovesi R. Chemical Physics Letters. 2001;348:131. [Google Scholar]
- 107.Doll K. Computer Physics Communications. 2001;137:74. [Google Scholar]
- 108.Doll K, Saunders VR, Harrison NM. International Journal of Quantum Chemistry. 2001;82:1. [Google Scholar]
- 109.Shanno DF. Mathematics of Computation. 1970;24:647. [Google Scholar]
- 110.Goldfarb D. Mathematics of Computation. 1970;24:23. [Google Scholar]
- 111.Fletcher R. The Computer Journal. 1970;13:317. [Google Scholar]
- 112.Broyden CG. IMA Journal of Applied Mathematics. 1970;6:76. [Google Scholar]
- 113.Holst M, Saied F. Journal of Computational Chemistry. 1993;14:105. [Google Scholar]
- 114.Gatti C, Saunders VR, Roetti C. The Journal of Chemical Physics. 1994;101:10686. [Google Scholar]
- 115.Krishnan R, Binkley JS, Seeger R, Pople JA. The Journal of Chemical Physics. 1980;72:650. [Google Scholar]
- 116.Boys SF, Bernardi F. Molecular Physics. 1970;19:553. [Google Scholar]
- 117.Berendsen HJC, Postma JPM, Gunsteren WFv, DiNola A, Haak JR. Journal of Chemical Physics. 1984;81:3684. [Google Scholar]
- 118.Schmid R, Miah AM, Sapunov VN. Physical Chemistry Chemical Physics. 2000;2:97. [Google Scholar]
- 119.Friedman HL, Krishnan CV. Thermodynamics of Ion Hydration. In: Franks F, editor. Water: A Comprehensive Treatise; Vol. 3 Plenum Press; New York: 1973. [Google Scholar]
- 120.Tissandier MD, Cowen KA, Feng WY, Gundlach E, Cohen MH, Earhart AD, Coe JV, Turtle TR. The Journal of Physical Chemistry A. 1998;102:7787. [Google Scholar]
- 121.Turner JD, Lingafelter EC. Acta Crystallographica. 1955;8:549. [Google Scholar]
- 122.Usanmaz A, Adler G. Acta Crystallographica Section B. 1982;38:660. [Google Scholar]
- 123.Kopylovich MN, Kukushkin VY, Haukka M, Fraústo da Silva JJR, Pombeiro AJL. Inorganic Chemistry. 2002;41:4798. doi: 10.1021/ic0256720. [DOI] [PubMed] [Google Scholar]
- 124.Lewis TC, Tocher DA. Acta Crystallographica Section E. 2005;61:1985. [Google Scholar]
- 125.Yalkowsky SH, He Y. Handbook of Aqueous Solubility Data; CRC Press; 2003. [Google Scholar]
- 126.Wolfenden R, Andersson L, Cullis PM, Southgate CCB. Biochemistry. 1981;20:849. doi: 10.1021/bi00507a030. [DOI] [PubMed] [Google Scholar]
- 127.Romero C, González M. Journal of Thermal Analysis and Calorimetry. 2008;92:705. [Google Scholar]
- 128.Romero CM, Gonzalez ME. Journal of Chemical & Engineering Data. 2009;55:2326. [Google Scholar]
- 129.Fenn TD, Schnieders MJ, Brunger AT. Acta Crystallographica Section D. 2010;66:1024. doi: 10.1107/S0907444910031045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Schnieders MJ, Fenn TD, Pande VS, Brunger AT. Acta Crystallographica Section D. 2009;65:952. doi: 10.1107/S0907444909022707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Barducci A, Bonomi M, Parrinello M. Wiley Interdisciplinary Reviews: Computational Molecular Science. 2011;1:826. [Google Scholar]
- 132.Eastman P, Pande VS. Journal of Computational Chemistry. 2010;31:1268. doi: 10.1002/jcc.21413. [DOI] [PMC free article] [PubMed] [Google Scholar]