Abstract
We incorporate nuclear quantum effects (NQE) in condensed matter simulations by introducing short-range neural network () corrections to the ab initio fitted molecular force field ARROW. Force field corrections are fitted to average interaction energies and forces of molecular dimers, which are simulated using the Path Integral Molecular Dynamics (PIMD) technique with restrained centroid positions. The -corrected force field allows reproduction of the NQE for computed liquid water and methane properties such as density, radial distribution function (RDF), heat of evaporation (HVAP), and solvation free energy. Accounting for NQE through molecular force field corrections circumvents the need for explicit computationally expensive PIMD simulations in accurate calculations of the properties of chemical and biological systems. The accuracy and locality of pairwise NQE corrections indicate that this approach could be applicable to complex heterogeneous systems, such as proteins.
Graphical Abstract

INTRODUCTION
All-atom molecular simulations are readily accessible and provide a very detailed description of chemical and biological systems, allowing computation of quantities of interest such as ligand binding free energies and rates of chemical reactions. The practical usefulness of molecular simulations is often limited by the insufficient accuracy of model Hamiltonians describing complex atomic interactions. One of the key ingredients toward achieving this accuracy is including nuclear quantum effects (NQE).1–4 The traditional path integral approach has a significant computational cost compared with standard molecular dynamics. As a result, various methods have been developed, for example, in refs 5–9, to reduce the overhead of modeling NQEs to classical molecular dynamics (MD). In this work, we present a method to encapsulate some of the NQE in an effective intermolecular Hamiltonian term, encoded by a neural network, thus enabling more accurate predictions without the Path Integral Molecular Dynamics (PIMD) overhead. The required local corrections to the intermolecular interaction energies can be learned from PIMD calculations of dimers only. Combined with an accurate ab initio force field,10,11 the effective hybrid Hamiltonian produces accurate predictions of condensed matter properties from a classical MD simulation.
Recently, there has been a resurgence of models that are derived from ab initio quantum computations, using analytical expressions,10 neural networks,12,13 or both.11 Molecular force fields (FF) derived from ab initio quantum chemical data describe molecular system potential energy surfaces (PES) with very high fidelity.10,14–18 However, to reliably compute condensed matter properties in statistical simulations with accurate FFs, NQE must be taken into account1–3 explicitly. (Most popular molecular force fields account for NQE implicitly as their parameters are fitted to reproduce experimental observables such as liquid density, heat of evaporation (HVAP), and solvation free energies.) For example, the errors of computed hydration free energy values for common neutral solutes are reduced from 0.8 to 0.22 kcal/mol when the PIMD technique was used in calculations with the ARROW FF.3,10
NQE are usually modeled via the Path Integral Molecular Dynamics (PIMD) technique, which is significantly computationally more expensive than classical MD. The PIMD method models the molecular system via multiple replicas, with each atom represented by a necklace of beads connected by harmonic springs (ring-polymer). While up to 128 replicas are needed for convergence of NQE for some thermodynamic properties such as heat capacity, 8 replicas are usually sufficient to capture most of NQE for properties dependent mostly on intermolecular energies, such as solvation and binding free energies.3,10,19,20 At room temperature, the dynamics of molecular systems are close to the classical limit, and beads representing atoms in PIMD simulations are localized in small volumes of across (corresponding to quantum uncertainty in atom positions). We expect that approximating the statistics of the full bead necklaces by those of the centroid may provide an accurate description of the statistical properties of ring-polymers and NQE effects.
Many methods have been developed to encompass NQE while preserving the classical description of molecular dynamics. One of these is the centroid molecular dynamics (CMD) method devised by Voth and his co-workers.21,22 In this approach, the dynamics of a molecular system are represented by the motion of atom centroids in the effective mean-field potential created by ring-polymer beads.5 Computing the effective centroid forces in each point of a CMD trajectory is expensive, and thus several approximate techniques have been developed to speed up the calculations.23,24 Another approach to account for NQE in classical molecular dynamics simulations involves the Generalized Langevin Equation (GLE)-based frequency-dependent thermostats,25,26 perturbed path-integral approximations,27 ring-polymer contraction,5 and multiple time stepping.28
Machine learning techniques are increasingly used to describe molecular potential surfaces.29–35 Neural networks are able to accurately fit very complex multidimensional functions and are thus uniquely suited to describe the intricate dependence of molecular energy on atomic coordinates. potentials were used successfully to derive coarse-grained potentials36,37 that described the free energy of molecular systems in the space of reduced coordinates. A derivation of an effective centroid energy functional can also be considered as a coarse-graining of ring-polymer PIMD simulations. Coarse-graining theory of PIMD simulations has been recently reported in refs 38,39. Derivations of effective centroid potentials for chemical systems using analytical functions and neural network based on atom-centered descriptors have been reported previously in8,9,24,39,40
Here, we focus on an accurate description of intermolecular interactions that play a dominant role in determining properties of interest in biochemistry such as ligand binding free energies and solvation free energies. We expect the PIMD intermolecular interactions to be numerically close to intermolecular energies evaluated in centroid coordinates, with corrections rapidly decaying with increasing atom–atom distances. To fit NQE corrections to intermolecular energies, we used a neural network () potential correction defined with descriptors that describe the relative geometries of intermolecular atomic contacts.
Because of the localized nature of NQE, we hypothesize that corrections to intermolecular energies and forces, trained on PIMD simulations of molecular dimers, can accurately model NQE. Atom-centered machine learning methods achieved a good transferability of quantum corrections to centroid forces across phases9 or learning them using very short distance cutoffs.40 Intermolecular interactions are relatively small perturbations of total energies of molecular systems; therefore, we expect that NQE in intermolecular interactions can be adequately described with even more local descriptors than those used for total potential energies and forces.
Below we describe a derivation of intermolecular NQE corrections to the ARROW FF for liquid water and methane. Water was chosen because of its obvious importance and the large amount of literature on its NQE and their modeling. However, in water, many of the NQE offset and partially cancel each other.1,41 We therefore also model methane because its NQE are fairly straightforward and also because many of its NQE come from the change of behavior of bonded interactions, which our technique does not explicitly embrace. We were curious if an effective intermolecular Hamiltonian can reproduce the energetic effects of an intramolecular perturbation. As NQE in both systems are significant as was shown in the previous studies,3,15 the two liquids provide a good proof of principle.
The parameters of ARROW FF are fitted exclusively to high-level ab initio quantum chemical data for monomers, dimers, and multimers of small molecules. We previously reported3 that the ARROW FF allowed accurate calculations of solvation free energies of neutral molecules, provided NQE are taken into account by PIMD calculations using at least 8 replicas. We now show that corrections to the ARROW FF accurately reproduce the NQE in calculations of the structural and thermodynamic properties of liquid water and methane. corrections allow the construction of a molecular FF from high-level quantum chemical data, which is inclusive of NQE and does not require PIMD calculations for accurate predictions of properties of interest. While the ARROW FF is polarizable and a multipolar force field, we note that it may not capture small effects related to electronic degrees of freedom.42–44
METHODS
An encapsulation of PIMD corrections in a neural network has been discussed recently,39 and so it is worthwhile to explain why we choose to do this in an alternative manner. First and foremost, we recently published11 a hybrid -analytical model of intermolecular interactions, parametrized from first-principles, which achieves chemical accuracy for prediction of properties of diverse molecular systems. The structure of the term and the choices made in designing it are described fully there. In short, the construction is computationally economical, naturally aligns with the pairwise force formulation of our FF model, and can be enabled in small subsets of the chemical interaction space. The current work is an application of the same techniques to encapsulate NQE and its specific purpose is to be consistent with our overall methodology. Focusing on intermolecular interactions was a necessary and conscious choice for Illarionov et al., and in this work, we want to see if the same choice of simulation mechanisms can be applied to encoding PIMD in a classical effective Hamiltonian. A diagram of the interaction model is shown in Figure 1, along with a schematic of the neural network that computes the NQE correction.
Figure 1.

(a) Diagram of the intermolecular interaction fingerprint. and are the interacting atoms, is located at the midpoint between and , and the atom pair symmetry functions (APSF) are atomic symmetry functions centered at . The location of automatically symmetrizes the construction with respect to the permutation. The APSF summations (blue dashed lines) are over neighborhoods of both and . The fingerprint is fed into an () specific neural network to produce an energy correction to the ARROW interaction energy between and . (b) Diagram of the neural network term for pair interactions between two water molecules. Each pair interaction fingerprint is fed into its corresponding trained neural network (e.g., HH, HO, and OO in this case). The output is then smoothly truncated by a TensorFlow -layer to zero beyond a cutoff distance. All of the individual pair energy contributions are then summed to a final dimer energy correction , which is then added to the energy output by the analytical force field to produce the total energy.
Spherical Bessel () descriptors were assigned to intermolecular atom pairs. Here, a pair between the oxygen atoms of water () and acetaldehyde () is shown. is the midpoint of the atom pair, which is the reference point for computed descriptors. descriptors involve a sum over atoms A and B and their immediate surroundings (atoms covalently bound to and and if or has only one bond, the next layer of covalent neighbors is included). Feedforward having descriptors as the input layer compute the contribution of atom pairs to the intermolecular energy NQE corrections.
The structure of the neural network is illustrated in Figure 1b. We constructed the using the Keras TensorFlow functional API that predicts changes in the intermolecular energies and forces of the molecular dimers due to ( and ). The component of the intermolecular interaction energy is decomposed into a sum of interactions between pairs of atoms () with each atom of the pair residing on a different monomer and having a separation distance of less than .
| (1) |
The interaction center of the pair is chosen to be at the geometric midpoint between and . This symmetrizes the construction to the permutation and permits the use of atomic pair symmetry functions (APSF) describing atom pair geometry centered at . Figure 1 shows a diagram of the APSF function for the intermolecular atom pair . The specific basis we chose is Spherical Bessel () descriptors.45,46
| (2) |
where , .
The atomic set for the convolution (, indices in (2)) includes atoms covalently bonded to atoms A and B and the next covalent neighbors if or is a terminal atom. We employ 15 descriptors (reverse distance descriptor) for each interatomic contact. This number of descriptors was chosen to create a balance between accurately describing pair geometries and avoiding overtraining for the systems considered (the MAE of the training set is similar to that of the validation set).
The model for the atom pair intramolecular energy contribution was composed of 3 layers: 2 dense layers with 20 nodes each and the tanh activation function and a -layer that smoothly scales the correction to zero at interatomic contacts greater than .
| (3) |
where is a vector of atom pair descriptors eq 2; , , and are the matrix of weights, biasing vector, and activation functions (tanh) for the layer of the ; is the atom–atom distance of the atom pair; and is the scaling layer cutoff distance () and the distance decay parameter .
Atomic forces due to corrections were computed using an automatic differentiation utility provided by the Tensor-Flow library and analytical derivatives of descriptors
| (4) |
where is a Cartesian coordinate of atom that is involved in descriptor described by eq 2.
Computing Changes of Dimer Intermolecular Energies and Forces due to NQE.
Ten thousand dimers were sampled from PIMD R8 (8 replicas) trajectories of liquid water in the ARROW FF at 298 K. For liquid methane, 2400 dimers were sampled from PIMD R8 trajectories at 120 K. The dimers were chosen to have an intermolecular separation of less than .
Figure S1 shows the distribution of the closest intermolecular distances for water and methane dimers used in calculations. For each molecular dimer, we performed PIMD R8 calculations where the initial dimer atom coordinates were used as positional restraints for atom centroids. A rigid force constant of 1000 kcal/mol/ was used for centroid restraints to ensure that the average centroid coordinates were close to the atom coordinates of the reference dimers. We ran 1 ns PIMD simulations for each water dimer and corresponding monomers. 5 ns PIMD simulations were run for methane dimers and monomers. Averaging interaction energies and forces along PIMD trajectories while keeping atom centroids restrained allows us to obtain effective changes of intermolecular interactions due to NQE for the given dimer coordinates.
For each dimer, we computed changes of intermolecular energies and forces due to and with the following procedure:
- PIMD intermolecular energy and forces of the dimer are computed by subtracting averaged PIMD potential energies and forces (including the bead spring term which arises from the quantum kinetic energy operator) of the dimer and averaged PIMD potential energies and forces of the monomers with restrained centroid coordinates
(5) (6) - Reference ARROW FF intermolecular energy and forces of the dimer are computed by subtracting FF energies and forces of the dimer and those of monomers at averaged centroid coordinates in PIMD calculations
(7) (8) - Changes of intermolecular energies and forces due to NQE were then computed as
(9) (10)
We thus isolated the NQE corrections to intermolecular dimer energies and forces, which were then fitted by a .
The dimer data set was split into training and validation sets (80/20%). We did not use a separate “set aside” test set, as the final quality of the model was evaluated on the computed bulk properties of liquid water and methane. training used the weighted mean square error (MSE) loss function for predicted NQE corrections to intermolecular energies and atomic forces.
| (11) |
Here, is the relative weight of intermolecular forces in the loss function. In this work, , see Figure S7 in the Supporting Information (SI) for more details on choice.
The Adam stochastic optimizer was used with a batch size of 256 and a patience value of 300, in the optimization of the loss function via the back-propagation algorithm.
MD Simulations Details and Liquid Properties Calculations.
To perform condensed matter simulations, we implemented calculations of NQE corrections to the ARROW FF in our ARBALEST program.3,10,18,47 corrections to energies and forces are computed for each water dimer within a cutoff. descriptor computations were found to be the most expensive step of calculations of corrections and were implemented to run on NVIDIA GPUs using CUDA libraries. The intermolecular energies and forces were also computed on the GPUs, while the intramolecular terms and integration of the equations of motion were processed on the CPUs.
Boxes of water or methane of sizes were simulated. The Particle-Mesh Ewald (PME) algorithm48 was used to compute the electrostatic interactions. A grid size mesh of and a fifth power spline interpolation order was used to compute the inverse PME sum. Nine cutoffs were used to compute the direct PME sum as well as the exchange and dispersion interactions. Bulk corrections were applied to account for long-distance dispersion interactions. Solvation free energies were computed by decoupling the interactions between the solute and the solvent molecules with the Bennett acceptance ratio (BAR)49 method. Electrostatic, exchange-repulsion, and dispersion interactions were switched off simultaneously using a lambda-dependent scaling (a scaling factor power of 2 was used) and a soft-coring algorithm (maximal soft-coring radius , soft-coring radius scaling factor power = 1). Eleven equally spaced points were used to decouple the solute–solvent intermolecular interactions.
Energy minimization (1000 steps) using the steepest descent algorithm was initially performed on the simulated systems. This was followed by a 50 ps equilibration and a 1 ns production run in the isothermal–isobaric ensemble (NPT). The temperature was maintained at 298 K (for water) and 120 K (for methane) using a Langevin thermostat50 (relaxation time = 0.1 ps). The pressure was maintained at 1 atm with a Berendsen barostat51 (relaxation time = 0.5 ps). A Multiple Time Step (MTS) algorithm52 was used to integrate the equations of motion both for the classical and path-integral runs. Bonded and PIMD53,54 bead interactions were integrated with a time step of 0.125 fs, while intermolecular interactions were integrated with a time step of 2 fs. The simulation protocol was the same for the ARROW and ARROW calculations.
RESULTS AND DISCUSSION
First, we found that NQE induce significant alterations in intermolecular interactions within water dimers, especially when the dimers are in close proximity. These alterations manifest as substantial changes in energies, reaching up to 1 kcal/mol, and forces, with deviations of up to 7 kcal/mol/ (see Figure 2a,b). As the distance between the molecules increases, the influence of the NQE rapidly diminish.
Figure 2.

NQE (PIMD-MD) in intermolecular energies eq 9 (a) and forces eq 10 (b) of water dimers computed by PIMD R8 (green) and corrected by (red) as a function of the closest distance between water monomers (the validation set). corrections to dimer intermolecular energies (c) and forces (d) vs NQE changes. NQE corrections of intermolecular interactions of water dimers rapidly decay with intermolecular distance and are accurately reproduced by based on atom pair descriptors.
corrections are detailed in the Methods section are able to accurately fit changes of intermolecular interactions due to NQE, as evidenced by the data presented in Figure 2a–d. The MAE of the fitted to NQE corrections to intermolecular energies was around 0.03 kcal/mol for the training set and 0.04 kcal/mol for the validation set. The convergence of the MAE values for the intermolecular energy fit for one of the optimization runs is depicted in Figure S2 in the Supporting Information.
For the fit to forces, the MAE was consistent at 0.11 kcal/mol/ for both the training and the validation sets. Notably, when the fitting process exclusively utilized energies , the MAE for the -corrected intermolecular forces increased by approximately 3-fold, reaching 0.3 kcal/mol/ (see Figure S7 in the SI).
The outcomes of computations of bulk properties of water with the ARROW FF, enhanced by corrections that account for NQE in intermolecular interactions, are displayed in Figure 3. PIMD calculations of water with the ARROW FF reveal minor variations of water density that are non-monotonic with the number of PIMD replicas utilized (Figure 3a). NQE corrections to the FF, derived from dimer PIMD calculations utilizing 8 and 32 replicas, yield results that closely align with the water density derived from bulk PIMD simulations with the corresponding number of replicas (1.029 g/mL compared to 1.026 g/mL for R8, and 1.023 vs 1.020 g/mL for R32).
Figure 3.

Water properties at , computed with ARROW FF (MD (blue), PIMD with 4, 8, 16, and 32 replicas (lime to dark green)) and with ARROW FF + NN PIMD corrections fit to PIMD R8 (salmon) and PIMD R32 (red) dimer calculations (a) Density. (b) O–O RDF. (c) Evaporation enthalpy (HVAP). (d) Solvation free energy. Computed O–O RDF, HVAP, and solvation changes due to NQE are accurately reproduced by NQE corrections to the ARROW FF.
It is worth emphasizing that the ARROW FF was fitted to quantum chemical data and its parameters were not fine-tuned to replicate water density, unlike the typical approach in widely adopted force fields. In this particular study, we focus on the description of NQE in intermolecular interactions.
In a recent study,55 we effectively utilized to improve the fitting of quantum chemical data. Consequently, the resultant ARROW force field demonstrates a heightened ability to accurately reproduce the experimental water density and various other properties of liquids, surpassing the capabilities of the original ARROW FF.
In Figure 3b, the water oxygen–oxygen RDF is depicted. The RDF is calculated using the ARROW force field (FF) with MD and PIMD simulations employing 8 replicas and 32 Replicas. Additionally, the RDF was obtained using the ARROW FF with NQE corrections. NQE corrections were fitted to a combination of energies and forces . The dependence of the MAE of NQE corrections of energies and forces on the parameter is shown in Figure S7. The corrections successfully capture the alterations in the oxygen–oxygen RDF stemming from NQE. Additional RDF’s for the O–H and H–H pairs in water are shown in Figures S3–S4 in the SI. O–H and H–H RDFs are not reproduced as well as the O–O RDF, with the RDF obtained in direct PIMD calculations showing much lower and wider peaks than in MD simulations with NQE corrections. Here, the limitations of our approach are apparent, as quantum delocalization of hydrogen atoms does not appear directly in MD simulations with NQE corrections, but is only reflected indirectly in the statistics of atom centroids.
In Figure 3c,d, the computed results for NQE in the water enthalpy of evaporation (HVAP) and hydration free energy under standard conditions ( and ) are presented. The computations were carried out using various setups: the ARROW FF in conjunction with MD, PIMD simulations employing 4, 8, 16, and 32 replicas, and the ARROW FF augmented with NQE corrections fitted to dimer PIMD calculations using 8 and 32 replicas. Fairly large NQE changes in HVAP ∼ 1.05 kcal/mol and in solvation free energy 0.7 kcal/mol of this water model are correctly reproduced by MD -corrected FF. The RMS deviations of the computed HVAP and solvation free energy values computed with NQE corrections obtained in multiple independent fits were about 0.03 and 0.06 kcal/mol, respectively.
Remarkably, the calculations using the augmented ARROW FF with corrections (fitted using PIMD R32 calculations of water dimers) closely reproduce the PIMD R32 ARROW FF results with an accuracy of approximately 0.03 kcal/mol. This deviation is notably smaller than the difference of 0.08 kcal/mol between HVAP values computed by using PIMD R8 and R32 (10.88 and 10.96 kcal/mol, respectively). Therefore, it can be inferred that simulations utilizing corrections derived from water dimer PIMD calculations with a higher number of replicas (such as 32) are expected to yield greater accuracy compared with direct PIMD calculations involving a lower number of replicas (such as 8).
To explore the performance of NQE interactions for other water FFs, we fittted corrections running restrained centroid PIMD calculations for the same 10 000 set of water dimers using a simple flexible TIP3P-like water model. Some computational results using fitted corrections for this water model are presented in Figure S8 in the SI. The accuracy with which NQE corrections for the TIP3P-like water model reproduce results of direct bulk PIMD calculations is similar to the results obtained for ARROW FF water. We can conclude from this fact that the accuracy of NQE corrections is likely to be weakly dependent on the underlying FF model.
In order to validate the applicability of our method across various systems, we extended our computations to liquid methane at a temperature of 120 K. Our earlier study3 demonstrated pronounced NQE in this particular system, leading to approximately a 5% alteration in equilibrium density, HVAP, and solvation free energies.
As shown in Figure 4a–d, the results underscore that the NQE corrections to methane dimer intermolecular energies and forces are approximately 5 times smaller in magnitude compared with their counterparts in water dimers. The corrections amount to a maximum of 0.15 kcal/mol for energies and 1.5 kcal/mol/ for forces. Furthermore, achieving convergence in the average intermolecular PIMD energies and forces required extended simulation times for methane dimers. Specifically, 5 ns long trajectories were necessary for methane, whereas only 1 ns was sufficient for water dimers to attain convergence.
Figure 4.

NQE (PIMD-MD) changes in intermolecular energies (a) and forces (b) of methane dimers computed by PIMD R8 (green) and corrected by (red) as a function of the closest distance between methane monomers (validation set). NN NQE corrections to intermolecular energies (c) and forces (d) of methane dimers vs NQE changes.
An illustration of the computational outcomes of the NQE on the properties of liquid methane at 120 K is shown in Figure 5. As seen in Figure 5a, using PIMD R8, there is a noticeable decrease in the density of liquid methane due to NQE, going from 0.437 to 0.416 g/mL. As the number of PIMD replicas increases to 16 and 32, the density further reduces to 0.413 and 0.411 g/mL, respectively, though with a diminishing rate of change. Methane density computed with the ARROW FF, enhanced with NQE corrections based on PIMD R8 dimer data, aligns well with the densities deduced from PIMD for matching number of replicas (0.417 vs 0.416 g/mL for R8 and 0.413 vs 0.410 for R32). Figure 5b demonstrates the effects of NQE on methane’s C–C RDF. Additional RDFs for the C–H and H–H pairs in methane are shown in Figures S5–S6 in the SI. As in the case of the water model, NQE effects on RDF involving hydrogen atoms are not fully reproduced in MD calculations with NQE corrections, while heavy atom RDFs are fairly accurate.
Figure 5.

Liquid methane properties at and computed with ARROW FF (MD) (blue), PIMD 4, 8, 16, and 32 replicas (lime-dark green), and with ARROW FF + NN NQE corrections fitted to dimer PIMD calculations with 8 (salmon) and 32 (red) replicas. (a) Density. (b) C–C RDF NN. (c) evaporation enthalpy (HVAP). (d) Solvation free energy. Ca. 5–6% changes in density, HVAP, and solvation due to NQE are correctly reproduced by MD simulations with ARROW FF + NN correction.
As can be seen in Figure 5c,d for liquid methane, HVAP and solvation free energy changes due to NQE are reproduced by calculations with ARROW FF augmented by NQE corrections. Average HVAP and solvation free energy values computed over multiple independently fitted corrections give a better accuracy of reproducing results of direct PIMD calculations, while results given by individual differ with RMS of ∼0.02 kcal/mol for HVAP and ∼0.03 kcal/mol for solvation free energies (shown by error bars in Figure 5a,c,d).
Calculations of NQE based on atom pair descriptors have been implemented in the ARBALEST program10,18,47 to run on GPUs using CUDA and TensorFlow56 libraries. With the chosen parameters of descriptors, calculations with the ARROW FF augmented by NQE corrections were about 3 times slower than calculations with the ARROW FF without . Specifically, the computation speed for of a water or methane box using a single GTX 1080-TI GPU was ∼1 ns/h for the ARROW FF and ∼0.35 ns/h for the ARROW FF with NQE. This makes the method more computationally efficient than doing direct PIMD calculations with this FF.
CONCLUSIONS
We showed that adjustments to the intermolecular energies and forces to the QM parametrized force field ARROW can accurately replicate NQE when calculating bulk properties of liquid water and methane in classical (1-bead) MD simulations. These corrections employ Spherical Bessel descriptors linked to atomic pairs between interacting molecules and are trained on averaged intermolecular energies and centroid forces obtained in PIMD (RPMD) calculations of molecular dimers.
The method we have discussed shares several similarities with the techniques presented in refs 9,38–40, which employed the atom-centered descriptors and different types of neural network models to fit effective forces acting on centroids in CMD and RPMD calculations. It was shown that effective potentials reproduced NQE effects for properties such as RDF, diffusion constants, pressure, and vibrational spectra. The authors of ref 40 emphasized the local nature of NQE corrections that allowed them to reduce the size of water clusters used in training to 7 water molecules, while ∼30 water molecule clusters were used in ref 39. In contrast to the cited works, we fitted corrections only to intermolecular energies and forces of molecular dimers, resulting in a more localized description of NQE effects. We also focus on NQE for properties that depend mostly on intermolecular interactions such as solvation and binding free energies that are focal points in drug development and material design. Given that these corrections provide a faithful representation of NQE for the analyzed properties, the technique could be suitably extended to complex heterogeneous systems like proteins. Here, corrections can be individually fitted for the different interacting chemical species.
The procedure we described has several significant limitations. First, it is only applicable at or around a certain temperature, and the correction Hamiltonian will need to be retrained if a different temperature is desired. Second, as we employ intermolecular corrections only, the NQE inherently produced by intramolecular degrees of freedom, such as heat capacity, will not be properly reproduced. We note, however, that some intramolecular effects, such as increased fluctuation of bond lengths, are picked up by the effective intermolecular correction Hamiltonian. Third, centroid approximations are known to have difficulties at low temperatures,57 and this correction inherits the deficiency. Fourth, as the statistics of centroid fluctuations are an approximation of the fluctuations of full path-integral replicas, the method is subject to the inaccuracies of this approximation. Fifth, the momentum distribution in the model will still remain Maxwell–Boltzmann, so properties dependent on proper quantum momentum could not be accurately computed. Nonetheless, given the limitations stated above, the method does offer a way to get, after some training cost, very accurate NQE corrections for observables of interest (Figures 3 and 5) at temperatures of importance to many biological and chemical applications at MD computational expense. Additionally, corrections can be trained on dimer conformations only, which enables the tractability of training for diverse and heterogeneous systems.
Recently, we extended the ARROW FF with corrections targeting intermolecular interactions and employing the same atom pair-centered Spherical Bessel descriptors described above.55 Corrections to the ARROW FF provided a significant improvement in the description of strong intermolecular interactions involving charged species and allowed very accurate calculations of ion solvation energetics and ligand binding free energies. By systematic refining of the ARROW to incorporate NQE with the approach described in this work, we anticipate a quantitative description of complex chemical phenomena, such as protein–ligand binding and protonation equilibrium in classical MD simulations, that so far were not sufficiently reliable due to the deficiencies of the force fields used.
Supplementary Material
ACKNOWLEDGMENTS
The authors thank Asher Nathan, Robert Harrow, Jeffrey Kletzel, Paul T. Marinelli, and InterX and NeoTX teams for their support. They also thank Dr. Oleg Khoruzhii for his key role in the InterX research program.
Funding
Funding for the research of the manuscript has been provided by InterX, Inc. (a fully owned subsidiary of NeoTX Holdings Ltd.)
ABBREVIATIONS
- MD
molecular dynamics
- PIMD
path integral molecular dynamics
- NQE
nuclear quantum effects
- FF
force field
- NN
neural network
- RDF
radial distribution function
- HVAP
enthalpy of vaporization
Footnotes
ASSOCIATED CONTENT
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.3c00921.
Force field description and quantum chemical method (Figures S1–S8) (PDF)
Complete contact information is available at: https://pubs.acs.org/10.1021/acs.jctc.3c00921
The authors declare no competing financial interest.
InterX, Inc. is a fully owned subsidiary of NeoTX Holdings Ltd., Rehovot 7670202, Israel.
Contributor Information
Igor V. Kurnikov, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States
Leonid Pereyaslavets, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Ganesh Kamath, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Serzhan N. Sakipov, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States
Ekaterina Voronina, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Oleg Butin, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Alexey Illarionov, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Igor Leontyev, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Grzegorz Nawrocki, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Mikhail Darkhovskiy, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Michael Olevanov, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Ilya Ivahnenko, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
YuChun Chen, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
Christopher B. Lock, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States; Department of Neurology and Neurological Sciences, Stanford University School of Medicine, Palo Alto, California 94304, United States
Michael Levitt, Department of Structural Biology, Stanford University School of Medicine, Stanford, California 94305, United States.
Roger D. Kornberg, Department of Structural Biology, Stanford University School of Medicine, Stanford, California 94305, United States
Boris Fain, InterX Inc., (a Subsidiary of NeoTX Therapeutics Ltd.), Berkeley, California 94710, United States.
REFERENCES
- (1).Paesani F; Voth GA The Properties of Water: Insights from Quantum Simulations. J. Phys. Chem. B 2009, 113 (17), 5702–5719. [DOI] [PubMed] [Google Scholar]
- (2).Ceriotti M; Cuny J; Parrinello M; Manolopoulos DE Nuclear Quantum Effects and Hydrogen Bond Fluctuations in Water. Proc. Natl. Acad. Sci. U.S.A 2013, 110 (39), 15591–15596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (3).Pereyaslavets L; Kurnikov I; Kamath G; Butin O; Illarionov A; Leontyev I; Olevanov M; Levitt M; Kornberg RD; Fain B On the Importance of Accounting for Nuclear Quantum Effects in Ab Initio Calibrated Force Fields in Biological Simulations. Proc. Natl. Acad. Sci. U.S.A 2018, 115 (36), 8878–8882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (4).Markland TE; Ceriotti M Nuclear Quantum Effects Enter the Mainstream. Nat. Rev. Chem 2018, 2 (3), No. 0109. [Google Scholar]
- (5).Markland TE; Manolopoulos DE An Efficient Ring Polymer Contraction Scheme for Imaginary Time Path Integral Simulations. J. Chem. Phys 2008, 129 (2), No. 024105. [DOI] [PubMed] [Google Scholar]
- (6).Ceriotti M; Bussi G; Parrinello M Nuclear Quantum Effects in Solids Using a Colored-Noise Thermostat. Phys. Rev. Lett 2009, 103 (3), No. 030603. [DOI] [PubMed] [Google Scholar]
- (7).Dammak H; Chalopin Y; Laroche M; Hayoun M; Greffet J-J Quantum Thermal Bath for Molecular Dynamics Simulation. Phys. Rev. Lett 2009, 103 (19), No. 190601. [DOI] [PubMed] [Google Scholar]
- (8).Fletcher T; Zhu A; Lawrence JE; Manolopoulos DE Fast Quasi-Centroid Molecular Dynamics. J. Chem. Phys 2021, 155 (23), No. 231101. [DOI] [PubMed] [Google Scholar]
- (9).Musil F; Zaporozhets I; Noé F; Clementi C; Kapil V Quantum Dynamics Using Path Integral Coarse-Graining. J. Chem. Phys 2022, 157 (18), No. 181102. [DOI] [PubMed] [Google Scholar]
- (10).Pereyaslavets L; Kamath G; Butin O; Illarionov A; Olevanov M; Kurnikov I; Sakipov S; Leontyev I; Voronina E; Gannon T; Nawrocki G; Darkhovskiy M; Ivahnenko I; Kostikov A; Scaranto J; Kurnikova MG; Banik S; Chan H; Sternberg MG; Sankaranarayanan SKRS; Crawford B; Potoff J; Levitt M; Kornberg RD; Fain B Accurate Determination of Solvation Free Energies of Neutral Organic Compounds from First Principles. Nat. Commun 2022, 13 (1), No. 414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Illarionov A; Sakipov S; Pereyaslavets L; Kurnikov IV; Kamath G; Butin O; Voronina E; Ivahnenko I; Leontyev I; Nawrocki G; Darkhovskiy M; Olevanov M; Cherniavskyi YK; Lock C; Greenslade S; Sankaranarayanan SK; Kurnikova MG; Potoff J; Kornberg RD; Levitt M; Fain B Combining Force Fields and Neural Networks for an Accurate Representation of Chemically Diverse Molecular Interactions. J. Am. Chem. Soc 2023, 145 (43), 23620–23629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Devereux C; Smith JS; Huddleston KK; Barros K; Zubatyuk R; Isayev O; Roitberg AE Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens. J. Chem. Theory Comput 2020, 16 (7), 4192–4202. [DOI] [PubMed] [Google Scholar]
- (13).von Lilienfeld OA; Müller K-R; Tkatchenko A Exploring Chemical Compound Space with Quantum-Based Machine Learning. Nat. Rev. Chem 2020, 4 (7), 347–358. [DOI] [PubMed] [Google Scholar]
- (14).Zhu X; Riera M; Bull-Vulpe EF; Paesani F MB-pol(2023): Sub-Chemical Accuracy for Water Simulations from the Gas to the Liquid Phase. J. Chem. Theory Comput 2023, 19 (12), 3551–3566. [DOI] [PubMed] [Google Scholar]
- (15).Reddy SK; Straight SC; Bajaj P; Huy Pham C; Riera M; Moberg DR; Morales MA; Knight C; Götz AW; Paesani F On the Accuracy of the MB-Pol Many-Body Potential for Water: Interaction Energies, Vibrational Frequencies, and Classical Thermodynamic and Dynamical Properties from Clusters to Liquid Water and Ice. J. Chem. Phys 2016, 145 (19), No. 194504. [DOI] [PubMed] [Google Scholar]
- (16).Yu Q; Qu C; Houston PL; Conte R; Nandi A; Bowman JM Q-AQUA: A Many-Body CCSD(T) Water Potential, Including Four-Body Interactions, Demonstrates the Quantum Nature of Water from Clusters to the Liquid Phase. J. Phys. Chem. Lett 2022, 13 (22), 5068–5074. [DOI] [PubMed] [Google Scholar]
- (17).Hagler AT Quantum Derivative Fitting and Biomolecular Force Fields: Functional Form, Coupling Terms, Charge Flux, Nonbond Anharmonicity, and Individual Dihedral Potentials. J. Chem. Theory Comput 2015, 11 (12), 5555–5572. [DOI] [PubMed] [Google Scholar]
- (18).Nawrocki G; Leontyev I; Sakipov S; Darkhovskiy M; Kurnikov I; Pereyaslavets L; Kamath G; Voronina E; Butin O; Illarionov A; Olevanov M; Kostikov A; Ivahnenko I; Patel DS; Sankaranarayanan SKRS; Kurnikova MG; Lock C; Crooks GE; Levitt M; Kornberg RD; Fain B Protein-Ligand Binding Free-Energy Calculations with ARROW—A Purely First-Principles Parameterized Polarizable Force Field. J. Chem. Theory Comput 2022, 18 (12), 7751–7763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Balog E; Hughes AL; Martyna GJ Constant Pressure Path Integral Molecular Dynamics Studies of Quantum Effects in the Liquid State Properties of N-Alkanes. J. Chem. Phys 2000, 112 (2), 870–880. [Google Scholar]
- (20).Stern HA; Berne BJ Quantum Effects in Liquid Water: Path-Integral Simulations of a Flexible and Polarizable Ab Initio Model. J. Chem. Phys 2001, 115 (16), 7622–7628. [Google Scholar]
- (21).Cao J; Voth GA The Formulation of Quantum Statistical Mechanics Based on the Feynman Path Centroid Density. I. Equilibrium Properties. J. Chem. Phys 1994, 100 (7), 5093–5105. [Google Scholar]
- (22).Jang S; Voth GA Path Integral Centroid Variables and the Formulation of Their Exact Real Time Dynamics. J. Chem. Phys 1999, 111 (6), 2357–2370. [Google Scholar]
- (23).Cao J; Voth GA The Formulation of Quantum Statistical Mechanics Based on the Feynman Path Centroid Density. IV. Algorithms for Centroid Molecular Dynamics. J. Chem. Phys 1994, 101 (7), 6168–6183. [Google Scholar]
- (24).Hone TD; Izvekov S; Voth GA Fast Centroid Molecular Dynamics: A Force-Matching Approach for the Predetermination of the Effective Centroid Forces. J. Chem. Phys 2005, 122 (5), No. 054105. [DOI] [PubMed] [Google Scholar]
- (25).Ceriotti M; Manolopoulos DE Efficient First-Principles Calculation of the Quantum Kinetic Energy and Momentum Distribution of Nuclei. Phys. Rev. Lett 2012, 109 (10), No. 100604. [DOI] [PubMed] [Google Scholar]
- (26).Ceriotti M; Manolopoulos DE; Parrinello M Accelerating the Convergence of Path Integral Dynamics with a Generalized Langevin Equation. J. Chem. Phys 2011, 134 (8), No. 084104. [DOI] [PubMed] [Google Scholar]
- (27).Poltavsky I; Tkatchenko A Modeling Quantum Nuclei with Perturbed Path Integral Molecular Dynamics. Chem. Sci 2016, 7 (2), 1368–1372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Kapil V; VandeVondele J; Ceriotti M Accurate Molecular Dynamics and Nuclear Quantum Effects at Low Cost by Multiple Steps in Real and Imaginary Time: Using Density Functional Theory to Accelerate Wavefunction Methods. J. Chem. Phys 2016, 144 (5), No. 054111. [DOI] [PubMed] [Google Scholar]
- (29).Behler J Perspective: Machine Learning Potentials for Atomistic Simulations. J. Chem. Phys 2016, 145 (17), No. 170901. [DOI] [PubMed] [Google Scholar]
- (30).Gokcan H; Isayev O Learning Molecular Potentials with Neural Networks. WIREs Comput. Mol. Sci 2022, 12 (2), No. e1564, DOI: 10.1002/wcms.1564. [DOI] [Google Scholar]
- (31).Zhang Y; Hu C; Jiang B Accelerating Atomistic Simulations with Piecewise Machine-Learned Ab Initio Potentials at a Classical Force Field-like Cost. Phys. Chem. Chem. Phys 2021, 23 (3), 1815–1821. [DOI] [PubMed] [Google Scholar]
- (32).Sauceda HE; Gálvez-González LE; Chmiela S; Paz-Borbón LO; Müller K-R; Tkatchenko A BIGDML-Towards Accurate Quantum Machine Learning Force Fields for Materials. Nat. Commun 2022, 13 (1), No. 3733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (33).Unke OT; Meuwly M PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments, and Partial Charges. J. Chem. Theory Comput 2019, 15 (6), 3678–3693. [DOI] [PubMed] [Google Scholar]
- (34).Schriber JB; Nascimento DR; Koutsoukas A; Spronk SA; Cheney DL; Sherrill CD CLIFF: A Component-Based, Machine-Learned, Intermolecular Force Field. J. Chem. Phys 2021, 154 (18), No. 184110. [DOI] [PubMed] [Google Scholar]
- (35).Kocer E; Ko TW; Behler J Neural Network Potentials: A Concise Overview of Methods. Annu. Rev. Phys. Chem 2022, 73, 163–186. [DOI] [PubMed] [Google Scholar]
- (36).Wang J; Olsson S; Wehmeyer C; Pérez A; Charron NE; de Fabritiis G; Noé F; Clementi C Machine Learning of Coarse-Grained Molecular Dynamics Force Fields. ACS Cent. Sci 2019, 5 (5), 755–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Jin J; Pak AJ; Durumeric AEP; Loose TD; Voth GA Bottom-up Coarse-Graining: Principles and Perspectives. J. Chem. Theory Comput 2022, 18 (10), 5759–5791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Ryu WH; Voth GA Coarse-Graining of Imaginary Time Feynman Path Integrals: Inclusion of Intramolecular Interactions and Bottom-up Force-Matching. J. Phys. Chem. A 2022, 126 (35), 6004–6019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (39).Loose TD; Sahrmann PG; Voth GA Centroid Molecular Dynamics Can Be Greatly Accelerated through Neural Network Learned Centroid Forces Derived from Path Integral Molecular Dynamics. J. Chem. Theory Comput 2022, 18 (10), 5856–5863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Wu C; Li R; Yu K Learning the Quantum Centroid Force Correction in Molecular Systems: A Localized Approach. Front. Mol. Biosci 2022, 9, No. 851311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (41).Habershon S; Markland TE; Manolopoulos DE Competing Quantum Effects in the Dynamics of a Flexible Water Model. J. Chem. Phys 2009, 131 (2), No. 024501. [DOI] [PubMed] [Google Scholar]
- (42).Schran C; Marsalek O; Markland TE Unravelling the Influence of Quantum Proton Delocalization on Electronic Charge Transfer through the Hydrogen Bond. Chem. Phys. Lett 2017, 678, 289–295. [Google Scholar]
- (43).Law YK; Hassanali AA The Importance of Nuclear Quantum Effects in Spectral Line Broadening of Optical Spectra and Electrostatic Properties in Aromatic Chromophores. J. Chem. Phys 2018, 148 (10), No. 102331. [DOI] [PubMed] [Google Scholar]
- (44).Sauceda HE; Vassilev-Galindo V; Chmiela S; Müller, K.-R.; Tkatchenko, A. Dynamical Strengthening of Covalent and Non-Covalent Molecular Interactions by Nuclear Quantum Effects at Finite Temperature. Nat. Commun 2021, 12 (1), No. 442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Kocer E; Mason JK; Erturk H A Novel Approach to Describe Chemical Environments in High-Dimensional Neural Network Potentials. J. Chem. Phys 2019, 150 (15), No. 154102. [DOI] [PubMed] [Google Scholar]
- (46).Kocer E; Mason JK; Erturk H Continuous and Optimally Complete Description of Chemical Environments Using Spherical Bessel Descriptors. AIP Adv 2020, 10, No. 015021. [Google Scholar]
- (47).Kamath G; Kurnikov I; Fain B; Leontyev I; Illarionov A; Butin O; Olevanov M; Pereyaslavets L Prediction of Cyclohexane-Water Distribution Coefficient for SAMPL5 Drug-like Compounds with the QMPFF3 and ARROW Polarizable Force Fields. J. Comput. Aided Mol. Des 2016, 30 (11), 977–988. [DOI] [PubMed] [Google Scholar]
- (48).Giese TJ; Panteva MT; Chen H; York DM Multipolar Ewald Methods, 1: Theory, Accuracy, and Performance. J. Chem. Theory Comput 2015, 11 (2), 436–450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Bennett CH Efficient Estimation of Free Energy Differences from Monte Carlo Data. J. Comput. Phys 1976, 22 (2), 245–268. [Google Scholar]
- (50).Pastor RW; Brooks BR; Szabo A An Analysis of the Accuracy of Langevin and Molecular Dynamics Algorithms. Mol. Phys 1988, 65 (6), 1409–1419. [Google Scholar]
- (51).Berendsen HJC; van Postma J; Van Gunsteren WF; DiNola A; Haak JR Molecular Dynamics with Coupling to an External Bath. J. Chem. Phys 1984, 81 (8), 3684–3690. [Google Scholar]
- (52).Tuckerman M; Berne BJ; Martyna GJ Reversible Multiple Time Scale Molecular Dynamics. J. Chem. Phys 1992, 97 (3), 1990–2001. [Google Scholar]
- (53).Tuckerman ME; Berne BJ; Martyna GJ; Klein ML Efficient Molecular Dynamics and Hybrid Monte Carlo Algorithms for Path Integrals. J. Chem. Phys 1993, 99 (4), 2796–2808. [Google Scholar]
- (54).Martyna GJ; Hughes A; Tuckerman ME Molecular Dynamics Algorithms for Path Integrals at Constant Pressure. J. Chem. Phys 1999, 110 (7), 3275–3290. [Google Scholar]
- (55).Illarionov A; Sakipov S; Pereyaslavets L; Kurnikov I; Kamath G; Butin O; Voronina E; Ivahnenko I; Leontyev I; Nawrocki G; Darkhovskiy M; Olevanov M; Cherniavkyi YK; Lock C; Greenslade S; Sankaranarayanan S; Kurnikova MG; Potoff J; Kornberg RD; Levitt M; Fain B Fast and Accurate Representation of Diverse Molecular Interactions. J. Am. Chem. Soc 2023, 145, 23620–23629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (56).Abadi M; Agarwal A; Barham P; Brevdo E; Chen Z; Citro C; Corrado GS; Davis A; Dean J; Devin M; Ghemawat S; Goodfellow I; Harp A; Irving G; Isard M; Jia Y; Jozefowicz R; Kaiser L; Kudlur M; Levenberg J; Mané D; Monga R; Moore S; Murray D; Olah C; Schuster M; Shlens J; Steiner B; Sutskever I; Talwar K; Tucker P; Vanhoucke V; Vasudevan V; Viégas F; Vinyals O; Warden P; Wattenberg M; Wicke M; Yu Y; Zheng X TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems, 2015. https://www.tensorflow.org/.
- (57).Witt A; Ivanov SD; Shiga M; Forbert H; Marx D On the Applicability of Centroid and Ring Polymer Path Integral Molecular Dynamics for Vibrational Spectroscopy. J. Chem. Phys 2009, 130 (19), No. 194510. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
