Towards exact molecular dynamics simulations with machine-learned force fields

Stefan Chmiela; Huziel E Sauceda; Klaus-Robert Müller; Alexandre Tkatchenko

doi:10.1038/s41467-018-06169-2

. 2018 Sep 24;9:3887. doi: 10.1038/s41467-018-06169-2

Towards exact molecular dynamics simulations with machine-learned force fields

Stefan Chmiela ¹, Huziel E Sauceda ², Klaus-Robert Müller ^1,^3,^4,^✉, Alexandre Tkatchenko ^5,^✉

PMCID: PMC6155327 PMID: 30250077

Abstract

Molecular dynamics (MD) simulations employing classical force fields constitute the cornerstone of contemporary atomistic modeling in chemistry, biology, and materials science. However, the predictive power of these simulations is only as good as the underlying interatomic potential. Classical potentials often fail to faithfully capture key quantum effects in molecules and materials. Here we enable the direct construction of flexible molecular force fields from high-level ab initio calculations by incorporating spatial and temporal physical symmetries into a gradient-domain machine learning (sGDML) model in an automatic data-driven way. The developed sGDML approach faithfully reproduces global force fields at quantum-chemical CCSD(T) level of accuracy and allows converged molecular dynamics simulations with fully quantized electrons and nuclei. We present MD simulations, for flexible molecules with up to a few dozen atoms and provide insights into the dynamical behavior of these molecules. Our approach provides the key missing ingredient for achieving spectroscopic accuracy in molecular simulations.

Simultaneous accurate and efficient prediction of molecular properties relies on combined quantum mechanics and machine learning approaches. Here the authors develop a flexible machine-learning force-field with high-level accuracy for molecular dynamics simulations.

Introduction

Molecular dynamics (MD) simulations within the Born-Oppenheimer (BO) approximation constitute the cornerstone of contemporary atomistic modeling. In fact, the 2013 Nobel Prize in Chemistry clearly highlighted the remarkable advances made by MD simulations in offering unprecedented insights into complex chemical and biological systems. However, one of the widely recognized and increasingly pressing issues in MD simulations is the lack of accuracy of underlying classical interatomic potentials, which hinders truly predictive modeling of dynamics and function of (bio)molecular systems. One possible solution to the accuracy problem is provided by direct ab initio molecular dynamics (AIMD) simulations, where the quantum-mechanical forces are computed on the fly for atomic configurations at every time step¹. The majority of AIMD simulations employ the current workhorse method of electronic-structure theory, namely density-functional approximations (DFA) to the exact solution of the Schrödinger equation for a system of nuclei and electrons. Unfortunately, different DFAs yield contrasting results² for the structure, dynamics, and properties of molecular systems. Furthermore, DFA calculations are not systematically improvable. Alternatively, explicitly correlated methods beyond DFA could also be used in AIMD simulations, unfortunately this leads to a steep increase in the required computational resources, for example a nanosecond-long MD trajectory for a single ethanol molecule executed with the CCSD(T) method would take roughly a million CPU years on modern hardware. An alternative is a direct fit of the potential-energy surface (PES) from a large number of CCSD(T) calculations, however this is only practically achievable for rather small and rigid molecules^3–5.

To solve this accuracy and molecular size dilemma and furthermore to enable converged AIMD simulations close to the exact solution of the Schrödinger equation, here we develop an alternative approach using symmetrized gradient-domain machine learning (sGDML) to construct force fields with the accuracy of high-level ab initio calculations. Recently, a wide range of sophisticated machine learning (ML) models for small molecules and elemental materials^6–46 have been proposed for constructing PES from DFA calculations. While these results are encouraging, direct ML fitting of molecular PESs relies on the availability of large reference datasets to obtain an accurate model. Frequently, those ML models are trained on thousands or even millions of atomic configurations. This prevents the construction of ML models using high-level ab initio methods, for which energies and forces only for 100s of conformations can be practically computed.

Instead, we propose a solution that allows converged MD simulations with fully quantized electrons and nuclei for molecules with up to a few dozen atoms. This is enabled by two novel aspects: a reduction of the problem complexity through a data-driven discovery of relevant spatial and temporal physical symmetries, and enhancing the information content of data samples by exercising these identified static and dynamic symmetries, hence implicitly increasing the amount of training data. Using the proposed sGDML approach, we carry out MD simulations at the ab initio coupled cluster level of electronic-structure theory and provide insights into their dynamical behavior. Our approach contributes the key missing ingredient for achieving spectroscopic accuracy and rigorous dynamical insights in molecular simulations.

Results

Symmetrized gradient-domain machine learning

The sGDML model is built on the previously introduced gradient domain learning (GDML) model⁴⁷, but now incorporates all relevant physical symmetries, hence enabling MD simulations with high-level ab initio force field accuracy. One can classify physical symmetries of molecular systems into symmetries of space and time and specific static and dynamic symmetries of a given molecule (see Fig. 1). Global spatial symmetries include rotational and translational invariance of the energy, while homogeneity of time implies energy conservation. These global symmetries were already successfully incorporated into the GDML model⁴⁷. Additionally, molecules possess well-defined rigid space group symmetries (i.e. reflection operation), as well as dynamic non-rigid symmetries (i.e., methyl group rotations). For example, the benzene molecule with only six carbon and six hydrogen atoms can already be indexed in $6! 6! = 518400$ different, but physically equivalent ways. However, not all of these symmetric variants are accessible without crossing impassable energy barriers. Only the 24 symmetry elements in the D_6h point group of this molecule are relevant. While methods for identifying molecular point groups for polyatomic rigid molecules are readily available⁴⁸, Longuet-Higgins⁴⁹ has pointed out that non-rigid molecules have extra symmetries. These dynamical symmetries arise upon functional-group rotations or torsional displacements and they are usually not incorporated in traditional force fields and electronic-structure calculations. Typically, extracting nonrigid symmetries requires chemical and physical intuition about the system at hand. Here we develop a physically motivated algorithm for data-driven discovery of all relevant molecular symmetries from MD trajectories.

Fig. 1 — Fully data-driven symmetry discovery. a, b Our multipartite matching algorithm recovers a globally consistent atom-atom assignment across the whole training set of molecular conformations, which directly enables the identification and reconstructive exploitation of relevant spatial and temporal physical symmetries of the molecular dynamics. c The global solution is obtained via synchronization of approximate pairwise matchings based on the assignment of adjacency matrix eigenvectors, which correspond in near isomorphic molecular graphs. We take advantage of the fact that the minimal spanning set of best bipartite assignments fully describes the multipartite matching, which is recovered via its transitive closure. Symmetries that are not relevant within the scope of the training dataset are successfully ignored. d This enables the efficient construction of individual kernel functions for each training molecule, reflecting the joined similarity of all its symmetric variants with another molecule. The kernel exercises the symmetries by consolidating all training examples in an arbitrary reference configuration from which they are distributed across all symmetric subdomains. This approach effectively trains the fully symmetrized dataset without incurring the additional computational cost

MD trajectories consist of smooth consecutive changes in nearly isomorphic molecular graphs. When sampling from these trajectories the combinatorial challenge is to correctly identify the same atoms across the examples such that the learning method can use consistent information for comparing two molecular conformations in its kernel function. While so-called bi-partite matching allows to locally assign atoms $R = (r_{1}, \dots, r_{N})$ for each pair of molecules in the training set, this strategy alone is not sufficient as it needs to be made globally consistent by multipartite matching in a second step^50–52.

We start with adjacency matrices as representation for the molecular graph^{9,13,47,53,54}. To solve the pairwise matching problem we therefore seek to find the assignment τ which minimizes the squared Euclidean distance between the adjacency matrices A of two isomorphic graphs G and H with entries ${(A)}_{i j} = ∥r_{i} - r_{j}∥$ , where P(τ) is the permutation matrix that realizes the assignment:

\underset{τ}{\arg \min} L (τ) = {∥P (τ) A_{G} P {(τ)}^{⊤} - A_{H}∥}^{2} .

Adjacency matrices of isomorphic graphs have identical eigenvalues and eigenvectors, only their assignment differs. Following the approach of Umeyama⁵⁵, we identify the correspondence of eigenvectors U by projecting both sets U_G and U_H onto each other to find the best overlap. We use the overlap matrix, after sorting eigenvalues and overcoming sign ambiguity

M = abs (U_{G}) abs {(U_{H})}^{⊤},

Then −M is provided as the cost matrix for the Hungarian algorithm⁵⁶, maximizing the overall overlap which finally returns the approximate assignment $\tilde{τ}$ that minimizes Eq. (1) and thus provides the results of step one of the procedure. As indicated, global inconsistencies may arise, e.g., violations of the transitivity property $τ_{j k} \circ τ_{i j} = τ_{i k}$ of the assignments, therefore a second step is necessary which is based on the composite matrix $\tilde{P}$ of all pairwise assignment matrices ${\tilde{P}}_{i j} \equiv P ({\tilde{τ}}_{i j})$ within the training set.

We propose to reconstruct a rank-limited $P$ via the transitive closure of the minimum spanning tree (MST) that minimizes the bi-partite matching cost (see Eq. (1), Fig. 1) over the training set. The MST is constructed from the most confident bi-partite assignments and represents the rank $N$ skeleton of $\tilde{P}$ , defining also $P$ .

The resulting consistent multipartite matching $P$ enables us to construct symmetric kernel-based ML models of the form

\hat{f} (x) = \sum_{i j}^{M} α_{i j} κ (x, P_{i j} x_{i}),

by augmenting the training set with the symmetric variations of each molecule (see Supplementary Note 1 for a comparison with alternative symmetry-adapted kernel functions). A particular advantage of our solution is that it can fully populate all recovered permutational configurations even if they do not form a symmetric group, severely reducing the computational effort in evaluating the model. Even if we limit the range of j to include all S unique assignments only, the major downside of this approach is that a multiplication of the training set size leads to a drastic increase in the complexity of the cubically scaling kernel ridge regression learning algorithm. We overcome this drawback by exploiting the fact that the set of coefficients α for the symmetrized training set exhibits the same symmetries as the data, hence the linear system can be contracted to its original size, while still defining the full set of coefficients exactly.

For notational convenience we transform all training geometries into a canonical permutation $x_{i} \equiv P_{i 1} x_{i}$ , enabling the use of uniform symmetry transformations $P_{j} \equiv P_{1 j}$ (see Supplementary Note 2). Simplifying Eq. (3) accordingly, gives rise to the symmetric kernel that we originally set off to construct

\begin{matrix} \hat{f} (x) & = & \sum_{i}^{M} α_{i} \sum_{q}^{S} κ (x, P_{q} x_{i}) \\ = & \sum_{i} α_{i} κ_{sym} (x, x_{i}), \end{matrix}

and yields a model with the exact same number of parameters as the original, non-symmetric one.

Our symmetric kernel is an extension to regular kernels and can be applied universally, in particular to kernel-based force fields. Here we construct a symmetric variant of the GDML model, sGDML. This symmetrized GDML force field kernel takes the form:

Hess (κ_{sym}) (x, x^{'}) = \sum_{q}^{S} Hess (κ) (x, P_{q} x^{'}) P_{q} .

Accordingly, the trained force field estimator collects the contributions of the partial derivatives 3N of all training points M and number of symmetry transformations S to compile the prediction for a new input x. It takes the form

{\hat{f}}_{F} (x) = \sum_{i}^{M} \sum_{l}^{3 N} \sum_{q}^{S} {(P_{q} α_{i})}_{l} \frac{\partial}{\partial x_{l}} \nabla κ (x, P_{q} x_{i})

and a corresponding energy predictor is obtained by integrating ${\hat{f}}_{F}$ with respect to the Cartesian geometry. Due to linearity of integration, the expression for the energy predictor is identical up to second derivative operator on the kernel function.

Every (s)GDML model is trained on a set of reference examples that reflects the population of energy states a particular molecule visits during an MD simulation at a certain temperature. For our purposes, the corresponding set of geometries is subsampled from a 200 picosecond DFT MD trajectory at 500 K following the Boltzmann distribution. Subsequently, a globally consistent permutation graph is constructed that jointly assigns all geometries in the training set, providing a small selection of physically feasible transformations that define the training set specific symmetric kernel function. In the interest of computational tractability, we shortcut this sampling process to construct sGDML@CCSD(T) and only recompute energy and force labels at this higher level of theory.

The sGDML model can be trained in closed form, which is both quicker and more accurate than numerical solutions. Model selection is performed through a grid search on a suitable subset of the hyper-parameter space. Throughout, cross-validation with dedicated datasets for training, testing, and validation are used to estimate the generalization performance of the model.

Forces and energies from GDML to sGDML@DFT to sGDML@CCSD(T)

Our goal is to demonstrate that it is possible to construct compact sGDML models that faithfully recover CCSD(T) force fields for flexible molecules with up to 20 atoms, by using only a small set of few hundred molecular conformations. As a first step, we investigate the gain in efficiency and accuracy of the sGDML model vs. the GDML model employing MD trajectories of ten molecules from benzene to azobenzene computed with DFT (see Fig. 2 and Supplementary Table 1). The benefit of a symmetric model is directly linked to the number of symmetries in the system. For toluene, naphthalene, aspirin, malonaldehyde, ethanol, paracetamol, and azobenzene, sGDML improves the force prediction by 31.3–67.4% using the same training sets in all cases (see Table 1). As expected, uracil and salicylic acid have no exploitable symmetries, hence the performance of sGDML is unchanged with respect to GDML. The inclusion of symmetries leads to a stronger improvement in force prediction performance compared to energy predictions. This is most clearly visible for the naphthalene dataset, where the force predictions even improve unilaterally. We attribute this to the difference in complexity of both quantities and the fact that an energy penalty is intentionally omitted in the cost function to avoid a tradeoff.

Fig. 2 — Data efficiency gains using sGDML vs. GDML. Energy and force prediction accuracy (in terms of the mean absolute error (MAE)) as a function of training set size of both models trained on DFT forces: the gain in efficiency and accuracy is directly linked to the number of symmetries in the system

Table 1.

Relative increase in accuracy of the sGDML@DFT vs. the non-symmetric GDML model: the benefit of a symmetric model is directly linked to the number of permutational symmetries in the system

Molecule	#Sym. in κ_sym	Δ MAE (%)
Molecule	#Sym. in κ_sym	Energy	Forces
Benzene	12	−1.6	−62.3
Uracil	1	0.0	0.0
Naphthalene	4	0.0	−52.2
Aspirin	6	−29.6	−31.3
Salicylic acid	1	0.0	0.0
Malonaldehyde	4	−37.5	−48.8
Ethanol	6	−53.4	−58.2
Toluene	12	−16.7	−67.4
Paracetamol	12	−40.7	−52.9
Azobenzene	8	−74.3	−47.4

Open in a new tab

All symmetry counts include the identity transformation

A minimal force accuracy required for reliable MD simulations is MAE = 1 kcal mol⁻¹ Å⁻¹. While the GDML model can achieve this accuracy at around 800 training examples for all molecules except aspirin, sGDML only needs 200 training examples to reach the same quality. Note that energy-based ML approaches typically require two to three orders of magnitude more data⁴⁷.

Given that the novel sGDML model is data efficient and highly accurate, we are now in position to tackle CCSD(T) level of accuracy with modest computational resources. We have trained sGDML models on CCSD(T) forces for benzene, toluene, ethanol, and malonaldehyde. For the larger aspirin molecule, we used CCSD forces (see Supplementary Table 2). The sGDML@CCSD(T) model achieves a high accuracy for energies, reducing the prediction error of sGDML@DFT by a factor of 1.4 (for ethanol) to 3.4 (for toluene). This finding leads to an interesting hypothesis that sophisticated quantum-mechanical force fields are smoother and, as a convenient side effect, easier to learn. Note that the accuracy of the force prediction in both sGDML@CCSD(T) and sGDML@DFT is comparable, with the benzene molecule as the only exception. We attribute this aspect to slight shifts in the locations of the minima on the PES between DFT and CCSD(T), which means that the data sampling process for CCSD(T) can be further improved. In principle, we can envision a corrected resampling procedure for CCSD(T), using the sGDML@CCSD(T) model as future work.

MD with ab initio accuracy

The predictive power of a force field can only be truly assessed by computing dynamical and thermodynamical observables, which require sufficient sampling of the configuration space, for example by employing MD or Monte Carlo simulations. We remark that global error measures, such as mean average error (MAE) and root mean squared error are typically prone to overestimate the reconstruction quality of the force field, as they average out local topological properties. However, these local properties can become highly relevant when the model is used for an actual analysis of MD trajectories. As a demonstration, we will use the ethanol molecule; this molecule has three minima, gauche± (M_g±) and trans (M_t) shown in Fig. 3a, where experimentally it has been confirmed that M_t is the ground state and M_g is a local minimum⁵⁷. The energy difference between these two minima is only 0.12 kcal mol⁻¹ and they are separated by an energy barrier of 1.15 kcal mol⁻¹. Obviously, the widely discussed ML target accuracy of 1 kcal mol⁻¹ is not sufficient to describe the dynamics of ethanol and other molecules.

Fig. 3 — Molecular dynamics simulations for ethanol. a Potential energy profile of the dihedral angle describing the rotation of the hydroxyl group for CCSD(T) (red) vs. DFT (blue). The energetic barriers predicted by sGDML@CCSD(T) are: M_t → M_g: 1.18 kcal mol⁻¹, M_g− → M_g+: 1.19 kcal mol⁻¹, and M_g → M_t: 1.07 kcal mol⁻¹. The dashed lines show the probability distributions obtained from PIMD at 300 K. b Joint probability distribution function for the two dihedral angles of the methyl and hydroxyl functional groups. Each minimum is annotated with the occupation probability obtained from classical and path-integral MD in comparison with experimental values. c Analysis of vibrational spectra (velocity–velocity autocorrelation function). (top) Comparison between the vibrational spectrum obtained from PIMD simulations at 300 K for sGDML@CCSD(T) and its sGDML@DFT counterpart; (middle) comparison between the sGDML@CCSD(T) PIMD spectrum and the harmonic approximation based on CCSD(T) frequencies; (bottom) comparison of sGDML@CCSD(T) PIMD spectra at 300 and 100 K. The rightmost panel shows several characteristic normal modes of ethanol, where atomic displacements are illustrated by green arrows

This brings us to another crucial issue for predictive models: the reference data accuracy. Computing the energy difference between M_t and M_g using DFT(PBE-TS) we observe that M_g is 0.08 kcal mol⁻¹ more stable than M_t, contradicting the experimental measurements. Repeating the same calculation using CCSD(T)/cc-pVTZ we find that M_t is more stable than M_g by 0.08 kcal mol⁻¹, in excellent agreement with experiment. From this analysis and subsequent MD simulations we conclude that CCSD(T) or sometimes even higher accuracy is necessary for truly predictive insights.

Additionally to requiring highly accurate quantum chemical approximations, the ethanol molecule also belongs to a category of fluxional molecules sensitive to nuclear quantum effects (NQE). This is because internal rotational barriers of the ethanol molecule (M_g ↔ M_t) are on the order of ~1.2 kcal mol⁻¹ (see Fig. 3), which is neither low enough to generate frequent transitions nor high enough to avoid them. In a classical MD at room temperature the thermal fluctuations lead to inadequate sampling of the PES. By correctly including NQE via path-integral MD (PIMD), the ethanol molecule is able to transition between M_g and M_t configurations, radically increasing the transition frequency (see Supplementary Figure 1) and generating statistical weights in excellent agreement with experiment. Figure 3b shows the statistical occupations of the different minima for ethanol using classical MD and PIMD for the sGDML@CCSD(T) and sGDML@DFT models in comparison with the experimental results. Overall, our MD results for ethanol highlight the necessity of using a highly accurate force field with an equally accurate treatment of NQE for achieving reliable and quantitative understanding of molecular systems.

Having established the accuracy of statistical occupations of different states of ethanol, we are now in position to discuss for the first time the CCSD(T) vibrational spectrum of ethanol computed using the velocity–velocity autocorrelation function based on centroid PIMD (see Fig. 3c). As a reference, in Fig. 3c-top we compare the vibrational spectra from DFT and CCSD(T) sGDML models in the fingerprint zone, and as expected the sGDML@CCSD(T) model generates higher frequencies but both share similar shapes but slightly different peak intensities. Molecular vibrational spectra at finite temperature include anharmonic effects, hence anharmonicities can be studied by comparing the sGDML@CCSD(T) spectrum with the harmonic approximation. Figure 3c-middle shows such comparison and demonstrates that low-frequency and non-symmetric vibrations are most affected by finite-temperature contributions. The thermal frequency shift can be better seen in Fig. 3c-bottom, where the sGDML@CCSD(T) spectrum is compared at two different temperatures. We observe that each normal mode is shifted in a specific manner and not by a simple scaling factor, as typically assumed. The most striking finding from our simulations is the resolution of the apparent mismatch between theory and experiment explaining the origin of the torsional frequency for the hydroxyl group. Experimentally, the low frequency region of ethanol, around ~210 cm⁻¹, is not fully understood, but there are frequency measurements for the hydroxyl rotor ranging in between ~202^58,59 and ~207⁶⁰ cm⁻¹ for gas-phase ethanol, while theoretically we found 243.7 cm⁻¹ at the sGDML@CCSD(T) level of theory in the harmonic approximation. From the middle and bottom panels in Fig. 3c, we observe that by increasing the temperature the lowest peak shifts to substantially lower frequencies compared to the rest of the spectrum. The origin of such phenomena is the strong anharmonic behavior of the lowest normal mode 1, shown in Fig. 3c-middle, which mainly corresponds to hydroxyl group rotations. At room temperature the frequency of this mode drops to ~215 cm⁻¹, corresponding to a red-shift of 12% and getting closer to the experimental results, demonstrating the importance of dynamical anharmonicities.

Finally, we illustrate the wider applicability of the sGDML model to more complex molecules than ethanol by performing a detailed analysis of MD simulations for malonaldehyde and aspirin. In Fig. 4a, we show the joint probability distributions of the dihedral angles (PDDA) for the malonaldehyde molecule. This molecule has a peculiar PES with two local minima with a O $\dots$ H $\dots$ O symmetric interaction (structure (1)), and a shallow region where the molecule fluctuates between two symmetric global minima (structure (2)). The dynamical behavior represented in structure (2) is due to the interplay of two molecular states dominated by an intramolecular O $\dots$ H interaction and a low crossing barrier of ~0.2 kcal mol⁻¹. An interesting result is the nearly unvisited structure (1) by sGDML@DFT in comparison to sGDML@CCSD(T) model regardless of the great similarities of their PES, which gives an idea of the observable consequences of subtle energy differences in the PES of molecules with several degrees of freedom. In terms of spectroscopic differences, the two approximations generate spectra with very few differences (Fig. 4a-right), but being the most prominent the one between the two peaks around 500 cm⁻¹. Such difference can be traced back to the enhanced sampling of the structure (1), and additionally it could be associated to the different nature between the methods in describing the intramolecular O $\dots$ H coupling.

Fig. 4 — Analysis of MD simulations with sGDML for malonaldehyde and aspirin. The MD simulations at 300 K were carried out for 500 ps. a Joint probability distributions of the dihedral angles in malonaldehyde, describing the rotation of both aldehyde groups based on classical MD simulations for sGDML@CCSD(T) and sGDML@DFT. The configurations (1) and (2) are representative structures of the most sampled regions of the PES. b Joint probability distributions of the dihedral angles in aspirin, describing the rotation of the ester and carboxylic acid groups based on PIMD simulations for sGDML@CCSD and sGDML@DFT using 16 beads at 300 K. The potential energy profile for the ester angle in kcal mol⁻¹ is shown for sGDML@CCSD (red), sGDML@DFT (blue) and compared with the CCSD reference (black, dashed). Contour lines show the differences of both distributions on a log scale. Both panels also show a comparison of the vibrational spectra generated via the velocity-velocity autocorrelation function obtained with sGDML@CCSD(T)/CCSD (red) and sGDML@DFT (blue)

For aspirin, the consequences of proper inclusion of the electron correlation are even more significant. Figure 4b shows the PIMD generated PDDA for DFT and CCSD based models. By comparing the two distributions we find that sGDML@CCSD generates localized dynamics in the global energy minimum, whereas the DFT model yields a rather delocalized sampling of the PES. These two contrasting results are explained by the difference in the energetic barriers along the ester dihedral angle. The incorporation of electron correlation in CCSD increases the internal barriers by ~1 kcal mol⁻¹. This prediction was corroborated with explicit CCSD(T) calculations along the dihedral-angle coordinate (black dashed line in Fig. 4b-PES). Furthermore, the difference in the sampling is also due to the fact that the DFT model generates consistently softer interatomic interactions compared to CCSD, which leads to large and visible differences in the vibrational spectra between DFT and CCSD (Fig. 4b-right).

Discussion

The present work enables MD simulations of flexible molecules with up to a few dozen atoms with the accuracy of high-level ab initio quantum mechanics. Such simulations pave the way to computations of dynamical and thermodynamical properties of molecules with an essentially exact description of the underlying PES. On the one hand, this is a required step towards molecular simulations with spectroscopic accuracy. On the other, our accurate and efficient sGDML model leads to unprecedented insights when interpreting the experimental vibrational spectra and dynamical behavior of molecules. The contrasting demands of accuracy and efficiency are satisfied by the sGDML model through a rigorous incorporation of physical symmetries (spatial, temporal, and local symmetries) into a gradient-domain ML approach. This is a significant improvement over symmetry adaption in traditional force fields and electronic-structure calculations, where usually only (global) point groups are considered. Global symmetries are increasingly less likely to occur with growing molecule size, providing diminishing returns. Local symmetries on the other hand are system size independent and preserved even when the molecule is fragmented for large-scale modeling.

In many of the applications of machine-learned force fields the target error is the chemical accuracy or thermochemical accuracy (1 kcal mol⁻¹), but this value was conceived in the sense of thermochemical experimental measurements, such as heats of formation or ionization potentials. Consequently, the accuracy in ML models for predicting the molecular PES should not be tied to this value. Here, we propose a framework for the accuracy in force fields which satisfy the stringent demands of molecular spectroscopists, being typically in the range of wavenumbers (≈ 0.03 kcal mol⁻¹). Reaching this accuracy will be one of the greatest challenges of ML-based force fields. We remark that energy differences between molecular conformers are often on the order of 0.1–0.2 kcal mol⁻¹, hence reaching spectroscopic accuracy in molecular simulations is needed to generate predictive results.

A comparable accuracy is not obtainable with traditional force fields (see Fig. 5). In general, they miss most of the crucial quantum effects due to their rigid, handcrafted analytical form. For example, the absence of a term for electron lone pairs in AMBER leads to uncoupled rotors in ethanol. Furthermore the oversimplified harmonic description of bonded interactions generates an unphysical harmonic sampling at room temperature (see Fig. 5a). In the case of malonaldehyde (Fig. 5b), both distributions misleadingly resemble each other, however they emerge from different types of interactions. For AMBER, the dynamics are purely driven by Coulomb interactions, while the sampling with sGDML@CCSD(T) (structure (2) in Fig. 4a) is mostly guided by electron correlation effects. Lastly, a complete mismatch between the regular force field and sGDML is evident for aspirin (see Fig. 5c), where the interactions dominated by Coulomb forces generate a completely different PES with spurious global and local minima. It is worth mentioning, that the observed shortcomings of the AMBER force field can be addressed for a particular molecule, however only at the cost of losing generality and computational efficiency.

In the context of ML, our work connects to recent studies on the usage of invariance constraints for learning and representations in vision. In the human visual system and also in computer vision algorithms the incorporation of invariances such as translation, scaling, and rotation of objects can in principle permit higher performance at more data efficiency⁶¹; learning theoretical bounds can furthermore show that the amount of data required is reduced by a factor: the number of parameters of the invariance transformation⁶². Interestingly, our study goes empirically beyond this factor, i.e., our gain in data efficiency is often more than two orders of magnitude when combining the invariances (physical symmetries). We speculate that our finding may indicate that the learning problem itself may become less complex, i.e., that the underlying problem structure becomes significantly easier to represent.

There is a number of challenges that remain to be solved to extend the sGDML model in terms of its applicability and scaling to larger molecular systems. Given an extensive set of individually trained sGDML models, an unseen molecule can be represented as a non-linear combination of those models. This would allow scaling up and transferable prediction for molecules that are similar in size. Advanced sampling strategies could be employed to combine forces from different levels of theory to minimize the need for computationally intensive ab initio calculations. Our focus in this work was on intramolecular forces in small- and medium-sized molecules. Looking ahead, it is sensible to integrate the sGDML model with an accurate intermolecular force field to enable predictive simulations of condensed molecular systems (Ref.⁶³ presents an intermolecular model which would be particularly suited for coupling with sGDML). Many other avenues for further development exist⁶⁴, including incorporating additional physical priors, reducing dimensionality of complex PES, computing reaction pathways, and modeling infrared, Raman, and other spectroscopic measurements.

Methods

Reference data generation

The data used for training the DFT models were created running abinitio MD in the NVT ensemble using the Nosé-Hoover thermostat at 500 K during a 200 ps simulation with a resolution of 0.5 fs. We computed forces and energies using all-electrons at the generalized gradient approximation level of theory with the Perdew-Burke-Ernzerhof (PBE)⁶⁵ exchange-correlation functional, treating van der Waals interactions with the Tkatchenko-Scheffler (TS) method⁶⁶. All calculations were performed with FHI-aims⁶⁷. The final training data was generated by subsampling the full trajectory under preservation of the Maxwell-Boltzmann distribution for the energies.

To create the coupled cluster datasets, we reused the same geometries as for the DFT models and recomputed energies and forces using all-electron coupled cluster with single, double, and perturbative triple excitations (CCSD(T)). The Dunning’s correlation-consistent basis set cc-pVTZ was used for ethanol, cc-pVDZ for toluene and malonaldehyde and CCSD/cc-pVDZ for aspirin. All calculations were performed with the Psi4⁶⁸ software suite.

Molecular dynamics

In order to incorporate the crucial effects induced by quantum nuclear delocalization, we used PIMD, which incorporates quantum-mechanical effects into MD simulations via the Feynman’s path integral formalism. The PIMD simulations were performed with the sGDML model interfaced to the i-PI code⁶⁹. The integration timestep was set to 0.2 fs to ensure energy conservation along the MD using the NVE and NVT ensemble. The total simulation time was 1 ns for ethanol (Fig. 3) to get a converged sampling of the PES using 16 beads in the PIMD.

Bipartite matching cost matrix

For the bipartite matching of a pair of molecular graphs, we solve the optimal assignment problem for the eigenvectors of their adjacency matrices using the Hungarian algorithm⁵⁶. As input, this algorithm expects a matrix with all pairwise assignment costs $C_{M} = - M$ , which is constructed as the negative overlap matrix from Eq. (2). We add a penalty matrix with entries ${(C_{z})}_{i j} = abs ({(z)}_{i} - {(z)}_{j}) ε$ that prevents the matching of non-identical nuclei for sufficiently large ε > 0. The final const matrix is then $C = C_{M} + C_{z}$ .

Training sGDML

The symmetric kernel formulation approximates the similarities in the kernel matrix between different permutational configurations of the inputs, as they would appear with a fully symmetrized training set. We construct this object as the sum over all relevant atom assignments for each training geometry, such that the kernel matrix retains its original size. This procedure is used to symmetrize the GDML model⁴⁷, where the symmetric kernel function takes the form

Hess (κ_{sym}) (x, x^{'}) = \frac{1}{S} \sum_{p q}^{S} P_{p}^{⊤} Hess (κ) (P_{p} x, P_{q} x^{'}) P_{q} .

Note, that the rows and columns of the Hessian in the summand are permuted (using $P_{p}^{⊤}$ and P_q) such that the corresponding partial derivatives align. When evaluating the model, the free variable x (first argument of the kernel function) is not permuted and the normalization factor is dropped (see Eq. (5). See Supplementary Note 3 for information on how to use the sGDML model, when the input is represented by a descriptor.

Electronic supplementary material

Supplementary Information^{(354.5KB, pdf)}

Acknowledgements

We thank Michael Gastegger for providing the AMBER force fields. S.C., A.T., and K.-R.M. thank the Deutsche Forschungsgemeinschaft (project MU 987/20-1) for funding this work. A.T. is funded by the European Research Council with ERC-CoG grant BeStMo. K.-R.M. gratefully acknowledges the BK21 program funded by the Korean National Research Foundation grant (no. 2012-005741). Part of this research was performed while the authors were visiting the Institute for Pure and Applied Mathematics, which is supported by the NSF.

Author contributions

S.C. conceived and constructed the sGDML models. S.C., H.E.S., A.T., and K.-R.M. developed the theory. H.E.S. and A.T. designed the analyses. H.E.S. performed the DFT and CCSD(T) calculations and MD simulations. S.C. and H.E.S. created the figures, with help from other authors. All authors wrote the paper, discussed the results and commented on the manuscript.

Data availability

All datasets used in this work are available at http://quantum-machine.org/datasets/. Additional data related to this paper may be requested from the authors.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Klaus-Robert Müller, Email: klaus-robert.mueller@tu-berlin.de.

Alexandre Tkatchenko, Email: alexandre.tkatchenko@uni.lu.

Electronic supplementary material

Supplementary Information accompanies this paper at 10.1038/s41467-018-06169-2.

References

1.Tuckerman, M. Statistical Mechanics: Theory and Molecular Simulation (Oxford University Press, Oxford, UK, 2010).
2.Koch, W. & Holthausen, M. C. A Chemist's Guide to Density Functional Theory (John Wiley & Sons, Hoboken, New Jersey, USA, 2015).
3.Partridge H, Schwenke DW. The determination of an accurate isotope dependent potential energy surface for water from extensive ab initio calculations and experimental data. J. Chem. Phys. 1997;106:4618–4639. doi: 10.1063/1.473987. [DOI] [Google Scholar]
4.Mizukami W, Habershon S, Tew DP. A compact and accurate semi-global potential energy surface for malonaldehyde from constrained least squares regression. J. Chem. Phys. 2014;141:144310. doi: 10.1063/1.4897486. [DOI] [PubMed] [Google Scholar]
5.Schran C, Uhl F, Behler J, Marx D. Highdimensional neural network potentials for solvation: the case of protonated water clusters in helium. J. Chem. Phys. 2018;148:102310. doi: 10.1063/1.4996819. [DOI] [PubMed] [Google Scholar]
6.Behler J, Parrinello M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 2007;98:146401. doi: 10.1103/PhysRevLett.98.146401. [DOI] [PubMed] [Google Scholar]
7.Bartók AP, Payne MC, Kondor R, Csányi G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 2010;104:136403. doi: 10.1103/PhysRevLett.104.136403. [DOI] [PubMed] [Google Scholar]
8.Jose KVJ, Artrith N, Behler J. Construction of high-dimensional neural network potentials using environment-dependent atom pairs. J. Chem. Phys. 2012;136:194111. doi: 10.1063/1.4712397. [DOI] [PubMed] [Google Scholar]
9.Rupp M, Tkatchenko A, Müller KR, von Lilienfeld OA. Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett. 2012;108:058301. doi: 10.1103/PhysRevLett.108.058301. [DOI] [PubMed] [Google Scholar]
10.Montavon G, et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 2013;15:095003. doi: 10.1088/1367-2630/15/9/095003. [DOI] [Google Scholar]
11.Bartók AP, Kondor R, Csányi G. On representing chemical environments. Phys. Rev. B. 2013;87:184115. doi: 10.1103/PhysRevB.87.184115. [DOI] [Google Scholar]
12.Hansen K, et al. Assessment and validation of machine learning methods for predicting molecular atomization energies. J. Chem. Theory Comput. 2013;9:3404–3419. doi: 10.1021/ct400195d. [DOI] [PubMed] [Google Scholar]
13.Hansen K, et al. Machine learning predictions of molecular properties: accurate many-body potentials and nonlocality in chemical space. J. Phys. Chem. Lett. 2015;6:2326–2331. doi: 10.1021/acs.jpclett.5b00831. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Rupp M, Ramakrishnan R, von Lilienfeld OA. Machine learning for quantum mechanical properties of atoms in molecules. J. Phys. Chem. Lett. 2015;6:3309–3313. doi: 10.1021/acs.jpclett.5b01456. [DOI] [Google Scholar]
15.Bartók AP, Csányi G. Gaussian approximation potentials: a brief tutorial introduction. Int. J. Quantum Chem. 2015;115:1051–1057. doi: 10.1002/qua.24927. [DOI] [Google Scholar]
16.Botu V, Ramprasad R. Learning scheme to predict atomic forces and accelerate materials simulations. Phys. Rev. B. 2015;92:094306. doi: 10.1103/PhysRevB.92.094306. [DOI] [Google Scholar]
17.Li Z, Kermode JR, De Vita A. Molecular dynamics with on-the-y machine learning of quantum-mechanical forces. Phys. Rev. Lett. 2015;114:096405. doi: 10.1103/PhysRevLett.114.096405. [DOI] [PubMed] [Google Scholar]
18.Eickenberg M, Exarchakis G, Hirn M, Mallat S, Thiry L. Solid harmonic wavelet scattering for predictions of molecule properties. J. Chem. Phys. 2018;148:241732. doi: 10.1063/1.5023798. [DOI] [PubMed] [Google Scholar]
19.Behler J. Perspective: machine learning potentials for atomistic simulations. J. Chem. Phys. 2016;145:170901. doi: 10.1063/1.4966192. [DOI] [PubMed] [Google Scholar]
20.De S, Bartok AP, Csányi G, Ceriotti M. Comparing molecules and solids across structural and alchemical space. Phys. Chem. Chem. Phys. 2016;18:13754–13769. doi: 10.1039/C6CP00415F. [DOI] [PubMed] [Google Scholar]
21.Brockherde F, et al. Bypassing the Kohn-Sham equations with machine learning. Nat. Commun. 2017;8:872. doi: 10.1038/s41467-017-00839-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Artrith N, Urban A, Ceder G. Efficient and accurate machine-learning interpolation of atomic energies in compositions with many species. Phys. Rev. B. 2017;96:014112. doi: 10.1103/PhysRevB.96.014112. [DOI] [Google Scholar]
23.Podryabinkin EV, Shapeev AV. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 2017;140:171–180. doi: 10.1016/j.commatsci.2017.08.031. [DOI] [Google Scholar]
24.Bartók AP, et al. Machine learning unifies the modeling of materials and molecules. Sci. Adv. 2017;3:e1701816. doi: 10.1126/sciadv.1701816. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Glielmo A, Sollich P, De Vita A. Accurate interatomic force fields via machine learning with covariant kernels. Phys. Rev. B. 2017;95:214302. doi: 10.1103/PhysRevB.95.214302. [DOI] [Google Scholar]
26.Gastegger M, Behler J, Marquetand P. Machine learning molecular dynamics for the simulation of infrared spectra. Chem. Sci. 2017;8:6924–6935. doi: 10.1039/C7SC02267K. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Schütt KT, Arbabzadah F, Chmiela S, Müller KR, Tkatchenko A. Quantum-chemical insights from deep tensor neural networks. Nat. Commun. 2017;8:13890. doi: 10.1038/ncomms13890. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Yao K, Herr JE, Parkhill J. The many-body expansion combined with neural networks. J. Chem. Phys. 2017;146:014106. doi: 10.1063/1.4973380. [DOI] [PubMed] [Google Scholar]
29.Dral PO, Owens A, Yurchenko SN, Thiel W. Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. J. Chem. Phys. 2017;146:244108. doi: 10.1063/1.4989536. [DOI] [PubMed] [Google Scholar]
30.John S, Csányi G. Many-body coarse-grained interactions using gaussian approximation potentials. J. Phys. Chem. B. 2017;121:10934–10949. doi: 10.1021/acs.jpcb.7b09636. [DOI] [PubMed] [Google Scholar]
31.Huang, B. & von Lilienfeld, O. The “DNA” of chemistry: scalable quantum machine learning with “amons”. Preprint at https://arxiv.org/abs/1707.04146 (2017).
32.Faber FA, et al. Prediction errors of molecular machine learning models lower than hybrid DFT error. J. Chem. Theory Comput. 2017;13:5255–5264. doi: 10.1021/acs.jctc.7b00577. [DOI] [PubMed] [Google Scholar]
33.Huan TD, et al. A universal strategy for the creation of machine learning-based atomistic force fields. npj Comput. Mater. 2017;3:37. [Google Scholar]
34.Schütt K, et al. SchNet: a continuous-filter convolutional neural network for modeling quantum interactions. Adv. Neural Inf. Process. Syst. 2017;31:991–1001. [Google Scholar]
35.Mardt A, Pasquali L, Wu H, Noé F. VAMPnets for deep learning of molecular kinetics. Nat. Commun. 2018;9:5. doi: 10.1038/s41467-017-02388-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Glielmo A, Zeni C, De Vita A. Efficient nonparametric n-body force fields from machine learning. Phys. Rev. B. 2018;97:184307. doi: 10.1103/PhysRevB.97.184307. [DOI] [Google Scholar]
37.Zhang L, Han J, Wang H, Car R, Weinan E. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 2018;120:143001. doi: 10.1103/PhysRevLett.120.143001. [DOI] [PubMed] [Google Scholar]
38.Lubbers N, Smith JS, Barros K. Hierarchical modeling of molecular energies using a deep neural network. J. Chem. Phys. 2018;148:241715. doi: 10.1063/1.5011181. [DOI] [PubMed] [Google Scholar]
39.Tang YH, Zhang D, Karniadakis GE. An atomistic fingerprint algorithm for learning ab initio molecular force fields. J. Chem. Phys. 2018;148:034101. doi: 10.1063/1.5008630. [DOI] [PubMed] [Google Scholar]
40.Grisafi A, Wilkins DM, Csányi G, Ceriotti M. Symmetry-adapted machine learning for tensorial properties of atomistic systems. Phys. Rev. Lett. 2018;120:036002. doi: 10.1103/PhysRevLett.120.036002. [DOI] [PubMed] [Google Scholar]
41.Ryczko K, Mills K, Luchak I, Homenick C, Tamblyn I. Convolutional neural networks for atomistic systems. Comput. Mater. Sci. 2018;149:134–142. doi: 10.1016/j.commatsci.2018.03.005. [DOI] [Google Scholar]
42.Kanamori K, et al. Exploring a potential energy surface by machine learning for characterizing atomic transport. Phys. Rev. B. 2018;97:125124. doi: 10.1103/PhysRevB.97.125124. [DOI] [Google Scholar]
43.Pronobis W, Tkatchenko A, Müller KR. Manybody descriptors for predicting molecular properties with machine learning: analysis of pairwise and three-body interactions in molecules. J. Chem. Theory Comput. 2018;14:2991–3003. doi: 10.1021/acs.jctc.8b00110. [DOI] [PubMed] [Google Scholar]
44.Hy TS, Trivedi S, Pan H, Anderson BM, Kondor R. Predicting molecular properties with covariant compositional networks. J. Chem. Phys. 2018;148:241745. doi: 10.1063/1.5024797. [DOI] [PubMed] [Google Scholar]
45.Smith, J. S. et al. Outsmarting quantum chemistry through transfer learning. Preprint at https://chemrxiv.org/articles/Outsmarting_Quantum_Chemistry_Through_Transfer_Learning/6744440 (2018).
46.Yao K, Herr JE, Toth DW, Mckintyre R, Parkhill J. The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics. Chem. Sci. 2018;9:2261–2269. doi: 10.1039/C7SC04934J. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Chmiela S, et al. Machine learning of accurate energyconserving molecular force fields. Sci. Adv. 2017;3:e1603015. doi: 10.1126/sciadv.1603015. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Wilson E. B., Decius J. C., Cross P. C., Sundheim Benson R. Molecular Vibrations: The Theory of Infrared and Raman Vibrational Spectra. Journal of The Electrochemical Society. 1955;102(9):235C. doi: 10.1149/1.2430134. [DOI] [Google Scholar]
49.Longuet-Higgins H. The symmetry groups of non-rigid molecules. Mol. Phys. 1963;6:445–460. doi: 10.1080/00268976300100501. [DOI] [Google Scholar]
50.Pachauri, D., Kondor, R. & Singh, V. Solving the multi-way matching problem by permutation synchronization. Adv. Neural Inf. Process. Syst.26, 1860–1868 (2013)
51.Schiavinato M, Gasparetto A, Torsello A. Transitive Assignment Kernels for Structural Classification. Cham: Springer International Publishing; 2015. pp. 146–159. [Google Scholar]
52.Kriege NM, Giscard PL, Wilson RC. On valid optimal assignment kernels and applications to graph classification. Adv. Neural Inf. Process. Syst. 2016;30:1623–1631. [Google Scholar]
53.Vishwanathan SVN, Schraudolph NN, Kondor R, Borgwardt KM. Graph kernels. J. Mach. Learn. Res. 2010;11:1201–1242. [Google Scholar]
54.Ferré G, Haut T, Barros K. Learning potential energy landscapes using graph kernels. J. Chem. Phys. 2017;146:114107. doi: 10.1063/1.4978623. [DOI] [PubMed] [Google Scholar]
55.Umeyama S. An eigendecomposition approach to weighted graph matching problems. Ieee. Trans. Pattern Anal. Mach. Intell. 1988;10:695–703. doi: 10.1109/34.6778. [DOI] [Google Scholar]
56.Kuhn HW. The Hungarian method for the assignment problem. Nav. Res. Logist. 1955;2:83–97. doi: 10.1002/nav.3800020109. [DOI] [Google Scholar]
57.González L, Mó O, Yáñez M. Density functional theory study on ethanol dimers and cyclic ethanol trimers. J. Chem. Phys. 1999;111:3855–3861. doi: 10.1063/1.479689. [DOI] [Google Scholar]
58.Durig J, Larsen R. Torsional vibrations and barriers to internal rotation for ethanol and 2, 2, 2-triuoroethanol. J. Mol. Struct. 1990;238:195–222. doi: 10.1016/0022-2860(90)85015-B. [DOI] [Google Scholar]
59.Wassermann TN, Suhm MA. Ethanol monomers and dimers revisited: a Raman study of conformational preferences and argon nanocoating effects. J. Phys. Chem. A. 2010;114:8223–8233. doi: 10.1021/jp104861q. [DOI] [PubMed] [Google Scholar]
60.Durig J, Bucy W, Wurrey C, Carreira L. Raman spectra of gases. XVI. Torsional transitions in ethanol and ethanethiol. J. Phys. Chem. A. 1975;79:988–993. doi: 10.1021/j100577a009. [DOI] [Google Scholar]
61.Poggio T, Anselmi F. Visual Cortex and Deep Networks: Learning Invariant Representations. Cambridge, MA: MIT Press,; 2016. [Google Scholar]
62.Anselmi F, Rosasco L, Poggio T. On invariance and selectivity in representation learning. Inf. Inference. 2016;5:134–158. doi: 10.1093/imaiai/iaw009. [DOI] [Google Scholar]
63.Bereau T, DiStasio RA, Jr, Tkatchenko A, Von Lilienfeld OA. Non-covalent interactions across organic and biological subsets of chemical space: physicsbased potentials parametrized from machine learning. J. Chem. Phys. 2018;148:241706. doi: 10.1063/1.5009502. [DOI] [PubMed] [Google Scholar]
64.De Luna P, Wei J, Bengio Y, Aspuru-Guzik A, Sargent E. Use machine learning to find energy materials. Nature. 2017;552:23. doi: 10.1038/d41586-017-07820-6. [DOI] [PubMed] [Google Scholar]
65.Perdew JP, Burke K, Ernzerhof M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996;77:3865–3868. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]
66.Tkatchenko A, Scheffler M. Accurate molecular van der waals interactions from ground-state electron density and free-atom reference data. Phys. Rev. Lett. 2009;102:073005. doi: 10.1103/PhysRevLett.102.073005. [DOI] [PubMed] [Google Scholar]
67.Blum V, et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 2009;180:2175–2196. doi: 10.1016/j.cpc.2009.06.022. [DOI] [Google Scholar]
68.Parrish RM, et al. Psi4 1.1: an open-source electronic structure program emphasizing automation, advanced libraries, and interoperability. J. Chem. Theory Comput. 2017;13:3185–3197. doi: 10.1021/acs.jctc.7b00174. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Ceriotti M, More J, Manolopoulos DE. i-PI: a python interface for ab initio path integral molecular dynamics simulations. Comput. Phys. Commun. 2014;185:1019–1026. doi: 10.1016/j.cpc.2013.10.027. [DOI] [Google Scholar]
70.Case, D. et al. Amber 2018 (The Amber Project, 2018).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(354.5KB, pdf)}

Data Availability Statement

All datasets used in this work are available at http://quantum-machine.org/datasets/. Additional data related to this paper may be requested from the authors.

[CR1] 1.Tuckerman, M. Statistical Mechanics: Theory and Molecular Simulation (Oxford University Press, Oxford, UK, 2010).

[CR2] 2.Koch, W. & Holthausen, M. C. A Chemist's Guide to Density Functional Theory (John Wiley & Sons, Hoboken, New Jersey, USA, 2015).

[CR3] 3.Partridge H, Schwenke DW. The determination of an accurate isotope dependent potential energy surface for water from extensive ab initio calculations and experimental data. J. Chem. Phys. 1997;106:4618–4639. doi: 10.1063/1.473987. [DOI] [Google Scholar]

[CR4] 4.Mizukami W, Habershon S, Tew DP. A compact and accurate semi-global potential energy surface for malonaldehyde from constrained least squares regression. J. Chem. Phys. 2014;141:144310. doi: 10.1063/1.4897486. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Schran C, Uhl F, Behler J, Marx D. Highdimensional neural network potentials for solvation: the case of protonated water clusters in helium. J. Chem. Phys. 2018;148:102310. doi: 10.1063/1.4996819. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Behler J, Parrinello M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 2007;98:146401. doi: 10.1103/PhysRevLett.98.146401. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Bartók AP, Payne MC, Kondor R, Csányi G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 2010;104:136403. doi: 10.1103/PhysRevLett.104.136403. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Jose KVJ, Artrith N, Behler J. Construction of high-dimensional neural network potentials using environment-dependent atom pairs. J. Chem. Phys. 2012;136:194111. doi: 10.1063/1.4712397. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Rupp M, Tkatchenko A, Müller KR, von Lilienfeld OA. Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett. 2012;108:058301. doi: 10.1103/PhysRevLett.108.058301. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Montavon G, et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 2013;15:095003. doi: 10.1088/1367-2630/15/9/095003. [DOI] [Google Scholar]

[CR11] 11.Bartók AP, Kondor R, Csányi G. On representing chemical environments. Phys. Rev. B. 2013;87:184115. doi: 10.1103/PhysRevB.87.184115. [DOI] [Google Scholar]

[CR12] 12.Hansen K, et al. Assessment and validation of machine learning methods for predicting molecular atomization energies. J. Chem. Theory Comput. 2013;9:3404–3419. doi: 10.1021/ct400195d. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Hansen K, et al. Machine learning predictions of molecular properties: accurate many-body potentials and nonlocality in chemical space. J. Phys. Chem. Lett. 2015;6:2326–2331. doi: 10.1021/acs.jpclett.5b00831. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Rupp M, Ramakrishnan R, von Lilienfeld OA. Machine learning for quantum mechanical properties of atoms in molecules. J. Phys. Chem. Lett. 2015;6:3309–3313. doi: 10.1021/acs.jpclett.5b01456. [DOI] [Google Scholar]

[CR15] 15.Bartók AP, Csányi G. Gaussian approximation potentials: a brief tutorial introduction. Int. J. Quantum Chem. 2015;115:1051–1057. doi: 10.1002/qua.24927. [DOI] [Google Scholar]

[CR16] 16.Botu V, Ramprasad R. Learning scheme to predict atomic forces and accelerate materials simulations. Phys. Rev. B. 2015;92:094306. doi: 10.1103/PhysRevB.92.094306. [DOI] [Google Scholar]

[CR17] 17.Li Z, Kermode JR, De Vita A. Molecular dynamics with on-the-y machine learning of quantum-mechanical forces. Phys. Rev. Lett. 2015;114:096405. doi: 10.1103/PhysRevLett.114.096405. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Eickenberg M, Exarchakis G, Hirn M, Mallat S, Thiry L. Solid harmonic wavelet scattering for predictions of molecule properties. J. Chem. Phys. 2018;148:241732. doi: 10.1063/1.5023798. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Behler J. Perspective: machine learning potentials for atomistic simulations. J. Chem. Phys. 2016;145:170901. doi: 10.1063/1.4966192. [DOI] [PubMed] [Google Scholar]

[CR20] 20.De S, Bartok AP, Csányi G, Ceriotti M. Comparing molecules and solids across structural and alchemical space. Phys. Chem. Chem. Phys. 2016;18:13754–13769. doi: 10.1039/C6CP00415F. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Brockherde F, et al. Bypassing the Kohn-Sham equations with machine learning. Nat. Commun. 2017;8:872. doi: 10.1038/s41467-017-00839-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Artrith N, Urban A, Ceder G. Efficient and accurate machine-learning interpolation of atomic energies in compositions with many species. Phys. Rev. B. 2017;96:014112. doi: 10.1103/PhysRevB.96.014112. [DOI] [Google Scholar]

[CR23] 23.Podryabinkin EV, Shapeev AV. Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci. 2017;140:171–180. doi: 10.1016/j.commatsci.2017.08.031. [DOI] [Google Scholar]

[CR24] 24.Bartók AP, et al. Machine learning unifies the modeling of materials and molecules. Sci. Adv. 2017;3:e1701816. doi: 10.1126/sciadv.1701816. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Glielmo A, Sollich P, De Vita A. Accurate interatomic force fields via machine learning with covariant kernels. Phys. Rev. B. 2017;95:214302. doi: 10.1103/PhysRevB.95.214302. [DOI] [Google Scholar]

[CR26] 26.Gastegger M, Behler J, Marquetand P. Machine learning molecular dynamics for the simulation of infrared spectra. Chem. Sci. 2017;8:6924–6935. doi: 10.1039/C7SC02267K. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Schütt KT, Arbabzadah F, Chmiela S, Müller KR, Tkatchenko A. Quantum-chemical insights from deep tensor neural networks. Nat. Commun. 2017;8:13890. doi: 10.1038/ncomms13890. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Yao K, Herr JE, Parkhill J. The many-body expansion combined with neural networks. J. Chem. Phys. 2017;146:014106. doi: 10.1063/1.4973380. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Dral PO, Owens A, Yurchenko SN, Thiel W. Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. J. Chem. Phys. 2017;146:244108. doi: 10.1063/1.4989536. [DOI] [PubMed] [Google Scholar]

[CR30] 30.John S, Csányi G. Many-body coarse-grained interactions using gaussian approximation potentials. J. Phys. Chem. B. 2017;121:10934–10949. doi: 10.1021/acs.jpcb.7b09636. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Huang, B. & von Lilienfeld, O. The “DNA” of chemistry: scalable quantum machine learning with “amons”. Preprint at https://arxiv.org/abs/1707.04146 (2017).

[CR32] 32.Faber FA, et al. Prediction errors of molecular machine learning models lower than hybrid DFT error. J. Chem. Theory Comput. 2017;13:5255–5264. doi: 10.1021/acs.jctc.7b00577. [DOI] [PubMed] [Google Scholar]

[CR33] 33.Huan TD, et al. A universal strategy for the creation of machine learning-based atomistic force fields. npj Comput. Mater. 2017;3:37. [Google Scholar]

[CR34] 34.Schütt K, et al. SchNet: a continuous-filter convolutional neural network for modeling quantum interactions. Adv. Neural Inf. Process. Syst. 2017;31:991–1001. [Google Scholar]

[CR35] 35.Mardt A, Pasquali L, Wu H, Noé F. VAMPnets for deep learning of molecular kinetics. Nat. Commun. 2018;9:5. doi: 10.1038/s41467-017-02388-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Glielmo A, Zeni C, De Vita A. Efficient nonparametric n-body force fields from machine learning. Phys. Rev. B. 2018;97:184307. doi: 10.1103/PhysRevB.97.184307. [DOI] [Google Scholar]

[CR37] 37.Zhang L, Han J, Wang H, Car R, Weinan E. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 2018;120:143001. doi: 10.1103/PhysRevLett.120.143001. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Lubbers N, Smith JS, Barros K. Hierarchical modeling of molecular energies using a deep neural network. J. Chem. Phys. 2018;148:241715. doi: 10.1063/1.5011181. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Tang YH, Zhang D, Karniadakis GE. An atomistic fingerprint algorithm for learning ab initio molecular force fields. J. Chem. Phys. 2018;148:034101. doi: 10.1063/1.5008630. [DOI] [PubMed] [Google Scholar]

[CR40] 40.Grisafi A, Wilkins DM, Csányi G, Ceriotti M. Symmetry-adapted machine learning for tensorial properties of atomistic systems. Phys. Rev. Lett. 2018;120:036002. doi: 10.1103/PhysRevLett.120.036002. [DOI] [PubMed] [Google Scholar]

[CR41] 41.Ryczko K, Mills K, Luchak I, Homenick C, Tamblyn I. Convolutional neural networks for atomistic systems. Comput. Mater. Sci. 2018;149:134–142. doi: 10.1016/j.commatsci.2018.03.005. [DOI] [Google Scholar]

[CR42] 42.Kanamori K, et al. Exploring a potential energy surface by machine learning for characterizing atomic transport. Phys. Rev. B. 2018;97:125124. doi: 10.1103/PhysRevB.97.125124. [DOI] [Google Scholar]

[CR43] 43.Pronobis W, Tkatchenko A, Müller KR. Manybody descriptors for predicting molecular properties with machine learning: analysis of pairwise and three-body interactions in molecules. J. Chem. Theory Comput. 2018;14:2991–3003. doi: 10.1021/acs.jctc.8b00110. [DOI] [PubMed] [Google Scholar]

[CR44] 44.Hy TS, Trivedi S, Pan H, Anderson BM, Kondor R. Predicting molecular properties with covariant compositional networks. J. Chem. Phys. 2018;148:241745. doi: 10.1063/1.5024797. [DOI] [PubMed] [Google Scholar]

[CR45] 45.Smith, J. S. et al. Outsmarting quantum chemistry through transfer learning. Preprint at https://chemrxiv.org/articles/Outsmarting_Quantum_Chemistry_Through_Transfer_Learning/6744440 (2018).

[CR46] 46.Yao K, Herr JE, Toth DW, Mckintyre R, Parkhill J. The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics. Chem. Sci. 2018;9:2261–2269. doi: 10.1039/C7SC04934J. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Chmiela S, et al. Machine learning of accurate energyconserving molecular force fields. Sci. Adv. 2017;3:e1603015. doi: 10.1126/sciadv.1603015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR48] 48.Wilson E. B., Decius J. C., Cross P. C., Sundheim Benson R. Molecular Vibrations: The Theory of Infrared and Raman Vibrational Spectra. Journal of The Electrochemical Society. 1955;102(9):235C. doi: 10.1149/1.2430134. [DOI] [Google Scholar]

[CR49] 49.Longuet-Higgins H. The symmetry groups of non-rigid molecules. Mol. Phys. 1963;6:445–460. doi: 10.1080/00268976300100501. [DOI] [Google Scholar]

[CR50] 50.Pachauri, D., Kondor, R. & Singh, V. Solving the multi-way matching problem by permutation synchronization. Adv. Neural Inf. Process. Syst.26, 1860–1868 (2013)

[CR51] 51.Schiavinato M, Gasparetto A, Torsello A. Transitive Assignment Kernels for Structural Classification. Cham: Springer International Publishing; 2015. pp. 146–159. [Google Scholar]

[CR52] 52.Kriege NM, Giscard PL, Wilson RC. On valid optimal assignment kernels and applications to graph classification. Adv. Neural Inf. Process. Syst. 2016;30:1623–1631. [Google Scholar]

[CR53] 53.Vishwanathan SVN, Schraudolph NN, Kondor R, Borgwardt KM. Graph kernels. J. Mach. Learn. Res. 2010;11:1201–1242. [Google Scholar]

[CR54] 54.Ferré G, Haut T, Barros K. Learning potential energy landscapes using graph kernels. J. Chem. Phys. 2017;146:114107. doi: 10.1063/1.4978623. [DOI] [PubMed] [Google Scholar]

[CR55] 55.Umeyama S. An eigendecomposition approach to weighted graph matching problems. Ieee. Trans. Pattern Anal. Mach. Intell. 1988;10:695–703. doi: 10.1109/34.6778. [DOI] [Google Scholar]

[CR56] 56.Kuhn HW. The Hungarian method for the assignment problem. Nav. Res. Logist. 1955;2:83–97. doi: 10.1002/nav.3800020109. [DOI] [Google Scholar]

[CR57] 57.González L, Mó O, Yáñez M. Density functional theory study on ethanol dimers and cyclic ethanol trimers. J. Chem. Phys. 1999;111:3855–3861. doi: 10.1063/1.479689. [DOI] [Google Scholar]

[CR58] 58.Durig J, Larsen R. Torsional vibrations and barriers to internal rotation for ethanol and 2, 2, 2-triuoroethanol. J. Mol. Struct. 1990;238:195–222. doi: 10.1016/0022-2860(90)85015-B. [DOI] [Google Scholar]

[CR59] 59.Wassermann TN, Suhm MA. Ethanol monomers and dimers revisited: a Raman study of conformational preferences and argon nanocoating effects. J. Phys. Chem. A. 2010;114:8223–8233. doi: 10.1021/jp104861q. [DOI] [PubMed] [Google Scholar]

[CR60] 60.Durig J, Bucy W, Wurrey C, Carreira L. Raman spectra of gases. XVI. Torsional transitions in ethanol and ethanethiol. J. Phys. Chem. A. 1975;79:988–993. doi: 10.1021/j100577a009. [DOI] [Google Scholar]

[CR61] 61.Poggio T, Anselmi F. Visual Cortex and Deep Networks: Learning Invariant Representations. Cambridge, MA: MIT Press,; 2016. [Google Scholar]

[CR62] 62.Anselmi F, Rosasco L, Poggio T. On invariance and selectivity in representation learning. Inf. Inference. 2016;5:134–158. doi: 10.1093/imaiai/iaw009. [DOI] [Google Scholar]

[CR63] 63.Bereau T, DiStasio RA, Jr, Tkatchenko A, Von Lilienfeld OA. Non-covalent interactions across organic and biological subsets of chemical space: physicsbased potentials parametrized from machine learning. J. Chem. Phys. 2018;148:241706. doi: 10.1063/1.5009502. [DOI] [PubMed] [Google Scholar]

[CR64] 64.De Luna P, Wei J, Bengio Y, Aspuru-Guzik A, Sargent E. Use machine learning to find energy materials. Nature. 2017;552:23. doi: 10.1038/d41586-017-07820-6. [DOI] [PubMed] [Google Scholar]

[CR65] 65.Perdew JP, Burke K, Ernzerhof M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996;77:3865–3868. doi: 10.1103/PhysRevLett.77.3865. [DOI] [PubMed] [Google Scholar]

[CR66] 66.Tkatchenko A, Scheffler M. Accurate molecular van der waals interactions from ground-state electron density and free-atom reference data. Phys. Rev. Lett. 2009;102:073005. doi: 10.1103/PhysRevLett.102.073005. [DOI] [PubMed] [Google Scholar]

[CR67] 67.Blum V, et al. Ab initio molecular simulations with numeric atom-centered orbitals. Comput. Phys. Commun. 2009;180:2175–2196. doi: 10.1016/j.cpc.2009.06.022. [DOI] [Google Scholar]

[CR68] 68.Parrish RM, et al. Psi4 1.1: an open-source electronic structure program emphasizing automation, advanced libraries, and interoperability. J. Chem. Theory Comput. 2017;13:3185–3197. doi: 10.1021/acs.jctc.7b00174. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] 69.Ceriotti M, More J, Manolopoulos DE. i-PI: a python interface for ab initio path integral molecular dynamics simulations. Comput. Phys. Commun. 2014;185:1019–1026. doi: 10.1016/j.cpc.2013.10.027. [DOI] [Google Scholar]

[CR70] 70.Case, D. et al. Amber 2018 (The Amber Project, 2018).

PERMALINK

Towards exact molecular dynamics simulations with machine-learned force fields

Stefan Chmiela

Huziel E Sauceda

Klaus-Robert Müller

Alexandre Tkatchenko

Abstract

Introduction

Results

Symmetrized gradient-domain machine learning

Fig. 1.

Forces and energies from GDML to sGDML@DFT to sGDML@CCSD(T)

Fig. 2.

Table 1.

MD with ab initio accuracy

Fig. 3.

Fig. 4.

Discussion

Fig. 5.

Methods

Reference data generation

Molecular dynamics

Bipartite matching cost matrix

Training sGDML

Electronic supplementary material

Acknowledgements

Author contributions

Data availability

Competing interests

Footnotes

Contributor Information

Electronic supplementary material

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases