Abstract
Machine learned interatomic interaction potentials have enabled efficient and accurate molecular simulations of closed systems. However, external fields, which can greatly change the chemical structure and/or reactivity, have been seldom included in current machine learning models. This work proposes a universal field-induced recursively embedded atom neural network (FIREANN) model, which integrates a pseudo field vector-dependent feature into atomic descriptors to represent system-field interactions with rigorous rotational equivariance. This “all-in-one” approach correlates various response properties like dipole moment and polarizability with the field-dependent potential energy in a single model, very suitable for spectroscopic and dynamics simulations in molecular and periodic systems in the presence of electric fields. Especially for periodic systems, we find that FIREANN can overcome the intrinsic multiple-value issue of the polarization by training atomic forces only. These results validate the universality and capability of the FIREANN method for efficient first-principles modeling of complicated systems in strong external fields.
Subject terms: Molecular dynamics, Chemical physics, Chemical physics, Reaction kinetics and dynamics
External fields, despite their significant influence on chemical processes, have been largely ignored in current machine learning potentials. Here, the authors introduce a field-induced model that captures system-field interactions rigorously.
Introduction
The interplay between external fields and chemical systems is of fundamental importance in a range of physical, chemical, and biological processes1,2. By interacting with atoms, molecules, or condensed matter, external (mainly electric) fields can induce electronic/spin polarization and spatial orientation of the system, which have offered a particular means to alter chemical structures3, promote electron transfer4, control phase transitions of materials5 or conformational transformations of biomolecules6, subtly manipulate chemical reactivity and selectivity in catalysis7–10 and quantum dynamics in cold chemical reactions11–13.
Exact field-dependent quantum scattering calculations were feasible only for very small systems14. Density functional theory (DFT) and ab initio molecular dynamics (AIMD) simulations based on the modern theory of polarization15 have been more commonly applied to study more complex aperiodic and periodic systems in the presence of external electric fields16–20. However, the AIMD approach remains very demanding, especially when nuclear quantum effects (NQEs) are important19. Although empirical force fields can be instead highly efficient21,22, their accuracy is limited by empirical functions and approximate expressions for the interaction Hamiltonian. For example, the commonly used dipole-field approximation truncates the perturbation of the system by an electric field to the first order (i.e. only the interaction with the permanent dipole is included) and omits higher-order interactions associated with polarizability, hyperpolarizability, and so on. Moreover, except these reactive force fields23–25, most of them fall short of describing bond breakage/formation.
Recent years have witnessed revolutionary successes of machine learning (ML) methods in solving high-dimensional problems in chemistry26–32. Various ML models for accurately representing potential energy surfaces (PESs) have been developed33–48. Some of them have been extended to learn tensorial properties such as the dipole moment and polarizability tensor with correct rotational equivariance48–60, enabling efficient field-free simulations of electronic and vibrational spectra. However, most ML models treat the potential energy and its response properties to electric fields separately, without capturing the field dependence. The influence of electric fields was first taken into account by Christensen et al. in a kernel-based regression method by constructing a fictitious dipole arising from fictitious partial charges and coupling it with the field vector to yield a scalar dipole-field interaction analog61. Müller and coworkers incorporated the dipole-field interaction similarly in their FieldSchNet neural network (NN) model to describe the solvent effect in the form of an effective field62. Gao and Remsing separated the long- and short-range interactions by introducing self-consistent effective electric fields, which are used as input for subsequent NNs along with local atomic coordinates to describe perturbations to the short-range system from the effective electric field. This strategy however does not rigorously fulfill the rotational equivariance with respect to the field direction63.
In this work, by introducing a simple field-dependent feature into the description of the atomic environment, we develop a field-induced recursively embedded atom neural network (FIREANN) model with correct rotational equivariance of a system interacting with an external field. Without any truncation of field-induced interactions, FIREANN describes not only the energy variation with applied field strength and direction, but also the associated response properties simultaneously up to (in principle) any order. Path-integral-based molecular dynamics (MD) simulations with well-trained FIREANN models yield reliable ab initio spectroscopy of representative molecular and condensed phase systems in the presence of electric fields. A remarkable characteristic of this model is that it can detour the multivalued issue of polarization in periodic systems, a well-known but largely neglected fact in existing ML models, by learning atomic forces only.
Results
FIREANN framework
The FIREANN model is built on top of a physics-inspired recursively embedded atom neural network (REANN) framework64 that uses embedded atom densities (EADs) as descriptors to atomic environment (Detailed in the Methods section). In the absence of electric fields, EADs are constructed in a quantum chemical spirit by the linear combination of Gaussian-type orbitals (GTOs) of surrounding atoms, preserving the overall rotational, translational, and permutational invariance of the system. However, an applied field can certainly redistribute the electron density and break down the rotational invariance of the system. The corresponding field-system interaction depends on the direction and strength of the electric field. To characterize this influence in a physically meaningful way, for an applied field (), we include a virtual field vector-dependent function, namely,
1 |
Following the procedure of feature construction in Methods, the field-dependent orbital was combined into the GTO to form a field-induced EAD (FI-EAD) vector that comprises various density values determined by different sets of contracted coefficients,
2 |
Here, the applied field felt by each atom is represented by a position vector of a pseudo-atom relative to that atom (, as illustrated in Fig. 1). The FI-EAD feature can be rewritten in terms of interatomic distances and enclosed angles39,
3 |
where we have combined the radial Gaussian and switching functions in fm for simplicity. From Eq. (3), one immediately realizes that the FI-EAD feature depends not only on atomic coordinates, but also on the field strength and the closed angle θijε between and each vector. In practice, rotating the field or the system separately will lead to different FI-EAD values, while the synchronous rotation of the field and the system without altering the relative direction of with respect to each coordinate will not. This FI-EAD feature captures the nature of the interaction between the system and the applied electric field without changing the physical form of the EAD feature. The resulting rotational equivariance is conserved in any subsequent message passing of the environment- and field-dependent orbital coefficients (cj and cε) and thus in the final potential energy. When = 0, this model naturally reduces to the original REANN model. The only extra cost for evaluating FIREANN compared to the standard REANN is that of a field-induced orbital, which is almost negligible as evident from Eqs. (1) and (2).
As an extra benefit, the FIREANN framework intrinsically describes the response of the potential energy to an external field up to an arbitrary order by taking the analytical gradients of the potential energy with respect to the field vector. For example, the electric dipole moment () is the first and the polarizability tensor (α) the second-order response to electric fields,
4 |
These properties can be simultaneously learned in a FIREANN model by adopting the following loss function,
5 |
where Nb is the size of the batch dataset, the superscripts NN and Ref refer to the NN-predicted and reference quantities, and λV, λF, λμ, and λα represent the weights of the energy, force, dipole moment, and polarizability, respectively, in the loss function. Note that these response properties by construction offer correlated information on the field-dependence of the PES rather than being simply accumulated in the loss function40. As a result, they have a similar effect as that of forces and hessians, which can help improve the fitting quality. We also note that FIREANN not only applies to electric fields as demonstrated by numerical examples below, but also equally to magnetic fields in the same spirit, by which the magnetic dipole and/or magnetic polarizability can be obtained.
A toy system
We first take the H2O molecule as a toy system to verify the symmetry adaption of the FIREANN method subject to an external electric field. A FIREANN model was constructed with just a single equilibrium geometry lying in the yz plane and the electric field being 0.1 V Å−1 along the x direction. As displayed in Fig. 2a, when the molecule rotates about the x axis, its potential energy does not change at all as the field is always orthogonal to the molecular plane. This is exactly encoded in FIREANN. On the other hand, the potential energy varies with the molecular rotation about the y axis, as shown in Fig. 2b. The energy variation representing the interaction between the molecular dipole and the electric field is again well predicted by the FIREANN model. Importantly, FIREANN further exhibits excellent extrapolatability in Fig. 2c, where the field intensity along the x axis varies from −0.2 to 0.2 V Å−1, resulting in a symmetrical energy dependence on the field direction. The FIREANN model trained with a single data successfully reproduces the energy profile generated by DFT. In comparison, FieldSchNet fails to predict the correct field-induced energy dependence in the same condition and give a constant energy as shown in Fig. 2c. More detailed comparisons between FIREANN and FieldSchNet will be discussed below.
Molecular spectroscopy
A distinct feature of the proposed FIREANN model is its all-in-one predictions for energies (atomic forces) and response properties with and without an electric field. We first demonstrate this feature for the N-methylacetamide (NMA) molecule, which has been widely used as a model system of the amide group to construct spectroscopic maps and simulate the spectra of the peptide backbone52,65–68. Specifically, we constructed an FIREANN model by learning a mix set of ab initio energies, forces, dipole moments, and polarizabilities for the NMA molecule in an electric field varying from 0.0 to 0.4 V Å−1 along x direction. Figure 3 clearly shows that the universal FIREANN model achieves an excellent accuracy for energy, dipole moment, and polarizability, with corresponding root mean square errors (RMSEs) of 0.0053 eV, 0.028 Debye, and 0.51 a.u., respectively. Given the synchronous prediction of these quantities, the FIREANN model enables efficient MD simulations of IR and Raman spectra in comparison with experimental data.
Figure 4a compares the calculated and experimental field-free infrared (IR) spectra69 for NMA at 300 K. In general, the classical MD-based result agrees reasonably well with the experimental spectrum, even reproducing double peaks for the C-O stretching vibration (~1710 cm−1) corresponding to the well-known P/R rotational structure-induced splitting69. However, the calculated bands of Amide II, Amide III, Amide A (the N-H stretch band, ~3507 cm−1), and the band including C-H stretching mode and other bending overtones of the methyl group (~2950 cm−1) are apparently blue-shifted compared to experiment. This discrepancy is likely due to the neglect of NQEs in the classical treatment of these vibrational bands relevant to hydrogen atoms. To solve this problem, path-integral based thermostated ring polymer molecular dynamics (TRPMD)70 simulations were performed. The TRPMD result significantly improves the agreement with the experiment and reproduces most bands in not only their positions but also their shapes and intensities. Figure 4b compares the calculated and experimental resonance Raman spectra of NMA for zero field. Due to the lack of the experimental spectrum for a single NMA molecule, the measured liquid spectrum71 is taken form qualitative comparison. Encouragingly, while there is apparent mismatch regarding the relative peak intensities of low-frequency modes, the TRPMD spectrum reproduces most of the observed bands reasonably well.
FIREANN also predicts the in-field molecular spectra, as shown in Fig. 5, where the field strength increases from 0 to 0.4 V Å−1 every 0.1 V Å−1 along x direction. In these in-field IR spectra, the C-O stretching band seems most influenced by the applied field. As mentioned in the field-free spectrum, this band has an intrinsic P/R rotational double-peak structure. Interestingly, with increasing field strength, this P/R branch splitting gradually vanishes and the absorption peak gets narrower and higher. This phenomenon implies the interplay between the electric field and the molecular rotation. Indeed, the dipole moment of NMA, which is almost parallel to the C-O bond69,72, tends to reorient to the opposite direction of the electric field to minimize the energy. Increasing the field intensity increases the dipole-field interaction and more strongly confines the NMA molecule in the preferable orientation. In addition, a significant red shift of the CO stretching vibration is found roughly proportional to the field intensity. This redshift is likely a natural consequence of the weakening of the chemical bond by the applied electric field. A similar but smaller redshift is also found for the N-H stretching, consistent with the fact that the electron cloud of the C-O group is more polarizable than the N-H group, due apparently to the higher electron density there. Furthermore, we decompose the molecular polarizability into isotropic () and anisotropic () terms and exhibit corresponding Raman spectra in Fig. 5, respectively. The anisotropic Raman spectra show a similar field-dependence of the C-O stretching band for the same reason. However, the rotational splitting is absent in isotropic Raman spectra as the isotropic polarizability is rotation invariant. As a result, the increasing field results in only a pure red shift of the C-O stretching vibration.
Liquid water
The FIREANN model is by its atom-wise form capable of describing the response of periodic systems to external electric fields. We test this capability in liquid water. However, unlike molecular systems, the polarization (dipole moment per unit volume) of a periodic system is a multivalued quantity according to the modern theory of polarization15, resulting in multiple parallel branches that differ by a polarization quantum represented by the product of any lattice vector and the electronic charge and divided by the volume of the lattice15. This ill-defined multiplicity may lead to sudden jumps in the dipole moment. Figure 6a shows clearly the abrupt discontinuities in the evolution of the x component of DFT calculated dipole moment along an AIMD trajectory without an electric field. This accidental change in the dipole moment poses challenges for conventional atomistic ML models that learn field-free dipoles, which typically decompose the global dipole moment vector into local atomic dipoles and represent atomic dipoles by the product of atomic charges and position vectors49,50,52. Importantly, this discontinuity issue occurs more frequently under a high field strength, as seen in Fig. 6b. Likewise, the in-field total energy is supposed to be discontinuous at these configurations as the dipole-field interaction jumps. Schienbein also recognized the multiple-valued problem of the dipole moment and proposed to learn the atomic polar tensor which is the spatial derivative of dipole instead of learning the dipole itself. These smooth spatial derivatives can be transformed into time derivatives of dipole in MD to calculate autocorrelation functions, ultimately yielding the IR spectrum60. Similarly, learning the offsets of the Wannier centers relative to the corresponding O atoms can overcome this problem73, however, localizing Wannier centers themselves in more complex systems with strong non-local features, is not a trial task, which easily gets stuck in local minima and has severe convergence difficulties74. Furthermore, unlike FIREANN, these previous ML models are designed for field-free systems only, which does not describe the general response of a system to external fields and the field-dependent potential energy surface.
In the FIREANN framework, alternatively, this issue can be easily bypassed by training atomic forces only in the presence of electric field, because the gradient of the energy is actually unaffected. Although the dipole moment and polarizability are not explicitly involved, the interaction between the electric field and the system can be learned implicitly in the force-only training. The dipole moment can then be retrieved by the first-order gradient of the energy output with respect to the field vector. It should be noted that training atomic forces only will introduce an undetermined field-dependent constant to the total energy in our model. This will lead to certain undetermined constants to absolute thermodynamic properties. However, atomic forces (or any properties’ gradients with respect to atomic coordinates) are well represented by our model so that the changes of thermodynamic properties under a given electric field can still be correctly described, which are physically more meaningful in practical calculations. In addition, by including the polarizability in the loss function during the training process, one can eliminate the field dependence of the undetermined constant on the total energy. This allows us to compare energies and the changes in thermodynamic properties under different field strengths, as will be discussed later in the comparison with FieldSchNet.
To validate this strategy, we have constructed a FIREANN model of bulk water including 64 water molecules under an electric field up to 0.6 V Å−1 along x direction, using atomic forces as targets only (named FIREANN-wF hereafter). Our model yields accurate predictions for atomic forces, with an overall RMSE of 39.4 meV Å−1. In addition, the TRPMD-calculated field-free radial distribution functions (RDFs) of liquid water agree very well with previous on-the-fly results at the same DFT level75 and experimental data76, as shown in Fig. 7, further validating the accuracy of the FIREANN-wF potential. It is also beneficial to compare the dipole moments predicted by FIREANN-wF with DFT data. Since dipole moments should smoothly change as the configuration evolves, it is reasonable to correct any abrupt changes in the dipole moment calculated by DFT along an AIMD trajectory. This correction involves shifting the dipole moment by an integer multiplied by the product of the corresponding lattice vector and electronic charge. By applying this correction, one ensures that the adjusted dipole moment remains closest to its value in the previous step, allowing for a continuous variation of dipole moments along the trajectory. Impressively, as shown in Fig. 6, the corrected DFT dipole moments perfectly match the FIREANN-wF predictions, without any prior knowledge of these positions of sudden jumps. Interestingly, the FIREANN-wF model captures the drastic increase in the total polarization of the system under an intensive electric field, as shown in Fig. 6b. This is because of additivity of the molecular dipole moment as each water molecule tends to align its dipole moment to the field vector. Figure 6 also shows the poor performance of a flawed model training with multi-branched dipole moments in a brute-force way (named FIREANN-wD hereafter), which obviously fails to follow the correct evolution of the dipole moment. Note that dipole moments in both two models are obtained by calculating the energy gradient with respect to the electric field. It is worth stating that although it is viable to correct the dipole moment along a single AIMD trajectory because the configuration change is minor in adjacent steps, it is difficult to do so in practice for uncorrelated trajectories or for independent single-point calculations. In latter cases, the training dataset will likely include dipole moments of unpredictable branches and give large noises to conventional ML dipole models relying on the multiplication of atomic charges and position vectors.
Misrepresenting the dipole moment surface could have a significant influence on the resultant IR spectrum. Figure 8a compares the calculated and experimental IR spectra of liquid water at room temperature without an electric field. Thanks to the inclusion of NQEs by TRPMD, the FIREANN-wF model that offers both the correct PES and dipole moment surface, does capture well all experimental vibrational features77 including the O-H stretching (~3600 cm−1), H-O-H bending (~1690 cm−1), librational (~700 cm−1)78 and H-bonding stretching bands (~170 cm−1)78. Our results also agree well with previous theoretical ones obtained by on-the-fly TRPMD at the same DFT level75, likely with their DFT dipole moments corrected. By contrast, the IR spectrum predicted by the FIREANN-wF model using the same trajectories deviates significantly from the experimental counterpart. This comparison clearly highlights the necessity of using an ML model fulfilling the physical requirement of the dipole moment in predicting IR spectra. Finally, we show in Fig. 8b the predictions of the FIREANN-wF model for the IR spectrum with an electric field up to 0.4 V Å−1 along x direction. Interestingly, the electric field influences mostly the O-H stretching band, resulting in a progressive red shift upon with the increasing field intensity. Unlike the NMA molecule, the red shift here is not solely because of the softening of the O-H bond by the electric field, but also the more ordered structure as a result of the field-induced reorientation of water molecules to be parallel to the field vector20. This effect will render the liquid water structure more ice-like20, in which the O-H vibrational band is lower in frequency. In contrast, this ordering effect hinders the librational rotation of water molecules and naturally results in an increased frequency of the librational mode. In comparison, the H-O-H bending mode is barely affected by the electric field, since the bending motion leads to little change in the direction of the dipole moment.
Comparison with previous models
Although similar force-only training can be done using previous models proposed in refs. 61,62, their ways of incorporating the external field are completely different from the present FIREANN model, rendering the absence of important high-order field-system interactions. Specifically, these models consider the response of the system to the external field by adding the dot product of a virtual atomic dipole and the electric field vector (namely , a scalar value) to the standard atomic descriptor. By construction, high-order field-system interactions are missing in their descriptors. In contrast, by introducing a virtual field-dependent atomic orbital in our field-induced EAD descriptor, we capture the response of the electron density to the external field through orbital-orbital interactions (Eq. (2) in this work). In such a way, all interactions between the field and the atomic environment are included in our FIREANN model. This is a fundamental improvement over all previous models, which can be easily adapted to other equivariant features based ML models44,48,53 without altering their fundamental architectures. Indeed, Christensen et al.61 have clearly admitted that their kernel-based model cannot predict polarizability and other high-order response properties. Similar deficiencies arise in the FieldSchNet model as well. Although the nonlinear message passing NNs imposed in that model may learn part of high-order interactions, this incompleteness will cause qualitative failures of FieldSchNet in some cases.
To show this explicitly, we have applied the FieldSchNet package and compared its predictions with present ones using exactly the same dataset. As already presented in Fig. 2c, the FIREANN model perfectly captures the nonlinear energy variation of a water molecule as a function of the field strength. In contrast, the FieldSchNet model predicts no dependence of the energy on the field strength at all. This is because the water molecule lies in the yz plane so all atomic dipoles derived from FieldSchNet always lie in plane, resulting in zero coupling with the electric field applied along the x direction and a constant energy. In practice, all x relevant components in any response quantities (dipole moment, polarizability, etc.) predicted by FieldSchNet are zero.
This phenomenon is not limited to molecules but also applies to periodic systems. To show this, we construct an exemplary dataset consisting of 200 liquid water configurations exposed to an electric field along the x direction ranging from 0 to 0.6 V Å−1. Water molecules in these configurations are aligned in four evenly spaced layers (16 molecules in each) perpendicular to the x axis with the first and third layer equal to the second and fourth layers respectively, as shown in the inset of Fig. 9. To enable the comparison of field-dependent energies, we trained a FIREANN model and a FieldSchNet model respectively with both forces and polarizabilities (to eliminate the field dependence of the undetermined constant of the total energy), and aligned their field-free energies to the same zero point. The difference between FIREANN and FieldSchNet models is amplified in this system, where the RMSEs predicted on the test set by FIREANN and FieldSchNet are 54.5 meV Å−1 (2.1 a.u.) and 245.4 meV Å−1 (165.1 a.u.) for forces (polarizability), respectively. Again, the FIREANN model precisely captures the large energy variance up to an applied electric field of ±2 V Å−1, while the FieldSchNet energy remains constant and deviates from the correct DFT result by several eV, as displayed in Fig. 9. This result also validates the generalizability of the FIREANN model towards representing high-intensity external fields.
We note that this deficiency will generally appear in any configuration if all atomic dipoles along the applied field direction are zero, which would lead to an unphysical behavior near the corresponding configuration space and inevitable large fitting errors. For example, using the same full dataset of liquid water in this work and training with atomic forces and polarizabilities, the FIREANN and FieldSchNet models yield test RMSEs of 45.5 meV Å−1 (2.5 a.u.) and 184.7 meV Å−1 (12.9 a.u.), respectively. The much worse performance of FieldSchNet represents an indicator of its incomplete description of the field-system interaction. It is worth noting that our FIREANN implementation is more efficient than FieldSchNet, with a training time of 2.4 versus 7.6 minutes per epoch, when running on a single A100 GPU with a memory capacity of 80 GB.
Discussion
In this work, we have proposed a simple, accurate, and universal FIREANN model to learn the external field-dependent PES and response properties with the proper rotational equivariance. This model allows us to obtain all ingredients from one single training for modeling spectroscopy and dynamics of chemical systems with and without external electric fields. The validity of this model is supported by the good agreement between the predicted vibrational spectra of the NMA molecule and liquid water and field-free experimental data. Moreover, the field-induced alignment of the dipole moment and the softening of the covalent bond are clearly predicted in the in-field IR or Raman spectra. For periodic systems like liquid water, in particular, the intrinsic multi-valued polarization of the system results in the discontinuous dipole moments in the training data and makes it difficult to being represented by conventional machine learning models based on atomic charges. This issue is nicely bypassed in the FIREANN model by learning atomic forces only, which can yield both field-dependent potentials and dipole moments, and thus IR spectra of liquid water. Our results not only clearly validate the high accuracy of the all-in-one FIREANN model, but also elucidate the interplay between chemical systems and electric fields.
In the current implementation built on the original PyTorch79 framework, training a FIREANN model in the most complete scenario (including energy, forces, dipole, and polarizability tensor) will take 4 times longer than force-only training, as the former process requires sample-to-sample (high-order) gradients. This issue can be largely alleviated by an improved implementation based on the more recently released functorch80 module in a new version of PyTorch, which allows efficient computation of sample-to-sample (high-order) gradients. Although all results presented in this work are relevant to the system exposed to an electric field, the FIREANN framework can be extended to describe the response of the system to a magnetic field or even to an electromagnetic field by introducing another field vector-dependent virtual function in Eq. (2). This will allow a more complete description of magnetic fields interacting with the system than in ref. 62. Note that the current version of the FIREANN model is limited to describing the influence of a homogeneous external field. In the case of a non-uniform external field, the response of the electron density to the field is spatially dependent and must be explicitly considered. A feasible way is to discretize the non-uniform field to each atomic center and introduce a nonequivalent field-dependent function to each FI-EAD feature (as implicitly implied in Fig. 1) to approximate the response of each atomic density to the local field experienced by the central atom. Note that this adjustment is intended to introduce an external inhomogeneous field interacting with the entire system. This differs from an inhomogeneous electric field approximately generated by solvent environments, which acts only onto the embedding molecular center as described in ref. 62. These desirable features make the FIREANN approach very promising to efficiently modeling strong field-induced phenomena such as electrochemistry81,82, plasmonic chemistry83, and tip-induced catalytic reactions8.
Methods
REANN
The regular REANN model was proposed for representing field-free PESs64. Like all atomistic NN models, the total potential energy (E) is expressed in the sum of atom-wise contributions, and each atomic energy (Ei) is learned by feeding a vector of atomic features for describing the atom-centered environment to an atom-wise NN. In the REANN model, EAD atomic features are specified to include many-body correlations between the central and neighbor atoms, which are simply evaluated by the square of the linear combination of a set of contracted GTOs located at neighbor atoms,
6 |
where the primitive GTO takes the following form,
7 |
and the contraction combines different shapes of primitive GTOs together,
8 |
In practical implementation, we reorder the summation over cj and in Eq. (6) to obtain better efficiency. It should be noted that an EAD feature vector (ρi) consists of a number of density values generated from different sets of contracted GTOs. Although GTOs in our model are expanded in Cartesian coordinates, they can also be expressed in terms of spherical harmonics, resembling in spirit those equivariant features based on spherical harmonics44,48,53. Specifically, is the position vector (three components) of the central atom i relative to the jth neighbor atom with rij (xij,yij,zij) being its norm (Cartesian component), l = lx+ ly + lz specifies the orbital angular momentum (e.g., l = 0 for the s orbital, l = 1 for the p orbital, etc.), αm and rm are hyperparameters that determine the center and the width of the radial Gaussian function. In the combination to form an EAD feature, L is the maximum orbital angular momentum of GTOs, Nφ is the number of primitive GTOs for each l and is the contraction coefficient of the mth primitive GTO for the nth component of the EAD vector, Nc is the number of neighbor atoms and cj is the jth atomic orbital coefficient within a cutoff radius (rc), is a cosine-type switching function continuously decaying interatomic interactions to zero at rc up to second-order derivatives. In particular, realizing that cj itself necessarily depends on its atomic environment, we express it as the output of an atomic NN based on EAD features centered at atom j. Apparently, REANN is essentially a message-passing NN by recursively expanding the environment-dependent orbital coefficients like this, which has proven an efficient way to incorporate high-order many-body correlations in the local environment64.
Training details
All FIREANN models in this study utilize NN with two hidden layers, each containing 64 neurons in each iteration of message-passing. Eight radial functions and L up to 2 were used to construct EAD features with sufficient representability. The initial learning rate was set to 0.002 and decays by a factor of 0.5 whenever the validation loss does not decrease for 100 consecutive epochs. Training stops when the learning rate drops below 1 × 10−5. The number of message-passing iterations for H2O, NMA, and liquid water were set to 0, 4, and 3, respectively. The cutoff distances for the three systems were 3.0 Å, 6.0 Å, and 5.0 Å. Other parameters were automatically optimized during the training process. These weights for individual properties in the loss function were dynamically adjusted during the training process. For the NMA molecule, λV, λF, λμ, and λα decay linearly from 0.1, 50, 10, and 10 to 0.1, 0.5, 0.5, and 0.5 as the learning rate decays. The same set of weights were used for the H2O molecule, except that there is no force weight included. As for the liquid water, only atomic forces were trained and weighting was unnecessary.
Computational details and datasets
Three systems are used to validate the FIREANN model, including a toy system (H2O monomer), NMA, and liquid water.
A toy system
The training set of the H2O molecule contains merely a single equilibrium geometry lying in the yz plane with an electric field of 0.1 V Å−1 along the x direction. The potential energy, dipole moment, and polarizability of H2O were calculated by Gaussian 0984 at the B3LYP/cc-pVDZ level85 and used as targets in the loss function defined in Eq. (7).
NMA
Over 13000 configurations were sampled from canonical ensemble (NVT) classical & path-integral MD simulations at 300 K in the presence of an electric field ranging from 0.0 to 0.4 V Å−1 along the x direction and calculated using Gaussian 0984 at the B3LYP/aug-cc-pVDZ level85 with D3 correction of disperson86. The dataset was divided into training set, validation set, and test set with a ratio of 8:1:1. Again, the potential energy, atomic force, dipole moment, and polarizability were trained simultaneously.
Liquid water
A cubic box of 64 water molecules was used in the data sampling. The dataset consists of ~33000 configurations sampled from NVT classical and path-integral AIMD simulations at 300 K with the external electric field ranging from 0.0 to 0.6 V Å−1 along x direction. Electronic structures and properties were calculated by CP2K87 with a hybrid density functional revPBE088,89 including D3 correction of dispersion86. Goedecker–Tetter–Hutter pseudopotentials90 with a cutoff of 1200 Ry and a TZV2P basis set were used. Only atomic forces were used to construct the PES, and the dipole moment was excluded due to its discontinuity caused by the multiple-value nature of the periodic systems. To illustrate the deficiency of FieldSchNet, a special dataset was collected consisting of 200 liquid water configurations exposed to an electric field along the x direction ranging from 0 to 0.6 V Å−1. Water molecules in these configurations were averagely placed in four evenly spaced layers perpendicular to the x axis (16 molecules in each). In addition, the first (second) layer was made identical to the third (fourth) layer. In this way, the sum of dipole moments in each layer was kept in plane and any interlayer dipole moment canceled out, leaving a zero x component of the total dipole moment. In the data sampling, water molecules were centered evenly spaced grids (4 × 4) in the plane with some random shifts within 0.3 Å and random in-plane orientation. The intramolecular O-H bond lengths and H-O-H angle of each molecule are randomly displaced from their equilibrium values within ~0.1 Å and ~2°.
MD simulations with FIREANN models
NMA
To compare the calculated IR and Raman spectra of NMA with experimental data, classical MD simulations were performed at 300 K in the absence of an electric field. The NMA molecule was first equilibrated with 20 ps using the Andersen thermostat91, after which two-hundred snapshots with corresponding momentum were randomly chosen for initializing subsequent NVE MD simulations of 25 ps. The time correlation functions (TCFs) were computed by the average of 200 such trajectories. IR and Raman spectra were obtained by the Fourier transform of the TCFs of the time derivatives of dipole and polarizability, respectively92. In addition, to include NQEs, path-integral based TRPMD simulations70,93 were performed with Langevin thermostats attached to all non-centroid normal modes, with and without adding an electric field. Other computational details are similar to those in the classical MD simulations. The resulting field-dependent IR and Raman spectra were obtained by the Fourier transform of the centroid-based TCFs on an average of 200 TRPMD trajectories. In all simulations, the time step was kept at 0.1 fs.
Liquid water
The system consists of 64 H2O molecules with a side length 12.4185 Å. The NVT classical MD simulations of liquid water were performed at 300 K, using Andersen thermostat91. To obtain convergent IR spectra, we extracted 128 positions and momentum from an equilibrium NVT trajectory as initial states for NVE MD simulations, with a total time of 20 ps per NVE simulation and a time step of 0.1 fs. The same setup was used for a 24-bead TRPMD simulations to include NQEs, which was found to converge the spectroscopic results.
Supplementary information
Source data
Acknowledgements
This work is supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB0450101), Innovation Program for Quantum Science and Technology (2021ZD0303301), CAS Project for Young Scientists in Basic Research (YSBR-005), National Natural Science Foundation of China (22325304, 22221003 and 22033007), and the Fundamental Research Funds for Central Universities (WK2060000017). We acknowledge the Supercomputing Center of USTC, Hefei Advanced Computing Center, Beijing PARATERA Tech CO., Ltd for providing high-performance computing service. We also thank Dr. Wei Hu, Dr. Xinming Qin, Dr. Mouyi Weng, and Junfeng Qiao for the very helpful discussion.
Author contributions
B.J. designed the project, and Y.Z. and B.J. discussed the neural network architecture. Y.Z. wrote the code and performed all calculations. Y.Z. and B.J. wrote the manuscript.
Peer review
Peer review information
Nature Communications thanks Rodrigo Alejandro Vargas-Hernández, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The dataset of NMA molecules and the exemplary dataset of liquid water (200 structures) generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data]. The initial and final structures of MD/TRPMD simulations generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data/md_stru]. The complete liquid water data are available upon reasonable request from the corresponding author. The data generated in this study are provided in the Source Data file. Source data are provided with this paper.
Code availability
The FIREANN package are available from https://github.com/zhangylch/FIREANN94.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-42148-y.
References
- 1.Shaik, S. S. & Stuyver T. Effects of electric fields on structure and reactivity: new horizons in chemistry. The Royal Society of Chemistry, xviii, 428 pages (2021).
- 2.Lemeshko M, Krems RV, Doyle JM, Kais S. Manipulation of molecules with electromagnetic fields. Mol. Phys. 2013;111:1648–1682. [Google Scholar]
- 3.Alemani M, et al. Electric field-induced isomerization of azobenzene by STM. J. Am. Chem. Soc. 2006;128:14446–14447. doi: 10.1021/ja065449s. [DOI] [PubMed] [Google Scholar]
- 4.Murgida DH, Hildebrandt P. Electron-transfer processes of cytochrome c at interfaces. new insights by surface-enhanced resonance raman spectroscopy. Acc. Chem. Res. 2004;37:854–861. doi: 10.1021/ar0400443. [DOI] [PubMed] [Google Scholar]
- 5.Velpula G, Teyssandier J, De Feyter S, Mali KS. Nanoscale control over the mixing behavior of surface-confined bicomponent supramolecular networks using an oriented external electric field. ACS Nano. 2017;11:10903–10913. doi: 10.1021/acsnano.7b04610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.English NJ, Mooney DA. Denaturation of hen egg white lysozyme in electromagnetic fields: a molecular dynamics study. J. Chem. Phys. 2007;126:091105. doi: 10.1063/1.2515315. [DOI] [PubMed] [Google Scholar]
- 7.Shaik S, Mandal D, Ramanan R. Oriented electric fields as future smart reagents in chemistry. Nat. Chem. 2016;8:1091–1098. doi: 10.1038/nchem.2651. [DOI] [PubMed] [Google Scholar]
- 8.Ciampi S, Darwish N, Aitken HM, Díez-Pérez I, Coote ML. Harnessing electrostatic catalysis in single molecule, electrochemical and chemical systems: a rapidly growing experimental tool box. Chem. Soc. Rev. 2018;47:5146–5164. doi: 10.1039/c8cs00352a. [DOI] [PubMed] [Google Scholar]
- 9.Shaik S, Ramanan R, Danovich D, Mandal D. Structure and reactivity/selectivity control by oriented-external electric fields. Chem. Soc. Rev. 2018;47:5125–5145. doi: 10.1039/c8cs00354h. [DOI] [PubMed] [Google Scholar]
- 10.Aragonès AC, et al. Electrostatic catalysis of a Diels–Alder reaction. Nature. 2016;531:88–91. doi: 10.1038/nature16989. [DOI] [PubMed] [Google Scholar]
- 11.Friedrich B, Herschbach DR. Spatial orientation of molecules in strong electric fields and evidence of pendular states. Nature. 1991;353:412. [Google Scholar]
- 12.Sussman BJ, Townsend D, Ivanov MY, Stolow A. Dynamic stark control of photochemical processes. Science. 2006;314:278–281. doi: 10.1126/science.1132289. [DOI] [PubMed] [Google Scholar]
- 13.de Miranda MHG, et al. Controlling the quantum stereodynamics of ultracold bimolecular reactions. Nat. Phys. 2011;7:502–507. [Google Scholar]
- 14.Tscherbul TV, Krems RV. Tuning bimolecular chemical reactions by electric fields. Phys. Rev. Lett. 2015;115:023201. doi: 10.1103/PhysRevLett.115.023201. [DOI] [PubMed] [Google Scholar]
- 15.King-Smith RD, Vanderbilt D. Theory of polarization of crystalline solids. Phys. Rev. B. 1993;47:1651–1654. doi: 10.1103/physrevb.47.1651. [DOI] [PubMed] [Google Scholar]
- 16.Meir R, Chen H, Lai W, Shaik S. Oriented electric fields accelerate diels–alder reactions and control the endo/exo selectivity. ChemPhysChem. 2010;11:301–310. doi: 10.1002/cphc.200900848. [DOI] [PubMed] [Google Scholar]
- 17.Zhang C, Sprik M. Finite field methods for the supercell modeling of charged insulator/electrolyte interfaces. Phys. Rev. B. 2016;94:245309. [Google Scholar]
- 18.Saitta AM, Saija F, Giaquinta PV. Ab initio molecular dynamics study of dissociation of water under an electric field. Phys. Rev. Lett. 2012;108:207801. doi: 10.1103/PhysRevLett.108.207801. [DOI] [PubMed] [Google Scholar]
- 19.Cassone G. Nuclear quantum effects largely influence molecular dissociation and proton transfer in liquid water under an electric field. J. Phys. Chem. Lett. 2020;11:8983–8988. doi: 10.1021/acs.jpclett.0c02581. [DOI] [PubMed] [Google Scholar]
- 20.Cassone G, Sponer J, Trusso S, Saija F. Ab initio spectroscopy of water under electric fields. Phys. Chem. Chem. Phys. 2019;21:21205–21212. doi: 10.1039/c9cp03101d. [DOI] [PubMed] [Google Scholar]
- 21.Vegiri A, Schevkunov SV. A molecular dynamics study of structural transitions in small water clusters in the presence of an external electric field. J. Chem. Phys. 2001;115:4175–4185. [Google Scholar]
- 22.Acosta-Gutiérrez S, Hernández-Rojas J, Bretón J, Llorente JMG, Wales DJ. Physical properties of small water clusters in low and moderate electric fields. J. Chem. Phys. 2011;135:124303. doi: 10.1063/1.3640804. [DOI] [PubMed] [Google Scholar]
- 23.Senftle TP, et al. The ReaxFF reactive force-field: development, applications and future directions. NPJ Comput. Mat. 2016;2:15011. [Google Scholar]
- 24.Sobrino Fernández M, Peeters FM, Neek-Amal M. Electric-field-induced structural changes in water confined between two graphene layers. Phys. Rev. B. 2016;94:045436. [Google Scholar]
- 25.Tan S, et al. Enhancing the oxidation of toluene with external electric fields: a reactive molecular dynamics study. Sci. Rep. 2017;7:1710. doi: 10.1038/s41598-017-01945-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Musil F, et al. Physics-inspired structural representations for molecules and materials. Chem. Rev. 2021;121:9759–9815. doi: 10.1021/acs.chemrev.1c00021. [DOI] [PubMed] [Google Scholar]
- 27.Unke OT, et al. Machine learning force fields. Chem. Rev. 2021;121:10142–10186. doi: 10.1021/acs.chemrev.0c01111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Behler J. Four generations of high-dimensional neural network potentials. Chem. Rev. 2021;121:10037–10072. doi: 10.1021/acs.chemrev.0c00868. [DOI] [PubMed] [Google Scholar]
- 29.Kang P-L, Shang C, Liu Z-P. Large-scale atomic simulation via machine learning potentials constructed by global potential energy surface exploration. Acc. Chem. Res. 2020;53:2119–2129. doi: 10.1021/acs.accounts.0c00472. [DOI] [PubMed] [Google Scholar]
- 30.Huang B, von Lilienfeld OA. Ab initio machine learning in chemical compound space. Chem. Rev. 2021;121:10001–10036. doi: 10.1021/acs.chemrev.0c01303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Deringer VL, et al. Gaussian process regression for materials and molecules. Chem. Rev. 2021;121:10073–10141. doi: 10.1021/acs.chemrev.1c00022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zhang Y, Lin Q, Jiang B. Atomistic neural network representations for chemical dynamics simulations of molecular, condensed phase, and interfacial systems: efficiency, representability, and generalization. WIREs Comput. Mol. Sci. 2023;13:e1645. [Google Scholar]
- 33.Behler J, Parrinello M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 2007;98:146401. doi: 10.1103/PhysRevLett.98.146401. [DOI] [PubMed] [Google Scholar]
- 34.Bartók AP, Payne MC, Kondor R, Csányi G. Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett. 2010;104:136403. doi: 10.1103/PhysRevLett.104.136403. [DOI] [PubMed] [Google Scholar]
- 35.Jiang B, Guo H. Permutation invariant polynomial neural network approach to fitting potential energy surfaces. J. Chem. Phys. 2013;139:054112. doi: 10.1063/1.4817187. [DOI] [PubMed] [Google Scholar]
- 36.Shao K, Chen J, Zhao Z, Zhang DH. Communication: fitting potential energy surfaces with fundamental invariant neural network. J. Chem. Phys. 2016;145:071101. doi: 10.1063/1.4961454. [DOI] [PubMed] [Google Scholar]
- 37.Schütt KT, Sauceda HE, Kindermans P-J, Tkatchenko A, Müller K-R. SchNet – a deep learning architecture for molecules and materials. J. Chem. Phys. 2018;148:241722. doi: 10.1063/1.5019779. [DOI] [PubMed] [Google Scholar]
- 38.Zhang L, Han J, Wang H, Car R, E W. Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. Phys. Rev. Lett. 2018;120:143001. doi: 10.1103/PhysRevLett.120.143001. [DOI] [PubMed] [Google Scholar]
- 39.Zhang Y, Hu C, Jiang B. Embedded atom neural network potentials: efficient and accurate machine learning with a physically inspired representation. J. Phys. Chem. Lett. 2019;10:4962–4967. doi: 10.1021/acs.jpclett.9b02037. [DOI] [PubMed] [Google Scholar]
- 40.Unke OT, Meuwly M. PhysNet: a neural network for predicting energies, forces, dipole moments, and partial charges. J. Chem. Theory Comput. 2019;15:3678–3693. doi: 10.1021/acs.jctc.9b00181. [DOI] [PubMed] [Google Scholar]
- 41.Zaverkin V, Kästner J. Gaussian moments as physically inspired molecular descriptors for accurate and scalable machine learning potentials. J. Chem. Theory Comput. 2020;16:5410–5421. doi: 10.1021/acs.jctc.0c00347. [DOI] [PubMed] [Google Scholar]
- 42.Shapeev AV. Moment tensor potentials: a class of systematically improvable interatomic potentials. Multiscale Model. Simul. 2016;14:1153–1173. [Google Scholar]
- 43.Zubatyuk R, Smith Justin S, Leszczynski J, Isayev O. Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network. Sci. Adv. 2021;5:eaav6490. doi: 10.1126/sciadv.aav6490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Unke OT, et al. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects. Nat. Commun. 2021;12:7273. doi: 10.1038/s41467-021-27504-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sauceda HE, et al. BIGDML—towards accurate quantum machine learning force fields for materials. Nat. Commun. 2022;13:3733. doi: 10.1038/s41467-022-31093-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Braams BJ, Bowman JM. Permutationally invariant potential energy surfaces in high dimensionality. Int. Rev. Phys. Chem. 2009;28:577–606. doi: 10.1146/annurev-physchem-050317-021139. [DOI] [PubMed] [Google Scholar]
- 47.Dral PO, et al. MLatom 2: an integrative platform for atomistic machine learning. Top. Curr. Chem. 2021;379:27. doi: 10.1007/s41061-021-00339-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Batzner S, et al. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun. 2022;13:2453. doi: 10.1038/s41467-022-29939-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gastegger M, Behler J, Marquetand P. Machine learning molecular dynamics for the simulation of infrared spectra. Chem. Sci. 2017;8:6924–6935. doi: 10.1039/c7sc02267k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Grisafi A, Wilkins DM, Csányi G, Ceriotti M. Symmetry-adapted machine learning for tensorial properties of atomistic systems. Phys. Rev. Lett. 2018;120:036002. doi: 10.1103/PhysRevLett.120.036002. [DOI] [PubMed] [Google Scholar]
- 51.Zhang Y, Maurer RJ, Jiang B. Symmetry-adapted high dimensional neural network representation of electronic friction tensor of adsorbates on metals. J. Phys. Chem. C. 2020;124:186–195. [Google Scholar]
- 52.Zhang Y, et al. Efficient and accurate simulations of vibrational and electronic spectra with symmetry-preserving neural network models for tensorial properties. J. Phys. Chem. B. 2020;124:7284–7290. doi: 10.1021/acs.jpcb.0c06926. [DOI] [PubMed] [Google Scholar]
- 53.Schütt, K. & Unke O. Gastegger M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. In: Proceedings of the 38th International Conference on Machine Learning (eds. Marina M., Tong Z.). PMLR (2021).
- 54.Nigam J, Willatt MJ, Ceriotti M. Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties. J. Chem. Phys. 2021;156:014115. doi: 10.1063/5.0072784. [DOI] [PubMed] [Google Scholar]
- 55.Westermayr J, Gastegger M, Marquetand P. Combining SchNet and SHARC: the SchNarc machine learning approach for excited-state dynamics. J. Phys. Chem. Lett. 2020;11:3828–3834. doi: 10.1021/acs.jpclett.0c00527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sommers GM, Calegari Andrade MF, Zhang L, Wang H, Car R. Raman spectrum and polarizability of liquid water from deep neural networks. Phys. Chem. Chem. Phys. 2020;22:10592–10602. doi: 10.1039/d0cp01893g. [DOI] [PubMed] [Google Scholar]
- 57.Huang X, Braams BJ, Bowman JM. Ab initio potential energy and dipole moment surfaces for H5O2+ J. Chem. Phys. 2005;122:044308. doi: 10.1063/1.1834500. [DOI] [PubMed] [Google Scholar]
- 58.Medders GR, Paesani F. Infrared and raman spectroscopy of liquid water through “first-principles” many-body molecular dynamics. J. Chem. Theory Comput. 2015;11:1145–1154. doi: 10.1021/ct501131j. [DOI] [PubMed] [Google Scholar]
- 59.Beckmann R, Brieuc F, Schran C, Marx D. Infrared spectra at coupled cluster accuracy from neural network representations. J. Chem. Theory Comput. 2022;18:5492–5501. doi: 10.1021/acs.jctc.2c00511. [DOI] [PubMed] [Google Scholar]
- 60.Schienbein P. Spectroscopy from machine learning by accurately representing the atomic polar tensor. J. Chem. Theory Comput. 2023;19:705–712. doi: 10.1021/acs.jctc.2c00788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Christensen AS, Faber FA, Lilienfeld OAV. Operators in quantum machine learning: Response properties in chemical space. J. Chem. Phys. 2019;150:064105. doi: 10.1063/1.5053562. [DOI] [PubMed] [Google Scholar]
- 62.Gastegger M, Schütt KT, Müller K-R. Machine learning of solvent effects on molecular spectra and reactions. Chem. Sci. 2021;12:11473–11483. doi: 10.1039/d1sc02742e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Gao A, Remsing RC. Self-consistent determination of long-range electrostatics in neural network potentials. Nat. Commun. 2022;13:1572. doi: 10.1038/s41467-022-29243-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Zhang Y, Xia J, Jiang B. Physically motivated recursively embedded atom neural networks: incorporating local completeness and nonlocality. Phys. Rev. Lett. 2021;127:156002. doi: 10.1103/PhysRevLett.127.156002. [DOI] [PubMed] [Google Scholar]
- 65.Wang L, Middleton CT, Zanni MT, Skinner JL. Development and validation of transferable amide I vibrational frequency maps for peptides. J. Phys. Chem. B. 2011;115:3713–3724. doi: 10.1021/jp200745r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Ye S, et al. A neural network protocol for electronic excitations of N-methylacetamide. Proc. Natl Acad. Sci. USA. 2019;116:11612–11617. doi: 10.1073/pnas.1821044116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Zhang J, et al. A machine-learning protocol for ultraviolet protein-backbone absorption spectroscopy under environmental fluctuations. J. Phys. Chem. B. 2021;125:6171–6178. doi: 10.1021/acs.jpcb.1c03296. [DOI] [PubMed] [Google Scholar]
- 68.Zhao L, et al. Accurate machine learning prediction of protein circular dichroism spectra with embedded density descriptors. JACS Au. 2021;1:2377–2384. doi: 10.1021/jacsau.1c00449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Forsting T, Gottschalk HC, Hartwig B, Mons M, Suhm MA. Correcting the record: the dimers and trimers of trans-N-methylacetamide. Phys. Chem. Chem. Phys. 2017;19:10727–10737. doi: 10.1039/c6cp07989j. [DOI] [PubMed] [Google Scholar]
- 70.Rossi M, Ceriotti M, Manolopoulos DE. How to remove the spurious resonances from ring polymer molecular dynamics. J. Chem. Phys. 2014;140:234116. doi: 10.1063/1.4883861. [DOI] [PubMed] [Google Scholar]
- 71.Herrebout WA, Clou K, Desseyn HO. Vibrational spectroscopy of N-methylacetamide revisited. J. Phys. Chem. A. 2001;105:4865–4881. [Google Scholar]
- 72.Zhang, Y., Jiang, J. & Jiang B. Chapter 19 - Learning dipole moments and polarizabilities. In: Quantum Chemistry in the Age of Machine Learning (ed. Dral P. O.). Elsevier (2023).
- 73.Zhang L, et al. Deep neural network for the dielectric response of insulators. Phys. Rev. B. 2020;102:041121. [Google Scholar]
- 74.Damle A, Lin L, Ying L. SCDM-k: localized orbitals for solids via selected columns of the density matrix. J. Comput. Phys. 2017;334:1–15. [Google Scholar]
- 75.Marsalek O, Markland TE. Quantum dynamics and spectroscopy of ab initio liquid water: the interplay of nuclear and electronic quantum effects. J. Phys. Chem. Lett. 2017;8:1545–1551. doi: 10.1021/acs.jpclett.7b00391. [DOI] [PubMed] [Google Scholar]
- 76.Soper AK. The radial distribution functions of water and ice from 220 to 673 K and at pressures up to 400 MPa. Chem. Phys. 2000;258:121–137. [Google Scholar]
- 77.Bertie JE, Lan Z. Infrared intensities of liquids XX: The intensity of the OH stretching band of liquid water revisited, and the best current values of the optical constants of H2O(l) at 25 °C between 15,000 and 1 cm−1. Appl. Spectrosc. 1996;50:1047–1057. [Google Scholar]
- 78.Silvestrelli PL, Bernasconi M, Parrinello M. Ab initio infrared spectrum of liquid water. Chem. Phys. Lett. 1997;277:478–482. [Google Scholar]
- 79.Paszke A. et al. PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems 32 (NeurIPS 2019) (eds. Wallach H., Larochelle H., Beygelzimer A., d’Alché-Buc F., Garnett EFaR). Curran Associates Inc. (2019).
- 80.Frostig, R., Johnson, M. J. & Leary C. Compiling machine learning programs via high-level tracing. Syst. Mach. Learn.2–24 (2018).
- 81.Groß A. Challenges in the modeling of elementary steps in electrocatalysis. Curr. Opin. Electrochem. 2023;37:101170. [Google Scholar]
- 82.Rossmeisl J, Nørskov JK, Taylor CD, Janik MJ, Neurock M. Calculated phase diagrams for the electrochemical oxidation and reduction of water over Pt(111) J. Phys. Chem. B. 2006;110:21833–21839. doi: 10.1021/jp0631735. [DOI] [PubMed] [Google Scholar]
- 83.Seemala B, et al. Plasmon-mediated catalytic O2 dissociation on Ag nanostructures: hot electrons or near fields? ACS Energy Lett. 2019;4:1803–1809. [Google Scholar]
- 84.Frisch M. J. et al. Gaussian 09 Revision B.01. Gaussian Inc. (2009).
- 85.Becke AD. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A. 1988;38:3098. doi: 10.1103/physreva.38.3098. [DOI] [PubMed] [Google Scholar]
- 86.Grimme S, Antony J, Ehrlich S, Krieg H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 2010;132:154104. doi: 10.1063/1.3382344. [DOI] [PubMed] [Google Scholar]
- 87.Kühne TD, et al. CP2K: An electronic structure and molecular dynamics software package - Quickstep: eficient and accurate electronic structure calculations. J. Chem. Phys. 2020;152:194103. doi: 10.1063/5.0007045. [DOI] [PubMed] [Google Scholar]
- 88.Adamo C, Barone V. Toward reliable density functional methods without adjustable parameters: the PBE0 model. J. Chem. Phys. 1999;110:6158–6170. [Google Scholar]
- 89.Goerigk L, Grimme S. A thorough benchmark of density functional methods for general main group thermochemistry, kinetics, and noncovalent interactions. Phys. Chem. Chem. Phys. 2011;13:6670–6688. doi: 10.1039/c0cp02984j. [DOI] [PubMed] [Google Scholar]
- 90.Goedecker S, Teter M, Hutter J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B. 1996;54:1703–1710. doi: 10.1103/physrevb.54.1703. [DOI] [PubMed] [Google Scholar]
- 91.Andersen HC. Molecular dynamics simulations at constant pressure and/or temperature. J. Chem. Phys. 1980;72:2384–2393. [Google Scholar]
- 92.Thomas M, Brehm M, Fligg R, Vohringer P, Kirchner B. Computing vibrational spectra from ab initio molecular dynamics. Phys. Chem. Chem. Phys. 2013;15:6608–6622. doi: 10.1039/c3cp44302g. [DOI] [PubMed] [Google Scholar]
- 93.Craig IR, Manolopoulos DE. Quantum statistics and classical mechanics: real time correlation frunction from ring polymer molecular dynamics. J. Chem. Phys. 2004;121:3368–3373. doi: 10.1063/1.1777575. [DOI] [PubMed] [Google Scholar]
- 94.Zhang, Y. & Jiang B. Universal machine learning for the response of atomistic systems to external fields. FIREANN10.5281/zenodo8363726 (2023). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The dataset of NMA molecules and the exemplary dataset of liquid water (200 structures) generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data]. The initial and final structures of MD/TRPMD simulations generated in this study have been deposited in the github [https://github.com/zhangylch/FIREANN/tree/main/data/md_stru]. The complete liquid water data are available upon reasonable request from the corresponding author. The data generated in this study are provided in the Source Data file. Source data are provided with this paper.
The FIREANN package are available from https://github.com/zhangylch/FIREANN94.