Abstract
The lack of long-range electrostatics is a key limitation of modern machine learning interatomic potentials (MLIPs), hindering reliable applications to interfaces, charge-transfer reactions, polar and ionic materials, and biomolecules. In this Perspective, we distill two design principles behind the Latent Ewald Summation (LES) framework, which can capture long-range interactions, charges, and electrical response just by learning from standard energy and force training data: (i) use a Coulomb functional form with environment-dependent charges to capture electrostatic interactions, and (ii) avoid explicit training on ambiguous density functional theory (DFT) partial charges. When both principles are satisfied, substantial flexibility remains: essentially any short-range MLIP can be augmented; charge equilibration schemes can be added when desired; dipoles and Born effective charges can be inferred or finetuned; and charge/spin-state embeddings or tensorial targets can be further incorporated. We also discuss current limitations and open challenges. Together, these minimal, physics-guided design rules suggest that incorporating long-range electrostatics into MLIPs is simpler and perhaps more broadly applicable than is commonly assumed.
There is no denying that accurate modeling of the 1/r-decaying long-range electrostatics is crucial in atomistic simulations, particularly for interfaces, charge-transfer processes, polar and ionic materials, and biomolecular systems [1–3]. The open question has always been how “easy” it is to incorporate such long-range interactions into the machine learning interatomic potential (MLIP) framework.
Most MLIPs are built on the locality assumption [4]: the energy and forces associated with a central atom are largely determined by its local neighbors [5, 6]. The total short-range energy of a system is thus approximated as a sum of local atomic energy contributions Ei from each atom i, . Early MLIPs [7, 8] define the local environment using a fixed spatial cutoff radius, typically on the order of 6 Å. Later messagepassing neural networks (MPNNs) [9–14] propagate information across multiple interaction shells, effectively increasing the receptive field. However, this is no guarantee that long-range interactions are faithfully captured, as deep MPNNs can suffer from information degradation and over-smoothing [15, 16].
In recent years, various strategies [6, 17–52] have been proposed to combine electrostatics with MLIPs. In Fig. 1, we illustrate the representative developments and the conceptual connections between them. This is by no means exhaustive and is skewed towards the earlier methods in each category, particularly those that have inspired our own work.
Figure 1.

A map of representative strategies for learning long-range interactions. Black solid arrows indicate an extension or variant of a method. Black dashed arrows show conceptual connections. Red arrows indicate the learning of physical quantities. The blue shaded region indicates methods that use additional DFT training labels, the green denotes methods with charge equilibration, the purple marks the methods using high-dimensional long-range representations, the yellow represents the classical force fields with fixed charges, and the red shows the methods that can learn electrical response properties. Abbreviations: LES, latent Ewald summation [21]; HDNNP, high-dimensional neural network potential [23, 31]; BAMBOO, ByteDance AI molecular simulation booster [25]; AIMNET, atoms-in-molecules neural network potential [32]; Ewald MP, Ewald message passing; LODE, long-distance equivariant [39]; RANGE, relaying attention nodes for global encoding [45]; DPLR, deep potential long-range [35]; SCFNN, self-consistent field neural network [36]; Qeq, charge equilibration scheme proposed in Ref. [47, 53]; BEC, Born effective charge tensors; Equivariant features, direct prediction of the tensorial quantities [62, 66, 67]; Differentiable learning, learn corresponding response properties by taking derivatives [54, 55]. U is electric enthalpy, U0 is the energy in the absence of the electric field, P is polarization, and ℰ is a uniform electric field [54].
The methods in the purple shaded region of Fig. 1 capture long-range effects by embedding global geometrical information directly into the model representation [37–42, 44, 45]. This includes approaches based on global or reciprocal-space message passing [38, 44], the use of virtual global nodes for communicating global information [45], and those constructing explicit long-range descriptors [39–41]. In these frameworks, long-range interactions are not introduced as a physical electrostatic energy term, but rather as part of the learned global feature space from which the neural network can infer nonlocal interactions.
The methods inside the blue shaded region of Fig. 1 encode physically-motivated quantities such as atomic charges [22–25, 27–29, 31–34, 48, 49], typically trained on electronic information from density functional theory (DFT). Notably, 3G-HDNNP [23, 31] learns DFT partial charges [31], and this was later refined in 4G-HDNNP [23] that further include charge equilibration (Qeq) scheme [47, 53]. PhysNet [22], BAMBOO [25], and AIMNET2 [32] include DFT dipoles in the loss function and predict atomic partial charges. Deep potential long-range (DPLR) model [35] and the self-consistent field neural network (SCFNN) [36] learn from the maximally localized Wannier centers (MLWCs) for insulating systems.
In a separate vein, the methods highlighted in the red shaded region specifically target electrical response properties such as polarization (P) [54–61], polarizability [54, 57, 58], electron density [62–64], and Born effective charge (BEC) tensors (Z*) [54–56, 59–61, 65]. These quantities may be learned directly as tensorial outputs using equivariant features [55, 56, 62, 63, 65–68], or through differentiable learning frameworks in which response properties are obtained as derivatives of learned quantities [54, 60], for examples, BECs as derivatives of polarization with respect to atomic displacements [54, 56, 60], and polarizations P as derivatives of energy with respect to the external electric field ℰ [54, 57, 58]. Related developments by Grisafi et al. [62, 64] introduced long-range symmetry-adapted models for predicting electron densities and the electronic density response under applied fields.
Latent Ewald Summation (LES) [21, 59–61] is at the intersection of the three aforementioned approaches: LES explicitly employs atomic charges to construct a physical long-range Coulomb electrostatic energy term; however, these charges are not learned from DFT electronic labels. Instead, they are inferred as latent, environment-dependent features solely from training on standard energy and force data. Meanwhile, electrical response properties such as P and Z* emerge naturally [59–61].
The LES algorithm is simple: the local invariant features on each atom i (same as the ones for predicting local energies Ei) predict the latent atomic charge qles via a neural network. These qles are then used to compute long-range electrostatic energy Elr: Pairwise summation is used for finite systems,
| (1) |
where ε0 is the vacuum permittivity, is the complementary error function, σ is a smearing factor typically ≈ 1 Å. Ewald summation is used for periodic systems,
| (2) |
with
| (3) |
where k is the reciprocal wave vector, and V is the cell volume. Finally, the electrostatic energy is added to the short-range energy to form the total potential energy of the system, .
For a finite system, the polarization is , and the BEC for an atom i is
| (4) |
with α and β labeling Cartesian directions. For a homogeneous periodic system,
| (5) |
where , and ε∞ is high-frequency (electronic) relative permittivity.
Besides the simple algorithm, the implementation of LES into any short-range MLIP is also easy. Thus far, LES has been integrated with MACE [12], NequIP [11], Allegro [69], CACE [70], CHGNet [71], and these implementations are discussed and benchmarked in Ref. [61]. In addition, LES has been independently implemented in TACE [50]. In Refs. [21, 59–61, 72], LES has been extensively benchmarked on diverse systems, including bulk liquids [21, 59–61], molecular crystals [61], polar molecules [21, 59, 61], ferroelectrics [60], and electrochemical interfaces [59, 72]. Overall, the LES augmentation was shown to universally improve the accuracy and ensure the reliable extraction of electrical response properties, including BECs [59–61], dipoles for finite systems [59, 61], infrared spectra [60, 61], and ionic electrical conductivities [60]. Meanwhile, adding LES only introduces a marginal computational overhead [59, 61], making it effectively a “free lunch”.
Throughout this Perspective, for illustrative purposes, we highlight representative results on the RPBE-D3 bulk water dataset [55], which consists of 604 training and 50 test configurations of 64 water molecules. The experimental value of ε∞ = 1.78 for bulk water was assumed in the BEC analysis using Eqn. (5).
Figure 2a compares the test force RMSEs for the short-range baseline models (hollow bars) and their LES-augmented counterparts (solid bars) across several MLIP architectures. LES systematically reduces the force errors for all models, although the magnitude of the improvement depends on the underlying short-range architecture.
Figure 2.

Benchmark results of short-range and LES-augmented MLIPs for bulk RPBE-D3 water. a: The test force root mean square errors (RMSEs) for baseline MLIPs (hollow bars), and the corresponding LES-augmented models (solid bars) for different architectures (MACE [12], NequIP [11], CHGNet [71], CACE [70], and Allegro [69]). The cutoff r, the number of layers nl, the order of irreducible representations (rotation order) ℓ, and body order ν are indicated for each MLIP. Results are reproduced from Ref. [61]. b: Parity plots comparing the diagonal components of the BEC tensor () of CACE LES with RPBE-D3 reference values for 100 bulk water configurations [55]; insets show the off-diagonal components ( with α ≠ β). E+F denotes a CACE LES model trained on energy and forces, E+F+BEC is reproduced from Ref. [60] and was trained on a smaller set of 100 configurations with BEC labels, and E+F+Qeq incorporates a charge equilibration (Qeq) scheme [47, 53]. The test RMSE values for forces in meV/Å are 21.0 (E+F), 25.3 (E+F+BEC), and 21.1 (E+F+Qeq). All three models use the same CACE settings (r = 4.5 Å, nl = 1, ν = 3). c: The test force RMSEs and BEC R2 values of MACE LES models (r = 4.5 Å, nl = 2, ℓ = 1) as a function of training set size. d: Computational performance benchmarks of MD simulations of bulk liquid water for varying system sizes (N) using LES-augmented MLIPs in single-precision (float32) with ASE implementations [73] performed on an NVIDIA L40S GPU (48 GB memory). The left panel shows the timing of MD simulations using the MACE models (r = 4.5 Å, nl = 2) with different ℓ, both without (empty symbols and dashed lines) and with LES (filled symbols and solid lines). The right panel shows the timing of the CACE models (r = 4.5 Å, nl = 1, ν = 3), without LES (blue line), with LES (red line), and with Qeq scheme implementation (orange line). A reference line for N3 scaling is included to illustrate the performance. The CACE LES model with the Qeq scheme shows cubic scaling with respect to the number of atoms. e: The test RMSEs for energy and force of CACE LES models (r = 4.5 Å, nl = 1, ν = 2) with varying charge output dimensions. f: Infrared (IR) absorption spectra of bulk liquid water obtained with the E+F, E+F+BEC, and E+F+Qeq CACE LES models. The experimental IR spectrum [74] is included for comparison. E+F and E+F+BEC results are reproduced from Ref. [60].
Beyond improved energies and forces, LES-enabled models recover physically meaningful electrical response properties. As shown in Ref. [61], all LES-augmented MLIPs accurately predict BECs, indicating that the long-range electrostatics are captured in a physically consistent manner. As a representative example, Fig. 2b shows the close agreement between the DFT BECs and those predicted by the CACE LES model trained only on energy and forces (E+F). Importantly, the inference of both long-range electrostatics and BECs within LES is also highly data efficient. As shown in Fig. 2c, the learning curves for the MACE LES model trained only on the energy and forces of different numbers of configurations demonstrate that accurate BEC predictions emerge rapidly and converge even faster than the forces. The BECs are not only sensitive indicators of the correct electrostatics, but are also useful for computing electrical response properties, such as the water IR spectra shown in Fig. 2f.
Finally, Fig. 2d compares the inference speed of the short-range baselines and the LES-augmented models for MACE and CACE. The results show that the additional computational cost associated with the long-range correction is small relative to the baseline evaluation cost, confirming that LES provides an efficient route to incorporating long-range electrostatics without sacrificing performance or system size scalability.
The performance of LES, achieved with a simple algorithm and implementation, is surprising to us: long-range electrostatics for MLIPs turns out to be easier than we thought. In this Perspective, we distill the underlying design principles that make LES work, discuss other possible variants and extensions that also satisfy these principles, highlight the limitations and open questions, and outline our outlook for the future employment of long-range MLIPs.
I. TWO SIMPLE PRINCIPLES
We argue that two simple design principles, outlined below, are sufficient for the easy and reliable incorporation of long-range electrostatics into MLIPs under the LES framework.
Principle 1: Coulomb functional form with environment-dependent charges.
This principle ensures that the learned long-range interaction is physically grounded: the one-dimensional environment-dependent charge on each atom is physically interpretable, and the Coulomb form (Eqns. 1 and 2) guarantees the correct asymptotic behavior and may help extrapolate beyond trained charge separation distances.
Several long-range models shown in the blue region of Fig. 1, including 4G-HDNNP [23], PhysNet [22], BAMBOO [25], and AIMNET2 [32], also satisfy this principle. In contrast, models in the purple region of Fig. 1 do not explicitly rely on physical charges. For example, although LODE [39, 40] encodes Coulomb and other asymptotically decaying potentials (1/rp) in the atomic environment, its high-dimensional features cannot be directly interpreted as physical charges. Similarly, Ewald MP [38] learns long-range descriptors in reciprocal space using trainable frequency filters, without explicitly enforcing a 1/r interaction form.
It is difficult to assert which strategy will yield better training accuracy: physically constrained electrostatics or fully learned long-range features. In principle, more flexible learned formulations can capture a broader class of long-range interactions, including dispersion [38, 40], and may therefore achieve lower fitting errors. However, Coulomb forces represent the dominant contribution among long-range interactions, while most others are much weaker. At the same time, all long-range interactions are intrinsically weak relative to short-range forces yet involve many more atoms, making them fundamentally harder to learn from finite training datasets without inductive bias. The empirical comparisons of these models on specific datasets are scarce in the literature. Fuchs et al. [48] reported that an Allegro [69] model with a charge equilibration layer achieves performance comparable to a PaiNN model [75] with Ewald MP [38] on the OE62 dataset [76]. Several models, including 4G-HDNNP [23], charge-constrained ACE [33], LES-augmented models [59, 61], LOREM [44], CELLI [48] that uses a charge equilibration layer, charge-equilibrated TensorNet (QET) [77], and NequIP-LR [49], have been benchmarked on some of the datasets ( , and Au2 on MgO(001) compiled by Ko et al. [23]. Among these, LES and LOREM have some of the lowest test energy and force errors. However, it has been questioned whether these datasets cover very challenging long-range interactions [44]. Moreover, these existing comparisons often use different short-range MLIP baselines, making it unclear whether the difference in accuracy stems from the baseline itself or the long-range correction.
Here, we test how the dimensionality of the long-range features affects learning accuracy. In the original formulation of LES [21], the latent charge qles can, in principle, be multi-dimensional, with the total long-range energy obtained by aggregating the contributions from different dimensions after the Ewald summation. Conceptually, such multi-dimensional latent charges are analogous to the high-dimensional long-range features used in Ewald MP [38] and LODE [39, 40]. As shown in Fig. 2e, CACE LES models trained with more latent charge output dimensions exhibit essentially negligible and inconsistent improvements in accuracy. While there are alternative ways to exploit high-dimensional long-range information, such as equivariant features [44], this example demonstrates that increasing the dimensionality of long-range features does not necessarily lead to higher training accuracy.
In our view, the greatest advantage of adhering to an explicit Coulomb interaction form is interpretability. This interpretability extends beyond the conceptual meaning of “charges” and directly enables the computation of physically measurable quantities. As discussed in Refs. [59–61], the learned latent charges are predictive of molecular dipoles and higher-order multipoles and also enable the modeling of the electrical response properties such as IR spectra, ionic electrical conductivities, and ferroelectricity. Moreover, the charges can be useful in hybrid schemes such as coupling quantum mechanical calculations with MLIPs (QM/ML) [78] or combining MLIPs with classical force fields with fixed charges (ML/MM) [79].
Principle 2: Not explicitly train on DFT charges.
There is no unique way to map the continuous DFT electron density onto discrete partial charges: different partitioning schemes [24, 80, 81], such as Mulliken [82], Hirshfeld [83], and MBIS [84], often yield substantially different results. Consequently, training on one specific definition of DFT charges would imply a different long-range electrostatic description than training on another. Besides the uniqueness issue, a more subtle point is that the Coulomb interactions between atomic charges are screened by the rapidly responding background electrons with the relative permittivity ε∞ [85, 86]. This screening is naturally absorbed in the LES formulation through learning from long-range electrostatic forces [60], but would generally be absent if atomic charges were learned directly. Consistent with this view, explicit training on DFT charges has also been shown to induce pathological behavior, leading to energy and force errors significantly larger than those of the short-range baseline [23], such that an additional charge equilibration scheme was required to mitigate such issues [23].
Avoiding training on DFT charges also offers a practical advantage. Most MLIP training datasets, especially those originally designed for short-range models, provide only atomic positions, energies, forces, and sometimes stress tensors. By not requiring charge labels, long-range augmentation becomes immediately applicable to a much broader range of existing datasets and systems. It is worth mentioning that the LES architecture allows training on dipoles and/or BECs, which are physical quantities that can help improve the training.
Moreover, explicit training on DFT charges does not appear to improve energy and force accuracy compared to LES. For the dataset of a diatomic gold cluster Au2 supported on the MgO(001) surface [23], which includes both wetting and non-wetting adsorption geometries and where aluminum dopants deep beneath the surface strongly affect the electronic structure and relative stability, a CACE [70] model augmented with LES and trained only on energies and forces [59] achieves nearly an order-of-magnitude lower errors than both the charge-constrained ACE model [33, 87] and the 4G-HDNNP [23].
More importantly, we argue that the LES charges learned just from energies and forces are physically meaningful: they correspond to effective partial charges governing electrostatic interactions and are predictive of measurable physical observables. As shown in Refs. [59, 61], the learned latent charges yield accurate molecular dipoles and multipoles, and can further derive BECs using Eqns. (4) and (5) [60].
For the water example, Fig. 2b demonstrates that the LES BECs trained on energy and forces (E+F) are in excellent agreement with the DFT reference. These BECs enable the calculation of electrical response properties such as IR spectra and electrical conductivities [60]. Fig. 2f shows the corresponding IR for bulk water: both the spectral shape and intensities are in good agreement with experiment, capturing distinct high-frequency intramolecular and low-frequency intermolecular modes.
Finally, with the predicted BECs, one can compute the electrostatic forces on atoms under an applied external electric field and perform molecular dynamics simulations under finite fields using MLIPs. Using this capability, we have shown that LES-augmented MLIPs can reproduce electric-field-dependent phenomena, including the IR spectral shift of bulk water under static fields, the ionic conductivity of a superionic material, and the polarization–electric field hysteresis loop of a ferroelectric material [60].
II. OTHER POSSIBLE EXTENSIONS
Once the two design principles are met, LES can be combined with various additional strategies to suit specific applications. Here we present several examples.
Universal compatibility with any short-range MLIPs.
LES can serve as a universal augmentation to any short-range MLIP [61], and we have demonstrated its integration with methods such as MACE [12], NequIP [11], Allegro [69], CACE [70], CHGNet [71], and UMA [14]. Among these, MACE, NequIP, CACE, CHGNet, and UMA use node features to predict local atomic energy contributions, and the same features are straightforwardly used to predict the latent charges. Instead, Allegro uses edge features to predict pairwise energies, which are then summed to obtain per-atom energies; in the same spirit, the LES augmentation with Allegro predicts the pairwise charge contributions first using the same edge features and then aggregates them to form the per-atom latent charges.
Optional fine-tuning using dipoles or BECs.
Although energies and forces alone are sufficient to recover the correct electrostatics, additional training labels, such as dipoles or BEC tensors, can be incorporated for fine-tuning. For example, we show that fine-tuning CACE LES models using DFT-computed BECs further improves BEC prediction accuracy while preserving energy and force accuracy for both bulk water and PbTiO3 datasets [60]. The water example shown in the middle panel (E+F+BEC) of Fig. 2b illustrates this modest improvement in BEC predictions. As expected, the corresponding IR spectrum in Fig. 2f remains in good agreement with experiment.
Integrating optional charge equilibration (Qeq).
Empirically, we have found charge equilibration schemes to be unnecessary within the LES framework [21, 59]. For neutral systems, the sum of the learned charges is typically very close to zero; any small residual is treated as a uniform background charge, corresponding to the tinfoil boundary condition in the reciprocal-space evaluation of the electrostatic interactions in Eqn. (2).
Nevertheless, LES can be straightforwardly combined with charge equilibration using the Qeq approach [47, 53], which determines the atomic charges qi by minimizing the energy
| (6) |
where Eelec is the Coulomb interaction between charges, χi are the atomic electronegativities, and Ji are element-specific hardness parameters. The χi are predicted by a neural network from local invariant features, while the Ji are assigned based on element type. Consistent with the LES design principles, no DFT charges are used for training.
We implemented LES combined with Qeq (LES-Qeq in Fig. 1), with trainable χi and Ji. For the water example, the right panel (E+F+Qeq) of Fig. 2b shows that the addition of Qeq preserves the accurate predictions of forces and LES BECs. The corresponding predicted IR spectrum in Fig. 2f also exhibits good agreement with the reference. However, the computational cost of Qeq is substantial: the CACE LES-Qeq model in the right panel of Fig. 2d shows that incorporating Qeq leads to cubic scaling due to the matrix inversion operation involved. As such, methods have recently been proposed to reduce this cost, including using the iterative solution of Qeq instead of direct matrix inversion [88], and developing a particle mesh iterative solver [89].
Coupling with the Siepmann–Sprik metal model.
As illustrated by the LES-MetalWall in Fig. 1, the LES can be coupled with the Siepmann–Sprik metal model [90]: LES determines the charges on the electrolyte, and the Siepmann–Sprik method describes the response charges on metal electrodes. This hybrid long-range framework enables nanosecond finite-field MD simulations of electrode–electrolyte interfaces under periodic boundary conditions [72].
Charge and spin embedding.
For datasets containing multiple charge or spin states, a degeneracy problem can arise: configurations that are geometrically identical but electronically distinct share the same representations in standard MLIPs, which can lead to poor model performance [23, 33]. To lift the degeneracy, one can embed the charge and/or spin states directly in the local atomic environment features. This can be achieved either by encoding global charge or spin states to inform the local features of all atoms [59, 91], or by utilizing atomwise states such as the oxidation state or the magnetic moment for each atom [91]. As such, not only will the predicted atomic energies depend on these states, but also the latent charges.
Learning atomic dipoles or dispersion coefficients.
The expression for the long-range energy in Eqns. (1) and (2) can be straightforwardly generalized to other 1/rp interactions, such as charge–dipole (p = 2), dipole–dipole (p = 3), or dispersion-like (p = 6) terms. In these cases, the learned latent variables would no longer be scalar charges but higher-order quantities: for example, equivariant atomic dipole vectors for dipolar interactions, or per-atom dispersion coefficients for van der Waals–type contributions.
III. LIMITATIONS AND OPEN QUESTIONS
There are known limitations of the LES algorithm: First, the latent atomic charges qles are determined from local features, and thus do not provide an explicit mechanism for truly long-range charge transfer mediated by mobile carriers. In materials with metallic or highly delocalized electronic character, an applied external field can induce charge rearrangements that are, in principle, global. An example here is the induced surface charge on a macroscopic metal electrode. In such cases, a purely local LES scheme is not sufficient to describe global charge equilibration. A possible remedy is to couple LES to an explicit metallic boundary model, as in LES–MetalWall, where LES controls the electrolyte charges and the Siepmann–Sprik model performs charge equilibration on the metal electrodes [72].
Second, extracting BEC tensors for periodic systems via Eqn. (5) requires specifying a constant high-frequency dielectric permittivity ε∞ for a homogeneous bulk material [60]. For heterogeneous systems, such as interfaces between materials with different ε∞, it is not obvious how to choose or generalize this parameter. Developing consistent strategies to extend LES-based BEC extraction and response-property calculations to inhomogeneous or interfacial systems remains an open challenge.
Third, scalability to very large system sizes remains an engineering challenge. The nominal scaling of Ewald summation itself is not the main bottleneck, as this operation is typically inexpensive relative to other components of modern MLIPs, and more favorable scaling variants such as particle–particle particle–mesh (P3M) or particle–mesh Ewald (PME) are readily available [43]. Rather, the main difficulty arises because inter atomic forces are obtained via the automatic differentiation of the total energy with respect to atomic positions, where the total energy is a global quantity that includes the long-range electrostatic contribution, which makes it nontrivial to distribute force evaluation across multiple GPUs.
Finally, there is currently no “foundation model” [92] that spans broad regions of chemical and materials space while incorporating LES. Encouragingly, MACELES-OFF [61], trained on the SPICE dataset [93] of molecules and small clusters, is more accurate than its short-range counterpart MACE-OFF [94], predicts dipoles and BECs reliably, and provides improved descriptions of bulk liquids. However, it remains an open question whether LES augmentation will offer similar advantages for substantially larger, more diverse molecular datasets or bulk materials datasets [95, 96]. Exploring this direction is key to determining whether long-range electrostatics can be made standard in next-generation foundation MLIPs.
IV. CONCLUSION AND OUTLOOK
In this Perspective, we examine the connections and distinctions between LES and other representative methods for learning long-range interactions (Fig. 1). While many of the aforementioned approaches show great promise, we argue that a minimal and physically grounded design is sufficient for incorporating long-range electrostatics, based on two simple principles: (i) use a physical functional form with flexible, environment-dependent atomic charges to capture electrostatic interactions, and (ii) avoid explicit training on DFT partial charges. Besides, there are many avenues for other optional extensions.
Despite the known limitations, many aspects of the LES method have turned out to be easier than we initially anticipated: the design principles and algorithm are straightforward; only standard energy and force labels are required, training is data-efficient (Fig. 2c); electrical response properties can be extracted reliably (Figs. 2b and f); and the associated computational overhead is low (Fig. 2d).
This naturally raises a broader question: why do short-range MLIPs remain the dominant paradigm, while long-range electrostatic MLIPs have been adopted only sparingly in practical simulations? Equivalently, if long-range electrostatics can in fact be incorporated so simply and efficiently, why was this not widely recognized earlier? One plausible reason is that some of the early approaches shown in Fig. 1 do not fully satisfy the two design principles outlined above, or may have higher computational expense. Another explanation may be that long-range methods have not yet been pushed and benchmarked extensively enough to convincingly demonstrate their broad applicability and computational efficiency.
Another possibility is simply that the atomistic modeling community has long been used to short-range MLIPs and has not yet made the transition to long-range models. Speaking from our own experience, when one of us (B.C.) began using MLIPs around 2018, it was already striking that atomic interactions could be learned and converged simulations performed with near–quantum mechanical accuracy. At that time, the systems we studied were predominantly homogeneous bulk systems, such as bulk water [97], superionic water [98], and high-pressure liquid hydrogen [99]. For these systems, long-range electrostatics is conceptually important, but was not the main concern: long-range interactions are largely isotropic and tend to average out, allowing short-range models to effectively learn the mean-field forces. Many observables of interest, such as radial distribution functions, transport coefficients, and free energies, are not sensitive to long-range electrostatics [100]. At the same time, the training errors of the first-generation MLIPs [7, 8] were substantially larger than those of atomic cluster expansion (ACE) [87] or MPNNs [101], due to the lack of higher-body-order terms, making long-range errors relatively less important. It is only more recently, as our work has shifted toward heterogeneous and interfacial systems, that long-range electrostatics has become essential. Interfaces introduce inhomogeneity, as well as intrinsic polarization and electric fields, where the short-range MLIPs struggle. As the community increasingly studies complex, multi-component, and field-driven systems, especially electrochemical interfaces, polar and ionic materials, and biomolecules, we expect long-range MLIPs to become not just beneficial but necessary.
Acknowledgements
B.C. gratefully thanks Christoph Dellago for his mentorship and influence. Besides his seminal contributions to statistical mechanics, Christoph Dellago is an early developer and adopter of machine learning interatomic potentials. B.C. did two exchanges in the groups of Christoph Dellago and Jörg Behler in 2018, with transformative impact on her research directions.
We thank Peichen Zhong and Daniel S. King for useful feedback on the manuscript, and for the collaborations on the LES method.
Funding acknowledgement:
Research reported in this publication was supported by the National Institute Of General Medical Sciences of the National Institutes of Health under Award Number R35GM159986. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Footnotes
Competing Interests
B.C. has an equity stake in AIMATX Inc. The University of California, Berkeley has filed a provisional patent for the Latent Ewald Summation algorithm.
Data availability
The RPBE-D3 bulk water dataset, training scripts, evaluation scripts, the trained CACE E+F+Qeq model, and CACE LES and MACE LES models used to produce results in Figs. 2c, d, and e are available at https://github.com/ChengUCB/les_fit.
Code availability
The LES library is publicly available at https://github.com/ChengUCB/les. The CACE package with the LES implementation is available at https://github.com/BingqingCheng/cace. The MACE package with the LES implementation is available at https://github.com/ACEsuit/mace. The NequIP and Allegro LES extension package is available at https://github.com/ChengUCB/NequIP-LES. The MatGL package with the LES implementation is available at https://github.com/ChengUCB/matgl. The UMA package with the LES implementation is available at https://github.com/santi921/fairchem/tree/les_branch.
References
- [1].French Roger H, Adrian Parsegian V, Podgornik Rudolf, Rajter Rick F, Jagota Anand, Luo Jian, Asthagiri Dilip, Chaudhury Manoj K, Chiang Yet-ming, Granick Steve, et al. , “Long range interactions in nanoscale science,” Reviews of Modern Physics 82, 1887–1944 (2010). [Google Scholar]
- [2].York Darrin M., Yang Weitao, Lee Hsing, Darden Tom, and Pedersen Lee G., “Toward the accurate modeling of dna: The importance of long-range electrostatics,” Journal of the American Chemical Society 117, 5001–5002 (1995). [Google Scholar]
- [3].Rodgers Jocelyn M and Weeks John D, “Interplay of local hydrogen-bonding and long-ranged dipolar forces in simulations of confined water,” Proceedings of the National Academy of Sciences 105, 19136–19141 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Prodan Emil and Kohn Walter, “Nearsightedness of electronic matter,” Proceedings of the National Academy of Sciences 102, 11635–11638 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Keith John A, Vassilev-Galindo Valentin, Cheng Bingqing, Chmiela Stefan, Gastegger Michael, Müller Klaus-Robert, and Tkatchenko Alexandre, “Combining machine learning and computational chemistry for predictive insights into chemical systems,” Chemical reviews 121, 9816–9872 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Unke Oliver T, Chmiela Stefan, Sauceda Huziel E, Gastegger Michael, Poltavsky Igor, Schütt Kristof T, Tkatchenko Alexandre, and Müller Klaus-Robert, “Machine learning force fields,” Chemical Reviews 121, 10142–10186 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Behler Jörg and Parrinello Michele, “Generalized neural-network representation of high-dimensional potential-energy surfaces,” Physical review letters 98, 146401 (2007). [DOI] [PubMed] [Google Scholar]
- [8].Bartók Albert P, Payne Mike C, Kondor Risi, and Csányi Gábor, “Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons,” Physical review letters 104, 136403 (2010). [DOI] [PubMed] [Google Scholar]
- [9].Schütt Kristof, Kindermans Pieter-Jan, Sauceda Felix Huziel Enoc, Chmiela Stefan, Tkatchenko Alexandre, and Müller Klaus-Robert, “Schnet: A continuous-filter convolutional neural network for modeling quantum interactions,” Advances in neural information processing systems 30 (2017). [Google Scholar]
- [10].Haghighatlari Mojtaba, Li Jie, Guan Xingyi, Zhang Oufan, Das Akshaya, Stein Christopher J., Heidar-Zadeh Farnaz, Liu Meili, Head-Gordon Martin, Bertels Luke, Hao Hongxia, Leven Itai, and Head-Gordon Teresa, “Newtonnet: A newtonian message passing network for deep learning of interatomic potentials and forces,” Digital Discovery 1, 333–343 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Batzner Simon, Musaelian Albert, Sun Lixin, Geiger Mario, Mailoa Jonathan P, Kornbluth Mordechai, Molinari Nicola, Smidt Tess E, and Kozinsky Boris, “E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials,” Nature communications 13, 2453 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Batatia Ilyes, Kovacs David P, Simm Gregor, Ortner Christoph, and Csányi Gábor, “Mace: Higher order equivariant message passing neural networks for fast and accurate force fields,” Advances in Neural Information Processing Systems 35, 11423–11436 (2022). [Google Scholar]
- [13].Fu Xiang, Wood Brandon M., Barroso-Luque Luis, Levine Daniel S., Gao Meng, Dzamba Misko, and Zitnick C. Lawrence, “Learning smooth and expressive interatomic potentials for physical property prediction,” arXiv preprint arXiv:2502.12147 (2025). [Google Scholar]
- [14].Wood Brandon M., Dzamba Misko, Fu Xiang, Gao Meng, Shuaibi Muhammed, Barroso-Luque Luis, Abdelmaqsoud Kareem, Gharakhanyan Vahe, Kitchin John R., Levine Daniel S., Michel Kyle, Sriram Anuroop, Cohen Taco, Das Abhishek, Rizvi Ammar, Sahoo Sushree Jagriti, Ulissi Zachary W., and Zitnick C. Lawrence, “Uma: A family of universal models for atoms,” arXiv preprint arXiv:2506.23971 (2025). [Google Scholar]
- [15].Li Qimai, Han Zhichao, and Wu Xiao-Ming, “Deeper insights into graph convolutional networks for semi-supervised learning,” arXiv preprintarXiv:1801.07606 (2018). [Google Scholar]
- [16].Alon Uri and Yahav Eran, “On the bottleneck of graph neural networks and its practical implications,” (2021). [Google Scholar]
- [17].Wai Ko Tsz, Finkler Jonas A., Goedecker Stefan, and Behler Jörg, “General-purpose machine learning potentials capturing nonlocal charge transfer,” Accounts of Chemical Research 54, 808–817 (2021). [DOI] [PubMed] [Google Scholar]
- [18].Kocer Emir, Ko Tsz Wai, and Behler Jörg, “Neural network potentials: A concise overview of methods,” Annual Review of Physical Chemistry 73, 163–186 (2022). [DOI] [PubMed] [Google Scholar]
- [19].Behler Jörg, “Four generations of high-dimensional neural network potentials,” Chemical Reviews 121, 10037–10072 (2021). [DOI] [PubMed] [Google Scholar]
- [20].Anstine Dylan M. and Isayev Olexandr, “Machine learning interatomic potentials and long-range physics,” The Journal of Physical Chemistry A 127, 2417–2431 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Cheng Bingqing, “Latent ewald summation for machine learning of long-range interactions,” npj Computational Materials 11, 80 (2025). [Google Scholar]
- [22].Unke Oliver T and Meuwly Markus, “Physnet: A neural network for predicting energies, forces, dipole moments, and partial charges,” Journal of chemical theory and computation 15, 3678–3693 (2019). [DOI] [PubMed] [Google Scholar]
- [23].Ko Tsz Wai, Finkler Jonas A, Goedecker Stefan, and Behler Jörg, “A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer,” Nature communications 12, 398 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Sifain Andrew E, Lubbers Nicholas, Nebgen Benjamin T, Smith Justin S, Lokhov Andrey Y, Isayev Olexandr, Roitberg Adrian E, Barros Kipton, and Tretiak Sergei, “Discovering a transferable charge assignment model using machine learning,” The journal of physical chemistry letters 9, 4495–4501 (2018). [DOI] [PubMed] [Google Scholar]
- [25].Gong Sheng, Zhang Yumin, Mu Zhenliang, Pu Zhichen, Wang Hongyi, Han Xu, Yu Zhiao, Chen Mengyi, Zheng Tianze, Wang Zhi, Chen Lifei, Yang Zhenze, Wu Xiaojie, Shi Shaochen, Gao Weihao, Yan Wen, and Xiang Liang, “A predictive machine learning force-field framework for liquid electrolyte development,” Nature Machine Intelligence 7, 543–552 (2025). [Google Scholar]
- [26].Shaidu Yusuf, Pellegrini Franco, Küçükbenli Emine, Lot Ruggero, and de Gironcoli Stefano, “Incorporating long-range electrostatics in neural network potentials via variational charge equilibration from short-sighted ingredients,” npj Computational Materials 10, 47 (2024). [Google Scholar]
- [27].Song Zichen, Han Jian, Henkelman Graeme, and Li Lei, “Charge-optimized electrostatic interaction atomcentered neural network algorithm,” Journal of Chemical Theory and Computation 20, 2088–2097 (2024). [DOI] [PubMed] [Google Scholar]
- [28].Kabylda Adil, Frank J. Thorben, Dou ergio Suarez, Khabibrakhmanov Almaz, Sandonas Leonardo Medrano, Unke Oliver T., Chmiela Stefan, Müller Klaus-Robert, and Tkatchenko Alexandre, “Molecular simulations with a pretrained neural network and universal pairwise force fields,” ChemRxiv (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Jacobson Leif D., Stevenson James M., Ramezanghorbani Farhad, Ghoreishi Delaram, Leswing Karl, Harder Edward D., and Abel Robert, “Transferable neural network potential energy surfaces for closed-shell organic molecules: Extension to ions,” Journal of Chemical Theory and Computation 18, 2354–2366 (2022). [DOI] [PubMed] [Google Scholar]
- [30].Thürlemann Moritz, Böselt Lennard, and Riniker Sereina, “Learning atomic multipoles: Prediction of the electrostatic potential with equivariant graph neural networks,” Journal of Chemical Theory and Computation 18, 1701–1710 (2022). [DOI] [PubMed] [Google Scholar]
- [31].Morawietz Tobias, Sharma Vikas, and Behler Jörg, “A neural network potential-energy surface for the water dimer based on environment-dependent atomic energies and charges,” The Journal of Chemical Physics 136, 064103 (2012). [DOI] [PubMed] [Google Scholar]
- [32].Anstine Dylan M., Zubatyuk Roman, and Isayev Olexandr, “Aimnet2: a neural network potential to meet your neutral, charged, organic, and elemental-organic needs,” Chemical Science 16, 10228–10244 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Rinaldi Matteo, Bochkarev Anton, Lysogorskiy Yury, and Drautz Ralf, “Charge-constrained atomic cluster expansion,” Physical Review Materials 9, 033802 (2025). [Google Scholar]
- [34].Unke Oliver T, Chmiela Stefan, Gastegger Michael, Schütt Kristof T, Sauceda Huziel E, and Müller Klaus-Robert, “Spookynet: Learning force fields with electronic degrees of freedom and nonlocal effects,” Nature communications 12, 7273 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Zhang Linfeng, Wang Han, Muniz Maria Carolina, Panagiotopoulos Athanassios Z., Car Roberto, and Weinan E, “A deep potential model with long-range electrostatic interactions,” The Journal of Chemical Physics 156 (2022). [DOI] [PubMed] [Google Scholar]
- [36].Gao Ang and Remsing Richard C, “Self-consistent determination of long-range electrostatics in neural network potentials,” Nature communications 13, 1572 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Yu Hongyu, Hong Liangliang, Chen Shiyou, Gong Xingao, and Xiang Hongjun, “Capturing long-range interaction with reciprocal space neural network,” arXiv preprint arXiv:2211.16684 (2022). [Google Scholar]
- [38].Kosmala Arthur, Gasteiger Johannes, Gao Nicholas, and Günnemann Stephan, “Ewald-based long-range message passing for molecular graphs,” in International Conference on Machine Learning (PMLR, 2023) pp. 17544–17563. [Google Scholar]
- [39].Grisafi Andrea and Ceriotti Michele, “Incorporating long-range physics in atomic-scale machine learning,” The Journal of chemical physics 151 (2019). [DOI] [PubMed] [Google Scholar]
- [40].Huguenin-Dumittan Kevin K, Loche Philip, Haoran Ni, and Ceriotti Michele, “Physics-inspired equivariant descriptors of nonbonded interactions,” The Journal of Physical Chemistry Letters 14, 9612–9618 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Faller Carolin, Kaltak Merzuk, and Kresse Georg, “Density-based long-range electrostatic descriptors for machine learning force fields,” The Journal of Chemical Physics 161, 214701 (2024). [DOI] [PubMed] [Google Scholar]
- [42].Monacelli Lorenzo and Marzari Nicola, “Electrostatic interactions in atomistic and machine-learned potentials for polar materials,” arXiv preprint arXiv:2412.01642 (2024). [Google Scholar]
- [43].Loche Philip, Huguenin-Dumittan Kevin K., Honarmand Melika, Xu Qianjun, Rumiantsev Egor, How Wei Bin, Langer Marcel F., and Ceriotti Michele, “Fast and flexible long-range models for atomistic machine learning,” The Journal of Chemical Physics 162, 142501 (2025). [DOI] [PubMed] [Google Scholar]
- [44].Rumiantsev Egor, Langer Marcel F., Sod-jargal Tulga-Erdene, Ceriotti Michele, and Loche Philip, “Learning long-range representations with equivariant messages,” 2507.19382. [Google Scholar]
- [45].Caruso Alessandro, Venturin Jacopo, Giambagli Lorenzo, Rolando Edoardo, Noé Frank, and Clementi Cecilia, “Extending the range of graph neural networks: Relaying attention nodes for global encoding,” arXiv preprint arXiv:2502.13797 (2025). [Google Scholar]
- [46].Weber John L., Guha Rishabh D., Agarwal Garvit, Wei Yu-jing, Fike Aidan A., Xie Xiaowei, Stevenson James, Leswing Karl, Halls Mathew D., Abel Robert, and Jacobson Leif D., “Efficient long-range machine learning force fields for liquid and materials properties,” arXiv preprint arXiv:2505.06462 (2025). [Google Scholar]
- [47].Alireza Ghasemi S, Hofstetter Albert, Saha Santanu, and Goedecker Stefan, “Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network,” Physical Review B 92, 045131 (2015). [Google Scholar]
- [48].Fuchs Paul, Sanocki Michal, and Zavadlav Julija, “Learning non-local molecular interactions via equivariant local representations and charge equilibration,” npj Computational Materials 11, 287 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].Uddin Maruf Moin, Kim Sungmin, and Ahmad Zeeshan, “Learning long-range interactions in equivariant machine learning interatomic potentials via electronic degrees of freedom,” The Journal of Physical Chemistry Letters 16, 9078–9087 (2025). [DOI] [PubMed] [Google Scholar]
- [50].Xu Zemin, Xie Wenbo, Xie Daiqian, and Hu P, “Towards universal property prediction in cartesian space: Tace is all you need,” arXiv preprint arXiv:2509.14961 (2025). [Google Scholar]
- [51].Plé Thomas, Adjoua Olivier, Benali Anouar, Posenitskiy Evgeny, Villot Corentin, Lagardère Louis, and Piquemal Jean-Philip, “A foundation model for accurate atomistic simulations in drug design,” ChemRxiv (2025). [Google Scholar]
- [52].Gao Rongzhi, Yam ChiYung, Mao Jianjun, Chen Shuguang, Chen GuanHua, and Hu Ziyang, “A foundation machine learning potential with polarizable long-range interactions for materials modelling,” Nature Communications 16, 10484 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].Rappe Anthony K and Goddard William A III, “Charge equilibration for molecular dynamics simulations,” The Journal of Physical Chemistry 95, 3358–3363 (1991). [Google Scholar]
- [54].Falletta Stefano, Cepellotti Andrea, Johansson Anders, Tan Chuin Wei, Descoteaux Marc L, Musaelian Albert, Owen Cameron J, and Kozinsky Boris, “Unified differentiable learning of electric response,” Nature communications 16, 4031 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [55].Schmiedmayer Bernhard and Kresse Georg, “Derivative learning of tensorial quantities—predicting finite temperature infrared spectra from first principles,” The Journal of Chemical Physics 161 (2024). [DOI] [PubMed] [Google Scholar]
- [56].Stocco Elia, Carbogno Christian, and Rossi Mariana, “Electric-field driven nuclear dynamics of liquids and solids from a multi-valued machine-learned dipolar model,” npj Computational Materials 11, 304 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].Zhang Yaolong and Jiang Bin, “Universal machine learning for the response of atomistic systems to external fields,” Nature Communications 14 (2023), 10.1038/s41467-023-42148-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Gastegger Michael, Schütt Kristof T., and Müller Klaus-Robert, “Machine learning of solvent effects on molecular spectra and reactions,” Chemical Science 12, 11473–11483 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].King Daniel S., Kim Dongjin, Zhong Peichen, and Cheng Bingqing, “Machine learning of charges and long-range interactions from energies and forces,” Nature Communications 16, 8763 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Zhong Peichen, Kim Dongjin, King Daniel S, and Cheng Bingqing, “Machine learning interatomic potential can infer electrical response,” arXiv preprint arXiv:2504.05169 (2025). [Google Scholar]
- [61].Kim Dongjin, Wang Xiaoyu, Vargas Santiago, Zhong Peichen, King Daniel S., Inizan Theo Jaffrelot, and Cheng Bingqing, “A universal augmentation framework for long-range electrostatics in machine learning interatomic potentials,” Journal of Chemical Theory and Computation (2025), 10.1021/acs.jctc.5c01400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [62].Grisafi Andrea, Fabrizio Alberto, Meyer Benjamin, Wilkins David M., Corminboeuf Clemence, and Ceriotti Michele, “Transferable machine-learning model of the electron density,” ACS Central Science 5, 57–64 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Lewis Alan M., Grisafi Andrea, Ceriotti Michele, and Rossi Mariana, “Learning electron densities in the condensed phase,” Journal of Chemical Theory and Computation 17, 7203–7214 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Grisafi Andrea, Bussy Augustin, Salanne Mathieu, and Vuilleumier Rodolphe, “Predicting the charge density response in metal electrodes,” Physical Review Materials 7, 125403 (2023). [Google Scholar]
- [65].Yu Hongyu, Deng Shihan, Zhu Haiyan, Xie Muting, Zhang Yuwen, Shi Xizhi, Zhong Jianxin, He Chaoyu, and Xiang Hongjun, “Prediction of Room Temperature Ferroelectricity in Subnano Silicon Thin Films with an Antiferroelectric Ground State,” Physical Review Letters 135, 156801 (2025). [DOI] [PubMed] [Google Scholar]
- [66].Grisafi Andrea, Wilkins David M, Csányi Gábor, and Ceriotti Michele, “Symmetry-adapted machine learning for tensorial properties of atomistic systems,” Physical review letters 120, 036002 (2018). [DOI] [PubMed] [Google Scholar]
- [67].Schienbein Philipp, “Spectroscopy from machine learning by accurately representing the atomic polar tensor,” Journal of Chemical Theory and Computation 19, 705–712 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [68].Joll Kit, Schienbein Philipp, Rosso Kevin M., and Blumberger Jochen, “Machine learning the electric field response of condensed phase systems using perturbed neural network potentials,” Nature Communications 15 (2024), 10.1038/s41467-024-52491-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [69].Musaelian Albert, Batzner Simon, Johansson Anders, Sun Lixin, Owen Cameron J, Kornbluth Mordechai, and Kozinsky Boris, “Learning local equivariant representations for large-scale atomistic dynamics,” Nature Communications 14, 579 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [70].Cheng Bingqing, “Cartesian atomic cluster expansion for machine learning interatomic potentials,” npj Computational Materials 10, 157 (2024). [Google Scholar]
- [71].Deng Bowen, Zhong Peichen, Jun KyuJung, Riebesell Janosh, Han Kevin, Bartel Christopher J, and Ceder Gerbrand, “Chgnet as a pretrained universal neural network potential for charge-informed atomistic modelling,” Nature Machine Intelligence 5, 1031–1041 (2023). [Google Scholar]
- [72].Wang Xiaoyu, Chen Junmin, Zeng Zezhu, Stein Frederick, Lim Junho, and Cheng Bingqing, “Ion-modulated structure, proton transfer, and capacitance in the pt(111)/water electric double layer,” arXiv preprint arXiv:2509.13727 (2025). [Google Scholar]
- [73].Larsen Ask Hjorth, Mortensen Jens Jørgen, Blomqvist Jakob, Castelli Ivano E, Christensen Rune, Dułak Marcin, Friis Jesper, Groves Michael N, Hammer Bjørk, Hargus Cory, Hermes Eric D, Jennings Paul C, Jensen Peter Bjerre, Kermode James, Kitchin John R, Kolsbjerg Esben Leonhard, Kubal Joseph, Kaasbjerg Kristen, Lysgaard Steen, Maronsson Jón Bergmann, Maxson Tristan, Olsen Thomas, Pastewka Lars, Peterson Andrew, Rostgaard Carsten, Schiøtz Jakob, Schütt Ole, Strange Mikkel, Thygesen Kristian S, Vegge Tejs, Vilhelmsen Lasse, Walter Michael, Zeng Zhenhua, and Jacobsen Karsten W, “The atomic simulation environment—a python library for working with atoms,” Journal of Physics: Condensed Matter 29, 273002 (2017). [DOI] [PubMed] [Google Scholar]
- [74].Bertie John E. and Lan Zhida, “Infrared intensities of liquids xx: The intensity of the oh stretching band of liquid water revisited, and the best current values of the optical constants of h2o(l) at 25°c between 15,000 and 1 cm-1,” Applied Spectroscopy 50, 1047–1057 (1996). [Google Scholar]
- [75].Schütt Kristof, Unke Oliver, and Gastegger Michael, “Equivariant message passing for the prediction of tensorial properties and molecular spectra,” in Proceedings of the 38th International Conference on Machine Learning , Proceedings of Machine Learning Research, Vol. 139, edited by Meila Marina and Zhang Tong (PMLR, 2021) pp. 9377–9388. [Google Scholar]
- [76].Stuke Annika, Kunkel Christian, Golze Dorothea, Todorovíc Milica, Margraf Johannes T., Reuter Karsten, Rinke Patrick, and Oberhofer Harald, “Atomic structures and orbital energies of 61,489 crystal-forming organic molecules,” Scientific Data 7, 58 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [77].Ko Tsz Wai, Liu Runze, Mishra Adesh Rohan, Yu Zihan, Qi Ji, and Ong Shyue Ping, “A fast, accurate, and reactive equivariant foundation potential,” (2025). [Google Scholar]
- [78].Semelak Jonathan A., Pickering Ignacio, Huddleston Kate, Olmos Justo, Grassano Juan Santiago, Clemente Camila Mara, Drusin Salvador I., Marti Marcelo, Gonzalez Lebrero Mariano Camilo, Roitberg Adrian E., and Estrin Dario A., “Advancing multiscale molecular modeling with machine learning-derived electrostatics,” Journal of Chemical Theory and Computation 21, 5194–5207 (2025). [DOI] [PubMed] [Google Scholar]
- [79].Morado João, Zinovjev Kirill, Hedges Lester O, Cole Daniel J, and Michel Julien, “Enhancing electrostatic embedding for ml/mm free energy calculations,” Journal of Chemical Theory and Computation (2025). [DOI] [PubMed] [Google Scholar]
- [80].Marenich Aleksandr V, Jerome Steven V, Cramer Christopher J, and Truhlar Donald G, “Charge model 5: An extension of hirshfeld population analysis for the accurate description of molecular interactions in gaseous and condensed phases,” Journal of chemical theory and computation 8, 527–541 (2012). [DOI] [PubMed] [Google Scholar]
- [81].Wiberg Kenneth B and Rablen Paul R, “Comparison of atomic charges derived via different procedures,” Journal of Computational Chemistry 14, 1504–1518 (1993). [Google Scholar]
- [82].Mulliken Robert S, “Electronic population analysis on lcao–mo molecular wave functions. i,” The Journal of chemical physics 23, 1833–1840 (1955). [Google Scholar]
- [83].Hirshfeld Fred L, “Bonded-atom fragments for describing molecular charge densities,” Theoretica chimica acta 44, 129–138 (1977). [Google Scholar]
- [84].Verstraelen Toon, Vandenbrande Steven, Heidar-Zadeh Farnaz, Vanduyfhuys Louis, Van Speybroeck Veronique, Waroquier Michel, and Ayers Paul W, “Minimal basis iterative stockholder: atoms in molecules for force-field development,” Journal of Chemical Theory and Computation 12, 3894–3912 (2016). [DOI] [PubMed] [Google Scholar]
- [85].Leontyev IV, Vener MV, Rostov IV, Basilevsky MV, and Newton Marshall D, “Continuum level treatment of electronic polarization in the framework of molecular simulations of solvation effects,” The Journal of chemical physics 119, 8024–8037 (2003). [Google Scholar]
- [86].Jorge Miguel, “Theoretically grounded approaches to account for polarization effects in fixed-charge force fields,” The Journal of Chemical Physics 161 (2024). [DOI] [PubMed] [Google Scholar]
- [87].Drautz Ralf, “Atomic cluster expansion for accurate and transferable interatomic potentials,” Physical Review B 99, 014104 (2019). [Google Scholar]
- [88].Kocer Emir, Singraber Andreas, Finkler Jonas A., Misof Philipp, Ko Tsz Wai, Dellago Christoph, and Behler Jörg, “Iterative charge equilibration for fourth-generation high-dimensional neural network potentials,” The Journal of Chemical Physics 162, 124106 (2025). [DOI] [PubMed] [Google Scholar]
- [89].Gubler Moritz, Finkler Jonas A., Schäfer Moritz R., Behler Jörg, and Goedecker Stefan, “Accelerating fourth-generation machine learning potentials using quasi-linear scaling particle mesh charge equilibration,” Journal of Chemical Theory and Computation 20, 7264–7271 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [90].Siepmann J Ilja and Sprik Michiel, “Influence of surface topology and electrostatic potential on water/electrode systems,” The Journal of chemical physics 102, 511–524 (1995). [Google Scholar]
- [91].Simeon Guillem, Mirarchi Antonio, Pelaez Raul P., Galvelis Raimondas, and De Fabritiis Gianni, “Broadening the scope of neural network potentials through direct inclusion of additional molecular attributes,” Journal of Chemical Theory and Computation 21, 1831–1837 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [92].Yuan Eric C.-Y., Liu Yunsheng, Chen Junmin, Zhong Peichen, Raja Sanjeev, Kreiman Tobias, Vargas Santiago, Xu Wenbin, Head-Gordon Martin, Yang Chao, Blau Samuel M., Cheng Bingqing, Krishnapriyan Aditi, and Head-Gordon Teresa, “Foundation models for atomistic simulation of chemistry and materials,” arXiv preprint arXiv:2503.10538 (2025). [DOI] [PubMed] [Google Scholar]
- [93].Eastman Peter, Behara Pavan Kumar, Dotson David L., Galvelis Raimondas, Herr John E., Horton Josh T., Mao Yuezhi, Chodera John D., Pritchard Benjamin P., Wang Yuanqing, De Fabritiis Gianni, and Markland Thomas E, “Spice, a dataset of drug-like molecules and peptides for training machine learning potentials,” Scientific Data 10, 11 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [94].Kovács Dávid Péter, Moore J. Harry, Browning Nicholas J., Batatia Ilyes, Horton Joshua T., Pu Yixuan, Kapil Venkat, Witt William C., Magdău Ioan-Bogdan, Cole Daniel J., and Csányi Gábor, “Mace-off: Short-range transferable machine learning force fields for organic molecules,” Journal of the American Chemical Society (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [95].Barroso-Luque Luis, Shuaibi Muhammed, Fu Xiang, Wood Brandon M., Dzamba Misko, Gao Meng, Rizvi Ammar, Lawrence Zitnick C, and Ulissi Zachary W., “Open materials 2024 (omat24) inorganic materials dataset and models,” arXiv prerpint arXiv:2410.12771 (2024). [Google Scholar]
- [96].Levine Daniel S., Shuaibi Muhammed, Spotte-Smith Evan Walter Clark, Taylor Michael G., Hasyim Muhammad R., Michel Kyle, Batatia Ilyes, Csányi Gábor, Dzamba Misko, Eastman Peter, Frey Nathan C., Fu Xiang, Gharakhanyan Vahe, Krishnapriyan Aditi S., Rackers Joshua A., Raja Sanjeev, Rizvi Ammar, Rosen Andrew S., Ulissi Zachary, Vargas Santiago, Lawrence Zitnick C, Blau Samuel M., and Wood Brandon M., “The open molecules 2025 (omol25) dataset, evaluations, and models,” arXiv preprint arXiv:2505.08762 (2025). [Google Scholar]
- [97].Cheng Bingqing, Engel Edgar A, Behler Jörg, Dellago Christoph, and Ceriotti Michele, “Ab initio thermodynamics of liquid and solid water,” Proceedings of the National Academy of Sciences 116, 1110–1115 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [98].Cheng Bingqing, Bethkenhagen Mandy, Pickard Chris J, and Hamel Sebastien, “Phase behaviours of superionic water at planetary conditions,” Nature physics 17, 1228–1232 (2021). [Google Scholar]
- [99].Cheng Bingqing, Mazzola Guglielmo, Pickard Chris J, and Ceriotti Michele, “Evidence for supercritical behaviour of high-pressure liquid hydrogen,” Nature 585, 217–220 (2020). [DOI] [PubMed] [Google Scholar]
- [100].Yue Shuwen, Muniz Maria Carolina, Calegari Andrade Marcos F, Zhang Linfeng, Car Roberto, and Panagiotopoulos Athanassios Z, “When do short-range atomistic machine-learning models fall short?” J. Chem. Phys 154, 034111 (2021). [DOI] [PubMed] [Google Scholar]
- [101].Batatia Ilyes, Batzner Simon, Kovács Dávid Péter, Musaelian Albert, Simm Gregor N. C., Drautz Ralf, Ortner Christoph, Kozinsky Boris, and Csányi Gébor, “The design space of e(3)-equivariant atom-centred interatomic potentials,” Nature Machine Intelligence 7, 56–67 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The RPBE-D3 bulk water dataset, training scripts, evaluation scripts, the trained CACE E+F+Qeq model, and CACE LES and MACE LES models used to produce results in Figs. 2c, d, and e are available at https://github.com/ChengUCB/les_fit.
The LES library is publicly available at https://github.com/ChengUCB/les. The CACE package with the LES implementation is available at https://github.com/BingqingCheng/cace. The MACE package with the LES implementation is available at https://github.com/ACEsuit/mace. The NequIP and Allegro LES extension package is available at https://github.com/ChengUCB/NequIP-LES. The MatGL package with the LES implementation is available at https://github.com/ChengUCB/matgl. The UMA package with the LES implementation is available at https://github.com/santi921/fairchem/tree/les_branch.
