Abstract
We present a reformulation of QM/MM as a fully quantum mechanical theory of interacting subsystems, all treated at the level of density functional theory (DFT). For the MM subsystem, which lacks orbitals, we assign an ad hoc electron density and apply orbital-free DFT functionals to describe its quantum properties. The interaction between the QM and MM subsystems is also treated using orbital-free density functionals, accounting for Coulomb interactions, exchange, correlation, and Pauli repulsion. Consistency across QM and MM subsystems is ensured by employing data-driven, many-body MM force fields that faithfully represent DFT functionals. Applications to water-solvated systems demonstrate that this approach achieves unprecedented, very rapid convergence to chemical accuracy as the size of the QM subsystem increases. We validate the method with several pilot studies, including water bulk, water clusters (prism hexamer and pentamers), solvated glucose, a palladium aqua ion, and a wet monolayer of MoS2.
1. Introduction
QM/MM (standing for quantum mechanics/molecular mechanics) has revolutionized computational biochemistry. Since the pioneering work of Honig and Karplus, the combination of a quantum mechanical (QM) description for a subsystem with a classical point-charge description of its environment has led to major breakthroughs in fields such as enzymatics, , drug development, , and materials design. Since its conception, methods handling the QM and MM subsystems have dramatically evolved. Today’s MM force fields can integrate data-driven potentials, polarizable models, − and even machine learning techniques. , QM methods have also evolved dramatically, ranging from DFT to wave function theory methods routinely used in conjunction with QM/MM.
The nature of the QM-MM interaction has also evolved. Initially, these were handled mechanically, with Coulomb interactions calculated a posteriori, influencing only the forces and total energy, but without affecting the QM wave function or density. The advent of electrostatic embedding improved accuracy by incorporating MM partial charges directly into the QM Hamiltonian. Ultimately, mutual QM-MM polarization was achieved using polarizable force fields, , a concept anticipated in early QM/MM work.
The computational cost of QM/MM simulations is greatly reduced compared to fully quantum mechanical treatments. However, incorporating QM/polarizable-MM interactions in an efficient manner remains challenging. Methods based on judicious partitioning of the induction response have demonstrated excellent scalability. , Additionally, algorithmic advances have been supported by steady progress in software development. ,− To extend QM/MM simulations to condensed phases, periodic boundary conditions (PBC) have been implemented. Ewald summation techniques are commonly used, and adaptations for molecular condensed phases − and material systems ,,, are now widely available.
How accurate are QM/MM models? A common way to address this question is to evaluate the convergence of the results with respect to the size of the QM subsystem or to simply compare against full QM calculations for selected model systems. Protein environments, for example, are exceptionally complex, and the search for effective ways to include relevant protein regions in QM/MM simulations continues. − Particularly challenging has been capturing charge transfer interactions on larger scales, , or finding appropriate cluster model systems of enzymatic active sites.
A slow convergence of the QM/MM setup has also affected those systems where partitioning in QM and MM subsystems involves no bond breaking. For example, water solvation. Ironically, independent QM-only or MM-only treatments of liquid water can provide accurate results, but their combination in QM/MM workflows results in an overall reduced accuracy. , Accurately modeling aqueous environments with QM/MM is essential due to the need to consider large water environments to properly account for the static and dynamic responses at water-material interfaces. , Therefore, representing these polarization effects in water bulk, which can extend for several nanometers, is crucial for capturing significant effects on the energetics of solvated species. ,
The culprit is the difficulty to accurately capture QM-MM interactions with a computationally efficient method. The MM subsystem is typically described using methods that largely (or completely) neglect its electronic structure. Point charges or, at times, point polarizable dipoles do not faithfully represent any electronic structure! The polarizable density embedding method tackles this problem by dividing the MM subsystem into two regions: one near the QM subsystem is assigned a QM density derived from isolated fragment calculations, while the remaining MM atoms are treated using conventional point charge or dipole models. This embedding approach improves the accuracy of the QM Hamiltonian by capturing both electrostatic and nonelectrostatic interactions, leading to better results than traditional QM/MM setups. Several related approaches build on the above ideas, such as QM/ESP, QM/GEM, QXD, density embedding methods, and many-body expansions. There have also been efforts to include the quantum mechanical Pauli repulsion in QM/MM, but with mixed success. Some strategies address this issue by parametrizing the mechanical embedding interaction energy without adding new terms to the QM Hamiltonian, − while others also include the effects of Pauli repulsion on the QM system by placing pseudopotentials in the MM region.
Thus, our approach in this work is to treat QM and MM subsystems on a more similar footing, aiming to reduce the impact of an imbalanced QM-MM interface and an imbalanced treatment of the internal energy of QM and MM subsystems. We propose “density-functionalizing” the MM subsystem, assigning it an electron density such that it can be handled like an electronic subsystem within the rigorous framework of subsystem DFT (sDFT). − This standardization of QM and MM subsystems allows for the use of first-principles density functionals for evaluating the QM-MM interaction, inherently capturing all relevant physical effects, such as exchange, correlation, Pauli repulsion, electrostatics, and charge penetration. Accounting for Pauli repulsion is a crucial aspect of sDFT. This is achieved through nonadditive functionals as demonstrated early on in the subsystem DFT literature. In particular, the nonadditive kinetic energy functional formally encodes Pauli repulsion and prevents charge “spill-out” across subsystems. , The next section details the theoretical framework for the density-functionalized QM/MM approach, with additional and less critical details provided in the Supporting Information.
2. Density-Functionalization of QM/MM
The central idea is to assign an electron density to both the QM subsystem, ρQM(r), and the MM subsystem, ρMM(r), with the total electron density given by their sum and the energy functional borrowed from rigorous sDFT −
| 1 |
| 2 |
Although formally the electronic energy is strictly a density functional, in practice, the external potential (electron–nuclear attraction) is known ahead of time and is thus specified for the QM subsystem, v QM(r), and for the MM subsystem, v MM(r), such that the additive part of the energy is given by (disregarding for the time being the nuclear–nuclear repulsion)
| 3 |
| 4 |
where T s, E H and E xc are the noninteracting kinetic energy, the classical electron–electron repulsion (Hartree) and the exchange-correlation functionals, respectively. The nonadditive energy is thus given by
| 5 |
where T s [ρQM, ρMM] = T s[ρQM + ρMM] – T s[ρQM] – T s[ρMM] and equivalently for E xc .
The equations above can be exploited for a variational minimization of the energy functional with respect to variations in both ρQM and ρMM, provided that suitable approximations for the relevant density functionals are available.
When both QM and MM subsystems are described at the Kohn–Sham DFT level, the resulting sDFT framework is highly accurate, especially in regimes of weak intersubsystem interactions. Notably, sDFT with semilocal functionals for E nad has consistently demonstrated excellent agreement with experimental structure factors of condensed-phase systems, such as liquid water, fluid CO2, and solvated ionic species. Sub-1 kcal/mol accuracy is now routinely attained in sDFT calculations, whether using Kohn–Sham subsystems, − orbital-free subsystems, or advanced multiscale schemes. ,− The treatment of hydrogen-bonding interactions, which are central to this work, is particularly reliable when employing the nonadditive PBE exchange-correlation in conjunction with the revAPBEk nonadditive kinetic energy, as well as other GGA nonadditive functionals. Thus, we have confidence that our approach can accurately capture QM-MM interactions for solvated species.
In a QM/MM framework, eq is typically replaced by a classical force field expression. This classical term includes electrostatic interactions between atom-centered point charges and, in some cases, atom-centered polarizable dipoles, as well as empirical expressions for short-range dispersion-repulsion interactions and bonded terms. In most QM/MM implementations, the classical component is handled by a separate tool or software package responsible for evaluating this energy contribution. However, the electrostatic multipoles defined in the MM force field also contribute to the electrostatic terms in the QM-MM interaction energy, as given in eq . To account for short-range dispersion-repulsion interactions and to prevent unphysical behavior when the QM and MM subsystems are close, ad hoc corrections are commonly introduced to replace the first two terms of eq . Once the dependence of eq on the atomic positions of the MM subsystem is established, taking the functional derivative of eq with respect to ρQM(r) enables the optimization of the QM subsystem using a self-consistent field (SCF) approach.
The point charges and point dipoles used to represent the MM subsystem can be parametrized in different ways. They may be adjusted to fit empirical data for the MM system, or optimized to reproduce properties related to the electronic density of the MM components as computed from first-principles, such as binding energies or electrostatic potentials. Since the permanent charges in most MM force fields do not depend on the molecular geometry or environment, atomic polarizabilities are often introduced into the force field to capture the system’s electrostatic response to external fields. In these polarizable force fields, the nonadditive term in eq induces dipoles in the MM subsystem, leading to a situation where the QM and MM subsystems mutually polarize each other.
The standard approximations in conventional QM/MM approaches often result in notable inaccuracies, especially when the QM/MM boundary is not fixed but systematically changes. Motivated by the progress of methods that incorporate explicit electron densities for the MM subsystem, , as well as the robust performance of the sDFT framework, we propose that a density-functional-based QM/MM approach grounded in eqs – can yield highly accurate models of solute–solvent systems. In particular, this framework is expected to be reliable for systems where the QM and MM subsystems interact weakly, provided that well-defined and consistent mappings exist between first-principles electronic densities and the corresponding force-field multipoles. Establishing such mappings enables a seamless and physically consistent description of both forward (from electronic density to multipoles) and backward (from multipoles to electronic density) interactions, which is essential for capturing mutual polarization and accurate QM-MM coupling in complex environments.
For the forward mapping from electronic-structure calculations to classical electrostatics, the many body polarizable approach developed by Paesani and collaborators demonstrates that it is possible to accurately describe statistical properties of bulk systems by carefully parametrizing a classical force field using a large database of high-level few-body first-principles simulations. , The early work on MB-pol focused on coupled cluster calculations for water dimers and trimers, but the same methodology can be applied to DFT-based calculations, as in the case of MB-PBE. The resulting force fields describe electrostatic interactions in terms of permanent charges and polarizabilities. As in similar approaches, short-range corrections are included to prevent self-polarization and to avoid the unphysical divergence of induced dipoles when polarizable sites are very close. In this work, we propose that using an MM subsystem with electrostatic interactions that are fully consistent with the quantum mechanical treatment of the QM region will allow for accurate and seamless convergence of QM/MM calculations as the boundary between the two regions changes.
For the backward mapping, which goes from the classical force field in the MM region to an effective electronic density used in the density functional QM/MM interaction, we adopt the following strategy. We represent the atomic nuclei (of QM and MM subsystems) with pseudopotentials. To keep calculations efficient for large MM subsystems, we use the local part of the ultrasoft pseudopotentials introduced by Garrity and Vanderbilt (GBRV), which are designed for simulations that require low plane wave cutoffs and high throughput. This assignment is carried out efficiently using the particle mesh Ewald method, which scales linearly with system size. For the valence electrons, we convert the point multipoles calculated by the classical force field into a smooth electronic charge density.
For each MM site i, we represent the valence electron density by a Gaussian function centered at the site position, R i , with an adjustable width σ i . Specifically, the density is given by ρ q i (r) = (N i – q i ) g σ i (r – R i ), where g σ i is a normalized Gaussian. The prefactor ensures an accurate description of the ion’s permanent charge q i , while correctly accounting for the total number of valence electrons N i associated with the isolated atom. If the force field includes higher multipoles in its electrostatic interactions, we incorporate these into the reconstructed electron density by using derivatives of the normalized Gaussian. For example, a point dipole can be mapped onto a smooth charge density by taking the gradient of a Gaussian, , where is the induced dipole at site j. Higher-order multipoles can be included in an analogous manner using higher derivatives. The total valence electron density is then given by the sum of these contributions
| 6 |
We stress that the proposed formulation relies on a set of element-specific widths (the σ i/j ) that can significantly affect the final shape of the reconstructed electronic density (both permanent charges and polarization density due to the induced dipoles).
For the forward mapping, we hypothesize that selecting the Gaussian widths to best reproduce the interaction energies between QM/MM and QM/QM (sDFT) calculations will enable an accurate description of mutual polarization effects. This approach should also ensure that QM/MM calculations converge smoothly as the size of the QM subsystem increases.
The functional derivatives of the nonadditive interaction energy with respect to the QM or MM subsystem densities yields the embedding potentials
| 7 |
| 8 |
that can be used to optimize the QM or MM degrees of freedom (densities). In particular, eq enters the Hamiltonian of the QM subsystem at each SCF step and it is used for the calculation of the ground state QM electronic densities. The sDFT framework, together with its implementation in the eDFTpy software, make it possible to couple the QM/MM embedding potential to multiple QM subsystems.
eq can be used by the MM engine to account for the influence of the QM region on induced dipoles and interatomic forces. To include the effect on induced dipoles, the gradient of the MM embedding potential is added to the classical electric field that determines the induced dipoles in the force field. Although all terms in eq should ideally contribute to the embedding field that polarizes the MM subsystem, the current treatment of atomic polarizabilities in polarizable force fields is only well-defined for describing long-range polarization. As a result, in our present implementation, we retain only the first two terms of eq , which correspond to classical electrostatic interactions, and neglect the contributions from the nonadditive kinetic and exchange-correlation terms. These omitted terms mainly affect short-range interactions, where the classical force field approach is less reliable.
The evaluation of the QM additive energy in can utilize the sDFT implementation of eDFTpy with either a single QM subsystem or multiple QM subsystems. When N S QM subsystems are involved, the QM electron density is represented as the sum of contributions from each subsystem, and the QM energy is decomposed into additive and nonadditive terms, following the structure of . Namely
| 9 |
| 10 |
3. Details of the Implementation
We now describe the implementation of the QM/MM algorithm in the eDFTpy software package. Figure shows the overall workflow for the QM/MM SCF cycle when a MM subsystem is present. The process begins with initial guesses for both the MM and QM electron densities and wave functions. In eDFTpy, the QM calculations are performed using QEpy, a Python interface to Quantum ESPRESSO 7.2, while the MM subsystem is treated using a Python interface to MBX. The initial guess for the QM electron density is typically constructed from a sum of atomic densities provided in the pseudopotential files. For the MM subsystem, we use a valence electron density obtained through the backward mapping procedure described above, ensuring that it is consistent with the permanent charges assigned in the force field, which remain fixed and independent of geometry.
1.
Workflow of the “density functionalized” QM/MM method with emphasis on software implementation. See details in section .
eDFTpy then instructs QEpy, the QM solver, to calculate the electronic structure of the QM subsystem. This is done using a single QEpy instance when only one QM subsystem is present, or multiple QEpy instances when several QM subsystems are included. For systems with more than one QM region, eDFTpy is responsible for computing the QM-in-QM embedding potentials, as indicated by the blue arrows connecting the QEpy solvers to the eDFTpy module in Figure . The QM electrostatic field, formally defined as the negative gradient of the MM embedding potential in eq , −∇⃗vemb (r), is evaluated at the sites of the MM polarizable dipoles.
For large systems, representing the QM electric field at the MM sites can be computationally demanding. ,,, To address this, we represent the QM electric field at the real-space grid point nearest to each MM dipole site. This approach is sufficiently accurate due to the smoothness of the QM field in the MM region, and it avoids the need to interpolate the field to the exact positions of the ions, which would require a costly MPI_GATHER operation. In eDFTpy, the MM cell is covered by a real-space grid that is fine enough to accurately represent the ionic pseudopotentials at each QM ion and MM site. In our simulations, we use a grid with a 100 Ha cutoff, which corresponds to a real-space grid spacing of approximately 0.23 a 0.
Once the electric field is represented on the MM sites, the MM solver is tasked with solving the Coulomb problem in the MM cell to yield the MM polarizable dipoles. The MM density is then built using eq . Having the MM density and the density of all QM subsystems allows eDFTpy to compute the required embedding potentials for each subsystem, including the MM subsystem. The SCF cycles then continue until convergence is achieved, which is calibrated on the convergence of the QM density. At convergence, QM density and MM dipoles are fully mutually polarized.
In the eDFTpy implementation we take full advantage of plane wave reduction techniques that were developed for sDFT. ,,− Specifically, MM and QM subsystems are represented by different simulation cells, grids and thus plane wave basis sets. The QM simulation cell is smaller compared to the physical cell which coincides with the MM cell. Coulomb fields and other long-ranged energy functionals are evaluated on the large, physical cell. Employing such a multicell/grid approach is crucial in the context of QM/MM simulations as the MM subsystem is usually dramatically more extended than any of the QM subsystems. In the Supporting Information we further discuss our parallelization strategy.
The current implementation also features analytic energy gradients for the atoms in the QM region for the “charge only” implementation (omitting the MM dipoles) and approximately also for the full polarizable QM/MM implementation. In the Supporting Information we devote a section to the implementation of the forces and results are shown in Tables S3 and S4.
4. Computational Details
4.1. Pseudopotentials, Density Functionals and Plane Wave Cutoffs
GBRV pseudopotentials are used for all elements considered of both the QM and MM subsystems. The PBE exchange-correlation functional and the revAPBEk kinetic energy functional are employed for approximating additive E xc and nonadditive E xc and T s functionals. More nuanced nonadditive functionals could be considered, and they would likely require a functional-specific reparametrization of the MM Gaussian widths. A good design principle for the QM-MM interaction functional is to employ the most accurate nonadditive functionals available for the systems considered. As in this work we consider water-solvated systems, and as revAPBEk was shown to be exceptionally well-suited for this type of systems, − we chose revAPBEk for all examples featured here. All calculations include Grimme’s D3 correction. We choose plane wave basis sets for the QM subsystems with a cutoff energy of 20 Ha for wave functions and 200 Ha for charge density and potential unless otherwise stated (see supporting Table S1). The energy convergence threshold for the SCF was set to 10–8 Ha/atom. The Brillouin zone was sampled by a 5 × 5 × 1 k-point mesh for MoS2 and the Γ point for all other calculations. MM calculations were conducted using the MBX package using the MB-PBE and MB-Pol water models, which were developed to quantitatively reproduce PBE or CCSD(T) water, , respectively.
All inputs/output files, Jupyter notebooks needed to analyze the data and reproduce tables and figures in this work, as well as links to tagged versions of the software (MBX, eDFTpy, QEpy, and DFTpy) used for the simulations, are available as reported in the Supporting Information.
4.2. Parameters Defining the MM Density
As described in the sections above, the proposed framework relies on the conversion of classical permanent charges and polarizable dipole sites into a smooth electronic density. For each permanent charge and polarizable dipole, this involves the fit of the width, σ, of the corresponding normalized Gaussian functions that contribute to the expansion in eq . In our applications to solvated systems, MM sites only involve oxygen and hydrogen atoms, for a total of 4 parameters that need to be fitted.
We also considered an additional MM-induced dipole self-energy correction. Although the induced point-dipole self-energy is given by , where α is the isotropic dipole polarizability, we recognize that the dipoles considered in our work are not point dipoles, e.g., their charge density overlaps with the QM density at the QM/MM interface. Thus, we added an additional term to the self-energy for each site equal to k i |μ⃗ – μ⃗′|2, where μ⃗′ is the induced dipole at the same site when only the MM subsystem is considered, and k i are element-dependent proportionality constants. Additional details about this correction and about how the permanent charge density was generated for the MB-PBE and MB-Pol force fields (which use the off-atom center M-site) are available in the Supporting Information document.
The parameters defining the MM density were fitted so as to reproduce the QM-QM interaction energies for a single water molecule in bulk water. Ten snapshots of 64-water molecule cubic systems were taken from ref . These provided 640 water-bulk interaction energies. More details about this system will be given in the Section . The final values of the parameters for both MB-PBE and MB-Pol force fields are listed in Table S1.
The use of bulk simulations as a reference for the parametrization of the density functionalization approach provides a robust and general strategy that reduces the need for application-specific benchmarks. Results on small water clusters and solvation effects on molecules and materials (reported in the following) highlight the transferability of the obtained parameters.
5. Results and Discussion
5.1. Water Dimer
We start by comparing the QM/MM potential energy curves shown in Figure and the polarization density depicted in Figure for water dimers. The dimer structures, sourced from ref , were placed in a 20 Å cubic simulation cell and evaluated against benchmark QM/QM simulations performed using sDFT.
2.
Water dimer energy curve (structure shown) for O–O distances ranging from 2.3 to 7.7 Å. The black line represents the QM/QM result, where both the donor and acceptor monomers are treated at the QM level. The shaded area represents the deviation of the QM/MM result from QM/QM. Green and blue lines correspond to the QM hydrogen bond donor (QM/MM) and acceptor (MM/QM), respectively. The inset shows a correlation plot of QM/MM and MM/QM interaction energies. Yellow markers represent geometries in the repulsive region of the curve (R O–O < 2.9 Å), while red markers correspond to those in the attractive region (R O–O > 2.9 Å).
3.
Polarization density, defined as ρ(r) – ρiso(r), where ρiso is the sum of the electron densities of the isolated water monomers, for the water dimer at the equilibrium O–O distance. In each panel, we present isosurfaces (top) and contour plots (bottom) generated with a cutoff of ±0.0007 e·a 0 . (a): reference QM/QM calculation; (b): MM/MM calculation carried out with MB-PBE, the dipoles have been represented by smooth Gaussians; (c) QM/MM calculation where the MM water molecule is the hydrogen bond acceptor; (d) MM/QM calculation where the MM water molecule is the hydrogen bond donor.
The key aspect of this system lies in its asymmetry, as one monomer is a hydrogen bond donor, and the other is an acceptor. Even though both QM and MM monomers are modeled by PBE (the MM subsystem is described by the MB-PBE model), the nature of the QM-MM interface is dramatically different whether one considers a QM hydrogen bond donor (QM/MM) or acceptor (MM/QM). Despite this, the curves are remarkably similar. The QM/MM and MM/QM minima (at 2.90 Å O–O distance) are less than 0.15 kcal/mol away from each other, and both deviate from the reference by a similar measure. These results show that the QM/MM interface is extremely well characterized by the nonadditive functionals. Crucially, the repulsive part of the curve is also well reproduced.
Our error of approximately 0.2 kcal/mol compares favorably to the 0.6 kcal/mol error reported in ref . for the dimer using polarizable QM/MM (MM treated with AMOEBA). We also observe about a 1 kcal/mol deviation when using another QM/MM (AMOEBA) implementation in our own calculations, as shown in Figure S1. These results demonstrate that a density functional treatment of the QM/MM interaction provides consistent interaction energies for both QM/MM and MM/QM partitions, whether the donor is treated at the QM level and the acceptor at the MM level, or vice versa. Previous studies have addressed inconsistencies between these two cases by introducing corrective short-range polynomials. In contrast, our formalism achieves this consistency directly, without requiring additional corrections. For completeness, we also compared a charge-only version of our density functional approach. In this implementation, the MM polarizable dipoles are omitted entirely, and only the permanent charges are retained. Specifically, the MM subsystem is represented solely by its electron density and the corresponding pseudopotentials, with no contribution from induced dipoles or polarization effects. We compared this charge-only scheme with the TIP4P model implemented in GPAW (see Figure S1). Although GPAW correctly yields consistent QM/MM and MM/QM results, our method achieves better agreement with the QM/QM reference, consistent with the observations reported by the GPAW authors in their paper.
The inset of Figure presents a correlation plot between the interaction energies obtained for the QM/MM and MM/QM systems, demonstrating that the two are in very close agreement. In the attractive region of the potential energy curve, the interaction energies follow the expected trend, with the data points lying along the diagonal. The repulsive region (highlighted by yellow markers) exhibits a slight deviation from this ideal behavior. This suggests there is some asymmetry in how charge penetration effects are captured for HQM-OMM compared to HMM-OQM. To confirm that our findings are not influenced by artifacts from the use of periodic boundary conditions, we repeated the calculation of the interaction energy minimum in a larger simulation cell with a lattice constant of a = 50 Å. The result changed by less than 10–3 kcal/mol, indicating that finite-cell-size effects are negligible.
While agreement in interaction energies is necessary, it does not guarantee that a model accurately captures the underlying electronic structure. For a more stringent assessment, it is also important to compare the electron densities, and in particular, the polarization density of the system.
Figure presents the polarization density (see caption for the definition) for the water dimer system. The MM polarization reproduces the overall features of the QM polarization only at a qualitative level. A direct comparison of the QM/QM case (panel a) with the MM/MM case (panel b) reveals that the MM approach misses several important details near the ion cores and along the O–O axis. These differences are well documented and reflect known limitations in the electrostatic response of dipole-only polarizable force fields. , Notably, the QM/MM treatment leads to an improved description of the polarization in both monomers. As shown in panels (c) and (d), the MM molecule in the QM/MM setup exhibits polarization features that are prominent in the QM response but absent or only weakly present in the MM/MM case. This demonstrates that our approach handles the QM-MM interface accurately not only in terms of interaction energies (as seen in Figure ) but also by the more stringent test of mutual polarization between subsystems.
5.2. Water Hexamer
Hexamer water structures play a crucial role in quantum chemistry due to their complex hydrogen-bond topologies. Accurately predicting their relative energies is often seen as a benchmark for a model’s ability to represent water across its various stable phases. Here, following ref , we focus on the most stable hexamer, the prism hexamer, to examine the effect of the QM-MM boundary. In the prism hexamer all water monomers act as both hydrogen bond donors and acceptors, with either 1/2 or 2/1 accepted/donated hydrogen bonds.
The root-mean-square errors (RMSEs) of the computed interaction energies defined as the energy of the hexamer minus the energy of the 6 isolated water molecules are collected in Figures S2 and S3 for the MM treated with MB-PBE and MB-Pol, respectively. When multiple water molecules are treated at the QM level, the sDFT framework allows flexibility in how the QM waters are grouped: they can be combined into a single subsystem (Frag. 1 in the figure) or divided into separate subsystems (Frag. 2). For each partition type with k QM or MM water molecules, there are possible members of the partition (i.e., ways to split the hexamer into QM and MM subsystems).
Ideally, all members of each partition should yield the same interaction energy. Therefore, a larger RMSE for the predicted interaction energy indicates a lower accuracy of the method. Recognizing that no practical fragmentation method is perfect, we find a relation between the RMSE for the k-th partition (σQM/MM(k)) and the square root of the number of members in the partition . This relationship is confirmed by R 2 values of 0.99 for all QM/MM and fragmentation methods considered (see Figure S4). The slope of this linear relationship is proportional to the average error per QM/MM boundary in the partitions ( ), yielding a predicted error per boundary ranging between 0.2 and 0.3 kcal/mol for the methods considered. This error is consistent with the results for water dimers and, as we will see in the next section, also aligns with the RMSEs for the pentamer clusters extracted from bulk liquid structures.
5.3. Bulk and First Solvation Shell Water Environment
To further assess our method, we examine the dipole moment of individual water molecules and the interaction energy between them and their environment in bulk liquid water. As a benchmark, we use sDFT, which provides accurate subsystem electron densities and has been validated for various embedded molecular species. ,,, The analysis is based on 10 snapshots from an ab initio molecular dynamics trajectory of 64 water molecules in a cubic cell with a lattice constant of 12.42 Å, , with all configurations provided in the Supporting Information.
We adopt the notation n/m to indicate simulations with n molecules at the QM level and m molecules at the MM level, always totaling 64 water molecules. In each case, we report the interaction of a single water molecule with its environment. For example, in the 1/63 system, one water molecule is described quantum mechanically and the remaining 63 are described by the MM model. The sDFT benchmarks are generated both by treating all 63 environment molecules as a single subsystem (Figure ) and by dividing them into individual subsystems (Figure S5); both approaches yield similar trends.
4.

Panels (a)–(c): correlation plots of the interaction energy (in kcal/mol) of a single water molecule with its environment in a model of liquid water. (a) Bulk: QM/MM with 1 QM water molecule and 63 MM water molecules. (b) First shell: 1 QM water molecule and 4 MM water molecules (only the first solvation shell). (c) Minimal solvation: 5 QM water molecules and 59 MM water molecules. Panels (d)–(f): correlation plots of the dipole moment length of the embedded molecule (in Debye) for the same systems as for panels (a–c).
In addition to the bulk water configurations analyzed using the 1/63 partitioning, we examine two further cases inspired by ref . The first is the “first shell” configuration (1/4), in which each water molecule is surrounded only by its four nearest neighbors, resulting in a collection of isolated water pentamers rather than a bulk environment. The second is the “minimal solvation” setup (5/59), which is still representative of bulk water. Here, the target molecule and its four nearest neighbors are all treated at the QM level, with each water molecule modeled as a separate sDFT subsystem, while the remaining 59 molecules are treated at the MM level. For each scenario, we consider all water molecules in each of the 10 snapshots, resulting in 640 structures per case.
Panels (a) and (b) of Figure show that the interaction energy between a water molecule and its environment in liquid water ranges from −35 to −7 kcal/mol in the bulk calculations, and from −20 to −1 kcal/mol in the first shell calculations. These energy ranges are consistent with values reported in the literature. As indicated in the figure, the root-mean-square errors (RMSEs) for the QM/MM interaction energies relative to the QM/QM benchmarks are 1.32 kcal/mol for the bulk system and 1.05 kcal/mol for the first shell. The minimal solvation setup shows an intermediate RMSE of 1.11 kcal/mol, improving on the bulk result. The RMSE for the first shell simulations is also comparable to the 1.4 kcal/mol reported for similar systems using the AMOEBA force field in ref . To our knowledge, there are no directly comparable results for the bulk or minimal solvation cases in the existing literature.
These RMSE values are also consistent with the accuracy achieved for the hexamer structure discussed earlier. Achieving an accuracy of about 1 kcal/mol for bulk systems is particularly notable, given that the average interaction energy per water molecule is significantly higher in bulk water (about 22.42 kcal/mol for QM–QM interactions) than in the hexamer (about 8.00 kcal/mol per water, using mbpol). This level of accuracy indicates that the mutual polarization between the QM and MM regions in both the bulk and minimal solvation setups is well captured by our QM/MM approach.
As illustrated in Figure , the polarization density of both the embedded water molecule (top panels) and the surrounding environment (bottom panels) is reproduced with reasonable fidelity in the QM/MM simulations. A noteworthy feature in the MM polarization of the environment is the relatively weak contribution from the second solvation shell compared to the QM/QM reference. This effect can be attributed to the use of slightly damped atomic dipole polarizabilities in typical polarizable force fields, as discussed in refs , Damping these polarizabilities is essential to avoid overpolarization, which in our context would otherwise arise from neglecting the nonadditive components of the embedding potential (see eq ). For completeness, we show the polarization density from QM/MM with MB-pol in supporting figure S6. In future work, it will be valuable to examine how including these additional terms affects MM polarization and the choice of atomic dipole polarizabilities.
5.
Polarization density (defined as the difference of the embedded and isolated molecular electron density) of an embedded water molecule employing the methods indicated in the figure. Top panels: single water molecule polarization. Bottom panels: environment polarization. The isosurface value is set to ±0.0025 e·a 0 .
Panels (d)–(f) of Figure show the magnitude of the dipole moment for the central, embedded water molecule. The values range from 1.6 to 4.5 D in both the 1/63 and 5/59 bulk simulations, and from 1.3 to 3.5 D in the 1/4 first-shell calculations. Overall, the dipole moments are somewhat underestimated in the QM/MM simulations compared to the QM/QM reference. Despite this, the QM/MM and QM/QM (sDFT) dipole moments remain strongly correlated, with Pearson’s correlation coefficients above 0.9. For the 1/4 calculations, we find a root-mean-square error (RMSE) of 0.29 D, which compares favorably to the 0.62 D reported in ref . Together with the interaction energy results discussed above, these findings support our claim that our method achieves the highest accuracy for QM/MM simulations of liquid water reported to date.
Figure also includes results for the charge-only implementation (red diamonds), in which the polarization of the MM subsystem is neglected by setting the polarizable dipoles to zero. For the first shell systems (1/4), the charge-only approach still yields reasonable results, with a dipole correlation of 0.91, a dipole RMSE of 0.42 D, and an interaction energy RMSE of 1.73 kcal/mol. However, for the bulk system, this approach performs poorly, as indicated by a significant increase in the interaction energy RMSE to 7.33 kcal/mol. This level of error is similar to what is typically observed in standard electrostatic embedding QM/MM simulations of condensed phases. ,
Additionally, since MB-Pol provides a highly accurate benchmark for interaction energies, we include a comparison of our QM/MM interaction energies for the 1/63 system with MB-Pol reference values in supporting Figure S7.
5.4. Convergence with Respect to the QM Size
An important test for QM/MM simulations of solvated species is the convergence with respect to the number of water molecules included in the QM subsystem. As mentioned in the Section , having an accurate model for the QM-MM interface and employing an MM force field that is consistent with the QM method (we use the MB-PBE force field) should allow us to showcase strong QM/MM convergence. The typical target is a ±2 kcal/mol from the reference QM calculation. In Figure , we present the interaction energies of glucose (left panel) and the [Pd(H2O)4]2+ aqua ion, counterbalanced by 2 Cl– ions (right panel), with the water bulk environment as a function of the number of water molecules included in the QM subsystem. For glucose (a neutral molecule), the figure clearly shows that the ±2 kcal/mol target is reached (in fact, a ±1 kcal/mol is achieved) already when only 14 water molecules are included in the QM subsystem. This amounts to including less than the first solvation shell. Interestingly, the convergence for this system is much worse when a charge-only MM model is used (i.e., as done before, we simply neglect the polarization of the MM subsystem) for which 26 QM water molecules are needed to reach convergence. For PdCl2, despite the presence of a doubly charged cation, we find an essentially identical behavior with convergence being reached already with 13 QM water molecules. Conversely, the charge only QM/MM method does not reach the ±2 kcal/mol goal even when 46 QM water molecules are included. Therefore, we conclude that similar to the glucose system, including polarization in the MM subsystem dramatically improves the convergence of the interaction energy with the size of the QM subsystem.
6.

Convergence of the glucose (left panel) and the [Pd(H2O)4]2+ aqua ion (counterbalanced by 2 Cl– ions, right panel) solute-water interaction energy with respect to the number of water molecules included in the QM subsystem. An orange bar representing the sDFT reference is labeled as QM/QM in each plot. The dot-dashed line marks ±2 kcal/mol, and the shade marks ±1 kcal/mol window from the sDFT reference.
Our results on the need to include polarization in the MM subsystem find justification in the fact that, for a condensed phase system, including the inductive response of the environment, is crucial to obtain a physical picture, particularly for charged solutes.
In supporting Figure S8, we also present the polarization density of glucose and its water environment, showing once again that the polarization of the QM/MM model qualitatively reproduces the polarization of the reference QM/QM simulation.
5.5. Charge Spill-Out Effects
As discussed in the Section , properly accounting for Pauli repulsion between the QM and MM subsystems is essential to prevent the well-known charge spill-out effect in the QM region. Our method addresses this by including nonadditive functionals, particularly the nonadditive kinetic energy functional, which introduces a repulsive term in both the energy and the potential felt by the QM electrons (see the term in eq ).
To illustrate the impact of this repulsive potential, we performed tests on the solvated PdCl2 system, comparing simulations with and without the inclusion of nonadditive functionals and their corresponding potentials (see Figure S9). The difference is clear: when the nonadditive terms are omitted, significant charge spill-out occurs from the Cl anions. This effect is mainly due to the absence of Pauli repulsion and is further exacerbated by the use of the PBE exchange-correlation functional, which, because of self-interaction error, causes Cl anions in vacuum to have partially unbound electrons and by the spatially delocalized nature of the basis set (plane waves). In the condensed phase, this problem is largely mitigated.
We also show an example of wet MoS2 surfaces in the Supporting Information document.
6. Conclusion
We introduced a novel QM/MM framework that incorporates density-functional theory (DFT) for the QM and the MM subsystems that leverages an orbital-free treatment for the MM region and the QM-MM interaction. By assigning an electron density to the MM subsystem and accurately capturing nonadditive interactions (such as exchange, correlation, Coulomb, and Pauli repulsion effects) our density-functionalized QM/MM approach achieves chemical accuracy in modeling the interactions between complex solute–solvent systems. We validated the approach against a variety of water-based systems, including water clusters, bulk water, solvated ions, and a wet monolayer of MoS2, demonstrating consistent accuracy and achieving unprecedented fast convergence to chemical accuracy with respect to increasing size of the QM subsystem. Reaching chemical accuracy with the proposed method requires only the inclusion of the first solvation shell in the QM region–a significant advancement over traditional QM/MM schemes.
Particularly striking is the performance of our density functionalized QM/MM method for H2O in water. There we find that the excellent performance of the method for clusters (dimers, pentamers and hexamers) seamlessly translates to accurate models of bulk liquid water. A prime example is given by the dipole of the solute molecule which we predict an RMSE with QM/MM of 0.29 D, or 12%, for both bulk and pentamers compared to 0.624 D for pentamers from ref .
Our results highlight the critical role of (1) employing ab initio density functionals for the QM-MM interactions instead of ad-hoc parametrizations; (2) properly accounting for mutual polarization at the QM-MM interface; and (3) employing MM force fields that are consistent with the QM method employed (here we use MB-PBE for MM and DFT with the PBE exchange-correlation functional for QM). We found that following these three principles significantly improves convergence with respect to the size of the QM region compared to standard QM/MM methods. Our pilot simulations have showed that for both neutral and charged solutes, the interaction energies reached the target accuracy of within ±2 kcal/mol using a minimal QM region, with further refinement yielding sub-1 kcal/mol errors when merely the first shell of solvent molecules is included in the QM subsystem. This level of accuracy, achieved even in bulk water systems, underscores the robustness of our method for simulating condensed-phase environments.
Overall, this work presents a significant step forward in extending the applicability of QM/MM methods to treat larger, more complex systems than typically approached by standard QM methods while maintaining chemical accuracy. Future work will explore the impact of including beyond-Coulomb, ab initio terms in the MM embedding potential, so that the MM dipole response can more closely resemble the true electronic response of the MM subsystem. We also plan to apply the density-functionalized QM/MM framework to chemical environments other than water, for example those provided by biomolecules and materials interfaces as well as nonaqueous solvents.
Supplementary Material
Acknowledgments
This research was partially funded by the U.S. National Science Foundation grants No. CHE-2154760 (M.P., J.M.B. and X.C.), OAC-2321103 (M.P., J.M.B. and X.C.), OAC-2311260 (M.R. and F.P.) and CHE-2306929 and OAC-2321102 (O.A.). All computations were carried out on the Price supercomputer of Rutgers University-Newark acquired through an NSF MRI grant No. OAC-2117429 (M.P.) and managed by the Office of Advanced Research Computing at Rutgers.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.5c01440.
Additional tables and figures and additional information related to (1) the definition of the QM-MM interaction functional; (2) the evaluation of atomic forces; (3) eDFTpy’s parallelization strategy, and (4) a calculation of wet MoS2 surfaces (PDF)
#.
X.C., J.A.M.B., and X.S. contributed equally to this work.
The authors declare no competing financial interest.
References
- Karplus, M. Development of Multiscale Models for Complex Chemical Systems: From H+H2 to Biomolecules (Nobel Lecture), 2013. https://www.nobelprize.org/uploads/2018/06/karplus-lecture.pdf. [DOI] [PubMed]
- Honig B., Karplus M.. Implications of torsional potential of retinal isomers for visual excitation. Nature. 1971;229:558–560. doi: 10.1038/229558a0. [DOI] [PubMed] [Google Scholar]
- Senn H. M., Thiel W.. QM/MM studies of enzymes. Curr. Opin. Chem. Biol. 2007;11:182–187. doi: 10.1016/j.cbpa.2007.01.684. [DOI] [PubMed] [Google Scholar]
- Kulik H. J.. Large-scale QM/MM free energy simulations of enzyme catalysis reveal the influence of charge transfer. Phys. Chem. Chem. Phys. 2018;20:20650–20660. doi: 10.1039/C8CP03871F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raghavan B., Paulikat M., Ahmad K., Callea L., Rizzi A., Ippoliti E., Mandelli D., Bonati L., De Vivo M., Carloni P.. Drug Design in the Exascale Era: A Perspective from Massively Parallel QM/MM Simulations. J. Chem. Inf. Model. 2023;63:3647–3658. doi: 10.1021/acs.jcim.3c00557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin H., Zhang Y., Pezeshki S., Duster A. W., Wang B., Wu X.-P., Zheng S.-W., Gagliardi L., Truhlar D. G.. QMMM 2023: A program for combined quantum mechanical and molecular mechanical modeling and simulations. Comput. Phys. Commun. 2024;295:108987. doi: 10.1016/j.cpc.2023.108987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ma C., Martin-Samos L., Fabris S., Laio A., Piccinin S.. QMMMW: A wrapper for QM/MM simulations with Quantum ESPRESSO and LAMMPS. Comput. Phys. Commun. 2015;195:191–198. doi: 10.1016/j.cpc.2015.04.024. [DOI] [Google Scholar]
- Lambros E., Lipparini F., Cisneros G. A., Paesani F.. A Many-Body, Fully Polarizable Approach to QM/MM Simulations. J. Chem. Theory Comput. 2020;16:7462–7472. doi: 10.1021/acs.jctc.0c00932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gresh N., Cisneros G. A., Darden T. A., Piquemal J.-P.. Anisotropic, Polarizable Molecular Mechanics Studies of Inter- and Intramolecular Interactions and Ligand-Macromolecule Complexes. A Bottom-Up Strategy. J. Chem. Theory Comput. 2007;3:1960–1986. doi: 10.1021/ct700134r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu Z., Zhang Y.. Interfacing ab Initio Quantum Mechanical Method with Classical Drude Osillator Polarizable Model for Molecular Dynamics Simulation of Chemical Reactions. J. Chem. Theory Comput. 2008;4:1237–1248. doi: 10.1021/ct800116e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ganguly A., Boulanger E., Thiel W.. Importance of MM Polarization in QM/MM Studies of Enzymatic Reactions: Assessment of the QM/MM Drude Oscillator Model. J. Chem. Theory Comput. 2017;13:2954–2961. doi: 10.1021/acs.jctc.7b00016. [DOI] [PubMed] [Google Scholar]
- Loco D., Lagardère L., Caprasecca S., Lipparini F., Mennucci B., Piquemal J.-P.. Hybrid QM/MM Molecular Dynamics with AMOEBA Polarizable Embedding. J. Chem. Theory Comput. 2017;13:4025–4033. doi: 10.1021/acs.jctc.7b00572. [DOI] [PubMed] [Google Scholar]
- Bondanza M., Nottoli M., Cupellini L., Lipparini F., Mennucci B.. Polarizable embedding QM/MM: the future gold standard for complex (bio)systems? Phys. Chem. Chem. Phys. 2020;22:14433–14448. doi: 10.1039/D0CP02119A. [DOI] [PubMed] [Google Scholar]
- Reinholdt P., Kongsted J., Lipparini F.. Fast approximate but accurate QM/MM interactions for polarizable embedding. J. Chem. Theory Comput. 2022;18:344–356. doi: 10.1021/acs.jctc.1c01037. [DOI] [PubMed] [Google Scholar]
- Giovannini T., Puglisi A., Ambrosetti M., Cappelli C.. Polarizable QM/MM Approach with Fluctuating Charges and Fluctuating Dipoles: The QM/FQFμ Model. J. Chem. Theory Comput. 2019;15:2233–2245. doi: 10.1021/acs.jctc.8b01149. [DOI] [PubMed] [Google Scholar]
- Zinovjev K.. Electrostatic embedding of machine learning potentials. J. Chem. Theory Comput. 2023;19:1888–1897. doi: 10.1021/acs.jctc.2c00914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giese T. J., Zeng J., Lerew L., McCarthy E., Tao Y., Ekesan S., York D. M.. Software Infrastructure for Next-Generation QM/MM- ΔMLP Force Fields. J. Phys. Chem. B. 2024;128:6257–6271. doi: 10.1021/acs.jpcb.4c01466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff L. E., Mohr S., Huhs G., Deutsch T., Masella M., Genovese L.. Challenges in large scale quantum mechanical calculations. WIREs Comput. Mol. Sci. 2017;7:e1290. doi: 10.1002/wcms.1290. [DOI] [Google Scholar]
- Groenhof, G. Introduction to QM/MM simulations. In Methods in Molecular Biology; Springer, 2013; Vol. 924, pp 43–66. [DOI] [PubMed] [Google Scholar]
- Warshel A., Levitt M.. Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J. Mol. Biol. 1976;103:227–249. doi: 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
- Lipparini F.. General linear scaling implementation of polarizable embedding schemes. J. Chem. Theory Comput. 2019;15:4312–4317. doi: 10.1021/acs.jctc.9b00585. [DOI] [PubMed] [Google Scholar]
- Bondanza M., Nottoli T., Nottoli M., Cupellini L., Lipparini F., Mennucci B.. The OpenMMPol library for polarizable QM/MM calculations of properties and dynamics. J. Chem. Phys. 2024;160:134106. doi: 10.1063/5.0198251. [DOI] [PubMed] [Google Scholar]
- Olsen J. M. H., Bolnykh V., Meloni S., Ippoliti E., Bircher M. P., Carloni P., Rothlisberger U.. MiMiC: a novel framework for multiscale modeling in computational chemistry. J. Chem. Theory Comput. 2019;15:3810–3823. doi: 10.1021/acs.jctc.9b00093. [DOI] [PubMed] [Google Scholar]
- Cruzeiro V. W. D., Manathunga M., Merz K. M. Jr, Götz A. W.. Open-source multi-GPU-accelerated QM/MM simulations with AMBER and QUICK. J. Chem. Inf. Model. 2021;61:2109–2115. doi: 10.1021/acs.jcim.1c00169. [DOI] [PubMed] [Google Scholar]
- Pederson J. P., McDaniel J. G.. PyDFT-QMMM: A modular, extensible software framework for DFT-based QM/MM molecular dynamics. J. Chem. Phys. 2024;161:034103. doi: 10.1063/5.0219851. [DOI] [PubMed] [Google Scholar]
- Dziedzic J., Mao Y., Shao Y., Ponder J., Head-Gordon T., Head-Gordon M., Skylaris C.-K.. TINKTEP: A fully self-consistent, mutually polarizable QM/MM approach based on the AMOEBA force field. J. Chem. Phys. 2016;145:124106. doi: 10.1063/1.4962909. [DOI] [PubMed] [Google Scholar]
- Lu Y., Sen K., Yong C., Gunn D. S., Purton J. A., Guan J., Desmoutier A., Nasir J. A., Zhang X., Zhu L.. et al. Multiscale QM/MM modelling of catalytic systems with ChemShell. Phys. Chem. Chem. Phys. 2023;25:21816–21835. doi: 10.1039/D3CP00648D. [DOI] [PubMed] [Google Scholar]
- Dohn A. O., Jónsson E. O., Levi G., Mortensen J. J., Lopez-Acevedo O., Thygesen K. S., Jacobsen K. W., Ulstrup J., Henriksen N. E., Møller K., Jónsson H.. Grid-based projector augmented wave (GPAW) implementation of quantum mechanics/molecular mechanics (QM/MM) electrostatic embedding and application to a solvated diplatinum complex. J. Chem. Theory Comput. 2017;13:6010–6022. doi: 10.1021/acs.jctc.7b00621. [DOI] [PubMed] [Google Scholar]
- Nam K., Gao J., York D. M.. An Efficient Linear-Scaling Ewald Method for Long-Range Electrostatic Interactions in Combined QM/MM Calculations. J. Chem. Theory Comput. 2005;1:2–13. doi: 10.1021/ct049941i. [DOI] [PubMed] [Google Scholar]
- Holden Z. C., Richard R. M., Herbert J. M.. Periodic boundary conditions for QM/MM calculations: Ewald summation for extended Gaussian basis sets. J. Chem. Phys. 2013;139:244108. doi: 10.1063/1.4850655. [DOI] [PubMed] [Google Scholar]
- Bonfrate S., Ferré N., Huix-Rotllant M.. Analytic Gradients for the Electrostatic Embedding QM/MM Model in Periodic Boundary Conditions Using Particle-Mesh Ewald Sums and Electrostatic Potential Fitted Charge Operators. J. Chem. Theory Comput. 2024;20:4338–4349. doi: 10.1021/acs.jctc.4c00201. [DOI] [PubMed] [Google Scholar]
- Pederson J. P., McDaniel J. G.. DFT-based QM/MM with particle-mesh Ewald for direct, long-range electrostatic embedding. J. Chem. Phys. 2022;156:174105. doi: 10.1063/5.0087386. [DOI] [PubMed] [Google Scholar]
- Laino T., Mohamed F., Laio A., Parrinello M.. An efficient linear-scaling electrostatic coupling for treating periodic boundary conditions in QM/MM simulations. J. Chem. Theory Comput. 2006;2:1370–1378. doi: 10.1021/ct6001169. [DOI] [PubMed] [Google Scholar]
- Laino T., Mohamed F., Laio A., Parrinello M.. An efficient real space multigrid QM/MM electrostatic coupling. J. Chem. Theory Comput. 2005;1:1176–1184. doi: 10.1021/ct050123f. [DOI] [PubMed] [Google Scholar]
- Ho J., Yu H., Shao Y., Taylor M., Chen J.. How Accurate Are QM/MM Models? J. Phys. Chem. A. 2025;129:1517–1528. doi: 10.1021/acs.jpca.4c06521. [DOI] [PubMed] [Google Scholar]
- Brandt F., Jacob C. R.. Systematic QM region construction in QM/MM calculations based on uncertainty quantification. J. Chem. Theory Comput. 2022;18:2584–2596. doi: 10.1021/acs.jctc.1c01093. [DOI] [PubMed] [Google Scholar]
- Pedraza-González L., Marín M. D. C., Jorge A. N., Ruck T. D., Yang X., Valentini A., Olivucci M., De Vico L.. Web-ARM: a web-based interface for the automatic construction of QM/MM models of rhodopsins. J. Chem. Inf. Model. 2020;60:1481–1493. doi: 10.1021/acs.jcim.9b00615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kulik H. J., Zhang J., Klinman J. P., Martínez T. J.. How large should the QM region be in QM/MM calculations? The case of catechol O-methyltransferase. J. Phys. Chem. B. 2016;120:11381–11394. doi: 10.1021/acs.jpcb.6b07814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karelina M., Kulik H. J.. Systematic quantum mechanical region determination in QM/MM simulation. J. Chem. Theory Comput. 2017;13:563–576. doi: 10.1021/acs.jctc.6b01049. [DOI] [PubMed] [Google Scholar]
- Summers T. J., Cheng Q., Palma M. A., Pham D.-T., Kelso D. K., Webster C. E., DeYonker N. J.. Cheminformatic quantum mechanical enzyme model design: A catechol-O-methyltransferase case study. Biophys. J. 2021;120:3577–3587. doi: 10.1016/j.bpj.2021.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pérez-Barcia Á., Cárdenas G., Nogueira J. J., Mandado M.. Effect of the QM size, basis set, and polarization on QM/MM interaction energy decomposition analysis. J. Chem. Inf. Model. 2023;63:882–897. doi: 10.1021/acs.jcim.2c01184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dziedzic J., Head-Gordon T., Head-Gordon M., Skylaris C.-K.. Mutually polarizable QM/MM model with in situ optimized localized basis functions. J. Chem. Phys. 2019;150:074103. doi: 10.1063/1.5080384. [DOI] [PubMed] [Google Scholar]
- Moscato D., Mandelli G., Bondanza M., Lipparini F., Conte R., Mennucci B., Ceotto M.. Unraveling Water Solvation Effects with Quantum Mechanics/Molecular Mechanics Semiclassical Vibrational Spectroscopy: The Case of Thymidine. J. Am. Chem. Soc. 2024;146:8179–8188. doi: 10.1021/jacs.3c12700. [DOI] [PubMed] [Google Scholar]
- Gómez S., Giovannini T., Cappelli C.. Multiple facets of modeling electronic absorption spectra of systems in solution. ACS Phys. Chem. Au. 2023;3:1–16. doi: 10.1021/acsphyschemau.2c00050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coons M. P., Herbert J. M.. Quantum chemistry in arbitrary dielectric environments: Theory and implementation of nonequilibrium Poisson boundary conditions and application to compute vertical ionization energies at the air/water interface. J. Chem. Phys. 2018;148:222834. doi: 10.1063/1.5023916. [DOI] [PubMed] [Google Scholar]
- Martinez B J. A., Paetow L., Tölle J., Shao X., Ramos P., Neugebauer J., Pavanello M.. Which Physical Phenomena Determine the Ionization Potential of Liquid Water? J. Phys. Chem. B. 2023;127:5470–5480. doi: 10.1021/acs.jpcb.2c07639. [DOI] [PubMed] [Google Scholar]
- Olsen J. M. H., Steinmann C., Ruud K., Kongsted J.. Polarizable density embedding: A new QM/QM/MM-based computational strategy. J. Phys. Chem. A. 2015;119:5344–5355. doi: 10.1021/jp510138k. [DOI] [PubMed] [Google Scholar]
- Kvedaraviciute S., Carrasco-Busturia D., Møller K. B., Olsen J. M. H.. Polarizable Embedding without Artificial Boundary Polarization. J. Chem. Theory Comput. 2023;19:5122–5141. doi: 10.1021/acs.jctc.3c00434. [DOI] [PubMed] [Google Scholar]
- Viquez Rojas C. I., Slipchenko L. V.. Exchange repulsion in quantum mechanical/effective fragment potential excitation energies: Beyond polarizable embedding. J. Chem. Theory Comput. 2020;16:6408–6417. doi: 10.1021/acs.jctc.9b01156. [DOI] [PubMed] [Google Scholar]
- Gökcan H., Kratz E., Darden T. A., Piquemal J.-P., Cisneros G. A.. QM/MM simulations with the Gaussian electrostatic model: A density-based polarizable potential. J. Phys. Chem. Lett. 2018;9:3062–3067. doi: 10.1021/acs.jpclett.8b01412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuechler E. R., Giese T. J., York D. M.. Charge-dependent many-body exchange and dispersion interactions in combined QM/MM simulations. J. Chem. Phys. 2015;143:234111. doi: 10.1063/1.4937166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Treß R. S., Hattig C., Hofener S.. Employing pseudopotentials to tackle excited-state electron spill-out in frozen density embedding calculations. J. Chem. Theory Comput. 2022;18:1737–1747. doi: 10.1021/acs.jctc.1c00732. [DOI] [PubMed] [Google Scholar]
- Gillan M. J., Alfè D., Bygrave P., Taylor C., Manby F.. Energy benchmarks for water clusters and ice structures from an embedded many-body expansion. J. Chem. Phys. 2013;139:114101. doi: 10.1063/1.4820906. [DOI] [PubMed] [Google Scholar]
- Ben-Nun M., Martínez T. J.. Direct evaluation of the Pauli repulsion energy using classical wavefunctions in hybrid quantum/classical potential energy surfaces. Chem. Phys. Lett. 1998;290:289–295. doi: 10.1016/S0009-2614(98)00486-2. [DOI] [Google Scholar]
- Giovannini T., Lafiosca P., Cappelli C.. A General Route to Include Pauli Repulsion and Quantum Dispersion Effects in QM/MM Approaches. J. Chem. Theory Comput. 2017;13:4854–4870. doi: 10.1021/acs.jctc.7b00776. [DOI] [PubMed] [Google Scholar]
- Bedrov D., Piquemal J.-P., Borodin O., MacKerell A. D. J., Roux B., Schröder C.. Molecular Dynamics Simulations of Ionic Liquids and Electrolytes Using Polarizable Force Fields. Chem. Rev. 2019;119:7940–7995. doi: 10.1021/acs.chemrev.8b00763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hršak D., Olsen J. M. H., Kongsted J.. Optimization and transferability of non-electrostatic repulsion in the polarizable density embedding model. J. Comput. Chem. 2017;38:2108–2117. doi: 10.1002/jcc.24859. [DOI] [PubMed] [Google Scholar]
- Khah A. M., Reinholdt P., Olsen J. M. H., Kongsted J., Hattig C.. Avoiding electron spill-out in QM/MM calculations on excited states with simple pseudopotentials. J. Chem. Theory Comput. 2020;16:1373–1381. doi: 10.1021/acs.jctc.9b01162. [DOI] [PubMed] [Google Scholar]
- Krishtal A., Sinha D., Genova A., Pavanello M.. Subsystem density-functional theory as an effective tool for modeling ground and excited states, their dynamics and many-body interactions. J. Phys.:Condens. Matter. 2015;27:183202. doi: 10.1088/0953-8984/27/18/183202. [DOI] [PubMed] [Google Scholar]
- Jacob C. R., Neugebauer J.. Subsystem density-functional theory. WIREs Comput. Mol. Sci. 2014;4:325–362. doi: 10.1002/wcms.1175. [DOI] [Google Scholar]
- Wesolowski T. A., Shedge S., Zhou X.. Frozen-density embedding strategy for multilevel simulations of electronic structure. Chem. Rev. 2015;115:5891–5928. doi: 10.1021/cr500502v. [DOI] [PubMed] [Google Scholar]
- Mi W., Luo K., Trickey S., Pavanello M.. Orbital-free density functional theory: An attractive electronic structure method for large-scale first-principles simulations. Chem. Rev. 2023;123:12039–12104. doi: 10.1021/acs.chemrev.2c00758. [DOI] [PubMed] [Google Scholar]
- Jacob C. R., Neugebauer J.. Subsystem density-functional theory (update) WIREs Comput. Mol. Sci. 2024;14:e1700. doi: 10.1002/wcms.1700. [DOI] [Google Scholar]
- Dułak M., Wesołowski T. A.. On the electron leak problem in orbital-free embedding calculations. J. Chem. Phys. 2006;124:164101. doi: 10.1063/1.2189228. [DOI] [PubMed] [Google Scholar]
- Pavanello M., Neugebauer J.. Modelling charge transfer reactions with the frozen density embedding formalism. J. Chem. Phys. 2011;135:234103. doi: 10.1063/1.3666005. [DOI] [PubMed] [Google Scholar]
- Solovyeva A., Pavanello M., Neugebauer J.. Spin densities from subsystem density-functional theory: Assessment and application to a photosynthetic reaction center complex model. J. Chem. Phys. 2012;136:194104. doi: 10.1063/1.4709771. [DOI] [PubMed] [Google Scholar]
- Supporting Information. 2025; See Supporting Information document.
- Genova A., Ceresoli D., Pavanello M.. Avoiding fractional electrons in subsystem DFT based ab-initio molecular dynamics yields accurate models for liquid water and solvated OH radical. J. Chem. Phys. 2016;144:234105. doi: 10.1063/1.4953363. [DOI] [PubMed] [Google Scholar]
- Mi W., Ramos P., Maranhao J., Pavanello M.. Ab initio structure and dynamics of CO2 at supercritical conditions. J. Phys. Chem. Lett. 2019;10:7554–7559. doi: 10.1021/acs.jpclett.9b03054. [DOI] [PubMed] [Google Scholar]
- Chen X., Cifuentes-Lopez A., Shao X., Lin L., Prokopchuk D., Pavanello M.. Unraveling the Hydration Shell Structure and Dynamics of Group 10 Aqua Ions. J. Phys. Chem. Lett. 2024;15:5517–5528. doi: 10.1021/acs.jpclett.4c00464. [DOI] [PubMed] [Google Scholar]
- Shao X., Mi W., Pavanello M.. GGA-level subsystem DFT achieves Sub-kcal/mol accuracy intermolecular interactions by mimicking nonlocal functionals. J. Chem. Theory Comput. 2021;17:3455–3461. doi: 10.1021/acs.jctc.1c00283. [DOI] [PubMed] [Google Scholar]
- Mi W., Pavanello M.. Nonlocal subsystem density functional theory. J. Phys. Chem. Lett. 2020;11:272–279. doi: 10.1021/acs.jpclett.9b03281. [DOI] [PubMed] [Google Scholar]
- Schlüns D., Klahr K., Mück-Lichtenfeld C., Visscher L., Neugebauer J.. Subsystem-DFT potential-energy curves for weakly interacting systems. Phys. Chem. Chem. Phys. 2015;17:14323–14341. doi: 10.1039/C4CP04936E. [DOI] [PubMed] [Google Scholar]
- Shao X., Mi W., Pavanello M.. Density embedding method for nanoscale molecule-metal interfaces. J. Phys. Chem. Lett. 2022;13:7147–7154. doi: 10.1021/acs.jpclett.2c01424. [DOI] [PubMed] [Google Scholar]
- Shao X., Lopez A. C., Khan Musa M. R., Nouri M. R., Pavanello M.. Adaptive subsystem density functional theory. J. Chem. Theory Comput. 2022;18:6646–6655. doi: 10.1021/acs.jctc.2c00698. [DOI] [PubMed] [Google Scholar]
- Schmitt-Monreal D., Jacob C. R.. Density-based many-body expansion as an efficient and accurate quantum-chemical fragmentation method: Application to water clusters. J. Chem. Theory Comput. 2021;17:4144–4156. doi: 10.1021/acs.jctc.1c00340. [DOI] [PubMed] [Google Scholar]
- Focke K., Jacob C. R.. Coupled-cluster density-based many-body expansion. J. Phys. Chem. A. 2023;127:9139–9148. doi: 10.1021/acs.jpca.3c04591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kevorkyants R., Dulak M., Wesolowski T. A.. Interaction energies in hydrogen-bonded systems: A testing ground for subsystem formulation of density-functional theory. J. Chem. Phys. 2006;124:024104. doi: 10.1063/1.2150820. [DOI] [PubMed] [Google Scholar]
- Cisneros G. A.. Application of Gaussian Electrostatic Model (GEM) Distributed Multipoles in the AMOEBA Force Field. J. Chem. Theory Comput. 2012;8:5072–5080. doi: 10.1021/ct300630u. [DOI] [PubMed] [Google Scholar]
- Babin V., Leforestier C., Paesani F.. Development of a “First Principles” Water Potential with Flexible Monomers: Dimer Potential Energy Surface, VRT Spectrum, and Second Virial Coefficient. J. Chem. Theory Comput. 2013;9:5395–5403. doi: 10.1021/ct400863t. [DOI] [PubMed] [Google Scholar]
- Babin V., Medders G. R., Paesani F.. Development of a “First Principles” Water Potential with Flexible Monomers. II: Trimer Potential Energy Surface, Third Virial Coefficient, and Small Clusters. J. Chem. Theory Comput. 2014;10:1599–1607. doi: 10.1021/ct500079y. [DOI] [PubMed] [Google Scholar]
- Riera M., Lambros E., Nguyen T. T., Götz A. W., Paesani F.. Low-order many-body interactions determine the local structure of liquid water. Chem. Sci. 2019;10:8211–8218. doi: 10.1039/C9SC03291F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garrity K. F., Bennett J. W., Rabe K. M., Vanderbilt D.. Pseudopotentials for high-throughput DFT calculations. Comput. Mater. Sci. 2014;81:446–452. doi: 10.1016/j.commatsci.2013.08.053. [DOI] [Google Scholar]
- Shao X., Mi W., Xu Q., Wang Y., Ma Y.. O(N log N scaling method to evaluate the ion–electron potential of crystalline solids. J. Chem. Phys. 2016;145:184110. doi: 10.1063/1.4967319. [DOI] [PubMed] [Google Scholar]
- Shao, X. ; Mi, W. ; Pavanello, M. . eDFTpy: An Object-Oriented Platform for Density Embedding Simulations 2024. http://edftpy.rutgers.edu. (Last accessed on: September 7, 2025).
- Shao, X. ; Andreussi, O. ; Ceresoli, D. ; Truscott, M. ; Baczewski, A. ; Campbell, Q. ; Pavanello, M. . QEpy: Quantum ESPRESSO in Python; Rutgers, 2024. https://qepy.rutgers.edu. (September 7, 2025). [Google Scholar]
- Giannozzi P., Andreussi O., Brumme T., Bunau O., Nardelli M. B., Calandra M., Car R., Cavazzoni C., Ceresoli D., Cococcioni M., Colonna N., Carnimeo I., Corso A. D., de Gironcoli S., Delugas P. Jr, DiStasio R. A., Ferretti A., Floris A., Fratesi G., Fugallo G., Gebauer R., Gerstmann U., Giustino F., Gorni T., Jia J., Kawamura M., Ko H.-Y., Kokalj A., Küçükbenli E., Lazzeri M., Marsili M., Marzari N., Mauri F., Nguyen N. L., Nguyen H.-V., de-la Roza A. O., Paulatto L., Poncé S., Rocca D., Sabatini R., Santra B., Schlipf M., Seitsonen A. P., Smogunov A., Timrov I., Thonhauser T., Umari P., Vast N., Wu X., Baroni S.. Advanced capabilities for materials modelling with Quantum ESPRESSO. J. Phys.:Condens. Matter. 2017;29:465901. doi: 10.1088/1361-648X/aa8f79. [DOI] [PubMed] [Google Scholar]
- Riera M., Knight C., Bull-Vulpe E. F., Zhu X., Agnew H., Smith D. G. A., Simmonett A. C., Paesani F.. MBX: A many-body energy and force calculator for data-driven many-body simulations. J. Chem. Phys. 2023;159:054802. doi: 10.1063/5.0156036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nørby M. S., Olsen J. M. H., Kongsted J., Jensen H. J. A.. Multipole moments for embedding potentials: Exploring different atomic allocation algorithms. J. Comput. Chem. 2016;37:1887–1896. doi: 10.1002/jcc.24403. [DOI] [PubMed] [Google Scholar]
- Ferré N., Ángyán J. G.. Approximate electrostatic interaction operator for QM/MM calculations. Chem. Phys. Lett. 2002;356:331–339. doi: 10.1016/S0009-2614(02)00343-3. [DOI] [Google Scholar]
- Genova A., Ceresoli D., Krishtal A., Andreussi O., DiStasio R. A. Jr, Pavanello M.. eQE: An open-source density functional embedding theory code for the condensed phase. Int. J. Quantum Chem. 2017;117:e25401. doi: 10.1002/qua.25401. [DOI] [Google Scholar]
- Mi W., Shao X., Genova A., Ceresoli D., Pavanello M.. eQE 2.0: Subsystem DFT Beyond GGA Functionals. Comput. Phys. Commun. 2021;269:108122. doi: 10.1016/j.cpc.2021.108122. [DOI] [Google Scholar]
- Genova A., Pavanello M.. Exploiting the locality of periodic subsystem density-functional theory: efficient sampling of the Brillouin zone. J. Phys.:Condens. Matter. 2015;27:495501. doi: 10.1088/0953-8984/27/49/495501. [DOI] [PubMed] [Google Scholar]
- Perdew J. P., Burke K., Ernzerhof M.. Generalized Gradient Approximation Made Simple [Phys. Rev. Lett. 77, 3865 (1996)] Phys. Rev. Lett. 1997;78:1396. doi: 10.1103/PhysRevLett.78.1396. [DOI] [PubMed] [Google Scholar]
- Laricchia S., Fabiano E., Constantin L., Della Sala F.. Generalized gradient approximations of the noninteracting kinetic energy from the semiclassical atom theory: Rationalization of the accuracy of the frozen density embedding theory for nonbonded interactions. J. Chem. Theory Comput. 2011;7:2439–2451. doi: 10.1021/ct200382w. [DOI] [PubMed] [Google Scholar]
- Grimme S.. Supramolecular binding thermodynamics by dispersion-corrected density functional theory. Chem. - Eur. J. 2012;18:9955–9964. doi: 10.1002/chem.201200497. [DOI] [PubMed] [Google Scholar]
- Babin V., Leforestier C., Paesani F.. Development of a “first principles” water potential with flexible monomers: Dimer potential energy surface, VRT spectrum, and second virial coefficient. J. Chem. Theory Comput. 2013;9:5395–5403. doi: 10.1021/ct400863t. [DOI] [PubMed] [Google Scholar]
- Theory of Electric Polarization, 2nd ed.; Böttcher, C. , Ed.; Elsevier: Amsterdam, 1973; Vol. 1, pp 109–110. [Google Scholar]
- P S. K., Genova A., Pavanello M.. Cooperation and Environment Characterize the Low-Lying Optical Spectrum of Liquid Water. J. Phys. Chem. Lett. 2017;8:5077–5083. doi: 10.1021/acs.jpclett.7b02212. [DOI] [PubMed] [Google Scholar]
- Tschumper G. S., Leininger M. L., Hoffman B. C., Valeev E. F., Schaefer I., Henry F., Quack M.. Anchoring the water dimer potential energy surface with explicitly correlated computations and focal point analyses. J. Chem. Phys. 2002;116:690–701. doi: 10.1063/1.1408302. [DOI] [Google Scholar]
- Holt A., Boström J., Karlström G., Lindh R.. A NEMO potential that includes the dipole-quadrupole and quadrupole-quadrupole polarizability. J. Comput. Chem. 2010;31:1583–1591. doi: 10.1002/jcc.21502. [DOI] [PubMed] [Google Scholar]
- Holt A., Karlström G.. Inclusion of the quadrupole moment when describing polarization. J. Comput. Chem. 2008;29:2033–2038. doi: 10.1002/jcc.20976. [DOI] [PubMed] [Google Scholar]
- Hincapié G., Acelas N., Castaño M., David J., Restrepo A.. Structural Studies of the Water Hexamer. J. Phys. Chem. A. 2010;114:7809–7814. doi: 10.1021/jp103683m. [DOI] [PubMed] [Google Scholar]
- Rayleigh L.. The problem of the random walk. Nature. 1905;72:318. doi: 10.1038/072318a0. [DOI] [Google Scholar]
- Krishtal A., Pavanello M.. Revealing electronic open quantum systems with subsystem TDDFT. J. Chem. Phys. 2016;144:124118. doi: 10.1063/1.4944526. [DOI] [PubMed] [Google Scholar]
- Luber S.. Local electric dipole moments for periodic systems via density functional theory embedding. J. Chem. Phys. 2014;141:234110. doi: 10.1063/1.4903828. [DOI] [PubMed] [Google Scholar]
- Tang K. T., Toennies J. P.. An improved simple model for the van der Waals potential based on universal damping functions for the dispersion coefficients. J. Chem. Phys. 1984;80:3726–3741. doi: 10.1063/1.447150. [DOI] [Google Scholar]
- Tkatchenko A., Ambrosetti A., DiStasio R. A.. Interatomic methods for the dispersion energy derived from the adiabatic connection fluctuation-dissipation theorem. J. Chem. Phys. 2013;138:074106. doi: 10.1063/1.4789814. [DOI] [PubMed] [Google Scholar]
- Boereboom J. M., Potestio R., Donadio D., Bulo R. E.. Toward Hamiltonian adaptive QM/MM: accurate solvent structures using many-body potentials. J. Chem. Theory Comput. 2016;12:3441–3448. doi: 10.1021/acs.jctc.6b00205. [DOI] [PubMed] [Google Scholar]
- Chen J., Harper J. B., Ho J.. Improving the accuracy of quantum mechanics/molecular mechanics (QM/MM) models with polarized fragment charges. J. Chem. Theory Comput. 2022;18:5607–5617. doi: 10.1021/acs.jctc.2c00491. [DOI] [PubMed] [Google Scholar]
- Nattino F., Dupont C., Marzari N., Andreussi O.. Functional extrapolations to tame unbound anions in density-functional theory calculations. J. Chem. Theory Comput. 2019;15:6313–6322. doi: 10.1021/acs.jctc.9b00552. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






