Abstract
We propose to generalize the previously developed two-layer permuted adaptive-partitioning QM/MM, which reclassifies atoms as QM or MM on-the-fly in dynamics simulations, to multi-layer adaptive-partitioning algorithms that enable multiple levels of theory. In this work, we formulate two new algorithms that smoothly interpolate the energy between two QM (Q1 and Q2) levels of theory. The first “permuted adaptive-partitioning” scheme is based on the weighted many-body expansion of the potential, as in the adaptive-partitioning QM/MM. Unconventional and potentially more efficient, the second “interpolated adaptive-partitioning” method employs alchemical QM calculations with Q1/Q2-mixed basis sets, Fock matrices, and overlap matrices. To our knowledge, this is the first time that such alchemical calculations are performed in QM, although they are routinely done in MM. Test calculations on water-cluster models show that both new algorithms indeed yield smooth energy curves when water molecules shift between Q1 and Q2.
Keywords: Multiscale Modeling, Many-body Expansion, Alchemical Modeling, Potential Interpolation
Graphical Abstract

1. Introduction
Adaptive quantum-mechanics/molecular-mechanics (QM/MM)1–34 allows the boundary between the QM and MM regions to be relocated on-the-fly as needed. This is especially suitable for modeling diffusive systems such as a substrate moving into and binding at the active site of an enzyme, or an ion migrating through the membrane via a channel. Our focus is usually the given substrate or ion and its immediate surroundings. This is often achieved by introducing a buffer zone (typically of 1–2 Å thickness) between the QM and MM regions and then smoothing the energy and/or forces when molecules or functional groups travel in the buffer zone. Adaptive QM/MM enables the use of a small, mobile QM zone centered at the substrate or ion and follows it wherever it goes, with the contents of the QM zone updated on-the-fly as the trajectory propagates. This treatment removes the limits caused by a static QM/MM boundary, achieving better integration of the QM and MM descriptions. Adaptive QM/MM has been demonstrated to yield accurate structural and kinetic properties of simulated model systems when compared with conventional QM/MM treatments.17, 27, 29
To further improve computational efficiency and accuracy, we have undertaken an endeavor to systematically generalize the adaptive QM/MM schemes for two layers (QM and MM) to multi-layer algorithms that permit two or more QM layers to be used. This will allow us to treat the solute’s 1st solvation shell at a high-level QM (denoted Q1, e.g., density functional theory (DFT) with a large basis set) and 2nd solvation shell at a low-level QM (denoted Q2, e.g., DFT with a small basis set, or simply semi-empirical QM). Consequently, an atom may be reclassified on-the-fly as Q1, Q2, or MM. Doing so leads to an adaptive-partitioning multi-layer (APML) Q1/Q2/MM model (Fig. 1). Because the 2nd shell, while important, is generally less influential than the 1st shell on a solute, treating it with a more approximate method lowers the computational cost without notably compromising accuracy. Moreover, when a semi-empirical QM or density functional based tight binding (DFTB) method is used for Q2, it may be feasible to extend the Q2 layer to include atoms beyond the 2nd shell, allowing for a more accurate description of critical interactions over longer ranges when compared to the two-layer QM/MM model. The APML models will be beneficial to many applications, e.g., the study of ion channels where electron delocalization is prominent along the transport path.35, 36
Figure 1.

Although not required, here all except the MM (white) zones are centered at the solute O (purple) in APML. Buffer1 (light green) is between Q1 (brown) and Q2 (yellow), and Buffer2 (light blue) between Q2 and MM zones. Groups are purple/red in Q1, orange in Buffer1, magenta in Q2, green in Buffer2, and blue in MM. Distance from O to a Buffer1 group R1 satisfies Rmin1 ≤ R1 ≤ Rmax1, and from O to a Buffer2 group R2 satisfies Rmin2 ≤ R2 ≤ Rmax2.
A critical component in the APML method is the adaptive treatments for the Q1/Q2 layer. Here, we report the preliminary results of two milestones in our new adventure: (1) generalizing a two-layer adaptive potential for QM/MM to Q1/Q2, where the potential is expressed in the form of many-body expansion with individual terms smoothly interpolated between the Q1 and Q2 levels, and (2) proposing a more efficient approach, the interpolated adaptive-partitioning (IAP) treatment, that smoothly switches between levels of theory through alchemical QM calculations. The complete formulation and demonstration of the APML algorithms for a Q1/Q2/MM three-layer model will be reported in a future paper.
2. Method
2.1. General Considerations
The new algorithms in this work are based on the permuted-AP (PAP) QM/MM method, which was first proposed by Heyden et al.3 and further developed by Lin and coworkers.6, 9, 15, 27, 29, 32 In PAP-QM/MM, the potential V of the system is defined through a many-body expansion where the many-body terms are scaled by smoothing functions that vary smoothly between 0 and 1 when the positions of involved buffer groups move from the MM into the QM zones. The gradients of these employed smoothing functions (called transition forces) are associated with the differences in chemical potentials for the buffer groups between the QM and MM levels of theory. Various schemes3, 4, 9, 17, 27 have been successfully developed to properly treat these gradients and avoid artifacts in simulations.
It is straightforward to generalize the PAP algorithm to two QM levels of theory. For simplicity, we present the algorithms for the cases where each buffer group is a whole molecule (e.g., a water molecule). In principle, extensions are straightforward when the boundary passes through covalent bonds: For the Q1/Q2 boundary, the fragments can be capped by H atoms with scaled bond-lengths as in the integrated molecular orbital and molecular orbital (IMOMO) method by Morokuma and coworkers.37 Alternatively, one can use the hybrid orbital projection technique in the fragment molecular orbital method.38 The methods are general and can in principle be applied to both mechanical- and electrostatic-embedding schemes; here we will only discuss the simpler situation, mechanical embedding. Furthermore, the subtractive formula for QM/QM energy37 is adopted.
Without loss of generality, we assume that there are sequentially one Q1 group (the 1st), N buffer groups (the 2nd to (N+1)-th), and one Q2 group (the (N+2)-th), leading to a total of (N + 2) groups. The Q1 and Q2 characteristics of group i are Pi and (1–Pi), respectively, where Pi is set to the value of a smoothing function2 for group i that varies from 0 at Ri = Rmax to 1 at Ri = Rmin, and Ri is the distance between the group and the Q1-zone center. The Pi is equivalent to the quantum weight λi in Ref.22 and fractional QM character σi in Ref.17 For the 1st (pure-Q1) and (N + 2)-th (pure-Q2) groups, P1 = 1 and PM = 0, respectively.
2.2. PAP Q1/Q2
Following the subtractive formula, which has been used in both QM/QM37 and QM/MM,39 we define the PAP Q1/Q2 potential as
| (1) |
In this work, unless otherwise noted, the superscripts indicate the levels of theory at which the groups listed in the subscripts are treated. The last two terms in the right side indicate that, for the cohort of Q1 and buffer groups, the single-level Q2 descriptions are replaced by the mixed-Q1/Q2 descriptions . The interactions between this Q1-buffer cohort and the Q2 group as well as the interactions within the Q2 group remain at the Q2 level. The mixed-Q1/Q2 descriptions for the Q1-buffer cohort are a sum of many-body interaction terms,
| (2) |
where the many-body interactions terms are
| (3) |
| (4) |
| (5) |
In eq. 4 and eq. 5, ΔVij and ΔVijk are given by
| (6) |
| (7) |
It is obvious that, for groups i and j, their two-body interactions will be if both are Q1. If one group is Q1 and the other is buffer or if both groups are buffer, the interactions will be a mixture of and , weighted by the respective scaling functions Pi and Pj. Similarly, for three groups i, j, and k, their three-body interactions will be if all groups are Q1 and a mixture of and if any of the groups is buffer. The zero of energy in the one-body term for a given group is arbitrary and often set to the energy at its equilibrium geometry at the designated Q1 or Q2 level for convenience.
We note that our scheme differs from ONIOM-extension solvation (ONIOM-XS)2, 40, 41 in that PAP yields a smooth potential energy surface when the number of buffer groups changes, while ONIOM-XS exhibits abrupt energy changes when the number of buffer groups varies.3
As previously found, truncating the many-body expansion in PAP QM/MM increases the efficiency without significant loss of accuracy.3, 6, 9, 15, 29, 32 It is likely that this is also true for PAP Q1/Q2 calculations; as demonstrated in the fragment molecular orbitals (FMO) studies by Kitaura and colleagues42, 43 and in the electrostatically embedded many-body expansions by Dahlke et al.,44, 45 many-body potentials truncated after two-body interactions can reach accuracy of ~2 kcal/mol against reference calculations.
2.3. IAP Q1/Q2
Another way to achieve a smooth Q1-Q2 transition is IAP, which uses interpolated parameters in special alchemical QM calculations. Similar to PAP, the IAP potential is given by:
| (8) |
where the potential for the Q1-buffer cohort is computed alchemically. To our knowledge, this is the first time such alchemical QM calculations are proposed for two different QM levels of theory, although a different ansatz based on scaling electron integrals had been used by Field22 for single-level QM calculations in adaptive QM/MM. To illustrate this concept, we examine the case of combining two different basis sets. If the basis sets are {ϕQ1} in Q1 and {ϕQ2} in Q2, one assigns an interpolated basis set to group i, in a similar way to a contracted basis set:
| (10) |
We mandate that an atomic orbital centered at a given atoms inherits the smoothing function value of the group that contains the atom. For instance, if atomic orbital μ is centered at an atom belonging to group i, then Pμ = Pi. The interpolation indicated by eq. 10 may be accomplished in different ways, and here we combine the respective shells from each basis set before the SCF iterations. For instance, the 1s shell from Q1 is combined with the 1s shell from Q2, and so on. Whenever a basis function exists in Q1 but not in Q2 (assuming Q1 is the larger basis), the coefficient is simply set to zero. (A piece of pseudo code is given in Chart S1 in the Supporting Information.) This ensures that the Fock matrix has a consistent size throughout the buffer region. Apart from the above-mentioned interpolation, no change was made to the computation of integrals or the Fock matrix. (Although we only present here the IAP equations for the Hartree-Fock method, the extension to DFT is straightforward, with the exchange-correlation numerical integration computed using the interpolated basis.)
With these interpolated basis sets, one can form the alchemical Fock matrices and alchemical overlap matrices at the Q1/Q2 level through interpolation. For example, the alchemical Fock matrix is a hybrid of the Fock matrices at the and levels and a new Fock matrix constructed using the interpolated basis set:
| (11) |
| (12) |
| (13) |
Here, μ and ν denote two atomic orbitals (AOs), Pμ is the scaling factor for the function ϕμ, ℎμν and gμν are the one- and two-electron terms, respectively, and Dλσ the charge density matrix element for atomic orbitals λ and σ. Here, Dλσ is modified by weights PλPσ in and (1 − Pλ)(1 – Pσ) in , respectively. Note that and are Fock matrices constructed only with the Q1 and Q2 basis sets, respectively, whereas is the new Fock matrix that contains cross terms from the mixed basis set defined in eq. 10. This cross-term Fock matrix ensures that the energy is correct at the limits of Pi = 0 and Pi = 1.
A special case is that Q1 and Q2 use different methods but the same basis set, which should result in simply interpolated matrices. Another special case is that Q1 and Q2 use the same method and the same basis set, leading to the regular Fock matrix for the system.
The alchemical overlap matrix interpolation is straightforward
| (14) |
This IAP scheme is in principle very general but leads to larger matrices with interpolated basis sets for the buffer groups; however, unless the number of the buffer groups is very large, the increase in computational cost is expected to be modest. Moreover, when two different basis sets are employed, the Q2 basis is often smaller in practical applications, again, leading to only modest increase in computation cost for the buffer region. Some matrix elements may become near 0 when Pi is approaching 0 or 1, leading to near linear dependence in the basis sets, and care must be taken to eliminate numerical instabilities when handling these matrices.
For alchemical QM calculations, just like in conventional QM calculations, only the interaction energy is of real interest. The interaction energy of each term in the right side of eq. 8 is computed in the usual way by subtracting the energies of isolated individual groups from the total energy of at the corresponding level of theory. For example, for , one will compute the energy of the individual i-th (i = 1, 2, … N + 1) group in the gas phase at the mixed Q1/Q2 level of theory with the associated Pi value; because P1 = 1, the 1st groups is computed simply at the Q1 level. Because the size of a group is usually rather small, the additional costs are insignificant. One can also tabulate pre-computed zeros of energy for given groups at equilibrium geometries on a fine grid of Pi (e.g., with a step size of 0.01 or 0.001 in Pi) for quick access (or for interpolations that approximate the zero of energy) when needed.
Although alchemical QM calculations may sound radical, the idea of alchemical calculations are widely used in MM calculations, e.g., in the alchemical free-energy calculations, where the parameters change on-the-fly depending on an extent parameter that describes the changes.46 The idea has been extended to QM/MM free-energy calculations, examples including the quantum mechanical free energy (QM-FE) approach,47 the QM/MM free energy (QM/MM-FE) method,48 the reference-potential49 algorithm, and the quantum-mechanical thermodynamic cycle perturbation (QTCP)50 scheme. In this work, the alchemical changes in IAP are in accord with the value of the smoothing function and are taken to be at the more fundamental level of electronic structures. We note that alchemical QM has been employed in a different context (exploration of chemical compound space) by von Lilienfeld and others.51, 52 Additionally, the IAP method may be considered in similar spirit to the dual-basis QM53–56 methods with the following major difference: in the dual-basis QM methods, the small and large basis sets are successively applied to all atoms in different stages of a calculation, whereas IAP introduces a special (buffer) region where both basis sets coexist throughout the entire calculation. In general, PAP and IAP yield different energies. However, both PAP and IAP produce smooth energy changes for groups moving across the buffer zone.
3. Computation
As proof of concept, we calculate both the PAP and IAP energy curves for 6 simple models: 1 two-water cluster (Model a), 3 three-water clusters (Models b to d) and 2 four-water clusters (Models e and f), which have distinct arrangements of the water molecules, respectively (Fig. 2). In the two-water model, W1 is fixed in the Q1 zone, with its O atom (O1) serving as the center of the Q1 zone, and W2 is moving along the O1-O2 line between the Q1 and Q2 zones. The three-water models are similar to the two-water model, except that additionally W3 is fixed in the Q2 zone, ~5–6 Å away from W1. In the four-water models, W1, W2, and W4 are fixed in the Q1, buffer, and Q2 zones, respectively, while W3 travels along the O1-O3 line between Q1 and Q2.
Figure 2.

Water-cluster Models a to f for proof-of-concept calculations. Water O atom is colored in red/yellow/green if in the Q1/buffer/Q2 zone. W2 in Models a to d and W3 in Models e and f move between Q1 and Q2 crossing the buffer zone, while all other water molecules are held fixed.
Generally, the buffer-zone setup is system specific, but buffer zones of 1–2 Å thickness are commonly adopted for the compromise between accuracy and computational efficiency. We have tested various buffer zone thicknesses and found that the overall results are similar. Thus, we present in detail the data with the spherical-shell buffer zones of 2.0-Å thickness (3.0 Å ≤ R ≤ 5.0 Å) and discuss the results for other buffer zones briefly. To situate the buffer zones, the distance R is measured between the O1 and O atom (O2 or O3) of the migrating water molecule, as appropriate. To simplify data analysis and interpretation, we have selected the popular DFT functional B3LYP,57–59 and we have tested two basis set combinations: MIDI/MINI60 and 6–31G(d)61–64/STO-3G,65 with the larger basis set for Q1 and the smaller basis set for Q2. We note that the MINI and STO-3G basis sets are likely too small for real applications, but they serve well for the proof-of-concept calculations targeted here. Because the two basis set combinations essentially reveal the same picture, we will focus on the MIDI/MINI results in the main text, with the 6–31G(d)/STO-3G results given in the Supporting Information. As all water molecules have fixed internal geometries in these models, the zeros of energy in AP treatments are computed using the fixed geometry. Full single-level Q1 and Q2 energy curves are also computed for comparison. The calculations were performed using a local version of the GAMESS66–68 program, where the IAP algorithms are implemented.
4. Results
Fig. 3 shows the scan of the total intermolecular interaction energy E as one of the water molecules travels from the Q1 to the Q2 zone through the buffer. As can be seen, both PAP and IAP generate smooth potential energy surfaces as the moving water travels across the buffer zone. First, let us look at Models a to d. As expected, the IAP and PAP curves do not match each other exactly within the buffer region, but the small (up to 1 kcal/mol) differences diminish when approaching the boundaries of the buffer zones. The PAP and IAP curves superimpose with each other outside the buffer region. Moreover, both curves match the Q2 curve when W2 moves into the Q2 zone, where all interactions between the fixed-geometry water molecules are described at the Q2 level in both PAP and IAP. Notably, for Model a, both IAP and PAP curves display smooth transitions between Q1 and Q2. As to Models e and f, the IAP and PAP curves are similar but do not match each other exactly even when the migrating water molecule W3 is not in the buffer zone, because another water molecule W2 remains in the buffer all the time. (The IAP curve in Model f is incidentally close to the Q1 curve but does not exactly superimpose with it.)
Figure 3:

Potential energy curves of Models a to f computed at the Q1 (orange squares), Q2 (red circles), PAP (green “×”), and IAP (blue “+”) levels of theory, respectively, as the moving water molecule migrates between the Q1 and Q2 zones through the buffer zone (3.0 Å ≤ R ≤ 5.0 Å, highlighted in yellow). The calculations are performed at the B3LYP level with basis set combination MIDI/MINI.
In the IAP calculations, we sometimes observe quick changes in the energy curves near the buffer-Q2 boundary. These changes are small (<0.5 kcal/mol) and are prominent for Models b and d in our test calculations. Intriguingly, the changes seem to be more noticeable for larger buffer zones. In Fig. 4, for Models b and d, the 2.0-Å buffer-zone energy curves are compared with those with smaller buffer zones of 3.0 Å ≤ R ≤ 4.5 Å and 3.0 Å ≤ R ≤ 4.0 Å. Although we have not yet fully understood the cause of these changes, we suspect that they are due to the presence of both the Q1 and Q2 basis sets (through interpolation) for the water molecule in the buffer zone, which together provide a very “large” basis set for the self-consistent field (SCF) calculations. However, when the buffer group moves near the buffer-Q2 boundary, GAMESS will discard some Q1 basis functions with very small contributions during the canonical orthogonalization, effectively leading to an increasingly smaller basis set in the SCF calculations. This process continues until the buffer group reaches the buffer-Q2 boundary, when all Q1 basis functions are gone and only the Q2 basis set is available. Such changes are more noticeable in our tests when a larger buffer zone is adopted, as the smoothing functions take more smaller values near the buffer-Q2 boundary. Similarly, the Q2 basis functions are gradually removed when the buffer group is approaching the Q1-buffer boundary; but because the basis functions in the larger Q1 basis set usually dominates over those in the smaller Q2 basis set, the effects due to such changes are negligible. Note that the energy plotted here is the interaction energy, not the absolute energy, and therefore in general these changes may be seen going down or up, or undetectable, depending on the model system (e.g., they are insignificant in other test models) and the employed levels of theory (e.g., they are overall less notable when using the 6–31G(d)/STO-3G combination, as revealed in Fig. S1 in the Supporting Information). Currently, we are exploring algorithms to eliminate/minimize these quick changes, such as projections between Q1 and Q2 basis sets.
Figure 4:

Comparisons of IAP potential for Models b (left panel) and d (right panel) with different buffer sizes: 3.0 Å ≤ R ≤ 4.0 Å (blue “+”), 3.0 Å ≤ R ≤ 4.5 Å (orange squares), and 3.0 Å ≤ R ≤ 5.0 Å (red circles). The calculations are performed at the B3LYP level with basis set combination MIDI/MINI.
5. Discussion
Although the energy curves display smooth transitions, one may ask if the alchemical QM process produces an utterly unphysical electronic structure. To address this question, we verify the integrity of the electronic structure in the IAP method. Two key features are examined, the canonical molecular orbitals (MO) and the Mulliken atomic charges.69 We understand that the meanings of the MO in DFT calculations are debatable and that the Mulliken charge model can sometimes lead to unreasonable charges; nevertheless, when used with care, they can provide useful insights toward the validity of the electronic structure obtained by the IAP method against the Q1 and Q2 calculations. For example, the Mulliken charges are well-known to depend strongly on the basis set, making them sensitive tests on the alchemical mixing of basis sets in this study.
Taking Model a as an example, we show here the results computed by the buffer zone of 3.0 Å ≤ R ≤ 5.0 Å. First, we look at the orbital energies. Table 1 lists the energies of two MOs with prominent contributions from the atomic orbitals at the buffer water molecule W2 in this dimer model: One MO (MOa) describes the σ bonds between the O and H atoms, and the other MO (MOb) involves the lone pair at the O atom. It is apparent that, except for near the buffer-Q2 boundary (R ≥ 4.8 Å), these MOs are dominated by contributions of the Q1 basis functions in the interpolated basis set, as mentioned above. As W2 approaches the buffer-Q2 boundary, some Q1 basis functions are gradually removed, and the Q2 basis functions become increasingly important. Upon reaching the buffer-Q2 boundary, the Q1 basis functions are no longer available, and only Q2 basis functions contribute to the MO. The data for the 6–31G(d)/STO-3G basis set combination (Table S1 in Supporting Information) yielded the same qualitative picture, although the deviations away from Q1 towards Q2 are seen much earlier in the journey of W2 moving toward the Q2 zone.
Table 1.
Orbital energies (a.u.) of two selected molecular orbitals for Model a with buffer zone (3.0 Å ≤ R ≤ 5.0 Å) calculated at the B3LYP level with basis set combination MIDI/MINI.
| MOa |
|
MOb |
|
|||
|---|---|---|---|---|---|---|
| R (Å) | Q1 | Q2 | IAP | Q1 | Q2 | IAP |
| 3.0 | −0.525 | −0.505 | −0.525 | −0.360 | −0.322 | −0.360 |
| 3.1 | −0.524 | −0.504 | −0.524 | −0.358 | −0.321 | −0.358 |
| 3.2 | −0.523 | −0.503 | −0.523 | −0.356 | −0.320 | −0.356 |
| 3.3 | −0.522 | −0.502 | −0.521 | −0.355 | −0.319 | −0.355 |
| 3.4 | −0.521 | −0.502 | −0.520 | −0.353 | −0.318 | −0.353 |
| 3.5 | −0.519 | −0.501 | −0.519 | −0.351 | −0.317 | −0.351 |
| 3.6 | −0.518 | −0.501 | −0.518 | −0.350 | −0.316 | −0.350 |
| 3.7 | −0.517 | −0.500 | −0.517 | −0.349 | −0.316 | −0.349 |
| 3.8 | −0.516 | −0.500 | −0.516 | −0.347 | −0.315 | −0.348 |
| 3.9 | −0.515 | −0.499 | −0.515 | −0.347 | −0.315 | −0.347 |
| 4.0 | −0.514 | −0.499 | −0.514 | −0.346 | −0.314 | −0.346 |
| 4.1 | −0.514 | −0.499 | −0.514 | −0.345 | −0.314 | −0.345 |
| 4.2 | −0.513 | −0.498 | −0.513 | −0.344 | −0.314 | −0.345 |
| 4.3 | −0.513 | −0.498 | −0.513 | −0.344 | −0.313 | −0.344 |
| 4.4 | −0.513 | −0.498 | −0.512 | −0.343 | −0.313 | −0.343 |
| 4.5 | −0.512 | −0.497 | −0.512 | −0.343 | −0.313 | −0.343 |
| 4.6 | −0.512 | −0.497 | −0.512 | −0.343 | −0.313 | −0.342 |
| 4.7 | −0.511 | −0.497 | −0.511 | −0.342 | −0.312 | −0.342 |
| 4.8 | −0.511 | −0.497 | −0.508 | −0.342 | −0.312 | −0.339 |
| 4.9 | −0.511 | −0.496 | −0.496 | −0.342 | −0.311 | −0.311 |
| 5.0 | −0.511 | −0.496 | −0.496 | −0.342 | −0.311 | −0.311 |
A similar story is told in Fig. 5 by the Mulliken charges of the atoms of the mobile water calculated at the Q1, Q2, and IAP levels of theory. The IAP charges of the O and H atoms (panels a and b) gradually deviate from the Q1 charges as the mobile water move into the buffer zone, reflecting the effects due to mixing with the Q2 charge density through the matrix interpolations, but they do not simply approach the Q2 charges monotonously. It should be emphasized that, however, all changes in these charges are rather small (< 0.03 e), and that overall, the charge values seem very reasonable, at least qualitatively. Moreover, the IAP total charge of the mobile water molecule is almost the same as the Q1 total charge (difference ~0.0001 e) along the way when the water molecule travels across the buffer zone (panel c). The results are qualitatively similar for the 6–31G(d)/STO-3G basis set combination (Fig. S2 in Supporting Information), although this time the atomic charges monotonously approach Q2. We note that the difference between the 6–31G(d) and STO-3G basis sets in the atomic Mulliken charges are 0.2 e-0.4 e. In contrast, this difference is <0.01 e for the MINI and MIDI basis sets. Again, the IAP total charge of W2 agree with Q1 very well (difference ≤ 0.004 e). The results imply that, although the Q1/Q2 mixing does perturb the electronic structure of the buffer groups, the impacts are relatively small and are likely acceptable in most applications.
Figure 5:

Mulliken charges of the (a) O and (b) H atoms of the mobile water W2 calculated for Model a. The charges of the two H atoms are essentially the same. Panel (c) plots the total charge of W2. The calculations are performed at the B3LYP level with basis set combination MIDI/MINI.
Although both PAP and IAP yield smooth potential energy surfaces, these two energy surfaces do not necessarily match each other, and neither is “more correct” in theory than the other, as they are just two different ways of potential interpolation. However, potentially, the IAP can be computationally much more efficient for large systems, although this depends on the actual setup of the calculations such as the number of buffer groups. In general, the computational costs of adaptive Q1/Q2 will be dominated by the number of high-level Q1 calculations and the number of atoms in these Q1 calculations. In PAP, a set of Q1 calculations are performed for the Q1 group and various buffer groups combined, and the number of such Q1 calculations scales factorially with the number of buffer groups, unless the many-body expansion of the potential is significantly truncated. In contrast, IAP requires only one large alchemical calculation for the Q1 and all buffer groups combined, which is modestly more expensive than the pure-Q1 calculations for the same system. For example, for three water molecules, when using the MIDI/MINI basis set combination, on average one SCF cycle in the alchemical calculations took 0.7 second, as opposed to those in the Q1 calculations (0.5 second) and Q2 calculations (0.3 second). For the 6–31G(d)/STO-3G basis set combination, where the difference in basis set sizes is more prominent, the average times of SCF/cycle were 1.1 second for the alchemical, 1.1 second for the Q1, and 0.5 second for the Q2 calculations, respectively. Admittedly, to get the zero of energy, IAP also requires alchemical calculations for each buffer group, but the number of such calculations scale linearly with the number of buffer groups. Moreover, these calculations on single buffer groups are inexpensive owing to the small size of individual buffer groups. For instance, on average the SCF/cycle times were about 0.07 second (for MIDI/MINI) and 0.14 second (for 6–31G(d)/STO-3G) for an alchemical water molecule.
Of course, the number of SCF cycles may vary between calculations, affecting the actual time needed for the calculations. This can be a concern for the unconventional alchemical calculations in IAP. For the model systems tested here, the SCF convergence turned out to be reasonable, and we have not encountered issues. To give readers a feeling on the timing, we benchmarked the total “wall o’clock times” for representative single-point PAP and IAP calculations on Model e with both W2 and W3 in the buffer zone: The times for PAP were 45 seconds for the MIDI/MINI and 76 seconds for the 6–31G(d)/STO-3G basis set combinations, respectively. The counterparts in IAP reduced to 34 and 48 seconds, respectively. The data demonstrate that IAP is computationally more efficient than PAP for this model, albeit the difference is less than a factor of 2. However, for model systems with larger numbers of Q1 and buffer groups, IAP can be more competitive.
Although only variational DFT methods with different-sized basis sets are tested in the present work, the IAP approach is general and can be expanded (such work is underway) to post-Hartree-Fock algorithms, such as IAP MP270/HF.71 For example, an IAP MP2 correction following the IAP SCF convergence could be implemented through the following tentative ansatz:
| (15) |
| (16) |
The equations take the form of the canonical MP2 expression, with modified two-electron integrals where weights are applied according to the groups that the atomic orbitals belong to, and the MO energies being those obtained in the IAP SCF calculations. Such a treatment is a natural extension of the variational IAP implementation and is also in line with the ansatz by Field.22
6. Conclusions
In this contribution, we report the extension of the adaptive QM/MM algorithm to multilayer models. A key ingredient to this development is a new adaptive scheme that can produce smooth potential energy surfaces between two QM (Q1 and Q2) levels of theory. To this end, we formulate two new adaptive Q1/Q2 schemes. The first scheme, PAP Q1/Q2, is a straightforward extension of the PAP QM/MM potential based on smoothly interpolating the many-body contributions to the potential energy. The second and more radical method, IAP Q1/Q2 utilizes an unconventional alchemical QM approach, where the basis sets, Fock matrices, and overlap matrices of Q1 and Q2 are interpolated. Alchemical treatments are routinely used in MM simulations, but it is, to our knowledge, the first time that they are applied to QM calculations. Both approaches take existing partitioning schemes3, 6, 22 previously developed for QM/MM methods and adapt them to an entirely quantum system. Test calculations on water-cluster models show overall smooth transitions in the energy curves by both PAP and IAP treatments when water molecules move across the buffer zones. Although the IAP energy curves do not seem to be always perfectly smooth near the Q2-buffer boundary (Fig. 4) because GAMESS removes basis functions with very small contributions during the canonical orthogonalization, the observed changes between the adjacent data points of concern are rather small (<0.5 kcal/mol), and the smoothness may be improved in the future, e.g., by projections between the Q1 and Q2 basis sets and/or by using different smoothing techniques. The alchemical QM calculations maintain a reasonably high fidelity in the descriptions of the electronic structure of the buffer groups, which appears to largely resemble the Q1 calculations. Currently, implementations and test calculations are underway for three-layer Q1/Q2/MM treatments, which integrate the Q1/Q2 description with an additional MM layer to tackle much larger systems. In short, the two schemes presented here are simple but nevertheless offer a promising starting point for future development.
Supplementary Material
ACKNOWLEDGMENT
This work is supported by NIH (1R15GM141728-01), the Camille & Henry Dreyfus Foundation (TH-14-028), Research Corporation for Advancement (25793), and NVIDIA Corporation. This work used XSEDE under grant CHE-140070, supported by NSF grant number ACI-1548562. This work is also supported by the University of Colorado Denver.
Footnotes
Supporting Information. A PDF file is available free of charge, which contains a piece of pseudo code for basis set mixing in Chart S1, the results computed with the 6–31G(d)/STO-3G basis set combination, including (i) the plots of the energy curves for Models a to f in Fig. S1, (ii) the Mulliken charges of W2 in Model a in Fig. S2, and (iii) the orbital energies for the two selected MO in Model a in Table S1, as well as the Cartesian coordinates of Models a to f at R = 3.0 Å in Table S2.
The authors declare no competing financial interests.
REFERENCES
- 1.Kerdcharoen T; Liedl KR; Rode BM, A QM/MM simulation method applied to the solution of Li+ in liquid ammonia. Chem. Phys 1996, 211, 313–323. [Google Scholar]
- 2.Kerdcharoen T; Morokuma K, ONIOM-XS: an extension of the ONIOM method for molecular simulation in condensed phase. Chem. Phys. Lett 2002, 355, 257–262. [Google Scholar]
- 3.Heyden A; Lin H; Truhlar DG, Adaptive partitioning in combined quantum mechanical and molecular mechanical calculations of potential energy functions for multiscale simulations. J. Phys. Chem. B 2007, 111, 2231–2241. [DOI] [PubMed] [Google Scholar]
- 4.Bulo RE; Ensing B; Sikkema J; Visscher L, Toward a practical method for adaptive QM/MM simulations. J. Chem. Theory Comput 2009, 5, 2212–2221. [DOI] [PubMed] [Google Scholar]
- 5.Nielsen SO; Bulo RE; Moore PB; Ensing B, Recent progress in adaptive multiscale molecular dynamics simulations of soft matter. Phys. Chem. Chem. Phys 2010, 12, 12401–12414. [DOI] [PubMed] [Google Scholar]
- 6.Pezeshki S; Lin H, Adaptive-partitioning redistributed charge and dipole schemes for QM/MM dynamics simulations: On-the-fly relocation of boundaries that pass through covalent bonds. J. Chem. Theory Comput 2011, 7, 3625–3634. [DOI] [PubMed] [Google Scholar]
- 7.Takenaka N; Kitamura Y; Koyano Y; Nagaoka M, The number-adaptive multiscale QM/MM molecular dynamics simulation: Application to liquid water. Chem. Phys. Lett 2012, 524, 56–61. [DOI] [PubMed] [Google Scholar]
- 8.Bulo RE; Michel C; Fleurat-Lessard P; Sautet P, Multiscale modeling of chemistry in water: Are we there yet? J. Chem. Theory Comput 2013, 9, 5567–5577. [DOI] [PubMed] [Google Scholar]
- 9.Pezeshki S; Davis C; Heyden A; Lin H, Adaptive-partitioning QM/MM dynamics simulations: 3. Solvent molecules entering and leaving protein binding sites J. Chem. Theory Comput 2014, 10, 4765–4776. [DOI] [PubMed] [Google Scholar]
- 10.Pezeshki S; Lin H, Recent developments in QM/MM methods towards open-boundary multi-scale simulations. Mol. Simul 2015, 41, 168–189. [Google Scholar]
- 11.Waller MP; Kumbhar S; Yang J, A density-based adaptive quantum mechanical/molecular mechanical method. ChemPhysChem 2014, 15, 3218–3225. [DOI] [PubMed] [Google Scholar]
- 12.Watanabe HC; Kubař T; Elstner M, Size-consistent multipartitioning QM/MM: A stable and efficient adaptive QM/MM method. J. Chem. Theory Comput 2014, 10, 4242–4252. [DOI] [PubMed] [Google Scholar]
- 13.Böckmann M; Doltsinis NL; Marx D, Adaptive switching of interaction potentials in the time domain: An extended lagrangian approach tailored to transmute force field to QM/MM simulations and back. J. Chem. Theory Comput 2015, 11, 2429–2439. [DOI] [PubMed] [Google Scholar]
- 14.Jiang T; Boereboom JM; Michel C; Fleurat-Lessard P; Bulo RE Proton transfer in aqueous solution: Exploring the boundaries of adaptive QM/MM. In Quantum Modeling of Complex Molecular Systems, Rivail J-L; Ruiz-Lopez M; Assfeld X, Eds.; Springer International Publishing: Cham, 2015, pp 51–91. [Google Scholar]
- 15.Pezeshki S; Lin H, Adaptive-partitioning QM/MM for molecular dynamics simulations: 4. Proton hopping in bulk water. J. Chem. Theory Comput 2015, 11, 2398–2411. [DOI] [PubMed] [Google Scholar]
- 16.Pezeshki S; Lin H Recent developments in adaptive QM/MM. In Quantum Modeling of Complex Molecular Systems, Rivail J-L; Ruiz-Lopez M; Assfeld X, Eds.; Springer: New York, NY, 2015, pp 93–113. [Google Scholar]
- 17.Boereboom JM; Potestio R; Donadio D; Bulo RE, Toward Hamiltonian adaptive QM/MM: Accurate solvent structures using many-body potentials. J. Chem. Theory Comput 2016, 12, 3441–3448. [DOI] [PubMed] [Google Scholar]
- 18.Duster A; Garza C; Lin H Adaptive partitioning QM/MM dynamics simulations for substrate uptake, product release, and solvent exchange. In Computational Approaches for Studying Enzyme Mechanism, Voth GA, Ed.; Elsevier: Cambridge, MA, 2016, pp 342–358 [Google Scholar]
- 19.Zheng M; Waller MP, Adaptive quantum mechanics/molecular mechanics methods. Wiley Interdiscip. Rev. Comput. Mol. Sci 2016, 6, 369–385. [Google Scholar]
- 20.Dohm S; Spohr E; Korth M, Developing adaptive QM/MM computer simulations for electrochemistry. J. Comput. Chem 2017, 38, 51–58. [DOI] [PubMed] [Google Scholar]
- 21.Duster AW; Wang C-H; Garza CM; Miller DE; Lin H, Adaptive quantum/molecular mechanics: what have we learned, where are we, and where do we go from here? Wiley Interdiscip. Rev. Comput. Mol. Sci 2017, 7, e1310–n/a. [Google Scholar]
- 22.Field MJ, An algorithm for adaptive QC/MM simulations. J. Chem. Theory Comput 2017, 13, 2342–2351. [DOI] [PubMed] [Google Scholar]
- 23.Zheng M; Kuriappan JA; Waller MP, Toward more efficient density-based adaptive QM/MM methods. Int. J. Quantum Chem 2017, 117, 1–8. [Google Scholar]
- 24.Boereboom JM; Fleurat-Lessard P; Bulo RE, Explicit solvation matters: Performance of QM/MM solvation models in nucleophilic addition. J. Chem. Theory Comput 2018, 14, 1841–1852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hofer TS; Hünenberger PH, Absolute proton hydration free-energy, surface potential of water, and redox potential of the hydrogen electrode from first principles: QM/MM MD free-energy simulations of sodium and potassium hydration. J. Chem. Phys 2018, 148, 222814/1–28. [DOI] [PubMed] [Google Scholar]
- 26.Zheng M; Waller MP, Yoink: An interaction-based partitioning API. J. Comput. Chem 2018, 39, 799–806. [DOI] [PubMed] [Google Scholar]
- 27.Duster A; Wang C-H; Lin H, Adaptive QM/MM for molecular dynamics simulations: 5. On the energy-conserved permuted adaptive-partitioning schemes. Molecules 2018, 23, 2170/1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Watanabe H, Improvement of performance, stability and continuity by modified size-consistent multipartitioning quantum mechanical/molecular mechanical method. Molecules 2018, 23, 1882/1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Duster AW; Garza CM; Aydintug BO; Negussie MB; Lin H, Adaptive partitioning QM/MM for molecular dynamics simulations: 6. Proton transport through a biological channel. J. Chem. Theory Comput 2019, 15, 892–905. [DOI] [PubMed] [Google Scholar]
- 30.Watanabe HC; Cui Q, Quantitative analysis of QM/MM boundary artifacts and correction in adaptive QM/MM simulations. J. Chem. Theory Comput 2019, 15, 3917–3928. [DOI] [PubMed] [Google Scholar]
- 31.Zhang B; Altarawy D; Barnes T; Turney JM; Schaefer HF, Janus: An extensible open-source software package for adaptive QM/MM methods. J. Chem. Theory Comput 2019, 15, 4362–4373. [DOI] [PubMed] [Google Scholar]
- 32.Duster AW; Lin H, Tracking proton transfer through titratable amino acid side chains in adaptive QM/MM simulations. J. Chem. Theory Comput 2019, 15, 5794–5809. [DOI] [PubMed] [Google Scholar]
- 33.Yang Z, Speed-dependent adaptive partitioning in QM/MM MD simulations of displacement damage in solid-state systems. Phys. Chem. Chem. Phys 2021, 23, 3417–3428. [DOI] [PubMed] [Google Scholar]
- 34.Watanabe HC; Yamada M; Suzuki Y, Proton transfer in bulk water using the full adaptive QM/MM method: integration of solute- and solvent-adaptive approaches. Phys. Chem. Chem. Phys 2021, 23, 8344–8360. [DOI] [PubMed] [Google Scholar]
- 35.Fukuda M; Takeda H; Kato HE; Doki S; Ito K; Maturana AD; Ishitani R; Nureki O, Structural basis for dynamic mechanism of nitrate/nitrite antiport by NarK. Nat. Commun 2015, 6, 7097/1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zheng H; Wisedchaisri G; Gonen T, Crystal structure of a nitrate/nitrite exchanger. Nature 2013, 497, 647–651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Humbel S; Sieber S; Morokuma K, The IMOMO method: Integration of different levels of molecular orbital approximations for geometry optimization of large systems: Test for n-butane conformation and SN2 reaction: RCl + Cl−. J. Chem. Phys 1996, 105, 1959–1967. [Google Scholar]
- 38.Nakano T; Kaminuma T; Sato T; Akiyama Y; Uebayasi M; Kitaura K, Fragment molecular orbital method: application to polypeptides. Chem. Phys. Lett 2000, 318, 614–618. [Google Scholar]
- 39.Lin H; Truhlar DG, QM/MM: What have we learned, where are we, and where do we go from here ? Theor. Chem. Acc 2007, 117, 185–199. [Google Scholar]
- 40.Kabbalee P; Sripa P; Tongraar A; Kerdcharoen T, Solvation structure and dynamics of K+ in aqueous ammonia solution: Insights from an ONIOM-XS MD simulation. Chem. Phys. Lett 2015, 633, 152–157. [Google Scholar]
- 41.Kerdcharoen T; Morokuma K, Combined quantum mechanics and molecular mechanics simulation of Ca2+/ammonia solution based on the ONIOM-XS method: Octahedral coordination and implication to biology. J. Chem. Phys 2003, 118, 8856–8862. [Google Scholar]
- 42.Fedorov DG; Kitaura K, Extending the power of quantum chemistry to large systems with the fragment molecular orbital method. J. Phys. Chem. A 2007, 111, 6904–6914. [DOI] [PubMed] [Google Scholar]
- 43.Fedorov DG; Ishida T; Kitaura K, Multilayer formulation of the fragment molecular orbital method (FMO). J. Phys. Chem. A 2005, 109, 2638–2646. [DOI] [PubMed] [Google Scholar]
- 44.Dahlke EE; Truhlar DG, Electrostatically Embedded Many-Body Expansion for Large Systems, with Applications to Water Clusters. J. Chem. Theory Comput 2007, 3, 46–53. [DOI] [PubMed] [Google Scholar]
- 45.Speetzen ED; Leverentz HR; Lin H; Donald GT Electrostatically embedded many-body expansion for large systems. In Accurate condensed-phase quantum chemistry, Manby FR, Ed.; CRC Press, Taylor & Francis Boca Raton, FL, 2011, pp 105–127. [Google Scholar]
- 46.Zwanzig RW, High-temperature equation of state by a perturbation method. I. Nonpolar gases. J. Chem. Phys 1954, 22, 1420–1426. [Google Scholar]
- 47.Chandrasekhar J; Jorgensen WL, Energy profile for a nonconcerted SN2 reaction in solution. J. Am. Chem. Soc 1985, 107, 2974–2975. [Google Scholar]
- 48.Zhang Y; Liu H; Yang W, Free energy calculation on enzyme reactions with an efficient iterative procedure to determine minimum energy paths on a combined ab initio QM/MM potential energy surface. J. Chem. Phys 2000, 112, 3483–3492. [Google Scholar]
- 49.Muller RP; Warshel A, Ab initio calculations of free energy barriers for chemical reactions in solution. J. Phys. Chem 1995, 99, 17516–17524. [PubMed] [Google Scholar]
- 50.Rod TH; Ryde U, Accurate QM/MM free energy calculations of enzyme reactions: Methylation by Catechol O-Methyltransferase. J. Chem. Theory Comput 2005, 1, 1240–1251. [DOI] [PubMed] [Google Scholar]
- 51.Anatole von Lilienfeld O, Accurate ab initio energy gradients in chemical compound space. J. Chem. Phys 2009, 131, 164102. [DOI] [PubMed] [Google Scholar]
- 52.von Rudorff GF; von Lilienfeld OA, Alchemical perturbation density functional theory. Phys. Rev. Res 2020, 2, 023220. [DOI] [PubMed] [Google Scholar]
- 53.Wolinski K; Pulay P, Second-order Møller–Plesset calculations with dual basis sets. J. Chem. Phys 2003, 118, 9497–9503. [Google Scholar]
- 54.Liang W; Head-Gordon M, Approaching the basis set limit in density functional theory calculations using dual basis sets without diagonalization. J. Phys. Chem. A 2004, 108, 3206–3210. [Google Scholar]
- 55.Steele RP; DiStasio RA; Shao Y; Kong J; Head-Gordon M, Dual-basis second-order Møller-Plesset perturbation theory: A reduced-cost reference for correlation calculations. J. Chem. Phys 2006, 125, 074108. [DOI] [PubMed] [Google Scholar]
- 56.Steele RP; Shao Y; DiStasio RA; Head-Gordon M, Dual-basis analytic gradients. 1. Self-consistent field theory. J. Phys. Chem. A 2006, 110, 13915–13922. [DOI] [PubMed] [Google Scholar]
- 57.Becke AD, Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 1988, 38, 3098–3100. [DOI] [PubMed] [Google Scholar]
- 58.Becke AD, Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys 1993, 98, 5648–5652. [Google Scholar]
- 59.Lee C; Yang W; Parr RG, Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B: Condens. Matter 1988, 37, 785–789. [DOI] [PubMed] [Google Scholar]
- 60.Huzinaga S; Andzelm J; Radzio-Andzelm E; Sakai Y; Tatewaki H; Klobukowski M, Gaussian Basis Sets for Molecular Calculations 1st ed.; Elsevier: Amsterdam, 1984; Vol. 16. [Google Scholar]
- 61.Ditchfield R; Hehre WJ; Pople JA, Self-consistent molecular-orbital methods. IX. An extended Gaussian-type basis for molecular-orbital studies of organic molecules. J. Chem. Phys 1971, 54, 724–728. [Google Scholar]
- 62.Hehre WJ; Ditchfield R; Pople JA, Self-consistent molecular orbital methods. XII. Further extensions of Gaussian-type basis sets for use in molecular orbital studies of organic molecules. J. Chem. Phys 1972, 56, 2257–2261. [Google Scholar]
- 63.Francl MM; Pietro WJ; Hehre WJ; Binkley JS; DeFrees DJ; Pople JA; Gordon MS, Self-consistent molecular orbital methods. XXIII. A polarization-type basis set for second-row elements. J. Chem. Phys 1982, 77, 3654–3665. [Google Scholar]
- 64.Frisch MJ; Pople JA; Binkley JS, Quadratic configuration interaction. A general technique for determining electron correlation energies. J. Chem. Phys 1984, 80, 3265–3269. [Google Scholar]
- 65.Hehre WJ; Stewart RF; Pople JA, Self-consistent molecular-orbital methods. I. Use of Gaussian expansions of Slater-type atomic orbitals. J. Chem. Phys 1969, 51, 2657–2664. [Google Scholar]
- 66.Schmidt MW; Baldridge KK; Boatz JA; Elbert ST; Gordon MS; Jensen JH; Koseki S; Matsunaga N; Nguyen KA; Su S, et al. , General atomic and molecular electronic structure system. J. Comput. Chem 1993, 14, 1347–1363. [Google Scholar]
- 67.Gordon MS; Schmidt MW Chapter 41 - Advances in electronic structure theory: GAMESS a decade later. In Theory and Applications of Computational Chemistry, Dykstra CE; Frenking G; Kim KS; Scuseria GE, Eds.; Elsevier: Amsterdam, 2005, pp 1167–1189. [Google Scholar]
- 68.Barca GMJ; Bertoni C; Carrington L; Datta D; Silva ND; Deustua JE; Fedorov DG; Gour JR; Gunina AO; Guidez E, et al. , Recent developments in the general atomic and molecular electronic structure system. J. Chem. Phys 2020, 152, 154102. [DOI] [PubMed] [Google Scholar]
- 69.Mulliken RS, Electronic population analysis on LCAO-MO molecular wave functions. I. J. Chem. Phys 1955, 23, 1833–1840. [Google Scholar]
- 70.Møller CMS; Plesset MS, Note on an approximation treatment for many-electron systems. Phys. Rev 1934, 46, 618–622. [Google Scholar]
- 71.Roothaan CCJ, New developments in molecular orbital theory. Rev. Mod. Phys 1951, 23, 69–89. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
