Abstract
Calculating free-energy differences using molecular dynamics (MD) simulations is an important task in computational chemistry. In practice, the accuracy of the results is limited by model approximations and insufficient phase-space sampling due to limited computational resources. In the present work, we address these challenges by integrating the quantum-mechanical/molecular-mechanical (QM/MM) scheme with replica-exchange enveloping distribution sampling (RE-EDS) to obtain a multistate and multiscale free-energy method with high computational efficiency. The performance of QM/MM RE-EDS is showcased by calculating hydration free energies for three data sets using semiempirical methods for the QM zone. We highlight the importance of the choice of QM Hamiltonian and the effect of the compatibility between the QM and MM models. Especially the choice of semiempirical method has a substantial effect on the accuracy compared to experiment, but also the choice of MM water model is non-negligible. Our findings indicate that RE-EDS is an efficient approach for calculating free-energy differences with a QM/MM scheme, and lays the foundation for future developments and applications.


Introduction
Accurate and reliable estimation of free-energy differences is one of the central objectives in computational chemistry with direct applications in various fields including drug discovery, physics, and materials science. For instance, a drug candidate should possess favorable physicochemical and pharmacokinetic properties as well as high potency and selectivity for its target. − For the prediction of the latter properties, free-energy methods based on molecular dynamics (MD) simulations have been established as the state-of-the-art tool. − Despite much progress in recent decades, some long-standing challenges remain unsolved. While MD simulations represent arguably the most rigorous way to estimate, for example, solvation and binding free energies, all-atom MD is associated with significant computational cost. Furthermore, the widely used classical fixed-charge force fields have known deficiencies. − Therefore, a balance needs to be struck between the severity of approximations made in the model and the extent of phase-space sampling. , Much ongoing research is focused on maximizing the achieved accuracy given the available computational resources by leveraging this trade-off.
While classical force fields have proven accurate enough in many applications (for examples see refs – ), there are cases where such approaches cannot reach sufficient accuracy. For instance, the fixed point-charge approximation employed in classical force fields does not hold for highly polarizable systems and/or environments. Quantum-mechanical (QM) methods would resolve this issue but even semiempirical approaches are typically too expensive to treat the full system for the simulation lengths needed in free-energy calculations (i.e., several nanoseconds). To reduce the computational costs while retaining the higher level of theory for the region of interest, a multiscale approach like QM/MM , can be employed. Thereby, the studied system is divided into a QM zone, for which a QM description is used, while the surrounding environment is treated classically (MM zone). , Multiscale approaches aim to combine the best of both worlds, see refs – for successful QM/MM applications.
However, even QM/MM when using density functional theory (DFT) (or more accurate methods) is still (too) expensive for free-energy calculations. Thus, the common approach for QM/MM free-energy calculations, which goes back to Gao and Warshel, is to use a thermodynamic cycle with sampling at a lower level of theory (classical or using a semiempirical method) and employing free-energy perturbation (FEP) to estimate the higher-level QM/MM free-energy differences. ,, The accuracy of this dual-resolution approach depends crucially on the extent of similarity between the potential-energy surfaces at the low and high levels of theory. If the surfaces are significantly different, the reweighting procedure fails to deliver converged values as the relevant regions of phase space for the high-level Hamiltonian are not sufficiently sampled. In the past decades, the main effort in QM/MM free-energy method development has been focused on this issue of insufficient overlap, and different strategies have been proposed to address it. ,,− Another alternative is to substitute the expensive QM calculations with a machine-learned interatomic potential (MLIP). −
Independent of the strategy used to improve the overlap in the dual-resolution framework, the free-energy methods commonly used are pairwise, i.e., a separate calculation for each pair of ligands needs to be carried out. A more efficient approach would be to use a multistate free-energy method, where multiple ligands are treated in a single simulation.
In this study, we combine the QM/MM scheme with the multistate method replica-exchange enveloping distribution sampling (RE-EDS) − to perform free-energy calculations. The computational efficiency of (RE-)EDS in a classical setting comes largely from the fact that the interactions between unperturbed particles in the system (i.e., the environment) have to be calculated only onceindependent of the number of ligands (if a pairwise decomposable method like reaction field is used for the long-range electrostatic interactions). This reduction is less relevant in a QM/MM setting as the costs of the QM calculations dominate, but it becomes more important again if the expensive QM Hamiltonian is replaced by a cheaper MLIP − in future applications. Furthermore, as with all multistate methods, there is no need for designing a perturbation map with a subset of the possible pairwise transformations. Compared to classical RE-EDS, the introduction of QM/MM allows for more accurate modeling of systems where polarization is important or which are not parametrized well by force fields. Related methods merging QM/MM schemes with enhanced sampling exist, , however, this work is the first time RE-EDS is utilized with a multiscale approach and a framework for inclusion of any Hamiltonian for the perturbed part of the system is presented.
The developed QM/MM RE-EDS methodology is tested by calculating hydration free-energies for three different sets of molecules, for which experimental data is available, and the results are compared to those obtained using classical force fields. In addition, we investigate the effect of the choice of the semiempirical method and the MM water model on the agreement with experiment. Note that we employ semiempirical methods for the QM zone for speed reasons, although this limits the possible gain in accuracy over classical force fields. Nevertheless, the developed method is QM model agnostic and more accurate alternatives can be used. The results could of course be further processed in a dual-resolution framework, although this is not the focus here. Rather, we aim to describe the main theoretical and implementation aspects of QM/MM RE-EDS and demonstrate its practical applications.
The present work is structured as follows: in the Theory section a short introduction to QM/MM, and RE-EDS is given, followed by a detailed description of how the two methods were integrated into QM/MM RE-EDS. The developed methodology has been tested on three different systems, which are described in the Methods section, and the performance is evaluated in Results and Discussion. Finally, main findings and prospective work are summarized in the Conclusions.
Theory
QM/MM
Partitioning a system into a QM zone and an MM zone introduces a boundary between the zones. Here, we focus on the most relevant aspects and refer interested readers to excellent introductory texts for detailed descriptions. , The potential energy of the entire system is typically constructed through the so-called additive scheme
| 1 |
where r QM are the coordinates of the QM particles and r MM the coordinates of the MM particles. Within the QM/MM formalism, the interactions between the zones (V QM–MM) can be treated at varying levels of complexity. , Electrostatic embedding is employed in this work as it provides a superior trade-off between accuracy and computational cost in comparison to the simpler mechanical embedding and the more complex polarizable embedding schemes. In electrostatic embedding, MM particles within a predefined cutoff radius R QM–MM enter the QM Hamiltonian as one-electron operators and can thus polarize the QM zone. van der Waals interactions between the QM and MM particles are calculated classically.
Replica-Exchange Enveloping Distribution Sampling (RE-EDS)
Most widely used alchemical free-energy methods such as thermodynamic integration (TI) and free-energy perturbation (FEP) are pathway dependent. For a fully connected graph consisting of N end-states, N(N − 1)/2 separate simulations would need to be carried out in theory. Such an approach would, however, be redundant and there are estimates that a perturbation map with scaling is sufficient. Even so, multistate methods offer a better computational efficiency as the interactions between unperturbed particles need to be calculated only once, independent of N (if the potential energy can be decomposed pairwise, e.g., using a reaction-field approach for the long-range electrostatic interactions).
Enveloping distribution sampling (EDS) − is a multistate method, which uses a reference state, V R , that encompasses all end-states of interest and requires no definition of pathways between end-states. The reference state V R is constructed in the following way
| 2 |
where with k B being the Boltzmann constant and T the absolute temperature, and V i (r) is the potential energy of end-state i. The smoothness parameter s and the energy offsets E R are used to tune the reference state and achieve adequate sampling of all end-states. A smoothness parameter s = 1.0 results in a reference state that represents the “physically correct” description of the system. Reducing s (i.e., s < 1.0) smooths the energy barriers between end-states, thus facilitating transitions between them. At very small s-values, the potential-energy landscape is significantly disturbed and unphysical configurations are sampled (termed “undersampling”). Energy offsets E R are used to give equal weights to all end-states in the reference state.
In EDS, the force acting on a particle k is calculated as follows:
| 3 |
The prefactor w i determines the contributions of each end-state to the force acting on a particle. The larger the s-value, the more dominant is the end-state with the most favorable potential energy compared to all other end-states. Conversely, at very small s-values, all end-states contribute significantly to the forces.
From the simulation of the EDS reference state, all N(N − 1)/2 free-energy differences can be obtained through reweighting − (i.e., using the Zwanzig equation). For two end-states A and B, ΔG BA is calculated as
| 4 |
Obtaining an optimal set of EDS reference-state parameters for more than two end-states proved to be a difficult task due to their mutual dependence. This issue can be partially circumvented by combining EDS with the replica-exchange technique to form replica-exchange enveloping distribution sampling (RE-EDS). − ,− RE-EDS is a form of Hamiltonian replica exchange, wherein replicas differ in the smoothness parameter s. Transitions between end-states are easier in replicas with small s-values, while physically relevant configurations are sampled in the top replica with s = 1.0. Exchanges between neighboring replicas k and l are accepted or rejected based on the Metropolis–Hastings criterion,
| 5 |
An automatic pipeline has been developed and refined to estimate the energy offsets and optimize the distribution of the replicas in s-space. , The RE-EDS pipeline consists of three phases, namely parameter exploration, parameter optimization, and production. During parameter exploration, a relevant configuration is generated for each end-state, and a lower bound for the s-value as well as initial energy offsets E R are determined. Subsequently, the s-value distribution and E R are optimized iteratively to achieve adequate sampling of all end-states at s = 1.0. Finally, a production run is performed to obtain relative free-energy differences between all end-states from a single simulation. For more details see refs and .
Integrating QM/MM with RE-EDS
Definition of the EDS Reference State
Classical RE-EDS simulations can be conducted with either a hybrid or a dual-topology approach. , For the combination with QM/MM, we follow the dual-topology approach with each end-state consisting of an entire molecule as this allows us to design the EDS reference state independent of the QM Hamiltonian. Let us expand eq as
| 6 |
where V QM(r QM, r MM) is the potential energy of the QM zone and V MM(r MM) is the potential energy of the MM zone. V QM–MM,EE(r QM, r MM) is the coupling between the QM and MM zones using electrostatic embedding (EE), which is defined as
| 7 |
where ρ(r) is the electron density in the QM subsystem, n ∈ QM and runs over all QM-treated nuclei, and j ∈ MM and runs over all point charges within the cutoff R QM–MM. Z n is the nuclear charge of the QM atom and Z j is the point charge of the MM particle j. V QM–MM,vdW(r QM, r MM) is the van der Waals (vdW) interaction between the two zones, which is calculated classically with a Lennard-Jones (LJ) potential-energy function,
| 8 |
where i ∈ QM and j ∈ MM. C 12 and C 6 are parametrized according to the force field of choice. In principle, point charges and LJ parameters should be reparameterized at each step of a QM/MM simulation, but it is rarely done in practice due to the associated computational cost and instead default force-field parameters are commonly used. In most QM/MM set-ups, there is a mismatch between the list of point charges included in the electrostatically embedded QM calculation (i.e., union of MM particles in the cutoff spheres of all QM particles) and the list of MM particles interacting with a given QM particle in classical LJ interactions (i.e., spherical cutoff around a QM single particle). If the LJ cutoff is large enough, the neglected interactions should be negligible. Note that V QM(r QM) and V QM–MM,EE(r QM, r MM) cannot be separated in practice for technical reasons. For an EDS end-state i, we will therefore use their combination, V i (r), in the following.
When constructing the EDS reference state (eq ), a decision needs to be made on which interactions to perturb. While it may seem straightforward to define an end-state as V i (r) = V i (r) + V i (r), a technical subtlety in EDS prevents that. Namely, only the lowest-energy end-state for a configuration contributes significantly to the forces (eq ) at high s-values. Therefore, intramolecular potential-energy terms should not be perturbed, because otherwise the particles of the other end-states may drift apart and through each other, preventing further transitions between end-states. To circumvent this problem, we introduce an additional vacuum QM calculation to evaluate the intramolecular potential in the absence of the MM environment.
We construct the end-state potential-energy function as
| 9 |
where V i (r) is the QM-calculated potential energy of the end-state embedded in MM point charges, V i (r) is the QM-calculated potential energy of the end-state in vacuum, and V i (r) is the classical van der Waals interaction of the end-state with the MM zone.
Definition of the Forces
The QM/MM RE-EDS system is propagated in time as follows:
-
1.
Forces on the particles in the MM zone are calculated and applied according to the force field
| 10 |
-
2.
QM-calculated forces in vacuum are applied to the QM zone for all end-states
| 11 |
-
3.
Contribution to the forces of each EDS end-state is calculated according to eq using the reference-state potential-energy function V R (r; s, E R ) (eq ) constructed from end-states V i (r) according to eq . Forces are then added to any particle included in the QM-MM coupling
| 12 |
Methods
Test Systems
Three different test systems with experimentally known hydration free energies (taken from the FreeSolv database) were selected to assess the performance of the developed methodology and compare to classical results (Figure ). Set A consists of benzene derivatives, which have already been studied using classical RE-EDS. We do not expect any sampling issues for this set. Sets B and C were selected to study the difference between force fields and semiempirical methods for sulfides, alcohols, and ethers. In addition, molecules in these two data sets are more flexible than in set A.
1.

Data sets used to validate the developed QM/MM RE-EDS methodology. (A) Set A with benzene derivatives A1–A6. (B) Set B with sulfides B1–B6. (C) Set C with molecules C1–C8 containing hydroxy and ether groups.
Simulation Details
All simulations were performed with a modified version of the GROMOS simulation package built upon an earlier GROMOS QM/MM implementation and the open-source Python3 reeds module. QM/MM simulations were interfaced to xtb 6.5.1 using GFN1-xTB and GFN2-xTB as well as to DFTB+ 24.1, using DFTB3 with the hydrogen and the DFTD4 (using the D4 model , ) dispersion corrections. Unless mentioned otherwise, default parameters were used. The water models TIP3P, SPC/E, and OPC3 were used. A cutoff R QM–MM = 1.2 nm was employed for the electrostatic embedding scheme. For the nonbonded interactions, a single 1.2 nm cutoff radius was used. Long-range nonbonded interactions were calculated using the reaction-field correction with ϵRF = 1 for vacuum simulations and ϵRF = 78.5 for simulations in water. In the water simulations, a reference temperature of 298.15 K and pressure of 0.06102 kJ mol–1 nm–3 were maintained using the Berendsen thermostat and barostat. The stochastic dynamics integrator was used for the vacuum simulations. The SHAKE algorithm with a relative tolerance of 10–4 was used to constrain all bonds in the MM zone. The pairlist was updated every five steps. A time step of 0.5 and 2 fs was used for the QM/MM and MM simulations, respectively.
Set A
Generalized AMBER force field (GAFF) 1.7 topologies and coordinates were taken from ref . To ensure that the molecules remain well-aligned during the simulations, distance restraints with a force constant of 5000 kJ mol–2 nm–2 were applied on four atoms per molecule. For each end-state, a relevant end-state configuration was obtained by running a 1 ns long EDS simulations where E R was biased toward the given end-state by setting E i = 500 kJ mol–1 for end-state i and E j = −500 kJ mol–1 for all other end-states j. These optimized configurations were used in a starting-state mixing approach as described by Ries et al. The lower bound for s was determined using 21 parallel 0.2 ns EDS simulations with logarithmically distributed s-values between 1 and 10–5 and E R set to zero for all end-states. Next, initial energy offsets were estimated from a 0.5 ns RE-EDS simulation, followed by s-distribution optimization step to achieve round trips between the replicas and a subsequent energy-offset rebalancing step. With the optimal set of parameters, a 5 ns production run was performed (ten repeats with different random number seeds for the initial velocities).
The described procedure was carried out for classical RE-EDS simulations in water and in vacuum as well as for the newly developed QM/MM RE-EDS in water.
Sets B and C
Molecules were parametrized with the OpenFF 2.0.0 force field. Topologies in the AMBER format were generated using the OpenFF Toolkit and subsequently converted to the GROMOS topology format using the GROMOS++ program amber2gromos. GROMOS++ programs pdb2g96 and sim_box were used to generate solvated boxes, and the programs red_top and prep_eds to prepare single molecule and perturbed topologies. To ensure that the molecules remain aligned during the simulation, distance restraints with a force constant of 5000 kJ mol–2 nm–2 were applied on two atoms per molecule (no restriction of any dihedral angle sampling). The same RE-EDS procedure as described for set A was carried out for these sets with only a small modification: separate s-optimization and energy-offset rebalancing steps were substituted by a single mixed optimization step as this was shown to be more effective. Ten iterations of 0.5 ns RE-EDS simulations were performed in which the s-value distribution and E R were optimized according to the N-GRTO algorithm. With the optimal set of parameters, a 5 ns production run was performed (ten repeats with different random number seeds for the initial velocities).
The described procedure was carried out for classical RE-EDS simulations in water and in vacuum as well as for the newly developed QM/MM RE-EDS in water.
Analysis
All simulations were analyzed using GROMOS++ programs and PyGromosTools. For further analysis and visualization, the following Python packages were used: Matplotlib 3.7.1, mpmath 1.3.0, NumPy 1.24.3, Pandas 2.0.1, SciPy 1.10.1, and pymbar 4.0.3.
Relative hydration free-energies between two end-states i and j were obtained based on a classical RE-EDS simulation in water and one in vacuum using the thermodynamic cycle
| 13 |
Note that in QM/MM RE-EDS, we define the end-states to be exclusively characterized by their nonbonded interactions with the environment (see Theory section). Therefore, the relative free energy of two end-states in vacuum is by definition zero, i.e., ΔG ji = 0. ΔΔG ji can thus be obtained from a single QM/MM RE-EDS simulation in water
| 14 |
Absolute hydration free energies (ΔG hyd) were retrieved in the following way
| 15 |
where i is the molecule of interest, j runs over all molecules, R is the reference state, and N is the number of molecules in a set.
Performance of the different methods was quantified using mean absolute error (MAE) and Kendall’s τ coefficient. Additional metrics, namely root-mean-square error (RMSE), Spearman’s ρ coefficient, the coefficient of determination (R 2) and Pearson correlation coefficient are given in the Supporting Information.
Section S5 in the Supporting Information provides RE-EDS-related plots, tracking the parameter optimization procedure, trajectory paths, sampling distribution and convergence of production runs (Figures S7–S16).
Results and Discussion
We tested the developed QM/MM RE-EDS methodology by calculating hydration free energies of three data sets with small molecules (Figure ). First, the QM/MM RE-EDS results are compared to computed results using classical RE-EDS and multistate Bennett acceptance ratio estimator (MBAR) , as well as to experimental values (both taken from the FreeSolv database). Subsequently, we showcase the effect of the choice of semiempirical method and water model on the QM/MM RE-EDS results.
QM/MM RE-EDS Validation
For set A, we compared QM/MM RE-EDS (semiempirical method GFN2-xTB and the TIP3P water model) with the classical (MM) approaches RE-EDS and MBAR (both GAFF 1.7) against the experimental values (Figure B and Table ). As shown previously, RE-EDS gives equivalent results to other methods like MBAR when the same force field is used. The deviation from experiment is shown in Figure A. While all three approaches have MAE values below chemical accuracy and the two classical methods perform equivalently, QM/MM with GFN2-xTB yields a higher MAE than the classical force field. However, it provides a better ranking for this set of compounds.
2.
Results on set A for QM/MM RE-EDS (GFN2-xTB and TIP3P), classical (MM) RE-EDS and MBAR (both GAFF 1.7). (A) Comparison against experimental values taken from the FreeSolv database. (B) Correlation with experiment. Error bars represent the standard deviation over ten repeats and experimental uncertainty, respectively. Shaded gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S1 in the Supporting Information.
1. Mean Absolute Error (MAE) and Kendall’s τ for All Sets .
| set | method | MAE [kJ/mol] | Kendall’s τ |
|---|---|---|---|
| A | MM MBAR (GAFF) | 1.4 | 0.87 |
| MM RE-EDS (GAFF) | 1.4 | 0.87 | |
| QM/MM RE-EDS (GFN2-xTB) | 2.2 | 1.00 | |
| B | MM MBAR (GAFF) | 1.4 | –0.60 |
| MM RE-EDS (OpenFF) | 1.4 | –0.60 | |
| QM/MM RE-EDS (GFN2-xTB) | 2.9 | –0.33 | |
| C | MM MBAR (GAFF) | 2.4 | 0.86 |
| MM RE-EDS (OpenFF) | 2.3 | 0.79 | |
| QM/MM RE-EDS (GFN2-xTB) | 5.5 | 0.29 |
Additional metrics are provided in Table S1 in the Supporting Information.
The sulfides in set B have very similar hydration free energies between −5 and −7 kJ/mol, thus challenging the accuracy limit of computational methods. As can be seen in Figure , the classical force fields GAFF and OpenFF give similar results and reproduce the experimental values well (within ±2 kJ/mol), except for compound B1 with a deviation of 3.2 kJ/mol. The error metrics are listed in Table . Due to the nearly identical hydration free energies of the compounds, the ranking (Kendall’s τ) does not contain much information. As for set A, all MAE values are below chemical accuracy and the QM/MM results with GFN2-xTB are worse than the classical force fields. There even appears to be systematic deviation as a function of the molecule size.
3.
Results on set B for QM/MM RE-EDS (GFN2-xTB and TIP3P), classical (MM) RE-EDS (OpenFF 2.0.0) and MBAR (GAFF 1.7). (A) Comparison against experimental values taken from the FreeSolv database. (B) Correlation with experiment. Error bars represent the standard deviation over ten repeats and experimental uncertainty, respectively. Shaded gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S2 in the Supporting Information.
Similar to set B, the compounds in set C have a higher flexibility compared to set A, but in this case the hydration free energies span a much larger range of 50 kJ/mol. Figure shows more variation between the force fields, although both give an MAE value below chemical accuracy and both again outperform QM/MM with GFN2-xTB (Table ). In the case of set C, also the ranking as measured with Kendall’s τ is better with the classical force fields.
4.
Results on set C for QM/MM RE-EDS (GFN2-xTB and TIP3P), classical (MM) RE-EDS (OpenFF 2.0.0) and MBAR (GAFF 1.7). (A) Comparison against experimental values taken from the FreeSolv database. (B) Correlation with experiment. Error bars represent the standard deviation over ten repeats and experimental uncertainty, respectively. Shaded gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S3 in the Supporting Information.
While the results show that RE-EDS can be used in a QM/MM setup to provide converged free-energy differences, we do not observe the expected improvement in accuracy when going from a fully classical description to QM/MMin contrary. This may be due to the choice of QM Hamiltonian and the compatibility between the QM and MM models (see below). However, it is also worth pointing out that classical force fields are parametrized and/or validated using hydration free energies (as evidenced by the small deviations from experiment for the classical force fields).
Choice of QM Hamiltonian
The choice of QM Hamiltonian can have a significant effect on the accuracy of results. To demonstrate this, we investigated the effect of the choice of QM Hamiltonian by performing QM/MM RE-EDS simulations with different semiempirical methods. Figure shows the results for GFN1-xTB and GFN2-xTB for sets A and B. For set C, three methods GFN1-xTB, GFN2-xTB, and DFTB3 were compared and the results are provided in Figure . Table lists the corresponding MAE and Kendall’s τ values.
5.
Comparison of QM/MM RE-EDS results with GFN1-xTB (yellow) and GFN2-xTB (blue) in TIP3P for sets A and B. (A) Deviation from experiment for set A. (B) Correlation to experiment for set A. (C) Deviation from experiment for set B. (D) Main differences between GFN1-xTB and GFN2-xTB methods. Error bars represent the standard deviation over repeats and experimental uncertainty for calculated values and experiment, respectively. Shaded light gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S4 in the Supporting Information.
6.
Comparison of QM/MM RE-EDS results with GFN1-xTB (yellow), GFN2-xTB (blue), and DFTB3 (purple) in TIP3P for set C. (A) Deviation from experiment for set C. (B) Correlation to experiment for set C. Error bars represent the standard deviation over repeats and experimental uncertainty for calculated values and experiment, respectively. Shaded light gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S5 in the Supporting Information.
2. Comparison of MAE and Kendall’s τ for All Sets When Varying the QM Hamiltonian in the QM/MM RE-EDS Simulations .
| set | QM Hamiltonian | MAE [kJ/mol] | Kendall’s τ |
|---|---|---|---|
| A | GFN1-xTB | 3.2 | 0.87 |
| GFN2-xTB | 2.2 | 1.00 | |
| B | GFN1-xTB | 3.0 | 0.20 |
| GFN2-xTB | 2.9 | 0.33 | |
| C | GFN1-xTB | 3.0 | 0.93 |
| GFN2-xTB | 5.5 | 0.29 | |
| DFTB3 | 1.9 | 0.93 |
Additional metrics are provided in Table S2 in the Supporting Information.
From the results, it is evident that the choice of QM Hamiltonian (even when staying within the semiempirical methods) significantly affects the accuracy. In addition, the effect appears to be system dependent. For sets A and B, GFN1-xTB gives slightly worse results than GFN2-xTB, while for set C it clearly outperforms GFN2-xTB. In case of set B, the molecule size dependent deviation from experiment is opposite between GFN1-xTB and GFN2-xTB, suggesting a parametrization-related source of error. For set C, DFTB3 shows best agreement with experiment, well within chemical accuracy, while the relative ranking is the same as for GFN1-xTB.
Compatibility of QM and MM Models
In addition to the choice of QM Hamiltonian, also the compatibility of models used in QM/MM simulations effects the accuracy of the results. − To assess this influence, we studied the effect of the choice of MM water model for set C, comparing the compatibility of TIP3P, SPC/E, and OPC3 with GFN1-xTB and GFN2-xTB. Parameters and selected properties of these models are shown in Table S3 in the Supporting Information.
As can be seen in Figure and Table , the effect of the choice of MM water model is not as significant as the difference between the QM Hamiltonians, but it affects the results nevertheless. For both GFN-xTB variants, the SPC/E water model appears to be most compatible, yielding the lowest MAE values.
7.
Comparison of QM/MM RE-EDS results with GFN1-xTB (yellow lines) and GFN2-xTB (blue lines) in TIP3P, SPC/E, and OPC3 (shades of colors) for set C. (A) Deviation from experiment for set C. (B) Correlation to experiment for set C. Error bars represent the standard deviation over repeats and experimental uncertainty for calculated values and experiment, respectively. Shaded light gray area depicts the range that falls within ±4.184 kJ/mol (±1 kcal/mol) from perfect correlation and the dark gray area corresponds to an error margin of ±8.368 kJ/mol (±2 kcal/mol). Hydration free energies as a function of the molecule identifier are shown in Figure S6 in the Supporting Information.
3. Comparison of MAE and Kendall’s τ for Set C When Varying MM Water Model and the QM Hamiltonian in the QM/MM RE-EDS Simulations .
| QM Hamiltonian | water model | MAE [kJ/mol] | Kendall’s τ |
|---|---|---|---|
| GFN1-xTB | TIP3P | 3.0 | 0.93 |
| SPC/E | 2.6 | 0.93 | |
| OPC3 | 2.9 | 0.93 | |
| GFN2-xTB | TIP3P | 5.5 | 0.29 |
| SPC/E | 4.6 | 0.36 | |
| OPC3 | 5.4 | 0.36 |
Additional metrics are provided in Table S3 in the Supporting Information.
Conclusions
In this work, we introduced the combination of the QM/MM scheme with the multistate free-energy method RE-EDS. By doing this, we may increase the accuracy for highly polarizable systems and extend the applicability domain of RE-EDS to systems not parametrized by classical force fields. As a proof-of-concept, we tested the performance of QM/MM RE-EDS by calculating hydration free energies for three data sets using semiempirical methods for the QM Hamiltonian. For all systems, we obtained converged results, which contained up to eight end-states simultaneously. Classical force fields outperformed the QM/MM treatment for two out of three data sets, which has been seen before and is not necessarily surprising because hydration free energies are used within the parametrization pipelines of most force fields. The performance of QM/MM RE-EDS depended strongly on the choice of semiempirical method, with DFTB3 giving the best results of all tested methods (including classical force fields) for set C. Furthermore, we investigated the importance of the compatibility of the QM and MM models in the QM/MM scheme by comparing three different MM water models. The variability is smaller in this case, albeit still non-negligible with up to 3.4 kJ/mol.
While we used semiempirical methods for the QM zone for speed reasons, there is no restriction on the QM Hamiltonian used in QM/MM RE-EDS. Possible future directions include further optimization of the compatibility between the QM and MM models, substituting the QM model by a MLIP for ML/MM calculations, and applying the methodology to study problems with a larger polarization effect.
Supplementary Material
Acknowledgments
The authors gratefully acknowledge financial support by the Swiss National Science Foundation (grant number 200021_212732). Domen Pregeljc thanks the Hans H. Günthard Foundation for his doctoral excellence fellowship. The authors thank Candide Champion for numerous helpful discussions regarding the RE-EDS pipeline, Felix Pultar for many useful discussions regarding the details of the GROMOS implementation and Gregory A. Landrum for best practices on data and code management. Jakob Teetz and Igor Gordiy are acknowledged for reviewing the data and code.
The developed QM/MM RE-EDS methodology has been implemented in the GROMOS software package and the source code is available on GitHub: https://github.com/rinikerlab/gromosXX/tree/qmmm_reeds. All the input files required to reproduce this work are available on GitHub: https://github.com/rinikerlab/qmmm_reeds.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jpcb.5c02086.
Extended set of error metrics, and additional figures and tables referenced in the text (PDF)
The authors declare no competing financial interest.
Published as part of The Journal of Physical Chemistry B special issue “At the Cutting Edge of Theoretical and Computational Biophysics”.
References
- Kerns, E. H. ; Di, L. . Drug-like Properties: Concepts, Structure Design and Methods; Academic Press, 2008. [Google Scholar]
- Kawasaki Y., Freire E.. Finding a Better Path to Drug Selectivity. Drug Discovery Today. 2011;16:985–990. doi: 10.1016/j.drudis.2011.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kairys V., Baranauskiene L., Kazlauskiene M., Matulis D., Kazlauskas E.. Binding Affinity in Drug Design: Experimental and Computational Techniques. Expet Opin. Drug Discov. 2019;14:755–768. doi: 10.1080/17460441.2019.1623202. [DOI] [PubMed] [Google Scholar]
- Michel J., Foloppe N., Essex J. W.. Rigorous Free Energy Calculations in Structure-Based Drug Design. Mol. Inform. 2010;29:570–578. doi: 10.1002/minf.201000051. [DOI] [PubMed] [Google Scholar]
- De Vivo M., Masetti M., Bottegoni G., Cavalli A.. Role of Molecular Dynamics and Related Methods in Drug Discovery. J. Med. Chem. 2016;59:4035–4061. doi: 10.1021/acs.jmedchem.5b01684. [DOI] [PubMed] [Google Scholar]
- Cournia Z., Allen B., Sherman W.. Relative Binding Free Energy Calculations in Drug Discovery: Recent Advances and Practical Considerations. J. Chem. Inf. Model. 2017;57:2911–2937. doi: 10.1021/acs.jcim.7b00564. [DOI] [PubMed] [Google Scholar]
- Williams-Noonan B. J., Yuriev E., Chalmers D. K.. Free Energy Methods in Drug Design: Prospects of Alchemical Perturbation in Medicinal Chemistry. J. Med. Chem. 2018;61:638–649. doi: 10.1021/acs.jmedchem.7b00681. [DOI] [PubMed] [Google Scholar]
- Cournia Z., Allen B. K., Beuming T., Pearlman D. A., Radak B. K., Sherman W.. Rigorous Free Energy Simulations in Virtual Screening. J. Chem. Inf. Model. 2020;60:4153–4169. doi: 10.1021/acs.jcim.0c00116. [DOI] [PubMed] [Google Scholar]
- Song L. F., Merz K. M.. Evolution of Alchemical Free Energy Methods in Drug Discovery. J. Chem. Inf. Model. 2020;60:5308–5318. doi: 10.1021/acs.jcim.0c00547. [DOI] [PubMed] [Google Scholar]
- Borhani D. W., Shaw D. E.. The Future of Molecular Dynamics Simulations in Drug Discovery. J. Comput.-Aided Mol. Des. 2012;26:15–26. doi: 10.1007/s10822-011-9517-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riniker S.. Fixed-Charge Atomistic Force Fields for Molecular Dynamics Simulations in the Condensed Phase: An Overview. J. Chem. Inf. Model. 2018;58:565–578. doi: 10.1021/acs.jcim.8b00042. [DOI] [PubMed] [Google Scholar]
- Vega C., Abascal J. L. F.. Simulating Water with Rigid Non-Polarizable Models: A General Perspective. Phys. Chem. Chem. Phys. 2011;13:19663–19688. doi: 10.1039/c1cp22168j. [DOI] [PubMed] [Google Scholar]
- Caleman C., van Maaren P. J., Hong M., Hub J. S., Costa L. T., van der Spoel D.. Force Field Benchmark of Organic Liquids: Density, Enthalpy of Vaporization, Heat Capacities, Surface Tension, Isothermal Compressibility, Volumetric Expansion Coefficient, and Dielectric Constant. J. Chem. Theory Comput. 2012;8:61–74. doi: 10.1021/ct200731v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beauchamp K. A., Behr J. M., Rustenburg A. S., Bayly C. I., Kroenlein K., Chodera J. D.. Toward Automated Benchmarking of Atomistic Force Fields: Neat Liquid Densities and Static Dielectric Constants from the ThermoML Data Archive. J. Phys. Chem. B. 2015;119:12912–12920. doi: 10.1021/acs.jpcb.5b06703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inakollu V. S., Geerke D. P., Rowley C. N., Yu H.. Polarisable Force Fields: What Do They Add in Biomolecular Simulations? Curr. Opin. Struct. Biol. 2020;61:182–190. doi: 10.1016/j.sbi.2019.12.012. [DOI] [PubMed] [Google Scholar]
- van Gunsteren W. F., Daura X., Fuchs P. F. J., Hansen N., Horta B. A. C., Hünenberger P. H., Mark A. E., Pechlaner M., Riniker S., Oostenbrink C.. On the Effect of the Various Assumptions and Approximations used in Molecular Simulations on the Properties of Bio-Molecular Systems: Overview and Perspective on Issues. ChemPhysChem. 2021;22:264–282. doi: 10.1002/cphc.202000968. [DOI] [PubMed] [Google Scholar]
- van Gunsteren W. F., Oostenbrink C.. Methods for Classical-Mechanical Molecular Simulation in Chemistry: Achievements, Limitations, Perspectives. J. Chem. Inf. Model. 2024;64:6281–6304. doi: 10.1021/acs.jcim.4c00823. [DOI] [PubMed] [Google Scholar]
- Lindorff-Larsen K., Piana S., Dror R. O., Shaw D. E.. How Fast-Folding Proteins Fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
- Dror R. O., Dirks R. M., Grossman J. P., Xu H., Shaw D. E.. Biomolecular Simulation: A Computational Microscope for Molecular Biology. Annu. Rev. Biophys. 2012;41:429–452. doi: 10.1146/annurev-biophys-042910-155245. [DOI] [PubMed] [Google Scholar]
- Fiala T., Barros E. P., Heeb R., Riniker S., Wennemers H.. Predicting Collagen Triple Helix Stability through Additive Effects of Terminal Residues and Caps. Angew. Chem., Int. Ed. 2023;62:e202214728. doi: 10.1002/anie.202214728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christensen A. S., Kubar T., Cui Q., Elstner M.. Semiempirical Quantum Mechanical Methods for Noncovalent Interactions for Chemical and Biochemical Applications. Chem. Rev. 2016;116:5301–5337. doi: 10.1021/acs.chemrev.5b00584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Warshel A., Karplus M.. Calculation of Ground and Excited State Potential Surfaces of Conjugated Molecules. I. Formulation and Parametrization. J. Am. Chem. Soc. 1972;94:5612–5625. doi: 10.1021/ja00771a014. [DOI] [Google Scholar]
- Warshel A., Levitt M.. Theoretical Studies of Enzymic Reactions: Dielectric, Electrostatic and Steric Stabilization of the Carbonium Ion in the Reaction of Lysozyme. J. Mol. Biol. 1976;103:227–249. doi: 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
- Senn H. M., Thiel W.. QM/MM Methods for Biomolecular Systems. Angew. Chem., Int. Ed. 2009;48:1198–1229. doi: 10.1002/anie.200802019. [DOI] [PubMed] [Google Scholar]
- Groenhof, G. In Introduction to QM/MM Simulations; Monticelli, L. , Salonen, E. , Eds.; Humana Press: Totowa, NJ, 2103; pp 43–66. [Google Scholar]
- Gao J.. Absolute Free Energy of Solvation from Monte Carlo Simulations Using Combined Quantum and Molecular Mechanical Potentials. J. Phys. Chem. 1992;96:537–540. doi: 10.1021/j100181a009. [DOI] [Google Scholar]
- Muller R. P., Warshel A.. Ab Initio Calculations of Free Energy Barriers for Chemical Reactions in Solution. J. Phys. Chem. 1995;99:17516–17524. doi: 10.1021/j100049a009. [DOI] [PubMed] [Google Scholar]
- Stanton R. V., Peräkylä M., Bakowies D., Kollman P. A.. Combined Ab Initio and Free Energy Calculations To Study Reactions in Enzymes and Solution: Amide Hydrolysis in Trypsin and Aqueous Solution. J. Am. Chem. Soc. 1998;120:3448–3457. doi: 10.1021/ja972723x. [DOI] [Google Scholar]
- Mulholland A. J., Lyne P. D., Karplus M.. Ab Initio QM/MM Study of the Citrate Synthase Mechanism. A Low-Barrier Hydrogen Bond Is not Involved. J. Am. Chem. Soc. 2000;122:534–535. doi: 10.1021/ja992874v. [DOI] [Google Scholar]
- Olsson M. A., Ryde U.. Comparison of QM/MM Methods To Obtain Ligand-Binding Free Energies. J. Chem. Theory Comput. 2017;13:2245–2253. doi: 10.1021/acs.jctc.6b01217. [DOI] [PubMed] [Google Scholar]
- Jug U., Pregeljc D., Mavri J., Vianello R., Stare J.. Elementary SN2 Reaction Revisited. Effects of Solvent and Alkyl Chain Length on Kinetics of Halogen Exchange in Haloalkanes Elucidated by Empirical Valence Bond Simulation. Comput. Theor. Chem. 2017;1116:96–101. doi: 10.1016/j.comptc.2017.02.003. [DOI] [Google Scholar]
- Pregeljc D., Jug U., Mavri J., Stare J.. Why Does the Y326I Mutant of Monoamine Oxidase B Decompose an Endogenous Amphetamine at a Slower Rate Than the Wild Type Enzyme? Reaction Step Elucidated by Multiscale Molecular Simulations. Phys. Chem. Chem. Phys. 2018;20:4181–4188. doi: 10.1039/C7CP07069A. [DOI] [PubMed] [Google Scholar]
- Wang H., Gao Z., Song P., Hu B., Wang J., Cheng M.. Molecular Dynamics Simulation and QM/MM Calculation Reveal the Selectivity Mechanism of Type I 1/2 Kinase Inhibitors: the Effect of Intramolecular H-bonds and Conformational Restriction for Improved Selectivity. Phys. Chem. Chem. Phys. 2019;21:24147–24164. doi: 10.1039/C9CP04353E. [DOI] [PubMed] [Google Scholar]
- Lawan N., Tinikul R., Surawatanawong P., Mulholland A. J., Chaiyen P.. QM/MM Molecular Modeling Reveals Mechanism Insights into Flavin Peroxide Formation in Bacterial Luciferase. J. Chem. Inf. Model. 2022;62:399–411. doi: 10.1021/acs.jcim.1c01187. [DOI] [PubMed] [Google Scholar]
- Prah A., Pregeljc D., Stare J., Mavri J.. Brunner Syndrome Caused by Point Mutation Explained by Multiscale Simulation of Enzyme Reaction. Sci. Rep. 2022;12:21889. doi: 10.1038/s41598-022-26296-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prah A., Mavri J.. L-DOPA Autoxidation: An Empirical Valence Bond Simulation of the Reactive Step. J. Phys. Chem. B. 2024;128:8355–8361. doi: 10.1021/acs.jpcb.4c03002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones R. O.. Density Functional Theory: Its Origins, Rise to Prominence, and Future. Rev. Mol. Phys. 2015;87:897–923. doi: 10.1103/RevModPhys.87.897. [DOI] [Google Scholar]
- Warshel, A. Computer Modeling of Chemical Reactions in Enzymes and in Solutions; Wiley: New York, 1991. [Google Scholar]
- Zwanzig R. W.. High-Temperature Equation of State by a Perturbation Method. I. Nonpolar Gases. J. Chem. Phys. 1954;22:1420–1426. doi: 10.1063/1.1740409. [DOI] [Google Scholar]
- Rod T. H., Ryde U.. Quantum Mechanical Free Energy Barrier for an Enzymatic Reaction. Phys. Rev. Lett. 2005;94:138302. doi: 10.1103/PhysRevLett.94.138302. [DOI] [PubMed] [Google Scholar]
- Cave-Ayland C., Skylaris C.-K., Essex J. W.. Direct Validation of the Single Step Classical to Quantum Free Energy Perturbation. J. Phys. Chem. B. 2015;119:1017–1025. doi: 10.1021/jp506459v. [DOI] [PubMed] [Google Scholar]
- Zhang Y., Liu H., Yang W.. Free Energy Calculation on Enzyme Reactions with an Efficient Iterative Procedure to Determine Minimum Enery Paths on a Combined Ab Initio QM/MM Potential Energy Surface. J. Chem. Phys. 2000;112:3483. doi: 10.1063/1.480503. [DOI] [Google Scholar]
- Rod T. H., Ryde U.. Accurate QM/MM Free Energy Calculations of Enzyme Reactions: Methylation by Catechol O-Methyltransferase. J. Chem. Theory Comput. 2005;1:1240–1251. doi: 10.1021/ct0501102. [DOI] [PubMed] [Google Scholar]
- König G., Boresch S.. Non-Boltzmann Sampling and Bennetts Acceptance Ratio Method: How to Profit from Bending the Rules. J. Comput. Chem. 2011;32:1082–1090. doi: 10.1002/jcc.21687. [DOI] [PubMed] [Google Scholar]
- König G., Hudson P. S., Boresch S., Woodcock H. L.. Multiscale Free Energy Simulations: An Efficient Method for Connecting Classical MD Simulations to QM or QM/MM Free Energies Using Non-Boltzmann Bennett Reweighting Schemes. J. Chem. Theory Comput. 2014;10:1406–1419. doi: 10.1021/ct401118k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- König G., Mei Y., Pickard F. C., Simmonett A. C., Miller B. T., Herbert J. M., Woodcock H. L., Brooks B. R., Shao Y.. Computation of Hydration Free Energies Using the Multiple Environment Single System Quantum Mechanical/Molecular Mechanical Method. J. Chem. Theory Comput. 2016;12:332–344. doi: 10.1021/acs.jctc.5b00874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Plotnikov N. V., Kamerlin S. C. L., Warshel A.. Paradynamics: An Effective and Reliable Model for Ab Initio QM/MM Free-Energy Calculations and Related Tasks. J. Phys. Chem. B. 2011;115:7950–7962. doi: 10.1021/jp201217b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y., Pu J.. Reaction Path Force Matching: A New Strategy of Fitting Specific Reaction Parameters for Semiempirical Methods in Combined QM/MM Simulations. J. Chem. Theory Comput. 2014;10:3038–3054. doi: 10.1021/ct4009624. [DOI] [PubMed] [Google Scholar]
- Steinmann C., Olsson M. A., Ryde U.. Relative Ligand-Binding Free Energies Calculated from Multiple Short QM/MM MD Simulations. J. Chem. Theory Comput. 2018;14:3228–3237. doi: 10.1021/acs.jctc.8b00081. [DOI] [PubMed] [Google Scholar]
- Rufa, D. A. ; Macdonald, H. E. B. ; Fass, J. ; Wieder, M. ; Grinaway, P. B. ; Roitberg, A. E. ; Isayev, O. ; Chodera, J. D. . Towards Chemical Accuracy for Alchemical Free Energy Calculations with Hybrid Physics-Based Machine Learning/Molecular Mechanics Potentials. 2020, bioRxiv 2020.07.29.227959. [Google Scholar]
- Wieder M., Fass J., Chodera J. D.. Fitting Quantum Machine Learning Potentials to Experimental Free Energy Data: Predicting Tautomer Ratios in Solution. Chem. Sci. 2021;12:11364–11381. doi: 10.1039/D1SC01185E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wieder, M. ; Fass, J. ; Chodera, J. D. . Teaching Free Energy Calculations to Learn from Experimental Data. 2021, bioRxiv 2021.08.24.457513. [Google Scholar]
- Karwounopoulos J., Bieniek M., Wu Z., Baskerville A. L., König G., Cossins B. P., Wood G. P. F.. Evaluation of Machine Learning/Molecular Mechanics End-State Corrections with Mechanical Embedding to Calculate Relative Protein–Ligand Binding Free Energies. J. Chem. Theory Comput. 2025;21:967–977. doi: 10.1021/acs.jctc.4c01427. [DOI] [PubMed] [Google Scholar]
- Crha R., Poliak P., Gillhofer M., Oostenbrink C.. Alchemical Free-Energy Calculations at Quantum-Chemical Precision. J. Phys. Chem. Lett. 2025;16:863–869. doi: 10.1021/acs.jpclett.4c03213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang, X. ; Wu, X. ; Brooks, B. ; Wang, J. . Accurate Free Energy Calculation via Multiscale Simulations Driven by Hybrid Machine Learning and Molecular Mechanics Potentials. 2025, chemrxiv-2024-zq975-v2. [Google Scholar]
- Pultar F., Thürlemann M., Gordiy I., Doloszeski E., Riniker S.. Neural Network Potential with Multi-Resolution Approach Enables Accurate Prediction of Reaction Free Energies in Solution. J. Am. Chem. Soc. 2025;147:6835–6856. doi: 10.1021/jacs.4c17015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sidler D., Schwaninger A., Riniker S.. Replica Exchange Enveloping Distribution Sampling (RE-EDS): A Robust Method to Estimate Multiple Free-Energy Differences from a Single Simulation. J. Chem. Phys. 2016;145:154114. doi: 10.1063/1.4964781. [DOI] [PubMed] [Google Scholar]
- Sidler D., Cristòfol-Clough M., Riniker S.. Efficient Round-Trip Time Optimization for Replica-Exchange Enveloping Distribution Sampling (RE-EDS) J. Chem. Theory Comput. 2017;13:3020–3030. doi: 10.1021/acs.jctc.7b00286. [DOI] [PubMed] [Google Scholar]
- Rieder S., Ries B., Champion C., Barros E. P., Hünenberger P. H., Riniker S.. Replica-Exchange Enveloping Distribution Sampling: Calculation of Relative Free Energies in GROMOS. Chimia. 2022;76:327–330. doi: 10.2533/chimia.2022.327. [DOI] [PubMed] [Google Scholar]
- Champion C., Gall R., Ries B., Rieder S., Barros E. P., Riniker S.. Accelerating Alchemical Free Energy Prediction Using a Multistate Method: Application to Multiple Kinases. J. Chem. Inf. Model. 2023;63:7133–7147. doi: 10.1021/acs.jcim.3c01469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tironi I. G., Sperb R., Smith P. E., van Gunsteren W. F. A.. Generalized Reaction Field Method for Molecular Dynamics Simulations. J. Chem. Phys. 1995;102:5451–5459. doi: 10.1063/1.469273. [DOI] [Google Scholar]
- Mori T., Hamers R. J., Pedersen J. A., Cui Q.. Integrated Hamiltonian Sampling: A Simple and Versatile Method for Free Energy Simulations and Conformational Sampling. J. Phys. Chem. B. 2014;118:8210–8220. doi: 10.1021/jp501339t. [DOI] [PubMed] [Google Scholar]
- Li, C. , Zhang, X. , Chan, G. K.-L. . General Quantum Alchemical Free Energy Simulations via Hamiltonian Interpolation. 2024, arXiv.2408.17002 [DOI] [PubMed] [Google Scholar]
- Mobley D. L., Guthrie J. P.. FreeSolv: a Database of Experimental and Calculated Hydration Free Energies, with Input Files. J. Comput.-Aided Mol. Des. 2014;28:711–720. doi: 10.1007/s10822-014-9747-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu W., Jia X., Wang M., Li P., Wang X., Hu W., Zheng J., Mei Y.. Calculations of the Absolute Binding Free Energies for Ralstonia solanacearum Lectins Bound with Methyl-α-l-fucoside at Molecular Mechanical and Quantum Mechanical/Molecular Mechanical Levels. RSC Adv. 2017;7:38570–38580. doi: 10.1039/c7ra06215j. [DOI] [Google Scholar]
- Bakowies D., Thiel W.. Hybrid Models for Combined Quantum Mechanical and Molecular Mechanical Approaches. J. Phys. Chem. 1996;100:10580–10594. doi: 10.1021/jp9536514. [DOI] [Google Scholar]
- Lin H., Truhlar D. G.. QM/MM: What Have We Learned, Where Are We, and Where Do We Go From Here? Theor. Chem. Acc. 2007;117:185–199. doi: 10.1007/s00214-006-0143-z. [DOI] [Google Scholar]
- Kirkwood J. G.. Statistical Mechanics of Fluid Mixtures. J. Chem. Phys. 1935;3:300–313. doi: 10.1063/1.1749657. [DOI] [Google Scholar]
- Pitman M., Hahn D. F., Tresadern G., Mobley D. L.. To Design Scalable Free Energy Perturbation Networks, Optimal Is Not Enough. J. Chem. Inf. Model. 2023;63:1776–1793. doi: 10.1021/acs.jcim.2c01579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christ C. D., van Gunsteren W. F.. Enveloping Distribution Sampling: A Method to Calculate Free Energy Differences from a Single Simulation. J. Chem. Phys. 2007;126:184110. doi: 10.1063/1.2730508. [DOI] [PubMed] [Google Scholar]
- Christ C. D., van Gunsteren W. F.. Multiple Free Energies from a Single Simulation: Extending Enveloping Distribution Sampling to Non-Overlapping Phase-Space Distributions. J. Chem. Phys. 2008;128:174112. doi: 10.1063/1.2913050. [DOI] [PubMed] [Google Scholar]
- Christ C. D., van Gunsteren W. F.. Simple, Efficient, and Reliable Computation of Multiple Free Energy Differences from a Single Simulation: A Reference Hamiltonian Parameter Update Scheme for Enveloping Distribution Sampling (EDS) J. Chem. Theory Comput. 2009;5:276–286. doi: 10.1021/ct800424v. [DOI] [PubMed] [Google Scholar]
- Christ C. D., van Gunsteren W. F.. Comparison of Three Enveloping Distribution Sampling Hamiltonians for the Estimation of Multiple Free Energy Differences from a Single Simulation. J. Comput. Chem. 2009;30:1664–1679. doi: 10.1002/jcc.21325. [DOI] [PubMed] [Google Scholar]
- Riniker S., Kunz A.-P. E., van Gunsteren W. F.. On the Calculation of the Dielectric Permittivity and Relaxation of Molecular Models in the Liquid Phase. J. Chem. Theory Comput. 2011;7:1469–1475. doi: 10.1021/ct100610v. [DOI] [PubMed] [Google Scholar]
- Ries B., Normak K., Weiss R. G., Rieder S., Barros E. P., Champion C., König G., Riniker S.. Relative Free-Energy Calculations for Scaffold Hopping-Type Transformations with an Automated RE-EDS Sampling Procedure. J. Comput.-Aided Mol. Des. 2022;36:117–130. doi: 10.1007/s10822-021-00436-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rieder S., Ries B., Schaller K., Champion C., Barros E. P., Hünenberger P. H., Riniker S.. Replica-Exchange Enveloping Distribution Sampling Using Generalized AMBER Force-Field Topologies: Application to Relative Hydration Free-Energy Calculations for Large Sets of Molecules. J. Chem. Inf. Model. 2022;62:3043–3056. doi: 10.1021/acs.jcim.2c00383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rieder S., Ries B., Kubincová A., Champion C., Barros E. P., Hünenberger P. H., Riniker S.. Leveraging the Sampling Efficiency of RE-EDS in OpenMM Using a Shifted Reaction-Field with an Atom-Based Cutoff. J. Chem. Phys. 2022;157:104117. doi: 10.1063/5.0107935. [DOI] [PubMed] [Google Scholar]
- Champion C., Hünenberger P. H., Riniker S.. Multistate Method to Efficiently Account for Tautomerism and Protonation in Alchemical Free-Energy Calculations. J. Chem. Theory Comput. 2024;20:4350–4362. doi: 10.1021/acs.jctc.4c00370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ries B., Rieder S., Rhiner C., Hünenberger P. H., Riniker S.. RestraintMaker: A Graph-Based Approach to Select Distance Restraints in Free-Energy Calculations With Dual Topology. J. Comput.-Aided Mol. Des. 2022;36:175–192. doi: 10.1007/s10822-022-00445-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmid N., Christ C. D., Christen M., Eichenberger A. P., van Gunsteren W. F.. Architecture, Implementation and Parallelisation of the GROMOS Software for Biomolecular Simulation. Comput. Phys. Commun. 2012;183:890–903. doi: 10.1016/j.cpc.2011.12.014. [DOI] [Google Scholar]
- Meier K., Schmid N., van Gunsteren W. F.. Interfacing the GROMOS (Bio)molecular Simulation Software to Quantum-Chemical Program Packages. J. Comput. Chem. 2012;33:2108–2117. doi: 10.1002/jcc.23047. [DOI] [PubMed] [Google Scholar]
- van Rossum, G. ; Drake, F. L. . Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, 2009. [Google Scholar]
- Ries, B. ; Rieder, S. R. ; Champion, C. ; Barros, E. P. ; Riniker, S. . rinikerlab/reeds: An Automatized RE-EDS Sampling Procedure (v1.0). 2021, https://github.com/rinikerlab/reeds (accessed Sep 10, 2024).
- Bannwarth C., Caldeweyher E., Ehlert S., Hansen A., Pracht P., Seibert J., Spicher S., Grimme S.. Extended Tight-Binding Quantum Chemistry Methods. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2021;11:e1493. doi: 10.1002/wcms.1493. [DOI] [Google Scholar]
- Grimme S., Bannwarth C., Shushkov P.. A Robust and Accurate Tight-Binding Quantum Chemical Method for Structures, Vibrational Frequencies, and Noncovalent Interactions of Large Molecular Systems Parametrized for All spd-Block Elements (Z = 1–86) J. Chem. Theory Comput. 2017;13:1989–2009. doi: 10.1021/acs.jctc.7b00118. [DOI] [PubMed] [Google Scholar]
- Bannwarth C., Ehlert S., Grimme S.. GFN2-xTB – An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions. J. Chem. Theory Comput. 2019;15:1652–1671. doi: 10.1021/acs.jctc.8b01176. [DOI] [PubMed] [Google Scholar]
- Hourahine B., Aradi B., Blum V., Bonafé F., Buccheri A., Camacho C., Cevallos C., Deshaye M. Y., Dumitrică T., Dominguez A.. et al. DFTB+, a Software Package for Efficient Approximate Density Functional Theory Based Atomistic Simulations. J. Chem. Phys. 2020;152:124101. doi: 10.1063/1.5143190. [DOI] [PubMed] [Google Scholar]
- Gaus M., Cui Q., Elstner M.. DFTB3: Extension of the Self-Consistent-Charge Density-Functional Tight-Binding Method (SCC-DFTB) J. Chem. Theory Comput. 2011;7:931–948. doi: 10.1021/ct100684s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caldeweyher E., Bannwarth C., Grimme S.. Extension of the D3 Dispersion Coefficient Model. J. Chem. Phys. 2017;147:034112. doi: 10.1063/1.4993215. [DOI] [PubMed] [Google Scholar]
- Caldeweyher E., Ehlert S., Hansen A., Neugebauer H., Spicher S., Bannwarth C., Grimme S.. A Generally Applicable Atomic-Charge Dependent London Dispersion Correction. J. Chem. Phys. 2019;150:154112. doi: 10.1063/1.5090222. [DOI] [PubMed] [Google Scholar]
- Jorgensen W. L., Chandraekhar J., Madura J. D., Impey R. W., Klein M. L.. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
- Berendsen H. J. C., Grigera J. R., Straatsma T. P.. The Missing Term in Effective Pair Potentials. J. Phys. Chem. 1987;91:6269–6271. doi: 10.1021/j100308a038. [DOI] [Google Scholar]
- Izadi S., Onufriev A. V.. Accuracy Limit of Rigid 3-Point Water Models. J. Chem. Phys. 2016;145:074501. doi: 10.1063/1.4960175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lide, D. R. Handbook of Chemistry and Physics, 88th ed.; CRC Press/Taylor and Francis: Boca Raton, FL, 2007. [Google Scholar]
- Berendsen H. J. C., Postma J. P. M., van Gunsteren W. F., DiNola A., Haak J. R.. Molecular Dynamics with Coupling to an External Bath. J. Chem. Phys. 1984;81:3684–3690. doi: 10.1063/1.448118. [DOI] [Google Scholar]
- Ryckaert J. P., Ciccotti G., Berendsen H. J. C.. Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-alkanes. J. Comput. Phys. 1977;23:327–341. doi: 10.1016/0021-9991(77)90098-5. [DOI] [Google Scholar]
- Wang J., Wolf R. M., Caldwell J. W., Kollman P. A., Case D. A.. Development and Testing of a General Amber Force Field. J. Comput. Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- Wagner, J. ; Thompson, M. ; Dotson, D. ; Hyejang; Boothroyd, S. ; Rodríguez-Guerra, J. . openforcefield/openff-forcefields: Version 2.0.0 Sage. 2021, https://zenodo.org/records/5214478 (accessed Sep 10, 2024)..
- Wagner, J. ; Thompson, M. ; Mobley, D. L. ; Chodera, J. ; Bannan, C. ; Rizzi, A. ; Trevorgokey; Dotson, D. L. ; Mitchell, J. A. ; Jaimergp; et al. openforcefield/openff-toolkit: 0.15.2 minor Feature Release. 2024, https://zenodo.org/records/10613658 (accessed Sep 10, 2024).
- Eichenberger A. P., Allison J. R., Dolenc J., Geerke D. P., Horta B. A. C., Meier K., Oostenbrink C., Schmid N., Steiner D., Wang D.. et al. GROMOS++ Software for the Analysis of Biomolecular Simulation Trajectories. J. Chem. Theory Comput. 2011;7:3379–3390. doi: 10.1021/ct2003622. [DOI] [PubMed] [Google Scholar]
- Lehner, M. T. ; Ries, B. ; Rieder, S. R. ; Riniker, S. . srinikerlab/PyGromosTools: PyGromosTools (v2.0). 2021, https://zenodo.org/record/4849409#.Yp0gTKjMLIU (accessed Sep 10, 2024).
- Hunter J. D.. Matplotlib: A 2D Graphics Environments. Comput. Sci. Eng. 2007;9:90–95. doi: 10.1109/MCSE.2007.55. [DOI] [Google Scholar]
- Hunter, J. D. mpmath: A Python Library for Arbitrary-Precision Floating-Point Arithmetic, Version 0.18. 2013, http://mpmath.org/ (accessed Sep 10, 2024).
- van der Walt S., Colbert S. C., Varoquaux G.. The NumPy Array: A Structure for Efficient Numerical Computation. Comput. Sci. Eng. 2011;13:22–30. doi: 10.1109/mcse.2011.37. [DOI] [Google Scholar]
- McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, 2010; Vol. 445, pp 51–56. [Google Scholar]
- Virtanen P., Gommers R., Oliphant T. E., Haberland M., Reddy T., Cournapeau D., Burovski E., Peterson P., Weckesser W., Bright J.. et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beauchamp, K. A. ; Chodera, J. D. ; Naden, L. N. ; Shirts, M. R. . Python Implementation of the Multistate Bennett Acceptance Ratio (MBAR). 2008, http://github.com/choderalab/pymbar (accessed Sep 10, 2024).
- Kendall M. G.. A New Measure of Rank Correlation. Biometrika. 1938;30:81–93. doi: 10.2307/2332226. [DOI] [Google Scholar]
- Spearman C.. The Proof and Measurement of Association Between Two Things. Am. J. Psychol. 1904;15:72–101. doi: 10.2307/1412159. [DOI] [PubMed] [Google Scholar]
- Pearson K.. Note on Regression and Inheritance in the Case of Two Parents. Proc. R. Soc. London. 1895;58:240–242. [Google Scholar]
- Bennett C. H.. Efficient Estimation of Free Energy Differences from Monte Carlo Data. J. Comput. Phys. 1976;22:245–268. doi: 10.1016/0021-9991(76)90078-4. [DOI] [Google Scholar]
- Shirts M. R., Chodera J. D.. Statistically Optimal Analysis of Samples from Multiple Equilibrium States. J. Chem. Phys. 2008;129:124105. doi: 10.1063/1.2978177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- König G., Pickard F. C., Huang J., Thiel W., MacKerell A. D., Brooks B. R., York D. M.. A Comparison of QM/MM Simulations With and Without the Drude Oscillator Model Based on Hydration Free Energies of Simple Solutes. Molecules. 2018;23:2695. doi: 10.3390/molecules23102695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cisneros G. A., Tholander S. N.-I., Parisel O., Darden T. A., Elking D., Perera L., Piquemal J.-P.. Simple Formulas for Improved Point-charge Electrostatics in Classical Force Fields and Hybrid Quantum Mechanical/Molecular Mechanical Embedding. Int. J. Quantum Chem. 2008;108:1905–1912. doi: 10.1002/qua.21675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaw K. E., Woods C. J., Mulholland A. J.. Compatibility of Quantum Chemical Methods and Empirical (MM) Water Models in Quantum Mechanics/Molecular Mechanics Liquid Water Simulations. J. Phys. Chem. Lett. 2010;1:219–223. doi: 10.1021/jz900096p. [DOI] [Google Scholar]
- Wang B., Truhlar D. G.. Geometry Optimization Using Tuned and Balanced Redistributed Charge Schemes for Combined Quantum Mechanical and Molecular Mechanical Calculations. Phys. Chem. Chem. Phys. 2011;13:10556–10564. doi: 10.1039/c0cp02850a. [DOI] [PubMed] [Google Scholar]
- Goyal P., Qian H.-J., Irle S., Lu X., Roston D., Mori T., Elstner M., Cui Q.. Molecular Simulation of Water and Hydration Effects in Different Environments: Challenges and Developments for DFTB Based Models. J. Phys. Chem. B. 2014;118:11007–11027. doi: 10.1021/jp503372v. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The developed QM/MM RE-EDS methodology has been implemented in the GROMOS software package and the source code is available on GitHub: https://github.com/rinikerlab/gromosXX/tree/qmmm_reeds. All the input files required to reproduce this work are available on GitHub: https://github.com/rinikerlab/qmmm_reeds.






