Abstract
The conformational ensemble of a molecule is strongly influenced by the surrounding environment. Correctly modeling the effect of any given environment is, hence, of pivotal importance in computational studies. Machine learning (ML) has been shown to be able to model these interactions probabilistically, with successful applications demonstrated for classical molecular dynamics. While first instances of ML implicit solvents for quantum-mechanical (QM) calculations exist, the high computational cost of QM reference calculations hinders the development of a generally applicable ML implicit solvent model for QM calculations. Here, we present a novel way of developing such a general machine-learned QM implicit solvent model by transferring knowledge obtained from classical interactions to QM, emulating a QM/MM setup with electrostatic embedding and a nonpolarizable MM solvent. This has the profound advantages that neither QM/MM reference calculations nor experimental data are required for training and that the obtained graph neural network (GNN)-based implicit solvent model (termed QM-GNNIS) is compatible with any functional and basis set. QM-GNNIS is currently applicable to small organic molecules and describes 39 different organic solvents. The performance of QM-GNNIS is validated on NMR and IR experiments, demonstrating that the approach can reproduce experimentally observed trends unattainable by state-of-the-art implicit-solvent models paired with static QM calculations.


Introduction
The physical and chemical properties of molecules are the result of their three-dimensional (3D) structure. Importantly, flexible molecules do not have a single 3D structure but rather adopt a Boltzmann-weighted ensemble of different conformations. Therefore, computational approaches in disciplines like spectroscopy, − medicinal chemistry, , and stereoselective synthesis need to take the conformational flexibility into account to provide accurate predictions. This, in turn, means that the accurate but fast prediction of conformational ensembles has long been appreciated as one of the prime objectives and challenges of computational chemistry.
The conformational ensemble of a molecule can be obtained from its free-energy landscape. Given an accurate Hamiltonian and within the limit of infinite sampling, molecular dynamics (MD) − is an established method to generate free-energy landscapes. However, for all but the smallest molecules, this method requires extensive sampling to reach convergence, even when more modern enhanced sampling methods are used. In silico conformer generators − forego the limitations posed by sampling methods and use geometric relationships between atoms to swiftly enumerate 3D-structure estimates from the topological graph. When conformer generators are paired with a method to estimate the free energy of the resulting conformers, a Boltzmann-weighted ensemble of molecular structures is obtained.
In the condensed phase, the calculation of molecular ensembles is complicated by the presence of not only intramolecular interactions but also interactions with the surrounding solvent. Small differences in the nature of the solvents may lead to substantially different ensembles. Approaches that include explicit solvent molecules in the calculation are considered most rigorous to describe interactions of solute and solvent. However, the large number of explicit molecules comes with a steep increase in computational cost, especially for calculations that involve more expensive quantum-mechanical (QM) Hamiltonians, which scale poorly with increasing system size. Furthermore, explicit-solvent models were shown to have a lower sampling efficiency due to the additional degrees of freedom introduced by the solvent, which requires extensive sampling, and also the increase in viscosity.
Multiresolution approaches like QM/MM − or, more recently, ML/MM − aim to maintain a high level of accuracy for the description of the solute while solvent molecules are approximated with classical force fields. These approaches are often fruitful when an explicit description of the electronic structure of the solute is desired, for example to study chemical reactions. However, these methods show the same limitations as classical MD simulations and often require vast computational resources.
Implicit solvent models aim to represent the effect a solvent exerts on a solute by an electrostatic continuum. This approximation not only reduces the computational burden due to a much smaller number of explicit particles in the system but also significantly accelerates sampling efficiency due to the instantaneous averaging of solvent configurations and reduced viscosity. , Many implicit solvent models, especially those that are used in conjunction with QM methods, consider the solute’s electron density when solving the Poisson–Boltzmann equation. , Examples include the conductor-like screening (COSMO), the conductor-like polarizable continuum (CPCM), and the solvation model based on density (SMD) models. A more complex solvent model that aims to include explicit-solvent effects was coined COSMO-RS. , Starting from perfectly screened molecules in the conductor-like approach, the COSMO-RS model uses methods from statistical thermodynamics to pair charged surface segments. Due to the relatively high computational burden associated with solving the Poisson–Boltzmann equation, force-field methods are typically paired with implicit solvent models that are based on the generalized Born (GB) model, which does not consider the solute’s electron density but rather atomic monopoles. Examples of these models are GB-HCT, GB-OBC, GB-Neck, and GB-Neck2. Examples of GB models employed in QM calculations include SM8 and SM12.
In previous studies, we have developed a graph neural network (GNN) based implicit solvent model (GNNIS), which was trained on forces extracted from classical MD simulations with explicit solvent. − While GNNIS yielded excellent results compared to the reference explicit-solvent simulations as well as experimental observables, replicating this approach directly for QM calculations is made prohibitively expensive by the necessary reference QM/MM (or ab initio MD) calculations for the training data. While transfer learning (i.e., taking a ML model trained on one task as a starting point for another) may allow for a reduction in training data, acquiring a diverse training set would still require a substantial computational effort. In addition, a specific functional and basis set would need to be chosen, potentially limiting the approach’s applicability to these specific choices. For this reason, we aimed to develop an approach that extracts knowledge from GNNIS in classical simulations, which we found to reproduce solute–solvent interactions well, , and transfers it to QM calculations as a correction with respect to traditional implicit solvation. This procedure has the crucial benefit that no further training data is required, and the approach should be applicable to any functional and basis set combination.
For this approach, we rely on the similarity between classical and QM-based implicit solvent models. In both cases, the solvent is modeled as a dielectric continuum interacting with the solute. While the way this continuum interacts with a given solute is modeled differently, deviations from a more accurate explicit-solvent description mostly stem from the lack of correct modeling of the solute–solvent interface. While at a larger distance, modeling a solvent as a continuum is a good approximation, the binary nature of actual solvent molecules is no longer accurately described at short distances. We denote this as the explicit-solvent effect, which we hypothesize to be of similar magnitude for classical and QM/MM (with a nonpolarizable MM solvent) simulations. When studying the behavior of classical MD simulations, we realized that it is mainly this effect that leads to the largest deviations between a traditional implicit solvent and the explicit-solvent ground truth. − For classical force fields, we could already show that the explicit-solvent effect is well reproduced by the GNNIS model, which was trained on a large set of reference forces of ∼370,000 molecules in 39 organic solvents.
In order to assign a free-energy contribution to the explicit-solvent effect, ΔΔG corr, we define it as the difference between the true solvation free energy of a compound in solution and the solvation free energy estimated based on a continuum model that neglects the explicit-solvent effect. Taking the classical GNNIS model as a good representation of the true solvation free energy of a classically described molecule and the GB-Neck2 model as a continuum-based estimate, ΔΔG corr can then be approximated for the identically described molecule as,
| 1 |
where ΔG GNNIS is the free-energy contribution calculated with the classical GNNIS model and ΔG GB‑Neck2 is the free-energy contribution calculated with the traditional implicit solvent model GB-Neck2. Under the assumption that this difference is approximately equal for MM solutes and for QM solutes, we combined it with the QM-based CPCM solvent to develop a machine-learned QM implicit solvent, which we denote QM-GNNIS. We note that a formal connection between GB models like GB-Neck2 and apparent surface-charge models like CPCM exists, which motivates the combination of these two models. As a separability of the two contributions is assumed, the benefit of the proposed approach lies in the direct accessibility of energies, gradients, and Hessians necessary to optimize structures and calculate relevant experimental properties. A schematic representation of the approach is shown in Figure . The force contribution of the solvent on the hydroxy oxygen of 2-methoxy-ethanol is indicated. This compound is known to feature a strong explicit-solvent effect with water, as water can form a pseudo seven-membered ring conformation with the molecule. In these instances, our hypothesis is that the correction force F corr leads to more meaningful solute–solvent interactions compared to continuum-based methods on their own.
1.

Schematic representation of the proposed approach. Theoretical force vectors acting on the hydroxy oxygen are indicated. (A) Forces predicted based on a classical force field. The force of the classical GNNIS model is indicated by the solid orange vector. The forces of the traditional GB-Neck2 implicit solvent based on the same force field, as well as the difference to the GNNIS force vector (i.e., the correction term), are indicated in dashed purple and blue arrows, respectively. (B) Forces predicted for the QM-based implicit solvent model. The force of the QM-based CPCM implicit solvent model is shown by the solid purple arrow. The classical correction force is indicated by the solid blue arrow. The resulting QM-GNNIS force (i.e., the sum of the two aforementioned contributions) is represented by the dashed orange arrow. Note that the correction term F corr is identical for panels A and B.
We emphasize that the proposed QM-GNNIS approach does not constitute an implicit solvent model that targets the accuracy of ab initio QM calculations, but rather the approach emulates QM/MM simulations with electrostatic embedding and a nonpolarizable MM solvent. Note that in our setting, the solute is only polarized based on a continuum approach (i.e., the CPCM solvent), with F corr added to the gradients of the QM atoms. Including explicit-solvent effects in this manner is not exhaustive but provides an improvement over traditional implicit solvent models routinely used in QM calculations.
Results and Discussion
ML models are typically evaluated based on their ability to reproduce reference calculations and are then, if applicable, compared against experimental results. While this validation strategy has been used successfully in many recent studies, − including our own, − the comparison of QM-GNNIS against the computational ground truth (i.e., explicit-solvent ab initio MD or QM/MM MD) is too expensive for the system sizes, simulation lengths, and level of theory we are using. For this reason, we rely on experimental data to evaluate the performance of the approach. To rigorously study the proposed methodology, a large number of ∼200 experimental measurements of 24 different test systems are studied. As a point of reference, QM calculations with the state-of-the-art implicit solvent models SMD and openCOSMO-RS are compared to the proposed approach. Note that we chose SMD solvent (and not CPCM) for the comparison as it was fitted to better reproduce solvation free energies of solutes in different solvents and can thus be thought of as an improvement over CPCM. This fitting is achieved by using solvent-specific parameters that should better describe solvent characteristics (e.g., hydrogen bonding) not captured by the dielectric permittivity of the solvent. As this procedure may already capture some of the explicit solvent effects, we anticipated that the combination of SMD and the correction from QM-GNNIS may overestimate some interactions. Therefore, we decided to combine QM-GNNIS with the simpler CPCM model. To ensure that this is indeed the case, we have performed reference calculations for the set of molecular balances that support this hypothesis (see Supporting Information Section S1).
Conformational Preference of Molecular Balances
To understand how the different implicit-solvent models behave, a set of 22 molecular balances for which experimental free-energy differences between conformers of the central amide group have been determined in ref were studied using the BP86 functional and the def2-SVP basis set with D4 correction (Figure A). First, classically optimized structures were taken as a starting point for QM optimizations with the SMD implicit-solvent model. The free energies of the optimized structures were evaluated with SMD and openCOSMO-RS. They also served as starting structures for the evaluation with QM-GNNIS, which was used to refine the conformers further before the free energies were estimated. The comparison between the predicted and experimentally observed free-energy differences for molecular balances A1, B1, C1, and D1 is shown in Figure B, and the results for all molecular balances are provided in Figures S2–S4.
2.

(A) Schematic representation of the molecular balances. The studied rotation of the amide group is indicated in red. (B) Comparison of the predicted versus observed free-energy differences, ΔΔG, for the QM calculations with SMD (top), openCOSMO-RS (middle), and QM-GNNIS (bottom). The experimental values were taken from ref . The solid and dashed black lines indicate identity and deviations of half k bT, respectively. The color indicates the dielectric permittivity of the solvent (red: apolar, blue: polar).
Overall, all implicit-solvent models provide decent agreement between the predicted ΔΔG and the experimental values. To compare the models quantitatively, we calculated the Pearson correlation coefficient (PCC) for all balance/solvent combinations (top panel of Figure ). While the SMD and QM-GNNIS models show a similar correlation, the openCOSMO-RS model performs significantly worse.
3.
(Top) Pearson correlation coefficient (PCC) between the experimentally observed ΔΔG values and the predictions based on QM calculations with SMD, openCOSMO-RS, and QM-GNNIS. The black star indicates a significant (p < 0.05) difference with respect to the other models. (Bottom) Slopes between the experimentally observed ΔΔG values and the predictions based on QM calculations with SMD, openCOSMO-RS, and QM-GNNIS. Black stars indicate a significant (p < 0.05) difference between the distributions and the perfect slope of 1.
While the correlation between the experimental and predicted ΔΔG values is of significance, the strength of the solvent interaction is of equal importance. Molecular balance D1, for instance, shows an excellent correlation with PCC = 0.92 for the SMD solvent model, even though the differences between the solvents are too small, leading to a slope of 0.34. Therefore, we also evaluated the capability of an implicit-solvent model to reproduce the correct relative interaction strength by calculating the slope between experiment and prediction (bottom panel of Figure ). This analysis revealed that while the SMD model achieves good correlations, it systematically underestimates the difference in ΔG between the different solvents, manifesting in slopes that are significantly below one. While the same trend was observed for the openCOSMO-RS model, the QM-GNNIS model captures the differences well, resulting in slopes centered around one.
In this context, it is noteworthy that predictions obtained with a classical force field and the GNNIS model also show excellent agreement with experiment as demonstrated in our previous study. However, the largest outlier in this study was molecular balance A3, which features a nitrile group in close proximity to a key hydrogen bond interacting with the ketone. Interactions involving higher multipoles that may be relevant in this case are not captured by classical fixed-charge force fields, potentially leading to the large deviations. Using QM calculations with QM-GNNIS, in contrast, reproduced the experimental trends much better (Figure ), highlighting the benefits of QM-based methods for the description of more complex molecular interactions.
4.
Comparison of predicted versus observed free-energy differences, ΔΔG, for the classical approach (OpenFF with GNNIS) and for QM calculations with QM-GNNIS for molecular balance A3. The experimental values were taken from ref classical data from ref . The solid and dashed black lines indicate identity and deviations of half k bT, respectively. The color indicates the dielectric permittivity of the solvent (red: apolar, blue: polar).
Solvent Effects in NMR and IR Spectroscopy
In case of the molecular balances above, the experimentally observed trend correlates with the dielectric permittivity of the solvent, i.e., apolar solvents such as chloroform, ethyl acetate, THF, and DCM showed a stronger preference for the intramolecular hydrogen bond while polar solvents such as methanol, ethanol, and DMSO showed a stronger preference for the “open” state. To further investigate the behavior of the different implicit-solvent models, we focused on examples where more complex solute–solvent interactions are at play.
NMR Measurements
The conformational preference of the two small molecules 2-methoxy-ethanol and 1,2-dimethoxyethane (Figure A) does not simply follow the dielectric permittivity of the solvent but is rather governed by explicit-solvent effects such as hydrogen bonding, making them an ideal test case. For both compounds, the conformational ensemble can be described based on the rotational state of the central torsion angle (Figure A). In refs and , 1H NMR measurements were conducted that allowed the determination of a total scalar coupling J tot of the α proton of the methoxy group, whose magnitude is directly related to the population of the conformations of the central torsion in different solvents given two commonly made assumptions: First, the experimental J tot is a population average over two distinct conformers (i.e., the gauche- and trans-conformer), and second, the underlying J HH coupling constants of a given conformer are solvent independent. The conformers of the two compounds were evaluated using a higher level of theory as the molecular balances with the B3LYP functional − and def2-TZVP basis set. Again, the three different implicit-solvent models were used in the QM calculations to optimize the two compounds in the ten solvents for which J tot value had been determined experimentally. The correlation between the predicted population of the trans-conformer and the experimentally observed J tot is shown in Figure .
5.
(A) Structure of 2-methoxy-ethanol and 1,2-dimethoxyethane and their shared Newman projection of the gauche- and trans-conformer. (B) Comparison of the predicted population of the trans-conformer and the experimental J tot coupling constant of 2-methoxy-ethanol (top) and 1,2-dimethoxyethane (bottom) for QM calculations with SMD (left), openCOSMO-RS (middle), and QM-GNNIS (right). The color indicates the dielectric permittivity of the solvent (red: apolar, blue: polar).
For both compounds, the QM calculations with openCOSMO-RS and QM-GNNIS show a good correlation between their predicted populations and the experimental J tot values. In contrast, QM calculations with SMD show no correlation for 2-methoxy-ethanol and a weaker one for compound 1,2-dimethoxyethane. This finding is especially interesting as the SMD implicit-solvent model showed the best correlations for the molecular balances (Figure ), indicating that this good performance may likely be only because of the strong correlation with the dielectric permittivity of the solvent in case of the molecular balances. As alluded to in the Introduction (Figure ), the relatively low population of the trans-conformer in water may be attributed to the formation of a pseudoseven-membered ring via a hydrogen-bond network of a water molecule with 2-methoxy-ethanol. The lower trans-conformer population in water predicted by QM-GNNIS and openCOSMO-RS with respect to other polar solvents (i.e., methanol and DMSO) demonstrates that the approach can capture more complex solute–solvent interactions.
Benchmarking of Functionals and Basis Sets
A major advantage of the proposed approach is that because no QM data is required for training, the QM-GNNIS model should by definition be compatible with any functional and basis set combination. To demonstrate that this is true in practice and to analyze the influence of different functionals and basis sets, we have chosen five functionals (BP86, , TPSS B3LYP, − M06-2X, and ω-B97X and four basis sets (def2-SVP, 6-311G, def2-TZVP, and ma-def2-TZVP , ) and tested all combinations for 2-methoxy-ethanol in three solvents: chloroform (lowest J tot), methanol (middle J tot), and DMSO (highest J tot). The results are summarized in Figure .
6.

Comparison of the predicted population of the trans-conformer and the experimental J tot coupling constant of 2-methoxy-ethanol for QM calculations with the QM-GNNIS model with different functionals and basis sets. In total, 20 combinations are shown for chloroform, methanol, and DMSO. For clarity, the same data is shown in three plots. (A) The color indicates all data points acquired with the same functional. (B) The color indicates all data points acquired with the same basis set. (C) Each point shows the average and standard deviation for all functionals used with the same basis set (1: def2-SVP, 2: 6-311G, 3: def2-TZVP, and 4: ma-def2-TZVP).
The data indicates that the approach is indeed compatible with all tested functional and basis set combinations, which is supported by the fact that the variation between different functionals using the same basis set is consistent. Further, all combinations reproduce the experimental trend well, which is encouraging. Interestingly, the larger basis sets appear to favor the trans-conformer more than the smaller basis sets. While it is not clear whether this behavior is more physical, we note that the trans-conformer is more favored in the QM calculations compared to the fully classical results in ref . Taken together, these findings suggest that a more accurate description of the physical interactions correlates with a stabilization of the trans-conformer.
Infrared (IR) Measurements
Solvent effectsespecially by polar solventscan also be observed in IR spectra. To the best of our knowledge, there is currently no implementation of analytical gradient or Hessian calculation for any COSMO-RS model, including openCOSMO-RS. It is our understanding that the derivation of a closed-form expression for gradients and higher-order derivatives for COSMO-RS is complicated by the presence of not continuously differentiable functions in the energy-interaction operator and the iterative solution procedure of the COSMO-RS equations. Therefore, (open)COSMO-RS can only provide free-energy weights for frequencies obtained for structures optimized in the gas phase or with other implicit-solvent models like SMD. In contrast, the (QM-)GNNIS model is fully differentiable and allows analytical evaluation of gradients as used for geometry optimization and Hessians, which is a requirement for predicting vibrational frequencies along with all other physical properties that can be expressed as a derivative of the energy. This feature is for instance used here for the prediction of the vibrational frequencies observed in IR measurements.
To further investigate the performance of the implicit-solvent models, the IR spectra of 2-methoxy-ethanol in five different solvents were predicted, and the OH vibrational frequencies were compared against experimental data from ref . Note that only the frequencies were predicted, which were analyzed using a Boltzmann weighting according to the predicted free energies of the conformers. The intensities resulting from a changed dipole derivative were assumed constant. The predicted OH vibrational spectra and the comparison of the frequencies against experiment are shown in Figure .
7.
Predicted IR spectrum of the OH vibration based on QM calculations with SMD (upper left), openCOSMO-RS (upper middle), and QM-GNNIS (upper right). The comparison between the predicted (x-axis) and experimentally observed frequencies ν (y-axis) for each solvent is shown below the predicted spectra. The dashed black line indicates the line y = x + k where k is an offset.
While the predicted frequencies of the QM calculations with SMD and openCOSMO-RS are very similar to each other independent of the solvent, the calculations with QM-GNNIS show larger solvent effects, which is in agreement with experiment. Note that the absolute values of the computed versus experimental wavenumbers are shifted by approximately ∼200 cm–1 for all methods. We hypothesize that this shift is a result of the level of theory used, as it is well-known that frequencies computed with DFT methods show a constant offset.
From Figure , we conclude that the QM-GNNIS model seems to capture key effects from the interaction between the solvent and the OH group of 2-methoxy-ethanol that give rise to a difference in the vibrational frequency unattainable by the other implicit-solvent models. This finding is especially striking as new conclusions can be drawn based on these results that provide an alternative explanation for the solvent-specific frequencies. In the original publication ref , the authors surmised that the difference in frequency resulted from an opening of the intramolecular hydrogen bond and concluded that the frequency shift was directly linked to the “open” vs “closed” pseudo five-membered intramolecular ring. Our results, however, indicate that it may rather be the subtle differences of the closed ring conformers in conjunction with the direct effect of the solvent interacting with the OH group that shift the vibrational frequency. A more detailed analysis indicates that (i) the closed-ring form dominates in all solvents, and that (ii) the opened-ring conformers feature higher rather than lower frequencies (see Figure S5), the opposite of what was suggested in ref . This observation highlights the proposed QM-GNNIS model as a versatile approach that can accurately predict the solvent-dependent conformational ensemble of molecules while also providing access to experimental properties such as IR frequencies.
Conclusions
Here, we have presented a novel way of developing a machine-learned QM implicit-solvent approach. The method separates the task of modeling the solvent into evaluating a continuum-based QM implicit-solvent method and adding a machine-learned explicit-solvent correction term. This term can be parametrized from classical explicit-solvent MD simulations, which are known to capture solute–solvent effects well, thus requiring no additional QM/MM training data. The resulting QM-GNNIS model has been demonstrated to outperform state-of-the-art QM-based implicit-solvent methods and provides access to novel explanations for experiments. The gradients and Hessians, in addition to energies, enable broad applications, and the additive nature of the method allows easy implementation with existing QM software packages.
Methods
For QM calculations, ORCA 6.0.1 , was used throughout this work. If not stated otherwise, ORCA’s default settings are applied. For COSMO-RS, the open-source implementation of COSMO-RS by Gerlach et al. was used and denoted as openCOSMO-RS. The ASE software was used to interface with ORCA and allow the integration of the QM-GNNIS model.
Structure Minimizations
An overview of the proposed workflow for the different implicit-solvent models is provided in Figure . For all compounds, the starting structure (details given in subsections Molecular Balances and NMR Measurements below) was optimized with SMD using ORCA with the OPT keyword. The free energies of the optimized conformer were calculated with SMD using the Freq keyword. The free energies for openCOSMO-RS were calculated by adding the free energy of the given conformer in vacuum (obtained with the Freq keyword) to the solvation free energy calculated with openCOSMO-RS.
8.
Schematic workflow of the different geometry minimizations and free-energy estimations performed in this study. The traditionally applied workflow using SMD and openCOSMO-RS is highlighted in purple. The approach with QM-GNNIS is shown in orange.
For GNNIS, the molecule-dependent parameters (i.e., atomic partial charges and van der Waals radii) of the GNN were set to the same values as for the classical GB-Neck2 model as described in ref . The preoptimized structure was further optimized using the BFGS algorithm with a tolerance of 0.025 eV angstrom–1 as implemented in the ASE package by combining the forces obtained with GNNIS and a QM calculation with CPCM. The dielectric constant used in the CPCM method was set to the same experimental value as in the GNNIS model (see ref for exact values). The free energy of the further optimized structure G(r⃗) was evaluated according to,
| 2 |
where is the single-point QM energy with CPCM, is the free energy calculated with GNNIS, is the free energy calculated with GB-Neck2, r⃗ are the atomic coordinates, T is the temperature, and is the vibrational entropy estimated using Grimme’s quasi-RRHO approach based on the combined Hessian of .
Molecular Balances
The structures of the studied molecular balances are shown in Figure and the experimentally determined free-energy differences between the indicated ketone rotational states using 19F{1H} NMR measurements were taken from ref . The starting structures for the molecular balances were taken from ref . There, a representative ensemble based on 5120 KDG conformers was generated using the minimization approach with the classical GNNIS model for each balance in each of the nine solvents. These structures were then optimized using the BP86 functional with the SVP basis set and a D4 correction. Next, their free energies were assessed in the nine different solvents for all implicit-solvent models according to the above description. Note that for QM-GNNIS, some conformers showed imaginary frequencies after the ASE minimization. As these do not represent true minima, these conformers were excluded from further analysis. The free energies of the minimized conformers were then used to calculate the populations of the two rotamers based on the ketone rotation shown in Figure . In order to focus the analysis on the differences in these free-energy differences, ΔΔG, with respect to different solvents, the means of the experimental and predicted free energies were subtracted from the experimental and predicted free energies, respectively, for each balance. Note that data points, which only represent upper boundaries (i.e., populations lower than the detection limit of the NMR procedure), were not included in the comparison. SciPy (version 1.10.1) was used to calculate the Pearson correlation coefficient (PCC) and the linear least-squares regression between the experimental and predicted ΔΔG. The comparison based on the regression slopes was performed on the log scale. Anticorrelated examples with negative slopes were assigned log values of minus infinity. This was the case for three balances (D2, D5, and C6) when using the openCOSMO-RS model. The significance of comparisons was assessed using the Wilcoxon signed-rank test. p-values below 0.05 were considered significant.
NMR Measurements
Experimentally obtained J tot couplings of 2-methoxy-ethanol at 25 °C and 1,2-dimethoxyethane at 40 °C were taken from refs and , respectively. For each compound, 255 starting structures were obtained with the KDG conformer generator and optimized for all solvent and implicit-solvent model combinations using the B3LYP functional − with the TZVP basis set and a D4 correction. The minimized structures were subsequently sorted based on their ascending free energies, and the ensemble was pruned using MDTraj (version 1.9.7) based on the all-atom RMSD with a threshold of 0.05 nm. The resulting conformers in the ensemble were assigned to the gauche-conformer if the central dihedral angle (see Figure ) was between −2.1 and 2.1 rad, and to the trans-conformer otherwise. The population of the trans-conformer was then calculated according to the assigned free energies.
Benchmarking of Functionals and Basis Sets
For the benchmarking of the functional and basis sets, the same procedure as described above was applied with five different functionals (BP86, , TPSS B3LYP, − M06-2X, and ω-B97X and four basis sets (def2-SVP, 6-311G, def2-TZVP, and ma-def2-TZVP , ) for chloroform, methanol, and DMSO.
IR Measurements
Experimental measurements of the OH IR frequencies of 2-methoxy-ethanol were taken from ref . The same optimization process as used for the NMR comparison was applied. The vibrational frequencies in the QM calculations with SMD and QM-GNNIS were calculated based on the Hessians obtained during the free-energy calculations. The frequencies were broadened using Gaussians with a σ of 5 Hz, and the total spectrum was obtained as the population-weighted sum over all conformers. As openCOSMO-RS does not allow the calculation of Hessians, the frequencies of the SMD model were used with the population weights obtained using openCOSMO-RS.
Supplementary Material
Acknowledgments
The authors thank ETH Zürich for financial support (ETH research grant no. ETH-50 21-1) and Jakob Teetz for reviewing the provided data and code.
The code used in this study is freely available on GitHub (https://github.com/rinikerlab/QM-GNNIS). The data set used to train the GNNIS model from ref is freely available in the ETH Research Collection (DOI: 10.3929/ethz-b-000710355).
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jctc.5c00728.
Comparison of SMD versus CPCM and additional figures (PDF)
#.
P.K. and F.P. contributed equally to this work.
The authors declare no competing financial interest.
References
- Braun J., Katzberger P., Landrum G. A., Riniker S.. Understanding and Quantifying Molecular Flexibility: Torsion Angular Bin Strings. J. Chem. Inf. Model. 2024;64:7917–7924. doi: 10.1021/acs.jcim.4c01513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pultar F., Hansen M. E., Wolfrum S., Böselt L., Fróis-Martins R., Bloch S., Kravina A. G., Pehlivanoglu D., Schäffer C., LeibundGut-Landmann S., Riniker S., Carreira E. M.. Mutanobactin D from the Human Microbiome: Total Synthesis, Configurational Assignment, and Biological Evaluation. J. Am. Chem. Soc. 2021;143:10389–10402. doi: 10.1021/jacs.1c04825. [DOI] [PubMed] [Google Scholar]
- Metternich J. B., Katzberger P., Kamenik A. S., Tiwari P., Wu R., Riniker S., Zenobi R.. Influence of the Fluorophore Mobility on Distance Measurements by Gas-Phase FRET. J. Phys. Chem. A. 2023;127:5620–5628. doi: 10.1021/acs.jpca.3c01509. [DOI] [PubMed] [Google Scholar]
- Cotter E., Pultar F., Riniker S., Altmann K.. Experimental and Theoretical Studies on the Reactions of Aliphatic Imines with Isocyanates. Chem. Eur. J. 2024;30:e202304272. doi: 10.1002/chem.202304272. [DOI] [PubMed] [Google Scholar]
- Lovering F., Bikker J., Humblet C.. Escape from Flatland: Increasing Saturation as an Approach to Improving Clinical Success. J. Med. Chem. 2009;52:6752–6756. doi: 10.1021/jm901241e. [DOI] [PubMed] [Google Scholar]
- Nicholls A., McGaughey G. B., Sheridan R. P., Good A. C., Warren G., Mathieu M., Muchmore S. W., Brown S. P., Grant J. A., Haigh J. A., Nevins N., Jain A. N., Kelley B.. Molecular Shape and Medicinal Chemistry: A Perspective. J. Med. Chem. 2010;53:3862–3886. doi: 10.1021/jm900818s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carreira, E. M. ; Kvaerno, L. . Classics in Stereoselective Synthesis; Wiley-VCH Verlag: Weinheim, Germany, 2007. [Google Scholar]
- Gelpi J., Hospital A., Goñi R., Orozco M.. Molecular Dynamics Simulations: Advances and Applications. Adv. Appl. Bioinform. Chem. 2015;8:37–47. doi: 10.2147/AABC.S70333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Gunsteren W. F., Berendsen H. J. C.. Computer Simulation of Molecular Dynamics: Methodology, Applications, and Perspectives in Chemistry. Angew. Chem., Int. Ed. 1990;29:992–1023. doi: 10.1002/anie.199009921. [DOI] [Google Scholar]
- van Gunsteren W. F., Oostenbrink C.. Methods for Classical-Mechanical Molecular Simulation in Chemistry: Achievements, Limitations, Perspectives. J. Chem. Inf. Model. 2024;64:6281–6304. doi: 10.1021/acs.jcim.4c00823. [DOI] [PubMed] [Google Scholar]
- Ebejer J.-P., Morris G. M., Deane C. M.. Freely Available Conformer Generation Methods: How Good Are They? J. Chem. Inf. Model. 2012;52:1146–1158. doi: 10.1021/ci2004658. [DOI] [PubMed] [Google Scholar]
- Riniker S., Landrum G. A.. Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation. J. Chem. Inf. Model. 2015;55:2562–2574. doi: 10.1021/acs.jcim.5b00654. [DOI] [PubMed] [Google Scholar]
- Hawkins P. C. D.. Conformation Generation: The State of the Art. J. Chem. Inf. Model. 2017;57:1747–1756. doi: 10.1021/acs.jcim.7b00221. [DOI] [PubMed] [Google Scholar]
- McNutt A. T., Bisiriyu F., Song S., Vyas A., Hutchison G. R., Koes D. R.. Conformer Generation for Structure-Based Drug Design: How Many and How Good? J. Chem. Inf. Model. 2023;63:6598–6607. doi: 10.1021/acs.jcim.3c01245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anandakrishnan R., Drozdetski A., Walker R. C., Onufriev A. V.. Speed of Conformational Change: Comparing Explicit and Implicit Solvent Molecular Dynamics Simulations. Biophys. J. 2015;108:1153–1164. doi: 10.1016/j.bpj.2014.12.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Warshel A., Levitt M.. Theoretical Studies of Enzymic Teactions: Dielectric, Electrostatic and Steric Stabilization of the Carbonium Ion in the Reaction of Lysozyme. J. Mol. Biol. 1976;103:227–249. doi: 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
- Senn H. M., Thiel W.. QM/MM Methods for Biomolecular Systems. Angew. Chem., Int. Ed. 2009;48:1198–1229. doi: 10.1002/anie.200802019. [DOI] [PubMed] [Google Scholar]
- Brunk E., Rothlisberger U.. Mixed Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of Biological Systems in Ground and Electronically Excited States. Chem. Rev. 2015;115:6217–6263. doi: 10.1021/cr500628b. [DOI] [PubMed] [Google Scholar]
- Böselt L., Thürlemann M., Riniker S.. Machine Learning in QM/MM Molecular Dynamics Simulations of Condensed-Phase Systems. J. Chem. Theory Comput. 2021;17:2641–2658. doi: 10.1021/acs.jctc.0c01112. [DOI] [PubMed] [Google Scholar]
- Gastegger M., Schütt K. T., Müller K.-R.. Machine Learning of Solvent Effects on Molecular Spectra and Reactions. Chem. Sci. 2021;12:11473–11483. doi: 10.1039/D1SC02742E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofstetter A., Böselt L., Riniker S.. Graph-Convolutional Neural Networks for (QM)ML/MM Molecular Dynamics Simulations. Phys. Chem. Chem. Phys. 2022;24:22497–22512. doi: 10.1039/D2CP02931F. [DOI] [PubMed] [Google Scholar]
- Galvelis R., Varela-Rial A., Doerr S., Fino R., Eastman P., Markland T. E., Chodera J. D., Fabritiis G. D.. NNP/MM: Accelerating Molecular Dynamics Simulations with Machine Learning Potentials and Molecular Mechanics. J. Chem. Inf. Model. 2023;63:5701–5708. doi: 10.1021/acs.jcim.3c00773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pultar F., Thürlemann M., Gordiy I., Doloszeski E., Riniker S.. Neural Network Potential with Multiresolution Approach Enables Accurate Prediction of Reaction Free Energies in Solution. J. Am. Chem. Soc. 2025;147:6835–6856. doi: 10.1021/jacs.4c17015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roux B., Simonson T.. Implicit Solvent Models. Biophys. Chem. 1999;78:1–20. doi: 10.1016/S0301-4622(98)00226-9. [DOI] [PubMed] [Google Scholar]
- Tomasi J., Mennucci B., Cammi R.. Quantum Mechanical Continuum Solvation Models. Chem. Rev. 2005;105:2999–3094. doi: 10.1021/cr9904009. [DOI] [PubMed] [Google Scholar]
- Herbert J. M.. Dielectric Continuum Methods for Quantum Chemistry. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2021;11:e1519. doi: 10.1002/wcms.1519. [DOI] [Google Scholar]
- Klamt A., Schüürmann G.. COSMO: A New Approach to Dielectric Screening in Solvents with Explicit Expressions for the Screening Energy and its Gradient. J. Chem. Soc., Perkin Trans. 1. 1993;2:799–805. doi: 10.1039/P29930000799. [DOI] [Google Scholar]
- Cossi M., Rega N., Scalmani G., Barone V.. Energies Structures, and Electronic Properties of Molecules in Solution with the C-PCM Solvation Model. J. Comput. Chem. 2003;24:669–681. doi: 10.1002/jcc.10189. [DOI] [PubMed] [Google Scholar]
- Marenich A. V., Cramer C. J., Truhlar D. G.. Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface Tensions. J. Phys. Chem. B. 2009;113:6378–6396. doi: 10.1021/jp810292n. [DOI] [PubMed] [Google Scholar]
- Klamt A.. Conductor-like Screening Model for Real Solvents: A New Approach to the Quantitative Calculation of Solvation Phenomena. J. Phys. Chem. 1995;99:2224–2235. doi: 10.1021/j100007a062. [DOI] [Google Scholar]
- Klamt A., Jonas V., Bürger T., Lohrenz J. C. W.. Refinement and Parametrization of COSMO-RS. J. Phys. Chem. A. 1998;102:5074–5085. doi: 10.1021/jp980017s. [DOI] [Google Scholar]
- Hawkins G. D., Cramer C. J., Truhlar D. G.. Pairwise Solute Descreening of Solute Charges from a Dielectric Medium. Chem. Phys. Lett. 1995;246:122–129. doi: 10.1016/0009-2614(95)01082-K. [DOI] [Google Scholar]
- Onufriev A., Bashford D., Case D. A.. Exploring Protein Native States and Large-Scale Conformational Changes with a Modified Generalized Born Model. Proteins: Struct., Funct., Bioinf. 2004;55:383–394. doi: 10.1002/prot.20033. [DOI] [PubMed] [Google Scholar]
- Mongan J., Simmerling C., McCammon J. A., Case D. A., Onufriev A.. Generalized Born Model with a Simple, Robust Molecular Volume Correction. J. Chem. Theory Comput. 2007;3:156–169. doi: 10.1021/ct600085e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen H., Roe D. R., Simmerling C.. Improved Generalized Born Solvent Model Parameters for Protein Simulations. J. Chem. Theory Comput. 2013;9:2020–2034. doi: 10.1021/ct3010485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marenich A. V., Olson R. M., Kelly C. P., Cramer C. J., Truhlar D. G.. Self-Consistent Reaction Field Model for Aqueous and Nonaqueous Solutions Based on Accurate Polarized Partial Charges. J. Chem. Theory Comput. 2007;3:2011–2033. doi: 10.1021/ct7001418. [DOI] [PubMed] [Google Scholar]
- Marenich A. V., Cramer C. J., Truhlar D. G.. Generalized Born Solvation Model SM12. J. Chem. Theory Comput. 2013;9:609–620. doi: 10.1021/ct300900e. [DOI] [PubMed] [Google Scholar]
- Katzberger P., Riniker S.. Implicit Solvent Approach Based on Generalized Born and Transferable Graph Neural Networks for Molecular Dynamics Simulations. J. Chem. Phys. 2023;158(20):204101. doi: 10.1063/5.0147027. [DOI] [PubMed] [Google Scholar]
- Katzberger P., Riniker S.. A General Graph Neural Network Based Implicit Solvation Model for Organic Molecules in Water. Chem. Sci. 2024;15:10794–10802. doi: 10.1039/D4SC02432J. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katzberger P., Hauswirth L. M., Kuhn A. S., Landrum G. A., Riniker S.. Rapid Access to Small Molecule Conformational Ensembles in Organic Solvents Enabled by Graph Neural Network-Based Implicit Solvent Model. J. Am. Chem. Soc. 2025;147:13264–13275. doi: 10.1021/jacs.4c17622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lange A. W., Herbert J. M.. Improving Generalized Born Models by Exploiting Connections to Polarizable Continuum Models. I. An Improved Effective Coulomb Operator. J. Chem. Theory Comput. 2012;8:1999–2011. doi: 10.1021/ct300111m. [DOI] [PubMed] [Google Scholar]
- Mohamed Mahmoud S. S., Esposito G., Serra G., Fogolari F.. Generalized Born Radii Computation using Linear Models and Neural Networks. Bioinformatics. 2020;36:1757–1764. doi: 10.1093/bioinformatics/btz818. [DOI] [PubMed] [Google Scholar]
- Horvath D., Marcou G., Varnek A.. “Big Data” Fast Chemoinformatics Model to Predict Generalized Born Radius and Solvent Accessibility as a Function of Geometry. J. Chem. Inf. Model. 2020;60:2951–2965. doi: 10.1021/acs.jcim.9b01172. [DOI] [PubMed] [Google Scholar]
- Chen Y., Krämer A., Charron N. E., Husic B. E., Clementi C., Noé F.. Machine Learning Implicit Solvation for Molecular Dynamics. J. Chem. Phys. 2021;155:084101. doi: 10.1063/5.0059915. [DOI] [PubMed] [Google Scholar]
- Airas J., Ding X., Zhang B.. Transferable Implicit Solvation via Contrastive Learning of Graph Neural Networks. ACS Cent. Sci. 2023;9:2286–2297. doi: 10.1021/acscentsci.3c01160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao S., Van R., Pan X., Park J. H., Mao Y., Pu J., Mei Y., Shao Y.. Machine Learning Based Implicit Solvent Model for Aqueous-Solution Alanine Dipeptide Molecular Dynamics Simulations. RSC Adv. 2023;13:4565–4577. doi: 10.1039/D2RA08180F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coste A., Slejko E., Zavadlav J., Praprotnik M.. Developing an Implicit Solvation Machine Learning Model for Molecular Simulations of Ionic Media. J. Chem. Theory Comput. 2024;20:411–420. doi: 10.1021/acs.jctc.3c00984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerlach T., Müller S., de Castilla A. G., Smirnova I.. An Open Source COSMO-RS Implementation and Parameterization Supporting the Efficient Implementation of Multiple Segment Descriptors. Fluid Phase Equilib. 2022;560:113472. doi: 10.1016/j.fluid.2022.113472. [DOI] [Google Scholar]
- Meredith N. Y., Borsley S., Smolyar I. V., Nichol G. S., Baker C. M., Ling K. B., Cockroft S. L.. Dissecting Solvent Effects on Hydrogen Bonding. Angew. Chem., Int. Ed. 2022;61:e202206604. doi: 10.1002/anie.202206604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perdew J. P.. Density-Functional Approximation for the Correlation Energy of the Inhomogeneous Electron Gas. Phys. Rev. B. 1986;33:8822–8824. doi: 10.1103/PhysRevB.33.8822. [DOI] [PubMed] [Google Scholar]
- Weigend F., Ahlrichs R.. Balanced Basis Sets of Split Valence, Triple Zeta Valence and Quadruple Zeta Valence Quality for H to Rn: Design and Assessment of Accuracy. Phys. Chem. Chem. Phys. 2005;7:3297–3305. doi: 10.1039/b508541a. [DOI] [PubMed] [Google Scholar]
- Caldeweyher E., Bannwarth C., Grimme S.. Extension of the D3 Dispersion Coefficient Model. J. Chem. Phys. 2017;147:034112. doi: 10.1063/1.4993215. [DOI] [PubMed] [Google Scholar]
- Boothroyd S., Behara P. K., Madin O. C., Hahn D. F., Jang H., Gapsys V., Wagner J. R., Horton J. T., Dotson D. L., Thompson M. W., Maat J., Gokey T., Wang L.-P., Cole D. J., Gilson M. K., Chodera J. D., Bayly C. I., Shirts M. R., Mobley D. L.. Development and Benchmarking of Open Force Field 2.0.0: The Sage Small Molecule Force Field. J. Chem. Theory Comput. 2023;19:3251–3275. doi: 10.1021/acs.jctc.3c00039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirsh J. M., Weaver J. B., Boxer S. G., Kozuch J.. Critical Evaluation of Polarizable and Nonpolarizable Force Fields for Proteins Using Experimentally Derived Nitrile Electric Fields. J. Am. Chem. Soc. 2024;146:6983–6991. doi: 10.1021/jacs.3c14775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tasaki K., Abe A.. NMR Studies and Conformational Energy Calculations of 1,2-Dimethoxyethane and Poly(oxyethylene) Polym. J. 1985;17:641–655. doi: 10.1295/polymj.17.641. [DOI] [Google Scholar]
- Vosko S. H., Wilk L., Nusair M.. Accurate Spin-Dependent Electron Liquid Correlation Energies for Local Spin Density Calculations: A Critical Analysis. Can. J. Phys. 1980;58:1200–1211. doi: 10.1139/p80-159. [DOI] [Google Scholar]
- Lee C., Yang W., Parr R. G.. Development of the Colle-Salvetti Correlation-Energy Formula into a Functional of the Electron Density. Phys. Rev. B. 1988;37:785–789. doi: 10.1103/PhysRevB.37.785. [DOI] [PubMed] [Google Scholar]
- Becke A. D.. Density-Functional Thermochemistry III. The Role of Exact Exchange. J. Chem. Phys. 1993;98:5648–5652. doi: 10.1063/1.464913. [DOI] [Google Scholar]
- Stephens P. J., Devlin F. J., Chabalowski C. F., Frisch M. J.. Ab Initio Calculation of Vibrational Absorption and Circular Dichroism Spectra Using Density Functional Force Fields. J. Phys. Chem. 1994;98:11623–11627. doi: 10.1021/j100096a001. [DOI] [Google Scholar]
- Becke A. D.. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A. 1988;38:3098–3100. doi: 10.1103/PhysRevA.38.3098. [DOI] [PubMed] [Google Scholar]
- Tao J., Perdew J. P., Staroverov V. N., Scuseria G. E.. Climbing the Density Functional Ladder: Nonempirical Meta–Generalized Gradient Approximation Designed for Molecules and Solids. Phys. Rev. Lett. 2003;91:146401. doi: 10.1103/PhysRevLett.91.146401. [DOI] [PubMed] [Google Scholar]
- Zhao Y., Truhlar D. G.. The M06 Suite of Density Functionals for Main Group Thermochemistry, Thermochemical Kinetics, Noncovalent Interactions, Excited States, and Transition Elements: Two New Functionals and Systematic Testing of Four M06-Class Functionals and 12 other Functionals. Theor. Chem. Acc. 2008;120:215–241. doi: 10.1007/s00214-007-0310-x. [DOI] [Google Scholar]
- Chai J.-D., Head-Gordon M.. Systematic optimization of long-range corrected hybrid density functionals. J. Chem. Phys. 2008;128:084106. doi: 10.1063/1.2834918. [DOI] [PubMed] [Google Scholar]
- Krishnan R., Binkley J. S., Seeger R., Pople J. A.. Self-consistent molecular orbital methods. XX. A basis set for correlated wave functions. J. Chem. Phys. 1980;72:650–654. doi: 10.1063/1.438955. [DOI] [Google Scholar]
- Zheng J., Xu X., Truhlar D. G.. Minimally augmented Karlsruhe basis sets. Theor. Chem. Acc. 2011;128:295–305. doi: 10.1007/s00214-010-0846-z. [DOI] [Google Scholar]
- Suryanarayana I., Someswar G. P., Subrahmanyam B.. IR Spectroscopic Studies of Solvent Interactions with Hydrogen Bonds of Methoxy Ethanol. Z. Phys. Chem. 1990;271O:621–628. doi: 10.1515/zpch-1990-27179. [DOI] [Google Scholar]
- Kesharwani M. K., Brauer B., Martin J. M. L.. Frequency and Zero-Point Vibrational Energy Scale Factors for Double-Hybrid Density Functionals (and Other Selected Methods): Can Anharmonic Force Fields Be Avoided? J. Phys. Chem. A. 2015;119:1701–1714. doi: 10.1021/jp508422u. [DOI] [PubMed] [Google Scholar]
- Neese F., Wennmohs F., Becker U., Riplinger C.. The ORCA Quantum Chemistry Program Package. J. Chem. Phys. 2020;152:224108. doi: 10.1063/5.0004608. [DOI] [PubMed] [Google Scholar]
- Neese F.. Software update: The ORCA program systemVersion 5.0. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2022;12:e1606. doi: 10.1002/wcms.1606. [DOI] [Google Scholar]
- Larsen A. H., Mortensen J. J., Blomqvist J., Castelli I. E., Christensen R., Dulak M., Friis J., Groves M. N., Hammer B., Hargus C.. et al. The Atomic Simulation Environment-a Python Library for Working with Atoms. J. Phys.: Condens. Matter. 2017;29:273002. doi: 10.1088/1361-648X/aa680e. [DOI] [PubMed] [Google Scholar]
- Fletscher, R. Practical Methods of Optimization, 2 nd ed.; John Wiley & Sons, Inc., 2000. [Google Scholar]
- Barone V., Cossi M.. Quantum Calculation of Molecular Energies and Energy Gradients in Solution by a Conductor Solvent Model. J. Phys. Chem. A. 1998;102:1995–2001. doi: 10.1021/jp9716997. [DOI] [Google Scholar]
- Grimme S.. Supramolecular Binding Thermodynamics by Dispersion-Corrected Density Functional Theory. Chem. -Eur. J. 2012;18:9955–9964. doi: 10.1002/chem.201200497. [DOI] [PubMed] [Google Scholar]
- Virtanen P., Gommers R., Oliphant T. E., Haberland M., Reddy T., Cournapeau D., Burovski E., Peterson P., Weckesser W., Bright J.. et al. SciPy 1.0 Contributors SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGibbon R. T., Beauchamp K. A., Harrigan M. P., Klein C., Swails J. M., Hernández C. X., Schwantes C. R., Wang L.-P., Lane T. J., Pande V. S.. MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J. 2015;109:1528–1532. doi: 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The code used in this study is freely available on GitHub (https://github.com/rinikerlab/QM-GNNIS). The data set used to train the GNNIS model from ref is freely available in the ETH Research Collection (DOI: 10.3929/ethz-b-000710355).





