Skip to main content
The Journal of Chemical Physics logoLink to The Journal of Chemical Physics
. 2016 Apr 28;144(16):164115. doi: 10.1063/1.4946779

VR-SCOSMO: A smooth conductor-like screening model with charge-dependent radii for modeling chemical reactions

Erich R Kuechler 1,2,1,2, Timothy J Giese 1, Darrin M York 1
PMCID: PMC4851621  PMID: 27131539

Abstract

To better represent the solvation effects observed along reaction pathways, and of ionic species in general, a charge-dependent variable-radii smooth conductor-like screening model (VR-SCOSMO) is developed. This model is implemented and parameterized with a third order density-functional tight binding quantum model, DFTB3/3OB-OPhyd, a quantum method which was developed for organic and biological compounds, utilizing a specific parameterization for phosphate hydrolysis reactions. Unlike most other applications with the DFTB3/3OB model, an auxiliary set of atomic multipoles is constructed from the underlying DFTB3 density matrix which is used to interact the solute with the solvent response surface. The resulting method is variational, produces smooth energies, and has analytic gradients. As a baseline, a conventional SCOSMO model with fixed radii is also parameterized. The SCOSMO and VR-SCOSMO models shown have comparable accuracy in reproducing neutral-molecule absolute solvation free energies; however, the VR-SCOSMO model is shown to reduce the mean unsigned errors (MUEs) of ionic compounds by half (about 2-3 kcal/mol). The VR-SCOSMO model presents similar accuracy as a charge-dependent Poisson-Boltzmann model introduced by Hou et al. [J. Chem. Theory Comput. 6, 2303 (2010)]. VR-SCOSMO is then used to examine the hydrolysis of trimethylphosphate and seven other phosphoryl transesterification reactions with different leaving groups. Two-dimensional energy landscapes are constructed for these reactions and calculated barriers are compared to those obtained from ab initio polarizable continuum calculations and experiment. Results of the VR-SCOSMO model are in good agreement in both cases, capturing the rate-limiting reaction barrier and the nature of the transition state.

I. INTRODUCTION

Computational modeling of biological reactions is of great interest to those seeking molecular-level insight into reaction mechanisms. Such studies can help interpret experimental results which are pursuing a deeper understanding of the fundamental principles governing chemical activity and encompass a wide range of archetypes including, but not limited to, reactions occurring within protein environments,1–3 model phosphoryl transfer reactions,4–8 and studies of RNA catalysis.9–11 The end goal of these studies is frequently to aid in the design of new drugs and therapeutic treatments.12–14 Multiscale modeling approaches including long timescale molecular dynamics simulations to explore fluctuations and conformational changes of biomolecules, combined quantum mechanical/molecular mechanical (QM/MM) simulations to examine deeply embedded reactive chemical events, and implicit solvent calculations of small model reactions are used for this task.

Biological reactions almost exclusively take place in solution, which has drastic ramifications on the pathways and barriers of the reaction mechanism.4,6,15–22 These phenomena are especially apparent when studying highly charged systems such as RNA11 as shown in Figure 1. To model these effects, simulations can be performed that include a vast buffer of explicit solvent molecules; however, such approaches are often extremely computationally demanding. Frequently, this cost leads the simulation into becoming impractical, especially when using expensive quantum mechanical models. Implicit solvation techniques forego the need to perform extensive sampling of explicit solvent degrees of freedom by replacing the solvent response of an effective, empirical model.23–26 In addition to enabling stand-alone applications to chemical reactivity, implicit solvation models can also be used to guide sampling in more expensive QM/MM free energy simulations.27–30 Implicit solvent models have come in a variety of forms, including Poisson-Boltzmann (PB),31 the Minnesota solvation (SMx),32–34 the conductor-like screening (COSMO),35–40 generalized Born (GB),41,42 and polarizable continuum (PCM) models,43–45 among many others.

FIG. 1.

FIG. 1.

Comparison between gas phase and solution reaction energy profiles for two model RNA-like reactions. Special note should be made of the difference in color scale between the gas and solution phase surfaces. All distances and energies are reported in Å and kcal/mol respectively.

One particularly useful class of solvation models is based on boundary element methods. The original boundary element solvation models suffered from discontinuities in the solvation energy as atoms became buried or solvent exposed as a function of conformation (thus exposing boundary elements discontinuously), making them difficult, if not impossible to use effectively in geometry optimizations or explorations of reaction pathways. York and Karplus introduced the first model to fully address this issue46 within the COSMO35 framework. Other similar smooth boundary element models have followed this formalism and have proven useful to the larger quantum chemistry community.45,47

However, another limitation to traditional boundary element solvation methods exists which involves the use of fixed radii to define the solute cavity. In some instances, to obtain more accurate solvation free energies, different radii are assigned to the same element depending on its chemical environment. This procedure becomes ill-defined when studying a chemical reaction where the chemical environment can change along the reaction coordinate. A mechanism whereby the radii themselves can change smoothly in this process is needed. Furthermore, while this fixed-radii strategy often works reasonably well for neutral solutes near local minima, it often failed when considering charged solutes, or chemical reactions where localized charge varied along the reaction coordinate.

The present work introduces the variable-radii smooth conductor-like screening model (VR-SCOSMO) whereby the solvation radii themselves vary smoothly as a function of the underlying electronic structure, as represented by atomic charge. Influenced by the pioneering work of Hou et al.20 and others,48–50 the present work applies a similar strategy of introducing solvation radii that have a charge dependence into the SCOSMO model.46 In this work, the DFTB3/3OB semiempirical method is used,51–53 and parameterization of the SCOSMO and VR-SCOSMO models is created to reproduce absolute solvation energies. The parameterized SCOSMO and VR-SCOSMO models are compared to each other and to the SCC-DFTBPR model.20 The VR-SCOSMO model is then used to construct eight two-dimensional relaxed potential energy surfaces of biologically motivated reactions, which are then compared to available experimental results. One of these surfaces is for the trimethylphosphate hydrolysis reaction. The other surfaces are transesterification phosphoryl transfer reactions involving seven different leaving groups.

II. COMPUTATIONAL METHODS

This manuscript uses the semiempirical DFTB3/3OB Hamiltonian52 supplemented with a specific reaction parameterization, OPhyd,53 which modulates the O and P parameters for improving performance on phosphate hydrolysis reactions. The DFTB3/3OB-OPhyd parameters nor the functional form for the solute-solute self-interaction energy has been altered in this manuscript. Therefore, for further discussion of technical details of the DFTB3 model, the reader is referred to Refs. 51–53. It can be stated that standard gas-phase DFTB3 calculations depend on the nuclear positions and the atomic orbital density matrix

EgasEDFTB3(R,P), (1)

where R is a vector of atomic positions,

P=Pα+Pβ, (2)
Pμνσ=iniσCμiσCνiσ, (3)

where nσ and Cσ are spin-resolved orbital occupation numbers and molecular orbital coefficients, respectively, in the atomic orbital basis of μ and ν. For all results presented, DFTB3/3OB and VR-SCOSMO calculations were performed using an in-house program which will be integrated into the Sander module AMBER and into the CHARMM molecular simulation software suites and made available in future full releases.

A. Solute charge density interaction correction

The DFTB3/3OB energy happens to use Mulliken charges to interact the solute atoms with the other solute atoms;52 however, previous work has found that the interaction of DFTB3 molecules with an “external environment” can be improved by choosing a second charge representation.54 This approach has been adopted in the present work. In essence, a second set of atomic charges and higher-order atomic multipoles are chosen to interact with the implicit solvent response, while leaving the DFTB3 solute-solute interactions unchanged from their original form. The solute charge density can be approximated by a sum of atomic point multipoles

q(r)=aZaδ(rRa)μνPμνχμ(r)χν(r)a,lmaqa,lmClm(a)(2l1)!!δ(rRa), (4)

where Za is the nuclear core charge of atom a, qa,lm is a multipole moment, Ra is an atomic position, ∇a is the Cartesian gradient operator acting on the coordinates of a, and Clm(∇) is a spherical tensor gradient operator which is constructed by replacing the Cartesian coordinate arguments of the real-valued regular solid harmonic with their corresponding derivative operators. In this notation,

χμ(r)χμ(r)Ylμmμ(Ω), (5)

where Ylm(Ω) is a real-valued spherical harmonic. Following Refs. 54 and 55, the atomic charges are a biased Mulliken partitioning of the density matrix, and the higher-order multipole moments are constructed from the one-center blocks of the density matrix,

qa,lm={Zabaa2bafab(bab)babifl=0μνaPμνMμν(l)4π2l+1ifl>0×Ylm(Ω)Ylμmμ(Ω)Ylνmν(Ω)dΩ, (6)

where bab is a Mulliken bond order

bab=2μaνbPμνSμν, (7)

where S is the atomic orbital overlap matrix

Sμν=χμ(r)χν(r)d3r (8)

and fab is a “charge bias.” This bias is constructed such that at fab = 1/2, a Mulliken charge decomposition is produced, but it is generalized to switch from fabs to fabd as the Mulliken bond order changes from single bond character, babs, to double bond character, babd,

fab(bab)=fabs+Son(bab,babs,babd)(fabdfabs), (9)

where Son(x, xlo, xhi) is a smooth switching function

Son(x,xlo,xhi)={0ifx<xlo1ifx>xhi110u315u4+6u5otherwiseu(xhix)/(xhixlo) (10)

and Mμν(l) are the one-center radial integrals

Mμν(l)=0χμ(r)χν(r)r2+ldr (11)

which are treated as parameters to empirically improve molecular electrostatic potentials. The parameters of this model, fabs, fabd, babs, babd, and Mμν(l), are listed in the supplementary material of Ref. 55 and are not modified in this work. The effects of the inclusion of this correction can be seen in Figure 2 for butanoic acid, showing how the use of multipoles can improve the description of sp3 oxygen lone pairs. Previous work has noted improvements to the description of sp2 carbon bonds, and sp3 sulfur and sp2 nitrogen lone pairs.

FIG. 2.

FIG. 2.

The electrostatic potential difference between B3LYP/6-311++G** and DFTB3/3OB (left) and DFTB3/3OB using the auxiliary atomic multipoles (right) evaluated on the solvent accessible surface of butanoic acid. Blue and red indicate that the semiempirical model requires more negative charge and more positive charge relative to B3LYP reference, respectively. Colors are bounded with a range ±0.003 a.u.

B. The SCOSMO implicit solvation model

The interface between the SCOSMO model56,57 and the DFTB3 semiempirical method closely follows the formulation provided by Khandogin et al.58 Briefly, the SCOSMO model treats the solvent environment as if it were a block of conducting metal whose dielectric constant is chosen to mimic the solvent on a macroscopic scale. Then, a cavity with a dielectric constant of unity, ϵ0 = 1, is carved out of the metal and the solute is placed within it. Given the discontinuous nature between the cavity and the bulk solvent dielectric constants at the cavity’s boundary, the electrostatic response of the metal occurs solely at the cavity surface. The cavity is constructed from a union of solute spheres whose radii, Rrad,a, are parameters of the model. The cavity surface is discretized by Gaussian functions,

gt(rRt;ζt)=(ζt/π)3/2eζt|rRt|2 (12)

such that the solvent’s charge density response at the surface is

σ(r)tstgt(rRt;ζt), (13)

where s is the vector of solvent response coefficients, st, that remain to be determined. In our notation, t indexes a discretized point on the surface and

Rt=Ra+Rrad,aRˆq,t (14)

is the location of that point. In other words, there are Nq,a discretized points, each point t is tethered to one atom ta separated by Rrad,a, in the direction Rˆq,t, where Rˆq,t is a Lebedev quadrature point that is associated with corresponding quadrature weight wq,t.59,60

The solvated energy of a compound can be modeled as an adiabatic energy of that compound in the presence of SCOSMO implicit

EaqEDFTB3(R,P)+Es(s,q), (15)

where

Es(s,q)=12f(ϵ)sTAs+a,taγwq,tptRrad,a2+sTBq (16)

and

f(ϵ)=ϵ1ϵ (17)

is the scaled deviation of the dielectric medium of the implicit solvent from an ideal conductor. Throughout this work, the dielectric ϵ = 78.4 (unitless) is used for water. The scaled self-interaction of the polarized surface charges of residing upon the cavitation barrier, A, takes the form

Att=gt(rRt;ζt)gt(rRt;ζt)|rr|d3rd3r×{pt1,ift=t1,otherwise. (18)

The switching function pt smoothly transitions between 0 and 1, preventing surface elements from interacting with other surface elements which have been partially or fully removed by the intersection of the cavitation sphere from another particle,

pta=baSon(Rbt2,Rin,b2,Rout,b2). (19)

When pt = 0 dimensionality of the arrays can be decreased safely without causing discontinuous changes in the energy or gradients. In this case, Rin,a and Rout,a are the limits of the switching region, whose explicit definitions are

Rin,a=(1oh)Rrad,a (20)

and

Rout,a=(1+(1o)h)Rrad,a, (21)

where

o=12+1h1h2128 (22)

such that h is some constant, which was set to h = 0.15 (unitless) for this work. Continuing, γ is the “surface tension” of the cavity which accounts for the free energy cost associated with the formation of the cavity. This term is treated as a parameter for optimization in the current protocols. B is the interaction between the previously defined surface charges and the solute charge distribution, q,

Bta=Clama(a)(2la1)!!gt(rRt;ζt)δ(rRa)|rr|d3rd3r=Clama(Rat)(2la1)!!ddRat2laerf(ζtRat)Rat, (23)

where

ζta=1wq,tζ(Nq,a)Rrad,a2. (24)

Values of ζ(Nq,a) constants are tabulated in Ref. 61 and are completely determined upon defining the discretization level of each atom, Nq,a. Special note should be taken that in the present work the interaction of the electrostatic potential with the surface response elements has been modified. The standard monopole representation is replaced with an auxiliary set of atomic multipole moments, constructed from the underlying DFTB3 density matrix.

The response coefficients, st, are determined by minimizing

δEs(s,q)λtst+f(ϵ)Q=0, (25)

where

Q=aqa,00 (26)

is the solute charge, qa is the atomic charge of atom a, and the Lagrange multiplier, λ, is used to enforce the constraint

σ(r)d3r=f(ϵ)Q. (27)

Following this constraint, one finds

s=f(ϵ)λA1vA1Bq, (28)
λ=vTA1BqQvTA1v, (29)

where vt = 1 ∀t. The quantities treated as adjustable parameters in the present work are the surface tension term, γ, and the cavitation radii Rrad,a.

The SCOSMO model enters the solution for the DFTB3 molecular orbital coefficients variationally through the Fock matrix

Fμνσ=EaqPμνσR=EDFTB3PμνσR+a,lmaEsqa,lmR,sqa,lmPμνR. (30)

Upon reaching self-consistent convergence, the molecular orbitals satisfy the generalized eigenvalue equation

FσCσ=SCσEσ, (31)

where S is the atomic orbital overlap matrix

Sμν=χμ(r)χν(r)d3r (32)

and the Cartesian gradient Xa of atom a becomes

dEaqdXa=EDFTB3XaPμνQμνdSμνdXa+EsXas,q+bEsqb,lmR,sqb,lmXaP, (33)

where

Qμν=σ(α,β)iniσEiiσCμiσCνiσ (34)

and Eiiσ is a spin-resolved molecular orbital eigenvalue.

C. The VR-SCOSMO implicit solvation model

The VR-SCOSMO model modifies the SCOSMO framework by incorporating charge-dependent behavior into the definition of the solvation radii

Rrad,a(qa,00)=Rrad,a+α(qa,00qa(0))+12β(qa,00qa(0))2, (35)

where Rrad,a, α, β, and qa(0) are parameters. With this change, the surface response Gaussian positions Rt and exponents ζt [Eq. (24)] also become charge-dependent. Upon replacing Rrad,a with Rrad,a(qa,00), the multipole moment-derivatives appearing in Eqs. (30)-(33) require additional chain-rules; that is,

Esqa,lmR,s=Esqa,lmR,s,ζ,Rt+δl0δm0taEsζtR,s,RtdζtdRrad,adRrad,aqa,00+δl0δm0taEsXtR,s,ζdXtdRrad,adRrad,aqa,00. (36)

D. Parameterization of the SCOSMO and VR-SCOSMO models

Parameters for the SCOSMO and VR-SCOSMO models, both utilizing the multipole correction, were optimized for elements H, C, N, O, and P to reproduce the absolute solvation free energy of a subset of the SM6 database32 consisting of 169 molecules (139 neutral, 16 anionic, 14 cationic). A table listing these molecules can be found in the supplementary material, Table S1.81 The absolute solvation free energy is approximated by the difference in the adiabatic electronic energy of the molecule in the implicit solvent and gas phase environments upon geometry optimization in its respective state,

ΔGsolvEaq(Raq,Paq)Egas(Rgas,Pgas). (37)

The parameterizations of the SCOSMO and VR-SCOSMO models were treated independently. Parameters obtained for both models are not constrained to have the same values. Parameter optimization was performed by direction set chi-squared minimization and geometry optimization was performed using the DL-Find software package.62 Initial parameterization steps were performed with the optimized gas phase geometry, as simultaneous optimization of solvation model parameters and molecular geometry can be computationally prohibitive. However, after initial parameterization, geometries and parameters were allowed to relax interchangeably until convergence was reached. In the final step of optimization, both geometries and parameters were relaxed simultaneously in the neighborhood of the previously obtained values. The optimized parameters are listed in Table I. The training set mean signed error (MSE), mean unsigned error (MUE), and root mean squared error (RMSE) for the SCOSMO and VR-SCOSMO models are listed in Table II and model performance, as compared to experiment, is shown in Figure 3.

TABLE I.

Optimized COSMO and VRSCOSMO parameters. All values are in atomic units. The surface tension parameters γ for SCOSMO and VRSCOSMO are 5 × 10−6 a.u. and 7.5 × 10−6 a.u., respectively.

SCOSMO VR-SCOSMO
Element Rrad Rrad q(0) α β
C 4.35 3.75 −0.0339 −0.2493 0.0000
H 1.90 2.20 0.0954 −0.0502 −0.1794
O 2.85 3.19 −0.7056 0.0179 −0.4350
N 2.65 3.17 −0.2450 0.0128 −0.0000
P 5.30 5.30 −0.2000 0.0000 0.0000

TABLE II.

Absolute solvation energy training set error statistics (kcal/mol).

VR-SCOSMO SCOSMO
Type MSE MUE RMSE MSE MUE RMSE
Neutral −0.0 1.4 1.9 0.4 1.2 1.7
Anion 0.8 2.5 3.0 3.9 5.6 6.3
Cation 1.4 2.8 4.3 0.7 4.6 5.7
Cationa −0.1 1.5 1.9 −1.0 3.5 3.9
Total 0.2 1.6 2.4 0.8 2.0 3.0
a

12 molecules. Excludes the compounds where the sum of bond orders around any oxygen is ≥3. Not included in the “total” statistics.

FIG. 3.

FIG. 3.

Comparison between the SCOSMO and VR-SCOSMO absolute solvation free energy values with the reference values used in the training set.

After parameter optimization, the SCOSMO and VR-SCOSMO models were evaluated on a separate test set of molecules, consisting of a total of 65 molecules (34 neutral, 17 anionic, 14 cationic) taken from Ref. 20. A table of these molecules can be found in the supplementary material, Table S2.81 These molecules were used in the parameterization of the SCC-DFTBPR implicit solvation model developed by Hou et al., which utilized a charge-dependent non-linear PB model implemented with an earlier variant of the DFTB3 semiempirical method.63,64 Comparisons of SCOSMO, VR-SCOSMO, and this reference model’s performance are tabulated in Table III. Similarly, performance of these models as compared to experimental values can be found in Figure 4.

TABLE III.

Absolute solvation energy test set error statistics (kcal/mol).

VR-SCOSMO SCOSMO SCC-DFTBPRa
Type MSE MUE MSE MUE MSE MUE
Neutral 0.4 0.9 0.6 1.0 0.2 1.6
Anion 0.2 3.2 5.6 6.6 0.7 3.1
Cation 3.5 4.6 3.2 6.1 1.1 3.4
Cationb 0.0 1.6 −0.2 3.9 −1.4 1.8
Total 1.1 2.3 2.4 3.6 0.5 2.4
a

Reference 20.

b

10 molecules. Excludes the compounds where the sum of bond orders around any oxygen is ≥3. This row is not included in the “total” statistics.

FIG. 4.

FIG. 4.

Comparison between SCOSMO and VR-SCOSMO modeled absolute solvation free energies with the reference values used in the test set. SCC-DFTBPR is taken from Ref. 20.

E. Phosphate hydrolysis and phosphoryl transfer reactions

Using the parameters obtained from absolute solvation free energy optimization, the VR-SCOSMO model was used to examine various biologically motivated reactions. A series of phosphoryl transfer reactions were performed which serve as informative models for reactive RNA systems. The cleavage of the phosphodiester bond in these hydrolysis/transfer reactions can be seen in many catalytic motifs: in the protein RNase A65,66 and in the hairpin,67,68 hammerhead,69–71 glmS,72,73 hepatitis delta virus74,75 ribozymes. An understanding of the fundamental chemical pathway in these reactions can glean further insight into these complex biochemical processes.

Two-dimensional reaction Potential Energy Surfaces (PESs) were generated, with reaction coordinates defined as the distance between the attacking and leaving group oxygen to phosphorus center, respectively. Distances were constrained to the value of each reaction coordinate and the system was allowed to relax using the DL-Find geometry optimization software suite.62 After which, energy landscapes were constructed using two-dimensional spline interpolation. Stationary point calculations were performed with no restraints to obtain reaction barriers and product state calculations were completed at infinite separation. No additional, higher-level corrections were made to the resulting energy surfaces.

Two different types of phosphoryl reactions were modeled: the phosphate hydrolysis reaction of trimethylphosphate (TMP) and a series of phosphoryl transesterification transfer reactions corresponding to the self-attack of the O2’ oxygen on the phosphate center in methyloxyribose (MOR) with seven different leaving groups [HO, CH3O, CH3CH2O, CH3CH2CH2O, (CH3)2CHO, CH3COO, and PhO]. The TMP PES is shown in Figure 5 and the MOR PESs are shown in Figure 6. Table IV compares the modeled free energy barriers of the MOR reactions to previously calculated and experimental values and Figure 7 displays geometric information for the MOR transition states. For these reactions, a steady-state approximation was made in which the O2’ oxygen of MOR is assumed to be deprotonated, and that the reaction is following a pseudo-first-order relate law with a transmission coefficient equal to one. According to the reaction scheme

MOROHMOROproducts (38)

it is assumed that, as these reactions were carried out under basic conditions, modeling the reaction starting from the deprotonated state is sufficient.

FIG. 5.

FIG. 5.

DFTB3+VR-SCOSMO modeled potential energy surface of the trimethylphosphate hydrolysis reaction. Energies and distances are in kcal/mol and Å, respectively.

FIG. 6.

FIG. 6.

DFTB3+VR-SCOSMO modeled potential energy surfaces for the phosphoryl transesterification reactions with a series of leaving groups. Energies and distances are in kcal/mol and Å, respectively. The leaving groups are (a) HO, (b) CH3O, (c) CH3CH2O, (d) CH3CH2CH2O, (e) (CH3)2CHO, (f) CH3COO, and (g) PhO. Note that the color scale used in (a) is different from the other plots.

TABLE IV.

Reaction barriers for the self-attack of MOR with different leaving groups. All values are kcal/mol. The experimental barriers for the leaving groups with 2-hydroxypropyl phosphate (Expt. I) and uridine 3′-phosphate (Expt. II) are shown due to the lack of available experimental data with MOR. Calculations performed with DFTB3+VR-SCOSMO with no additional, higher-level corrections.

Calculated Experiment
Model/Reference VR-SCOSMO PCMa Expt. Ib Expt. IIc
HO 24.90
CH3O 28.10 24.43 27.09
CH3CH2O 26.60 25.84 28.55 23.37
CH3CH2CH2O 27.38 24.26
(CH3)2CHO 28.87 25.96 30.76 26.02
CH3COO 8.19 11.91
PhO 22.72 14.76 22.62
a

M06-2X/6-311++G(3df,2p)//6-311++G(d,p) with PCM implicit solvation. Ref. 19.

b

2-hydroxypropyl phosphate. Ref. 76.

c

Uridine 3′-phosphate. Ref. 77.

FIG. 7.

FIG. 7.

The model phosphoryl transfer reaction with different leaving groups. Shown is the methyloxyribose ring, which undergoes an internal nucleophilic attack to model the backbone cleavage in RNA. Also shown are the attacking and leaving oxygen bond lengths, R1 and R2, respectively, computed with DFTB3+VR-SCOSMO for each leaving group, X–O. TS 1 and TS 2 are the early and late transition states. Experimental pKa values are taken from Ref. 78.

III. RESULTS AND DISCUSSION

A. VR-SCOSMO model parameterization

Initial tests performed during the preparation of this manuscript found that the standard monopole representation of the DFTB3/3OB electrostatic potential encountered difficulties in distinguishing different functional groups. Large, systematic errors were observed in the absolute solvation free energy between molecules containing oxygen atoms with different numbers of lone pairs, such as when comparing alcohol and acid functional groups. Additionally, conjugated rings were frequently found to produce an electrostatic potential which, at the ring center, was too positive when compared to a B3LYP reference. The implicit solvent model response is caused by the electrostatic potential of the solute; therefore, an auxiliary set of atomic multipole moments from the DFTB3 density matrix was constructed using the prescription developed in Ref. 54. The inclusion of these multipole moments leads to the alleviation or complete elimination of these systematic errors, thus informing the decision to abandon parameterization efforts using the conventional DFTB3 monopole approximation.

Overall, the VR-SCOSMO model performs well for a vast majority of the molecules in the training set, shown in Table II and Figure 3. The SCOSMO non-charge-dependent and VR-SCOSMO charge-dependent parameterizations show similar performance for neutral molecules, having MUEs of approximately 1 kcal/mol. However, the VR-SCOSMO model presents MUE 2-3 kcal/mol lower for the charged species. For both models, the largest errors occur in cationic molecules containing oxygen atoms whose sum of formal charge is +1, that is, molecules containing the “additionally protonated” alcohol compounds or molecules containing a protonated oxygen which also contains a double bond. These compounds are systematically undersolvated by approximately 10 kcal/mol, which is likely due to the over-simplification of proton chemical bonding in the absence of explicit solvent. Better estimations of solvation free energies could likely be obtained if explicit water molecules were included as part of the solute. Due to this logic, these types of compounds were assigned a smaller weight during chi-squared minimization as to not bias the parameterization into over-correcting these errors. For comparison purposes, additional solvation free energy data for the more stable cations, bereft of these compounds, have been added to Tables II and III labeled with a and b, respectively. Additional issues were present in the nitrogen containing compounds which, as a whole, tended to contribute the largest difference from experiment in the solvation free energies tested. Detailed information of individual molecule absolute solvation free energy values can be found in the supplementary material.81

After parameter optimization the SCOSMO and VR-SCOSMO models were evaluated against a separate test set of molecules. Absolute solvation data from these calculations can be seen in Table III and Figure 4. These results are also compared to those generated by Hou et al.,20 who developed a similar charge-dependent nonlinear PB model, SCC-DFTBPR. The molecules within this test set were used as part of the SCC-DFTBPR training set and do not appear within this manuscript’s training set. Again, the largest errors occur for the cationic compounds containing unusual oxygen protonation states. As expected, the SCC-DFTBPR model performs better for these unusual cases, as a significant number of these compounds appeared in the training set for this model. However, if these compounds where removed, the MSE and MUE of the test set for VR-SCOSMO would be 0.3 kcal/mol and 1.6 kcal/mol, respectively, while the SCC-DFTBPR model would report MSEs and MUEs of 0.1 kcal/mol and 2.1 kcal/mol, respectively. Both VR-SCOSMO and SCC-DFTBPR perform significantly better than the SCOSMO parameterization.

The improved accuracy of the VR-SCOSMO model does come at an additional computational cost. Energy calculations with VR-SCOSMO require reconstruction of the surface element mesh at each iteration of the SCF cycle as well as re-evaluation of the electrostatic potential at the surface elements and solution of the solvation polarization response density on the surface. Unlike a fixed surface mesh that can be inverted once and formulated as a Green’s function term added directly to the Fock matrix, the VR-SCOSMO method requires inversion of the surface element interaction matrix each time the surface elements change during the SCF. Alternatively, the solution of the solvation polarization response density can be achieved by linear algebraic methods (e.g., conjugate gradient minimization), which is more practical for large systems, in which case the overhead of the VR-SCOSMO method is much less. Relative timings for the small systems studied here varied greatly, but the VR-SCOSMO/SCOSMO timing ratio was generally around 4:1. It is observed that the number of SCF cycle iterations generally does not change when using either the VR-SCOSMO or SCOSMO models, supporting the thought that the increase in computational cost is believed to largely emerge from the necessary increase in calculation complexity.

B. Phosphoryl transfer reactions

Phosphoryl transfer reactions are ubiquitous in biology7,79 and have been studied extensively with quantum chemistry calculations and implicit solvation models.4,6,15–20 Two different types of phosphoryl reactions were used to test the phosphate hydrolysis applicability of the DFTB3+VR-SCOSMO framework to RNA-like systems, a phosphate hydrolysis reaction, and a series of phosphoryl transesterification transfer reactions with different leaving groups. These reactions have a large amount of local charge transfer, totaling a net 2e charge, which makes them ideal candidates for use with a charge-dependent model with an aim of accurately modeling biological reactions. These reactions in particular serve as analogs for a key step in the self-cleavage of the RNA backbone seen in many ribozymes.

The two-dimensional reaction PESs for hydrolysis of TMP can be seen in Figure 5. The energy barrier is found to be 23.19 kcal/mol which is in good agreement with the experimentally known value of 24.6 kcal/mol.80,81 The model predicts an “early” transition state; that is, it forms the attacking bond P–O in advance of the leaving group bond cleavage. This observation is consistent with previous computational studies of this system.20

Similarly, DFTB3+VR-SCOSMO reaction surfaces for the phosphoryl transesterification reactions of MOR with different leaving groups can be found in Figure 6. A summary of the reaction barriers as compared to previously calculated ab initio barriers using the PCM with UFF radii and experimentally known barriers can be found in Table IV. Additionally, Figure 7 details various transition state geometries for the modeled reactions as well as listing non-rate-limiting reaction barriers for those reactions where two separate transition states were observed. VR-SCOSMO attack barriers compare favorably to those calculated with the M06-2X/6-311++G(3df,2p)//M06-2X/6-311++G(d,p) PCM model19 as well as to the known experiments.76,77 It should be noted, however, that the experiments presented here examine reactions with sightly different chemical structures than those which have been computed in this manuscript. Specifically, Expt. I76 performed experiments with 2-hydroxypropyl phosphate while Expt. II77 examined various esters of uridine 3′-phosphate. However, despite these structural differences, it is thought that the reaction barriers should not be significantly different and general trends of the data should remain fairly consistent.

IV. CONCLUSION

A charge-dependent, variable radii variant of the SCOSMO implicit solvation model is presented. The model, VR-SCOSMO, is parameterized using the DFTB3/3OB-OPhyd semiempirical Hamiltonian, while using an auxiliary set of atomic multipoles to interact the DFTB3 solute with the implicit solvent model. As a baseline for comparison, a traditional SCOSMO solvation is similarly parameterized. The two methods produce similar neutral molecule solvation energies; however, VR-SCOSMO is shown to provide a significant 2-3 kcal/mol improvement in MUE for charged species. Further comparisons were made to a charge-dependent nonlinear PB implicit solvation model,20 with VR-SCOSMO showing similar accuracy. Example DFTB3+VR-SCOSMO applications are performed on phosphoryl hydrolysis and phosphoryl transesterification reactions. The reaction barriers were shown to agree well with available experiment and previous ab initio calculations. The VR-SCOSMO implicit solvation model serves as a efficient technique to probe biochemical reaction landscapes to understand their mechanistic pathways. Its advantages are in producing smooth potential energy surfaces while being able to dynamically adjust itself to changes in the local charge state of atoms over the course of a reaction.

Acknowledgments

The authors are grateful for financial support provided by the National Institutes of Health (No. GM107485 to DY). Computational resources were provided by the Minnesota Supercomputing Institute for Advanced Computational Research (MSI) and Rutgers, the State University of New Jersey.

REFERENCES

  • 1.Kamerlin S. C. L., Haranczyk M., and Warshel A., J. Phys. Chem. B 113, 1253 (2009). 10.1021/jp8071712 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Teixeira V. H., Soares C. M., and Baptista A. M., Proteins 70, 1010 (2008). 10.1002/prot.21588 [DOI] [PubMed] [Google Scholar]
  • 3.Pabis A., Geronimo I., York D. M., and Paneth P., J. Chem. Theory Comput. 10, 2246 (2014). 10.1021/ct500205z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Liu Y., Gregersen B. A., Hengge A., and York D. M., Biochemistry 45, 10043 (2006). 10.1021/bi060869f [DOI] [PubMed] [Google Scholar]
  • 5.Range K., McGrath M. J., Lopez X., and York D. M., J. Am. Chem. Soc. 126, 1654 (2004). 10.1021/ja0356277 [DOI] [PubMed] [Google Scholar]
  • 6.Chen H., Giese T. J., Huang M., Wong K.-Y., Harris M. E., and York D. M., Chem. - Eur. J. 20, 14336 (2014). 10.1002/chem.201403862 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kamerlin S. C. L., Sharma P. K., Prasad R. B., and Warshel A., Q. Rev. Biophys. 46, 1 (2013). 10.1017/S0033583512000157 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang X., Harrison D. H. T., and Cui Q., J. Am. Chem. Soc. 124, 14871 (2002). 10.1021/ja027063x [DOI] [PubMed] [Google Scholar]
  • 9.Lee T.-S., Wong K.-Y., Giambasu G. M., and York D. M., “Bridging the gap between theory and experiment to derive a detailed understanding of hammerhead ribozyme catalysis,” Prog. Mol. Biol. Transl. Sci. 120, 25–91 (2013). 10.1016/B978-0-12-381286-5.00002-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Radak B. K., Lee T.-S., Harris M. E., and York D. M., RNA 21, 1566 (2015). 10.1261/rna.051466.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Panteva M. T., Dissanayake T., Chen H., Radak B. K., Kuechler E. R., Giambaşu G. M., Lee T.-S., and York D. M., in Multiscale Methods for Computational RNA Enzymology (Elsevier, 2015), Chap. 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Klimovich P. V., Shirts M. R., and Mobley D. L., J. Comput.-Aided Mol. Des. 29(5), 397–411 (2015). 10.1007/s10822-015-9840-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mortier J., Rakers C., Bermudez M., Murgueitio M. S., Riniker S., and Wolber G., Drug Discovery Today 20(6), 686–702 (2015). 10.1016/j.drudis.2015.01.003 [DOI] [PubMed] [Google Scholar]
  • 14.Wang L., Wu Y., Deng Y., Kim B., Pierce L., Krilov G., Lupyan D., Robinson S., Dahlgren M. K., Greenwood J. et al. , J. Am. Chem. Soc. 137, 2695 (2015). 10.1021/ja512751q [DOI] [PubMed] [Google Scholar]
  • 15.Florián J. and Warshel A., J. Phys. Chem. B 102, 719 (1998). 10.1021/jp972182y [DOI] [Google Scholar]
  • 16.Florián J., Åqvist J., and Warshel A., J. Am. Chem. Soc. 120, 11524 (1998). 10.1021/ja982698a [DOI] [Google Scholar]
  • 17.Lopez X., York D. M., Dejaegere A., and Karplus M., Int. J. Quantum Chem. 86, 10 (2002). 10.1002/qua.1601 [DOI] [Google Scholar]
  • 18.Liu Y., Gregersen B. A., Lopez X., and York D. M., J. Phys. Chem. B 109, 19987 (2005). 10.1021/jp053146z [DOI] [PubMed] [Google Scholar]
  • 19.Huang M. and York D. M., Phys. Chem. Chem. Phys. 16, 15846 (2014). 10.1039/C4CP01050G [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hou G., Zhu X., and Cui Q., J. Chem. Theory Comput. 6, 2303 (2010). 10.1021/ct1001818 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.van der Kamp M. W. and Mulholland A. J., “Computational enzymology: Insight into biological catalysts from modelling,” Nat. Prod. Rep. 25(6), 1001–1014 (2008). 10.1039/b600517a [DOI] [PubMed] [Google Scholar]
  • 22.Kuechler E. R. and York D. M., J. Chem. Phys. 140, 054109 (2014). 10.1063/1.4863344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Cramer C. J. and Truhlar D. G., Chem. Rev. 99, 2161 (1999). 10.1021/cr960149m [DOI] [PubMed] [Google Scholar]
  • 24.Roux B. and Simonson T., Biophys. Chem. 78, 1 (1999). 10.1016/S0301-4622(98)00226-9 [DOI] [PubMed] [Google Scholar]
  • 25.Truong T. N., Int. Rev. Phys. Chem. 17, 525 (1998). 10.1080/014423598230045 [DOI] [Google Scholar]
  • 26.Im W., Chen J., and Brooks C. L. III, Adv. Protein Chem. 72, 173 (2006). 10.1016/S0065-3233(05)72007-6 [DOI] [PubMed] [Google Scholar]
  • 27.Senn H. M. and Thiel W., Angew. Chem., Int. Ed. 48, 1198 (2009). 10.1002/anie.200802019 [DOI] [PubMed] [Google Scholar]
  • 28.Senn H. M. and Thiel W., Top. Curr. Chem. 268, 173 (2007). 10.1007/128_2006_084 [DOI] [Google Scholar]
  • 29.Gao J. and Truhlar D. G., Annu. Rev. Phys. Chem. 53, 467 (2002). 10.1146/annurev.physchem.53.091301.150114 [DOI] [PubMed] [Google Scholar]
  • 30.Klähn M., Rosta E., and Warshel A., J. Am. Chem. Soc. 128, 15310 (2006). 10.1021/ja065470t [DOI] [PubMed] [Google Scholar]
  • 31.Sharp K. and Honig B., J. Phys. Chem. 94, 7684 (1990). 10.1021/j100382a068 [DOI] [Google Scholar]
  • 32.Kelly C. P., Cramer C. J., and Truhlar D. G., J. Chem. Theory Comput. 1, 1177 (2005). 10.1021/ct050164b [DOI] [PubMed] [Google Scholar]
  • 33.Geisen D. J., Hawkins G. D., Liotard D. A., Cramer C. J., and Truhlar D. G., Theor. Chem. Acc. 98, 85 (1997). 10.1007/s002140050283 [DOI] [Google Scholar]
  • 34.Cramer C. J. and Truhlar D. G., Science 256, 213 (1992). 10.1126/science.256.5054.213 [DOI] [PubMed] [Google Scholar]
  • 35.Klamt A. and Schüürmann G., J. Chem. Soc., Perkin Trans. 2 2, 799 (1993). 10.1039/P29930000799 [DOI] [Google Scholar]
  • 36.Klamt A., J. Phys. Chem. 99, 2224 (1995). 10.1021/j100007a062 [DOI] [Google Scholar]
  • 37.Klamt A., Eckert F., Diedenhofen M., and Beck M. E., J. Phys. Chem. A 107, 9380 (2003). 10.1021/jp034688o [DOI] [PubMed] [Google Scholar]
  • 38.Eckert F. and Klamt A., J. Comput. Chem. 27, 11 (2006). 10.1002/jcc.20309 [DOI] [PubMed] [Google Scholar]
  • 39.Eckert F. and Klamt A., AIChE J. 48, 369 (2002). 10.1002/aic.690480220 [DOI] [Google Scholar]
  • 40.Schäfer A., Andreas Klamt J. C. W. L., Sattel D., and Eckert F., Phys. Chem. Chem. Phys. 2, 2187 (2000). 10.1039/b000184h [DOI] [Google Scholar]
  • 41.Liotard D. A., Hawkins G. D., Lynch G. C., Cramer C. J., and Truhlar D. G., J. Comput. Chem. 16, 422 (1995). 10.1002/jcc.540160405 [DOI] [Google Scholar]
  • 42.Qiu D., Shenkin P. S., Hollinger F. P., and Still W. C., J. Phys. Chem. A 101, 3005 (1997). 10.1021/jp961992r [DOI] [Google Scholar]
  • 43.Pierotti R. A., Chem. Rev. 76, 717 (1976). 10.1021/cr60304a002 [DOI] [Google Scholar]
  • 44.Miertuš S., Scrocco E., and Tomasi J., Chem. Phys. 55, 117 (1981). 10.1016/0301-0104(81)85090-2 [DOI] [Google Scholar]
  • 45.Lange A. W. and Herbert J. M., J. Chem. Phys. 133, 244111 (2010). 10.1063/1.3511297 [DOI] [PubMed] [Google Scholar]
  • 46.York D. M. and Karplus M., J. Phys. Chem. A 103, 11060 (1999). 10.1021/jp992097l [DOI] [Google Scholar]
  • 47.Scalmani G. and Frisch M. J., J. Chem. Phys. 132, 114110 (2010). 10.1063/1.3359469 [DOI] [PubMed] [Google Scholar]
  • 48.Ginovska B., Camaioni D. M., Dupuis M., Schwerdtfeger C. A., and Gil Q., J. Phys. Chem. A 112, 10604 (2008). 10.1021/jp804092v [DOI] [PubMed] [Google Scholar]
  • 49.Aguilar M., Martin M., Tolosa S., and Olivares Del Valle F., J. Mol. Struct.: THEOCHEM 166, 313 (1988). 10.1016/0166-1280(88)80455-X [DOI] [Google Scholar]
  • 50.Aguilar M. and Olivares Del Valle F., Chem. Phys. 129, 439 (1989). 10.1016/0301-0104(89)85013-X [DOI] [Google Scholar]
  • 51.Gaus M., Cui Q., and Elstner M., J. Chem. Theory Comput. 7, 931 (2011). 10.1021/ct100684s [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Gaus M., Goez A., and Elstner M., J. Chem. Theory Comput. 9, 338 (2013). 10.1021/ct300849w [DOI] [PubMed] [Google Scholar]
  • 53.Gaus M., Lu X., Elstner M., and Cui Q., J. Chem. Theory Comput. 10, 1518 (2014). 10.1021/ct401002w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Giese T. J., Chen H., Huang M., and York D. M., J. Chem. Theory Comput. 10, 1086 (2014). 10.1021/ct401035t [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Giese T. J., Panteva M. T., Chen H., and York D. M., J. Chem. Theory Comput. 11, 451 (2015). 10.1021/ct500799g [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.York D. M., Lee T.-S., and Yang W., Chem. Phys. Lett. 263, 297 (1996). 10.1016/S0009-2614(96)01198-0 [DOI] [Google Scholar]
  • 57.Gregersen B. A., Khandogin J., Thiel W., and York D. M., J. Phys. Chem. B 109, 9810 (2005). 10.1021/jp044061l [DOI] [PubMed] [Google Scholar]
  • 58.Khandogin J., Gregersen B. A., Thiel W., and York D. M., J. Phys. Chem. B 109, 9799 (2005). 10.1021/jp044062d [DOI] [PubMed] [Google Scholar]
  • 59.Lebedev V. I., Ž. Vyčisl. Mat. i Mat. Fiz. 16, 293 (1976). 10.1016/0041-5553(76)90100-2 [DOI] [Google Scholar]
  • 60.Lebedev V. I., Sibirsk. Mat. Ž. 18, 99 (1977). 10.1007/BF00966954 [DOI] [Google Scholar]
  • 61.Gregersen B. A. and York D. M., J. Chem. Phys. 122, 194110 (2005). 10.1063/1.1899146 [DOI] [PubMed] [Google Scholar]
  • 62.Kästner J., Carr J. M., Keal T. W., Thiel W., Wander A., and Sherwood P., J. Phys. Chem. A 113, 11856 (2009). 10.1021/jp9028968 [DOI] [PubMed] [Google Scholar]
  • 63.Elstner M., J. Phys. Chem. A 111, 5614 (2007). 10.1021/jp071338j [DOI] [PubMed] [Google Scholar]
  • 64.Yang Y., Yu H., York D. M., Cui Q., and Elstner M., J. Phys. Chem. A 111, 10861 (2007). 10.1021/jp074167r [DOI] [PubMed] [Google Scholar]
  • 65.Formoso E., Matxain J. M., Lopez X., and York D. M., J. Phys. Chem. B 114, 7371 (2010). 10.1021/jp909004y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Gu H., Zhang S., Wong K.-Y., Radak B. K., Dissanayake T., Kellerman D. L., Dai Q., Miyagi M., Anderson V. E., York D. M. et al. , Proc. Natl. Acad. Sci. U. S. A. 110, 13002 (2013). 10.1073/pnas.1215086110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Giambaşu G. M., York D. M., and Case D. A., RNA 21, 963 (2015). 10.1261/rna.047357.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Heldenbrand H., Janowski P. A., Giambaşu G., Giese T. J., Wedekind J. E., and York D. M., J. Am. Chem. Soc. 136, 7789 (2014). 10.1021/ja500180q [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Scott W. G., What Can the New Hammerhead Ribozyme Structures Teach us About Design? (Springer-Verlag, Berlin Heidelberg, 2010), pp. 305–323. [Google Scholar]
  • 70.Scott W. G., Biol. Chem. 388, 727 (2007). 10.1515/BC.2007.087 [DOI] [PubMed] [Google Scholar]
  • 71.Scott W. G., Murray J. B., Arnold J. R. P., Stoddard B. L., and Klug A., Science 274, 2065 (1996). 10.1126/science.274.5295.2065 [DOI] [PubMed] [Google Scholar]
  • 72.Zhang S., Ganguly A., Goyal P., Bingaman J. L., Bevilacqua P. C., and Hammes-Schiffer S., J. Am. Chem. Soc. 137, 784 (2015). 10.1021/ja510387y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Viladoms J. and Fedor M. J., J. Am. Chem. Soc. 134, 19043 (2012). 10.1021/ja307021f [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Thaplyal P., Ganguly A., Hammes-Schiffer S., and Bevilacqua P. C., Biochemistry 54, 2160 (2015). 10.1021/acs.biochem.5b00190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ferré-D’Amaré A. R. and Scott W. G., Cold Spring Harbor Perspect. Biol. 2, a003574 (2010). 10.1101/cshperspect.a003574 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Brown D. M. and Usher D. A., J. Chem. Soc. 87, 6558 (1965). 10.1039/jr9650006558 [DOI] [Google Scholar]
  • 77.Kosonen M., Youseti-Salakdeh E., Strömberg R., and Lönnberg H., J. Chem. Soc., Perkin Trans. 2 1997, 2661–2666. 10.1039/a704636g [DOI] [Google Scholar]
  • 78.Serjeant E. P. and Dempsey B., Ionisation Constants of Organic Acids in Aqueous Solution (Pergamon Press, New York, 1979). [Google Scholar]
  • 79.Lassila J. K., Zalatan J. G., and Herschlag D., Annu. Rev. Biochem. 80, 669 (2011). 10.1146/annurev-biochem-060409-092741 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Barnard P. W., Bunton C. A., Llewellyn D. R., Vernon C. A., and Welch V. A., J. Chem. Soc. 1961, 2670–2676. 10.1039/jr9610002670 [DOI] [Google Scholar]
  • 81.See supplementary material at http://dx.doi.org/10.1063/1.4946779 E-JCPSA6-144-002616 for all materials stated to be in the supplementary material.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. See supplementary material at http://dx.doi.org/10.1063/1.4946779 E-JCPSA6-144-002616 for all materials stated to be in the supplementary material.

Articles from The Journal of Chemical Physics are provided here courtesy of American Institute of Physics

RESOURCES