Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Jun 1.
Published in final edited form as: J Comput Biophys Chem. 2021 Dec 29;21(4):431–447. doi: 10.1142/s2737416521420047

High Order Ab Initio Valence Force Field with Chemical Pattern Based Parameter Assignment

Xudong Yang 1, Chengwen Liu 1, Pengyu Ren 1,*
PMCID: PMC9248749  NIHMSID: NIHMS1766431  PMID: 35784097

Abstract

Bonded (or valence) interactions, which directly determine the local structures of the molecules, are fundamental parts of molecular mechanics force fields (FFs). Most popular classical FFs adopt the simple harmonic models for bond stretching and angle bending and ignore cross-coupling effects among the valence terms. This may lead to less accurate vibrational properties and configurations in molecular dynamics (MD) simulations. AMOEBA models utilize an MM3(MM4)-style bonded interaction model, in which the vibrational anharmonicity, the coupling effects among different energy terms, and the out-of-plane bending for sp2-hybridized atoms are considered. In this work, we report the development of bonded interaction parameters for a wide range of chemistry based on quantum mechanics (QM). About 270 atomic types defined by SMARTS strings were used to model the valence interactions. Our results indicate that the resulting valence parameters produce accurate vibrational frequencies (RMSD from QM is less than ~36.6 cm−1) over a large set of molecules with diverse functional groups (445 molecules). By contrast, the harmonic models usually give an RMS error greater than 60 cm−1. Meanwhile, this model accurately reflects the potential energy surface of the out-of-plane bending. Our model can generally be applied to the AMOEBA family and any MM3(MM4)-based molecular mechanics FFs.

Keywords: AMOEBA potential, Force field, Bonded interactions, Normal mode frequencies

1. Introduction

Classical molecular dynamics (MD) simulations are widely applied by researchers to investigate various biomolecular systems. One of the essential elements determining the reliability of MD simulations is the underlying classical potentials, or force fields (FFs)13, which describe the intra- and intermolecular interactions using empirical functional forms. Abundant studies focus on improving modeling the intermolecular (non-bonded) interactions as they are responsible for thermodynamic properties in protein-ligand binding47, protein folding8, enzymatic catalysis9, and other critical biomolecular processes. Nonetheless, bonded interactions are equally important as they determine the local structural ensemble sampled during the MD simulations.10

Most popular FFs, such as AMBER1114, OPLS1517, and CHARMM1820, adopt simple harmonic functions to describe bond stretching and angle bending without considering the anharmonic nature of bonds and angles. AMBER omits cross-coupling effects between bond and angle terms, and it uses the Fourier style to describe out-of-plane effects. OPLS uses a similar functional form to AMBER. CHARMM employs a unique improper potential to represent the out-of-plane bending and a Urey-Bradley potential to account for the cross-coupling. Those methods are computationally efficient, but their accuracy is limited in complex molecular structures, especially in vibrational analysis. By contrast, AMOEBA-based models2123 use the sophisticated MM3(MM4)-style2425 functional forms to describe these bonded interactions, in which the anharmonicity, cross-coupling effects, and out-of-plane bending (opbend) have been included. First, higher-order polynomial terms were added to the bond stretching and angle bending terms to capture the anharmonicity while avoiding abnormal effects on strained bonds or angles due to the negative contributions from odd polynomial terms. Second, the cross-coupling terms were proven to be essential for capturing the coupling effects and improving the vibrational frequencies. For example, it is evidenced that the combination of a high-order polynomial model and cross-terms can greatly enhance the description of equilibrium structure and vibrational frequencies of formate ion.2627 Specifically, the average absolute deviation of vibrational frequencies (|Δv|ave) plunges from 155.6 cm−1 by the simple harmonic model without any cross terms to 7.2 cm−1 by a quartic model with third-order cross terms. Similar improvements have been found on systems including amides, peptides, and others.2829 Finally, the out-of-plane bending (opbend)30 is a necessary and important energy term to maintain planar structure around trigonal centers as a significant pyramidalization can form without seriously distorting any of the three angles of bonds involved.

MM3/MM4 parameters were derived almost three decades ago. Quantum mechanics (QM) calculations have significantly improved over the years, with energy, structures, vibrational frequencies closely matching experimental measurements. In addition, a more comprehensive range of coverage in chemical space is needed, and better ways for parameter assignment and typing (e.g., chemical pattern based) are readily available. This work aims to provide an up-to-date database of valence parameters for common organic molecules using atomic types determined by SMARTS strings. These parameters will be used for AMOEBA21 and AMOEBA+22 FFs but are also compatible with other classical FFs that use MM3/MM4 style valence terms. We have developed an automated Python program that implements the Modified Seminario method31 to generate bond and angle parameters from ab initio Hessian matrices. In addition, a reversed searching with the ranking tree was implemented inside this program to assign valence parameters for new molecules based on SMARTS patterns. Moreover, the anharmonicity, coupling effects, and out-of-plane bending have been analyzed on a large set of diverse molecules. The database of parameters will allow us to parameterize and simulate new biological and synthetic molecules, such as ligands and drug compounds by using AMOEBA and AMOEBA+ FFs.

2. Methodologies

2.1. Theoretical background

The bonded interactions of AMOEBA21 and AMOEBA+2223 include five components: bond stretching, angle bending, out-of-plane bending (opbend), torsion, and cross-coupling terms for bond/angle/torsion, as shown in Equation 1.

Ebonded=Estretch+Ebend+Etorsion+Eopbend+Ecross #(1)

This study focuses on bond stretching, angle bending, stretch-bend coupling, and opbend terms, which are the standard contributing elements to the total potential energy shared by almost all the molecules. The torsional energy involving four atoms in connection is usually coupled with other interactions such as electrostatics, van der Waals and polarization. The torsion term and related cross-coupling terms are treated as a residual correction to match the ab initio quantum mechanics reference energy after all the other terms of AMOEBA(+), including non-bonded terms are determined.

As mentioned above, the AMOEBA valence terms have been mostly inherited from MM3/MM4 model.2425 The advantage of using the higher-order polynomials is that the energy variation within a wider range of geometry changes can be accurately captured. Bond stretching term adopts the quartic form (Equation 2), which can generate reliable energy for bond variation in the range of ±0.3 Å from the equilibrium structure. Also, the sextet bending model is utilized, which can reliably reproduce ab initio energy for the angle variation in the range of ±70° and beyond.32 The out-of-plane bending (opbend) term is used to keep the planar structures for sp2 atoms at and around the trigonal centers such as the amide group. Without the out-of-plane or improper torsion term used by some force fields, large pyramidalization may occur during the MD simulations without seriously distorting the bond lengths and angles. In AMOEBA, the opbend term (Equation (4)) shares a similar functional form to the angle bending term. The χ is defined as the bending angle between the plane and the central atom (Figure 1), which means each out-of-plane structure has three components contributed by each peripheral atom.

Estretch=kbΔr2[12.55Δr+712×2.552Δr2] #(2)
Ebend=kaΔθ2[10.014Δθ+5.6(10)5Δθ27.0(10)7Δθ3+2.2(10)8Δθ4] #(3)
Eoop=kopbendΔχ2[10.014Δχ+5.6(10)5Δχ27.0(10)7Δχ3+2.2(10)8Δχ4] #(4)

Figure 1.

Figure 1.

An illustration of a trigonal center D pyramidalizing from plane ABC. The out-of-plane angle χ has been defined as illustrated.

Here Δr = rr0 is the difference between the real bond length and the reference r0, also, we have Δθ = θθ0 and Δχ = χχ0. The cross-terms (Equation 5) describe the coupling effects among bond stretching, angle bending, and rotational torsion. In this study, we focus on the stretch-bend cross term only. Torsion and related cross-terms are included in AMOEBA functional forms and will be considered later. These torsion-related cross-terms are needed to describe the hyperconjugation effects for conjugated molecules.33 The stretch-bend cross-term is expressed in Equation (6): for any angle A-B-C, two parameters kb,AB and kb,BC denotes the coupling of the bond AB, BC respectively with angle ABC.

Ecross=Estrbnd+Estrstr+Ebndbnd+Estrtors+Ebndtors+Etorstors #(5)
Estrbnd=[kba,ABΔrAB+kba,BCΔrBC]ΔθABC #(6)

In this work, the distinct bond types are defined by the atomic types of 2 atoms forming the bond (A-B). Also, the different angle types have been classified by the atomic types of 3 atoms forming the angle (A-B-C).23 Slightly simple types are used for the stretch-bend energy term, where any angles with the same central atom use the same parameters. Besides, the opbend terms have been classified by the type of central sp2 atom with the type of one peripheral atom.

2.2. Initial bond/angle parameters from QM Hessian matrix

In this work, we adopt the modified Seminario method31, 34 to generate our initial bond stretching and angle bending parameters directly from the QM Hessian matrix. The modified Seminario method is a convenient and efficient way to derive accurate stretching and bending parameters with all other contributions ruled out (as opposed to fitting).

The Seminario method was initially designed for the generation of OPLS-AA35 style parameters and written in MATLAB. We have re-written the code in Python based on the atom types we defined here for organic molecules (see Data availability).23 The OPLS-AA FF uses the classical harmonic model for stretching and bending, which means the modified Seminario cannot be directly applied to the parameterization based on our model. The resulting parameters from this method serve as excellent initial values for further optimization against experimental data, which is significantly improved over the previous simple estimates used by Tinker36 or Poltype37 tools.

The mathematical details and principles about the modified Seminario are described here. The Hessian matrix is written based on the Cartesian coordinates of each atom, which is further divided into many small 3×3 matrices belonging to different bonds, such as HAB for the bond AB in Equation (7). Stretching parameters can be obtained straightforwardly by using Equation (7), where λiAB(i=1,2,3) are the eigenvalues of the small Hessian for the bond AB and υiAB(i=1,2,3) are the corresponding eigenvectors, uAB is the bond vector (All the vectors mentioned here and in the next paragraph are normalized).

HAB=[2ExAxB2ExAyB2ExAzB2EyAxB2EyAyB2EyAzB2EzAxB2EyAzB2EzAzB],kAB=i=13λiAB|uABυiAB| #(7)

As to deriving angle bending parameters, additional work is involved. Inside the equations (8) below, we need to first derive the intermediate force constants in the direction perpendicular to the bonds forming the angle (vectors uPA and uPC with the orientation pointing to the interior in Figure 2(a)). Then, by considering the relation of those normal vectors and their contributions to the angle bending, the equations (9) and (10) are used to calculate the final bending force constants. More mathematical and analytical details about this method can be found elsewhere.31 The N is the number of angles in the FF with a central atom B and involves the bond AB. Take oxalic acid as an example in Figure 2(b); atom A is the peripheral atom for two coupled angles: ABC and ABE. To obtain the force constant for angle ABC kθ(ABC), we can see the vector uPA,1 perpendicular to bond AB points to ABC and uPA,2 perpendicular to bond AB points to ABE. However, this method is not appropriate to deal with angles involving sp-hybridized atoms, because it cannot detect the orientation. Sp atoms are not common, so some more rough estimates are used here for initial values in the subsequent fitting to QM vibrational frequencies (VF).

kPA=i=13λiAB|uPAυiAB|,  kPC=i=13λiCB|uPCυiCB| #(8)
sPA={1+j=1N|uPA,1uPA,j|21N1(N>1)1(N=1) #(9)
1kθ(ABC)=sPARAB2kPA+sPCRCB2kPC #(10)

Figure 2.

Figure 2.

An illustration of the vectors used in the Seminario method. (a) The 2 vectors uPA and uPC contributing to the force constant of angle ABC in water molecule; (b). The 2 vectors pointing to 2 coupling angles with A as a side atom in oxalic acid.

It is convenient to derive the initial bond/angle parameters using the Modified Seminario method from the QM Hessian Matrix of a given molecule. However, since this model is appropriate mainly for the harmonic model, the parameters need to be further fitted to the QM VF. The whole procedure has been illustrated in Figure 5 and implemented in an automated program which will be described in Section 2.4.

Figure 5.

Figure 5.

The procedure to generate the final valence parameter set (green arrows) and the methods to obtain and assign parameters for any new molecule (yellow arrows): assignment by Modified Seminario is shown inside the red dash box, and assignment by reversed searching is shown inside the blue dash box.

2.3. Computational details

(1). Selection of training set molecules

The training set consists of 445 molecules, with the distribution of the number of heavy atoms in each molecule shown in Figure 3. The whole range is 1 to 18 heavy atoms with most molecules having 5 to 9 heavy atoms. All the molecules have been classified into categories that span from alkane to bio-fragments, as illustrated in Figure 4 and Table 1. In total there are grouped into seven categories: (1). alkanes and aliphatic rings; (2). molecules comprised of only single bonds; (3). molecules with double bonds or triple bonds; (4). benzene derivatives; (5). heterocyclics; (6). bio-fragments; (7). combinations of multiple aromatic cycles. The detailed list of molecules with their SMARTS and specific categories is in Table S1. The whole procedure of parametrization has been illustrated by the workflow in Figure 5.

Figure 3.

Figure 3.

Distribution of the number of heavy atoms in different molecules. (Red: training set; Blue: validation set; Green: total.)

Figure 4.

Figure 4.

Representatives of the organic molecules covered in this study.

Table 1.

A general illustration of the functional groups and distinct chemical structures covered in the current study.

graphic file with name nihms-1766431-t0001.jpg
graphic file with name nihms-1766431-t0002.jpg
graphic file with name nihms-1766431-t0003.jpg
graphic file with name nihms-1766431-t0004.jpg

(2). Initial bond/angle parameters from Hessian matrix

The frequencies of the normal modes have been calculated by using Gaussian0938 with MP2/6–31g** (6–311g** for iodine atom) after the tight optimization with the same method and basis set. The bond/angle equilibrium values at MP2/6–31g** are close to the experiment for the organic molecules.39 In this work, the reference bond lengths and angles were extracted from the final optimized structures and averaged based on the pre-defined bond and angle types. Then, the initial force constant parameters for bonds and angles were derived from the Hessian matrix by using the modified Seminario method.31 These parameters were further refined in step (3), targeting the QM frequencies.

(3). Bond/angle parameters refinement using QM frequency

As suggested previously4041, the frequencies calculated by MP2/6–31g** (6–311g** for iodine) are scaled by a factor of 0.943 for better consistency with the experimental values. The vibrational analysis program (vibrate.x) in Tinker was used to generate normal mode frequencies for our models. Since we only focus on the interactions related to bonds and angles here and the frequencies of high range (usually large than 1000 cm−1) are mainly contributed by those terms while the low range depends mainly on torsional terms, the reference data lower than 1000 cm−1 are not used in the refinement. The least-square method from scipy42 is used to optimize the parameters for bond, angle, and stretch-bend cross-terms, with the cost, root-mean-square-error (RMSE in VF), as given by Equation 11.

RMSE(frequency)=1ni(νMMνQM,scaled)2 #(11)

(4). Opbend parameters derivation from potential energy surface fitting

The potential energy surfaces of the training set molecules were utilized to derive the out-of-plane bending (opbend) parameters (further details in Table S2). We focus on those molecules containing the sp2 atoms that form trigonal planar structures. The central sp2 atom was pulled out of the plane by 0.05, 0.1, 0.15 Å (Opbend-small subset hereafter), and 0.2, 0.25, 0.3 Å (Opbend-large subset hereafter), respectively, to create deformed configurations and corresponding energies. We obtained 4318 configurations for the training set, and the single point energy was calculated using Psi443 at MP2/aug-cc-pvtz (aug-cc-pvtz-pp for iodine) level of theory.

The least-square method was used to fit the opbend parameters by optimizing the weighted RMSE of the energy difference (shown by Equation 12). To ensure that our parameters are not biased towards the high-energy (rare) structures far from equilibrium, we used a smaller weight (0.2) for configurations with energy 20 kcal/mol higher than that of equilibrium.

wRMSE(energy)=1ni[(EMM,smallEQM,small)2+0.2×(EMM,largeEQM,large)2] #(12)

(5). Validation

After obtaining the parameters by fitting frequencies and potential energy surface, we chose 120 new molecules as the validation set (see Table S3 & S4 for details). The distribution of the number of heavy atoms in molecules of the validation set has been illustrated in Figure 3. Overall, the molecules in the validation set contain 4~14 heavy atoms. Those molecules are made of functional groups included in the training set molecules (Table 1). We mainly evaluated the bond/angle and coupling parameters by comparing the high-range frequencies of normal mode for these molecules. The opbend parameters were validated by comparing the energy differences between QM and our model for 829 configurations stemming from the 120 molecules in the validation set. This validation is intended to show the transferability of the current valence parameters.

2.4. Automated generation and assignment of valence parameters for new molecules

Although we have covered a range of chemistry (see Table 1) widely encountered in the practical simulations, there remains the possibility of failure to find the parameters for a certain molecule. For the molecules that the exact SMARTS patterns are missing in our database, we attempt to assign parameters based on the highest similarity to the existing chemical fragments.

Here we provide an automated program to determine the best matching atomic type for any new molecule and assign both valence parameters and our previously published charge-flux parameters23 to each atom (see Data availability). It will attempt to find an exact matching from our database at first. When the exact matching fails, this program has implemented 2 methods to estimate those missing parameters: (1). Modified Seminario (in the red dash-line rectangle of Figure 5); (2). Reversed searching by ranking tree (in cyan dash-line rectangle of Figure 5). In this work, we used the second method to generate the new parameters for the validation set.

The second method depends on the current parameter set to generate new parameters suitable for our model. We established a ranking tree where different types were placed in some positions in different levels determined by the complexity of the structure (Figure 6). For example, given a bond A-B which has no direct equivalent in the database, the program will check the position of types A and B in the ranking tree. It undergoes a reversed searching starting from the bottom (most complex), during which the types A and B are sequentially replaced by other types in the same level under the same root. If there is still no matching item, the searching will move to the upper level and continue this operation. The final parameter will be given by averaging the first 5 parameters found during the reversed searching.

Figure 6.

Figure 6.

The basic structure of the ranking tree (simplified). The green arrows show the sequence of searching in the same level and reversely to the upper levels.

For the common sp3 carbon, we have introduced canonical types (nsp3 C) based on the number of H atoms attached. They are designated as C30, C31, C32, C33, or C34 (See Table S5). Given a molecule, the detailed atom types of any sp3 C atoms will be identified by comparing their local environment (SMARTS) with the database. If there is no complete match, these canonical types will be used to generate parameters. For instance, if we have a molecule CH3-CH(CH3)-CH2-CHO and find no angle parameter of CH-CH2-CHO, then we will collect all the known angle parameters for the structure as nsp3 C – nsp3 C – CHO in the database to compute the average value and assign its value to the above angle. If still no appropriate parameter is found, we will continue reversed searching on the carbon atoms which are not sp3. If high accuracy is required under this method, we can use the frequency fitting to improve all the valence terms except the opbend terms.

3. Results and discussion

3.1. Overview of the atomic types and valence parameters

Currently, we have determined the valence parameters based on about 270 atomic types. All the parameters were derived by both fitting to QM data or the ranking tree. To be specific, we generated 768 bond terms (of which 42 terms from the ranking tree, with each term including kb and b0), 1946 angle terms (of which 186 terms from the ranking tree, with each term involving ka and θ0), 109 stretch-bend terms (one term involves kba,AB and kba,BC), and 565 opbend terms (of which 34 from the ranking tree, one term involves kopbend).

The force constants have been determined for different structures by using our version of the modified Seminario program for bond and angle parameters. All the reference bond (b0) and angle (θ0) are averaged over the same atomic types. The current atomic types can keep the overall RMSD of the reference bond length less than 0.05Å and angle degree less than 5° for our training set comparing to the QM optimized structures (see Data availability), which is acceptable in MD simulations23. As to cross terms, our current setting only turns on the stretching-bending coupling effects. The stretch-bend parameters are only distinguished by the atomic type of the central atom, i.e., all the angle structures with the same central atom share the same stretch-bend parameters. Next, the classification of out-of-plane bending terms (opbend) only considers the atomic type of the central atom and the side atom, which possibly pop out of the triangle plane. We will analyze this in detail in the following sections.

3.2. Analysis on parameters of the valence terms

We examined the force constants (kb) and equilibrium bond lengths (b0) between different sp3 C and H. It is observed from Table 2 that the sp3 C with different numbers of the bonded hydrogens does not show a remarkable change in the bond length of C-H and corresponding force constant, especially the negligible difference among C(H3)-H, C(H2)-H, C(H1)-H. So, the bond type for terminal C and H can be equated with that for middle C and H. For most cases, the local functional groups have minimal effects on C-H because the RMSD of the lengths of all the matching bonds in the database is below 0.03 Å.

Table 2.

The comparison of equilibrium bond lengths and corresponding force constants among sp3 C and H. a)

bond b) Equilibrium length (Database) c) Force constant (Database) c)
C[H4]-H 1.0897 (4) 0.0000 383.57
C[H3]-H 1.0940 (1297) 0.0297 345.98
C[H2]-H 1.0957 (448) 0.0028 340.53
C[H1]-H 1.0980 (52) 0.0034 338.72
C[H3]-C[H3] 1.5255 (1) 0.0000 266.49
C[H3]-C[H2] 1.5225 (153) 0.0065 203.31
C[H3]-C[H1] 1.5240 (82) 0.0052 280.35
C[H3]-C[H0] 1.5306 (16) 0.0044 210.15
C[H2]-C[H2] 1.5236 (30) 0.0046 205.76
C[H2]-C[H1] 1.5231 (9) 0.0112 214.73
C[H2]-C[H0] 1.5352 (3) 0.0094 190.84
C[H1]-C[H1] 1.5420 (1) 0.0000 172.75
C[H0]-C[H0] 1.5697 (1) 0.0000 133.12
a)

Statistics are reported from all the configurations in our database.

b)

Here, C[Hn] means the sp3 C with n hydrogens bonded.

c)

The numbers in the parenthesis represent the number of matching structures in the database, and the underlined number means the RMSD: root mean square derivation. (Unit: length – Å, force constant – kcal∙mol−1∙Å−1))

Next, we listed the possible pairs of the bonds between the canonical sp3 C in Table 2. The bond lengths between sp3 C and sp3 C with the different number of connected hydrogens are mostly within the range of 1.52 Å~1.53 Å, but it is observed that less number of bonded hydrogens lead to longer bond lengths and smaller force constants. The C(H3)-C(H1) has the largest force constant, but it involves a terminal carbon with 3 H atoms. For those carbons of the interior, this trend is mostly correct, suggesting a strong coupling between bonds and the chemical environment (other bonds and angles) inside the molecule.

As for other kinds of bonds and angles, the analysis is becoming complicated, and no clear trend is observed.

3.3. Normal mode frequency

The frequencies of the normal modes for all 445 molecules in the training set have been utilized for determining force constants after a tight structure optimization. The correlation between QM reference and our model is illustrated in Figure 7(a). The RMSD of the frequency of the normal modes reaches 36.6 cm−1 for the whole training set; at the same time, the relative error percentage is around 1.7% (Table 3). By comparison, it was observed that OPLS combined with the modified Seminario method could generate frequencies with the relative errors of 7.4% (RMSD: 59.4 cm−1) in general.31 AMBER-type valence terms combined with a strategy of partial hessian fitting gives out RMSD ~81.4 cm−1 from a much smaller database,44 and GAFF would show frequency error mostly larger than 100 cm−1.1 Meanwhile, these previous works on vibration analysis used small databases limited in the number and the variety of molecules, which in turn demonstrates the reliability of the valence model and parameters derived in this work using a large and diverse set of molecules.

Figure 7.

Figure 7.

The correlation between the QM and MM normal mode frequencies(cm−1) for the molecules in (a) the training set and (b) the validation set.

Table 3.

Statistical comparison of QM scaled frequencies of normal modes and the values calculated by MM model.

Statistics a) Training set b) Validation set b)
RMSE MRE Num. RMSE MRE Num.
Group A 38.4279 1.81% 4846 37.9629 1.82% 1181
Group B 34.2120 1.72% 354 35.2745 1.59% 146
Benzene derivatives 28.5765 1.45% 1433 32.1555 1.48% 982
Heterocyclics 41.0277 1.91% 1122 33.3864 1.71% 541
Bio-fragments 39.0039 1.84% 831 38.6137 1.94% 81
Combination of multiple aromatic cycles 28.8252 1.41% 752 26.0445 1.30% 449
Total 36.6139 1.73% 9338 34.0284 1.62% 3410
R2=0.9981 R2=0.9983
a)

Group A includes those molecules with less than 10 heavy atoms (non-hydrogen) and Group B means those molecules with ≥ 10 heavy atoms. Both Group A and Group B contain those molecules which are not involved in other categories listed in the table.

b)

RMSE: root mean square error (unit: cm−1); MRE: mean relative error (1Ni=1N|ν(QM)ν(MM)|ν(QM)×100%). The number of respective unique data points is marked with Num. R2: correlation coefficient.

As seen in Figure 7(a), most valence related vibration frequencies concentrate on two areas (1000~1800 cm−1 and 2800~3600 cm−1). The valence terms can capture those modes in the range 2800~3600 cm−1, which are contributed mainly by the fast stretching between heavy atoms and hydrogen such as C-H, N-H, or O-H. These motions occur at the terminal of the molecule, indicating limited coupling effects with the local angle bending. It is a common practice to restrain the H related high frequency stretching during molecular simulation to allow larger time steps4546, while we have found reversible reference system propagator algorithms (RESPA) is a more effective approach – a small time step (e.g., 0.5 fs) can be used for valence forces and a large time step (e.g., 2 fs) is applied to the other contributions. The situation becomes more complicated when it comes to the range of 1000~1800 cm−1. These involve double bond stretching, angle bending, and the mixture of several other motions. Because these motions directly affect the properties of interest in MD, the use of cross-terms and high-order polynomial models (for anharmonicity) becomes important. The detailed frequency comparisons for several typical molecules have been displayed in Figure S1, proving the reliability of our current model and parameters.

Some data points in Figure 7(a) with relatively large deviations arise from carbamate and guanine derivatives (several molecules having such functional groups). In carbamic acid and carbamate ion, we found that the normal mode contributed by the stretching of C=O is mixed with certain local angles bending (O-C-O, N-C-O, and more) lower frequency. The QM gives 1752 cm−1 in carbamate ion and 1893 cm−1 in carbamic acid for this coupled motion, while our model gives the frequencies inside 2000~2100 cm−1. For guanine, the deviation also happens on the C6=O stretching. It involves the angle bending N1-C6-C5 and both two ring structures show large deformation. Because we only utilized stretch-bend cross-terms in this work, there might be part of coupling effects that cannot be fully described. More coupling terms between different structures (such as those involving torsion) can be included to improve the performance for these complex situations further.

Here we assessed how much the cross-terms and the opbend terms contribute to the overall performance in Table 4. We re-classify Group (1)~(3) into 2 groups: Group A and Group B by the number of heavy atoms in each molecule (Group A: <10; Group B: >=10). The cross-terms impact the overall performance of vibration analysis, but opbend terms show minimal influence since the previous works about the simple quadratic model for stretching and bending reported the best RMSD up to nearly 60 cm−1. It reveals that anharmonicity plays the most significant role in generating accurate frequencies. Besides, the results are improved consistently for all the groups except Group B when stretch-bend cross terms have been used in this work, indicating that cross terms can capture more details in structural vibrations. The molecules in Group B mostly include long carbon chain or large carbon ring, which demand more coupling effects considered. Notably, the improvement for those molecules with largely conjugated structures is remarkable: the RMSD for benzene derivatives is decreased by 27.3%, 16.1% for heterocyclics, 16.5% for bio-fragments, and 31.6% for the combinations of multiple aromatic cycles. These molecules possess complex structures, so the motion of interior atoms can easily impact the surrounding bonds and angles. In general, the RMSD of frequencies for the training set goes down by 13.6% by adding cross-terms. Opbend terms have very little effect on the frequencies and made them slightly worse for certain groups. These are likely vibration modes involving the motion in which some sp2 atoms popping out of the plane and adding opbend helps keep the correct structure and energy while introducing noise to vibrational frequencies.

Table 4.

Statistical comparison of QM scaled frequency of normal modes and the values calculated by MM model with or without coupling terms and opbend terms in the training set.

Group name a) w. coupling
w. opbend b)
w. coupling
w.o. opbend b)
w.o. coupling
w. opbend b)
RMSE MRE RMSE MRE RMSE MRE
Group A 38.4279 1.81% 39.0842 1.83% 41.9415 1.95%
Group B 34.2120 1.72% 33.4050 1.69% 33.2874 1.73%
Benzene derivatives 28.5765 1.45% 28.5561 1.45% 39.3057 1.93%
Heterocyclics 41.0277 1.91% 39.8838 1.89% 48.8816 2.38%
Bio-fragments 39.0039 1.84% 38.7749 1.88% 46.6939 2.33%
Combination of multiple cycles 28.8252 1.41% 28.8945 1.42% 42.4678 2.11%
Total 36.6139 1.73% 36.7788 1.75% 42.6315 2.04%
a)

Group A means those molecules with less than 10 heavy atoms (non-hydrogen) and Group B means those molecules with ≥ 10 heavy atoms. Both Group A and Group B contain those molecules which are not involved in other categories listed in the table.

b)

RMSE: root mean square error (unit: cm−1); MRE: mean relative error (1Ni=1N|ν(QM)ν(MM)|ν(QM)×100%) has been included in the parenthesis. R2: correlation coefficient.

3.4. Potential energy surface for opbend parameters

Out-of-plane parameters are used to avoid the abnormal pyramidalization of the planar structures with sp2 atoms in the center. The results of the current polynomial model with the corresponding parameters of the training set for opbend fitting have been illustrated in Figure 8(a). In Figure 8(a). Most of the data points are well matched between the model and QM energy within a wide range up to ~90 kcal/mol, while the model collectively underestimates the QM reference values. This suggests that our model can reflect the shape of potential energy surface during pyramidalization with high accuracy, even when the deformation of the structure is very large. The energy RMSD for the training set is 0.8375 kcal/mol for the Opbend-small set (maximum QM energy 22.9956 kcal/mol with average 2.3185 kcal/mol), 3.5226 kcal/mol for the Opbend-large set (maximum QM energy 91.0101 kcal/mol with average 12.9017 kcal/mol) and 2.5582 kcal/mol for the whole set (Table 5). Although Allinger’s model can improve the description of potential energy surface in the out-of-plane bending, the simple model cannot capture the effect fully. As a result, the model energy tends to be smaller than expected from overall observation. Nonetheless, this model separates the out-of-plane motion into three components from each side atoms, which makes the parameters more transferable and flexible in applying in many distinct planar structures.

Figure 8.

Figure 8.

The correlation between the energy difference calculated by QM and MM for all the configurations in (a) the training set and (b) the validation set. Opbend-small refers to small deformation and opbend-large set refers to large deformation of the specified structures, as described in Section 2.3(4).

Table 5.

Statistical comparison of QM energy differences and the values calculated by the MM model.

Group name b) Training set c) Validation set d)
Opbend-small a) RMSE 0.8375 1.3453
QM_Max 22.9956 17.6173
QM_Ave 2.3185 3.4736
Opbend-large a) RMSE 3.5226 4.1265
QM_Max 91.0101 53.3861
QM_Ave 12.9017 15.4029
Total RMSE 2.5582 2.8082
R2 0.9743 0.9471
QM_Ave 7.6003 8.2367
a)

Opbend-small and Opbend-large groups rely on the scale of deformation of the specified structures (see section 2.2).

b)

RMSE: root mean square error; QM_Max: maximum of the absolute QM ΔE; QM_Ave: average of the absolute QM ΔE. (Unit: kcal∙mol−1).

c)

R2: correlation coefficient. The total number of configurations for the training set is 4318.

d)

R2: correlation coefficient. The total number of configurations for the validation set is 829.

3.5. Validation

To test the accuracy and transferability of the valence parameters derived in this work, we applied these parameters to all the 120 molecules in the validation set. All the parameters for the validation set molecules are automatically assigned or generated via the ranking tree method described in Methodologies. Frequency validation results are shown in Figure 7(b) and Table 3. The RMSD in frequencies for each group of validation set is mostly consistent with the results from the training set, ca. ~34 cm−1. In addition, we compared the equilibrium b0 and θ0 parameters assigned based on atom type with QM-optimized geometry for the 120 molecules. The overall RMSD is less than 0.05Å for b0 and less than 5° for θ0 (see Data Availability). For the out-of-plane effects, the performance of the opbend parameters is satisfying as shown in Figure 8(b) and Table 5. The RMSD between QM and MM energy for all the 829 configurations from 120 molecules is ~2.8 kcal/mol. Overall the validation results demonstrated high accuracy and transferability of our parameters for a wide range of small organic molecules.

4. Conclusion

The classical harmonic equations have been used for decades to calculate the potential energy of bonded interactions by many FFs. While computationally attractive, it shows deficiencies in describing the potential energy surface and vibrational frequencies of molecules compared to quantum calculations. AMOEBA-family FFs apply the MM3(MM4)-style, anharmonic valence interaction model to improve the accuracy of structure and spectroscopic properties. The model employs high-order polynomial functionals in bond stretching and angle bending, the cross-terms that couple different valence motions, and out-of-plane bending terms. The benefits of these high-order and coupling terms are evidenced in this work. They noticeably improve the accuracy of normal mode frequencies, indicating a strong anharmonicity in bonds and angles. The cross-terms capture more details of the intramolecular motions and significantly improve normal mode frequencies with consistently high accuracy, especially for large structures and conjugation molecules. In addition, our results indicate that opbend terms generate correct potential energy surface around the planar structure for those sp2 atoms.

The current parameter set can be applied to a wide range of distinct functional groups, and it is expandable to tackle additional chemistry in the future. The modified Seminario method has been combined with the current atom types of the AMOEBA(+) model, providing reliable initial force constant parameters of bond stretching and angle bending as reference values from QM Hessian. Refinement of bond, angle, and bond-angle coupling explicitly targeting the QM frequency data further increases the quality of these force constant parameters. The opbend parameters are derived by fitting to the potential energy surface of distorted structures composing sp2 atoms. It is interesting to examine how good our model can describe the overall geometry of simple compounds comparing to high-level QM or experimental structures. However, this will not be practical until all the other energy components are parameterized, as both the valence and non-boned interactions are responsible for the accurate prediction of the molecular geometry. An automated program to assign the parameters has also been presented in this work, which is accessible to the public (see Data availability). This program as well as the parameters, will be further implemented in the canonical Tinker 36 and Poltype software packages 37.

Supplementary Material

Supplementary information

Acknowledgments

This work was supported by the National Institutes of Health (R01GM106137 and R01GM114237), and National Science Foundation (CHE-1856173).

Data availability

1. The molecules in tinker format and final bonded parameters can be accessed at: https://github.com/prenlab/amoebaplus_data/tree/master/valence2021

2. The programs developed in this work are accessible through the GitHub repository at: https://github.com/prenlab/amoebaplus_parameter

References

  • 1.Dauber-Osguthorpe P; Hagler AT, Biomolecular force fields: where have we been, where are we now, where do we need to go and how do we get there? J Comput Aided Mol Des 2019, 33 (2), 133–203. [DOI] [PubMed] [Google Scholar]
  • 2.Hagler AT, Force field development phase II: Relaxation of physics-based criteria… or inclusion of more rigorous physics into the representation of molecular energetics. J Comput Aided Mol Des 2019, 33 (2), 205–264. [DOI] [PubMed] [Google Scholar]
  • 3.Jing Z; Liu C; Cheng SY; Qi R; Walker BD; Piquemal JP; Ren P, Polarizable Force Fields for Biomolecular Simulations: Recent Advances and Applications. Annu Rev Biophys 2019, 48, 371–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gilson MK; Zhou HX, Calculation of protein-ligand binding affinities. Annu Rev Biophys Biomol Struct 2007, 36, 21–42. [DOI] [PubMed] [Google Scholar]
  • 5.Gallicchio E; Levy RM, Recent theoretical and computational advances for modeling protein-ligand binding affinities. Adv Protein Chem Struct Biol 2011, 85, 27–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Harger M; Lee JH; Walker B; Taliaferro JM; Edupuganti R; Dalby KN; Ren P, Computational insights into the binding of IN17 inhibitors to MELK. J Mol Model 2019, 25 (6), 151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kaoud TS; Johnson WH; Ebelt ND; Piserchio A; Zamora-Olivares D; Van Ravenstein SX; Pridgen JR; Edupuganti R; Sammons R; Cano M; Warthaka M; Harger M; Tavares CDJ; Park J; Radwan MF; Ren P; Anslyn EV; Tsai KY; Ghose R; Dalby KN, Modulating multi-functional ERK complexes by covalent targeting of a recruitment site in vivo. Nat Commun 2019, 10 (1), 5232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Neupane K; Manuel AP; Woodside MT, Protein folding trajectories can be described quantitatively by one-dimensional diffusion over measured energy landscapes. Nature Physics 2016, 12 (7), 700–703. [Google Scholar]
  • 9.Lonsdale R; Ranaghan KE; Mulholland AJ, Computational enzymology. Chemical Communications 2010, 46 (14), 2354–2372. [DOI] [PubMed] [Google Scholar]
  • 10.Qiu Y; Smith D; Boothroyd S; Jang H; Wagner J; Bannan CC; Gokey T; Lim VT; Stern C; Rizzi A; Lucas X; Tjanaka B; Shirts MR; Gilson M; Chodera J; Bayly CI; Mobley D; Wang L-P, Development and benchmarking of open force field v1. 0.0, the parsley small molecule force field. ChemRxiv 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Maier JA; Martinez C; Kasavajhala K; Wickstrom L; Hauser KE; Simmerling C, ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. Journal of Chemical Theory and Computation 2015, 11 (8), 3696–3713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Debiec KT; Cerutti DS; Baker LR; Gronenborn AM; Case DA; Chong LT, Further along the Road Less Traveled: AMBER ff15ipq, an Original Protein Force Field Built on a Self-Consistent Physical Model. Journal of Chemical Theory and Computation 2016, 12 (8), 3926–3947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wang L-P; McKiernan KA; Gomes J; Beauchamp KA; Head-Gordon T; Rice JE; Swope WC; Martínez TJ; Pande VS, Building a More Predictive Protein Force Field: A Systematic and Reproducible Route to AMBER-FB15. The Journal of Physical Chemistry B 2017, 121 (16), 4023–4039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Salomon-Ferrer R; Case DA; Walker RC, An overview of the Amber biomolecular simulation package. WIREs Computational Molecular Science 2013, 3 (2), 198–210. [Google Scholar]
  • 15.Jorgensen WL; Maxwell DS; Tirado-Rives J, Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids. Journal of the American Chemical Society 1996, 118 (45), 11225–11236. [Google Scholar]
  • 16.Kaminski GA; Friesner RA; Tirado-Rives J; Jorgensen WL, Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides. The Journal of Physical Chemistry B 2001, 105 (28), 6474–6487. [Google Scholar]
  • 17.Jorgensen WL; Tirado-Rives J, The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. Journal of the American Chemical Society 1988, 110 (6), 1657–1666. [DOI] [PubMed] [Google Scholar]
  • 18.MacKerell AD; Bashford D; Bellott M; Dunbrack RL; Evanseck JD; Field MJ; Fischer S; Gao J; Guo H; Ha S; Joseph-McCarthy D; Kuchnir L; Kuczera K; Lau FTK; Mattos C; Michnick S; Ngo T; Nguyen DT; Prodhom B; Reiher WE; Roux B; Schlenkrich M; Smith JC; Stote R; Straub J; Watanabe M; Wiórkiewicz-Kuczera J; Yin D; Karplus M, All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J. Phys. Chem. B 1998, 102 (18), 3586–3616. [DOI] [PubMed] [Google Scholar]
  • 19.Vanommeslaeghe K; Hatcher E; Acharya C; Kundu S; Zhong S; Shim J; Darian E; Guvench O; Lopes P; Vorobyov I; Mackerell AD Jr., CHARMM general force field: A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. Journal of computational chemistry 2010, 31 (4), 671–690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lopes PE; Guvench O; MacKerell AD Jr., Current status of protein force fields for molecular dynamics simulations. Methods Mol Biol 2015, 1215, 47–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ren P; Ponder JW, Polarizable Atomic Multipole Water Model for Molecular Mechanics Simulation. The Journal of Physical Chemistry B 2003, 107 (24), 5933–5947. [Google Scholar]
  • 22.Liu C; Piquemal J-P; Ren P, AMOEBA+ Classical Potential for Modeling Molecular Interactions. J. Chem. Theory Comput 2019, 15 (7), 4122–4139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yang X; Liu C; Walker BD; Ren P, Accurate description of molecular dipole surface with charge flux implemented for molecular mechanics. The Journal of Chemical Physics 2020, 153 (6), 064103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Allinger NL; Yuh YH; Lii JH, Molecular mechanics. The MM3 force field for hydrocarbons. 1. Journal of the American Chemical Society 1989, 111 (23), 8551–8566. [Google Scholar]
  • 25.Allinger NL; Chen K; Lii J-H, An improved force field (MM4) for saturated hydrocarbons. Journal of Computational Chemistry 1996, 17 (5‐6), 642–668. [Google Scholar]
  • 26.Maple JR; Dinur U; Hagler AT, Derivation of force fields for molecular mechanics and dynamics from ab initio energy surfaces. Proc Natl Acad Sci U S A 1988, 85 (15), 5350–5354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Lii JH; Allinger NL, Molecular mechanics. The MM3 force field for hydrocarbons. 2. Vibrational frequencies and thermodynamics. Journal of the American Chemical Society 1989, 111 (23), 8566–8575. [Google Scholar]
  • 28.Maple JR; Hwang MJ; Jalkanen KJ; Stockfisch TP; Hagler AT, Derivation of class II force fields: V. Quantum force field for amides, peptides, and related compounds. Journal of Computational Chemistry 1998, 19 (4), 430–458. [Google Scholar]
  • 29.Palmo K; Mirkin NG; Pietila LO; Krimm S, Spectroscopically determined force fields for macromolecules. 1. n-Alkane chains. Macromolecules 1993, 26 (25), 6831–6840. [Google Scholar]
  • 30.Hagler AT, Quantum Derivative Fitting and Biomolecular Force Fields: Functional Form, Coupling Terms, Charge Flux, Nonbond Anharmonicity, and Individual Dihedral Potentials. Journal of Chemical Theory and Computation 2015, 11 (12), 5555–5572. [DOI] [PubMed] [Google Scholar]
  • 31.Allen AEA; Payne MC; Cole DJ, Harmonic Force Constants for Molecular Mechanics Force Fields via Hessian Matrix Projection. Journal of Chemical Theory and Computation 2018, 14 (1), 274–281. [DOI] [PubMed] [Google Scholar]
  • 32.Jensen F, Introduction to Computational Chemistry. Wiley 2017, 28. [Google Scholar]
  • 33.Nevins N; Lii J-H; Allinger NL, Molecular mechanics (MM4) calculations on conjugated hydrocarbons. Journal of Computational Chemistry 1996, 17 (5‐6), 695–729. [Google Scholar]
  • 34.Seminario JM, Calculation of intramolecular force fields from second-derivative tensors. International Journal of Quantum Chemistry 1996, 60 (7), 1271–1277. [Google Scholar]
  • 35.Robertson MJ; Tirado-Rives J; Jorgensen WL, Improved Peptide and Protein Torsional Energetics with the OPLS-AA Force Field. Journal of Chemical Theory and Computation 2015, 11 (7), 3499–3509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rackers JA; Wang Z; Lu C; Laury ML; Lagardère L; Schnieders MJ; Piquemal J-P; Ren P; Ponder JW, Tinker 8: Software Tools for Molecular Design. Journal of Chemical Theory and Computation 2018, 14 (10), 5273–5289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wu JC; Chattree G; Ren P, Automation of AMOEBA polarizable force field parameterization for small molecules. Theor. Chem. Acc 2012, 131 (3), 1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Frisch MJT,GW; Schlegel HB; Scuseria GE; Robb MAC,JR; Scalmani G; Barone V; Mennucci B; Petersson GAN,H; Caricato M; Li X; Hratchian H;P; Izmaylov AFB,J; Zheng G; Sonnenberg JL; Hada M; Ehara MT,K; Fukuda R; Hasegawa J; Ishida M; Nakajima T; Honda YK,O; Nakai H; Vreven T; Montgomery JA Jr.; Peralta JEO,F; Bearpark M; Heyd JJ; Brothers E; Kudin KN; Staroverov VNK,R; Normand J; Raghavachari K; Rendell AB,JC; Iyengar SS; Tomasi J; Cossi M; Rega N; Millam JMK,M; Knox JE; Cross JB; Bakken V; Adamo CJ,J; Gomperts R; Stratmann RE; Yazyev O; Austin AJC,R; Pomelli C; Ochterski JW; Martin RL; Morokuma KZ,VG; Voth GA; Salvador P; Dannenberg JJD,S; Daniels AD; Farkas Ö; Foresman JBO,JV; Cioslowski J; Fox DJ, Gaussian 09. Gaussian, Inc., Wallingford CT, 2009 2009. [Google Scholar]
  • 39.Hehre WJ, A guide to molecular mechanics and quantum chemical calculations. Wavefunction Irvine, CA: 2003; Vol. 2. [Google Scholar]
  • 40.Johnson RD. NIST Computational Chemistry Comparison and Benchmark Database, accessed 2020. http://cccbdb.nist.gov/.
  • 41.Irikura KK; Johnson RD; Kacker RN, Uncertainties in Scaling Factors for ab Initio Vibrational Frequencies. The Journal of Physical Chemistry A 2005, 109 (37), 8430–8437. [DOI] [PubMed] [Google Scholar]
  • 42.Virtanen P; Gommers R; Oliphant TE; Haberland M; Reddy T; Cournapeau D; Burovski E; Peterson P; Weckesser W; Bright J; van der Walt SJ; Brett M; Wilson J; Millman KJ; Mayorov N; Nelson ARJ; Jones E; Kern R; Larson E; Carey CJ; Polat İ; Feng Y; Moore EW; VanderPlas J; Laxalde D; Perktold J; Cimrman R; Henriksen I; Quintero EA; Harris CR; Archibald AM; Ribeiro AH; Pedregosa F; van Mulbregt P; Vijaykumar A; Bardelli AP; Rothberg A; Hilboll A; Kloeckner A; Scopatz A; Lee A; Rokem A; Woods CN; Fulton C; Masson C; Häggström C; Fitzgerald C; Nicholson DA; Hagen DR; Pasechnik DV; Olivetti E; Martin E; Wieser E; Silva F; Lenders F; Wilhelm F; Young G; Price GA; Ingold G-L; Allen GE; Lee GR; Audren H; Probst I; Dietrich JP; Silterra J; Webber JT; Slavič J; Nothman J; Buchner J; Kulick J; Schönberger JL; de Miranda Cardoso JV; Reimer J; Harrington J; Rodríguez JLC; Nunez-Iglesias J; Kuczynski J; Tritz K; Thoma M; Newville M; Kümmerer M; Bolingbroke M; Tartre M; Pak M; Smith NJ; Nowaczyk N; Shebanov N; Pavlyk O; Brodtkorb PA; Lee P; McGibbon RT; Feldbauer R; Lewis S; Tygier S; Sievert S; Vigna S; Peterson S; More S; Pudlik T; Oshima T; Pingel TJ; Robitaille TP; Spura T; Jones TR; Cera T; Leslie T; Zito T; Krauss T; Upadhyay U; Halchenko YO; Vázquez-Baeza Y; SciPy C, SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods 2020, 17 (3), 261–272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Parrish RM; Burns LA; Smith DGA; Simmonett AC; DePrince AE; Hohenstein EG; Bozkaya U; Sokolov AY; Di Remigio R; Richard RM; Gonthier JF; James AM; McAlexander HR; Kumar A; Saitow M; Wang X; Pritchard BP; Verma P; Schaefer HF; Patkowski K; King RA; Valeev EF; Evangelista FA; Turney JM; Crawford TD; Sherrill CD, Psi4 1.1: An Open-Source Electronic Structure Program Emphasizing Automation, Advanced Libraries, and Interoperability. J. Chem. Theory Comput 2017, 13 (7), 3185–3197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Wang R; Ozhgibesov M; Hirao H, Partial hessian fitting for determining force constant parameters in molecular mechanics. J Comput Chem 2016, 37 (26), 2349–59. [DOI] [PubMed] [Google Scholar]
  • 45.Ryckaert J-P; Ciccotti G; Berendsen HJC, Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. Journal of Computational Physics 1977, 23 (3), 327–341. [Google Scholar]
  • 46.Ciccotti G; Ryckaert JP, Molecular dynamics simulation of rigid molecules. Computer Physics Reports 1986, 4 (6), 346–392. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information

Data Availability Statement

1. The molecules in tinker format and final bonded parameters can be accessed at: https://github.com/prenlab/amoebaplus_data/tree/master/valence2021

2. The programs developed in this work are accessible through the GitHub repository at: https://github.com/prenlab/amoebaplus_parameter

RESOURCES