Abstract
In this study several commonly used implicit solvent models are compared with respect to their accuracy of estimating solvation energies of small molecules and proteins, as well as desolvation penalty in protein-ligand binding. The test set consists of 19 small proteins, 104 small molecules, and 15 protein-ligand complexes. We compared predicted hydration energies of small molecules with their experimental values; the results of the solvation and desolvation energy calculations for small molecules, proteins and protein-ligand complexes in water were also compared with Thermodynamic Integration calculations based on TIP3P water model and Amber12 force field. The following implicit solvent (water) models considered here are: PCM (Polarized Continuum Model implemented in DISOLV and MCBHSOLV programs), GB (Generalized Born method implemented in DISOLV program, S-GB, and GBNSR6 stand-alone version), COSMO (COnductor-like Screening Model implemented in the DISOLV program and the MOPAC package) and the Poisson-Boltzmann model (implemented in the APBS program). Different parameterizations of the molecules were examined: we compared MMFF94 force field, Amber12 force field and the quantum-chemical semi-empirical PM7 method implemented in the MOPAC package. For small molecules, all of the implicit solvent models tested here yield high correlation coefficients (0.87–0.93) between the calculated solvation energies and the experimental values of hydration energies. For small molecules high correlation (0.82–0.97) with the explicit solvent energies is seen as well. On the other hand, estimated protein solvation energies and protein-ligand binding desolvation energies show substantial discrepancy (up to 10 kcal/mol) with the explicit solvent reference. The correlation of polar protein solvation energies and protein-ligand desolvation energies with the corresponding explicit solvent results is 0.65–0.99 and 0.76–0.96 respectively, though this difference in correlations is caused more by different parameterization and less by methods and indicates the need for further improvement of implicit solvent models parameterization. Within the same parameterization, various implicit methods give practically the same correlation with results obtained in explicit solvent model for ligands and proteins: e.g. correlation values of polar ligand solvation energies and the corresponding energies in the frame of explicit solvent were 0.953–0.966 for the APBS program, the GBNSR6 program and all models used in the DISOLV program. The DISOLV program proved to be on a par with the other used programs in the case of proteins and ligands solvation energy calculation. However, the solution of the Poisson-Boltzmann equation (APBS program) and Generalized Born method (implemented in the GBNSR6 program) proved to be the most accurate in calculating the desolvation energies of complexes.
Keywords: solvation, implicit solvent models, explicit water, protein-ligand binding
Graphical Abstract
1. Introduction
1.1 The role of solvent in drug development efforts
Small-molecule inhibitors represent the basis for new drugs: their effect usually is to block the active site of the target protein associated with a disease. The initial stage of new drug development is to find such molecules which are inhibitors of the given target-protein; this task may involve analysis of many thousands of small molecule candidates. Quick and effective solution of this problem decreases considerably material costs and duration of subsequent stages of the drug development. Nowadays, this task can be addressed effectively with the help of computer simulations [1,2]. High accuracy of the protein-ligand binding energy calculations is the key problem to be solved to increase considerably the effectiveness of computer simulation in the new inhibitors development. Sufficiently reliable prediction of the inhibition activity can be achieved if the error of the calculation of the protein–inhibitor binding energy does not exceed 1 kcal/mol [2]. So, the accuracy of the calculation of various contributions to the protein–ligand binding energy should be better 1 kcal/mol.
The desolvation energy (the difference between the complex solvation energy and the sum of the protein and ligand solvation energies) gives a significant contribution to the free energy of protein-ligand binding in the protein-ligand binding process, and its calculation accuracy directly determines the calculation accuracy of the ligand binding constant [3–10]. In this paper we focus on calculations of the contribution due to the interaction of molecules with solvent (solvation and desolvation energies). In human or animal organisms as well as in experiments in vitro and in vivo the protein-ligand binding occurs in solvent (in the aqueous solution). So, the presence of solvent (water) must be taken into account in the calculation of protein-ligand binding energy. Upon protein-ligand binding, solvent is displaced partly from the active site of the target protein, and some of ligand and protein atoms cease to interact with solvent.
There are two commonly used approaches to calculate solvation energy: those based on the explicit solvent model, and those that utilize the implicit (or continuum) one. Of the two models the former is considered be more accurate but at the same time much more expensive computationally – the solvent is described as an ensemble of larger number of discrete water molecules. In contrast, the orders of magnitude less time-consuming implicit solvent model [11–24] is represented by the homogeneous continuum with the dielectric constant ε (for water ε = 80 at 300 K) filling the space around the solute molecule. In this model the dominant contribution to the solvation energy is its electrostatic part: Coulomb interaction of solute atoms charges with the polarization charges induced on the dielectric boundary. Within the basic continuum solvent framework, this interaction can be estimated through numerical solution of the three-dimensional Poisson–Boltzmann (PB) equation by using freely available software such as APBS [25]. In addition, there are several algorithms (models) for the calculation of the polar component of the solvation energy of molecules focused on solving the relevant equations on the dielectric boundary.
Since numerical solutions of the PB equation are also relatively time-consuming, a variety of fast approximations to these solutions for biomolecules has been developed. Three different algorithms of the solvation energy polar component calculation are implemented in the DISOLV program [26, 27]: PCM (Polarized Continuum Model), S-GB (Surface Generalized Born method proposed in [28]) and COSMO (COnductor-like Screening Model) [19]. All these three implementations demonstrate high numerical accuracy for the sufficiently small triangulation network step size on the same solvent boundary, and the PCM method demonstrates highest accuracy, but it needs more computing time. The faster algorithm of the same PCM method has been recently implemented in the MCBHSOLV program [27, 29] using a novel multicharge approximation for large dense matrices.
The performance of this algorithm can be up to two orders of acceleration (for the triangulation step size of 0.1 Å and for solute molecules of 2000 – 4000 atoms) as compared to the PCM implementation in the DISOLV program without loss of accuracy [27, 29]. Smaller acceleration is obtained for the greater step size of the triangulation network and for smaller molecules.
Methods implemented in DISOLV and MCBHSOLV for the same smooth solvent boundary surface [26, 27, 30] have been parameterized [26] in the frame of the MMFF94 force field [31] and they demonstrate high numerical accuracy as well as good parameterization, i.e. good correspondence of the calculated hydration energies to experimentally measured ones for a set of more than 400 small molecules [26]. Taking into account that DISOLV and MCBHSOLV programs are used successfully for new inhibitors development [32–34] at the post-processing stage [10] and in the gridless docking procedure [35] in the frame of the MMFF94 force field [31] there is a need to compare results of DISOLV and MCBHSOLV calculations with ones performed with other implicit solvent programs and parameterization methods [14, 20–22, 25] for same sets of molecules. Here we will also examine a relatively recent addition to the family of GB models, GBNSR6, which has already demonstrated high accuracy in estimates of hydration free energies of small molecules [14].
Accuracy of solvent representation is just one variable that affects the over-all accuracy of solvation energy predictions: the underlying gas-phase force-filed, including the choice of partial charges, affect the accuracy significantly [36]. Thus in D. Mobley’s recent study [36] various approaches for obtaining partial charges for computing hydration free energies of small molecules have been tested to examine the influence of charge model on agreement with experiment. Also, other studies [37, 38] demonstrated that calculation of small molecules solvation energies allows one to test whether the given parameterization is adequate, and to improve the underlying methods and force fields. In this respect, the novel quantum-chemical semi-empirical PM7 method [39] included in the MOPAC package [40] is becoming popular recently for new inhibitors development at the postprocessing stage [32, 33, 41]. Due to the unprecedentedly wide range of molecules used for the parameterization of the PM7 method and due to including corrections on dispersion interactions as well as ones for hydrogen and halogen bonds PM7 is significantly superior to previous semi-empirical methods in respect of calculation accuracy, especially for intermolecular interactions [39], including those semi-empirical methods used for the atomic charge calculations in some force fields, e.g. the Amber force field.
In this work we compare not only the different methods of solvation energy calculation, such as in [21, 23], but also several different implicit solvent models in their various software implementations. Among these implementations DISOLV and MCBHSOLV programs are interesting due to high and controlled numerical accuracy of realization of three implicit models PCM, COSMO and SGB for the one and the same smooth SES surface constructed by primary and secondary rolling procedures [26, 27, 30]: it was shown that for sufficiently small triangulation network step size (0.1 – 0.3 Å) variations of the solvation energy due to variations of the triangulation network position on SES are less than 1 kcal/mol [26, 27]. However, there is no detailed comparison of the DISOLV program with other independent programs, which realize the solvation energy calculations. The other reason is that the algorithms used in DISOLV and MCBHSOLV programs are also implemented in the FLM docking program [35] and they directly influence the docking accuracy.
Thus, we compare the above methods and their implementations with one another, with experimental data for small molecules, and with the results of the much more computationally expensive reference calculations using the explicit water model for the same sets of small and large molecules including protein-ligand complexes. To the best of our knowledge, no comprehensive accuracy comparison of all of the above methods on the same footing is available. In addition, we perform the COSMO calculations implemented in MOPAC using parameterizations (atoms charges and radii) of the new semiempirical quantum-chemical PM7 method.
2. Materials and methods
2.1 Solvent models. Non-polar component of the solvation energy
In the process of dissolution the molecule passes from vacuum into the solvent. The corresponding change of the Gibbs free energy ΔGs is mainly due to change of the solute-solvent interaction (e.g. air vs. water), and, secondly, due to formation of the cavity in the solvent where the solute molecule is located:
where ΔGpol is the polar component of the solute-solvent interaction, ΔGnp is the van der Waals solute-solvent interaction, and ΔGcav is the cavitation component of solvation free energy due to the cavity formation in the solvent. The latter two terms are the non-polar part of the solvation free energy ΔGnp + ΔGcav. This part can be calculated in the explicit and implicit water models, however, its contribution to the total solvation energy is relatively small compared with the polar component for most systems. The simplest empirical formula for calculation of the non-polar component of the solvation energy is as follows [42]:
where SSAS is the SAS surface area (Solvent Accessible Surface), b and σ are fitting parameters whose values may depend on the solvent and the types of atoms composing the solute molecule, e.g. σ = 0.00387 kcal/(mol*Å2), b = 0.698 kcal/mol [42] for water in the non-polar calculations included in DISOLV [26] and MCBHSOLV [27] programs. A more accurate approach described in [43], represents the non-polar part of the solvation energy depending not only on SSAS, but also on the energy of van der Waals interactions of individual atoms of the solute molecule with the solvent. This approach is implemented in the GBNSR6 [14] program. In the MOPAC [40] program non-polar components were not calculated.
2.2 Solvation models. Polar component of the solvation energy
2.2.1 Explicit solvent model
In the framework of the explicit solvent model, the aqueous environment of the solute molecule is represented in the form of a set of individual water molecules constituting the hydration shell. This model allows an atomic level description of the interactions between the solute and water. One of the most popular models of explicit water molecules is the rigid three-point TIP3P model developed by Jorgensen et al. [44]. In this model the charge of the oxygen atom equals to q(O) = −0.834e and each of the two hydrogen atoms has the charge of q(H) = 0.417e. Length of O-H chemical bonds equals to r = 0.957 Å, the value of the valence angle is θ = 104.52°. While some modern water models [45] surpass TIP3P with respect to accuracy of hydration energy estimates, these estimates are, in general, computationally very demanding. Therefore, here we use results from a previous comprehensive estimates of hydration free energy in the explicit water for small molecules from Ref. [38]. In the case of proteins and protein-ligand complexes the explicit water model was used as described in [22] to perform calculations with the Amber package [46] by the thermodynamic integration (TI) method implemented in the sander program (the Amber package). Obtained values of the solvation energy are used as reference data to evaluate the quality of implicit solvent models.
Convergence of the TI protocol and its sensitivity to the initial conditions were checked for two randomly selected complexes as it is described in [22]. For these complexes the calculations were repeated using the different random seeds, and it was notices that the obtained received differences didn't exceed a standard deviation for the calculated solvation energy values, which is of ±0.7 kcal/mol [22] for complexes and proteins. To confirm the convergence of the TI results the simulation time was extended from 2 to 5 nanoseconds, and it was observed that the resulting values of the solvation energies didn't exceed a standard deviation.
2.2.2 Implicit solvent models
Implicit (or continuum) models consider solvent as a continuous homogeneous medium describing electrostatic interactions of a solute molecule with solvent. This medium has predetermined electrostatic properties including a specified dielectric constant; the solute is separated from the solvent by a dielectric boundary (DB), the results of practical calculations are very sensitive to the choice of DB [47], with several possible, generally non-equivalent, choices [20]. In this study the PCM, S-GB and COSMO models implemented in the DISOLV program and the PCM model in the MCBHSOLV program employ the same DB (the Solvent Excluded Surface or SES) constructed as follows [10, 26, 27, 30]. The molecule is represented as an ensemble of hard spheres centered at the nucleus of atoms; radii of these spheres are different for different atom types and they are parameters of the continuum solvent model; the construction of SES involves two main steps: the primary and the secondary probe rolling [48, 49] steps. The primary rolling step is the construction of the molecular surface by rolling of the probe sphere over the solute molecule, which simulates the solvent molecule. All possible points of contact of primary rolling sphere and atoms' spheres determine points of SES. The primary rolling procedure may sometimes result in undesirable self-intersections of the surface and fractures; the secondary rolling procedure is applied for SES smoothing [49]. The essence of the secondary rolling method is to replace the surface fragments close to self-penetrations and fractures by other smooth fragments defined by the rolling of SES near to fractures with the small sphere (called the secondary rolling sphere). The TAGSS program [26, 27, 30] is the program building the surface with application of primary and secondary rolling. It should be noted that the GBNSR6 program [14] uses other method relies on MSMS-based solvent excluded surface [50] to represent the dielectric boundary.
Within the framework of the continuum (implicit) solvent model the polar part of the solvation energy is the energy of electrostatic interaction between the atoms charges of the molecule located in the cavity of the dielectric continuum and the polarization charges induced on SES. Implicit solvent models are currently included in many software packages for molecular modeling: quantum chemical packages Gaussian [51], Gamess [52], MolPro [53], MOPAC [40], and molecular dynamics packages such as Charmm [54] and Amber [46]. In addition there are independent software implementations for finding the polar part of the solvation energy, for example, DelPhi [15], APBS [25], DISOLV [26, 29], MCBHSOLV [27, 29].
The popular approach to estimate the solvation free energy is the Generalized Born model (GB) [16], in which the Green function of the Poisson equation is approximated by a simple closed form expression, see e.g. Ref. [13] for a review. The GB implementation in DISOLV program calculates the electrostatic part of the solvation free energy as follows [26, 28]:
(1) |
where R⃗i,j = R⃗i − R⃗j, R⃗i is the radius-vector that defines the position of the charge Qi of i-th atom of the solute molecule, c is an empirical constant (c=8 in the DISOLV program), summation is over all atoms of the solute molecule, and the Born radii ai can be calculated by different ways: by the volumetric method [16] as the integrals over the cavity volume and the surface method (S-GB) [28], in which the integrals are calculated over SES. The Born radii in SGB are calculated as follows [28]:
(2) |
Here An are empirical constants, are the integrals over SES:
(3) |
where n⃗s is the normal to SES, r⃗s is the radius-vector of the point at SES.
The GB polar component of the solvation energy calculated by the GBNSR6 program is realized as follows [14]:
(4) |
where εin and εout are the dielectric constants of the solute and the solvent respectively, β= εin/εout, α = 0.571412, and A is the electrostatic size [55] of the molecule. The extra terms in the Green function, Eq 4, compared to the original one due to Still et al (Eq. 1) ensure physically correct dependence on the dielectric constants [55, 56].
Here the effective Born radii ai is calculated by the following equation [14]:
(5) |
where ∂V represents the molecular surface of the molecule, and dS is the infinitesimal surface vector.
Further, in the work we will perform the calculations using the GB method implemented in the GBNSR6 program described in [14], as well as the GB-calculations using the S-GB method implemented in the DISOLV program.
The PCM (Polarized Continuum Model) [12, 17, 18, 26, 27] method is the reduction of the three-dimension Poisson equation to the integral equation at SES (see below). It is assumed in the DISOLV program that the dielectric constant inside the surface equals to 1 and the outside dielectric constant equals to ε. The density of the polarization charges induced on SES can be determined from the integral equation:
(6) |
where σ(r) is the surface density of charge induced on SES at point with radius-vector r⃗, r⃗' is the radius-vector of the surface element, R⃗i is the point that defines the position of the atomic charge Qi of each atom of the solute molecule, n⃗ is the normal to the cavity surface directed into the solvent; ε is the dielectric constant of the solvent. The electrostatic component of the interaction energy with the solvent equals to:
(7) |
To solve the resulting integral equations the DB surface SES is divided into the small elements, i.e. it is triangulated, and the equation is converted to the matrix representation:
(8) |
where q is the column vector of the polarization charges σi of discrete surface elements Si, Q – is the column vector of the solute molecule atoms' charges.
This matrix equation is solved by the one-step iterative method procedure proposed in [18] with a carefully chosen initial solution and a set of specially chosen parameters [26, 27, 29]. This method is implemented in the DISOLV program [26, 27, 29]. However, the solution of this matrix equation using the iteration procedure requires O(N2) operations (where N is the matrix size) since a standard matrix-by-vector multiplication has this complexity. Since the matrix size N is large (N ≈ 105) for the calculations with high accuracy [26, 27, 29] the time of getting the solution of the matrix equation becomes very large.
The recently developed multicharge method [57] works if the matrix A in equation (8) has a certain structure – the so-called H2-structure [58]. This structure implies that the matrix has a mosaic partitioning, i.e. it can be represented in the form of a set of sub-matrices so that each element of the matrix belongs to exactly one sub-matrix from this set. So in this problem the matrix A can be approximated by this H2-matrix with a high degree of reliability. It requires O(N) operations to build the H2-decomposition of matrix A and to get the matrix-by-vector product in the format of H2-decomposition, so it can accelerate significantly the solution of equation (8). The PCM method for finding the polar component of solvation energy using multicharge approximation was implemented in the MCBHSOLV program [27, 29], and it was shown that this program can run faster than the DISOLV program [27, 29], based on the classical algorithm for solving the PCM equations without loss of high numerical accuracy of the calculations – better than 1 kcal/mol [27, 29]. In this work we used both methods of solving the PCM equations: the classical method implemented in the DISOLV program [26, 27], and the method using multicharge decomposition implemented in the MCBHSOLV program [27, 29]. Both cases use the same program TAGSS [26, 27, 30] for constructing the triangulation grid on SES.
Another method COSMO (COnductor-like Screening MOdel) [19] is applied to solvents with large dielectric constants (e.g. for water, ε = 80). In this model dielectric continuum surrounding the solute molecule is replaced by metal (ε = ∞) continuum. The polar part of the solvation energy in this case is calculated as follows:
(9) |
Where Cf is the corrective coefficient due to the finite dielectric constant:
(10) |
Based on the vanishing potential on the surface of a conductor, the COSMO integral equation for the polarization charges is written as follows:
(11) |
Where r⃗ is the radius-vector of any point on the surface or out of the surface, R⃗i is the point that defines the position of the atomic charge Qi of each atom of the solute molecule.
In the present work calculations have been made also by the COSMO method implemented in the DISOLV program, as well as COSMO implemented in the quantum-chemical package MOPAC [40]. All methods listed above (PCM, S-GB, COSMO – program DISOLV; PCM - program MCBHSOLV) use the TAGSS program [26, 27, 30] to construct the triangulated SES surface. When the non-polar component is calculated with the DISOLV program, SAS is built from SES by the similarity transformation: each triangular element on SAS is obtained by shifting of the three vertices of the corresponding triangle on SES by the value of the radius of the primary rolling sphere along normal to the SES surface in these vertices [26, 27, 30].
In addition to the methods listed above, in this work we carried out calculations using the APBS program [25] solving the Poisson-Boltzmann equation for calculation of the polar part of solvation energy.
For the methods mentioned above and implemented in the DISOLV program (S-GB, PCM, COSMO), MCBHSOLV program (PCM), GBNSR6 program, and the APBS program the following parameters were selected: the radius of the primary rolling sphere was selected Rpr = 1.4 Å, the dielectric constant of solvent (water at room temperature) was ε = 80, and the step of triangulation grid on the SES surface was set to 0.3 Å. (except for the GBNSR6 program where the default value of 6 triangles per 1 Å2 was used). For the MOPAC program the dielectric constant of water was the same and the effective radius of the solvent molecule equals to 1 Å.
2.2.3 Polar component of the complex desolvation energy
Figure 1 illustrates the thermodynamic cycle of binding of the protein and the ligand in the solvent and in vacuum. The solvation energies of the protein, the ligand and the complex are designated as ΔGpol(protein), ΔGpol(ligand) and ΔGpol(complex) respectively, the polar component of binding energy in vacuum designated as ΔE(electrostatic). Here we consider only the polar components of the Gibbs free energy, and do not consider the components associated with the cavity formation and non-polar components of solvation energy. The change of the polar component of the free energy upon binding of the protein and the ligand in the solvent will be the same independently of the path in the thermodynamic cycle, therefore we can calculate the energy ΔΔGpol as follows: the protein and the ligand are transferred from solvent into vacuum, then form the complex in vacuum, and then the complex is transferred from vacuum into the solvent:
(12) |
Since in vacuum the polar component of the binding energy equals
(13) |
we compute the desolvation energy (influence of the solvent on the protein-binding energy in the solvent) as:
(14) |
This formula will be used for calculating the desolvation energy of protein-ligand complexes.
2.3 Test structures
Three test sets were collected for this work: a set of small molecules, a set of proteins and a set of protein-ligand complexes. 104 structures of small molecules were previously used by Aguilar et al. in [14]; these structures were randomly selected from the original David Mobley database [38]. The experimental hydration energies were also known for these molecules and the values are given in [38] and in the respective Supplementary materials. The second set consists of 19 small charge neutral proteins with no more than 500 atoms per structure: [20] (PDB IDs: 1az6, 1bh4, 1bku, 1brv, 1byy, 1cmr, 1dfs, 1dmc, 1eds, 1fct, 1fmh, 1fwo, 1g26, 1ha9, 1hzn, 1paa, 1qfd, 1qk7, 1scy), which were selected from the larger set [21]. The third set used here for the desolvation energy calculations, consists of 15 protein-ligand complexes (PDB IDs: 1b11, 1bkf, 1f40, 1fb7, 1fkb, 1fkf, 1fkg, 1fkh, 1fkj, 1fkl, 1pbk, 1zp8, 2fke, 2hah, 3kfp) [22]. The following criteria were used for choosing them from the PDB [59]: the total number of atoms was no more than 2000, there were no missed atoms in the structures other than hydrogens, and the ligands are neutral at pH = 7 and have known binding constants.
Configurations of molecules for calculations in the implicit solvent models were the same, which were used in the explicit solvent calculations.
Different parameterizations are used in some cases due to following reasons. The parameters included the atom charges, for which we calculate the interaction with the charges induced on the solvent boundary surface, and the atom radii, on which the solvent boundary surface (SES) is built. These parameters are external parameters for the implicit solvent model and may influent the accuracy of the solvation energy calculations. The main parameters set (parameter set 1) uses the radii ZAP9 [60, 61] and the charges am1-bcc [62, 63] for small molecules, and also the atomic radii mbondi2 and the charges of the ff14SB force field [64] on atoms for proteins and complexes (in this case proteins and complexes are prepared with the help of H++ webserver (http://biophysics.cs.vt.edu/) [65]), so far as this parameterization is used in the TI calculations in the explicit model (using Amber package for protein and complexes and taking results from [38] for small molecules). Therefore, this parameter set 1 is used in all implicit models and programs which allow it: GBNSR6, DISOLV and APBS programs. The choice of the ZAP9 [60, 61] radii set for small molecules is based on the results presented in [14], where it has been shown that radii set ZAP9 more precisely reproduce experimental values of solvation energies for small molecules. In case of proteins and complexes we have taken the radii set mbondi2 because it has also provided good results [20].
As it was mentioned in the Introduction, DISOLV and MCBHSOLV programs initially have been parameterized [26] in the frame of the MMFF94 force field. Therefore, we performed the solvation energy calculations for all methods of these two programs with the parameter set 2 using the atomic charges of MMFF94 force field [31] and the corresponding atomic radii for building SES [26].
Finally, the quantum-chemical program MOPAC uses its own parameterization in the PM7 method (parameter set 3).
Table 1 contains all calculation methods and defined parameters (charges and radii of atoms) used for each of these methods.
Table 1.
Method | Charges (proteins / small molecules) |
Radii (proteins / small molecules) |
|
---|---|---|---|
1 | GB (GBNSR6) (parameter set 1) | ff14SB / am1-bcc | mbondi2 /ZAP9 |
2 | GB (DISOLV) (parameter set 1) | ff14SB / am1-bcc | mbondi2 /ZAP9 |
3 | GB (DISOLV) (parameter set 2) | Mmff94 | Disolv |
4 | COSMO (DISOLV) (parameter set 1) | ff14SB / am1-bcc | mbondi2 /ZAP9 |
5 | COSMO (DISOLV) (parameter set 2) | Mmff94 | Disolv |
6 | COSMO (MOPAC) (parameter set 3) | Mopac | Mopac |
7 | PCM (DISOLV) (parameter set 1) | ff14SB / am1-bcc | mbondi2 /ZAP9 |
8 | PCM (DISOLV) (parameter set 2) | Mmff94 | Disolv |
9 | PCM (MCBHSOLV) (parameter set 2) | Mmff94 | Disolv |
10 | PB (APBS) (parameter set 1) | ff14SB / am1-bcc | mbondi2 /ZAP9 |
11 | Explicit solvent model (Amber) | ff14SB / am1-bcc |
The calculations of the non-polar component of the solvation energy using the DISOLV program (see paragraph 2) have been performed for two sets of parameters: DISOLV with parameter set 1 (atomic charges and radii are ff14SB / am1-bcc and mbondi2 /ZAP9) and DISOLV with parameter set 2 (atomic charges and radii are mmff94 and Disolv).
3. Results
In order to evaluate the quality of different calculation methods the following criteria were used: the root mean square deviation (RMSD) and the Pearson correlation coefficient (R2) between the calculated and reference values with averaging over the corresponding set of molecules. As the reference data we used either experimentally determined solvation energies (only for small molecules set) or calculated using explicit water model (for all test sets).
Experimental hydration energies for small molecules are considered as the total solvation energies including the polar and non-polar part. Both polar and non-polar components of the solvation energy are also calculated for the small molecules using the explicit solvent. Similarly, polar and non-polar parts of solvation energies (as well as total solvation energies) are obtained for small molecules using GBNSR6, DISOLV and MCBHSOLV. The programs MOPAC and APBS carry out the calculations of only the electrostatic (polar) component of solvation energies, so for these programs we do not compare the calculated total solvation energies with the experimental data. The correlation coefficients and values of RMSD for small molecules are given in tables 2–5. Table 2 shows that root mean square deviations of the polar components calculated by different implicit models from the results of calculations in explicit solvent are 0.9 – 1.9 kcal/mol; the range of absolute values of the polar component is about 16 kcal/mol (from −14 kcal/mol to 2 kcal/mol), consequently errors of polar component of solvation energy calculations (compared with the calculations in explicit solvent) by implicit solvent models are 6 – 12% for low molecular weight molecules. The corresponding correlation coefficients are quite high and are 0.82 – 0.97, and the correlation is slightly higher if the implicit models (models 1, 2, 4, 7 and 10 – see table 1) use the same radii and charges as the explicit solvent model uses (method 11 – see table 1).
Table 2.
Method | PCM (DISOLV) param. set 1 |
COSMO (DISOLV) param. set 1 |
S-GB (DISOLV) param. set 1 |
GB (GBNSR6) param. set 1 |
APBS param. set 1 |
---|---|---|---|---|---|
RMSD, kcal/mol |
1.634 |
1.642 |
1.782 |
0.930 |
1.911 |
R2 | 0.966 | 0.965 | 0.953 | 0.954 | 0.966 |
Method | PCM (DISOLV) param. set 2 |
COSMO (DISOLV) param. set 2 |
S-GB (DISOLV) param. set 2 |
PCM (MCBHSOLV) param. set 2 |
COSMO (MOPAC) param. set 3 |
RMSD, kcal/mol |
1.464 |
1.473 |
1.475 |
1.664 |
3.577 |
R2 | 0.877 | 0.876 | 0.854 | 0.822 | 0.843 |
Table 5.
Method | PCM (DISOLV) param. set 1 |
COSMO (DISOLV) param. set 1 |
S-GB (DISOLV) param. set 1 |
GB (GBNSR6) param. set 1 |
---|---|---|---|---|
RMSD, kcal/mol |
1.431 | 1.436 | 1.667 | 1.277 |
R2 | 0.929 | 0.929 | 0.906 | 0.923 |
Method | PCM (DISOLV) param. set 2 |
COSMO (DISOLV) param. set 2 |
GB (DISOLV) param. set 2 |
PCM (MCBHSOLV) param. set 2 |
RMSD, kcal/mol |
1.195 | 1.196 | 1.290 | 1.493 |
R2 | 0.920 | 0.919 | 0.916 | 0.874 |
An exception is the COSMO method implemented in the MOPAC package where the deviation from the explicit solvent reference is substantially larger (RMSD = 3.577 kcal/mol), possibly due to the insufficient selection of atomic radii used for the construction of the surface separating solvent from the solute molecule. We should note also that MOPAC uses the SAS surface instead of the SES surface, as it is usually accepted – see, e.g., [26, 27, 29, 30].
Two methods were used for the calculations of the non-polar component of the solvation energy (table 3): the method based on a linear dependence of non-polar component of the solvation energy on SAS area (DISOLV – param. set 1 or DISOLV – param. set 2 in table 3, depending on the selected options: am1-bcc - ZAP9 or mmff94 - disolv) and the method [43] taking into account the individual contributions of separate atoms and implemented in the GBNSR6 program (see above, section 2). Values of the non-polar part of the solvation energy were in the range from −0.18 kcal/mol to 3.31 kcal/mol for small molecules. Non-polar component of solvation energies demonstrates (see table 3) a poor correlation with explicit model results for the calculation method used in the DISOLV program (non-polar component is a linear form of SAS area), and it shows significantly better correlation with explicit model results for the GBNSR6 program, which takes into account Van der Waals contributions from individual solute atoms.
Table 3.
Method | DISOLV param. set 1 |
DISOLV param. set 2 |
GB (GBNSR6) param. set 1 |
---|---|---|---|
RMSD, kcal/mol |
0.661 | 0.696 | 0.696 |
R2 | 0.184 | 0.043 | 0.745 |
However, the total solvation energy (which is the sum of polar and non-polar parts) continue to have a high correlation with the total solvation energies obtained by calculations in explicit solvent (table 4) and with the experimental hydration energies (table 5).
Table 4.
Method | PCM (DISOLV) param. set 1 |
COSMO (DISOLV) param. set 1 |
S-GB (DISOLV) param. set 1 |
GB (GBNSR6) param. set 1 |
---|---|---|---|---|
RMSD, kcal/mol |
1.560 | 1.567 | 1.669 | 1.292 |
R2 | 0.966 | 0.9655 | 0.955 | 0.957 |
Method | PCM (DISOLV) param. set 2 |
COSMO (DISOLV) param. set 2 |
S-GB (DISOLV) param. set 2 |
PCM (MCBHSOLV) param. set 2 |
RMSD, kcal/mol |
1.604 | 1.611 | 1.639 | 1.787 |
R2 | 0.872 | 0.872 | 0.860 | 0.820 |
The total solvation energy of small molecules for different methods includes the polar solvation energy, calculated by different implicit methods, and the non-polar solvation energy, calculated by the DISOLV program (for methods 2,3,4,5,7,8,9 – see table 1) or the GBNSR6 program (for method 1 - see table 1). Importantly, when comparing the solvation energies (polar part) with the results obtained using explicit water model (Amber12 force field) influence of the radii and atoms charges is observed (for example, the correlation is 0.97 for the PCM calculations in the DISOLV program with the same parameters as calculations in explicit solvent, and it is 0.88 for the calculations with the parameters of MMFF94 force fields), but when comparing the total solvation energies of small molecules with the experimental hydration energies, the correlation coefficients between the calculated solvation energies and experimental hydration energies for different choices of radii and charges of atoms were close (for different methods implemented in DISOLV and GBNSR6 correlation coefficients equal to 0.90–0.93). RMSD also decreases when the comparison with the results in explicit solvent is changed to the comparison with the experimental hydration energies, and in this case the standard deviation is smaller for sets of parameters DISOLV (param. set 2), but not DISOLV (param. set 1).
Comparison of polar parts of the protein solvation energy was made between the explicit solvent reference and the following methods: PCM (implemented in the DISOLV program), COSMO (implemented in the DISOLV program), S-GB (implemented in the DISOLV program), Poisson-Boltzmann method (implemented in the APBS program) and COSMO (implemented in the MOPAC program) – see table 6. The polar component of proteins solvation energies are in the range from −250 kcal/mol to −750 kcal/mol. The corresponding root mean square deviations between the polar components of the solvation energies calculated using the respective implicit solvent models and the polar components of the solvation energies calculated in explicit solvent are shown in table 6, and the errors for the different methods range from 2% (GBNSR6) to 30% (S-GB method implemented in the DISOLV program and using the atomic radii and charges, different from that used in the calculations in explicit solvent).
Table 6.
Method | PCM (DISOLV) param. set 1 |
COSMO (DISOLV) param. set 1 |
S-GB (DISOLV) param. set 1 |
GB (GBNSR6) param. set 1 |
APBS param. set 1 |
---|---|---|---|---|---|
RMSD, kcal/mol |
12.303 | 12.452 | 28.994 | 11.699 | 13.037 |
R2 | 0.998 | 0.998 | 0.997 | 0.998 | 0.998 |
Method | PCM (DISOLV) param. set 2 |
COSMO (DISOLV) param. set 2 |
S-GB (DISOLV) param. set 2 |
PCM (MCBHSOLV) param. set 2 |
COSMO) (MOPAC) param. set 3 |
RMSD, kcal/mol |
130.781 | 137.216 | 148.417 | 131.202 | 99.155 |
R2 | 0.683 | 0.652 | 0.668 | 0.683 | 0.769 |
As in the case of small molecules there is significant dependence of the results on the selected parameters (radii and charges of atoms), and weak dependence on a specific method of the calculation when comparing the solvation energies calculated using implicit solvent models and ones calculated in explicit solvent. Here all the methods using the same parameters also give approximately equal accuracy of the calculations.
A similar situation is observed for the polar parts of the desolvation energy of protein-ligand complexes (table 7). Different models calculating the polar component of the solvation energy give close results to those in explicit solvent if the parameters are the same as ones used in the explicit solvent calculations. For the same parameters set two methods (the GB (GBNSR6) method and the Poisson-Boltzmann (APBS) method) show the better values of correlation coefficients and the lower RMSD with the results of calculations in explicit solvent. If another parameters are used the correlation coefficient noticeably decreases (except is the COSMO method implemented in the MOPAC package). The absolute values of the polar parts of the desolvation energies for complexes ranged from 25 kcal/mol to 85 kcal/mol (the range is 60 kcal/mol), and thus for the methods that use the same parameterization as in the calculations in explicit solvent the error is 8% – 21%, and for methods that use the different parameterization (3, 5, 8 methods in table 1) this error is significantly larger – 32 – 38%. The exception here is also the COSMO method implemented in MOPAC which gives small root mean square deviation from the results of the explicit solvent reference (RMSD equals 8.2 kcal/mol).
Table 7.
Method | PCM (DISOLV) param. set 1 |
COSMO (DISOLV) param. set 1 |
S-GB (DISOLV) param. set 1 |
GB(GBNSR6) param. set 1 |
APBS param. set 1 |
---|---|---|---|---|---|
RMSD, kcal/mol |
12.915 | 12.465 | 11.439 | 7.042 | 5.138 |
R2 | 0.886 | 0.886 | 0.909 | 0.956 | 0.963 |
Method | PCM (DISOLV) param. set 2 |
COSMO (DISOLV) param. set 2 |
S-GB (DISOLV) param. set 2 |
PCM (MCBHSOLV) param. set 2 |
COSMO (MOPAC) param. set 3 |
RMSD, kcal/mol |
20.240 | 20.477 | 23.163 | 19.697 | 8.251 |
R2 | 0.794 | 0.790 | 0.828 | 0.758 | 0.899 |
It was shown in [27, 29] that the numerical accuracy of the PCM method implemented in DISOLV and MCBHSOLV programs are the same when the same model parameters (atoms charges and radii for the dielectric boundary construction) were used. So, we should expect the same RMSD and correlation values between results of PCM MCBHSOLV program solvation energy calculations and ones obtained by the explicit solvent model as it is demonstrated for PCM DISOLV program calculations for the parameters set 1 (see respective cells in tables 6 and 7). The possibility of higher computation speed for large solute structures is the main advantage of PCM MCBHSOLV program over PCM DISOLV one, and the advantage is growing with decrease of the triangulation network step size and with increase of the solute molecule size [27, 29].
Calculations of solvation energy (polar component) of the small molecules take a few seconds for all methods. Computational times of solvation energy calculations benchmarked for the different methods for the proteins and complexes are shown in table 8.
Table 8.
Method | PCM (DISOLV) |
COSMO (DISOLV) |
S-GB (DISOLV) |
GBNSR6 | APBS | TI in explicit solvent |
PCM (MCBHSOLV) |
---|---|---|---|---|---|---|---|
Proteins calculatio n time |
2 – 15 seconds |
1 – 5 seconds |
0.5 – 1 seconds |
~ 5 seconds |
2 – 5 minutes |
12 hours on 12 processores |
15 – 40 seconds |
Complex es calculatio n time |
~ 35 – 125 seconds |
20 – 45 seconds |
3 – 6 seconds |
~ 7 – 10 seconds |
~ 15 minutes |
12 hours on 12 processores |
50 – 120 seconds |
As expected, the different implementations of the Generalized Born method are the fastest among all methods under consideration. The MCBHSOLV realization of PCM slower than the DISOLV one for small proteins (no more than 500 atoms), but demonstrates the same calculation speed for complexes with no more than 2000 atoms.
4. Conclusions
Need to increase considerably effectiveness of computer assisted drug design demands high accuracy of the protein-ligand binding energy calculations – better than 1 kcal/mol. This demand results in necessity to shed light on the accuracy of desolvation energy calculations with different implicit solvent models and their different realizations due to the desolvation energy is the important contribution to the protein-ligand binding energy. In this paper we have considered different implementations of several popular implicit solvation models: PCM (Polarized Continuum Model), GBNSR6 (a recent “R6” version of the generalized Born model), S-GB (Surface version of the generalized Born model), COSMO (COnductor-like Screening MOdel) and the standard numerical solution of the Poisson-Boltzmann equation (APBS program). For models without recommended default parameters we use either MMFF94 or Amber12 force fields for the charges and radii. For the COSMO model implemented in the MOPAC package we use PM7 quantum-chemical semiempirical method. The models are tested for small molecules, proteins and protein-ligand complexes. For small molecules the comparison between calculated total solvation energies and experimental hydration energies is performed, as well as between polar parts of solvation energies calculated with implicit solvation models and with explicit solvent. For proteins and complexes polar parts of solvation energies calculated with implicit solvation models are compared with the corresponding explicit solvent results, which are obtained by TI using rigid three-point TIP3P water model.
In terms of correlation coefficients between the results obtained using implicit models and the results of calculations in explicit solvent model (or experimental data for small molecules set), most of the tested models show similar results when the same parameters (radii and charges of atoms) are chosen. As expected [36] the choice of these parameters influences on the solvation energies: this can be seen from the comparison of the calculated solvation energies with the explicit solvent reference. However, in the case of small molecules it is also possible to notice that different model parameter sets lead to similar values of the correlation coefficients when comparing calculated total energies of solvation with the experimental hydration energies.
At the same time, despite the high correlation coefficients with results obtained in explicit solvent model (and experimental data), the desired “chemical accuracy” of solvation/desolvation energies (not exceeding 1 kcal/mol from the experimental values) has not yet been achieved by the methods and programs tested here. Even for small molecules the standard deviation (RMSD) of the solvation energies of small molecules computed by implicit solvent models from the experimental hydration energies have a range 1.2 – 1.7 kcal/mol for different implicit models. Similar standard deviation (1.2 kcal/mol) can be obtained comparing total solvation energies computed in the explicit solvent with experimental hydration energies of small molecules. The standard deviation of the desolvation energies calculated using the implicit models from the desolvation energies obtained using explicit solvent model are higher and equals to 5–20 kcal/mol with the range of desolvation energies of the complexes of about 60 kcal/mol (the difference between the maximum and minimum values of the desolvation energies, selected from the whole set of molecules). Since in this case (when comparing with the results of the calculations in explicit solvent model) the choice of atomic radii and charges has strong influence on the results, perhaps a more detail parameterization could reduce RMSD values.
It is difficult to anticipate that more accurate parameterization of implicit models is able to decrease significantly these discrepancies because the desolvation energy is a small difference between two large values and is subjected to large errors. However, for accuracy of docking positioning [35] we do not need to calculate the desolvation energy. The target function of the global minimization in the docking procedure is the energy of the protein-ligand complex [35] and only the complex interaction with water solvent has to be considered and the desolvation energy calculation is avoided.
In the present work, DISOLV and MCBHSOLV programs proved to be on par with the other used programs in the case of proteins and ligands solvation energy calculation. Also the numerical Poisson-Boltzmann (APBS program) and GBNSR6 methods proved to be the most accurate in calculating the desolvation energies, with GBNSR6 being much faster. The methods implemented in the DISOLV program allow getting almost the same results as the GBNSR6 and Poisson-Boltzmann methods in the case of the same parameterization. The S-GB method demonstrates the high speed of calculations as the GBNSR6 method. However it is also shown that the parameterization which uses mmff94 charges and DISOLV radii is need to be improved.
All models listed above have already been used in molecular modeling packages, for example: in quantum-chemical packages [51–53], in molecular dynamics packages [46, 54], in the MOPAC package (COSMO) [40]. Regarding docking programs, implicit solvent models are generally not used at all in most docking programs. Sometimes implicit models are used after the docking procedure calculating the scoring function for a given ligand pose, as it is implemented in the DOCK program [66]. The only exception is the SOL program [10] where the simplified GB model is used for the generation of the grid of potentials for probe ligand atoms (module SOL_GRID). The S-GB method and the PCM method using the multicharge approximations were recently used in the direct (gridless) docking program FLM [35], and it has been shown that the inclusion of the implicit solvent models into the calculation of the protein-ligand binding energies in the docking procedure, firstly, allowed to improve the quality of docking positioning and, secondly, brought the calculated protein-ligand binding energies much closer to the binding energies derived from the experimental data. The application of implicit models to estimation of the free energy of protein-ligand complexes formation, e.g. in the frame of the docking procedure, imposes certain restrictions on the calculation times, and in this case the methods using the GB model have a significant advantage.
It should be noted that the more accurate binding energy calculations should take into account polar and non-polar components of solvation energy both. In the present study the non-polar component calculation is discussed only briefly, and in calculations of protein solvation and protein-ligand desolvation energies only the polar component is considered. Though the non-polar component gives rather small contribution into the total solvation energy of large molecules, e.g. protein-ligand complexes, the lack of the non-polar component can lead to noticeable errors in accurate binding energy calculations. It was shown that the more sophisticated method of the non-polar component calculation considering individual contributions of solute atoms, demonstrates better correlation with the explicit solvent calculations: R2 values of DISOLV and GBNSR6 in Table 3 have to be compared. But this sophisticated method uses some model parameters (in GBNSR6), which have been optimized only for small molecules [14]. Therefore it is impossible to tell definitely that these parameters will be suitable for calculations of the proteins and complexes. In the present work non-polar components have been calculated only for small molecules. Further improving of implicit solvent models should be directed to self-consistent optimization of non-polar individual solute atoms contributions and polar model parameters for proteins as well as for small molecules.
Due to the vital necessity to improve accuracy of protein-ligand binding energy in docking calculations the fast and accurate implicit solvent models are urgently needed and the conclusions of the present work are extremely demanded. The main conclusion is that implicit models: GB (realized in GBNSR6 program [22]) and S-GB (realized in DISOLV program [26, 28]) have sufficient accuracy and computing speed to be used in docking programs. PCM and COSMO models implemented in DISOLV and MCBHSOLV programs are one order of magnitude slower than GB but they also can be used in docking [35] for higher computational accuracy.
On the other hand, it is revealed that further improvement of the implicit models parameterization polar and non-polar components both is needed to increase the accuracy of solvation energy calculations and as a result to improve docking accuracy.
Supplementary Material
Highlights.
-
Implicit solvent models are compared with each other and with the explicit solvent
Choice of solvation model parameters has strong influence on the results
For small molecules solvent models yield high correlation with experimental values
Generalized Born model demonstrates best combination of accuracy and computing speed
Acknowledgments
The reported work was financially supported by the Russian Science Foundation, Agreement # 15-11-00025. The calculations in explicit solvent by A.V. Onufriev were partially supported from the NIH GM076121.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Sliwoski G, Kothiwale S, Meiler J, Lowe EW., Jr Computational Methods in Drug Discovery. Pharmacol. Rev. 2014;66:334–395. doi: 10.1124/pr.112.007336. http://dx.doi.org/10.1124/pr.112.007336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sadovnichii VA, Sulimov VB. Supercomputing technologies in medicine. In: Voevodin VV, Sadovnichii VA, Savin GI, editors. Supercomputing technologies in science, education, and industry. Moscow: Moscow University Publishing; 2009. pp. 16–23. (in Russian) [Google Scholar]
- 3.Gilson MK, Given JA, Head MS. A new class of models for computing receptor–ligand binding affinities. Chemistry & Biology. 1997;4:87–92. doi: 10.1016/s1074-5521(97)90251-9. http://dx.doi.org/10.1016/S1074-5521(97)90251-9. [DOI] [PubMed] [Google Scholar]
- 4.Chen W, Chang CE, Gilson MK. Calculation of Cyclodextrin Binding Affinities: Energy, Entropy, and Implications for Drug Design. Biophysical Journal. 2004;87:3035–3049. doi: 10.1529/biophysj.104.049494. http://dx.doi.org/10.1529/biophysj.104.049494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Barreiro G, Guimaraes CRW, Tubert-Brohman I, Lyons TM, Tirado-Rives J, Jorgensen WL. Search for Non-Nucleoside Inhibitors of HIV-1 Reverse Transcriptase Using Chemical Similarity, Molecular Docking, and MM-GB/SA Scoring. J. Chem. Inf. Model. 2007;47:2416–2428. doi: 10.1021/ci700271z. http://dx.doi.org/10.1021/ci700271z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhou HX, Gilson MK. Theory of Free Energy and Entropy in Noncovalent Binding. Chem. Rev. 2009;109:4092–4107. doi: 10.1021/cr800551w. http://dx.doi.org/10.1021/cr800551w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Moghaddam S, Inoue Y, Gilson MK. Host-Guest Complexes with Protein-Ligand-like Affinities: Computational Analysis and Design. J. Am. Chem. Soc. 2009;131:4012–4021. doi: 10.1021/ja808175m. http://dx.doi.org/10.1021/ja808175m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Guthrie JP. A blind challenge for computational solvation free energies: Introduction and overview. J. Phys. Chem. B. 2009;113:4501–4507. doi: 10.1021/jp806724u. http://dx.doi.org/10.1021/jp806724u. [DOI] [PubMed] [Google Scholar]
- 9.Chen W, Gilson MK, Webb SP, Potter MJ. Modeling Protein-Ligand Binding by Mining Minima. J. Chem. Theory Comput. 2010;6:3540–3557. doi: 10.1021/ct100245n. http://dx.doi.org/10.1021/ct100245n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sulimov AV, Kutov DC, Oferkin IV, Katkova EV, Sulimov VB. Application of the docking program SOL for CSAR benchmark. J. Chem. Inf. Model. 2013;53:1946–1956. doi: 10.1021/ci400094h. http://dx.doi.org/10.1021/ci400094h. [DOI] [PubMed] [Google Scholar]
- 11.Tomasi J, Persico M. Molecular interactions in solution: an overview of method based on continuous distributions of the solvent. Chem. Rev. 1994;94:2027–2094. http://dx.doi.org/10.1021/cr00031a013. [Google Scholar]
- 12.Cramer C, Truhlar D. Implicit solvation models: equilibria, structure, spectra, and dynamics. Chem. Rev. 1999;99:2161–2200. doi: 10.1021/cr960149m. http://dx.doi.org/10.1021/cr960149m. [DOI] [PubMed] [Google Scholar]
- 13.Onufriev A. Modeling solvent environments: applications to simulations of Biomolecules. Weinheim: WILEY-VCH Verlag GmbH & Co. KGaA; 2010. Continuum electrostatics solvent modeling with the generalized Born model; pp. 127–166. http://dx.doi.org/10.1002/9783527629251.ch6. [Google Scholar]
- 14.Aguilar B, Onufriev AV. Efficient Computation of the Total Solvation Energy of Small Molecules via the R6 Generalized Born Model. J. Chem. Theory and Comput. 2012;(8):2404–2411. doi: 10.1021/ct200786m. http://dx.doi.org/10.1021/ct200786m. [DOI] [PubMed] [Google Scholar]
- 15.Rocchia W, Sridharan S, Nicholls A, Alexov E, Chiabrera A, Honig B. Rapid grid-based construction of the molecular surface and the use of induced surface charge to calculate reaction field energies: applications to the molecular systems and geometric objects. J. Comp. Chem. 2002;23:128–137. doi: 10.1002/jcc.1161. http://dx.doi.org/10.1002/jcc.1161. [DOI] [PubMed] [Google Scholar]
- 16.Still WC, Tempczyk A, Hawley RC, Hendrickson T. Semianalytical treatment of solvation for molecular mechanics and dynamics. J. Am. Chem. Soc. 1990;112:6127–6129. http://dx.doi.org/10.1021/ja00172a038. [Google Scholar]
- 17.Totrov M, Abagyan R. Rapid boundary element solvation electrostatics calculations in folding simulations: successful folding of a 23-residue peptide. Biopolymers (Peptide Science) 2001;60:124–133. doi: 10.1002/1097-0282(2001)60:2<124::AID-BIP1008>3.0.CO;2-S. http://dx.doi.org/10.1002/1097-0282(2001)60:2<124::AIDBIP1008>3.0.CO;2-S. [DOI] [PubMed] [Google Scholar]
- 18.Pomelli S, Tomasi J. A new formulation of the PCM solvation method: PCMQINTn. Theor. Chem. Acc. 1997;96:39–43. http://dx.doi.org/10.1007/s002140050201. [Google Scholar]
- 19.Klamt A, Schuurmann G. COSMO: a new approach to dielectric screening in solvent with explicit expressions for the screening energy and its gradient. J. Chem. Soc. Perkin Trans. 1993;2:799–805. http://dx.doi.org/10.1039/P29930000799. [Google Scholar]
- 20.Onufriev AV, Aguilar B. Accuracy of continuum electrostatic calculations based on three common dielectric boundary definitions. J. Theor. Comput. Chem. 2014;13:1440006. doi: 10.1142/S0219633614400069. http://dx.doi.org/10.1142/S0219633614400069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks CL. Performance comparison of generalized born and Poisson methods in the calculation of electrostatic solvation energies for protein structures. J. Comput. Chem. 2004;25:265–284. doi: 10.1002/jcc.10378. http://dx.doi.org/10.1002/jcc.10378. [DOI] [PubMed] [Google Scholar]
- 22.Izadi S, Aguilar B, Onufriev AV. Protein–Ligand Electrostatic Binding Free Energies from Explicit and Implicit Solvation. J. Chem. Theory and Comput. 2015;11:4450–4459. doi: 10.1021/acs.jctc.5b00483. http://dx.doi.org/10.1021/acs.jctc.5b00483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Knight JL, Brooks CL. Surveying implicit solvent models for estimating small molecule absolute hydration free energies. J. Comput. Chem. 2011;32:2909–2923. doi: 10.1002/jcc.21876. http://dx.doi.org/10.1002/jcc.21876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Arthur EJ, Brooks CL., III Parallelization and improvements of the generalized born model with a simple sWitching function for modern graphics processors. J. Comput. Chem. 2016;37:927–939. doi: 10.1002/jcc.24280. http://dx.doi.org/10.1002/jcc.24280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA. Electrostatics of nanosystems: application to microtubules and the ribosome. Proc. Natl. Acad. Sci. USA. 2001;98:10037–10041. doi: 10.1073/pnas.181342398. http://dx.doi.org/10.1073/pnas.181342398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kupervasser OY, Zhabin SN, Martynov YB, Fedulov KM, Oferkin IV, Sulimov AV, Sulimov VB. A continuum solvent model: the DISOLV program - algorithms, implementation, and validation. Numer. Methods Program. 2011;12:246–261. (in Russian) [Google Scholar]
- 27.Sulimov VB, Mikhalev AYu, Oferkin IV, Oseledets IV, Sulimov AV, Kutov DC, Katkova EV, Tyrtyshnikov EE. Polarized continuum solvent model: considerable acceleration with the multicharge matrix approximation. International Journal of Applied Engineering Research. 2015;10:44815–44830. [Google Scholar]
- 28.Romanov AN, Jabin SN, Martynov YB, Sulimov AV, Grigoriev FV, et al. Surface generalized Born method: a simple, fast and precise implicit solvent model beyond the Coulomb approximation. J. Phys. Chem. Sect A. 2004;108:9323–9327. http://dx.doi.org/10.1021/jp046721s. [Google Scholar]
- 29.Mikhalev AY, Oferkin IV, Oseledets IV, Sulimov AV, Tyrtyshnikov EE, Sulimov VB. Application of the multicharge approximation for large dense matrices in the framework of the polarized continuum solvent model. Numer. Methods Program. 2014;15:9–21. (in Russian) [Google Scholar]
- 30.Zhabin SN, Sulimov VB. Construction of the adaptively triangulated smooth molecular surface: program TAGSS. Scientific Visualization. 2011;3:27–53. (in Russian) [Google Scholar]
- 31.Halgren TA. Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94. J. Comput. Chem. 1996;17:490–519. http://dx.doi.org/10.1002/(SICI)1096-987X. [Google Scholar]
- 32.Sulimov VB, Katkova EV, Oferkin IV, Sulimov AV, Romanov AN, Roschin AI, Beloglazova IB, Plekhanova OS, Tkachuk VA, Sadovnichiy VA. Application of Molecular Modeling to Urokinase Inhibitors Development. BioMed Research International. 2014;2014 doi: 10.1155/2014/625176. Article ID 625176, http://dx.doi.org/10.1155/2014/625176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Beloglazova IB, Plekhanova OS, Katkova EV, Rysenkova KD, Stambol’skii DV, Sulimov VB, Tkachuk VA. Molecular Modeling as a New Approach to the Development of Urokinase Inhibitors. Bulletin of experimental biology and medicine. 2015;158:700–704. doi: 10.1007/s10517-015-2839-3. http://dx.doi.org/10.1007/s10517-015-2839-3. [DOI] [PubMed] [Google Scholar]
- 34.Sulimov VB, Gribkova IV, Kochugaeva MP, Katkova EV, Sulimov AV, Kutov DK, Shikhaliev KS, Medvedeva SM, Krysin MYu, Sinauridze EI, Ataullakhanov FI. Application of molecular modeling to development of new factor Xa inhibitors. BioMed Research International. 2015;2015 doi: 10.1155/2015/120802. Article ID 120802, http://dx.doi.org/10.1155/2015/120802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Oferkin IV, Katkova EV, Sulimov AV, Kutov DC, Sobolev SI, Voevodin VV, Sulimov VB. Evaluation of docking target functions by the comprehensive investigation of protein-ligand energy minima. Advances in Bioinformatics. 2015;2015 doi: 10.1155/2015/126858. Article ID 126858, http://dx.doi.org/10.1155/2015/126858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Mobley DL, Dumont E, Chodera JD, Dill KA. Comparison of Charge Models for Fixed-Charge Force Fields: Small-Molecule Hydration Free Energies in Explicit Solvent. J. Phys. Chem. B. 2007;111:2242–2254. doi: 10.1021/jp0667442. http://dx.doi.org/10.1021/jp0667442. [DOI] [PubMed] [Google Scholar]
- 37.Nicholls A, Mobley DL, Guthrie JP, Chodera JD, Bayly CI, Cooper MD, Pande VS. Predicting Small-Molecule Solvation Free Energies: An Informal Blind Test for Computational Chemistry. J. Med. Chem. 2008;51:769–779. doi: 10.1021/jm070549+. http://dx.doi.org/10.1021/jm070549+ [DOI] [PubMed] [Google Scholar]
- 38.Mobley DL, Bayly ChI, Cooper MD, Shirts MR, Dill KA. Small molecule hydration free energies in explicit solvent: An extensive test of fixed-charge atomistic simulations. J. Chem. Theory Comput. 2009;5:350–358. doi: 10.1021/ct800409d. http://dx.doi.org/10.1021/ct800409d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Stewart JJP. Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters. J. Mol. Modeling. 2013;19:1–32. doi: 10.1007/s00894-012-1667-x. http://dx.doi.org/10.1007/s00894-012-1667-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.MOPAC. 2012 http://openmopac.net/MOPAC2012.html. [Google Scholar]
- 41.Katkova EV, Oferkin IV, Sulimov VB. Application of the PM7 Quantum Chemical Semi-Empirical Method to the Development of New Urokinase Inhibitors. Numer. Methods Program. 2014;15:258–273. (in Russian) [Google Scholar]
- 42.Bordner AJ, Cavasotto CN, Abagyan RA. Accurate transferable model for water, n-Octanol, and n-Hexadecane solvation free energies. J. Phys. Chem. B. 2002;106:11009–11015. http://dx.doi.org/10.1021/jp0264477. [Google Scholar]
- 43.Gallicchio E, Levy RM. AGBNP: an analytic implicit solvent model suitable for MD simulations and high-resolution modeling. J. Comput. Chem. 2004;25:479–499. doi: 10.1002/jcc.10400. http://dx.doi.org/10.1002/jcc.10400. [DOI] [PubMed] [Google Scholar]
- 44.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. http://dx.doi.org/10.1063/1.445869. [Google Scholar]
- 45.Izadi S, Anandakrishnan R, Onufriev AV. Building Water Models, a Different Approach. J. Chem. Phys. Lett. 2014;5:3863–3871. doi: 10.1021/jz501780a. http://dx.doi.org/10.1021/jz501780a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Salomon-Ferrer R, Case DA, Walker RC. An overview of the Amber biomolecular simulation package. WIREs Comput. Mol. Sci. 2013;3:198–210. http://dx.doi.org/10.1002/wcms.1121. [Google Scholar]
- 47.Tjong H, Zhou HX. On the dielectric boundary in Poisson-Boltzmann calculations. J. Chem. Theory Comput. 2008;4:507–514. doi: 10.1021/ct700319x. http://dx.doi.org/10.1021/ct700319x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Connolly ML. Solvent-accessible surfaces of proteins and nucleic acids. Science. 1983;221:709–713. doi: 10.1126/science.6879170. http://dx.doi.org/10.1126/science.6879170. [DOI] [PubMed] [Google Scholar]
- 49.Vorobjev YN, Hermans J. SIMS: computation of a smooth invariant molecular surface. Biophys J. 1997;73:722–732. doi: 10.1016/S0006-3495(97)78105-0. http://dx.doi.org/10.1016/S0006-3495(97)78105-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Sanner MF, Olson AJ, Spehner JC. Reduced surface: an efficient way to compute molecular surfaces. Biopolymers. 1996;38:305–320. doi: 10.1002/(SICI)1097-0282(199603)38:3%3C305::AID-BIP4%3E3.0.CO;2-Y. http://dx.doi.org/10.1002/(SICI)1097-0282(199603)38:3<305::AID-BIP4& gt;3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
- 51.Frisch MJ, et al. Gaussian 03. Pittsburgh, Wallingford, CT: Gaussian Inc.; 2003. [Google Scholar]
- 52.Schmidt MW, et al. General Atomic and Molecular Electronic Structure System. J. Comput. Chem. 1993;14:1347–1363. http://dx.doi.org/10.1002/jcc.540141112. [Google Scholar]
- 53.Werner HJ, et al. Molpro: a general-purpose quantum chemistry program package. Wiley interdisciplinary reviews: Computational molecular science. 2012;2:242–253. http://dx.doi.org/10.1002/wcms.82. [Google Scholar]
- 54.Im W, Beglov D, Roux B. Continuum solvation model: computation of electrostatic forces from numerical solutions to the Poisson–Boltzmann equation. Comp. Phys. Commun. 1998;111:59–75. http://dx.doi.org/10.1016/S0010-4655(98)00016-2. [Google Scholar]
- 55.Sigalov G, Fenley A, Onufriev A. Analytical Linearized Poisson--Boltzmann Approach: Beyond the Generalized Born Approximation. J. Chem. Phys. 2006;124:124902. doi: 10.1063/1.2177251. http://dx.doi.org/10.1063/1.2177251. [DOI] [PubMed] [Google Scholar]
- 56.Sigalov G, Scheffel P, Onufriev A. Incorporating variable dielectric environments into the generalized Born model. J. Chem. Phys. 2005;122:094511. doi: 10.1063/1.1857811. http://dx.doi.org/10.1063/1.1857811. [DOI] [PubMed] [Google Scholar]
- 57.Mikhalev AYu, Oseledets IV. Adaptive nested cross approximation of non-local operators. arXiv preprint: 1309.1773. 2013 (URL: http://arxiv.org/abs/1309.1773) [Google Scholar]
- 58.Börm S. Efficient numerical methods for non-local operators: H2-matrix compression, algorithms and analysis. Tracts in Mathematics 14, European Mathematical Society. 2010 http://dx.doi.org/10.4171/091. [Google Scholar]
- 59.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Research. 2000;28:235–242. doi: 10.1093/nar/28.1.235. http://dx.doi.org/10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Nicholls A, Wlodek S, Grant JA. SAMPL2 and continuum modeling. J. Comput. Aided Mol. Des. 2010;24:293–306. doi: 10.1007/s10822-010-9334-8. http://dx.doi.org/10.1007/s10822-010-9334-8. [DOI] [PubMed] [Google Scholar]
- 61.Nicholls A, Wlodek S, Grant JA. The SAMP1 solvation challenge: further lessons regarding the pitfalls of parameterization. J. Phys. Chem. B. 2009;113:4521–4532. doi: 10.1021/jp806855q. http://dx.doi.org/10.1021/jp806855q. [DOI] [PubMed] [Google Scholar]
- 62.Jakalian A, Bush BL, Jack DB, Bayly CI. Fast, efficient generation of highquality atomic charges. AM1-BCC model: I. Method. J. Comput. Chem. 2000;21:132–146. doi: 10.1002/jcc.10128. http://dx.doi.org/10.1002/(SICI)1096-987X(20000130)21:2<132::AIDJCC5>3.0.CO;2-P. [DOI] [PubMed] [Google Scholar]
- 63.Jakalian A, Jack DB, Bayly CI. Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J. Comput. Chem. 2002;23:1623–1641. doi: 10.1002/jcc.10128. http://dx.doi.org/10.1002/jcc.10128. [DOI] [PubMed] [Google Scholar]
- 64.Wang J, Kollman PA. Automatic parameterization of force field by systematic search and genetic algorithms. Journal of Computational Chemistry. 2001;22:1219–1228. http://dx.doi.org/10.1002/jcc.1079. [Google Scholar]
- 65.Anandakrishnan R, Aguilar B, Onufriev AV. H++ 3.0: automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulation. Nucleic Acids Res. 2012;40:W537–W541. doi: 10.1093/nar/gks375. http://dx.doi.org/10.1093/nar/gks375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Oshiro CM, Kuntz ID. Flexible ligand docking using a genetic algorithm J. Comput-Aided Mol. Design. 1995;9:113–130. doi: 10.1007/BF00124402. http://dx.doi.org/10.1007/BF00124402. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.