Abstract
The need for accurate yet efficient representation of the aqueous environment in biomolecular modelling has led to the development of a variety of generalized Born (GB) implicit solvent models. While many studies have focused on the accuracy of available GB models in predicting solvation free energies, a systematic assessment of the quality of these models in binding free energy calculations, crucial for rational drug design, has not been undertaken. Here, we evaluate the accuracies of eight common GB flavors (GB-HCT, GB-OBC, GB-neck2, GBNSR6, GBSW, GBMV1, GBMV2 and GBMV3), available in major molecular dynamics packages, in predicting the electrostatic binding free energies () for a diverse set of 60 biomolecular complexes belonging to four main classes: protein-protein, protein-drug, RNA-peptide and small complexes. The GB flavors are examined in terms of their ability to reproduce the results from the Poisson-Boltzmann (PB) model, commonly used as accuracy reference in this context. We show that the agreement with the PB of estimates varies widely between different GB models and also across different types of biomolecular complexes, with correlations ranging from 0.3772 to 0.9986. A surface-based “R6” GB model recently implemented in AMBER shows closest over-all agreement with reference PB (, RMSD=8.75 kcal/mol). The RNA-peptide and protein-drug complex sets appear to be most challenging for all but one model, as indicated by the large deviations from the PB in . Small neutral complexes present the least challenge for most of the GB models tested. The quantitative demonstration of the strengths and weaknesses of the GB models across the diverse complex types provided here can be used as a guide for practical computations and future development efforts.
Graphical Abstract

INTRODUCTION
Accurate determination of the binding affinities of small molecules with a biological target is crucial in molecular design and drug discovery but remains challenging.1-9 Experimental approaches for assessing the free energy of binding are difficult and prohibitively expensive on large scale.10 Therefore, considerable effort has been devoted to develop in silico approaches with the aim of reducing time and cost in modern drug design processes, namely free energy perturbation (FEP),11 thermodynamic integration (TI),12 linear response (LR)13 and molecular mechanics-Poisson-Boltzmann-surface area (MM-PBSA)14,15 methods. Among all, molecular simulations have gained significant popularity in estimating binding affinities due to substantial methodological improvements and advanced computational power.14,16,17
In essence, the free energy of binding is largely determined from the combined effect of the delicate balance between receptor-ligand, solvent-ligand, solventreceptor and solvent-solvent interactions. The quality of the underlying solvent environment can therefore have a profound effect to the outcomes of receptor-ligand binding calculations.18-20 Traditionally, the effects of aqueous solvation have been accounted for by placing a sufficiently large number of individual water molecules around the solute, and simulating their motion on an equal footing with the molecule of interest.21-23 While arguably the most realistic of the current theoretical approaches, it suffers from considerable computational costs, which often becomes prohibitive, for example for the receptor-ligand docking problem in which one searches for the best position of the ligand molecule (drug candidate) on the target receptor.24,25 The cost of such calculations increase dramatically with the system size. Moreover, it is not a-priori clear at this point which, if any, of the numerous available explict water models can be used as an accuracy standard for estimates of receptor-ligand binding affinities. For example, the popular TIP3P model can lead to 40 % (6 kcal/mol) error relative to experiment in binding enthalpy estimates of even small hostguest systems;26 differences between binding free energy estimates based on the widely used TIP3P and TIP4P-Ew models can be as high as 9 kcal/mol.27
The implicit solvation methodology28,29,29-50 provides significant computational advantages by representing the discrete solvent molecules by a uniform continuous medium with the average properties of the solvent. Currently, the practical “engine” of the implicit solvent models – responsible for the estimation of the key electrostatic interactions – is either the Poisson–Boltzmann (PB) model,14,29,38,45,51-57 or, in case where computational efficiency is paramount, the generalized Born (GB) approximation.36,58-63 The GB, of which a great variety of flavors and implementations are available,63-92 has become popular due to a reasonable balance between accuracy and speed and the availability of a diverse class of GB implementations in major molecular modeling packages.93-97
While arguably most widely used in molecular dynamics simulations, the relatively high efficiency of the GB model makes it also very promising in large-scale estimates of solvation free energies and, hence, in estimates of receptor-ligand binding interactions,16,30,34,61,98,99 especially in conjunction with approaches, such as MM-PB(GB)SA,14-16 where large numbers of snapshots need to be processed. In the past, systematic studies were performed to examine the accuracy and efficiency of computing the electrostatic solvation free energies () with various implicit solvent models, including the GB and PB.18,19,64,100-103 However, even a reasonably good accuracy of a GB (or the PB, for that matter) model in reproducing the electrostatic solvation free energy alone may not necessarily translate into the same level of accuracy for ligand binding calculations.56,63,104 One reason is that the values of binding free energies are relatively much smaller than the free energy terms corresponding to the bound and unbound states (e.g. tens of kcal/mol for binding affinities vs. thousands of kcal/mol for solvation free energies), which leads to high sensitivity of the method to parameters used in the calculations of binding affinities.56,105-107
To the best of our knowledge, no systematic assessment of the GB accuracy in receptor-ligand binding was ever performed on a large and diverse set of complexes, simultaneously for most GB models available in major modeling packages. This study is intended to fill this critical gap.
Here, we study the accuracies of the electrostatic binding free energies () computed for a set of 60 biomolecular complexes using eight commonly used GB models available in the major molecular modeling packages AMBER93 and CHARMM.94 Several of these GB models originally developed for these packages have since been adopted by other major modeling packages.95-97 The sets of biomolecular complexes tested here are diverse with respect to the biomolecule type (e.g. nucleic acids, proteins and drugs), structure sizes, net charges, as well as values. The objective is to provide a quantitative assessment of the accuracies of common GB models across diverse complex types.
All of the GB models (flavors) considered here have been developed in the context of simulating biological macromolecules; the PB model has been used as their natural accuracy reference, including previous large-scale estimates of the GB performance across multiple models64,100 focused on solvation free energy. We adopt the same strategy here in terms of the PB reference, and compare the predictions of and by different GB models to those given by the numerical PB. That is we focus on the electrostatic component of the binding free energy, since it is the electrostatics that the GB approximation is designed to estimate. We fully acknowledge the approximate nature of the PB;108 further justification for the use of the numerical PB reference for the GB models considered here is provided in “Methods”. Implications for practical calculations of binding affinity are discussed in “Conclusions”. We expressly avoid comparing computational efficiencies of different models from multiple packages – interested readers are referred to previous comprehensive studies,64,100 and , for the newest models, to the original manuscripts.109,110
The remainder of the paper is organized as follows. First, we present the specifics of the receptor-ligand complexes used for the comparative studies, along with a brief overview of the GB flavors and the reference PB model used in this study. Second, we present the quality of the calculated and with respect to reproducing the values predicted by the reference PB models. Our findings, including implications of the GB model accuracy relative to the PB in the context of practical calculations are summarized and discussed in the Conclusions.
METHODS
Test Structures
Four test sets containing 60 biomolecular complexes were collected for this study: a set of small complexes (data set 1), a set of protein-drug complexes (data set 2), a set of protein-protein complexes (data set 3), and a set of RNA-peptide complexes (data set 4). The structures of these molecules were taken from the RCSB protein databank111 (PDB), and the PDB identification codes of these structures can be found in Table 1. These sets are diverse with respect to the biomolecule type (e.g. nucleic acids, proteins and drugs), structure sizes and net charges (see Table 2), as well as their calculated (see the Results section). The structures in Set 1 (small neutral complexes) were originally selected27,112 to contain no more than 2000 heavy atoms and no missing heavy atoms. They were protonated as described previously.27
Table 1:
PDB ID of the complexes in the four data sets.
| data set 1 (small complexes) |
data set 2 (protein-drug) |
data set 3 (protein-protein) |
data set 4 (RNA-peptide) |
|---|---|---|---|
| 1B11 | 1JTX | 1B6C | 1A1T |
| 1BKF | 1JTY | 1BEB | 1A4T |
| 1F40 | 1JUM | 1BVN | 1BIV |
| 1FB7 | 1JUP | 1E96 | 1EXY |
| 1FKB | 1QVT | 1EMV | 1HJI |
| 1FKF | 1QVU | 1FC2 | 1I9F |
| 1FKG | 3BQZ | 1GRN | 1MNB |
| 1FKH | 3BR3 | 1HE1 | 1NYB |
| 1FKJ | 3BTC | 1KXQ | 1QFQ |
| 1FKL | 3BTI | 1SBB | 1ULL |
| 1PBK | 3BTL | 1FMQ | 1ZBN |
| 1ZP8 | 3PM1 | 1UDI | 2A9X |
| 1ZPA | 2BTF | 484D | |
| 2FKE | 2PCC | ||
| 2HAH | 2SIC | ||
| 3FKP | 2SNI | ||
| 7CEI |
Table 2:
The average sizes (in atoms) and average absolute total charges of the complexes and ligands for the four test sets.
| Average Size (atoms) | Average ∣Charge∣ (e) | |||
|---|---|---|---|---|
| Complex | Ligand | Complex | Ligand | |
| small complexes | 1769.9 | 101.4 | 0.00 | 0.00 |
| protein-drug | 6203.4 | 44.9 | 3.615 | 1.154 |
| protein-protein | 6147 | 2475.1 | 6.00 | 4.667 |
| RNA-peptide | 1209.5 | 393.5 | 16.615 | 7.769 |
For the proteins and RNA molecules (except for data set (1)), the atomic charges and hydrogens were assigned using the PDB2PQR utility.113,114 The ff99bsc0 parameters and the GAFF force field, both part of AMBER,115 were used for preparing the topology and coordinate files, which includes partial charges. The atomic charges for the drugs in data set (4) were assigned with the methods outlined elsewhere.63 For the sake of consistency, we used the Bondi radii set for all of the implicit solvent models studied here, as these radii are recommended for specific AMBER GB models.94 However, to test the sensitivities of the calculated and to the choice of atomic radii, some of the calculations were repeated with the atomic radii other than Bondi. The PQR files as well as the corresponding values for the test sets are available in the Supporting Information.
The Thermodynamic Cycle for Electrostatic Binding Free Energies
The thermodynamic cycle shown in Figure 1 is used to calculate : first, the receptor and the ligand are transferred from solvent into vacuum, second, the complex is formed in vacuum, and then the complex is transferred from vacuum into the solvent. The of the receptor, the ligand and the complex are designated as , , and , respectively. The electrostatic component of the binding free energy in vacuum is , which is calculated similarly for all models. Using this cycle,
| (1) |
Figure 1:

The thermodynamic cycle used to compute the electrostatic component of the binding free energy. The surrounding dielectric medium is shaded for water and is white for vaccum.
In all the calculations presented here we assume no conformational changes to the structures and do not consider the nonpolar component of the solvation free energy or the entropic contributions to binding. While estimating these quantities would be necessary for a direct comparison with experiment, the associated uncertainties in their theoretical estimates can be large,9,14,15,104,116-119 and may obfuscate the errors in , which is the only quantity that the GB model predicts.
Reference Poisson-Boltzmann Calculations
The finite-difference solution of the 3D linear PB equation was solved using the Adaptive Poisson-Boltzmann Solver (APBS).120 To obtain the of a molecule the linear PB equation was solved twice, first in salt solution and then in vacuum, then the difference between the two electrostatic free energies was taken. Each molecule was treated as a low dielectric region with a dielectric constant () of 1. The solvent (here a salt solution) region was treated as a high dielectric region with a dielectric constant () of 80 and NaCl concentration of 0.145 M, with the latter representing a physiological environment. The solvent temperature was set to 298.15 K. The dielectric boundary that separates the low dielectric molecule from the solvent region was defined using the solvent excluded molecular surface obtained with a 1.4 Å spherical water probe radius. No ion exclusion region was considered for consistency with the GB models here examined. We used an autofocusing protocol with a fine grid 20 Å larger than the molecule in each dimension, and a coarse grid 1.7 times the size of the molecule in each dimension, and a fine grid spacing of 0.3 or 0.5 Å (to check for convergence with respect to grid spacing105,107). The default values of all other APBS parameters were used.
The Generalized Born Models
The “canonical” generalized Born model approximates the Green function of the Poisson equation via a simple closed form expression, first introduced by Still et al.,60 and later augmented to account for the presence of monovalent salt:121
| (2) |
where is the distance between atoms and , and are the respective (partial) charges, the and are the so-called effective Born radii, and is the Debye-Hückel screening parameter used to incorporate121 the electrostatic screening effects of (monovalent) salt. While alternatives have been proposed,71,89,122,123 the most widely used functional form of is
| (3) |
Apart from a “dielectric” correction to Eq. (2) used by one of the models (see below), the GB Green function Eq. (2) is the same for all of the GB models considered in this work. That Green function can be rigorously derived from the Poisson equation under a certain set of assumptions,36 which further supports124 the use of the numerical PB for testing of the GB accuracy, at least in the context of the GB flavors tested here. The latter were developed primarily for modeling of biological macromolecules. At the same time, we acknowledge the existence of a view108 that the GB approximation in general is not a direct approximation to the PB, see e.g. Ref.124 for a discussion.
The way the effective Born radii () are calculated differ significantly between implementations of the GB models; in that aspect, the GB models (flavors) that are investigated here represent a variety of classes. Full details of these GB models are provided in the original papers; here, we briefly review the main differences between them.
While the GB does not employ the dielectric boundary in exactly the same manner as the PB model does, the solute/solvent boundary plays the same conceptual role in the GB; technically, the dielectric boundary enters125 the GB model through the effective Born radii. In all of the models tested here the effective Born radii are computed as approximate or exact integrals over the suitably chosen solute volume or surface. The two-dielectric model considered here is defined by specifying the values of the internal (solute) and external (solvent) dielectrics, and the postion of the sharp boundary between them. Thus, within this model it is natural to assume the same dielectric constants for the GB when testing it directly against the PB. Within this limited scope testing, the precise value of the dielectric constant used to describe the low dielectric interior of the macromolecule is not very relevant. However, we argue that using the lowest physically meaningful value of is best if the goal is to highlight the GB versus PB error – the error is maximized when the solute and solvent dielectrics differ the most, and disappears when the two are equal.126
For all the GB models, we used interior and exterior dielectric constants of () and () respectively, a NaCl concentration of 0.145 M, the temperature was set to 298.15 K and a solvent probe radius of 1.4 Å where appropriate. To make correspondence to the reference PB calculations, we used the same intrinsic atomic radii for the GB and PB calculations, see below. Most of the reported results are based on Bondi radii, but we have explored the sensitivity of the conclusions to several other radii sets as described below. Appropriate radii were incorporated into the prmtop files using a variation of the AMBER2CHARMM tool.
AMBER GB models
Four variants of the GB model were used: GB-HCT,127-129 GB-OBC,130,131 GB-neck288,132 and GBNSR6.109,133 Except for GB-NSR6, all of these GB models are based on the Coulomb field approximation and differ mainly in how the effective Born radii are computed. The analytical linearized PB (ALPB) correction134 was used with the GBNSR6 model model only.
GB-HCT uses the pairwise descreening approach of Hawkins, Cramer and Truhlar,127,128 with parameters described by Tsui and Case,129 which leads to a GB model implemented in Amber with igb=1. The 3D integral used in the estimation of the effective radii is performed over the van der Waals spheres of solute atoms, which implies a definition of the solute volume in terms of a set of spheres, rather than the more complex molecular surface.
In GB-OBC,131 the effective Born radii are re-scaled to account for the interstitial spaces between atom spheres missed by the GB-HCT approximation. In that sense, GB-OBC is intended to be a closer approximation to the true molecular volume, albeit in an average sense. GB-OBC can be used in AMBER by setting igb=5 or 2, here we used the igb=5 version.
GB-neck2 is the latest, most successful version of an earlier GBn model,135 with different additional parameters.88,132 GB-neck2 uses a pairwise correction term to GB-OBC to approximate135 a molecular surface dielectric boundary; that is to eliminate interstitial regions of high dielectric smaller than a solvent molecule. This correction affects all atoms and is geometry-specific, going beyond the geometry-free, “average” re-scaling approach of GB-OBC, which mostly affects buried atoms. GB-neck2 can be used in AMBER by setting igb=8.
GBNSR6109,133,136 (“Numerical Surface R6 GB") is one of the latest additions to the AMBER family of GB models, now also implemented in MM_PBSA137,138 module and Free Energy Workflow (FEW)139 tool of AmberTools17136 In GBNSR6 model, the effective Born radii are computed via the “R6” numerical integration over the molecular surface of the solute, defined by the solute intrinsic radii and the solvent probe. GBNSR6 includes a single additional (to the PB) parameter: a constant offset to the inverse effective Born radii. The value of the offset is fixed for a given probe radius; here we used Å−1 as proposed by Mongan et al135 for the probe radius of 1.4 Å. That is, as long as the solvent probe radius is set to the default 1.4 Å, and agreement with the PB is the goal, GBNSR6 model becomes parameter-free in the same sense as the PB. Note that in addition to input parameters that are generally used by the PB, such as the intrinsic atomic radii, other GB models rely on multiple additional parameters to approximate the PB energies, the optimum values of these parameters generally depend on the atomic radii.
By default, in GBNSR6 is calculated by the ALPB model,134 which introduces a physically correct dependence on the dielectric constants into the original GB model (Eq. (2)), while maintaining the efficiency of the original:
| (4) |
where , , and is the electrostatic size of the molecule, which is essentially the overall size of the structure; is estimated analytically.134
CHARMM GB models
Four GB models available in the CHARMM software package94 were evaluated in this study: GBMV1 and GBMV2 are analytical methods in which the molecular volume is constructed from a superposition of atomic functions.73,140 GBMV1 is the oldest and least accurate of these two analytical GB models. GBMV373,100 is a grid-based method that uses a Cartesian grid representation of the molecular volume similar to that used in traditional Poisson-Boltzmann calculations, and GBSW87,141 uses a van der Waals-based surface with a simple polynomial switching function to smooth the dielectric boundary. A more detailed description of these models is available in a recent review.100 To compute these energies, CHARMM topology and coordinate files were created with a script. The notation used here for naming the different CHARMM GB parameters follows the CHARMM manual (see the documentation for the gbmv and gbsw modules). For GBSW, the molsurf option was used to mimic the molecular surface, conc=0.145 M, sw=0.2, and the parameters aa0 and aa1 were set to 1.2045 and 0.1866, respectively. For GBMV1 beta was set to −100, the shift parameter was set to −0.5, lambda1 was set to 0.1, p1 to 0.44, tt to 2.92, corr to 0, bufr to 0.5, mem to 20, cuta to 20, p3 to 0, wtyp to 0, and eshift to 0. For GBMV2, tol was set to 10−10, mem to 20, cuta to 20, dn to 1, bufr to 0.2, beta to −12, shift to −0.1, slope to 0.9, lambda1 to 0.5, p1 to 0.45, p2 to 1.25, p3 to 0.65, p6 to 8, onx to 1.9, offx to 2.1, corr to 1, son to 1.2, soff to 1.5, nphi to 5, sa to 0, sb to 0, and kappa to 0.1239950497. For GBMV3, dn was set to 0.2, p6 to 4.0, nphi to 38, shift to −0.007998, slope to 0.9026, wtyp to 2 and kappa to 0.1239950497. All other parameters used their default values.
Choice of the Input Atomic Radii Set
Within the implicit solvation framework, the solvation energies are extremely sensitive to the intrinsic atomic radii values,107,125,142 because these determine the position of the dielectric boundary between the low dielectric interior and the high dielectric solvent. Both the GB and the PB models take the intrinsic atomic radii as input – many different radii sets exist.143-147 Thus, one has to carefully consider the choice of the input radii set for any accuracy analysis of these models, including the one presented here. As we shall see below, the appropriate choice is not obvious, and depends on the kind of comparison one wants to make, and the specifics of the models used. In principle, one could consider using different radii sets for different models; such strategy would make sense if the accuracy metric were an experimental observable such as the binding free energy. In this case, one could choose a set of parameters for each model that gives best agreement with experiment, and then compare the outcomes to determine the most accurate model, GB or PB, with respect to the chosen metric. However, for reasons outlined above, our goal here is more limited and focused: to evalute only the electrostatic component of solvation energy relative to the PB reference. We argue that in this case using, for all of the GB models, one and the same radii set employed to obtain the PB reference energies makes sense on the grounds of physical consistency. Indeed, all of the GB models reduce to the Born formula for a single spherical ion, which also corresponds to the exact solution of the Poisson equation within the two-dielectric formalism shared by all of the models considered here. Thus, it makes sense that all the models, GB or PB, are required to give identical result for a single isolated spherical ion in water, which implies the same radii set. Here we propose to use the well established Bondi radii set: it is physically well-grounded and has only 5 main atom types, all of which bodes well for low probability of an unwanted bias towards a specific training set and type of reference used in radii set optimizations. Note that more complex radii sets optimized against hydration free energies of small molecules or amino-acids are more likely to be poorly transferable to computation of protein-ligand binding energies.148 Unfortunately, even the seemingly straightforward choice of Bondi radii needs to be carefully justfied for the diverse set of practical GB models tested here. This is because most of these GB models contain multiple fitting parameters used to optimize the agreement with the respective accuracy reference, which most often includes the numerical PB solvation energies. These GB models are intended to be used with specific atomic radii set employed during the parameter optimization, e.g. modifications of Bondi set mbondi, mbondi2 and mbondi3 for GB-HCT, GB-OBC and GB-neck2, respectively (AMBER), and CHARMM radii for many CHARMM GB models. Among all of the GB models tested here, GBNSR6 is the only model with no recommended radii set, because the model is essentially agnostic to the input intrinsic radii in the same sense as the PB reference is. For all the other GB models, changing the input atomic radii away from the recommended set without an extensive re-parametrization effort – which is out of scope here – may result in deterioration of the accuracy of the model relative to the PB reference values based on the same radii set; a possible appearance of accuracy improvement for some structural classes would likely be due to a fortuitous cancellation of error for this class of structures only. To address this subtle issue, we have confirmed that the computed values are not sensitive to the use of Bondi set instead of the recommended set for the GB models where Bondi set was not listed as one of the recommended options, see "Results" section for details.
Perfect effective Born radii
To compute the perfect122 effective Born radii , a Poisson problem is set up and solved separately for each atom using the numerical PB solver PEP,30 which is especially well suited for the task. In the process, the dielectric boundary of the full molecule is present, but only the charge of that particular atom is non-zero (all other charges are set to zero). The process yields the self-energy for each atom , from which the perfect effective Born radius is obtained by inverting the Born equation . We set solute dielectric , and use for the solvent to mimic the conductor limit as discussed previously.126 Here, the perfect radii are used only to further explore the accuracy of GBNSR6; for consistency with the GBNSR6 results without the perfect radii, the corresponding (perfect radii) values in Table 6 are computed using (Eq. (4)). The corresponding PB reference values of reported in Table 6 are computed using APBS (rather than PEP) for consistency with the rest of the paper; we verified that these values change insignificantly if computed with PEP instead.
Table 6:
PB values (kcal/mol) compared to GB-NSR6 (Bondi radii) and the GB using perfect effective Born radii. When the latter are used in the GB formalism, the remaining error relative to the PB comes from the GB Green function itself.
| PDB ID | PB | GBNSR6(Bondi radii) | GB(Perfect radii) |
|---|---|---|---|
| 1BKF | −0.76 | −9.19 | −10.72 |
| 1FKL | 12.28 | 6.23 | 3.98 |
Results
Poisson-Boltzmann Convergence
To determine whether our PB results were converged with respect to grid spacing, we computed 20 estimates of each of , , and at grid spacings of 0.5 Å with the centers of the solution grid shifted by a random fraction of a grid spacing in each direction and averaged these results. We then compared these numbers to similar numbers obtained at a grid spacing of 0.3 Å. By averaging over these solution grids we attempted to eliminate the error due to grid placement. The root mean square deviation (RMSD) between the APBS results with grid spacings of 0.3 Å and those with grid spacings of 0.5 Å are 12.6, 7.9, 2.7 and 2.8 kcal/mol for the RNA-peptide, protein-protein, protein-drug and small complexes data sets, respectively. These RMSD’s between the APBS calculations with grid spacings of 0.3 Å and those with grid spacings of 0.5 Å may at first appear to be large compared to those between the GB models and the APBS calculations at 0.3 Å, but the energies with grid spacings of 0.3 Å are always systematically smaller than those with a grid spacing of 0.5 Å because of the bias present in such calculations.105,107 The correlations between the two sets of APBS calculations are therefore much higher than would be expected from the RMSD alone. The square () of the Pearson’s correlation coefficients between given by a grid spacing of 0.3 Å and those given by a grid spacing of 0.5 Å were greater than 0.999 for all four data sets. For was greater than 0.992 for all four data sets. For any of the GB models, weaker correlations can be attributed to errors in the GB model.
Electrostatic Solvation Free Energies ()
In this section, we briefly review the accuracies of the different GB models in predicting for the four sets of complexes (including the complex and receptor and ligand components). Table 3 and Figure 2(a) summarize the deviations of the given by the different GB models from those given by the PB equation. The root mean square deviation (RMSD), average signed error (< error >), square of the Pearson correlation coefficient (), and RMSD of the 20% of complexes with the worst agreement with the reference were calculated for each of the four data sets. Overall, the values from all the GB models highly correlate with the values from the PB equation. The lowest correlation is for GBMV1 for data set 3 (), but even this correlation is good.
Table 3:
Deviation of GB values (kcal/mol) from the PB reference for the four data sets. Error metrics calculated for each set of predictions include root mean square deviation (RMSD), average signed error (< error >), square of the Pearson correlation coefficient (), and RMSD of the 20% of complexes with the worst agreement with the reference.
| Data Set 1 (small complexes) | Data Set 2 (protein-drug) | |||||||
|---|---|---|---|---|---|---|---|---|
| RMSD | < error > | RMS of worst 20% | RMSD | < error > | RMS of worst 20% | |||
| GBNSR6 | 21.8 | 0.999987 | 18.6 | 31.0 | 21.4 | 0.999974 | 9.3 | 42.7 |
| GB-HCT | 33.6 | 0.999703 | −25.3 | 52.5 | 260.6 | 0.999938 | −211.1 | 358.3 |
| GB-OBC | 24.7 | 0.999904 | 20.8 | 36.8 | 172.8 | 0.999764 | 134.7 | 271.8 |
| GB-neck2 | 6.7 | 0.999919 | −0.2 | 12.5 | 36.1 | 0.999902 | 1.5 | 70.7 |
| GBSW | 11.5 | 0.999799 | 5.4 | 20.0 | 148.0 | 0.999937 | −119.1 | 216.8 |
| GBMV1 | 135.4 | 0.999537 | −109.2 | 201.2 | 895.7 | 0.999854 | −731.9 | 1164.2 |
| GBMV2 | 14.0 | 0.999964 | 12.0 | 23.3 | 19.9 | 0.999971 | 2.5 | 37.1 |
| GBMV3 | 25.0 | 0.999945 | 20.7 | 37.3 | 80.7 | 0.999954 | 62.6 | 131.0 |
| Data Set 3 (protein-protein) | Data Set 4 (RNA-peptide) | |||||||
| RMSD | < error > | RMS of worst 20% | RMSD | < error > | RMS of worst 20% | |||
| GBNSR6 | 39.5 | 0.999944 | 33.8 | 66.9 | 14.5 | 0.999997 | 11.1 | 25.3 |
| GB-HCT | 358.2 | 0.996962 | −275.7 | 674.3 | 141.7 | 0.999835 | 95.0 | 256.8 |
| GB-OBC | 93.0 | 0.999094 | −26.5 | 193.4 | 351.7 | 0.999343 | 276.1 | 578.0 |
| GB-neck2 | 101.4 | 0.999391 | −62.2 | 205.6 | 153.5 | 0.999701 | −111.06 | 292.8 |
| GBSW | 112.8 | 0.999709 | −89.8 | 211.8 | 67.5 | 0.999902 | −46.5 | 126.3 |
| GBMV1 | 701.0 | 0.995759 | −610.7 | 1167.1 | 176.6 | 0.999571 | −144.1 | 301.5 |
| GBMV2 | 39.0 | 0.999786 | 13.0 | 71.5 | 59.8 | 0.999947 | 34.0 | 111.4 |
| GBMV3 | 82.1 | 0.999759 | 64.3 | 148.5 | 44.2 | 0.99996 | −28.0 | 85.9 |
Figure 2:

a) The root-mean-square deviations (RMSD) between the electrostatic solvation free energies () predicted by the different generalized Born (GB) models and those predicted by the reference PB for different protein-ligand complex sets. b) The RMSD between the electrostatic binding free energies () predicted by the different GB models and those predicted by the reference PB for different complex sets.
Although the between given by the PB equation and that given by the different GB models was good for each GB model, the RMSD and average deviations are significant, and significantly different, between different GB models (Table 3 and Figure 2(a)). For instance, while the RMSD of GBNSR6 from the PB is smaller than 39.5 kcal/mol over all four sets, the RMSD of GBMV1 from the PB can be as large as 895.5 kcal/mol (data set 2). That GBMV1 is less accurate than GBMV2 agrees with similar GB calculations done elsewhere.73 The RMSD of GB-OBC varies significantly over the four sets, from 24.7 kcal/mol for data set 1 to 351.7 kcal/mol for data set 4. Among the AMBER GB models GBNSR6 has the smallest RMSD over-all (all four sets), and among the CHARMM GB models GBMV2 had the smallest over-all RMSD from the PB; the largest RMSD in for these models are 39.5 kcal/mol (data set 3) and 59.8 kcal/mol (data set 4), respectively. However, for one structure set, namely set 1 (small neutral complexes), it is GB-neck2 that shows the best agreement with the PB.
The non-zero average deviations (Table 3) point to the existence of noticeable systematic shifts between the estimates of obtained with the PB equation and those obtained with the GB models, which can be due to a number of factors, including the following: (i) Differences in the dielectric boundary definitions used by each model. The PB model typically employs a grid-resolution dependent numerical approximation to the molecular surface, while the analytical GB models mimic this surface to varying degrees of accuracy. The exception is GBNSR6 – the version used here employs essentially the same molecular surface, but still subtle details, including surface discretization, differ between it and what is used by the PB solver used here. (ii) The GB Green function is only an approximation to the correct Green function of the Poisson problem for a given dielectric boundary. The errors introduced by the approximate Green function can be tested via perfect (PB-based) effective Born radii, as discussed below. One should notice that since binding free energies depend on the “difference” in values of the receptor and ligand components, large systematic shifts in can cancel out in the binding free energy calculations. In general, we find that GB models that show lower RMSD with respect to the PB values are also in better agreement with the PB values, as discussed below.
Electrostatic Binding Free Energies ()
Figure 2(b), and Table 5 (bottom) summarize the deviation of different GB models from the PB reference in estimating , while the breakdown by individual structure test sets is given in Figure 3 and Table 4. We calculate RMSD, average signed error < error >, , and the RMSD of the 20% of complexes with the largest deviations between given by the PB equation and those given by the GB model.
Table 5:
Overall deviation of the GB and values (kcal/mol) from those computed with the numerical PB equation for 60 complexes. The error metrics are described in Table 3.
| RMSD | < error > | RMS of worst 20% | ||
|---|---|---|---|---|
| GBNSR6 | 27.2 | 0.9999 | 19.5 | 50.2 |
| GB-HCT | 240.5 | 0.9945 | −114.6 | 471.8 |
| GB-OBC | 189.8 | 0.9974 | 86.6 | 395.9 |
| GB-neck2 | 92.1 | 0.9994 | −42.4 | 198.1 |
| GBSW | 97.9 | 0.9995 | −61.4 | 194.2 |
| GBMV1 | 577.0 | 0.9862 | −402.1 | 1123.3 |
| GBMV2 | 37.0 | 0.9999 | 15.0 | 75.2 |
| GBMV3 | 63.4 | 0.9996 | 32.3 | 123.4 |
| RMSD | < error > | RMS of worst 20% | ||
| GBNSR6 | 8.7 | 0.994889 | −2.4 | 17.3 |
| GB-HCT | 56.2 | 0.89914 | −40.2 | 113.2 |
| GB-OBC | 32.3 | 0.955511 | −19.4 | 66.0 |
| GB-neck2 | 34.0 | 0.949725 | −19.4 | 70.8 |
| GBSW | 34.6 | 0.91599 | −9.4 | 73.6 |
| GBMV1 | 61.0 | 0.804674 | −35.9 | 191.2 |
| GBMV2 | 40.4 | 0.913058 | −22.1 | 69.7 |
| GBMV3 | 26.8 | 0.953638 | −2.8 | 54.4 |
Figure 3:


Correlation between electrostatic binding free energies () computed by different GB models relative to the PB. The diagonal lines are .
Table 4:
Deviation of GB values (kcal/mol) from the PB reference for the four data sets. The error metrics are described in Table 3.
| Data Set 1 (small complexes) | Data Set 2 (protein-drug) | |||||||
|---|---|---|---|---|---|---|---|---|
| RMSD | < error > | RMS of worst 20% | RMSD | < error > | RMS of worst 20% | |||
| GBNSR6 | 3.9 | 0.979509 | −3.6 | 6.71 | 4.2 | 0.949986 | 1.2 | 5.9 |
| GB-HCT | 10.7 | 0.876606 | −10.2 | 15.4 | 27.7 | 0.764202 | −26.0 | 40.3 |
| GB-OBC | 7.3 | 0.946172 | −6.7 | 10.6 | 23.0 | 0.377215 | −19.6 | 37.0 |
| GB-neck2 | 6.2 | 0.910121 | −5.5 | 10.6 | 13.5 | 0.776368 | −10.8 | 24.3 |
| GBSW | 4.0 | 0.924397 | 3.0 | 6.61 | 12.8 | 0.774716 | −9.53 | 22.2 |
| GBMV1 | 9.6 | 0.463363 | 6.9 | 15.6 | 9.5 | 0.822145 | −5.81 | 16.5 |
| GBMV2 | 5.0 | 0.935667 | −4.2 | 7.7 | 7.0 | 0.88332 | 1.8 | 11.4 |
| GBMV3 | 3.0 | 0.913558 | 0.2 | 6.0 | 24.4 | 0.638866 | −20.0 | 39.2 |
| Data Set 3 (protein-protein) | Data Set 4 (RNA-peptide) | |||||||
| RMSD | < error > | RMS of worst 20% | RMSD | < error > | RMS of worst 20% | |||
| GBNSR6 | 13.6 | 0.985424 | −9.2 | 24.7 | 7.7 | 0.998602 | 4.7 | 12.4 |
| GB-HCT | 50.8 | 0.921176 | −41.8 | 87.1 | 100.5 | 0.91455 | −89.2 | 148.1 |
| GB-OBC | 26.2 | 0.927651 | −11.9 | 46.8 | 57.3 | 0.950862 | −45.0 | 87.8 |
| GB-neck2 | 24.0 | 0.952807 | −11.1 | 43.9 | 65.7 | 0.95664 | −56.5 | 99.1 |
| GBSW | 19.0 | 0.948775 | 4.3 | 32.2 | 69.6 | 0.892278 | −43.8 | 95.4 |
| GBMV1 | 49.1 | 0.94559 | −44.2 | 71.4 | 116.9 | 0.920653 | −107.0 | 152.6 |
| GBMV2 | 25.9 | 0.99063 | −24.6 | 36.4 | 80.9 | 0.921114 | −64.5 | 123.3 |
| GBMV3 | 16.8 | 0.981712 | 12.4 | 30.0 | 48.2 | 0.924991 | −10.5 | 86.3 |
Over-all, a wide range of accuracies is seen across the GB models tested, as judged by the different accuracy metrics. Some models (GBNSR6 and GBMV3) have very little over-all systematic deviation from the PB, Table 5 (bottom), while others deviate substantially, e.g. the older GB-HCT and GBMV1 models. By far the lowest RMS error and the highest correlation are seen for GBNSR6 model.
With regards to specific structures, the GB models also demonstrate a wide range of accuracies. For data set 1 (small, net neutral complexes), all of the GB models show good correlation with the reference PB values, except for GBMV1 for which is 0.46. The low correlation of GBMV1 with the PB values in set 1 is consistent with the large RMSD and average error of the values (RMSD=135.35 kcal/mol, Table 3). The average error in for all of AMBER GB models (GBNSR6, GB-HCT, GB-OBC and GB-neck2) is negative (also see Figure 3), suggesting that the deviation of AMBER GB models from the used PB reference contains a systematic component for this test set. For data set 1, the best correlations with the PB reference is seen for the GBNSR6 () and GBMV3 () predictions.
Overall, the lowest correlation with the PB across different GB models is seen for data set 2 (protein-drug). Perhaps not surprisingly, GB-OBC, which is among the oldest commonly used generalpurpose GB models, yields a very low correlation with the PB (). The correlation () is only 0.63 for GBMV3, which is surprising for a model shown to be superior to other CHARMM GB models.73,140 Only GBNSR6 maintains a high correlation with the PB for this data set, () (Table 4 and Figure 3), suggesting that challenging structures are less forgiving to the approximations made by multi-parameter GB models; among the tested models, GBNSR6 makes the fewest approximations relative to the PB.
On average, the best agreement with the PB across all of the GB models is achieved for data set 3 (protein-protein). The lowest correlation with the PB is again for the oldest models, GB-HCT and GB-OBC, with (), while is as good as ≈0.99 for GBMV2 and GBNSR6. GBNSR6 also yields the smallest RMSD (13.62 kcal/mol). The good agreement with the PB for this set is not unexpected given that for most of the GB models protein structures were always a large part of the training set against which parameters of the models were fit.
Data set 4 (RNA-peptide) appears to be the most challenging for all the GB models, with the exception of GBNSR6, with respect to RMSD from the PB (Table 4 and Figure 2). The average is the second worst among all of the data sets, after data set 2. Again, GB-NSR6 is an exception for this set: it shows a relatively small deviation from the PB (RMSD=7.64 kcal/mol, ).
Overall, the large average deviation (and RMSD) from the PB reference in set 4 and the lowest deviation for set 1 correlate well with the average net charge of these two data sets, which carry the largest and smallest average net charges, respectively (Table 2). Our explanation for the correlation is as follows: errors in the GB Green function are scaled by the partial atomic charges, Eq. (2). Note that all structures in set 4 contain fragments of nucleic acids (RNA), which carry high partial charges on the phosphate groups, absent from all other sets.
The correlation with the PB is worst for set 2 (not set 4, with the highest average charge), which suggests a more nuanced story. As discussed above, we conjecture that the GB Green function performs the worst for set 2, leading to the poorest correlation with the PB, but the resulting error values (RMSD and average) are not as high because the structures are not as highly charged (in the limit of zero partial charges, even very low correlation with the PB can still result in small average and RMS errors).
Correlation between accuracy in and calculations
In general, we find a moderate correlation between the accuracies of and those of , Figure 4. Namely, when comparing a group of GB models that agree most with the PB to the group that agrees the least, there is a clear correlation between the accuracy of and between these broad groups (but not within each group), whether the accuracy is measured by RMSD or (Figure 4 (a) and (b)). Thus, a very poor agreement of with the PB suggests that a good agreement of is unlikely. Even though the range of in GB is very narrow (all values are fairly close to 1 for all the GB models), Figure 4 (b) , there is still a correlation between of and of . This correlation suggests that, when interpreting for , decimal numbers up to the fourth or even fifth place can be important for distinguishing between the accuracies of different GB models in terms of their ability to predict . In addition, Figure 4 (c) shows that the accuracy of and measured by RMSD and , respectively, also correlate to some extent (Figure 4 (c)). Given that RMSD in from the PB can significantly differ across GB models (Table 3), the data shown in Figure 4 (c) suggests that, for GB models on the “opposite ends" of the accuracy spectrum, RMSD can be used instead of to distinguish between the models.
Figure 4:

Correlation between the accuracy of the GB models in predicting electrostatic binding affinities () and their accuracy in predicting electrostatic solvation free energy components () over all of the four complex sets. a) RMSD in versus RMSD in b) correlations in versus correlations in c) correlations in versus RMSD in .
One can not reliably predict the accuracy in based on that of for models where the difference is not very large. For example, while the RMSD in for GBMV2 (37 kcal/mol) is smaller than that of GBMV3 (63.4 kcal/mol); the RMSD in for GBMV2 is larger (40.4 kcal/mol) than that of GBMV3 (26.8 kcal/mol) (Table 5). Another example is the GB-OBC model which is the third least accurate model in values (RMSD=189.8 kcal/mol), yet it is the third most accurate model in values (RMSD=32.3 kcal/mol, ) (Table 5).
Source of the remaining error in GBNSR6
The error of the GB relative to the exact solution of the corresponding Poisson problem can be decomposed into two independent components: the error in approximating the effect of the same (in our case, molecular surface) dielectric boundary, and the error due to the approximate GB Green function.125 The first error can be eliminated by the use of the perfect, that is PB-based effective Born radii – while computationally ineffective, these effective radii ensure that the self terms in Eq. (2) and the corresponding PB values are identical, so that the remaining error is only due to the cross terms, i.e. the GB Green function itself. As the errors of GBNSR6 on our test sets are the lowest, we chose this model to explore the origin of the remaining error. Due to computational constraints, we focused on set 1 – the set of smallest structures among the four sets used here. We calculated perfect effective Born radii63,122 for the two complexes from set 1 that showed the largest deviation of GBNSR6 values from the PB reference. The results, Figure 5, show that, for the two complexes tested here, most of the effective Born radii from GBNSR6 correlate well with the perfect (PB) radii, with the discrepancies seen mostly for “buried” atoms. We attribute most of the discrepancy to subtle differences between the dielectric boundary employed by this version of GB-NSR6 (the dielectric boundary is the molecular surface calculated by MSMS149) and the approximation of the boundary employed by typical PB solvers, see Ref.136 for further details. One may excpect that reducing/eliminating the difference between the dielectric boundaries used by the GB and PB should further reduce the discrepancy between the respective values. This appears to be the case: an algorithm that produces a molecular surface very similar to that used by the PB solvers was recently implemented136 for GBNSR6 in AMBER, which reduced the RMSD error with respect to PB by more than a factor of two, from 3.9 to 1.43 kcal/mol for data set 1 for which the evaluation was performed.136
Figure 5:

Comparing the inverse effective Born radii calculated using GBNSR6 and PB (perfect radii) for two complexes from data set 1: a) 1bkf , b) 1fkl. The diagonal lines are .
One might conjecture that direct use of the perfect effective Born in the GB equation would bring the GB accuracy into an even closer accord with the PB, but that may not be the case, at least based on the two structures we tested here. Namely, when the perfect radii were used instead of the R6 radii in the GB Eq. (2), we found no reduction of the GB. vs. PB error in , Table 6. The fact that for GBNSR6 the agreement with the PB does not necessarily improve with the use of the perfect Born radii, suggests that for this GB model a significant component of the remaining error in likely comes from the approximate GB Green function itself, and not from the way the effective Born radii are approximated109,133 within this model. It remains to be seen whether more sophisticated GB Green functions123 can further reduce the GB vs. PB gap in ligand binding calculations.
Insensitivity to the input radii set
We have explicitly confirmed that our key conclusions are not affected by the use of Bondi intrinsic atomic radii instead of recommended defaults for the models where this choice may matter. Here we tested CHARMM GB models (i. e. GBSW, GBMV1, GBMV2, GBMV3) on our data set 2, using CHARMM atomic radii instead of Bondi radii set. We chose this set of structures for the sensitivity test because this set shows the poorest correlation with the reference, so that if a “native” radii set were to lead to an improvement, we would most likely see it on this set. We used the same logic to also evaluate the sensitivity of GB-neck2 to input radii. Here we used the other “worst performing” data set – set 4. We chose to test GB-neck2 on set 4 because GB-neck2 uses a different set of parameters for nucleic-acids,150 and our data set 4 is the only one that contains nucleic acid fragments. For the test we employed mbondi3 radii (the suggested default for GB-neck2) instead of Bondi radii. As before, we used the matching set of atomic radii for the reference PB calculations. Our results show that the RMS deviations and correlations change only slightly when using these different atomic radii other than Bondi (Table 7). Thus, all of our conclusions based on the Bondi radii set remain valid.
Table 7:
Sensitivity of some GB models to the choice of atomic radii set. Deviations of values (kcal/mol) relative to the PB values using the same set of radii for both GB and PB. Included here are root mean square deviation (RMSD) and square of the Pearson correlation coefficient ().
| Data Set 3 (protein-protein) | ||||
|---|---|---|---|---|
| (RMSD) | () | |||
| Bondi radii | CHARMM radii | Bondi radii | CHARMM radii | |
| GBSW | 19.0 | 29.6 | 0.949 | 0.972 |
| GBMV1 | 49.1 | 43.7 | 0.946 | 0.937 |
| GBMV2 | 25.9 | 18.4 | 0.991 | 0.971 |
| GBMV3 | 16.8 | 20.5 | 0.982 | 0.960 |
| Data Set 4 (RNA-peptide) | ||||
| (RMSD) | () | |||
| Bondi radii | mbondi3 radii | Bondi radii | mbondi3 radii | |
| GB-neck2 | 65.7 | 64.7 | 0.957 | 0.953 |
Conclusions
High computational efficiency of the methods used in calculation of binding free energies is important, since these calculations involve large structural ensembles. The GB models offer a good compromise: they offer efficient methods to account for solvent without explicitly modeling individual water molecules. Of course, the accuracy of the models are absolutely critical. This study is an attempt to systematically investigate the accuracy of several commonly used GB models implemented in major molecular simulation packages in estimating the electrostatic component of the binding free energy, . The accuracy was evaluated in terms of agreement with the numerical PB reference across a diverse set of 60 biomolecular complexes.
Our findings demonstrate that different GB models lead to very different binding affinity predictions, with correlations varying largely among different complex sets, as well as among different GB models. An assessment of the literature reveals a similar behavior of correlations for other biomolecular complexes with varying energy range and spacing (e.g.,151-155). Future studies comparing the GB-predicted solvation free energies directly to experiment will need to evaluate to what extend performance of the promising GB models holds in realistic problems where binding free energies differ by only a few kcal/mol. Among all different complex sets tested in this study, the best agreement with the PB was achieved for the protein-protein set, most likely because protein structures have been conventionally used as a large part of the training sets used in the parametrization of most GB models. The correlation with PB is also good for the small complexes, which are relatively small and not highly charged. However, the RNA-peptide and protein-drug complex sets appeared to be most challenging. The RNA-peptide complexes are highly charged, which can directly scale up the errors in GB Green function. The force field parameters for “non-standard” drug components seem to be least compatible with the GB parameters, leading to a relatively low correlation with PB for the protein-drug test.
In general, we found that newer GB models are in better agreement with PB. Among the CHARMM GB models and considering all biomolecular complexes together, GBMV3 shows the highest correlation with the PB results for , whereas GBMV2 is the most accurate for . Moreover, GBMV1, which is an older analytical model, is less accurate than the other CHARMM GB models when all biomolecular complexes are combined into one data set. Among the GB models available in AMBER, GB-neck2, which has the largest number of fitting parameters, shows a good performance (second to GBNSR6) cross different test cases.
In contrast to the other GB models tested here, GB-NSR6 uses only a single additional (to the PB) fitting parameter, yet it shows the best agreement with reference PB among all of the models tested. The closer agreement of GBNSR6 with PB consistently holds over all the four complex sets. Moreover, the accuracy of GBNSR6 estimates does not change significantly if the perfect (PB-based) Born radii are used instead of the R6 radii calculated within GBNSR6. Therefore, we conjecture that the majority of the remaining error in GBNSR6 estimates of are due to the approximate GB Green function itself, rather than the way the effective Born radii is calculated. Future work will assess if this superior accuracy of the GBNSR6 model in Amber translates into better binding affinity ranking when performing MM(GB)SA studies of biomolecular systems for which solid experimental data is available.
We have also explored how the accuracy of the GB models in predicting correlates with their ability to predict the binding affinity . To summarize our finding with respect to the PB accuracy metric: (1) GB models that tend to be considerably more accurate in electrostatic solvation free energies generally perform better in binding affinity calculations. (2) The GB models that lead to poor correlations with PB in electrostatic solvation free energies are least accurate in terms of electrostatic binding free energies. (3) Nevertheless, the same accuracy rank order of GB models in the electrostatic solvation free energies does not necessarily hold in the corresponding electrostatic binding affinities predictions;56,63 especially when the differences in accuracies are relatively small.
We stress that a close agreement of a GB model with the numerical PB, while being an important metric of the GB accuracy,124 does not, by itself, guarantee an equally good performance of the model compared to experiment. There are several reasons for the lack of a simple direct correlation here. First, the PB itself is a serious approximation to reality – a mean field, linear response treatment of atomistic electrostatic interactions in the presence of discrete water molecules and mobile ions replaced by a structureless continuum within the PB. Equally importantly, the PB accounts for the electrostatics only, while the total binding free energy includes other important contributions, most notably non-polar and entropic, the latter being particularly difficult to estimate computationally – these contributions have not been considered here. At the same time, a significant disagreement of a GB model with the PB on a certain class of structures is a serious indication of potential problems in practical calculations.
Indeed, many publications based on MM(GB)SA and MM(PB)SA have shown a large variability in their predictive power when compared against experimental binding data.156-160 Based on the very different accuracies of the GB models tested here and elsewhere (e.g.,151,161-163), we think this variability may stem largely from the very different electrostatic binding free energies predicted by the different GB models. The results presented here can therefore be used as a guide for the MM(GB)SA/MM(PB)SA community in deciding which GB models should be avoided, and which are more likely than others to lead to more accurate results. The much improved level of accuracy in more recent GB methods provides an encouraging basis for more reliable GB simulations and opens a whole new range of possibilities for reaching larger system sizes in ligand binding calculations. Future studies will need to evaluate the most promising GB models against appropriate experimental data. In this sense, “electrostatic" mutants in which a charged amino acid is mutated into a neutral one may be particularly useful, as the electrostatic effects should dominate the corresponding free energy changes to the binding free energy. Finally, the detailed description of parameters of many practical GB flavors provided here may serve as a useful reference guide for those wishing to utilize these models in ligand binding computations.
Supplementary Material
Acknowledgement
A.V.O. thanks the NIH (GM076121) for partial support. All authors thank Michael Feig for many helpful discussions of the GBMV family of models and the associated best practices; Carlos Simmerling for suggestions regarding best radii set for GB-neck2; Wonpil Im for a comment on GBSW model. M.O.F thanks Dr. Ilisa Lee for doing a literature search on MMPB(GB)SA studies and Dr. Michael E. Zawrotny for maintaining and helping with computer resources at the Institute of Molecular Biophysics in Florida State University.
Footnotes
Supporting Information Available: A README file, one folder for each of the four data sets and *.dat files describing the names of the files in the folder are provided. The Amber topology (*.top) and coordinate (*.crd) are included in each of the four folders. The *.pqr files are also included in each of the four folders. This material is available free of charge via the Internet at http://pubs.acs.org/.
References
- (1).Chodera JD; Mobley DL; Shirts MR; Dixon RW; Branson K; Pande VS Curr. Opin. Struct. Biol 2011, 21, 150–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Wang L; Wu Y; Deng Y; Kim B; Pierce L; Krilov G; Lupyan D; Robinson S; Dahlgren MK; Greenwood J; Romero DL; Masse C; Knight JL; Steinbrecher T; Beuming T et al. J. Am. Chem. Soc 2015, 137, 2695–2703. [DOI] [PubMed] [Google Scholar]
- (3).Jorgensen WL Science 2004, 303, 1813–1818. [DOI] [PubMed] [Google Scholar]
- (4).Mobley DL; Dill KA Structure 2009, 17, 489–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Shirts MR; Mobley DL; Brown SP In Structure Based Drug Design, 1st ed.; Merz KM, Ringe D, Reynolds CH, Eds.; Lecture Notes in Computer Science; Cambridge University Press: Cambridge, New York USA, 2010; pp 61–85. [Google Scholar]
- (6).Kastritis PL; Bonvin AMJJ J. R. Soc. Interface 2012, 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Lau AY; Roux BA Nat. Struct. Mol. Biol 2011, 18, 283–287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Lin Y-L; Meng Y; Jiang W; Roux B Proc. Natl. Acad. Sci. U. S.A 2013, 110, 1664–1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Gilson MK; Zhou H-X Annu. Rev. Biophys. Biomol. Struct 2007, 36, 21–42. [DOI] [PubMed] [Google Scholar]
- (10).Pollard TD Mol. Biol. Cell 2010, 21, 4061–4067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Kollman P. Chem. Rev 1993, 93, 2395–2417. [Google Scholar]
- (12).Beveridge DL; ; DiCapua FM Annu. Rev. Biophys. Biophys. Chem 1989, 18, 431–492. [DOI] [PubMed] [Google Scholar]
- (13).Åqvist J; Medina C; Samuelsson J-E Protein Eng. 1994, 7, 385–391. [DOI] [PubMed] [Google Scholar]
- (14).Srinivasan J; Cheatham TE III.; Cieplak P; Kollman PA; Case DA J. Am. Chem. Soc 1998, 120, 9401–9409. [Google Scholar]
- (15).Kollman PA; Massova I; Reyes C; Kuhn B; Huo S; Chong L; Lee M; Lee T; Duan Y; Wang W; Donini O; Cieplak P; Srinivasan J; Case DA; Cheatham TE III. Acc. Chem. Res 2000, 33, 889–897. [DOI] [PubMed] [Google Scholar]
- (16).Gohlke H; Kiel C; Case DA J. Mol. Biol 2003, 330, 891–913. [DOI] [PubMed] [Google Scholar]
- (17).Hu Y; Sherborne B; Lee T-S; Case DA; York DM; Guo Z J. Comput. Aided Mol. Des 2016, 30, 533–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Mobley DL; Dill KA; Chodera JD J. Phys. Chem. B 2008, 112, 938–946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Mobley DL; Bayly CI; Cooper MD; Shirts MR; Dill KA J. Chem. Theory Comput 2009, 5, 350–358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Shivakumar D; Deng Y; Roux B J. Chem. Theory Comput 2009, 5, 919–930. [DOI] [PubMed] [Google Scholar]
- (21).Jorgensen WL; Chandrasekhar J; Madura JD; Impey RW; Klein ML J. Chem. Phys 1983, 79, 926–935. [Google Scholar]
- (22).Horn HW; Swope WC; Pitera JW; Madura JD; Dick TJ; Hura GL; Head-Gordon T J. Chem. Phys 2004, 120, 9665–9678. [DOI] [PubMed] [Google Scholar]
- (23).Izadi S; Anandakrishnan R; Onufriev AV 2014, 5, 3863–3871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Söderhjelm P; Tribello GA; Parrinello M Proc. Natl. Acad. Sci. USA 2012, 109, 5170–5175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (25).Proctor EA; Yin S; Tropsha A; Dokholyan NV Biophys. J 2012, 102, 144 – 151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Gao K; Yin J; Henriksen NM; Fenley AT; Gilson MK J. Chem. Theory Comput 2015, 11, 4555–âĂŞ4564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (27).Izadi S; Aguilar B; Onufriev AV J. Chem. Theory Comput 2015, 11, 4450–4459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Cramer CJ; Truhlar DG Chem. Rev 1999, 99, 2161–2200. [DOI] [PubMed] [Google Scholar]
- (29).Honig B; Nicholls A Science 1995, 268, 1144–1149. [DOI] [PubMed] [Google Scholar]
- (30).Beroza P; Case DA Methods Enzymol. 1998, 295, 170–189. [DOI] [PubMed] [Google Scholar]
- (31).Madura JD; Davis ME; Gilson MK; Wade RC; Luty BA; McCammon JA Rev. Comp. Chem 1994, 5, 229–267. [Google Scholar]
- (32).Gilson MK Curr. Opin. Struct. Biol 1995, 5, 216–223. [DOI] [PubMed] [Google Scholar]
- (33).Scarsi M; Apostolakis J; Caflisch A J. Phys. Chem. A 1997, 101, 8098–8106. [Google Scholar]
- (34).Luo R; David L; Gilson MK J. Comput. Chem 2002, 23, 1244–1253. [DOI] [PubMed] [Google Scholar]
- (35).Simonson T. Rep. Prog. Phys 2003, 66, 737–787. [Google Scholar]
- (36).Onufriev A. In Modeling Solvent Environments, 1st ed.; Feig M, Ed.; Wiley: USA, 2010; pp 127–165. [Google Scholar]
- (37).Labute PJ Comput. Chem 2008, 29, 1693–1698. [DOI] [PubMed] [Google Scholar]
- (38).Nicholls A; Honig BJ Comput. Chem 1991, 12, 435–445. [Google Scholar]
- (39).Baker NA; Sept D; Joseph S; Holst MJ; McCammon JA Proc. Natl. Acad. Sci. U. S.A 2001, 98, 10037–10041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Bashford D; Karplus M Biochemistry 1990, 29, 10219–10225. [DOI] [PubMed] [Google Scholar]
- (41).Im W; Beglov D; Roux B Comput. Phys. Commun 1998, 111, 59–75. [Google Scholar]
- (42).Madura JD; Davist ME; Gilson MK; Wades RC; Luty BA; McCammon JA Reviews in Computational Chemistry; John Wiley and Sons, Inc., 2007; pp 229–267. [Google Scholar]
- (43).Altman MD; Bardhan JP; White JK; Tidor B J. Comput. Chem 2009, 30, 132–153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (44).Li B; Cheng X; Zhang Z SIAM J. Appl. Math 2011, 71, 2093–2111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Simonov NA; Mascagni M; Fenley MO J. Chem. Phys 2007, 127, 185105. [DOI] [PubMed] [Google Scholar]
- (46).Boschitsch AH; Fenley MO; Zhou H-X J. Phys. Chem. B 2002, 106, 2741–2754. [Google Scholar]
- (47).Boschitsch AH; Fenley MO J. Comput. Chem 2004, 25, 935–955. [DOI] [PubMed] [Google Scholar]
- (48).Mackoy T; Harris RC; Johnson J; Mascagni M; Fenley MO Commun. Comput. Phys 2013, 13, 195–206. [Google Scholar]
- (49).Boschitsch AH; Fenley MO In Computational Electrostatics for Biological Applications: Geometric and Numerical Approaches to the Description of Electrostatic Interaction Between Macromolecules; Rocchia W, Spagnuolo M, Eds.; Springer International Publishing: Switzerland, 2015; pp 73–110. [Google Scholar]
- (50).Fenley MO; Mascagni M; McClain J; Silalahi ARJ; Simonov NA J. Chem. Theory Comput 2009, 6, 300–314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (51).Baker NA Curr. Opin. Struct. Biol 2005, 15, 137–143. [DOI] [PubMed] [Google Scholar]
- (52).Boschitsch AH; Fenley MO J. Chem. Theory Comput 2011, 7, 1524–1540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (53).Chen D; Chen Z; Chen C; Geng W; Wei G-W J. Comput. Chem 2011, 32, 756–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (54).Wang J; Tan C; Chanco E; Luo R Physical chemistry chemical physics : PCCP 2010, 12, 1194–1202. [DOI] [PubMed] [Google Scholar]
- (55).Sarkar S; Witham S; Zhang J; Zhenirovskyy M; Rocchia W; Alexov E Communications in computational physics 2013, 13, 269–284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (56).Harris RC; Mackoy T; Fenley MO J. Chem. Theory Comput 2015, 11, 705–712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (57).Fenley MO; Harris RC; Mackoy T; Boschitsch AH J. Comput. Chem 2015, 36, 235–243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (58).Hoijtink G; De Boer E; Van der Meij P; Weijland W Recl. Trav. Chim. Pays-Bas 1956, 75, 487–503. [Google Scholar]
- (59).Tucker SC; Truhlar DG Chem. Phys. Lett 1989, 157, 164–170. [Google Scholar]
- (60).Still WC; Tempczyk A; Hawley RC; Hendrickson T J. Am. Chem. Soc 1990, 112, 6127–6129. [Google Scholar]
- (61).Bashford D; Case DA Annu. Rev. Phys. Chem 2000, 51, 129–152. [DOI] [PubMed] [Google Scholar]
- (62).Chen J; Brooks III CL; Khandogin J Curr. Opin. Struct. Biol 2008, 18, 140–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (63).Harris RC; Mackoy T; Fenley MO Mol. Based Math. Biol 2013, 1, 63–74. [Google Scholar]
- (64).Feig M; Brooks III CL Curr. Opin. Struct. Biol 2004, 14, 217–224. [DOI] [PubMed] [Google Scholar]
- (65).Hawkins GD; Cramer CJ; Truhlar DG Chem. Phys. Lett 1995, 246, 122–129. [Google Scholar]
- (66).Haberthür U; Caflisch A J. Comput. Chem 2007, 29, 701–715. [DOI] [PubMed] [Google Scholar]
- (67).Hawkins GD; Cramer CJ; Truhlar DG J. Phys. Chem 1996, 100, 19824–19839. [Google Scholar]
- (68).Schaefer M; Karplus M J. Phys. Chem 1996, 100, 1578–1599. [Google Scholar]
- (69).Qiu D; Shenkin PS; Hollinger FP; Still WC J. Phys. Chem. A 1997, 101, 3005–3014. [Google Scholar]
- (70).Edinger SR; Cortis C; Shenkin PS; Friesner RA J. Phys. Chem. B 1997, 101, 1190–1197. [Google Scholar]
- (71).Jayaram B; Liu Y; Beveridge DL J. Chem. Phys 1998, 109, 1465–1471. [Google Scholar]
- (72).Ghosh A; Rapp CS; Friesner RA J. Phys. Chem. B 1998, 102, 10983–10990. [Google Scholar]
- (73).Lee MS; Salsbury FR; Brooks III CL J. Chem. Phys 2002, 116, 10606–10614. [Google Scholar]
- (74).Felts AK; Harano Y; Gallicchio E; Levy RM Proteins: Struct., Funct., Bioinf 2004, 56, 310–321. [DOI] [PubMed] [Google Scholar]
- (75).Romanov AN; Jabin SN; Martynov YB; Sulimov AV; Grigoriev FV; Sulimov VB J. Phys. Chem. A 2004, 108, 9323–9327. [Google Scholar]
- (76).Dominy BN; Brooks CL III. J. Phys. Chem. B 1999, 103, 3765–3773. [Google Scholar]
- (77).David L; Luo R; Gilson MK J. Comput. Chem 2000, 21, 295–309. [DOI] [PubMed] [Google Scholar]
- (78).Tsui V; Case DA J. Am. Chem. Soc 2000, 122, 2489–2498. [Google Scholar]
- (79).Calimet N; Schaefer M; Simonson T Proteins: Struct., Funct., Bioinf 2001, 45, 144–158. [DOI] [PubMed] [Google Scholar]
- (80).Spassov VZ; Yan L; Szalma S J. Phys. Chem. B 2002, 106, 8726–8738. [Google Scholar]
- (81).Simmerling C; Strockbine B; Roitberg AE J. Am. Chem. Soc 2002, 124, 11258–11259. [DOI] [PubMed] [Google Scholar]
- (82).Wang T; Wade RC Proteins: Struct., Funct., Bioinf 2003, 50, 158–169. [DOI] [PubMed] [Google Scholar]
- (83).Nymeyer H; García AE Proc. Natl. Acad. Sci. U. S.A 2003, 100, 13934–13949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (84).Onufriev A; Bashford D; Case DA Proteins: Struct., Funct., Bioinf 2004, 55, 383–394. [DOI] [PubMed] [Google Scholar]
- (85).Gallicchio E; Levy RM J. Comput. Chem 2004, 25, 479–499. [DOI] [PubMed] [Google Scholar]
- (86).Lee MC; Duan Y Proteins: Struct., Funct., Bioinf 2004, 55, 620–634. [DOI] [PubMed] [Google Scholar]
- (87).Im W; Lee MS; Brooks CL III J. Comput. Chem 2003, 24, 1691–1702. [DOI] [PubMed] [Google Scholar]
- (88).Nguyen H; Perez A; Bermeo S; Simmerling C J. Chem. Theory Comput 2015, 11, 3714–3728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (89).Lange AW; Herbert JM J. Chem. Theory Comput 2012, 8, 1999–2011. [DOI] [PubMed] [Google Scholar]
- (90).Tjong H; Zhou H-X J. Phys. Chem. B 2007, 111, 3055–3061. [DOI] [PubMed] [Google Scholar]
- (91).Xu Z; Cheng X; Yang H J. Chem. Phys 2011, 134, 064107. [DOI] [PubMed] [Google Scholar]
- (92).Izadi S; Anandakrishnan R; Onufriev AV J. Chem. Theory Comput 2016, 12, 5946–5959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (93).Case DA; Cheatham III TE; Darden T; Gohlke H; Luo R; Merz KM Jr; Onufriev A; Simmerling C; Wang B; Woods RJ J. Comput. Chem 2005, 26, 1668–1688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (94).Brooks BR; Brooks III CL; MacKerell AD Jr.; Nilsson L; Petrella RJ; Roux B; Won Y; Archontis G; Bartels C; Boresch S; Caflisch A; Caves L; Cui Q; Dinner AR; Feig M et al. J. Comput. Chem 2009, 30, 1545–1614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (95).Pronk S; Páll S; Schulz R; Larsson P; Bjelkmar P; Apostolov R; Shirts MR; Smith JC; Kasson PM; van der Spoel D; Hess B; Lindahl E Bioinformatics 2013, 29, 845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (96).Phillips JC; Braun R; Wang W; Gumbart J; Tajkhorshid E; Villa E; Chipot C; Skeel RD; Kalé L; Schulten K J. Comput. Chem 2005, 26, 1781–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (97).Eastman P; Pande V Comput. Sci. Eng 2010, 12, 34–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (98).Qin S; Zhou H-X Biopolymers 2007, 86, 112–118. [DOI] [PubMed] [Google Scholar]
- (99).Dong F; Zhou H-X Proteins: Struct., Funct., Bioinf 2006, 65, 87–102. [DOI] [PubMed] [Google Scholar]
- (100).Knight JL; Brooks CL III J. Comput. Chem 2011, 32, 2909–2923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (101).Bartlett RJ; Musiał M Rev. Mod. Phys 2007, 79, 291–352. [Google Scholar]
- (102).Helgaker T; Klopper W; Tew DP Molec. Phys 2008, 106, 2107–2143. [Google Scholar]
- (103).Fennell CJ; Kehoe CW; Dill KA Proc. Natl. Acad. Sci. U. S.A 2011, 108, 3234–3239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (104).Merz KM J. Chem. Theory Comput 2010, 6, 1769–1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (105).Harris RC; Boschitsch AH; Fenley MO J. Chem. Theory Comput 2013, 9, 3677–3685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (106).Harris RC; Boschitsch AH; Fenley MO J. Chem. Phys 2014, 140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (107).Sørensen J; Fenley MO; Amaro RE In Computational Electrostatics for Biological Applications: Geometric and Numerical Approaches to the Description of Electrostatic Interaction Between Macromolecules; Rocchia W, Spagnuolo M, Eds.; Springer International Publishing: Switzerland, 2015; pp 39–71. [Google Scholar]
- (108).Cramer CJ; Truhlar DG Acc. Chem. Res 2008, 41, 760–768. [DOI] [PubMed] [Google Scholar]
- (109).Aguilar B; Onufriev AV J. Chem. Theory Comput 2012, 8, 2404–2411. [DOI] [PubMed] [Google Scholar]
- (110).Nguyen H; Maier J; Huang H; Perrone V; Simmerling C J. Am. Chem. Soc 2014, 136, 13959–13962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (111).Berman HM; Westbrook J; Feng Z; Gilliland G; Bhat TN; Weissig H; Shindyalov IN; Bourne PE Nucleic Acids Res. 2000, 28, 235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (112).Katkova EV; Onufriev AV; Aguilar B; Sulimov VB J. Mol. Graphics Mod 2017, 72, 70 – 80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (113).Dolinsky TJ; Nielsen JE; McCammon JA; Baker NA Nucleic Acids Res. 2004, 32, W665–W667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (114).Dolinsky TJ; Czodrowski P; Li H; Nielsen JE; Jensen JH; Klebe G; Baker NA Nucleic Acids Res. 2007, 35, W522–W525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (115).Case DA; Berryman JT; Betz RM; Cerutti DS; Cheatham III TE; Darden TA; Duke RE; Giese TJ; Gohlke H; Goetz AW; Homeyer N; Izadi S; Janowski P; Kaus J; Kovalenko A et al. AMBER 2015; University of California, San Francisco, 2015. [Google Scholar]
- (116).Hu CY; Kokubo H; Lynch GC; Bolen DW; Pettitt BM Prot. Sci 2010, 19, 1011–1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (117).Kokubo H; Hu CY; Pettitt BM J. Am. Chem. Soc 2011, 133, 1849–1858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (118).Kokubo H; Harris RC; Asthagiri D; Pettitt BM J. Phys. Chem. B 2013, 117, 16428–16435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (119).Harris RC; Pettitt BM Proc. Natl. Acad. Sci. U. S.A 2014, 111, 14681–14686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (120).Baker NA; Sept D; Joseph S; Holst MJ; McCammon JA Proc. Natl. Acad. Sci. U. S.A 2001, 98, 10037–10041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (121).Srinivasan J; Trevathan MW; Beroza P; Case DA Theor. Chem. Acc 1999, 101, 426–434. [Google Scholar]
- (122).Onufriev A; Case DA; Bashford D J. Comput. Chem 2002, 23, 1297–1304. [DOI] [PubMed] [Google Scholar]
- (123).Onufriev AV; Sigalov G J. Chem. Phys 2011, 134, 164104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (124).Onufriev AV; Izadi S Wiley Interdisciplinary Reviews: Computational Molecular Science e1347–n/a, e1347. [Google Scholar]
- (125).Onufriev AV; Aguilar B J. Theor. Comput. Chem 2014, 13, 1440006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (126).Sigalov G; Scheffel P; Onufriev A J. Chem. Phys 2005, 122, 094511. [DOI] [PubMed] [Google Scholar]
- (127).Hawkins G; Cramer C; Truhlar D J. Phys. Chem 1996, 100, 19824–19839. [Google Scholar]
- (128).Schaefer M; Van Vlijmen H; Karplus M Adv. Protein Chem 1998, 51, 1–57. [DOI] [PubMed] [Google Scholar]
- (129).Tsui V; Case D Biopolymers (Nucl. Acid. Sci.) 2001, 56, 275–291. [DOI] [PubMed] [Google Scholar]
- (130).Onufriev A; Bashford D; Case D J. Phys. Chem. B 2000, 104, 3712–3720. [Google Scholar]
- (131).Onufriev A; Bashford D; Case D Proteins: Struct., Funct., Bioinf 2004, 55, 383–394. [DOI] [PubMed] [Google Scholar]
- (132).Nguyen H; Roe DR; Simmerling C J. Chem. Theory Comput 2013, 9, 2020–2034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (133).Aguilar B; Shadrach R; Onufriev AV J. Chem. Theory Comput 2010, 6, 3613–3630. [Google Scholar]
- (134).Sigalov G; Fenley A; Onufriev A J. Chem. Phys 2006, 124, 124902. [DOI] [PubMed] [Google Scholar]
- (135).Mongan J; Simmerling C; McCammon JA; Case DAA; Onufriev A J. Chem. Theory Comput 2007, 3, 156–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (136).Forouzesh N; Izadi S; Onufriev AV J. Chem. Inf. Model 2017, 57, 2505–2513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (137).Homeyer N; Gohlke H Mol. Inf 2012, 31, 114–122. [DOI] [PubMed] [Google Scholar]
- (138).Miller BR III; McGee TD Jr; Swails JM; Homeyer N; Gohlke H; Roitberg AE J. Chem. Theory Comput 2012, 8, 3314–3321. [DOI] [PubMed] [Google Scholar]
- (139).Homeyer N; Gohlke H J. Comput. Chem 2013, 34, 965–973. [DOI] [PubMed] [Google Scholar]
- (140).Lee MS; Feig M; Salsbury FR Jr.; Brooks III CL J. Comput. Chem 2003, 24, 1348–1356. [DOI] [PubMed] [Google Scholar]
- (141).Im W; Feig M; Brooks CL III Biophys. J 2003, 85, 2900–2918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (142).Tjong H; Zhou H-X J. Chem. Theory Comput 2008, 4, 507–514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (143).Nina M; Beglov D; Roux B J. Phys. Chem. B 1997, 101, 5239–5248. [Google Scholar]
- (144).Nina M; Im W; Roux B Biophysical Chemistry 1999, 78, 89 – 96. [DOI] [PubMed] [Google Scholar]
- (145).Swanson JMJ; Adcock SA; McCammon JA J. Chem. Th. Comp 2005, 1, 484–493. [DOI] [PubMed] [Google Scholar]
- (146).Nicholls A; Mobley DL; Guthrie JP; Chodera JD; Bayly CI; Cooper MD; Pande VS J. Med. Chem 2008, 51, 769–779. [DOI] [PubMed] [Google Scholar]
- (147).Chen J. Journal of Chemical Theory and Computation 2010, 6, 2790–2803. [DOI] [PubMed] [Google Scholar]
- (148).Rankin KN; Sulea T; Purisima EO J. Comput. Chem 2003, 24, 954–962. [DOI] [PubMed] [Google Scholar]
- (149).Sanner MF; Olson AJ; Spehner JC Biopolymers 1996, 38, 305–320. [DOI] [PubMed] [Google Scholar]
- (150).Nguyen H; Pérez A; Bermeo S; Simmerling C J. Chem. Theory Comput 2015, 11, 3714–3728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (151).Vorontsov II; Miyashita O J. Comput. Chem 2011, 32, 1043–1053. [DOI] [PubMed] [Google Scholar]
- (152).Carra C; Saha J; Cucinotta FA Journal of Molecular Modeling 2012, 18, 3035–3049. [DOI] [PubMed] [Google Scholar]
- (153).Genheden S; Luchko T; Gusarov S; Kovalenko A; Ryde U J. Phys. Chem. B 2010, 114, 8505–8516. [DOI] [PubMed] [Google Scholar]
- (154).Delgado-Soler L; Pinto M; Tanaka-Gil K; Rubio-Martinez J J. Chem. Inf. Model 2012, 52, 2107–2118. [DOI] [PubMed] [Google Scholar]
- (155).Rastelli G; Rio AD; Degliesposti G; Sgobba M J. Comput. Chem 2010, 31, 797–810. [DOI] [PubMed] [Google Scholar]
- (156).Wright DW; Hall BA; Kenway OA; Jha S; Coveney PV J. Chem. Theory Comput 2014, 10, 1228–1241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (157).Mikulskis P; Genheden S; Rydberg P; Sandberg L; Olsen L; Ryde U J. Comput. Aided Mol. Des 2012, 26, 527–541. [DOI] [PubMed] [Google Scholar]
- (158).Suenaga A; Okimoto N; Hirano Y; Fukui K PLOS ONE 2012, 7, 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (159).Panel N; Sun YJ; Fuentes EJ; Simonson T Frontiers in Molecular Biosciences 2017, 4, 65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (160).Genheden S. J. Computer-Aided Mol. Des 2011, 25, 1085–1093. [DOI] [PubMed] [Google Scholar]
- (161).Ren X; Tortorella M; Wang J; Wang C Sci.Rep 2017, 7, 14934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (162).Ylilauri M; PentikÃd’inen OT J. Chem. Inf. Model 2013, 53, 2626–2633. [DOI] [PubMed] [Google Scholar]
- (163).Wittayanarakul K; Hannongbua S; Feig M J. Comput. Chem 2008, 29, 673–685. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
