Abstract
Introduction: The molecular mechanics energies combined with the Poisson–Boltzmann or generalized Born and surface area continuum solvation (MM/PBSA and MM/GBSA) methods are popular approaches to estimate the free energy of the binding of small ligands to biological macromolecules. They are typically based on molecular dynamics simulations of the receptor–ligand complex and are therefore intermediate in both accuracy and computational effort between empirical scoring and strict alchemical perturbation methods. They have been applied to a large number of systems with varying success.
Areas covered: The authors review the use of MM/PBSA and MM/GBSA methods to calculate ligand-binding affinities, with an emphasis on calibration, testing and validation, as well as attempts to improve the methods, rather than on specific applications.
Expert opinion: MM/PBSA and MM/GBSA are attractive approaches owing to their modular nature and that they do not require calculations on a training set. They have been used successfully to reproduce and rationalize experimental findings and to improve the results of virtual screening and docking. However, they contain several crude and questionable approximations, for example, the lack of conformational entropy and information about the number and free energy of water molecules in the binding site. Moreover, there are many variants of the method and their performance varies strongly with the tested system. Likewise, most attempts to ameliorate the methods with more accurate approaches, for example, quantum-mechanical calculations, polarizable force fields or improved solvation have deteriorated the results.
Keywords: drug design, electrostatics, entropy, free energy perturbation, linear interaction energy, non-polar solvation, solvation
1. Introduction
A goal of structure-based drug design is to find a new pharmaceutical compound that binds to a macromolecular receptor. In essence, the binding can be described with the chemical reaction:
1 |
where L denotes the ligand and R the receptor (which typically is a protein or another biomacromolecule). The binding strength is determined by the binding free energy, ΔG bind. Traditionally, design campaigns are carried out experimentally, a process that is both time-consuming and costly. Therefore, computational methods have been developed with the aim of reducing the time and cost to design new drugs. The choice of method depends on at which stage in the campaign it will be employed and there is typically a trade-off between accuracy and efficiency.
The most widely used computational methods in drug design are docking and scoring [1], whereby the binding mode of the drug is predicted, followed by an estimate of the binding affinity. These methods are efficient but not particularly accurate; they can be used to predict binding modes and discriminate between binders and non-binders, but they can typically not discriminate between drugs that differ by less than one order of magnitude in affinity, that is, by < 6 kJ/mol in ΔG bind.
At the other end of the spectrum there are the alchemical perturbation (AP) methods that are derived from statistical mechanics and thus are in principle very accurate [2]. However, they are based on Monte Carlo or molecular dynamics (MD) simulations and require extensive sampling of the complex and the free ligand in solution, as well as of unphysical intermediate states, which render them computationally intensive. Therefore, they have seldom been used in drug design [3].
Between these extremes there is a group of methods with intermediate performance. They are also based on sampling but only of the end states, that is, the complex and possibly the free receptor and ligand. These methods are therefore called end point methods. They were devised to be more inexpensive than AP, but more accurate than the scoring functions.
The most basic approximate method is the linear-response approximation (LRA) [4], in which the electrostatic free energy change is estimated from
2 |
where is the electrostatic interaction energy between the ligand and the surroundings (the receptor or water), the brackets indicate ensemble averages and their subscripts indicate from which simulation the average is computed. PL and L are standard simulations of the complex and the ligand, respectively, whereas PL′ and L′ indicate simulations in which the charges of the ligand have been zeroed. This method has been used to estimate solvation free energies, but not binding free energies because it lacks a non-polar part. However, it forms the basis of the linear interaction energy (LIE) method developed by Åqvist et al. [5,6]. In this method, only the complex and the free ligand is simulated. In addition, a non-polar term is added, formed in an analogous way:
3 |
where is the van der Waals interaction between the ligand and the surroundings, and α and β are empirical parameters. Warshel et al. have devised a related method called PDLD/s-LRA/β (semi-macroscopic protein-dipoles Langevin-dipoles method within a LRA) [4], in which the polar part is LRA and the non-polar part is borrowed from LIE.
The arguably most popular end point method is the topic of this review: MM/PBSA (molecular mechanics [MM] with Poisson–Boltzmann [PB] and surface area solvation) [7]. In this method, ΔG bind is estimated from the free energies of the reactants and product of the reaction in Equation 1:
4 |
This method was developed by Kollman et al. in the late 90s [8] and has enjoyed a high popularity ever since, with 100 – 200 publications each of the latest 5 years, as can be seen in Figure 1. The method has been used in a range of settings, including protein design [9], protein–protein interactions [10,11], conformer stability [12,13] and re-scoring [14,15]. In this paper, we present a critical review of the method and its ability to predict ligand-binding affinities. In particular, we discuss the precision and accuracy of the results and whether it is possible to improve the method by using more accurate approaches for each of the terms in the method, whereas specific applications are better covered in previous reviews [7,16-18]. Several methods with similarity to MM/PBSA have been suggested both earlier and later [19-21], but none of them has found any wide application.
2. The MM/PBSA approach
Here, we will describe the MM/PBSA approach as it was originally defined by Kollman et al. [7,8]. Since then, the method has been developed and modified [16-18] and we will discuss many variants in the coming sections. Unfortunately, there is still no consensus on the details of the method, probably because the performance varies depending on what system it is applied to.
In MM/PBSA, the free energy of a state, that is, P, L or PL in Equation 4, is estimated from the following sum [7,8]:
5 |
where the first three terms are standard MM energy terms from bonded (bond, angle and dihedral), electrostatic and van der Waals interactions. G pol and G np are the polar and non-polar contributions to the solvation free energies. G pol is typically obtained by solving the PB equation or by using the generalized Born (GB) model (giving the MM/GBSA approach), whereas the non-polar term is estimated from a linear relation to the solvent accessible surface area (SASA). The last term in Equation 5 is the absolute temperature, T, multiplied by the entropy, S, estimated by a normal-mode analysis of the vibrational frequencies. Gohlke and Case have pointed out that a correction owing to the translational and rotational enthalpy (3 RT) was missing in the original formulation and that the translational entropy should be calculated with a 1 M concentration standard state to be comparable to experimental estimates [11].
Strictly, the averages in Equation 4 should be estimated from three separate simulations [22], that is, from simulations of the complex, the free ligand and the unbound receptor, thereby obtaining
6 |
where the subscripts indicate from which simulation the average is computed. We will denote this approach three-average MM/PBSA (3A-MM/PBSA).
However, it is much more common to only simulate the complex and create the ensemble average of the free receptor and ligand by simply removing the appropriate atoms, thereby obtaining
7 |
which we will denote one-average MM/PBSA (1A-MM/PBSA). This requires fewer simulations, improves precision and leads to a cancellation of E bnd in Equation 5. On the other hand, it ignores the change in structure of the ligand and the receptor upon ligand binding, which may be important factors for the affinity [22]. We compared the 3A- and 1A-MM/PBSA approaches for the avidin and factor Xa (fXa) proteins, but unfortunately the performance depended on the test system and solvation model [23]. However, the standard error was four to five times larger with the 3A-MM/PBSA approach. In another study of the ferritin protein [24], the large uncertainty of the 3A approach (19 – 21 kJ/mol) rendered the results practically useless. Pearlman drew a similar conclusion for p38 MAP kinase [25]. In practice, the 1A approach often gives more accurate results than the 3A approach [26,27]. Swanson et al. suggested a 2A approach with sampling also of the free ligand, which would include the ligand reorganization energy [22]. Yang et al. have made a similar argument and have shown that it can improve the results [28].
Typically, the simulations used to estimate the ensemble averages in Equations 6 and 7 employ explicit solvent models. All solvent molecules are then removed from each snapshot, because the implicit PBSA or GBSA solvent models are used to estimate the solvation energies. However, this leads to an inconsistency in the method because the simulations and energy calculations do not use the same energy function, requiring reweighting of the energies. This could be circumvented by simulating with the same implicit solvent model used when evaluating the free energy. However, the accuracy of the implicit solvent model is uncertain and sometimes leads to dissociation of the ligand or protein subunits [29]. We have illustrated this problem by performing both implicit and explicit solvent simulations of five galectin-3 complexes [30]: Ensembles generated by the different solvent models are rather different with root-mean-square deviations of 1.2 – 1.4 Å and the quality of the predicted affinities changes significantly. It was possible to convert one ensemble into the other by performing simulations with the new method, but the convergence was slow.
It has often been suggested that the MM/PBSA calculation can be based on single minimized structures, rather than on a large number of MD snapshots. Naturally, this will save much computational effort, but it also ignores dynamical effects, making the results strongly dependent on the starting structure, and all information about the statistical precision of the method is lost. Yet in practice, minimized structures often give as good or better results as MD simulations [15,31-33], although some studies have emphasized the importance of MD sampling [14]. We have tested this approach by starting the minimization from different MD snapshots and shown that it often gives results similar to those obtained with MD, but sometimes unrealistic structures need to be filtered away [29]. It has been argued that more time can be saved by performing the minimizations in a GB continuum solvent [32]. Hou et al. found that the MM/GBSA results varied with the length of the MD simulation, but that there is no gain of using simulations longer than 4 ns [33]. Finally, it has been suggested that the results can be improved by using NMR ensembles, rather than MD simulations [34].
The MM/PBSA approach was originally developed for the AMBER software [7,8] and several automated versions have been presented [35,36]. Recently, automatic scripts have also been presented for the freely available GROMACS, NAMD and APBS software [37].
3. The precision
Early applications of MM/PBSA used relatively few snapshots (∼ 20) from the MD simulations. A typical result is shown in Table 1. It can be seen that for charged ligands (Btn1 – Btn3), the MM/PBSA energy is dominated by the E el and G pol terms, but these two terms often nearly cancel, illustrating the screening effect of the solvent. Consequently, the E vdW term often dominate the net binding free energy, although the entropy term is also sizeable. On the other hand, the G np term is essentially always small and similar for all ligands.
Table 1.
Term | Btn1 | Btn2 | Btn3 | Btn4 | Btn5 | Btn6 | Btn7 | SD1 | SD7 |
---|---|---|---|---|---|---|---|---|---|
Eel | –1224 | –1295 | –1287 | –174 | –83 | –50 | –109 | 46 | 21 |
EvdW | –148 | –149 | –132 | –200 | –128 | –128 | –49 | 16 | 11 |
Gpol | 1224 | 1321 | 1259 | 266 | 146 | 123 | 124 | 30 | 13 |
Gnp | –17 | –17 | –17 | –21 | –16 | –16 | –11 | 0 | 0 |
TS | –81 | –96 | –70 | –82 | –67 | –66 | –28 | 46 | 57 |
Eel + Gpol | 0 | 26 | –27 | 92 | 63 | 72 | 15 | 22 | 20 |
ΔGbind | –187 | –145 | –222 | –114 | –49 | –34 | –53 | 47 | 62 |
The two last columns (SD1 and SD7) show the standard deviation of the various terms for Btn1 and Btn7. Ligands Btn1–Btn3 have a net charge of –1, whereas the other four ligands are neutral.
Adapted with permission from [29] (the 03oh/03 calculation with PB solvation). Copyright (2006) American Chemical Society.
A major problem with MM/PBSA is the poor precision. For example, the standard deviation of ΔG bind over the 20 snapshots in Table 1 (SD1 and SD7 columns) is 47 and 62 kJ/mol, giving a standard error of the mean of 11 and 14 kJ/mol (i.e., the standard deviation divided by the square root of the number of snapshots). Such a poor precision makes the method useless when comparing ligands with similar affinities or when comparing results obtained with different approaches or by different groups. Therefore, we made a thorough investigation how statistically converged MM/GBSA results can be obtained [38]. We showed that it is more effective to run many short independent simulations than a single long, which has also been observed in other studies [39-41] (a single long simulation will underestimate the uncertainty in the result). Moreover, we showed that snapshots taken after 1 – 10 ps are statistically independent and that an equilibration of 100 ps and a production simulation of 100 – 200 ps are appropriate, although later studies have indicated that occasionally longer equilibrations are needed [42]. Once these measures have been determined, any desired precision can be obtained by running a proper number of independent simulations, because the standard error decreases with the square root of the number of independent samples. For avidin, 20 – 50 simulations were needed to give a standard error of 1 kJ/mol for the seven biotin analogues (6 – 15 ns simulations for each ligand in total).
We have also discussed how the independent simulations should be generated [42]. The standard way is to use different starting velocities for the MD simulations. However, the variation can be enhanced by taking advantage of the arbitrary choices made when setting up the MM/PBSA calculations, in particular the solvation of the receptor and the selection of alternative conformations in the starting crystal structure. The uncertainty in the protonation and conformation of various groups can also be employed with some care. The good message of our results is that the MM/GBSA results are insensitive to all these choices, unless the affected residue is very close to the ligand. This shows that MM/GBSA is stable and reproducible and that calculations set up by different groups and procedures are likely to give similar results, in spite of the many more or less arbitrary choices made during the setup.
4. The polar solvation term
The polar solvation term (G pol in Equation 5) was originally obtained by solving the PB equation numerically [7,8]. However, it can in principle be calculated with any other continuum-solvation method and it is now more common that it is calculated by the GB approach (giving a MM/GBSA method), because such calculations are faster and pair-wise decomposable. Many studies have compared the performance of MM/PBSA and MM/GBSA, indicating that the MM/PBSA results are better [29,33], worse [14,43-45] or of a similar quality [15,32,46,47], depending on the studied system.
Gohlke and Case have made a detailed study on how the results depend on the polar solvation energy [11]. They find that the PB results strongly depend on the radii employed and that different variants of GB give widely different results. We have also tested several different methods [48]. As can be seen in Figure 2, there was a major variation in the absolute ΔG bind obtained with the various methods, up to 210 kJ/mol, and even the relative affinities varied by up to 85 kJ/mol. The mean absolute deviation (MAD) between the calculated and experimental ΔG bind varied from 10 to 43 kJ/mol (after removal of the systematic error) and the correlation coefficient (r 2) varied from 0.59 to 0.93. We also tested two variants of the three-dimensional reference site interaction model (3D-RISM), which is expected to give more accurate solvation energies, but it gave intermediate results (r 2 = 0.80 – 0.90 and MAD = 16 – 17 kJ/mol).
It has also been suggested that special methods can be used for the ligand, for example, a LIE-like solvation term [49]. We have compared the performance of 24 different continuum-solvation methods [50]. For small organic molecules, the quantum mechanics (QM)-based polarized continuum model (PCM) gave the most accurate results, but the performance deteriorates as the size and polarity of the molecule increase and if we compare only relative solvation energies of homologous ligands with the same net charge, essentially all methods (PCM, PB and GB) give solvation energies that agree within 2 – 5 kJ/mol. This indicates that there is not much to gain from using more accurate solvation methods for the ligand.
5. The non-polar solvation term
The polar solvation energy represents the electrostatic interaction between the solute and the continuum solvent. However, three additional terms are needed to reproduce experimental solvation energies, viz. cavitation, dispersion and repulsion energies, representing the cost of making a cavity in the solvent, as well as the attractive and repulsive parts of the van der Waals interactions between the solute and the solvent. Strictly, all these three non-polar solvation terms are free energies and in particular the cavitation energy should have important entropic components, representing the reorganization of the solvent around the solute [16]. However, in standard MM/PBSA, the non-polar solvation energy is represented by a linear relation to the SASA [7].
This term has been little discussed, although the parameters vary quite extensively, 0.01 – 0.24 kJ/mol/Å2 for the surface tension and 0 – 4.2 kJ/mol for the offset [8,9,16,17,29,44]. The reason for this is that the term is typically small and shows only a minor variation among similar ligands (Table 1), thereby having insignificant influence on the results. This is alarming, as this term should model the hydrophobic effect, which normally is assumed to be among the most important terms for ligand binding [1]. It has sometimes been argued that the SASA model should be completed with the solvent–solute van der Waals interaction [11,16] or that only the attractive part of the van der Waals interaction should be included [51]. Solvent-accessible volume terms have also been tested, but without changing the results significantly [37].
We noted that the non-polar solvation energy calculated by SASA is three to eight times smaller and of the opposite sign to the same energy calculated by the more accurate PCM (which includes separate terms for cavitation, dispersion and repulsion energies) or 3D-RISM methods [48,52]. By calibration against AP calculations with explicit water for the binding of benzene to a hidden cavity in T4 lysozyme, we showed that actually both methods give the wrong results, because they assume that the cavity is filled with water before the ligand binds, contrary to experimental observations [53]. If this problem is corrected by filling the un-bound cavity with a non-interacting ligand, PCM actually gives a more accurate result than SASA. Unfortunately, for more water-exposed binding sites, both SASA and PCM fail strongly (by up to 32 and 78 kJ/mol, respectively) to reproduce the AP results and it seems to be hard to find any correction that works for binding sites of different solvent accessibility [54]. The reason for this is that the continuum methods do not contain any information about how many water molecules there are in the binding site and their entropy before and after the ligand binding. This may lead to differences in relative ΔG bind of up to 75 kJ/mol when calculated with different approaches.
Many attempts have been made to explicitly include the effect of binding-site water molecules in MM/PBSA. Several groups have treated a number of water molecules as part of the receptor, often giving improved results [24,26,27,55], although the performance may depend on the number of explicit water molecules [56]. However, in a large test of 855 ligand complexes, inclusion of water molecules within 3.5 Å of the ligand made the results worse [57]. Attempts have also been made to combine MM/GBSA with estimates of the free-energy gain of displacing binding-site water molecules from the WaterMap approach with varying results [58-60]. We have tried to estimate both these effects in a consistent way within the MM/GBSA approach (by estimating the binding energy also of water molecules with MM/GBSA) with reasonable results [24].
6. The electrostatics term
E el in Equation 5 is normally calculated using Coulomb’s law with atomic charges taken from the MM force field. Consequently, the results depend on the charges used for the receptor and the ligand. Several groups have studied how the results depend on the force field. Hou et al. tested different Amber force fields and obtained the best results with the 03 or 99 force fields, depending on the length of the MD simulations [33]. They also found that HF/6-31G* restrained electrostatic potential (RESP) charges for the ligand gave the best results, although AM1-BCC and ESP charges also gave satisfactory predictions. On the other hand, Oehme et al. obtained the best results with the simple Gasteiger and HF/STO-3G charges (which are small in magnitude), as well as the more sophisticated B3LYP/cc-pVTZ charges [44].
We have compared the results obtained with the Amber 94, 99 and 03 force fields, but did not find any significant differences [29]. We also calculated QM charges (HF/6-31G*) for all atoms in the protein and the ligand for the conformation observed in each snapshot of the MD simulations, but this did not improve the results. However, interaction energies calculated with such conformation-dependent charges strongly differ from those calculated by standard charges (by 40 kJ/mol on average for charged ligands), although the effect is partly cancelled by the solvation energy [61]. Moreover, if these charges were used for MD simulations, the calculations crashed with numerical problems. These problems could be avoided by averaging the QM charges over the snapshots or over all occurrences of the same amino acid in different proteins [61,62]. Such extensively averaged electrostatic potential (xAvESP) charges provide an alternative to standard RESP charges and actually gave slightly improved calculated ΔG bind for two proteins. However, in another study, no significant differences between ΔG bind calculated with AM1-BCC, RESP and xAvESP charges were observed [47].
In most force fields for proteins, polarization between different atoms is ignored, although it often constitutes 10 – 30% of the total electrostatic interaction energy [63]. We have tried to use a polarizable force field in MM/PBSA calculations, which increased the computational effort by a factor of ∼ 3, but no improvement in the results was observed [29]. We have also tested to use a very accurate force field consisting of multipoles up to quadrupoles and anisotropic polarizabilities, calculated for all atoms and bond centers in the actual conformation in all snapshots, but still with no improvement in ΔG bind compared with experiments [52]. Ren et al. have combined the polarizable Amoeba force field with MM/PBSA, obtaining reasonable results [64].
In the original MM/PBSA approach, E el was calculated with a dielectric constant of unity (ϵ = 1) [7,8]. However, it has frequently been suggested that the results may be improved by using a larger ϵ [16-18,65], which would also implicitly model some of the dielectric relaxation and dynamics of the ligand in the protein [66]. We have estimated the optimum value of the dielectric constant, obtaining varying results (ϵ = 1 – 25) depending on the solvation model and the protein [23]. Such varying results have also been observed in other studies and it has been suggested that the optimum value of ϵ depends on the characteristics of the binding site (a highly charged binding site requires a higher ϵ than a hydrophobic site) [17,67], but often the results are best for ϵ = 2 – 4 [14,15,52], especially in larger studies of several proteins [43,46]. Alternatively, it has been suggested that ϵ should vary with the protein residues [68].
7. The entropy term
In the original MM/PBSA approach [7,8], the entropy is calculated by removing all water molecules and residues > 8 Å from the ligand and then minimizing the remainder using a distance-dependent dielectric function. Vibrational frequencies are then calculated, which are used to obtain entropies by a normal-mode analysis using a rigid-rotor harmonic-oscillator ideal-gas approximation. A problem with this approach is that a third energy function is used (in addition to those used for the MD simulations and the calculations of the other energy terms) and that the receptor may relax in an uncontrolled way after the truncation of the protein. As a consequence, the entropy term typically gives the largest statistical uncertainty, as can be seen in Table 1.
Therefore, we suggested a simple modification of the protocol [69]: We included all residues and water molecules within 12 Å of the ligand in the minimization, but fixed the water molecules and residues between 8 and 12 Å in the minimization and ignored their contributions to the entropy. Thereby, we could use ϵ = 1 and avoid any large change in the structure. This led to a reduction of the uncertainty of the entropy term by a factor of 3, so that it no longer limited the precision. We have also shown that the calculated entropy depends little on the truncation radius and other parameters of the method and that 8 Å is a proper radius [70]. Another possibility to improve the precision of the entropy is to start the minimization of the free receptor and the free ligand from the minimized structure of the complex [71] or to perform the minimization in a GB solvent [72].
It is clear that the normal-mode entropies employed in MM/PBSA omit important contributions to the true ligand-binding entropy. In particular, it considers only the local stiffness of the actual binding conformation and does not include any information whether the receptor or the ligand may have many different conformations or whether the number of conformations changes upon binding. Likewise, the single-average approach omits possible changes in the conformation of the ligand or the receptor upon binding. In principle, it is possible to study different conformations of the receptor and the ligand in MD simulations and to calculate entropies from these, for example, from the dihedral distributions. Unfortunately, it is virtually impossible to obtain converged entropies from such simulations [73].
Several attempts have been made to replace the normal-model entropy in MM/PBSA with entropies calculated with other methods. For example, an entropic penalty has been assigned to rotable bonds in the ligand and in protein [74]. Other investigators have tested to calculate the entropy by a quasi-harmonic analysis of the MD trajectories [11,75], which includes the conformational part of the entropy, although not in a fully correct way [76], but also this approach has severe convergence problems [11]. A simpler approach based on the solvent-accessible and buried surface area has also been suggested and has been shown to perform as good as the normal-mode approach [45].
The entropy calculations typically dominate the computational cost of the MM/PBSA estimates. Therefore, it is sometimes calculated only for a subset of the snapshots [7,8,11]. It has also repeatedly been suggested that this term can be omitted and that it does not improve the results in large tests [26,43]. Consequently, this term is ignored in a large portion of the published MM/PBSA studies [16-18]. However, it is clear that for the absolute affinities, the entropy term is needed, owing to the loss of translational and rotational freedom when the ligand binds. Some studies have also indicated that the entropy term improves the results [19,44] or that it is enough to calculate the entropy of the ligand [77].
8. Replacing MM with QM
In contrast to the other terms in Equation 7, we are not aware of any attempts to modify only the E vdW term in MM/PBSA. However, it is well known that the MM energy model has several shortcomings and that a new force field (at least the charges) needs to be created for each new ligand studied [16]. Therefore, there has been a great interest to instead employ a quantum mechanical energy function [78].
Merz et al. pioneered this field by replacing the MM energies with a solvated semi-empirical QM (SQM) estimate, supplemented by a MM dispersion term and a simple entropy estimate [74]. Initially, they studied single snapshots, obtaining encouraging results for 165 protein–ligand complexes (r 2 = 0.55). This approach was subsequently extended to post-process MD snapshots of β-lactamase complexes, but the large standard deviation made the results of little use [79]. Recently, a similar approach was proposed, which gave better results than standard MM/PBSA on the Lck SH2 protein [80]. We performed a more systematic investigation of three SQM methods [81]. The results varied with the protein–ligand system, but it was clear that a dispersion correction is necessary to reproduce experimental affinities. On average, AM1 gave the best results, comparable to standard MM/GBSA, at a cost that was 1.3 – 1.6 times larger.
We also were the first to estimate binding affinities with a high-level QM method with large basis set for which dispersion and polarization are treated properly [52]. The calculations were done within an MM/PBSA framework, but the MM part was replaced with fragment MP2/cc-pVTZ QM calculations for all chemical groups within 4.5 Å of the ligand, treating many-body and long-range effects by a highly accurate force field that includes a multipole expansion up to quadrupoles and anisotropic polarizabilities at all atoms and bond centers [63], including PCM solvation. Unfortunately, the results were quite poor with r 2 = 0.20 – 0.52 depending on the G np term. The results also showed that three-body effects (mainly the coupling between polarization and repulsion) absent in any standard MM representation (also polarizable) affect receptor–ligand interactions by 2 – 10 kJ/mol [63].
Furthermore, various QM/MM-PBSA approaches have been developed, in which the ligand is treated with QM and the rest with MM [82-84]. We used this approach to estimate binding of ligands to cathepsin B, but found that QM/MM-PBSA energies (r 2 = 0.59) were worse than gas-phase QM energies (r 2 = 0.80) [85]. However more recently, accurate QM/MM-PBSA results have been obtained for cytochrome P450 [86], showing that performance depends on the system.
Skylaris et al. have recently applied linear-scaling density functional theory (DFT) calculations to ligand binding, treating the whole protein–ligand complex [87,88]. They obtained a MAD of 9 kJ/mol, which is slightly better than the MM/PBSA result, but the ranking was poor. Other groups have used similar methods on rather large host–guest systems [89,90].
The use of QM calculations exaggerates the inconsistency between the energy functions used for sampling and post-processing. Some studies try to solve this by performing a QM minimization before the energy evaluation [79,80,91], but our study indicates this hardly corrects the ensemble mismatch [30]. A better solution would be to do the MD simulation also at the QM level as has been done at the SQM level [92].
9. Comparison with other methods
It is of great interest to compare the performance of MM/PBSA and other alternative methods. LIE and MM/PBSA have been compared several times, but the results depend on the tested system [47,56,93-98]. MM/PBSA often overestimates differences in binding affinities, giving a favorable r 2, but a poor MAD [99]. We have instead compared the efficiency and precision of LIE and MM/PBSA, showing that LIE is two to seven times more efficient than MM/PBSA, owing to the time-consuming entropy estimate [100].
Schutz and Warshel compared the PDLD/s-LRA/β, LIE and MM/PBSA methods, concluding that the former was most accurate [66]. However, they only compared with selected literature values for MM/PBSA, which may introduce biases from differences in the force field, simulation set-up and continuum model. Therefore, we performed a study where all such differences were removed for two proteins [23]. Although LRA is theoretically the more strict approach, our results indicated that standard MM/PBSA is more accurate, more precise and also less computationally demanding. For avidin, LIE outperformed MM/PBSA for the non-polar contributions, but this was not the case for fXa.
It is also of interest to compare MM/PBSA with the more strict AP methods. We performed such a comparison for nine inhibitors of fXa [99]. It was found that the two methods gave similar MAD, but AP gave a better r 2 compared with experiments. Interestingly, we also found that an optimized AP approach actually was more efficient than MM/PBSA contrary to common belief [17]. This was mainly due to that MM/PBSA is inherently imprecise, so that many independent simulations are needed to give a precision similar to that of AP. On the other hand, the optimization of the AP approach has been shown to be system-dependent [101]. Other studies have mainly focused on the accuracy and have found that MM/PBSA is comparable [102,103], inferior [25,104,105] and better [106] than AP, depending on the system.
10. Conclusions
We have presented an overview of the MM/PBSA method to predict ligand-binding affinities. The method contains six energy terms that can be individually tested and improved. In separate sections, we have discussed most of these terms.
The electrostatic term depends on the charges used for the receptor and the ligand, and several attempts have been made to improve these by polarizable potentials, multipole expansions or QM calculations, but with little improvement in ΔG bind. It also depends on the dielectric constant used for the protein and there are some indications that the results can be improved by using ϵ = 2 – 4, especially for polar and charged binding sites.
The polar solvation term often partly cancels E el. It was originally calculated by PB, but today most studies are performed with GB solvation, which is faster and often gives a better accuracy. However, the results are very sensitive to details in the calculations and different GB methods give widely different results. The non-polar energy has been less investigated, but recent studies indicate that any continuum estimate may be inaccurate owing to lack of information about water molecules in the binding site.
The normal-mode entropy is also problematic, because it is expensive to calculate and it lacks information of the conformational entropy. Unfortunately, alternative methods do not give converged results. Therefore, the term is often omitted.
Many approaches have been suggested to replace the MM force field with QM calculations for the ligand or for the whole complex, for example, by using SQM, DFT or ab initio methods. So far, the results have been varying and the inconsistency between energy function used for simulations and energy calculations is a potential problem.
Traditionally, MM/PBSA energies are calculated for snapshots obtained by MD simulations. Unfortunately, there is a very large variation in these energies, giving major problems with the precision. These are normally solved by calculating only interaction energies (Equation 7), studying many snapshots and using several independent simulations. It has frequently suggested that the calculations can be sped up by using only minimized structure, but such results may depend strongly on the starting structures.
Finally, MM/PBSA has been compared with other ligand-binding methods, with varying results. Typically, the accuracy is better than for docking and scoring methods, worse than for AP methods and comparable to other end point methods, such as LIE and LRA.
11. Expert opinion
MM/PBSA is a popular method to calculate absolute binding affinities with a modest computational effort. Originally, it was presented as a method that does not contain any adjustable parameters and consists of six well-defined terms. However, since then many variants have been suggested and the user now have to make many choices, for example, concerning the dielectric constant, parameters for the non-polar energy, the radii used for the PB or GB calculations, whether to include the entropy term and whether to perform MD simulations or minimizations. In practice, it typically gives results of intermediate quality, often better than docking and scoring, but worse than FEP, for example, r 2 = 0.3 for the whole PDB bind database, but r 2 = 0.0 – 0.8 for individual proteins [46]. The ranking of ligands is often unsatisfactory if their affinities differ by < 12 kJ/mol [18]. As for most ligand-binding approaches, the results depend critically on the tested receptor–ligand system, but it allows for larger variation in the ligand scaffold than AP methods and is less sensitive to changes in the net charge.
Unfortunately, the results strongly depend on the continuum solvation employed, making the absolute affinities unreliable or the method empirical (the solvation method can be selected to give accurate ΔG bind). The dielectric constant and whether to include the entropy term has also become parameters that can be varied to improve the results.
Undoubtedly, the method involves severe thermodynamic approximations [11,22,54], for example, ignoring structural changes in the receptor and ligand upon binding and conformational entropy, and lacking information about the number and entropy of water molecules in the binding site before and after ligand binding. Moreover, no attempt to improve the approach by using more accurate methods, for example, involving QM calculations, 3D-RISM solvation, polarizable force fields or more detailed non-polar energies, has given any consistent improvement. Instead, the best results are often obtained with methods that give numerically small energies.
MM/PBSA has severe convergence problems, requiring many independent simulations to yield a useful precision (thereby reducing its advantage compared with computer intensive methods like AP). All results (including quality measures like r 2 or MAD) should be accompanied by an estimate of the statistical precision and any comparison of ligands or methods must take this uncertainty into account in a statistically valid way. MM/PBSA typically gives too large energies, thereby often giving a good r 2 but a poor MAD (often worse than the null hypothesis of the same affinity for all ligands).
In conclusion, MM/PBSA may be useful for post-processing of docked structures or to rationalize observed differences. However, it is not accurate enough for predictive drug design or as a basis to develop more accurate methods.
Article highlights.
The MM/PBSA and MM/GBSA (molecular mechanics energies combined with the Poisson–Boltzmann or generalized Born and surface area continuum solvation) methods have been used to estimate ligand-binding affinities in many systems, giving correlation coefficients compared with experiments of r 2 = 0.0 – 0.9, depending on the protein.
The results strongly depend on details in the method, especially the continuum-solvation method, the charges, the dielectric constant, the sampling method and the entropies. The methods often overestimate differences between sets of ligands.
Many attempts have been made to improve the results, for example, by using various tastes of quantum mechanics (QM) methods, QM charges, QM solvation, three-dimensional reference site interaction model solvation, polarizable force field, multipole expansions or more detailed estimates of the non-polar energies. However, this has not led to any consistently improved results.
The methods involve several severe approximations, for example, a questionable entropy, lacking the conformational contribution and missing effects from binding-site water molecules.
Consequently, the methods may be useful to improve the results of docking and virtual screening or to understand observed affinities and trends. However, they are not accurate enough for later states of predictive drug design.
This box summarizes key points contained in the article.
Declaration of interest
U Ryde is supported by a grant from the Swedish research council (project 2010-5025) and the Swedish National Infrastructure for Computing (SNIC), while S Genheden is supported by a grant from the Wenner-Gren Foundations. The computations were performed on computer resources provided by SNIC at Lunarc Lund University, HPC2N at Umeå University, and C3S3 at Chalmers Technical University. The authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.
Bibliography
Papers of special note have been highlighted as either of interest (•) or of considerable interest (••) to readers.
- • .Gohlke H, Klebe G. Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Ang Chem Int Ed. 2002;41:2644–76. doi: 10.1002/1521-3773(20020802)41:15<2644::AID-ANIE2644>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]; • Good review on computational approaches to ligand binding.
- Shirts MR, Mobley DL, Chodera JD. Alchemical free energy calculations: ready for prime time? Ann Rep Comput Chem. 2007;3:41–59. [Google Scholar]
- Homeyer N, Stoll F, Hillisch A, et al. Binding free energy calculations for lead optimization: assessment of their accuracy in an industrial drug design context. J Chem Theory Comput. 2014;10:3331–44. doi: 10.1021/ct5000296. [DOI] [PubMed] [Google Scholar]
- Sham YY, Chu ZT, Tao H, et al. Examining methods for calculations of binding free energies: LRA, LIE, PDLD-LRA, and PDLD/S-LRA calculations of ligands binding to an HIV protease. Proteins. 2000;39:393–407. [PubMed] [Google Scholar]
- Åqvist J, Medina C, Samuelsson JE. A new method for predicting binding affinity in computer-aided drug design. Proteins Eng. 1994;7:385–91. doi: 10.1093/protein/7.3.385. [DOI] [PubMed] [Google Scholar]
- Hansson T, Marelius J, Åqvist J. Ligand binding affinity prediction by linear interaction energy methods. J Comput-Aided Mol Des. 1998;12:27–35. doi: 10.1023/a:1007930623000. [DOI] [PubMed] [Google Scholar]
- Kollman PA, Massova I, Reyes C, et al. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc Chem Res. 2000;33:889–97. doi: 10.1021/ar000033j. [DOI] [PubMed] [Google Scholar]
- • .Srinivasan J, Cheatham TE, Cieplak P, et al. Continuum solvent studies of the stability of DNA, RNA, and phosphoramidate–DNA helices. J Am Chem Soc. 1998;120:9401–9. [Google Scholar]; • The first MM/PBSA study.
- Sirin S, Kumar R, Martinez C, et al. A computational approach to enzyme design: predicting ω-aminotransferase catalytic activity using docking and MM-GBSA scoring. J Chem Inf Model. 2014;54:2334–46. doi: 10.1021/ci5002185. [DOI] [PubMed] [Google Scholar]
- Moreira IS, Fernandes PA, Ramos MJ. Unravelling Hot Spots: a comprehensive computational mutagenesis study. Theor Chern Ace. 2006;117:99–113. [Google Scholar]
- • .Gohlke H, Case DA. Converging free energy estimates: MM-PB(GB)SA studies on the protein–protein complex Ras–Raf. J Comput Chem. 2004;25:238–50. doi: 10.1002/jcc.10379. [DOI] [PubMed] [Google Scholar]; • Thorough study of MM/PBSA and MM/GBSA for protein–protein interactions, including a comparison of several different variants.
- Reblova K, Strelcova Z, Kulhanek P, et al. An RNA molecular switch: intrinsic flexibility of 23S rRNA helices 40 and 68 5’-UAA/5’-GAN internal loops studied by molecular dynamics methods. J Chem Theory Comput. 2010;6:910–29. [PMC free article] [PubMed] [Google Scholar]
- Homeyer N, Gohlke H. Free energy calculations by the molecular mechanics Poisson−Boltzmann surface area method. Mol Inf. 2012;31:114–22. doi: 10.1002/minf.201100135. [DOI] [PubMed] [Google Scholar]
- Hou T, Wang J, Li YY, et al. Assessing the performance of the molecular mechanics/Poisson Boltzmann surface area and molecular mechanics/generalized Born surface area methods. II. The accuracy of ranking poses generated from docking. J Comp Chem. 2011;32:866–77. doi: 10.1002/jcc.21666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun H, Li Y, Shen M, et al. Assessing the performance of the MM/PBSA and MM/GBSA methods. 5. Improved docking performance using high solute dielectric constant MM/GBSA and MM/PBSA rescoring. Phys Chem Chem Phys. 2014;16:22035–45. doi: 10.1039/c4cp03179b. [DOI] [PubMed] [Google Scholar]
- • .Foloppe N, Hubbard R. Towards predictive ligand design with free-energy based computational methods. Curr Med Chem. 2006;13:3583–608. doi: 10.2174/092986706779026165. [DOI] [PubMed] [Google Scholar]; • Good review of the MM/PBSA method with a thorough discussion of the various energy terms.
- • .Wang J, Hou T, Xu X. Recent advances in free energy calculations with a combination of molecular mechanics and continuum models. Curr Comput-Aided Drug Design. 2006;2:95–103. [Google Scholar]; • Good review of the MM/PBSA method with a thorough discussion of applications 2002 – 2006.
- • .Homeyer N, Gohlke H. Free energy calculations by the molecular mechanics Poisson–Boltzmann surface area method. Mol Inf. 2012;31:114–22. doi: 10.1002/minf.201100135. [DOI] [PubMed] [Google Scholar]; • A thorough review of the MM/PBSA method, including recent applications.
- Checa A, Ortiz AR, Pascual-Teresa B, et al. Assessment of solvation effects on calculated binding affinity differences: trypsin inhibition by flavonoids as a model system for congeneric series. J Med Chem. 1997;40:4136–45. doi: 10.1021/jm970245v. [DOI] [PubMed] [Google Scholar]
- Schwarzl SM, Tschopp TB, Smith JC, et al. Can the calculation of ligand binding free energy be improved with continuum solvent electrostatics and an ideal-gas entropy correction. J Comput Chem. 2002;23:1143–9. doi: 10.1002/jcc.10112. [DOI] [PubMed] [Google Scholar]
- Zoete V, Michielin O, Karplus M. Protein–ligand binding free energy estimation using molecular mechanics and continuum electrostatics. Application to HIV-1 protease inhibitors. J Comput-Aided Mol Design. 2003;17:861–80. doi: 10.1023/b:jcam.0000021882.99270.4c. [DOI] [PubMed] [Google Scholar]
- •• .Swanson JM, Henchman RH, McCammon JA. Revisiting free energy calculations: a theoretical connection to MM/PBSA and direct calculation of the association free energy. Biophys J. 2004;86:67–74. doi: 10.1016/S0006-3495(04)74084-9. [DOI] [PMC free article] [PubMed] [Google Scholar]; •• Discusses the theoretical basis for the MM/PBSA method.
- • .Genheden S, Ryde U. Comparison of end-point continuum-solvation methods for the calculation of protein–ligand binding free energies. Protein. 2012;80:1326–42. doi: 10.1002/prot.24029. [DOI] [PubMed] [Google Scholar]; • Thorough comparison of several end point methods, such as MM/PBSA, LIE and PDLD/s-LRA.
- Mikulskis P, Genheden S, Ryde U. Effect of explicit water molecules on ligand-binding affinities calculated with the MM/GBSA approach. J Mol Model. 2014;20:2273. doi: 10.1007/s00894-014-2273-x. [DOI] [PubMed] [Google Scholar]
- Pearlman DA. Evaluating the molecular mechanics Poisson-Boltzmann surface area free energy method using a congeneric series of ligands to p38 MAP kinase. J Med Chem. 2005;48:7796–807. doi: 10.1021/jm050306m. [DOI] [PubMed] [Google Scholar]
- Spackova N Cheatham TE, Ryjacek F, et al. Molecular dynamics simulations and thermodynamics analysis of DNA–drug complexes. Minor groove binding between 4’,6-diamidino-2-phenylidole and DNA duplexes in solution. J Am Chem Soc. 2003;125:1759–69. doi: 10.1021/ja025660d. [DOI] [PubMed] [Google Scholar]
- Lepsik M, Kriz Z, Havlas Z. Efficiency of a second-generation HIV-1 protease inhibitor studied by molecular dynamics and absolute binding free energy calculations. Proteins. 2004;57:279–93. doi: 10.1002/prot.20192. [DOI] [PubMed] [Google Scholar]
- Yang CY, Sun H, Chen J, et al. Importance of ligand reorganization free energy in protein-ligand binding-affinity prediction. J Am Chem Soc. 2009;131:13709–21. doi: 10.1021/ja9039373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- • .Weis A, Katebzadeh K, Söderhjelm P, et al. Ligand affinities predicted with the MM/PBSA method: Dependence on the simulation method and the force field. J Med Chem. 2006;49:6596–606. doi: 10.1021/jm0608210. [DOI] [PubMed] [Google Scholar]; • A thorough comparison of the performance of different force fields, simulation protocols and charge sets for MM/PBSA, including QM charges for all atoms and polarizable force fields.
- Godschalk F, Genheden S, Söderhjelm P, et al. Comparison of MM/GBSA calculations based on explicit and implicit solvent simulations. Phys Chem Chem Phys. 2013;15:7731–9. doi: 10.1039/c3cp00116d. [DOI] [PubMed] [Google Scholar]
- Kuhn B, Gerber P, Schulz-Gasch T, et al. Validation and use of the MM-PBSA approach for drug discovery. J Med Chem. 2005;48:4040–8. doi: 10.1021/jm049081q. [DOI] [PubMed] [Google Scholar]
- Rastelli G, Del Rio A, Degliesposti G, et al. Fast and accurate predictions of binding free energies using MM-PBSA and MM-GBSA. J Comput Chem. 2010;31:797–810. doi: 10.1002/jcc.21372. [DOI] [PubMed] [Google Scholar]
- Xu L, Sun H, Li Y, et al. Assessing the Performance of MM/PBSA and MM/GBSA methods. 3. Impact of force fields and ligand charge models. J Phys Chem B. 2013;117:8408–21. doi: 10.1021/jp404160y. [DOI] [PubMed] [Google Scholar]
- Li Y, Liu Z, Wang R. Test MM-PB/SA on true conformational ensembles of protein–ligand complexes. J Chem Inf Model. 2010;50:1682–92. doi: 10.1021/ci100036a. [DOI] [PubMed] [Google Scholar]
- Miller BR, McGee TD, Swails JM, et al. MMPBSA.py: an efficient program for end-state free energy calculations. J Chem Theory Comput. 2012;8:3314–21. doi: 10.1021/ct300418h. [DOI] [PubMed] [Google Scholar]
- Homeyer N, Gohlke H. FEW: A workflow tool for free energy calculations of ligand binding. J Comput Chem. 2013;34:965–73. doi: 10.1002/jcc.23218. [DOI] [PubMed] [Google Scholar]
- Kumari R, Kumar R, Lynn A. G_mmpbsa – a GROMACS tool for high-throughput MM-PBSA calculations. J Chem Inf Model. 2014;54:1951–62. doi: 10.1021/ci500020m. [DOI] [PubMed] [Google Scholar]
- •• .Genheden S, Ryde U. How to obtain statistically converged MM/GBSA results. J Comput Chem. 2010;31:837–46. doi: 10.1002/jcc.21366. [DOI] [PubMed] [Google Scholar]; •• Discusses how to obtain high, controllable precision MM/GBSA.
- Sadiq SK, Wright DW, Kenway OA, et al. Accurate ensemble molecular dynamics binding free energy ranking of multidrug-resistant HIV-1 proteases. J Chem Inf Model. 2010;50:890–905. doi: 10.1021/ci100007w. [DOI] [PubMed] [Google Scholar]
- Adler M, Beroza P. Improved ligand binding energies derived from molecular dynamics: replicate sampling enhances the search of conformational space. J Chem Inf Model. 2013;53:2065–72. doi: 10.1021/ci400285z. [DOI] [PubMed] [Google Scholar]
- Wright DW, Hall BA, Kenway OA, et al. Computing clinically relevant binding free energies of HIV-1 protease inhibitors. J Chem Theory Comput. 2014;10:1228–41. doi: 10.1021/ct4007037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Genheden S, Ryde U. A comparison of different initialization protocols to obtain statistically independent molecular dynamics simulations. J Comput Chem. 2011;32:187–95. doi: 10.1002/jcc.21546. [DOI] [PubMed] [Google Scholar]
- Yang T, Wu JC, Yan C, et al. Virtual screening using molecular simulations. Proteins. 2011;79:1940–51. doi: 10.1002/prot.23018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oehme DP, Brownlee RT, Wilson DJ. Effect of atomic charge, solvation, entropy and ligand protonation state on MM-PB(GB)SA binding energies for HIV protease. J Comput Chem. 2012;33:2566–80. doi: 10.1002/jcc.23095. [DOI] [PubMed] [Google Scholar]
- Wang J, Hou T. Develop and test a solvent accessible surface area-based model in conformational entropy calculations. J Chem Inf Model. 2012;52:1199–212. doi: 10.1021/ci300064d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- • .Sun H, Li Y, Tian S, et al. Assessing the performance of MM/PBSA and MM/GBSA methods. 4. Accuracies of MM/PBSA and MM/GBSA methodologies evaluated by various simulation protocols using PDBbind data set. Phys Chem Chem Phys. 2014;16:16719–1672. doi: 10.1039/c4cp01388c. [DOI] [PubMed] [Google Scholar]; • A large-scale evaluation of MM/PBSA and MM/GBSA.
- Genheden S. MM/GBSA and LIE estimates of host–guest affinities: dependence on charges and solvation model. J Comput Aided Mol Des. 2011;25:1085–93. doi: 10.1007/s10822-011-9486-1. [DOI] [PubMed] [Google Scholar]
- Genheden S, Luchko T, Gusarov S, et al. An MM/3D-RISM approach for ligand-binding affinities. J Phys Chem B. 2010;114:8505–16. doi: 10.1021/jp101461s. [DOI] [PubMed] [Google Scholar]
- Freedman H, Huynh LP, Le L, et al. Explicitly solvated ligand contribution to continuum solvation models for binding free energies: selectivity of theophylline binding to and RNA aptamer. J Phys Chem B. 2010;114:2227–37. doi: 10.1021/jp9059664. [DOI] [PubMed] [Google Scholar]
- Kongsted J, Söderhjelm P, Ryde U. How accurate are continuum solvation models for drug-like molecules? J Comput-Aided Mol Des. 2009;23:395–409. doi: 10.1007/s10822-009-9271-6. [DOI] [PubMed] [Google Scholar]
- Tan C, Tan YH, Luo R. Implicit nonpolar models. J Phys Chem B. 2007;111:12263–74. doi: 10.1021/jp073399n. [DOI] [PubMed] [Google Scholar]
- • .Söderhjelm P, Kongsted J, Ryde U. Ligand affinities estimated by quantum chemical calculations. J Chem Theory Comput. 2010;6:1726–37. doi: 10.1021/ct9006986. [DOI] [PubMed] [Google Scholar]; • First study to use a high-level QM method (MP2/cc-pVTZ) in a MM/PBSA-like approach to estimate binding affinity.
- Genheden S, Kongsted J, Söderhjelm P, et al. Nonpolar solvation free energies of protein–ligand complexes. J Chem Theory Comput. 2010;6:3558–68. doi: 10.1021/ct100272s. [DOI] [PubMed] [Google Scholar]
- • .Genheden S, Mikulskis P, Hu L, et al. Accurate predictions of non-polar solvation free energies require explicit consideration of binding site hydration. J Am Chem Soc. 2011;133:13081–92. doi: 10.1021/ja202972m. [DOI] [PubMed] [Google Scholar]; • Discusses the problem of the continuum non-polar solvation approximation in MM/PBSA.
- Wallnoefer HG, Liedl KR, Fox T. A challenging system: free energy prediction for factor Xa. J Comput Chem. 2011;32:1743–52. doi: 10.1002/jcc.21758. [DOI] [PubMed] [Google Scholar]
- Wong S, Amaro RE, McCammon JA. MM-PBSA computes key role of intercalating water molecules at a protein–protein interface. J Chem Theory Comput. 2009;5:422–9. doi: 10.1021/ct8003707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenidge PA, Kramer C, Mozziconacci JC, et al. MM/GBSA binding energy prediction on the PDBbind data set: successes, failures, and directions for further improvement. J Chem Inf Model. 2012;53:201–9. doi: 10.1021/ci300425v. [DOI] [PubMed] [Google Scholar]
- • .Guimarães CR, Mathiowetz AM. Addressing limitations with the MM-GB/SA scoring procedure using the WaterMap method and free energy perturbation calculations. J Chem Inf Model. 2010;50:547–59. doi: 10.1021/ci900497d. [DOI] [PubMed] [Google Scholar]; • Interesting combination of MM/PBSA and the popular WaterMap method.
- Kohlmann A, Zhu X, Dalgarno D. Application of MM-GB/SA and WaterMap to SRC kinase inhibitors potency prediction. ACS Med Chem Letters. 2012;3:94–9. doi: 10.1021/ml200222u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abel R, Salam NK, Shelley JK, et al. Contribution of explicit solvent effects to the binding affinity of small-molecule inhibitors in blood coagulation factor serine proteases. Chem Med Chem. 2011;6:1049–66. doi: 10.1002/cmdc.201000533. [DOI] [PubMed] [Google Scholar]
- Genheden S, Söderhjelm P, Ryde U. Transferability of conformational dependent charges from protein simulations. Int J Quant Chem. 2012;112:1768–85. [Google Scholar]
- Söderhjelm P, Ryde U. Conformational dependence of charges in protein simulations. J Comput Chem. 2009;30:750–60. doi: 10.1002/jcc.21097. [DOI] [PubMed] [Google Scholar]
- • .Söderhjelm P, Ryde U. How accurate can a force field become? A polarizable multipole model combined with fragment-wise quantum-mechanical calculations. J Phys Chem A. 2009;113:617–27. doi: 10.1021/jp8073514. [DOI] [PubMed] [Google Scholar]; • Develops a high-level QM method to study protein–ligand interactions and discusses the accuracy limit of force fields.
- Jiao D, Zhang J, Duke RE, et al. Trypsin–ligand binding free energies from explicit and implicit solvent simulations with polarizable potential. J Comput Chem. 2009;30:1701–11. doi: 10.1002/jcc.21268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh N, Warshel A. Absolute binding free energy calculations: on the accuracy of computational scoring of protein-ligand interactions. Proteins. 2010;78:1705–23. doi: 10.1002/prot.22687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schutz CN, Warshel A. What are the dielectric ”constants” of proteins and how to validate electrostatic models. Proteins. 2001;44:400–17. doi: 10.1002/prot.1106. [DOI] [PubMed] [Google Scholar]
- • .Hou T, Wang J, Li Y, et al. Assessing the performance of the MM/PBSA and MM/GBSA methods. 1. The accuracy of binding free energy calculations based on molecular dynamics simulations. J Chem Inf Model. 2011;51:69–82. doi: 10.1021/ci100275a. [DOI] [PMC free article] [PubMed] [Google Scholar]; • Evaluation of several of the terms in MM/PBSA.
- Ravindranathan K, Tirado-Rives J, Jorgensen WL, et al. Improving MM-GB/SA scoring through the application of the variable dielectric model. J Chem Theory Comput. 2011;7:3859–65. doi: 10.1021/ct200565u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kongsted J, Ryde U. An improved method to predict the entropy term with the MM/PBSA approach. J Comput-Aided Mol Design. 2009;23:63–71. doi: 10.1007/s10822-008-9238-z. [DOI] [PubMed] [Google Scholar]
- Genheden S, Kuhn O, Mikulskis P, et al. The normal-mode entropy in the MM/GBSA method: effect of system truncation, buffer region, and dielectric constant. J Chem Inf Model. 2012;52:2079–88. doi: 10.1021/ci3001919. [DOI] [PubMed] [Google Scholar]
- Page CS, Bates PA. Can MM-PBSA calculations predict the specificity of protein kinase inhibitors? J Comput Chem. 2006;27:1990–2007. doi: 10.1002/jcc.20534. [DOI] [PubMed] [Google Scholar]
- Kopitz H, Cashman DA, Pfeiffer-Marek S, et al. Influence of the solvent representation on vibrational entropy calculations: generalized Born versus distance-dependent dielectric model. J Comput Chem. 2012;12:1004–13. doi: 10.1002/jcc.22933. [DOI] [PubMed] [Google Scholar]
- • .Genheden S, Ryde U. Will molecular dynamics simulations of proteins ever reach equilibrium. Phys Chem Chem Phys. 2012;14:8662–77. doi: 10.1039/c2cp23961b. [DOI] [PubMed] [Google Scholar]; • Discusses the problem of estimating entropy from molecular dynamic simulations.
- • .Raha K, Merz KM. Large-scale validation of a quantum mechanics based scoring function: Predicting the binding affinity and the binding mode of a diverse set of protein−ligand complexes. J Med Chem. 2005;48:4558–75. doi: 10.1021/jm048973n. [DOI] [PubMed] [Google Scholar]; • First attempt to a SQM-based scoring on a large scale.
- Gao C, Park MS, Stern HA. Accounting for ligand conformational restrictions in calculations of protein–ligand binding affinities. Biophys J. 2010;98:901–10. doi: 10.1016/j.bpj.2009.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang CE, Chen W, Gilson MK. Evaluating the accuracy of the quasiharmonic approximation. J Chem Theory Comput. 2005;1:1017–28. doi: 10.1021/ct0500904. [DOI] [PubMed] [Google Scholar]
- Guimaraes CR, Cardozo M. MM-GB/SA rescoring of docking poses in structure-based lead optimization. J Chem Inf Model. 2008;48:958–70. doi: 10.1021/ci800004w. [DOI] [PubMed] [Google Scholar]
- Söderhjelm P, Genheden S, Ryde U. Quantum mechanics in structure-based ligand design. In: Gohlke H, editor. Protein–ligand interactions; Methods and principles in medicinal chemistry. Vol. 53. Wiley-VCH Verlag; Weinheim: 2012. pp. 121–43. editor. p. [Google Scholar]
- Diaz N, Suarez D, Merz KM, et al. Molecular dynamics simulations of the TEM-1,beta-lactamase complexed with cephalothin. J Med Chem. 2005;48:780–91. doi: 10.1021/jm0493663. [DOI] [PubMed] [Google Scholar]
- Anisimov VM, Cavasotto CN. Quantum mechanical binding free energy calculation for phosphopeptide inhibitors of the Lck SH2 domain. J Comput Chem. 2011;32:2254–63. doi: 10.1002/jcc.21808. [DOI] [PubMed] [Google Scholar]
- Mikulskis P, Genheden S, Wichmann K, et al. A semiempirical approach to ligand binding affinities: Dependence on the Hamiltonian and corrections. J Comput Chem. 2012;33:1179–89. doi: 10.1002/jcc.22949. [DOI] [PubMed] [Google Scholar]
- Khandelwal A, Lukacova, Comez D, et al. A combination of docking, QM/MM methods, and MD simulation for binding affinity estimation of metalloprotein ligands. J Med Chem. 2005;48:5437–47. doi: 10.1021/jm049050v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gräter F, Schwarzl SM, Dejaegere A, et al. Protein/ligand binding free energies calculated with quantum mechanics/molecular mechanics. J Phys Chem B. 2005;109:10474–83. doi: 10.1021/jp044185y. [DOI] [PubMed] [Google Scholar]
- Kaukonen M, Söderhjelm P, Heimdal J, et al. QM/MM-PBSA method to estimate free energies for reactions in proteins. J Phys Chem B. 2008;112:12537–48. doi: 10.1021/jp802648k. [DOI] [PubMed] [Google Scholar]
- Ciancetta A, Genheden S, Ryde U. A QM/MM study of the binding of RAPTA ligands to cathepsin B. J Comput-Aided Mol Des. 2011;25:729–42. doi: 10.1007/s10822-011-9448-7. [DOI] [PubMed] [Google Scholar]
- Lu H, Huang X, Abdulhameed MD, et al. Binding free energies for nicotine analogs inhibiting cytochrome P450 2A6 by a combined use of molecular dynamics simulations and QM/MM-PBSA calculations. Bioorg Med Chem. 2014;22:2149–56. doi: 10.1016/j.bmc.2014.02.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fox SJ, Dziedzic J, Fox T, et al. Density functional theory calculations on entire proteins for free energies of binding: application to a model polar binding site. Proteins. 2014;82:3335–46. doi: 10.1002/prot.24686. [DOI] [PubMed] [Google Scholar]
- Dziedzic J, Fox SJ, Fox T, et al. Large-scale DFT calculations in implicit solvent: a case study on the T4 lysozyme L99A/M102Q protein. Int J Quant Chem. 2013;113:771–85. [Google Scholar]
- Sure R, Antony J, Grimme S. Blind prediction of binding affinities for charged supramolecular host-guest systems: achievements and shortcomings of DFT-D3. J Phys Chem B. 2014;118:3431–40. doi: 10.1021/jp411616b. [DOI] [PubMed] [Google Scholar]
- Mikulskis P, Cioloboc D, Andrejic M, et al. Free-energy perturbation and quantum mechanical study of SAMPL4 octa-acid host-guest binding energies. J Comput Aided Mol Design. 2014;28:375–400. doi: 10.1007/s10822-014-9739-x. [DOI] [PubMed] [Google Scholar]
- Kolar M, Fanfrlik J, Hobza P. Ligand conformational and solvation/desolvation free energy in protein-ligand complex formation. J Phys Chem B. 2011;115:4718–24. doi: 10.1021/jp2010265. [DOI] [PubMed] [Google Scholar]
- Retegan M, Milet A, Jame H. Exploring the binding of inhibitors derived from tetrabromobenzimidazole to the CK2 protein using a QM/MM-PB/SA approach. J Chem Inf Model. 2009;49:963–71. doi: 10.1021/ci8004435. [DOI] [PubMed] [Google Scholar]
- Barril X, Gelpi JL, Lopez JM, et al. How accurate can molecular dynamics/linear response and Poisson-Boltzmann/solvent accessible surface calculations be for predicting relative binding affinities? Acetylcholinesterase Huprine inhibitors as a test case. Theor Chem Acc. 2001;106:2–9. [Google Scholar]
- Kollman PA, Kuhn B. Binding of a diverse set of ligands to avidin and streptavidin: an accurate quantitative prediction of their relative affinities by a combination of molecular mechanics and continuum solvent models. J Med Chem. 2002;43:3786–91. doi: 10.1021/jm000241h. [DOI] [PubMed] [Google Scholar]
- Hou T, Guo S, Xu X. Predictions of binding of a diverse set of ligands to gelatinase-A by a combination of molecular dynamics and continuum solvent models. J Phys Chem B. 2002;106:5527–35. [Google Scholar]
- Salvalaglio M, Zamolo L, Busini V, et al. Molecular modeling of Protein A affinity chromatography. J Chrom A. 2009;1216:8678–86. doi: 10.1016/j.chroma.2009.04.035. [DOI] [PubMed] [Google Scholar]
- Muddana HS, Varnado D, Bielawski CW, et al. Blind prediction of host–guest binding affinities: a new SAMPL3 challenge. J Comput-Aided Mol Des. 2012;26:475–87. doi: 10.1007/s10822-012-9554-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mikulskis P, Genheden S, Rydberg P, et al. Binding affinities in the SAMPL3 trypsin and host–guest blind tests estimated with the MM/PBSA and LIE methods. J Comput-Aided Mol Des. 2012;26:527–41. doi: 10.1007/s10822-011-9524-z. [DOI] [PubMed] [Google Scholar]
- Genheden S, Nilsson I, Ryde U. Binding affinities of factor Xa inhibitors estimated by thermodynamic integration and MM/GBSA. J Chem Inf Model. 2011;51:947–58. doi: 10.1021/ci100458f. [DOI] [PubMed] [Google Scholar]
- Genheden S, Ryde U. Comparison of the efficiency of the LIE and MM/GBSA methods to calculate ligand-binding energies. J Chem Theory Comput. 2011;7:3768–78. doi: 10.1021/ct200163c. [DOI] [PubMed] [Google Scholar]
- • .Mikulskis P, Genheden S, Ryde U. A Large-scale test of free-energy simulation estimates of protein–ligand binding affinities. J Chem Inf Model. 2014;54:2794–806. doi: 10.1021/ci5004027. [DOI] [PubMed] [Google Scholar]; • Large-scale evaluation of alchemical perturbation methods.
- Laitinen T, Kankare JA, Perakyla M. Free energy simulations and MM-PBSA analyses on the affinity and specificity of steroid binding to antiestradiol antibody. Proteins. 2004;55:34–43. doi: 10.1002/prot.10399. [DOI] [PubMed] [Google Scholar]
- Guimaraes CR. A direct comparison of the MM-GB/SA scoring procedure and free-energy perturbation calculations using carbonic anhydrase as a test case: strengths and pitfalls of each approach. J Chem Theory Comput. 2011;7:2296–306. doi: 10.1021/ct200244p. [DOI] [PubMed] [Google Scholar]
- Gouda H, Kuntz ID, Case DA, et al. Free energy calculations for theophylline binding to an RNA aptamer: comparison of MM-PBSA and thermodynamic integration methods. Biopol. 2003;68:16–34. doi: 10.1002/bip.10270. [DOI] [PubMed] [Google Scholar]
- Wang L, Dent Y, Knight JL, et al. Modeling local structural rearrangements using FEP/REST: application to relative binding affinity predictions of CDK2 inhibitors. J Chem Theory Comput. 2012;9:1282–93. doi: 10.1021/ct300911a. [DOI] [PubMed] [Google Scholar]
- Bea I, Cervello E, Kollman PA, et al. Molecular recognition by beta-cyclodextrin derivatives: FEP vs MM/PBSA study. Comb Chem High Through Screen. 2001;4:605–11. doi: 10.2174/1386207013330689. [DOI] [PubMed] [Google Scholar]