Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Jan 1.
Published in final edited form as: J Comput Aided Mol Des. 2016 Sep 22;31(1):1–19. doi: 10.1007/s10822-016-9974-4

Overview of the SAMPL5 host–guest challenge: Are we doing better?

Jian Yin 1, Niel M Henriksen 1, David R Slochower 1, Michael R Shirts 2, Michael W Chiu 3, David L Mobley 4, Michael K Gilson 1
PMCID: PMC5241188  NIHMSID: NIHMS818646  PMID: 27658802

Abstract

The ability to computationally predict protein-small molecule binding affinities with high accuracy would accelerate drug discovery and reduce its cost by eliminating rounds of trial-and-error synthesis and experimental evaluation of candidate ligands. As academic and industrial groups work toward this capability, there is an ongoing need for datasets that can be used to rigorously test new computational methods. Although protein–ligand data are clearly important for this purpose, their size and complexity make it difficult to obtain well-converged results and to troubleshoot computational methods. Host–guest systems offer a valuable alternative class of test cases, as they exemplify noncovalent molecular recognition but are far smaller and simpler. As a consequence, host–guest systems have been part of the prior two rounds of SAMPL prediction exercises, and they also figure in the present SAMPL5 round. In addition to being blinded, and thus avoiding biases that may arise in retrospective studies, the SAMPL challenges have the merit of focusing multiple researchers on a common set of molecular systems, so that methods may be compared and ideas exchanged. The present paper provides an overview of the host–guest component of SAMPL5, which centers on three different hosts, two octa-acids and a glycoluril-based molecular clip, and two different sets of guest molecules, in aqueous solution. A range of methods were applied, including electronic structure calculations with implicit solvent models; methods that combine empirical force fields with implicit solvent models; and explicit solvent free energy simulations. The most reliable methods tend to fall in the latter class, consistent with results in prior SAMPL rounds, but the level of accuracy is still below that sought for reliable computer-aided drug design. Advances in force field accuracy, modeling of protonation equilibria, electronic structure methods, and solvent models, hold promise for future improvements.

Keywords: Host–guest, Molecular recognition, Computer-aided drug design, Blind challenge, Binding affinity

Introduction

Structure-based computer-aided drug design (CADD) methodologies are widely used to assist in the discovery of small molecule ligands for proteins of known three-dimensional structure [13]. Docking and scoring methods can assist with qualitative hit identification and optimization [46], and explicit solvent free energy methods [710] are beginning to show promise as an at least semi-quantitative tool to identify promising variants on a defined chemical scaffold [1114]. However, despite numerous efforts to improve the reliability of CADD by going beyond docking and scoring methods, ligand design still includes a large component of experimental trial and error, and the reasons why CADD methods are often not predictive are unclear. Although likely sources of substantial systematic error are well known—such as inaccuracy in the energy models used and uncertainty in protonation and tautomer states—it is difficult, and perhaps impossible, to analyze systematic errors in any detail, because incomplete conformational sampling of proteins adds large, ill-characterized random error.

As a consequence, host–guest systems [1525] are finding increasing application as substitutes for protein– ligand systems in the evaluation of computational methods of predicting binding affinities [2628]. A host is a compound much smaller than a protein but still large enough to have a cavity or cleft into which a guest molecule can bind by non-covalent forces. Host–guest systems can be identified that highlight various issues in protein–ligand binding, including receptor flexibility, solvation, hydrogen bonding, the hydrophobic effect, tautomerization and ionization. Because host molecules tend to be more rigid and always have far fewer degrees of freedom than proteins, random error due to inadequate or uncertain conformational sampling can be dramatically reduced, allowing a tight focus on other sources of error. Additionally, host–guest systems arguably represent a minimalist threshold test for methods of estimating binding affinities, as it is improbable that a method which does not work for such simple systems could succeed for more complex proteins.

Accordingly, host–guest systems have been included in rounds 3, 4 and now 5, of the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) project, a community-wide prediction challenge to evaluate computational methods related to CADD [2932]. The SAMPL project has traditionally posed challenges involving not only binding affinities but also simpler physical properties, such as hydration free energies of small molecules, and, in the present SAMPL5, distribution coefficients of drug-like molecules between water and cyclohexane. Importantly, SAMPL is a blinded challenge, which means that the unpublished experimental measurements are withheld from participants until the predictions have been made and submitted. This approach avoids the risk, in retrospective computational studies, of adjusting parameters or protocols to yield agreement with the known data, leading to results which appear promising but are not in fact reflective of how the method will perform on new data. In addition, SAMPL challenges facilitate comparisons among methods, because all participants address the same problems, and the consistency of the procedures offers the possibility of comparing results from one challenge to the next, in order to at least begin to track the state of the art.

The most recent challenge, SAMPL5, included 22 host–guest systems (Fig. 1), which attracted 54 sets of predictions from seven research groups. Here, we provide an overview of this challenge and the results. (Note that many participants also have provided individual papers on their host–guest predictions, most in this same special issue, and that additional papers address the distribution coefficient challenge that also was part of SAMPL5.) The present paper is organized as follows. We first introduce the design of the current SAMPL challenge, including descriptions of the host–guest systems and measurements, information on how the challenge was organized, and the nature of the submissions. We then analyze the performance of the various computational methods, using a number of different error metrics, and compare the results with each other and with those from prior SAMPL host–guest challenges.

Fig. 1.

Fig. 1

Structures of host OAH, OAMe, CBClip and their guest molecules. OA and OAMe are also known as OA and TEMOA, respectively. All host molecules are shown in two perspectives. Silver carbon, Blue nitrogen, Red oxygen, Yellow sulfur. Non-polar hydrogen atoms were omitted for clarity. OA-G1–OA-G6 are the common guest molecules for OAH and OAMe, and CBC-G1–CBC-G10 are guests for CBClip. Protonation states of all host and guest molecules shown in the figure were suggested by the organizers based on the expected pKas and the experimental pH values

Methods

Structures of Host–Guest Systems and Experimental Measurements

The SAMPL5 host–guest challenge involves three host molecules, which were synthesized and studied in the laboratories of Prof. Bruce Gibb and Prof. Lyle Isaacs, who kindly allowed the experimental data to be included in the SAMPL5 challenge before being published. The first two hosts, OAH [33] and OAMe, from the Gibb laboratory, are also known as octa-acid (OA) and tetra-endo-methyl octa-acid (TEMOA) [34, 35]. The third, CBClip [36], was developed in the Isaacs laboratory. Representative 3D structures along with the 2D drawings of their respective SAMPL5 guest molecules, are shown in Fig. 1. Host OAH was used in the SAMPL4 challenge [31], but with a different set of guests. One end of it has a wide opening to a bowl-shaped binding site, while the other end has a narrow opening that is too small to admit most guests. The bowl's opening is rimmed by four carboxylic acids, and another four carboxylic groups extend into solution from the closed end. The carboxylic groups were added to promote solubility and are not thought to interact closely with any of the guests. Host OAMe is identical to OAH, except for the addition of four methyl groups to the aromatic rings at the rim of the portal. The common guest molecules of OAH and OAMe, OA-G1–OA-G6 were chosen based on chemical diversity, solubility, and an expectation that they would exhibit significant binding to these hosts. Host CBClip is an acyclic molecular clip that is chemically related to the cucurbiturils used in previous SAMPL projects [30, 31]. It consists of two glycoluril units, each with an aromatic sidewall, and four sulfonate solubilizing groups. Ten molecules, CBC-G1–CBC-G10, were chosen as guests of CBClip, with the aim of attaining a wide range of affinities.

The experimental binding data for all three sets of host–guest systems are listed in Table 1. A 1:1 binding stoichiometry was confirmed experimentally in all cases. The binding affinities of most OAH/OAMe complexes were measured using two different techniques, NMR and ITC, and binding enthalpies are also available for the ones studied by ITC. The NMR experiments were carried out in 10 mM sodium phosphate buffer at a pH of 11.3, while the ITC experiments were performed in 50 mM sodium phosphate buffer at pH 11.5. Both sets of experiments were conducted at 298 K, except that the NMR results for OAMe-G4 were obtained at 278 K. In the SAMPL5 instruction file (see Supplementary Material), we provided expected buffer conditions for OAH/OAMe systems as “aqueous 10 mM sodium phosphate buffer at pH 11.5, at 298 K, except for OA-G6, for which the buffer was 50 mM sodium phosphate”, based on information from Dr. Gibb. Therefore, the binding affinities measured under these conditions were used for the present error analysis when-ever they are available; i.e., the ITC values for OA-G6 and NMR values for the rest. Note that OA-G4 with OAH was measured only by ITC, so this value was used in the present analysis. For the CBClip systems, the experimental studies were carried out in 20 mM sodium phosphate buffer at pH 7.4, at a temperature of 298 K. Most of the CBClip binding affinities were measured by either NMR or UV/Vis spectroscopy. However, CBC-G6 and CBC-G10 were measured by both techniques; for these, the present analysis uses the results with the highest confidence level indicated by the experimentalists: UV/Vis measurement for CBC-G6 and fluorescence for CBC-G10. Detailed experimental data for OAH and OAMe system are provided in the SAMPL5 special issue [37], and data for CBClip systems are provided elsewhere [38]. Note that a different set of numbering was used for both hosts and guests in the octa acid experimental paper.

Table 1.

Experimental standard binding affinities (ΔG°) of OAH, OAMe and CBClip (1 M standard concentration) used as references for SAMPL5 host-guest affinity predictions. All binding affinities discussed in the present work denote standard binding affinities

Compd ID Technique Buffer Conc (mM) Ka (M−1)a ΔG° (kcal/mol)b Uncertainty (kcal/mol)
OAHc
    OA-G1 NMR 10 (5.00 ± 0.07) × 103 −5.04 0.01
    OA-G2 NMR 10 (1.31 ± 0.04) × 103 −4.25 0.01
    OA-G3 NMR 10 (5.16 ± 0.09) × 103 −5.06 0.01
    OA-G4 ITC 50 (7.43 ± 0.04) × 106 −9.37 0.00
    OA-G5 NMR 10 (1.996 ± 0.005) × 103 −4.50 0.00
    OA-G6 ITC 50 (8.15 ± 0.07) × 103 −5.33 0.00
OAMec
    OA-G1 NMR 10 (6.94 ± 0.76) × 103 −5.24 0.05
    OA-G2 NMR 10 (4.96 ± 0.37) × 103 −5.04 0.03
    OA-G3 NMR 10 (2.31 ± 0.66) × 104 −5.94 0.12
    OA-G4d NMR 10 (5.58 ± 0.28) × 101 −2.38 0.02
    OA-G5 NMR 10 (7.29 ± 0.33) × 102 −3.90 0.02
    OA-G6 ITC 50 (2.05 ± 0.10) × 103 −4.52 0.02
CBClip
    CBC-G1 NMR 20 (1.9 ± 0.1) × 104 −5.83 0.03
    CBC-G2 NMR 20 70 ± 8 −2.51 0.07
    CBC-G3 NMR 20 (8.8 ± 0.5) × 102 −4.02 0.03
    CBC-G4 UV/VIS 20 (2.0 ± 0.1) × 105 −7.24 0.03
    CBC-G5 UV/VIS 20 (1.8 ± 0.2) × 106 −8.53 0.07
    CBC-G6e UV/VIS 20 (2.2 ± 0.2) × 106 −8.64 0.05
    CBC-G7 UV/VIS 20 (6.2 ± 0.2) × 103 −5.17 0.02
    CBC-G8 UV/VIS 20 (3.3 ± 0.2) × 104 −6.17 0.04
    CBC-G9 UV/VIS 20 (2.6 ± 0.1) × 105 −7.39 0.02
    CBC-G10f Fluorescence 20 (3.9 ± 0.2) × 107 −10.35 0.03
a

All Ka values are reported as mean ± standard deviation (SD), where the SD was computed from the experimental replicates

b

The uncertainties of experimental ΔG° values are reported as standard error of the mean (SEM), obtained from the experimental replicates of Ka (see Table S1)

c

For the complete set of NMR and ITC data for OAH/OAMe systems, please see ref [37]

d

Ka values were measured at 298 K except OAMe-G4 was measured at 278 K

e

The binding constant of CBClip-G6 measured by fluorescence titration is (2.3 ± 0.2) × 106 M−1

f

The binding constant of CBClip-G10 is (2.4 ± 0.4) × 107 M−1 measured by UV/Vis titration and (6.6 ± 0.7) × 107 M−1 measured by UV/Vis competition assay

Design of the SAMPL5 host–guest challenge

The SAMPL5 challenge was organized in collaboration with the Drug Design Data Resource (D3R). The general information, detailed instructions, and input files for SAMPL5 were posted on the D3R website (https://drugdesigndata.org/about/sampl5) mostly before September 15, 2015; the information for three guest molecules in the CBClip series was added in mid-October. Submissions were accepted from registered participants until the February 2 deadline. Multiple sets of predictions were allowed for any or all of the host–guest series. Experimental measurements and error analyses of all predictions were released shortly after the submission deadline, and many participants discussed their results and the challenge at the D3R workshop held March 9–11, 2016, at University of California San Diego. All participants were invited to submit a manuscript about their calculations and results before a June 20, 2016 deadline, and the resulting papers accompany this overview in the special issue of the Journal of Computer-Aided Molecular Design.

The SAMPL5 host–guest instruction files provided the expected experimental conditions for each set of host–guest systems, including pH, buffer composition and temperature, though these were subject to adjustment because some experiments were still being done when the instructions were distributed. The instructions noted that all acidic groups of the host molecules seemed likely to be ionized at the experimental pH values (above), leading to net charges of net charges of −8, −8 and −4 for OAH, OAMe and CBClip, respectively; but also noted that this assumption was open to modification by each participant. Plausible three-dimensional coordinates of host CBClip were provided by Prof. Lyle Isaacs, while the starting 3D structures of OAH and OAMe were built and energy-minimized with the program MOE [39]. The protonation states of all guest molecules in their unbound state were also suggested, based on their expected pKas and the experimental pH values (Fig. 1), but, again, it was made clear that each participant had to make his or her own judgment regarding the ionization states and whether they remained unchanged on binding their respective hosts. The initial structures of the free guest molecules were constructed by conformational search with MOE. The resulting structures of free hosts and guests were provided in the download as PDB, mol2 and SD files. (A bond order issue in a few SD files of the free CBClip guests were reported by SAMPL users at the workshop; two submissions using the Movable Type method were adversely affected [40].) When submitting their predictions, participants were required to provide not only estimated binding free energies, but also computational uncertainties, in the form of standard errors of the mean (SEM). New in SAMPL5, participants were also invited to provide predictions of the binding enthalpies for the octa-acid host–guest systems, OA and OAMe, but this aspect of the challenge is not discussed in the present overview paper because only one group predicted enthal-pies [41].

In prior rounds of SAMPL, it was observed that participants using ostensibly equivalent force fields and simulation procedures to compute binding free energies sometimes reported rather different predictions. To help resolve such situations in case they arose in SAMPL5, participants using explicit solvent free energy methods were encouraged to submit additional “standard” runs with a prescribed set of force field and simulation parameters. The systems selected for these standard runs were OAH-G3 and OAH-G4. The input files for plausibly docked host–guest complexes solvated in TIP3P water with counterions were provided to participants in Amber [42], Gromacs [43], Desmond [44], and LAMMPS [45] formats, along with the other SAMPL5 starting data. The procedures used to ascertain that all four standard setups were equivalent across all four software packages are detailed in another paper in this issue [46].

Error analysis

Details of the experimental measurements for the host–guest complexes in SAMPL5 are available elsewhere [37, 38]; all available data were provided to the participants after the close of the challenge. For most of the OAH and OAMe cases, affinities were measured by more than one technique [37]. Some SAMPL5 participants used the averaged affinities for their error analysis, while others selected affinities measured by either NMR or ITC. It is up to the participants to determine which set of experimental affinities to use for their own analyses, as long as consistent criteria are used, rather than choosing ones that generate the best agreement with the computational estimates. In any case, given that the different affinities measured for the same host–guest pairs vary only slightly, this factor should not influence the judgement of the performance of any submissions to a significant extent. In the present paper, the error analysis for OAH and OAMe is based on comparisons with the selected NMR/ITC affinities listed in Table 1, which were chosen to best match the experimental conditions that participants were told to expect when the challenge was set. However, our statistical analyses change little on comparing instead with, for example, the average of all available affinities (ITC and NMR) for each host–guest pair. For example, the RMSE values change by at most 0.1 kcal/mol; see the error metric spreadsheet in SI. For completeness, detailed error metrics based on both sets of experimental affinities (those listed in Table 1, and averaged) are provided in the SI. The SI also provides the experimental replicates (Prof. Bruce Gibb, personal communication), which we used to estimate the experimental uncertainties in Table 1.

All binding affinity prediction sets were compared with the corresponding experimental data by four measures: root mean-squared error (RMSE), Pearson coefficient of determination (R2), linear regression slope (m), and the Kendall rank correlation coefficient (τ). Evaluating these statistics was straightforward for predictions of absolute (also known as standard) binding free energies, and the results are presented here as “absolute error metrics”. For OAH and OAMe, some submissions included only relative binding free energies, and comparing these with experiment is more complicated. One approach for handling relative free energies would be to reference all of the relative binding free energies to a single guest molecule, but then the apparent accuracy can become particularly sensitive to the quality of the calculations for the reference guest. Another approach would be to consider all pairwise free energy differences, but this becomes cumbersome and redundant. Additionally, a method is needed to compare the accuracy of relative and absolute free energy calculations on a uniform footing.

Here, we adopted an approach used in analyzing the SAMPL4 challenge [31], in which the mean signed error (MSE) of each submission set, whether relative or absolute, is subtracted from each prediction leading to “offset” binding affinity estimates. The error metrics for comparisons to experiment are less sensitive with this approach than using any particular host–guest system as a reference. We compute the offset binding free energies for each method as follows

ΔGi,ocalc=ΔGi,rcalcMSE (1)
MSE=1ni=1n(ΔGiexpΔGi,rcalc) (2)

where ΔGi,rcalc is the reported (absolute or relative) binding affinity for each prediction i, ΔGiexp is the corresponding absolute experimental binding affinity, ΔGi,ocalc is the offset binding affinity, and n is the total number of guests considered. By offsetting both relative and absolute predictions, we can make a fair comparison of their agreement with experiment. We term the error metrics computed with this approach “offset error metrics” and they are named RMSEo, Ro2, mo, and τo.

Given the similarity of the OAH and OAMe hosts, the fact that the same guests were studied for both, and the fact that most submissions included results for both subsets, we provide error statistics for the combined OAH/OAMe datasets. Note that, in the submissions that reported relative affinities, the binding estimates of OAH-G1 and OAMe-G1 were both arbitrarily set as zero, even though the experimental binding affinities of OAH-G1 and OAMe-G1 are not identical. We addressed this problem by applying a separate MSE offset to the data for these two hosts; that is, by subtracting the MSE of the OAH subset from the OAH estimates, and the MSE of the OAMe subset from the OAMe estimates. For instance, in a combined set of OAH/OAMe predictions which contains six relative binding affinity estimates for host OAH and six for OAMe, the offset RMSE error metric, termed RMSEo, is given by

RMSEo=112(i=16[ΔGiexp(ΔGi,rcalcMSE)]OAH2+j=16[ΔGjexp(ΔGj,rcalcMSE)]OAMe2) (3)

We also tested how well the computational predictions performed by comparison with two simple null models, Null1 and Null2. In Null1, all binding free energies were set to 0.0 kcal/mol and the statistical uncertainty for each data point was set to 0.0 kcal/mol. In the Null2 model, the binding affinity estimate for each guest was computed via a linear regression of the experimental binding free energies versus the number of heavy atoms in the corresponding guest molecule, for identical or similar host molecules used in the SAMPL3 [30] and SAMPL4 [31] exercises; the resulting expression is ΔG = −1.11 × number of heavy atoms +5.06 (kcal/mol) for OAH and OAMe systems and ΔG = −0.25 × number of heavy atoms −1.81 (kcal/mol) for the CBClip systems. In order to simulate a real submission, we assigned a statistical uncertainty of 1.0 kcal/-mol to each data point in Null2.

In addition to evaluating how each calculation method performed in this specific challenge (i.e., the two types of error metrics described above), we wanted to provide an estimate of how well each method would perform in general. In other words, we wanted to compute error metric uncertainties which accounted for how the reported statistical error and composition of the data set influences the error metric results. The uncertainty in the error metrics was determined via bootstrap resampling with replacement. Conceptually, this involves creating thousands of hypothetical “experiment versus calculation” data sets which are consistent with the reported uncertainties, and then recording the distribution of the error metrics across all of the hypothetical sets. More specifically, we considered each data point, whether experiment or calculated, as a normal distribution centered on the reported mean value with the width determined by the reported SEM. For each bootstrap cycle, we selected a single random value from that distribution for each data point, which we term “re-sampling”. Additionally, while each bootstrap cycle always had the same total number of host–guest systems (12 for OAH/OAMe, 10 for CBClip), the composition of the data set was selected “with replacement”, meaning that in some cycles there were multiple copies of a host–guest pair, while other pairs were absent. The error metric distributions were generated with a sufficient number of bootstrap cycles, 100,000 in this case, such that the mean and standard deviation of the distributions were reproducible to the second decimal place. For submissions which did not report an uncertainty (see Tables 3, 4 footnotes), the resampling step was omitted. The code for generating the error metrics and plotting the distributions is available both in the SI and on Github.

Table 3.

Absolute and offset error metrics of binding affinity predictions for the combined OAH/OAMe datasets

Method RMSE Absolute error metrics
RMSEo Offset error metrics
R2 m τ Ro2 mo τ o
Null1a 5.3 (5.3 ± 0.5) 1.5 (1.4 ± 0.4) 0.1 (0.2 ± 0.1) 0.1 (0.1 ± 0.1)
Null2b 3.5 (3.5 ± 0.8) 0.0 (0.2 ± 0.2) 0.1 (−0.4 ± 1.0) −0.3 (−0.2 ± 0.3) 2.5 (2.6 ± 0.5) 0.0 (0.2 ± 0.2) 0.2 (−0.3 ± 1.0) −0.2 (−0.2 ± 0.3)
APR-OPC 2.1 (2.1 ± 0.2) 0.8 (0.8 ± 0.2) 1.4 (1.5 ± 0.5) 0.6 (0.6 ± 0.2) 1.2 (1.2 ± 0.4) 0.8 (0.8 ± 0.2) 1.4 (1.5 ± 0.5) 0.6 (0.6 ± 0.2)
APR-TIP3P 1.6 (1.6 ± 0.2) 0.9 (0.9 ± 0.1) 1.4 (1.4 ± 0.2) 0.7 (0.7 ± 0.1) 0.8 (0.9 ± 0.2) 0.9 (0.9 ± 0.1) 1.4 (1.4 ± 0.2) 0.7 (0.7 ± 0.1)
BEDAM 6.1 (6.0 ± 1.0) 0.0 (0.1 ± 0.2) 0.2 (0.6 ± 1.5) 0.0 (0.1 ± 0.3) 4.4 (4.3 ± 0.6) 0.0 (0.1 ± 0.2) 0.5 (0.8 ± 1.4) 0.1 (0.1 ± 0.2)
DFT/TPSS-cc 5.3 (5.2 ± 0.8) 0.3 (0.4 ± 0.2) 2.2 (2.8 ± 1.5) 0.5 (0.5 ± 0.2)
DFT/TPSS-nc 5.5 (5.5 ± 0.8) 0.4 (0.4 ± 0.2) 2.5 (2.4 ± 1.5) 0.3 (0.3 ± 0.3)
DLPNO-CCSD(T)c 7.3 (7.2 ± 1.1) 0.4 (0.4 ± 0.2) 3.1 (2.6 ± 2.0) 0.3 (0.3 ± 0.2)
Metadynamics 3.1 (3.2 ± 0.5) 0.7 (0.5 ± 0.3) 1.2 (1.1 ± 0.6) 0.4 (0.4 ± 0.2) 1.0 (1.4 ± 0.4) 0.7 (0.5 ± 0.3) 1.0 (1.0 ± 0.5) 0.5 (0.4 ± 0.2)
MMPBSA-GAFF 3.6 (3.6 ± 0.5) 0.0 (0.2 ± 0.2) 0.3 (0.1 ± 1.1) 0.1 (0.1 ± 0.3) 3.0 (2.9 ± 0.8) 0.0 (0.2 ± 0.2) 0.3 (0.1 ± 1.1) 0.1 (0.0 ± 0.3)
MovTyp-1d 3.0 (2.9 ± 1.1) 0.0 (0.3 ± 0.3) 0.2 (−0.1 ± 0.9) 0.0 (0.0 ± 0.3) 2.2 (2.1 ± 0.7) 0.1 (0.3 ± 0.3) 0.3 (0.0 ± 0.9) 0.0 (0.0 ± 0.3)
MovTyp-2d 3.1 (2.9 ± 1.1) 0.0 (0.3 ± 0.3) 0.2 (−0.1 ± 1.1) 0.1 (0.1 ± 0.3) 2.6 (2.5 ± 0.7) 0.1 (0.3 ± 0.2) 0.4 (0.1 ± 1.0) 0.1 (0.1 ± 0.3)
TI-ps 2.7 (2.7 ± 0.9) 0.0 (0.2 ± 0.2) 0.2 (0.2 ± 1.0) 0.2 (0.2 ± 0.2) 2.2 (2.4 ± 0.8) 0.1 (0.3 ± 0.2) 0.5 (0.5 ± 0.9) 0.3 (0.2 ± 0.3)
TI-raw 2.7 (2.8 ± 0.9) 0.0 (0.2 ± 0.2) 0.2 (0.3 ± 1.1) 0.2 (0.2 ± 0.2) 2.3 (2.5 ± 0.7) 0.1 (0.2 ± 0.2) 0.5 (0.6 ± 1.0) 0.4 (0.2 ± 0.2)
HBAR 3.0 (3.0 ± 0.9) 0.0 (0.2 ± 0.2) −0.3 (−0.2 ± 1.0) 0.1(0.1 ± 0.2) 2.4 (2.4 ± 0.7) 0.0 (0.2 ± 0.2) 0.1 (0.1 ± 0.8) 0.2 (0.2 ± 0.3)
HBAR-ps 3.0 (2.9 ± 0.9) 0.0 (0.2 ± 0.2) −0.3 (−0.2 ± 1.0) 0.1 (0.1 ± 0.2) 2.3 (2.3 ± 0.7) 0.0 (0.2 ± 0.2) 0.1 (0.1 ± 0.8) 0.2 (0.2 ± 0.3)
HBAR-ps1 3.0 (3.0 ± 0.9) 0.0 (0.2 ± 0.2) −0.3 (−0.2 ± 1.0) 0.1 (0.1 ± 0.3) 2.4 (2.4 ± 0.7) 0.0 (0.2 ± 0.2) 0.1 (0.1 ± 0.8) 0.2 (0.2 ± 0.3)
HBAR-ps2 2.2 (2.3 ± 0.5) 0.0 (0.1 ± 0.2) 0.0 (0.1 ± 0.7) 0.1 (0.1 ± 0.3) 1.7 (1.8 ± 0.4) 0.1 (0.2 ± 0.2) 0.3 (0.4 ± 0.5) 0.3 (0.2 ± 0.2)
SOMD-1 3.6 (3.6 ± 0.3) 0.9 (0.8 ± 0.2) 1.3 (1.2 ± 0.3) 0.5 (0.4 ± 0.2) 0.8 (0.9 ± 0.2) 0.9 (0.8 ± 0.2) 1.2 (1.1 ± 0.3) 0.5 (0.5 ± 0.2)
SOMD-2 3.6 (3.6 ± 0.3) 0.9 (0.8 ± 0.2) 1.3 (1.2 ± 0.3) 0.5 (0.4 ± 0.2) 0.8 (0.9 ± 0.2) 0.9 (0.8 ± 0.2) 1.2 (1.2 ± 0.3) 0.5 (0.5 ± 0.2)
SOMD-3 2.1 (2.1 ± 0.3) 0.9 (0.7 ± 0.2) 1.3 (1.2 ± 0.3) 0.4 (0.4 ± 0.2) 0.9 (1.0 ± 0.2) 0.9 (0.7 ± 0.2) 1.2 (1.1 ± 0.3) 0.5 (0.4 ± 0.2)
SOMD-4 10.0 (10.0 ± 0.5) 0.1 (0.2 ± 0.2) 2.0 (1.7 ± 3.1) 0.3 (0.3 ± 0.2) 9.8 (9.7 ± 1.0) 0.1 (0.2 ± 0.1) 1.9 (1.6 ± 3.1) 0.3 (0.3 ± 0.2)

The values before the parentheses are error metrics computed from the original data in the left column and the offset data in the right column. The mean values and uncertainties inside the parentheses were obtained through resampling the original data with replacement. RMSE, R2, m and τ are the root mean-squared error, Pearson coefficient of determination, linear regression slope and Kendall rank correlation coefficient of the absolute error metrics, as opposed to their respective counterparts, the offset error metrics RMSEo, Ro2, mo and τo

a

Null1 model with a constant value of 0.0 kcal/mol and no computational uncertainties

b

Null2 model based on the linear regression equation ΔG = −1.11 × number of heavy atoms +5.06 kcal/mol (based on SAMPL4 OA data); SEM was set to 1.0 kcal/mol

c

Relative binding affinities were reported; computational uncertainties were not provided

d

Computational uncertainties were not reported

Table 4.

Offset error metrics of binding affinity predictions for the separate OAH and OAMe datasets

Method OAH
OAMe
RMSEo Ro2 mo τ o RMSEo Ro2 mo τ o
Null1a 1.7 (1.6 ± 0.6) 1.1 (1.1 ± 0.3)
Null2b 1.7 (1.9 ± 0.5) 0.4 (0.4 ± 0.3) 0.8 (0.4 ± 4.5) 0.4 (0.3 ± 0.3) 3.1 (3.1 ± 0.8) 0.7 (0.6 ± 0.2) −1.5 (−1.6 ± 1.0) −0.8 (−0.6 ± 0.2)
APR-OPC 0.7 (0.7 ± 0.2) 0.9 (0.7 ± 0.3) 1.1 (1.2 ± 2.7) 0.6 (0.5 ± 0.3) 1.6 (1.6 ± 0.6) 0.9 (0.9 ± 0.2) 2.2 (1.9 ± 0.7) 0.9 (0.7 ± 0.2)
APR-TIP3P 0.8 (0.8 ± 0.2) 1.0 (0.9 ± 0.2) 1.4 (1.5 ± 0.9) 0.6 (0.5 ± 0.3) 0.9 (0.9 ± 0.3) 0.9 (0.9 ± 0.1) 1.6 (1.5 ± 0.4) 0.9 (0.7 ± 0.2)
BEDAM 4.8 (4.7 ± 0.9) 0.0 (0.1 ± 0.2) −0.5 (0.2 ± 18.5) −0.1 (0.0 ± 0.3) 3.9 (3.8 ± 0.7) 0.4 (0.5 ± 0.3) 2.7 (2.1 ± 2.4) 0.3 (0.3 ± 0.4)
DFT/TPSS-cc 5.7 (5.6 ± 1.3) 0.2 (0.4 ± 0.3) 1.7 (3.7 ± 9.1) 0.5 (0.4 ± 0.3) 4.8 (4.7 ± 0.8) 0.7 (0.7 ± 0.2) 4.1 (3.9 ± 2.4) 0.3 (0.3 ± 0.4)
DFT/TPSS-nc 4.4 (4.3 ± 0.7) 0.5 (0.6 ± 0.3) 2.2 (2.0 ± 5.8) 0.3 (0.3 ± 0.4) 6.5 (6.4 ± 1.3) 0.4 (0.5 ± 0.3) 3.9 (2.7 ± 4.3) 0.3 (0.3 ± 0.4)
DLPNO-CCSD(T)c 7.0 (6.9 ± 1.3) 0.5 (0.5 ± 0.3) 3.3 (1.8 ± 9.9) 0.3 (0.3 ± 0.4) 7.6 (7.4 ± 1.7) 0.3 (0.4 ± 0.3) 3.7 (2.2 ± 5.4) 0.2 (0.2 ± 0.4)
Metadynamics 0.7 (0.9 ± 0.3) 0.9 (0.7 ± 0.3) 1.2 (1.1 ± 1.9) 0.6 (0.4 ± 0.3) 1.2 (1.6 ± 0.5) 0.3 (0.3 ± 0.3) 0.6 (0.8 ± 1.1) 0.3 (0.3 ± 0.3)
MMPBSA-GAFF 1.8 (1.8 ± 0.3) 0.5 (0.5 ± 0.3) 1.0 (1.3 ± 5.4) 0.5 (0.4 ± 0.4) 3.9 (3.7 ± 1.2) 0.4 (0.5 ± 0.3) −1.7 (−1.2 ± 1.9) −0.3 (−0.3 ± 0.4)
MovTyp-1d 0.8 (0.7 ± 0.2) 0.8 (0.7 ± 0.3) 1.0 (0.9 ± 2.4) 0.5 (0.4 ± 0.4) 3.1 (2.9 ± 1.0) 0.7 (0.6 ± 0.3) −1.5 (−1.3 ± 0.9) −0.5 (−0.4 ± 0.4)
MovTyp-2d 1.2 (1.2 ± 0.2) 0.8 (0.7 ± 0.3) 1.2 (1.4 ± 4.2) 0.9 (0.8 ± 0.2) 3.5 (3.4 ± 1.0) 0.5 (0.5 ± 0.3) −1.7 (−1.5 ± 1.3) −0.5 (−0.4 ± 0.4)
TI-ps 0.8 (1.3 ± 0.4) 0.8 (0.7 ± 0.2) 1.0 (1.6 ± 3.6) 0.7 (0.5 ± 0.3) 3.0 (3.1 ± 1.1) 0.2 (0.3 ± 0.3) −0.8 (−0.4 ± 1.8) −0.2 (−0.1 ± 0.4)
TI-raw 1.7 (1.9 ± 0.5) 0.5 (0.5 ± 0.3) 1.0 (1.7 ± 4.6) 0.6 (0.4 ± 0.3) 2.7 (2.8 ± 1.1) 0.1(0.4 ± 0.3) −0.6 (−0.2 ± 1.7) −0.2 (−0.1 ± 0.4)
HBAR 1.7 (1.7 ± 0.5) 0.2 (0.4 ± 0.3) 0.3 (0.7 ± 4.8) 0.5 (0.4 ± 0.3) 3.0 (2.9 ± 1.0) 0.2 (0.4 ± 0.3) −0.9 (−0.4 ± 1.5) −0.1 (−0.1 ± 0.4)
HBAR-ps 1.3 (1.3 ± 0.3) 0.4 (0.6 ± 0.2) 0.4 (0.8 ± 2.2) 0.6 (0.5 ± 0.3) 3.0 (2.9 ± 1.0) 0.2 (0.4 ± 0.3) −0.9 (−0.4 ± 1.5) −0.1 (−0.1 ± 0.4)
HBAR-ps1 1.2 (1.2 ± 0.4) 0.5 (0.6 ± 0.2) 0.5 (1.0 ± 2.0) 0.6 (0.6 ± 0.3) 3.2 (3.2 ± 1.0) 0.2 (0.4 ± 0.3) −1.1 (−0.6 ± 1.6) −0.2 (−0.2 ± 0.4)
HBAR-ps2 1.5 (1.5 ± 0.4) 0.3 (0.4 ± 0.2) 0.3 (0.7 ± 1.8) 0.5 (0.4 ± 0.3) 1.9 (2.0 ± 0.5) 0.0 (0.3 ± 0.3) −0.1 (0.2 ± 1.1) −0.1 (0.0 ± 0.4)
SOMD-1 0.8 (1.0 ± 0.4) 0.9 (0.7 ± 0.3) 1.3 (1.1 ± 1.6) 0.5 (0.4 ± 0.4) 0.7 (0.8 ± 0.3) 0.8 (0.7 ± 0.3) 1.1 (1.0 ± 0.5) 0.5 (0.5 ± 0.3)
SOMD-2 0.9 (1.0 ± 0.3) 0.9 (0.8 ± 0.3) 1.4 (1.1 ± 1.6) 0.5 (0.4 ± 0.4) 0.7 (0.8 ± 0.3) 0.8 (0.7 ± 0.3) 1.1 (1.0 ± 0.5) 0.6 (0.5 ± 0.3)
SOMD-3 1.0 (1.1 ± 0.3) 0.9 (0.7 ± 0.3) 1.3 (1.0 ± 1.7) 0.3 (0.3 ± 0.4) 0.7 (0.8 ± 0.3) 0.8 (0.7 ± 0.3) 1.1 (1.0 ± 0.5) 0.6 (0.5 ± 0.3)
SOMD-4 9.5 (9.5 ± 1.5) 0.4 (0.4 ± 0.3) 3.6 (3.5 ± 30.8) 0.5 (0.4 ± 0.3) 10.0 (9.9 ± 1.5) 0.0 (0.3 ± 0.3) −1.4 (−0.5 ± 7.1) 0.1 (0.1 ± 0.4)
TI/BAR 1.0 (1.2 ± 0.4) 0.6 (0.6 ± 0.2) 0.6 (1.1 ± 2.5) 0.9 (0.6 ± 0.3)
MMPBSA-OPLS 3.4 (3.2 ± 1.1) 0.7 (0.6 ± 0.3) −1.7 (−1.4 ± 1.0) −0.6 (−0.4 ± 0.3)

The values before the parentheses are error metrics computed from the original data offset by the MSE. The mean values and uncertainties inside the parentheses were obtained through resampling the original data using replacement. The offset error metrics RMSEo, Ro2, mo and τo are provided

a

Null1 model with a constant value 0.0 kcal/mol and no computational uncertainties

b

Null2 model based on the linear regression equation ΔG = −1.11 × number of heavy atoms + 5.06 kcal/mol (based on SAMPL4 OA data); SEM was set to 1.0 kcal/mol

c

Computational uncertainties were not reported

d

Computational uncertainties were not reported

Results

The SAMPL5 host–guest challenge received a total of 54 submissions from 7 research groups, comprising 12 submissions for host CBClip, 21 submissions for OAH and 21 submissions for OAMe. Key aspects of all prediction methods—the conformational sampling method, the force field used for the host and guest, and the water model—are summarized in Table 2. After merging submissions that used identical methods for both OAH and OAMe, the 42 OAH/OAMe submissions reduce to 20 sets of combined predictions, along with two subset predictions: TI-BAR (Table 2) for only OAH, and MMPBSA-OPLS (Table 2) for only OAMe. The conformational sampling techniques used include docking, molecular dynamics (MD) simulations with explicit or implicit solvent, and Monte Carlo methods. Compared with SAMPL3 and SAMPL4 exercises, docking was less frequently used as the sole sampling technique, but it was commonly used for obtaining the starting structures for more detailed computational approaches. Extensive use was made of generalized classical force fields with fixed charges and no explicit treatment of electronic polarizability, and methods using explicit solvent models employed chiefly the TIP3P water model [47]. However, a few methods focused less on conformational sampling and more on the quality of the energy calculations, through the use of various quantum methods. For the quantum methods to obtain configurational entropy, low-lying vibrational modes were treated by the free-rotor approximation, using the interpolation model implemented by Grimme [48]. The methods to derive affinities or relative affinities range from relatively established approaches, such as thermodynamic integration (TI) [49], Bennett acceptance ratio (BAR) [50], metadynamics [51], and MM/PBSA [52], to the more recently developed Movable Type method [53].

Table 2.

Summary of computational methods in all SAMPL5 host-guest submissions

Method Conformational sampling Energy model Solvent model SAMPL5 Refs.
OAH/OAMe
    APR-OPC (C, E) MD; docking GAFF/RESP OPC [41]
    APR-TIP3P (C, E) MD; docking GAFF/RESP TIP3P [41]
    BEDAM (C, I) MD OPLS-2005 AGBNP2 [54]
    DFT/TPSS-ca (Q, I) Manual DFT-D3 (TPSS functional) COSMO-RS [55]
    DFT/TPSS-na (Q, I) Manual DFT-D3 (TPSS functional) COSMO-RS [55]
    DLPNO-CCSD(T)a (Q, I) Manual DLPNO-CCSD(T) COSMO-RS [55]
    Metadynamics (C, E) Funnel metadynamics GAFF/RESP TIP3P [56]
    MMPBSA-GAFF (C, I) MD GAFF/RESP PBSA; TIP3P [56]
    MMPBSA-OPLSb (C, I) MD OPLS/RESP PBSA; TIP3P [56]
    MovTyp-1 (C, I) Mixed torsion/low mode KECSA 1 Implicit [40]
    MovTyp-2 (C, I) Mixed torsion/low mode KECSA 2 Implicit [40]
    TI/BARc (C, E) MD; docking CGenFF TIP3P [57]
    TI-psc (C, E) MD; docking CGenFF TIP3P [57]
    TI-rawc (C, E) MD; docking CGenFF TIP3P [57]
    HBARd (C, E) MD; docking CGenFF TIP3P [57]
    HBAR-psd (C, E) MD; docking CGenFF TIP3P [57]
    HBAR-ps1d (C, E) MD; docking CGenFF TIP3P [57]
    HBAR-ps2d (C, E) MD; docking CGenFF TIP3P [57]
    SOMD-1e (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-2 (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-3 (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-4 (C, E) MD GAFF/RESP TIP3P [58]
CBClip
    BAR-ab-initiof (C, E) MD CGenFF TIP3P [59]
    BAR-dockf (C, E) MD; docking CGenFF TIP3P [59]
    TI-ab-initiof (C, E) MD CGenFF TIP3P [59]
    TI-BARf (C, E) MD; docking CGenFF TIP3P [59]
    TI-dockf (C, E) MD; docking CGenFF TIP3P [59]
    BEDAM (C, I) MD OPLS-2005 AGBNP2 [54]
    MovTyp-1 (C, I) Mixed torsion/low mode KECSA 1 Implicit [40]
    MovTyp-2 (C, I) Mixed torsion/low mode KECSA 2 Implicit [40]
    SOMD-1e (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-2 (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-3 (C, E) MD GAFF/RESP TIP3P [58]
    SOMD-4 (C, E) MD GAFF/RESP TIP3P [58]

APR attach-pull-release approach [60], OPC “optimal” 3-charge, 4-point rigid water model [61]; TIP3P transferable interaction potential three-point [47]; BEDAM binding energy distribution analysis method [62]; DLPNO-CCSD(T) domain based, local pair natural orbital-coupled-cluster single double and perturbative triple excitations [63]; DFT-D3 density functional theory with the latest dispersion corrections [64]; MovTyp Movable Type method [53]; SOMD single topology relative free energy calculations performed with Sire/OpenMM6.3 software [65, 66]; BAR Bennett acceptance ratio [50]; TI thermodynamic integration [49, 67]; GAFF generalized AMBER force field [68]; CGenFF CHARMM generalized force-field [69]; RESP restrained electrostatic potential [70]; OPLS-2005 optimized potentials for liquid simulations 2005 force field [71, 72]; KECSA knowledge-based and empirical combined scoring algorithm [73]; AGBNP2 analytical generalized born plus non-polar 2 [74]; COSMO-RS conductor-like screening model for real solvents [75]; MMPBSA molecular mechanics Poisson Boltzmann/solvent accessible surface area [52]. The classifications of the energy model: quantum (Q) or classical (C), as well as the solvent model: implicit (I) or explicit (E) are listed in parentheses following the name of each method

a

DLPNO-CCSD(T) and DFT/TPSS-n (n indicates neutralized) used neutralized hosts, yet fully charged guests. DFT/TPSS-c (c indicates charged) used both fully charged hosts and guests

b

The MMPBSA-OPLS approach was only used to generate predictions for the OAMe subset

c

TI was used to compute binding affinities in both TI-raw and TI-ps. Lowest values of the computed binding free energies were reported in TI-raw. In TI-ps, possible corrections were added through calculating the relative pKa/pKb of the ligands to known analogs

d

Binding free energies in HBAR submission were computed by Hamiltonian replica exchange method (HREM) combined with the BAR method. The protonation state correction was used in HBAR-ps, HBAR-ps1 and HBAR-ps2. Results from the neutralized-only systems were reported as HBAR-ps1 and those from systems at the experimental ionic strength were reported as HBAR-ps2. TI/BAR prediction only reported binding affinities for the OAH subset based on the averaged results computed by TI and HREM/BAR for each guest

e

All SOMD predictions were produced based on single topology relative free energy calculations combined with multistate Bennet acceptance ratio (MBAR) method, but with different protocols. SOMD-1: No corrections; SOMD-2: includes a correction term for long-range dispersion interactions; SOMD-3: same as SOMD-2, but a correction term for the use of the flat-bottom distance restraints was also applied to bring the decoupled guest to a standard concentration; SOMD-4: same as SOMD-3 but with an additional correction term for electrostatic energies

f

Predictions were generated by either TI (labeled with TI-) or HREM/BAR (label by BAR-). Also the starting structures were obtained by quantum calculations (labeled with “ab initio”) or docking (labeled with “dock”). TI/BAR reported the lowest binding affinity from either method

Error statistics

Error statistics for all 17 sets of absolute binding free energy predictions for the combined OAH/OAMe dataset are summarized in Table 3 (left-hand side) and Fig. 2. These absolute binding free energy predictions, in addition to three sets of relative binding free energy predictions: DFT/TPSS-n, DFT/TPSS-c, and DLPNO-CCSD(T), were then converted to offset binding free energies, using Eq 1, and the error statistics are presented in Table 3 (right-hand side) and Fig. 3. The offset free energy statistics for all methods for the separate OAH and OAMe sets are presented in Table 4. All 12 sets of CBClip predictions are absolute binding free energies, and error statistics for these are presented in Table 5 and Fig. 4. Scatter plots of original data and offset predictions versus experimental binding free energies for all methods and systems are provided in Figure S1 and S2 respectively.

Fig. 2.

Fig. 2

OAH/OAMe submissions ranked based on the original values of absolute error metrics (white circles), which were computed from reported binding affinities without resampling or considering any uncertainty sources. The violin plot describes the shape of the sampling distribution for each set of predictions when bootstrapping 100,000 samples with replacement, and the vertical bar represents the mean of the distribution. The computational uncertainties are absent in the Null1, MovTyp-1, and MoveTyp-2 predictions. Two null models are shown in red. The violin plot area, here and below, are normalized not to unity, but instead to give the same maximum thickness

Fig. 3.

Fig. 3

OAH/OAMe submissions ranked based on the original values of offset error metrics (white circles), which were computed from reported binding affinities without resampling or considering any uncertainty sources. The violin plot describes the shape of the sampling distribution for each set of predictions when bootstrapping 100,000 samples with replacement, and the vertical bar represents the mean of the distribution. The computational uncertainties are absent in Null1 model, MovTyp-1, MoveTyp-2 DFT/TPSS-n, DFT/TPSS-C and DLPNO-CCSD(T) predictions. Two null models are shown in red

Table 5.

Absolute error metrics of binding affinity predictions for the CBClip datasets

Method RMSE R2 m τ
Null1a 6.9 (6.9 ± 0.7)
Null2b 2.2 (2.4 ± 0.5) 0.2 (0.2 ± 0.2) 0.4 (0.4 ± 0.3) 0.4 (0.3 ± 0.2)
BAR-ab initio 4.0 (4.0 ± 0.8) 0.0 (0.1 ± 0.1) −0.1 (−0.1 ± 0.4) −0.2 (−0.1 ± 0.2)
BAR-dock 4.7 (4.9 ± 1.0) 0.1 (0.1 ± 0.1) −0.4 (−0.4 ± 0.6) −0.2 (−0.2 ± 0.2)
TI-ab initio 4.7 (4.8 ± 0.8) 0.2 (0.2 ± 0.2) −0.6 (−0.6 ± 0.5) −0.2 (−0.2 ± 0.3)
TI-dock 3.4 (3.6 ± 0.7) 0.0 (0.1 ± 0.1) −0.1 (−0.1 ± 0.5) −0.2 (−0.1 ± 0.2)
TI/BAR 4.0 (4.0 ± 0.9) 0.0 (0.1 ± 0.1) −0.1 (−0.1 ± 0.4) −0.2 (−0.1 ± 0.2)
BEDAM 4.8 (4.6 ± 1.3) 0.4 (0.5 ± 0.2) 1.7 (1.8 ± 0.8) 0.4 (0.4 ± 0.2)
MovTyp-1c 3.5 (3.5 ± 0.7) 0.0 (0.1 ± 0.1) 0.0 (0.0 ± 0.3) 0.0 (0.0 ± 0.3)
MovTyp-2c 4.2 (4.1 ± 0.8) 0.0 (0.1 ± 0.1) 0.0 (0.0 ± 0.5) 0.0 (0.0 ± 0.3)
SOMD-1 6.4 (6.4 ± 0.9) 0.8 (0.7 ± 0.2) 2.7 (2.7 ± 0.5) 0.7 (0.6 ± 0.1)
SOMD-2 6.3 (6.3 ± 0.9) 0.8 (0.7 ± 0.2) 2.7 (2.7 ± 0.5) 0.7 (0.6 ± 0.1)
SOMD-3 5.7 (5.7 ± 0.7) 0.8 (0.7 ± 0.2) 2.7 (2.7 ± 0.5) 0.7 (0.6 ± 0.2)
SOMD-4 18.4 (17.7 ± 5.3) 0.5 (0.5 ± 0.3) 5.6 (5.6 ± 2.4) 0.5 (0.5 ± 0.3)

The values before the parentheses are raw error metrics computed from the original data. The mean values and uncertainties inside the parentheses were obtained through resampling the original data using replacement. The absolute error metrics RMSE, R2, m and τ are provided

a

Null1 model with a constant value 0.0 and no computational uncertainties were assigned to guests

b

Null2 model based on the linear regression equation ΔG = −0.25 × number of heavy atoms −1.81 kcal/mol (based on SAMPL3 Host1 data); SEM was set to 1.0 kcal/mol

c

Computational uncertainties were not reported. Results were adversely affected by bond order issues in the provided SD files of free guests [40]

Fig. 4.

Fig. 4

CBClip submissions ranked based on the original values of absolute error metrics (white circles), which were computed from reported binding affinities without resampling or considering any uncertainty sources. The violin plot describes the shape of the sampling distribution for each set of predictions when bootstrapping 100,000 samples with replacement, and the vertical bar represents the mean of the distribution. Two null models are shown in red. The computational uncertainties are absent in Null1 model, MovTyp-1 and MoveTyp-2 predictions

Inspection of the absolute binding free energy results for OAH and OAMe (Fig. 2; Table 3) reveals that most prediction sets outperformed both Null models for this dataset, and that comparatively favorable results were provided by several explicit solvent free energy methods with fixed charge models and the GAFF parameters [68]. The attach-pull-release (APR) method [60] with either the TIP3P or the OPC water model, performed well, as did the SOMD-3 method (Fig. 5a, b), followed closely by the SOMD-1 and SOMD-2 methods. The APR method obtains the binding free energy in terms of the reversible work to pull the guest from the host along a physical pathway [60, 76], while the SOMD calculations use the double-decoupling approach [77]. The APR-TIP3P, APR-OPC and SOMD-3 methods all yielded R2 ≥ 0.8, linear regression slopes 1.3 < m < 1.4, and 1.6 ≤ RMSE ≤ 2.1 kcal/mol. The other two SOMD predictions, SOMD-1 and SOMD-2, which closely resemble SOMD-3 but use different correction protocols, provide similar correlations with experiment, but larger RMSE values, 3.6 kcal/mol. The Metadynamics method uses the funnel metadynamics approach [78] to obtain the binding free energy via the potential of mean force along a physical binding pathway, again using molecular dynamics with GAFF and TIP3P; this method also performed relatively well, with R2 of 0.7, slope near 1, and RMSE of 3.1 kcal/mol. It is not immediately clear why the Metadynamics and APR-TIP3P differ, as the force fields used appear to match, but it is worth noting that the Metadynamics calculations actually provided relative binding free energies, which were converted into absolute binding free energies for submission by referring to a known octa-acid guest result from SAMPL4. The accurate absolute binding free energies cannot be obtained by Metadynamics due to the special treatment of the unbound state as a “dry state”, in which all water molecules were restrained from entering the host cavity [56].

Fig. 5.

Fig. 5

Combined OAH/OAMe predictions with MSE offsets using a APR-TIP3P, b SOMD-3, and c DFT/TPSS-n method. CBClip predictions without MSE offset using d the Null2 model, e SOMD-3, and f BEDAM method. Purple dots OAH, red dots OAMe, cyan dots CBClip, solid black line of identity

The analysis of relative binding free energies (Fig. 3; Tables 2, 3) provides a similar overall picture, but allows the three sets of relative predictions—DLPNO-CCSD(T), DFT/TPSS-n and DFT/TPSS-c to be compared with the other predictions on an equal footing. The DFT/TPSS-n and DFT/TPSS-c predictions were generated with dispersion-corrected density functional theory calculations, in conjunction with the COSMO-RS continuum solvation model [75], while DLPNO-CCSD(T) approach used the DLPNO-CCSD(T) level of theory, again combined with COSMO-RS. Both DLOPNO-CCSD(T) and DFT/TPSS-n treated the host as neutral and the guest as fully charged, while the DFT/TPSS-c assumed charges appropriate to the experimental pH for both host and guest molecules. According to the participant, the correlations shown in DFT/TPSS-n (Ro2=0.5; Fig. 5c) and other two quantum submissions actually resulted from including the OAMe-G4 data point based on a faulty binding configuration. When the proper configuration was used in later calculations, no correlations were found with experimental data [54].

The offset error analysis also provides separate statistics for OAH and OAMe, and it is noteworthy that, despite their relatively low correlations for the OAH/OAMe combined set, the MovTyp-1 and MovTyp-2 methods yield good error statistics for the OAH subset (Table 4), with Ro2 of 0.8 and regression slopes near 1. However, the two Movable Type methods yield anti-correlations for the OAMe subset, and this degrades the overall performance of these methods for the combined OAH/OAMe set. Similar performance deterioration by including estimates from the OAMe subset was also observed for several other methods, including Metadynamics, TI-raw and TI-ps, and to some degree for MMPBSA-GAFF predictions. Interestingly, although the Null2 model showed a large RMSEo value of 3.1 kcal/mol and anti-correlation for the OAMe subset, it seems to be able to generate reasonable predictions for the OAH subset, with the RMSEo value of 1.7 kcal/mol, Ro2 value of 0.4 and mo value of 0.8. Null2 model resembles what was observed in about one third of the predictions for OAH and OAMe systems: a method that performed well on OAH systems could totally fail on OAMe systems. It is also worth noting that methods that showed much weaker correlation for the OAMe set also yielded larger RMSEo values for OAMe, suggesting that the narrower spread of experimental binding energies in the OAMe dataset, relative to OAH, cannot fully account for the weak correlations.

Fewer methods were applied to the CBClip set (Fig. 4; Table 5), and results are in general less favorable than those for OAH and OAMe. Indeed, the Null-2 model, which estimates affinity based on the number of guest heavy atoms, outperformed all methods in terms of RMSE and regression slope, and turned in a mid-range performance for the measures of correlation (Fig. 5d). The SOMD methods again provided high correlations with experiment, yet large regression slopes of 2.7 and RMSE values on the order of 6 kcal/mol (Table 5; Fig. 5e). The BEDAM method provided a balanced performance, with an R2 value of 0.4, a RMSE value of 4.8 kcal/mol, and a regression slope of 1.7 (Table 5; Fig. 5f). MovTyp-1 and MovTyp-2 submissions showed near-zero correlations. However, according to the participant, moderate correlations and lower RMSEo values were obtained when structures with corrected bond orders were used [40]. The remaining five sets of predictions generated by either TI or HREM/BAR approach, yielded either zero or negative correlations with experiment. One possible explanation for the worse performance of multiple methods for CBClip, versus the octa-acids, is that CBClip is acyclic and hence may be more flexible and slower to converge. However, this would presumably lead to greater scatter of the binding estimates and thus lower correlation, yet the SOMD method still showed good correlations for the CBClip set (R2 ~ 0.8). Instead, the large errors in this case seem to derive from the fact that the slopes (m) are as high as 2.7 for the CBClip cases. This would suggest some systematic error, such as finite-size effects or problems in the treatment of short-range electrostatics, since the four sulfonate groups are positioned where they can interact strongly with the guests.

Comparison with SAMPL3 and SAMPL4 host–guest challenges

Host–guest systems were first introduced to SAMPL for the SAMPL3 challenge, and all SAMPL hosts to date have been drawn from the cucurbituril and octa-acid families of hosts, a trend which reflects the continuing data contributions of Professors Lyle Isaacs and Bruce Gibb. Although some hosts are new chemical variants, others have recurred across challenges. Thus, the current OAH host is identical to the OA host in SAMPL4; and the present CBClip resembles the glycoluril-based molecular clip Host H1 in SAMPL3 and the glycoluril host CB7 in SAMPL4. The structures of H1 and CB7 are shown in Fig. 6. In addition, some SAMPL5 participants used closely related methods to generate predictions for prior rounds of SAMPL. One may thus begin to look for trends in computational performance over time.

Fig. 6.

Fig. 6

Structures of host H1 and cucurbit[7]uril (CB7) tested in prior SAMPL host–guest challenges. Silver carbon, Blue nitrogen, Red oxygen. Hydrogen atoms were omitted for clarity

Two methods, BEDAM and TI/BAR, applied to the present CBClip case, were also used to predict the binding affinities for the chemically related H1 in SAMPL3 [30, 79]. Both methods yielded larger RMSE values in the present study: 4.8 kcal/mol (Table 5) versus 2.5 kcal/mol in SAMPL3 for BEDAM, and 4.0 kcal/mol versus 2.6 kcal/mol for TI/BAR. However, the correlations were similar: R2 values between 0.4 and 0.5 for BEDAM, and R2 values near zero for TI/BAR in both SAMPL exercises.

Binding data for the octa-acid host OAH were also used in the SAMPL4 challenge [31], where this host was termed OA instead of OAH, and several identical or similar computational methods were applied to this host in both SAMPL challenges. Note that, since the error analysis in SAMPL4 was based on relative binding affinity predictions, we compared the SAMPL4 error metrics of OA with the offset error metrics of OAH in the current challenge. In particular, RMSE_o in SAMPL4 was obtained in a similar manner to RMSEo here by using offset binding affinity estimates. The BEDAM method yielded substantially more accurate predictions for this host in SAMPL4, with R2 of 0.9 then versus 0.04 now, and the offset RMSE 0.9 kcal/mol then and 4.8 kcal/mol now. It is important to note that, although the methods, energy models, solvent models and sampling techniques appear mostly the same between SAMPL3 and SAMPL5 for this approach, the more diverse guest set in SAMPL5 may pose a challenge to BEDAM's implicit solvent model. An in-depth discussion on the performance of BEDAM can be found in the SAMPL5 special issue [54].

It also seemed appropriate to compare the present DFT/TPSS-n predictions with RRHO-551 (SAMPL4 ID:551), which used DFT-D, an early version of dispersion-corrected DFT with COSMO-RS; and the DLPNO CCSD(T) predictions with RRHO-552 (SAMPL4 ID:552), which used LCCSD(T), a local coupled-cluster method with COSMO-RS [80]. Comparable performance was observed in both cases on going from SAMPL4 to SAMPL5 for the OAH set only. For the DFT methods, the prior offset RMSE, R2 and regression slopes were, respectively, 5.8 kcal/mol, 0.5 and 3.9, while the current values are 4.4 kcal/mol, 0.5, and 2.2. For the coupled-cluster methods, the prior offset RMSE, R2 and regression slopes were, respectively, 6.1 kcal/mol, 0.4 and 3.3, while the current values are 7.0 kcal/mol, 0.5, and 3.3. However, as mentioned above, the quantum submissions showed essentially zero correlation on the mixed OAH/OAMe set after the faulty configuration of OAMe-G4 was replaced with a more proper one [55]. Given this adjustment, the quantum methods performed worse in SAMPL5 compared with SAMPL4.

The Metadynamics approach yielded more accurate predictions in SAMPL5 than in SAMPL4 (ID:579), though it is important to note that, for this method, the hosts studied are largely distinct, and a different force field was used previously [81]. The SAMPL4 predictions with this method showed near-zero or anti-correlations for the CB7 host, whereas the SAMPL5 predictions showed moderate correlations in the OAH/OAMe combined set and fairly good agreement with experiments in the OAH subset.

Although two top-ranked SAMPL5 methods, SOMD and APR, were not tested in prior SAMPL challenges, it is of interest to compare each with one of the free energy perturbation (FEP) methods that also employed GAFF parameters, RESP charges and TIP3P water model in SAMPL4. APR-TIP3P and SOMD-1 were thus compared with FEP-526 (SAMPL4 ID: 526) [80] for the octa acid predictions. In spite of the increased chemical diversity of the SAMPL5 set of guests, all three methods performed equally well: the R2 values in all three methods are no less than 0.9; the offset RMSE measures ranged from 0.8 to 0.9 kcal/mol, and slopes ranged from 1.3 to 1.5. APR-TIP3P and SOMD-1 even showed slightly better performance for the OAH/OAMe combined datasets than FEP-526 for OAH alone.

Discussion

The SAMPL5 host–guest blinded prediction challenge has provided a fresh opportunity to rigorously test the reliability of computational tools for predicting binding affinities, and the fact that host–guest systems were also used in two prior rounds of SAMPL makes it possible to look for consistencies and trends over time. A full analysis of the varied prediction methods used is beyond the scope of this overview, and readers desiring greater detail are referred to the more focused articles provided by SAMPL5 participants. However, some general observations may be made.

Overall, the reliability of methods based on explicit solvent free energy simulations and of those based on electronic structure calculations appear to be fairly consistent across SAMPL challenges, with the simulation-based methods generally providing greater reliability. However, it should be emphasized that the number of observations is still modest, even across three SAMPL rounds and that each class of methods includes multiple variants with different levels of performance. Moreover, there is significant variation in performance across different host–guest series, even within SAMPL5. Thus, predictions for the octa-acid hosts tend to be more accurate than those for CBClip, and accuracy is somewhat greater for the OAH systems than for OAMe, although OAMe differs from OAH only by the addition of four methyl groups. Based on informal discussions at the D3R/SAMPL5 workshop, it appears that the methyl groups, which are disposed around the opening of the binding site, increased difficulties in sampling guest poses in the bound state.

Even the best performing simulation-based methods in SAMPL5 yield absolute RMSE values on the order of 2 kcal/mol and tend to overestimate binding affinities (Figure S1). Previous binding calculations have shown that extensive sampling and small statistical uncertainties of binding estimates can be feasibly achieved on host–guest systems nowadays, with the aid of high-performance computing capabilities [60, 82]. Given that adequate conformational sampling can be achieved for such moderatesized systems, and that the ionization states of these systems are relatively straightforward to ascertain at the experimental pH, the errors in predictions from carefully executed calculations presumably trace to limitations in the potential functions, or force fields, used in the simulations. It should be emphasized that, if current force fields yield errors of this magnitude on host–guest systems, one should not expect to achieve any better in blinded predictions of protein-small molecule binding free energies, even with greater simulation times and a correct treatment of protonation states. Although a recent report describes encouraging results for alchemical calculations of relative protein–ligand binding free energies [11], the statistics come from a retrospective analysis, rather than from blinded prediction along the lines of those described in the present paper. The present results thus underscore the need for improvements in force field parameters and perhaps functional forms.

Electronic structure methods, such as the DFT and coupled cluster methods discussed above, offer an alternative route to improved accuracy in the potential function, since they largely avoid the need for empirical force fields. However, such methods still are, arguably, restricted by the challenge of achieving adequate conformational sampling, due to the high computational cost of evaluating the energy for each conformation. In addition, their accuracy may be limited by the fact that it is difficult to couple them to an explicit solvent model. Although implicit solvent models have predictive power for molecular systems that are essentially convex in shape (see, e.g., SAMPL5 papers regarding the calculation of distribution coefficients for drug-like molecules), it is unknown whether they can capture the properties of water in confined spaces, such as the binding sites of host molecules or proteins, well enough to provide binding free energies with kcal/mol accuracy. It seems probable that continued improvements in computer power and algorithms will make quantum methods, perhaps hybridized with classical methods, increasingly competitive with classical free energy methods. More computationally efficient methods, such as BEDAM and Movable Type, also generated some encouraging results and are amenable to continued refinement, such as through the development of improved solvent models, and the incorporation of more accurate force fields as they become available.

In the current host–guest challenge, a number of groups submitted multiple predictions, and the results often provided clear signals as to the relative merits of the various approaches tested. Indeed, the simplicity of host–guest model systems makes it relatively easy to evaluate errors and isolate their sources, and the blinded nature of the SAMPL challenges eliminates the risk of even unintentionally adjusting one's method to agree with known data. Thus, submission of multiple predictions is encouraged for future rounds of SAMPL. It is also hoped that more groups will participate, so that an even wider range of methods may be tested; additional methods may also be evaluated by participants using software developed outside their own research groups, including commercial packages.

SAMPL is a community effort. It depends on the generosity of experimentalists who make their data available on a prepublication basis, which is not always convenient, and it requires courage on the part of the computational chemists, who are making truly blinded predictions in a public setting. It is indeed encouraging that so many groups contributed to and participated in the SAMPL5 host–guest challenge, and thus to the continuing improvement of the entire field.

Supplementary Material

suppl1

Acknowledgments

We thank Dr. Pär Söderhjelm for helpful discussion on the error analysis and Dr. Ulf Ryde for helpful comments on the manuscript. M.K.G. thanks the National Institutes of Health (NIH) for Grants GM061300 and U01GM111528 and the Air Force Office of Scientific Research (AFOSR) for Basic Research Initiative (BRI) Grant (FA9550-12-1-6440414). D.L.M appreciates financial support from the National institutes of Health (1R01GM108889-01) and the National Science Foundation (CHE 1352608). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the NIH, NSF or AFOSR. M.K.G. has an equity interest in, and is a cofounder and scientific advisor of VeraChem LLC.

Footnotes

Electronic supplementary material The online version of this article (doi:10.1007/s10822-016-9974-4) contains supplementary material, which is available to authorized users.

References

  • 1.Borhani DW, Shaw DE. The future of molecular dynamics simulations in drug discovery. J Comput Aided Mol Des. 2012;26:15–26. doi: 10.1007/s10822-011-9517-y. doi:10.1007/s10822-011-9517-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Martin E, Ertl P, Hunt P, Duca J, Lewis R. Gazing into the crystal ball; The future of computer-aided drug design. J Comput Aided Mol Des. 2012;26:77–79. doi: 10.1007/s10822-011-9487-0. doi:10.1007/s10822-011-9487-0. [DOI] [PubMed] [Google Scholar]
  • 3.Chen L, Morrow JK, Tran HT, Phatak SS, Du-Cuny L, Zhang S. From laptop to benchtop to bedside: structure-based drug design on protein targets. Curr Pharm Des. 2012;18:1217–1239. doi: 10.2174/138161212799436386. doi:10.2174/138920012799362837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov. 2004;3:935–949. doi: 10.1038/nrd1549. doi:10.1038/nrd1549. [DOI] [PubMed] [Google Scholar]
  • 5.Klebe G. Virtual ligand screening: strategies, perspectives and limitations. Drug Discov Today. 2006;11:580–594. doi: 10.1016/j.drudis.2006.05.012. doi:10.1016/j.drudis.2006.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lavecchia A, Di Giovanni C. Virtual screening strategies in drug discovery: a critical review. Curr Med Chem. 2013;20:2839–2860. doi: 10.2174/09298673113209990001. doi:10.2174/09298673113209990001. [DOI] [PubMed] [Google Scholar]
  • 7.Mortier J, Rakers C, Bermudez M, Murgueitio MS, Riniker S. The impact of molecular dynamics on drug design: applications for the characterization of ligand-macromolecule complexes. Drug Discov Today. 2015;20:686–702. doi: 10.1016/j.drudis.2015.01.003. doi:10.1016/j.dru dis.2015.01.003. [DOI] [PubMed] [Google Scholar]
  • 8.Gilson MK, Zhou H-X. Calculation of protein-ligand binding affinities. Annu Rev Biophys Biomol Struct. 2007;36:21–42. doi: 10.1146/annurev.biophys.36.040306.132550. doi:10.1146/annurev.biophys.36.040306.132550. [DOI] [PubMed] [Google Scholar]
  • 9.Adcock SA, Mccammon JA. Molecular dynamics : survey of methods for simulating the activity of proteins. Proteins. 2006;106:1589–1615. doi: 10.1021/cr040426m. doi:10.1021/cr040426m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gumbart JC, Roux B, Chipot C. Standard binding free energies from computer simulations: what is the best strategy? J Chem Theory Comput. 2013;9:794–802. doi: 10.1021/ct3008099. doi:10.1021/ct3008099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wang L, Wu Y, Deng Y, Kim B, Pierce L, Krilov G, Lupyan D, Robinson S, Dahlgren MK, Greenwood J, Romero DL. Accurate and reliable prediction of relative ligand binding potency in prospective drug discovery by way of a modern free-energy calculation protocol and force field. J Am Chem Soc. 2015;137:2695–2703. doi: 10.1021/ja512751q. doi:10.1021/ja512751q. [DOI] [PubMed] [Google Scholar]
  • 12.Buch I, Giorgino T, De Fabritiis G. Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations. Proc Natl Acad Sci USA. 2011;108:10184–10189. doi: 10.1073/pnas.1103547108. doi:10.1073/pnas.1103547108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chodera JD, Mobley DL, Shirts MR, Dixon RW, Branson K, Pande VS. Alchemical free energy methods for drug discovery: progress and challenges. Curr Opin Struct Biol. 2011;21:150–160. doi: 10.1016/j.sbi.2011.01.011. doi:10.1016/j.sbi.2011.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lu Y, Yang CY, Wang S. Binding free energy contributions of interfacial waters in HIV-1 protease/inhibitor complexes. J Am Chem Soc. 2006;128:11830–11839. doi: 10.1021/ja058042g. doi:10.1021/ja058042g. [DOI] [PubMed] [Google Scholar]
  • 15.Cram DJ, Cram JM. Host-guest chemistry. Science. 1974;183(80):803–809. doi: 10.1126/science.183.4127.803. [DOI] [PubMed] [Google Scholar]
  • 16.Freeman WA, Mock WL, Shih N-Y. Cucurbituril. J Am Chem Soc. 1981;103:7367–7368. doi:10.1021/ja00414a070. [Google Scholar]
  • 17.Lee JW, Samal S, Selvapalam N, Kim HJ, Kim K. Cucurbituril homologues and derivatives: new opportunities in supramolecular chemistry. Acc Chem Res. 2003;36:621–630. doi: 10.1021/ar020254k. doi:10.1021/ar020254k. [DOI] [PubMed] [Google Scholar]
  • 18.Jeon YJ, Kim S-Y, Ko YH, Sakamoto S, Yamaguchi K, Kim K. Novel molecular drug carrier: encapsulation of oxaliplatin in cucurbit[7]uril and its effects on stability and reactivity of the drug. Org Biomol Chem. 2005;3:2122–2125. doi: 10.1039/b504487a. doi:10.1039/b504487a. [DOI] [PubMed] [Google Scholar]
  • 19.Lagona J, Mukhopadhyay P, Chakrabarti S, Isaacs L. The cucurbit[n]uril family. Angew Chem Int Ed Engl. 2005;44:4844–4870. doi: 10.1002/anie.200460675. doi:10.1002/anie.200460675. [DOI] [PubMed] [Google Scholar]
  • 20.Rekharsky MV, Inoue Y. Complexation thermodynamics of cyclodextrins. Chem Rev. 1998;98:1875–1918. doi: 10.1021/cr970015o. doi:10.1021/cr970015o. [DOI] [PubMed] [Google Scholar]
  • 21.Zimmerman SC, Vanzyl CM. Rigid molecular tweezers: synthesis, characterization, and complexation chemistry of a diacridine. J Am Chem Soc. 1987;109:7894–7896. doi:10.1021/ja00259a055. [Google Scholar]
  • 22.Zimmerman SC. Rigid molecular tweezers as hosts for the complexation of neutral guests. Top Curr Chem. 1993;165:71–102. doi:10.1007/BFb0111281. [Google Scholar]
  • 23.Klärner FG, Kahlert B. Molecular tweezers and clips as synthetic receptors. Molecular recognition and dynamics in receptor-substrate complexes. Acc Chem Res. 2003;36:919–932. doi: 10.1021/ar0200448. doi:10.1021/ar0200448. [DOI] [PubMed] [Google Scholar]
  • 24.Sinha S, Lopes DHJ, Du Z, Pang ES, Shanmugam A, Lomakin A, Talbiersky P, Tennstaedt A, McDaniel K, Bakshi R, Kuo PY. Lysine-specific molecular tweezers are broad-spectrum inhibitors of assembly and toxicity of amyloid proteins. J Am Chem Soc. 2011;133:16958–16969. doi: 10.1021/ja206279b. doi:10.1021/ja206279b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Linton B, Hamilton AD. Host-guest chemistry: combinatorial receptors. Curr Opin Chem Biol. 1999;3:307–312. doi: 10.1016/S1367-5931(99)80047-7. [DOI] [PubMed] [Google Scholar]
  • 26.Baron R, McCammon JA. Molecular recognition and ligand association. Annu Rev Phys Chem. 2013;64:151–175. doi: 10.1146/annurev-physchem-040412-110047. doi:10.1146/annurev-physchem-040412-110047. [DOI] [PubMed] [Google Scholar]
  • 27.Yin J, Fenley AT, Henriksen NM, Gilson MK. Toward improved force-field accuracy through sensitivity analysis of host-guest binding thermodynamics. J Phys Chem B. 2015 doi: 10.1021/acs.jpcb.5b04262. doi:10.1021/acs.jpcb.5b04262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wickstrom L, Deng N, He P, Mentes A, Nguyen C, Gilson MK, Kurtzman T, Gallicchio E, Levy RM. Parameterization of an effective potential for protein-ligand binding from host-guest affinity data. J Mol Recognit. 2016;29:10–21. doi: 10.1002/jmr.2489. doi:10.1002/jmr.2489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Geballe MT, Skillman AG, Nicholls A, Guthrie JP, Taylor PJ. The SAMPL2 blind prediction challenge: introduction and overview. J Comput Aided Mol Des. 2010;24:259–279. doi: 10.1007/s10822-010-9350-8. doi:10.1007/s10822-010-9350-8. [DOI] [PubMed] [Google Scholar]
  • 30.Muddana HS, Varnado CD, Bielawski CW, Urbach AR, Isaacs L, Geballe MT, Gilson MK. Blind prediction of host-guest binding affinities: a new SAMPL3 challenge. J Comput Aided Mol Des. 2012;26:475–487. doi: 10.1007/s10822-012-9554-1. doi:10.1007/s10822-012-9554-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Muddana HS, Fenley AT, Mobley DL, Gilson MK. The SAMPL4 host-guest blind prediction challenge: an overview. J Comput Aided Mol Des. 2014;28:305–317. doi: 10.1007/s10822-014-9735-1. doi:10.1007/s10822-014-9735-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mobley DL, Wymer KL, Lim NM, Guthrie JP. Blind prediction of solvation free energies from the SAMPL4 challenge. J Comput Aided Mol Des. 2014;28:135–150. doi: 10.1007/s10822-014-9718-2. doi:10.1007/s10822-014-9718-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gibb CLD, Gibb BC. Binding of cyclic carboxylates to octa-acid deep-cavity cavitand. J Comput Aided Mol Des. 2014;28:319–325. doi: 10.1007/s10822-013-9690-2. doi:10.1007/s10822-013-9690-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gan H, Gibb BC. Guest-mediated switching of the assembly state of a water-soluble deep-cavity cavitand. Chem Commun. 2013;49:1395–1397. doi: 10.1039/c2cc38227j. doi:10.1039/c2cc38227j. [DOI] [PubMed] [Google Scholar]
  • 35.Jordan JH, Gibb BC. Molecular containers assembled through the hydrophobic effect. Chem Soc Rev. 2014;44:547–585. doi: 10.1039/c4cs00191e. doi:10.1039/c4cs00191e. [DOI] [PubMed] [Google Scholar]
  • 36.Zhang B, Isaacs L. Acyclic cucurbit[n]uril-type molecular containers: influence of aromatic walls on their function as solubilizing excipients for insoluble drugs. J Med Chem. 2014;57:9554–9563. doi: 10.1021/jm501276u. doi:10.1021/jm501276u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sullivan MR, Sokkalingam P, Nguyen T, Donahue JP, Gibb BC. Binding of carboxylate and trimethylammonium salts to octa-acid and TEMOA deep-cavity cavitands. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9925-0. doi:10.1007/s10822-016-9925-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.She N, Moncelet D, Gilberg L, Lu X, Sindelar V, Briken V, Isaacs L. Glycoluril-derived molecular clips are potent and selective receptors for cationic dyes in water. Chem A Eur J. 2016;22:1–11. doi: 10.1002/chem.201601796. doi:10.1002/chem.201601796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Molecular Operating Environment (MOE) 2013.08 . Chemical Computing Group Inc.; 1010 Sherbooke St. West, Suite #910, Montreal, QC, Canada: 2016. p. H3A 2R7. [Google Scholar]
  • 40.Bansal N, Zheng Z, Cerutti DS, Merz KM. On the fly estimation of host-guest binding free energies using the movable type method: participation in the SAMPL5 blind challenge. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9980-6. in press. [DOI] [PubMed] [Google Scholar]
  • 41.Yin J, Henriksen NM, Slochower DR, Gilson MK. The SAMPL5 host-guest challenge: computing binding free energies and enthalpies from explicit solvent simulations using attach-pull-release (APR) approach. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9970-8. doi:10.1007/s10822-016-9970-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Case DA, Babin V, Berryman JT, Betz RM, Cai Q, Cerutti DS, Cheatham DS, III, Darden TA, Duke TA, Gohlke H, Goetz AW, Gusarov S, Homeyer N, Janowski P, Kaus J, Kolossvary I, Kovalenko A, Lee TS, LeGrand S, Luchko T, Luo R, Madej B, Merz KM, Paesani F, Roe DR, Roitberg A, Sagui C, SalomonFerrer R, Seabra G, Simmerling CL, Smith W, Swails J, Walker RC, Wang J, Wolf RM, Wu X, Kollman PA. AMBER 14. University of California; San Francisco: 2014. [Google Scholar]
  • 43.Abraham MJ, Murtola T, Schulz R, Páll S, Smith JC, Hess B, Lindahl E. Gromacs: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1–2:19–25. doi:10.1016/j.softx.2015.06. 001. [Google Scholar]
  • 44.Desmond Molecular Dynamics System, D. E. Shaw Research . Maestro-Desmond Interoperability Tools. Schrödinger; New York, NY: New York, NY: 2016. p. 2016. [Google Scholar]
  • 45.Plimpton S. Fast parallel algorithms for short-range molecular dynamics. J Comput Phys. 1995;117:1–19. doi:10.1006/jcph.1995.1039. [Google Scholar]
  • 46.Shirts MR, Klein C, Swails JM, Yin J, Gilson MK, Mobley DL, Case DA, Zhong ED. Lessons learned from comparing molecular dynamics engines on the SAMPL5 dataset. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9977-1. doi:10.1007/s10822-016-9977-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926–935. doi:10.1063/1.445869. [Google Scholar]
  • 48.Grimme S. Supramolecular binding thermodynamics by dispersion-corrected density functional theory. Chem A Eur J. 2012;18:9955–9964. doi: 10.1002/chem.201200497. doi:10.1002/chem.201200497. [DOI] [PubMed] [Google Scholar]
  • 49.Kirkwood JG. Statistical mechanics of fluid mixtures. J Chem Phys. 1935;3:300–313. doi:10.1063/1.1749657. [Google Scholar]
  • 50.Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comput Phys. 1976;22:245–268. doi:10.1016/0021-9991(76)90078-4. [Google Scholar]
  • 51.Laio A, Parrinello M. Escaping free-energy minima. Proc Natl Acad Sci USA. 2002;99:12562–12566. doi: 10.1073/pnas.202427399. doi:10.1073/pnas.202427399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Srinivasan J, Cheatham TE, Cieplak P, Kollman PA, Case DA. Continuum solvent studies of the stability of DNA, RNA, and phosphoramidate-DNA helices. J Am Chem Soc. 1998;120:9401–9409. doi:10.1021/ja981844? [Google Scholar]
  • 53.Zheng Z, Ucisik MN, Merz KM. The movable type method applied to protein-ligand binding. J Chem Theory Comput. 2013;9:5526–5538. doi: 10.1021/ct4005992. doi:10.1021/ct4005992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pal RK, Haider K, Kaur D, Flynn W, Xia J, Levy RM, Taran T, Wickstrom L, Kurtzman T, Gallicchio E. A combined treatment of hydration and dynamical effects for the modeling of host-guest binding thermodynamics: the SAMPL5 blinded challenge. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9956-6. doi:10.1007/s10822-016-9956-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Caldararu O, Olsson MA, Riplinger C, Neese F, Ryde U. Binding free energies in the SAMPL5 octa-acid host-guest challenge calculated with DFT-D3 and CCSD(T). J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9957-5. doi:10.1007/s10822-016-9957-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Bhakat S, Söderhjelm P. Resolving the problem of trapped water in binding cavities: prediction of host-guest binding free energies in the SAMPL5 challenge by funnel metadynamics. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9948-6. doi:10.1007/s10822-016-9948-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Tofoleanu F, Lee J, Pickard FC, IV, König G, Huang J, Baek M, Seok C, Brooks BR. Absolute binding free energies for octa-acids and guests in SAMPL5. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9965-5. doi:10.1007/s10822-016-9965-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Bosisio S, Mey ASJS, Michel J. Blinded predictions of host-guest standard free energies of binding in the SAMPL5 challenge. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9933-0. doi:10.1007/s10822-016-9933-0. [DOI] [PubMed] [Google Scholar]
  • 59.Lee J, Tofoleanu F, Pickard FC, IV, König G, Huang J, Damjanović A, Baek M, Seok C, Brooks B. Absolute binding free energy calculations of CBClip host–guest systems in the SAMPL5 blind challenge. J Comput Aided Mol Des. 2016 doi: 10.1007/s10822-016-9968-2. doi:10.1007/s10822-016-9968-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Henriksen NM, Fenley AT, Gilson MK. Computational calorimetry: high-Precision calculation of host-guest binding thermodynamics. J Chem Theory Comput. 2015;11:4377–4394. doi: 10.1021/acs.jctc.5b00405. doi:10.1021/acs.jctc.5b00405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Izadi S, Anandakrishnan R, Onufriev AV. Building water models: a different approach. JPhysChemLett. 2014;5:3863–3871. doi: 10.1021/jz501780a. doi:10.1021/jz501780a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Gallicchio E, Lapelosa M, Levy RM. The binding energy distribution analysis method (BEDAM) for the estimation of protein-ligand binding affinities. J Chem Theory Comput. 2010;6:2961–2977. doi: 10.1021/ct1002913. doi:10.1021/ct1002913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Riplinger C, Sandhoefer B, Hansen A, Neese F. Natural triple excitations in local coupled cluster calculations with pair natural orbitals. J Chem Phys. 2013;139(13):134101. doi: 10.1063/1.4821834. [DOI] [PubMed] [Google Scholar]
  • 64.Grimme S, Antony J, Ehrlich S, Krieg H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J Chem Phys. 2010 doi: 10.1063/1.3382344. doi:10.1063/1.3382344. [DOI] [PubMed] [Google Scholar]
  • 65.Woods CJW, Mey A, Calabro G, Michel J. [28 July];Sire molecular simulations framework. 2016 http://siremol.org.
  • 66.Eastman P, Friedrichs MS, Chodera JD, Radmer RJ, Bruns CM, Ku JP, Beauchamp KA, Lane TJ, Wang LP, Shukla D, Tye T. OpenMM 4: a reusable, extensible, hardware independent library for high performance molecular simulation. J Chem Theory Comput. 2013;9:461–469. doi: 10.1021/ct300857j. doi:10.1021/ct300857j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Straatsma TP, McCammon JA. Multiconfiguration thermodynamic integration. J Chem Phys. 1991;95:1175. doi:10.1063/1. 461148. [Google Scholar]
  • 68.Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general Amber Force Field. J Comput Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. doi:10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
  • 69.Vanommeslaeghe K, Hatcher E, Acharya C, Kundu S, Zhong S, Shim J, Darian E, Guvench O, Lopes P, Vorobyov I, Mackerell AD. CHARMM general force field: a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. J Comput Chem. 2010;31:671–690. doi: 10.1002/jcc.21367. doi:10.1002/jcc.21367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Bayly CCI, Cieplak P, Cornell WD, Kollman PA. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J Phys Chem. 1993;97:10269–10280. doi:10.1021/j100142a004. [Google Scholar]
  • 71.Kaminski GA, Friesner RA, Tirado-Rives J, Jorgensen WL. Evaluation and reparametrization of the OPLS-AA force field for proteins via comparison with accurate quantum chemical calculations on peptides. J Phys Chem B. 2001;105:6474–6487. doi:10.1021/jp003919d. [Google Scholar]
  • 72.Banks JL, Beard HS, Cao Y, Cho AE, Damm W, Farid R, Felts AK, Halgren TA, Mainz DT, Maple JR, Murphy R. Integrated modeling program, applied chemical theory (IMPACT). J Comput Chem. 2005;26:1752–1780. doi: 10.1002/jcc.20292. doi:10.1002/jcc.20292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Zheng Z, Wang T, Li P, Merz KM. KECSA-movable type implicit solvation model (KMTISM). J Chem Theory Comput. 2014;11:667–682. doi: 10.1021/ct5007828. doi:10.1021/ct5007828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Gallicchio E, Paris K, Levy RM. The AGBNP2 implicit solvation model. J Chem Theory Comput. 2009;5:2544–2564. doi: 10.1021/ct900234u. doi:10.1021/ct900234u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Klamt A. Conductor-like screening model for real solvents: a new approach to the quantitative calculation of solvation phenomena. J Phys Chem. 1995;99:2224–2235. doi:10.1021/j100007a062. [Google Scholar]
  • 76.Velez-Vega C, Gilson MK. Overcoming dissipation in the calculation of standard binding free energies by ligand extraction. J Comput Chem. 2013;34:2360–2371. doi: 10.1002/jcc.23398. doi:10.1002/jcc.23398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Hamelberg D, McCammon JA. Standard free energy of releasing a localized water molecule from the binding pockets of proteins: double-decoupling method. J Am Chem Soc. 2004;126:7683–7689. doi: 10.1021/ja0377908. doi:10.1021/ja0377908. [DOI] [PubMed] [Google Scholar]
  • 78.Limongelli V, Bonomi M, Parrinello M. Funnel metadynamics as accurate binding free-energy method. Proc Natl Acad Sci. 2013;110:6358–6363. doi: 10.1073/pnas.1303186110. doi:10.1073/pnas.1303186110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Gallicchio E, Levy RM. Prediction of SAMPL3 host-guest affinities with the binding energy distribution analysis method (BEDAM). J Comput Aided Mol Des. 2012;26:505–516. doi: 10.1007/s10822-012-9552-3. doi:10.1007/s10822-012-9552-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Mikulskis P, Cioloboc D, Andrejić M, Khare S, Brorsson J, Genheden S, Mata RA, Söderhjelm P, Ryde U. Free-energy perturbation and quantum mechanical study of SAMPL4 octa-acid host-guest binding energies. J Comput Aided Mol Des. 2014;28:375–400. doi: 10.1007/s10822-014-9739-x. doi:10.1007/s10822-014-9739-x. [DOI] [PubMed] [Google Scholar]
  • 81.Hsiao YW, Söderhjelm P. Prediction of SAMPL4 host-guest binding affinities using funnel metadynamics. J Comput Aided Mol Des. 2014;28:443–454. doi: 10.1007/s10822-014-9724-4. doi:10.1007/s10822-014-9724-4. [DOI] [PubMed] [Google Scholar]
  • 82.Fenley AT, Henriksen NM, Muddana HS, Gilson MK. Bridging calorimetry and simulation through precise calculations of cucurbituril-guest binding enthalpies. J Chem Theory Comput. 2014;10:4069–4078. doi: 10.1021/ct5004109. doi:10.1021/ct5004109. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

suppl1

RESOURCES