Abstract
Protein flexibility plays a major role in biomolecular recognition. In many cases it is not obvious how molecular structure will change upon association with other molecules. In proteins these changes can be major, with large deviations in overall backbone structure, or they can be more subtle as in a side chain rotation. Either way the algorithms that predict the favorability of biomolecular association require relatively accurate predictions of the bound structure to give an accurate assessment of the energy involved in association. Here we review a number of techniques that have been proposed to accommodate receptor flexibility in the simulation of small molecules binding to protein receptors. We investigate modifications to standard rigid receptor docking algorithms, and also explore enhanced sampling techniques, and the combination of free energy calculations and enhanced sampling techniques. The understanding and allowance for receptor flexibility are helping to make computer simulations of ligand protein binding more accurate. These developments may help improve the efficiency of drug discovery and development. Efficiency will be essential as we begin to see personalized medicine tailored to individual patients, which means specific drugs are needed for each patient’s genetic makeup.
Keywords: Computer aided drug design, structure based drug design, receptor flexibility, ensemble docking, relaxed complex scheme, molecular dynamics, accelerated molecular dynamics, free energy calculation
Introduction
The advent of pharmacogenomics has opened up many new and exciting avenues to optimize drug treatment for a variety of diseases(1). While often used to correlate drug efficiency or toxicity with specific mutations, it has great potential in combination with structure based drug discovery as well. Particularly in the realm of computer aided drug design, it is very conceivable that studies on individual mutant receptors can yield “personalized drugs” tailored to target specific mutant protein forms only. These studies ideally will include full receptor and ligand flexibility to find the most optimally suited inhibitors. This review will focus on techniques that are used to account for receptor flexibility in computer aided drug discovery studies.
Introduction to receptor flexibility
Often it is of great interest to know the manner in which different molecules interact. Gibbs free energy can be related to the favorability of one state over another such as state A: a drug bound to its protein target, or state B: unbound. The free energy can be related to the concentration bound and unbound, and affinity can be quantitatively expressed in this way. A large number of methods have been proposed to calculate the free energy of binding in biomolecular recognition, because this property is of great interest to those that develop drugs for biological targets. If one only needs to know the structure of a compound to calculate its binding affinity to a target of interest, it would save a great effort in screening compounds and synthesizing them. In addition, the vast majority of the chemical space possible for drug like molecules has not been probed by chemists, but could in theory be more efficiently explored via computation if rapid and accurate methods are developed. In fact, in terms of drug-sized molecules it is estimated that only a very small percentage of the chemical space available has been synthesized(2). Of great relevance and application have been docking and scoring functions which provide rapid ranking of compounds, sometimes estimating binding energies, by generating and scoring poses, which are 3 dimensional orientations of ligands interacting with receptors. There are numerous examples of docking programs; the first widely used method was DOCK(3) and more recent versions are still in use today (4). Other commonly used programs are AutoDock(5) which is freely available, and GLIDE(6) distributed with the Schrödinger suite of tools(7). These methods have proved of great use(8) because of their speed and simplicity in setup, but suffer from numerous errors because they don’t account well for entropy, water effects, or protein structural changes upon ligand binding(9).
Often proteins adopt different conformations upon ligand binding than they do in their unbound form. Significant changes can be seen in the protein structure of some proteins, while minor changes like side chain rotations or subtle rearrangements are often important in terms of binding affinity as well. Some Cytochrome P450s (CYP) including CYP2B4 undergo alpha-helical rearrangements to allow different substrates to bind(10). The infamous CYP3A4 isoform adopts major changes in conformation during binding which is believed to be necessary for metabolism of many substrates, which vary greatly in size (1000 daltons variation), geometry, and composition. These changes have been verified by ligand co-crystallography(11-13). With all the degrees of freedom and flexibility inherent in protein structure, the changes in receptor conformation are very important in calculating the binding free energy. If the basic geometric shape of the binding site is not described correctly, then it will be quite difficult to predict compound binding correctly using methods like docking and scoring. Two theories have been proposed to describe receptor configurational change upon ligand interaction. The first is conformational selection(14, 15) in which all conformations are present in the unbound receptor, but the populations of each configuration change in the bound form, meaning that the average structure is different from bound to unbound. The second theory is induced fit(16), in which one configuration is forced into another configuration by the ligand-binding event. See figure 1 for a schematic of these two theories of biomolecular recognition and many methods to account for these conformational changes in protein-ligand interactions.
Figure 1.
Schematic representation of the possible pathways taken in ligand binding. The induced fit pathways and conformational selection pathways are shown via dashed and solid lines. Various methods for binding energy calculations, which account for receptor flexibility are listed in the grey box. Simulation methods for sampling receptor flexibility are listed in the purple box.
Docking and Scoring Functions which Account for Receptor Flexibility
Docking combined with a scoring function is a fast method for ranking compound binding or complementarity to a target of interest. First the poses are generated by performing a 3-dimensional conformational search of the binding pocket with a molecule of interest (docking), then poses are evaluated based on a scoring function (scoring). Docking functions may perform 3-dimensional searches, randomly, systematically, or by performing MD simulation of ligand conformations and then mapping to the 3-dimensional structure of the receptor. Many scoring functions have been developed and they may score using a force field-based, empirically-based, knowledge-based, or consensus-based algorithm. A fine review of docking and scoring functions is given by Kitchen et al.(9). Often only a single rigid receptor conformation is used to dock and score each pose for computational efficiency, however there are ways in which docking programs try to account for receptor flexibility. Soft docking, or the softening of van der Waals potentials, can allow for small overlaps between the ligand and receptor without large steric penalties(17, 18). However, this may increase the rate of false positives because more diverse structures are allowed to bind. It also does not allow for larger conformational changes like side chain rotations or protein backbone motions. Allowing for certain side chains near the active site to rotate, can also account partially for receptor flexibility(19).
Induced Fit Docking
Additionally, one can use induced fit docking where the receptor is modeled with flexibility to accommodate the induced fit associated with ligand binding. One example of induced fit docking is the procedure outlined by Sherman et al. and implemented in the popular Schrödinger suite of tools as an option in the GLIDE docking program(20, 21). Here residue side chains are changed to alanine residues to prevent steric clashes in the initial docking. Then side chain predictions are used to generate possible conformations and the binding site and ligand are energy minimized. This protocol allows for local movements around the protein upon ligand binding, and significant reductions of the root mean square deviation (RMSD) of some flexible protein binding poses as compared to crystal structures(20). Nonetheless, it remains a challenge to predict larger scale motions that may lead to different binding conformations.
Rosetta Ligand
An alternate method that accounts for receptor flexibility is RosettaLigand. RosettaLigand is based on the popular modeling software Rosetta(22). In contrast to the aforementioned methods, RosettaLigand, does not split the receptor and ligand ensemble generation into separate steps that are combined during docking. Rather it allows for both receptor and ligand structural changes during the docking stage. Also in contrast to the previously mentioned methods, Rosetta uses knowledge-based scoring functions to assess protein conformations while the protein-ligand interactions are modeled employing first-principles. In its original implementation RosettaLigand allowed for ligand flexibility as well as receptor side chain flexibility(23). In a subsequent release the developers went beyond just accounting for side chain flexibility in the active site to incorporating full protein backbone and side chain flexibility(24). An extensive blind docking benchmark revealed that the performance of RosettaLigand is comparable to that of other commercial software (25). In its latest release, RosettaLigand can dock multiple ligands simultaneously, allowing for the redesign of the binding interface during docking, and it has a more user-friendly xml script interface (26).
Ensemble Based Screening Methods
Ensemble based screening methods rely on using varied receptor conformations in a docking protocol. The conformations can be determined from crystallographic or NMR structures, Monte Carlo simulation, molecular dynamics simulation, or enhanced sampling methods. See figure 2 for a schematic of ensemble based docking methods. The relaxed complex scheme (RCS)(27, 28) is a type of ensemble docking, which relies on previously determined conformations from molecular dynamics simulation to perform docking studies against. This ensemble of structures is then used in conjunction with a docking scoring function rather than a single structure, with the idea that an ensemble of low energy structures will bind a larger variety of compounds, and thus more hits will be obtained from a compound library(27-29). While one could just extract structures at equidistant time points, it seems more logical to perform a clustering type analysis, and use a single representative of related structures(29) to cover a large conformational space without redundant structures. In the final scoring of compounds, it may be useful to weight the score from each structure based on how often that structure was present in the original simulation, or one can simply take the highest score given. The problem remains though that perhaps very rare structures will be the most important in compound binding. The protein structure can be simulated in the apo state with the hope that without any compounds bound more states will be sampled since the structure will not be biased by ligand-protein interactions. However, in some structures the binding pocket may adopt a structure that does not bind ligands or occlude itself completely without a ligand bound. In this case, it is best to simulate with a ligand bound, however the structures will be biased towards the ligand bound state and may fail to explore a larger variety of potential ligand binding structures.
Figure 2.
Schematic representation of ensemble based docking. Examples of methods or data used at each step are given inside the dashed grey lined squares.
This method has been successfully used in a number of studies to find compounds for various targets. McCammon and colleagues described a novel binding trench in HIV integrase using RCS methods and docking, which helped inspire the discovery of the FDA approved drug raltegravir(30). Other diketo acid HIV integrase inhibitors have been modeled and designed using RCS(31). In addition, inhibitors of essential enzymes for Trypanosoma brucei have been found using the RCS method (32). The selection of appropriate configurations based on compound class and active site volume have been described for the proposed antibacterial target undecaprenyl pyrophosphate synthase(33). New compounds have been described which inhibit neuraminidase in avian influenza(34). Although molecular dynamics (MD) structures can enrich active compound predictions, some structures may perform worse than X-ray structures and a broadly applicable protocol for a priori determining the most predictive structures from the simulations has not been determined(35). Indeed, ensemble based screening can generate more hits with more chemical diversity than a single structure, but if many crystal structures are available with varied binding site geometries the enrichment may be better for ensembles of crystal structures than for ensembles of simulation structures(36). This leads us to the conclusion that ensemble based screening is superior to a single structure screening, but simulation based ensemble screening may be more applicable when extensive crystallographic data is not available. Generally the configurational changes observed with these RCS methods and induced fit methods are smaller changes. The short timescale upon which most molecular simulations are run does not fully map out bio-molecular phase space. Greater sampling of biomolecular phase space as well as a broadly applicable method for determining the most predictive structures a priori would both be big steps forward in ensemble based docking.
Enhanced Sampling: Hardware improvements
Over the course of MD simulations if there is a high energy barrier between two low energy states it is unlikely that the simulation will cross this barrier and describe the second low energy state, unless the simulation is very long. In addition, the free energy surface of proteins is vast and rugged and exploration is slow. The timescales often simulated with MD using today’s computers are nanoseconds to microseconds and sometimes even milliseconds(37). However, many interesting events occur on the timescales of milliseconds to seconds. In addition, if one wishes to recover a Boltzmann ensemble of structures then the crossing of energy barriers must be performed many times to generate converged statistics. It is of interest then to speed the crossing of high energy barriers and numerous methods have been proposed to perform this.
Conceptually the simplest improvements come from increased speed and efficiency of computer hardware and software. Two recent hardware advances have been graphics processing unit (GPU) accelerated computing and the construction of a special purpose machine Anton for running long MD simulations. GPU computing has shown impressive benchmarks for MD(38). The common test case DHFR run with AMBER11 was benchmarked at 30.79 ns/day on 48 Intel X5670 2.93 GHz processors, while a single NVIDIA GTX580 GPU ran 40.74 ns/day, and the GPU performance could be improved by running in parallel(39). Also GPUs may provide a more reasonably priced solution compared to traditional CPU based high performance computing (HPC). Popular simulation packages like NAMD(40), and AMBER(39), GROMACS, and ACEMD(38) have released code which runs on GPUs. A fine review of GPU-computing in the context of molecular modeling is given by Stone et al.(41). In addition, special purpose machines like Anton have been built for running fast MD simulations and extending brute force MD simulation into the millisecond timescale(37). These extremely long simulations have been used to show drug binding events to their protein target(42). They have also been used to show in full atomic resolution how small proteins fold, and that in many cases a very accurate representation of the folded state can be obtained from MD simulation(43). The accuracy of force fields on these millisecond timescales has never truly been tested before and these new hardware developments have been an exciting development in molecular modeling. Protein flexibility on longer timescales is now being accessed much more routinely. In addition to the hardware changes, simulation techniques that help improve sampling have been developed and continue to show promise and efficiency over brute force descriptions of conformational flexibility.
Enhanced Sampling: Methodologies
One can speed sampling in a number of ways by introducing artificial biases into the model upon which the simulation is based. The simplest of artificial biases is raising the temperature, which causes more rapid fluctuations in structure by increasing the average velocity of all atoms. Temperature accelerated replica exchange uses many replicas at varying temperatures to increase sampling, and these replicas exchange with each other based on a Metropolis criterion, finally recovering the canonical ensemble (44). In addition, one may use a Hamiltonian modifying technique like accelerated molecular dynamics (aMD) in a replica exchange framework to modify only the potential energy surface at a given temperature (45). Other methods like umbrella sampling(46-48), and metadynamics(49, 50) have also been used to enhance sampling. However many of these methods suffer from the need to define a reaction coordinate a priori to simulating the system. This is not advantageous if one is looking for new configurations to bind a drug-like compound to a protein active site without knowledge of what that configuration is, but can be quite useful in determining the free energy change between known conformations.
Umbrella sampling
The calculation of free energy differences from molecular dynamics simulations requires enhanced sampling techniques to probe the energy landscape effectively. Free energy difference calculations are facilitated by the use of a reaction coordinate, a parameter that measures the degree to which the system is near each of the two or more thermodynamic states of interest. One major challenge is the finite simulation time that results in regions close to energy minima being sampled well, whereas regions of higher energy are rarely or never sampled(51). Sufficient sampling of the entire space along the reaction coordinate is necessary to derive the free energy difference. Umbrella sampling is one such technique that enforces adequate sampling of high energy regions. In umbrella sampling (or biased MD) (46-48), the underlying energy potential is modified in order to allow for an easier transition over the energy barrier. For this an additional energy term, often referred to as bias or bias potential, is applied to the system to ensure efficient sampling along the entire reaction coordinate (51). This is done in separate simulations or windows, which overlap. Bias potentials should be chosen to ensure that sampling along the reaction coordinate is as uniform as possible. Umbrella sampling frequently employs harmonic biasing potentials in a series of windows along the reaction coordinate. An alternative to this is the use of a single window in combination with an adaptive biasing potential aimed at matching the free energy profile along the reaction coordinate as well as possible. This method is referred to as adaptive bias umbrella sampling. The sampling in each individual window can be performed using conventional molecular dynamics (cMD) or employing enhanced sampling techniques such as Hamiltonian replica exchange (44, 52). Careful choice of the reaction coordinate is crucial for correct umbrella sampling results(53). The free energy curves are combined using techniques such as the weighted histogram analysis method (WHAM, (54, 55)).
Metadynamics
Metadynamics is another technique designed to accelerate rare events and reconstruct the free energy profile(50). Metadynamics (49) relies on the introduction of a set of collective variables (CVs) which best describe the process of interest. Within the space of these CVs, a history-dependent potential is build up by dropping Gaussians along the sampled trajectory effectively discouraging the system from revisiting configurations that have already been sampled(56). The overall sum of these Gaussians is then used to compute a free energy profile along the CVs. For an excellent example of a metadynamics simulation see Fig. 1 in (50). The main difference to umbrella sampling is the non-systematic sampling along the collective variable(s). Important user-defined parameters are the height, width and dropping-frequency of Gaussians. Choosing the right set of CVs is crucial to the success of the simulation and criteria for picking good CVs have been outlined in the literature (50, 56). An instructive example of what happens if the correct CVs are not chosen is given in (57). Unfortunately owing to the complexity of large biological systems, determining a set of appropriate collective variables is challenging. However, this has not deterred a large number of applications using the method. Most notably to mention for this review is work that used metadynamics to characterize small molecule – protein interactions accounting for full receptor flexibility (58-62).
Accelerated Molecular Dynamics
Accelerated molecular dynamics (63), an extension of hyperdynamics(64), is a simulation method that does not require the selection of a reaction coordinate or CVs and is ideally suited for efficiently exploring configurational space. Accelerated MD modifies the potential energy surface based on the difference between a user defined reference energy E and the normal potential energy of the underlying MD force field at each point. Accelerated MD speeds sampling by decreasing the size of energy barriers and smoothing the energetic ruggedness of phase space exploration according to a parameter α. See figure 3 for the equations used to modify the potential energy surface and a hypothetical 2-dimensional representation of the effect of aMD on a potential energy landscape. Accelerated MD has been used to simulate small benchmark systems like alanine dipeptide and reproduce the Ramachandran plot from MD(63). Originally aMD was only applied to torsional angles(63) but was subsequently extended to all force field terms including explicit solvent(65). These two forms were combined in the dual-boost approach(66). Accelerated MD has been used for complex systems and validated with experimental results such as the improved prediction of experimental NMR observables in large protein systems (67). Grant et al. suggested that conformational selection is the dominant mechanism for nucleotide binding dependent conformational changes in G-proteins, while induced fit plays a smaller role based on data from aMD simulations(68).
Figure 3.
Accelerated Molecular Dynamics. Equations used to calculate the boost energy and modified potential energy surface in aMD(63). A two dimensional representation of the modified potential, V*(r) (dashed lines), and the unmodified potential energy surface, V (thick black line). α was varied as indicated E was always fixed at 60 and is indicated by a thin solid black line.
It is however important to note that recovery of Boltzmann statistics in large simulations remains a challenge because of sampling limitations in simulating large biomolecules. A non-Boltzmann simulation like aMD requires that each configuration be reweighted in a way that recovers the canonical ensemble(63). Often when longer timescales are accessed with aMD the reweighting procedure is subject to statistical error in the estimate of the weighting factor for each point. An excellent review of the statistical issues with reweighting is given by Shen and Hamelberg(69). However, since the underlying shape of the energy surface is preserved, although flattened, one can consider the most common configurations from aMD to be likely structures although the calculation of the exact ratios of their populations is still challenging. In simulations with the goal of conformational exploration where the exact free energy of the conformation is not desired, simulations are often described without reweighting, with the risk that a high energy state may have been over-represented. Recently, Wereszczynski et al. explored the conformational space of GET3 (a protein involved in the guided entry of tail anchored proteins into the membrane) with aMD and then performed rigorous potential of mean force calculations to determine the energetics of conformational change in the presence of various bound nucleotides. GET3 simulation results agreed with experiments and were able to capture the conformational biases between open, semi-open, and closed states associated with nucleotide binding. Additionally simulations with aMD showed much greater conformational exploration than classical MD simulations and helped suggest that the apo GET3 may explore a semi-open state when free in soloution (70). Accelerated MD was also recently used to study the dynamics of the important cardiomyocyte calcium binding protein troponin C (71). Indeed, enhanced sampling methods can play a critical role in the generation of structural ensembles with larger conformational changes and these methods are also seeing increased use in combination with rigorous free energy calculations.
Accelerated MD in free energy calculations
Free energy calculations based on computer simulations have been pursued for a few decades now, although they are not as ubiquitous in pharmaceutical settings as quicker methods of structure based compound binding prediction like docking combined with scoring functions. Part of the problem is that they are not well automated like many docking and scoring algorithms. Another major issue is that the computational power required for these calculations is great and a large investment in computing resources is needed before one can predict the affinity of a single ligand let alone a large compound library. We now have the computational power to make predictions of free energy, for pharmaceutically relevant ligand-receptor interactions, using methods like free energy perturbations (FEP) and thermodynamic integration (TI). These methods rely on defining a thermodynamic cycle where alchemical transitions can be used to change between states and then one can calculate a free energy difference between two states. Often the states will be the bound and unbound form of the ligand, but variations allow calculation of other properties such as the free energy of solvation, or the difference in free energy of binding between two ligands(72). One of the reasons behind the slow speed of these calculations is that the alchemical change must be made so that there is appropriate phase space overlap between the successive states along the reaction coordinate between the different end states. In thermodynamic integration calculations similar convergence challenges arise. This requires considerable sampling of any conformational changes around the modified portion of the system, as well as accurate convergence of the energy of those conformations, which can lead to large amounts of simulation. Since free energy methods rely on sampling, each simulation will be different and one can generate a simple estimate of error by running simulations in replicate, further increasing the need for computation.
Because rapid generation of low energy configurations is necessary for accurate free energy estimations, it is logical that enhanced sampling methods may be beneficially be applied to these calculations. Accelerated MD has been applied in a number of ways to free energy calculations. Fajer et al. used a replica exchange framework where the difference between replicas was the level of acceleration applied via varied boost parameters(45). Since the ground state replica was run on an unaltered potential energy surface there were no issues with reweighting it as long as the replicas were spaced closely enough that they would exchange rapidly. There are however many ways to incorporate data from replicas at different levels of acceleration and multistate Bennett acceptance ratio (MBAR) was determined to be roughly 4 times more efficient at recovering data than the ground state alone(73).
Oliveira created upside down aMD to overcome some of the issues of reweighting and calculate free energies with a single replica. The simple test system of a butane to butane symmetric transformation was used to show that one can increase the accuracy and speed of convergence by enhancing sampling with aMD in free energy calculations(74). This method allows the simulation to populate statistics by running at low to no acceleration in low energy regions, and then jumping energy barriers to rapidly move between different configurations. While quite promising in simple systems, this method proved challenging to parameterize so that high energy barriers were not flattened more than low energy barriers in simulations with many degrees of freedom. To ameliorate this issue, we proposed a boost limiting factor and created windowed aMD. This method efficiently reproduced the free energy surface of alanine dipeptide. In addition the new boost equation was used to calculate the free energy difference between the antibiotic vancomycin and two of its glycopeptide binding partners using TI. The overall results demonstrated more rapid convergance of free energy calculations and easily reweighted statistics, however more parameters were required (75).
Since the reweighting statistical error is directly related to the amount of acceleration and the complexity of the system, it seems reasonable to limit the portion of the simulation to which boost is applied. Acceleration of just the dihedral angles was the original way aMD was applied and in biomolecular systems these dihedral terms provide much of the restraint in conformational exploration(63). This also provided a convenient limitation of the energetic terms to which acceleration was applied since water molecules, which make up the majority of the atoms in explicit solvent simulations, have no dihedral angles. Selectively applied aMD took this idea one step further and limited acceleration only to predefined dihedral angles in alanine dipeptide, and in a free energy simulation of decoupling oseltamivir’s binding to neuraminidase. This work demonstrated that reweighting a small subset of dihedral angles helped overcome reweighting issues and that the free energy results converged to the same answer as cMD simulations, but in less time(76). Although this provides for better statistical recovery, it requires the prediction of which dihedral angles are important in ligand recognition or protein flexibility, which may be non-trivial.
Concluding remarks
There has been tremendous progress in the field of personalized sequencing of genetic code within the past 20 years. The “$100 genome” is within reach in our generation. The possibility of personalized genetic knowledge of specific diseases has unprecedented potential for targeted drug treatment. This exciting development goes hand in hand with advances in computer aided drug design that can be used to discover novel leads targeting specific mutant receptors. Here we presented recent developments in computational algorithms and hardware used in drug discovery with a focus on using enhanced protein dynamics sampling techniques to aid in the incorporation of full receptor flexibility in structure based drug discovery. In the near future, it should be possible to include the effects of genetic variation in models of drug targets, and speed the choice of therapies appropriate for individual patients. The effects of genetic variation not only lead to sequence changes, but structural and dynamical changes too. Thus we anticipate that computer-aided drug discovery which accommodates receptor flexibility will be an important component of pharmacogenomics in the near future opening many new and exciting opportunities for combining the two techniques.
Acknowledgements
This work was supported by the Molecular Biophysics Training Grant GM08326 (WS), the National Science Foundation Grant MCB-1020765, NBCR, CTBP, Howard Hughes Medical Institute, and National Institutes of Health Grant GM31749 (JAM).
Abbreviations
- (CYP)
cytochrome P450s
- (RMSD)
root mean square deviation
- (MD)
molecular dynamics
- (MC)
Monte Carlo
- (IFD)
induced fit docking
- (RCS)
relaxed complex scheme
- (GPU)
graphics processing unit
- (HPC)
high performance computing
- (CV)
collective variable
- (aMD)
accelerated molecular dynamics
- (GET)
guided entry of tail anchored proteins
- (FEP)
free energy perturbation
- (TI)
thermodynamic integration
- (MBAR)
multistate bennett acceptance ratio
- (MM/GBSA)
molecular mechanics/ generalized born surface area
Footnotes
The authors declare no conflicts of interests.
References
- 1.Wang L. Pharmacogenomics: a systems approach. Wiley Interdiscip Rev Syst Biol Med. 2010;2:3–22. doi: 10.1002/wsbm.42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.New horizons in chemical space. Nat Rev Drug Discov. 2004;3:375. doi: 10.1038/nrd1401. [DOI] [PubMed] [Google Scholar]
- 3.Kuntz ID, Blaney JM, Oatley SJ, Langridge R, Ferrin TE. A geometric approach to macromolecule-ligand interactions. J Mol Biol. 1982;161:269–288. doi: 10.1016/0022-2836(82)90153-x. [DOI] [PubMed] [Google Scholar]
- 4.Lang PT, Brozell SR, Mukherjee S, Pettersen EF, Meng EC, Thomas V, Rizzo RC, Case DA, James TL, Kuntz ID. DOCK 6: combining techniques to model RNA-small molecule complexes. RNA. 2009;15:1219–1230. doi: 10.1261/rna.1563609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK, Goodsell DS, Olson AJ. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J Comput Chem. 2009;30:2785–2791. doi: 10.1002/jcc.21256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, Repasky MP, Knoll EH, Shelley M, Perry JK, Shaw DE, Francis P, Shenkin PS. Glide: A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy. J. Med. Chem. 2004;47:1739–1749. doi: 10.1021/jm0306430. [DOI] [PubMed] [Google Scholar]
- 7.GLIDE 4.5. Schrödinger; New York, NY: [Google Scholar]
- 8.McInnes C. Virtual screening strategies in drug discovery. Curr Opin Chem Biol. 2007;11:494–502. doi: 10.1016/j.cbpa.2007.08.033. [DOI] [PubMed] [Google Scholar]
- 9.Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov. 2004;3:935–949. doi: 10.1038/nrd1549. [DOI] [PubMed] [Google Scholar]
- 10.Gay SC, Sun L, Maekawa K, Halpert JR, Stout CD. Crystal structures of cytochrome P450 2B4 in complex with the inhibitor 1-biphenyl-4-methyl-1H-imidazole: ligand-induced structural response through alpha-helical repositioning. Biochemistry. 2009;48:4762–4771. doi: 10.1021/bi9003765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yano JK, Wester MR, Schoch GA, Griffin KJ, Stout CD, Johnson EF. The structure of human microsomal cytochrome P450 3A4 determined by X-ray crystallography to 2.05-A resolution. J. Biol. Chem. 2004;279:38091–38094. doi: 10.1074/jbc.C400293200. [DOI] [PubMed] [Google Scholar]
- 12.Ekroos M, Sjögren T. Structural basis for ligand promiscuity in cytochrome P450 3A4. P Natl Acad Sci. 2006;103:13682–13687. doi: 10.1073/pnas.0603236103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sevrioukova IF, Poulos TL. Structure and mechanism of the complex between cytochrome P4503A4 and ritonavir. P Natl Acad Sci. 2010;107:18422–18427. doi: 10.1073/pnas.1010693107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Monod J, Wyman J, Changeux JP. On the nature of allosteric transitions: a plausible model. J Mol Biol. 1965;12:88–118. doi: 10.1016/s0022-2836(65)80285-6. [DOI] [PubMed] [Google Scholar]
- 15.Ma B, Kumar S, Tsai CJ, Nussinov R. Folding funnels and binding mechanisms. Protein Eng. 1999;12:713–720. doi: 10.1093/protein/12.9.713. [DOI] [PubMed] [Google Scholar]
- 16.Koshland DE. Application of a Theory of Enzyme Specificity to Protein Synthesis. P Natl Acad Sci. 1958;44:98–104. doi: 10.1073/pnas.44.2.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jiang F, Kim SH. Soft docking”: matching of molecular surface cubes. J Mol Biol. 1991;219:79–102. doi: 10.1016/0022-2836(91)90859-5. [DOI] [PubMed] [Google Scholar]
- 18.Ferrari AM, Wei BQ, Costantino L, Shoichet BK. Soft docking and multiple receptor conformations in virtual screening. J. Med. Chem. 2004;47:5076–5084. doi: 10.1021/jm049756p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Leach AR. Ligand docking to proteins with discrete side-chain flexibility. J Mol Biol. 1994;235:345–356. doi: 10.1016/s0022-2836(05)80038-5. [DOI] [PubMed] [Google Scholar]
- 20.Sherman W, Day T, Jacobson MP, Friesner RA, Farid R. Novel procedure for modeling ligand/receptor induced fit effects. J. Med. Chem. 2006;49:534–553. doi: 10.1021/jm050540c. [DOI] [PubMed] [Google Scholar]
- 21.Sherman W, Beard HS, Farid R. Use of an induced fit receptor structure in virtual screening. Chem Biol Drug Des. 2006;67:83–84. doi: 10.1111/j.1747-0285.2005.00327.x. [DOI] [PubMed] [Google Scholar]
- 22.Simons KT, Kooperberg C, Huang E, Baker D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol. 1997;268:209–225. doi: 10.1006/jmbi.1997.0959. [DOI] [PubMed] [Google Scholar]
- 23.Meiler J, Baker D. ROSETTALIGAND: protein-small molecule docking with full side-chain flexibility. Proteins. 2006;65:538–548. doi: 10.1002/prot.21086. [DOI] [PubMed] [Google Scholar]
- 24.Davis IW, Baker D. RosettaLigand docking with full ligand and receptor flexibility. J Mol Biol. 2009;385:381–392. doi: 10.1016/j.jmb.2008.11.010. [DOI] [PubMed] [Google Scholar]
- 25.Davis IW, Raha K, Head MS, Baker D. Blind docking of pharmaceutically relevant compounds using RosettaLigand. Protein Sci. 2009;18:1998–2002. doi: 10.1002/pro.192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lemmon G, Meiler J. Rosetta Ligand docking with flexible XML protocols. Methods Mol. Biol. 2012;819:143–155. doi: 10.1007/978-1-61779-465-0_10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lin J-H, Perryman AL, Schames JR, McCammon JA. Computational drug design accommodating receptor flexibility: the relaxed complex scheme. J Am Chem Soc. 2002;124:5632–5633. doi: 10.1021/ja0260162. [DOI] [PubMed] [Google Scholar]
- 28.Lin J-H, Perryman AL, Schames JR, McCammon JA. The relaxed complex method: Accommodating receptor flexibility for drug design with an improved scoring scheme. Biopolymers. 2003;68:47–62. doi: 10.1002/bip.10218. [DOI] [PubMed] [Google Scholar]
- 29.Amaro RE, Baron R, McCammon JA. An improved relaxed complex scheme for receptor flexibility in computer-aided drug design. J. Comput. Aided Mol. Des. 2008;22:693–705. doi: 10.1007/s10822-007-9159-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Schames JR, Henchman RH, Siegel JS, Sotriffer CA, Ni H, McCammon JA. Discovery of a novel binding trench in HIV integrase. J. Med. Chem. 2004;47:1879–1881. doi: 10.1021/jm0341913. [DOI] [PubMed] [Google Scholar]
- 31.Di Santo R, Costi R, Roux A, Artico M, Lavecchia A, Marinelli L, Novellino E, Palmisano L, Andreotti M, Amici R, Galluzzo CM, Nencioni L, Palamara AT, Pommier Y, Marchand C. Novel bifunctional quinolonyl diketo acid derivatives as HIV-1 integrase inhibitors: design, synthesis, biological activities, and mechanism of action. J. Med. Chem. 2006;49:1939–1945. doi: 10.1021/jm0511583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Durrant JD, Urbaniak MD, Ferguson MAJ, McCammon JA. Computer-aided identification of Trypanosoma brucei uridine diphosphate galactose 4′-epimerase inhibitors: toward the development of novel therapies for African sleeping sickness. J. Med. Chem. 2010;53:5025–5032. doi: 10.1021/jm100456a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Sinko W, de Oliveira C, Williams S, Van Wynsberghe A, Durrant JD, Cao R, Oldfield E, McCammon JA. Applying molecular dynamics simulations to identify rarely sampled ligand-bound conformational states of undecaprenyl pyrophosphate synthase, an antibacterial target. Chem Biol Drug Des. 2011;77:412–420. doi: 10.1111/j.1747-0285.2011.01101.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cheng LS, Amaro RE, Xu D, Li WW, Arzberger PW, McCammon JA. Ensemble-based virtual screening reveals potential novel antiviral compounds for avian influenza neuraminidase. J. Med. Chem. 2008;51:3878–3894. doi: 10.1021/jm8001197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Nichols SE, Baron R, Ivetac A, McCammon JA. Predictive power of molecular dynamics receptor structures in virtual screening. J Chem Inf Model. 2011;51:1439–1446. doi: 10.1021/ci200117n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Osguthorpe DJ, Sherman W, Hagler AT. Exploring protein flexibility: incorporating structural ensembles from crystal structures and simulation into virtual screening protocols. J Phys Chem B. 2012;116:6952–6959. doi: 10.1021/jp3003992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Shaw DE, Deneroff MM, Dror RO, Kuskin JS, Larson RH, Salmon JK, Young C, Batson B, Bowers KJ, Chao JC, Eastwood MP, Gagliardo J, Grossman JP, Ho CR, Ierardi DJ, Kolossvary I, Klepeis JL, Layman T, Mcleavey C, Moraes MA, Mueller R, Priest EC, Shan Y, Spengler J, Theobald M, Towles B, Wang SC. Anton, a special-purpose machine for molecular dynamics simulation. Commun ACM. 2008;51:91–97. [Google Scholar]
- 38. http://www.nvidia.com/object/molecular_dynamics.html.
- 39. http://ambermd.org/gpus/benchmarks.htm.
- 40.Stone JE, Phillips JC, Freddolino PL, Hardy DJ, Trabuco LG, Schulten K. Accelerating molecular modeling applications with graphics processors. J Comput Chem. 2007;28:2618–2640. doi: 10.1002/jcc.20829. [DOI] [PubMed] [Google Scholar]
- 41.Stone JE, Hardy DJ, Ufimtsev IS, Schulten K. GPU-accelerated molecular modeling coming of age. J. Mol. Graph. Model. 2010;29:116–125. doi: 10.1016/j.jmgm.2010.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Shan Y, Kim ET, Eastwood MP, Dror RO, Seeliger MA, Shaw DE. How does a drug molecule find its target binding site? J Am Chem Soc. 2011;133:9181–9183. doi: 10.1021/ja202726y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How fast-folding proteins fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
- 44.Sugita Y, Okamoto Y. Replica-exchange molecular dynamics method for protein folding. Chem Phys Lett. 1999;314:141–151. [Google Scholar]
- 45.Fajer M, Hamelberg D, McCammon JA. Replica-Exchange Accelerated Molecular Dynamics (REXAMD) Applied to Thermodynamic Integration. J Chem Theory Comput. 2008;4:1565–1569. doi: 10.1021/ct800250m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Torrie GM, Valleau JP. Monte-Carlo Free-Energy Estimates Using Non-Boltzmann Sampling - Application to Subcritical Lennard-Jones Fluid. Chem Phys Lett. 1974;28:578–581. [Google Scholar]
- 47.Torrie GM, Valleau JP. Non-Physical Sampling Distributions in Monte-Carlo Free-Energy Estimation - Umbrella Sampling. J Comput Phys. 1977;23:187–199. [Google Scholar]
- 48.Patey GN, Valleau JP. Monte-Carlo Method for Obtaining Interionic Potential of Mean Force in Ionic Solution. J Chem Phys. 1975;63:2334–2339. [Google Scholar]
- 49.Laio A, Parrinello M. Escaping free-energy minima. P Natl Acad Sci. 2002;99:12562–12566. doi: 10.1073/pnas.202427399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Laio A, Gervasio FL. Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science. Rep Prog Phys. 2008;71 [Google Scholar]
- 51.Kastner J. Umbrella sampling. WIRES Comput Mol Sci. 2011;1:932–942. [Google Scholar]
- 52.Sugita Y, Kitao A, Okamoto Y. Multidimensional replica-exchange method for free-energy calculations. J Chem Phys. 2000;113:6042–6051. [Google Scholar]
- 53.Rosta E, Woodcock HL, Brooks BR, Hummer G. Artificial Reaction Coordinate “Tunneling” in Free-Energy Calculations: The Catalytic Reaction of RNase H. J Comput Chem. 2009;30:1634–1641. doi: 10.1002/jcc.21312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kumar S, Bouzida D, Swendsen RH, Kollman PA, Rosenberg JM. The Weighted Histogram Analysis Method for Free-Energy Calculations on Biomolecules .1. The Method. J Comput Chem. 1992;13:1011–1021. [Google Scholar]
- 55.Souaille M, Roux B. Extension to the weighted histogram analysis method: combining umbrella sampling with free energy calculations. Comput Phys Commun. 2001;135:40–57. [Google Scholar]
- 56.Barducci A, Bonomi M, Parrinello M. Metadynamics. WIRES Comput Mol Sci. 2011;1:826–843. [Google Scholar]
- 57.Branduardi D, Gervasio FL, Cavalli A, Recanatini M, Parrinello M. The role of the peripheral anionic site and cation-pi interactions in the ligand penetration of the human AChE gorge. J Am Chem Soc. 2005;127:9147–9155. doi: 10.1021/ja0512780. [DOI] [PubMed] [Google Scholar]
- 58.Gervasio FL, Laio A, Parrinello M. Flexible docking in solution using metadynamics. J Am Chem Soc. 2005;127:2600–2607. doi: 10.1021/ja0445950. [DOI] [PubMed] [Google Scholar]
- 59.Limongelli V, Bonomi M, Marinelli L, Gervasio FL, Cavalli A, Novellino E, Parrinello M. Molecular basis of cyclooxygenase enzymes (COXs) selective inhibition. P Natl Acad Sci. 2010;107:5411–5416. doi: 10.1073/pnas.0913377107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Provasi D, Bortolato A, Filizola M. Exploring molecular mechanisms of ligand recognition by opioid receptors with metadynamics. Biochemistry. 2009;48:10020–10029. doi: 10.1021/bi901494n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Masetti M, Cavalli A, Recanatini M, Gervasio FL. Exploring complex protein-ligand recognition mechanisms with coarse metadynamics. J Phys Chem B. 2009;113:4807–4816. doi: 10.1021/jp803936q. [DOI] [PubMed] [Google Scholar]
- 62.Pietrucci F, Marinelli F, Carloni P, Laio A. Substrate binding mechanism of HIV-1 protease from explicit-solvent atomistic simulations. J Am Chem Soc. 2009;131:11811–11818. doi: 10.1021/ja903045y. [DOI] [PubMed] [Google Scholar]
- 63.Hamelberg D, Mongan J, McCammon JA. Accelerated molecular dynamics: a promising and efficient simulation method for biomolecules. J Chem Phys, American Institute of Physics. 2004;120:11919–11929. doi: 10.1063/1.1755656. [DOI] [PubMed] [Google Scholar]
- 64.Voter AF. Hyperdynamics: Accelerated molecular dynamics of infrequent events. Phys Rev Lett. 1997;78:3908–3911. [Google Scholar]
- 65.de Oliveira CA, Hamelberg D, McCammon JA. On the application of accelerated molecular dynamics to liquid water simulations. J Phys Chem B. 2006;110:22695–22701. doi: 10.1021/jp062845o. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hamelberg D, de Oliveira CAF, McCammon JA. Sampling of slow diffusive conformational transitions with accelerated molecular dynamics. J Chem Phys. 2007;127:155102. doi: 10.1063/1.2789432. [DOI] [PubMed] [Google Scholar]
- 67.Markwick PR, Cervantes CF, Abel BL, Komives EA, Blackledge M, McCammon JA. Enhanced conformational space sampling improves the prediction of chemical shifts in proteins. J Am Chem Soc. 2010;132:1220–1221. doi: 10.1021/ja9093692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Grant BJ, McCammon JA, Gorfe AA. Conformational selection in G-proteins: lessons from Ras and Rho. Biophys J. 2010;99:L87–9. doi: 10.1016/j.bpj.2010.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Shen T, Hamelberg D. A statistical analysis of the precision of reweighting-based simulations. J Chem Phys. 2008;129:034103. doi: 10.1063/1.2944250. [DOI] [PubMed] [Google Scholar]
- 70.Wereszczynski J, McCammon JA. Nucleotide-dependent mechanism of Get3 as elucidated from free energy calculations. Proc Natl Acad Sci U S A. 2012 doi: 10.1073/pnas.1117441109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Lindert S, Kekenes-Huskey PM, Huber G, Pierce L, McCammon JA. Dynamics and Calcium Association to the N-Terminal Regulatory Domain of Human Cardiac Troponin C: A Multiscale Computational Study. J Phys Chem B. 2012 doi: 10.1021/jp212173f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Simonson T, Archontis G, Karplus M. Free energy simulations come of age: protein-ligand recognition. Accounts of Chemical Research. 2002;35:430–437. doi: 10.1021/ar010030m. [DOI] [PubMed] [Google Scholar]
- 73.Fajer M, Swift RV, McCammon JA. Using multistate free energy techniques to improve the efficiency of replica exchange accelerated molecular dynamics. J Comput Chem. 2009;30:1719–1725. doi: 10.1002/jcc.21285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.de Oliveira CA, Hamelberg D, McCammon JA. Coupling Accelerated Molecular Dynamics Methods with Thermodynamic Integration Simulations. J Chem Theory Comput. 2008;4:1516–1525. doi: 10.1021/ct800160q. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Sinko W, de Oliveira CAF, Pierce LCT, McCammon JA. Protecting High Energy Barriers: A New Equation to Regulate Boost Energy in Accelerated Molecular Dynamics Simulations. J Chem Theory Comput. 2012;8:17–23. doi: 10.1021/ct200615k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wereszczynski J, McCammon JA. Using Selectively Applied Accelerated Molecular Dynamics to Enhance Free Energy Calculations. J Chem Theory Comput. 2012;6:3285–3292. doi: 10.1021/ct100322t. [DOI] [PMC free article] [PubMed] [Google Scholar]



