Abstract
Traditional molecular dynamics (MD) simulations of proteins, which rely on integration of Newton’s equations of motion, cannot efficiently equilibrate water occupancy for buried cavities in proteins. This leads to slow convergence of thermodynamic averages for such systems. We have addressed this challenge by efficiently integrating standard Metropolis Monte Carlo (MC) translational water moves with MD in the AMBER simulation package. The translational moves allow water to easily enter or exit buried sites in a thermodynamically correct way during a simulation. To maximize efficiency, the algorithm avoids moves that only interchange waters within the bulk around the protein, instead focusing on moves that can transfer water between bulk and the protein interior. In addition, a steric grid allows avoidance of moves that would lead to obvious steric clashes, and a fast grid-based energy evaluation is used to reduce the number of expensive full energy calculations. The potential energy distribution produced using MC/MD was found to be statistically indistinguishable from that of control simulations using only MD, and the algorithm effectively equilibrated water across steric barriers and into binding pockets that are not accessible with pure MD. The MC/MD method introduced here should be of increasing utility for applications spanning protein folding, the elucidation of protein mechanisms, and free energy calculations for computer-aided drug design. It is available in version 18 release of the widely disseminated AMBER simulation package.
Graphical Abstract:
2. INTRODUCTION
Many proteins contain buried cavities large enough to accommodate a buried water molecule.1 Such cavities may open and allow water exchange only rarely, particularly if they are deeply buried.2 As a consequence, standard molecular dynamics (MD) simulations, that integrate Newton’s equations of motion, do not efficiently sample water occupancy states in buried cavities3. Because water occupancy of a buried site influences the conformational distribution of the protein, standard MD simulations may be inefficient at sampling protein conformations in a manner that accounts for both occupied and unoccupied states of buried cavities. Thus, the conformational distribution provided by even a long MD simulation may not accurately follow the Boltzmann distribution associated with the force field and the conditions (e.g. solvent composition and temperature) of the simulation. The challenge of equilibrating water in buried sites also has important implications for the calculation of protein-ligand binding free energies and hence for computer-aided drug design.3,4 For example, it may be difficult to converge an alchemical relative binding free energy calculation5 that modifies a ligand in a buried site in a manner that requires entry or exit of a buried water molecule. Similarly, binding affinity calculations by the double-decoupling method,6 in which the ligand is gradually decoupled from the binding site, require that water be free to enter and occupy the site. This may pose serious convergence issues if the site is deeply buried.7 The kinetics of water penetration may be a bottleneck to convergence of other types of molecular systems as well, such as membranes and porous materials.
Several prior works have addressed this important problem by integrating grand canonical Monte Carlo (MC) steps with MD simulations,3,4,8 so that water molecules can be added to or removed from random locations in the system in a manner that preserves the Boltzmann distribution of states for a given chemical potential of water. Thus, water molecules can enter or leave a buried cavity even if the cavity remains closed to the outside solvent. This elegant method solves the problem, but it adds algorithmic complexity, and variations in the total numbers of water molecules during a simulation can require substantial changes in the tools used to analyze the results.
Here, we describe a simpler yet rigorous solution, where standard MD steps are supplemented with translational MC steps executed within a rectangular region that overlaps with the protein interior and part or all of the bulk solvent around the protein. The total number of water molecules in the system remains constant, and application of the Metropolis accept/reject criterion means the method yields a valid Boltzmann distribution, as required for the calculation of protein properties and binding thermodynamics. Detailed balance is preserved without the need for reweighting because the rectangular region is both the origin and destination of all water moves, only the waters within this region are selected for moves, their candidate destination sites are also within this region, and the waters and sites are sampled with uniform probability. Hence the algorithm does not lead to preferential sampling.9 It is worth noting that the present method has a different purpose than algorithms whose aim is to efficiently identify stable water-binding sites in protein crystal structures.10–15 Such methods are valuable tools to initialize a simulation in a likely hydration state, but they do not yield a Boltzmann distribution and are not integrated into simulations. It is also important to distinguish the present work from methods like WaterMap,16,17 GIST,18,19, and STOW,20 which are ways of analyzing water distributions from MD, not of generating the water distribution. In fact, our development of the present method is motivated in part by the aim of generating valid water distributions for analysis by such methods. The following subsections detail the method and its implementation in the widely used AMBER simulation package,21,22 and present a series of tests that validate the resulting conformational ensemble and the ability of the method to equilibrate water across barriers and into buried sites.
3. METHODS
We introduce a method to integrate standard Metropolis Monte Carlo (MC) translational water moves with MD, allowing water molecules to enter and exit buried sites during the course of a simulation and thus to generate an equilibrium distribution.
3.1. WORKFLOW
The basic approach is to iteratively alternate a block of NMC move attempts with a block of NMD time steps (Figure 1). Each block of MD starts from the configuration present at the end of the prior MC block, and vice versa. The velocities of all the atoms in the system at the start of each MD block are the same as those at the end of the prior MD block; i.e., a moved water molecule carries its velocities. In addition, the orientation of a moved water is not modified, as the orientation of waters in the bulk is randomized by the MD blocks. Trajectory snapshots (atomic coordinates and velocities and system properties such as energy) are saved during the MD block, but not during the MC block. The MD and the full Particle Mesh Ewald (PME) energy calculation are performed on the GPU when using AMBER18’s GPU-accelerated PMEMD simulation code.23–26 The MC part is performed on the CPU.
Although this procedure is valid, in the sense of generating a Boltzmann distribution of configurations for whatever force field is used, implementing it in a naïve manner would be inefficient, for two main reasons. First, for the condensed phase systems, which are of greatest interest here, most move attempts would lead to steric overlaps and hence to Metropolis rejections that do not advance the goal of sampling new water positions. Nonetheless, each failed move attempt would still require a costly energy calculation. Second, for simulations of a protein in water, many of the water moves would merely move water molecules from place to place in the bulk around the protein and thus would not serve the purpose of equilibrating water between the bulk and the protein interior; and MD is already highly effective at equilibrating water in the bulk. We have addressed these inefficiencies by a combination of methods, as now described.
3.1.1. FOCUSING MOVE ATTEMPTS WITH A STERIC GRID
An MC move attempt involves tentatively relocating a water molecule from its initial position within the rectangular sampling region to a target position also in the rectangular sampling region. If the target position would lead to gross steric overlap of the moved water with nearby atoms, it would generate a very high energy change, so the move attempt would be rejected with near certainty. Hence making such move attempts, computing the resulting high energies, and then rejecting them, is essentially equivalent to not trying them at all. The present method takes advantage of this equivalence by only attempting moves to target positions that will not generate a gross steric overlap, thus avoiding many needless energy calculations. This procedure does not require any adjustment in configuration weights because configurations are not saved during the MC procedure. The only role of the MC block is to deliver a new starting configuration for the following block of MD moves.
To accomplish this, and to reduce the number of uninteresting move attempts where both the initial and target locations are in the bulk, we set up a “steric grid” within the rectangular sampling region that includes at least part of the protein interior and that extends into the surrounding bulk water (Figure 2b). This region is defined by the input variable shift (default 10 Å). The origin of the grid is therefore positioned at (shift, shift, shift), where the origin of the simulation box is at (0,0,0). The maximal coordinates of the grid along the x and y axes are the size of the periodic box minus shift. The maximum z-coordinate of the grid is the maximum z-coordinate of the simulation box minus 3 Å. Thus, the grid can be sized to just encompass the protein along the x- and y-axes while extending into bulk solvent along the z-axis (see Figure 2).
At the start of each MC block, each grid point is marked to indicate whether it is sterically blocked, as detailed below. A move attempt then involves choosing a water whose oxygen atom is within the steric grid and moving it to an unblocked target position anywhere on the grid. A water molecule in bulk can thus jump to an unoccupied site in the protein interior and vice versa. After each accepted move, the steric grid is updated to account for the move.
The 0.2 Å resolution steric grid is constructed as follows. It is initialized by assigning a zero overlap count to each grid point. The algorithm (Figure 1) then loops over all atoms in the system and increments the overlap count at each grid point by 1 for every atom that overlaps it (Figure 2B). Each atom is considered to overlap a spherical region of radius rvdw +rwat, where rvdw is the atom’s van der Waals radius, based on elements,27 and rwat an effective steric radius for a water molecule. We found empirically that using rwat = 0.9 Å provided good efficiency and negligible risk of treating an accessible site as occupied. Thus, if a grid point is covered by 3 atoms, (van der Waals radius plus added water radius; see Figure 2B), the value of this grid point will be 3. Grid points with the value zero then lie within cavities in which a water molecule can be placed without causing major steric clashes with the existing atoms. We make a list of all these empty grid points for use in choosing target positions for water moves.
To execute a move, the algorithm randomly selects a water molecule that lies within the rectangular region covered by the grid and moves it to a target position in the immediate neighborhood of a grid point randomly selected from the list of empty grid points. The target position is generated by allowing random shifts of up to a half grid unit (0.1 Å) from the selected grid point along all three axes. The move is then either accepted or rejected, as detailed in Section 3.1.2. If the move is accepted, the water’s coordinates are shifted to the target position, and the steric grid is updated by: a) decrementing by 1 the overlap count of the grid points that the atom previously overlapped; and b) incrementing by 1 the overlap count of the grid points it overlaps after the move. Grid points that had been covered by only this water molecule now have an overlap count of 0 and thus are accessible for the next move attempt, while grid points that had an overlap count of zero but now are overlapped by the moved water are inaccessible for the next move attempt. This approach is similar but not identical to the cavity-bias algorithm used in a number of grand canonical MC implementations.3,8,28–31
3.1.2. FILTERING MOVES BASED ON HYDROGEN CLASHES AND A FAST ENERGY ESTIMATOR
We observed that even moves generated using the steric grid often have very high energies. This results partly from the requirement of using a small value of rwat to avoid excluding close hydrogen-bonded configurations, and partly from the fact that the steric grid does not effectively deal with polar hydrogen atoms, due to their small Lennard-Jones radii. We therefore developed filters that efficiently reject trial moves with very high energies and thus reduce the number of costly full Particle Mesh Ewald (PME) energy calculations. First, a move is immediately rejected if it leads to a hydrogen-hydrogen distance <0.5 Å. Second, the algorithm estimates the energy change due to the trial move by computing the interaction energy of the water with only its neighboring atoms before and after the move and examining the difference between these two energies. If the difference is highly unfavorable (>15 kcal mol−1), the move is rejected. Otherwise, a full energy calculation is performed and the Metropolis criterion32 is used to decide whether to accept or reject the move. The validity of this procedure is demonstrated in Results.
The interaction energy calculations are accelerated by using a grid-based neighbor list. Thus, at the start of each MC block, a second grid, which covers the entire simulation box, is set up, with a coarse spacing defined as half the user-defined AMBER variable cutoff, which is the distance cutoff used for nonbonded interactions. Each coarse grid point is then assigned a list of the atoms in its corresponding voxel. To evaluate a trial move, the algorithm calculates the interactions of the water molecule with only the atoms that are in the neighboring two coarse voxels in each direction.
3.2. EVALUATION
We verified that the present method yields a proper Boltzmann distribution and then evaluated its ability to sample hydration changes in buried cavities. These tests are detailed in the following subsections.
3.2.1. MAINTAINING THE BOLTZMANN DISTRIBUTION
We tested the method on a system that is less dense than liquid water and so has a high MC acceptance rate, so that any error introduced by the MC procedure would easily become apparent. We chose a system with 612 gas phase water molecules at 500 K in a constant volume box about 113 Å along each axis. The rectangular sampling region, with its steric grid, covered the entire simulation box. The MC acceptance rate was 45% for this system. The SHAKE algorithm33,34 was used with the default tolerance of 0.00001. The integration time step was set to 0.2 fs in both the pure MD and the MCMD calculations to minimize deviations from the Boltzmann ensemble due to the use of a finite time-step. This maximized compatibility with the MC steps, for which there are no finite time-step errors. The reference MD simulation was run for 20 ns and the MD energy was recorded every 10 steps, resulting in 107 full energy calculations. For the MCMD calculation, we ran 2.5×106 cycles, with NMC=NMD=10, and the energy was recorded immediately following each MC component of the cycle.
3.2.2. HYDRATING AND DEHYDRATING INACCESSIBLE SITES
We used several systems to test the ability of the MC/MD method to allow water exchange between otherwise inaccessible regions.
The first test system was a ~30×30×85 Å rectangular box partitioned into two regions by planar graphene sheets at Z ≈ 42 Å and 70 Å (Figure 5A). The outer sides of the system were initially populated with water molecules at normal liquid density, whereas the middle region was initially populated with a single water molecule. The simulations were performed at 500 K in boxes of constant volume. The expectation was that pure MD would not allow water to exchange between the two regions, while MC/MD should allow the densities to equalize. In one calculation, the rectangular sampling and steric grid covered the entire simulation box. In a second calculation the sampling region and grid spanned only a thin elongated rectangle (10<X<20;10<Y<20;10<Z<82 Å) which crossed the graphene walls. (Figure 5A).
The second test system started with a water molecule trapped inside a C60 buckyball (Figure 6A). As this water molecule forms no hydrogen bonds with any of the surrounding carbons, it is energetically unstable relative to a water in bulk. The buckyball was solvated in 11 Å of water molecules in all directions, forming a cubic simulation box (Figure 6A). The simulation was performed at constant volume at 300 K.
The third and fourth test systems were proteins with buried water sites. Major Urinary Protein (MUP)35 contains a buried crystallographic water molecule which bridges between the ligand (nonan-1-ol) and Tyr120 (Figure 6B), while Cytochrome c Peroxidase (CcP)36 has a trapped crystallographic water molecule which binds to the Glu229 (Figure 6C). Here we removed the buried water molecules and compared the ability of pure MD and MC/MD to reoccupy these cavities. To focus on the water exchanges, the Cα atoms and the MUP ligand atoms were restrained during the simulation with positional restraints of 5 kcal/mol-Å2. For the MUP and CcP, the simulations included bulk water to a thickness of 11 and 10 Å respectively in all directions, in a cubic simulation box, and the shift variable (Section 3.1.1) was set to 11 Å.
3.3. USING THE MC/MD METHOD IN AMBER
The MC/MD method is invoked and controlled with several keywords in the AMBER input file: mcwat invokes the MC/MD method when set to 1 (default=0); NMC sets NMC (default=1,000); NMD sets NMD (default=1,000); and mcboxshift sets the value of shift, the gap between the sides of the grid box and the sides of the simulation box (Section 3.1.1 and Figure 1; default=10Å). If the system is prepared with cubic periodic boundary conditions with an equal amount of solvent padding along all three axes, it is recommended that this value be set to the thickness of the padding. The input variable mcresstr is the name of the water residue e.g. WAT or HOH.
3.4. SIMULATION DETAILS
Following equilibration at constant pressure and temperature, production MD was performed at constant volume using the TIP3P water model37, the ff14SB38,39 force field for the protein, and GAFF40 for the ligand of MUP. The CcP was prepared as described by Balius et al.41 The particle mesh Ewald (PME) method42 was used for periodic boundary conditions. The SHAKE algorithm33,34 was used to constrain hydrogen bond lengths. The cutoff used for non-bonded interactions was 12Å. The time step was 2 fs for the 300 K simulations, and 0.2 fs for the gas phase (500 K) simulations. The MC blocks were run at a matching temperature and constant volume.
4. RESULTS
4.1. ACCURACY OF THE FAST ENERGY ESTIMATOR
We evaluated the accuracy of the fast energy estimator (Section 3.1.2) by comparing the energy differences it provides for 107 MC trial moves in the CcP system (Section 3.2.2) with energy differences computed with the full PME method. The agreement and correlation are excellent (Figure 3), with a root-mean-squared deviation of 0.91 kcal mol−1, coefficient of determination (R2) of 1.00, and linear regression y = 1.00x + 0.68. Using the fast energy estimator resulted in a 2-fold speed up compared to the full energy. These results support the utility of the fast estimator as a filter to reduce the number of full energy calculations that must be performed during the MC procedure.
4.2. MAINTAINING THE BOLTZMANN DISTRIBUTION
We checked whether the MC/MD procedure maintains a proper Boltzmann distribution for a gas phase test system where ~45% of MC attempted moves are accepted, so that the MC steps contribute heavily to the distribution of states. As shown in Figure 4, the potential energy distribution provided by the MC/MD method agrees closely with that provided by a pure MD simulation of the same system. For MC/MD, the mean potential energy over 2.5×106 energy evaluations was −696.53 kcal mol−1, with a standard error of the mean (SEM) by blocking analysis 43 of 1.55 kcal mol−1, while the mean and SEM for 107 pure MD steps was −695.85 kcal mol−1 with SEM 1.52 kcal mol−1. The good agreement between MC/MD and pure MD supports the validity of the MC/MD method and its correct maintenance of detailed balance.
4.3. HYDRATING AND DEHYDRATING INACCESSIBLE SITES
Further tests of the MC/MD procedure started with two relatively artificial test cases. The first tests the ability of the MC/MD procedure to equilibrate water density across impermeable graphene barriers (Figure 5A), so that an initially nonuniform distribution of water in two regions of the simulation box may become uniform. As expected, based on the construction of the system, standard MD does not allow water to populate the empty region, due to the barriers (data not shown). In contrast, the translational MC moves in the MC/MD method smoothly equalizes the densities in the two regions, as intended, both with a rectangular sampling region covering the entire simulation box and with a much smaller sampling region that spans the graphene walls asymmetrically (Figure 5B & 5C). The fact that the asymmetric grid equalizes the densities faster probably results from the fact that a smaller fraction of its volume is in the region with high initial density, so, during the equilibration phase, it generates more moves into the region with initial low-density. The second test system starts with a buckyball containing one water molecule (Figure 6A). With MD, the water is trapped, while with MC/MD, the water returns to the surrounding bulk water and the buckyball remains unoccupied thereafter. This outcome is not unexpected, given that the trapped water is unable to form stabilizing polar interactions with the nonpolar carbon atoms of the buckyball.
We then examined buried water sites in two proteins, the Major Urinary Protein (Figure 6B) and Cytochrome c Peroxidase (Figure 6C). In both cases, we artificially removed the crystallographic water from the site and simulated the proteins immersed in bulk water, using pure MD and the MC/MD procedure. In both cases, 10 μs of pure MD simulation failed to transfer a water into the empty sites, while MC/MD simulations successfully hydrated them. Following hydration of the sites, no MC moves were accepted that emptied them again. For these two test cases, the CPU time required to carry out 1,000 MC steps followed by 1,000 MD steps results is currently ~5 times greater than that to carry out just the 1,000 MD steps, but this computational cost is accompanied by a clear improvement in conformational sampling.
We also examined how the number of MC steps in each block, NMC, affects the rate of water transfer from bulk to each site, by carrying out runs with NMC = 1,000, 5,000 and 10,000; NMD = 1,000 in all cases. Ten replicates, with different random number seeds, were run for each value of NMC. As shown in Figure 7, increasing NMC consistently shortens the number of MC/MD cycles required for a given fraction of the replicates to become hydrated. Over all runs, the mean number of MC steps required to reach a 50% probability of occupation is 7 × 105 for MUP and 5 × 106 for CcP. Note that these statistics depend, in general, on the dimensions of the steric grid.
5. DISCUSSION
The hybrid MC/MD approach described here accelerates configurational sampling during simulations by allowing large translational water moves during the MC component of each MC/MD cycle. These moves allow water molecules to jump in and out of buried cavities without any need to wait for the protein to sample conformations in which the cavities are connected by a channel to the bulk. The method yields the same Boltzmann distribution of states expected from a sufficiently long MD simulation, so it can be used to compute thermodynamic averages and properties, such as conformational distributions and free energies. This characteristic differentiates the method from more targeted algorithms aimed only at finding high-occupancy water sites, rather than at sampling both occupied and unoccupied states according to the potential function (force field) being used in the simulation. The present method is more closely related to ones that combine MD with grand canonical Monte Carlo steps, in which water molecules may be introduced to or removed from the simulation in a manner that preserves a Boltzmann distribution.3,44 However, the full grand canonical machinery is not needed to achieve our goal of equilibrating between buried sites and bulk, and it may be simpler to interpret and analyze simulation results in which the number of water molecules remains constant and integer, as in the present approach.
The present method, which is available in the AMBER simulation package as of version 18,21 should be useful to accelerate calculations of thermodynamic averages in any setting where water equilibration is rate-limiting for pure MD simulations. We anticipate that it will be particularly valuable in calculating protein-ligand binding free energies where the binding site is isolated from bulk solvent, as it will allow water occupancy to change when a ligand is alchemically modified45–47 or fully decoupled from the binding site.6 It may also speed equilibration of protein conformation ensembles for proteins, such as BPTI,48,49 which form cavities large enough to bind water in their folded state. By the same token, it should be noted that the present method is not designed to generate the correct kinetics for the Hamiltonian being simulated, as the MC steps that speed equilibration are nonphysical.
Key parameters of the method are the dimensions and position of the steric grid and the numbers of MC and MD steps in each cycle. For the protein cases studied here, we have used a steric grid which covers the entire protein, but in many applications, a smaller grid will be advantageous. For example, if one wishes to speed the equilibration of waters between a specific ligand-binding site and the bulk, then a small, focused steric grid just large enough to cover the site and extend into bulk may suffice and should greatly increase efficiency by focusing the MC move attempts on the site of interest. It may also be appropriate to adjust NMC and NMD to suit the application. For example, if the goal is to determine the hydration state of a cavity for a fairly well-defined protein structure, one might set NMC>>NMD. This scenario may be particularly relevant for analyses of water structure with methods like WaterMap16,17 and GIST.18,19 On the other hand, if the goal is to explore the conformational space of the protein, a 1:1 ratio may be more useful. In all cases, we suggest setting NMC ≥1,000, because the steric grid is rebuilt for every cycle, and it is favorable to amortize this computational cost across a large number of MC steps.
We foresee opportunities to further accelerate these calculations in the future. For example, porting the MC cycles to the GPU would reduce the amount of costly communication between GPU and CPU. It should also be possible to evaluate MC energy changes not by recomputing the full energy but instead by computing the change in the interactions of the moved water with all other atoms, while properly accounting for the Ewald summation of Coulombic interactions. Thus, the MC/MD paradigm introduced here should be of increasing utility for applications spanning protein folding, the elucidation of protein mechanisms, and free energy calculations for computer-aided drug design, as well as for a broader range of molecular systems with cavities that are not directly accessible to bulk water.
6 ACKNOWLEDGEMENTS
MKG acknowledges funding from National Institute of General Medical Sciences (GM061300 and GM100946). RCW and CL thanks Intel and their Parallel Computing Center for support. These findings are solely of the authors and do not necessarily represent the views of the NIH or Intel. MKG has an equity interest in and is a cofounder and scientific advisor of VeraChem LLC. GSK did not fund this work and the authors declare no competing interest. We thank Dr. Trent Balius for providing the prepared CcP-gateless structures.
7 REFERENCES
- (1).Bissantz C; Kuhn B; Stahl M A Medicinal Chemist’s Guide to Molecular Interactions. J. Med. Chem. 2010, 53 (14), 5061–5084. 10.1021/jm100112j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Poornima CS; Dean PM Hydration in Drug Design. 2. Influence of Local Site Surface Shape on Water Binding. J. Comput. Aided Mol. Des. 1995, 9 (6), 513–520. [DOI] [PubMed] [Google Scholar]
- (3).Deng Y; Roux B Computation of Binding Free Energy with Molecular Dynamics and Grand Canonical Monte Carlo Simulations. J. Chem. Phys. 2008, 128 (11), 115103 10.1063/1.2842080. [DOI] [PubMed] [Google Scholar]
- (4).Ross GA; Bruce Macdonald HE; Cave-Ayland C; Cabedo Martinez AI; Essex JW Replica-Exchange and Standard State Binding Free Energies with Grand Canonical Monte Carlo. J. Chem. Theory Comput. 2017, 13 (12), 6373–6381. 10.1021/acs.jctc.7b00738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Tembre BL; Mc Cammon JA Ligand-Receptor Interactions. Comput. Chem. 1984, 8 (4), 281–283. 10.1016/0097-8485(84)85020-2. [DOI] [Google Scholar]
- (6).Gilson MK; Given JA; Bush BL; McCammon JA The Statistical-Thermodynamic Basis for Computation of Binding Affinities: A Critical Review. Biophys. J. 1997, 72 (3), 1047–1069. 10.1016/S0006-3495(97)78756-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Mobley DL; Gilson MK Predicting Binding Free Energies: Frontiers and Benchmarks. Annu. Rev. Biophys. 2017, 46, 531–558. 10.1146/annurev-biophys-070816-033654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Woo H-J; Dinner AR; Roux B Grand Canonical Monte Carlo Simulations of Water in Protein Environments. J Chem Phys 2004, 121 (13), 6392–6400. 10.1063/1.1784436. [DOI] [PubMed] [Google Scholar]
- (9).Allen Michael P. and Tildesley Dominic J.. Computer Simulation of Liquids; Clarendon Press: Oxford, 1987. [Google Scholar]
- (10).Hermans J; Xia X; Zhang L; Cavanaugh D. DOWSER Program; Department of Biochemistry and Biophysics, School of Medicine, University of North Carolina, Chapel Hill, NC: 27599–7260, 2003. [Google Scholar]
- (11).Michel J; Tirado-Rives J; Jorgensen WL Prediction of the Water Content in Protein Binding Sites. J. Phys. Chem. B 2009, 113 (40), 13337–13346. 10.1021/jp9047456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Huggins DJ; Tidor B Systematic Placement of Structural Water Molecules for Improved Scoring of Protein–Ligand Interactions. Protein Eng. Des. Sel. 2011, 24 (10), 777–789. 10.1093/protein/gzr036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Morozenko A; Stuchebrukhov AA Dowser++, a New Method of Hydrating Protein Structures. Proteins 2016, 84 (10), 1347–1357. 10.1002/prot.25081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Sridhar A; Ross GA; Biggin PC Waterdock 2.0: Water Placement Prediction for Holo-Structures with a Pymol Plugin. PLoS ONE 2017, 12 (2), e0172743 10.1371/journal.pone.0172743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Nittinger E; Flachsenberg F; Bietz S; Lange G; Klein R; Rarey M Placement of Water Molecules in Protein Structures: From Large-Scale Evaluations to Single-Case Examples. J. Chem. Inf. Model. 2018, 58 (8), 1625–1637. 10.1021/acs.jcim.8b00271. [DOI] [PubMed] [Google Scholar]
- (16).Young T; Abel R; Kim B; Berne BJ; Friesner RA Motifs for Molecular Recognition Exploiting Hydrophobic Enclosure in Protein–Ligand Binding. Proc. Natl. Acad. Sci. U.S.A. 2007, 104 (3), 808–813. 10.1073/pnas.0610202104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (17).Abel R; Young T; Farid R; Berne BJ; Friesner RA Role of the Active-Site Solvent in the Thermodynamics of Factor Xa Ligand Binding. J. Am. Chem. Soc. 2008, 130 (9), 2817–2831. 10.1021/ja0771033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Nguyen CN; Young TK; Gilson MK Grid Inhomogeneous Solvation Theory: Hydration Structure and Thermodynamics of the Miniature Receptor Cucurbit[7]Uril. J. Chem. Phys. 2012, 137 (4), 044101 10.1063/1.4733951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Nguyen CN; Cruz A; Gilson MK; Kurtzman T Thermodynamics of Water in an Enzyme Active Site: Grid-Based Hydration Analysis of Coagulation Factor Xa. J. Chem. Theory Comput. 2014, 10 (7), 2769–2780. 10.1021/ct401110x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Li Z; Lazaridis T Computing the Thermodynamic Contributions of Interfacial Water. Methods Mol. Biol. 2012, 819, 393–404. 10.1007/978-1-61779-465-0_24. [DOI] [PubMed] [Google Scholar]
- (21).Case DA, Ben-Shalom IY, Brozell SR, Cerutti DS, Cheatham TE III, Cruzeiro VWD, Darden TA, Duke RE, Ghoreishi D, Gilson MK, Gohlke H, Goetz AW, Greene D, Harris R, Homeyer N, Izadi S, Kovalenko A, Kurtzman T, Lee TS, LeGrand S, Li P, Lin C, Liu J, Luchko T, Luo R, Mermelstein DJ, Merz KM, Miao Y, Monard G, Nguyen C, Nguyen H, Omelyan I, Onufriev A, Pan F, Qi R, Roe DR, Roitberg A, Sagui C, Schott-Verdugo S, Shen J, Simmerling CL, Smith J, Salomon-Ferrer R, Swails J, Walker RC, Wang J, Wei H, Wolf RM, Wu X, Xiao L, York DM and Kollman PA. Amber18; University of California, San Francisco, 2018. [Google Scholar]
- (22).Case DA, Ben-Shalom IY, Brozell SR, Cerutti DS, Cheatham TE III, Cruzeiro VWD, Darden TA, Duke RE, Ghoreishi D, Gilson MK, Gohlke H, Goetz AW, Greene D, Harris R, Homeyer N, Izadi S, Kovalenko A, Kurtzman T, Lee TS, LeGrand S, Li P, Lin C, Liu J, Luchko T, Luo R, Mermelstein DJ, Merz KM, Miao Y, Monard G, Nguyen C, Nguyen H, Omelyan I, Onufriev A, Pan F, Qi R, Roe DR, Roitberg A, Sagui C, Schott-Verdugo S, Shen J, Simmerling CL, Smith J, Salomon-Ferrer R, Swails J, Walker RC, Wang J, Wei H, Wolf RM, Wu X, Xiao L, York DM and Kollman PA. AmberTools18; University of California, San Francisco, 2018. [Google Scholar]
- (23).Götz AW; Williamson MJ; Xu D; Poole D; Le Grand S; Walker RC Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born. J. Chem. Theory Comput. 2012, 8 (5), 1542–1555. 10.1021/ct200909j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Salomon-Ferrer R; Götz AW; Poole D; Le Grand S; Walker RC Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. J. Chem. Theory Comput. 2013, 9 (9), 3878–3888. 10.1021/ct400314y. [DOI] [PubMed] [Google Scholar]
- (25).Le Grand S; Götz AW; Walker RC SPFP: Speed without Compromise—A Mixed Precision Model for GPU Accelerated Molecular Dynamics Simulations. Comput. Phys. Commun. 2013, 184 (2), 374–380. 10.1016/j.cpc.2012.09.022. [DOI] [Google Scholar]
- (26).Mermelstein DJ; Lin C; Nelson G; Kretsch R; McCammon JA; Walker RC Fast and Flexible Gpu Accelerated Binding Free Energy Calculations within the Amber Molecular Dynamics Package. J. Comput. Chem. 2018, 39 (19), 1354–1358. 10.1002/jcc.25187. [DOI] [PubMed] [Google Scholar]
- (27).Bondi A Van Der Waals Volumes and Radii. J. Phys. Chem. 1964, 68 (3), 441–451. 10.1021/j100785a001. [DOI] [Google Scholar]
- (28).Mezei M A Cavity-Biased (T, V, μ) Monte Carlo Method for the Computer Simulation of Fluids. Mol. Phys. 1980, 40 (4), 901–906. 10.1080/00268978000101971. [DOI] [Google Scholar]
- (29).Mezei M Excess Free Energy of Different Water Models Computed by Monte Carlo Methods. Mol. Phys. 1982, 47 (6), 1307–1315. 10.1080/00268978200100992. [DOI] [Google Scholar]
- (30).Deitrick GL; Scriven LE; Davis HT Efficient Molecular Simulation of Chemical Potentials. J. Chem. Phys. 1989, 90 (4), 2370–2385. 10.1063/1.455979. [DOI] [Google Scholar]
- (31).Stapleton MR; Panagiotopoulos AZ Application of Excluded Volume Map Sampling to Phase Equilibrium Calculations in the Gibbs Ensemble. J. Chem. Phys. 1990, 92 (2), 1285–1293. 10.1063/1.458138. [DOI] [Google Scholar]
- (32).Metropolis N; Rosenbluth AW; Rosenbluth MN; Teller AH; Teller E Equation of State Calculations by Fast Computing Machines. J. Chem. Phys. 1953, 21 (6), 1087–1092. 10.1063/1.1699114. [DOI] [Google Scholar]
- (33).Ryckaert J-P; Ciccotti G; Berendsen HJC Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes. J. Comput. Phys. 1977, 23 (3), 327–341. 10.1016/0021-9991(77)90098-5. [DOI] [Google Scholar]
- (34).Miyamoto S; Kollman PA Settle: An Analytical Version of the SHAKE and RATTLE Algorithm for Rigid Water Models. J. Comput. Chem. 1992, 13 (8), 952–962. 10.1002/jcc.540130805. [DOI] [Google Scholar]
- (35).Malham R; Johnstone S; Bingham RJ; Barratt E; Phillips SEV; Laughton CA; Homans SW Strong Solute−Solute Dispersive Interactions in a Protein−Ligand Complex. J. Am. Chem. Soc. 2005, 127 (48), 17061–17067. 10.1021/ja055454g. [DOI] [PubMed] [Google Scholar]
- (36).Fischer M; Coleman RG; Fraser JS; Shoichet BK Incorporation of Protein Flexibility and Conformational Energy Penalties in Docking Screens to Improve Ligand Discovery. Nat. Chem. 2014, 6 (7), 575–583. 10.1038/nchem.1954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Jorgensen WL; Chandrasekhar J; Madura JD; Impey RW; Klein ML Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79 (2), 926–935. 10.1063/1.445869. [DOI] [Google Scholar]
- (38).Hornak V; Abel R; Okur A; Strockbine B; Roitberg A; Simmerling C Comparison of Multiple Amber Force Fields and Development of Improved Protein Backbone Parameters. Proteins 2006, 65 (3), 712–725. 10.1002/prot.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (39).Maier JA; Martinez C; Kasavajhala K; Wickstrom L; Hauser KE; Simmerling C Ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from Ff99SB. J. Chem. Theory Comput. 2015, 11 (8), 3696–3713. 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Wang J; Wolf RM; Caldwell JW; Kollman PA; Case DA Development and Testing of a General Amber Force Field. J. Comput. Chem. 2004, 25 (9), 1157–1174. 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- (41).Balius TE; Fischer M; Stein RM; Adler TB; Nguyen CN; Cruz A; Gilson MK; Kurtzman T; Shoichet BK Testing Inhomogeneous Solvation Theory in Structure-Based Ligand Discovery. Proc. Natl. Acad. Sci. U.S.A. 2017, 114 (33), E6839–E6846. 10.1073/pnas.1703287114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Darden T; York D; Pedersen L Particle Mesh Ewald: An N log(N) Method for Ewald Sums in Large Systems. J. Chem. Phys. 1993, 98 (12), 10089–10092. 10.1063/1.464397. [DOI] [Google Scholar]
- (43).Flyvbjerg H; Petersen HG Error Estimates on Averages of Correlated Data. J. Chem. Phys. 1989, 91 (1), 461–466. 10.1063/1.457480. [DOI] [Google Scholar]
- (44).Ross GA; Bodnarchuk MS; Essex JW Water Sites, Networks, And Free Energies with Grand Canonical Monte Carlo. J. Am. Chem. Soc. 2015, 137 (47), 14930–14943. 10.1021/jacs.5b07940. [DOI] [PubMed] [Google Scholar]
- (45).Boyce SE; Mobley DL; Rocklin GJ; Graves AP; Dill KA; Shoichet BK Predicting Ligand Binding Affinity with Alchemical Free Energy Methods in a Polar Model Binding Site. J. Mol. Biol. 2009, 394 (4), 747–763. 10.1016/j.jmb.2009.09.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (46).Chodera JD; Mobley DL; Shirts MR; Dixon RW; Branson K; Pande VS Alchemical Free Energy Methods for Drug Discovery: Progress and Challenges. Curr. Opin. Struct. Biol. 2011, 21 (2), 150–160. 10.1016/j.sbi.2011.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (47).Gill SC; Lim NM; Grinaway PB; Rustenburg AS; Fass J; Ross GA; Chodera JD; Mobley DL Binding Modes of Ligands Using Enhanced Sampling (BLUES): Rapid Decorrelation of Ligand Binding Modes via Nonequilibrium Candidate Monte Carlo. J. Phys. Chem. B 2018, 122 (21), 5579–5598. 10.1021/acs.jpcb.7b11820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (48).Denisov VP; Peters J; Hörlein HD; Halle B Using Buried Water Molecules to Explore the Energy Landscape of Proteins. Nat. Struct. Mol. Biol. 1996, 3 (6), 505–509. 10.1038/nsb0696-505. [DOI] [PubMed] [Google Scholar]
- (49).Fischer S; Verma CS Binding of Buried Structural Water Increases the Flexibility of Proteins. Proc. Natl. Acad. Sci. U.S.A. 1999, 96 (17), 9613–9615. 10.1073/pnas.96.17.9613. [DOI] [PMC free article] [PubMed] [Google Scholar]