Skip to main content
Biophysical Reviews logoLink to Biophysical Reviews
. 2022 Nov 19;14(6):1349–1358. doi: 10.1007/s12551-022-01010-z

Advancing the field of computational drug design using multicanonical molecular dynamics-based dynamic docking

Gert-Jan Bekker 1,, Narutoshi Kamiya 2
PMCID: PMC9842809  PMID: 36659995

Abstract

Multicanonical molecular dynamics (McMD)-based dynamic docking is a powerful tool to not only predict the native binding configuration between two flexible molecules, but it can also be used to accurately simulate the binding/unbinding pathway. Furthermore, it can also predict alternative binding sites, including allosteric ones, by employing an exhaustive sampling approach. Since McMD-based dynamic docking accurately samples binding/unbinding events, it can thus be used to determine the molecular mechanism of binding between two molecules. We developed the McMD-based dynamic docking methodology based on the powerful, but woefully underutilized McMD algorithm, combined with a toolset to perform the docking and to analyze the results. Here, we showcase three of our recent works, where we have applied McMD-based dynamic docking to advance the field of computational drug design. In the first case, we applied our method to perform an exhaustive search between Hsp90 and one of its inhibitors to successfully predict the native binding configuration in its binding site, as we refined our analysis methods. For our second case, we performed an exhaustive search of two medium-sized ligands and Bcl-xL, which has a cryptic binding site that differs greatly between the apo and holo structures. Finally, we performed a dynamic docking simulation between a membrane-embedded GPCR molecule and a high affinity ligand that binds deep within its receptor’s pocket. These advanced simulations showcase the power that the McMD-based dynamic docking method has, and provide a glimpse of the potential our methodology has to unravel and solve the medical and biophysical issues in the modern world.

Supplementary Information

The online version contains supplementary material available at 10.1007/s12551-022-01010-z.

Keywords: Dynamic docking, Protein receptors and their ligands, Multicanonical molecular dynamics, Principal component analysis, Cluster free energy, Binding configurations

Evolution of molecular docking as a tool for drug design: from classical to dynamic approaches

Drug design is the process of developing new chemical compounds and using them to correct the defective molecular mechanism of a biological target. The first step to understanding the molecular mechanism of a compound is to determine how it binds to its bio-molecular target, more specifically, to its protein receptor. Molecular docking between a receptor and a compound, or a library of compounds, has been used for decades in the drug discovery process. Classical docking with programs such as DOCK (Kuntz et al. 1982), AutoDock (Goodsell and Olson 1990), FlexX (Rarey et al. 1996), GOLD (Jones et al. 1997), and myPresto/sievgene (Fukunishi et al. 2005) inherently treats the receptor as a rigid molecule, with only the ligand as fully flexible. However, the docking results strongly depend on the structure of the receptor. One early attempt to include receptor flexibility was to use ensemble docking, where either multiple experimental conformations of a protein were used (Craig et al. 2010; Wada et al. 2011), or by computationally generating the alternative conformations, e.g., via a rotamer library or by molecular dynamics (MD) simulations (Oda et al. 2018), after which the classical docking methodology would be applied to the ensemble to generate the docking poses. While such approaches take protein flexibility into account to some degree, the conformational space searched by such ensemble docking methods is still quite limited. In addition, the scoring functions used by the classical docking approaches lack accuracy due to many approximations and the usage of empirical parameters. Furthermore, to properly understand the molecular mechanism of the drug, it is important to not only understand the final bound state, but also the binding process, including intermediary states. Therefore, a method that can accurately predict protein–ligand complex formation is required.

One solution is to exhaustively sample the conformational space of the receptor and the configurational space of protein–ligand complexes simultaneously using conventional (i.e., to brute force it) MD simulations. Although this would correctly model the binding/unbinding between the receptor and the ligand, the time required to gather enough statistics from the binding/unbinding events to identify the native binding configuration and binding mechanism makes this approach effectively useless, even on expensive, hardware accelerated MD supercomputers such as ANTON (Shaw et al. 2008). In the past decade, an alternative to these expensive computers has become the usage of general-purpose graphics processing units (GPGPU) accelerated PC clusters, enabling larger and longer calculations at much lower costs than before. This has also given rise to a new class of docking methodologies, which have been coined dynamic docking (Gioia et al. 2017). One of the key advantages of dynamic docking techniques is the high accuracy of the binding configuration prediction because it treats both the receptor and the ligand as flexible. However, this still takes an unsurmountable amount of time if performed with conventional MD simulations, due to the existence of high energy barriers that are commonly found in bio-molecular systems. Therefore, using enhanced sampling MD methods such as multicanonical MD (McMD) (Nakajima et al. 1997), replica exchange MD (REMD) (Sugita and Okamoto 1999) , Filling Potential (Fukunishi et al. 2003), MetaDynamics (Laio and Parrinello 2002), Accelerated MD (Hamelberg et al. 2004), mD-VcMD (Hayami et al. 2019), and hypersound accelerated MD (Araki et al. 2021) should enable effective dynamic docking. Each of these methods adds a bias to the system, either by increasing the temperature of the whole or part of the system (e.g., McMD, REMD, AMD), by applying a bias to the center-of-mass (or landmark atom) of a molecule based on a pre-determined reaction coordinate (e.g., FP, MetaD, mD-VcMD), or via non-equilibrium approaches (e.g., hypersound accelerated MD).

One powerful, but woefully underutilized enhanced sampling methodology is McMD, which has a long history at Prof. Nakamura’s lab (Nakajima et al. 1997; Kamiya et al. 2002, 2007, 2008; Ikebe et al. 2011; Higo et al. 2012; Bekker et al. 2017), with a theoretical description of the methodology described in Section S1. Here, a bias, determined via a pre-run, is applied to the system to enable uniform sampling over a specific potential energy range, facilitating a random walk within this energy range, where each energetic state is equally probable to be sampled. This enables exploring a wide conformational space without conformational trapping at local minima, as the method can randomly transition between the higher and lower temperature regions. One of the merits of the McMD method is that the physically accepted ensemble, i.e., the canonical ensemble, can be obtained by a reweighting procedure (Nakajima et al. 1997), which determines the probability that a structure could be sampled at a given temperature given its potential energy. The McMD method first forms the multicanonical ensemble, covering a wide temperature range. Then, the conformational ensemble at 300 K is obtained from the multicanonical ensemble by the reweighting method, enabling structural analysis of the reweighted multicanonical ensemble, to extract representative structures and rank them. REMD is another popular generalized ensemble method (Sugita and Okamoto 1999) and has also been applied to find native binding configurations (Kokubo et al. 2011). For both the McMD method and the REMD method (as well as its derivates REST (Liu et al. 2005) and REST2 (Wang et al. 2011)), the canonical ensemble at 300 K can be reobtained, although with McMD data from the entire energy range can be integrated. With REST and REST2, only part of the system is accelerated, while the other part remains in the canonical ensemble. Nakajima et al. (Nakajima 1998) also developed a two-component McMD version to only accelerate the solute. Unlike McMD, REMD and REST do not require a pre-run, but require manual setting and tuning of parameters such as the temperature of each of the replicas, which increase as the system size increases, as overlap between potential energy distributions of the neighboring replicas is required for the method to work. In addition, REMD requires its replica simulations to be executed simultaneously and synchronously, increasing the computational requirements. With McMD, multiple parallel trajectories are only used to increase statistics and improve sampling efficiency, with each parallel trajectory covering the entire potential energy range. Furthermore, parallel trajectories are executed independently and can thus be executed serially, so McMD could even be used on small clusters or even a single machine, thus making McMD more practical for smaller research groups.

We have taken the powerful McMD-enhanced sampling method and developed our own dynamic docking methodology and toolset (Bekker and Kamiya 2021a), with a schematic overview of our dynamic docking protocol shown in Fig. 1. Using our methodology, we successfully predicted the native binding configuration between lysozyme to an oligosaccharide (Kamiya et al. 2008), cyclin-dependent kinase 2 to a small compound inhibitor (Bekker et al. 2017), β-secretase (BACE) to a medium-sized compound inhibitor (Bekker et al. 2019a), an antibody to its peptide antigen (Bekker et al. 2020b), heat shock protein 90 (Hsp90) to a small compound inhibitor (Bekker et al. 2020a), the binding of medium-sized compounds to the cryptic binding site of B-cell lymphoma-extra large (Bcl-xL) (Bekker et al. 2021b), the binding of a small compound inhibitor to the β2-adrenergic G-protein Coupled Receptor (β2AR GPCR) (Bekker et al. 2021a), and the binding of an antigen peptide to Human Leukocyte Antigen-A*2402 (Bekker and Kamiya 2021b). Here, we will describe our latest efforts in advancing the field of computational drug design by describing the first application of our methodology to exhaustively sample the configurational space between an inhibitor and Hsp90, the dynamic docking of two ligands to the cryptic binding site of Bcl-xL and the docking of a ligand to a membrane-embedded GPCR molecule.

Fig. 1.

Fig. 1

Schematic overview of our McMD-based dynamic docking protocol. The McMD-based dynamic docking protocol can be thought of to consist of eight steps, which are described in Section S2. This image uses data from our WEHI – Bcl-xL docking simulations (Bekker et al. 2021b). The figures were generated using Molmil (Bekker et al. 2016), a molecular viewer developed by Protein Data Bank Japan (Kinjo et al. 2017, 2018; Bekker et al. 2022)

Methodology and application of exhaustive McMD-based dynamic docking to Hsp90 and a high-affinity ligand

In our early works, we focused only on predicting the binding configuration with the assumption that the binding site was known. In those studies (Bekker et al. 2017, 2019a, 2020b), we used a cylinder to restrain the ligand to a binding site and the nearby bulk region, which leads to improved prediction accuracy and simultaneously suppresses the need for extremely long simulations, while still attaining sufficient binding/unbinding event statistics due to the reduced search space. In our Hsp90 docking work, we docked a modified ligand (VER49008) (Schmidtke et al. 2011) with a known high affinity to Hsp90 (Bekker et al. 2020a). However, unlike our previous works, we did not use a cylinder to restrain the ligand, but instead performed an exhaustive search of the configurational space of the entire N-terminal domain of human Hsp90, where its natural ligand, ATP, normally binds. Thus, we intended to not only predict the native binding configuration, but simultaneously the native binding site with this exhaustive simulation. To prevent unfolding at high temperatures during the McMD simulations, the Cα atoms corresponding to the residues belonging to either an α-helix or a β-sheet as calculated by DSSP (Kabsch and Sander 1983) were position restrained during the McMD simulations. For the McMD simulation, we first executed a pre-run to estimate the density of states for a total duration of 10.81 µs and an additional 10.24 µs to equilibrate the system. Finally, a production run lasting 32 µs was executed to sample the binding configurations.

As part of this work, we also introduced a more refined and robust analysis method of the multicanonical ensemble (Fig. 1). In our preceding works, we used simple K-means clustering with k = 10 clusters on the first three principal components (PCs) obtained by principal component analysis (PCA) to obtain a set of representative structures. Here, we made several changes to this methodology. First, instead of just using three PCs, we dynamically calculated the number of PCs so that the cumulative explained variance of the selected components is at least 90%, so that most of the dynamics are described in the coordinates used for the clustering, which amounted to 5 PCs for our Hsp90 simulation data. Subsequently, we performed K-means clustering on this five-dimensional space (PC1–PC5), using a total of k = 1000 clusters and selected a single representative structure for each cluster. Finally, we compared the intermolecular contact matrices of all representative structures with each other using our R-value analysis (Bekker et al. 2019a) to merge similar clusters (R-value > 0.75) together and then ranked the clusters and their corresponding structure based on the cluster free energy (CFE), which is the sum of the McMD-reweighted probabilities of all the structures assigned to each cluster. Here, the R-value is a measure of the similarity rate between the short-range contact matrices of two arbitrary configurations and/or conformations, where a value of 1.0 corresponds to matching contact matrices. The new methodology has several advantages compared to the one we used in our preceding works (Bekker et al. 2019a, 2020b). Specifically, first, the automatic PC selection makes for a robust and scalable method, while it coincidentally also leads to an increase in the number of selected PCs, increasing the descriptiveness of the coordinate space used during the clustering step. Secondly, the significant increase in the number of clusters during the K-means clustering step means that outlier structures are less likely to be assigned to an inappropriate cluster, making the clusters more homogenous and the picked representative structures similarly more representative for their respective cluster. Finally, we obtain a set of the uniquely sampled representative structures along with their relative probabilities (i.e., their CFE) after merging clusters with similar structures (as the initial number of clusters is generally larger than required for the configurational space within the chosen structural similarity cutoff), which we can then use to rank and filter the structures.

For our Hsp90 docking, 12 representative structures were selected within a CFE cutoff of 2.5 kcal/mol (Fig. 2a), where the most top-ranking structure (r1) had an R(native)-value of 0.89 and a root-mean-square-deviation (RMSD) of 0.76 Å, indicating that both the intermolecular interactions between the ligand and Hsp90, as well as the absolute positioning of the ligand, were well reproduced by our simulations. Subsequent canonical MD simulations at 300 K and 400 K, which can be used to validate the results if no experimental data is available (Bekker et al. 2019a, b), validated our results, showing that the most top-ranking structure (r1) is also the most stable one. The second-ranking structure, which was also bound to the same pocket but in a different orientation, also showed a high score during the subsequent validation step, suggesting that this binding configuration would most likely also contribute to the inhibiting effect of the ligand. Finally, our results showed that a loop structure near the binding site of Hsp90, which we referred to by “FHL” (Fig. 2b), had attained a helical conformation in the most top-ranking representative structure r1 (Fig. 2c, e), while a loop-in structure was observed for the experimental structure of our ligand’s analog (Fig. 2b, d), which was postulated by Amaral et al. to be caused by the size of the ligand, where only large ligands would result in a helical state (Amaral et al. 2017). Analysis of our multicanonical ensemble showed however that more than 50% of the near-native binding configurations were in a helical state, compared to only 6% of the unbound structures, which suggested to us that in explicit solvent, some of the interactions between the ligand and Hsp90 in the residues in the FHL as observed in the experimental structure become unstable, causing the FHL to predominantly form a helical structure. Thus, with our first attempt at using McMD-based dynamic docking to perform an exhaustive search of the ligand’s binding configurations did not only accurately reproduce the native binding configuration, it also reproduced the native binding site. In addition, we also predicted an alternative binding configuration in the same site and showed that the FHL region is likely to attain a helical configuration upon binding a relatively small ligand. Our work with Hsp90 has not only provided insight into the molecular mechanism of ligand binding to the protein, but has also enabled us to benchmark exhaustive McMD-based dynamic docking and pioneer a new, more refined, and robust analysis method that we have since used in all our subsequent research.

Fig. 2.

Fig. 2

Docking results obtained from our Hsp90-VER49008 ligand dynamic docking simulations, data available from BSM-00010. (a) Overview of the top-ranking 12 structures, colored by their cluster free energy from blue to red, with the experimental structure in white and the helices and sheets labeled. (b) Cartoon representation of the experimental structure from the front, with the “FHL” loop in pink. This flexible lid region has a sequence of INNLGTIA, spanning from Ile104 to Ala111. (c) Cartoon representation of the top-ranking structure r1 from the front, with the “FHL” loop in pink. (d) Stick representation of the binding site of the experimental structure from the side, with nearby crystal waters shown (note: hydrogens were generated). (e) Stick representation of the binding site of the top-ranking structure r1 from the side, with nearby water molecules shown

Dynamic docking of two medium-sized compounds to the cryptic binding site of Bcl-xL

In our subsequent work, we studied two medium-sized compounds that are known to bind to the cryptic binding site of Bcl-xL (Bekker et al. 2021b), an anti-apoptotic protein that in various cancers becomes overexpressed, with cells losing their ability to inhibit apoptosis, thus making Bcl-xL an important drug target. For this protein, the conformation of the binding site differs greatly between the apo and the holo states, thus requiring large conformational changes to the protein to occur to enable ligand binding. We selected two medium-sized ligands, ABT-737, which was the first BH3-mimic inhibitor developed and mimics the Bcl-2 homology 3 only (BH3-only) motif found in BCL-xL’s natural ligands such as Bim, Puma, and Bad (Petros et al. 2000; Liu et al. 2003), and the second ligand WEHI-539, which was the first Bcl-xL-selective inhibitor developed (hereafter referred to by as ABT and WEHI, respectively).

To give ourselves the most difficult challenge for this project, we had decided to perform an exhaustive binding simulation starting from the apo state of Bcl-xL, thus requiring our McMD-based dynamic docking simulations to not only sample all the binding configurations, but to also make it find the correct binding site first. Unlike our above-described Hsp90 simulation where the apo and holo structures do not differ significantly, here, we have considerable differences between the structures. Furthermore, we were doing two independent simulations, one for each ligand. One of the biggest hurdles was that large conformational changes of Bcl-xL were required, but on the other hand we also had to restrain our protein structure to prevent unfolding during the McMD simulations. To allow for sufficiently large conformational changes to occur, but to prevent complete unfolding, we had decided to go with a weak distance restraint protocol, as the position restraints used during our Hsp90 docking would have been far too stringent. We had previously used distance restraints with our lysozyme and BACE dynamic docking works (Kamiya et al. 2008; Bekker et al. 2019a), but in those cases a very large number of restraints was used, leading to both a more rigid structure and a higher computational load. Instead, we used distance restraints between the backbone oxygen and nitrogen atoms for the residues that form hydrogen bonds to stabilize the secondary structure determined from the apo state. However, as these restraints would only prevent the secondary structure from breaking at high temperature, we also added restraints between a set of manually picked Cα atom pairs to restrain the tertiary structure. For both systems, the pre-run lasted for 24 µs, which was then followed by a production run, each lasting 32 µs. For the analysis of the data, we used the analysis protocol as described above in the Hsp90 section (see also Fig. 1 and Section S2).

Although no cylinder was used, the top four ranking structures were all located in the native binding site for both WEHI (Fig. 3a) and ABT (Fig. 3b). For WEHI, the docking resulted in a top-ranking structure (r1, Fig. 3c) that only had an R(native)-value of 0.447, as the ligand had only bound at the experimental binding site partially, while the other part was in a more easily attainable state pointing outwards, instead of binding inside the cryptic pocket. Notably, this is similar to the configuration of W1191542, an analog of WEHI, as found in the PDB structure 3INQ (Lee et al. 2009). On the other hand, the third-ranking structure (r3, Fig. 3d) had an R(native)-value of 0.752 and an RMSD of 3.18 Å, showing that the experimental configuration can be reproduced, but is difficult to attain due to the required conformational changes of Bcl-xL. The fourth-ranking structure only had an R(native)-value of 0.491, but had an RMSD of only 1.51 Å, showing that the ligand had positioned itself similar to the experimental structure, but that Bcl-xL was in a different conformational state. For the ABT docking, better results were observed, with the top-ranking structure having an R(native)-value of 0.674, while the third-ranking structure had an R(native)-value of 0.844. Thus, for WEHI, we obtained a number of stable binding configurations, presumably due to WEHI’s difficulty of binding deeply inside the pocket and the large conformational changes required of Bcl-xL for this to occur. We subsequently performed our usual canonical MD simulations for validation, showing the best results for the third-ranking structure of WEHI (which is also the most buried configuration), as well as for the top-ranking structure of ABT.

Fig. 3.

Fig. 3

Docking results obtained from our Bcl-xL dynamic docking simulations, data available from BSM-00021. (a) Tube representation of the receptor and stick model for the ligand for the top four ranking structures from our WEHI-ligand docking simulation in green, cyan, magenta and black, respectively. (b) Tube representation of the receptor and stick model for the ligand for the top four ranking structures from our ABT-ligand docking simulation in green, cyan, magenta, and black, respectively. (c) Cartoon representation of the receptor and stick model for the ligand for the r1 structure of our WEHI-ligand docking. Also shown are the nearby residues as stick models. (d) Cartoon representation of the receptor and stick model for the ligand for the r3 structure of our WEHI-ligand docking. Also shown are the nearby residues as stick models

Analyzing the binding/unbinding pathway (step 7 in Fig. 1) indicated that the configurational ensemble had low probability regions along the binding pathway, which suggested that the binding event is sudden and thus follows a population shift between the receptor’s open and closed structures, where Bcl-xL would first need to attain a favorable structure, before the ligand can bind. It also showed that the structure r1 of WEHI (Fig. 3c) was actually an intermediary structure to the final bound state (r3, Fig. 3d). Although both structures started from the same apo state, the different ligands had a profound effect on the conformational ensemble of Bcl-xL, with large conformational changes required to enable the ligand to bind, especially for WEHI, where the ligand binds following a two-step model: first to our top-ranking structure (unbound/intermediary state to r1, Fig. 3c), then after sufficient conformational changes of Bcl-xL so that the cryptic binding site appears, to the near-native third-ranking (Fig. 3d) state (i.e., r1 to r3). This also gives a clue to the low ranking of r3; r1 is a much more easily attainable state, while r3 requires considerable conformational changes of Bcl-xL, followed by a partial flip of WEHI to enable this binding configuration to form.

To validate the pathway and to better analyze it, we also performed path sampling simulations starting from these structures (step 8 in Fig. 1). The calculated affinities showed slightly higher values compared to the experimental ones, but more peculiar was the difference between the two ligands when comparing the experimental to the computational results. Experimentally, ABT was shown to bind a bit stronger to Bcl-xL than WEHI, and WEHI furthermore has faster association and dissociation rates than ABT (Lessene et al. 2013). However, our dynamic docking simulations showed that binding of WEHI (in the r3 configuration, Fig. 3d) to be much more difficult, following a two-step binding mechanism, indicating that binding of WEHI would be slower than ABT; thus, WEHI would have slower association and dissociation rates than ABT to the final native binding configuration. However, since the experimental binding kinetics consider all configurations, it is quite likely that our top-ranking configuration of WEHI (r1, Fig. 3c), which is only the partially bound structure, is the major contributor to the binding kinetics, suggesting that in explicit solvent, the binding populations might be more similar to our dynamic docking results, rather than just the experimental crystal structure, which might have been stabilized due to crystallization effects. This is also corroborated by another PDB structure, 3INQ (Lee et al. 2009), where an analogous ligand of WEHI is bound to Bcl-xL in a configuration similar to WEHI’s top-ranking structure r1. Thus, given this evidence, we can assume that r1 is likely the native binding configuration of WEHI, with r3 a rare, deeper bound state that might sometimes be attained, and that we had thus accurately predicted the native binding configuration for both ligands and their binding free energy.

Dynamic docking of a high-affinity antagonist to a membrane-embedded GPCR

One large difficulty has long been the enhanced sampling of membrane-embedded proteins such as GPCRs, due to their complicated design, which also includes a lipid bilayer, and due to the difficulty of sampling the configurational space of a binding site that is deeply buried inside the receptor. However, we have successfully applied our McMD-based dynamic docking method between a membrane-embedded β2AR GPCR molecule and its high-affinity antagonist, alprenolol (Bekker et al. 2021a). To overcome the difficulty of enhanced sampling with a membrane, we employed a clever application of simple position restraints to only restrain the movement of a subset of membrane atoms along the axis perpendicular to the membrane, thus allowing the lipids to freely move within the membrane without rupturing it during the enhanced MD. The ligand was placed on the extracellular side of the GPCR in the bulk region and a cylinder (perpendicular to the membrane) was used to prevent the ligand from “teleporting” to the intracellular side through the bulk (due to the presence of periodic boundary conditions), and to prevent the ligand from getting trapped near the charged membrane surface. The GPCR was restrained using distance restraints to protect the secondary (backbone N–O restraints) and tertiary (Cα-Cα restraints) structure. First, a 68.16 µs pre-run was performed, followed by a 48 µs production run, which was then analyzed using our analysis protocol as described above (Fig. 1).

Analysis of the ensemble gave us 11 representative structures within a CFE of 1.5 kcal/mol (Fig. 4a), with the top-ranking one (r1) matching that of the experimental structure with an R(native)-value of 0.999 and an RMSD of 0.72 Å (Fig. 4b). The subsequent canonical MD simulations corroborated these results, with r1 also being the most stable one. Analysis of the binding pathway from the multicanonical ensemble enabled us to determine the molecular mechanism of ligand binding. We found that a gatekeeper residue Phe193, along with its neighboring residues Asp192 and Phe194 and their interacting residues, is central in guiding the ligand from the outer pocket (r261, Fig. 4c) at the extracellular side of the receptor, into the inner pocket (r264, Fig. 4d), where the ligand binds in its native configuration to the orthosteric site. As the ligand approaches the pocket entrance from the bulk region, it hits the gateway residue Phe193. The ligand then pushes Phe194 inside, which in turn pushes the gateway residue to the back of the outer pocket (Fig. 4c). Simultaneously, the positively charged nitrogen atom of the ligand starts interacting with Asp192, which in turn weakens Asp192’s charge-charge interaction with Lys305, leading their salt bridge to weaken and eventually break (Fig. 4c). This causes the pocket to widen and the loop that is formed by the gateway forming residues Asp192, Phe193, and Phe194, to destabilize, opening the path to the inner pocket, allowing the ligand to finally pass through, after which the gateway can close up again after the ligand has migrated to the inner pocket to bind to the orthosteric site (Fig. 4d).

Fig. 4.

Fig. 4

Docking results obtained from our GPCR dynamic docking simulations, data available from BSM-00024. (a) Overview of our top 11 structures, with the ligand, alprenolol, structures colored based on their cluster free energy from blue to red, with the experimental structure in white. (b) Close-up of the orthosteric binding site, with the top-ranking structure (blue), the experimental structure (green), and the sidechains of nearby residues shown. (c) Structures r1 and r261 shown in black and magenta, respectively, where ligand of r261 is positioned in the outer pocket and the receptor is in a closed state, similar to the native structure, except for the gateway loop, which is in a non-native state. (d) Structures r1 and r264 shown in black and magenta, respectively, where the ligand of r264 has started to enter the inner binding site (as it passes through where Phe194 is positioned in the natively bound configuration), but the receptor’s gateway loop still in an open state

We also found that the second-ranking binding configuration might also contribute to the inhibitory effects of the ligand, as, although it does not enter the inner pocket, it binds to the GPCR in the outer pocket, before the gateway residue Phe193. It is likely that the ligand in this binding configuration can actually prevent the GPCR’s natural ligand epinephrine from passing through, thus providing an inhibitory effect. Since alprenolol would have to perform some acrobatics to get through the GPCR’s gateway, while to epinephrine, which is a molecule that is much smaller than alprenolol, and therefore would have much less trouble passing the gateway, the inhibitory effect of alprenolol in the second-ranking binding site might actually be considerable, as it would be able to inhibit the GPCR without having to enter the inner pocket. Thus, we found that alprenolol might not only function as an orthosteric inhibitor, but also as an allosteric one, where improvements to the chemical structure of the molecule could potentially be made to make it a better allosteric inhibitor, with improved affinity and specificity to the outer binding site.

Conclusion

Over the past decade, we have pushed the boundaries of the computational drug design field with our McMD-based dynamic docking approach. As part of Prof. Nakamura’s lab, we have developed the McMD-based dynamic docking approach, and since then, we have further refined it as shown in Fig. 1, and looked for new boundaries to surpass. We have shown that our methodology can not only be used to predict the correct binding configuration, but also to predict it in conjunction with it having to predict the correct binding site with our exhaustive simulations. Even for the difficult Bcl-xL docking, where its cryptic pocket differs greatly between the apo and holo states, we were able to accurately predict the binding configuration of the two large ligands. We also showed that our methodology can be used for membrane-embedded systems by clever usage of position restraints that prevent the membrane from rupturing, without significantly inhibiting membrane dynamics. One of the next stages in pushing the boundaries would be to apply our dynamic docking approach to protein–protein docking, with both molecules of similar size, unlike the protein-peptide dockings we have done thus far (Bekker et al. 2020b; Bekker and Kamiya 2021b).

One of the great benefits of McMD-based dynamic docking is that it not only accurately predicts the native binding configuration, but it can also be used to determine the binding/unbinding pathway, and thus provides insight into the molecular mechanism of binding. By understanding the molecular mechanism of binding, one can start to tune the compound by modifying its functional groups. Thus, our McMD-based dynamic docking approach would be a useful tool for lead optimization, where dynamic docking would first be performed using an initial version of the lead. Then, analysis of the molecular mechanism of binding can provide insight into how the compound binds to the native site, intermediary sites, and non-native sites, to improve the binding profile, e.g., by improving specificity to the native site or early intermediary binding sites to improve the association constant, with subsequent McMD-based dynamic docking simulations for validation and additional stages of optimization. With further increases of computational performance, as well as by tuning the computational parameters of the McMD-based dynamic docking simulation (simulation length, timestep, restraints, etc.), the McMD-based dynamic docking approach will become an attractive tool to model the binding mechanism not only in an academic environment, but also an industrial one, where McMD-based dynamic docking could be used to optimize a lead compound into a high-affinity and high-specificity drug.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

We are especially grateful to Prof. Haruki Nakamura for his advice and ideas regarding the development of our dynamic docking methodology.

Funding

This work was supported by Japan Agency for Medical Research and Development (AMED) to N.K., and by the Grand-in-Aid for Scientific Research from the Japan Society for the Promotion of Science (JP20H03229). It was performed in part under the Cooperative Research Program of the Institute for Protein Research, Osaka University, CR-21–05 and CR-22–05. Computational resources of the TSUBAME3.0 system, Tokyo Institute of Technology, were provided by the HPCI Research Project (hp190018, hp190021, hp190027, hp200011, hp200025, hp200063, hp210002, hp210005, hp210048, hp220002, hp220015, and hp220026).

Data availability

The representative structures of our McMD-based dynamic docking simulations have been submitted to the Biological Structure Model Archive (https://bsma.pdbj.org) under BSMIDs BSM-00002, BSM-00007, BSM-00008, BSM-00010, BSM-00021, BSM-00024, and BSM-00029 (Bekker et al. 2020c).

The McMD-enhanced version of GROMACS is available from https://gitlab.com/gjbekker/gromacs, along with analysis tools.

Declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Conflict of interest

Not applicable.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Gert-Jan Bekker, Email: gertjan.bekker@protein.osaka-u.ac.jp.

Narutoshi Kamiya, Email: n.kamiya@sim.u-hyogo.ac.jp.

References

  1. Amaral M, Kokh DB, Bomke J, et al. Protein conformational flexibility modulates kinetics and thermodynamics of drug binding. Nat Commun. 2017;8:2276. doi: 10.1038/s41467-017-02258-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Araki M, Matsumoto S, Bekker G-J, et al. Exploring ligand binding pathways on proteins using hypersound-accelerated molecular dynamics. Nat Commun. 2021;12:2793. doi: 10.1038/s41467-021-23157-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bekker G-J, Kamiya N. Dynamic docking using multicanonical molecular Dynamics: simulating complex formation at the atomistic level. In: Ballante F, editor. Protein-ligand interactions and drug design. Heidelberg: Springer; 2021. pp. 187–202. [DOI] [PubMed] [Google Scholar]
  4. Bekker G-J, Kamiya N. N-Terminal-driven binding mechanism of an antigen Peptide to human leukocyte antigen-A*2402 elucidated by multicanonical molecular dynamic-based dynamic docking and path sampling simulations. J Phys Chem B. 2021;125:13376–13384. doi: 10.1021/acs.jpcb.1c07230. [DOI] [PubMed] [Google Scholar]
  5. Bekker G-J, Nakamura H, Kinjo AR. Molmil: a molecular viewer for the PDB and beyond. Journal of Cheminformatics. 2016;8:42. doi: 10.1186/s13321-016-0155-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bekker G-J, Kamiya N, Araki M, et al. Accurate prediction of complex structure and affinity for a flexible protein receptor and its inhibitor. J Chem Theory Comput. 2017;13:2389–2399. doi: 10.1021/acs.jctc.6b01127. [DOI] [PubMed] [Google Scholar]
  7. Bekker G-J, Araki M, Oshima K, et al. Dynamic Docking of a Medium-Sized Molecule to Its Receptor by Multicanonical MD Simulations. J Phys Chem B. 2019;123:2479–2490. doi: 10.1021/acs.jpcb.8b12419. [DOI] [PubMed] [Google Scholar]
  8. Bekker G-J, Ma B, Kamiya N. Thermal stability of single-domain antibodies estimated by molecular dynamics simulations. Protein Sci. 2019;28:429–438. doi: 10.1002/pro.3546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bekker G-J, Araki M, Oshima K, et al. Exhaustive search of the configurational space of heat-shock protein 90 with its inhibitor by multicanonical molecular dynamics based dynamic docking. J Comput Chem. 2020;41:1606–1615. doi: 10.1002/jcc.26203. [DOI] [PubMed] [Google Scholar]
  10. Bekker G-J, Fukuda I, Higo J, Kamiya N. Mutual population-shift driven antibody-peptide binding elucidated by molecular dynamics simulations. Sci Rep. 2020;10:1406. doi: 10.1038/s41598-020-58320-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bekker G-J, Kawabata T, Kurisu G. The Biological Structure Model Archive (BSM-Arc): an archive for in silico models and simulations. Biophys Rev. 2020;12:371–375. doi: 10.1007/s12551-020-00632-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bekker G-J, Araki M, Oshima K, et al. Accurate binding configuration prediction of a G-protein-coupled receptor to its antagonist using multicanonical molecular dynamics-based dynamic docking. J Chem Inf Model. 2021;61:5161–5171. doi: 10.1021/acs.jcim.1c00712. [DOI] [PubMed] [Google Scholar]
  13. Bekker G-J, Fukuda I, Higo J, et al. Cryptic-site binding mechanism of medium-sized Bcl-xL inhibiting compounds elucidated by McMD-based dynamic docking simulations. Sci Rep. 2021;11:5046. doi: 10.1038/s41598-021-84488-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bekker G-J, Yokochi M, Suzuki H, et al. Protein Data Bank Japan: celebrating our 20th anniversary during a global pandemic as the Asian hub of three dimensional macromolecular structural data. Protein Sci. 2022;31:173–186. doi: 10.1002/pro.4211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Craig IR, Essex JW, Spiegel K. Ensemble docking into multiple crystallographically derived protein structures: an evaluation based on the statistical analysis of enrichments. J Chem Inf Model. 2010;50:511–524. doi: 10.1021/ci900407c. [DOI] [PubMed] [Google Scholar]
  16. Fukunishi Y, Mikami Y, Nakamura H. The filling potential method: a method for estimating the free energy surface for protein-ligand docking. J Phys Chem B. 2003;107:13201–13210. doi: 10.1021/jp035478e. [DOI] [Google Scholar]
  17. Fukunishi Y, Mikami Y, Nakamura H. Similarities among receptor pockets and among compounds: analysis and application to in silico ligand screening. J Mol Graph Model. 2005;24:34–45. doi: 10.1016/j.jmgm.2005.04.004. [DOI] [PubMed] [Google Scholar]
  18. Gioia D, Bertazzo M, Recanatini M, et al. Dynamic docking: a paradigm shift in computational drug discovery. Molecules. 2017;22:2029. doi: 10.3390/molecules22112029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Goodsell DS, Olson AJ. Automated docking of substrates to proteins by simulated annealing. Proteins. 1990;8:195–202. doi: 10.1002/prot.340080302. [DOI] [PubMed] [Google Scholar]
  20. Hamelberg D, Mongan J, McCammon JA. Accelerated molecular dynamics: a promising and efficient simulation method for biomolecules. J Chem Phys. 2004;120:11919–11929. doi: 10.1063/1.1755656. [DOI] [PubMed] [Google Scholar]
  21. Hayami T, Higo J, Nakamura H, Kasahara K. Multidimensional virtual-system coupled canonical molecular dynamics to compute free-energy landscapes of peptide multimer assembly. J Comput Chem. 2019;40:2453–2463. doi: 10.1002/jcc.26020. [DOI] [PubMed] [Google Scholar]
  22. Higo J, Ikebe J, Kamiya N, Nakamura H. Enhanced and effective conformational sampling of protein molecular systems for their free energy landscapes. Biophys Rev. 2012;4:27–44. doi: 10.1007/s12551-011-0063-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ikebe J, Umezawa K, Kamiya N, et al. Theory for trivial trajectory parallelization of multicanonical molecular dynamics and application to a polypeptide in water. J Comput Chem. 2011;32:1286–1297. doi: 10.1002/jcc.21710. [DOI] [PubMed] [Google Scholar]
  24. Jones G, Willett P, Glen RC, et al. Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E Cohen J Mole Biol. 1997;267:727–748. doi: 10.1006/jmbi.1996.0897. [DOI] [PubMed] [Google Scholar]
  25. Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22:2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
  26. Kamiya N, Higo J, Nakamura H. Conformational transition states of a β-hairpin peptide between the ordered and disordered conformations in explicit water. Protein Sci. 2002;11:2297–2307. doi: 10.1110/ps.0213102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kamiya N, Mitomo D, Shea JE, Higo J. Folding of the 25 Residue Aβ(12–36) peptide in TFE/water: temperature-dependent transition from a funneled free-energy landscape to a rugged one. J Phys Chem B. 2007;111:5351–5356. doi: 10.1021/jp067075v. [DOI] [PubMed] [Google Scholar]
  28. Kamiya N, Yonezawa Y, Nakamura H, Higo J. Protein-inhibitor flexible docking by a multicanonical sampling: native complex structure with the lowest free energy and a free-energy barrier distinguishing the native complex from the others. Proteins. 2008;70:41–53. doi: 10.1002/prot.21409. [DOI] [PubMed] [Google Scholar]
  29. Kinjo AR, Bekker G-J, Suzuki H, et al. Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures. Nucleic Acids Res. 2017;45:D282–D288. doi: 10.1093/nar/gkw962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kinjo AR, Bekker G-J, Wako H, et al. New tools and functions in data-out activities at Protein Data Bank Japan (PDBj) Protein Sci. 2018;27:95–102. doi: 10.1002/pro.3273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kokubo H, Tanaka T, Okamoto Y. Ab initio prediction of protein–ligand binding structures by replica-exchange umbrella sampling simulations. J Comput Chem. 2011;32:2810–2821. doi: 10.1002/jcc.21860. [DOI] [PubMed] [Google Scholar]
  32. Kuntz ID, Blaney JM, Oatley SJ, et al. A geometric approach to macromolecule-ligand interactions. J Mol Biol. 1982;161:269–288. doi: 10.1016/0022-2836(82)90153-X. [DOI] [PubMed] [Google Scholar]
  33. Laio A, Parrinello M. Escaping free-energy minima. Proc Natl Acad Sci USA. 2002;99:12562–12566. doi: 10.1073/pnas.202427399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lee EF, Czabotar PE, Yang H, et al. Conformational changes in Bcl-2 pro-survival proteins determine their capacity to bind ligands. J Biol Chem. 2009;284:30508–30517. doi: 10.1074/jbc.M109.040725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Lessene G, Czabotar PE, Sleebs BE, et al. Structure-guided design of a selective BCL-XL inhibitor. Nat Chem Biol. 2013;9:390–397. doi: 10.1038/nchembio.1246. [DOI] [PubMed] [Google Scholar]
  36. Liu X, Dai S, Zhu Y, et al. The structure of a Bcl-xL/Bim fragment complex. Immunity. 2003;19:341–352. doi: 10.1016/S1074-7613(03)00234-6. [DOI] [PubMed] [Google Scholar]
  37. Liu P, Kim B, Friesner RA, Berne BJ. Replica exchange with solute tempering: a method for sampling biological systems in explicit water. Proc Natl Acad Sci USA. 2005;102:13749–13754. doi: 10.1073/pnas.0506346102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Nakajima N. A selectively enhanced multicanonical molecular dynamics method for conformational sampling of peptides in realistic water molecules. Chem Phys Lett. 1998;288:319–326. doi: 10.1016/S0009-2614(98)00271-1. [DOI] [Google Scholar]
  39. Nakajima N, Nakamura H, Kidera A. Multicanonical ensemble generated by molecular dynamics simulation for enhanced conformational sampling of peptides. J Phys Chem B. 1997;101:817–824. doi: 10.1021/jp962142e. [DOI] [Google Scholar]
  40. Oda M, Inaba S, Kamiya N, et al. Structural and thermodynamic characterization of endo-1,3-β-glucanase: insights into the substrate recognition mechanism. Biochimica Et Biophysica Acta (BBA) - Proteins Proteomics. 2018;1866:415–425. doi: 10.1016/j.bbapap.2017.12.004. [DOI] [PubMed] [Google Scholar]
  41. Petros AM, Nettesheim DG, Wang Y, et al. Rationale for Bcl-X L /Bad peptide complex formation from structure, mutagenesis, and biophysical studies. Protein Sci. 2000;9:2528–2534. doi: 10.1110/ps.9.12.2528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Rarey M, Kramer B, Lengauer T, Klebe G. A fast flexible docking method using an incremental construction algorithm. J Mol Biol. 1996;261:470–489. doi: 10.1006/jmbi.1996.0477. [DOI] [PubMed] [Google Scholar]
  43. Schmidtke P, Luque FJ, Murray JB, Barril X. Shielded hydrogen bonds as structural determinants of binding kinetics: application in drug design. J Am Chem Soc. 2011;133:18903–18910. doi: 10.1021/ja207494u. [DOI] [PubMed] [Google Scholar]
  44. Shaw DE, Deneroff MM, Dror RO, et al. Anton, a special-purpose machine for molecular dynamics simulation. Commun ACM. 2008;51:91–97. doi: 10.1145/1364782.1364802. [DOI] [Google Scholar]
  45. Sugita Y, Okamoto Y. Replica-exchange molecular dynamics method for protein folding. Chem Phys Lett. 1999;314:141–151. doi: 10.1016/S0009-2614(99)01123-9. [DOI] [Google Scholar]
  46. Wada M, Kanamori E, Nakamura H, Fukunishi Y. Selection of in silico drug screening results for G-protein-coupled receptors by using universal active probes. J Chem Inf Model. 2011;51:2398–2407. doi: 10.1021/ci200236x. [DOI] [PubMed] [Google Scholar]
  47. Wang L, Friesner RA, Berne BJ. Replica exchange with solute scaling: a more efficient version of replica exchange with solute tempering (REST2) J Phys Chem B. 2011;115:9431–9438. doi: 10.1021/jp204407d. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The representative structures of our McMD-based dynamic docking simulations have been submitted to the Biological Structure Model Archive (https://bsma.pdbj.org) under BSMIDs BSM-00002, BSM-00007, BSM-00008, BSM-00010, BSM-00021, BSM-00024, and BSM-00029 (Bekker et al. 2020c).

The McMD-enhanced version of GROMACS is available from https://gitlab.com/gjbekker/gromacs, along with analysis tools.


Articles from Biophysical Reviews are provided here courtesy of Springer

RESOURCES