Abstract
The anomalous binding modes of five highly similar fragments of TIE2 inhibitors, showing three distinct binding poses, are investigated. We report a quantitative rationalization for the changes in binding pose based on molecular dynamics simulations. We investigated five fragments in complex with the transforming growth factor β receptor type 1 kinase domain. Analyses of these simulations using Grid Inhomogeneous Solvation Theory (GIST), pKA calculations, and a tool to investigate enthalpic differences upon binding unraveled the various thermodynamic contributions to the different binding modes. While one binding mode flip can be rationalized by steric repulsion, the second binding pose flip revealed a different protonation state for one of the ligands, leading to different enthalpic and entropic contributions to the binding free energy. One binding pose is stabilized by the displacement of entropically unfavored water molecules (binding pose determined by solvation entropy), ligands in the other binding pose are stabilized by strong enthalpic interactions, overcompensating the unfavorable water entropy in this pose (binding pose determined by enthalpic interactions). This analysis elucidates unprecedented details determining the flipping of the binding modes, which can elegantly explain the experimental findings for this system.
Introduction
Structure-based drug design is used to guide drug discovery with the help of a known three-dimensional structure of a potential drug target. The correct prediction of small molecule binding to a target is essential for the success of structure-based drug design projects. The most common way to experimentally determine binding modes is by X-ray crystallography. As obtaining a crystal structure for every ligand modification is rather time-consuming, it is usually assumed that minor ligand modifications do not alter the binding pose and only a few ligands with major alterations are crystallized. However, it is rather intricate to predict which modification preserves the binding mode and which one leads to an alternate binding mode and may result in incorrect interpretations in structure-based design. Therefore, it is essential to know how often a structural revalidation of the binding modes by X-ray crystallography is required. Unfortunately, there is no simple answer to this question.1 Small molecules, e.g. fragments, may readily change their binding modes upon larger modifications (for fragments already an absolute small modification can be large relative to the fragment size) but on the other hand may preserve key binding interactions when bound to binding site hot-spots.2 Nonadditive behavior of substituents in structure–activity relationships can provide crucial hints on changes in binding modes even in absence of structural information.3 Additionally, binding site dynamics may give rise to an ensemble of receptor states that in turn allow different binding poses for small molecules.4 A detailed understanding of how and why different binding modes originate is crucial to improve methods to correctly predict binding poses5 and to rationalize structure–activity relationships.3
One of the studied potential drug targets is TIE2, an angiopoietin receptor, involved in the formation of new blood vessels. Fragments of a potential TIE2 inhibitor6,7 were studied by Czodrowski et al. to determine the contributions of individual functional groups to ligand binding free energy.8 Five related compounds (Figure 1) were crystallized in complex with the transforming growth factor β receptor type 1 kinase domain (TGFBR1). Surprisingly, these five fragments differ in their binding modes. Figure 1A shows the core fragment with the hinge binding motif 4-amino-8H-pyrido[2,3-d]pyrimidin-5-one of the initial inhibitor, while Figures 1B–E show four molecules with different substitutions on the core fragment. Interestingly, the five compounds show three different hinge binding modes.
Compounds A and B (Figure 1) show the same binding mode (Pose A). As compounds C, D, and E each have substituents at the amino group in position 4, they must have an alternate binding mode due to steric repulsion. Unexpectedly, these three similar compounds show different binding poses. Compound C with a p-anilino group binds such that the substituent is solvent exposed (binding pose C, Figure 2: green ligand). Compound D with the m-anilino substitution and E with the p-pyridine moiety bind in an orientation, where the substituents point to the buried, less solvent exposed pocket (binding pose D, Figure 2: blue ligand). If only one of these five crystal structures were obtained, these changes in binding conformation would not have been predicted correctly, resulting in misinterpretation of structure–activity relationships.
Czodrowski et al. attempted to rationalize the observed binding mode flips.8 They captured the thermodynamic behavior of the different ligands and binding poses by performing docking studies and analyzing solvent molecules around the protein using SZMAP9 and WaterMap.10 The thermodynamic interpretation originally presented is not unambiguous. It was claimed that binding pose D can be explained by the replacement of a “happy” (Watermap)/“hydrophilic” (SZMAP) water. However, replacement of “happy”/“hydrophilic” water molecules has usually a negative contribution to the free energy of binding because they have a lower free energy than bulk waters. Hence, the replacement of these water molecules is unfavorable in terms of binding free energy,10,11 and therefore that interpretation is doubtful. In this study, we present a theoretical paper of the thermodynamics of these binding poses to clarify the unexpected behavior of the five ligands and to shed light on the question why compound C is not found in the same binding pose as compound D and vice versa.
To correctly describe the thermodynamics of ligand binding, it is necessary to study not only the direct enthalpic interactions of the ligand and the protein but also to account for differences in the solvation of the formed complexes. The (de)solvation of the protein binding site and the ligand can be an important driving force for the biomolecular recognition of a ligand and can be as important as the direct contact between protein and ligand.12−14 This versatile behavior of water in binding pockets attracted the attention of the scientific field, which resulted in an increasing number of publications concerning the analysis of water molecules in ligand-binding with a vast variety of different techniques,9,10,15−17 making it one of the “hot topics” in medicinal chemistry.
Furthermore, it is also necessary to calculate both enthalpic and entropic contributions, because the free energy is generally not correlated to the enthalpy or the entropy alone.18 In this contribution, we analyze the different binding modes in terms of thermodynamics and decompose the (binding) free energy into entropic and enthalpic contributions of the complex and the solvent molecules.
We use Molecular Dynamics (MD) simulations in combination with Grid Inhomogeneous Solvation Theory17,19 (GIST) and the Liner Interaction Energy (LIE)20,21 module of AmberTools15 to analyze the thermodynamic properties of the ligand binding poses. To estimate the protonation of all ligands we perform pKA calculations because results are expected to depend on the protonation state of the ligands. Only the combination of these computational methods allows us to explain the experimental observed binding pose changes in great detail.
Materials & Methods
To capture the complete thermodynamics of the ligand binding process—including the flip in the binding pose from compound C to D (Figure 1)—we have to use an amalgam of analyses tools.
Molecular Dynamics Simulations
The two ligand fragments (compound C and D) in their neutral and positively charged form were parametrized using the antechamber module of the Amber package22 with the AM1-BCC charge model23,24 and the general Amber force field25 (partial charges for all ligands are shown in Figures S1–S4). In addition to the crystal structures, structures of compounds C and D in the other possible binding pose were modeled. This was achieved by “in silico” modification of the ligands. The two ligands were simulated in both protonation states and in both binding poses in complex with the TGFBR1 kinase domain using experimental crystal structures (PDB: 4X2K, 4X2G) and as unbound ligands in solution. In total, eight simulations of the ligands in complex with the protein were performed plus four simulations of the unbound ligands. ff14SB26 was chosen as protein force field. All ligand protein complexes were solvated with TIP4P27 water molecules in an octahedral box. Periodic boundary conditions were applied and a minimum distance of 15 Å between the solute and the edge of the box was chosen. Equilibration of the systems was performed according to an established protocol previously developed in our group.28 A Langevin thermostat was used to keep the temperature at 300 K. The pressure of 1 bar was kept by using an isotropic implementation of the Berendsen barostat. The time step was set to 2 fs and coordinates were saved every 10 ps for a 200 ns trajectory without restraints.
Five representative conformations were obtained by clustering a 200 ns trajectory of each ligand using a hierarchical agglomerative (bottom-up) approach implemented in cpptraj.29 Five clusters were used as we had to find a compromise between calculation effort and accuracy. The 2000 water molecules closest to the complex of every representative conformer were retained, they were again solvated in an octahedral box and equilibrated. During equilibration and simulation of the five clusters we applied coordinate restraints to the whole protein and the ligand, as it is required for the GIST analysis.17 Every cluster was simulated for 100 ns and coordinates were saved every 100 ps.
Grid Inhomogeneous Solvation Theory
GIST allows for a thermodynamic analysis of water molecules around a solute. The method calculates the free energy of water molecules on a grid based approach for a single conformation of the solute. Recently published studies highlighted that GIST is a valid and useful approach to study biomolecular systems in combination with molecular dynamics simulations,19,30 allowing one to differentiate between entropic and enthalpic contributions to the total free energy. For all calculated values, the references state, to which all values refer, is pure water. A detailed description of how entropic and enthalpic terms are calculated is given by Nguyen et al.17 In all our calculations, a temperature of 300 K was chosen.
In an approximated manner, the free energy of solvation of a solute ΔGSolv can be written according to eq 1.
1 |
It is defined as the integral over the solvation free energies ΔGSolv(q) of a solute determined for each of its conformations q times the probability p(q) to find the solute in this conformation. To calculate ΔGSolv(q) for a particular conformation q, its coordinates are restrained to retain this conformation. In this paper we consider the occurrence of different relevant conformations by simulating an unrestraint complex system. The trajectory is subsequently clustered into five representative conformations. Those are simulated with coordinate restraints and analyzed using GIST. Equation 1 is used to calculate the solvation free energy ΔGSolv from the solvation free energies of the five restrained solute conformers ΔGSolv(q) (the integral is approximated with a discrete sum). The probabilities p(q) are the relative cluster sizes, i.e., the number of structures per cluster.
GIST analyses various thermodynamic properties of water with a grid based approach. We want to emphasize that GIST covers the thermodynamic properties of the water only and it does not cover the entropic and enthalpic differences of the ligand and the protein upon binding. The results of a GIST calculation are multiple grids. As continuous grids are difficult to visualize, we tried to enhance the visualization of our results by extracting the most abundant water positions from the grid values.
To find sensible water positions, a script was written to search for the grid point with the highest water density. From this grid point and the surrounding grid points the density of one water molecule is subtracted. Afterward, the next most abundant grid point is found by the program and again the density of one water molecule is subtracted. This is done for 99% of the water density. A 1% residual density is necessary due to boundary effects. To assign thermodynamic values to the refined water positions, the thermodynamic value (e.g., entropy, enthalpy, or free energy) of the analyzed grid points are averaged (density weighted). Figure 3 shows the results for the total entropy of the water molecules in a grid representation (A) and with representative water molecules (B). Each depicted water molecule has an entropy value larger than 3.5 kcal/mol. An analogous procedure can be applied to all other thermodynamic values of the GIST output; see ref (17). Aside from an easier visualization an additional advantage of this representation is that less abundant (less populated) grid points are weighted to a lesser extent and therefore points with high entropy due to low occupancy are not overinterpreted.
In contrast to the visualization of single water molecules and their corresponding properties, we also aimed for a quantitative description of the binding thermodynamics. Therefore, we summed the thermodynamic value of interest (density-weighted) over all grid points of the ligand binding region to capture differences in the overall thermodynamics. To ensure that roughly the same volume is used to estimate the water properties of the pocket for every simulation, all grid points within 5 Å of the ligand, the ASP-290, or the GLU-245 residue (shown in Figure 3) are used to calculate the thermodynamic properties of the pocket.
Binding Enthalpies
As the GIST analysis omits the enthalpic interactions between the ligand and the protein, we choose a method explicitly including this interaction. Therefore, we used the LIE implementation of the AmberTools15 package to estimate the enthalpy of ligand binding.20,21 In LIE, eq 2 is applied to estimate the free energy of solvation:
2 |
ΔUl–s describes the difference in interaction energy between the unbound solvated ligand and the ligand in complex with the protein plus solvation. If we set the parameters α = β = 1 and γ = 0, we obtain the difference in interaction energy ΔU for the ligand in the bound and unbound state. In LIE usually the parameters α and β are fitted to obtain values for ΔG. In contrast, the method with the suggested parameters (α = β = 1 and γ = 0) is a measure for the change in interaction enthalpy between the ligand in the bound and in the unbound state. Therefore, this method includes the interaction of the ligand with the protein, which is not captured by the GIST analysis. This method was further used to analyze the difference in the binding enthalpy between a protonated and the neutral form of the ligands C and D.
pKA Calculations
Implicit solvent calculations31,32 can be employed for the calculation of pKA values. We make use of the OpenEye software protein_pka which is based on their Poisson–Boltzmann solver ZAP.33 The actual pKA value is derived from a thermodynamic cycle: the solvation energy of the protonated and deprotonated form of the residue (amino acid or ligand) is computed by solving the Poisson–Boltzmann equation whereas the pKA value of the residue in aqueous solution is taken from literature or experiment. Based on these contributions, the pKA value can finally be computed. However, sampling of the different possibilities of the charge states (residue 1 protonated and residue 2 protonated, residue 1 protonated and residue 2 deprotonated, etc.) of the residues is necessary: protein_pka makes use of Monte Carlo sampling to simulate the effect of different charge states. The protein structures were prepared by protein_pka (addition of hydrogen atoms and rebuilding missing heavy atoms). AM1-BCC charges23,24 were used for the protein and the ligand. The protonation states can be computed using the Henderson–Hasselbach equation. Several publications highlight a protonation effect upon ligand binding to a protein.34−37
Results and Discussion
As discussed in a previously published study by Czodrowski et al. the role of water molecules in the proximity of the binding pocket can play a significant role in explaining the two different binding modes.8 To simplify the complex process of solvation, we distinguish in a first approximation between water molecules with a negative and a positive free energy value compared with water molecules in bulk. If the free energy of the water molecule is more positive than in bulk water, we can gain free energy of binding when the water is removed from the protein and released into bulk. Water molecules with a positive free energy value in comparison to bulk water are either entropically constrained, i.e., restricted in their movement with no or too low enthalpic compensation (entropically unfavored water position, e.g., formation of a water wire or water polygons in an apolar environment), or have in general too low enthalpic interactions in comparison with bulk water (enthalpically unfavored water position, e.g., hydrophobic pocket).38 Water molecules with a negative free energy value compared to bulk water often show strong enthalpic interactions. These strong enthalpic interactions lead to a highly favorable enthalpy of solvation. However, the phase space that these enthalpically strong bound water molecules can occupy (e.g., water near a charged residue) is usually restricted, resulting in a low entropy contribution. Therefore, the very favorable enthalpy of solvation is partially compensated by entropic restrictions. This phenomenon is called entropy-enthalpy compensation.39−41 As neither enthalpy nor entropy is able to describe the differences in free energy completely,18 it is necessary to investigate both enthalpy and entropy to identify water molecules with a positive free energy contribution.
To explain why compound C is not found in pose D and vice versa, analyses of all possible binding poses and different protonation states were performed (summarized in Figure 4). Once more we would like to point out that the two ligands only differ in the position of the amino group, which is in para-position in compound C and in meta-position in D.
We start the thermodynamic analysis by investigating the free energy contributions of water molecules for the four uncharged states (red box of Figure 4). As already pointed out by Czodrowski et al.8 the differences between compound C and D in terms of physicochemical properties are very subtle at first glance. Redocking for both compounds was successful but when compound D was docked into the crystal structure of compound C binding pose C was obtained. The same result was obtained for the cross-docking of compound C into the crystal structure of compound D.
To rationalize the binding behavior we started with analyzing the solvent structure of the neutral ligands in the binding pose from the crystal structure and in the docked binding pose. The visualization of the GIST water entropies can be seen in Figure 5.
Entropically unfavorable water sites in respect to bulk water (−TΔS > 3.5 kcal/mol) are shown in Figure 5 as blue spheres. For compound D (Figure 5: bottom) we find that the binding pose D (left) has significantly fewer entropically unfavorable water molecules than binding pose C (right). Thus, for compound D the binding pose D is entropically favored over pose C. Some of these entropically unfavorable water molecules do not show strong enthalpic interactions with the ligand or the protein or other water molecules. The free energy of these water molecule is high in comparison to bulk water molecules. In the buried pocket (red oval in Figure 5) such water molecules with a positive contribution to the free energy are found, which can be replaced by a ligand, as found for compound D in pose D.
However, also for compound C binding pose D shows significantly fewer ordered water molecules (Figure 5: top), indicating that our analysis is missing important details for this ligand. To shed light on this behavior, enthalpic and entropic contributions to solvation as well as the resulting free energy of water molecules within the previously mentioned 5 Å radius to the binding pocket are studied and results listed in Table 1.
Table 1. Thermodynamic Values of Pocket Water Molecules from the GIST Calculations (kcal/mol).
compound | pose | charge | ΔH | –TΔS | ΔG |
---|---|---|---|---|---|
C | C | neutral | –156.0 | 74.1 | –82.4 |
D | –162.0 | 70.3 | –91.7 | ||
D | C | –155.9 | 79.7 | –76.0 | |
D | –151.0 | 69.5 | –81.5 | ||
C | C | positive | –175.0 | 88.6 | –86.4 |
D | –155.1 | 72.4 | –82.7 | ||
D | C | –166.1 | 82.9 | –83.2 | |
D | –154.9 | 68.2 | –86.7 |
The first four rows list the results of the GIST calculations for the uncharged ligands. For compound D we find that pose D is energetically favorable (ΔGD(pose D) < ΔGD(pose C)) and that this difference in free energy has its origin in the water entropy. This is in good agreement with the visualization and the results obtained earlier by Czodrowski et al.8 The solvent calculations with WaterMap and SZMAP also identified two to three “unhappy”/“hydrophobic” water molecules which are replaced by compound D in the buried pocket. As a reminder: “unhappy”/“hydrophilic water molecules correspond to water molecules with a positive free energy value compared with bulk water. This explains the binding mode of compound D but would also mean that compound C should be found in the crystal structure with binding pose D. The free energy for compound C in pose D is also lower than in pose C (ΔGC(pose D) < ΔGC(pose C). The previously published explanation that the displacement of an enthalpically strong bound water molecule is responsible for this change in the binding is doubtful, as this should lead to a decrease in binding affinity.
These results indicate that the binding pose of compound C cannot be explained by solvation analysis alone. Therefore, further investigation of the binding of the ligand C is required. A more detailed investigation of the binding site and the residues in close proximity to the ligand revealed that the binding site shows two negatively charged residues (ASP-351, GLU-245) in the buried pocket together with a positively charged lysine (LYS-232), forming salt bridges with each other. Furthermore, a negatively charged aspartate (ASP-290) is close to the amino group of the solvent exposed ligand. Hence, protonation of the aniline group and formation of a salt bridge with the deprotonated aspartate residue (shown in Figure 6) could occur as no other positively charged group is in close proximity.
We therefore performed pKA calculations for the compounds C and D for their native X-ray pose as well as their docking pose. Table 2 summarizes the results from the pKA calculations.
Table 2. Results from the pKA Calculationsa.
The reported values correspond to the pKA values of the amino group in compounds C and D.
Amino group is definitely not protonated at physiological pH: this can be concluded due to the pKA value which is computed to be below 0.
The results of the pKA calculations showed that indeed compound C in binding pose C is likely protonated, whereas for all other structures it is unlikely that the ligands are protonated at neutral pH.
Therefore, we performed the GIST calculations for the protonated form of the ligands C and D (Table 1; row 5–8). Again the entropy for binding pose D is higher than for pose C. In terms of free energy the compounds show the correct trend, but the difference between the different compounds and the different binding poses are smaller than for the neutral compounds. As the method’s error (max standard error of the mean for the GIST calculation: 3.1 kcal/mol) is in the same range, we have to further investigate the system to ensure the trend is correct. The enthalpic interactions between two charged residues (only present with a positively charged ligand) are significantly stronger than interactions between a charge and a dipole or two dipoles. Since GIST does not cover the interaction energy between ligand and protein this could further help to explain the binding mode. Therefore, we hypothesize that the strong enthalpic interaction of the salt bridge in the solvent exposed binding mode overcompensates the entropic unfavorable contributions of the ordered water molecules in the buried pocket.
To ensure that the pKA calculations are correct, we present further evidence that the ligand is indeed protonated and this is responsible for the binding mode flip. Unfortunately, protonation states usually cannot be determined by X-ray crystallography for most biomolecular systems and techniques sensitive to proton location, like neutron scattering, are rarely used within drug design projects. However, by looking at pKA values for related molecules it becomes evident that p-phenylenediamine is significantly more basic (pKA = 6.2) than the m-phenylenediamine with a pKA of 4.98,42 indicating that compound C is much easier to protonate than compound D at physiological pH. This is corroborated by the pKA calculations of the protein-complex. In terms of biomolecular recognition the position of the amino group in para-position (N–O distance 2.8 Å) is closer to the carboxyl group of the aspartate than the amino group in the meta-position (N–O distance 3.5 Å). The difference in the pKA values might also influence the solvation thermodynamics as the amount of the respective protonated species differ between the two compounds. However, the absolute number of protonated species is considered small and therefore these contributions are likely to be minor without a stabilizing functional group in vicinity. While at the first glance the change of the amino group from ortho- to meta-position does not seem to be significant, it fundamentally changed the properties of the molecule.
To further analyze the idea that the compound can be protonated we used the LIE tool of the AmberTools package.29 We wanted to choose a method which explicitly includes the interaction of the protein and the ligand, as this kind of interaction is omitted by the GIST analysis.
To obtain the difference between the interaction energy of the ligand in complex and as free ligand two simulations were necessary. The enthalpic difference between the two states here is the value of interest. Further, we are interested in the change of binding energy between protonated and unprotonated state (shown in the last two columns of Table 3). As expected, the results show that the interaction between the ligand and the surrounding (protein and water) is enhanced when the ligand is protonated for both binding poses. Still, the gain in (more favorable) interaction energy for position C (−15.8 kcal/mol) is higher compared to the gain in pose D (−10.8 kcal/mol) for compound C. It is likely that this gain in enthalpy is responsible for the binding pose change. For binding pose C this enthalpic gain helps to compensate the entropic penalty of the water molecules in the buried pocket and therefore stabilizing the binding pose of compound C. For both binding poses of compound D, we see a gain in interaction energy similarly to compound C in position C (16.2 and 16.0 kcal/mol). In contrast to compound C, compound D is much more difficult to protonate and therefore the energy gain may not be enough to compensate for the energy necessary for the protonation.
Table 3. Results of the Linear Interaction Energy Calculations of Protein–Ligand Interactions ΔU (kcal/mol) for Compounds C and D.
ligand |
|||||||
---|---|---|---|---|---|---|---|
neutral |
positive |
difference |
|||||
ΔU | C | D | C | D | C | D | |
pose | C | –5.3 | –6.7 | –21.2 | –22.9 | –15.8 | –16.2 |
D | –3.4 | –11.8 | –14.2 | –27.8 | –10.8 | –16.0 |
Looking at Table 3 from a different angle: Although we already see that in the neutral form compound C slightly prefers pose C (−1.9 kcal/mol), this could be in the range of the method’s error. The preference enhances for the positive form of compound C and is significantly larger (−7.0 kcal/mol) than the error of the method. For compound D, we found a preference for pose D over pose C (≈5.0 kcal/mol) for both the positive and neutral form.
Further evidence that the preference of the neutral compound C for pose C is not significant brought the application of the MM/PBSA method to this problem.43−45 With this method we found that compound C prefers pose D in the unprotonated state by 6 kcal/mol, whereas for the charged compound we could not find any preference (Δ < 1 kcal/mol) for one of the two binding poses. Similar results were obtained with MM/GBSA. This finding further supports our hypothesis that compound C has to be protonated, when it is found in pose C.
Enthalpy (LIE) calculations without solvent and further information about the MM/GB(PB)SA calculations are given in the SI.
To summarize: Binding pose D is favorable for compound D, not only but also due to entropic contributions from the water molecules being (the phase space water molecules can occupy) more favorable than in pose C. Hence, the binding pose of compound D is to a large extend determined by the entropic contribution of the water molecules. This explains how water can influence the affinity and specificity of ligand binding sites.46 The importance of water for ligand binding is already well established.47 In the presented case the replacement of entropically hindered water with respect to bulk results in a gain in free energy. In contrast, there are cases reported in literature where the enthalpy gain by formation of a water mediated hydrogen bond is larger than the entropic cost of restricting the phase space of the water molecules.13
For binding pose C, the entropy of the water molecules in the binding pocket is unfavorable in comparison to the water molecules in pose D, but compound C is protonated, leading to strong enthalpic interactions. These contributions overcompensate the entropic energy penalty. Therefore, pose C is dominated by the enthalpic contributions to the free energy. This highlights that entropy and enthalpy are both important for ligand binding. Our findings confirm once more that ligand binding depends on the balance between entropy and enthalpy. The relative importance of the enthalpic and entropic contribution might change dramatically upon subtle modifications in ligand chemistry. Our study also highlights the already known fact that enthalpy alone is not a good measurement for free energy,18 a fact often forgotten in medicinal chemistry when focusing solely on direct protein–ligand contacts.
Furthermore, the pH effect on binding affinities and binding modes is often neglected. Indeed, the protonation state and the binding affinity are linked together. The pKA of a ligand influences the binding affinity and selectivity, but also the binding of a ligand affects the pKA of the ligand and the protein,48 as shown in this study.
Conclusion
Our analysis shows that the combination of in silico analysis tools can unravel the origin of different binding poses. The present study provides an explanation of binding modes and rationalizes, based on thermodynamics, the observed binding mode flips. We show how a minor change in the structure of a ligand leads to major structural changes, which could result in a misinterpretation of structure–activity relationships. The binding pose of compound D is by a large extend determined by releasing entropically unfavored (with respect to bulk water) water molecules from the binding pocket into bulk upon binding. For a second compound C, very similar to the first one, the binding mode is determined by the enthalpic contributions of ligand binding. The enthalpic interactions that play a key role in one of the binding poses are strongly related to the protonation of the amino group in this ligand. We encourage to carefully investigate protonation states, if functional groups are present which may or may not be protonated in the environment of another charged group with opposite sign. Seemingly subtle changes in the chemical structure can significantly alter the physicochemical properties of certain groups. As the obtained results go beyond the limit of chemical intuition, the computational methodology presented can be of value for medicinal chemistry programs.49 The investigated TGFBR1 kinase domain binding mode flip is not an isolated case. Similar observations are reported in the literature but are usually discovered by serendipity.1
The enhanced thermodynamic understanding of such binding flips can help to improve binding pose prediction of unknown ligands. While we have to admit that the extensive study presented here is most likely beyond the scope of typical drug design programs, the ligand binding poses analyzed here would most likely not have been correctly predicted by any state-of-the-art structure-based drug design methodology. Nevertheless, such challenging cases can be used to strengthen our ability to accurately predict binding modes and binding mode flips.
Acknowledgments
The authors thank the Austrian Science Fund (FWF) for funding of project P 23051. We thank Anthony Nicholls and Jose Batista (OpenEye Scientific Software) for the access to and help with protein_pka.
Glossary
Abbreviations
- TGFBR1
Transforming Growth Factor Beta Receptor Type 1
- LIE
Linear Interaction Energy
- GIST
Grid Inhomogeneous Solvation Theory
Supporting Information Available
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jcim.6b00483.
Partial atomic charges of all parametrized ligands, additional enthalpy calculations without solvent, and further information about the MM/GB(PB)SA calculations (PDF)
Author Present Address
⊥ C.K.: F. Hoffmann–La Roche AG, Pharma Research and Early Development, Therapeutic Modalities, Roche Innovation Center Basel, Grenzacherstrasse 124, 4070 Basel, Switzerland.
Author Present Address
∥ J.E.F.: Boehringer Ingelheim RCV GmbH & Co KG, Doktor-Boehringer-Gasse 5-11, 1120 Vienna, Austria
The authors declare no competing financial interest.
Supplementary Material
References
- Kuhnert M.; Koster H.; Bartholomaus R.; Park A. Y.; Shahim A.; Heine A.; Steuber H.; Klebe G.; Diederich W. E. Tracing Binding Modes in Hit-To-Lead Optimization: Chameleon-Like Poses of Aspartic Protease Inhibitors. Angew. Chem., Int. Ed. 2015, 54, 2849–53. 10.1002/anie.201411206. [DOI] [PubMed] [Google Scholar]
- Kozakov D.; Hall D. R.; Jehle S.; Luo L.; Ochiana S. O.; Jones E. V.; Pollastri M.; Allen K. N.; Whitty A.; Vajda S. Ligand deconstruction: Why some fragment binding positions are conserved and others are not. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, E2585–94. 10.1073/pnas.1501567112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kramer C.; Fuchs J. E.; Liedl K. R. Strong Nonadditivity as a Key Structure–Activity Relationship Feature: Distinguishing Structural Changes from Assay Artifacts. J. Chem. Inf. Model. 2015, 55, 483–494. 10.1021/acs.jcim.5b00018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hritz J.; de Ruiter A.; Oostenbrink C. Impact of plasticity and flexibility on docking results for cytochrome P450 2D6: a combined approach of molecular dynamics and ligand docking. J. Med. Chem. 2008, 51, 7469–77. 10.1021/jm801005m. [DOI] [PubMed] [Google Scholar]
- Gao C.; Thorsteinson N.; Watson I.; Wang J.; Vieth M. Knowledge-Based Strategy to Improve Ligand Pose Prediction Accuracy for Lead Optimization. J. Chem. Inf. Model. 2015, 55, 1460–1468. 10.1021/acs.jcim.5b00186. [DOI] [PubMed] [Google Scholar]
- Ackermann K. A.; Adan J.; Cases C. D.; Crassier H.; Hölzemann G. D.; Jonczyk A. D.; Rautenberg W. D.; Rosell-Vives E.; Tarrason G. D.. Pyridopyrimidinonderivate. DE Patent 102004054215, May 11, 2006.
- Hoelzemann G.; Ackermann K. A.; Crassier H.; Jonczyk A.; Rautenberg W.; Tarrason G.; Rosell-Vives E.; Adan J.; Cases C.. 4-Amino-5-Oxo-8-Phenyl-5H-Pyrido-[2,3-d]-Pyrimidine Derivatives as Tyrosine Kinase and RAF Kinase Inhibitors for the Treatment of Tumours. U.S. Patent WO2006050800, May 18, 2006.
- Czodrowski P.; Hölzemann G.; Barnickel G.; Greiner H.; Musil D. Selection of Fragments for Kinase Inhibitor Design: Decoration Is Key. J. Med. Chem. 2015, 58, 457–465. 10.1021/jm501597j. [DOI] [PubMed] [Google Scholar]
- SZMAP 1.2.1.4; OpenEye Scientific Software, Santa Fe, NM, 2015; http://www.eyesopen.com.
- Abel R.; Young T.; Farid R.; Berne B. J.; Friesner R. A. Role of the Active-Site Solvent in the Thermodynamics of Factor Xa Ligand Binding. J. Am. Chem. Soc. 2008, 130, 2817–2831. 10.1021/ja0771033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michel J.; Tirado-Rives J.; Jorgensen W. L. Energetics of Displacing Water Molecules from Protein Binding Sites: Consequences for Ligand Optimization. J. Am. Chem. Soc. 2009, 131, 15403–15411. 10.1021/ja906058w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Breiten B.; Lockett M. R.; Sherman W.; Fujita S.; Al-Sayah M.; Lange H.; Bowers C. M.; Heroux A.; Krilov G.; Whitesides G. M. Water Networks Contribute to Enthalpy/Entropy Compensation in Protein–Ligand Binding. J. Am. Chem. Soc. 2013, 135, 15579–15584. 10.1021/ja4075776. [DOI] [PubMed] [Google Scholar]
- Ladbury J. E. Just Add Water! The Effect of Water on the Specificity of Protein-Ligand Binding Sites and its Potential Application to Drug Design. Chem. Biol. 1996, 3, 973–980. 10.1016/S1074-5521(96)90164-7. [DOI] [PubMed] [Google Scholar]
- Levy Y.; Onuchic J. N. Water Mediation in Protein Folding and Molecular Recognition. Annu. Rev. Biophys. Biomol. Struct. 2006, 35, 389–415. 10.1146/annurev.biophys.35.040405.102134. [DOI] [PubMed] [Google Scholar]
- Kovalenko A.; Hirata F. Three-Dimensional Density Profiles of Water in Contact with a Solute of Arbitrary Shape: a RISM Approach. Chem. Phys. Lett. 1998, 290, 237–244. 10.1016/S0009-2614(98)00471-0. [DOI] [Google Scholar]
- Sindhikara D. J.; Hirata F. Analysis of Biomolecular Solvation Sites by 3D-RISM Theory. J. Phys. Chem. B 2013, 117, 6718–6723. 10.1021/jp4046116. [DOI] [PubMed] [Google Scholar]
- Nguyen C. N.; Kurtzman Young T.; Gilson M. K. Grid Inhomogeneous Solvation Theory: Hydration Structure and Thermodynamics of the Miniature Receptor Cucurbit[7]uril. J. Chem. Phys. 2012, 137, 044101. 10.1063/1.4733951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reynolds C. H.; Holloway M. K. Thermodynamics of Ligand Binding and Efficiency. ACS Med. Chem. Lett. 2011, 2, 433–437. 10.1021/ml200010k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen C. N.; Cruz A.; Gilson M. K.; Kurtzman T. Thermodynamics of Water in an Enzyme Active Site: Grid-Based Hydration Analysis of Coagulation Factor Xa. J. Chem. Theory Comput. 2014, 10, 2769–2780. 10.1021/ct401110x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aqvist J.; Marelius J. The Linear Interaction Energy Method for Predicting Ligand Binding Free Energies. Comb. Chem. High Throughput Screening 2001, 4, 613–26. 10.2174/1386207013330661. [DOI] [PubMed] [Google Scholar]
- Gutierrez-de-Teran H.; Aqvist J. Linear Interaction Energy: Method and Applications in Drug Design. Methods Mol. Biol. 2012, 819, 305–23. 10.1007/978-1-61779-465-0_20. [DOI] [PubMed] [Google Scholar]
- Case D. A.; Berryman J. T.; Betz R. M.; Cerutti D. S.; Cheatham T. E. I.; Darden T. A.; Duke R. E.; Giese T. J.; Gohlke H.; Goetz A. W.; Homeyer N.; Izadi S.; Janowski P.; Kaus J.; Kovalenko A.; Lee T. S.; LeGrand S.; Li P.; Luchko T.; Luo R.; Madej B.; Merz K. M.; Monard G.; Needham P.; Nguyen H.; Nguyen H. T.; Omelyan I.; Onufriev A.; Roe D. R.; Roitberg A.; Salomon-Ferrer R.; Simmerling C. L.; Smith W.; Swails J.; Walker R. C.; Wang J.; Wolf R. M.; Wu X.; York D. M.; Kollman P. A.. AMBER 2015; University of California, San Francisco, 2015. [Google Scholar]
- Jakalian A.; Bush B. L.; Jack D. B.; Bayly C. I. Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: I. Method. J. Comput. Chem. 2000, 21, 132–146. . [DOI] [PubMed] [Google Scholar]
- Jakalian A.; Jack D. B.; Bayly C. I. Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: II. Parameterization and Validation. J. Comput. Chem. 2002, 23, 1623–41. 10.1002/jcc.10128. [DOI] [PubMed] [Google Scholar]
- Wang J.; Wolf R. M.; Caldwell J. W.; Kollman P. A.; Case D. A. Development and Testing of a General Amber Force Field. J. Comput. Chem. 2004, 25, 1157–74. 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- Maier J. A.; Martinez C.; Kasavajhala K.; Wickstrom L.; Hauser K. E.; Simmerling C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput. 2015, 11, 3696–3713. 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jorgensen W. L.; Chandrasekhar J.; Madura J. D.; Impey R. W.; Klein M. L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79, 926–935. 10.1063/1.445869. [DOI] [Google Scholar]
- Wallnoefer H. G.; Handschuh S.; Liedl K. R.; Fox T. Stabilizing of a Globular Protein by a Highly Complex Water Network: A Molecular Dynamics Simulation Study on Factor Xa. J. Phys. Chem. B 2010, 114, 7405–7412. 10.1021/jp101654g. [DOI] [PubMed] [Google Scholar]
- Roe D. R.; Cheatham T. E. PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput. 2013, 9, 3084–3095. 10.1021/ct400341p. [DOI] [PubMed] [Google Scholar]
- Schauperl M.; Podewitz M.; Waldner B. J.; Liedl K. R. Enthalpic and Entropic Contributions to Hydrophobicity. J. Chem. Theory Comput. 2016, 12, 4600–4610. 10.1021/acs.jctc.6b00422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bashford D.; Karplus M. pKa’s of Ionizable Groups in Proteins: Atomic Detail from a Continuum Electrostatic Model. Biochemistry 1990, 29, 10219–25. 10.1021/bi00496a010. [DOI] [PubMed] [Google Scholar]
- Schutz C. N.; Warshel A. What are the Dielectric “Constants” of Proteins and how to Validate Electrostatic Models?. Proteins: Struct., Funct., Genet. 2001, 44, 400–417. 10.1002/prot.1106. [DOI] [PubMed] [Google Scholar]
- Word J. M.; Nicholls A. Application of the Gaussian Dielectric Boundary in Zap to the Prediction of Protein pKa Values. Proteins: Struct., Funct., Bioinf. 2011, 79, 3400–9. 10.1002/prot.23079. [DOI] [PubMed] [Google Scholar]
- Czodrowski P.; Dramburg I.; Sotriffer C. A.; Klebe G. Development, Validation, and Application of Adapted PEOE Charges to Estimate pKa Values of Functional Groups in Protein–Ligand Complexes. Proteins: Struct., Funct., Bioinf. 2006, 65, 424–437. 10.1002/prot.21110. [DOI] [PubMed] [Google Scholar]
- Czodrowski P.; Sotriffer C. A.; Klebe G. Atypical Protonation States in the Active Site of HIV-1 Protease: A Computational Study. J. Chem. Inf. Model. 2007, 47, 1590–1598. 10.1021/ci600522c. [DOI] [PubMed] [Google Scholar]
- Czodrowski P.; Sotriffer C. A.; Klebe G. Protonation Changes upon Ligand Binding to Trypsin and Thrombin: Structural Interpretation Based on pKa Calculations and ITC Experiments. J. Mol. Biol. 2007, 367, 1347–1356. 10.1016/j.jmb.2007.01.022. [DOI] [PubMed] [Google Scholar]
- Steuber H.; Czodrowski P.; Sotriffer C. A.; Klebe G. Tracing Changes in Protonation: A Prerequisite to Factorize Thermodynamic Data of Inhibitor Binding to Aldose Reductase. J. Mol. Biol. 2007, 373, 1305–1320. 10.1016/j.jmb.2007.08.063. [DOI] [PubMed] [Google Scholar]
- Young T.; Abel R.; Kim B.; Berne B. J.; Friesner R. A. Motifs for Molecular Recognition Exploiting Hydrophobic Enclosure in Protein–Ligand Binding. Proc. Natl. Acad. Sci. U. S. A. 2007, 104, 808–13. 10.1073/pnas.0610202104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee B. Enthalpy-Entropy Compensation in the Thermodynamics of Hydrophobicity. Biophys. Chem. 1994, 51, 271–278. 10.1016/0301-4622(94)00048-4. [DOI] [PubMed] [Google Scholar]
- Lumry R.; Rajender S. Enthalpy-Entropy Compensation Phenomena in Water Solutions of Proteins and Small Molecules: A Ubiquitous Property of Water. Biopolymers 1970, 9, 1125–1227. 10.1002/bip.1970.360091002. [DOI] [PubMed] [Google Scholar]
- Chodera J. D.; Mobley D. L. Entropy-Enthalpy Compensation: Role and Ramifications in Biomolecular Ligand Recognition and Design. Annu. Rev. Biophys. 2013, 42, 121–142. 10.1146/annurev-biophys-083012-130318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rappoport Z.; Frankel M.. CRC Handbook of Tables for Organic Compound Identification; CRC Press. Inc.: Boca Raton, United States, 1967. [Google Scholar]
- Kollman P. A.; Massova I.; Reyes C.; Kuhn B.; Huo S.; Chong L.; Lee M.; Lee T.; Duan Y.; Wang W.; Donini O.; Cieplak P.; Srinivasan J.; Case D. A.; Cheatham T. E. 3rd Calculating Structures and Free Energies of Complex Molecules: Combining Molecular Mechanics and Continuum Models. Acc. Chem. Res. 2000, 33, 889–97. 10.1021/ar000033j. [DOI] [PubMed] [Google Scholar]
- Srinivasan J.; Cheatham T. E.; Cieplak P.; Kollman P. A.; Case D. A. Continuum Solvent Studies of the Stability of DNA, RNA, and Phosphoramidate–DNA Helices. J. Am. Chem. Soc. 1998, 120, 9401–9409. 10.1021/ja981844+. [DOI] [Google Scholar]
- Massova I.; Kollman P. A. Combined Molecular Mechanical and Continuum Solvent Approach (MM-PBSA/GBSA) to Predict Ligand Binding. Perspect. Drug Discovery Des. 2000, 18, 113–135. 10.1023/A:1008763014207. [DOI] [Google Scholar]
- Fuchs J. E.; Huber R. G.; Waldner B. J.; Kahler U.; von Grafenstein S.; Kramer C.; Liedl K. R. Dynamics Govern Specificity of a Protein-Protein Interface: Substrate Recognition by Thrombin. PLoS One 2015, 10, e0140713. 10.1371/journal.pone.0140713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Betz M.; Wulsdorf T.; Krimmer S. G.; Klebe G. Impact of Surface Water Layers on Protein--Ligand Binding: How Well Are Experimental Data Reproduced by Molecular Dynamics Simulations in a Thermolysin Test Case?. J. Chem. Inf. Model. 2016, 56, 223–33. 10.1021/acs.jcim.5b00621. [DOI] [PubMed] [Google Scholar]
- Gilson M. K.; Zhou H. X. Calculation of Protein-Ligand Binding Affinities. Annu. Rev. Biophys. Biomol. Struct. 2007, 36, 21–42. 10.1146/annurev.biophys.36.040306.132550. [DOI] [PubMed] [Google Scholar]
- Bodnarchuk M. S. Water, Water, Everywhere... It’s Time to Stop and Think. Drug Discovery Today 2016, 21, 1139–1146. 10.1016/j.drudis.2016.05.009. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.