Abstract
Cysteine protease enzymes are important for human physiology and catalyze key protein degradation pathways. These enzymes react via a nucleophilic reaction mechanism that involves a cysteine residue and the proton of a proximal histidine. Particularly efficient inhibitors of these enzymes are nitrile-based, however, the details of the catalytic reaction mechanism currently are poorly understood. To gain further insight into the inhibition of these molecules, we have performed a combined density functional theory and quantum mechanics/molecular mechanics study on the reaction of a nitrile-based inhibitor with the enzyme active site amino acids. We show here that small perturbations to the inhibitor structure can have dramatic effects on the catalysis and inhibition processes. Thus, we investigated a range of inhibitor templates and show that specific structural changes reduce the inhibitory efficiency by several orders of magnitude. Moreover, as the reaction takes place on a polar surface, we find strong differences between the DFT and QM/MM calculated energetics. In particular, the DFT model led to dramatic distortions from the starting structure and the convergence to a structure that would not fit the enzyme active site. In the subsequent QM/MM study we investigated the use of mechanical vs. electronic embedding on the kinetics, thermodynamics and geometries along the reaction mechanism. We find minor effects on the kinetics of the reaction but large geometric and thermodynamics differences as a result of inclusion of electronic embedding corrections. The work here highlights the importance of model choice in the investigation of this biochemical reaction mechanism.
Keywords: enzyme mechanism, enzyme inhibition, DFT, QM/MM
Introduction
Proteases are important enzymes in human physiology and involved in terminal protein degradation via a nucleophilic acid-base reaction (Chapman et al., 1997; Otto and Schirmeister, 1997; Madala et al., 2010). They are classified within four different classes that are based on the Lewis base in the catalytic reaction mechanism, namely serine-, cysteine-, aspartate-, and metalloproteases. The former two uses the terminal side chain of a serine or cysteine amino acid as nucleophilic center in the reaction mechanism and are the most abundant proteases in the body. Members of the cysteine protease family include intracellular proteases called cathepsins that take part in hydrolytic cleavage of peptide bonds, apoptosis as well as immune responses. As such, the pathogenesis of microbial infections, arthritis, osteoporosis and cancer require the involvement of cysteine proteases, which makes them an important target of drug development and chemotherapy (Selzer et al., 1999; Palermo and Joyce, 2007; Robinson et al., 2008). In addition, they have been implicated in atherosclerosis-based vascular diseases (Lutgens et al., 2007; Cheng et al., 2011; Ratovitski et al., 2011).
Due to their pharmaceutical interest considerable research has been devoted to establish the mechanism and function of cysteine protease enzymes (Kato et al., 2005; Hang et al., 2006; Shokhen et al., 2011). Furthermore, research into finding suitable inhibitors of the cysteine protease family is paramount (Arkin and Wells, 2004; Rzychon et al., 2004; Drahl et al., 2005; Tyndall et al., 2005; Ehmke et al., 2011). Inhibition can take place via several different methods, including covalent inhibition, blockage or distortion of the catalytic active site via competition with non-covalent inhibitors. One particular class of covalent inhibitors that have been particularly useful for cysteine proteases are nitrile-based compounds (Oballa et al., 2007; MacFaul et al., 2009; Morley et al., 2009; Ehmke et al., 2012). Currently, however, the precise mechanism of the interaction between these inhibitors and active site cysteine is not clear. In order to understand the inhibition process by cysteine proteases, and, especially, its mechanism with inhibitors we decided to do a computational study. The calculations use density functional theory (DFT) on model complexes and quantum mechanics/molecular mechanics (QM/MM) studies on a complete cysteine protease enzyme.
Cysteine proteases bind their substrates in a groove located on the surface of the enzyme, therefore the reaction takes place on a relatively polar region with many interacting hydrogen bonding donor and acceptor groups (Sajid and McKerrow, 2002). A protein-ligand structure of cathepsin K (Altmann et al., 2004) is shown in Figure 1, whereby the nitrile-based inhibitor is covalently linked to a cysteine residue (Cys25) in the active site of the protease and within hydrogen bonding distance to a histidine residue (His162). It has been hypothesized that this histidine residue donates a proton during the covalent linkage formation between inhibitor and the cysteine residue, but the intricate details of this mechanism remain elusive. The computational study described in this work will address this mechanism and establish the key factors that drive the reaction mechanism as well as the influence of the surrounding protein.
Several groups have studied the catalytic mechanism of cysteine protease with computational techniques, including quantum mechanics/molecular mechanics (QM/MM) (Helten et al., 2004; Ma et al., 2007; Mladenovic et al., 2008a,b,c) and density functional theory (DFT) model complexes and focused on cathepsin B proteases (Shokhen et al., 2009; Shankar et al., 2010). These studies implicated strong influences of the hydrogen bonding network in the substrate/inhibitor binding pocket and led to increased stability of the HisH+—Cys− ion pair as compared to its neutral form, although this may change after substrate binding. QM/MM studies on the catalytic mechanism of these cysteine proteases revealed a rate determining acylation step with a barrier of 19.8 kcal mol−1 (Ma et al., 2007). These values compare well with other nucleophilic addition reactions in related enzymes, such as sortase A that has a similar active site as cysteine protease and reacts with a similar catalytic machinery (Tian and Eriksson, 2011). So far, few studies have addressed inhibitor binding and the effect of compound reactivity and in particular nitrile-based inhibitors have not been thoroughly studied computationally in this way (Mladenovic et al., 2008c). We therefore initiated this study to understand the covalent mechanism of this chemotype through its binding to cathepsin K.
The specific mechanism of how nitrile-based inhibitors interact with cathepsin K is not known, it is however clear that these inhibitors bind tightly within the active site and inhibit via the reversible formation of a covalent bond (Helten et al., 2004). To find out, how structural changes to these inhibitors affect their inhibition, we modeled the reaction of cathepsin K with four inhibitor scaffolds (Scheme 1): S1 and S2 are the protonated and neutral purine-based nitriles, whereas in S3 and S4 the nitrogen atoms of the six-membered ring were replaced by carbon. The purpose of investigating S3 and S4 is because studies have shown such a change results in a significant reduction in the reactivity of the nitrile functionality (Altmann et al., 2004). As the reaction takes place on the surface of the enzyme, and the interactions between the inhibitors and protein affect the reaction we have opted for two sets of calculations. In the first set we take a model of the substrate binding-site and reaction position and calculate the mechanism with density functional theory. In a subsequent set of calculations we use quantum mechanics/molecular mechanics (QM/MM) on the complete enzyme. As we show here, there are dramatic differences in substrate activation between the two models that highlight the importance of the full QM/MM model in the calculations.
Methods
The studies presented in this work use density functional theory (DFT) as well as quantum mechanics/molecular mechanics (QM/MM) methods as implemented in the Jaguar and Gaussian-03 program packages (Frisch et al., 2004; Jaguar, 2009). We used two approaches to investigate the mechanism of cysteine protease inhibition: (i) DFT calculations on a model complex that represents the active species and key hydrogen bonding interactions, so-called QM-cluster calculations, and (ii) Full QM/MM on the complete enzyme. In the past we used the QM-cluster methodology extensively to describe the mechanistic features of important enzymes for human health, for example of the cytochromes P450, (Shaik et al., 2005; de Visser, 2010; Kumar et al., 2010) cysteine dioxygenase (Aluri and de Visser, 2007) and taurine/α-ketoglutarate dependent dioxygenase (de Visser, 2006). Although these QM-cluster calculations in many cases were able to reproduce experimentally determined product distributions, kinetic isotope effects and rate constants as well as spectroscopic features of key stable intermediates (Kumar et al., 2005; Vardhaman et al., 2011, 2013a,b), we cannot be certain that the methodology will also work on an enzyme such as a cysteine protease. In particular, QM/MM studies on these enzymatic systems showed that in many cases the active features were sufficient to describe the enzyme accurately (Godfrey et al., 2008; Porro et al., 2009; Kumar et al., 2011b), but in cases where the active species has close lying electronic states environmental perturbations were shown to change the electronic properties of the reactant and consequently the reactivity patterns (Ogliaro et al., 2000; de Visser et al., 2002; Leeladee et al., 2012). We anticipated similar problems in the current system, where strong polar interactions influenced the reaction kinetics, therefore, a combined DFT and QM/MM approach was applied.
QM-cluster calculations
The DFT model complex is based on the key features of the active site of cysteine protease with inhibitor bound as displayed in Scheme 2. Since, the inhibitor is bound on the surface of the enzyme via a range of hydrogen bonding interactions, we included a large proportion of the active site including these key interacting residues. Important hydrogen bonding interactions of the protein with these inhibitor templates, include the interaction with the cysteine residue, the alcohol group of Ser183, the peptide chain carbonyl group of Ala161, Ala163 and Asn182, and a bridging water molecule (W402). To prevent unnatural changes in the geometry during the optimization procedure, we fixed several carbon atoms of the peptide chains as identified with a star in Scheme 2. However, as geometric constraints often lead to geometry convergence problems and small imaginary frequencies, we kept the number of fixed atoms to a minimum here (Pratter et al., 2013; de Visser et al., 2014). This model has overall charge of +1 and was calculated in the singlet spin state only. Note that the structural constraint had little effect on the geometry optimization and all local minima had real frequencies only. The inhibition of nitriles by cysteine protease was investigated with four different inhibitor molecules as identified with S1, S2, S3 and S4 in Scheme 1 above. S1 and S2 are based on the inhibitor bound in the 1U9V pdb file and differ by the protonation state of the nitrogen atom of the five-membered ring. As a consequence the model with substrate S1 has overall charge +1, whereas the model with substrate S2 is neutral. In addition, we investigated two substrates S3 and S4, where one of the nitrogen atoms of the pyridine ring was replaced by a CH group. These structures are expected to have a lower intrinsic reactivity and differ in their hydrogen bond acceptor patterns. It has been shown (Ehmke et al., 2012) that molecules with high electrophilicity inhibit cysteine protease better than those with lower electrophilicity. In particular, compounds, such as S2, which contain a pyrimidine, were found to inhibit more effectively than compounds based on the S3 and S4 templates that use a pyridine instead.
Following previous experience with DFT on nucleophilic addition reactions (de Visser, 2012), we use the hybrid density functional method B3LYP (Becke, 1993; Lee et al., 1998) in combination with a 6-31G* basis set, BS1, for geometry optimizations and frequency calculations. We performed a full geometry optimization followed by an analytical frequency. To test the effect of the basis set on the energetics we ran single point B3LYP/6-311++G** calculations, basis set BS2. For a selection of reactions we also tested the effect of solvent using the polarized continuum model as implemented in Jaguar with a dielectric constant of ϵ = 5.7 and a probe radius of 2.66 Å.
QM/MM set-up
In a second set of calculations we carried out QM/MM calculations starting from the inhibitor bound structure of cathepsin K: 1U9V pdb structure (Altmann et al., 2004). We used well tested QM/MM procedures, which we applied to the catalytic reaction mechanisms of heme and non-heme iron enzymes previously (Godfrey et al., 2008; Porro et al., 2009; Kumar et al., 2011b; Quesne et al., 2014). The work started out from the inhibitor-bound enzyme monomer as deposited as the 1U9V crystal structure, which we initially updated to include all missing heavy atoms of the amino acids using the MOE program package (MOE, 2008). Subsequently, we used these starting coordinates to add hydrogen atoms to the heavy atoms with the PDB2PQR program package (Dolinsky et al., 2007). All amino acids were then protonated according to the usual pKa conventions (Dolinsky et al., 2007) at pH = 7 and analyzed with the Propka program package; this resulted in a structure with all aspartate and glutamate amino acids in their deprotonated forms and all arginine and lysine amino acids as protonated. The peptide chain contains two histidine amino acids and upon visual inspection it was decided to take His162 as doubly protonated, whereas His177 as singly protonated on atom Nϵ only. This gave us a system that is overall charge neutral. Subsequently, our chemical system was solvated in a sphere where the protein was extended with a water layer with a diameter of 10 Å using periodic boundary conditions. We repeated the solvation procedure in several steps where we attempted to add further water molecules to the system after each step and kept repeating this process until few water molecules could be added. This resulted in a chemical system with a total of 9580 atoms. Note, that the structure is energy minimized after each water addition step until a threshold of minimum number of water molecules added was reached. This system was then equilibrated and subjected to a molecular mechanics minimization using the FF94 force field (Wang et al., 2004) and then gradually heated to above room temperature (298 K) conditions using a heating procedure. During the equilibration the water accessible areas of the protein were comprehensibly solvated and the protein unfolded into a more relaxed structure using a dynamical protocol built into the MOE software package (MOE, 2008). The equilibration was performed in a consecutive set of minimizations starting with restrained protein and ligand and followed by an fully unrestrained calculation. Subsequently, the chemical system was gradually heated from 0 to 300 K with restrained ligand and protein. Finally, an NVT molecular dynamics simulation was done for 500 ps in steps of 0.001 ps until a temperature of 300 K was reached. In order to parameterize and obtain a topology for our purine-based nitrile inhibitor the antechamber component of the AMBER 10 package was used (Cornell et al., 1995; Tao and Schlegel, 2010). In this particular case, for each atom and bond type specific topology and parameters were identified based on known residue values.
QM/MM calculations
We then selected a QM region that contained the same model as displayed in Scheme 2 for the DFT model calculations. The QM region was overall charge neutral and was calculated in the closed-shell singlet spin state only. The ONIOM approach (Vreven et al., 2006). was used for the QM/MM calculations with the B3LYP/6-311G method describing the QM region. We tested several force fields for the description of the MM region including the UFF/ZDO and Amber methods (Rappé et al., 1992; Cornell et al., 1995). The link-atom approach (Singh and Kollman, 1986) describes the division between the QM and MM regions as hydrogen link-atoms. All structures and energies reported here were fully optimized (without constraints) using DFT/AMBER in Gaussian-03, whereby all atoms within a radius of 8Å from the reaction center were free to move during the optimization and the rest was fixed in its position throughout the QM/MM calculation. As frequency calculations were impossible on the full QM/MM structure in Gaussian, we extracted the QM region from the optimized QM/MM geometries, added hydrogen atoms and ran an analytical frequency calculation on the QM region only. To prevent excessive small imaginary frequencies in structures, we reoptimized the hydrogen atoms before running the frequency calculation, however, none of the local minima discussed here had any imaginary frequencies. As will be shown in the results section the differences in ΔE and ΔE+ZPE energies are small, and do not change the trends reported in this work. The QM/MM energies (EQM/MM) were calculated using Equation 1 as the separate energies of a QM calculation on the QM region (Emodel, QM) and an MM calculation on the complete system (Ereal, MM) minus an MM calculation on the QM region (Emodel,MM). QM/MM optimizations were initially done using DFT/AMBER with mechanical embedding included and were followed by DFT/AMBER single point calculations with electronic embedding included. We later also did full geometry optimizations with electronic embedding included at the DFT/AMBER level of theory.
(1) |
Mechanistic studies
The covalent inhibition of the inhibitor molecules was investigated via two possible scenarios, namely a concerted or a stepwise mechanism, Scheme 3. The first possibility is sequential nucleophilic addition and proton transfer in a stepwise mechanism, where the thiolate group of Cys25 attacks the carbon atom of the nitrile group first to form a covalent bond. Subsequently, the negatively charged nitrogen atom of this bond abstracts a proton from His162. Alternatively the nucleophilic addition and proton transfer happen in a single concerted reaction step. To ascertain the reaction mechanism of these inhibitors against cysteine protease enzymes we investigated both possible reaction mechanisms using two computational methods: Firstly, we did extensive density functional theory (DFT) studies on an active site model complex. Secondly, full quantum mechanics/molecular mechanics (QM/MM) studies on a substrate bound crystal structure were performed.
Results and discussion
Thus, in the proposed reaction mechanism, the thiolate group of Cys25 reacts via a nucleophilic addition reaction with substrate to form a covalent linkage. The, thereby formed intermediate abstracts a proton from a nearby histidine residue (His162) to give the product complex, Scheme 2. Currently, it is unclear whether this mechanism is concerted or stepwise and what the effect of the protein is on the reaction mechanism.
We started the work with a DFT investigation into the catalytic reaction mechanism of cysteine proteases, using QM-cluster model as described in Scheme 2 above and the results obtained with templates S1, S2, S3 and S4 are given in Figure 2. We define the reaction as proceeding from a reactant complex (R) that is initiated by an electrophilic attack of the thiolate group on the nitrile via a transition state TSE leading to an intermediate state IE. In a subsequent proton transfer reaction via transition state TSH we form the product complex P. Thus, using a substrate with the imidazolium group doubly protonated we find a stepwise mechanism, but if it is deprotonated on the Nd position a concerted pathway is found via a single transition state (TSE), where the imaginary mode gives simultaneous S–C bond formation and proton transfer from His162 to the nitrile nitrogen atom to form the product complex P. By contrast, for substrate S1 the reaction is stepwise via an intermediate complex and two barriers TSE and TSH separating the intermediate were characterized. As a consequence the processes for S2, S3 and S4 are concerted via a single transition state with an imaginary frequency that shows a mixture of both electrophilic attack and proton transfer. For inhibitor scaffolds S1, S2, and S3 this barrier is characterized by a small imaginary frequency, which is typical for sulfur-involved transition states (Kumar et al., 2011a). Using S4, the proton transfer is more dominating in the transition state, likely due to the later transition state and hence the imaginary frequency is larger (i270.7 cm−1).
Optimized geometries of the transition states (TSE and TSH) are displayed alongside the mechanism of the reaction in Figure 2. The concerted TSE transition states have the proton originating from His162 at a relatively short distance of 1.456–1.542 Å from the nitrile nitrogen atom, where it is 1.746 Å in the stepwise TS for substrate S1. Interestingly, the nitrile C–N distance shows little variation between the four different transition states and ranges from 1.196 to 1.207 Å. The C–S distance is slightly longer in the stepwise transition state as compared to the concerted transition states.
Energetically, the nucleophilic mechanisms are characterized by low energy barriers of between 5 and 6 kcal mol−1 for templates S1, S2, and S4, whereas S3 gives a somewhat enhanced barrier of 12.7 kcal mol−1 in the gas-phase. Therefore, the inhibition of cathepsin K by these nitrile-based inhibitors would be expected to be a fast and efficient process. Although technically the mechanism using substrate S1 is stepwise with well-characterized transition states for nucleophilic addition and proton transfer, the proton transfer barrier is very small (of the order of 0.1 kcal mol−1). This implies that the intermediate complex will have an ultrashort lifetime and proton transfer will happen instantaneously prior after the nucleophilic addition. Consequently, the mechanism perceived will be a concerted nucleophilic addition and proton transfer. Calculations with a dielectric continuum added that mimics a solvent gave little changes in the relative energies.
Replacement of one of the nitrogen atoms of the pyridine ring in S2 by a C–H group as in S3 leads to a raise in barriers for the nucleophilic reaction mechanism. Thus, using substrates S1, S2, and S4 the peptide chain N–H group of Cys25 interacts with the pyridine nitrogen atom in a weak hydrogen bonding interaction. However, replacement of this nitrogen atom by a C–H group leads to a repulsive interaction between the N–H and C–H groups, which leads to some tilting of the substrate group and weakening of the hydrogen bonding interactions between substrate and protein. Because of this the barriers are raised and unfavorable substrate inhibition is observed. This result is in good agreement with experimental inhibition studies that found higher activity for S2 type inhibitors than for pyridine containing inhibitors (Ehmke et al., 2012). The effect of substitution of the other pyridine nitrogen atom by C–H has little influence on the reaction mechanism and thermodynamics since it is not involved in direct hydrogen bonding interactions. Moreover, this group is located on the edge of the protein and points toward the solvent, where its thermodynamic perturbation will be limited. As such it may be difficult to establish the intricate details of the reactivity differences between S3 and S4 experimentally.
To confirm the suggested mechanism and ascertain the effect of the protein, we decided to do a full QM/MM calculation on the mechanism using substrate S2. The cysteine protease active site is located on the surface of the protein, Figure 3, and the inhibitor forms a number of hydrogen bonds with the protein to maintain its binding mode. Inside the active site of the protein, the nitrile functionality reacts with Cys25, assisted by His162. Visualization of the protein structure identifies a covalent linkage between Cys25 and the substrate, hence is in a product-like P conformation.
To understand the effect of protein and solvent on the mechanism of these inhibitors we decided to do a full QM/MM study starting from the described crystal structure. We initially optimized the reactant (R) and product (P) geometries using our QM/MM model and extracts of these structures are given in Figure 4. We used two procedures for the geometry optimizations: (i) with mechanical embedding included and (ii) with electronic embedding included. Thus, it occasionally happens that structures and consequently relative energies are dependent on the full environmental perturbations of the whole protein on the active site and with electronic embedding these interactions are included in full. The reactant complex has the inhibitor in a conformation where the nitrogen atom is in hydrogen bonding distance to the proton of the imidazole group of His162 at a relatively short distance of 1.519 Å. Hence the inhibitor molecule will be strongly bound in the substrate binding pocket in a specific orientation. The sulfur atom of the cysteine residue is located at the relatively large distance of 2.166 Å to the carbon atom of the nitrile. This geometry changes somewhat when electronic embedding is included and the histidine moves to a position in hydrogen bonding distance to the thiolate of Cys25, which results in somewhat larger distance away from the substrate. In the mechanically embedded structure the thiolate also forms a hydrogen bonding interaction with the peptide backbone N–H of Ala161 (2.488 Å) as well as a weak hydrogen bond with the hydrogen atom of the H–Cα atoms of His162 at a distance of 2.636 Å. In the product complexes these hydrogen bonding interactions are lost and the peptide chain containing the Ala161–His162 residues has moved away from the Cys-inhibitor linkage. Thus the nitrogen atom of the imidazole group of His162 has formed a weak hydrogen bonding interaction with one of the hydrogen atoms of the CH2 group of Cys25 instead (distance of 2.227 Å). By contrast, the thioester group of Cys25 interacts with the H–Cα group of His162 (distance of 2.830 Å) in the product complexes, and consequently considerable changes in the protein have occurred during the reaction process. As a result of this the relative energies for the potential energy profile has undergone some drastic changes.
The calculated reaction energy for the process from RQM/MM to PQM/MM using mechanical embedding only is −20.0 kcal mol−1, which in comparison to the QM-cluster calculations described above, is indicative of an irreversible process. However, it is known from experiment that the reaction is reversible, which means that the energeties calculated with mechanical embedding are not realistic. In particular, enzymatic studies on this system have shown that covalent binding of the inhibitor is a reversible process. The calculations with mechanical embedding included, therefore, are a poor representation of the actual inhibition process. To find a more realistic pathway, we subsequently recalculated the QM/MM calculations with electronic embedding included. As will be shown in the next paragraphs these calculations predict an almost thermoneutral reaction mechanism and consequently, a reversible reaction mechanism in agreement with experiment. Most probably the polarity of the inhibitor binding site and the enzyme makes it essential to include full electronic embedding in the calculations.
In a subsequent set of calculations we explored the potential energy profile between reactants and products in detail using QM/MM with both mechanical and electronic embedding models. To this end we initially did a full geometry scan, whereby the C–S bond formation was kept fixed at specific distances while we optimized all other degrees of freedom starting from RQM/MM. In addition, we did geometry scans in the reverse direction starting from PQM/MM for the proton transfer back from the CN–H group to His162. These geometry scans gave insight onto the potential energy profile of the reaction and the maxima from the scans were used as starting points for the transition state searches. Our optimized transition state geometries and potential energy landscapes are shown in Figure 5. The barrier is characterized by a large imaginary frequency of i589.6 cm−1 that implicates the proton transfer from His162 to the nitrile group. It is also substantially higher in energy than the one calculated for the DFT model above and we find it 23.1 kcal mol−1 in energy above the reactants using the mechanical embedding procedures. Geometrically, the C–S distance is in between that found for reactants and products. Moreover, an attempt to optimize a local minimum and a subsequent C–S bond formation transition state failed and led to reactant structures in all cases. Therefore, the reaction proceeds in a concerted mechanism via a transition state that is dominated by the proton transfer motion. Using electronic embedding corrections the barrier height drops slightly to 19.9 kcal mol−1. Therefore, electronic embedding has a small barrier-lowering effect on the calculations. The nucleophilic addition barrier calculated with QM/MM is in perfect agreement with that found for native cysteine proteases, for which a value of 19.8 kcal mol−1 was calculated (Ma et al., 2007). Therefore, the electronic embedding corrections appear to have a large effect on the overall thermodynamics of the reaction but only a minor effect on the kinetics. The electronically embedded reaction energy is close to thermoneutral and very similar to that found for the QM-cluster results described above. Experimentally, it has been shown that the reaction is reversible, which implicates an almost thermoneutral reaction mechanism, in agreement with our QM-cluster and QM/MM with electronic embedding results. Clearly, the mechanical embedding QM/MM calculations overestimate the overall thermodynamics of the reaction, although the kinetics is reasonable.
In order to understand the large differences and reaction energetics between the DFT model study and the QM/MM results, we investigated structural differences between the optimized geometries first, see Figure 6. Panel Figure 6A displays an overlay of the three QM/MM optimized geometries. Despite the fact that these structures have resulted from a full geometry optimization without constraints, actually the peptide chain is virtually in the same position for all geometries. In the active site, there are some obvious structural differences due to formation of a linkage between Cys25 and the inhibitor, which is absent in the reactant. This bond formation changes the position of the histidine side-chain as well as other proton donors/hydrogen bonding groups to the active site, including the Trp184 and Gln19 residues.
The overlay of the two transition state structures for substrate S1 activation as obtained by DFT and QM/MM is given in Figure 6B. Even though the DFT model complex had four fixed carbon atoms in positions of the original protein structure, the full geometry optimization of the reaction mechanism led to a structure that significantly deviated from the actual enzyme conformation. Thus, the overlay of TSQM/MM and TSE, S1 shows a twist of the substrate by almost 90° as well as reorientation of the protein backbone. In the model complex, there is sufficient geometric flexibility so that the system will relax to its most ideal configuration. In the enzyme, by contrast, steric interactions of the protein structure prevent reorientation of the geometry and the most ideal configurations as those found for the DFT model complexes are not stable. Because of that the energetics of the potential energy profile has shifted dramatically and the nucleophilic addition step is raised in energy. This is indeed what was found on native cysteine proteases, where it was shown that disruption of the hydrogen-bonding network led to proton affinity differences of the histidine and cysteine groups (Mladenovic et al., 2008b). Based on these studies Engels et al. predicted enhanced nucleophilic addition reactivity upon removal of the hydrogen-bonding network. Indeed that is what we find when we compare the studies of the DFT model complex (without the water layer) and the QM/MM result. Recent work of ours on the reactivity of iron(IV)-oxo and manganese(IV)-oxo intermediates in aliphatic reaction mechanisms also showed reduced reactivity for complexes whereby the active oxidant underwent hydrogen bonding (accepting) interactions so that this may be a general feature in enzymatic reaction mechanisms (Latifi et al., 2013).
Conclusions
We report a combined DFT and QM/MM study into the mechanism of inhibition of nitrile-based inhibitors to cathepsin K. We show here that a DFT model complex gives dramatic differences in optimized geometries and relative energies as compared to QM/MM. We show that despite geometric constraints the DFT structure has undergone dramatic changes with respect to its original conformation and does not fit the enzymatic pocket anymore when an attempt is made to reinsert the structure into the protein. These changes affect the relative energies of the reaction. All calculations predict a concerted nucleophilic addition reaction with simultaneous C–S bond formation and proton transfer. We also did QM/MM calculations and show that a procedure with full electronic embedding included is necessary to describe an irreversible reaction process. This again highlights the importance of model and method choice in quantum chemical calculations.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
Matthew G. Quesne thanks the BBSRC for a studentship. The National Service of Computational Chemistry Software (NSCCS) is thanked for cpu time provided.
References
- Altmann E., Cowan-Jacob S. W., Missbach M. (2004). Novel purine nitrile derived inhibitors of the cysteine protease cathepsin K. J. Med. Chem. 47, 5833–5836 10.1021/jm0493111 [DOI] [PubMed] [Google Scholar]
- Aluri S., de Visser S. P. (2007). The mechanism of cysteine oxygenation by cysteine dioxygenase enzymes. J. Am. Chem. Soc. 129, 14846–14847 10.1021/ja0758178 [DOI] [PubMed] [Google Scholar]
- Arkin M. R., Wells J. A. (2004). Small-molecule inhibitors of protein-protein interactions: progressing towards a dream. Nat. Rev. Drugs. Disc. 3, 301–317 10.1038/nrd1343 [DOI] [PubMed] [Google Scholar]
- Becke A. D. (1993). Density−functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 98, 5648–5652 10.1063/1.464913 [DOI] [Google Scholar]
- Chapman H. A., Riese R. J., Shi G.-P. (1997). Emerging roles for cysteine proteases in human biology. Annu. Rev. Physiol. 59, 63–88 10.1146/annurev.physiol.59.1.63 [DOI] [PubMed] [Google Scholar]
- Cheng X. W., Huang Z., Kuzuya M., Okumura K., Murohara T. (2011). Cysteine protease cathepsins in atherosclerosis-based vascular disease and its complications. Hypertension 58, 978–986 10.1161/HYPERTENSIONAHA.111.180935 [DOI] [PubMed] [Google Scholar]
- Cornell W. D., Cieplak P., Bayly C. I., Gould I. R., Merz K. M., Jr., Ferguson D. M., et al. (1995). A second generation force-field for the simulation of proteins, nucleic-acids, and organic-molecules. J. Am. Chem. Soc. 117, 5179–5197 10.1021/ja00124a00215702930 [DOI] [Google Scholar]
- de Visser S. P. (2006). Differences in and comparison of the catalytic properties of heme and non-heme enzymes with a central oxo-iron group. Angew. Chem. Int. Ed. 45, 1790–1793 10.1002/anie.200503841 [DOI] [PubMed] [Google Scholar]
- de Visser S. P. (2010). Trends in substrate hydroxylation reactions by heme and nonheme iron(IV)-oxo oxidants give correlations between intrinsic properties of the oxidant with barrier height. J. Am. Chem. Soc. 132, 1087–1097 10.1021/ja908340j [DOI] [PubMed] [Google Scholar]
- de Visser S. P. (2012). Predictive studies of oxygen atom transfer reactions by Compound I of cytochrome P450: aliphatic and aromatic hydroxylation, epoxidation and sulfoxidation. Adv. Inorg. Chem. 64, 1–31 10.1016/B978-0-12-396462-5.00001-5 [DOI] [Google Scholar]
- de Visser S. P., Ogliaro F., Sharma P. K., Shaik S. (2002). What factors affect the regioselectivity of oxidation by cytochrome P450? A DFT study of allylic hydroxylation and double bond epoxidation in a model reaction. J. Am. Chem. Soc. 124, 11809–11826 10.1021/ja026872d [DOI] [PubMed] [Google Scholar]
- de Visser S. P., Quesne M. G., Martin B., Comba P., Ryde U. (2014). Computational modelling of oxygenation processes in enzymes and biomimetic model complexes. Chem. Commun. 50, 262–282 10.1039/c3cc47148a [DOI] [PubMed] [Google Scholar]
- Dolinsky T. J., Czodrowski P., Li H., Nielsen J. E., Jensen J. H., Klebe G., et al. (2007). PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res. 35, W522–W525 10.1093/nar/gkm276 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drahl C., Cravatt B. F., Sorensen E. J. (2005). Protein-reactive natural products. Angew. Chem. Int. Ed. 44, 5788–5809 10.1002/anie.200500900 [DOI] [PubMed] [Google Scholar]
- Ehmke V., Heindl C., Rottmann M., Freymond C., Schweizer W. B., Brun R., et al. (2011). Potent and selective inhibition of cysteine proteases from Plasmodium falciparum and Trypanosoma brucei. ChemMedChem. 6, 273–278 10.1002/cmdc.201000449 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ehmke V., Quinsaat J. E. Q., Rivera-Fuentes P., Heindl C., Freymond C., Rottmann M., et al. (2012). Tuning and predicting biological affinity: aryl nitriles as cysteine protease inhibitors. Org. Biomol. Chem. 10, 5764–5768 10.1039/c2ob00034b [DOI] [PubMed] [Google Scholar]
- Frisch M. J., Trucks G. W., Schlegel H. B., Scuseria G. E., Robb M. A., Cheeseman J. R., et al. (2004). Gaussian-03, Revision C.02. Wallington, CT: Gaussian, Inc [Google Scholar]
- Godfrey E., Porro C. S., de Visser S. P. (2008). Comparative quantum mechanics / molecular mechanics (QM/MM) and density functional theory calculations on the oxo-iron species of taurine/α-ketoglutarate dioxygenase. J. Phys. Chem. A 112, 2464–2468 10.1021/jp710999v [DOI] [PubMed] [Google Scholar]
- Hang H. C., Loureiro J., Spooner E., van der Velden A. W. M., Kim Y.-M., Pollington A. M., et al. (2006). Mechanism-based probe for the analysis of cathepsin cysteine proteases in living cells. ACS Chem. Biol. 1, 713–723 10.1021/cb600431a [DOI] [PubMed] [Google Scholar]
- Helten H., Schirmeister T., Engels B. (2004). Model calculations about the influence of protic environments on the alkylation step of epoxide, aziridine, and thiirane based cysteine protease inhibitors. J. Phys. Chem. A 108, 7691–7701 10.1021/jp048784g [DOI] [Google Scholar]
- Jaguar (2009). Jaguar, Version 7.6. New York, NY: Schrodinger, LLC [Google Scholar]
- Kato D., Boatright K. M., Berger A. B., Nazif T., Blum G., Ryan C., et al. (2005). Activity-based probes that target diverse cysteine protease families. Nat. Chem. Biol. 1, 33–38 10.1038/nchembio707 [DOI] [PubMed] [Google Scholar]
- Kumar D., de Visser S. P., Shaik S. (2005). Multistate reactivity in styrene epoxidation by Compound I of cytochrome P450: mechanisms of products and side products formation. Chem. Eur. J. 11, 2825–2835 10.1002/chem.200401044 [DOI] [PubMed] [Google Scholar]
- Kumar D., Karamzadeh B., Sastry G. N., de Visser S. P. (2010). What factors influence the rate constant of substrate epoxidation by Compound I of cytochrome P450 and analogous iron(IV)-oxo oxidants. J. Am. Chem. Soc. 132, 7656–7667 10.1021/ja9106176 [DOI] [PubMed] [Google Scholar]
- Kumar D., Sastry G. N., de Visser S. P. (2011a). Effect of the axial ligand on substrate sulfoxidation mediated by iron(IV)–oxo porphyrin cation radical oxidants. Chem. Eur. J. 17, 6196–6205 10.1002/chem.201003187 [DOI] [PubMed] [Google Scholar]
- Kumar D., Thiel W., de Visser S. P. (2011b). Theoretical study on the mechanism of the oxygen activation process in cysteine dioxygenase enzymes. J. Am. Chem. Soc. 133, 3869–3882 10.1021/ja107514f [DOI] [PubMed] [Google Scholar]
- Latifi R., Sainna M. A., Rybak-Akimova E. V., de Visser S. P. (2013). Does hydrogen-bonding donation to manganese(IV)–oxo and iron(IV)–oxo oxidants affect the oxygen-atom transfer ability? A computational study. Chem. Eur. J. 19, 4058–4068 10.1002/chem.201202811 [DOI] [PubMed] [Google Scholar]
- Lee C., Yang W., Parr R. G. (1998). Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 37, 785–789 10.1103/PhysRevB.37.785 [DOI] [PubMed] [Google Scholar]
- Leeladee P., Baglia R. A., Prokop K. A., Latifi R., de Visser S. P., Goldberg D. P. (2012). Valence tautomerism in a high-valent manganese-oxo porphyrinoid complex induced by a Lewis acid. J. Am. Chem. Soc. 134, 10397–10400 10.1021/ja304609n [DOI] [PubMed] [Google Scholar]
- Lutgens S. P. M., Cleutjens K. B. J. M., Daemen M. J. A. P., Heeneman S. (2007). Cathepsin cysteine proteases in cardiovascular disease. Faseb. J. 21, 3029–3041 10.1096/fj.06-7924com [DOI] [PubMed] [Google Scholar]
- Ma S., Devi-Kesavan L., Gao J. (2007). Molecular dynamics simulations of the catalytic pathway of a cysteine protease: a combined QM/MM study of human cathepsin K. J. Am. Chem. Soc. 129, 13633–13645 10.1021/ja074222+ [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacFaul P. A., Morley A. D., Crawford J. J. (2009). A simple in vitro assay for assessing the reactivity of nitrile containing compounds. Bioorg. Med. Chem. Lett. 19, 1136–1138 10.1016/j.bmcl.2008.12.105 [DOI] [PubMed] [Google Scholar]
- Madala P. K., Tyndall J. D. A., Nall T., Fairlie D. P. (2010). Update 1 of: proteases universally recognize beta strands in their active sites. Chem. Rev. 110, PR1–PR31 10.1021/cr900368a [DOI] [PubMed] [Google Scholar]
- Mladenovic M., Ansorg K., Fink R. F., Thiel W., Schirmeister T., Engels B. (2008a). Atomistic insights into the inhibition of cysteine proteases: first QM/MM calculations clarifying the stereoselectivity of epoxide-based inhibitors. J. Phys. Chem. B 112, 11798–11808 10.1021/jp803895f [DOI] [PubMed] [Google Scholar]
- Mladenovic M., Fink R. F., Thiel W., Schirmeister T., Engels B. (2008b). On the origin of the stabilization of the zwitterionic resting state of cysteine proteases: a theoretical study. J. Am. Chem. Soc. 130, 8696–8705 10.1021/ja711043x [DOI] [PubMed] [Google Scholar]
- Mladenovic M., Junold K., Fink R. F., Thiel W., Schirmeister T., Engels B. (2008c). Atomistic insights into the inhibition of cysteine proteases: first QM/MM calculations clarifying the regiospecificity and the inhibition potency of epoxide- and aziridine-based inhibitors. J. Phys. Chem. B 112, 5458–5469 10.1021/jp711287c [DOI] [PubMed] [Google Scholar]
- MOE (2008). MOE Version 2007.09. Montreal, QC: Chemical Computing Group Inc [Google Scholar]
- Morley A. D., Kenny P. W., Burton B., Heald R. A., MacFaul P. A., Mullett J., et al. (2009). 5-Aminopyrimidin-2-ylnitriles as cathepsin K inhibitors. Bioorg. Med. Chem. Lett. 19, 1658–1661 10.1016/j.bmcl.2009.01.110 [DOI] [PubMed] [Google Scholar]
- Oballa R. M., Truchon J. F., Bayly C. I., Chauret N., Day S., Crane S., et al. (2007). A generally applicable method for assessing the electrophilicity and reactivity of diverse nitrile-containing compounds. Bioorg. Med. Chem. Lett. 17, 998–1002 10.1016/j.bmcl.2006.11.044 [DOI] [PubMed] [Google Scholar]
- Ogliaro F., Cohen S., de Visser S. P., Shaik S. (2000). Medium polarization and hydrogen bonding effects on Compound I of cytochrome P450: what kind of a radical is it really? J. Am. Chem. Soc. 122, 12892–12893 10.1021/ja005619f [DOI] [Google Scholar]
- Otto H.-H., Schirmeister T. (1997). Cysteine proteases and their inhibitors. Chem. Rev. 97, 133–171 10.1021/cr950025u [DOI] [PubMed] [Google Scholar]
- Palermo C., Joyce J. A. (2007). Cysteine cathepsin proteases as pharmacological targets in cancer. Trends Pharmacol. Sci. 29, 22–28 10.1016/j.tips.2007.10.011 [DOI] [PubMed] [Google Scholar]
- Porro C. S., Sutcliffe M. J., de Visser S. P. (2009). Quantum mechanics/molecular mechanics studies on the sulfoxidation of dimethyl sulfide by Compound I and Compound 0 of Cytochrome P450: which is the better oxidant? J. Phys. Chem. A 113, 11635–11642 10.1021/jp9023926 [DOI] [PubMed] [Google Scholar]
- Pratter S. M., Konstantinovics C., DiGiuro C. L. M., Leitner E., Kumar D., de Visser S. P., et al. (2013). Inversion of enantio-selectivity of a mononuclear non-heme iron(II)-dependent hydroxylase by tuning the interplay of metal-center geometry and protein structure. Angew. Chem. Int. Ed. 52, 9677–9681 10.1002/anie.201304633 [DOI] [PubMed] [Google Scholar]
- Quesne M. G., Latifi R., Gonzalez-Ovalle L. E., Kumar D., de Visser S. P. (2014). Quantum mechanics/molecular mechanics study on the oxygen binding and substrate hydroxylation step in AlkB repair enzymes. Chem. Eur. J. 20 [Epub ahead of print]. 10.1002/chem.201303282 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rappé A. K., Casewit C. J., Colwell K. S., Goddard W. A., III, Skiff W. M. (1992). UFF, a full periodic table force field for molecular mechanics and molecular dynamics simulations. J. Am. Chem. Soc. 114, 10024–10035 10.1021/ja00051a040 [DOI] [Google Scholar]
- Ratovitski T., Chighladze E., Waldron E., Hirschhorn R. R., Ross C. A. (2011). Cysteine proteases bleomycin hydrolase and cathepsin Z mediate N-terminal proteolysis and toxicity of mutant Huntingtin. J. Biol. Chem. 286, 12578–12589 10.1074/jbc.M110.185348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson M. W., Dalton J. P., Donnelly S. (2008). Helminth pathogen cathepsin proteases: it's a family affair. Trends Biochem. Sci. 33, 601–608 10.1016/j.tibs.2008.09.001 [DOI] [PubMed] [Google Scholar]
- Rzychon M., Chmiel D., Stec-Niemczyk J. (2004). Modes of inhibition of cysteine proteases. Acta Biochim. Pol. 51, 861–873 [PubMed] [Google Scholar]
- Sajid M., McKerrow J. H. (2002). Cysteine proteases of parasitic organisms. Mol. Biochem. Parasitol. 120, 1–21 10.1016/S0166-6851(01)00438-8 [DOI] [PubMed] [Google Scholar]
- Selzer P. M., Pingel S., Hsieh I., Ugele B., Chan V. J., Engel J. C., et al. (1999). Cysteine protease inhibitors as chemotherapy: lessons from a parasite target. Proc. Natl. Acad. Sci. U.S.A. 96, 11015–11022 10.1073/pnas.96.20.11015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaik S., Kumar D., de Visser S. P., Altun A., Thiel W. (2005). Theoretical perspective on the structure and mechanism of cytochrome P450 enzymes. Chem. Rev. 105, 2279–2328 10.1021/cr030722j [DOI] [PubMed] [Google Scholar]
- Shankar R., Kolandaivel P., Senthilkumar K. (2010). Reaction mechanism of cysteine proteases model compound HSH with diketone inhibitor PhCOCOCH3–nXn, (X = F, Cl, n = 0, 1, 2). Int. J. Quantum Chem. 110, 1660–1674 10.1002/qua.22332 [DOI] [Google Scholar]
- Shokhen M., Khazanov N., Albeck A. (2009). Challenging a paradigm: theoretical calculations of the protonation state of the Cys25-His159 catalytic diad in free papain. Proteins 77, 916–926 10.1002/prot.22516 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shokhen M., Traube T., Vijayakumar S., Hirsch M., Uritsky N., Albeck A. (2011). Differentiating serine and cysteine protease mechanisms by new covalent QSAR descriptors. ChemBioChem. 12, 1023–1026 10.1002/cbic.201000459 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh U. C., Kollman P. A. (1986). A combined ab initio quantum mechanical and molecular mechanical method for carrying out simulations on complex molecular systems: applications to the CH3Cl + Cl− exchange reaction and gas phase protonation of polyethers. J. Comput. Chem. 7, 718–730 10.1002/jcc.540070604 [DOI] [Google Scholar]
- Tao P., Schlegel H. B. (2010). A toolkit to assist ONIOM calculations. J. Comp. Chem. 31, 2363–2369 10.1002/jcc.21524 [DOI] [PubMed] [Google Scholar]
- Tian B.-X., Eriksson L. A. (2011). Catalytic mechanism and roles of Arg197 and Thr183 in the Staphylococcus aureus Sortase A enzyme. J. Phys. Chem. B 115, 13003–13011 10.1021/jp2058113 [DOI] [PubMed] [Google Scholar]
- Tyndall J. D. A., Nall T., Fairlie D. P. (2005). Proteases universally recognize beta strands in their active sites. Chem. Rev. 105, 973–999 10.1021/cr040669e [DOI] [PubMed] [Google Scholar]
- Vardhaman A. K., Barman P., Kumar S., Sastri C. V., Kumar D., de Visser S. P. (2013a). Comparison of the reactivity of nonheme iron(IV)-oxo versus iron(IV)-imido complexes: which is the better oxidant? Angew. Chem. Int. Ed. 52, 12288–12292 10.1002/anie.201305370 [DOI] [PubMed] [Google Scholar]
- Vardhaman A. K., Barman P., Kumar S., Sastri C. V., Kumar D., de Visser S. P. (2013b). Mechanistic insight into halide oxidation by non-heme iron complexes. Haloperoxidase versus halogenase activity. Chem. Commun. 49, 10926–10928 10.1039/c3cc46792a [DOI] [PubMed] [Google Scholar]
- Vardhaman A. K., Sastri C. V., Kumar D., de Visser S. P. (2011). Nonheme ferric hydroperoxo intermediates are efficient oxidants of bromide oxidation. Chem. Commun. 47, 11044–11046 10.1039/c1cc13775a [DOI] [PubMed] [Google Scholar]
- Vreven T., Frisch M. J., Kudin K. N., Schlegel H. B., Morokuma K. (2006). Geometry optimization with QM/MM methods II: explicit quadratic coupling. Mol. Phys. 104, 701–714 10.1080/00268970500417846 [DOI] [Google Scholar]
- Wang J. M., Wolf R. M., Caldwell J. W., Kollman P. A., Case D. A. (2004). Development and testing of a general amber force field. J. Comput. Chem. 25, 1157–1174 10.1002/jcc.20035 [DOI] [PubMed] [Google Scholar]