Abstract
Biomolecular recognition is crucial in cellular signal transduction. Signaling is mediated through molecular interactions at protein-protein interfaces. Still, specificity and promiscuity of protein-protein interfaces cannot be explained using simplistic static binding models. Our study rationalizes specificity of the prototypic protein-protein interface between thrombin and its peptide substrates relying solely on binding site dynamics derived from molecular dynamics simulations. We find conformational selection and thus dynamic contributions to be a key player in biomolecular recognition. Arising entropic contributions complement chemical intuition primarily reflecting enthalpic interaction patterns. The paradigm “dynamics govern specificity” might provide direct guidance for the identification of specific anchor points in biomolecular recognition processes and structure-based drug design.
Introduction
Cellular signaling critically depends on biomolecular recognition processes [1]. Understanding these processes on the molecular level is key for a comprehensive picture of living organisms. Models of biomolecular interactions evolved from first mechanistic explanation through Fischer's lock-and-key model that presumes static steric complementarity between the binding partners [2] and neglects any dynamic processes in the interacting entities. Koshland introduced dynamic aspects in the induced fit model which assumes that binding partners adapt their respective conformations to a state of maximum complementarity [3]. However, proteins undergo conformational transitions even in absence of binding partners existing as an equilibrium of conformations [4]. The conformational selection paradigm proposes that binding partners select the most appropriate conformation from this pre-existing ensemble of conformations [5]. Upon complex formation, equilibrium populations are shifted and a weakly populated state might become dominant [6]. Recently, conformational selection has become apparent in most biomolecular recognition processes [7].
Proteases provide prototypic protein-protein interfaces [8], binding and proteolytically cleaving peptides and proteins at a catalytic cleft [9]. The sub-pocket interactions of cleaved substrates (“degradome”) [10] are classified following the convention of Schechter and Berger [11]. Protease sub-pockets are numbered according to the corresponding substrate binding site over all sub-pockets Sn-Sn', with the peptide's scissile bond being the bond between N-terminal P1 and C-terminal P1'. Owing to a multitude of experimental techniques [12], substrate data is available for a wide range of proteases, e.g., via the MEROPS database [13]. Substrate information can be utilized for direct comparison of substrate recognition [14,15] and quantification of specificity [16]. Using these techniques, specificity within a protease binding site can be identified and visualized. In the well-characterized family of serine proteases substrate specificity originates primarily from interactions N-terminal to the cleavage site (non-prime side) [17], but also via remote exosite interactions [18,19]. Several studies aim at identifying a suitable binding paradigm and suggest conformational selection as most likely model [20,21]. Thrombin is a trypsin-like serine protease and key enzyme in the blood-clotting cascade [22,23]. On a structural level, active thrombin consists of a heavy and a light chain that is formed by proteolytic cleavage from a single precursor chain [24]. Thrombin includes several highly dynamic segments such as the autolysis loop (γ-loop) that is frequently missing in X-ray structures. The dynamic rearrangement of the active site of thrombin plays a role during zymogen activation via prethrombin-1 and prethrombin-2 as well as upon substrate binding [25]. As thrombin exists in two different states, exhibiting different biological roles, allosteric communication mediating the transition between the two forms plays an important role [26]. Thereby, binding of a Na+ ion switches the enzyme from the slow to the fast form which includes reordering of bound water molecules [27, 28]. Trypsin-like serine proteases are generally regulated via conformational plasticity around the substrate binding site, thus leading to the E*/E equilibrium [29]. The E* form is basically inactive towards substrate and Na+ binding and shows a collapse of the 215–217 ß-strand into the active site. In the active E form the S1 pocket is accessible and presents a negatively charged aspartate side-chain [30]. Direct P1-S1 interactions of the substrate with this amino acid explain the strong preference of thrombin for positively charged substrate residues (especially arginine residues) at P1 (C-terminal to the scissile bond). Further requirements have also been described for flanking amino acids [31,32].
However, differences between the other sub-pockets are smaller and less obvious from an enthalpic point of view. Broad literature highlights complex interplays between dynamics, solvation and ligand binding in thrombin [33,34,35]. We decipher molecular origins for the different degrees of specificity within sub-pockets of thrombin based on flexibility. Our analyses are based on two central concepts: We apply information theoretic cleavage entropy values [16] based on experimental substrate data to quantify local protease specificity and compare emerging specificity patterns to local binding site flexibility that is quantified via dihedral entropies [36] over backbone torsions derived from microsecond scale molecular dynamics simulations. In contrast to flexible sub-pockets, we expect rigid regions in interfaces to provide excellent anchor points for drug design as they allow for specific drug-target interactions. Locating appropriate interaction centers will support structure-based drug design targeting classical active sites [37] as well as allosteric pockets [38]. We expect our analysis of dynamic binding site features to be especially helpful in rational drug design on challenging protein-protein interactions [39].
Results and Discussion
We quantified thrombin sub-pocket specificity as cleavage entropy [16] based on substrate data from MEROPS [13] (see Fig 1). A value of zero represents absolute specificity for a single amino acid whereas absence of substrate preference in a sub-pocket is indicated via a cleavage entropy of one.
To investigate the effect of conformational dynamics and thus conformational selection on substrate recognition, we simulated protein dynamics of unbound and fibrinogen-complexed thrombin for 1μs. To probe intrinsic local dynamics of the binding site region of thrombin without ligand, we calculated residue-wise backbone flexibility in holo state using Cα B-factors after a global alignment. We refer to holo systems for proteins that were created by artificially removing a ligand from a complex structure in contrast to a native apo structure. We grouped the global backbone flexibility into respective sub-pockets S6-S1, yielding average flexibilities of thrombin sub-pockets (see Table 1). Mapping to the binding site region gives a visual impression of intrinsic dynamics of the thrombin substrate binding cleft (see Fig 2).
Table 1. Sub-pocket-wise global backbone B-factors for holo simulations (Global Cα) and dihedral entropy values Sφ and Sψ.
S6 | S5 | S4 | S3 | S2 | S1 | |
---|---|---|---|---|---|---|
Global Cα [Å2] | 44.6 | 56.5 | 47.7 | 50.8 | 24.8 | 15.6 |
Sφ [J/(mol*K)] | 34.6 | 35.9 | 35.6 | 36.6 | 32.5 | 33.3 |
Sψ [J/(mol*K)] | 32.3 | 34.2 | 34.1 | 35.8 | 33.5 | 33.0 |
As expected from its buried location in the center of the protease, the S1 pocket was found to be the most rigid part of the binding site on a global scale. The S2 pocket was found to be the second most rigid pocket in thrombin upon ranking by global backbone flexibility. Pockets S3-S6 showed elevated global backbone flexibilities. The correlation between active site dynamics and specificity was quantified as Spearman rank correlation coefficient ρ. We found that global backbone dynamics correlate strongly to cleavage entropies (specificity) with ρ = 0.886.
Additionally, we quantified sub-pocket flexibility as backbone dihedral entropies, Sφ and Sψ from the distribution of dihedral angles φ and ψ during 1μs molecular dynamics simulation. It should be noted that cleavage entropy and backbone dihedral entropy are different concepts which should not be confused even though both rely on entropy. Whereas cleavage entropy measures specificity relying on substrate data via information entropy, backbone dihedral entropies capture local flexibility based on molecular dynamics data reflecting thermodynamic entropy.
Backbone dihedral entropy yields Spearman rank correlation coefficients of ρφ = 0.657 and ρψ = 0.886 to cleavage entropy. Mapping of dihedral entropies capturing local backbone flexibility to the binding site of thrombin are shown in Fig 2. Both employed metrics for local flexibility clearly demonstrate the interplay of local flexibility and specificity. Identification of P1-S1 interactions as main carrier of substrate specificity, followed by P2-S2 interactions, is in good agreement with experimental findings from literature [40].
In addition to protein dynamics, water is known to contribute to protein recognition [41]. Thus, we quantified water thermodynamics in thrombin binding pockets using Grid Inhomogeneous Solvation Theory (GIST) [42]. We found strong correlations of water ordering and binding specificity similar to the correlations described above between specificity and protein flexibility (see Fig 3). Seemingly protein rigidity extends in the form of water ordering into the first solvation shell and thus further contributes to specificity.
The rigid S1 pocket of thrombin contains highly ordered water molecules with entropies considerably lower than the bulk reference (see Table 2). Translational entropy increases for outer and more solvent-exposed sub-pockets. Orientational entropy contributions are all negative, thus water molecules are overall more ordered than in bulk. Correlations of water entropies to binding specificity are strong with ρ = 0.886 for orientational entropy and ρ = 0.600 for translational entropy.
Table 2. Sub-pocket-wise water thermodynamics for holo simulations: dTStrans, dTSorient, Hsolv per water.
S6 | S5 | S4 | S3 | S2 | S1 | |
---|---|---|---|---|---|---|
dTStrans [kcal/mol] | 0.11 | 0.36 | 0.12 | -0.001 | -0.052 | -0.51 |
dTSorient [kcal/mol] | -0.45 | -0.53 | -0.91 | -0.22 | -0.61 | -1.38 |
Hsolv [kcal/mol] | -10.0 | -10.4 | -9.69 | -3.56 | -5.30 | -10.0 |
Explicit water molecules in molecular dynamics simulations are an excellent probe for enthalpic contributions to binding, as they do not only capture hydrogen bonding but also electrostatics and even non-polar interactions. In our analysis water enthalpy does not correlate to specificity to an extent comparable to water ordering measured by solvation entropy. Absolute enthalpic interactions between water and the protein identify the S1 pocket as most important anchor point but fail to rank other pockets in terms of specificity. A mean water-water interaction of -9.533 kcal/mol per water is expected for the TIP3P water model [43]. Despite, S2 and S3 pockets in fact repel waters binding from bulk. However, the S2 pocket is a major contribution to thrombin specificity. In consequence, Spearman correlation is only ρ = 0.200 between water enthalpies and cleavage entropy.
Several studies had hinted at qualitative correlations between protease dynamics and substrate specificity [44–48]. Our quantitative correlation of structural and dynamic data allows us to directly relate local flexibility with sub-pocket specificity of the protease thrombin. Correlations are derived from a ligand-free simulation and thereby support the theory that binding preferences are influenced by the intrinsic conformational ensemble of the receptor. Thus, specificity is following a paradigm of conformational selection.
Conformational selection is also confirmed when comparing simulations of holo and substrate-bound thrombin. Sampled conformational space for the substrate-bound form is a subset of the holo conformational ensemble. This is in agreement with studies on enzyme kinetics in literature [49,50,20]. The comparison of holo and complexed simulations revealed differences in active site dynamics upon substrate binding (see Fig 4). Global flexibility is reduced for all binding site residues in the simulated complex. Still, some residues oppose the overall trend to rigidification in Sφ: Leu-99 is mobilized by substrate binding. This is due to the hydrophobic contact of Leu-99 with Phe at P9 of the fibrinogen-derived peptide that mobilizes the backbone. As substrate peptides are bound to the catalytic cleft of proteases via distinct hydrogen bonding patterns, we investigated the hydrogen bonding characteristics of the thrombin-fibrinogen complex. In our simulations P1-S1 interactions are characterized by strong hydrogen bonding, forming on average more than three hydrogen bonds. Also S6-P6 interactions are mainly driven by hydrogen bonds, S3-P3 partially relies on hydrogen bonding of the backbone. Pockets S2, S4 and S5 show very little or no hydrogen bonding with the substrate peptide at all.
The most specific binding site region of thrombin, the S1 pocket, shows tightest hydrogen bonding (see Table 3). Nevertheless, the Spearman rank correlation coefficient ρ between hydrogen bond occupancy and cleavage entropy (specificity) is very low. Accordingly, ρ = -0.058 indicates an inverse correlation of hydrogen bond occupancy and cleavage entropy, where low values indicate specific substrate recognition. Still, the absolute value of ρ is close to zero and thus lower than correlation coefficients found for global backbone flexibility and dihedral entropy shown beforehand.
Table 3. Hydrogen bonding between thrombin and the bound peptide.
S6 | S5 | S4 | S3 | S2 | S1 | |
---|---|---|---|---|---|---|
Occupancy | 1.04 | <0.01 | 0 | 0.59 | 0 | 3.37 |
Hence, local dynamics of the holo system provide more information about substrate specificity than hydrogen bonding in the complex. This shows that capturing enthalpic information only is not sufficient to explain the sub-pocket specificity of thrombin. Regions like the rigid S2 pocket in thrombin can be specific interaction partners even in absence of direct chemical interactions.
Additionally, we show a propagation of receptor dynamics into local water entropy. Rigid pockets show highly ordered water molecules. These highly ordered water molecules might be interpreted as probes of molecular interactions. Ordered water molecules imply well-defined exit vectors or particular pocket shapes and thus high specificity. Thereby, these ordered water molecules by specific binding reflect conformational selection from the hydration point of view. Similar correlations have already been described for PDZ domains [51], where the interplay of binding promiscuity and conformational selection is well accepted [52].
We observe very similar trends when analyzing a second thrombin complex with a factor XIII-derived peptide as well as a third complex formed by thrombin and a fragment of the protease-activated receptor 1 (see Table A and Table B in S1 File). Again, entropic contributions from binding site dynamics derived from microsecond scale simulations are more predictive for local sub-pocket specificity than enthalpic measures. The consistency of these trends observed for all three simulated systems strengthens our confidence that the correlations are caused by actual dynamic processes rather than sampling artifacts.
We demonstrate a tight interplay of flexibility and specificity in the substrate recognition of thrombin relying on conformational selection. Rigid binding site regions provide less diverse conformations, thus matching fewer binding partners. This in turn leads to specific substrate recognition. The described effect is independent of enthalpic considerations which we found insufficient to explain sub-pocket specificity of thrombin. In summary we find that active site flexibility plays a central role in substrate readout, turning local dynamics into a key player in the specificity of the protease thrombin. The paradigm “dynamics govern specificity” should be transferable to any biomolecular recognition process. Thus, it should provide guidance for finding accurate specific anchor points in structure-based drug design.
Materials and Methods
Data mining and cleavage entropy calculation
We used the protease substrate collection from MEROPS [13] to quantify protease specificity via calculation of cleavage entropy calculation [16]. Based on 185 substrates (database identifier: S01.217, accession date: 14.6.2013) we calculated sub-pocket-wise cleavage entropies for all substrate positions P6-P1 that are in direct contact with the thrombin binding site. Sub-pocket-wise cleavage entropies depict a metric for substrate specificity ranging from 0 for stringent substrate readout to 1 for random substrate binding.
Preparation of the X-ray structures
Structures of protease-substrate complexes allow mapping of substrate specificity to the binding site of proteases. We based our study on the X-ray structure of thrombin in complex with fibrinogen (PDB: 1FPH [53]) occupying the non-prime side of the binding cleft. The structure contains a chloromethylketone-inhibited form of thrombin, where the substrate peptide is covalently linked to the active site residues Ser-195 and His-57. We removed the covalent bonds and added an N-methyl group (NME) to the C-terminal of the peptides to neutralize the free charge. Analogously, we capped the N-terminal of the substrate peptides with an acetyl group (ACE). This terminal group was already present in case of the complex with fibrinogen-derived decapeptide. Thus, the simulated complex represents a non-covalent protein-ligand complex.
We removed the light chains of thrombin as well as the hirudin chain. Water molecules assigned to the heavy chain were preserved, as buried water molecules are known be crucial for protease simulations [54,55]. Structures were protonated using the protonate3D protocol from MOE [56] and manually checked. This resulted in a simulation topology not containing any protonated histidines, while automatically assigned histidine tautomers were preserved.
To study conformational selection effects, we generated a holo system in addition to the complex structure. Therefore, we removed the bound fibrinogen from the binding site. This resulted in two topologies for molecular dynamics simulations: the original thrombin complex Thrcom,Fib as well as the respective holo system Thrholo,Fib. The constructed holo system is highly similar to a published apo structure of thrombin (PDB: 1HXF [57]) with a Cα RMSD as low as 0.4Å. We did not reprotonate the system after substrate removal to ensure comparability between both systems. Finally, we added solvent boxes of explicit TIP3P water [58] with a minimum wall distance of 12Å around the proteins using tleap from AmberTools [59].
Four additional systems were prepared for comparison: We created another simulation setup based on the crystal structure of thrombin with a bound factor XIII-derived peptide (PDB: 1DE7 [60]) resulting in a system for the complex as well as an apo system after deleting the peptide ligand. A third independent starting structure was built based on the co-crystal structure of thrombin and a bound protease-activated receptor 1 fragment (PDB: 3LU9 [61]). Again, we performed simulations for both complex and apo state modeled via deletion of the peptide ligand. Those additional systems represent the fast form of thrombin with bound Na+ ions. These ions were held in place over simulation time by applying flat bottom harmonic restraints to the coordinating protein atoms.
Binding site definition
For the definition of the thrombin binding site, the complex structures of thrombin with bound fibrinogen and thrombin with bound factor XIII were used. All residues of thrombin with at least one atom within a radius of 3.5Å around the peptide substrates were considered as binding pockets S6-S1. Based on this definition a direct comparison between substrate data (P6-P1) and structure and dynamics data for the pockets S6-S1 is possible.
To obtain a more balanced binding site definition between both systems Thrcom,Fib and Thrcom,F13, we unified both sets of residues leading to a common definition for the sub-pockets (see Table C in S1 File for a list of residues following the numbering scheme for chymotrypsinogen [62]).
Molecular dynamics simulations
All atom simulations were performed using the GPU implementation of pmemd [63] in AMBER12 [59] using the AMBER forcefield 99SB-ILDN [64]. A uniform neutralizing plasma was applied to neutralize the box net charge [65]. We used a Van der Waals cutoff of 8Å, SHAKE algorithm [66] on hydrogen atoms allowed a time step of 2.0fs. Simulations were performed at 300K maintained by the Langevin thermostat [67].
After minimization with harmonic constraints on protein heavy atoms, the system was gradually heated to 300K in NVT ensemble. As last equilibration step an unrestrained density equilibration over 1ns was performed. For more details on the extensive equilibration protocol including restraint settings and details on heating and cooling steps see Wallnoefer et al. [54].
After equilibration 500 million unrestrained sampling steps were run, equivalent to a total simulation time of 1μs for each system. We saved 50,000 equally spaced snapshots to trajectory for later analysis. Real simulation time per system was approximately four weeks on single GeForce GTX 680 GPUs.
Analysis of molecular dynamics simulations
Analyses of trajectories were carried out using ptraj and cpptraj [68] from AmberTools [59]. To check stability and convergence of our simulations we generated two-dimensional RMSD plots of Cα-atom positions in the binding site regions (see Table A in S1 File for a list of included residues).
The molecular dynamics simulation of unbound thrombin sampled a conformational space near the native X-ray structure with plateau RMSD values below 2.5Å for Cα positions of the binding site. The simulation of thrombin in complex with fibrinogen remained even closer to the input structure with RMSD values below 1.5Å. Two-dimensional RMSD plots between the holo and the complex simulation show that overlapping portions of conformational space were sampled. Removal of the substrates from the active site of thrombin did not significantly alter sampled configurations within the simulation time scale of 1μs, as Cα RMSD values between snapshots from both simulations remained below 3Å (see Figure C in S1 File for two-dimensional RMSD-plots).
Global flexibility in the protein was captured as residue-wise B-factors for Cα-atoms after a single global alignment. Apart from global alignments, flexibility metrics can be defined based on internal coordinates. The backbone dihedral angles φ and ψ define the local backbone conformation of a protein [69]. The distribution of these angles during the course of a molecular dynamics trajectory allows to infer a state probability function and subsequently calculate a local backbone entropy. Nonparametric kernel density estimation was applied in order to obtain a state probability distribution function as demonstrated previously [70]. To avoid a decrease of the kernel at boundaries when estimating the density, data around -180°/180° is periodically duplicated.
Integration according to Sα = —R ∫ p(α) ln p(α) dα yields thermodynamic entropy arising from conformational flexibility in the degree of freedom α [36]. This metric gives an upper bond for total backbone entropy, as correlation between the backbone angles is not accounted for due to the fact that a one-dimensional probability density function is obtained. Total thermodynamic entropy is lower as correlation of individual degrees of freedom restricts the overall conformational space. We calculated independent entropies for both dihedral angles φ and ψ as Sφ and Sψ. Ordered states correspond to low entropies. Therefore, a single dihedral peak with a width of 1° yields an entropy of zero (rigid region), whereas disordered states yield positive values (flexible region).
All residue-wise flexibility metrics were grouped into sub-pockets and arithmetic averages were calculated. Pocket-wise flexibilities were mapped to the starting structures and visualized using Pymol [71].
To support emerging correlations between binding site rigidity and substrate specificity, we performed a statistical analysis of the molecular dynamics trajectories by splitting them in ten parts (100ns each). The calculated global backbone flexibility (see Figure D in S1 File) and dihedral entropy (see Figures E and F in S1 File) all presented S1 and S2 as most rigid parts of the thrombin binding site. Pockets S3-S6 are more flexible and show varying rankings of these four sub-pockets.
We used Grid Inhomogeneous Solvation Theory to calculate hydration thermodynamics from simulation trajectories [72,42,73]. We extracted 5,000 equal-spaced snapshots from our trajectory for analysis on a grid with 6Å edge length and default 0.5Å spacing centered on the center of mass of respective sub-pocket residues.
To capture interactions between the substrate peptide and the protease, we extracted ensemble-averaged hydrogen bond occupancies from the complex simulations and grouped them into the respective protease sub-pockets. We applied default definitions for hydrogen bonds. Thus, the maximum heavy atom distance was set to 3.0Å, the maximum angle between donor, hydrogen and acceptor to 45°.
Supporting Information
Rank-based binding site mappings, two-dimensional RMSD plots of simulations, trajectory splitting analyses for different flexibility metrics, a definition of the binding pockets, as well as detailed results for the simulation of the fibrinogen-derived thrombin complex and the PAR1-derived complex are available as Supporting Information.
Supporting Information
Acknowledgments
Julian E. Fuchs and Roland G. Huber thank the Austrian Academy of Sciences for a DOC-fellowship at the Institute of General, Inorganic and Theoretical Chemistry at University of Innsbruck. Klaus R. Liedl gratefully acknowledges the Austrian Science Fund (FWF) for the grant: Austrian Science Fund (FWF): P23051.
Data Availability
All relevant data are within the paper and its Supporting Information file.
Funding Statement
The work was supported by the following: Austrian Science Fund, P23051, www.fwf.ac.at and Austrian Academy of Sciences, DOC-grant to JEF, RGH, BJW, www.oeaw.ac.at. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Nooren I, Thornton JM. (2003) Structural characterisation and functional significance of transient protein–protein interactions. J Mol Biol 325: 991–1018. [DOI] [PubMed] [Google Scholar]
- 2. Fischer E. (1894) Einfluss der Configuration auf die Wirkung der Enzyme. Berichte Dtsch Chem Ges 27: 2985–2993. [Google Scholar]
- 3. Koshland D.(1959) Enzyme flexibility and enzyme action. J Cell Comp Physiol 54: 245–258. [DOI] [PubMed] [Google Scholar]
- 4. Boehr DD, Nussinov R, Wright PE. (2009) The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 5: 789–796. 10.1038/nchembio.232 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Ma B, Nussinov R. (2010) Enzyme dynamics point to stepwise conformational selection in catalysis. Curr Opin Chem Biol 14: 652–659. 10.1016/j.cbpa.2010.08.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Kumar S, Ma B, Tsai C, Sinha N, Nussinov R. (2009) Folding and binding cascades: dynamic landscapes and population shifts. Protein Sci 9: 10–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Changeux J-P, Edelstein S. (2011) Conformational selection or induced fit? 50 years of debate resolved. F1000 Biol Rep 3: 19 10.3410/B3-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Otlewski J, Jelen F, Zakrzewska M, Oleksy A. (2005) The many faces of protease–protein inhibitor interaction. EMBO J 24: 1303–1310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Perona JJ, Craik CS. (1995) Structural basis of substrate specificity in the serine proteases. Protein Sci 4: 337–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. López-Otín C, Overall CM. (2002) Protease degradomics: a new challenge for proteomics. Nat. Rev Mol Cell Biol 3: 509–519. [DOI] [PubMed] [Google Scholar]
- 11. Schechter I, Berger A. (1967) Protease subsite nomenclature. Biochem Biophys Res Commun 27: 157–162. [DOI] [PubMed] [Google Scholar]
- 12. Poreba M, Drag M. (2010) Current strategies for probing substrate specificity of proteases. Curr Med Chem 17: 3968–3995. [DOI] [PubMed] [Google Scholar]
- 13. Rawlings ND, Waller M, Barrett AJ, Bateman A. (2014) MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res 42: D503–D509. 10.1093/nar/gkt953 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Fuchs JE, von Grafenstein S, Huber RG, Kramer C, Liedl KR. (2013) Substrate-Driven Mapping of the Degradome by Comparison of Sequence Logos. PLOS Comput Biol 9: e1003353 10.1371/journal.pcbi.1003353 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Fuchs JE, Liedl KR. (2013) Substrate sequences tell similar stories as binding cavities: commentary. J Chem Inf Model 53: 3115–3116. 10.1021/ci4005783 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Fuchs JE, von Grafenstein S, Huber RG, Margreiter MA, Spitzer GM, Wallnoefer HG et al. (2013) Cleavage Entropy as Quantitative Measure of Protease Specificity. PLoS Comput Biol 9: e1003007 10.1371/journal.pcbi.1003007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Hedstrom L. (2002) Serine protease mechanism and specificity. Chem Rev 102: 4501–4524. [DOI] [PubMed] [Google Scholar]
- 18. Page M, MacGillivray R, Di Cera E. (2005) Determinants of specificity in coagulation proteases. J Thromb Haemost 3: 2401–2408. [DOI] [PubMed] [Google Scholar]
- 19. Kromann-Hansen T, Lund IK, Liu Z, Andreasen PA, Hoyer-Hansen G, Sorensen HP. (2013) Allosteric Inactivation of a Trypsin-Like Serine Protease by An Antibody Binding to the 37-and 70-Loops. Biochemistry 52: 7114–7126. 10.1021/bi400491k [DOI] [PubMed] [Google Scholar]
- 20. Vogt AD, Di Cera E. (2013) Conformational Selection Is a Dominant Mechanism of Ligand Binding. Biochemistry 52: 5723–5729. 10.1021/bi400929b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Pozzi N, Chen Z, Zapata F, Niu W, Barranco-Medina S, Pelc LA et al. (2013) Autoactivation of thrombin precursors. J Biol Chem 288: 11601–11610. 10.1074/jbc.M113.451542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Huntington J. (2005) Molecular recognition mechanisms of thrombin. J Thromb Haemost 3: 1861–1872. [DOI] [PubMed] [Google Scholar]
- 23. Di Cera E. (2008) Thrombin. Mol Aspects Med 29: 203–254. 10.1016/j.mam.2008.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Bode W, Mayr I, Baumann U, Huber R, Stone SR, Hofsteenge J. (1989) The refined 1.9 A crystal structure of human alpha-thrombin: interaction with D-Phe-Pro-Arg chloromethylketone and significance of the Tyr-Pro-Pro-Trp insertion segment. EMBO J 8: 3467–3475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Lechtenberg BC, Johnson DJ, Freund SM, Huntington JA. (2010) NMR resonance assignments of thrombin reveal the conformational and dynamic effects of ligation. Proc Natl Acad Sci 107: 14087–14092. 10.1073/pnas.1005255107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Dang O, Vindigni A, Di Cera E. (1995) An allosteric switch controls the procoagulant and anticoagulant activities of thrombin. Proc Natl Acad Sci 92: 5977–5981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Vindigni A., Di Cera E. (1996) Release of fibrinopeptides by the slow and fast forms of thrombin. Biochemistry 35: 4417–4426. [DOI] [PubMed] [Google Scholar]
- 28. Di Cera E, Guinto ER, Vindigni A, Dang QD, Ayala YM, Wuyi M, et al. (1995) The Na+ binding site of thrombin. J Biol Chem 270: 22089–22092. [DOI] [PubMed] [Google Scholar]
- 29. Di Cera E. (2009) Serine Proteases. IUBUMB Life 61: 510–515. 10.1002/iub.186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Pozsgay M, Szabo G, Bajusz S, Simonsson R, Gaspar R, Elodi P. (1981) Study of the Specificity of Thrombin with Tripeptidyl-p-nitroanilide Substrates. Eur J Biochem 115: 491–495. [DOI] [PubMed] [Google Scholar]
- 31. Chang J-Y. (1985) Thrombin specificity. Eur J Biochem 151: 217–224. [DOI] [PubMed] [Google Scholar]
- 32. Newell-Caito JL, Griffiths AE, Fay PJ. (2012) P3–P3′ Residues Flanking Scissile Bonds in Factor VIII Modulate Rates of Substrate Cleavage and Procofactor Activation by Thrombin. Biochemistry 51: 3451–3459. 10.1021/bi300070z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Engh RA, Brandstetter H, Sucher G, Eichinger A, Baumann U, Bode W et al. (1996) Enzyme flexibility, solvent and “weak”interactions characterize thrombin–ligand interactions: implications for drug design. Structure 4: 1353–1362. [DOI] [PubMed] [Google Scholar]
- 34. Baum B, Mohamed M, Zayed M, Gerlach C, Heine A, Hangauer D et al. (2009) More than a simple lipophilic contact: a detailed thermodynamic analysis of nonbasic residues in the S1 pocket of thrombin. J Mol Biol 390: 56–69. 10.1016/j.jmb.2009.04.051 [DOI] [PubMed] [Google Scholar]
- 35. Biela A, Khayat M, Tan H, Kong J, Heine A, Hangauer D et al. (2012) Impact of ligand and protein desolvation on ligand binding to the S1 pocket of thrombin. J Mol Biol 418: 350–366. 10.1016/j.jmb.2012.01.054 [DOI] [PubMed] [Google Scholar]
- 36. Huber RG, Eibl C, Fuchs JE. (2015) Intrinsic Flexibility of NLRP Pyrin Domains is a Key Factor in their Conformational Dynamics, Fold Stability and Dimerization. Protein Sci 24: 174–181. 10.1002/pro.2601 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Rosenkilde MM, Schwartz TW. (2006) GluVII:06—a highly conserved and selective anchor point for non-peptide ligands in chemokine receptors. Curr Top Med Chem 6: 1319–1333. [DOI] [PubMed] [Google Scholar]
- 38. Nussinov R, Tsai CJ. (2014) Unraveling structural mechanisms of allosteric drug action. Trends Pharmacol Sci 35: 256–264. 10.1016/j.tips.2014.03.006 [DOI] [PubMed] [Google Scholar]
- 39. Wells JA, McClendon CL. (2007) Reaching for high-hanging fruit in drug discovery at protein–protein interfaces. Nature 450: 1001–1009. [DOI] [PubMed] [Google Scholar]
- 40. Schilling O, Overall CM. (2008) Proteome-derived, database-searchable peptide libraries for identifying protease cleavage sites. Nat Biotechnol 26: 685–694. 10.1038/nbt1408 [DOI] [PubMed] [Google Scholar]
- 41. Papoian GA, Ulander J, Wolynes PG. (2003) Role of water mediated interactions in protein-protein recognition landscapes. J Am Chem Soc 125: 9170–9178. [DOI] [PubMed] [Google Scholar]
- 42. Nguyen CN, Kurtzman Young T, Gilson MK. (2012) Grid inhomogeneous solvation theory: hydration structure and thermodynamics of the miniature receptor cucurbit[7]uril. J Chem Phys 137: 044101 10.1063/1.4733951 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Case DA, Babin V, Berryman JT, Betz RM, Cai Q, Cerutti DS et al. (2014) Reference Manual Amber 14: Molecular dynamics package (UCSF). [Google Scholar]
- 44. Mace JE, Wilk BJ, Agard DA. (1995) Functional linkage between the active site of α-lytic protease and distant regions of structure: Scanning alanine mutagenesis of a surface loop affects activity and substrate specificity. J Mol Biol 251: 116–134. [DOI] [PubMed] [Google Scholar]
- 45. Mace JE, Agard DA. (1995) Kinetic and structural characterization of mutations of glycine 216 in α-lytic protease: a new target for engineering substrate specificity. J Mol Biol 254: 720–736. [DOI] [PubMed] [Google Scholar]
- 46. Mulder FA, Schipper D, Bott R, Boelens R. (1999) Altered flexibility in the substrate-binding site of related native and engineered high-alkaline Bacillus subtilisins. J Mol Biol 292: 111–123. [DOI] [PubMed] [Google Scholar]
- 47. Wallnoefer HG, Lingott T, Gutiérrez JM, Merfort I, Liedl KR. (2010) Backbone Flexibility Controls the Activity and Specificity of a Protein− Protein Interface: Specificity in Snake Venom Metalloproteases. J Am Chem Soc 132: 10330–10337. 10.1021/ja909908y [DOI] [PubMed] [Google Scholar]
- 48. Fuchs JE, von Grafenstein S, Huber RG, Wallnoefer HG, Liedl KR. (2014) Specificity of a protein–protein interface: Local dynamics direct substrate recognition of effector caspases. Proteins 82: 546–555. 10.1002/prot.24417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Huntington JA. (2012) Thrombin plasticity. Biochim. Biophys Acta BBA-Proteins Proteomics 1824: 246–252. 10.1016/j.bbapap.2011.07.005 [DOI] [PubMed] [Google Scholar]
- 50. Vogt AD, Di Cera E. (2012) Conformational selection or induced fit? A critical appraisal of the kinetic mechanism. Biochemistry 51: 5894–5902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Beauming T, Farid R, Sherman W. (2009) High-energy water sites determine peptide binding affinity and specificity of PDZ domains. Protein Sci 18: 1609–1619. 10.1002/pro.177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Munz M, Hein J, Biggin PC. (2012) The role of flexibility and conformational selection in the binding promiscuity of PDZ domains. PLOS Comput Biol 8: e1002749 10.1371/journal.pcbi.1002749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Stubbs MT, Oschkinat H, Mayr I, Huber R, Angliker H, Stone SR et al. (1992) The interaction of thrombin with fibrinogen. Eur J Biochem 206: 187–195. [DOI] [PubMed] [Google Scholar]
- 54. Wallnoefer HG, Handschuh S, Liedl KR, Fox T. (2010) Stabilizing of a globular protein by a highly complex water network: a molecular dynamics simulation study on factor Xa. J Phys Chem B 114: 7405–7412. 10.1021/jp101654g [DOI] [PubMed] [Google Scholar]
- 55. Wallnoefer HG, Liedl KR, Fox T. (2011) A challenging system: Free energy prediction for factor Xa. J Comput Chem 32: 1743–1752. 10.1002/jcc.21758 [DOI] [PubMed] [Google Scholar]
- 56. Labute P. (2009) Protonate3D: assignment of ionization states and hydrogen coordinates to macromolecular structures. Proteins 75: 187–205. 10.1002/prot.22234 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Zhang E, Tulinsky A. (1997) The molecular environment of the Na+ binding site of thrombin. Biophys Chem 63, 185–200. [DOI] [PubMed] [Google Scholar]
- 58. Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. (1983) Comparison of simple potential functions for simulating liquid water. J Chem Phys 79: 926. [Google Scholar]
- 59. Case DA, Darden TA, Cheatham TE III, Simmerling CL, Wang J, Duke RR et al. (2012) Amber 12: Molecular dynamics package (UCSF). [Google Scholar]
- 60. Sadasivan C, Yee VC. (2000) Interaction of the Factor XIII activation peptide with α-thrombin crystal structure of its enzyme-substrate analog complex. J Biol Chem 275: 36942–36948. [DOI] [PubMed] [Google Scholar]
- 61. Gandhi PS, Chen Z, Di Cera E. (2010) Crystal structure of human thrombin mutant S195A in complex with the extracellular fragment of human PAR1. J. Biol. Chem. 285: 15393–15398. 10.1074/jbc.M110.115337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Hartley BS. (1964) Amino acid sequence of bovine chymotrypsinogen-A. Nature 201: 1284–1287. [DOI] [PubMed] [Google Scholar]
- 63. Salomon-Ferrer R, Götz AW, Poole D, Le Grand S, Walker RC. (2013) Routine microsecond molecular dynamics simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. J Chem Theory Comput 9: 3878–3888. [DOI] [PubMed] [Google Scholar]
- 64. Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO et al. (2010) Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins 78: 1950–1958. 10.1002/prot.22711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Darden T, York D, Pedersen L. (1993) Particle mesh Ewald: An N⋅ log (N) method for Ewald sums in large systems. J Chem Phys 98: 10089–10092. [Google Scholar]
- 66. Ciccotti G, Ryckaert J-P. (1986) Molecular dynamics simulation of rigid molecules. Comput Phys Rep 4: 346–392. [Google Scholar]
- 67. Adelman S. (1976) Quantum generalized Langevin equation approach to gas/solid collisions. Chem Phys Lett 40: 495–499. [Google Scholar]
- 68. Roe DR, Cheatham TE III. (2013) PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J Chem Theory Comput 9: 3084–4095. [DOI] [PubMed] [Google Scholar]
- 69. Ramachandran G, Ramakrishnan C, Sasisekharan V. (1963) Stereochemistry of polypeptide chain configurations. J Mol Biol 7: 95–99. [DOI] [PubMed] [Google Scholar]
- 70. Huber RG, Fuchs JE, von Grafenstein S, Laner M, Wallnoefer HG, Abdelkader N et al. (2013) Entropy from State Probabilities: Hydration Entropy of Cations. J Phys Chem B 117: 6466–6672. 10.1021/jp311418q [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. DeLano W. (2008) The PyMOL Molecular Graphics System (Palo Alto, CA: DeLano Scientific LLC; ). [Google Scholar]
- 72. Lazaridis T.(1998) Inhomogeneous fluid approach to solvation thermodynamics. I. Theory. J Phys Chem B 102: 3531–3541. [Google Scholar]
- 73. Nguyen CN, Cruz A, Gilson MK, Kurtzman T. (2014) Thermodynamics of Water in an Enzyme Active Site: Grid-Based Hydration Analysis of Coagulation Factor Xa. J Chem Theory Comput 10: 2769–2780. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information file.