Abstract
Catalytic loop motions facilitate substrate recognition and binding in many enzymes. While these motions appear to be highly flexible, their functional significance suggests that structure-encoded preferences may play a role in selecting particular mechanisms of motions. We performed an extensive study on a set of enzymes to assess whether the collective/global dynamics, as predicted by elastic network models (ENMs), facilitates or even defines the local motions undergone by functional loops. Our dataset includes a total of 117 crystal structures for ten enzymes of different sizes and oligomerization states. Each enzyme contains a specific functional/catalytic loop (10–21 residues long) that closes over the active site during catalysis. Principal component analysis (PCA) of the available crystal structures (including apo and ligand-bound forms) for each enzyme revealed the dominant conformational changes taking place in these loops upon substrate binding. These experimentally observed loop reconfigurations are shown to be predominantly driven by energetically favored modes of motion intrinsically accessible to the enzyme in the absence of its substrate. The analysis suggests that robust global modes cooperatively defined by the overall enzyme architecture also entail local components that assist in suitable opening/closure of the catalytic loop over the active site.
Author Summary
Protein loops have critical roles in ligand binding and catalysis. An unresolved issue in this context is the extent to which the intrinsic dynamics of proteins predispose loops to perform their molecular function. In this work, we (i) critically examine the structural changes undergone by functional/catalytic loops based on a set of enzyme crystal structures in the presence/absence of a ligand, and (ii) examine to what extent those motions are correlated with, or driven by, the global modes that are predictable using simplified, physics-based models. Using a dataset of 117 structures for ten enzymes of different sizes and oligomerization states, we show that the collective modes defined by the protein topology favor loop rearrangements in reasonable agreement with those experimentally observed upon activation. These results suggest that simple but robust motions encoded by the entire architecture, not the local binding site only, assist in binding of the ligand, positioning of the catalytic loop, and/or sequestration of the catalytic site, which in turn, enable efficient catalysis.
Introduction
An issue yet to be resolved is the extent to which the intrinsic dynamics of proteins predispose them to ligand binding. Is there any correlation between local functional events such as loop rearrangements involved in ligand binding and the collective motions intrinsically accessible to the protein prior to ligand binding? To what extent do the structure-encoded global modes of motions (e.g. domain opening/closing, exposure or burial of active sites, cooperative conformational switches in allosteric proteins) simultaneously engage loop motions that facilitate functional interactions? Or, are loop reconfigurations mainly induced on a local scale by the ligand?
Notably, two different views have been advanced in recent years in linking protein dynamics and function: (i) enzyme structural flexibility affects its catalytic reactivity [1]–[4], (ii) catalysis is independent of collective dynamics [5]–[7]. The second view is supported by the limited mobility of catalytic residues in the collective motions of the protein (due to the requirement of precise positioning for chemical reactivity). Recent studies show that the preorganization of the active site is a rate-limiting factor in catalysis, while conformational dynamics help reorganize structural elements near the catalytic site [8].
The global motions of enzymes, also called slowest or softest due to their low frequency or small effective force constants, have been shown in numerous studies [9], [10] to be robustly defined by the evolutionarily selected fold. It is conceivable that these structure-encoded modes play a role in facilitating the enzymatic activity, for example, by favoring structural changes that enable efficient recognition and binding of the substrate/ligand [11]. There is experimental evidence that the loss of conformational motion affects the enzymatic mechanism, even though the structure and electrostatics are preserved [4], while recent work showed that electrostatic preorganization, not conformational motions, makes the largest contribution to catalysis [7]. Our examination of triosephosphate isomerase (TIM) collective dynamics suggests that there is a coupling between the global dynamics of the molecule and the local motions of the catalytically active loop 6 [12], [13]. As illustrated in the Supporting Information (SI) Figure S1, the experimentally observed closure of loop 6 over the ligand is in accord with the essential/principal mode of motion observed in molecular dynamics (MD) simulations of TIM; furthermore this first principal mode extracted from MD by essential dynamics analysis (EDA) [14] is in agreement with the global (softest, lowest frequency) mode predicted for the dimer using the anisotropic network model (ANM) [15], [16]. Collective monomeric counter-rotations, which are not evident in experimental data, appear to be coupled to the functional loop's opening/closure over the active site. Moreover, experiments for TIM indicate that loop closure is not ligand-gated and emerges as an intrinsic motion of the apo enzyme [17]. While these observations signal a role of global dynamics in facilitating functional loop motions, there has been no systematic study of enzyme dynamics in relation to loop motions to establish the generality of these observations, apart from a recent study by Jernigan and coworkers where attention has been invited to the importance of slow modes for functional loop motions [18].
With the rapid accumulation of both apo and liganded structures (usually open and closed forms, respectively) for a given protein in the Protein Data Bank (PDB) [19], and with the development of analytical models and tools for rapid estimation of intrinsic dynamics, we are now in a position to (i) critically examine the structural changes undergone in recognition loops and/or catalytic sites based on structurally resolved proteins in the presence/absence of a ligand and (ii) examine to what extent those motions are correlated with, or driven by, the global modes that are predictable using simplified, physics-based models.
To this aim, we focus on a series of enzymes, where loop motions relevant to function have been experimentally detected (Table 1). In each case, we have a set of structures containing the apo and ligand-bound forms, which differ particularly in their loop regions. As listed in Table 1, the root-mean-square deviation (RMSD) between the open and closed forms varies in the range 0.9–3.9 Å (after optimal superposition of the open and closed structures to eliminate rigid-body translations and rotations), while the loop RMSD varies between 3.5 and 14.5 Å; and the tip residues of the loops are displaced by 6.7 to 25.0 Å between the open and closed conformations. On the other hand, the internal RMSDs of the loops, obtained after structural alignment of the isolated loops, are lower than 5.5 Å (Table 1), suggesting that the large displacements of the loops on the proteins are to a large extent due to the rigid-body displacements, which may be coupled to the collective motions of the enzymes. Notably, four out of ten enzymes (TIM, protein tyrosine phosphatase (PTP), L-lactate dehydrogenase and 3-dehydroquinase) exhibit almost a rigid-lid type closure with a loop internal RMSD less than 2 Å (Table 1).
Table 1. Dataset of enzymes with functional loops that close over the active site.
RMSD (Å) | ||||||
Enzyme | Number of residuesa | Loop residues | Overall proteinb | Loopc | Isolated loopd | Loop tip motion (Å) |
Protein Tyr phosphatase (PTP) | 305 | 352–361 | 0.9 | 3.5 | 1.0 | 7.0 |
HhaI methyltransferase | 327 | 80–100 | 3.9 | 14.5 | 5.3 | 25.0 |
OMP decarboxylase | 267 (x2) | 203–218 | 2.7 | 9.6 | 4.0 | 14.7 |
β 1,4-galactosyltransferase | 288 | 345–365 | 3.4 | 11.6 | 5.5 | 21.5 |
L-lactate dehydrogenase | 317 (x4) | 81–91 | 0.9 | 4.3 | 1.0 | 6.7 |
3-dehydroquinase | 252 (x2) | 227–239 | 1.2 | 5.1 | 1.6 | 10.1 |
Biphosphate aldoase | 307 (x2) | 176–191 | 1.8 | 7.2 | 3.3 | 16.5 |
Triosephosphate isomerase (TIM) | 248 (x2) | 166–176 | 1.0 | 4.5 | 1.1 | 7.9 |
Enolase | 436 (x2) | 34–50 | 0.9 | 3.7 | 2.8 | 8.8 |
Pyruvate mutase | 295 (x4) | 118–134 | 2.4 | 9.7 | 4.2 | 18.0 |
Oligomerization state (number of monomers) is specified in parentheses.
RMSD between the representative open (apo) and closed (liganded) structures listed in Table S1 in Text S1, based on Cα atoms.
RMSD of the loop region calculated after aligning the same apo and bound structures using the Cα atoms. The value is reported for the loop in chain A for multimers.
RMSD obtained after superimposition of the loop region only.
The approach we undertake is the following: (i) to determine the dominant conformational changes of the functional loops by performing a principal component analysis (PCA) of the available crystal structures for each enzyme as well as by direct examination of two structures representatives of the open and closed forms (see Table S1 in Text S1), (ii) to determine collective modes of motion of a representative unliganded member using the ANM, and (iii) to examine the overlap between functional loop reconfiguration derived from experimental data and structure-based motions predicted by the ANM, as explained in previous work [11]. Additionally, we will extract essential motions from MD simulations for PTP and TIM as two case studies to further establish the correlation, if any, between computationally predicted loop motions, and those experimentally observed. We will show that a few well-defined, energetically accessible collective modes of motions encoded by the entire architecture, not by the local binding site only, favor suitable repositioning of the catalytic loop, which in turn, enable the predisposition of the active site to catalytic activity.
Results
Overview of dataset, method of approach, and results
Calculations were performed for a dataset of 117 structures from the PDB corresponding to 10 enzymes (Tables 1 and S1), with 2 to 28 structures resolved in different forms per enzyme. Among them, HhaI methyltranferase (M. HhaI) is a DNA-binding enzyme; and all others bind ligands of various sizes. They contain s = 10–21 residue long loops that close over the active site during reaction. By this means, a catalytic residue located on the loop is correctly positioned in the active site and the site is protected from solvent during catalysis.
We compare two sets of data generated for each enzyme: experimental, derived from the structures known for the enzyme; and computational, predicted for a representative unliganded structure (indicated as open structure in Table S1 in Text S1). Of interest is to assess the correspondence, if any, between the experimentally observed (local) loop motions, and the predicted loop motion as driven by the soft (global) ANM modes. As a metric, we use the overlap O1j≡|p (1). u (j)| between the dominant motion inferred from experiments (expressed by 3N-dimensional unit directional vector, p (1), also called PC1 if obtained by PCA or deformation vector if calculated from the difference between open and closed forms; see Methods) and the jth eigenmode u (j) predicted by the ANM. O1j varies by definition in the range [0, 1]. An overlap close to 1 means that the experimentally observed structural change is essentially driven by the mode j. Another metric is the cumulative overlap, a summation over a subset of p modes (see Methods), describing the fractional contribution of p modes to the (experimentally) observed deformation.
Figure S2 displays the O1j values for the slowest 40 modes (bars) and their cumulative overlap (curve) for each enzyme. In six out of ten enzymes, there is at least one mode with an O1j>0.4, and a cumulative overlap of 0.7 or higher is attained in 7/10 cases, suggesting that the soft modes facilitate, if not enable, functional loop motions.
We further made a direct assessment of the orientational correlation between the loop motions observed in experiments and those predicted by computations. To this aim, we evaluated the correlation cosine , between the 3s-dimensional subvectors p s (1) and u s (j) corresponding to the loop regions of p (1)and u (j). O1jloop will be shortly called loop overlap. Table 2 shows that a loop overlap of 0.57≤O1jloop≤0.86 is achieved by at least one mode (among the softest 10; written in parentheses) in each examined enzyme (column 2). Column 3 lists the softest mode that yields a loop overlap higher than 0.5; and column 4, the modes, among the softest 10, that yield a loop overlap of 0.5 or higher.
Table 2. Comparison of experimentally observed and theoretically predicted loop motions.
Normalized loop overlap (orientational) | Enhancement factor for weighted-average loop overlapd | |||
Enzyme | Highest overlapa | Slowest mode withb O≥0.5 | Slow modes withc O≥0.5 | <O|1–10> |
PTP | 0.72 (6) | 0.51 (2) | 2, 6 | 1.8 |
HhaI methyltransferase | 0.82 (1) | 0.82 (1) | 1, 4, 7, 9–10 | 5.2 |
OMP decarboxylase | 0.79 (7) | 0.72 (1) | 1, 6–9 | 11.6 |
β 1,4-galactosyltransferase | 0.63 (4) | 0.55 (1) | 1, 4 | 7.1 |
L-lactate dehydrogenase | 0.74 (1) | 0.74 (1) | 1–4, 7–9 | 16.1 |
3-dehydroquinase | 0.76 (10) | 0.65 (5) | 5–7, 10 | 2.5 |
Biphosphate aldoase | 0.61 (2) | 0.61 (2) | 2, 6, 8–9 | 3.7 |
TIM | 0.78 (10) | 0.64 (4) | 4, 7, 10 | 3.8 |
Enolase | 0.57 (5) | 0.54 (3) | 3–5 | 2.6 |
Pyruvate mutase | 0.69 (8) | 0.69 (8) | 8 | 5.2 |
Overlap between loop reconfiguration observed in experimental structures and the structural change predicted for the loop by the ANM.
Highest overlap achieved by a single mode and corresponding mode number in parentheses.
Slowest ANM mode (in parentheses) that shows an overlap greater than 0.5, and the corresponding overlap.
Slow modes that show overlap greater or equal to 0.5.
The ratio <O|loop>pANM/<O|loop>prandom (see Eq. 1 and main text).
We also calculated the weighted-average overlaps, <O|s>p, averaged over p = 10 modes (see Eq. 1 in Methods) evaluated for segments of s consecutive residues. Figure S3 displays <O|s>p for the catalytic loop (s-residue long), calculated for successive sets of 10 modes (shifting windows along the abscissa of 3N-6 ANM modes). A general trend of decreasing loop overlap with increasing mode number is observed for all enzymes.
As a further test, we compared weighted-average loop overlaps, calculated for the softest 10 modes, to those accomplished by randomly generated modes. To that aim, we evaluated the difference Δ<O|s>p = <O|s>pANM−<O|s>prandom at the loop region of s residues, and repeated the calculations for all successive windows of s residues along the protein sequence. The goal was to test whether the resulting ‘difference profiles’ as a function of residue (sliding window) index would distinguish the loop regions as regions of high overlap with ANM softest modes (e.g. p = 10 of them). The difference profiles presented in Figure S4 clearly indicate that for the most part the catalytic loop regions (the positions of which along the sequences are indicated by red stars and dashed vertical lines) are distinguished by their high overlap with slow modes, in support of the correlation between structure-encoded soft modes and functional loop reconfigurations. The last column in Table 2 shows that the enhancement factor calculated as the ratio <O|s = loop>pANM/<O|s = loop>prandom. Notably, the enhancement factor varies between 1.8–16.1, with PTP exhibiting the smallest enhancement, and L-lactate dehydrogenase, the largest.
In summary, in each studied protein, at least one of the top-ranking (energetically favorable) 10 collective modes predicted by the ANM yields a high loop overlap, and the weighted-average overlap achieved by these soft modes at the loop region is enhanced by a factor of 6.0 on average (over 10 proteins) compared to randomly generated modes. These data further support the view that the seemingly ‘local’ loop reconfigurations inferred from experimental data are not decoupled from the global modes intrinsically encoded by the overall structure. On the contrary, global modes generally exhibit higher overlaps with the functional loop motions than local (high frequency) modes (Figure S3) or random modes (Figure S4).
Below we describe in more details the results for four enzymes.
Protein Tyrosine Phosphatase (PTP)
Protein tyrosine phosphatases form a superfamily of enzymes that regulate the tyrosine phosphorylation levels in signal transduction pathways together with the action of protein tyrosine kinases. Specifically, PTPs catalyze the hydrolysis of phosphate moiety in phosphotyrosine-containing proteins. Class I cytoplasmic PTPs include human PTP (PTP1B) and Yersinia PTP (YopH), which show low sequence identity (∼20%) [20]–[22], but share a structurally conserved catalytic domain of ∼280 residues [21], namely an eight-stranded mixed β-sheet wrapped by seven α-helices (Figure 1A). One important feature of PTPs is the WPD loop, which carries the conserved catalytic residue Asp356 (in YopH) and closes over the active site upon binding of the substrate (open and closed conformations of the loop are shown in Figure 1A). Similar to TIM, loop closure correctly positions the functional residues around the ligand and shields the site from bulk solvent during catalysis [21]. Standard and targeted MD simulations [23], [24] on PTP1B have identified important regions (S-loop, R-loop) that are possibly related to ligand binding and closure of the WPD loop, respectively (Figure 1A).
PCA results from experiments
Our ensemble consists of 16 PTP crystal structures (Table S1 in Text S1). The first PCA mode, which corresponds to 53% of the total variance (Table S2 in Text S1), describes mainly the variation of the WPD loop between open and closed conformations. Interestingly, the other solvent-exposed loops and helices do not exhibit any motion in the principal modes, consistent with the conformations shown in Figure 1A. Experiments for PTP have indicated that loop closure is not ligand-gated and emerges as an intrinsic motion of the apo enzyme [25]. So our question here is whether the WPD loop closure described by PC1 is coupled to the collective motions of PTP.
Overlap with ANM predictions
The overlap between experimental data (PC1) and the ANM modes is calculated to detect any correlation between loop closure and the slow modes. We performed ANM calculations for open structure with PDB identifier 1YPT [22]. ANM mode 2 (ANM2) yielded highest overall overlap (0.41, Figure S2) with PC1. We depict fluctuations along ANM2 using arrows (green) pointing in the loop closure direction in Figure 1B. In the close-up view, ANM2 arrows are compared to the loop closure observed in crystal structures (dark orange arrows). As to the WPD loop overlap, two out of 10 slowest modes exhibit correlation cosines above 0.5 (0.72 for mode 6, and 0.51 for mode 2; see Table 2). Further, slowest 10 ANM modes yield an overlap enhancement of 1.8 relative to random modes, which is actually the lowest enhancement factor in the dataset. Finally, cumulative overlap (see Methods) plots show that 20 modes (2.4% of all modes) explain 60% of the structural variance. (Figures 2A and S2). Note that slow modes 2 and 6 alone account for 45% of the variance.
MD results
We also performed a 50 ns MD run of the apo enzyme. Similar to ANM results, the highest correlation with PCA1 achieved by individual EDA modes is 0.44. However, the loop overlap values achieved by the essential modes at the low frequency region of the spectrum are considerably higher than those at the intermediate and high frequency ranges, as illustrated in Figure 1D. These results are in accord with previous 10 ns MD simulations on YopH [26]. Essential modes consistently show WPD loop's half-closure in apo form. In Figure 1C, the time evolution of the distance between active site residue Val407 and loop residue Gln357 is plotted, showing that the loop has a tendency to occasionally move towards the closed conformation (indicated by the blue horizontal bar), although it never reaches the closed form. Note that these runs are performed for the unliganded PTP so as to assess the conformational dynamics of the enzyme in the absence of ligand binding. The ‘incomplete’ closure of the loop in simulations may thus result from the lack of interactions, (mainly electrostatic, between the charged residues on the loop and the ligand) which would otherwise stabilize the closed state. In Figure 1E, snapshots from every 2.5 ns are shown to display halfway closed loop configurations from the simulation. Overall, an intrinsic feature of PTP functional dynamics is the flexibility of WPD loop to fluctuate in the functional direction, confirmed by both ANM and MD results.
HhaI methyltransferase (M.HhaI)
M.HhaI catalyzes the methylation of cytosine residues located in specific DNA sequences with the aid of a cofactor (S-adenosyl-L-homocysteine). M.HhaI is a monomeric enzyme, which positions the DNA between its large (Rossmann fold) and small domains [27] (Figure 3A). The catalytic nucleophile Cys81 is located on a long, flexible loop, with tip residue displaced by 25 Å when it binds the DNA [27]. This is accompanied by a flip of the target cytosine out of the DNA helix into the active site. The collective motions of the M.HhaI have been proposed to facilitate the base flipping observed in the ternary complex in a study using elastic network model [28].
PCA results
We analyzed an ensemble of 29 HhaI structures - two unbound, and the rest, ligand-bound. The closure of the loop (movement along PC1) explained 90% of the structural variability in the dataset and 8% of the remaining variability was attributed to the structural changes observed in the bound form of the enzyme.
ANM results
We performed ANM calculations for the holoenzyme structure [29] of HhaI after removing in silico the cofactor. The slowest ANM mode (ANM1) was found to couple the loop motion with small and large domain movements of HhaI (Figure 3B–C). The collectivity of this mode is 0.55 [30]. The residue fluctuations driven by ANM1 are compared to the experimental temperature factors in Figure 3D, which indicate close agreement specifically for the loop region, supporting the dominance of this soft mode in the observed fluctuations. This mode exhibits a loop overlap of 0.82 (Table 2). In addition, four other modes among the first ten present a loop overlap above 0.5. The cumulative overlap plot shows that 20 modes (2.3% of all modes) explain 80% of the structural variance (Figure 2A). When we consider 5 modes with high loop overlap, 69% of the variance is explained.
The weighted-average overlap of HhaI loop motion based on p = 10 softest modes indicates an enhancement factor of 5.2 over random modes (Table 2). The weighted-average overlap as a function of mode index is shown in Figure 3E, obtained by evaluating <O|s>p values (Eq. 1) for successive subsets (sliding windows) of p = 10 modes along the abscissa. The red arrow indicates the result for the first window composed of ten lowest frequency modes. The bars clearly display the distinctively high loop overlap achieved by the softest modes. The inset displays the overlap difference with respect to random, Δ<O|s>p, evaluated for successive segments of length s along the backbone sequence. The curve clearly captures the strong correlation at the functional loop (indicated by dashed vertical line).
Orotidine 5′-phosphate Decarboxylase (OMP decarboxylase)
Orotidine 5′-phosphate decarboxylase is a homodimeric enzyme with the classic TIM-barrel fold [31]. It catalyzes the conversion of orotidine 5-monophosphate (OMP) to uridine 5-monophosphate (UMP) in the biosynthesis of primidine nucleotides. The active site is located at the dimer interface. A flexible loop located at the C-terminal end is associated with substrate binding and release of the product in the last step of the reaction. The loop (Figure 4A) is in open conformation when it is ordered and in closed conformation at the active site contacting the ligand [31]. Hur and Bruice performed MD simulations and found that the loop changes conformation during the catalytic reaction [32].
PCA results
We performed PCA using the seven resolved crystal structures of the enzyme. PC1 and PC2 explain 71% and 26% of the variance, respectively.
ANM results
The slowest mode (Figure 4B–C) clearly drives the closure of the loop, exhibiting an overlap of 0.72 with the closure direction (Table 2). The seventh and eighth modes also have high overlap values of 0.79 and 0.7 (not shown). The functional loop is highly mobile in the first mode (Figure 4D). The comparison of residue fluctuations with the experimental temperature factors yields a correlation of 0.68. The cumulative overlap calculations show that 20 modes (1.2% of all modes) explain close to 90% of the structural variance (Figures 2A and S2) and when only those with high loop overlaps (listed in Table 2) are considered 81% of the variance is accounted for.
The weighted-average loop overlap for the 10 slowest modes exhibits a remarkable enhancement (of 11.6) over random modes (Table 2). The weighted-average loop overlap as a function of mode index clearly demonstrates the distinctive overlap achieved by the slowest 10 modes (see the peak indicated in Figure 4E). The overlap difference profile in the inset of Figure 4E also demonstrates how the experimental deformations at the loop regions of both subunits (two curves) correlate with those along soft modes.
Triosephosphate isomerase (TIM)
The homodimeric enzyme TIM plays a crucial role in the glycolytic pathway by catalyzing the interconversion of dihydroxyacetone phosphate and glyceraldehyde 3-phosphate. Each subunit adopts the TIM-barrel fold as in OMP decarboxylase. Loop 6 that carries a catalytic glutamate closes over the active site and protects it from solvent during catalysis. However, this loop closure is not ligand-gated, i.e. it also takes place in the apo state [17]. Aligned apo and ligand-bound structures of chicken TIM in Figure 5A indicate the conformational change in loop 6 (colored blue and magenta).
PCA results
We performed PCA using 15 resolved crystal structures of the enzyme. PC1 and PC2 explain 52% and 24% of the structural variance, respectively.
ANM results
The overall overlap values between PC1 and ANM modes (generated for the apo structure, 8TIM) are lower than 0.4 in general (see Figure S2). In this respect, TIM is one of the four cases (together with biphosphate aldoase, enolase and pyruvate mutase) where the overall structural change between the apo and liganded forms exhibits a relatively weak overlap with ANM soft modes. However, several soft modes contribute to loop motion. Figure 5B displays the loop reorientation driven by mode 1 (with loop overlap of 0.45). ANM calculations repeated with another apo structure, Trypanosoma cruzi TIM (TcTIM; PDB identifier 1TCD), corroborated those performed for chicken TIM (Figure S1D).
MD results
Our previous 60 ns MD simulation performed for chicken TIM [13] exhibited loop 6 closure, consistent with experimental data. Global deformations that were not apparent in crystal structures have been observed therein to accompany the loop closure. Specifically, the first mode from EDA of the trajectory (with 34% contribution to overall motion) revealed a counter-rotation of the two subunits accompanied by the proper closure of loop 6, again consistent with ANM results. Here we performed an independent MD simulation (100 ns long) on TcTIM apo structure and focused on the distance between the loop and a relatively immobile residue. The results plotted as a function of time (Figure 5C), show multiple opening/closure events in both subunits (only subunit A shown). Snapshots taken every 10 ns indicate various loop conformations between open and closed states (Figure 5D). EDA analysis further confirms that the counter-clockwise rotation of the two subunits in the first mode (with 36% contribution) is coupled to the loop closure event (Figure S1A). The high correlation between the first three modes from EDA and ANM of TcTIM further support the robustness of the results (Figure S1B–C). These findings consistently highlight the coupling between the global counter-rotation of the subunits in the first mode and the loop closure.
Other enzymes
The results for other proteins are displayed in Figures 6 and S5. Additional data provided in Tables 1 and 2, Figure 2, Tables S1, S2 in Text S1 and Figures S2, S3, S4 essentially consolidate the results described in detail for the four cases.
Discussion
Proteins undergo a broad range of motions under physiological conditions, spanning from local to global changes in conformations. Among them, the most probable motions, also known as the softest modes, are usually highly collective, i.e., they drive the cooperative motions of entire domains/subunits [10], [33]. Many activities of proteins are achieved, on the other hand, by relatively localized motions, such as loop reconfigurations that accompany ligand binding. A common behavior in all enzymes studied here was the occurrence of the catalytic loop reconfiguration based on the available apo and bound structures. This observation has commonly leaded to the hypothesis that loop motions are triggered by ligand binding.
Given that loop motions are not collective in nature, but seemingly confined to short segments on the backbone, they might be attributed to local, rather than global dynamics. Many studies focused on such ‘regions of interest’ implicitly assuming that the loop reconfiguration observed is predominantly determined by local interactions. Our analysis demonstrates, however, that the local conformational changes observed in experiments at functional loops are not independent of the soft modes of motions intrinsically favored by the architecture. On the contrary, at least in the examined dataset, the soft modes do contribute (more than local high frequency modes) to the reconfiguration of the loops along directions stabilized upon ligand binding.
The top-ranking ANM modes are by definition collective modes of motions known to be highly robust against sequence and structure variations. The correlation between experimentally observed structural changes at the catalytic loops and these modes suggests the evolution of the enzymatic architecture to facilitate the predisposition of the catalytic loop to enzymatic activity. Our previous and current MD simulations on TIM from two different species consistently indicate high mobility and almost full opening/closure of loop 6 in both subunits of the homodimer. In contrast, only half-closure and restricted mobility is observed for the WPD loop during PTP simulations. Evidently, there are other factors that also affect catalytic loop dynamics in terms of reaching the closed state, or the state ‘pre-disposed’ to catalytic activity. One factor may be the favorable electrostatic interactions provided by the substrate (not included in our simulations). Another factor proposed to facilitate loop closure is the presence of the conserved, glycine-rich loops interacting with the active-site loop in previous MD simulations on enolase, β 1,4-galactosyltransferase and lipase [34], [35].
It is important to note that our study does not contradict the critical role of electrostatic interactions in catalysis pointed out earlier [5]–[7], and in fact, our earlier work [36] showed that the catalytic site, once assuming the ‘active’ conformation, is mechanically constrained to maintain its precise geometry required for chemical reactivity. On the other hand, conformational flexibility comes into play, and plays an important role to our view, in facilitating the binding of the substrate, and in favoring the reconfiguration of the active site into its form prone to catalysis, hence the significant role of conformational flexibility in accomplishing catalysis observed in previous work [1]–[4]. In a sense, the structure-encoded flexibility, or the suitable reorientation of the catalytic loop (as shown here to be favored by intrinsic collective motions) is a prerequisite for the ensuing catalytic activity which requires the appropriate chemical (and, in particular, electrostatic) organization.
It is worth noting that the ANM modes are purely based on native contact topology, or geometry. No residue-specific interactions are taken into consideration. The collective dynamics is essentially controlled by uniform spring-like potentials; and these potentials in turn account for the Gaussian fluctuations/distributions of inter-residue distances- the underlying assumption of the theory of elastic networks, as originally set forth for polymer networks [37]. As such, the directions of motions predicted by the ANM are those favored by elastic entropic effects (for a recent review see ref [10]), and the structural changes initiated/favored by these entropic effects are likely to be complemented by enthalpic effects, including in particular electrostatic interactions with the bound ligand to shape and stabilize the final closed conformer. Yet, the he correlation with experimentally observed deformations suggests that these entropic effects play a significant role in defining the accessible mechanisms of ligand binding.
Methods
PCA of experimental structural data
The experimental data for each protein composed of N residues are generated as follows: (i) the ensemble of structures is superimposed using an iterative Kabsch algorithm (see SI), (ii) mean positions <R i> = [<xi > <yi> <zi>]T are determined for α-carbons 1≤i≤N (or those residues with known coordinates), (iii) deviations from mean position, ΔR i s = [Δx i s Δyi s Δzi s]T (where Δx i s = x i s−<xi>) are organized in a 3N-dimensional deformation vector ΔR s for each structure s in the ensemble; (iv) the cross-correlations between these deviations, averaged over the entire set are written in a 3N×3N covariance matrix C (exp), and (v) C (exp) is diagonalized to determine the principal modes of structural variations, where p (i) p (i) T. Here p (i) and σi, are the respective i th eigenvector and eigenvalue, and m is the number of structures resolved for the studied protein. The principal modes are rank-ordered: PC1 (p (1)) refers to the direction of maximal variance, succeeded by PC2, etc. The fractional contribution of p (i) to structural variance is given by fi = σi/Σ j σj where the summation is performed over all modes.
ANM analysis
The Hessian matrix, H, forms the basis of ANM approach. H can be written in terms of N×N submatrices, H (ij), each of size 3×3, given by
for , and . Here is the magnitude of the distance vector R ij 0 between α-carbons i and j (observed in the PDB), and , , and are the components. is the ijth element of the Kirchhoff matrix equal to 1 if i and j are connected (within a cutoff distance of rcut) in the network, 0 otherwise. A uniform force constant, γ, is used for all pairwise interactions. H decomposed into 3N-6 nonzero eigenvalues λi, and corresponding eigenvectors u (i), as u (i) u (i) T. ANM covariance is C ANM = H −1, where H −1 is pseudo inverse, such that 1/λ1 is the counterpart of the PCA σ1, and u (i) is the counterpart of p (i).
Overlap definitions
The overlap between PCA and ANM modes is given by the absolute value of the correlation cosine Oij = |p (i). u (j)| [9]. For enzymes with two available structures only, p (1) is equivalent to the 3N-dimensional deformation vector (normalized) between open and closed crystal structures (Table S1 in Text S1). Cumulative overlap is defined as [38]. Note that CO1J = 1 for J = 3N-6, i.e., the 3N-6 ANM eigenvectors form a complete set of orthonormal basis vectors.
The orientational correlation between the s-residue long loop motions experimentally observed and computationally predicted is measured by the overlap between the loop elements (3s-dimensional subvectors) of p (1)and u (j). We further define the weighted-average overlap for any segment of length s based on p modes.
(1) |
This definition takes account of the magnitudes of loop motions, in addition to their orientations. Calculations were performed using the software package ProDy [39].
MD and EDA
PTP was simulated for 50 ns in explicit TIP3 [40] water using NAMD [41] with CHARMM force field [42] (see SI for details). Langevin dynamics and Langevin piston Nose-Hoover [43], [44] methods were used to keep the temperature and pressure constant at 300 K and 1 atm. EDA [14] was performed after iterative superposition of the MD trajectory onto the crystal structure. TcTIM simulations were performed using AMBER [45], [46] with the ff03 force field parameters [47], and the protocol described in previous work [13].
Supporting Information
Acknowledgments
PD thanks OT Turgut for helpful discussions on the subject.
Funding Statement
Support from NIH grants # 1R01GM099738-01 and # 5R01GM086238-04 is gratefully acknowledged by IB. PD and ZK acknowledge partial support from DPT grant # 2009k120520 and the Betil fund. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Henzler-Wildman KA, Thai V, Lei M, Ott M, Wolf-Watz M, et al. (2007) Intrinsic motions along an enzymatic reaction trajectory. Nature 450: 838–844. [DOI] [PubMed] [Google Scholar]
- 2. Agarwal PK, Billeter SR, Rajagopalan PT, Benkovic SJ, Hammes-Schiffer S (2002) Network of coupled promoting motions in enzyme catalysis. Proc Natl Acad Sci U S A 99: 2794–2799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Benkovic SJ, Hammes-Schiffer S (2003) A perspective on enzyme catalysis. Science 301: 1196–1202. [DOI] [PubMed] [Google Scholar]
- 4. Bhabha G, Lee J, Ekiert DC, Gam J, Wilson IA, et al. (2011) A dynamic knockout reveals that conformational fluctuations influence the chemical step of enzyme catalysis. Science 332: 234–238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Olsson MH, Parson WW, Warshel A (2006) Dynamical contributions to enzyme catalysis: critical tests of a popular hypothesis. Chem Rev 106: 1737–1756. [DOI] [PubMed] [Google Scholar]
- 6. Warshel A, Sharma PK, Kato M, Xiang Y, Liu H, et al. (2006) Electrostatic basis for enzyme catalysis. Chem Rev 106: 3210–3235. [DOI] [PubMed] [Google Scholar]
- 7. Adamczyk AJ, Cao J, Kamerlin SCL, Warshel A (2011) Catalysis by dihydrofolate reductase and other enzymes arises from electrostatic preorganization, not conformational motions. Proc Natl Acad Sci U S A 108: 14115–14120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Torbeev VY, Raghuraman H, Hamelberg D, Tonelli M, Westler WM, et al. (2011) Protein conformational dynamics in the mechanism of HIV-1 protease catalysis. Proc Natl Acad Sci U S A 108: 20982–20987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Tama F, Sanejouand YH (2001) Conformational change of proteins arising from normal mode calculations. Protein Eng 14: 1–6. [DOI] [PubMed] [Google Scholar]
- 10. Bahar I, Lezon TR, Yang LW, Eyal E (2010) Global dynamics of proteins: bridging between structure and function. Annual Rev Biophys 39: 23–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Bakan A, Bahar I (2009) The intrinsic dynamics of enzymes plays a dominant role in determining the structural changes induced upon inhibitor binding. Proc Natl Acad Sci U S A 106: 14349–14354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Kurkcuoglu O, Jernigan RL, Doruker P (2006) Loop motions of triosephosphate isomerase observed with elastic networks. Biochemistry 45: 1173–1182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Cansu S, Doruker P (2008) Dimerization affects collective dynamics of triosephosphate isomerase. Biochemistry 47: 1358–1368. [DOI] [PubMed] [Google Scholar]
- 14. Amadei A, Linssen AB, Berendsen HJ (1993) Essential dynamics of proteins. Proteins 17: 412–425. [DOI] [PubMed] [Google Scholar]
- 15. Doruker P, Atilgan AR, Bahar I (2000) Dynamics of proteins predicted by molecular dynamics simulations and analytical approaches: application to a-amylase inhibitor. Proteins 40: 512–524. [PubMed] [Google Scholar]
- 16. Atilgan AR, Durell SR, Jernigan RL, Demirel MC, Keskin O, et al. (2001) Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys J 80: 505–515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Williams JC, McDermott AE (1995) Dynamics of the flexible loop of triosephosphate isomerase: the loop motion is not ligand gated. Biochemistry 34: 8309–8319. [DOI] [PubMed] [Google Scholar]
- 18. Skliros A, Zimmermann MT, Chakraborty D, Saraswathi S, Katebi AR, et al. (2012) The importance of slow motions for protein functional loops. Phys Biol 9: 014001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Tabernero L, Aricescu AR, Jones EY, Szedlacsek SE (2008) Protein tyrosine phosphatases: structure-function relationships. FEBS J 275: 867–882. [DOI] [PubMed] [Google Scholar]
- 21. Schubert HL, Fauman EB, Stuckey JA, Dixon JE, Saper MA (1995) A ligand-induced conformational change in the Yersinia protein tyrosine phosphatase. Protein Sci 4: 1904–1913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Stuckey JA, Schubert HL, Fauman EB, Zhang ZY, Dixon JE, et al. (1994) Crystal structure of Yersinia protein tyrosine phosphatase at 2.5 A and the complex with tungstate. Nature 370: 571–575. [DOI] [PubMed] [Google Scholar]
- 23. Kamerlin SC, Rucker R, Boresch S (2007) A molecular dynamics study of WPD-loop flexibility in PTP1B. Biochem Biophys Res Commun 356: 1011–1016. [DOI] [PubMed] [Google Scholar]
- 24. Kamerlin SC, Rucker R, Boresch S (2006) A targeted molecular dynamics study of WPD loop movement in PTP1B. Biochem Biophys Res Commun 345: 1161–1166. [DOI] [PubMed] [Google Scholar]
- 25. Juszczak LJ, Zhang ZY, Wu L, Gottfried DS, Eads DD (1997) Rapid loop dynamics of Yersinia protein tyrosine phosphatases. Biochemistry 36: 2227–2236. [DOI] [PubMed] [Google Scholar]
- 26. Hu X, Stebbins CE (2006) Dynamics of the WPD loop of the Yersinia protein tyrosine phosphatase. Biophys J 91: 948–956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Klimasauskas S, Kumar S, Roberts RJ, Cheng X (1994) HhaI methyltransferase flips its target base out of the DNA helix. Cell 76: 357–369. [DOI] [PubMed] [Google Scholar]
- 28. Luo J, Bruice TC (2005) Low-frequency normal mode in DNA HhaI methyltransferase and motions of residues involved in the base flipping. Proc Natl Acad Sci U S A 102: 16194–16198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. O'Gara M, Zhang X, Roberts RJ, Cheng X (1999) Structure of a binary complex of HhaI methyltransferase with S-adenosyl-L-methionine formed in the presence of a short non-specific DNA oligonucleotide. J Mol Biol 287: 201–209. [DOI] [PubMed] [Google Scholar]
- 30. Bruschweiler R (1995) Collective protein dynamics and nuclear spin relaxation. J Chem Phys 102: 3396–3403. [Google Scholar]
- 31. Miller BG, Hassell AM, Wolfenden R, Milburn MV, Short SA (2000) Anatomy of a proficient enzyme: the structure of orotidine 5′-monophosphate decarboxylase in the presence and absence of a potential transition state analog. Proc Natl Acad Sci U S A 97: 2011–2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Hur S, Bruice TC (2002) Molecular dynamic study of orotidine-5′-monophosphate decarboxylase in ground state and in intermediate state: a role of the 203–218 loop dynamics. Proc Natl Acad Sci U S A 99: 9668–9673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Bahar I, Lezon TR, Bakan A, Shrivastava IH (2010) Normal mode analysis of biomolecular structures: functional mechanisms of membrane proteins. Chem Rev 110: 1463–1497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Gunasekaran K, Ma B, Nussinov R (2003) Triggering loops and enzyme function: identification of loops that trigger and modulate movements. J Mol Biol 332: 143–159. [DOI] [PubMed] [Google Scholar]
- 36. Yang LW, Bahar I (2005) Coupling between catalytic site and collective dynamics: A requirement for mechanochemical activity of enzymes. Structure 13: 893–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Flory PJ (1976) Statistical Thermodynamics of Random Networks. Proc R Soc London 351: 351–380. [Google Scholar]
- 38. Yang L, Song G, Carriquiry A, Jernigan RL (2008) Close correspondence between the motions from principal component analysis of multiple HIV-1 protease structures and elastic network modes. Structure 16: 321–330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Bakan A, Meireles LM, Bahar I (2011) ProDy: Protein Dynamics Inferred from Theory and Experiments. Bioinformatics 27: 1575–1577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Jorgensen WL, Chandrasekhar J, Madura JD, Imprey RW, Klein ML (1985) Comparison of simple potential functions for simulating liquid water. J Chem Phys 79: 926. [Google Scholar]
- 41. Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, et al. (2005) Scalable molecular dynamics with NAMD. J Comput Chem 26: 1781–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, et al. (1983) CHARMM: A Program for Macromolecular Energy, Minimization, and Dynamics Calculations. J Comput Chem 4: 187–217. [Google Scholar]
- 43. Feller SE, Zhang Y, Pastor RW, Brooks BR (1995) Constant pressure molecular dynamics simulation: The Langevin piston method. J Chem Phys 103: 4613. [Google Scholar]
- 44. Martyna GJ, Tobias DJ, Klein ML (1994) Constant pressure molecular dynamics algorithms. J Chem Phys 101: 4177. [Google Scholar]
- 45. Case DA, Cheatham TE, Darden T, Gohlke H, Luo R, et al. (2005) The Amber biomolecular simulation programs. J Comput Chem 26: 1668–1688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Case DA, Darden T, Cheatham TE, Simmerling C, Wang J et al. (2004) AMBER, version 8 [computer program]. San Francisco: University of California.
- 47. Duan Y, Wu C, Chowdhury S, Lee MC, Xiong GM, et al. (2003) A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations. J Comput Chem 24: 1999–2012. [DOI] [PubMed] [Google Scholar]
- 48. O'Gara M, Klimasauskas S, Roberts RJ, Cheng X (1996) Enzymatic C5-cytosine methylation of DNA: mechanistic implications of new crystal structures for HhaL methyltransferase-DNA-AdoHcy complexes. J Mol Biol 261: 634–645. [DOI] [PubMed] [Google Scholar]
- 49. Chan KK, Wood BM, Fedorov AA, Fedorov EV, Imker HJ, et al. (2009) Mechanism of the orotidine 5′-monophosphate decarboxylase-catalyzed reaction: evidence for substrate destabilization. Biochemistry 48: 5518–5531. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.