Abstract
We compare various predicted mechanical and thermodynamic properties of nine oxidized thioredoxins (TRX) using a Distance Constraint Model (DCM). The DCM is based on a nonadditive free energy decomposition scheme, where entropic contributions are determined from rigidity and flexibility of structure based on distance constraints. We perform averages over an ensemble of constraint topologies to calculate several thermodynamic and mechanical response functions that together yield quantitative stability/flexibility relationships (QSFR). Applied to the TRX protein family, QSFR metrics display a rich variety of similarities and differences. In particular, backbone flexibility is well conserved across the family, whereas cooperativity correlation describing mechanical and thermodynamic couplings between residue pairs exhibit distinctive features that readily standout. The diversity in predicted QSFR metrics that describe cooperativity correlation between pairs of residues is largely explained by a global flexibility order parameter describing the amount of intrinsic flexibility within the protein. A free energy landscape is calculated as a function of the flexibility order parameter, and key values are determined where the native-state, transition-state and unfolded-state are located. Another key value identifies a mechanical transition where the global nature of the protein changes from flexible to rigid. The key values of the flexibility order parameter help characterize how mechanical and thermodynamic response is linked. Variation in QSFR metrics, and key characteristics of global flexibility are related to the native state x-ray crystal structure primarily through the hydrogen bond network. Furthermore, comparison of three TRX redox pairs reveals differences in thermodynamic response (i.e., relative melting point) and mechanical properties (i.e., backbone flexibility and cooperativity correlation) that are consistent with experimental data on thermal stabilities and NMR dynamical profiles. The results taken together demonstrate that small-scale structural variations are amplified into discernible global differences by propagating mechanical couplings through the H-bond network.
Keywords: thioredoxin, quantified stability/flexibility relationships, hydrogen bond network, molecular cooperativity, free energy landscape, rigidity
INTRODUCTION
Thioredoxin (TRX) is a small, ~110 residue α/β protein found in all organisms [1]. Its intracellular function is to regulate the oxidative state of target proteins by maintaining them in reduced form, while itself becoming oxidized through formation of a disulfide bond. The cysteine residues of the disulfide occur in a conserved C-X-X-C motif found in thioredoxins and other thioredoxin fold members. The greater stability of oxidized TRX emphasizes its intracellular role in maintaining substrate proteins in their reduced form [2]. Extensively studied for decades (see [3, 4] for two recent reviews), many TRX sequences and structures are available. TRX influences a wide variety of physiological functions due to its pivotal role in regulating cellular redox balance [5]. As a consequence, TRX is associated with a number of human diseases, including: cancer, cardiac disease, viral disease, etc. In recent years, TRX is receiving increasing attention for its therapeutic potential as a regulator of processes such as cell growth, apoptosis, and inflammation [6].
Here, we compare relationships between stability and flexibility metrics across the TRX family using a Distance Constraint Model (DCM). Various interactions, such as covalent bonds, hydrogen bonds (H-bonds) and local residue conformational states are modeled as a network of distance constraints, where each distance constraint is assigned an energy and entropy contribution. Statistical mechanics is employed to calculate thermodynamic properties over an ensemble of distance constraint networks (constraint topologies). The ensemble of constraint topologies represents all possible structural conformations ranging from native state fluctuations, partial unfolded states, all the way through to the unfolded state. For each constraint topology: Total enthalpy is given as the sum over enthalpy contributions from each distance constraint present, and, a good estimate for total entropy is given as a sum over entropy contributions from independent distance constraints that are identified using graph-rigidity algorithms [7–15]. In addition to thermodynamic properties, mechanical properties (i.e., local flexibility/rigidity profiles, correlated motions, etc.) are readily calculated. Due to the extraordinarily large number of accessible constraint topologies, a hybrid method combining Monte Carlo sampling with a mean field approximation is employed [7, 13].
The results presented here are based on a minimal distance constraint model (mDCM) that we describe briefly (see [13, 14] for a complete description). Accessible constraint topologies are limited to perturbations away from the native protein structure. In other words, non-native contacts are not considered, as is commonly done in Go-like models [17–19], COREX [20, 21], and other Ising-like models [22, 23]. Covalent bonds are modeled as distance constraints, but since they do not break and form, their enthalpy and entropy contributions factor out and play no role in thermodynamic response. In contrast, H-bonds and salt-bridges can break and form, and local conformational states of residues defined by internal dihedral angles fluctuate. These fluctuating noncovalent interactions govern the interesting thermodynamic and mechanical response. The backbone and side-chain dihedral angles that define the conformational state of a residue are modeled to be either native-like or disordered. The distance constraints that are used to enforce native-like or disordered dihedral angle basins are called torsion constraints. The characteristics of all native torsion constraints are considered to be the same, independent of residue type. Likewise, all disordered torsion constraints share the same characteristics. The mDCM has three adjustable parameters determined by fitting to experimental data (heretofore, heat capacity, Cp, curves). The mDCM was previously employed to identify conserved stability/flexibility relationships across a mesophilic/thermophilic RNase H ortholog pair [16], and its predictions were markedly consistent with earlier experimental conclusions [24]. The mDCM was also used to understand fragment stability in E. coli TRX, and it reconciled contradictory experimental descriptions of TRX folding [15]. More recently, the mDCM has been used to highlight the importance of the hydrogen bond network in four homologous periplasmic binding proteins [25]. Despite its simplicity, the mDCM has provided key insight into the interplay between stability and flexibility for a diverse set of proteins.
In this report, we expand upon our earlier work on E. coli TRX by comparing various QSFR metrics across a family of nine different oxidized TRX homologs spanning from prokaryota to human. In addition, three reduced TRX structures are also considered. Concentrating on the group of nine well-conserved structures of the same function, we determine, for the first time, how conserved (or diverse) the various QSFR properties are across the TRX family. To circumvent finding three adjustable parameters per protein, we transfer the same three parameters obtained from the E. coli TRX to all other members. With this fixed parameterization, we verified the mDCM correctly predicts all TRX homologs to have a two-state folding/unfolding transition, and the single peak in Cp is used to define the melting temperature, Tm. We find the native state backbone flexibility at respective Tm for each protein is well conserved. Surprisingly, QSFR metrics involving residue-residue correlations within the native state at respective Tm show significant distinguishing features across the family. Several model predictions are supported by experimental conclusions; however, the scarcity of biophysical characterizations across the family limits our ability to corroborate the full set of predictions. Intriguingly, most of the QSFR diversity revealed here is explained by the relative locations of key points in the flexibility order parameter that characterize the free energy landscape, and the mechanical transition between globally rigid and flexible. Remarkably, the observed QSFR variations, including these key points, do not correlate to global structural similarity measures, such as the root mean squared deviation (RMSD) in atomic positions. Moreover, the mDCM predictions of mechanical and thermodynamic response cannot be explained from small differences in global properties of the H-bond network. In fact, even the most similar TRX pair in regards to the H-bond network reveals substantial QSFR variance due to just a few critical H-bonds. This result highlights the sensitivity of QSFR analysis to the detailed properties of the underlying H-bond network.
RESULTS AND DISCUSSION
Thermodynamic properties
Published Cp curves for two (E. coli and S. aureus) oxidized TRX homologs are currently available [26, 27]. Using our previously described simulated annealing procedure [15], best-fit parameters, {u, v, δnat}, have been determined for each. The two best-fit parameter sets, which are qualitatively similar, are provided in Table 2. Crossing the best-fit parameters, meaning applying the E. coli parameters to S. aureus and vice versa, causes the melting temperature, Tm, to be downshifted a small amount; however, a two-state transition and single Cp peak is maintained (see Figure 1a). Conservation of a cooperative transition indicates that the mDCM provides reasonable thermodynamic descriptions; even when less than perfect parameters are applied. Consequently, to eliminate arbitrariness, we subsequently apply the E. coli best-fit parameters to the other TRXs considered here.
Table 2.
E. coli | S. aureus | |
---|---|---|
u | −2.24 | −2.44 |
v | −0.89 | −0.95 |
δnat | 0.97 | 1.48 |
The units of u and v are kcal/mol, whereas δnat is a dimensionless pure entropy.
Application of the E. coli best-fit parameters to all TRXs results in two-state transitions and a single peak in Cp (Figure 1b). The assumption of parameter transferability is reasonable given that DCM parameters are constructed to have physical meaning, and thus such values should be more or less constant across the relatively homogeneous family. Sequence and structure conservation across the TRX family has been appreciated for some time [28]. Previously, we have used the same strategy on four bacterial periplasmic binding proteins (bPBPs). In the bPBP work, we demonstrated that the height of predicted Cp curves, called Cpmax, are linearly related to the number of H-bonds within the structure (R = 0.90). Interestingly, this is not the case here (R = 0.12). This result is naively counterintuitive. One might expect greater correlation between a global H-bond characteristic and thermodynamic descriptions in more conserved families. However, just the opposite is found. When H-bond networks are similar, there is not enough diversity present to correlate to differences in Cpmax. In contrast, the large variance found in the H-bond network within the bPBP family did manifest as the dominant factor governing Cpmax. For example, the number of H-bonds varies between 293 and 504 within the bPBP dataset, whereas the variance in H-bond number within the TRX family is only 18. A similar decrease in correlation between number of H-bonds and Tm is found. The number of H-bonds was more correlated to Tm in our previous investigation across the bPBP family (R = 0.69) than that across the TRX family (R = 0.41). In fact, no single structural parameter identified here is strongly correlated to TRX Cpmax. These results highlight how subtle variations within the H-bond network and the strength of individual H-bonds can result in pronounced and unexpected changes in thermodynamic and/or mechanical response. This high level of sensitivity is rooted in the nonadditive properties of the free energy decomposition [13–16].
The two-state nature of the transition is identified using the free energy landscape. While the free energy landscape is initially computed as a function of the number of native torsion constraints and number of H-bonds, as seen in Eq. (1), it is convenient to express the free energy as a function of the global flexibility order parameter, θ. The global flexibility order parameter is defined by: θ(Nnt, N hb) ≡ Idis(Nnt, Nhb)/n, where Idis(Nnt, Nhb) is the average number of independent disordered constraints and n is the number of residues within the protein. Figure 2a plots a typical free energy landscape, G(T,θ). At T=Tm, all free energy landscapes include two minima, which highlight the native and unfolded metastable states, and a straddling free energy barrier, Gbar. Table 3 underscores that there is much variability within Gbar,, whereas the location of key points along the one-dimensional free energy landscape are much more conserved. The height of the free energy barrier is not correlated with any structural input; however, it is strongly correlated (R = 0.89) with Cpmax. A similar result was observed in the bPBP family.
Table 3.
Source | Tm1 | Cpmax2 | Hyst.3 | Gbar4 | θnat5 | θTS | θ unf | θRP |
---|---|---|---|---|---|---|---|---|
E. coli | 359 | 13.9 | 10.0 | 0.6 | 0.9 | 1.8 | 2.2 | 1.2 |
S. aureus | 343 | 11.3 | 9.9 | 1.2 | 1.2 | 1.6 | 2.1 | 1.2 |
Spinach-m cholorplast | 349 | 11.1 | 18.8 | 1.0 | 1.3 | 1.7 | 2.2 | 1.3 |
Anabaena sp. | 366 | 25.3 | 62.8 | 5.4 | 1.0 | 1.6 | 2.4 | 1.4 |
Human mitochondria | 376 | 21.3 | 61.0 | 2.5 | 1.1 | 1.7 | 2.2 | 1.3 |
C. reinhardtii cytosol | 385 | 20.4 | 67.9 | 5.1 | 1.1 | 1.7 | 2.3 | 1.3 |
Spinach-f cholorplast | 366 | 7.9 | 9.0 | 0.4 | 1.5 | 1.8 | 2.3 | 1.4 |
Fruit fly cytosol | 379 | 15.6 | 30.0 | 1.9 | 1.3 | 1.9 | 2.5 | 1.4 |
Human cytosol | 369 | 19.9 | 52.9 | 3.6 | 1.2 | 1.7 | 2.4 | 1.3 |
Average | 365.8 | 16.3 | 35.8 | 2.4 | 1.2 | 1.7 | 2.3 | 1.3 |
Standard deviation | 13.7 | 5.8 | 25.2 | 1.9 | 0.2 | 0.1 | 0.1 | 0.1 |
Melting temperature (K).
Peak height of the heat capacity curve in units of kcal/mol·K.
Hysteresis temperature range (K).
The height of the straddling free energy barrier (units are kcal/mol) separating the native and unfolded basins within the one-dimensional free energy landscapes.
Key points along the one-dimensional free energy landscape at the respective Tm are provided: θnat ≡ minimum of the native free energy basin, θTS ≡ location of the transition state barrier, θunf ≡ the minimum of the unfolded free energy basin, and θRP ≡ rigid cluster percolation threshold.
Hysteresis is identified when both the native and unfolded states are present within the ensemble. Previously [16], we have used the hysteresis temperature range to help explain the varying mechanisms of thermal denaturation between E. coli and T. thermophilus RNase H orthologs. The sizes of the TRX hysteresis temperature ranges are provided in Table 3, which are strongly correlated to Gbar and Cpmax (R = 0.93 and 0.81, respectively). These relationships are not surprising because they are related to ensemble populations in similar way. Specifically, Cp is a direct measure of energetic fluctuations across the ensemble, Gbar characterizes the kinetics of the native → unfolded transition, and hysteresis highlights the temperature regime in which the transitions occur. Our previous comparison of the orthologous RNase H pair revealed that native state energy, at Tm, is quantitatively conserved, whereas native state entropy was much more variable. We find the same trend here. The standard deviation within H(Tm, θnat) is 24.2 kcal/mol, whereas the standard deviation within -TmS(Tm, θnat) is nearly twice that (48.1 kcal/mol). This result is due to the nonadditive nature of component entropies within the DCM and highlights how global properties nontrivially emerge from local variations with the network topology. For example, while H(Tm, θnat) is strongly correlated to total H-bond energy (R = 0.89), -TS(Tm, θnat) is not (R = 0.28). (Note that H(Tm, θnat) is not perfectly correlated to total H-bond energy due to the presence of native torsion energies and the fact that competing H-bonds to solvent vary as a percentage of the parent intramolecular H-bond.)
Characterizing the mechanical transition
As the flexibility order parameter increases, the protein transitions from a native to unfolded state. The one-dimensional free energy landscapes characterize the thermodynamic aspects of the transition (Figure 2a). Similar to how heat capacity is be used to locate the thermodynamic melting temperature, the rigid cluster susceptibility, RCS, characterizes a mechanical transition that describes the protein changing between being globally rigid to flexible (Figure 2b). Specifically, RCS quantifies the amount of rigid cluster size fluctuations occurring across the ensemble (see [14] for details). At low θ, the protein is principally exploring the native state. The native state is predominately composed of a single large rigid cluster, plus many attached small rigid clusters forming non-cooperative flexible sidechains. At large θ, the protein is primarily exploring the unfolded state, which is composed of many disjoint small clusters. The largest rigid clusters at large θ generally come from α-helices that remain intact due to their intrinsic stability. At both extremes, RCS is low due to lack of fluctuations. However, at intermediate values of θ, the RCS peak signifies the point, called the rigidity percolation threshold (θRP), where there is maximum rigid cluster size fluctuation (see Figure 2b). The rigidity threshold corresponds to a point where the protein transitions mechanically from a globally rigid to flexible structure, or vice versa. Figure 3 compares RCS for all nine oxidized TRXs. While the location of θRP is somewhat conserved, there are significant differences within the amount of fluctuations (characterized by RCS peak height and width) occurring across the ensemble. No correlation to structural input is found for either RCS peak height or width. However, RCS peak height is correlated to Cpmax (R = 0.71). Again, this result is not surprising as both metrics quantify fluctuations, albeit from fundamentally different points of view, across the ensemble. This result is extremely important because it highlights how the underlying ensemble of constraint topologies fundamentally couples the mechanical and thermodynamic descriptions.
The θRP values are provided in Table 3. As with key points along the free energy landscape, no global correlation to structure input is observed. However, interesting properties emerge when comparing θRP to key points along the one-dimensional free energy landscape. Previously [14], we have compared θRP to θTS in order to assess the compactness of the transition state. Across the nine oxidized TRXs, θRP < θTS, meaning the TRX transition states are voluminous since the mechanical transition precedes the thermodynamic transition. In fact, the mechanical transition actually precedes the location of the native state basin in two cases. Consequently, the native states, in addition to the transition states, of these two TRX homologs are also rather voluminous. Figure 4 highlights the relationship between the mechanical and thermodynamic transitions in each of the nine oxidized homologs. The native state of TRX-2 from Anabaena sp. is predicted to be extremely compact (θRP ≫ θnat), which results in noteworthy mechanical linkage properties. At the other extreme, the native state of TRX-f from spinach chloroplast is predicted to be voluminous (θRP < θnat), which, in turn, results in clearly distinct mechanical linkage properties. The key points that characterize certain types of mechanical and thermodynamic response, when taken together lend themselves for defining a global flexibility signature for a protein, discussed further below.
Backbone flexibility
A QSFR flexibility metric, Findex, characterizes backbone flexibility. Positive Findex values reflect the amount of excess degrees of freedom in flexible regions, and negative values reflect the amount of excess constraints in rigid regions (see [25] for an exact definition). In our earlier investigation of bPBPs, we found that Findex is largely conserved across that family. Plotting Findex against the aligned backbone position for the nine oxidized TRXs reveals Findex conservation is even greater across the TRX family. The general conservation of backbone flexibility is highlighted in Figure 5, where the multiple alignment has been color-coded by Findex (red = flexible; blue = rigid) using TEXshade [31]. Across the alignment, secondary structure elements appear more rigid, whereas intervening loop regions are more flexible. In all cases, the N- and C-terminus have high flexibility, and the flexibility of the N-terminus extends through strand β1. All other B-strands are marginally rigid, while there is considerable variation in the α-helices. Helices α1 and α4 are among the most rigid portions of the protein, whereas the short helix α3 is somewhat flexible. The long helix α2, containing the conserved active site sequence WCGPC, displays the most intriguing behavior. Tracing from the N-terminus, the tryptophan is flexible, yet the redox-active cysteines are rigid. Immediately after the active site, there is a curious stretch of flexibility within α2 from all but two of the TRX homologs. For the most part, the remainder of the helix is very rigid. Ribbon diagrams (Figure 6) highlight the active site differences across all nine homologs, while also displaying an overall conservation of flexibility/rigidity profiles.
Exceptions to the above canonical rigidity/flexibility profiles are: (i.) the lack of the small flexible region within TRX-2 from Anabaena sp. and human mitochondria, (ii.) the lack of flexibility within the coil region connecting the 3–10 helix and strand β4 in the Anabaena sp. homolog, and (iii.) decreased rigidity of the c-terminal end of helix α2 within TRX-f from spinach chloroplast, TRX from E. coli, and TRX from S. aureus. The increased rigidity of TRX-2 from Anabaena sp. and the diminished rigidity within the spinach chloroplast and S. aureus homologs follow along with naïve expectations based on the above θRP versus θnat comparisons. On the other hand, the diminished rigidity of TRX from E. coli is puzzling as this trend is actually counter the θRP versus θnat comparisons. Finally, the θRP versus θnat comparisons do not suggest anything unique about human mitochondria TRX-2. Nevertheless, like TRX-2 from Anabaena sp., it too lacks the flexible region within α2 at the active site. It is worth noting that these are the only two TRX-2 orthologs within the dataset, which may explain their similarity in this regard. Another interesting variation occurs within the active site flexibility of the two TRX homologs from spinach chloroplast. The active site tryptophan of TRX-f is predicted to be more flexible than that of TRX-m, with Findex values of 0.39 vs. −0.12. Encouragingly, the same result had been seen in previous crystallographic analysis, where the flexibility of the active site tryptophan in TRX-f was seen as a possible factor in explaining the different substrate specificities of the two spinach TRXs [32].
Cooperativity correlation
Cooperativity correlation plots describe correlations between a pair of residues in the native state at the respective Tm that are either rigidity correlated, flexibly correlated, or not correlated. Blue regions correspond to rigidity correlation, giving the extent that two residues simultaneously fall within the same rigid cluster. Red regions correspond to flexibility correlation, giving the extent that a pair of residues is simultaneously flexible within a path that flexibility can propagate. White regions indicate no discernable mechanical coupling between the two residues. In our investigation of bPBPs, we observed a large amount of variation within cooperativity correlation. While bPBP backbone flexibility was generally conserved, bPBP cooperativity correlation was not. The variability within cooperativity correlation was ultimately explained by key differences within the H-bond network, even after normalizing for the different sizes of the proteins. Because the TRX H-bond network is much more conserved, we initially expected QSFR metrics to exhibit conserved properties. Surprisingly, we observe considerable diversity and richness in cooperativity correlation, as shown in Figure 7, for all nine oxidized TRX proteins. To quantify the degree of similarity and differences, all 36 pairwise correlations coefficients were calculated. The average Pearson pairwise correlation coefficient over all 36 pairs is 0.68 (standard deviation = 0.10). The all-to-all pairwise statistical analysis reveals that while some local patterns are well conserved, as was also the case in the bPBP family, certain features of each protein are quite distinct. Some interesting specific details on the similarities and differences are now discussed.
The large amount of blue color indicates that the native states of all TRX homologs are primarily composed of one large rigid cluster. However, the size and details of the rigid cluster are quite variable. For example, the largest rigid cluster in TRX-2 from Anabaena sp. covers, on average, residues 5–91. Across the other seven homologs, the largest rigid cluster is not contiguous in sequence. The large swaths of lighter color (and, at the extreme, red color) demarcate regions not included within the rigid cluster. Most often, it is the region spanning from helix α2 to strand β3 (~25–55) that is not included within the rigid core. Additionally, in all of the TRXs, the c-terminal helix α4 is disjoint from the rest of the network. While α4 is, in and of itself, mostly rigid (see Figure 5), it is generally not part of the core of the protein. However, we stress that the cooperativity correlation plots represent averages of mechanical couplings between two residues based on the ensemble of constraint topologies. As such, the extent of α4 separation from the core rigid cluster varies across the family. There is virtually no interaction of α4 and the core (as highlighted by the predominantly white band at the c-terminal end of Figure 7) within the human cytosol, human mitochondria, and spinach chloroplast (TRX-m) homologs. Conversely, there is noticeable interaction within Anabaena sp. TRX-2. Note that we have previously reported this separation of α4 from the core in our description of the E. coli homolog [15]. In fact, we demonstrated that cleaving the backbone just prior to the start of α4 was actually stabilizing due to conformational entropy effects.
The mostly rigidly correlated behavior of Anabaena sp. TRX-2 is consistent with the above θRP versus θnat analysis, indicating the native state is mainly a single large rigid cluster. Expanding this analysis to all nine homologs reveals a clear relationship between the relative positions of θRP and θnat, and the extent of cooperativity correlation. The Anabaena sp., E. coli, C. reinhardtii, human mitochondria, and human cytosol homologs are predicted to have the most compact native states, which is consistent with their structures being mostly rigidly correlated, θRP ≫ θnat. The native states of TRX from fruit fly cytosol and TRX-m from spinach chloroplast are also expected to be compact, θRP > θnat, but not as much as the previous five. A concomitant reduction of rigidity correlation is observed in these two. Finally, TRX from S. aureus and TRX-f from spinach chloroplast, which are identified as having voluminous native states (θRP < θnat), are the most flexibly correlated (an thus, the least rigidly correlated).
Similarity and differences within the cooperativity correlations are quantified by constructing a dendragram (see Figure 7) from a distance matrix based on the 36 pairwise comparisons introduced above (see Supplementary Table 1). Amazingly, with only a single exception, the clustering in Figure 7 is exactly consistent with the differences between θRP and θnat plotted in Figure 4. Specifically, the largest cluster of four TRX homologs, which includes Anabaena sp., C. reinhardtii, human cytosol, and human mitochondria, represent four (of the five) homologs with the largest difference between the thermodynamic and rigidity transitions, defined as θRP ≫ θnat. The two TRXs with the smallest positive difference, θRP > θnat, (namely, TRX-m from spinach chloroplast and TRX from fruit fly cytosol) cluster together, whereas the two TRXs with θRP < θnat (namely, TRX-f from spinach chloroplast and TRX from S. aureus) form a cluster distinct from the rest of the family. The sole outlier is the E. coli TRX, which clusters near the fruit fly TRX/spinach TRX-m cluster. By eye, it is clear that the large swath of reduced rigidity approximately centered on residue 40 that is present in three of the four TRXs in the largest cluster is absent within the E. coli homolog. Moreover, traces of the decreased rigidity about α4 within the E. coli homolog can be observed within the homologs it clusters with.
Identifying allostery using cooperativity correlation
It is evident that identifying relationships between the mechanical and thermodynamic transitions may have far-reaching consequences, including, connecting cooperativity correlation to allostery. Successful identification of allostery from cooperativity correlation data rest on the assumption that the intrinsic equilibrium fluctuations of the native state appropriately reflects the response of a protein to perturbation. This critical assumption is in the spirit of linear response theory, for which there is mounting evidence from computational [33] and experimental [34] studies that this assumption works well. Along this same line, we have demonstrated that the distribution of rigidity throughout a protein is a statistically significant determinant for long-range intramolecular couplings that exhibit nonadditivity in free energy changes within double mutant cycles [35]. Meaning, the intrinsic properties of mechanical response of a protein correlates to free energy couplings between two point mutations. Because a cooperativity correlation plot is based on an ensemble average of mechanical couplings, the “old” view of allostery [36] is captured as a series of concerted mechanical events propagating through the protein structure. Specifically, regions within the cooperativity correlation plots colored blue are mechanically coupled through rigidity, whereas mechanical pathways in which flexibility can propagate are colored red. The high density of constraints that defines a rigid cluster provides a network of interactions that physically couple all residue pairs within the cluster. However, the rigid regions are not necessarily static, as they themselves are allowed to fluctuate as constraints break and form.
The “new” [36] view of allostery (commonly called the population shift model [37–41]) is also captured because the appropriate Boltzmann factors are accounted for during ensemble averaging. In other words, the free energy basin (a local minimum) controls the most probable constraint topologies, and the fluctuations of constraints around the most probable topology unify mechanical and thermal response. For example, as the temperature is lowered or raised, constraints are more easily formed or broken, and the mechanical properties change. More interestingly, the temperature dependence for a constraint to break (or form) strongly depends on all other constraints (i.e. their strength and where they are distributed) because network rigidity is a long-range interaction. Consequently, cooperativity correlations depend on solvent and thermodynamic conditions, and are sensitive to mutations that induce changes in constraint topology, so that the “old” mechanical view and the “new” thermodynamic view are united. Despite overall similarity between the TRX proteins within the family, the diversity found in cooperativity correlation suggests that allosteric response will also vary significantly within the TRX family, which is consistent with observations across other protein families [42–47]. To understand allostery further with a direct connection to experimental data, the next step is to calculate redistributions in backbone flexibility and cooperativity correlation due to an external perturbation. Recording the response of a protein to an applied external perturbation that increases or decreases a local energy term has proved to be a successful approach using Ising-like models [48–51] that stress thermodynamic aspects, and with elastic network models [52, 53] that stress mechanical aspects. This perturbation approach is in progress using the mDCM, where both the thermodynamic and mechanical nature of the response is tracked.
Pairwise comparisons
The chief problem of analyzing such large amounts of QSFR data is being able to concisely explain all mechanistic variations. As such, we chose to primarily focus on two exemplar pairwise comparisons, specifically: Anabaena sp. TRX-2 to E. coli TRX and Anabaena sp. TRX-2 to TRX-f from spinach chloroplast. The Anabaena sp. and E. coli homologs represent the case with the largest gap between θRP and θnat, whereas the TRX-f from spinach chloroplast homolog represents the opposite trend. As such, these two pairs bookend the full range of observed variations in QSFR metrics across the nine TRX proteins. Figure 8 highlights that the pairs are near the two extremes of pairwise H-bond conservation across the dataset. As expected, H-bond network conservation is closely related to pairwise structural (Figure 8a) and sequence similarity. After adding additional protein families to the analysis (unpublished results), a quadratic relationship between pairwise H-bond conservation and pairwise structure or sequence similarity is revealed that resembles the famous Chotia and Lesk plot [54] showing a similar relationship between pairwise sequence and structure similarity. The H-bond networks within the Anabaena sp./E. coli pair are 70% conserved (averaging over Figure 8b), whereas they are only 39% conserved across the Anabaena sp./spinach chloroplast pair. Across the former pair, it is remarkable that such drastic QSFR variations can occur in spite of such global similarity within their underlying H-bond networks (see Figure 9a–c). This result highlights the long-range nature of rigidity and how a small number of well-placed constraint differences will drastically affect the mechanical linkage properties. As expected, the latter pair exhibits even greater diversity within the QSFR metrics (Figure 9d–f). Curiously, despite only 23% pairwise sequence identity, TRX-2 from Anabaena sp. displays some known functional relationships to TRX-f from spinach chloroplast. Differing with respect to all other TRXs, it has been shown that TRX-2 and TRX-f easily reduce spinach chloroplast fructose bisphosphatase [55]. This functional similarity is not suggested by the QSFR analysis herein. While functional relationships might be inferred by consideration of QSFR measures, this type of approach would require application of more sophisticated pattern recognition methods, and doing so is beyond the scope of the current work. However, it is known that TRX-f has other distinguishing properties that differentiate it from all other TRXs such as the ability to be reduced by glutathione and the lack of many otherwise conserved TRX residues. Therefore, the large QSFR differences predicted between TRX-f and other TRXs is intriguing.
The QSFR analysis described above shows TRX-2 from Anabaena sp. is one of the most atypical across our dataset. The Anabaena sp. homolog shows much more extensive regions of correlated rigidity compared to the others. The striking difference between it and the E. coli homolog (Figure 9c) is especially surprising considering the large amount of H-bond network similarity. Moreover, global descriptions of the H-bond network (i.e., number of H-bonds, average H-bond energy, and total H-bond energy) are virtually identical. Despite great global similarities in the H-bond network, it is known that TRX-2 from Anabaena sp., like TRX-f from spinach chloroplast, actually possesses quite different functional properties (i.e., unusual enzymatic activities and substrate specificities) from other common TRXs [56]. It is encouraging that the mDCM is sensitive to such subtle differences in H-bond topology and strength.
The effect of redox state
Crystal structures for three reduced/oxidized TRX pairs are currently available. As such, we attempt to elucidate the differences within QSFR that arise from the loss of the active site disulfide. Human cytosolic TRX was the first redox pair solved, with the oxidized form obtained from reduced form crystals by air exposure [57]. The pair showed only minute overall differences (pairwise RMSD = 0.2Å), with the greatest changes, as expected, localized at the active site. Structural changes were also observed near the dimer interface. The first redox pair of crystals grown independently was from fruit fly cytosol TRX [58]. Once again, few global differences were observed (pairwise RMSD = 0.5Å), and the largest localized differences occurred near the active site. Finally, a third redox pair has been crystallized from human mitochondria [59]. Like the previous two pairs, the redox pair is nearly equivalent (pairwise RMSD = 0.2Å). After averaging over the six molecules in the asymmetric unit, there is less deviation between redox pair than there is after averaging only over the oxidized (or, similarly, reduced) structures. Meaning, TRX exhibits minimal global conformational changes during its functional cycling between oxidized and reduced forms.
While the static structures of oxidized and reduced TRXs appear mostly similar, putative dynamic differences are impossible to fully quantify from the static comparisons. The DCM is well suited to detect differences in these structures based on their underlying H-bond networks. Using the same parameter set as above, the reduced structures exhibit a two-state folding transition, albeit the heat capacity curves have slightly smaller Cpmax values (Figure 10a) with lower melting temperatures. The lowered Tm values of the reduced structures are consistent with the diminished stability of the reduced form, as demonstrated by the thermal stability analysis of Hiraoki et al. [60]. This consistency between our predicted relative stabilities and those determined experimentally gives confidence that the mDCM is qualitatively identifying proper two-state transitions. As can be seen in Figure 10b, the subtle structure rearrangements have a marked effect on backbone flexibility. Overall, the two human structure pairs, where the reduced form was obtained from oxidized crystals, display more similar flexibility differences. Some of the differences seen for the fruit fly TRX may be due to the differing crystal forms used. In all three examples, there is considerable flexibility increase at the active site. This is expected because the loss of the constraint specified by the disulfide bond should lead to increased mechanical flexibility in the immediate surrounding region. While most differences, few that they are, between TRX redox pairs are localized to the active site [57–59], a general “loosening” of the entire protein structure is revealed by NMR order parameters [61, 62]. In line with these NMR results, we also observe that several structurally remote regions exhibit increased flexibility.
Intriguingly, our results also indicate that some regions actually become more rigid upon reduction. This effect occurs due to the release of strain energy contained within the oxidized active site region [15]. Removal of the disulfide allows other regions to become better packed, resulting in a more optimized H-bond network. The NMR work described above is for E. coli TRX (to the best of our knowledge, NMR order parameters comparing oxidized and reduced human or fruit fly TRX pairs do not exist). It is interesting to note that the protection factors reported in [61] indicate that there is a small region of increased rigidity just prior to the active site that is consistent with the rigidity increase we observe in the fruit fly TRX (see Figure 10b). However, an increase in rigidity in this specific region is not predicted by the mDCM for the two human TRX pairs. For the same reason of better packing, the two reduced human TRXs are predicted to have an increase in rigidity at the c-terminal ends. While the predictions form the mDCM are subject to being an artifact of an oversimplified model or the uncertainties within the X-ray crystal structure, these results point to a plausible mechanism of structural rearrangement. There is also the general problem that the time scale probed by NMR is very fast (ps-ns), whereas the DCM flexibility predictions describe quasi-stationary motions. It will be interesting to know if future experiments will be able to confirm the nuanced details of where increased rigidity is predicted in the different TRX pairs, such as the c-terminal ends that we predict for the two reduced human TRXs.
Figure 10c compares cooperativity correlation between the reduced and oxidized pairs. In two of the three cases (TRX-2 from human mitochondria and TRC from human cytosol), there is a general decrease in rigidity correlation, whereas the human cytosol pair is virtually unchanged. Consistent with the above results, the relative changes within rigidity correlation are consistent with the relative locations of θRP and θnat. To highlight this point, we define χ as the change in the relative positions of θRP and θnat of the oxidized and reduced structures (see Table 4). In the fruit fly cytosol pair, χ = 0.0, which is consistent with the pair’s strongly conserved cooperativity correlation plots. Conversely, the other two pairs show diminished rigidity correlation, which is predicted by χ = 0.3 and 0.4. Likewise, the increased flexibility within the reduced human homologs parallels the reduced rigidity correlation, even though the exact details and extent of the change are not conserved. Taken as a whole, the general trends in QSFR upon TRX reduction is consistent with the observations across the nine TRX homologs. It is important to note that attempts to understand how QSFR is manifest in terms of simple global metrics all failed, and rather, the differences are attributed to subtle differences in the H-bond network.
Table 4.
Source | Tm | Cpmax | Hyst. | Gbar | θnat | θTS | θunf | θRP | χ2 |
---|---|---|---|---|---|---|---|---|---|
Human mitoch. (ox) | 376 | 21.3 | 61.0 | 2.5 | 1.1 | 1.7 | 2.2 | 1.3 | -- |
Human mitoch. (red) | 360 | 11.4 | 36.9 | 2.3 | 1.4 | 1.9 | 2.4 | 1.3 | -- |
Difference | 16 | 9.9 | 24.1 | 0.2 | −0.3 | −0.2 | −0.2 | 0.0 | 0.3 |
Fruit fly cytosol (ox) | 379 | 15.6 | 30.0 | 1.9 | 1.3 | 1.9 | 2.5 | 1.4 | -- |
Fruit fly cytosol (red) | 354 | 6.4 | 7.5 | 0.4 | 1.5 | 1.8 | 2.2 | 1.2 | -- |
Difference | 25 | 9.2 | 22.5 | 1.5 | −0.2 | 0.1 | 0.3 | 0.2 | 0.0 |
Human cytosol (ox) | 369 | 19.9 | 52.9 | 3.6 | 1.2 | 1.7 | 2.4 | 1.3 | -- |
Human cytosol (red) | 357 | 16.1 | 43.0 | 3.1 | 1.2 | 1.8 | 2.4 | 1.3 | -- |
Difference | 12 | 3.8 | 9.9 | 0.5 | 0.0 | −0.1 | 0.0 | 0.0 | 0.4 |
Thermodynamic descriptions of oxidized and reduced pairs. The first nine columns are the same as in Table 3.
The change in the relative positions of the mechanical and thermodynamic transitions is described by: .
CONCLUSIONS
Using the mDCM, we analyze QSFR descriptions of nine oxidized and three reduced members of the TRX family. While backbone flexibility is well conserved in the family, other QSFR metrics reveal rich amounts of diversity in mechanical and thermodynamic response. Intriguingly, QSFR diversity can be mostly explained by comparing the relative locations of the mechanical and thermodynamic transitions, described by θRP and θnat. However, none of the predicted QSFR properties, including θRP and θnat, are correlated to global structural properties (i.e., number of H-bonds, average H-bond strength, total H-bond energy, etc.). This result demonstrates how small-scale structural variations are amplified into discernible global differences by propagating mechanical couplings through the H-bond network. Despite the simplicity of the mDCM, its predicted QSFR differences are largely consistent with known functional differences and variations within allosteric response. Further detailed comparisons will be possible upon completion of a more sophisticated DCM that is currently underway, to predict thermodynamic and mechanical response upon perturbation of the protein by mutation or ligand binding.
METHODS
The minimal distance constraint model
All results were obtained by employing the mDCM, which has been extensively defined and explained in previously published work [7, 9, 13–15, 25]. The critical elements are highlighted here. Interactions are modeled as distance constraints that are assigned enthalpy and entropy values. The three-dimensional structure is mapped onto a graph where nodes and edges represent atoms and distance constraints respectively. This graph defines a constraint topology. A fixed constraint topology consists of all accessible molecular conformations consistent with a given set of distance constraints between pairs of atoms. Starting with the input graph, an ensemble of diverse constraint topologies is generated by perturbing away from the input constraint topology. Since each internal dihedral angle can be locked with either a native or disordered distance constraint, and each H-bond or salt-bridge can be present or not, there are ~2635 accessible constraint topologies for thioredoxin. Due to the immense number of possible constraint topologies (graphs), a mean field approximation is invoked by binning states together that have the same total numbers of native torsion constraints and H-bonds, which defines the macrostate of a protein. The number of native torsion constraints and number of H-bonds are used as order parameters. The non-trivial feature of the DCM is that nonadditivity of component entropies is explicitly modeled. Rigidity and flexibility properties of the structure are used to determine if a distance constraint as independent or redundant [10]. The total enthalpy is simply a sum of all enthalpic components; however, the total entropy, which is a nonadditive property [29, 30], is only evaluated over the set of independent constraints.
Using the nonadditivity principles described above, the free energy of each macrostate is calculated by Monte Carlo sampling over topological frameworks that are constrained to satisfy the two order parameters. From this involved, yet computationally efficient procedure, the free energy landscape, partition function, and all resultant thermodynamic quantities, are calculated for TRX in less than a minute on a single CPU. Once the free energy landscape is constructed, it is used to appropriately weight mechanical properties to obtain proper thermodynamic averages. Meaning, the essence of the DCM is to use the mechanical properties to make more accurate thermodynamic calculations based on macromolecular structure, and then, in turn, to use the thermodynamic properties to predict emergent mechanical response.
The free energy function of each macrostate is given by:
Eq. (1) |
where Uhb is the intramolecular H-bond energy, uNhb describes H-bonding to solvent, vNnt describes native torsion force energy, Sc(Nnt,Nhb|δnat) is the conformational entropy, and Sm(Nnt,Nhb) is the mixing entropy associated with sampling the specified macrostate. The values of the three phenomenological parameters u, v, and δnat are determined by fitting to experimental Cp curves from differential scanning calorimetry (DSC). While not specified here, Sc(Nnt,Nhb|δnat) is calculated over the set of independent constraints, and this set is sampled using Monte Carlo. Previous work has demonstrated that the predicted thermodynamic and, even more so, mechanical properties are robust to slight parameterization differences. Therefore, a priori assuming parameter transferability is reasonable [14].
Structure preparation
Nine different oxidized and three reduced TRX homologs are investigated here in order to provide a large evolutionary cross-section for analysis, while still being a tractable number for data handling and visual comparison. The dataset (see Table 1) includes homologs from prokaryotic bacteria (E. coli, S. aureus, and Anabaena sp.), eukaryotic microbes (C. reinhardtii), plants (two distinct spinach chloroplast paralogs), and animals (fruit flies and humans) [27, 32, 56–59, 63, 64]. All TRXs investigated here have x-ray crystal structures solved at high resolution (greater than 2.3 Å resolution). Moreover, the structures are remarkably similar; the pairwise α-carbon root mean square deviation (RMSD), computed by combinatorial extension [65], ranges from 0.7 to 1.6 Å. NMR structures of TRXs were also attempted, but gave poorly defined heat capacity transitions. Thus, they were excluded from further analysis.
Table 1.
Protein | Source | PDBID | Resol.1 | # Res. | # HB | Avg HB E2 | RMSD3 | Seq Id4 |
---|---|---|---|---|---|---|---|---|
Oxidized structures | ||||||||
TRX | E. coli | 2TRX | 1.7 Å | 108 | 141 | −2.6 | 0.0 Å | 100 % |
TRX | S. aureus | 2O7K | 2.2 Å | 103 | 142 | −2.3 | 1.3 Å | 51 % |
TRX-m | Spinach cholorplast | 1FB6 | 2.1 Å | 104 | 130 | −2.4 | 1.0 Å | 48 % |
TRX-2 | Anabaena sp. | 1THX | 1.6 Å | 108 | 142 | −2.6 | 1.0 Å | 43 % |
TRX-2 | Human mitochondria | 1W4V | 1.8 Å | 107 | 142 | −2.5 | 1.2 Å | 38 % |
TRX-h | C. reinhardtii cytosol | 1EP7 | 2.1 Å | 112 | 142 | −2.5 | 1.4 Å | 36 % |
TRX-f | Spinach cholorplast | 1F9M | 1.9 Å | 112 | 148 | −2.5 | 1.4 Å | 32 % |
TRX | Fruit fly cytosol | 1XWA | 2.2 Å | 106 | 145 | −2.4 | 1.2 Å | 31 % |
TRX | Human cytosol | 1ERU | 2.1 Å | 105 | 146 | −2.4 | 1.3 Å | 27 % |
Reduced structures | ||||||||
TRX-2 | Human mitochondria | 1UVZ | 2.0 Å | 107 | 140 | −2.3 | 0.2 Å | 100 % |
TRX | Fruit fly cytosol | 1XWC | 2.3 Å | 106 | 141 | −2.3 | 0.5 Å | 100 % |
TRX | Human cytosol | 1ERT | 1.7 Å | 105 | 155 | −2.2 | 0.2 Å | 100 % |
The x-ray crystallographic resolution.
The average hydrogen bond energy (kcal/mol).
The α-carbon root mean square deviation comparing each structure to the E. coli homolog. In the case of the reduced TRXs, the comparison is to the oxidized state of the same homolog.
The pairwise percent residue identity comparing each sequence to the E. coli homolog. In the case of the reduced TRXs, the comparison is to its oxidized state.
Hydrogen atoms are added to initial structures using H++ web server [66]. In addition to adding missing hydrogen atoms, H++ uses Poisson-Boltzmann continuum electrostatic theory to calculate the appropriate ionization state of the protein and performs an optimization of hydrogen positions. H++ is a convenient method to generate input structures to the mDCM, involving minimal perturbation away from the experimental structure. Employed electrostatic parameters include a salinity of 0.15 M and external/internal dielectrics of 80 and 6, respectively. Output structures were protonated assuming a pH of 7.0, which is consistent with the conditions used in the original DSC experiments.
Hydrogen bond network comparisons
H-bond network similarity is computed on a pairwise basis. Starting with the pairwise sequence alignment of two TRX homologs, equivalent residue positions are identified. Equivalent H-bonds are simply defined by identical donor and acceptor residue pairs. While this residue-level approach does not identify variations that can occur at the atomic-level, a cursory overview of the results suggests that the variation between the two approaches is insignificant. As such, overall H-bond network similarity is simply computed as the ratio of equivalent H-bonds to the average number within each being considered. However, it does not necessarily make sense to consider equivalence within a feeble (Ehb ~ −0.1 kcal/mol) and a strong (Ehb < −6.0 kcal/mol) H-bond. As such, we invoke a series of H-bond energy cut-offs, such that only H-bonds stronger than the cut-off are considered present.
Hierarchical comparison of cooperativity correlation
Using R (http://www.r-project.org/), each cooperativity correlation plot pair is compared using the Pearson correlation coefficients, which are hierarchically clustered using Ward’s method with Euclidean distances. Note that the results are generally conserved when using the Spearman rank correlation instead of Pearson.
Supplementary Material
Acknowledgments
Grant sponsor: This work is supported by NIH grant R01 GM073082-01A1 to DJJ and DRL.
We thank Andrei Y. Istomin for proofreading a draft of this manuscript and providing a number of valuable comments. Anthony A. Fodor is also thanked for insightful discussions related to the presented work. Key to the DCM is the use of graph-rigidity algorithms. This algorithm is claimed in U.S. Patent Number 6,014,449, which has been assigned to the Board of Trustees Michigan State University. Used with permission.
References
- 1.Holmgren A. Thioredoxin. Annu Rev Biochem. 1985;54:237–271. doi: 10.1146/annurev.bi.54.070185.001321. [DOI] [PubMed] [Google Scholar]
- 2.Gane PJ, Freedman RB, Warwicker J. A molecular model for the redox potential difference between thioredoxin and DsbA, based on electrostatics calculations. J Mol Biol. 1995;249:376–387. doi: 10.1006/jmbi.1995.0303. [DOI] [PubMed] [Google Scholar]
- 3.Stefankova P, Kollarova M, Barak I. Thioredoxin - structural and functional complexity. Gen Physiol Biophys. 2005;24:3–11. [PubMed] [Google Scholar]
- 4.Carvalho AP, Fernandes PA, Ramos MJ. Similarities and differences in the thioredoxin superfamily. Prog Biophys Mol Biol. 2006;91:229–248. doi: 10.1016/j.pbiomolbio.2005.06.012. [DOI] [PubMed] [Google Scholar]
- 5.Burke-Gaffney A, Callister ME, Nakamura H. Thioredoxin: friend or foe in human disease? Trends Pharmacol Sci. 2005;26:398–404. doi: 10.1016/j.tips.2005.06.005. [DOI] [PubMed] [Google Scholar]
- 6.Lillig CH, Holmgren A. Thioredoxin and related molecules--from biology to health and disease. Antioxid Redox Signal. 2007;9:25–47. doi: 10.1089/ars.2007.9.25. [DOI] [PubMed] [Google Scholar]
- 7.Jacobs DJ. Recent Research Developments in Biophysics. Vol. 5. Trivandrum, India: Transworld Research Network; 2006. Predicting protein flexibility and stability using network rigidity: a new modeling paradigm; pp. 71–13. [Google Scholar]
- 8.Jacobs DJ, Dallakyan S, Wood GG, Heckathorne A. Network rigidity at finite temperature: relationships between thermodynamic stability, the nonadditivity of entropy, and cooperativity in molecular systems. Phys Rev E Stat Nonlin Soft Matter Phys. 2003;68:061109. doi: 10.1103/PhysRevE.68.061109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jacobs DJ, Mottonen JM, Vorov OK, Istomin AY, Livesay DR. Progress in Biopolymer Res. Hauppauge NY: Nova Science Publishers; 2008. Protein thermodynamcis modeled using network rigidity. To appear. [Google Scholar]
- 10.Jacobs DJ, Rader AJ, Kuhn LA, Thorpe MF. Protein flexibility predictions using graph theory. Proteins. 2001;44:150–165. doi: 10.1002/prot.1081. [DOI] [PubMed] [Google Scholar]
- 11.Jacobs DJ, Thorpe MF. Generic rigidity percolation: The pebble game. Phys Rev Lett. 1995;75:4051–4054. doi: 10.1103/PhysRevLett.75.4051. [DOI] [PubMed] [Google Scholar]
- 12.Jacobs DJ, Thorpe MF. Computer-implemented system for analyzing rigidity of substructures within a macromolecule. 6014449. US Patent. 1998
- 13.Jacobs DJ, Dallakyan S. Elucidating protein thermodynamics from the three-dimensional structure of the native state using network rigidity. Biophys J. 2005;88:903–915. doi: 10.1529/biophysj.104.048496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Livesay DR, Dallakyan S, Wood GG, Jacobs DJ. A flexible approach for understanding protein stability. FEBS Lett. 2004;576:468–476. doi: 10.1016/j.febslet.2004.09.057. [DOI] [PubMed] [Google Scholar]
- 15.Jacobs DJ, Livesay DR, Hules J, Tasayco ML. Elucidating quantitative stability/flexibility relationships within thioredoxin and its fragments using a distance constraint model. J Mol Biol. 2006;358:882–904. doi: 10.1016/j.jmb.2006.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Livesay DR, Jacobs DJ. Conserved quantitative stability/flexibility relationships (QSFR) in an orthologous RNase H pair. Proteins. 2006;62:130–143. doi: 10.1002/prot.20745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Go N. Theoretical studies of protein folding. Annu Rev Biophys Bioeng. 1983;12:183–210. doi: 10.1146/annurev.bb.12.060183.001151. [DOI] [PubMed] [Google Scholar]
- 18.Head-Gordon T, Brown S. Minimalist models for protein folding and design. Curr Opin Struct Biol. 2003;13:160–167. doi: 10.1016/s0959-440x(03)00030-7. [DOI] [PubMed] [Google Scholar]
- 19.Kallias A, Bachmann M, Janke W. Thermodynamics and kinetics of a Go protein-like heteropolymer model with two-state folding characteristics. J Chem Phys. 2008;128:055102. doi: 10.1063/1.2822287. [DOI] [PubMed] [Google Scholar]
- 20.Hilser VJ, Freire E. Structure-based calculation of the equilibrium folding pathway of proteins. Correlation with hydrogen exchange protection factors. J Mol Biol. 1996;262:756–772. doi: 10.1006/jmbi.1996.0550. [DOI] [PubMed] [Google Scholar]
- 21.Hilser VJ, Garcia-Moreno EB, Oas TG, Kapp G, Whitten ST. A statistical thermodynamic model of the protein ensemble. Chem Rev. 2006;106:1545–1558. doi: 10.1021/cr040423+. [DOI] [PubMed] [Google Scholar]
- 22.Godoy-Ruiz R, Henry ER, Kubelka J, Hofrichter J, Munoz V, Sanchez-Ruiz JM, Eaton WA. Estimating Free-Energy Barrier Heights for an Ultrafast Folding Protein from Calorimetric and Kinetic Data. J Phys Chem B. 2008;112:5938–5949. doi: 10.1021/jp0757715. [DOI] [PubMed] [Google Scholar]
- 23.Munoz V. What can we learn about protein folding from Ising-like models? Curr Opin Struct Biol. 2001;11:212–216. doi: 10.1016/s0959-440x(00)00192-5. [DOI] [PubMed] [Google Scholar]
- 24.Hollien J, Marqusee S. A thermodynamic comparison of mesophilic and thermophilic ribonucleases H. Biochemistry. 1999;38:3831–3836. doi: 10.1021/bi982684h. [DOI] [PubMed] [Google Scholar]
- 25.Livesay DR, Huynh DH, Dallakyan S, Jacobs DJ. Hydrogen bond networks determine emergent mechanical and thermodynamic properties across a protein family. Chem Central J. 2008;2:17. doi: 10.1186/1752-153X-2-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Georgescu RE, Garcia-Mira MM, Tasayco ML, Sanchez-Ruiz JM. Heat capacity analysis of oxidized Escherichia coli thioredoxin fragments (1--73, 74--108) and their noncovalent complex. Evidence for the burial of apolar surface in protein unfolded states. Eur J Biochem. 2001;268:1477–1485. doi: 10.1046/j.1432-1327.2001.02014.x. [DOI] [PubMed] [Google Scholar]
- 27.Roos G, Garcia-Pino A, Van Belle K, Brosens E, Wahni K, Vandenbussche G, Wyns L, Loris R, Messens J. The conserved active site proline determines the reducing power of Staphylococcus aureus thioredoxin. J Mol Biol. 2007;368:800–811. doi: 10.1016/j.jmb.2007.02.045. [DOI] [PubMed] [Google Scholar]
- 28.Eklund H, Gleason FK, Holmgren A. Structural and functional relations among thioredoxins of different species. Proteins. 1991;11:13–28. doi: 10.1002/prot.340110103. [DOI] [PubMed] [Google Scholar]
- 29.Dill KA. Additivity principles in biochemistry. J Biol Chem. 1997;272:701–704. doi: 10.1074/jbc.272.2.701. [DOI] [PubMed] [Google Scholar]
- 30.Mark AE, van Gunsteren WF. Decomposition of the free energy of a system in terms of specific interactions. Implications for theoretical and experimental studies. J Mol Biol. 1994;240:167–176. doi: 10.1006/jmbi.1994.1430. [DOI] [PubMed] [Google Scholar]
- 31.Beitz E. TEXshade: shading and labeling of multiple sequence alignments using LATEX2 epsilon. Bioinformatics. 2000;16:135–139. doi: 10.1093/bioinformatics/16.2.135. [DOI] [PubMed] [Google Scholar]
- 32.Capitani G, Markovic-Housley Z, DelVal G, Morris M, Jansonius JN, Schurmann P. Crystal structures of two functionally different thioredoxins in spinach chloroplasts. J Mol Biol. 2000;302:135–154. doi: 10.1006/jmbi.2000.4006. [DOI] [PubMed] [Google Scholar]
- 33.Bradley MJ, Chivers PT, Baker NA. Molecular dynamics simulation of the Escherichia coli NikR protein: equilibrium conformational fluctuations reveal interdomain allosteric communication pathways. J Mol Biol. 2008;378:1155–1173. doi: 10.1016/j.jmb.2008.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lange OF, Lakomek NA, Fares C, Schroder GF, Walter KF, Becker S, Meiler J, Grubmuller H, Griesinger C, de Groot BL. Recognition dynamics up to microseconds revealed from an RDC-derived ubiquitin ensemble in solution. Science. 2008;320:1471–1475. doi: 10.1126/science.1157092. [DOI] [PubMed] [Google Scholar]
- 35.Istomin AY, Gromiha MM, Vorov OK, Jacobs DJ, Livesay DR. New insight into long-range nonadditivity within protein double-mutant cycles. Proteins. 2008;70:915–924. doi: 10.1002/prot.21620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Formaneck MS, Ma L, Cui Q. Reconciling the “old” and “new” views of protein allostery: a molecular simulation study of chemotaxis Y protein (CheY) Proteins. 2006;63:846–867. doi: 10.1002/prot.20893. [DOI] [PubMed] [Google Scholar]
- 37.Gunasekaran K, Ma B, Nussinov R. Is allostery an intrinsic property of all dynamic proteins? Proteins. 2004;57:433–443. doi: 10.1002/prot.20232. [DOI] [PubMed] [Google Scholar]
- 38.Kern D, Zuiderweg ER. The role of dynamics in allosteric regulation. Curr Opin Struct Biol. 2003;13:748–757. doi: 10.1016/j.sbi.2003.10.008. [DOI] [PubMed] [Google Scholar]
- 39.Ma B, Shatsky M, Wolfson HJ, Nussinov R. Multiple diverse ligands binding at a single protein site: a matter of pre-existing populations. Protein Sci. 2002;11:184–197. doi: 10.1110/ps.21302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Swain JF, Gierasch LM. The changing landscape of protein allostery. Curr Opin Struct Biol. 2006;16:102–108. doi: 10.1016/j.sbi.2006.01.003. [DOI] [PubMed] [Google Scholar]
- 41.Tsai CJ, del Sol A, Nussinov R. Allostery: absence of a change in shape does not imply that allostery is not at play. J Mol Biol. 2008;378:1–11. doi: 10.1016/j.jmb.2008.02.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jensen AA, Spalding TA. Allosteric modulation of G-protein coupled receptors. Eur J Pharm Sci. 2004;21:407–420. doi: 10.1016/j.ejps.2003.11.007. [DOI] [PubMed] [Google Scholar]
- 43.Jensen RA, Stenmark SL. Comparative allostery of 3-deoxy-D-arabino-heptulosonate-7-phosphate synthetase as a molecular basis for classification. J Bacteriol. 1970;101:763–769. doi: 10.1128/jb.101.3.763-769.1970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Jensen RA, Twarog R. Allostery of 3-deoxy-D-arabino-heptulosonate 7-phosphate synthetase in Clostridium: another conserved generic characteristic. J Bacteriol. 1972;111:641–648. doi: 10.1128/jb.111.3.641-648.1972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.May LT, Avlani VA, Sexton PM, Christopoulos A. Allosteric modulation of G protein-coupled receptors. Curr Pharm Des. 2004;10:2003–2013. doi: 10.2174/1381612043384303. [DOI] [PubMed] [Google Scholar]
- 46.Royer WE, Jr, Knapp JE, Strand K, Heaslet HA. Cooperative hemoglobins: conserved fold, diverse quaternary assemblies and allosteric mechanisms. Trends Biochem Sci. 2001;26:297–304. doi: 10.1016/s0968-0004(01)01811-4. [DOI] [PubMed] [Google Scholar]
- 47.Whitaker RJ, Byng GS, Gherna RL, Jensen RA. Comparative allostery of 3-deoxy-D-arabino-heptulosonate 7-phosphate synthetase as an indicator of taxonomic relatedness in pseudomonad genera. J Bacteriol. 1981;145:752–759. doi: 10.1128/jb.145.2.752-759.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Freire E. Can allosteric regulation be predicted from structure? Proc Natl Acad Sci U S A. 2000;97:11680–11682. doi: 10.1073/pnas.97.22.11680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Liu T, Whitten ST, Hilser VJ. Ensemble-based signatures of energy propagation in proteins: a new view of an old phenomenon. Proteins. 2006;62:728–738. doi: 10.1002/prot.20749. [DOI] [PubMed] [Google Scholar]
- 50.Pan H, Lee JC, Hilser VJ. Binding sites in Escherichia coli dihydrofolate reductase communicate by modulating the conformational ensemble. Proc Natl Acad Sci U S A. 2000;97:12020–12025. doi: 10.1073/pnas.220240297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Whitten ST, Garcia-Moreno BE, Hilser VJ. Ligand effects on the protein ensemble: unifying the descriptions of ligand binding, local conformational fluctuations, and protein stability. Methods Cell Biol. 2008;84:871–891. doi: 10.1016/S0091-679X(07)84027-1. [DOI] [PubMed] [Google Scholar]
- 52.Ming D, Wall ME. Allostery in a coarse-grained model of protein dynamics. Phys Rev Lett. 2005;95:198103. doi: 10.1103/PhysRevLett.95.198103. [DOI] [PubMed] [Google Scholar]
- 53.Ming D, Wall ME. Interactions in native binding sites cause a large change in protein dynamics. J Mol Biol. 2006;358:213–223. doi: 10.1016/j.jmb.2006.01.097. [DOI] [PubMed] [Google Scholar]
- 54.Chothia C, Lesk AM. The relation between the divergence of sequence and structure in proteins. Embo J. 1986;5:823–826. doi: 10.1002/j.1460-2075.1986.tb04288.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Gleason FK. Activities of two dissimilar thioredoxins from the cyanobacterium Anabaena sp. strain PCC 7120. J Bacteriol. 1992;174:2592–2598. doi: 10.1128/jb.174.8.2592-2598.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Saarinen M, Gleason FK, Eklund H. Crystal structure of thioredoxin-2 from Anabaena. Structure. 1995;3:1097–1108. doi: 10.1016/s0969-2126(01)00245-3. [DOI] [PubMed] [Google Scholar]
- 57.Weichsel A, Gasdaska JR, Powis G, Montfort WR. Crystal structures of reduced, oxidized, and mutated human thioredoxins: evidence for a regulatory homodimer. Structure. 1996;4:735–751. doi: 10.1016/s0969-2126(96)00079-2. [DOI] [PubMed] [Google Scholar]
- 58.Wahl MC, Irmler A, Hecker B, Schirmer RH, Becker K. Comparative structural analysis of oxidized and reduced thioredoxin from Drosophila melanogaster. J Mol Biol. 2005;345:1119–1130. doi: 10.1016/j.jmb.2004.11.004. [DOI] [PubMed] [Google Scholar]
- 59.Smeets A, Evrard C, Landtmeters M, Marchand C, Knoops B, Declercq JP. Crystal structures of oxidized and reduced forms of human mitochondrial thioredoxin 2. Protein Sci. 2005;14:2610–2621. doi: 10.1110/ps.051632905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Hiraoki T, Brown SB, Stevenson KJ, Vogel HJ. Structural comparison between oxidized and reduced Escherichia coli thioredoxin. Proton NMR and CD studies. Biochemistry. 1988;27:5000–5008. doi: 10.1021/bi00414a008. [DOI] [PubMed] [Google Scholar]
- 61.Jeng MF, Dyson HJ. Comparison of the hydrogen-exchange behavior of reduced and oxidized Escherichia coli thioredoxin. Biochemistry. 1995;34:611–619. doi: 10.1021/bi00002a028. [DOI] [PubMed] [Google Scholar]
- 62.Stone MJ, Chandrasekhar K, Holmgren A, Wright PE, Dyson HJ. Comparison of backbone and tryptophan side-chain dynamics of reduced and oxidized Escherichia coli thioredoxin using 15N NMR relaxation measurements. Biochemistry. 1993;32:426–435. doi: 10.1021/bi00053a007. [DOI] [PubMed] [Google Scholar]
- 63.Katti SK, LeMaster DM, Eklund H. Crystal structure of thioredoxin from Escherichia coli at 1.68 A resolution. J Mol Biol. 1990;212:167–184. doi: 10.1016/0022-2836(90)90313-B. [DOI] [PubMed] [Google Scholar]
- 64.Menchise V, Corbier C, Didierjean C, Saviano M, Benedetti E, Jacquot JP, Aubry A. Crystal structure of the wild-type and D30A mutant thioredoxin h of Chlamydomonas reinhardtii and implications for the catalytic mechanism. Biochem J. 2001;359:65–75. doi: 10.1042/0264-6021:3590065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Shindyalov IN, Bourne PE. Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 1998;11:739–747. doi: 10.1093/protein/11.9.739. [DOI] [PubMed] [Google Scholar]
- 66.Gordon JC, Myers JB, Folta T, Shoja V, Heath LS, Onufriev A. H++: a server for estimating pKas and adding missing hydrogens to macromolecules. Nucleic Acids Res. 2005;33:W368–371. doi: 10.1093/nar/gki464. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.