Graphical abstract

Keywords: de novo protein design, protein folding and stability, stability landscape, non-additive effects, (β/α)8-barrel
Abbreviations: A, area under the stability curve; Tm, midpoint of thermal unfolding; CD, Circular Dichroism; DSC, Differential Scanning Calorimetry; Eact, activation energy; GdnHCl, guanidinium chloride; IF, Intrinsic Fluorescence; ΔΔGint, Gibbs free energy of coupling; ΔH85°C, change in enthalpy extrapolated at 85 °C
Highlights
-
•
The TIM barrel is a versatile fold to understand structure-stability relationships.
-
•
A collection of de novo TIM barrels with improved hydrophobic cores was designed.
-
•
DeNovoTIMs are reversible in chemical and thermal unfolding, which is uncommon in TIM barrels.
-
•
Epistatic effects play a central role in DeNovoTIMs stabilization.
-
•
DeNovoTIMs navigate a previously uncharted region of the stability landscape.
Abstract
The ability to design stable proteins with custom-made functions is a major goal in biochemistry with practical relevance for our environment and society. Understanding and manipulating protein stability provide crucial information on the molecular determinants that modulate structure and stability, and expand the applications of de novo proteins. Since the (β/⍺)8-barrel or TIM-barrel fold is one of the most common functional scaffolds, in this work we designed a collection of stable de novo TIM barrels (DeNovoTIMs), using a computational fixed-backbone and modular approach based on improved hydrophobic packing of sTIM11, the first validated de novo TIM barrel, and subjected them to a thorough folding analysis. DeNovoTIMs navigate a region of the stability landscape previously uncharted by natural TIM barrels, with variations spanning 60 degrees in melting temperature and 22 kcal per mol in conformational stability throughout the designs. Significant non-additive or epistatic effects were observed when stabilizing mutations from different regions of the barrel were combined. The molecular basis of epistasis in DeNovoTIMs appears to be related to the extension of the hydrophobic cores. This study is an important step towards the fine-tuned modulation of protein stability by design.
Introduction
Proteins are essential macromolecules capable of performing diverse and exquisite biological functions such as immune protection, cellular communication, or enzymatic reactions. To guarantee such activities, the functional states must act under specific environmental conditions in a relevant time scale, that is, proteins must be “stable”. Protein stability is required to maintain functional structures and it enhances the ability of proteins to evolve new properties.1, 2 The central role of proteins in the chemistry of life, as well as their increasing application in basic and applied research, implies that the understanding and manipulation of protein stability are highly relevant.
There are two main indicators of protein conformational stability at equilibrium. One is the difference of free energy between the native and unfolded states at a given temperature (ΔG), which is often obtained by chemical unfolding experiments carried out at 25 °C. In addition, stability is also assessed in the context of thermal unfolding, where the unfolding temperature (Tm), the temperature at the midpoint of the transition from native to the unfolded state, is the most common parameter employed to quantify stability. Both the ΔG and Tm parameters, usually determined as criteria for a “stable” protein, are related with the enthalpy (ΔH) and heat capacity (ΔCP) changes through the Gibbs-Helmholtz equation, which describes the variation of ΔG with temperature, the so-called “stability curve” of proteins.3 Different mechanisms have been proposed to modify the stability curve of proteins,4 and numerous studies on natural proteins and their site-directed mutants have been used to rationalize the stability of thermophilic proteins and moreover to engineer thermostability.5
Historically, the design of stable proteins has been one of the main objectives of computational protein design.6 Several strategies, such as increasing the hydrophobic area in internal cores, improvement of water-protein interactions, the introduction of disulfide bridges as well as the addition of salt bridges, have been proposed.7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 The design of de novo proteins can further enhance our understanding of the physicochemical properties that modulate stability. For example, although folding behavior has been only addressed for very few cases, the kinetic analysis of the folding mechanism of two de novo β⍺ proteins has revealed complex free energy surfaces.19, 20 The fine-tuning of conformational stability, that is, the manipulation of the protein stability curve, is an open challenge for protein design and engineering. Such a goal requires a comprehensive characterization of de novo proteins, describing the combination of thermodynamic parameters that can be reached in a particular fold.
Within the different topologies that a protein can adopt, the TIM-barrel or (β/⍺)8-barrel fold is one of the most abundant superfolds in nature.21 Based on proteomic analysis, the TIM-barrel domain is also close to the average size of proteins present in Escherichia coli.22 Besides, the TIM-barrel fold is one of the most successful topologies used in nature to host catalytic activities. Due to its large variety of functions and its ubiquity in different types of enzymes, the TIM barrel represents a suitable scaffold for protein design and engineering.23 For these reasons, its construction has been an important objective over the years.24, 25, 26, 27, 28 Recently, the successful design of a de novo four-fold symmetric TIM barrel was described: the sTIM11 protein.29 Considering that the sTIM11 sequence is significantly different from the ones found in naturally occurring TIM barrels, the potential of this scaffold to explore new thermodynamic properties and functions is highly interesting. sTIM11 shows a high melting temperature (Tm = 80 °C) but low conformational stability (ΔG25°C= ~4 kcal mol−1) when compared to natural TIM barrels.29, 30, 31, 32 Since low conformational stability often results in high sensitivity to mutations and changes in the environment, this can limit the design of novel proteins with new functions.8 Thus, fine-tuning the stability of the sTIM11 scaffold is a prerequisite to functionalize and generate tailor-made barrels for applications in biochemistry, biotechnology, and medicine. In this work, a fixed-backbone design with a modular approach was used to generate a collection of de novo TIM barrels. Their thermodynamic and structural properties were characterized in detail, increasing our knowledge on how stability can be fine-tuned by design.
Results and discussion
DeNovoTIM collection designed by modular repacking of a de novo TIM barrel
The de novo protein sTIM11 is an idealized four-fold symmetric TIM barrel of 184 residues, which was designed to include two cysteines that, however, did not form the intended disulfide bond (Figure 1). To avoid reactive free thiols, both residues were reverted to the residues in the original four-fold design (C8Q and C181V), resulting in sTIM11noCys. The base design DeNovoTIM0, which is the starting point for all further constructs in this work, additionally contains the changes W34V and A38G in all symmetry-related quarters. These residues are situated in every second ⍺/β-loop, and in sTIM11, these tryptophan residues are the most highly solvent exposed. While different strategies have been explored to increase protein stability,8, 18 here we focused on hydrophobic repacking. The structural analysis suggested three regions to be amenable to improvements in sTIM11, one in the internal and two in the peripheral hydrophobic cores. The inner face of the circular sheet forms the internal core, whereas the outer face of the strands and the internal face of the helices constitute the peripheral core. In this latter, we identified two regions, henceforth named bottom and top cores as shown in Figure 1. The residues lining the three regions were subjected to fixed-backbone Rosetta design according to the flow diagram in Appendix A, 89, 90, 91.
Figure 1.
Modular design approach to obtain the DeNovoTIM collection. Cartoon representation of the regions and the corresponding residues modified in each design round. The two cysteine residues present in sTIM11 that were reverted to the corresponding symmetry-related residues in sTIM11noCys are shown in magenta (C8Q and C181V). Mutations W34V and A38G (as well as their 4-fold-symmetry related residues) introduced in DeNovoTIM0 are shown in black. The internal core, formed by the β-barrel residues A21, R23, I40, I42, A67, R69, I86, and I88 (as well as their 2 fold-symmetry related residues) is shown in orange. The bottom core, formed by the N-terminal region of even β-strands and the C-region of the flanking ⍺-helices, that is, residues Q11, E15, T18, K31, and V34 (as well as their 4-fold-symmetry related residues) is colored green. The top core situated at the C-terminal region of the odd β-strands and the N-terminal region of the flanking ⍺-helices formed by residues K2, A5, W6, Y22, S24, and D29 (as well as their 4-fold-symmetry related residues) is shown in purple. All the sequences analyzed in this work are reported in Figure S2 and Tables S1 and S2.
Ten designs were selected for characterization in the first round: four with modifications in the internal core (DeNovoTIM1-4) as well as three designs each for the bottom (DeNovoTIM5-7) and the top core (DeNovoTIM8-10) (Figure S2). For the internal core, no improved designs could be identified when four-fold symmetry was preserved. Therefore, in DeNovoTIM1-4 only a two-fold symmetry was enforced. An exploratory characterization by circular dichroism (CD) and differential scanning calorimetry (DSC) of DeNovoTIMs 0–10 showed that designs 1, 6, and 8 were the best for each region (Figure S3 and supporting text).
To test for additivity effects on stability and structure, mutations contained in the best design of each group were combined to generate the following double-region designs: DeNovoTIM11-13 as shown in Figure 1. Finally, in the third round the mutations of all three regions were combined resulting in DeNovoTIM14. All these proteins as well as sTIM11, sTIM11noCys, and DeNovoTIM0 were characterized in detail. Information on sequences, and mutations in each design are reported in the supporting information (Figure S2 and Tables S1–S4).
Folding thermodynamics of DeNovoTIMs
All variants presented the characteristic far-UV CD spectra observed for ⍺/β proteins (Figure 2(A) and Figure S4). The near-UV CD and intrinsic fluorescence (IF) spectra showed that the aromatic residues are buried from solvent and structured in the folded state (Figure 2(B)-(C) and Figures S5-S6; see supporting text for details). All DeNovoTIMs are monomeric and compact as revealed by analytical size exclusion chromatography (Table S5).33
Figure 2.
Conformational properties and equilibrium unfolding of DeNovoTIMs. (A) Far-UV CD spectra. (B) Near-UV CD spectra. (C) Intrinsic Fluorescence (IF) spectra (λexc = 295 nm). (D) Thermal unfolding followed by CD222nm (scan rate: 1.5 K min−1). (E) Differential Scanning Calorimetry (DSC) endotherms (scan rate: 1.5 K min−1; for easy comparison, the physical and chemical baselines have been subtracted). F) DSC endotherms of DeNovoTIM14 in the presence of increasing concentrations of urea (2.0 to 6.0 M) from bottom to top (scan rate: 1.5 K min−1). For clarity, in panels E and F only a small part of the pre- and post-transition baselines are shown. (G) Chemical unfolding using urea and followed by CD (notice that DeNovoTIM14 does not unfold with urea). (H) Chemical unfolding induced by guanidinium hydrochloride for DeNovoTIM14 (squares: CD, circles: IF).
Thermal unfolding was then studied by CD and DSC (Figure 2(D)–(E)). All DeNovoTIMs showed cooperative transitions with a remarkably broad range of Tm values, from 47 °C (DeNovoTIM0) to 109 °C (DeNovoTIM12) (Table 1); indeed at 90 °C many of the proteins still showed secondary and tertiary structure (Figure S4(B) and Figure S5(B)). All DeNovoTIMs, except 13 and 14, showed thermal unfolding reversibility (Figure S7) and were well fitted to the two-state model (N ⇋ U) (Figure S8 and Table 1). This is remarkable because the temperature-induced unfolding of natural proteins of this size, particularly TIM barrels, is usually not reversible.22, 30 DeNovoTIM14 showed two endotherms, suggesting the presence of an unfolding intermediate (Figure S8(I)). For DeNovoTIM 13 and 14, endotherms were well-fitted to an irreversible two-state mechanism (N → F) giving activation energies (Eact) of 120 and 37 kcal mol−1 (Table 1), respectively, resulting in very different kinetic stabilities (Figures S9–S10 and supporting text).
Table 1.
Thermodynamic properties of DeNovoTIMs.
| de novo TIM barrel |
Thermal unfolding (by DSC)a |
Chemical unfolding (by CD and IF)e |
|||||||
|---|---|---|---|---|---|---|---|---|---|
| Tm (°C) | ΔH (kcal mol−1) | ΔH85°C (kcal mol−1)b | ΔCP (kcal mol−1 K−1) | ΔHvH / ΔH | Globalthermodynamic stability (kcal K mol−1)c | ΔG25 °C (kcal mol−1) | m (kcal mol−1 M−1) | D[1/2] (M) | |
| sTIM11 | 80.0 ± 0.2 | 93 ± 1 | 104 ± 2 | 2.19 ± 0.09 | 0.99 ± 0.05 | 280 | 4.8 ± 0.3 | 1.34 ± 0.06 | 3.1 |
| sTIM11noCys | 65.6 ± 0.1 | 82 ± 1 | 128 ± 2 | 2.36 ± 0.08 | 0.99 ± 0.03 | 176 | 3.2 ± 0.2 | 2.03 ± 0.10 | 1.9 |
| DeNovoTIM0 | 47.0 ± 0.2 | 25 ± 1 | 41 ± 3 | 0.44 ± 0.06 | 1.09 ± 0.08 | 61 | 1.5 ± 0.1 | 0.76 ± 0.02 | 2.1 |
| DeNovoTIM1 | 71.0 ± 0.4 | 48 ± 1 | 58 ± 2 | 0.69 ± 0.05 | 0.98 ± 0.05 | 226 | 3.8 ± 0.1 | 1.87 ± 0.10 | 2.0 |
| DeNovoTIM6 | 92.3 ± 0.1 | 125 ± 2 | 108 ± 1 | 2.38 ± 0.06 | 1.03 ± 0.02 | 542 | 7.9 ± 0.2 | 1.51 ± 0.08 | 5.6 |
| DeNovoTIM8 | 77.3 ± 0.3 | 52 ± 1 | 61 ± 1 | 1.19 ± 0.07 | 0.95 ± 0.09 | 165 | 2.5 ± 0.2 | 0.85 ± 0.09 | 2.9 |
| DeNovoTIM11 | 103.5 ± 0.2 | 62 ± 2 | 49 ± 1 | 0.67 ± 0.14 | 1.02 ± 0.09 | 523 | 8.4 ± 0.4 | 1.75 ± 0.08 | 6.0 |
| DeNovoTIM12 | 108.8 ± 0.3 | 79 ± 2 | 62 ± 1 | 0.72 ± 0.08 | 1.01 ± 0.08 | 792 | 10.9 ± 0.2 | 1.77 ± 0.03 | 6.2 |
| DeNovoTIM13 | 92.8 ± 0.4 | 47 ± 5 | ND Eact: 120 ± 3 kcal mol−1d | ND | 9.5 ± 0.2 | 1.54 ± 0.03 | 6.6 | ||
| DeNovoTIM14 | 91.5 ± 0.1 | 5.4 ± 0.2 | ND Eact: 37 ± 1 kcal mol−1d | ND | Tot: 23.6 ± 2.0 N-I:12.7 ± 1.9 I-U:10.9 ± 0.6 |
N-I: 4.3 ± 0.9 I-U: 2.5 ± 0.1 |
N-I: 3.2 I-U: 4.4 |
||
| 105.3 ± 0.1 | 8.6 ± 0.4 | ||||||||
ND: Not determined due to irreversibility in the thermal unfolding. Instead, activation energy (Eact) was calculated from an irreversible two-state mechanism.
aThe thermodynamic parameters reported are the average of ten experiments carried out at different protein concentrations (0.25–2.5 mg mL−1; Figure S8 and Figure S11), ± indicate the standard deviation calculated from these 10 experiments.
bΔH at 85 °C (ΔH85°C) was calculated using the experimental ΔH and ΔCP values as indicated in material and methods.
cThe global thermodynamic stability was calculated from the area of the stability curve evaluated between 0 °C and Tm (Figure 3(B)).
dEact value and ± are the average and standard deviation, respectively, from three different calculation methods (Figure S9 and Figure S10).
e±indicates the standard error from global fitting (Figure S13).
For DeNovoTIMs with a reversible thermal unfolding, the observed unfolding ΔH and ΔCP also vary greatly (Table 1). For some DeNovoTIMs these values are smaller than expected for a protein of 184 residues (ΔH = 128 ± 4 kcal mol−1 and ΔCP = 2.6 ± 0.04 kcal mol−1 K−1, according to parametric equations reported in,34 while the ΔH values observed for the first and second design rounds (0.24 to 0.64 kcal mol−1 residue−1) are similar to those reported for natural monomeric TIM barrels (0.25 to 0.67 kcal mol−1 residue−1). Obtained ΔCP values were independent of protein concentration and showed a small standard deviation (Figure S8, Figure S11, and Table 1). A decrease in ΔCP has been shown to result from residual structure in the unfolded state.35 This is observed in the far-UV CD spectra of those DeNovoTIMs that are unfolded at 90 °C. In addition, the low ΔH of DeNovoTIM14 increases in the presence of urea (Figure 2(F) and Figure S10). These results suggest that for some DeNovoTIMs, the reason for the low ΔH and ΔCP is likely the high content of residual structure in the unfolded state (supporting text).
Stability at 25 °C was studied by chemical unfolding with urea or GdnHCl. Except for DeNovoTIM14, all designs were completely unfolded in 9.0 M urea (Figures S4–S6). For all designs, except for DeNovoTIM14, CD and IF curves were monophasic, cooperative, coincident, and globally-fitted well to a two-state N ⇋ U model, indicating the absence of populated intermediates (Figure 2(G) and Figures S12–S13). DeNovoTIM14 showed no changes in CD or IF signal up to 9.0 M of urea, even after incubation for 5 days (Figures S4–S6). In contrast, in the presence of GdnHCl DeNovoTIM14 showed a three-state unfolding process with a populated intermediate: N ⇋ I ⇋ U (Figure 2(H) and Figure S13).
All selected first- and second-round designs had a ΔG25 °C higher than DeNovoTIM0, whereas the triple-design, DeNovoTIM14, showed a pronounced increase in stability (ΔGTot = 23.6 kcal mol−1; Table 1). The stability change related to the loss of the native state in DeNovoTIM14 (ΔGN-I = 12.7 kcal mol−1), the so called “relevant stability”,36 is higher than the ΔG of the second-round designs; whereas the stability of the intermediate, also referred as the “residual stability”, is similar (ΔGI-U = 10.9 kcal mol−1). For all DeNovoTIMs, the m value, a parameter proportional to the surface area exposed to the solvent upon unfolding (ΔASA),37 is similar to those observed for natural proteins with the same size, except for DeNovoTIM 0 and 8 where m decreases significantly (Table 1). Low m values have been related to low packing in the native state, residual structure in the unfolded state, or the presence of folding intermediates.38 Residual structure in the unfolded state is not clearly observed in the CD spectra of DeNovoTIMs at 9.0 M urea (Figures S4–S5), therefore, other techniques and kinetic studies should help to detect the persistence of native-like structure in the unfolded state and/or the presence of intermediates.38, 39
The modular approach used in this work improved both ΔG and Tm substantially and hence produced significantly more stable proteins. In this context, it is worth mentioning that over the years the combination of stabilizing mutations has been considered an effective strategy to enhance the stability of small proteins.36, 40, 41, 42, 43 Previous work on small globular proteins with optimized hydrophobic cores and interactions on the surface exhibited increased thermal stability by up to 30 degrees.9, 11, 15 Extending these strategies from point mutants to regions appears to be useful for bigger folds such as the TIM barrel. In what follows, using the thermal and chemical unfolding data described above, the thermodynamic properties underlying the stability of DeNovoTIMs are analyzed.
Global thermodynamic stability and non-additive effects of DeNovoTIMs
As observed in natural proteins, the m values obtained from the chemical unfolding of sTIM11, sTIM11noCys, DeNovoTIM 0, 6, and 8 correlate with their ΔCP values (Figure 3(A)), likely because both depend on the ΔASA upon unfolding. In contrast, ΔCP values obtained for DeNovoTIM 1, 11, and 12 are much lower than expected (Figure 3(A)). According to the Rosetta models and the native state structures (see below), these differences are not exclusively due to properties of the native state since the calculated ΔASA is close to the expected value for the size of DeNovoTIMs (17135 A2).37 This suggests that the unfolded state reached at high temperatures is more structured than the one obtained by chemical unfolding.
Figure 3.
Stability and energetic coupling in DeNovoTIMs. (A) Correlation between two parameters which are proportional to the exposed surface area: m value from chemical unfolding and ΔCP from temperature-induced unfolding (solid line: linear regression excluding DeNovoTIM1, DeNovoTIM11, and DeNovoTIM12 data; R2: 0.76. Dotted line: correlation reported by 37). (B) Stability curves calculated from DSC data (lines) using the Gibbs-Helmholtz equation (open symbols show ΔG values determined by chemical unfolding at 25 °C. Grey dashed line indicates 25 °C). (C) Correlation between the relative global thermodynamic stability (Area/AreaDeNovoTIM0) and thermostability (Tm) (R2: 0.93). Inset: correlation between ΔG at 25 °C determined by chemical unfolding and Tm (R2: 0.87). For DeNovoTIM14, where two transitions were found, it was assumed that the one observed at lower [GdnHCl] corresponds to the lower Tm. (D) Thermodynamic cube showing the coupling energy (ΔΔGint) between different regions of DeNovoTIMs. ΔΔGint values were calculated from the double-mutant cycles shown in Figure S14. ΔΔGint values between single-region mutants are depicted as colored arrows from the top face to the bottom face. ΔΔGint values calculated for the addition of a single-region design to a double-region design are shown as colored arrows in the bottom face.
Thermal unfolding reversibility allowed the assessment of DeNovoTIM stability curves (Figure 3(B)). The ΔG25 °C values are in excellent agreement with those obtained from chemical unfolding experiments. According to the Gibbs-Helmholtz equation, conformational stability is modulated by changes in Tm, ΔH, and ΔCP. For natural TIM barrels, it has been observed that changes in the stability curve are influenced mainly by modifying one or two of those parameters.30, 31 In contrast, the DeNovoTIMs differ in all three parameters. Increasing ΔH is the most commonly found mechanism for stabilization of thermophilic proteins5 and is also the most often exploited mechanism for engineering protein stability.7, 40 In DeNovoTIMs, enthalpy-driven stabilization is found in all proteins but is especially important in DeNovoTIM6 (Figure 3(B)). ΔCP determines the magnitude of the curvature of the stability curve so that changes in this parameter trigger a more or less flattened curve. A decrease in ΔCP has been postulated as a mechanism for thermostabilization.35, 44 For DeNovoTIMs, the reduction in ΔCP combined with an increase in ΔH is the reason for the increase in both Tm and ΔG25 °C. The results presented here indicate that, as observed for natural proteins, the unfolded ensemble plays an important role in shaping the stability curve and should be considered in protein design.
DeNovoTIMs show a non-linear correlation between ΔG25 °C and Tm (inset in Figure 3(C)). A similar trend between ΔG at the temperature where it is a maximum (ΔGTmax) and Tm has also been reported for natural and engineered proteins with different sizes and topologies.34, 45, 46 Additionally; the global thermodynamic stability can be conveniently described by the area (from 0 °C to Tm) under the stability curve (A). Instead of using a single reference temperature, A integrates the conformational stability in a temperature range.47 The relative global stability of DeNovoTIMs (A/ADeNovoTIM0) is also correlated with Tm (Figure 3(C)). Notably, for DeNovoTIM 6, 11, and 12, A/ADeNovoTIM0 is nearly ten-fold higher than the initial design (Figure 3(C) and Table 1).
The modular design strategy allowed us to calculate the contribution of each region to global stability, and to evaluate the presence of non-additive effects between regions of the barrel. Non-additive effects were evaluated as ΔΔGint through an approach based on thermodynamic double mutant cycles (see material and methods). ΔΔGint is also referred to as coupling energy, non-additive effect, interaction energy, and more recently epistatic effect.48
Thermodynamic cycles showed that the stabilization is non-additive and depends on the structural context (Figure S14). All the ΔΔGint values calculated in Figure S14 are summarized in the single cube shown in Figure 3(D). ΔΔGint for double designs are much smaller than those involving the triple-region design. The regions that are most energetically coupled in double-region designs (ΔΔGint = 6.1 kcal mol−1) are the internal core (DeNovoTIM1) and the top core (DeNovoTIM8). Coupling increases considerably when a third region is incorporated on top of two already mutated regions (ΔΔGint > 6 kcal mol−1). The largest ΔΔGint was observed when the DeNovoTIM8 mutations were added to DeNovoTIM11 (ΔΔGint = 14.2 kcal mol−1) (Figure 3(D) and Figure S14). Clearly, mutations in one place affected other regions of the barrel. The latter indicates that the TIM-barrel fold is suitable for studying modularity and, in general, cooperative effects of proteins. Also, the results presented here suggest that the modular design strategy could be used in the future for the rational stability improvement in other protein topologies.
Structural features of DeNovoTIMs
The structural properties of DeNovoTIMs were examined by X-ray crystallography (Table S6). High-resolution data were collected for sTIM11noCys and DeNovoTIM13, whereas a low-resolution structure was obtained for DeNovoTIM6. All showed the designed globular compact TIM-barrel topology (Figure 4). Structural comparison with the Rosetta models showed the lowest RMSD located in the second quarter of the barrel. As previously observed in sTIM11,29 the main structural differences are found in the α-helices located at the N- and C-terminal ends. In agreement, for all the barrel structures, the RMSD among quarters of the barrel is higher in the first and fourth ones (plot in Figure 4(A)). Since the TIM barrel is a closed-repeat protein, contacts between the first and last helices depend on the precise curvature generated by each ⍺/β unit, therefore geometrical strain may interfere with the proper closure of the barrel.
Figure 4.
Three-dimensional structures of DeNovoTIMs. (A) Structural alignment of X-ray structures of sTIM11 (PDB ID: 5BVL), sTIM11noCys (PDB ID: 6YQY), DeNovoTIM6 (PDB ID: 6Z2I), and DeNovoTIM13 (PDB ID: 6YQX). The RMSD Cα between the structure and the Rosetta model among the quarters in each protein is shown in the lower part of the panel. (B) Comparison of sTIM11noCys and sTIM11 structures (RMSD: 1.07 Å −174 Cα-). The mutated residues 8 and 181 in sTIM11noCys are zoomed in the bottom part. (C) Comparison of the DeNovoTIM6 structure with the Rosetta model (RMSD: 2.28 Å −168 Cα-). The quarters with the highest and lowest structural similarity are highlighted (bottom left and bottom right, respectively). (D) Comparison of the DeNovoTIM13 structure with the Rosetta model (RMSD: 1.43 Å −181 Cα-). The quarters with the highest and lowest structural similarity are highlighted (right and left, respectively). Sidechains of the mutated residues are shown in sticks.
A comparison of the sTIM11noCys and sTIM11 structures showed that removal of the two cysteines causes some structural changes mainly localized in the first and last quarters; the most significant deviations are observed at the N-terminal region where the first two helices are not well-formed. So even without forming the disulfide bridge, both cysteines in sTIM11 increase the stability and promote a proper closure of the barrel (Figure 4(B) and Table 1); nevertheless, sTIM11noCys maintains the general expected TIM-barrel architecture.
The thermodynamic properties of DeNovoTIM6 are very similar to those expected for a natural protein (Table 1). Unfortunately, due to the low quality of the crystals and therefore the low resolution obtained (2.9 Å), details such as side-chain conformations are not well resolved in its structure. Nevertheless, it could be verified that the protein is a well folded TIM-barrel (Figure 4(C)). In DeNovoTIM6, almost all α/β loops of the barrel are well defined and correspond to the model. However, for some residues within 5 of the 7 β/α loops no electron density was observed. In general, the DeNovoTIM6 structure has high B factors which may reflect higher disorder in the protein crystal or increased flexibility, similar to observations in some regions of sTIM11, namely the amino- and carboxyl-terminal α-helices.
In the DeNovoTIM13 structure, the second and third quarters display only minor differences to the Rosetta model, with the secondary structure elements and side chains superposing very well (Figure 4(D)). In going from sTIM11 to DeNovoTIM13, a 60 % increase in the total area of hydrophobic clusters was found (3765 vs. 6148 Å2); most of this change comes from a three-fold increase in the area of the major hydrophobic cluster (1116 vs. 4351 Å2, Table S7).
As a consequence of the DeNovoTIMs design protocol, polar interactions were replaced by hydrophobic ones, therefore, it is not surprising that the number of H-bonds and salt bridges is lower in DeNovoTIMs than in sTIM11 and sTIM11noCys (Table S7). Some of the designs that contained the highest number of polar stabilizing interactions (such as DeNovoTIM 1 and 8) were not the most stable ones, whereas some of the most stable designs (such as DeNovoTIM 6 and 12–14) showed a reduction in this type of interaction (Table S7). In agreement with the design strategy, the stability of DeNovoTIMs increases with the number of hydrophobic interactions. The total area, as well as the number of residues and contacts in hydrophobic clusters, are substantially increased in the best first-round designs along with the more stable second- and third-round designs (Figure S15 and Table S7). This suggests that the strategy of increasing hydrophobic contacts was successful in the stabilization of DeNovoTIMs.
Epistasis on the stability landscape of de novo TIM barrels
To correlate the most common and informative parameters obtained from both temperature and chemical unfolding, Tm, ΔH, and ΔG were plotted in a two-dimensional bubble plot thereby representing the thermodynamic combinations of the designed proteins (Figure 5). The Tm values found in DeNovoTIMs are widely distributed ranging from 47 °C to 109 °C, corresponding to a 62 °C increase in thermostability, a range higher than those previously reported for engineered proteins, but in the observed range found in natural proteins (Figure 5). The stability measured from chemical-unfolding experiments (ΔG25°C) also shows the variety found in natural TIM barrels and other natural folds (Figure 5(B) and (C)). Natural proteins populate some thermodynamic regions more than others, exploring ample space due to the diversity in size, topology, oligomeric state, function, and evolutionary history. Interestingly, Figure 5 shows that several DeNovoTIMs are located in a region of the plot corresponding to low ΔH and high ΔG25°C values, which has not been reported for natural proteins.
Figure 5.
Thermodynamic 2D bubble plots of de novo TIM barrels in comparison with natural proteins. (A) 2D plot of ΔH85°C versus Tm of all DeNovoTIMs. Colored arrows indicate the design flow. (B) 2D plot of ΔH versus Tm of natural TIM barrels (in blue) and DeNovoTIMs (in red). (C) 2D plot as shown in B including data for other natural folded proteins (open circles). The diameter in the bubbles correspond to the ΔG25°C magnitude. Tm and ΔH data were obtained from thermal unfolding, whereas ΔG values derive from chemical unfolding. Data for all non-redundant proteins presented here were obtained from ProThermDB database.56, 57
Assuming additivity, the expected change in stability calculated for DeNovoTIM14 would be the sum of the individual stabilizations provided by all the single-region designs giving a value of 11.2 kcal mol−1. However, the stability of DeNovoTIM14 is 23.6 kcal mol−1, indicating that more than half of the stabilization comes from positive non-additive effects. Non-additive effects or interaction energies may be referred to as epistasis, a concept traditionally used in genetics to describe the phenotype dependency of a mutation on the genetic state at other sites.48, 49, 50 Previous studies have explored and analyzed the mechanisms of epistasis within proteins, especially regarding their implications for protein function, evolution, and stability.51, 52, 53, 54, 55
Rearrangements in the TIM barrel can influence local changes in other parts of the protein, and these epistatic effects are quantified in the ΔΔGint values whose magnitude for DeNovoTIMs is considerable. The structural analyses suggest that the epistatic effect observed in DeNovoTIMs is likely related to the extension of the hydrophobic cores, particularly to the increase of the major hydrophobic cluster located in the interface between the inner β-barrel and the outer ⍺-helices (Figure S15 and Table S7). From the first- to the second-round designs, the highest area in hydrophobic clusters was found for DeNovoTIM12, and this corresponds to the highest positive epistatic effect in this round (ΔΔGint = 6.1 kcal mol−1), whereas the decrease of the hydrophobic cluster area in DeNovoTIM11 (compared to DeNovoTIM1 and DeNovoTIM6) correlates with a negative ΔΔGint = −1.8 kcal mol−1. From the second- to the third-round designs, the most notable change in hydrophobic area is observed in going from DeNovoTIM11 to DeNovoTIM14, resulting in the highest positive epistatic effect (ΔΔGint = 14.2 kcal mol−1). The relevance and magnitude of the epistatic or non-additive effects found in DeNovoTIMs, as well as those observed in other reports, suggest that modeling such interactions can improve the success in protein design and engineering.
Conclusions
Design requires a deep understanding of the relationship between sequence, structure, and stability, and therefore, the combination of thermodynamic and structural data is fundamental to achieve this goal. Here, we designed a family of stable TIM barrels and comprehensively explored their thermodynamic and structural properties. The TIM-barrel collection reported in this work exhibits a considerable range in thermostability (more than 60 degrees in Tm) and conformational stability at 25 °C (more than 22 kcal mol−1 in ΔG). These data can now be used to accelerate the development of future custom design protein stability curves which, in turn, will expand the biomedical and biotechnological applications of de novo proteins. For example, by fusion to another de novo protein, one of the stabilized scaffolds reported here (DeNovoTIM13) has been successfully used to create a reaction chamber on the top of the barrel,58 confirming the convenience of working with robust and stable TIM barrels in the path towards functional de novo proteins.
In the same way that one explores the sequence space by studying homologous proteins from different organisms, de novo design with a fixed backbone follows a similar strategy generating new sequences within the same topology. It is well known that highly stable proteins can be generated by computational design. However, one of the unexpected findings resulting from the thermodynamic characterization of this family of DeNovoTIMs is that very stable proteins presenting unexplored combinations of thermodynamic parameters can be designed. The stability of DeNovoTIMs is severely influenced by epistatic effects that appear to arise from the design strategy involving an increase in hydrophobic clusters. The design and characterization of stable de novo proteins, such as those described in this work, is an essential step on the route to the next generation of new protein functions charting novel sequence space.
Material and methods
Enzymes and chemicals
All reagents were of analytical grade from Merck KGaA®. Genes were ordered from GenScript Biotech. Water was distilled and deionized.
Design protocol
De novo TIM barrels were designed using the Rosetta software suite v.3.259, 60 (https://www.rosettacommons.org/). All DeNovoTIMs were designed using DeNovoTIM0 as a template. The script used for the DeNovoTIM collection follows and executes the steps indicated by the algorithm as indicated in Figure S1. In general, the algorithm first selects the symmetry with which it designs the proteins. Once the two-fold or four-fold symmetry was chosen, it selects the number of residues that mutate (depending on whether it is a quarter or half of the protein). Then, an energy minimization step is performed by simulated annealing considering and evaluating the packing and RMSD. Subsequently, it performs a Monte Carlo (MC) fast layer design to improve the packing of the protein’s hydrophobic cores (in one or several cavities selected according to the regions described in Figure 1), minimizes the constraints of the main and side chains, and compares each of them with the starting design. For each step, it verifies the RMSD value between both proteins (in the case of DeNovoTIMs, the design was done with a fixed backbone and a cut-off pointed out < 0.7 Å). Then, the algorithm filters the results to keep those designs that, with the suggested mutations, were able to increase the packing and preserve the reference topology (ScoreRes: ≤ −1.9, Talaris: ≤ −3.5 BetaNov, Sspred: ≥ 0.85, Packstat: ≥ 0.65). To evaluate if the suggested protein folds as expected, selected designs were computationally validated by a later step of forward folding to predict the ab initio three-dimensional structure. The selection was done by an energy score, choosing the designs with the lowest energy value and smallest possible RMSD (located at the bottom left when the energy score against RMSD is plotted). In all selected DeNovoTIMs, funnel plots were observed. Finally, the designs were analyzed and the candidates for experimental characterization were selected based on energy criteria, a fewer number of mutations, and physicochemical properties of the suggested mutations.
Cloning, overexpression, and protein purification
The nucleotide sequence of all DeNovoTIMs was optimized for expression in Escherichia coli. The coding genes were synthesized and cloned into the pET29b(+) vector by GeneScript (New Jersey, USA), except sTIM11noCys, which was cloned into pET21b(+). Proteins were overexpressed in E. coli strain BL21(DE3) (Invitrogen®) in 1 L of Terrific Broth (TB) medium supplemented with 30 µg mL−1 kanamycin or 100 µg mL−1 ampicillin, inoculated with 5 mL preculture and incubated at 37 °C and 200 rpm. After an OD600 of 0.6–0.8 was reached, overexpression was induced by adding 1 mM isopropyl-D-1-thiogalactopyranoside (IPTG); growth was continued for 16 h at 30 °C. After incubation, cells were harvested by centrifugation (Thermo/SLA-3000®, 15 min, 8000 rpm, 4 °C), pellets resuspended in buffer A: 35 mM sodium phosphate, 300 mM NaCl and 35 mM Imidazole pH 8 (supplemented with 0.2 mM of protease inhibitor Phenylmethylsulfonyl fluoride), lysed by sonication (Cole Parmer Ultrasonic Processor®, 10 cycles in 45 s intervals, 30% pulse, 4 °C), and centrifuged again (Sorvall/SS-34®, 40 min, 16000 rpm, 4 °C). In some cases, to increase the efficiency of lysis, the resuspended cells were incubated with lysozyme (250 μg mL−1) at 37 °C for 1 h before sonication. The purification was performed loading the supernatant onto a HisTrap HP column (5 mL; GE Healthcare Life Sciences®) coupled to an ÄKTA system (GE Healthcare Life Sciences®). The unbound fraction was washed out with 20 column volumes (CV) of buffer A. Bound protein was eluted with a linear gradient of 35–500 mM Imidazole using buffer B: 35 mM sodium phosphate, 300 mM NaCl and 500 mM Imidazole pH 8. The pooled fractions were loaded onto a HiLoad 16/600 Superdex 75 preparative grade column (GE Healthcare Life Sciences®). The proteins were purified using isocratic elution with 1.5 CV of buffer C: 150 mM NaCl, 35 mM sodium phosphate pH 8. The fractions corresponding to the monomeric population were pooled and stored at 4 °C for use in subsequent experiments. It should be noted that at the end of protein purification, all designs contain a polyhistidine-tag in the carboxyl-terminal region. For DeNovoTIM11 and DeNovoTIM14, the following purification variables were modified to increase the yield: 0.1 mM of IPTG for induction at OD600 of 0.2–0.3, 30 °C and 6 h for overexpression, buffer A and B containing 1 M NaCl, and buffer C with 300 mM NaCl. At each step of the purification process, aliquots were taken to quantify the amount of protein and to calculate the corresponding purification tables. The final yields are indicated in Table S5.
Far- and near-UV circular dichroism
Circular Dichroism (CD) spectra were collected in buffer D: 10 mM sodium phosphate pH 8 in a Chirascan Spectropolarimeter using a Peltier device to control the temperature (Applied Photophysics®). For Far-UV spectra, 0.4 mg mL−1 of DeNovoTIM was used for all measurements (1 nm bandwidth, 185–260 nm wavelength range, 1 mm cuvette). For Near-UV spectra, 1 mg mL−1 of DeNovoTIM was used for all measurements (1 nm bandwidth, 250–350 nm wavelength range, 10 mm cuvette). The spectra for thermally-unfolded states were collected at 90 °C. Spectra for chemically-unfolded states were collected at 9 M urea for all DeNovoTIMs, except for DeNovoTIM14, which was collected at 7 M GdnHCl. Raw data were converted to mean residue molar ellipticity ([θ]) using: [θ] = θ/(l C Nr), where θ is ellipticity collected in millidegrees, l is the cell path length in mm, C is the DeNovoTIM molar concentration, and Nr the number of residues per protein. Far-UV spectra were deconvoluted with CDNN.61
Intrinsic fluorescence
Intrinsic Fluorescence (IF) spectra were collected on a PC1 ISS Spectrofluorometer (Champaign IL-USA®) equipped with a Peltier device controlling the temperature. In all measurements, protein concentration was 0.4 mg mL−1 in buffer D: 10 mM sodium phosphate pH 8 (1 nm bandwidth slits, 295 nm excitation wavelength, 310–450 nm emission wavelength range). Spectra for chemically-unfolded states were collected at 9 M urea for all DeNovoTIMs, except for DeNovoTIM14, which was collected at 7 M GdnHCl. Fluorescence spectral center of mass (SCM) was calculated from intensity data (Iλ) obtained at different wavelengths (λ): SCM= ∑λIλ/∑Iλ.
Three-dimensional structure determination
DeNovoTIMs were concentrated with Amicon Ultra centrifugal filter units (Millipore®) and dialyzed in buffer C: 10 mM sodium phosphate pH 8, 150 mM NaCl. Sitting-drop vapor-diffusion method, and JCSG Core I-IV, JCSG +, Classics I-II, PACT, PEGs I-II, and AmSO4 screening suites (Qiagen®) were used to screen crystallization conditions in 96 well Intelli plates (Art Robbins Instruments®) stored at 20 °C in the hotel-based Rock Imager RI 182 (Formulatrix®). 0.8 µL drops were prepared in a 1:1 ratio with mother liquid using a nanodispensing crystallization robot Phoenix (Art Robbins Instruments®) and then optimized by multiple crystallization rounds using a sitting-drop vapor-diffusion method. To improve the diffraction quality of DeNovoTIM crystals, different pre- and post-crystallization methods were used: reductive methylation (JBS Methylation Kit, Jena Biosciences®), seeding, additive screening, controlled dehydration, cryoprotection screening, crystal annealing, and room-temperature diffraction. In total, more than 300 different crystals in various conditions were tested.
Suitable crystals for X-ray diffraction were found in the following conditions: sTIM11noCys: 0.2 M Ammonium Sulfate, 0.1 M Trisodium Citrate pH 5.6, 25% w/v Polyethylene glycol (PEG) 4000, with a protein concentration of 15 mg mL−1; DeNovoTIM6: 0.095 M Sodium Citrate pH 5.0, 19% v/v Isopropanol, 25% w/v PEG 4000, 5% v/v Glycerol, with a protein concentration of 8.6 mg mL−1; DeNovoTIM13: 0.17 M Sodium Acetate, 0.085 M Tris pH 8.5, 25.5% w/v PEG 4000, 15% v/v Glycerol, with a protein concentration of 10 mg mL−1.
For sTIM11noCys and DeNovoTIM13, diffraction data were collected at 100 K at the Swiss Light Source at the Paul Scherrer Institute in Villigen (Switzerland) (X10SA-PXII beamline for sTIM11noCys and X06DA-PXIII beamline for DeNovoTIM13) using a wavelength of 1 Å and a PILATUS 6M detector for sTIM11noCys and a PILATUS 2M-F detector for DeNovoTIM13.62, 63 For DeNovoTIM6, diffraction data were collected at 100 K at the Berlin Electron Storage Ring Society for Synchrotron Radiation beamline 14.2 (BESSY BL14.2) operated by Helmholtz‐Zentrum Berlin using a wavelength of 0.91 Å and a PILATUS3S 2 M detector.64
Diffraction data were processed with the X-ray Detector Software (XDS) using XDSAPP v.2.065, 66 for sTIM11noCys and DeNovoTIM13; and DIALS67 for DeNovoTIM6. For data reduction; criteria used to cut off the data were the resolution shell with a meanI/sigma(I) between 1–2 and the best CC1/2 according to redundancy and completeness. The structures were solved by molecular replacement with PHASER in the PHENIX software suite v.1.1768 using sTIM11 (PDB ID: 5BVL) as a starting model for sTIM11noCys and the own Rosetta model for DeNovoTIM6 and DeNovoTIM13. Refinement was done with phenix.refine.68 The model was improved by map inspection and iteratively manual rebuilding performed in COOT v.0.9.69 The final coordinates were validated with PDB_REDO,70 MolProbity v.4.2,71 and the Protein Data Bank validation service72; in all servers; the 3D-structure satisfied all quality criteria. The coordinates and structure factors were deposited in the PDB with accession codes: 6YQY (sTIM11noCys), 6Z2I (DeNovoTIM6), and 6YQX (DeNovoTIM13). The figures were created using PyMOL Molecular Graphics System v.4.5.0 (Schrodinger, LLC).
Analytical size exclusion chromatography
Hydrodynamic measurements were performed on a Superdex 75 10/300 GL analytical column coupled to an ÄKTA System (GE Healthcare Life Sciences®). All experiments were performed in buffer C: 10 mM sodium phosphate pH 8, 150 mM NaCl at 25 °C and a protein concentration range from 0.01 to 2.0 mg mL−1. Experimental molecular weight, Stokes-radii, and oligomeric state were calculated from elution volumes and a calibration curve derived from 7 different known proteins.
Thermal unfolding followed by circular dichroism
Temperature-induced unfolding was monitored by CD at 222 nm as a function of temperature using 0.4 mg mL−1 in buffer D: 10 mM sodium phosphate pH 8, a heating rate of 1.0 and 1.5 K min−1, and a 1 mm path-length cell. The changes in the CD signal were normalized to the fraction of unfolded molecules (fU) by:
| (1) |
where yobs is the experimentally observed CD signal at a given temperature, and (yN + mNT) and (yU + mUT) are the linear fitting equations corresponding to the native and unfolded regions, respectively. Tm values were estimated from normalized data fitted with a Boltzmann-type function:
| (2) |
where a is related to the slope of the transition.
Thermal unfolding followed by differential scanning calorimetry
Differential Scanning Calorimetry (DSC) scans were carried out in a VP-Capillary DSC system (MicroCal®, Malvern Panalytical). Samples were prepared by exhaustive dialysis in buffer D: 10 mM sodium phosphate pH 8 and then degassed at room temperature. To ascertain proper instrument equilibration, two buffer–buffer scans were performed before each protein-buffer scan (Figure S7). Corresponding buffer–buffer traces were subtracted from each endotherm. For all proteins a reheating scan was performed to determine the reversibility or irreversibility of the process (Figure S7). Reversibility percentage was calculated by comparing the calorimetric ΔH (area under the curve) recovered in the second scan and that obtained in the first one (ΔHsecondscan/ΔHfirstscan)*100. To verify that irreversibility was not the result of a too high final scanning temperature, the first scans were also performed heating near the Tm. For DeNovoTIMs with a reversible thermal unfolding, protein concentration varied from 0.25 to 2.5 mg mL−1 and scan rates from 1 to 3 K min−1, except for DeNovoTIM0 where protein concentration varied from 1 to 5 mg mL−1. For DeNovoTIMs with an irreversible thermal unfolding, protein concentration was 1 mg mL−1 and scan rates from 1 to 3 K min−1. For DeNovoTIM14 in native conditions, protein concentration was increased to 2.5 and 4.5 mg mL−1 to accurately determine the transition. For DeNovoTIM14 in the presence of urea, all the scans were done at 1 mg mL−1 from 2.0 to 6.0 M urea with samples incubated for 6 h at 10 °C. Origin v.9.0 (OriginLab Corporation, Northampton, MA, USA.) with MicroCal software was used for data analysis.
Thermodynamic parameters from reversible DSC transitions
DSC endotherms were fitted to equilibrium two-state model (N ⇋ U):
| (3) |
where B0 and B1 define the slope and intercept of the low-temperature baseline segment, n is the number of subunits in the native protein (1 for all DeNovoTIMs) and f(T) is the protein fraction in the folded monomeric state, producing ΔH (at Tm), ΔCP, and Tm. The thermodynamic parameters reported are the average of ten experiments carried out in the 0.25 to 2.5 mg mL−1 range. The van’t Hoff enthalpy (ΔHvH) was evaluated by73:
| (4) |
where R is the universal gas constant, Tm is the temperature at which CP is maximal, CP,Tm is the heat capacity value at Tm, and ΔH is the total calorimetric enthalpy of the endotherm.
Thermodynamic parameters from irreversible DSC transitions
Calorimetric transitions were adequately described by the two-state irreversible model (N → F) where N is the native protein and F is the final state.74, 75 The kinetic conversion from N to F is described by a first-order rate constant (k) changing with temperature according to the Arrhenius equation:
| (5) |
where T″ is the temperature at which the k = 1 min−1 and Eact is the activation energy between the native and the transition states that describes the unfolding process. The apparent heat capacity is given by:
| (6) |
where T is the temperature and ΔH is the unfolding enthalpy. The Eact was also obtained following these two procedures: from the slope of Arrhenius plots, i.e. ln k vs. 1/T; and derived from a data consistency test, evaluating the effect of scanning rate (ν) on Tm.76
Chemical-induced unfolding
All experiments were carried out at a protein concentration of 0.1 mg mL−1 in buffer D: 10 mM sodium phosphate pH 8 at 25 °C. To determine whether urea induced unfolding was reversible, unfolding and refolding experiments were assayed. For unfolding experiments, native DeNovoTIM was the initial state, whereas for refolding, the starting state was the unfolded DeNovoTIM incubated overnight in 9.0 M urea. Thereafter samples were incubated at different concentrations of urea (0–9.0 M), either increasing or decreasing the initial concentration (for unfolding and refolding experiments, respectively). Intrinsic fluorescence of both, unfolding and refolding samples, was measured at different times to determine the equilibrium time. Unfolding and refolding transitions are coincident and the signal does not change after incubation for 12 h, i.e. chemical unfolding is reversible and at equilibrium under the experimental conditions. Once the equilibrium time was found, unfolding experiments with samples incubated for 12 h and followed by CD and IF were performed as aforementioned. IF data at fixed emission wavelength and CD data at 222 nm were both collected over 2 minutes at each urea concentration. The changes in IF and CD were normalized to the fraction of unfolded molecules (fU) by:
| (7) |
where yobs is the experimentally observed IF and CD signal at a given temperature, and (yN + mN[urea]) and (yU + mU[urea]) are the linear fitting equations corresponding to the native and unfolded regions, respectively. All two-state transitions were fitted to Santoro and Bolen equation77 which assumes a two-state model (N ⇌ D):
| (8) |
where ΔGH2O is the unfolding free energy in absence of denaturant, m is ΔG/[urea], T is the temperature of the experiment (25 °C), and (yN + mN [urea]) and (yU + mU [urea]) are the linear fitting equations for the pre- and post-transition states. The chemical unfolding transitions for DeNovoTIM14 in GdnHCl were fitted to a three-state model with an intermediate:
| (9) |
where K1=, K2=, ΔGNtoI and ΔGItoU is the unfolding free energy from native state to intermediate and from intermediate to unfolded state, mNtoI and mItoU is ΔG/[GdnHCl] of each step, T is the temperature of the experiment (25 °C), and (yN + mN[GdnHCl]), (yI + mI[GdnHCl]), and (yU + mU[GdnHCl]) are the linear fitting equations for native, intermediate, and unfolded states, respectively. Similar ΔG values were obtained when experimental protein concentration was increased five fold, ruling out the possibility of a bimolecular association/folding step.
Stability curve and global thermodynamic stability
Global stability curves, ΔG(T), were calculated using the thermodynamic parameters obtained from DSC experiments and the Gibbs-Helmholtz equation78:
| (10) |
The area under the stability curve is a measure of the global stability of the protein.47 It was calculated integrating Eq. (10) from the lowest temperature at which the protein is in the liquid state i.e. 0 °C (273.15 K) to Tm:
| (11) |
Thermodynamic 2D bubble plots
The 2D bubble plots were constructed by plotting Tm and ΔH85°C obtained from thermal unfolding experiments, and ΔG25°C obtained from chemical unfolding data. Since DeNovoTIMs have different Tm values, experimental ΔH from DSC experiments cannot be directly compared. To put the thermodynamic parameters on a similar ground for comparison in Figure 5(A), ΔH at 85 °C (ΔH85°C), which is the average Tm of the DeNovoTIM collection, was calculated using ΔH and ΔCP from DSC experiments as follows79:
| (12) |
where Tm, ΔH and ΔCP are the experimental values obtained from DSC experiments for each protein, and 85 °C is the reference temperature. ΔH85°C was not calculated for DeNovoTIM13 and DeNovoTIM14, because their irreversible thermal unfolding hampered the determination of ΔCP. The plots were constructed with Origin v.9.0 (OriginLab Corporation, Northampton, MA, USA.) and the diameter in the bubbles correspond to the ΔG25°C magnitude.
Thermodynamic double-mutant cycles
To calculate non-additive effects between different DeNovoTIM barrel regions, an approximation based on double mutant cycles was used.48, 80, 81 The thermodynamic cycles were constructed using the experimental ΔG25°C values obtained from chemical unfolding experiments and linking single-region/double-region designs and double-region/triple-region designs as indicated in Figure S14.
Each corner of the square represents a different DeNovoTIM where the mutations are located in a specific region of the barrel or in a combination of them. For double-region cycles (upper panel), from the first to the second design round, ΔG1 and ΔG2 are the changes in stability produced when a single region of the barrel was mutated, ΔG3 and ΔG4 are the changes in stability generated when the same mutations are evaluated in the background of another first-round design. In the triple-region cycles (lower panel), from the second to the third design round, ΔG1 and ΔG3 are the changes in stability produced when the mutations of a single region are introduced in the background of DeNovoTIM0 or in a double-region design, whereas ΔG2 and ΔG4 are the changes in stability generated when a double-region design was incorporated in the background of DeNovoTIM0 or in a single region design, respectively.
Considering that ΔG is a state property, if two regions of the barrel are energetically independent, their effects will be additive and not coupled. Therefore, stability changes linked to a particular region will result in the same values on parallel sides of the square, i.e., ΔG1 = ΔG3 and ΔG2 = ΔG4. Any difference the values on the parallel sides of the squares indicates a deviation from additivity and measures the coupling energy between different regions of the barrel, given by ΔΔGint = ΔG4-ΔG2 = ΔG3-ΔG1, where ΔΔGint values have been referred as coupling energy, non-additive effects, interaction energies, and more recently epistatic effects.48 A positive ΔGint indicates that the introduction of favorable interactions has a higher stabilizing effect when a nearby region is already mutated.
Sequence and structural analysis
Sequence alignment was performed with MAFFT v.7.45082 using the secondary structure information from the sTIM11 structure (PDB ID: 5BVL). Sequence identity was calculated with the SIAS server (Universidad Complutense de Madrid, 2013). Structural alignments and RMSD calculations were performed using PyMOL Molecular Graphics System v.4.5.0 (Schrodinger, LLC). Cavity volumes were calculated with MOLE v.2.583 using a standard probe radius of 5 Å and an interior threshold of 1.1 Å with a non-directed exploration path. The accessible surface area (ASA) was calculated with VADAR v.1.8.84 In these analyses; changes in ASA for the unfolded state were calculated with an extended Gly-X-Gly peptide. Hydrogen bonds, as well as salt bridges, were calculated using HBPLUS v.3.0685 and ESBRI86 with default parameters for distances and angles. A salt bridge was assigned when two atoms of opposite charge were observed within 4 Å. Hydrophobic clusters (formed by ILV residues) were calculated following an algorithm previously reported by Sobolev87 and available in the ProteinTools toolkit developed by Dr. Noelia Ferruz-Capapey from the Höcker Lab.88 The analysis considers ILE; VAL; and LEU residues and then recaptures the coordinates of their neighboring atoms. Then, the buried solvent-accessible hydrophobic surface area is calculated, and the cluster’s total area is computed by the sum of the individual residue areas that comprise it.
Accession numbers
Coordinates and structure files have been deposited to the Protein Data Bank (PDB) with accession numbers: 6YQY (sTIM11noCys), 6Z2I (DeNovoTIM6), and 6YQX (DeNovoTIM13).
Data and materials availability
All data to support the conclusions of this manuscript are included in the main text and supporting information.
CRediT authorship contribution statement
Sergio Romero-Romero: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Data curation, Writing - original draft, Writing - review & editing, Visualization, Supervision. Miguel Costas: Conceptualization, Formal analysis, Investigation, Resources, Writing - original draft, Writing - review & editing, Funding acquisition. Daniel-Adriano Silva Manzano: Software, Investigation, Writing - review & editing. Sina Kordes: Methodology, Investigation, Writing - review & editing. Erendira Rojas-Ortega: Methodology, Investigation, Writing - review & editing. Cinthya Tapia: Methodology, Investigation, Writing - review & editing. Yasel Guerra: Methodology, Investigation, Writing - review & editing. Sooruban Shanmugaratnam: Methodology, Investigation, Writing - review & editing. Adela Rodríguez-Romero: Methodology, Investigation, Resources, Writing - review & editing, Supervision, Funding acquisition. David Baker: Conceptualization, Resources, Writing - review & editing, Supervision, Project administration, Funding acquisition. Birte Höcker: Conceptualization, Resources, Writing - original draft, Writing - review & editing, Supervision, Project administration, Funding acquisition. D. Alejandro Fernández-Velasco: Conceptualization, Formal analysis, Investigation, Resources, Writing - original draft, Writing - review & editing, Supervision, Project administration, Funding acquisition.
Acknowledgments
Acknowledgments
We acknowledge financial support and allocation of beamtime by PSI and HZB. We thank the beamline staff at the SLS and at BESSY for assistance, and LANEM-IQ-UNAM for the support in crystal characterization. We thank María Isabel Velázquez López, Laura Iliana Alvarez Añorve, Alma Jessica Díaz Salazar, and Georgina Espinosa Pérez for their competent technical support, Gregor Wiese for generating and crystallizing sTIM11noCys, Noelia Ferruz-Capapey for her help in the structural analyses, as well as Po-Ssu Huang for his comments on the manuscript. We kindly thank all the members of the Fernandez-Velasco, Höcker, and Baker Labs for their constructive suggestions to improve the research.
Funding
This work was supported by scholarships from CONACYT (749489 to C.T., 387653, 291062, 14401, and 27897 to S.R.R), UNAM-DGAPA-PAPIIT (IN220516 to S.R.R.), UNAM-DGAPA (postdoctoral fellowship to Y.G.), CONACYT and UNAM-DGAPA (sabbatical stay fellowship to D.A.F.V.), and Foundations Alexander von Humboldt and Bayer Science & Education (Humboldt-Bayer Research Fellowship for Postdoctoral Researchers to S.R.R.). This research was also financed by grants from CONACYT (221169 to A.R.R., 254514 to D.A.F.V.), UNAM-DGAPA-PAPIIT (IN220519 to M.C., IN208418 to A.R.R., IN219519 and IN220516 to D.A.F.V.), and Programa de Apoyo a la Investigación y el Posgrado FQ-UNAM (5000-9018 to M.C.). B.H. gratefully acknowledges financial support by the European Research Council (ERC Consolidator Grant 647548 ‘Protein Lego’) and by HZB to visit the beamlines at BESSY.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Edited by Amy Keating
Footnotes
This article contains supporting information that includes: Supporting text, Supporting Figures S1 to S15, and Supporting Tables S1 to S7. Supplementary data to this article can be found online at https://doi.org/10.1016/j.jmb.2021.167153.
Contributor Information
David Baker, Email: dabaker@uw.edu.
Birte Höcker, Email: birte.hoecker@uni-bayreuth.de.
D. Alejandro Fernández-Velasco, Email: fdaniel@unam.mx.
Appendix A. Supplementary material
The following are the Supplementary data to this article:
References
- 1.Tokuriki N., Tawfik D.S. Stability effects of mutations and protein evolvability. Curr. Opin. Struct. Biol. 2009;19(5):596–604. doi: 10.1016/j.sbi.2009.08.003. [DOI] [PubMed] [Google Scholar]
- 2.Bloom J.D., Labthavikul S.T., Otey C.R., Arnold F.H. Protein stability promotes evolvability. Proc. Natl. Acad. Sci. U.S.A. 2006;103(15):5869–5874. doi: 10.1073/pnas.0510098103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Becktel W.J., Schellman J.A. Protein stability curves. Biopolymers. 1987;26(11):1859–1877. doi: 10.1002/bip.360261104. [DOI] [PubMed] [Google Scholar]
- 4.Nojima H., Ikai A., Oshima T., Noda H. Reversible thermal unfolding of thermostable phosphoglycerate kinase. Thermostability associated with mean zero enthalpy change. J. Mol. Biol. 1977;116(3):429–442. doi: 10.1016/0022-2836(77)90078-X. [DOI] [PubMed] [Google Scholar]
- 5.Razvi A., Scholtz J.M. Lessons in stability from thermophilic proteins. Protein Sci. 2006;15(7):1569–1578. doi: 10.1110/ps.062130306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Huang P.S., Boyken S.E., Baker D. The coming of age of de novo protein design. Nature. 2016;537(7620):320–327. doi: 10.1038/nature19946. [DOI] [PubMed] [Google Scholar]
- 7.Islam M.M., Kobayashi K., Kidokoro S.I., Kuroda Y. Hydrophobic surface residues can stabilize a protein through improved water-protein interactions. FEBS J. 2019;286(20):4122–4134. doi: 10.1111/febs.14941. [DOI] [PubMed] [Google Scholar]
- 8.Goldenzweig A., Fleishman S.J. Principles of protein stability and their application in computational design. Annu. Rev. Biochem. 2018;87(1):105–129. doi: 10.1146/annurev-biochem-062917-012102. [DOI] [PubMed] [Google Scholar]
- 9.Kim D.N., Jacobs T.M., Kuhlman B. Boosting protein stability with the computational design of β-sheet surfaces. Protein Sci. 2016;25(3):702–710. doi: 10.1002/pro.2869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Huang P.S., Oberdorfer G., Xu C., Pei X.Y., Nannenga B.L., Rogers J.M., DiMaio F., Gonen T., Luisi B., Baker D. High thermodynamic stability of parametrically designed helical bundles. Science. 2014;346(6208):481–485. doi: 10.1126/science.1257481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Borgo B., Havranek J.J. Automated selection of stabilizing mutations in designed and natural proteins. Proc. Natl. Acad. Sci. U.S.A. 2012;109(5):1494–1499. doi: 10.1073/pnas.1115172109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kuhlman B., Baker D. Exploring folding free energy landscapes using computational protein design. Curr. Opin. Struct. Biol. 2004;14(1):89–95. doi: 10.1016/j.sbi.2004.01.002. [DOI] [PubMed] [Google Scholar]
- 13.Kwok S.C., Hodges R.S. Clustering of large hydrophobes in the hydrophobic core of two-stranded α-helical coiled-coils controls protein folding and stability. J. Biol. Chem. 2003;278(37):35248–35254. doi: 10.1074/jbc.M305306200. [DOI] [PubMed] [Google Scholar]
- 14.Selvaraj S., Gromiha M.M. Role of hydrophobic clusters and long-range contact networks in the folding of (α/β)8 barrel proteins. Biophys. J. 2003;84(3):1919–1925. doi: 10.1016/S0006-3495(03)75000-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Malakauskas S.M., Mayo S.L. Design, structure and stability of a hyperthermophilic protein variant. Nat. Struct. Biol. 1998;5(6):470–475. doi: 10.1038/nsb0698-470. [DOI] [PubMed] [Google Scholar]
- 16.Dahiyat B.I., Mayo S.L. Probing the role of packing specificity in protein design. Proc. Natl. Acad. Sci. U.S.A. 1997;94(19):10172–10177. doi: 10.1073/pnas.94.19.10172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Colón W., Elöve G.A., Wakem L.P., Sherman F., Roder H. Side chain packing of the N- and C-terminal helices plays a critical role in the kinetics of cytochrome c folding. Biochemistry. 1996;35(17):5538–5549. doi: 10.1021/bi960052u. [DOI] [PubMed] [Google Scholar]
- 18.Fersht A.R., Serrano L. Principles of protein stability derived from protein engineering experiments. Curr. Opin. Struct. Biol. 1993;3(1):75–83. doi: 10.1016/0959-440X(93)90205-Y. [DOI] [Google Scholar]
- 19.Basak S., Nobrega P.R., Tavella D., Deveau L.M., Koga N., Tatsumi-Koga R., Baker D., Massi F., Matthews C.R. Networks of electrostatic and hydrophobic interactions modulate the complex folding free energy surface of a designed Βα protein. Proc. Natl. Acad. Sci. U.S.A. 2019;116(14):6806–6811. doi: 10.1073/pnas.1818744116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Watters A.L., Deka P., Corrent C., Callender D., Varani G., Sosnick T., Baker D. The highly cooperative folding of small naturally occurring proteins is likely the result of natural selection. Cell. 2007;128(3):613–624. doi: 10.1016/j.cell.2006.12.042. [DOI] [PubMed] [Google Scholar]
- 21.Sterner R., Höcker B. Catalytic versatility, stability, and evolution of the (betaalpha)8-barrel enzyme fold. Chem. Rev. 2005;105(11):4038–4055. doi: 10.1021/cr030191z. [DOI] [PubMed] [Google Scholar]
- 22.Braselmann E., Chaney J.L., Clark P.L. Folding the proteome. Trends Biochem. Sci. 2013;38(7):337–344. doi: 10.1016/j.tibs.2013.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Romero-Romero S., Kordes S., Michel F., Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr. Opin. Struct. Biol. 2021;68:94–104. doi: 10.1016/j.sbi.2020.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nagarajan D., Deka G., Rao M. Design of symmetric TIM barrel proteins from first principles. BMC Biochem. 2015;16(1):1–22. doi: 10.1186/s12858-015-0047-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Figueroa M., Oliveira N., Lejeune A., Kaufmann K.W., Dorr B.M., Matagne A., Martial J.A., Meiler J., Van de Weerdt C. Octarellin VI: using rosetta to design a putative artificial (β/α)8 protein. PLoS ONE. 2013;8(8) doi: 10.1371/journal.pone.0071858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Höcker B., Lochner A., Seitz T., Claren J., Sterner R. High-resolution crystal structure of an artificial (Βα) 8-barrel protein designed from identical half-barrels. Biochemistry. 2009;48(6):1145–1147. doi: 10.1021/bi802125b. [DOI] [PubMed] [Google Scholar]
- 27.Höcker B., Claren J., Sterner R. Mimicking enzyme evolution by generating new (betaalpha)8-barrels from (betaalpha)4-half-barrels. Proc. Natl. Acad. Sci. U.S.A. 2004;101(47):16448–16453. doi: 10.1073/pnas.0405832101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Offredi F., Dubail F., Kischel P., Sarinski K., Stern a.S., van de Weerdt C., Hoch J.C., Prosperi C., Francois J.M., Mayo S.L., Martial J.A. De novo backbone and sequence design of an idealized alpha/ protein: evidence of stable tertiary structure. J. Mol. Biol. 2004;325(1):163–174. doi: 10.1061/S022-2836(02)01206-8. [DOI] [PubMed] [Google Scholar]
- 29.Huang P.S., Feldmeier K., Parmeggiani F., Fernandez-Velasco D.A., Hocker B., Baker D. De novo design of a four-fold symmetric TIM-barrel protein with atomic-level accuracy. Nat. Chem. Biol. 2016;12(1):29–34. doi: 10.1038/nchembio.1966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Romero-Romero S., Costas M., Rodríguez-Romero A., Fernández-Velasco D.A. Reversibility and two state behaviour in the thermal unfolding of oligomeric TIM barrel proteins. Phys. Chem. Chem. Phys. 2015;17(32):20699–20714. doi: 10.1039/c5cp01599e. [DOI] [PubMed] [Google Scholar]
- 31.Cipolla A., Delbrassine F., Da Lage J.L., Feller G. Temperature adaptations in psychrophilic, mesophilic and thermophilic chloride-dependent alpha-amylases. Biochimie. 2012;94(9):1943–1950. doi: 10.1016/j.biochi.2012.05.013. [DOI] [PubMed] [Google Scholar]
- 32.Rudolph R., Siebendritt R., Kiefhaber T. Reversible unfolding and refolding behavior of a monomeric aldolase from staphylococcus aureus. Protein Sci. 1992;1(5):654–666. doi: 10.1002/pro.5560010511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Uversky V.N. Use of fast protein size-exclusion liquid chromatography to study the unfolding of proteins which denature through the molten globule. Biochemistry. 1993;32(48):13288–13298. doi: 10.1021/bi00211a042. [DOI] [PubMed] [Google Scholar]
- 34.Robertson A.D., Murphy K.P. Protein structure and the energetics of protein stability. Chem. Rev. 1997;97(5):1251–1267. doi: 10.1021/cr960383c. [DOI] [PubMed] [Google Scholar]
- 35.Robic S., Guzman-Casado M., Sanchez-Ruiz J.M., Marqusee S. Role of residual structure in the unfolded state of a thermophilic protein. Proc. Natl. Acad. Sci. U.S.A. 2003;100(20):11345–11349. doi: 10.1073/pnas.1635051100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lamazares E., Clemente I., Bueno M., Velázquez-Campoy A., Sancho J. Rational stabilization of complex proteins: a divide and combine approach. Sci. Rep. 2015;5(9129):1–11. doi: 10.1038/srep09129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Myers J.K., Nick Pace C., Martin Scholtz J. Denaturant m values and heat capacity changes: relation to changes in accessible surface areas of protein unfolding. Protein Sci. 1995;4(10):2138–2148. doi: 10.1002/pro.5560041020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Spudich G., Marqusee S. A change in the apparent m value reveals a populated intermediate under equilibrium conditions in Escherichia coli ribonuclease HI. Biochemistry. 2000;39(38):11677–11683. doi: 10.1021/bi000466u. [DOI] [PubMed] [Google Scholar]
- 39.Shortle D., Ackerman M.S. Persistence of native-like topology in a denatured protein in 8 M urea. Science. 2001;293(5529):487–489. doi: 10.1126/science.1060438. [DOI] [PubMed] [Google Scholar]
- 40.Haruki M., Tanaka M., Motegi T., Tadokoro T., Koga Y., Takano K., Kanaya S. Structural and thermodynamic analyses of Escherichia Coli RNase HI variant with quintuple thermostabilizing mutations. FEBS J. 2007;274(22):5815–5825. doi: 10.1111/j.1742-4658.2007.06104.x. [DOI] [PubMed] [Google Scholar]
- 41.Serrano L., Day A.G., Fersht A.R. Step-wise mutation of barnase to binase. A procedure for engineering increased stability of proteins and an experimental analysis of the evolution of protein stability. J. Mol. Biol. 1993;233:305–312. doi: 10.1006/jmbi.1993.1508. [DOI] [PubMed] [Google Scholar]
- 42.Stearman R.S., Frankel A.D., Freire E., Liu B., Pabo C.O. Combining thermostable mutations increases the stability of λ repressor. Biochemistry. 1988;27(19):7571–7574. doi: 10.1021/bi00419a059. [DOI] [PubMed] [Google Scholar]
- 43.Matsumura M., Signor G., Matthews B.W. Substantial increase of protein stability by multiple disulphide bonds. Nature. 1989;342(6247):291–293. doi: 10.1038/342291a0. [DOI] [PubMed] [Google Scholar]
- 44.Hart K.M., Harms M.J., Schmidt B.H., Elya C., Thornton J.W., Marqusee S. Thermodynamic system drift in protein evolution. PLoS Biol. 2014;12(11) doi: 10.1371/journal.pbio.1001994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Howell S.C., Inampudi K.K., Bean D.P., Wilson C.J. Understanding thermal adaptation of enzymes through the multistate rational design and stability prediction of 100 adenylate kinases. Structure. 2014;22(2):218–229. doi: 10.1016/j.str.2013.10.019. [DOI] [PubMed] [Google Scholar]
- 46.Rees D.C., Robertson A.D. Some thermodynamic implications for the thermostability of proteins. Protein Sci. 2001;10(6):1187–1194. doi: 10.1110/ps.180101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Alfano C., Sanfelice D., Martin S.R., Pastore A., Temussi P.A. An optimized strategy to measure protein stability highlights differences between cold and hot unfolded states. Nat. Commun. 2017;8(May):1–9. doi: 10.1038/ncomms15428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Horovitz A., Fleisher R.C., Mondal T. Double-mutant cycles: new directions and applications. Curr. Opin. Struct. Biol. 2019;58(Oct):10–17. doi: 10.1016/j.sbi.2019.03.025. [DOI] [PubMed] [Google Scholar]
- 49.Miton C.M., Tokuriki N. How mutational epistasis impairs predictability in protein evolution and design. Protein Sci. 2016;25(7):1260–1272. doi: 10.1002/pro.2876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Starr T.N., Thornton J.W. Epistasis in protein evolution. Protein Sci. 2016;25(7):1204–1218. doi: 10.1002/pro.2897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Risso V.A., Manssour-Triedo F., Delgado-Delgado A., Arco R., Barroso-delJesus A., Ingles-Prieto A., Godoy-Ruiz R., Gavira J.A., Gaucher E.A., Ibarra-Molero B., Sanchez-Ruiz J.M. Mutational studies on resurrected ancestral proteins reveal conservation of site-specific amino acid preferences throughout evolutionary history. Mol. Biol. Evol. 2015;32(2):440–455. doi: 10.1093/molbev/msu312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ashenberg O., Gong L.I., Bloom J.D. Mutational Effects on stability are largely conserved during protein evolution. Proc. Natl. Acad. Sci. U.S.A. 2013;110(52):21071–21076. doi: 10.1073/pnas.1314781111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Natarajan C., Inoguchi N., Weber R.E., Fago A., Moriyama H., Storz J.F. Epistasis among adaptive mutations in deer mouse hemoglobin. Science. 2013;340(6138):1324–1327. doi: 10.1126/science.1236862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Lynch V.J., May G., Wagner G.P. Regulatory evolution through divergence of a phosphoswitch in the transcription factor CEBPB. Nature. 2011;480(7377):383–386. doi: 10.1038/nature10595. [DOI] [PubMed] [Google Scholar]
- 55.Ortlund E.A., Bridgham J.T., Redinbo M.R., Thornton J.W. Crystal structure of an ancient protein: evolution by conformational epistasis. Science. 2007;317(5844):1544–1548. doi: 10.1126/science.1142819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kumar M.D.S. ProTherm and ProNIT: thermodynamic databases for proteins and protein-nucleic acid interactions. Nucleic Acids Res. 2006;34(90001):D204–D206. doi: 10.1093/nar/gkj103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Nikam R., Kulandaisamy A., Harini K., Sharma D., Michael Gromiha M. ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years. Nucleic Acids Res. 2021;49(D1):D420–D424. doi: 10.1093/nar/gkaa1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Caldwell S.J., Haydon I.C., Piperidou N., Huang P.S., Bick M.J., Sebastian Sjöström H., Hilvert D., Baker D., Zeymer C. Tight and specific lanthanide binding in a de novo TIM barrel with a large internal cavity designed by symmetric domain fusion. Proc. Natl. Acad. Sci. U.S.A. 2020;117(48):30362–30369. doi: 10.1073/pnas.2008535117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Koga N., Tatsumi-Koga R., Liu G., Xiao R., Acton T.B., Montelione G.T., Baker D. Principles for designing ideal protein structures. Nature. 2012;491(7423):222–227. doi: 10.1038/nature11600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rohl C.A., Strauss C.E.M., Misura K.M.S., Baker D. Protein structure prediction using rosetta. Methods Enzymol. 2004;383:66–93. doi: 10.1007/978-0-387-92738-1_11. [DOI] [PubMed] [Google Scholar]
- 61.Böhm G., Muhr R., Jaenicke R. Quantitative analysis of protein far UV circular dichroism spectra by neural networks. Protein Eng. Des. Sel. 1992;5(3):191–195. doi: 10.1093/protein/5.3.191. [DOI] [PubMed] [Google Scholar]
- 62.Fuchs M.R., Pradervand C., Thominet V., Schneider R., Panepucci E., Grunder M., Gabadinho J., Dworkowski F.S.N., Tomizaki T., Schneider J., Mayer A., Curtin A., Olieric V., Frommherz U., Kotrle G., Welte J., Wang X., Maag S., Schulze-Briese C., Wang M. D3, the new diffractometer for the macromolecular crystallography beamlines of the swiss light source. J. Synchrotron Radiat. 2014;21(2):340–351. doi: 10.1107/S160057751400006X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Bingel-Erlenmeyer R., Olieric V., Grimshaw J.P.A., Gabadinho J., Wang X., Ebner S.G., Isenegger A., Schneider R., Schneider J., Glettig W., Pradervand C., Panepucci E.H., Tomizaki T., Wang M., Schulze-Briese C. SLS crystallization platform at beamline X06DA-A fully automated pipeline enabling in situ x-ray diffraction screening. Cryst. Growth Des. 2011;11(4):916–923. doi: 10.1021/cg101375j. [DOI] [Google Scholar]
- 64.Helmholtz-Zentrum Berlin für Materialien und Energie. (2016). The MX Beamlines BL14 .1-3 at BESSY II. J. Large-Scale Res. Facil. 47 (2), 1–6. http://dx.doi.org/10.17815/jlsrf-2-64
- 65.Kabsch W. XDS. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66(2):125–132. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Sparta K.M., Krug M., Heinemann U., Mueller U., Weiss M.S. XDSAPP2.0. J. Appl. Crystallogr. 2016;49:1085–1092. doi: 10.1107/S1600576716004416. [DOI] [Google Scholar]
- 67.Winter G., Waterman D.G., Parkhurst J.M., Brewster A.S., Gildea R.J., Gerstel M., Fuentes-Montero L., Vollmar M., Michels-Clark T., Young I.D., Sauter N.K., Evans G. DIALS: implementation and evaluation of a new integration package. Acta Crystallogr. Sect. D Struct. Biol. 2018;74:85–97. doi: 10.1107/S2059798317017235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Adams P.D., Afonine P.V., Bunkóczi G., Chen V.B., Davis I.W., Echols N., Headd J.J., Hung L.W., Kapral G.J., Grosse-Kunstleve R.W., McCoy A.J., Moriarty N.W., Oeffner R., Read R.J., Richardson D.C., Richardson J.S., Terwilliger T.C., Zwart P.H. PHENIX: A comprehensive python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66(2):213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Emsley P., Lohkamp B., Scott W.G., Cowtan K. Features and development of coot. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66(4):486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Joosten R.P., Long F., Murshudov G.N., Perrakis A. The PDB-REDO server for macromolecular structure model optimization. IUCrJ. 2014;1:213–220. doi: 10.1107/S2052252514009324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Chen V.B., Arendall W.B., Headd J.J., Keedy D.A., Immormino R.M., Kapral G.J., Murray L.W., Richardson J.S., Richardson D.C. MolProbity: All-atom structure validation for macromolecular crystallography. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010;66(1):12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Berman H.M., Battistuz T., Bhat T.N., Bluhm W.F., Bourne P.E., Burkhardt K., Feng Z., Gilliland G.L., Iype L., Jain S., Fagan P., Marvin J., Padilla D., Ravichandran V., Schneider B., Thanki N., Weissig H., Westbrook J.D., Zardecki C. The protein data bank. Acta Crystallogr. Sect. D Biol. Crystallogr. 2002;58(6 I):899–907. doi: 10.1107/S0907444902003451. [DOI] [PubMed] [Google Scholar]
- 73.Freire E. Statistical thermodynamic analysis of differential scanning calorimetry data: structural deconvolution of heat capacity function of proteins. Methods Enzymol. 1994;240(C):502–530. doi: 10.1016/S0076-6879(94)40062-8. [DOI] [PubMed] [Google Scholar]
- 74.Lyubarev A.E., Kurganov B.I. Modeling of irreversible thermal protein denaturation at varying temperature. I. The model involving two consecutive irreversible steps. Biochem. 1998;63(4):434–440. [PubMed] [Google Scholar]
- 75.Sanchez-Ruiz J.M. Theoretical analysis of lumry-eyring models in differential scanning calorimetry. Biophys. J. 1992;61(4):921–935. doi: 10.1016/S0006-3495(92)81899-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Costas M., Rodríguez-Larrea D., De Maria L., Borchert T.V., Gómez-Puyou A., Sanchez-Ruiz J.M. Between-species variation in the kinetic stability of TIM proteins linked to solvation-barrier free energies. J. Mol. Biol. 2009;385(3):924–937. doi: 10.1016/j.jmb.2008.10.056. [DOI] [PubMed] [Google Scholar]
- 77.Santoro M.M., Bolen D.W. Unfolding free energy changes determined by the linear extrapolation method. 1. Unfolding of phenylmethanesulfonyl a-chymotrypsin using different denaturants. Biochemistry. 1988;27(21):8063–8068. doi: 10.1021/bi00421a014. [DOI] [PubMed] [Google Scholar]
- 78.Kumar S., Tsai C.J., Nussinov R. Maximal stabilities of reversible two-state proteins. Biochemistry. 2002;41(17):5359–5374. doi: 10.1021/bi012154c. [DOI] [PubMed] [Google Scholar]
- 79.Privalov P.L., Gill S.J. Stability of protein structure and hydrophobic interaction. Adv. Protein Chem. 1988;39(C):191–234. doi: 10.1016/S0065-3233(08)60377-0. [DOI] [PubMed] [Google Scholar]
- 80.Horovitz A., Fersht A.R. Strategy for analysing the co-operativity of intramolecular interactions in peptides and proteins. J. Mol. Biol. 1990;214(3):613–617. doi: 10.1016/0022-2836(90)90275-Q. [DOI] [PubMed] [Google Scholar]
- 81.Carter P.J., Winter G., Wilkinson A.J., Fersht A.R. The use of double mutants to detect structural changes in the active site of the tyrosyl-TRNA synthetase (Bacillus Stearothermophilus) Cell. 1984;38(3):835–840. doi: 10.1016/0092-8674(84)90278-2. [DOI] [PubMed] [Google Scholar]
- 82.Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Pravda L., Sehnal D., Toušek D., Navrátilová V., Bazgier V., Berka K., Vařeková R.S., Koča J., Otyepka M. MOLEonline: A web-based tool for analyzing channels, tunnels and pores (2018 update) Nucleic Acids Res. 2018;46(W1):W368–W373. doi: 10.1093/nar/gky309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Willard L., Ranjan A., Zhang H., Monzavi H., Boyko R.F., Sykes B.D., Wishart D.S. VADAR: A web server for quantitative evaluation of protein structure quality. Nucleic Acids Res. 2003;31(13):3316–3319. doi: 10.1093/nar/gkg565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.McDonald I.K., Thornton J.M. Satisfying hydrogen bonding potential in proteins. J. Mol. Biol. 1994;238(5):777–793. doi: 10.1006/jmbi.1994.1334. [DOI] [PubMed] [Google Scholar]
- 86.Costantini S., Colonna G., Facchiano A.M. ESBRI: A web server for evaluating salt bridges in proteins. Bioinformation. 2008;3(3):137–138. doi: 10.6026/97320630003137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Sobolev V., Sorokine A., Prilusky J., Abola E.E., Edelman M. Automated analysis of interatomic contacts in proteins. Bioinformatics. 1999;15(4):327–332. doi: 10.1093/bioinformatics/15.4.327. [DOI] [PubMed] [Google Scholar]
- 88.Ferruz N., Schmidt S., Höcker B. ProteinTools: a toolkit to analyze protein structures. Nucleic Acids Res. 2021;gkab375 doi: 10.1093/nar/gkab375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Woody R.W. Aromatic side-chain contributions to the far ultraviolet circular dichroism of peptides and proteins. Biopolymers. 1978;17(6):1451–1467. doi: 10.1002/bip.1978.360170606. [DOI] [Google Scholar]
- 90.Hilser V.J., Gómez J., Freire E. The enthalpy change in protein folding and binding: refinement of parameters for structure-based calculations. Proteins Struct. Funct. Genet. 1996;26(2):123–133. doi: 10.1002/(SICI)1097-0134(199610)26:2<123::AID-PROT2>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]
- 91.Makhatadze G., Privalov P.L. Protein interactions with urea and guanidinium chloride: a calorimetric study. J. Mol. Biol. 1992;226(2):491–505. doi: 10.1016/0022-2836(92)90963-K. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





