Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2017 Nov 6;114(47):12472–12477. doi: 10.1073/pnas.1708907114

Enzyme stabilization via computationally guided protein stapling

Eric J Moore a, Dmitri Zorine b,c, William A Hansen b,c,d, Sagar D Khare b,c,d,1, Rudi Fasan a,1
PMCID: PMC5703291  PMID: 29109284

Significance

The marginal stability of most natural proteins presents a challenge for the exploitation of natural and engineered enzymes in biotechnology and industrial biocatalysis. Protein stabilization can be time- and labor-intensive and often involves extensive amino acid substitutions, which may impair the activity and/or selectivity of the enzyme. Here, we describe a computational design method for enzyme stabilization that uses structure-based modeling to introduce covalent ‟staples” in a protein scaffold via a genetically encodable noncanonical amino acid. This method was applied to obtain stapled variants of a stereoselective cyclopropanation biocatalyst featuring greatly increased thermostability and robustness to high concentrations of organic cosolvents. This minimally invasive strategy for protein stabilization should be applicable to a variety of enzymes and proteins.

Keywords: protein thermostabilization, computational protein design, myoglobin, noncanonical amino acids, Rosetta macromolecular modeling

Abstract

Thermostabilization represents a critical and often obligatory step toward enhancing the robustness of enzymes for organic synthesis and other applications. While directed evolution methods have provided valuable tools for this purpose, these protocols are laborious and time-consuming and typically require the accumulation of several mutations, potentially at the expense of catalytic function. Here, we report a minimally invasive strategy for enzyme stabilization that relies on the installation of genetically encoded, nonreducible covalent staples in a target protein scaffold using computational design. This methodology enables the rapid development of myoglobin-based cyclopropanation biocatalysts featuring dramatically enhanced thermostability (ΔTm = +18.0 °C and ΔT50 = +16.0 °C) as well as increased stability against chemical denaturation [ΔCm (GndHCl) = 0.53 M], without altering their catalytic efficiency and stereoselectivity properties. In addition, the stabilized variants offer superior performance and selectivity compared with the parent enzyme in the presence of a high concentration of organic cosolvents, enabling the more efficient cyclopropanation of a water-insoluble substrate. This work introduces and validates an approach for protein stabilization which should be applicable to a variety of other proteins and enzymes.


Enzymes play a major role in biotechnology and constitute attractive catalysts for the implementation of efficient, selective, and sustainable processes for the production of pharmaceuticals, fine chemicals, and biofuels (1, 2). The marginal stability of proteins, however, poses a fundamental challenge for the exploitation of enzymes for practical-scale syntheses and chemical manufacturing, which often require harsh reaction conditions such as elevated temperature and exposure to organic solvents (1). Due to these limitations, protein stabilization against thermal and chemical denaturation has represented a long-standing goal in enzyme design and engineering. In addition to higher robustness to operational conditions, increasing the thermostability of a protein can enhance its evolvability (i.e., its tolerance to mutagenesis for the acquisition of new or improved functions) (3).

Well-known experimental strategies for enzyme stabilization include directed evolution and consensus mutagenesis (46). While effective in many instances, these procedures remain laborious and time-consuming, typically requiring several rounds of mutagenesis and screening to obtain variants with significantly increased stability (i.e., ΔTm > 5–10 °C) (7). Notable examples of enzyme stabilization have been reported using computational design (8). In this area, methods have focused on optimizing native state interactions, for example through improving core packing (911), fragment contacts (12), combined structure- and phylogeny-guided energy optimization (13, 14), surface charge optimization (15, 16), and rigidification (17, 18). A complementary but comparatively less explored approach has focused on decreasing the configurational entropy of the unfolded state ensemble (19), primarily via the installation of nonnative disulfide linkages (20). Indeed, the introduction of nonnative disulfide bridges (20) has proven useful for increasing the thermostability of enzymes, although stability–activity trade-offs were also observed (21, 22). Disulfide cross-linking can additionally provide kinetic stabilization (23). Unfortunately, the redox instability of disulfide bridges make this approach unsuitable for many proteins and enzymes that are meant to function in reducing environments, such as the intracellular milieu and/or in the presence of reductants (2426). Moreover, the reversibility and homotypic nature of disulfide bridges can complicate folding of variants with multiple cross-links due to disulfide scrambling (27). Thus, the development of computational strategies for protein stabilization by means of chemically stable covalent bonds remains an unmet challenge.

Here, we report the development of Rosetta-guided protein stapling (R-GPS), a method for enzyme/protein stabilization based on the computational design of genetically encodable, covalent “staples” in a protein of interest. This method utilizes the Rosetta enzyme design framework to identify optimal sites for intramolecular cross-linking of the protein scaffold via redox-stable and irreversible thioether bonds generated upon the reaction between a cysteine and a genetically encoded noncanonical amino acid containing a cysteine-reactive side-chain group (O-2-bromoethyl tyrosine, or O2beY) (28). The R-GPS approach was implemented and validated in the context of a myoglobin (Mb)-based cyclopropanation biocatalyst, resulting in the development of “cyclopropanases” with significantly increased stability against thermal and chemical denaturation as well as improved catalytic performance under harsh reaction conditions.

Results

Choice of Target Enzyme and Protein Stapling Chemistry.

A recently reported Mb-derived cyclopropanation biocatalyst, Mb(H64V,V68A) (29), was selected as the test bed for the development of R-GPS. Compared with wild-type Mb, Mb(H64V,V68A) bears two active site mutations that confer enhanced catalytic activity as well as high diastereo- and enantioselectivity (>90–99% de and ee) in the cyclopropanation of styrenes and vinylarenes with ethyl α-diazoacetate as the carbene donor (29). On one hand, stabilization of Mb(H64V,V68A) was desirable for enhancing its robustness for synthetic applications, which include the synthesis of cyclopropane-containing drugs and other carbene transfer reactions (30, 31). On the other hand, the high stereoselectivity of Mb(H64V,V68A) was envisioned to provide a sensitive probe for evaluating the impact of the computationally designed covalent staples on subtle functional properties such as stereoinduction. Indeed, previous studies indicated that the stereoselectivity of Mb-based cyclopropanation catalysts is highly sensitive to small structural variations in their active sites (30).

The thioether bond-forming reaction between cysteine and the genetically encodable O-2-bromoethyl-tyrosine (O2beY) (28, 32) was selected as the protein cross-linking strategy (Fig. 1A). Based on these previous studies, the O2beY/Cys reaction was anticipated to offer several promising features for the purpose of protein stapling, namely (i) the ability to mediate the spontaneous formation of the thioether cross-link at the posttranslational level, (ii) high chemoselectivity of O2beY toward cysteine-mediated alkylation in the presence of other nucleophilic residues (e.g., Lys and His), and (iii) “spatially controlled” reactivity, whereby the O2beY and Cys undergo the nucleophilic substitution reaction only when located in close proximity.

Fig. 1.

Fig. 1.

Computational design approach. (A) Stapling reaction between the noncanonical amino acid O-2-bromoethyl tyrosine and cysteine resulting in a chemically stable thioether bond. (B) Structure of Mb (Protein Data Bank ID code 1JP9) highlighting helices A–H (Left). The active-site heme group and metal-coordinating histidine residue are shown as sticks. Conformational ensemble of the modeled thioether linkage used to find compatible locations for stapling (Right). (C) Models showing locations of covalent staples between A–H and B–G helices in designs sMb1 through sMb9.

Computational Design of Stapled Mb(H64V,V68A) Variants Using R-GPS.

As a first design principle we reasoned that cross-linking residues that are distal in the primary sequence but proximal in space in the folded state would maximize thermostabilization by decreasing the chain entropic cost of folding (3336). As a second criterion we envisioned the need to identify compatible backbone positions for placing the haloalkane (O2beY) and thiol (Cys) side-chain groups so that the cross-link is accommodated in a strain-free configuration while making energetically favorable interactions with its surrounding residues (33). Following these guiding principles, the N-terminal helices A and B and the C-terminal helices G and H of the target enzyme, Mb(H64V,V68A), were chosen as the target regions for the installation of the thioether bridges (Fig. 1B), as cross-links between these structural elements would bear the highest contact order. The RosettaMatch algorithm (37) was then applied to identify positions where the O2beY and Cys residues can be accommodated for formation of the thioether staple. Once geometrically feasible backbone positions of the cross-linking residues were obtained (Fig. 1B), RosettaDesign (38) was performed to identify additional sequence changes to minimize steric clashes with the modeled staple. To assess the method, we selected for experimental characterization a diverse set of nine designs based on (i) the solvent accessibility of the staple in the protein structure, (ii) total number of amino acid substitutions, and (iii) Rosetta energies (Table 1). These designs were termed sMb1 through sMb9 and had differences in scores relative to the parent protein Mb(H64V,V68A) ranging from −6.7 (sMb1) to +16.6 (sMb9) Rosetta energy units (Reu), and intervening segment lengths ranging from 73 (sMb2) to 121 (sMb5) residues. Seven of designed variants feature a surface exposed thioether linkage, whereas for the remaining two (sMb6 and sMb8) the cross-link is buried (Fig. 1C). The number of designed amino acid substitutions, excluding the Cys and O2beY residues, ranged from zero (sMb2 and sMb5) to two (sMb3, sMb6, sMb7, sMb8, and sMb9).

Table 1.

Computational and experimental values for Mb(H64V,V64A) and its variants

Variant Mutations ΔGcomp, Reu Location staple* Attack angle, deg Staple (MS) Tm °C (ΔTm) T50, °C (ΔT50)§ Cm (M) (ΔCm)
Mb(H64V,V68A) 0 None n.a. n.a. 66.0 ± 1.0 (0.0) 62.3 ± 0.6 (0.0) 1.55 ± 0.02 (0.0)
sMb1 R31(O2beY), S35K, E109C −6.73 B–G 102 No 67.5 ± 1.3 (+1.5) 59.1 ± 0.6 (−3.2) n.d.
sMb2 H36(O2beY), E109C −5.05 B/C–G 142 Yes 73.2 ± 0.2 (+7.2) 63.8 ± 0.3 (+1.5) n.d.
sMb3 L9R, H12C, D122T, A127(O2beY) −2.66 A–H 126 Yes 72.2 ± 1.4 (+6.2) 60 ± 3 (−2.3) n.d.
sMb4 D27(O2beY), H113C, V114G −0.76 B–G 98 Yes (partial) 56.1 ± 1.1 (−9.9) 50 ± 2 (−12.3) n.d.
sMb5 G5(O2beY), D126C −0.68 A–H 169 Yes 76.0 ± 2.0 (+10.0) 71.5 ± 0.9 (+9.2) 1.79 ± 0.02 (+0.24)
sMb6 G25C, I28A, L69A, I111(O2beY) +1.70 B–G (core) 132 No 53.0 ± 1.0 (−13.0) 51.2 ± 0.3 (−11.1) n.d.
sMb7 D20S, G23C, D27A, R118(O2beY) +2.34 B-G 156 Yes 69.9 ± 1.4 (+3.9) 54 ± 1 (−8.3) n.d.
sMb8 H24(O2beY), I28S, L69S, I111C +3.05 B–G (core) 108 No 49.2 ± 1.0 (−16.8) 43.3 ± 0.6 (−19.0) n.d.
sMb9 K16C, H119A, G121(O2beY), D122S +16.6 A-G/H 112 No 64.3 ± 1.3 (−1.7) 52 ± 2 (−10.3) n.d.
sMb10 G5(O2beY), H36(O2beY), E109C, D126C n.a. B/C-G + A–H n.a. Yes (double) 82.8 ± 0.8 (+16.8) 73 ± 2 (+10.7) 2.01 ± 0.05 (+0.46)
sMb11 G5(O2beY), H36(O2beY), F106A, E109C, D126C n.a. B/C–G + A–H n.a. Yes (double) n.d. 61 ± 1 (−1.3) n.d.
sMb12 G5(O2beY), H36(O2beY), E109C, D126C, G129E n.a. B/C–G + A–H n.a. n.d. n.d. 66.6 ± 0.8 (+4.3) n.d.
sMb13 G5(O2beY), H36(O2beY), E109C, H113E, D126C n.a. B/C–G + A–H n.a. Yes (double) 84.0 ± 0.2 (+18.0) 78.3 ± 0.8 (+16.0) 2.08 ± 0.03 (+0.53)

deg, degrees; n.a., not applicable; n.d., not determined.

*

Corresponding to alpha helices A through H as labeled in Fig. 1.

Angle for nucleophilic attack of Cys thiol group to O2beY side-chain alkyl bromide group (θattack) as described in SI Appendix, Fig. S1A.

Apparent melting temperatures as determined via circular dichroism (SI Appendix, Fig. S7).

§

Half-maximal denaturation temperature of holoprotein (Soret band) after 10-min incubation.

Midpoint guanidinium chloride concentration from chemical denaturation curves measured by CD.

Expression and Characterization of the sMb Designs.

All of the designed Mb(H64V,V68A)-derived variants could be expressed in soluble form from Escherichia coli cells containing an orthogonal aminoacyl-tRNA synthetase/tRNACUA pair for O2beY incorporation (28) via amber stop codon suppression (39). In addition, the constructs were all able to bind heme and fold properly, as evinced from the characteristic Soret band (∼410 nm) in the corresponding UV-visible spectra (SI Appendix, Fig. S4).

SDS/PAGE analysis revealed an increase in electrophoretic mobility for five out of the nine protein constructs, namely sMb2, sMb3, sMb4, sMb5, and sMb7, compared with Mb(H64V,V68A) (Fig. 2A and SI Appendix, Fig. S5). This behavior is indicative of a more compact structure of the protein under denaturing and reducing conditions (SDS + DTT), which is consistent with the presence of the nonreducible thioether (O2beY/Cys) cross-link. These conclusions were corroborated by MALDI-TOF MS (Fig. 2B and SI Appendix, Fig. S6), which showed a single signal corresponding to expected mass of these proteins minus 82 Da, deriving from the loss of HBr as a result of the O2beY/Cys cross-linking reaction. For sMb4, two m/z signals consistent with the stapled and unstapled form of the protein indicated the O2beY/cysteine cross-linking reaction has occurred only partially. For the remaining designs, the only observable species in the MS spectra corresponded to the O2beY-containing protein lacking the thioether cross-link. Notably, the integrity of the O2beY residue in these unstapled constructs demonstrates the lack of undesirable reactivity toward hydrolysis or abundant thiol-containing intracellular metabolites such as glutathione. These results also show how placing the O2beY/Cys pair in close spatial proximity in the context of a folded protein is a necessary but not sufficient condition for productive stapling. The positional dependence of the stapling reaction could be rationalized based on analysis of the rotamers of Cys and O2beY residues (Fig. 1B) in models of the sMb constructs in their unstapled forms. In particular, a good agreement was found between the experimental results and the accessibility of near-attack conformations (NACs) compatible with a bimolecular nucleophilic substitution reaction between the thiol group of the cysteine and the alkyl bromide group in O2beY (see SI Appendix for further discussion).

Fig. 2.

Fig. 2.

Characterization of stapled Mb(H64V,V68A) variants (sMb variants). (A and B) SDS/PAGE gel (A) and MALDI-TOF MS spectrum (B) of Mb(H64V,V68A) and representative sMb variants. c, cross-linked; c(2×), doubly cross-linked; nc, not cross-linked. Calculated masses: Mb(H64V,V68A): 18,474 Da; sMb2 (c): 18,500 Da; sMb4 (c): 18,472 Da; sMb5 (c): 18,594 Da; sMb10 [c(2×)]: 18,621 Da; sMb13 [c(2×)]: 18,612 Da. (C) Thermal denaturation curves for Mb(H64V,V68A) and selected stapled variants as measured via CD at 220 nm (Tm determination). (D) Heat-induced inactivation curves (heme loss) for the same proteins as determined by decrease of Soret band signal (408 nm) after incubation (10 min) at variable temperatures (T50 determination). See SI Appendix, Figs. S5–S7 for additional data. (E and F) Catalytic activity (E) and enantioselectivity (F) of Mb(H64V,V68A) and stapled variants in styrene cyclopropanation reactions with EDA in buffer only and in the presence of 30% vol/vol ethanol. Relative activities refer to normalized catalytic turnovers (TON) relative to TON measured with Mb(H64V,V68A) in buffer only reactions. See SI Appendix, Figs. S10–S12 for related data with other organic cosolvents. Rel Int, relative intensity.

Thermostability of the sMb Designs.

The thermostability of the Mb variants was examined by measuring their melting temperatures (Tm) using CD (SI Appendix, Fig. S7). Mb(H64,V68A) was found to exhibit an apparent Tm of 66.0 °C (Fig. 2C), which is about 14 °C lower than that of wild-type Mb. Notably, all of the sMb constructs containing the thioether staple showed an increase in thermostability compared with the parent protein, with ΔTm values ranging from 3.9 °C to 10 °C (Table 1). The largest thermostabilization effect was observed for sMb5 (ΔTm: +10.0 °C), followed by sMb2 (ΔTm: +7.2 °C). In contrast, the sMb variants lacking the cross-link show comparable (sMb1 and sMb9) or lower Tm values (sMb4, sMb6, and sMb8) compared with Mb(H64V,V68A) (ΔTm: +1.5 to –16.8 °C; Table 1). To further examine the impact of the thioether staple, analogs of the two most stable sMb variants, sMb2 and sMb5, were prepared by substituting O2beY for O-propargyl-tyrosine (OpgY) (40). OpgY is an isostere of O2beY but is unable to react with cysteine to form the thioether cross-link, as confirmed by SDS/PAGE (Fig. 2A) and MALDI-TOF MS analyses (SI Appendix, Fig. S6). The resulting OpgY-containing variants, sMb2(OpgY) and sMb5(OpgY), displayed significantly lower Tm values than their stapled counterparts (ΔTm: –12.2 °C and –11.9 °C, respectively), confirming the stabilizing effect of the thioether cross-link, as encoded in computational design.

As a second measure of thermostability, half-maximal denaturation temperatures (T50) were determined by monitoring heme loss (λmax: 408 nm) upon incubation of the hemoproteins (10 min) at variable temperatures (Fig. 2D). This is a more stringent assay of thermal stability since it monitors the ability of the sMb variants to remain associated with the heme cofactor, which is essential for their activity as cyclopropanation biocatalysts. As thermal denaturation of holomyoglobin is irreversible, an increase in T50 is also a measure of kinetic stabilization upon cross-linking. In this assay, Mb(H64V,V68A) showed a T50 value of 62.3 °C (Table 1). Whereas sMb3 and sMb7 showed reduced T50 values compared with Mb(H64V,V64A) (Table 1), both sMb5 and sMb2 exhibited improved thermal stability compared with the parent protein (ΔT50: +9.2 °C and +1.5 °C, respectively; Fig. 2C). These results demonstrate that the thermostabilization effect induced by the O2beY/Cys cross-links in both sMb2 and sMb5 is not restricted to the protein secondary structure, as determined by the CD melting curves, but extends to the heme-bound (“holo”) forms of these proteins.

Design and Characterization of Doubly Stapled Mb Variants.

Further stabilization of Mb(H64V,V68A) was then pursued by combining the covalent staples from the two most promising variants, namely sMb2 and sMb5, resulting in the sMb10 design. Upon production in E. coli, MS analysis confirmed the successful formation of both cross-links within this protein (Fig. 2B). Further characterization of sMb10 showed a nearly additive effect of the two thioether staples toward increasing thermostability of the hemoprotein in terms of both Tm and T50Tm: +16.8 °C; ΔT50: +10.7 °C; Table 1). Comparison of the CD spectra corresponding to sMb5, sMb10, and Mb(H64V,V68A) revealed no major changes in the protein secondary structure as a result of the presence of a single (sMb5) and double staple (sMb10) (SI Appendix, Fig. S8).

Evaluation and further stabilization of sMb10 was then pursued via structure-guided point mutagenesis. Investigation of the F106A substitution in sMb11 confirmed the energetic importance of the computed π–π stacking interaction between the O2beY residue in position 36 and the phenyl ring of Phe106 (see SI Appendix for further discussion). Next, we noticed that installation of the 36–109 cross-link (from sMb2) and 5–126 cross-link (from sMb5) replaces in each case a solvent-exposed negatively charged residue (i.e., Glu109 and Asp126, respectively) with the less-polar O2beY/Cys staple. Accordingly, two additional constructs (sMb12 and sMb13) were designed to incorporate “compensating” neutral-to-negatively charged amino acid substitutions (i.e., G129E and H113E, respectively) at a compatible position in proximity of each staple. Whereas sMb12 showed reduced thermostability (T50) compared with sMb10, sMb13 was found to exhibit higher TmTm: +1.2 °C) as well as significantly increased stability against temperature-induced heme loss (ΔT50: +5.3 °C) compared with sMb10. The presence of the two O2beY/Cys cross-links in sMb13 was confirmed by MS analysis, which also revealed the presence of a small fraction of singly stapled protein (Fig. 2B). The CD spectra of sMb13 and sMb10 were found to be superimposable, indicating that the “charge-compensating” mutation H113E had minimal impact on the protein structure (SI Appendix, Fig. S8). Thus, as a result of merely five solvent-exposed mutations in sMb13, the thermostability (Tm) of the original Mb(H64V,V64A) biocatalyst could be augmented by +18 °C. These modifications also translated into a significantly increased thermal stability of the heme-bound form of the protein (ΔT50: +16 °C).

Catalytic Activity and Selectivity of Stapled Mb(H64V,V68A) Variants in Cyclopropanation Reactions.

In the absence of selective pressure to maintain function, thermostabilization via protein engineering is often accompanied by a reduction in catalytic efficiency and/or selectivity of the target enzyme (4143). To assess the impact of the thioether staples on the catalytic properties and stereoselectivity of the thermostabilized sMb variants these biocatalysts were tested in a model cyclopropanation reaction with styrene (1) and α-diazoacetate (EDA, 2) (Table 2). Under the applied conditions [10 mM styrene, 20 mM EDA, 2 μM Mb variant (0.02 mol%)] Mb(H64V,V68A) was found to produce (1S,2S)-ethyl 2-phenylcyclopropanecarboxylate (3a) in 94% yield (4,710 turnovers or TON) and high diastereomeric (99.4% de) and enantiomeric excess (99.1% ee) (entry 1, Table 2). Under identical conditions, both sMb10 and sMb13 were able to catalyze the formation of 3a with equally high catalytic activity (92–94% yields; 4,710–4,615 TON) and stereoselectivity (>99% de and ee; entries 4–5, Table 2). These data clearly indicated that thermostabilization induced by the stapling procedure had no negative impact on the carbene transfer reactivity of these hemoproteins, nor did it perturb the asymmetric environment provided by the distal heme pocket, which is critical for inducing the (1S,2S)-enantioselectivity during the cyclopropanation reaction (29).

Table 2.

Catalytic activity and selectivity of Mb(H64V,V68A) and its stapled variants for cyclopropanation of styrene (1a) and pentafluorostyrene (1b) with ethyl 2-diazo-acetate (2) in the absence and in the presence of organic cosolvents

graphic file with name pnas.1708907114fx01.jpg

Entry Variant Product Solvent Yield, % TON % de % ee
1 Mb(H64V,V68A) 3a 94 4,710 ± 180 99.4 99.1
2 sMb2 3a 94 4,740 ± 100 99.3 99.2
3 sMb5 3a 94 4,690 ± 25 99.1 99.1
4 sMb10 3a 94 4,710 ± 250 99.4 99.3
5 sMb13 3a 92 4,610 ± 140 99.2 99.2
6 Mb(H64V,V68A) 3a 30% EtOH 48 2,420 ± 280 98 92
7 sMb10 3a 30% EtOH 82 4,120 ± 270 99.3 97
8* sMb10 3a 30% DMF 84 (64) 4,220 ± 140 (3,480) 99.8 (99) 99.6 (99)
9* sMb13 3a 30% THF 81 (92) 4,360 ± 90 (4,660) 97 (99) 95 (94)
10* sMb10 3a 30% DMSO 64 (60) 3,220 ± 280 (3,000) 99.8 (99.8) 99.6 (99.4)
11* sMb10 3a 45% DMSO 39 (20) 1,990 ± 190 (1,020) 99.9 (99.9) 98.5 (98.5)
12 Mb(H64V,V68A) 3b 45% DMSO 54 270 ± 5 99.8 99.9
13 sMb10 3b 45% DMSO 80 395 ± 5 99.8 99.9
14 sMb13 3b 45% DMSO 77 380 ± 5 99.8 99.9

Reaction conditions for 3a: 2 μM protein (0.02 mol%), 10 mM 1a, 20 mM EDA (2), 10 mM sodium dithionite in 50 mM potassium phosphate (pH 7), room temperature. Reaction conditions for 3b: 10 μM protein (0.2 mol%), 5 mM 1b, 10 mM EDA (2), 10 mM sodium dithionite in 50 mM potassium phosphate (pH 7), room temperature.

*

Values in brackets correspond to reactions performed in the presence of Mb(H64V,V68A) as the catalyst.

Increased Stability Against Chemical Denaturation.

Enhanced robustness to chemical denaturation and organic solvents is a highly desirable trait of enzymes to be employed for synthetic applications (44, 45). To examine this aspect, the most promising sMb constructs (sMb5, sMb10, and sMb13) were subjected to denaturation experiments in the presence of guanidinium chloride (Gnd·HCl) (SI Appendix, Fig. S9). Reflecting the trend emerging from the thermostability assays, a progressive stability increase in the presence of the chaotropic agent was observed going from the singly stapled sMb5 (Cm: 1.79 M) to the doubly stapled sMb10 and sMb13 (Cm: 2.01 M and 2.08 M, respectively), compared with the parent enzyme (Cm: 1.55 M).

Next, we investigated the effect of the staples toward improving the performance of the sMb variants in organic solvents. In the presence of 30% (vol/vol) ethanol, Mb(H64V,V68A) catalyzes the cyclopropanation of styrene with reduced activity (48% yield; 2,420 TON; 51% relative activity) and lower enantioselectivity (92% ee) compared to the same reaction in buffer (Fig. 2 E and F; entry 6 vs. 1, Table 2). Compared with Mb(H64V,V68A), both sMb10 and sMb13 better tolerate the presence of the organic cosolvent, affording the cyclopropanation product 3a in higher yields (78–82%; 3,900–4,100 TON; entry 7, Table 2) and with higher diastereo- (1:400 vs. 1:130 diastereomeric ratio for trans:cis) and enantioselectivity (97% vs. 92% ee) (entry 7 vs. 6, Fig. 2F and Table 2). As stereoselectivity is highly sensitive to structural perturbation within the active site of these biocatalysts (30), these results suggest that stabilization of the Mb scaffold by covalent stapling minimizes the disruptive effects induced by the organic solvent. A similar trend was observed upon comparing the activity and selectivity of sMb5, sMb10, and sMb13 with those of Mb(H64V,V68A) in cyclopropanation reactions in the presence of high concentrations (30% vol/vol) of other organic solvents such as methanol, dimethylformamide (DMF), and DMSO (SI Appendix, Figs. S10–S12). In comparison, tetrahydrofuran (THF) was tolerated equally well by both the parent protein and the stapled variants (entry 9, Table 2). In the presence of these organic cosolvents, both sMb10 and sMb13 are able to produce 3a in high yields (64–88%; 3,200–4,440 TON) and high stereoselectivity (99.5–99.8% de; 99.2–99.6% ee) (entries 8–10, Table 2). Furthermore, sMb10 and sMb13 maintain good catalytic activity (1,080–1,990 TON) as well as excellent stereoselectivity (>99% de; 97–98.5% ee) even in the presence of a nearly 1:1 mixture (45% vol/vol) of buffer with DMF or DMSO (entry 11, Table 2 and SI Appendix, Figs. S11 and S12). These results are remarkable considering that most heme-dependent enzymes are readily inactivated by low concentrations (>5–10% vol/vol) of these organic solvents (46, 47). Given their enhanced performance at high DMSO concentrations, sMb10 and sMb13 could be applied to afford the cyclopropanation of a water-insoluble substrate, that is, pentafluorostyrene (1b), with higher efficiency than possible using the parent Mb variant (Table 2, entries 13 and 14 vs. 12). Altogether, these results demonstrate the value of R-GPS toward enhancing the robustness and performance of a biocatalyst in the presence of chemical denaturants and organic cosolvents.

Discussion

As demonstrated through the design and characterization of highly stabilized variants of an Mb-based cyclopropanase, R-GPS enables the rapid identification of optimal sites within a target protein scaffold for structural stabilization via “stapling” through nonnative covalent bridges. In its current implementation, R-GPS utilizes a bioorthogonal reaction between cysteine and the genetically encodable noncanonical amino acid O2beY to produce chemically stable and nonreducible thioether cross-links. Key advantages of this chemistry include its chemoselectivity, spatially controlled reactivity, and efficient and spontaneous formation of the cross-link at the posttranslational level and in the intracellular space. The latter feature allows for the production of the stapled proteins inside cells and eliminates the need for protein manipulation during or after purification (48). Whereas R-GPS has been validated using the O2beY/Cys cross-linking method, we expect this approach can be extended to other noncanonical amino acids and/or other chemistries useful for protein cross-linking (4952).

Two computationally designed staples could be effectively combined in the final designs (sMb10 and sMb13) to achieve additive effects for stabilization against thermal and chemical denaturation. While the introduction of computationally designed nonnative disulfide bridges may result in misfolded variants due to disulfide scrambling (27), no evidence of incorrect cross-linking was noted for the sMb variants containing the two thioether staples. We attribute this result to the heterotypic nature of the O2beY/Cys thioether linkage, which reduces the number of wrong cross-links that can be formed compared with the homotypic disulfide bridge. In addition, the presence of specific geometric requirements for successful formation of the O2beY/Cys cross-link, as revealed by the present studies (SI Appendix, Fig. S1), likely contributes to further disfavor the formation of incorrect linkages. At the same time, a noticeable reduction in the protein yields was observed upon incorporation of multiple copies of O2beY via amber stop codon suppression [2 mg/L culture for sMb10 compared with 10–15 mg/L for sMb2/sMb5 and 30 mg/L for Mb(H64V,V68A)]. While this result is not surprising given the less-efficient incorporation of noncanonical amino acids (ncAAs) compared with natural amino acids, various strategies have recently enabled the expression of proteins containing multiple ncAAs (5355) and these protocols should increase the accessibility of proteins containing multiple staples designed with the present methodology.

R-GPS was able to identify a small set of structural modifications (i.e., four and five mutations for sMb10 and sMb13, respectively) capable of conferring these carbene transferase enzymes significantly increased stability against thermal and chemical denaturation without impairing their catalytic activity and subtle functional properties such as stereoselectivity. The improved stability properties of these biocatalysts translated into improved performance and stereoselectivity in reactions conducted in high concentrations of organic solvents, also enabling the more efficient transformation of a water-insoluble olefin substrate. Since R-GPS involves a small set of amino acid substitutions, most of the protein residues remain available for stabilization based on optimization of interactions in the native state. Thus, it should be possible to combine R-GPS with other computational design (13, 14, 17) and/or experimental methods (46) to achieve additive or synergistic effects in protein/enzyme stabilization. The minimally invasive approach to protein stabilization described here is expected to provide a valuable strategy for greatly enhancing the stability of enzymes and proteins without impacting their functional properties.

Materials and Methods

See SI Appendix for detailed descriptions of computational methods and experimental procedures and for additional data: NAC analyses (SI Appendix, Figs. S1 and S2 and Table S1); UV-visible and CD spectra (SI Appendix, Figs. S4 and S8), protein gels (SI Appendix, Fig. S5), MS spectra (SI Appendix, Fig. S6), thermal and chemical denaturation curves (SI Appendix, Figs. S7 and S9), and reaction characterization data (SI Appendix, Figs. S11–S13).

Supplementary Material

Supplementary File

Acknowledgments

We thank David Vargas for help with the characterization of 3b, Daniel Greenfield for technical assistance, and Dr. Jermaine Jenkins (Structure Biology & Biophysics Facility, University of Rochester) for assistance with the CD instrumentation. This work was supported by National Institute of Health Grant GM098628 (to R.F.) and National Science Foundation Grants MCB1716623 and MCB1330760 (to S.D.K.). MS instrumentation at the University of Rochester was supported by National Science Foundation Grants CHE-0840410 and CHE-0946653. E.J.M. acknowledges support from NIH Graduate Training Grant T32GM118283.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1708907114/-/DCSupplemental.

References

  • 1.Bornscheuer UT, et al. Engineering the third wave of biocatalysis. Nature. 2012;485:185–194. doi: 10.1038/nature11117. [DOI] [PubMed] [Google Scholar]
  • 2.Clouthier CM, Pelletier JN. Expanding the organic toolbox: A guide to integrating biocatalysis in synthesis. Chem Soc Rev. 2012;41:1585–1605. doi: 10.1039/c2cs15286j. [DOI] [PubMed] [Google Scholar]
  • 3.Bloom JD, Labthavikul ST, Otey CR, Arnold FH. Protein stability promotes evolvability. Proc Natl Acad Sci USA. 2006;103:5869–5874. doi: 10.1073/pnas.0510098103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gershenson A, Arnold FH. Enzyme stabilization by directed evolution. Genet Eng (N Y) 2000;22:55–76. doi: 10.1007/978-1-4615-4199-8_5. [DOI] [PubMed] [Google Scholar]
  • 5.Lehmann M, Wyss M. Engineering proteins for thermostability: The use of sequence alignments versus rational design and directed evolution. Curr Opin Biotechnol. 2001;12:371–375. doi: 10.1016/s0958-1669(00)00229-9. [DOI] [PubMed] [Google Scholar]
  • 6.Bommarius AS, Paye MF. Stabilizing biocatalysts. Chem Soc Rev. 2013;42:6534–6565. doi: 10.1039/c3cs60137d. [DOI] [PubMed] [Google Scholar]
  • 7.Tracewell CA, Arnold FH. Directed enzyme evolution: Climbing fitness peaks one amino acid at a time. Curr Opin Chem Biol. 2009;13:3–9. doi: 10.1016/j.cbpa.2009.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Magliery TJ. Protein stability: Computation, sequence statistics, and new experimental methods. Curr Opin Struct Biol. 2015;33:161–168. doi: 10.1016/j.sbi.2015.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Malakauskas SM, Mayo SL. Design, structure and stability of a hyperthermophilic protein variant. Nat Struct Biol. 1998;5:470–475. doi: 10.1038/nsb0698-470. [DOI] [PubMed] [Google Scholar]
  • 10.Korkegian A, Black ME, Baker D, Stoddard BL. Computational thermostabilization of an enzyme. Science. 2005;308:857–860. doi: 10.1126/science.1107387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Borgo B, Havranek JJ. Automated selection of stabilizing mutations in designed and natural proteins. Proc Natl Acad Sci USA. 2012;109:1494–1499. doi: 10.1073/pnas.1115172109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Li Y, et al. A diverse family of thermostable cytochrome P450s created by recombination of stabilizing fragments. Nat Biotechnol. 2007;25:1051–1056. doi: 10.1038/nbt1333. [DOI] [PubMed] [Google Scholar]
  • 13.Goldenzweig A, et al. Automated structure- and sequence-based design of proteins for high bacterial expression and stability. Mol Cell. 2016;63:337–346. doi: 10.1016/j.molcel.2016.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bednar D, et al. FireProt: Energy- and evolution-based computational design of thermostable multiple-point mutants. PLOS Comput Biol. 2015;11:e1004556. doi: 10.1371/journal.pcbi.1004556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gribenko AV, et al. Rational stabilization of enzymes by computational redesign of surface charge-charge interactions. Proc Natl Acad Sci USA. 2009;106:2601–2606. doi: 10.1073/pnas.0808220106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bjørk A, Dalhus B, Mantzilas D, Sirevåg R, Eijsink VG. Large improvement in the thermal stability of a tetrameric malate dehydrogenase by single point mutations at the dimer-dimer interface. J Mol Biol. 2004;341:1215–1226. doi: 10.1016/j.jmb.2004.06.079. [DOI] [PubMed] [Google Scholar]
  • 17.Wijma HJ, et al. Computationally designed libraries for rapid enzyme stabilization. Protein Eng Des Sel. 2014;27:49–58. doi: 10.1093/protein/gzt061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Van den Burg B, Vriend G, Veltman OR, Venema G, Eijsink VG. Engineering an enzyme to resist boiling. Proc Natl Acad Sci USA. 1998;95:2056–2060. doi: 10.1073/pnas.95.5.2056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zou J, Song B, Simmerling C, Raleigh D. Experimental and computational analysis of protein stabilization by Gly-to-d-Ala substitution: A convolution of native state and unfolded state effects. J Am Chem Soc. 2016;138:15682–15689. doi: 10.1021/jacs.6b09511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Dombkowski AA, Sultana KZ, Craig DB. Protein disulfide engineering. FEBS Lett. 2014;588:206–212. doi: 10.1016/j.febslet.2013.11.024. [DOI] [PubMed] [Google Scholar]
  • 21.Matsumura M, Signor G, Matthews BW. Substantial increase of protein stability by multiple disulphide bonds. Nature. 1989;342:291–293. doi: 10.1038/342291a0. [DOI] [PubMed] [Google Scholar]
  • 22.Liu T, et al. Enhancing protein stability with extended disulfide bonds. Proc Natl Acad Sci USA. 2016;113:5910–5915. doi: 10.1073/pnas.1605363113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sanchez-Romero I, et al. Mechanism of protein kinetic stabilization by engineered disulfide crosslinks. PLoS One. 2013;8:e70013. doi: 10.1371/journal.pone.0070013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tyagi V, Bonn RB, Fasan R. Intermolecular carbene S-H insertion catalysed by engineered myoglobin-based catalysts. Chem Sci (Camb) 2015;6:2488–2494. doi: 10.1039/c5sc00080g. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Carninci P, et al. Thermostabilization and thermoactivation of thermolabile enzymes by trehalose and its application for the synthesis of full length cDNA. Proc Natl Acad Sci USA. 1998;95:520–524. doi: 10.1073/pnas.95.2.520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tyagi V, Fasan R. Myoglobin-catalyzed olefination of aldehydes. Angew Chem Int Ed Engl. 2016;55:2512–2516. doi: 10.1002/anie.201508817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Tanghe M, et al. Disulfide bridges as essential elements for the thermostability of lytic polysaccharide monooxygenase LPMO10C from Streptomyces coelicolor. Protein Eng Des Sel. 2017;30:401–408. doi: 10.1093/protein/gzx014. [DOI] [PubMed] [Google Scholar]
  • 28.Bionda N, Cryan AL, Fasan R. Bioinspired strategy for the ribosomal synthesis of thioether-bridged macrocyclic peptides in bacteria. ACS Chem Biol. 2014;9:2008–2013. doi: 10.1021/cb500311k. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bordeaux M, Tyagi V, Fasan R. Highly diastereoselective and enantioselective olefin cyclopropanation using engineered myoglobin-based catalysts. Angew Chem Int Ed Engl. 2015;54:1744–1748. doi: 10.1002/anie.201409928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bajaj P, Sreenilayam G, Tyagi V, Fasan R. Gram-scale synthesis of chiral cyclopropane-containing drugs and drug precursors with engineered myoglobin catalysts featuring complementary stereoselectivity. Angew Chem Int Ed Engl. 2016;55:16110–16114. doi: 10.1002/anie.201608680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tyagi V, Sreenilayam G, Bajaj P, Tinoco A, Fasan R. Biocatalytic synthesis of allylic and allenyl sulfides through a myoglobin-catalyzed Doyle-Kirmse reaction. Angew Chem Int Ed Engl. 2016;55:13562–13566. doi: 10.1002/anie.201607278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bionda N, Fasan R. Ribosomal synthesis of natural-product-like bicyclic peptides in Escherichia coli. ChemBioChem. 2015;16:2011–2016. doi: 10.1002/cbic.201500179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Betz SF. Disulfide bonds and the stability of globular-proteins. Protein Sci. 1993;2:1551–1558. doi: 10.1002/pro.5560021002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zhang T, Bertelsen E, Alber T. Entropic effects of disulphide bonds on protein stability. Nat Struct Biol. 1994;1:434–438. doi: 10.1038/nsb0794-434. [DOI] [PubMed] [Google Scholar]
  • 35.Flory PJ. Theory of elastic mechanisms in fibrous proteins. J Am Chem Soc. 1956;78:5222–5234. [Google Scholar]
  • 36.Chan HS, Dill KA. Intrachain loops in polymers–Effects of excluded volume. J Chem Phys. 1989;90:492–509. [Google Scholar]
  • 37.Zanghellini A, et al. New algorithms and an in silico benchmark for computational enzyme design. Protein Sci. 2006;15:2785–2794. doi: 10.1110/ps.062353106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kuhlman B, Baker D. Native protein sequences are close to optimal for their structures. Proc Natl Acad Sci USA. 2000;97:10383–10388, and erratum (2000) 97:13460. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Liu CC, Schultz PG. Adding new chemistries to the genetic code. Annu Rev Biochem. 2010;79:413–444. doi: 10.1146/annurev.biochem.052308.105824. [DOI] [PubMed] [Google Scholar]
  • 40.Deiters A, Schultz PG. In vivo incorporation of an alkyne into proteins in Escherichia coli. Bioorg Med Chem Lett. 2005;15:1521–1524. doi: 10.1016/j.bmcl.2004.12.065. [DOI] [PubMed] [Google Scholar]
  • 41.Nagatani RA, Gonzalez A, Shoichet BK, Brinen LS, Babbitt PC. Stability for function trade-offs in the enolase superfamily “catalytic module”. Biochemistry. 2007;46:6688–6695. doi: 10.1021/bi700507d. [DOI] [PubMed] [Google Scholar]
  • 42.Hamamatsu N, et al. Directed evolution by accumulating tailored mutations: Thermostabilization of lactate oxidase with less trade-off with catalytic activity. Protein Eng Des Sel. 2006;19:483–489. doi: 10.1093/protein/gzl034. [DOI] [PubMed] [Google Scholar]
  • 43.Zhong CQ, et al. Improvement of low-temperature caseinolytic activity of a thermophilic subtilase by directed evolution and site-directed mutagenesis. Biotechnol Bioeng. 2009;104:862–870. doi: 10.1002/bit.22473. [DOI] [PubMed] [Google Scholar]
  • 44.Savile CK, et al. Biocatalytic asymmetric synthesis of chiral amines from ketones applied to sitagliptin manufacture. Science. 2010;329:305–309. doi: 10.1126/science.1188934. [DOI] [PubMed] [Google Scholar]
  • 45.Fox RJ, et al. Improving catalytic function by ProSAR-driven enzyme evolution. Nat Biotechnol. 2007;25:338–344. doi: 10.1038/nbt1286. [DOI] [PubMed] [Google Scholar]
  • 46.Wong TS, Arnold FH, Schwaneberg U. Laboratory evolution of cytochrome p450 BM-3 monooxygenase for organic cosolvents. Biotechnol Bioeng. 2004;85:351–358. doi: 10.1002/bit.10896. [DOI] [PubMed] [Google Scholar]
  • 47.Chauret N, Gauthier A, Nicoll-Griffith DA. Effect of common organic solvents on in vitro cytochrome P450-mediated metabolic activities in human liver microsomes. Drug Metab Dispos. 1998;26:1–4. [PubMed] [Google Scholar]
  • 48.Abdeljabbar DM, Piscotta FJ, Zhang S, James Link A. Protein stapling via azide-alkyne ligation. Chem Commun (Camb) 2014;50:14900–14903. doi: 10.1039/c4cc06528j. [DOI] [PubMed] [Google Scholar]
  • 49.Furman JL, et al. A genetically encoded aza-Michael acceptor for covalent cross-linking of protein-receptor complexes. J Am Chem Soc. 2014;136:8411–8417. doi: 10.1021/ja502851h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Xiang Z, et al. Proximity-enabled protein crosslinking through genetically encoding haloalkane unnatural amino acids. Angew Chem Int Ed Engl. 2014;53:2190–2193. doi: 10.1002/anie.201308794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chen XH, et al. Genetically encoding an electrophilic amino acid for protein stapling and covalent binding to native receptors. ACS Chem Biol. 2014;9:1956–1961. doi: 10.1021/cb500453a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Xuan W, Shao S, Schultz PG. Protein crosslinking by genetically encoded noncanonical amino acids with reactive aryl carbamate side chains. Angew Chem Int Ed Engl. 2017;56:5096–5100. doi: 10.1002/anie.201611841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Guo J, Melançon CE, 3rd, Lee HS, Groff D, Schultz PG. Evolution of amber suppressor tRNAs for efficient bacterial production of proteins containing nonnatural amino acids. Angew Chem Int Ed Engl. 2009;48:9148–9151. doi: 10.1002/anie.200904035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Johnson DBF, et al. RF1 knockout allows ribosomal incorporation of unnatural amino acids at multiple sites. Nat Chem Biol. 2011;7:779–786. doi: 10.1038/nchembio.657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Lajoie MJ, et al. Genomically recoded organisms expand biological functions. Science. 2013;342:357–360. doi: 10.1126/science.1241459. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES