Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2023 Mar 22;145(13):7159–7165. doi: 10.1021/jacs.2c12004

MicroED Structure of a Protoglobin Reactive Carbene Intermediate

Emma Danelius †,, Nicholas J Porter §, Johan Unge , Frances H Arnold §, Tamir Gonen †,‡,∥,*
PMCID: PMC10080679  PMID: 36948184

Abstract

graphic file with name ja2c12004_0006.jpg

Microcrystal electron diffraction (MicroED) is an emerging technique that has shown great potential for describing new chemical and biological molecular structures. Several important structures of small molecules, natural products, and peptides have been determined using ab initio methods. However, only a couple of novel protein structures have thus far been derived by MicroED. Taking advantage of recent technological advances, including higher acceleration voltage and using a low-noise detector in counting mode, we have determined the first structure of an Aeropyrum pernix protoglobin (ApePgb) variant by MicroED using an AlphaFold2 model for phasing. The structure revealed that mutations introduced during directed evolution enhance carbene transfer activity by reorienting an α helix of ApePgb into a dynamic loop, making the catalytic active site more readily accessible. After exposing the tiny crystals to the substrate, we also trapped the reactive iron-carbenoid intermediate involved in this engineered ApePgb’s new-to-nature activity, a challenging carbene transfer from a diazirine via a putative metallo-carbene. The bound structure discloses how an enlarged active site pocket stabilizes the carbene bound to the heme iron and, presumably, the transition state for the formation of this key intermediate. This work demonstrates that improved MicroED technology and the advancement in protein structure prediction now enable investigation of structures that was previously beyond reach.

Introduction

The identification of novel enzymes through protein engineering and directed evolution has made biocatalysis a competitive tool in modern organic synthesis.1 Heme enzymes are particularly interesting due to their ability to form and transfer reactive carbene and nitrene intermediates to effect transformations not known in biology and sometimes not even known in chemical catalysis.24 Although many new-to-nature heme enzymes have been described, with a wide diversity in their synthetic products, the structural rationale behind these advancements is still missing. Describing the short-lived reactive intermediates of these reactions is of great interest for the development of future biocatalysts, but this has proven very challenging. Using X-ray crystallography, only two carbene-bound intermediates have been reported: a carbene-bound Rhodothermus marinus cytochrome c variant5 and a myoglobin heme-iron-carbenoid complex, which was observed in a nonreactive configuration.6 In many cases, large, well-ordered crystals of the enzymes and their complexes are not accessible, so other methods able to handle much smaller crystals are needed.

MicroED is a cryo-electron microscopy (cryo-EM) method, which has been developed during the last decade79 and has contributed many structures ranging from small molecules10 and peptides11 to both soluble12 and membrane proteins.13 In MicroED, continuous rotation electron diffraction data is collected from three-dimensional crystals using a transmission electron microscope (TEM) under cryogenic conditions. The crystals are typically a billionth the size of crystals used for X-ray diffraction; hence, structures of new and important targets which have been out of reach due to challenges in crystal growth can be determined.14 As in X-ray diffraction experiments, the intensities of the diffracted beams are directly recorded, while the phases also used to model the crystal content need to be derived by other means. At atomic resolution, phases can be estimated directly from the intensities computationally by ab initio methods. Several novel small molecules, peptides, and natural products have been solved by MicroED using ab initio phasing, including the sub-ångström structure of the prion proto-PrPSc peptide,15 the antibiotic macrocycle thiostrepton1010 and the chemotherapeutic teniposide.16 Further, radiation damage-induced phasing has been shown previously for MicroED data,17 and isomorphous replacement, while theoretically possible, has yet to be demonstrated effectively. Recently, even for macromolecular structures, ab initio phasing was demonstrated with the sub-ångström resolution structure of triclinic lysozyme.18 However, like with X-ray crystallography, the most common method to derive initial phases for macromolecular MicroED structure determination is molecular replacement (MR), which relies on a starting homologous model. The model is typically a similar protein with a known structure, and the phases can be calculated after its position and orientation are found within the crystal. Due to the growing number of known structures deposited to the PDB as well as computational improvements, MR usage has increased from 50% in 2000 to 80% in 2022, and as such, MR is the first choice in most cases for both X-ray crystallography and macromolecular MicroED. A couple of novel protein structures have been solved by MicroED with MR using the known structure of the wild-type homologue, including a novel mutant of the murine voltage-dependent anion channel at 3.1 Å resolution,19 and the 3.0 Å structure R2lox,20 but MR remains challenging in cases where structures of closely related homologues are not available.

Recent advances in protein structure prediction can enable MR where experimentally determined structures fail or otherwise are unavailable. The possibility to generate ab initio models without closely related homologues took a leap in the 14th Critical Assessment of Structure Prediction (CASP14) with the emergence of the deep learning method implemented in AlphaFold2. On the provided test sets, the peptide backbone atom positions could be predicted accurately to within 1 Å. This accuracy meets the requirement for MR when the diffracting resolution is better than 3 Å.21 AlphaFold2 or RosettaMR has already been used to generate a starting model for successfully phasing X-ray data, where no experimental structure was available.2224 However, this approach has not been successfully applied to MicroED data.

Here, we present the previously unknown structure of Aeropyrum pernix protoglobin (ApePgb) determined by MicroED in two different states: resting state and with the reactive intermediate carbene bound following chemical activation of the reaction. The ApePgb structure described herein is an engineered variant for which no wild-type structure has been experimentally determined. The crystals formed as long and thin plates that were brittle and challenging to isolate; despite significant efforts, the structure could not be obtained by synchrotron X-ray crystallography as very weak, or no diffraction was observed. The structure was obtained using the latest MicroED technology, including a cryo-TEM operating at 300 kV acceleration with parallel illumination, data collection on a direct electron detector operating in counting mode, and cryogenic preservation. The resting state structure was solved by molecular replacement against a computationally generated model from AlphaFold2. As compared to our previously described protoglobin (7UTE), we have added additional data and, after further refinement, could resolve loops B and C. Following exposure of the crystals to reaction-like conditions, the same methodology was used to capture and determine the structure of the carbene-bound reactive intermediate of ApePgb by MicroED. This is, to the best of our knowledge, the first example of a protein structure bound to an aryl-carbene intermediate. This demonstrates the feasibility of using ab initio generated protein models for MR in MicroED at the resolution most well represented by protein structures in the PDB and that MicroED can now contribute novel protein structures, including those of short-lived reactive intermediates that were previously beyond technological reach.

Results and Discussion

Protoglobins are small dimeric heme proteins found in Archaea that are presumed to naturally function as gas binders/sensors.25 These proteins have recently gained attention as engineered carbene transfer biocatalysts that can use either diazo compounds26 or diazirines27 as carbene precursors. Notably, the recent report of diazirine activation (Figure 1) represents the first example of catalytic activation and subsequent carbene transfer from these species. Characterizing the structural details underlying these laboratory-evolved functions can provide deeper insights to guide the future engineering of such biocatalysts.

Figure 1.

Figure 1

Directed evolution of A. pernix protoglobin (ApePgb) converted this gas-binding protein into an enzyme catalyzing cyclopropanation of benzyl acrylate 1 using phenyldiazirine 2 as a carbene source to generate cyclopropane 3. The installation of the 4 mutations shown here, introduced during directed evolution, resulted in a >40-fold increase in activity (right; ApePgb LVQ = ApePgb W59L Y60V F145Q and ApePgb GLVRSQL = ApePgb C45G W59L Y60V V63R C102S F145Q I149L). The reaction scope was further extended to include N–H and Si–H insertion reactions.27

The ApePgb variant GLVRSQL described here was expressed and purified as reported previously.27 Over 500 conditions were screened for crystal formation identifying only one condition that yielded crystals. To interrogate the structural basis for the gain of cyclopropanation activity (Figure 1), we attempted to determine the crystal structure by X-ray diffraction (XRD). However, the crystals were extremely thin and brittle plates that formed in large clusters (Figure 2a), making it difficult to isolate a single and intact crystal for XRD. While screening, isolated crystals diffracted weakly to around 10–12 Å resolution (Figure 2b), proving insufficient for any structural determination. Crystal optimization assays failed to yield better crystals for XRD despite significant efforts. Instead, the plate-like crystals were prepared for MicroED as described below.

Figure 2.

Figure 2

(a) Crystal drop of ApePgb GLVRSQL in 0.4 M sodium phosphate monobasic/1.6 M potassium phosphate dibasic, 0.1 imidazole (pH 8.0), 0.2 M NaCl. (b) XRD, single exposure. (c) MicroED, single exposure. The green circle indicates levels of resolution.

MicroED Grid Preparation and Diffraction Screening

To examine the crystals in the cryo-TEM, the crystalline clusters were broken into smaller crystallite fragments by perturbation using a pipette, and the remaining crystal slurry was transferred to TEM grids inside of a vitrification robot at 4 °C and 90% humidity. The grids were blotted from the back, vitrified by plunging into liquid ethane, and loaded into a Thermo Fisher Talos Arctica under cryogenic conditions for screening. The crystals appeared as thin sheets on the grids under low magnification (Figure S1). Initial diffraction data were collected on a Ceta-D detector as a movie and processed according to standard MicroED procedures.28 However, these plate-like crystals adopted a preferred orientation on the grid, and they had the low symmetry P1 space group that resulted in low completeness, and the overall data quality was insufficient for structural determination.

MicroED Data Collection

The data quality was dramatically improved by turning to higher acceleration voltage (300 kV) and parallel illumination at the Thermo Fisher Titan Krios TEM, and by collecting the data on the Falcon-4 direct electron detector operating in counting mode, which provides a significantly lower background and higher signal-to-noise ratio.18 Whereas scintillator-based cameras, such as the Ceta-D used initially, record the data using integrating mode where the number of electrons is determined by the charge accumulated in a pixel during a readout cycle, direct electron detectors such as the Falcon-4 can be used in counting mode where they detect individual electrons, leading to increased accuracy and higher data quality. The higher acceleration voltage also allowed us to interrogate slightly thicker crystals, further enhancing the signal. Compared with the Ceta-D, the increased sensitivity of the Falcon-4 detector allows more information to be recorded for an identical exposure. In addition, with a faster readout, more fine-sliced data can be collected, further reducing the background for high-resolution reflections. In this case, 840 frames were collected from each crystal on the Falcon-4, as compared with only 160 frames on the Ceta-D for the same exposure. The crystals were continuously rotated (0.15° s–1) in the electron beam during the exposure, covering the complete angular range of the stage. The merged data from these experiments yielded about 75% completeness with reasonable merging statistics, even with P1 symmetry (Table S1). The continuous-rotation MicroED data were converted to SMV format using an in-house software that is freely available.29

MicroED Data Processing

Data were indexed and integrated in XDS, as described previously for MicroED.18 The integrated data were indexed in space group P1 with unit cell dimensions (a, b, c) = 46.2, 58.3, 80.7 Å, and angles (α, β, γ) = 104.1, 98.6, 90.1°. Scaling and merging in AIMLESS30 yielded a data set to 2.1 Å resolution with an overall completeness of ∼75% at a CC1/2/Rmerge of 96.5/0.19. Initially, phasing was attempted by MR using the structure of a homologue of ApePgb GLVRSQL, Y61A Methanosarcina acetivorans protoglobin (MaPgb Y61A; 56% identity, PDB 3ZJI).31 However, when no reasonable solution was obtained with this starting model, we redirected our efforts toward predicted models; the sequence of ApePgb GLVRSQL was subjected to structure prediction with AlphaFold2 using the ColabFold environment.32 The generated model was then used as a search model in Phaser to provide a preliminary phase solution for the MicroED data. The best solution with an LLG value of >1300 found 4 monomers in the unit cell. Atomic models were refined with electron scattering factors in Phenix Refine33 using automated solvent modeling. Several rounds of refinement resulted in a Rwork/Rfree of 0.19/0.22. In addition to the protein with the expected heme groups, the final model includes an imidazole molecule bound to the Fe of the heme in each chain, as well as about 170 water molecules.

Structure Analysis ApePgb GLVRSQL

The structure of ApePgb GLVRSQL (Figure 3) has 7 mutations installed during directed evolution as compared to the wild-type sequence: C45G, W59L, Y60V, V63R, C102S, F145Q, and I149L. Most of the mutations are near the active site and affect the internal surface. The structure adopts an expanded version of the 3/3 helical sandwich typical of “classical” globins, with an additional N-terminal extension followed by the Z-helix, helping the formation of the homodimer.31 The dimer is built by the G- and H-helices, creating a four-helix bundle for the two subunits. The alignment of the AlphaFold2 model and ApePgb GLVRSQL is presented in Figure 3a, and the alignment of the closest homologue MaPgb Y61A with the sequence alignment is presented in Figure S2. The ab initio model from AlphaFold2 resulted in slightly smaller overall differences to ApePgb (r.m.s.d. 1.57 Å versus 1.90 Å for MaPgb Y61A, for details, see Table S3 and Figure 2). When compared to the standard protoglobin fold observed in MaPgb Y61A, the major difference is the disruption of the B helix between residues 60 and 70. These residues adopt a rigid helical conformation in AlphaFold2 and MaPgb but are found to be restructured as a loop in ApePgb GLVRSQL (Figure 3b). Given that the Y61A mutation in MaPgb does not alter the helical conformation of this region and there is no substantial deviation in the wild-type sequences before the B helix terminus, it is reasonable that this helix would still be present in wild-type ApePgb. For the ApePgb GLVRSQL structure with four chains (A–D), the backbone of the disrupted B helix 60–70 is observed for chains A–C, but the density is substantially weaker than surrounding regions (Figure S3). In chain D, residues of the disrupted B helix 60–70 could not be modeled. The weaker electrostatic potential map, the slight differences between chains (Figure S3), and the B-factors of the loop (Figure S4) suggest that this region is flexible and thus might be able to adopt different conformations in solution. It is likely that the observed structural change stems from the mutation V63R, which resulted in a 14-fold boost in product yield for the cyclopropanation reaction (Figure 1), the largest improvement from any single mutation during enzyme engineering.27 In MaPgb, V63 is pointing toward the active site, where the natural substrate is two atoms only. The inclusion of the much bulkier arginine residue in this position is difficult to model within the structure of ApePgb GLVRSQL without major rearrangements. The AlphaFold2-predicted model of ApePgb GLVRSQL incorrectly orients R63 into the enzyme active site (Figure 3b), similar to the MaPgb structure. The limited space at the active site, together with the repulsion effect between the positively charged iron and arginine, could be the reason for breaking up the helical conformation in this region to produce a conformation, where R63 is instead positioned at the surface pointing outward (Figure 3b). Thus, the effects of unfavorable steric and electrostatic interactions between the heme and positively charged arginine side chain are presumed to drive the rearrangement of residues 60–70, truncating the B helix in this variant and expanding the active site cavity (Figure 3c,d).

Figure 3.

Figure 3

MicroED structure of ApePgb GLVRSQL: (a) Superposition of the structures of ApePgb GLVRSQL (blue) with the AlphaFold2 model (white), with helices labeled according to convention for the protoglobin fold. (b) Close-up of the structures of ApePgb GLVRSQL (blue) with the AlphaFold2 model (white) showing the unwinding of the B helix into a dynamic loop, creating a larger cavity around the active site with increased access to the heme. (c, d) Clipped surface of the AlphaFold2 model (white) and ApePgb GLVRSQL (blue) showing the effects of the unwinding of the helix and the rearrangement of the B/G interface leading to easier substrate access from outside in addition to the increased space available at the heme.

The solvent-inaccessible heme is buried in the protein matrix (Figure 3c,d). This feature is in contrast to most members of the globin family structures, where about 30% of the heme would be surface-accessible.25 In natural protoglobins, the diatomic substrates access the active site through two small orthogonal apolar tunnels defined at the interfaces of the B/E and B/G helices. In ApePgb GLVRSQL, however, the rearrangement of the final turns of helix B obstructs the B/E tunnel through interactions with the main chain and the W62 side chain, resulting in a broadening of the B/G tunnel (Figure 3d). This larger tunnel is presumed to increase diffusion in the active site and allow the entry of larger ligands than the natural diatomic substrates, such as the diazirine and acrylate substrates, targeted in directed evolution (Figure 1). In fact, the bulky benzene moiety is too large to fit into the tunnels present in the AlphaFold2 model, and it seems likely that the drastic expansion of the access pathway is necessary for the passage of the substrate. Molecular dynamics simulations suggest that F145 controls the accessibility of B/G tunnel, though this has yet to be validated experimentally.25 The corresponding mutated amino acid Q145, as well as L149, in the engineered ApePgb lines the expanded tunnel and could reasonably affect the affinity or orientation of the substrate. The F145Q mutation remained throughout the directed evolution of the enzyme despite screening of mutations at this position, and I149L doubled the biosynthetic yield of cyclopropane 3, underscoring the importance of these mutations to the new activity. The mutations G45 and S102 are both located at the surface of the protein. These mutations remove the cysteine residues in the wild-type sequence. Since these cysteines are located close to one another in space in MaPgb (5.6 Å Cα–Cα separation), they are potentially capable of forming a disulfide between the A and E helices in wild-type ApePgb.

Data Collection of Substrate-Bound ApePgb GLVRSQL

Nanocrystals allow efficient and homogeneous diffusion of small molecules, giving a fast and convenient way for the determination of ligand-bound complexes, in contrast to a time-consuming and often inaccessible co-crystallization approach. This is especially essential for ligands or intermediates with a limited half-life in solution. Hence, MicroED shows potential for structural determination of reactive intermediates in enzyme-catalyzed reactions. To investigate the reactive intermediate of the reaction shown in Figure 1, the crystal fragments were soaked with carbene precursor 2 (phenyldiazirine, Figure 1) according to previously described protocols.5 Following 15 min of soaking and the addition of sodium dithionite to mirror reaction-like conditions, the grids were prepared, and data were collected, as described above. The integrated data in this case were indexed in space group P121 with unit cell dimensions (a, b, c) = 58.15, 45.89, 71.71 Å, and angles (α, β, γ) = 90.00, 105.42, 90.00°. Scaling and merging in AIMLESS30 to 2.5 Å resolution gave a data set with overall completeness of 72% and a CC1/2/Rmerge of 0.97/0.23. The data were phased by molecular replacement in Phaser using chain A of the ApePgb GLVRSQL described here. The solution found was top-ranked with an LLG of 4002, containing 2 monomers in the asymmetric unit. The structure was further refined in Phenix Refine33 to a Rwork/Rfree of 0.23/0.28. The final model was derived by altering the angle and distance describing the carbene interaction until the lowest Rfree value was obtained.

Structure Analysis of the Metallo-Carbene Structure

Carbene transfer from a diazirine is thought to involve the formation of a putative reactive iron heme-carbene intermediate, which transfers the carbene to a second substrate, followed by product release and regeneration of the catalyst.27 In the metallo-carbene structure described here, the observed overall fold for the carbene-bound ApePgb GLVRSQL is the same as for the unbound, with the only small differences observed in the 60–70 loop region (Figure 4a). Interestingly, the MicroED density of this loop is a little more defined than in unbound ApePgb, which might indicate that the loop rigidifies upon substrate binding (Figure S5). In particular, residue W62 is better described by density. ApePgb GLVRSQL was engineered for activation of a benzene-substituted diazirine (Figure 1), a much larger molecule than any natural protoglobin substrate. As discussed above, increased diffusion into and out of the active site and accommodation of larger substrates near the heme cofactor likely play a significant role in the improved activity of ApePgb GLVRSQL. For example, amino acids L59 and V60 are both located in the active site, with their side chains pointing toward the binding area on the distal side of the heme group (Figure 4b). Both the selected mutations W59L and Y60V introduce substantially smaller side chains, forming a larger cavity between the heme and the B helix. The main chain conformations for residues 59 and 60 in ApePgb GLVRSQL and the AlphaFold2 model are similar, and modeling of the original residues, W59 and Y60, discloses significant steric clashes of such residues with the aryl ring of the carbene, suggesting that there would not be sufficient room for this intermediate in the wild-type protoglobin. Further, the side chain of F93 adjusts slightly to form a pi-stacking interaction with the phenyl group of the carbene (Figure 4b). These intermolecular interactions likely stabilize the binding and orientation of the phenyl carbene, each contributing to the improved reactivity gained through evolution. The observed MicroED density and the occupancy of the carbene suggest a single carbene species as the dominant form, where the rate of carbene formation is greater than the rate of carbene decay in the absence of the second substrate.

Figure 4.

Figure 4

MicroED structure of carbene-bound ApePgb GLVRSQL. (a) Superposition of the structures of ApePgb GLVRSQL (blue) and the carbene-bound intermediate (orange). (b) Polder omit map (2.5σ) for the metallo-carbene complex (orange), also including the heme from unbound ApePgb GLVRSQL (blue) for comparison. (c) Clipped surface of the carbene-bound ApePgb GLVRSQL, showing a decrease in the size of the B/G channel as compared to the unbound ApePgb GLVRSQL.

While it is clear that mutations in ApePgb GLVRSQL have reshaped the active site, the heme exhibits a similar ruffled distortion to that observed in MaPgb and other protoglobins (Figure S2).25 Out-of-plane distortions to the porphyrin ring are known to alter the electrostatic and ligand-binding properties of the bound iron,34 but the specific changes associated with any specific distortion are challenging to measure and remain unclear. When comparing the ApePgb GLVRSQL with the carbene-bound intermediate, the heme in the ligand-bound variant is further distorted, as illustrated by measuring the angle deviation from the coordination plane. This comparison of heme ring conformations is given in Figure S7. The position of the carbene that resulted in the lowest Rfree in similar refinement rounds is found at a distance of 1.74 Å (Fe–C1) and at an angle of about 128° (Fe–C1–C2). These values are comparable to the previously determined protein structure describing a heme-carbene complex,5 as well as an iron porphyrin X-ray structure in which a diaryl-carbene is bound to the Fe atom.35 When comparing the B/G helix interfaces of the unbound and carbene-bound states, it seems that binding the substrate has closed the passage slightly (Figure 4c), which coincides with the observation that the residues around the active site and at the solvent tunnel are less dynamic when the substrate is bound. This change is also observable in the B-factor gradient (Figure S4). The efficiency of enzymes in accelerating chemical reactions is explained by both their ability to preorganize the active site for transition state stabilization36 as well as sample the conformational ensemble required for substrate binding, reaction, and product release.37 For this, some inherent flexibility of the enzyme structure is required. The increase in flexibility observed for ApePgb GLVRSQL can enable the enzyme to adopt the conformations important for the different processes. The following observed rigidification upon binding the substrate might function to preorganize the active site for transition state stabilization. Notably, donor-substituted carbenes are known to be short-lived and highly reactive.38 That such a sensitive intermediate can be trapped and observed by MicroED underscores the value of this technique and the insights it can provide into such systems. The homogeneity of the bound intermediate within the crystal is likely enhanced by the improved diffusion and smaller sample size inherent to microcrystalline samples, providing a better context for the atomic details underlying the enzyme chemistry. The atomic details underlying the engineered carbene transfer chemistry developed in these protoglobins will serve to guide future enzyme engineering, leading to the further development of future biocatalysts.

Conclusions

In conclusion, comparisons to both the experimental structure of the related MaPgb as well as the predicted AlphaFold2 model show good overall agreement. It highlights the significance of the disruption introduced into the B helix region of the protoglobin fold and implicates the V63R mutation as a factor in this structural change. The broadening of the active site access tunnel relates well to the increased reaction rates observed for this variant. In modern crystallography, most protein structures are phased by molecular replacement using a related model from the protein structure data bank. To date, structure determination using MicroED in the absence of a reasonable search model has been set back due to the lack of experimental phasing techniques analogous to anomalous scattering in X-ray crystallography. We present the determination of a novel structure that could be solved by molecular replacement made possible by an ab initio generated model from AlphaFold2 in concert with higher-quality data accessible due to advanced detector development and a high-voltage electron microscope. We further used this technology to investigate the formation of the reactive metallo-carbene and describe the first structure of an aryl-carbene intermediate in a protein structure. As the crystals used in this study were not amenable to X-ray diffraction, this example adds an important tool for the determination of highly sought protein structures.

Acknowledgments

E.D. thanks the Wenner-Gren Foundations for their support through the Wenner-Gren Postdoctoral Fellowship. This study was supported by the National Institutes of Health P41GM136508. The Gonen laboratory is supported by funds from the Howard Hughes Medical Institute. N.J.P. thanks Merck and the Helen Hay Whitney Foundation for their support through the Merck-Helen Hay Whitney Foundation Postdoctoral Fellowship. This publication is based on work supported by the United States Army Research Office under Contract W911NF-19-0026 for the Institute for Collaborative Biotechnologies and the G. Harold and Leila Y. Mathers Charitable Foundation.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/jacs.2c12004.

  • Methods, MicroED crystallographic tables, and supplementary figures (PDF)

The authors declare no competing financial interest.

Supplementary Material

ja2c12004_si_001.pdf (1.4MB, pdf)

References

  1. Bell E. L.; Finnigan W.; France S. P.; et al. Biocatalysis. Nat. Rev. Methods Primers 2021, 1, 46. 10.1038/s43586-021-00044-z. [DOI] [Google Scholar]
  2. Yang Y.; Arnold F. H. Navigating the unnatural reaction space: Directed evolution of heme proteins for selective carbene and nitrene transfer. Acc. Chem. Res. 2021, 54, 1209–1225. 10.1021/acs.accounts.0c00591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Miller D. C.; Lal R. G.; Marchetti L. A.; Arnold F. H. Biocatalytic one-carbon ring expansion of aziridines to azetidines via a highly enantioselective [1,2]-stevens rearrangement. J. Am. Chem. Soc. 2022, 144, 4739–4745. 10.1021/jacs.2c00251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Athavale S. V.; Gao S.; Das A.; et al. Enzymatic nitrogen insertion into unactivated C–H bonds. J. Am. Chem. Soc. 2022, 144, 19097–19105. 10.1021/jacs.2c08285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Lewis R. D.; Garcia-Borràs M.; Chalkley M. J.; et al. Catalytic iron-carbene intermediate revealed in a cytochrome c carbene transferase. Proc. Natl. Acad. Sci. U.S.A. 2018, 115, 7308–7313. 10.1073/pnas.1807027115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Hayashi T.; Tinzl M.; Mori T.; et al. Capture and characterization of a reactive haem–carbenoid complex in an artificial metalloenzyme. Nat. Catal. 2018, 1, 578–584. 10.1038/s41929-018-0105-6. [DOI] [Google Scholar]
  7. Shi D.; Nannenga B. L.; Iadanza M. G.; et al. Three-dimensional electron crystallography of protein microcrystals. eLife 2013, 2, e01345 10.7554/eLife.01345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Nannenga B. L.; Shi D.; Leslie A. G. W.; et al. High-resolution structure determination by continuous-rotation data collection in MicroED. Nat. Methods 2014, 11, 927–930. 10.1038/nmeth.3043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Rodriguez J. A.; Ivanova M. I.; Sawaya M. R.; et al. Structure of the toxic core of α-synuclein from invisible crystals. Nature 2015, 525, 486–490. 10.1038/nature15368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Jones C. G.; Martynowycz M. W.; Hattne J.; et al. The CryoEM method MicroED as a powerful tool for small molecule structure determination. ACS Cent. Sci. 2018, 4, 1587–1592. 10.1021/acscentsci.8b00760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Ting C. P.; Funk M. A.; Halaby S. L.; et al. Use of a scaffold peptide in the biosynthesis of amino acid derived natural products. Science 2019, 365, 280–284. 10.1126/science.aau6232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Nannenga B. L.; Gonen T. The cryo-EM method microcrystal electron diffraction (MicroED). Nat. Methods 2019, 16, 369–379. 10.1038/s41592-019-0395-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gallenito M. J.; Gonen T. Studying membrane proteins with MicroED. Biochem. Soc. Trans. 2022, 50, 231–239. 10.1042/BST20210911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Clabbers M. T. B.; Shiriaeva A.; Gonen T. MicroED: conception, practice and future opportunities. IUCrJ 2022, 9, 169–179. 10.1107/S2052252521013063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gallagher-Jones M.; Glynn C.; Boyer D. R.; et al. Sub-ångström cryo-EM structure of a prion protofibril reveals a polar clasp. Nat. Struct. Mol. Biol. 2018, 25, 131–134. 10.1038/s41594-017-0018-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Bruhn J. F.; Scapin G.; Cheng A.; et al. Small molecule microcrystal electron diffraction for the pharmaceutical industry–lessons learned from examining over fifty samples. Front. Mol. Biosci. 2021, 8, 648603 10.3389/fmolb.2021.648603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Martynowycz M. W.; Hattne J.; Gonen T. Experimental phasing of microed data using radiation damage. Structure 2020, 28, 458–464. 10.1016/j.str.2020.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Martynowycz M. W.; Clabbers M. T. B.; Hattne J.; Gonen T. Ab initio phasing macromolecular structures using electron-counted MicroED data. Nat. Methods 2022, 19, 724–729. 10.1038/s41592-022-01485-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Martynowycz M. W.; Khan F.; Hattne J.; Abramson J.; Gonen T. MicroED structure of lipid-embedded mammalian mitochondrial voltage-dependent anion channel. Proc. Natl. Acad. Sci. U.S.A. 2020, 117, 32380–32385. 10.1073/pnas.2020010117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Xu H.; Lebrette H.; Clabbers M. T. B.; et al. Solving a new R2lox protein structure by microcrystal electron diffraction. Sci. Adv. 2019, 5, eaax4621 10.1126/sciadv.aax4621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. McCoy A. J.; Sammito M. D.; Read R. J. Implications of AlphaFold2 for crystallographic phasing by molecular replacement. Acta Crystallogr., Sect. D 2022, 78, 1–13. 10.1107/S2059798321012122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hotta K.; Keegan R. M.; Ranganathan S.; et al. Conversion of a disulfide bond into a thioacetal group during echinomycin biosynthesis. Angew. Chem., Int. Ed. 2014, 53, 824–828. 10.1002/anie.201307404. [DOI] [PubMed] [Google Scholar]
  23. Sjodt M.; Brock K.; Dobihal G.; et al. Structure of the peptidoglycan polymerase RodA resolved by evolutionary coupling analysis. Nature 2018, 556, 118–121. 10.1038/nature25985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Barbarin-Bocahu I.; Graille M.. Artificial intelligence to solve the X-ray crystallography phase problem: a case study report 2021 10.1101/2021.12.14.472726. [DOI] [Google Scholar]
  25. Pesce A.; Bolognesi M.; Nardini M.. Protoglobin: Structure and Ligand-Binding Properties. In Advances in Microbial Physiology; Academic Press, 2013; Chapter 3, pp 79–96. [DOI] [PubMed] [Google Scholar]
  26. Knight A. M.; Kan S. B. J.; Lewis R. D.; et al. Diverse engineered heme proteins enable stereodivergent cyclopropanation of unactivated alkenes. ACS Cent. Sci. 2018, 4, 372–377. 10.1021/acscentsci.7b00548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Porter N. J.; Danelius E.; Gonen T.; Arnold F. H. Biocatalytic carbene transfer using diazirines. J. Am. Chem. Soc. 2022, 144, 8892–8896. 10.1021/jacs.2c02723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Danelius E.; Gonen T.. Protein and Small Molecule Structure Determination by the Cryo-EM Method MicroED. In Structural Proteomics: High-Throughput Methods; Springer: US, 2021; pp 323–342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hattne J.; Reyes F. E.; Nannenga B. L.; et al. MicroED data collection and processing. Acta Crystallogr., Sect. A 2015, 71, 353–360. 10.1107/S2053273315010669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Evans P. R.; Murshudov G. N. How good are my data and what is the resolution?. Acta Crystallogr., Sect. D 2013, 69, 1204–1214. 10.1107/S0907444913000061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Pesce A.; Tilleman L.; Donné J.; et al. Structure and haem-distal site plasticity in methanosarcina acetivorans protoglobin. PLoS One 2013, 8, e66144 10.1371/journal.pone.0066144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mirdita M.; Schütze K.; Moriwaki Y.; et al. ColabFold: making protein folding accessible to all. Nat. Methods 2022, 19, 679–682. 10.1038/s41592-022-01488-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Afonine P. V.; Grosse-Kunstleve R. W.; Echols N.; et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr., Sect. D 2012, 68, 352–367. 10.1107/S0907444912001308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Bikiel D. E.; Forti F.; Boechi L.; et al. Role of heme distortion on oxygen affinity in heme proteins: the protoglobin case. J. Phys. Chem. B 2010, 114, 8536–8543. 10.1021/jp102135p. [DOI] [PubMed] [Google Scholar]
  35. Li Y.; Huang J.; Zhou Z.; Che C.; You X. Remarkably stable iron porphyrins bearing nonheteroatom-stabilized carbene or (alkoxycarbonyl)carbenes: Isolation, X-ray crystal structures, and carbon atom transfer reactions with hydrocarbons. J. Am. Chem. Soc. 2002, 124, 13185–13193. 10.1021/ja020391c. [DOI] [PubMed] [Google Scholar]
  36. Warshel A.; Sharma P. K.; Kato M.; et al. Electrostatic basis for enzyme catalysis. Chem. Rev. 2006, 106, 3210–3235. 10.1021/cr0503106. [DOI] [PubMed] [Google Scholar]
  37. Boehr D. D.; Nussinov R.; Wright P. E. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 2009, 5, 789–796. 10.1038/nchembio.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zhu D.; Chen L.; Fan H.; Yao Q.; Zhu S. Recent progress on donor and donor–donor carbenes. Chem. Soc. Rev. 2020, 49, 908–950. 10.1039/C9CS00542K. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ja2c12004_si_001.pdf (1.4MB, pdf)

Articles from Journal of the American Chemical Society are provided here courtesy of American Chemical Society

RESOURCES