Abstract
Introduction:
Macromolecular X-ray crystallography (XRC), nuclear magnetic resonance (NMR), and cryo-electron microscopy (cryoEM) are the primary techniques for determining atomic-level, three-dimensional structures of macromolecules essential for drug discovery. With advancements in artificial intelligence (AI) and cryoEM, the Protein Data Bank (PDB) is solidifying its role as a key resource for 3D macromolecular structures. These developments underscore the growing need for enhanced quality metrics and robust validation standards for experimental structures.
Areas covered:
This review examines recent advancements in cryoEM for drug discovery, analyzing structure quality metrics, resolution improvements, metal-ligand and water molecule identification, and refinement software. It compares cryoEM with other techniques like XRC and NMR, emphasizing the global expansion of cryoEM facilities and its increasing significance in drug discovery.
Expert opinion:
CryoEM is revolutionizing structural biology and drug discovery, particularly for large, complex structures in induced proximity and antibody-antigen interactions. It supports vaccine design, CAR T-cell optimization, gene editing, and gene therapy. Combined with AI, cryoEM enhances particle identification and 3D structure determination. With recent breakthroughs, cryoEM is emerging as a crucial tool in drug discovery, driving the development of new, effective therapies.
Keywords: cryo-electron microscopy, drug discovery, artificial intelligence, machine learning, structure validation, quality metri
1. Introduction
Structural biology, a cornerstone of drug discovery, offers profound insights into the three-dimensional structures of biological macromolecules, including proteins, nucleic acids, and even viruses. By unveiling the precise shape, active sites, and interaction surfaces of target molecules at the atomic level, scientists understand how small molecule agents interact with macromolecules of interest, leading to the design of more potent drugs. Techniques such as X-ray crystallography (XRC), nuclear magnetic resonance spectroscopy (NMR), and cryo-electron microscopy (cryoEM) are instrumental in visualizing these structures, enabling the identification of binding sites for potential drug molecules and the study of the competition of various small molecule agents for active sites. Such structural information guides the refinement of lead compounds enhancing their affinity, specificity, and efficacy while minimizing off-target effects, thereby expediting the drug development process.
XRC, NMR, and now cryoEM are the most widely used experimental techniques for drug design. Table I summarizes their advantages and limitations. XRC still plays a crucial role in drug discovery by providing high-resolution, three-dimensional structures of biological molecules. Structural information from diffraction experiments used to be the main approach for understanding how these molecules function and interact with potential drug candidates. Recently, cryoEM became a method of choice for studies of large and very large macromolecules and their complexes. It may be counterintuitive for non-structural biologists that the high-quality analysis of cryoEM data for large macromolecules is fast and relatively more straightforward than for small molecules. An example is provided by investigators working on structural studies of ribosomes (Ada Yonath, Venki Ramakrishnan, and Nenand Ban, among others) who abandoned XRC and moved entirely to cryoEM. While many cryoEM structures today still fall within the lower resolution range and smaller molecules remain technically challenging, the technology is advancing rapidly. With ongoing improvements in hardware, software, and innovative methods such as using larger protein scaffolds (1,2), even smaller proteins are becoming more accessible for high-resolution studies.
Table 1.
CryoEM structures used in drug design (full list is available in Table S1) with unusually high differences between the structure quality measures: CCMASK and Q-score.
| PDB ID | Release Date | Length | Resolution [Å] | CCMASK | Average Q-score |
|---|---|---|---|---|---|
| 8WLL | 2024-06-12 | 389 | 3.74 | 0.76 | 0.353 |
| 8BVT | 2023-07-19 | 623 | 3.94 | 0.78 | 0.364 |
| 7Y1K | 2023-06-14 | 1164 | 3.80 | 0.82 | 0.413 |
| 7Y1L | 2023-06-14 | 1189 | 3.73 | 0.85 | 0.424 |
| 7EJA | 2022-04-13 | 1097 | 3.60 | 0.80 | 0.378 |
CryoEM has experienced significant advances in recent years due to the development of high-speed direct electron detectors, new sample preparation methods and, equally important, improved algorithms for image processing and reconstruction. Machine learning approaches have been revolutionary for enhancing the speed and accuracy of automated high-throughput grid screening (Smart Leginon: (3)), micrograph processing and particle picking (TOPAZ: (4), CrYOLO: (5)), 3D reconstruction (CryoDRGN: (6), Blush: (7)), automatic post-processing of cryoEM maps (deepEMhancer: (8)), model building (ModelAngelo: (9)), and recently addressing continuous conformational heterogeneity (3DFlex: (10)). New sample preparation and handling techniques, such as lipid nanodiscs that preserve the native structure of biomolecules in solution and reduce damage from ice formation during freezing, as well as the use of various sample freezing apparatus that improves reproducibility of grid preparation (11-13).
The advancements mentioned above have significantly increased the number of annual depositions of cryoEM structures to the Protein Data Bank (PDB) (14-16) and Electron Microscopy Data Bank (EMDB) (17-19). This growth is particularly significant in the context of the increasing demand for PDB downloads, which is growing even faster (20), far exceeding the rate at which new structures are being experimentally determined. Figure 1 presents the number of structures determined by XRC, NMR, and cryoEM and deposited each year in the PDB. The number of structures determined by cryoEM is growing rapidly, while the deposits of structures determined by XRC have stabilized, and by NMR is decreasing. However, what grows extremely fast are not depositions but the number of downloads from the PDB, which is probably the world's most frequently accessed scientific resource. The vertical scale represents the number of deposits per year or, alternatively, the average number of downloads every two minutes. As shown by the rapid growth of the number of such deposits, the importance of cryoEM in meeting this demand and its potential impact on many areas of research, including drug discovery, cannot be overstated.
Figure 1.

The annual number of new deposits in the PDB, categorized by the method of structure determination: XRC (blue), NMR (green), and cryoEM (red). The solid line represents the average number of deposits downloads every two minutes.
The trends in PDB download data reveal a steady increase from 2009 to 2019, followed by a significant surge from 2019 onward, suggesting an underlying shift in usage patterns. The initial period likely reflects a time when downloads were more targeted, with researchers manually selecting relevant structures, resulting in lower volumes. The sharp increase post-2019 aligns with the emergence of the "AI era" in structural biology, marked by widespread adoption of computational tools like AlphaFold2 (21), which encouraged many research groups to download the entire PDB for local use to facilitate large-scale analysis and prediction. Regular updates to maintain current data further contribute to this growth. This period also highlights the interplay between AI and cryoEM; while AI-driven models benefit from structural diversity to improve predictive capabilities, cryoEM supplies unique structures to enhance the training datasets. The continuous development of AI tools, such as AlphaFold3 (22) with ligand docking features, supports hypothesis generation in drug discovery, where cryoEM offers a means for experimental validation. This synergy underscores the co-evolution of AI and cryoEM, reinforcing their collective impact on advancing structural biology and drug discovery.
In drug discovery, examining macromolecules (typically proteins) in complex with small-molecule ligands, such as inhibitors or active pharmaceutical compounds, is essential for understanding ligand-macromolecule interactions. This approach enables rational modifications to ligand structures to improve binding strength and specificity. Additionally, water molecules and metal ions often play significant supporting roles, as they can reveal alternative binding sites crucial for drug design. The success of these studies depends on access to high-resolution, high-quality structural models; however, such models represent only a subset of all entries in the PDB. While quality assessment metrics for atomic models derived from X-ray crystallography (XRC) and nuclear magnetic resonance (NMR) are well-established, the evaluation methods for cryo-electron microscopy (cryoEM) structures are still maturing. The following section explores the ongoing development and application of quality metrics for cryoEM structures in greater detail.
2. Quality of cryoEM structures
Every structural model requires refinement and specific metrics to assess its alignment with experimental data, such as the electron density map in XRC and Coulomb potential map in cryoEM. Non-specialists may not always recognize that the map represents the experimental outcome, while the structural model presents the experiment's interpretation. In 2020 a new quality metric for cryoEM structures, the Q-score, has been proposed by Pintilie (23). The Q-score measures how similar the map profile of an atom is to a Gaussian-like function that would be seen if the atom were well resolved. The validation reports available in PDB contain Q-scores averaged for each chain of a given structure. This score represents an "average resolvability measure" (23). Since September 2022 all validation reports for cryoEM structures include Q-scores (24), as well as another metric called Atom Inclusions. The latter measures the fraction of atoms in the model that reside inside the EM volume (25). The dependence of Q-score on resolution for all cryoEM structures is shown in Figure 2. It should be noted that there are structures with negative Q-scores, even some of those with claimed resolution better than 3.0 Å. A negative Q-score of structures determined with cryoEM suggests that the atomic density in the Coulomb potential map deviates significantly from a well-resolved Gaussian-like peak. This typically means that the atom or region being measured is poorly resolved or does not fit the map well. Possible reasons for a negative Q-score include low map resolution, inaccurate atomic placement, noise or artifacts in the data, and regions of the structure that are flexible or disordered.
Figure 2.

Q-score and CCMASK values plotted against resolution for all cryoEM structures in the PDB with resolution better than 5 Å. Q-scores for several high-resolution structures (represented as blue dots), not originally reported, were calculated by the authors of this manuscript. Negative and positive values of quality metrics for individual structures are shown as, respectively, blue and green dots. An interactive version of this plot can be explored at https://bioreproducibility.org/figures/cryoem_sbdd/fig2 . The CCMASK data is limited to structures cataloged by the CERES service(26), accessible at (27).
Evaluating the quality of cryoEM structures has sparked debate over the most effective metrics. While atom inclusions and Q-scores have been commonly used, some argue that they fall short as comprehensive quality measures. The Phenix group has advocated for the model-to-map correlation coefficient (CCMASK) as a superior alternative, particularly for high-resolution structures (28-30). Unlike atom inclusions or Q-scores, CCMASK considers essential atomic model parameters such as atomic displacement parameters (ADPs), occupancy, and atom type. Moreover, atom inclusions and Q-scores do not address density shape, missing atoms, or alternative conformations. Historically, identifying multiple conformations in cryoEM structures was challenging due to the difficulty of obtaining maps with both sufficient resolution and the ability to capture fine details. However, recent advances in cryoEM technology appear to have resolved these issues. For example, the atomic-level resolution (1.09 Å) structure 8RQB clearly illustrates double conformations within a significant region of the map (Figure S1). These improvements suggest that more nuanced quality metrics, such as CCMASK, may offer a more comprehensive assessment of cryoEM structure quality in the future.
In 2021 the Phenix group developed an online re-refinement system for cryoEM named CERES (26). This system, which is regularly updated, automates the re-refinement of structures deposited in the PDB. In addition to re-refinement, CERES computes the CCMASK for both the original and re-refined cryoEM structures; however, this information is not available for all cryoEM structures in the PDB. As of November 15, 2024, we successfully retrieved data for 15,275 of the 23,212 cryoEM structures listed in the PDB. A comparison of two structure quality measures, presented in Figure 2, demonstrates that CCMASK appears to be less resolution-dependent than the Q-score. This observation is consistent with the findings of Liebschner et al. (26), who note that CCMASK behaves differently from the R-factors used in X-ray crystallography quality assessments, which are known to be strongly correlated with resolution (31).
To test the hypothesis proposed by the Phenix group regarding the potential superiority of CCMASK over the Q-score, we analyzed the relationship between these two measures, as shown in Figure 3. Our analysis reveals that, particularly for newer and higher-resolution structures (represented by blue dots), CCMASK tends to exhibit more consistent values, whereas the Q-score shows greater variability. However, both measures present interesting outliers. Notable examples include:
Figure 3.

Comparison of Q-score and CCMASK across different resolutions. The "new and high-resolution" category (blue dots) comprises structures with a resolution of at least 2.5 Å, released by the PDB after January 1, 2022. Red dots represent structures with negative values for either CCMASK or Q-score. All remaining structures are represented as green dots. For detailed identification of each structure, an interactive version of this plot is available at https://bioreproducibility.org/figures/cryoem_sbdd/fig3.
PDB ID: 6RBE: This structure was derived from five different electron density maps, all of which have low Q-scores: EMD-4792 (0.16), EMD-4793 (0.1465), EMD-4794 (0.0285), EMD-4795 (0.032), and EMD-4796 (−0.0005). The structure’s average Q-score is exceptionally low at 0.019. However, surprisingly, the CCMASK values provided by CERES show a significantly higher value of 0.8 for the EMD-4793 map, while the other two maps (EMD-4794 and EMD-4795) have low CCMASK values of 0.02 and 0.09, respectively.
PDB ID: 7YKS: For this structure, both the EMD-33896 map and the structure itself display a decent Q-score of 0.545. CERES reports a CCMASK value of only 0.11 for the deposited structure. However, following re-refinement in CERES, the CCMASK value improves significantly to 0.8.
Additional cases of discrepancies between the Q-score and CCMASK are summarized in Table 1. This table also highlights examples where the structures, according to the PDB, serve as the structural basis for already approved drugs.
2.1. Resolution of cryoEM maps
While resolution is not a parameter directly addressed in structures obtained by NMR, the concept of resolution is inherently relevant to XRC and cryoEM. Both latter methods use resolution to quantify the level of detail, but the specific metrics used differ, making a direct comparison difficult (32,33).
In crystallography, resolution is primarily determined by the maximum diffraction angles of the X-rays scattered by the crystal that still provide measurable data. Several factors affect resolution in XRC, including the wavelength of the X-rays used for the experiment, crystal quality, both inherent and affected by the conditions of data collection, and computational methods used for processing diffraction data. The expected resolution limits can be roughly estimated by looking at the first diffraction image, so that weakly diffracting crystals could be bypassed in full data collection.
In cryoEM, resolution is primarily determined by the quality of the electron micrographs, the number of particles analyzed, and the intrinsic heterogeneity of the sample (34). In this case, the resolution depends on the quality of the images (determined by the quality of the electron source, detector, and microscope optics), quality and features of the particles (including quality of sample preparation and particle orientation in the sample), as well as the quality of data processing (including not only the quality of image processing, but also the particle picking and classification, which can be significantly affected by the lack of experience). Fourier Shell Correlation (FSC) is usually used to assess the overall resolution of a cryoEM structure. The 'gold standard' method (35), now the most prevalent in the EM community, is often used to determine the resolution of structures deposited in the PDB or EMDB. Similarly to how resolution is presented in XRC, global FSC provides a single value representing the average resolution across the entire structure. Local FSC can also be calculated to assess the resolution variation within a structure, providing more realistic information about the local quality of the structure. Various algorithms have been developed for local validation, using FSC, Q-score, or a combination of both (36,37).
Until about 15 years ago, in the “blobology” era (38), only low-resolution structures obtained by electron microscopy, primarily using negatively stained samples, were available in the PDB. All that changed with the introduction of modern direct electron-counting detectors and new image captions and processing software. We present the rapid increase in the resolution of cryoEM deposits in PDB in Figure 4. The current highest-resolution structure is that of mouse heavy-chain apoferritin (PDB ID 8RQB). It was resolved at an impressive 1.09 Å resolution. However, the paper describing this structure and the process of obtaining it (39) had to wait 6 months for publication (we are indeed slower than we were during the Gutenberg times). This remarkable achievement highlights the rapid advancements in cryoEM and makes it difficult to predict how far the technology will advance in the coming years (40). However, it is important to note that a recent study (41) highlighted a potential trade-off in pushing the limits of resolution in cryoEM. In particular, authors mention the three main resolution limiting problems: higher-order lens aberration, chromatic aberration, and axial coma. The latter one is especially important since in the case of images affected by coma the FSC will show an increase in correlation which may lead to misestimations of the resolution (42). Char and Stark (41) suggest that solvent densities may serve as more reliable indicators of map quality than the protein regions themselves and demonstrate that even among three high-resolution (1.5 Å or better) apoferritin structures, the number of resolved water molecules varies by a factor of three. The authors conclude that the number of resolved solvent molecules depends not only on the microscope but also on the software used in the processing of the structure. These results demonstrate the challenges in achieving overall model quality at extremely high resolution, suggesting that a balance must be maintained between resolution and the accuracy of the structural model.
Figure 4.

Resolution (≥ 5 Å) of cryoEM structures as reported by their authors, plotted against release date. The boxes represent the interquartile range (50% of the data), with whiskers indicating the range of 1.5 interquartile ranges from the first and third quartiles. An interactive version of this figure, which identifies outliers, is available online at: https://bioreproducibility.org/figures/cryoem_sbdd/fig4.
2.2. Metal ligands
Metal ions play a significant role in drug discovery and development, contributing to both therapeutic and diagnostic applications. Their unique chemical properties that include coordination geometry, catalytic abilities, or variable oxidation states make them useful for metal-based or metal-associated drugs. Some metals, for example silver or copper, directly exhibit antibacterial properties by disrupting bacterial membranes or affecting bacterial enzymes. Metal-containing compounds can be developed to inhibit metalloenzymes in infectious diseases and in cancer. CryoEM is particularly well-suited for studying large, complex structures, such as protein-nucleic acid assemblies or membrane proteins, which often involve inclusion of metal ions. Capturing molecules in near-native state helps to preserve important interactions with metal ions. Since cryoEM is also well suited for analysis of conformational changes in proteins that affect their function, metal-induced changes can be studied.
We analyzed the proportion of structures containing the most commonly occurring metals in all macromolecular complexes present in the PDB. The results, shown in Figure 5, are categorized by the technique used for structure determination. The high fraction of structures containing magnesium and zinc is the result of using almost exclusively cryoEM for ribosome studies. However, it is not clear whether all metal ions are correctly identified. The CheckMyMetal (CMM) server (43,44), the primary metal identification resource, was developed for XRC and was recently expanded for structures determined by cryoEM (45).
Figure 5.

Fraction of PDB deposits containing metal ions, categorized by the method used for their determination (XRC in blue, cryoEM in red, and NMR in green).
The use of the CMM server for validating metal binding sites in high-resolution structures has highlighted some challenges. In an initial test, a significant number of metal ions were incorrectly identified (45). Additional caution is needed when using an X-ray crystallography (XRC) structure as a template for solving a cryoEM structure. Metals bound in the XRC structure, which may have been obtained under different conditions, might be incorrectly retained in the final cryoEM model. Another issue with metal identification arises from the cryoEM CIF file, which lacks information on sample preparation (46). Only general details about the data acquisition process are available through the PDBe API (47). For example, the 8RQB deposit, which had minimal or no refinement data in its CIF file, still contained an important iron metal site (39). Similarly, the unusually high number of zinc sites observed in NMR metal dataset studies (Figure 5, green bar) is somewhat artificial. This is primarily due to zinc ions being retained during experiments, as they play a crucial role in protein structure and function. Without zinc, the protein may misfold or become unstable, which explains why zinc ions are frequently reported in these studies.
2.3. Water molecules in cryoEM maps
All biological molecules exist in the aqueous environment and interactions with individual water molecules play an important role in stabilizing biological material. Samples used in cryoEM experiments are rapidly frozen in a thin layer of vitreous ice (non-crystalline water), preserving the native structure of biological molecules without forming damaging ice crystals. When visible, water molecules can be modeled in high-resolution cryoEM structures. Similarly to XRC, density modification algorithms are usually used in cryoEM to improve the maps (48). It has been shown that different methods will differentially affect ligand densities, including enhancing as well as removing densities for waters and selected metals, depending on the method used (49). However, it seems that many cryoEM deposits do not contain water molecules even if the corresponding map density is present and even though solvent interactions are important for understanding intermolecular interactions. A comparison between cryoEM and XRC of the number of water molecules per residue as a function of resolution (Figure 6) shows that the number of water molecules identified by cryoEM is typically lower than that identified by XRC. However, this difference is likely influenced by the fact that the concept of "resolution" is defined differently for the two techniques, which makes direct comparisons less straightforward (33). Additionally, adding water molecules to crystal structures may lead to improvement of the parameters that indicate structure quality, whereas no such effect is present in cryoEM, since in that case a more complete model does not influence the map itself.
Figure 6.

Ratio of water molecules to amino acid residues plotted against the structure resolution determined by XRC (blue) or cryoEM (red) methods. The boxes represent the interquartile range (middle 50% of the data), with whiskers extending up to 1.5 interquartile ranges from the first and third quartiles. The horizontal line inside each box marks the median. Individual dots represent outliers. An interactive version of this plot, enabling the identification of individual outliers, is available at: https://bioreproducibility.org/figures/cryoem_sbdd/fig6.
3. Refinement
Several programs, including Phenix (29), Refmac/Servalcat (50), Isolde (51), and Coot (52) or various combinations of these packages (53) are being used for structure refinement. Differences in how these programs approach the refinement process may lead to slightly different results. However, looking into the deposit, it is often difficult to find which program was used, and sometimes it is even difficult to find whether the model was refined at all. There are three possible places to find such information, namely the validation file, PDB file, and CIF file. On top of that in the CIF file this information can be found in two independent places: 1. In the “software” list as the “refinement” classification. 2. In the “em_software” list in the category “Model Refinement”. These different sources show, in fact, different results. Figure 7 shows this data according to the “EM_software” table specific for cryoEM structures in CIF files, however, different results may be obtained from the PDBe API using the “refinement_software” field or PDB files.
Figure 7.

Software used in the refinement of cryoEM structures, as reported in PDB deposits. The plot was generated by analyzing 22,646 deposits and extracting data from the CIF file using the EM_software table. An alternative analysis, combining information from the CIF files, validation reports, and older PDB files, yields slightly different results. For further details, see the interactive version at https://bioreproducibility.org/figures/cryoem_sbdd/fig7.
In many cases, neither the validation files nor the CIF and PDB files provide the required detail. Our correspondence with the authors of deposit 8RQB revealed that the lack of information about refinement was due to a bug in one of the deposition programs. This fact identifies serious limitations to the dissemination of know-how for this very fast-developing technique.
Additional details were gathered directly from the deposit authors via phone and email. We have consistently notified the PDB about various errors and have made efforts to inform them of each new class of issues that we discovered.
4. Comparison of structure determination methods
Table 2 summarizes the pros and cons of each of the structure determination methods emphasizing the differences of cryoEM with regard to X-ray and NMR based on the aspects mentioned above.
Table 2.
Summary of advantages and disadvantages of different structure determination methods in obtaining quality information for ligand complexes.
| X-ray crystallography | CryoEM | NMR |
|---|---|---|
Pros:
|
Pros:
|
Pros:
|
Cons:
|
Cons:
|
Cons:
|
5. CryoEM around the world
5.1. CryoEM in academia and national laboratories
Several synchrotron facilities have recognized that adding capabilities to conduct cryoEM experiments on-site can significantly boost scientific output by enabling researchers to work in concert with both cryoEM and XRC. Such integration allows for more efficient and complementary structural analysis. The most successful facilities are Diamond Light Source synchrotron in the United Kingdom, Stanford Synchrotron Research Laboratory (SSRL) in USA, Paul Sherer Institute in Switzerland, and Photon Factory in Japan, although other synchrotron facilities have also been going in this direction. Also, several university-built facilities are open to external users. New York Structural Biology Center, HHMI Janelia Research Campus, The National CryoEM Facility (NCEF) at the National Cancer Institute, The Max Planck Institute of Biophysics in Frankfurt, University of Texas Southwestern in Dallas, The Scripps Research Institute in La Jolla, Rutgers University, and the University of Virginia in Charlottesville are among the best known and best equipped academic facilities. Some of these centers provide computational resources, software, and expertise that allow users to leave facilities with partially refined structures. Unfortunately, it is difficult to measure the performance of these facilities as, contrary to XRC data collected at synchrotron beamlines, the use of the academic resource is not reported in the PDB deposit.
5.2. CryoEM meets the industry
Several leading pharmaceutical and biotech companies have invested in their own cryoEM facilities to advance structural biology studies, providing critical insights that guide drug design and development. Notable examples include Roche, Novartis, Pfizer, Astra Zeneca, Merck, Amgen, and Bristol-Myers Squibb. These facilities are used for commercial studies and results are rarely published or deposited.
Several companies provide full service to industry and academia. They prepare samples, perform measurements, perform 3-D reconstruction, refine the resulting structural models, and even perform partial analysis of results. The best known are NanoImaging Services (NIS) in San Diego, Structural Biology Center at WuXi in Shanghai, SPT Labtech in Melbourne and MiTeGen in Ithaca. It is interesting that while Thermo Fisher Scientific is primarily known for manufacturing cryoEM instruments, they also provide full cryoEM services. The cutting-edge research conducted in primarily manufacturing companies is beneficial for both instrument development and, presumably, the use of prototypes of the new, better system.
5.3. Utilization of cryoEM in drug discovery
The utility of cryoEM in the process of drug discovery and design has been alluded to for almost a decade (Table S1). The first publication that explicitly showed results of such studies was by Merk et al. (58). The authors presented 3.8 Å resolution cryoEM structures of the cancer target isocitrate dehydrogenase and identified the nature of conformational changes induced by binding of the allosteric small-molecule inhibitor ML309. They also pushed the resolution of the structure of lactate dehydrogenase to 2.8 Å and of glutamate dehydrogenase to 1.8 Å. It should be stressed that all these enzymes were comparatively large (93-334kDa), emphasizing the fact that cryoEM works better for larger macromolecules than for smaller ones. Much more recently Cushing et al. (59) presented a series of structures of CDK-activating kinase determined at the resolution around 2 Å with an explicit purpose of assisting in structure-based drug design. The mass of these heterotrimeric complexes is ~87 kDa, emphasizing that current cryoEM methods could be useful for these comparatively small protein complexes. However, it is not clear if much smaller proteins could provide similar type of information and, for now, XRC is still the major technique in drug design and development studies.
CryoEM has revolutionized structural studies of G protein-coupled receptors (GPCRs) and ribosomes (60-63). GPCRs are critical targets in drug discovery due to their involvement in numerous physiological processes (64,65). Application of cryoEM in GPCR research enables the determination of high-resolution structures, particularly for challenging targets, such as large protein complexes and membrane proteins, which are very difficult to crystallize. CryoEM has allowed for insights into the active and inactive states of GPCRs, their conformational changes, and their interactions with ligands (66,67). The research is conducted entirely within public/private companies and sometimes conducted collaboratively between private companies and academic institutions. These collaborations frequently result in publications with a large number of co-authors, some of whom hold multiple affiliations. For example, two GPCR-related papers, published in joint efforts by the iHuman Institute (ShanghaiTech University), Structure Therapeutics (South San Francisco), and several other organizations (68,69), featured 12 and 19 co-authors, respectively, representing 10 and 9 institutions. Several authors listed multiple affiliations. These examples highlight the extensive collaboration behind such work but also raise questions about whether every contributor fully grasps the broader context of the research—the proverbial "forest"—rather than focusing solely on their specific "trees."
6. Challenges and the future directions
CryoEM is a relatively new, rapidly evolving technique that holds immense promise. One might assume that with the availability of structures boasting resolutions better than 1.25 Å (although it must be noticed that 22 out of the 40 cryoEM structures with resolution better than 1.8 Å represent just 2 proteins: Human (P02794) and Mouse Ferritin (P09528)), the major challenges have been overcome, and the development of life-improving drugs is just around the corner. However, the real challenge lies in the dissemination of know-how. Surprisingly, even at similar resolution, the quality indicators of structures—as measured by the Q-score—vary significantly, even for identical samples (7VZ3 Q-score -0.015; 7VYV Q-score 0.684; 7LZJ Q-score 0.632). Despite numerous courses and workshops, we are still left questioning why, for example, three highly similar structures could exhibit such different Q-scores. We suspect that the low score for 7VZ3 is due to the misalignment between the map and the coordinates not addressed during deposition, and not to any intrinsic problems of the structure itself. The PDB has now adopted the Q-score, and regardless of how we feel about it, biologists will inevitably use it as a benchmark for selecting structures. We wish the authors would do the same.
CryoEM is a very powerful technique, but it still faces several challenges. The most critical is probably sample preparation which may ultimately strongly affect map reconstruction. The cryoEM sample does not need to be crystallized, but sample preparation is still not trivial (70)). Samples must be flash-frozen in a thin layer of vitreous ice, and the procedure is not always fully reproducible. The optimal ice thickness is critical for obtaining micrographs for high resolution reconstructions, with the upper limit of ice thickness considered to be under 300Å for electron transparency (71,72). Thick ice does limit resolution, while thin ice can force out particles, induce a preferred orientation, or even damage biological structures (73-75). Microfluidics has enabled several “blot free” systems to be developed for preparing TEM grids – notably the SPT Chameleon®, CryoSol Vitrojet™, cryoWriter, and custom time resolved systems (76-79). Although these systems differ, each offers rapid vitrification on the millisecond timescale, which helps to avoid lengthy interaction with the air-water interface, as well as improves consistency over conventional blotting. A broader strategy may be to devise a range of affinity grids that can immobilize particles in ways that maintain their structural integrity (70). Detergents and other additives could always be screened, but new methods such as incorporation of Late Embryogenesis Abundant (LEA) proteins to samples could overcome environmental concerns of the commonly used polyfluoroalkyl substances (PFAS) based detergents (e.g. fluorinated octyl-maltoside) (80,81). Grid technology has paved the way for high-quality, ultra-stable gold supports with minimal beam-induced motion, enabling some of the highest reported resolutions in structural studies (82,83).
Improvements in the new generation of energy filters continue to reduce noise from inelastically scattered electrons, pushing the resolution limit and overall quality of micrographs (84). The current development of 100 kV TEM systems minimizes radiation damage, while reporting resolution at the same level of existing 200 kV and 300 kV systems (85-87).
Regardless of all these problems, there is no question that cryoEM will provide a powerful tool for drug discovery and complement XRC well as cryoEM overcomes the limits of XRC for large, heterogeneous, and flexible macromolecular complexes. Streamlining computational workflows for automated data acquisition coupled to processing pipelines, along with automated model building will make cryoEM a primary method for drug discovery in the future.
7. Expert Opinion
CryoEM has emerged as a transformative technology, revolutionizing the field of structural biology and, by extension, drug discovery. There are numerous approaches gaining importance in modern drug discovery, and cryoEM has the potential to be particularly useful among them. Since cryoEM is particularly well-suited for studying large, complex structures, one of the avenues where it can be advantageous over other methods is the quickly evolving field of induced proximity, including targeted protein degradation, that carries a promise of drugging undruggable (88). Capturing induced-proximity events is challenging, as complexes containing such interactions can be difficult to crystallize due to their transient or dynamic nature, as well as their size. Antibodies are widely used as therapeutic agents, also a rapidly evolving field. CryoEM can help to visualize the interactions between antibodies and their target antigens, providing rapid epitope mapping. Such information can be used to explain their mode of action, as well as to improve antibody design and enhance their therapeutic efficacy. Combined with other experimental techniques cryoEM can be used for approaches such as iterative vaccine design (89). In that respect cryoEM already has played a significant role in fighting COVID-19 (90), as well as contributed to a rational design of effective respiratory syncytial virus (RSV) vaccine candidates (91). CryoEM can assist in rational design for chimeric antigen receptor (CAR) T-cell receptors, where detailed understanding of their interactions with cancer antigens can facilitate maximizing their therapeutic potential by fine-tuning CAR receptors affinity and specificity (92). CryoEM has also been used to determine structures of gene editing enzymes in action. This knowledge can be used to modify the properties of gene editors and improve their speed and accuracy (93). What is more, cryoEM can be used for characterization and design of viral and non-viral gene delivery vehicles in gene therapy (94,95), as well as vaccine delivery vehicles, like lipid nanoparticles.
Even though the single particle analysis is currently the main branch of cryoEM utilized in drug discovery, cryo-electron tomography (cryoET) is gaining momentum, providing the scientists with the ability to study the molecular architecture of cells and the processes within in unprecedented detail. The key aspect in any drug discovery pipeline is to mechanistically rationalize the choice of novel targets, which requires a deep understanding of cellular processes and pathways involved in the pathogenesis. Use of cryoET allows researchers to gain a detailed understanding of how a target plays a role in disease biology, which is a fundament for all the following steps of the drug discovery process.
Artificial intelligence is playing a significant role in cryoEM data processing and analysis (96). Particle picking and identification of 2D classes are critical steps in cryoEM structure determination. AI algorithms can automatically identify and pick individual particles from cryoEM micrographs, reducing the time and effort required for manual selection. In many cases, it is extremely difficult for an untrained eye to identify quality particles and differentiate between slightly different conformational states of the particles. AI algorithms, being able to differentiate between subtle conformational changes, can facilitate the classification of particles. AI-driven methods can also be used to determine the 3D structure of macromolecules from cryoEM data without prior knowledge of the structure. Even though we are probably not there yet, especially for more challenging cases, eventually AI can automate the entire cryoEM workflow, from data acquisition to structure determination, reducing manual labor and increasing efficiency.
The 2024 Nobel Prize in Chemistry, awarded to David Baker, Demis Hassabis, and John Jumper, highlights the transformative impact of AI in biomedical research. AI not only predicts molecular structures in the absence of experimental data but also improves the accuracy of structures obtained through various techniques, including cryoEM (21,97-100). As these approaches continue to converge, cryoEM is poised to benefit even more from breakthroughs like AlphaFold3 and its future developments.
CryoEM is a cutting-edge, rapidly advancing technique with great potential. In the past two months, two groundbreaking papers have made a significant impact on the research community. The first demonstrates that cryoEM can clearly resolve hydrogen atom positions and water networks (54), while the second reports an atomic resolution (1.09 Å) protein structure (8RQB) that reveals double conformations (39).
The application of cryoEM to structure-based drug discovery is still evolving, but it is rapidly becoming a routine method of choice. As the technology continues to advance, we can expect to see more examples of drugs and therapeutic approaches that will be discovered, designed or optimized using cryoEM-derived structural information, leading to the development of new and effective therapies for numerous pressing health issues.
Supplementary Material
Figure S1 Double conformations observed in the cryoEM structure of mouse heavy-chain apoferritin (PDB ID: 8RQB). The figure highlights the iron channel, showcasing four histidine residues coordinating the iron atom. The histidine residues exhibit double conformations, demonstrating the ability of modern high-resolution cryoEM to resolve alternative conformations in critical regions of macromolecular structures.
Table S1 This table showcases a set of approved drugs that have been structurally characterized exclusively by cryo-electron microscopy (cryoEM) according to current entries in the Protein Data Bank (PDB). At the time of this study, these drugs lack structural data from alternative methods such as X-ray crystallography or NMR, underscoring the unique role of cryoEM in providing the initial structural insights for these drug-bound complexes. Although future depositions may include structures resolved by other techniques, cryoEM will remain the pioneering method that first captured these specific drug interactions. This compilation highlights the increasing impact of cryoEM in advancing drug discovery, particularly for targets and complexes challenging to study by other structural approaches.
Article Highlights.
CryoEM is a rapidly evolving and cutting-edge technique with immense potential.
Its role in structure-based drug discovery is expanding as the field continues to evolve.
CryoEM has revolutionized the study of large, complex structures, especially in drug discovery, offering unique advantages for investigating transient and dynamic interactions, including applications in induced proximity and targeted protein degradation.
When combined with cryoET, cryoEM provides unparalleled insights into cellular molecular architecture and disease biology, making it invaluable for rational drug target identification.
Recent breakthroughs in cryoEM have significantly enhanced resolution, although challenges such as lens aberrations and solvent density variations highlight the delicate balance between achieving high resolution and maintaining model accuracy.
CryoEM structures, like other structural models, require refinement and robust quality metrics. The Q-score is widely used, but the model-to-map correlation coefficient (CCMASK) has emerged as an alternative. It addresses the limitations of Q-scores by accounting for atomic displacement, density shape, and recent advances in resolving multiple conformations.
Artificial intelligence is revolutionizing cryoEM data analysis by automating particle identification, improving classification, and enhancing 3D structure determination, significantly reducing the need for manual intervention.
Metal ions play a pivotal role in drug discovery, both therapeutically and diagnostically. CryoEM excels in studying large, complex structures involving metal ions, preserving critical metal interactions, and enabling the analysis of metal-induced conformational changes in proteins.
Acknowledgments
The authors would like to thank Dr. Mark Nakasone and Dr. Zbigniew Dauter for their critical reading of the manuscript and valuable comments. We also thank Kim Pate and Pam Acker for their overall help during manuscript preparation.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was funded by Harrison Family Funds via University of Virginia and in part by the NIH Intramural Research Program, Center for Cancer Research, National Cancer Institute, National Institutes of Health. KM and MG express their gratitude for the financial support provided for this research through the departmental grant funds (N19/DBS/000023).
Footnotes
Declaration of interest
WM has contributed to the development of software, data management systems, and data-mining tools, some of which have been commercialized by HKL Research. WM is also a cofounder of HKL Research and serves on its board. The other authors declare no relevant affiliations or financial relationships with any organization or entity with a financial interest in or conflict with the subject matter or materials discussed in this manuscript, beyond the disclosures provided.
Reviewer disclosures
Peer reviewers on this manuscript have no relevant financial or other relationships to disclose.
References
- 1. Herzik MA, Wu M, Lander GC. High-resolution structure determination of sub-100 kDa complexes using conventional cryo-EM. Nat Commun [Internet]. 2019. Mar 4;10(1):1032. Available from: http://www.ncbi.nlm.nih.gov/pubmed/30833564 ** determination of sub 100 kDa complexes by cryoEM
- 2.Chiu Y-H, Ko K-T, Yang T-J, Wu K-P, Ho M-R, Draczkowski P, et al. Direct Visualization of a 26 kDa Protein by Cryo-Electron Microscopy Aided by a Small Scaffold Protein. Biochemistry [Internet]. 2021. Apr 13;60(14):1075–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/33719392 [DOI] [PubMed] [Google Scholar]
- 3. Cheng A, Kim PT, Kuang H, Mendez JH, Chua EYD, Maruthi K, et al. Fully automated multi-grid cryoEM screening using Smart Leginon. IUCrJ [Internet]. 2023. Jan 1;10(1):77–89. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36598504 * automated screening of grids
- 4.Bepler T, Kelley K, Noble AJ, Berger B. Topaz-Denoise: general deep denoising models for cryoEM and cryoET. Nat Commun [Internet]. 2020. Oct 15;11(1):5208. Available from: http://www.ncbi.nlm.nih.gov/pubmed/33060581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Wagner T, Merino F, Stabrin M, Moriya T, Antoni C, Apelbaum A, et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun Biol [Internet]. 2019. Jun 19;2(1):218. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31240256 * fully automated particle picker
- 6.Zhong ED, Bepler T, Berger B, Davis JH. CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks. Nat Methods [Internet]. 2021. Feb 4;18(2):176–85. Available from: http://www.ncbi.nlm.nih.gov/pubmed/33542510 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kimanius D, Jamali K, Wilkinson ME, Lövestam S, Velazhahan V, Nakane T, et al. Data-driven regularization lowers the size barrier of cryo-EM structure determination. Nat Methods [Internet]. 2024. Jul 11;21(7):1216–21. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38862790 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sanchez-Garcia R, Gomez-Blanco J, Cuervo A, Carazo JM, Sorzano COS, Vargas J. DeepEMhancer: a deep learning solution for cryo-EM volume post-processing. Commun Biol [Internet]. 2021. Jul 15;4(1):874. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34267316 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Jamali K, Käll L, Zhang R, Brown A, Kimanius D, Scheres SHW. Automated model building and protein identification in cryo-EM maps. Nature [Internet]. 2024. Apr 11;628(8007):450–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38408488 * automated model builder
- 10.Punjani A, Fleet DJ. 3DFlex: determining structure and motion of flexible proteins from cryo-EM. Nat Methods [Internet]. 2023. Jun 11;20(6):860–70. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37169929 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Dalal V, Arcario MJ, Petroff JT, Tan BK, Dietzen NM, Rau MJ, et al. Lipid nanodisc scaffold and size alter the structure of a pentameric ligand-gated ion channel. Nat Commun [Internet]. 2024. Jan 2;15(1):25. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38167383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Yuan Y, Kong F, Xu H, Zhu A, Yan N, Yan C. Cryo-EM structure of human glucose transporter GLUT4. Nat Commun [Internet]. 2022. May 13;13(1):2671. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35562357 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Hirst IJ, Thomas WJR, Davies RA, Muench SP. CryoEM grid preparation: a closer look at advancements and impact of preparation mode and new approaches. Biochem Soc Trans [Internet]. 2024. Jun 26;52(3):1529–37. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38864435 *** Comprehensive description of Protein Data Bank
- 14.Burley SK, Berman HM, Bhikadiya C, Bi C, Chen L, Di Costanzo L, et al. Protein Data Bank: The single global archive for 3D macromolecular structure data. Nucleic Acids Res. 2019;47(D1):D520–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Burley SK, Bhikadiya C, Bi C, Bittrich S, Chen L, Crichlow GV., et al. RCSB Protein Data Bank: Celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D. Protein Sci. 2022;31(1):187–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Markosian C, Di Costanzo L, Sekharan M, Shao C, Burley SK, Zardecki C. Analysis of impact metrics for the Protein Data Bank. Sci data [Internet]. 2018. Dec 16;5(1):180212. Available from: http://www.nature.com/articles/sdata2018212 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lawson CL, Baker ML, Best C, Bi C, Dougherty M, Feng P, et al. EMDataBank.org: Unified data resource for CryoEM. Nucleic Acids Res. 2011;39(SUPPL. 1):456–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Pintilie G, Chiu W. Assessment of structural features in Cryo-EM density maps using SSE and side chain Z-scores. J Struct Biol [Internet]. 2018;204(3):564–71. Available from: 10.1016/j.jsb.2018.08.015 *** Comprehensive description of EMDB
- 19. Microscopy CE, Em T. EMDB - the Electron Microscopy Data Bank Growth and Statistics. 2023;(19):456–65. ** description of Q-score
- 20. World Wide PDB Download Statistics [Internet]. [cited 2024 Nov 18]. Available from: https://www.wwpdb.org/stats/download ** Use of Phenix for structure refinement
- 21.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature [Internet]. 2021. Aug 26;596(7873):583–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34265844 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature [Internet]. 2024. Jun 13;630(8016):493–500. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38718835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pintilie G, Zhang K, Su Z, Li S, Schmid MF, Chiu W. Measurement of atom resolvability in cryo-EM maps with Q-scores. Nat Methods [Internet]. 2020;17(3):328–34. Available from: 10.1038/s41592-020-0731-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.World Wide PDB 2022. News: Q-score [Internet]. [cited 2024 Nov 18]. Available from: https://www.wwpdb.org/news/news?year=2022#62de9e5235ec5bb4ddb19a43
- 25. RCSB: Assessing the Quality of 3D Structures [Internet]. [cited 2024 Nov 18]. Available from: https://www.rcsb.org/docs/general-help/assessing-the-quality-of-3d-structures ** validation of cryoEM models
- 26.Liebschner D, Afonine PV, Moriarty NW, Poon BK, Chen VB, Adams PD. CERES: A cryo-EM re-refinement system for continuous improvement of deposited models. Acta Crystallogr Sect D Struct Biol. 2021;77:48–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.CERES - the Cryo-EM re-refinement system [Internet]. [cited 2024 Nov 18]. Available from: https://cci.lbl.gov/ceres
- 28.Terwilliger TC, Adams PD, Afonine PV, Sobolev OV. A fully automatic method yielding initial models from high-resolution cryo-electron microscopy maps. Nat Methods [Internet]. 2018;15(11):905–8. Available from: 10.1038/s41592-018-0173-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Afonine PV, Poon BK, Read RJ, Sobolev OV, Terwilliger TC, Urzhumtsev A, et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr Sect D, Struct Biol [Internet]. 2018;74(Pt 6):531–44. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29872004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Afonine P V, Klaholz BP, Moriarty NW, Poon BK, Sobolev OV, Terwilliger TC, et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr Sect D Struct Biol. 2018;74(9):814–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Joosten RP, Salzemann J, Bloch V, Stockinger H, Berglund A-C, Blanchet C, et al. PDB_REDO: automated re-refinement of X-ray structure models in the PDB. J Appl Crystallogr [Internet]. 2009. Jun 1;42(Pt 3):376–84. Available from: http://www.ncbi.nlm.nih.gov/pubmed/22477769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wlodawer A, Dauter Z. `Atomic resolution’: a badly abused term in structural biology. Acta Crystallogr Sect D, Struct Biol [Internet]. 2017;73(Pt 4):379–80. Available from: http://www.ncbi.nlm.nih.gov/pubmed/28375149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Wlodawer A, Li M, Dauter Z. High-Resolution Cryo-EM Maps and Models: A Crystallographer’s Perspective. Structure [Internet]. 2017;25(10):1589–1597.e1. Available from: http://www.ncbi.nlm.nih.gov/pubmed/28867613 ** validation of metal sited in cryoEM structural models
- 34.Stagg SM, Noble AJ, Spilman M, Chapman MS. ResLog plots as an empirical metric of the quality of cryo-EM reconstructions. J Struct Biol [Internet]. 2014. Mar;185(3):418–26. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24384117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Scheres SHW, Chen S. Prevention of overfitting in cryo-EM structure determination. Nat Methods [Internet]. 2012. Sep 29;9(9):853–4. Available from: http://www.ncbi.nlm.nih.gov/pubmed/22842542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ramírez-Aportela E, Maluenda D, Fonseca YC, Conesa P, Marabini R, Heymann JB, et al. FSC-Q: a CryoEM map-to-atomic model quality validation based on the local Fourier shell correlation. Nat Commun [Internet]. 2021. Jan 4;12(1):42. Available from: http://www.ncbi.nlm.nih.gov/pubmed/33397925 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Feng M-F, Chen Y-X, Shen H-B. DeepQs: Local quality assessment of cryo-EM density map by deep learning map-model fit score. J Struct Biol [Internet]. 2024. Mar;216(1):108059. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38160703 [DOI] [PubMed] [Google Scholar]
- 38. Ghosh P. What is “blobology” and how is it transforming biology? BBC News [Internet]. 2019; Available from: https://www.bbc.com/news/science-environment-49359137 ** Refmac and Servalcat approach to refinement cryoEM models
- 39.Küçükoğlu B, Mohammed I, Guerrero-Ferreira RC, Ribet SM, Varnavides G, Leidl ML, et al. Low-dose cryo-electron ptychography of proteins at sub-nanometer resolution. Nat Commun [Internet]. 2024. Sep 14;15(1):8062. Available from: http://www.ncbi.nlm.nih.gov/pubmed/39277607 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Egelman EH. The Current Revolution in Cryo-EM. Biophys J [Internet]. 2016. Mar 8;110(5):1008–12. Available from: 10.1016/j.bpj.2016.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chari A, Stark H. Prospects and Limitations of High-Resolution Single-Particle Cryo-Electron Microscopy. Annu Rev Biophys [Internet]. 2023. May 9;52(1):391–411. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37159297 [DOI] [PubMed] [Google Scholar]
- 42.Bromberg R, Guo Y, Borek D, Otwinowski Z. High-resolution cryo-EM reconstructions in the presence of substantial aberrations. IUCrJ [Internet]. 2020. May 1;7(Pt 3):445–52. Available from: http://www.ncbi.nlm.nih.gov/pubmed/32431828 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Handing KB, Niedzialkowska E, Shabalin IG, Kuhn ML, Zheng H, Minor W. Characterizing metal-binding sites in proteins with X-ray crystallography. Nat Protoc [Internet]. 2018. May 19;13(5):1062–90. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29674755 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Gucwa M, Lenkiewicz J, Zheng H, Cymborowski M, Cooper DR, Murzyn K, et al. CMM-An enhanced platform for interactive validation of metal binding sites. Protein Sci [Internet]. 2023. Jan;32(1):e4525. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36464767 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gucwa M, Bijak V, Zheng H, Murzyn K, Minor W. CheckMyMetal (CMM): validating metal-binding sites in X-ray and cryo-EM data. IUCrJ [Internet]. 2024. Sep 1;11(Pt 5):871–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/39141478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kleywegt GJ, Adams PD, Butcher SJ, Lawson CL, Rohou A, Rosenthal PB, et al. Community recommendations on cryoEM data archiving and validation. IUCrJ. 2024;11(Pt 2):140–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Velankar S, van Ginkel G, Alhroub Y, Battle GM, Berrisford JM, Conroy MJ, et al. PDBe: improved accessibility of macromolecular structure data from PDB and EMDB. Nucleic Acids Res [Internet]. 2016. Jan 4;44(D1):D385–95. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26476444 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Terwilliger TC, Ludtke SJ, Read RJ, Adams PD, Afonine PV. Improvement of cryo-EM maps by density modification. Nat Methods [Internet]. 2020. Sep 17;17(9):923–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/32807957 ** how to make good cryoEM samples
- 49.Berkeley RF, Cook BD, Herzik MA. Machine learning approaches to cryoEM density modification differentially affect biomacromolecule and ligand density quality. Front Mol Biosci [Internet]. 2024. Apr 18;11:1404885. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38698773 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Yamashita K, Wojdyr M, Long F, Nicholls RA, Murshudov GN. GEMMI and Servalcat restrain REFMAC5. Acta Crystallogr Sect D, Struct Biol. 2023;79:368–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Croll TI. ISOLDE : a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr Sect D Struct Biol [Internet]. 2018. Jun 1;74(6):519–30. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29872003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Brown A, Long F, Nicholls RA, Toots J, Emsley P, Murshudov G. Tools for macromolecular model building and refinement into electron cryo-microscopy reconstructions. Acta Crystallogr Sect D Biol Crystallogr. 2015;71:136–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lugmayr W, Kotov V, Goessweiner-Mohr N, Wald J, DiMaio F, Marlovits TC. StarMap: a user-friendly workflow for Rosetta-driven molecular structure refinement. Nat Protoc [Internet]. 2023. Jan;18(1):239–64. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36323866 [DOI] [PubMed] [Google Scholar]
- 54.Hussein R, Graça A, Forsman J, Aydin AO, Hall M, Gaetcke J, et al. Cryo-electron microscopy reveals hydrogen positions and water networks in photosystem II. Science [Internet]. 2024. Jun 21;384(6702):1349–55. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38900892 [DOI] [PubMed] [Google Scholar]
- 55.Bochtler M. X-rays, electrons, and neutrons as probes of atomic matter. Structure [Internet]. 2024. May 2;32(5):630–643.e6. Available from: 10.1016/j.str.2024.01.015 [DOI] [PubMed] [Google Scholar]
- 56.Bick T, Dominiak PM, Wendler P. Exploiting the full potential of cryo-EM maps. BBA Adv [Internet]. 2024;5:100113. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38292063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Marques MA, Purdy MD, Yeager M. CryoEM maps are full of potential. Curr Opin Struct Biol [Internet]. 2019. Oct;58:214–23. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31400843 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Merk A, Bartesaghi A, Banerjee S, Falconieri V, Rao P, Davis MI, et al. Breaking Cryo-EM Resolution Barriers to Facilitate Drug Discovery. Cell. 2016;165(7):1698–707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Cushing VI, Koh AF, Feng J, Jurgaityte K, Bondke A, Kroll SHB, et al. High-resolution cryo-EM of the human CDK-activating kinase for structure-based drug design. Nat Commun [Internet]. 2024. Mar 13;15(1):2265. Available from: https://www.nature.com/articles/s41467-024-46375-9 *** description of AlphaFold
- 60.Zhang X, Johnson RM, Drulyte I, Yu L, Kotecha A, Danev R, et al. Evolving cryo-EM structural approaches for GPCR drug discovery. Structure [Internet]. 2021. Sep 2;29(9):963–974.e6. Available from: http://www.ncbi.nlm.nih.gov/pubmed/33957078 [DOI] [PubMed] [Google Scholar]
- 61.Shimada I, Ueda T, Kofuku Y, Eddy MT, Wüthrich K. GPCR drug discovery: integrating solution NMR data with crystal and cryo-EM structures. Nat Rev Drug Discov [Internet]. 2019. Jan;18(1):59–82. Available from: http://www.ncbi.nlm.nih.gov/pubmed/30410121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Duan J, He X-H, Li S-J, Xu HE. Cryo-electron microscopy for GPCR research and drug discovery in endocrinology and metabolism. Nat Rev Endocrinol [Internet]. 2024. Jun;20(6):349–65. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38424377 [DOI] [PubMed] [Google Scholar]
- 63.Danev R, Belousoff M, Liang Y-L, Zhang X, Eisenstein F, Wootten D, et al. Routine sub-2.5 Å cryo-EM structure determination of GPCRs. Nat Commun [Internet]. 2021. Jul 15;12(1):4333. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34267200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Hauser AS, Attwood MM, Rask-Andersen M, Schiöth HB, Gloriam DE. Trends in GPCR drug discovery: new agents, targets and indications. Nat Rev Drug Discov [Internet]. 2017. Dec;16(12):829–42. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29075003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lundstrom K. An overview on GPCRs and drug discovery: structure-based drug design and structural biology on GPCRs. Methods Mol Biol [Internet]. 2009;552:51–66. Available from: http://www.ncbi.nlm.nih.gov/pubmed/19513641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Papasergi-Scott MM, Pérez-Hernández G, Batebi H, Gao Y, Eskici G, Seven AB, et al. Time-resolved cryo-EM of G-protein activation by a GPCR. Nature [Internet]. 2024. May;629(8014):1182–91. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38480881 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Li F, Tai L, Sun X, Lv Z, Tang W, Wang T, et al. Molecular recognition and activation mechanism of short-chain fatty acid receptors FFAR2/3. Cell Res [Internet]. 2024. Apr;34(4):323–6. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38191689 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Zhang H, Qiao A, Yang L, Van Eps N, Frederiksen KS, Yang D, et al. Structure of the glucagon receptor in complex with a glucagon analogue. Nature [Internet]. 2018;553(7686):106–10. Available from: 10.1038/nature25153 [DOI] [PubMed] [Google Scholar]
- 69.Yuan S, Xia L, Wang C, Wu F, Zhang B, Pan C, et al. Conformational Dynamics of the Activated GLP-1 Receptor-G s Complex Revealed by Cross-Linking Mass Spectrometry and Integrative Structure Modeling. ACS Cent Sci [Internet]. 2023. May 24;9(5):992–1007. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37252352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Han B-G, Avila-Sakar A, Remis J, Glaeser RM. Challenges in making ideal cryo-EM samples. Curr Opin Struct Biol [Internet]. 2023. Aug;81:102646. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37392555 [DOI] [PubMed] [Google Scholar]
- 71.Neselu K, Wang B, Rice WJ, Potter CS, Carragher B, Chua EYD. Measuring the effects of ice thickness on resolution in single particle cryo-EM. J Struct Biol X [Internet]. 2023;7:100085. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36742017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Dickerson JL, Lu P-H, Hristov D, Dunin-Borkowski RE, Russo CJ. Imaging biological macromolecules in thick specimens: The role of inelastic scattering in cryoEM. Ultramicroscopy [Internet]. 2022. Jul;237:113510. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35367900 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Olek M, Cowtan K, Webb D, Chaban Y, Zhang P. IceBreaker: Software for high-resolution single-particle cryo-EM with non-uniform ice. Structure [Internet]. 2022. Apr 7;30(4):522–531.e4. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35150604 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Rice WJ, Cheng A, Noble AJ, Eng ET, Kim LY, Carragher B, et al. Routine determination of ice thickness for cryo-EM grids. J Struct Biol [Internet]. 2018. Oct;204(1):38–44. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29981485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Henderikx RJM, Schotman MJG, Shahzad S, Fromm SA, Mann D, Hennies J, et al. Ice thickness control and measurement in the VitroJet for time-efficient single particle structure determination. J Struct Biol [Internet]. 2024. Oct 20;216(4):108139. Available from: http://www.ncbi.nlm.nih.gov/pubmed/39433138 [DOI] [PubMed] [Google Scholar]
- 76.Schmidli C, Albiez S, Rima L, Righetto R, Mohammed I, Oliva P, et al. Microfluidic protein isolation and sample preparation for high-resolution cryo-EM. Proc Natl Acad Sci U S A [Internet]. 2019. Jul 23;116(30):15007–12. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31292253 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Henderikx RJM, Mann D, Domanska A, Dong J, Shahzad S, Lak B, et al. VitroJet: new features and case studies. Acta Crystallogr Sect D, Struct Biol [Internet]. 2024. Apr 1;80(Pt 4):232–46. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38488730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Klebl DP, Gravett MSC, Kontziampasis D, Wright DJ, Bon RS, Monteiro DCF, et al. Need for Speed: Examining Protein Behavior during CryoEM Grid Preparation at Different Timescales. Structure [Internet]. 2020. Nov 3;28(11):1238–1248.e4. Available from: http://www.ncbi.nlm.nih.gov/pubmed/32814033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Rubinstein JL, Guo H, Ripstein ZA, Haydaroglu A, Au A, Yip CM, et al. Shake-it-off: a simple ultrasonic cryo-EM specimen-preparation device. Acta Crystallogr Sect D, Struct Biol [Internet]. 2019. Dec 1;75(Pt 12):1063–70. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31793900 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Zheng L, Xu J, Wang W, Gao X, Zhao C, Guo W, et al. Self-assembled superstructure alleviates air-water interface effect in cryo-EM. Nat Commun [Internet]. 2024. Aug 24;15(1):7300. Available from: http://www.ncbi.nlm.nih.gov/pubmed/39181869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Abe KM, Li G, He Q, Grant T, Lim CJ. Small LEA proteins mitigate air-water interface damage to fragile cryo-EM samples during plunge freezing. Nat Commun [Internet]. 2024. Sep 4;15(1):7705. Available from: http://www.ncbi.nlm.nih.gov/pubmed/39231985 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Naydenova K, Russo CJ. Integrated wafer-scale manufacturing of electron cryomicroscopy specimen supports. Ultramicroscopy [Internet]. 2022. Jan;232:113396. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34740028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Russo CJ, Passmore LA. Ultrastable gold substrates: Properties of a support for high-resolution electron cryomicroscopy of biological specimens. J Struct Biol [Internet]. 2016. Jan;193(1):33–44. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26592474 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Fréchin L, Holvec S, von Loeffelholz O, Hazemann I, Klaholz BP. High-resolution cryo-EM performance comparison of two latest-generation cryo electron microscopes on the human ribosome. J Struct Biol [Internet]. 2023. Mar;215(1):107905. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36241135 [DOI] [PubMed] [Google Scholar]
- 85.Naydenova K, McMullan G, Peet MJ, Lee Y, Edwards PC, Chen S, et al. CryoEM at 100 keV: a demonstration and prospects. IUCrJ [Internet]. 2019. Nov 1;6(Pt 6):1086–98. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31709064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.McMullan G, Naydenova K, Mihaylov D, Yamashita K, Peet MJ, Wilson H, et al. Structure determination by cryoEM at 100 keV. Proc Natl Acad Sci U S A [Internet]. 2023. Dec 5;120(49):e2312905120. Available from: http://www.ncbi.nlm.nih.gov/pubmed/38011573 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Chua EYD, Mendez JH, Rapp M, Ilca SL, Tan YZ, Maruthi K, et al. Better, Faster, Cheaper: Recent Advances in Cryo-Electron Microscopy. Annu Rev Biochem [Internet]. 2022. Jun 21;91:1–32. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35320683 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Liu X, Ciulli A. Proximity-Based Modalities for Biology and Medicine. ACS Cent Sci [Internet]. 2023. Jul 26;9(7):1269–84. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37521793 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Antanasijevic A, Bowman CA, Kirchdoerfer RN, Cottrell CA, Ozorowski G, Upadhyay AA, et al. From structure to sequence: Antibody discovery using cryoEM. Sci Adv [Internet]. 2022. Jan 21;8(3):eabk2039. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35044813 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Rapp M, Shapiro L, Frank J. Contributions of single-particle cryoelectron microscopy toward fighting COVID-19. Trends Biochem Sci [Internet]. 2022. Feb;47(2):117–23. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34799235 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Che Y, Gribenko AV, Song X, Handke LD, Efferen KS, Tompkins K, et al. Rational design of a highly immunogenic prefusion-stabilized F glycoprotein antigen for a respiratory syncytial virus vaccine. Sci Transl Med [Internet]. 2023. Apr 26;15(693):eade6422. Available from: http://www.ncbi.nlm.nih.gov/pubmed/37023209 [DOI] [PubMed] [Google Scholar]
- 92.He C, Mansilla-Soto J, Khanra N, Hamieh M, Bustos V, Paquette AJ, et al. CD19 CAR antigen engagement mechanisms and affinity tuning. Sci Immunol [Internet]. 2023. Mar 31;8(81):eadf1426. Available from: http://www.ncbi.nlm.nih.gov/pubmed/36867678 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Bravo JPK, Liu M-S, Hibshman GN, Dangerfield TL, Jung K, McCool RS, et al. Structural basis for mismatch surveillance by CRISPR-Cas9. Nature [Internet]. 2022. Mar;603(7900):343–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35236982 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Hsu H-L, Brown A, Loveland AB, Lotun A, Xu M, Luo L, et al. Structural characterization of a novel human adeno-associated virus capsid with neurotropic properties. Nat Commun [Internet]. 2020. Jun 30;11(1):3279. Available from: http://www.ncbi.nlm.nih.gov/pubmed/32606306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Zhang R, Xu G, Cao L, Sun Z, He Y, Cui M, et al. Divergent engagements between adeno-associated viruses with their cellular receptor AAVR. Nat Commun [Internet]. 2019. Aug 21;10(1):3760. Available from: http://www.ncbi.nlm.nih.gov/pubmed/31434885 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Chung JM, Durie CL, Lee J. Artificial Intelligence in Cryo-Electron Microscopy. Life (Basel, Switzerland) [Internet]. 2022. Aug 19;12(8). Available from: http://www.ncbi.nlm.nih.gov/pubmed/36013446 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Senior AW, Evans R, Jumper J, Kirkpatrick J, Sifre L, Green T, et al. Improved protein structure prediction using potentials from deep learning. Nature [Internet]. 2020;577(7792):706–10. Available from: 10.1038/s41586-019-1923-7 [DOI] [PubMed] [Google Scholar]
- 98.Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res [Internet]. 2022. Jan 7;50(D1):D439–44. Available from: http://www.ncbi.nlm.nih.gov/pubmed/34791371 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Jumper J, Hassabis D. Protein structure predictions to atomic accuracy with AlphaFold. Nat Methods [Internet]. 2022. Jan 11;19(1):11–2. Available from: http://www.ncbi.nlm.nih.gov/pubmed/35017726 [DOI] [PubMed] [Google Scholar]
- 100.Varadi M, Bertoni D, Magana P, Paramval U, Pidruchna I, Radhakrishnan M, et al. AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res [Internet]. 2024. Jan 5;52(D1):D368–75. Available from: http://www.ncbi.nlm.nih.gov/pubmed/3793385 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1 Double conformations observed in the cryoEM structure of mouse heavy-chain apoferritin (PDB ID: 8RQB). The figure highlights the iron channel, showcasing four histidine residues coordinating the iron atom. The histidine residues exhibit double conformations, demonstrating the ability of modern high-resolution cryoEM to resolve alternative conformations in critical regions of macromolecular structures.
Table S1 This table showcases a set of approved drugs that have been structurally characterized exclusively by cryo-electron microscopy (cryoEM) according to current entries in the Protein Data Bank (PDB). At the time of this study, these drugs lack structural data from alternative methods such as X-ray crystallography or NMR, underscoring the unique role of cryoEM in providing the initial structural insights for these drug-bound complexes. Although future depositions may include structures resolved by other techniques, cryoEM will remain the pioneering method that first captured these specific drug interactions. This compilation highlights the increasing impact of cryoEM in advancing drug discovery, particularly for targets and complexes challenging to study by other structural approaches.
