Graphical abstract
Keywords: Cryo-EM, Validation, Reconstruction, 3D refinement, Independent, Raw data, BioEM
Highlights
-
•
Use of an independent particle set to validate 3D density maps from cryo-EM.
-
•
The posterior probability of the map should increase as a function of the iteration and low-pass frequency cutoff.
-
•
The similarity between the distributions of the posterior probabilities is an indicator of the map quality.
-
•
A control particle set provides valuable information for cryo-EM map validation.
Abstract
Cryo-electron microscopy (cryo-EM) has revolutionized structural biology by providing 3D density maps of biomolecules at near-atomic resolution. However, map validation is still an open issue. Despite several efforts from the community, it is possible to overfit 3D maps to noisy data. Here, we develop a novel methodology that uses a small independent particle set (not used during the 3D refinement) to validate the maps. The main idea is to monitor how the map probability evolves over the control set during the 3D refinement. The method is complementary to the gold-standard procedure, which generates two reconstructions at each iteration. We low-pass filter the two reconstructions for different frequency cutoffs, and we calculate the probability of each filtered map given the control set. For high-quality maps, the probability should increase as a function of the frequency cutoff and the refinement iteration. We also compute the similarity between the densities of probability distributions of the two reconstructions. As higher frequencies are included, the distributions become more dissimilar. We optimized the BioEM package to perform these calculations, and tested it over systems ranging from quality data to pure noise. Our results show that with our methodology, it possible to discriminate datasets that are constructed from noise particles. We conclude that validation against a control particle set provides a powerful tool to assess the quality of cryo-EM maps.
Introduction
Cryo-electron microscopy (cryo-EM) has revolutionized structural biology by providing electron density maps of biomolecules that were difficult to resolve with X-ray crystallography or nuclear magnetic resonance (Kühlbrandt, 2014, Cheng, 2015, Murata and Wolf, 2018). The introduction of direct electron detection cameras (Wu et al., 2016, McMullan et al., 2016) and novel computational algorithms (Kervrann et al., 2016, Cossio and Hummer, 2018) has enabled the reconstruction of density maps with near-atomic details. To date, an exponential-growing number of maps, and their corresponding atomic models (Afonine et al., 2018), are being deposited in the electron microscopy (Lawson et al., 2011) and protein data banks (Berman et al., 2000) (EMDB and PDB, respectively).
Typically, cryo-EM maps are reconstructed using the gold-standard procedure (Henderson et al., 2012, Scheres and Chen, 2012). The particles are divided into two sets, and two independent reconstructions are generated. The reconstructions are refined iteratively using maximum-likelihood (Sorzano et al., 2004, Tang et al., 2007) or Bayesian techniques (Scheres, 2012, Punjani et al., 2017). At each iteration, the Fourier Shell Correlation (FSC) (Saxton and Baumeister, 1982, Harauz and van Heel, 1986) between the independent reconstructions is computed. Fixed FSC threshold criteria at 0.143 (Rosenthal and Rubinstein, 2015) or 0.5 (Harauz and van Heel, 1986) are used to determine the resolution of the reconstructions (i.e., the size of the smallest reliable detail). The refinement process is halted when the resolution of the reconstructions stops improving. In the end, the maps are masked, sharpened, and a final resolution is determined.
However, despite several efforts from the cryo-EM community, map validation is still problematic. In the recent Map Challenge, it has been shown that there is no absolute ‘gold standard’ (Heymann, 2018). The protocols are user-dependent, and there can be biases due to processing workflows. For instance, in the FSC calculation, the resolution estimate is dependent on the radius of the shell in Fourier space, and on the point symmetry of the biomolecule (Van Heel and Schatz, 2005, Sorzano et al., 2017). The use of a fixed threshold for the FSC is restricted by the assumption that the noise and the signal are orthogonal (Van Heel and Schatz, 2005). In addition, the mask can be a source for overestimating the resolution (Penczek, 2010, Pintilie et al., 2016, Rosenthal and Rubinstein, 2015). Therefore, the best criteria to estimate the map resolution are still debated in the cryo-EM community (Sorzano et al., 2017). For example, inconsistencies when reporting the resolution have been highlighted in Ref. (Afonine et al., 2018), where the values of the resolution reported in the model (from the PDB) and in the map (from the EMDB) were different for nearly one-third of the deposited data. Moreover, it has been found that more than 70 of the maps in the EMDB have moderate to low agreement with the model, mostly because of the limited resolvable features of the maps (Neumann et al., 2018). In extreme cases, maps can be reconstructed from pure-noise images (Henderson, 2013, Shatsky et al., 2009). These issues can lead to cryo-EM maps built from aligned noise.
Therefore, methods that validate the quality of the maps and models are fundamental for cryo-EM. Several methods use the FSC curve to assess the quality of the reconstructions. By discarding shells from the reference map used for the alignment, Shaikh et al. (Shaikh et al., 2003) encountered a different behaviour of the FSC curve between reconstructions that were made from pure-noise images or real particles. Signatures of overfitting can be detected by randomizing the phases beyond a certain frequency (Scheres and Chen, 2012, Chen et al., 2013), for non-overfitted maps, the FSC should drop close to zero after that frequency. Estimates of the map resolution, which take into account the symmetry of the molecule and the non-orthogonality of the signal and noise, are obtained with the 1/2 bit non-fixed FSC threshold (Van Heel and Schatz, 2005, Afanasyev et al., 2017). The local resolution in a map can be evaluated using the background noise of the reconstruction (Kucukelbir et al., 2014) or by masking different regions with the FSC (Cardone et al., 2013, Pintilie et al., 2016). For other methods, the particle alignment provides quality indicators of the reconstruction, for example, using tilt-pair analysis (Rosenthal, 2016, Rosenthal and Rubinstein, 2015, Rosenthal and Henderson, 2003, Henderson et al., 2011, Penczek et al., 1994) or by assessing the reproducibility of the orientation assignment (Vargas et al., 2017, Vargas et al., 2016). Moreover, several metrics that monitor cross-correlations in real or Fourier space between the maps and models indicate the reliability of the resolution (Afonine et al., 2018, Neumann et al., 2018, Brown et al., 2015). Recently, deep learning algorithms have been introduced to automatically classify maps into high, medium, and low resolution (Avramov et al., 2019). These methods have the limitation that they do not use the raw data, which ultimately come from the individual particles, but they only use the maps or models that are the product of processing and averaging.
In comparison to the widely used cross-validation methods for X-ray crystallography (Brünger, 1992) and nuclear magnetic resonance (Brünger et al., 1993, Clore and Garrett, 1999), there are few methods available for cryo-EM (Shaikh et al., 2003, Falkner and Schroeder, 2013). Heymann (Heymann, 2015) showed that reconstructions from sets of real particles have higher resolutions than reconstructions from pure-noise particles, which can be used as a consistency test of the data. However, this test requires processing and averaging the particles for generating the reconstructions and extracting the resolutions.
Here, we propose an unbiased strategy that validates cryo-EM reconstructions using a small control set of particles that are omitted from the refinement process. We do not focus on determining a specific value for the resolution, but the main idea is to monitor how the performance of the reconstructions evolves during the refinement procedure over unbiased and independent data. We first calculate the Bayesian inference of electron microscopy (BioEM) (Cossio and Hummer, 2013, Cossio et al., 2017) probability of the maps, given the set of independent particles, as a function of a low-pass frequency cutoff of the reconstructions. High-quality maps should increase in probability for higher frequency cutoffs and higher refinement iterations. We then show that the similarity between the probability distributions of the two reconstructions from the gold-standard procedure is an additional quality indicator. Finally, we test the method on different systems and asses its effectiveness to discriminate high quality maps, which are reconstructed from true signal instead of noise.
Methods
Benchmark systems
We used the following benchmarks that represent diverse biomolecular families and cryo-EM systems:
The human hyperpolarization-activated cyclic nucleotide-gated channel (HCN1) is a voltage-dependent ion channel, which was resolved to high resolution using cryo-EM (Lee and MacKinnon, 2017). The system was resolved in two conformational states, an apo state and a cAMP-bound state, to Åusing RELION 3D-refinement (Scheres, 2012). 55870 particles belonging to the apo state together with the defocus information are available in the Electron Microscopy Public Image Archive (EMPIAR) (Iudin et al., 2016) with code 10081.
The recombination-activating genes RAG1-RAG2 form a complex (RAG1-RAG2) that plays an essential role in the generation of antibodies and antigen-receptor genes in a process called V(D) J recombination. Two main structures of the RAG1-RAG2 complex can be distinguished during the V(D) J recombination, a synaptic paired complex and the signal end complex (SEC). These states were resolved to 3.7 and 3.4 Å, respectively, using cryo-EM (Ru and Others, 2015). 81946 processed picked particles from the SEC state are deposited in the EMPIAR data bank with code 10049. The defocus information is available for these particles.
The mammalian transient receptor potential TRPV1 ion channel (TRPV1) is the receptor for capsaicin. Its structure was determined to 3.4 Åusing cryo-EM (Maofu et al., 2013). A set of 35645 processed particles for this system are found in the EMPIAR data bank with code 10005. The defocus information is also available for these particles.
The human immunodeficiency virus type 1 envelope glycoprotein trimer (HIV-ET) is a membrane-fusing machine which mediates virus entry into host cells. The structure of the apo HIV-1 envelope glycoprotein in the trimer-conformation was determined to 6 Åusing the 0.5 FSC threshold with cryo-EM (Mao et al., 2013). A set of 124478 particles used in the refinement process is available in EMPIAR with code 10008. The defocus information is also available for these particles.
For all of the above cases, a subset of 5000 particles was randomly selected to be used as the control set. Specifically, these particles are not used in the refinement processes.
Pure-noise images: we generated a set of synthetic 1000 pure-noise particles. Each particle contains random intensities following a Gaussian distribution with zero mean and unit variance (for details see the Supplementary Information). These images were used as a “false” control set to assess the RAG1-RAG2 reconstructions.
3D refinement
The RELION (Scheres, 2012) software was used to reconstruct the cryo-EM maps. For all systems, we assume that the deposited particles correspond to the same state. Therefore, the preprocessing steps of 2D or 3D classification are not performed. As the initial reference map for the 3D refinement, we use the final map reported by the authors low-pass filtered to 60 Å. This was done to minimize the risk of model bias (Scheres and Chen, 2012). The 3D-refinement procedure implements the gold-standard approach by splitting the data into two random halves (sets ) and performing two independent reconstructions. We note that the number of particles used for these reconstructions was slightly less than those of the original works because the particles from the control set were taken out. In all cases, we used the RELION default parameters, and point-group symmetries reported by the authors. Table 1 summarizes the results obtained from the 3D refinement. The resolutions are in accordance with the reported ones, taking into account that the post-processing steps were not performed, and that the control set of particles was excluded from the refinement.
Table 1.
Summary of the results from the 3D-refinement using RELION (Scheres, 2012) for the cryo-EM systems.
| System | #Particles | Symmetry | #iterations | Final resolution∗ |
|---|---|---|---|---|
| HCN1 | 50870 | C4 | 17 | 4.2 Å |
| RAG1-RAG2 | 79946 | C2 | 21 | 3.8 Å |
| TRPV1 | 30645 | C4 | 24 | 5.3 Å |
| HIV-ET | 119478 | C3 | 10 | 9.9 Å |
using the 0.143 FSC threshold.
Low-pass filtering
Consider a map m generated from an iteration of the 3D refinement. Let be its 3D-Fourier transform, where is the reciprocal vector. We perform a low-pass filter on the map, , up to a frequency cutoff . The resulting filtered map is
We use the code lowpassmap_fftw available from the Rubinstein lab webpage (Rubinstein, xxxx) to perform this calculation. We then convert the map into real space by applying the inverse Fourier transform of . The real-space filtered map is masked and then used as input for the BioEM computation (see below).
BioEM posterior probabilities
The BioEM method (Cossio and Hummer, 2013) uses a Bayesian framework to quantify the consistency between an experimental image and a given map m (or model) by calculating a posterior probability . BioEM takes into account the relevant physical parameters () for the image formation: center displacement, normalization, offset, noise, orientation and CTF parameters (defocus, amplitude, and B-factor). is calculated by integrating-out all parameters
| (1) |
where are the prior probabilities parameters and is the likelihood function. For ensemble determinations, the posterior can be multiplied by the prior probabilities of each model, as described in Ref. Cossio and Hummer (2013). We considered the prior probabilities of maps, orientations and center displacement uniform over the integration intervals. Gaussian priors for the CTF parameters were used similarly as in Ref. Cossio et al. (2018). In Eq. (1), the integrals over the offset, noise and normalization are performed analytically (Cossio and Hummer, 2013), and that over the center displacement is described in Ref. Cossio et al. (2017). The integral over the orientations and CTF defocus is done using a double-round algorithm, which is described in the following subsection.
Similarly as in Ref. Cossio and Hummer (2013), we define a noise model where is the number of pixels and is the image variance (by default ). is used as a reference to compare the posterior probabilities to a hypothetical model composed of Gaussian noise of zero mean and unit variance. We note that is a constant for each system, and ) estimates how many log-units the map is more probable than the Gaussian noise model for particle .
BioEM algorithm
To optimize the computations, we divided the BioEM posterior calculation into two rounds. The objective of the first round is to obtain the best orientations for each particle. In this round, an all-orientations to all-particles algorithm is performed (Cossio et al., 2017). As the BioEM input map, we used the final reconstruction from the refinement with a mask and without low-pass filtering. To sample the orientations, we used 36864 quaternions that sample uniformly orientation space (Yershova et al., 2010). The particles were grouped into sets with similar experimental defocus with range, and an independent orientation search was performed for each group. In this round, the best 10 orientations for each particle are extracted. An example of the BioEM input for the first round is presented in the Supplementary Information.
Starting in round 2, we calculate the BioEM posterior probability using a fixed list of quaternions and the experimental defocus for each particle, for all low-pass filtered reconstructions from the refinement iterations. The list of quaternions contains 1250 orientations concentrated around on the 10 best orientations for each particle from BioEM round 1. Therefore, the optimal orientations and translations are determined at every iteration for every low-passed filtered map. However, we note that the space of possible angles is not the entire but it is the reduced list of 1250 quaternions determined only once for each particle in round 1. This procedure is described in detail in Ref. Cossio et al. (2018).
In round 2, we used 8 filtering-frequencies for each reconstruction; these were distributed uniformly from to where is the pixel size. If one wants to analyze the behavior at very low-resolution or very high-resolution, the frequency-cutoff range should be modified (e.g. as in Supplementary Fig.1). All reconstructions were masked using the same mask as for round 1. An example of the BioEM input file for round 2 is presented in the Supplementary Information.
BioEM code
The BioEM code has been extended with several optimizations, which drastically increase performance for the second round of calculations. Most importantly, the main code structure and algorithm were modified to allow for a parallel comparison of multiple orientations to a single particle image. Initial reading of the input files has been parallelized, and the overall memory consumption decreased. These code changes lead to more efficient utilization of the computing resources, and hence to a faster calculation of posterior probabilities, especially for the workloads specific to the second round. For more information, we refer the reader to the BioEM user manual: https://readthedocs.org/projects/bioem/.
Normalized Jensen-Shannon divergence
Measuring a distance among probability distributions is a common task in statistics. Most measures include concepts from information theory, such as the Kullback–Leibler divergence (Kullback, 1968, Lin, 1991) or the Shannon entropy (Cover and Thomas, 2006). In this work, we measure the statistical similarity between the probability distributions from reconstructions from set 1 and set 2 calculated over the control set. We define a metric that is the Jensen-Shannon divergence (Cover and Thomas, 2006, Lin, 1991) normalized by the individual Shannon entropies
| (2) |
where and are the probabilities of the reconstructions from set 1 and 2, respectively. and are not normalized in the BioEM calculation, therefore, to calculate Eq. (2), we enforce the simple normalization for each image , frequency cutoff and iteration. We note that to enforce this normalization both probabilities from set 1 and 2 are required. For simplicity of notation, we have omitted the dependency of the probabilities on the frequency cutoff .
In Eq. (2), the numerator measures the similarity between the densities of probability distributions, and the Shannon entropies in the denominator play the role of a normalization factor. Some important properties of the NJSD metric are that it is positive, symmetric and its lower bound is 0 if and only if for all particles .
Results and discussion
Validation protocol using an independent particle set
We propose a statistical framework for the validation of cryo-EM reconstructions. The validation analysis is done over a small control set of particles not used in the refinement process. Our main objective is to assess how well the maps perform over unseen data, which is the core of cross-validation. Therefore, having an independent set guarantees that the estimate of the quality of the reconstructions is free of biases from the optimization of the target function used in the refinement.
Fig. 1 shows the work-flow of the methodology. The refinement is done following the gold-standard procedure (Fig. 1-left), where two reconstructions are generated at each iteration step. These two reconstructions are validated using the independent particle set (Fig. 1-right). At each iteration, the two 3D maps are low-pass filtered to different frequency cutoffs, (see Methods). The BioEM (Cossio and Hummer, 2013) probability, , is calculated for each gold-standard reconstruction, , over the control set of images () with N particles. As a first validation test, we monitor the map evidence, , as a function of for each set i. This evidence should increase or remain constant as higher frequencies are added to the maps. Failing this test is a prime indicative that there is a problem in the refinement process.
Fig. 1.
Unbiased validation protocol for cryo-EM maps using an independent particle set. (left) Gold-standard refinement procedure in cryo-EM. Two particle sets are used to generate two independent reconstructions. These reconstructions are compared using the Fourier shell correlation (FSC). A fixed FSC threshold is used to extract the resolution of the reconstructions. The process is iterated until the resolution stops improving. (right) Novel validation protocol using a small independent particle set not included in the gold-standard refinement. At each iteration of the refinement, the reconstructions are low-pass filtered to different frequency cutoffs . The BioEM probabilities (Cossio and Hummer, 2013, Cossio et al., 2017), over the independent control set, are calculated as a function of . Two tests validate the quality of the reconstructions: 1) the map evidence of the log-posterior and 2) the statistical similarity between the probability distributions (measured with a normalized Jensen-Shannon divergence). The results from both tests should increase as a function of the frequency cutoff. The maps represented correspond to the RAG1-RAG2 comple.x (see Methods).
The second validation test consists on measuring the similarity between the probability distributions of the two reconstructions, also as a function of the frequency cutoff. For this purpose, we calculate a normalized Jensen-Shannon divergence () (see Methods). The NJSD is a positive, symmetric and bound metric that measures how distinguishable are the probability distributions from the reconstructions sets 1 and 2. We expect that, as more frequencies are added to the reconstructions, more noise is added, and the densities of probability distributions are less similar.
In the following, we describe in detail the two validation tests.
Map evidence from the log-posterior
We tested the methodology over several cryo-EM datasets: the synaptic RAG1-RAG2 complex (RAG1-RAG2) (Ru and Others, 2015), the human HCN1 channel (HCN1) (Lee and MacKinnon, 2017), and the TRPV1 ion channel (TRPV1) (Maofu et al., 2013). These systems represent a diverse set of biomolecular families, with membrane proteins and protein-nucleic acid complexes. The reconstruction refinement was performed using the gold-standard procedure in RELION (Scheres, 2012). The final resolution of these systems ranges from approximately 3 to 6 Å(see Methods). To analyze the impact of aligning noise, we studied two additional systems: cryo-EM reconstructions from the HIV-1 envelop trimer (HIV-ET) (Mao et al., 2013) and a set of synthetic pure-noise images that acts as a ‘false’ control set with the RAG1-RAG2 reconstructions (see Methods). This was motivated by the fact that some reconstructions might have been generated from pure-noise particles, and their resolution might have been over-estimated (Subramaniam, 2013, Henderson, 2013, Shatsky et al., 2009).
In Fig. 2, we examine the improvement of the maps by monitoring the sum of the log-posterior relative to noise , over the control set with , as a function of for the reconstructions . The results are shown for different refinement iterations with a gradient color scheme (first iteration: maroon; last iteration: green). These results measure how probable each filtered map is relative to . For the RAG1-RAG2, HCN1 and TRPV1 systems, we find an increase of the map evidence (given by the sum of the log-posterior) as a function of the frequency cutoff. For very high frequencies, the map evidence plateaus. We only observe minor differences between the results obtained for reconstructions and 2 (solid and dashed lines, respectively, in Fig. 2). This is an indication of the similarity between the two reconstructions. Importantly, the results highlight the ability of the BioEM posterior to correctly rank maps of different resolutions. For example, in Supplementary Fig. 1, we show the log-posterior for the first five iterations of the refinement of the TRPV1 system. Even for low-resolution maps, as the iteration increases (i.e., the map resolution) so does the map evidence. The reconstructions from the last iterations (i.e., the most refined with highest resolution) are the most probable. This is in agreement with what one expects from the 3D-refinement algorithms (Cossio and Hummer, 2018), and from previous studies that use the posterior probability to discriminate between conformational models (Cossio and Hummer, 2013, Cossio et al., 2018).
Fig. 2.
The sum of the log-posterior relative to noise , over the control set with N particles, as a function of the frequency cutoff for reconstructions from set and 2 (solid and dashed lines, respectively). The results are shown for different refinement iteration steps with a gradient color code: the first iteration is maroon and the last iteration is green. On the top row, we show the results for the cryo-EM systems where we expect refinement to work properly: HCN1, TRPV1 and RAG1-RAG2 for . Systems that present signs of treating noise as signal, the HIV-ET with and a noise particle control set with , are shown in the bottom row, highlighted with a red box.
In contrast, for the HIV-ET and noise-particle set, we find a different behavior of the map evidence. We find that the sum of the log-posterior does not increase as a function of the frequency cutoff but decreases or remains constant. For the noise-particle set, the map evidence relative to is small, and the differences between iterations are almost two orders of magnitude smaller than for the validated sets. Moreover, for this case, as the refinement iteration proceeds, the maps are slightly less probable over the control set. This analysis monitors map quality in cryo-EM: if the map evidence does not increase as a function of the frequency cutoff and the refinement iteration, then there are signs of spurious data in the refinement.
Similarity between the probability distributions
As a second validation test, we compare the distributions of the posterior probabilities generated by the reconstructions from sets over the control set. In the Supplementary Information, we show an example of the probability distributions for the HCN1 system for two frequency cutoffs at a given iteration (Supplementary Fig. 2-top). We find that the probability distributions, over the independent set, are quite similar for both reconstructions. However, there are small differences between them, and the higher-frequency maps present larger fluctuations (Supplementary Fig. 2-bottom). These differences can be quantified using a normalized Jensen-Shannon divergence (NJSD; see Methods).
In Fig. 3, we plot the as a function of the frequency cutoff . Interestingly, for the RAG1-RAG2, HCN1 and TRPV1 systems, we observe that the increases as higher frequencies are included in the filtered maps. This implies that the probability distributions between maps with higher frequencies are less similar, possibly because they are more uncorrelated due to the high-frequency noise. For these standard systems, we also find that as the iteration increases the NJSD reaches at higher frequencies a plateau value. This behavior can be fitted with an inverse exponential function (see below and solid lines in Fig. 3). By contrast, for the HIV-ET and noise-particle sets, we find that the NJSD remains constant or has random behavior, suggesting that distributions do not consistently change when higher frequencies are added to the maps. For HIV-ET, we notice further that the values are ten times larger than for any of the other systems, indicating that the two gold-standard refinements are highly inconsistent with each other.
Fig. 3.
Normalized Jensen-Shannon divergence (NJSD) as a function of the frequency cutoff. This metric calculates the similarity between the distributions of the BioEM probabilities computed for the two reconstructions from sets 1 and 2. We use a gradient color code for the refinement iteration steps: the first iteration is maroon and the last iteration is green. On the top row, we show the results for the systems where standard cryo-EM refinement is expected to work: HCN1, TRPV1 and RAG1-RAG2. For these systems, we fit the data points to an inverse exponential function (solid lines). Systems that treat noise as signal due to the alignment, a noise particle control set and HIV-ET, are shown in the bottom row with the dashed lines as a guide, and highlighted by a red box. The number of particles in the control sets are the same as for the data in Fig. 2. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Validation tests versus resolution
We explored how the results depend on the map resolution. For the HCN1, TRPV1 and RAG1-RAG2 systems, we find that the curves can be fitted to an inverse exponential function, (solid lines shown in Fig. 3). Intuitively, the frequency indicates where the plateau of the NJSD is reached. In Fig. 3, we can qualitatively see that increases with each refinement iteration. In Fig. 4, we plot the frequency as a function of the inverse of the resolution (calculated using the FSC at the threshold 0.143). Interestingly, we find that the frequency is highly correlated to the inverse of the resolution with correlation coefficient , and , for HCN1, TRPV1 and RAG1-RAG2, respectively. In Supplementary Fig. 3, we compare to the resolution obtained using the 1/2 bit non-fixed FSC threshold (Van Heel and Schatz, 2005), finding similar correlations. These results show that even from a small control set it is possible to extract unbiased information of the map resolution. We note that for the HIV-ET and noise-particle sets it is not possible to fit the NJSD data to an inverse exponential function. Therefore, we can only estimate the correlation between and the inverse of the resolution for the standard cryo-EM systems.
Fig. 4.
Frequency versus the inverse of the resolution for the standard cryo-EM systems: HCN1, TRPV1 and RAG1-RAG2. The NJSD curves for these systems were fitted to an inverse exponential function where is the frequency. We find large correlations between and the inverse of the resolution (calculated using the 0.143 criteria). The correlation coefficients are , and , for HCN1, TRPV1 and RAG1-RAG2, respectively. Solid lines show the linear fits to the individual sets. Black dashed line shows the global fit with parameters where R is the resolution. The correlation coefficient for the global fit is .
Convergence over a small control set
We assessed how the results depend on the number of particles in the control set. In Supplementary Fig. 4, we show an example of the map evidence and NJSD as a function of the number of particles in the control set. We find that after approximately 1000 particles these observables converge, suggesting that only a small set is needed to perform the analysis. This is confirmed in Supplementary Fig. 5, where we plot the map evidence and NJSD as a function of the frequency cutoff for a validation set of 1000 particles. For the same set, in Supplementary Fig. 6, we plot the frequency as a function of the inverse of the map resolution, showing high correlation between the two for the standard cryo-EM systems. These results are very similar to those obtained for the validation set with 5000 particles.
Conclusions
In this work, we have developed a novel methodology for validating cryo-EM reconstructions. The procedure is performed over a small particle set that is not used to generate the reconstructions. The independence of the particle set ensures that there are no biases due to the refinement in the validation.
Two validation tests are proposed to assess the performance of the maps over raw unbiased data. The first consists on monitoring the map evidence as a function of a low-pass filter frequency cutoff. The posterior should increase as a function of the frequency cutoff and the refinement iteration. In the second test, we assess the similarity between the probability distributions generated from the two reconstructions from the gold-standard procedure. The distributions should become less similar as higher frequencies are added to the reconstructions.
We performed the validation tests over several systems: three standard cryo-EM reconstruction sets, and two datasets that mimic the treatment of noise as signal. The results show substantial differences. While for the standard cryo-EM sets the results are as expected, the noisy datasets present almost no increment (and sometimes even decrease) of the map evidence or the NJSD. Thus, signatures of purious signal can be monitored by measuring the maps’ performance over an independent small set of particles. However, the method is still to be tested over more challenging datasets (not just those composed of mostly noisy particles). For example, for detecting the point at which noise is started to be treated as signal during the refinement.
The mathematical framework is not only valid for the BioEM posterior but also for any posterior probability that measures the likelihood of a 3D density given a particle set. However, an accurate estimate of the posterior probability is important. For example, if a large percentage of particles are misaligned then both the map evidence and NJSD will be underestimated (see Supplementary Fig. 7). With accurate posterior estimates, we find that the tests converge over a small particle set, typically only 1000 particles. We note that for the evaluated systems, the particle sets were selected from the EMPIAR dataset (Iudin et al., 2016) after 3D classification, which reduces the variability and heterogeneity of the particles. Therefore, the consequences of the particle picking and classification algorithms on the validation methodologies remain to be assessed.
The proposed methodology is still to be examined over flexible and heterogeneous biomolecules. However, preliminary tests using synthetic particles, which belonged to different conformations, showed that with the posterior probability, it is possible to determine the correct conformational ensemble (Cossio and Hummer, 2013). Moreover, novel reconstruction methods (Moscovich et al., 2019, Lederman et al., 2019, Zhong et al., 2019) that use machine learning techniques for mapping conformational variability will highly benefit from cross-validation procedures because for continuous heterogeneous ensembles the traditional FSC methods fail.
In summary, our methodology provides an unbiased basis to validate cryo-EM maps. Moreover, it has the potential to be applicable for directly validating atomic models (instead of 3D maps) using an independent set. We conclude that having a control particle set which is not tampered to generate reconstructions is beneficial for validating cryo-EM applications.
Data availability
The BioEM code is available at https://github.com/bio-phys/BioEM. A tutorial to perform the cross-validation protocol is available at: https://github.com/bio-phys/BioEM-tutorials.
Funding
S.O. and P.C. were supported by Colciencias, University of Antioquia and Ruta N, Colombia. G.H., and P.C. acknowledge the support of the Max Planck Society. Some computations were performed on a local server with an NVDIA Titan X GPU. PC gratefully acknowledges the support of NVIDIA Corporation for the donation of this GPU. Other computations were performed at the Max Planck Computing and Data Facility.
Credit Author Statement
G.H. and P.C. conceived the idea. S.O. and P.C. developed the analysis and methodology. L.S. and M.R. developed and optimized the software. B. A. R validated the results. All authors wrote and reviewed the manuscript.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
The authors thank Dr. Alessandro Laio for insightful discussions, Dr. Jose Maria Carazo for information about the experimental systems, and Dr. Frank Avila for proof reading.
Footnotes
Supplementary data associated with this article can be found, in the online version, at https://doi.org/10.1016/j.yjsbx.2020.100032.
Supplementary data
The following are the Supplementary data to this article:
References
- Afanasyev P., Seer-Linnemayr C., Ravelli R.B.G., Matadeen R., De Carlo S., Alewijnse B., Portugal R.V., Pannu N.S., Schatz M., van Heel M. Single-particle cryo-EM using alignment by classification (ABC): the structure of Lumbricus terrestris haemoglobin. IUCrJ. 2017;4:678–694. doi: 10.1107/S2052252517010922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Afonine, P.V., Klaholz, B.P., Moriarty, N.W., Poon, B.K., Sobolev, O.V., Terwilliger, T.C., Adams, P.D., Urzhumtsev, A., IUCr, 2018. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallographica Section D Structural Biology 74, 814–840. [DOI] [PMC free article] [PubMed]
- Avramov T.K., Vyenielo D., Gomez-Blanco J., Adinarayanan S., Vargas J., Si D. Deep learning for validating and estimating resolution of cryo-electron microscopy density maps. Molecules. 2019;24 doi: 10.3390/molecules24061181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berman H., Westbrook J., Feng Z., Gilliland G., Bhat T., Weissig H., Shindyalov I., Bourne P. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown, A., Long, F., Nicholls, R.A., Toots, J., Emsley, P., Murshudov, G., IUCr, 2015. Tools for macromolecular model building and refinement into electron cryo-microscopy reconstructions. Acta Crystallographica Section D Biological Crystallography 71, 136–153. [DOI] [PMC free article] [PubMed]
- Brünger A.T. Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature. 1992;355:472–475. doi: 10.1038/355472a0. [DOI] [PubMed] [Google Scholar]
- Brünger A., Clore G., Gronenborn A., Saffrich R., Nilges M. Assessing the quality of solution nuclear magnetic resonance structures by complete cross-validation. Science. 1993;261:328–331. doi: 10.1126/science.8332897. [DOI] [PubMed] [Google Scholar]
- Cardone G., Heymann J.B., Steven A.C. One number does not fit all: mapping local variations in resolution in cryo-EM reconstructions. Journal of Structural Biology. 2013;184:226–236. doi: 10.1016/j.jsb.2013.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen S., McMullan G., Faruqi A.R., Murshudov G.N., Short J.M., Scheres S.H., Henderson R. High-resolution noise substitution to measure overfitting and validate resolution in 3D structure determination by single particle electron cryomicroscopy. Ultramicroscopy. 2013;135:24–35. doi: 10.1016/j.ultramic.2013.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng Y. Single-particle cryo-EM at crystallographic resolution. Cell. 2015;161:450–457. doi: 10.1016/j.cell.2015.03.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clore G.M., Garrett D.S. R-factor, free r, and complete cross-validation for dipolar coupling refinement of nmr structures. Journal of the American Chemical Society. 1999;121:9008–9012. doi: 10.1021/ja991789k. [DOI] [Google Scholar]
- Cossio P., Hummer G. Bayesian analysis of individual electron microscopy images: Towards structures of dynamic and heterogeneous biomolecular assemblies. Journal of Structural Biology. 2013;184:427–437. doi: 10.1016/j.jsb.2013.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cossio P., Hummer G. Likelihood-based structural analysis of electron microscopy images. Current Opinion in Structural Biology. 2018;49:162–168. doi: 10.1016/j.sbi.2018.03.004. [DOI] [PubMed] [Google Scholar]
- Cossio P., Rohr D., Baruffa F., Rampp M., Lindenstruth V., Hummer G. BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images. Computer Physics Communications. 2017;210:163–171. [Google Scholar]
- Cossio P., Allegretti M., Mayer F., Mueller V., Vonck J., Hummer G. Bayesian inference of rotor ring stoichiometry from electron microscopy images of archaeal ATP synthase. Microscopy. 2018;67:266–273. doi: 10.1093/jmicro/dfy033. [DOI] [PubMed] [Google Scholar]
- Cover T.M., Thomas J.A. second ed. Wiley-Interscience; 2006. Elements of Information Theory. [Google Scholar]
- Falkner B., Schroeder G.F. Cross-validation in cryo-EM-based structural modeling. Proceedings of the National Academy of Sciences of the United States of America. 2013;110:8930–8935. doi: 10.1073/pnas.1119041110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harauz G., van Heel M. Exact filters for general geometry three dimensional reconstruction. Optik. 1986;78:6–30. [Google Scholar]
- Henderson R. Avoiding the pitfalls of single particle cryo-electron microscopy: Einstein from noise. Proceedings of the National Academy of Sciences. 2013;110:18037–18041. doi: 10.1073/pnas.1314449110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henderson R., Chen S., Chen J.Z., Grigorieff N., Passmore L.A., Ciccarelli L., Rubinstein J.L., Crowther R.A., Stewart P.L., Rosenthal P.B. Tilt-pair analysis of images from a range of different specimens in single-particle electron cryomicroscopy. Journal of Molecular Biology. 2011;413:1028–1046. doi: 10.1016/j.jmb.2011.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henderson R., Sali A., Baker M.L., Carragher B., Devkota B., Downing K.H., Egelman E.H., Feng Z., Frank J., Grigorieff N., Jiang W., Ludtke S.J., Medalia O., Penczek P.A., Rosenthal P.B., Rossmann M.G., Schmid M.F., Schroeder G.F., Steven A.C., Stokes D.L., Westbrook J.D., Wriggers W., Yang H., Young J., Berman H.M., Chiu W., Kleywegt G.J., Lawson C.L. Outcome of the first electron microscopy validation task force meeting. Structure. 2012;20:205–214. doi: 10.1016/j.str.2011.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heymann J.B. Validation of 3D EM Reconstructions: the phantom in the noise. AIMS Biophysics. 2015;2:21–35. doi: 10.3934/biophy.2015.1.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heymann J.B. Map challenge assessment: Fair comparison of single particle cryoEM reconstructions. Journal of Structural Biology. 2018;204:360–367. doi: 10.1016/j.jsb.2018.07.012. [DOI] [PubMed] [Google Scholar]
- Iudin A., Korir P.K., Salavert-Torres J., Kleywegt G.J., Patwardhan A. EMPIAR: a public archive for raw electron microscopy image data. Nature Methods. 2016;13:387–388. doi: 10.1038/nmeth.3806. [DOI] [PubMed] [Google Scholar]
- Kervrann C., Sanchez Sorzano C.O., Acton S.T., Olivo-Marin J.C., Unser M. A guided tour of selected image processing and analysis methods for fluorescence and electron microscopy. IEEE Journal of Selected Topics in Signal Processing. 2016;10:6–30. [Google Scholar]
- Kucukelbir A., Sigworth F.J., Tagare H.D. Quantifying the local resolution of cryo-EM density maps. Nature Methods. 2014;11:63–65. doi: 10.1038/nmeth.2727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kühlbrandt W. Cryo-EM enters a new era. eLife. 2014;3 doi: 10.7554/eLife.03678. e03678. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kullback S. Information theory and statistics. Dover. 1968 Publications. [Google Scholar]
- Lawson C.L., Baker M.L., Best C., Bi C., Dougherty M., Feng P., van Ginkel G., Devkota B., Lagerstedt I., Ludtke S.J., Newman R.H., Oldfield T.J., Rees I., Sahni G., Sala R., Velankar S., Warren J., Westbrook J.D., Henrick K., Kleywegt G.J., Berman H.M., Chiu W. EMDataBank.org: unified data resource for CryoEM. Nucleic Acids Res. 2011;39:D456–D464. doi: 10.1093/nar/gkq880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lederman, R.R., Anden, J., Singer, A., 2019. Hyper-Molecules: on the Representation and Recovery of Dynamical Structures, with Application to Flexible Macro-Molecular Structures in Cryo-EM. https://arxiv.org/abs/1907.01589. [DOI] [PMC free article] [PubMed]
- Lee C.H., MacKinnon R. Structures of the human HCN1 hyperpolarization-activated channel. Cell. 2017;168:111–120. doi: 10.1016/j.cell.2016.12.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin J. Divergence measures based on the Shannon entropy. IEEE Transactions on Information Theory. 1991;37:145–151. doi: 10.1109/18.61115. [DOI] [Google Scholar]
- Mao Y., Wang L., Gu C., Herschhorn A., Desormeaux A., Finzi A., Xiang S.H., Sodroski J.G. Molecular architecture of the uncleaved HIV-1 envelope glycoprotein trimer. Proceedings of the National Academy of Sciences. 2013;110:12438–12443. doi: 10.1073/pnas.1307382110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maofu Liao, Erhu Cao, David Julius, Yifan Cheng. Structure of the TRPV1 ion channel determined by electron cryo-microscopy. Nature. 2013;504:107. doi: 10.1038/nature12822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McMullan G., Faruqi A., Henderson R. Direct electron detectors. Methods in Enzymology. 2016;579:1–17. doi: 10.1016/bs.mie.2016.05.056. [DOI] [PubMed] [Google Scholar]
- Moscovich, A., Halevi1, A., Anden, J., Singer, A., 2019. Cryo-EM reconstruction of continuous heterogeneity by Laplacian spectral volumes. Inverse Problems. [DOI] [PMC free article] [PubMed]
- Murata K., Wolf M. Cryo-electron microscopy for structural analysis of dynamic biological macromolecules. Biochimica et Biophysica Acta. 2018;1862:324–334. doi: 10.1016/j.bbagen.2017.07.020. [DOI] [PubMed] [Google Scholar]
- Neumann P., Dickmanns A., Ficner R. Validating resolution revolution. Structure. 2018;26:785–795.e4. doi: 10.1016/j.str.2018.03.004. [DOI] [PubMed] [Google Scholar]
- Penczek P.A. Resolution measures in molecular electron microscopy. Methods in Enzymology. 2010;482:73–100. doi: 10.1016/S0076-6879(10)82003-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Penczek P., Grassucci R., Frank J. The ribosome at improved resolution: new techniques for merging and orientation refinement in 3D cryo-electron microscopy of biological particles. Ultramicroscopy. 1994;53:251–270. doi: 10.1016/0304-3991(94)90038-8. [DOI] [PubMed] [Google Scholar]
- Pintilie G., Chen D.H., Haase-Pettingell C.A., King J.A., Chiu W. Resolution and probabilistic models of components in CryoEM maps of mature p22 bacteriophage. Biophysical Journal. 2016;110:827–839. doi: 10.1016/j.bpj.2015.11.3522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Punjani A., Rubinstein J.L., Fleet D.J., Brubaker M.A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nature Methods. 2017;14:290–296. doi: 10.1038/nmeth.4169. [DOI] [PubMed] [Google Scholar]
- Rosenthal P.B. Testing the validity of single-particle maps at low and high resolution. Methods Enzymology. 2016;579:227–253. doi: 10.1016/bs.mie.2016.06.004. [DOI] [PubMed] [Google Scholar]
- Rosenthal P.B., Henderson R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. Journal of Molecular Biology. 2003;333:721–745. doi: 10.1016/j.jmb.2003.07.013. [DOI] [PubMed] [Google Scholar]
- Rosenthal P.B., Rubinstein J.L. Validating maps from single particle electron cryomicroscopy. Current Opinion in Structural Biology. 2015;34:135–144. doi: 10.1016/j.sbi.2015.07.002. [DOI] [PubMed] [Google Scholar]
- Ru, H., Others, 2015. Molecular Mechanism of V(D)J Recombination from Synaptic RAG1-RAG2 Complex Structures. Cell 163, 1138–1152. [DOI] [PMC free article] [PubMed]
- Rubinstein, J., Rubinstein Lab. https://sites.google.com/site/rubinsteingroup/home.
- Saxton W.O., Baumeister W. The correlation averaging of a regularly arranged bacterial cell envelope protein. Journal of Microscopy. 1982;127:127–138. doi: 10.1111/j.1365-2818.1982.tb00405.x. [DOI] [PubMed] [Google Scholar]
- Scheres S.H.W. RELION: implementation of a Bayesian approach to cryo-EM structure determination. Journal of Structural Biology. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scheres S.H.W., Chen S. Prevention of overfitting in cryo-EM structure determination. Nature Methods. 2012;9:853. doi: 10.1038/nmeth.2115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaikh T., Hegerl R., Frank J. An approach to examining model dependence in EM reconstructions using cross-validation. Journal of Structural Biology. 2003;142:301–310. doi: 10.1016/S1047-8477(03)00029-7. [DOI] [PubMed] [Google Scholar]
- Shatsky M., Hall R.J., Brenner S.E., Glaeser R.M. A method for the alignment of heterogeneous macromolecules from electron microscopy. Journal of Structural Biology. 2009;166:67–78. doi: 10.1016/j.jsb.2008.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sorzano C., Marabini R., Velázquez-Muriel J., Bilbao-Castro J., Scheres S., Carazo J., Pascual-Montano A. XMIPP: a new generation of an open-source image processing package for electron microscopy. Journal of Structural Biology. 2004;148:194–204. doi: 10.1016/j.jsb.2004.06.006. [DOI] [PubMed] [Google Scholar]
- Sorzano, C.O.S., Vargas, J., Otón, J., Abrishami, V., de la Rosa-Trev⧹)⧹́)∈, J.M., Gómez-Blanco, J., Vilas, J.L., Marabini, R., Carazo, J.M., 2017. A review of resolution measures and related aspects in 3D Electron Microscopy. Progress in Biophysics and Molecular Biology 124, 1–30. [DOI] [PubMed]
- Subramaniam S. Structure of trimeric HIV-1 envelope glycoproteins. Proceedings of the National Academy of Sciences. 2013;110:E4172–E4174. doi: 10.1073/pnas.1313802110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang G., Peng L., Baldwin P.R., Mann D.S., Jiang W., Rees I., Ludtke S.J. EMAN2: An extensible image processing suite for electron microscopy. Journal of Structural Biology. 2007;157:38–46. doi: 10.1016/j.jsb.2006.05.009. [DOI] [PubMed] [Google Scholar]
- Van Heel M., Schatz M. Fourier shell correlation threshold criteria. Journal of Structural Biology. 2005;151:250–262. doi: 10.1016/j.jsb.2005.05.009. [DOI] [PubMed] [Google Scholar]
- Vargas J., Otón J., Marabini R., Carazo J.M., Sorzano C.O.S. Particle alignment reliability in single particle electron cryomicroscopy: a general approach. Scientific Reports. 2016;6:21626. doi: 10.1038/srep21626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vargas J., Melero R., Gómez-Blanco J., Carazo J.M., Sorzano C.O.S. Quantitative analysis of 3D alignment quality: its impact on soft-validation, particle pruning and homogeneity analysis. Scientific Reports. 2017;7:6307. doi: 10.1038/s41598-017-06526-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu S., Armache J.P., Cheng Y. Single-particle cryo-EM data acquisition by using direct electron detection camera. Microscopy. 2016;65:35–41. doi: 10.1093/jmicro/dfv355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yershova A., Jain S., LaValle S.M., Mitchell J.C. Generating uniform incremental grids on SO(3) using the Hopf fibration. International Journal of Robotics Research. 2010;29:801–812. doi: 10.1177/0278364909352700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhong, E.D., Bepler, T., Davis, J.H., Berger, B., 2019. Reconstructing continuously heterogeneous structures from single particle cryo-EM with deep generative models. https://arxiv.org/abs/1909.05215.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The BioEM code is available at https://github.com/bio-phys/BioEM. A tutorial to perform the cross-validation protocol is available at: https://github.com/bio-phys/BioEM-tutorials.





