Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 May 17;12:8162. doi: 10.1038/s41598-022-11382-7

Helical Indexing in Real Space

Chen Sun 1, Brenda Gonzalez 1, Wen Jiang 1,
PMCID: PMC9114412  PMID: 35581231

Abstract

Biological structures with helical symmetries of distinct twist, rise, and axial symmetry are abundant and span a wide range of organisms and functions. Performing de novo helical indexing remains challenging because of the steep learning curve involved in Fourier space layer lines. The unknown amount of out-of-plane tilt and the existence of multiple conformations of the helices further complicate indexing. In this work, we introduce a real-space indexing method that leverages the prior knowledge of the tilt and in-plane angles of the helical filaments/tubes, robust ab initio 3D reconstruction capabilities in single particle cryo-EM to obtain asymmetric reconstructions, and automatic indexing of helical parameters directly from the asymmetric density maps. We validated this approach using data from multiple helical structures of distinct helical symmetries, diameters, flexibility, data qualities, and heterogeneous states. The fully automated tool we introduce for real space indexing, HI3D, uses the 2D lattice in the autocorrelation of the cylindrical projection of a 3D density map to identify the helical symmetry. HI3D can often successfully determine the helical parameters of a suboptimal 3D density map, including ab initio single particle asymmetric reconstructions and sub-tomogram averages, with intermediate evidence that can also help assess the map quality. Furthermore, this open-source HI3D is usable independently as a Web application that can be accessed free of installation. With these methods, de novo helical indexing will be significantly more accessible to researchers investigating structures of helical filaments/tubes using cryo-EM.

Subject terms: Structural biology, Electron microscopy, Cryoelectron microscopy

Introduction

Nature has adopted helical arrangements for many important biological structures, from viruses to cellular components. Notable examples include double-stranded DNA, actin, microtubule, bacterial flagella and pili, filamentous viruses such as TMV and Ebola, and phage tails. In recent years, increasing evidence demonstrates that many cellular proteins undergo conformational changes to form helical filaments either in a regulated process, such as human acetyl-CoA carboxylase1, or in pathological conditions, such as distinct Tau protein filaments in Alzheimer’s diseases and other neurodegenerative diseases24. Helical structures were among the initial targets of 3-D electron microscopy in 19685 and many helical structures have been solved to near-atomic resolutions since 2003 using the classic Fourier-Bessel method6, and Iterative Helical Real Space Refinement (IHRSR) method7. Currently, RELION8 is the most frequently used program for helical reconstruction.

A helix is a 1D crystallographic arrangement of asymmetrical units that can be described in rise and twist. Rise is the distance along the helical axis between the two neighboring asymmetric units. The twist is defined as the rotation angle around the helical axis between the neighboring asymmetric units. The helical symmetry parameters are traditionally determined by indexing the layer line patterns in the 2D power spectra of helical particle images, using the same principle for indexing as was used for the “Photo 51”, the X-ray diffraction of dsDNA9 that led to the discovery of the double-helix structure. In this method, the correct orders of Fourier-Bessel functions are manually assigned to the layer lines, which is not only mathematically difficult for many users to master but also intrinsically demanding on the quality of the data. A critical step of Fourier layer line based indexing is identifying and extracting a long, straight, and well-ordered helical filament/tube in the micrograph to produce clearly defined layer lines in Fourier space. However, helical structures with a high degree of curvature and heterogeneous conformations produce poorly defined layer lines, making Fourier Bessell indexing impossible. Only when clear helical layer lines are seen in Fourier space, the helical twist and rise can be derived, and the final structure can be refined with the helical symmetry estimation. However, helical indexing could still be ambiguous and error-prone even when strong layer lines are available, as the unknown amount of out-of-plane tilt further complicates the layer line pattern10. For these reasons, incorrect helical indexing has been found in several reports10,11. The IHRSR method was developed7 to overcome the challenges of the classic Fourier-Bessel reconstruction method with flexible, curved helical structures by treating a helical filament/tube as a series of overlapping segments (i.e. single particles) of which the orientation/center parameters could be individually refined. Although this method could also automatically refine the helical parameters around an initial value, it could only converge to the correct helical parameters if the initial rise/twist values were close to the correct values. In reality, it is hard to find an accurate initial value, especially when dealing with a new system without having any prior knowledge of it.

Herein, we introduce a new method to determine helical symmetry in real-space. Using existing single-particle reconstruction (SPR) image processing tools, we have devised a strategy to obtain 3D asymmetric reconstructions of helices without knowledge of helical parameters to estimate the helical symmetry in real space using HI3D (Helical Indexing using the cylindrical projection of a 3D map). This method is based on 2D lattice indexing using the auto-correlation function (ACF) of the cylindrical projection of a 3D map. We have demonstrated that this approach can yield accurate helical symmetry parameters for multiple experimental datasets. The samples of these datasets vary in function, size, symmetry, flexibility, heterogeneity, and data quality. The easy access of the HI3D method as a Web app will also help one better understand helical symmetry, how helical symmetry and its underlying 2D crystal lattice are related. We believe this method is easier to understand and use than the Fourier layer line method, and it is openly accessible online, significantly reducing the difficulty in determining de novo helical structures.

Results

Real space helical symmetry estimation with HI3D web app

HI3D was developed to overcome the above-mentioned issues with Fourier layer line based helical indexing. It only requires an input 3D map to automatically output the helical rise, twist, and axial symmetry parameters. The internal configuration of HI3D is described in Fig. 1. Since it was designed for 3D maps reconstructed without symmetry that is arbitrarily positioned/oriented (i.e. not parallel to the Z-axis and not centered in the XY plane) (Fig. 1a), HI3D will first find the center of the density map and shift it to the center of the box and vertically align it along the Z-axis (Fig. 1b). Then, the cylindrical projection of the 3D map is obtained by resampling the map in cylindrical coordinates and summing along the radial direction (Fig. 1c). This cylindrical projection is equivalent to unwrapping a helical structure into its corresponding 2D crystal. If the input asymmetric reconstruction has clear helical structure features, the cylindrical projection appears as a well-ordered 2D crystal (Fig. 1d). Conversely, the cylindrical projection of an asymmetric reconstruction without helical structure features would not produce a 2D lattice organization. Thus, the apparent crystalline order of the cylindrical projection is a useful and intuitive diagnostic of the presence and quality of the helical structure features in the asymmetric reconstruction.

Figure 1.

Figure 1

HI3D workflow. The input asymmetric density map (a) that is arbitrarily positioned and orientated is automatically centered and vertically aligned (b). The aligned 3D map is resampled in the cylindrical coordinate to generate the cylindrical projection (c) to mathematically convert a helical structure in the original 3D map into a “2D crystal” image. The auto-correlation function of the cylindrical projection would generate a 2D lattice (d) that visually resembles the diffraction spots of a crystal. The unit cell vector (red arrow in d) with the shortest distance to the equator would correspond to the helical twist (x-coordinate) and rise (y-coordinate).

HI3D has also implemented an automated 2D lattice indexing function to not only eliminate the need for interactive indexing, but also improve the accuracy of helical twist and rise estimates by least-square fitting of all “spots”. The automated 2D lattice indexing was achieved by two different methods. The first is to treat the detected “diffraction spots” on a generic 2D lattice without considering its origin from the helical lattice and to find the two unit cell vectors that best fit the “spot” positions. The unit cell vector with the smallest distance to the equator will be chosen to report the helical twist (x-coordinate) and rise (y-coordinate). The second method considers the 2D lattice as a thin slab of lattice unwrapped from the helical lattice and identifies the rise and twist in sequential order. First, it sorts the Y-coordinates of all “spots”, calculates the distance between neighbor values, and takes the most frequent, non-zero spacing as the helical rise. Second, it sorts the X-coordinates of the “spots” in neighboring rows (i.e. with Y-coordinates separated by helical rise), calculates the spacings between neighbor values, and takes the most frequent, non-zero value as the helical twist. To ensure the resulting helical twist/rise parameters are reliable, HI3D uses both methods to generate the rise and twist and requires that the difference between the results of the two methods is less than a threshold, for example, 1 Å in rise and 1 in twist. When the density map has clear helical structure features, the two methods reliably generate consistent helical twist and rise parameters. However, the twist and rise values from these two methods tend to diverge when the quality of the helical structure becomes worse. HI3D can alert the user about inconsistency among helical parameter estimates. A major challenge for a poor “lattice” is that many false peaks can be erroneously picked. HI3D has built-in heuristics to handle false peaks and is reasonably resistant to the false peaks. In both methods, the axial-symmetry of the helix can be calculated as the number of equally spaced peaks on the equator or equivalently as Csym=360/twist if Csym is an integer. To further improve the accuracy of the helical twist and rise parameters, a local optimization of both rise and twist parameters is performed to sub-pixel accuracy by maximizing the sum of ACF values at the “diffraction spots” positions expected by the helical rise/twist.

HI3D has been set up as a Web app (Fig. 2) which is convenient to access using an internet browser, without the need for installation. Due to the resource (i.e. memory quota) limitation of the hosting services, the largest map is currently limited to about 5003 voxels that is adequate for most cases. Larger maps can be analyzed by the HI3D web app running on a local computer (Linux, Mac, or Windows) following a simple procedure. In addition to UI for data intake (Fig. 2a) and displaying the output helical twist/rise parameters (Fig. 2b), its Web UI also displays the intermediate data, such as the X/Y/Z section views, cylindrical projection, ACF with detected “spots” (Fig. 2c), and the vector representing the twist/rise parameters (Fig. 2b) for diagnostic purposes. These intermediate data clearly relate the helical lattice to the corresponding 2D lattice and how the two lattices could be interconverted through a wrapping (2D helical lattice) or unwrapping (helical 2D lattice) process. Thus, HI3D can also serve as an educational tool to illustrate the basic concepts of helical structures and the formation of helical structures via wrapping a thin slab of 2D crystals. An HI3D session can be bookmarked/shared/reproduced with the exact parameters via its URL in the browser address bar or a built-in QR display of the complete URL. The HI3D web site includes a gallery of HI3D examples with clickable URLs for users to reproduce the indexing of a wide range of structures from helical reconstructions in EMDB spanning a wide range of resolutions (2-35 Å), subtomogram averages in EMDB, to ab initio asymmetric reconstructions included in this work.

Figure 2.

Figure 2

HI3D Web app user interface. The user interface of HI3D consists of three major parts. (a) The left part shows the input panel and the X/Y/Z section of the input map. Users can input a map in three ways. The first one is to upload a map from the local directory. The second is to provide the URL, for example, of a cryoSPARC output map. The third is dedicated to the helical structures in EMDB by either entering an EMDB ID or randomly choosing a helical structure in EMDB. (b) The central panel shows the output twist, rise, and C-sym values in text and as a vector centered in the ACF image. The right panel (c) shows the cylindrical projection (top), ACF of the projection image (middle), and input fields (bottom) for the user to overwrite the rise and twist if the automated detection fails.

Case studies of helical structures deposited in EMDB

To facilitate the validation of HI3D against a wide range of helical structures, HI3D includes an input mode to retrieve EMDB-deposited helical structures (Fig. 3). A user can either input a specific EMDB ID or HI3D can randomly select one from all currently available helical structures in EMDB. HI3D keeps an up-to-date list of all helical structures in EMDB by automatically querying EMDB for the complete list of all helical structures. HI3D performs well in tests with the EMDB helical structures as shown in the three examples of distinct cases, EMD-30129 (Helical stem of the cleaved double-headed nucleocapsids of Sendai virus12, 2.9 Å ), EMD-1759 (Bacterial tubulin homologue TubZ13, 35 Å ), and EMD-23890 (Tau straight filament extracted from PrP-CAA patient brain tissue14, 3.1Å) (Fig. 3), with the HI3D-reported rise and twist values almost identical to the published values despite the wide range of resolutions (2.9-35 Å) and the drastically different qualities of the lattice in the ACF images of these structures. Therefore, HI3D is a robust reporter of helical parameters for EMDB maps. Since the helical rise and twist parameters are not consistently reported across different EMDB entries or even for the same EMDB entry across the EMDB-mirroring sites, HI3D can be useful as a validation tool to complement EMDB. The convenience provided by HI3D to quickly examine many helical structures in EMDB and obtain their helical parameters also makes it a useful educational tool for illustrating helical structure and symmetry.

Figure 3.

Figure 3

HI3D results for three helical structures in EMDB. (a) EMD-30129 (Helical stem of the cleaved double-headed nucleocapsids of Sendai virus, 2.9 Å); (b) EMD-1759 (Bacterial tubulin homologue TubZ, 35 Å, reconstructed from negative stain EM images); (c) EMD-23890 (Tau straight helical filament extracted from PrP-CAA patient brain tissue, 3.1 Å). The published rise and twist for these three datasets are (4.09 Å, -27.58), (42 Å, 21), and (4.79 Å, -1.07), respectively.

Case studies of subtomogram averages

As in situ structural biology using cryo-electron tomography of cell sections is increasingly being used to study native structures without being isolated from their cellular context, many subtomogram averages of helical structures have been deposited into EMDB. Since helical symmetry was not imposed on these subtomogram averages, we tested if HI3D could successfully index their helical symmetry parameters. As shown in Fig. 4, HI3D performs well in tests with the three examples, EMD-12289 (in situ actomyosin15, 10.2 Å) , EMD-12293 (in situ I-band thin filament including troponin complex15, 19.8 Å) , and EMD-8601 (in situ type VI secretion system16, 24 Å) despite the different map resolutions and qualities of the lattice in the ACF images of these structures. The results indicate that HI3D is a useful structural analysis tool for many in situ structural studies of cell sections involving helical structural components.

Figure 4.

Figure 4

HI3D results for three subtomogram averages in EMDB. (a) EMD-12289 (in situ actomyosin, 10.2 Å); (b) EMD-12293 (in situ I-band thin filament including troponin complex, 19.8 Å); (c) EMD-8601 (in situ type VI secretion system, 24 Å).

Case studies of ab initio asymmetric reconstructions

The ultimate goal of HI3D is to facilitate de novo determination of helical structures using single particle cryo-EM without the need for prior knowledge of the helical rise/twist parameters that are typically obtained from Fourier layer line-based indexing, a complicated, error-prone approach. To gauge its feasibility, we tested multiple experimental datasets spanning a wide range of helical symmetries, diameters, flexibility, data qualities, and heterogeneous states. In these tests, we used asymmetric reconstruction without imposing any point or helical symmetry, but took advantage of single particle reconstruction methods and constrained the orientation search for two of the three Euler angles to a small range around expected values (see Methods). Since the resulting 3D density maps were not expected to have an ideal helical structure, these tests would gauge the robustness of HI3D in its reporting of helical symmetry parameters.

  • TMV (Figure 5)

    We chose the tobacco mosaic virus (TMV) as our first test case for it is the first structure reconstructed with 3DEM5, and has been often studied as a model system throughout the years. It was also used as a robust test sample for a new algorithm in subtomogram averaging17. The TMV capsid is primarily composed of a 17 kDa coat protein (CP) that is organized into a helix. In Figure 5, the ab initio reconstructed asymmetric map generated with Euler angle constraints already displays detailed structure features. After uploading this map to HI3D, the cylindrical projection and ACF are automatically generated. Although the cylindrical projection visually displays an obvious 2D crystalline pattern, it is also obvious that the “crystalline” pattern has long range disorder (e.g. brightness variation), which explains the much weaker spots in the outer region of the ACF and also indicates that the asymmetric reconstruction was still suboptimal. Despite the suboptimal quality of this asymmetric reconstruction, HI3D reports twist and rise values close to the published values (Table 1). Of note, although we have downsampled the segment images eight times resulting in a pixel size (2.552 Å), which is much larger than the rise (1.408 Å), the error of HI3D output rise value is only 0.16 Å or is 6% of the pixel size. The HI3D-determined twist has an opposite sign due to the wrong handedness of the asymmetric map. Single particle reconstructions are intrinsically equivalent for both handedness and the correct handedness can only be determined with additional information, such as tilt experiments or the atomic structure details.

  • MAVS-CARD (Figure 6)

    We selected the MAVS CARD EMPIAR dataset 1003119 as our second case for it has been historically assigned wrong helical twist, rise, and C-symmetry10. With our method, the helical parameters can be easily determined without any ambiguity as shown by the 2D lattice of “spots” in the ACF image (Fig. 6). After ab initio asymmetric reconstruction, two of the three output classes have outputted correct 2D lattice, while the other one is obvious junk classes (surface view of all three classes are shown in Supplementary Figure 1). Figure 6 are the results of the dominant class. While the cylindrical projection clearly displays a 2D crystalline pattern, differences in the “unit cells” were also apparent. Despite the suboptimal asymmetric reconstruction, HI3D could output rise (5.04 Å) and twist (-101.2) values that are nearly identical to the published values (rise=5.088 Å, twist=-101.436) (Table 1).

  • VipA/VipB (Figure 7)

    Our third test case, VipA/VipB, is a type VI secretion system analogous to helical tails of myophages. VipA/VipB is a relatively rigid helical tube with an outer diameter of about 300 Å. We used the Vibrio Cholerae VipA/VipB dataset (EMPIAR 10019)20. HI3D analysis of our ab initio asymmetric reconstruction estimated the rise and twist at 21.83 Å and -29.31, respectively (Fig. 7), which is nearly identical to the previously published helical rise (21.8 Å) and twist (29.4)20 except for the opposite sign of the twist. HI3D also correctly detected the intrinsic C6 symmetry around the helical axis although no symmetry was used for the asymmetric reconstruction.

  • HIV Tubes (Figs. 8 and 9)

    The data for our fourth test case, HIV tubes, provided by Dr. Peijun Zhang, was a more challenging dataset due to heterogeneity. HIV capsid protein is known to organize into a variety of structures, such as cones and tubes. Tube assemblies can also vary in diameter and helical symmetry. Dr. Peijun Zhang’s recent work revealed high resolution HIV tube structures with seven helical symmetries21. The dataset that we obtained was a small subset of the data containing only 39 tubes. After ab initio asymmetric reconstruction into three classes, two classes showed well resolved hexons, indicating that both structures were correct. The largest class contains 41.5% particles with HI3D-reported helical rise of 6.97 Å and twist -31.12 (Fig. 8). The second most populated class was composed of 32.8% particles with HI3D-reported helical rise of 6.4 Å and twist 28.62 (Fig. 9). The least populated 3rd class only had 25.7% particles and did not yield a clear structure. It is interesting to note that the cylindrical projection showed well resolved and poorly resolved angular/vertical stripes of hexons and slightly varying gaps between hexons, suggesting that these asymmetric reconstructions were still far from ideal. Nevertheless, the 2D lattice in the ACF was sufficiently clear to estimate the helical symmetry very close to the reported values (Table 1).

Figure 5.

Figure 5

The surface view of the asymmetric reconstruction (a), the cylindrical projection (b), and the 2D lattice (c) with the HI3D outputted helical rise and twist and C-sym of the TMV dataset (EMPIAR-1022). The surface view was generated with UCSF Chimera18.

Table 1.

Comparison of HI3D reported helical parameter with the published values.

Dataset EMPIAR EMDB Symmetry # of particles Box size Pixel Size (Å)
TMV 10305 10129 C1 21,744 128 2.552
MAVS CARD 10031 6428 C1 118,301 150 2.1
VipA/VipB 10019 2699 C6 8,899 256 2
HIV tubes N/A 10239 C1 2,296 300 2.44
10246 C1 1,814 300 2.44
Dataset Published Rise (Å) Published Twist () HI3D Rise (Å) HI3D Twist () Reference
TMV 1.406 22.038 1.25 -19.69 27
MAVS CARD 5.088 -101.436 5.04 -101.2 19
VipA/VipB 21.8 29.4 21.83 -29.31 20
HIV tubes 6.95 -31.13 6.97 -31.12 21
6.44 -28.68 6.4 28.62

Figure 6.

Figure 6

The surface view of the asymmetric reconstruction (a), the cylindrical projection (b), and the 2D lattice (c) with the HI3D outputted helical rise and twist and C-sym of the MAVS CARD dataset (EMPIAR-10031). The surface view was generated with UCSF Chimera18.

Figure 7.

Figure 7

The surface view of the asymmetric reconstruction (a), the cylindrical projection (b), and the 2D lattice (c) with the HI3D outputted helical rise and twist and C-sym of the VipA/VipB dataset (EMPIAR-10029). The surface view was generated with UCSF Chimera18.

Figure 8.

Figure 8

The surface view of the asymmetric reconstruction (a), the cylindrical projection (b), and the 2D lattice (c) with the HI3D outputted helical rise and twist and C-sym of the major class of the HIV capsid protein dataset (provided by Dr. Peijun Zhang). The surface view was generated with UCSF Chimera18.

Figure 9.

Figure 9

The surface view of the asymmetric reconstruction (a), the cylindrical projection (b), and the 2D lattice (c) with the HI3D outputted helical rise and twist and C-sym of the minor class of the HIV capsid protein dataset (provided by Dr. Peijun Zhang). The surface view was generated with UCSF Chimera18.

Discussion

The work we present here introduces a unique method, HI3D, to analyze helical symmetry in real space using asymmetric reconstructions. It offers an alternative approach to overcome the challenges associated with Fourier layer line indexing and/or confirm previously assigned helical symmetry estimations obtained from other methods. To demonstrate its robustness, we have tested it with four experimental datasets varying in helical diameter, symmetry, flexibility, and heterogeneity. From suboptimal asymmetric reconstructions, it has successfully estimated helical parameters that are accurate enough for the downstream helical refinement of all test cases (Figs. 59 and Supplementary Figs. S1S4).

As a Web app, HI3D is installation-free and convenient to use. It can be easily integrated into any existing image processing workflow. Its user-friendly interface improves access to new users. There is no stringent requirement on the input map resolution, as demonstrated by examples including the 24 Å subtomogram average of in situ type VI secretion system (EMD-8601, Fig. 4c) and the 35 Å EMD-1759 bacterial tubulin homolog TubZ reconstructed from negative stain images (Fig. 3b). Furthermore, the “crystalline order” of the cylindrical projections and sharpness of the “diffraction spots” in the ACF images could provide the users with an intuitive assessment of the quality of the input maps. It can also be used as a validation tool for maps deposited to EMDB for easy access to the helical symmetry information.

Compared with the helical symmetry search methods in the existing software, like RELION helical_toobox and cryoSPARC Helical Symmetry Search utility, HI3D demonstrated more robust performance, especially for asymmetric reconstructions with suboptimal helical structure features. In these software, the symmetry search is cross-correlation based, which relies on user input to define a relatively small search range for both helical twist and rise parameters. For unknown systems, it can be hard to guess an accurate search range. Therefore, they are not global symmetry search methods and perform poorly when provided with full range. In contrast, HI3D is designed to directly estimate the helical parameters globally by default. For bad quality maps, for example, with varying crystalline order in different stripes of the cylindrical projection, users can select to use only the clearly defined regions in the cylindrical projection to improve the 2D lattice in the ACF. From this aspect, HI3D has the advantage of lowering the quality threshold needed for helical indexing. Coupled with HI3D indexing, some suboptimal asymmetric maps, with partial disordered regions, can still be used to decipher the helical parameters as shown in our four examples (Figs. 5-9).

Despite all these advantages, like all other methods, there is also a shortcoming of HI3D. Different ab initio asymmetric reconstructions from the same dataset can yield different apparent helical symmetries, and none may be correct. Multiple ab initio reconstruction jobs are recommended to find a consistent solution (Supplementary Fig. S4). Based on our experience, for high quality datasets, whatever reconstruction software was used, there was a high probability for at least one class to converge to the correct structure. However, for low-quality or challenging datasets, it could be that none of the reconstructed maps is correct. Using VipA/VipB dataset as an example, three parallel runs of RELION/3.1 3D classification (k=3) with the same set of particles and parameters returned nine maps as shown in Supplementary Fig. S4. Among the three classes yielded by each run, two classes are obvious junk classes with very few particles assigned to them, while one dominant class consistently showed up in all the runs. After importing it to HI3D, a clear sharp 2D lattice was observed for this class and all the three maps in red boxes gave the same set of correct helical parameters for VipA/VipB. However, for datasets with low signal-to-noise ratios and more heterogeneity, it is possible that the ab initio reconstructions become stuck in a local minimum and couldn’t find the best solution in a single run. To be cautious, we would only suggest using the map that consistently appears in the independent runs for real space indexing with HI3D. HI3D in turn could help the user gauge the quality of the map by calculating the cylindrical projection and 2D ACF lattice of the reconstructed maps. Overall, helical samples remain one of the most challenging targets of cryoEM studies because of the complexity/ambiguity involved in helical symmetry estimation and heterogeneity. We hope this work can provide more resources for education, make structural research involving helical structures more accessible, and provide a new direction for helical indexing.

Methods

Test Datasets

Among the four test datasets, three datasets were downloaded from EMPIAR, and the HIV tube dataset was kindly provided by Dr. Peijun Zhang (Table 1). For the TMV dataset, segments were extracted from the micrographs with the provided coordinate file. For the MAVS CARD dataset, segments were picked with cryoSPARC’s reference-free filament tracer22. The reference-free 2D classification was performed for all extracted segments. All good quality 2D classes were selected. For the VipA/VipB dataset, segments were first picked with the template-free cryoSPARC filament tracer. Good looking 2D classes were selected for template-guided filament tracer. Then, 2D classification was performed in cryoSPARC and good quality 2D classes were selected. For the HIV tube dataset, all filaments were manually picked and extracted with RELION/3.123,24. As above, cryoSPARC was used for reference-free 2D classification. For all datasets, the selected, good segments were subject to ab initio asymmetric reconstruction using RELION/3.1. The resulting maps were uploaded to HI3D for automated helical indexing.

Ab initio asymmetric reconstruction with constrained Euler angles

After extracting helical segments and using 2D classification to remove the poor-quality segments, the Euler angle (ϕ, ψ, θ) search–constrained ab initio asymmetric reconstruction was performed with RELION/3.1 using Class3D specifying multiple classes (typically k=3) with a bare cylinder as the initial model (Supplementary Figs. S1S3). The tilt angle search range was constrained to 90±15 given the prior knowledge that the helical axis should be nearly parallel to the image plane. The in-plane rotation angle psi is also constrained (±5) around the angle derived from filament selection. The asymmetric reconstructions were generated without imposing any point or helical symmetry. These reconstructed maps were then uploaded to HI3D to determine the helical parameters.

Implementation and availability of HI3D Web app

We have used streamlit25, which is open-source software for the development of Python-based Web apps, to implement the HI3D Web app. The hosted HI3D app and the source code are freely accessible from the authors’ Web site (http://jiang.bio.purdue.edu/HI3D). The trackpy26 library was used for automated detection of peaks (i.e. “diffraction spots”) in the ACF image of the cylindrical projection.

Statement of Ethics

In this work, we have not conducted any experiments on live vertebrates and/or higher invertebrates. No clinical data has been used. The HIV tube dataset based on HIV capsid proteins expressed in E. coli was kindly provided by Dr. Peijun Zhang.

Supplementary Information

Acknowledgements

This work was supported in part by grants from NIGMS U24GM116789, NIAID R01AI111095, NINDS U01NS110437, NIA RF1AG071177 to W.J. and the NIH T32 GM132024 Molecular Biophysics Training Program at Purdue (https://molbiophys.science.purdue.edu) to B.G. We thank Dr. Peijun Zhang for providing the HIV tube dataset as a test for HI3D. We thank the Purdue Rosen Center for Advanced Computing (RCAC) (https://www.rcac.purdue.edu) for providing the computing resources for the development and application of HI3D. Molecular graphics and analyses performed with UCSF Chimera, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311.

Author contributions

Conceptualization, method development, writing and supervision, W.J.; Data analysis, C.S. and B.G.; Writing—original draft and data visualization, C.S., B.G. and W.J.. All authors reviewed the manuscript.

Data availability

The ab initio maps generated in the current study are available in the Zenodo repository, (https://zenodo.org/record/6366819#.YjQKoi-B2J8).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-11382-7.

References

  • 1.Lynch EM, Kollman JM, Webb BA. Filament formation by metabolic enzymes-a new twist on regulation. Curr. Opin. Cell Biol. 2020;66:28–33. doi: 10.1016/j.ceb.2020.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Berriman J, et al. Tau filaments from human brain and from in vitro assembly of recombinant protein show cross-beta structure. Proc. Natl. Acad. Sci. USA. 2003;100:9034–9038. doi: 10.1073/pnas.1530287100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kidd, M. Paired helical filaments in electron microscopy of alzheimer’s disease (1963). [DOI] [PubMed]
  • 4.Terry, R. D. The fine structure of neurofibrillary tangles in Alzheimer’s disease (1963). [DOI] [PubMed]
  • 5.De Rosier, D. J. & Klug, A. Reconstruction of three dimensional structures from electron micrographs (1968). [DOI] [PubMed]
  • 6.Yonekura K, Maki-Yonekura S, Namba K. Complete atomic model of the bacterial flagellar filament by electron cryomicroscopy. Nature. 2003;424:643–650. doi: 10.1038/nature01830. [DOI] [PubMed] [Google Scholar]
  • 7.Egelman EH. A robust algorithm for the reconstruction of helical filaments using single-particle methods. Ultramicroscopy. 2000;85:225–234. doi: 10.1016/S0304-3991(00)00062-0. [DOI] [PubMed] [Google Scholar]
  • 8.He S, Scheres SHW. Helical reconstruction in RELION. J. Struct. Biol. 2017;198:163–176. doi: 10.1016/j.jsb.2017.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Franklin RE, Gosling RG. Molecular configuration in sodium thymonucleate. Nature. 1953;171:740–741. doi: 10.1038/171740a0. [DOI] [PubMed] [Google Scholar]
  • 10.Egelman, E. H. Ambiguities in helical reconstruction. (2014). [DOI] [PMC free article] [PubMed]
  • 11.Egelman EH, Wang F. Cryo-EM is a powerful tool, but helical applications can have pitfalls. Soft Matter. 2021;17:3291–3293. doi: 10.1039/D1SM00282A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhang N, et al. Structure and assembly of double-headed sendai virus nucleocapsids. Commun. Biol. 2021;4:494. doi: 10.1038/s42003-021-02027-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Aylett CHS, Wang Q, Michie KA, Amos LA, Löwe J. Filament structure of bacterial tubulin homologue TubZ. Proc. Natl. Acad. Sci. USA. 2010;107:19766–19771. doi: 10.1073/pnas.1010176107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hallinan GI, et al. Structure of tau filaments in prion protein amyloidoses. Acta Neuropathol. 2021;142:227–241. doi: 10.1007/s00401-021-02336-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wang Z, et al. The molecular basis for sarcomere organization in vertebrate skeletal muscle. Cell. 2021;184:2135–2150.e13. doi: 10.1016/j.cell.2021.02.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Chang, Y.-W., Rettberg, L. A. & Jensen, G. J. In vivo structures of an intact type VI secretion system revealed by electron cryotomography (2017). [DOI] [PMC free article] [PubMed]
  • 17.Sanchez RM, Zhang Y, Chen W, Dietrich L, Kudryashev M. Subnanometer-resolution structure determination in situ by hybrid subtomogram averaging - single particle cryo-EM. Nat. Commun. 2020;11:3709. doi: 10.1038/s41467-020-17466-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Pettersen EF, et al. Ucsf chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  • 19.Xu, H. et al. Correction: Structural basis for the prion-like MAVS filaments in antiviral innate immunity (2015). [DOI] [PMC free article] [PubMed]
  • 20.Kudryashev, M. et al. Structure of the type VI secretion system contractile sheath (2015). [DOI] [PMC free article] [PubMed]
  • 21.Ni T, et al. Intrinsic curvature of the HIV-1 CA hexamer underlies capsid topology and interaction with cyclophilin a. Nat. Struct. Mol. Biol. 2020;27:855–862. doi: 10.1038/s41594-020-0467-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods. 2017;14:290–296. doi: 10.1038/nmeth.4169. [DOI] [PubMed] [Google Scholar]
  • 23.Scheres SHW. RELION: implementation of a bayesian approach to cryo-EM structure determination. J. Struct. Biol. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife7 (2018). [DOI] [PMC free article] [PubMed]
  • 25.streamlit. GitHub - streamlit/streamlit: Streamlit — the fastest way to build data apps in python. https://github.com/streamlit/streamlit. Accessed: 2021-8-26.
  • 26.Soft-matter. GitHub - soft-matter/trackpy: Python particle tracking toolkit. https://github.com/soft-matter/trackpy. Accessed: 2021-8-26.
  • 27.Weis F, Beckers M, von der Hocht I, Sachse C. Elucidation of the viral disassembly switch of tobacco mosaic virus. EMBO Rep. 2019;20:e48451. doi: 10.15252/embr.201948451. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The ab initio maps generated in the current study are available in the Zenodo repository, (https://zenodo.org/record/6366819#.YjQKoi-B2J8).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES