Abstract
Viruses infecting hyperthermophilic archaea represent one of the most enigmatic parts of the global virome, with viruses from different families showing no genomic relatedness to each other or to viruses of bacteria and eukaryotes. Tristromaviruses, which build enveloped filamentous virions and infect hyperthermophilic neutrophiles of the order Thermoproteales, represent one such enigmatic virus families. They do not share genes with viruses from other families and have been believed to represent an evolutionarily independent virus lineage. A cryo-electron microscopic reconstruction of the tristromavirus Pyrobaculum filamentous virus 2 at 3.4 Å resolution shows that the virion is constructed from two paralogous major capsid proteins (MCP) which transform the linear dsDNA genome of the virus into A-form by tightly wrapping around it. Unexpectedly, the two MCP are homologous to the capsid proteins of other filamentous archaeal viruses, uncovering a deep evolutionary relationship within the archaeal virosphere.
Keywords: cryo-EM, hyperthermophilic archaea, major capsid proteins, virus evolution, virus structure
1. Introduction
Systematic comparison of sequences and structures of the major (nucleo)capsid proteins (MCP) from across the virosphere has shown that viruses from ∼80 per cent of known virus families can be categorized into seventeen architectural classes represented by unique MCP folds (Krupovic and Koonin 2017). However, the remaining ∼20 per cent of virus groups encode MCPs for which the structural fold is not known and could not be predicted using state-of-the-art sequence analyses, representing the ‘unknown’ of the virosphere and potentially concealing novel architectural solutions for virion organization. Notably, ∼33 per cent of this structural ‘dark matter’ corresponds to viruses infecting archaea. Filamentous archaeal viruses, all infecting hyperthermophilic hosts growing optimally at 80°–90 °C, have been classified into four families: Clavaviridae, Rudiviridae, Lipothrixviridae and Tristromaviridae (Prangishvili et al. 2017). High-resolution structures are available for representatives of the first three virus families and display MCP folds unprecedented among bacterial and eukaryotic viruses (Krupovic and Koonin 2017; Hartman et al. 2019). Clavavirus APBV1 infects Aeropyrum pernix (marine hyperthermophile; order Desulfurococcales) and encodes one of the simplest known MCPs, folded as two α-helices connected by a β-hairpin. Approximately 1,000 copies of the MCP form a hollow cylindrical virion into which the circular dsDNA genome is packed (Ptchelkine et al. 2017). In contrast, virions of rudiviruses (DiMaio et al. 2015) and lipothrixviruses (Kasson et al. 2017; Liu et al. 2018), which infect members of the order Sulfolobales (acidophilic hyperthermophiles), are formed by condensation of a linear dsDNA genome by homodimeric (in rudiviruses) or heterodimeric paralogous (in lipothrixviruses) MCPs displaying a C-terminal four-helix bundle fold with an extended N-terminal α-helical arm. The α-helical arm wraps tightly around the DNA, transforming the dsDNA into the A-form. Whereas virions of rudiviruses are fairly rigid and non-enveloped, those of lipothrixviruses are more flexible and covered with a lipid envelope containing other viral proteins. Despite these differences, rudiviruses and lipothrixviruses share extensive gene content and are thus unified into an order Ligamenvirales (Prangishvili and Krupovic 2012). Tristromaviruses infect neutrophilic hyperthermophiles of the order Thermoproteales. Although the high-resolution structure of tristromaviruses is not known, biochemical studies have shown that virions are constructed from three major structural proteins, with MCP1 and MCP2 forming a helical nucleocapsid, and VP3 being associated with an external lipid membrane (Rensen et al. 2016). Due to lack of detectable protein and nucleotide sequence similarity between tristromaviruses and those of other known viruses, tristromaviruses were suggested to represent an evolutionarily independent virus lineage (Iranzo et al. 2016). To study the virion organization of tristromaviruses, we have determined the structure of the Pyrobaculum filamentous virus 2 (PFV2) at 3.4 Å resolution.
2. Materials and methods
2.1 PFV2 production and purification
For virus production, 150-ml culture of exponentially growing Pyrobaculum arsenaticum 2GA cells (Rensen et al. 2016) was infected with a fresh preparation of PFV2 (Baquero et al. 2020) at a multiplicity of infection of ∼0.1–1. The infected culture was incubated at 90 °C for 4 days without shaking. Following the removal of cells (7,000 rpm, 20 min, Sorvall 1500 rotor), the virions were collected and concentrated by ultracentrifugation (40,000 rpm, 2 h, 10 °C, Beckman 126 SW41 rotor). The concentrated virus particles were resuspended in buffer A (Quemin et al. 2015): 20 mM KH2PO4, 250 mM NaCl, 2.14 mM MgCl2, 0.43 mM Ca(NO3)2 and <0.001% trace elements of Sulfolobales medium, pH 6.
2.2 Cryo-electron micrograph image analysis and model building
The PFV2 sample (4.5 μl) was applied to discharged lacey carbon grids and plunge frozen using a Vitrobot Mark IV (FEI). Frozen grids were imaged in a Titan Krios at 300 keV and recorded with a Falcon III camera at 1.4 Å per pixel. Micrographs were collected using a defocus range of 1.25–2.25 μm, with a total exposure dose of 55 electrons/Å2 distributed into twenty-four fractions. To get an initial helical reconstruction volume, all of the micrographs were motion corrected using MotionCorr version 2.1 (Li et al. 2013), then used for contrast transfer function (CTF) estimation by the CTFFIND3 program (Mindell and Grigorieff 2003). After the images were corrected for the CTF through multiplication by the theoretical CTF, filament images amounting to ∼20 electrons/Å2 were extracted from dose-weighted fractions using the e2helixboxer program within EMAN2 (Tang et al. 2007). A small subset containing 30,000 overlapping 384-pixel-long segments (with a shift of ten pixels between adjacent subunits, ∼4 times the axial rise per subunit) was used to determine the helical symmetry in SPIDER (Shaikh et al. 2008), using IHRSR (Egelman 2000) after searching through a number of possible symmetries by trial and error. A ∼4.5 Å reconstruction was generated from this small subset, and this volume was subsequently filtered to 8 Å as the starting reference used in RELION (Zivanov et al. 2018). The micrographs and box coordinates of the full dataset were imported into RELION. After Refine3D, CTF refinement and Bayesian polishing, the final volume was estimated to have a resolution of 3.4 Å based on the map: map FSC, model: map FSC and d99 (Afonine et al. 2018) and was sharpened with a negative B factor of −155 Å2.
First, the density corresponding to a single MCP1 or MCP2 was segmented from the experimental filament density using Chimera (Pettersen et al. 2004). The full-length MCP1/MCP2 protein was built de novo into the segmented map using Rosetta CM (Wang et al. 2015), then adjusted manually in Coot (Emsley and Cowtan 2004) and real space refined in PHENIX (Afonine et al. 2018). Then EM density corresponding to A-DNA was segmented in Chimera, and A-DNA was manually put in the map and refined in PHENIX. Finally, the refined MCP1 and MCP2 single model were used to generate a filamentous model using the determined helical symmetry, and this filament model plus A-DNA were refined against the full cryo-electron micrograph (cryo-EM) map using PHENIX. MolProbity (Williams et al. 2018) was used to evaluate the quality of the filament model. The refinement statistics are shown in Table 1.
Table 1.
Parameter | PFV2 |
---|---|
Data collection and processing | |
Magnification | 59,000× |
Voltage (kV) | 300 |
Electron exposure (e− Å−2) | 20 |
Defocus range (μm) | −1.25 to −2.25 |
Pixel size (Å) | 1.4 |
Symmetry imposed | Helical (rise: 2.86 Å; rotation: 22.95°) |
Final particle images (n) | 186,576 |
Map resolution (Å) | |
‘Gold-standard’ map: map FSC (0.143) | 3.4 |
Model: map FSC (0.38, which is √0.143) | 3.4 |
d99 | 3.5 |
Refinement | |
Initial model used (PDB code) | NA |
Map-sharpening B factor (Å2) | −155 |
Root-mean-square deviations | |
Bond lengths (Å) | 0.008 |
Bond angles (°) | 0.828 |
Validation | |
MolProbity score | 1.37 |
Clash score | 4.0 |
Poor rotamers (%) | 0 |
Ramachandran plot | |
Favored (%) | 96.9 |
Allowed (%) | 3.1 |
Disallowed (%) | 0 |
The PFV2 model used has the following IDs: Electron Microscopy Data Bank: 21094; PDB: 6V7B.
3. Results and discussion
Cryo-electron micrographs (Fig. 1a) show the membrane-enveloped virions of PFV2 to be ∼340 Å in diameter and ∼5,000 Å in length. The helical symmetry was determined to be a rise of 2.86 Å and a rotation of 22.95° per asymmetric unit, where the asymmetric unit contained a protein heterodimer and 12 bp (base pairs) of A-form DNA. The DNA, almost completely covered by the heterodimers (Fig. 1b and c), tightly supercoils in a solenoidal fashion with one right-handed supercoil per 45.4 Å turn of the capsid helix. The native twist of the DNA is therefore 11.3 bp/turn (564 bp in forty-seven local turns plus three supercoil turns). Notably, Tristromaviridae are the first family of neutrophilic and hyperthermophilic archaeal viruses in which genomic DNA adopts the A-form (all other known archaeal viruses in which the DNA was found to be A-form are acidophiles (DiMaio et al. 2015; Kasson et al. 2017; Liu et al. 2018; Wang et al. 2019)). This observation suggests that A-form DNA is a general adaptation of these viruses to extreme temperatures rather than pH.
The membrane (Fig. 1b and d) is seen as a thicker outer component and a thinner inner one. We attribute the thickened outer portion to the presence of the additional viral protein VP3. In the closely related PFV1 (97–99% pairwise sequence identity between the three major structural proteins, and 98.9% nucleotide identity over 70% of their genomes) (Baquero et al. 2020), VP3 was shown to remain associated with the membrane fraction when the envelope was removed (Rensen et al. 2016). However, we failed to see any order associated with this outer layer. In the membrane-enveloped AFV1 (Kasson et al. 2017) and SFV1 (Liu et al. 2018), the distance between the inner and outer membrane density peaks is ∼12 Å, the basis for models of a thin monolayer built from horseshoe-shaped lipids (Kasson et al. 2017). In contrast, this spacing is ∼30 Å in PFV2 suggesting that the lipids are arranged in a similar manner as in the host membrane, perhaps stabilized by VP3.
There was no ambiguity in fitting the sequences of MCP1 and MCP2 into the density map given the resolution achieved (Fig. 1e). While a 3D superposition of MCP1 and MCP2 shows significant differences in the overall folds, a flexible secondary structure alignment (Ye and Godzik 2003) reveals a one-to-one mapping of local secondary structure elements (Fig. 1f) indicative of obvious homology. Perhaps more surprisingly, the PFV2 heterodimer is quite similar to the homodimer of rudivirus SIRV2 (DiMaio et al. 2015) and the heterodimer of lipothrixvirus AFV1 (Kasson et al. 2017) (Fig. 2a and b), especially in the C-terminal four-helix bundle fold, whereas the three viruses share no apparent similarity either at the nucleotide or protein sequence level (Fig. 2c). Notably, PFV2 achieves a similar coverage of A-DNA as SIRV2 and AFV1 by swapping the upper half of the MCP1 N-terminal helix-arm to the other side of the MCP2 helix-arm (Fig. 2a and b).
Our results uncover an evolutionary relationship among three families of hyperthermophilic archaeal viruses, Tristromaviridae, Rudiviridae and Lipothrixviridae, including the homology of the MCPs and the similar compaction of the genomes as A-form DNA by these MCPs. The fact that, besides the structurally related MCPs, tristromaviruses and ligamenviruses (i.e. rudiviruses and lipothrixviruses) do not encode recognizable orthologs emphasizes the remarkable plasticity of archaeal virus gene content and suggest that the two groups of viruses have diverged in a distant past, potentially during the split of Sulfolobales and Thermoproteales from their common ancestor. Due to the shared virion organization, we suggest unifying Tristromaviridae and the order Ligamenvirales within a class ‘Tokiviricetes’ (toki means ‘thread’ in Georgian [] and viricetes is an official suffix for a virus class), the first class-rank taxon for archaeal viruses. More importantly, our results bring to light another component of the structural ‘dark matter’ of the global virosphere and suggest that the ultimate knowledge of the complete set of structural folds used by viruses for virion formation might be within reach.
Funding
This work was supported by the National Institute of General Medical Sciences (GM122510 to E.H.E.); the European Union’s Horizon 2020 Research and Innovation Program (685778, project VIRUS X to D.P.); and l’Agence Nationale de la Recherche (ANR-17-CE15-0005-01 to M.K.). D.P.B. is part of the Pasteur—Paris University (PPU) International PhD Program, which has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 665807.
Conflict of interest: None declared.
References
- Afonine P. V. et al. (2018) ‘New Tools for the Analysis and Validation of Cryo-EM Maps and Atomic Models’, Acta Crystallographica Section D: Structural Biology, 74: 814–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baquero D. P. et al. (2020) ‘New Virus Isolates from Italian Hydrothermal Environments Underscore the Biogeographic Pattern in Archaeal Virus Communities’, The ISME Journal, doi:10.1101/2020.01.16.907410. [DOI] [PMC free article] [PubMed]
- DiMaio F. et al. (2015) ‘A Virus That Infects a Hyperthermophile Encapsidates A-Form DNA’, Science, 348: 914–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Egelman E. H. (2000) ‘A Robust Algorithm for the Reconstruction of Helical Filaments Using Single-Particle Methods’, Ultramicroscopy, 85: 225–34. [DOI] [PubMed] [Google Scholar]
- Emsley P., Cowtan K. (2004) ‘Coot: Model-Building Tools for Molecular Graphics’, Acta Crystallographica Section D: Structural Biology , 60: 2126–32. [DOI] [PubMed] [Google Scholar]
- Hartman R. et al. (2019) ‘Survey of High-Resolution Archaeal Virus Structures’, Current Opinion in Virology, 36: 74–83. [DOI] [PubMed] [Google Scholar]
- Iranzo J. et al. (2016) ‘Bipartite Network Analysis of the Archaeal Virosphere: Evolutionary Connections between Viruses and Capsidless Mobile Elements’, Journal of Virology, 90: 11043–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kasson P. et al. (2017) ‘Model for a Novel Membrane Envelope in a Filamentous Hyperthermophilic Virus’, Elife, 6: e26268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krupovic M., Koonin E. V. (2017) ‘Multiple Origins of Viral Capsid Proteins from Cellular Ancestors’, Proceedings of the National Academy of Sciences of the United States of America, 114: E2401–E2410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X. et al. (2013) ‘Electron Counting and Beam-Induced Motion Correction Enable near-Atomic-Resolution Single-Particle cryo-EM’, Nature Methods, 10: 584–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y. et al. (2018) ‘Structural Conservation in a Membrane-Enveloped Filamentous Virus Infecting a Hyperthermophilic Acidophile’, Nature Communications, 9: 3360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mindell J. A., Grigorieff N. (2003) ‘Accurate Determination of Local Defocus and Specimen Tilt in Electron Microscopy’, Journal of Structural Biology, 142: 334–47. [DOI] [PubMed] [Google Scholar]
- Pettersen E. F. et al. (2004) ‘UCSF Chimera–A Visualization System for Exploratory Research and Analysis’, Journal of Computational Chemistry, 25: 1605–12. [DOI] [PubMed] [Google Scholar]
- Prangishvili D., Krupovic M. (2012) ‘A New Proposed Taxon for Double-Stranded DNA Viruses, the Order “Ligamenvirales”’, Archives of Virology, 157: 791–5. [DOI] [PubMed] [Google Scholar]
- Prangishvili D. et al. (2017) ‘The Enigmatic Archaeal Virosphere’, Nature Reviews Microbiology, 15: 724–39. [DOI] [PubMed] [Google Scholar]
- Ptchelkine D. et al. (2017) ‘Unique Architecture of Thermophilic Archaeal Virus APBV1 and Its Genome Packaging’, Nature Communications, 8: 1436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quemin E. R. et al. (2015) ‘Sulfolobus Spindle-Shaped Virus 1 Contains Glycosylated Capsid Proteins, a Cellular Chromatin Protein, and Host-Derived Lipids’, Journal of Virology, 89: 11681–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rensen E. I. et al. (2016) ‘A Virus of Hyperthermophilic Archaea with a Unique Architecture among DNA Viruses’, Proceedings of the National Academy of Sciences of the Unites States of America, 113: 2478–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaikh T. R. et al. (2008) ‘SPIDER Image Processing for Single-Particle Reconstruction of Biological Macromolecules from Electron Micrographs’, Nature Protocols, 3: 1941–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang G. et al. (2007) ‘EMAN2: An Extensible Image Processing Suite for Electron Microscopy’, Journal of Structural Biology, 157: 38–46. [DOI] [PubMed] [Google Scholar]
- Wang F. et al. (2019) ‘A Packing for A-Form DNA in an Icosahedral Virus’, Proceedings of the National Academy of Sciences of the Unites States of America, 116: 22591–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang R. Y. et al. (2015) ‘De Novo Protein Structure Determination from Near-Atomic-Resolution Cryo-EM Maps’, Nature Methods, 12: 335–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Williams C. J. et al. (2018) ‘MolProbity: More and Better Reference Data for Improved All-Atom Structure Validation’, Protein Science, 27: 293–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ye Y., Godzik A. (2003) ‘Flexible Structure Alignment by Chaining Aligned Fragment Pairs Allowing Twists’, Bioinformatics, 19: ii246–55. [DOI] [PubMed] [Google Scholar]
- Zivanov J. et al. (2018) ‘New Tools for Automated High-Resolution Cryo-EM Structure Determination in RELION-3’, Elife, 7: e42166. [DOI] [PMC free article] [PubMed] [Google Scholar]