Skip to main content
eLife logoLink to eLife
. 2022 Jul 28;11:e77607. doi: 10.7554/eLife.77607

The giant mimivirus 1.2 Mb genome is elegantly organized into a 30-nm diameter helical protein shield

Alejandro Villalta 1,, Alain Schmitt 1,, Leandro F Estrozi 2,, Emmanuelle RJ Quemin 3,, Jean-Marie Alempic 1, Audrey Lartigue 1, Vojtěch Pražák 3,4, Lucid Belmudes 5, Daven Vasishtan 3,4, Agathe MG Colmant 1, Flora A Honoré 1, Yohann Couté 5, Kay Grünewald 3,4, Chantal Abergel 1,
Editors: Adam Frost6, Sara L Sawyer7
PMCID: PMC9512402  PMID: 35900198

Abstract

Mimivirus is the prototype of the Mimiviridae family of giant dsDNA viruses. Little is known about the organization of the 1.2 Mb genome inside the membrane-limited nucleoid filling the ~0.5 µm icosahedral capsids. Cryo-electron microscopy, cryo-electron tomography, and proteomics revealed that it is encased into a ~30-nm diameter helical protein shell surprisingly composed of two GMC-type oxidoreductases, which also form the glycosylated fibrils decorating the capsid. The genome is arranged in 5- or 6-start left-handed super-helices, with each DNA-strand lining the central channel. This luminal channel of the nucleoprotein fiber is wide enough to accommodate oxidative stress proteins and RNA polymerase subunits identified by proteomics. Such elegant supramolecular organization would represent a remarkable evolutionary strategy for packaging and protecting the genome, in a state ready for immediate transcription upon unwinding in the host cytoplasm. The parsimonious use of the same protein in two unrelated substructures of the virion is unexpected for a giant virus with thousand genes at its disposal.

Research organism: Viruses

Introduction

Acanthamoeba infecting giant viruses were discovered with the isolation of mimivirus (La Scola et al., 2003; Raoult et al., 2004). Giant viruses now represent a highly diverse group of dsDNA viruses infecting unicellular eukaryotes (Abergel et al., 2015) which play important roles in the environment (Schulz et al., 2020; Moniruzzaman et al., 2020; Kaneko et al., 2021). They also challenge the canonical definitions of viruses (Forterre, 2010; Claverie and Abergel, 2010) as they can encode central translation components (Raoult et al., 2004; Abergel et al., 2007) as well as a complete glycosylation machinery (Piacente et al., 2017; Notaro et al., 2021) among other unique features.

Mimivirus has been the most extensively studied giant virus infecting Acanthamoeba (Colson et al., 2017). The virions are 0.75 µm wide and consist of icosahedral capsids of 0.45 µm diameter surrounded by a dense layer of radially arranged fibrils (Raoult et al., 2004). Structural analyses of the virions have provided some insights into the capsid structure (Kuznetsov et al., 2013; Xiao et al., 2009; Klose et al., 2010; Ekeberg et al., 2015; Schrad et al., 2020) but given the size of the icosahedral particles (and hence the sample thickness), accessing the internal organization of the core of the virions remains challenging. Consequently, little is known about the packaging of the 1.2 Mb dsDNA genome (Chelikani et al., 2014). Inside the capsids, a lipid membrane delineates an internal compartment (~340 nm in diameter, Figure 1A, Figure 1—figure supplement 1), referred to as the nucleoid, which contains the viral genome, together with all proteins necessary to initiate the replicative cycle within the host cytoplasm (Schrad et al., 2020; Kuznetsov et al., 2010; Claverie et al., 2009; Arslan et al., 2011). Acanthamoeba cells engulf mimivirus particles, fooled by their bacteria-like size and the heavily glycosylated decorating fibrils (La Scola et al., 2003; Raoult et al., 2004; Notaro et al., 2021). Once in the phagosome, the Stargate portal located at one specific vertex of the icosahedron opens up (Zauberman et al., 2008), enabling the viral membrane to fuse with that of the host vacuole to deliver the nucleoid into the host cytoplasm (Schrad et al., 2020; Claverie et al., 2009). EM studies have shown that next the nucleoid gradually loses its electron dense appearance, transcription begins and the early viral factory is formed (Arslan et al., 2011; Suzan-Monti et al., 2007; Mutsafi et al., 2014). Previous atomic force microscopy studies of the mimivirus infectious cycle suggested that the DNA forms a highly condensed nucleoprotein complex enclosed within the nucleoid (Kuznetsov et al., 2013). Here, we show that opening of the large icosahedral capsid in vitro led to the release of rod-shaped structures of about 30-nm width. These structures were further purified and the various conformations characterized using cryo-electron microscopy (cryo-EM), tomography, and mass spectrometry (MS)-based proteomics.

Figure 1. The mimivirus genomic fiber.

(A) Micrograph of an ultrathin section of resin-embedded infected cells showing the DNA tightly packed inside mimivirus capsids (C) with electron dense material inside the nucleoid (N). The string-like features, most likely enhanced by the dehydration caused by the fixation and embedding protocol, correspond to the genomic fiber (F) packed into the nucleoid. (B) Micrograph of negative stained mimivirus capsid (C) opened in vitro with the genomic fiber (F) still being encased into the membrane-limited nucleoid (N). (C) Multiple strands of the flexible genomic fiber (F) are released from the capsid (C) upon proteolytic treatment. (D) Micrograph of negative stained purified mimivirus genomic fibers showing two conformations the right fiber resembling the one in (B) and free DNA strands (white arrowheads). (E) Slices through two electron cryo tomograms of the isolated helical protein shell of the purified genomic fibers in compact (top) or relaxed conformation (bottom) in the process of losing one protein strand (Figure 1—figure supplement 2, Figure 1—videos 1, 4). (F) Different slices through the two tomograms shown in (E) reveal DNA strands lining the helical protein shell of the purified genomic fibers in compact (top) or relaxed conformation (bottom). Examples of DNA strands extending out at the breaking points of the genomic fiber are marked by black arrowheads. Note in the top panel, individual DNA strands coated by proteins (red arc). The slicing planes at which the mimivirus genomic fibers were viewed are indicated on diagrams on the top right corner as blue dashed lines and the internal colored segments correspond to DNA strands lining the protein shell. The thickness of the tomographic slices is 1.1 nm and the distance between tomographic slices in panels (E) and (F) is 4.4 nm. Scale bars as indicated.

Figure 1.

Figure 1—figure supplement 1. Negative staining micrographs of opened mimivirus reunion virions before purification of the genomic fiber.

Figure 1—figure supplement 1.

(A) Spherical nucleoids (N) released from the capsids upon specific treatment (see methods). (B) Dissolved nucleoids expelled from the opened capsids after regular treatment (the region marked with dashed lines is enlarged in Figure 1C), all of the opened capsids (>25%) are releasing the condensed genomic fiber. (C) Multiple broken genomic fibers can be seen next to debris prior to purification and an isolated Stargate (S). (D, E) Multiple segments of the flexible genomic fiber are released upon proteolytic treatment of the capsids. (F) Example of a long, flexible, genomic fiber released from an open capsid (C).

Figure 1—figure supplement 2. Cryo-electron tomography (cryo-ET) of mimivirus genomic fiber.

Figure 1—figure supplement 2.

(Upper panel) Slices through tomograms exhibiting different features of compact, unwinding, and broken genomic fibers of mimivirus. (A) Slices through a cryo-tomogram focused on a long and broken fiber. Features of the protein shell are visible in tangential slices (top and bottom), while filamentous densities corresponding to DNA strands become visible when slicing deeper inside the fiber. Scale bar, 100 nm. (B) A straight filament running along the fiber symmetry axis has been observed in several cases (blue rectangle), which may correspond to a single or a bundle of DNA strands dissociated from the protein shell. (C) Large amorphous electron dense structures have been observed inside the lumen of the fiber (highlighted with purple circles). (D) DNA strands are often observed emanating from breaks along the genomic fiber, partially disassembling (indicated with yellow arrowheads). (E) Detail of the central slices through the genomic fibers showed in the third depicted slice in A–D. Thickness of the slices is 1.1 nm. Distance between the tomographic slices (from top to bottom) is 4.4 nm between the first and second and the fourth and fifth, and 6.6 nm between the third and second or fourth. The scheme on the left represents a cross-section of a cartoon representation of the 5-start genomic fiber depicted as a cylinder containing five strands of colored DNA internally lining the helical protein shell. It also indicates the plane through which the genomic fiber is viewed as 2D slices extracted from the tomograms in Figure 1—video 1 (A), Figure 1—video 2 (B), Figure 1—video 3 (C), and Figure 1—video 4 (D). Scale bars (B–D), 50 nm.

Figure 1—figure supplement 3. Agarose gel electrophoresis of the purified genomic fiber compared to the viral genomic DNA: M: molecular weight markers (1 kbp DNA Ladder Plus, Euromedex).

Figure 1—figure supplement 3.

Lane 1, 100 ng of mimivirus genomic DNA, Lane 2: 200 ng of untreated genomic fiber, Lane 3: 200 ng of proteinase K (PK)-treated genomic fiber, Lane 4: 200 ng of DNase- and PK-treated genomic fiber, Lane 5: 200 ng of RNase- and PK-treated genomic fiber.
Figure 1—figure supplement 3—source data 1. Source data of the agarose gel.
Figure 1—figure supplement 3—source data 2. Source data of the agarose gel.

Figure 1—figure supplement 4. Bubblegrams on the mimivirus genomic fiber.

Figure 1—figure supplement 4.

Field of view for genomic fibers either (from top to bottom): long and bent, long and straight, long and broken, or a mix of short and broken, subjected to a dose series. The total electron flux applied is specified for each field of view. First column: fibers showed after the first exposure of 75 e2. Second column: representative micrographs indicating the point at which the first sign of radiation damage as ‘bubbles’ inside the fiber (red arrowheads) were detected. Nucleoproteins with high nucleic acid content are often more susceptible to bubbling than pure proteinaceous structures. In unfolded ribbons (black arrowhead), no bubbles are detected at an accumulated flux of up to 1125 e2. Scale bars, 100 nm. Third column: enlarged view where the effect of radiation damage as trapped ‘bubbles’ is more clearly visible.

Figure 1—figure supplement 5. 200 2D classes (6 empty) were obtained after reference-free 2D classification of fibers acquired (see methods) for single-particle analysis and extracted with a box size of 500 pixels in Relion 3.1.0 after motion correction, CTF estimation, and manual picking.

Figure 1—figure supplement 5.

The 2D classes are representative of the different relaxation states of the mimivirus genomic fiber observed in our highly heterogeneous dataset.

Figure 1—figure supplement 6. Clustering analysis of the 2D classes.

Figure 1—figure supplement 6.

(A) Cross-section model adjusted to one 2D class, representative of cluster Cl1b, to estimate the parameters of the widths W1 (external) and W2 (internal). (B) Automatic sorting of the 2D classes using the fiber width W1 and pairwise correlations of the 2D classes resulting in three main clusters (compact, Cl1 in dark blue; intermediate, Cl2 in cyan and relaxed, Cl3 in green and orange). Power spectra indexing with Helixplorer (http://rico.ibs.fr/helixplorer/) for the candidate helical parameters estimation. (C) Cl1a: 5-start, cyclic symmetry C1 (https://tinyurl.com/mrxnyjwn), (D) Cl2: 6-start, cyclic symmetry C3 (https://tinyurl.com/yckmk79b), and (E) Cl3a: 5-start, dihedral symmetry D5 (https://tinyurl.com/38v5f37m).

Figure 1—video 1. Cryp-ET of a long and broken fiber.

Download video file (9.1MB, mp4)

Figure 1—video 2. Cryo-ET of a fiber with a straight filament in its lumen.

Download video file (9.7MB, mp4)

Figure 1—video 3. Cryo-ET of a fiber with large amorphous dense structures inside its lumen.

Download video file (9.6MB, mp4)

Figure 1—video 4. Cryo-ET of a fiber with DNA strands emanating from breaks along the fiber.

Download video file (7.8MB, mp4)

Results

Capsid opening induces the release of a ~30-nm-wide rod-shaped structure that contains the dsDNA genome

We developed an in vitro protocol for particle opening that led to the release of ~30 nm-wide rod-shaped fibers of several microns in length (Figure 1, Figure 1—figure supplement 1). We coined this structure the mimivirus genomic fiber. Complete expulsion of the opened nucleoid content produced bundled fibers resembling a ‘ball of yarn’ (Kuznetsov et al., 2013; Figure 1C). The capsid opening procedure involves limited proteolysis and avoids harsh conditions, as we found that the structure becomes completely denatured by heat (95°C) and is also sensitive to acidic treatment, thus preventing its detection in such conditions (Schrad et al., 2020). Various conformations of the genomic fiber were observed, sometimes even on the same fiber (Figure 1D), ranging from the most compact rod-shaped structures (Figure 1D [left], and Figure 1E, F [top]) to more relaxed structures where DNA strands begin to dissociate (Figure 1D [right], and Figure 1E, F [bottom]). After optimizing the in vitro extraction on different strains of group-A mimiviruses, we focused on an isolate from La Réunion Island (mimivirus reunion), as more capsids were opened by our protocol, leading to higher yields of genomic fibers that were subsequently purified on sucrose gradient. All opened capsids released genomic fibers (Figure 1C, Figure 1—figure supplement 1).

The first confirmation of the presence of DNA in the genomic fiber was obtained by agarose gel electrophoresis (Figure 1—figure supplement 3). Cryo-EM bubblegram analysis (Wu et al., 2012; Cheng et al., 2014) gave a further indication that the nucleic acid is located in the fiber lumen. Alike other nucleoprotein complexes, fibers are expected to be more susceptible to radiation damage then pure proteinaceous structures. Surprisingly, the specimen could sustain higher electron irradiation before the appearance of bubbles compared to other studies (Mishyna et al., 2017): 600 e/Ų for relaxed helices and up to 900 e/Ų for long compact ones, while no bubbles could be detected in unfolded ribbons (Figure 1—figure supplement 4). For comparison, bacteriophage capsids containing free DNA, that is not in the form of nucleoproteins, show bubbling for doses of ~30–40 e/Ų (Mishyna et al., 2017).

Cryo-EM single-particle analysis of the different compaction states of the mimivirus genomic fiber

In order to shed light on the mimivirus genome packaging strategy and to determine the structure of the purified genomic fibers, we performed cryo-EM single-particle analysis. The different conformations of the genomic fiber initially observed by negative staining (Figure 1) and cryo-EM resulted in a highly heterogeneous dataset for single-particle analysis. In order to separate different conformations, in silico sorting through 2D classification (using Relion, He and Scheres, 2017; Scheres, 2012) was performed. Next, we carried out cluster analysis relying on the widths of the helical segments and correlations (real and reciprocal spaces) between the experimental 2D-class patterns (Figure 1—figure supplements 5 and 6 ). Three independent clusters (Cl) could be distinguished, corresponding to the compact (Cl1), intermediate (Cl2), and relaxed (Cl3) fiber conformations (Figure 1—figure supplement 6) with the latter being the widest. For each cluster, we determined their helical symmetry parameters by image power spectra analyses and performed structure determination and refinement (Figure 2, Figure 2—figure supplements 13).

Figure 2. Structures of the mimivirus genomic fiber for Cl1a (A–C), Cl3a (D–F), and Cl2 (G–I): Electron microscopy (EM) maps of Cl1a (A) and Cl3a (D) are shown with each monomer of one GMC-oxidoreductase dimer colored in green and orange and three adjacent dimers in yellow, to indicate the large conformational change taking place between the two fiber states.

The transition from Cl1a to Cl3a (5-start helix) corresponds to a rotation of each individual unit (corresponding to a GMC-oxidoreductase dimer) by ~−10° relative to the fiber longitudinal axis and a change in the steepness of the helical rise by ~−11°. Scale bars, 50 Å. Compared to Cl1a, the Cl2 6-start helix (G) shows a difference of ~2° relative to the fiber longitudinal axis and ~2° in the steepness of the helical rise. Scale bars, 50 Å. Cross-sectional (bottom) and longitudinal (top) sections through the middle of final Cl1a (B), Cl3a (E), and Cl2 (H) EM maps. Scale bars, 50 Å. Longitudinal (top) and orthogonal (bottom) views of final Cl1a (C), Cl3a (F), and Cl2 (I) EM maps color coded according to each start of the 5-start helix. Densities for some asymmetric units in the front have been removed on the side view map to show the five DNA strands lining the protein shell interior. Scale bars 50 Å.

Figure 2.

Figure 2—figure supplement 1. Workflow of the Cl1a compact helix reconstruction process and fitting.

Figure 2—figure supplement 1.

Segment extraction was performed with a box size of 400 pixels (pix) binned twice (box size 200 pix, 2.18 Å/pix). The distance between consecutive boxes was equal to the axial rise calculated by indexation on the power spectrum. From 2D classification with 200 classes, 30 classes were selected for Cl1a after clustering see methods and Figure 1—figure supplement 6. First, 3D classification was carried out using the segments from 30 2D classes, helical symmetry parameters from the power spectrum indexation and a 300-Å featureless cylinder as 3D reference. 3D refinement of the four boxed 3D classes was achieved using one low-pass filtered 3D class as reference on the unbinned segments. One step of focused refinement (with solvent flattening) was performed using a low-pass filtered 3D reconstruction (before solvent flattening) as reference. 3D refinement was then performed. The resulting map was then used as model for automatic picking. A last 3D-refinement cycle was performed using as reference a featureless cylinder. The bottom right graph presents the Fourrier shell correlation (FSC) curves for the final 3D reconstructions (helical assembly and asymmetric unit).
Figure 2—figure supplement 2. Workflow of the Cl3a relaxed helix reconstruction process and fitting.

Figure 2—figure supplement 2.

Segment extraction was performed with a box size of 400 pixels (pix) binned twice (box size 200 pix, 2.18 Å/pix) using the rise obtained from the power spectrum indexation (see methods). From 2D classifications with 200 classes, 42 classes were selected for Cl3a after clustering (see methods and Figure 1—figure supplement 6). First, 3D classification was carried out using the 42 2D classes, with D5 symmetry, rise and twist from the power spectrum indexation and a featureless cylinder of 340 Å as reference. 3D refinement of the two 3D classes (red boxes) was achieved using one low-pass filtered 3D class as reference on the unbinned segments. The resulting map was used as model for automatic picking and segment extraction. Final 3D refinement was performed on unbinned segments using as initial reference a featureless cylinder with a last step of solvent flattening.
Figure 2—figure supplement 3. Workflow of the Cl2 helix reconstruction process and fitting.

Figure 2—figure supplement 3.

Segment extraction was performed with a box size of 400 pixels (pix) binned twice (box size 200 pix, 2.18 Å/pix) using the rise obtained from the power spectrum indexation (see methods). From 2D classifications with 200 classes, 12 classes were selected for Cl2 based on correlation (see methods and Figure 5). First, 3D refinements combining 12 2D classes with solvent flattening, were performed with the C3 symmetry, rise and twist from the power spectrum indexation and a 330-Å featureless cylinder as reference. The resulting map was used as model for automatic picking and segment extraction on the unbinned segments. Final 3D refinement was performed on unbinned segments using as initial reference a featureless cylinder with a last step of solvent flattening.
Figure 2—figure supplement 4. Comparison of sequence coverages for qu_946 (A) and qu_143 (B) obtained by mass spectrometry (MS)-based proteomic analysis of genomic fibers and total virions.

Figure 2—figure supplement 4.

Identified peptides are highlighted in red, blue, and gray when identified, respectively, in three, two, and one replicate of the purified genomic fiber.
Figure 2—video 1. Illustration of the relaxation of the genomic fiber.
Download video file (5.7MB, mp4)

For both Cl1a and Cl3a conformations, after 3D refinement, we obtained helical structures of 3.7 Å resolution (Fourrier shell correlation [FSC] threshold 0.5, masked), corresponding to 5-start left-handed helices made of a ~8-nm-thick proteinaceous external shell (Figure 2—figure supplements 12). For the most compact conformation (Cl1a) five dsDNA strands were lining the interior of the protein shell leaving a ~9-nm-wide central channel (Figures 2 and 3, Supplementary file 1). The dsDNA strands appear as curved cylinders in the helical structure, the characteristic shape of the DNA (minor and major groove) becoming only visible after focused refinement of a single strand of dsDNA (Figure 3, Figure 3—figure supplements 1 and 2). In the relaxed subcluster Cl3a, the DNA strands at the interface to the ~17-nm-wide central channel are not clearly recognizable (Figure 2, Supplementary file 1, and Figure 2—figure supplement 2), most likely because they are at least partially detached inside the broken expanded fiber. The breaks after relaxation of the helix might be the result of the extraction and purification treatment, while DNA will remain in the central channel, at least in the early phase of Acanthamoeba infection.

Figure 3. Maps of the compact (Cl1a and Cl2) genomic fiber structures.

(A) Cl1a EM map corresponding to the protein shell prior focused refinement is shown as a transparent surface and the five DNA strands as solid surface. One protein dimer strand is shown yellow except for one asymmetric unit (transparent yellow) to illustrate the dimer fit. The position of a second dimer (green) from the adjacent dimer strand is shown to emphasize that the DNA strand (gold) is lining the interface between two dimers. (B) Cartoon representation of GMC-oxidoreductase qu_946 dimers fitted into one of the Cl1a 5-start helix strands in the 3.7-Å resolution map. The map is shown at a threshold highlighting the periodicity of contacts between the dsDNA and the protein shell. Charged distribution on surface representation of Cl1a protein shell made of qu_946 dimers (C) or qu_143 dimers (D). Cartoon representation of qu_946 (E) and qu_143 (F) fitted into the Cl1a cryo-electron microscopy (cryo-EM) maps highlighting the interacting residues (given as stick models) between each monomer and one dsDNA strand. The isosurface threshold chosen allows visualization of density for the manually built N-terminal residues, including terminal cysteines (stick model), of two neighboring monomers that could form a terminal disulfide bridge. (G) Zoom into the 3.3 Å resolution focused refined Cl1a map illustrating the fit of the side chains and the FAD ligand (Figure 3—video 1). (H) Cartoon representation of the DNA fitted in the focused refined DNA only map (Figure 3—figure supplement 2). (I) Focused refined Cl1a map colored by monomer, next to a cartoon representation of the qu_946 dimer (α-helices in red, β-strands in blue, and coils in yellow). Secondary structure elements are annotated in both representations (H: helix, B: beta-strand).

Figure 3.

Figure 3—figure supplement 1. 2D classification of subtracted segments.

Figure 3—figure supplement 1.

Fifty 2D classes (2 empty) were obtained after reference-free 2D classification in Relion 3.1.0 of segments (box size of 400 pixels) of the Cl1a genomic fibers after signal subtraction to keep only the information of the DNA strands inside the fiber. The 2D classes show the enhancement of the signal corresponding to the DNA.
Figure 3—figure supplement 2. Workflow of the DNA strand focused refinement process.

Figure 3—figure supplement 2.

One strand segmented from the 5-start compacted map (Cl1a) was low-pass filtered at 20 Å and used as reference for 3D refinement of the dataset corresponding to the merged dataset of the individually subtracted five strands. The result of the refinement was low-pass filtered at 30 Å and used as reference for 3D classification. The best 3D class was further refined leading to a focused refined map of the DNA strand.
Figure 3—video 1. Illustration of the fit of the GMC-oxidoreductase into the asymetric unit focused refined Cl1a map.
Download video file (87.4MB, mp4)
Figure 3—animation 1. Illustration of the fit of the GMC-oxidoreductase into the asymmetric unit focused refined Cl1a map.

Finally, the 4-Å resolution Cl2 map obtained after 3D refinement (Figure 2—figure supplement 3) corresponds to a 6-start left-handed helix made of a~8-nm-thick proteinaceous external shell, with six dsDNA strands lining the shell interior and leaving a ~12-nm-wide inner channel (Figure 2, Supplementary file 1).

The most abundant proteins in the genomic fiber are GMC oxidoreductases, the same that compose the fibrils decorating mimivirus capsid

MS-based proteomic analyses performed on three biological replicates identified two GMC oxidoreductases as the main components of the purified genomic fiber (qu_946 and qu_143 in mimivirus reunion corresponding to L894/93 and R135, respectively, in mimivirus prototype) (Supplementary file 2). The two mimivirus reunion proteins share 69% identity (81% similarity). The available mimivirus R135 GMC-oxidoreductase dimeric structure (Klose et al., 2015) (PDB 4Z24, lacking the 50 amino acid long cysteine-rich N-terminal domain) was fitted into the EM maps (Figures 2 and 3). This is quite unexpected, since GMC oxidoreductases are already known to compose the fibrils surrounding mimivirus capsids (Notaro et al., 2021; Boyer et al., 2011). The corresponding genes are highly expressed during the late phase of the infection cycle at the time of virion assembly. Notably, the proteomic analyses provided different sequence coverages for the GMC oxidoreductases depending on whether samples were intact virions or purified genomic fiber preparations, with substantial under-representation of the N-terminal domain in the genomic fiber (Figure 2—figure supplement 4). Accordingly, the maturation of the GMC oxidoreductases involved in genome packaging must be mediated by one of the many proteases encoded by the virus or the host cell. Interestingly, mimivirus M4 (Boyer et al., 2011), a laboratory strain having lost the genes responsible for the synthesis of the two polysaccharides decorating mimivirus fibrils (Notaro et al., 2021) also lacks the GMC-oxidoreductase genes. Additional studies on this specific variant will be key to establish if it exhibits a similar genomic fiber, and if yes, which proteins are composing it.

Analysis of the genomic fiber structure

The EM maps and FSC curves of Cl1a are shown in Figure 2—figure supplement 1. An additional step of refinement focused on the asymmetric unit further improved the local resolution to 3.3 Å as indicated by the corresponding FSC (Figure 3 and Figure 2—figure supplement 1). After fitting the most abundant GMC-oxidoreductase qu_946 (SWISS-MODEL model Waterhouse et al., 2018) in the final map of Cl1a, five additional N-terminal residues in each monomer were manually built using the uninterpreted density available. This strikingly brings the cysteines of each monomer (C51 in qu_946) close enough to allow a disulfide bridge, directly after the 50 amino acids domain not covered in the proteomic analysis of the genomic fiber (Figure 3G, Figure 2—figure supplement 4). The N-terminal chain, being more disordered than the rest of the structure, it is absent in the focused refined map, and also absent in the Cl3a map of the relaxed helix, suggesting that a break of the disulfide bridge could be involved in the observed unwinding process. Models of the three helical assemblies and asymmetric unit were further refined using the real-space refinement program in PHENIX 1.18.2 (Liebschner et al., 2019). In the 3.3-Å resolution map of the asymmetric unit, most side chains and notably the FAD cofactor are accommodated by density suggesting that the oxidoreductase enzyme could be active (Figure 3D). Density that can be attributed to the FAD cofactor is also present in the Cl2 and Cl3a maps. The atomic models of Cl1a and Cl3a dimers are superimposable with a core root-mean-square deviation (RMSD) of 0.68 Å based on Cα atoms.

Inspection of individual genomic fibers in the tomograms confirmed the coexistence of both 5- and 6-start left-handed helices containing DNA (Figure 1—figure supplement 2 and Figure 3—animation 1, Figure 1—video 1). Further, some intermediate and relaxed structures were also observed in which the DNA segments appeared detached from the protein shell and sometimes completely absent from the central channel of the broken fibers. Both GMC oxidoreductases (qu_946 and qu_143) can be fitted in the 5- and 6-start maps.

In relaxed or broken fibers, large electron dense structures that might correspond to proteins inside the lumen were sometimes visible (Figure 1—figure supplement 2C, E and Figure 1—video 3) as well as dissociating DNA fragments, either in the central channel (Figure 1—figure supplement 2B and Figure 1—video 2) or at the breakage points of the fibers in its periphery (Figure 1—figure supplement 2D and Figure 1—video 4). Densities corresponding to the dimer subunits composing the protein shell were also commonly observed on dissociated DNA strands (Figure 1E, F, Figure 1—figure supplement 2, and Figure 1—video 1; Figure 1—video 2; Figure 1—video 3; Figure 1—video 4).

In the Cl1a and Cl2 helices, the monomers in each dimer are interacting with two different dsDNA strands. As a result, the DNA strands are interspersed between two dimers, each also corresponding to a different strand of the protein shell helix (Figure 3). Based on the periodic contacts between protein shell and DNA strands, these interactions might involve, in the case of the Cl1a helix, one aspartate (D82 relative to the N-terminal methionine in qu_946), one glutamate (E321), two lysines (K344, K685), one arginine (R324), and a histidine (H343) or one asparagine (N80), two lysines (K319, K342), one arginine (R322), and one tyrosine (Y687), in the case of the qu_143 (Figure 3E, F, Supplementary file 3). Intra- and interstrands contacts between each dimer are presented in Supplementary file 3 for qu_143 and qu_946 in Cl1a, Cl2, and Cl3 maps.

Despite the conformational heterogeneity and the flexibility of the rod-shaped structure, we were able to build three atomic models of the mimivirus genomic fiber, in compact (5- and 6-start) and relaxed (5-start) states. Higher resolution data would still be needed to determine the precise structure of the dsDNA corresponding to the viral genome (Figure 3B, H), however, the lower resolution for this part of the map even in focused refinement runs (Figure 3H) might also mean that the DNA does not always bind in the same orientation.

Rough estimation of genome compaction to fit into the nucleoid

Since there is a mixture of five and six strands of DNA in the genomic fiber, this could correspond to five or six genomes per fiber or to a single folded genome. Assuming that the length of DNA in B-form is ~34 Å for 10 bp, the mimivirus linear genome of 1.2 × 106 bp would extend over ~400 µm and occupy a volume of 1.3 × 106 nm3 (~300 µm and ~1 × 106 nm3 if in A-form) (Li et al., 2019). The volume of the nucleoid (Kuznetsov et al., 2010) (~340 nm in diameter) is approximately 2.1 × 107 nm3 and could accommodate over 12 copies of viral genomes in a naked state, but only 40 µm of the ~30-nm-wide flexible genomic fiber. Obviously, the mimivirus genome cannot be simply arranged linearly in the genomic fiber and must undergo further compaction to accommodate the 1.2 Mb genome in a ~40-µm-long genomic fiber. As a result, the complete mimivirus genome, folded at least five times, fits into the helical shell. This structure surprisingly resembles a nucleocapsid, such as the archaea infecting APBV1 nucleocapsid (Ptchelkine et al., 2017).

Additional proteins, including RNA polymerase subunits, are enriched in the genomic fiber

The proteomic analysis of fiber preparations revealed the presence of additional proteins including several RNA polymerase subunits: Rpb1 and Rpb2 (qu_530/532 and qu_261/259/257/255), Rpb3/11 (qu_493), Rpb5 (qu_245), RpbN (qu_379), and Rpb9 (qu_219), in addition to a kinesin (qu_313), a regulator of chromosome condensation (qu_366), a helicase (qu_572), to be possibly associated with the genome (Supplementary file 2). In addition to the two GMC oxidoreductases, at least three oxidative stress proteins were also identified together with hypothetical proteins (Supplementary file 2). RNA polymerase subunits start being expressed 1hr postinfection with a peak after 5 hr and are expressed until the end the infection cycle. GMC oxidoreductases, kinesin, regulator of chromosome condensation are all expressed after 5 hr of infection until the end of the cycle.

As expected, the core protein (qu_431) composing the nucleoid and the major capsid proteins (MCP, qu_446) were significantly decreased in the genomic fiber proteome compared to intact virions. In fact, qu_431 and qu_446 represent, respectively, 4.5% and 9.4% of the total protein abundance in virions whereas they only account for 0.4% and 0.7% of the total protein abundance in the genomic fiber, suggesting that they could be contaminants in this preparation. On the contrary, we calculated enrichment factors of more than 500 (qu_946) and 26 (qu_143) in the genomic fiber samples compared to the intact virion. Finally, the most abundant RNA polymerase subunit (qu_245) is increased by a factor of eight in the genomic fiber compared to intact virion (if the six different subunits identified are used, the global enrichment is sevenfold). Furthermore, upon inspection of the negative staining micrographs, macromolecules strikingly resembling the characteristic structure of the poxviruses RNA polymerase (Grimm et al., 2019) were frequently observed scattered around the unwinding fiber and sometimes sitting on DNA strands near broken fibers (Figure 4). Together with the tomograms showing large electron dense structures in the lumen, some RNA polymerase units could occupy the center of the genomic fiber.

Figure 4. RNA polymerase could be associated to the genomic fiber.

Figure 4.

(A) Micrograph of negative stained fiber with released DNA still being connected to a relaxed and broken fiber and adjacent scattered macromolecular complexes that might resemble RNA polymerases. (B) Strikingly, some of them (black arrows) appear to sit on a DNA strand (white arrow). (C) E, particle extracted from the NS-TEM image; P, projections of vaccinia virus RNA polymerase (6RIC and 6RUI, Grimm et al., 2019) structure in preferred orientation; CE, clean extraction (see Material and methods). White and black boxed corresponds to images with white and black asterisk (CE), respectively. Scale bar, 50 Å. Negative staining imaging may dehydrate the objects and change macromolecules volumes.

Discussion

Several DNA compaction solutions have been described. For instance, the DNA of filamentous viruses infecting archaea is wrapped by proteins to form a ribbon which in turn folds into a helical rod forming a cavity in its lumen (DiMaio et al., 2015; Wang et al., 2020). In contrast, the chromatin of cellular eukaryotes consists of DNA wrapped around histone complexes (Robinson et al., 2006). It was recently shown that the virally encoded histone doublets of the Marseilleviridae can form nucleosomes (Liu et al., 2021; Valencia-Sánchez et al., 2021) and such organization would be consistent with previous evolutionary hypotheses linking giant DNA viruses with the emergence of the eukaryotic nucleus (Bell, 2001; Bell, 2020; Chaikeeratisak et al., 2017; Claverie, 2006; Takemura, 2001). Herpesviruses (Gong et al., 2019; Liu et al., 2019), bacteriophages (Sun et al., 2015; Rao and Feiss, 2015), and APBV1 archaeal virus (Ptchelkine et al., 2017) package their dsDNA genome as naked helices or coils. Yet, APBV1 nucleocapsid structure strikingly resemble the mimivirus genomic fiber with a proteinacious shell enclosing the folded dsDNA genome. Consequently, mimivirus genomic fiber is a nucleocapsid further bundled as a ball of yarn into the nucleoid, itself encased in the large icosahedral capsids. The structure of the mimivirus genomic fiber described herein supports a complex assembly process where the DNA must be folded into five or six strands prior to or concomitant with packaging, a step that may involve the repeat containing regulator of chromosome condensation (qu_366) identified in the proteomic analysis of the genomic fiber. The proteinaceous shell, via contacting residues between the dsDNA and the GMC oxidoreductases, would guide the folding of the dsDNA strands into the structure prior loading into the nucleoid. The lumen of the fiber being large enough to accommodate the mimivirus RNA polymerase, we hypothesize that it could be sitting on the highly conserved promoter sequence of early genes (Suhre et al., 2005). This central position would support the involvement of the RNA polymerase in genome packaging into the nucleoid and could determine the channel width via its anchoring on the genome (Figure 4D). According to this scenario, the available space (although tight) for the RNA polymerase inside the genomic fiber lumen suggests it could be sterically locked inside the compact form of the genomic fiber and could start moving and transcribing upon helix relaxation, initiating the replicative cycle and the establishment of the cytoplasmic viral factory. The genome and the transcription machinery would thus be compacted together into a proteinaceous shield, ready for transcription upon relaxation (Figure 2—video 1). This organization would represent a remarkable evolutionary strategy for packaging and protecting the viral genome, in a state ready for immediate transcription upon unwinding in the host cytoplasm. This is conceptually reminiscent of icosahedral and filamentous dsRNA viruses which pack and protect their genomes together with the replicative RNA polymerase into an inner core (Toriyama, 1986; Collier et al., 2016; Ding et al., 2019). As a result, replication and transcription take place within the protein shield and viral genomes remain protected during their entire infectious cycle. In the case of dsDNA viruses however, the double helix must additionally open up to allow transcription to proceed, possibly involving the helicase identified in our proteomic study (Supplementary file 2). Finally, in addition to their structural roles, the FAD containing GMC oxidoreductases making the proteinaceous shield, together with other oxidative stress proteins (Supplementary file 2), could alleviate the oxidative stress to which the virions are exposed while entering the cell by phagocytosis.

Mimivirus virion thus appears as a Russian doll, with its icosahedral capsids covered with heavily glycosylated fibrils, two internal membranes, one lining the capsid shell, the other encasing the nucleoid, in which the genomic fiber is finally folded. To our knowledge, the structure of the genomic fiber used by mimivirus to package and protect its genome in the nucleoid represents the first description of the genome organization of a giant virus. Since the genomic fiber appears to be expelled from the nucleoid as a flexible and subsequently straight structure starting decompaction upon release, we suspect that an active, energy-dependent, process is required to bundle it into the nucleoid during virion assembly. Such an efficient structure is most likely shared by other members of the Mimiviridae family infecting Acanthamoeba and could be used by other dsDNA viruses relying on exclusively cytoplasmic replication like poxviruses to immediately express early genes upon entry into the infected cell (Malkin et al., 2003; Blanc-Mathieu et al., 2021). Finally, the parsimonious use of moonlighting GMC oxidoreductases playing a central role in two functionally unrelated substructures of the mimivirus particle: (1) as a component of the heavily glycosylated peripheral fibril layer and (2) as a proteinaceous shield to package the dsDNA into the genomic fibers questions the evolutionary incentive leading to such an organization for a virus encoding close to a thousand proteins.

Materials and methods

Nucleoid extraction

Mimivirus reunion virions defibrillated, as described previously (Notaro et al., 2021; Kuznetsov et al., 2010), were centrifuged at 10,000 × g for 10 min, resuspended in 40 mM TES pH 2 and incubated for 1 hr at 30°C to extract the nucleoid from the opened capsids.

Extraction and purification of the mimivirus genomic fiber

The genomic fiber was extracted from 12 ml of purified mimivirus reunion virions at 1.5 × 1010 particles/ml, split into 12 × 1 ml samples processed in parallel. Trypsin (Sigma T8003) in 40 mM Tris–HCl pH 7.5 buffer was added at a final concentration of 50 µg/ml and the virus-enzyme mix was incubated for 2 hr at 30°C in a heating dry block (Grant Bio PCH-1). Dithiothreitol (DTT) was then added at a final concentration of 10 mM and incubated at 30°C for 16 hr. Finally, 0.001% Triton X-100 was added to the mix and incubated for 4 hr at 30°C. Each tube was vortexed for 20 s with 1.5-mm diameter stainless steel beads (CIMAP) to separate the fibers from the viral particles and centrifuged at 5000 × g for 15 min to pellet the opened capsids. The supernatant was recovered, and the fibers were concentrated by centrifugation at 15,000 × g for 4 hr at 4°C. Most of the supernatant was discarded leaving 12 × ~200 µl of concentrated fibers that were pooled and layered on top of ultracentrifuge tubes of 4 ml (polypropylene centrifuge tubes, Beckman Coulter) containing a discontinuous sucrose gradient (40%, 50%, 60%, 70% [wt/vol] in 40 mM Tris–HCl pH 7.5 buffer). The gradients were centrifuged at 200,000 × g for 16 hr at 4°C. Since no visible band was observed, successive 0.5 ml fractions were recovered from the bottom of the tube, the first one supposedly corresponding to 70% sucrose. Each fraction was dialyzed using 20 kDa Slide-A-Lyzers (ThermoFisher) against 40 mM Tris–HCl pH 7.5 to remove the sucrose. These fractions were further concentrated by centrifugation at 15,000 × g, at 4°C for 4 hr and most of the supernatant was removed, leaving ~100 µl of sample at the bottom of each tube. At each step of the extraction procedure the sample was imaged by negative staining transmission electron microscopy (TEM) to assess the integrity of the genomic fiber (Figure 1—figure supplement 1). Each fraction of the gradient was finally controlled by negative staining TEM. For proteomic analysis, an additional step of concentration was performed by speedvac (Savant SPD131DDA, Thermo Scientific).

Negative stain TEM

300-mesh ultra-thin carbon-coated copper grids (Electron Microscopy Sciences, EMS) were prepared for negative staining by adsorbing 4–7 µl of the sample for 3 min, followed by two washes with water before staining for 2 min in 2% uranyl acetate. The grids were imaged either on a FEI Tecnai G2 microscope operated at 200 keV and equipped with an Olympus Veleta 2k camera (IBDM microscopy platform, Marseille, France); a FEI Tecnai G2 microscope operated at 200 keV and equipped with a Gatan OneView camera (IMM, microscopy platform, France) or a FEI Talos L120c operated at 120 keV and equipped with a Ceta 16M camera (CSSB multi-user cryo-EM facility, Germany, Figure 1, Figure 1—figure supplement 1).

Cryo-electron tomography

Sample preparation

For cryo-electron tomography (cryo-ET) of the mimivirus genomic fiber, samples were prepared as described above for single-particle analysis except that 5 nm colloidal gold fiducial markers (UMC, Utrecht) were added to the sample right before plunge freezing at a ratio of 1:2 (sample:fiducial markers).

Data acquisition

Tilt series were acquired using SerialEM (Mastronarde, 2005) on a Titan Krios (Thermo Scientific) microscope operated at 300 keV and equipped with a K3 direct electron detector and a GIF BioQuantum energy filter (Gatan). We used the dose-symmetric tilt-scheme (Hagen et al., 2017) starting at 0° with a 3° increment to ±60° at a nominal magnification of ×64,000, a pixel size of 1.4 Å and a total fluence of 150 e/Ų over the 41 tilts, that is, ~3.7 e/Ų/tilt for an exposure time of 0.8 s fractionated into 0.2 s frames (Supplementary file 4).

Data processing

Tilt series were aligned and reconstructed using the IMOD (Kremer et al., 1996). For visualization purposes, we applied a binning of 8 and SIRT-like filtering from IMOD (Kremer et al., 1996) as well as a bandpass filter bsoft (Heymann and Belnap, 2007). The tomograms have been deposited on EMPIAR, accession number 1131 and videos were prepared with Fiji (Figure 1E, F, Figure 1—figure supplement 2, and Figure 1—video 1; Figure 1—video 2; Figure 1—video 3; Figure 1—video 4).

Agarose gel electrophoresis and DNA dosage to assess the presence of DNA into the fiber

Genomic DNA was extracted from 1010 virus particles using the PureLink TM Genomic DNA mini kit (Invitrogen) according to the manufacturer’s protocol. Purified genomic fiber was obtained following the method described above. The purified fiber was treated by adding proteinase K (PK) (Takara ST 0341) to 20 µl of sample (200 ng as estimated by dsDNA Qubit fluorometric quantification) at a final concentration of 1 mg/ml and incubating the reaction mix at 55°C for 30 min. DNase treatment was done by adding DNase (Sigma 10104159001) and MgCl2 to a final concentration of 0.18 mg/ml and 5 mM, respectively, in 20 µl of sample and incubated at 37°C for 30 min prior to PK treatment. For RNase treatment, RNase (Sigma SLBW2866) was added to 20 µl (200 ng) of sample solution to a final concentration of 1 mg/ml and incubated at 37°C for 30 min prior to PK treatment. All the samples were then loaded on a 1% agarose gel and stained with ethidium bromide after migration. The bands above the 20 kbp marker correspond to the stacked dsDNA fragments of various lengths compatible with the negative staining images of the broken fibers where long DNA fragment are still attached to the helical structure. The mimivirus purified genomic DNA used as a control migrates at the same position (Figure 1—figure supplement 3).

Cryo-EM bubblegram analysis

Samples were prepared as described for single-particle analysis. Dose series were acquired on a Titan Krios (Thermo Scientific) microscope operated at 300 keV and equipped with a K3 direct electron detector and a GIF BioQuantum (Gatan) energy filter. Micrographs were recorded using SerialEM (Mastronarde, 2005) at a nominal magnification of ×81,000, a pixel size of 1.09 Å, and a rate of 15 e/pixel/s (Figure 1—figure supplement 4 and Supplementary file 4). Dose series were acquired by successive exposures of 6 s, resulting in an irradiation of 75 e/Ų per exposure. Micrographs were acquired with 0.1 s frames and aligned in SerialEM (Mastronarde, 2005). In a typical bubblegram experiment, 12–15 successive exposures were acquired in an area of interest with cumulative irradiations of 900–1125 e/Ų total (Figure 1—figure supplement 4).

Single-particle analysis by cryo-EM

Sample preparation

For single-particle analysis, 3 µl of the purified sample were applied to glow-discharged Quantifoil R 2/1 Cu grids, blotted for 2 s using a Vitrobot Mk IV (Thermo Scientific) and applying the following parameters: 4°C, 100% humidity, blotting force 0, and plunge frozen in liquid ethane/propane cooled to liquid nitrogen temperature.

Data acquisition

Grids were imaged using a Titan Krios (Thermo Scientific) microscope operated at 300 keV and equipped with a K2 direct electron detector and a GIF BioQuantum energy filter (Gatan). 7656 movie frames were collected using the EPU software (Thermo Scientific) at a nominal magnification of ×130,000 with a pixel size of 1.09 Å and a defocus range of −1 to −3 μm. Micrographs were acquired using EPU (Thermo Scientific) with 8-s exposure time, fractionated into 40 frames and 7.5 e/pixel/s (total fluence of 50.5 e/Ų) (Supplementary file 4).

2D classification and clustering of 2D classes

All movie frames were aligned using MotionCor2 (Zheng et al., 2017) and used for contrast transfer function (CTF) estimation with CTFFIND-4.1 (Rohou and Grigorieff, 2015). Helical segments of the purified genomic fibers, manually picked with Relion 3.0 (He and Scheres, 2017; Scheres, 2012), were initially extracted with different box sizes, 400 pixels for 3D reconstructions, 500 pixels for initial 2D classifications and clustering, and 700 pixels to estimate the initial values of the helical parameters. Particles were subjected to reference-free 2D classification in Relion 3.1.0 (He and Scheres, 2017; Scheres, 2012), where multiple conformations of the fiber were identified (Figure 1—figure supplements 5 and 6).

We then performed additional cluster analysis of the 194 initial 2D classes provided by Relion (Figure 1—figure supplement 5) to aim for more homogeneous clusters, eventually corresponding to different states (Figure 1—figure supplement 6). A custom two-step clustering script was written in python with the use of Numpy (Harris et al., 2020) and Scikit-learn (Pedregosa et al., 2011) libraries. First, a few main clusters were identified by applying a DBSCAN (Hahsler, 2019) clustering algorithm on the previously estimated fiber external width values (W1). The widths values, estimated by adjusting a parameterized cross-section model on each 2D stack, range from roughly 280 to 340 Å. The cross-section model fitting process is based on adjusting a section profile (S) described in the equation bellow on the longitudinally integrated 2D-class profile (Figure 1—figure supplement 6A). The cross-section model is composed of a positive cosine component (parameterized by the center position μ, its width σ1 and amplitude a1) associated with the fiber external shell, a negative cosine component (parameterized by the center position μ, its width σ2 and amplitude a2) associated with the central hollow lumen (W2), and a constant p0, accounting for the background level, as.

S(μ, σ1,a1,σ2,a2,p0,x)=a1cosxμσ1a2cosxμσ2+p0

Then, as a second step, each main cluster was subdivided into several subclusters by applying a KMEANS (Mannor et al., 2011) clustering algorithm on a pairwise similarity matrix. This similarity metric was based on a 2D image cross-correlation scheme, invariant to image shifts, and mirroring (Guizar-Sicairos et al., 2008). The number of subclusters was manually chosen by visual inspection. For the most populated 2D classes corresponding to the most compact conformations, the number of subclusters was small: n = 2 subclusters, Cl1a and Cl1b, and n = 1 subcluster for the intermediate class, Cl2. However, the number of subclusters was higher (n = 5) for the relaxed conformation (Cl3 from green to purple in Figure 1—figure supplement 6), highlighting the overall heterogeneity of our dataset with compact, intermediate, relaxed states and even loss of one protein strands (Cl3b) and unwound ribbons.

Identification of candidate helical parameters

Fourier transform analysis methods have been used to identify helical parameters candidates (Coudray et al., 2016; Diaz et al., 2010; Sachse, 2015) for the Cl1a, Cl2, and Cl3a clusters. For each cluster, we first estimated the repeat distance by applying the method consisting in a longitudinal autocorrelation with a windowed segment of the real-space 2D class of fixed size (100 Å) (Diaz et al., 2010). Then, a precise identification of the power spectrum maxima could be achieved on a high signal-to-noise ratio power spectrum, obtained by averaging all the constituting segments in the Fourier domain, which helps lowering the noise, and fill in the CTF zeros regions. The best candidates were validated with Helixplorer (http://rico.ibs.fr/helixplorer/).

For the Cl1a cluster, the parameters of a 1-start helix have been identified with a rise of 7.937 Å and a twist of 221.05°. For the Cl2 cluster, the candidate parameters are a rise of 20.383 Å and a twist of 49.49° and C3 cyclic symmetry. For the Cl3a cluster, the candidate parameters correspond to a rise of 31.159 Å and a twist of 24° and D5 symmetry (Figure 1—figure supplement 6C–E).

Cryo-EM data processing and 3D reconstruction

- Cl1a–Cl3a

After helical parameters determination, two last 2D classifications were performed on segments extracted with a box size of 400 pixels (decimated to 200 pixels) using the proper rises for the most compact Cl1a (7.93 Å, 113,026 segments, overlap ~98.2%) and the relaxed Cl3a (31.16 Å, 16,831 segments, overlap ~92.9%). Values of the helical parameters (rise and twist) were then used for Relion 3D classification (He and Scheres, 2017; Scheres, 2012), with a ±10% freedom search range, using a featureless cylinder as initial reference (diameter of 300 Å for the compact particles Cl1a and 340 Å for the relaxed particles Cl3a). The superimposable 3D classes (same helical parameters, same helix orientation) were then selected, reducing the dataset to 95,722 segments for the compact fiber (Cl1a) and to 15,289 segments for the relaxed fiber (Cl3a). After re-extraction of the selected segments without scaling, further 3D refinement was performed with a 3D classification output low-pass filtered to 15 Å as reference. With this, the maps were resolved enough (Cl1a: 4.4 Å, Cl3a: 4.8; FSC threshold 0.5) to identify secondary structure elements (with visible cylinders corresponding to the helices) (Figure 2—figure supplements 12).

- Cl2

The 12 2D classes corresponding to segments of the Cl2 conformation, were extracted with a box size of 400 pixels (decimated to 200 pixels, rise 20.4 Å, 5,775 segments, overlap ~94.9%). They were used in Relion for 3D refinement with the helical parameters values identified previously, with a ±10% freedom search range, using a 330 Å large featureless cylinder as initial reference and resulted in a 7.1-Å map (FSC threshold 0.5) and C3 cyclic symmetry (Figure 2—figure supplement 3).

Focused refinement of a single oxidoreductase dimer

The Cl1a 3D map was used to make a mask corresponding to a single dimer of oxidoreductases through the segmentation module (Pintilie et al., 2010) of the program Chimera (Pettersen et al., 2004) (this mask was deposited as part of the EMDB D_1292117739). With Relion and this mask we performed partial signal subtraction (Bai et al., 2015) to remove the information of the other dimers from the experimental images generating a new stack of subtracted images. This was followed by focused refinement with an initial model created (command "relion_reconstruct") from the subtracted dataset and the corresponding orientation/centering parameters from the partial signal subtraction. CTF parameters refinement was performed followed by a last 3D refinement step and postprocessing (B-factor applied −45). This led to the best resolved 3.3-Å 3D map (FSC threshold 0.5, masked, Figure 2—figure supplement 1) that was used to build the atomic model of the GMC oxidoreductase and its ligands (Supplementary file 1, Figure 3). As expected, densities corresponding to DNA were still visible in the final 3D map because the 3D density used during partial signal subtraction contained no high-resolution information about the DNA structure. A Relion project directory summarizing all the steps to perform the focused refinement of the asymmetric unit (oxidoreductase dimer) is available under the link: https://src.koda.cnrs.fr/igs/genfiber_cl1a_focusrefine_relion_pipeline.

DNA focused refinement

The 3D EM maps of Cl1a and Cl2 were used to make masks for each DNA strand separately with ChimeraX (Goddard et al., 2018). Each strand was used in a partial signal subtraction removing information from all proteins keeping only information near the presumed DNA regions on the corresponding experimental images. The subtracted (DNA only) datasets were merged and subjected to 2D classification to allow visual assessment of the quality of the subtracted images (Figure 3—figure supplement 1). 3D refinement was performed followed by 3D classification and final refinement of the best 3D classes (Figure 3E, Figure 3—figure supplement 2).

For the Cl3a compaction state (where some barely visible densities could correspond to remaining DNA) a cylinder with a diameter corresponding to the lumen of the relaxed structure (140 Å) was used as mask for partial signal subtraction in an attempt to enhance the information from the presumed DNA regions. However, the signal remained too weak to be recognized as such.

Automatic picking of the three different conformations (Cl1a, Cl2, and Cl3a)

Projections from the different helical maps were used as new input for automatic picking of each Cl1a, Cl2, and Cl3a clusters to get more homogeneous datasets for each conformation (Estrozi and Navaza, 2008). For each dataset a final round of extraction (box size Cl1a: 380 pixels, 121,429 segments; Cl2: 400 pixels, 8479 segments; Cl3a: 400 pixels, 11,958 segments) and 3D refinement with solvent flattening (Central z length 30%) was performed using the appropriate helical parameters and additional symmetries (none for Cl1a, C3 for Cl2, and D5 for Cl3a). This led to the improved maps presented in Figures 2 and 3, Figure 2—figure supplements 13 (Cl1a: 3.7 Å; Cl2: 4.0 Å; Cl3a: 3.7 Å; FSC threshold 0.5, masked). Postprocessing was performed with B-factor −80 (Supplementary file 1, Figure 2—figure supplements 13).

Structures refinement

The resolution of the EM map enabled to fit the R135 dimeric structure (Klose et al., 2015) (PDB 4Z24) into the maps using UCSF Chimera 1.13.1 (Pettersen et al., 2004). The qu_143 and qu_946 models were obtained using SWISS-MODEL (Waterhouse et al., 2018) (closest PDB homolog: 4Z24). It is only at that stage that the best fitted qu_946 model was manually inspected and additional N-terminal residues built using the extra density available in the cryo-EM map of the 5-start compact reconstruction (Figure 3). The entire protein shell built using the corresponding helical parameters were finally fitted into the Cl1a and Cl3a maps and were further refined against the map using the real-space refinement program in PHENIX 1.18.2 (Liebschner et al., 2019; Figure 2—figure supplements 1 and 2). The qu_143 and qu_946 models were also used to build the entire shell using the Cl2 helical parameters and symmetry and were further refined using the real-space refinement program in PHENIX 1.18.2 (Liebschner et al., 2019; Figure 2—figure supplement 3). Validations were also performed into PHENIX 1.18.2 (Liebschner et al., 2019) using the comprehensive validation program and statistics in Supplementary file 1 correspond to the qu_143 model. The qu_946 model was manually corrected and ultimately refined and validated into PHENIX 1.18.2 (Liebschner et al., 2019) using the highest resolution focused refined Cl1a map. In that map the two first amino acid are disordered including the cysteine.

MS-based proteomic analysis of mimivirus virion and genomic fiber

Proteins extracted from total virions and purified fiber were solubilized with Laemmli buffer (4 volumes of sample with 1 volume of Laemmli 5× – 125 mM Tris–HCl pH 6.8, 10% sodium dodecyl-sulfate (SDS), 20% glycerol, 25% β-mercaptoethanol, and traces of bromophenol blue) and heated for 10 min at 95°C. Three independent infections using three different batches of virions were performed and the genomic fiber was extracted from the resulting viral particles to analyze three biological replicates. Extracted proteins were stacked in the top of an SDS–polyacrylamide gel electrophoresis PAGE gel (4–12% NuPAGE, Life Technologies), stained with Coomassie blue R-250 (BioRad) before in-gel digestion using modified trypsin (Promega, sequencing grade) as previously described Casabona et al., 2013. Resulting peptides were analyzed by online nanoliquid chromatography coupled to tandem MS (UltiMate 3000 RSLCnano and Q-Exactive Plus, Thermo Scientific). Peptides were sampled on a 300 µm × 5 mm PepMap C18 precolumn and separated on a 75 µm × 250 mm C18 column (Reprosil-Pur 120 C18-AQ, 1.9 μm, Dr. Maisch) using a 60 min gradient for fiber preparations and a 140 min gradient for virion. MS and MS/MS data were acquired using Xcalibur (Thermo Scientific). Peptides and proteins were identified using Mascot (version 2.7.0) through concomitant searches against mimivirus reunion, classical contaminant databases (homemade), and the corresponding reversed databases. The Proline software (Bouyssié et al., 2020) was used to filter the results: conservation of rank 1 peptides, peptide score ≥25, peptide length ≥6, peptide-spectrum-match identification false discovery rate <1% as calculated on scores by employing the reverse database strategy, and minimum of 1 specific peptide per identified protein group. Proline was then used to perform a compilation and MS1-based quantification of the identified protein groups. Intensity-based absolute quantification (iBAQ) (Schwanhäusser et al., 2011) values were calculated from MS intensities of identified peptides. The viral proteins detected in a minimum of two replicates are reported with their molecular weight, number of identified peptides, sequence coverage, and iBAQ values in each replicate. The number of copies per fiber for each protein was calculated according to their iBAQ value based on the GMC-oxidoreductase iBAQ value (see below, Supplementary file 2). Mapping of the identified peptides for the GMC oxidoreductases are presented in Figure 2—figure supplement 4.

Protein/DNA ratio validation

To compare the theoretical composition of the genomic fiber accommodating a complete genome with the experimental concentrations in protein and DNA of the sample, we performed DNA and protein quantification (Qubit fluorometric quantification, Thermo Fischer Scientific) on two independent samples of the purified genomic fiber. This returned a concentration of 1.5 ng/µl for the dsDNA and 24 ng/µl for the proteins in one sample and 10 ng/µl for the dsDNA and 150 ng/µl for the protein in the second sample (deposited on agarose gel, Figure 1—figure supplement 3). Considering that all proteins correspond to the GMC-oxidoreductase subunits (~71 kDa), in the sample there is a molecular ratio of 7.2 dsDNA basepairs per GMC-oxidoreductase subunit in the first sample and 7.68 in the second. Based on our model, a 5-start genomic fiber containing the complete mimivirus genome (1,196,989 bp) should be composed of ~95,000 GMC-oxidoreductases subunits (~93,000 for a 6-start). This gives a molecular ratio of 12.5 dsDNA basepairs per GMC-oxidoreductase subunit, which would be more than experimentally measured. However, in the cryo-EM dataset, there are at most 50% of genomic fibers containing the DNA genome (mostly Cl1), while some DNA strands can be observed attached to the relaxed genomic fiber Cl3 but the vast majority of released DNA was lost during the purification of the genomic fiber. Applying an estimated loss of 50% to the total DNA compared to the measured values, we obtain a ratio, which is in the same order of magnitude of the ones measured in the purified genomic fiber samples.

Model visualization

Molecular graphics and analyses were performed with UCSF Chimera 1.13.1 (Pettersen et al., 2004) and UCSF ChimeraX 1.1 (Goddard et al., 2018), developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from National Institutes of Health R01-GM129325 and the Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases.

Analysis of macromolecules in NS-TEM images of mimivirus genomic fiber

Manually picked positions of the particles of interest have been used to automatically extract 193-Å square area from the micrographs (Figure 4C (E)). Then, manual clipping of the particle from its noisy background has been achieved using GIMP’s intelligent scissors and a smooth transparent to black mask, producing the clean extraction image (Figure 4C (CE)). This clipping step was applied in order to improve the semi-automatic identification of the closest orientation in all RNA polymerase projections. The projection views of the different RNA polymerase were produced by converting the PDB model into a volume density through EMAN2’s e2pdb2mrc software, and converting the volume into a 2D projection using a dedicated python script, ultimately applying a Gaussian blur filter (σ = 2.9 Å) in order to roughly simulate the whole imaging transfer function. The projections dataset of all orientations of the vaccinia virus RNA polymerase structure was produced with a 5° rotation step in all angles (PDB: 6RIC, equivalent subunits identified by our MS-based proteomics of purified mimivirus genomic fibers). Preferred projections (Figure 4C (P)) orientations were manually assessed.

Acknowledgements

The cryo-EM work was performed at the multi-user Cryo-EM Facility at CSSB. We thank Jean-Michel Claverie for his comments on the manuscript and discussions all along the project. We thank Irina Gutsche, Ambroise Desfosses, Eric Durand, and Juan Reguera for their helpful support on structural work. We thank Carolin Seuring for support and technical help. Processing was performed on the DESY Maxwell cluster, at IBS and IGS. We thank Wolfgang Lugmayr for assistance and Sebastien Santini and Guy Schoehn for support. The preliminary electron microscopy experiments were performed on the PiCSL-FBI core facility (Nicolas Brouilly, Fabrice Richard, and Aïcha Aouane, IBDM, AMU-Marseille), member of the France-BioImaging national research infrastructure and on the IMM imaging platform (Artemis Kosta). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement no. 832601). This work was also partially supported by the French National Research Agency ANR-16-CE11-0033-01. Proteomic experiments were partly supported by ProFI (ANR-10-INBS-08-01) and GRAL, a program from the Chemistry Biology Health (CBH) Graduate School of University Grenoble Alpes (ANR-17-EURE-0003). Cryo-EM data collection was supported by DFG grants (INST 152/772-1|152/774-1|152/775-1|152/776-1). Work in the laboratory of Kay Grünewald is funded by the Wellcome Trust (107806/Z/15/Z), the Leibniz Society, the City of Hamburg, and the BMBF (05K18BHA). Emmanuelle Quemin received support from the Alexander von Humboldt foundation (individual research fellowship no. FRA 1200789 HFST-P). Chantal Abergel received support from France-BioImaging national research infrastructure (ANR-10-INBS-04).

Funding Statement

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication. For the purpose of Open Access, the authors have applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Contributor Information

Chantal Abergel, Email: Chantal.Abergel@igs.cnrs-mrs.fr.

Adam Frost, University of California, San Francisco (Adjunct), United States.

Sara L Sawyer, University of Colorado Boulder, United States.

Funding Information

This paper was supported by the following grants:

  • European Research Council 832601 to Chantal Abergel.

  • Agence Nationale de la Recherche ANR-16-CE11-0033-01 to Chantal Abergel.

  • Agence Nationale de la Recherche ANR-10-INBS-08-01 to Yohann Couté.

  • Agence Nationale de la Recherche ANR-17-EURE-0003 to Yohann Couté.

  • Wellcome Trust 107806/Z/15/Z to Kay Grünewald.

  • 152/774-1 INST 152/772-1 to Kay Grünewald.

  • Alexander von Humboldt-Stiftung FRA 1200789 HFST-P to Emmanuelle RJ Quemin.

  • Agence Nationale de la Recherche ANR-10-INBS-04 to Chantal Abergel.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – review and editing.

Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – review and editing.

Data curation, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – review and editing.

Resources, Data curation, Funding acquisition, Visualization, Writing – review and editing.

Formal analysis, Investigation, Methodology.

Formal analysis, Investigation, Methodology, Writing – review and editing.

Validation, Investigation, Visualization, Writing – review and editing.

Formal analysis, Investigation.

Validation, Investigation, Visualization.

Investigation.

Investigation.

Data curation, Formal analysis, Supervision, Funding acquisition, Investigation, Methodology, Writing – review and editing.

Resources, Supervision, Funding acquisition, Visualization, Writing – review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Mimivirus Reunion genomic fibers data statistics (Liebschner et al., 2019).
elife-77607-supp1.docx (17.8KB, docx)
Supplementary file 2. Mass spectrometry-based proteomic analysis of (A) three independent preparations of mimivirus genomic fiber and (B) one sample of purified mimivirus virions.

(ND: not detected). RNA polymerase subunits are marked in red.

elife-77607-supp2.docx (53.3KB, docx)
Supplementary file 3. Contacting residues for each GMC oxidoreductases in the different maps.

Conserved or divergent amino acids are color coded in green or in red, respectively.

elife-77607-supp3.docx (116.4KB, docx)
Supplementary file 4. Data acquisition parameters for cryo-electron microscopy (cryo-EM).
elife-77607-supp4.docx (16.5KB, docx)
Transparent reporting form

Data availability

Mimivirus reunion genome has been deposited under the following accession number: BankIt2382307 Seq1 MW004169. 3D reconstruction maps and the corresponding PDB have been deposited to EMDB (Deposition number Cl1a: 7YX4, EMD-14354; Cl1a focused refined: D_1292117739; Cl3a: 7YX5, EMD-14355; Cl2: 7YX3, EMD-14353). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD021585 and 10.6019/PXD021585. The tomograms have been deposited in EMPIAR under accession number 1131 and tomograms video is provided with the article.

The following datasets were generated:

Brun V, Coute Y. 2021. Proteomic analysis of virion and genomic fibre of Mimivirus reunion. PRIDE. PXD021585

Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 5-start helix form. Electron Microscopy Data Bank. EMD-14354

Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 6-start helix form. Electron Microscopy Data Bank. EMD-14353

Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its relaxed 5-start helix form. Electron Microscopy Data Bank. EMD-14355

Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre asymmetric unit. Electron Microscopy Data Bank. EMD-13641

Fabre E, Poirot O, Legendre M, Abergel C, Claverie JM, Yoshida T, Ogata H. 2021. Mimivirus reunion isolate Queen, complete genome. NCBI Nucleotide. MW004169

Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15630

Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15627

Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15628

Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15629

References

  1. Abergel C, Rudinger-Thirion J, Giegé R, Claverie J-M. Virus-encoded aminoacyl-tRNA synthetases: structural and functional characterization of mimivirus TyrRS and MetRS. Journal of Virology. 2007;81:12406–12417. doi: 10.1128/JVI.01107-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abergel C, Legendre M, Claverie JM. The rapidly expanding universe of giant viruses: Mimivirus, Pandoravirus, Pithovirus and Mollivirus. FEMS Microbiology Reviews. 2015;39:779–796. doi: 10.1093/femsre/fuv037. [DOI] [PubMed] [Google Scholar]
  3. Arslan D, Legendre M, Seltzer V, Abergel C, Claverie JM. Distant Mimivirus relative with a larger genome highlights the fundamental features of Megaviridae. PNAS. 2011;108:17486–17491. doi: 10.1073/pnas.1110889108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bai X, Rajendra E, Yang G, Shi Y, Scheres SHW. Sampling the conformational space of the catalytic subunit of human γ-secretase. eLife. 2015;4:e11182. doi: 10.7554/eLife.11182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bell PJ. Viral eukaryogenesis: was the ancestor of the nucleus a complex DNA virus? Journal of Molecular Evolution. 2001;53:251–256. doi: 10.1007/s002390010215. [DOI] [PubMed] [Google Scholar]
  6. Bell PJL. Evidence supporting a viral origin of the eukaryotic nucleus. Virus Research. 2020;289:198168. doi: 10.1016/j.virusres.2020.198168. [DOI] [PubMed] [Google Scholar]
  7. Blanc-Mathieu R, Dahle H, Hofgaard A, Brandt D, Ban H, Kalinowski J, Ogata H, Sandaa RA. A persistent giant algal virus, with A unique morphology, encodes an unprecedented number of genes involved in energy metabolism. Journal of Virology. 2021;95:e02446–e02420. doi: 10.1128/JVI.02446-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bouyssié D, Hesse AM, Mouton-Barbosa E, Rompais M, Macron C, Carapito C, Gonzalez de Peredo A, Couté Y, Dupierris V, Burel A, Menetrey JP, Kalaitzakis A, Poisat J, Romdhani A, Burlet-Schiltz O, Cianférani S, Garin J, Bruley C. Proline: an efficient and user-friendly software suite for large-scale proteomics. Bioinformatics. 2020;36:3148–3155. doi: 10.1093/bioinformatics/btaa118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Boyer M, Azza S, Barrassi L, Klose T, Campocasso A, Pagnier I, Fournous G, Borg A, Robert C, Zhang X, Desnues C, Henrissat B, Rossmann MG, La Scola B, Raoult D. Mimivirus shows dramatic genome reduction after intraamoebal culture. PNAS. 2011;108:10296–10301. doi: 10.1073/pnas.1101118108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Casabona MG, Vandenbrouck Y, Attree I, Couté Y. Proteomic characterization of Pseudomonas aeruginosa PAO1 inner membrane. Proteomics. 2013;13:2419–2423. doi: 10.1002/pmic.201200565. [DOI] [PubMed] [Google Scholar]
  11. Chaikeeratisak V, Nguyen K, Khanna K, Brilot AF, Erb ML, Coker JKC, Vavilina A, Newton GL, Buschauer R, Pogliano K, Villa E, Agard DA, Pogliano J. Assembly of a nucleus-like structure during viral replication in bacteria. Science. 2017;355:194–197. doi: 10.1126/science.aal2130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chelikani V, Ranjan T, Kondabagil K. Revisiting the genome packaging in viruses with lessons from the “Giants.”. Virology. 2014;466–467:15–26. doi: 10.1016/j.virol.2014.06.022. [DOI] [PubMed] [Google Scholar]
  13. Cheng N, Wu W, Watts NR, Steven AC. Exploiting radiation damage to map proteins in nucleoprotein complexes: the internal structure of bacteriophage T7. Journal of Structural Biology. 2014;185:250–256. doi: 10.1016/j.jsb.2013.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Claverie JM. Viruses take center stage in cellular evolution. Genome Biology. 2006;7:110. doi: 10.1186/gb-2006-7-6-110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Claverie JM, Grzela R, Lartigue A, Bernadac A, Nitsche S, Vacelet J, Ogata H, Abergel C. Mimivirus and Mimiviridae: giant viruses with an increasing number of potential hosts, including corals and sponges. Journal of Invertebrate Pathology. 2009;101:172–180. doi: 10.1016/j.jip.2009.03.011. [DOI] [PubMed] [Google Scholar]
  16. Claverie JM, Abergel C. Mimivirus: the emerging paradox of quasi-autonomous viruses. Trends in Genetics. 2010;26:431–437. doi: 10.1016/j.tig.2010.07.003. [DOI] [PubMed] [Google Scholar]
  17. Collier AM, Lyytinen OL, Guo YR, Toh Y, Poranen MM, Tao YJ. Initiation of rna polymerization and polymerase encapsidation by a small dsrna virus. PLOS Pathogens. 2016;12:e1005523. doi: 10.1371/journal.ppat.1005523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Colson P, La Scola B, Levasseur A, Caetano-Anollés G, Raoult D. Mimivirus: leading the way in the discovery of giant viruses of amoebae. Nature Reviews. Microbiology. 2017;15:243–254. doi: 10.1038/nrmicro.2016.197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Coudray N, Lasala R, Zhang Z, Clark KM, Dumont ME, Stokes DL. Deducing the symmetry of helical assemblies: Applications to membrane proteins. Journal of Structural Biology. 2016;195:167–178. doi: 10.1016/j.jsb.2016.05.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Diaz R, Rice WJ, Stokes DL. Fourier-Bessel reconstruction of helical assemblies. Methods in Enzymology. 2010;482:131–165. doi: 10.1016/S0076-6879(10)82005-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. DiMaio F, Yu X, Rensen E, Krupovic M, Prangishvili D, Egelman EH. Virology: A virus that infects A hyperthermophile encapsidates A-form DNA. Science. 2015;348:914–917. doi: 10.1126/science.aaa4181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ding K, Celma CC, Zhang X, Chang T, Shen W, Atanasov I, Roy P, Zhou ZH. In situ structures of rotavirus polymerase in action and mechanism of mRNA transcription and release. Nature Communications. 2019;10:2216. doi: 10.1038/s41467-019-10236-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ekeberg T, Svenda M, Abergel C, Maia FRNC, Seltzer V, Claverie J-M, Hantke M, Jönsson O, Nettelblad C, van der Schot G, Liang M, DePonte DP, Barty A, Seibert MM, Iwan B, Andersson I, Loh ND, Martin AV, Chapman H, Bostedt C, Bozek JD, Ferguson KR, Krzywinski J, Epp SW, Rolles D, Rudenko A, Hartmann R, Kimmel N, Hajdu J. Three-dimensional reconstruction of the giant mimivirus particle with an x-ray free-electron laser. Physical Review Letters. 2015;114:098102. doi: 10.1103/PhysRevLett.114.098102. [DOI] [PubMed] [Google Scholar]
  24. Estrozi LF, Navaza J. Fast projection matching for cryo-electron microscopy image reconstruction. Journal of Structural Biology. 2008;162:324–334. doi: 10.1016/j.jsb.2008.01.014. [DOI] [PubMed] [Google Scholar]
  25. Forterre P. Giant viruses: conflicts in revisiting the virus concept. Intervirology. 2010;53:362–378. doi: 10.1159/000312921. [DOI] [PubMed] [Google Scholar]
  26. Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE. Ucsf chimerax: Meeting modern challenges in visualization and analysis: Ucsf chimerax visualization system. Protein Science: A Publication of the Protein Society. 2018;27:14–25. doi: 10.1002/pro.3235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gong D, Dai X, Jih J, Liu YT, Bi GQ, Sun R, Zhou ZH. Dna-packing portal and capsid-associated tegument complexes in the tumor herpesvirus kshv. Cell. 2019;178:1329–1343. doi: 10.1016/j.cell.2019.07.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Grimm C, Hillen HS, Bedenk K, Bartuli J, Neyer S, Zhang Q, Hüttenhofer A, Erlacher M, Dienemann C, Schlosser A, Urlaub H, Böttcher B, Szalay AA, Cramer P, Fischer U. Structural basis of poxvirus transcription: vaccinia rna polymerase complexes. Cell. 2019;179:1537–1550. doi: 10.1016/j.cell.2019.11.024. [DOI] [PubMed] [Google Scholar]
  29. Guizar-Sicairos M, Thurman ST, Fienup JR. Efficient subpixel image registration algorithms. Optics Letters. 2008;33:156–158. doi: 10.1364/ol.33.000156. [DOI] [PubMed] [Google Scholar]
  30. Hagen WJH, Wan W, Briggs JAG. Implementation of a cryo-electron tomography tilt-scheme optimized for high resolution subtomogram averaging. Journal of Structural Biology. 2017;197:191–198. doi: 10.1016/j.jsb.2016.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hahsler M. Doran, dbscan: fast density-based clustering with R. Journal of Statistical Software. 2019;91:i01. doi: 10.18637/jss.v091.i01. [DOI] [Google Scholar]
  32. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, Wieser E, Taylor J, Berg S, Smith NJ, Kern R, Picus M, Hoyer S, van Kerkwijk MH, Brett M, Haldane A, Del Río JF, Wiebe M, Peterson P, Gérard-Marchant P, Sheppard K, Reddy T, Weckesser W, Abbasi H, Gohlke C, Oliphant TE. Array programming with NumPy. Nature. 2020;585:357–362. doi: 10.1038/s41586-020-2649-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. He S, Scheres SHW. Helical reconstruction in RELION. Journal of Structural Biology. 2017;198:163–176. doi: 10.1016/j.jsb.2017.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Heymann JB, Belnap DM. Bsoft: image processing and molecular modeling for electron microscopy. Journal of Structural Biology. 2007;157:3–18. doi: 10.1016/j.jsb.2006.06.006. [DOI] [PubMed] [Google Scholar]
  35. Kaneko H, Blanc-Mathieu R, Endo H, Chaffron S, Delmont TO, Gaia M, Henry N, Hernández-Velázquez R, Nguyen CH, Mamitsuka H, Forterre P, Jaillon O, de Vargas C, Sullivan MB, Suttle CA, Guidi L, Ogata H. Eukaryotic virus composition can predict the efficiency of carbon export in the global ocean. IScience. 2021;24:102002. doi: 10.1016/j.isci.2020.102002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Klose T, Kuznetsov YG, Xiao C, Sun S, McPherson A, Rossmann MG. The three-dimensional structure of mimivirus. Intervirology. 2010;53:268–273. doi: 10.1159/000312911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Klose T, Herbst DA, Zhu H, Max JP, Kenttämaa HI, Rossmann MG. A mimivirus enzyme that participates in viral entry. Structure. 2015;23:1058–1065. doi: 10.1016/j.str.2015.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kremer JR, Mastronarde DN, McIntosh JR. Computer visualization of three-dimensional image data using IMOD. Journal of Structural Biology. 1996;116:71–76. doi: 10.1006/jsbi.1996.0013. [DOI] [PubMed] [Google Scholar]
  39. Kuznetsov YG, Xiao C, Sun S, Raoult D, Rossmann M, McPherson A. Atomic force microscopy investigation of the giant mimivirus. Virology. 2010;404:127–137. doi: 10.1016/j.virol.2010.05.007. [DOI] [PubMed] [Google Scholar]
  40. Kuznetsov YG, Klose T, Rossmann M, McPherson A. Morphogenesis of mimivirus and its viral factories: an atomic force microscopy study of infected cells. Journal of Virology. 2013;87:11200–11213. doi: 10.1128/JVI.01372-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. La Scola B, Audic S, Robert C, Jungang L, de Lamballerie X, Drancourt M, Birtles R, Claverie JM, Raoult D. A giant virus in amoebae. Science. 2003;299:2033. doi: 10.1126/science.1081867. [DOI] [PubMed] [Google Scholar]
  42. Li S, Olson WK, Lu XJ. Web 3DNA 2.0 for the analysis, visualization, and modeling of 3D nucleic acid structures. Nucleic Acids Research. 2019;47:W26–W34. doi: 10.1093/nar/gkz394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Liebschner D, Afonine PV, Baker ML, Bunkóczi G, Chen VB, Croll TI, Hintze B, Hung LW, Jain S, McCoy AJ, Moriarty NW, Oeffner RD, Poon BK, Prisant MG, Read RJ, Richardson JS, Richardson DC, Sammito MD, Sobolev OV, Stockwell DH, Terwilliger TC, Urzhumtsev AG, Videau LL, Williams CJ, Adams PD. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallographica. Section D, Structural Biology. 2019;75:861–877. doi: 10.1107/S2059798319011471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Liu YT, Jih J, Dai X, Bi GQ, Zhou ZH. Cryo-EM structures of herpes simplex virus type 1 portal vertex and packaged genome. Nature. 2019;570:257–261. doi: 10.1038/s41586-019-1248-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Liu Y, Bisio H, Toner CM, Jeudy S, Philippe N, Zhou K, Bowerman S, White A, Edwards G, Abergel C, Luger K. Virus-encoded histone doublets are essential and form nucleosome-like structures. Cell. 2021;184:4237–4250. doi: 10.1016/j.cell.2021.06.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Malkin AJ, McPherson A, Gershon PD. Structure of intracellular mature vaccinia virus visualized by in situ atomic force microscopy. Journal of Virology. 2003;77:6332–6340. doi: 10.1128/jvi.77.11.6332-6340.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Mannor S, Jin X, Han J, Jin X, Han J, Jin X, Han J, Zhang X. K-Means Clustering. Springer; 2011. pp. 563–564. [Google Scholar]
  48. Mastronarde DN. Automated electron microscope tomography using robust prediction of specimen movements. Journal of Structural Biology. 2005;152:36–51. doi: 10.1016/j.jsb.2005.07.007. [DOI] [PubMed] [Google Scholar]
  49. Mishyna M, Volokh O, Danilova Y, Gerasimova N, Pechnikova E, Sokolova OS. Effects of radiation damage in studies of protein-DNA complexes by cryo-EM. Micron. 2017;96:57–64. doi: 10.1016/j.micron.2017.02.004. [DOI] [PubMed] [Google Scholar]
  50. Moniruzzaman M, Weinheimer AR, Martinez-Gutierrez CA, Aylward FO. Widespread endogenization of giant viruses shapes genomes of green algae. Nature. 2020;588:141–145. doi: 10.1038/s41586-020-2924-2. [DOI] [PubMed] [Google Scholar]
  51. Mutsafi Y, Fridmann-Sirkis Y, Milrot E, Hevroni L, Minsky A. Infection cycles of large DNA viruses: emerging themes and underlying questions. Virology. 2014;466–467:3–14. doi: 10.1016/j.virol.2014.05.037. [DOI] [PubMed] [Google Scholar]
  52. Notaro A, Couté Y, Belmudes L, Laugeri ME, Salis A, Damonte G, Molinaro A, Tonetti MG, Abergel C, De Castro C. Expanding the occurrence of polysaccharides to the viral world: the case of mimivirus. Angewandte Chemie. 2021;60:19897–19904. doi: 10.1002/anie.202106671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay É. Scikit-learn: machine learning in python. Journal of Machine Learning Research: JMLR. 2011;12:2825–2830. [Google Scholar]
  54. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE. UCSF Chimera--A visualization system for exploratory research and analysis. Journal of Computational Chemistry. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  55. Piacente F, De Castro C, Jeudy S, Gaglianone M, Laugieri ME, Notaro A, Salis A, Damonte G, Abergel C, Tonetti MG. The rare sugar N-acetylated viosamine is a major component of Mimivirus fibers. The Journal of Biological Chemistry. 2017;292:7385–7394. doi: 10.1074/jbc.M117.783217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Pintilie GD, Zhang J, Goddard TD, Chiu W, Gossard DC. Quantitative analysis of cryo-EM density map segmentation by watershed and scale-space filtering, and fitting of structures by alignment to regions. Journal of Structural Biology. 2010;170:427–438. doi: 10.1016/j.jsb.2010.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Ptchelkine D, Gillum A, Mochizuki T, Lucas-Staat S, Liu Y, Krupovic M, Phillips SEV, Prangishvili D, Huiskonen JT. Unique architecture of thermophilic archaeal virus APBV1 and its genome packaging. Nature Communications. 2017;8:1436. doi: 10.1038/s41467-017-01668-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Rao VB, Feiss M. Mechanisms of dna packaging by large double-stranded dna viruses. Annual Review of Virology. 2015;2:351–378. doi: 10.1146/annurev-virology-100114-055212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Raoult D, Audic S, Robert C, Abergel C, Renesto P, Ogata H, La Scola B, Suzan M, Claverie J-M. The 1.2-megabase genome sequence of Mimivirus. Science. 2004;306:1344–1350. doi: 10.1126/science.1101485. [DOI] [PubMed] [Google Scholar]
  60. Robinson PJJ, Fairall L, Huynh VAT, Rhodes D. EM measurements define the dimensions of the “30-nm” chromatin fiber: evidence for a compact, interdigitated structure. PNAS. 2006;103:6506–6511. doi: 10.1073/pnas.0601212103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rohou A, Grigorieff N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. Journal of Structural Biology. 2015;192:216–221. doi: 10.1016/j.jsb.2015.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Sachse C. EMBL—european molecular biology laboratory, structural and computational biology unit, meyerhofstr. AIMS Biophysics. 2015;2:219–244. doi: 10.3934/biophy.2015.2.219. [DOI] [Google Scholar]
  63. Scheres SHW. RELION: implementation of a Bayesian approach to cryo-EM structure determination. Journal of Structural Biology. 2012;180:519–530. doi: 10.1016/j.jsb.2012.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Schrad JR, Abrahão JS, Cortines JR, Parent KN. Structural and proteomic characterization of the initiation of giant virus infection. Cell. 2020;181:1046–1061. doi: 10.1016/j.cell.2020.04.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Schulz F, Roux S, Paez-Espino D, Jungbluth S, Walsh DA, Denef VJ, McMahon KD, Konstantinidis KT, Eloe-Fadrosh EA, Kyrpides NC, Woyke T. Giant virus diversity and host interactions through global metagenomics. Nature. 2020;578:432–436. doi: 10.1038/s41586-020-1957-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Schwanhäusser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M. Global quantification of mammalian gene expression control. Nature. 2011;473:337–342. doi: 10.1038/nature10098. [DOI] [PubMed] [Google Scholar]
  67. Suhre K, Audic S, Claverie JM. Mimivirus gene promoters exhibit an unprecedented conservation among all eukaryotes. PNAS. 2005;102:14689–14693. doi: 10.1073/pnas.0506465102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Sun L, Zhang X, Gao S, Rao PA, Padilla-Sanchez V, Chen Z, Sun S, Xiang Y, Subramaniam S, Rao VB, Rossmann MG. Cryo-EM structure of the bacteriophage T4 portal protein assembly at near-atomic resolution. Nature Communications. 2015;6:7548. doi: 10.1038/ncomms8548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Suzan-Monti M, La Scola B, Barrassi L, Espinosa L, Raoult D. Ultrastructural characterization of the giant volcano-like virus factory of Acanthamoeba polyphaga Mimivirus. PLOS ONE. 2007;2:e328. doi: 10.1371/journal.pone.0000328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Takemura M. Poxviruses and the origin of the eukaryotic nucleus. Journal of Molecular Evolution. 2001;52:419–425. doi: 10.1007/s002390010171. [DOI] [PubMed] [Google Scholar]
  71. Toriyama S. An rna-dependent rna polymerase associated with the filamentous nucleoproteins of rice stripe virus. Journal of General Virology. 1986;67:1247–1255. doi: 10.1099/0022-1317-67-7-1247. [DOI] [Google Scholar]
  72. Valencia-Sánchez MI, Abini-Agbomson S, Wang M, Lee R, Vasilyev N, Zhang J, De Ioannes P, La Scola B, Talbert P, Henikoff S, Nudler E, Erives A, Armache K-J. The structure of a virus-encoded nucleosome. Nature Structural & Molecular Biology. 2021;28:413–417. doi: 10.1038/s41594-021-00585-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Wang F, Baquero DP, Beltran LC, Su Z, Osinski T, Zheng W, Prangishvili D, Krupovic M, Egelman EH. Structures of filamentous viruses infecting hyperthermophilic archaea explain DNA stabilization in extreme environments. PNAS. 2020;117:19643–19652. doi: 10.1073/pnas.2011125117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Wu W, Thomas JA, Cheng N, Black LW, Steven AC. Bubblegrams reveal the inner body of bacteriophage φKZ. Science. 2012;335:182. doi: 10.1126/science.1214120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Xiao C, Kuznetsov YG, Sun S, Hafenstein SL, Kostyuchenko VA, Chipman PR, Suzan-Monti M, Raoult D, McPherson A, Rossmann MG. Structural studies of the giant mimivirus. PLOS Biology. 2009;7:e1000092. doi: 10.1371/journal.pbio.1000092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Zauberman N, Mutsafi Y, Halevy DB, Shimoni E, Klein E, Xiao C, Sun S, Minsky A. Distinct DNA exit and packaging portals in the virus Acanthamoeba polyphaga mimivirus. PLOS Biology. 2008;6:e114. doi: 10.1371/journal.pbio.0060114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Zheng SQ, Palovcak E, Armache JP, Verba KA, Cheng Y, Agard DA. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nature Methods. 2017;14:331–332. doi: 10.1038/nmeth.4193. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Adam Frost 1

Giant dsDNA viruses, with genomes in excess of 1Mb encoding more than one thousand genes, were only recently discovered and their study offers new opportunities to probe life's mechanisms. Little is known how these "organisms" protect and organize their genomes. This fascinating study reveals a helical protein assembly comprised of oxidoreductase-family proteins, which assemble into multi-start helical fibers, with genomic DNA lining the lumen of the fiber.

Decision letter

Editor: Adam Frost1
Reviewed by: Adam Frost2, Jonatas S Abrahão3

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "The giant Mimivirus 1.2 Mb genome is elegantly organized into a 30 nm helical protein shield" for consideration by eLife. Your article has been reviewed by 4 peer reviewers, including Adam Frost as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by a Reviewing Editor and Sara Sawyer as the Senior Editor. The following individual involved in the review of your submission has agreed to reveal their identity: Jonatas S Abrahão (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

1. Several reviewers expressed uncertainty about the significance of the structural observations for our understanding of viral behavior. Critically, one reviewer noted: "In light of the presented results, it is reasonable to assume that GMC-type oxidoreductases of the mimivirus are very important for the formation of functional virions. However, in a previous study (PMID: 21646533), it has been shown that the genes encoding GMC-type oxidoreductases can be deleted from the virus genome (M4 mutant) without the loss of infectivity. The M4 virions were devoid of the external fibers decorating the icosahedral capsid, but the genome was still packaged. How do the authors reconcile these results with those presented in the present manuscript? This should be addressed in the Discussion section." Another review notes that the answer to this question should address "previously published data on proteomics of viral factories and transcriptomics of mimivirus: is there any temporal association between GMC-type oxidoreductases peak of expression and genome replication during the viral cycle? what about RNA pol subunits? Are all those proteins highly expressed during the late cycle? or [do] they reach the peak concomitantly with genome replication?" Another reviewer noted "The presented data do not [enable us] to estimate the amount of [the] mimivirus genome organized into 30 nm diameter filaments. Hence, the title of the paper is [overreaching]" and "The filamentous structures [result from] an extremely harsh treatment of the virus particle, which starts with a 1.5 hour-long incubation at pH 2. Do the filaments actually exist inside the virus particle as the title of the paper implies? Or [might] these filaments [form during] host take over? Or [perhaps] these filaments [result from a harsh in vitro treatment] and have nothing to do with either?" Addressing these comments is essential.

2. The reviewers requested a more comprehensive description of the GMC oxidoreductase paralogs and the properties of the helical coat. One reviewer noted "The authors describe the interactions between the monomers in the dimer of qu_946 as well as between qu_946 and DNA. I would also like to see a brief description of protein-protein interactions between subunits within the same helical strand as well as between helical strands, which hold the whole assembly together (i.e., what are the contacts between green subunits as well as between green and yellow subunits shown in Figure 2C). The authors suggest that the shell "would guide the folding of the dsDNA strands into the structure" (L310). To support this statement, the authors could show the lumen of the fiber rendered by electrostatic potential." And multiple reviewers requested a more detailed description of whether the helically averaged density map is simply unable to distinguish between qu_946 and qu_143, including a description of the amino acid percent identity versus similarity, especially for the contact-forming surface residues that govern the protein-protein and protein-nucleic acid contacts?

Another reviewer noted "Please provide some background information on the distribution of GMC-type oxidoreductases in other families of giant viruses, so that it is clearer whether the described packaging mechanism is specific to mimiviruses or is more widespread." A reviewer wrote concerning the properties of the coat "[the] authors could better explain why we only see 20 kb fragments in the gel, including in the control (in Figure S2)." Another reviewer wrote "Equally important, what is going on with the N-terminal 50-residue domain of qu_946? Is there a space for it in the cryoEM map? Is it disordered?" Another wrote, "Rough estimation of genome compaction to fit into the nucleoid" section. Even though these back-of-the-envelope-type calculations appear to be reasonable, the last sentence "As a result, the Mimivirus genome is probably organized…" throws a monkey wrench into all of this. I see no resemblance in the organizations of the APBV1 genome and the DNA in mimivirus fibers. Finally, an EM reviewer wrote "I am not yet convinced the authors have resolved the putative disulfide bridge between protomers. Please make a figure of the density around this bond at a more stringent map threshold so that the reader can appreciate the strength of the EM evidence for the disulfide." Addressing these comments is essential.

There were also less critical but still valuable requests to consider:

3. Several reviewers were unsure how to think about the "balls of yarn." One reviewer wrote "slightly puzzled by the observed "ball of yarn". It is hard for me to imagine that a cylindrical container/fiber-containing continuous dsDNA genome could be bent or fragmented into bundles because this would break the protein-protein interactions holding the fiber together. In Figures 1C and S1, are these parts of the same fiber or multiple fibers coming out of one capsid? Related to this question – is there evidence (e.g., from qPCR) that mimivirus carries a single copy of genomic dsDNA per capsid?" Another reviewer wrote "The "ball of yarn" analogy is nice, but Figure 1C shows several fibers unconnected (free) in one of their ends. I am wondering if it means that the genomic fiber is not a long-single structure covering the whole genome, but a bunch [of] several independent helical structures covering the whole genome and attached in such [a] "ball of yarn". Like several threads connected. Could [the] authors clarify?" Another reviewer wrote "wondering if authors attempted to get [scanning or transmission] electron microscopy images of mimivirus with exposed ball(s) of yarn?"

4. The methods would benefit from more detailed descriptions. Multiple reviewers agree that the methods section on focused refinements for both the dimer of oxidoreductases and the DNA within needs much more explanation for other groups to reproduce. Please enhance the methods description and consider including example scripts. Also, please include the output from Helixplorer (http://rico.ibs.fr/helixplorer/).

5. The reviewers suggest the following modifications to which and where certain figures are included. At least two EM reviewers agree that "The bubblegram analysis is not very convincing. The bubbles appear to correlate with the length or thickness of the structure – the long or overlapped structures form bubbles. The bubbles may not be due to the presence of DNA." and would favor minimizing arguments that depend on these data. Another reviewer wrote "Panels C and D of Figure 4 are too speculative to be included in the main text. Panels A and B are fascinating and pose a hypothesis worthy of further investigation, but please omit panels C and D." This request was modified with a great suggestion by another reviewer "All images of the putative RNA polymerases should be boxed out from the panel A (from the whole photograph, that is) and a collage of these boxes should be shown on the same scale as the Poxvirus RNAP. The latter should be shown in several orientations. Perhaps, projection views instead of the molecular surface (which is shown in Figure 4C) would match the negatively stained images better (but maybe not). Panel 4B should be shown on the same scale as panel A and the putative RNAP can be placed (in color) into the channel of the fiber."

6. There were a number of suggestions to improve the readability and generality of the text.

– L140: "single DNA strand" but the authors probably mean single molecule of dsDNA or single double-helix. Please revise to avoid ambiguity.

– L178: Please soften "must be" – no evidence is presented that any of the mimivirus proteases are involved in the processing of oxidoreductases. The involvement of cellular proteases cannot be a priori excluded.

– L44 Please consider the flexibility of "giant virus" concept. The term was first used in 1999 to refer to Paramecium bursaria chlorella virus, which is substantially smaller than mimivirus. As the term was inaugurated in the context of chloroviruses, would be friendly if the authors state that "Giant viruses OF AMOEBAS were discovered with the isolation of mimivirus".

– Mimivirus or mimivirus? Please see https://talk.ictvonline.org/information/w/faq/386/how-to-write-virus-species-and-other-taxa-names.

– Figure S12: fiber or fibre?

– Medusavirus genome harbored genes for all five types of histones (H1, H2A, H2B, H3, and H4). Please consider adding this to the discussion topic.

L. 303-304. APBV1 capsid structure appears strikingly similar to the Mimivirus genomic fiber but Mimivirus… I am sorry but I do not see any resemblance.

L. 334-338. This philosophical excursion might be correct, but I do not see any data supporting these hypotheses.

The grammar needs to be corrected in many places starting with the title " The giant Mimivirus 1.2 Mb genome is elegantly organized into a 30 nm-DIAMETER helical protein shield".

L. 79. "… resulted in the observation…" <- simplify by removing unnecessary words.

L. 80. "The used capsid opening procedure involves…" <- remove "used".

L. 83-84. "a same" -> THE same.

Reviewer #1 (Recommendations for the authors):

1. The methods section on focused refinements for both the dimer of oxidoreductases and the DNA within needs much more explanation for other groups to reproduce. Please enhance the methods description and consider including example scripts.

2. I am not yet convinced the authors have resolved the putative disulfide bridge between protomers. Please make a figure of the density around this bond at a more stringent map threshold so that the reader can appreciate the strength of the EM evidence for the disulfide.

3. Panels C and D of Figure 4 are too speculative to be included in the main text. Panels A and B are fascinating and pose a hypothesis worthy of further investigation, but please omit panels C and D.

4. Please include the outputs Helixplorer (http://rico.ibs.fr/helixplorer/)

Reviewer #2 (Recommendations for the authors):

L140: "single DNA strand" but the authors probably mean single molecule of dsDNA or single double-helix. Please revise to avoid ambiguity.

L178: Please soften "must be" – no evidence is presented that any of the mimivirus proteases are involved in the processing of oxidoreductases. The involvement of cellular proteases cannot be a priori excluded.

Reviewer #3 (Recommendations for the authors):

– In addition to information that is in public preview, I recommend that authors consider the flexibility of "giant virus" concept. The term was first used in 1999 to refer to Paramecium bursaria chlorella virus, which is substantially smaller than mimivirus. As the term was inaugurated in the context of chloroviruses, would be friendly if the authors state that "Giant viruses OF AMOEBAS were discovered with the isolation of mimivirus" (eg line 44);

– Mimivirus or mimivirus? please see https://talk.ictvonline.org/information/w/faq/386/how-to-write-virus-species-and-other-taxa-names.

– Figure S12: fiber or fibre?

– Please consider promoting Figure S5 as Figure 5.

– I am wondering if the authors attempted to get scanning electron microscopy images of mimivirus with an exposed ball of yarn. Sounds good.

– Medusavirus genome harbored genes for all five types of histones (H1, H2A, H2B, H3, and H4). Please consider adding this to the discussion topic.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "The giant mimivirus 1.2 Mb genome is elegantly organized into a 30 nm diameter helical protein shield" for further consideration by eLife. Your revised article has been evaluated by Sara Sawyer (Senior Editor) and Reviewing Editor Adam Frost. The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:

First, we find the uncertainty and controversy around the role and identification of the GMC oxidoreductases confusing. Please upload the cryo-EM maps and fitted atomic coordinates for the reviewers. Second, while we understand your decision to describe the genetic system and the genomic fibers of the M4 mutant in a separate publication, we feel that an additional explanation regarding the M4 mutant is necessary for this manuscript. The observation that the genomic fibers are constructed from different proteins in other mimiviruses, and potentially in the M4 mutant which lacks qu_946 and qu_143 poses quite a puzzle and raises concerns about the validity of the structural assignment.

eLife. 2022 Jul 28;11:e77607. doi: 10.7554/eLife.77607.sa2

Author response


Essential revisions (for the authors):

1. Several reviewers expressed uncertainty about the significance of the structural observations for our understanding of viral behavior. Critically, one reviewer noted: "In light of the presented results, it is reasonable to assume that GMC-type oxidoreductases of the mimivirus are very important for the formation of functional virions. However, in a previous study (PMID: 21646533), it has been shown that the genes encoding GMC-type oxidoreductases can be deleted from the virus genome (M4 mutant) without the loss of infectivity. The M4 virions were devoid of the external fibers decorating the icosahedral capsid, but the genome was still packaged. How do the authors reconcile these results with those presented in the present manuscript? This should be addressed in the Discussion section.”

These reviewers are totally right and this point also bothered us. We thus managed to extract the genomic fiber of M4 (the isolate without GMC oxidoreductases). The fiber also has a rodshaped structure but protein composition analysis of the purified fiber shows that different proteins are involved in its assembly.

We do not think GMC-oxidoreductases are essential. The packaging machinery appears to be able to use a variety of the most abundant proteins available at the stage of genome packaging. This corresponds to the GMC-oxidoreductases for mimivirus but is another protein in the case of M4. Follow up studies we are currently conducting clearly show the GMC-oxidoreductase is not essential for the genomic fiber formation.

The focus of the current manuscript is to report the structure of mimivirus genome organization in the genomic fiber, which is already a first piece of work. We consider the study of M4 genomic fiber, and the testing of the hypotheses this finding raises, as follow up studies. We would prefer not to discuss this in the current manuscript and hope the reviewers and the editor will believe our preliminary data and will agree on keeping this information for a subsequent publication.

Another review notes that the answer to this question should address "previously published data on proteomics of viral factories and transcriptomics of mimivirus: is there any temporal association between GMC-type oxidoreductases peak of expression and genome replication during the viral cycle? what about RNA pol subunits? Are all those proteins highly expressed during the late cycle? or [do] they reach the peak concomitantly with genome replication?"

The transcriptomic data of mimivirirus prototype infectious cycle have been published and corresponding data are publicly available under this link: http://www.igs.cnrsmrs.fr/mimivirus/ (clicking on mimivirus Genome Browser). They report both 454 and Solid data.

Author response image 1 shows the table of the genes with the keyword “Polymerase” in the answers to the reviewers for Mimivirus prototype (pink) and Mimivirus reunion (blue) to help them identify the equivalence between the different subunits.

Author response image 1.

Author response image 1.

The GMC-oxidoreductase genes are expressed after DNA replication, at the time of virion assembly, starting at 5h Post Infection (PI) until the end of the cycle. For reference, the DNA polymerase starts being expressed after 1h until the end, with a peak at 3 hours PI. The RNA polymerase subunits start also being expressed after 1 h PI until the end of the cycle. The RNA polymerase subunits are loaded in the virions as well as the mRNA maturation machinery (PolyA polymerase, mRNA capping enzyme, etc…) (supplementary file 2B).We now added a sentence in the main text reporting the late expression of the GM-Coxidoreductases and the fact that they start being expressed after DNA replication (p5, main text). In the section “Additional proteins, including RNA polymerase subunits, are enriched in the genomic fiber” we also provide a sentence on RNA polymerase subunits and GM-Coxidoreductases, kinesin, regulator of chromosome condensation expression, all expressed until the end of the cycle (main text, p9).

Another reviewer noted "The presented data do not [enable us] to estimate the amount of [the] mimivirus genome organized into 30 nm diameter filaments.

We think that the entire genome can only be packaged in the capsid through its assembly within the protein shell. We also think the genomic fiber is progressively built on the genomic DNA while it progresses into the capsid, most likely by an energy driven packaging machinery. This process can be compared to bacterial pili assembly, except that pili are built on the surface of the cell, while the genomic fiber is built into a compartment, the nucleoid, forcing it to fold into this compartment. This is only possible due to the high flexibility of the genomic fiber. So, the entire genome corresponds to ~40 µm of genomic fiber, which when packaged as a ball can entirely fit into the nucleoid (~350 nm diameter).

The organization of the genome in a large tubular structure and its folding inside the nucleoid compartment has been previously reported by AFM studies of the mimivirus particles (Kuznetsov, Y. G. et al. Virology 2010; Kuznetsov YG et al. J. Virol. 2013, Figure 15), which the authors refer to as “highly condensed nucleoprotein masses about 350 nm in diameter within the inner membrane sacs of virions”, with the presence of tubular structure they refer to as “thick cables of the nucleic acid”.

We believe the Reviewers should think in terms of packaging. The folded genome is packaged through two lipid membranes (the one lining the capsid interior and the one in the nucleoid), concomitantly with its wrapping by the protein shell ribbon. Thus, there is plenty of space in the nucleoid at the beginning of the packaging and the genomic fiber is gently folded inside. But as more genome needs to be packaged, this compresses the flexible fiber into the nucleoid until it is totally encased in the nucleoid and that also defines the size of the nucleoid in the icosahedral capsid. This tight packaging is exemplified in Figure 1A for instance or in the AFM images of the nucleoid in Kuznetsov, Y. G. et al. Virology 2010; Kuznetsov YG et al. J. Virol. 2013, Figure 15.

Hence, the title of the paper is [overreaching]" and "The filamentous structures [result from] an extremely harsh treatment of the virus particle, which starts with a 1.5 hour-long incubation at pH 2.

There is a misunderstanding here. The 1 h incubation at 30°C and pH 2 was only applied to recover the nucleoids (see material and method section “Nucleoid extraction”) presented in Figure 1—figure supplement 1A. Acidic treatment was never applied to produce the genomic fiber as we noticed it is sensitive to temperature and acidic pH. All steps of the extraction protocol were performed at pH 7.5 (section: “Extraction and purification of the mimivirus genomic fiber”). We must emphasize that the release of the genomic fiber can be seen at the very first step of the extraction protocol (protease treatment). The sample was also controlled at each step of the protocol by negative staining TEM to assess the status of the genomic fiber. We had to optimize the protocol as using a too soft proteolytic treatment led to too few opened particles but with mostly a compact genomic fiber released, if it was too harsh, all particles were opened but the genomic fiber was mostly in the ribbon state. We had to compromise to get a decent amount of compact and relaxing structures to be able to perform the present work. We would like to stress out that we could reproducibly obtain the genomic fiber from many preparations and that we could observe them with different virions (including M4), even using different protocols (only the one with the better yield is reported in the manuscript).

Do the filaments actually exist inside the virus particle as the title of the paper implies?

In the Figure 1B the genomic fiber can be seen inside a virion and is still encased in the membrane compartment. These structures were not reported in previous cryo-EM analyses of the virions. As said above, they were only reported by AFM studies of the mimivirus particles (Kuznetsov, Y. G. et al. Virology 2010; Kuznetsov YG et al. J. Virol. 2013, Figure 15).

Or [might] these filaments [form during] host take over? Or [perhaps] these filaments [result from a harsh in vitro treatment] and have nothing to do with either?"

I personally have difficulties to imagine that such a complex structure could be the result of an artefact due to the treatment for several reasons:

  1. It is unlikely that by simply putting the GMC-oxidoreductases with DNA would result in a helical structure where the DNA is folded 5 times and internally lining the protein shell (Figure 3-animation 1 of one tomogram). It would be like crystallizing the proteins (in a heterogeneous sample) onto the folded DNA to form a helix with a hollow lumen. The crystallographic data obtained by others by on the mimivirus GMC-oxidoreductase did not produce tubular structures either and they reported 3 crystal forms. They overexpressed the proteins in E. coli and did not report such structures bound to DNA either.

  2. Given the presence of compact and relaxed forms, once relaxed the helix cannot go back to a compact state passively by simply rewinding suggesting the relaxed forms are the result of decompaction of a constrained structure. This is also supported by the loss of DNA in the relaxed state Cl3. Last steps of unfolding correspond to the loss of strands of the ribbon, one after the other.

  3. The contacts between chains intra and inter strand are also scarce supporting an active assembly of the structure. We now provide an additional supplementary file 3 with the different contacts for the different states of the genomic fiber. There are very few interstart contacts.

2. The reviewers requested a more comprehensive description of the GMC oxidoreductase paralogs and the properties of the helical coat. One reviewer noted "The authors describe the interactions between the monomers in the dimer of qu_946 as well as between qu_946 and DNA. I would also like to see a brief description of protein-protein interactions between subunits within the same helical strand as well as between helical strands, which hold the whole assembly together (i.e., what are the contacts between green subunits as well as between green and yellow subunits shown in Figure 2C).

A new Table (Supplementary file 3) has been provided with contacting residues intra and inter strands and their conservation. We also changed Cl2 by Cl1a maps in Figure 3F to show the qu_143 contacts with DNA in the same map as for qu_946 (Figure 3E). Finally, to comply with another reviewer’s question these contacting residues are shown for both qu_946 and qu_143 in the Cl1a, Cl2 and Cl3a maps. Conserved and divergent residues are highlighted in green and red, respectively, in Supplementary file 3. Inter- and intra-strands contacts are clearly loosening between the two compact Cl1a and Cl2 structures, compared to the relaxed Cl3, supporting the unfolding process and the use of an energy driven machinery to build and package the genomic fiber into the capsids.

The authors suggest that the shell "would guide the folding of the dsDNA strands into the structure" (L310). To support this statement, the authors could show the lumen of the fiber rendered by electrostatic potential." And multiple reviewers requested a more detailed description of whether the helically averaged density map is simply unable to distinguish between qu_946 and qu_143, including a description of the amino acid percent identity versus similarity, especially for the contact-forming surface residues that govern the protein-protein and protein-nucleic acid contacts?

We think this is an excellent suggestion and we thus replaced in Figure 3 previous panels by the corresponding electrostatic views of the protein shell with the DNA (Figure 3C-D). We believe it is interesting to note that the electrostatics are different between qu_946 or qu_143, showing that despite the high level of identity (69%) between the two proteins, there are differences and that this does not impact the packaging of the DNA. Along with the finding that M4 use a different protein to package its DNA, we believe this supports even further an energy driven packaging machinery.

We now added a sentence in the main text p5:” The two mimivirus reunion proteins share 69% identity (81% similarity).”

Another reviewer noted "Please provide some background information on the distribution of GMC-type oxidoreductases in other families of giant viruses, so that it is clearer whether the described packaging mechanism is specific to mimiviruses or is more widespread."

If reviewers still think it would be useful, we can provide a multiple alignment of the GM-Coxidoreductases of prototype members of the different clades of the family as a supplementary figure.

This is a central point, also linked to the question about M4. In fact, like the reviewers, we initially assumed that the GMC-oxidoreductases were essential. Now, we believe it might be premature to assume that GMC-type oxidoreductases are the only type of proteins that can be involved in the scaffolding of the Mimiviridae genomic fiber.

Author response image 2.

Author response image 2.

A reviewer wrote concerning the properties of the coat "[the] authors could better explain why we only see 20 kb fragments in the gel, including in the control (in Figure S2)."

Figure 1—figure supplement 3 (old Figure S2) corresponds to a regular 1% agarose gel and not to a PFGE gel. This gel was simply to show there is DNA associated with the genomic fiber and not to show the size of the DNA as the genomic fiber has been broken into pieces by the extraction protocol, pipetted several times before being loaded on the gel and we thus did not expect to have very high molecular weight. I must point out that when extracting the DNA form Mimivirus capsids using standard kits and pipetting, it also migrates at the top of the gel (Lane 1 in Figure 1—figure supplement 3) while it would likely appear as a smear above 20 kb on a PFGE. By contrast when the viral particles are put into plugs prior lysis, the genomic DNA migrates at the proper size, as shown in the publication from Boyer et al. 2011 (reference 31), showing the genome of Mimivirus is a linear genome migrating around 1.37 Mb (Figure 1, Panel B, Lane M1).

Another reviewer wrote "Equally important, what is going on with the N-terminal 50-residue domain of qu_946? Is there a space for it in the cryoEM map? Is it disordered?"

The N-terminal domain is only present in the fibrils decorating the capsids.

As illustrated in Figure 2—figure supplement 4, when analyzed by MS-based proteomics, the comparison of the peptide coverage of the GMC-oxidoreductases depending if they compose the fibrils or the genomic fiber is not the same. The N-terminal domain is clearly covered when the fibrils or intact virions are analyzed and not covered when the analysis is performed on the purified genomic fiber. That is why we propose this N-terminal domain could be an addressing signal (see main text) and that a protease could be cleaving it prior to genomic fiber assembly.

Main text: “The proteomic analyses provided different sequence coverages for the GMC-oxidoreductases depending on whether samples were intact virions or purified genomic fiber preparations, with substantial under-representation of the N-terminal domain in the genomic fiber (Figure 2—figure supplement 4). Accordingly, the maturation of the GMC-oxidoreductases involved in genome packaging must be mediated by one of the many proteases encoded by the virus or the host cell.”

Indeed, there is no space to accommodate this domain as it would prevent the interaction between the protein shell and the DNA or induce an increase of the genomic fiber diameter that would be too big to be accommodated into the nucleoid.

Another wrote, "Rough estimation of genome compaction to fit into the nucleoid" section. Even though these back-of-the-envelope-type calculations appear to be reasonable, the last sentence "As a result, the Mimivirus genome is probably organized…" throws a monkey wrench into all of this. I see no resemblance in the organizations of the APBV1 genome and the DNA in mimivirus fibers.

The APBV1 nucleocapsid is also a 30 nm diameter helical structure made of proteins with the dsDNA genome lining the interior of the shell. The genomic fiber also looks like a nucleocapsid, but the DNA is circular for APBV1 and linear for mimivirus. The most important difference being that the mimivirus genomic fiber is actually folded in the nucleoid, itself encased in the icosahedral capsid, while APBV1 nucleocapsid is the virion. If this reviewer thinks this is confusing, we could remove the reference to this archaeal virus. We tried to clarify by proposing the following sentence: “As a result, the complete mimivirus genome folded at least 5 times fits into the helical shell. This structure surprisingly resembles a nucleocapsid, such as the archaea infecting APBV1 nucleocapsid.”

Finally, an EM reviewer wrote "I am not yet convinced the authors have resolved the putative disulfide bridge between protomers. Please make a figure of the density around this bond at a more stringent map threshold so that the reader can appreciate the strength of the EM evidence for the disulfide."

We agree with this reviewer that the density does not permit to totally confidently conclude that there is a disulfide bridge. Our rational was as follow: building additional residues into the uninterpreted density brought the cysteines of the two chains close enough to allow a disulfide bridge and fully filled the available density. Clearly in the fully relaxed structure the density is weaker, suggesting there is some additional disorder at this location that led us to think that a disulfide bridge break could fragilize the helix and initiate its decompaction. In Author response image 3A (zoomed in C) is the Cl1a compact map at the same threshold than in the relaxed Cl3 in Author response image 3B (zoomed in D). If the reviewers decide this is convincing, we can add this figure as a supplementary figure.

Author response image 3.

Author response image 3.

We finally thought that since the helical structure is in a decompaction process, the density at that location is an average between locked and unlocked structures explaining why it was less defined than the rest of the structure and thus absent of the focused refined map. In the main text we only suggest a disulfide bridge break could be involved in the unwinding process. We now added “could” in the Figure 3 legend p7: “The isosurface threshold chosen allows visualization of density for the manually built N-terminal residues, including terminal cysteines (stick model), of two neighboring monomers that could form a terminal disulfide bridge.”

There were also less critical but still valuable requests to consider:

3. Several reviewers were unsure how to think about the "balls of yarn." One reviewer wrote "slightly puzzled by the observed "ball of yarn". It is hard for me to imagine that a cylindrical container/fiber-containing continuous dsDNA genome could be bent or fragmented into bundles because this would break the protein-protein interactions holding the fiber together. In Figures 1C and S1, are these parts of the same fiber or multiple fibers coming out of one capsid?

It is the same fiber but the treatments are not exactly the same to produce the relaxing fiber coming out of the capsids and the “ball of yarn” structure, for the latter, there are clearly breakages that give the impression of multiple fibers. The genomic fiber is highly flexible and rolled up in the nucleoid.

In the enclosed cryo-EM micrograph of the expelling virions (Author response image 4) the capsids were opened by a simple proteinase K treatment with 1 mM DTT. The sample was frozen and observed immediately after treatment.

Author response image 4.

Author response image 4.

There is no scale bar but the capsids are roughly half a micrometer diameter. Reviewers can also recognize the nucleoid inside most capsids, but with different sizes reflecting the release process progress. The flexibility of the fiber is also visible.

In the NS-TEM image (Author response image 5), the Reviewers can also see how long and flexible the genomic fiber can be with no breaks.

Author response image 5.

Author response image 5.

Related to this question – is there evidence (e.g., from qPCR) that mimivirus carries a single copy of genomic dsDNA per capsid?"

It would be possible to accommodate many copies of Mimivirus genome as naked DNA but only one copy can be fitted into the helical structure as ~40 µm are needed to fit the folded genome. These 40 µm also correspond to the volume of the nucleoid.

Another reviewer wrote "The "ball of yarn" analogy is nice, but Figure 1C shows several fibers unconnected (free) in one of their ends. I am wondering if it means that the genomic fiber is not a long-single structure covering the whole genome, but a bunch [of] several independent helical structures covering the whole genome and attached in such [a] "ball of yarn". Like several threads connected. Could [the] authors clarify?"

At that stage the genomic fiber has been broken due to the multiple steps of extraction, enrichment and purification, which does not happen in vivo. As can be seen in the Cryo-EM picture above, with only one step of treatment, it appears intact and is not broken into fragments.

Another reviewer wrote "wondering if authors attempted to get [scanning or transmission] electron microscopy images of mimivirus with exposed ball(s) of yarn?"

This is a good suggestion but we did not. They are globally 350 nm diameter, as for the nucleoid. The point of this manuscript was to report the structure of the mimivirus genome organization into this helical structure as well as its composition.

4. The methods would benefit from more detailed descriptions. Multiple reviewers agree that the methods section on focused refinements for both the dimer of oxidoreductases and the DNA within needs much more explanation for other groups to reproduce. Please enhance the methods description and consider including example scripts. Also, please include the output from Helixplorer (http://rico.ibs.fr/helixplorer/).

We now expanded the focused refinement procedure in the Material and method section and enclose the output of Helixplorer as supplementary figure (Figure 1—figure supplement 6, C-E).

We also prepared a Relion project directory recapitulating the different steps applied to perform the focused refinement of the asymmetric unit (oxidoreductase dimer). This project directory was made after cleaning (directory filled by different jobs) and renumbering the initial project directory. As we recovered the motion corrected micrographs from the Hamburg platform, the “Import” task of the raw micrographs is not available, but a corresponding task was added to the project directory in order to summarize and have coherence for all steps of the procedure. The different “.mrc” or “.mrcs” files were replaced by empties “.zip” to reduce the disk usage as they are not needed to summarize the processes applied. The directory can be found under this link: https://src.koda.cnrs.fr/igs/genfiber_cl1a_focusrefine_relion_pipeline

5. The reviewers suggest the following modifications to which and where certain figures are included. At least two EM reviewers agree that "The bubblegram analysis is not very convincing. The bubbles appear to correlate with the length or thickness of the structure – the long or overlapped structures form bubbles. The bubbles may not be due to the presence of DNA." and would favor minimizing arguments that depend on these data.

The point is, as demonstrated by our structural studies, that the relaxed structure lost the DNA. This is why bubble cannot be seen in the relaxed broken fibers. On long fibers still in compact form, the DNA is visible in the structure and bubble can be seen. Yet the evidence for the presence of DNA in the structure is also provided by the agarose gel of the purified genomic fiber and the cryo-EM structures. Bubblegrams were just one additional analysis.

Another reviewer wrote "Panels C and D of Figure 4 are too speculative to be included in the main text. Panels A and B are fascinating and pose a hypothesis worthy of further investigation, but please omit panels C and D." This request was modified with a great suggestion by another reviewer "All images of the putative RNA polymerases should be boxed out from the panel A (from the whole photograph, that is) and a collage of these boxes should be shown on the same scale as the Poxvirus RNAP. The latter should be shown in several orientations. Perhaps, projection views instead of the molecular surface (which is shown in Figure 4C) would match the negatively stained images better (but maybe not). Panel 4B should be shown on the same scale as panel A and the putative RNAP can be placed (in color) into the channel of the fiber."

We now changed Figure 4 accordingly and increased the size of Figure 4B. We now present projections of six orientations of the vaccinia virus RNAP corresponding to the best extracted boxes. We must mention that negative staining imaging may dehydrate the objects and change macromolecules volumes. This is also mentioned in Figure 4 legend.

Concerning the last sentence: “Panel 4B should be shown on the same scale as panel A and the putative RNAP can be placed (in color) into the channel of the fiber." We did change the scale of Panel 4B but it is unclear to us if we can leave the colored RNAP fitted in the channel?

6. There were a number of suggestions to improve the readability and generality of the text.

– L140: "single DNA strand" but the authors probably mean single molecule of dsDNA or single double-helix. Please revise to avoid ambiguity.

We modified the text accordingly:

“The dsDNA strands appear as curved cylinders in the helical structure, the characteristic shape of the DNA (minor and major groove) becoming only visible after focused refinement of a single strand of dsDNA (Figure 3, Figure 3—figure supplement 1-2).”

– L178: Please soften "must be" – no evidence is presented that any of the mimivirus proteases are involved in the processing of oxidoreductases. The involvement of cellular proteases cannot be a priori excluded.

We now changed the sentence to: “Accordingly, the maturation of the GMC-oxidoreductases involved in genome packaging must be mediated by one of the many proteases encoded by the virus or the host cell.”

– L44 Please consider the flexibility of "giant virus" concept. The term was first used in 1999 to refer to Paramecium bursaria chlorella virus, which is substantially smaller than mimivirus. As the term was inaugurated in the context of chloroviruses, would be friendly if the authors state that "Giant viruses OF AMOEBAS were discovered with the isolation of mimivirus".

This is true and it is now modified in the introduction section.

– Mimivirus or mimivirus? Please see https://talk.ictvonline.org/information/w/faq/386/how-to-write-virus-species-and-other-taxa-names.

We changed Mimivirus by mimivirus all along the manuscript to comply with ICTV rule, as requested by this reviewer.

– Figure S12: fiber or fibre?

This has now been corrected.

– Medusavirus genome harbored genes for all five types of histones (H1, H2A, H2B, H3, and H4). Please consider adding this to the discussion topic.

We did not talk about Medusavirus because it was not proved these histones could organize into nucleosomes.

L. 303-304. APBV1 capsid structure appears strikingly similar to the Mimivirus genomic fiber but Mimivirus… I am sorry but I do not see any resemblance.

The APBV1 nucleocapsid is also a 30 nm diameter helical structure made of proteins with the dsDNA genome lining the interior of the shell. The genomic fiber also looks like a nucleocapsid, but the DNA is circular for APBV1 and linear for mimivirus. The most important difference being that the mimivirus genomic fiber is actually folded in the nucleoid, itself encased in the icosahedral capsid, while APBV1 nucleocapsid is the virion. If this reviewer thinks this is confusing, we could remove the reference to this archaeal virus.

However, to clarify this point we change the sentence in the Result section to:

“As a result, the complete mimivirus genome, folded at least 5 times, fits into the helical shell. This structure surprisingly resembles a nucleocapsid, such as the archaea infecting APBV1 nucleocapsid.”

And in the Discussion section to:

“Yet, APBV1 nucleocapsid structure strikingly resemble the mimivirus genomic fiber with a proteinacious shell enclosing the folded dsDNA genome. Consequently, mimivirus genomic fiber is a nucleocapsid further bundled as a ball of yarn into the nucleoid, itself encased in the large icosahedral capsids.”

L. 334-338. This philosophical excursion might be correct, but I do not see any data supporting these hypotheses.

We now removed this paragraph.

The grammar needs to be corrected in many places starting with the title " The giant Mimivirus 1.2 Mb genome is elegantly organized into a 30 nm-DIAMETER helical protein shield".

L. 79. "… resulted in the observation…" <- simplify by removing unnecessary words.

We replace it by: “produced bundled fibers

L. 80. "The used capsid opening procedure involves…" <- remove "used".

Done

L. 83-84. "a same" -> THE same.

This is now corrected.

Reviewer #1 (Recommendations for the authors):

1. The methods section on focused refinements for both the dimer of oxidoreductases and the DNA within needs much more explanation for other groups to reproduce. Please enhance the methods description and consider including example scripts.

2. I am not yet convinced the authors have resolved the putative disulfide bridge between protomers. Please make a figure of the density around this bond at a more stringent map threshold so that the reader can appreciate the strength of the EM evidence for the disulfide.

3. Panels C and D of Figure 4 are too speculative to be included in the main text. Panels A and B are fascinating and pose a hypothesis worthy of further investigation, but please omit panels C and D.

4. Please include the outputs Helixplorer (http://rico.ibs.fr/helixplorer/)

All this reviewer comments have been addressed and answered above.

Helixplorer output are now provided in the supplementary material (Figure 1—figure supplement 6C-E).

Reviewer #2 (Recommendations for the authors):

L140: "single DNA strand" but the authors probably mean single molecule of dsDNA or single double-helix. Please revise to avoid ambiguity.

Thanks for pointing this out, this is now clarified.

L178: Please soften "must be" – no evidence is presented that any of the mimivirus proteases are involved in the processing of oxidoreductases. The involvement of cellular proteases cannot be a priori excluded.

The sentence has been modified accordingly.

Reviewer #3 (Recommendations for the authors):

– In addition to information that is in public preview, I recommend that authors consider the flexibility of "giant virus" concept. The term was first used in 1999 to refer to Paramecium bursaria chlorella virus, which is substantially smaller than mimivirus. As the term was inaugurated in the context of chloroviruses, would be friendly if the authors state that "Giant viruses OF AMOEBAS were discovered with the isolation of mimivirus" (eg line 44);

This is totally true and this has now been corrected.

– Mimivirus or mimivirus? please see https://talk.ictvonline.org/information/w/faq/386/how-to-write-virus-species-and-other-taxa-names.

This reviewer is right but it is always difficult to change old habits! This has now been changed all along the manuscript.

– Figure S12: fiber or fibre?

Thank you for pointing this out, it is now corrected. –

– Please consider promoting Figure S5 as Figure 5.

We can if requested by the editors but we think it is a bit technical.

– I am wondering if the authors attempted to get scanning electron microscopy images of mimivirus with an exposed ball of yarn. Sounds good.

No but this is something we should consider in additional studies.

– Medusavirus genome harbored genes for all five types of histones (H1, H2A, H2B, H3, and H4). Please consider adding this to the discussion topic.

As it was never demonstrated they can form nucleosomes we prefer not to speculate.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

The manuscript has been improved but there are some remaining issues that need to be addressed, as outlined below:

First, we find the uncertainty and controversy around the role and identification of the GMC oxidoreductases confusing. Please upload the cryo-EM maps and fitted atomic coordinates for the reviewers. Second, while we understand your decision to describe the genetic system and the genomic fibers of the M4 mutant in a separate publication, we feel that an additional explanation regarding the M4 mutant is necessary for this manuscript. The observation that the genomic fibers are constructed from different proteins in other mimiviruses, and potentially in the M4 mutant which lacks qu_946 and qu_143 poses quite a puzzle and raises concerns about the validity of the structural assignment.

I do not understand the Reviewers additional request as all maps and coordinates files were already provided for review purposes.

They are also available on the eLife site as Supporting Zip Documents that we thought were made available to the reviewers and editors:

  1. Cl1a : compact 5-start map and coordinates

  2. Cl1a : focus refine map and coordinates

  3. Cl1a focus refine DNA map

  4. Cl3 : relaxed 5-start genomic fiber map and coordinates

  5. Cl2 : compact 6-start map and coordinates

From the very beginning the reviewers could thus assess the quality of the reconstruction and the fit of the GMC-oxidoreductases in the various maps, including the focus refined map at 3.3 A resolution showing the FAD co-facteur into density.

Reviewers could go back to the 5 maps and coordinates of the GMC-oxidoreductase whenever they wish. They can visualize these data using a variety of software tool (coot, pymol, chimera, chimeraX…) and activate the symmetry to visualize the entire helices.

A video of the focus refined map with the fitted pdb structure was also provided as supplementary video (Video_S2.mp4) with a final zoom on the FAD co-factor showing it was in density. I wonder if the reviewers were aware of all this supplementary material.

The presence of the GMC-oxidoreductase as a main scaffolding protein in the Mimivirus genomic fiber is an experimental fact strongly supported by MS-based proteomics and by the 4 maps. Again, reviewers could assess the unambiguous fit of the protein into the 4 maps. The fact that this result is unexpected (i.e. "confusing"), and thus constitutes an important discovery, should not be used to deny its validity a priori.

As for M4, I really don't understand why it has become the focus of all attentions, as it is not the subject of our study, the purpose of which is to establish the first high resolution structure of a Mimiviridae chromosome.

Microbiology is full of examples of genes and structures once thought to be essential, then subsequently discovered lacking in related microorganisms. Considering that the GMC-oxidoreducase cannot compose Mimivirus genomic fiber simply because the homologous genes are not present in M4 is a mere opinion, not a scientific argument.

I did include in the revised version of this manuscript a sentence explicitly referring to M4:

“Interestingly, mimivirus M4 (31), a laboratory strain having lost the genes responsible for the synthesis of the two polysaccharides decorating mimivirus fibrils (11) also lacks the GMC-oxidoreductase genes. Additional studies on this specific variant will be key to establish if it exhibits a similar genomic fiber, and if yes, which proteins are composing it.”

We hope the reviewers will now have a careful look to the fit of the structure in the different maps and will get convinced on the validity of our results. I would recommend to use Coot or Pymol to be able to zoom easily into the structure and thus assess the fit of the secondary structures, the side chains as well as the one of the co-factor.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Brun V, Coute Y. 2021. Proteomic analysis of virion and genomic fibre of Mimivirus reunion. PRIDE. PXD021585
    2. Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 5-start helix form. Electron Microscopy Data Bank. EMD-14354
    3. Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 6-start helix form. Electron Microscopy Data Bank. EMD-14353
    4. Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its relaxed 5-start helix form. Electron Microscopy Data Bank. EMD-14355
    5. Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre asymmetric unit. Electron Microscopy Data Bank. EMD-13641
    6. Fabre E, Poirot O, Legendre M, Abergel C, Claverie JM, Yoshida T, Ogata H. 2021. Mimivirus reunion isolate Queen, complete genome. NCBI Nucleotide. MW004169
    7. Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15630
    8. Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15627
    9. Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15628
    10. Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15629

    Supplementary Materials

    Figure 1—figure supplement 3—source data 1. Source data of the agarose gel.
    Figure 1—figure supplement 3—source data 2. Source data of the agarose gel.
    Supplementary file 1. Mimivirus Reunion genomic fibers data statistics (Liebschner et al., 2019).
    elife-77607-supp1.docx (17.8KB, docx)
    Supplementary file 2. Mass spectrometry-based proteomic analysis of (A) three independent preparations of mimivirus genomic fiber and (B) one sample of purified mimivirus virions.

    (ND: not detected). RNA polymerase subunits are marked in red.

    elife-77607-supp2.docx (53.3KB, docx)
    Supplementary file 3. Contacting residues for each GMC oxidoreductases in the different maps.

    Conserved or divergent amino acids are color coded in green or in red, respectively.

    elife-77607-supp3.docx (116.4KB, docx)
    Supplementary file 4. Data acquisition parameters for cryo-electron microscopy (cryo-EM).
    elife-77607-supp4.docx (16.5KB, docx)
    Transparent reporting form

    Data Availability Statement

    Mimivirus reunion genome has been deposited under the following accession number: BankIt2382307 Seq1 MW004169. 3D reconstruction maps and the corresponding PDB have been deposited to EMDB (Deposition number Cl1a: 7YX4, EMD-14354; Cl1a focused refined: D_1292117739; Cl3a: 7YX5, EMD-14355; Cl2: 7YX3, EMD-14353). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD021585 and 10.6019/PXD021585. The tomograms have been deposited in EMPIAR under accession number 1131 and tomograms video is provided with the article.

    The following datasets were generated:

    Brun V, Coute Y. 2021. Proteomic analysis of virion and genomic fibre of Mimivirus reunion. PRIDE. PXD021585

    Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 5-start helix form. Electron Microscopy Data Bank. EMD-14354

    Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its compact 6-start helix form. Electron Microscopy Data Bank. EMD-14353

    Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre in its relaxed 5-start helix form. Electron Microscopy Data Bank. EMD-14355

    Villalta A, Schmitt A, Estrozi LF, Quemin ERJ, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Grunewald K, Abergel C. 2022. Structure of the Mimivirus genomic fibre asymmetric unit. Electron Microscopy Data Bank. EMD-13641

    Fabre E, Poirot O, Legendre M, Abergel C, Claverie JM, Yoshida T, Ogata H. 2021. Mimivirus reunion isolate Queen, complete genome. NCBI Nucleotide. MW004169

    Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15630

    Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15627

    Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15628

    Villalta A, Schmitt A, Estrozi L, Quemin ERK, Alempic JM, Lartigue A, Prazak V, Belmudes L, Vasishtan D, Colmant AMG, Honore FA, Coute Y, Gruenewald K, Abergel C. 2022. Tomogram of the Mimivirus genomic fiber. Electron Microscopy Data Bank. EMD-15629


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES