Skip to main content
Journal of Virology logoLink to Journal of Virology
. 2022 Oct 3;96(20):e00783-22. doi: 10.1128/jvi.00783-22

Diverse RNA Viruses Associated with Diatom, Eustigmatophyte, Dinoflagellate, and Rhodophyte Microalgae Cultures

Justine Charon a, Tim Kahlke b, Michaela E Larsson b, Raffaela Abbriano b, Audrey Commault b, Joel Burke b, Peter Ralph b, Edward C Holmes a,
Editor: Colin R Parrishc
PMCID: PMC9599419  PMID: 36190242

ABSTRACT

Unicellular microalgae are of immense ecological importance with growing commercial potential in industries such as renewable energy, food, and pharmacology. Viral infections can have a profound impact on the growth and evolution of their hosts. However, very little is known of the diversity within, and the effect of, unicellular microalgal RNA viruses. In addition, identifying RNA viruses in these organisms that could have originated more than a billion years ago constitutes a robust data set to dissect molecular events and address fundamental questions in virus evolution. We assessed the diversity of RNA viruses in eight microalgal cultures, including representatives from the diatom, eustigmatophyte, dinoflagellate, red algae, and euglenid groups. Using metatranscriptomic sequencing combined with bioinformatic approaches optimized to detect highly divergent RNA viruses, we identified 10 RNA virus sequences, with nine constituting new viral species. Most of the newly identified RNA viruses belonged to the double-stranded Totiviridae, Endornaviridae, and Partitiviridae, greatly expanding the reported host range for these families. Two new species belonging to the single-stranded RNA viral clade Marnaviridae, commonly associated with microalgal hosts, were also identified. This study highlights that a substantial diversity of RNA viruses likely exists undetected within the unicellular microalgae. It also highlights the necessity for RNA viral characterization and for investigation of the effects of viral infections on microalgal physiology, biology, and growth, considering their environmental and industrial roles.

IMPORTANCE Our knowledge of the diversity of RNA viruses infecting microbial algae—the microalgae—is minimal. However, describing the RNA viruses infecting these organisms is of primary importance at both the ecological and economic scales because of the fundamental roles these organisms play in aquatic environments and their growing value across a range of industrial fields. Using metatranscriptomic sequencing, we aimed to reveal the RNA viruses present in cultures of eight microalgae species belonging to the diatom, dinoflagellate, eustigmatophyte, rhodophyte, and euglena major clades of algae. Accordingly, we identified 10 new divergent RNA virus species belonging to RNA virus families as diverse as the double-stranded Totiviridae, Endornaviridae, and Partitiviridae and the single-stranded Marnaviridae. By expanding the known diversity of RNA viruses infecting unicellular eukaryotes, this study contributes to a better understanding of the early evolution of the virosphere and will inform the use of microalgae in industrial applications.

KEYWORDS: diatom, evolution, microalgae, virome, virosphere, metagenomics

INTRODUCTION

Viruses are often considered the most ancient “life forms” (i.e., replicatory agents). As studies of the viromes of increasingly diverse taxa proceed, the more their remarkable ubiquity, diversity, and abundance becomes apparent (1). RNA viruses are by far the most abundant microorganisms in marine systems (2) and play fundamental roles in these environments by infecting and regulating phytoplankton populations (2). RNA viruses that infect unicellular photosynthetic microalgae are also of primary importance for marine resource management due to the significant ecotoxicological effect of some microalgal hosts, including abundant dinoflagellate species (3). There is also growing awareness of the value of microalgal cultures for biofuels, pharmacology, water treatment, food, and the aquacultural industries (47). Indeed, the intensive commercial cultivation and production of microalgae populations could be seriously affected by viral disease outbreaks (8). Accordingly, an extensive description of the RNA virus diversity in unicellular microalgae is of importance to better understand their role and impact on natural microalgal populations and in anticipating the consequences of industrial cultivation.

Knowledge of the RNA virosphere in overlooked eukaryotic lineages that evolved billions of years ago—such as the microalgae—could greatly enhance our understanding of the earliest events in RNA virus evolution. With barely 100 species of RNA viruses reported since the first isolation of a microalgae-infecting RNA virus in 2003 (9), our current knowledge of RNA viruses infecting microalgae is limited, representing less than 0.5% of the RNA viruses for which hosts have been formally established (10). This lack of knowledge most likely reflects the historical focus on viruses that cause disease in humans and bioresources (domestic animals, animal and insect vector, plants) rather than those infecting microbial eukaryotes.

The study of global viromes has been revolutionized by metagenomics. By avoiding cultivation limitations and paving the way for the exploration of very diverse environments (soil, water, etc.), the metagenomic era has multiplied the number of RNA viruses described by many thousands (1013). This is evident in the field of “phycovirology” (the study of algal viruses), for which recent studies investigating RNA viruses using metagenomic approaches have revealed a high diversity and prevalence of RNA viruses in several microalgae lineages (11, 1421). While the positive-sense single-strand (ss+) picorna-like Marnaviridae are the best described family of microalgae-infecting viruses (11, 22, 23), metagenomic studies continue to expand the diversity of microalgal viruses, including identification of the double-strand RNA (dsRNA) viruses from the orders Ghabrivirales (Totiviridae-like), Durnavirales (Partitiviridae-like), and Martellivirales (Endornaviridae-like) (19, 20, 2426), as well as ss+ RNA viruses from the Sobelivirales (Alvernaviridae), Nodamuvirales (Nodaviridae), Wolframvirales (Narnaviridae), and Cryppavirales (Mitoviridae) phyla (19, 20, 24, 27, 28). To date, the majority of the microalgal hosts documented to contain RNA viruses are from the Bacillariophyta (diatom) and Dinoflagellata (dinoflagellate) lineages. However, some viruses have been reported from other stramenopile hosts (such as Phaeophytes, Raphidophytes, and Xanthophytes) (9, 20, 2830) and in some other major groups of microalgae such as the Rhizaria (9, 20), Chlorophyta (19, 31), Rhodophyta (20, 25, 26), and, more recently, Haptophyta (20).

To increase our understanding of the RNA virosphere in microalgae, we assessed the diversity of RNA viruses in eight microalgal species, covering the major groups of stramenopiles, including eustigmatophytes (Nannochloropsis oceanica and Nannochloropsis oculata) and diatoms (Thalassiosira weissflogii), alveolates including dinoflagellates (Prorocentrum cf. balticum, Prorocentrum lima, Gambierdiscus carpenteri), red algae (Rhodella maculata), and euglenid (Euglena gracilis). By using a “culture-based” metatranscriptomic approach, we combined the power of unbiased detection of ultralarge-scale RNA sequencing with the use of monoorganism culture to assist in associating the viruses identified to their specific algae hosts. Given the high levels of sequence diversity observed in many RNA viruses, we paid particular attention to identifying divergent virus-like sequences.

RESULTS AND DISCUSSION

We searched for RNA virus sequences associated with cultures of eight unicellular microalgal species, representing four major algal groups: stramenopiles, alveolates, rhodophytes, and euglenozoa (Fig. 1). Following total RNA extraction from each microalgal culture, metatranscriptomic sequencing was used to obtain deep transcriptomes. The corresponding RNAs and data yields for each microalgal sample/library are detailed in Table 1 and Fig. S1. By combining a standard metagenomic bioinformatic pipeline with the protein hidden Markov model (HMM) profile and structural comparison developed in the RNA-dependent RNA polymerase (RdRp) scan (32), we were able to identify 10 new viral-like sequences (Table 2). With the exception of the unicellular red algae R. maculata, recently associated with the Despoena mito-like virus (20), these represent the first reports of viruses in each microalgal species investigated (Fig. 1). The 10 viral sequences found in this study were compared to the genomic sequences of the corresponding algal host whenever possible (Table S1). Accordingly, 9 of the 10 viral sequences identified were not found in the host genome and therefore were treated as exogenous viruses (Table 2). In contrast, the viral signal detected from E. gracilis using HMM-based approach was identical to Euglena genome sequences (Table 2) and hence likely corresponds to an endogenous viral element (EVE; see below).

FIG 1.

FIG 1

Phylogenetic position of the microalgal species used in this study within the global eukaryote phylogeny. Microalgal species names are indicated in italics, and their main applications (industrial, biological hazard [toxins], and model organisms) are specified by icons. Species for which no RNA viruses were reported prior to this study are indicated in bold. The eukaryote cladogram was based on data from reference 56.

TABLE 1.

Total RNA extractions and RNA-seq resultsa

Algae species Total RNA quantity (ng) Sequencing data yield (Gb)
Nannochloropsis oceanica 20 27.81
Nannochloropsis oculata 30 52.54
Thalassiosira weissflogii 105 29.39
Prorocentrum lima 700 25.16
Prorocentrum cf. balticum 7,500 40.80
Gambierdiscus carpenteri 630 51.80
Rhodella maculata 120 24.94
Euglena gracilis 5,700 24.90
a

RNA-seq, transcriptome sequencing.

TABLE 2.

New viruses and endogenous viral elements found in this studya

Virus name Host (lineage) RdRp phylum/order related RT-PCR validated? (virus/host) Coverage (quality/nb reads) Length Full-length vs. partial Exogenous vs. EVE
Taphios ghabri-like virus 1 Nannochloropsis oceanica (Eustigmatophyte) Duplornaviricota/Ghabrivirales Yes/Yes Good/13,769 4,876 Likely complete Exogenous
Taphios ghabri-like virus 1 Thalassiosira weissflogii (Eustigmatophyte) Duplornaviricota/Ghabrivirales Yes/Yes Good/876 4,835 Likely complete Exogenous
Triopas ghabri -like virus 1 Prorocentrum cf. balticum (Dinophyceae) Duplornaviricota/Ghabrivirales No/Yes Average/70 1,425 Partial Exogenous
Diktys durna-like virus 1 Prorocentrum lima (Dinophyceae) Pisuviricota/Durnavirales–Partitivirus Yes/Yes Good/24,826 1,826 Partial (one segment missing?) Exogenous
Orion durna-like virus 1 Gambierdiscus carpenteri (Dinophyceae) Pisuviricota/Durnavirales Yes/Yes Good/1,198 2,077 Partial (one segment missing?) Exogenous
Almopos endorna-like virus 1 Gambierdiscus carpenteri (Dinophyceae) Kitrinoviricota/Martellivirales Yes/Yes Good/54,823 21,494 Likely complete Exogenous
Althepos endorna-like virus 1 Gambierdiscus carpenteri (Dinophyceae) Kitrinoviricota/Martellivirales Yes/Yes Good/460 4,825 Likely complete Exogenous
Phineus pisuviri-like virus 1 Rhodella maculata (Rhodophyta) Pisuviricota/Picornavirales Yes/Yes Good/14,972 6,398 Likely complete Exogenous
Megareus marna-like virus 1 Nannochloropsis oculata (Stramenopiles) Pisuviricota/Picornavirales No/No Good/425 1,222 Partial Exogenous
Minyas marna-like virus 1 Nannochloropsis oculata (Stramenopiles) Pisuviricota/Picornavirales No/No Good/472 1,130 Partial Exogenous
Pisuviri-like signal Euglena gracilis (Euglenozoa) Pisuviricota/uncertain placement Yes/Yes Good/1,531 1,484 EVE EVE

aFull-length versus partial information was hypothesized from genomic length and organization. EVE, endogenous viral element; RdRp, RNA-dependent RNA polymerase; RT-PCR, reverse transcription PCR.

To eliminate contamination during the library preparation or sequencing, we tested the presence of all viruses in total RNA samples using reverse transcription (RT)-PCR. This resulted in the detection of 8 of the 11 virus/samples tested (Table 2 and Fig. 2). Triopas ghabri-like virus 1 could not be detected in Prorocentrum cf. balticum RNAs (Fig. 2), likely because of the very low abundance of this viral contig (Table 2). Megareus marna-like virus 1 and Minyas marna-like virus 1, both associated with the N. oculata sample, similarly could not be confirmed using RT-PCR. In addition, the positive control used to target the N. oculata internal transcribed spacer (ITS) sequence did not return any PCR signal (Fig. 2). Hence, the meager quantity of total RNA extracted from N. oculata cultures (Table 1) may explain the difficulty in validating both host gene and associated viruses using RT-PCR. As a control, all the final viral genomes sequences were cross-checked with every transcriptome sequencing (RNA-seq) library. None of the viral sequences found in this study were identified in the other samples, with the exception of Toti#2, which was present in the N. oceanica and T. weissflogii libraries.

FIG 2.

FIG 2

Reverse transcription (RT)-PCR confirmation of novel viral signals identified in this study. The expected lengths of each PCR product are indicated below each band. The RNA sequences targeted for each reaction are indicated in green. The corresponding RNA samples are indicated on top of each well. E.g, Euglena gracilis; P.b, Prorocentrum cf. balticum; T.w, Thalassiosira weissflogii; N.o, Nannochloropsis oceanica; N. ocu, Nannochloropsis oculata; (+), host gene tested.

Placement of the newly identified microalgal viruses within global RNA virus diversity.

To characterize the newly identified viruses, we first used phylogenetic analysis to place the new viral sequences within the diversity of viral RNA-dependent RNA polymerase (RdRp) sequences at the phylum level using the recently developed RdRp-scan resource (32). These large-scale phylogenies show that the viral sequences identified fell in diverse topological positions among those RNA viruses identified to date, with two belonging to the Duplornaviricota viruses, two falling into the Kitrinoviricota viruses, and six sharing homologies at amino acid level with Pisuviricota viruses (Fig. 3).

FIG 3.

FIG 3

Phylogenetic placement of the newly identified viruses within the diversity of Riboviria phyla. In these unrooted ML trees, red stars indicate the viruses newly identified. Light red stars represent RdRp-like sequences obtained using the hidden Markov model (HMM)-based RdRp-scan method. Bars represent the number of amino acid substitutions per site.

We then conducted additional phylogenetic analyses focusing on the viral subclades that contained the 10 newly identified sequences. These comprised the Ghabrivirales, Endornaviridae, Durnavirales, and Marnaviridae lineages and are described below.

New microalgae-infecting viruses suggest a TSAR-infecting Totiviridae genus.

Among the 10 viral sequences identified in this study, two were related to Totiviridae-like viruses (Table 2 and Fig. 3). Triopas ghabri-like virus 1, identified in the dinoflagellate Prorocentrum cf. balticum, forms a clade with the Arion toti-like virus identified in the dinoflagellate Pyrodinium bahamense (20). Together, these two viruses group with those previously reported in microalgae and oomycete hosts (Fig. 4).

FIG 4.

FIG 4

Phylogeny of the Ghabrivirales. Sequences in gray denote previously unclassified viruses, while those in bold refer to microalgae-associated viruses. Host lineages are indicated in circles to the right of major viral clade labels and correspond to fungi (orange), protozoa (light blue), and microalgae (blue). The new viral sequences identified in this study are indicated with red circles. The tree is mid-point rooted, and confident nodes (with SH-alrt likelihood ratio test values ≥80%) are represented as circles. The bar represents the number of amino acid substitutions per site.

The short length of the RdRp-encoding segment identified here—1.4 kb—suggests that the genome of Triopas ghabri-like virus 1 is partial (Fig. 5). While this limits the discussion of genomic attributes, an additional open reading frame (ORF) in reverse orientation and without any known function associated was predicted using the standard genetic code. Such a use of antisense ORF would constitute an original feature in the Totiviridae. Triopas ghabri-like virus 1-associated RNAs were found at very low abundance in the Prorocentrum cf. balticum sample and could not be confirmed experimentally by RT-PCR (Table 2 and Fig. 2). Although this viral sequence requires additional validation, it supports previous suggestions of dinoflagellate-infecting Totiviridae (20) and constitutes further evidence for recognizing a new genus infecting the Telonemid, Stramenopile, Alveolate, and Rhizaria supergroup (TSAR) within the Totiviridae (20).

FIG 5.

FIG 5

Genome organization of the Ghabri-like viruses identified in this study. Read coverage of each genome is represented as a blue histogram. The open reading frames (ORFs) were predicted using standard genetic codes, and their directions are represented as arrows. ORFs encoding RdRp-like signals and hypothetical functions are indicated in red and gray, respectively. A, B, and C RdRp motifs are indicated in green, yellow, and red boxes, respectively.

A second Toti-like virus, Taphios ghabri-like virus 1, was found in the eustigmatophyte N. oceanica and the diatom T. weissflogii. It forms a clade with the algae-associated Polyphemus and Ephialtes toti-like viruses, both previously identified in Astrosyne radiata (diatom) samples (20). They also form a sister clade to the Trichomonasvirus, Victorivirus, and Leishmaniavirus genera, infecting protozoan parasites and fungi (3335). To consolidate the host-virus relationship and potentially elongate the genomic sequence, we screened for the presence of the newly described viruses in additional host transcriptomes available in the Sequence Read Archive (SRA) (Table S2). Accordingly, Taphios ghabri-like virus 1 sequences were observed in one transcriptome (SRR12347810) of the diatom Phaeodactylum tricornutum, with only eight single-nucleotide polymorphisms (SNP) reported at the genome level.

The total length of the Taphios ghabri-like virus 1 sequence, at 4.8 kb, is in the range of other Totiviridae and, along with the read coverage profile, suggests that the full-length genome has been obtained (Fig. 5). The organization of the Taphios ghabri-like virus 1 genome into two overlapping ORFs, probably translated with a +1 ribosomal frameshift, corresponds to the genomic features commonly observed among the Totiviridae. The first ORF likely encodes a coat protein, while no annotations could be retrieved from InterProscan analysis for this ORF (36). We hypothesize from the placement within the Totiviridae phylogeny and the similarities in genome organization and length that this virus has a dsRNA genome. Combined, the results from RdRp phylogenies, genome organization, and host range are in accord with establishing a new Totiviridae genus infecting diatom and eustigmatophyte hosts.

The observation of Totiviridae likely infecting dinoflagellates, diatoms, and eustigmatophyte hosts aligns with the suspected ubiquity of these dsRNA viruses in microalgae (20) and unicellular eukaryotes more generally (37). Notably, the Totiviridae have been associated with changes in host fitness and to hyper- or hypovirulence of some of their hosts (3840). The effects of the newly discovered Totiviridae genus on corresponding dinoflagellate, diatom, and eustigmatophyte microalgal cultures require additional investigation and could be of interest considering their potential effects on growth, including that of harmful algal blooms (HABs), and commercial cultivation yields.

First association of Alphaendornavirus with dinoflagellates.

Two of the viruses identified in this study cluster within the Endornaviridae family of dsRNA viruses. Specifically, Althepos endorna-like virus 1 and Almopos endorna-like virus 1 (both retrieved from G. carpenteri–Dinophyceae) group with members of the Alphaendornavirus genus, a genus within the Endornaviridae previously associated with land plants, fungi, and oomycetes (41) (Fig. 6).

FIG 6.

FIG 6

Phylogeny of the Endornaviridae. Sequences in gray denote unclassified viruses and sequences in bold refer to microalgal-associated viruses. Host lineages are indicated in circles to the right of major viral clades and correspond to fungi, land plants, protozoa, and microalgae. The new viral sequences identified in this study are indicated with circles. The Sichuan Alphaendornavirus cluster includes the diatom-associated RNA virus 15, previously reported from diatom-containing samples (18). The tree is mid-point rooted, and confident nodes (with SH-alrt likelihood ratio test values ≥80%) are represented as circles. The bar represents the number of amino acid substitutions per site.

Endornaviridae dsRNA genomes are 9.7 to 17.6 kb in length and encode a single polyprotein with a RdRp domain located in the C terminus (37). The genome organization of Almopos endorna-like virus 1 therefore possesses features common to the Endornaviridae (Fig. S2), except for its genome size of ~21 kb, which is the longest genome reported to date for this group. In addition to the viral RdRp domain located in the C-terminal region of the Almopos endorna-like virus 1 protein, other protein domains and signatures could be identified that were related to the (+)RNA virus helicase core (IPR027351), the YbiA-like superfamilies (IPR037238), and the UDP-glycosyltransferase/glycogen phosphorylase superfamily (SSF53756) (Fig. S2), similar to previous studies (4245). It is very likely that other viral proteins and functions are encoded but are too divergent to be identified. The investigation of these additional divergent viral translated products could be of significant importance for both, revealing the evolutionary origins of the Endornaviridae (46).

The Althepos endorna-like virus 1 sequence is only 4.8 kb in length and likely represents a partial genome. Additional read mapping using our metagenomic or SRA-based data did not allow the retrieval of the full-length sequence (Fig. S2). Whether Althepos endorna-like virus 1 and Almopos endorna-like virus 1 affect the fitness of their G. carpenteri host remains to be investigated but may have important implications for the management of this potentially harmful species (47). Considering the persistent lifestyle reported for endornaviruses (48) and the high similarities in terms of RdRp sequence and genomic organization, it is likely that Althepos endorna-like virus 1 and Almopos endorna-like virus 1 share the same infectious properties as other members of the genus Alphaendornavirus and might therefore constitute another example of capsid-less persistent viruses associated with protist hosts. Importantly, Althepos endorna-like virus 1 and Almopos endorna-like virus 1 identified from the G. carpenteri culture represent the second microalgal-endornavirus association observed to date (18) and the first report in a dinoflagellate host, strongly suggesting that a microalgal-specific Endornaviridae clade may exist.

A new Partitiviridae genus associated with dinoflagellate hosts.

Orion durna-like virus 1 and Diktys durna-like virus 1, observed in P. lima (Dinophyceae) and G. carpenteri (Dinophyceae) cultures, respectively, form a clade with the Ourea durna-like virus previously associated with the dinoflagellate Dinophysis acuminata (20) (Fig. 7). Specifically, they form a sister clade to the genus Deltapartitivirus, belonging to the bi-segmented dsRNA Partitiviridae that infect fungi and plants (49) and recently associated with unicellular algae (20).

FIG 7.

FIG 7

Phylogeny of the Durnavirales. Sequences in gray denote unclassified viruses, while those in bold refer to microalgal-associated viruses. Host lineages are indicated with circles to the right of major viral clades and correspond to metazoa, fungi, land plants, protozoa, and microalgae. The new viral sequences identified in this study are indicated with circles. The tree is mid-point rooted, and confident nodes (with SH-alrt likelihood ratio test values ≥80%) are represented as circles. The bar represents the number of amino acid substitutions per site.

Orion durna-like virus 1 and D. durna-like virus 1 genomes (1.8 kb and 2 kb in length, respectively) with a single ORF containing the RdRp domain (Fig. S3). Partitiviridae are bisegmented viruses. Considering the placement of Orion durna-like virus 1 and D. durna-like virus 1 within the Partitiviridae phylogeny, it is very likely that they comprise a second segment, potentially encoding a coat protein not retrieved in this study due to our RdRp-based retrieval methodology. A complementary comparison of those sequences in the SRA database identified a sequence with 100% sequence identity at the amino acid level to D. durna-like virus 1 from a Gambierdiscus polynesiensis (dinoflagellate) sample (SRR3358210) (Table S2).

Together, these results are compatible with the establishment of a new Partitiviridae genus that is specific to dinoflagellates, comprising Orion durna-like virus 1, D. durna-like virus 1, and the previously identified Ourea durna-like virus. This observation expands the host range reported for this family, already comprising plants, fungi, oomycetes, apicomplexan parasites, and green algae (19, 5053). While most of the Partitiviridae do not induce symptoms in their hosts, hypovirulence has been reported in the alpha-, beta-, and gammapartitiviruses (54). Further analysis is required to assess the effects of Orion durna-like virus 1 and D. durna-like virus 1 on their potentially harmful dinoflagellate hosts, G. carpenteri and P. lima (47, 55).

Marnaviridae-like viruses associated with a N. oculata culture.

Two newly identified viruses were identified in a N. oculata (a eustigmatophyte) sample and exhibited RdRp sequence similarity with the Marnaviridae, a picorna-like family of ss+ RNA viruses that infect unicellular eukaryotes (Fig. 8). With its taxonomy recently reassessed to incorporate viruses from metagenomic studies (22), the Marnaviridae are classified into seven genera. Accordingly, Minyas marna-like virus 1 belongs to the genus Locarnavirus that comprises viruses derived from marine environment, mollusc- and fish-based metagenomic studies (Fig. 8).

FIG 8.

FIG 8

Phylogeny of the Marnaviridae. Sequences in gray denote unclassified viruses, while those in bold refer to algae-associated viruses. Host lineages are indicated in circles to the right of major viral clades and correspond to arthropods, land plants, and microalgae. The new viral sequences identified in this study are indicated with circles. The tree is mid-point rooted, and confident nodes (with SH-alrt likelihood ratio test values ≥80%) are represented as circles. The bar depicts the number of amino acid substitutions per site.

This identification of Minyas marna-like virus 1 from the N. oculata culture provides compelling evidence for a Locarnavirus directly associated with a unicellular microalga. Along with the previous identification of the Dinophyceae-associated Pelias marna-like virus (20) (Fig. 8), this supports the idea of an extensive host range of locarnaviruses among unicellular microalgae. The second Marnaviridae-like virus, Megareus marna-like virus 1, forms a cluster with Sanxia picorna-like virus 7, falling in a position basal to locarna-, kusarna-, bacillarna-, salisharna-, and sogarnaviruses (Fig. 8). It may therefore constitute a new genus of Marnaviridae (22).

The short sequences of both Minyas marna-like virus 1 and Megareus marna-like virus 1 and their average read coverages (Table 2 and Fig. S3) strongly suggest that only partial genomes have been recovered. The low quantities of RNA extracted from the N. oculata sample and the corresponding fragmented RNAs likely explain the poor coverage for the corresponding viral contigs and why RT-PCR targeting both viral and host sequences returned negatives (Fig. 2). Additional studies are needed to achieve the genomic and biological characterization of those new Marnaviridae-like viruses associated with the eustigmatophyte host N. oculata. In particular, if the two newly reported Marnaviridae caused lysis of the biofuel-producing N. oculata cells, this could represent a major concern for industrial-scale production.

Divergent viruses identified using protein profiles and structural comparisons.

To help identify viruses in basal and divergent microbial eukaryotes, we conducted an approach based on HMM and structural RdRp comparisons, using the newly developed RdRp-scan tool (32). Briefly, ORFs were predicted from each orphan contig and compared to RdRp profiles using hidden Markov models (32). Such a strategy is expected to detect distant homologs sharing less than 30% sequence identity with viral protein sequences currently available in sequence databases. As a result, two additional viral RdRp sequences were identified as distantly homologous to Pisuviricota members (Fig. 3).

Using RdRp-scan HMM profiles, a remote Pisuviricota-like RdRp signal was identified as being associated with E. gracilis. The complementary Phyre2-based homology search returned a strong hit to the picornavirus sicinivirus 3dpol RdRp, validating the Pisuviricota-like signal previously detected. As noted above, comparison with E. gracilis nuclear (GCA_900893395) and mitochondrial (GCA_001638955) revealed strong identities (Table S1). A very close sequence (seven SNPs at the genome level and six nonsynonymous substitutions at the RdRp level) could also be retrieved from the SRA database sample (SRR2294740), corresponding to the mitochondrial genome of E. gracilis. Such a mitochondrial sublocation is also suggested by the ORF found that can be expressed using the Chlorophycean mitochondrial genetic code (Fig. S3). Hence, this Pisuviri-like signal appears to be part of the host genome, likely corresponding to an endogenous viral element. The presence of such EVEs will help identify divergent viruses infecting euglenoid lineages, which are expected to be highly divergent, considering the basal placement of euglenoids within eukaryotic organism diversity (56).

The Phineus pisuviri-like virus 1, identified from R. maculata, was also confidently identified as a remote homolog of Pisuviricota RNA viruses using both RdRp-scan profiles and Phyre2 server. Although its distant and basal position in the RNA virus phylogeny prevents a robust comparison with existing Riboviria clades, its genome of 6.4 kb and the associated read coverage suggests the full-length genome was recovered. The genome encodes three ORFs that possess RdRp function at the C terminus (Fig. S3). No functions could be associated with the additional ORFs, and further studies are required to characterize this newly identified protist-infecting virus.

Additional viruses.

While it does not constitute a novel virus, one contig assembled from the R. maculata was retrieved in very high quantity and identical to the D. mito-like virus (Table 2), previously reported in another R. maculata sample (20). This strongly reinforces the idea of a D. mito-like virus infecting the red algal host and more generally the establishment of a mitovirus subclade able to infect microalgae (20). Finally, our unbiased metagenomic analysis also retrieved additional viruses related to the Tombusviridae, with identical sequences identified across several unrelated samples. It is very likely that these sequences result from contamination from kits or water used to extract or prepare RNA and cDNA libraries and were thus discarded from this study.

Virus-host assumptions based on the composition of microalgal cultures.

The composition of the major kingdoms present in each sample was obtained by comparing contigs to the nt and nr databases. The corresponding proportions and the abundance of contigs without a detectable match in nt or nr are shown in Fig. 9. Bacteria-associated contigs were present within the libraries, especially those from Nannochloropsis and Rhodella, in line with the commonly reported microalgal-bacteria interactions (57). Bacterial and eukaryotic viruses are usually too distantly related to be confounded. The presence of bacterial organisms in the samples is therefore not expected to interfere with our assumption that viruses identified as sharing homology with eukaryotic viruses very likely infect eukaryotic microalgal hosts. The proportion of undetected hits, without any match in the nt and nr databases, is highly variable between libraries, ranging from less than 15% in the T. weissflogii sample to more than 85% in Prorocentrum minimum culture (Fig. 9A). Such high variation likely arises from the lack of microalgal genomic and proteomic sequences in the NCBI nt and nr databases, with genomic sequences available only for half of the microalgae hosts analyzed here (Table S1). Such discrepancies in nucleotide and protein sequence assignment and abundance are further amplified in cases of highly abundant transcripts, such as rRNA, which very likely remain in the sample.

FIG 9.

FIG 9

Relative abundance of contigs in microalgae libraries based on their assignment to major cellular organism clades. Contigs were assigned according to the taxonomy of their best BLAST hits. Percentages of each contig were based on the abundance values and correspond to the sum of all contig TPM values belonging to each taxonomy clade. (A) Relative abundance of contigs associated with kingdoms Archaea (dark pink), Bacteria (light pink), and Eukaryota (yellow) using both BLASTn and BLASTx. The abundance of contigs with nt or nr entries lacking a taxonomy assignment are indicated in dark gray, while those without any nt or nr matches detected are indicated in light gray. (B) Relative abundances of contigs associated with major eukaryotic clades using BLASTx. Low abundance clades, counting for less than 0.5% of the total contig abundance, are not represented. TSAR, Telonemids-Stramenopiles-Alveolates-Rhizaria group as defined in reference 56.

While many unassigned entries in most of the samples analyzed here can limit the formal assignment of viruses to hosts, obtaining a clearer picture of the eukaryotic host sequences present in the sample and their relative abundance can help to discriminate between eukaryotic hosts. Indeed, our cultures were washed several times before RNA extraction. It is therefore likely that the viral sequences identified result from intracellular viral forms rather than extracellular virions circulating in the culture media: hence, we assume that viruses detected in this study are associated with cellular organisms that are also present in the sample. We therefore examined the deep taxonomy of BLASTx eukaryotic-like contigs, as well as the total contig abundance reported for major eukaryotic lineages (Fig. 9B), which helped discriminate potential hosts for the most uncertain assignments. Accordingly, the very low abundance of fungi and land plant-associated sequences in G. carpenteri (Fig. 9B) could constitute additional evidence for a microalgae-infecting endornavirus, even though members of the Endornaviridae have been traditionally associated with fungi and land plants (Streptophyta).

In the case of the P. pisuviri-like virus 1 identified from R. maculata, the majority of detectable contigs belong to the corresponding Rhodophyta host taxa, suggesting that this virus is likely associated with a Rhodophyte host rather than fungi or other contaminant organisms. The very large proportion of contigs associated with land plants (Streptophyta) in the N. oculata library might correspond to contamination. However, the unambiguous placement of the corresponding Minyas marna-like virus 1 virus within the well established microalgae-infecting Marnaviridae provides a strong argument that this virus is associated with diatoms.

Of note, all of the viruses reported here were identified in healthy algae cultures and therefore assumed to be detrimental to their algal hosts. It will be of considerable interest to characterize the phenotypic effects of such seemingly commensal infections, whether persistent, neutral, or even beneficial, and to understand their underlying molecular basis.

Conclusions.

Through metatranscriptomic sequencing of total RNA from microalgae cultures we identified 10 new RNA viruses associated with diatom, eustigmatophyte, dinoflagellate, and rhodophyte microalgae. These newly discovered viruses contribute to the establishment of new microalgae-infecting viral clades within the Totiviridae and Partitiviridae, as well as the enrichment of the positive single-stranded picorna-like family Marnaviridae. This study also extended the host range of the dsRNA endornaviruses to microalgae, raising questions about how this viral family is able to infect the plant, fungi, and TSAR eukaryotic supergroups. Considering the harmful or commercial value of their hosts, this description of new microalgal viruses paves the way for further studies of the effects of viral infections on host biology and their associated ecological and industrial consequences. Finally, this study highlights the need to reveal the hidden diversity among RNA viruses infecting microalgae and among microbial eukaryotes in general, particularly considering their fundamental and applied importance.

MATERIALS AND METHODS

Algae cultures.

Microalgal cultures were maintained on a 12-h:12-h light:dark cycle at 100 μmol m−2 s−1. Culture media and temperature conditions were specific to each species and were as follows: N. oceanica 24°C, f/2 medium; N. oculata 24°C, f/2 medium; T. weissflogii 20°C, f/2 medium; R. maculata 24°C, L1 medium (minus Si); E. gracilis 20°C, euglena medium; Prorocentrum cf. balticum (UTSPH2D4) (47); 20°C K medium-Si; P. lima 25°C modified K medium (58); and G. carpenteri (UTSHI2C4) 25°C modified K medium (47, 59). To harvest each microalgal culture, the cells from 100 to 250 mL were pelleted by centrifugation at 200 × g for 4 min and the supernatant discarded. The cells were then resuspended in 5 mL of artificial seawater and centrifuged again at 200 × g for 4 min. This wash step was repeated twice more before a final centrifugation step at 1,000 × g for 4 min followed by storage at −80°C until RNA extraction.

Total RNA extraction and sequencing.

Total RNA from the diatom (T. weissflogii) and Euglenozoa (E. gracilis) cultures were extracted using the RNeasy Plus Universal kit (Qiagen), according to the manufacturer’s instructions. Qiazol lysis buffer was then added to frozen pellets, and homogenization was performed by pipetting. Genomic DNA was removed, and RNA was extracted using 1-bromo-3-chloropropane. Supernatants were then transferred to Qiagen columns. After washing the columns, pure RNAs were collected into sterile water strictly following kit instructions.

Total RNA from dinoflagellates (P. lima, Prorocentrum cf. balticum, G. carpenteri), the Rhodophyta R. maculata, and the eustigmatophyte (N. oceanica and N. oculata) cultures was extracted using Allprep DNA/RNA kit (Qiagen), following the manufacturer’s instructions. Briefly, frozen cell pellets were supplanted with lysis RLT buffer and cells disrupted using bead beating with 0.5-mm glass beads. An additional step of sample homogenization using QIAshredder (Qiagen) was added during the RNA extraction of R. maculata sample and N. oceanica to reduce the viscosity of eluates. Cell debris was removed using a centrifugation step at high speed, and the supernatants were transferred to Qiagen columns. Total RNA fractions were then purified after several washing steps and eluted according to the kit instructions.

RNA sequencing.

RNA quality was checked using a TapeStation and individually converted by the Australian Genome Research Facility (AGRF, Melbourne) into non-rRNA RNA-seq libraries using TruSeq Stranded Total RNA with Ribo-Zero Plant (Illumina). Due to the very low RNA yields obtained for N. oculata and N. oceanica, these two libraries were prepared using the SMARTer stranded total RNA-seq kit version 2, Pico Input Mammalian libraries (TaKaRa Bio, Mountain View, CA, USA). The corresponding libraries were sequenced on the NovaSeq platform (Illumina) (paired-end, 150 bp) by the AGRF.

RNA-seq data preprocessing: read trimming, rRNA depletion, and contig assembly.

Total reads were filtered using Trimmomatic (version 0.36) (60) to remove low-quality and Illumina adapters. To maximize the completeness of the ribosomal RNA (rRNA) depletion performed during library prep, the remaining rRNA reads were removed using the SortmeRNA program (2.1b) (61). Filtered reads were then assembled into contigs using Trinity (version 2.5.1) (62), and the abundances (expected count and TPM) were calculated using RSEM (version 1.3.1) (63).

Sample taxa composition.

To help determine the taxa composition of each library, all contig sequences were compared to the nonredundant protein database nr from NCBI using Diamond BLASTx (version 2.0.9) (64) and to the nucleotide database nt from NCBI using BLAST (version 2.2.30). The best hits were reported for each contig, and their corresponding taxonomies were analyzed. For each library, contigs were grouped into major eukaryotic taxa and relative abundance determined as the sum of all the TPM (transcripts per million) within each taxon.

RNA virus identification.

(i) Sequence-based similarity detection. RdRp sequences corresponding to RNA viruses (i.e., the Riboviria) were first identified by comparing contigs to the nr database using Diamond BLASTX (version 2.0.9; e value < 1e−5) (64). To maximize the detection of RNA viruses, putative virus sequences identified from nr BLASTx, as well as those previously obtained in an algae virus study (20), were used as a database to perform a second round of BLASTx using contig libraries as queries and employing the same parameters as previously. The resulting RNA virus-like sequences were then submitted to the nr database (NCBI), and hits with the best match in cellular organism sequences were treated as false positives and discarded from the analysis.

(ii) HMM-based homology detection of ORFans. All orphan contig sequences (i.e., that had no match in the nr database denoted ORFans) were compared to the RdRp HMM profiles of the RdRp-scan resource (32) and using the HMMer3 program (version 3.3) (65).

(iii) Genome extension, genome coverage, and virus annotation. To ensure all the RNA virus-like sequences could be identified and in their longest form, additional attempts to assemble contigs were performed using the rnaSPADES (version 3.13.0) (66) and Megahit programs (version 1.2.9) (67). This did not identify additional or longer RNA virus sequences. A manual elongation step was performed on viral candidates using Geneious (version 11.1.4) (68). A virus annotation to identify RdRp motifs was performed using InterProScan (69) and RdRp-scan (32). Genome coverage profiles were obtained by mapping the non-rRNA reads back to each contig sequence using Bowtie2 (version 2.3.3.1) (70) and Samtools (version 1.6) (71). The resulting SAM files were then plotted onto viral genomes using Geneious (version 11.1.4) (68). The overall quality of each genome assembly was finally assessed based on the read coverage homogeneity along the sequence and its decrease at the extremities.

(iv) SRA mining. To help retrieve complete genome sequences, assess intraspecies variability, and associate viruses with particular algae hosts, we performed an additional step of Sequence Read Archive (SRA) mining for each of the 10 new viruses identified in this study. For each algae library, we screened the SRA using nucleotide Magic-BLAST (version 1.3.0) (72). When the number of hits exceeded 100, the corresponding SRA reads were mapped to the viral genome using Bowtie2 (version 2.3.3.1) (70) and SAMtools (version 1.6) (71).

(v) Phylogenetic analysis. RNA virus phyla-level comparisons were performed using Clustal Omega (version 1.2.4) (73) to directly compare the newly identified sequences to the prebuilt RdRp alignments from the RdRp-scan resource (32). Initial phylogenetic trees were inferred using the maximum-likelihood method available in FastTREE (version 2.1.9; default parameters) (74). Subalignments at the RNA virus order or family scale were then obtained using Clustal Omega (version 1.2.4) (73) and manually checked using Geneious (version 11.1.4) (68). Maximum-likelihood phylogenies of these subalignments were then inferred using the IQ-TREE package (version 2.0-rc1) (75) with the best-fit amino acid substitution model obtained with ModelFinder Plus (76) and using a Shimodaira-Hasegawa approximate-likelihood ratio and 1,000 replicates (-alrt 1,000) to assess nodal support.

RT-PCR confirmation.

To experimentally confirm viral contigs assembled from RNA-seq data, cDNAs from each of the total RNAs were first obtained using the SuperScript IV reverse transcriptase (Invitrogen). PCRs were then performed on each cDNA sample using corresponding host and virus primers (detailed in Table S3) using the Platinum SuperFi II DNA polymerase (Invitrogen) and following the manufacturer’s instructions.

Data availability.

Corresponding RNA-seq read files are available on the SRA under BioProject PRJNA867582, with accessions SAMN30215649 to SAMN30215656. The viruses newly identified here are available at GenBank/NCBI under the accessions OP191686 to OP191695.

ACKNOWLEDGMENTS

We thank Jean-Baptiste Raina for helpful comments on host compositions.

E.C.H. is supported by Australian Research Council Australian Laureate Fellowship FL170100022.

Footnotes

Supplemental material is available online only.

Supplemental file 1
Tables SA1 to SA3 and Fig. SA1 to SA3. Download jvi.00783-22-s0001.pdf, PDF file, 0.5 MB (514.3KB, pdf)

Contributor Information

Edward C. Holmes, Email: edward.holmes@sydney.edu.au.

Colin R. Parrish, Cornell University

REFERENCES

  • 1.Simmonds P, Adams MJ, Benkő M, Breitbart M, Brister JR, Carstens EB, Davison AJ, Delwart E, Gorbalenya AE, Harrach B, Hull R, King AMQ, Koonin EV, Krupovic M, Kuhn JH, Lefkowitz EJ, Nibert ML, Orton R, Roossinck MJ, Sabanadzovic S, Sullivan MB, Suttle CA, Tesh RB, van der Vlugt RA, Varsani A, Zerbini FM. 2017. Virus taxonomy in the age of metagenomics. Nat Rev Microbiol 15:161–168. 10.1038/nrmicro.2016.177. [DOI] [PubMed] [Google Scholar]
  • 2.Suttle CA. 2007. Marine viruses—major players in the global ecosystem. Nat Rev Microbiol 5:801–812. 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]
  • 3.Wang D-Z. 2008. Neurotoxins from marine dinoflagellates: a brief review. Mar Drugs 6:349–371. 10.3390/md20080016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Barkia I, Saari N, Manning SR. 2019. Microalgae for high-value products towards human health and nutrition. Mar Drugs 17:304. 10.3390/md17050304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Barolo L, Abbriano RM, Commault AS, George J, Kahlke T, Fabris M, Padula MP, Lopez A, Ralph PJ, Pernice M. 2020. Perspectives for glyco-engineering of recombinant biopharmaceuticals from microalgae. Cells 9:633. 10.3390/cells9030633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chew KW, Yap JY, Show PL, Suan NH, Juan JC, Ling TC, Lee D-J, Chang J-S. 2017. Microalgae biorefinery: high value products perspectives. Bioresour Technol 229:53–62. 10.1016/j.biortech.2017.01.006. [DOI] [PubMed] [Google Scholar]
  • 7.Harun R, Singh M, Forde GM, Danquah MK. 2010. Bioprocess engineering of microalgae to produce a variety of consumer products. Renew Sustain Energy Rev 14:1037–1047. 10.1016/j.rser.2009.11.004. [DOI] [Google Scholar]
  • 8.Zhu Z, Jiang J, Fa Y. 2020. Overcoming the biological contamination in microalgae and cyanobacteria mass cultivations for photosynthetic biofuel production. Molecules 25:5220. 10.3390/molecules25225220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Tai V, Lawrence JE, Lang AS, Chan AM, Culley AI, Suttle CA. 2003. Characterization of HaRNAV, a single-stranded RNA virus causing lysis of Heterosigma akashiwo (Raphidophyceae). J Phycol 39:343–352. 10.1046/j.1529-8817.2003.01162.x. [DOI] [Google Scholar]
  • 10.Cobbin JC, Charon J, Harvey E, Holmes EC, Mahar JE. 2021. Current challenges to virus discovery by meta-transcriptomics. Curr Opin Virol 51:48–55. 10.1016/j.coviro.2021.09.007. [DOI] [PubMed] [Google Scholar]
  • 11.Wolf YI, Silas S, Wang Y, Wu S, Bocek M, Kazlauskas D, Krupovic M, Fire A, Dolja VV, Koonin EV. 2020. Doubling of the known set of RNA viruses by metagenomic analysis of an aquatic virome. Nat Microbiol 5:1262–1270. 10.1038/s41564-020-0755-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Shi M, Lin X-D, Tian J-H, Chen L-J, Chen X, Li C-X, Qin X-C, Li J, Cao J-P, Eden J-S, Buchmann J, Wang W, Xu J, Holmes EC, Zhang Y-Z. 2016. Redefining the invertebrate RNA virosphere. Nature 540:539–543. 10.1038/nature20167. [DOI] [PubMed] [Google Scholar]
  • 13.Zayed AA, Wainaina JM, Dominguez-Huerta G, Pelletier E, Guo J, Mohssen M, Tian F, Pratama AA, Bolduc B, Zablocki O, Cronin D, Solden L, Delage E, Alberti A, Aury J-M, Carradec Q, da Silva C, Labadie K, Poulain J, Ruscheweyh H-J, Salazar G, Shatoff E, Bundschuh R, Fredrick K, Kubatko LS, Chaffron S, Culley AI, Sunagawa S, Kuhn JH, Wincker P, Sullivan MB, Acinas SG, Babin M, Bork P, Boss E, Bowler C, Cochrane G, de Vargas C, Gorsky G, Guidi L, Grimsley N, Hingamp P, Iudicone D, Jaillon O, Kandels S, Karp-Boss L, Karsenti E, Not F, Ogata H, Poulton N, Tara Oceans Coordinators , et al. 2022. Cryptic and abundant marine viruses at the evolutionary origins of Earth’s RNA virome. Science 376:156–162. 10.1126/science.abm5847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Culley AI, Lang AS, Suttle CA. 2006. Metagenomic analysis of coastal RNA virus communities. Science 312:1795–1798. 10.1126/science.1127404. [DOI] [PubMed] [Google Scholar]
  • 15.Lang AS, Rise ML, Culley AI, Steward GF. 2009. RNA viruses in the sea. FEMS Microbiol Rev 33:295–323. 10.1111/j.1574-6976.2008.00132.x. [DOI] [PubMed] [Google Scholar]
  • 16.Urayama S-I, Takaki Y, Nishi S, Yoshida-Takashima Y, Deguchi S, Takai K, Nunoura T. 2018. Unveiling the RNA virosphere associated with marine microorganisms. Mol Ecol Resour 18:1444–1455. 10.1111/1755-0998.12936. [DOI] [PubMed] [Google Scholar]
  • 17.Culley AI, Mueller JA, Belcaid M, Wood-Charlson EM, Poisson G, Steward GF. 2014. The characterization of RNA viruses in tropical seawater using targeted PCR and metagenomics. mBio 5:e01210-14. 10.1128/mBio.01210-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Chiba Y, Tomaru Y, Shimabukuro H, Kimura K, Hirai M, Takaki Y, Hagiwara D, Nunoura T, Urayama S-I. 2020. Viral RNA genomes identified from marine macroalgae and a diatom. Microbes Environ 35:ME20016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Charon J, Marcelino VR, Wetherbee R, Verbruggen H, Holmes EC. 2020. Metatranscriptomic identification of diverse and divergent RNA viruses in green and chlorarachniophyte algae cultures. Viruses 12:1180. 10.3390/v12101180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Charon J, Murray S, Holmes EC. 2021. Revealing RNA virus diversity and evolution in unicellular algae transcriptomes. Virus Evol 7:veab070. 10.1093/ve/veab070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Chase EE, Monteil-Bouchard S, Gobet A, Andrianjakarivony FH, Desnues C, Blanc G. 2021. A high rate algal pond hosting a dynamic community of RNA viruses. Viruses 13:2163. 10.3390/v13112163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Vlok M, Lang AS, Suttle CA. 2019. Application of a sequence-based taxonomic classification method to uncultured and unclassified marine single-stranded RNA viruses in the order Picornavirales. Virus Evol 5:vez056. 10.1093/ve/vez056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sadeghi M, Tomaru Y, Ahola T. 2021. RNA viruses in aquatic unicellular eukaryotes. Viruses 13:362. 10.3390/v13030362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Urayama SI, Takaki Y, Nunoura T. 2016. FLDS: a comprehensive DSRNA sequencing method for intracellular RNA virus surveillance. Microbes Environ 31:33–40. 10.1264/jsme2.ME15171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lachnit T, Thomas T, Steinberg P. 2015. Expanding our understanding of the seaweed holobiont: RNA viruses of the red alga Delisea pulchra. Front Microbiol 6:1489. 10.3389/fmicb.2015.01489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rousvoal S, Bouyer B, López-Cristoffanini C, Boyen C, Collén J. 2016. Mutant swarms of a totivirus-like entities are present in the red macroalga Chondrus crispus and have been partially transferred to the nuclear genome. J Phycol 52:493–504. 10.1111/jpy.12427. [DOI] [PubMed] [Google Scholar]
  • 27.Tomaru Y, Katanozaka N, Nishida K, Shirai Y, Tarutani K, Yamaguchi M, Nagasaki K. 2004. Isolation and characterization of two distinct types of HcRNAV, a single-stranded RNA virus infecting the bivalve-killing microalga Heterocapsa circularisquama. Aquat Microb Ecol 34:207–218. 10.3354/ame034207. [DOI] [Google Scholar]
  • 28.Waldron FM, Stone GN, Obbard DJ. 2018. Metagenomic sequencing suggests a diversity of RNA interference-like responses to viruses across multicellular eukaryotes. PLoS Genet 14:e1007533. 10.1371/journal.pgen.1007533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lang AS, Culley AI, Suttle CA. 2004. Genome sequence and characterization of a virus (HaRNAV) related to picorna-like viruses that infects the marine toxic bloom-forming alga Heterosigma akashiwo. Virology 320:206–217. 10.1016/j.virol.2003.10.015. [DOI] [PubMed] [Google Scholar]
  • 30.Takao Y, Mise K, Nagasaki K, Okuno T, Honda D. 2006. Complete nucleotide sequence and genome organization of a single-stranded RNA virus infecting the marine fungoid protist Schizochytrium sp. J Gen Virol 87:723–733. 10.1099/vir.0.81204-0. [DOI] [PubMed] [Google Scholar]
  • 31.Attoui H, Jaafar FM, Belhouchet M, de Micco P, de Lamballerie X, Brussaard CPD. 2006. Micromonas pusilla reovirus: a new member of the family Reoviridae assigned to a novel proposed genus (Mimoreovirus). J Gen Virol 87:1375–1383. 10.1099/vir.0.81584-0. [DOI] [PubMed] [Google Scholar]
  • 32.Charon J, Buchmann JP, Sadiq S, Holmes EC. 2022. RdRp-scan: a bioinformatic resource to identify and annotate divergent RNA viruses in metagenomic sequence data. bioRxiv 10.1101/2022.02.28.482397. [DOI] [PMC free article] [PubMed]
  • 33.Fichorova RN, Lee Y, Yamamoto HS, Takagi Y, Hayes GR, Goodman RP, Chepa-Lotrea X, Buck OR, Murray R, Kula T, Beach DH, Singh BN, Nibert ML. 2012. Endobiont viruses sensed by the human host—beyond conventional antiparasitic therapy. PLoS One 7:e48418. 10.1371/journal.pone.0048418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Grybchuk D, Kostygov AY, Macedo DH, d'Avila-Levy CM, Yurchenko V. 2018. RNA viruses in trypanosomatid parasites: a historical overview. Mem Inst Oswaldo Cruz 113:e170487. 10.1590/0074-02760170487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ghabrial SA, Nibert ML. 2009. Victorivirus, a new genus of fungal viruses in the family Totiviridae. Arch Virol 154:373–379. 10.1007/s00705-008-0272-x. [DOI] [PubMed] [Google Scholar]
  • 36.Ghabrial SA. 2008. Totiviruses. Encycl Virol 163–174. 10.1016/B978-012374410-4.00518-5. [DOI] [Google Scholar]
  • 37.Lefkowitz EJ, Dempsey DM, Hendrickson RC, Orton RJ, Siddell SG, Smith DB. 2018. Virus taxonomy: the database of the International Committee on Taxonomy of Viruses (ICTV). Nucleic Acids Res 46:D708–D717. 10.1093/nar/gkx932. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hartley MA, Ronet C, Zangger H, Beverley SM, Fasel N. 2012. Leishmania RNA virus: when the host pays the toll. Front Cell Infect Microbiol 2:99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ives A, Ronet C, Prevel F, Ruzzante G, Fuertes-Marraco S, Schutz F, Zangger H, Revaz-Breton M, Lye L-F, Hickerson SM, Beverley SM, Acha-Orbea H, Launois P, Fasel N, Masina S. 2011. Leishmania RNA virus controls the severity of mucocutaneous leishmaniasis. Science 331:775–778. 10.1126/science.1199326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Liu J-J, Xiang Y, Sniezko RA, Schoettle AW, Williams H, Zamany A. 2019. Characterization of Cronartium ribicola dsRNAs reveals novel members of the family Totiviridae and viral association with fungal virulence. Virol J 16:118. 10.1186/s12985-019-1226-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Fukuhara T. 2019. Endornaviruses: persistent dsRNA viruses with symbiotic properties in diverse eukaryotes. Virus Genes 55:165–173. 10.1007/s11262-019-01635-5. [DOI] [PubMed] [Google Scholar]
  • 42.Okada R, Kiyota E, Sabanadzovic S, Moriyama H, Fukuhara T, Saha P, Roossinck MJ, Severin A, Valverde RA. 2011. Bell pepper endornavirus: molecular and biological properties, and occurrence in the genus Capsicum. J Gen Virol 92:2664–2673. 10.1099/vir.0.034686-0. [DOI] [PubMed] [Google Scholar]
  • 43.Okada R, Yong CK, Valverde RA, Sabanadzovic S, Aoki N, Hotate S, Kiyota E, Moriyama H, Fukuhara T. 2013. Molecular characterization of two evolutionarily distinct endornaviruses co-infecting common bean (Phaseolus vulgaris). J Gen Virol 94:220–229. 10.1099/vir.0.044487-0. [DOI] [PubMed] [Google Scholar]
  • 44.Hacker CV, Brasier CM, Buck KW. 2005. A double-stranded RNA from a Phytophthora species is related to the plant endornaviruses and contains a putative UDP glycosyltransferase gene. J Gen Virol 86:1561–1570. 10.1099/vir.0.80808-0. [DOI] [PubMed] [Google Scholar]
  • 45.Sabanadzovic S, Wintermantel WM, Valverde RA, McCreight JD, Aboughanem-Sabanadzovic N. 2016. Cucumis melo endornavirus: genome organization, host range and co-divergence with the host. Virus Res 214:49–58. 10.1016/j.virusres.2016.01.001. [DOI] [PubMed] [Google Scholar]
  • 46.Roossinck MJ, Sabanadzovic S, Okada R, Valverde RA. 2011. The remarkable evolutionary history of endornaviruses. J Gen Virol 92:2674–2678. 10.1099/vir.0.034702-0. [DOI] [PubMed] [Google Scholar]
  • 47.Larsson ME, Bramucci AR, Collins S, Hallegraeff G, Kahlke T, Raina J-B, Seymour JR, Doblin MA. 2022. Mucospheres produced by a mixotrophic protist impact ocean carbon cycling. Nat Commun 13:1301. 10.1038/s41467-022-28867-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Roossinck MJ. 2010. Lifestyles of plant viruses. Philos Trans R Soc Lond B Biol Sci 365:1899–1905. 10.1098/rstb.2010.0057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Nibert ML, Ghabrial SA, Maiss E, Lesker T, Vainio EJ, Jiang D, Suzuki N. 2014. Taxonomic reorganization of family Partitiviridae and other recent progress in partitivirus research. Virus Res 188:128–141. 10.1016/j.virusres.2014.04.007. [DOI] [PubMed] [Google Scholar]
  • 50.Shiba K, Hatta C, Sasai S, Tojo M, T Ohki S, Mochizuki T. 2018. Genome sequence of a novel partitivirus identified from the oomycete Pythium nunn. Arch Virol 163:2561–2563. 10.1007/s00705-018-3880-0. [DOI] [PubMed] [Google Scholar]
  • 51.Nibert ML, Woods KM, Upton SJ, Ghabrial SA. 2009. Cryspovirus: a new genus of protozoan viruses in the family Partitiviridae. Arch Virol 154:1959–1965. 10.1007/s00705-009-0513-7. [DOI] [PubMed] [Google Scholar]
  • 52.Park D, Goh CJ, Kim H, Hahn Y. 2018. Identification of two novel amalgaviruses in the common eelgrass (Zostera marina) and in silico analysis of the amalgavirus +1 programmed ribosomal frameshifting sites. Plant Pathol J 34:150–156. 10.5423/PPJ.NT.11.2017.0243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Walker PJ, Siddell SG, Lefkowitz EJ, Mushegian AR, Adriaenssens EM, Dempsey DM, Dutilh BE, Harrach B, Harrison RL, Hendrickson RC, Junglen S, Knowles NJ, Kropinski AM, Krupovic M, Kuhn JH, Nibert M, Orton RJ, Rubino L, Sabanadzovic S, Simmonds P, Smith DB, Varsani A, Zerbini FM, Davison AJ. 2020. Changes to virus taxonomy and the statutes ratified by the International Committee on Taxonomy of Viruses (2020). Arch Virol 165:2737–2748. 10.1007/s00705-020-04752-x. [DOI] [PubMed] [Google Scholar]
  • 54.Telengech P, Hisano S, Mugambi C, Hyodo K, Arjona-López JM, López-Herrera CJ, Kanematsu S, Kondo H, Suzuki N. 2020. Diverse partitiviruses from the phytopathogenic fungus, Rosellinia necatrix. Front Microbiol 11:1064. 10.3389/fmicb.2020.01064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Pan Y, Cembella AD, Quilliam MA. 1999. Cell cycle and toxin production in the benthic dinoflagellate Prorocentrum lima. Mar Biol 134:541–549. 10.1007/s002270050569. [DOI] [Google Scholar]
  • 56.Burki F, Roger AJ, Brown MW, Simpson AGB. 2020. The new tree of eukaryotes. Trends Ecol Evol 35:43–55. 10.1016/j.tree.2019.08.008. [DOI] [PubMed] [Google Scholar]
  • 57.Bolch CJS, Subramanian TA, Green DH. 2011. The toxic dinoflagellate Gymnodinium catenatum (Dinophyceae) requires marine bacteria for growth. J Phycol 47:1009–1022. 10.1111/j.1529-8817.2011.01043.x. [DOI] [PubMed] [Google Scholar]
  • 58.Litaker RW, Vandersea MW, Faust MA, Kibler SR, Chinain M, Holmes MJ, Holland WC, Tester PA. 2009. Taxonomy of Gambierdiscus including four new species, Gambierdiscus caribaeus, Gambierdiscus carolinianus, Gambierdiscus carpenteri and Gambierdiscus ruetzleri (Gonyaulacales, Dinophyceae). Phycologia 48:344–390. 10.2216/07-15.1. [DOI] [Google Scholar]
  • 59.Kretzschmar AL, Larsson ME, Hoppenrath M, Doblin MA, Murray SA. 2019. Characterisation of two toxic Gambierdiscus spp. (Gonyaulacales, Dinophyceae) from the Great Barrier Reef (Australia): G. lewisii sp. nov. and G. holmesii sp. nov. Protist 170:125699. 10.1016/j.protis.2019.125699. [DOI] [PubMed] [Google Scholar]
  • 60.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Kopylova E, Noé L, Touzet H. 2012. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 28:3211–3217. 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
  • 62.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652. 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Li B, Dewey CN. 2011. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12:323. 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Buchfink B, Xie C, Huson DH. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
  • 65.Eddy SR. 2011. Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Bushmanova E, Antipov D, Lapidus A, Prjibelski AD. 2019. rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data. Gigascience 8:1–13. 10.1093/gigascience/giz100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. 2015. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676. 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
  • 68.Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649. 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong S-Y, Lopez R, Hunter S. 2014. InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240. 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup . 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Boratyn GM, Thierry-Mieg J, Thierry-Mieg D, Busby B, Madden TL. 2019. Magic-BLAST, an accurate RNA-seq aligner for long and short reads. BMC Bioinformatics 20:1–19. 10.1186/s12859-019-2996-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. 2011. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539–539. 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Price MN, Dehal PS, Arkin AP. 2010. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 5:e9490. 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32:268–274. 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14:587–589. 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental file 1

Tables SA1 to SA3 and Fig. SA1 to SA3. Download jvi.00783-22-s0001.pdf, PDF file, 0.5 MB (514.3KB, pdf)

Data Availability Statement

Corresponding RNA-seq read files are available on the SRA under BioProject PRJNA867582, with accessions SAMN30215649 to SAMN30215656. The viruses newly identified here are available at GenBank/NCBI under the accessions OP191686 to OP191695.


Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES