Abstract
Whole-genome sequencing is widely used to better understand the transmission dynamics, the evolution and the emergence of new variants of viral pathogens. This can bring crucial information to stakeholders for disease management. Unfortunately, aquatic virus genomes are usually difficult to characterize because most of these viruses cannot be easily propagated in vitro. Developing methodologies for routine genome sequencing of aquatic viruses is timely given the ongoing threat of disease emergence. This is particularly true for pathogenic viruses infecting species of commercial interest that are widely exchanged between production basins or countries. For example, the ostreid herpesvirus type 1 (OsHV-1) is a Herpesvirus widely associated with mass mortality events of juvenile Pacific oyster Crassostrea gigas. Genomes of Herpesviruses are large and complex with long direct and inverted terminal repeats. In addition, OsHV-1 is unculturable. It therefore accumulates several features that make its genome sequencing and assembly challenging. To overcome these difficulties, we developed a tangential flow filtration (TFF) method to enrich OsHV-1 infective particles from infected host tissues. This virus purification allowed us to extract high molecular weight and high-quality viral DNA that was subjected to Illumina short-read and Nanopore long-read sequencing. Dedicated bioinformatic pipelines were developed to assemble complete OsHV-1 genomes with reads from both sequencing technologies. Nanopore sequencing allowed characterization of new structural variations and major viral isomers while having 99,98 % of nucleotide identity with the Illumina assembled genome. Our study shows that TFF-based purification method, coupled with Nanopore sequencing, is a promising approach to enable in field sequencing of unculturable aquatic DNA virus.
Keywords: Crassostrea gigas, Illumina, ostreid herpesvirus type 1, oxford nanopore technologies, tangential flow filtration, virus
Data Summary
All sequence data have been deposited in NCBI under Bioproject ID PRJNA823871. Nanopore data have been deposited to SRA under accession number SAMN27363714 with base-called fastq reads under accession number SRR19025776 and raw fast5 and slow5 file under accession number SRR19039499; Illumina fastq reads have been deposited to SRA under accession number SAMN27363675. OsHV-1 assembled genome from Nanopore data have been deposited to GenBank under accession number ON411413; OsHV-1 assembled genome from Illumina data have been deposited to GenBank under accession number ON411412.
Impact Statement.
Several aquatic viruses are pathogenic to species of commercial interest, leading to massive value losses during disease outbreaks. To better understand the origin, transmission and propagation of such viruses, more genomic data is required and needs to be analysed. However genomic characterization of viruses that are mainly unculturable is a major challenge. In the present study, we have successfully tested the combination of tangential flow filtration (TFF) and Nanopore sequencing for virus purification and genome sequencing. Our results demonstrate that it is now possible to easily sequence the complete genome of large DNA viruses directly from host infected tissues without prior in vitro propagation.
Introduction
Viruses found in aquatic environments are often unculturable and their genomic characterization often relies on our ability to enrich samples for viral nucleic acid. To date, several methods have been implemented to this end and different workflows are applied depending on the type of virus studied: small or large virus, RNA or DNA virus, virus with linear or circular genome, single or double strand virus. Enrichment methods range from rapid and simple steps involving low-speed centrifugation, filtration and nuclease treatment, to more complex methods involving ultracentrifugation, tilling PCR, rolling circle amplification and hybridization-based capture. While the former shows higher efficiency at purifying viral nucleic acid, these methods habitually rely on a priori knowledge of the targeted virus and are limited at detecting a new variant or new pathogen [1–4]. Recently, metagenomic next-generation sequencing (mNGS) has been successfully tested for broad-spectrum pathogen identification for clinical diagnosis of infectious diseases, outbreak surveillance, and pathogen discovery [5, 6]. However, the cost, sequencing depth and background contamination are true limitations to the accuracy of mNGS-based diagnostics relative to specific targeting testing.
Given that aquaculture is continuously faced with the challenge of identifying novel pathogenic viruses infecting different fish, crustaceans, and shellfish species, there is an urgent need to develop tools enabling those viruses to be sequenced easily and with limited bias to develop timely disease control strategies.
Ostreid herpesvirus type 1 (OsHV-1) is a double strand DNA virus belonging to the Malacoherpesviridae family and is the only species of the genus Ostreavirus [7]. OsHV-1 has been associated with mass mortality events of naive Pacific oyster Crassostrea gigas [8] from larvae to adults [9, 10]. It has plagued the oyster production worldwide, from Europe to north and south America, Australia and New Zealand, and in a lesser extent in Asia, for more than a decade [11–17]. OsHV-1 mostly affects spat and juvenile (less than 18 months) in oyster farms, and leads to mass mortalities that can reach 100 % within days [18–20]. Research efforts have revealed a series of factors contributing to the disease, including seawater temperature, oyster age and genetics [8, 9, 16, 18, 21–24]. Recently, holistic molecular approaches uncovered the pathogens associated to Pacific oyster mortality syndrome (POMS) [25]. These studies showed that an infection by OsHV-1 is the critical step of the infectious process leading to an immune-compromised state followed by a microbiota destabilization that ‘opens the door’ to opportunistic bacterial pathogens (e.g. Vibrio spp.) that target haemocytes to induce their lysis [26]. The infectious process is completed with subsequent bacteraemia ultimately leading to oyster death (reviewed in [27]).
Since the characterization of the emergent genotype OsHV-1 µVar, associated with the 2008 high mortality events [18], several variants of the µVar genotype have been described (reviewed in [28]). However, most studies on OsHV-1 genetic diversity have been based on PCR molecular markers focusing on a few variable regions of the viral genome, which do not reflect the whole genomic diversity [29, 30]. Indeed, OsHV-1 µVar whole-genome sequencing [31] and its comparison with the reference genome published in 2005 [7] showed that both genomes differed at the nucleotide level (single nucleotide polymorphism, SNP) as well as at the structural level by the loss or gain of several genomic regions carrying many open reading frames (ORFs). This observation indicates that whole-genome sequencing is needed to fully reveal viral diversity and better understand the structure and evolution of the virus populations [31].
OsHV-1 particles harbour a linear genome of approximately 207 kilobases (kb) with the majority of the 128 ORFs detected (54.7 %), having no known homologies with other organisms [31]. OsHV-1 genomic organization is characteristic of other Herpesviruses consisting of two invertible unique regions (UL for ‘Unique long’ 170 kbp; US for ‘Unique small’ 3.5 kbp) each flanked by tandem and inverted repeats (TRL/IRL: 7.5 kbp; TRS/IRS: 9.7 kbp) separated by a third unique region (X: 1.5 kbp) with a global genomic organization TRL–UL–IRL–X–IRS–US–TRS. Furthermore, several isomers of the virus cohabit at equimolar levels in the same samples [7].
To date, 50 fully assembled OsHV-1 genomes are available in public databases, those genomes have been sequenced and assembled using various protocol and sequencing technologies (Table S1, available in the online version of this article). Although recent genomes have been obtained from short-read mNGS [31–34], OsHV-1 complete genome assembly remains a complex process. Indeed, a short-read approach is ideal for the identification of small-scale variation like SNPs but this technology struggles to identify variations at a larger scale like structure variations (SV) or large inverted and repeated regions. Due to its intrinsic Herpesvirus genome architecture with high G+C content regions, long repeated regions (TRL/IRL and TRS/IRS), and the presence of four major isomers differing in the relative orientations of the two unique regions, it is extremely difficult, if not impossible, to solve the complete structure of OsHV-1 genome using short-read sequencing [35].
These limitations can now be addressed by long-read sequencing approaches of which Oxford Nanopore Technologies (ONT) is one example. Indeed, ONT has already demonstrated its ability to sequence and assemble several types of viruses from RNA [36–42] as well as DNA [43–46] matrices. Nanopore sequencing has also been successfully tested on Herpesviruses like the Human Herpes Virus Type 1 (HHV-1) [47], the Human cytomegalovirus (HCMV) [48], the Equine herpesvirus 1 (EHV-1) [49] and the koi herpesvirus (KHV) [50]. In addition, ONT already allowed researchers to sequence viruses infecting aquaculture species like the White spot syndrome virus (WSSV) infecting shrimp [51] or the Salmonid alphavirus (SAV) infecting Atlantic salmon [52]. Finally, ONT has also demonstrated efficiency to sequence viruses from sea-water samples [53–55]. However, it has never been tested for the characterization of the OsHV-1 genome.
In order to implement ONT long-read sequencing, high molecular weighted DNA is required in a large quantity. However, due to the lack of a cell line, that could be used for the propagation and isolation of viral particles, OsHV-1 must be extracted from infected animal tissues. Therefore, it is extremely challenging to obtain viral DNA in sufficient quantity to perform long-read mNGS.
To overcome these difficulties, we need a new approach for viral particle purification coupled with long-read sequencing to assemble complex OsHV-1 genomes efficiently and accurately.
One approach to purify viral particles is tangential flow filtration (TFF). This method has been widely used for purification of a large variety of virus particles, such as particles of human influenza A virus [56], insect baculovirus [57, 58], HIV-like virus [59], oncolytic measles virus [60] and of several waterborne viruses like norovirus and adenovirus [61]. While TFF has been used to reduce oyster pathogen concentration in seawater to bio-secure shellfish farms [62], it has never been used with the goal of purifying those pathogens. Recently, TFF coupled with Nanopore sequencing, has been used to concentrate a large amount of seawater for the characterization of bacteriophage population [63].
In the present study, we have developed a TFF-based method allowing purification and concentration of OsHV-1 infectious particles from infected oyster tissues and we have evaluated the relative benefits of long-read (Nanopore) and short-read (Illumina) sequencing for genome assembly. Our results show that TFF purification of OsHV-1 viral particles coupled with ONT sequencing is a promising strategy for OsHV-1 whole-genome reconstruction and to decipher genetic diversity and evolution of virus populations.
Methods
Oyster production
Experimental C. gigas spat was produced at the Ifremer hatchery in La Tremblade, (Marennes-Oléron Bay, Charente-Maritime, France). Oysters belonged to a line that had been characterized for its higher susceptibility to OsHV-1 [64]. Thirty adult oysters were conditioned in seawater at 20 °C for 2 months to favour gametogenesis. Then, ripe oysters were triggered for a mass spawning event in March 2015. Larvae and spat were cultured as described by [64, 65] and they were always maintained in our controlled facilities using filtered and UV-treated seawater to protect from pathogens naturally present in the Marennes-Oléron Bay. Prior to the experiment, spat (2 g and 4 cm) were acclimatized at 19 °C for 2 weeks in flow-through 120 l tanks using filtered and UV-treated seawater enriched in phytoplankton (Skeletonema costatum, Isochrisis galbana and Tetraselmis suecica).
Viral inoculum preparation and OsHV-1 quantification
OsHV-1 inoculum were prepared from tissue of infected oysters following protocols of Schikorsky et al. [66]. Briefly, gills and mantle were dissected from moribund oysters naturally or experimentally infected with OsHV-1 in previous trials and frozen at −80 °C. Then, ten volumes of 0.2 µm filtered and artificial seawater (ASW) were added and tissues were crushed on ice using an Ultraturax mixer (3×5 s). The tissue mash was clarified by centrifugation (1000 g , 5 min, 4 °C), and then filtered consecutively in sterile conditions using syringe filters at 8, 0.45 and 0.22 µm pore sizes (Millipore; Billerica, USA).
Quantification of OsHV-1 was carried out using quantitative PCR (qPCR) using primers amplifying a 197 bp fragment of the OsHV-1 DNA polymerase (DP) gene. DNA was extracted from viral inoculum using the QiaAmp DNA Mini Kit (Qiagen) according to the manufacturer’s protocol. DNA purity (1.8 ≤ A260 nm/280 nm ≤ 2.0; 2.0 ≤ A260 nm/230 nm ≤ 2.20) and concentration were checked using a Nano-Drop ND-1000 spectrometer (Thermo Scientific) and Qubit® dsDNA HS assay kits (Molecular Probes Life Technologies), respectively.
qPCRs were performed in duplicate using a Mx3005 P Thermocycler (Agilent). Amplification reactions were each performed in a total volume of 20 µl. Each reaction contained 5 µl of DNA extracted from viral inoculum or from TFF fractions (readjusted to initial inoculum volume) or 5 µl of Milli-Q water for the negative control, 10 µl of Brilliant III Ultra-Fast SYBR® Green PCR Master Mix (Agilent), 2 µl of primer OsHVDP For (forward) 5′-ATTGATGATGTGGATAATCTGTG-3′ and 2 µl of primer OsHVDP Rev (reverse) 5′-GGTAAATACCATTGGTCTTGTTCC-3′ [67] at the final concentration of 550 nM each, and 1 µl of Milli-Q water. qPCR cycling conditions were as follows: 3 min at 95 °C followed by 40 cycles of amplification at 95 °C for 5 s and 60 °C for 20 s. For absolute quantification, the amplification product of the DP gene was cloned into the pCR4-Topo vector and replicated in Escherichia coli DH5α (Invitrogen). Plasmid was extracted using the Wizard Plus SV miniprep DNA purification system (Promega) and standard curves of known concentration of plasmid were generated according to the Applied Biosystems manual for absolute real-time PCR quantification. Absolute quantification of OsHV-1 genome copies in oyster samples was then estimated by comparing the observed crossing point (Cp) values to known plasmid standards from 103 to 109 copies of the DP product. The results were expressed as the log-transformed copy number of viral DNA per micro litter of DNA extracts (cp μl−1).
Virus-like particle quantification by epifluorescence microscopy
Virus-like particles (VLP) were quantified using epifluorescence microscopy after staining with nucleic acid dye [68, 69]. Briefly, viral inoculum were first fixed with glutaraldehyde (Sigma-Aldrich G6257, final concentration 0.5 %) and diluted in 1 % potassium citrate solution (pH 7) to prevent aggregation of viral particles. Inoculum were vacuum filtrated to capture viruses on a 0.02 µm aluminium oxide filter (Whatman 6809–6002 Anodisc 25 inorganic membrane filter), before subsequent staining (SYBR Gold nucleic acid gel stain Invitrogen S11494, diluted at 0.2 % in Tris-EDTA) and washing by 4 ml of 1 % potassium citrate solution. Filters were then mounted on glass slides and VLP were observed and enumerated under epifluorescence microscopy (Olympus BH2-RFCA).
Experimental infections
For experimental infections, oysters were previously myorelaxed in hexahydrate MgCl2 (ACROS, 50 g l−1) [70]. Oysters were injected using 26-gauge needle attached to a multi-dispensing hand pipette, with 100 µl of inoculum (diluted in ASW to reach the desired viral concentration) in the abductor muscle. For each experiment, a group of oysters was myorelaxed and injected with ASW as the control. Oysters were replaced in seawater in plastic tanks at 20 °C and mortality was monitored daily.
Virus purification and concentration by tangential flow filtration (TFF)
To develop a TFF-based process for the concentration of OsHV-1 particles, we used MicroKros® hollow fibre filter modules (Spectrum®) with three different pore sizes: 750 kDa (CO2-E7500-05-N), 500 kDa (CO2-E500-05-N) and 300 kDa (CO2-E300-05-N).
The first step of the process was to determine the pore size of the appropriate membrane to retain the OsHV-1 particles. To this end, we have filtered 10 ml of OsHV-1 inoculum containing 5.9×104 genomic units (GU)/µl with the 750 kDa TFF-device until the volume reached 1 ml. This 1 ml retentate (R), consisting of particles larger than the pore size of the hollow fibre, was diluted to the initial volume (10 ml) with filtered (0.02 µm) seawater and stored at 4 °C. The permeate (P), composed of particles smaller than the pore size of the hollow fibre, was collected, the volume adjusted to 10 ml with filtered (0.02 µm) seawater and was then filtered on the 500 kDa TFF device. The 500 kDa retentate was then processed similarly to the 750 kDa retentate and the 500 kDa permeate was then filtered on the 300 kDa TFF-device. At the end of this filtering process, we obtained three retentates (R-750; R-500 and R-300) and one permeate (P-300) (Fig. 1a).
The second step consisted of purifying and concentrating OsHV-1 particles to extract enough DNA for both short- and long-read sequencing. For this purpose, we prepared a viral inoculum (see Viral inoculum preparation and OsHV-1 quantification for details) from 31.12 g of tissue obtained from 28 susceptible oysters (family F15) naturally infected by OsHV-1 in the Thau Lagoon in 2016 and stored at −80 °C (for more details on oyster origin and infection process see [71]). We obtained 462 ml of 0.22 µm filtrated inoculum, which were processed on a 500 kDa hollow fibre filter. Concentrated viral particles contained in 82 ml of the 500 kDa retentate were recovered by centrifugation (2 h at 18 000 r.p.m. at 4 °C). Pellets were then re-suspended in 1.2 ml of PBS and DNA was extracted and quantified as described above. We finally obtained 17.9 µg of viral DNA.
Illumina short-read sequencing
Illumina DNA-Seq library preparation and sequencing were performed by the Genome Quebec Company (Genome Quebec Innovation Center, McGill University, Montreal, Canada) using 120 ng of DNA extracted from the OsHV-1 inoculum concentrated on 500 kDa TFF-device [as described at the end of Virus purification and concentration by tangential flow filtration (TFF)], the NxSeq AmpFREE Low DNA Library Kit (Lucigen) and the NovaSeq 6000 Sequencing system (Illumina®) (paired ends, 150 bp).
Nanopore long-read sequencing
For long-read sequencing, we used the Oxford Nanopore Technologies (ONT) MinION platform (Mk1b). Briefly, 10 µg of DNA extracted from the OsHV-1 inoculum concentrated on 500 kDa TFF-device [as described at the end of Virus purification and concentration by tangential flow filtration (TFF)] were fractionated on Agencourt AMPure XP beads (Beckmann Coulter, 0.5 × final concentration) to eliminate small DNA molecules (< 300 bp). DNA size range analysis on Agilent Fragment Analyzer (Genomic DNA analysis kit DNF-467–33) revealed that the average size was over 60 kb with a genomic quality number (GQN) of 8.9 (threshold 10 kb). Then, 400 ng of AMPure purified DNA were used to construct the sequencing library using SQK-RAD0004 tagmentation kit and sequencing was performed as per manufacturer’s guidelines using R9.4 flowcells (FLO-MIN106) and run for 20 h.
Illumina genome assembly
Illumina raw reads files were quality filtered (PHRED score >31) and adapters trimmed with Fastp version 0.20.1 [72]. To evaluate the origin of reads, fastq reads were aligned to the oyster C. gigas genome (accession: GCA_902806645.1) and to OsHV-1 µVar A genome (accession: KY242785.1) with Bowtie2 version 2.4.1 [73]. All the non-oyster reads were then selected with Samtools version 1.9 [74] and Seqtk version 1.2 (https://github.com/lh3/seqtk). Those reads were then used for de novo assembly with SPAdes version 3.14.0 with parameter –meta [75]. The de novo OsHV-1 contigs generated by SPAdes were then identified with blastn version 2.6.0 [76] using OsHV-1 µVar A sequence as a query (accession number: KY242785.1). The identified OsHV-1 contigs were then manually scaffolded in Geneious Prime (version 2020.2.3) to reconstruct the common virus architecture (UL-IRL-X-IRS-US) [7]. To overcome the problem of inverted repeated regions that cannot be confidently positioned with short-read data, we constructed non-redundant genomes (NR genomes) that contain only one copy of each repeated region (UL-IRL-X-IRS-US) an approach commonly used for Herpesvirus genome analyses [34, 77, 78]. Quality filtered reads were then aligned to the new constructed OsHV-1 NR genome to evaluate coverage uniformity. This pipeline is summarized in Fig. 2a.
Nanopore genome assembly
Nanopore fast5 file were GPU base called with Guppy version 5.0.7 with standard parameters except -config dna_r9.4.1_450bps_hac.cfg (https://community.nanoporetech.com/downloads). Fastq reads were then 100 bp trimmed at their beginning and at their end to ensure proper adapter removal with SeqTk version 1.3 with parameters -b 100 and -e 100 (https://github.com/lh3/seqtk). To assign reads to either host or viral origin, fastq reads were aligned to the oyster C. gigas genome (accession: GCA_902806645.1) and to OsHV-1 µVar A genome (accession: KY242785.1) with Minimap2 version 2.22 with standard parameters [79]. Then two types of de novo assemblies were performed with non-oyster reads with two different de novo assemblers. All fastq reads or only the top 10 % of the longest reads were used as input for de novo assembly with Flye version 2.8.3 and parameters -meta [80] or with Canu version 2.1.1 and parameters -nanopore [81]. OsHV-1 de novo assembled contigs were then identified with blastn [76] against the OsHV-1 µVar A genome and were manually trimmed in Geneious version 2020.2.4 (https://www.geneious.com). All de novo contigs plus the new assembled OsHV-1 genome were then polished using Nanopore reads with three run of Racon version 1.4.20 with parameters --no-trimming -m 8x−6 g −8 w 500 [82] and one run of Medaka version 1.4.3 with parameters medaka_consensus (https://github.com/nanoporetech/medaka).
For OsHV-1 isoform coverage evaluation, the four contigs produced by Canu with the top 10 % of the longest reads were trimmed on the UL side to keep a 10 kb UL fragment with Geneious version 2020.2.4 (https://www.geneious.com). The four major isomers were then used as a reference for reads mapping with Minimap2 version 2.22 with standard parameters -axe map-ont (https://github.com/lh3/minimap2). Samtools version 1.10 [83] were then used to filter out all the shortest aligned read and all the alignment with MapQ <40, keeping only the reads spanning at least a fragment of the UL and a fragment of the US (i.e. 28 kb long). A pipeline summary can be found in Fig. 2b.
Non-OsHV-1 contigs' classification
Polished non-OsHV-1 contigs with a size larger than 10 kb were then subjected to taxonomic identification and classification with phaster [84] accessed in September 2021, with blastn version 2.0.9 with parameters -max_hsps 1 -max_target_seqs 100 -evalue 0.0001 [76] or with diamond blastx [85] version 2.0.6 with parameters --sensitive --max-hsps 1 --max-target-seqs 0 --range-culling -F 15 --evalue 0.0001 against the refseq database accessed in September 2021.
Comparison of Illumina and Nanopore OsHV-1 genomes
Before whole-genome alignment evaluation, both genomes were modified to fit the same isomers' version of the genome (i.e. TRL/UL->/IRL/X/IRS/US->/TRS). To evaluate the capacity of Nanopore sequencing to produce an accurate OsHV-1 genome, the manually constructed Illumina OsHV-1 non-redundant (NR) genome was aligned with the manually curated Nanopore OsHV-1 complete genome with maftt version 1.4.0 using default parameters [86] implemented in Geneious version 2020.2.4 (https://www.geneious.com).
OsHV-1 phylogeny
In order to illustrate the global phylogenetic relationships among our and other available genomes, 13 OsHV-1 NR-genomes (11 available in GenBank: AY509253, GQ153938, KP412538, KY242785, KY271630, MG56175, MF509813, OM811579, OM811584, OM811586, OM811594 and the two from this study) were aligned with progressiveMauve algorithm [87] implemented in Geneious version 2020.2.4 to local collinear blocks. Reordered genomes block were then aligned with MAFTT version 7.4 using default parameters [86] implemented in NGPhylogeny.fr [88]. Alignment was cleaned with BMGE with default parameters [89] implemented in NGPhylogeny.fr. The best fitting model of nucleotide substitution was chosen according to the Akaike Information Criterion with jModelTest version 2.1.10 with default parameter [90]. Tree was constructed with phyML version 3.3 [91] with default parameters except ‘-b 100 m HKY85 -f m -c 8 s BEST -a e -v e -o tlr’ implemented in NGPhylogeny.fr. Phylogenetic tree was midpoint rooted and arranged in FigTree version 1.4.4 (https://github.com/rambaut/figtree).
Results
Calibration of tangential flow filtration (TFF) device to purify and concentrate OsHV-1 particles
To develop a tangential flow filtration-based process for the purification and concentration of OsHV-1, we tested TFF-devices with three membrane pore sizes [molecular weight cut-off (MWCO) of 750, 500 and 300 kDa]. Then, we used an OsHV-1 inoculum as starting material and successively filtered it through the three TFF-devices (Fig. 1a; see Methods for further detail). Finally, we have obtained three retentates (R-750, R-500 and R-300) and one permeate (P-300), which were then used to analyse the OsHV-1 load and infectivity.
To evaluate the ability of the three TFF-devices to retain OsHV-1, we have quantified OsHV-1 load by qPCR in the initial inoculum, in the three retentates and in the permeate (Fig. 1b). The number of OsHV-1 genomic units (GU) in the R-750 fraction (5.6×104 GU) was of the same order of magnitude as it was in the initial inoculum (5.9×104 GU), which suggests that most of the viruses (95 %), were retained by 750 kDa MWCO. However, a small fraction (4.8 %) of GUs present in the initial inoculum were also found in the R-500 retentate (2.8×103 GU) and traces of amplification (below standard curve) were also observed in the R-300 (2.9×10 GU). Epifluorescence microscopy did not reveal virus-like particles (VLP) in the R-300 and P-300 fractions (data not shown) while in the R-750 and R-500 fractions we were able to enumerate 2.71×104 (7.05 x 103±sd) and 4.88×103 (2.79×103±sd) VLP µl−1, respectively (Fig. 1c). Epifluorescence microscopy gave similar results to qPCR and confirms that most of the viral particles present in the starting inoculum were preserved in the R-750 fraction.
Finally, we compared the infectious capacity of the different fractions with the initial inoculum by injecting them into oysters. No mortality was observed for oysters injected with fractions R-300 and P-300 as well as for oysters injected with filtered sea water (Fig. 1d). However, oysters that had been injected with R-750 and R-500 fractions exhibit survival curves, which are not significantly different from those of oysters injected with the starting inoculum. Interestingly, survival curves of R-750 and R-500 fractions are not significantly different although R-500 fraction contained around 20 times less OsHV-1 GU than the R-750 fraction. These results showed that filtration on 500 kDa TFF-device allowed the purification and concentration of fully infective OsHV-1 viral particles. Consequently, we used the 500 kDa TFF-device to purify and concentrate viral particles from an inoculum prepared from 28 oysters associated from mortality event, which have occurred in 2016. We have been able to extract ~18 µg of viral DNA, an amount sufficient to perform both short- and long-read sequencing (see Methods for further detail).
General statistics of the two sequencing technologies
Viral DNA extracted from TFF purified viral particles was sequenced using two different sequencing technologies: (i) a short-read approach based on the Illumina platform and (ii) a long-read approach based on the Nanopore platform.
The emergence of mNGS, has allowed researchers to put aside complex techniques such as ultracentrifugation coupled to sucrose gradient for OsHV-1 genome sequencing. The direct consequence of the lack of viral purification is the production of a large amount of non-viral data, resulting in the waste of a significant portion of the sequencing run. In recent studies, sequencing of the OsHV-1 genome from DNA extracted directly from infected tissues has led to the acquisition of only 0.01–13.21% OsHV-1 reads (Table S1).
As expected TFF purification of OsHV-1 particles allowed both sequencing technologies to obtain a large enrichment in OsHV-1 reads compared to host reads, leading to a mean coverage of 740 reads/bp and 213 790 reads/bp for Nanopore and Illumina sequencing, respectively. In more details, the Illumina sequencing run yielded more than 304 million paired-end reads of 150 base pairs (bp) corresponding to 45.68 Gb and distributed as follows: 52 % of OsHV-1 reads, 40 % of C. gigas reads and 8 % of unaligned reads with a similar proportion of sequenced base number (Fig. 3a). The Nanopore sequencing run yielded 75 403 reads corresponding to 0,19 Gb and distributed as follows: 62 and 82 % of OsHV-1 reads and bases, respectively, 4 and 0.7 % of C. gigas reads and bases, respectively, and 33 and 17 % of unaligned reads and bases, respectively (Fig. 3a). Since the Nanopore technology sequence native DNA molecules present in the DNA library, a large range of reads' size has been sequenced. In detail, OsHV-1 reads ranged from 42 to 86 234 bp with a N50 of 11 209 bp, C. gigas reads ranged from 45 to 15 635 bp with a N50 of 724 bp and unaligned reads ranged from 1 to 70 966 bp with a N50 7 329 bp (Fig. 3b). The difference in N50 between host and viral DNA (11 209 vs 724 bp) was probably due to oyster DNA fragmentation during purification steps while viral DNA was protected by the viral envelope and capsid. This observation has probably led to the observed difference in host reads' percentage between both sequencing approaches. Indeed, before and during the Nanopore libraries' construction several Ampure XP bead-based DNA purification were performed, and this type of DNA purification is known to induce a loss of small size DNA molecules.
OsHV-1 genome assembly using Illumina reads
Our short-read pipeline based on the de novo assembler metaSPAdes produced five OsHV-1 contigs ranging from 2 429 to 99 197 bp (Fig. 4a). Those five contigs were then manually assembled to construct an OsHV-1 NR genome in which only one copy of each repeat was conserved (UL-IRL-X-IRS-US, see Methods for further details). The re-alignment of the Illumina reads on the new OsHV-1 NR genome allowed us to correct the junction between contigs C1191 and C811 (addition of one A/T bp) and to confirm that the other metaSPAdes contigs were correctly assembled.
As expected, we observed that the reads' coverage within IRL and the IRS is twice the reads' coverage for unique regions UL, US and X given that the NR genome contains only one copy of theses duplicated regions (Fig. 5 upper panel). Interestingly, two decreases of reads' coverage were observed (Fig. 5 upper panel). The first decrease of coverage was at the junction of the C22 and C10 contigs and takes place in a stem-loop suspected to be the viral origin of replication [7]. The presence of this stem-loop could have induced a difficulty to access the DNA in this region by the sequencing polymerase leading to a decrease of sequencing yield around this palindromic region and the production of two distinct contigs by the de novo assembler (Fig. S1). The second decrease of reads' coverage takes place within the C135 contig, in a long C22 homopolymer of the IRL region. This type of bias in read coverage has already been observed for long homopolymer sequenced by Illumina [92] (Fig. 5 upper panel and Fig. S2).
OsHV-1 genome assembly using Nanopore reads
Since this is the first time that an OsHV-1 genome has been assembled from ONT long-read, we decided to evaluate the two most used long-read de novo assembler Flye and Canu with two datasets: (i) all the Nanopore reads and (ii) only the top 10 % longest Nanopore reads (see Methods for further detail) (Fig. 4b).
Whatever the dataset used, Flye yielded the same two contigs; a long one (164 kb) corresponding to the complete UL and a short one (40 kb) made of the US surrounded by one copy of IRL-X-IRL on its left side and one copy of TRS-X-TRL on its right side (Fig. 4b). With the dataset including all Nanopore reads, Canu yielded one large OsHV-1 contig (262 kb). This contig is made of one fragment of the UL (32 kb) followed by a fragment made of the US surrounded by one copy of IRL-X-IRS on its left side and one copy of TRS-X-TRL on its right side (40 kb), followed by the complete UL (164 kb) and followed by another fragment made of US surrounded by one copy of IRL-X-IRS on the left side and one copy of the TRL (24 kb) on its right side (Fig. 4b).
With the dataset composed of only the top 10 % longest Nanopore reads, Canu yielded six OsHV-1 contigs ranging from 28 to 210 kb (Fig. 4b). The 28 kb contig consisted of the US surrounded on its left side by one copy of TRL-X-IRS and on its right side by one copy of TRS. The four contigs ranging from 45 to 54 kb share the same general structure; that is to say, a fragment of the UL region followed by a copy of IRL-X-IRS-US-TRS. These four contigs are differentiated by the relative orientation of the UL and US fragments (same or inverted orientations) and their orientation with respect to the IRL-X-IRS fragment (Fig. 4b). It was likely that these four contigs correspond to the four major OsHV-1 isomers described by Davison et al. [7]. In order to verify this observation and to quantify the relative abundance of the four isomers, we have screened all the reads long enough to cover at least a part of both UL and US regions. We identified 92 long-read divided into 18 <-UL/<-US, 24 UL->/US->, 26 <-UL/US-> and 24 UL->/<-US (Fig. 6). This result confirms the OsHV-1 genome isomerization and shows that the viral population is composed of approximately equimolar amounts of the four isomers. Finally, Canu assembled one large contig (210 kb) made of the complete UL surrounded by one copy of IRS-X-TRL on its left side and one copy of IRL-X-IRS-US-TRS on its right side (Fig. 4b). This former contig have been manually curated to create the complete ONT OsHV-1 genome by removing the IRS-X fragment on the left side of the UL and by reverse complement the UL to fit with the most used representation of OsHV-1 genomes isomer [7].
To assess the robustness of the manual assembly, we aligned all ONT reads back to the ONT OsHV-1 genome and observed a continuity of coverage throughout the genome, which validates the assembly of the ONT OsHV-1 genome with our pipeline (Fig. 5 lower panel). Interestingly, we can see a small decrease of coverage at both extremities of the genome and an increase of coverage on IRL and IRS at the junction surrounding the X region (Fig. 5 lower panel). This observation reflects the use of one of the 16 minor isomers described by Davison et al. to represent our ONT OsHV-1 genome [7]. Indeed, in addition to the four isomers described previously, Davison et al. showed that some of these isomers contained an additional copy of the X region at the left or right terminus whereas some might have lacked the X region internally. Since only one copy of the X region is presented in the ONT OsHV-1 genome, the reads corresponding to the minor variant of the X region might tend to align around the X region between IRL and IRS instead of aligning at the extremities of the genome (TRL and TRS) leading to a higher coverage of the region surrounding X compared to the rest of the genome (Fig. 5 lower panel).
Taken together these results validates the assembly of the ONT OsHV-1 genome, confirms for the first time the OsHV-1 genome isomerization and shows that the viral population is composed of approximately equimolar amounts of the four major isomers.
Comparison of Illumina OsHV-1 NR genome and ONT OsHV-1 genome
Since it is almost impossible to assemble a complete OsHV-1 genome from short-read technology due to the presence of large inverted repeats, and because long-read technology tends to produce longer OsHV-1 genome than expected, both technologies still requires manual editing to join the different fragments or to remove extra repeats to obtain a genome structure as accurate as possible. Furthermore, ONT is known to produce a high error rate on raw sequences compared to the Illumina technology. Here, rather than looking at the raw sequence error rate, we attempted to quantify the residual error by comparing to each other, the polished ONT OsHV-1 genome, and the Illumina OsHV-1 NR genome. To do so, we used a pairwise alignment and were able to identify a total of 187 bp difference between the two genomes (Fig. 7).
Those differences could be classified in three categories: (i) SNPs, (ii) copy number variation of a 115 bp repeat and (iii) large homopolymer size variation. We observed a total of 31 SNPs between the two genomes, 24 SNPs correspond to InDel, which take place at the extremity of small homopolymers (from 5 to 11 bp), and 7 SNPs were distributed elsewhere in the genome. The 115 bp copy number variation takes place into the IRS region. While the Illumina OsHV-1 genome contains two copies of the 115 bp sequence, the ONT OsHV-1 genome harbour three copies of this repetition (Fig. S3). Finally, we observed an homopolymer size variation in the IRL region corresponding to 41 bp insertion in the ONT OsHV-1 genome compared to Illumina OsHV-1 genome. Since sequencing error in homopolymer is a well-known caveat of Nanopore sequencing [93], and because long homopolymer are known to be error prone also in Illumina sequencing [92, 94], we have performed a careful analysis of the read alignment for both technologies in this specific region. Interestingly, alignment observation in this large homopolymer identify heterogeneity in homopolymer size at the level of individual reads for both sequencing technologies. While size variation for Illumina reads ranged from 7 to 22 bp, size variation for Nanopore reads ranged from 5 to 98 bp (Fig. S2A and B respectively).
Because of the impossibility to accurately distinguish error in this large homopolymer for both Illumina and Nanopore sequencing, and because we cannot accurately assign the number of Cs inside this large homopolymer, we have decided to keep the most represented number of Cs for each of the sequencing techniques. We therefore did not consider this large homopolymer in our final pairwise homology comparison of both sequencing technologies.
Finally, if we disregard the additional 115 bp repeat highlighted by ONT sequencing, the two de novo assembled genomes only differ by a total of 31 bp and show 99.98 % of nucleotide identity.
OsHV-1 phylogenetic analysis
In order to determine the relationships of the OsHV-1 genomes from this study with those known so far, we constructed a phylogenetic tree from the alignment of the two genomes we generated in the present study (viruses collected in 2016), seven OsHV-1 genomes available in public databases (viruses collected before 2016) and four representatives of the 21 genomes acquired by Delmotte-Pelletier et al. [34] (viruses collected in 2018). This analysis confirms that the OsHV-1 virus sampled in Thau lagoon (France) in 2016 belongs to µVar clade and that genomes reconstructed from the same sample with either the Nanopore or the Illumina sequencing data are really close to each other (Fig. 8).
Non-OsHV-1 contigs' classification
Interestingly, our TFF-based purification process led to the characterization of other viral populations present in oyster tissues. To obtain a global overview of this viral metagenome, we made an extensive taxonomic classification of the non-OsHV-1 contigs produced by both ONT and Illumina sequencing. While the Nanopore sequencing produced 22 non-OsHV-1 contigs (from 14 374 to 163 592 bp) (Fig. 4b, Table S2A), Illumina sequencing produced 222 non-OsHV-1 contigs with size greater than 10 kb (from 10 017 bp to 172 478 bp) (Fig. 4a, Table S2B). Among the 22 Nanopore contigs 14 were also detected in the Illumina non-OsHV-1 contigs (Table S2A). Nanopore identified 20 contigs coding for putative bacteriophages, 1 contig coding for a CRESS DNA virus and 1 contig coding for the C. gigas mitochondrial DNA. Illumina identified 202 contigs' coding for putative bacteriophages, 19 contigs with no hit and 1 contig coding for the C. gigas mitochondrial DNA (Table S2B).
Discussion
Viral whole-genome sequencing has recently become a standard in molecular epidemiology and evolutionary genomics, however this approach can be challenging when implemented on DNA viruses with large and complex genomes such as Herpesviruses [95, 96]. Indeed, it is often difficult to purify these types of viruses in sufficient quantity to allow whole-genome sequencing and reconstruction. Moreover, whole-genome assembly is a real burden considering the viral complexity due to the presence of long inverted repeated regions and structural variants. This is even more challenging to achieve for uncultured viruses derived from clinical samples or from livestock.
We implemented TFF to purify and to concentrate OsHV-1. To this end, we tested TFF devices with different pore sizes and quantified the virus load and infectious capacity of each fraction. Our results showed that TFF device with a MWCO of 500 kDa was able to retain and to concentrate most the OsHV-1 infectious particles contained in the inoculum. Interestingly, viral load quantification by real-time PCR and epifluorescence microscopy were of the same order of magnitude suggesting that TFF not only concentrates the virus but also eliminates most of the free DNA in the starting inoculum as previously suggested by Loewe et al. [60]. This is supported by the tremendous increase of the signal (OsHV-1 reads) to noise (oyster reads) ratio obtained in the sequencing data. Indeed, more than 60 % of the sequenced reads belonged to OsHV-1 whereas, in previous work, the ratio obtained from infected tissues ranged from 0,01% to 13,21% [31, 34]. While other virus enrichment methods like PCR or capture-based approaches can reach up to 100 % of viral data, the TFF approach does not suffer from amplification or hybridization bias caused by sequence variation over time due to genomic divergence [97]. We believe this method will be very helpful to improve the quantification of infectious viral particles in samples from organisms for which there is no cell culture available. Indeed, for tissue samples, the quantification of viruses is mainly based on qPCR approaches, which quantify the genome copy number but not the number of infectious viral particles. Since infected tissues could contain viral genomes undergoing replication or non-canonical genomes [48], the quantification of genome packaged in viral particle reflect a more robust estimate of the true quantification of viable virion in the sample. In addition, we have shown that TFF is an alternative particle enrichment method to centrifugation-based purification. TFF has many advantages including low cost, transportability, rapid purification, and enrichment in infectious particles (i.e. 500 kDa for OsHV-1). We therefore demonstrate, that TFF is a powerful tool to purify unculturable viruses from clinical and environmental isolates.
Then, we performed a high-quality DNA extraction that allowed us to compare the capabilities of long- and short-read sequencing technologies (Illumina and Oxford Nanopore Technologies) to de novo assemble OsHV-1 whole genome.
To date and to our knowledge, there is no maintained bioinformatic pipeline for reference-free and de novo assembly of long DNA viruses with large inverted-repeat like OsHV-1 for either short- or long-read sequence. Usually, long- and short-read sequencing technologies are combined to de novo assemble Herpesvirus whole genomes [48, 98]. Conversely to what is usually done, we have successfully tested two bioinformatic pipelines to independently analyse the long- and short-read datasets, and to reference-free and de novo assemble two OsHV-1 whole genomes. The de novo assembly of the short-read pipeline produced five OsHV-1 contigs that have been manually scaffolded to produce a complete OsHV-1 genome. The de novo assembly of the long-read pipeline produced one large contig that has been manually curated to remove one extra duplicated region. Because of uncertainty with the short-read assembly to position duplicated regions relative to unique region, we have depicted the Illumina assembly as a Non-Redundant (NR) genome, with only one copy of each RL and RS, as frequently done in the literature [34, 77, 78, 99]. In contrast, the long-read assembly resulted in a genome with both inverted repeats with the help of a minimal manual edit.
Pairwise comparison of the two assembled OsHV-1 genomes showed that the Nanopore genome was 99,98 % similar to its Illumina counterpart. Our results demonstrate that Nanopore sequencing is now a mature sequencing technology for the small-scale genomic characterization of difficult long DNA virus with large inverted repeat. This breakthrough sequencing technology coupled with TFF virus purification is now a promising solution to drastically increase the number of sequenced OsHV-1 genomes allowing in-depth molecular evolution and epidemiology characterization.
In addition to being accurate at the base level, ONT sequencing also allowed to resolve large-scale genomic variation. For the first time here, the Nanopore sequencing allowed the identification of one extra copy of a 115 bp multi-copy repetition inside the IRS region that was not previously resolved by short-read approach. This long-read sequencing approach also confirmed the presence of the four major OsHV-1 isomers in viral particles for the first time since the acquisition of the first OsHV-1 genome [7]. Isomers are characterized by permutation of the unique regions UL and US with respect to the IRL-X-IRS fragment. Some of the long-read obtained with Nanopore sequencing were large enough to cover one part of the two unique regions allowing us to quantify and confirm that the different OsHV-1 isomers were present in a stoichiometric proportion among viral particles. Interestingly, this observation is different from the one made in herpes simplex virus 1 (HSV-1), where four isomers have also been characterized, but with variable tropisms specific to cell lines and tissues [100]. Since the presence of isomeric genome in Herpesvirus is tightly linked to its replication and recombination process [98, 101, 102], having access to isomer frequency along with full genome characterization will allow in the near future a better understanding of the complex life cycle as well as of the evolutionary processes of this type of virus.
Finally, and even if this was not the primarily goal of this study, TFF allowed to co-purify multiple DNA viruses along with OsHV-1. For this specific application of viral metagenomics, Illumina allowed, due to the huge sequencing depth, to taxonomically identify ten times more virus than Nanopore sequencing for contigs with size larger than 10 kb. The great majority of identified DNA virus belong to bacteriophage. It is interesting to note that some of them are phages infecting Vibrio , bacteria, which have been described as colonizing moribund oysters during POMS outbreak [25, 103], others are Pseudoalteromonas and Aeromonas phages, pathogens of bacteria considered as beneficial for oysters [104, 105]. Of course, our data does not disentangle the potential role of these phages in POMS but raised questions about the necessity of investigating the virome of healthy and diseased oysters as it has been done for the bacteriome [25].
Conclusion
This first evaluation of Nanopore sequencing for OsHV-1 genome characterization is extremely promising. Coupled with the appropriate bioinformatics pipeline this long-read sequencing approach enables identification of structural variation not observed with the short-read approach and allows characterization and quantification of genomic isomers. As shown in the present study, Nanopore sequencing still struggles to accurately sequence homopolymeric region. However, Nanopore is currently updating software for base calling as well as flow cell and chemistry to improve the resolution of the homopolymeric region [93, 106]. Illumina as well as Nanopore present difficulties to accurately resolve very long homopolymer (>20 bp) in GC-rich regions. It is thus of prime importance to focus efforts on methodological development for a better resolution of those regions that likely play a role for the diversity, replication and the stability of the viral genome [35, 107, 108].
The next challenge lies in combining TFF purification of virus populations harboured by a single individual and long-read sequencing to resolve viral haplotype phasing and better understand intra individual viral dynamics. Indeed, most of the whole-genome studies conducted on Herpesviruses to date only generate a consensus view of the viral population rather than a reconstruction of the viral haplotypes [96]. However, recent studies have shown that viral haplotype phasing could prove to be crucial in understanding the genetic determinants and viral evolution underpinning virulence, pathogenesis and drug resistance [109–111].
Supplementary Data
Funding information
J.D. was supported by a grant from the University of Montpellier. C.P. was supported by a grant from the Ifremer Scientific Board and from the Nouvelle-Aquitaine region. A.D.M. was financially supported by Ifremer, by EU DG SANTE through the European Reference Laboratory for mollusc diseases, and by DGAL through the National Reference Laboratory for Mollusc Diseases. This work benefited from support from the ‘Laboratoire d’excellence’ (LabEx) CeMEB, through the exploratory research project HaploFit and the use of the ‘environmental genomics’ facility (http://www.labex-cemeb.org/fr/genomique-environnementale-1). This work was also supported by the Ifremer Scientific Board, through the HemoVir and GT-oyster projects and by the ‘Fond Européen pour les Affaires Maritimes et la Pêche’ (FEAMP, GESTINNOV project n°PFEA470020FA1000007). The authors acknowledge the Pôle de Calcul et de Données Marines (PCDM; https://wwz.ifremer.fr/en/Research-Technology/Research-Infrastructures/Digital-infrastructures/Computation-Centre) for providing DATARMOR computing and storage resources. Some data were produced through the GenSeq technical facilities of the « Institut des Sciences de l’Evolution de Montpellier » with the support of LabEx CeMEB, an ANR ‘Investissements d'avenir’ programme (ANR-10-LABX-04–01). This study is set within the framework of the ‘Laboratoires d’Excellences (LABEX)’ TULIP (ANR-10-LABX-41). The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank the staff of the Ifremer station at La Tremblade (ASIM), Frédéric Girardin and his team (Plateforme des Mollusques Marins de La Tremblade PMMLT) and Christophe Stavrakakis and his team (Plateforme des Mollusques Marins de Bouin PMMB). We also thank Louis Boismorand and Marc Leroy for technical assistance. We are also grateful to Jean-Christophe Avarre and Marie-ka Tilak (ISEM) for fruitful discussions and advice concerning Nanopore sequencing technology.
Author contributions
J.M.E. and B.M. conceptualized the study; B.M., N.F., M.B. and J.M.E. collected and processed the samples, J.D. and J.M.E. performed Nanopore sequencing, A.D.M. and G.C. did the bioinformatic analysis for Nanopore data, C.P. and G.C. did the bioinformatic analysis of the Illumina data, J.M.E. and G.C. drafted the manuscript and M.J. and B.M. corrected it. All authors read and approved the final manuscript.
Conflicts of interest
The authors declare that there are no conflicts of interest.
Ethical statement
Not applicable, experimental works with bivalves are not currently subjected to ethical approval by Ifremer.
Footnotes
Abbreviations: ASW, artificial seawater; Cp, crossing point; cp, copy number; DP, DNA polymerase; FSW, filtered seawater; GU, genomic units; IRL, inverted repeats long; IRS, inverted repeats small; mNGS, metagenomic next-generation sequencing; ONT, Oxford Nanopore Technologies; ORF, open reading frames; OsHV-1, ostreid herpesvirus type; P, permeate; POMS, pacific oyster mortality syndrome; R, retentate; SNP, single nucleotide polymorphism; SV, structure variations; TFF, tangential flow filtration; TRL, tandem repeats long; TRS, tandem repeats small; UL, unique long; US, unique small; VLP, virus-like particles.
All supporting data, code and protocols have been provided within the article or through supplementary data files. Three supplementary figures and two supplementary tables are available with the online version of this article.
References
- 1.Hall RJ, Wang J, Todd AK, Bissielo AB, Yen S, et al. Evaluation of rapid and simple techniques for the enrichment of viruses prior to metagenomic virus discovery. J Virol Methods. 2014;195:194–204. doi: 10.1016/j.jviromet.2013.08.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kumar A, Murthy S, Kapoor A. Evolution of selective-sequencing approaches for virus discovery and virome analysis. Virus Res. 2017;239:172–179. doi: 10.1016/j.virusres.2017.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gaafar YZA, Ziebell H, Pappu HR. Comparative study on three viral enrichment approaches based on RNA extraction for plant virus/viroid detection using high-throughput sequencing. PLoS ONE. 2020;15:e0237951. doi: 10.1371/journal.pone.0237951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Deng X, Achari A, Federman S, Yu G, Somasekar S, et al. Metagenomic sequencing with spiked primer enrichment for viral diagnostics and genomic surveillance. Nat Microbiol. 2020;5:443–454. doi: 10.1038/s41564-019-0637-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Simner PJ, Miller S, Carroll KC. Understanding the promises and hurdles of metagenomic next-generation sequencing as a diagnostic tool for infectious diseases. Clin Infect Dis. 2018;66:778–788. doi: 10.1093/cid/cix881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gardy JL, Loman NJ. Towards a genomics-informed, real-time, global pathogen surveillance system. Nat Rev Genet. 2018;19:9–20. doi: 10.1038/nrg.2017.88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Davison AJ, Trus BL, Cheng N, Steven AC, Watson MS, et al. A novel class of herpesvirus with bivalve hosts. J Gen Virol. 2005;86:41–53. doi: 10.1099/vir.0.80382-0. [DOI] [PubMed] [Google Scholar]
- 8.EFSA Oyster mortality. EFSA Journal. 2015:1. [Google Scholar]
- 9.Azéma P, Lamy J-B, Boudry P, Renault T, Travers M-A, et al. Genetic parameters of resistance to vibrio aestuarianus, and oshv-1 infections in the pacific oyster, crassostrea gigas, at three different life stages. genetics selection evolution. BioMed Central. 2017:1–16. doi: 10.1186/s12711-017-0297-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dégremont L, Morga B, Maurouard E, Travers M-A. Susceptibility variation to the main pathogens of Crassostrea gigas at the larval, spat and juvenile stages using unselected and selected oysters to OsHV-1 and/or V. aestuarianus . J Invertebr Pathol. 2021;183:107601. doi: 10.1016/j.jip.2021.107601. [DOI] [PubMed] [Google Scholar]
- 11.Xia J, Bai C, Wang C, Song X, Huang J. Complete genome sequence of Ostreid herpesvirus-1 associated with mortalities of Scapharca broughtonii broodstocks. Virol J BioMed Central. 2015;12:110–119. doi: 10.1186/s12985-015-0334-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Cáceres-Martínez J, Vásquez-Yeomans R, Danigo P, Reyes-Roel C. Histological alterations in pacific oysters crassostrea gigas that survived a summer mortality event in Baja California, Mexico. J Aquat Anim Health. 2018;30:31–38. doi: 10.1002/aah.10006. [DOI] [PubMed] [Google Scholar]
- 13.Hwang JY, Park JJ, Yu HJ, Hur YB, Arzul I, et al. Ostreid herpesvirus 1 infection in farmed Pacific oyster larvae Crassostrea gigas (Thunberg) in Korea. J Fish Dis. 2013;36:969–972. doi: 10.1111/jfd.12093. [DOI] [PubMed] [Google Scholar]
- 14.Jenkins C, Hick P, Gabor M, Spiers Z, Fell SA, et al. Identification and characterisation of an ostreid herpesvirus-1 microvariant (OsHV-1 µ-var) in Crassostrea gigas (Pacific oysters) in Australia. Dis Aquat Organ. 2013;105:109–126. doi: 10.3354/dao02623. [DOI] [PubMed] [Google Scholar]
- 15.Mortensen S, Strand Å, Bodvin T, Alfjorden A, Skår CK, et al. Summer mortalities and detection of ostreid herpesvirus microvariant in Pacific oyster Crassostrea gigas in Sweden and Norway. Dis Aquat Organ. 2016;117:171–176. doi: 10.3354/dao02944. [DOI] [PubMed] [Google Scholar]
- 16.Pernet F, Barret J, Le Gall P, Corporeau C, Dégremont L, et al. Mass mortalities of pacific oysters Crassostrea gigas reflect infectious diseases and vary with farming practices in the Mediterranean Thau lagoon, France. Aquacult Environ Interact. 2012;2:215–237. doi: 10.3354/aei00041. [DOI] [Google Scholar]
- 17.Prado-Alvarez M, Darmody G, Hutton S, O’Reilly A, Lynch SA, et al. Occurrence of oshv-1 in crassostrea gigas cultured in ireland during an exceptionally warm summer. Selection of Less Susceptible Oysters Front Physiol. 2016;7:492. doi: 10.3389/fphys.2016.00492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Segarra A, Pépin JF, Arzul I, Morga B, Faury N, et al. Detection and description of a particular Ostreid herpesvirus 1 genotype associated with massive mortality outbreaks of Pacific oysters, Crassostrea gigas, in France in 2008. Virus Res. 2010;153:92–99. doi: 10.1016/j.virusres.2010.07.011. [DOI] [PubMed] [Google Scholar]
- 19.Garcia C, Thébault A, Dégremont L, Arzul I, Miossec L, et al. Ostreid herpesvirus 1 detection and relationship with Crassostrea gigas spat mortality in France between 1998 and 2006. Vet Res. 2011;42:73. doi: 10.1186/1297-9716-42-73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Renault T, Faury N, Barbosa-Solomieu V, Moreau K. Suppression substractive hybridisation (SSH) and real time PCR reveal differential gene expression in the Pacific cupped oyster, Crassostrea gigas, challenged with Ostreid herpesvirus 1. Dev Comp Immunol. 2011;35:725–735. doi: 10.1016/j.dci.2011.02.004. [DOI] [PubMed] [Google Scholar]
- 21.Petton B, Pernet F, Robert R, Boudry P. Temperature influence on pathogen transmission and subsequent mortalities in juvenile Pacific oysters Crassostrea gigas . Aquacult Environ Interact. 2013;3:257–273. doi: 10.3354/aei00070. [DOI] [Google Scholar]
- 22.Petton B, Boudry P, Alunno-Bruscia M, Pernet F. Factors influencing disease-induced mortality of Pacific oysters Crassostrea gigas . Aquacult Environ Interact. 2015;6:205–222. doi: 10.3354/aei00125. [DOI] [Google Scholar]
- 23.Le Roux F, Wegner KM, MPTI Oysters and vibrios as a model for disease dynamics in wild animals. Trends Microbiol. 2016;24:568–580. doi: 10.1016/j.tim.2016.03.006. [DOI] [PubMed] [Google Scholar]
- 24.Delisle L, Pauletto M, Vidal-Dupiol J, Petton B, Bargelloni L, et al. High temperature induces transcriptomic changes in Crassostrea gigas that hinders progress of Ostreid herpesvirus (OsHV-1) and promotes survival. J Exp Biol. 2020 doi: 10.1242/jeb.226233. [DOI] [PubMed] [Google Scholar]
- 25.de Lorgeril J, Lucasson A, Petton B, Toulza E, Montagnani C, et al. Immune-suppression by OsHV-1 viral infection causes fatal bacteraemia in Pacific oysters. Nat Commun. 2018;9:1–14. doi: 10.1038/s41467-018-06659-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rubio T, Oyanedel D, Labreuche Y, Toulza E, Luo X, et al. Species-specific mechanisms of cytotoxicity toward immune cells determine the successful outcome of Vibrio infections. Proc Natl Acad Sci U S A. 2019;116:14238–14247. doi: 10.1073/pnas.1905747116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Petton B, Destoumieux-Garzón D, Pernet F, Toulza E, de Lorgeril J, et al. The Pacific Oyster Mortality Syndrome, a Polymicrobial and Multifactorial Disease: State of Knowledge and Future Directions. Front Immunol. 2021;12:630343. doi: 10.3389/fimmu.2021.630343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Barbosa Solomieu V, Renault T, Travers M-A. Mass mortality in bivalves and the intricate case of the Pacific oyster, Crassostrea gigas . J Invertebr Pathol. 2015;131:2–10. doi: 10.1016/j.jip.2015.07.011. [DOI] [PubMed] [Google Scholar]
- 29.Renault T, Moreau P, Faury N, Pépin J-F, Segarra A, et al. Analysis of clinical ostreid herpesvirus 1 (malacoherpesviridae) specimens by sequencing amplified fragments from three virus genome areas.2nd ed. J Virol. 2012;86(10):5942–5947. doi: 10.1128/JVI.06534-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Renault T, Tchaleu G, Faury N, Moreau P, Segarra A, et al. Genotyping of a microsatellite locus to differentiate clinical Ostreid herpesvirus 1 specimens. Vet Res. 2014;45:3. doi: 10.1186/1297-9716-45-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Burioli EAV, Prearo M, Houssin M. Complete genome sequence of Ostreid herpesvirus type 1 µVar isolated during mortality events in the Pacific oyster Crassostrea gigas in France and Ireland. Virology. 2017;509:239–251. doi: 10.1016/j.virol.2017.06.027. [DOI] [PubMed] [Google Scholar]
- 32.Abbadi M, Zamperin G, Gastaldelli M, Pascoli F, Rosani U, et al. Identification of a newly described OsHV-1 µvar from the North Adriatic Sea (Italy) J Gen Virol. 2018;99:693–703. doi: 10.1099/jgv.0.001042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Morga B, Jacquot M, Pelletier C, Chevignon G, Dégremont L, et al. Genomic diversity of the ostreid herpesvirus type 1 across time and location and among host species. Front Microbiol. 2021;12:711377. doi: 10.3389/fmicb.2021.711377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Delmotte J, Pelletier C, Morga B, Galinier R, Petton B, et al. Genetic diversity and connectivity of the Ostreid herpesvirus 1 populations in France: a first attempt to phylogeographic inference for A marine mollusc disease. Virus Evol. 2022;8:veac039. doi: 10.1093/ve/veac039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kuny CV, Szpara ML. Alphaherpesvirus genomics: past, present and future. Curr Issues Mol Biol. 2021;42:41–80. doi: 10.21775/cimb.042.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Riaz N, Leung P, Barton K, Smith MA, Carswell S, et al. Adaptation of Oxford Nanopore technology for hepatitis C whole genome sequencing and identification of within-host viral variants. BMC Genomics. 2021;22:148–12. doi: 10.1186/s12864-021-07460-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Rios G, Lacoux C, Leclercq V, Diamant A, Lebrigand K, et al. Monitoring SARS-CoV-2 variants alterations in Nice neighborhoods by wastewater nanopore sequencing. Lancet Reg Health Eur. 2021;10:100202. doi: 10.1016/j.lanepe.2021.100202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Toh X, Wang Y, Rajapakse MP, Lee B, Songkasupa T, et al. Use of nanopore sequencing to characterize african horse sickness virus (AHSV) from the African horse sickness outbreak in thailand in 2020. Transbound Emerg Dis. 2022;69:1010–1019. doi: 10.1111/tbed.14056. [DOI] [PubMed] [Google Scholar]
- 39.Chan WS, Au CH, Lam HY, Wang CLN, Ho DN-Y, et al. Evaluation on the use of Nanopore sequencing for direct characterization of coronaviruses from respiratory specimens, and a study on emerging missense mutations in partial RdRP gene of SARS-CoV-2. Virol J. 2020;17:183–13. doi: 10.1186/s12985-020-01454-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.King J, Harder T, Beer M, Pohlmann A. Rapid multiplex MinION nanopore sequencing workflow for Influenza A viruses. BMC Infect Dis. 2020;20:648–8. doi: 10.1186/s12879-020-05367-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Batista FM, Stapleton T, Lowther JA, Fonseca VG, Shaw R, et al. Whole genome sequencing of hepatitis a Virus Using A PCR-Free single-molecule nanopore sequencing approach. Front Microbiol. 2020;11:874. doi: 10.3389/fmicb.2020.00874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Faria NR, Quick J, Claro IM, Thézé J, de Jesus JG, et al. Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature. 2017;546:406–410. doi: 10.1038/nature22401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Eltom KH, Althoff AC, Hansen S, Böhlken-Fascher S, Yousif A, et al. Differentiation of capripox viruses by nanopore sequencing. Vaccines. 2021;9:351. doi: 10.3390/vaccines9040351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Radukic MT, Brandt D, Haak M, Müller KM, Kalinowski J. Nanopore sequencing of native adeno-associated virus (AAV) single-stranded DNA using a transposase-based rapid protocol. NAR Genomics and Bioinformatics. 2020;2 doi: 10.1093/nargab/lqaa074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kulikov EE, Golomidova AK, Babenko VV, Letarov AV. A simple method for extraction of the horse feces virome DNA, suitable for oxford nanopore sequencing. Microbiology. 2020;89:246–249. doi: 10.1134/S002626172002006X. [DOI] [Google Scholar]
- 46.Ben Chehida S, Filloux D, Fernandez E, Moubset O, Hoareau M, et al. Nanopore sequencing is a credible alternative to recover complete genomes of geminiviruses. Microorganisms. 2021;9:903. doi: 10.3390/microorganisms9050903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Karamitros T, Harrison I, Piorkowska R, Katzourakis A, Magiorkinis G, et al. De novo assembly of human herpes virus type 1 (HHV-1) genome, mining of non-canonical structures and detection of novel drug-resistance mutations using short- and long-read next generation sequencing technologies. PLoS One. 2016;11:e0157600. doi: 10.1371/journal.pone.0157600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Karamitros T, van Wilgenburg B, Wills M, Klenerman P, Magiorkinis G. Nanopore sequencing and full genome de novo assembly of human cytomegalovirus TB40/E reveals clonal diversity and structural variations. BMC Genomics. 2018;19:577–12. doi: 10.1186/s12864-018-4949-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Vereecke N, Carnet F, Pronost S, Vanschandevijl K, Theuns S, et al. Genome sequences of equine herpesvirus 1 strains from a European outbreak of neurological disorders linked to a horse gathering in Valencia, Spain, in 2021. Microbiol Resour Announc. 2021;10:e00333-21. doi: 10.1128/MRA.00333-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Fuandila NN, Gosselin-Grenet A-S, Tilak M-K, Bergmann SM, Escoubas J-M, et al. Structural variation turnovers and defective genomes: key drivers for the in vitro evolution of the large double-stranded DNA koi herpesvirus (KHV). Cold Spring Harbor Laboratory. bioRxiv. 2022;2 doi: 10.24072/pcjournal.154. [DOI] [Google Scholar]
- 51.Vinaya Kumar K, Shekhar MS, Otta SK, Karthic K, Ashok Kumar J, et al. First report of a complete genome sequence of White spot syndrome virus from India. Genome Announc. 2018;6:e00055-18. doi: 10.1128/genomeA.00055-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Gallagher MD, Matejusova I, Nguyen L, Ruane NM, Falk K, et al. Nanopore sequencing for rapid diagnostics of salmonid RNA viruses. Sci Rep. 2018;8:16307–16309. doi: 10.1038/s41598-018-34464-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liem M, Regensburg-Tuïnk T, Henkel C, Jansen H, Spaink H. Microbial diversity characterization of seawater in a pilot study using Oxford Nanopore Technologies long-read sequencing. BMC Res Notes. 2021;14:42–47. doi: 10.1186/s13104-021-05457-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Beaulaurier J, Luo E, Eppley JM, Uyl PD, Dai X, et al. Assembly-free single-molecule sequencing recovers complete virus genomes from natural microbial communities. Genome Res. 2020;30:437–446. doi: 10.1101/gr.251686.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Urban L, Holzer A, Baronas JJ, Hall MB, Braeuninger-Weimer P, et al. Freshwater monitoring by nanopore sequencing. Elife. 2021;10:e61504. doi: 10.7554/eLife.61504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wickramasinghe SR, Kalbfuss B, Zimmermann A, Thom V, Reichl U. Tangential flow microfiltration and ultrafiltration for human influenza A virus concentration and purification. Biotechnol Bioeng. 2005;92:199–208. doi: 10.1002/bit.20599. [DOI] [PubMed] [Google Scholar]
- 57.Michalsky R, Passarelli AL, Pfromm PH, Czermak P. Purification of the baculovirus Autographa californica M nucleopolyhedrovirus by tangential flow ultrafiltration. Desalination. 2009;245:694–700. doi: 10.1016/j.desal.2009.02.039. [DOI] [Google Scholar]
- 58.Eilts F, Harnischfeger J, Loewe D, Wolff MW, Salzig D, et al. Production of Baculovirus and Stem Cells for Baculovirus-Mediated Gene Transfer into Human Mesenchymal Stem Cells. Methods Mol Biol. 2021;2183:367–390. doi: 10.1007/978-1-0716-0795-4_19. [DOI] [PubMed] [Google Scholar]
- 59.Negrete A, Pai A, Shiloach J. Use of hollow fiber tangential flow filtration for the recovery and concentration of HIV virus-like particles produced in insect cells. J Virol Methods. 2014;195:240–246. doi: 10.1016/j.jviromet.2013.10.017. [DOI] [PubMed] [Google Scholar]
- 60.Loewe D, Grein TA, Dieken H, Weidner T, Salzig D, et al. Tangential Flow Filtration for the Concentration of Oncolytic Measles Virus: The Influence of Filter Properties and the Cell Culture Medium. Membranes. 2019;9:E160. doi: 10.3390/membranes9120160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Shi H, Pasco EV, Tarabara VV. Membrane-based methods of virus concentration from water: a review of process parameters and their effects on virus recovery. Environ Sci: Water Res Technol. 2017;3:778–792. doi: 10.1039/C7EW00016B. [DOI] [Google Scholar]
- 62.Cordier C, Stavrakakis C, Morga B, Degrémont L, Voulgaris A, et al. Removal of pathogens by ultrafiltration from sea water. Environ Int. 2020;142:105809. doi: 10.1016/j.envint.2020.105809. [DOI] [PubMed] [Google Scholar]
- 63.Beaulaurier J, Luo E, Eppley JM, Uyl PD, Dai X, et al. Assembly-free single-molecule sequencing recovers complete virus genomes from natural microbial communities. Genome Res. 2020;30:437–446. doi: 10.1101/gr.251686.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Azéma P, Lamy J-B, Boudry P, Renault T, Travers M-A, et al. Genetic parameters of resistance to Vibrio aestuarianus, and OsHV-1 infections in the Pacific oyster, Crassostrea gigas, at three different life stages. Genet Sel Evol. 2017;49:23. doi: 10.1186/s12711-017-0297-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Dégremont L, Bédier E, Soletchnik P, Ropert M, Huvet A, et al. Relative importance of family, site, and field placement timing on survival, growth, and yield of hatchery-produced Pacific oyster spat (Crassostrea gigas) Aquaculture. 2005;249:213–229. doi: 10.1016/j.aquaculture.2005.03.046. [DOI] [Google Scholar]
- 66.Schikorski D, Renault T, Saulnier D, Faury N, Moreau P, et al. Experimental infection of Pacific oyster Crassostrea gigas spat by ostreid herpesvirus 1: demonstration of oyster spat susceptibility. Vet Res. 2011;42:13–27. doi: 10.1186/1297-9716-42-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Webb SC, Fidler A, Renault T. Primers for PCR-based detection of ostreid herpes virus-1 (OsHV-1): Application in a survey of New Zealand molluscs. Aquaculture. 2007;272:126–139. doi: 10.1016/j.aquaculture.2007.07.224. [DOI] [Google Scholar]
- 68.Leruste A, Bouvier T, Bettarel Y. Enumerating viruses in coral mucus. Appl Environ Microbiol. 2012;78:6377–6379. doi: 10.1128/AEM.01141-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Patel A, Noble RT, Steele JA, Schwalbach MS, Hewson I, Fuhrman JA. Virus and prokaryote enumeration from planktonic aquatic environments by epifluorescence microscopy with SYBR Green I. Nat Protoc. Nature Publishing Group. 2007;2(2):269–76. doi: 10.1038/nprot.2007.6. [DOI] [PubMed] [Google Scholar]
- 70.Suquet M, de Kermoysan G, Araya RG, Queau I, Lebrun L, et al. Anesthesia in pacific oyster, crassostrea gigas. Aquat Living Resour EDP Sciences. 2009;22(1):29–34. [Google Scholar]
- 71.Dupont S, Lokmer A, Corre E, Auguet J-C, Petton B, et al. Oyster hemolymph is a complex and dynamic ecosystem hosting bacteria, protists and viruses. Anim Microbiome. 2020;2:12. doi: 10.1186/s42523-020-00032-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. The sequence alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–834. doi: 10.1101/gr.213959.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 77.Cunningham C, Gatherer D, Hilfrich B, Baluchova K, Dargan DJ, et al. Sequences of complete human cytomegalovirus genomes from infected cell cultures and clinical specimens. J Gen Virol. 2010;91:605–615. doi: 10.1099/vir.0.015891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Morse AM, Calabro KR, Fear JM, Bloom DC, McIntyre LM. Reliable detection of herpes simplex virus sequence variation by high-throughput resequencing. Viruses. 2017;9:E226. doi: 10.3390/v9080226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Kolmogorov M, Bickhart DM, Behsaz B, Gurevich A, Rayko M, et al. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat Methods. 2020;17:1103–1110. doi: 10.1038/s41592-020-00971-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Vaser R, Sović I, Nagarajan N, Šikić M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. 2017;27:737–746. doi: 10.1101/gr.214270.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Arndt D, Grant JR, Marcu A, Sajed T, Pon A, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44:W16–21. doi: 10.1093/nar/gkw387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Buchfink B, Reuter K, Drost H-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat Methods. 2021;18:366–368. doi: 10.1038/s41592-021-01101-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Lemoine F, Correia D, Lefort V, Doppelt-Azeroual O, Mareuil F, et al. NGPhylogeny.fr: new generation phylogenetic services for non-specialists. Nucleic Acids Res. 2019;47:W260–W265. doi: 10.1093/nar/gkz303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Criscuolo A, Gribaldo S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol Biol. 2010;10:210–221. doi: 10.1186/1471-2148-10-210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9:772–2. doi: 10.1038/nmeth.2109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
- 92.Ross MG, Russ C, Costello M, Hollinger A, Lennon NJ, et al. Characterizing and measuring bias in sequence data. Genome Biol. 2013;14:20–R51. doi: 10.1186/gb-2013-14-5-r51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Delahaye C, Nicolas J. Sequencing DNA with nanopores: Troubles and biases. PLoS One. 2021;16:e0257521. doi: 10.1371/journal.pone.0257521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Stoler N, Nekrutenko A. Sequencing error profiles of Illumina sequencing instruments. NAR Genom Bioinform. 2021;3:lqab019. doi: 10.1093/nargab/lqab019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Houldcroft CJ, Beale MA, Breuer J. Clinical and biological insights from viral genome sequencing. Nat Rev Microbiol. 2017;15:183–192. doi: 10.1038/nrmicro.2016.182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Renner DW, Szpara ML. Impacts of genome-wide analyses on our understanding of human herpesvirus diversity and evolution. J Virol. 2018;92:126–13. doi: 10.1128/JVI.00908-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Sozhamannan S, Holland MY, Hall AT, Negrón DA, Ivancich M, et al. Evaluation of signature erosion in ebola virus due to genomic drift and its impact on the performance of diagnostic assays. Viruses. 2015;7:3130–3154. doi: 10.3390/v7062763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.López-Muñoz AD, Rastrojo A, Kropp KA, Viejo-Borbolla A, Alcamí A. Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome. Microb Genom. 2021;7 doi: 10.1099/mgen.0.000586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Hage E, Wilkie GS, Linnenweber-Held S, Dhingra A, Suárez NM, et al. Characterization of human cytomegalovirus genome diversity in immunocompromised hosts by whole-genome sequencing directly from clinical specimens. J Infect Dis. 2017;215:1673–1683. doi: 10.1093/infdis/jix157. [DOI] [PubMed] [Google Scholar]
- 100.Mahiet C, Ergani A, Huot N, Alende N, Azough A, et al. Structural variability of the herpes simplex virus 1 genome in vitro and in vivo. J Virol. 2012;86:8592–8601. doi: 10.1128/JVI.00223-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Chou J, Roizman B. Isomerization of herpes simplex virus 1 genome: identification of the cis-acting and recombination sites within the domain of the a sequence. Cell. 1985;41:803–811. doi: 10.1016/s0092-8674(85)80061-1. [DOI] [PubMed] [Google Scholar]
- 102.McVoy MA, Ramnarain D. Machinery to support genome segment inversion exists in a herpesvirus which does not naturally contain invertible elements. J Virol. 2000;74:4882–4887. doi: 10.1128/jvi.74.10.4882-4887.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Le Roux F, Wegner KM, Polz MF. Oysters and Vibrios as a Model for Disease Dynamics in Wild Animals. Trends Microbiol. 2016;24:568–580. doi: 10.1016/j.tim.2016.03.006. [DOI] [PubMed] [Google Scholar]
- 104.Desriac F, El Harras A, Simon M, Bondon A, Brillet B, et al. Alterins Produced by Oyster-Associated Pseudoalteromonas Are Antibacterial Cyclolipopeptides with LPS-Binding Activity. Mar Drugs. 2020;18:E630. doi: 10.3390/md18120630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Gibson LF, Woodworth J, George AM. Probiotic activity of Aeromonas media on the Pacific oyster, Crassostrea gigas, when challenged with Vibrio tubiashii. Aquaculture. 1998;169:111–120. doi: 10.1016/S0044-8486(98)00369-X. [DOI] [Google Scholar]
- 106.Wang Y, Zhao Y, Bollas A, Wang Y, Au KF. Nanopore sequencing technology, bioinformatics and applications. Nat Biotechnol. 2021;39:1348–1365. doi: 10.1038/s41587-021-01108-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Olson AT, Wang Z, Rico AB, Wiebe MS. A poxvirus pseudokinase represses viral DNA replication via A pathway antagonized by its paralog kinase. PLoS Pathog. 2019;15:e1007608. doi: 10.1371/journal.ppat.1007608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Castillo-Lizardo M, Henneke G, Viguera E. Replication slippage of the thermophilic DNA polymerases B and D from the Euryarchaeota Pyrococcus abyssi. Front Microbiol. 2014;5:403. doi: 10.3389/fmicb.2014.00403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Wu NC, De La Cruz J, Al-Mawsawi LQ, Olson CA, Qi H, et al. HIV-1 quasispecies delineation by tag linkage deep sequencing. PLoS One. 2014;9:e97505. doi: 10.1371/journal.pone.0097505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Monaco DC, Zapata L, Hunter E, Salomon H, Dilernia DA. Resistance profile of HIV-1 quasispecies in patients under treatment failure using single molecule, real-time sequencing. AIDS. 2020;34:2201–2210. doi: 10.1097/QAD.0000000000002697. [DOI] [PubMed] [Google Scholar]
- 111.Della Bartola M, Byrne S, Mullins E. Characterization of potato virus Y isolates and assessment of nanopore sequencing to detect and genotype potato viruses. Viruses. 2020;12:E478. doi: 10.3390/v12040478. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.