Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2021 Jul 13;12:711377. doi: 10.3389/fmicb.2021.711377

Genomic Diversity of the Ostreid Herpesvirus Type 1 Across Time and Location and Among Host Species

Benjamin Morga 1,*,, Maude Jacquot 1,, Camille Pelletier 1, Germain Chevignon 1, Lionel Dégremont 1, Antoine Biétry 1, Jean-François Pepin 2, Serge Heurtebise 1, Jean-Michel Escoubas 3, Tim P Bean 4,5, Umberto Rosani 6, Chang-Ming Bai 7, Tristan Renault 8, Jean-Baptiste Lamy 1
PMCID: PMC8313985  PMID: 34326830

Abstract

The mechanisms underlying virus emergence are rarely well understood, making the appearance of outbreaks largely unpredictable. This is particularly true for pathogens with low per-site mutation rates, such as DNA viruses, that do not exhibit a large amount of evolutionary change among genetic sequences sampled at different time points. However, whole-genome sequencing can reveal the accumulation of novel genetic variation between samples, promising to render most, if not all, microbial pathogens measurably evolving and suitable for analytical techniques derived from population genetic theory. Here, we aim to assess the measurability of evolution on epidemiological time scales of the Ostreid herpesvirus 1 (OsHV-1), a double stranded DNA virus of which a new variant, OsHV-1 μVar, emerged in France in 2008, spreading across Europe and causing dramatic economic and ecological damage. We performed phylogenetic analyses of heterochronous (n = 21) OsHV-1 genomes sampled worldwide. Results show sufficient temporal signal in the viral sequences to proceed with phylogenetic molecular clock analyses and they indicate that the genetic diversity seen in these OsHV-1 isolates has arisen within the past three decades. OsHV-1 samples from France and New Zealand did not cluster together suggesting a spatial structuration of the viral populations. The genome-wide study of simple and complex polymorphisms shows that specific genomic regions are deleted in several isolates or accumulate a high number of substitutions. These contrasting and non-random patterns of polymorphism suggest that some genomic regions are affected by strong selective pressures. Interestingly, we also found variant genotypes within all infected individuals. Altogether, these results provide baseline evidence that whole genome sequencing could be used to study population dynamic processes of OsHV-1, and more broadly herpesviruses.

Keywords: marine virus, OsHV-1, diversity, shellfish farming, measurably evolution

Introduction

A common paradigm is that double-stranded DNA (dsDNA) viruses, and particularly herpesviruses, are inherently stable (Sanjuán et al., 2010; Sanjuán and Domingo-Calap, 2016) making them unsuitable for analytical techniques derived from population genetic theory. At the molecular level, this view stems from low per-site mutation rates due to the high-fidelity polymerases and error correction systems typical of these viruses. However, recent progress has been made in understanding the genomic diversity and evolution of mainly human herpesviruses, driven by the rapid expansion and application of high throughput sequencing (HTS), bioinformatics, and comparative genomics in virology. The ability of dsDNA viruses to accrue standing variation, and to undergo recombination with neighboring genomes, creates many opportunities for selective pressures to induce rapid genetic shifts (for a review of factors see Sanjuán and Domingo-Calap, 2016). In the meantime, estimates of the substitution rates of human herpesviruses and cytomegaloviruses are substantially higher than previously though (Kitchen et al., 2010; Jaramillo et al., 2013; Parsons et al., 2015; Renner and Szpara, 2017) and evidences that several viral haplotypes can be found within one infected host (at the individual scale) have been provided (Renzette et al., 2011; Cudini et al., 2019). Together, these data have reshaped our sense of the stability of herpesvirus genomes. Because contemporary viral genomic diversity is a result of the dynamic interaction of past ecological and evolutionary processes (Retel et al., 2019), opportunities to assess their evolution, at a genome-wide scale, to infer their population dynamics from genomic data (Biek et al., 2015) offer new perspectives for better understanding and preventing herpesviruses outbreaks.

Ostreid herpesvirus 1 (OsHV-1) is a dsDNA herpesvirus and the unique member of the genus Ostreavirus (family Malacoherpesviridae, order Herpesvirales). It was the first herpesvirus isolated from invertebrates in the early 1990s (Nicolas et al., 1992) and likely is the causative agent of the last decades increased mortality events in Pacific oysters, Crassostrea gigas and therefore responsible for dramatic economic damages (EFSA, 2010; Segarra et al., 2010). Since the appearance of the OsHV-1 μVar variant in 2008, we have observed high mortality following the detection of this variant, the means to fight against the infection of this virus in oyster are limited.

Ostreid herpesvirus 1 infects mainly spat and juvenile C. gigas and is currently detected in most oyster producing area worldwide (Nicolas et al., 1992; Garcia et al., 2011; Paul-Pont et al., 2013). Furthermore, this virus is able to infect at least four more distantly related marine bivalve species (Ostrea edulis, Chlamys farreri, Pecten maximus Cerastoderma edule, and Anadara broughtonii; Arzul et al., 2001, 2017; Davison et al., 2005; Bai et al., 2015; Bookelaar et al., 2020). However, whether viral populations infecting the different host species are the same remains unknown.

The first variant of OsHV-1 was described on P. maximus larvae and was named OsHV-1 Var but the associated disease was not well documented (Arzul et al., 2001). Since 2008, microvariants of OsHV-1 have been associated with increasing mortality of C. gigas in France, and in a number of countries across Europe, as well as in Australia and New Zealand (Segarra et al., 2010; Jenkins et al., 2013). OsHV-1 μVar is a single variant which differs from the reference genome published in 2005 by Davidson (accession number NC_005881.2) mainly because of sequence variations in a microsatellite locus upstream of the Open Reading Frame (ORF) 4, in ORF4 and in ORF42/43. The term “microvariants” is now used to refer to all haplotypes with mutations in and upstream of ORF 4 and in ORF 42/43. Other variants have also been described, for example OsHV-1-SB which has been associated with mass mortalities of S. broughtonii in China (Bai et al., 2015, 2016).

Following the emergence of OsHV-1 μVar, a number of studies have been undertaken to investigate the diversity of OsHV-1 using single- (Renault et al., 2014) or multi-genomic regions (Renault et al., 2012). Three OsHV-1 genome sequences were obtained with bacteriophage lambda libraries or genome walking procedures (Davison et al., 2005; Ren et al., 2013; Xia et al., 2015). More recently, three additional OsHV-1 μVar genomes were sequenced using Illumina shotgun DNA-seq technology (Burioli et al., 2017; Abbadi et al., 2018).

While sequencing allows us to characterize polymorphism and genomic variation at the genome scale with unprecedented power, in the case of OsHV-1, diversity has not yet been explored at the genome scale, or across different dimensions such as time, location and host species. Recently a study of viral diversity was performed at the viral mRNA level, which demonstrated viral populations differ depending on the genetic background of their hosts. This result suggests a co-evolution process between OsHV-1 μVar and oyster populations (Delmotte et al., 2020).

Therefore, using a reference mapping approach from historical collection of deep sequenced infected animals, we aim to assess the measurability of evolution on epidemiological time scales of OsHV-1. We also aim at characterizing the potential of the approach to test (i) if infected individuals belonging to different host species harbor different viral populations with large genomic differences possibly due to viral speciation and, (ii) if infected individual from the same host species but originated from different world region harbor viral populations with the same genomic structure and simple polymorphisms.

Materials and Methods

Sample Collection

Samples analyzed in this study come from a combination of sources, including an historical collection and the European and National Reference Laboratory for Bivalve Mollusc Diseases. For the rest of this paper, we refer to “isolates” in the sense of viral populations isolated from infected individual host. Most of the data derives from whole-genome sequencing of infected individual host or pools of infected individual hosts (Table 1). The 22 isolates in this study were collected from several countries around the world: France (eight infected juveniles C. gigas, two pool of infected C. gigas oysters, and one C. gigas adult), New-Zealand (one infected pool of C. gigas larvae and three infected C. gigas adults), Ireland (one C. gigas infected adult), Japan (two C. gigas infected adults), Sweden (two pooled infected O. edulis larvae and one infected adult), United-Kingdom (one C. gigas individual), Netherlands (one Crassostrea gigas infected adult), and Spain (one C. gigas infected adult; Table 1).

TABLE 1.

Description of the 22 sequenced libraries in this study: country of origin, host species, year of collection, sample stage, sequencing technology, and number of reads aligned against Ostreid herpesvirus 1 (OsHV-1) reference genome (Accession number: NC_005881.2).

Sample’s ID Country of origin Species Year Stage Sequencer, sequencing protocol, platform Aligned reads
VIV46-2-m France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 281,336
VIV56-10-m France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 346,491
VIV48-4-m France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 354836
VIV49-5-m88 France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 183,665
VIV58-12-m France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 155,976
VIV47-3-m France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 370,770
VIV57-11-m99 France Crassostrea gigas 2017 Juvenile HiSeq 4000, Paired-end 150 bp, LIGAN-PM Genomics 49,985
Poole-Harbour United Kingdom United-Kingdom Crassostrea gigas 2015 na MiSeq, Paired-end 150 bp 55,056
LI France Crassostrea gigas 2010 Pool of larvae HiSeq 2500, Paired-end 100 bp, GenoToul 400,021
PR France Crassostrea gigas 2008 Pool of juveniles HiSeq 2500, Single-end 100 bp, GATC Biotech 344,852
MV France Crassostrea gigas 2010 Ripe Adult HiSeq 2500, Single-end 100 bp, GATC Biotech 331,278
NZ New-Zealand Crassostrea gigas 2010 Pool of larvae HiSeq 2500, Single-end 100 bp, GATC Biotech 341256
NZ16 New-Zealand Crassostrea gigas 2011 Adult HiSeq 2500, Paired-end 100 bp, GATC Biotech 105,610
NZ17 New-Zealand Crassostrea gigas 2011 Adult HiSeq 2500, Paired-end 100 bp, GenoToul 202,246
NZ18 New-Zealand Crassostrea gigas 2011 Adult HiSeq 2500, Paired-end 100 bp, GenoToul 150,315
IRL15* Ireland Crassostrea gigas 2011 Adult HiSeq 2500, Paired-end 100 bp, GenoToul 1,802
JP2* Japan Crassostrea gigas na Adult HiSeq 2500, Paired-end 100 bp, GenoToul 2,115
JP6* Japan Crassostrea gigas na Adult HiSeq 2500, Paired-end 100 bp, GenoToul 3,280
NL4* Netherlands Crassostrea gigas na Adult HiSeq 2500, Paired-end 100 bp, GenoToul 4,143
SP16* Spain Crassostrea gigas na Adult HiSeq 2500, Paired-end 100 bp, GenoToul 1,259
SW3* Sweden Ostrea edulis 2012 Pool of larvae HiSeq 2500, Paired-end 100 bp, GenoToul 1,839
SW6 Sweden Ostrea edulis 2012 Adult HiSeq 2500, Paired-end 100 bp, GenoToul 402,082

Samples for which the ID starts by “VIV” are from research project Vivaldi and the remaining samples are from historical collections. *Those samples did not pass the quality check and were discarded.

The historical samples were from moribund oysters collected during mortality outbreaks. For spat, juvenile and adults, a piece of mantle or gill was collected and stored in 90% ethanol until DNA extraction. For larvae samples, a pool of individuals was frozen (−20°C) and stored until DNA extraction.

All “VIV” oysters were sampled during a mortality outbreak in July 2017 (Table 1). Briefly, spat were placed into the field (17390 La Tremblade Ronce-les-Bains France, a sandbank calledLa Floride: LAT 45.80° and LONG -1.15°). As soon as the onset of the mortality, spat were transferred in the laboratory and checked twice a day (from 18/07/2017 to 21/07/2017) to sample fresh moribund individuals.

Genomic Library Preparation and Sequencing Runs

A range of 60–200 mg of pooled larval samples were mechanically crushed using unique piston pellet before DNA extraction. About 50 mg of fresh tissue were collected from spat-juveniles-adults. Nucleic acid extraction was performed using the QIAamp DNA Mini Kit (Qiagen) according to the manufacturer’s handbook. All extracted DNA samples were stored at −25°C to avoid any degradation and DNA concentrations were measured with the NanodropTM (Labtech, France). After storage, DNA quality was assessed with NanodropTM and checked again by the sequencing platform using fluorometric measurement of nucleic acids.

All the libraries were sequenced using Illumina technology, with HiSeq 2000 sequencers on two platforms, GATC (Mulhouse, France) and GenoToul (Toulouse, France), with the exception of United Kingdom samples that were sequenced with MiSeq on in-house sequencers, and the HiSeq 4000 for the most recent samples on LIGAN (CNRS, Lille, France). Regarding the sequencing protocols, three libraries were sequenced on single-end protocol (1 × 100 bp) and remaining libraries were sequenced with paired-end protocol (2 × 150 bp) (Table 1), with the exception of the Miseq data, which was generated using V3-600 Miseq specific cartridge. Raw sequence data are available on the NCBI Sequence Read Archive (SRA) under the BioProject ID: PRJNA721248.

Bioinformatics and Genomic Framework

Read Mapping and Filtration

A reference mapping approach was used to explore the viral genomic diversity within and between samples. This approach has been extensively used for eukaryotic organisms and is best suited to call single nucleotide polymorphisms (SNPs) and genomic structural variation (SV) from a collection of individuals or pooled libraries in comparison to the more widely used de novo assembly strategies in the viral community (Baaijens et al., 2017). All sequenced libraries were aligned against the OsHV-1 reference genome (Accession number: NC_005881.2) (Davison et al., 2005) using BWA MEM v7.1.0 (Li and Durbin, 2009; Li, 2013). Only samples with a mean of coverage depth equal to or above 100X have been included for polymorphism calling and 10X for haplotype reconstruction (Figure 1A).

FIGURE 1.

FIGURE 1

Schematic illustration of bioinformatics framework. In summary, the right part is devoted to genome-wide phylogenetic approaches and the left part aims to reconstruct local viral haplotypes. Sequences were aligned to reference genome (A) and consensus were used to detected haplotypes (B,E,H) and SNPs (C,D,F). Multiple alignement were used to built phylogenies (G,I).

Local Haplotypes Reconstruction

We first aimed to reconstruct local haplotypes. As far as we are aware, no software dedicated to haplotype reconstruction is able to manage long indels, as it is observed in OsHV-1 genomes, with short reads (i.e., ∼100 bp reads long). To circumvent this limitation, we decided to make local haplotype reconstruction on each gene coding DNA sequence (CDS) that were present in every sample and outside of inverted and repeat regions. Each CDS coordinate were extracted from the GenBank file (Clark et al., 2016) and we use them to subset the bam file for each sample (Locus_tag, Figure 1B). We feed QuasiRecomb (Töpfer et al., 2013), haploclique (Töpfer et al., 2014), and shorah (Zagordi et al., 2010, 2011) with CDS-split bam files (Figure 1E). Only results from Shorah software are showed on the main text because it represents the most conservative short read local assembler, whereas the other software outputs are showed in supplementary figures. Briefly, Shorah employs a model-based probabilistic clustering algorithm to correct errors, infer haplotypes and their frequencies, and estimate, using a Bayesian approach, the quality of the reconstruction by computing the full joint posterior probability distribution of all parameters of interest (Zagordi et al., 2010).

Subsequent plotting were made in R Development Core Team (2008) with basics functions and the ggplot2 package (Wickham, 2016) (Figure 1H).

Short and Long Polymorphism Calling

The raw bam files were also processed with Freebayes software (Garrison and Marth, 2012; Figure 1D) to call short polymorphisms. Freebayes is one of the most versatile variants calling software, particularly when the libraries included pool of viral haplotypes as it is in the present study and has been extensively tested and validated on various organisms with different protocols (Olson et al., 2005; Wang et al., 2005). We discarded genomic regions with low complexity (entropy > 1) and regions with a high number of misalignments namely TRL, IRL, IRS, and TRS genomic regions that are duplicated.

Variation of coverage depth and, when available, the orientation of the paired-reads against the reference genome were jointly analyzed to detect SV, deletions, duplications, insertions, inversions, and translocations (Davison et al., 2005). We use Sniffles to perform the SV detection (Figure 1C; Sedlazeck et al., 2018), all the inferred structural variants were checked manually and filtered out as soon as we suspected some alignments artefacts. Most inferred deletions were visually checked with IGV (Thorvaldsdóttir et al., 2013), the remaining types of SV (insertions, duplications, inversions, and translocations) were rare, difficult to check manually and consequently we did not consider those in the rest of the paper. Information from both VCF files, short and long polymorphisms, were merged using vcflib (Garrison, 2019).

Consensus Sequence Reconstruction

For each sample, we reconstructed a “consensus sequence” based on the most frequent alleles for SNPs and short insertions/deletions. Into this consensus, we also integrated all long deletions found in the SV calling step, only all “large deletions” inferred by Sniffles and visually checked (Figure 1C, Supplementary Table 1, and Supplementary Figure 1). To create these consensus sequences, we use a combination of bcftools (Li et al., 2009; Li, 2011) and bedtools (Quinlan and Hall, 2010; Figure 1F).

Phylogenomic Analysis of Consensus Sequence

We choose to work on the full-length genome (Figures 1C,D,F,G) because among the Malacoherpesviridae family most of the transcripts and proteins have no functional annotation.

All consensus sequences for each samples, plus the publicly available full length OsHV-1 genomes as of March 2021 (MF509813.1, NC_005881.2, KY242785.1, and KY271630.1) and the genome of the acute viral necrotic virus (AVNV) GQ153938.1 were merged and aligned with MAFFT (Katoh and Standley, 2013; Figure 1G). For the multiple sequence alignment, we did not perform any trimming or masking steps as recommended in Tan et al. (2015).

We used a general time reversible evolutionary model with a gamma distribution of the rate of variation among sites and a proportion of invariable sites (GTR + I + G) considered most appropriate according to jModelTest v2.1.10 (Posada, 2008) and based on the Akaike Information Criterion corrected for small sample size (AICc).

We assessed whether there is sufficient temporal signal in our data to proceed with phylogenetic molecular clock analysis and checked there were no sequences whose genetic divergence and sampling date were incongruent with TempEst v1.5.3 (Rambaut et al., 2016) from a maximum likelihood tree build using PhyML v3.1 (Guindon et al., 2010). Time-scaled Bayesian phylogenetic trees were estimated in BEAST v1.10.4 (Suchard et al., 2018) using a GTR + I + G substitution model under a relaxed molecular clock with a lognormal distribution of rates (Figure 1I). A Gaussian Markov Random Field Bayesian skyride coalescent model was used as the tree prior (Minin et al., 2008). A Monte Carlo Markov chain (MCMC) was run for the number of generations needed to achieve stationary distributions (200,000,000 generations) and sampling frequency adjusted accordingly to yield 10,000 samples from the posterior. We used Tracer v1.6 (Rambaut et al., 2016) to visualize the posterior distribution of each parameter and to obtain an estimate of the effective sample size (ESS) after removal of the initial 10% burn-in. We assumed the run had reached sufficient mixing as the ESS of all the parameters was above 200. A maximum clade credibility (MCC) tree was produced after removal of the initial 10% burn-in based on common ancestor heights method (Heled and Bouckaert, 2013) using the auxiliary program TreeAnnotator included in the BEAST package.

Statistical Analysis

To test for the correlation between polymorphism and time, host species and stage, we calculated a mean frequency of the targeted polymorphism type (snp, deletion, and standardized on 1,000 bp) across all genes for each individual (to avoid pseudo-replication). We then used a generalized linear model with fixed factors with the following syntax (we add into brackets information about variable nature and modalities numbers).

Polymorphism (SNP or deletion) ∼ sampling date [continuous variable] + host species [factor with two modalities C. gigas, O. edulis] + host stage [factor with three modalities: juvenile, pool_larvae, and adult]. All statistics were conducted with R 3.4 (R Development Core Team, 2008) using the linear model, built in “lm” function.

Results

Data Quality Observations

As expected, the sequencing depth and the viral genome coverage was correlated to the total number of reads for the different libraries (data not shown). However, the sequencing of fresh moribund individuals, or even better, a pool of infected larvae (as the cost of losing information about the within-host dynamic of viral haplotypes) improves coverage and depth of sequencing. A pragmatic rule of thumb in order to have meaningful and sufficient reads from the virus genome, when sequencing an infected individual host, is to choose a fresh moribund oyster (i.e., an adult, a juvenile or a pool of larvae) harboring more than 5.0 × 106 OsHV-1 copies/mg of fresh flesh and sequence it with a yield of at least 10 × 106 reads per sample (Supplementary Figure 1). This procedure enabled a mean coverage depth of 100X of the whole viral genome for the following samples: LI, NZW17, NZW18, SW6, VIV46-2-m, VIV47-3-m, VIV48-4-m, VIV49-5-m88, VIV56-10-m, and VIV58-12-m. However, it should be kept in mind that most of the reads from a library are from the host’s genome (approximately 95–99%, although this could be as low as 70% in case of pooled infected larvae).

Viral Haplotype Diversity

The local reconstruction of haplotype diversity gives an estimate of number of viral haplotypes segregating within one isolate (Figure 2, and Supplementary Figure 2) between 1 and 4 haplotypes (on the CDS basis). As expected, libraries sequenced with short reads and single end, namely PR, MV, and NZ, did not bridge different mutations (SNP or indels) and jeopardize the local assembler heuristics to disentangle sequencing error from true mutations. The coverage depth has a deep impact of the number of haplotypes that were reconstructed. For example, all the VIV samples showed a higher number of reconstructed haplotypes mainly due to a good sequencing depth. Indeed, it is hard to compare the number of viral haplotypes between samples from pool of larvae or juvenile and from one individual host. In summary, the number of viral haplotypes that segregate into infected individual is higher than unit except for samples Poole-Harbor United Kingdom and VIV57-11-m, but both samples have a rather low sequencing depth, and the Poole harbor isolate had previously been passaged through an oyster in a controlled environment. The variation of the number of haplotypes across the viral genome is hard to interpret but some regions seem to harbor more haplotypes nearly inverted and repeated regions.

FIGURE 2.

FIGURE 2

Reconstructed local viral haplotypes for four samples using Shorah with amplicon mode (Zagordi et al., 2011). The X-axis represents the genomic position within the reference viral genome (Davison et al., 2005). The Y-axis is the number of reconstructed viral haplotypes on the focal genomic position. The vertical-colored bands represent the span of the coding DNA sequence (CDS) of the reference viral genome (Davison et al., 2005), one color represents one CDS numbered from 1 to 127. The top panels above the plots represent the coverage depth across the viral genome. Low or no sequencing depth on a genomic region could be interpreted as genomic deletion on the considered samples.

Deletions, Substitution Rate and Most Recent Common Ancestors

The linear regression analysis showed that the accumulation of substitutions is correlated with time and not driven by a hidden species effect (DF = 6/9, F = 98.68, p value = 1.07 × 10–7, time effect estimate = 0.12, time p value = 0.023). The greatest modalities of the stage did not have any effect except for the juveniles (or pool of juveniles), however, the unbalanced nature of these factors precludes any serious inferences. In comparison, accumulation of short deletions did not increase with time (model1, DF = 6/9, F = 113.8, p value = 5.72 × 10–8, time effect estimate = 0.02, time p value = 0.68). It should be noted, the accumulation of deletions is far above the accumulation of substitutions and seems to be the main process of sequence evolution over time (Figure 3). Our reference mapping approach gives a biased picture of the insertion events into OsHV-1 genome as only sequences which are present in the reference genome can be detected and hence any new sequence inserts larger than read length will be lost during alignment process. Despite this limitation, 89 short insertions could be detected across 22 samples independently of the oyster species and the sampling date. Another important feature of OsHV-1 genome evolution is the presence of large deletions (Figure 3, Supplementary Table 1, and Supplementary Figure 4). The closest sample (PR) in time (and region) compare to OsHV-1 reference genome (Davison et al., 2005) shows no deletion whereas the sample taken from O. edulis in 2012 showed four large deletions (>>100 bp). A careful visual examination of reads alignment (Figure 3 and Supplementary Figure 1) reveals numerous abrupt drop sequencing depths associated with clipped reads in the CIGAR motif. Both local sequencing depth variation and read clipping were used to infer large deletions and other structural variations (Supplementary Table 1). These large deletions do not seem to be random across the genome, which means that between isolates, deletions did not occur exactly at the same position but are located in the same genomic regions (Supplementary Figure 3, vertical reddish bands).

FIGURE 3.

FIGURE 3

Functional view of DNA variation (mutations) density along the genome of Ostreid herpesvirus 1 (OsHV-1). Here only LI, PR, NZ17, and SW6 are shown. The X-axis is the number of CDS found in the reference annotation genome last accessed (Davison et al., 2005) and the Y-axis is a standardized count of mutations found in 1,000 bp if the SNP density where the same in the targeted CDS. Genomic regions with a low complexity content (repeated sequences) were not considered in this analysis (light gray arrows). SNP, single nucleotide polymorphism; INS, short insertion (<50 bp); DEL, short insertion (<50 bp); MNP, short complex polymorphism, dark gray arrows: large deletion (>50 bp). Figures of all samples (detailed Table 1) are given in Supplementary Files.

Regression of root-to-tip genetic distance against sampling time (Supplementary Figure 5) exposed sufficient temporal signal to proceed with phylogenetic molecular clock analyses. BEAST (Drummond et al., 2012) estimate of the coefficient of variation (CoV) was of 0.701 (95% HPD interval [0.4078, 1.0581]), indicating that strict clock-like evolution could be rejected and validating our choice to use a relaxed molecular clock. OsHV-1 viruses exhibited a mean evolutionary rate of 6.787E-05 nucleotide substitutions per site per year (95% HPD interval [3.5172E-05, 1.0335E-04]). The results indicate that the present genetic diversity seen in OsHV-1 isolates has arisen within the past three decades (Figure 4). Both herpesviruses found in C. farreri (GQ153938.1) and A. broughtonii (MF509813.1) group together (Ren et al., 2013; Xia et al., 2015). Their most recent common ancestor (MRCA) is very recent (approximatively 2001) and both species live in sympatry. The virus detected in O. edulis (SW6) is close to the reference genome (NC_005881.2) and from the OsHV-1 isolate collected from France in 2008 (PR) (Davison et al., 2005). PR and SW6 have their MRCA dated from 1993. Within the C. gigas host cluster, viral isolates cluster by provenance and sampling year. For example, all isolates from New-Zealand collected in 2010–2011 group together and appear distinct from isolates collected in Europe in 2010, 2015, and 2017 (MV, LI, Poole-Harbour United Kingdom, and all VIV samples). OsHV-1 μVars cluster is represented by the LI and MV isolates from France in 2010 and by both μVar variant A and B (KY242785.1 and KY271630.1) that were isolated from France and Ireland between 2010 and 2014 (Burioli et al., 2017). As expected, all samples from the same epidemic event in 2017 clustered together.

FIGURE 4.

FIGURE 4

Time-scaled phylogenetic tree of OsHV-1 and chlamys acute necrotic samples. Maximum clade credibility time-calibrated phylogenetic tree generated in BEAST. The tree is scaled in years, with the final sampling year being 2017. The branches are colored accordingly to their mean height across the posterior. This figure was produced with FigTree v1.4.3 (Rambaut, 2019).

Discussion

Summary of the Main Findings

This study brings new insights about the diversity of OsHV-1 at the genomic scale. We show that the OsHV-1 genome has accumulated a large number of mutations since the publication of the reference genome in 2005 obtained from an infected oyster sampled in 1999. Deletions dominate the mutational landscape at the genome level, but our mapping reference approach is much powerful to detect deletions rather than insertions. Our phylogenetic analysis reveals that OsHV-1 diversity is structured by the host species (pectinids, clams, and oyster species), the geographical origin (Europe vs. Oceania) and most importantly the sampling date.

OsHV-1, a Measurably Evolving Pathogen

Examining the correlation between root-to-tip genetic distance and time suggests OsHV-1 is a measurably evolving pathogen when working at with whole-genome sequencing. The inferred mean evolutionary rate of 6.787E-05 nucleotide substitutions per site per year of OsHV-1 was higher than would be expected under the theory of co-evolution with host species. Potential biases inherent to the analysis conducted have been pointed out (Ho et al., 2005; Emerson, 2007; Debruyne and Poinar, 2009). In particular, it has been demonstrated that molecular rates are accelerated at short timescales on population level studies (Ho, 2005). The most likely explanation for this phenomenon is the persistence of slightly deleterious mutations, which, through purifying selection, would have been eliminated over long timescales (Ho et al., 2005, 2007a,2007b; Ho and Larson, 2006). While conducting selective pressure analyses on a bigger sampling would allow to test that our results were not a consequence of the time dependency of the molecular clock due to short time range of our dataset (i.e., >20 years), our findings are consistent with studies supporting that the evolutionary rates of some dsDNA viruses are comparable to those of RNA and single strand DNA viruses (Hughes et al., 2010; Patrício et al., 2012). Although we acknowledge current limitations of our study, mainly due to the small sample size available, our results demonstrate the feasibility of applying population genetic theory tools such as phylodynamics to a larger and well-designed sampling including virus from healthy animals (Burioli et al., 2016) of whole genome sequences of OsHV-1 and other herpesviruses.

OsHV-1 Within Host Diversity: Methodological Challenges and Biological Explanations

Viral diversity within infected individuals host is still hard to characterize because of two main biological viral features (Baaijens et al., 2017): (i) the abundance of variants is unknown and (ii) the great diversity and related high mutation rate could challenge classical tools used in bioinformatics (variant calling operations). This problem is even more acute when the abundance of variants is close to the sequencing error rate. Traditionally, in malacoherpesvirus literature, most of the studies have used de novo assembly methods to unravel viral diversity (Davison et al., 2005; Ren et al., 2013; Burioli et al., 2017; Bai et al., 2019a). However, at January 2018, most of the available short read assembly methods don’t manage the mixture of haplotypes well, as they are not able to use the molecular phasing information from the reads (Baaijens et al., 2017). Indeed, we choose a pure mapping reference approach to describe viral diversity coupled with local de novo assembly. This process allows us to uncover the viral diversity within host. Our estimates about the true number of viral haplotypes within infected individuals were very conservative (lower bound) since we are not able to phase variants above 150 bp. To explain the source of the diversity within an infected individual, several non-exclusives hypotheses could be formulated: First, diversity within host is due to a multiple infection by several viral lineages. This means that several OsHV-1 haplotypes could be found in sea water during mortality outbreak. However, viral particles in sea water are hard to detect even when using passive sensor systems [lab detection of virus in seawater (Vincent-Hubert et al., 2017) and Hubert in prep (in situ detection of virus in sea water)]. Some alternatives approaches using ultra-filtration of large sea water volume are a promising way to check this hypothesis and we would help in finding if various haplotypes are present at the onset of the disease. Again, using ultra-filtration methods of sea water, we could expect an increase of the number of variants during the spread of the disease through a population. Such hypothesis could also be tested throughout the genomic relatedness among viral variants within individuals and variants from the previous years. The last hypothesis concerns de novo mutations during the infection process, viruses use host cellular machinery to replicate, there are several mechanisms which could give arise to de novo variants mediated by host antiviral defense systems (Sanjuán and Domingo-Calap, 2016).

Recently, extensive anti-viral mechanisms were characterized; Rosani et al. (2019) demonstrated that Adenosine deaminase enzymes of the ADAR family are found in the Pacific oyster and have the ability to post-transcriptionally modify OsHV-1 RNA in the form of A-to-I conversions. Indeed, investigators showed that viral genome have evolved to reduce the number of deamination targets along most of OsHV-1 CDS. The life cycle of herpes virus is also prone to mutation via recombination mechanisms which have been demonstrated in other herpesvirus or close-related as cytomegalovirus in human (Wilkinson and Weller, 2003; Norberg et al., 2007; Sijmons et al., 2015; Balloux et al., 2016; Kolb et al., 2017; Tomer et al., 2019). Although two DNA Recombination-Initiating Promoter Motifs have been reported as enriched among Malacoherpesviridae (Rosani and Venier, 2017), the evidence of such mechanisms is yet to be characterized in Malacoherpesviridae. The evolutionary advantage of having multiple viral haplotypes within one infected individual is of prime interest, since lethal or sub-lethal viral haplotypes could be maintained in the population thank to genetic complementation and be an additional reservoir of standing genetic variation (Segredo-Otero and Sanjuán, 2019).

Large Genomic Deletions Are a Good Markers of Virus Origin

In this study, we were able to compare the genomic structure of various viral isolates from several host species. All isolates were compared to the oldest reference genome published (Davison et al., 2005) which is likely to be the ancestor of most of our samples. Reads from all the samples were mapped to the Davison reference genome, however, visual inspection of sequencing depth and structural variant analysis reveals tremendous genetic variation between virus isolates. Notably virus isolates from O. edulis have two specific deletions in ORF101 (48 bp deleted) and ORF105 (49 bp deleted) that are not shared with the other isolates. We hypothesize that this particular haplotype who is attached to O. edulis (named OsHV-1 SW6) is a virus strain “adapted” to infect this species. If our expectation is true, this haplotype will be found in other infected O. edulis population. In addition, isolates from New-Zealand (NZ16-NZ17-NZ18) share only two of the three observed larges deletions identified in France isolates (LI) probably because the New-Zealand samples are from a different virus reservoir. The exact molecular mechanism(s) of such deletion(s) are unknown, but they are not deleterious for the viral replication as they are maintained in the actual viral population. In addition, deletions in the OsHV-1 are not completely random across the samples they tend to occur in the regions 17,700–19,000 bp (ORF11), 52,000–52,800 bp (ORF35, ORF36, and ORF37), 67,972–68,573 bp (mainly ORF48), 93,000–98,500 bp (ORF62, ORF63, and ORF64). Burioli et al. (2017) observed the same deletions as we observe in the LI and VIV samples. The scarcities of annotation in these regions do not help us to formulate a functional hypothesis.

Refining Malacoherpiviradae Classification to Get a Better Understanding of Viral Micro Evolution

Ostreid herpesvirus 1 is usually described as a generalist herpesvirus that is able to infect various marine bivalves species across the world (Arzul et al., 2001, 2017; Davison et al., 2005; Gao et al., 2018; Kim et al., 2019). However, our phylogenetic analysis shows that such view is probably too simple and more elaborate hypothesis are needed. Recently, OsHV-1 found on two other bivalve species (C. farreri and A. broughtonii mainly in China) (Xia et al., 2015; Bai et al., 2017, 2018, 2019b) seems to have a recent common ancestor, the MRCA is dated from 1991 to 2006. In addition, there is a growing number of report of OsHV-1 DNA in novel host species Scapharca subcrenata and Agropecten irridians (Gao et al., 2018; Kim et al., 2019). In addition, we also observed a unique type of deletion in this isolate (Figure 3, SW6 sample) compare to other isolates from C. gigas. All these inferred host shifts could be explained by two hypotheses: (1) We are revealing an old viral diversity, i.e., various viral haplotypes are adapted to various host species, but this diversity was not accessible to previous studies because of technological limitations (MLST, gene concatenation, microsatellite, and Sanger sequencing). However, this hypothesis does explain why MRCA are so recent (less than 50 years). (2) We are observing an ongoing specialization (speciation) of a generalist virus [OsHV-1 census (Davison et al., 2005)] to a specialized viral populations OsHV-1-SB on A. broughtonii, AVNV on C. farreri, OsHV-1 Var on P. maximus, OsHV-1 μVars on C. gigas, and now OsHV-1 SW6 on O. edulis, a mechanism possibly supported by the different host antiviral responses. For the later haplotype, we hypothesize a host shift from C. gigas to O. edulis and the estimated date of this shift (1992) which is congruent with the history of the introduction of C. gigas in Scandinavia (Wrange et al., 2010; Mortensen et al., 2016; Anglès d’Auriac et al., 2017). These recent host shifts could be explained by two mechanisms well documented in other farming animal and plants (Jones et al., 2013; Bai et al., 2015; Mineur et al., 2015). The first mechanism is the introduction of C. gigas (and OsHV-1) world-wide for its extraordinary adaptive capacity and the second is an intensification of C. gigas farming (imported spat, spat from hatcheries, high field density, triploid’s oyster, and extension of field surfaces). Indeed, OsHV-1 could reach quite high number of genome copies per infection events since the cultivated density of susceptible animals are much higher than natural beds, leading to an increase of contact with a larger number of bivalve’s taxa than ever. Host shift is likely to be just a matter of time because of the abundant genomic viral diversity. We postulate that OsHV-1 taxon is under specialization in novel populations and/or bivalve species due to world-wide expansion of C. gigas shellfish farming in the 1970s.

In order to discriminate between the first and the second hypothesis regarding variants, we need more detailed study of isolates from various bivalve species, most notably including old and novel sympatric species of C. gigas. In fact, we need a better understanding of how OsHV-1 is constrained, or not, to a specific host species and how frequently it could jump from one species to another species (Brito et al., 2021). Only well-replicated phylogenetic studies encompassing the whole viral genome information will help-us to better understand OsHV-1 ecology and evolution.

Conclusion and Perspectives

We have characterized the viral genomic diversity within an infected individual; our estimates are probably lower bounds of the exact number of haplotypes. To refine and notably to understand complementation between haplotypes within individuals new techniques, such as long read sequencing, will be required (Bai et al., 2019a). The observed viral diversity is generally well structured according to host species, then geographical origins and finally sampling date. There is an urgent need to assess the effect of some mutations on viral phenotype namely pathogenicity (virulence and infectivity) for a given host species and populations, including basic information about which viral haplotype can infect which host species. These crucial questions about virus ecology should be explored to be able to understand the potential of OsHV-1 to jump from one species to the next.

This study thus lays the groundwork for future studies to further analyze the relationships between viral diversity and pathogenicity across bivalve species and strains.

Data Availability Statement

The data presented in the study are deposited in the NCBI Sequence Read Archive (SRA) under repository, accession number of the BioProject ID: PRJNA721248.

Author Contributions

This study is the result of a collective work. BM, J-BL, J-FP, and TR conceived this study, and participated in its design. SH and CP performed the sample preparation for Illumina sequencing. J-BL, AB, MJ, and BM performed genomic analyses. BM, MJ, and J-BL interpreted the results and drafted the manuscript. All authors read, corrected, and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors thank the UMR 8199 LIGAN-PM Genomics platform (Lille, France) which belongs to the “Fédération de Recherche” 3508 Labex EGID (European Genomics Institute for Diabetes; ANR-10-LABX-46) and was supported by the ANR Equipex 2010 session (ANR-10-EQPX-07-01; “LIGAN-PM”). The LIGAN-PM Genomics platform (Lille, France) is also supported by the FEDER and the Region Nord-Pas-de-Calais-Picardie. The authors acknowledge the Pôle de Calcul et de Données Marines (PCDM) for providing DATARMOR storage, computational resources, and support services (URL: http://www.ifremer.fr/pcdm) with a special attention to Bioinformatic team: Laure Quintric and Patrick Durand. The authors thank Isabelle Arzul for the fruitful discussions about OsHV-1 biology.

Footnotes

Funding. This work was financially supported by DGAL (French National Services of Food and Agriculture) through the National Reference Laboratory for Mollusc Diseases, Ifremer, La Tremblade and by the EU funded project VIVALDI (H2020 program, n°678589) within the WP 3.1.1 part about genetic of resistance to OsHV-1 infection.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.711377/full#supplementary-material

Supplementary Figure 1

Correlation between the assessed viral copy numbers per μL per ng of gills from infected individuals for all the individuals from VIV batches and the sequencing depth of each sample. The labels of the sample should be interpreted as the family name (i.e., viv57) followed by the individual number (viv57-11) and the final status at the end of the experimental infection “v” stand for alive (viv57-11-v3) and “m” for dead (viv57-11-m99).

Supplementary Figure 2

Reconstructed local viral haplotypes for four samples using Haploclique (Töpfer et al., 2013). The X-axis represents a genomic position within OsHV-1 reference genome in base pair; the Y-axis is the number of haplotypes reconstituted at the focal genomic position. Large deletions are visible in the sequencing top panels and in the number of reconstructed haplotypes.

Supplementary Figure 3

Local viral haplotype reconstruction using QuasiRecomb software on genomic regions without any abrupt drop of sequencing depth (large deletions). Reconstructions have been done on CDS basis (using the reference genome annotation from Davison et al. (2005). The X-axis represents the genomic position within the reference viral genome. The Y-axis is the number of quasispecies. The black number corresponding to ORF. In red, we represented the larges deletion found in the consensus viral genome of each isolates, orange illustrate genomic regions where the alignment was not good enough to allow haplotype reconstruction. Large deletions were detected after visual inspection of bam file and sniffles detection.

Supplementary Figure 4

Illustration of the variety of polymorphism found in OsHV-1 genome mapped against the reference genome from Davison et al. (2005). Some visual patterns observed in Bam files motivate the search of viral haplotypes. Here both Bam and Vcf files are viewed using IGV (Thorvaldsdóttir et al., 2013). The blue arrowed highlights some detected polymorphisms and red arrows point out undetected potential variants. Horizontal arrows showed the large deletions, i.e., structural variations, and vertical arrows pointed out short polymorphism, i.e., short indel and SNPs. Each picture (one per sample) is composed of a bandplot showing the sequencing depth (top panel) of the considered genomic regions followed by the alignment of individual reads (bottom panel). The sequencing depth is between 0 and 4,000 X for the LI and SW6 samples, and 0–190 for NZ17 samples. (A) The observed region is from 94,000 to 100,000 bp on LI, NC17, and SW6 isolates. (B) The observed region is from 71,400 to 73,000 bp. (C) Zoom in the 77,900–79,000 bp in the NZ17 samples, it seems that there is a physical linkage between a short deletion and a SNP leading to conclusion that there are at least two haplotypes segregating in this region. (D) Zoom in the 21,700–22,800 bp region of the SW6 samples. The extremely large sequence depth of this sample allows to infer that few haplotypes segregating in this population show duplication.

Supplementary Figure 5

Clock-likeness of OsHV-1 genomes. Regression of root-to-tip genetic distance against sampling time were computed using TempEst v1.5.3 (Tan et al., 2015) using two rooting methods (best fit in TempEst and midpoint). Sufficient and robust temporal signal was observed to proceed with phylogenetic molecular clock analyses.

References

  1. Abbadi M., Zamperin G., Gastaldelli M., Pascoli F., Rosani U., Milani A., et al. (2018). Identification of a newly described OsHV-1 μvar from the North Adriatic Sea (Italy). J. Gen. Virol. 99 693–703. 10.1099/jgv.0.001042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anglès d’Auriac M. B., Rinde E., Norling P., Lapègue S., Staalstrøm A., Hjermann D. Ø, et al. (2017). Rapid expansion of the invasive oyster Crassostrea gigas at its northern distribution limit in Europe: naturally dispersed or introduced? PLoS One 12:e0177481. 10.1371/journal.pone.0177481 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arzul I., Corbeil S., Morga B., Renault T. (2017). Viruses infecting marine molluscs. J. Invertebr. Pathol. 147 118–35. 10.1016/j.jip.2017.01.009 [DOI] [PubMed] [Google Scholar]
  4. Arzul I., Nicolas J.-L., Davison A. J., Renault T. (2001). French scallops: a new host for ostreid herpesvirus-1. Virology 290 342–359. 10.1006/viro.2001.1186 [DOI] [PubMed] [Google Scholar]
  5. Baaijens J. A., Aabidine A. Z. E., Rivals E., Schönhuth A. (2017). De novo assembly of viral quasispecies using overlap graphs. Genome Res. 27 835–848. 10.1101/gr.215038.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bai C., Gao W., Wang C., Yu T., Zhang T., Qiu Z., et al. (2016). Identification and characterization of ostreid herpesvirus 1 associated with massive mortalities of Scapharca broughtonii broodstocks in China. Dis. Aquat. Organ. 118 65–75. 10.3354/dao02958 [DOI] [PubMed] [Google Scholar]
  7. Bai C., Wang C., Xia J., Sun H., Zhang S., Huang J. (2015). Emerging and endemic types of ostreid herpesvirus 1 were detected in bivalves in China. J. Invertebr. Pathol. 124 98–106. 10.1016/j.jip.2014.11.007 [DOI] [PubMed] [Google Scholar]
  8. Bai C.-M., Morga B., Rosani U., Shi J., Li C., Xin L.-S., et al. (2019a). Long-range PCR and high-throughput sequencing of ostreid herpesvirus 1 indicate high genetic diversity and complex evolution process. Virology 526 81–90. 10.1016/j.virol.2018.09.026 [DOI] [PubMed] [Google Scholar]
  9. Bai C.-M., Rosani U., Xin L.-S., Li G.-Y., Li C., Wang Q.-C., et al. (2018). Dual transcriptomic analysis of ostreid herpesvirus 1 infected Scapharca broughtonii with an emphasis on viral anti-apoptosis activities and host oxidative bursts. Fish Shellfish Immunol. 82 554–564. 10.1016/j.fsi.2018.08.054 [DOI] [PubMed] [Google Scholar]
  10. Bai C.-M., Wang Q.-C., Morga B., Shi J., Wang C.-M. (2017). Experimental infection of adult Scapharca broughtonii with ostreid herpesvirus SB strain. J. Invertebr. Pathol. 143 79–82. 10.1016/j.jip.2016.12.001 [DOI] [PubMed] [Google Scholar]
  11. Bai C.-M., Xin L.-S., Rosani U., Wu B., Wang Q.-C., Duan X.-K., et al. (2019b). Chromosomal-level assembly of the blood clam, Scapharca (Anadara) broughtonii, using long sequence reads and Hi-C. Gigascience 8:giz067. 10.1093/gigascience/giz067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Balloux F., Lassalle F., Tutill H. J., Breuer J., Reeves M. B., Christiansen M. T., et al. (2016). Islands of linkage in an ocean of pervasive recombination reveals two-speed evolution of human cytomegalovirus genomes. Virus Evol. 2:vew017. 10.1093/ve/vew017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Biek R., Pybus O. G., Lloyd-Smith J. O., Didelot X. (2015). Measurably evolving pathogens in the genomic era. Trends Ecol. Evol. 30 306–313. 10.1016/j.tree.2015.03.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bookelaar B., Lynch S. A., Culloty S. C. (2020). Host plasticity supports spread of an aquaculture introduced virus to an ecosystem engineer. Parasites Vectors 13:498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Brito A. F., Baele G., Nahata K. D., Grubaugh N. D., Pinney J. W. (2021). Intrahost speciations and host switches played an important role in the evolution of herpesviruses. Virus Evol. 7:veab025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Burioli E. A. V., Prearo M., Houssin M. (2017). Complete genome sequence of ostreid herpesvirus type 1 μVar isolated during mortality events in the Pacific oyster Crassostrea gigas in France and Ireland. Virology 509 239–251. 10.1016/j.virol.2017.06.027 [DOI] [PubMed] [Google Scholar]
  17. Burioli E. A. V., Prearo M., Riina M. V., Bona M. C., Fioravanti M. L., Arcangeli G., et al. (2016). Ostreid herpesvirus type 1 genomic diversity in wild populations of Pacific oyster Crassostrea gigas from Italian coasts. J. Invertebr. Pathol. 137 71–83. 10.1016/j.jip.2016.05.004 [DOI] [PubMed] [Google Scholar]
  18. Clark K., Karsch-Mizrachi I., Lipman D. J., Ostell J., Sayers E. W. (2016). GenBank. Nucleic Acids Res. 44 D67–D72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Cudini J., Roy S., Houldcroft C. J., Bryant J. M., Depledge D. P., Tutill H., et al. (2019). Human cytomegalovirus haplotype reconstruction reveals high diversity due to superinfection and evidence of within-host recombination. Proc. Natl. Acad. Sci. U.S.A. 116 5693–5698. 10.1073/pnas.1818130116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Davison A. J., Trus B. L., Cheng N., Steven A. C., Watson M. S., Cunningham C., et al. (2005). A novel class of herpesvirus with bivalve hosts. J. Gen. Virol. 86 41–53. 10.1099/vir.0.80382-0 [DOI] [PubMed] [Google Scholar]
  21. Debruyne R., Poinar H. N. (2009). Time dependency of molecular rates in ancient DNA data sets, a sampling artifact? Syst. Biol. 58 348–360. 10.1093/sysbio/syp028 [DOI] [PubMed] [Google Scholar]
  22. Delmotte J., Chaparro C., Galinier R., de Lorgeril J., Petton B., Stenger P.-L., et al. (2020). Contribution of viral genomic diversity to oyster susceptibility in the Pacific oyster mortality syndrome. Front. Microbiol. 11:1579. 10.3389/fmicb.2020.01579 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Drummond A. J., Suchard M. A., Xie D., Rambaut A. (2012). Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29 1969–1973. 10.1093/molbev/mss075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. EFSA (2010). Scientific opinion on the increased mortality events in Pacific oysters, Crassostrea gigas. EFSA J. 8:1894. 10.2903/j.efsa.2010.1894 [DOI] [Google Scholar]
  25. Emerson B. C. (2007). Alarm bells for the molecular clock? No support for Ho et al.’s model of time-dependent molecular rate estimates. Syst. Biol. 56 337–345. 10.1080/10635150701258795 [DOI] [PubMed] [Google Scholar]
  26. Gao F., Jiang J.-Z., Wang J.-Y., Wei H.-Y. (2018). Real-time quantitative isothermal detection of ostreid herpesvirus-1 DNA in Scapharca subcrenata using recombinase polymerase amplification. J. Virol. Methods 255 71–75. 10.1016/j.jviromet.2018.02.007 [DOI] [PubMed] [Google Scholar]
  27. Garcia C., Thébault A., Dégremont L., Arzul I., Miossec L., Robert M., et al. (2011). Ostreid herpesvirus 1 detection and relationship with Crassostrea gigas spat mortality in France between 1998 and 2006. Vet. Res. 42:73. 10.1186/1297-9716-42-73 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Garrison E. (2019). A Simple C++ Library for Parsing and Manipulating VCF Files, + many Command-Line Utilities. GitHub Repository. Available online at: https://github.com/vcflib/vcflib (accessed October, 2020). [Google Scholar]
  29. Garrison E., Marth G. (2012). Haplotype-based variant detection from short-read sequencing. arXiv [Preprint] 12073907. arXiv:1207.3907 [Google Scholar]
  30. Guindon S., Dufayard J.-F., Lefort V., Anisimova M., Hordijk W., Gascuel O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59 307–321. 10.1093/sysbio/syq010 [DOI] [PubMed] [Google Scholar]
  31. Heled J., Bouckaert R. R. (2013). Looking for trees in the forest: summary tree from posterior samples. BMC Evol. Biol. 13:221. 10.1186/1471-2148-13-221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ho S., Larson G. (2006). Molecular clocks: when timesare a-changin’. Trends Genet. 22 79–83. 10.1016/j.tig.2005.11.006 [DOI] [PubMed] [Google Scholar]
  33. Ho S. Y. W. (2005). Accuracy of rate estimation using relaxed-clock models with a critical focus on the early metazoan radiation. Mol. Biol. Evol. 22 1355–1363. 10.1093/molbev/msi125 [DOI] [PubMed] [Google Scholar]
  34. Ho S. Y. W., Kolokotronis S.-O., Allaby R. G. (2007a). Elevated substitution rates estimated from ancient DNA sequences. Biol. Lett. 3 702–705. 10.1098/rsbl.2007.0377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Ho S. Y. W., Phillips M. J., Cooper A., Drummond A. J. (2005). Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Mol. Biol. Evol. 22 1561–1568. 10.1093/molbev/msi145 [DOI] [PubMed] [Google Scholar]
  36. Ho S. Y. W., Shapiro B., Phillips M. J., Cooper A., Drummond A. J. (2007b). Evidence for time dependency of molecular rate estimates. Syst. Biol. 56 515–522. 10.1080/10635150701435401 [DOI] [PubMed] [Google Scholar]
  37. Hughes A. L., Irausquin S., Friedman R. (2010). The evolutionary biology of poxviruses. Infect. Genet. Evol. 10 50–59. 10.1016/j.meegid.2009.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Jaramillo N., Domingo E., Muñoz-Egea M. C., Tabarés E., Gadea I. (2013). Evidence of Muller’s ratchet in herpes simplex virus type 1. J. Gen. Virol. 94 366–375. 10.1099/vir.0.044685-0 [DOI] [PubMed] [Google Scholar]
  39. Jenkins C., Hick P., Gabor M., Spiers Z., Fell S., Gu X., et al. (2013). Identification and characterisation of an ostreid herpesvirus-1 microvariant (OsHV-1 μ-var) in Crassostrea gigas (Pacific oysters) in Australia. Dis. Aquat. Organ. 105 109–126. 10.3354/dao02623 [DOI] [PubMed] [Google Scholar]
  40. Jones B. A., Grace D., Kock R., Alonso S., Rushton J., Said M. Y., et al. (2013). Zoonosis emergence linked to agricultural intensification and environmental change. Proc. Natl. Acad. Sci. U.S.A. 110 8399–8404. 10.1073/pnas.1208059110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Katoh K., Standley D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30 772–780. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kim H. J., Jun J. W., Giri S. S., Yun S., Kim S. G., Kim S. W., et al. (2019). Mass mortality in Korean bay scallop (Argopecten irradians) associated with ostreid herpesvirus-1 μVar. Transbound. Emerg. Dis. 66 1442–1448. [DOI] [PubMed] [Google Scholar]
  43. Kitchen A., Shapiro B., Firth C., Holmes E. C., Suchard M. A., Rambaut A. (2010). Using time-structured data to estimate evolutionary rates of double-stranded DNA viruses. Mol. Biol. Evol. 27 2038–2051. 10.1093/molbev/msq088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kolb A. W., Lewin A. C., Moeller Trane R., McLellan G. J., Brandt C. R. (2017). Phylogenetic and recombination analysis of the herpesvirus genus Varicellovirus. BMC Genomics 18:887. 10.1186/s12864-017-4283-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Li H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27 2987–2993. 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Li H. (2013). Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv [Preprint] 13033997[q–bio]. arXiv:1303.3997 [q-bio.GN] [Google Scholar]
  47. Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25 2078-2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Mineur F., Provan J., Arnott G. (2015). Phylogeographical analyses of shellfish viruses: inferring a geographical origin for ostreid herpesviruses OsHV-1 (Malacoherpesviridae). Mar. Biol. 162 181–192. 10.1007/s00227-014-2566-8 [DOI] [Google Scholar]
  50. Minin V. N., Bloomquist E. W., Suchard M. A. (2008). Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics. Mol. Biol. Evol. 25 1459–1471. 10.1093/molbev/msn090 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Mortensen S., Strand Å, Bodvin T., Alfjorden A., Skår C., Jelmert A., et al. (2016). Summer mortalities and detection of ostreid herpesvirus microvariant in Pacific oyster Crassostrea gigas in Sweden and Norway. Dis. Aquat. Organ. 117 171–176. 10.3354/dao02944 [DOI] [PubMed] [Google Scholar]
  52. Nicolas J. L., Comps M., Cochennec N. (1992). Herpes-like virus infecting Pacific oyster larvae, Crassostrea gigas. Bull. Eur. Assoc. Fish Pathol. 12 11–13. [Google Scholar]
  53. Norberg P., Kasubi M. J., Haarr L., Bergström T., Liljeqvist J. -Å. (2007). Divergence and recombination of clinical herpes simplex virus type 2 isolates. J. Virol. 81:13158. 10.1128/JVI.01310-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Olson J. K., Ercolini A. M., Miller S. D. (2005). A virus-induced molecular mimicry model of multiple sclerosis. Curr. Top. Microbiol. Immunol. 296 39–53. 10.1007/3-540-30791-5_3 [DOI] [PubMed] [Google Scholar]
  55. Parsons L. R., Tafuri Y. R., Shreve J. T., Bowen C. D., Shipley M. M., Enquist L. W., et al. (2015). Rapid genome assembly and comparison decode intrastrain variation in Human alphaherpesviruses. mBio 6:e02213-14. 10.1128/mBio.02213-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Patrício A. R., Herbst L. H., Duarte A., Vélez-Zuazo X., Santos Loureiro N., Pereira N., et al. (2012). Global phylogeography and evolution of chelonid fibropapilloma-associated herpesvirus. J. Gen. Virol. 93 1035-45. 10.1099/vir.0.038950-0 [DOI] [PubMed] [Google Scholar]
  57. Paul-Pont I., Dhand N. K., Whittington R. J. (2013). Spatial distribution of mortality in Pacific oysters Crassostrea gigas: reflection on mechanisms of OsHV-1 transmission. Dis. Aquat. Organ. 105 127–138. 10.3354/dao02615 [DOI] [PubMed] [Google Scholar]
  58. Posada D. (2008). jModelTest: phylogenetic model averaging. Mol. Biol. Evol. 25 1253–1256. 10.1093/molbev/msn083 [DOI] [PubMed] [Google Scholar]
  59. Quinlan A. R., Hall I. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. R Development Core Team (2008). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. [Google Scholar]
  61. Rambaut A. (2019). FigTree. Available online at: https://github.com/rambaut/figtree (accessed April, 2021). [Google Scholar]
  62. Rambaut A., Lam T. T., Max Carvalho L., Pybus O. G. (2016). Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2:vew007. 10.1093/ve/vew007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Ren W., Chen H., Renault T., Cai Y., Bai C., Wang C., et al. (2013). Complete genome sequence of acute viral necrosis virus associated with massive mortality outbreaks in the Chinese scallop, Chlamys farreri. Virol. J. 10:110. 10.1186/1743-422X-10-110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Renault T., Moreau P., Faury N., Pepin J.-F., Segarra A., Webb S. (2012). Analysis of clinical ostreid herpesvirus 1 (Malacoherpesviridae) specimens by sequencing amplified fragments from three virus genome areas. J. Virol. 86 5942–5947. 10.1128/JVI.06534-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Renault T., Tchaleu G., Faury N., Moreau P., Segarra A., Barbosa-Solomieu V., et al. (2014). Genotyping of a microsatellite locus to differentiate clinical ostreid herpesvirus 1 specimens. Vet. Res. 45:3. 10.1186/1297-9716-45-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Renner D. W., Szpara M. L. (2017). Impacts of genome-wide analyses on our understanding of human herpesvirus diversity and evolution. J. Virol. 92:e00908-17. 10.1128/JVI.00908-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Renzette N., Bhattacharjee B., Jensen J. D., Gibson L., Kowalik T. F. (2011). Extensive genome-wide variability of human cytomegalovirus in congenitally infected infants. PLoS Pathog. 7:e1001344. 10.1371/journal.ppat.1001344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Retel C., Märkle H., Becks L., Feulner P. (2019). Ecological and evolutionary processes shaping viral genetic diversity. Viruses 11:220. 10.3390/v11030220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Rosani U., Bai C.-M., Maso L., Shapiro M., Abbadi M., Domeneghetti S., et al. (2019). A-to-I editing of Malacoherpesviridae RNAs supports the antiviral role of ADAR1 in mollusks. BMC Evol. Biol. 19:149. 10.1186/s12862-019-1472-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Rosani U., Venier P. (2017). Oyster RNA-seq data support the development of Malacoherpesviridae genomics. Front. Microbiol. 8:1515. 10.3389/fmicb.2017.01515 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Sanjuán R., Domingo-Calap P. (2016). Mechanisms of viral mutation. Cell. Mol. Life Sci. 73 4433–4448. 10.1007/s00018-016-2299-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Sanjuán R., Nebot M. R., Chirico N., Mansky L. M., Belshaw R. (2010). Viral mutation rates. J. Virol. 84 9733–9748. 10.1128/JVI.00694-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Sedlazeck F. J., Rescheneder P., Smolka M., Fang H., Nattestad M., von Haeseler A., et al. (2018). Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 15 461–468. 10.1038/s41592-018-0001-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Segarra A., Pépin J. F., Arzul I., Morga B., Faury N., Renault T. (2010). Detection and description of a particular ostreid herpesvirus 1 genotype associated with massive mortality outbreaks of Pacific oysters, Crassostrea gigas, in France in 2008. Virus Res. 153 92–99. 10.1016/j.virusres.2010.07.011 [DOI] [PubMed] [Google Scholar]
  75. Segredo-Otero E., Sanjuán R. (2019). The effect of genetic complementation on the fitness and diversity of viruses spreading as collective infectious units. Virus Res. 267 41–48. 10.1016/j.virusres.2019.05.005 [DOI] [PubMed] [Google Scholar]
  76. Sijmons S., Thys K., Mbong Ngwese M., Van Damme E., Dvorak J., Van Loock M., et al. (2015). High-throughput analysis of human cytomegalovirus genome diversity highlights the widespread occurrence of gene-disrupting mutations and pervasive recombination. J. Virol. 89 7673–7695. 10.1128/JVI.00578-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Suchard M. A., Lemey P., Baele G., Ayres D. L., Drummond A. J., Rambaut A. (2018). Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4:vey016. 10.1093/ve/vey016/5035211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Tan G., Muffato M., Ledergerber C., Herrero J., Goldman N., Gil M., et al. (2015). Current methods for automated filtering of multiple sequence alignments frequently Worsen Single-gene phylogenetic inference. Syst. Biol. 64 778–791. 10.1093/sysbio/syv033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Thorvaldsdóttir H., Robinson J. T., Mesirov J. P. (2013). Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 14 178–192. 10.1093/bib/bbs017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Tomer E., Cohen E. M., Drayman N., Afriat A., Weitzman M. D., Zaritsky A., et al. (2019). Coalescing replication compartments provide the opportunity for recombination between coinfecting herpesviruses. FASEB J. 33 9388–9403. 10.1096/fj.201900032R [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Töpfer A., Marschall T., Bull R. A., Luciani F., Schönhuth A., Beerenwinkel N. (2014). Viral quasispecies assembly via maximal clique enumeration. PLoS Comput. Biol. 10:e1003515. 10.1371/journal.pcbi.1003515 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Töpfer A., Zagordi O., Prabhakaran S., Roth V., Halperin E., Beerenwinkel N. (2013). Probabilistic inference of viral quasispecies subject to recombination. J. Comput. Biol. 20 113–123. 10.1089/cmb.2012.0232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Vincent-Hubert F., Morga B., Renault T., Le Guyader F. S. (2017). Adsorption of norovirus and ostreid herpesvirus type 1 to polymer membranes for the development of passive samplers. J. Appl. Microbiol. 122 1039–1047. 10.1111/jam.13394 [DOI] [PubMed] [Google Scholar]
  84. Wang E., Paessler S., Aguilar P. V., Smith D. R., Coffey L. L., Kang W., et al. (2005). A novel, rapid assay for detection and differentiation of serotype-specific antibodies to Venezuelan equine encephalitis complex alphaviruses. Am. J. Trop. Med. Hyg. 72 805–810. 10.4269/ajtmh.2005.72.805 [DOI] [PubMed] [Google Scholar]
  85. Wickham H. (2016). ggplot2: Elegant Graphics for Data Analysis. Cham: Springer-Verlag New York. 10.1007/978-3-319-24277-4 [DOI] [Google Scholar]
  86. Wilkinson D., Weller S. (2003). The role of DNA recombination in herpes simplex virus DNA replication. IUBMB Life 55 451–458. 10.1080/15216540310001612237 [DOI] [PubMed] [Google Scholar]
  87. Wrange A.-L., Valero J., Harkestad L. S., Strand Ø, Lindegarth S., Christensen H. T., et al. (2010). Massive settlements of the Pacific oyster, Crassostrea gigas, in Scandinavia. Biol. Invasions 12 1145–1152. 10.1007/s10530-009-9535-z [DOI] [Google Scholar]
  88. Xia J., Bai C., Wang C., Song X., Huang J. (2015). Complete genome sequence of ostreid herpesvirus-1 associated with mortalities of Scapharca broughtonii broodstocks. Virol. J. 12:110. 10.1186/s12985-015-0334-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Zagordi O., Bhattacharya A., Eriksson N., Beerenwinkel N. (2011). ShoRAH: estimating the genetic diversity of a mixed sample from next-generation sequencing data. BMC Bioinformatics 12:119. 10.1186/1471-2105-12-119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Zagordi O., Geyrhofer L., Roth V., Beerenwinkel N. (2010). Deep sequencing of a genetically heterogeneous sample: local haplotype reconstruction and read error correction. J. Comput. Biol. 17 417–428. 10.1089/cmb.2009.0164 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure 1

Correlation between the assessed viral copy numbers per μL per ng of gills from infected individuals for all the individuals from VIV batches and the sequencing depth of each sample. The labels of the sample should be interpreted as the family name (i.e., viv57) followed by the individual number (viv57-11) and the final status at the end of the experimental infection “v” stand for alive (viv57-11-v3) and “m” for dead (viv57-11-m99).

Supplementary Figure 2

Reconstructed local viral haplotypes for four samples using Haploclique (Töpfer et al., 2013). The X-axis represents a genomic position within OsHV-1 reference genome in base pair; the Y-axis is the number of haplotypes reconstituted at the focal genomic position. Large deletions are visible in the sequencing top panels and in the number of reconstructed haplotypes.

Supplementary Figure 3

Local viral haplotype reconstruction using QuasiRecomb software on genomic regions without any abrupt drop of sequencing depth (large deletions). Reconstructions have been done on CDS basis (using the reference genome annotation from Davison et al. (2005). The X-axis represents the genomic position within the reference viral genome. The Y-axis is the number of quasispecies. The black number corresponding to ORF. In red, we represented the larges deletion found in the consensus viral genome of each isolates, orange illustrate genomic regions where the alignment was not good enough to allow haplotype reconstruction. Large deletions were detected after visual inspection of bam file and sniffles detection.

Supplementary Figure 4

Illustration of the variety of polymorphism found in OsHV-1 genome mapped against the reference genome from Davison et al. (2005). Some visual patterns observed in Bam files motivate the search of viral haplotypes. Here both Bam and Vcf files are viewed using IGV (Thorvaldsdóttir et al., 2013). The blue arrowed highlights some detected polymorphisms and red arrows point out undetected potential variants. Horizontal arrows showed the large deletions, i.e., structural variations, and vertical arrows pointed out short polymorphism, i.e., short indel and SNPs. Each picture (one per sample) is composed of a bandplot showing the sequencing depth (top panel) of the considered genomic regions followed by the alignment of individual reads (bottom panel). The sequencing depth is between 0 and 4,000 X for the LI and SW6 samples, and 0–190 for NZ17 samples. (A) The observed region is from 94,000 to 100,000 bp on LI, NC17, and SW6 isolates. (B) The observed region is from 71,400 to 73,000 bp. (C) Zoom in the 77,900–79,000 bp in the NZ17 samples, it seems that there is a physical linkage between a short deletion and a SNP leading to conclusion that there are at least two haplotypes segregating in this region. (D) Zoom in the 21,700–22,800 bp region of the SW6 samples. The extremely large sequence depth of this sample allows to infer that few haplotypes segregating in this population show duplication.

Supplementary Figure 5

Clock-likeness of OsHV-1 genomes. Regression of root-to-tip genetic distance against sampling time were computed using TempEst v1.5.3 (Tan et al., 2015) using two rooting methods (best fit in TempEst and midpoint). Sufficient and robust temporal signal was observed to proceed with phylogenetic molecular clock analyses.

Data Availability Statement

The data presented in the study are deposited in the NCBI Sequence Read Archive (SRA) under repository, accession number of the BioProject ID: PRJNA721248.


Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES