Skip to main content
Viruses logoLink to Viruses
. 2020 Jul 28;12(8):814. doi: 10.3390/v12080814

Assessment of Metagenomic Sequencing and qPCR for Detection of Influenza D Virus in Bovine Respiratory Tract Samples

Maodong Zhang 1, Yanyun Huang 1,2, Dale L Godson 2,3, Champika Fernando 3, Trevor W Alexander 4, Janet E Hill 3,*
PMCID: PMC7472010  PMID: 32731471

Abstract

High throughput sequencing is currently revolutionizing the genomics field and providing new approaches to the detection and characterization of microorganisms. The objective of this study was to assess the detection of influenza D virus (IDV) in bovine respiratory tract samples using two sequencing platforms (MiSeq and Nanopore (GridION)), and species-specific qPCR. An IDV-specific qPCR was performed on 232 samples (116 nasal swabs and 116 tracheal washes) that had been previously subject to virome sequencing using MiSeq. Nanopore sequencing was performed on 19 samples positive for IDV by either MiSeq or qPCR. Nanopore sequence data was analyzed by two bioinformatics methods: What’s In My Pot (WIMP, on the EPI2ME platform), and an in-house developed analysis pipeline. The agreement of IDV detection between qPCR and MiSeq was 82.3%, between qPCR and Nanopore was 57.9% (in-house) and 84.2% (WIMP), and between MiSeq and Nanopore was 89.5% (in-house) and 73.7% (WIMP). IDV was detected by MiSeq in 14 of 17 IDV qPCR-positive samples with Cq (cycle quantification) values below 31, despite multiplexing 50 samples for sequencing. When qPCR was regarded as the gold standard, the sensitivity and specificity of MiSeq sequence detection were 28.3% and 98.9%, respectively. We conclude that both MiSeq and Nanopore sequencing are capable of detecting IDV in clinical specimens with a range of Cq values. Sensitivity may be further improved by optimizing sequence data analysis, improving virus enrichment, or reducing the degree of multiplexing.

Keywords: Illumina MiSeq sequencing, influenza D virus, nanopore GridION sequencing, qPCR, diagnostics, bovine respiratory disease

1. Introduction

High throughput sequencing is currently revolutionizing the genomics field and providing new approaches to the detection and characterization of viruses. The utilization of metagenomic sequencing to elucidate genome sequences of viruses, particularly RNA viruses, directly from clinical samples offers several benefits. First, metagenomics enables identification and genomic characterization of unexpected viruses or even novel viruses either as primary pathogens or as co-infectants, without prior knowledge of their clinical significance [1,2,3,4]. Second, it eliminates the need for ongoing optimization of primers and/or probes for rapidly evolving or highly diverse RNA viruses [5]. Third, it facilitates routine surveillance and early detection of outbreaks of novel virus strains that are distinct from currently circulating strains. Finally, the development of portable sequencing devices creates the potential for timely identification of routine cases or outbreaks in the field [6,7]. Sequencing technology continues to evolve rapidly. With the capability of generating long reads, relatively lower set-up cost and portability, Oxford Nanopore sequencing has attracted increasing attention for its potential advantages in some circumstances over short-read sequencing technologies [8,9].

Metagenomic sequencing has been widely used for non-targeted detection of viruses and has been applied to identify several “new” viruses associated with bovine respiratory disease (BRD) [10,11,12,13]. In dairy cattle, bovine adenovirus 3 (BAdV3), bovine rhinitis A virus (BRAV), and influenza D virus (IDV) showed significant association with BRD [12]. In beef cattle, bovine rhinitis B virus (BRBV), BRAV, and IDV showed statistical association with BRD [11,13]. Among all of the viruses detected by sequencing of the bovine respiratory tract metagenome, influenza D virus (IDV) has been identified as a common virus associated with BRD in both beef and dairy cattle, suggesting the potential contribution of IDV to BRD [11,12,13]. Influenza D virus (IDV) belongs to the Orthomyxoviridae, and is a single-stranded, enveloped, segmented and negative-sense RNA virus [14]. Since its discovery in swine in USA in 2011, IDV has been reported all over the world, and cattle are thought to be the natural host reservoir [13,15,16,17,18]. In addition to cattle and swine, IDV has been reported in sheep, goats, laboratory animals (ferrets and guinea pigs), and seropositivity has been detected in humans [17,19,20,21]. Concerns about interspecies transmission and potential zoonosis have been raised due to the high seroprevalence of IDV antibodies in people exposed to cattle [19].

Given the diversity of viruses now known to be associated with BRD and the potential for discovery of novel viruses like IDV, there is increasing interest in application of metagenomic sequencing for diagnostics, since screening for many individual viruses using targeted PCR assays quickly becomes logistically complex, expensive, and time-consuming. The relative performance of metagenomic sequencing compared to PCR in terms of analytical sensitivity, however, has not been widely explored. At the time of writing, there is only one published study focused on the assessment of performance of MiSeq and Nanopore sequencing relative to gold standard qPCR, which included a limited number of samples and a narrow range of viral loads [1].

In this study, IDV was used as a representative BRD-associated virus to examine the feasibility of using metagenomic sequencing for detection of viruses in clinical bovine respiratory samples. We compared results of long-read sequencing on the Oxford Nanopore GridION platform and previously generated Illumina MiSeq data [13] to the results of an IDV-specific qPCR. The objective was to assess IDV detection using all three approaches applied to a set of bovine respiratory tract samples containing a wide range of viral loads.

2. Materials and Methods

2.1. Ethics Statement

The samples used in this study were collected as part of a previous study [10]. Collection of the samples was approved by the University of Calgary Veterinary Sciences Animal Care Committee (AC15-0109).

2.2. Sample Preparation

The overall sample preparation workflow is shown in Figure 1a. Sample collection and preparation were described previously [13,22]. Briefly, paired nasal swabs (n = 116) and tracheal washes (n = 116) were collected from cattle with BRD and healthy controls from four different feedlots in Alberta, Canada between November 2015 and January 2016 [22]. The samples were centrifuged and supernatants were treated with DNase (Life Technologies, Carlsbad, CA, USA) and RNase (Promega, Madison, WI, USA), followed by extraction of viral nucleic acids using a commercial kit (QIAamp MinElute virus spin kit, Qiagen, Venlo, Netherlands). A portion (2.5 µL) of extracted total nucleic acids was used directly as a template for IDV qPCR and another portion (7 µL) used to generate cDNA for sequencing. The first strand was reverse transcribed with primer FR26RV-N (5′-GCC GGA GCT CTG CAG ATA TCN NNN NN-3′) using Superscript III enzyme (Life Technologies, Carlsbad, CA, USA), followed by complementary strand synthesis using Sequenase polymerase (Affymetrix, Santa Clara, CA, USA) as per manufacturer’s instructions [23]. Double-stranded DNA was purified using NucleoMag beads (Macherey-Nagel Inc., Bethlehem, PA, USA) and subsequently subjected to random amplification with primer FR20RV (5′-GCC GGA GCT CTG CAG ATA TC-3′) prior to sequencing library preparation [23].

Figure 1.

Figure 1

Workflow for GridION Nanopore sequencing, MiSeq sequencing, and qPCR of bovine respiratory tract samples (BRD). (a) The workflow illustrates both the sample and library preparation. Extracted RNA was used directly for qPCR, while DNA randomly amplified from the same extracts was used for MiSeq sequencing and GridION Nanopore sequencing; (b) Bioinformatic workflow to identify viruses in BRD samples. WIMP (What’s In My Pot) analysis was used only for Nanopore data. The remaining analysis was the same for data from both MiSeq and GridION Nanopore sequencing except Minimap2 was used instead of Bowtie2 for host sequence subtraction.

2.3. qPCR Confirmation and Quantification

Quantitative real-time PCR for IDV was performed on the extracted total nucleic acids for each sample (total number of samples = 232) using previously described primers and probe specific for IDV [24]. The qPCR was carried out using AgPath-ID One-Step RT-PCR reagents (ThermoFisher Scientific, Waltham, USA) in a total volume of 12.5 µL, which included 2.5 µL template, 4 pM forward/reverse primers, 2 pM probe and 0.5 µL AmpliTaq Gold DNA polymerase in a Bio-Rad CFX 96 Real-Time Detection System (Bio-Rad, Hercules, CA). The following cycling conditions were used: reverse transcription phase at 48 °C for 30 min; initial activation phase at 95 °C for 10 min; 40 two-step cycles of denaturation at 95 °C for 15 s; and annealing and extension at 60 °C for 1 min. To obtain a positive control template, a DNA fragment corresponding to a 170 bp portion of the PB1 gene of IDV (accession number: JQ922306) was synthesized and inserted into pUC57-Amp vector (Bio Basic, Markham, ON, Canada). A 10-fold dilution series of the positive control plasmid was used to construct a standard curve to determine the efficiency of the PCR. All samples were tested in duplicate along with the standard curve and no template controls. Samples, for which both of the duplicates gave a sigmoid amplification curve with a Cq (cycle quantification) value, were considered positive.

2.4. GridION Library Preparation and Sequencing

Nineteen samples that were IDV positive by either MiSeq virome sequencing or qPCR were selected for Nanopore sequencing. Samples were selected to represent the full range of Cq values observed in the qPCR results. Three batches of six samples were multiplexed and run on individual flow cells, while one sample (sample 129) was run individually. The DNA used for GridION Nanopore library preparation was from the same randomly amplified DNA that was used for MiSeq sequencing (Figure 1a). Ligation 1D sequencing kit SQK-LSK108 was used for library preparation. End-repair and dA-tailing were performed on randomly amplified DNA for each sample using NEBNext FFPE DNA repair mix and Ultra II End-prep enzyme mix (New England Biolabs, Ipswich, MA, USA). After purification with AMPure XP beads (Beckman Coulter, Brea, CA, USA), native barcode ligation using the EXP-NBD103 barcode kit (Oxford Nanopore Technologies, Oxford, UK) and Blunt/TA Ligase master mix (New England Biolabs, Ipswich, MA, USA) was performed as per manufacturer’s instructions. The concentration of barcoded libraries was determined using a Qubit fluorometer (ThermoFisher Scientific, Waltham, MA, USA) and subsequently equimolar amounts of each barcoded library (total amount = 1 µg) were pooled, and adaptors were added using Quick T4 DNA Ligase (New England Biolabs, Ipswich, MA, USA). Each pooled library (14.5 µL) was mixed with 35 µL Priming Buffer and 25.5 µL Loading Beads and loaded dropwise through the sample port into the flow cell (FLO-MIN106) as per manufacturer’s instructions. The MinKNOW platform QC check confirmed at least 800 available pores, and the High Accuracy Basecalling (HAC) Flip-flop model was applied.

2.5. Bioinformatic Analysis

The workflow of bioinformatic analysis is illustrated in Figure 1b. Once Nanopore raw data were demultiplexed and trimmed using Porechop (https://github.com/rrwick/Porechop) and passed the quality score (Qscore) 7, high quality reads were aligned to the bovine genome (BioProject Accessions PRJNA33843, PRJNA32899) using Minimap2 [25], and unmapped reads (i.e., non-host derived reads) from each sample were de novo assembled using Trinity [26,27]. Assembled contigs were mapped to the virus Reference Sequence (RefSeq) database using BLASTn and virus-like contigs with a minimum alignment length of 100 bp and an expectation (e) value < 10−3 were further examined by BLASTx alignment to the GenBank non-redundant protein sequence database to confirm the nucleotide sequence-based identification and to remove any spurious matches [28]. The total number of viral reads was determined as previously described [13].

Quality filtered reads from the Nanopore sequencing were also uploaded to the EPI2ME (https://epi2me.nanoporetech.com/) platform for analysis with the WIMP (What’s in My Pot, version 2.3.7) application for taxonomic classification of reads.

3. Results

3.1. Comparison of IDV Detection by MiSeq and qPCR

Samples totaling 232 (116 nasal swabs and 116 tracheal washes) that had been sequenced using MiSeq previously (500 cycle V2 chemistry, libraries of 50 multiplexed samples) were tested by an IDV-specific qPCR (Figure 1a) [10]. The detection limit of the PCR was demonstrated to be 62.5 copies per reaction (data not shown). There were 53 IDV positive samples based on the qPCR and the range of Cq values was from 16.99 (6.25 × 107 copies per reaction) to 39.46 (6.88 copies per reaction) with median Cq value being 34.07 (2.91 × 102 copies per reaction). The agreement of IDV detection between qPCR and MiSeq was 82.8%. When qPCR was regarded as the gold standard, the sensitivity and specificity of MiSeq detection were 28.3% and 98.9%, respectively. IDV was detected by MiSeq in 14 of 17 IDV qPCR-positive samples with Cq values below 31 (Figure 2), when multiplexing 50 samples in the MiSeq flow cell. Only 1 of 36 IDV qPCR-positive samples with a Cq value above 31 was detected by MiSeq (Figure 2). Nineteen samples that were positive for IDV by qPCR or MiSeq, and that represented the full range of qPCR Cq values, were selected for further analysis by Nanopore sequencing.

Figure 2.

Figure 2

Cq values and MiSeq detection results for 53 samples positive by IDV qPCR. Samples positive for both qPCR and MiSeq are indicated by red dots. Most concurrent detection by MiSeq and qPCR occurred in samples with Cq values <31 (dotted line). Samples selected for Nanopore sequencing are indicated with their respective sample identification number.

3.2. Comparison of Nanopore Sequencing Results with Previously Determined MiSeq Data

A total of 82.7 million reads were obtained from MiSeq. After removing low-quality reads and host-derived reads, 33.6 million reads remained. A total of 1.8 million high-quality viral reads was generated, accounting for 2.19% of the total reads obtained from MiSeq [13]. A total of 5.9 million raw Nanopore reads was generated, 69.5% of which passed the quality filter (Qscore 7) and were classified taxonomically when the data was uploaded to the EPI2ME platform. These classified reads included a total of 0.41 million viral reads, accounting for 6.9% of filter passed, classified reads (Figure 3). The proportion of viral reads per sample was 0.1% to 18.4% (Nanopore, WIMP analysis) compared to 0.03% to 3.1% for the previously generated MiSeq data; however, with both sequencing approaches, the majority of reads obtained were identified as host-derived or other (bacteria, fungi, unclassified) (Figure 3).

Figure 3.

Figure 3

Proportions of reads corresponding to host, viruses or other taxa (bacteria, fungi, unclassified) from 19 IDV-positive samples sequenced using Nanopore sequencing (WIMP) (left bar in each pair) and MiSeq sequencing (right bar). Labels on the x-axis indicate individual specimens; tracheal samples are denoted by “T” before animal number.

In addition to WIMP classification of quality-filtered Nanopore reads, we also performed a de novo assembly of the Nanopore reads. The largest IDV contigs assembled for each sample from Nanopore data (using the in-house bioinformatics workflow, Figure 1b) were generally longer than those from MiSeq data and ranged from 626 to 2308 bp (Nanopore), and 249 to 1584 bp (MiSeq) (Table 1). The genome segment coverage of each largest contig from each sample was from 10.5% to 95.3%. The proportion of Nanopore reads mapped to IDV for each sample by in-house analysis was higher than that from MiSeq except for sample T10 and T30. The proportion of IDV reads identified in the WIMP analysis of the Nanopore data, however, was generally comparable to that from the Nanopore (in-house) workflow (Table 1). The proportion of reads identified as IDV in Nanopore (WIMP), Nanopore (in-house) and MiSeq sequencing was generally extremely low (average 2.51%, 17.03%, and 0.46%, respectively). As expected, approximately six times more reads were obtained for the individually sequenced sample 129 than for those from the multiplexed samples (Table 1). Sample 129 also had the lowest Cq value in the IDV qPCR (16.99, corresponding to 6.25 × 107 copies per reaction) and the highest proportion of IDV reads in the metagenomic sequencing results (Nanopore-WIMP 14.69%, Nanopore-in-house 27.72%, MiSeq 1.48%) (Table 1).

Table 1.

Summary of data from Nanopore, MiSeq and qPCR on detection of influenza D virus (IDV) for each individual sample.

Sample * Cq Value Copy Number
(Per Reaction)
Number (%) of IDV Reads Largest IDV Contig (bp) Total Input Reads
Nanopore (WIMP) Nanopore
(In-House)
MiSeq Nanopore
(In-House)
MiSeq Nanopore MiSeq
129 16.99 6.25 × 107 321,638 (14.69) 606,932 (27.72) 2182 (1.48) 2030 951 2,188,805 147,341
114 17.53 1.31 × 107 8 (<0.01) ND ND N/A N/A 335,559 136,961
50 20.71 1.25 × 106 1088 (5.74) 1625 (8.57) 162 (1.1) 1161 498 18,966 14,719
69 21.69 4.48 × 105 944 (7.83) 2287 (18.97) 1812 (0.69) 980 842 12,053 263,262
42 22.04 2.86 × 105 608 (0.34) 584 (0.33) 26 (0.07) 986 499 179,559 37,560
6 23.33 2.16 × 105 281 (1.44) 350 (1.8) 183 (0.08) 1034 522 19,512 228,875
10 24.09 1.31 × 105 1656 (1.17) 1650 (1.11) 87 (0.2) 1359 485 148,381 44,433
199 26.01 4.64 × 104 4211 (10.99) 10861 (28.34) 3250 (1.49) 2308 552 38,329 218,800
T50 26.76 2.93 × 104 3 (<0.01) ND ND N/A N/A 43,193 1,256,918
T129 28.20 8.00 × 103 403 (0.23) 502 (0.28) 34 (<0.01) 843 327 177,706 444,979
70 28.71 5.70 × 103 2 (0.02) ND ND N/A N/A 12,211 160,704
T10 29.15 5.50 × 103 29 (0.54) ND 455 (0.03) N/A 913 5,413 1,415,256
13 29.22 4.15 × 103 116 (0.10) 152 (0.13) 48 (0.04) 626 470 119,073 132,847
32 33.60 2.18 × 102 ND ND ND N/A N/A 275,387 32,312
170 35.62 8.25 × 101 1080 (2.03) 1991 (3.75) 8167 (0.74) 903 1,584 53,131 1,100,167
135 36.51 4.88 × 101 10 (<0.01) ND ND N/A N/A 142,737 154,220
260 39.46 6.88 9 (<0.01) ND ND N/A N/A 164,607 959,935
T30 ND - 3 (<0.01) ND 2 (<0.01) N/A 249 139,037 1,156,213
T52 ND - 51 (0.1) 52 (0.1) 497 (0.05) 1616 1,341 51,180 930,628

Cq = quantification cycle; ND = not detected; WIMP = What’s In My Pot; bp = basepair; * Samples beginning with T are tracheal, all others are nasal swabs.

3.3. Comparison of IDV Detection by qPCR, MiSeq and Nanopore Sequencing

The 19 samples selected for sequencing on the Nanopore GridION platform represented a range of IDV concentrations based on qPCR of 6.88 to 6.25 × 107 genome copies per reaction, corresponding to Cq values ranging from 39.46 to as low as 16.99 (Table 1). The agreement of IDV detection between qPCR and Nanopore was 57.9% (in-house) and 84.2% (WIMP), and that between MiSeq and Nanopore was 89.5% (in-house) and 73.7% (WIMP). IDV was detected in the Nanopore data from all but one (18/19) of the IDV-positive samples when reads were classified using the WIMP application, but this proportion dropped to 11/19 when the in-house read assembly workflow was used. For most (7/8) of the samples with disparate results, 10 or fewer IDV reads were identified in the WIMP analysis. The exception was sample T10 with 29 IDV reads.

In order to explore qualitatively whether detection of other viruses in addition to IDV was comparable between the two metagenomic sequencing platforms, we compared the complete lists of viruses detected by MiSeq or Nanopore in the 19 IDV-positive samples. The number of different viruses detected in each sample varied from none to a maximum of four. The proportion of samples with perfect agreement between MiSeq and Nanopore (in-house) was 52.6%, by MiSeq and Nanopore (WIMP) it was 47.4%; and by Nanopore (in-house) and Nanopore (WIMP) it was 36.8% (Supplementary Table S1).

4. Discussion

Metagenomic sequencing is transforming routine detection of viruses from traditional cell culture, antibody–antigen techniques and qPCR to detection of viruses in a target-independent manner. Sequencing approaches have now been widely applied for detection of known and novel agents in various types of clinical specimens in both human and veterinary medicine [3,29,30]. The potential usefulness of viral metagenomics for virus surveillance and diagnostics is still in debate due to its performance relative to the gold-standard method of real-time qPCR routinely employed in diagnostic laboratories [6]. A recent assessment of the performance of Nanopore, MiSeq and qPCR for detection of chikungunya and dengue viruses in serum or plasma samples with relatively high viral loads (Cq values from 14 to 32) demonstrated 100% agreement between these methods [1]. In this investigation, however, a maximum of 16 samples was multiplexed and sequenced using MiSeq, and each sample was sequenced individually on Nanopore [1]. This low degree of multiplexing translates to high analytical sensitivity, but correspondingly makes these technologies relatively more expensive per sample and decreases the potential application for routine diagnostics. In our current study, we performed further exploration to assess the performance of metagenomic sequencing approaches with a higher degree of multiplexing of clinical samples in both MiSeq and Nanopore sequencing. IDV presented an excellent target for this comparison given its association with BRD in beef and dairy cattle, and the availability of specimens with a wider range of viral loads than has been included in previous investigations [19,20,31].

Different viral extraction kits have been demonstrated to have variable extraction efficiencies for different viruses in respiratory clinical samples [32]. The QIAamp MinElute Virus Spin Kit (MVSK) has been found to be generally applicable for isolating nucleic acid for qPCR or metagenomic virus identification of adenovirus, influenza virus A, human parainfluenza virus 3, human coronavirus OC43, and human metapneumovirus in respiratory clinical samples [31]. In our current study, the original nucleic acids extracted with the MVSK were used for both qPCR and metagenomic sequencing, eliminating the influence of different extraction methods and kits on our results (Figure 1a).

The IDV-specific qPCR assay detected its target in 22.8% (53/232) of specimens. While the majority of IDV positive samples with Cq value below 31 were detected by MiSeq, only 1/36 samples with a Cq above this threshold were positive by sequencing (Figure 2). These results demonstrate that for samples where the viral load exceeds 6.25 × 102 per reaction even a relatively modest MiSeq sequencing effort (50 samples multiplexed in a single flow cell) is sufficient to detect the virus. The agreement between qPCR and Nanopore of 57.9% (in-house) and 84.2% (WIMP) demonstrated that relatively modest Nanopore sequencing effort (six samples multiplexed) is also sufficient to detect the virus. The results from Nanopore sequencing (in-house), however, showed no consistent relationship between viral load and detection by sequencing; furthermore, no consistent relationship between viral load and proportion of viral reads was observed in either MiSeq or Nanopore sequencing (Table 1). For example, the two IDV positive samples 199 and 10 had Cq values of 26.01 and 24.09, respectively; however, sample 199 had a higher proportion of IDV sequence reads in Nanopore and MiSeq than sample 10 (Table 1).

There are several possible explanations for differences in both the proportion of IDV reads and total viral reads detected in each sample by MiSeq and Nanopore. First, variation in the amounts of DNA used for sequencing library preparation for the two sequencing platforms may play an important role. Second, the abundance of virus relative to host or bacterial genetic material is a critical determinant of the detection threshold of metagenomic sequencing. A greater proportional abundance of a virus increases the chance that it will be detected by sequencing and improves the genome coverage obtained. Therefore, virus enrichment is commonly applied to clinical samples and enrichment methods such as those used in this study (a combination of centrifugation and nuclease-treatment) should lead to removal of bacteria and host cells, thus improving virus detection [32]. Virus propagation in cell culture is a less appealing method for virus enrichment since it is time-consuming, requires specific expertise and creates the potential for introduction of mutations [33]. Reduction of the degree of multiplexing of samples is an alternative way to improve virus detection, but there is a corresponding increase in cost per sample and a corresponding reduction in throughput that are undesirable in research or clinical diagnostic settings. Reduction of the degree of multiplexing of samples also reduces the chances of cross-barcode contamination because barcode reagents are susceptible to cross-contamination [34].

Bioinformatic analysis in metagenomic sequencing remains challenging but is crucial for accurate identification of diagnostic targets. We used the comparable pipeline to analyze both data from MiSeq and Nanopore sequencing (in-house), which showed the feasibility of metagenomic viral whole-genome-sequencing using both Nanopore and MiSeq technology with the assembled contigs covering from 10 to 95% of each IDV genome segment. Although de novo assembly was performed on Nanopore sequencing data, for the majority of the samples, the length of the largest contig was that of one single read. Skipping the assembly step in bioinformatic analysis of Nanopore data could provide an advantage for timely identification of potential pathogens. The long reads of Nanopore sequencing are thought to provide good confidence for species level identification, but the low coverage combined with the error rates of this platform preclude its use for strain-level resolution [4].

The detection rates of IDV, the number or proportion of IDV reads (Table 1), and other viruses detected (Supplementary Table S1) from Nanopore (WIMP) and Nanopore (in-house) were different, which demonstrates that bioinformatic analysis affects the results of virus detection. Taxonomic classification in WIMP is based on Centrifuge [35], which compares query sequences to a (undescribed) reference database with a high speed and space-optimized k-mer-based algorithm. For Nanopore (in-house) and MiSeq analysis, BLAST [36] was used to compare assembled contig sequences to the NCBI RefSeq virus database, which is a more computationally intense process that produces more detailed results. The identification of a match using Centrifuge is based on probabilities of particular k-mer combinations occurring in the query and reference and not a consideration of the entire query sequence, thus increasing the possibility of false positives [35]. In contrast, Trinity assembly and then BLAST search against a reference database could lead to false negatives if the particular target sequence is very rare [37]. If there are very few reads derived from some component of the metagenome, these reads may not be included in the assembly since there is insufficient “evidence” to support building contigs from them [36,37]. The current lack of definition of the reference database or the ability to use custom databases with WIMP make this approach inappropriate for clinical diagnostic applications due to the difficulty of validating such approaches. Our results provide an illustration of the profound effects that post-sequencing analysis can have on results, and the trade-offs associated with each choice. Selection of the most appropriate analysis pipeline must consider the sequencing platform, as well as tolerance for false negatives and false positives, logistical considerations, and the required taxonomic resolution.

Analytical sensitivity is currently one of the main limitations of metagenomics. In this study, IDV was detected by MiSeq sequencing in specimens with qPCR Cq value as high as 35.62 when 50 samples were multiplexed in comparison to a maximum Cq value of 39.46 using Nanopore with multiplexing of six samples. For the IDV positive samples with low virus loads (e.g., sample 32), targeted qPCR may be preferable given its higher analytical sensitivity. Interestingly, we observed two samples that were IDV positive by both Nanopore (WIMP) and MiSeq but negative by qPCR (Samples T30 and T52, Table 1). These cases illustrate a potential advantage of metagenomic sequencing compared to qPCR since a likely explanation for this observation is that these specimens contained strain variants of IDV that were not detected by the qPCR assay. We were unable to determine if this was the case since the IDV sequence reads did not cover the region of the genome targeted by the species-specific qPCR assay. Targeted PCR assays for rapidly evolving RNA viruses require ongoing performance monitoring, and optimization of primers and probes [5]. No single method is suitable for application for all pathogens or specimen types, and each one has advantages in different circumstances.

5. Conclusions

Taken together our results demonstrate the potential of metagenomic sequencing on the Illumina MiSeq and Oxford Nanopore platforms for detection of viruses, including IDV, in clinical samples from naturally infected animals with a wide range of viral loads. While application of these approaches to screening animal populations or infectious disease research is feasible, their deployment for routine virology diagnostics in clinical settings will require additional research, laboratory and bioinformatic method development, and performance evaluation. Our exploration of two sequencing platforms, different degrees of multiplexing, including samples containing a wide range of virus loads, and comparing to a current diagnostic gold standard is an important step toward achieving these goals. Selection of appropriate methods will continue to require careful consideration of the numerous trade-offs that confront practitioners at each step of the investigation.

Acknowledgments

We thank Anju Tumber (Prairie Diagnostic Services, Inc.) for reagent purchasing and other logistics. We are grateful for Kara Toews, Anatoliy Trokhymchuk, Kazal Krishna Ghosh (PDS) for technical support.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/12/8/814/s1, Table S1: Summary of viruses detected by Nanopore and MiSeq sequencing.

Author Contributions

Conceptualization, M.Z.; J.E.H., and Y.H.; methodology, M.Z.; software, M.Z. and J.E.H.; validation, M.Z.; Y.H., and J.E.H.; formal analysis, M.Z.; investigation, Y.H. and T.W.A.; resources, T.W.A.; data curation, M.Z.; D.L.G., and J.E.H.; writing—original draft preparation, M.Z.; writing—review and editing, J.E.H.; C.F.; Y.H., and D.L.G.; visualization, M.Z. and J.E.H.; supervision, Y.H. and J.E.H.; project administration, Y.H.; funding acquisition, Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by Saskatchewan Cattlemen’s Association (Grant/Award Number: SBIF2015-109), Agriculture Development Fund (Grant/Award Number: ADF20160092), and Beef Cattle Research Council (Grant/Award Number: AMR.10.17). Maodong Zhang is supported by China Scholarship Council (CSC).

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Kafetzopoulou L.E., Efthymiadis K., Lewandowski K., Crook A., Carter D., Osborne J., Aarons E., Hewson R., Hiscox J.A., Carroll M.W., et al. Assessment of metagenomic Nanopore and Illumina sequencing for recovering whole genome sequences of chikungunya and dengue viruses directly from clinical samples. Euro Surveill. 2018;23:100228. doi: 10.2807/1560-7917.ES.2018.23.50.1800228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wamaitha M.J., Nigam D., Maina S., Stomeo F., Wangai A., Njuguna J.N., Holton T.A., Wanjala B.W., Wamalwa M., Lucas T., et al. Metagenomic analysis of viruses associated with maize lethal necrosis in Kenya. Virol. J. 2018;15:90. doi: 10.1186/s12985-018-0999-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Parker J., Chen J. Application of next generation sequencing for the detection of human viral pathogens in clinical specimens. J. Clin. Virol. 2017;86:20–26. doi: 10.1016/j.jcv.2016.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tyler A.D., Mataseje L., Urfano C.J., Schmidt L., Antonation K.S., Mulvey M.R., Corbett C.R. Evaluation of Oxford Nanopore’s MinION Sequencing Device for Microbial Whole Genome Sequencing Applications. Sci. Rep. 2018;8:10931. doi: 10.1038/s41598-018-29334-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Andersen K.G., Shapiro B.J., Matranga C.B., Sealfon R., Lin A.E., Moses L.M., Folarin O.A., Goba A., Odia I., Ehiane P.E., et al. Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus. Cell. 2015;162:738–750. doi: 10.1016/j.cell.2015.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Filloux D., Fernandez E., Loire E., Claude L., Galzi S., Candresse T., Winter S., Jeeva M.L., Makeshkumar T., Martin D.P., et al. Nanopore-based detection and characterization of yam viruses. Sci. Rep. 2018;8:17879. doi: 10.1038/s41598-018-36042-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Quick J., Loman N.J., Duraffour S., Simpson J.T., Severi E., Cowley L., Bore J.A., Koundouno R., Dudas G., Mikhail A., et al. Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016;530:228–232. doi: 10.1038/nature16996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Schadt E.E., Turner S., Kasarskis A. A window into third-generation sequencing. Hum. Mol. Genet. 2010;19:R227–R240. doi: 10.1093/hmg/ddq416. [DOI] [PubMed] [Google Scholar]
  • 9.Greninger A.L., Naccache S.N., Federman S., Yu G., Mbala P., Bres V., Stryke D., Bouquet J., Somasekar S., Linnen J.M., et al. Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med. 2015;7:99. doi: 10.1186/s13073-015-0220-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Buermans H.P., den Dunnen J.T. Next generation sequencing technology: Advances and applications. Biochim. Biophys. Acta. 2014;1842:1932–1941. doi: 10.1016/j.bbadis.2014.06.015. [DOI] [PubMed] [Google Scholar]
  • 11.Mitra N., Cernicchiaro N., Torres S., Li F., Hause B.M. Metagenomic characterization of the virome associated with bovine respiratory disease in feedlot cattle identified novel viruses and suggests an etiologic role for influenza D virus. J. Gen. Virol. 2016;97:1771–1784. doi: 10.1099/jgv.0.000492. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ng T.F., Kondov N.O., Deng X., Van Eenennaam A., Neibergs H.L., Delwart E. A metagenomics and case-control study to identify viruses associated with bovine respiratory disease. J. Virol. 2015;89:5340–5349. doi: 10.1128/JVI.00064-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zhang M., Hill J.E., Fernando C., Alexander T.W., Timsit E., van der Meer F., Huang Y. Respiratory viruses identified in western Canadian beef cattle by metagenomic sequencing and their association with bovine respiratory disease. Transbound. Emerg. Dis. 2019;66:1379–1386. doi: 10.1111/tbed.13172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ferguson L., Eckard L., Epperson W.B., Long L.P., Smith D., Huston C., Genova S., Webby R., Wan X.F. Influenza D virus infection in Mississippi beef cattle. Virology. 2015;486:28–34. doi: 10.1016/j.virol.2015.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Dane H., Duffy C., Guelbenzu M., Hause B., Fee S., Forster F., McMenamy M.J., Lemon K. Detection of influenza D virus in bovine respiratory disease samples, UK. Transbound. Emerg. Dis. 2019;66:2184–2187. doi: 10.1111/tbed.13273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hause B.M., Collin E.A., Liu R., Huang B., Sheng Z., Lu W., Wang D., Nelson E.A., Li F. Characterization of a novel influenza virus in cattle and Swine: Proposal for a new genus in the Orthomyxoviridae family. MBio. 2014;5:e00031-14. doi: 10.1128/mBio.00031-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hause B.M., Ducatez M., Collin E.A., Ran Z., Liu R., Sheng Z., Armien A., Kaplan B., Chakravarty S., Hoppe A.D., et al. Isolation of a novel swine influenza virus from Oklahoma in 2011 which is distantly related to human influenza C viruses. PLoS Pathog. 2013;9:e1003176. doi: 10.1371/journal.ppat.1003176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Su S., Fu X., Li G., Kerlin F., Veit M. Novel Influenza D virus: Epidemiology, pathology, evolution and biological characteristics. Virulence. 2017;8:1580–1591. doi: 10.1080/21505594.2017.1365216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.White S.K., Ma W., McDaniel C.J., Gray G.C., Lednicky J.A. Serologic evidence of exposure to influenza D virus among persons with occupational contact with cattle. J. Clin. Virol. 2016;81:31–33. doi: 10.1016/j.jcv.2016.05.017. [DOI] [PubMed] [Google Scholar]
  • 20.Zhai S.L., Zhang H., Chen S.N., Zhou X., Lin T., Liu R., Lv D.H., Wen X.H., Wei W.K., Wang D., et al. Influenza D Virus in Animal Species in Guangdong Province, Southern China. Emerg. Infect. Dis. 2017;23:1392–1396. doi: 10.3201/eid2308.170059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sreenivasan C., Thomas M., Sheng Z., Hause B.M., Collin E.A., Knudsen D.E., Pillatzki A., Nelson E., Wang D., Kaushik R.S., et al. Replication and Transmission of the Novel Bovine Influenza D Virus in a Guinea Pig Model. J. Virol. 2015;89:11990–12001. doi: 10.1128/JVI.01630-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Timsit E., Workentine M., van der Meer F., Alexander T. Distinct bacterial metacommunities inhabit the upper and lower respiratory tracts of healthy feedlot cattle and those diagnosed with bronchopneumonia. Vet. Microbiol. 2018;221:105–113. doi: 10.1016/j.vetmic.2018.06.007. [DOI] [PubMed] [Google Scholar]
  • 23.Allander T., Tammi M.T., Eriksson M., Bjerkner A., Tiveljung-Lindell A., Andersson B. Cloning of a human parvovirus by molecular screening of respiratory tract samples. Proc. Natl. Acad. Sci. USA. 2005;102:12891–12896. doi: 10.1073/pnas.0504666102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Faccini S., De Mattia A., Chiapponi C., Barbieri I., Boniotti M.B., Rosignoli C., Franzini G., Moreno A., Foni E., Nigrelli A.D. Development and evaluation of a new Real-Time RT-PCR assay for detection of proposed influenza D virus. J. Virol. Methods. 2017;243:31–34. doi: 10.1016/j.jviromet.2017.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Brister J.R., Ako-Adjei D., Bao Y., Blinkova O. NCBI viral genomes resource. Nucleic Acids Res. 2015;43:D571–D577. doi: 10.1093/nar/gku1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Li L., Shan T., Wang C., Côté C., Kolman J., Onions D., Gulland F.M.D., Delwart E. The Fecal Viral Flora of California Sea Lions. J. Virol. 2011;85:9909–9917. doi: 10.1128/JVI.05026-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Shan T., Li L., Simmonds P., Wang C., Moeser A., Delwart E. The Fecal Virome of Pigs on a High-Density Farm. J. Virol. 2011;85:11697–11708. doi: 10.1128/JVI.05217-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zhang D., Lou X., Yan H., Pan J., Mao H., Tang H., Shu Y., Zhao Y., Liu L., Li J., et al. Metagenomic analysis of viral nucleic acid extraction methods in respiratory clinical samples. BMC Genom. 2018;19:773. doi: 10.1186/s12864-018-5152-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hall R.J., Wang J., Todd A.K., Bissielo A.B., Yen S., Strydom H., Moore N.E., Ren X., Huang Q.S., Carter P.E., et al. Evaluation of rapid and simple techniques for the enrichment of viruses prior to metagenomic virus discovery. J. Virol. Methods. 2014;195:194–204. doi: 10.1016/j.jviromet.2013.08.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Quick J., Grubaugh N.D., Pullan S.T., Claro I.M., Smith A.D., Gangavarapu K., Oliveira G., Robles-Sikisaka R., Rogers T.F., Beutler N.A., et al. Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc. 2017;12:1261. doi: 10.1038/nprot.2017.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Esling P., Lejzerowicz F., Pawlowski J. Accurate multiplexing and filtering for high-throughput amplicon-sequencing. Nucleic Acids Res. 2015;43:2513–2524. doi: 10.1093/nar/gkv107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kim D., Song L., Breitwieser F.P., Salzberg S.L. Centrifuge: Rapid and sensitive classification of metagenomic sequences. Genome Res. 2016;26:1721–1729. doi: 10.1101/gr.210641.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 37.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Viruses are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES