Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2023 Nov 1.
Published in final edited form as: Food Environ Virol. 2023 Nov 1:10.1007/s12560-023-09569-w. doi: 10.1007/s12560-023-09569-w

Metabarcoding of Hepatitis E virus genotype 3 and Norovirus GII from wastewater samples in England using nanopore sequencing

Samantha Treagus 1,2,, James Lowther 1, Ben Longdon 2, William Gaze 3, Craig Baker-Austin 1, David Ryder 1, Frederico M Batista 1
PMCID: PMC7615314  EMSID: EMS190417  PMID: 37910379

Abstract

Norovirus is one of the largest causes of gastroenteritis worldwide, and Hepatitis E virus (HEV) is an emerging pathogen that has become the most dominant cause of acute viral hepatitis in recent years. The presence of norovirus and HEV has been reported within wastewater in many countries previously. Here we used amplicon deep sequencing (metabarcoding) to identify norovirus and HEV strains in wastewater samples from England collected in 2019 and 2020. For HEV, we sequenced a fragment of the RNA-dependent RNA polymerase (RdRp) gene targeting genotype 3 strains. For norovirus, we sequenced the 5’ portion of the major capsid protein gene (VP1) of genogroup II strains. Sequencing of the wastewater samples revealed eight different genotypes of norovirus GII (GII.2, GII.3, GII.4, GII.6, GII.7, GII.9, GII.13 and GII.17). Genotypes GII.3 and GII.4 were the most commonly found. The HEV metabarcoding assay was able to identify HEV genotype 3 strains in some samples with a very low viral concentration determined by RT-qPCR. Analysis showed that most HEV strains found in influent wastewater were typed as G3c and G3e and were likely to have originated from humans or swine. However, the small size of the HEV nested PCR amplicon could cause issues with typing, and so this method is more appropriate for samples with high CTs where methods targeting longer genomic regions are unlikely to be successful. This is the first report of HEV RNA in wastewater in England. This study demonstrates the utility of wastewater sequencing and the need for wider surveillance of norovirus and HEV within host species and environments.

Keywords: Hepatitis E virus, Norovirus, nanopore sequencing, metabarcoding, environmental transmission, wastewater-based epidemiology

Introduction

Norovirus and hepatitis E virus (HEV) are enteric viruses that are often overlooked in clinical research due to their relatively non-severe nature, the short symptomatic period for norovirus, and the relatively low prevalence of HEV in most countries (Htet, Althaf and Karim, 2018; Robilotti, Deresinski and Pinsky, 2015; Lhomme et al., 2020; Public Health England, 2019). However, norovirus causes a significant annual healthcare burden of 685 million cases and $60 billion globally (Centers for Disease Control and Prevention, 2021), and HEV genotypes 1 and 2 alone were estimated to cause 20 million cases annually in 2002 (World Health Organization, 2021). The most recent data on cases available is from a study of 30 European countries, which showed that confirmed HEV cases increased from 514 to 5,617 cases between 2005 and 2015 (Aspinall et al., 2017). Both viruses can also have severe and fatal outcomes in immunocompromised people. Genome sequencing has become an invaluable tool for understanding the epidemiology and evolution of viruses. It can be used to track the evolution of new variants and to understand the phylogeography and spread of a virus (Medema et al., 2020; Martin et al., 2020; Agrawal et al., 2021). Due to the widespread adoption and success of sequencing viruses such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) from wastewater, researchers are looking to monitor other viruses to improve knowledge of their abundance within communities.

Norovirus and HEV spread through faeco-oral and foodborne routes of transmission. Outbreaks of norovirus are usually spread through person-to-person transmission but can also be spread through consumption of contaminated foods or through fomites (Meghnath et al., 2019; Prato et al., 2004). Genome sequencing has been used for identifying genogroups, genotypes and strains of norovirus that cause norovirus outbreaks and to determine the source of the infection. A large study by (Verhoef et al., 2011) used sequencing data to identify the global sources and relationships between norovirus outbreaks and conservatively estimated that 7% of norovirus outbreaks had an international geographical distribution. These findings contrasted with previous estimates of 0.4% through standard epidemiological investigations (Robilotti et al., 2015; Lu et al., 2016; Sakon et al., 2018).

HEV, on the other hand, usually spreads through the consumption of undercooked meat, particularly pork (VanderWaal and Deen, 2018; Guillois et al., 2015). A study on the prevalence of HEV in UK pigs identified a seroprevalence of 92.8%, and HEV RNA presence in 21% of the animals analysed (Grierson et al., 2015), suggesting that consumption of pork is a significant risk factor in the UK. However, many possible host animal species for HEV have been identified (Kenney, 2019). HEV has been detected in vegetarians in the Netherlands (Slot et al., 2017), suggesting there may be routes of infection other than consumption of animal products. As such, e-tracing sources of outbreaks using sequencing may provide insights into other transmission routes of HEV, in particular from environmental sources.

Viruses in food and environmental samples are often present at very low abundance, which can make detection and genotyping by nucleotide sequencing difficult. Consequently, PCR-based methods are often used to amplify a specific genomic region for sequencing. Most studies on food and environmental samples to date have used PCR-based methods followed by Sanger sequencing (Rivadulla et al., 2019; Pallerla et al., 2021). Unlike high throughput sequencing (HTS) techniques, Sanger sequencing is less likely to pick up rare variants within a sample or may lead to unresolved bases at certain divergent loci where samples contain multiple divergent strains at similar concentrations (Mancini et al., 2019; Gao et al., 2016). HTS is becoming more commonly used to identify norovirus in different food matrices, for example in food products such as strawberries (Bartsch et al., 2018), and shellfish (Ollivier et al., 2022). However, application of HTS to identify norovirus and HEV in wastewater samples in the UK has not yet been performed to the best of our knowledge.

The aim of this study was to develop a HTS metabarcoding approach using Oxford Nanopore Technologies (ONT) to identify and characterise norovirus and HEV present in wastewater samples at low levels. ONT was chosen since the cost of the sequencing devices is low, and it is a scalable sequencing platform, allowing the analyses of a variable number of samples on the same flow cell.

Materials and Methods

Wastewater samples

We collected 140 paired grab samples, comprising of 70 influent and 70 effluent samples, from seven wastewater treatment plants (WWTPs) in Southern England between October 2019 and February 2020. Influent samples consisted of a litre of coarsely screened influent, and effluent samples were one litre of final effluent. Ten pairs of influent and effluent samples were collected from each WWTP over the collection period. Samples were collected weekly or fortnightly during this period. Table 1 shows the population served and treatment level for each WWTP.

Table 1. Characteristics of the WWTPs involved in the studies.

WWTP number Population equivalent Dry weather flow (m3/day) Treatment level (type)
1 33,822 9,450 Secondary (trickling filter)
2 166,837 40,486 Tertiary (UV)
3 178,531 55,000 Tertiary (UV)
4 166,931 47,700 Tertiary (sand filter and UV)
5 141,213 40,007 Secondary (activated sludge)
6 22,352 4,910 Tertiary (membrane filtration)
7 93,303 32,141 Secondary (aeration)

Wastewater virus extraction

Samples were concentrated using a modified ultracentrifugation method adapted from Puig et al., (1994). Briefly, 40 ml of wastewater sample was mixed, then split equally into two ultracentrifuge bottles. Mengo virus (10 μl) was then added to both bottles to act as a recovery control. Sample bottles were spun at 152,000 x g at 4°C for one hour and the supernatant was discarded. The pellet from one of the sample bottles was resuspended in 2 ml of 0.25 M glycine buffer (pH 9.5; glycine powder from Sigma-Aldrich, St. Louis, MO, USA). This suspension was then added to the pellet from the second sample bottle and the second pellet resuspended with the first. The sample was then put on ice for 20 min before adding 2ml of cold (4°C) phosphate buffered saline (PBS tablets, VWR, Radnor, PA, USA). Samples were centrifuged at 6,090 x g for 20 min at 4°C. The supernatant was transferred to clean bottles, and 18 ml of cold PBS added before a final spin at 152,000 x g at 4°C for one hour. The pellet from this final spin was resuspended in 1ml of cold PBS before it was used in subsequent RNA extraction.

RNA extraction

Five hundred microlitres of concentrated wastewater pellet was added to 2 ml of lysis buffer (Biomerieux, Durham, NC, USA) and incubated at room temperature for 10 min. Total RNA was then extracted using a viral RNA extraction method developed for food samples, described in ISO 15216-1:2017 and Lowther et al. (2012). Following the lysis buffer incubation, 50 μl of magnetic silica beads from the NucliSENS Magnetic Extraction Reagents kit (Biomerieux) were added, and the mixture was left to incubate for ten minutes at room temperature. The sample was then centrifuged at 1500 x g for 2 minutes and the supernatant discarded. The silica beads remaining were washed in buffers whilst held in a NucliSENS Minimag (Biomerieux). Wash buffer 1 was applied and removed using an aspirator after 30 seconds of wash spinning within the Minimag; and this step was repeated. Wash buffer 2 was then applied in the same way and repeated. Wash buffer 3 was applied for 15 seconds of wash spinning and removed, and finally 100 μl of elution buffer was added to resuspend the magnetic beads. The elution mix was incubated on a thermoshaker at 60°C and 1400 rpm for five minutes, before applying to a magnet to separate the buffer (containing extracted RNA) from the silica beads. The 100 μl of RNA extract was then transferred to a clean tube for use in subsequent qRT-PCR. A reference extraction was performed alongside the samples, consisting of 10 μl of mengo virus (same batch as used in the ultracentrifugation) in 500 μl of molecular grade water; and a negative extraction control of 500 μl of molecular grade water was also included during each extraction. The RNA extract was frozen at -80°C prior to testing.

Detection and Quantification

RNA extracts were tested for the presence of HEV and norovirus GII using RT-qPCR. The detection of norovirus was carried out as described in the international standard for quantification of viruses in foods ISO 15216-1:2017 (International Organization for Standardization, 2017). The details for these assays and the HEV assay are shown in Table 2. The RT-qPCR assays for HEV and norovirus were prepared using RNA UltraSense™ One-Step Quantitative RT-PCR System (superscript III, Invitrogen, ThermoFisher Scientific Waltham, MA, USA) and thermal cycling and monitoring of amplicon formation was carried out using a QuantStudio 3 Real-Time PCR System. For the HEV and mengo virus RT-qPCR assays the final primer concentrations were 0.625 μM for the forward primer. The final concentration of the reverse primer for the HEV and mengo virus assays was 1.125 μM. For the norovirus GII forward primer the final concentration was 1 μM, and 1.8 μM for the norovirus GII reverse primer.

Table 2. Primer and probe sequences and details for the qRT-PCR methods.

Target Primer and Probe Sequences Lower Limit of Detection Reference
HEV FWD: 5’-GGTGGTTTCTGGGGTGAC-3’
REV: 5’-AGGGGTTGGTTGGATGAA-3’
PROBE: 5’-TGATTCTCAGCCCTTCGC-3’
5’-FAM 3’-MGB-NFQ
4 genome copies Jothikumar et al., (2006); Garson et al., (2012)
Norovirus GII FWD: 5’-
ATGTTCAGRTGGATGAGRTTCTCWGA- 3’
REV: 5’-TCGACGCCATCTTCATTCACA-3’
PROBE: 5’-AGCACGTGGGAGGGCGATCG-3’
5’-FAM 3’-TAMRA
1-10 genome copies (depending on strain) Loisy et al., (2005); Kageyama et al., (2003)
Mengo virus FWD: 5’-GCGGGTCCTGCCGAAAGT-3’
REV: 5’-
GAAGTAACATATAGACAGACGCACAC- 3’
PROBE: 5’-ATCACATTACTGGCCGAAGC-3’
5’-FAM 3’-MGB-NFQ
Unknown Pintó, Costafreda and Bosch (2009)

Cycling conditions for all PCRs were 55°C for 60 min for the reverse transcription, followed by 95°C for 5 min, then 45 cycles of 95°C for 15 sec, 60°C for 1 min and 65°C for 1 min. Synthetic DNA controls for production of standard curves for quantification were prepared following ISO 15216-1:2017 guidelines for norovirus GII, and a similar approach was used for HEV controls. Standard curves conformed to an r2 value ≥0.99 and a slope of between -3.1 and -3.6. Testing of norovirus GII and HEV followed the approach of Lowther et al. (2012) and ISO 15216-1:2017 for controls (International Organization for Standardization, 2017). Virus concentrations in wastewater (copies/ml) were calculated from copies/μl in the sample RNA using conversion factors based on the volumes tested and concentration factors applied.

Norovirus GII and HEV samples selected for sequencing

Forty-two (21 influent and 21 effluent) wastewater samples with norovirus GII CT values between 27 and 39 were selected for norovirus sequencing analysis. These were composed of three influent and three effluent samples with the lowest CT values from each of the seven WWTPs. Separately, 42 wastewater samples (31 influent and 11 effluent samples) that yielded CT values for HEV between 34 and 44 were selected for HEV sequencing analysis. This constituted all of the samples which tested positive for HEV by RT-qPCR out of the 140 samples analysed. The sample sets for norovirus and HEV sequencing were therefore different although some samples were selected for both sets.

HEV semi-nested PCR primer design

Due to high HEV nucleotide diversity, primers were designed targeting genotype 3 (G3) only as G3 causes the majority of cases in the UK (Oeser et al., 2019). The G3 reference sequences used in this study were downloaded from the National Center for Biotechnology Information (NCBI) and aligned using Clustal Omega (Madeira et al., 2022). The alignment of the G3 genomes can be found in Online Resource 1 (DOI: 10.6084/m9.figshare.21900873). Possible primer sequences were identified manually by identifying conserved regions by eye, before testing sequences from these regions for self-complementarity and melting temperature suitability using the Multiple Primer Analyzer from ThermoFisher Scientific (ThermoFisher Scientific, NA) and Oligo Calc from Northwestern University (Kibbe, 2007). Primers were selected if they had a melting temperature of 56-65°C, length of 17-23 bases, GC content between 30-60%, and a maximum of 3 degenerate bases. Once primer candidates had been found, primer pairs were identified by their melting temperature similarity, the ability to be used in a semi-nested PCR assay, the size of the amplicons and lack of primer complementarity. The primer set selected for the semi-nested PCR targeted the RNA-dependent RNA polymerase (RdRp). This gene was chosen since it was suitable for use according to the criteria described above and has been used previously for HEV typing (Lin et al., 2015). The length of the first-round amplicon was 258 bp and the length of target sequence in the semi-nested amplicon (i.e. not including additional primer adapter sequences added by tagging onto the primer sequences) was 254 bp.

cDNA synthesis and semi-nested PCR

Synthesis of cDNA utilised the Invitrogen SuperScript™ IV First-Strand Synthesis System (Invitrogen, ThermoFisher Scientific Waltham, MA, USA) and random hexamers (final concentration 2.5 μM), per the manufacturers protocol (template volume 10 μl, reaction volume 20 μl). An Eppendorf Mastercycler Nexus was used for cDNA synthesis as well as for the first and second rounds of the semi-nested PCRs (nPCR). The primers for the norovirus GII nPCR were described previously; and target the extreme 3’ end of the RdRp polymerase gene (ORF1) and the 5’ portion of the VP1 major capsid protein gene (ORF2). The primers used in the second round for both norovirus GII and HEV assays were modified with 5’ adapter sequences to allow addition of barcode sequences to enable multiplex sequencing. The primer sequences and reaction conditions are detailed in Table 3. The final concentration of both forward and reverse primers were 0.4 μM.

Table 3. Nucleotide sequence of the primers used for amplification of HEV G3 and norovirus GII by semi-nested PCR.

Assay Primer name and nucleotide sequences Amplicon size (bp) Target fragment size (bp)b Reference
Norovirus GII First round
QNIF2D: 5’-ATGTTCAGRTGGATGAGRTTCTCWGA-3’
GIISKR: 5’-CCRCCNGCATRHCCRTTRTACAT-3’
378 Loisy et al., (2005); Kojima et al., (2002)
Second round
GIISKF_T: 5’-TTTCTGTTGGTGCTGATATTGCCNTGGGAGGGCGATCGCAA -3’
GIISKR_T: 5’-ACTTGCCTGTCGCTCTATCTTCCCRCCNGCATRHCCRTTRTACAT-3’
344a 302 Kojima et al., (2002)
HEV G3 First round
G3STF1: 5’-TGTTGCGCAGGTYTGTGT-3’
G3STR1: 5’-GCARCATAGGCARAARCACGA-3’
258 This study
Second round
G3STF2: 5’-TTTCTGTTGGTGCTGATATTGCTGTTGCGCAGGTYTGTGT-3’
G3STR2: 5’-ACTTGCCTGTCGCTCTATCTTCCATAGGCARAARCACGARGAA-3’
254a 215 This study
a

The amplicon size given here does not include the primer adapters, which added 44 more base pairs. Sequences in bold are the primer adapters

b

The amplicons are trimmed to remove the primer sequences to give the trimmed target fragment

Amplification conditions for HEV G3 and norovirus GII PCRs were the same for the first and second rounds, and consisted of 95°C for 1 min, followed by 40 cycles of 95°C for 30 sec, 50°C for 30 sec, and 72°C for 30 sec. There was a final extension step of 72°C for 7 min. A negative control was utilised in all PCR reactions (water in place of sample cDNA). Negative controls were sequenced alongside samples providing positive nPCR results. A positive control was used for the first and second round PCRs for the norovirus GII assay, composed of cDNA synthesised from a norovirus GII LENTICULE (RMNOROG2, UK Health Security Agency). For G3 HEV, cDNA synthesized from RNA from cell culture was used as a positive control (RNA kindly donated by Eva Trojnar and Reimar Johne of Bundesintitut für Risikobewertung, Berlin, Germany). Positive controls were not sequenced.

The nPCR amplicons were visualised using 2% agarose (Scientific Laboratory Supplies) gel electrophoresis with Gel Red (10,000X in water, BIOTIUM) nucleic acid gel stain. PCR products with the expected band size were stored at 4°C for up to 48 h or frozen at -20°C for up to 7 days until sequencing.

Nanopore Sequencing

Amplicons were purified using AMPure XP Reagent for PCR Purification (Beckman Coulter, Brea, CA, USA) using a 1:1 ratio of beads to amplicons and the remaining protocol carried out as per the manufacturer’s instructions. The rest of the procedure follows the Oxford Nanopore Technologies protocol “PCR barcoding (96) amplicons (SQK- LSK109)”. Library preparation was carried out using PCR barcodes (EXP-PBC096), ligation sequencing kit (SQK-LSK109), and flow cell priming kit (EXP-FLP002) per the manufacturer’s instructions. This enabled preparation of DNA libraries for sequencing on a MinION MK1C machine. Sequencing runs took between 8 and 48 h to generate a minimum of 20,000 reads per sample on R9.4.1 flow cells.

Once the MinION sequencing runs were stopped the fast5 files were processed using the high accuracy basecalling model (part of Guppy 4.3.4; (Oxford Nanopore Technologies, NA-a)) to generate fastq files for each barcode (minimum quality score of 7). The fastq files were then processed using a bioinformatics pipeline. Firstly, reads were trimmed using the program Cutadapt (version 3.2) to remove adapters, barcodes, and primer sequences (Martin, 2011). Trimmed reads were then aligned to reference genome sequences from Smith et al., (2020) or Kroneman et al., (2011) using Minimap2 (version 2.17) and Samtools (version 1.1) (Li, 2018; Danecek et al., 2021). Reads which aligned to a given reference with more than a 1000x coverage were error corrected using Canu (version 2.1.1) with a minimum overlap of 150bp, a minimum read length of 300bp and a minimum coverage of 30x (Koren et al., 2017). One error corrected read per reference was then saved into a file using Seqtk (version 1.3) and used as a consensus as previously described (Li, 2008). The process was repeated for each barcode, with consensus sequences from each barcode then aligned using MAFFT (version 7.475) (Katoh and Standley, 2013).

These alignments were then manually checked, and duplicate consensus sequences were removed. Sequences with a Hamming Dissimilarity distance (calculated within Unipro UGENE, Windows version 40 (Okonechnikov et al., 2012)) greater than 10 were classed as different sequences. Sequence reads were then aligned against the consensus sequences, and information such as coverage, alignment quality and the proportion of reads which aligned to a consensus were recorded, using Minimap2 and Samtools. The CPU version of Medaka (version 1.2.3) (Oxford Nanopore Technologies, NA-b) was then used to polish the consensus sequences. Consensus sequences were defined as being unique if they had a sequence dissimilarity of ≥5%, due to the possibility of errors from nanopore sequencing. Manual identification and removal of chimeric sequences present as a result of a sequencing artifact was then carried out. The G3 HEV and norovirus GII processing pipelines were very similar, differing only by the amplicon length, primer sequences and database of reference sequences. The code for these pipeline processes can be seen in Online Resources 2 and 3 (DOIs: 10.6084/m9.figshare.21900885; 10.6084/m9.figshare.21900900). The alignment files used can be seen in Online Resources 4 and 5 (DOIs: 10.6084/m9.figshare.21900903; 10.6084/m9.figshare.21900912). A minimum of 1,000 mapped reads were required to confirm the sequence was not present as an artifact due to barcode hopping or cross-contamination. All negative controls had less than 100 reads.

Phylogenetic Analysis

For the norovirus GII phylogenetic analysis, excluding primer-derived sequences, sequence fragments of 302 bp were generated using the selected primers, however 20 bp of RNA-dependent RNA polymerase gene sequence was trimmed from the start of each sequence to enable comparison to capsid sequences from NCBI. This was done to prevent distortion of the phylogenetic analysis due to the presence of polymerase/capsid recombinants in the database. Reference sequences used for the phylogenetic analysis of the nucleotide sequencing data were retrieved from NCBI; 514 sequences for HEV genotypes 1-8, and 96 sequences for norovirus GII were downloaded and merged into two separate FASTA files, using BioEdit (Hall, 1999). The downloaded HEV sequences included sequences from humans, swine, deer, macaques and mongooses. The polished G3 HEV and norovirus GII consensus sequences obtained in the present study were then added to the appropriate FASTA files before alignment using Clustal Omega. The sequences were trimmed to be the same length as the sequenced amplicons (excluding primer-derived sequence and 20 bp of RNA-dependent RNA polymerase gene in the case of norovirus GII) and alignments were curated by eye. IQTREE was used to determine the most suitable evolutionary model for phylogenetic analysis, based on Bayesian Information Criteria scores (Nguyen et al., 2015). The substitution model selected was TIM2 with gamma distribution (TIM2 + G) for norovirus GII and G3 HEV. Phylogenetic trees were visualised in iTOL (Letunic and Bork, 2016).

HEV sequences were typed using the RIVM Hepatitis E Virus Genotyping Tool (https://www.rivm.nl/mpf/typingtool/hev/) and norovirus GII sequences were typed using the RIVM Norovirus Genotyping Tool (Kroneman et al., 2011). HEV sequences obtained from this study were uploaded to GenBank with accession numbers OQ918704 to OQ918713. Norovirus sequences were uploaded with accession numbers OQ913488 to OQ913500.

Results

Hepatitis E virus

Semi-nested PCR products with the expected size were obtained in 33 out of the 42 HEV positive samples by RT-qPCR selected for sequencing analysis. However, sequencing data were obtained from only 10 influent samples, with at least one HEV sequence per sample. No wastewater effluent samples yielded any HEV sequencing data. Online Resource 6 (DOI: 10.6084/m9.figshare.21900924) shows a box and whisker plot of the distribution of CT values for samples which failed or succeeded to provide HEV sequences.

The number of mapped reads per HEV amplicon sequence ranged from 2,115 to 401,595 after sequences of incorrect length were removed. The number of reads and coverage data can be seen in Online Resource 7 (DOI: 10.6084/m9.figshare.22152383). Samples from which HEV sequencing data were obtained mostly yielded one consensus sequence irrespective of sequencing depth, but in a single sample two different HEV consensus sequences were obtained. The eleven HEV consensus sequences generated included ten different sequences; Wastewater seq1 and Wastewater seq6 (identified from different influent samples) were identical. All consensus sequences were genotyped as G3 using the RIVM Hepatitis E Virus Genotyping Tool, Wastewater seq1/seq6, Wastewater seq4 and Wastewater seq5 were subtyped as subtype G3c. The other sequences could not be subtyped due to weak phylogenetic support.

A nucleotide BLAST of the sequences showed the majority clustered most closely with GenBank HEV sequences detected in humans (Altschul et al., 1990). The results with the highest percentage identity can be seen in Online Resource 8 (DOI: 10.6084/m9.figshare.22300027).

As only a few consensus sequences were successfully subtyped using the RIVM tool, phylogenetic analysis of the HEV sequences was performed to identify subtype clustering of the HEV consensus sequences. Fig. 1 shows the phylogenetic tree constructed using HEV consensus sequences from this study alongside previously published sequences. The tree showed that the consensus sequences obtained in this study cluster with HEV strains detected in humans and pigs. Note that the phylogenetic tree shown in Fig. 1 is pruned, the full tree includes HEV strains isolated from a wide variety of host species. These strains were all more distantly related to the consensus sequences than the human and pig-derived strains shown.

Fig. 1. Pruned phylogenetic tree of HEV sequences.

Fig. 1

Pruned phylogenetic tree of a 215bp fragment from 514 HEV sequences, and ten unique sequences detected in the present study. Accession numbers can be seen in Online Resource 9 (DOI: 10.6084/m9.figshare.22584049). Pruning included sequences closely clustering with the consensus sequences and reference sequences from the RIVM genotyping tool (+). Genotypes other than G3 are collapsed. Bootstraps generated using ultrafast bootstrapping. The full tree, containing reference sequences from other human and animal hosts, can be seen in Online Resource 10 (DOI: 10.6084/m9.figshare.21900915). Wastewater_seq1 was not included in the tree as it was identical to Wastewater_seq6. The sequences from this study are shown with black bold labels. Published sequence labels contain * for human hosts or ** for swine hosts which the sequence was identified in. the host species it was identified in. The scale bar shows the length of branch that represents the substitutions per site of 0.1. Tree created using IQTREE and visualised in iTOL.

The phylogenetic tree shows that sequences 9, 2, 11 and 7 cluster with subtype G3e sequences, close to both HEV strains observed in humans and swine. Sequence 10 appears to cluster most closely with G3m, close to HEV strains detected in humans and swine. Sequences 3, 4, 5, 6 and 8 cluster together within G3c, most closely with HEV strains described in humans.

Norovirus

Of the 42 influent and effluent wastewater samples which were selected for norovirus GII amplification by nPCR, 23 (12 influent and 11 effluent samples) yielded bands with the expected size and provided valid sequencing data (sequences with over 1,000 reads). Fifteen of the 42 samples did not amplify using the nPCR assay. One sample provided <1,000 reads and so was not analysed further. The remaining three samples did not provide norovirus sequences (despite showing an amplicon of the expected size by nPCR and inclusion in the library preparation). Online Resource 11 (DOI: 10.6084/m9.figshare.22299694) shows a box and whisker plot of the distribution of CT values for samples which failed and succeeded.

The number of reads which mapped to norovirus GII reference sequences varied between 1,329 and 93,423 for the samples. The negative control produced no norovirus reads. The sequence mean depth per sample can be seen in Online Resource 12 (DOI: 10.6084/m9.figshare.21900921).

Across the 23 samples that generated norovirus sequences, 93 consensus sequences in total were obtained from the wastewater samples, with 13 of these being unique sequences (where nucleotide sequence dissimilarity was greater than 5%). These 13 sequences represented eight genotypes as determined using the RIVM Norovirus Typing Tool: GII.2, GII.3, GII.4, GII.6, GII.7, GII.9, GII.13 and GII.17. The two different GII.4 consensus sequences were further identified by the Typing Tool as belonging to the Sydney 2012 strain. One of the wastewater samples contained only one genotype of norovirus GII whereas the remaining samples contained between two and six genotypes (Fig. 2).

Fig. 2. Percentage of reads attributed to each genotype of norovirus GII in wastewater samples.

Fig. 2

An alternative version suitable for the condition tritanopia can be seen in Online Resource 13 (DOI: 10.6084/m9.figshare.21900927). Figure created in R using package ggplot2.

Eleven of the unique sequences were found in multiple samples (up to a maximum of 20 samples). Between one and eight different norovirus GII consensus sequences were observed per sample. Multiple consensus sequences for the same genotypes were identified in eight samples (Table 4).

Table 4. Number of unique sequences and genotypes observed for each sample.

Sample Sample type Number of unique sequences Genotypes present (number of unique sequences)a
SW024 Influent 3 GII.2, GII.3, GII.4
SW026 Influent 4 GII.3, GII.4, GII.6, GII.7
SW034 Influent 3 GII.2, GII.3, GII.4
SW037 Effluent 1 GII.6
SW038 Influent 8 GII.2, GII.3, GII.4 (2), GII.6 (3), GII.7
SW044 Influent 2 GII.2, GII.3
SW046 Influent 5 GII.2, GII.3 (2), GII.6, GII.17
SW048 Influent 6 GII.2, GII.3, GII.6(3), GII.7
SW050 Influent 3 GII.3, GII.4, GII.6
SW059 Effluent 2 GII.3, GII.9
SW061 Effluent 5 GII.2, GII.3, GII.4, GII.6, GII.9
SW063 Effluent 5 GII.2, GII.3, GII.4, GII.7, GII.17
SW094 Influent 7 GII.2, GII.3, GII.4, GII.6 (3), GII.7
SW096 Influent 4 GII.2, GII.3, GII.4, GII.6
SW101 Effluent 2 GII.2, GII.3
SW103 Effluent 3 GII.3, GII.4, GII.6
SW105 Effluent 4 GII.3, GII.4, GII.13, GII.17
SW106 Influent 5 GII.2, GII.3, GII.4, GII.6, GII.7
SW107 Effluent 4 GII.2, GII.4 (2), GII.7
SW117 Effluent 6 GII.2, GII.3, GII.4, GII.6, GII.13, GII.7
SW121 Effluent 2 GII.3, GII.4
SW122 Influent 2 GII.3, GII.4
SW133 Effluent 2 GII.4, GII.6
a

Each genotype was detected once in the sample unless otherwise specified in brackets.

The genotype which was detected most frequently was GII.3, from 20 samples, followed by GII.4, in 17 different samples. Genotypes GII.2, GII.3, GII.4, GII.6, GII.7 and GII.17 were detected in the 12 influent samples. Genotypes GII.2, GII.3, GII.4, GII.6, GII.7, GII.9, GII.13 and GII.17 were detected in the effluent samples. The number of consensus sequences attributed to each genotype can be seen in Table 5. The most common genotype found in influent samples was GII.3 (in 12 samples), followed by GII.4 and GII.2 (9 samples each). The most common genotypes within effluent were GII.3 and GII.4 (both found in 8 samples). Sequences from GII.2 and GII.3 were detected in the influent and effluent of each WWTP which yielded sequencing results (five out of seven).

Table 5. Sequences attributed to each norovirus GII genotype.

Genotype Number of samples Number of unique sequences
GII.2 14 1
GII.3 20 2
GII.4 17 2
GII.6 13 4
GII.7 7 1
GII.9 2 1
GII.13 2 1
GII.17 4 1
TOTAL 23 13

A Megablast search with default parameters was conducted to identify similar norovirus GII sequences in the GenBank nucleotide collection (nr/nt), revealing nucleotide sequence similarity values between 91 and 100%. Phylogenetic analysis of the VP1 sequences showed identical genotyping results to the ones obtained using the RIVM Norovirus Typing Tool (Fig. 3).

Fig. 3. Pruned phylogenetic tree of norovirus GII sequences.

Fig. 3

Pruned phylogenetic tree constructed using a 282bp fragment of norovirus GII major capsid gene (VP1), showing reference sequences from the RIVM Norovirus genotyping tool (+). Amplicon sequences are labelled in bold text. Accession sequences available in Online Resource 14 (DOI: 10.6084/m9.figshare.22584079). Bootstrap support was >70% for all major genotype clades. The full phylogenetic tree, including reference sequences from other genotypes, can be viewed online in Online Resource 15 (DOI: 10.6084/m9.figshare.21900933). Only sequences obtained in the present study with a minimum nucleotide dissimilarity of 5% were included. The scale bar shows the length of branch that represents an amount of genetic change of 0.1. Figure created using IQTREE with visualisation in iTOL and editing in GIMP.

Discussion

HEV in wastewater

Of the 42 samples of wastewater influent and effluent analysed (all of which reported high CT values during RT-qPCR for HEV), ten provided HEV sequencing data using the metabarcoding approach developed in the present study. The CT values of these samples ranged from 34 to 41, showing that even samples with low levels of virus can be sequenced using this technique, although several other samples with similar CTs could not be sequenced. It is possible that this was due to degradation of the HEV RNA within the wastewater samples.

The sequences from this study matched closely to G3c and G3e HEV sequences detected in humans in most cases but were also closely related to swine HEV sequences for others, suggesting that these species were the major sources of the viruses observed in the present study. This is unsurprising as the same HEV strains which circulate in swine also circulate in humans. However, it is possible that if there were more animal sequences publicly accessible that these results may have been less biased towards a link with HEV strains infecting humans or swine, as G3 HEV is capable of infecting many different animal hosts (Kenney, 2019). However, this can only be assessed when more HEV sequences from other animals become available. Considering that most of the WWTPs were fed primarily by human wastewater, it seems likely that most of the HEV sequences originated from human rather than animal sources. Some subtypes have become more dominant in the UK in the past two decades. Ijaz et al., (2013) and Grierson et al., (2015) showed the emergence of new phylotypes of G3 emerging within the UK, showing that HEV subtypes seemed to form two major clades, one of which had been dominant between 2003 and 2010 and one of which became more dominant from 2011 (Ijaz et al., 2013). Clade 1 includes subtypes 3e, 3f, 3g and clade 2 includes subtypes 3a, 3b, 3c, 3d, 3h, 3i, 3j (Ijaz et al., 2013; Smith et al., 2021).

In the UK in 2013, pigs were shown to generally be infected with viruses from clade 1, whilst humans were generally infected with viruses from clade 2 (Grierson et al., 2015); meanwhile HEV strains found in pigs from other European countries in the same time frame appeared to cluster with clade 2. A study on a limited number of UK infections in blood donors from 2018 – 2019 also showed most infections to belong to clade 2 (Smith et al., 2021). It appears that the virus subtypes identified within this study fall into both clades. As swine are thought to be the main reservoir of HEV, this may mean that swine HEV strains originally identified in both the UK and other European countries circulate within the UK. However, it is possible the swine data from the previous studies was too limited, and a phylogenetic comparison of the wastewater sequences to swine HEV strains in the UK was not possible (due to lack of UK HEV sequences). It was not possible to draw conclusions on whether these two major groups may be circulating more or less than previously reported due to small sample size, and there is no current data to suggest which subtypes were more dominant in the population in the UK in 2019.

The sequences obtained were from subtypes G3c and G3e, with one undetermined. Possible reasons for a lack of subtype diversity are that these were the subtypes circulating in the served populations of the WWTPs at the time, and that the WWTPs studied may not fully represent the population of the UK. There is no clinical data in the UK publicly available to compare the strains to for this time period, however, there was an outbreak of HEV in Italy at the end of 2019 where G3e and G3f were isolated from patients (Garbuglia et al., 2021), and strains isolated from patients in Spain in 2019 were from subtypes G3f and G3m (Muñoz-Chimeno et al., 2022). This means that G3e, G3f and G3m strains were actively circulating in Europe in 2019.

It is apparent that HEV presence in both wastewater and the community is likely to be infrequent in the UK, due to the low prevalence identified in previous studies, such as the identification of HEV in 3% of Scottish shellfish samples sold in a supermarket (O’Hara et al., 2018), and annual reports of clinically diagnosed HEV cases (Public Health England, 2019). However, despite low prevalence in these areas, HEV may contaminate the aquatic environment in the UK, most likely due to release of untreated human (and possibly animal) wastewater into water courses through combined sewer overflows (CSOs), which are allowed to spill into water courses during storm weather conditions. Considering that CSOs across the UK spilt into water courses for over 3 million hours in 2020, constituting 400,000 spills (Environment Agency, 2020; Laville, 2021), it is possible that HEV from human faecal sources regularly contaminates the aquatic environment. It seems less likely that HEV within treated effluent would be a large source of contamination for the aquatic environment as no effluent samples from this study provided any HEV sequence data, perhaps due to low copy number or RNA degradation. However, some effluent samples were RT-qPCR positive, so it cannot be ruled out.

Norovirus GII in wastewater

Of the 42 wastewater samples with CT values between 27 and 39, 23 samples were successfully sequenced. The fifteen samples which did not amplify using the semi-nested PCR may have been too degraded to generate these amplicons. These samples contained eight different GII genotypes, with individual samples containing as many as six different genotypes. The norovirus GII sequences identified in this study were shown to have a nucleotide sequence similarity between 91 and 100% with sequences deposited in GenBank using nucleotide BLAST. Ten unique sequences were detected (all with >1,000 reads) in multiple individual samples, suggesting widely circulating strains, though this was not possible to confirm as the small amplicon length means that different strains may have produced identical sequence results. Surprisingly, GII.3 sequences were detected most commonly, within 20 samples; followed by GII.4 in 17 samples. GII.6 gave the highest number of unique sequences, with 4 identified across the samples. Two GII.4 unique sequences were also identified, and GII.4 has been the most frequently detected genotype in the world since the 1990s (Cannon et al., 2021), and has a high diversity (Parra et al., 2017). However, there were also two unique sequences from GII.3, as well as several other unique sequences from the other genotypes. This shows the high diversity of GII noroviruses present in wastewater, and therefore in the community, even in just a short five-month period. This agrees with findings reported by Ollivier et al., (2022), who also found a high level of norovirus diversity within oysters harvested between 2016 and 2018 across 12 different European countries. In the present study, GII.3 and GII.4 sequences were most common in effluent samples, but GII.3 was most common in influent samples (in 12 samples).

Sequence data was obtained from samples from five of seven WWTPs. GII.2 and GII.3 were the only genotypes which were detected at all five WWTPs. According to Public Health England (2020), these two genotypes were detected throughout 2019 in outbreaks that occurred in England or Wales.

Despite the abundance of different genotypes within the wastewater samples, clinical data from the UKHSA at the time of the study shows that though GII.4 and GII.6 made up a large proportion of cases, other detected genotypes such as GII.9 and GII.13 were not detected in cases from the public (Public Health England, 2020). This is likely to be explained by under-reporting of cases by the public, or by high frequency of asymptomatic illness, especially for certain genotypes, as norovirus cases are estimated to reach 3 million annually (Gherman et al., 2020), and the UK Health Security Agency reported only 6,172 symptomatic infections between 2018 and 2019 (Public Health England, 2021). It is possible that, with the finding of these potentially under-reported or asymptomatic genotypes of norovirus, that a similar situation may be occurring with norovirus GI and other genotypes of HEV, such as genotype 4. However, this would require further work to confirm as investigating norovirus GI and HEV G4 was outside of the scope of this study. These findings reinforce the application of sequencing environmental samples in the surveillance of human pathogenic viruses, as techniques such as these can enable detection of circulating strains which may not have been identified through clinical surveillance. Indeed, Fontenele et al., (2021) and many other groups utilised HTS technologies to sequence strains of SARS-CoV-2 from wastewater samples, providing an insight into circulating variants in the community.

A limitation of the study is that small fragment size and PCR-based methods may prevent possible detection of new HEV and norovirus GII variants (specifically if there were mutations in the primer regions or outside the amplicon region), and the apparent diversity observed was lower than the actual diversity. Another limitation was that the method was unable to detect norovirus recombinants as ORF1 and ORF2 were not sequenced simultaneously, and the nPCR assays used may have inherent biases towards certain genotypes or subtypes (Ollivier et al., 2022). However, sequencing of longer amplicons or whole genomes was unlikely to be successful due to the high CT values for most samples, and it has been observed previously that longer amplicons than those of the RT-qPCR for detection can lead to lower sensitivity (Aprea et al., 2018; Grierson et al., 2015). However, the use of small amplicons has allowed sequencing data to be obtained to successfully genotype HEV and norovirus sequences. Another limitation is that Nanopore sequencing is known to have higher error rates than other sequencing platforms (such as Illumina), however Nanopore sequencing is still under active development, errors rates are reducing over time, and software is being constantly updated and refined to better deal with sequencing errors. Therefore, as Nanopore technologies improve, the results from these subtyping methods will also improve. Despite the limitations of this study, these methods provide a way to type HEV and norovirus from complex and difficult samples with high CT values where other methods such as shotgun sequencing or cloning/Sanger sequencing may not be feasible.

Conclusion

In this study a metabarcoding approach using nanopore sequencing to genotype HEV and norovirus GII present in wastewater samples was successfully developed and applied. This could have several advantages for the sequencing of samples with low viral concentration in future studies (e.g. shellfish samples), and the use of the ONT platform for this method could enable greater portability and scalability of HEV and norovirus GII sequencing, as well as improving affordability over other sequencing platforms. The study has shown that HEV is present in wastewater in southern England and therefore that contamination of the aquatic environment with HEV could occur relatively frequently. It has also shown that many different genotypes of norovirus GII circulate simultaneously in wastewater, and that national surveillance of clinical norovirus cases may not be fully representative of all circulating genotypes within the population.

Supplementary Material

Supplementary file 2
Supplementary file 6
Supplementary file 7
Supplementary file 8
Supplementary file 9
Supplementary file 10
Supplementary file 11
Supplementary file 12
Supplementary file 13
Supplementary file 14
Supplementary file 15

Acknowledgements

Funding has been provided by a Cefas internal Seedcorn project (DP422), a Seafood Innovation Fund, UK, contract RD011 and the University of Exeter for this work. Ben Longdon is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (grant no. 109356/Z/15/Z) https://wellcome.ac.uk/funding/sir-henry-dale-fellowships. We would like to thank the anonymous reviewers for their help with submission of this article. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

Funding

Funding has been provided by a Cefas internal Seedcorn project (DP422), a Seafood Innovation Fund, UK, contract RD011, and the University of Exeter for this work. Ben Longdon is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (grant no. 109356/Z/15/Z). https://wellcome.ac.uk/funding/sir-henry-dale-fellowships.

Declarations

Financial and non-financial interests

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript.

Ethics approval

Ethical approval was granted by the University of Exeter Research Ethics Committee for this work.

Consent to participate

No human participants were involved in this work.

Consent to publish

No human participants were involved in this work.

Availability of data and material

See Online Resources for supplementary materials, including bioinformatics scripts. Sequences can be found in GenBank under accession numbers OQ913488 to OQ913500 and OQ918704 to OQ918713.

Code availability

See Online Resources.

References

  1. Agrawal S, Orschler L, Lackner S. Long-term monitoring of SARS-CoV-2 RNA in wastewater of the Frankfurt metropolitan area in Southern Germany. Scientific reports. 2021;11(1):5372. doi: 10.1038/s41598-021-84914-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  3. Aprea G, Amoroso MG, Di Bartolo I, D’Alessio N, Di Sabatino D, Boni A, Cioffi B, D’Angelantonio D, Scattolini S, De Sabato L, Cotturone G, et al. Molecular detection and phylogenetic analysis of hepatitis E virus strains circulating in wild boars in south-central Italy. Transboundary and Emerging Diseases. 2018;65(1):e25–e31. doi: 10.1111/tbed.12661. [DOI] [PubMed] [Google Scholar]
  4. Aspinall EJ, Couturier E, Faber M, Said B, Ijaz S, Tavoschi L, Takkinen J, Adlhoch C, experts, o. b. o. t. c Hepatitis E virus infection in Europe: surveillance and descriptive epidemiology of confirmed cases, 2005 to 2015. Eurosurveillance. 2017;22(26):30561. doi: 10.2807/1560-7917.ES.2017.22.26.30561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bartsch C, Höper D, Mäde D, Johne R. Analysis of frozen strawberries involved in a large norovirus gastroenteritis outbreak using next generation sequencing and digital PCR. Food microbiology. 2018;76:390–395. doi: 10.1016/j.fm.2018.06.019. [DOI] [PubMed] [Google Scholar]
  6. Cannon J, Bonifacio J, Bucardo F, Buesa J, Bruggink L, Chan MC-W, Fumian T, Giri S, Gonzalez M, Hewitt J, Lin J-H, et al. Global Trends in Norovirus Genotype Distribution among Children with Acute Gastroenteritis. Emerging Infectious Disease journal. 2021;27(5):1438. doi: 10.3201/eid2705.204756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Centers for Disease Control and Prevention. Norovirus Worldwide. Centre for Disease Control; USA: 2021. [Accessed: 22 January 2023]. Available at: https://www.cdc.gov/norovirus/trends-outbreaks/worldwide.html. [Google Scholar]
  8. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2) doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Environment Agency. Event Duration Monitoring - Storm Overflows - 2020. Database of CSO use throughout UK in 2020; 2020. Available at: https://environment.data.gov.uk/dataset/21e15f12-0df8-4bfc-b763-45226c16a8ac. [Google Scholar]
  10. Fontenele RS, Kraberger S, Hadfield J, Driver EM, Bowes D, Holland LA, Faleye TOC, Adhikari S, Kumar R, Inchausti R, Holmes WK, et al. High-throughput sequencing of SARS-CoV-2 in wastewater provides insights into circulating variants. Water Research. 2021;205:117710. doi: 10.1016/j.watres.2021.117710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gao J, Wu H, Shi X, Huo Z, Zhang J, Liang Z. Comparison of Next-Generation Sequencing, Quantitative PCR, and Sanger Sequencing for Mutation Profiling of EGFR, KRAS, PIK3CA and BRAF in Clinical Lung Tumors. Clin Lab. 2016;62(4):689–96. doi: 10.7754/clin.lab.2015.150837. [DOI] [PubMed] [Google Scholar]
  12. Garbuglia AR, Bruni R, Villano U, Vairo F, Lapa D, Madonna E, Picchi G, Binda B, Mariani R, De Paulis F, D’Amato S, et al. Hepatitis E Outbreak in the Central Part of Italy Sustained by Multiple HEV Genotype 3 Strains, June-December 2019. Viruses. 2021;13(6) doi: 10.3390/v13061159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Garson JA, Ferns RB, Grant PR, Ijaz S, Nastouli E, Szypulska R, Tedder RS. Minor groove binder modification of widely used TaqMan probe for hepatitis E virus reduces risk of false negative real-time PCR results. Journal of virological methods. 2012;186(1):157–160. doi: 10.1016/j.jviromet.2012.07.027. [DOI] [PubMed] [Google Scholar]
  14. Gherman I, Holland D, Clifford R, Mahmoudzadeh N, Uy F, Adkin A, Kainth B, Cook P, Day A, Wilson A. Technical Report: Review of Quantitative Risk Assessment of foodborne norovirus transmission. Science. 2020;1:1. [Google Scholar]
  15. Grierson S, Heaney J, Cheney T, Morgan D, Wyllie S, Powell L, Smith D, Ijaz S, Steinbach F, Choudhury B. Prevalence of hepatitis E virus infection in pigs at the time of slaughter, United Kingdom, 2013. Emerging infectious diseases. 2015;21(8):1396. doi: 10.3201/eid2108.141995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Guillois Y, Abravanel F, Miura T, Pavio N, Vaillant V, Lhomme S, Le Guyader FS, Rose N, Le Saux J-C, King LA. High proportion of asymptomatic infections in an outbreak of hepatitis E associated with a spit-roasted piglet, France, 2013. Clinical Infectious Diseases. 2015;62(3):351–357. doi: 10.1093/cid/civ862. [DOI] [PubMed] [Google Scholar]
  17. Hall TA. BIOEDIT: A USER-FRIENDLY BIOLOGICAL SEQUENCE ALIGNMENT EDITOR AND ANALYSIS PROGRAM FOR WINDOWS 95/98/NT [Google Scholar]
  18. Htet Z, Althaf M, Karim M. Hepatitis E infection in solid organ transplant recipients: An overlooked diagnosis? Clin Nephrol. 2018;90(6):427–430. doi: 10.5414/CN109533. [DOI] [PubMed] [Google Scholar]
  19. Ijaz S, Said B, Boxall E, Smit E, Morgan D, Tedder RS. Indigenous hepatitis E in England and Wales from 2003 to 2012: evidence of an emerging novel phylotype of viruses. The Journal of infectious diseases. 2013;209(8):1212–1218. doi: 10.1093/infdis/jit652. [DOI] [PubMed] [Google Scholar]
  20. International Organization for Standardization. ISO 15216-1:2017: Microbiology of food and animal feed—Horizontal method for determination of hepatitis A virus and norovirus in food using real-time RT-PCR. Geneva: 2017. [Google Scholar]
  21. Jothikumar N, Cromeans TL, Robertson BH, Meng X, Hill VR. A broadly reactive one-step real-time RT-PCR assay for rapid and sensitive detection of hepatitis E virus. Journal of virological methods. 2006;131(1):65–71. doi: 10.1016/j.jviromet.2005.07.004. [DOI] [PubMed] [Google Scholar]
  22. Kageyama T, Kojima S, Shinohara M, Uchida K, Fukushi S, Hoshino FB, Takeda N, Katayama K. Broadly reactive and highly sensitive assay for Norwalk-like viruses based on real-time quantitative reverse transcription-PCR. Journal of clinical microbiology. 2003;41(4):1548–1557. doi: 10.1128/JCM.41.4.1548-1557.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Molecular Biology and Evolution. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kenney SP. The current host range of hepatitis E viruses. Viruses. 2019;11(5):452. doi: 10.3390/v11050452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kibbe WA. OligoCalc: an online oligonucleotide properties calculator. Nucleic Acids Res. 2007;35(Web Server issue):W43–6. doi: 10.1093/nar/gkm234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kojima S, Kageyama T, Fukushi S, Hoshino FB, Shinohara M, Uchida K, Natori K, Takeda N, Katayama K. Genogroup-specific PCR primers for detection of Norwalk-like viruses. J Virol Methods. 2002;100(1-2):107–14. doi: 10.1016/s0166-0934(01)00404-9. [DOI] [PubMed] [Google Scholar]
  27. Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27(5):722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kroneman A, Vennema H, Deforche K, v d Avoort H, Peñaranda S, Oberste MS, Vinjé J, Koopmans M. An automated genotyping tool for enteroviruses and noroviruses. J Clin Virol. 2011;51(2):121–5. doi: 10.1016/j.jcv.2011.03.006. [DOI] [PubMed] [Google Scholar]
  29. Laville S. Water firms discharged raw sewage into English waters 400,000 times last year. The Guardian; 2021. Mar 31, 2021. Available at: https://www.theguardian.com/environment/2021/mar/31/water-firms-discharged-raw-sewage-into-english-waters-400000-times-last-year. [Google Scholar]
  30. Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Research. 2016;44(W1):W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lhomme S, Marion O, Abravanel F, Izopet J, Kamar N. Clinical Manifestations, Pathogenesis and Treatment of Hepatitis E Virus Infections. J Clin Med. 2020;9(2) doi: 10.3390/jcm9020331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Li H. Toolkit for processing sequences in FASTA/Q formats. GitHub; 2008. [Accessed: 30 April 2023]. Available at: https://github.com/lh3/seqtk. [Google Scholar]
  33. Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lin J, Karlsson M, Olofson AS, Belák S, Malmsten J, Dalin AM, Widén F, Norder H. High prevalence of hepatitis e virus in Swedish moose--a phylogenetic characterization and comparison of the virus from different regions. PLoS One. 2015;10(4):e0122102. doi: 10.1371/journal.pone.0122102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Loisy F, Atmar R, Guillon P, Le Cann P, Pommepuy M, Le Guyader F. Real-time RT-PCR for norovirus screening in shellfish. Journal of virological methods. 2005;123(1):1–7. doi: 10.1016/j.jviromet.2004.08.023. [DOI] [PubMed] [Google Scholar]
  36. Lowther JA, Gustar NE, Powell AL, Hartnell RE, Lees DN. Two-year systematic study to assess norovirus contamination in oysters from commercial harvesting areas in the United Kingdom. Applied and environmental microbiology. 2012;78(16):5812–5817. doi: 10.1128/AEM.01046-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Lu J, Fang L, Zheng H, Lao J, Yang F, Sun L, Xiao J, Lin J, Song T, Ni T, Raghwani J, et al. The Evolution and Transmission of Epidemic GII.17 Noroviruses. The Journal of Infectious Diseases. 2016;214(4):556–564. doi: 10.1093/infdis/jiw208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, Madhusoodanan N, Kolesnikov A, Lopez R. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic acids research. 2022:gkac240. doi: 10.1093/nar/gkac240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Mancini P, Bonanno Ferraro G, laconelli M, Suffredini E, Valdazo-González B, Della Libera S, Divizia M, La Rosa G. Molecular characterization of human Sapovirus in untreated sewage in Italy by amplicon-based Sanger and next-generation sequencing. Journal of applied microbiology. 2019;126(1):324–331. doi: 10.1111/jam.14129. [DOI] [PubMed] [Google Scholar]
  40. Martin J, Klapsa D, Wilton T, Zambon M, Bentley E, Bujaki E, Fritzsche M, Mate R, Majumdar M. Tracking SARS-CoV-2 in Sewage: Evidence of Changes in Virus Variant Predominance during COVID-19 Pandemic. Viruses. 2020;12(10):1144. doi: 10.3390/v12101144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal. 2011;17:10–12. [Google Scholar]
  42. Medema G, Heijnen L, Elsinga G, Italiaander R, Brouwer A. Presence of SARS-Coronavirus-2 RNA in sewage and correlation with reported COVID-19 prevalence in the early stage of the epidemic in the Netherlands. Environmental Science & Technology Letters. 2020;7(7):511–516. doi: 10.1021/acs.estlett.0c00357. [DOI] [PubMed] [Google Scholar]
  43. Meghnath K, Hasselback P, McCormick R, Prystajecky N, Taylor M, McIntyre L, Man S, Whitfield Y, Warshawsky B, McKinley M, Bitzikos O, et al. Outbreaks of Norovirus and Acute Gastroenteritis Associated with British Columbia Oysters, 2016-2017. Food Environ Virol. 2019;11(2):138–148. doi: 10.1007/s12560-019-09374-4. [DOI] [PubMed] [Google Scholar]
  44. Muñoz-Chimeno M, Bartúren S, García-Lugo MA, Morago L, Rodríguez Á, Galán JC, Pérez-Rivilla A, Rodríguez M, Millán R, Del Álamo M, Alonso R, et al. Hepatitis E virus genotype 3 microbiological surveillance by the Spanish Reference Laboratory: geographic distribution and phylogenetic analysis of subtypes from 2009 to 2019. Euro Surveill. 2022;27(23) doi: 10.2807/1560-7917.ES.2022.27.23.2100542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32(1):268–74. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Oeser C, Vaughan A, Said B, Ijaz S, Tedder R, Haywood B, Warburton F, Charlett A, Elson R, Morgan D. Epidemiology of Hepatitis E in England and Wales: A 10-Year Retrospective Surveillance Study, 2008-2017. The Journal of Infectious Diseases. 2019;220(5):802–810. doi: 10.1093/infdis/jiz207. [DOI] [PubMed] [Google Scholar]
  47. O’Hara Z, Crossan C, Craft J, Scobie L. First Report of the Presence of Hepatitis E Virus in Scottish-Harvested Shellfish Purchased at Retail Level. Food and environmental virology. 2018:1–5. doi: 10.1007/s12560-018-9337-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Okonechnikov K, Golosova O, Fursov M, team tU. Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics. 2012;28(8):1166–1167. doi: 10.1093/bioinformatics/bts091. [DOI] [PubMed] [Google Scholar]
  49. Ollivier J, Lowther J, Desdouits M, Schaeffer J, Wacrenier C, Oude Munnink BB, Besnard A, Mota Batista F, Stapleton T, Schultz AC, Aarestrup F, et al. Application of Next Generation Sequencing on Norovirus-contaminated oyster samples. EFSA Supporting Publications. 2022;19(6):7348E [Google Scholar]
  50. Oxford Nanopore Technologies. (NA-a) Python client library for Guppy. GitHub; [Accessed: 25 April 2023]. Available at: https://github.com/nanoporetech/pyguppyclient. [Google Scholar]
  51. Oxford Nanopore Technologies. (NA-b) Sequence correction provided by ONT Research. GitHub; [Accessed: 25 April 2023]. Available at: https://github.com/nanoporetech/medaka. [Google Scholar]
  52. Pallerla SR, Schembecker S, Meyer CG, Linh LTK, Johne R, Wedemeyer H, Bock CT, Kremsner PG, Velavan TP. Hepatitis E virus genome detection in commercial pork livers and pork meat products in Germany. Journal of Viral Hepatitis. 2021;28(1):196–204. doi: 10.1111/jvh.13396. [DOI] [PubMed] [Google Scholar]
  53. Parra GI, Squires RB, Karangwa CK, Johnson JA, Lepore CJ, Sosnovtsev SV, Green KY. Static and Evolving Norovirus Genotypes: Implications for Epidemiology and Immunity. PLoS Pathog. 2017;13(1):e1006136. doi: 10.1371/journal.ppat.1006136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Pintó RM, Costafreda MI, Bosch A. Risk assessment in shellfish-borne outbreaks of hepatitis A. Appl Environ Microbiol. 2009;75(23):7350–7355. doi: 10.1128/AEM.01177-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Prato R, Lopalco PL, Chironna M, Barbuti G, Germinario C, Quarto M. Norovirus gastroenteritis general outbreak associated with raw shellfish consumption in south Italy. BMC infectious diseases. 2004;4:37. doi: 10.1186/1471-2334-4-37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Public Health England. Hepatitis E: symptoms, transmission, treatment and prevention. gov.uk: Public Health England; 2019. [Accessed: 27 August 2019]. Available at: https://www.gov.uk/government/publications/hepatitis-e-symptoms-transmission-prevention-treatment/hepatitis-e-symptoms-transmission-treatment-and-prevention. [Google Scholar]
  57. Public Health England. PHE National norovirus and rotavirus Report. GOV.UK: Public Health England; 2020. Available at: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachmentdata/file/871753/Norovirusupdate2020weeks08to09.pdf. [Google Scholar]
  58. Public Health England. 2018 to 2019 Season Norovirus Report National norovirus laboratory and outbreak data in England. gov.uk: Public Health England; 2021. Available at: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachmentdata/file/966775/2021-01-29-Norovirusannualreport2018to2019.pdf. [Google Scholar]
  59. Puig M, Jofre J, Lucena F, Allard A, Wadell G, Girones R. Detection of adenoviruses and enteroviruses in polluted waters by nested PCR amplification. Appl Environ Microbiol. 1994;60(8):2963–2970. doi: 10.1128/aem.60.8.2963-2970.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Rivadulla E, Varela MF, Mesquita JR, Nascimento MS, Romalde JL. Detection of hepatitis E virus in shellfish harvesting areas from Galicia (Northwestern Spain) Viruses. 2019;11(7):618. doi: 10.3390/v11070618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Robilotti E, Deresinski S, Pinsky BA. Norovirus. Clin Microbiol Rev. 2015;28(1):134–64. doi: 10.1128/CMR.00075-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Sakon N, Sadamasu K, Shinkai T, Hamajima Y, Yoshitomi H, Matsushima Y, Takada R, Terasoma F, Nakamura A, Komano J, Nagasawa K, et al. Foodborne Outbreaks Caused by Human Norovirus GII.P17-GII.17-Contaminated Nori, Japan, 2017. Emerging Infectious Disease journal. 2018;24(5):920. doi: 10.3201/eid2405.171733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Slot E, Zaaijer HL, Molier M, Van den Hurk K, Prinsze F, Hogema BM. Meat consumption is a major risk factor for hepatitis E virus infection. PloS one. 2017;12(4):e0176414. doi: 10.1371/journal.pone.0176414. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Smith DB, Izopet J, Nicot F, Simmonds P, Jameel S, Meng X-J, Norder H, Okamoto H, van der Poel WH, Reuter G. Update: proposed reference sequences for subtypes of hepatitis E virus (species Orthohepevirus A) Journal of General Virology. 2020:jgv001435. doi: 10.1099/jgv.0.001435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Smith I, Said B, Vaughan A, Haywood B, Ijaz S, Reynolds C, Brailsford S, Russell K, Morgan D. Case-Control Study of Risk Factors for Acquired Hepatitis E Virus Infections in Blood Donors, United Kingdom, 2018-2019. Emerging Infectious Diseases. 2021;27(6):1654–1661. doi: 10.3201/eid2706.203964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. ThermoFisher Scientific. (NA) Multiple Primer Analyzer. 2020. Available at: https://www.thermofisher.com/ca/en/home/brands/thermo-scientific/molecular-biology/molecular-biology-learning-center/molecular-biology-resource-library/thermo-scientific-web-tools/multiple-primer-analyzer.html.
  67. VanderWaal K, Deen J. Global trends in infectious diseases of swine. Proceedings of the National Academy of Sciences. 2018;115(45):11495–11500. doi: 10.1073/pnas.1806068115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Verhoef L, Kouyos RD, Vennema H, Kroneman A, Siebenga J, van Pelt W, Koopmans M. An integrated approach to identifying international foodborne norovirus outbreaks. Emerg Infect Dis. 2011;17(3):412–8. doi: 10.3201/eid1703.100979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. World Health Organization. Hepatitis E. World Health Organization; 2021. [Accessed: 08 September 2021 2021]. Available at: https://www.who.int/news-room/fact-sheets/detail/hepatitis-e. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file 2
Supplementary file 6
Supplementary file 7
Supplementary file 8
Supplementary file 9
Supplementary file 10
Supplementary file 11
Supplementary file 12
Supplementary file 13
Supplementary file 14
Supplementary file 15

Data Availability Statement

See Online Resources for supplementary materials, including bioinformatics scripts. Sequences can be found in GenBank under accession numbers OQ913488 to OQ913500 and OQ918704 to OQ918713.

See Online Resources.

RESOURCES