Evaluation of Oxford Nanopore Technologies workflows for genomic epidemiology of outbreak-associated bacterial isolates in the clinical setting

Stefan Neuenschwander; Loïc Borcard; Sonja Gempeler; Miguel A Terrazos Miani; Carlo Casanova; Alban Ramette

doi:10.1099/mgen.0.001626

. 2026 Feb 10;12(2):001626. doi: 10.1099/mgen.0.001626

Evaluation of Oxford Nanopore Technologies workflows for genomic epidemiology of outbreak-associated bacterial isolates in the clinical setting

Stefan Neuenschwander ¹, Loïc Borcard ¹, Sonja Gempeler ¹, Miguel A Terrazos Miani ¹, Carlo Casanova ¹, Alban Ramette ^1,^2,^*

PMCID: PMC12890258 PMID: 41665488

Abstract

Accurate and efficient whole-genome sequencing (WGS) is crucial for clinical diagnostics and surveillance of bacterial infections. Here, we investigate the potential of a new Oxford Nanopore Technologies (ONT) workflow for WGS of clinically relevant bacterial isolates. Specifically, we assess the performance of R10.4.1 flow cells in combination with the V14 version of the transposase-based (RBK) library preparation kit to provide rapid and accurate genomic epidemiological comparisons of bacterial species of clinical importance. We focused on retrospective collections of outbreak-associated Corynebacterium diphtheriae (CDIP) and vancomycin-resistant enterococci (VRE) and benchmarked expected performance parameters such as genome assembly quality, genotyping [multi-locus sequence typing (MLST) and core-genome multi-locus sequence typing (cgMLST)], SNP profiling and antimicrobial resistance and virulence prediction, against WGS data obtained routinely by Illumina MiSeq sequencing. Complete concordance with Illumina results was observed for MLST in both species and for cgMLST in CDIP, across all ONT kits and software evaluated. For VRE, however, cgMLST results varied with strain identity, library preparation kit and analysis parameters, likely due to software challenges to correctly call methylated bases. Yet, the use of the latest basecalling models combined with a PCR-based library preparation kit (RPB) reliably reproduced Illumina cgMLST results across all tested VRE strains. By testing two hybrid strategies combining PCR-free and PCR-based library preparation approaches, we also showed that combining PCR-free and PCR-based methods may yield a promising strategy, achieving both high accuracy and assembly completeness. Genomic-based antimicrobial resistance (AMR) prediction was consistent across sequencing methods, and we further highlight advantages and limitations of the PCR-based, PCR-free and mixed assemblies, to inform on the genomic context of AMR genes. This study demonstrates that a Nanopore-only sequencing approach may offer improved accuracy and consistency for classical bacterial typing in outbreak investigations, paving the way to wider use in clinical microbiology laboratories.

Keywords: bacterial pathogens, core-genome multi-locus sequence typing (cgMLST), clinical setting, genome assembly, genomic epidemiology, outbreak, Oxford Nanopore Technologies, plasmids, R10.4.1

Impact Statement

This study demonstrates that an Oxford Nanopore Technologies-only sequencing approach improves the accuracy and consistency of bacterial typing in outbreak investigations, even for difficult cases such as vancomycin-resistant enterococci. This advancement paves the way for wider adoption of this technology in clinical microbiology laboratories, leading to more rapid and effective responses to bacterial infections.

Data Summary

Illumina MiSeq and Oxford Nanopore Technologies (ONT) sequencing data for the Illumina, SUPD43, SUPDP43 and SUPD50 treatments are available under BioProject accessions PRJNA889706 and PRJNA1230056. NCBI SRA accession numbers for the individual libraries are listed in Table S2. ONT datasets generated using earlier basecaller versions are available upon request. The bioinformatic correction script (Fig. 1) is available at https://github.com/RametteLab/NanoporeHybridKP.

Introduction

Whole-genome sequencing (WGS) has become the gold standard for bacterial strain typing, revolutionizing our ability to elucidate infection-epidemiological connections due to its high discriminatory power [1]. Consequently, WGS is increasingly employed in clinical settings to enhance outbreak detection, implement epidemiological surveillance and improve infection control, leveraging its unparalleled resolution [2,5]. Common approaches for assessing genetic relatedness from WGS data include gene-by-gene comparisons, such as core-genome multi-locus sequence typing (cgMLST) or whole-genome multi-locus sequence typing and SNP analysis [6]. While SNP typing offers high resolution, it is computationally intensive and time-consuming [7,8]. In contrast, cgMLST-typing provides a standardized nomenclature with lower computational demands [6]. Target-free, k-mer-based methods, like split k-mer analysis, are emerging but are not yet standard for transmission analysis [7,9]. Despite the widespread use of these methods, a standardized approach for interpreting WGS data in epidemiological investigations remains undefined [7].

Healthcare facilities have traditionally relied on short-read Illumina technology for WGS and genomic pathogen surveillance [10,11]. However, long-read next-generation sequencing (NGS), particularly provided by Oxford Nanopore Technologies (ONT) [12], is gaining traction. ONT sequencing offers rapid turnaround times, real-time data, long-read capabilities and lower capital costs, potentially reducing outbreak investigation times or enabling genomic surveillance in resource-limited settings [13,14]. Studies have shown promising results when using ONT sequencing for bacterial sequence typing and epidemiology [15,16], including largely consistent findings with Illumina for vancomycin-resistant enterococci (VRE) [17], Mycobacterium tuberculosis [18], Staphylococcus aureus [19] and methicillin-resistant S. aureus [20]. Several studies also demonstrate successful Nanopore-based typing in Salmonella [21,22], with one addressing homopolymer error reduction and combined Illumina/Nanopore analysis [22]. Smaller studies also report consistent results between Illumina and Nanopore for highly pathogenic bacteria [23], Escherichia coli [10], VRE [24], improved resolution of hospital VRE isolates [25] and Klebsiella pneumoniae strain typing [26].

Despite significant improvements in ONT technology, including higher throughput and lower error rates, and increasing use for bacterial genome assembly [27,28], challenges remain. These include the sensitivity of established typing methods like multi-locus sequence typing (MLST) [29,30], cgMLST or cgSNP [11] to single base errors potentially introduced by ONT sequencing. Consequently, there is a need to compare data generated with ONT to the more accurate Illumina sequencing for outbreak detection. For clinical diagnostics, where accuracy is key for correct treatment and control measures, these errors can be critical. This re-evaluation is particularly important given the recent improvement in ONT sequencing accuracy due to the development of newer flow cells and basecalling algorithms, making it increasingly reliable for clinical applications [23,31]. For instance, the R10.4.1 flow cell version has improved modal sequencing accuracy of 99.6% at the read level, which may be crucial for high-resolution genotyping necessary in clinical diagnostics. In addition, ONT transposase-based library preparation kits (e.g. RBK) now offer very fast turnaround time from isolates to sequencing results. Further, the possibility to combine PCR and PCR-free library preparation has not been fully assessed in terms of turnaround time, cost and effects on the genomic sequencing accuracy.

To validate ONT V14 kits for genomic epidemiology in outbreak settings, we performed a retrospective, comparative study using clinical isolates of Corynebacterium diphtheriae (CDIP) and VRE, which we obtained from our routine diagnostic laboratory. Beyond their clinical relevance, these isolates were selected because they were managed through a fully integrated workflow spanning isolation, storage, DNA extraction, sequencing and bioinformatic analysis conducted by the same expert team. This approach ensures our evaluations closely replicate real-world clinical diagnostic settings and methodological consistency. Recognizing that Illumina short-read sequencing may represent the accuracy gold standard, but may also be limited in resolving genomic structures, our goal was to determine if ONT long reads provide actionable data for outbreak decision-making. We focused on evaluating ONT performance in key areas: typing (MLST and cgMLST), antimicrobial resistance (AMR) and plasmid analysis in these two bacterial outbreak examples.

Methods

Sample origin, cultivation and DNA extraction

All CDIP isolates were associated with local outbreaks in Switzerland from July to September 2022 and were described in a previous study [32]. The corresponding Illumina MiSeq data are available as BioProject PRJNA889706. VRE isolates, associated with local hospital outbreaks in a Swiss hospital, were collected between 2017 and 2021, with some of them belonging to ST796 [33]. They were isolated on CHROMagar VRE plates (CHROMagar, Paris, France) during routine analysis by the clinical microbiology laboratory of the Institute for Infectious Diseases (IFIK), University of Bern. Species identification was determined by MALDI-TOF (Bruker Daltonics, Bremen, Germany). Susceptibility testing for all isolates was performed during routine clinical processing at IFIK. All isolates were stored at −80°C and re-grown on CSBA plates before performing genomic extraction, thus minimizing the risk of in vitro evolution. Genomic DNA was extracted from agarose plates using PureLink Genomic DNA kit (Thermo Fisher, Switzerland), or with Maxwell RSC Cultured Cells DNA Kit (Promega, Switzerland).

Whole-genome sequencing

NGS libraries for Illumina sequencing were produced with the Nextera DNA Flex Library Prep Kit (Illumina, Switzerland), according to the manufacturer’s recommendations, and sequenced on an Illumina MiSeq sequencer with v2 reagents in 2×150 paired-end mode, at the Next Generation Sequencing Platform of the Institute for Infectious Diseases, Bern. Nanopore sequencing libraries were produced with the rapid barcoding chemistry SQK-RBK114.96 (RBK) and Rapid PCR barcoding chemistry SQK-RPB114.96 (RPB) according to the manufacturer’s recommendations. The libraries were loaded onto standard GridION flowcells (FLO-MIN114) and sequenced in batches of 21 samples for 12–72 h on GridION X5 sequencers with real-time basecalling and under high-accuracy mode (HAC). The raw signal files (fast5_pass/pod5_pass directories) were subsequently re-basecalled either on the GridION or with standalone basecallers on a separate Linux workstation. All software versions and basecaller models are provided in Table S1 (available in the online Supplementary Material).

Ethics statement

All CDIP and VRE isolates in the present study have been anonymized, and no patient information is used in the interpretation of the bacterial genomic data. Publication of this analysis does not harm or influence either patients or institutions. Ethical committee approval was, therefore, not requested.

Bioinformatic analyses

Genome assembly and polishing. All genomes were reconstructed with the same software versions and parameters, except for the assembly parameters and for the models used for genome polishing, which were adjusted to the basecalling mode (SUP or HAC), and the basecalling model, respectively (Fig. 2). All reads were filtered to remove sequences shorter than 500 bases and those with an average quality score below 10. In addition, 30 bases were trimmed from both the 5′ and 3′ ends of each read to remove lower quality bases (NanoFilt, version 2.8.0, parameters: -q 10 -l 500 --headcrop 30 --tailcrop 30) [34]. The remaining reads were assembled with Flye (version 2.9.2) [35], and polished with Medaka (version 1.11.3) [36]. Detailed information, including the sequencing and basecalling software, basecalling models, assembly parameters and Medaka models, can be found in the supplementary material (Table S1). To mask bases with ambiguous base compositions in the corresponding reads, selected assemblies were processed with the software MPOA [37].

Combining PCR-based and PCR-free Nanopore reads. We tested two different approaches to combine RBK and RPB data: (a) Read mixtures. Random subsets of the reads produced with RBK and RPB were combined at theoretical coverage depths of 70× and 30×, respectively, to obtain mixed datasets with total coverage depths of 100×. These read sets were processed as described above for the non-mixed datasets. (b) Assembly correction. Reads sets produced with RBK and RPB were mapped separately against an assembly produced from the RBK reads with the software minimap2 (version 2.24) [38]. The resulting SAM files were sorted (samtools, version 1.15.1) [39] and further analysed with the software Pysamstats (version 1.1.2) [40] to obtain per-strand base compositions for each assembly position. Base composition statistics were then processed using a custom Python script as follows (Fig. 1): (1) We assigned a confidence level (high or low) to each base of the original RBK assembly, based on the underlying base compositions of the mapped reads. Assembly positions were considered as ‘low confidence’ if one or more of the following criteria applied: Low read coverage (<20× coverage depth), mixed base composition (<80% of the reads matching the most frequent base) or strand bias (different dominant bases between the forward and reverse strand). Otherwise, the positions were considered as ‘high confidence’. (2a) The RPB majority consensus base was obtained for each base position of the original assembly. (2b) We assigned a confidence level (high or low) to the RPB majority consensus bases, using the same criteria described in step one above. (3) The assembly was corrected for low confidence positions in the original assembly, which were replaced by the majority consensus of the RPB reads if the confidence was high for the latter and, by N, if the confidence in the RPB read set was low too. Each ‘high confidence position’ of the original assembly was kept.

Typing and comparison of the different treatments. All assemblies were imported into SeqSphere+ (v.8.4.0; Ridom GmbH, Germany) and processed with the cgMLST schemes Enterococcus faecium cgMLST v1.1 [41] and a custom task template encompassing 1,319 core genes for CDIP based on the Institute Pasteur scheme, as described in [32]. Comparison tables (containing allele IDs for each target gene and assembly) were created for the Nanopore and Illumina assemblies of each analysed species and exported for further processing with custom Python and R scripts.

AMR and plasmid analyses. Protein sequences were predicted with the software Prokka (version 1.14.6) [42] and used as input for AMRFinderPlus (version 3.11.2, database: 2023-08-08.2, parameters -p -g -n --annotation_format prokka --organism) [43]. For plasmid analysis, WGS reads from treatments ‘SUPD43’, ‘SUPDP43’ and ‘SUPD and P43’ were reassembled and analysed with the ‘long read’ module with Flye as assembler and default parameters (SeqSphere+, version 10.0.0). Comparison tables including the task template fields ‘Chromosome and plasmid Overview’, as well as a local plasmid database (created via ‘Create Task Template for Plasmid Mash Database’), were generated to facilitate data extraction from SeqSphere+. Assembly statistics (number of replicons, circularity) and plasmid-borne resistance were extracted from the resulting output tables and further analysed using custom Python scripts.

Results

VRE genomic epidemiology using Nanopore vs. Illumina WGS

We choose a representative and diverse collection of clinical VRE isolates, consisting of 6 different MLST sequence types, 3 clonal clusters and 12 different cgMLST complex types based on Illumina assemblies (Fig. 3a). In terms of target gene recovery, we observed near-identical performance between Illumina and the top-performing Nanopore WGS methods: Percentages of successfully detected cgMLST target genes in the assemblies were highest for Illumina (98.80%), closely followed by the PCR-based Nanopore approach SUPDP43 (98.77%) and the most recent PCR-free approach SUPD50 (98.71%). The lowest percentage was found for the HAC43.masked (97.46%), which was still well above the quality threshold of 90% (Fig. 3b).

A near-perfect agreement was found between Illumina and the top-performing Nanopore methods for the individual cgMLST allele assignments. The numbers of mismatching alleles between the Illumina and Nanopore assemblies varied strongly between isolates, Nanopore software versions, basecalling models and the library preparation kits: For VRE21, for example, no mismatching alleles were detected, irrespective of the approach used to create the Nanopore assemblies, whereas 0 to 44 mismatching alleles were detected in the assemblies produced from isolate VRE01 (Fig. 3c). Noticeably, PCR-based library preparation outperformed PCR-free alternatives irrespective of the basecalling models used, allowing for a perfect replication of the allelic profiles obtained with Illumina in all but one case, with a maximum number of two mismatches (VRE18). Mixtures of PCR-based and PCR-free reads, as well as PCR-free assemblies that were corrected with PCR-based reads, performed on par with PCR-free reads in these aspects.

The PCR-free approach was also more sensitive to the analysis parameters: SUP models clearly outperformed HAC models, and the most recent iterations of the software (and models) outperformed previous versions. The best combination of parameters resulted in perfectly matching allele assignments in 14 out of 20 cases with a maximum of four mismatches (Fig. 3c). Subsequent masking with the software MPOA resulted in the exclusion of all but one of the previously mentioned mismatching alleles (SeqSphere+ excludes genes with N’s from the analysis), while the percentage of successfully detected cgMLST targets dropped by 0.8%, remaining well above the threshold of 90% (SUPD34, 98.68%; SUP43.masked, 97.90%, Fig. S1A, B).

MLST classification into sequence type (ST) was successful across all tested combinations that did not involve masking with MPOA (Fig. S1D). The latter resulted in two unassigned STs in the treatment HAC43.masked (Fig. S1D), whereas treatment SUP43.masked did not result in a missing ST assignment. cgMLST classification into complex type (CT) was more sensitive to the variability introduced by the different treatments: The PCR-based treatments SUPDP43 and HACP43 matched the Illumina CT in all cases and in 19 out of 20 cases, respectively. The single CT 1552 (instead of CT 2887) that differed between the PCR-based treatments was a result of one target gene (EFAU004_01770, hypothetical protein), which was exclusively found in SUPDP43 in otherwise identical allelic profiles (Fig. S1D). The mixtures of PCR-based and PCR-free reads matched the Illumina CT in all cases and the PCR-free assemblies that were corrected with PCR-based reads in 19 out of 20 cases, respectively. The single mismatch was caused by the absence of the same target gene, as described above for the PCR-based approaches. Among the PCR-free treatments, SUPD50 performed the best with all 20 correct CTs assigned, followed by SUPD43 with 17/20 correct assignments (Fig. 3c). Subsequent MPOA masking (SUP43.masked) corrected two of the remaining mis-assigned CTs, but another CT was assigned for VRE10, while no CT was assigned for it based on the Illumina assembly (Fig. S1).

Noticeably, a large proportion of the mismatching bases between Illumina and Nanopore assemblies were found in assembly positions with conflicting information between the forward and reverse reads (Fig. 4a). This phenomenon is referred to as strand bias (SB) in subsequent sections. We analysed the frequency of SB as a function of its genomic context and MLST types (Fig. 4b). In our dataset, SB was more often observed on plasmids than on chromosomes. No distinct pattern related to ST was identified; instead, SB levels appeared to be strain-specific rather than ST-specific (Fig. 4b).

VRE-predicted AMR and plasmids

To evaluate whether ONT assemblies are suitable for AMR analysis, we applied AMRFinderPlus to the Illumina and ONT assemblies: On the class level, predictions for the Illumina and ONT-based assemblies were matched in all cases (Fig. 5a). On the antibiotic subclass level, discrepancies were found for one isolate (VRE_02), for which gentamicin and tobramycin resistances were exclusively predicted for the ONT assemblies (Fig. 5b). On the gene level, the bifunctional aminoglycoside-modifying enzyme AAC(6′)-Ie–APH(2′′)-Ia, which confers resistance to all commercially available aminoglycosides except streptomycin [44], was exclusively detected in the ONT assemblies in five cases (Fig. 5c). However, in all but one isolate (VRE_02), the C-terminal domain of AAC(6′)-Ie–APH(2′′)-Ia was detected in the Illumina and ONT assemblies, explaining the matching resistance predictions (Fig. 6a, b).

Different Nanopore approaches were compared with respect to their performance for plasmid analysis (PCR-free, PCR-based and mixtures of both). PCR-free and mixed Nanopore methods showed superior chromosome assembly performance than PCR-based Nanopore assemblies, as PCR-based Nanopore assemblies showed a higher degree of chromosome fragmentation than PCR-free or mixed assemblies. Plasmid assembly performance did not differ significantly between the tested methods (Fig. 6).

Discrepancies in the predicted genomic context of resistance genes were identified in 4 out of the 19 isolates when comparing the tested Nanopore approaches: Tetracycline resistance in VRE02, along with gentamicin and tobramycin resistance in VRE16, was determined to be plasmid-mediated in PCR-free and mixed assemblies, but not in PCR-based assemblies. Conversely, amikacin, gentamicin, kanamycin and tobramycin resistance in VRE14 were identified as plasmid-mediated only in the PCR-based assemblies. Vancomycin resistance was predicted to be plasmid-mediated in PCR-based and mixed assemblies of VRE20, but not in the PCR-free variant (Fig. 7).

CDIP genomic epidemiology using ONT vs. Illumina WGS

Similarly, a high level of diversity was expected in the chosen collection of clinical CDIP isolates with six different MLST sequence types and five clonal clusters based on Illumina assemblies (Fig. 8a). Percentages of successfully detected cgMLST target genes in the assemblies were highest for Illumina (94.96% on average), but on par with the Nanopore approach SUPD50 (94.38% on average) (Fig. 8b). Masking of ambiguous base positions reduced the number of successfully detected cgMLST target genes by 1.1 and 1.5% in the treatments SUPD43.masked and HAC43.masked as compared to their unmasked counterparts SUPD43 and HAC43, respectively (Fig. S1). Between Illumina and all tested Nanopore approaches, individual alleles were assigned in perfect agreement for MLST (Fig. S2C), and with very high agreement for cgMLST, with a maximum of one differing allele per Nanopore approach (Fig. 8c). The AMR predictions for CDIP showed complete concordance between Illumina and ONT assemblies at the class level (Fig. S3A). However, a single discrepancy was observed at the sub-class and gene levels for two isolates (CDIP_05 and CDIP_18). Specifically, the aminoglycoside phosphotransferase gene aph(3′)-Ia, conferring kanamycin resistance, was absent in the PCR-based ONT assemblies of both isolates (Fig. S3B, C).

Time and cost estimation

The ONT approaches used in this study are competitive with Illumina both in terms of hands-on time, overall turnaround time and costs per sample at the tested throughput levels (Table 1). For example, preparing 20 samples takes only 0.7 h (RBK) or 1 h (RPB) hands-on time, significantly less than the 1.8 h required for Illumina MiSeq. The overall turnaround time is also notably faster for smaller batches: For four samples, ONT (RBK or RPB) finishes sequencing in 5 h, while Illumina requires 24 h. There is a slight cost increase when combining ONT RBK with RPB workflows, as the latter involves the use of long-range PCR in addition to the transposase library preparation as in the RBK protocol. Most critically, the cost reduction is substantial: For batches of 20 samples, the cost is estimated at $310 (RBK) or $400 (RPB), compared to $2,340 for the MiSeq run, leading to a cost difference per sample of $117 for MiSeq vs. $36 for the combination of RBK and RPB on the same sample, so a 3.25-fold cost reduction. This significant cost reduction can be obtained given the possibility to wash ONT flow cells with a DNase treatment and to stop the sequencing run earlier when enough reads have been obtained, as compared to the Illumina-based approach, which entails fixed costs and run durations (Table 1).

Table 1. Turnaround time and cost estimates.

	Illumina MiSeq	ONT RBK*	ONT RPB†
Library preparation (hands-on/total)
n=4 samples	1.5/2.7 h	0.5/1.0 h	0.7/3.0 h
n=20 samples	1.8/2.7 h	0.7/1.2 h	1.0/3.3 h
Sequencing time
n=4 samples	24 h	5 h	5 h
n=20 samples	24 h	24 h	24 h
Cost estimates per batch (per sample)‡
n=4 samples	$468 ($117)	$200 ($50)	$300 ($75)
n=20 samples	$2,340 ($117)	$310 ($16)	$400 ($20)

Open in a new tab

*ONT transposase-based library preparation (RBK).

†ONT transposase-based followed by PCR-based library preparation (RPB).

‡Costs are estimated at $117 per sample for MiSeq based on 20 isolates per run, excluding personnel costs. For ONT runs, we hypothesized conservatively that 24 h and 5 h runs correspond to half and a fourth of the maximal lifespan of a flow cell, whose total cost is estimated at $500. The costs are indicative and may vary depending on the country and specific contract with the involved companies.

Discussion

We assessed the recovery of cgMLST target genes, allele assignments, MLST and cgMLST classification, prediction of AMR genes and plasmid content using various Nanopore library preparation methods (PCR-based, PCR-free and mixtures) and analysis pipelines. We evaluated a diverse, clinically relevant collection of outbreak isolates, originating from local outbreaks, which were classified as different MLST sequence types and clonal clusters based on Illumina WGS assemblies. Our main findings indicate that MLST classification was generally successful across Nanopore approaches, while cgMLST CT classification showed more sensitivity to the different Nanopore approaches. We also explored the suitability of Nanopore assemblies for AMR analysis, revealing generally concordant results with Illumina at the class level for AMR but some discrepancies at the subclass and gene levels. The nanopore workflows that performed best in cgMLST were further compared with respect to their ability to predict the genomic context of resistance genes using the SeqSphere+ Long read Data Plasmid Transmission Analysis Module. This analysis yielded largely consistent results, with only a few discrepancies. We selected this approach because it provides a straightforward framework for plasmid characterization and demonstrated higher consistency than a plasmid analysis in SeqSphere+ based on the preassembled contigs (Figs S4 and S5). Finally, the study provided turnaround time and cost estimation, suggesting that the tested Nanopore approaches are competitive with those offered by short-read NGS approaches. With more than 550 resulting assemblies analysed with MLST and cgMLST to determine clonal complexes and accuracy of the epidemiological conclusions, we highlighted how type of WGS data produced combined with the software running on Nanopore sequencers, as well as parameters selected by the operator, can have a major influence on the quality of the resulting sequencing reads and the epidemiological conclusions derived from the WGS data. Our results have practical consequences for genomic studies that aggregate sequencing data originating from different laboratories using different technologies or software versions.

While Nanopore demonstrated near-identical performance to Illumina for cgMLST target gene recovery and MLST typing, some discrepancies were observed in cgMLST allele assignments and AMR gene detection depending on the specific Nanopore approach used. Notably, PCR-based Nanopore library preparation generally outperformed PCR-free methods for allelic profiling in VRE. We concluded that optimized Nanopore sequencing and analysis pipelines offer a competitive alternative to Illumina for genomic surveillance of these pathogens in terms of accuracy, time and cost. Complete concordance with Illumina results was observed for MLST in both tested species and for cgMLST in CDIP across all ONT kits and software evaluated. In contrast, the accuracy of cgMLST results for VRE varied based on the strain, library preparation kit and analysis parameters, likely due to challenges in resolving base methylations. The latest software (≥Dorado 0.5.0) and basecalling model versions (≥v4.3), combined with the PCR-based library preparation kit (RPB), reliably reproduced Illumina cgMLST results across all tested VRE strains.

Previous studies assessing ONT performance for accurate genome reconstruction often did not use the latest generation of flow cells, sequencing kits and software. For instance, Foster-Nyarko et al. [45] found R9.4.1/V10 chemistry reliable for MLST and AMR in K. pneumoniae, but cgMLST/SNP cluster detection remained challenging. Greig et al. [10] noted 95% of nt discrepancies between Illumina and ONT data for two isolates of Shiga toxin–producing E. coli (STEC) O157:H7, which could be resolved by masking methylated and prophage regions in ONT data. More recent R10.4/V12 studies showed mixed results: [13] found V12 chemistry accuracy promising but throughput insufficient for large-scale studies. [23] found high-resolution cgMLST/cgSNP genotyping feasible for F. tularensis and B. anthracis, but not B. suis. [28] concluded R10.4 cells yield ‘near-finished’ bacterial genomes without short-read polishing, though they did not test high-resolution typing. Conversly in [14], V12 chemistry accuracy was found to be sufficient for Bordetella pertussis clinical epidemiology via cgMLST. Despite V12’s accuracy improvements, especially for homopolymer errors, previous generations were limited by reduced data throughput. The latest R10.4.1/V14 chemistry was evaluated by [37] in a K. pneumoniae outbreak using cgMLST, concluding that ONT sequencing caused considerable base errors leading to incorrect exclusions in outbreak tracing. They suggested mitigating these methylation-related errors through PCR-based library prep or masking ambiguous base positions.

Our study thus expands the knowledge on ONT performance for WGS-based bacterial typing by applying a similar comparative framework to two sets of well-characterized outbreak strains from the highly relevant species CDIP and VRE, both with native and PCR-based library preparation kits. In contrast to previous studies, we put a strong focus on the influence of different sequencing and basecalling software on the accuracy of downstream analyses such as cgMLST and AMR gene contextual analyses. Thus, we tested multiple software versions released in 2023 and 2024, in combination with different parameters such as different versions of the GridION software, varying sampling rate of the signal acquisition (4 khz and 5 khz), basecaller software (Guppy and Dorado), basecaller versions and basecalling models. All sequencing runs were conducted with real-time basecalling in HAC. The acquired signals were later re-basecalled in HAC and SUP mode, with standalone versions of the basecallers Guppy and Dorado, and different basecalling models. Our study thus reflects and integrates in its evaluation the rapid evolution of the technology over the past 2 years, including both wet laboratory and software changes that initially reduced the accuracy of the tested applications. We show the significant influence such an update may have on ONT datasets when processing native DNA libraries. Further strengths of our study are that we also provided estimates of both turnaround time and associated costs to offer a comprehensive perspective on implementation considerations.

Our study also identified several weaknesses and areas where ONT sequencing showed variability or discrepancies compared to Illumina. The number of mismatching alleles between Illumina and Nanopore assemblies varied strongly depending on the isolate, ONT software versions, basecalling models and library preparation kits. PCR-free library preparation was found to be more sensitive to analysis parameters than PCR-based methods. Masking of ambiguous base positions with MPOA could lead to unassigned STs and a reduction in successfully detected cgMLST target genes. Discrepancies were observed in AMR predictions at the antibiotic subclass and gene levels for certain isolates. Furthermore, the predicted genomic context of resistance genes (chromosomal vs. plasmid-mediated) showed discrepancies between different Nanopore approaches in some cases. However, we cannot exclude that optimizations in the plasmid reconstruction procedure could improve the consistency between different nanopore approaches [46], but this was outside the scope of the study.

Despite its strengths, the PCR-based ONT kit exhibited certain drawbacks, such as a smaller read length, ultimately leading to more fragmented chromosome assemblies. To mitigate this, we tested two hybrid strategies combining PCR-free and PCR-based Nanopore approaches and demonstrated promising results. These combined methods indeed retained the accuracy of the PCR-based approach while achieving assembly performance comparable to PCR-free methods. AMR prediction at the genome level was consistent across all tested sequencing methods; however, resolving the genomic context of AMR genes remains a complex task [47]. Although ONT long-read sequencing may offer significant improvements in determining genomic context compared to short-read technologies, we did observe inconsistencies in the predicted genomic context of selected AMR genes when comparing datasets produced from PCR-based, PCR-free and mixed Nanopore reads. As we chose a retrospective collection of outbreak-related isolates for in-depth comparative analyses, future work would need to provide a prospective evaluation to assess the performance of the new technology in actual clinical or epidemiological settings, thus offering a more accurate reflection of real-world applicability. Also, we limited ourselves to local outbreaks, for which the source of material and reference short-read data were controlled. We may also ask how international collection and inter-laboratory comparison would perform in such a comparative framework.

The results of this study suggest that Nanopore WGS is a promising technology for genomic epidemiology of outbreak bacterial species, such as VRE and CDIP, demonstrating comparable performance to Illumina in key areas such as cgMLST target gene recovery and allele assignment. The strengths in chromosome assembly using PCR-free methods and the competitive time and cost further support this potential. The observed variability related to software and parameter specification and the discrepancies in AMR and plasmid analysis highlight the need for continued optimization of Nanopore sequencing protocols, basecalling algorithms and analysis pipelines to improve accuracy and reliability in genomic epidemiological applications. Future research could focus on refining PCR-free methods to reduce their sensitivity to analysis parameters, developing more robust methods for resolving ambiguous base positions and improving the accuracy of AMR and plasmid prediction from Nanopore assemblies. As the technology continues to evolve, Nanopore sequencing has the potential to serve as a valuable alternative or complementary approach to short-read technologies for comprehensive genomic characterization in epidemiological investigations.

Supplementary material

Uncited Supplementary Material 1.

mgen-12-01626-s001.pdf^{(2.4MB, pdf)}

DOI: 10.1099/mgen.0.001626

Acknowledgements

The authors thank the diagnostic department of the Institute for Infectious Diseases, University of Bern, for collecting and storing the clinical isolates.

Abbreviations

AMR: antimicrobial resistance
CDIP: Corynebacterium diphtheriae
CT: complex type
HAC: high-accuracy mode
IFIK: institute for infectious diseases
MLST: multi-locus sequence typing
NGS: next-generation sequencing
ONT: Oxford nanopore technologies
SB: strand bias
ST: sequence type
VRE: vancomycin-resistant enterococci
WGS: whole-genome sequencing

Footnotes

Funding: AR received funding from the Institute for Infectious Diseases and the Multidisciplinary Center for Infectious Diseases, Bern, Switzerland, to support this study.

Author contributions: Conceptualization: S.N. and A.R. Data curation: S.N., S.G. and M.A.T.M. Formal analysis: S.N. and A.R. Investigation: S.N., L.B., S.G., M.A.T.M. and A.R. Methodology: S.N., M.A.T.M. and A.R. Funding acquisition: A.R. Project administration: A.R. Resources: C.C. and A.R. Software development: S.N., L.B. and A.R. Supervision: A.R. Validation: S.N., S.G., M.A.T.M., C.C. and A.R. Visualization: S.N. and A.R. Writing – original draft: S.N., M.A.T.M. and A.R. Writing – review and editing: S.N., L.B., S.G., M.A.T.M., C.C. and A.R. All authors have read and agreed to the published version of the manuscript.

Contributor Information

Stefan Neuenschwander, Email: stefan.neuenschwander@unibe.ch.

Loïc Borcard, Email: loic.borcard@unibe.ch.

Sonja Gempeler, Email: sonja.b.gempeler@unibe.ch.

Miguel A. Terrazos Miani, Email: miguel.terrazos@unibe.ch.

Carlo Casanova, Email: carlo.casanova@unibe.ch.

Alban Ramette, Email: alban.ramette@ifik.unibe.ch.

References

1.Van Goethem N, Descamps T, Devleesschauwer B, Roosens NHC, Boon NAM, et al. Status and potential of bacterial genomics for public health practice: a scoping review. Implement Sci. 2019;14:79. doi: 10.1186/s13012-019-0930-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Didelot X, Bowden R, Wilson DJ, Peto TEA, Crook DW. Transforming clinical microbiology with bacterial genome sequencing. Nat Rev Genet. 2012;13:601–612. doi: 10.1038/nrg3226. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Price V, Ngwira LG, Lewis JM, Baker KS, Peacock SJ, et al. A systematic review of economic evaluations of whole-genome sequencing for the surveillance of bacterial pathogens. Microb Genom. 2023;9:000947. doi: 10.1099/mgen.0.000947. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Schadron T, van den Beld M, Mughini-Gras L, Franz E. Use of whole genome sequencing for surveillance and control of foodborne diseases: status quo and quo vadis. Front Microbiol. 2024;15 doi: 10.3389/fmicb.2024.1460335. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Simar SR, Hanson BM, Arias CA. Techniques in bacterial strain typing: past, present, and future. Curr Opin Infect Dis. 2021;34:339–345. doi: 10.1097/QCO.0000000000000743. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Deurenberg RH, Bathoorn E, Chlebowicz MA, Couto N, Ferdous M, et al. Application of next generation sequencing in clinical microbiology and infection prevention. J Biotechnol. 2017;243:16–24. doi: 10.1016/j.jbiotec.2016.12.022. [DOI] [PubMed] [Google Scholar]
7.Higgs C, Sherry NL, Seemann T, Horan K, Walpola H, et al. Optimising genomic approaches for identifying vancomycin-resistant Enterococcus faecium transmission in healthcare settings. Nat Commun. 2022;13:509. doi: 10.1038/s41467-022-28156-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Nadon C, Van Walle I, Gerner-Smidt P, Campos J, Chinen I, et al. PulseNet International: vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance. Euro Surveill. 2017;22:30544. doi: 10.2807/1560-7917.ES.2017.22.23.30544. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Harris SR. SKA: Split Kmer Analysis Toolkit for bacterial genomic epidemiology. bioRxiv. 2018 doi: 10.1101/453142. [DOI]
10.Greig DR, Jenkins C, Gharbia S, Dallman TJ. Comparison of single-nucleotide variants identified by Illumina and Oxford Nanopore technologies in the context of a potential outbreak of Shiga toxin-producing Escherichia coli. Gigascience. 2019;8:giz104. doi: 10.1093/gigascience/giz104. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Schürch AC, Arredondo-Alonso S, Willems RJL, Goering RV. Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene-based approaches. Clin Microbiol Infect. 2018;24:350–354. doi: 10.1016/j.cmi.2017.12.016. [DOI] [PubMed] [Google Scholar]
12.Deamer D, Akeson M, Branton D. Three decades of nanopore sequencing. Nat Biotechnol. 2016;34:518–524. doi: 10.1038/nbt.3423. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Sanderson ND, Kapel N, Rodger G, Webster H, Lipworth S, et al. Comparison of R9.4.1/Kit10 and R10/Kit12 Oxford Nanopore flowcells and chemistries in bacterial genome reconstruction. Microb Genom. 2023;9:mgen000910. doi: 10.1099/mgen.0.000910. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Wagner GE, Dabernig-Heinz J, Lipp M, Cabal A, Simantzik J, et al. Real-time Nanopore Q20+ sequencing enables extremely fast and accurate core genome MLST typing and democratizes access to high-resolution bacterial pathogen surveillance. J Clin Microbiol. 2023;61:e0163122. doi: 10.1128/jcm.01631-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Ahrenfeldt J, Skaarup C, Hasman H, Pedersen AG, Aarestrup FM, et al. Bacterial whole genome-based phylogeny: construction of a new benchmarking dataset and assessment of some existing methods. BMC Genomics. 2017;18:19. doi: 10.1186/s12864-016-3407-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Magi A, Semeraro R, Mingrino A, Giusti B, D’Aurizio R. Nanopore sequencing data analysis: state of the art, applications and challenges. Brief Bioinform . 2018;19:1256–1272. doi: 10.1093/bib/bbx062. [DOI] [PubMed] [Google Scholar]
17.Oh S, Nam SK, Chang HE, Park KU. Comparative analysis of short- and long-read sequencing of vancomycin-resistant Enterococci for application to molecular epidemiology. Front Cell Infect Microbiol. 2022;12:857801. doi: 10.3389/fcimb.2022.857801. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Hall MB, Rabodoarivelo MS, Koch A, Dippenaar A, George S, et al. Evaluation of Nanopore sequencing for Mycobacterium tuberculosis drug susceptibility testing and outbreak investigation: a genomic analysis. Lancet Microbe . 2023;4:e84–e92. doi: 10.1016/S2666-5247(22)00301-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Liao Y-C, Wu H-C, Liou C-H, Lauderdale T-LY, Huang I-W, et al. Rapid and routine molecular typing using multiplex polymerase chain reaction and MinION sequencer. Front Microbiol. 2022;13:875347. doi: 10.3389/fmicb.2022.875347. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Ferreira FA, Helmersen K, Visnovska T, Jørgensen SB, Aamot HV. Rapid nanopore-based DNA sequencing protocol of antibiotic-resistant bacteria for use in surveillance and outbreak investigation. Microb Genom. 2021;7:000557. doi: 10.1099/mgen.0.000557. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Wu X, Luo H, Ge C, Xu F, Deng X, et al. Evaluation of multiplex nanopore sequencing for Salmonella serotype prediction and antimicrobial resistance gene and virulence gene detection. Front Microbiol. 2022;13:1073057. doi: 10.3389/fmicb.2022.1073057. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Xian Z, Li S, Mann DA, Huang Y, Xu F, et al. Subtyping evaluation of Salmonella Enteritidis using single nucleotide polymorphism and core genome multilocus sequence typing with Nanopore reads. Appl Environ Microbiol. 2022;88:e0078522. doi: 10.1128/aem.00785-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Linde J, Brangsch H, Hölzer M, Thomas C, Elschner MC, et al. Comparison of Illumina and Oxford Nanopore Technology for genome analysis of Francisella tularensis, Bacillus anthracis, and Brucella suis. BMC Genomics. 2023;24:258. doi: 10.1186/s12864-023-09343-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tarumoto N, Sakai J, Sujino K, Yamaguchi T, Ohta M, et al. Use of the Oxford Nanopore MinION sequencer for MLST genotyping of vancomycin-resistant enterococci. J Hosp Infect. 2017;96:296–298. doi: 10.1016/j.jhin.2017.02.020. [DOI] [PubMed] [Google Scholar]
25.Both A, Kruse F, Mirwald N, Franke G, Christner M, et al. Population dynamics in colonizing vancomycin-resistant Enterococcus faecium isolated from immunosuppressed patients. J Glob Antimicrob Resist. 2022;28:267–273. doi: 10.1016/j.jgar.2022.01.027. [DOI] [PubMed] [Google Scholar]
26.Cao MD, Ganesamoorthy D, Elliott AG, Zhang H, Cooper MA, et al. Streaming algorithms for identification of pathogens and antibiotic resistance potential from real-time MinION(TM) sequencing. Gigascience. 2016;5:32. doi: 10.1186/s13742-016-0137-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Dilthey AT, Meyer SA, Kaasch AJ. Ultraplexing: increasing the efficiency of long-read sequencing for hybrid assembly with k-mer-based multiplexing. Genome Biol. 2020;21:68. doi: 10.1186/s13059-020-01974-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Sereika M, Kirkegaard RH, Karst SM, Michaelsen TY, Sørensen EA, et al. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat Methods. 2022;19:823–826. doi: 10.1038/s41592-022-01539-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, et al. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci USA. 1998;95:3140–3145. doi: 10.1073/pnas.95.6.3140. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Maiden MCJ, Jansen van Rensburg MJ, Bray JE, Earle SG, Ford SA, et al. MLST revisited: the gene-by-gene approach to bacterial genomics. Nat Rev Microbiol. 2013;11:728–736. doi: 10.1038/nrmicro3093. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Delahaye C, Nicolas J. Sequencing DNA with nanopores: troubles and biases. PLoS One. 2021;16:e0257521. doi: 10.1371/journal.pone.0257521. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Kofler J, Ramette A, Iseli P, Stauber L, Fichtner J, et al. Ongoing toxin-positive diphtheria outbreaks in a federal asylum centre in Switzerland, analysis July to September 2022. Euro Surveill. 2022;27:2200811. doi: 10.2807/1560-7917.ES.2022.27.44.2200811. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Piezzi V, Wassilew N, Atkinson A, D’Incau S, Kaspar T, et al. Nosocomial outbreak of vancomycin-resistant Enterococcus faecium (VRE) ST796, Switzerland, 2017 to 2020. Euro Surveill. 2022;27:2200285. doi: 10.2807/1560-7917.ES.2022.27.48.2200285. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018;34:2666–2669. doi: 10.1093/bioinformatics/bty149. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
36.Chris W. 2024. Medaka: sequence correction provided by ONT research. https://github.com/nanoporetech/medaka
37.Lohde M, Wagner GE, Dabernig-Heinz J, Viehweger A, Braun SD, et al. Accurate bacterial outbreak tracing with Oxford Nanopore sequencing and reduction of methylation-induced errors. Genome Res. 2024;34:2039–2047. doi: 10.1101/gr.278848.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Miles A. 2024. Pysamstats. Github. https://github.com/alimanfoo/pysamstats
41.de Been M, Pinholt M, Top J, Bletz S, Mellmann A, et al. Core genome multilocus sequence typing scheme for high-resolution typing of Enterococcus faecium. J Clin Microbiol. 2015;53:3788–3797. doi: 10.1128/JCM.01946-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
43.Feldgarden M, Brover V, Gonzalez-Escalona N, Frye JG, Haendiges J, et al. AMRFinderPlus and the Reference Gene Catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci Rep. 2021;11:12728. doi: 10.1038/s41598-021-91456-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Arias CA, Murray BE. The rise of the Enterococcus: beyond vancomycin resistance. Nat Rev Microbiol. 2012;10:266–278. doi: 10.1038/nrmicro2761. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Foster-Nyarko E, Cottingham H, Wick RR, Judd LM, Lam MMC, et al. Nanopore-only assemblies for genomic surveillance of the global priority drug-resistant pathogen, Klebsiella pneumoniae. Microb Genom. 2023;9:mgen000936. doi: 10.1099/mgen.0.000936. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Bouras G, Sheppard AE, Mallawaarachchi V, Vreugde S. Plassembler: an automated bacterial plasmid assembly tool. Bioinformatics. 2023;39:btad409. doi: 10.1093/bioinformatics/btad409. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Djordjevic SP, Jarocki VM, Seemann T, Cummins ML, Watt AE, et al. Genomic surveillance for antimicrobial resistance - a One Health perspective. Nat Rev Genet. 2024;25:142–157. doi: 10.1038/s41576-023-00649-y. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Chris W. 2024. Medaka: sequence correction provided by ONT research. https://github.com/nanoporetech/medaka
Miles A. 2024. Pysamstats. Github. https://github.com/alimanfoo/pysamstats

Supplementary Materials

Uncited Supplementary Material 1.

mgen-12-01626-s001.pdf^{(2.4MB, pdf)}

DOI: 10.1099/mgen.0.001626

[R1] 1.Van Goethem N, Descamps T, Devleesschauwer B, Roosens NHC, Boon NAM, et al. Status and potential of bacterial genomics for public health practice: a scoping review. Implement Sci. 2019;14:79. doi: 10.1186/s13012-019-0930-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Didelot X, Bowden R, Wilson DJ, Peto TEA, Crook DW. Transforming clinical microbiology with bacterial genome sequencing. Nat Rev Genet. 2012;13:601–612. doi: 10.1038/nrg3226. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Price V, Ngwira LG, Lewis JM, Baker KS, Peacock SJ, et al. A systematic review of economic evaluations of whole-genome sequencing for the surveillance of bacterial pathogens. Microb Genom. 2023;9:000947. doi: 10.1099/mgen.0.000947. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Schadron T, van den Beld M, Mughini-Gras L, Franz E. Use of whole genome sequencing for surveillance and control of foodborne diseases: status quo and quo vadis. Front Microbiol. 2024;15 doi: 10.3389/fmicb.2024.1460335. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Simar SR, Hanson BM, Arias CA. Techniques in bacterial strain typing: past, present, and future. Curr Opin Infect Dis. 2021;34:339–345. doi: 10.1097/QCO.0000000000000743. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Deurenberg RH, Bathoorn E, Chlebowicz MA, Couto N, Ferdous M, et al. Application of next generation sequencing in clinical microbiology and infection prevention. J Biotechnol. 2017;243:16–24. doi: 10.1016/j.jbiotec.2016.12.022. [DOI] [PubMed] [Google Scholar]

[R7] 7.Higgs C, Sherry NL, Seemann T, Horan K, Walpola H, et al. Optimising genomic approaches for identifying vancomycin-resistant Enterococcus faecium transmission in healthcare settings. Nat Commun. 2022;13:509. doi: 10.1038/s41467-022-28156-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Nadon C, Van Walle I, Gerner-Smidt P, Campos J, Chinen I, et al. PulseNet International: vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance. Euro Surveill. 2017;22:30544. doi: 10.2807/1560-7917.ES.2017.22.23.30544. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Harris SR. SKA: Split Kmer Analysis Toolkit for bacterial genomic epidemiology. bioRxiv. 2018 doi: 10.1101/453142. [DOI]

[R10] 10.Greig DR, Jenkins C, Gharbia S, Dallman TJ. Comparison of single-nucleotide variants identified by Illumina and Oxford Nanopore technologies in the context of a potential outbreak of Shiga toxin-producing Escherichia coli. Gigascience. 2019;8:giz104. doi: 10.1093/gigascience/giz104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Schürch AC, Arredondo-Alonso S, Willems RJL, Goering RV. Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene-based approaches. Clin Microbiol Infect. 2018;24:350–354. doi: 10.1016/j.cmi.2017.12.016. [DOI] [PubMed] [Google Scholar]

[R12] 12.Deamer D, Akeson M, Branton D. Three decades of nanopore sequencing. Nat Biotechnol. 2016;34:518–524. doi: 10.1038/nbt.3423. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Sanderson ND, Kapel N, Rodger G, Webster H, Lipworth S, et al. Comparison of R9.4.1/Kit10 and R10/Kit12 Oxford Nanopore flowcells and chemistries in bacterial genome reconstruction. Microb Genom. 2023;9:mgen000910. doi: 10.1099/mgen.0.000910. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Wagner GE, Dabernig-Heinz J, Lipp M, Cabal A, Simantzik J, et al. Real-time Nanopore Q20+ sequencing enables extremely fast and accurate core genome MLST typing and democratizes access to high-resolution bacterial pathogen surveillance. J Clin Microbiol. 2023;61:e0163122. doi: 10.1128/jcm.01631-22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Ahrenfeldt J, Skaarup C, Hasman H, Pedersen AG, Aarestrup FM, et al. Bacterial whole genome-based phylogeny: construction of a new benchmarking dataset and assessment of some existing methods. BMC Genomics. 2017;18:19. doi: 10.1186/s12864-016-3407-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Magi A, Semeraro R, Mingrino A, Giusti B, D’Aurizio R. Nanopore sequencing data analysis: state of the art, applications and challenges. Brief Bioinform . 2018;19:1256–1272. doi: 10.1093/bib/bbx062. [DOI] [PubMed] [Google Scholar]

[R17] 17.Oh S, Nam SK, Chang HE, Park KU. Comparative analysis of short- and long-read sequencing of vancomycin-resistant Enterococci for application to molecular epidemiology. Front Cell Infect Microbiol. 2022;12:857801. doi: 10.3389/fcimb.2022.857801. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Hall MB, Rabodoarivelo MS, Koch A, Dippenaar A, George S, et al. Evaluation of Nanopore sequencing for Mycobacterium tuberculosis drug susceptibility testing and outbreak investigation: a genomic analysis. Lancet Microbe . 2023;4:e84–e92. doi: 10.1016/S2666-5247(22)00301-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Liao Y-C, Wu H-C, Liou C-H, Lauderdale T-LY, Huang I-W, et al. Rapid and routine molecular typing using multiplex polymerase chain reaction and MinION sequencer. Front Microbiol. 2022;13:875347. doi: 10.3389/fmicb.2022.875347. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Ferreira FA, Helmersen K, Visnovska T, Jørgensen SB, Aamot HV. Rapid nanopore-based DNA sequencing protocol of antibiotic-resistant bacteria for use in surveillance and outbreak investigation. Microb Genom. 2021;7:000557. doi: 10.1099/mgen.0.000557. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Wu X, Luo H, Ge C, Xu F, Deng X, et al. Evaluation of multiplex nanopore sequencing for Salmonella serotype prediction and antimicrobial resistance gene and virulence gene detection. Front Microbiol. 2022;13:1073057. doi: 10.3389/fmicb.2022.1073057. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Xian Z, Li S, Mann DA, Huang Y, Xu F, et al. Subtyping evaluation of Salmonella Enteritidis using single nucleotide polymorphism and core genome multilocus sequence typing with Nanopore reads. Appl Environ Microbiol. 2022;88:e0078522. doi: 10.1128/aem.00785-22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Linde J, Brangsch H, Hölzer M, Thomas C, Elschner MC, et al. Comparison of Illumina and Oxford Nanopore Technology for genome analysis of Francisella tularensis, Bacillus anthracis, and Brucella suis. BMC Genomics. 2023;24:258. doi: 10.1186/s12864-023-09343-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Tarumoto N, Sakai J, Sujino K, Yamaguchi T, Ohta M, et al. Use of the Oxford Nanopore MinION sequencer for MLST genotyping of vancomycin-resistant enterococci. J Hosp Infect. 2017;96:296–298. doi: 10.1016/j.jhin.2017.02.020. [DOI] [PubMed] [Google Scholar]

[R25] 25.Both A, Kruse F, Mirwald N, Franke G, Christner M, et al. Population dynamics in colonizing vancomycin-resistant Enterococcus faecium isolated from immunosuppressed patients. J Glob Antimicrob Resist. 2022;28:267–273. doi: 10.1016/j.jgar.2022.01.027. [DOI] [PubMed] [Google Scholar]

[R26] 26.Cao MD, Ganesamoorthy D, Elliott AG, Zhang H, Cooper MA, et al. Streaming algorithms for identification of pathogens and antibiotic resistance potential from real-time MinION(TM) sequencing. Gigascience. 2016;5:32. doi: 10.1186/s13742-016-0137-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Dilthey AT, Meyer SA, Kaasch AJ. Ultraplexing: increasing the efficiency of long-read sequencing for hybrid assembly with k-mer-based multiplexing. Genome Biol. 2020;21:68. doi: 10.1186/s13059-020-01974-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Sereika M, Kirkegaard RH, Karst SM, Michaelsen TY, Sørensen EA, et al. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat Methods. 2022;19:823–826. doi: 10.1038/s41592-022-01539-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, et al. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci USA. 1998;95:3140–3145. doi: 10.1073/pnas.95.6.3140. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Maiden MCJ, Jansen van Rensburg MJ, Bray JE, Earle SG, Ford SA, et al. MLST revisited: the gene-by-gene approach to bacterial genomics. Nat Rev Microbiol. 2013;11:728–736. doi: 10.1038/nrmicro3093. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Delahaye C, Nicolas J. Sequencing DNA with nanopores: troubles and biases. PLoS One. 2021;16:e0257521. doi: 10.1371/journal.pone.0257521. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Kofler J, Ramette A, Iseli P, Stauber L, Fichtner J, et al. Ongoing toxin-positive diphtheria outbreaks in a federal asylum centre in Switzerland, analysis July to September 2022. Euro Surveill. 2022;27:2200811. doi: 10.2807/1560-7917.ES.2022.27.44.2200811. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Piezzi V, Wassilew N, Atkinson A, D’Incau S, Kaspar T, et al. Nosocomial outbreak of vancomycin-resistant Enterococcus faecium (VRE) ST796, Switzerland, 2017 to 2020. Euro Surveill. 2022;27:2200285. doi: 10.2807/1560-7917.ES.2022.27.48.2200285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018;34:2666–2669. doi: 10.1093/bioinformatics/bty149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]

[R36] 36.Chris W. 2024. Medaka: sequence correction provided by ONT research. https://github.com/nanoporetech/medaka

[R37] 37.Lohde M, Wagner GE, Dabernig-Heinz J, Viehweger A, Braun SD, et al. Accurate bacterial outbreak tracing with Oxford Nanopore sequencing and reduction of methylation-induced errors. Genome Res. 2024;34:2039–2047. doi: 10.1101/gr.278848.123. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Miles A. 2024. Pysamstats. Github. https://github.com/alimanfoo/pysamstats

[R41] 41.de Been M, Pinholt M, Top J, Bletz S, Mellmann A, et al. Core genome multilocus sequence typing scheme for high-resolution typing of Enterococcus faecium. J Clin Microbiol. 2015;53:3788–3797. doi: 10.1128/JCM.01946-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] 42.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]

[R43] 43.Feldgarden M, Brover V, Gonzalez-Escalona N, Frye JG, Haendiges J, et al. AMRFinderPlus and the Reference Gene Catalog facilitate examination of the genomic links among antimicrobial resistance, stress response, and virulence. Sci Rep. 2021;11:12728. doi: 10.1038/s41598-021-91456-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] 44.Arias CA, Murray BE. The rise of the Enterococcus: beyond vancomycin resistance. Nat Rev Microbiol. 2012;10:266–278. doi: 10.1038/nrmicro2761. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] 45.Foster-Nyarko E, Cottingham H, Wick RR, Judd LM, Lam MMC, et al. Nanopore-only assemblies for genomic surveillance of the global priority drug-resistant pathogen, Klebsiella pneumoniae. Microb Genom. 2023;9:mgen000936. doi: 10.1099/mgen.0.000936. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] 46.Bouras G, Sheppard AE, Mallawaarachchi V, Vreugde S. Plassembler: an automated bacterial plasmid assembly tool. Bioinformatics. 2023;39:btad409. doi: 10.1093/bioinformatics/btad409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] 47.Djordjevic SP, Jarocki VM, Seemann T, Cummins ML, Watt AE, et al. Genomic surveillance for antimicrobial resistance - a One Health perspective. Nat Rev Genet. 2024;25:142–157. doi: 10.1038/s41576-023-00649-y. [DOI] [PubMed] [Google Scholar]

PERMALINK

Evaluation of Oxford Nanopore Technologies workflows for genomic epidemiology of outbreak-associated bacterial isolates in the clinical setting

Stefan Neuenschwander

Loïc Borcard

Sonja Gempeler

Miguel A Terrazos Miani

Carlo Casanova

Alban Ramette

Abstract

Impact Statement

Data Summary

Introduction

Methods

Sample origin, cultivation and DNA extraction

Whole-genome sequencing

Ethics statement

Bioinformatic analyses

Fig. 2. Overview of the bioinformatic analyses performed in the study.

Results

VRE genomic epidemiology using Nanopore vs. Illumina WGS

Fig. 4. (a) Example of an ambiguous base position: The majority of forward and reverse reads differ at the called base. Strand direction is indicated by read color. (b) Numbers of ambiguous base positions detected by genomic locations and by MLST ST.

VRE-predicted AMR and plasmids

Fig. 5. Comparison of the predicted resistance genes based on ONT and Illumina assemblies. Tuples indicate presence (1) or absence (0) in the treatments SUBDP_43 (RPB), SUPD_43 (RBK) and ILLUM (Illumina) with (a) antibiotic classes, (b) antibiotic subclasses, (c) resistance genes.

Fig. 7. Plasmid-borne resistance. Numbers indicate the number of resistance genes detected for each specific antibiotic subclass.

CDIP genomic epidemiology using ONT vs. Illumina WGS

Time and cost estimation

Table 1. Turnaround time and cost estimates.

Discussion

Supplementary material

Acknowledgements

Abbreviations

Footnotes

Contributor Information

References

Associated Data

Data Citations

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases