Skip to main content
PLOS Neglected Tropical Diseases logoLink to PLOS Neglected Tropical Diseases
. 2026 Jan 21;20(1):e0013931. doi: 10.1371/journal.pntd.0013931

Genomic characterization of a severe West Nile Virus transmission season using a single reaction amplicon sequencing approach

Shawn Freed Jr 1,2,#, Sarah Chandler 3,#, Sarah Uhm 1,#, Zach Pella 1,2, Dikchha Gurung 1, Hallie Smith 4, Tammy Dowdy 4, Amanda M Bartling 5, Ava Butz 1, Michael R Wiley 5,6, M Jana Broadhurst 6, Sydney Stein 4, Emily L McCutchen 5, Jeff Hamik 4, Peter C Iwen 5,6, Nick Downey 7, Kaylee S Herzog 1, Joseph R Fauver 1,*
Editor: Doug E Brackney8
PMCID: PMC12844533  PMID: 41564144

Abstract

West Nile virus (WNV) is an endemic arthropod-borne virus that has routinely caused seasonal outbreaks in the United States since it was first detected in 1999. While phylogenetic studies have shown how WNV has diversified and undergone genotype replacement since introduction, more geographically focused studies are needed to understand intricate transmission dynamics at local and regional scales. In this study, we validate the IDT xGen WNV panel, a novel single reaction amplicon-based Next-Generation Sequencing approach, to generate high-quality WNV genomes and compare it to the “Primal Scheme” assay for WNV, a common amplicon sequencing strategy. By generating >250 genomes from mosquito pools, we show that the IDT xGen WNV panel generated coding-complete and accurate WNV genomes when compared to the current sequencing approaches. Additionally, we used this approach to generate 100 coding-complete WNV genomes from surveillance pools of mosquitoes collected in Nebraska during the 2023 outbreak. Our discrete phylogeographic analysis revealed substantial genetic diversity in WNV genomes from 2023 with minimal clustering across the state. This study demonstrated the utility of a single reaction amplicon-based sequencing approach to generate quality WNV genomes from routine surveillance samples and characterize WNV transmission dynamics in a high-incidence setting.

Author summary

West Nile virus (WNV) remains the most impactful arthropod-borne virus in the United States. Genomic epidemiology of WNV can inform our understanding of virus emergence and transmission, ultimately informing mosquito surveillance and control programs. In this study, we validated a simple, single-reaction amplicon sequencing approach by generating >250 coding-complete WNV genomes from WNV positive pools of mosquitoes. We then characterized the 2023 transmission in Nebraska, a high-incidence state, and showed that the western portion of the state saw early and intense WNV transmission, as evidenced by high infection rates in both mosquitoes and humans. We showed there was significant genetic diversity in WNV genotypes with minimal clustering of WNV genomes across the state, indicating the early transmission initiation in the western portion did not seed transmission in other areas of the state. Our study demonstrated the utility of a single-reaction sequencing approach to generate high-quality WNV genomes directly from pools of mosquitoes collected for surveillance purposes while also highlighting the substantial WNV genetic diversity present during a severe WNV transmission season.

Introduction

West Nile virus (WNV) is an arthropod-borne flavivirus (arbovirus) that exists in a transmission cycle between passerine birds and Culex species mosquitoes and is widely distributed across the United States [1]. Since introduction to the United States, there have been >60,000 diagnosed clinical cases of WNV disease, of which >30,000 have resulted in severe neuroinvasive disease [2]. Importantly, clinical cases are underreported and the true burden of infections in the United States is estimated to be in the millions [3]. A substantial portion of individuals who develop symptomatic disease experience neurological sequelae [4], underscoring WNV as a significant source of both acute and chronic disease. As there is no licensed human vaccine or specific treatment for WNV, control depends entirely on reducing exposure to infectious mosquitoes [5]. Therefore, understanding WNV transmission dynamics is imperative to determine risk of human infection and ultimately implement control measures [6]. Entomological surveillance provides the best spatial-temporal indication of human risk of WNV transmission [79]. Accordingly, vector mosquitoes are systematically collected and tested for the presence of WNV seasonally throughout the United States [10]. While informative for inferring contemporary risk of infection, a simple presence/absence measure of WNV in mosquito populations does not allow for broader inference of transmission patterns. Understanding the connectivity of WNV populations across spatial scales, genotype maintenance through and across transmission seasons, and patterns of emergence and transmission initiation requires the addition of WNV genomic information.

Numerous efforts to understand WNV transmission patterns and evolution at a national scale have been undertaken using genomic approaches. These studies have shown that WNV dispersed rapidly across the United States during the early epidemic period [11,12] and dissemination was characterized by long-range movements of the virus [13]. WNV populations are thought to be loosely structured by geography across the US with substantial circulating genetic diversity in concurrent areas of transmission [11,14]. Since the introduction of WNV, multiple genotypes have arisen and come to dominate transmission. This includes the replacement of NY99 by WN02 [15,16], the rise of SW03 [17] in the Southwest, and more recently the emergence of NY10 [18] in the Northeast, Southeast, and Midwest regions of the United States. While national studies are crucial for inferring large-scale transmission patterns, such as the emergence of new genotypes, WNV transmission is highly heterogeneous across the United States. Studies on local WNV dynamics can reveal important factors influencing transmission and inform risk. Longitudinal genomic analysis of WNV in California identified the co-circulation of WN02 and SW03 and implicated overwintering mosquitoes and non-migratory bird populations for maintaining WNV genotypes through seasons [19]. An analysis of WNV genomes sampled over a decade in Connecticut indicated that control measures were unlikely to have a lasting impact beyond a disease season due to repeated WNV introductions that reinitiate transmission year to year [20]. Additionally, longitudinal genomic analysis and virus characterization from New York state identified the emergence of a novel genotype, NY10, and found its defining mutations resulted in increased infectivity and transmissibility by Culex pipiens mosquitoes and led to NY10 becoming the dominant genotype across the state and elsewhere in the United States [18,21,22]. Prior studies of WNV dynamics at local levels reveal important factors influencing transmission and potential risk. While informative, genomic studies have been limited to a few states and localities across the United States and have not included areas with the highest disease burden, such as the Central Plains [1].

Untargeted Next-Generation Sequencing (NGS) strategies, such as shotgun metagenomics, from pools of mosquitoes would be an ideal sequencing approach as it would not require prior knowledge of the pathogen, however these approaches lack sensitivity and result in a high proportion of non-arbovirus read data [23]. Therefore, sequencing arbovirus genomes from entomological samples requires either the isolation of virus in cell culture prior to sequencing and/or using target-specific amplification approaches. While isolating WNV in cell culture increases the specificity of NGS data, most mosquito surveillance programs around the United States rely on molecular detection methods, specifically RT-qPCR, to identify WNV in mosquito populations and no longer isolate viruses using cell culture [10]. Multiple amplicon-based sequencing approaches have been developed to generate high-quality WNV genomes [22,24] however these approaches rely on single amplicons to cover each region of the genome [24,25], which can result in amplicon drop-out when mutations occur in amplicon primer binding regions [26]. In this study, we described the development and validation of a single reaction amplicon-based NGS approach to generate coding-complete WNV genomes, called IDT xGen WNV. We compared our approach to the “Primal Scheme” amplicon-sequencing strategy that is commonly used for sequencing WNV. Finally, we demonstrated the utility of this approach by characterizing the 2023 WNV transmission season in Nebraska using entomological, epidemiological, and genomic data. We have shown that the IDT xGen WNV panel produced coding-complete and accurate genomes ideal for phylogenetic inference and outbreak characterization.

Materials and methods

IDT xGen WNV panel development

The IDT xGen Amplicon sequencing approach is similar to Primal Scheme, however it differs in that Primal Scheme uses an overlapping primer approach to generate amplicons to the complete virus genome by staggering primer design and conducting PCR in two separate reactions, where the IDT xGen Amplicon assay makes uses of “super amplicons” that are elongated and span multiple primer sequences, thus allowing for PCR to be conducted in a single tube (S1 Fig). A custom IDT xGen Amplicon panel was designed in partnership with Integrated DNA Technologies (IDT, Coralville, Iowa). A total of 50 publicly available WNV genomes were downloaded from GenBank [27] to create a multisequence alignment (MSA) using CLUSTAL Omega [28]. Primer sequences were designed from the consensus sequence to capture contemporary WNV Lineage 1A diversity while being robust to amplicon dropout due to nucleotide mismatches in primer binding regions. Primer sequences span the complete coding-sequence of WNV and do not amplify the 5’ and 3’ untranslated regions.

Sample selection

All WNV genomes in this study were generated from WNV positive mosquito pools collected by the Nebraska Department of Health and Human Services (NeDHHS) for routine seasonal WNV surveillance purposes from 2012-2024. Briefly, CDC light traps are placed in multiple counties across the state to collect Culex species mosquitoes from May through September. Trap contents are sent to NeDHHS on dry ice for morphological species identification and pooling. Pools are separated by location, date, and species and up to 50 individual mosquitoes stored in a tube with PBS prior to WNV testing. Mosquito pools were deemed positive for WNV via RT-qPCR [29] performed by the Nebraska Public Health Laboratory (NPHL). To characterize genomic data generated with the IDT xGen WNV panel, a total of 287 WNV positive mosquito pools collected during 2012–2024 were selected for sequencing. Between 20–50 WNV positive mosquitoes pools were selected for each year in the study in an attempt to get geographic and temporal representation across the state. This sample set included 107 samples from the 2023 WNV transmission season in Nebraska. Additionally, a subset of 30 samples were sequenced using both the IDT xGen WNV panel and the WNV Primal Scheme assay for direct comparison [24].

IDT xGen WNV library preparation, sequencing, and consensus sequence generation

Prior to nucleic acid extraction, all mosquito pools were homogenized by placing a single ball bearing in the tube containing mosquitoes/PBS and placed on a tissue homogenizer at 25 Hz for 1 minute. For WNV positive mosquito pools collected during 2012–2020, total nucleic acid was extracted using the MagMAX Viral/Pathogen Nucleic Acid Isolation Kit (ThermoFisher) using a KingFisher Flex instrument (ThermoFisher) according to manufacturer’s instructions. For WNV positive pools collected during 2021–2024, RNA was provided by NPHL. RNA was retested for the presence of WNV via a specific RT-qPCR assay using the Luna Universal Probe One-Step RT-qPCR Kit (New England Biolabs [NEB]) as previously described [30]. RNA was reverse transcribed to create cDNA using LunaScript RT SuperMix (NEB) according to manufacturer’s instructions and 10µl was used as input to the IDT xGen Amplicon Core Kit (IDT). Total RNA or cDNA was not quantified prior to library preparation. Library preparation was conducted according to manufacturer’s instruction with the following modifications: increased elution volume following panel-specific multiplex PCR to 25µl, increased elution post indexing PCR to 20µl, and removed samples from beads in each clean-up step. Samples were indexed with IDT xGen Normalase UDI Primers. Samples were normalized and pooled with the IDT xGen Normalase module according to manufacturer’s instructions. Following library normalization and pooling, final library pools were quantified with the Qubit dsDNA Quantification HS Assay (ThermoFisher). An initial pool of 20 samples was sequenced on the Illumina MiniSeq sequencing system at the University of Nebraska Medical Center (UNMC), and the remaining pools were sent to the Yale Center for Genomic Analysis (YCGA) for sequencing on Illumina NovaSeq 6000 sequencing system. A total of 100,000 reads were targeted for each sample. Following sequencing and demultiplexing, data was transferred to the Holland Computing Center (HCC) at the University of Nebraska for quality control and consensus sequence generation. Read level data were assessed for quality and had Illumina adapter sequences trimmed with fastp [31] where reads with a Q score <15 were removed. Following quality control, reads were aligned to the previously described consensus WNV genome using BWA [32], primer sequences were trimmed with fgbio [33],.bam files were created and indexed with samtools [34], and consensus sequences were generated with ViralConsensus [35]. At least 10 reads (10x depth) were required to call a base in the consensus sequence, and regions of the genome with <10 reads were assigned the ambiguous nucleotide “N”. A detailed bioinformatic script to conduct QC and consensus sequence generation from the IDT xGen WNV panel can be found in our GitHub repository.

Additionally, a subset of 10 samples prepared for Illumina sequencing were also prepared for sequencing on the Oxford Nanopore Technologies (ONT) MinION sequencing platform. These samples were processed and analyzed as described above using a modified version of the primer panel and of the IDT xGen Amplicon Core Kit reagents. The bioinformatic analysis is similar to that described above, however a constrained sequence length between 200–2000 base pairs was implemented using seqkit [36], and reads were aligned to the reference genome using minimap2 [37]. A detailed bioinformatic script to conduct QC and consensus sequence generation from the IDT xGen WNV panel sequenced on the ONT MinION platform can be found in our GitHub repository.

Primal scheme library preparation, sequencing, and consensus sequence generation

The subset of RNA from a total of 30 WNV positive mosquito pools described above also underwent library preparation using the Primal Scheme assay designed for WNV [24,25]. Following cDNA synthesis as described above, 5µl of cDNA was aliquoted to each of two tubes for multiplex PCR with Pool 1 and Pool 2 primers, respectively. Multiplex PCR was performed as previously described [24] with reduced PCR cycles to 30. Following bead-based sample purification using KAPA Pure Beads (Roche), samples were eluted into 25ul molecular grade water. Pool 1 and Pool 2 contents were pooled by sample, quantified with the Qubit dsDNA Quantification HS Assay (ThermoFisher), and used as input for library preparation using the Illumina DNA Prep Kit (Illumina) according to manufacturer’s instructions. Following library preparation, samples were quantified as previously described and pooled to equal concentrations prior to sequencing. Initially, 10 samples were sequenced on the MiniSeq system at UNMC. An additional pool of the remaining 20 samples was sent to YCGA for sequencing on Illumina NovaSeq 6000 sequencing system. A total of 100,000 reads were targeted for each sample. Following sequencing and demultiplexing, data was transferred to the HCC at the University of Nebraska for quality control and consensus sequence generation. Read level data were assessed for quality and had Illumina adapter sequences trimmed using fastp [31] where reads with a Q score <15 were removed. Following quality control, reads were aligned to a reference WNV genome using BWA [32],.bam files were created and indexed with samtools [34], and primers were trimmed and consensus sequences were generated using iVar [24]. At least 10 reads (10x depth) were required to call a base in the consensus sequence, and regions of the genome with <10 reads were assigned the ambiguous nucleotide “N”. A detailed bioinformatic script to conduct QC and consensus sequence generation can be found in our GitHub repository.

Validation of IDT xGen WNV panel

WNV genomes generated using the IDT xGen WNV panel were assessed for genome completeness by determining the number of ambiguous nucleotides (“N”) in the genome assembly using a custom bash script. The percent of reads aligning to the WNV genome was determined by parsing the output of the samtools flagstat command incorporated into our bioinformatic pipeline. The temporality of WNV genomes was assessed using TempEst [38]. Genomes generated with the IDT xGen WNV panel were aligned using MAFFT [39] and a maximum-likelihood (ML) phylogenetic analysis was conducted using PhyML [40]. The ML tree and a metadata file containing sample collection dates were imported into TempEst to calculate substitutions per site, correlation coefficient, and residual means squared. To compare genome accuracy, a multisequence alignment was generated using MAFFT [39] of the 30 samples prepared and sequenced with both the IDT xGen WNV panel and the Primal Scheme WNV assay (60 sequences total). A distance matrix for the alignment was produced in Geneious [41] and nucleotide differences between consensus sequence genomes for paired samples was generated. Nucleotide similarity was visualized using a heatmap and dendrogram produced with the heatmap.2 function in gplots [42]. To determine breadth of genome coverage associated with sequencing effort, total reads for paired samples were downsampled to 100,000 and 10,000, using seqkit [36] prior to consensus sequence generation. Genome completeness was then subsequently compared across paired samples using the full dataset and downsampled datasets.

Entomological and human WNV surveillance

To determine if WNV transmission dynamics varied across Nebraska during the 2023 transmission season, the state was partitioned into three regions, East, Central, and West, as described in a previous study assessing WNV seroprevalence in the state following the initial WNV epidemic in the state in 2003 [43]. These regions generally correlate with population size and average annual rainfall in Nebraska [44]. Weekly entomological surveillance data from the 2023 WNV transmission season in Nebraska was provided by the NeDHHS. WNV positivity of mosquito pools was determined by RT-qPCR [29] conducted by the NPHL. The vector index (VI), an estimate of the number of infected mosquitoes in a population [9], was calculated using the CDC PooledInfRate package available in R [45]. Total human cases of WNV per county in Nebraska were provided by the CDC National Arbovirus Surveillance System (ArboNET) [46]. The cumulative incidence statewide and for each of the three regions in 2023 was calculated by dividing the number of clinical cases of WNV in 2023 by the total population at each scale. Total WNV cases included both neuroinvasive and non-neuroinvasive cases. Population data was collected from the United States Census Bureau.

Phylogenetic analysis of 2023 WNV genomes from Nebraska

A total of 100 coding-complete genomes were generated from Nebraska in 2023 from the West (N = 48), Central (N = 34), and East (N = 18) regions. A custom Nextstrain build [47] was generated to characterize the WNV genomes collected during the 2023 WNV transmission season in Nebraska. Briefly, complete and publicly available WNV Lineage 1A genomes and corresponding metadata were downloaded from Pathoplexus [48]. Using the augur [49] pipeline, a multisequence alignment of 3,952 WNV genomes was created using MAFFT [39]. The alignment was used as input into IQ-TREE with 1,000 ultrafast bootstraps to generate a maximum-likelihood phylogenetic tree with nodal support values [50]. A time-calibrated tree was generated and ancestral state reconstruction of clades was conducted with TreeTime [51]. A final annotated tree was visualized in Auspice. Due to the variation in entomological and human risk observed in different regions of the state, genomes from Nebraska were annotated as either East, Central, or West depending on location of collection. To determine if genotypes were shared across the state, we identified clades containing three or more Nebraska 2023 genomes, UFBoot > 70, and with a time to most recent common ancestor (tMRCA) estimated to occur in 2019 or later using the “explode tree by” function in Nextstrain. The cutoff of 2019 was selected to identify clusters made up of WNV genomes from Nebraska that could have spread between the regions during our study time period. Additionally, the majority of publicly available WNV genomes from outside of Nebraska were generated prior to 2019 (S2 Fig). This approach would identify WNV genotypes that may have emerged in one region and were subsequently transmitted to other regions in the state during the 2023 transmission season. The discrete ancestral state reconstructions of these clades were inferred using TreeTime [51]. Phylogenetic trees presented in this study were visualized using baltic [52].

Primer sets for the IDT xGen WNV panel are available via IDT under CS#870 for the Illumina panel and CP#1082 for the ONT panel. Note: the IDT xGen WNV panel is for research use only. Unless otherwise agreed to in writing, IDT does not intend these products to be used in clinical applications and does not warrant their fitness or suitability for any clinical diagnostics use. The purchaser is solely responsible for all decisions regarding the use of these products and any associated regulatory or legal obligations. RUO25–3814_001.

Results

Characterization of genomes produced by the IDT xGen WNV panel

To validate the sequencing approach, we generated libraries from 287 WNV positive mosquito pools collected during 2012–2024 in Nebraska (Fig 1 and S3 Fig). From these libraries, we were able to generate 274 WNV genomes with >80% genome completeness (Fig 1A). Of these 274 genomes, 263 had > 95% genome completeness. Mosquito pools that produced lower CT values (i.e., higher WNV RNA concentration) produced more complete WNV genomes (Fig 1A). Samples that produced incomplete WNV genomes (N = 10) had high rates of off-target amplification as determined by the percent of reads in the library that aligned to the WNV genome (Fig 1A). We observed this phenomenon in high CT value samples with low genome completeness. These samples produced quantifiable libraries and total read counts comparable to low CT values samples with high genome completeness, however the majority of reads did not align to the WNV genome. We then assessed the temporal signal of WNV genomes produced using the IDT xGen WNV panel. We observed an expected “clock-like” mutation-rate from the 274 genomes where the genetic distance was strongly correlated with sampling time (r = 0.805, p=<00001) (Fig 1B). From these data, we estimated the nucleotide substitution rate to be 3.97 x 10-4 substitutions per site per year.

Fig 1. Validation of WNV genomes generated using the IDT xGen WNV panel.

Fig 1

(A) Genome completeness for 287 genomes generated from Nebraska as measured by the proportion of positions in the WNV coding sequence with ≥ 10 reads. The color gradient for the circles depicts the percent of total reads aligning to the WNV genome. Lighter circles have a higher overall alignment rate. (B) Root-to-tip plot of 274 WNV genomes generated from Nebraska with >80% completeness plotting the time the sample was collected compared to the genetic distance of the sample.

Comparison of the IDT xGen WNV panel to the WNV primal scheme assay

To determine the accuracy of genomes produced with the IDT xGen WNV panel, we resequenced 30 samples using the WNV Primal Scheme assay [24] to determine if both approaches produced identical consensus sequence genomes. We observed a high level of consensus sequence genome identity between approaches across a broad range of contemporary WNV genetic diversity (Fig 2A). Assessment of pairwise nucleotide identity revealed that 28 of the 30 genomes sequenced using both approaches produced from the same sample were 100% identical across the complete coding sequence (Fig 2B). The two samples that were not identical across the genome, UNMC0019 and UNMC0567, had a total of 2 and 6 nucleotide mismatches, respectively.

Fig 2. Comparison of WNV genomes by sequencing approach.

Fig 2

(A) A maximum likelihood phylogenetic tree depicting the WNV genomes from Nebraska generated as a part of this study. Tips shown in red were sequenced using both the IDT xGen WNV panel and the Primal Scheme for WNV assay. (B) A dendrogram and heatmap showing the pairwise nucleotide identity of WNV genomes from the same sample generated with both approaches. (C) Genome completeness for each paired sample sequenced using both approaches. Red dots represent genomes sequenced with Primal Scheme and blue dots represent genomes sequenced with IDT. Sequencing effort refers to how read datasets were down sampled, fewer reads are depicted with lighter colors. Max refers to the total amount of read data for that given sample, 100K refers to 100,000 reads, and 10K refers to 10,000 reads. The lowest read count is displayed most forward. (D) Box-and-whisker plot showing the difference in genome completeness by sequencing approach from the max sequencing effort compared to the most reduced read dataset of 10,000 reads. Horizontal black lines represent the median difference. For B,C, and D, “PS” refers to “Primal Scheme” and “IDT” refers to “IDT xGen WNV”.

We then compared genome completeness of paired samples with varying levels of sequencing effort by systematically downsampling the number of reads in each library (Fig 2C). At the maximum sequencing depth (i.e., the number of reads produced by a given library), both sequencing approaches produced coding-complete or nearly coding-complete genomes, with the exception UNMC0019 sequenced with Primal Scheme, which was ~ 75% complete. Notably, UNMC0261 showed amplicon dropout in the 10,000 read count datasets for both approaches, however the dropout was sharper for the genome generated using the IDT xGen WNV panel. The variation in genome completeness when comparing the maximum sequencing effort datasets to the most reduced datasets of 10,000 was wider for samples produced with the Primal Scheme assay compared to those produced with the IDT xGen WNV panel (Fig 2D).

Comparison of the IDT xGen WNV panel for ONT MinION sequencing

A subset of these samples were resequenced using the IDT xGen WNV kit for ONT MinION sequencing. Of the 10 samples sequenced with all 3 approaches/strategies (IDT xGen WNV on Illumina, IDT xGen WNV on the ONT MinION, and WNV Primal Scheme) all produced complete and identical consensus sequence genomes (S1 Table).

Entomological and epidemiological surveillance of the 2023 WNV transmission season

Entomological surveillance for WNV occurred in 21 counties across Nebraska during epidemiological weeks 22 (first week in June) to 39 (last week in September) (Fig 3A). Over the course of the 2023 transmission season, a total of 38,051 Culex spp. mosquitoes were collected and tested for the presence of WNV RNA which resulted in 224 positive mosquito pools. The West region of the state produced the largest number of Culex spp. mosquitoes and mosquito pools, followed by the Central and then East regions (Fig 3B). The West region also had an earlier peak in mosquito abundance in epidemiological week 30 compared to epidemiological 34 in the Central region and 36 in the East region. This time difference was reflected in the vector index (VI) across each region, which was highest and rises earlier in the west compared to the rest of the state (Fig 3C). In total, 150 clinical cases of WNV were diagnosed in Nebraska in 2023, resulting in an overall cumulative incidence of 7.54 cases per 100,000 individuals (Table 1). The geographic distribution of human surveillance data reflected entomological surveillance data across the state, where the West region had a cumulative incidence of 43.72 per 100,000, which far exceeded that of the East and Central regions.

Fig 3. Entomological description of the WNV outbreak in Nebraska in 2023.

Fig 3

(A) Map of Nebraska split into three zones, East, Central, and West. Counties where mosquitoes were collected and tested for WNV contain dots. Fig 3a was made in ArcGIS and the base layer was obtained from the Geographic Information Office for the State of Nebraska. Link here: https://www.arcgis.com/home/item.html?id=f8f32ae0dd254524a81477c656e3e469 (B) Mosquito abundance over the course of the 2023 WNV transmission season by zone in Nebraska presented by epidemiological week. (C) VI calculations for each zone in Nebraska over the course of the 2023 WNV transmission season presented by epidemiological week.

Table 1. Number of cases and cumulative incidence of WNV across Nebraska.

Locality Total Cases Cumulative Incidence per 100,000
Nebraska 150 7.54
East 63 4.28
Central 41 9.95
West 46 43.72

Phylodynamics of the 2023 WNV transmission season

The phylogenetic analysis was focused on genomes generated from the 2023 WNV transmission season in Nebraska. Given the clear differences in entomological and human risk identified across the state (Fig 3 and Table 1), we aimed to determine if WNV genotypes identified during the initiation of transmission in the West region, which had the highest VI and the earliest peak in transmission, were identified in the East or Central region. A total of 100 coding-complete or nearly coding-complete WNV genomes were generated from Nebraska with sequencing effort per region proportional to the number of WNV positive mosquito pools (Fig 4A). All positive mosquito pools were classified as WNV Lineage 1A. The majority of genomes sequenced were singletons on long branches that extended from clades with a MRCA that did not originate in Nebraska according to our ancestral state reconstruction. To determine whether WNV genotypes were shared across the state, we identified and annotated clades that contained 1) >3 taxa, 2) had UFBoot values >0.7, and 3) had a tMRCA post 2018. A total of six clades met these criteria (Fig 4B). The majority of samples contained within these clades were collected in either the West or Central regions, indicating that genotypes from the eastern side of the state were unlikely to be introduced from the West region. Additionally, all clades had tMRCA values prior to 2023, indicating these genotypes had been circulating in the state prior to the 2023 transmission season. The ancestral state reconstruction indicated that all six clades had originated in the West region in Nebraska, however this could be due to sparse sampling of WNV genomes from elsewhere in the United States between 2019 and 2023 (S2 Fig).

Fig 4. Phylogenetic analysis of the 2023 WNV outbreak in Nebraska.

Fig 4

(A) A time-calibrated maximum likelihood phylogenetic tree highlighting 100 WNV genomes generated from Nebraska in 2023. (B) Exploded tree view of Nebraska WNV clades identified in 2023. Confidence intervals for the tMRCA values are represented in grey.

Discussion

Incorporating genomics into routine entomological surveillance will improve the understanding of WNV transmission dynamics at local and regional scales. To this end, we sought to develop and validate a single reaction amplicon-based NGS approach to generate coding-complete and accurate WNV genomes. We generated 274 coding-complete WNV genomes from mosquito pools collected in Nebraska during 2012–2024 (Fig 1). The IDT xGen WNV panel generated coding-complete genomes from the majority of samples assessed. Genome completeness was reduced in samples with higher WNV CT values (lower viral input quantities) as seen in other amplicon-based sequencing assays [53,54]. Interestingly, samples with high CT values that failed to produce coding-complete genomes had relatively equal read distributions across regions of the WNV genome but had high rates of off-target read alignment, indicating that increasing the amount of sequence data from such samples would result in more complete genomes. WNV genomes from Nebraska demonstrated an expected clock-like mutation-rate and an inferred nucleotide substitution rate of 3.97 x 10-4 substitutions per site per year is on par with estimates from other WNV studies [11,19,20,22,55].

In addition to genome completeness, accuracy of genomic content is crucial for correct phylogenetic inference.

For 30 samples representing a diversity of WNV genotypes across Nebraska, we compared consensus sequence genomes generated using the IDT xGen WNV panel and the WNV Primal Scheme assay (Fig 2). A pairwise comparison demonstrated 100% nucleotide identity across the WNV genome in 28 of the 30 paired consensus sequence genomes. The two samples that did not produce 100% identical genomes, UNMC0019 and UNMC0567, had 2 and 6 nucleotide mismatches, respectively. Upon further inspection of these alignments, nucleotide mismatches were the result of an ambiguous nucleotide (e.g., “Y” to represent either a “C” or “T”) called in the genomes generated with the WNV Primal Scheme assay. However, as the Primal Scheme bioinformatic pipeline was originally set to call a base with a simple majority frequency of >50%, modifying the consensus sequence generation step to allow for ambiguous nucleotides resolved this discrepancy. Additionally, amplicon dropout that resulted in reduced genome completeness occurred sparingly in both approaches, and downsampled datasets that contained only 10,000 reads produced coding-complete or nearly coding-complete genomes in both sequencing approaches.

While both approaches produced high-quality genomes, there are important distinctions between the two. For one, the IDT xGen WNV panel has a streamlined laboratory protocol where amplicon generation occurs in a single tube per sample, whereas two separate PCRs are required for Primal Scheme assays [24,25]. Additionally, the IDT xGen WNV panel allows for amplicon generation and library preparation for both Illumina and Oxford Nanopore MinION sequencing platforms from a single kit. The Primal Scheme approach, however, offers more flexibility in reagents for amplicon generation and library preparation. The IDT xGen WNV panel encompasses the full WNV CDS, where the Primal Scheme assay includes targets in both the 5’ and 3’ UTR, resulting in greater proportion of the total genome represented in final consensus sequences. Incorporating primer sequences to target the UTRs is an important future direction for the IDT xGen WNV panel.

The 2023 WNV transmission season in Nebraska was particularly severe with 150 cases of WNV disease diagnosed across the state, the most on record since 2018 when 251 cases were diagnosed (Table 1) [2]. Nebraska had the fourth most reported WNV cases in 2023 behind Colorado, California, and Texas, respectively [56]. Analysis of entomological data, specifically the abundance of mosquitoes and the VI, indicated that enzootic transmission of WNV was more severe in the West region of the state compared to the Central and East regions (Fig 3). This pattern was reflected in human case data in different regions across the state, where the cumulative incidence is substantially higher in the West region, 43.72 per 100,000, compared to the Central and Eastern regions, 9.95 and 4.28 per 100,000, respectively. These results demonstrate that entomological risk metrics, such as the VI, were useful to predict human cases at large geographical scales, as observed elsewhere [8].

In addition to a higher abundance of Culex mosquitoes and a higher VI, enzootic transmission peaked earlier in the West region. This observation led us to explore if earlier and more intense transmission in the West region resulted in dispersal of WNV to the rest of the state. To answer this question, we generated 100 coding-complete WNV genomes from WNV positive mosquito pools collected across the state in 2023 using the IDT xGen WNV panel (Fig 4). We demonstrated that WNV genomes from Nebraska sequenced in 2023 were broadly distributed across the WNV Lineage 1A phylogeny, suggesting little association between genotype and region of collection. A similar study assessing WNV diversity from Suffolk County, NY in 2012 also found high levels of WNV sequence diversity from samples collected in the same location [57]. Additional studies have also found substantial genetic diversity in sequenced WNV genomes from the same locations in the same collection years [5860].

To explore this question further, we conducted a discrete phylogeographic analysis using a time-calibrated maximum-likelihood phylogenetic tree to identify clades that contained genomes from across the state that would indicate shared transmission networks. The results of our phylogenetic assessment indicated that of the 100 WNV genomes generated from Nebraska in 2023, six clades containing 26 genomes met our criteria for nodal support and tMRCA. Each of these clades had a > 95% discrete state probability of arising from Nebraska and specifically in the West region. A single genome derived from samples collected in the East region was placed in a clade with genomes from the West region, where every other clade contained genomes from either the West and Central regions, or the West region alone. No clades had an estimated tMRCA in 2023 and most had tMRCA estimates prior to 2021, suggesting these genomes have been present in the state over multiple transmission seasons. In general, these results suggest an absence of geographic structure of WNV genomes by broadly defined regions in Nebraska during the 2023 outbreak. A study of WNV dynamics in New York state by Bialosuknia et al. found a similar absence of geographic structure in their phylogenetic analysis with the exception of two well-supported clades [22], one of them being the NY10 genotype which was associated with an increase in cases in 2012 [18,21]. The persistence of specific WNV genotypes in high-incidence areas in Nebraska remains to be determined. Future work will focus on longitudinal genomic analyses to evaluate how and why genotypes persist in the state once more complete genomic datasets are generated.

Multiple limitations of this study need to be considered. First, our comparative analysis of the IDT xGen WNV panel to the Primal Scheme assay for WNV was limited to samples collected from Nebraska. Although these genomes do represent substantial contemporary WNV genetic diversity (Fig 2A), circulating genotypes such as SW03 were not compared. Additionally, our phylogeographic analysis was limited to a snapshot of the 2023 WNV transmission season in Nebraska. While we were able to generate numerous genomes from 2023, the lack of clustering of Nebraska samples from this year may be due to insufficient sampling in our sequencing data. Thus, we would likely need to sequence more genomes from 2023 to observe clustering patterns. In addition, the genomes from the 2023 outbreak in Nebraska were on long branches extending out from well supported clades, likely because there was a lack of contextual WNV sequence data from elsewhere in the United States for the same time period (S2 Fig). A temporal analysis from Nebraska as more data becomes available from around the United States will provide better estimates of clustering patterns of WNV genomes in high-incidence settings.

In conclusion, our results indicated that the IDT xGen WNV panel produces high-quality WNV genomes that can be used to characterize WNV transmission dynamics.

Supporting information

S1 Fig. Cartoon schematic highlighting the differences in amplification approaches between Primal Scheme (A) and the WNV IDT xGen approach (B).

(TIF)

pntd.0013931.s001.tif (210.2KB, tif)
S2 Fig. Publicly available WNV genomes used in this study by year or collection.

(TIF)

S3 Fig. WNV Genomes generated as a part of this study by year of collection.

(TIF)

pntd.0013931.s003.tif (1.1MB, tif)
S1 Table. Comparison of sequencing data and consensus sequence genomes from 10 matched samples prepared and sequenced using different approaches.

(XLSX)

pntd.0013931.s004.xlsx (11.5KB, xlsx)

Acknowledgments

We would like to acknowledge Nathan Grubaugh from the Yale School of Public Health for helpful comments on analysis and providing the Primal Scheme primer pools. We would also like to acknowledge the national arbovirus surveillance system ArboNET for providing human WNV case data for the state of Nebraska for 2023.

Data Availability

All data generated this study are available at the European Nucleotide Archive (ENA) under accession PRJEB96341. Final genomes and read-level data are available for each sample. These data have been cross listed on NCBI GenBank and NCBI Short Read Archive and can be found via specific accession numbers for each sample available on our GitHub Page. All code and metadata, including accession numbers, used in this analysis are available on our GitHub: github.com/josephfauver/WNV_Methods_Outbreak_Manuscript.

Funding Statement

This work was supported in part by start-up funds from the UNMC VCR Office provided to JRF. Additionally, this publication is supported by Cooperative Agreement Number NU60OE000104 (CFDA #93.322), funded by the Centers for Disease Control and Prevention (CDC) of the US Department of Health and Human Services (HHS) providing salary support to SF and ZP. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of APHL, CDC, HHS or the US Government. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Petersen LR. Epidemiology of West Nile Virus in the United States: implications for arbovirology and public health. J Med Entomol. 2019;56(6):1456–62. doi: 10.1093/jme/tjz085 [DOI] [PubMed] [Google Scholar]
  • 2.CDC. ArboNET. 2023. https://www.cdc.gov/mosquitoes/mosquito-control/professionals/ArboNET.html [Google Scholar]
  • 3.Ronca SE, Murray KO, Nolan MS. Cumulative incidence of West Nile virus infection, continental United States, 1999-2016. Emerg Infect Dis. 2019;25:325–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Patel H, Sander B, Nelder MP. Long-term sequelae of West Nile virus-related illness: a systematic review. Lancet Infect Dis. 2015;15(8):951–9. doi: 10.1016/S1473-3099(15)00134-6 [DOI] [PubMed] [Google Scholar]
  • 5.CDC. Treatment and prevention of West Nile virus disease. West Nile Virus. 2024. https://www.cdc.gov/west-nile-virus/hcp/treatment-prevention/index.html [Google Scholar]
  • 6.Pollett S, Fauver JR, Maljkovic Berry I, Melendrez M, Morrison A, Gillis LD, et al. Genomic epidemiology as a public health tool to combat mosquito-borne virus outbreaks. J Infect Dis. 2020;221(Suppl 3):S308–18. doi: 10.1093/infdis/jiz302 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Barker CM. Models and surveillance systems to detect and predict West Nile Virus outbreaks. J Med Entomol. 2019;56(6):1508–15. doi: 10.1093/jme/tjz150 [DOI] [PubMed] [Google Scholar]
  • 8.Kilpatrick AM, Pape WJ. Predicting human West Nile virus infections with mosquito surveillance data. Am J Epidemiol. 2013;178(5):829–35. doi: 10.1093/aje/kwt046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Fauver JR, et al. Temporal and spatial variability of entomological risk indices for West Nile virus infection in northern Colorado: 2006–2013. J Med Entomol. 2016;53:425–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Centers for Disease Control and Surveillance. West Nile virus surveillance and control guidelines. 2024. https://www.cdc.gov/west-nile-virus/media/pdfs/2024/08/WestNileVirus-SurveillanceControlGuidelines_508-h.pdf [Google Scholar]
  • 11.Di Giallonardo F, Geoghegan JL, Docherty DE, McLean RG, Zody MC, Qu J, et al. Fluid spatial dynamics of west nile virus in the United States: rapid spread in a permissive host environment. J Virol. 2015;90(2):862–72. doi: 10.1128/JVI.02305-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dellicour S, Lequime S, Vrancken B, Gill MS, Bastide P, Gangavarapu K, et al. Epidemiological hypothesis testing using a phylogeographic and phylodynamic framework. Nat Commun. 2020;11(1):5620. doi: 10.1038/s41467-020-19122-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pybus OG, Suchard MA, Lemey P, Bernardin FJ, Rambaut A, Crawford FW, et al. Unifying the spatial epidemiology and molecular evolution of emerging epidemics. Proc Natl Acad Sci U S A. 2012;109(37):15066–71. doi: 10.1073/pnas.1206598109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hadfield J, Brito AF, Swetnam DM, Vogels CBF, Tokarz RE, Andersen KG, et al. Twenty years of West Nile virus spread and evolution in the Americas visualized by Nextstrain. PLoS Pathog. 2019;15(10):e1008042. doi: 10.1371/journal.ppat.1008042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ebel GD, Carricaburu J, Young D, Bernard KA, Kramer LD. Genetic and phenotypic variation of West Nile virus in New York, 2000-2003. Am J Trop Med Hyg. 2004;71:493–500. [PubMed] [Google Scholar]
  • 16.Davis CT, et al. Phylogenetic analysis of North American West Nile virus isolates, 2001-2004: evidence for the emergence of a dominant genotype. Virology. 2005;342:252–65. [DOI] [PubMed] [Google Scholar]
  • 17.McMullen AR, May FJ, Li L, Guzman H, Bueno R Jr, Dennett JA, et al. Evolution of new genotype of West Nile virus in North America. Emerg Infect Dis. 2011;17(5):785–93. doi: 10.3201/eid1705.101707 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Fautt C, Boudreau MR, Mooney AC, Cohnstaedt LW, Hudson AR, Humphreys JM. The emergence of NY10: insights into the 2012 West Nile Virus outbreak in the United States. Virus Evol. 2025;11(1):veaf037. doi: 10.1093/ve/veaf037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Duggal NK, Reisen WK, Fang Y, Newman RM, Yang X, Ebel GD, et al. Genotype-specific variation in West Nile virus dispersal in California. Virology. 2015;485:79–85. doi: 10.1016/j.virol.2015.07.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Armstrong PM, Vossbrinck CR, Andreadis TG, Anderson JF, Pesko KN, Newman RM, et al. Molecular evolution of West Nile virus in a northern temperate region: connecticut, USA 1999-2008. Virology. 2011;417(1):203–10. doi: 10.1016/j.virol.2011.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bialosuknia SM, Dupuis Ii AP, Zink SD, Koetzner CA, Maffei JG, Owen JC, et al. Adaptive evolution of West Nile virus facilitated increased transmissibility and prevalence in New York State. Emerg Microbes Infect. 2022;11(1):988–99. doi: 10.1080/22221751.2022.2056521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bialosuknia SM, et al. Evolutionary dynamics and molecular epidemiology of West Nile virus in New York State: 1999-2015. Virus Evol. 2019;5:vez020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Fauver JR, Akter S, Morales AIO, Black WC 4th, Rodriguez AD, Stenglein MD, et al. A reverse-transcription/RNase H based protocol for depletion of mosquito ribosomal RNA facilitates viral intrahost evolution analysis, transcriptomics and pathogen discovery. Virology. 2019;528:181–97. doi: 10.1016/j.virol.2018.12.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, De Jesus JG, Main BJ, et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019;20(1):8. doi: 10.1186/s13059-018-1618-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kent C, Smith AD, Tyson J, Stepniak D, Kinganda-Lusamaki E, Lee T, et al. PrimalScheme: open-source community resources for low-cost viral genome sequencing. openRxiv. 2024. doi: 10.1101/2024.12.20.629611 [DOI] [Google Scholar]
  • 26.Kuchinski KS, Nguyen J, Lee TD, Hickman R, Jassem AN, Hoang LMN, et al. Mutations in emerging variant of concern lineages disrupt genomic sequencing of SARS-CoV-2 clinical specimens. Int J Infect Dis. 2022;114:51–4. doi: 10.1016/j.ijid.2021.10.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2016;44:D67-72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Madeira F, Madhusoodanan N, Lee J, Eusebi A, Niewielska A, Tivey ARN, et al. The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Res. 2024;52(W1):W521–5. doi: 10.1093/nar/gkae241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Brault AC, Fang Y, Reisen WK. Multiplex qRT-PCR for the detection of Western Equine Encephalomyelitis, St. Louis Encephalitis, and West Nile viral RNA in mosquito pools (Diptera: Culicidae). J Med Entomol. 2015;52(3):491–9. doi: 10.1093/jme/tjv021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lanciotti RS, Kerst AJ, Nasci RS, Godsey MS, Mitchell CJ, Savage HM, et al. Rapid detection of west nile virus from human clinical specimens, field-collected mosquitoes, and avian samples by a TaqMan reverse transcriptase-PCR assay. J Clin Microbiol. 2000;38(11):4066–71. doi: 10.1128/JCM.38.11.4066-4071.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. 2013.
  • 33.Fennell T, N H. Fgbio: tools for working with genomic and high throughput sequencing data. https://github.com [Google Scholar]
  • 34.Li H. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Moshiri N. ViralConsensus: a fast and memory-efficient tool for calling viral consensus genome sequences directly from read alignment data. Bioinformatics. 2023;39(5):btad317. doi: 10.1093/bioinformatics/btad317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Shen W, Le S, Li Y, Hu F. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS One. 2016;11(10):e0163962. doi: 10.1371/journal.pone.0163962 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Li H. New strategies to improve minimap2 alignment accuracy. Bioinformatics. 2021;37(23):4572–4. doi: 10.1093/bioinformatics/btab705 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016;2(1):vew007. doi: 10.1093/ve/vew007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30(14):3059–66. doi: 10.1093/nar/gkf436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59(3):307–21. doi: 10.1093/sysbio/syq010 [DOI] [PubMed] [Google Scholar]
  • 41.Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28(12):1647–9. doi: 10.1093/bioinformatics/bts199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Warnes R, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, et al. Various R programming tools for plotting data. Comprehensive R Archive Network (CRAN); 2024. [Google Scholar]
  • 43.Schweitzer BK, Kramer WL, Sambol AR, Meza JL, Hinrichs SH, Iwen PC. Geographic factors contributing to a high seroprevalence of West Nile virus-specific antibodies in humans following an epidemic. Clin Vaccine Immunol. 2006;13(3):314–8. doi: 10.1128/CVI.13.3.314-318.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.PRISM Group Oregon State University. Mean annual precipitation in Nebraska (1999-2020). https://prism.oregonstate.edu/projects/gallery_view.php?state=NE [Google Scholar]
  • 45.Functions for statistical estimation with pooled or grouped binary data. GitHub. https://github.com/CDCgov/PooledInfRate [Google Scholar]
  • 46.CDC. Mosquitoes. 2024. https://www.cdc.gov/mosquitoes/php/arbonet/index.html [Google Scholar]
  • 47.Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018;34(23):4121–3. doi: 10.1093/bioinformatics/bty407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Dalla Vecchia E. Pathoplexus: towards fair and transparent sequence sharing. Lancet Microbe. 2024;5(12):100995. doi: 10.1016/j.lanmic.2024.100995 [DOI] [PubMed] [Google Scholar]
  • 49.Huddleston J, Hadfield J, Sibley TR, Lee J, Fay K, Ilcisin M, et al. Augur: a bioinformatics toolkit for phylogenetic analyses of human pathogens. J Open Source Softw. 2021;6(57):2906. doi: 10.21105/joss.02906 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32(1):268–74. doi: 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sagulenko P, Puller V, Neher RA. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 2018;4(1):vex042. doi: 10.1093/ve/vex042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Dudas G. Baltic: Baltic - backronymed adaptable lightweight tree import code for molecular phylogeny manipulation, analysis and visualisation. Development is back on the Evogytis/baltic branch (i.e. here). 2023. https://github.com/Evogytis/baltic [Google Scholar]
  • 53.Alpert T, Vogels CBF, Breban MI, Petrone ME, Wyllie AL, Grubaugh ND, et al. Sequencing SARS-CoV-2 genomes from saliva. Virus Evol. 2022;8(1):veab098. doi: 10.1093/ve/veab098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Vogels CBF, Hill V, Breban MI, Chaguza C, Paul LM, Sodeinde A, et al. DengueSeq: a pan-serotype whole genome amplicon sequencing protocol for dengue virus. BMC Genomics. 2024;25(1):433. doi: 10.1186/s12864-024-10350-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Snapinn KW, Holmes EC, Young DS, Bernard KA, Kramer LD, Ebel GD. Declining growth rate of West Nile virus in North America. J Virol. 2007;81(5):2531–4. doi: 10.1128/JVI.02169-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Padda H, Jacobs D, Gould CV, Sutter R, Lehman J, Staples JE, et al. West Nile virus and other nationally notifiable arboviral diseases - United States, 2023. MMWR Morb Mortal Wkly Rep. 2025;74(21):358–64. doi: 10.15585/mmwr.mm7421a1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Ehrbar DJ, Ngo KA, Campbell SR, Kramer LD, Ciota AT. High levels of local inter- and intra-host genetic variation of West Nile virus and evidence of fine-scale evolutionary pressures. Infect Genet Evol. 2017;51:219–26. doi: 10.1016/j.meegid.2017.04.010 [DOI] [PubMed] [Google Scholar]
  • 58.Mann BR. Continued evolution of West Nile virus, Houston, Texas, USA, 2002-2012. Emerg Infect Dis. 2013;19:1418–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Amore G, et al. Multi-year evolutionary dynamics of West Nile virus in suburban Chicago, USA, 2005-2007. Philos Trans R Soc Lond B Biol Sci. 2010;365:1871–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Hepp CM, Cocking JH, Valentine M, Young SJ, Damian D, Samuels-Crow KE, et al. Phylogenetic analysis of West Nile Virus in Maricopa County, Arizona: evidence for dynamic behavior of strains in two major lineages in the American Southwest. PLoS One. 2018;13(11):e0205801. doi: 10.1371/journal.pone.0205801 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0013931.r001

Decision Letter 0

Doug Brackney

23 Dec 2025

Response to Reviewers Revised Manuscript with Track Changes Manuscript

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-4304-636XX

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-1765-0002

Journal Requirements:

1) Please ensure that the CRediT author contributions listed for every co-author are completed accurately and in full.

At this stage, the following Authors/Authors require contributions: Shawn Freed, Sarah Chandler, Sarah Uhm, Zach Pella, Dikchha Gurung, Halie Smith, Tammy Dowdy, Amanda M. Bartling, Ava Butz, Michael R. Wiley, M. Jana Broadhurst, Sydney Stein, Emily L. McCutchen, Jeff Hamik, Peter C. Iwen, Nick Downey, Kaylee S. Herzog, and Joseph R Fauver. Please ensure that the full contributions of each author are acknowledged in the "Add/Edit/Remove Authors" section of our submission form.

The list of CRediT author contributions may be found here: https://journals.plos.org/plosntds/s/authorship#loc-author-contributions

2) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines:

https://journals.plos.org/plosntds/s/figures

3) We notice that your supplementary Figures, and Tables are included in the manuscript file. Please remove them and upload them with the file type 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

Reviewers' comments:

Key Review Criteria Required for Acceptance?

As you describe the new analyses required for acceptance, please consider the following:

Methods:

-Are the objectives of the study clearly articulated with a clear testable hypothesis stated?

-Is the study design appropriate to address the stated objectives?

-Is the population clearly described and appropriate for the hypothesis being tested?

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested?

-Were correct statistical analysis used to support conclusions?

-Are there concerns about ethical or regulatory requirements being met?

Reviewer #1: (No Response)

Reviewer #2: -Are the objectives of the study clearly articulated with a clear testable hypothesis stated? Yes

-Is the study design appropriate to address the stated objectives? Yes

-Is the population clearly described and appropriate for the hypothesis being tested? Yes

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested? Yes

-Were correct statistical analysis used to support conclusions? N/A

-Are there concerns about ethical or regulatory requirements being met? No

**********

Results

-Does the analysis presented match the analysis plan?

-Are the results clearly and completely presented?

-Are the figures (Tables, Images) of sufficient quality for clarity?

Reviewer #1: (No Response)

Reviewer #2: -Does the analysis presented match the analysis plan? Yes

-Are the results clearly and completely presented? Needs minor revisions (see below)

-Are the figures (Tables, Images) of sufficient quality for clarity? Needs minor revisions (see below)

**********

Conclusions

-Are the conclusions supported by the data presented?

-Are the limitations of analysis clearly described?

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study?

-Is public health relevance addressed?

Reviewer #1: (No Response)

Reviewer #2: -Are the conclusions supported by the data presented? Yes

-Are the limitations of analysis clearly described? Yes

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study? Yes

-Is public health relevance addressed? Yes

**********

Editorial and Data Presentation Modifications?

Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”.

Reviewer #1: (No Response)

Reviewer #2: MAJOR COMMENTS:

1. Line 29: Authors must be cautious when using the term “complete genomes” as the IDT panel does not cover the 5’ and 3’ UTRs (as stated in lines 119-120). Authors must rewrite every instance of “complete genome” in the manuscript to “coding-complete”, which is the standard language used when genome termini are not fully determined.

2. Line 30: While the IDT xGen WNV panel undoubtedly produces coding-complete genomes, I am not convinced that the authors provide significant evidence to say that the panel is “more robust to amplicon dropout”. Although theoretically, super amplicons generated with the IDT xGen panels are expected to provide coverage when nucleotide variants occur in a primer binding site, the data presented in this manuscript do not demonstrate that this happens in any meaningful way using either sequencing approach. In fact, the only example given, that of sample UNMC0261, shows “amplicon dropout” with both methods, with the most significant dropout present in the genome generated with the IDT xGen panel. The authors themselves highlight this (lines 277-278 and line 375); therefore, I am unsure why this claim was left in line 30 of the abstract, but the authors should consider revising this sentence.

3. Line 122: “Sample selection” section needs significant clarification on the methodology. Do all 287 WNV-positive pools have the same number of individuals per pool? How were insect pools generated? Provide details on logistical workflow from field to lab (i.e., cold chain maintained? Were they stored in stabilization buffer such as RNAlater or similar?). How was species identification performed? (morphological identification, DNA barcoding?). Please provide as much detail as possible.

4. Line 131: Were pools mechanically disrupted prior to extraction? (i.e., using a tissue homogenizer, mortar and pestle, etc.?) Please be as detailed as possible.

5. Line 137: How much cDNA was used as input for the IDT Amplicon Core kit?

6. Line 165: Reference 25 is only for the PrimalScheme open-source preprint, not a specific protocol designed for WNV sequencing. Please correct, as this is misleading.

7. Line 318, Figure 3: Distribution of the 224 WNV-positive mosquito pools across the three regions surveilled should be included somewhere in figure 3 or at least mentioned in the results. Only the total is given for the whole state, followed by which region produced the largest number of Culex spp. pools.

8. Line 256, Figure 1A: The Figure is slightly confusing. Data is very informative and fine as is, but the Y-axis needs more explanation in the figure legend. I assume that the Y-axis represents the percentage of the genome sequence with a coverage depth of greater than 10x (at least 10 reads at each base), but this is not clearly explained anywhere in the manuscript.

9. Lines 148-153 and 176-180: Providing more detailed information on how the reads were quality controlled (quality score limits, removal of terminal nucleotides, minimum read length, etc.) and final genome read coverage (average depth of coverage) would significantly improve this manuscript.

10. Line 231: As currently written, I do not believe the data availability for all sequencing data generated meets the standards required for this journal. The GitHub link provided is useful, but it does not make it easy for readers to access the consensus sequences and corresponding sequencing metrics. Please edit the sample metadata table in GitHub to include all relevant metadata associated with each generated genome, including columns for GenBank accession numbers (hyperlinked to publicly available data), total number of reads, number of reads mapped to WNV, sequence coverage %, length, GC content, etc.

11. Line 231: Additionally, the data availability section makes no mention of whether the raw sequence data will be made available through a public repository such as NCBI Sequence Read Archive (SRA), which is required (link to PLOS submission guidelines and recommended data repositories in link below, as well as two publications for reference).

https://journals.plos.org/plosntds/s/recommended-repositories

1. Palinski RM, Sangula A, Gakuya F, Bertram MR,Pauszek SJ, Hartwig EJ, Smoliga GR, Obanda V,Omondi GP,VanderWaal K, Arzt J.2022.Genome Sequences of Foot-and-Mouth Disease Virus SAT1 Strains Purified from Coinfected Cape Buffalo in Kenya. Microbiol Resour Announc11:e00584-22.https://doi.org/10.1128/mra.00584-22

2. He X, Yin Q, Zhou L, Meng L, Hu W, et al. (2021) Metagenomic sequencing reveals viral abundance and diversity in mosquitoes from the Shaanxi-Gansu-Ningxia region, China. PLOS Neglected Tropical Diseases 15(4): e0009381. https://doi.org/10.1371/journal.pntd.0009381

MINOR COMMENTS:

1. Line 86: References needed for prior studies. Examples of “important factors”.

2. Line 98: No need to overstate by saying that other approaches require “multiple” PCR reactions, while (see line 112) in fact it is only two separate PCR reactions. This does not diminish the usefulness of using a single-tube approach.

3. Line 99: Though not grammatically incorrect, authors should revise every instance of “as well,” at the beginning of a sentence and perhaps substitute it with “Moreover” or “In addition”.

4. Line 149: “adapter” is the preferred spelling when referring to Illumina adapter sequences.

5. Line 168: eluted with 25uL of what?

6. Line 176: Same suggestion as in line 149.

7. Line 286: “genomes” is misspelled.

8. Line 407: The word “sequence” is repeated.

**********

Summary and General Comments

Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed.

Reviewer #1: This study validates a single-reaction amplicon-based sequencing method for generating high-quality WNV genomes. The method was compared to the commonly used Primal Scheme assay and found to be robust and efficient, especially in handling amplicon dropout. Using this approach, they sequenced >250 complete WNV genomes from mosquito surveillance samples collected in Nebraska. Phylogenetic analysis revealed high genetic diversity with minimal geographic clustering of virus strains in the state during 2023. The genomic sequences and development of a new sequencing approach provides a valuable resource and methodology for the research community. Overall, the paper reads well, the data is clearly presented and is appropriately interpreted in the context of the larger literature. I have only minor comments for consideration.

Specific Comments:

1. Abstract, line 30. It states they generated 100 complete genomes in 2023 but in the manuscript >250 genomes were produced from multiple years. I think it’s worth noting the full extent of their accomplishment in the abstract.

2. Figure 4 is low resolution and could be made larger for easier assimilation.

3. Discussion, line 412 not sure what is meant by “shared transmission cycles”. WNV is maintained in a bird-Culex transmission cycle throughout its geographic range. Perhaps change to “shared transmission networks”?

Reviewer #2: In this work, Freed et al. validate a single-reaction, amplicon-based approach for sequencing coding-complete West Nile virus genomes from mosquito pools collected during routine WNV vector surveillance. The protocol is based on a custom IDT xGen WNV panel designed in collaboration with IDT. The authors use a subset of samples to compare this method to the Primal Scheme approach, a previously established, amplicon-based sequencing protocol commonly used for sequencing WNV. The authors provide sufficient evidence to demonstrate that the IDT xGen WNV panel can generate coding-complete WNV genome sequences, the majority of which exhibit 100% nucleotide identity with those generated by the Primal Scheme assay.

Finally, the authors demonstrated the utility of this validated IDT xGen WNV sequencing protocol by generating coding-complete WNV genomes from 100 mosquito pools collected during the 2023 transmission season in Nebraska. They then used this sequence data to characterize WNV transmission dynamics in this region.

Overall, this work is relevant to researchers interested in implementing a WNV genome sequencing protocol using an efficient approach that uses a single-tube method. The manuscript is well-written and organized, with figures that clearly support its major claims.

There are minor revisions that would improve the clarity of the manuscript, particularly involving the methodology and presentation of the sequencing data. All revisions are categorized into “major comments” and “minor comments” below.

I believe the manuscript is acceptable for publication, provided the authors address these revisions.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

Figure resubmission:

Reproducibility:--> -->-->To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols-->?>

Attachment

Submitted filename: Freed et al. peer review.docx

pntd.0013931.s005.docx (22.4KB, docx)
PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0013931.r003

Decision Letter 1

Doug Brackney

13 Jan 2026

Dear Dr. Fauver,

We are pleased to inform you that your manuscript 'Genomic Characterization of a Severe West Nile Virus Transmission Season using a Single Reaction Amplicon Sequencing Approach' has been provisionally accepted for publication in PLOS Neglected Tropical Diseases.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Doug E Brackney, PhD

Academic Editor

PLOS Neglected Tropical Diseases

Nigel Beebe

Section Editor

PLOS Neglected Tropical Diseases

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-4304-636XX

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-1765-0002

***********************************************************

p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; line-height: 16.0px; font: 14.0px Arial; color: #323333; -webkit-text-stroke: #323333}span.s1 {font-kerning: none

PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0013931.r004

Acceptance letter

Doug Brackney

Dear Dr. Fauver,

We are delighted to inform you that your manuscript, "Genomic Characterization of a Severe West Nile Virus Transmission Season using a Single Reaction Amplicon Sequencing Approach," has been formally accepted for publication in PLOS Neglected Tropical Diseases.

We have now passed your article onto the PLOS Production Department who will complete the rest of the publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Editorial, Viewpoint, Symposium, Review, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript will be published online unless you opted out of this process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

For Research Articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Cartoon schematic highlighting the differences in amplification approaches between Primal Scheme (A) and the WNV IDT xGen approach (B).

    (TIF)

    pntd.0013931.s001.tif (210.2KB, tif)
    S2 Fig. Publicly available WNV genomes used in this study by year or collection.

    (TIF)

    S3 Fig. WNV Genomes generated as a part of this study by year of collection.

    (TIF)

    pntd.0013931.s003.tif (1.1MB, tif)
    S1 Table. Comparison of sequencing data and consensus sequence genomes from 10 matched samples prepared and sequenced using different approaches.

    (XLSX)

    pntd.0013931.s004.xlsx (11.5KB, xlsx)
    Attachment

    Submitted filename: Freed et al. peer review.docx

    pntd.0013931.s005.docx (22.4KB, docx)
    Attachment

    Submitted filename: Response to Reviewers PLoS NTD Freed 2025.docx

    pntd.0013931.s007.docx (23.5KB, docx)

    Data Availability Statement

    All data generated this study are available at the European Nucleotide Archive (ENA) under accession PRJEB96341. Final genomes and read-level data are available for each sample. These data have been cross listed on NCBI GenBank and NCBI Short Read Archive and can be found via specific accession numbers for each sample available on our GitHub Page. All code and metadata, including accession numbers, used in this analysis are available on our GitHub: github.com/josephfauver/WNV_Methods_Outbreak_Manuscript.


    Articles from PLOS Neglected Tropical Diseases are provided here courtesy of PLOS

    RESOURCES