Abstract
The patterns of spread of influenza A viruses in local populations in tropical and sub-tropical regions are unclear due to sparsity of representative spatiotemporal sequence data. We sequenced and analyzed 58 influenza A(H3N2) virus genomes sampled between December 2015 and December 2016 from nine health facilities within the Kilifi Health and Demographic Surveillance System (KHDSS), a predominantly rural region, covering approximately 891 km2 along the Kenyan coastline. The genomes were compared with 1571 contemporaneous global sequences from 75 countries. We observed at least five independent introductions of A(H3N2) viruses into the region during the one-year period, with the importations originating from Africa, Europe, and North America. We also inferred 23 virus location transition events between the nine facilities included in the study. International virus imports into the study area were captured at the facilities of Chasimba, Matsangoni, Mtondia, and Mavueni, while all four exports from the region were captured from the Chasimba facility, all occurring to Africa destinations. A strong spatial clustering of virus strains at all locations was observed associated with local evolution. Our study shows that influenza A(H3N2) virus epidemics in local populations appear to be characterized by limited introductions followed by significant local spread and evolution. Knowledge of the viral lineages that circulate within specific populations in understudied tropical and subtropical regions is required to understand the full diversity and global ecology of influenza viruses and to inform vaccination strategies within these populations.
Keywords: Next-generation sequencing, Genomic surveillance, Transmission, Influenza, Kenya
Subject terms: Molecular evolution, Phylogenetics
Introduction
Two subtypes of human influenza type A virus (IAV), A(H3N2) and A(H1N1)pdm09, and two lineages of human influenza type B virus (IBV), Victoria and Yamagata, co-circulated in human populations1 causing multiple annual seasonal epidemics globally2–4. However, the Yamagata lineage has not been detected since the emergence of the COVID-19 pandemic in 2020, raising speculation about its possible extinction5,6. The World Health Organization (WHO) and other public health agencies have not formally declared it extinct. These viruses belong to the family Orthomyxoviridae, which are enveloped, negative-sense, single-stranded RNA viruses with segmented genomes7. IAVs evolve rapidly and undergo immune driven selection through accumulation of amino acid changes, especially at antigenic sites of hemagglutinin (HA) glycoproteins8–10. These amino acid sequence drifts on HA are observed more frequently in A(H3N2) virus than A(H1N1)pdm09 virus11,12. Since 2009, antigenic drift of A(H3N2) viruses has resulted in emergence of several genetic groups (i.e., clades, subclades, and subgroups) globally, for example, clade 1–7 viruses13. Clade 3 viruses are the most genetically diverse and are divided into several genetic groups: subclades 3A, 3B, and 3C; and subgroups 3C.1, 3C.2, 3C.2a, 3C.3, and 3C.3a. We recently characterized A(H3N2) viruses that circulated in coastal Kenya between 2009 and 2017 and revealed co-circulation of multiple A(H3N2) virus genetic groups among hospitalized patients and outpatients5.
While the global spread of seasonal influenza viruses has been studied intensively using phylogenetic and phylogeographic approaches14–20, their local patterns of spread remain less clear21, especially at city-wide or town scale. Although it is important to understand how diseases spread around the globe, local spread is most often the main driver of novel cases of respiratory diseases such as COVID-19 or influenza21. Transmission patterns have been recorded at different levels of human social clustering. Household studies have been performed to investigate person-to-person influenza transmission22. Studies of college campuses using phylogenetic methods have revealed extensive mixing of influenza virus strains among college students23. City-wide and countrywide, transmission of influenza in a season is characterized by majorly multiple virus introductions into cities21 and countries24–26; viruses then spread from multiple geographical locations to multiple geographical destinations following introduction24–26. However, there is a paucity of studies that describe the patterns of spread of influenza on a local scale (city or town) due to sparsity of representative spatiotemporal sequence data from defined sub-populations residing in the same geography within a country21,27. Knowledge of the viral lineages that circulate within specific localities in understudied tropical and subtropical regions is required to understand the full diversity and global ecology of influenza viruses in humans and to inform vaccination strategies within these regions.
Given this gap in knowledge, we studied the patterns of spread of A(H3N2) virus in coastal Kenya using virus next generation sequencing (NGS) data collected over a one-year period between December 2015 and December 2016, along with 1571 global A(H3N2) virus sequence data from 75 countries to provide a phylogenetic context.
Materials and methods
Study design
The samples analysed in this study were collected from nine health facilities within the Kilifi Health and Demographic Surveillance System (KHDSS)28,29. The KHDSS, in Kilifi County, is a predominantly rural location along the Kenyan coastline covering an area of 891 km2; Fig. 1, Panel A. The KHDSS monitors a population of approximately 296,000 residents (2016 census) through household enumeration visits conducted every 4 months29. The KHDSS area has 21 public health facilities (including Kilifi County Hospital (KCH)) that receives outpatients and operates under the Kenya Ministry of Health. In total, nine of these facilities, spread throughout the KHDSS, were selected for this study: Matsangoni, Ngerenya, Mtondia, Sokoke, Mavueni, Jaribuni, Chasimba, Pingilikani, and Junju, Fig. 1, Panel A. Non-residents and residents of the KHDSS presenting to these facilities were included. A total of 5796 nasopharyngeal (NP) swab samples were taken from outpatients across all age groups presenting with acute respiratory illness (ARI), defined as patients presenting with symptoms of cough, sneezing, nasal congestion, difficulty breathing, or increased respiratory rate for age (as defined by the World Health Organization). A comprehensive description of the study area and ARI surveillance at the KHDSS outpatient health facilities has been provided in our previous publications28,29.
Sample handling and molecular screening
NP samples were stored in viral transport medium (VTM) at − 80 °C prior to molecular screening and subsequent laboratory processing29,30. Samples were screened for a range of respiratory viruses, including IAV, using a multiplex (MPX) reverse transcription (RT)-PCR assay employing Qiagen QuantiFast multiplex RT-PCR kit (https://www.qiagen.com/us/products/discovery-and-translational-research/pcr-qpcr-dpcr/real-time-pcr-enzymes-and-kits/probe-based-one-step-qrt-pcr/quantifast-multiplex-rt-pcr-kits); a real-time PCR cycle threshold (Ct) of < 35.0 was used to define virus-positive samples30. A total of 97 IAV positive specimens were available for genome sequencing from the KHDSS facilities.
RNA extraction and multi-segment real-time PCR (M-RTPCR)
Viral nucleic acid extraction from IAV positive samples was performed using QIAamp Viral RNA Mini Kit (Qiagen). Extracted RNA was reverse transcribed, and the entire IAV genome amplified in a single M-RTPCR reaction using the Uni/Inf primer set31. Successful amplification was evaluated by running the products on a 2% agarose gel and visualized on a UV transilluminator after staining with RedSafe Nucleic Acid Staining solution (iNtRON Biotechnology Inc.,).
IAV next generation sequencing and genome assembly
Following M-RTPCR, amplicon purification and library preparation was conducted as previously described5. Libraries were then diluted to 2 nmol/L, pooled, denatured, diluted to 12.5 pmol/L and sequenced on the Illumina MiSeq using 2 × 250 bp paired-end reads using the MiSeq v2 500 cycle kit with 5% Phi-X (Illumina) spike-in. Sequence assembly was performed using the Iterative Refinement Meta-Assembler (IRMA)32 in EDGE Bioinformatics environment33 using IRMA default settings: median read Q-score filter of 30; minimum read length of 125; frequency threshold for insertion and deletion refinement of 0.25 and 0.6, respectively; Smith-Waterman mismatch penalty of 5; and gap opening penalty of 10.
Collation of contemporaneous global sequence dataset
Global comparison datasets for A(H3N2) virus were retrieved from the GISAID EpiFlu™ database (https://platform.gisaid.org/epi3/cfrontend). The datasets were prepared to determine the relatedness of the viruses in this study to those circulating around the world thus understand their global context. Data processing included removal of duplicate sequences, sequences with missing dates (collection date and month), incomplete sequences, and sequences with ambiguous nucleotides (N). The data were organized into a Microsoft Excel database which also stored the associated metadata (country of origin, date of isolation, subtype, and sequence length per segment). In-house python scripts were used in the extraction and manipulation of the data. Additionally, sequences were binned by calendar year for temporal analysis. A final dataset of 1571 global sequences sampled between January 2014 and December 2016 was available (numbers in parenthesis indicate number of sequences): Africa (281); Asia (250); Europe (250); North America (250); South America (290); and Oceania (250); Figure S1. The accession numbers for the global genomes are available in the study’s GitHub repository, (https://github.com/DCollinsOwuor/H3N2_Kilifi_Kenya_molecular_epidemiology_2015-16/blob/main/H3N2_global_WGS_genome_details_2010-2016.txt).
Phylogenetic analysis
Consensus nucleotide sequences were aligned by pairwise alignment using MUSCLE program (https://www.ebi.ac.uk/Tools/msa/muscle/) and translated in AliView v1.2634. The individual genome segments were concatenated into codon-complete genomes using SequenceMatrix v1.835. The full-length hemagglutinin (HA) sequences of all viruses were used to characterize A(H3N2) virus strains into genetic groups using Phylogenetic Clustering using Linear Integer Programming (PhyCLIP)36 and the European CDC Guidelines13. Maximum-likelihood (ML) phylogenetic trees of A(H3N2) virus sequences from Kilifi and global contemporaneous sequences were reconstructed using IQTREE v2.0.7, based on general time-reversible (GTR) model and bootstrap method. The software initiates tree reconstruction after assessment and selection of the best model of nucleotide substitution for alignment. The ML trees were linked to various metadata in R programming software v4.0.2 and visualized using R ggtree v2.4.237. TempEst v1.5.3 was used to assess the presence of a molecular clock signal in the analyzed data, and linear regression of root-to-tip genetic distances against sampling dates were reconstructed (Figure S2). A total of 15 outliers were excluded from the downstream analyses.
Reassortment analysis
The concatenated codon-complete genomes based on sequences of all the eight individual gene segments of A(H3N2) viruses from Kilifi, were used to reconstruct a full-length phylogenetic tree then annotated by genetic group using ggtree v2.4.237. Reassortment events were assessed computationally using the Graph-incompatibility-based Reassortment Finder (GiRaF) tool38. The concatenated genome tree was used to infer the clustering patterns of each gene segment, show lineage classification of all eight individual gene segments, and detect reassortment events.
Estimating A(H3N2) virus imports and exports
The ML tree topology based on Kilifi and global sequences was used to estimate the number of viral transmission events between Kilifi and the rest of the world. TreeTime39 was used to transform the ML tree topology into a dated tree. Outlier sequences were identified by TreeTime and excluded during the process. A migration model was henceforth fitted to the resulting time-scaled phylogenetic tree from TreeTime, mapping the location status of the genomes from the nine health facilities at both the tip and internal nodes. Using the date and location annotated tree topology, the number of transitions between and within Kilifi and the rest of the world were counted and plotted using ggplot2 v3.3.3.
Bayesian tip-association significance (BaTS) test
The strength of geographical clustering among the Kilifi viruses was assessed using the phylogeny-trait association test implemented in the BaTS package40. Each virus genome sequence was assigned a geographic code reflecting its place of sampling within the KHDSS. The overall statistical significance of geographical clustering of all Kilifi sequences was determined by calculating observed and expected association index and parsimony score statistics, where the null hypothesis is that clustering by geographical location is not more than that expected by chance. Additionally, the maximum clade statistic was used to compare the strength of clustering at each of the KHDSS location by calculating the expected and observed mean clade size from each of the nine outpatient locations. A significance level of p ≤ 0.05 was used in all scenarios.
Ethics
Ethical clearance for the study was granted by the Kenya Medical Research Institute (KEMRI)—Scientific and Ethical Review Unit (SERU# 3103) and the University of Warwick Biomedical and Scientific Research Ethics Committee (BSREC# REGO-2015-6102). Informed consent was sought and received from the study participants for the study. All research was performed in accordance with relevant guidelines.
Results
IAV genome sequencing and assembly
Among the 97 IAV positive specimens that were available from the KHDSS, 72 (74.2%) that passed pre-sequencing quality control checks were loaded onto the Illumina MiSeq. A total of 63 (87.5%) codon-complete genomes were successfully assembled following sequencing: 58 (92.1%) A(H3N2) virus and 5 (7.9%) A(H1N1)pdm09 virus sequences, respectively; only the 58 A(H3N2) virus genomes were included in the analysis. The 58 genomes comprised 4 genetic groups: clade 3C.2A (n = 34, 58.6%), subclade 3C.2A2 (n = 3, 5.2%), subclade 3C.2A3 (n = 1, 1.7%), and subgroup 3C.2A1b (n = 20, 34.5%). The socio-demographic characteristics of the patients whose samples were analyzed in this report are shown in Table 1.
Table 1.
Outpatient (KHDSS) | ||
---|---|---|
n | % | |
Age | ||
0–11 month | 9 | (15.52) |
12–59 month | 18 | (31.04) |
6–15 year | 15 | (25.86) |
16–64 year | 13 | (22.41) |
> = 65 year | 3 | (5.17) |
Sex | ||
Female | 26 | (44.83) |
Male | 32 | (55.17) |
Cough | ||
No | 2 | (3.45) |
Yes | 56 | (96.55) |
Difficulty breathing | ||
No | 53 | (91.38) |
Yes | 5 | (8.62) |
Chest-wall in-drawing | ||
No | 53 | (91.38) |
Yes | 5 | (8.62) |
Unable to feed | ||
No | 56 | (96.55) |
Yes | 2 | (3.45) |
Oxygen saturation | ||
< 90 | 2 | (3.45) |
> = 90 | 56 | (96.55) |
Conscious level | ||
Alert/Normal | 56 | (96.55) |
Lethargic | 2 | (3.45) |
Prostrate | – | – |
Unconscious | – | – |
Spatiotemporal representativeness of sequenced samples
IAV was detected throughout the surveillance period in coastal Kenya (except in July, August, and September 2016) with the number of observed cases fluctuating from month-to-month (Fig. 1, Panel B). The proportion of samples from each health facility that were sequenced roughly reflected the overall distribution of positives that were detected in the specific health facilities (Fig. 1, Panel B). Clade 3C.2A and subgroup 3C.2A1b viruses were detected in most of the facilities (8 and 6 of the 9 health facilities, respectively), which suggests that A(H3N2) viruses were in circulation in most locations in Kilifi without geographical restriction to a particular lineage during 2015–16, Fig. 1, Panel C. Subclade 3C.2A2 and subclade 3C.2A3 viruses were characterized by three and one virus detection, respectively consistent with limited spread.
Reassortment analysis
The concatenated codon-complete phylogenetic tree of Kilifi viruses revealed a topology in which the detected A(H3N2) virus genetic groups had consistent lineage classification for each of the eight gene segments, which occurs in the absence of reassortment, Figure S3. These findings were verified computationally using the GiRaF reassortment analysis tool, which showed no evidence of reassortment in any of the A(H3N2) viruses. The full-length codon-complete sequences, which provide a greater phylogenetic resolution, were therefore analyzed to understand the patterns of spread of A(H3N2) virus in Kilifi.
A(H3N2) virus diversity in Kilifi and viral imports and exports
We first assessed how the 58 A(H3N2) virus genomes from Kilifi compared to 1571 genomes sampled from around the world by inferring their phylogenies. The Kilifi genomes span the existing global diversity (Fig. 2A), which suggests exchange (most likely introductions into Kilifi) of viruses with other areas around the globe. We then used ancestral location state reconstruction of the dated phylogeny (Fig. 2A) to infer the number of viral imports and exports. We inferred five importations originating from outside the region (two from Europe, two from Africa, and one from North America), which represent five independent introductions into Kilifi from areas outside the region (Fig. 2B). We also inferred a total of 23 virus location transition events between the nine health facilities, Fig. 2B. International virus imports into the study area were captured at the facilities of Chasimba (n = 13, 56.5%), Matsangoni (n = 6, 26.1%), Mtondia (n = 2, 8.7%), and Mavueni (n = 2, 8.7%), while all four virus exports from the region were captured from Chasimba facility, all occurring to Africa destinations (Fig. 2B).
Phylogeographic structure of A(H3N2) virus in coastal Kenya
To determine the phylogeographic structure in the Kilifi sequence data using a statistical approach, phylogeny-trait association tests were conducted to determine phylogenetic association with sampling location (health facility), Table 2. For strains from Kilifi, the results confirmed a stronger spatial clustering of sequences at all locations (p < 0.001), which is also evident in the time-resolved tree of Kilifi sequences, Figure S4. Additionally, the maximum clade statistic was significant (p ≤ 0.05) in most locations (6 out of 9 locations) reflecting predominantly local evolution in most locations. The estimated differences in observed and expected maximum clade values tentatively suggested that Pingilikani and Sokoke exhibited the least spatial structure (i.e., most mixing; difference < 0) while Matsangoni exhibited the strongest spatial structure (difference of 4).
Table 2.
Association index (95% CI) | Parsimony scores (95% CI) | |||||
---|---|---|---|---|---|---|
Location | Observed | Expected | p values | Observed | Expected | p values |
All | 2.04 (1.77–2.32) | 5.78 (5.07–6.45) | < 0.001 | 25.65 (25–26) | 37.59 (34.99–39.95) | < 0.001 |
Mean maximum clade size (95% CI) | ||||||
---|---|---|---|---|---|---|
Location | Observed | Expected | p values | Difference # | ||
Chasimba | 2.00 (2.00–2.00) | 1.21 (1.00–1.99) | 0.029 | 0.79 | ||
Jaribuni | 1.99 (2.00–2.00) | 1.06 (1.00–1.97) | 0.004 | 0.93 | ||
Junju | 3.99 (4.00–4.00) | 1.13 (1.00–1.99) | 0.009 | 2.86 | ||
Matsangoni | 6.05 (6.00–7.00) | 2.07 (1.01–3.05) | 0.002 | 3.98 | ||
Mavueni | 2.17 (2.00–3.00) | 1.10 (1.00–1.99) | 0.012 | 1.07 | ||
Mtondia | 1.99 (2.00–2.00) | 1.46 (1.00–2.23) | 0.111 | 0.53 | ||
Ngerenya | 1.99 (2.00–2.00) | 1.08 (1.00–1.99) | 0.005 | 0.91 | ||
Pingilikani | 1.00 (1.00–1.00) | 1.04 (1.00–1.01) | > 0.999 | − 0.04 | ||
Sokoke | 1.00 (1.00–1.00) | 1.04 (1.00–1.15) | > 0.999 | − 0.04 |
Discussion
Owing to the paucity of studies that describe the patterns of spread of influenza on a local scale (city or town) due to sparsity of representative spatiotemporal sequence data from defined sub-populations residing in the same geography within a country, we sought to understand the transmission dynamics of influenza in a local community in coastal Kenya. We observed at least five independent introductions of A(H3N2) viruses into the region during the one-year period, with the importations originating from Africa, Europe, and North America. Additionally, we inferred 23 virus location transition events between the nine facilities included in the study. International virus imports into the study area were captured at the facilities of Chasimba, Matsangoni, Mtondia, and Mavueni, while all four exports from the region were captured from the Chasimba facility, all occurring to Africa destinations. Chasimba borders other coastal counties, most of which are major global tourist destination sites in the country. We also observed a strong spatial clustering of virus strains at all locations, which was associated with local evolution. A major analysis of virus sequences collected from inpatient and community participants from this population revealed co-circulation of multiple genetic groups of influenza viruses, with divergence from vaccine strains associated with virus evolution3. The current study aimed to understand the patterns of introduction and spread of influenza A(H3N2) virus in a local community over a one-year period.
On a global scale, annual influenza A(H3N2) virus epidemics are proposed to result from the introduction of new genetic variants from East or Southeast Asia, where viruses circulate through a network of temporally overlapping epidemics14,19 or from temporally migrating metapopulations of influenza viruses that seed global epidemics18. Here, we show that the spread of influenza A(H3N2) virus in a local community in coastal Kenya was characterized by a few virus introductions over a one-year period, followed by frequent inter-facility populations transmission. We have previously reported up to 29 IBV introductions in this population in an epidemic year predominated by IBV in 20166. This reveals that repeated introductions of A(H3N2) virus and IBV into the local population drove the influenza season in 2015–16. However, unlike the IBV epidemic, only a few introductions of A(H3N2) virus instigated the 2015–16 season, which was then sustained through local spread and evolution of circulating viruses. Therefore, the seasonal dynamics of influenza are far more complex. Wider and deeper sampling of viruses from understudied tropical and sub-tropical regions is required for a more complete understanding of the local, regional, and global spread of influenza viruses27,41.
The genetic diversity of A(H3N2) virus in specific regions arising from multiple virus importations and subsequent local evolution, as shown in our study, could lead to predominance of circulating viruses that are not closely matched to previously selected vaccine strains. In countries like Kenya, where virus circulation is year-round42, there also exists unpredictability of which genetic strain may predominate and when. Influenza vaccines with broad coverage (“universal” vaccines) could be key to managing the disease burden in such settings. Currently, Kenya does not have a national influenza vaccination policy43, but it would be important to consider deployment of trivalent influenza vaccines with representative A(H3N2) virus, A(H1N1)pdm09 virus, and B/Victoria for optimal vaccine effectiveness. It will also be important to investigate further whether the use of southern hemisphere or northern hemisphere formulated vaccines could have a place in tropical regions like in Kenya, where virus importations from both hemispheres are common. However, the genetic strains reported in Kenya during one-year study period were reported in other African countries44,45 and in Asia46. Thus, the continuing evolution of A(H3N2) in Kilifi is in part due to the global circulation of the virus.
Inclusion of regional and global sequences deposited in GISAID significantly improved the power of our phylogenomic analyses, which showed that the Kilifi diversity was part of the global continuum. For example, we showed extensive geographical mixing of local strains with global strains from Europe and North America. The use of NGS technology to generate virus sequence data from Kilifi enables further scrutiny of the available data to answer other key molecular epidemiological questions. For example, the sequencing depth achieved with NGS may allow for further analysis of minority variant populations. These might be evolving locally in coastal Kenya region thus undermining the assumption that vaccines matched to globally dominant strains in trivalent/quadrivalent vaccines may necessarily protect against these local strains in a local population. Additionally, we have demonstrated the utility of NGS data analysis to investigate reassortment events in influenza viruses.
The study had some limitations. First, the study utilized samples collected from outpatients only, but would have benefitted from additional analysis of samples collected from hospitalized patients to investigate the patterns of spread of A(H3N2) virus in Kilifi. Second, the prioritized samples were selected based on anticipated probability of successful sequencing inferred from the sample’s viral load as indicated by the diagnosis Ct value. This strategy ultimately avoided NGS of some samples that may have been critical in reconstructing the patterns of spread and transmission clusters of A(H3N2) virus in coastal Kenya. Third, the KHDSS outpatient facilities surveillance collected a maximum of fifteen samples per week per site. This paucity of sequence data from some locations may have introduced bias in inference of the phylogeographic structure of A(H3N2) virus in coastal Kenya.
In conclusion, although there is paucity of studies that describe the patterns of spread of influenza on a local scale (city or town), our findings suggest that considerable influenza virus diversity circulates within defined sub-populations residing in the same geography within a country, including virus lineages that are unique to those locales, as reported for Kilifi. These lineages may be capable of dissemination to other populations through virus location transition events. Further knowledge of the viral lineages that circulate in specific locales is required to understand the main drivers of novel cases of respiratory diseases and to inform vaccination strategies within these populations.
Supplementary Information
Author contributions
D.C.O. conceptualized the study, involved in formal analysis, investigated the study, visualized the data, and wrote the original draft of the manuscript. J.M.N. investigated the study and validated the study. F.M.N. and J.U.N. involved in formal analysis and visualized the data. N.M. curated the data. D.J.N., C.N.A., and S.S.C. conceptualized the study, provided resources, supervised the study, wrote the original draft of the manuscript, and reviewed and edited the manuscript.
Funding
The authors D.C.O., F.M.N. and C.N.A. were supported by the Initiative to Develop African Research Leaders (IDeAL) through the DELTAS Africa Initiative [DEL-15-003]. The DELTAS Africa Initiative is an independent funding scheme of the African Academy of Sciences (AAS)'s Alliance for Accelerating Excellence in Science in Africa (AESA) and supported by the New Partnership for Africa’s Development Planning and Coordinating Agency (NEPAD Agency) with funding from the Wellcome Trust [107769/Z/10/Z] and the UK government. The study was also part funded by a Wellcome Trust grant [1029745] and the USA CDC grant [GH002133]. This paper is published with the permission of the Director of KEMRI.
Data availability
All generated sequence data were deposited in the National Center for Biotechnology Information under the accessions PP529601–PP529666 and in the Global Initiative on Sharing All Influenza Data (GISAID) EpiFluTM database (https://platform.gisaid.org/epi3/cfrontend) under the accession numbers EPI_ISL_393682, EPI_ISL_393684-393703, EPI_ISL_393705-393709, EPI_ISL_393711-393723, EPI_ISL_393725-393753, EPI_ISL_393936-393946, EPI_ISL_393949, EPI_ISL_393951-393953, EPI_ISL_393955-393956, EPI_ISL_393960, EPI_ISL_393963, EPI_ISL_393965-393969, EPI_ISL_394051-394052, EPI_ISL_394107-394112.
Declarations
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-024-74218-6.
References
- 1.World Health Organization. FluNet Summary. 2021 [cited 2021 20 August]; Available from: https://www.who.int/tools/flunet/flunet-summary.
- 2.Katz, M. A. et al. Results from the first six years of national sentinel surveillance for influenza in Kenya, July 2007–June 2013. PLoS One9(6), e98615 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Owuor, D. C. et al. Genetic characterization of influenza A(H3N2) viruses circulating in coastal Kenya, 2009–2017. Influenza Other Respir. Viruses14(3), 320–330 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Nyasimi, F. M. et al. Epidemiological and evolutionary dynamics of influenza B virus in coastal Kenya as revealed by genomic analysis of strains sampled over a single season. Virus Evolut.6(2), veaa045 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Caini, S. et al. Probable extinction of influenza B/Yamagata and its public health implications: A systematic literature review and assessment of global surveillance databases. Lancet Microbe5, 100851 (2024). [DOI] [PubMed] [Google Scholar]
- 6.Koutsakos, M. et al. Influenza lineage extinction during the COVID-19 pandemic?. Nat. Rev. Microbiol.19, 741–742 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hayden, F. & Palese, P. Influenza virus. In Clinical Virology (eds Richman, D. et al.) 1009–1058 (ASM Press, 2017). [Google Scholar]
- 8.Westgeest, K. B. et al. Genomewide analysis of reassortment and evolution of human influenza A(H3N2) viruses circulating between 1968 and 2011. J. Virol.88(5), 2844–2857 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bedford, T. et al. Integrating influenza antigenic dynamics with molecular evolution. eLife3, e01914 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Skehel, J. J. & Wiley, D. C. Receptor binding and membrane fusion in virus entry: The influenza hemagglutinin. Annu. Rev. Biochem.69(1), 531–569 (2000). [DOI] [PubMed] [Google Scholar]
- 11.Tewawong, N. et al. Assessing antigenic drift of seasonal influenza A(H3N2) and A(H1N1)pdm09 viruses. PLoS One10(10), e0139958 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Belongia, E. A. et al. Variable influenza vaccine effectiveness by subtype: A systematic review and meta-analysis of test-negative design studies. Lancet. Infect. Dis16, 942–951 (2016). [DOI] [PubMed] [Google Scholar]
- 13.ECDC. Influenza Virus Characterisation Reports, summary Europe. 2021; Available from: https://www.ecdc.europa.eu/en/seasonal-influenza/surveillance-and-disease-data/influenza-virus-characterisation.
- 14.Bedford, T. et al. Global migration dynamics underlie evolution and persistence of human influenza A(H3N2). PLoS Pathog.6(5), e1000918 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bedford, T. et al. Global circulation patterns of seasonal influenza viruses vary with antigenic drift. Nature523(7559), 217–220 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lemey, P., Suchard, M. & Rambaut, A. Reconstructing the initial global spread of a human influenza pandemic: A Bayesian spatial-temporal model for the global spread of H1N1pdm. PLoS Curr. Biol.1, RRN1031 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lemey, P. et al. Unifying viral genetics and human transportation data to predict the global transmission dynamics of human influenza H3N2. PLoS Pathog.10(2), e1003932 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bahl, J. et al. Temporally structured metapopulation dynamics and persistence of influenza A H3N2 virus in humans. Proc. Natl. Acad. Sci. USA108(48), 19359–19364 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kosakovsky Pond, S. L. et al. Network analysis of global influenza spread. PLoS Comput. Biol.6(11), e1001005 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Russell, C. A. et al. The global circulation of seasonal influenza A(H3N2) viruses. Science320(5874), 340–346 (2008). [DOI] [PubMed] [Google Scholar]
- 21.Müller, N. F. et al. Characterising the epidemic spread of influenza A/H3N2 within a city through phylogenetics. PLOS Pathog.16(11), e1008984 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.McCrone, J. T. et al. Stochastic processes constrain the within and between host evolution of influenza virus. eLife10.7554/eLife.35962 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Holmes, E. C. et al. Extensive geographical mixing of 2009 human H1N1 influenza A virus in a single university community. J. Virol.85(14), 6923–6929 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Baillie, G. J. et al. Evolutionary dynamics of local pandemic H1N1/2009 influenza virus lineages revealed by whole-genome analysis. J. Virol.86(1), 11–18 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Owuor, D.C., et al., Characterizing the countrywide epidemic spread of influenza A (H1N1) pdm09 virus in Kenya between 2009 and 2018. MedRxiv, (2021). [DOI] [PMC free article] [PubMed]
- 26.Pollett, S. et al. Phylogeography of influenza A(H3N2) virus in peru, 2010–2012. Emerg. Infect. Dis.21(8), 1330–1338 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ng, S. & Gordon, A. Influenza burden and transmission in the tropics. Curr. Epidemiol. Rep.2(2), 89–100 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Scott, J. A. et al. Profile: The Kilifi health and demographic surveillance system (KHDSS). Int. J. Epidemiol.41(3), 650–657 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nyiro, J. U. et al. Surveillance of respiratory viruses in the outpatient setting in rural coastal Kenya: Baseline epidemiological observations. Wellcome Open Res.3, 89. 10.12688/wellcomeopenres.14662.1 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hammitt, L. L. et al. Added value of an oropharyngeal swab in detection of viruses in children hospitalized with lower respiratory tract infection. J. Clin. Microbiol.49(6), 2318–2320 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zhou, B. & Wentworth, D. E. Influenza A virus molecular virology techniques. Methods Mol. Biol.865, 175–192 (2012). [DOI] [PubMed] [Google Scholar]
- 32.Shepard, S. S. et al. Viral deep sequencing needs an adaptive approach: IRMA, the iterative refinement meta-assembler. BMC Genom.17, 708 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li, P. E. et al. Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform. Nucleic Acids Res.45(1), 67–80 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Larsson, A. AliView: A fast and lightweight alignment viewer and editor for large datasets. Bioinformatics30(22), 3276–3278 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Vaidya, G., Lohman, D. J. & Meier, R. SequenceMatrix: Concatenation software for the fast assembly of multi-gene datasets with character set and codon information. Cladistics27, 171–180 (2011). [DOI] [PubMed] [Google Scholar]
- 36.Han, A. X. et al. Phylogenetic clustering by linear integer programming (PhyCLIP). Mol. Biol. Evol.36(7), 1580–1595 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yu, G. et al. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol. Evolut.8(1), 28–36 (2017). [Google Scholar]
- 38.Nagarajan, N. & Kingsford, C. GiRaF: Robust, computational identification of influenza reassortments via graph mining. Nucleic Acids Res.39(6), e34–e34 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol.4(1), vex042 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Parker, J., Rambaut, A. & Pybus, O. G. Correlating viral phenotypes with phylogeny: Accounting for phylogenetic uncertainty. Infect. Genet. Evolut.8(3), 239–246 (2008). [DOI] [PubMed] [Google Scholar]
- 41.Viboud, C. et al. Contrasting the epidemiological and evolutionary dynamics of influenza spatial transmission. Philos. Trans. R Soc. Lond. B Biol. Sci.368(1614), 20120199 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Emukule, G. O. et al. Influenza activity in Kenya, 2007–2013: Timing, association with climatic factors, and implications for vaccination campaigns. Influenza Other Respir. Viruses10(5), 375–385 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Dawa, J. et al. Developing a seasonal influenza vaccine recommendation in Kenya: Process and challenges faced by the National Immunization Technical Advisory Group (NITAG). Vaccine37(3), 464–472 (2019). [DOI] [PubMed] [Google Scholar]
- 44.Monamele, G. C. et al. Genetic and antigenic characterization of influenza A(H3N2) in Cameroon during the 2014–2016 influenza seasons. PLoS One12(9), e0184411 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Tivane, A. et al. Antigenic and genetic characterization of influenza viruses isolated in Mozambique during the 2015 season. PLoS One13(7), e0201248 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Suntronwong, N. et al. Genetic and antigenic divergence in the influenza A(H3N2) virus circulating between 2016 and 2017 in Thailand. PLoS One12(12), e0189511 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All generated sequence data were deposited in the National Center for Biotechnology Information under the accessions PP529601–PP529666 and in the Global Initiative on Sharing All Influenza Data (GISAID) EpiFluTM database (https://platform.gisaid.org/epi3/cfrontend) under the accession numbers EPI_ISL_393682, EPI_ISL_393684-393703, EPI_ISL_393705-393709, EPI_ISL_393711-393723, EPI_ISL_393725-393753, EPI_ISL_393936-393946, EPI_ISL_393949, EPI_ISL_393951-393953, EPI_ISL_393955-393956, EPI_ISL_393960, EPI_ISL_393963, EPI_ISL_393965-393969, EPI_ISL_394051-394052, EPI_ISL_394107-394112.