Skip to main content
eLife logoLink to eLife
. 2021 Sep 13;10:e67852. doi: 10.7554/eLife.67852

Multiple introductions of multidrug-resistant typhoid associated with acute infection and asymptomatic carriage, Kenya

Samuel Kariuki 1,2,†,, Zoe A Dyson 2,3,4,5,, Cecilia Mbae 1, Ronald Ngetich 1, Susan M Kavai 1, Celestine Wairimu 1, Stephen Anyona 1, Naomi Gitau 1, Robert Sanaya Onsare 1, Beatrice Ongandi 1, Sebastian Duchene 6, Mohamed Ali 7, John David Clemens 8, Kathryn E Holt 4,5, Gordon Dougan 3
Editors: Joseph Lewnard9, Bavesh D Kana10
PMCID: PMC8494480  PMID: 34515028

Abstract

Background:

Understanding the dynamics of infection and carriage of typhoid in endemic settings is critical to finding solutions to prevention and control.

Methods:

In a 3-year case-control study, we investigated typhoid among children aged <16 years (4670 febrile cases and 8549 age matched controls) living in an informal settlement, Nairobi, Kenya.

Results:

148 S. Typhi isolates from cases and 95 from controls (stool culture) were identified; a carriage frequency of 1 %. Whole-genome sequencing showed 97% of cases and 88% of controls were genotype 4.3.1 (Haplotype 58), with the majority of each (76% and 88%) being multidrug-resistant strains in three sublineages of the H58 genotype (East Africa 1 (EA1), EA2, and EA3), with sequences from cases and carriers intermingled.

Conclusions:

The high rate of multidrug-resistant H58 S. Typhi, and the close phylogenetic relationships between cases and controls, provides evidence for the role of carriers as a reservoir for the community spread of typhoid in this setting.

Funding:

National Institutes of Health (R01AI099525); Wellcome Trust (106158/Z/14/Z); European Commission (TyphiNET No 845681); National Institute for Health Research (NIHR); Bill and Melinda Gates Foundation (OPP1175797).

Research organism: S. enterica serovar Typhi

Introduction

Typhoid fever, caused by Salmonella enterica serovar Typhi (S. Typhi) is estimated to involve ~21.7 million illnesses and 216,000 deaths annually Crump and Mintz, 2010; Mogasale et al., 2014, with most of these occurring in lower and middle-income countries. In Africa, overall typhoid is now estimated to have an average annual pooled incidence rate of 112.1 (95% CI, 46.7–203.5) cases per 100,000 people Hopewell and Graham, 2014; Marchello et al., 2019 with a case fatality rate (CFR) of 5.4% (2.7–8.9) Marchello et al., 2020.

Control of typhoid is impeded by asymptomatic carriage, which historically was estimated to account for 2–5% of individuals infected Levine et al., 1982; Parry et al., 2002; Thanh Duy et al., 2020. However, there is a paucity of recent data on the frequency of carriers in different settings including sub-Saharan Africa (SSA) as well as the extent to which they contribute to disease transmission Gauld et al., 2018. A recent modelling study using data generated in Blantyre, Malawi, identified multidrug resistant (MDR) S. Typhi and/or the emergence of the lineage known as H58 (genotype 4.3.1) as a primary driver of an increasing number of typhoid fever cases. In this study, an estimated 45–95% of typhoid transmission was attributed to carriers Pitzer et al., 2015; Saad et al., 2017. S. Typhi H58 Wong et al., 2015 is a globally disseminated clade frequently associated with MDR (defined as resistance to chloramphenicol, ampicillin and co-trimoxazole) and an increasing frequency of reduced susceptibility to fluoroquinolones. H58 S. Typhi are rapidly displacing other lineages in many endemic areas Wong et al., 2015; Feasey et al., 2015; Kariuki et al., 2010; Park et al., 2018 and a new subclade that is extensively drug resistant (XDR), displaying resistance to ciprofloxacin and fluoroquinolones in addition to MDR, has been described in Pakistan Klemm et al., 2018.

Recent reports of epidemics of typhoid fever in SSA suggest that the disease may be becoming more widespread in the region Crump and Mintz, 2010; Park et al., 2018; Hendriksen et al., 2015; Lutterloh et al., 2012; Marks et al., 2017; Neil et al., 2012. In Kenya, the rapid growth of population has led to a huge rural-to-urban migration with people increasingly living in informal settlements where clean water and good sanitation are a major challenge Kyobutungi et al., 2008; Mberu et al., 2016. The incidence of typhoid in one such informal settlement, Kibera in Nairobi, was estimated at 247 cases per 100,000 with the highest rates in children 5–9 years old (596 per 100,000) Breiman et al., 2012. For the last two decades, the majority of cases of typhoid in Kenya have been MDR, with reduced susceptibility to fluoroquinolones rising in frequency Kariuki et al., 2010; Park et al., 2018; Mutai et al., 2018. Previously, we showed that S. Typhi H58 gained a foothold in Kenya in the 1990s, constituting >75% of the circulating S. Typhi we have characterised since 2001 Kariuki et al., 2010. Two H58 lineages were detected; lineage I being isolated between 1988 and 2008 and lineage II from 2004 onwards. We have previously observed carriage rates of 6% in households where typhoid cases were detected Kariuki et al., 2010, however these S. Typhi isolates were not characterised genetically and the role of asymptomatic carriers in transmission dynamics of typhoid in the community is still poorly understood. Over the past 7 years, we have been intensively studying typhoid and other invasive bacterial diseases in Mukuru, an informal settlement 15 km east of the city of Nairobi, Kenya. The prevalence of S. Typhi infections among 16,236 children was 1.4% (CI: 1.2–1.6%), and higher amongst males (1.8% vs 1.2% for females), with a high proportion of infections noted among older children 5–8 years in age Mbae et al., 2020. Risk factors predictive of S. Typhi infection in Mukuru were multiple but were predominantly associated with contaminated water sources and sanitation issues Mbae et al., 2020. Here, we analysed typhoid cases in Mukuru clinically and microbiologically, and identified frequent asymptomatic carriage among children below 16 years of age. By exploiting whole genome sequencing (WGS) and geospatial mapping we characterised the population structure and transmission dynamics of S. Typhi in this location.

Materials and methods

Key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
gene (Salmonella enterica serotype Typhi – S. Typhi) Wild type This study PRJEB19289 ENA sequence accession bank
Strain, strain background (S. Typhi)Wild type Wild type This study Supplementary file 1
Strain, strain background (Escherichia coli) ATCC ATCC25922 https://www.atcc.org/ › products › 25,922
Sequence-based reagent Primer vi-F This paper PCR primers GTTATTCAGC
ATAAGGAG
Sequence-based reagent Primer Vi-R This paper PCR Primers CTTCCATACCA
CTTTCCG
Sequence-based reagent Primer prt-F This paper PCR Primers CTTGCTATGGA
AGACATAACGAACC
Sequence-based reagent Primer prt-R This paper PCR Primers CGTCTCCATCA
AAAGCTCCATAGA
Commercial assay or kit Bactec Media Becton-Dickinson BACTEC 9050 Blood Culture System Blood culture media
Commercial assay or kit Selenite F/MacConkey Oxoid Ltd http://www.oxoid.com CM0395/ CM0007 Selective enrichment/
selective media
Commercial assay or kit Salmonella-Shigella agar Oxoid Ltd http://www.oxoid.com CM0099 Selective agar
Commercial assay or kit Salmonella antisera Murex Diagnostics,
Dartford, UK
https://www.dnb.com
Salmonella typing antisera
Commercial assay or kit Wizard Genomic DNA Extraction Kit https://worldwide.promega.com Whole Genome DNA extraction Cat#A1120
Chemical compound, drug Antimicrobial susceptibility test discs in cartridges Oxoid Ltd
http://www.oxoid.com
Assorted antimicrobial discs for susceptibility testing
Software, algorithm Kraken Genome Biol
2014; 15: R46–12.
ultrafast metagenomic sequence classification using exact alignment
Software, algorithm Multi-locus sequence typing (MLST) Genome Med
2014; 6: 90.
Rapid genomic surveillance for public health and hospital microbiology labs.
Software, algorithm BIGSdb software BMC Bioinformatics
2010; 11: 595.
Scalable analysis of bacterial genome variation at the population level. Genomic analysis
software
Software, algorithm Pathogen-watch for AMR prediction. Nat Commun
2021; 12: 2879–12.
AMR prediction software analysis
Software, algorithm Maximum Likelihood Analytical Tool Bioinformatics
2014; 30: 1312–3.
RAxML (v8.2.9) version 8 A tool for phylogenetic
analysis and post-
analysis of large
phylogenies
Software, algorithm MicroReact Tool (https://microreact.org/project/wViqmaRdZuFVEb6yk4i1jU) Interactive global H58 phylogeny
Software, algorithm Bandage assay Bioinformatics 2015; 31: 3350–2. Interactive visualisation of de novo genome assemblies.
Software, algorithm ISMapper BMC Genomics 2015; 16: 667. Genomics tool for phylogenetics

Study site

Mukuru informal settlement is situated East of Nairobi city, about 15 km from the city centre. It is one of the largest slums in the city with a population of around 250,000 people KNBS KNBOS, 2009. The informal settlement is made up of improvised temporary dwellings often made from scrap materials, such as corrugated metal sheets, plywood, and polythene-sheets Olack et al., 2011. In addition to poverty, a number of factors associated with informal settlements, including overcrowding, substandard housing, unclean and insufficient quantities of water, and inadequate sanitation, contribute to a high incidence of infectious diseases and increased mortality among children under five years Kyobutungi et al., 2008; Mutisya and Yarime, 2011. Mukuru informal settlement is divided into eight villages; Mukuru Lunga-Lunga, Mukuru kwa Sinai, Mukuru kwa Ruben, Mukuru kwa Njenga, Mukuru Kayaba, Fuata Nyayo, Jamaica, and Mukuru North. This study was carried out in two of the large villages, Mukuru kwa Njenga and Mukuru kwa Ruben, with a combined population of 150,000. Spatial mapping of the two villages was conducted using the Universal Transverse Mercator system Tinline and Gregory, 1988, and patient details collected as described previously Mbae et al., 2020.

The two villages in the informal settlement are served by three outpatient clinics: Ruben Health Centre located in the Ruben village (zone named Simba cool, serves approximately 30% of the population), Missionaries of Mary Located in Kwa Njenga village (zone named Vietnam, serves approximately 45% of the population), and County Government Clinic in Kwa Njenga village (Zone named MCC and serves approximately 25% of the population). The fourth site, Mbagathi District Hospital, is located on the western side of Nairobi city, 5 km from city centre and was used as a referral facility. Participants living outside of the mapped demographic surveillance site (DSS) who came to seek medical services in any of the three study site health facilities or at Mbagathi District Hospital were included for the purpose of tracking typhoid cases and carriers treated at the facilities, but are reported separately in the Results section.

Recruitment of clinical typhoid fever cases and asymptomatic typhoid carriers

Typhoid fever cases and asymptomatic carriers presented in this study were identified and recruited as part of a larger study on surveillance and genomics of invasive Salmonella disease in children and young adults less than 16 years of age Mbae et al., 2020; Kariuki et al., 2020. Children presenting as outpatients at the three study clinics and Mbagathi District Hospital between August 2013 and November 2016 were triaged to identify those with fever, headache and/or diarrhoea for recruitment into the study as potential cases. Patients with current fever ( ≥ 38°C) and reportedly febrile for ≥3 days were considered potential typhoid cases and assessed via blood or stool culture. The primary typhoid case definition (data presented in Table 1) was children aged 0–16 years with ≥3 days fever ≥38°C and positive blood or stool culture for S. Typhi (see bacterial culture methods below).

Table 1. Culture-positive typhoid cases and asymptomatic carriers.

Note the values reported for logistic regressions are from multivariate models including all indicated covariates, fit separately for cases and controls.

Participants tested, N 4,670 8,549
 Male, N (%) 2,497 (53.5%) 4,260 (49.8%)
 Female, N (%) 2,173 (46.5%) 4,289 (50.2%)
S. Typhi culture positive, N (%) 148 (3.2%) 95 (1.1%)
 Male, N (%) 99 (4.0%) 49 (1.15%)
 Female, N (%) 49 (2.3%) 46 (1.1%)
WGS confirmed S. Typhi, N (%) 100 (2.1%) 55 (0.64%)
Logistic regression for S. Typhi culture positive
 Year of isolation, OR (p-value) 1.19 (0.072) 0.94 (0.586)
 Male Sex, OR (p-value) 1.81 (0.0008*) 1.08 (0.699)
 Age in years, OR (p-value) 1.08 (0.0005*) 1.02 (0.403)
Logistic regression for S. Typhi culture positive, males only
 Year of isolation, OR (p-value) 1.19 (0.147) 1.09 (0.576)
 Age in years, OR (p-value) 1.11 (0.0001*) 1.06 (0.082)
Logistic regression for S. Typhi culture positive, females only
 Year of isolation, OR (p-value) 1.19 (0.296) 0.81 (0.158)
 Age in years, OR (p-value) 1.03 (0.551) 0.98 (0.534)
Logistic regression for S. Typhi WGS positive
 Year of isolation, OR (p-value) 1.15 (0.209) 0.59 (0.0003*)
 Male Sex, OR (p-value) 1.59 (0.028*) 0.93 (0.780)
 Age in years, OR (p-value) 1.11 (0.0001*) 1.03 (0.326)
Logistic regression for S. Typhi WGS positive, males only
 Year of isolation, OR (p-value) 1.18 (0.261) 0.74 (0.133)
 Age in years, OR (p-value) 1.15 (0.00002*) 1.07 (0.131)
Logistic regression for S. Typhi WGS positive, females only
 Year of isolation, OR (p-value) 1.12 (0.558) 0.48 (0.0003*)
 Age in years, OR (p-value) 1.03 (0.518) 1.0 (0.92)

During the study period, age-matched controls were recruited from children without current fever or diarrhoea attending the same health facilities for healthy mother and child clinics (e.g. for vaccination and nutritional advice). Those with S. Typhi-positive stool culture were designated as asymptomatic typhoid carriers as described previously Mbae et al., 2020. Hence, the inclusion criteria for asymptomatic typhoid carriers (data presented in Table 1) were children aged 0–16 years with no diarrhoea, no current fever, and no recent fever history, with stool culture positive for S. Typhi (see bacterial culture methods below). The total number of participants was computed on the basis of a 4% prevalence rate of typhoid from previous study Kariuki et al., 2010. (A structured questionnaire was used to collect demographic data for both cases and controls recruited into the study as described previously Mbae et al., 2020).

All isolates cultured from participants and identified as Salmonella were archived and later revived for WGS as detailed below. The sequence data revealed some mis-identification of Salmonella serotypes (Figure 1, Supplementary file 1), hence for genomic analyses we included all cases and controls whose cultures were found to be S. Typhi positive by WGS rather than those identified as S. Typhi positive by serotype in the microbiology laboratory.

Figure 1. Flow chart of samples collected and analysed.

Figure 1.

Red boxes indicate bacterial isolates that could not be included in downstream genetic analyses, grouped by reason for exclusion.

Bacterial culture

For blood culture, 1–3 mL for children < 5 years of age and 5–10 mL for those 5–16 years of age was collected in a syringe, placed into Bactec media bottles (Becton-Dickinson, New Jersey, USA), incubated at 37°C in a computerised BACTEC 9,050 Blood Culture System (Becton-Dickinson), and subcultured after 24–48 hr onto blood, chocolate and MacConkey agar (Oxoid, Basingstoke, UK) plates. All isolates, whether from cases or carriers were cultured on selenite F (Oxoid) broth aerobically at 37°C overnight. Broth cultures were then subcultured on MacConkey agar and Salmonella-Shigella agar (Oxoid) and incubated at 37°C overnight. Blood and stool isolates were identified using a series of standard biochemical and serological tests as described previously Mbae et al., 2020. Briefly, colonies of S. Typhi identified from biochemical and PCR testing were subjected to serological identification as S. Typhi using the slide agglutination technique and applying monovalent antisera (Murex Diagnostics, Dartford, UK). A drop of the antisera to be tested was mixed with a bacterial smear on a slide to observe for the presence or absence of agglutination in two minutes.

Antimicrobial susceptibility testing

Antimicrobial susceptibility testing was performed using the disk diffusion technique for ampicillin 10 µg, tetracycline 30 µg, co-trimoxazole 25 µg, chloramphenicol 30 µg, cefpodoxime 30 µg, ceftazidime 30 µg, ceftriaxone 30 µg, cefotaxime 30 µg, ciprofloxacin 5 µg, and nalidixic acid 10 µg as described previously Mbae et al., 2020. Results were interpreted according to the 2017 guidelines provided by the Clinical and Laboratory Standards Institute (CLSI) Patel et al., 2015.

Whole genome sequencing

All Salmonella isolated from cases and controls were subcultured at the end of the study for DNA extraction and WGS. These included 243 cultures identified as S. Typhi from cases (85 from blood and 63 from stool) and 95 from controls (all from stool) (see Figure 1, (1) Supplementary file 1), which are the subject of this study (non-typhoidal Salmonella data is reported elsewhere Kariuki et al., 2020). Twelve S. Typhi case isolates and 10 control isolates could not be revived and were not further analysed. DNA was extracted using the Wizard Genomic DNA Extraction Kit (Promega, Wisconsin, USA) and shipped on ice to the Wellcome Sanger Institute for sequencing using the Illumina platform as described previously Park et al., 2018. A total of 217 S. Typhi DNA samples were successfully sequenced (two were of insufficient quality to construct sequencing libraries, or failed sequencing). Non-S. Typhi bacterial DNA sequences were detected in 75 samples (34.6%; organisms detected are shown in Supplementary file 1), and 11 sequences originally identified as other Salmonella serotypes were later found to be S. Typhi with genomic data. Two sequences showed the presence of S. Typhi, but at low depth, and were subsequently omitted from further genomic analyses, leaving genome data for 153 S. Typhi isolates for further analysis.

The taxonomic identities of non-typhoidal Salmonella spp. were determined using Kraken (v0.10.6) Wood and Salzberg, 2014, multi-locus sequence typing (MLST) with SRST2 Inouye et al., 2014; Jolley and Maiden, 2010, as well as the Salmonella In Silico Typing Resource (SISTR) and Speciator (both available via Typhi Pathogenwatch; https://pathogen.watch/) Argimón et al., 2021.

Phylogenetic and SNP analysis of S. Typhi isolates

For SNP analysis, paired-end reads from 153 S. Typhi isolates were mapped to the reference sequence of S. Typhi CT18 (accession number: AL513382) Parkhill et al., 2001 using the RedDog mapping pipeline (v1beta.10.3), available at https://github.com/katholt/reddog (copy archived at swh:1:rev:90707ace56d6189997526273cf97013d84b84d90; Holt, 2021a) and detailed in supplementary methods. Read alignments were used to assign isolates to previously defined lineages according to the extended genotyping framework Wong et al., 2016; Britto et al., 2018 with the GenoTyphi pipeline (available at https://github.com/katholt/genotyphi, copy archived at swh:1:rev:f489d0e004e30bdca683c76c25ac7f4d79162791; Holt, 2021b). Unique SNPs defining three novel lineages were identified from the genome-wide SNP allele table and added to the GenoTyphi scheme to facilitate easy identification of these lineages in future studies (details in supplementary methods and results).

Phylogenetic analyses were restricted to WGS-confirmed pure cultures of S. Typhi H58 (genotype 4.3.1, n = 128). For some analyses, an additional 1076 S. Typhi H58 genomes from previously published WGS studies of global and African isolates Wong et al., 2015; Park et al., 2018; Wong et al., 2016; Pham Thanh et al., 2016 were also included for context, along with 61 non-H58 genomes for phylogenetic outgroup rooting, using the same mapping approach detailed above (see Supplementary file 2 for full list of genomes analysed and their public data accessions). SNPs called in phage regions or repetitive sequences were filtered from the alignment (details in supplementary methods). Analysis with Gubbins (v2.3.2) identified two regions affected by recombination (coordinates 954,115–970,731 (genes STY0961-STY0976), and 1,438,676–1,467,273 (genes STY1485-STY1508) in the CT18 reference genome), which were excluded from the alignment Croucher et al., 2015. This resulted in a final set of 8635 SNPs. From this global alignment we extracted a separate SNP alignment for the set of 239 Kenyan S. Typhi 4.3.1 genomes (n = 128 from this study and n = 111 from published studies, see Supplementary file 3; Feasey et al., 2015; Yu et al., 2016), the resulting alignment of length 489 SNPs was used for temporal analyses (described below and in supplementary methods).

Maximum likelihood (ML) phylogenetic trees were inferred from SNP alignments using RAxML (v8.2.9) Stamatakis, 2014 (as detailed in supplementary methods) and the resulting trees were visualised using Microreact (interactive global H58 phylogeny available at: https://microreact.org/project/wViqmaRdZuFVEb6yk4i1jU) Argimon et al., 2016.

Pairwise SNP distances were calculated from the SNP alignment using the dist.dna() function in the R package ape (v5.4.1) Paradis et al., 2004. Terminal branch lengths were extracted from phylogenies using R package ggtree (v2.2.4) Yu et al., 2016. Non-synonymous mutations occurring in terminal branches were detected using SNPPar (v0.4.2dev) Edwards et al., 2020 and grouped by function based on the gene in which they were found, according to the functional classification scheme in the genome annotation of S. Typhi CT18 Thanh Duy et al., 2020; Parkhill et al., 2001.

Phylodynamic analysis

To investigate temporal signal and date the introduction of S. Typhi H58 into Kenya based on the 239 available Kenyan genomes (n = 128 from this study, and n = 111 from previous studies Park et al., 2018; Wong et al., 2016), we used several methods. First, we used TempEst (v1.5.1) Rambaut et al., 2016 to assess temporal structure (i.e. clock-like evolution) by conducting a regression analysis of the root-to-tip branch distances of the ML tree as a function of sampling date, and later a date-randomisation test (full details of temporal signal assessment and model selection are provided in supplementary methods). To estimate divergence dates for the three S. Typhi H58 sublineages we detected in Kenya (EA1-3), we used BEAST (v1.10) Suchard et al., 2018 to fit a phylodynamic model to the SNP alignment and isolation dates as described in supplementary methods. The resultant MCC tree was visualised using ggtree (v2.2.4) Yu et al., 2016 and Microreact Argimon et al., 2016 (interactive phylogeny available at: https://microreact.org/project/I2KUoasUB).

Genomic determinants of antimicrobial resistance

The read mapping-based allele typer SRST2 (v0.2.0) Inouye et al., 2014 was used to detect the presence of plasmid replicons (PlasmidFinder database Carattoli et al., 2014) and antimicrobial resistance (AMR) genes (ARGannot database Gupta et al., 2014). Where AMR genes were observed without evidence of a known AMR plasmid, raw read data was assembled using Unicycler (v0.4.7) Wick et al., 2017 and then examined using Bandage (v0.8.1) Wick et al., 2015 to confirm the chromosomal location and composition of AMR-associated transposons. As the Tn2670-like composite transposon commonly associated with the acquisition of MDR genes in S. Typhi is mediated by IS1 translocation Wong et al., 2015, ISMapper (v2.0) Hawkey et al., 2015 was also used to identify the location of IS1 insertion sequences in the S. Typhi chromosome as described in supplementary methods. SRST2 was used to determine IncHI1 plasmid MLST (multi-locus sequence types) sequence types (pMLST), and minor alleles were identified by mapping to the plasmid pAKU1 reference sequence (accession number AM412236) in the same manner as described above for the S. Typhi chromosome (details in supplementary methods and Supplementary files 4-5). Point mutations located within the quinolone resistance determining region (QRDR) of genes gyrA, gyrB, and parC associated with reduced susceptibility to fluoroquinolones Pham Thanh et al., 2016 were detected using GenoTyphi Wong et al., 2016; Britto et al., 2018 as detailed in supplementary methods.

Statistical and spatial analysis

All statistical analyses unless otherwise stated were carried out using R (v4.0.2). Details of specific functions within R packages used for individual analyses are available in supplementary methods.

Nucleotide sequence and read data accession numbers

Raw Illumina sequence reads have been submitted to the European Nucleotide Archive (ENA) under accession PRJEB19289. Individual sequence accession numbers are listed in Supplementary file 1.

Results

Detection of S. Typhi cases and asymptomatic carriers

From August 2013-November 2016, a total of 4670 febrile children were recruited across the four study sites and subjected to blood and/or stool culture. S. Typhi was identified in cultures from 148 children (3.2%); the annual rate was steady over the study period but significantly higher amongst males (4.0% vs 2.3%, p = 0.0008, see Table 1). The odds of S. Typhi positive culture increased significantly with age (OR 1.08, p = 0.0005) but the effect was restricted to males (see Table 1), amongst whom the isolation rate was 1.3% in those ≤1 year, 2.0% in those aged 1–7 years, and 3.4% in those >7 years old (compared with 0.95%, 1.1%, and 0.94%, respectively amongst females). A total of 8549 age-matched control participants (with no current diarrhoea and no recent fever history) were recruited and subjected to stool culture. S. Typhi was identified in cultures from n = 95 (1.1%); these are considered asymptomatic carriers. S. Typhi culture positivity amongst controls was not significantly associated with age or sex and was stable over the study period (see Table 1 and Supplementary file 6). Significant associations identified from culture data were also observed when using WGS confirmed infections only, and no significant statistical association was found between phenotypic or genotypic AMR patterns and case/control status, age, or sex.

Global population structure and antimicrobial resistance profiles of kenyan S. Typhi

The presence of S. Typhi was confirmed by WGS in 94 cases (64%) and 50 controls (53%) that were originally identified as S. Typhi via microbiological culture (Figure 1, Supplementary file 1). S. Typhi genotype 4.3.1 (H58) was dominant throughout the study (n = 145, 95%), amongst both cases and controls (Table 2). Five other genotypes were detected: 2.2.2 (n = 1), 2.5.0 (n = 3), 3.0.0 (n = 3), and 4.1.1 (n = 1), see Table 2.

Table 2. Genotypes and AMR profiles for 153 sequenced S. Typhi isolates.

Percentages indicate genotype frequencies amongst cases or controls (first two columns); or frequency of antimicrobial resistance determinants amongst isolates of a given genotype (remaining columns). MDR, multi-drug resistant; L1, lineage I; L2, lineage II.

Genotype Cases Controls MDR GyrA mutation GyrB mutation
Plasmid Chromosome S83F S83Y D87G S464F
All 99 54 83 33 3 17 2 75
2.2.2 0 1 (1.9%) 0 0 0 0 0 0
2.5.0 1 (1.0%) 2 (3.7%) 0 0 0 0 0 0
3.0.0 2 (2.0%) 1 (1.9%) 0 0 0 0 0 0
4.1.1 0 1 (1.9%) 0 0 0 0 0 0
4.3.1 (H58) 96 (97%) 49 (91%) 83 (57%) 33 (23%) 4(2.8%) 19(13%) 2(1.4%) 75(51.7%)
H58 subgroups
EA1 (L1) 35 (35%) 20 (37%) 29 (53%) 17 (31%) 4(7.3%) 2(3.6%) 2(3.6%) 2(3.6%)
EA2 (L2) 46 (46%) 27 (50%) 54 (74%) 0 0 0 0 73(100%)
EA3 (L2) 15 (15%) 2 (3.7%) 0 16 (94%) 0 17(100%) 0 0

The few non-H58 isolates (Table 2) lacked any known AMR determinants. In contrast, the majority of H58 isolates were MDR (n = 116, 80%), often carrying acquired genes conferring resistance to ampicillin (blaTEM-1), chloramphenicol (catA1), co-trimoxazole (dfrA7 plus sul1 and/or sul2), and streptomycin (strAB). In 33 genomes (23% of H58), these genes were carried by a Tn2670-like complex transposable element inserted in the chromosome as reported previously in the region Wong et al., 2015; Feasey et al., 2015; Park et al., 2018. The remaining 83 MDR genomes (57% of H58) carried a closely related Tn2670-like transposon located within an IncHI1 plasmid, which in all but one isolate also carried an additional tetracycline resistance gene (tetB). The IncHI1 plasmids were genotyped as plasmid sequence type 6 (PST6), which is associated with MDR H58 in East Africa and South Asia Wong et al., 2015; Park et al., 2018; Holt et al., 2011. We compared single-nucleotide variant haplotypes for these plasmids with those from 534 IncHI1 PST6 plasmids sequenced previously from African and global studies (all of which were carried by H58 S. Typhi hosts, see Supplementary file 4). The wildtype PST6 plasmid haplotype was detected in S. Typhi hosts of all 6 H58 genotypes, whereas the derived plasmid haplotypes were each detected in a single S. Typhi host genotype (Figure 2—figure supplement 1). This is consistent with ongoing microevolution of the PST6 plasmid within S. Typhi lineages since the acquisition of the plasmid by the mrca of S. Typhi H58, but shows no evidence of transfer of plasmid haplotypes between S. Typhi lineages. Thus, the observed AMR phenotypes in our study site (n = 128 H58 and n = 8 non-H58 genome sequences) corresponded to the presence of known molecular determinants of AMR and mobile genetic elements. Estimates of sensitivity and specificity of AMR genotyping are presented in Supplementary file 7 and supplementary results. No statistical association was observed between the presence of MDR genes or QRDR mutations shown in Table 2 and case/control status, age, or sex.

Local subpopulations of S. Typhi H58

S. Typhi H58 (genotype 4.3.1) can be subdivided into lineages I (genotype 4.3.1.1) and II (genotype 4.3.1.2). Lineage II was more common in this setting than lineage I: n = 90 (62.1% of H58) vs n = 55 (37.9%). Examination of the global phylogeny (Figure 2, and online interactive version https://microreact.org/project/wViqmaRdZuFVEb6yk4i1jU) revealed all H58 lineage I isolates from this study shared a most recent common ancestor (mrca) whose descendants form a monophyletic clade that exclusively comprised S. Typhi from East African countries (see Figure 2,), here defined as H58 sublineage EA1 (East Africa 1) with genotype designation 4.3.1.1.EA1 (labelled in Figure 2).

Figure 2. Global population structure of H58 (4.3.1) S.Typhi showing Kenyan isolates cluster into three East African clades.

Whole genome phylogeny of 1,204 H58 isolates, including all available Kenyan genomes (n = 128 from this study, n=111 from prior studies) and globally distributed genomes for context (n=965, see Methods). Branch lengths are in substitutions per core-genome site, branches are coloured to indicate geographical origin (see inset legend), shaded boxes highlight the three East African H58 clades defined in this study. Colour bars to the right indicate (as per inset legend): 1, Kenyan strains isolated and sequenced during this study; 2, geographical location; 3, mutation(s) in the quinolone resistance determining region (QRDR) of genes gyrA, gyrB, and parC; 4, presence of multidrug resistance (MDR) IncHI1 plasmid; 5, presence of MDR genes; 6, H58 lineage. Interactive version available at https://microreactorg/project/wViqmaRdZuFVEb6yk4i1jU.

Figure 2.

Figure 2—figure supplement 1. S. Typhi IncHI1 PST6 plasmid minimum spanning tree.

Figure 2—figure supplement 1.

Nodes indicate unique IncHI1 PST6 haplotypes observed among n=534 PST6 plasmid sequences. Nodes are pie charts sized by the number of sequences among which the different plasmid haplotypes were observed and are coloured by the genotype(s) of the host S. Typhi sequences carrying the plasmid haplotype.

S. Typhi H58 lineage II (genotype 4.3.1.2) isolates from our study belonged to two distinct clades of the global phylogeny (Figure 2), which were each exclusively populated by East African isolates. The largest of these clades (n = 80 isolates, of which 81.3% derive from the current study) formed a monophyletic group nested within a deeper clade of diverse South Asian isolates (see Figure 2), and corresponds to the previously reported introduction of H58 lineage II into Kenya from South Asia Wong et al., 2015. This lineage, here defined as H58 sublineage EA2 (East Africa 2) is designated genotype 4.3.1.2.EA2 (labelled in Figure 2). The smaller East African H58 lineage II clade (n = 43 isolates) is designated genotype 4.3.1.2.EA3 (labelled in Figure 2) and comprised two sister clades, separated by ≥13 SNPs: one involving isolates from Kenya (n = 13, all from this study) and the other isolates from Uganda (n = 30), which accounted for 100% of the typhoid burden at the Ugandan site where they were identified (see Figure 2). All three East African H58 genotypes have been added to the GenoTyphi scheme using unique marker SNPs and further details on these are provided in supplementary results.

The three East African H58 subgroups circulating in our setting all had high rates of MDR (84%, 74%, and 94%, respectively); however, in EA2, MDR was exclusively associated with the PST6-IncHI1 plasmid, and in EA3 exclusively with the chromosomal insertion (see Table 2, Figure 2). In EA1, most MDR was associated with the PST6-IncHI1 plasmid. However, a subclade of isolates (associated with spread to Tanzania and Malawi) carried the chromosomal insertion instead (see Table 2, Figure 2, , supplementary methods).

Distribution of S. Typhi genotypes amongst individuals

No statistically significant differences in genotype distribution were observed between cases and controls (p = 0.077, using Chi-squared test, data in Table 2), or between males and females (p = 0.37, using Chi-squared test, data in Supplementary file 8), consistent with symptomatic and asymptomatic infections being drawn from the same general circulating pool of pathogens. The distribution of genotypes amongst cases varied by age group (p = 0.01, using Chi-square test), with the frequency of EA1 declining with age and the overall diversity increasing with age (Table 3). No significant differences in age groups was evident amongst controls (p = 0.9 using Chi-square test, see Table 3).

Table 3. Typhi genotypes associated with n = 153 cases and controls among different age groups.

Age group
≤ 1 year 1–7 years > 7 years
WGS-confirmed cases 7 66 26
 EA1 5 (71%) 24 (36%) 6 (23%)
 EA2 1 (14%) 34 (52%) 11 (42%)
 EA3 0 8 (12%) 7 (27%)
 non-H58 1 (14%) 0 2 (78%)
Shannon diversity 0.80 0.97 1.25
WGS-confirmed carriers 4 30 20
 EA1 1 (25%) 10 (33%) 9 (45%)
 EA2 3 (75%) 16 (53%) 8 (40%)
 EA3 0 1 (3%) 1 (5%)
 non-H58 0 3 (10%) 2 (10%)
Shannon diversity 0.56 1.05 1.11

Spatiotemporal distribution of S. Typhi cases and carriers

We examined the spatial and temporal distribution of all S. Typhi isolates collected at the study clinics (see methods; Figure 3—figure supplement 1 and Supplementary file 9), and the subset of 96 S. Typhi from cases and 67 from carriers living within the demographic surveillance site (DSS) (Figure 3, and Table 4). A number of peaks in monthly S. Typhi case and carrier numbers are apparent in both cohorts (Figure 3, Figure 3—figure supplement 1), with fewer cases and carriers observed in warmer months. Carrier counts remained relatively consistent throughout the study period. We tested for association between case or carrier peaks ( > 2 positives per month) and high rainfall or temperature in the same month, previous month, or 2 months prior to the month of observation (Table 4, and Supplementary file 9). For those S. Typhi from within the DSS, high temperatures were associated with lower case and carrier counts in the same month, and in the subsequent month (p < 0.05, using Fisher’s exact test), however no associations between high rainfall and elevated case or carrier counts was observed. Repeating these analyses with WGS-confirmed isolates, the association with temperature was no longer significant (although the trend remained; see Supplementary file 10, Supplementary file 11). No association was detected between patient numbers attending study clinics and climate variables (Supplementary file 12).

Figure 3. Epidemic curve of all S. Typhi cases and controls per month inside the DSS.

(A) Monthly distribution of S. Typhi genotypes from cases. (B) Monthly distribution of S. Typhi genotypes from carriers. Note that the counts include all participants who were culture-positive for S. Typhi and also those who were culture-positive for other Salmonella but identified later by WGS as S. Typhi. (C) Weather conditions throughout the study period. Blue dashed line indicates precipitation level per month (rainfall), shaded orange polygon indicates the temperature range, red circle indicates threshold for high minimum temperature for statistical testing, black circle indicates threshold for high maximum temperature for statistical testing, blue triangle indicates threshold for high rainfall for statistical testing.

Figure 3.

Figure 3—figure supplement 1. Epidemic curve of all S.Typhi cases and controls per month.

Figure 3—figure supplement 1.

(A) Monthly distribution of S. Typhi genotypes from cases. (B) Monthly distribution of S. Typhi genotypes from carriers. Note that the counts include all participants who were culture-positive for S. Typhi and also those who were culture-positive for other Salmonella but identified later by WGS as S. Typhi. Genotypes are indicated by colour as per inset legend. (C) Weather conditions throughout the study period. Blue dashed line indicates precipitation level per month (rainfall), shaded orange polygon indicates the temperature range, red circle indicates threshold for high minimum temperature for statistical testing, black circle indicates threshold for high maximum temperature for statistical testing, blue triangle indicates threshold for high rainfall for statistical testing.

Table 4. Climatic predictors of elevated case and control counts inside the DSS.

Values in cells are odds ratios and p-values for Fisher’s exact test between high case or control count ( > 2 per month) and high rainfall/temperature. * highlights p-values < 0.05.

Typhoid cases
Month Same month Previous month 2 months prior
OR (95% CI) p-value OR (95% CI) p-value OR (95% CI) p-value
Rainfall (precipitation)> 75 mm 0.21 (0.019–1.2) 0.079 1.4 (0.26–6.9) 0.73 3.7 (0.73–22.3) 0.08
Minimum temperature> 14°C 0.21 (0.041–0.95) 0.025* 0.61 (0.14–2.6) 0.52 2.2 (0.49–10.5) 0.33
Maximum temperature> 26°C 0.85 (0.20–3.6) 1 0.37 (0.080–1.6) 0.20 0.67 (0.15–2.80) 0.75
Asymptomatic controls
Month Same month Previous month 2 months prior
OR (95% CI) p-value OR (95% CI) p-value OR (95% CI) p-value
Rainfall (precipitation)> 75 mm 1.2 (0.21–6.5) 1 0.43 (0.038–2.7) 0.45 0.43 (0.038–2.7) 0.45
Minimum temperature> 14°C 0.12 (0.016–0.64) 0.005* 0.41 (0.078–1.9) 0.30 0.65 (0.13–3.2) 0.73
Maximum temperature> 26°C 0.10 (0.0090–0.61) 0.005* 0.19 (0.027–1.0) 0.04* 0.63 (0.12–3.0) 0.73

GPS coordinates were available for n = 139 (55%) S. Typhi isolates, and we endeavoured to look for geographic hotspots suggestive of major point-source single-genotype outbreaks in the informal settlement. However, our data revealed that the three H58 genotypes and non-H58 genotypes were co-circulating throughout the study area, with no evidence of geographic restriction of specific genotypes (see Figure 4—figure supplement 1). Further, we did not observe any spatially linked phylogenetic clusters of closely related sequences.

Evolutionary history of S. Typhi cases and controls

We applied Bayesian phylodynamic analysis to all available Kenyan H58 genomes to estimate the dates of emergence of each of the East African lineages. The data showed temporal structure (see methods and Figure 4—figure supplement 2), and we estimated a genome-wide substitution rate of 0.8 SNPs per genome per year (95% HPD, 0.1–1.0). This translates to a rate of 1.9 × 10–7 genome-wide substitutions per site per year (95% HPD = 1.5 × 10–7-2.2 × 10–7). The novel EA1 isolates from this study (accounting for 35% of cases and 37% of controls) were intermingled with those sequenced previously from Kenya and were genetically diverse (median pairwise distance ~16 SNPs, interquartile range 12–27). This is indicative of a well-established EA1 S. Typhi population in Nairobi for which we estimate the mrca existed circa 1990 (95% HPD, 1981–1999) (see Figure 4a, Figure 4—figure supplement 3). The most common lineage was EA2 (48%), which also showed extensive diversity and we estimate emerged circa 1988–1990 (95% HPD, 1978–1997) (see Figure 4a, Figure 4—figure supplement 2), earlier than the first recorded H58 Lineage II isolation in Kenya in 2004 Kariuki et al., 2010. We estimate the MDR fluoroquinolone non-susceptible lineage EA3, which accounts for just 11 % of isolates, arrived much more recently (Kenyan mrca circa 2012, 95% HPD 2009–2014) (see Figure 4a, Figure 4—figure supplement 3). The topology of the global H58 tree (Figure 2) supports South Asia as the most likely origin for EA3, with EA3 strains spreading between Kenya and Uganda, probably through the shared transport systems.

Figure 4. Temporal distribution of genotypes and among all cases and carriers.

(A) Dated maximum-clade credibility phylogenetic tree of Kenyan S. Typhi genotype 4.3.1 (H58), including 128 isolated from this study. Tip colours & first colour bar indicate symptom status, second colour bar indicates those isolates from children living in the defined survey area. Black triangles demarcate nodes of interest, and the accompanying bars indicate 95% HPD of node heights. Interactive phylogeny available at https://microreactorg/project/I2KUoasUB. (B) Distribution of terminal branch lengths for all sequences, extracted from the Bayesian tree shown in (A). (C) Distribution of isolate-specific SNPs detected in sequences from all cases and controls. (D) Distribution of terminal non-synonymous mutations detected in sequences from all cases and controls. In the boxplots in panels B, C, and D, black bars indicate median values, boxes indicate interquartile range. Cases and carrier samples indicated as per the inset legend.

Figure 4.

Figure 4—figure supplement 1. Spatial distribution of S. Typhi genotypes throughout the informal settlement.

Figure 4—figure supplement 1.

Each individual point represents an individual S. Typhi isolate obtained from the informal settlement which are coloured by genotype as per the inset legend.
Figure 4—figure supplement 2. Tempest regressions & BEAST date randomisation testing.

Figure 4—figure supplement 2.

(A) Kenyan H58 S. Typhi tempest regression of root-to-tip distance as a function of sampling time, with the root of the tree selected using heuristic residual mean squared (each point represents a tip of the maximum likelihood tree). The slope is a crude estimate of the substitution rate for the SNP alignment, the x-intercept corresponds to the age of the root node, and the R2 is a measure of clocklike behaviour (B) Kenyan H58 S. Typhi date randomisation test with the right most box plot showing the posterior substitution rate estimate from the SNP alignment of the data with the correct sampling times, and the remaining 20 boxplots showing the posterior distributions of the rate from replicate runs using randomised dates. The data are considered to have strong temporal signal if the estimate with the correct sampling times does not overlap with those from the randomisations.
Figure 4—figure supplement 3. Temporal and age distribution of genotypes among cases and controls inside the survey site.

Figure 4—figure supplement 3.

(A) Dated maximum-clade credibility phylogenetic tree of Kenyan S. Typhi genotype 4.3.1 (H58), including 128 isolated from this study. Tip colours and first colour bar indicate symptom status, second colour bar indicates those isolates from children living in the defined survey area. Black triangles demarcate nodes of interest, and the accompanying bars indicate 95% HPD of node heights. Interactive phylogeny available at https://microreactorg/project/I2KUoasUB. (B) Distribution of terminal branch lengths for sequences isolated from cases and controls within the survey area, extracted from the Bayesian tree shown in (A). (C) Distribution of isolate-specific SNPs detected in sequences from cases and controls resident the survey area. (D) Distribution of terminal non-synonymous mutations detected in sequences from cases and controls within the survey area. In the boxplots in panels B, C, and D, black bars indicate median values, boxes indicate interquartile range.

The Bayesian tree of Kenyan H58 isolates (Figure 4a, Figure 4—figure supplement 3) shows intermingling of sequences from acute cases and asymptomatic carriers. Sequences from carriers appeared more deeply branched than those of cases which we tested by comparing the terminal branch lengths (estimated in units of time in the Bayesian phylogeny) and isolate-specific SNP counts, for high-quality H58 sequences from acute cases (n = 85) vs those of asymptomatic carriers (n = 43) (Figure 4b, ). The mean values were higher for carriers vs cases (Figure 4b–c, Figure 4—figure supplement 3), with the trend being more pronounced among samples from within the DSS but these trends were not statistically significant (p = 0.42 for unique SNPs and p = 0.57 for terminal branches for all samples, using one-sided Wilcoxon rank sum test; p = 0.051 for unique SNPs and p = 0.58 for terminal branches in DSS). The mean number of non-synonymous (NS) mutations detected in terminal branches was greater for carrier isolates than those from cases, but again this difference was not statistically significant (0.72 vs 0.54 for all sequences, p = 0.53 using Wilcox rank sum test; 0.81 vs 0.39, p = 0.20 inside the DSS; see Figure 4d, Figure 4—figure supplement 3). There was also no significant difference in terminal branch lengths or unique SNP counts between genomes carrying vs lacking MDR genes or QRDR mutations (data not shown).

Examination of the location of terminal-branch NS mutations revealed that certain functional categories of genes carried more NS mutations arising on terminal branches associated with carriage samples vs those from acute cases (Figure 5; Supplementary file 13). Notably, carriage samples were associated with significantly higher frequencies of terminal-branch NS mutations in genes responsible for the synthesis of surface polysaccharides and antigens (9.3% of carriers vs 1.2% of acute cases (all of which were blood isolates), p = 0.043, Fisher’s exact test). Notably, in the viaB operon (responsible for Vi capsule biosynthesis) we identified n = 2/43 carriage isolates that harboured NS mutations (tviD-R159C and tviE-P263S) compared with only n = 1/85 case isolate (tviB-V1M); and in the wba cluster genes (responsible for O-antigen biosynthesis) we identified n = 2/43 carriage isolates that harboured NS mutations (wza-V137G and wzxC-L26F) whilst none were detected among case samples (Supplementary file 13). Non-significant excesses of mutations in carriage isolates were also observed for periplasmic and exported lipoproteins (6.98% vs 1.92% in blood isolates from febrile cases and 3.03% in stool isolates from febrile cases, see Figure 5).

Figure 5. Frequency of terminal non-synonymous mutations in difference gene functional categories among cases and carriers.

Figure 5.

(A) Frequency of terminal non-synonymous mutations in all sequences collected. (B) Frequency of terminal non-synonymous mutations in sequences from within the DSS area. Red bars indicate the frequency non-synonymous mutations found in acute case samples from blood, peach bars indicate non-synonymous mutations found in acute case samples from stool, and blue bars indicate the frequency of mutations found in carrier samples from stool.

Discussion

In this case-control typhoid surveillance study, we observed an asymptomatic S. Typhi carriage rate of 1.1% among children aged 16 years and under from an informal settlement with endemic Water, Sanitation, and Hygiene (WaSH)-related enteric diseases Mbae et al., 2020; Kariuki et al., 2020; Kariuki et al., 2019. To our knowledge, there has not been systematic surveillance for typhoid carriage in communities in Africa, but globally carriage and shedding of S. Typhi has mostly been associated with older age groups Levine et al., 1982; Senthilkumar et al., 2012; Gunn et al., 2014. Our data highlights a role for paediatric carriage, revealing a lower percentage of carriers amongst infants ≤ 1 year of age (0.62%), increasing to 1.2% in children between 7 and 16 years (Supplementary file 6). Thus, carriage and shedding, especially among school age children, is likely an important factor in the onward transmission of typhoid in this setting Mbae et al., 2020. Symptomatic typhoid fever is common in school age children, with a case culture positive rate of 4.3% among febrile children 7–16 years of age, though our data shows that there is also a substantial burden among younger children 1–7 years of age, and infants up to 1 year of age, with culture-positive rates among febrile participants of 3.1% and 2.2%, respectively (Supplementary file 6). Our WGS data shows that serotyping frequently leads to mis-classification of S. Typhi vs non-typhoidal Salmonella, highlighting the importance of molecular diagnostics in future studies and surveillance programs. WGS has been widely adopted by Salmonella reference labs in high-income countries Ashton et al., 2016; Köser et al., 2012; Arnold et al., 2018 but remains rare in low-income countries, although KEMRI now sequences all Salmonella isolates on-site immediately after serotyping.

We previously noted a dominance of MDR H58 S. Typhi over the last decade, essentially replacing the antimicrobial susceptible genotypes that dominated in the 1980–1990s Kariuki et al., 2015. The S. Typhi circulating in the informal settlement in the present study are largely comprised of descendants of the previously observed H58 sublineages 4.3.1.EA1 (36%) and 4.3.1.2.EA2 (48%) Wong et al., 2015; Park et al., 2018. Both EA1 and EA2 appear to be long established genotypes, with the mrca of EA1 existing circa 1990 (95% HPD, 1981–1999) (see Figure 4a ), consistent with the earliest recorded detection of H58 Lineage I in Kenya in 1988 Kariuki et al., 2010. Similarly we predict that the mrca of EA2 existed circa 1988–1990 (95% HPD, 1978–1997), earlier than the first recorded H58 Lineage II isolation in Kenya in 2004 Kariuki et al., 2010. Our data thus support contemporaneous imports of EA1 and EA2 in the late 1980s or early 1990s, shortly after the emergence of H58 in South Asia Wong et al., 2015 (circa 1982, 95% HPD 1974–1990), and show that both lineages have persisted and diversified locally alongside one another in the intervening decades. H58 sublineage EA3 was introduced later (we estimate that the Kenyan mrca existed circa ~2012 [95% HPD 2009 to 2014]), and consistent with this the lineage displays less diversity and accounts for a smaller fraction of cases and controls (11%). The topology of our global phylogeny (Figure 2) suggests that South Asia is the most likely origin of EA3 (as it is for EA1 and EA2), and that EA3 appears to have spread between Uganda and Kenya. This in line with multiple reports Feasey et al., 2015; Park et al., 2018 of H58 strains spreading through East Africa, mainly arising from intracontinental and transcontinental travel and concomitant risk factors associated with WASH conditions.

The sublineages of S. Typhi H58 in Kenya exhibit different antibiotic resistance profiles (Figure 2 and Table 2). Notably, EA1 has a large Kenyan sublineage of MDR strains with IncHI1 plasmids Wong et al., 2015; Park et al., 2018 (which are commonly associated with outbreaks in East Africa and Asia Wong et al., 2015; Park et al., 2018; Holt et al., 2011) but also a Kenyan sublineage with chromosomally integrated MDR. Chromosomal integration of MDR has not previously been reported in S. Typhi from Kenya (see supplementary data), but has been reported in Malawi and Tanzania Wong et al., 2015 and our new data suggests that the variant may have been transferred to these locations from Kenya (see Figure 2). MDR H58 isolates are now widespread across East Africa, having been detected in Malawi, Uganda, Rwanda, Tanzania, and Mozambique Wong et al., 2015; Feasey et al., 2015; Park et al., 2018; Wong et al., 2016; Ingle et al., 2019.

The three East African lineages differed markedly in their patterns of mutations conferring Decreased Ciprofloxacin Susceptibility (DCS). GyrB-S464F was conserved among all EA2, whereas all EA3 isolates carried the GyrA-S83Y mutation. The GyrA-S464F mutation was also detected at low frequency in EA1 (Table 2). This data indicates that ciprofloxacin resistance has been selected independently multiple times and is ongoing. Increasing rates of ciprofloxacin resistance have also been observed following similar introductions of H58 elsewhere in East Africa Park et al., 2018, and likely reflect a change in treatment practise following widespread dissemination of MDR S. Typhi elsewhere including South and Southeast Asia Britto et al., 2018; Britto et al., 2020; Rahman et al., 2020.

The different S. Typhi lineages appeared to be fairly evenly distributed between both acute cases and carriers, with the most common subgroup (EA2) accounting for 46% of acute cases and 50% of carriers. Similarly, all S. Typhi genotypes were identified throughout the study period and spatially across the study site, with most case/carrier monthly counts and geographic regions containing a diversity of genotypes (Figure 3, Figure 3—figure supplement 1). Our data therefore provides no evidence for major point-source single-genotype outbreaks, but is consistent with persistent contamination of water supplies with multiple S. Typhi genotypes. Higher temperatures were associated with lower S. Typhi case and carrier counts, however, no association with high rainfall was observed among our culture data. These findings are in line with previous studies focusing on seasonal trends in nearby Kibera Breiman et al., 2012. However, they contrast with trends previously observed in other settings including Malawi, where higher temperatures and rainfall were associated with increased risk of disease albeit with a time lag of multiple months Thindwa et al., 2019, and South Asia Baker et al., 2011; Dewan et al., 2013; Kanungo et al., 2008.

In our phylogenetic trees, branch lengths and SNP counts can be interpreted as measures of evolutionary time, and thus terminal branches and isolate-specific SNP counts as a measure of time since that isolate shared a mrca with another sampled isolate. This total time includes (i) time from mrca to the acquisition of infection in the sampled host, plus (ii) time from initial acquisition of the infection to time of sampling (ie time within the sampled host). Variation in the latter is more likely to be explained by the symptom status of the sampled host (rather than the former which occurs prior to the infection of the sampled host). Hence the higher mean branch lengths and SNP counts in asymptomatic controls is consistent with the expectation that carriers have had, on average, a longer duration from acquisition of the infection to sampling in the clinic (which is triggered by routine visits and unrelated to symptoms or colonisation status), compared to acute cases who are presenting due to febrile illness triggered by a recently acquired S. Typhi infection. This supports the interpretation that S. Typhi-positive controls identified in this study represent genuine medium- to long-term typhoid carriers, rather than simply reflecting transient presence in the gut. The greater diversity observed here amongst controls (Table 2) further supports this interpretation. Longer branch lengths among carrier samples were also observed in a recent study Thanh Duy et al., 2020 of S. Typhi isolated from bile samples from the gallbladders of cholecystectomy patients in Nepal. The differences in terminal branch lengths were non-significant in our study, possibly reflecting low statistical power or, perhaps less likely, that our control data constitute a mix of multiple carriage types including convalescent (3 weeks to 3 months), temporary (3e to 12 months), and chronic (more than 1 year) carriers. Also in line with previous findings Thanh Duy et al., 2020, our analyses provide evidence of positive selection among carriage isolates, with a higher proportion of non-synonymous mutations detected among carriers in specific biological pathways including surface polysaccharides and antigens, transport/binding proteins, and anaerobic respiration (Figure 5, Supplementary file 13). This is exemplified in the genes encoding surface antigens, notably those responsible for biosynthesis of Vi capsule and O-antigen lipopolysaccharide.

Our study is not without limitations, firstly, our data are from a single informal settlement community in Nairobi, and thus may not be representative of the overall population structure and AMR patterns of typhoid in Kenya more broadly, or in older age groups. Similarly, our sample size yielded a relatively small number of isolates for WGS, and we thus lack statistical power for some genetic analyses.

Conclusion

Our study is the first case-control study to identify and sequence both typhoid carriers and cases contemporaneously in an endemic community setting. High rates of AMR among both infection types in Kenya combined with high carriage and case rates, especially in the younger age groups, highlight the need for enhanced AMR and genomic surveillance in this region to inform both treatment guidelines and control strategies that keep pace with the local evolution and spread of AMR. Intervention strategies are urgently needed including the introduction of the new Vi conjugate vaccine in a programme that includes targeting of paediatric age groups in the short term, and improvements to WaSH infrastructure in the long term.

Acknowledgements

Research reported in this publication was supported by the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number R01AI099525 to SK and the Wellcome Trust. ZAD was supported by a grant funded by the Wellcome Trust (STRATAA; 106158/Z/14/Z and TyVac), and received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement TyphiNET No 845,681. GD was supported by the Cambridge Biomedical Research Council NIHR AMR theme. KEH was supported by a Senior Medical Research Fellowship from the Viertel Foundation of Australia, and the Bill and Melinda Gates Foundation, Seattle (grant #OPP1175797).

Appendix 1

Supplementary methods

Phylogenetic and SNP analysis of S. Typhi isolates

For SNP analysis, paired-end reads from 153 S. Typhi isolates were mapped to the reference sequence of S. Typhi CT18 (accession number: AL513382) Parkhill et al., 2001 using the RedDog mapping pipeline (v1beta.10.3), available at http://githibcom/katholt/reddog. Briefly, RedDog uses Bowtie (v2.2.3) Langmead and Salzberg, 2012 to map reads to the reference sequence; SAMtools (v0.1.19) Li et al., 2009 to identify SNPs with phred quality scores above 30; filter out SNPs supported by <5 reads, or with >2.5 times the genome-wide average read depth (representing putative repeated sequences), or with ambiguous (heterozygous) consensus base calls. For each SNP position that passed these criteria in any one isolate, consensus base calls (i.e. alleles) for that position were extracted from all genomes, and used to construct an alignment of alleles across all SNP sites. Ambiguous base calls and those with phred quality <20 were treated as unknown alleles and represented with a gap character in the SNP alignment. Read alignments were used to assign isolates to previously defined lineages according to the S. Typhi extended genotyping framework Wong et al., 2016; Britto et al., 2018; Rahman et al., 2020, by subjecting the alignments (BAM format) to analysis with the GenoTyphi pipeline (available at http://github.com/katholt/genotyphi). Unique SNPs defining three novel lineages were identified from the genome-wide SNP allele table (with SNPs responsible for non-synonymous mutations in highly conserved genes without deletions prioritized for lineage definitions), these were added to the GenoTyphi scheme to facilitate easy identification of these lineages in future studies.

Phylogenetic analyses were restricted to WGS-confirmed pure cultures of S. Typhi H58 (genotype 4.3.1, n = 128). For some analyses, an additional 1,076 S. Typhi H58 genomes from previously published WGS studies of global and African isolates Wong et al., 2015; Park et al., 2018; Wong et al., 2016; Pham Thanh et al., 2016 were also included for context (using the same mapping approach detailed above). Alleles from 61 additional S. Typhi genomes representing all non-H58 subclades (listed in Supplementary file 2), and S. Paratyphi A str. AKU_12601 (accession FM200053) Holt et al., 2009, were also included in the phylogenetic analysis as outgroups for tree rooting. SNPs called in phage regions or repetitive sequences (354 kbp; ~ 7.4 %) of bases in the CT18 reference chromosome, as defined previously Wong et al., 2015; Holt et al., 2008 were filtered from the alignment, which was then used with the CT18 reference genome (AL513382) to produce a whole genome pseudoalignment that was subjected to analysis with Gubbins (v2.3.2) Croucher et al., 2015 to remove any further recombinant regions. This resulted in a final set of 8,635 SNPs identified from an alignment of 4,809,037 sites for the 1,266 isolates.

From the global SNP alignment, maximum likelihood (ML) phylogenetic trees were inferred using RAxML (v8.2.9) Stamatakis, 2014, with a generalized time-reversible model, a Gamma distribution to model site-specific rate variation (the GTR+ Γ substitution model; GTRGAMMA in RAxML), and 100 bootstrap pseudo-replicates to assess branch support.

From the global alignment we extracted a separate SNP alignment for the set of 239 Kenyan S. Typhi H58 genomes (n = 128 from this study and n = 111) from published studies, see Supplementary file 3; Park et al., 2018; Wong et al., 2016, which had length 489 SNPs. The same phylogenetic inference methods were used to generate a ML tree from the SNP alignment of 239 Kenyan isolates for temporal analyses (described below).

Phylogenetic trees were visualized using Microreact (interactive global H58 phylogeny available at: https://microreact.org/project/wViqmaRdZuFVEb6yk4i1jU) Argimon et al., 2016 and the R package ggtree v1.14.6 Yu et al., 2016. For the purpose of plotting the global H58 tree, clusters of S. Typhi isolates that were members of the same monophyletic clade and isolated from the same location in South East Asian countries (i.e. representing local outbreaks) were reduced to a single representative each using the drop.tip() function in the R package ape (v5.4.1) Paradis et al., 2004.

Terminal branch lengths were extracted from phylogenies using R package ggtree Yu et al., 2016. Pairwise SNP distances were calculated from alignments using the dist.dna() function in the R package ape Paradis et al., 2004. Non-synonymous mutations were detected using SNPPar (V0.4.2dev) Edwards et al., 2020 and grouped by function based on the gene in which they were found according to the S. Typhi functional classification scheme developed at the Sanger Institute (http://www.sanger.ac.uk/) using the genome annotation of CT18 Thanh Duy et al., 2020; Parkhill et al., 2001.

SNP analysis of S. Typhi IncHI1 plasmids

In order to investigate the potential for plasmid transmission among the three East African S. Typhi H58 sublineages we carried out an SNP analysis of IncHI1 plasmid minor variants. Firstly, raw read data for all S. Typhi sequences included in the chromosomal SNP analysis described above were mapped to the reference sequence of IncHI1 plasmid pAKU_1 (accession number AM412236) using RedDog (as described above). Those plasmid sequences where a read depth of at least 10-fold and coverage across the refence sequence of at least 75 % were observed (n = 669 of sequences analysed; see Supplementary file 4) were included in SNP analysis, with repetitive regions excluded as detailed in Supplementary file 5. Plasmid MLST sequence types (pMLST) were determined using SRST2 Inouye et al., 2014; Jolley and Maiden, 2010; Holt et al., 2011; Phan et al., 2009, and we then interrogated the RedDog output allele matrix, and found that among n = 534 sequences of plasmid sequence type 6 (PST6), associated with genotype 4.3.1, there were a total of 25 variable sites. We concatenated these to form alignments, selected a single representative sequence of each plasmid haplotype, omitting those with gap characters, and calculated a genetic distance matrix using the R function dist.dna() from the R package ape Paradis et al., 2004. The distance matrix was then used as input for a Minimum Spanning Tree which was inferred using the R function mst() also in the R package ape, and visualised using the R package ggnetwork (v0.5.9) Briatte, 2021.

Phylodynamic analysis of S. Typhi isolates

To investigate temporal signal and date the introduction of S. Typhi H58 into Kenya based on the 239 available genomes (n = 128 from this study, and n = 111 from previous studies Park et al., 2018; Wong et al., 2016), we used several methods. First, we used TempEst (v1.5.1) Rambaut et al., 2016 to assess temporal structure (i.e. clock-like evolution) by conducting a regression analysis of the root-to-tip branch distances of the ML tree as a function of sampling date (expressed as decimal years at a resolution of days), using the heuristic residual mean squared method with the best fitting root selected. To estimate divergence dates for the three S. Typhi H58 sublineages we detected in Kenya (EA1-3), we used BEAST (v1.10) Suchard et al., 2018 to fit a phylodynamic model to the SNP alignment and isolation dates (as decimal years at a resolution of days). Note that as EA1-3 almost exclusively comprise Kenyan strains in the global H58 tree and the parent node for each of EA1-3 is the parent node for all Kenyan isolates of EA1-3 (see Figure 2, Figure 2—figure supplement 1), the divergence dates for the EA1-3 parent nodes in the tree of 239 Kenyan isolates is taken as the divergence date for each of EA1-3 generally, as well as the lower bound for the date of introduction of each of these sublineages into Kenya.

We ran separate models using constant-coalescent population size and Bayesian skyline tree priors, in combination with a strict clock model or a relaxed (uncorrelated log normal distribution) clock model, to identify the best fitting model for our data. For BEAST analyses the GTR+Γ substitution model was selected, and sampling times (tip dates) were used to calibrate the molecular clock (for isolates collected in this study the precise day of isolation was used; for the previously published genomes, only the isolation year was known, so tip dates were assigned to the first of July for that year with an uncertainty of 0.5 years). For all tree prior and model combinations, a chain length of 100,000,000 steps with sampling every 5,000 steps was used Duchene et al., 2016. The relaxed (uncorrelated lognormal) clock model, that allows for evolutionary rate variations among branches of the tree and the constant-coalescent model were found to best fit our data. To assess the temporal signal of these Bayesian estimates, we conducted a date-randomisation test where sampling times were assigned randomly to the sequences, and the analysis re-run 20 times with the best fitting models (constant-coalescent demographic and uncorrelated lognormal clock) Duchene et al., 2016; Firth et al., 2010. The date-randomisation test revealed that these data displayed ‘strong’ temporal structure (meeting the criterion CR2 of Duchene et al., 2016). Our preliminary BEAST runs resulted in implausible tree topologies and dated the most recent common ancestor (mrca) of H58 S. Typhi in Kenya ~ 1927 (95% highest posterior density = 1847–1984), conflicting with previously inferred divergence dates for the emergence of H58 (5,7,26). Fixing the tree topology to that obtained from ML inference yielded more plausible date estimates (~1968, 95 % HPD = 1957–1977), however, sampling from the prior (without the sequence alignment) using the same model showed that this estimate was driven entirely by the priors provided to the model, with no information contributed by the sequence data. Taken together, these preliminary analyses suggested that while a temporal signal is present in the alignment, the signal is weak and would benefit from the specification of sensible priors to calibrate the root height. Previous analyses of the H58 divergence date, inferred using global data spanning a wider sampling period and with stronger temporal signal, estimate it emerged circa 1989 (95% HPD, 1981–1995) Wong et al., 2015, and we have previously reported the presence of H58 lineage one in Kenya in the late 1980s Kariuki et al., 2010. We therefore specified a log-normally distributed root height prior with mean 1989 and standard deviation 4 years. Use of this root height prior (without fixing the tree topology) yielded a plausible tree topology (i.e. consistent with the outgroup-rooted maximum likelihood tree inferred from the same alignment, with the expected separation of H58 sublineages into monophyletic clades), showed evidence of temporal signal (via date randomisation testing), and was not driven by priors alone; hence this approach was used for the final analyses presented here.

For the final analyses, 2 independent runs each conducted with a chain length of 100,000,000 steps sampling every 5,000 iterations were combined using LogCombiner (v1.10.0) Suchard et al., 2018, after removing the first 10% of steps from each as ‘burn-in’. Maximum-clade credibility trees (MCC) trees were generated with ‘common ancestor heights’ specified for node heights, using TreeAnnotator (v1.10.0) Suchard et al., 2018. The effective sample sizes (ESS) from the combined runs were >200 for all reported parameters. The resultant MCC tree was visualized using ggtree (v1.14.6) Yu et al., 2016 and Microreact Argimon et al., 2016 (interactive phylogeny available at: https://microreact.org/project/I2KUoasUB).

Genomic determinants of antimicrobial resistance

The read mapping-based allele typer SRST2 (v0.2.0) Inouye et al., 2014 was used to detect the presence of plasmid replicons (PlasmidFinder database Carattoli et al., 2014) and antimicrobial resistance (AMR genes) (ARGannot database Gupta et al., 2014) and to identify the precise alleles of AMR genes. Where AMR genes were observed without evidence of a known AMR plasmid, raw read data was de novo assembled using Unicycler (v0.4.7) Wick et al., 2017 and then examined visually using the Bandage (v0.8.1) Wick et al., 2015 assembly graph visualizer, in order to interrogate the assembly to confirm the chromosomal location and composition of AMR-associated transposons. As the Tn2670-like composite transposon commonly associated with the acquisition of MDR genes in S. Typhi is mediated by IS1 translocation Wong et al., 2015, ISMapper (v2.0) Hawkey et al., 2015 was used with default parameters to screen all read sets for insertion sites of transposases of IS1 (accession number J01730) relative to the CT18 reference chromosome sequence, in order to identify the location of any such insertion sequences in the chromosome of each Kenyan S. Typhi genome. Single point mutations located within the quinolone resistance determining region (QRDR) of genes gyrA, gyrB, and parC associated with reduced susceptibility to fluoroquinolones Pham Thanh et al., 2016 were detected from the whole genome read alignments (BAM files) described above, using GenoTyphi Wong et al., 2016; Britto et al., 2018; Rahman et al., 2020. Where a resistance phenotype was detected in the absence of known molecular determinants of AMR, DBGWAS (v0.5.4) Jaillard et al., 2018 was utilised to carry out a bacterial genome-wide association study (GWAS) to screen for genetic loci and/or variants associated with the observed resistance phenotype.

Statistical and spatial analysis

All statistical analyses unless otherwise stated were carried out using R (v4.0.2). Multivariate logistic regression analyses were conducted with the glm() function in base R, and linear regression analyses of were carried out using the lm() function in base R. Shannon diversity was calculated using the diversity() function in the R package vegan Dixon, 2003. Base R function chisq.test() was used to conduct a Chi-Squared test of age groups and S. Typhi genotypes. Fisher’s exact test was carried out using the fisher.test() function in base R to investigate the frequency of non-synonymous mutations among cases and carriers, as well as associations between binary categories (‘high’ vs ‘low’) for monthly case counts or carrier counts and weather conditions (monthly rainfall, minimum and maximum temperature) in either the same month, the previous month, or two months prior. Thresholds for the ‘high’ categories were: case or carrier counts >2; rainfall > 75 mm; minimum temperature >14 °C; maximum temperature >26 °C. For analyses of elevated patient counts attending study clinics, we used the same climate thresholds, but instead used an elevated patient count threshold of n = 85.

GPS coordinate data were visualized using Microreact Argimon et al., 2016.

Supplementary results

Precision of AMR phenotypes from genotypes

Observed AMR phenotypes for all high quality WGS data (n = 128 H58 and n = 8 Non-H58 genome sequences) were largely explained by the presence of known molecular determinants of AMR (Supplementary file 7). Sensitivity was generally high for previous first line drugs for treating typhoid including chloramphenicol, ampicillin, co-trimoxazole, as well as tetracyclines ( > 94%), as was specificity ( > 74%), and low very major errors (VME; failure to predict a resistance phenotype) were determined for these drugs ( < 4%). Major errors (ME; failure to predict a susceptible resistance phenotype) ranged between 5.1%–7.4% for these drugs, and for three sequences, this is potentially explained by the loss of an IncHI1 plasmid (n = 2) or Tn2670-like transposon (n = 1) associated with the MDR phenotype in culture, as resistance phenotypes were not observed for any of these drugs.

VME for nalidixic acid (32.4%), and ME (23.5%) for ciprofloxacin were both high. The sensitivity and specificity for these drugs (Supplementary file 7) are confounded by the presence of the GyrB-S464F mutation (Table 2), conserved in all n = 73 genome sequences of genotype 4.3.1.2.EA2 (H58 lineage two sublineage East Africa 2; discussed below in detail). The GyrB-S464F mutation has been previously reported to cause Decreased Ciprofloxacin Susceptibility (DCS), but not nalidixic acid resistance Accou-Demartin et al., 2011, however, in our data the relationship between GyrB-S464F Quinolone/Fluoroquinolone resistance in this strain background remains unclear as 50.7 % of EA2 isolates showed reduced susceptibility to nalidixic acid and 47.9 % ciprofloxacin, with the rest testing sensitive to these drugs. Neither comparative analysis of resistant and sensitive EA2 sequences, nor further bacterial GWAS analysis, revealed any evidence of either causal or compensatory mutations, or the acquisition/loss of genetic loci responsible for these phenotypes, perhaps due to low power.

Specificity estimates were high (100%), but sensitivity low (0%), for third generation cephalosporins including ceftazidime cefotaxime ceftriaxone and cefpodoxime (see Supplementary file 7). This was the result of phenotypic resistance to 3rd generation cephalosporins detected at low frequency among our high quality sequenced genomes (n = 1–4, 0.73–2.9%) in the absence of any known molecular determinants for resistance to these drugs. A GWAS was again carried out using susceptibility data for these drugs, but no candidate molecular mechanisms significantly associated with the observed resistance phenotypes were identified, again possibly due to low power.

Local subpopulations of S. Typhi H58

Salmonella Typhi H58 (genotype 4.3.1) is subdivided into lineages I (genotype 4.3.1.1) and II (genotype 4.3.1.2). Lineage II was more common in this setting than lineage I: n = 90 (62.1% of H58) vs n = 55 (37.9%). Examination of the global phylogeny (Figure 2 and online interactive version https://microreact.org/project/wViqmaRdZuFVEb6yk4i1jU) revealed all H58 lineage I isolates from this study shared a most recent common ancestor (mrca) whose descendants form a monophyletic clade that exclusively comprised S. Typhi from East African countries (see Figure 2). This clade corresponds to the previously reported introduction of H58 lineage I from South Asia into Eastern Africa, which appears to have arrived in Kenya before spreading to Tanzania and on to Malawi and South Africa (Feasey et al., 2015; Wong et al., 2016; Figure 2). Here, we define this as H58 sublineage EA1 (East Africa 1) with genotype designation 4.3.1.1.EA1 (labelled in Figure 2), which can be identified by the presence of a synonymous marker SNP STY0750-G1407A (position 751,854 in CT18; this genotype has been added to the GenoTyphi scheme available at http://github.com/katholt/genotyphi).

S. Typhi H58 lineage II (genotype 4.3.1.2) isolates from our study belonged to two distinct clades of the global phylogeny (Figure 2), which were each exclusively populated by East African isolates. The largest of these clades (n = 80 isolates, of which 81.3 % derive from the current study) formed a monophyletic group in which isolates from this study were intermingled with those obtained from previous studies in Kenya (n = 14, 17.5%), and a single Tanzanian strain isolated in 2012 (see Figure 2), suggestive of a single inter-country transmission event from Kenya into Tanzania. This clade is nested within a deeper clade of diverse South Asian isolates (see Figure 2), and corresponds to the previously reported introduction of H58 lineage II into Kenya from South Asia Wong et al., 2015; Park et al., 2018. This lineage, here defined as H58 sublineage EA2 (East Africa 2) and designated genotype 4.3.1.2.EA2 (labelled in Figure 2), can be identified by the presence of a synonymous marker SNP STY4818-C1069T (position 4680610 in CT18). The smaller East African H58 lineage II clade (n = 43 isolates) comprised two sister clades, separated by ≥13 SNPs, one involving isolates from Kenya (n = 13, all from this study) and the other isolates from Uganda (n = 30, which accounted for 100 % of the typhoid burden at the Ugandan site in the TSAP study Park et al., 2018) (see Figure 2). This clade (including both the Ugandan and Kenyan subgroups) is here defined as H58 sublineage EA3 (East Africa 3) and designated genotype 4.3.1.2.EA3 (labelled in (2) Figure 2), identified by synonymous marker SNP STY2750-G96A (position 2587488 in CT18). Both EA2 and EA3 genotypes have been added to the GenoTyphi scheme.

The three East African H58 subgroups circulating in our setting all had high rates of MDR (84%, 74% and 94%, respectively); however in EA2, MDR was exclusively associated with the PST6-IncHI1 plasmid, and in EA3 exclusively with the chromosomal insertion (see Table 2). In EA1, most MDR was associated with the PST6-IncHI1 plasmid. However, a subclade of isolates (associated with spread to Tanzania and Malawi) carried the chromosomal insertion instead (see Table 2). Chromosomal integration of MDR strains has not previously been reported in S. Typhi from Kenya, although it has been reported in the Malawi and Tanzanian sublineages of EA1 Wong et al., 2015 which our data suggests transferred to those locations from Kenya. Here, the integration of the MDR composite transposon into the chromosome of Kenyan EA1 and EA3 isolates was supported by both examination of genome assembly graphs and analysis of IS1 insertion sites (see Methods), both of which supported integration near gene cyaA as has been reported previously Wong et al., 2015; Britto et al., 2018. Interestingly, MDR in the Ugandan subclade of EA3 was associated with the IncHI1-PST6 plasmid, suggesting that migration of the transposon from plasmid to chromosome may have occurred in situ in Kenya after divergence from the Ugandan branch. The three East African lineages also differed markedly in their patterns of mutations associated with reduced susceptibility to fluoroquinolones: GyrB-S464F was conserved among all EA2, whereas all EA3 isolates carried the GyrA-S83Y mutation, and three distinct GyrA mutations, and the GyrA-S464F mutation were detected at low frequency in EA1 (see Table 2).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Samuel Kariuki, Email: samkariuki2@gmail.com.

Joseph Lewnard, University of California Berkeley, United States.

Bavesh D Kana, University of the Witwatersrand, South Africa.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health R01AI099525 to Samuel Kariuki.

  • Wellcome Trust 106158/Z/14/Z to Gordon Dougan, Kathryn E Holt.

  • European Commission TyphiNET No 845681 to Zoe A Dyson, Kathryn E Holt.

  • National Institute for Health Research AMR Theme to Gordon Dougan.

  • Bill and Melinda Gates Foundation OPP1175797 to Kathryn E Holt.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Funding acquisition, Investigation, Project administration, Supervision, Validation, Writing – original draft, Writing – review and editing.

Writing – original draft, Writing – review and editing.

Investigation, Methodology, Project administration, Writing – review and editing.

Investigation, Methodology.

Investigation, Methodology.

Formal analysis, Methodology.

Investigation, Methodology.

Investigation, Methodology.

Formal analysis, Project administration.

Formal analysis.

Formal analysis.

Data curation, Formal analysis.

Conceptualization, Formal analysis, Writing – original draft, Writing – review and editing.

Validation, Writing – original draft, Writing – review and editing.

Conceptualization, Methodology, Validation, Writing – original draft, Writing – review and editing.

Ethics

Human subjects: The study was approved by the Scientific and Ethics Review Unit (SERU) of the Kenya Medical Research Institute (KEMRI) (Scientific Steering Committee No. 2076). All parents and/or guardians of participating children were informed of the study objectives and voluntary written consent was sought and obtained before inclusion.

Additional files

Supplementary file 1. S. Typhi sequences used in this study (excel file).
elife-67852-supp1.xlsx (62.2KB, xlsx)
Supplementary file 2. Outgroup and global context H58 S. Typhi sequences used in this study (excel file).
elife-67852-supp2.xlsx (62.2KB, xlsx)
Supplementary file 3. Kenyan sequences used in temporal analyses (excel file).
elife-67852-supp3.xlsx (20.1KB, xlsx)
Supplementary file 4. Sequences used in IncHI1 plasmid analyses (excel file).
elife-67852-supp4.xlsx (40.5KB, xlsx)
Supplementary file 5. Repetitive regions excluded from plasmid pAKU1 SNP analysis (excel file).
elife-67852-supp5.xlsx (9.8KB, xlsx)
Supplementary file 6. Culture positive typhoid cases and asymptomatic carriers.
elife-67852-supp6.docx (14.4KB, docx)
Supplementary file 7. Comparison of phenotypic and genotypic AMR profiles of 136 (n=128 H58, n=8 Non-H58) high quality S. Typhi genome sequences.
elife-67852-supp7.docx (15.2KB, docx)
Supplementary file 8. Distribution of n=153 S. Typhi genotypes among each sex for cases and controls.
elife-67852-supp8.docx (12.9KB, docx)
Supplementary file 9. Climatic predictors of elevated case and carrier counts for all samples.
elife-67852-supp9.docx (13.8KB, docx)
Supplementary file 10. Climatic predictors of WGS confirmed elevated case and carrier counts inside the DSS.
elife-67852-supp10.docx (13.9KB, docx)
Supplementary file 11. Climatic predictors of WGS confirmed elevated case and carrier counts for all samples collected.
elife-67852-supp11.docx (13.8KB, docx)
Supplementary file 12. Climatic predictors of elevated cases presenting at study clinics.
elife-67852-supp12.docx (13.1KB, docx)
Supplementary file 13. Non-synonymous (NS) Mutations among S. Typhi isolates (excel file).
elife-67852-supp13.xlsx (19.7KB, xlsx)
Transparent reporting form

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. Raw Illumina sequence reads have been submitted to the European Nucleotide Archive (ENA) under accession PRJEB19289. Individual sequence accession numbers are listed in Table S1.

The following dataset was generated:

Kariuki S, Dyson Z. 2020. Typhi_Kenya European Nucleotide Archive (ENA) NCBI BioProject. PRJEB19289

The following previously published datasets were used:

Parkhill J, Dougan G, James KD. 2001. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 35101607

Argimon S, Abudahab K, Goater RJE. 2016. Visualizing and sharing data for genomic epidemiology and phylogeography. Microb Genom. PMC5320705

Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 26099265

Hawkey J, Hamidian M, Wick RR. 2015. ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data. BMC Genomics. 26336060

References

  1. Accou-Demartin M, Gaborieau V, Song Y. Salmonella enterica Serotype Typhi with nonclassical quinolone resistance phenotype. Emerg Infect Dis. 2011;17:1091–1094. doi: 10.3201/eid/1706.101242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Argimon S, Abudahab K, Goater RJE. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Microbial Genomics. 2016;2:e000093. doi: 10.1099/mgen.0.000093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Argimón S, Yeats CA, Goater RJ, Abudahab K, Taylor B, Underwood A, Sánchez-Busó L, Wong VK, Dyson ZA, Nair S, Park SE, Marks F, Page AJ, Keane JA, Baker S, Holt KE, Dougan G, Aanensen DM. A global resource for genomic predictions of antimicrobial resistance and surveillance of salmonella typhi at pathogenwatch. Nature Communications. 2021;12:2879. doi: 10.1038/s41467-021-23091-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Arnold C, Edwards K, Desai M, Platt S, Green J, Setup CD. Validation, and Quality Control of a Centralized Whole-Genome-Sequencing Laboratory: Lessons Learned. Journal of Clinical Microbiology. 2018;56:00261-18. doi: 10.1128/JCM.00261-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ashton PM, Nair S, Peters TM. Identification of Salmonella for public health surveillance using whole genome sequencing. PeerJ. 2016;4:e1752. doi: 10.7717/peerj.1752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Baker S, Holt KE, Clements ACA. Combined high-resolution genotyping and geospatial analysis reveals modes of endemic urban typhoid fever transmission. Open Biology. 2011;1:110008. doi: 10.1098/rsob.110008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Breiman RF, Cosmas L, Njuguna H. Population-based incidence of typhoid fever in an urban informal settlement and a rural area in Kenya: implications for typhoid vaccine use in Africa. PLOS ONE. 2012;7:e29119. doi: 10.1371/journal.pone.0029119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Briatte F. Geometries to Plot Networks with ‘ggplot2. 0.5.9R Package Ggnetwork. 2021 https://CRAN.R-project.org/package=ggnetwork
  9. Britto CD, Dyson ZA, Duchene S. Laboratory and molecular surveillance of paediatric typhoidal Salmonella in Nepal: Antimicrobial resistance and implications for vaccine policy. PLOS Neglected Tropical Diseases. 2018;12:e0006408. doi: 10.1371/journal.pntd.0006408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Britto CD, Dyson ZA, Mathias S. Persistent circulation of a fluoroquinolone-resistant Salmonella enterica Typhi clone in the Indian subcontinent. The Journal of Antimicrobial Chemotherapy. 2020;75:337–341. doi: 10.1093/jac/dkz435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Carattoli A, Zankari E, García-Fernández A. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrobial Agents and Chemotherapy. 2014;58:3895–3903. doi: 10.1128/AAC.02412-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Croucher NJ, Page AJ, Connor TR, Delaney AJ, Keane JA, Bentley SD, Parkhill J, Harris SR. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Research. 2015;43:e15. doi: 10.1093/nar/gku1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Crump JA, Mintz ED. Global trends in typhoid and paratyphoid Fever. Clinical Infectious Diseases. 2010;50:241–246. doi: 10.1086/649541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dewan AM, Corner R, Hashizume M, Ongee ET. Typhoid Fever and its association with environmental factors in the Dhaka Metropolitan Area of Bangladesh: a spatial and time-series approach. PLOS Neglected Tropical Diseases. 2013;7:e1998. doi: 10.1371/journal.pntd.0001998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dixon P. VEGAN, a package of R functions for community ecology. Journal of Vegetation Science. 2003;14:927–930. doi: 10.1111/j.1654-1103.2003.tb02228.x. [DOI] [Google Scholar]
  16. Duchene S, Holt KE, Weill FX. Genome-scale rates of evolutionary change in bacteria. Microbial Genomics. 2016;2:e000094. doi: 10.1099/mgen.0.000094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Edwards DJ, Duchêne S, Pope B, Holt KE. SNPPAR: identifying convergent evolution and other homoplasies from microbial whole-genome alignments. bioRxiv. 2020 doi: 10.1101/2020.07.08.194480. [DOI] [PMC free article] [PubMed]
  18. Feasey NA, Gaskell K, Wong V, Msefula C, Selemani G, Kumwenda S, Allain TJ, Mallewa J, Kennedy N, Bennett A, Nyirongo JO, Nyondo PA, Zulu MD, Parkhill J, Dougan G, Gordon MA, Heyderman RS. Rapid emergence of multidrug resistant, H58-lineage Salmonella typhi in Blantyre, Malawi. PLOS Neglected Tropical Diseases. 2015;9:e0003748. doi: 10.1371/journal.pntd.0003748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Firth C, Kitchen A, Shapiro B, Suchard MA, Holmes EC, Rambaut A. Using time-structured data to estimate evolutionary rates of double-stranded DNA viruses. Molecular Biology and Evolution. 2010;27:2038–2051. doi: 10.1093/molbev/msq088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gauld JS, Hu H, Klein DJ, Levine MM. Typhoid fever in Santiago, Chile: Insights from a mathematical model utilizing venerable archived data from a successful disease control program. PLOS Neglected Tropical Diseases. 2018;12:e0006759. doi: 10.1371/journal.pntd.0006759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gunn JS, Marshall JM, Baker S, Dongol S, Charles RC, Ryan ET. Salmonella chronic carriage: epidemiology, diagnosis, and gallbladder persistence. Trends in Microbiology. 2014;22:648–655. doi: 10.1016/j.tim.2014.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gupta SK, Padmanabhan BR, Diene SM. ARG-ANNOT, a new bioinformatic tool to discover antibiotic resistance genes in bacterial genomes. Antimicrobial Agents and Chemotherapy. 2014;58:212–220. doi: 10.1128/AAC.01310-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hawkey J, Hamidian M, Wick RR. ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data. BMC Genomics. 2015;16:667. doi: 10.1186/s12864-015-1860-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hendriksen RS, Leekitcharoenphon P, Mikoleit M. Genomic dissection of travel-associated extended-spectrum-beta-lactamase-producing Salmonella enterica serovar typhi isolates originating from the Philippines: a one-off occurrence or a threat to effective treatment of typhoid fever? Journal of Clinical Microbiology. 2015;53:677–680. doi: 10.1128/JCM.03104-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Holt KE, Parkhill J, Mazzoni CJ. High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nature Genetics. 2008;40:987–993. doi: 10.1038/ng.195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Holt KE, Thomson NR, Wain J. Pseudogene accumulation in the evolutionary histories of Salmonella enterica serovars Paratyphi A and Typhi. BMC Genomics. 2009;10:36. doi: 10.1186/1471-2164-10-36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Holt KE, Phan MD, Baker S, Duy PT, Nga TVT, Nair S, Turner AK, Walsh C, Fanning S, Farrell-Ward S, Dutta S, Kariuki S, Weill F-X, Parkhill J, Dougan G, Wain J. Emergence of a globally dominant inchi1 plasmid type associated with multiple drug resistant typhoid. PLOS Neglected Tropical Diseases. 2011;5:e1245. doi: 10.1371/journal.pntd.0001245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Holt K. Reddog. swh:1:rev:90707ace56d6189997526273cf97013d84b84d90Software Heritage. 2021a https://archive.softwareheritage.org/swh:1:dir:ca328c2159ec565754d77e7cfac2122aa34f41ac;origin=https://github.com/katholt/RedDog;visit=swh:1:snp:02487700a686fbf3a5f15ce7ea4c9bede43894d6;anchor=swh:1:rev:90707ace56d6189997526273cf97013d84b84d90
  29. Holt K. Genotyphi. swh:1:rev:f489d0e004e30bdca683c76c25ac7f4d79162791Software Heritage. 2021b https://archive.softwareheritage.org/swh:1:dir:f7c2ff7de21dc14cb011dda278424a1ef94fa7f3;origin=https://github.com/katholt/genotyphi;visit=swh:1:snp:724e03c18bc27bba59067574dbde6fbe83130e70;anchor=swh:1:rev:f489d0e004e30bdca683c76c25ac7f4d79162791
  30. Hopewell MR, Graham JP. Trends in access to water supply and sanitation in 31 major sub-Saharan African cities: an analysis of DHS data from 2000 to 2012. BMC Public Health. 2014;14:208–212. doi: 10.1186/1471-2458-14-208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ingle DJ, Nair S, Hartman H. Informal genomic surveillance of regional distribution of Salmonella Typhi genotypes and antimicrobial resistance via returning travellers. PLOS Neglected Tropical Diseases. 2019;13:e0007620. doi: 10.1371/journal.pntd.0007620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Inouye M, Dashnow H, Raven LA. SRST2: Rapid genomic surveillance for public health and hospital microbiology labs. Genome Medicine. 2014;6:90. doi: 10.1186/s13073-014-0090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Jaillard M, Lima L, Tournoud M. A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events. PLOS Genetics. 2018;14:e1007758. doi: 10.1371/journal.pgen.1007758. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Jolley KA, Maiden MCJ. BIGSdb: Scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics. 2010;11:595. doi: 10.1186/1471-2105-11-595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kanungo S, Dutta S, Sur D. Epidemiology of typhoid and paratyphoid fever in India. Journal of Infection in Developing Countries. 2008;2:454–460. doi: 10.3855/jidc.161. [DOI] [PubMed] [Google Scholar]
  36. Kariuki S, Revathi G, Kiiru J. Typhoid in Kenya is associated with a dominant multidrug-resistant Salmonella enterica serovar Typhi haplotype that is also widespread in Southeast Asia. Journal of Clinical Microbiology. 2010;48:2171–2176. doi: 10.1128/JCM.01983-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kariuki S, Gordon MA, Feasey N, Parry CM. Antimicrobial resistance and management of invasive Salmonella disease. Vaccine. 2015;33:C21–C29. doi: 10.1016/j.vaccine.2015.03.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kariuki S, Mbae C, Onsare R. Multidrug-resistant Nontyphoidal Salmonella Hotspots as Targets for Vaccine Use in Management of Infections in Endemic Settings. Clinical Infectious Diseases. 2019;68:S10–S15. doi: 10.1093/cid/ciy898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kariuki S, Mbae C, Van Puyvelde S. High relatedness of invasive multi-drug resistant non-typhoidal Salmonella genotypes among patients and asymptomatic carriers in endemic informal settlements in Kenya. PLOS Neglected Tropical Diseases. 2020;14:e0008440. doi: 10.1371/journal.pntd.0008440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Klemm EJ, Shakoor S, Page AJ, Qamar FN, Judge K, Saeed DK, Wong VK, Dallman TJ, Nair S, Baker S, Shaheen G, Qureshi S, Yousafzai MT, Saleem MK, Hasan Z, Dougan G, Hasan R. Emergence of an Extensively Drug-Resistant Salmonella enterica Serovar Typhi Clone Harboring a Promiscuous Plasmid Encoding Resistance to Fluoroquinolones and Third-Generation Cephalosporins. mBio. 2018;9:e00105-18. doi: 10.1128/mBio.00105-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. KNBS KNBOS . Kenya Population and Housing Census. Kenya National Bureau of Statistics; 2009. [Google Scholar]
  42. Köser CU, Ellington MJ, Cartwright EJP. Routine use of microbial whole genome sequencing in diagnostic and public health microbiology. PLOS Pathogens. 2012;8:e1002824. doi: 10.1371/journal.ppat.1002824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kyobutungi C, Ziraba AK, Ezeh A, Yé Y. The burden of disease profile of residents of Nairobi’s slums: results from a demographic surveillance system. Population Health Metrics. 2008;6:1. doi: 10.1186/1478-7954-6-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Meth. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Levine MM, Black RE, Lanata C. Precise estimation of the numbers of chronic carriers of Salmonella typhi in Santiago, Chile, an endemic area. The Journal of Infectious Diseases. 1982;146:724–726. doi: 10.1093/infdis/146.6.724. [DOI] [PubMed] [Google Scholar]
  46. Li H, Handsaker B, Wysoker A. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Lutterloh E, Likaka A, Sejvar J. Multidrug-resistant typhoid fever with neurologic findings on the Malawi-Mozambique border. Clinical Infectious Diseases. 2012;54:1100–1106. doi: 10.1093/cid/cis012. [DOI] [PubMed] [Google Scholar]
  48. Marchello CS, Hong CY, Crump JA. Global Typhoid Fever Incidence: A Systematic Review and Meta-analysis. Clinical Infectious Diseases. 2019;68:S105–S116. doi: 10.1093/cid/ciy1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Marchello CS, Birkhold M, Crump JA. Complications and mortality of typhoid fever: A global systematic review and meta-analysis. The Journal of Infection. 2020;81:902–910. doi: 10.1016/j.jinf.2020.10.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Marks F, von K, Aaby P. Incidence of invasive salmonella disease in sub-Saharan Africa: a multicentre population-based surveillance study. The Lancet Global Health. 2017;5:310–323. doi: 10.1016/S2214-109X(17)30022-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Mbae C, Mwangi M, Gitau N. Factors associated with occurrence of salmonellosis among children living in Mukuru slum, an urban informal settlement in Kenya. BMC Infectious Diseases. 2020;20:422. doi: 10.1186/s12879-020-05134-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Mberu BU, Haregu TN, Kyobutungi C, Ezeh AC. Health and health-related indicators in slum, rural, and urban communities: a comparative analysis. Global Health Action. 2016;9:33163. doi: 10.3402/gha.v9.33163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Mogasale V, Maskery B, Ochiai RL. Burden of typhoid fever in low-income and middle-income countries: a systematic, literature-based update with risk-factor adjustment. The Lancet Global Health. 2014;2:e570–e580. doi: 10.1016/S2214-109X(14)70301-8. [DOI] [PubMed] [Google Scholar]
  54. Mutai WC, Muigai AWT, Waiyaki P, Kariuki S. Multi-drug resistant Salmonella enterica serovar typhi isolates with reduced susceptibility to ciprofloxacin in Kenya. BMC Microbiology. 2018;18:187. doi: 10.1186/s12866-018-1332-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Mutisya E, Yarime M. Understanding the grassroots dynamics of slums in Nairobi: The dilemma of Kibera informal settlements. International Transaction Journal of Engineering, Management, and Applied Sciences and Technologies. 2011;2011:1–18. [Google Scholar]
  56. Neil KP, Sodha SV, Lukwago L. A large outbreak of typhoid fever associated with a high rate of intestinal perforation in Kasese District, Uganda, 2008-2009. Clinical Infectious Diseases. 2012;54:1091–1099. doi: 10.1093/cid/cis025. [DOI] [PubMed] [Google Scholar]
  57. Olack B, Burke H, Cosmas L. Nutritional status of under-five children living in an informal urban settlement in Nairobi, Kenya. Journal of Health, Population, and Nutrition. 2011;29:357–363. doi: 10.3329/jhpn.v29i4.8451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Paradis E, Claude J, Strimmer K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics. 2004;20:289–290. doi: 10.1093/bioinformatics/btg412. [DOI] [PubMed] [Google Scholar]
  59. Park SE, Pham DT, Boinett C. The phylogeography and incidence of multi-drug resistant typhoid fever in sub-Saharan Africa. Nature Communications. 2018;9:5094. doi: 10.1038/s41467-018-07370-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Parkhill J, Dougan G, James KD. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 2001;413:848–852. doi: 10.1038/35101607. [DOI] [PubMed] [Google Scholar]
  61. Parry CM, Hien TT, Dougan G, White NJ, Farrar JJ. Typhoid fever. The New England Journal of Medicine. 2002;347:1770–1782. doi: 10.1056/NEJMra020201. [DOI] [PubMed] [Google Scholar]
  62. Patel JB, Cockerill FR, Bradford PA. Performance Standards for Antimicrobial Susceptibility Testing. Clinical and Laboratory Standards Institute. 2015 https://clsi.org/standards/products/microbiology/documents/m100/
  63. Pham Thanh D, Karkey A, Dongol S. A novel ciprofloxacin-resistant subclade of H58 Salmonella Typhi is associated with fluoroquinolone treatment failure. eLife. 2016;5:e14003. doi: 10.7554/eLife.14003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Phan MD, Kidgell C, Nair S. Variation in Salmonella enterica serovar typhi IncHI1 plasmids during the global spread of resistant typhoid fever. Antimicrobial Agents and Chemotherapy. 2009;53:716–727. doi: 10.1128/AAC.00645-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Pitzer VE, Feasey NA, Msefula C. Mathematical modeling to assess the drivers of the recent emergence of Typhoid fever in blantyre, malawi. Clinical Infectious Diseases. 2015;61:251–258. doi: 10.1093/cid/civ710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Rahman SIA, Dyson ZA, Klemm EJ. Population structure and antimicrobial resistance patterns of Salmonella Typhi isolates in urban Dhaka, Bangladesh from 2004 to 2016. PLOS Neglected Tropical Diseases. 2020;14:e0008036. doi: 10.1371/journal.pntd.0008036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TEMPEST (formerly Path-o-gen) Virus Evolution. 2016;2:vew007. doi: 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Saad NJ, Bowles CC, Grenfell BT. The impact of migration and antimicrobial resistance on the transmission dynamics of typhoid fever in Kathmandu, Nepal: A mathematical modelling study. PLOS Neglected Tropical Diseases. 2017:e0005547. doi: 10.1371/journal.pntd.0005547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Senthilkumar B, Senbagam D, Rajasekarapandian M. An epidemiological surveillance of asymptomatic typhoid carriers associated in respect to socioeconomic status in India. J Public Health. 2012;22:297–301. [Google Scholar]
  70. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Suchard MA, Lemey P, Baele G, Ayres DL, Drummond AJ, Rambaut A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evolution. 2018;4:vey016. doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Thanh Duy P, Thieu NTV, Nguyen Thi Nguyen T, Ngoc Dan Thanh H, Dongol S, Karkey A, Carey M, Basnyat B, Dougan G, Rabaa MA, Baker S. Gallbladder carriage generates genetic variation and genome degradation in salmonella typhi. PLOS Pathogens. 2020;16:e1008998. doi: 10.1371/journal.ppat.1008998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Thindwa D, Chipeta MG, Henrion MYR, Gordon MA. Distinct climate influences on the risk of typhoid compared to invasive non-typhoid Salmonella disease in Blantyre. Malawi. Sci Rep. 2019;9:20310–20311. doi: 10.1038/s41598-019-56688-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Tinline RR, Gregory D. The Universal Transverse Mercator Code: A location code for disease reporting. The Canadian Veterinary Journal = La Revue Veterinaire Canadienne. 1988;29:825–829. [PMC free article] [PubMed] [Google Scholar]
  75. Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads. PLOS Computational Biology. 2017;13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Wong VK, Baker S, Pickard DJ. Phylogeographical analysis of the dominant multidrug-resistant H58 clade of Salmonella Typhi identifies inter- and intracontinental transmission events. Nature Genetics. 2015;47:632–639. doi: 10.1038/ng.3281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Wong VK, Baker S, Connor TR. An extended genotyping framework for Salmonella enterica serovar Typhi, the cause of human typhoid. Nature Communications. 2016;7:12827. doi: 10.1038/ncomms12827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biology. 2014;15:46. doi: 10.1186/gb-2014-15-3-r46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Yu G, Smith DK, Zhu H, Guan Y, TT-Y L. Ggtree: An rpackage for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution. 2016;8:28–36. [Google Scholar]

Decision letter

Editor: Joseph Lewnard1
Reviewed by: Lauren Cowley, Madikay Senghore2

Our editorial process produces two outputs: i) public reviews designed to be posted alongside the preprint for the benefit of readers; ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Acceptance summary:

This study contributes a significant advance to the field in terms of detailed genomic epidemiology of the introductions of drug resistant typhoid into Kenya, and Africa in general. The findings highlight the role of asymptomatic carriage in the spread, drug-resistance mechanisms and the emergence of typhoid strains in Kenya. The authors also contribute a small number of sequences from both carriage and acute disease, most sequenced cases are associated with acute disease, making this deposit of publicly available carriage sequences extremely valuable. This work will be of interest to a broad readership, including but not limited to clinicians, biologists, and public health experts.

Decision letter after peer review:

Thank you for submitting your article "Multiple introductions of multidrug-resistant typhoid associated with acute infection and asymptomatic carriage, Kenya" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Bavesh Kana as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Lauren Cowley (Reviewer #2); Madikay Senghore (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

As noted by the reviewers, the authors present a novel and high quality study elucidating further aspects of the role of asymptomatic carriage within typhoid epidemiology and placing East African lineages in the geographic context of introductions from south Asia. Most recommendations from the reviewers are for clarifications; a modest list of suggested analyses for the revision follows before the public reviews and evaluation summary:

1) One major factor that is overlooked in the investigation, is the sampling of acute cases both from blood and fecal culture. In the reviewers' opinion, these should be considered as separate conditions and separate sample types (so not grouped together for some of the analysis.) This would affect results in figure 5 that could be further separated out into three groups; controls, enteric cases and invasive cases. It might be expected that positive selection had happened in the differing environments of the acute cases, either in invasive blood cultured specimens or enteric fecal cultured specimens that could be investigated and detailed (see further details on this point in the comments from Reviewer 2)

(2) Another aspect of the paper that could have borne more scrutiny was the PTS6-IncHI1 plasmid present in two of the east African lineages. Although short read sequencing limits its resolution or the level of comparative analysis possible, some attempt at comparing the short reads generated in this study to previously published full sequences of PTS6-IncHI1 plasmids in H58 Typhi would be warranted. This could inform horizontal transfer of this plasmid and whether these plasmids sampled in two separate lineages in the same location contemporaneously were highly similar or not.

(3) The authors rightly mention that recombinant regions were excluded in the genomes prior to phylogenetic tree construction. The reviewers wondered if the phylogenies with and without recombinant regions filtered out could be compared. This would provide a sense of the overall impact of recombination in the evolution of the S. Typhi strains. Also, it's not clear if the recombination events removed in the phylogeny of only Kenyan isolates, specific clades and global isolates. Considering that Gubbins was developed to identify recombination events in clonal bacterial population, the authors should comment on whether it is appropriate to run the analysis using a diverse global dataset. It would also be great to mention the recombination rate inferred from Gubbins.

Reviewer #2 (Recommendations for the authors):

I really enjoyed reading this paper and think that, on the whole, the analysis produced and conclusions drawn are very well justified and carefully prepared. I have just two small suggestions to provide further detail to this paper that I think would be extremely valuable to the readers.

(1) Separate the acute infections into two groups, those sampled from fecal culture and those from blood culture. This is an important distinction that would be important to the analysis performed to produce figure 5. There could be a distinct signal in NS mutations from the fecal vs. blood acute samples. Furthermore, the carriage fecal and acute fecal samples could have mutations that differentiate them from the blood culture samples or vice versa. Some consideration of this or inclusion of this in your analysis would be worthwhile.

(2) To interrogate the PTS6-IncHI1 plasmid further, I suggest blasting the contigs from your plasmid-identified samples against previously published and finished PTS6-IncHI1 plasmid sequences. You might be able to piece together more about the content of this plasmid and how similar the PTS6-IncHI plasmids from EA1 are to those from EA2. Otherwise some similar style of analysis that investigates horizontal transfer further.

Reviewer #3 (Recommendations for the authors):

The authors present a well written manuscript based on a case contact study of diarrheal aetiology among children in Kenya. They implement robust bioinformatics pipelines and derive conclusions that can guide intervention strategies. Below are some specific comments.

Line 159 – The authors mentions that for carriers, rectal swabs or stools samples were cultured on selenite F, however, it is not specified whether the isolates from cases were recovered using the same procedures. Please clarify

Line 185-186. Could the authors explain why there is such a huge discrepancy and how they ensured that this was not the result of sample mix-ups.

Table 1 – the authors use the results from culture to determine whether there are correlates for S. typhi carriage and disease. Given the discrepancies between the genomic data and the serotyping data why did the authors choose to use the serotyping data instead of the genomic confirmation. Are the results of this analysis reproducible if the genome confirmed data are used instead of the serotyping data?

Line 406 – The authors present evidence suggesting that higher temperatures were significantly associated with decreased case counts. However, this includes isolates that were not confirmed by sequencing and generally the per month case counts were low. Was there a correlation between weather and the general numbers of patients presenting at the clinic? Additionally, when the unconfirmed isolates are excluded from the analysis does this observation still hold?

eLife. 2021 Sep 13;10:e67852. doi: 10.7554/eLife.67852.sa2

Author response


Essential revisions:

As noted by the reviewers, the authors present a novel and high quality study elucidating further aspects of the role of asymptomatic carriage within typhoid epidemiology and placing East African lineages in the geographic context of introductions from south Asia. Most recommendations from the reviewers are for clarifications; a modest list of suggested analyses for the revision follows before the public reviews and evaluation summary:

(1) One major factor that is overlooked in the investigation, is the sampling of acute cases both from blood and fecal culture. In the reviewers' opinion, these should be considered as separate conditions and separate sample types (so not grouped together for some of the analysis.) This would affect results in figure 5 that could be further separated out into three groups; controls, enteric cases and invasive cases. It might be expected that positive selection had happened in the differing environments of the acute cases, either in invasive blood cultured specimens or enteric fecal cultured specimens that could be investigated and detailed (see further details on this point in the comments from Reviewer 2)

As detailed in the Methods section, we define acute typhoid cases as children with ≥3 days fever ≥38°C and positive blood or stool culture for S. Typhi (see lines 137-141). We include stool culture within this definition because blood culture is known to be only ~50% sensitive (see e.g. Antillon et al. 2018, J Infect Dis, doi: 10.1093/infdis/jiy471), and the presence of febrile illness with culture-proven presence of S. Typhi is generally considered to represent an acute infection as opposed to asymptomatic carriage. However, we agree with the reviewer’s point that for the purposes of examining positive selection in the bacteria, there could potentially be a distinction between selective pressure on isolates recovered from stool vs blood of febrile cases. We have therefore reanalysed the data as suggested and updated Figure 5, which now shows the data stratified into three groups: (i) stool isolates from asymptomatic controls; (ii) blood isolates from febrile cases; and (iii) stool isolates from febrile cases. The results show that, for the functional groups noted in text (surface polysaccharides and lipoproteins), the stool isolates from cases are similar to blood isolates and distinct from stool isolates from asymptomatic carriers.

We have updated the Results section (lines 548-551) to reflect this:

“Notably, carriage samples were associated with significantly higher frequencies of terminal-branch NS mutations in genes responsible for the synthesis of surface polysaccharides and antigens (9.3% of carriers vs 1.2% of acute cases (all of which were blood isolates), p=0.043, Fisher’s exact test). […] Non-significant excesses of mutations in carriage isolates were also observed for periplasmic and exported lipoproteins (6.98% vs 1.92% in blood isolates from febrile cases and 3.03% in stool isolates from febrile cases, see Figure 5).”

(2) Another aspect of the paper that could have borne more scrutiny was the PTS6-IncHI1 plasmid present in two of the east African lineages. Although short read sequencing limits its resolution or the level of comparative analysis possible, some attempt at comparing the short reads generated in this study to previously published full sequences of PTS6-IncHI1 plasmids in H58 Typhi would be warranted. This could inform horizontal transfer of this plasmid and whether these plasmids sampled in two separate lineages in the same location contemporaneously were highly similar or not.

To address this, we carried out an additional SNP analysis of n=669 IncHI1 plasmid sequences detected in the first global typhoid genomics study (Wong et al. 2015, Nat Genet, doi: 10.1038/ng.3281) and a multi-site surveillance study in East and West African (Park et al., 2018, Nat Commun, doi: 10.1038/s41467-018-07370-z) also included in the Typhi SNP analyses (sequences are listed in the new Supplementary Files 2 and 4). Results are shown in Figure 2 and new Figure 2—figure supplement 1.

We have added an additional section to the Results section titled ‘Global population structure and antimicrobial resistance profiles of Kenyan S. Typhi’, from lines 362-366 that now reads:

“We compared single nucleotide variant haplotypes for these plasmids with those from 534 IncHI1 PST6 plasmids sequenced previously from African and global studies (all of which were carried by H58 S. Typhi hosts, see Supplementary File 4). […] This is consistent with ongoing microevolution of the PST6 plasmid within S. Typhi lineages since the acquisition of the plasmid by the mrca of S. Typhi H58, but shows no evidence of transfer of plasmid haplotypes between S. Typhi lineages.”

The Methods section (lines 282-286) was updated as follows:

“SRST2 was used to determine IncHI1 plasmid MLST (multi-locus sequence types) sequence types (pMLST), and minor alleles were identified by mapping to the plasmid pAKU1 reference sequence (accession number AM412236) in the same manner as described above for the S. Typhi chromosome (details in supplementary methods and Supplementary Files 4-5).”

Supplementary methods (new section ‘SNP analysis of S. Typhi IncHI1 plasmids’ lines 1173-1190) provides more details of the analysis:

“SNP analysis of S. Typhi IncHI1 plasmids

In order to investigate the potential for plasmid transmission among the three East African S. Typhi H58 sublineages we carried out an SNP analysis of IncHI1 plasmid minor variants. […] The distance matrix was then used as input for a Minimum Spanning Tree which was inferred using the R function mst() also in the R package ape, and visualised using the R package ggnetwork (v0.5.9) 23.”

(3) The authors rightly mention that recombinant regions were excluded in the genomes prior to phylogenetic tree construction. The reviewers wondered if the phylogenies with and without recombinant regions filtered out could be compared. This would provide a sense of the overall impact of recombination in the evolution of the S. Typhi strains. Also, it's not clear if the recombination events removed in the phylogeny of only Kenyan isolates, specific clades and global isolates. Considering that Gubbins was developed to identify recombination events in clonal bacterial population, the authors should comment on whether it is appropriate to run the analysis using a diverse global dataset. It would also be great to mention the recombination rate inferred from Gubbins.

Salmonella Typhi is a genetically monomorphic pathogen (a single lineage within S. enterica, comprising a single clonal complex in the MLST scheme) and has very low recombination rates (Holt et al. 2008 Nat Genet, doi: 10.1038/ng.195, 40(8); Achtman 2008 Annual Review of Microbiology, 62, doi: 10.1146/annurev.micro.62.081307.162832). Further, the H58 (4.3.1) genotype is considered a single clone (Wong et al. 2015, Nat Genet, doi: 10.1038/ng.3281; Roumagnac et al. 2006 Science, doi: 10.1126/science.1134933) and thus suitable for recombination filtering using Gubbins. Gubbins analysis of H58 (Kenya or global isolates) consistently identifies two regions of recombination occurring at the root of the H58/genotype 4.3.1 branch (see Phandango screenshot in Author response image 1). The first of these spans CT18 (accession AL513382) reference sequence coordinates 954,115–970,731 (genes STY0961-STY0976), and the second spans 1,438,676-1,467,273 (genes STY1485-STY1508). Including these two recombinant patches in the SNP alignment results in lengthening the relevant branch but has no meaningful impact on the inferred relationships between lineages or isolates. Therefore, all phylogenies reported in the manuscript are inferred from the alignment that excludes these gubbins identified regions.

Author response image 1.

Author response image 1.

We have updated the Methods section to clarify this (lines 234-245):“Analysis with Gubbins (v2.3.2) identified two regions affected by recombination (coordinates 954,115–970,731 (genes STY0961-STY0976), and 1,438,676-1,467,273 (genes STY1485-STY1508) in the CT18 reference genome), which were excluded from the alignment40.”

Reviewer #2 (Recommendations for the authors):

I really enjoyed reading this paper and think that, on the whole, the analysis produced and conclusions drawn are very well justified and carefully prepared. I have just two small suggestions to provide further detail to this paper that I think would be extremely valuable to the readers.

(1) Separate the acute infections into two groups, those sampled from fecal culture and those from blood culture. This is an important distinction that would be important to the analysis performed to produce figure 5. There could be a distinct signal in NS mutations from the fecal vs. blood acute samples. Furthermore, the carriage fecal and acute fecal samples could have mutations that differentiate them from the blood culture samples or vice versa. Some consideration of this or inclusion of this in your analysis would be worthwhile.

This comment has been addressed as detailed above (please see ‘Essential revisions’).

(2) To interrogate the PTS6-IncHI1 plasmid further, I suggest blasting the contigs from your plasmid-identified samples against previously published and finished PTS6-IncHI1 plasmid sequences. You might be able to piece together more about the content of this plasmid and how similar the PTS6-IncHI plasmids from EA1 are to those from EA2. Otherwise some similar style of analysis that investigates horizontal transfer further.

This comment has been addressed as detailed above (please see ‘Essential revisions’).

Reviewer #3 (Recommendations for the authors):

The authors present a well written manuscript based on a case contact study of diarrheal aetiology among children in Kenya. They implement robust bioinformatics pipelines and derive conclusions that can guide intervention strategies. Below are some specific comments.

Line 159 – The authors mentions that for carriers, rectal swabs or stools samples were cultured on selenite F, however, it is not specified whether the isolates from cases were recovered using the same procedures. Please clarify.

All isolates, whether from cases or carriers were recovered using the same procedure of pre-enrichment and selection in Selenite-F. We have modified lines 163-177 to now read:

“For blood culture, 1-3 mL for children <5 years of age and 5-10 mL for those 5-16 years of age was collected in a syringe, placed into Bactec media bottles (Becton-Dickinson, New Jersey, USA), incubated at 37oC in a computerized BACTEC 9050 Blood Culture System (Becton-Dickinson), and subcultured after 24-48 h onto blood, chocolate and MacConkey agar (Oxoid, Basingstoke, UK) plates. All isolates, whether from cases or carriers were cultured on selenite F (Oxoid) broth aerobically at 37oC overnight.”

Line 185-186. Could the authors explain why there is such a huge discrepancy and how they ensured that this was not the result of sample mix-ups.

This was a 3-year study and isolates were archived immediately after biochemical and serological ID to await WGS processing later, hence we believe it is more likely that we had mixed colonies in the initial culture. Regardless, we report our results both in terms of sero-identified, and WGS-confirmed, S. Typhi and there is little difference except in terms of sample size and thus statistical power for testing (see Tables 1 and Figure 4—Figure Supplement 3).

Table 1 – the authors use the results from culture to determine whether there are correlates for S. typhi carriage and disease. Given the discrepancies between the genomic data and the serotyping data why did the authors choose to use the serotyping data instead of the genomic confirmation. Are the results of this analysis reproducible if the genome confirmed data are used instead of the serotyping data?

We prioritised culture data for the logistic regression analyses (Table 1) because (a) this is the largest sample available, as the WGS process introduces a large amount of data loss not because of mis-classification but because many isolates were unable to be revived or failed sequencing (see Figure 1); and (b) it represents the primary classification that is available in clinical microbiology laboratories in most settings and therefore comparable to other studies, in contrast to WGS-confirmed. We did also show results for linear regression of infection rates on age, stratified by sex, for WGS-confirmed as well as serologically-defined infections, in Table S4 (now Supplementary File 6) which show the same associations.

However to improve clarity and robustness, we have now added logistic regression analyses for WGS-confirmed S. Typhi to Table 1, which shows the same associations with age amongst males. We have also modified the text at lines 323-326 to note this:

“Significant associations identified from culture data were also observed when using WGS confirmed infections only, and no significant statistical association was found between phenotypic or genotypic AMR patterns and case/control status, age, or sex.”

Line 406 – The authors present evidence suggesting that higher temperatures were significantly associated with decreased case counts. However, this includes isolates that were not confirmed by sequencing and generally the per month case counts were low. Was there a correlation between weather and the general numbers of patients presenting at the clinic? Additionally, when the unconfirmed isolates are excluded from the analysis does this observation still hold?

The monthly count of febrile cases presenting to the clinics was not associated with rainfall or temperature, this data is now shown in Supplementary File 12.

The choice to use culture confirmed data for the climate analysis was made for the same reasons as outlined above for the logistic regression analyses. We have repeated the analyses using WGS confirmed isolates and added two new tables to the revised manuscript (Supplementary Files 11-12). We have adjusted the text in lines 468-472 to refer to these results, as follows:

“Repeating these analyses with WGS-confirmed isolates, the association with temperature was no longer significant (although the trend remained; see Supplementary Files 10-11). No association was detected between patient numbers attending study clinics and climate variables (Supplementary File 12).”

And have added the below lines 1303-1305 to the Supplementary Methods section:

“For analyses of elevated patient counts attending study clinics, we used the same climate thresholds, but instead used an elevated patient count threshold of n=85.”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Kariuki S, Dyson Z. 2020. Typhi_Kenya European Nucleotide Archive (ENA) NCBI BioProject. PRJEB19289
    2. Parkhill J, Dougan G, James KD. 2001. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 35101607 [DOI] [PubMed]
    3. Argimon S, Abudahab K, Goater RJE. 2016. Visualizing and sharing data for genomic epidemiology and phylogeography. Microb Genom. PMC5320705 [DOI] [PMC free article] [PubMed]
    4. Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 26099265 [DOI] [PMC free article] [PubMed]
    5. Hawkey J, Hamidian M, Wick RR. 2015. ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data. BMC Genomics. 26336060 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Supplementary file 1. S. Typhi sequences used in this study (excel file).
    elife-67852-supp1.xlsx (62.2KB, xlsx)
    Supplementary file 2. Outgroup and global context H58 S. Typhi sequences used in this study (excel file).
    elife-67852-supp2.xlsx (62.2KB, xlsx)
    Supplementary file 3. Kenyan sequences used in temporal analyses (excel file).
    elife-67852-supp3.xlsx (20.1KB, xlsx)
    Supplementary file 4. Sequences used in IncHI1 plasmid analyses (excel file).
    elife-67852-supp4.xlsx (40.5KB, xlsx)
    Supplementary file 5. Repetitive regions excluded from plasmid pAKU1 SNP analysis (excel file).
    elife-67852-supp5.xlsx (9.8KB, xlsx)
    Supplementary file 6. Culture positive typhoid cases and asymptomatic carriers.
    elife-67852-supp6.docx (14.4KB, docx)
    Supplementary file 7. Comparison of phenotypic and genotypic AMR profiles of 136 (n=128 H58, n=8 Non-H58) high quality S. Typhi genome sequences.
    elife-67852-supp7.docx (15.2KB, docx)
    Supplementary file 8. Distribution of n=153 S. Typhi genotypes among each sex for cases and controls.
    elife-67852-supp8.docx (12.9KB, docx)
    Supplementary file 9. Climatic predictors of elevated case and carrier counts for all samples.
    elife-67852-supp9.docx (13.8KB, docx)
    Supplementary file 10. Climatic predictors of WGS confirmed elevated case and carrier counts inside the DSS.
    elife-67852-supp10.docx (13.9KB, docx)
    Supplementary file 11. Climatic predictors of WGS confirmed elevated case and carrier counts for all samples collected.
    elife-67852-supp11.docx (13.8KB, docx)
    Supplementary file 12. Climatic predictors of elevated cases presenting at study clinics.
    elife-67852-supp12.docx (13.1KB, docx)
    Supplementary file 13. Non-synonymous (NS) Mutations among S. Typhi isolates (excel file).
    elife-67852-supp13.xlsx (19.7KB, xlsx)
    Transparent reporting form

    Data Availability Statement

    All data generated or analysed during this study are included in the manuscript and supporting files. Raw Illumina sequence reads have been submitted to the European Nucleotide Archive (ENA) under accession PRJEB19289. Individual sequence accession numbers are listed in Table S1.

    The following dataset was generated:

    Kariuki S, Dyson Z. 2020. Typhi_Kenya European Nucleotide Archive (ENA) NCBI BioProject. PRJEB19289

    The following previously published datasets were used:

    Parkhill J, Dougan G, James KD. 2001. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 35101607

    Argimon S, Abudahab K, Goater RJE. 2016. Visualizing and sharing data for genomic epidemiology and phylogeography. Microb Genom. PMC5320705

    Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 26099265

    Hawkey J, Hamidian M, Wick RR. 2015. ISMapper: identifying transposase insertion sites in bacterial genomes from short read sequence data. BMC Genomics. 26336060


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES