ABSTRACT
Enterotoxigenic Escherichia coli (ETEC) is an important cause of diarrhea in children in low- and middle-income countries (LMICs). However, large-scale pathogen burden studies in children have identified ETEC in the guts of both symptomatic patients and controls. The factors that influence this balance are poorly understood, but it is postulated that the gut microbiome may play a role in either resistance or progression to disease. In this study, we profiled the microbiomes of children and adults from Bangladesh who were asymptomatically or symptomatically infected with ETEC. Symptomatic patients had significantly higher numbers of sequenced reads mapping to both E. coli and two ETEC toxins, suggesting higher bacterial burden. They were also significantly more likely to be coinfected with enteroaggregative E. coli (EAEC) and had higher proportions of other Gammaproteobacteria, including Klebsiella, Salmonella, and Haemophilus. Colonization with ETEC was also associated with increased prevalence of antimicrobial resistance (AMR) genes, most notably those of the β-lactamase class. Taxonomic profiles were distinctly different between all groups in both species richness and composition, although the direction of these changes was different in adults and children. As seen previously, children with high E. coli burdens also had higher proportions of Streptococcus spp., while healthy children were more heavily colonized by Bifidobacterium spp. Our study provides insight into the microbiome changes that occur upon infection with ETEC in an endemic setting and provides rationale for future studies investigating how the microbiome may protect or predispose individuals to symptomatic infections with gastrointestinal pathogens.
KEYWORDS: ETEC, microbiome, metagenomics
INTRODUCTION
Enterotoxigenic Escherichia coli (ETEC) is a common cause of watery diarrhea in low- and middle-income countries (LMICs). While normally described as the major cause of traveler’s diarrhea in visitors to these regions, ETEC also infects young children in areas of endemicity, causing an estimated 84.4 million diarrhea episodes per year worldwide (1). Although most people in these regions will generate immunity in childhood, this immune protection is not absolute, and adults may periodically endure symptomatic infections. From large-scale field studies, it is also clear that individuals can often carry ETEC without displaying significant clinical symptoms (2). Although it is not known what modulates the balance between symptomatic and asymptomatic infections, factors such as preexisting immunity and the composition of the microbiome may play a part.
In recent years, there has been increasing interest in understanding how the microbiome impacts human health. However, the majority of this microbial profiling has been performed on healthy adult individuals in developed countries and may not be generalizable or applicable to people in geographically distinct regions. Exemplar studies have found that the microbiomes of people in the United States and Europe are distinct from those in Africa and India (3–5). Some of this can be described by the Prevotella/Bacteroides gradient within the Bacteroidetes phylum, whereby Bacteroides predominates in Western populations, while Prevotella dominates elsewhere. Bacteroidetes are also more generally overrepresented in Western microbiomes, with Firmicutes being more common in other populations. This is also true in children, whereby children in Bangladesh were shown to have much lower levels of Bacteroidetes in total and higher microbial diversity than children in the United States (6). Similar patterns were also noted in adult U.S. travelers to Central America and India, with travelers having relatively more Firmicutes and fewer Bacteroidetes on return to the United States (7). These regional variations in microbiome composition are likely to be mediated by a multitude of factors, including diet, genetics, lifestyle, and the environment (4).
One environmental factor that is expected to have a major influence on gastrointestinal health is the interaction between the microbiome and gastrointestinal pathogens. Although several studies have looked at the impact of intestinal infection on the gut microbiome (8–13), due to differences in patient age, pathogen of interest, control groups and antibiotic therapy, as well as small patient sample numbers, there is no consensus on which aspects of the microbiome may be protective or endow colonization resistance and which might facilitate progression to disease. In a human ETEC challenge study, researchers identified several species that were correlated with progression to disease or resistance to infection, including several Bacteroides spp. which were overrepresented in volunteers who were protected from symptomatic infection (14). However, as this study was in healthy adults in the United States, it is not clear how relevant these findings are to populations where ETEC is endemic.
In our study we used metagenomic sequencing analysis to compare the gut microbiomes of adults and children in Bangladesh who were asymptomatically or symptomatically infected with ETEC. The whole-genome shotgun sequence data were interrogated to determine the burden of E. coli species, presence of ETEC toxins, antimicrobial resistance genes (ARGs), other bacterial pathogens, and overall microbiome composition.
RESULTS
Burden of E. coli in symptomatically and asymptomatically ETEC-infected individuals.
ETEC culture-positive stool samples were collected from 32 symptomatic and asymptomatic adults and children at the International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b) in Dhaka, Bangladesh. As controls, 16 ETEC culture-negative stool samples were collected from healthy adults and children in the same city. DNA was extracted from stool samples and shotgun sequenced using Illumina HiSeq technology. To determine the approximate ETEC load in each sample, sequenced reads were mapped to the O78 ETEC reference genome H10407. The percentage of reads that mapped to E. coli H10407 was then compared for each health status and age group.
Regardless of age, participants who were symptomatically infected with ETEC were more likely to have high percentages of reads mapping to E. coli H10407 (Fig. 1A). The average percentage of reads that mapped to E. coli H10407 was also significantly greater in symptomatic participants compared to asymptomatically infected participants and ETEC-negative healthy controls. Asymptomatic participants had a higher average percentage of E. coli H10407 reads compared to controls (P = 0.002) but were significantly lower than the symptomatic group. While this suggests that symptomatically infected participants had a higher burden of ETEC on average, it is possible that they may also have had an abundance of other E. coli or related bacteria or that they were presymptomatic and would go on to have diarrheal symptoms.
FIG 1.
Detection of E. coli and associated virulence factors from sequence data. (A and B) For each sample the sequence data were mined to determine the percentage of reads mapping to E. coli H10407 (A) and the presence of adhesins and pathogenic E. coli-associated virulence factors (B). The intensity of the heatmap is shown relative to the RPKM for the gene of interest.
To specifically identify ETEC and other diarrheagenic E. coli in the stool samples, pathotype-defining virulence gene sequences were detected in shotgun data using SRST2. ETEC-positive samples were defined as those where we could detect DNA sequences for either the heat-labile toxin (LT) or heat-stable toxin (ST) genes. Of the 15 symptomatic participants in the study, we could detect LT or ST in 14 samples (Fig. 1B). Importantly, the proportion of samples within which we could detect LT/ST and the reads per kilobase mapped (RPKM) for these genes was significantly greater in the symptomatically infected group compared to asymptomatically infected individuals (Wilcoxon rank-sum test, LT P = 0.0285; ST P = 0.0413). There was no significant difference in the detection of reads between adults and children.
In addition to ETEC-associated DNA, symptomatically infected participants had significantly higher levels of detectable DNA associated with enteroaggragative E. coli (EAEC) than asymptomatically infected participants (P = 0.003). DNA sequences normally associated with enteropathogenic E. coli (EPEC) were also detected in some samples, but there was no statistically significant difference between groups (Fig. 1B). Compared to control participants, symptomatically infected individuals had significantly higher detection of ETEC- and EAEC-associated DNA, while for asymptomatic participants, only ETEC-associated DNA signatures were significantly more prevalent (P = 0.009).
Interestingly, of the samples from which we could detect ETEC-associated DNA, the majority of symptomatic patients were infected with ETEC harboring genes for both LT and ST. In contrast, the most common ETEC found in asymptomatic patients was ST-only ETEC. ETEC positive for the LT gene only was only found in one patient in each infected group. Of the ETEC colonization factor adhesins identified, CFA and CS13 were significantly more prevalent in asymptomatic and symptomatic participants than controls (Fig. 1B). A number of additional adhesins were detected in symptomatic participants, including CS1, CS2, CS6, CS14, CS17, and CS22. Overall, the number of fimbriae detected per sample was higher in symptomatic participants than in those who were asymptomatic (Mann-Whitney U test, P < 0.01). This could be due to mixed ETEC infections or a higher average number of fimbriae produced by ETEC strains colonizing participants in the symptomatic group.
Comparison of microbiome diversity and composition between asymptomatic and symptomatic individuals.
To compare the gut microbiome composition of stool samples from ETEC-infected participants and controls, human sequence-depleted DNA reads were assigned informatically into taxonomic groups, and the microbial composition was compared at the phylum level (Fig. 2). As expected, there were noticeable differences between the microbial composition of stool between adults and children. In general, healthy adults were heavily colonized by Firmicutes, while healthy children were predominantly colonized by Actinobacteria. In contrast, symptomatic participants showed high proportions of Proteobacteria, regardless of age. While the majority of asymptomatic participants were similar to controls at the phylum level, there were some notable exceptions within the data set where asymptomatic participants looked more similar to symptomatic participants than to controls.
FIG 2.
Microbial composition of gut microbiome samples by age group and health status. Sequenced reads were classified to the phylum level, and the relative abundance of each phylum is shown as the proportion of total bacterial reads.
The differences in microbiome structure were further investigated with additional granulation by determining the richness (α-diversity) and composition (β-diversity) of the samples at the genus level. α-Diversity was measured using Shannon’s diversity index, a measure of population abundance and evenness. Higher Shannon indices are seen in samples which are more diverse in the number of species present and lower when the sample is populated by fewer species or predominantly composed of few species. As expected, the pattern of α-diversity was very different between adults and children (Fig. 3). The α-diversity of the microbiome in control and asymptomatic adults was significantly greater than that of symptomatic participants (Fig. 3A). In children however, the microbiome α-diversity in controls was found to be much lower than that of adult controls (Fig. 3A). There was a trend toward an increase in α-diversity in asymptomatic children compared to controls; however, this difference was not statistically significant. Symptomatic children had diversity profiles somewhere between controls and asymptomatic children but were not significantly different from either.
FIG 3.

Diversity of stool microbiome samples by health status group and age. The diversity of microbiome samples was compared at the genus level and stratified by age group and health status. (A) Shannon (α) diversity (B) Bray Curtis dissimilarity (β diversity). MDS, multidimensional scaling; **, P < 0.01.
β-Diversity was compared by calculating the Bray Curtis dissimilarity index, which compares the compositional dissimilarity between samples. A Bray Curtis value of 0 reflects a situation whereby samples contained exactly the same microbes, whereas a value of 1 would suggest they did not share any microbial species. In general, symptomatically infected adults and children had similar microbial composition to each other, regardless of age (Fig. 3B, dark green and dark brown ellipses). Control adults and children had distinct microbial composition, as shown by the separate β-diversity populations (Fig. 3B, light green and light brown ellipses). Asymptomatically infected participants did not cluster together but were spread between the distinct control and symptomatic populations.
Diversity of antimicrobial resistance genes between asymptomatically and symptomatically infected individuals and controls.
Antimicrobial resistance genes were identified from whole shotgun sequence data using SRST2 and the ARG-ANNOT database. To allow comparison between samples, the relative abundance of each gene in the shotgun data was determined by calculating the RPKM (Fig. 4). There were significant differences in the antibiotic genes detected in the different health status groups. In general, genes of the beta-lactamase, fluoroquinolone, sulfonamide, and trimethoprim classes were more common in symptomatic and asymptomatic patients than controls. Only aadA and tetB were significantly more prevalent in symptomatic patients than in asymptomatic patients. Conversely, ACl-1 and ermQ were both found to be more prevalent in asymptomatic patients than controls. There were few remarkable differences between the age groups; however, in control participants a small number of tetracycline genes (tet-40 and tetB-P) were more prevalent in adults than children.
FIG 4.
Antimicrobial resistance gene carriage in participants. RPKM values for ARGs detected in participant stool samples are shown as a heatmap, clustered by health status, age, and antimicrobial class. Significant differences (Fisher’s exact test) in the prevalence of specific ARGs between groups is shown next to the gene name. AGLY, aminoglycosides; BLA, beta-lactamases; FOS, fosfomycin; FLQ, fluoroquinolones; GLY, glycosides; MLS, macrolides/lincosamides/streptogramins; PHE, phenicols; SUL, sulfonamides; TET, tetracyclines; TMP, trimethoprim.
Differentially abundant taxa between individuals with high and low E. coli loads.
The above-described analysis of the microbiome samples showed significant heterogeneity between asymptomatically infected participants. It is clear, however, that despite ETEC exposure, some individuals were able to protect themselves from an outgrowth of E. coli, while others became heavily colonized. To further explore the differences in the microbiomes of people who were highly colonized with ETEC compared to those with low or no ETEC colonization, we reclassified samples into two categories, those with high E. coli burden (>10% reads mapped to E. coli H10407) and those with low E. coli burdens (<10% reads mapped to E. coli H10407). For adults, 9/25 (36%) of enrolled participants had high proportions of E. coli reads, while for children this was 10/23 (43%).
To allow us to identify underlying differences in the non-E. coli bacterial populations in participants infected with ETEC, the reads classified as genus Escherichia were removed from the data set before analysis. For comparison between groups, abundance data at the genus level were normalized, and pairwise comparisons between each group were made by calculating the log2 ratio of median proportions. The median ratios were then filtered to remove hits that were not significantly different between groups. The significant differences between the high- and low-E. coli burden groups were then visualized using heat tree maps (Fig. 5). To provide additional granularity, this analysis was repeated at the species level (Table S1).
FIG 5.
(A and B) Heat trees showing statistically significant differences in abundance of bacterial taxa in adults (A) and children (B) with high (brown) or low (green) E. coli burden. The size of the node is proportional to the number of operational taxonomic units (OTUs) detected, while the depth of color is proportional to the size of the log2 median difference between groups. Gray branches show taxa that were present but not significantly different between groups.
Significant differences in bacterial abundance between samples with high E. coli burden and those with low E. coli burden at the species level. A positive log2 median ratio represents species that were overabundant in low-E. coli burden subjects, while a negative ratio indicates bacteria more prevalent in high-E. coli samples. Download Table S1, XLSX file, 0.02 MB (16.1KB, xlsx) .
Copyright © 2022 Higginson et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
There were 101 and 43 species significantly different in abundance in adults and children, respectively. Adults and children with high E. coli burdens had higher proportions of numerous other Enterobacteriaceae compared to those with low E. coli burdens. At the species level this included Salmonella enterica, Citrobacter spp., and Klebsiella spp. Adults with high E. coli reads also harbored higher proportions of reads that mapped to the genomes of Campylobacteria, Burkholderiales, Pasteurellaceae, and undefined Veillonellaceae F0422 MGYG-HGUT-03197 (Fig. 5A). In contrast, adults with low E. coli burden had significantly higher proportions of numerous as yet undefined species, including members of the classes Bacilli and Clostridia.
For children, there were fewer statistically significant differences between the two groups, and fewer taxa that contributed greater than 0.005 reads to the population, proportionally (Fig. 5B). In addition to members of the Enterobacteriaceae, other species that were highly enriched in children with high E. coli reads were Streptococcus spp., Comamonas spp., and the newly defined species F0422 MGYG-HGUT-03197, which was also identified in adults. The only taxa that were significantly enriched in children with low E. coli burden were six Bifidobacterium species (B. infantis, B. scardovii, B. pullorum, B. gallinarum, B. breve, and B. angulatum).
DISCUSSION
In this study, we found significant differences in microbial diversity among symptomatically and asymptomatically ETEC-infected individuals compared to controls in both adults and children. The most striking difference between symptomatic and asymptomatic participants was the number of reads mapping to the widely used reference ETEC H10407 genome. Although our analysis may not provide an exact measure of ETEC burden, as it includes other E. coli, the E. coli read depths largely correlated with detected LT and ST toxin genes, which are key signatures of ETEC. This strongly suggests that asymptomatic people have a lower ETEC burden than symptomatic individuals. In this proof-of-concept study, asymptomatic participants were substantially more likely to harbor ST-only ETEC, whereas more symptomatic participants had LT/ST-ETEC. However, why ETEC is able to persist and replicate in some individuals without causing clinical disease remains an open question. While the presence of LT could increase the likelihood of a patient becoming symptomatic, data from large-scale studies suggest that ST is the more important toxin for mediating moderate-to-severe diarrhea (2).
One limitation of this small, proof-of-concept study was that samples were only collected from participants at one time point, and participants with detectable ETEC in their stool were not followed up to establish whether or not a subset would eventually go on to become symptomatic. While progression to symptomatic infection cannot be discounted within this cohort, we do know from case-control studies such as GEMS1 and GEMS1A that ETEC (LT or ST positive) is commonly identified in asymptomatic individuals (2, 15). From the GEMS1 study in Bangladesh, only ST-positive ETEC was significantly associated with disease, although this was only in the younger age groups. In children between 24 and 59 months, all-cause ETEC (ST or LT positive) was detected in 20 cases and 26 controls (https://clinepidb.org/ce/app/). Thus, there is a clear case for asymptomatic carriage of ETEC in children and adults. Building on this work, a larger study collecting longitudinal samples from participants will be needed to investigate how the microbiome changes in response to ETEC colonization over time and what factors precipitate progression to symptomatic disease.
There were no significant differences between adults and children in the E. coli read depth, ETEC toxin gene carriage, ARG carriage, or presence of other E. coli pathotypes. However, the microbiome diversity and response to infection were considerably different between these two groups. Microbial diversity in control children was relatively low, and their microbiomes were dominated by Actinobacteria. The diversity in control children was distinct from that of asymptomatically infected children, who had much higher α-diversity. This likely represents the proliferation of species normally present at low abundances in these children due to disruption of normal gut homeostasis. Whether ETEC infection precipitates this change or is a result of it is yet to be determined. In general, when looking at microbial diversity, the microbiomes of symptomatic and asymptomatic children were more similar to each other than to controls. This is an important factor to consider when deciding on appropriate controls for future microbiome research or potential microbial therapeutic development.
On the species level we also saw increases in the proportions of several Streptococcal species in children with high reads. This is consistent with results of a similar study on microbiome changes during intestinal infection in Bangladesh, which found proportional increases in E. coli and several Streptococcal species in children with diarrhea, regardless of the culpable intestinal pathogen (6). Conversely, in children with low E. coli burden we found higher proportions of several species of Bifidobacterium. These bacteria are common in breastfed infants (16) and were also increased in abundance in healthy children compared to diarrhea cases in the above-described study (7).
For adults, the overall microbiome profile was very different from that in children. Asymptomatic adults and controls had very similar gut microbiomes, which suggests that the introduction of ETEC does not singularly cause any considerable microbiome perturbation and that there may be other factors at play. The healthy adult microbiome was also much more diverse, indicating decreased diversity upon ETEC infection as E. coli became the most abundant bacterial member of the microbiome. The microbiome in healthy adults was rich in Firmicutes, similar to other published studies from Bangladesh (14), although we also observed an abundance of Actinobacteria. This finding was similar to a study on travelers returning to the United States from a LMIC, which found an abundance of Firmicutes, Actinobacteria, and Proteobacteria in healthy and ETEC-infected travelers (17). Although a study of ETEC infections in healthy volunteers in the United States found a number of Bacteroides species to be protective against infection (14), in our study the Bacteroides genus was on average greater than 2-fold more abundant in adults with high E. coli sequence reads, although no species were significantly more abundant. This may be related to baseline differences in gut microbiome composition between the United States and Bangladesh.
It is clear from these data that progression to symptomatic ETEC infection is associated with an outgrowth of Enterobacteriaceae, although interestingly, many uncharacterized species were also significantly associated with either protection from or progression toward symptomatic disease. Most of these bacteria have only recently been discovered, and many have never been cultured, as they were identified through metagenome assembly. This highlights the importance of efforts like the Unified Human Gastrointestinal Genome to shine a light on previously hidden microbial populations (17).
Finally, ARG carriage was significantly different between the groups, with asymptomatic and symptomatic patients being broadly more similar to each other than controls. In general, patients colonized with ETEC had higher prevalence of ARGs of the trimethoprim, sulfonamide, fluoroquinolone, and beta-lactamase classes. These antibiotics are commonly used in South Asia, both clinically and in farming, and high levels of resistance to these classes have been seen in other studies (18–21). Although there was increased prevalence of ARGs in people colonized with ETEC, it is unlikely that all the ARGs would be ETEC-associated, as most E. coli isolates in South Asia are not so highly multidrug resistant (18, 19). It is therefore probable that the increase in ARG prevalence is associated with changes in the microbiome and increases in the relative proportions of ARG-carrying Enterobacteriaceae. Understanding the mechanics of this interaction will require larger studies encompassing longitudinal sampling and comprehensive lifestyle and antibiotic usage surveys.
Conclusions.
Our study highlights the gastrointestinal microbiome differences between asymptomatic colonization with ETEC compared to symptomatic infection. Not only were there clear alterations in microbial composition in these patients, but the prevalence of several clinically relevant ARG classes was also impacted. This work provides important foundational information for future studies investigating the impact of gut microbiome composition on gastrointestinal disease progression and how the presence of supportive organisms may endow protection against symptomatic disease.
MATERIALS AND METHODS
Ethics.
Samples were collected from participants of the ETEC ETVAX Vaccine Trial (ClinicalTrials registration no. NCT02531802) at the International Centre for Diarrhoeal Disease Research, Bangladesh (icddr,b).
Sample collection.
Stools samples were collected from asymptomatic and symptomatic adults and children who were culture positive for ETEC, as well as uninfected controls (Table 1). Raw stool was stored at −80°C before total DNA was extracted using the QIAamp DNA stool minikit. All samples were processed within 4 days of sample collection.
TABLE 1.
Numbers of study participants by age and health status
| Age group | Control | Asymptomatic | Symptomatic | Total |
|---|---|---|---|---|
| Children | 8 | 7 | 8 | 23 |
| Adults | 8 | 10 | 7 | 25 |
| Total | 16 | 17 | 15 | 48 |
Sequencing.
DNA samples that cleared the quality-control check were sequenced in 16-plex on two lanes of a HiSeqV4 machine at the Wellcome Sanger Institute (WSI). The raw data are available on the European Nucleotide Archive under study accession no. PRJEB21793. Over 15 million 125-bp paired-end reads were generated for each sample. The data generated were screened for human reads, which were identified and removed from the raw sequence data by aligning to the Homo sapiens GRCh38 reference genome using Bowtie 2 v2.2.3 (22).
E. coli mapping, virulence factor, and ARG detection.
To identify E. coli species-specific reads in raw sequence data, human-depleted reads were mapped to the widely used O78 reference genome of H10407 (accession version no. GCA_000210475.1) using Bowtie 2. To identify relevant E. coli virulence determinants and ARGs, raw reads were mapped against databases of E. coli virulence determinants (Virulence Factor Database [23]) and ARGs (ARGannot3 [REF]) using SRST2 v2 (24). The number of reads mapping to these genes of interest was quantified by calculating the reads per kilobase mapped (RPKM). Pathotypes were defined as follows: enterotoxigenic Escherichia coli (ETEC) were defined as the presence of the genes corresponding to heat-labile toxin (LT) or heat-stable toxin (ST), enteroaggregative Escherichia coli (EAEC) were defined by the presence of aatA and at least one of either the aggR, aaiC, or aap genes, and enteropathogenic Escherichia coli (EPEC) were defined as eae positive and stx negative.
Statistical analysis of pathotype detection was calculated using a two-tailed Fisher’s exact test, and the percentage of mapped reads and RPKM (reads per kilobase of transcript, per million reads mapped) values were compared using a Wilcoxon rank-sum test. To compare the prevalence of antibiotic resistance gene detections between groups, Fisher’s exact test was used with an adjustment for false-discovery rate.
Taxonomic classification, abundance estimation, and diversity analyses.
Classification of human-depleted reads was carried out using Kraken 2 v2.0.8-beta (25). Reads were classified against the uhgg_kraken2-db database of prokaryotic genomes (17), which was downloaded from the European Bioinformatics Institute (https://www.ebi.ac.uk/metagenomics/genomes). Abundance estimation was run on Kraken 2 outputs using Bracken v2.5 (26) at the phylum and genus levels. Diversity analyses were carried out in R using vegan v2.5-6 (27) and agricolae v1.3-1 (28). Before α- and β-diversity calculations, data were rarefied to the minimum number of reads to maintain all samples in the data set. Statistical analysis of α-diversity measures was carried out using analysis of variance (ANOVA) and Tukey’s honestly significant difference (HSD) test.
Metacoder v0.3.3 (29) was used to visualize genus level differences in bacterial diversity between samples with high (>10%) and low percentages (<10%) of reads mapping to E. coli H10407. Bracken abundance data were filtered to limit the data set to bacterial taxa. To discount the contribution of Escherichia reads on the overall abundance of taxa, reads classified to the level of Escherichia were removed from the data set. Due to the differences in diversity seen between adults and children, these data sets were analyzed independently. Low-abundance taxa (<20 reads per sample) were removed, and proportions of total bacterial reads were calculated for the remaining taxa. Taxa with proportions below 0.005 for all samples were dropped from the analysis. For comparative analysis, the log2 median ratio of taxon abundances was calculated for each pairwise comparison. Statistical significance between groups was measured using the Wilcoxon rank sum test, with an adjustment for false-discovery rate. For visual heat tree comparisons, log2 median ratios were filtered to display only those that were significantly different between groups.
Data availability.
The sequence data are available on the European Nucleotide Archive under study accession no. PRJEB21793.
ACKNOWLEDGMENTS
We thank the clinical and laboratory teams at the International Centre for Diarrheal Disease Research, Bangladesh, and the sequencing team at the Wellcome Sanger Institute for their assistance.
The samples and data collected for this study were collected as part of the ETEC ETVAX Vaccine Trial (ClinicalTrials registration no. NCT02531802).
We declare we have no competing interests.
At the University of Cambridge, this work was funded by the NIHR AMR Research Capital Funding Scheme (NIHR 200640). This work was also supported by the icddr,b, which is grateful to the governments of Bangladesh, Canada, Sweden, and the United Kingdom for providing core/unrestricted support. G.D. and A.M. were supported by NIHR-BRC. A.M. was also funded by the DBT-Cambridge partnership. Additional support was provided by Wellcome Trust Strategic Award (no. 106158/Z/14/Z) and the Bill and Melinda Gates Foundation (no. 617 OPP1141321).
Authors contributions: E.E.H.: investigation, data analysis, writing-original draft, and editing. M.A.S.: investigation, writing-review, and editing. J.P.D.: data analysis, writing-review, and editing. V.S.: writing-review and editing. M.B.: writing-review and editing. S.K.S.: writing-review and editing. I.W.: conceptualization and writing-review. F.Q.: conceptualization, funding, writing-review, and editing. G.D.: conceptualization, supervision, funding, writing-review, and editing. A.M.: conceptualization, supervision, funding, writing-review, and editing.
Contributor Information
Ankur Mutreja, Email: am872@medschl.cam.ac.uk.
Rino Rappuoli, GSK Vaccines.
REFERENCES
- 1.Anderson JD IV, Bagamian KH, Muhib F, Amaya MP, Laytner LA, Wierzba T, Rheingans R. 2019. Burden of enterotoxigenic Escherichia coli and shigella non-fatal diarrhoeal infections in 79 low-income and lower middle-income countries: a modelling analysis. Lancet 7:e321–e330. doi: 10.1016/S2214-109X(18)30483-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kotloff KL, Nataro JP, Blackwelder WC, Nasrin D, Farag TH, Panchalingam S, Wu Y, Sow SO, Sur D, Breiman RF, Faruque AS, Zaidi AK, Saha D, Alonso PL, Tamboura B, Sanogo D, Onwuchekwa U, Manna B, Ramamurthy T, Kanungo S, Ochieng JB, Omore R, Oundo JO, Hossain A, Das SK, Ahmed S, Qureshi S, Quadri F, Adegbola RA, Antonio M, Hossain MJ, Akinsola A, Mandomando I, Nhampossa T, Acácio S, Biswas K, O’Reilly CE, Mintz ED, Berkeley LY, Muhsen K, Sommerfelt H, Robins-Browne RM, Levine MM. 2013. Burden and aetiology of diarrhoeal disease in infants and young children in developing countries (the Global Enteric Multicenter Study, GEMS): a prospective, case-control study. Lancet 382:209–222. doi: 10.1016/S0140-6736(13)60844-2. [DOI] [PubMed] [Google Scholar]
- 3.de Filippo C, Cavalieri D, di Paola M, Ramazzotti M, Baptiste PJ, Massart S, Collini C, Pieraccini G, Lionetti P. 2010. Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural Africa. Lancet 107:14691–14696. doi: 10.1073/pnas.1005963107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yatsunenko T, Rey FE, Manary MJ, Trehan I, Dominguez-Bello MG, Contreras M, Magris M, Hidalgo G, Baldassano RN, Anokhin AP, Heath AC, Warner B, Reeder J, Kuczynski J, Caporaso JG, Lozupone CA, Lauber C, Clemente JC, Knights D, Knight R, Gordon JI. 2012. Human gut microbiome viewed across age and geography. Nature 486:222–227. doi: 10.1038/nature11053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bhute S, Pande P, Shetty SA, Shelar R, Mane S, Kumbhare SV, Gawali A, Makhani H, Navandar M, Dhotre D, Lubree H, Agarwal D, Patil R, Ozarkar S, Ghaskadbi S, Yajnik C, Juvekar S, Makharia GK, Shouche YS. 2016. Molecular characterization and meta-analysis of gut microbial communities illustrate enrichment of Prevotella and Megasphaera in Indian subjects. Front Microbiol 7:660. doi: 10.3389/fmicb.2016.00660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lin A, Bik EM, Costello EK, Dethlefsen L, Haque R, Relman DA, Singh U. 2013. Distinct distal gut microbiome diversity and composition in healthy children from Bangladesh and the United States. PLoS One 8:e53838. doi: 10.1371/journal.pone.0053838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Youmans BP, Ajami NJ, Jiang Z-D, Campbell F, Wadsworth WD, Petrosino JF, DuPont HL, Highlander SK. 2015. Characterization of the human gut microbiome during travelers’ diarrhea. Gut Microbes 6:110–119. doi: 10.1080/19490976.2015.1019693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Midani FS, Weil AA, Chowdhury F, Begum YA, Khan AI, Debela MD, Durand HK, Reese AT, Nimmagadda SN, Silverman JD, Ellis CN, Ryan ET, Calderwood SB, Harris JB, Qadri F, David LA, LaRocque RC. 2018. Human gut microbiota predicts susceptibility to Vibrio cholerae infection. J Infect Dis 218:645–653. doi: 10.1093/infdis/jiy192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lindsay B, Oundo J, Hossain MA, Antonio M, Tamboura B, Walker AW, Paulson JN, Parkhill J, Omore R, Faruque ASG, Das SK, Ikumapayi UN, Adeyemi M, Sanogo D, Saha D, Sow S, Farag TH, Nasrin D, Li S, Panchalingam S, Levine MM, Kotloff K, Magder LS, Hungerford L, Sommerfelt H, Pop M, Nataro JP, Stine OC. 2015. Microbiota that affect risk for shigellosis in children in low-income countries. Emerg Infect Dis 21:242–250. doi: 10.3201/eid2101.140795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Weil A, Midani F, Chowdhury F, Khan A, Begum Y, Charles R, Calderwood SB, Ryan ET, Harris J, Qadri F, David L, Larocque R. 2015. The gut microbiome and susceptibility to Vibrio cholerae infection. Open Forum Infect Dis 697. doi: 10.1093/ofid/ofv131.72. [DOI] [Google Scholar]
- 11.Kieser S, Sarker SA, Sakwinska O, Foata F, Sultana S, Khan Z, Islam S, Porta N, Combremont S, Betrisey B, Fournier C, Charpagne A, Descombes P, Mercenier A, Berger B, Brüssow H. 2018. Bangladeshi children with acute diarrhoea show faecal microbiomes with increased Streptococcus abundance, irrespective of diarrhoea aetiology. Environ Microbiol 20:2256–2269. doi: 10.1111/1462-2920.14274. [DOI] [PubMed] [Google Scholar]
- 12.Monira S, Nakamura S, Gotoh K, Izutsu K, Watanabe H, Alam NH, Nakaya T, Horii T, Ali SI, Iida T, Alam M. 2013. Metagenomic profile of gut microbiota in children during cholera and recovery. Gut Pathog 5:1. doi: 10.1186/1757-4749-5-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Singh P, Teal TK, Marsh TL, Tiedje JM, Mosci R, Jernigan K, Zell A, Newton DW, Salimnia H, Lephart P, Sundin D, Khalife W, Britton RA, Rudrik JT, Manning SD. 2015. Intestinal microbial communities associated with acute enteric infections and disease recovery. Microbiome 3:45. doi: 10.1186/s40168-015-0109-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Pop M, Paulson JN, Chakraborty S, Astrovskaya I, Lindsay BR, Li S, Bravo HC, Harro C, Parkhill J, Walker AW, Walker RI, Sack DA, Stine OC. 2016. Individual-specific changes in the human gut microbiota after challenge with enterotoxigenic Escherichia coli and subsequent ciprofloxacin treatment. BMC Genomics 17:440. doi: 10.1186/s12864-016-2777-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kotloff KL, Nasrin D, Blackwelder WC, Wu Y, Farag T, Panchalingham S, Sow SO, Sur D, Zaidi AKM, Faruque ASG, Saha D, Alonso PL, Tamboura B, Sanogo D, Onwuchekwa U, Manna B, Ramamurthy T, Kanungo S, Ahmed S, Qureshi S, Quadri F, Hossain A, Das SK, Antonio M, Hossain MJ, Mandomando I, Acácio S, Biswas K, Tennant SM, Verweij JJ, Sommerfelt H, Nataro JP, Robins-Browne RM, Levine MM. 2019. The incidence, aetiology, and adverse clinical consequences of less severe diarrhoeal episodes among infants and children residing in low-income and middle-income countries: a 12-month case-control study as a follow-on to the Global Enteric Multicenter Study (GEMS). Lancet Global Health 7:e568–e584. doi: 10.1016/S2214-109X(19)30076-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bezirtzoglou E, Tsiotsias A, Welling GW. 2011. Microbiota profile in feces of breast- and formula-fed newborns by using fluorescence in situ hybridization (FISH). Anaerobe 17:478–482. doi: 10.1016/j.anaerobe.2011.03.009. [DOI] [PubMed] [Google Scholar]
- 17.Almeida A, Nayfach S, Boland M, Strozzi F, Beracochea M, Shi ZJ, Pollard KS, Parks DH, Hugenholtz P, Segata N, Kyrpides NC, Finn RD. 2019. A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome. bioRxiv doi: 10.1101/762682. [DOI]
- 18.Talukdar PK, Rahman M, Rahman M, Nabi A, Islam Z, Hoque MM, Endtz HP, Islam MA. 2013. Antimicrobial resistance, virulence factors and genetic diversity of Escherichia coli isolates from household water supply in Dhaka, Bangladesh. PLoS One 8:e61090. doi: 10.1371/journal.pone.0061090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Natarajan M, Kumar D, Mandal J, Biswal N, Stephen S. 2018. A study of virulence and antimicrobial resistance pattern in diarrhoeagenic Escherichia coli isolated from diarrhoeal stool specimens from children and adults in a tertiary hospital, Puducherry, India. J Health Popul Nutr 37:1–11. doi: 10.1186/s41043-018-0147-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Swarthout JM, Fuhrmeister ER, Hamzah L, Harris AR, Ahmed MA, Gurley ES, Satter SM, Boehm AB, Pickering AJ. 2021. Differential overlap in human and animal fecal microbiomes and resistomes in rural versus urban Bangladesh. medRxiv doi: 10.1101/2021.05.13.21257188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ahmed I, Rabbi MB, Sultana S. 2019. Antibiotic resistance in Bangladesh: a systematic review. Int J Infect Dis 80:54–61. doi: 10.1016/j.ijid.2018.12.017. [DOI] [PubMed] [Google Scholar]
- 22.Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chen L, Yang J, Yu J, Yao Z, Sun L, Shen Y, Jin Q. 2004. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res 33:D325–D328. doi: 10.1093/nar/gki008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Inouye M, Dashnow H, Raven L-A, Schultz MB, Pope BJ, Tomita T, Zobel J, Holt KE. 2014. SRST2: rapid genomic surveillance for public health and hospital microbiology labs. Genome Med 6:90. doi: 10.1186/s13073-014-0090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wood DE, Lu J, Langmead B. 2019. Improved metagenomic analysis with Kraken 2. bioRxiv doi: 10.1101/762302. [DOI] [PMC free article] [PubMed]
- 26.Lu J, Breitwieser F, Thielen P, Salzberg SL. 2017. Bracken: estimating species abundance in metagenomics data. PeerJ Comput Sci 3:e104. doi: 10.7717/peerj-cs.104. [DOI] [Google Scholar]
- 27.Oksanen J. Vegan. 2022. ecological diversity. https://cran.r-project.org/web/packages/vegan/vignettes/diversity-vegan.pdf.
- 28.de Mendiburu F. 2019. Package “agricolae”. https://cran.r-project.org/web/packages/agricolae/agricolae.pdf.
- 29.Foster ZSL, Sharpton TJ, Grünwald NJ. 2017. Metacoder: an R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13:e1005404. doi: 10.1371/journal.pcbi.1005404. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Significant differences in bacterial abundance between samples with high E. coli burden and those with low E. coli burden at the species level. A positive log2 median ratio represents species that were overabundant in low-E. coli burden subjects, while a negative ratio indicates bacteria more prevalent in high-E. coli samples. Download Table S1, XLSX file, 0.02 MB (16.1KB, xlsx) .
Copyright © 2022 Higginson et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Availability Statement
The sequence data are available on the European Nucleotide Archive under study accession no. PRJEB21793.




