Abstract
Extra-intestinal pathogenic Escherichia coli (ExPEC) can cause a variety of infections outside of the intestine and are a major causative agent of urinary tract infections. Treatment of these infections is increasingly frustrated by antimicrobial resistance (AMR) diminishing the number of effective therapies available to clinicians. Incidence of multidrug resistance (MDR) is not uniform across the phylogenetic spectrum of E. coli. Instead, AMR is concentrated in select lineages, such as ST131, which are MDR pandemic clones that have spread AMR globally. Using a gnotobiotic mouse model, we demonstrate that an MDR E. coli ST131 is capable of out-competing and displacing non-MDR E. coli from the gut in vivo. This is achieved in the absence of antibiotic treatment mediating a selective advantage. In mice colonised with non-MDR E. coli strains, challenge with MDR E. coli either by oral gavage or co-housing with MDR E. coli colonised mice results in displacement and dominant intestinal colonisation by MDR E. coli ST131. To investigate the genetic basis of this superior gut colonisation ability by MDR E. coli, we assayed the metabolic capabilities of our strains using a Biolog phenotypic microarray revealing altered carbon metabolism. Functional pangenomic analysis of 19,571 E. coli genomes revealed that carriage of AMR genes is associated with increased diversity in carbohydrate metabolism genes. The data presented here demonstrate that independent of antibiotic selective pressures, MDR E. coli display a competitive advantage to colonise the mammalian gut and points to a vital role of metabolism in the evolution and success of MDR lineages of E. coli via carriage and spread.
A mouse colonisation model reveals that a multidrug-resistant (MDR) strain of E. coli can displace commensal strains from the gut, and that unique selection pressures occurring in the MDR strain result in streamlining of their metabolism. Analysis of a curated set of 20,000 E. coli genomes shows that MDR lineages are associated with increased nucleotide sequence diversity in metabolism genes.
Introduction
Infections by multidrug resistant (MDR) gram-negative pathogens now represent one of the greatest global public health challenges of our generation, with the World Health Organisation declaring them of utmost international importance. Chief among these pathogens is MDR Escherichia coli, which are responsible for an alarming rise in the incidence of antimicrobial resistant (AMR) blood stream and urinary tract infections [1]. MDR in E. coli is heavily associated with the carriage of large MDR plasmids encoding extended spectrum beta-lactamases (ESBLs) and carbapenemases such as NDM, KPC, and Oxa-48 that confer resistance to third-generation cephalosporins and carbapenem classes of antibiotics, respectively [1]. Intriguingly, such plasmids are very rarely found in intestinal pathogenic E. coli such as E. coli O157 or common enteropathogenic and enterotoxigenic E. coli strains. Rather MDR plasmid carriage is concentrated in a number of lineages responsible for extra-intestinal pathogenesis such as blood stream and urinary tract infections [2].
Extra-intestinal pathogenic E. coli (ExPEC) is the name given to E. coli strains capable of causing extra-intestinal infections, but do not represent a phylogenetically distinct group of organisms. Rather ExPEC are found across the species phylogeny mainly in phylogroups B2, D, and F [3]. Recent longitudinal surveys of national blood stream infection isolates have shown the most common ExPEC lineages to be ST131, ST73, ST69, ST95, ST410, and members of the ST10 complex including ST167 [4,5]. Equally as intriguingly, MDR plasmids are not evenly distributed among these ExPEC lineages but rather their carriage is concentrated in a small number of highly successful, globally disseminated clones [2]. The most successful of these clones is clade C of ST131, the most common cause of MDR blood stream and urinary tract infections worldwide [6]. Other common MDR E. coli lineages include ST69, ST410, ST167, and ST648 [2].
What makes certain clones or lineages of E. coli successful MDR pathogens is an ongoing question. Recent analysis of longitudinal blood stream infection isolates from Norway shows that ST131 strains successfully emerged to be dominant in the absence of MDR plasmid carriage indicating MDR alone is not the driver of their success [5]. Analysis of longitudinal UK isolates shows that MDR alone is not sufficient to drive strains to complete dominance of the epidemiological landscape [4]. Recent evidence suggests an important phenotype that differentiates MDR E. coli from other lineages is their ability to rapidly and asymptomatically colonise the intestinal tract of humans. The COMBAT study of 1,847 people travelling from the Netherlands to Asia, Africa, and South America found that 34% of those travelling acquired an ESBL E. coli in their intestinal tract during their journey, with that number increasing to 75% of those travelling to Asia [7]. Of those colonised in the study, 11% were colonised for up to 12 months. A small scale study of University of Birmingham students found that all participants travelling to Asia became colonised by an ESBL E. coli, with genomic analysis confirming that this was due to acquisition of a new MDR strain and not the resident commensal E. coli becoming MDR [8]. A recent study of medical personnel travelling to Laos sampled travellers in real time to show that every single person was colonised by an MDR E. coli during travel, with colonisation occurring within days after arrival [9]. A very recent study by our group deploying metagenomic sequencing on the COMBAT study samples showed that when people were colonised by an MDR E. coli, there was no detectable impact on diversity or composition of the wider gut microbiome as a result of MDR colonisation [10]. While observational studies have yielded hypothesis-generating data suggesting stain-to-strain competition, this has never been directly tested in vivo.
Studies investigating genetic determinants differentiating MDR E. coli lineages from the rest of the population have also uncovered a number of parallel observations. Studies comparing clade C ST131 to its ancestral population show a collection of adaptive nucleotide substitutions in chromosomal promoter regions associated with the specific plasmid carried by the strain [11], a pattern which has also been seen in ST167 [12]. A high-resolution study of ST131 using a pangenome approach to identify allelic variations in genes found a highly elevated number of nucleotide substitutions in genes involved in mammalian colonisation including anaerobic metabolism, iron acquisition, and adhesins. This pattern was not seen in successful non-MDR ExPEC lineages such as ST73 [13]. Such allelic diversity in anaerobic metabolism genes has also been seen in ST167 and ST648 [12,14], and it was also shown that recombination of new alleles of fhu iron acquisition genes was the key evolutionary event underpinning the emergence the carbapenem-resistant B4/H24RxC clone in the ST410 lineage [15]. We hypothesise that these genetic adaptations may contribute to more effective colonisation of the mammalian gut.
Here, we use a gnotobiotic mouse model of intestinal colonisation to directly test the hypothesis that MDR E. coli can outcompete commensal E. coli via inter-strain competition to establish dominant colonisation of the intestinal tract in vivo in the absence of any antibiotic selection. We demonstrate that an MDR ST131 strain can out-compete a commensal strain to establish intestinal colonisation in gnotobiotic mice. Furthermore, when introduced into the gut of mice pre-colonised with commensal E. coli, MDR ST131 could displace commensal E. coli from the gut to establish dominant colonisation. Displacement of the resident commensal strain occurs within 48 h of cohousing with mice colonised by MDR ST131, with ST131 becoming the dominant strain in all co-housed mice. To understand the biological process underpinning this competition, we use Biolog phenotypic microarray which identifies altered utilisation of carbon sources. Targeted genomic comparison our assayed strains reveals distinct mutational signatures. We further expand this analysis to a dataset of 19,571 genomes representing the full phylogenetic and AMR diversity of E. coli to identify a significant link between metabolism and carriage of AMR genes. We find that MDR lineages of E. coli display an increased nucleotide diversity in genes associated with carbohydrate utilisation which may afford competitive advantage for colonising the mammalian gut compared to commensal E. coli strains.
Results
Non-MDR ExPEC and MDR ExPEC are efficient colonisers of germ-free mice, with both ExPEC outcompeting a commensal isolate in co-colonisation
To determine whether MDR E. coli is able to out-compete non-MDR E. coli in the intestine in vivo, we performed competitive colonisation experiments in germ-free mice using 3 different strains of E. coli: a non-MDR (ST73) commensal strain 822-E8 isolated from a healthy human volunteer, a pathogenic non-MDR ExPEC (ST73) strain F084 isolated from a bacteraemia patient, and a pathogenic MDR ExPEC (ST131) strain F016 isolated from a bacteraemia patient (S3 Table). All 3 strains possessed an equivalent virulence-associated gene profile; however, the MDR strain F016 possessed a greater number of AMR genes (S1 and S2 Figs). Mice were inoculated with 109 colony-forming unit (CFU) of each strain via oral gavage and bacterial growth was measured by enumeration of CFU from the faeces as well as strain specific qPCR. Under these conditions, all strains could individually monocolonise germ-free mice (S3 Fig). Competitive colonisation between strains was investigated by orally gavaging GF mice with a 1:1 ratio of 2 strains in combination (see Fig 1A for combinations) followed by quantification of each strain in the faeces over the subsequent week (Fig 1B–1E). Both F084 and F016 strains out-competed the commensal 822-E8 strain achieving 96.1% and 80.7% of the total growth, respectively, by day 6 post gavage (Fig 1B–1D). Neither F084 nor the MDR F016 was able to out-compete the other with F016 accounting for 47.1% of the growth by day 6 (Fig 1B and 1E).
It is possible that F084 and MDR F016 are out-competing the commensal 822-E8 strain due to phage or production of a secreted toxin. Previous data from our group has shown that an ST131 strain retards growth of both ST73 and ST10 strains in LB broth [16]. Therefore, we grew each strain in LB broth overnight, filter sterilised the spent medium before mixing in a 1:1 ratio with fresh LB, and investigated the growth kinetics of all strains in the presence of supernatant from a competing strain. No strain displayed any impairment in growth in the presence of either autologous or heterologous supernatant indicating that no strains were releasing toxins/phage to kill competing strain (Fig 1F–1H).
MDR ExPEC efficiently displaces an established commensal from the mouse intestinal tract
Next, we aimed to determine whether MDR E. coli could displace commensal E. coli that had already established a colonisation niche within the intestine. Germ-free mice were colonised with a commensal strain 822-E8 by oral gavage and colonisation allowed to stabilise for 1 week before challenging with a second E. coli strain (Fig 2A). When commensal-colonised mice were challenged with MDR strain F016, it rapidly out-competes the commensal strain 822-E8 within 4 days accounting for greater than 60% of the CFU (Fig 2B and 2C). By day 21, the MDR strain F016 accounted for 80.4% of the CFU in the faeces. In contrast, this displacement is not observed when commensal-mice colonised are challenged with the non-MDR strain F084, which results in an equilibrium of 50–50 colonisation of both strains within 4 days and remains equivalently co-colonised at 21 days post-challenge (Fig 2B and 2D). Conversely, when MDR strain F016 monocolonised mice are challenged with commensal strain 822-E8, the commensal is unable to displace F016, accounting for 20.4% of the CFU at day 4 and further diminishing to 15.3% at day 21 (Fig 2B and 2E). These results are not due to changes in commensal colonisation as growth remains stable in the control group throughout the experiment (Fig 2F). Collectively, these data demonstrate that MDR E. coli strain F016 can displace the commensal E. coli strain 822-E8 to establish itself as a dominant coloniser of the gut in vivo.
To determine whether our findings using oral gavage (with large quantities of bacteria) could be replicated in the setting of environmental exposure to MDR E. coli, we monocolonised mice wither either commensal 822-E8 or MDR ExPEC F016. After allowing colonisation to stabilise for 7 days mice were co-housed (Fig 3A). Within 48 h, F016 is observed in the faeces of all mice and becomes the dominant coloniser in all mice by day 4 accounting for nearly 80% of the CFU (Fig 3B). This colonisation dominance is persistent and sustained for many weeks (Fig 3B). Of note, F016 monocolonised mice did acquire a low level of colonisation by 822-E8 following co-housing, but F016 remained the dominant strain with 822-E8 accounting for less than 20% of faecal CFUs. This MDR F016 phenotype is not due to strain-specific differences in inherent ability to colonise the mouse gut, as the total CFU recovered from F016 monocolonised mice is equivalent to 822-E8 monocolonised mice (Fig 3C and 3D). The host response can influence the ability of certain pathogens to establish colonisation. The host immune response in the small intestine and colon of colonised mice was analysed histologically revealing no evidence of inflammatory cell recruitment or tissue damage in response to colonisation by F016 (S5 Fig). Expression of 11 cytokines was assayed by probe-based qPCR from tissue collected from the small intestine, caecum, and colon. Expression of the assayed cytokines in the small intestine and colon revealed few differences between colonisation conditions (S6–S8 Figs).
Given that our displacement findings cannot be driven by classical virulence genes (S1 Fig) nor differences in host response, we sought to perform a comprehensive comparative genomic and phenotypic analysis of the E. coli strains competed in vivo to identify the factors underpinning F016’s colonisation success.
The MDR ST131 strain F016 displays altered carbon source utilisation alongside numerous polymorphisms in metabolic genes
Stains F084, F016, and 822-E8 were subject to Biolog analysis using commercially available Phenotype MicroArray plates. Significant differences were only detected in the PM1 carbon utilisation plate. Of the 96 conditions tested in PM1, 34 showed statistical (<0.05 P-value) differences between the 3 strains (Fig 4A). The data shows that the MDR strain F016 is less efficient at utilising N-Acetyl-Glucosamine, trehalose, mannose, xylose, fructose, maltose, melibiose, methyl-D-galactoside, lactose, and lactulose than strains F084 and 822-E8. Conversely, strain F016 is far more efficient at utilising keto-butyric acid, sucrose, L-glutamine, hydroxy-butyric acid, D and L-threonine, and glyoxylic acid. Additionally, both F084 and F016 were significantly more efficient at utilising both propionic and mucic acid. To investigate the genomic factors underpinning these phenotypic differences, we examined genetic polymorphisms in 73 genes associated with the metabolites above. Using the commensal 822-E8 strain as a reference revealed that there was a very low number of polymorphisms in the F084 strain, while the F016 strain displayed a far higher number of mutations. The majority of mutations were single nucleotide polymorphisms (SNPs) that caused synonymous mutations or missense mutations (Fig 4E and 4F). In addition to mutational profiling, the strains were subject to a functional pangenomic analysis. A pangenome of the 3 strains was constructed, the resulting pangenome reference file was functionally annotated using the eggNOG database with the eMapper utility. This analysis identified 712 genes uniquely present and 466 genes uniquely absent in the F016 strain. The majority of these genes were annotated with “S -function unknown” (238/712 and 98/466) alongside a significant number with no functional annotation (106/712 and 60/466). Of the uniquely present genes, the most abundant COG categories were “Replication, recombination and repair,” “Transcription,” and “Carbohydrate metabolism” with 62, 50, and 37 genes, respectively. While the uniquely absent genes were abundant with “Replication, recombination and repair,” “Cell membrane biogenesis,” and “Transcription” with 40, 39, and 25 genes, respectively. Based on phenotypic differences, we focussed on the carbon metabolism genes that were differentially present in F016 revealing that it possessed multiple genes such as sgc operon which has been putatively annotated as a sugar uptake and isomerization operon. All 3 strains possessed the fucA gene; however, F016 possessed a duplicate allele. Moreover, the F016 strain possessed scrB and scrK for sucrose metabolism, as well as mngA and mngB, which are involved in mannose uptake and utilisation.
ST131 displays reduced selective pressure on select metabolic loci
To explore whether our observations could be applied to a wider selection of genomes, we downloaded assemblies for the ST73 and ST131 lineages from (S3 Table). We screened these assemblies for the 73 genes we selected from our phenotypic analysis, extracted gene sequences, and calculated a Tajima’s D value, which is a measurement of selection on a gene with values around 0 indicating an absence of selection. Comparisons between ST73 and ST131 reveal that treC, prpBRDCE, cyaY, yihU, glcB, lacA, and glnA have values closer to 0 than ST73, suggesting these alleles are under reduced selective pressures (Fig 4B). Further examination of the genes identified highlighted other differences between ST73 and ST131. Specifically, yihU is completely absent from ST73 but present in a significant number of ST131 genomes (Fig 4C and 4D). LacYZ, glcB, and prp operon genes display an elevated level of loss in ST131 (Fig 4C and 4D), with lacYZ showing partial gene loss in ST131 (Fig 4 H). While ST73 harbours a partial duplication of garD which is linked to the very high Tajima’s D value for this gene (Fig 4B, 4C, 4E and 4G). ST131 possess 2 allelic versions of treC and prpR evident as 2 identity peaks (Fig 4F). GlcB appears as 2 allelic variants in ST73 but in ST131, there is more diversity with a broader distribution of identity values (Fig 4E and 4F). Together, this data indicate that ST131 lineage is undergoing a complex evolutionary process targeting metabolic capabilities evident as duplications of core metabolic genes as well as allelic variation.
MDR lineages display an increase in genetic diversity of carbohydrate metabolism genes in their accessory pangenome associated with recombination of new variant alleles
We sought to examine whether our genomic observations of ST131 were unique or common to multiple MDR lineages of E. coli. We curated a dataset of 19,571 E. coli genome assemblies encompassing the major E. coli phylogroups (A, B1, B2, D, E, and F/G), representing 20 STs incorporating commensal, ExPEC, and EPEC/EHEC lineages (S17A Fig and S3 Table). Antibiotic resistance genes are concentrated in 5 ExPEC lineages: ST38, ST69, ST131, ST167, and ST648 (S17B Fig). These lineages have a high proportion of their population carrying multiple (>1) resistance genes (ST38: 89.0%, ST69: 80.3%, ST131: 85.7%, ST167: 99.1%, ST648: 93.5%). Pangenomes for the 20 different lineages of E. coli were constructed with Roary v3.10.2 using an identity threshold of 95%, a core gene frequency of 99% with paralog splitting disabled. This setting allows us to specifically look for unique alleles of core genes as those unique alleles then become part of the accessory genome [13]. The host generalist ST10 had the greatest pangenome size with 46,259 gene clusters identified followed by ST131 with 23,857 clusters (S4 Table and S14 Fig). Core genome size was consistent across all lineages averaging 3,777 genes (S4 Table and S15 Fig). There was no correlation between pangenome size and carriage of AMR genes (S13 Fig). Curiously, AMR carriage was significantly negatively associated with the proportion of phage in the pangenome but showed no correlation with recombination (S17C and S17D Fig). Pangenomes were functionally annotated using the eggNOG database with the eMapper utility, functional composition of the pangenome was explored using Clusters of Orthologous Groups (COG) categories. Links between AMR and biological function were explored in the core and accessory genome using linear regressions analysis that revealed a single significant association. Specifically, lineages with a higher proportion of AMR genes displayed an increased number of “Carbohydrate metabolism and transport” genes in their accessory genome (Fig 4I). There were no other significant correlations between COG categories and carriage of AMR genes after correcting for multiple testing. Our data suggest that diversification of carbohydrate metabolic genes is correlated to the acquisition of multidrug resistance in E. coli. We sought to determine the source driving our observed diversity of carbohydrate genes in AMR lineages. We then looked at the recombination plot created for the main MDR E. coli lineage ST131 (S18A Fig) and the main non-MDR ExPEC ST73 (S18B Fig). Our analysis clearly shows recombination occurring in key metabolic loci that does not occur in ST73 and that these loci house metabolism genes occurring as allelic variants in the accessory genome dataset. Together with previously published data, our genomic analysis provides compelling evidence for unique patterns of evolution in metabolism genes within MDR lineages of E. coli.
Discussion
The evolution and global transmission of AMR are a major threat to public health. Genomic identification of antibiotic resistance genes in our global dataset highlights that AMR is concentrated in a number pandemic lineages. This is most evident when examining the average number of resistance genes per genome; ST167, ST648, ST38, and ST131 all display on average in excess of 7 resistance genes per genome. This result is not driven by many resistance genes within a small subpopulation, as ST167, ST648, ST38, and ST131 all have in excess of 80% of their population possessing multiple resistance genes highlighting the success of these lineages. This observation is aligned with numerous other studies that frequently report these populations as major MDR pathogens, confirming previous observations that AMR carriage is not equal across the E. coli population with AMR being concentrated in certain lineages [2].
Successful pandemic clones must transmit from the environment to individuals or between individuals rapidly in order to spread globally. Here, we demonstrate that an MDR ST131 strain can readily colonise new hosts even when those hosts are pre-colonised with commensal E. coli. The invading ST131 becomes the dominant colonising strain in mice, both when the invading strain is introduced artificially via oral gavage and when mice are co-housed. It is important to emphasise that this transmission is occurring in the absence of any antibiotic treatment conferring a selective advantage to ST131. Typically, it has been required that mice are treated with streptomycin to allow E. coli strains to colonise them; however, more recent studies, alongside data presented here, indicate that ST131 strains do not require antibiotic treatment in order to competitively colonise mice [17]. In our study, the ST131 out-colonises another ExPEC strain of the common ST73 lineage. This implies that ST131 possesses some mechanism by which it can out-compete commensals that is lacking in another highly successful but non-MDR ExPEC lineage ST73. While our study is limited to the use of germ-free mice, our observations mimic those made from human traveller studies which have observed MDR E. coli as frequent colonisers of healthy travellers in the absence of antibiotics [7–9]. Within household transmission has also been observed for ST131 [18]. Studies of healthy individuals who travel to regions where antibiotic resistance is endemic have reported varying levels of colonisation of between 30% and 70% upon return [7]. Sampling travellers during their trip revealed a much higher rate of colonisation rate of 95%, of which E. coli was the most common colonising bacteria [9]. From our data, the resident commensal strain was not completely displaced, similar observations have been made of human travellers, specifically individuals colonised by MDR E. coli had a recurrence of their original commensal E. coli at the end of their travels [8].
Phenotypic microarray of our colonising strains again pointed to altered utilisation of carbon sources. Subsequent targeted genomic analysis highlighted altered mutational profiles of our assayed strains. We expanded this analysis to multiple MDR lineages of E. coli in comparison to multiple non-MDR ExPEC lineages revealing diversification of carbohydrate metabolic genes in multiple MDR lineages. The genomic signature we have identified is complex and requires further investigation. Previous pangenome analysis identified metabolic loci as being enriched in nucleotide diversity in ST131 compared to other ExPEC, specifically anaerobic metabolic genes were exhibiting increased genetic variation [13]. Our functional pangenome analysis supports these observations, revealing that there is a significant correlation between carriage of AMR genes and genetic diversity in metabolism genes. Specifically, there is increased variation in genes encoding carbohydrate metabolism in lineages with a high rate of antibiotic resistance carriage. Previous analyses have focussed on individual lineages, whereas here, we present data on multiple MDR ExPEC lineages, revealing that metabolic variation is a shared adaptation of MDR E. coli. Experimental evolution studies have identified that E. coli can evolve resistance to antibiotic stress through mutations in core metabolic genes, particularly those involved in carbon and energy metabolism [19]. These observed mutations occur at low frequency and were only detected through sequencing of multiple isolates from a population; however, they were still detectable in datasets of clinical samples demonstrating their relevance. A large-scale bacterial genetic screen to test the effect of allelic variation in key metabolism genes on the ability to colonise and displace commensal E. coli in the mammalian intestinal tract would seem attractive. However, the signal observed in our dataset occurs in multiple genes and pathways and genetically investigating such a polygenic trait is far from trivial. We suggest further investigation of our findings will need to combine classical genetics with long-term and complex experimental studies attempting to recapitulate MDR clone evolution.
Collectively, our data demonstrate that MDR E. coli is highly capable of host intestinal colonisation, displacing resident commensal E. coli to become the dominant strain and readily transmitting between hosts. Our genomic analysis implicates metabolism as a pivotal factor in the evolution of AMR linked to the incredible gut colonisation ability of MDR E. coli.
Methods
Ethics statement
Animal protocols were reviewed and approved by the University of Calgary Animal Care Committee (approved protocol numbers AC17-0090 and AC19-0139) and animal experiments were conducted in accordance with Canadian Council on Animal Care guidelines.
Mouse colonisation
Mouse colonisation experiments were conducted at the International Microbiome Centre, University of Calgary, Canada. Germ-free mice were bred and maintained in flexible film isolators in our axenic breeding facility, and germ-free status was confirmed by a combination of Gram staining, Sytox green DNA staining, anaerobic and aerobic culture, and 16S rRNA gene amplicon sequencing from faeces were maintained in isocages in our gnotobiotic facility. Germ-free C57BL/6 mice were colonised with 109 CFU of bacteria via oral gavage (see S3 Table and S4 and S5 Figs for details of strains used for colonisation) and maintained in sterile isocages in our gnotobiotic facility throughout the duration of experiments. Bacterial colonisation was monitored by CFU enumeration on UTI Chromogenic Agar (Thermo) from faecal pellets as well as by DNA extraction and strain-specific probe-based qPCR. DNA from faecal pellets was extracted using the MagMAX Microbiome Ultra Nucleic Acid isolation kit (Thermo) on a KingFisher Flex instrument (Thermo) following manufacturer’s instructions. Strain-specific primers and probes were designed to target unique genes (822-E8: clpP, F084: lon, F016: prtR–S4 Table) identified from genome data. Probes were manufactured by Integrated DNA Technologies (IDT). Reactions were performed using PrimeTime Gene Expression master mix (IDT) on a QuantStudio 1 system (Thermo). Reactions were performed following manufacturer’s recommended parameters: 3 min at 95*C, 40 cycles of 15 s at 95*C, 1 min at 60*C, fluorescence readings taken at the end of the extension stage. A standard curve was generated from a bacterial culture of known CFU.
In vitro supernatant cultures
Bacteria were grown in LB broth to an OD600 of between 0.4 and 0.6, cultures were pelleted at 4,000 rpm for 15 min. The supernatant was passed through a 0.22 μm filter, the filtrate was diluted in a 1 to 1 ratio with fresh LB. Bacteria were inoculated into the supernatant mixture and grown in a 96-well plate at 37°C with growth measured by OD600 readings at 10-min intervals by a Spark Microplate Reader (Tecan).
Functional metabolic analysis
Strains (F084, F016, and 822 E8) were grown on LB-NaCl agar (5 g/L Yeast Extract (Melford, Y20020-500.0), tryptone 10 g/L (Melford, T60060-500.0), and agar (Melford, A20020-500.0)) for 16 h. Inoculations were set up following manufacturer’s instructions with below modifications. Biomass was removed with a cell scraper and suspended in inoculation fluid 0a (IF-0a –Techno-path, 72268—PM IF-0a GN/GP Base (1.2×) 125 ml) to an O.D.600 of 0.185 +/− 0.05. This was then diluted 1:6 with IF-0a containing 1.4% (v/v) of Biolog Redox Dye Mix A (100×, Techno-path, 74221). To each well of each PM1 plate (Techno-path, 12111), 100 μl of this suspension was added and strains incubated for up to 48 h at 37°C, static, in a OmniLog PM system (imaging every 15 min). Data was extracted using Biolog softwares (conversion of D5E to OKA: D5E_OKA Data File Converter v1.1.1.15 and extraction of raw kinetic data using PM analysis software: Kinetic V1.3). Statistically relevant results were identified using a one-way ANOVA with a P-value threshold of 0.05.
Genomic dataset curation, pangenome construction, and AMR gene detection
A total of 20 lineages or sequence types (STs) were selected from the literature with a focus on ExPEC lineages but also including EHEC and EPEC clones. This resulted in a dataset of 19,571 E. coli genome sequences encompassing all the E. coli phylogroups (A, B1, B2, D, E, and F/G) (S1 and S2 Tables) [10.6084/m9.figshare.c.6147189]. The earliest samples with reliable metadata were from 1980; however, the majority was sequenced in recent decades. Humans represented the major source niche for all lineages except ST117 for which poultry was the major niche. This dataset contained samples from multiple countries; however, Europe and North America accounted for the majority.
Genome assemblies for each lineage were downloaded from Enterobase [20] using a custom python script [https://github.com/C-Connor/EnterobaseGenomeAssemblyDownload]. Duplicated assemblies were identified using Mash v1.1.1 [21] to estimate genome similarity, a custom R script then removed isolates with a Mash distance of 0 [https://github.com/C-Connor/MashDistDeReplication]. Dendrograms of Mash distances were also constructed and examined for outlier genomes that were not part of a larger cluster. The remaining genome files were annotated with Prokka v1.12 [22] and pangenomes were constructed with Roary v3.10.2 [23] using a 95% identity threshold, a 99% core genome threshold, paralog splitting was disabled, and a core genome alignment was produced using MAFFT. AMR genes were detected using Abricate v 0.8 [https://github.com/tseemann/abricate] with the Resfinder-2018 database [24], results were filtered to remove hits with less than 80% resistance gene coverage. A phylogeny of the whole dataset was constructed using MashTree v0.36.2 [25] and visualised in iTOL [26].
Targeted genomic comparisons
Genes of interest were selected based on their involvement in the utilisation of Biolog metabolites. Short read data for F084 and F016 strains was mapped to the commensal 822-E8 strain using Snippy 4.6.0 (https://github.com/tseemann/snippy). Reference gene sequences from (K12 MG1655 U00096.3) were downloaded from NCBI and genome assemblies were screened using Abricate. Gene seqeuences were extracted from each assembly using extract_genes_ABRricate.py (https://github.com/boasvdp/extract_genes_ABRicate). Sequences were aligned with MAFFT 7.487 using default parameters. Tajima’s D measurements from the alignments were calculated in R using the packages ape 5.7.1 39 and pegas 1.2
Pangenome functional annotation
Pangenome reference Fasta files produced by Roary were functionally annotated using emapper-1.0.3-3-g3e22728 [27] based on eggNOG orthology data [28]. Sequence searches were performed using DIAMOND [29]. Functional annotation data was combined with the gene presence absence matrix and analysed in R v4.0.3. To examine if there was any association between functional composition of the accessory pangenome or core genome linear regression was performed between carriage of AMR (as a proportion of the population with 2 or more AMR genes) and individual COG categories, correcting for multiple testing.
Supporting information
Abbreviations
- AMR
antimicrobial resistance
- CFU
colony-forming unit
- ESBL
extended spectrum beta-lactamase
- ExPEC
extra intestinal pathogenic Escherichia coli
- IDT
Integrated DNA Technologies
- MDR
multidrug resistance
- SNP
single nucleotide polymorphism
- ST
sequence type
Data Availability
All relevant data are within the paper and its Supporting Information files. All genomic data used in this study can be found in a dedicated figshare repository 10.6084/m9.figshare.c.6147189.
Funding Statement
This work was funded by a Wellcome Trust funded MIDAS PhD studentship awarded to CC (Grant number 203821/Z/16/A). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Poirel L, Madec J-Y, Lupo A, Schink A-K, Kieffer N, Nordmann P, et al. Antimicrobial Resistance in Escherichia coli. Microbiol Spectr. 2018;6. doi: 10.1128/microbiolspec.ARBA-0026-2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dunn SJ, Connor C, McNally A. The evolution and transmission of multi-drug resistant Escherichia coli and Klebsiella pneumoniae: the complexity of clones and plasmids. Curr Opin Microbiol. 2019;51:51–56. [DOI] [PubMed] [Google Scholar]
- 3.Denamur E, Clermont O, Bonacorsi S, Gordon D. The population genetics of pathogenic Escherichia coli. Nat Rev Microbiol. 2021;19:37–54. [DOI] [PubMed] [Google Scholar]
- 4.Kallonen T, Brodrick HJ, Harris SR, Corander J, Brown NM, Martin V, et al. Systematic longitudinal survey of invasive Escherichia coli in England demonstrates a stable population structure only transiently disturbed by the emergence of ST131. Genome Res. 2017. doi: 10.1101/gr.216606.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gladstone RA, McNally A, Pöntinen AK, Tonkin-Hill G, Lees JA, Skytén K, et al. Emergence and dissemination of antimicrobial resistance in Escherichia coli causing bloodstream infections in Norway in 2002–17: a nationwide, longitudinal, microbial population genomic study. Lancet Microbe. 2021;2:e331–e341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Banerjee R, Johnson JR. A new clone sweeps clean: the enigmatic emergence of Escherichia coli sequence type 131. Antimicrob Agents Chemother. 2014;58:4997–5004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Arcilla MS, van Hattem JM, Haverkate MR, Bootsma MCJ, van Genderen PJJ, Goorhuis A, et al. Import and spread of extended-spectrum beta-lactamase-producing Enterobacteriaceae by international travellers (COMBAT study): a prospective, multicentre cohort study. Lancet Infect Dis. 2017;17:78–85. [DOI] [PubMed] [Google Scholar]
- 8.Bevan ER, McNally A, Thomas CM, Piddock LJV, Hawkey PM. Acquisition and Loss of CTX-M-Producing and Non-Producing Escherichia coli in the Fecal Microbiome of Travelers to South Asia. MBio. 2018;9. doi: 10.1128/mBio.02408-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kantele A, Kuenzli E, Dunn SJ, Dance DAB, Newton PN, Davong V, et al. Dynamics of intestinal multidrug-resistant bacteria colonisation contracted by visitors to a high-endemic setting: a prospective, daily, real-time sampling study. Lancet Microbe. 2021;2:e151–e158. doi: 10.1016/S2666-5247(20)30224-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Davies M, Galazzo G, van Hattem JM, Arcilla MS, Melles DC, de Jong MD, et al. Enterobacteriaceae and Bacteroidaceae provide resistance to travel-associated intestinal colonization by multi-drug resistant Escherichia coli. Gut Microbes. 2022;14:e2060676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.McNally A, Oren Y, Kelly D, Pascoe B, Dunn S, Sreecharan T, et al. Combined Analysis of Variation in Core, Accessory and Regulatory Genome Regions Provides a Super-Resolution View into the Evolution of Bacterial Populations. PLoS Genet. 2016;12:e1006280. doi: 10.1371/journal.pgen.1006280 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zong Z, Fenn S, Connor C, Feng Y, McNally A. Complete genomic characterization of two Escherichia coli lineages responsible for a cluster of carbapenem-resistant infections in a Chinese hospital. J Antimicrob Chemother. 2018;73:2340–2346. [DOI] [PubMed] [Google Scholar]
- 13.McNally A, Kallonen T, Connor C, Abudahab K, Aanensen DM, Horner C, et al. Diversification of Colonization Factors in a Multidrug-Resistant Escherichia coli Lineage Evolving under Negative Frequency-Dependent Selection. MBio. 2019;10:e00644–e00619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schaufler K, Semmler T, Wieler LH, Trott DJ, Pitout J, Peirano G, et al. Genomic and Functional Analysis of Emerging Virulent and Multidrug-Resistant Escherichia coli Lineage Sequence Type 648. Antimicrob Agents Chemother. 2019;63:e00243–e00219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Feng Y, Liu L, Lin J, Ma K, Long H, Wei L, et al. Key evolutionary events in the emergence of a globally disseminated, carbapenem resistant clone in the Escherichia coli ST410 lineage. Commun Biol. 2019;2:322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cummins EA, Moran RA, Snaith AE, Hall RJ, Connor CH, Dunn SJ, et al. Loss of type VI secretion systems in multi-drug resistant Escherichia coli clones. bioRxiv. doi: 10.1101/2023.03.28.534550 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sarkar S, Hutton ML, Vagenas D, Ruter R, Schüller S, Lyras D, et al. Intestinal Colonization Traits of Pandemic Multidrug-Resistant Escherichia coli ST131. J Infect Dis. 2018;218:979–990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Johnson JR, Clabots C, Kuskowski MA. Multiple-host sharing, long-term persistence, and virulence of Escherichia coli clones from human and animal household members. J Clin Microbiol. 2008;46:4078–4082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lopatkin AJ, Bening SC, Manson AL, Stokes JM, Kohanski MA, Badran AH, et al. Clinically relevant mutations in core metabolic genes confer antibiotic resistance. Science. 2021;371:eaba0862. doi: 10.1126/science.aba0862 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhou Z, Alikhan N-F, Mohamed K, Fan Y, Achtman M. The EnteroBase user’s guide, with case studies on Salmonella transmissions, Yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res. 2020;30:138–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016;17:132. doi: 10.1186/s13059-016-0997-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
- 23.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31:3691–3693. doi: 10.1093/bioinformatics/btv421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, et al. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012;67:2640–2644. doi: 10.1093/jac/dks261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Katz LS, Griswold T, Morrison SS, Caravas JA, Zhang S, den Bakker HC, et al. Mashtree: a rapid comparison of whole genome sequence files. J Open Source Softw. 2019;4:doi: 10.21105/joss.01762 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:W293–W296. doi: 10.1093/nar/gkab301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, et al. Fast Genome-Wide Functional Annotation through Orthology Assignment by eggNOG-Mapper. Mol Biol Evol. 2017;34:2115–2122. doi: 10.1093/molbev/msx148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, Walter MC, et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 2015;44:D286–D293. doi: 10.1093/nar/gkv1248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60. doi: 10.1038/nmeth.3176 [DOI] [PubMed] [Google Scholar]