Abstract
Long-term colonization of the gut microbiome by carbapenemase-producing Enterobacteriaceae (CPE) is a growing area of public health concern as it can lead to community transmission and rapid increase in cases of life-threatening CPE infections. Here, leveraging the observation that many subjects are decolonized without interventions within a year, we used longitudinal shotgun metagenomics (up to 12 timepoints) for detailed characterization of ecological and evolutionary dynamics in the gut microbiome of a cohort of CPE-colonized subjects and family members (n = 46; 361 samples). Subjects who underwent decolonization exhibited a distinct ecological shift marked by recovery of microbial diversity, key commensals and anti-inflammatory pathways. In addition, colonization was marked by elevated but unstable Enterobacteriaceae abundances, which exhibited distinct strain-level dynamics for different species (Escherichia coli and Klebsiella pneumoniae). Finally, comparative analysis with whole-genome sequencing data from CPE isolates (n = 159) helped identify substrain variation in key functional genes and the presence of highly similar E. coli and K. pneumoniae strains with variable resistance profiles and plasmid sharing. These results provide an enhanced view into how colonization by multi-drug-resistant bacteria associates with altered gut ecology and can enable transfer of resistance genes, even in the absence of overt infection and antibiotic usage.
Subject terms: Antimicrobial resistance, Microbiome, Microbial genetics, Microbial ecology, Genome informatics
Longitudinal shotgun metagenomics reveal changes in the gut microbial ecology upon carbapenemase-producing Enterobacteriaceae colonization and decolonization of adult subjects.
Main
The global dissemination of antibiotic resistance genes (ARGs) among pathogenic bacteria is a major public health problem that, if left unaddressed, would reduce efficacy of treatments, elevate costs and increase mortality1. Of particular concern is the spread of carbapenemase-producing Enterobacteriaceae (CPE)2, with their ability to degrade carbapenems often acquired from plasmids encoding carbapenemases3, thus rapidly endangering these antibiotics of last resort4. In addition to life-threatening infections, asymptomatic gut colonization by CPE is increasingly common5, creating ARG transmission reservoirs6. While previous studies have focused on epidemiology2,7 and molecular aspects4,8, the natural history of gut colonization including ecological and evolutionary changes linked to ARG transmission or CPE decolonization remains unexplored.
Recent studies into host–microbiome–pathogen interactions have provided important insights into pathogenesis9, immune response10 and treatment avenues11 for various pathogens. Some have leveraged metagenomics to temporally track microbiomes and understand ecological responses to overt infection11. As microbes have rapid turnover, whole-genome sequencing (WGS) of pathogens has helped characterize intra-host evolution during chronic infections, identifying key enzymes for host adaptation or colonization12. Alternatively, deep metagenomic sequencing can reveal nucleotide-level variation for many species of interest13, shedding light on strain-level dynamics in microbiomes. This approach has characterized stable microbiomes in healthy individuals as well as dynamic changes during faecal microbiota transplantation14. Asymptomatic gut colonization by CPE strains presents a unique opportunity to study an intermediate phenomenon, that is, strain competition with commensals, and associated ecological and evolutionary adaptations, without overt infection or disease.
In this Letter, we conducted longitudinal gut microbial analysis for a cohort of index subjects (n = 29, CPE colonized at recruitment) and their family members (n = 17, not CPE colonized) with up to 12 timepoints within a year, to obtain multi-scale15 (microbiome composition-, strain- and gene-level) characterization of ecological and evolutionary changes during CPE colonization. On the basis of deep shotgun metagenomics of stool DNA, we observed distinct ecological shifts marked by recovery of diversity and key commensals in association with CPE decolonization. CPE colonization was marked by elevated but unstable Enterobacteriaceae abundances, which exhibited specific strain-level dynamics for different species (Escherichia coli and Klebsiella pneumoniae). Comparative analysis with WGS data from CPE isolates (n = 159) identified the presence of highly similar E. coli and K. pneumoniae strains with variable resistance profiles and plasmid sharing. These results provide an enhanced view into how colonization by multi-drug-resistant bacteria associates with altered gut ecology and can enable transfer of resistance genes, even without overt infection.
Results
Ecological shifts and recovery after CPE decolonization
Leveraging the observation that CPE carriage is resolved within 3 months, with 98.5% probability within a year for our cohort16 (longer for other cohorts17), we tracked gut microbiome composition in these individuals for a year to understand ecological changes associated with decolonization (12 timepoints, 361 samples in total; Extended Data Table 1 and Supplementary File 1). Specifically, stool samples were obtained from hospital patients who screened positive for CPE carriage (n = 29, index subjects) and non-CPE-colonized family members (n = 17, home-environment-matched controls) and characterized via deep shotgun metagenomics (>50 million 2 × 100 bp reads; Methods). Principal coordinates analysis (PCoA) with average-linkage clustering based on taxonomic profiles identified multiple distinct community configurations (I, II, III and IV), where CPE-positive samples (from stool culture and quantitative PCR16) were less common in configurations I and II, and more common in configurations III and IV (Fig. 1a, Extended Data Fig. 1 and Supplementary File 2). This statistically significant shift of CPE-positive samples along PCoA1 (Wilcoxon P value < 1.4 × 10−8; Extended Data Fig. 2a) is defined by a gradient of abundances strongly correlated with Escherichia (negative, that is, more abundant in configuration IV samples) and Bacteroides (positive; Extended Data Fig. 2b). A similar shift was observed when comparing taxonomic profiles for configuration IV versus configuration I (Supplementary File 3). Interestingly, while configuration IV has no microbiomes from family members, a few CPE-negative samples from index subjects also cluster here.
Extended Data Table 1.
Stool samples were collected at recruitment (V00), weekly for 4 weeks (V01-V04), monthly for 5 months (V05-V09) and bimonthly for 6 months (V10-V12) when provided by study participants.
Grouping samples on the basis of temporal proximity to CPE clearance highlighted that, while CPE-positive samples have lowest average diversity18, there is a gradual increase around the time of decolonization and post-decolonization, with diversity reaching the higher levels seen in family members after 2 months (Fig. 1b). This pattern was seen even after accounting for potential confounders, including antibiotic usage, hospitalization status, multiple timepoints for an individual, gender and ethnicity, in a linear mixed-effects model (Supplementary Fig. 1 and Methods). We investigated whether CPE colonization alone could explain these changes by computationally subtracting Enterobacteriaceae from taxonomic profiles and recomputing diversity metrics. We noted that richness and Shannon diversity consistently preserved the trend of increasing during decolonization (Supplementary Fig. 2), pointing to a wider microbiome shift besides Enterobacteriaceae.
Shifts in diversity during decolonization were also reflected in microbiome similarity, with Bray–Curtis distances to family members being highest in CPE-positive samples, reducing during and post-decolonization towards baseline values seen among family members (Extended Data Fig. 3). These results highlight the ecological shift associated with CPE colonization that resolves post-decolonization, but with residual effects in some individuals.
To identify species associated with decolonization, we conducted differential-abundance analysis based on CPE status (Methods and Supplementary File 2). While most Enterobacteriaceae species were not differentially abundant, K. pneumoniae2,4 had strong association with CPE-positive status (Fig. 1c). Only one other species (Bifidobacterium breve) was enriched in CPE-positive samples, while seven species were depleted in CPE-negative samples. These included important commensals known to reduce gut inflammation through diverse pathways, including Bacteroides dorei (decreasing gut microbial lipopolysaccharide production19), Faecalibacterium prausnitzii (butyrate production20) and Bifidobacterium (bifidum, pseudocatenulatum; inhibition of NF-κB activation21), and could thus aid CPE decolonization22.
Differential pathway analysis based on CPE status provided further supporting evidence that key inflammatory pathways (for example, sulphate reduction) are enriched during colonization, highlighting their role in the process (Extended Data Fig. 4 and Supplementary File 4). Furthermore, aerobic respiration and oxidative phosphorylation pathways (for example, pentose phosphate) were enriched during CPE colonization consistent with a model of gut oxygenation proposed previously23. These results were recapitulated after excluding Enterobacteriaceae from functional profiles, emphasizing that they are not solely from CPE colonization and have substantial contributions from other species (Supplementary Fig. 3). Micro-aerophilic niches for Enterobacteriaceae due to antibiotic treatment provide another potential explanation24, as antibiotic usage was common (preceding ~25% timepoints; Supplementary File 1). Notably, while ARGs were enriched in gut microbiomes for CPE-positive timepoints, no significant differences were observed between index subjects and family members at other timepoints (Supplementary Fig. 4).
While Enterobacteriaceae species were enriched in CPE-positive samples relative to CPE-negative samples (Wilcoxon P value < 3.5 × 10−5), index subjects at CPE-negative timepoints also showed enrichment compared with family members (Wilcoxon P value = 0.01; Extended Data Fig. 5). Overall, Enterobacteriaceae composition varied across individuals, with E. coli and K. pneumoniae being most common, but other Escherichia, Klebsiella, Enterobacter and Proteus species also being moderately abundant across individuals and timepoints (Extended Data Fig. 5). The abundant Enterobacteriaceae species in an individual were not necessarily the CPE species (for example, subject 0505-T, timepoints 1–3). Enterobacteriaceae profiles also shifted rapidly (for example, in 0457-T and 0512-T, timepoint 6) with higher temporal variation in index subjects (Wilcoxon P value < 0.05; Fig. 1d and Extended Data Fig. 5). Together, these results suggest that CPE colonization is maintained by an altered, dynamic pro-inflammatory microenvironment supporting Enterobacteriaceae, which resolves concurrently with microbiome recovery25.
Distinct strain-level dynamics of Enterobacteriaceae species
Analysing the deep metagenomic data at higher resolution, we investigated strain-level dynamics for the two most prevalent Enterobacteriaceae (E. coli and K. pneumoniae). Read mapping to references identified single-nucleotide variants (SNVs), and allele frequency modes were used with a classical population genetics approach to determine strain count13 (Methods and Supplementary Fig. 5). For 53% of samples (E. coli, 63%; K. pneumoniae, 38%) where a species was detected (abundance >0.1%), read coverage was sufficient to identify strain count (one, two or multiple strains, otherwise low coverage; Fig. 2a,b and Supplementary Fig. 6). As expected for a gut commensal26, E. coli was at comparable frequencies in index subjects (86%) and family members (90%), and more frequently detected relative to K. pneumoniae (Fisher’s exact P value < 5 × 10−20; Fig. 2a). K. pneumoniae was, however, more frequently found in index subjects (70%) relative to family members (39%), consistent with the hypothesis that a pro-inflammatory environment might be facilitating colonization (Fisher’s exact P value < 2 × 10−9; Fig. 2b).
Among classified samples, E. coli was frequently single strain in index patients (44%) while family members often had multiple strains (44%; Supplementary Fig. 6), pointing to clonal dominance in pro-inflammatory environments. A few individuals maintained a single strain over several months (for example, 0505-T, 1667-T and 0506-T), indicating that this can be a stable state (Fig. 2a). For K. pneumoniae despite sporadic detection, the multi-strain state was more common (49%), consistent with the hypothesis that even in pro-inflammatory environments, no clone outcompetes others27 (Fig. 2b and Supplementary Fig. 6). Overall, in agreement with previous observations (Fig. 1d and Extended Data Fig. 5), Enterobacteriaceae strain compositions were highly variable temporally.
Capturing transition frequencies between strain compositions as first-order Markov models, we noted distinct patterns for E. coli and K. pneumoniae, and index subjects versus family members (Fig. 2c and Methods). For example, E. coli colonization is more likely to stay single strain for index patients (69%) versus family members (57%) and single-strain K. pneumoniae colonization (52%; Fig. 2c). When E. coli is not detected, this state is more likely to be maintained in index subjects (47%) versus family members (22%; Fig. 2c). Overall, the Markov model predicts that E. coli is maintained as single strains (index subjects, 41%; family members, 39%). Contrastingly, K. pneumoniae frequently converges to not detected state in subjects (42%) and family members (75%). Grouping classes of strain detection, we tested if transition probabilities differed across cohorts and species (Fig. 2d). For E. coli, transition probabilities were not significantly different between cohorts (Fisher’s exact P value > 0.05; Fig. 2d). In contrast, driven by stark detected/not detected patterns for K. pneumoniae, index subjects had significantly different transition probabilities compared with family members for various groupings involving the not detected state (‘All classifications’, ‘Detected versus not detected’ and ‘Fixed versus variable strains’; Fisher’s exact P value < 10−2; Fig. 2d). These results further highlight differences in strain-level dynamics for Enterobacteriaceae in the potentially pro-inflammatory gut microbiome milieu of index subjects.
Substrain variation and plasmid sharing in Enterobacteriaceae
Samples determined to have a single strain can still exhibit substrain variation relative to this genomic background, similar to viral quasi-species. Characterizing distribution of such intra-host variations across genes could identify adaptive changes important for CPE colonization, similar to recent studies with mouse models and strain isolates28,29. To analyse standing variation in Enterobacteriaceae, we identified low-frequency (<50%) SNVs in single-strain timepoints (E. coli, 30,155; K. pneumoniae, 13,061), annotating them for function-altering changes to reveal adaptive signatures in Enterobacteriaceae during gut colonization (Supplementary File 5 and Methods). We found 5,919 and 1,787 function-altering changes in E. coli and K. pneumoniae, including several in key polysaccharide utilization and virulence genes (for example, lacZ, lacY and ECIAI39_4258 (putative invasin/intimin protein)) similar to key genes previously identified from isolate WGS as undergoing selection for human gut colonization30 (Table 1). Visualizing function-altering SNVs in polysaccharide-utilization genes, where adaptive mutations can reflect pressures to use diet-derived polysaccharides, identified structural motifs that might be key functionally (Extended Data Fig. 6). Consistent with our study of low-frequency SNVs, most regions bear signatures of purifying selection (dN/dS < 0.5; Extended Data Fig. 6), though identified genes were significantly enriched relative to genome-wide average for non-synonymous SNVs (Table 1).
Table 1.
Gene | Protein | SNV count | P value |
---|---|---|---|
E. coli | |||
ECIAI39_4258 | Putative invasin/intimin protein | 48 | <10−10 |
ECIAI39_0530 | Host specificity protein J of prophage | 34 | <10−13 |
ydcM | Putative transposase | 24 | <10−15 |
ECIAI39_1027 | Putative GTP-binding domain | 19 | <10−12 |
ECIAI39_4240 | Putative anti-restriction protein | 18 | <10−14 |
yeeS | Putative DNA repair protein; CP4-44 prophage | 18 | <10−16 |
lacZ | β-d-galactosidase | 17 | <0.05 |
ECIAI39_4893 | Putative tail fibre component K of prophage | 14 | <10−7 |
lacY | Lactose/galactose transporter | 14 | <10−4 |
Rz | Endopeptidase from phage origin (lysis protein) | 13 | <10−9 |
ECIAI39_1018 | Hypothetical protein | 11 | <10−4 |
ECIAI39_4867 | Hypothetical protein | 10 | <10−2 |
yncK | Transposase ORF A | 10 | <10−8 |
K. pneumoniae | |||
---|---|---|---|
KPHS_51490 | Putative transposase | 30 | <10−39 |
KPHS_35720 | Putative transposase | 18 | <10−13 |
KPHS_22580 | Hypothetical protein | 12 | <10−4 |
Top ten genes with most function-altering SNVs (≥10) with low frequencies (allele frequency < 0.5) for E. coli and K. pneumoniae. SNVs in distinct individuals were counted separately, but multiple timepoints were counted as one. P values were computed by performing a binomial test on the observed number of SNVs given the probability of a gene acquiring SNVs after accounting for (1) the probabilities of acquiring mutations across all genes and (2) the codon composition of the genes where the respective SNVs are found. Bonferroni correction (α = 0.05) was applied.
To study SNVs further relative to CPE decolonization, time-series information was used to cluster co-varying SNVs (Methods). Interestingly, in some subjects multiple clusters were identified, revealing distinct substrain lineages differing by few hundred SNVs genome-wide (for example, 1674-T; Fig. 3a,b, Extended Data Figs. 7 and 8 and Supplementary Fig. 7). For subject 1674-T, both E. coli and K. pneumoniae have a dominant cluster during CPE-positive timepoints (V00–V03) that match SNV signatures seen in corresponding CPE isolates (Shared and Cluster 1 SNVs; Fig. 3c,d and Supplementary Fig. 7). The subdominant cluster (Cluster 2; Fig. 3c,d and Supplementary Fig. 7; possibly sublineage of Cluster 1) has SNV signature not seen in CPE isolates but detected post-decolonization (V05), indicating that these lineages are discordant for CPE status despite high genomic similarity. For both species, decolonization coincides with appearance of distinct strains with >1,000 SNVs relative to CPE strains (V05 unique; Fig. 3c,d and Supplementary Fig. 7). Despite shared strain patterns for E. coli and K. pneumoniae, they exhibited dissimilar relative abundance trends, with E. coli being reduced at decolonization (V05) while K. pneumoniae peaks at this point (Fig. 3e,f). While a few dense trajectories of co-varying SNVs were detected in other individuals, many SNVs varied independently of these clusters (Extended Data Figs. 7 and 8). These results suggest that Enterobacteriaceae may share substrain dynamics during CPE decolonization while having species-specific ecological properties.
Leveraging availability of multiple timepoints across subjects, we identified SNVs whose frequencies varied notably over time (>30%). These were overlapped across subjects to identify SNVs that recurrently vary (Extended Data Table 2 and Supplementary File 6), identifying several polysaccharide utilization (lacZ, lacI and treA), pyruvate metabolism (pflB and pykF) and protein synthesis (dnaK, 30S and 50S ribosomal subunits) genes implicated in adaptation to nutrient limitation31, antibiotics32 and environmental stress33 conditions. Several genes were common to E. coli and K. pneumoniae (srmB, pnp, nlpI and pheT), indicating that similar selection constraints may act on strains for both species. The lacZ gene was highlighted as having the most recurrent, frequency-varying SNVs (n = 12), with all SNVs occurring in surface-exposed regions (Fig. 3g). Comparing accessible surface area of variant residues versus others highlighted that variants are significantly more exposed to solvent (mean 73.1Å2 versus 34.1Å2, Welch’s t-test P value < 0.01). Many SNVs occurred in the activating interface, a region near amino-terminus required for tetramerization34, indicating that they could impact lacZ function via complex formation dynamics.
Extended Data Table 2.
Among genetic features prominent in CPE strains from subject 1674-T, variants in polysaccharide utilization genes were common as discussed previously (Fig. 3g). Additionally, plasmid sequence analysis identified two shared plasmids between E. coli and K. pneumoniae CPE isolates (Fig. 3h, Supplementary Fig. 8 and Methods). This included pKPC2, recently identified in hypervirulent carbapenem-resistant K. pneumoniae from Singapore, and harbouring blaKPC-2, the carbapenemase that provided CPE designation for these isolates16. Additionally, pMS6192B was shared between all E. coli and K. pneumoniae from the first visit (V00; Fig. 3h). The shared plasmids have total sequence length >140 kbp and no SNVs across species, indicating a recent common source. Plasmid transfer experiments with pKPC2 between E. coli and K. pneumoniae suggest moderate conjugation frequency under in vitro conditions (~0.1%; Methods). Furthermore, 50% of plasmid-bearing clones (3/6) were observed to have SNV in pKPC2 after 300 generations, defining an upper-bound on divergence of plasmid-bearing isolates having no SNVs being 5 months (binomial P value < 0.05).
Discussion
The availability of metagenomic data for up to 12 timepoints over a year in this study allowed examination of long-term dynamics, enabling microbiome comparisons before and after CPE decolonization in a subject-matched fashion to study microbiome shifts associated with decolonization. This revealed ecological shifts that cannot be explained solely by CPE strain loss (for example, increase in species richness), and the specific taxonomic/functional changes observed point to the role of inflammation in maintaining Enterobacteriaceae-favourable gut environment in index subjects (for example, Pantoea species; Extended Data Fig. 2 and Supplementary File 3). Additionally, our data indicate that this configuration may be unstable in many individuals, opening up the possibility that interventions that reduce gut inflammation directly or via action of probiotics could reduce Enterobacteriaceae and promote CPE decolonization.
Gut inflammation has been known to create a niche for enterics such as Salmonella35, where some species use sulphate, nitrate and tetrathionate as terminal electron acceptors for anaerobic respiration (for example, E. coli). The enriched pathways in CPE-colonized subjects are marked by menaquinol biosynthesis, glycolysis and respiration (tricarboxylic acid cycle), even after computationally subtracting Enterobacteriaceae contributions, indicating that the gut environment in this group is qualitatively different in oxygenation. Additionally, fucose/rhamnose and 1,2-propanediol degradation are enriched during CPE colonization, potentially serving as carbon sources for Enterobacteriaceae such as K. pneumoniae, which demonstrates competitive fitness with oxygen as terminal electron acceptor under such conditions27. The enrichment of pentose phosphate pathway could indicate need for reducing equivalents of NADPH+ to maintain redox conditions or serve as nucleic acid precursors to fuel growth. Overall, the shift in microbial pathways during CPE colonization is independent of Enterobacteriaceae but favours their growth. Further work is needed to understand if this shift is established by gut inflammation (as in colitis36, via direct biomarker measurement, for example, calprotectin) or if diverse factors play a strong role in an individual-specific manner (for example, antibiotics for some subjects24). Notably, while reduction in microbial diversity during CPE colonization could not be attributed solely to antibiotic usage or hospitalization at a timepoint, these could be delayed effects and would need a more controlled study design to explore further.
An alternative strategy for CPE decolonization involves introduction of species depleted in the colonized state (for example, Faecalibacterium prausnitzii or Bifidobacterium bifidum), either as probiotic formulations or through faecal microbiota transplants37. Matching donors to recipients, supplementing missing species or promoting further instability in Enterobacteriaceae abundances based on ecological models could be a promising avenue to explore, similar to studies for Clostridoides difficile38. The observed differences in colonization dynamics for Enterobacteriaceae suggest that decolonization strategies might also have to be species specific. For example, a possible explanation for presence of multiple K. pneumoniae strains is that they are not well adapted for gut colonization but instead opportunistically exploit a niche. Decolonization of K. pneumoniae strains may therefore require elimination of conditions favouring this niche such as inflammation or availability of specific sugars. The presence of single E. coli strains in many subjects supports a model where gut-adapted strains acquire ARGs, and plasmid-targeting strategies might be better suited in this case. Interestingly, our data suggest that human gut microbiomes can harbour multiple strains of commensals such as E. coli (in contrast to observations in mouse studies39), even among non-CPE-colonized family members40 (Fig. 2a). Further studies using high-throughput culturing and single-cell sequencing could help reconstruct strain genomes and unravel factors that determine niche competition41.
Identifying genes that enable CPE gut colonization provides another avenue to target interventions. As shown here, analysis of high-coverage metagenomic data to identify substrain variations can provide promising hypotheses based on in vivo evolution, similar to viral quasi-species analysis42, or mutagenesis-based experiments43. Furthermore, isolation of lineages with distinct advantages in colonizing or avoiding decolonization (for example, cluster 2 in 1674-T), could narrow down genetic features to be investigated in vitro. Finally, the role of gut microbiome as an ARG reservoir, and plasmid sharing across Enterobacteriaceae, is of particular concern. While we cannot definitively conclude that data for 1674-T represent plasmid transfer, these observations and isolated strains serve as important resources for further investigations into plasmid transmission and CPE decolonization.
Methods
Sample collection and CPE classification
A prospective cohort study involving CPE carriers was conducted from October 2016 to February 2018. Study participants were recruited with informed consent from two tertiary healthcare centres in Singapore. CPE carriers were identified by routine collection of rectal swab samples for clinical care and/or infection prevention and control measures, in accordance with local infection control policies. The study received ethics approval from the Singapore National Healthcare Group Domain Specific Review Board 74 (NHG DSRB Reference 2016/00364) before commencement. Stool samples were first collected weekly for 4 weeks, then monthly for 5 months and finally once every 2 months for 6 months. In addition to the CPE-colonized subjects, stool samples from a number of family members were also obtained to provide a control dataset. Samples obtained from index subjects were classified as either CPE positive or CPE negative, on the basis of whether CPE genes (including blaNDM-1, blaKPC, blaOXA-48, blaIMI-1 and blaIMP) were positively identified from Enterobacteriaceae isolates found to be resistant to either meropenem or ertapenem16 (Supplementary Table 1). The presence of CPE-negative samples was used to detect CPE clearance, and samples were further classified on the basis of the amount of time elapsed since clearance, that is, before clearance, within 2 months post-clearance and more than 2 months post-clearance. Owing to the focus on household transmission and CPE carriage, dietary information was not collected in the study.
Isolate sequencing and assembly
DNA for all CPE isolates obtained from stool samples in this study (all subjects, all timepoints) was collected from Tan Tock Seng Hospital and transferred to the Genome Institute of Singapore (GIS) for WGS. Library preparation was performed using the NEBNext Ultra DNA Library Prep Kit for Illumina, and 2 × 151 base-pair sequencing was performed using the Illumina HiSeq 4000. Raw FASTQ reads were processed using in-house pipelines at GIS for de novo assembly with the Velvet assembler44 (v1.2.10), parameters optimized by Velvet Optimiser (k-mer length ranging from 81 to 127), contig scaffolding with Opera45 (v1.4.1) and finishing with FinIS46 (v0.3).
Shotgun metagenomic sequencing
DNA from 361 stool samples was extracted using the PowerSoil DNA Isolation Kit (12888, MoBio Laboratories) with modifications to the manufacturer’s protocol. Specifically, to avoid spin-filter clogging, we extended the centrifugation to twice the original duration, and solutions C2, C3 and C4 were doubled in volume. DNA was eluted in 80 µl of Solution C6. Concentration of DNA was determined by Qubit dsDNA BR assay (Q32853, Thermo Fisher Scientific). For library construction, 50 ng of DNA was resuspended in a total volume of 50 µl and was sheared using Adaptive Focused Acoustics (Covaris) with the following parameters: duty factor of 30%, peak incident power of 450, 200 cycles per burst and treatment time of 240 s. Sheared DNA was cleaned up with 1.5× Agencourt AMPure XP beads (A63882, Beckman Coulter). Gene Read DNA Library I Core Kit (180434, Qiagen) was used for end-repair, A-addition and adapter ligation. Custom barcode adapters were used for cost considerations (HPLC purified, double stranded, first strand: 5′ P-GATCGGAAGAGCACACGTCT; second strand: 5′ ACACTCTTTCCCTACACGACGCTCTTCCGATCT) in replacement of Gene Read Adapter I Set for library preparation. Library was cleaned up twice using 1.5× Agencourt AMPure XP beads (A63882, Beckman Coulter). Enrichment was carried out with indexed primers according to an adapted protocol from Multiplexing Sample Preparation Oligonucleotide kit (Illumina). We pooled the enriched libraries in equimolarity and sequenced them on an Illumina HiSeq 2500 sequencing instrument at GIS to generate 2 × 101 base-pair reads, which yielded around 17.7 billion paired-end reads in total and 49 million paired-end reads on average per library.
Taxonomic and functional profiling
Read quality trimming was performed using famas (https://github.com/andreas-wilm/famas, v0.10,–no-order-check), and microbial reads were identified by mapping and filtering out reads aligned to the human reference genome (hg19) using bwa-mem47 (v0.7.9a, default parameters; >90% microbial reads on average). Taxonomic profiling was done using MetaPhlAn48 (v2.0, default parameters, filtering taxa with relative abundance <0.1%), and functional profiles were obtained with HUMAnN49 (v2.0, default parameters). As a sanity check, we confirmed that species- and genus-level taxonomic profiles were not dominated by taxa that are commonly attributed to reagent or laboratory contamination50 (Supplementary File 2). Average-linkage hierarchical clustering of taxonomic profiles was used to group samples with the number of clusters determined using Akaike information criterion. Sample α-diversity was computed using the Shannon diversity index with the vegan library in R. Differential abundance analysis was performed using LEfSe51 (v1.0.8), as a non-parametric and conservative approach to identify significantly varying taxa and functions across groups52. These results were further validated using Songbird53 (v1.0.3; –epochs 10000 –differential-prior 0.5) analysis with Bonferroni-corrected P value < 0.05. Abundances of ARGs in the metagenomes was computed using a direct read mapping approach implemented in SRST2 (ref. 54) with default parameters and the CARD_v3.0.8_SRST2 database55.
Linear mixed-effects modelling
Linear mixed-effects modelling was conducted using the lmer function from the lme4 package in R. For each model, genus-level Shannon diversity was set as the response variable, with colonization status as the fixed effect and potential confounders (for example, antibiotic usage since last visit, hospitalization status, individual subjects, gender and ethnicity; Supplementary File 1) as random-effect covariates. Residual Shannon diversity values were derived for visualization by subtracting the intercept terms corresponding to random effects.
SNV analysis
Genome assemblies were aligned to their respective reference genomes using nucmer (v3.23, -maxmatch -nosimplify), and consensus SNVs were called using the show-SNVs function in MUMmer56. References for E. coli (NC_011750) and K. pneumoniae (NC_016845.1) were selected to minimize median distance from isolate genomes. Metagenomic SNVs (consensus and low frequency) were identified on the basis of read mapping using bwa-mem47 to the E. coli and K. pneumoniae references (v0.7.10a; soft-clipped reads and reads with more than three or four mismatches for K pneumoniae and E. coli, respectively, were filtered out to avoid mismapped reads) and variant calling with LoFreq57 (v1.2.1; default parameters). Note that our stringent mapping approach restricts to only reads with >96% identity with the reference and thus will typically exclude mismapping of reads from other genomes. Additionally, genomic regions with frequent ambiguous mappings were identified on the basis of isolate sequencing data and metagenome data from samples without target species as determined from taxonomic classification (>5× coverage with E. coli reads on K. pneumoniae genome or vice versa). Calls in these regions that match positions where variants were called between isolate reads and reference sequence (allele frequency >95%) were excluded from downstream analysis. The validity of this pipeline was confirmed by noting that very few K. pneumoniae SNVs (median 2, mean 5.5) were called genome-wide when analysing metagenomes where taxonomic profiling detected few K. pneumoniae reads (ten samples with 107–288 reads). Note that SNVs from such ‘low coverage’ samples are also excluded from further analysis in this study as defined below. To assess the impact of a shared, but potentially divergent, reference on SNV calling, reads were also mapped onto CPE isolate genomes (where available) to call SNVs and compute concordance. Isolate genome-based SNVs were translated to the common reference coordinate system using the UCSC liftover tool58 with chain file generated using flo59 (-fastMap -tileSize=12 -minIdentity=90).
Strain analysis
Metagenomic coverage of samples for E. coli and K. pneumoniae was determined from bwa-mem read mappings using genomeCoverageBed60 (v2.25.0). Samples with too low relative abundance for confident identification (<0.1%) were designated as ‘not detected’, while samples with low median read coverage (<8) were designated as ‘low coverage’. Of the remaining samples, those with >90% of the SNVs at or above an allele frequency of 0.9 were designated as ‘one strain’, exhibiting a unimodal distribution as is classically expected in the single haplotype setting13 (Supplementary Fig. 5a). A k-means clustering approach (based on allele frequency values <0.98, k = 2) was used on other samples to identify ‘two strains’ (silhouette score >0.8, indicating good concordance with two clusters for a bimodal distribution) and ‘multiple strains’ cases where there may be more than two clusters (Supplementary Fig. 5a). Note that this analysis was only used to determine strain ‘states’ (Fig. 2), and the corresponding clusters were not used for downstream haplotype analysis. To confirm metagenomic SNV calling quality and strain designations, ‘one strain’ cases were compared with SNVs from corresponding isolates (where available) and noted to have high precision for both E. coli and K. pneumoniae (>98%; Supplementary Fig. 5b). A first-order Markov model of the transition frequencies between the strain compositions was estimated using the markovchain package in R (ref. 61) (maximum likelihood estimator with Laplace smoothing parameter = 1).
Substrain analysis
SNVs with mean allele frequency >0.9 across timepoints were identified as probably fixed across all strains in a sample. Non-fixed SNVs from ‘one strain’ cases were further annotated for their impact on protein function using SnpEff62 (v4.3). The ratio of the rate of non-synonymous (dN) to synonymous (dS) mutations was calculated using the package Biopython.codonalign.codonseq with the ‘NG86’ method.
Leveraging the availability of multiple ‘one strain’ timepoints in some individuals, non-fixed SNV trajectories were clustered to identify co-varying SNVs that may belong to a common substrain background. Specifically, the DBSCAN algorithm in R (ref. 63) was used to cluster SNV trajectories in selected individuals with multiple ‘one strain’ timepoints (ε = 0.2, minPts = 2n as recommended), and identified clusters were visualized as a sanity check.
Plasmid analysis
A Mash screen search approach was used with PLSDB64 to obtain a list of plasmids that are potentially present in the CPE isolate genomes. The union of all such plasmid sequences was then aligned with isolate genome assemblies to identify plasmids hits with >85% coverage at >95% identity (only alignments >500 bp). Plasmid hits were clustered into groups using hierarchical clustering at 95% identity (hclust function in R, average linkage based on Mash distance65), with the longest plasmid serving as a representative. Only plasmids longer than 10 kbp are included in the figure to avoid spurious/redundant matches to shorter plasmids.
Plasmid conjugation assay
Donor E. coli harbouring the pKPC2 plasmid with a kanamycin selection cassette (MG1655) and recipient K. pneumoniae strains (ATCC13883) were streaked on selective LBA and incubated overnight at 37 °C. Bacterial colonies were resuspended in LB (1 ml), diluted to OD600 0.5 and mixed in a 1:1 ratio and spotted onto 0.22 µm nitrocellulose membrane (Sartorius) placed on top of LBA (20 µl). After 4 hours of incubation at 37 °C, the bacterial mixture was resuspended in 2 ml of PBS, serially diluted and plated on LBA with appropriate antibiotic selection. Kanamycin (50 μg ml−1) and fosfomycin (40 μg ml−1) were used for selection of transconjugants. Plates were incubated at 37 °C overnight, and colonies were enumerated. Conjugation frequency was calculated as the total number of transconjugants per total number of recipients.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This work was supported by grants from Biomedical Research Council Industry Alignment Fund (IAF311018) to N.N., S.L.C., K.M. and N.O.T., National Medical Research Clinician Scientist Individual Research Grant (CIRG18nov-0034) to K.M., National Medical Research Council Collaborative Grant (NMRC CGAug16C005) to K.M. and N.O.T. and National Medical Research Council Clinician Scientist Award (NMRC/CSA-INV/0002/2016 and MOH-CSAINV18nov-0004) to K.M.
Extended data
Author contributions
N.N., S.L.C., K.M. and N.O.T. conceived and designed the study. K.M. and N.O.T. oversaw the clinical work. A.N. conducted metagenomic library preparation and sequencing under C.K.R.’s supervision. S.L.C. conducted the isolate sequence analysis. J.T.L.K, J.T., D.B. and A.R. coordinated bioinformatic analysis of the metagenomic data under N.N.’s supervision. M.Y. conducted plasmid transfer experiments under Y-H.G.’s supervision. J.T.L.K, J.T., D.B., A.N., A.R., C.K.R. and N.N. wrote the manuscript with inputs from all authors.
Peer review
Peer review information
Nature Microbiology thanks Sean Gibbons and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Data availability
Isolate and shotgun metagenomic sequencing data is available from the European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena/browser/home) under project accession number PRJEB49334. Source data are provided with this paper.
Code availability
Source code for scripts used to analyse the data are available in a GitHub project at https://github.com/CSB5/CPE-microbiome.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Jonathan T. L. Kang, Jonathan J. Y. Teo, Denis Bertrand.
Extended data
is available for this paper at 10.1038/s41564-022-01221-w.
Supplementary information
The online version contains supplementary material available at 10.1038/s41564-022-01221-w.
References
- 1.von Wintersdorff CJ, et al. Dissemination of antimicrobial resistance in microbial ecosystems through horizontal gene transfer. Front. Microbiol. 2016;7:173. doi: 10.3389/fmicb.2016.00173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Suay-García B, Pérez-Gracia MT. Present and future of carbapenem-resistant Enterobacteriaceae (CRE) infections. Antibiotics. 2019;8:122. doi: 10.3390/antibiotics8030122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Blair JM, Webber MA, Baylay AJ, Ogbolu DO, Piddock LJ. Molecular mechanisms of antibiotic resistance. Nat. Rev. Microbiol. 2015;13:42–51. doi: 10.1038/nrmicro3380. [DOI] [PubMed] [Google Scholar]
- 4.Codjoe FS, Donkor ES. Carbapenem resistance: a review. Med Sci. 2017;6:1. doi: 10.3390/medsci6010001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Schechner V, et al. Asymptomatic rectal carriage of blaKPC producing carbapenem-resistant Enterobacteriaceae: who is prone to become clinically infected? Clin. Microbiol. Infect. 2013;19:451–456. doi: 10.1111/j.1469-0691.2012.03888.x. [DOI] [PubMed] [Google Scholar]
- 6.Penders J, Stobberingh EE, Savelkoul PH, Wolffs PF. The human microbiome as a reservoir of antimicrobial resistance. Front Microbiol. 2013;4:87. doi: 10.3389/fmicb.2013.00087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Nordmann P, Naas T, Poirel L. Global spread of Carbapenemase-producing Enterobacteriaceae. Emerg. Infect. Dis. 2011;17:1791–1798. doi: 10.3201/eid1710.110655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tooke CL, et al. β-Lactamases and β-lactamase inhibitors in the 21st century. J. Mol. Biol. 2019;431:3472–3500. doi: 10.1016/j.jmb.2019.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sun X, et al. Microbiota-derived metabolic factors reduce campylobacteriosis in mice. Gastroenterology. 2018;154:1751–1763.e2. doi: 10.1053/j.gastro.2018.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ichinohe T, et al. Microbiota regulates immune defense against respiratory tract influenza A virus infection. Proc. Natl Acad. Sci. USA. 2011;108:5354–5359. doi: 10.1073/pnas.1019378108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Buffie CG, et al. Precision microbiome reconstitution restores bile acid mediated resistance to Clostridium difficile. Nature. 2015;517:205–208. doi: 10.1038/nature13828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lieberman TD, et al. Parallel bacterial evolution within multiple patients identifies candidate pathogenicity genes. Nat. Genet. 2011;43:1275–1280. doi: 10.1038/ng.997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Garud NR, Good BH, Hallatschek O, Pollard KS. Evolutionary dynamics of bacteria in the gut microbiome within and across hosts. PLoS Biol. 2019;17:e3000102. doi: 10.1371/journal.pbio.3000102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chu ND, Smith MB, Perrotta AR, Kassam Z, Alm EJ. Profiling living bacteria informs preparation of fecal microbiota transplantations. PLoS ONE. 2017;12:e0170922. doi: 10.1371/journal.pone.0170922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ferreiro A, Crook N, Gasparrini AJ, Dantas G. Multiscale evolutionary dynamics of host-associated microbiomes. Cell. 2018;172:1216–1227. doi: 10.1016/j.cell.2018.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mo Y, et al. Duration of carbapenemase-producing Enterobacteriaceae carriage in hospital patients. Emerg. Infect. Dis. 2020;26:2182–2185. doi: 10.3201/eid2609.190592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Haverkate MR, et al. Duration of colonization with Klebsiella pneumoniae carbapenemase-producing bacteria at long-term acute care hospitals in Chicago, Illinois. Open Forum Infect. Dis. 2016;3:ofw178. doi: 10.1093/ofid/ofw178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Korach-Rechtman H, et al. Intestinal dysbiosis in carriers of carbapenem-resistant Enterobacteriaceae. mSphere. 2020;5:e00173–20. doi: 10.1128/mSphere.00173-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yoshida N, et al. Bacteroides vulgatus and Bacteroides dorei reduce gut microbial lipopolysaccharide production and inhibit atherosclerosis. Circulation. 2018;138:2486–2498. doi: 10.1161/CIRCULATIONAHA.118.033714. [DOI] [PubMed] [Google Scholar]
- 20.Lenoir M, et al. Butyrate mediates anti-inflammatory effects of. Gut Microbes. 2020;12:1–16. doi: 10.1080/19490976.2020.1826748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Riedel CU, et al. Anti-inflammatory effects of bifidobacteria by inhibition of LPS-induced NF-κB activation. World J. Gastroenterol. 2006;12:3729–3735. doi: 10.3748/wjg.v12.i23.3729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zeng MY, Inohara N, Nuñez G. Mechanisms of inflammation-driven bacterial dysbiosis in the gut. Mucosal Immunol. 2017;10:18–26. doi: 10.1038/mi.2016.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Winter SE, Bäumler AJ. A breathtaking feat: to compete with the gut microbiota, Salmonella drives its host to provide a respiratory electron acceptor. Gut Microbes. 2011;2:58–60. doi: 10.4161/gmic.2.1.14911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Rivera-Chávez F, Lopez CA, Bäumler AJ. Oxygen as a driver of gut dysbiosis. Free Radic. Biol. Med. 2017;105:93–101. doi: 10.1016/j.freeradbiomed.2016.09.022. [DOI] [PubMed] [Google Scholar]
- 25.Chng KR, et al. Metagenome-wide association analysis identifies microbial determinants of post-antibiotic ecological recovery in the gut. Nat. Ecol. Evol. 2020;4:1256–1267. doi: 10.1038/s41559-020-1236-0. [DOI] [PubMed] [Google Scholar]
- 26.Tenaillon O, Skurnik D, Picard B, Denamur E. The population genetics of commensal Escherichia coli. Nat. Rev. Microbiol. 2010;8:207–217. doi: 10.1038/nrmicro2298. [DOI] [PubMed] [Google Scholar]
- 27.Stacy A, et al. Infection trains the host for microbiota-enhanced resistance to pathogens. Cell. 2021;184:615–627.e17. doi: 10.1016/j.cell.2020.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Barreto HC, Sousa A, Gordo I. The landscape of adaptive evolution of a gut commensal bacteria in aging mice. Curr. Biol. 2020;30:1102–1109.e5. doi: 10.1016/j.cub.2020.01.037. [DOI] [PubMed] [Google Scholar]
- 29.Ernst CM, et al. Adaptive evolution of virulence and persistence in carbapenem-resistant Klebsiella pneumoniae. Nat. Med. 2020;26:705–711. doi: 10.1038/s41591-020-0825-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhao S, et al. Adaptive evolution within gut microbiomes of healthy people. Cell Host Microbe. 2019;25:656–667.e8. doi: 10.1016/j.chom.2019.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Warsi OM, Andersson DI, Dykhuizen DE. Different adaptive strategies in E. coli populations evolving under macronutrient limitation and metal ion limitation. BMC Evol. Biol. 2018;18:72. doi: 10.1186/s12862-018-1191-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hickman RA, Munck C, Sommer MOA. Time-resolved tracking of mutations reveals diverse allele dynamics during Escherichia coli antimicrobial adaptive evolution to single drugs and drug pairs. Front. Microbiol. 2017;8:893. doi: 10.3389/fmicb.2017.00893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Auriol C, Bestel-Corre G, Claude JB, Soucaille P, Meynial-Salles I. Stress-induced evolution of Escherichia coli points to original concepts in respiratory cofactor selectivity. Proc. Natl Acad. Sci. USA. 2011;108:1278–1283. doi: 10.1073/pnas.1010431108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Juers DH, Matthews BW, Huber RE. LacZ β-galactosidase: structure and function of an enzyme of historical and molecular biological importance. Protein Sci. 2012;21:1792–1807. doi: 10.1002/pro.2165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Rogers AWL, Tsolis RM, Bäumler AJ. Salmonella versus the microbiome. Microbiol. Mol. Biol. Rev. 2021;85:e00027–19. doi: 10.1128/MMBR.00027-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hughes ER, et al. Microbial respiration and formate oxidation as metabolic signatures of inflammation-associated dysbiosis. Cell Host Microbe. 2017;21:208–219. doi: 10.1016/j.chom.2017.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gupta S, Allen-Vercoe E, Petrof EO. Fecal microbiota transplantation: in perspective. Ther. Adv. Gastroenterol. 2016;9:229–239. doi: 10.1177/1756283X15607414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wortelboer K, Nieuwdorp M, Herrema H. Fecal microbiota transplantation beyond Clostridioides difficile infections. EBioMedicine. 2019;44:716–729. doi: 10.1016/j.ebiom.2019.05.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lee SM, et al. Bacterial colonization factors control specificity and stability of the gut microbiota. Nature. 2013;501:426–429. doi: 10.1038/nature12447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Martinson JNV, et al. Rethinking gut microbiome residency and the Enterobacteriaceae in healthy human adults. ISME J. 2019;13:2306–2318. doi: 10.1038/s41396-019-0435-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Woyke T, Doud DFR, Schulz F. The trajectory of microbial single-cell sequencing. Nat. Methods. 2017;14:1045–1054. doi: 10.1038/nmeth.4469. [DOI] [PubMed] [Google Scholar]
- 42.Domingo E, Perales C. Viral quasispecies. PLoS Genet. 2019;15:e1008271. doi: 10.1371/journal.pgen.1008271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Yamada C, et al. Molecular insight into evolution of symbiosis between breast-fed infants and a member of the human gut microbiome Bifidobacterium longum. Cell Chem. Biol. 2017;24:515–524.e5. doi: 10.1016/j.chembiol.2017.03.012. [DOI] [PubMed] [Google Scholar]
- 44.Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gao S, Bertrand D, Chia BK, Nagarajan N. OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees. Genome Biol. 2016;17:102. doi: 10.1186/s13059-016-0951-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Gao S, Bertrand D, Nagarajan N. FinIS: improved in silico finishing using an exact quadratic programming formulation. Lect. Notes Comput. Sci. 2012;7534:314–325. [Google Scholar]
- 47.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 1303.3997v2 (2013).
- 48.Segata N, et al. Metagenomic microbial community profiling using unique clade-specific marker genes. Nat. Methods. 2012;9:811–814. doi: 10.1038/nmeth.2066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Franzosa EA, et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat. Methods. 2018;15:962–968. doi: 10.1038/s41592-018-0176-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Salter SJ, et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014;12:87. doi: 10.1186/s12915-014-0087-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Segata N, et al. Metagenomic biomarker discovery and explanation. Genome Biol. 2011;12:R60. doi: 10.1186/gb-2011-12-6-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hawinkel S, Mattiello F, Bijnens L, Thas O. A broken promise: microbiome differential abundance methods do not control the false discovery rate. Brief. Bioinformatics. 2019;20:210–221. doi: 10.1093/bib/bbx104. [DOI] [PubMed] [Google Scholar]
- 53.Morton JT, et al. Establishing microbial composition measurement standards with reference frames. Nat. Commun. 2019;10:2719. doi: 10.1038/s41467-019-10656-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Inouye M, et al. SRST2: rapid genomic surveillance for public health and hospital microbiology labs. Genome Med. 2014;6:90. doi: 10.1186/s13073-014-0090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Alcock BP, et al. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res. 2020;48:D517–D525. doi: 10.1093/nar/gkz935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kurtz S, et al. Versatile and open software for comparing large genomes. Genome Biol. 2004;5:R12. doi: 10.1186/gb-2004-5-2-r12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Wilm A, et al. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. 2012;40:11189–11201. doi: 10.1093/nar/gks918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hinrichs AS, et al. The UCSC Genome Browser Database: update 2006. Nucleic Acids Res. 2006;34:D590–D598. doi: 10.1093/nar/gkj144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Pracana R, Priyam A, Levantis I, Nichols RA, Wurm Y. The fire ant social chromosome supergene variant Sb shows low diversity but high divergence from SB. Mol. Ecol. 2017;26:2864–2879. doi: 10.1111/mec.14054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Quinlan AR. BEDTools: the Swiss-Army tool for genome feature analysis. Curr. Protoc. Bioinformatics. 2014;47:11.12.1–34. doi: 10.1002/0471250953.bi1112s47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Spedicato G. Discrete time Markov chains with R. R J. 2017;9.2:84. [Google Scholar]
- 62.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Hahsler M, Piekenbrock M, Doran D. dbscan: fast density-based clustering with R. J. Stat. Softw. 2019;91:1–30. [Google Scholar]
- 64.Galata V, Fehlmann T, Backes C, Keller A. PLSDB: a resource of complete bacterial plasmids. Nucleic Acids Res. 2019;47:D195–D202. doi: 10.1093/nar/gky1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Ondov BD, et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016;17:132. doi: 10.1186/s13059-016-0997-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Quan S, et al. Adaptive evolution of the lactose utilization network in experimentally evolved populations of Escherichia coli. PLoS Genet. 2012;8:e1002444. doi: 10.1371/journal.pgen.1002444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tsuchido T, VanBogelen RA, Neidhardt FC. Heat shock response in Escherichia coli influences cell division. Proc. Natl Acad. Sci. USA. 1986;83:6959–6963. doi: 10.1073/pnas.83.18.6959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Trubetskoy D, Proux F, Allemand F, Dreyfus M, Iost I. SrmB, a DEAD-box helicase involved in Escherichia coli ribosome assembly, is specifically targeted to 23S rRNA in vivo. Nucleic Acids Res. 2009;37:6540–6549. doi: 10.1093/nar/gkp685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Garoff L, Huseby DL, Praski Alzrigat L, Hughes D. Effect of aminoacyl-tRNA synthetase mutations on susceptibility to ciprofloxacin in Escherichia coli. J. Antimicrob. Chemother. 2018;73:3285–3292. doi: 10.1093/jac/dky356. [DOI] [PubMed] [Google Scholar]
- 70.Aponte RA, Zimmermann S, Reinstein J. Directed evolution of the DnaK chaperone: mutations in the lid domain result in enhanced chaperone activity. J. Mol. Biol. 2010;399:154–167. doi: 10.1016/j.jmb.2010.03.060. [DOI] [PubMed] [Google Scholar]
- 71.Mundhada H, et al. Increased production of l-serine in Escherichia coli through adaptive laboratory evolution. Metab. Eng. 2017;39:141–150. doi: 10.1016/j.ymben.2016.11.008. [DOI] [PubMed] [Google Scholar]
- 72.Conrad TM, et al. RNA polymerase mutants found through adaptive evolution reprogram Escherichia coli for optimal growth in minimal media. Proc. Natl Acad. Sci. USA. 2010;107:20500–20505. doi: 10.1073/pnas.0911253107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Li Y, et al. LPS remodeling is an evolved survival strategy for bacteria. Proc. Natl Acad. Sci. USA. 2012;109:8716–8721. doi: 10.1073/pnas.1202908109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Isolate and shotgun metagenomic sequencing data is available from the European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena/browser/home) under project accession number PRJEB49334. Source data are provided with this paper.
Source code for scripts used to analyse the data are available in a GitHub project at https://github.com/CSB5/CPE-microbiome.