Skip to main content
mSystems logoLink to mSystems
. 2022 Jan 11;7(1):e01169-21. doi: 10.1128/msystems.01169-21

Specificities and Commonalities of Carbapenemase-Producing Escherichia coli Isolated in France from 2012 to 2015

Rafael Patiño-Navarrete a,b,#, Isabelle Rosinski-Chupin a,b,#, Nicolas Cabanel a,b, Pengdbamba Dieudonné Zongo a,b,c, Mélanie Héry a,d, Saoussen Oueslati a,d, Delphine Girlich a,d, Laurent Dortet a,d,e,f, Rémy A Bonnin a,d,f, Thierry Naas a,d,e,f,#, Philippe Glaser a,b,✉,#
Editor: Zackery Bulmang
PMCID: PMC8751382  PMID: 35014866

ABSTRACT

Carbapenemase-producing Escherichia coli (CP-Ec) represents a major public health threat with a risk of dissemination in the community as has occurred for lineages producing extended-spectrum β-lactamases. To characterize the extent of CP-Ec spread in France, isolates from screening and infection samples received at the French National Reference Center (F-NRC) laboratory for carbapenemase-producing Enterobacterales were investigated. A total of 691 CP-Ec isolates collected between 2012 and 2015 and 22 isolates collected before 2012 were fully sequenced. Analysis of their genome sequences revealed some disseminating multidrug-resistant (MDR) lineages frequently acquiring diverse carbapenemase genes mainly belonging to clonal complex 23 (CC23) (sequence type 410 [ST410]) and CC10 (ST10 and ST167) and sporadic isolates, including rare ST131 isolates (n = 17). However, the most represented sequence type (ST) was ST38 (n = 92) with four disseminated lineages carrying blaOXA-48-like genes inserted in the chromosome. Globally, the most frequent carbapenemase gene (n = 457) was blaOXA-48. It was also less frequently associated with MDR isolates being the only resistance gene in 119 isolates. Thus, outside the ST38 clades, its acquisition was frequently sporadic with no sign of dissemination, reflecting the circulation of the IncL plasmid pOXA-48 in France and its high frequency of conjugation. In contrast, blaOXA-181 and blaNDM genes were often associated with the evolution of MDR E. coli lineages characterized by mutations in ftsI and ompC.

IMPORTANCE Carbapenemase-producing Escherichia coli (CP-Ec) might be difficult to detect, as MICs can be very low. However, their absolute number and their proportion among carbapenem-resistant Enterobacterales have been increasing, as reported by WHO and national surveillance programs. This suggests a still largely uncharacterized community spread of these isolates. Here, we have characterized the diversity and evolution of CP-Ec isolated in France before 2016. We show that carbapenemase genes are associated with a wide variety of E. coli genomic backgrounds and a small number of dominant phylogenetic lineages. In a significant proportion of CP-Ec, the most frequent carbapenemase gene blaOXA-48, was detected in isolates lacking any other resistance gene, reflecting the dissemination of pOXA-48 plasmids, likely in the absence of any antibiotic pressure. In contrast, carbapenemase gene transfer may also occur in multidrug-resistant E. coli, ultimately giving rise to at-risk lineages encoding carbapenemases with a high potential of dissemination.

KEYWORDS: drug resistance evolution, emergence, genomics, multidrug resistance, surveillance studies

INTRODUCTION

Escherichia coli is one of the first causes of diverse bacterial infections in the community and in the hospital. In particular, it is the most frequent cause of urinary tract infection (UTI), and 50 to 60% of women will suffer at least one UTI during her life (1). Therefore, multidrug resistance (MDR) in E. coli is a major public health issue making E. coli infections more difficult to treat. In addition, as carbapenemase-producing Enterobacterales (CPE) are increasingly detected (2, 3) and E. coli is a ubiquitous member of the human gut microbiome, carbapenemase-producing E. coli (CP-Ec) is also becoming a major actor for the global dissemination of carbapenemase genes (4).

The emergence and spread of carbapenem-resistant Gram-negative bacteria are mainly linked to the widespread dissemination through horizontal gene transfer (HGT) of mobile genetic elements (MGEs) encoding carbapenemases. These carbapenemases belong to Ambler class A (mainly KPC type), class B (metallo-β-lactamases IMP, VIM, and NDM types) or class D (OXA-48-like enzymes) of β-lactamases (5). The global epidemiology of extended-spectrum β-lactamase (ESBL)-producing E. coli has been characterized through multiple studies, revealing in Western countries the major contribution of the sequence type 131 (ST131) lineage in the high prevalence of blaCTX-M family ESBL genes (6). Much less is known with respect to CP-Ec.

Since 2012, the French National Reference Center (F-NRC) laboratory for carbapenemase-producing Enterobacterales has experienced a steady increase in the number of CP-Ec isolates received each year (2). A multilocus sequence typing (MLST) analysis of isolates received in 2012 and 2013 revealed a broad diversity of sequence types (STs), as the 140 analyzed isolates belonged to 50 different STs. However, a few STs were overrepresented (7), such as the ST38 (24 isolates) and ST410 (10 isolates). In that study, only one isolate belonged to ST131, contrasting with the situation in the United Kingdom where ST131 isolates represented 10% of the CP-Ec isolates received between 2014 and 2016 by Public Health England (8). On the other hand, a genome-based survey of CPE in the Netherlands between 2014 and 2018 revealed that the 264 analyzed E. coli isolates belonged to 87 different STs, with three dominant lineages, ST38 (n = 46), ST167 (n = 22), and ST405 (n = 16) representing 32% of the isolates (9). F-NRC isolates also showed a predominance of isolates producing OXA-48-like carbapenemases, followed by NDM-producing isolates and suggested clonally related isolates among ST38 OXA-48-producing isolates and ST410 OXA-181-producing isolates, respectively (7).

Whole-genome sequencing (WGS) significantly increases our ability to infer phylogenetic relationships between isolates. By sequencing 50 ST410 CP-Ec isolates received by the F-NRC between 2011 and 2015 (10), we showed that 36 (72%) belonged to a newly described ST410 lineage producing OXA-181 (11). We showed that this clade is characterized by mutations in the two porin genes ompC and ompF leading to a decreased outer membrane permeability to certain β-lactams and by a four-codon duplication (YRIN) in the ftsI gene encoding the penicillin binding protein 3 (PBP3) leading to a decreased susceptibility to β-lactams targeting this PBP (10). After a thorough analysis of CP-Ec genome sequences from public databases for mutations in these three genes, we proposed that CP-Ec followed three different evolutionary trajectories. In some lineages which are enriched in CP-Ec isolates and have disseminated globally, acquisition of carbapenem resistance genes might have been facilitated by the mutations in porin genes and in ftsI. In ST38, the genetic background and in particular a specific ompC allele with reduced permeability to some β-lactams may have similarly facilitated the acquisition of carbapenemase genes. In contrast, other CP-Ec isolates, including isolates from ST131, might have occurred sporadically following the acquisition of plasmids encoding carbapenemase and with no clue of dissemination (10).

Here, we thoroughly characterized the diversity of CP-Ec circulating in France by sequencing the genomes of almost all isolates received by the F-NRC from its creation in 2012 until 2015 (see Table S1 in the supplemental material). By combining whole-genome phylogeny with the addition of E. coli genome sequences publicly available (Table S2) and tracking mutations in candidate genes, we show that three different situations are encountered. The high transmission potency of the IncL pOXA-48 plasmids has led to a high frequency of OXA-48-producing isolates, often characterized by susceptibility to most non-β-lactam antibiotics. On the other hand, an increasing number of CP-Ec lineages, mainly producing OXA-181 and NDM carbapenemases, are observed in France as in other European countries. These lineages are multidrug-resistant (MDR) or extensively drug-resistant (XRD) lineages, they show several mutations in quinolone resistance-determining regions (QRDRs) and are often mutated in ftsI and porin genes. Finally, the rapid dissemination of four OXA-48/OXA-244 ST38 lineages might have been favored by the chromosomal integration of the carbapenemase gene.

TABLE S1

Main characteristics of the F-NRC CP-Ec. Download Table S1, XLSX file, 0.1 MB (126.9KB, xlsx) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S2

Main characteristics of genome sequences used for phylogenetic reconstructions. Download Table S2, XLSX file, 0.3 MB (266.1KB, xlsx) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

RESULTS

CP-Ec isolates collected until 2015 by the French National Reference Center.

Isolates analyzed in this work were sent to the F-NRC laboratory on a voluntary basis from private and public clinical laboratories from different parts of France mainly between the years 2012 and 2015. During this period, we encountered a regular increase in the number of isolates received each year (Fig. 1A) corresponding to an increasing number of isolated strains and an increasing frequency of isolates sent to the F-NRC. A total of 713 CP-Ec isolates, including 22 isolates collected from 2001 to 2011 were submitted to WGS. The 691 sequenced isolates of the 2012–2015 period represented 87.5% of the 790 CP-Ec isolates received by the F-NRC during this period.

FIG 1.

FIG 1

Origin of the isolates received by the F-NRC during the 2012–2015 period. (A) Absolute number of isolates per year from infection, screening, or unknown origins, differentiating between isolates that have or have not been sequenced. (B) Proportions of isolates from infection or from screening or unknown origins showing that isolates from infection origin represent the same proportions among total isolates that have been received by the F-NRC and isolates that have been sequenced.

The majority of the sequenced isolates (66.5%, 474/713) were from rectal swab screening of patients suspected of carrying CPE (patients repatriated from a hospital abroad, patients who have visited a foreign country within the last 6 months, contact patients of a former carrier, or a previously known carrier), 24% (172/713) were considered to be responsible for infection (isolated from clinical samples), and the source was unknown for 67 isolates (Fig. 1A; see Table S1 in the supplemental material). During the 4-year period, the number of infection-related isolates was found to increase more slowly than the number of screening or of unknown source isolates. A nearly twofold decrease in the proportion of clinical versus total isolates was observed between 2013 and 2014 (Fig. 1B). Of the 172 isolates associated with disease, 122 were isolated from urine (71%), 15 from blood samples, 8 from deep site samples, 8 from wound samples, 9 from the respiratory tract, 3 from vaginal samples, 3 from pus, 3 from bile, and 1 from the skin of a newborn. All these isolates were previously identified as carbapenemase producers by PCR.

Diversity of CP-Ec isolates as assessed by whole-genome sequencing.

The genome sequences of the 713 CP-Ec isolates were first analyzed following de novo assembly. For each isolate, we determined its MLST type (Achtman scheme), phylogroup (ClermonTyping), and antibiotic resistance gene (ARG) content. We also searched for mutations in the QRDRs of gyrA, parC, and parE and for mutations in ftsI, ompC, and ompF we previously identified as associated with CP-Ec disseminated lineages (10) (Table S1). F-NRC CP-Ec bacteria were assigned to 168 different STs, including six new allelic combinations. ST38 was the most prevalent ST (n = 92, 12.9%), followed by ST10 (n = 67, 9.4%), ST410 (n = 64, 9%), and ST167 (n = 34, 4.8%). Ten additional STs were represented by at least 10 isolates (see Fig. S1A and Table S1 in the supplemental material), and 154 STs were represented by less than 10 isolates, including 102 STs with a single isolate.

FIG S1

Per-year analysis of the origin of the isolates received by the F-NRC. (A) Proportions of the isolates belonging to the different STs. The total number of isolates is indicated within parentheses. The large proportion of ST90 isolates collected in 2012 is linked to a local outbreak. (B) Proportions of the isolates collected in screening and infection situations as a function of the year and phylogroup. The absolute number of isolates for each class is indicated inside the bars. Download FIG S1, PDF file, 0.02 MB (24.9KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

We next performed a core genome phylogeny following read mapping using strain E. coli MG1655 (NCBI reference sequence NC_000913.3) as the reference genome sequence. The phylogenetic tree, estimated from 372,238 core single nucleotide polymorphisms (SNPs) was consistent with the results of phylogroup determination by using in silico ClermonTyping (12) except for a few isolates (Fig. 2). In agreement with the MLST-based analysis, this tree showed a broad diversity of CP-Ec isolates belonging to the eight phylogroups and three dominant clades corresponding to CC10 (phylogroup A; including ST10, ST167, and ST617), CC23 (phylogroup C; including ST410, ST88, and ST90) and ST38 (phylogroup D) with 161, 97, and 98 isolates, respectively, accounting for 49.9% of the F-NRC CP-Ec isolates analyzed (Fig. 2). Phylogroup B2 isolates represented 11% of the total isolates (n = 80), and only 17 CP-ST131 isolates were identified. Fluctuations in the proportions of the main STs could be observed during the 4 years of the analysis, but no clear tendency could be identified (Fig. S1A).

FIG 2.

FIG 2

Core genome phylogeny of CP-Ec isolates received by the F-NRC. ML phylogeny of the 713 CP-Ec isolates was built with RAxML (45) from 372,238 core SNPs after sequence alignment on MG1655-K12 (NC_000913.3) selected as the best reference. The genome sequence of Escherichia fergusonii strain ATCC 35469 (NC_011740.1) was used as an outgroup for the phylogenetic analysis. Genomes from Touchon et al. (40) were also incorporated into the analysis. Genomic features are indicated as in the figure key (left) from the inside to the outside circles: carbapenemases of the OXA, NDM, and other types, CTX-M ESBL, mutations in gyrA and parC QRDR region (FQ resistance), origin, main ST, phylogroups. Reference genomes from Touchon et al. (40) are indicated as Touchon. NA, not available.

Infection-related and screening isolates were intermixed throughout the phylogeny (Fig. 2). However, an enrichment in infection-related isolates was observed in phylogroup C (Pearson’s chi-squared test, P < 0.02, degrees of freedom [df] = 1) and phylogroup B2 (Pearson’s chi-squared test, P < 0.0005, df = 1) (Fig. S1B). In phylogroup B2, 5 out of 9 ST127 isolates, 5 out of 17 ST131 isolates, and 7 out of 8 ST636 isolates were responsible for UTIs (Table S1).

Diversity of antibiotic resistance genes carried by CP-Ec isolates.

The number of acquired ARGs was found to vary from 1 to 26 among the 713 CP-Ec isolates. The median was higher in phylogroup C isolates (median, 16) and lower in isolates from phylogroup B2 (median, 2) compared to other phylogroups (medians for phylogroup A, B1, and D were 9, 8, and 8, respectively) (Fig. 3A). An ESBL of the CTX-M family was present in 40.7% (n = 290) of the isolates, with a predominance of blaCTX-M-15 gene (n = 205) (Table S1). Mutations in gyrA, parC and/or parE potentially leading to fluoroquinolone (FQ) resistance occurred in 425 CP-Ec isolates (59.6%), with mutations in gyrA, parC, and parE identified in 412, 309, and 261 isolates, respectively (Table S1). Up to five mutations in QRDRs were identified in 19 isolates, and 250 isolates had four mutations in QRDRs, suggesting they have been subjected to long-term evolution under antibiotic pressure, including FQ (Table S1). Globally, a higher number of mutations in QRDRs was associated with a higher number of resistance genes (Fig. 3B), furthering the link between the number of QRDR mutations and a likely evolution under antibiotic selective pressure for CP-Ec isolates.

FIG 3.

FIG 3

Resistance gene content of the F-NRC CP-Ec isolates. (A) Box plot representation of the ARG content according to the phylogroup. The number of isolates belonging to each phylogroup is indicated within parentheses. The limits of the box indicate the lower and upper quartile. Outliers are indicated by points above the maximum value. (B) Relationships between the number of ARGs and the number of mutations in QRDRs. In the box plot representations, the median is indicated by an horizontal bar, and the mean is indicated by a × symbol.

Four isolates from 2015, two ST648 isolates collected in the same hospital at 1-week interval, one ST216 isolate and one ST744 isolate, carried a mcr-9 gene conferring resistance to colistin (Table S1). Although not associated with infection, the four isolates were MDR, carrying 13 to 20 acquired ARGs in addition to mcr-9 and carbapenemase genes (blaVIM-1 for three of the isolates, blaNDM-1 and blaOXA-48 for one isolate). Determination of the MIC for colistin for these four isolates were at the resistance breakpoint (MIC = 2 mg/L).

Diversity of carbapenemase genes.

Carbapenemase genes identified by WGS (Table S1) were in agreement with molecular data collected by the F-NRC laboratory. The blaOXA-48 and blaOXA-181 genes were the most frequent carbapenemase genes detected in 464 (65%) and 101 (14.1%) E. coli isolates, respectively. Twenty-nine other isolates carried minor blaOXA-48-like genes (Table S1). blaNDM family genes were detected in 14.9% (106/713) of the CP-Ec isolates (blaNDM-5 in 49 isolates and blaNDM-1 in 41 isolates). Three additional blaNDM alleles were identified: blaNDM-7 (n = 10) including a variant coding for a NDM7-like carbapenemase with a S24G mutation, blaNDM-4 (n = 5), and blaNDM-6 (n = 1). Four different blaVIM alleles were detected in 18 isolates: blaVIM-1 (n = 8), blaVIM-4 (n = 8), blaVIM-2 (n = 1), and blaVIM-29 (n = 1). Only five E. coli isolates expressed blaKPC-2 (n = 2) or blaKPC-3 (n = 3) alleles. In 12 isolates, two different carbapenemase genes were found; blaOXA-48/blaNDM-1 was the most frequent cooccurrence (n = 5) (Table S1).

The analysis of the number of additional ARGs and of mutations in QRDR regions showed that, compared with other carbapenemase genes, the presence of blaOXA-48 was frequently associated with less resistant isolates (Fig. 4A and B). In particular, it was the only ARG in 117 isolates of the 457 (24.5%) encoding this carbapenemase, including 23 infection-related isolates. Of those isolates, only five showed mutations in QRDRs. Conversely, only two blaNDM-1 gene-carrying isolates of 42 isolates and one blaNDM-5 gene-carrying isolates of 49 isolates carried no other ARG.

FIG 4.

FIG 4

Carbapenemase gene content of the F-NRC CP-Ec isolates. (A) Relationship between the carbapenemase allele and the ARG content. In the box plot representation, the median is indicated by an horizontal bar and the mean by a × symbol. The limits of the box indicate the lower and upper quartile. Outliers are indicated by points above the maximum value. (B) Number of isolates carrying a specific carbapenemase allele according to the number of QRDR mutations in their genomes. (C) Per-year evolution of the percentage of isolates carrying the different carbapenemase alleles in the 2012–2015 period. The year of isolation is indicated as in the figure key (right). Given their small number (n = 22), strains isolated before 2012 are not indicated.

A temporal analysis of the proportion of isolates displaying blaOXA-48 did not reveal significant variations (Pearson’s chi-squared test, P = 0.05, df = 3) during the 2012–2015 period (Fig. 4C). In contrast, the frequency of blaOXA-181 relative to other alleles was found to significantly increase from 0% in 2012 to 18.9% in 2015 (Pearson’s chi-squared test, P < 0.0002, df = 3) (Fig. 4C).

We observed some association between the carbapenemase gene and the ST. Of the 22 STs with at least seven F-NRC CP-Ec isolates, 14 predominantly displayed blaOXA-48 (Fig. 5A); blaOXA-48 was also predominant in STs represented by six or less isolates. In three lineages, ST410, ST940, and ST1284, blaOXA-181 was the predominant allele. These three STs grouped 74% of the blaOXA-181 gene carrying E. coli isolates from the F-NRC, and ST410 accounted for 50.5% of them. Finally, blaOXA-204 was the most prevalent allele in ST90, and blaNDM-5 was the predominant allele in ST636.

FIG 5.

FIG 5

Features of the main STs associated with CP-Ec isolates in France. At the top of the figure, for each ST, the phylogroup, the total number of isolates, and the number of isolates associated with urinary tract infections are indicated. Intermediate (Inter) and minor ST indicate ST represented by four to six isolates and one to three isolates, respectively. (A) Proportion of the isolates according to the carbapenemase allele in each ST. (B) Box plot representation of the ARGs content according to the ST. (C) Proportion of isolates according to the number of mutations in QRDR. (D) Proportion of isolates with a specific PBP3 allele. PBP3 characterized by a YRIP, YRIK, or YRIN insertion between positions 333 and 334, the YRIN insertion is generally associated with a A413V mutation (YRIN-L) and less frequently also with a E349K mutation (YRIN-K-L) (10). (E) Proportion of isolates carrying specific mutations in ompC: R195L, G138D or acquisition of an ST38-like ompC allele by recombination (10).

Characteristics of the STs preferentially associated with F-NRC CP-Ec isolates.

Eight of the 14 STs with more than 10 CP-Ec isolates (ST410, ST167, ST940, ST405, ST617, ST90, ST1284, and ST101) were characterized by a larger number of ARGs (median ≥ 10) and a larger number of mutations in QRDRs compared to ST38, ST10, ST131, ST69, ST88, and ST represented by less than seven isolates (Fig. 5B and C). ST648 isolates have an intermediate position, being highly mutated in QRDRs but less rich in ARGs. Analysis of CP-Ec isolates from the F-NRC for polymorphisms in ftsI revealed that 131 (18.4%) had a 4-amino-acid (aa) insertion between positions 333 and 334 of PBP3, the insertion being particularly prevalent in ST101, ST167, ST410, and ST1284 (Table S1 and Fig. 5D). Similarly, ompC alleles we previously characterized as modifying susceptibility to antibiotics were more prevalent in some STs: an ST38-like ompC allele resulting from a recombination event or a G138D mutation was present in 33 and 26 isolates belonging to six different STs, whereas the R195L mutation was observed in 50 isolates all from ST410 (Table S1 and Fig. 5E). In addition, 144 isolates belonging to phylogroups D and F possessed the ST38-like allele inherited vertically. On the other hand, the ompF gene was pseudogeneized in 47 isolates (Table S1).

To analyze the phylogenetic relationships between F-NRC Cp-Ec and other Cp-Ec isolates collected worldwide, we built ST-based maximum likelihood trees for the 14 STs with more than 10 isolates and included to this analysis genome sequences publicly available (Table S2). Together with the analysis of QRDR mutations and mutations in ftsI and ompC, this showed that 51 (80%) of the ST410 (Fig. S2A), 17 (49%) of the ST167 (Fig. S2B), 3 (33%) of the ST405 (Fig. S2C), and 8 (62%) of the ST101 (Fig. S2D) F-NRC CP-Ec isolates belonged to internationally disseminating MDR subclades enriched in CP we previously identified (10). Subclades characterized by mutations in ftsI and ompC mainly expressed blaOXA-181 (ST410) or blaNDM (other lineages). Other isolates belonging to these STs were dispersed on their respective phylogenies and with no sign of clonal dissemination for most of them.

FIG S2

Core genome phylogenies of the main STs characterized by clades disseminating internationally. (A) ST410; (B) ST167; (C) ST405; (D) ST101. Phylogenies were based on genome sequences of 64 CP-Ec isolates from the F-NRC and 146 genome sequences from the NCBI database (A), 35 CP-Ec isolates from the F-NRC and 134 genome sequences from the NCBI database (B), 15 CP-Ec isolates from the F-NRC and 145 genome sequences from the NCBI database (C), and 13 CP-Ec isolates from the F-NRC and 194 genome sequences from the NCBI database (D). Core genome (ST410, 3,522,000 nt; ST167, 3,276,000 nt; ST405, 3,685,000 nt; ST101, 3,774,000 nt) alignments of the de novo assemblies on the sequences of GCA_001442495.1 (ST410), GCA_003028815.1 (ST167), GCA_002142675.1 (ST405), and GCA_002163655.1 (ST101) used as reference were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 6,176 (ST410), 10,393 (ST167), 37,482 (ST405), and 19,967 (ST101) core SNPs after removing recombined regions with gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93E7 (ST88), CNR93D10 (ST746), CNR73I9 (ST115), and CNR93I2 (ST906) were used as outgroups for the phylogenetic analyses of ST410, ST167, ST405, and ST101, respectively. The origin from the F-NRC is indicated by red triangles close to the isolate name. Other genomic features are indicated as indicated in the figure key (left) from the inside to the outside lines: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance), number of ARGs, mutations in ftsI, ompC, ompF; and geographical origin. Download FIG S2, PDF file, 0.9 MB (946KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

ST38 corresponded to the most represented ST (n = 92) among the studied CP-Ec. The phylogenetic reconstruction together with 314 additional nonredundant CP-Ec genome sequences retrieved from EnteroBase (http://enterobase.warwick.ac.uk/) and 150 non-CP isolates from NCBI provided evidence that 80.4% (n = 67) of the French isolates clustered into four clades (Fig. 6). Three of the clades, G1 (n = 35), G2 (n = 25), and G4 (n = 4) only contained isolates encoding blaOXA-48, while G3 (n = 7) included isolates expressing blaOXA-48 or its single nucleotide derivative blaOXA-244. All but one isolate in G1 expressed blaCTX-M-24, while isolates of G3 expressed blaCTX-M-14b. The phylogenetic analysis provided evidence of worldwide dissemination of G1, G3, and G4 clades and multiple introduction in France. Strikingly, among the 28 isolates of the G2 clade, 24 were collected in France and four in the Netherlands, suggesting at this time a more regional dissemination. In none of the isolates of the four clades could an IncL plasmid, that generally carries blaOXA-48 (13), be identified by PlasmidFinder (14). It suggests a chromosomal integration as previously shown among ST38 isolates collected in the United Kingdom (15). Analysis of representative isolates of the G1 and G3 cluster (G1, GCA_005886035.1; G3, GCA_004759025.1), whose genome sequences were completely assembled, confirmed that blaOXA-48 and blaOXA-244, in these two isolates, respectively, were chromosomally integrated. To assess the genetic support of blaOXA-48 in the two other clades, we have determined the complete genome sequence of two isolates belonging to G2 (CNR65D6) and G4 (G4:CNR85I8) by combining long-read Pacbio and Illumina sequencing and found that both possess a chromosomally inserted blaOXA-48 gene. pOXA-48 plasmid sequences of various lengths were cointegrated with blaOXA-48, the whole sequence being surrounded by IS1 insertion sequences with different insertion sites for the four clades (Fig. 7). BlastN analyses on the assembled Illumina sequences of other isolates of each clade showed that isolates of the same clade share the same insertion sites (see Materials and Methods). A fifth clade, composed of seven closely related French isolates corresponded to a possible outbreak east of Paris, France, between December 2014 and December 2015. These isolates are predicted to be highly resistant, as they are carrying in addition to blaVIM-4, blaCTX-M-15, as well as 15 to 18 additional ARGs and five mutations in QRDRs.

FIG 6.

FIG 6

Core genome phylogeny of ST38 isolates. ML Phylogeny was based on genome sequences of 92 CP-Ec from the F-NRC and 464 genome sequences retrieved from EnteroBase and from the NCBI database, including 331 carrying a carbapenemase gene. A core genome (2,900,000-nt) alignment of the de novo assemblies on the sequence of GCA_005886035.1 used as a reference was performed by using Parsnp (47); ML phylogeny was built with RAxML (45) from 6,170 core SNPs after removing recombined regions with Gubbins (48). The genome sequence of CNRC6O47 (ST963) was used as an outgroup for the phylogenetic analysis. F-NRC isolates are indicated by red triangles (inner circle). Other genomic features are indicated as indicated in the figure key (left) from the inside to the outside circles: carbapenemases of the OXA, NDM, and other types; absence of any plasmid of any Inc type as identified by using PlasmidFinder (14); ESBL of the CTX-M types; number of mutations in gyrA and parC QRDRs (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. The four OXA-48-like clades (G1, G2, G3, and G4) clustering most French isolates are colored in blue, red, violet, and green. A fifth clade (G5) corresponding to a possible outbreak of VIM-4 isolates in east of Paris is colored in brown.

FIG 7.

FIG 7

blaOXA-48 integration sites of ST38 CP-Ec clades 1, 2, 3, and 4. The genome sequences of GCA_005886035.1 (CP040390.1), CNR65D6, GCA_004759025.1 (CP038505.1), and CNR85I8 were chosen as representative genomes of clusters 1, 2, 3, and 4, respectively, and aligned, by using BlastN, against the sequence of GCA_001900295.1 (CP010116.1), an ST38 isolate that does not carry a carbapenemase gene and whose genome sequence was complete. Gene annotations were derived from strain MG1655 (NC_000913.3). The regions surrounding the blaOXA-48 insertion sites in each clade were more precisely aligned against the corresponding sequences of CP010116.1 and of representative genomes of the other clades by using Easyfig (52). blaOXA-48 is represented by a brown arrow, cointegrated genes of pOXA-48 origin are indicated by light green arrows, and the IS1 transposase is indicated by a green arrow. Sequences used to perform the BLAST analysis on Illumina contigs are underlined in red. It is interesting to note that, in the G3 clade, blaOXA-48 integration occurs in a genomic island inserted at tRNASec. Another genomic island is present in G1-type strains but does not carry blaOXA-48.

ST10 E. coli isolates are commensals of a variety of mammals and bird species (16), and studies in different contexts have shown that this ST is also associated with ARG carriage (17, 18). The 67 F-NRC ST10 CP-Ec isolates were distributed throughout the ST10 phylogeny. Nonetheless, 31 isolates belonged to a clade enriched with carbapenemase producers (Fig. S3A, in green) that mainly expressed blaOXA-48, with the exception of three isolates expressing blaOXA-181, three expressing blaNDM-5, and one expressing blaVIM-4. Two of the blaOXA-181-expressing isolates and the three blaNDM-5-expressing isolates were closely related, shared a CTX-M-27 ESBL, four mutations in QRDRs, and a ftsI allele with a 4-aa YRIN insertion (Fig. S3A, in red).

FIG S3

Core genome phylogenies of ST10 and ST131 isolates. (A) ST10 phylogeny based on the genome sequences of 67 CP-Ec isolates from the F-NRC and 153 genome sequences from the NCBI. (B) ST131 phylogeny based on the genome sequences of 17 CP-Ec isolates from the F-NRC and 462 genome sequences from the NCBI. Core genome (ST10, 813,000 nt; ST131, 1,641,000 nt) alignments of the de novo assemblies on the sequence of MG1655 (ST10) or GCA_000285655.3 (ST131) used as references were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 20,245 (ST10) and 14,366 (ST131) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93D10 (ST746) and CNRAL47G10 (ST640) were used as outgroups for the phylogenetic analyses of ST10 and ST131, respectively. F-NRC isolates are indicated by red triangles (inner circle). Other genomic features are indicated as indicated in the figure key (left) from the inside to the outside circles: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance), number of ARGs, mutations in ftsI, ompC, and ompF; and geographical origin. The ST10 clade enriched in CP-Ec is indicated in green and the subclade with the YRIN insertion in PBP3 is indicated in red. Note that the circle for ftsI mutations is absent in ST131, as no YRIN-like duplication in PBP3 was identified among the analyzed isolates. Download FIG S3, PDF file, 1.3 MB (1.4MB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

ST131 isolates are major contributors for extraintestinal infections and the main factor responsible for the dissemination of CTX-M-15 ESBL (19, 20). ST131 evolution has been thoroughly analyzed, and four main lineages A, B, C1, and C2 have been described (21). We identified only 17 ST131 CP-Ec isolates received by the F-NRC (Fig. 2), 10 belonged to lineage A, while the others were scattered in the three other lineages (Fig. S3B). The main carbapenemase associated with these isolates was OXA-48 (n = 10). We also did not observe any association with the carriage of blaCTX-M genes, as only four ST131 CP-Ec isolates carried a blaCTX-M-15 gene. These results are in accordance with the United Kingdom survey (8) and our previous observations from sequences retrieved from public databases (10).

ST940 (phylogroup B1) and ST1284 (phylogroup A) represented 2.4% (n = 17) and 1.9% (n = 14) of the sequenced isolates, while they were poorly represented in sequence databases. For instance, in May 2021, only 83 and 103 genome sequences could be retrieved from EnteroBase for these STs, respectively, but with 43% (n = 36) and 22% (n = 22) carrying a carbapenemase gene, respectively. This suggested that ST940 and ST1284 were associated with the dissemination of carbapenemase genes. Phylogenetic trees were drawn by adding the nonredundant sequences from EnteroBase to those of the F-NRC and NCBI. For ST940 (Fig. S4A), this revealed a well-differentiated subclade of 18 isolates from Asia, Australia, and Europe carrrying blaNDM-5 and sharing mutations in ftsI and ompC. Isolates from the F-NRC were distributed on the tree, showing that the overrepresentation of ST940 in France did not result from local outbreaks. Three F-NRC isolates belonged to a second smaller subclade of blaNDM-expressing isolates, also mutated in ftsI and ompC. In contrast, in ST1284, 13 isolates carrying blaOXA-181 were found to be closely related (Fig. S4B). Metadata analysis showed that 12 of them were recovered from the same health facility in the Paris suburb during a 1-month period in 2015, demonstrating a local outbreak origin. A fourteenth unrelated isolate expressed blaNDM-5. The F-NRC isolates belonged to a clade characterized by four mutations in QRDR determinants, a YRIN duplication in PBP3, a G137D mutation in OmpC and containing additional OXA-181- or NDM-producing isolates.

FIG S4

Core genome phylogeny of ST940 and ST1284 isolates. (A) ST940 phylogeny based on genome sequences of 17 CP-Ec isolates from the F-NRC and 88 genome sequences from EnteroBase and the NCBI database. (B) ST1284 phylogeny based on genome sequences of 14 CP-Ec isolates from the F-NRC and 60 genome sequences from EnteroBase and the NCBI database. A core genome (ST940, 3,639,000 nt; ST1284, 3,779,000 nt) alignment of the de novo assemblies on the sequence of ST940: ESC_LB2149 and ST1284: ESC_LB2152AA (from EnteroBase) used as reference was performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 13,859 (ST940) and 2,009 (ST1284) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR98G1 (ST3022) and MG1655 (ST10) were used as outgroups for the phylogenetic analyses of ST940 and ST1284, respectively. F-NRC isolates are indicated by red triangles in thefirst column on the left. Other genomic features are as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S4, PDF file, 0.3 MB (362.8KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Of the 15 ST90 CP-Ec isolates, 9 encoding the blaOXA-204 allele were isolated in hospitals east of Paris between August 2012 to April 2013 and suspected to be associated with the use of a contaminated endoscope (22). Isolates closely related to this outbreak reappeared in 2014 (three times) and in 2015 (twice), also in hospitals located east of Paris. They belonged to a MDR clade characterized by four mutations in QRDRs (Fig. S5A). Finally, in the four last STs with more than 10 CP-Ec isolates (ST69, ST88, ST617, and ST648), French CP-Ec isolates were distributed throughout their respective phylogenetic trees (Fig. S5B and S5C and Fig. S6A and S6B) with no sign of clonal dissemination, except for a small cluster of five ST69 isolates from two geographical regions and expressing blaOXA-48.

FIG S5

Core genome phylogenies of ST69, ST88, and ST90 isolates. (A) ST90 phylogeny based on the 15 CP-Ec isolates from the F-NRC and 38 genome sequences from the NCBI database. (B) ST69 phylogeny based on the 16 CP-Ec isolates from the F-NRC and 244 genome sequences from the NCBI database. (C) ST88 phylogeny based on the 15 CP-Ec isolates from the F-NRC and 99 genome sequences from the NCBI database. Core genome (ST69, 2,735,000 nt; ST88, 3,472,000 nt; ST90, 3,801,000 nt) alignments of the de novo assemblies on the sequence of GCA_002443135.1 (ST69), GCA_002812685.1 (ST88), or GCA_001900635.1 (ST90) used as reference were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 4,596 (ST69), 12,557 (ST88), or 2,878 (ST90) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR33D9 (ST394), CNR83B9 (ST410), and CNR88B9 (ST847) were used as outgroups for the phylogenetic analyses of ST69, ST88, and ST90, respectively. F-NRC isolates are indicated by red triangles (first column on the left). Other genomic features are indicated as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S5, PDF file, 0.4 MB (465.4KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S6

Core genome phylogeny of ST617 and ST648 isolates based on genome sequences of ST617 genome sequences of 15 CP-Ec isolates from the F-NRC and 65 genome sequences from the NCBI database (A) and ST648genome sequences of 14 CP-Ec isolates from the F-NRC and 126 genome sequences from the NCBI database (B). A core genome (ST617, 3,407,000; ST648, 3,501,000 nt) alignment of the de novo assemblies on the sequences of GCA_002142695.1 (ST617) or GCA_004138645.1 (ST648) used as reference was performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 15,030 (ST617) or 3,782 (ST648) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93D10 (ST746) and CNR71A8 (ST1485) were used as outgroups for the phylogenetic analyses of ST617 and ST648, respectively. F-NRC isolates are indicated by red triangles close to the isolate name. Other genomic features are indicated as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S6, PDF file, 0.4 MB (429.4KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

In addition to potential outbreaks detected in ST38 (blaVIM-4), ST90 (blaOXA-204), and ST1284 (blaOXA-181), we also found evidence for another potential outbreak among isolates belonging to ST359. The five isolates carried blaOXA-48 and blaCTX-M-32 and had three mutations in QRDRs. They were isolated during January 2014, four of them in the southeastern part of France and one in the Parisian suburbs.

DISCUSSION

The prevalence of CP-Ec is increasing worldwide. However, whether this reveals conjugation event in E. coli isolates by circulating CP-encoding plasmids or the emergence and dissemination of at-risk clones is still largely unknown. Here, we have analyzed, by using WGS, 691 (87.8%) of the CP-Ec isolates received by the F-NRC during the2012–2015 period and 22 isolates from the Bicêtre hospital strain collection (2001 to 2011) to characterize both the diversity of carbapenemase genes and of CP-Ec isolates in France. Altogether, 713 CP-Ec isolates were sequenced, representing to our knowledge the most extended collection of CP-Ec isolates sequenced published to date.

The number of CP-Ec isolates sent to the F-NRC was found to strongly increase during the 4-year period of analysis, which might be a consequence of an increased circulation of CP-Ec in France but also of an increased screening of potential CPE carriers at their admission to the hospital. Indeed, while the number of infection-related CP-Ec isolates was regularly increasing, their proportion compared to the total number of received and sequenced isolates decreased by twofold, with a clear change observed between 2013 and 2014 (Fig. 1). This is likely a consequence of the implementation of the recommendations on MDR screening of the French Public Health Advisory Board 2013 (23). However, as CP-Ec isolates were sent on a voluntary basis by clinical laboratories, we cannot exclude some sampling bias.

The most predominant carbapenemase allele was blaOXA-48 (65%), followed by blaOXA-181 (14.1%) which detection constantly increased from no cases in 2012 to 73 in 2015. Next was blaNDM-5 that was found to progressively substitute to the blaNDM-1 gene in the frequency of detection (Fig. 4). The predominance of blaOXA-48-like genes and particularly of blaOXA-48, among CP-Ec was also noted in other studies (9, 24, 25) although in lower proportions. The small number of blaKPC genes detected in these studies is in contrast with the situation reported in Klebsiella pneumoniae for many European countries (26). However, blaOXA-48 was found to be the most prevalent allele in K. pneumoniae in France (2) and in the Netherlands (9). This difference might result from a lower capacity for blaKPC-carrying plasmids to conjugate to E. coli compared to pOXA-48 or to a higher fitness cost. It might also be linked to sampling differences, as different thresholds of carbapenem susceptibility might have been used to collect CRE isolates among studies. Indeed, OXA-48-like carbapenemases are generally associated with lower levels of carbapenem resistance than KPC. This analysis was performed on isolates collected between 2012 and 2015 and revealed early trends in the evolution of CP-Ec isolated in France. It will be essential in the characterization of the genomic evolution of CP-Ec in more recent years, which remains to be performed.

The number of ARGs identified in the CP-Ec isolates from the F-NRC varied from 1 (the carbapenemase gene only) to 26, showing that carbapenemase genes were acquired not only in MDR isolates but also in an E. coli population that can be considered “naive” relative to evolution under antibiotic selective pressure and resident of the intestinal microbiota. This was particularly true for isolates producing blaOXA-48 that were generally associated with a lower number of resistance genes than isolates producing other carbapenemases, irrespective of their clinical status (Fig. 4A). In particular, 118 out of the 121 (97.5%) isolates with no other ARG than the carbapenemase gene carried blaOXA-48. Therefore, the predominance of blaOXA-48 among the carbapenemase genes identified in the F-NRC collection might mainly rely on the higher conjugative transfer rate of pOXA-48-related plasmids compared to the rates of plasmids carrying genes encoding other carbapenemases (27). pOXA-48 has indeed been shown to rapidly conjugate among Enterobacterales in hospitalized patients (28). This could also explain the high frequency of blaOXA-48-carrying isolates belonging to ST10, ST69, ST88, ST127, ST12, and ST58 (Fig. 5) that carried a small number of resistance genes and QRDR mutations and among which four were previously identified among the most frequent E. coli STs characterized from fecal samples (29).

On the other hand, MDR CP-Ec isolates may result from the transfer of carbapenem resistance genes into isolates already selected through multiple exposures to antibiotic treatments that have already acquired multiresistance plasmids or chromosomal mutations increasing their intrinsic drug resistance, such as mutations in QRDRs, in ftsI and/or porin genes. Alternatively, the circulation of carbapenem-resistant lineages, such as the OXA-181-producing ST410 E. coli lineage, showing a stable association between the lineage and the resistance gene, may also occur and significantly contribute to the number of CP-Ec isolates collected. Discriminating between both alternatives would require a more extended comparison of plasmids carried in these lineages after long-read sequencing similarly to what been done in K. pneumoniae (30). Strikingly, the analysis of ftsI alleles coding for a PBP3 with the 4-amino-acid duplication revealed a contrasted situation among CP-Ec isolates collected in France, with the duplication in ftsI found in 72.3%, 77.8%, and 67% of the isolates carrying blaNDM-5 (34/47), blaNDM-7 (7/9), and blaOXA-181 (68/101) genes but in only 0.7% (4/458) of the isolates carrying blaOXA-48. We previously proposed that the fixation of mutations reducing the intrinsic susceptibility to carbapenems might have favored the efficient conjugative transfer of plasmids carrying the carbapenemase genes from other CPE species in the gut (10). Transconjugants would be selected in the context of low biliary excretion of carbapenems or other β-lactams after parenteral administration. Increasing the proportion of the donor and recipient bacteria would therefore be less necessary for plasmids with high conjugative rates, such as pOXA-48 (27). Alternatively, these mutations might also have favored plasmid maintenance by increasing the resistance level and selection of the isolates during β-lactam or carbapenem treatment. Of particular concern in France are the OXA-181-producing ST410 and the NDM (NDM-1, -5, and -7) ST167 lineages that significantly contribute to the circulation of carbapenem resistance. However, our study also reveals smaller lineages, that although less frequently encountered in international studies, are nevertheless circulating. In contrast, we did not obtain evidence for clonal dissemination of CP-Ec ST131 lineages derived from the B2 clade responsible for blaCTX-M-15 gene dissemination (19, 20).

Despite their high frequency among the F-NRC collection, the ST38 isolates do not fall into one of the two previous categories. Indeed, while these isolates mainly express blaOXA-48 or its single point mutant derivative blaOXA-244 (31), our phylogenetic analysis revealed that a majority of them belong to four different lineages, including one mostly associated with rapid dissemination in France. In contrast with OXA-181- and NDM-producing lineages previously described, these lineages are associated with a moderate number of ARGs and QRDR mutations. None of the ST38 isolates is mutated in ftsI; however, we previously demonstrated that all are characterized by an ompC allele that encodes a porin with a reduced permeability to certain β-lactams and has disseminated into unrelated lineages by homologous recombination (10). A common feature of the four lineages is the integration of part of the pOXA-48 plasmid carrying blaOXA-48. This could have reduced a fitness cost of this plasmid and facilitated the clonal expansion of these lineages. However, ST38 isolates were identified as a infrequent colonizer of the gastrointestinal (GI) tract (29, 32). Therefore, additional features of these CP-Ec ST38 lineages might have contributed to their dissemination.

In conclusion, by analyzing genome sequences of CP-Ec isolates collected by the F-NRC, we showed that MDR lineages, enriched in carbapenemase-producing isolates are circulating in France, and that some of them, such as ST1284 isolates, are associated with outbreaks. The results of our study also suggest that the evolutionary trajectory may depend on the carbapenemase gene, blaOXA-181 or blaNDM genes being more frequently associated with the evolution of MDR E. coli lineages characterized by mutations in ftsI and ompC. Surveillance of these mutations may be an important parameter in controlling the circulation of MDR lineages. On the other hand, carbapenemase genes are also frequently acquired through plasmid dissemination from other bacterial species. Depending on the resistance background of the receiver E. coli, this may lead to XDR or to isolates sensitive to a broad range of antibiotics. Finally, we also observed a strong and rapid dissemination of ST38 isolates that might have been favored by a reduced susceptibility to carbapenems linked to the ST38 ompC allele and by the chromosomal integration of the carbapenemase gene. These results strengthen our model of different evolutionary trajectories associated with the gain of carbapenemase genes (10). They also show that systematic genome sequencing of CP-Ec and on a larger scale, CPE isolates, irrespective of their clinical or resistance status, is able to provide useful information not only on the circulation of MDR lineages but also on the propagation of resistance genes through horizontal gene transfer.

MATERIALS AND METHODS

Isolate collection and sequencing.

CP-Ec isolates analyzed in this study were collected by the F-NRC, mainly between the years 2012 and 2015. Twenty-two additional isolates we received before the creation of the F-NRC in 2012 were included in the analysis. Information on these isolates such as the year of isolation, region and department in France, and summary of their genomic features are reported in Table S1 in the supplemental material.

Whole-genome sequencing and analyses.

DNA was extracted by using the Qiagen Blood and Tissue DNA easy kit. Sequencing libraries were constructed by using the Nextera XT kit (Illumina) following the manufacturer’s instructions and sequenced with the Illumina HiSeq2500 or NextSeq500. FastQ files were trimmed for adaptors and low-quality bases (setting the minimum base quality threshold to 25) with the Cutadapt fork Atropos (33). De novo assemblies were generated from the trimmed reads with SPAdes v3.12.0 (34), using k-mer sizes of 51, 71, 81, and 91, the coverage cutoff option was set to “auto” and the “careful” option was activated. QUAST v2.2 (35) was used to assess the assembly quality, and contigs shorter than 500 bp were filtered out for phylogenetic analyses. The ST38 isolates CNR65D6 and CNR85I8 were sequenced to completion by using the long-read Pacbio technology. PacBio reads were assembled with the RS_HGAP_Assembly.3 protocol from the SMRT analysis toolkit v2.3 (36) and with Canu (37), polished with Quiver (36), and manually corrected by mapping Illumina reads using Breseq (38). Assembled genomes were annotated with Prokka v1.9 (39). To analyze the F-NRC CP-Ec isolates belonging to the main ST recovered during the analysis in a more global context, their genome sequences were combined to the assembled genome sequences from the same ST retrieved from the NCBI database (July 2019) and in some STs from EnteroBase (http://enterobase.warwick.ac.uk/, April 2021). Retrieved genomes were annotated in the same way as the genome sequences of the F-NRC isolates (Table S2).

To generate a core genome phylogeny tree of all isolates, the best reference genome was first selected among a set of 18 genomes analyzed by Touchon et al. (40) using the software refRank (41) and three subsets of 100 randomly selected sequences from our study as input. This led to the selection of the genome sequence of strain MG1655-K12 (NC_000913.3) as the reference for read mapping and SNP identification. Sequence reads for the 17 reference genomes (40) as well as for the genome sequence of Escherichia fergusonii strain ATCC 35469 (NC_011740.1), used as an outgroup, were simulated with ART (42). Trimmed sequencing reads were mapped against the MG1655-K12 genome with BWA-MEM algorithm of the BWA v0.7.4 package (43). For SNP calling, the Genome Analysis Toolkit (GATK) v3.6.0 (44) was used with the following criteria, a minimum depth coverage (DP) of 10, a quality by depth (QD) bigger than 2, a fisher strain bias (FS) below 60, a root mean square of the mapping quality (MQ) above 40, and the mapping quality rank sum test (MQRankSum) and the read position rank sum test (ReadPosRankSum) greater than −12.5 and −8, respectively. A maximum likelihood tree was estimated with RAxML v8.28 (45) using core genome SNPs after removing positions in the accessory genome identified with the filter_BSR_variome.py script from the LS-BSR pipeline (46).

For STs with more than 10 CP-Ec isolates from the F-NRC, a core genome alignment was generated with Parsnp (47), by using a finished genome sequence as reference. A closely related isolate outside the ST lineage was selected from the global phylogenetic tree, including all F-NRC isolates and used as an outgroup to root the ST phylogenetic trees. Maximum likelihood (ML) trees were generated with RAxML v8.28 (45) after removing regions of recombination with Gubbins (48). All graphic representations were performed by using ITOL (49).

MLST type, phylogroup, resistance gene, and plasmid replicon identification.

Sequence type was assigned to each assembly through a python script that relies on BLAST (10). Antibiotic resistance genes (ARGs) and mutations in gyrA, parC, and parE quinolone resistance determining regions (QRDRs) were identified with Resfinder 4.0 and PointFinder (50) run locally, respectively. The scripts and database (retrieved 4 January 2021) were downloaded from the repositories of the Centre for Genomic Epidemiology (https://bitbucket.org/genomicepidemiology/). The identified ARGs were manually reviewed to eliminate potential redundant ARGs predicted at the same genomic position. mdf(A) that is present in E. coli core genome was not taken into account. Phylogroups were assigned by using EzClermont (51) run locally. Plasmid replicons were identified with plasmidfinder run on each assembly (14). ftsI, ompC, and ompF coding sequences (CDS) and ompF promoter sequences were identified by using BlastN. Translated or nucleotide sequences were clustered by cd-hit (cd-hit-v4.8.1) with an amino acid (FtsI, OmpC, OmpF) or nucleotide sequence (ompF promoter) identity threshold of 1. Sequences of each cluster were aligned to detect mutations in regions of interest: 4-amino-acid insertions between P333 and Y334 and E349K and I532L mutations (FtsI), mutation modifying charge in L3 constriction loop, R195L mutation, nonsense or frameshift mutations or OmpC sequence clustering with ST-38 OmpC sequences (OmpC), nonsense mutations (OmpF) or mutations affecting one of the OmpR boxes: mutation −46T/C in OmpR-F3 box and mutation −75ΔT in OmpR-F2 box (ompF promoter).

Determination of blaOXA-48 insertion site in ST38 clades 1, 2, 3, and 4.

The genome sequences of GCA_005886035.1 (CP040390.1), CNR65D6, GCA_004759025.1 (CP038505.1), and CNR85I8 were chosen as representative genomes of clusters 1, 3, 2, and 4, respectively, and aligned, by using BlastN, against the sequence of GCA_001900295.1 (CP010116.1), an ST38 isolate that does not express a carbapenemase and whose genome sequence was complete. The regions surrounding the blaOXA-48 insertion sites in each clade were more precisely aligned against the corresponding sequences of other representative genomes by using Easyfig (Easyfig_mac_2.1) (52) (see Fig. S3 in the supplemental material). To perform a BLAST analysis of insertion sites by using contigs generated from Illumina sequences, smaller sequences were chosen as queries as following: nucleotides (nt) 1240133 to 1240934, nt 964173 to 965072, nt 419599 to 421606, nt 4206783 to 4207690 from GCA_001900295.1 to screen for native sequences corresponding to the four blaOXA-48 insertion sites in G1, G2, G3, and G4 clades and sequences nt 4460332 to 4460759 from CP040390.1, nt 4763987 to 4764506 from CNR65D6, nt 100784 to 102087 from CP038505.1, and nt 1206426 to 1206985 from CNR85I8 to screen for the integration site plus IS1 insertion as occurring in G1, G2, G3, and G4 clades, respectively.

Statistical analysis.

Pearson’s chi-squared tests were performed by using standard libraries contained within the R statistics package (http://www.R-project.org).

Availability of data.

All sequence data have been deposited at DDBJ/EMBL/GenBank (BioProject PRJEB46636) and BioSample identifiers for the Illumina sequence data are listed in Table S1. Complete genome sequences of CNR65D6 and CNR85I8 and the long-read sequencing data have been deposited at DDBJ/EMBL/GenBank with the accession numbers OU701452.1 (PacBio reads, ERR6414227) and OU701449.1 (PacBio reads, ERR6414228), respectively.

ACKNOWLEDGMENTS

This work was supported by grants from the French National Research Agency (ANR-10-LABX-62-IBEID, ANR-10-LABX-33, INCEPTION project [PIA/ANR-16-CONV-0005]) and from the European Union’s Horizon 2020 research and Innovation Program under grant agreement 773830 (Project ARDIG, One Health EJP).

Pengdbamba Dieudonné Zongo is a scholar of Ed525 CDV, Sorbonne Université, Paris.

Contributor Information

Philippe Glaser, Email: pglaser@pasteur.fr.

Zackery Bulman, University of Illinois at Chicago.

REFERENCES

  • 1.Medina M, Castillo-Pino E. 2019. An introduction to the epidemiology and burden of urinary tract infections. Ther Adv Urol 11:1756287219832172. doi: 10.1177/1756287219832172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Dortet L, Cuzon G, Ponties V, Nordmann P. 2017. Trends in carbapenemase-producing Enterobacteriaceae, France, 2012 to 2014. Euro Surveill 22(6):pii=30461. 10.2807/1560-7917.ES.2017.22.6.30461. [DOI] [PMC free article] [PubMed]
  • 3.French National Reference Center for antibiotic resistance, Le Kremlin-Bicêtre, France, activity report 2019–2020. https://www.cnr-resistance-antibiotiques.fr/bilans-dactivite.html.
  • 4.Saleem AF, Allana A, Hale L, Diaz A, Salinas R, Salinas C, Qureshi SM, Hotwani A, Rahman N, Khan A, Zaidi AK, Seed PC, Arshad M. 2020. The gut of healthy infants in the community as a reservoir of ESBL and carbapenemase-producing bacteria. Antibiotics 9:286. doi: 10.3390/antibiotics9060286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bush K, Jacoby GA. 2010. Updated functional classification of beta-lactamases. Antimicrob Agents Chemother 54:969–976. doi: 10.1128/AAC.01009-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Stoesser N, Sheppard AE, Pankhurst L, De Maio N, Moore CE, Sebra R, Turner P, Anson LW, Kasarskis A, Batty EM, Kos V, Wilson DJ, Phetsouvanh R, Wyllie D, Sokurenko E, Manges AR, Johnson TJ, Price LB, Peto TE, Johnson JR, Didelot X, Walker AS, Crook DW, Modernizing Medical Microbiology Informatics Group (MMMIG) . 2016. Evolutionary history of the global emergence of the Escherichia coli epidemic clone ST131. mBio 7:e02162-15. doi: 10.1128/mBio.02162-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gauthier L, Dortet L, Cotellon G, Creton E, Cuzon G, Ponties V, Bonnin RA, Naas T. 2018. Diversity of carbapenemase-producing Escherichia coli isolates, France 2012–2013. Antimicrob Agents Chemother 62:e00266-18. doi: 10.1128/AAC.00266-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ellaby N, Doumith M, Hopkins KL, Woodford N, Ellington MJ. 2019. Emergence of diversity in carbapenemase-producing Escherichia coli ST131, England, January 2014 to June 2016. Euro Surveill 24(37):pii=1800627. 10.2807/1560-7917.ES.2019.24.37.1800627. [DOI] [PMC free article] [PubMed]
  • 9.van der Zwaluw K, Witteveen S, Wielders L, van Santen M, Landman F, de Haan A, Schouls LM, Bosch T, Dutch CPE surveillance Study Group . 2020. Molecular characteristics of carbapenemase-producing Enterobacterales in the Netherlands; results of the 2014–2018 national laboratory surveillance. Clin Microbiol Infect 26:1412.e7–1412.e12. doi: 10.1016/j.cmi.2020.01.027. [DOI] [PubMed] [Google Scholar]
  • 10.Patino-Navarrete R, Rosinski-Chupin I, Cabanel N, Gauthier L, Takissian J, Madec JY, Hamze M, Bonnin RA, Naas T, Glaser P. 2020. Stepwise evolution and convergent recombination underlie the global dissemination of carbapenemase-producing Escherichia coli. Genome Med 12:10. doi: 10.1186/s13073-019-0699-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Roer L, Overballe-Petersen S, Hansen F, Schønning K, Wang M, Røder BL, Hansen DS, Justesen US, Andersen LP, Fulgsang-Damgaard D, Hopkins KL, Woodford N, Falgenhauer L, Chakraborty T, Samuelsen Ø, Sjöström K, Johannesen TB, Ng K, Nielsen J, Ethelberg S, Stegger M, Hammerum AM, Hasman H. 2018. Escherichia coli sequence type 410 is causing new international high-risk clones. mSphere 3:e00337-18. doi: 10.1128/mSphere.00337-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Beghain J, Bridier-Nahmias A, Le Nagard H, Denamur E, Clermont O. 2018. ClermonTyping: an easy-to-use and accurate in silico method for Escherichia genus strain phylotyping. Microb Genom 4:e000192. doi: 10.1099/mgen.0.000192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hendrickx APA, Landman F, de Haan A, Witteveen S, van Santen-Verheuvel MG, Schouls LM, Dutch Cpe Surveillance Study Group . 2021. bla(OXA-48)-like genome architecture among carbapenemase-producing Escherichia coli and Klebsiella pneumoniae in the Netherlands. Microb Genom 7:e000512. doi: 10.1099/mgen.0.000512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Carattoli A, Zankari E, Garcia-Fernandez A, Voldby Larsen M, Lund O, Villa L, Moller Aarestrup F, Hasman H. 2014. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother 58:3895–3903. doi: 10.1128/AAC.02412-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Turton JF, Doumith M, Hopkins KL, Perry C, Meunier D, Woodford N. 2016. Clonal expansion of Escherichia coli ST38 carrying a chromosomally integrated OXA-48 carbapenemase gene. J Med Microbiol 65:538–546. doi: 10.1099/jmm.0.000248. [DOI] [PubMed] [Google Scholar]
  • 16.Leverstein-van Hall MA, Dierikx CM, Cohen Stuart J, Voets GM, van den Munckhof MP, van Essen-Zandbergen A, Platteel T, Fluit AC, van de Sande-Bruinsma N, Scharinga J, Bonten MJ, Mevius DJ, National ESBL surveillance group . 2011. Dutch patients, retail chicken meat and poultry share the same ESBL genes, plasmids and strains. Clin Microbiol Infect 17:873–880. doi: 10.1111/j.1469-0691.2011.03497.x. [DOI] [PubMed] [Google Scholar]
  • 17.Oteo J, Diestra K, Juan C, Bautista V, Novais A, Pérez-Vázquez M, Moyá B, Miró E, Coque TM, Oliver A, Cantón R, Navarro F, Campos J, Spanish Network in Infectious Pathology Project (REIPI) . 2009. Extended-spectrum beta-lactamase-producing Escherichia coli in Spain belong to a large variety of multilocus sequence typing types, including ST10 complex/A, ST23 complex/A and ST131/B2. Int J Antimicrob Agents 34:173–176. doi: 10.1016/j.ijantimicag.2009.03.006. [DOI] [PubMed] [Google Scholar]
  • 18.Manges AR, Mende K, Murray CK, Johnston BD, Sokurenko EV, Tchesnokova V, Johnson JR. 2017. Clonal distribution and associated characteristics of Escherichia coli clinical and surveillance isolates from a military medical center. Diagn Microbiol Infect Dis 87:382–385. doi: 10.1016/j.diagmicrobio.2017.01.007. [DOI] [PubMed] [Google Scholar]
  • 19.Price LB, Johnson JR, Aziz M, Clabots C, Johnston B, Tchesnokova V, Nordstrom L, Billig M, Chattopadhyay S, Stegger M, Andersen PS, Pearson T, Riddell K, Rogers P, Scholes D, Kahl B, Keim P, Sokurenko EV. 2013. The epidemic of extended-spectrum-beta-lactamase-producing Escherichia coli ST131 is driven by a single highly pathogenic subclone, H30-Rx. mBio 4:e00377-13. doi: 10.1128/mBio.00377-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Nicolas-Chanoine MH, Bertrand X, Madec JY. 2014. Escherichia coli ST131, an intriguing clonal group. Clin Microbiol Rev 27:543–574. doi: 10.1128/CMR.00125-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Decano AG, Downing T. 2019. An Escherichia coli ST131 pangenome atlas reveals population structure and evolution across 4,071 isolates. Sci Rep 9:17394. doi: 10.1038/s41598-019-54004-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Potron A, Bernabeu S, Cuzon G, Pontiès V, Blanchard H, Seringe E, Naas T, Nordmann P, Dortet L. 2017. Analysis of OXA-204 carbapenemase-producing Enterobacteriaceae reveals possible endoscopy-associated transmission, France, 2012 to 2014. Euro Surveill 22(49):pii=17-00048. 10.2807/1560-7917.ES.2017.22.49.17-00048. [DOI] [PMC free article] [PubMed]
  • 23.Lepelletier D, Berthelot P, Lucet JC, Fournier S, Jarlier V, Grandbastien B, National Working Group . 2015. French recommendations for the prevention of 'emerging extensively drug-resistant bacteria' (eXDR) cross-transmission. J Hosp Infect 90:186–195. doi: 10.1016/j.jhin.2015.04.002. [DOI] [PubMed] [Google Scholar]
  • 24.Ortega A, Sáez D, Bautista V, Fernández-Romero S, Lara N, Aracil B, Pérez-Vázquez M, Campos J, Oteo J, Spanish Collaborating Group for the Antibiotic Resistance Surveillance Programme . 2016. Carbapenemase-producing Escherichia coli is becoming more prevalent in Spain mainly because of the polyclonal dissemination of OXA-48. J Antimicrob Chemother 71:2131–2138. doi: 10.1093/jac/dkw148. [DOI] [PubMed] [Google Scholar]
  • 25.Grundmann H, Glasner C, Albiger B, Aanensen DM, Tomlinson CT, Andrasević AT, Cantón R, Carmeli Y, Friedrich AW, Giske CG, Glupczynski Y, Gniadkowski M, Livermore DM, Nordmann P, Poirel L, Rossolini GM, Seifert H, Vatopoulos A, Walsh T, Woodford N, Monnet DL, European Survey of Carbapenemase-Producing Enterobacteriaceae (EuSCAPE) Working Group . 2017. Occurrence of carbapenemase-producing Klebsiella pneumoniae and Escherichia coli in the European survey of carbapenemase-producing Enterobacteriaceae (EuSCAPE): a prospective, multinational study. Lancet Infect Dis 17:153–163. doi: 10.1016/S1473-3099(16)30257-2. [DOI] [PubMed] [Google Scholar]
  • 26.David S, Reuter S, Harris SR, Glasner C, Feltwell T, Argimon S, Abudahab K, Goater R, Giani T, Errico G, Aspbury M, Sjunnebo S, EuSCAPE Working Group, ESGEM Study Group, Feil EJ, Rossolini GM, Aanensen DM, Grundmann H. 2019. Epidemic of carbapenem-resistant Klebsiella pneumoniae in Europe is driven by nosocomial spread. Nat Microbiol 4:1919–1929. doi: 10.1038/s41564-019-0492-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Potron A, Poirel L, Nordmann P. 2014. Derepressed transfer properties leading to the efficient spread of the plasmid encoding carbapenemase OXA-48. Antimicrob Agents Chemother 58:467–471. doi: 10.1128/AAC.01344-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.León-Sampedro R, DelaFuente J, Díaz-Agero C, Crellen T, Musicha P, Rodríguez-Beltrán J, de la Vega C, Hernández-García M, R-GNOSIS WP5 Study Group, López-Fresneña N, Ruiz-Garbajosa P, Cantón R, Cooper BS, San Millán Á. 2021. Pervasive transmission of a carbapenem resistance plasmid in the gut microbiota of hospitalized patients. Nat Microbiol 6:606–616. doi: 10.1038/s41564-021-00879-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Matsui Y, Hu Y, Rubin J, de Assis RS, Suh J, Riley LW. 2020. Multilocus sequence typing of Escherichia coli isolates from urinary tract infection patients and from fecal samples of healthy subjects in a college community. Microbiologyopen 9:1225–1233. doi: 10.1002/mbo3.1032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.David S, Cohen V, Reuter S, Sheppard AE, Giani T, Parkhill J, European Survey of Carbapenemase-Producing Enterobacteriaceae (EuSCAPE) Working Group, ESCMID Study Group for Epidemiological Markers (ESGEM), Rossolini GM, Feil EJ, Grundmann H, Aanensen DM. 2020. Integrated chromosomal and plasmid sequence analyses reveal diverse modes of carbapenemase gene spread among Klebsiella pneumoniae. Proc Natl Acad Sci USA 117:25043–25054. doi: 10.1073/pnas.2003407117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Emeraud C, Girlich D, Bonnin RA, Jousset AB, Naas T, Dortet L. 2021. Emergence and polyclonal dissemination of OXA-244-producing Escherichia coli, France. Emerg Infect Dis 27:1206–1210. doi: 10.3201/eid2704.204459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Massot M, Daubié AS, Clermont O, Jauréguy F, Couffignal C, Dahbi G, Mora A, Blanco J, Branger C, Mentré F, Eddi A, Picard B, Denamur E, Coliville Group . 2016. Phylogenetic, virulence and antibiotic resistance characteristics of commensal strain populations of Escherichia coli from community subjects in the Paris area in 2010 and evolution over 30 years. Microbiology (Reading) 162:642–650. doi: 10.1099/mic.0.000242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Didion JP, Martin M, Collins FS. 2017. Atropos: specific, sensitive, and speedy trimming of sequencing reads. PeerJ 5:e3720. doi: 10.7717/peerj.3720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
  • 37.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Deatherage DE, Barrick JE. 2014. Identification of mutations in laboratory-evolved microbes from next-generation sequencing data using breseq. Methods Mol Biol 1151:165–188. doi: 10.1007/978-1-4939-0554-6_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 40.Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, Bidet P, Bingen E, Bonacorsi S, Bouchier C, Bouvet O, Calteau A, Chiapello H, Clermont O, Cruveiller S, Danchin A, Diard M, Dossat C, Karoui ME, Frapy E, Garry L, Ghigo JM, Gilles AM, Johnson J, Le Bouguenec C, Lescat M, Mangenot S, Martinez-Jehanne V, Matic I, Nassif X, Oztas S, Petit MA, Pichon C, Rouy Z, Ruf CS, Schneider D, Tourret J, Vacherie B, Vallenet D, Medigue C, Rocha EP, Denamur E. 2009. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet 5:e1000344. doi: 10.1371/journal.pgen.1000344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Becker L, Fuchs S, Pfeifer Y, Semmler T, Eckmanns T, Korr G, Sissolak D, Friedrichs M, Zill E, Tung ML, Dohle C, Kaase M, Gatermann S, Rüssmann H, Steglich M, Haller S, Werner G. 2018. Whole genome sequence analysis of CTX-M-15 producing Klebsiella isolates allowed dissecting a polyclonal outbreak scenario. Front Microbiol 9:322. doi: 10.3389/fmicb.2018.00322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Huang W, Li L, Myers JR, Marth GT. 2012. ART: a next-generation sequencing read simulator. Bioinformatics 28:593–594. doi: 10.1093/bioinformatics/btr708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. 2010. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Sahl JW, Caporaso JG, Rasko DA, Keim P. 2014. The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes. PeerJ 2:e332. doi: 10.7717/peerj.332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Treangen TJ, Ondov BD, Koren S, Phillippy AM. 2014. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol 15:524. doi: 10.1186/s13059-014-0524-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Croucher NJ, Page AJ, Connor TR, Delaney AJ, Keane JA, Bentley SD, Parkhill J, Harris SR. 2015. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res 43:e15. doi: 10.1093/nar/gku1196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Letunic I, Bork P. 2019. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res 47:W256–W259. doi: 10.1093/nar/gkz239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Bortolaia V, Kaas RS, Ruppe E, Roberts MC, Schwarz S, Cattoir V, Philippon A, Allesoe RL, Rebelo AR, Florensa AF, Fagelhauer L, Chakraborty T, Neumann B, Werner G, Bender JK, Stingl K, Nguyen M, Coppens J, Xavier BB, Malhotra-Kumar S, Westh H, Pinholt M, Anjum MF, Duggett NA, Kempf I, Nykäsenoja S, Olkkola S, Wieczorek K, Amaro A, Clemente L, Mossong J, Losch S, Ragimbeau C, Lund O, Aarestrup FM. 2020. ResFinder 4.0 for predictions of phenotypes from genotypes. J Antimicrob Chemother 75:3491–3500. doi: 10.1093/jac/dkaa345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Waters NR, Abram F, Brennan F, Holmes A, Pritchard L. 2020. Easy phylotyping of Escherichia coli via the EzClermont web app and command-line tool. Access Microbiol 2:acmi000143. doi: 10.1099/acmi.0.000143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sullivan MJ, Petty NK, Beatson SA. 2011. Easyfig: a genome comparison visualizer. Bioinformatics 27:1009–1010. doi: 10.1093/bioinformatics/btr039. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

TABLE S1

Main characteristics of the F-NRC CP-Ec. Download Table S1, XLSX file, 0.1 MB (126.9KB, xlsx) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S2

Main characteristics of genome sequences used for phylogenetic reconstructions. Download Table S2, XLSX file, 0.3 MB (266.1KB, xlsx) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S1

Per-year analysis of the origin of the isolates received by the F-NRC. (A) Proportions of the isolates belonging to the different STs. The total number of isolates is indicated within parentheses. The large proportion of ST90 isolates collected in 2012 is linked to a local outbreak. (B) Proportions of the isolates collected in screening and infection situations as a function of the year and phylogroup. The absolute number of isolates for each class is indicated inside the bars. Download FIG S1, PDF file, 0.02 MB (24.9KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S2

Core genome phylogenies of the main STs characterized by clades disseminating internationally. (A) ST410; (B) ST167; (C) ST405; (D) ST101. Phylogenies were based on genome sequences of 64 CP-Ec isolates from the F-NRC and 146 genome sequences from the NCBI database (A), 35 CP-Ec isolates from the F-NRC and 134 genome sequences from the NCBI database (B), 15 CP-Ec isolates from the F-NRC and 145 genome sequences from the NCBI database (C), and 13 CP-Ec isolates from the F-NRC and 194 genome sequences from the NCBI database (D). Core genome (ST410, 3,522,000 nt; ST167, 3,276,000 nt; ST405, 3,685,000 nt; ST101, 3,774,000 nt) alignments of the de novo assemblies on the sequences of GCA_001442495.1 (ST410), GCA_003028815.1 (ST167), GCA_002142675.1 (ST405), and GCA_002163655.1 (ST101) used as reference were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 6,176 (ST410), 10,393 (ST167), 37,482 (ST405), and 19,967 (ST101) core SNPs after removing recombined regions with gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93E7 (ST88), CNR93D10 (ST746), CNR73I9 (ST115), and CNR93I2 (ST906) were used as outgroups for the phylogenetic analyses of ST410, ST167, ST405, and ST101, respectively. The origin from the F-NRC is indicated by red triangles close to the isolate name. Other genomic features are indicated as indicated in the figure key (left) from the inside to the outside lines: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance), number of ARGs, mutations in ftsI, ompC, ompF; and geographical origin. Download FIG S2, PDF file, 0.9 MB (946KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S3

Core genome phylogenies of ST10 and ST131 isolates. (A) ST10 phylogeny based on the genome sequences of 67 CP-Ec isolates from the F-NRC and 153 genome sequences from the NCBI. (B) ST131 phylogeny based on the genome sequences of 17 CP-Ec isolates from the F-NRC and 462 genome sequences from the NCBI. Core genome (ST10, 813,000 nt; ST131, 1,641,000 nt) alignments of the de novo assemblies on the sequence of MG1655 (ST10) or GCA_000285655.3 (ST131) used as references were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 20,245 (ST10) and 14,366 (ST131) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93D10 (ST746) and CNRAL47G10 (ST640) were used as outgroups for the phylogenetic analyses of ST10 and ST131, respectively. F-NRC isolates are indicated by red triangles (inner circle). Other genomic features are indicated as indicated in the figure key (left) from the inside to the outside circles: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance), number of ARGs, mutations in ftsI, ompC, and ompF; and geographical origin. The ST10 clade enriched in CP-Ec is indicated in green and the subclade with the YRIN insertion in PBP3 is indicated in red. Note that the circle for ftsI mutations is absent in ST131, as no YRIN-like duplication in PBP3 was identified among the analyzed isolates. Download FIG S3, PDF file, 1.3 MB (1.4MB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S4

Core genome phylogeny of ST940 and ST1284 isolates. (A) ST940 phylogeny based on genome sequences of 17 CP-Ec isolates from the F-NRC and 88 genome sequences from EnteroBase and the NCBI database. (B) ST1284 phylogeny based on genome sequences of 14 CP-Ec isolates from the F-NRC and 60 genome sequences from EnteroBase and the NCBI database. A core genome (ST940, 3,639,000 nt; ST1284, 3,779,000 nt) alignment of the de novo assemblies on the sequence of ST940: ESC_LB2149 and ST1284: ESC_LB2152AA (from EnteroBase) used as reference was performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 13,859 (ST940) and 2,009 (ST1284) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR98G1 (ST3022) and MG1655 (ST10) were used as outgroups for the phylogenetic analyses of ST940 and ST1284, respectively. F-NRC isolates are indicated by red triangles in thefirst column on the left. Other genomic features are as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S4, PDF file, 0.3 MB (362.8KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S5

Core genome phylogenies of ST69, ST88, and ST90 isolates. (A) ST90 phylogeny based on the 15 CP-Ec isolates from the F-NRC and 38 genome sequences from the NCBI database. (B) ST69 phylogeny based on the 16 CP-Ec isolates from the F-NRC and 244 genome sequences from the NCBI database. (C) ST88 phylogeny based on the 15 CP-Ec isolates from the F-NRC and 99 genome sequences from the NCBI database. Core genome (ST69, 2,735,000 nt; ST88, 3,472,000 nt; ST90, 3,801,000 nt) alignments of the de novo assemblies on the sequence of GCA_002443135.1 (ST69), GCA_002812685.1 (ST88), or GCA_001900635.1 (ST90) used as reference were performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 4,596 (ST69), 12,557 (ST88), or 2,878 (ST90) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR33D9 (ST394), CNR83B9 (ST410), and CNR88B9 (ST847) were used as outgroups for the phylogenetic analyses of ST69, ST88, and ST90, respectively. F-NRC isolates are indicated by red triangles (first column on the left). Other genomic features are indicated as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S5, PDF file, 0.4 MB (465.4KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S6

Core genome phylogeny of ST617 and ST648 isolates based on genome sequences of ST617 genome sequences of 15 CP-Ec isolates from the F-NRC and 65 genome sequences from the NCBI database (A) and ST648genome sequences of 14 CP-Ec isolates from the F-NRC and 126 genome sequences from the NCBI database (B). A core genome (ST617, 3,407,000; ST648, 3,501,000 nt) alignment of the de novo assemblies on the sequences of GCA_002142695.1 (ST617) or GCA_004138645.1 (ST648) used as reference was performed by using Parsnp (T. J. Treangen, B. D. Ondov, S. Koren, and A. M. Phillippy, Genome Biol 15:524, 2014, https://doi.org/10.1186/preaccept-2573980311437212); ML phylogeny was built with RAxML (A. Stamatakis, Bioinformatics 30:1312−1313, 2014, https://doi.org/10.1093/bioinformatics/btu033) from 15,030 (ST617) or 3,782 (ST648) core SNPs after removing recombined regions with Gubbins (N. J. Croucher, A. J. Page, T. R. Connor, A. J. Delaney, et al., Nucleic Acids Res 43:e15, 2015, https://doi.org/10.1093/nar/gku1196). The genome sequences of CNR93D10 (ST746) and CNR71A8 (ST1485) were used as outgroups for the phylogenetic analyses of ST617 and ST648, respectively. F-NRC isolates are indicated by red triangles close to the isolate name. Other genomic features are indicated as indicated in the figure key (left) from the left to the right columns: carbapenemases of the OXA, NDM, and other types; number of mutations in gyrA and parC QRDR region (FQ resistance); number of ARGs; mutations in ftsI, ompC, and ompF; and geographical origin. Download FIG S6, PDF file, 0.4 MB (429.4KB, pdf) .

Copyright © 2022 Patiño-Navarrete et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES