Skip to main content
mSystems logoLink to mSystems
. 2025 Dec 9;11(1):e01274-25. doi: 10.1128/msystems.01274-25

Genomic and functional characterization of the L‑sorbose phosphotransferase system in high-risk Escherichia coli lineages

Lena-Sophie Swiatek 1, Elias Eger 1, Kristin Surmann 2, Fynn Meller 1, Lukas Schulig 3, Marco Harms 2, Thaddäus Echelmeyer 1, Christian Hentschker 2, Uwe Völker 2, Michael Schwabe 1, Katharina Schaufler 1,4,
Editor: Atmika Paudel5
PMCID: PMC12817954  PMID: 41363471

ABSTRACT

Certain Escherichia coli strains belong to international high-risk clonal lineages exhibiting a combination of multidrug resistance, enhanced fitness, and increased virulence, posing major public health challenges. Investigation of characteristics beyond antibiotic resistance genes and classical virulence factors is crucial to gain understanding of common traits among these pathogens. Analysis of 22,267 publicly available E. coli genomes from five pathogenic and one commensal sequence type (ST) revealed the L-sorbose phosphotransferase system (PTS ) as a pathogen-enriched marker. Although previously investigated for evolutionary insights, this system has not been functionally assessed within high-risk clonal lineages, as performed in the present study. Our multi-omics results demonstrate that the L-sorbose PTS is induced in pathogenic strains of ST131 and ST648 in the presence of L-sorbose, conferring a competitive advantage compared to a commensal ST10 strain. Transcriptomic analysis revealed co-regulated pathways involving metabolism, flagellar motility, and capsular polysaccharide production, indicating roles in niche adaptation. Despite some strain-specific variations, functional assays with knock-out/knock-in mutants and Galleria mellonella infection models confirmed the operon’s impact on competition and fitness.

IMPORTANCE

This study highlights the value of combining large-scale genomic analyses with functional validation for identifying and analyzing the L‑sorbose phosphotransferase system in different E. coli high-risk clones. This knowledge may be applied in future studies to address the need for identifying alternative, pathogen-specific targets.

KEYWORDS: E. coli, antimicrobial resistance, sorbose and associated fitness, competition, multi-omics investigation

INTRODUCTION

The Escherichia coli species comprises highly diverse strains and includes both commensal representatives, integral to the healthy human intestinal microbiome, and pathogenic types (1). Pathogenic E. coli strains are typically classified into intestinal pathogenic E. coli (InPEC) and extraintestinal pathogenic E. coli (ExPEC) (1, 2). ExPEC colonizes the intestine and can cause a wide range of diseases, including urinary tract infections, sepsis, and meningitis (3, 4). Phylogenetic analysis has separated E. coli into phylogroups, with phylogroups B2, D, and F predominating in extraintestinal infections (5).

Certain ExPEC sequence types (STs), such as the globally dominant ST131, ST648, and ST38, have emerged as high-risk clonal lineages that combine multidrug-resistant (MDR) with virulence and fitness traits (6, 7). These clonal lineages, characterized by their high international prevalence, are often equipped with plasmid-encoded resistance mechanisms (8, 9). The overuse and misuse of antibiotics exert strong selection pressure on bacterial communities selecting for resistant strains (10). Infections with resistant strains, especially when combined with high virulence, severely affect patient outcomes (11). The highest number of deaths associated with or attributable to resistance worldwide is caused by resistant isolates of E. coli (9).

In contrast, commensal E. coli strains, often from phylogroup A, such as ST10, are generally non-pathogenic. However, these strains can harbor antimicrobial resistance (AMR) genes and act as opportunistic pathogens, particularly in immunocompromised individuals (2, 5, 6). The diversity in pathogenic potential across E. coli subtypes reflects their large accessory genome, which frequently includes virulence-associated genes and contributes to their adaptability and varied roles in human health (1, 12). However, diversity goes beyond classical resistance and virulence genes (13). Therefore, this study explored the L-sorbose phosphotransferase system (PTS) (sor-operon) potentially associated with virulence or fitness in international high-risk clonal lineages of MDR E. coli, such as the L-sorbose phosphotransferase system (PTS) (sor-operon). Previous research has investigated the evolutionary and diagnostic relevance of L-sorbose utilization in E. coli and Shigella spp. (n = 266), primarily focusing on InPEC (14). That work applied PCR and Sanger sequencing, revealing the presence of the operon encoding an L-sorbose-specific PTS without demonstrating its direct role in conferring a benefit, such as a connection between this metabolic pathway and the fitness or virulence in MDR high-risk clonal lineages (14). Through homologous clustering of 22,267 publicly available E. coli genomes, we identified the L-sorbose PTS enriched in high-risk clonal lineages. Detailed multi-omics analysis of two strains from ST131 and ST648 revealed the sor-operon as a determinant of fitness and competitive advantage, which is linked to co-regulated pathways such as flagellar motility and capsular polysaccharide production. Its involvement in these pathways was further validated through knock-out and knock-in experiments.

MATERIALS AND METHODS

Bacterial strains

Homologous clustering included six STs: ST10 as a commensal representative (phylogroup A), often isolated as a colonizer, and ST38 (phylogroup D), ST131 (phylogroup B2), ST405 (phylogroup D), ST410 (phylogroup C), and ST648 (phylogroup F) as pathogenic representatives based on their global success as high-risk clonal lineages (6). Initial microbiological screenings were conducted for three strains of each ST (Table S1). We included two mutants of the wild-type strain PBIO729 (ST131): PBIO729∆sorAM, with a deletion in transport component genes, and PBIO729∆sorE, which cannot reduce L-sorbose-1-phosphate to d sorbitol-6-phosphate. Additionally, a sor-operon knock-in strain (PBIO365:sor, ST10) with insertion into the region identical to sor positive strains was included. For in-depth analysis, including transcriptomics, proteomics, and phenotypic characterization, we selected one strain from each of ST10 (PBIO365), ST131 (PBIO729), and ST648 (PBIO730). ST131 was chosen as the dominant ExPEC lineage, while ST648 was selected due to its emergence in multiple regions as a potential pandemic clone (6, 1517). Both combine epidemiological relevance with complementary biological traits and differ in phylogroups and certain phenotypic features. Within each selected ST, we chose isolates from our in-house collection that are extensively characterized with phenotypic data and high-quality whole-genome sequences, enabling robust multi-omics integration (16, 18, 19).

Data set and homologous clustering

E. coli genomes (Table S1) were downloaded from EnteroBase database (v1.2.0) (20) (n = 24,724; Dec. 2020) and filtered (L50-value ≥ 20 contigs), allowing 5% missing genes within the Benchmarking Universal Single-Copy Orthologue (BUSCO, v 5.0) and the Enterobacterales odb10 analysis (21). Twenty-eight genomes, either publicly available via NCBI or provided by cooperation partners, were integrated to the data set. Reads were quality trimmed with Trim Galore (v. 0.6.8; https://github.com/FelixKrueger/TrimGalore) and assembled with Shovill (v. 1.1.0; https://github.com/tseemann/shovill), if only raw data were available. Genome assemblies were annotated using Prokka (v. 1.14.6) (22) and homologous clustering using cd-hit (v. 4.8.1) (23) with protein sequences as input (cut-offs: 95% identity and coverage). Pathogen-enriched markers were characterized by their presence in ≥90% of each ST of the pathogenic genomes and in ≤10% of the commensal genomes. BLAST analysis was performed using the non-redundant databases of the National Center for Biotechnology Information (24) and UniProt (25). To examine potential bias from artificial laboratory strains (e.g., E. coli K-12 MG1655), E. coli genomes belonging to ST10 were filtered based on the “Name” column in EnteroBase. If a name (e.g., E. coli K-12 MG1655) occurred ≥10 times, one assembly was randomly chosen as a representative.

Cultivation and harvest of bacterial strains

Bacterial strains (including PBIO365, PBIO2, PBIO729, PBIO730, PBIO729∆sorAM, PBIO729∆sorE, PBIO365:sor) were cultivated in Nutrient Broth II (NB) without (ctrl) or with 1% L-sorbose (sor) (Carl Roth, Karlsruhe, Germany), shaking at 150–220 rpm, 37°C, unless stated otherwise (Table S1). For growth kinetics, 5 mL of NB (ctrl or sor), LB, or M9 minimal medium were inoculated with a single colony of the respective strain and incubated overnight. Either 10 mL of medium was inoculated at an optical density of 0.05 at λ = 600 nm (OD600), and growth was monitored by measurements every 30 min for 8 h, or 200 µL was transferred into a 96-well plate, and growth was assessed automatically every 30 min for 24 h using a plate reader (CLARIOstarplus, BMG Labtech, Ortenberg, Germany).

Total protein and RNA were extracted from the same cultures (PBIO365, PBIO720, and PBIO730), which were initiated through a systematic serial dilution of 5 mL NB overnight cultures originating from a glycerol stock. Pre-cultures (20 mL of NB) were inoculated starting from mid-exponential phase overnight cultures (referring to an OD600 of approx. 0.6-1.0), and main cultures (80 mL NB ctrl or sor) were initiated from mid-exponential phase pre-cultures at an OD600 of 0.05. Cell harvest was performed during the early stationary growth phase and involved cooling in liquid nitrogen and centrifugation (3 min; 4°C; 8,000 × g). Cell pellets were stored at −80°C until RNA and protein preparation.

Visualization of L-sorbose utilization in pathogenic E. coli strains

The utilization of L-sorbose was evaluated as described previously using NB-agar plates supplemented with 1% L-sorbose and 50 mg/L neutral red (14). Utilization of other sugars, including D-sorbitol (Merck), D-glucose (Carl Roth), D-fructose (Merck), and D-mannose (Sigma Aldrich; Merck), was equally tested.

Isolation of total DNA

Total DNA extraction was performed using the MasterPure DNA Purification Kit for Blood, Version 2 (Lucigen, Middleton, WI, USA), according to the manufacturer’s instructions. DNA was quantified with dsDNA HS Assay Kit using a Qubit 4 fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) (e.g., 21).

Mutant generation and sequencing

Marker-less mutant strains PBIO729∆sorAM, PBIO729∆sorE, and PBIO365:sor were generated by Creative Biogene (Shirley, NY, USA) through homologous recombination. Chromosomal DNA of the mutants was isolated and then shipped to SeqCenter (Pittsburgh, PA, USA) (Supplementary methods I).

Preparation of total RNA

For transcriptomic analysis and Northern blotting, total bacterial RNA was prepared by mechanical disruption and acid phenol-chloroform extraction as previously described, with minor modifications (26). Modifications concerned the resuspension upon mechanical disruption in 2 mL only, as a bacterial pellet equivalent to 8 OD units was used (see Table S2 for the composition of all buffers).

Northern blot analysis

The sor-operon expression levels were analyzed by using sorC or sorE digoxigenin-labeled probes (Supplementary methods II).

Database for omics analysis

To compare the expression and translation profiles in the presence and absence of L-sorbose between PBIO365, PBIO729, and PBIO730, a pangenome was built with PIRATE (v. 1.0.4, -s 95,96,97,98,99,100, -k -cd-low 100 -e 1E-12 -hsp-len 0.5). This resulted in a grouping of genes from all three strains and the assignment of a unique group ID. The resulting representative protein sequences were used as input for proteomic analysis to minimize redundancy and enable consistent peptide mapping across strains.

RNA sequencing and differential gene expression analysis

Total RNA samples of PBIO365, PBIO729, and PBIO730 were frozen and shipped to LGC Genomics GmbH (Berlin, Germany). The rRNA depletion and mRNA library preparation was performed by LGC Genomics GmbH (Berlin, Germany), and sequencing was conducted with 1 × 75 bp reads (Illumina NextSeq 550, non-stranded). Raw reads were adapter- and quality-trimmed using Trim Galore. Trimmed reads were mapped using Bowtie 2 (v. 2.4.5/mode: -very-sensitive-local) (27) with the genome assemblies as references for the individual strains. Gene counts were calculated using featureCounts (v. 2.0.1) (28) based on the annotation for each strain. The count table was imported into RStudio (v. 4.3.1). Differentially expressed genes were identified with DESeq2 (v. 1.40.0) in default mode by comparing counts from RNA preparations in the absence and presence of L-sorbose. Log2 fold-change (L2FC) was calculated by comparing individual strains grown in the presence and absence of L-sorbose.

Isolation of bacterial proteins

The bacterial cell pellets were processed as described before (29) and washed twice (4°C, 10 min, 17,000 × g) with ice-cold phosphate-buffered saline (PBS; Thermo Fisher Scientific). The pellets were then reconstituted in 100 µL of 20 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES; pH 8.0) with 2% (wt/vol) sodium dodecyl sulfate (SDS), followed by denaturation (95°C, 1 min) with vigorous shaking. Cell disruption was achieved using pre-cooled vessels and a Mixer Mill MM 400 (Retch, Haan, Germany) for 3 min at 2,600 rpm. Cell powder was resuspended in 150 µL of preheated (95°C) 20 mM HEPES (pH 8.0) and transferred into a 1.5 mL low binding pre-lubricated tube (Sorenson BioScience, Salt Lake City, UT, USA). After cooling to room temperature (RT), lysates were supplemented with 4 mM MgCl2 and 0.005 U/µL benzonase (Pierce Universal Nuclease, Thermo Fisher Scientific). Ultrasonication was carried out for 5 min, and cell debris was pelleted by centrifugation (30 min; 17,000 × g; RT). The resulting supernatants were then transferred into a fresh 1.5 mL low binding pre-lubricated tube (SorensonTM BioScience).

Bicinchoninic acid assay

Protein concentration was determined as described previously (29, 30), using the MicroBCA Protein Assay Kit (Thermo Fisher Scientific), according to the manufacturer’s instructions. Measurements were conducted using BioTek Synergy (Agilent Technologies, Waldbronn, Germany). The obtained data were analyzed through the Shiny application, as described by Reder et al., to ensure accurate quantification (30).

Single-plot solid-phase enhanced sample preparation

The single-plot solid-phase enhanced sample preparation, with minor adjustments as described by Blankenburg et al., was executed (29, 31). For trypsin digestion, 5 µg of total protein in 10 µL of 20 mM HEPES (pH 8.0) were incubated (18 min shaking at 1,400 rpm) with 10 µL of magnetic beads, comprising equal volumes of hydrophilic (Speedbead magnetic carboxylated modified particles, GE Healthcare, United Kingdom) and hydrophobic beads (Sera-Mag Speedbead carboxylated-modified particles, Thermo Fisher Scientific). The supernatant was discarded by bead sedimentation on a magnetic rack, followed by two washes with 70% (vol/vol) ethanol and one wash with 100% (vol/vol) acetonitrile (ACN) (Thermo Fisher Scientific) before air-drying. Trypsin digestion occurred in freshly prepared 20 mM ammonium bicarbonate buffer, with a trypsin-to-protein ratio of 1:25 (18 h at 37°C). The digest was stopped by adding ACN, and the protocol was performed as described before (31).

Acquisition and analysis of mass spectrometry data

Peptides were separated using an UltiMate 3000 nanoLC device (Thermo Fisher Scientific) with a pre-column (Acclaim PepMap; Thermo Fisher Scientific) and an analytical column (Accucore; Thermo Fisher Scientific) by applying a binary gradient with buffer A (0.1% [vol/vol] acetic acid in HPLC-grade water) and buffer B (0.1% [vol/vol] acetic acid in ACN) at a flow rate of 300 nL/min, as described previously (29). After ionization, peptides were analyzed with a Q Exactive HF mass spectrometer (Thermo Fisher Scientific) in data-independent acquisition mode. Specification of the used gradient and detailed settings on nanoLC-MS/MS data acquisition are provided (Table S5). Raw data were mapped against the in-house established database (see 2.9), including whole-genome sequences of analyzed strains, using Spectronaut (v. 16.0) (Biognosys, Schlieren, Switzerland). All search parameters are provided (Supplementary methods III, Table S6). Statistical comparison was accomplished on peptide level with the PECA package (v. 1.30.0) by applying a ROTS test (ROPECA approach—reproducibility-optimized peptide change averaging) (32). Protein intensities were calculated using the maxLFQ algorithm (33). Proteome data have been stored at the ProteomeXchange Consortium via the PRIDE partner repository (34) with the dataset identifier PXD056542.

Multi-omics marker analysis

The multi-omics analysis was performed using three biological replicates for RNA-Seq and four biological replicates for mass spectrometry for each condition and strain. Differential expression was defined as L2FC > 1 or < −1 (|L2FC| > 1.0) with an adjusted P ≤ 0.05. Genes not meeting these criteria were considered to exhibit no differential expression. The pathogen-enriched markers included genes that were up- or downregulated in the pathogenic strains (PBIO729 and PBIO730) under sor compared to ctrl (sorbose-free) conditions, but showed no differential expression in the commensal representative (PBIO365). Additionally, we analyzed strain-specific up- or downregulated features. For a multi-step examination, up- or downregulated features were investigated by integrating eggNOG (v. 2.1.9) annotations (35, 36) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) (37). Information regarding the cluster of orthologous groups (COG) was plotted for genes showing up- or downregulation on transcript levels.

Competition assay

Competition assays assessed the fitness of each strain relative to another strain. Main cultures (NB sor or ctrl) of the respective strain combination were inoculated at a total OD600 of 0.05, with a strain ratio of 1:3. Samples were collected at 0 h and 24 h, diluted, and plated on selective agar for differentiation of strains: either LB and LB-cefotaxime (4 µg/mL) (Thermo Fisher Scientific), LB-chloramphenicol (8 µg/mL) (Sigma Aldrich; Merck), or NB agar supplemented with 1% L-sorbose and 50 mg/L neutral red (Merck, Darmstadt, Germany). Differentiation was verified based on colony PCR detecting an amplicon of the sor-operon (Table S7).

Survival in human serum

The survival of the different bacterial strains in 50% human serum was assessed as described previously with minor modifications (38). Strains were incubated in NB without or with 1% L-sorbose overnight (37°C, shaking). Then, bacteria were washed and resuspended in PBS containing either 2% L-sorbose or 2% D-glucose to a final OD600 of 0.1. Bacterial suspensions were then mixed with equal amounts of human serum (US origin, Sigma-Aldrich, St. Louis, USA) and seeded in a 96-well plate. Determination of CFU/mL was performed immediately and after 24 h. Serum resistance was expressed as the L2FC in CFU/mL from t0 to t24.

Motility assay

Strains were incubated in either NB without or with 1% L-sorbose overnight (37°C, shaking) before diluting cultures 1:100 in fresh medium. Motility agar was prepared using NB with 0.3% agar (Carl Roth), with or without L-sorbose. Before spotting 10 µL of each culture onto agar plates in triplicates, bacterial cells were washed with PBS and adjusted to OD600 of 0.5. Plates were incubated at 37°C for 24 h. Motility was determined by measuring the radius of the colony and the swarming.

Biolog plates

Biolog plates PM1, PM2, PM3, and PM5 (Biolog, Hayward, CA, USA) were used to screen the strains PBIO365, PBIO729, PBIO729∆sorE, and PBIO730, following the manufacturer’s protocol with slight modifications (Supplementary methods IV).

Galleria mellonella infection model

For the Galleria mellonella infection model, overnight cultures were diluted 1:50 in 20 mL NB and incubated. Then, 1 mL of culture was collected (5 min, 8,000 × g) and washed twice with PBS. Optical density was set to OD600 of 0.1 (~105 CFU). The larvae of the greater wax moth (Galleria mellonella) were used as an in vivo infection model, as described previously (39).

Statistical analysis

All phenotypic experiments were conducted with three or more independent biological replicates, and statistical analyses were performed using GraphPad Prism (v. 9.3.0) for Windows (GraphPad Software, San Diego, CA, USA). Data were expressed as mean and standard deviation unless stated otherwise. Statistical significance was assessed via one-way analysis of variance (ANOVA) with uncorrected Fisher’s LSD, unpaired t with Welch’s correction, or two-way ANOVA with Tukey correction for multiple comparisons. Significance levels were indicated as follows: *P < 0.033; **P < 0.002; ***P < 0.001. For transcriptomic and proteomic analyses, an L2FC greater than 1 (|L2FC| > 1.0) with an adjusted P-value equal to or smaller than 0.05 was considered significant.

RESULTS

Discovery of the L-sorbose phosphotransferase system as a unique marker in pathogenic E. coli lineages

For homologous clustering of 22,267 E. coli genomes, we included six STs: ST10, often isolated in colonization and surveillance studies comprising a high diversity of isolates, and ST38, ST131, ST405, ST410, and ST648, which are globally successful high-risk clonal lineages. For consistency throughout the manuscript, we use “commensal” to refer to ST10 and “pathogenic” for the international high-risk clonal lineages. However, it is important to acknowledge that ST10 has been previously detected in extraintestinal infections, indicating its potential to act as an opportunistic pathogen (6, 7). Most significantly, the homologous clustering detected seven genes from an operon encoding the L-sorbose-specific PTS, which were enriched in the pathogenic STs and largely absent in ST10 (Fig. 1A; Table S8). The operon was previously explored as an evolutionary marker, i.e., a genetic feature indicating phylogenetic relationships among E. coli pathotypes and Shigella (14). The sor-operon genes (sorC, sorD, sorB, sorA, sorM, and sorE) were present in more than 95% of all pathogenic representatives and in less than 0.1% (n = 6) of the commensal genomes (Fig. 1B). Four distinct sorF clusters were identified when clustered at 95% sequence identity and length, revealing only minor sequence variations. As a result, they were pooled using a 75% sequence identity and length threshold. Further analysis of 44,208 genomes (Supplementary methods V) across all phylogroups revealed that this operon is present in the majority (>86%) of genomes, except for phylogroups A (9.56%) and B1 (12.12%) (Table S9). Investigation of the surrounding regions revealed that, while genes are generally conserved across and within STs, including ST10, there are exceptions and ST-specific insertions (e.g., ST648) (Fig. S1), warranting further investigation.

Fig 1.

The sor-operon analysis in E. coli shows high prevalence in pathogenic STs (>95%) but absence in commensal ST10 (<0.1%). Pathogens with the seven sor genes form red colonies on L-sorbose media and show superior growth upon L-sorbose utilization.

Differences in the occurrence of sor-operon reflect in unique L-sorbose utilization in pathogenic STs. (A) Venn diagram schematically illustrating the genomic intersections, including the genomic markers and structure of sor-operon. Each circle represents the genome of an exemplary strain. Overlapping regions indicate shared genes. The intersection among all pathogenic strains is referred to as marker section, including the sor-operon. (B) Heatmap of percentage occurrence of the indicated genes of the sor-operon. The occurrence in the commensal ST10 and the pathogenic STs (ST38, ST131, ST405, ST410, and ST648) is color coded from blue (0%) to red (100%) as a heatmap. *Clustering of genes was performed with 95% identity and coverage, except for sorF, where clustering based on 75% identity is shown because otherwise, genes will occur in separate clusters. (C) Single colonies of representative pathogenic and commensal E. coli STs on Nutrient Broth II agar containing 1% L-sorbose and neutral red. (D) Growth (OD600 nm) of pathogenic and commensal E. coli STs and the indicated mutants was recorded in the presence and absence of L-sorbose: The growth of each strain in NB and NBsor was compared by calculating the ratio of AUC of three independent replicates. These ratios and their respective mean values are presented for the individual strains. Additionally, the respective ST is indicated below.

Genomic analyses suggested potential differences in L-sorbose utilization between ST10 and the other STs. To investigate this, we tested L-sorbose utilization in three strains from each ST on NB L-sorbose agar supplemented with neutral red dye (Fig. S2A; Fig. 1C). Utilization of other sugars (D-sorbitol, D-glucose, D-fructose, and D-mannose) did not reveal any pathogen-specific patterns (Fig. S2B). To examine the impact on growth behavior, we selected two commensal ST10 strains (PBIO365 and PBIO2) and two pathogenic representatives from ST131 (PBIO729) and ST648 (PBIO730). Additionally, a sor-operon knock-in mutant (PBIO365:sor) and two different mutant strains of PBIO729 lacking either sorAM or sorE were tested, all exhibiting similar growth kinetics in LB medium and minimal medium supplemented with 1% dglucose (Fig. S3A and B). In minimal medium containing 1% L-sorbose, only PBIO729, PBIO729∆sorAM, and PBIO730 grew (Fig. S3C). PBIO365:sor grew in minimal medium containing 1% L-sorbose; however, evident growth was observed after 48 h. In complex medium supplemented with 1% L-sorbose (NBsor), pathogenic strains and PBIO729∆sorAM, but not PBIO729∆sorE, displayed growth with cellular densities increasing concurrently with medium acidification during the transient growth phase, indicating the initiation of L-sorbose utilization (Fig. S3D and E). This is reflected in an increased area under the curve (AUC) calculated from the growth kinetics in NBsor compared to NB (Fig. 1D). Enhanced growth was also observed when comparing PBIO365 with the integrated sor-operon to the wild-type strain. However, PBIO365 exhibited slightly enhanced growth in NB compared to NBsor, resulting in a normalized AUC below 1, likely because the inability to utilize L-sorbose disturbs growth in NBsor (Fig. 1D). This may result from uptake of L-sorbose via low-specificity PTS transport systems, leading to intracellular accumulation of phosphorylated intermediates that impose osmotic stress and disrupt phosphate homeostasis (40, 41). Notably, PBIO729∆sorE exhibited the strongest impairment in growth compared to the wild-type and in relation to growth in NB. We conclude that while the mutant deficient for the transport components, PBIO729∆sorAM, is still capable of L-sorbose uptake via other PTSs, the PBIO729∆sorE mutant, despite having an intact transport, is unable to reduce L-sorbose-1-phosphate to D-sorbitol-6-phosphate.

L-sorbose utilization is triggered during early stationary phase among other pathways

Expression was examined for the regulator sorC, which is an activator in the presence of L-sorbose and a repressor in the absence, and for sorbose-1-phosphate reductase sorE (42). This revealed sor-operon induction in the presence of L-sorbose in pathogenic strains during the transition into the stationary growth phase (sorC) and during stationary phase (sorE), respectively (Fig. S4B). Comparing early stationary growth phase samples in the presence and absence of L-sorbose, we confirmed induction of the sor-operon by both transcriptome and proteome analyses in pathogenic strains (PBIO729 and PBIO730), while no induction was detected in the commensal strain PBIO365, as expected due to gene absence (Fig. S4C). Ranking the genes revealed that the seven sor-genes exhibited the highest L2FC between the two conditions among all genes detected as differentially expressed. Expression of a shikimate dehydrogenase-like gene (sdhL), located immediately upstream of the sor-operon, was observed only in the ST648 strain, and sdhL is absent from ST131 and the other investigated STs (Fig. S1).

To gain deeper insights into the functional benefits of expressing the L-sorbose PTS and fermenting L-sorbose as an additional carbon source, the overall expression patterns were analyzed to elucidate the associated regulatory pathways. Proteomic analysis confirmed the production of proteins of the sor-operon; however, reconstructing a meaningful network of regulatory pathways was not feasible due to the limited number of proteins differently abundant in ST131 and ST648 but not in ST10. Proteomics revealed significant downregulation in metabolic pathways, including propionate metabolism, fatty acid oxidation, and chemotaxis, likely attributable to a shift in cellular resources toward the L-sorbose PTS components ( Table S10, Fig. S5). Transcriptomic analysis identified regulatory pathways shared among all three strains, along with strain-specific induction of gene expression. Comparison of sorbose and control conditions revealed a subset of 127 genes that were upregulated and 30 genes that were downregulated in both pathogenic strains (PBIO729, PBIO730), whereas these were not detected as up- or downregulated genes in the commensal strain (PBIO365) (Fig. S6A). Most up- or downregulated features in the presence of L-sorbose were detected in PBIO729, while the lowest number of genes and proteins was detected in PBIO365 (Fig. S6B). Subsequent analysis indicated that only a small proportion of strain-specific regulatory effects were directly attributable to differences in gene presence. Instead, it appeared that many of these differences occurred due to variations in the regulatory pathways upon l-sorbose utilization (Fig. S6C). Although both PBIO729 (ST131) and PBIO730 (ST648) are pathogenic ExPEC lineages, differences in accessory genome content and regulatory network architecture account for different transcriptional reprogramming.

Pathogen-specific regulatory networks activated by L-sorbose fermentation

The differentially expressed genes were further analyzed to identify their associated pathways and to better understand the potential impact of these changes. COG analysis revealed an increase and diversification of regulated pathways in pathogenic strains (Fig. S7). We identified two major pathways related to tryptophan and purine metabolism that were upregulated in both pathogenic, but not the commensal strain, in the presence of L-sorbose (Fig. 2). Six genes encoding proteins involved in the tryptophan biosynthesis are organized in an operon-like structure (trpA/B/C/G/D/E) in all three strains but are only differentially expressed in the pathogenic representatives (Fig. 2A). Moreover, we identified upregulated genes associated with the shikimate pathway. While aroL was upregulated in both pathogenic strains, the sdhL gene was differentially expressed in only PBIO730, in accordance with its unique occurrence in ST648, as described above. Purine-associated genes, such as purA, were similarly regulated in all three strains. However, genes particularly important for initial steps in this pathway were upregulated in the pathogenic strains, including purB/C/D/E/F/H/K/L/M/N/P/R/T (Fig. 2B). The pur genes were non-contiguously distributed across multiple chromosomal loci in all three strains, rather than being organized in a single operon, which influences their transcriptional regulation and purine metabolism (43). Diverse upregulation of genes concerning capsule polysaccharide (CPS) production was detected with L-sorbose utilization in pathogenic STs, such as the kps-operon (Fig. 2D). In both STs, genes encoded in region 1 and region 3 of the kps-operon were upregulated during L-sorbose fermentation (44). Strain-specific responses most prominently included the upregulation of a total of 32 flagellar genes in PBIO729 (Fig. 2C).

Fig 2.

Heatmaps of gene regulation in metabolic and virulence pathways. Purine and tryptophan metabolism shows upregulation in both pathogens, flagellar genes exhibit upregulation in PBIO729, while kps capsule genes show variable patterns across strains

Heatmaps of regulated genes mapping to different virulence- and/or fitness-associated pathways. Based on research using the KEGG database, differentially expressed genes were mapped to their respective pathways, including (A) tryptophan metabolism (KEGG-map00380), (B) purine metabolism (KEGG-map00230), (C) flagellar assembly (KEGG-T30583_02040), and (D) kps-operon. Heatmaps indicating the differential gene expression of the respective genes (upregulated (L2FC ≥ 1.0; P adjusted ≤ 0.05) [red], not differentially expressed [white], downregulated (L2FC ≤ −1.0; P adjusted ≤ 0.05) [blue]) for the analyzed strains (PBIO365, PBIO729, PBIO730).

Alignment between transcriptomic changes and observed phenotypic variations

To complement the transcriptomic analysis, selected phenotypic assays using knock-out and knock-in mutants were performed based on the detected upregulated genes, including tryptophan and purine metabolism, flagellar assembly, and capsule polysaccharide synthesis. A Biolog plate-based screening supported the observation of a general enhancement of previously described metabolic pathways (Table S11). Amino acid and peptide metabolism, including L-serine, L-isoleucine, L-leucine, L-lysine, L-methionine, glycyl-L-aspartic acid, and glycyl-L-glutamic acid, showed alterations when comparing PBIO729 under sor versus control conditions and were not detected in PBIO365 or PBIO729∆sorE. Enhanced purine metabolism, specifically guanosine, was detected in PBIO729 via Biolog screening, consistent with transcriptomic data showing upregulation of genes for purine metabolism.

To evaluate whether L-sorbose utilization provides an additional competitive fitness advantage, the competitive index (CI) was assessed in mixed bacterial cultures (Fig. 3A). The wild-type pathogenic strain PBIO729 demonstrated a clear competitive advantage over the commensal strain PBIO365. Note that PBIO365 showed reduced growth in NBsor single culture. However, the advantage of PBIO729 was significantly diminished upon deletion of the sorE gene (P < 0.001). A similar depletion of competitive fitness was observed for PBIO729 (P < 0.001) and PBIO729∆sorE (P = 0.01) when competing against PBIO365:sor compared to the wild-type PBIO365. The observed competitive advantage of PBIO365:sor over PBIO729 demonstrates a relative fitness gain conferred by enabling L-sorbose utilization in an otherwise sorbose-negative ST10 background. In contrast, PBIO365 showed a reduced CI when competing against PBIO365:sor. These results highlight that the observed competitive advantage is dependent on the ability to utilize L-sorbose. We additionally observed that PBIO729 shows no competitive advantage over PBIO365 in the absence of L-sorbose (Fig. S8), suggesting that PBIO365 can rely on L-sorbose-independent mechanisms to compete effectively.

Fig 3.

PBIO729 wild-type strains show enhanced competition, motility and serum survival compared to PBIO720 sorE-mutant in presence of sorbose. PBIO365 shows the highest survival in the larvae model, which is reduced in sor positive strains and sorE mutant.

Phenotypic analysis revealed L-sorbose-dependent changes in fitness and/or virulence. The indicated strains were tested in biological replicates (n ≥ 3). (A) Competitive benefits in nutrient broth medium supplemented with 1% L-sorbose were calculated from the CFU/mL of the individual strains comparing the ratio of test over reference strain after 24 h of competition with the inoculation ratio. Relevant changes were evaluated regarding their significance (ordinary one-way ANOVA with Fisher’s LSD test; adj. P < 0.033 [*]; <0.002 [**]; <0.0002 [***]; <0.0001 [****]). (B) Motility of strains was tested using NB agar plates (0.3% agar) either without or with 1% L-sorbose, and the radius of swarming is plotted. Relevant changes were evaluated regarding their significance (two-way ANOVA with Tukey correction for multiple comparisons) (adj. P < 0.033 [*]; <0.002 [**]; <0.001 [***]). (C) Survival was assessed in 50% human serum diluted in PBS with 1% l-sorbose (D) or 1% D-glucose. The L2FC was calculated from the inoculum and the CFU/mL after 24 h of incubation. Relevant changes were evaluated regarding their significance (one-way ANOVA, unpaired t-test with Welch’s correction; adj. P < 0.033 [*]; <0.002 [**]; <0.001 [***]). (E) Galleria mellonella larvae were infected with the indicated strains using 10 technical and three biological replicates each and PBS as injection control. Survival was assessed every 12–24 h for 72 h and the mean survival rate is plotted.

Consistent with the transcriptomic data, we detected high motility for PBIO729 (Fig. 3B). A slight but not significant increase in motility in the presence of L-sorbose was observed for PBIO729 and PBIO729∆sorAM, as well as for PBIO365 and PBIO365:sor. The PBIO729∆sorE mutant showed a significant reduction in motility in the presence of L-sorbose compared to the wild-type (P = 0.0057) and sorAM mutant (P = 0.0090). Interestingly, PBIO365 demonstrated a high level of motility, whereas PBIO365:sor exhibited a decrease in motility with considerable variation. Notably, we would not expect an increase in motility for PBIO365 in the presence of L-sorbose, as it cannot utilize L-sorbose, but the decreased motility observed in PBIO365:sor could reflect a complex interplay between L-sorbose utilization and motility regulation. This finding further emphasizes the importance of considering strain variability, as even small genetic differences can lead to considerable phenotypic divergence.

CPS plays a vital role as a virulence factor by, for example, protecting bacteria from components of the human immune system, such as those found in serum (45). Therefore, serum survival was assessed with L-sorbose and with D-glucose as a control. The commensal strain PBIO365 failed to survive under any condition (Fig. 3C and D). PBIO365: sor exhibited survival only in the presence of L-sorbose. The PBIO729ΔsorE mutant displayed significantly reduced serum survival in the presence of L-sorbose compared to the wild-type (P = 0.05) (Fig. 3C) and a notable reduction across all tested conditions compared to the wild-type strain and the PBIO729ΔsorAM mutant (Fig. 3D). The Galleria mellonella in vivo infection model was employed to evaluate the overall impact of the strains on fitness and potential virulence (Fig. 3E). Infection with PBIO365 resulted in moderate virulence, with survival rates of 80% during the early infection phase (≤48 h). By contrast, larvae infected with PBIO365:sor exhibited significantly lower survival rates, comparable to those infected with the highly virulent PBIO729, showing survival rates of only 20–30% after 48–72 h. Interestingly, the deletion of the sorE gene in PBIO729 did not affect larval mortality rates. It is important to note that L-sorbose could not be administered simultaneously during the infection experiments, and its occurrence needs to be investigated in future studies.

DISCUSSION

Our study investigating 22,267 E. coli genomes revealed the sor-operon as a genomic marker enriched in pathogenic E. coli STs. The enrichment of the sor-operon in pathogenic strains across different phylogroups, in contrast to phylogroups A and B1, suggests a benefit for sor-carrying strains and its potential as a genetic marker for distinguishing between pathogenic and commensal E. coli isolates, though it does not serve as an absolute indicator of virulence. Our study offers a significant advancement over previous research with a comprehensive multi-omics approach including globally disseminated and clinically relevant MDR isolates from ST131 and ST648 (14). This research demonstrated the operon’s induction in the presence of L-sorbose. We highlight a link to increased energy production, consistent with the known functions of ribosomal proteins and ATPases in cellular physiology, and its importance for enhancing growth, although specific contributions in this context were not directly assessed. By not only enhancing growth but also driving the expression of virulence- and fitness-associated traits, this leads to competitive advantages for pathogenic E. coli. L-sorbose is a sugar present in fruits and detectable within different human and animal body fluids, e.g., feces and urine, but detailed studies on the concentration are lacking (46). Research in rats revealed fermentation of L-sorbose within the intestine after adaptation of the intestinal microflora (46). Furthermore, it is known that L-sorbose can be synthesized from D-sorbitol by Acetobacter sp (47), which resides in the gut of Drosophila sp., a commonly used model organism for human gut microbiome studies (48, 49). Importantly, sorbitol is a sugar alcohol commonly used as a sweetener in food and pharmaceutical products (50). The enhanced growth during the transient phase in complex media supports the idea that L-sorbose utilization promotes growth in nutrient-limited environments. Further, the findings suggest that strains capable of utilizing L-sorbose outcompete those lacking the sor-operon in vitro. In conclusion, L-sorbose utilization provides the potential to outcompete others in specific niches (51). For example, upon colonizing the intestine, E. coli must secure a nutrient niche, outcompeting other bacteria by dominating the use of at least one limiting nutrient (52). The genetic diversity of MDR E. coli in carbohydrate metabolism enhances their competitive edge over commensal E. coli in the intestinal tract of mice (53). Literature also highlights the importance of specific metabolic traits, such as amino acid and energy metabolism, for the fitness of pathogenic E. coli in mouse models (54, 55).

Upregulation of purine and tryptophan pathways indicates that pathogenic bacteria channel energy from L-sorbose fermentation into metabolic pathways associated with virulence. Tryptophan biosynthesis intermediates, such as indole, are known to modulate immune responses and affect gut barrier function (56). The induction of L-sorbose (sorC) and purine metabolism (purA) has already been linked upon adhesion to human brain endothelial cells, with mutants showing reduced invasion (57). Moreover, UPEC relies on de novo purine synthesis to colonize purine-limited environments, such as the bladder, while mutations in purine biosynthesis (∆purF) impair colonization (58). L-sorbose is detectable in urine, suggesting that L-sorbose fermentation may enhance purine metabolism, boosting both bacteria’s fitness and virulence. Furthermore, PTSs can function in signal transduction in mediating the expression of virulence features (40). Comparable with the d fructose PTS, which mediates the expression of type 1 fimbriae in E. coli, supporting fitness and invasion into eukaryotic cells (59), we observed flagellar upregulation in the ST131 strain in the presence of L-sorbose. In addition, induction of capsule production alongside L-sorbose fermentation highlights the link between metabolic activities and virulence factor expression. Capsule synthesis and export serve as major virulence factors in many bacterial species, protecting the bacterium from the host immune response (60) and supporting colonization of murine kidneys and bladders (61). Although not statistically significant, the survival of PBIO365:sor and the killing of PBIO729∆sorE were specific to L-sorbose rather than glucose conditions, suggesting a direct link between L-sorbose utilization and serum survival.

Results from the G. mellonella infection model suggest that the sor-operon in the knock-in mutant of the commensal strain enhances virulence compared to the wild-type. In accordance with that, the involvement of L-sorbose utilization in pathogenesis has also been demonstrated for Legionella pneumophila (62). However, the absence of a significant effect from sorE deletion on mortality rates indicates that factors beyond L-sorbose metabolism may contribute to virulence. There is also a knowledge gap regarding the concentration and metabolism of L-sorbose in vivo, and whether the expression of the sor-operon is triggered. Future investigations in mouse models are planned to explore the competitive and fitness advantages conferred by L-sorbose utilization in vivo, especially in the presence of L-sorbose.

Overall, the regulatory profiles observed during L-sorbose fermentation underscore the critical relationship between metabolism and virulence. Previous studies have already shown that multidrug resistance alone does not explain the success of certain high-risk clonal lineages (63, 64). The link between enhanced metabolic pathways leading to increased virulence is well demonstrated, such as the association between aromatic amino acid biosynthesis and virulence in various bacterial species (65). Note that, while L-sorbose metabolism contributes to fitness and competition, its impact is not straightforward or universally applicable across all strains. Future studies should aim to explore how L-sorbose metabolism in other strains interacts with other metabolic pathways, virulence factors, and environmental conditions. Despite the observed complexities, the L-sorbose metabolism confers a competitive advantage beyond lL-sorbose utilization.

Limitations

The study’s comparative genomic analysis was limited to a single commensal ST (ST10) due to the scarcity of publicly available genomes and metadata from other “commensal” E. coli lineages. However, non-pathogenic strains may acquire virulence traits over time. Another potential limitation is the inclusion of genomes from clonal populations in the homologous clustering. Due to limited metadata, filtering relied solely on strain names, but it did not significantly affect the overall results. Functional analyses were conducted using a limited number of strains, knock-out/knock-in mutants, and experiments. This may restrict the generalizability of the findings, particularly in in vivo experiments. Moreover, it is indispensable to perform studies to gain information about specific L-sorbose concentrations within different body sites to conclude about potential induction and related advantages of the sor-operon expression.

ACKNOWLEDGMENTS

The authors thank Dr. Stephan Michalik for creating the R scripts for the statistical analysis of proteome data and Sara-Lucia Skwara for her excellent technical support.

This work was supported by the German Federal Ministry of Research Technology and Space (BMFTR) (previously called Federal Ministry of Education and Research (BMBF)) within the “DISPATch_MRGN-Disarming pathogens as a different strategy to fight antimicrobial-resistant Gram-negatives” project (grant 01KI2410).

Conceptualization, K.Schaufler; methodology, E.E., K.Surmann, L.-S.S., and M.S.; software, K.Surmann, L.-S.S., and M.S.; validation, L.-S.S.; formal analysis, E.E., F.M., K.Surmann, L.S., L.-S.S., M.H., M.S., and T.E.; investigation, K.Surmann., L.-S.S., M.S., and C.H.; resources, K.Schaufler. and U.V.; data curation, K.Surmann, L.-S.S., and M.S.; writing-original draft preparation, L.-S.S.; writing-review and editing, C.H., E.E., K.Schaufler., K. Surmann, L.S., L.-S.S., M.H., M.S., and U.V.; visualization, L.-S.S.; supervision, K.Schaufler, U.V.; project administration, K.Schaufler.; funding acquisition, K.Schaufler. All authors have read and agreed to the published version of the manuscript.

Footnotes

Writing - original draft

Contributor Information

Katharina Schaufler, Email: katharina.schaufler@helmholtz-hioh.de.

Atmika Paudel, Fluxus Inc., Sunnyvale, California, USA.

DATA AVAILABILITY

The experimental and computational data supporting the findings of this research are available in this article and its supplementary information files. Genomic and transcriptomic data of this study have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB98906. Proteome data have been stored at the ProteomeXchange Consortium via the PRIDE partner repository (34) with the dataset identifier https://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD056542PXD056542.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/msystems.01274-25.

Figure S1. msystems.01274-25-s0001.png.

Visualization of sor genomic region across STs.

DOI: 10.1128/msystems.01274-25.SuF1
Supplement. msystems.01274-25-s0002.docx.

Supplemental methods, tables, and figures

DOI: 10.1128/msystems.01274-25.SuF2
Table S12. msystems.01274-25-s0003.xlsx.

List of genomes included in analysis of sor genomic region.

DOI: 10.1128/msystems.01274-25.SuF3

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. Geurtsen J, de Been M, Weerdenburg E, Zomer A, McNally A, Poolman J. 2022. Genomics and pathotypes of the many faces of Escherichia coli. FEMS Microbiol Rev 46:1–30. doi: 10.1093/femsre/fuac031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Kocsis B, Gulyás D, Szabó D. 2022. Emergence and dissemination of extraintestinal pathogenic high-risk international clones of Escherichia coli Life (Basel) 12:2077. doi: 10.3390/life12122077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Kim KS. 2016. Human meningitis-associated Escherichia coli. EcoSal Plus 7:1–25. doi: 10.1128/ecosalplus.ESP-0015-2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Laupland KB, Church DL. 2014. Population-based epidemiology and microbiology of community-onset bloodstream infections. Clin Microbiol Rev 27:647–664. doi: 10.1128/CMR.00002-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Denamur E, Clermont O, Bonacorsi S, Gordon D. 2021. The population genetics of pathogenic Escherichia coli. Nat Rev Microbiol 19:37–54. doi: 10.1038/s41579-020-0416-x [DOI] [PubMed] [Google Scholar]
  • 6. Manges AR, Geum HM, Guo A, Edens TJ, Fibke CD, Pitout JDD. 2019. Global extraintestinal pathogenic Escherichia coli (ExPEC) lineages. Clin Microbiol Rev 32:1–25. doi: 10.1128/CMR.00135-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Riley LW. 2014. Pandemic lineages of extraintestinal pathogenic Escherichia coli. Clin Microbiol Infect 20:380–390. doi: 10.1111/1469-0691.12646 [DOI] [PubMed] [Google Scholar]
  • 8. Escudeiro P, Pothier J, Dionisio F, Nogueira T. 2019. Antibiotic resistance gene diversity and virulence gene. mSphere 4:1–13. doi: 10.1128/mSphere.00135-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Murray CJL, Ikuta KS, Sharara F, Swetschinski L, Robles Aguilar G, Gray A, Han C, Bisignano C, Rao P, Wool E, et al. 2022. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. The Lancet 399:629–655. doi: 10.1016/S0140-6736(21)02724-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Holmes AH, Moore LSP, Sundsfjord A, Steinbakk M, Regmi S, Karkey A, Guerin PJ, Piddock LJV. 2016. Understanding the mechanisms and drivers of antimicrobial resistance. Lancet 387:176–187. doi: 10.1016/S0140-6736(15)00473-0 [DOI] [PubMed] [Google Scholar]
  • 11. Cosgrove SE. 2006. The relationship between antimicrobial resistance and patient outcomes: mortality, length of hospital stay, and health care costs. Clin Infect Dis 42 Suppl 2:S82–9. doi: 10.1086/499406 [DOI] [PubMed] [Google Scholar]
  • 12. McNally A, Oren Y, Kelly D, Pascoe B, Dunn S, Sreecharan T, Vehkala M, Välimäki N, Prentice MB, Ashour A, Avram O, Pupko T, Dobrindt U, Literak I, Guenther S, Schaufler K, Wieler LH, Zhiyong Z, Sheppard SK, McInerney JO, Corander J. 2016. Combined analysis of variation in core, accessory and regulatory genome regions provides a super-resolution view into the evolution of bacterial populations. PLoS Genet 12:e1006280. doi: 10.1371/journal.pgen.1006280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, Bidet P, Bingen E, Bonacorsi S, Bouchier C, Bouvet O, et al. 2009. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet 5:e1000344. doi: 10.1371/journal.pgen.1000344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Lehmacher A, Bockemühl J. 2007. L-sorbose utilization by virulent Escherichia coli and Shigella: different metabolic adaptation of pathotypes. Int J Med Microbiol 297:245–254. doi: 10.1016/j.ijmm.2007.01.007 [DOI] [PubMed] [Google Scholar]
  • 15. Ewers C, Bethe A, Semmler T, Guenther S, Wieler LH. 2012. Extended-spectrum β-lactamase-producing and AmpC-producing Escherichia coli from livestock and companion animals, and their putative impact on public health: a global perspective. Clin Microbiol Infect 18:646–655. doi: 10.1111/j.1469-0691.2012.03850.x [DOI] [PubMed] [Google Scholar]
  • 16. Schaufler Katharina, Semmler T, Pickard DJ, de Toro M, de la Cruz F, Wieler LH, Ewers C, Guenther S. 2016. Carriage of extended-spectrum beta-lactamase-plasmids does not reduce fitness but enhances virulence in some strains of pandemic E. coli lineages. Front Microbiol 7:336. doi: 10.3389/fmicb.2016.00336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Mathers AJ, Peirano G, Pitout JDD. 2015. Escherichia coli ST131: the quintessential example of an international multiresistant high-risk clone. Adv Appl Microbiol 90:109–154. doi: 10.1016/bs.aambs.2014.09.002 [DOI] [PubMed] [Google Scholar]
  • 18. Schaufler K, Semmler T, Wieler LH, Trott DJ, Pitout J, Peirano G, Bonnedahl J, Dolejska M, Literak I, Fuchs S, Ahmed N, Grobbel M, Torres C, McNally A, Pickard D, Ewers C, Croucher NJ, Corander J, Guenther S. 2019. Genomic and functional analysis of emerging virulent and multidrug-resistant Escherichia coli lineage sequence type 648. Antimicrob Agents Chemother 63:e00243-19. doi: 10.1128/AAC.00243-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Ewers C, Bethe A, Stamm I, Grobbel M, Kopp PA, Guerra B, Stubbe M, Doi Y, Zong Z, Kola A, Schaufler K, Semmler T, Fruth A, Wieler LH, Guenther S. 2014. CTX-M-15-D-ST648 Escherichia coli from companion animals and horses: another pandemic clone combining multiresistance and extraintestinal virulence? J Antimicrob Chemother 69:1224–1230. doi: 10.1093/jac/dkt516 [DOI] [PubMed] [Google Scholar]
  • 20. Zhou Z, Alikhan N, Mohamed K, Fan Y, Group S, Achtman M. 2020. The EnteroBase user ’ s guide, with case studies on Salmonella transmissions, Yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res 30:138–152. doi: 10.1101/gr.251678.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. 2021. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol 38:4647–4654. doi: 10.1093/molbev/msab199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 23. Fu L, Niu B, Zhu Z, Wu S, Li W. 2012. Bioinformatics applications note sequence analysis CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinforma Apllications Note 28:3150–3152. doi: 10.1093/bioinformatics/bts565 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Sayers EW, Bolton EE, Brister JR, Canese K, Chan J, Comeau DC, Connor R, Funk K, Kelly C, Kim S, Madej T, Marchler-Bauer A, Lanczycki C, Lathrop S, Lu Z, Thibaud-Nissen F, Murphy T, Phan L, Skripchenko Y, Tse T, Wang J, Williams R, Trawick BW, Pruitt KD, Sherry ST. 2022. Database resources of the national center for biotechnology information. Nucleic Acids Res 50:D20–D26. doi: 10.1093/nar/gkab1112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Consortium TU . 2023. UniProt: the universal protein knowledgebase in 2023. Nucleic Acids Res 51:523–531. doi: 10.1093/nar/gkac1052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Nicolas P, Mäder U, Dervyn E, Rochat T, Leduc A, Pigeonneau N, Bidnenko E, Marchadier E, Hoebeke M, Aymerich S, et al. 2012. Condition-dependent transcriptome reveals high-level regulatory architecture in Bacillus subtilis. Science 335:1103–1106. doi: 10.1126/science.1206848 [DOI] [PubMed] [Google Scholar]
  • 27. Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. doi: 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Liao Y, Smyth GK, Shi W. 2014. FeatureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30:923–930. doi: 10.1093/bioinformatics/btt656 [DOI] [PubMed] [Google Scholar]
  • 29. Swiatek L-S, Surmann K, Eger E, Müller JU, Salazar MG, Heiden SE, Werner G, Hübner N-O, Bohnert JA, Becker K, Heiden SE, Völker U, Schwabe M, Schaufler K. 2025. Multi-omics investigations uncover unique pathogenic markers in clinical Klebsiella pneumoniae that could be leveraged as novel antimicrobial targets. Front Microbiol 16. doi: 10.3389/fmicb.2025.1657680 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Reder A, Hentschker C, Steil L, Salazar MG, Hammer E, Dhople VM, Sura T, Lissner U, Wolfgramm H, Dittmar D, Harms M, Surmann K, Volker U, Michalik S. 2023. MassSpecPreppy - an end-to-end solution for automated protein concentration determination and flexible sample digestion for proteomics applications. Preprints. doi: 10.22541/au.169148377.71225055/v1 [DOI] [PubMed]
  • 31. Blankenburg S, Hentschker C, Nagel A, Hildebrandt P, Michalik S, Dittmar D, Surmann K, Völker U. 2019. Improving proteome coverage for small sample amounts: an advanced method for proteomics approaches with low bacterial cell numbers. Proteomics 19:e1900192. doi: 10.1002/pmic.201900192 [DOI] [PubMed] [Google Scholar]
  • 32. Suomi T, Elo LL. 2017. Enhanced differential expression statistics for data-independent acquisition proteomics. Sci Rep 7:5869. doi: 10.1038/s41598-017-05949-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Cox J, Hein MY, Luber CA, Paron I, Nagaraj N, Mann M. 2014. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics 13:2513–2526. doi: 10.1074/mcp.M113.031591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Perez-Riverol Y, Csordas A, Bai J, Bernal-Llinares M, Hewapathirana S, Kundu DJ, Inuganti A, Griss J, Mayer G, Eisenacher M, Pérez E, Uszkoreit J, Pfeuffer J, Sachsenberg T, Yilmaz S, Tiwary S, Cox J, Audain E, Walzer M, Jarnuczak AF, Ternent T, Brazma A, Vizcaíno JA. 2019. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 47:D442–D450. doi: 10.1093/nar/gky1106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, von Mering C, Bork P. 2019. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47:D309–D314. doi: 10.1093/nar/gky1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. 2021. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol 38:5825–5829. doi: 10.1093/molbev/msab293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. 2023. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res 51:D587–D592. doi: 10.1093/nar/gkac963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Eger E, Schwabe M, Schulig L, Hübner N-O, Bohnert JA, Bornscheuer UT, Heiden SE, Müller JU, Adnan F, Becker K, Correa-Martinez CL, Guenther S, Idelevich EA, Baecker D, Schaufler K. 2022. Extensively drug-resistant Klebsiella pneumoniae counteracts fitness and virulence costs that accompanied ceftazidime-avibactam resistance acquisition. Microbiol Spectr 10:e0014822. doi: 10.1128/spectrum.00148-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Insua JL, Llobet E, Moranta D, Pérez-Gutiérrez C, Tomás A, Garmendia J, Bengoechea JA. 2013. Modeling Klebsiella pneumoniae pathogenesis by infection of the wax moth Galleria mellonella. Infect Immun 81:3552–3565. doi: 10.1128/IAI.00391-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Erni B. 2013. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS): an interface between energy and signal transduction. J IRAN CHEM SOC 10:593–630. doi: 10.1007/s13738-012-0185-1 [DOI] [Google Scholar]
  • 41. Bobrovskyy M, Vanderpool CK. 2014. The small RNA SgrS: roles in metabolism and pathogenesis of enteric bacteria. Front Cell Infect Microbiol 4:61. doi: 10.3389/fcimb.2014.00061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Wöhrl BM, Wehmeier UF, Lengeler JW. 1990. Positive and negative regulation of expression of the l-sorbose (sor) operon by SorC in Klebsiella pneumoniae. Molec Gen Genet 224:193–200. doi: 10.1007/BF00271552 [DOI] [PubMed] [Google Scholar]
  • 43. Bachmann BJ. 1983. Linkage map of Escherichia coli K-12, edition 7. Microbiol Rev 47:180–230. doi: 10.1128/mr.47.2.180-230.1983 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Schneider G, Dobrindt U, Brüggemann H, Nagy G, Janke B, Blum-Oehler G, Buchrieser C, Gottschalk G, Emödy L, Hacker J. 2004. The pathogenicity island-associated K15 capsule determinant exhibits a novel genetic structure and correlates with virulence in uropathogenic Escherichia coli strain 536. Infect Immun 72:5993–6001. doi: 10.1128/IAI.72.10.5993-6001.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Arredondo-Alonso S, Blundell-Hunter G, Fu Z, Gladstone RA, Fillol-Salom A, Loraine J, Cloutman-Green E, Johnsen PJ, Samuelsen Ø, Pöntinen AK, Cléon F, Chavez-Bueno S, De la Cruz MA, Ares MA, Vongsouvath M, Chmielarczyk A, Horner C, Klein N, McNally A, Reis JN, Penadés JR, Thomson NR, Corander J, Taylor PW, McCarthy AJ. 2023. Evolutionary and functional history of the Escherichia coli K1 capsule. Nat Commun 14. doi: 10.1038/s41467-023-39052-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Würsch P, Welsch C, Arnaud MJ. 1979. Metabolism of L-sorbose in the rat and the effect of the intestinal microflora on its utilization both in the rat and in the human. Nutr Metab 23:145–155. doi: 10.1159/000176251 [DOI] [PubMed] [Google Scholar]
  • 47. Hanmoungjai W, Chukeatirote E, Yamada Y, Sahachaisaree V, Lumyong P, Takata G, Izumori K, Lumyong S. 2008. L-sorbose production by acidotolerant acetic acid bacteria isolated from Thailand sources. Chiang Mai J Sci 35:382–390. https://cmudc.library.cmu.ac.th/frontend/Info/item/dc:83060. [Google Scholar]
  • 48. Erkosar B, Storelli G, Defaye A, Leulier F. 2013. Host-intestinal microbiota mutualism: “learning on the fly”. Cell Host Microbe 13:8–14. doi: 10.1016/j.chom.2012.12.004 [DOI] [PubMed] [Google Scholar]
  • 49. Chouaia B, Gaiarsa S, Crotti E, Comandatore F, Degli Esposti M, Ricci I, Alma A, Favia G, Bandi C, Daffonchio D. 2014. Acetic acid bacteria genomes reveal functional traits for adaptation to life in insect guts. Genome Biol Evol 6:912–920. doi: 10.1093/gbe/evu062 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Grembecka M. 2015. Sugar alcohols—their role in the modern world of sweeteners: a review. Eur Food Res Technol 241:1–14. doi: 10.1007/s00217-015-2437-7 [DOI] [Google Scholar]
  • 51. Bryant WA, Krabben P, Baganz F, Zhou Y, Ward JM. 2009. The analysis of multiple genome comparisons in genus Escherichia and its application to the discovery of uncharacterised metabolic genes in uropathogenic Escherichia coli CFT073. Comp Funct Genomics 2009:782924. doi: 10.1155/2009/782924 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Conway T, Cohen PS. 2015. Commensal and pathogenic Escherichia coli metabolism in the gut. Metab Bact Pathog:343–362. doi: 10.1128/9781555818883.ch16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Connor CH, Zucoloto AZ, Munnoch JT, Yu IL, Corander J, Hoskisson PA, McDonald B, McNally A. 2023. Multidrug-resistant E. coli encoding high genetic diversity in carbohydrate metabolism genes displace commensal E. coli from the intestinal tract. PLoS Biol 21:e3002329. doi: 10.1371/journal.pbio.3002329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Cummins EA, Snaith AE, McNally A, Hall RJ. 2021. The role of potentiating mutations in the evolution of pandemic Escherichia coli clones. Eur J Clin Microbiol Infect Dis. doi: 10.1007/s10096-021-04359-3 [DOI] [PubMed] [Google Scholar]
  • 55. Alteri CJ, Smith SN, Mobley HLT. 2009. Fitness of Escherichia coli during urinary tract infection requires gluconeogenesis and the TCA cycle. PLoS Pathog 5:e1000448. doi: 10.1371/journal.ppat.1000448 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Costantini C, Bellet MM, Renga G, Stincardini C, Borghi M, Pariano M, Cellini B, Keller N, Romani L, Zelante T. 2020. Tryptophan co-metabolism at the host-pathogen interface. Front Immunol 11:67. doi: 10.3389/fimmu.2020.00067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Hoffman JA, Badger JL, Zhang Y, Kim KS. 2001. Escherichia coli K1 purA and sorC are preferentially expressed upon association with human brain microvascular endothelial cells. Microb Pathog 31:69–79. doi: 10.1006/mpat.2001.0451 [DOI] [PubMed] [Google Scholar]
  • 58. Shaffer CL, Zhang EW, Dudley AG, Dixon B, Guckes KR, Breland EJ, Floyd KA, Casella DP, Algood HMS, Clayton DB, Hadjifrangiskou M. 2017. Purine biosynthesis metabolically constrains intracellular survival of uropathogenic Escherichia coli. Infect Immun 85:e00471-16. doi: 10.1128/IAI.00471-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Rouquet G, Porcheron G, Barra C, Répérant M, Chanteloup NK, Schouler C, Gilot P. 2009. A metabolic operon in extraintestinal pathogenic Escherichia coli promotes fitness under stressful conditions and invasion of eukaryotic cells. J Bacteriol 191:4427–4440. doi: 10.1128/JB.00103-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Buckles MEL, Wang X, Lane MC, Lockatell CV, Johnson DE, Rasko DA, Mobley HLT, Donnenberg MS. 2009. Role of the K2 capsule in Escherichia coli urinary tract infection and serum resistance. J Infect Dis 199:1689–1697. doi: 10.1086/598524 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Bahrani-Mougeot FK, Buckles EL, Lockatell CV, Hebel JR, Johnson DE, Tang CM, Donnenberg MS. 2002. Type 1 fimbriae and extracellular polysaccharides are preeminent uropathogenic Escherichia coli virulence determinants in the murine urinary tract. Mol Microbiol 45:1079–1093. doi: 10.1046/j.1365-2958.2002.03078.x [DOI] [PubMed] [Google Scholar]
  • 62. Gao LY, Harb OS, Kwaik YA. 1998. Identification of macrophage-specific infectivity loci (mil) of Legionella pneumophila that are not required for infectivity of protozoa. Infect Immun 66:883–892. doi: 10.1128/IAI.66.3.883-892.1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Kallonen T, Brodrick HJ, Harris SR, Corander J, Brown NM, Martin V, Peacock SJ, Parkhill J. 2017. Systematic longitudinal survey of invasive Escherichia coli in England demonstrates a stable population structure only transiently disturbed by the emergence of ST131. Genome Res 27:1437–1449. doi: 10.1101/gr.216606.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Gladstone RA, Mcnally A, Pöntinen AK, Tonkin-hill G, John A. 2023. Europe PMC Funders Group Emergence and dissemination of antimicrobial resistance in Escherichia coli causing bloodstream infections: a nationwide longitudinal microbial population genomic study in Norway between 2002-2017. Lancet Microbe:1–24. doi: 10.1016/S2666-5247(21)00031-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Subashchandrabose S, Smith SN, Spurbeck RR, Kole MM, Mobley HLT. 2013. Genome-wide detection of fitness genes in uropathogenic Escherichia coli during systemic infection. PLoS Pathog 9:e1003788. doi: 10.1371/journal.ppat.1003788 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1. msystems.01274-25-s0001.png.

Visualization of sor genomic region across STs.

DOI: 10.1128/msystems.01274-25.SuF1
Supplement. msystems.01274-25-s0002.docx.

Supplemental methods, tables, and figures

DOI: 10.1128/msystems.01274-25.SuF2
Table S12. msystems.01274-25-s0003.xlsx.

List of genomes included in analysis of sor genomic region.

DOI: 10.1128/msystems.01274-25.SuF3

Data Availability Statement

The experimental and computational data supporting the findings of this research are available in this article and its supplementary information files. Genomic and transcriptomic data of this study have been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB98906. Proteome data have been stored at the ProteomeXchange Consortium via the PRIDE partner repository (34) with the dataset identifier https://proteomecentral.proteomexchange.org/cgi/GetDataset?ID=PXD056542PXD056542.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES