Skip to main content
Gut Microbes logoLink to Gut Microbes
. 2024 Sep 18;16(1):2401944. doi: 10.1080/19490976.2024.2401944

Molecular serotyping of diarrheagenic Escherichia coli with a MeltArray assay reveals distinct correlation between serotype and pathotype

Chen Du a,b,*, Yiqun Liao a,*, Congcong Ding a, Jiayu Huang a, Shujuan Zhou a, Yiyan Xu a, Zhaohui Yang a, Xiaolu Shi b, Yinghui Li b, Min Jiang b, Le Zuo b, Minxu Li b, Shengzhe Bian c, Na Xiao d, Liqiang Li e, Ye Xu a,, Qinghua Hu b,, Qingge Li a,
PMCID: PMC11529414  PMID: 39292565

ABSTRACT

Diarrheagenic Escherichia coli serotypes are associated with various clinical syndromes, yet the precise correlation between serotype and pathotype remains unclear. A major barrier to such studies is the reliance on antisera-based serotyping, which is culture-dependent, low-throughput, and cost-ineffective. We have established a highly multiplex PCR-based serotyping assay, termed the MeltArray E. coli serotyping (EST) assay, capable of identifying 163 O-antigen-encoding genes and 53 H-antigen-encoding genes of E. coli. The assay successfully identified serotypes directly from both simulated and real fecal samples, as demonstrated through spike-in validation experiments and a retrospective study. In a multi-province study involving 637 E. coli strains, it revealed that the five major diarrheagenic pathotypes have distinct serotype compositions. Notably, it differentiated 257 Shigella isolates into four major Shigella species, distinguishing them from enteroinvasive E. coli based on their distinct serotype profiles. The assay’s universality was further corroborated by in silico analysis of whole-genome sequences from the EnteroBase. We conclude that the MeltArray EST assay represents a paradigm-shifting tool for molecular serotyping of E. coli, with potential routine applications for comprehensive serotype analysis, disease diagnosis, and outbreak detection.

KEYWORDS: Diarrheagenic Escherichia coli, Shigella, molecular serotyping, culture-independent serotyping, correlation between serotype and pathotype

1. Introduction

Diarrheal illnesses and foodborne outbreaks caused by diarrheagenic Escherichia coli (DEC) are enduring public health threats, and the major causes of morbidity and mortality in children in developing countries in particular.1–3 However, not all E. coli strains isolated from diarrheal patients are pathogenic because commensal E. coli also extensively colonize the gut,4–6 Therefore, distinguishing pathogenic strains from nonpathogenic ones is critical for controlling DEC infections.

E. coli pathotypes are classified based on the presence of specific virulence genes,7 while E. coli serotypes are defined by particular combinations of O (lipopolysaccharide) and H (flagellar) antigens.8 The identification of DEC pathotypes and serotypes, which serve as critical strain markers in outbreak analyses and epidemiological surveillance, follows well-established protocols. These protocols generally involve three steps: 1) conventional culture and biochemical methods to obtain and characterize E. coli isolates;8,9 2) molecular methods to detect virulence genes specific to five major DEC pathotypes, including eae, escV or bfpB for enteropathogenic E. coli (EPEC); stx1 or stx2 for Shiga toxin – producing E. coli (STEC); lt, stp or sth for enterotoxigenic E. coli (ETEC); aggR, pic or astA for enteroaggregative E. coli (EAEC); and ipaH for enteroinvasive E. coli (EIEC) and Shigella,10–12 and 3) serotyping using an antiserum panel against both O and H antigens.7,13,14 Notably, Shigella species can be regarded as members of the EIEC pathotype based on their virulence attributes.15 However, the manipulations involved in each of these three steps are lengthy and labor-intensive. Furthermore, most laboratories worldwide possess only a subset of the necessary antisera, which cover over 180 O antigens (O1 to O187) and 53 H antigens (H1 to H53).16 Consequently, 47.5% to 82.5% of E. coli isolates obtained from diarrheal patients cannot be practically serotyped.16–18

The association between serotype and pathogenicity of E. coli has been extensively explored. As early as 1949, EPEC was epidemiologically shown to be the etiological agent in frequent and very severe outbreaks of infantile diarrhea.19 Initially, relatively few O groups (O111 and O55) were found to be associated with diarrhea outbreaks, and several other serotypes were later on added to the list.20 A notorious example is the serotype O157:H7, which produces cytotoxic enterotoxins (Shiga toxin, stx) that target endothelial cells of vessels and are responsible for hemolytic uremic syndrome (HUS).21 Other stx producing E. coli serotypes also exist, such as O26, O45, O103, O104, O111, O121, and O145.22,23 Owing to its high immunogenicity, the O-antigen of E. coli is considered a promising vaccine target,24–28 So far, however, only a few E. coli serotypes have been consistently associated with diseases,14,17 and the correlation between serotype and pathotype is not always reproducible.29,30 Whether an inherent association exists between DEC serotype and pathotype remains an open question that urgently needs to be addressed.

To explore the association of E. coli O and H serotypes with pathogenicity, we developed a highly multiplexed, real-time PCR scheme, termed the MeltArray E. coli serotyping (EST) assay. This assay can simultaneously distinguish the five major DEC pathotypes and all available O and H groups in a single test. The utility of the MeltArray EST assay was demonstrated through direct analysis of anal swab samples in both spike-in validation experiments and a retrospective study. We then applied the MeltArray EST to investigate the association between pathotype and serotype of E. coli in a multi-province study and further to differentiate the four major species of Shigella from EIEC.

2. Materials and methods

2.1. Study design

The objective of this study was to explore a clinically applicable approach to identify all E. coli serotypes and demonstrate the association between serotypes and DEC pathotypes. We developed a molecular serotyping assay with the capacity of identifying all known antigen-encoding genes of O and H serotypes of E. coli by using 229 plasmids containing the insertion sequences of each target (Table S1) and 176 reference strains collected by the Shenzhen Centre for Disease Control and Prevention (CDC) (Table S2). Simulated samples were used to examine the ability of the assay to identify serotypes directly from fecal samples, followed by a retrospective study with a focus on the utility of the assay for culture-independent outbreak detection. After that, we used the assay to explore the association between serotypes and five major DEC pathotypes by testing 637 isolates in a multi-province study (Table S3), and went further to confirm this relationship by performing whole genome sequencing (WGS) analysis of 166 DEC isolates from EnteroBase (Table S4). Furthermore, we examined the association between E. coli serotypes and four major species of Shigella by testing 257 Shigella isolates (Table S5) and 18 EIEC isolates (Table S3) in our study and performing WGS analysis of 775 Shigella isolates (Table S6) and 360 EIEC isolates (Table S7) from EnteroBase. In all experiments, sample sizes were determined according to previous publications and experimental experience, and investigators were blinded from the nature of strains to run the assay. Data of 93 isolates in the multi-province study were classified as nonpathogenic E. coli and therefore excluded from analysis. Sample sizes, experimental replicates, and statistics are given in the corresponding figures, figure legends, and data files.

2.2. Bacterial isolates

E. coli reference strains used in this study (n = 176, Table S2) were collected by the Shenzhen CDC. All strains were grown aerobically at 37°C by shaking at 220 rpm in Luria – Bertani (LB) broth. The O-antigenic serogroups of these strains were determined using commercial E. coli antisera31 (Denka Seiken Co. Ltd., Tokyo, Japan). Strains that did not react with any of the O antisera were classified as O-nontypeable (ONT). For comparison, each isolate was analyzed via WGS by BGI Genomics Co. Ltd. (Shenzhen, China) to examine the presence of corresponding virulence genes, O-antigen biosynthesis genes, H-antigen-encoding genes, and the E. coli-specific gene yccT.

E. coli strains isolated from stool samples of outpatients with diarrhea and used retrospectively for the multi-province study (n = 637, Table S3) were collected from five provinces of China, namely, Guangdong (n = 355), Henan (n = 106), Sichuan (n = 75), Jiangsu (n = 65), and Zhejiang (n = 36).

The serovars of Shigella isolates (n = 257, Table S5) were determined using commercial Shigella antisera (Denka Seiken),32 and the collection was provided by the Shenzhen CDC.

2.3. Plasmid construction

Target genes (detailed in Figure S1 legend) of the MeltArray EST assay include 12 virulence genes targeting 5 major diarrheagenic pathotypes,11 163 O-antigen biosynthesis genes targeting 182 O groups,33,34 53 H-antigen- encoding genes targeting 53 H types35,36 and one E. coli-specific gene, yccT.37 A total of 229 plasmids that contain target sequences of the above genes (Table S1) were synthesized by Sangon Biotech (Shanghai, China). Each insertion sequence was cloned into a pUC57 vector, and the resulting plasmids were individually transformed into E. coli TOP10 strain.

2.4. Preparation of simulated samples

Stool sample negative for the EHEC virulence genes (stx1, stx2, escV, eae) was divided into 5 equal parts weighing 200 mg each for 5 multiplex PCR amplification reactions. Serially diluted (105, 104, 103, 102, 101 copies/μL, determined by digital PCR via the quantification of yccT) cultures (50 μL) of a O157 strain (GD-325, Table S3) or a O111 strain (GD-293) were spiked into the above prepared stool samples, and used as simulated stool samples.

2.5. Outbreak cases in a retrospective study

A foodborne disease outbreak occurred at a high school in Shenzhen in May 2019 and had 8 suspected patients who presented diarrhea (>3 times/day) and at least vomiting (>2 times/day) or abdominal pain. Anal swabs from 7 of them were collected by the Yantian District CDC (Shenzhen, China), and E. coli were isolated from 5 samples and preliminary identified as EPEC strain O44 using specific antisera.

2.6. DNA extraction

E. coli isolates, Shigella isolates, anal swab samples, and simulated samples were stored in a − 80°C freezer before DNA extraction. Genomic DNA of bacterial isolates was extracted using the QIAsymphony DSP DNA Mini Kit (Qiagen, Hilden, Germany) following the manufacturer’s instructions; the total DNA of simulated samples and anal swab samples was extracted using a QIAamp Fast DNA Stool Mini Kit (Qiagen). The extracted nucleic acids were stored at − 80°C before use.

2.7. MeltArray EST assay

MeltArray EST assay reactions were performed in a 25-μL volume, containing DNA template (5 μL), and PCR master buffer [(10 mM Tris – HCl (pH 8.0) and 50 mM KCl), 7 mM MgCl2, 0.2 mM deoxynucleoside triphosphates (dNTPs), 2.5 U of Taq 01 DNA polymerase (Zeesan Biotech, Xiamen, China)]. Each reaction also contained a universal primer tag, tag sequence-tailed target-specific primers, mediator probes, and universal molecular beacon reporters. Sequences and concentrations of the primers and probes are listed in Table S8. PCR and melting curve analyses were performed in a SLAN 96S real-time PCR detection system (Hongshi Medical Technology Co. Ltd., Shanghai, China) following the program below: denaturation at 95°C for 5 min; 40 cycles of 95°C for 20 s and 60°C for 1 min; 35°C for 40 min, 95°C for 2 min, 45°C for 2 min, and a final temperature increase from 45°C to 95°C at 0.04°C/step for melting curve analysis. Fluorescence intensity was measured in six detection channels [Atto 425 (450 nm), FAM (510 nm), HEX (565 nm), ROX (620 nm), Cy5 (665 nm), and Quasar 705 (705 nm)] in each step of continuous temperature increase during the melting curve analysis. Data were analyzed using the SLAN 96S real-time PCR detection system software, version 8.2.2.

2.8. Digital PCR

Singleplex digital PCR was performed for quantification of E. coli isolates in the spike-in validation experiment, and triplex digital PCR was performed for quantification of linked genes in the retrospective study. Primer and probe sequences and concentrations used are listed in Table S9. Digital PCR was performed on a TD-1 Droplet Digital PCR system (TargetingOne, Beijing, China) following the manufacturer’s instructions.

2.9. Whole genome sequencing

The procedures of WGS and comparative genomics analysis were described previously.38 Specifically, DNA libraries with an insert size of 300 bp were sequenced using either a single-end 100 bp mode (SE100) or a pair-end 100 bp mode (PE100) on a BGISEQ-500 sequencer (BGI Inc., Shenzhen, China). Raw reads were quality trimmed using SOAPnuke,39 assembled (depth = 0, minlen = 50) into contigs by Shovill (https://github.com/tseemann/shovill), and annotated using prokka 1.1340 with Swiss-Prot database of Uniprot41 to locate the open reading frames (ORFs).

2.10. Genome sequences downloaded from EnteroBase

A collection of 166 DEC genomes of common serotypes defined in our multi-province study was listed in Table S4. The genomes of 775 Shigella (52 serotypes, comprising four Shigella species, Table S6) and 360 EIEC of 45 serotypes (Table S7) were available from EnteroBase (http://enterobase.warwick.ac.uk/).42

2.11. Genome sequence analysis

For genome sequences obtained by WGS or downloaded from EnteroBase, the online tool VirulenceFinder (https://cge.food.dtu.dk/services/VirulenceFinder.) was used to examine the presence of corresponding virulence genes,43 and SerotypeFinder (https://cge.food.dtu.dk/services/SerotypeFinder) was used to examine the presence of O-antigen biosynthesis genes and H-antigen-encoding genes.44

2.12. Statistical analysis

Data analysis was carried out in R v.4.1.2. Pearson correlation coefficient (r) was used to assess the linear relationship between linked genes, and a correlation coefficient value (0.80–1.00) was defined as a “very strong positive correlation”. SP ratio (the sum of isolate number of dominant pathotype of each serotype/O group/H type divided by total number of isolates) was used to define the correlation between serotypes/O groups/H types and pathotypes.

3. Results

3.1. MeltArray E. coli serotyping (EST) assay for E. coli

To establish a rapid and comprehensive molecular serotyping assay for E. coli, we used a highly multiplexed, real-time PCR scheme termed ‘MeltArray EST assay’. The assay is composed of 5 multiplex PCR amplification reactions whereby reaction 1 targets 12 virulence genes and the yccT gene of E. coli, reactions 2–4 target 163 O-antigen biosynthesis genes, and reaction 5 targets 53 H-antigen-encoding genes (Figure S1a). Each target in each reaction is identified based on a unique 2-dimensional (2D) label consisting of a fluorescent dye (Fn, one of the six fluorophore types in correspondence with the detection channel of the real-time PCR thermocycler) and melting temperature (Tm).45 DEC pathotypes were identified based on different combinations of 12 virulence genes (Figure S1b). Serotypes were determined based on the combination of O antigens/groups and H antigens. The MeltArray EST assay is capable of identifying or classifying of 182 O antigens/groups and 53 H antigens. Among them, 149 have unique O-antigen biosynthesis genes and the remaining 33 share identical or very similar O-antigen biosynthesis genes,46 which in this study were categorized into 14 groups (Gp1 to Gp10 and Gp12 to Gp15, Figure S1c). Meanwhile, 53 H antigens all have their unique H-antigen-encoding genes.

A series of evaluations were performed to assess the analytical performance of the MeltArray EST assay. Reproducibility analysis was performed using 229 plasmids containing the insertion sequences of each target. MeltArray EST showed a maximum coefficient of Tm with variations lower than 0.4%, enabling all targets to be unambiguously distinguished (Figure 1(a)). When testing 176 reference E. coli strains, the assay exhibited 96% concordance with traditional antisera-based serotyping and 100% concordance with whole-genome sequencing (WGS) (Figure S2). The limit of detection (LOD) was determined to be 500 copies/reaction (Figure S3). LOD analysis also showed that the concentration of each target was proportional to the height of its melting peak (Rm); Rm values were thus used to define the abundance of the target as a third dimension of the target label (Figure 1(b)), becoming 3D together with Fn and Tm. Assuming that the copy number ratio of O-antigen biosynthesis genes, H-antigen-encoding genes, and virulence genes is fixed in E. coli, the combined use of Fn, Tm, and Rm helps define the exact O:H serotype of a pathogenic E. coli strain. This was true because Rm values of these genes identified by 2D label were proportional to their copy number in each clinical sample, especially with disease outbreaks in which all patients were affected by the same E. coli strain. Thus, the MeltArray EST assay can be used for culture-dependent identification of DEC based on a 2D label comprising Fn and Tm, and for culture-independent identification based on a 3D label comprising Fn, Tm, and Rm.

Figure 1.

Figure 1.

Identification (ID) of pathotypes and serotypes of E. coli through 5 MeltArray reactions (Rxns). (a) Setup of 5 MeltArray reactions. Rxn 1, 13-plex PCR targeting 12 virulence genes and one E. coli-specific gene (yccT); Rxn 2, 62-plex PCR targeting 61 O-antigen biosynthesis genes and yccT; Rxn 3, 52-plex PCR targeting 51 O-antigen biosynthesis genes and yccT; Rxn 4, 52-plex PCR targeting 51 O-antigen biosynthesis genes and yccT; Rxn 5, 54-plex PCR targeting 53 H-antigen encoding genes and yccT. In each reaction, the target genes are identified based on 2D labels comprising fluorophore types (Fn) and melting temperatures (Tm). The fluorophore types are indicated by the colours of the solid circles, and the Tm values are shown as averaged Tm values ±3-fold the standard deviations (n = 8) using a collection of 229 plasmids with the concentration of 104 copies/μL. (b) Conceptual illustration of a 3D label comprising Fn, Tm, and melting peak height (Rm) for correlation analysis among pathotypes, O groups and H types. The 13 target genes are: eae, encoding intimin for E. coli attaching and effacing; escV, gene on the locus of enterocyte effacement (LEE), encoding a type III secretion factor; bfpB, bundle-forming pilus B; stx1, shiga-like toxin I; stx2, shiga-like toxin II; lt, heat-labile enterotoxin; stp, heat-stable enterotoxins initially discovered in the isolates from pigs; sth, heat-stable enterotoxins initially discovered in the isolates from human; aggR, aggregative adhesive fimbriae regulator; pic, encoding a protein involved in intestinal colonization; astA, enteroaggregative heat-stable enterotoxin A; ipaH, invasive plasmid antigen H-gene; yccT, a gene encoding a conserved protein in E. coli with an unknown function.

3.2. Culture-independent identification of simulated samples

To examine the potential of the 3D label to identify serotypes directly from fecal samples, we spiked into each different normal fecal sample with either enterohemorrhagic E. coli (EHEC) O157:H7 or EHEC O111:H8 at a series of final concentrations (ranging from 105 to 101 copies/μL, quantified by digital PCR for E. coli) to simulate actual fecal samples from different patients containing varying numbers of E. coli. The MeltArray EST assay successfully detected the spiked-in and expected EHEC-specific (stx1, stx2, escV, and eae), O157 and H7 genes. Additionally, it detected 10 O groups and 3 H types other than O157 and H7 that preexisted in the same normal fecal sample (Figure S4a, group 1). Hence, the O157:H7 strain could not be identified by 2D label comprised of Fn and Tm only, owing to the presence of mixed O groups and H types in the simulated sample. This was also true of the second type of simulated samples in which the O111:H8 strain could not be identified by the 2D label, owing to the background O groups and H types when a different fecal material was spiked (Figure S4a, group 2). However, when Rm values as the 3rd dimention (thus, 3D label) were additionally used for correlation analysis between the genes detected in each fecal sample, the Rm values of EHEC-specific genes (stx1 or stx2, eae, and escV) were proportional to the Rm values of the serotype-specific genes of O157 and H7 in group 1, and O111 and H8 in group 2 (Figures S4b and S4c), thereby enabling specific identification of the exact serotype of EHEC spiked into the fecal sample.

3.3. Direct identification of the outbreak strain in a retrospective study

To test the ability of MeltArray EST in identifying serotypes directly from clinical samples, we retrospectively analyzed stored anal swab samples collected in 2019 during a foodborne outbreak involving 7 patients from a high school in Shenzhen. EPEC-specific genes (escV or eae) were detected in all 7 samples (Figure S5). Upon examining the Rm values of all melting peaks, we observed a strong positive correlation between the Rm values of these two genes and the serotype-specific genes Gp10 and H11 (Figure 2(a)). When comparing the melting peaks of escV, Gp10, and H11 in the 7 samples, their normalized Rm values were found to proportionally increase or decrease across different samples, showing direct correlations among the three genes (Figure 2(b)). These correlations were further confirmed by digital PCR, which demonstrated that the gene copy numbers of escV, Gp10, and H11 were nearly equal in each sample (Figure 2(c)). Additionally, strong positive correlations (r ≥ 0.989) were observed among the copy numbers of these three genes (escV, Gp10, and H11), indicating that their coexistence in the genome of the outbreak strain. We thus conclude that the outbreak strain is EPEC Gp10:H11.

Figure 2.

Figure 2.

Investigation of a food poisoning outbreak in a retrospective study. (a) Correlation analysis between the melting peak height (Rm) of identified genes. Pearson correlation coefficient (r) was indicated by the colour of the lines or the heat map where blue indicates positive correlation and red indicates negative correlation. (b) MeltArrayArray EST procedure to conduct correlation analysis of the Rm values of the escV, Gp10, and H11 marker genes from 7 samples (S1 to S7). The three panels shown from left to right are: normalized melting peaks, normalized Rm values, and correlation analysis of the Rm values of the three marker genes. (c) Digital PCR procedure to conduct correlation analysis of the Rm values of the escV, Gp10, and H11 marker genes. The three panels shown from left to right are: triplex digital PCR raw data from sample 1 (S1), copy numbers of the marker genes from 7 samples (S1 to S7), and correlation analysis of the copy numbers of the maker genes.

Isolates cultured from patient samples and subsequently subjected to the MeltArray EST assay yielded results consistent with those obtained directly from the anal swab samples without culturing. Sanger sequencing of the isolated strains using primer pairs for Gp10 and H1134, 36 further confirmed the presence of both Gp10- and H11-specific sequences, validating the results of our assay performed directly on the anal swab samples. Notably, the entire investigation process using the MeltArray EST assay, from the initial anal swab samples to the final results, was completed within 4.0 hrs (1.0 hr for nucleic acid extraction and 3.0 hrs for the assay), significantly shorter than the culture-dependent serotyping procedure, which typically takes at least 5 days.

To further corroborate our results with traditional serotyping, we retrieved the outbreak investigation records and found that the causative strain was initially identified as EPEC O44 by the local district CDC using an antisera-based serotyping method. However, the Shenzhen municipal CDC later corrected this to an O-nontypeable strain, as no agglutination was observed against O44 or any other available antisera. No H-type serotyping was performed at either the district or municipal CDC. This inconsistency in serotyping result by different institutions, along with the frequent lack of H serotyping in culture-based methods, highlights the advantages of molecular serotyping, which offer better reproducibility and higher resolution than conventional antisera-based method.

3.4. Association of serotypes with DEC pathotypes

To explore the association between serotypes and DEC pathotypes, we applied the MeltArray EST assay to 637 E. coli strains previously isolated from stool samples of diarrheal patients across five different provinces of China (Guangdong, Henan, Sichuan, Jiangsu, and Zhejiang). Of these 637 strains, 544 were classified as DEC based on the presence of virulence genes (Figure S1b). O groups and H types were identified in 93.2% (507/544) and 99.3% (540/544) of the DEC strains, respectively, resulting in an O:H serotyping rate of 92.6% (504/544). The geographical distribution patterns revealed that ETEC, EPEC, and EAEC were the dominant pathotypes, whereas EIEC and STEC were extremely rare (Figure 3). Among the 544 DEC isolates, 218 distinct O:H serotypes were identified, each comprising less than 5% of the total, except for O159:H34, which accounted for 8.3% (45/544), indicating high serotype diversity within this collection.

Figure 3.

Figure 3.

Geographical distribution patterns of diarrheagenic pathotypes and serotypes in five provinces of China.

When pathotypes were superposed onto the serotypes, nearly every serotype corresponded to a single pathotype, as depicted by single-colored bubbles in Figure 4(a), indicating that these serotypes were “pure” in terms of their pathotype attribution. We introduced the concept of serotype purity (SP), defined as the percentage of isolates belonging to dominant pathotypes within each serotype. Chord diagram connecting O types/groups, H types, and serotypes (n ≥ 5) with the five pathotypes (Figure 4(b)−4(d)) revealed SP values of 84.5% (317/375), 70.5% (368/522), and 97.2% (243/250) for the O group, H type, and serotype, respectively. This result further demonstrates that almost every serotype exhibits a single corresponding pathotype, supporting the existence of an inherent association between serotype and DEC pathotype.

Figure 4.

Figure 4.

Correlation analysis of serotypes and pathotypes. (a) Distribution of serotypes and pathotypes of DEC strains in the multi-province study (n = 544). The numbers on the x-axis from 1 to 187 indicate O1 to O187, 190 to 194 indicate Gp1 to Gp5, 195 to 198 indicate Gp7 to Gp10, 199 indicates Gp13, 200 indicates Gp15, and 0 indicates O-nontypeable (ONT); the numbers on the y-axis from 1 to 53 indicate H1 to H53, and 0 indicates H-nontypeable (HNT). (b) Correlation of common O groups (n ≥ 5) with pathotypes (n = 375). (c) Correlation of common H types (n ≥ 5) with pathotypes (n = 522). (d) Correlation of common serotypes (n ≥5) with pathotypes (n = 250). (e) Comparison between our study results (n = 250) and online data analysis results (n = 78) with respect to the correlation of serotypes with pathotypes.

To examine whether the above conclusion is applicable to other DEC strains, we analyzed 166 whole-genome sequences of DEC strains downloaded from EnteroBase (Table S4), including the common O:H serotypes identified in in our study, EnteroBase includes a larger proportion of STEC strains compared to other pathotypes, likely due to the association of STEC strains with severe diseases. To account for this bias and the presence of the de facto hybrid pathotypes of STEC strains, we divided the retrieved genome sequences into two groups: a non-STEC group (n = 78) and an STEC group (n = 88). The prediction results showed that all non-STEC strains displayed fully identical associations between their serotypes and pathotypes, similar to our findings. However, one exception was noted: O15:H18 strains was classified as EAEC in EnteroBase but as either ETEC or EAEC in our collection (Figure 4(e)). The STEC strains (n = 88) were further subdivided into EHEC (n = 61) and non-EHEC (n = 27) strains. The EHEC group displayed a hybrid pathotype containing virulence genes from both STEC (stx1 or stx2) and EPEC (eae and escV) strains, and the non-EHEC group only contained virulence genes from STEC (stx1 or stx2) strains. Sequence prediction revealed that the serotype of all EHEC strains in EnteroBase was classified as EPEC in our collection (Figure S6), reflecting the hybrid nature of EHEC with EPEC. In contrast, the serotypes of the non-EHEC strains in EnteroBase were classified as both EAEC and EPEC in our collection, indicating some rare but noteworthy exceptions to the association between serotypes and pathotypes.

3.5. Identification of distinct serotype patterns between EIEC and Shigella

Given the inherent association between serotype and DEC pathotype, we hypothesized that Shigella, despite sharing a nearly identical pathogenic mechanism with EIEC and being genetically indistinguishable from it, might possess distinct serotypes. To test this hypothesis, we applied the MeltArray EST assay to 257 Shigella and 18 EIEC isolates from our collection. Additionally, we downloaded whole-genome sequences of 775 Shigella isolates, representing 52 serotypes across four Shigella species (Table S6) and 360 EIEC isolates, representing 45 E. coli serotypes (Table S7), from EnteroBase for serotype predication. The assay results revealed 100% serotype purity (275/275) for our own collection (Figure 5(a)), while the EnteroBase genomes showed a near-perfect serotype purity of 99.8% (1133/1135). These findings demonstrate that Shigella and EIEC can indeed be differentiated based on their distinct serotypes.

Figure 5.

Figure 5.

Discrepancy analysis between EIEC and shigella based on O:H serotypes. (a) Distribution of O:H serotypes of EIEC and Shigella strains. The numbers on the x-axis from 1 to 187 indicate O1 to O187, 190 to 194 indicate Gp1 to Gp5, 195 to 198 indicate Gp7 to Gp10, 199 indicates Gp13, 200 indicates Gp15, and 0 indicates O-nontypeable (ONT); the numbers on the y-axis from 1 to 53 indicate H1 to H53, and 0 indicates H-nontypeable (HNT). (b) Discrepancy of O:H serotype compositions between EIEC and four major species of Shigella. Sf, Shigella flexneri; sd, Shigella dysenteriae; Sb, Shigella Boydii; ND, Shigella serotype non-detected.

We further explored whether the four major Shigella species could be differentiated from each other based on serotypes. Using a chord diagram, we analyzed all 378 EIEC and 1032 Shigella isolates from our collection, along with the sequences downloaded from the EnteroBase. Intriguingly, the four major species of Shigella were clearly distinguished based on their serotypes (Figure 5(b)). Specifically, the two dominant species, S. flexneri and S. sonnei, were classified into an extremely small number of E. coli serotypes: 93.2% (397/426) of S. flexneri strains were classified as serotype Gp10:H14, and 99.6% (270/271) of S. sonnei strains were identified as ONT:H16. We also observed a strong association (SP = 97.9%, 138/141) between the serotypes of E. coli and the serotypes of S. dysenteriae, but a relatively weak association for S. boydii (SP = 75.3%, 146/194). Notably, 45.4% (88/194) of S. boydii strains were classified as O-nontypeable, which contributed to the reduced resolution in serotype differentiation. These results suggest that an inherent association exists between Shigella species and E. coli serotypes, further distinguishing them from EIEC.

4. Discussion

In this study, we established a highly multiplexed real-time PCR-based serotyping scheme for E. coli, MeltArray EST, which has demonstrated its potential to directly analyze anal swab or stool samples. In a retrospective study, we successfully performed culture-independent serotyping of outbreak strains from anal swab samples. Using this novel molecular serotyping tool, we unambiguously distinguished the five major DEC pathotypes by their unique serotype patterns. Moreover, MeltArray EST differentiated EIEC from Shigella and distinguished the four major Shigella species based on their distinct serotype compositions. Our findings strongly suggest an inherent association between DEC serotype and pathotype.

Traditional comprehensive E. coli serotyping requires over 180 antisera for O antigens and more than 50 antisera for H antigens,33,35 which are only available in a limited number of laboratories globally.14 Moreover, traditional serotyping relies on isolating bacteria in pure cultures, a slow process and often impractical for clinical or outbreak investigations,47,48,49 In addition, the presence of numerous commensal E. coli in the intestinal tract complicates culture-independent identification of outbreak strains.50,51 Furthermore, agglutination tests with specific antisera are laborious, and sometimes result in cross-reactions between different serotypes, leading to ambiguous results.52

In contrast, molecular serotyping methods have proven effective in accurately identifying O-antigen biosynthesis genes (wzx, wzy, wzm or wzt) and H-antigen-encoding genes (fliC and its homologs) of E coli.33,35,53–56 However, the number of target genes detected by conventional PCR and real-time PCR assays is limited by the resolution of band sizes in gel33,35 and the number of fluorometric detection channels available,53,54 respectively. Although microarrays55 and Luminex microbead-based suspension array56 can identify dozens of serotypes, these assays require specialized equipment, various post-PCR manipulations, and are prone to amplicon contamination. By contrast, the MeltArray EST assay covers 12 virulence genes, 182 O serotypes, and 53 H serotypes in just five PCR amplification reactions, which can be completed in a single step with prepared nucleic acids, taking less than 3 h, and making it a highly efficient alternative to traditional antisera-based serotyping.

The correlation between DEC serotypes and pathotypes has been a longstanding question. Although certain E. coli serotypes are believed to be associated with specific clinical symptoms,7,8,20,57 traditional serotyping often fails to accurately identify the exact pathotypes, and the correlation between serotypes and pathotypes is not always reproducible.14,29,30 Our multi-province study demonstrates, for the first time, a near-complete correspondence between DEC serotypes and pathotypes in China. This finding is expected to aid in disease diagnosis and outbreak detection.

The O antigen, a component of lipopolysaccharide, has been considered the most promising vaccine target for E. coli.28 For example, an O157-specific polysaccharide was developed as a vaccine (O157-rEPA; lot 011094) against STEC,58 and conjugate vaccine candidates were produced against two of the most prevalent O serogroups (O148 and O78) of ETEC.59 However, studies using traditional E. coli serotyping to guide vaccine target selection are limited.60 With its 96% concordance with traditional antisera-based serotyping and alignment with findings from previous research,33,55,61 molecular serotyping (both MeltArray EST assay and WGS analysis) can be used to guide the selection of O antigens for vaccines.

Our multi-province study also provided a dynamic profile of previously unknown or notorious serotypes that require close surveillance in China. For example, STEC can cause hemorrhagic colitis and HUS.62 While O157:H7 is the most well-known STEC type, the prevalence of non-O157:H7 STEC remains largely unknown due to difficulties in detection. Our study identified 7 STEC isolates, including both O157:H7 (n = 2) and non-O157:H7 serotypes (n = 5), suggesting that non-O157 STEC might even have a higher infection rate than O157:H7 in some provinces of China. EHEC strains, considered a more pathogenic subset of STEC,8 were also identified in our study, each belonging to one of the top 7 STEC serogroups that account for over 90% of STEC infections in the United States.22 Additionally, we detected the EAEC O104:H4, associated with a major outbreak of HUS in Germany in 2011,63 in Guangdong (n = 3) and Henan provinces (n = 1), highlighting the need for close surveillance. The burden of disease caused by atypical enteropathogenic E. coli (aEPEC) has been increasing annually in both industrialized and developing countries, yet the population structure of this emerging pathogen is poorly understood.64 Our study showed that aEPEC accounted for the majority (98.3%, 169/172) of total EPEC isolates, with a significantly higher proportion than in Africa and South Asia,2 suggesting that aEPEC may have replaced typical EPEC as the predominant pathotype in China.

Differentiating Shigella strains from EIEC is crucial for clinical diagnosis and epidemiological investigations,65–67 The high similarities between Shigella and EIEC strains make this distinction challenging.68,69 Our study demonstrates that MeltArray EST assay can unequivocally distinguish EIEC from Shigella based on their distinct serotype patterns. This assay has the potential to directly identify an isolate as Shigella or EIEC in clinical settings, filling a significant gap left by traditional serotyping methods. Interestingly, although H antigens (flagellin) are rarely expressed,14 H-antigen-encoding genes were identified in nearly all Shigella and EIEC strains (98.4%, 1387/1410) in this study, which might explain why Shigella species could be distinguished by molecular serotyping of E. coli, but not by antisera-based serotyping.

Some limitations of this study should be noted. First, 33 O serotypes were classified into 14 groups (Gp1 to Gp10 and Gp12 to Gp15) based on the presence of identical or similar O-antigen biosynthesis genes,46 making it challenging to distinguish O serotypes within the same group using the MeltArray EST assay. In such cases, WGS analysis is needed to identify specific genomic differences between strains of these O groups. Second, while we attained distinct serotype patterns of enteric pathogenic E. coli, the serotype patterns of commensal E. coli in the gut remain unknown. Future studies on commensal E. coli could help distinguish pathogenic from nonpathogenic strains.

In conclusion, MeltArray EST provides a paradigm-shifting molecular serotyping tool that confirms the relationship between DEC serotype and pathotype. It provides numerous advantages over traditional antisera-based E. coli serotyping methods, including substantial savings in time, labor, and costs, while maintaining high sensitivity and accuracy. Our study suggests that, when harnessed with MeltArray EST, E. coli serotype can serve as a robust disease biomarker for DEC infection and a potential target for vaccine development in addition to its indispensable role in epidemiological studies.

Supplementary Material

Supplemental Material

Acknowledgments

We thank professors Weihong Tan, Karl Drlica, Xilin Zhao and Xinhe Lai for valuable comments and staff from five Provincial CDC (Guangdong, Henan, Sichuan, Jiangsu, and Zhejiang) for their participation in and contribution to the work of strain collection.

Funding Statement

This work was supported by the Important National Science and Technology Specific Project (2017ZX10303406), National Key R&D Program of China (KY202002008 and 2021YFC1200200), Major Science and Technology Project of the Inner Mongolia Autonomous Region (2021ZD0006), Fujian University Industry-University-Research Joint Innovation Project (2021Y4001), Xiamen Major Science and Technology Project (3502Z20191007), Shenzhen Science and Technology Program (KCXFZ20230731093959008), and Sanming Project of Medicine in Shenzhen (SZSM202311015).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data and material availability

All study data are included in the article and/or supporting information. The sequencing data of reference strains (Table S5) are available from the NCBI sequence database under accession number PRJNA890762.

Ethics approval

All procedures were approved by the Medical Ethics Committee of the School of Medicine, Xiamen University (XDYX2021016).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/19490976.2024.2401944

References

  • 1.Wang LP, Zhou SX, Wang X, Lu QB, Shi LS, Ren X, Zhang HY, Wang YF, Lin SH, Zhang CH, et al. Etiological, epidemiological, and clinical features of acute diarrhea in China. Nat Commun. 2021;12:2464. doi: 10.1038/s41467-021-22551-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kotloff KL, Nataro JP, Blackwelder WC, Nasrin D, Farag TH, Panchalingam S, Wu Y, Sow SO, Sur D, Breiman RF, et al. Burden and aetiology of diarrhoeal disease in infants and young children in developing countries (the global enteric multicenter study, GEMS): a prospective, case-control study. Lancet. 2013;382(9888):209–17. doi: 10.1016/S0140-6736(13)60844-2. [DOI] [PubMed] [Google Scholar]
  • 3.Collaborators GBDDD, Blacker BF, Khalil IA, Rao PC, Cao S, Zimsen SR, Albertson SB, Stanaway JD, Deshpande A, Abebe Z, et al. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of diarrhoea in 195 countries: a systematic analysis for the global burden of disease study 2016. Lancet Infect Dis. 2018;18(11):1211–1228. doi: 10.1016/S1473-3099(18)30362-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tenaillon O, Skurnik D, Picard B, Denamur E.. The population genetics of commensal Escherichia coli. Nat Rev Microbiol. 2010;8:207–217. doi: 10.1038/nrmicro2298. [DOI] [PubMed] [Google Scholar]
  • 5.Riley LW, Blanton RE. Distinguishing pathovars from Nonpathovars: Escherichia coli*. Microbiol Spectr. 2020;8(4):8.4.1. doi: 10.1128/microbiolspec.AME-0014-2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Foster-Nyarko E, Pallen MJ. The microbial ecology of Escherichia coli in the vertebrate gut. FEMS Microbiol Rev. 2022;(3):46. doi: 10.1093/femsre/fuac008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kaper JB, Nataro JP, Mobley HL. Pathogenic escherichia coli. Nat Rev Microbiol. 2004;2:123–140. doi: 10.1038/nrmicro818. [DOI] [PubMed] [Google Scholar]
  • 8.Nataro JP, Kaper JB. Diarrheagenic escherichia coli. Clin Microbiol Rev. 1998;11:142–201. doi: 10.1128/CMR.11.1.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Athumani Msalale L. Isolation and Characterization of Escherichia coli from Animals, Humans, and Environment. In: Amidou S, ed. Escherichia coli. Rijeka: IntechOpen, 2017:Ch. 10.
  • 10.Gomes TA, Elias WP, Scaletsky IC, Guth BE, Rodrigues JF, Piazza RM, Ferreira LC, Martinez MB. Diarrheagenic escherichia coli. Braz J Microbiol. 2016;47 Suppl 1:3–30. doi: 10.1016/j.bjm.2016.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Yun Z, Zeng L, Huang W, Wu Q, Fan Y, Zheng S, Peng L, Han J, Huang Y, Zhou H, et al. Detection and categorization of diarrheagenic Escherichia coli with auto-microfluidic thin-film chip method. Sci Rep. 2018;8:12926. doi: 10.1038/s41598-018-30765-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Müller D, Greune L, Heusipp G, Karch H, Fruth A, Tschäpe H, Schmidt MA. Identification of unconventional intestinal pathogenic escherichia coli isolates expressing intermediate virulence factor profiles by using a novel single-step multiplex PCR. Appl Environ Microbiol. 2007;73(10):3380–3390. doi: 10.1128/aem.02855-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ørskov F, Ørskov I. Serotyping of escherichia coli. In: Bergan T. editor. Methods in Microbiology. London: Academic Press; 1984. p. 43–112. [Google Scholar]
  • 14.Croxen MA, Law RJ, Scholz R, Keeney KM, Wlodarska M, Finlay BB. Recent advances in understanding enteric pathogenic escherichia coli. Clin Microbiol Rev. 2013;26:822–880. doi: 10.1128/CMR.00022-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lan R, Alles MC, Donohoe K, Martinez MB, Reeves PR. Molecular evolutionary relationships of enteroinvasive escherichia coli and shigella spp. Infect Immun. 2004;72:5080–5088. doi: 10.1128/IAI.72.9.5080-5088.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tamaki Y, Narimatsu H, Miyazato T, Nakasone N, Higa N, Toma C, Iwanaga M. The relationship between O-Antigens and pathogenic genes of diarrhea-associated escherichia coli. Jpn J Infect Dis. 2005;58(2):65–69. doi: 10.7883/yoken.JJID.2005.65. [DOI] [PubMed] [Google Scholar]
  • 17.Yang JR, Wu FT, Tsai JL, Mu JJ, Lin LF, Chen KL, Kuo SH, Chiang CS, Wu HS. Comparison between O serotyping method and multiplex real-time PCR to identify diarrheagenic escherichia coli in Taiwan. J Clin Microbiol. 2007;45:3620–3625. doi: 10.1128/jcm.00596-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Chen Y, Chen X, Zheng S, Yu F, Kong H, Yang Q, Cui D, Chen N, Lou B, Li X, et al. Serotypes, genotypes and antimicrobial resistance patterns of human diarrhoeagenic Escherichia coli isolates circulating in southeastern China. Clin Microbiol Infect. 2014;20(1):52–58. doi: 10.1111/1469-0691.12188. [DOI] [PubMed] [Google Scholar]
  • 19.Taylor J, Powell BW, Wright J. Infantile diarrhoea and vomiting; a clinical and bacteriological investigation. Br Med J. 1949;2(4619):117–125. doi: 10.1136/bmj.2.4619.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Orskov F, Orskov I. Escherichia coli serotyping and disease in man and animals. Can J Microbiol. 1992;38(7):699–704. doi: 10.1139/m92-115. [DOI] [PubMed] [Google Scholar]
  • 21.Tarr PI, Gordon CA, Chandler WL. Shiga-toxin-producing Escherichia coli and haemolytic uraemic syndrome. Lancet. 2005;365(9464):1073–1086. doi: 10.1016/S0140-6736(05)71144-2. [DOI] [PubMed] [Google Scholar]
  • 22.Brooks JT, Sowers EG, Wells JG, Greene KD, Griffin PM, Hoekstra RM, Strockbine NA. Non-O157 shiga toxin–producing Escherichia coli infections in the United States, 1983–2002. J Infect Dis. 2005;192(8):1422–1429. doi: 10.1086/466536. [DOI] [PubMed] [Google Scholar]
  • 23.Walker MJ, Beatson SA, Epidemiology . Outsmarting outbreaks. Science. 2012;338:1161–1162. doi: 10.1126/science.1232327. [DOI] [PubMed] [Google Scholar]
  • 24.Royer G, Clermont O, Condamine B, Mercier-Darty M, Laouénan C, Lefort A, Denamur E, de Lastours V, Wolff M, Alavoine L. O-Antigen targeted vaccines against Escherichia coli May Be useful in reducing morbidity, mortality, and antimicrobial resistance. Clin Infect Dis. 2022;74(2):364–366. doi: 10.1093/cid/ciab458. [DOI] [PubMed] [Google Scholar]
  • 25.Rojas-Lopez M, Monterio R, Pizza M, Desvaux M, Rosini R. Intestinal pathogenic Escherichia coli: insights for vaccine development. Front Microbiol. 2018;9:440. doi: 10.3389/fmicb.2018.00440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Svennerholm AM, Qadri F, Lundgren A, Kaim J, Rahman Bhuiyan T, Akhtar M, Maier N, Louis Bourgeois A, Walker RI. Induction of mucosal and systemic immune responses against the common O78 antigen of an oral inactivated ETEC vaccine in Bangladeshi children and infants. Vaccine. 2022;40:380–389. doi: 10.1016/j.vaccine.2021.10.056. [DOI] [PubMed] [Google Scholar]
  • 27.Frenck RW Jr., Ervin J, Chu L, Abbanat D, Spiessens B, Go O, Haazen W, van den Dobbelsteen G, Poolman J, Thoelen S, et al. Safety and immunogenicity of a vaccine for extra-intestinal pathogenic Escherichia coli (ESTELLA): a phase 2 randomised controlled trial. Lancet Infect Dis. 2019;19(6):631–640. doi: 10.1016/s1473-3099(18)30803-x. [DOI] [PubMed] [Google Scholar]
  • 28.Lipworth S, Vihta KD, Chau KK, Kavanagh J, Davies T, George S, Barker L, Vaughan A, Andersson M, Jeffery K, et al. Ten years of population-level genomic Escherichia coli and Klebsiella pneumoniae serotype surveillance informs vaccine development for invasive infections. Clin Infect Dis. 2021;73(12):2276–2282. doi: 10.1093/cid/ciab006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Levine MM. Escherichia coli that cause diarrhea: enterotoxigenic, enteropathogenic, enteroinvasive, enterohemorrhagic, and enteroadherent. J Infect Dis. 1987;155(3):377–389. doi: 10.1093/infdis/155.3.377. [DOI] [PubMed] [Google Scholar]
  • 30.Robins-Browne RM, Holt KE, Ingle DJ, Hocking DM, Yang J, Tauschek M. Are Escherichia coli pathotypes still relevant in the Era of whole-genome sequencing? Front Cell Infect Microbiol. 2016;6:141. doi: 10.3389/fcimb.2016.00141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Li Y, Luo Q, Shi X, Lin Y, Qiu Y, Lv D, Jiang Y, Chen Q, Jiang M, Ma H, et al. Phenotypic and genotypic characterization of clinical enterotoxigenic Escherichia coli isolates from Shenzhen, China. Foodborne Pathog Dis. 2017;14(6):333–340. doi: 10.1089/fpd.2016.2233. [DOI] [PubMed] [Google Scholar]
  • 32.Ventola E, Bogaerts B, De Keersmaecker SCJ, Vanneste K, Roosens NHC, Mattheus W, Ceyssens PJ. Shifting national surveillance of shigella infections toward geno-serotyping by the development of a tailored luminex assay and NGS workflow. Microbiologyopen. 2019;8:e00807. doi: 10.1002/mbo3.807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Iguchi A, Iyoda S, Seto K, Morita-Ishihara T, Scheutz F, Ohnishi M, Bourbeau P, Pathogenic EcWGiJ . Escherichia coli O-Genotyping PCR: a comprehensive and practical platform for molecular O serogrouping. J Clin Microbiol. 2015;53(8):2427–2432. doi: 10.1128/JCM.00321-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.DebRoy C, Fratamico PM, Yan X, Baranzoni G, Liu Y, Needleman DS, Tebbs R, O’Connell CD, Allred A, Swimley M, et al. Comparison of O-Antigen gene clusters of all O-Serogroups of Escherichia coli and proposal for adopting a new nomenclature for O-Typing. PLOS ONE. 2016;11(1):e0147434. doi: 10.1371/journal.pone.0147434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Banjo M, Iguchi A, Seto K, Kikuchi T, Harada T, Scheutz F, Iyoda S, Pathogenic EcWGiJ . Escherichia coli H-Genotyping PCR: a complete and practical platform for molecular H typing. J Clin Microbiol. 2018;56(6):56. doi: 10.1128/JCM.00190-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Beutin L, Delannoy S, Fach P. Genetic analysis and detection of fliC H1 and fliC H12 genes coding for serologically closely related flagellar antigens in human and animal pathogenic Escherichia coli. Front Microbiol. 2016;7:135. doi: 10.3389/fmicb.2016.00135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Gadsby NJ, McHugh MP, Russell CD, Mark H, Conway Morris A, Laurenson IF, Hill AT, Templeton KE. Development of two real-time multiplex PCR assays for the detection and quantification of eight key bacterial pathogens in lower respiratory tract infections. Clin Microbiol Infect. 2015;21(8):.e788.1–.e.13. doi: 10.1016/j.cmi.2015.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Bian S, Jia Y, Zhan Q, Wong NK, Hu Q, Zhang W, Zhang Y, Li L. Vpsero: rapid serotyping of Vibrio parahaemolyticus using serogroup-specific genes based on whole-genome sequencing data. Front Microbiol. 2021;12:620224. doi: 10.3389/fmicb.2021.620224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, et al. Soapnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience. 2018;7:1–6. doi: 10.1093/gigascience/gix120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 41.Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O’Donovan C, Phan I, et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;31(1):365–370. doi: 10.1093/nar/gkg095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zhou Z, Alikhan NF, Mohamed K, Fan Y, Achtman M. The EnteroBase user’s guide, with case studies on salmonella transmissions, yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res. 2020;30:138–152. doi: 10.1101/gr.251678.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Joensen KG, Scheutz F, Lund O, Hasman H, Kaas RS, Nielsen EM, Aarestrup FM, Carroll KC. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J Clin Microbiol. 2014;52(5):1501–1510. doi: 10.1128/jcm.03617-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Joensen KG, Tetzschner AM, Iguchi A, Aarestrup FM, Scheutz F, Carroll KC. Rapid and easy in silico serotyping of escherichia coli isolates by use of whole-genome sequencing data. J Clin Microbiol. 2015;53(8):2410–2426. doi: 10.1128/jcm.00008-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Huang Q, Chen D, Du C, Liu Q, Lin S, Liang L, Xu Y, Liao Y, Li Q. Highly multiplex PCR assays by coupling the 5′-flap endonuclease activity of taq DNA polymerase and molecular beacon reporters. Proc Natl Acad Sci USA. 2022;119(9):119. doi: 10.1073/pnas.2110672119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Iguchi A, Iyoda S, Kikuchi T, Ogura Y, Katsura K, Ohnishi M, Hayashi T, Thomson NR. A complete view of the genetic diversity of the Escherichia coli O-antigen biosynthesis gene cluster. DNA Res. 2015;22(1):101–107. doi: 10.1093/dnares/dsu043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zhang X, Payne M, Kaur S, Lan R. Improved genomic identification, clustering, and serotyping of shiga toxin-producing Escherichia coli using cluster/Serotype-specific gene markers. Front Cell Infect Microbiol. 2021;11:772574. doi: 10.3389/fcimb.2021.772574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Cocolin L, Comi G. Use of a culture-independent molecular method to study the ecology of yersinia spp. In food. Int J Food Microbiol. 2005;105(1):71–82. doi: 10.1016/j.ijfoodmicro.2005.05.006. [DOI] [PubMed] [Google Scholar]
  • 49.Bording-Jorgensen M, Parsons B, Szelewicki J, Lloyd C, Chui L. Molecular detection of non-O157 shiga toxin-producing Escherichia coli (STEC) directly from stool using multiplex qPCR assays. Microorganisms. 2022;10(2):10. doi: 10.3390/microorganisms10020329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Loman NJ, Constantinidou C, Christner M, Rohde H, Chan JZ, Quick J, Weir JC, Quince C, Smith GP, Betley JR, et al. A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of shiga-toxigenic Escherichia coli O104: H4. Jama. 2013;309:1502–1510. doi: 10.1001/jama.2013.3231. [DOI] [PubMed] [Google Scholar]
  • 51.Liu X, Noll L, Shi X, Porter E, Wang Y, Stoy C, Lu N, Nagaraja TG, Anderson G, Bai J, et al. Single-cell-based digital PCR detection and association of shiga toxin-producing Escherichia coli serogroups and major virulence genes. J Clin Microbiol. 2020. 58. 58(3). doi: 10.1128/jcm.01684-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Orskov I, Orskov F, Jann B, Jann K. Serology, chemistry, and genetics of O and K antigens of Escherichia coli. Bacteriol Rev. 1977;41:667–710. doi: 10.1128/br.41.3.667-710.1977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Beutin L, Jahn S, Fach P. Evaluation of the ‘GeneDisc’ real-time PCR system for detection of enterohaemorrhagic Escherichia coli (EHEC) O26, O103, O111, O145 and O157 strains according to their virulence markers and their O- and H-antigen-associated genes. J Appl Microbiol. 2009;106(4):1122–1132. doi: 10.1111/j.1365-2672.2008.04076.x. [DOI] [PubMed] [Google Scholar]
  • 54.Duong VT, Tu LTP, Tuyen HT, Nhi LTQ, Campbell JI, Van Minh P, Le Phuc H, Chau TTH, Ngoc NM, Vi LL, et al. Novel multiplex real-time PCR assays reveal a high prevalence of diarrhoeagenic Escherichia coli pathotypes in healthy and diarrhoeal children in the south of Vietnam. BMC Microbiol. 2020;20:192. doi: 10.1186/s12866-020-01878-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wang Q, Wang S, Beutin L, Cao B, Feng L, Wang L. Development of a DNA microarray for detection and serotyping of enterotoxigenic Escherichia coli. J Clin Microbiol. 2010;48:2066–2074. doi: 10.1128/jcm.02014-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lin A, Nguyen L, Lee T, Clotilde LM, Kase JA, Son I, Carter JM, Lauzon CR. Rapid O serogroup identification of the ten most clinically relevant STECs by luminex microbead-based suspension array. J Microbiol Methods. 2011;87(1):105–110. doi: 10.1016/j.mimet.2011.07.019. [DOI] [PubMed] [Google Scholar]
  • 57.Achtman M, Zhou Z, Charlesworth J, Baxter L. EnteroBase: hierarchical clustering of 100 000s of bacterial genomes into species/subspecies and populations. Philos Trans R Soc Lond B Biol Sci. 2022;377(1861):20210240. doi: 10.1098/rstb.2021.0240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Ahmed A, Li J, Shiloach Y, Robbins JB, Szu SC. Safety and immunogenicity of Escherichia coli O157 O-Specific polysaccharide conjugate vaccine in 2–5-Year-Old children. J Infect Dis. 2006;193(4):515–521. doi: 10.1086/499821. [DOI] [PubMed] [Google Scholar]
  • 59.Williams AJ, Warfel KF, Desai P, Li J, Lee JJ, Wong DA, Nguyen PM, Qin Y, Sobol SE, Jewett MC, et al. A low-cost recombinant glycoconjugate vaccine confers immunogenicity and protection against enterotoxigenic Escherichia coli infections in mice. Front Mol Biosci. 2023;10:1085887. doi: 10.3389/fmolb.2023.1085887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lerouge I, Vanderleyden J. O-antigen structural variation: mechanisms and possible roles in animal/plant–microbe interactions. FEMS Microbiol Rev. 2002;26(1):17–47. doi: 10.1111/j.1574-6976.2002.tb00597.x. [DOI] [PubMed] [Google Scholar]
  • 61.Lin A, Sultan O, Lau HK, Wong E, Hartman G, Lauzon CR. O serogroup specific real time PCR assays for the detection and identification of nine clinically relevant non-O157 STECs. Food Microbiol. 2011;28(3):478–483. doi: 10.1016/j.fm.2010.10.007. [DOI] [PubMed] [Google Scholar]
  • 62.Havira MS, Ta A, Kumari P, Wang C, Russo AJ, Ruan J, Rathinam VA, Vanaja SK. Shiga toxin suppresses noncanonical inflammasome responses to cytosolic LPS. Sci Immunol. 2020. 5. 5(53). doi: 10.1126/sciimmunol.abc0217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Rohde H, Qin J, Cui Y, Li D, Loman NJ, Hentschke M, Chen W, Pu F, Peng Y, Li J, et al. Open-source genomic analysis of shiga-Toxin–Producing E. coli O104: H4. N Engl J Med. 2011;365(8):718–724. doi: 10.1056/NEJMoa1107643. [DOI] [PubMed] [Google Scholar]
  • 64.Ingle DJ, Tauschek M, Edwards DJ, Hocking DM, Pickard DJ, Azzopardi KI, Amarasena T, Bennett-Wood V, Pearson JS, Tamboura B, et al. Evolution of atypical enteropathogenic E. coli by repeated acquisition of LEE pathogenicity island variants. Nat Microbiol. 2016;1:15010. doi: 10.1038/nmicrobiol.2015.10. [DOI] [PubMed] [Google Scholar]
  • 65.Dhakal R, Wang Q, Lan R, Howard P, Sintchenko V. Novel multiplex PCR assay for identification and subtyping of enteroinvasive Escherichia coli and differentiation from shigella based on target genes selected by comparative genomics. J Med Microbiol. 2018;67(9):1257–1264. doi: 10.1099/jmm.0.000784. [DOI] [PubMed] [Google Scholar]
  • 66.van den Beld Mjc, de Boer Rf, Reubsaet FAG, Rossen JWA, Zhou K, Kuiling S, Friedrich AW, Kooistra-Smid M, van den Beld MJC, de Boer RF. Evaluation of a culture-dependent algorithm and a molecular algorithm for identification of shigella spp. Escherichia coli, And Enteroinvasive E Coli J Clin Microbiol. 2018;56(10). doi: 10.1128/jcm.00510-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.van den Beld Mj, Reubsaet FA, Beld MJC. Differentiation between shigella, enteroinvasive Escherichia coli (EIEC) and noninvasive Escherichia coli. Eur J Clin Microbiol Infect Dis. 2012;31(6):899–904. doi: 10.1007/s10096-011-1395-7. [DOI] [PubMed] [Google Scholar]
  • 68.Lan R, Lumb B, Ryan D, Reeves PR, DiRita VJ. Molecular evolution of large virulence plasmid in shigella clones and enteroinvasive Escherichia coli. Infect Immun. 2001;69(10):6303–6309. doi: 10.1128/iai.69.10.6303-6309.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Pettengill EA, Pettengill JB, Binet R. Phylogenetic analyses of shigella and enteroinvasive Escherichia coli for the identification of molecular epidemiological markers: whole-genome comparative analysis does not support distinct genera designation. Front Microbiol. 2015;6:1573. doi: 10.3389/fmicb.2015.01573. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

Articles from Gut Microbes are provided here courtesy of Taylor & Francis

RESOURCES