Abstract
Illnesses caused by Shiga toxin-producing Escherichia coli (STECs) can be life threatening, such as hemolytic uremic syndrome (HUS). The STECs most frequently identified by USDA’s Microbiological Data Program (MDP) carried toxin gene subtypes stx1a and/or stx2a. Here we described the genome sequences of 331 STECs isolated from foods regulated by the FDA 2010–2017, and determined their genomic identity, serotype, sequence type, virulence potential, and prevalence of antimicrobial resistance. Isolates were selected from the MDP archive, routine food testing by FDA field labs (ORA), and food testing by a contract company. Only 276 (83%) strains were confirmed as STECs by in silico analysis. Foods from which STECs were recovered included cilantro (6%), spinach (25%), lettuce (11%), and flour (9%). Phylogenetic analysis using core genome MLST revealed these STEC genomes were highly variable, with some clustering associated with ST types and serotypes. We detected 95 different sequence types (ST); several ST were previously associated with HUS: ST21 and ST29 (O26:H11), ST11 (O157:H7), ST33 (O91:H14), ST17 (O103:H2), and ST16 (O111:H-). in silico virulome analyses showed ~ 51% of these strains were potentially pathogenic [besides stx gene they also carried eae (25%) or 26% saa (26%)]. Virulence gene prevalence was also determined: stx1 only (19%); stx2 only (66%); and stx1/sxt2 (15%). Our data form a new WGS dataset that can be used to support food safety investigations and monitor the recurrence/emergence of E. coli in foods.
Introduction
Shiga toxin-producing Escherichia coli (STECs) have the potential to cause infections, from mild to life-threatening outcomes such as hemolytic uremic syndrome (HUS). STECs causing HUS, hemorrhagic colitis and bloody diarrhea are known as enterohemorrhagic E. coli (EHEC). Among the most common EHECs are O157:H7, O26, O121, O103, O111, and O145. O157:H7 strains are responsible for most foodborne outbreaks in the last two decades [1] while non-O157 serogroups, O26, O121, O103, O111, and O145 are the second most common cause of EHEC foodborne infections in the US [2,3] and worldwide [4–7]. Each year in the US, O157:H7 causes approximately 95,000 cases with 2,150 hospitalizations, while non-O157 STECs are responsible for an estimated 170,000 cases [3]. These serotypes carry Shiga toxin genes (stx1 and/or stx2) and there are at least 130 EHEC serotypes that have been recovered from human patients. In 2011 the US Department of Agriculture Food Safety and Inspection Services (USDA FSIS), declared O26 and five other non-O157 serogroups, O45, O103, O111, O121, and O145 as adulterants in ground beef and non-intact beef products, and in mid-2012 began testing for these pathogens in both domestic and imported beef trimmings [8].
In order to cause illness, STEC strains need a set of genes that allow them to attach, colonize, and produce and secrete Shiga toxin protein [9–12]. STECs have the capacity to produce attaching and effacing (A/E) lesions on intestinal mucosa, mediated by proteins found on the locus of enterocyte effacement (LEE) Pathogenicity Island. These include EAE protein (intimin, coded by the eae gene), secreted effector proteins (Esp), a TIR receptor, and other T3SS effectors present in the LEE island. However, some STECs do not carry eae (LEE-negative STEC strains). They possess other genes believed to compensate for the lack of eae or LEE island (e.g. saa) and still caused sporadic cases of HUS [13,14]. Other putative virulence genes (ehxA, espP, etpD, toxB, katP, subA, saa, and sab genes, among others) are usually located in a plasmid referred to as the virulence plasmid to differentiate it from other possible plasmids that can be carried by the same strain [9,11,15]. All STECs have a virulence plasmid, although differing in sizes and gene content (either virulence or plasmid maintenance genes). Although the precise role of ehxA in STEC pathogenesis remains to be elucidated, several studies indicate an association of ehxA in clinical disease since 1) ehxA was found to be produced by many STEC associated with diarrheal disease and HUS [16–18], and 2) serum samples from HUS patients have been shown to react specifically to ehxA [19].
STECs can be transmitted by various means with food remaining the predominant transmission route [1]. Among the illnesses caused by STECs in FDA regulated food products (FRFDA), fresh produce has been implicated in several outbreaks, as well as some other atypical commodities, such as flour [20]. Leafy greens and other agricultural food crops are particularly susceptible to contamination since they are grown in close contact with the ground where runoff from livestock areas, particularly cattle, contaminated irrigation water, manure used as fertilizer, and the intrusion of wildlife into growing fields can occur [21]. Many of these same items are consumed raw and possibly with little cleaning. Noteworthy E. coli outbreaks reported by the Center of Disease and Control (CDC) in the US in the last 10 years are: in 2009—beef (O157:H7) and prepackaged cookie dough (O157:H7); in 2010—cheese (O157:H7), romaine lettuce (O145) and beef (O157:H7); in 2011—romaine lettuce (O157:H7), Lebanon bologna (O157:H7), and in-shell hazelnuts (O157:H7); in 2012—spinach and spring mix blend (O157:H7), unknown source (O145), and raw clover sprouts (O26); in 2013—ready-to-eat salads (O157:H7), and frozen food products (O121); in 2014—raw clover sprouts (O121), and ground beef (O157:H7); in 2015—rotisserie chicken salad (O157:H7), and Mexican-style restaurant chain (O26); in 2016—flour (O121 and O26), and alfalfa sprouts (O157); 2017—leafy greens (O157:H7), and soy nut butter (O157:H7); and in 2018—there has been an outbreak linked to romaine lettuce caused by O157:H7 (https://www.cdc.gov/ecoli/outbreaks.html).
Beyond the noted outbreaks, there have been several reports on STECs found in FRFDA [22,23]. The most comprehensive survey was the USDA Microbiological Data Program (MDP) that collected domestic and imported fresh fruit and vegetable samples from primarily terminal markets and wholesale distribution centers from 2001–2012 (https://www.ams.usda.gov/datasets/mdp/mdp-program-data-and-reports). This program tested approximately 15,000 samples annually, and tested for the presence of Salmonella, E. coli O157:H7, and other STECs. STEC were most frequently found in spinach samples (0.5%), and of the 132 isolated STECs, 9% were found to carry eae. The most prevalent Shiga toxin variants found were stx1a (22%) and/or stx2a (56%) [23]. However, little other information about the genome content of those strains is publicly available.
Whole genome sequencing (WGS) technology is reshaping food safety and food-borne illness investigations [24]. The use of WGS is becoming more useful as the cost of bacterial genome sequencing decreases every year. WGS’s cost per bacterial sequence is now comparable to PFGE. There are many attractive attributes with regards to the use of WGS in analyzing food samples including the potential to identify all pathogens present in that sample [25]. Other applications of WGS may include assistance in the following: identifying genes that allow for resistance/survival or virulence of certain bacterial strains [26–28], establishing phylogenetic relationships among old strains of STECs isolated from either clinical cases or environmental samples [7,29,30], and identifying matches between environmental and outbreak strains during outbreak scenarios [26,30–32]. Furthermore, using WGS can help in identifying matches among bacterial strains isolated from environmental samples in production facilities and locating contamination sources [33]. It can also be extremely helpful in establishing a mechanism of evolution among pathogens [34]. For example, the 2011 outbreak in Germany was linked to fenugreek seeds caused by an E. coli strain with a genomic backbone and virulence traits of entero-aggregative E. coli (EAEC) but had acquired a stx phage (stx2a gene variant) and caused a more aggressive disease with high HUS rate cases [31,35]. This event highlighted the high plasticity of the E. coli genomes and it constitutes a warning of the possible rise of new “hybrid pathotype” strains.
Therefore, we wanted to further characterize and catalog historical strains of STECs isolated from FRFDA by performing WGS analysis of every STEC strain isolated by the MDP and other FDA surveillance programs, as well as some FDA historical isolates. Most STEC strains were isolated during active surveillance programs and were not outbreak related with the exception of some strains isolated during a flour outbreak of 2016 (ST155 –O121:H19). This work establishes the first genomic dataset of FRFDA STEC strains, which in turn, will allow improved surveillance for both recurrence and the emergence of new strains that may impact our food supply. A total of 296 presumptive STECs were isolated during 2010–2017, and 35 additional STECs were historical isolates from our collection. The 331 presumptive STEC strains were analyzed for virulence genes [encompassing all E. coli virulent types—STEC, entero-pathogenic E. coli (EPEC), entero-toxigenic E. coli (ETEC), entero-invasive E. coli (EIEC), and EAEC], in silico MLST, and antibiotic resistance genes. Finally, their phylogenetic relationships and diversity were determined by whole genome phylogeny analysis using an allele-based whole genome multilocus sequence analysis (MLST) or core genome MLST analysis (cgMLST).
Materials and methods
Bacterial strains and media
E. coli (n = 331) presumptive Shiga toxin-positive strains used in this study are listed in S1 Table. Each strain was assigned a CFSAN number for future tracking. The FRFDA strains were isolated by MDP program (n = 196), FDA Office of Regulatory Affairs (ORA) laboratories (n = 74) during food surveillance testing, and a contracting lab (n = 63) during 2012–2017 in the US.
DNA preparation
Genomic DNA from each strain was isolated from overnight cultures using the DNeasy Blood and Tissue Kit (QIAGEN, Valencia, CA), following the manufacturer’s instructions. The resultant DNA extract was stored at -20°C until used as a template for whole genome sequencing. The concentration was determined using a Qubit double-stranded DNA HS assay kit and a Qubit 2.0 fluorometer (Thermo Fisher Scientific, Waltham, MA), according to the manufacturer's instructions.
Whole genome sequencing, contig assembly and annotation
The genomes of the strains sequenced in our laboratory used an Illumina MiSeq sequencer (Illumina, San Diego, CA), with the 2x250 bp pair-end chemistry according to the manufacturer’s instructions, at approximately 80X average coverage. The genome libraries were constructed using the Nextera XT DNA sample prep kit (Illumina). Genomic sequence contigs were de novo assembled using default settings within CLC Genomics Workbench v9.5.2 (QIAGEN) with a minimum contig size threshold of 500 bp in length.
in silico MLST phylogenetic analysis
The initial analysis and identification of the strains were performed using an in silico E. coli MLST approach, based on the information available at the E. coli MLST website (http://mlst.warwick.ac.uk/mlst/dbs/Ecoli) and using Ridom SeqSphere+ software v2.4.0 (Ridom; Münster, Germany) (http://www.ridom.com/seqsphere). Seven housekeeping genes (dnaE, gyrB, recA, dtdS, pntA, pyrC, and tnaA), described previously for E. coli [36], were used for MLST analysis. The same E. coli MLST database was also used to assign numbers for alleles and STs.
in silico serotyping, determination of virulence genes, and antimicrobial resistance genes identification
We used Ridom software for performing batch screening of determinants for each of the sequenced genomes.
The serotype of each strain analyzed in this study was confirmed using the genes deposited in the Center for Genomic Epidemiology (http://www.genomicepidemiology.org) for E. coli as part of their web-based serotyping tool (SerotypeFinder 1.1 - https://cge.cbs.dtu.dk/services/SerotypeFinder) [37,37,38]. Each whole genome sequence was screened for O-type or H-type genes. The virulence genes present in each strain were determined using the genes deposited in the Center for Genomic Epidemiology (http://www.genomicepidemiology.org) for E. coli as part of their VirulenceFinder 1.5 web-based tool (https://cge.cbs.dtu.dk/services/VirulenceFinder) [38]. Each WGS was screened for the presence of 102 virulence genes (95 virulence genes previously reported here [27] plus 7 additional genes) (S2 Table). These 102 virulence genes included a repertoire representing genes found in different E. coli pathotypes (ETEC, STEC, EAEC, and EPEC) in order to detect any possible E. coli hybrid present. Antimicrobial resistance genes present in each sequenced genomes were identified by using the genes deposited in the Center for Genomic Epidemiology (http://www.genomicepidemiology.org) as part of their Resfinder 2.1 web-based tool (https://cge.cbs.dtu.dk/services/ResFinder) [39]. Each WGS was screened for the presence of those AMR genes reported in the database.
Phylogenetic relationship of the strains by cgMLST analysis
The phylogenetic relationship of the strains was assessed by a core genome multilocus sequence typing (cgMLST) analysis using Ridom SeqSphere+ software v2.4.0. The genome of O157:H7 strain Sakai (NC_002695.1) was used as the reference for the cgMLST. This E. coli strain has 5,204 genes, of which 3,860 genes (core genes) were present in the six genomes used as comparison to generate the cgMLST scheme (NC_011353.1 –O157:H7 strain EC4115, NC_002655.2—O157:H7 strain EDL933, NC_013008.1 –O157:H7 strain TW14359, NC_013941.1—O55:H7 strain CB9615, NC_017656.1—O55:H7 strain RM12579, and NC_017906.1 –O157:H7 strain Xuzhou21). While 791 genes were found in some of the compared genomes, the remaining of the genes were eliminated from the analysis since they were paralogous or pseudogenes. Therefore, a total of 4,651 genes were used as templates for the analysis of the STECs from this study. After eliminating loci that were missing from the genome of any strain used in our analyses, we performed a cgMLST analysis. These remaining loci were considered the core genome shared by the analyzed strains. We used Nei’s DNA distance method [40] for calculating the matrix of genetic distance, taking into consideration only the number of same/different alleles in the core genes. A Neighbor-Joining (NJ) tree using the appropriate genetic distances was built after the cgMLST analysis. The discriminatory index was calculated with the Ridom software using the Simpson’s discriminatory index as described [41]; cgMLST uses the alleles number of each loci for determining the genetic distance and builds the phylogenetic tree. The use of allele numbers reduces the influence of recombination in the dataset studied and allows for fast clustering determination of genomes.
Nucleotide sequence accession numbers
The draft genome sequences of 331 E. coli strains used in our study are available in GenBank under the accession numbers listed in S1 Table.
Results
Presence of STEC in FDA regulated foods
Among 331 suspected STEC strains isolated from FRFDA between 2003–2017 and sequenced by several labs and deposited at NCBI, only 276 were confirmed to be STECs by in silico analysis for the presence of either stx1 or stx2 (S1 Table). Those that were stx negatives either might have lost their phages or were initially misidentified. Of the 196 identified by MDP and sequenced by our lab, 92% carried either stx1 or 2 (181/196). Of the 74 strains which genomes were retrieved from NCBI and were initially isolated and sequenced by FDA ORA, 94% carried either stx1 or 2 (70/74). Of the 63 E. coli strains isolated and sequenced by a FDA contracting laboratory, 43% carried either stx1 or 2 (25/61). The frequency of isolation of STECs from foods are listed in Table 1. STECs were isolated from 22 food commodities. Most of these STECs were isolated from spinach (32%), flour (21%), lettuce (13%), and cilantro (12%).
Table 1. Frequency of STEC isolated by food commodity.
Commodities | No. strains | % |
---|---|---|
Spinach | 88 | 31.88 |
Flour | 59 | 21.38 |
Lettuce | 35 | 12.68 |
Cilantro (coriander) | 32 | 11.59 |
Cheese | 9 | 3.26 |
Leafy greens | 8 | 2.90 |
Kale | 7 | 2.54 |
Basil | 6 | 2.17 |
Pepper | 6 | 2.17 |
Alfalfa sprouts | 3 | 1.09 |
Cantaloupe | 3 | 1.09 |
Parsley | 3 | 1.09 |
Tomatoes | 3 | 1.09 |
Creamy soy Nut butter | 2 | 0.72 |
pizza dough dry mix | 2 | 0.72 |
sprouts | 2 | 0.72 |
Almond | 1 | 0.36 |
oats animal feed | 1 | 0.36 |
Celery | 1 | 0.36 |
Clover sprouts | 1 | 0.36 |
cucumbers | 1 | 0.36 |
animal feed | 1 | 0.36 |
enviromental | 2 | 0.72 |
Total | 276 | 100.00 |
The stx-negative strains were eliminated from this analysis.
The frequency of STEC isolation per year, their sequence type, food commodity and state of isolation (if available) is listed in Table 2. The minimum number of STECs recovered from FDA regulated foods per year during the period 2010–2017 was 12 in 2012 and the maximum was 53 in 2016, for a median of 30 STECs per year. We used 2010 as our starting year, since the number of strains prior to that year were found sporadically and came from our STEC historical collection. The STEC analyzed strains were isolated in 22 states.
Table 2. Frequency of STECs isolated from FDA regulated food commodities per year from 2010–2017.
Yeara | Number of STECs Isolated | % | STs | Commodities | State |
---|---|---|---|---|---|
2003 | 3 | 1.09 | 3017, 641, 446 | lettuce, celery, tomato | TX |
2004 | 4 | 1.45 | 655, 205, 642 | lettuce, cantaloupe, cilantro | TX, CA, MD |
2005 | 1 | 0.36 | 677 | lettuce | CA |
2006 | 4 | 1.45 | 33,211,764,496 | alfalfa sprouts, lettuce | MI, CA, MN |
2007 | 2 | 0.72 | 297, 295 | cantaloupe, lettuce | NY, OH |
2008 | 10 | 3.62 | 21, 329, 6475, 642, 2217, 4496, 2008, 1385 | spinach, lettuce | CA, WA, MI |
2009 | 11 | 3.99 | 223, 58, 11, 6509, 718, 4496, 2389, 6638, 2008, 5530 | spinach, lettuce, flour | FL, CA, OH, MI, TX, MD, WI, NY |
2010 | 34 | 12.32 | 6640, 2520, 154, 661, 443, 205, 718, 173, 2387, 10, 706, 692, 4173, 1431, 4496, 5299, 3, 295, 5435, 2008, 1727 | spinach, lettuce, cilantro, sprouts, hot pepper, tomato | TX, CA, FL, WI, NY, CO, WA, |
2011 | 38 | 13.77 | 6642, 942, 297, 6641, 955, 11, 679, 6639, 2161, 21, 692, 2217, 4496, 88, 306, 5435, 205, 2008, 5973, 691, 724 | spinach, cantaloupe, lettuce, almond, alfalfa sprouts, cilantro, hot pepper | MI, MN, CA, TX, OH, FL, CO |
2012 | 40 | 14.49 | 993, 16, 223, 5975, 2520, 443, 1611, 119, 718, 677, 297, 173, 2387, 101, 21, 5602, 747, 906, 5395, 2385, 4496, 101, 442, 295 | cilantro, flour, spinach, lettuce, cherry tomatoes | TX, FL, NC, NY, CO, OH, MI, CA, WA |
2013 | 13 | 4.71 | 223, 297, 325, 2388, 1611, 394, 677, 156, 937, 2385, 35, 515 | cilantro, basil, sprouts, parsley, flour | GA, CA, TX, TN, AZ |
2014 | 36 | 13.04 | 993, 17, 16, 2520, 297, 329, 442, 679, 677, 297, 173, 657, 2387, 21, 342, 43, 906, 675, 6632, 2385, 88, 3759, 2217, 306, 29, 5960, 10 | lettuce, kale, cheese, clover sprouts, animal feed, basil, Leafy greens, spinach | OH, CA, WI, AZ, WA, KY, OR, PA |
2015 | 12 | 4.35 | 723, 1967, 1817, 25, 398, 32, 442, 205, 446, 21, 342, 40 | environmental, flour, lettuce, Leafy greens, spinach, pepper, kale | WA, NE, AZ, CA, OR |
2016 | 53 | 19.20 | 655, 747, 723, 4496, 154, 17, 1792, 297, 21, 33 | flour, kale, spinach | MO, CO, MI, OK, CA |
2017 | 15 | 5.43 | 1112, 5082, 662, 1086, 162 | cucumbers, soy nut butter, pepper, flour | KY |
Total | 276 | 100.00 |
a- STECs isolated during years 2003–2009 are included as historical STECs and their prevalence was not used for determining the STEC frequency in foods regulated by the FDA per year.
Characterization of STEC strains by in silico MLST
Among the 276 STECs analyzed in this study we identified 95 different sequence types (STs) by in silico MLST. Strains belonging to a ST were isolated between 1 to 12 times in the period studied (2010–2017) (Tables 3 and S3). The majority of the STs identified were observed only one time (45%).
Table 3. STs observed and number of strains included in each ST.
STa | No. Strains | % | Human cases | Reported as EHECb | Known serotypes |
---|---|---|---|---|---|
655 | 38 | 13.67 | + | + | O121:H19 |
4496 | 12 | 4.32 | + | NR | O8:H28 |
2008 | 12 | 4.32 | + | NR | Ounk:H2/40 |
21 | 8 | 2.88 | + | + | O26:H11/- |
297 | 9 | 3.24 | + | - (UPEC, APEC) | diverse serotypes |
205 | 7 | 2.52 | + | - (UPEC) | NR |
43 | 5 | 1.80 | + | - (ETEC, EAEC) | O6:H10 |
154 | 5 | 1.80 | + | - (EPEC, APEC) | diverse serotypes |
173 | 5 | 1.80 | + | NR | O181:H49 |
295 | 5 | 1.80 | + | - (EAEC, UPEC, ExPEC) | diverse serotypes |
677 | 5 | 1.80 | + | + | diverse serotypes |
747 | 8 | 2.88 | + | - (ETEC) | diverse serotypes |
11 | 3 | 1.08 | + | + | O157:H7/- |
16 | 2 | 0.72 | + | + | O111:H8/2/- |
17 | 2 | 0.72 | + | + | O103:H2/- |
25 | 1 | 0.36 | + | + | O128:H2 |
29 | 1 | 0.36 | + | + (also EPEC) | O26:H11 |
32 | 1 | 0.36 | + | + | O145:H- |
33 | 2 | 0.72 | + | + | O91:H14 |
119 | 1 | 0.36 | + | + | O165:H25/28 |
223 | 4 | 1.44 | + | + (also UPEC, EAEC) | diverse serotypes |
306 | 2 | 0.72 | + | + | O84:H2/K+ |
329 | 2 | 0.72 | + | + (also EAEC) | diverse serotypes |
657 | 1 | 0.36 | + | + | diverse serotypes |
675 | 1 | 0.36 | + | + | O76:H19 |
679 | 3 | 1.08 | + | + | O163:H19 |
724 | 1 | 0.36 | + | + | O154/Ounk:H20 |
5299 | 2 | 0.72 | + | NR | O8:H49 |
325 | 1 | 0.36 | + | NR | O15:H16/K+ |
6639 | 1 | 0.36 | + | NR | O174:H21/36 |
661 | 2 | 0.72 | + | NR | O174:H2 |
662 | 2 | 0.72 | + | NR | diverse serotypes |
691 | 2 | 0.72 | + | NR | diverse serotypes |
692 | 3 | 1.08 | + | NR | O74:H42 |
718 | 3 | 1.08 | + | NR | O168:H8 |
723 | 4 | 1.44 | + | NR | O103:H11 |
942 | 1 | 0.36 | + | NR | O116:H28 |
955 | 1 | 0.36 | + | NR | O139:H1/6 |
993 | 2 | 0.72 | + | NR | O100:H30 |
1792 | 1 | 0.36 | + | NR | O111:H8 |
1817 | 1 | 0.36 | + | NR | O104:H7 |
1967 | 2 | 0.72 | + | NR | O103:H2 |
2388 | 1 | 0.36 | + | NR | O15 |
2520 | 3 | 1.08 | + | NR | O116:H49 |
3759 | 1 | 0.36 | + | NR | NR |
5973 | 2 | 0.72 | + | NR | Ounk:H2 |
6475 | 2 | 0.72 | + | NR | O17/077:H45 |
10 | 2 | 0.72 | + | - (mainly EAEC, UPEC, ETEC) | diverse serotypes |
35 | 1 | 0.36 | + | - (EPEC or UPEC) | O154:H4, O145:H34/31 |
40 | 1 | 0.36 | + | - (EAEC) | O111ac:H21 |
58 | 1 | 0.36 | + | - (EAEC, UPEC, ExPEC, APEC) | diverse serotypes |
88 | 2 | 0.72 | + | - (EAEC, UPEC, ExPEC, APEC) | diverse serotypes |
101 | 3 | 1.08 | + | - (EAEC, UPEC, ExPEC, APEC) | diverse serotypes |
156 | 1 | 0.36 | + | - (UPEC, EXPEC) | diverse serotypes |
162 | 1 | 0.36 | + | - (UPEC, APEC) | O8:H19 |
342 | 2 | 0.72 | + | - (EPEC) | O177:NM |
394 | 1 | 0.36 | + | - (EAEC, UPEC) | diverse serotypes |
398 | 1 | 0.36 | + | - (ExPEC) | diverse serotypes |
442 | 3 | 1.08 | + | - (EPEC) | O146:H21 |
443 | 2 | 0.72 | + | - (UPEC) | NR |
446 | 2 | 0.72 | + | - (APEC) | diverse serotypes |
515 | 1 | 0.36 | + | - (EAEC) | O2:H9 |
641 | 1 | 0.36 | + | - (ExPEC) | diverse serotypes |
642 | 3 | 1.08 | + | - (EPEC) | diverse serotypes |
706 | 1 | 0.36 | + | - (UPEC) | diverse serotypes |
906 | 3 | 1.08 | + | - (UPEC) | diverse serotypes |
1431 | 1 | 0.36 | + | - (ExPEC) | O8:H19/30 |
1727 | 4 | 1.44 | + | - (mostly nonpathogen) | diverse serotypes |
937 | 1 | 0.36 | + | - (ExPEC, non pathogen) | O43:H2 |
1385 | 1 | 0.36 | - | - (APEC) | Ounk:H4 |
1611 | 2 | 0.72 | - | - (APEC) | diverse serotypes |
5082 | 1 | 0.36 | - | NR | NR |
332 | 1 | 0.36 | - | NR | O171:H2 |
5395 | 1 | 0.36 | - | NR | O74:H8 |
5435 | 2 | 0.72 | - | NR | Ounk:H16 |
5530 | 1 | 0.36 | - | NR | Ounk:H21 |
5602 | 2 | 0.72 | - | NR | 36:H28 |
5960 | 1 | 0.36 | - | NR | NR |
5975 | 1 | 0.36 | - | NR | O113:H21 |
6509 | 1 | 0.36 | - | NR | O168:H8 |
6632 | 1 | 0.36 | - | NR | O8:H16 |
6638 | 1 | 0.36 | - | NR | Ounk:H19 |
3017 | 1 | 0.36 | - | NR | O116:H21 |
6640 | 1 | 0.36 | - | NR | O113:H21 |
6641 | 1 | 0.36 | - | NR | O130:H11 |
6642 | 1 | 0.36 | - | NR | O113:H21 |
1112 | 1 | 0.36 | - | - (nonpathogen) | diverse serotypes |
1176 | 1 | 0.36 | - | - (nonpathogen) | O36:H14 |
2161 | 1 | 0.36 | - | - (nonpathogen) | O180:H14 |
2217 | 4 | 1.44 | - | - (nonpathogen) | diverse serotypes |
2389 | 1 | 0.36 | - | - (nonpathogen) | NR |
1086 | 10 | 3.60 | - | - (nonpathogen) | diverse serotypes |
2385 | 7 | 2.52 | - | - (nonpathogen) | O8:H19 |
2387 | 8 | 2.88 | - | - (nonpathogen) | O185:H7 |
4173 | 2 | 0.72 | - | - (nonpathogen) | O79:H2 |
NR- not reported, UPEC (uropathogenic E. coli), EPEC (enteropathogenic E. coli), ETEC (Enterotoxigenic E. coli), APEC (Avian pathogenic E. coli), EAEC (Enteroaggregative E. coli) and ExPEC (Extraintestinal pathogenic E. coli).
a-Determined by in silico analysis of the WGS assemblies.
b-when negative, the reported E. coli type is stated.
Characterization of STEC strains by serotyping, and virulence gene profiles
However, a strain belonging to a known ST that caused hemorrhagic colitis (HC) is not enough to predict the likelihood of it causing disease illness. Therefore, we further characterized these STECs by in silico virulence determination and their predicted serotype (Table 4). The detailed in silico analysis for presence of virulence genes and serotype is listed in S2 Table. Table 4 lists only the serotype and some of the most known virulence genes for each strain: stx1 variants, stx2 variants, eae variants, ehxA, espP, etpD, toxB, katP, subA, saa, and sab. We identified at least 81 different serotypes among the 276 STECs sequenced (Table 4). Many of the O types were not present in our O types database and were listed as unknown or the wzx and wzy gene were not assembled. Among those some of the most common clinical STECs serotypes were identified, such as: O157:H7, O26:H11, O113:H21, O121:H19, O91:H21, O103:H2, and O111:H8. Other serotypes like O153/O178:H19 could not be discerned by in silico analysis since both O types have identical wzx and wzy gene sequences. This issue is discussed in more detail elsewhere [37].
Table 4. in silico characterization of STECs from this study for presence of virulence genes and their serotype.
strains | ST | serotype | stx1 type | stx2 type | eae type | ehxA | espP | etpD | toxB | katP | subA | saa | sab |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CFSAN041120 | 10 | O2:H27 | - | a | - | + | - | - | - | - | - | - | - |
CFSAN051538 | 10 | Ounk:H32 | a | - | - | - | + | - | - | - | - | - | - |
CFSAN046715 | 11 | O157:H7 | - | a | gamma-1 | + | + | + | + | + | - | - | - |
CFSAN046720 | 11 | O157:H7 | - | a | gamma-1 | + | + | + | + | + | - | - | - |
FDA00009839 | 11 | O157:H7 | - | a | gamma-1 | + | + | + | - | + | - | - | - |
CFSAN053336 | 16 | O111:H8 | a | - | theta-2 | + | - | - | - | - | - | - | - |
CFSAN053346 | 16 | O111:H8 | a | a | theta-2 | + | - | - | - | - | - | - | - |
FDA00010429 | 17 | O103:H2 | a | - | epsilon | + | - | + | - | - | - | - | - |
IEH-NGS-ECO-00075 | 17 | O103:H2 | a | - | epsilon | + | + | - | + | - | - | - | - |
CFSAN046724 | 21 | O26:H11 | a | - | beta-1 | + | + | - | - | + | - | - | - |
CFSAN046724 | 21 | O26:H11 | a | - | beta-1 | + | - | - | - | + | - | - | - |
CFSAN053342 | 21 | O103:H11 | a | - | beta-1 | + | + | - | + | + | - | - | - |
CFSAN053343 | 21 | O26:H11 | a | - | beta-1 | + | + | - | + | + | - | - | - |
CFSAN053345 | 21 | O26:H11 | a | - | beta-1 | + | + | - | + | + | - | - | - |
FDA00010430 | 21 | O26:H11 | a | - | beta-1 | - | + | - | - | + | - | - | - |
IEH-NGS-ECO-00076 | 21 | O26:H11 | a | - | beta-1 | + | + | - | + | + | - | - | - |
IEH-NGS-ECO-00213 | 21 | O26:H11 | a | - | beta-1 | + | + | - | + | - | - | - | - |
IEH-NGS-ECO-00227 | 25 | O128ac:H2 | c | - | - | - | - | - | - | - | + | - | + |
IEH-NGS-ECO-00125 | 29 | Ounk:H11 | - | a | beta-1 | + | + | - | - | - | - | - | - |
IEH-NGS-ECO-00224 | 32 | O145:H28 | a | d | gamma-1 | + | + | - | + | + | - | - | - |
CFSAN051773 | 33 | O91:H14 | a | d | - | + | - | - | - | - | + | + | + |
CFSAN053344 | 33 | O91:H14 | a | d | - | + | - | - | - | - | + | - | + |
IEH-NGS-ECO-00232 | 40 | Ounk:H21 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN051526 | 43 | O6:H10 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN051527 | 43 | O6:H10 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN051533 | 43 | O6:H10 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN051535 | 43 | O6:H10 | c | - | - | - | - | - | - | - | - | - | - |
CFSAN051537 | 43 | O6:H10 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN046659 | 58 | O116:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046700 | 88 | O8:H9 | - | e | - | - | - | - | - | - | - | - | - |
CFSAN051531 | 88 | O8:H30 | - | e | - | - | - | - | - | - | - | - | - |
CFSAN046737 | 101 | O82:H8 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN046738 | 101 | O82:H8 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN046749 | 101 | O21:H21 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046750 | 119 | O165:H28 | a | a | epsilon-2 | + | + | - | - | + | - | - | - |
CFSAN046672 | 154 | O88:H25 | a | a | - | + | - | - | - | - | - | + | + |
CFSAN046673 | 154 | O88:H25 | a | a | - | + | - | - | - | - | - | + | + |
CFSAN046682 | 154 | O134:H38 | a | d | - | + | - | - | - | - | - | + | + |
CFSAN056112 | 154 | O88:H25 | a | a | - | + | - | - | - | - | - | + | + |
CFSAN056113 | 154 | O88:H25 | a | a | - | + | - | - | - | - | - | + | + |
CFSAN051557 | 156 | O174:H28 | - | d | - | - | - | - | - | - | - | - | - |
FDA00011520 | 162 | O8:H19 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046678 | 173 | O181:H49 | a | a | - | + | + | - | - | - | + | + | + |
CFSAN046740 | 173 | O181:H49 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN046747 | 173 | O181:H49 | - | d | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00108 | 173 | O181:H49 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046751 | 173 | O181:H49 | - | d | - | + | - | - | - | - | + | + | + |
CFSAN041109 | 205 | Ounk:H19 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046650 | 205 | O153/O178:H19 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046690 | 205 | O153/O178:H19 | a | a | - | + | + | - | - | - | + | + | + |
CFSAN046704 | 205 | Ounk:H19 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN051550 | 205 | Ounk:H19 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN051551 | 205 | Ounk:H19 | - | a | - | + | + | - | - | - | + | + | + |
FDA00009731 | 205 | O153/O178:H19 | a | d | - | + | - | - | - | - | - | + | + |
CFSAN046660 | 223 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046665 | 223 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN053329 | 223 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046734 | 223 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN041114 | 295 | Ounk:H16 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046640 | 295 | Ounk:H16 | c | b | - | - | - | - | - | - | - | - | + |
CFSAN046691 | 295 | Ounk:H11 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN053347 | 295 | Ounk:H16 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046735 | 295 | Ounk:H11 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN041108 | 297 | O130:H11 | a | d | - | + | - | - | - | - | + | + | + |
CFSAN046639 | 297 | O130:H11 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046719 | 297 | O130:H11 | - | d | - | + | - | - | - | - | + | + | + |
CFSAN046753 | 297 | O179:H8 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN051516 | 297 | O130:H11 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN051524 | 297 | O130:H11 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN051549 | 297 | O130:H11 | a | d | - | + | - | - | - | - | + | + | + |
CFSAN056111 | 297 | O130:H11 | - | a | - | + | - | - | - | - | + | + | + |
IEH-NGS-ECO-00087 | 297 | O179:H8 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046717 | 306 | O98:H21 | a | - | zeta | + | + | - | - | - | - | - | - |
CFSAN051525 | 306 | O98:H21 | a | - | zeta | + | + | - | - | - | - | - | - |
CFSAN051544 | 325 | O15:H16 | - | g | - | - | - | - | - | - | - | - | - |
CFSAN046643 | 329 | O136:H16 | a | - | - | + | - | - | - | - | - | - | - |
CFSAN053337 | 329 | O136:H16 | a | - | - | + | - | - | - | - | - | - | - |
CFSAN046636 | 332 | O171:H2 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN053335 | 342 | O5:Hunk | a | - | beta-1 | + | + | - | - | - | - | - | - |
IEH-NGS-ECO-00230 | 342 | O5:Hunk | a | - | beta-1 | + | - | - | - | - | - | - | - |
CFSAN053330 | 394 | O17/O77:H18 | - | d | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00231 | 398 | O136:H20 | c | - | - | - | - | - | - | + | - | - | - |
CFSAN046746 | 442 | O91:H21 | - | a | - | + | - | - | - | - | - | + | + |
CFSAN051529 | 442 | O146:H21 | c | b | - | + | - | - | - | - | + | - | + |
CFSAN051540 | 442 | O146:H21 | c | - | - | + | - | - | - | - | + | - | + |
CFSAN046688 | 443 | O153/O178:H19 | a | d | - | + | - | - | - | - | - | + | + |
CFSAN046752 | 443 | O153/O178:H19 | a | d | - | - | - | - | - | - | - | + | + |
CFSAN046631 | 446 | O22:H8 | - | c | - | - | - | - | - | - | - | - | + |
FDA00009425 | 446 | O22:H8 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN053334 | 515 | Ounk:H29 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046632 | 641 | O117:H10 | a | - | - | - | + | - | - | - | - | + | - |
CFSAN046646 | 642 | O187:H52 | c | - | - | - | - | - | - | - | - | - | - |
CFSAN046648 | 642 | O187:H52 | c | - | - | - | - | - | - | - | - | - | - |
CFSAN046649 | 642 | O187:H52 | c | - | - | - | - | - | - | - | - | - | - |
CFSAN046651 | 655 | O121:H19 | - | a | epsilon-2 | + | - | - | - | - | - | - | - |
FDA00010253 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010254 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010255 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010256 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010257 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010258 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010259 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010276 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010277 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010278 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010279 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010280 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010281 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010282 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010283 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010284 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010285 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010296 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010297 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010298 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010299 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010300 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010301 | 655 | O121:H19 | - | a | epsilon-2 | - | + | - | - | - | - | - | - |
FDA00010302 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010303 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010304 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010305 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010306 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010307 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010308 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010309 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
FDA00010310 | 655 | O121:H19 | - | a | epsilon-2 | - | + | - | - | - | - | - | - |
FDA00010369 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010370 | 655 | O121:H19 | - | a | epsilon-2 | - | + | - | - | - | - | - | - |
FDA00010371 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010372 | 655 | O121:H19 | - | a | epsilon-2 | + | + | - | - | - | - | - | - |
FDA00010373 | 655 | O121:H19 | - | a | epsilon-2 | - | - | - | - | - | - | - | - |
IEH-NGS-ECO-00103 | 657 | O183:H18 | a | d | - | + | + | - | - | - | + | + | + |
CFSAN041119 | 661 | O174:H2 | - | c | - | + | + | - | - | - | - | + | + |
CFSAN051547 | 661 | O174:H2 | - | c | - | + | - | - | - | - | - | + | + |
FDA00011331 | 662 | O17/O77:H45 | a | d | - | + | + | - | - | - | - | + | + |
FDA00011332 | 662 | O17/O77:H45 | a | d | - | + | + | - | - | - | - | + | + |
CFSAN051530 | 675 | O76:H19 | c | - | - | + | - | - | - | - | + | - | + |
CFSAN046652 | 677 | Ounk:H21 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046748 | 677 | O174:H21 | a | d | - | + | + | - | - | - | + | + | + |
CFSAN051519 | 677 | O174:H21 | - | a | - | - | - | - | - | - | - | - | + |
CFSAN051520 | 677 | O174:H21 | - | a | - | - | - | - | - | - | - | - | + |
IEH-NGS-ECO-00080 | 677 | O174:H21 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046699 | 679 | O163:H19 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN046705 | 679 | O163:H19 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN051541 | 679 | O163:H19 | a | d | - | + | + | - | - | - | + | + | + |
CFSAN053338 | 691 | Ounk:H20 | a | d | - | + | + | - | - | - | - | + | + |
CFSAN046703 | 691 | Ounk:H20 | a | - | - | + | + | - | - | - | - | + | + |
CFSAN046709 | 691 | Ounk:H20 | a | - | - | + | + | - | - | - | - | + | + |
CFSAN041113 | 692 | O74:H42 | a | d | - | + | + | - | - | - | - | + | + |
CFSAN051548 | 692 | O74:H42 | a | d | - | + | + | - | - | - | - | + | + |
CFSAN051553 | 692 | O74:H42 | a | d | - | + | + | - | - | - | - | + | + |
CFSAN046689 | 706 | O32:H1 | - | d | - | - | - | - | - | - | - | - | + |
CFSAN046666 | 718 | O168:H8 | - | a | - | + | - | - | - | - | - | - | - |
CFSAN046693 | 718 | O168:H8 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046725 | 718 | O168:H8 | - | d | - | - | - | - | - | - | - | - | - |
FDA00010428 | 723 | O103:H11 | a | - | beta-1 | - | - | - | - | + | - | - | - |
FDA00010431 | 723 | O103:H11 | a | - | beta-1 | + | - | - | - | + | - | - | - |
FDA00010457 | 723 | O103:H11 | a | - | beta-1 | + | + | - | - | + | - | - | - |
IEH-NGS-ECO-00223 | 723 | O103:H11 | a | - | beta-1 | + | + | - | + | + | - | - | - |
CFSAN046707 | 724 | Ounk:H20 | a | - | - | + | + | - | - | - | + | + | + |
CFSAN046733 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN046741 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN046742 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN046743 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
FDA00010882 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
FDA00010883 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
FDA00010884 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
FDA00010885 | 747 | O17/O77:H45 | - | a | - | + | + | - | - | - | - | + | + |
CFSAN041116 | 906 | O74:H8 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN051555 | 906 | O74:H8 | - | d | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00096 | 906 | O74:H8 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN051545 | 937 | O43:H2 | a | a | - | + | - | - | - | - | - | + | + |
CFSAN046721 | 942 | O116:H28 | a | - | - | + | + | - | - | - | - | + | + |
CFSAN046713 | 955 | O139:H1 | - | e | - | - | - | - | - | - | - | - | - |
CFSAN051539 | 993 | O100:H30 | - | e | - | - | - | - | - | - | - | - | - |
CFSAN046730 | 993 | O100:H30 | - | e | - | - | - | - | - | - | - | - | - |
FDA00011815 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011816 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011817 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011818 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011819 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011820 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011821 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011822 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011823 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011824 | 1086 | O8:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
FDA00011218 | 1112 | O142:H27 | - | e | - | - | - | - | - | - | - | - | - |
CFSAN046653 | 1176 | O36:H14 | - | g | - | + | - | - | - | - | - | - | - |
CFSAN046655 | 1385 | Ounk:H4 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046683 | 1431 | O8:H19 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046728 | 1611 | O159:H19 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN051515 | 1611 | O159:H19 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN046668 | 1727 | Ounk:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN046669 | 1727 | Ounk:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN046671 | 1727 | Ounk:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN046692 | 1727 | Ounk:H7 | - | c | - | - | - | - | - | - | - | - | - |
FDA00010432 | 1792 | O111:H8 | a | a | theta-2 | + | - | - | - | - | - | - | - |
IEH-NGS-ECO-00221 | 1817 | O104:H7 | c | - | - | - | - | - | - | + | - | - | - |
IEH-NGS-ECO-00211 | 1967 | O103:H2 | a | - | epsilon | + | + | - | - | + | - | - | - |
IEH-NGS-ECO-00212 | 1967 | O103:H2 | a | - | epsilon | + | + | - | - | - | - | - | - |
CFSAN041110 | 2008 | Ounk:H2 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN041111 | 2008 | Ounk:H2 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN041112 | 2008 | Ounk:H2 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046642 | 2008 | Ounk:H2 | - | a | - | - | - | - | - | - | - | - | - |
CFSAN046657 | 2008 | Ounk:H2 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN046684 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046685 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046686 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046687 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046708 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046710 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN051552 | 2008 | Ounk:H2 | - | d/c | - | - | - | - | - | - | - | - | - |
CFSAN046701 | 2161 | O180:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN046641 | 2217 | O45:H16 | a | - | - | - | + | - | - | - | - | + | - |
CFSAN046712 | 2217 | O76:H21 | a | - | - | - | - | - | - | - | - | + | - |
CFSAN046714 | 2217 | O8:H16 | a | - | - | - | + | - | - | - | - | + | - |
CFSAN051522 | 2217 | O84:H38 | a | - | - | - | + | - | - | - | - | + | - |
CFSAN046723 | 2385 | O8:H19 | a | a | - | + | + | - | - | - | + | + | + |
CFSAN051517 | 2385 | O8:H19 | a | a | - | + | + | - | - | - | + | + | + |
CFSAN051518 | 2385 | O8:H19 | a | a | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00082 | 2385 | O8:H19 | a | a | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00088 | 2385 | O8:H19 | - | a | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00089 | 2385 | O8:H19 | - | a | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00090 | 2385 | O8:H19 | a | c | - | + | + | - | - | - | + | + | + |
CFSAN041115 | 2387 | O185:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN041118 | 2387 | O185:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN046670 | 2387 | O185:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN051528 | 2387 | O185:H7 | - | c | - | + | + | - | - | - | - | - | - |
CFSAN051546 | 2387 | O185:H7 | - | c | - | + | + | - | - | - | - | - | - |
CFSAN051554 | 2387 | O185:H7 | - | c | - | - | - | - | - | - | - | - | - |
CFSAN046744 | 2387 | O185:H7 | - | c | - | - | - | - | - | - | + | + | + |
CFSAN046745 | 2387 | O185:H7 | - | c | - | - | + | - | - | - | + | + | + |
CFSAN053339 | 2388 | O15:H27 | - | d | - | - | - | - | - | - | - | - | - |
CFSAN046656 | 2389 | Ounk:H11 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN046679 | 2520 | O116:H49 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046731 | 2520 | O116:H49 | - | a | - | + | + | - | - | - | + | + | + |
IEH-NGS-ECO-00204 | 2520 | O116:H49 | a | a | - | + | + | - | - | - | + | + | + |
CFSAN046633 | 3017 | O116:H21 | a | a | - | + | - | - | - | - | + | + | + |
IEH-NGS-ECO-00070 | 3759 | O8:H49 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046667 | 4173 | O79:H2 | a | - | - | - | - | - | - | - | - | - | - |
CFSAN046676 | 4173 | O79:H2 | a | - | - | - | - | - | - | - | - | - | - |
CFSAN046637 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046644 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046654 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046661 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046680 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046695 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046718 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046726 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN046727 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
FDA00009866 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
FDA00009867 | 4496 | O8:H28 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN046729 | 4496 | O8:H28 | - | d/e | - | + | - | - | - | - | - | - | - |
FDA00011519 | 5082 | O180:H14 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN046674 | 5299 | O8:H49 | a | - | - | - | - | - | - | - | - | + | + |
CFSAN046675 | 5299 | O8:H49 | a | - | - | + | + | - | - | - | - | + | + |
CFSAN046722 | 5395 | O74:H8 | a | - | - | + | + | - | - | - | + | + | + |
CFSAN046694 | 5435 | Ounk:H16 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046696 | 5435 | Ounk:H16 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046664 | 5530 | Ounk:H21 | - | d/e | - | - | - | - | - | - | - | - | - |
CFSAN041117 | 5602 | O36:H28 | - | g | - | + | - | - | - | - | - | - | - |
CFSAN051556 | 5602 | O36:H28 | - | g | - | + | - | - | - | - | - | - | - |
IEH-NGS-ECO-00105 | 5960 | Ounk:H19 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046702 | 5973 | Ounk:H2 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN046716 | 5973 | Ounk:H2 | - | c | - | + | + | - | - | - | + | + | + |
CFSAN046739 | 5975 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
CFSAN046645 | 6475 | O17/O77:H45 | a | a | - | + | + | - | - | - | - | + | + |
CFSAN046647 | 6475 | O17/O77:H45 | a | a | - | + | + | - | - | - | - | - | - |
CFSAN046663 | 6509 | O168:H8 | - | d/e | - | + | - | - | - | - | - | - | - |
CFSAN051521 | 6632 | O8:H16 | - | d | - | - | + | - | - | - | - | + | - |
CFSAN046662 | 6638 | Ounk:H19 | - | a | - | + | - | - | - | - | + | + | + |
CFSAN046711 | 6639 | O174:H21 | - | c | - | - | + | - | - | - | - | - | - |
CFSAN046677 | 6640 | O113:H21 | - | d | - | + | + | - | - | - | + | + | + |
CFSAN046706 | 6641 | O130:H11 | a | - | - | + | - | - | - | - | + | + | + |
CFSAN046697 | 6642 | O113:H21 | - | a | - | + | + | - | - | - | + | + | + |
Adherence factors eae and saa genes were found in 67 (24%) and 72 (26%), respectively (Table 4). Shiga toxin genes were present as follows: stx1- 53 (19%) (variants a and c), stx2- 184 (67%) (variants a, b, c, d, d/e, e, and g), while stx1+stx2–39 (15%). Among the 184 STECs carrying only stx2 genes, 144 of them carried variants 2a,d, or c; while the remaining 40 carried variants 2e, d/e, or g.Other putative virulence genes were found as follows: exhA gene was present in 169 (61%), espP was present in 118 (43%), katP in 24 (9%), etpD in 4 (2%), and finally toxB was present in 10 (4%).
Presence of antimicrobial resistance genes
Thirty-three of the 276 STEC strains (12%) carried antimicrobial resistance genes (Table 5). Thirty of them carried multiple antibiotic resistance genes while the remaining three carried a single gene (tetA- IEH-NGS-ECO-00231, FDA00011218, and CFSAN051521). Among the antimicrobial classes observed were genes resistant to aminoglycosides, beta-lactamases, macrolides, phenicols, quinolones, sulphonamides, tetracyclines, and trimethoprim.
Table 5. Presence of antimicrobial resistance genes identified by in silico analysis in the 276 STEC genomes analyzed in this study.
Strains | aadAa | aph3a | strAa | strBa | bla TEMb | mef(B)c | floRd | QnrBe | sul1f | sul2f | sul3f | tetAg | tetBg | tetCg | dfrAh |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CFSAN053338 | - | - | + | + | - | - | - | - | - | - | - | - | - | - | - |
CFSAN046714i | - | - | - | - | - | - | - | - | + | - | + | - | - | + | - |
CFSAN051552 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046710 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046642 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
IEH-NGS-ECO-00231j | - | - | - | - | - | - | - | - | - | - | - | + | - | - | - |
CFSAN053336 | - | + | + | + | - | - | - | - | - | + | - | + | - | - | - |
FDA00009425 | - | - | + | + | - | - | + | - | - | + | - | + | - | - | - |
FDA00011218 | - | - | - | - | - | - | - | - | - | - | - | + | - | - | - |
IEH-NGS-ECO- | + | - | + | + | - | - | + | - | + | + | + | - | - | - | - |
CFSAN051526 | + | - | - | - | - | - | - | - | + | - | + | + | - | - | - |
CFSAN046636 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046669 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046668 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN053334 | - | - | + | + | - | - | + | - | - | + | - | + | - | - | - |
CFSAN046687 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN051521 | - | - | - | - | - | - | - | - | - | - | - | + | - | - | - |
CFSAN046730 | - | + | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN041112 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN041111 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046671 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046708 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046685 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN046684 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN051527 | + | - | - | - | - | - | - | - | + | - | + | + | - | - | - |
CFSAN046686 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
CFSAN051535 | - | - | + | + | + | - | - | - | - | + | - | + | - | - | - |
CFSAN046693 | - | - | + | + | - | - | - | - | - | + | - | - | - | - | - |
CFSAN046713 | - | - | - | + | + | - | - | + | - | + | - | + | - | - | + |
CFSAN046725 | - | - | - | + | - | - | - | - | - | + | - | - | - | - | - |
CFSAN051531 | - | - | - | + | - | - | - | - | - | + | - | + | - | - | + |
CFSAN051539k | + | + | - | - | + | + | - | - | - | - | + | - | + | - | - |
CFSAN041110 | - | - | + | + | - | - | - | - | - | + | - | - | + | - | - |
aAminoglycoside,
bBeta-lactamase,
cMacrolide,
dPhenicol,
eQuinolone,
fSulphonamide,
gTetracycline, and
h Trimethoprim.
i strain carrying blaOXAb gene.
j strain carrying blaCMY-2b gene.
k strain carrying cmld and cmlA1d genes.
Phylogenetic relationship of the STEC strains by cgMLST analysis
The phylogenetic relationships among the 276 STECs from this study determined by cgMLST analysis is shown in Fig 1. The genome of O157:H7 strain Sakai (NC_002695.1) was used as the reference for the cgMLST. The initial phylogenetic analysis [Neighbor-Joining (NJ) tree] based on gene differences (allele based) among these 276 STECs (Fig 1) revealed a complex evolutionary history with the existence of multiple, highly diverse genomic variants of strains isolated from RFFDA. Some of these genomes formed discrete groups and clustering was consistent with their ST (ex. all ST655 strains clustered together). A further analysis by a minimum spanning tree allows visualization of allele differences between strains with the same ST that was not seen with the NJ tree (Fig 2).
eae positive Non-STEC strains virulence gene profiles
Among the 55 non-STECs (lacking either stx gene by in silico analysis) strains isolated from FDA regulated foods, we found 35 that were positive for the eae gene (S4 Table). The majority were classified as atypical EPEC (aEPEC) eae+ and bfpA-. Two of them (IEH-NGS-ECO-00094, and IEH-NGS-ECO-00100) carried bfpA -the first gene of the bfp operon that encodes a type IV pilus- (eae+ and bfpA+) but were missing most of the common genes found in typical EPEC (S4 Table, typical EPEC lineage 1 strain E2348/69). Therefore, we classified them as aEPEC.
Discussion
STECs are the most dangerous among diarrheagenic E. coli affecting public health worldwide [5,7,23,27,42,43]. Usually the most threatening STEC are those of O157:H7 serotype [44,45]. However, in recent years there has been an increase in the occurrence of many non-O157 serotypes in humans associated with consumption of contaminated food, including produce and other FDA regulated products [2,46,47]. Some studies have characterized STECs presence and their virulence potential from FDA regulated products [20,23,48]. Most of the STEC isolated from those products have been only initially screened for the presence of some virulence genes using PCR [23,49]. In the present study, we performed an in-depth analysis with whole genome sequencing of 331 presumptive STEC strains. These strains were isolated from FDA regulated foods recovered during a period of 2003–2017 by two surveillance programs (FDA ORA, and MDP USDA). STECs were isolated from 22 food commodities. It is worth mentioning that even though the sampling did not occur in all states, the food commodities had nationwide (or at least multistate) distribution.
The STEC analyzed in this study were isolated from a wide variety of foods (Table 1), with the majority isolated from spinach (32%), flour (21%), lettuce (13%), and cilantro (12%) samples during the period 2010–2017. The actual frequency of flour STECs should be assessed at a lower frequency of 9%; the spike observed in their frequency was due to the outbreak in flour in 2016, where most STECs (37 strains (66%) of total flour STECs) were isolated. A better reflection of the frequency of STEC isolated per food commodity, specifically produce, can be found in Feng and Reddy (2013) [23]. Nevertheless, the presence of STECs in FRFDA per year remained relatively low, with a median of 30 isolates per year. As pointed previously, these variations in frequency of isolation can be due to seasonal and geographical variations, or to sporadic outbreaks as was observed for O121:H19 STEC strains isolated from flour in 2016 (23).
WGS revealed that these STECs were highly variable with the existence of 95 different sequence types (STs) and belonging to at least 81 different serotypes. Some serotypes could not be predicted and might be due to the fact that the O and H type genes were not present in the database used which includes the most frequent serotypes found in clinical cases. Most STs were observed only once while some others were observed more frequently. ST655 was observed up to 38 times among the STECs analyzed and it was because 37 of those STEC strains were recovered during the flour outbreak in 2016 [20]. According to what was found in Enterobase (http://enterobase.warwick.ac.uk), the majority of the STEC STs observed in this study [69/95–73%] had been reported as causing disease in humans. Furthermore, of these potential human pathogenic STECs, strains belonging to 18 of those STs (19%) were additionally associated with strains causing EHEC-related illnesses (Table 3). Among the known ST associated with causing HC illnesses or HUS cases we found: ST21 and ST29 (O26:H11), ST11 (O157:H7), ST33 (O91:H14), ST17 (O103:H2), and ST16 (O111:H-), among others [7,42,50,51]. There are some STs, that have been described as EAEC/ETEC, however in our dataset, the strains with those STs did not carry any virulence genes for those pathotypes. These results emphasized the need to analyze the data as a whole (e.g. to include virulence genes);ST alone is not an accurate predictor of pathotype or virulence.
Although some strains have the same ST, they show differences in their virulence profile as well as their Shiga toxin gene content. For example, there were 5 strains that were ST10 and from these only 2 were classified as STECs, with one carrying stx1a while the other carried stx2a. Both were negative for any of the attaching genes (eae, or saa genes), and therefore considered low risk for causing infection in a healthy individual. This demonstrates that a single characteristic (e.g. ST or serotype) is not enough to make an inference of the potential pathogenic trait of any STECs (http://www.fao.org/documents/card/en/c/CA0032EN). The better way is to take all the information into consideration (ST, stx type, attaching genes, serotype, etc) in order to make a more informed prediction of the pathogenic potential of any STEC in conjunction with historical available data on clinical cases. For example, a strain of O113:H21 stx2a positive that doesn’t possess eae but has saa (a gene that encodes an auto-agglutinating adhesion) and has been linked to HUS cases [14], could be potentially harmful to humans. A similar analysis could be done in the case of any STEC that has all those attributes but has not been linked to any human cases. Even though we cannot predict the actual outcome of an infection with this strain, it still warrants a warning about its presence in foods that are consumed raw as is the case with fresh produce.
We tested for 95 known virulence genes [27] found in the most common E. coli pathotypes and did not find any genes present that would characterize the strains as STEC/EAEC//ETEC/EIEC hybrids. Among the adherence factors, eae and saa genes were found in 24% and 26% of the STEC strains, respectively. Strains that carry eae did not carry saa, and vice versa, as perviously observed for STEC isolated from fresh produce [23]. Regarding the presence of Shiga toxin type, there was great variation with most strains (67%) carrying only stx type 2, 19% carrying only stx type 1 while 15% carried both stx types. Among the stx2 there were 144 that were either a,d, or c variants, which are the stx2 variants found among clinical cases [31,32,52–54] and that have a specific tropism for humans [53]. The remaining 40 STECs carrying stx type 2 alone were stx variants e, d/e and g which have been found in animal reservoirs [55]. The remaining putative virulence genes were sporadically found with the most common exhA gene found in 61% of the STECs, while espP was found in 43% of the STECs. These two genes can be found in the virulence plasmid and appear to participate in STECs infection in humans [9,11,15]. In summary, 46 of the STECs analyzed in this study carried a combination of stx2a/d and eae genes. Many researchers and food safety institutions considered a STEC with this virulence gene combination to possess an elevated risk to humans [22,53]. On the other hand, 94 of these STECs carried a combination of stx2a/d/c and saa genes. In this case, the risk is harder to determine and are considered of moderate risk if serotype was previously found associated with human cases.
We also confirmed the presence of antimicrobial resistance (AMR) genes in some of the analyzed STEC strains with a low prevalence (12%). However, those few strains carried multiple antimicrobial resistance genes. The presence of strains carrying multiple AMR genes is worrisome since they can be shared amongst other E. coli and could possible participate in the dissemination of AMR in their environments, as has been observed for tetracycline genes in E. coli isolates from beef cattle [56], for colistin resistance (mcr-1 gene) through plasmid-mediated transfer [57], and for ampicillin resistance genes in E. coli in an infant treated with antibiotics [58].
Phylogenetic analysis by a custom cgMLST analysis of these 276 STECs confirmed the MLST in silico analysis, with many different defined clades among these STECs isolated from FRFDA. The cgMLST analysis is a fast method of analysis and provides an initial visualization of the relationships among the strains analyzed. Comparable results have been observed for establishing fast relationships among genomes from diverse bacterial pathogens [27,29,59–64]. A further analysis using only the genomes of strains that are located within each individual or among selected clades can produce a more detailed evolutionary history, using single nucleotide analyses. This SNP analysis can help in determining the potential source, phylogenetic nature, lineage, and timeline of transmission of each group, as has been shown for the ST36 lineage of Vibrio parahaemolyticus [65].
EPECs are the leading cause of infantile diarrhea in developing countries [66,67]. EPEC strains do not carry stx genes but typical EPECs (tEPEC) have eae and bfp genes, and their main reservoir is humans [68]. The eae gene is located in the chromosome, in the LEE operon, while the bfp operon is typically located in the large EPEC adherence factor (EAF) virulence plasmid [68]. These tEPEC also carry the perA gene, which increases the expression of LEE elements [68,69]. Interestingly, 11% of our presumptive STECs were shown by in silico analysis to be atypical EPECs (aEPEC). Among their unusual features are the absence of the EAF plasmid, and their reservoirs can be animals or humans [68]. It is possible these aEPECs might had lost their phages upon culturing, as this pattern has been observed in clinical isolates of E. coli upon sub-cultivation [70]. The aEPEC we observed in this study may have the capacity to produce A/E lesions, since they carried both the eae and tir gene, which are the effector and receptor necessary for the formation of the A/E lesion [71].
Our results suggest that finding aEPECs in food could be of particular concern, as these strains have the potential for acquiring the stx phage, as observed in the E. coli O104:H4 strain found in Germany [72]. That strain was an entero-aggregative E. coli (EAEC) that had acquired an stx2a phage, and human illnesses that resulted during 2011 became the largest known HUS outbreak of STEC-related illness in the world [72]. Similarly, an O26:H11 strain 21765, isolated in 2005 during a milk cheese outbreak in France [73] was shown to be an EPEC strain that had probably acquired a stx2a phage [27]. In Gonzalez-Escalona et al (2016), the authors demonstrated that some strains of E. coli O26:H11 isolated from US cattle were phylogenetically more closely related to ST29 O26:H11 EHECs but t because these did not carry the stx phage, they would have been classified as EHEC-like by previous methods [74]. Over the last five years, the analyses of thousands of E. coli genomes have revealed that so-called E. coli “hybrid strains”–strains that belong to one pathotype but acquire virulence markers, such as stx genes, from another pathotype–could be more common than previously believed.
We are heading to a new phase in surveillance of STECs in the US by using a genomic monitoring approach and the genomes of the STEC isolated from FRFDA provides a solid foundation to build upon [75] (https://www.cdc.gov/pulsenet/pathogens/wgs.html). A database already exists that achieves the first goal of source tracking by using core genome information (NCBI pathogen detection tool). However, there is a need for improved databases that allow for fast analysis of the WGS data for detecting virulence genes, phages and plasmids content, as well as antimicrobial resistance genes.
In conclusion, STECs were isolated from diverse FRFDA food sources during the period study. The contamination frequency was relatively low (median 30 STEC strains isolated per year). However, fifty percent of the STECs analyzed in this study carried either a combination of eae plus stx, or saa plus stx, therefore being potentially pathogenic to humans. Moreover, those STECs carried most of the virulence genes described for STECs causing infections with a diverse range from HC (e.g. ST655 O111:H19 strains) to HUS (e.g. ST21 O26:H11 strains) [20,42]. Some others have not been described as causing disease in humans but have the potential to do so (e.g. ST342 O5:H-unknown strains) since they carried all virulence genes described in pathogenic strains (stx1a, eae-beta1, exhA, tir, and many of the T3SS effectors and non-LEE effectors) (Table 4). Nonetheless, the determination of the presence of STECs in FRFDA with the potential to cause disease in humans reinforces the need to continue surveillance of this important pathogen which is of importance for food safety and public health. Furthermore, the availability of these genomes could provide early warnings of food contamination from cattle or other animals, since some of the STEC isolated were carrying stx2e that have been usually observed causing edema in pigs [76] and are considered as probably non-pathogenic to humans [53]. Here we showed that WGS enabled comparisons across isolates to establish phylogeny, helped in identification of antibiotic resistance by monitoring the presence of antimicrobial resistance genes, and determined the presence of known virulence genes that have been linked with illnesses. A freely accessible dataset of high-quality reference genome sequences of FRFDA was previously unavailable. Future food safety investigations will benefit from the comparisons made possible by this WGS dataset as it allows for the monitoring of the recurrence and emergence of strains in the food supply. It is our goal to help develop a database that will allow for fast source tracking and accurate categorization (low risk or high risk) of STEC food isolates in a more comprehensive manner.
Supporting information
Acknowledgments
The authors thank Dr. Shanker Reddy for sharing the USDA MDP program food isolates used in this study and the laboratorians within the FDA Office of Regulatory Affairs, Food and Feed Laboratories who diligently isolated and sequenced many of the isolates here within analyzed. We also want to thank Sabina Lindley for assistance in sequencing some of these E. coli strains and Lili Fox Vélez for her helpful comments on this manuscript.
Data Availability
The draft genome sequences of 331 E. coli strains used in our study are available in GenBank under the accession numbers listed in S1 Table.
Funding Statement
This work was supported by the FDA Foods Science and Research Intramural Program.
References
- 1.Rangel JM, Sparling PH, Crowe C, Griffin PM, Swerdlow DL. Epidemiology of Escherichia coli O157:H7 outbreaks, United States, 1982–2002. Emerg Infect Dis. 2005; 11: 603–609. 10.3201/eid1104.040739 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Brooks JT, Sowers EG, Wells JG, Greene KD, Griffin PM, Hoekstra RM et al. Non-O157 Shiga toxin-producing Escherichia coli infections in the United States, 1983–2002. J Infect Dis. 2005; 192: 1422–1429. JID34662 [pii]; 10.1086/466536 [DOI] [PubMed] [Google Scholar]
- 3.Scallan E, Hoekstra RM, Angulo FJ, Tauxe RV, Widdowson MA, Roy SL et al. Foodborne illness acquired in the United States—major pathogens. Emerg Infect Dis. 2011; 17: 7–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Beutin L, Krause G, Zimmermann S, Kaulfuss S, Gleier K. Characterization of Shiga toxin-producing Escherichia coli strains isolated from human patients in Germany over a 3-year period. J Clin Microbiol. 2004; 42: 1099–1108. 10.1128/JCM.42.3.1099-1108.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kuehne A, Bouwknegt M, Havelaar A, Gilsdorf A, Hoyer P, Stark K et al. Estimating true incidence of O157 and non-O157 Shiga toxin-producing Escherichia coli illness in Germany based on notification data of haemolytic uraemic syndrome. Epidemiol Infect. 2016; 144: 3305–3315. S0950268816001436; 10.1017/S0950268816001436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Byrne L, Jenkins C, Launders N, Elson R, Adak GK. The epidemiology, microbiology and clinical impact of Shiga toxin-producing Escherichia coli in England, 2009–2012. Epidemiol Infect. 2015; 143: 3475–3487. S0950268815000746; 10.1017/S0950268815000746 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.McAllister LJ, Bent SJ, Petty NK, Skippington E, Beatson SA, Paton JC et al. Genomic Comparison of Two O111:H- Enterohemorrhagic Escherichia coli Isolates from a Historic Hemolytic-Uremic Syndrome Outbreak in Australia. Infect Immun. 2016; 84: 775–781. IAI.01229-15; 10.1128/IAI.01229-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.U.S.Department of Agriculture FSaIS. Shiga Toxin-Producing Escherichia coli in Certain Raw Beef Products. Fed Regist. 2012; 77: 31975–31981. [Google Scholar]
- 9.Kaper JB, Nataro JP, Mobley HL. Pathogenic Escherichia coli. Nat Rev Microbiol. 2004; 2: 123–140. 10.1038/nrmicro818 [DOI] [PubMed] [Google Scholar]
- 10.Karmali MA, Mascarenhas M, Shen S, Ziebell K, Johnson S, Reid-Smith R et al. Association of genomic O island 122 of Escherichia coli EDL 933 with verocytotoxin-producing Escherichia coli seropathotypes that are linked to epidemic and/or serious disease. J Clin Microbiol. 2003; 41: 4930–4940. 10.1128/JCM.41.11.4930-4940.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Nataro JP, Kaper JB. Diarrheagenic Escherichia coli. Clin Microbiol Rev. 1998; 11: 142–201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Garmendia J, Frankel G, Crepin VF. Enteropathogenic and enterohemorrhagic Escherichia coli infections: translocation, translocation, translocation. Infect Immun. 2005; 73: 2573–2585. 73/5/2573; 10.1128/IAI.73.5.2573-2585.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Khaitan A, Jandhyala DM, Thorpe CM, Ritchie JM, Paton AW. The operon encoding SubAB, a novel cytotoxin, is present in shiga toxin-producing Escherichia coli isolates from the United States. J Clin Microbiol. 2007; 45: 1374–1375. JCM.00076-07; 10.1128/JCM.00076-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Paton AW, Srimanote P, Woodrow MC, Paton JC. Characterization of Saa, a novel autoagglutinating adhesin produced by locus of enterocyte effacement-negative Shiga-toxigenic Escherichia coli strains that are virulent for humans. Infect Immun. 2001; 69: 6999–7009. 10.1128/IAI.69.11.6999-7009.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Garmendia J, Ren Z, Tennant S, Midolli Viera MA, Chong Y, Whale A et al. Distribution of tccP in clinical enterohemorrhagic and enteropathogenic Escherichia coli isolates. J Clin Microbiol. 2005; 43: 5715–5720. 43/11/5715; 10.1128/JCM.43.11.5715-5720.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Beutin L, Prada J, Zimmermann S, Stephan R, Orskov I, Orskov F. Enterohemolysin, a new type of hemolysin produced by some strains of enteropathogenic E. coli (EPEC). Zentralbl Bakteriol Mikrobiol Hyg A. 1988; 267: 576–588. [DOI] [PubMed] [Google Scholar]
- 17.Schmidt H, Kernbach C, Karch H. Analysis of the EHEC hly operon and its location in the physical map of the large plasmid of enterohaemorrhagic Escherichia coli O157:h7. Microbiology. 1996; 142 (Pt 4): 907–914. [DOI] [PubMed] [Google Scholar]
- 18.Boerlin P, McEwen SA, Boerlin-Petzold F, Wilson JB, Johnson RP, Gyles CL. Associations between virulence factors of Shiga toxin-producing Escherichia coli and disease in humans. J Clin Microbiol. 1999; 37: 497–503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Schmidt H, Beutin L, Karch H. Molecular analysis of the plasmid-encoded hemolysin of Escherichia coli O157:H7 strain EDL 933. Infect Immun. 1995; 63: 1055–1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Crowe SJ, Bottichio L, Shade LN, Whitney BM, Corral N, Melius B et al. Shiga Toxin-Producing E. coli Infections Associated with Flour. N Engl J Med. 2017; 377: 2036–2043. 10.1056/NEJMoa1615910 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Erickson MC, Doyle MP. Food as a vehicle for transmission of Shiga toxin-producing Escherichia coli. J Food Prot. 2007; 70: 2426–2449. [DOI] [PubMed] [Google Scholar]
- 22.Feng PC, Reddy SP. Prevalence and diversity of enterotoxigenic Escherichia coli strains in fresh produce. J Food Prot. 2014; 77: 820–823. 10.4315/0362-028X.JFP-13-412 [DOI] [PubMed] [Google Scholar]
- 23.Feng PC, Reddy S. Prevalences of Shiga toxin subtypes and selected other virulence factors among Shiga-toxigenic Escherichia coli strains isolated from fresh produce. Appl Environ Microbiol. 2013; 79: 6917–6923. AEM.02455-13; 10.1128/AEM.02455-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Allard MW, Bell R, Ferreira CM, Gonzalez-Escalona N, Hoffmann M, Muruvanda T et al. Genomics of foodborne pathogens for microbial food safety. Curr Opin Biotechnol. 2018; 49: 224–229. S0958-1669(17)30139-8; 10.1016/j.copbio.2017.11.002 [DOI] [PubMed] [Google Scholar]
- 25.Bergholz TM, Moreno Switt AI, Wiedmann M. Omics approaches in food safety: fulfilling the promise? Trends Microbiol. 2014; 22: 275–281. S0966-842X(14)00018-3; 10.1016/j.tim.2014.01.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hoffmann M, Luo Y, Monday SR, Gonzalez-Escalona N, Ottesen AR, Muruvanda T et al. Tracing Origins of the Salmonella Bareilly Strain Causing a Food-borne Outbreak in the United States. J Infect Dis. 2016; 213: 502–508. jiv297 [pii]; 10.1093/infdis/jiv297 [DOI] [PubMed] [Google Scholar]
- 27.Gonzalez-Escalona N, Toro M, Rump LV, Cao G, Nagaraja TG, Meng J. Virulence Gene Profiles and Clonal Relationships of Escherichia coli O26:H11 Isolates from Feedlot Cattle as Determined by Whole-Genome Sequencing. Appl Environ Microbiol. 2016; 82: 3900–3912. AEM.00498-16 [pii]; 10.1128/AEM.00498-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Maiden MC, Harrison OB. The population and functional genomics of the Neisseria revealed with gene-by-gene approaches. J Clin Microbiol. 2016. JCM.00301-16 [pii]; 10.1128/JCM.00301-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lorenz SC, Kotewicz ML, Hoffmann M, Gonzalez-Escalona N, Fischer M, Kase JA. Complete Genome Sequences of Four Enterohemolysin-Positive (ehxA) Enterocyte Effacement-Negative Shiga Toxin-Producing Escherichia coli Strains. Genome Announc. 2016; 4 4/5/e00846-16 [pii]; 10.1128/genomeA.00846-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dallman TJ, Byrne L, Launders N, Glen K, Grant KA, Jenkins C. The utility and public health implications of PCR and whole genome sequencing for the detection and investigation of an outbreak of Shiga toxin-producing Escherichia coli serogroup O26:H11. Epidemiol Infect. 2015; 143: 1672–1680. S0950268814002696 [pii]; 10.1017/S0950268814002696 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Beutin L, Martin A. Outbreak of Shiga toxin-producing Escherichia coli (STEC) O104:H4 infection in Germany causes a paradigm shift with regard to human pathogenicity of STEC strains. J Food Prot. 2012; 75: 408–418. 10.4315/0362-028X.JFP-11-452 [DOI] [PubMed] [Google Scholar]
- 32.Pennington H. Escherichia coli O104, Germany 2011. Lancet Infect Dis. 2011. S1473-3099(11)70166-9 [pii]; 10.1016/S1473-3099(11)70166-9 [doi]. [DOI] [PubMed] [Google Scholar]
- 33.Li Z, Perez-Osorio A, Wang Y, Eckmann K, Glover WA, Allard MW et al. Whole genome sequencing analyses of Listeria monocytogenes that persisted in a milkshake machine for a year and caused illnesses in Washington State. BMC Microbiol. 2017; 17: 134 10.1186/s12866-017-1043-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ogura Y, Ooka T, Iguchi A, Toh H, Asadulghani M, Oshima K et al. Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli. Proc Natl Acad Sci U S A. 2009; 106: 17939–17944. 10.1073/pnas.0903585106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, Scheutz F et al. Origins of the E. coli strain causing an outbreak of hemolytic-uremic syndrome in Germany. N Engl J Med. 2011; 365: 709–717. 10.1056/NEJMoa1106920 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wirth T, Falush D, Lan R, Colles F, Mensa P, Wieler LH et al. Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol. 2006; 60: 1136–1151. MMI5172; 10.1111/j.1365-2958.2006.05172.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Joensen KG, Tetzschner AM, Iguchi A, Aarestrup FM, Scheutz F. Rapid and Easy In Silico Serotyping of Escherichia coli Isolates by Use of Whole-Genome Sequencing Data. J Clin Microbiol. 2015; 53: 2410–2426. JCM.00008-15 [pii]; 10.1128/JCM.00008-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Joensen KG, Scheutz F, Lund O, Hasman H, Kaas RS, Nielsen EM et al. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J Clin Microbiol. 2014; 52: 1501–1510. JCM.03617-13 [pii]; 10.1128/JCM.03617-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O et al. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012; 67: 2640–2644. dks261 [pii]; 10.1093/jac/dks261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Nei M, Tajima F, Tateno Y. Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data. J Mol Evol. 1983; 19: 153–170. [DOI] [PubMed] [Google Scholar]
- 41.Hunter PR, Gaston MA. Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity. J Clin Microbiol. 1988; 26: 2465–2466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bielaszewska M, Mellmann A, Bletz S, Zhang W, Kock R, Kossow A et al. Enterohemorrhagic Escherichia coli O26:H11/H-: a new virulent clone emerges in Europe. Clin Infect Dis. 2013; 56: 1373–1381. cit055 [pii]; 10.1093/cid/cit055 [DOI] [PubMed] [Google Scholar]
- 43.Taylor EV, Nguyen TA, Machesky KD, Koch E, Sotir MJ, Bohm SR et al. Multistate outbreak of Escherichia coli O145 infections associated with romaine lettuce consumption, 2010. J Food Prot. 2013; 76: 939–944. 10.4315/0362-028X.JFP-12-503 [DOI] [PubMed] [Google Scholar]
- 44.Watanabe H, Wada A, Inagaki Y, Itoh K, Tamura K. Outbreaks of enterohaemorrhagic Escherichia coli O157:H7 infection by two different genotype strains in Japan, 1996. Lancet. 1996; 348: 831–832. [DOI] [PubMed] [Google Scholar]
- 45.Bell BP, Goldoft M, Griffin PM, Davis MA, Gordon DC, Tarr PI et al. A multistate outbreak of Escherichia coli O157:H7-associated bloody diarrhea and hemolytic uremic syndrome from hamburgers. The Washington experience. JAMA. 1994; 272: 1349–1353. [PubMed] [Google Scholar]
- 46.Ethelberg S, Smith B, Torpdahl M, Lisby M, Boel J, Jensen T et al. Outbreak of non-O157 Shiga toxin-producing Escherichia coli infection from consumption of beef sausage. Clin Infect Dis. 2009; 48: e78–e81. 10.1086/597502 [DOI] [PubMed] [Google Scholar]
- 47.Fey PD, Wickert RS, Rupp ME, Safranek TJ, Hinrichs SH. Prevalence of non-O157:H7 shiga toxin-producing Escherichia coli in diarrheal stool samples from Nebraska. Emerg Infect Dis. 2000; 6: 530–533. 10.3201/eid0605.000513 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cooper KK, Mandrell RE, Louie JW, Korlach J, Clark TA, Parker CT et al. Comparative genomics of enterohemorrhagic Escherichia coli O145:H28 demonstrates a common evolutionary lineage with Escherichia coli O157:H7. BMC Genomics. 2014; 15: 17 1471-2164-15-17 [pii]; 10.1186/1471-2164-15-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Feng PC, Delannoy S, Lacher DW, Dos Santos LF, Beutin L, Fach P et al. Genetic diversity and virulence potential of shiga toxin-producing Escherichia coli O113:H21 strains isolated from clinical, environmental, and food sources. Appl Environ Microbiol. 2014; 80: 4757–4763. AEM.01182-14; 10.1128/AEM.01182-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Mellmann A, Bielaszewska M, Kock R, Friedrich AW, Fruth A, Middendorf B et al. Analysis of collection of hemolytic uremic syndrome-associated enterohemorrhagic Escherichia coli. Emerg Infect Dis. 2008; 14: 1287–1290. 10.3201/eid1408.071082 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Bielaszewska M, Prager R, Kock R, Mellmann A, Zhang W, Tschape H et al. Shiga toxin gene loss and transfer in vitro and in vivo during enterohemorrhagic Escherichia coli O26 infection in humans. Appl Environ Microbiol. 2007; 73: 3144–3150. AEM.02937-06 [pii]; 10.1128/AEM.02937-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Bielaszewska M, Friedrich AW, Aldick T, Schurk-Bulgrin R, Karch H. Shiga toxin activatable by intestinal mucus in Escherichia coli isolated from humans: predictor for a severe clinical outcome. Clin Infect Dis. 2006; 43: 1160–1167. CID40120; 10.1086/508195 [DOI] [PubMed] [Google Scholar]
- 53.Scheutz F, Teel LD, Beutin L, Pierard D, Buvens G, Karch H et al. Multicenter evaluation of a sequence-based protocol for subtyping Shiga toxins and standardizing Stx nomenclature. J Clin Microbiol. 2012; 50: 2951–2963. JCM.00860-12; 10.1128/JCM.00860-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Beutin L, Hammerl JA, Reetz J, Strauch E. Shiga toxin 2A-encoding bacteriophages in enteroaggregative Escherichia coli O104:H4 strains. Emerg Infect Dis. 2014; 20: 1567–1568. 10.3201/eid2009.131373 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Melton-Celsa AR, O'Brien AD (1998) Structure, biology, and relative toxicity of Shiga toxin family members for cells and animals In: Kaper JB, O'Brien AD, editors. Escherichia coli O157:H7 and other Shiga toxin-producing E. coli strains. Washington, DC: ASM Press; pp. 121–128. [Google Scholar]
- 56.Shin SW, Shin MK, Jung M, Belaynehe KM, Yoo HS. Prevalence of Antimicrobial Resistance and Transfer of Tetracycline Resistance Genes in Escherichia coli Isolates from Beef Cattle. Appl Environ Microbiol. 2015; 81: 5560–5566. AEM.01511-15; 10.1128/AEM.01511-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Liu YY, Wang Y, Walsh TR, Yi LX, Zhang R, Spencer J et al. Emergence of plasmid-mediated colistin resistance mechanism MCR-1 in animals and human beings in China: a microbiological and molecular biological study. Lancet Infect Dis. 2016; 16: 161–168. S1473-3099(15)00424-7; 10.1016/S1473-3099(15)00424-7 [DOI] [PubMed] [Google Scholar]
- 58.Karami N, Martner A, Enne VI, Swerkersson S, Adlerberth I, Wold AE. Transfer of an ampicillin resistance gene between two Escherichia coli strains in the bowel microbiota of an infant treated with antibiotics. J Antimicrob Chemother. 2007; 60: 1142–1145. dkm327; 10.1093/jac/dkm327 [DOI] [PubMed] [Google Scholar]
- 59.Ruppitsch W, Pietzka A, Prior K, Bletz S, Fernandez HL, Allerberger F et al. Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Whole-Genome Sequence-Based Typing of Listeria monocytogenes. J Clin Microbiol. 2015; 53: 2869–2876. JCM.01193-15 [pii]; 10.1128/JCM.01193-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Chen Y, Gonzalez-Escalona N, Hammack TS, Allard MW, Strain EA, Brown EW. Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes. Appl Environ Microbiol. 2016; 82: 6258–6272. AEM.01532-16 [pii]; 10.1128/AEM.01532-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Gonzalez-Escalona N, Gavilan RG, Brown EW, Martinez-Urtaza J. Transoceanic Spreading of Pathogenic Strains of Vibrio parahaemolyticus with Distinctive Genetic Signatures in the recA Gene. PLoS One. 2015; 10: e0117485 10.1371/journal.pone.0117485 [doi];PONE-D-14-38928 [pii]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Strauss L, Stegger M, Akpaka PE, Alabi A, Breurec S, Coombs G et al. Origin, evolution, and global transmission of community-acquired Staphylococcus aureus ST8. Proc Natl Acad Sci U S A. 2017; 114: E10596–E10604. 1702472114; 10.1073/pnas.1702472114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Gonzalez-Escalona N, Jolley KA, Reed E, Martinez-Urtaza J. Defining a Core Genome Multilocus Sequence Typing Scheme for the Global Epidemiology of Vibrio parahaemolyticus. J Clin Microbiol. 2017; 55: 1682–1697. JCM.00227-17; 10.1128/JCM.00227-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Ghanem M, Wang L, Zhang Y, Edwards S, Lu A, Ley D et al. Core Genome Multilocus Sequence Typing: a Standardized Approach for Molecular Typing of Mycoplasma gallisepticum. J Clin Microbiol. 2018; 56 JCM.01145-17; 10.1128/JCM.01145-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Martinez-Urtaza J, van AR, Abanto M, Haendiges J, Myers RA, Trinanes J et al. Genomic Variation and Evolution of Vibrio parahaemolyticus ST36 over the Course of a Transcontinental Epidemic Expansion. MBio. 2017; 8 mBio.01425-17; 10.1128/mBio.01425-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Okello E, Moonens K, Erume J, De GH. Enterotoxigenic Escherichia coli strains are highly prevalent in Ugandan piggeries but disease outbreaks are masked by antibiotic prophylaxis. Trop Anim Health Prod. 2015; 47: 117–122. 10.1007/s11250-014-0694-2 [DOI] [PubMed] [Google Scholar]
- 67.Ochoa TJ, Contreras CA. Enteropathogenic Escherichia coli infection in children. Curr Opin Infect Dis. 2011; 24: 478–483. 10.1097/QCO.0b013e32834a8b8b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Trabulsi LR, Keller R, Gomes TAT. Typical and Atypical Enteropathogenic Escherichia coli. Emerg Infect Dis. 2002; 8: 508–513. 10.3201/eid0805.010385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Elliott SJ, Sperandio V, Giron JA, Shin S, Mellies JL, Wainwright L et al. The locus of enterocyte effacement (LEE)-encoded regulator controls expression of both LEE- and non-LEE-encoded virulence factors in enteropathogenic and enterohemorrhagic Escherichia coli. Infect Immun. 2000; 68: 6115–6126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Karch H, Meyer T, Russmann H, Heesemann J. Frequent loss of Shiga-like toxin genes in clinical isolates of Escherichia coli upon subcultivation. Infect Immun. 1992; 60: 3464–3467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.McWilliams BD, Torres AG. Enterohemorrhagic Escherichia coli adhesins. Microbiol Spectrum. 2014; 2: EHEC-0003-2013. 10.1128/microbiolspec.EHEC-0003-2013 [DOI] [PubMed] [Google Scholar]
- 72.Rohde H, Qin J, Cui Y, Li D, Loman NJ, Hentschke M et al. Open-source genomic analysis of Shiga-toxin-producing E. coli O104:H4. N Engl J Med. 2011; 365: 718–724. 10.1056/NEJMoa1107643 [DOI] [PubMed] [Google Scholar]
- 73.Galia W, Mariani-Kurkdjian P, Loukiadis E, Blanquet-Diot S, Leriche F, Brugere H et al. Genome Sequence and Annotation of a Human Infection Isolate of Escherichia coli O26:H11 Involved in a Raw Milk Cheese Outbreak. Genome Announc. 2015; 3 3/1/e01568-14 [pii]; 10.1128/genomeA.01568-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Bugarel M, Beutin L, Scheutz F, Loukiadis E, Fach P. Identification of genetic markers for differentiation of Shiga toxin-producing, enteropathogenic, and avirulent strains of Escherichia coli O26. Appl Environ Microbiol. 2011; 77: 2275–2281. AEM.02832-10 [pii]; 10.1128/AEM.02832-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Allard MW, Strain E, Melka D, Bunning K, Musser SM, Brown EW et al. Practical Value of Food Pathogen Traceability through Building a Whole-Genome Sequencing Network and Database. J Clin Microbiol. 2016; 54: 1975–1983. JCM.00081-16; 10.1128/JCM.00081-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Oanh TKN, Nguyen VK, De Greve H, Goddeeris BM. Protection of Piglets against Edema Disease by Maternal Immunization with Stx2e Toxoid. Infect Immun. 2012; 80: 469–473. 10.1128/IAI.05539-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The draft genome sequences of 331 E. coli strains used in our study are available in GenBank under the accession numbers listed in S1 Table.