ABSTRACT
The whole genome sequence of Escherichia coli strain DP51 is reported here. DP51 was isolated in 2011 from a human rectal specimen in Breda, Netherlands. DP51 genome is 4.7 Mb with 50.68% GC content. DP51 possesses 34 extraintestinal virulence-related genes and is presumptive extraintestinal pathogenic E. coli (ExPEC).
KEYWORDS: extraintestinal diseases, Escherichia coli (ExPEC), human rectal specimen
ANNOUNCEMENT
Extraintestinal pathogenic Escherichia coli (ExPEC) is an E. coli subgroup that causes illness outside the gastrointestinal tract, including urinary tract infections and neonatal meningitis (1). A human fecal carriage survey was performed in Breda, Netherlands, to compare isolates from humans and chicken meat (2). DP51 was isolated from a human rectal swab by incubation for 16–18 h at 37°C in 15 mL of tryptic soy broth (TSB), then in TSB plus vancomycin and cefotaxime, followed by selection for the presence of extended-spectrum β-lactamase (ESBL).
Genomic DNA was isolated from overnight cultures (18 h) grown in LB broth at 37°C, with shaking at 180 rpm and sequenced as described previously (3) with >205× coverage. Briefly, genomic DNA was sheared and underwent size selection for 20 kb fragments. DNA sequencing was performed using a PacBio Sequel machine with Single Molecule Real-Time (SMRT) technology. Default parameters were used for all software analysis, unless otherwise specified. SMRT Link software v6.0.0 was used for quality control and filtering, which generated 21,697 reads, with 1,278,540,836 bases with a mean subread length of 9,732 and N50 value of 11,218.
A chromosome (4,764,301 bp; 50.68% GC content) and three plasmids (97,470 bp; 48.48% GC content, 69,739 bp, 54.41% GC content, and 42,753 bp, 51.47% GC content) were generated using Flye/2.9.1 software for genome assembly (4). Both chromosomes and plasmids were assembled as circular molecules. The chromosome was rotated with the dnaA gene as the origin of replication (circulator/v1.5.5) (5). The NCBI-Prokaryotic Genome Annotation Pipeline (PGAP, Software version: 6.7) (6) was used for genome annotation, which identified 4,911 genes and 117 RNA sequences, including 88 tRNA genes. Eleven intact phages were predicted on the chromosome using the PhageBoost software package (7) (accessed on 4/02/2025). SerotypeFinder-2.0 (https://cge.food.dtu.dk/services/SerotypeFinder/) (8) and MLST-2.0 databases (Software version: 2.0.9 [2022-05-11]; Database version [2023-06-19]; https://cge.food.dtu.dk/services/MLST/) (9) assigned DP51 to sequence type 93, and serotype O5:H4. ResFinder-4.3.3 (10) identified nine resistance genes (Table 1), all in plasmid sequences (10).
TABLE 1.
Antimicrobial resistance genes (AMRs) and virulence genes identified in DP51
| Gene category | Genes identified |
|---|---|
| AMRs | aph (6)-Id, aph(3'')-Ib, aadA4, blaCTX-M-1, blaTEM-1B, mph(A), sul2, tet(A), dfrA2. |
| Virulence genes | aslA, anr, capU, csgA, cvaC, etsC, fdeC, fyuA, gad, hlyE, hlyF, hra, iroN, irp2, iss, iucC, iutA, kpsE, kpsMII_K5, mchF, nlpI, ompT, papA_F13, papC, shiA, sitA, terC, traJ, traT, tsh, yehA, yehB, yehC, yehD. |
VirulenceFinder-2.0 identified a total of 34 chromosomal virulence genes (Table 1), using parameters of 90% minimum identity and 60% minimum length (11) (Software version: 2.0.5 [2024-01-31]; Database version [2022-12-02]). In addition, genes related to iron acquisition (chuA, fyuA, iucC, irp2, sitA, and iutA), adhesins/attachment (afaA, afaC, afaD, iha, nfaE, papA_F43, terC, and yfcV), protectins/serum resistance (iss, kpsE, kpsMII_K5, and ompT), and toxin production (sat and usp) were identified. PathogenFinder-1.1 (https://cge.cbs.dtu.dk/services/PathogenFinder/) estimated a 0.933 probability of DP51 being a human pathogen.
The presence of more than two out of the key virulence genes (pap, sfa/foc, afa/dra, iutA, and kpsMT II) qualifies a strain as ExPEC (12). DP51 was qualified presumptively as ExPEC based on its carriage of iutA, afaB/C, papAH, papC, and kpsE/MII (12). The sequence data could offer the possibility of determining the relatedness of strains from food and humans through comparative genomics (11).
ACKNOWLEDGMENTS
We thank Pina Fratamico for critical reading of the manuscript. This research used resources provided by the SCINet project and/or the AI Center of Excellence of the USDA Agricultural Research Service, ARS project numbers 0201-88888-003-000D and 0201-88888-002-000D. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the view of the U.S. Department of Agriculture.
Contributor Information
Yanhong Liu, Email: yanhong.liu@usda.gov.
David Rasko, University of Maryland School of Medicine, Baltimore, Maryland, USA.
DATA AVAILABILITY
The whole-genome project has been deposited in BioProject number PRJNA642167, Sequence
Read Archive (SRA) number SRR29021894 BioSample number SAMN41394341
with the GenBank accession number CP156034 for the DP51 chromosome and CP156035, CP156036 and CP156037 for the plasmids.
REFERENCES
- 1. Russo TA, Johnson JR. 2003. Medical and economic impact of extraintestinal infections due to Escherichia coli: focus on an increasingly important endemic problem. Microbes Infect 5:449–456. doi: 10.1016/s1286-4579(03)00049-2 [DOI] [PubMed] [Google Scholar]
- 2. Overdevest I, Willemsen I, Rijnsburger M, Eustace A, Xu L, Hawkey P, Heck M, Savelkoul P, Vandenbroucke-Grauls C, van der Zwaluw K, Huijsdens X, Kluytmans J. 2011. Extended-spectrum β-lactamase genes of Escherichia coli in chicken meat and humans, The Netherlands. Emerg Infect Dis 17:1216–1222. doi: 10.3201/eid1707.110209 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Elder JR, Liu Y, Kanrar S, Gehring A, Xu A, Sommers CH, Johnston BD, Johnson JR. 2021. Complete genome sequence of Escherichia coli strain FEX669, a ColV plasmid-containing isolate from retail chicken meat. Microbiol Resour Announc 10:e01340-20. doi: 10.1128/MRA.01340-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8 [DOI] [PubMed] [Google Scholar]
- 5. Hunt M, Silva ND, Otto TD, Parkhill J, Keane JA, Harris SR. 2015. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol 16:294. doi: 10.1186/s13059-015-0849-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Sirén K, Millard A, Petersen B, Gilbert MTP, Clokie MRJ, Sicheritz-Pontén T. 2021. Rapid discovery of novel prophages using biological feature engineering and machine learning. NAR Genom Bioinform 3:lqaa109. doi: 10.1093/nargab/lqaa109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Joensen KG, Tetzschner AMM, Iguchi A, Aarestrup FM, Scheutz F. 2015. Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data. J Clin Microbiol 53:2410–2426. doi: 10.1128/JCM.00008-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Larsen MV, Cosentino S, Rasmussen S, Friis C, Hasman H, Marvig RL, Jelsbak L, Sicheritz-Pontén T, Ussery DW, Aarestrup FM, Lund O. 2012. Multilocus sequence typing of total-genome-sequenced bacteria. J Clin Microbiol 50:1355–1361. doi: 10.1128/JCM.06094-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, Aarestrup FM, Larsen MV. 2012. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother 67:2640–2644. doi: 10.1093/jac/dks261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Malberg Tetzschner AM, Johnson JR, Johnston BD, Lund O, Scheutz F. 2020. In silico genotyping of Escherichia coli isolates for extraintestinal virulence genes by use of whole-genome sequencing data. J Clin Microbiol 58:e0126920. doi: 10.1128/JCM.01269-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Johnson JR, Murray AC, Gajewski A, Sullivan M, Snippes P, Kuskowski MA, Smith KE. 2003. Isolation and molecular characterization of nalidixic acid-resistant extraintestinal pathogenic Escherichia coli from retail chicken products. Antimicrob Agents Chemother 47:2161–2168. doi: 10.1128/AAC.47.7.2161-2168.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The whole-genome project has been deposited in BioProject number PRJNA642167, Sequence
Read Archive (SRA) number SRR29021894 BioSample number SAMN41394341
with the GenBank accession number CP156034 for the DP51 chromosome and CP156035, CP156036 and CP156037 for the plasmids.
