ABSTRACT
Uropathogenic Escherichia coli (UPEC) remains the main etiological agent of urinary tract infections affecting females and males. The draft genome sequence of three strains of UPEC isolated from senior citizens and pregnant women in the state of Puebla, Mexico, is reported here.
KEYWORDS: genome sequence, genome analysis, uropathogenic E. coli, virulence factors, antibiotic resistance, evolution
ANNOUNCEMENT
Urinary tract infections (UTI) are the most common bacterial infections, affecting over 150 million people worldwide each year (1, 2). Uropathogenic Escherichia coli (UPEC) is the main causative agent of UTI.
UPEC 123-I, 129-I, and 42GP, isolated from urine from patients with UTI and with risk factors, belong to a strain collection previously characterized. The strains possess classical UPEC virulence genes (Table 1) and were originally isolated by plating 10 µL of urinary sample on MacConkey agar and incubating aerobically overnight at 37°C. A lactose-positive colony was selected for identification by conventional biochemical test (3–5). Each strain, maintained at −70°C, was plated on MacConkey agar, and one lactose-positive colony was inoculated in 3 mL of LB broth and incubated overnight at 37°C. Genomic DNA was isolated using the Wizard Genomic DNA Purification Kit (Promega, USA). All protocols were conducted following the manufacturer’s instructions. The genomic DNA was resolved on 0.5% agarose gel to determine its integrity.
TABLE 1.
Characteristics of three strains used in the present study and their virulence genesa
| Strain | Sex | Patient age | Risk factor for UTI | Virulence genes | Reference |
|---|---|---|---|---|---|
| 123-I | Female | 75 | Older age | fimH, iucD, hlyA, sat, cnf-1, traT | (3) |
| 129-I | Male | 73 | Older age | fimH, iha, fliC, iucD, hlyA, cnf-1, traT | (4) |
| 42GP | Woman | 25 | Pregnant | fimH, iha, fliC, iucD, hlyA, sat, cnf-1, traT | (5) |
fimH, pili fimbriae adhesin type 1; iha, IrgA/siderophore homologous adhesin; fliC, flagellin; iucD, aerobactin; hlyA, α-hemolysin; sat, secreted autotransporter toxin; cnf-1, cytotoxic necrotizing factors; and traT, serum resistance.
Genomes were sequenced using both Illumina and Nanopore Technologies; in each case, distinct DNA extraction was used. Illumina DNA Prep Kit (Illumina, USA) was used to prepare the libraries, using IDT for Illumina Indexes with a target insert size of 460 bp. The NextSeq 500 platform was used for paired-end sequencing. The read length of the sequences was 2 × 149 paired end. Strains 123-I , 129-I, and 42GP have 8,936,357, 6,567,253, and 5,454,188 paired reads, respectively, with 530× (123-I and 42GP) and 321× (129-I) of depth.
The Native Barcoding 96 kit (SQK-NBD112.96) and Guppy base caller (Oxford Nanopore Technologies, USA) were used for sequencing with Oxford Nanopore Technology. The DNA was not fragmented or size selected for library construction; there was only a wash at the end of adapter ligation using the Large Fragment Buffer. The N50 for strain 123-I was 6,955 bp; for 129-I was 14,043 bp; and for 42GP was 4,220 bp. These strains have 783,911 (123-I), 143,194 (129-I), and 340,863 (42GP) reads. In the following analysis, default settings were used unless otherwise noted. For quality control and error correction of the raw reads, FastQC (v0.12.1) and trimmomatic (v0.36) were used (6, 7). De novo assembly of the reads was performed by SPAdes (v3.15.4) (8). The quality of assemblies was determined using QUAST (v4.6.0) (9). A total of 7 (123-I), 15 (129-I), and 12 (42GP) contigs were removed as they were below 200 bp in length. Strain 123-I has a genome size of 5,492,714 bp (19 contigs); 129-I has 5,466,769 bp (41 contigs); and 42GP has 5,333,327 bp (18 contigs). The genomes have an N50 of 2,445,443 (123-I), 3,209,856 (129-I), and 3,707,725 bp (42GP). The strain 123-I has 50.71% of GC, while strains 129-I and 42GP have 50.69% of GC. The annotation was carried out following the Prokaryotic Genome Annotation Pipeline v6.6 of the NCBI (10).
ACKNOWLEDGMENTS
The present work was supported by Consejo Nacional de Humanidades, Ciencia y Tecnología (CONAHCYT) through the project "Estudio de la plataforma genética involucrada en la transferencia de la resistencia a antimicrobianos y genes de virulencia en clonas de Escherichia coli de interés médico y alimentario,” CONAHCYT A1-S-22136 and by Vicerrectoría de Investigación y Estudios de Posgrado, BUAP through the project number VIEP 00055-2023 “Análisis del Genoma de E. coli Uropatógena y Comensal para la Prevención, Control y Tratamiento Adecuados de la Infección de Tracto Urinario” I.M.C.-d.l.P. receipt, CONAHCYT scholarship number 911787.
Contributor Information
Margarita M. P. Arenas-Hernández, Email: margarita.arenas@correo.buap.mx.
David Rasko, University of Maryland School of Medicine, Baltimore, Maryland, USA.
DATA AVAILABILITY
The draft genomes of UPEC 123-I, 129-I, and 42GP were deposited in the National Center for Biotechnology Information (NCBI) under the accession numbers JBAISI000000000, JAZGOG000000000, and JAZGOH000000000, respectively. In addition, all raw reads were submitted to NCBI’s SRA under the accession numbers SRR27907043 and SRR27889912 for strain 123-I; SRR27899061 and SRR27899060 for strain 129-I; and SRR27899059 and SRR27899058 for strain 42GP.
ETHICS APPROVAL
The study protocol for 123-I and 129-I was approved by the Ethical Committee of Hospital Regional del ISSSTE, Puebla, number 188-2018. For 42GP, study approval by an ethics committee was not required because the strain was donated by a clinical laboratory in Puebla.
REFERENCES
- 1. Nielubowicz GR, Mobley HLT. 2010. Host–pathogen interactions in urinary tract infection. Nat Rev Urol 7:430–441. doi: 10.1038/nrurol.2010.101 [DOI] [PubMed] [Google Scholar]
- 2. Öztürk R, Murt A. 2020. Epidemiology of urological infections: a global burden. World J Urol 38:2669–2679. doi: 10.1007/s00345-019-03071-4 [DOI] [PubMed] [Google Scholar]
- 3. Ixtepan-Tejero C. 2018. Características de virulencia, resistencia Y transferencia del material Genético en Cepas de E. coli uropatógena Undergraduated thesis, BUAP, Puebla, Mexico. https://repositorioinstitucional.buap.mx/items/4db1a3ec-15d0-4ba5-b701-4220202f79ce. [Google Scholar]
- 4. Huelgas-Méndez D. 2020. Clasificación Filogenética, Virulencia Y Resistencia Presenta Undergraduated thesis, BUAP, Puebla, Mexico. https://repositorioinstitucional.buap.mx/items/44080abb-7f87-4e90-80ad-129a08f4bdd6. [Google Scholar]
- 5. Ballesteros-Monrreal MG, Arenas-Hernández MM, Enciso-Martínez Y, Martínez-de la Peña CF, Rocha-Gracia RDC, Lozano-Zaraín P, Navarro-Ocaña A, Martínez-Laguna Y, de la Rosa-López R. 2020. Virulence and resistance determinants of uropathogenic Escherichia coli strains isolated from pregnant and non-pregnant women from two States in Mexico. Infect Drug Resist 13:295–310. doi: 10.2147/IDR.S226215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Andrews S. Babraham bioinformatics - FastQC a quality control tool for high throughput sequence data. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc. Retrieved 17 Apr 2021.
- 7. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The draft genomes of UPEC 123-I, 129-I, and 42GP were deposited in the National Center for Biotechnology Information (NCBI) under the accession numbers JBAISI000000000, JAZGOG000000000, and JAZGOH000000000, respectively. In addition, all raw reads were submitted to NCBI’s SRA under the accession numbers SRR27907043 and SRR27889912 for strain 123-I; SRR27899061 and SRR27899060 for strain 129-I; and SRR27899059 and SRR27899058 for strain 42GP.
