ABSTRACT
We report the genome sequences of four Enterococcus faecium phages isolated from environmental wastewater in Kenya. They are double-stranded DNA phages with genomes varying in length from 42,231 to 43,348 bp, with G+C contents ranging from 34.96% to 35.2%. The genomes contain 78–82 coding sequences.
KEYWORDS: bacteriophage, Enterococcus faecium, Kenya, multidrug resistance, phage therapy
ANNOUNCEMENT
Enterococcus faecium is an effective nosocomial, multidrug-resistant (MDR) pathogen, causing severe human infections (1). Bacteriophages (phages) have regained interest as alternative therapeutics to address the increase in MDR bacteria (2). We present the complete genome sequences of four E. faecium phages isolated in Kenya.
Phages were isolated from wastewater samples collected from the Nairobi environs, Kenya, through an enrichment method using clinical isolates of MDR E. faecium as hosts, as described elsewhere with slight modifications (3). Briefly, 50 mL of wastewater was centrifuged (12,000 × g, 10 min), and the supernatant was filtered (0.22 µm). Eight milliliters of the supernatant was mixed with 2 mL of 5× tryptic soy broth (TSB; Oxoid Ltd., Basingstoke, Hampshire, England) and 50 µL of bacterial culture grown in 1× TSB (37°C, 200 rpm, 24 h). The mixture was incubated (37°C, 200 rpm, 24 h) and clarified through centrifugation. Phages were isolated using plaque assays (4) and purified by three rounds of single plaque isolation. Pure phages were propagated on the enrichment strain in TSB and concentrated by centrifugation (10,000 × g, 18 h) to achieve a titer of >1010 PFU/mL for DNA extraction (5). Host RNA and DNA were removed using RNase A and DNase I (ThermoFisher Scientific, USA), and deproteinization was achieved by adding Proteinase K and incubating at 56°C for 1 h 30 min (3). Genomic DNA was extracted using a Norgen phage DNA isolation kit (Norgen Biotek Corp., ON, Canada), following the manufacturer’s instructions. The genomic libraries were prepared using the Illumina Collibri PCR-free ES DNA library preparation kit (Invitrogen, CA, USA) and sequenced on an Illumina MiSeq instrument with a 600-cycle MiSeq reagent kit v3, producing 350-bp maximum paired-end reads. Sequence reads were quality-controlled using FastQC v0.12.1 (6), trimmed with fastp 0.23.4 (7), and assembled with shovill 1.1.0 (https://github.com/tseemann/shovill). The quality of the assembled sequences was assessed using QUAST 5.2.0 (8). CheckV v1.0.3 (9) predicted that the genomes have a completeness of 100%, and Bandage 0.8.1 (10) was used for genome assembly visualization. Genome annotation and detection of antimicrobial resistance (AMR) and virulence genes were performed using pharokka 1.5.1 (11). Taxonomic classification was performed using Mash alignment against the INPHARED database (12). PhageLeads (13) was used to predict lifecycle and PhageTerm version 1.0.11 (14) to determine phage termini and packaging mechanisms. Nucleic acid sequence similarity searches were performed using a web-based MegaBLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Default parameters were used for all software tools.
The phage genomes ranged from 42,231 to 43,348 bp, with G+C contents ranging from 34.96% to 35.2%. They have linear double-stranded DNA containing 78–82 coding sequences. All four phages were unclassified at the family and genus levels. Analysis of their DNA termini identified 3′ cohesive sequences, and the BLASTn results are shown in Table 1. All the genomes lacked predicted AMR, virulence, and lysogeny genes, indicating a strictly virulent lifecycle and promising candidates for phage therapy.
TABLE 1.
Genomic properties of the four E. faecium phagesa
Phage name | Host strain types | Sample collection site (GPS coordinates) | BLASTn results | Genome length (bp) | G+C content (%) | No. of CDSs | Genome coverage (×) | No. of raw reads | Termini | GenBank accession no. | SRA accession no. | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
% identity | Top hit | |||||||||||
Enterococcus phage vB_Efm3_KEN18 | EFM3 | Ruai Sewage treatment plant (1°14′26.6″S 37°01′01.9″E) |
90.38% | Phage BUCT630 (PP434460.1) | 43,348 | 35.16 | 82 | 68.16 | 231,931 | Cos (3′) | PP582173 | SRR28477793 |
93.74% | Phage Aramis (LR990833.1) | |||||||||||
93.69% | Phage dArtagnan (LR991625.1) | |||||||||||
Enterococcus phage vB_Efm3_KEN20 | EFM3 | Kahawa (1°11′58.7″S 36°55′16.9″E) |
94.72% | Phage Aramis (LR990833.1) | 42,434 | 34.96 | 81 | 69.82 | 463,670 | Cos (3′) | PP582174 | SRR28477792 |
94.38% | Phage dArtagnan (LR991625.1) | |||||||||||
90.75% | Phage IME-EFm1 (NC_024356.1) | |||||||||||
Enterococcus phage vB_Efm8_KEN21 | 80 | Ruai Sewage treatment plant (1°14′26.6″S 37°01′01.9″E) |
88.80% | Phage IME-EFm5 (NC_028826.1) | 42,231 | 35.19 | 80 | 67.30 | 197,902 | Cos (3′) | PP582175 | SRR28477791 |
88.18% | Phage IME-EFm1 (NC_024356.1) | |||||||||||
87.88% | Phage Aramis (LR990833.1) | |||||||||||
Enterococcus phage vB_Efm10_KEN22 | 2672 | Ruai Sewage treatment plant (1°14′26.6″S 37°01′01.9″E) |
94.89% | Phage Aramis (LR990833.1) | 42,487 | 35.2 | 78 | 69.85 | 110,640 | Unknown | PP582176 | SRR28477790 |
93.74% | Phage vB_Efm_LG62 (OP018674.1) | |||||||||||
93.58% | Phage dArtagnan (LR991625.1) |
Cos (3′), 3′ cohesive sequences; CDSs, coding sequences; SRA, Sequence Read Archive.
ACKNOWLEDGMENTS
We acknowledge the research support from the African Union Commission, the AFRICA-ai-JAPAN Project Innovation Research Funds, and the WRAIR AMR surveillance project. We also acknowledge the KEMRI CDC for the sequencing support.
Footnotes
writing - original draft
Contributor Information
Lillian Musila, Email: Lillian.Musila@usamru-k.org.
Catherine Putonti, Loyola University, Chicago, Illinois, USA.
DATA AVAILABILITY
The genome sequences of the four E. faecium phages are available in GenBank and Sequence Read Archive under the accession numbers listed in Table 1.
REFERENCES
- 1. Zhou X, Willems RJL, Friedrich AW, Rossen JWA, Bathoorn E. 2020. Enterococcus faecium: from microbiological insights to practical recommendations for infection control and diagnostics. Antimicrob Resist Infect Control 9:130. doi: 10.1186/s13756-020-00770-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Canfield GS, Chatterjee A, Espinosa J, Mangalea MR, Sheriff EK, Keidan M, McBride SW, McCollister BD, Hang HC, Duerkop BA. 2023. Lytic bacteriophages facilitate antibiotic sensitization of Enterococcus faecium. Antimicrob Agents Chemother 65:e00143-21. doi: 10.1128/AAC.00143-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. D’Souza R, Filippov AA, Sergueev KV, He Y, Ward AM, Goglin K, Vashee S, Nikolich MP, Fouts DE. 2020. Complete genome sequence of broad-host-range Staphylococcus aureus myophage ESa1. Microbiol Resour Announc 9:e00730-20. doi: 10.1128/MRA.00730-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Mutai IJ, Juma AA, Inyimili MI, Nyachieo A, Nyamache AK. 2022. Efficacy of diversely isolated lytic phages against multi-drug resistant Enterobacter cloacae isolates in Kenya. Afr J Lab Med 11:1673. doi: 10.4102/ajlm.v11i1.1673 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Sergueev KV, Filippov AA, Farlow J, Su W, Kvachadze L, Balarjishvili N, Kutateladze M, Nikolich MP. 2019. Correlation of host range expansion of therapeutic bacteriophage Sb-1 with allele state at a hypervariable repeat locus. Appl Environ Microbiol 85:e01209-19. doi: 10.1128/AEM.01209-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Andrews S. 2010. Internet. FastQC: a quality control tool for high throughput sequence data. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Retrieved 21 Feb 2024.
- 7. Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. 2021. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol 39:578–585. doi: 10.1038/s41587-020-00774-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi: 10.1093/bioinformatics/btv383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Bouras G, Nepal R, Houtak G, Psaltis AJ, Wormald PJ, Vreugde S. 2023. Pharokka: a fast scalable bacteriophage annotation tool. Bioinformatics 39:btac776. doi: 10.1093/bioinformatics/btac776 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Cook R, Brown N, Redgwell T, Rihtman B, Barnes M, Clokie M, Stekel DJ, Hobman J, Jones MA, Millard A. 2021. INfrastructure for a PHAge REference Database: identification of large-scale biases in the current collection of cultured phage genomes. PHAGE (N Rochelle) 2:214–223. doi: 10.1089/phage.2021.0007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Yukgehnaish K, Rajandas H, Parimannan S, Manickam R, Marimuthu K, Petersen B, Clokie MRJ, Millard A, Sicheritz-Pontén T. 2022. PhageLeads: rapid assessment of phage therapeutic suitability using an ensemble machine learning approach. Viruses 14:342. doi: 10.3390/v14020342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Garneau JR, Depardieu F, Fortier L-C, Bikard D, Monot M. 2017. PhageTerm: a tool for fast and accurate determination of phage termini and packaging mechanism using next-generation sequencing data. Sci Rep 7:8292. doi: 10.1038/s41598-017-07910-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequences of the four E. faecium phages are available in GenBank and Sequence Read Archive under the accession numbers listed in Table 1.