Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2025 Nov 10;14(12):e00867-25. doi: 10.1128/mra.00867-25

Complete genome sequence of a Kayfunavirus bacteriophage identified in Blantyre, Malawi

Chimwemwe Mhango 1,2,3,, Wanangwa Ndovie 4, Allan Zuza 3,5, End Chinyama 3, Flywell Kawonga 2,3, Ernest Matambo 2,3,6, Benjamin Kumwenda 1, Arox W Kamng'ona 1, Chrispin Chaguza 7, Martin M Nyaga 8, Celeste M Donato 9, Khuzwayo C Jere 2,3,10
Editor: Kenneth M Stedman11
PMCID: PMC12697162  PMID: 41211825

ABSTRACT

We report a bacteriophage from Malawi recovered during rotavirus RNA sequencing. It shares 84.3% nucleotide identity with MH400309, a Kayfunavirus bacteriophage. This incidental finding shows RNA virus sequencing can also detect bacteriophages, offering insights into the broader viral diversity in clinical and environmental samples.

KEYWORDS: Kayfunavirus, bacteriophage, DNA sequencing, DNA assembly

ANNOUNCEMENT

Gastroenteritis remains a significant public health challenge, especially in children under 5 years of age living in low-income settings, and is caused by a range of pathogens, including viruses, bacteria, and parasites. Rotavirus is the leading cause of severe viral gastroenteritis worldwide, though co-infections with bacterial pathogens such as Escherichia coli are common and may complicate clinical outcomes (1, 2). The gut microbiome’s viral components, including bacteriophages (viruses that specifically infect bacteria), play essential roles in shaping bacterial population dynamics, influencing bacterial behavior, and potentially modulating host-pathogen interactions (35). Here, we report the complete genome sequence of a Kayfunavirus bacteriophage assembled during the de novo assembly of rotavirus genomes from a stool sample.

A stool sample was collected on 24 October 2019 from a child presenting with acute gastroenteritis at the Queen Elizabeth Central Hospital, Blantyre, Malawi. We screened the stool sample for rotavirus using an enzyme immunoassay (Rotaclone; Meridian Bioscience, Cincinnati, OH, USA). Nucleic acid was then extracted from the rotavirus-positive stool sample using the Trizol-chloroform method for double-strand ribonucleic acid (dsRNA) (6, 7). We enriched for dsRNA using lithium chloride precipitation (8). We further used DNase I (Sigma-Aldrich, Dorset, UK) to remove contaminating DNA (7) before cDNA synthesis (6). We used the cDNA to prepare sequencing libraries using the Nextera DNA Flex Library Preparation Kit (Illumina, San Diego, CA, USA) and sequenced them on an Illumina MiSeq platform with a V3 reagent kit, as previously described (8, 9). A total of 417,142 raw paired-end reads were generated from the MiSeq run.

For the data analysis, all tools were run with default parameters unless otherwise specified. Raw sequence reads were assessed for quality using multiQC v.1.21 (10) and trimmed to remove low-quality bases and adapters with Trimmomatic v.0.39 (11). After quality control, 408,696 paired-end reads were retained and assembled de novo using metaSPAdes v.3.15.5 (12). The resulting assemblies were annotated using Genome Detective (13), revealing a 39,229 bp contig with hallmark bacteriophage genomic features. Completeness and contamination of the contig was evaluated using checkV v.0.7.0 (14) (Table 1). To validate the discovery, we reassembled the raw reads using a different short-read metagenomic pipeline on the CZ-ID platform (15).

TABLE 1.

Summary of the complete bacteriophage genome, gene prediction, and life cycle characteristics

Sample_id CHX131
NCBI accession PV611674.1
Contigs 1
Assembly length (bp) 39,229 bp
Number of mapped reads 38,884
Source Homo sapiens
Assembly method De novo
Average sequencing depth 96.5×
Sequencing platform Illumina Miseq
Read type Paired end
Average read length (bp) 137.5
G + C content (%) 49.96
No. of genes 50
Viral genes 45
No. of annotated CDS 44
Completeness (%) 100
Contamination (%) 0
Life cycle Virulent
Genus Kayfunavirus

Open reading frames were predicted using Prodigal with the meta option v.2.6.3 (16), resulting in the identification of 50 genes, of which 45 were specific to viral functions. The predicted genes were annotated using Bakta v.2.14.1 (17). Additionally, Life Cycle Classifier v.1.6.0 in phageAI (18) was employed to assign the phage’s lifestyle (Table 1). Comparative analyses using taxMyPhage v.0.3.5 (19) revealed an average nucleotide identity of 83.9% between the assembled genome and MH400309, a Kayfunavirus genus.

The discovery of this bacteriophage highlights the hidden diversity of organisms within the gut virome and underscores the importance of using unbiased assembly pipelines to explore viral diversity. Future work will focus on recovering this bacteriophage from the fecal material and characterizing its biological properties.

ACKNOWLEDGMENTS

This work was conducted as part of the Sequencing and Antigenic Cartography for Enteric Viruses project, which is funded by the Gates Foundation (Investment ID: INV-046917). This study was conducted independently of the funders, who were not involved in the design, data collection, analysis, interpretation, or the decision to publish the findings. The authors received no financial compensation or other incentives related to the preparation of this article. The conclusions and interpretations presented are those of the authors alone and do not necessarily reflect the official policies or positions of the funders.

Contributor Information

Chimwemwe Mhango, Email: chim@liverpool.ac.uk.

Kenneth M. Stedman, Portland State University, Portland, Oregon, USA

DATA AVAILABILITY

The complete genome of this bacteriophage has been deposited at GenBank under the accession number in Table 1. The raw sequencing reads generated for this study have been deposited in the National Center for Biotechnology Information Sequence Read Archive under accession number SRX30043303 and BioProject accession number PRJNA1303374.

ETHICS APPROVAL

This study was conducted according to the guidelines of the Declaration of Helsinki and approved by the National Health Sciences Research Committee, Lilongwe, Malawi (#867).

REFERENCES

  • 1. Iturriza-Gómara M, Jere KC, Hungerford D, Bar-Zeev N, Shioda K, Kanjerwa O, Houpt ER, Operario DJ, Wachepa R, Pollock L, Bennett A, Pitzer VE, Cunliffe NA. 2019. Etiology of diarrhea among hospitalized children in blantyre, malawi, following rotavirus vaccine introduction: a case-control study. J Infect Dis 220:213–218. doi: 10.1093/infdis/jiz084 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Omotade TI, Babalola TE, Anyabolu CH, Japhet MO. 2023. Rotavirus and bacterial diarrhoea among children in Ile-Ife, Nigeria: burden, risk factors and seasonality. PLoS One 18:e0291123. doi: 10.1371/journal.pone.0291123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Zuppi M, Hendrickson HL, O’Sullivan JM, Vatanen T. 2021. Phages in the gut ecosystem. Front Cell Infect Microbiol 11:822562. doi: 10.3389/fcimb.2021.822562 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Shuwen H, Kefeng D. 2022. Intestinal phages interact with bacteria and are involved in human diseases. Gut Microbes 14:2113717. doi: 10.1080/19490976.2022.2113717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Mills S, Shanahan F, Stanton C, Hill C, Coffey A, Ross RP. 2013. Movers and shakers. Gut Microbes 4:4–16. doi: 10.4161/gmic.22371 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Jere KC, Mlera L, O’Neill HG, Potgieter AC, Page NA, Seheri ML, van Dijk AA. 2011. Whole genome analyses of African G2, G8, G9, and G12 rotavirus strains using sequence-independent amplification and 454 pyrosequencing. J Med Virol 83:2018–2042. doi: 10.1002/jmv.22207 [DOI] [PubMed] [Google Scholar]
  • 7. Jere KC, Chaguza C, Bar-Zeev N, Lowe J, Peno C, Kumwenda B, Nakagomi O, Tate JE, Parashar UD, Heyderman RS, French N, Cunliffe NA, Iturriza-Gomara M. 2018. Emergence of double- and triple-gene reassortant G1P[8] rotaviruses possessing a DS-1-like backbone after rotavirus vaccine introduction in Malawi. J Virol 92:e01246-17. doi: 10.1128/JVI.01246-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Mhango C, Banda A, Chinyama E, Mandolo JJ, Kumwenda O, Malamba-Banda C, Barnes KG, Kumwenda B, Jambo KC, Donato CM, Esona MD, Mwangi PN, Steele AD, Iturriza-Gomara M, Cunliffe NA, Ndze VN, Kamng’ona AW, Dennis FE, Nyaga MM, Chaguza C, Jere KC. 2023. Comparative whole genome analysis reveals re-emergence of human Wa-like and DS-1-like G3 rotaviruses after Rotarix vaccine introduction in Malawi. Virus Evol 9:vead030. doi: 10.1093/ve/vead030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Mwangi PN, Page NA, Seheri ML, Mphahlele MJ, Nadan S, Esona MD, Kumwenda B, Kamng’ona AW, Donato CM, Steele DA, Ndze VN, Dennis FE, Jere KC, Nyaga MM. 2022. Evolutionary changes between pre- and post-vaccine South African group A G2P[4] rotavirus strains, 2003-2017. Microb Genom 8:000809. doi: 10.1099/mgen.0.000809 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Ewels P, Magnusson M, Lundin S, Käller M. 2016. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32:3047–3048. doi: 10.1093/bioinformatics/btw354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. 2017. metaSPAdes: a new versatile metagenomic assembler. Genome Res 27:824–834. doi: 10.1101/gr.213959.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Vilsker M, Moosa Y, Nooij S, Fonseca V, Ghysens Y, Dumon K, Pauwels R, Alcantara LC, Vanden Eynden E, Vandamme A-M, Deforche K, de Oliveira T. 2019. Genome detective: an automated system for virus identification from high-throughput sequencing data. Bioinformatics 35:871–873. doi: 10.1093/bioinformatics/bty695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. 2021. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol 39:578–585. doi: 10.1038/s41587-020-00774-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Kalantar KL, Carvalho T, de Bourcy CFA, Dimitrov B, Dingle G, Egger R, Han J, Holmes OB, Juan Y-F, King R, et al. 2020. IDseq-An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring. Gigascience 9:giaa111. doi: 10.1093/gigascience/giaa111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:1–11. doi: 10.1186/1471-2105-11-119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Schwengers O, Jelonek L, Dieckmann MA, Beyvers S, Blom J, Goesmann A. 2021. Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microb Genom 7:000685. doi: 10.1099/mgen.0.000685 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Tynecki P, Guziński A, Kazimierczak J, Jadczuk M, Dastych J, Onisko A. 2020. PhageAI - bacteriophage life cycle recognition with machine learning and natural language processing. bioRxiv. doi: 10.1101/2020.07.11.198606 [DOI]
  • 19. Millard A, Denise R, Lestido M, Thomas MT, Webster D, Turner D, Sicheritz-Pontén T. 2025. taxMyPhage: automated taxonomy of dsdna phage genomes at the genus and species level. PHAGE (New Rochelle) 6:5–11. doi: 10.1089/phage.2024.0050 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete genome of this bacteriophage has been deposited at GenBank under the accession number in Table 1. The raw sequencing reads generated for this study have been deposited in the National Center for Biotechnology Information Sequence Read Archive under accession number SRX30043303 and BioProject accession number PRJNA1303374.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES