Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2023 May 31;12(6):e00253-23. doi: 10.1128/mra.00253-23

Complete Genome Sequences of Subcluster C1 Mycobacteriophages Blackbrain, Cactojaque, Kboogie, Trinitium, and YoungMoneyMata

Madison M Moore a, Mary A Ayuk a, Amber A Johnson a, Triniti Sims a, Sanaa Haamen a, Ramata Haidara a, Adrian D Allen a, Leon A Dickson a, Somiranjan Ghosh a, Ayele Gugssa a, Hemayet Ullah a, Glory B Bassey a, Lourds M Fernando a, Laricca Y London b, Esohe G Irabor a, Swagota D Roy a, Benedict K Quagraine a, Michael Smith a; Howard University SEA-PHAGES Studentsa, Winston A Anderson a, Courtney J Robinson a,
Editor: John J Dennehyc
PMCID: PMC10281172  PMID: 37255446

ABSTRACT

Five subcluster C1 mycobacteriophages, Blackbrain, Cactojaque, Kboogie, Trinitium, and YoungMoneyMata, were isolated from soil using the host Mycobacterium smegmatis mc2155. The genome sizes range from 154,512 to 156,223 bp. The largest genome encodes 237 predicted proteins, 34 tRNAs, and 1 transfer-messenger RNA (tmRNA).

ANNOUNCEMENT

With the goal of studying bacteriophage evolution and diversity, students in the Science Education Alliance—Phage Hunters Advancing Genomics and Evolutionary Sciences (SEA-PHAGES) program isolated five mycobacteriophages (Blackbrain, Cactojaque, Kboogie, Trinitium, and YoungMoneyMata) from soil collected from Howard University’s campus (Blackbrain and Cactojaque: 38°55′11.0″N, 77°01′10.0″W; Kboogie: 38°55′22.0″N, 77°01′07.0″W; Trinitium: 38°55′18.5″N, 77°01′09.0″W; and YoungMoneyMata: 38°55′21.7″N, 77°01′14.9″W) (1, 2).

As described previously, soil filtrates were generated by adding 7H9 broth to soil samples, shaking at 37°C for 2 h, and then filtering (pore size, 0.22 μm) (3). Mycobacterium smegmatis mc2155 cells were added to the filtrates, and the samples were incubated aerobically at 37°C for 48 h with shaking. The samples were then refiltered and plated with M. smegmatis mc2155 as the host. Three rounds of serial dilution were conducted for each plaque morphology of interest for purification. After single phage populations were obtained, genomic DNA was isolated from the cell lysate using the Promega Wizard DNA cleanup kit (2). Sequencing libraries were prepared using the Ultra II FS kit with dual-indexed barcoding (New England Biolabs). An Illumina MiSeq instrument was used to sequence the pooled libraries, yielding 150-base single-end reads (Blackbrain, 326,268; YoungMoneyMata, 178,427; Kboogie, 521,155; Trinitium, 167,263; and Cactojaque, 384,435). The genome reads were trimmed and assembled using Newbler v2.9, yielding a single contig for each genome. Identification of the genomic termini was completed using Consed v29.0 (46). The phages all had circularly permuted genomes.

DNA Master v5.23.3 (http://cobamide2.bio.pitt.edu/computer.htm) and the Phage Evidence Collection and Annotation Network (PECAAN) (7) were used to annotate the genomes (https://blog.kbrinsgd.org/overview/). The presence of genes and their starting points were predicted and selected using GLIMMER v3.0, GeneMark v2.5, and Starterator v1.1 (https://github.com/SEA-PHAGES/starterator) (8, 9). Predicted gene functions were assigned utilizing Phamerator Actino_Draft v402 (https://phamerator.org), BLAST v2.11.0+, PhagesDB, HHpred v3.0, and the NCBI Conserved Domain Database v3 (1014). tRNAs and transfer-messenger RNAs (tmRNAs) were detected using ARAGON v1.2.38 and tRNAscan-SE v2.0 (15, 16). PhagesDB and the NCBI SRA Taxonomy Analysis tool were used to assign taxonomy and cluster affiliations based on nucleotide similarity (12, 17). Default parameters were used for all software. Genome information is available in Table 1.

TABLE 1.

Genome characteristics and accession numbers of five C1 mycobacteriophages

Phage name Sequencing coverage (×) GenBank accession no. SRA accession no. Genome length (bp) G+C content (%) No. of genes (ORFs)a No. of tRNAs Top BLASTn match (GenBank accession no.; % identity)
Blackbrain 298 MK878897 SRX10013851 155,713 64.6 232 34 Cactojaque (MN428065.1; 99.83)
Cactojaque 354 MN428065 SRX10013856 154,918 64.6 228 34 Janiyra (MT818423.1; 99.67)b
Kboogie 473 MN234193 SRX10013841 155,680 64.7 237 34 Wally (JN699625.1; 99.74)
Trinitium 154 MN276192 SRX10013852 154,512 64.7 232 34 Quasimodo (NC_054721.1; 99.69)
YoungMoneyMata 265 MN183285 SRX10013855 156,223 64.7 235 33 JustHall (MK359333.1; 99.37)
a

ORFs, open reading frames.

b

Phage Janiyra’s genome comparison to Cactojaque’s genome resulted in a higher maximum score than the comparison to Blackbrain.

Nucleotide similarity analysis determined that the phages belong to subcluster C1 and have myovirus morphology (17). The genome sizes range from 154,512 to 156,223 bp (Table 1). The largest number of open reading frames was found in the Kboogie genome, which is the third largest of the genomes (Table 1). The genomic characteristics were consistent with other C1 genomes based on current PhagesDB entries, with each of the five having 1 tmRNA, 64.6% or 64.7% G+C content, and all except one having 34 tRNAs (Table 1) (12). Also consistent with other C1 phages, the genomes are highly similar to each other (18). Blackbrain and Cactojaque were most similar (nucleotide identity, 99.83%), and the most different genomes were YoungMoneyMata and Cactojaque (nucleotide identity, 99.19%). Despite the high nucleotide similarities across the genomes, Phamerator analysis revealed two distinct areas with multiple instances of low similarity (Fig. 1).

FIG 1.

FIG 1

Excerpts of genome maps (7650bp to 14000bp (a) and 133,850bp to 141,000bp (b)) generated using Phamerator displaying areas of low nucleotide similarity in the genomes of phages Blackbrain, Cactojaque, Kboogie, Trinitium, and YoungMoneyMata (10). Violet shading on the maps indicates 100% nucleotide identity, while blue, yellow, and red indicate decreasing nucleotide identity and white signifies no nucleotide identity (10).

Data availability.

The complete genome sequences and raw sequencing reads are available at GenBank and the NCBI Sequence Read Archives, respectively. The accession numbers are provided in Table 1.

ACKNOWLEDGMENTS

We thank Rebecca A. Garlena, Debra Jacobs-Sera, Welkin Pope, and Daniel A. Russell for conducting genome sequencing, assembly, and quality control at the University of Pittsburgh. We acknowledge the SEA-PHAGES faculty team, who provided review of the genome annotations. We also thank Kedus Ashagre, Gracie Charles, Shalom Entner, Tajahane Francis, Kayla Johnson, Ky’Lyn Lampkin, Rainaflore Louis, Lourd Norris, and Jerome J. Oliver for their contributions to this research.

This work was supported by the Howard Hughes Medical Institute (grant number 52008098) and the Science Education Alliance—Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program. Additional support was provided by the Department of Biology at Howard University and the Department of Biological and Environmental Sciences at Alabama A&M University.

Howard University SEA-PHAGES student coauthors were Comfort Y. Adegboye, Alexandria Adigun, Brianna Adams, Deborah Adu, Sirri Akaya, Colette Akhimien, Akiria Anderson, Sydney Andrade, Ahlina C. Archibald, Amia Black, Denia L. Breaux, Olivia R. Charles, Laraine Cheung, Jirachat Chinboot, Maya Dawson, Jielle Denton, Kirk Frederick, Jarius A. Garner, SeighViance Givens, Nazareth N. Henry, Charnae’ Henry-Smith, Imani M. Hernandez, Niaya James, Syntyche Kane, Devante Kerr, Camille A. King, Camryn Lam, Briana Louis, Xavier Luellen, Trinity S. Malone, Bria Morton-Lane, Esther Nwozo, Toluwani Odu, Antonia Oduguwa, Kasyah Richardson, Delandra M. Robinson, Dylan Salters, Rajon Scott, Lauren Williams, Jalen Williams, Marc Williams, and Maya Wilson.

Contributor Information

Courtney J. Robinson, Email: Courtney.robinson@howard.edu.

John J. Dennehy, Queens College Department of Biology

Collaborators: Comfort Y. Adegboye, Alexandria Adigun, Brianna Adams, Deborah Adu, Sirri Akaya, Colette Akhimien, Akiria Anderson, Sydney Andrade, Ahlina C. Archibald, Amia Black, Denia L. Breaux, Olivia R. Charles, Laraine Cheung, Jirachat Chinboot, Maya Dawson, Jielle Denton, Kirk Frederick, Jarius A. Garner, SeighViance Givens, Nazareth N. Henry, Charnae’ Henry-Smith, Imani M. Hernandez, Niaya James, Syntyche Kane, Devante Kerr, Camille A. King, Camryn Lam, Briana Louis, Xavier Luellen, Trinity S. Malone, Bria Morton-Lane, Esther Nwozo, Toluwani Odu, Antonia Oduguwa, Kasyah Richardson, Delandra M. Robinson, Dylan Salters, Rajon Scott, Lauren Williams, Jalen Williams, Marc Williams, and Maya Wilson

REFERENCES

  • 1.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy JJ, Denver DR, Dunbar D, Elgin SCR, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Hartzog GA, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. mBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Poxleitner M, Pope W, Jacobs-Sera D, Sivanathan V, Hatfull GF. 2018. Phage discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD. [Google Scholar]
  • 3.London LY, Ayuk MA, Effiom D, Fashina F, Louis BJ, Tolsma SS, Allen AD, Dickson LA, Ghosh S, Gugssa A, Ullah H, Bassey GB, Fernando LM, Moore MM, Oliver JJ, Irabor EG, Roy SD, Quagraine BK, Smith M, Anderson WA, RobInson CJ, Howard University SEA-PHAGES Students . 2021. Complete genome sequences of mycobacteriophages Dallas and Jonghyun. Microbiol Resour Announc 10:e0030421. doi: 10.1128/MRA.00304-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer MLI, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, et al. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376–380. doi: 10.1038/nature03959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed] [Google Scholar]
  • 7.Rinehart CA, Gaffney B, Wood JD, Smith J. 2016. PECAAN, a Phage Evidence Collection And Annotation Network. https://discover.kbrinsgd.org/login. [Google Scholar]
  • 8.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Besemer J, Borodovsky M. 2005. GeneMark: Web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395–408. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 12.Russell DA, Hatfull GF. 2017. PhagesDB: the actinobacteriophage database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, Geer RC, He J, Gwadz M, Hurwitz DI, Lanczycki CJ, Lu F, Marchler GH, Song JS, Thanki N, Wang Z, Yamashita RA, Zhang D, Zheng C, Bryant SH. 2015. CDD: NCBI’s conserved domain database. Nucleic Acids Res 43:D222–D226. doi: 10.1093/nar/gku1221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Katz KS, Shutov O, Lapoint R, Kimelman M, Brister JR, O'Sullivan C. 2021. STAT: a fast, scalable, MinHash-based k-mer tool to assess Sequence Read Archive next generation sequence submissions. Genome Biol 22:270. doi: 10.1186/s13059-021-02490-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hatfull GF, Jacobs-Sera D, Lawrence JG, Pope WH, Russell DA, Ko C-C, Weber RJ, Patel MC, Germane KL, Edgar RH, Hoyte NN, Bowman CA, Tantoco AT, Paladin EC, Myers MS, Smith AL, Grace MS, Pham TT, O'Brien MB, Vogelsberger AM, Hryckowian AJ, Wynalek JL, Donis-Keller H, Bogel MW, Peebles CL, Cresawn SG, Hendrix RW. 2010. Comparative genomic analysis of 60 mycobacteriophage genomes: genome clustering, gene acquisition, and gene size. J Mol Biol 397:119–143. doi: 10.1016/j.jmb.2010.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete genome sequences and raw sequencing reads are available at GenBank and the NCBI Sequence Read Archives, respectively. The accession numbers are provided in Table 1.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES