Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2013 Nov 27;1(6):e00847-13. doi: 10.1128/genomeA.00847-13

Complete Genome Sequences of 63 Mycobacteriophages

Graham F Hatfull a,; Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) Programb; KwaZulu-Natal Research Institute for Tuberculosis and HIV (K-RITH) Mycobacterial Genetics Coursea,c,d,a,c,d,a,c,d; University of California—Los Angeles Research Immersion Laboratory in Virologye; Phage Hunters Integrating Research and Education (PHIRE) Programa
PMCID: PMC3869317  PMID: 24285655

Abstract

Mycobacteriophages are viruses that infect mycobacterial hosts. The current collection of sequenced mycobacteriophages—all isolated on a single host strain, Mycobacterium smegmatis mc2155, reveals substantial genetic diversity. The complete genome sequences of 63 newly isolated mycobacteriophages expand the resolution of our understanding of phage diversity.

GENOME ANNOUNCEMENT

The bacteriophage population encompasses amazing genetic diversity, reflecting its vast size, dynamic nature, and ancient origins (14). The collection of sequenced mycobacteriophage genomes offers insights into phages that infect a common host, Mycobacterium smegmatis mc2155 (5, 6), and thus are presumed to be in genetic communication with one another. Comparative genomics reveals mosaicism as the fundamental architectural feature of phage genomes, which are constructed from segments exchanged among the phage population (7, 8). Nonetheless, simple DNA comparison identifies groups of genomes more similar to one another than to other phages, and these are referred to as clusters (9). The previously reported 223 mycobacteriophages assort into a total of 15 different clusters (clusters A through O) and eight singletons that have no close relatives (5, 1013). Several of the clusters can be divided into subclusters according to nucleotide sequence similarities (5). The total number of clusters, subclusters, and singletons is 36 (5).

Bacteriophage discovery and genomic characterization provide an effective platform for introducing students to authentic scientific research (9, 14, 15). The 63 phages described here were isolated and sequenced by undergraduate students in four programs: the Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) Program, the Phage Hunters Integrating Research and Education (PHIRE) Program, the Research Immersion Laboratory in Virology at UCLA, and the Mycobacterial Genetics Course at the University of KwaZulu-Natal, Durban, South Africa (Table 1). Students and faculty isolated, purified, named, extracted, and annotated each of the new phages.

TABLE 1.

Data for newly sequenced mycobacteriophage genomes

Phage name Cluster Accession no. Length (bp) GC% Institution(s)
ABCat E KF188414 76,131 63.0 Southern Connecticut State Universitya
Aeneas A1 JQ809703 53,684 63.6 Brigham Young Universitya
ArcherS7 C1 KC748970 156,558 64.7 Trinity Collegea
Arturo A4 JX307702 51,500 64.1 Hampden-Sydney Collegea
Astraea C1 KC691257 154,872 64.7 North Carolina State Universitya
Astro A8 JX015524.1 52,494 61.4 College of Charlestona
Ava3 C1 JQ911768 154,466 64.8 Calvin Collegea
Avani F2 JQ809702 54,470 61.0 Gettysburg Collegea
Bobi F1 KF114874 59,179 61.7 Purdue Universitya
Breeniome C1 KF006817 154,434 64.8 CUNY, Queens Collegea
Breezona L2 KC691254 76,652 58.9 Bucknell Universitya
Butters N KC576783 41,491 65.8 Lehigh Universitya
Catdawg O KF017002 72,108 65.4 Cabrini Collegea
Contagion E KF024732 74,533 63.1 Loyola Marymount Universitya
Crossroads L2 KF024731 76,129 58.9 Teacher workshop, University of Pittsburghd
Daenerys F1 KF017005 58,043 61.6 University of Pittsburgha
Dhanush A4 KC661271 51,373 63.9 University of Alabama at Birminghama
Dorothy F1 JX411620 58,866 61.4 Baylor Universitya
Dumbo E KC691255 75,736 63.0 University of Maine, Fort Kenta
Dylan O KF024730 69,815 65.4 University of KwaZulu-Natalb
ElTiger69 A5 JX042578 51,505 59.8 University of North Texasa
Fishburne P KC691256 47,109 67.3 College of Charlestona
Flux A4 JQ809701 51,370 63.9 Saint Joseph’s Universitya
Gizmo C1 KC748968 157,482 64.6 Illinois Wesleyan Universitya
Goose A10 JX307704 50,645 65.1 Del Mar Collegea
Hamulus F1 KF024723 57,155 61.8 University of California, Santa Cruza
HINdeR A7 KC661275 52,617 62.8 Miami Universitya
ICleared A4 JQ896627 51,440 63.9 University of Louisiana at Monroea
Jabbawokkie F2 KF017003 55,213 61.1 University of Pittsburgha
Job42 F1 KC661280 59,626 61.2 Providence Collegea
Jobu08 A3 KC661281 50,679 64.0 Washington State Universitya
KayaCho B4 KF024729 70,838 70.0 University of KwaZulu-Natala,b
LittleCherry A5 KF017001 50,690 60.9 College of William & Marya
MacnCheese K3 JX042579 61,567 67.3 Calvin Collegea
Marcell A1 JX307705 49,186 64.0 University of Maine, Machiasa
Medusa A4 KF024733 51,384 63.9 University of Louisiana at Monroea
Methuselah A3 KC661272 50,891 64.2 University of Maryland, Baltimore Countya
Muddy Single KF024728 48,228 58.8 University of KwaZulu-Natalb
Murphy E KC748971 76,179 62.9 Ohio State Universitya
Newman B1 KC691258 68,598 66.5 University of Colorado at Bouldera
Odin A2 KF017927 52,807 62.3 University of California, Los Angelesc
PattyP A1 KC661273 52,057 63.6 Carthage Collegea
PegLeg M KC900379 80,955 61.5 University of California, Los Angelesc
Phaux E KC748969 76,479 62.9 Helena High School,d Queensboro Community Collegea
Phrux E KC661277 74,711 63.1 Miami Universitya
Rebeuca A10 JX411619 51,235 65.1 University of Texas at El Pasoa
Redno2 J KF114875 108,297 60.9 Virginia Commonwealth Universitya
Reprobate B5 KF024727 70,120 68.3 University of KwaZulu-Natalb
Sabertooth A4 JX307703 51,377 63.9 Culver-Stockton Collegea
SarFire A1 KF024726 53,701 63.8 University of KwaZulu-Natalb
SDcharge11 B1 KC661274 67,702 66.5 Loyola Marymount Universitya
Severus A10 KC661279 49,894 64.4 College of St. Scholasticaa
ShiVal B1 KC576784 68,355 66.5 Montclair State Universitya
Shrimp C1 KF024734 155,714 64.7 Illinois Wesleyan Universitya
SiSi F1 KC661278 56,279 61.5 Gonzaga Universitya
Taj F1 JX121091 58,550 61.9 University of Pittsburghd
Tiger A5 JQ684677 50,332 60.7 Oregon State Universitya
Trouble A1 KF024724 52,102 63.6 University of California, Santa Cruza
Twister A10 JQ512844 51,094 65.0 Virginia Commonwealth Universitya
Velveteen F1 KF017004 54,314 61.5 University of Pittsburgha
Wanda J KF006818 109,960 60.8 Lehigh Universitya
Whirlwind L3 KF024725 76,050 59.3 University of Pittsburghd
Winky L2 KC661276 76,653 58.9 Saint Joseph’s Universitya
a

SEA-PHAGES member institution.

b

K-RITH Mycobacterial Genetics Course.

c

UCLA Research Immersion Laboratory in Virology.

d

Phage Hunters Integrating Research and Education (PHIRE) Program, University of Pittsburgh.

Phage genomic DNA was sequenced using Ion Torrent, 454 pyrosequencing, or Illumina platforms with an average redundancy of 210-fold (range, 25- to 900-fold). Reads were assembled using Newbler (version 2.6), and the assembly quality was evaluated using Consed (version 20 or newer); targeted Sanger sequencing resolved weak areas where needed. Fifty-two of the genomes have defined ends, and the termini were determined by comparison to closely related genomes, examination of read density, and where necessary, Sanger sequencing on genomic DNA. Eleven genomes assembled circularly and are presumed to have circularly permuted terminally redundant ends, and position 1 was selected at the 5′ end of the most distally closely linked gene to the terminase large subunit (1 to 5 genes upstream in the five cluster B phages). For the six cluster C phages, position 1 was chosen for consistency with Bxz1 (8).

Phage genomes were annotated using DNA Master (http://cobamide2.bio.pitt.edu/computer.htm), Glimmer 3.0 (16), GeneMark (17, 18), Aragorn (19), tRNAscan-SE (20), and Phamerator (21), and gene functions were predicted using BLAST and HHPred (22). All sequences and student-annotated genomes were reviewed and revised as necessary at the University of Pittsburgh.

Of the 63 genomes, one (Fishburne) joins a singleton to form cluster P, one (Muddy) is a new singleton, and four add new subclusters, two in A10 and one each in B5 and L5 (Table 1). The 285 sequenced mycobacteriophage genomes form a total of 39 clusters or subclusters and 8 singletons.

Further information on these phage genomes is available at http://www.phagesdb.org.

Nucleotide sequence accession numbers.

GenBank accession numbers are shown in Table 1.

ACKNOWLEDGMENTS

We thank the many undergraduates and instructors who contributed to this phage discovery and genomics initiative. We also thank David J. Asai, Cheryl Bailey, Lucia P. Barker, William R. Bishai, Charles Bowman, Kevin W. Bradley, Darisa Clarke, Nell Eisenberg, Roger Hendrix, William R. Jacobs, Jr., Deborah Jacobs-Sera, Paras Jain, Victoria Kaspowicz, Michelle H. Larsen, Welkin H. Pope, Eric Rubin, and Daniel A. Russell for their assistance in phage isolation, genome analysis, and program administration. Phage genomes were sequenced at the Genomics and Proteomics Core Laboratories and at the Pittsburgh Bacteriophage Institute at the University of Pittsburgh, the David H. Murdock Research Institute, the Nucleic Acids Research Facilities at Virginia Commonwealth University, the Purdue University Genomics Core Facility, the University of California—Santa Cruz’s Genome Sequencing Center, and the UCLA Genotyping and Sequencing Core Facility.

This work was supported by Public Health Service grant GM93901 from the Institute of General Medical Sciences and by the Howard Hughes Medical Institute, including grant 52006944 to UCLA.

Footnotes

Citation Hatfull GF, Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science Program, KwaZulu-Natal Research Institute for Tuberculosis and HIV Mycobacterial Genetics Course, University of California—Los Angeles Research Immersion Laboratory in Virology, Phage Hunters Integrating Research Education Program. 2013. Complete genome sequences of 63 mycobacteriophages. Genome Announc. 1(6):e00847-13. doi:10.1128/genomeA.00847-13.

REFERENCES

  • 1. Hatfull GF, Hendrix RW. 2011. Bacteriophages and their genomes. Curr. Opin. Virol. 1:298–303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Hendrix RW. 2002. Bacteriophages: evolution of the majority. Theor. Popul. Biol. 61:471–480 [DOI] [PubMed] [Google Scholar]
  • 3. Suttle CA. 2007. Marine viruses—major players in the global ecosystem. Nat. Rev. Microbiol. 5:801–812 [DOI] [PubMed] [Google Scholar]
  • 4. Wommack KE, Colwell RR. 2000. Virioplankton: viruses in aquatic ecosystems. Microbiol. Mol. Biol. Rev. 64:69–114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Hatfull GF, Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science Program. KwaZulu-Natal Research Institute for Tuberculosis and HIV Mycobacterial Genetics Course Students. Phage Hunters Integrating Research and Education Program 2012. Complete genome sequences of 138 mycobacteriophages. J. Virol. 86:2382–2384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Hatfull GF. 2012. The secret lives of mycobacteriophages. Adv. Virus Res. 82:179–288 [DOI] [PubMed] [Google Scholar]
  • 7. Hendrix RW, Smith MC, Burns RN, Ford ME, Hatfull GF. 1999. Evolutionary relationships among diverse bacteriophages and prophages: all the world’s a phage. Proc. Natl. Acad. Sci. U. S. A. 96:2192–2197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Pedulla ML, Ford ME, Houtz JM, Karthikeyan T, Wadsworth C, Lewis JA, Jacobs-Sera D, Falbo J, Gross J, Pannunzio NR, Brucker W, Kumar V, Kandasamy J, Keenan L, Bardarov S, Kriakov J, Lawrence JG, Jacobs WR, Hendrix RW, Hatfull GF. 2003. Origins of highly mosaic mycobacteriophage genomes. Cell 113:171–182 [DOI] [PubMed] [Google Scholar]
  • 9. Hatfull GF, Pedulla ML, Jacobs-Sera D, Cichon PM, Foley A, Ford ME, Gonda RM, Houtz JM, Hryckowian AJ, Kelchner VA, Namburi S, Pajcini KV, Popovich MG, Schleicher DT, Simanek BZ, Smith AL, Zdanowicz GM, Kumar V, Peebles CL, Jacobs WR, Jr, Lawrence JG, Hendrix RW. 2006. Exploring the mycobacteriophage metaproteome: phage genomics as an educational platform. PLoS Genet. 2:e92. 10.1371/journal.pgen.0020092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Stella EJ, Franceschelli JJ, Tasselli SE, Morbidoni HR. 2013. Analysis of novel mycobacteriophages indicates the existence of different strategies for phage inheritance in mycobacteria. PLoS One 8:e56384. 10.1371/journal.pone.0056384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Pope WH, Jacobs-Sera D, Best AA, Broussard GW, Connerly PL, Dedrick RM, Kremer TA, Offner S, Ogiefo AH, Pizzorno MC, Rockenbach K, Russell DA, Stowe EL, Stukey J, Thibault SA, Conway JF, Hendrix RW, Hatfull GF. 2013. Cluster J mycobacteriophages: intron splicing in capsid and tail genes. PLoS One 8:e69273. 10.1371/journal.pone.0069273 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Fan X, Teng T, Wang H, Xie J. 2012. Biology of a novel mycobacteriophage, SWU1, isolated from Chinese soil as revealed by genomic characteristics. J. Virol. 86:10230–10231 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Mageeney C, Pope WH, Harrison M, Moran D, Cross T, Jacobs-Sera D, Hendrix RW, Dunbar D, Hatfull GF. 2012. Mycobacteriophage Marvin: a new singleton phage with an unusual genome organization. J. Virol. 86:4762–4775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Hanauer DI, Jacobs-Sera D, Pedulla ML, Cresawn SG, Hendrix RW, Hatfull GF. 2006. Inquiry learning. Teaching scientific inquiry. Science 314:1880–1881 [DOI] [PubMed] [Google Scholar]
  • 15. Hatfull GF. 2010. Bacteriophage research: gateway to learning science. Microbe Mag. 5:243–250 [Google Scholar]
  • 16. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27:4636–4641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Borodovsky M, Lomsadze A. 2011. Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite. Curr. Protoc. Bioinformatics 35:4.5.1–4.5.17. 10.1002/0471250953.bi0405s35 [DOI] [PubMed] [Google Scholar]
  • 18. Borodovsky M, Mills R, Besemer J, Lomsadze A. 2003. Prokaryotic gene prediction using GeneMark and GeneMark.hmm. Curr. Protoc. Bioinformatics 1:4.5.1–4.5.16. 10.1002/0471250953.bi0405s01 [DOI] [PubMed] [Google Scholar]
  • 19. Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32:11–16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 33:W244–W248 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES