Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2016 Jul 7;4(4):e00389-16. doi: 10.1128/genomeA.00389-16

Complete Genome Sequences of 61 Mycobacteriophages

Graham F Hatfull a,; Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) Programb; KwaZulu-Natal Research Institute for Tuberculosis and HIV (K-RITH) Mycobacterial Genetics Coursea,c,d; University of California–Los Angeles Research Immersion Laboratory in Virologye; Phage Hunters Integrating Research and Education (PHIRE) Programa
PMCID: PMC4939774  PMID: 27389257

Abstract

Mycobacteriophages—viruses of mycobacteria—provide insights into viral diversity and evolution as well as numerous tools for genetic dissection of Mycobacterium tuberculosis. Here we report the complete genome sequences of 61 mycobacteriophages newly isolated from environmental samples using Mycobacterium smegmatis mc2155 that expand our understanding of phage diversity.

GENOME ANNOUNCEMENT

Bacteriophages are the most numerous biological entities on the planet, with a global population of 1031 particles. With an estimated 1023 productive infections per second worldwide, the population is vast, dynamic, and genetically diverse (14). As of March 2016, the National Center for Biotechnology Information (NCBI) lists 1,757 Caudovirales genomes, 318 of which infect Mycobacterium hosts. Previous comparative analyses of mycobacteriophages revealed substantial diversity and mosaic architectures resulting from nonhomologous recombination. Integrated research-education programs such as Phage Hunters Integrating Research and Education (PHIRE) (5), Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) (6), the Mycobacterial Genetics Course at the University of Kwazulu-Natal (K-RITH), and the University of California–Los Angeles’s Research Immersion Laboratory in Virology, isolated, sequenced, and annotated the phages reported here (Table 1) using M. smegmatis as a host.

TABLE 1 .

Newly sequenced mycobacteriophage genomes

Phage name Cluster Genome (bp) G+C content (%) GenBank accession no. Finding/annotating institution
Alvin A1 49,577 63.5 KP027205 University of Pittsburgha
ArcherNM A2 52,561 64.2 KU761559 Washington State University, University of Floridab
Artemis2UCLA A6 52,344 61.4 KF560333 University of California Los Angelesd
Bactobuster A2 52,129 63.1 KU568494 North Carolina A&T State Universityb
Bernardo B3 68,196 67.4 KF493879 University of California–Los Angelesd
Bipper Y 77,832 67.3 KU728633 University of Pittsburgh, Florida Gulf Coast Universityb
Bricole M1 81,128 61.6 KT591491 Old Dominion Universityb
Bruin E 74,210 63.0 KF562099 University of California–Los Angelesd
Brusacoram P 47,618 67.0 KT347313 College of St. Scholasticab
Carcharodon N 43,680 66.2 KM588359 Jacksonville State Universityb
Cedasite G1 41,901 66.6 KT355472 Morehouse Collegeb
Chandler B3 69,450 67.5 KP027207 University of Pittsburgha
CloudWang3 A6 52,873 61.4 KF560332 University of California–Los Angelesd
Conspiracy A5 50,755 60.6 KF560330 University of California–Los Angelesd
Cosmo V 78,229 56.8 KP027195 University of KwaZulu-Natalc
Eidsmoe A9 52,946 62.5 KU716094 Illinois Wesleyan Universityb
Enkosi K1 59,052 67.2 KT281789 University of KwaZulu-Natalc
Glass B2 67,509 69.0 KT880194 Hope Collegeb
Graduation A1 52,823 63.5 KF560331 University of California–Los Angelesd
HanShotFirst A1 52,390 63.8 KF493880 University of California–Los Angelesd
HufflyPuff E 76,323 63.0 KF562100 University of California–Los Angelesd
HyRo C1 153,714 64.7 KT281790 University of KwaZulu-Natalc
Iracema64 A4 51,637 64.0 KU055616 La Salle Universityb
JAMaL B4 70,841 68.8 KF493881 University of California–Los Angelesd
JenCasNa A3 50,877 64.0 KU255188 Howard Hughes Medical Instituteb
Jovo A5 51,319 60.8 KF493882 University of California–Los Angelesd
Kimberlium F1 56,826 61.4 KR935214 Gettysburg Collegeb
LadyBird A2 53,141 63.5 KT588442 St. Edward’s Universityb
Lolly9 L3 75,816 59.3 KT281791 University of KwaZulu-Natalc
Lumos L3 75,586 59.3 KT372003 Indian River State Collegeb
MichelleMyBell N 42,240 66.0 KF986246 Nyack Collegeb
Mosby E 74,533 63.1 KF493883 University of California–Los Angelesd
Nala E 75,894 63.1 KF562101 University of California–Los Angelesd
NaSiaTalie A2 52,920 63.4 KU297783 Howard Hughes Medical Instituteb
Numberten B1 68,607 66.5 KJ194583 University of Pittsburgha
Panchino N 43,516 65.9 KU935727 Lincoln Universityb
Phamished B1 68,515 66.5 KR816508 Gettysburg Collegeb
PhatBacter E 76,217 63.0 KF562102 University of California–Los Angelesd
Phatniss F1 57,293 61.3 KT279576 Johns Hopkins Universityb
Phrann N 44,872 66.3 KU935731 Southern Connecticut State Universityb
Pioneer A9 53,219 62.6 KT285706 Indian River State Collegeb
Pipsqueaks N 43,679 66.3 KU935730 College of Charlestonb
PopTart F1 55,094 61.6 KT281792 University of KwaZulu-Natalc
Potter B1 68,327 66.5 KU867907 University of Kansasb
Romney A4 51,370 63.9 KU867906 Seton Hill Universityb
Sbash I2 55,832 65.6 KP027201 University of KwaZulu-Natalc
Seabiscuit A1 51,781 63.7 KJ194585 University of Pittsburgha
Seagreen F1 57,766 61.8 KT281793 University of KwaZulu-Natalc
SkinnyPete N 43,478 66.4 KU935729 Virginia Commonwealth Universityb
Snenia L3 75,626 59.3 KT281794 University of KwaZulu-Natalc
Sparkdehlily F1 56,275 61.2 KT895280 James Madison Universityb
Tasp14 A1 51,409 63.9 KT326768 Ohio State Universityb
Texage A3 50,081 64.0 KT326767 Merrimack Collegeb
TheloniousMonk A1 52,055 63.6 KT363731 Western Kentucky Universityb
Tres B2 67,349 68.9 KT365402 James Madison Universityb
Wooldri A3 50,797 64.0 KT381277 Washington State University, University of Floridab
Xeno N 42,395 66.8 KU935728 Southern Connecticut State Universityb
Xerxes N 43,698 66.3 KU935726 University of Floridab
XFactor F1 55,617 61.7 KT281795 University of KwaZulu-Natalc
Zaka A6 52,122 61.5 KF560334 University of California–Los Angelesd
Zakhe101 O 69,653 65.5 KT281796 University of KwaZulu-Natalc
a

Phage Hunters Integrating Research and Education (PHIRE) Program, University of Pittsburgh.

b

Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES).

c

K-RITH Mycobacterial Genetics Course.

d

University of California–Los Angeles, Research Immersion Laboratory in Virology.

Phages were isolated by direct plating of filtered soil extracts or from enriched cultures, followed by plaque purification. Electron microscopy shows that 60 have siphoviral morphotypes, and HyRo is the sole member of the Myoviridae. Most have isometric capsids, the exceptions being Bipper, Sbash, and Zakhe101 with prolate heads. Genomic DNA was extracted from high titer lysates, sheared, and sequenced at the University of Pittsburgh, University of California–Los Angeles, the DOE Joint Genome Institute, or Virginia Commonwealth University using either Sanger, Illumina, Ion Torrent, or 454 technology. Sequence reads were assembled using Newbler (Roche) and Consed (7) and coverage depths range from 47-fold to 2,308-fold, with an average of 200-fold. Sequence assemblies revealed discrete genome ends for 52 phages, and the 9 with circularly permuted assemblies were bioinformatically linearized such that base one was assigned in accord with other mycobacteriophages. Genomes were annotated using DNA Master (http://cobamide2.bio.pitt.edu), Phamerator (8), Glimmer (9), GeneMark (10), Aragorn (11), and tRNAscanSE (12), and functions were determined using the public databases GenBank, Protein DataBase, pfamA, and phagesdb.org with BLAST (13), and HHPred (14). Genomes were assigned to clusters or subclusters as described previously (15).

Notwithstanding the large extant collection of sequenced mycobacteriophage genomes, these newly sequenced phages considerably expand our understanding of mycobacteriophage diversity. Twenty-two are members of the largest cluster, cluster A, but span 7 of the 15 subclusters. The others are broadly distributed across other clusters, including B, C, E, F, G, I, K, L, M, N, O, and P. Cosmo has substantial nucleotide sequence similarity to the singleton phage Wildcat, forming the new cluster V. The eight cluster N phages, Cedasite (G1), and Brusacoram (P) are notable in that they contain integration-dependent immunity systems in which the phage attachment site (attP) is located within the repressor gene (16).

As is typical of other sequenced phage genomes, functions can be assigned to only ~25% of the predicted genes, primarily those involved in virion capsid and assembly and well-conserved genes associated with DNA metabolism. Two of the cluster A genomes (Eidsmoe, ArcherNM) contain partitioning systems in place of integration cassettes; several genomes (e.g., Phrann, Xeno) encode toxin-antitoxin systems; and three encode Lsr2 homologs (Lolly9, Lumos, and Snenia).

Nucleotide sequence accession numbers.

Nucleotide sequence accession numbers for all phages are shown in Table 1.

ACKNOWLEDGMENTS

We thank the many undergraduates and instructors who contributed to the phage discovery and genomics initiative. We especially thank David J. Asai, William Biederman, Charles Bowman, Kevin W. Bradley, Steven Cresawn, Rebecca Garlena, Roger Hendrix, William R. Jacobs, Jr., Deborah Jacobs-Sera, Paras Jain, Victoria Kaspowicz, Michelle H. Larsen, Travis Mavrich, Welkin H. Pope, Eric Rubin, Daniel A. Russell, and Viknesh Sivanathan for their assistance in phage isolation, genome analysis, and program administration.

Footnotes

Citation Hatfull GF, Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) Program, KwaZulu-Natal Research Institute for Tuberculosis and HIV (K-RITH) Mycobacterial Genetics Course, University of California–Los Angeles Research Immersion Laboratory in Virology, Phage Hunters Integrating Research and Education (PHIRE) Program. 2016. Complete genome sequences of 61 mycobacteriophages. Genome Announc 4(4):e00389-16. doi:10.1128/genomeA.00389-16.

REFERENCES

  • 1.Suttle CA. 2007. Marine viruses—major players in the global ecosystem. Nat Rev Microbiol 5:801–812. doi: 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]
  • 2.Pedulla ML, Ford ME, Houtz JM, Karthikeyan T, Wadsworth C, Lewis JA, Jacobs-Sera D, Falbo J, Gross J, Pannunzio NR, Brucker W, Kumar V, Kandasamy J, Keenan L, Bardarov S, Kriakov J, Lawrence JG, Jacobs WR Jr, Hendrix RW, Hatfull GF. 2003. Origins of highly mosaic mycobacteriophage genomes. Cell 113:171–182. doi: 10.1016/S0092-8674(03)00233-2. [DOI] [PubMed] [Google Scholar]
  • 3.Krupovic M, Bamford DH. 2010. Order to the viral universe. J Virol 84:12476–12479. doi: 10.1128/JVI.01489-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hendrix RW, Smith MC, Burns RN, Ford ME, Hatfull GF. 1999. Evolutionary relationships among diverse bacteriophages and prophages: all the world’s a phage. Proc Natl Acad Sci USA 96:2192–2197. doi: 10.1073/pnas.96.5.2192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hatfull GF. 2010. Bacteriophage research: gateway to learning science. Microbe 5:243–250. [Google Scholar]
  • 6.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy JJ, Denver DR, Dunbar D, Elgin SC, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Hartzog GA, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. MBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Besemer J, Borodovsky M. 2005. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ackermann KH, Adams N, Adler C, Ahammed Z, Ahmad S, Allgower C, Amsbaugh J, Anderson M, Anderssen E, Arnesen H, Arnold L, Averichev GS, Baldwin A, Balewski J, Barannikova O, Barnby LS, Baudot J, Beddo M, Bekele S, Belaga VV, Bellwied R, Bennett S, Bercovitz J, Berger J, Betts W, Bichsel H, Bieser F, Bland LC, Bloomer M, Blyth CO, Boehm J, Bonner BE, Bonnet D, Bossingham R, Botlo M, Boucham A, Bouillo N, Bouvier S, Bradley K, Brady FP, Braithwaite ES, Braithwaite W, Brandin A, Brown RL, Brugalette G, Byrd C, Caines H, Calderon de la Barca Sanchez M, Cardenas A, Carr L. 2001. Elliptic flow in Au+Au collisions at square root(S)NN = 130 GeV. Phys Rev Lett 86:402–407. doi: 10.1103/PhysRevLett.86.402. [DOI] [PubMed] [Google Scholar]
  • 13.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 14.Remmert M, Biegert A, Hauser A, Söding J. 2012. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9:173–175. doi: 10.1038/nmeth.1818. [DOI] [PubMed] [Google Scholar]
  • 15.Hatfull GF, Jacobs-Sera D, Lawrence JG, Pope WH, Russell DA, Ko CC, Weber RJ, Patel MC, Germane KL, Edgar RH, Hoyte NN, Bowman CA, Tantoco AT, Paladin EC, Myers MS, Smith AL, Grace MS, Pham TT, O’Brien MB, Vogelsberger AM, Hryckowian AJ, Wynalek JL, Donis-Keller H, Bogel MW, Peebles CL, Cresawn SG, Hendrix RW. 2010. Comparative genomic analysis of 60 mycobacteriophage genomes: genome clustering, gene acquisition, and gene size. J Mol Biol 397:119–143. doi: 10.1016/j.jmb.2010.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Broussard GW, Oldfield LM, Villanueva VM, Lunt BL, Shine EE, Hatfull GF. 2013. Integration-dependent bacteriophage immunity provides insights into the evolution of genetic switches. Mol Cell 49:237–248. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES