Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2019 Aug 8;8(32):e00660-19. doi: 10.1128/MRA.00660-19

Complete Genome Sequences of Cluster F Mycobacteriophages EleanorGeorge, Mattes, and Spikelee

Christina Jacob a, Yunia Alvarez a, Cynthia Castro a, Katherine Clarke a, Linda Dantico a, Bernadette J Connors a,
Editor: Simon Rouxb
PMCID: PMC6687927  PMID: 31395640

EleanorGeorge, Mattes, and Spikelee are mycobacteriophages isolated from different soil samples using Mycobacterium smegmatis mc2155 as the host. Each was obtained using direct isolation techniques, purified, and then sequenced. Based on sequence similarity, all three belong to the F1 subcluster and are temperate phages.

ABSTRACT

EleanorGeorge, Mattes, and Spikelee are mycobacteriophages isolated from different soil samples using Mycobacterium smegmatis mc2155 as the host. Each was obtained using direct isolation techniques, purified, and then sequenced. Based on sequence similarity, all three belong to the F1 subcluster and are temperate phages.

ANNOUNCEMENT

EleanorGeorge, Mattes, and Spikelee were directly isolated from soil using the actinobacterial host Mycobacterium smegmatis mc2155. Two rounds of purification were done, with all methods followed as described in the Howard Hughes Medical Institute (HHMI) Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) (1) Discovery Guide (https://seaphagesphagediscoveryguide.helpdocsonline.com/home). The plaques were lightly turbid, indicating that each of these three phages is temperate, capable of integrating the genome into that of its host until the conditions are right for it to transition to the lytic cycle. DNA was purified using the Wizard Promega DNA cleanup kit, and sequencing libraries were prepared from genomic DNA using an Ultra II FS kit with dual-indexed barcoding (New England Biolabs). Libraries were then pooled and run on an Illumina MiSeq instrument, yielding at least 670,000 single-end 150-base reads and at least 1,630-fold coverage for each genome. Assembly was completed using Newbler version 2.9, and for each genome, a single and complete phage contig was produced that was checked for accuracy and phage genomic termini using Consed version 29 (2). The genomes were annotated using DNA Master (3), while GLIMMER (4), GeneMark (5), and Starterator (http://seaphages.org/software/) were used to predict gene starts. ARAGORN and tRNAScan were used to detect tRNA (6, 7). Using predicted sequences derived from autoannotation through DNA Master, function was verified using BLASTp (BLASTDBv5) on the NCBI nr and actinobacteriophage databases (E value of 10E−7 or less), as well as the HHpred database (a probability value of 90% or better was sufficient evidence for function if the query aligned the full length of the subject match or if the alignment included the functional domain) (810). Function using synteny was determined using Phamerator (11). Phage genome features are listed in Table 1.

TABLE 1.

Mycobacteriophage genome details

Phage name GenBank accession no. SRA accession no. Genome length (bp) GC content (%) No. of CDSa Shotgun coverage (×)
EleanorGeorge MH669001 SRX6383939 59,482 61.7 108 4,015
Spikelee MH669014 SRX6383942 58,604 61.0 104 1,629
Mattes MH155871 SRX6383941 58,074 61.3 105 2,103
a

CDS, coding sequences.

The genomic diversity of mycobacteriophages is evidenced by their classification into 26 parent clusters (A to Z) (12), with clusters assigned using nucleotide similarity to other sequenced phage genomes (13). Cluster F has 168 unique phages among five subclusters. F1 contains the largest number of sequenced phages (157 in total). According to the gene content tool on PhagesDB (14), EleanorGeorge and Mattes share 64.8% nucleotide similarity but were only ∼48% similar to Spikelee. According to Phamerator, which classifies predicted proteins into “phamilies” of related sequences and arranges them into genome maps, there is greater similarity in the first half of the genome, with the iconic synteny of the major tail protein, tail assembly chaperone, tape measure protein, and minor tail proteins conserved among all three phages (15). The integrases of Mattes and EleanorGeorge showed significant similarity to one another according to Clustal W Omega multiple sequence alignment (16) (97.85% similarity between Mattes and EleanorGeorge using default settings for protein alignments; 22% similarity of Mattes and EleanorGeorge to Spikelee). Many of the genes in the second half of these genomes cannot be assigned functions, and this region is the source of the greatest variability on genome maps. Some notable exceptions are multiple genes encoding glycosyltransferase, galactosyltransferase, and methylase proteins. Continued exploration of cluster F phage genomes may elucidate additional novel genes in the second half of the genomes.

Data availability.

The genome sequences reported here have been deposited in GenBank under the accession numbers listed in Table 1 (BioProject number PRJNA488469). The SRA numbers are also provided in Table 1.

ACKNOWLEDGMENTS

We thank the Dominican College Science Department and acknowledge Siobhan O’Sullivan for her invaluable assistance. We thank Graham Hatfull, Welkin Pope, Deborah Jacobs-Sera, Daniel Russell, and Rebecca Garlena for their technical support during the sequencing and annotation of these genomes. Support for this research was provided by Dominican College and by the Howard Hughes Medical Institute SEA-PHAGES program.

REFERENCES

  • 1.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy JJ, Denver DR, Dunbar D, Elgin SC, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Grant AH, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. mBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed] [Google Scholar]
  • 3.Lawrence JG. 2007. DNA Master. http://cobamide2.bio.pitt.edu/computer.htm.
  • 4.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with GLIMMER. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Besemer J, Borodovsky M. 2005. GeneMark: Web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Russell DA, Hatfull GF. 2017. PhagesDB: the actinobacteriophage database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hatfull GF. 2012. The secret lives of mycobacteriophages, p 179–288. In Advances in virus research, vol 82 Academic Press, Cambridge, MA. [DOI] [PubMed] [Google Scholar]
  • 13.Hatfull GF, Jacobs-Sera D, Lawrence JG, Pope WH, Russell DA, Ko C-C, Weber RJ, Patel MC, Germane KL, Edgar RH, Hoyte NN, Bowman CA, Tantoco AT, Paladin EC, Myers MS, Smith AL, Grace MS, Pham TT, O'Brien MB, Vogelsberger AM, Hryckowian AJ, Wynalek JL, Donis-Keller H, Bogel MW, Peebles CL, Cresawn SG, Hendrix RW. 2010. Comparative genomic analysis of 60 mycobacteriophage genomes: genome clustering, gene acquisition, and gene size. J Mol Biol 397:119–143. doi: 10.1016/j.jmb.2010.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Mavrich TN, Hatfull GF. 2017. Bacteriophage evolution differs by host, lifestyle and genome. Nat Microbiol 2:17112. doi: 10.1038/nmicrobiol.2017.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hatfull GF. 2008. Bacteriophage genomics. Curr Opin Microbiol 11:447–453. doi: 10.1016/j.mib.2008.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. 2014. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The genome sequences reported here have been deposited in GenBank under the accession numbers listed in Table 1 (BioProject number PRJNA488469). The SRA numbers are also provided in Table 1.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES