Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2018 Nov 1;7(17):e01182-18. doi: 10.1128/MRA.01182-18

Complete Genome Sequences of Mycobacteriophages Kwksand96 and Cane17

Kayla M Fast a,, Tracy W Keener a, Shane Castleberry a, Kenny D Jones II a, John D Larrimore a, Caitlin A Long a, Jon T Newburn a,*, Nicholas C Pritchett a, P Kiersten Schellhammer a, Michael W Sandel a
Editor: Irene L G Newtonb
PMCID: PMC6256495  PMID: 30533760

Bacteriophages Kwksand96 and Cane17 were isolated from Mycobacterium smegmatis mc2155. M. smegmatis is host to the highest number of phages analyzed from one species.

ABSTRACT

Bacteriophages Kwksand96 and Cane17 were isolated from Mycobacterium smegmatis mc2155. M. smegmatis is host to the highest number of phages analyzed from one species. Both mycobacteriophages were isolated from soil in west Alabama. Kwksand96 and Cane17 belong to subclusters B1 and C1, respectively, based on mycobacteriophage nucleotide sequence similarity.

ANNOUNCEMENT

We report here the discovery and genome annotation of two mycobacteriophages from the Black Belt Geological Region of west Alabama. Cane17 was discovered on a farm in Emelle, Alabama (32.698678 N, 88.267897 W), and Kwksand96 was found on the University of West Alabama (UWA) campus (32.594337 N, 88.186615 W) in Livingston, Alabama. These phages were isolated and characterized by UWA students participating in the Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program.

The phages were isolated from the soil using the nonpathogenic host bacterium Mycobacterium smegmatis mc2155. The genus Mycobacterium also contains pathogenic species that cause tuberculosis, leprosy, and Buruli ulcer. Isolation of Kwksand96 was performed in 7H9 medium directly from the soil sample at 37°C. Conversely, 7H9 medium was enriched with the host bacterium when Cane17 was discovered. Imaging using a transmission electron microscope revealed that Kwksand96 is a Siphoviridae phage with a long tail; Cane17 possesses a relatively large head and short tail consistent with the Myoviridae family.

We extracted DNA from both phages using the Wizard DNA cleanup system (Promega, Madison, WI) in a modified protocol (1). Genomic DNA libraries were generated using an New England Biolabs (NEB) Ultra II FS kit (New England Biolabs, Ipswich, MA) with dual-indexed barcoding. Pooled libraries were run on an Illumina MiSeq system at the Pittsburgh Bacteriophage Institute at the University of Pittsburgh, which yielded at least 200,000 single-end 150-base reads for each genome. Read coverage depth was 861× and 188× for Kwksand96 and Cane17, respectively. We assembled these reads using Newbler v. 2.9 with default settings (2). Newbler produced a single-phage contig, which we checked for completeness, accuracy, and phage genomic termini with Consed v. 2.0 (3). No large buildups of reads or coverage variation were observed in either genome. We thus concluded that these genomes were circularly permuted. The first base of the genome was chosen to match similar, previously published phage sequences. Sequence data place Kwksand96 in the B1 subcluster and Cane17 in C1. Mycobacteriophage clusters were determined using whole-genome analysis, which places similar phages in the same clusters according to nucleotide sequence similarity and gene placement (4). The Kwksand96 genome is 68,882 bp long and has a G+C content of 66.4% and 103 protein-coding genes. The Cane17 genome is 160,330 bp long and has a G+C content of 64.7%, 223 protein-coding genes, 32 tRNAs, 1 transfer-messenger RNA (tmRNA), a programmed translational frameshift, and a wraparound gene (i.e., a gene that begins at the terminal end of the genome and wraps around to the beginning.).

Phage genomes were annotated in DNA Master v. 5.0.2 (http://cobamide2.bio.pitt.edu/). Starterator software was used to choose correct start codons for each gene (https://github.com/SEA-PHAGES/starterator). PhagesDB BLAST, NCBI BLAST, Phamerator (http://phamerator.org), and HHPred v. 3.0 were used to determine gene function (57). ARAGORN v. 1.2.38 and tRNAscan-SE v. 2.0 were used to detect and trim tRNAs (8, 9). Coding potential was determined with GeneMark v. 3.25 (10). We assigned functions to 25 of Kwksand96’s 103 protein-coding genes and 50 of Cane17’s 223.

Data availability.

The complete genome sequences of Kwksand96 and Cane17 are available from GenBank under the accession numbers MH513973 and MH697579, respectively. Raw Illumina reads for Kwksand96 and Cane17 are available on NCBI’s Sequence Read Archive under accession numbers SRX4732908 and SRX4732909, respectively.

ACKNOWLEDGMENTS

This project was supported by The Howard Hughes Medical Institute Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science program (www.seaphages.org) and The University of West Alabama 2016–2018 Phage Hunters.

We thank Daniel A. Russell and Rebecca A. Garlena for conducting genome sequencing, quality control, and assembly at the University of Pittsburgh. We thank Denise Monti for assisting with transmission electron microscopy at the High Resolution Imaging Facility at the University of Alabama at Birmingham.

REFERENCES

  • 1.Poxleitner M, Pope W, Jacobs-Sera D, Sivanathan V, Hatfull G. 2018. Phage discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD. [Google Scholar]
  • 2.Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen Y-J, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer MLI, Jarvie TP, Jirage KB, Kim J-B, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376–380. doi: 10.1038/nature03959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gordon D, Abajian C, Green P. 1998. Consed: a graphical tool for sequence finishing. Genome Res 8:195–202. doi: 10.1101/gr.8.3.195. [DOI] [PubMed] [Google Scholar]
  • 4.Pope WH, Bowman CA, Russell DA, Jacobs-Sera D, Asai DJ, Cresawn SG, Jacobs WR Jr., Hendrix RW, Lawrence JG, Hatfull GF, Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science, Phage Hunters Integrating Research and Education, Mycobacterial Genetics Course. 2015. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity. Elife 4:e06416. doi: 10.7554/eLife.06416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Russell DA, Hatfull GF. 2017. PhagesDB: the Actinobacteriophage database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 7.Söding J, Biegert A, Lupas AN. 2005. The HHPred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:244–248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lowe TM, Chan PP. 2016. tRNAscan-SE on-line: search and contextual analysis of transfer RNA genes. Nucleic Acids Res 44:54–57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Borodovsky M, McIninch J. 1993. Recognition of genes in DNA sequence with ambiguities. Biosystems 30:161–171. doi: 10.1016/0303-2647(93)90068-N. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete genome sequences of Kwksand96 and Cane17 are available from GenBank under the accession numbers MH513973 and MH697579, respectively. Raw Illumina reads for Kwksand96 and Cane17 are available on NCBI’s Sequence Read Archive under accession numbers SRX4732908 and SRX4732909, respectively.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES