ABSTRACT
OKCentral2016 is the first mycobacteriophage sequenced from Oklahoma soil using the bacterial host Mycobacterium smegmatis strain mc2155. OKCentral2016 has a double-stranded DNA genome composed of 50,072 bp, with 84 predicted coding genes and 1 tRNA sequence. This mycobacteriophage has sequence similarities to members of the A10 subcluster.
GENOME ANNOUNCEMENT
OKCentral2016 was isolated from enriched topsoil at the University of Central Oklahoma (35.653371 N, 97.474118 W) using the bacterial host Mycobacterium smegmatis mc2155. The plaques produced by this phage were clear and had a diameter of approximately 4 mm after 24 h of incubation. High-titer phage lysate was obtained by using polyethylene glycol 8000 and the NaCl salt precipitation method (1, 2). Electron microscopy revealed a Siphoviridae morphotype, consisting of an icosahedral head that was approximately 60 nm in diameter and a tail that was approximately 120 nm in length. The genome was sequenced using Illumina (Solexa) next-generation sequencing with the MiSeq platform. The genome was assembled using Newbler and Consed software. OKCentral2016 has a genome that was 50,072 bases, with a GC content of 65.1%.
The genome of OKCentral2016 was annotated using DNA Master (http://cobamide2.bio.pitt.edu/). Protein-coding regions of the genome were detected using Glimmer (3) and GeneMark (4). Starterator (https://seaphages.org/software/) was used to evaluate gene start codons. Gene function was determined using Phamerator (5) and BLASTn/BLASTp on PhagesDB.org. ARAGORN (6) and tRNAscan-SE (7) were used to analyze tRNA sequences within the genome. OKCentral2016 was shown to possess 84 protein-coding genes and 1 tRNA (tRNATrp) sequence that was 76 bases in length. There were two programmed frame shifts that were observed. We also detected programmed frameshifts in the tail assembly chaperone and in the primase gene.
OKCentral2016 is the first mycobacteriophage sequenced from Oklahoma soil. The OKCentral2016 genome contains 3′ cohesive ends, which are 10 bases in length. The 3′ overhang sequence was shown to be 5′-CGGCCGGTAA-3′. The GC contents of the host bacterium, M. smegmatis mc2155, and OKCentral2016 were similar, at 67.4% and 65.1%, respectively (8). OKCentral2016 was shown to align most with two other A10 subcluster phages, Chupacabra and Goose, by using a BLASTn search on PhagesDB.org with an expected value of 0.0 and a minimum identity value of 99%. OKCentral2016 shared a strong syntenic region coding for lysin A, lysin B, terminase, portal, capsid maturation protease, scaffolding, and major capsid subunit proteins with Goose and Twister, other A10 subcluster mycobacteriophages.
Accession number(s).
The genome sequence of OKCentral2016 is available from GenBank under the accession no. MF773750.
ACKNOWLEDGMENTS
We thank Deborah Jacobs-Sera and Welkin H. Pope for checking and editing our annotations and for comments on the manuscript. We thank the Laboratory for Molecular Biology and Cytometry Research at OUHSC for the use of the Core Facility, which provided sequencing services. We are grateful to Ben Fowler at the Imaging Core Facility associated with the Oklahoma Medical Research Foundation for electron microscopy.
We also thank the University of Central Oklahoma, Office of Research and Sponsored Programs, for an RCSA grant and the OK-INBRE SURE program for providing funding for this project.
Footnotes
Citation Patton CJ, Kotturi H. 2018. Genomic sequence of mycobacteriophage OKCentral2016. Genome Announc 6:e01208-17. https://doi.org/10.1128/genomeA.01208-17.
REFERENCES
- 1.Colombet J, Sime-Ngando T. 2012. Use of PEG, polyethylene glycol, to characterize the diversity of environmental viruses, p 316–322. In Mendez-Vilas A. (ed), Current microscopy contributions to advances in science and technology, Formatex Research Center, Badajoz, Spain. [Google Scholar]
- 2.Yamamoto KR, Alberts BM, Benzinger R, Lawhorne L, Treiber G. 1970. Rapid bacteriophage sedimentation in the presence of polyethylene glycol and its application to large-scale virus purification. Virology 40:734–744. doi: 10.1016/0042-6822(70)90218-7. [DOI] [PubMed] [Google Scholar]
- 3.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with Glimmer. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Besemer J, Borodovsky M. 2005. GeneMark: Web software for gene finding in prokaryotes, eukaryotes, and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mohan A, Padiadpu J, Baloni P, Chandra N. 2015. Complete genome sequences of Mycobacterium smegmatis laboratory strain (MC2 155) and isoniazid-resistant (4XR1/R2) mutant strains. Genome Announc 3:e01520-14. doi: 10.1128/genomeA.01520-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
