ABSTRACT
We report the genome sequence of bacteriophage NathanVaag, an actinobacteriophage isolated from soil in El Paso, Texas, that infects Arthrobacter sp. strain ATCC 21022. The 49,645-bp genome contains 73 predicted protein-coding genes. Based on gene content similarity to phages in the Actinobacteriophage Database, NathanVaag is assigned to phage cluster AO1.
ANNOUNCEMENT
Understanding the molecular evolution of bacteriophages is critical to finding novel medical solutions for antibiotic resistance in bacteria, as well as for diverse applications in agricultural and biotechnological settings. Here, we present the genome sequence of a novel actinobacteriophage, NathanVaag, that infects the soil bacterium Arthrobacter sp. strain ATCC 21022 and that represents an additional sequenced actinobacteriophage genome isolated in Texas through the Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program at the University of Texas at El Paso (UTEP) (El Paso, Texas) and annotated through the SEA-PHAGES program at the University of Hawai`i at Mānoa (Honolulu, Hawai`i) (1, 2).
NathanVaag was isolated and purified in 2019 from a surface soil sample collected at UTEP (coordinates: 31.768794N, 106.505322W) using standard methods (3). Briefly, the soil sample was washed in peptone-yeast extract-calcium (PYCa) medium, and the filtered wash (0.22 μm) was inoculated with Arthrobacter sp. strain ATCC 21022. After incubation with shaking for 48 h at 28°C, the mixture was refiltered and plated in top agar with Arthrobacter sp. A single clear plaque was selected and isolated by enrichment following multiple rounds of plaque purification by plating in top agar with Arthrobacter sp. at 28°C. After phage particle purification, a high-titer lysate was generated from webbed plates, and genomic DNA was extracted using a Wizard DNA extraction kit (Promega). Genomic DNA was prepared for sequencing using the NEBNext Ultra II library kit and sequenced on an Illumina MiSeq instrument (v3 reagents), resulting in 167-fold coverage from 61,215 single-end 150-bp reads. Raw reads were assembled in Newbler v2.9 (Roche) using default parameters, resulting in a single genomic contig. Genome completeness and accuracy and phage genomic termini were verified using Consed v29 as described previously (4–6). The complete genome sequence of NathanVaag was circularly permuted and is 49,645 bp in length, with a G+C content of 63.6%.
The genome was autoannotated using DNA Master v5.23.6 build 2701 (http://cobamide2.bio.pitt.edu/computer.htm) embedded with Glimmer v3 (7) and GeneMark (8). Start sites from the autoannotation were manually refined using the Phage Evidence Collection and Annotation Network (PECAAN) (http://pecaan.kbrinsgd.org), Phamerator (9), and Starterator (http://phages.wustl.edu/starterator). Potential functions for predicted protein-coding genes were assigned based on top hits for searches using NCBI BLASTp (nonredundant database) (10) and HHpred (PDB mmcif70, NCBI conserved domain, SCOPe70, and Pfam databases) (11), and putative membrane proteins were identified using TMHMM v2.0 (https://services.healthtech.dtu.dk/service.php?DeepTMHMM). Default parameters were used for all software. NathanVaag is predicted to have 73 protein-coding genes, of which 36 genes (49%) have a putative function assigned and the remaining 37 genes (51%) encode hypothetical proteins with unknown functions. No tRNAs were identified using ARAGORN v1.2.38 (12) and tRNAscan-SE v2.0 (13). Of the 73 genes, all except the rightmost six genes are transcribed rightward. Based on gene content similarity of at least 35% to phages within the Actinobacteriophage Database (PhagesDB) (14), NathanVaag can be assigned to the AO1 subcluster of cluster AO actinobacteriophages (15). Similar to other AO1 phages, NathanVaag contains several genes with putative DNA cleavage functions (e.g., RecE-like exonuclease and RusA-like resolvase) and is predicted to be a lytic phage with no identifiable immunity repressor or integrase functions.
Data availability.
The complete genome sequence of the actinobacteriophage NathanVaag has been deposited in GenBank with accession number ON970601, BioProject accession number PRJNA488469, and SRA accession number SRX14483220.
ACKNOWLEDGMENTS
This project was generously supported by the Howard Hughes Medical Institute SEA-PHAGES program and the School of Life Sciences at the University of Hawaii at Mānoa (UHM).
We thank the SEA-PHAGES students and instructors at UTEP who isolated and discovered the bacteriophage NathanVaag. We thank Graham F. Hatfull, Welkin H. Pope, Deborah Jacobs-Sera, Daniel A. Russell, and Rebecca A. Garlena for their continued technical support during the sequencing and annotation of this genome.
Contributor Information
Rebecca A. Chong, Email: rachong@hawaii.edu.
Kenneth M. Stedman, Portland State University
REFERENCES
- 1.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy JJ, Denver DR, Dunbar D, Elgin SCR, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Hartzog GA, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. mBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hanauer DI, Graham MJ, SEA-PHAGES , Betancur L, Bobrownicki A, Cresawn SG, Garlena RA, Jacobs-Sera D, Kaufmann N, Pope WH, Russell DA, Jacobs WR, Sivanathan V, Asai DJ, Hatfull GF. 2017. An inclusive Research Education Community (iREC): impact of the SEA-PHAGES program on research outcomes and student learning. Proc Natl Acad Sci USA 114:13531–13536. doi: 10.1073/pnas.1718188115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Poxleitner M, Pope WH, Jacobs-Sera D, Sivanathan V, Hatfull GF. 2018. Phage discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD. [Google Scholar]
- 4.Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen Y-J, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Ho CH, Irzyk GP, Jando SC, Alenquer MLI, Jarvie TP, Jirage KB, Kim J-B, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, et al. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376–380. doi: 10.1038/nature03959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes, p 109–125. In Clokie MRJ, Kropinski AM, Lavigne R (ed), Bacteriophages: methods and protocols, vol 3. Springer, New York, NY. [DOI] [PubMed] [Google Scholar]
- 7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Besemer J, Borodovsky M. 2005. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 11.Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chan PP, Lin BY, Mak AJ, Lowe TM. 2021. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res 49:9077–9096. doi: 10.1093/nar/gkab688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Russell DA, Hatfull GF. 2017. PhagesDB: the Actinobacteriophage Database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pope WH, Mavrich TN, Garlena RA, Guerrero-Bustamante CA, Jacobs-Sera D, Montgomery MT, Russell DA, Warner MH, Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) , Hatfull GF. 2017. Bacteriophages of Gordonia spp. display a spectrum of diversity and genetic relationships. mBio 8:e01069-17. doi: 10.1128/mBio.01069-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The complete genome sequence of the actinobacteriophage NathanVaag has been deposited in GenBank with accession number ON970601, BioProject accession number PRJNA488469, and SRA accession number SRX14483220.