Mycobacteriophage Arlo is a newly isolated Siphoviridae bacteriophage isolated from soil samples collected in Bluff Dale, Texas. Mycobacteriophage Arlo has a 52,960 base-pair double-stranded DNA genome that is predicted to contain 96 protein-coding genes.
ABSTRACT
Mycobacteriophage Arlo is a newly isolated Siphoviridae bacteriophage isolated from soil samples collected in Bluff Dale, Texas. Mycobacteriophage Arlo has a 52,960 base-pair double-stranded DNA genome that is predicted to contain 96 protein-coding genes. Mycobacteriophage Arlo is related to mycobacteriophage DD5 and other cluster A1 bacteriophages.
ANNOUNCEMENT
We report the whole-genome sequence of mycobacteriophage Arlo (1, 2), which was directly isolated from a strain of Mycobacterium smegmatis, a rapidly growing environmental species that is generally nonpathogenic but can act as an opportunistic pathogen in immunosuppressed individuals (3). Mycobacteriophage Arlo was isolated from compost-containing community vegetable garden soil samples collected in Bluff Dale, Texas (32°19′08.0004″, -098°01′14.9016″). Soil samples were washed with 7H9 liquid medium, and bacteriophages were extracted from the mixture through a 0.22-µm filter. For virus replication, filtered medium was incubated with Mycobacterium smegmatis mc2155 at 37°C for 48 h. Plaque assays of isolated mycobacteriophage Arlo resulted in medium-sized turbid plaques. Negative-staining transmission electron microscopy showed that mycobacteriophage Arlo has a siphoviral morphology with a 60-nm-diameter nonenveloped icosahedral capsid and a 125 nm flexible noncontractile tail, which is typical of viruses in the Caudovirales order (Fig. 1).
FIG 1.

Transmission electron microscopy (TEM) of mycobacteriophage Arlo. Purified high-titer lysate was placed on a carbon type-B 300 mesh grid, stained with uranyl acetate, and imaged with an FEI Tecnai G2 Spirit BioTWIN transmission electron microscope (NL1.160G). TEM micrographs of negatively stained mycobacteriophage Arlo show an approximately 60-nm-diameter nonenveloped icosahedral capsid and 125-nm flexible noncontractile tail. The morphology of mycobacteriophage Arlo corresponds to that of members of the Siphoviridae family.
DNA was isolated from purified bacteriophage with the Promega Wizard DNA clean-up kit, and sequencing libraries were prepared from genomic DNA with the NEBNext Ultra II kit. Libraries were sequenced with Illumina MiSeq at the Pittsburgh Bacteriophage Institute to approximately 1,987-fold coverage from 742,500 total reads of 150-base read length (4). Sequence reads were assembled with Newbler 2.9 with default settings to produce a single-bacteriophage contig, which was checked for completeness, accuracy, and genome termini using consed v29.0 (5). The virus was determined to contain a linear double-stranded DNA genome that is 52,960 base pairs in length, with 63.8% G+C content, and a 3′ single-stranded terminal overhang of 5′-CGGATGGTAA-3′. Whole-genome nucleotide alignment with NCBI BLASTn (https://blast.ncbi.nlm.nih.gov/) (6) showed 96–97% nucleotide identity to cluster A1 mycobacteriophages Oogway (GenBank accession number MH230878) and DD5 (GenBank accession number NC_011022) (2).
Autoannotation of the genome was performed using GLIMMER v3.02 (7, 8) and GeneMark v2.5p (9, 10), followed by manual inspection, refinement of start sites, and annotation revision using Phamerator (https://phamerator.org/) (11), DNA Master v5.23.2 (http://phagesdb.org/DNAMaster/), and PECAAN (https://pecaan.kbrinsgd.org/). Mycobacteriophage Arlo is predicted to contain 96 protein-coding genes. No tRNAs genes were identified by ARAGORN v1.2.38 (12) or tRNAscan-SE v2.0 (13). Start codon usage was determined to be 90.12% AUG, 8.72% GUG, and 1.16% UUG.
HHpred v3.0beta (14, 15) and NCBI BLASTp (6) software were used to assign putative functions to 34 (35.4%) of 96 predicted protein-coding genes. The mycobacteriophage Arlo genome is arranged with rightwards-transcribed genes (genes 1 to 36, 58.4% of genome) encoding virion structural and assembly proteins and a lysis cassette consisting of lysin A and lysin B genes. Leftwards-transcribed genes encode DNA polymerase I, metallophosphoesterase, DNA primase, DNA methylase, endonuclease VII, NrdH-like glutaredoxin, DnaB-like dsDNA helicase, RecB-like exonuclease/helicase, and immunity repressor proteins.
Data availability.
The mycobacteriophage Arlo genome is available at GenBank as accession number MH576971. Raw reads are available in the SRA under accession number SRX4721440.
ACKNOWLEDGMENTS
Support for this research was provided by Tarleton State University College of Science and Technology and by the Howard Hughes Medical Institute Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) research and education program.
We thank Graham Hatfull, Welkin Pope, Deborah Jacobs-Sera, Daniel Russell, Rebecca Garlena, Sally Molloy, Tamarah Adair, Phoebe Doss, and Keely Wilson for their technical support during the imaging of the virion and the isolation, sequencing, and annotation of this genome.
REFERENCES
- 1.Pope WH, Bowman CA, Russell DA, Jacobs-Sera D, Asai DJ, Cresawn SG, Jacobs WR Jr, Hendrix RW, Lawrence JG, Hatfull GF; Science Education Alliance Phage Hunters Advancing Genomics and Evolutionary Science, Phage Hunters Integrating Research and Education, Mycobacterial Genetics Course . 2015. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity. Elife 4:e06416. doi: 10.7554/eLife.06416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Russell DA, Hatfull GF. 2017. PhagesDB: the actinobacteriophage database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wallace RJ Jr, Nash DR, Tsukamura M, Blacklock ZM, Silcox VA. 1988. Human disease due to Mycobacterium smegmatis. J Infect Dis 158:52–59. doi: 10.1093/infdis/158.1.52. [DOI] [PubMed] [Google Scholar]
- 4.Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes, p 109–125. In Clokie MRJ, Kropinski AM, Lavigne R (ed), Bacteriophages: methods and protocols, vol 3 Springer New York, New York, NY. [DOI] [PubMed] [Google Scholar]
- 5.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 7.Salzberg SL, Delcher AL, Kasif S, White O. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res 26:544–548. doi: 10.1093/nar/26.2.544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Borodovsky M, Mills R, Besemer J, Lomsadze A. 2003. Prokaryotic gene prediction using GeneMark and GeneMark.hmm. Curr Protoc Bioinformatics Chapter 4:Unit 4.5. doi: 10.1002/0471250953.bi0405s01. [DOI] [PubMed] [Google Scholar]
- 10.Besemer J, Borodovsky M. 2005. GeneMark: Web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lowe TM, Chan PP. 2016. tRNAscan-SE on-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res 44:W54–W57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zimmermann L, Stephens A, Nam S-Z, Rau D, Kübler J, Lozajic M, Gabler F, Söding J, Lupas AN, Alva V. 2018. A completely reimplemented MPI Bioinformatics Toolkit with a new HHpred server at its core. J Mol Biol 430:2237–2243. doi: 10.1016/j.jmb.2017.12.007. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The mycobacteriophage Arlo genome is available at GenBank as accession number MH576971. Raw reads are available in the SRA under accession number SRX4721440.
