Rahel is a lytic Myoviridae bacteriophage that infects Mycobacterium smegmatis mc2155. It has 1,555,955 bp and 64.7% G+C content. Rahel has a circularly permuted genome with 270 genes, 53 of them of known function, 33 tRNAs, and 1 transfer-messenger RNA (tmRNA). Only five genes are coded on the reverse strand.
ABSTRACT
Rahel is a lytic Myoviridae bacteriophage that infects Mycobacterium smegmatis mc2155. It has 1,555,955 bp and 64.7% G+C content. Rahel has a circularly permuted genome with 270 genes, 53 of them of known function, 33 tRNAs, and 1 transfer-messenger RNA (tmRNA). Only five genes are coded on the reverse strand.
ANNOUNCEMENT
Rahel was isolated as part of the Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) at SUNY Old Westbury. Rahel was isolated in Hicksville, Nassau County, New York (lat 40.7679, long −73.5405). It is a C1 subcluster lytic phage that infects Mycobacterium smegmatis mc2155. Clusters are groups of phages with sequence similarity over 50% of their genome (1). It was isolated by enrichment and two cycles of purification and amplification in 7H9 top agar at 37°C (2). For transmission electron microscopy (TEM), phages were collected from a high-titer lysate by high-speed centrifugation, mounted on carbon-stabilized, Formvar-coated copper TEM grids stained with uranyl acetate, and imaged with a JEOL JEM-2100 TEM (2). Rahel is a Myoviridae of 178 nm total length with an isometric head and tail of 89 nm (Fig. 1). DNA was extracted from a high-titer lysate using a Promega DNA Wizard kit with a modified protocol (2). Genomic DNA libraries were generated using an Ultra II FS kit (New England Biolabs [NEB], Ipswich, MA) with dual-indexed barcoding. Pooled libraries were run on an Illumina MiSeq system at the University of Pittsburgh, which yielded at least 547,000 single-end 150-base reads. Read coverage depth was 471× without Sanger finishing reactions. The reads were assembled using Newbler and Consed (3). A full genome sequence was present, as the contigs on both ends overlapped and the genome circularized. Rahel was annotated using DNA Master (http://cobamide2.bio.pitt.edu/computer.htm) and the gene prediction tools GLIMMER v3.0 (4), GeneMark v2.5 (5), and Starterator v1.1 to determine gene start sites. tRNA and transfer-messenger RNA (tmRNA) predictions were made using ARAGORN v1.2.38 (6) and tRNAscan-SE v3.0 (7). Functional assignments were made using BLAST v2.9 (8), HHpred (9), and Phamerator (10). Default settings were used for all programs.
FIG 1.
TEM of Rahel mounted on Formvar-coated copper grids stained with uranyl acetate and imaged with a JEOL JEM-2100 microscope. Rahel’s virions are 178 nm in total length, with an isometric head and tail of 89 nm (×50,000 magnification).
Rahel has 155,955 bp, 64.7% G+C content, 270 open reading frames (ORFs), 33 tRNAs, 1 tmRNA, 53 genes of known function, and 182 hypothetical proteins (67.6%); only five ORFs are coded on the reverse strand. Rahel has a circularly permuted genome, as all of the reads assembled into a large contig and there was no large buildup of reads or coverage variation at the ends (2). When linearized, the coding region of gene 270 is disrupted. Structural genes include genes 99 (the major capsid protein) and 100 (the capsid decoration protein), genes 126 to 144 (the tail structural proteins, including the tail assembly chaperone), and genes 129 and 130. Rahel has a tail sheath protein gene, 126, and two baseplate wedge protein genes, 138 and 139, but a tail tube protein was not identified. Genes involved in nucleotide metabolism and DNA replication are found on the right end of the genome from gene 147 on to the end. In Rahel, the lysis system is encoded by genes 252 and 254, lysin A and B, respectively, followed by a terminase, gene 255. We did not find a holin gene as part the lysis system.
Data availability.
The complete sequence for Rahel’s genome can be found in GenBank under accession number MK359348 and SRA accession number SRX9162484.
ACKNOWLEDGMENTS
We thank the HHMI SEA-PHAGES program for the opportunity to be part of this project. We are grateful to the School of Arts and Sciences at SUNY Old Westbury and the Department of Biology for supporting this work. We also thank the University of Pittsburgh SEA-PHAGES team members involved in sequencing, assembly, and quality control (QC) annotation.
REFERENCES
- 1. Hatful G. 2014. Molecular genetics of mycobacteriophages. Microbiol Spectr 2:1–36. doi: 10.1128/microbiolspec.MGM2-0032-2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Poxleitner M, Pope W, Jacobs-Sera D, Sivanathan V, Hatfull G. 2018. Phage discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD. [Google Scholar]
- 3. Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed] [Google Scholar]
- 4. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Lukashin A, Borodovsky M. 1998. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 26:1107–1115. doi: 10.1093/nar/26.4.1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 9. Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The complete sequence for Rahel’s genome can be found in GenBank under accession number MK359348 and SRA accession number SRX9162484.

