ABSTRACT
The microbacteriophages QuadZero and AnnaLie were isolated from soil samples from Charlotte, NC, and were classified into EA and EB clusters, respectively. QuadZero has a 40,140 base-pair double-stranded DNA genome with 62 predicted protein coding genes, whereas AnnaLie has a 41,665-bp genome with 71 predicted protein coding genes.
ANNOUNCEMENT
We report the genome sequences of two microbacteriophage, QuadZero and AnnaLie, that were isolated from two different soil samples (35.189N and 80.831W and 35.264N and 81.015W, respectively) as part of the Howard Hughes Medical Institute’s (HHMI) Science Education Alliance-Phage Hunters Advancing Genomic and Evolutionary Science program (SEA-PHAGES) (1). These genome sequences contribute to the greater understanding of the diversity and abundance of microbacteriophage species. This information can provide a better understanding of their relationship to their bacterial host and ecological role (1).
Using the protocols provided in the HHMI’s SEA-PHAGES discovery guide, (https://seaphagesphagediscoveryguide.helpdocsonline.com/home) the microbacteriophage were isolated using the enriched isolation protocol and the bacterial host Microbacterium foliorum NRRL B-24224 (provided by the Pittsburgh Bacteriophage Institute at University of Pittsburgh) grown in PYCa liquid media at 30°C for 5 days. Isolation was followed by three to four rounds of plaque assays to purify the phage, followed by production of high-volume lysates to amplify the phage. Subsequent negative staining transmission electron microscopy revealed that QuadZero and AnnaLie have siphovirus morphology. A representative virion for QuadZero had a capsid diameter of 50 nm and tail length of 113 nm, while a representative virion for AnnaLie had a capsid diameter of 47 nm and a tail length of 141 nm (Fig. 1) as measured by imageJ v.1.53k (2).
FIG 1.

Transmission electron microscopy of the microbacteriophage Quadzero and AnnaLie. High-titer lysate was attached to a 300 copper mesh grid and negatively stained with 2% uranyl acetate. Imaging performed using a JOEL JEM-1400Plus Transmission Electron Microscope revealed siphovirus morphology for both phage. Quadzero had a capsid diameter of 50 nm and tail length of 113 nm (A), whereas AnnaLie had a capsid diameter of 47 nm and tail length of 113 nm (B).
Genomic DNA was isolated from the bacteriophage using the Promega Wizard DNA cleanup kit and prepared for sequencing with a NEB Ultra II DNA kit. Genomes were sequenced at the Pittsburgh Bacteriophage Institute with an Illumina MiSeq instrument, yielding 515,401 reads of 150-base single-end reads (QuadZero) and 813,568 reads of 150-base single-end reads (AnnaLie). The raw reads were assembled using Newbler 2.7 (3) with default settings in each case yielding a single contig for each phage. Consed version 29 (4) was used to check for genomic termini, accuracy, and completeness, as previously described (5). QuadZero had a genome length of 40,140 bp with a fold coverage of 1787, a G+C% content of 63.6%, and circularly permuted ends. AnnaLie had a genome length of 41,665 bp with a fold coverage of 842, a G+C% content of 66.6%, with 10-base single-stranded 3’ extensions (5′-TCTCCCGGCA−3′).
The protein coding regions of the phage genomes were predicted using Glimmer v3.02 (6) and GeneMark v2.5p (7) embedded within DNA Master v5.02 (http://cobamide2.bio.pitt.edu/computer.htm) using the settings described in HHMI’s SEA-PHAGES Bioinformatics Guide (https://seaphagesbioinformatics.helpdocsonline.com/home). Gene starts were predicted using Starterator (http://seaphages.org/software/), while protein function was determined using NCBI BLASTp 2.7 (8), Phamerator (9) and HHPred v3.0beta (10). QuadZero contained 62 predicted protein-coding regions with 24 putative functions, whereas AnnaLie contained 71 predicted protein-coding genes with 31 putative functions. No tRNA sequences were identified in either bacteriophage genome by Aragorn v1.1 (11) and tRNAscan-SE v 2.0.6 (12). QuadZero and AnnaLie both contained putative translational frameshifts in the tail assembly chaperone genes.
Clustal Omega alignment (https://www.ebi.ac.uk/Tools/msa/clustalo/) (13) using default settings indicated a 71% average nucleotide sequence identity (ANI) between QuadZero and Alakazam (GenBank accession number MT024862) and a 70.95% average nucleotide sequence identity (ANI) with Neferthena (GenBank accession number MH697589), both of which are members of the EA cluster, a microbacteriophage cluster. AnnaLie had a 94.86% average nucleotide sequence identity (ANI) with SansAfet (GenBank accession number MN329675), and a 90.22% average nucleotide sequence identity (ANI) with BubbaBear (GenBank accession number MK814753), both of which are members of the EB cluster, also a microbacteriophage cluster.
Data availability.
GenBank and SRA accession numbers for QuadZero and AnnaLie are listed in Table 1.
TABLE 1.
Phage accession numbers for GenBank and SRA and genome assembly results
| Phage name | Genbank accession no. | SRA accession no. | Avg coverage (x) | Cluster | Genome length (bp) | GC content (%) | No. of genes |
|---|---|---|---|---|---|---|---|
| QuadZero | MZ150787 | SRX12371995 | 1787 | EA | 40,140 | 63.6% | 62 |
| AnnaLie | MT818427 | SRX12371994 | 842 | EB | 41,665 | 66.6% | 71 |
ACKNOWLEDGMENTS
This work was supported by the Howard Hughes Medical Institute SEA-PHAGES program.
Soil samples were collected and phage isolation was performed in the fall of 2019 by students enrolled in the BIO 201 courses at Queens University of Charlotte. Electron microscopy was performed at the Electron Microscopy Laboratory at Canon Research Center at Atrium Health. Phage genomes were sequenced at the Pittsburgh Bacteriophage Institute at the University of Pittsburgh. Preliminary genome annotation was performed in the spring of 2020 by the students enrolled in the BIO 303 courses at Queens University of Charlotte.
Contributor Information
Jennifer Cook-Easterwood, Email: easterwj@queens.edu.
Simon Roux, DOE Joint Genome Institute.
REFERENCES
- 1.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy 0.039w?>JJ, Denver DR, Dunbar D, Elgin SCR, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Hartzog GA, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. mBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Schneider CA, Rasband WS, Eliceiri KW. 2012. NIH Image to ImageJ: 25 years of image analysis. Nat Methods 9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Miller JR, Koren S, Sutton G. 2010. Assembly algorithms for next generation sequencing data. Genomics 95:315–327. doi: 10.1016/j.ygeno.2010.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Russell DA. Sequencing, assembling, and finishing complete bacteriophage genomes. 2018. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed] [Google Scholar]
- 6.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Besemer J, Borodovsky M. 2005. GeneMark: web software for gene finding in prokaryotes, eukaryotes, and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 9.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zimmermann L, Stephens A, Nam SZ, Rau D, Kubler J, Lozajic M, Gabler F, Soding J, Lupas AN, Alva V. 2018. A completely reimplemented MPI Bioinformatics Toolkit with a new HHpred server at its core. J Mol Biol 430:2237–2243. doi: 10.1016/j.jmb.2017.12.007. [DOI] [PubMed] [Google Scholar]
- 11.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chan PP, Lowe TM. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol Biol 2019. 1962:1–14. doi: 10.1007/978-1-4939-9173-0_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Soding J, Thompson JD, Higgins DG. 2011. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
GenBank and SRA accession numbers for QuadZero and AnnaLie are listed in Table 1.
TABLE 1.
Phage accession numbers for GenBank and SRA and genome assembly results
| Phage name | Genbank accession no. | SRA accession no. | Avg coverage (x) | Cluster | Genome length (bp) | GC content (%) | No. of genes |
|---|---|---|---|---|---|---|---|
| QuadZero | MZ150787 | SRX12371995 | 1787 | EA | 40,140 | 63.6% | 62 |
| AnnaLie | MT818427 | SRX12371994 | 842 | EB | 41,665 | 66.6% | 71 |
