ABSTRACT
MaGuCo is a temperate phage isolated from soil collected in Alton, NH, USA, using Arthrobacter globiformis. Its genome is 43,924 base pairs long and contains 63 protein-encoding genes, 44 of which were assigned putative functions. MaCuGo is assigned to cluster AZ2 based on gene content similarity to actinobacteriophages.
KEYWORDS: bacteriophage genetics, genome analysis, actinobacteriophage, cluster AZ
ANNOUNCEMENT
Bacteriophages (phages) are viruses that reproduce by infecting and lysing host bacterial cells (1). With increasing antibiotic resistance being found in pathogenic bacteria, the antibacterial properties of phages have garnered interest as a possible treatment alternative (2). The goal of this study was to isolate and characterize a unique phage from a local soil sample. The research was conducted primarily via protocols developed by Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (3). The in-depth study of individual phage species is a crucial first step to understanding the possible medical applications of phages (2).
MaGuCo is a temperate phage from cluster AZ, subcluster AZ2 that infects Arthrobacter globiformis B-2979. MaGuCo was isolated in 2022 from a soil sample collected in Alton, NH, USA (43.4432 N, 71.22524 W) using standard protocols (3). Briefly, the soil sample was washed with peptone-yeast extract-calcium chloride broth, the wash was filtered through a 0.22 µm filter, and the filtrate was plated in top agar with Arthrobacter globiformis B-2979. MaGuCo, which results in bullseye plaques approximately 6 mm in diameter (Fig. 1) after 48 hours of incubation at 30°C, was purified through three rounds of plating. The phage was then amplified, and a high-titer lysate was collected for DNA extraction and transmission electron microscopy.
Fig 1.

Plaque assay agar plate showing MaGuCo plaque morphology.
MaGuCo’s DNA was extracted from a lysate using the Phage DNA Isolation Kit from Norgen Biotek Corporation. The DNA was prepared for sequencing using the NEB Ultra II Library Kit and shotgun sequenced with an Illumina MiSeq with v3 reagents at the Pittsburgh Bacteriophage Institute to yield 353,597 150-base single-end reads that provided coverage of approximately 1,175×. Reads were assembled with Newbler v2.9 (4) and verified with Consed v29.0 (4), resulting in a genome of 43,924 base pairs (bp) long with a guanine and cytosine content of 68.2% and 3′ single-stranded overhang (5′-CGGAGAGGCAT-3′). Based on gene content similarity (GCS) of at least 35% to actinobacteriophages, using the GCS tool at the Actinobacteriophage database (https://phagesdb.org/), MaGuCo was assigned to cluster AZ, subcluster AZ2 (5, 6).
DNA Master v5.23.6 build 2705 (https://phagesdb.org/DNAMaster/) was used to auto-annotate the sequenced genome, and then Phage Evidence Collection and Annotation Network (PECAAN) v20221109 (7), Glimmer v3.02 (8), GeneMark v2.5p (09.08.06) (9), Starterator v1.2 (10), Phamerator v519 (11), BLASTp (using the PhagesDB and NCBI nonredundant databases) (12), HHPred (using the PDB mmCIF70, Pfam-A, and NCBI Conserved Domain databases) (13), TMHMM v2.0 (14), and TOPCONS v2.0 (15) were used to further refine the annotation and assign functions. A total of 63 genes were predicted, 44 of which were assigned a putative function. No tRNAs were found, and 19 genes were not assigned a function.
As with other cluster AZ phages, MaGuCo’s structure and assembly genes are encoded on the left third of the genome, while DNA metabolism genes are encoded on the right two-thirds, including a serine integrase. The endolysin of cluster AZ phages can be found encoded within the first third or last third of the genome. For MaGuCo, the endolysin is encoded by gene 20. Of the 63 genes, 4 genes have no known homologs in the Actinobacteriophage and NCBI nonredundant databases (6, 12).
ACKNOWLEDGMENTS
The research was supported by New Hampshire-INBRE through an Institutional Development Award (IDeA), P20GM103506, from the National Institute of General Medical Sciences of the NIH.
We would like to extend a special thank you to Karel Pluhar and Dr. Benjamin Moyer for going above and beyond to assist with this project, as well as SEA-PHAGES for the extensive collection of phage research information and protocols that were the backbone of our study.
Contributor Information
Beth M. Wilkes, Email: bwilkes@ccsnh.edu.
Catherine Putonti, Loyola University Chicago, Chicago, Illinois, USA.
DATA AVAILABILITY
MaGuCo is available at GenBank with accession No. OQ709203 and Sequence Read Archive (SRA) No. SRX20916068.
REFERENCES
- 1. Hatfull GF. 2020. Actinobacteriophages: genomics, dynamics, and applications. Annu Rev Virol 7:37–61. doi: 10.1146/annurev-virology-122019-070009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Sulakvelidze A, Alavidze Z, Morris JG. 2001. Bacteriophage therapy. Antimicrob Agents Chemother 45:649–659. doi: 10.1128/AAC.45.3.649-659.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Poxleitner M, Pope W, Jacobs-Sera D, Sivanathan V, Hatfull G. 2018. Phage discovery guide. Howard Hughes Medical Institute, Chevy Chase, MD. Available from: http://seaphagesphagediscoveryguide.helpdocsonline.com/home [Google Scholar]
- 4. Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9 [DOI] [PubMed] [Google Scholar]
- 5. Pope WH, Mavrich TN, Garlena RA, Guerrero-Bustamante CA, Jacobs-Sera D, Montgomery MT, Russell DA, Warner MH, Hatfull GF, Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) . 2017. Bacteriophages of Gordonia spp. display a spectrum of diversity and genetic relationships. mBio 8:e01069-17. doi: 10.1128/mBio.01069-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Russell DA, Hatfull GF. 2017. PhagesDB: the actinobacteriophage database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Rinehart CA, Gaffney BL, Smith JR, Wood JD. 2016. PECAAN: phage evidence collection and annotation network. Western Kentucky University Bioinformatics and Information Science Center, Bowling Green. https://seaphages.org/media/docs/PECAAN_User_Guide_Dec7_2016.pdf. [Google Scholar]
- 8. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Lukashin AV, Borodovsky M. 1998. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res 26:1107–1115. doi: 10.1093/nar/26.4.1107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Pacey M. 2016. Edited by Pope W. Starterator guide. University of Pittsburgh, Pittsburgh, PA. Available from: https://seaphages.org/media/docs/Starterator_Guide_2016.pdf [Google Scholar]
- 11. Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinforma 12:395. doi: 10.1186/1471-2105-12-395 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
- 13. Söding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Krogh A, Larsson B, von Heijne G, Sonnhammer ELL. 2001. Predicting transmembrane protein topology with a hidden markov model: application to complete genomes. J Mol Biol 305:567–580. doi: 10.1006/jmbi.2000.4315 [DOI] [PubMed] [Google Scholar]
- 15. Tsirigos KD, Peters C, Shu N, Käll L, Elofsson A. 2015. The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Res 43:W401–W407. doi: 10.1093/nar/gkv485 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
MaGuCo is available at GenBank with accession No. OQ709203 and Sequence Read Archive (SRA) No. SRX20916068.
