Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2022 Nov 2;11(12):e00771-22. doi: 10.1128/mra.00771-22

Complete Genome Sequence of Arthrobacter Phage GantcherGoblin Exhibits Both Conservation with Subcluster AU6 Phages and Genetic Novelty

J J Wheeler a, Carina M Carlos a, Helen A Cedzidlo a, Xingfeiyang Liu a, Massimo S Modica a, Izaiah D Rhodes a, Leah F Truskinovsky a, Ethan M VanGosen a, Hannah E Gavin b,c,
Editor: Simon Rouxd
PMCID: PMC9753662  PMID: 36321902

ABSTRACT

GantcherGoblin is a lytic siphovirus that was isolated on Arthrobacter globiformis B-2979 from soil collected in Massachusetts. The 55,368-bp genome has a GC content of 50.1% and 91 predicted protein-coding genes. Based on gene content similarity to phages in the Actinobacteriophage Database, GantcherGoblin was assigned to phage subcluster AU6.

ANNOUNCEMENT

Bacteriophages (phages) are abundant and diverse members of the biosphere (1). Generating a comprehensive catalog of phages increases our chances of successfully applying these entities to medical and environmental challenges (2). Here, we report the discovery and characterization of the novel phage GantcherGoblin.

GantcherGoblin was isolated by undergraduate students at Tufts University Experimental College in partnership with the national Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program, using standard procedures (3, 4). Briefly, GantcherGoblin was extracted from a soil sample that had been collected in Medford, Massachusetts (42.410N, 71.115596W), in early September 2021, by washing the soil in peptone-yeast extract-calcium (PYCa) liquid medium. The wash was filtered (0.02-μm pore size), and a filtrate sample was plated in PYCa top agar with Arthrobacter globiformis B-2979. GantcherGoblin was purified through three consecutive rounds of plating with incubations at 30°C. Negative-staining transmission electron microscopy (TEM) revealed GantcherGoblin to exhibit siphoviral morphology (Fig. 1).

FIG 1.

FIG 1

Characteristics of GantcherGoblin plaques and virions. (a) When plated on the isolation host Arthrobacter globiformis B-297 and incubated at 30°C, GantcherGoblin plaques exhibit clear centers and turbid margins. (b) Virions negatively stained with 1% uranyl acetate for TEM reveal siphoviral morphology, with prolate icosahedral heads and long tails. Scale bar = 100 nm (21, 22). (c) The dimensions of 5 virion capsids and 4 virion tails, measured using ImageJ, are plotted with a horizontal line at the calculated mean and error bars indicating the standard deviations of the mean (22). On average, GantcherGoblin tails measure 231 nm ± 5 nm. Capsids measure 53 nm ± 3 nm along the axis perpendicular to the tail (Head1) and 71 nm ± 1 nm along the same axis as the tail (Head2).

DNA was extracted from a plate-derived high-titer lysate using the Wizard DNA clean-up system (A7280; Promega) and submitted to the University of Pittsburgh, where a library was prepared for sequencing with a NEBNext Ultra II FS kit. The library was sequenced using an Illumina MiSeq system (v3 reagents), yielding 565,517 unpaired 150-bp reads (coverage, ~1,443×). Raw reads were assembled with Newbler v2.9 and checked using Consed v29 (5, 6).

Coding regions were initially identified by using DNA Master v5.0.2 to run GeneMark v4.9, Glimmer v3.02b, and ARAGORN v1.1 (79). Putative open reading frames (ORFs) and start positions were confirmed or revised through examination of Shine-Dalgarno sequences, Starterator reports, and BLAST searches against the PhagesDB Actinobacteriophage Database and NCBI BLASTn standard nucleotide databases (7, 1012). Functions were assigned to ORFs by BLASTp searches against standard databases and HHpred searches against the following databases: PDB mmCIF70_12_Oct, Pfam-A v35, NCBI Conserved Domains (CD) v3.18, and PRD v6.9. Phamerator and the Comprehensive Antibiotic Resistance Database (CARD) v3.2.0 Resistance Gene Identifier (RGI) v5.2.1 were also consulted when predicting gene functions (1315). When a function could not be assigned, TMHMM v2.0 and SOSUI were employed to identify transmembrane domains (1618).

The GantcherGoblin genome is 55,368 bp, with a GC content of 50.1%. The genome termini have 9-bp 3′ single-stranded overhangs (5′-CGCCGGCCT-3′). All of the 91 protein-coding ORFs of GantcherGoblin are transcribed in the same direction. Of these genes, 87 have homologs in the Actinobacteriophage Database as of 31 May 2022. Functions were predicted for 26 GantcherGoblin genes; 11 encode virion structure or assembly components, while 15 encode nonstructural proteins. An additional 5 ORFs are predicted transmembrane proteins. No antibiotic resistance genes or tRNAs were identified. The absence of lysogeny-associated genes in the genome is consistent with GantcherGoblin plaque morphology and suggests an obligately lytic replication cycle (Fig. 1a).

Based on gene content similarity (GCS) to phages in the Actinobacteriophage Database as of 3 October 2022, GantcherGoblin was assigned to cluster AU, subcluster AU6 (12, 19, 20). GantcherGoblin shares 83.1% and 84.2% GCS with the two other subcluster AU6 phages, i.e., Uzumaki and Zeina, respectively. Like those of Zeina, the virions of GantcherGoblin have a prolate capsid structure (Fig. 1b and c).

Unusually large intergenic regions of up to ~200 bp are interspersed among Gantcher Goblin genes 32 to 36. In addition, the genomic region from bp ~13,000 to 16,000, which includes predicted minor tail proteins, has low conservation with other AU6 phages. The GantcherGoblin genome lacks several genes that are present in both Uzumaki and Zeina and, conversely, possesses novel genes (genes 48, 51, 76, and 87) with currently unknown functions.

Data availability.

The complete GantcherGoblin genome is available in GenBank under accession number ON970564. GantcherGoblin raw sequencing data are archived under SRA accession number SRX14443506.

ACKNOWLEDGMENTS

This project was supported the Howard Hughes Medical Institute SEA-PHAGES program, the Tufts Experimental College, and the Tufts Department of Biology. We appreciate financial support from the Tufts Dean of Academic Affairs, Sam Thomas, and infrastructural and financial support from Howard Woolf and Amy Goldstein.

Genome sequencing was performed at the University of Pittsburgh. Electron microscopy imaging, consultation, and services were performed by the HMS Electron Microscopy Facility.

We particularly acknowledge and appreciate training and technical support from Becky Garlena, Dan Russell, Deborah Jacobs-Sera, and Graham Hatfull; electron microscopy sample processing and imaging services from Anja Nordstrom and Maria Ericsson; and direct course support from Michael Grossi and Brendan Carson.

Contributor Information

Hannah E. Gavin, Email: hannah.gavin@tufts.edu.

Simon Roux, DOE Joint Genome Institute.

REFERENCES

  • 1.Hatfull GF. 2015. Dark matter of the biosphere: the amazing world of bacteriophage diversity. J Virol 89:8107–8110. doi: 10.1128/JVI.01340-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hatfull GF. 2020. Actinobacteriophages: genomics, dynamics, and applications. Annu Rev Virol 7:37–61. doi: 10.1146/annurev-virology-122019-070009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hatfull GF. 2015. Innovations in undergraduate science education: going viral. J Virol 89:8111–8113. doi: 10.1128/JVI.03003-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Jordan TC, Burnett SH, Carson S, Caruso SM, Clase K, DeJong RJ, Dennehy JJ, Denver DR, Dunbar D, Elgin SC, Findley AM, Gissendanner CR, Golebiewska UP, Guild N, Hartzog GA, Grillo WH, Hollowell GP, Hughes LE, Johnson A, King RA, Lewis LO, Li W, Rosenzweig F, Rubin MR, Saha MS, Sandoz J, Shaffer CD, Taylor B, Temple L, Vazquez E, Ware VC, Barker LP, Bradley KW, Jacobs-Sera D, Pope WH, Russell DA, Cresawn SG, Lopatto D, Bailey CP, Hatfull GF. 2014. A broadly implementable research course in phage discovery and genomics for first-year undergraduate students. mBio 5:e01051-13. doi: 10.1128/mBio.01051-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Miller JR, Koren S, Sutton G. 2010. Assembly algorithms for next-generation sequencing data. Genomics 95:315–327. doi: 10.1016/j.ygeno.2010.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gordon D, Green P. 2013. Consed: a graphical editor for next-generation sequencing. Bioinformatics 29:2936–2937. doi: 10.1093/bioinformatics/btt515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Besemer J, Borodovsky M. 2005. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Pope WH, Jacobs-Sera D. 2018. Annotation of bacteriophage genome sequences using DNA Master: an overview, p 217–229. In Clokie MRJ, Kropinski AM, Lavigne R (ed), Bacteriophages: methods and protocols, vol 3. Springer, New York, NY. [DOI] [PubMed] [Google Scholar]
  • 11.Washington University in St. Louis. 2022. Starterator. http://phages.wustl.edu/starterator.
  • 12.Russell DA, Hatfull GF. 2017. PhagesDB: the Actinobacteriophage Database. Bioinformatics 33:784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Soding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 15.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Krogh A, Larsson B, von Heijne G, Sonnhammer EL. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
  • 17.Hirokawa T, Boon-Chieng S, Mitaku S. 1998. SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics 14:378–379. doi: 10.1093/bioinformatics/14.4.378. [DOI] [PubMed] [Google Scholar]
  • 18.Mitaku S, Hirokawa T, Tsuji T. 2002. Amphiphilicity index of polar amino acids as an aid in the characterization of amino acid preference at membrane-water interfaces. Bioinformatics 18:608–616. doi: 10.1093/bioinformatics/18.4.608. [DOI] [PubMed] [Google Scholar]
  • 19.Pope WH, Mavrich TN, Garlena RA, Guerrero-Bustamante CA, Jacobs-Sera D, Montgomery MT, Russell DA, Warner MH, Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) , Hatfull GF. 2017. Bacteriophages of Gordonia spp. display a spectrum of diversity and genetic relationships. mBio 8:e01069-17. doi: 10.1128/mBio.01069-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Mavrich TN, Hatfull GF. 2017. Bacteriophage evolution differs by host, lifestyle and genome. Nat Microbiol 2:17112. doi: 10.1038/nmicrobiol.2017.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ackermann HW. 1998. Tailed bacteriophages: the order Caudovirales. Adv Virus Res 51:135–201. doi: 10.1016/s0065-3527(08)60785-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Schneider CA, Rasband WS, Eliceiri KW. 2012. NIH Image to ImageJ: 25 years of image analysis. Nat Methods 9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete GantcherGoblin genome is available in GenBank under accession number ON970564. GantcherGoblin raw sequencing data are archived under SRA accession number SRX14443506.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES