Abstract
Bacteriophage Wheezy, a lytic phage with siphoviral morphology isolated using the host Gordonia terrae 3612, has a genome of 67,021 base pairs and is 65.9% GC. The genome sequence of Wheezy aligns most closely with subcluster CR2 phages Tracker and NatB6. Annotation of the full-length genome sequence of Phage Wheezy revealed 92 protein-coding genes and no tRNA genes.
Figure 1. Transmission electron micrograph of Wheezy virions.
Negative-stained (1% uranyl acetate) transmission electron microscopy of virions of bacteriophage Wheezy exhibit a siphoviral morphology. Measurements of six particles showed tail lengths ranging from 231 to 249 nm (average 238 nm) and 54 to 62 nm diameter capsids (average 59 nm). Scale bar = 200 nm. Image acquired at the Nebraska Center for Biotechnology Morrison Microscopy Core Facility (Univ. Nebraska-Lincoln) using Hitachi Model HT-7800.
Description
Members of the Gram-positive Gordonia genus, in the Actinobacteria phylum, are found largely in soil and aquatic environments (Sowani et al., 2018) , and include a number of pathogenic isolates (Ramanan et al., 2013; Gulvik et al., 2021) . Capable of diverse metabolic transformations, Gordonia species are reservoirs of genes that can be exploited for bioremediation and industrial biotechnology (Arenskötter et al., 2004; Drzyzga, 2012; Sowani et al., 2019) . Bacteriophages infecting Gordonia are potential molecular tools for studying the genes coding for metabolic enzymes of Gordonia (Dyson et al., 2015) . We report the genome sequence of Wheezy, an actinobacteriophage infecting the soil bacterium Gordonia terrae 3612.
Wheezy was obtained from a soil sample collected from Black Elk Park (Millard, NE; GPS 41.198411 N, 96.168891 W) using G. terrae 3612 as the host following standard protocols for enriched isolation (Zorawik et al., 2024) . Briefly, soil was mixed with peptone-yeast extract-calcium (PYCa) liquid media and the collected extract was filter-sterilized with a 0.22-μm filter before inoculation with G. terrae 3612. After incubation at 30˚C, with shaking for 24 hours, the extract was centrifuged to remove bacteria and tested for phage using G. terrae 3612 in soft (PYCa) agar plaque assays, with plates being incubated for 48 h at 30˚C. Wheezy made clear plaques of 1 to 1.5 mm diameter. After three rounds of plaque purification, negative-stained Wheezy phage from a lysate were observed by transmission electron microscopy to have a siphoviral morphology ( Figure 1 ).
Phage DNA was isolated from a lysate using the Wizard DNA Cleanup Kit (Promega Corp., Madison, WI), prepared for sequencing with the New England Biolabs Ultra II Library Kit and sequenced at the Pittsburgh Bacteriophage Institute using Illumina MiSeq (v3 reagents). A total of 563,642 (150-base single-end) reads provided 1257-fold coverage. Genome assembly was accomplished and checked for sequence quality and completeness with Newbler v2.9 and Consed v29 (Russell, 2018) .
The Wheezy genome was 67,021 bp with a 3’ single-stranded overhang (CGCCGCGTAC) and 65.9% GC, and assigned to cluster CR (subcluster CR2) based on gene content similarity (GCS) of 35% or higher with phages in the Actinobacteriophage database (phagesdb.org), using the database GCS tool (Pope et al., 2017; Russell and Hatfull 2017) . Wheezy was most similar to GrootJr, sharing 98.38% of its phams (phage protein families) followed by NovumRegina and NatB6, sharing 97.83% and 97.3%, respectively, of their phams with Wheezy (Russell, 2018) .
The genome of Wheezy was automatically annotated with DNA Master v5.23.6 (Lawrence, 2007) that used Glimmer v3.92b (Delcher et al., 2007) and GeneMark v2.5p (Besemer and Borodovsky 2005) to identify open reading frames (ORFs). Neither tRNAs nor transfer-messenger RNAs (tmRNA) were detected using the programs tRNAscan-SE 2.0 and ARAGORN v1.2.41 (Lowe and Eddy 1997; Laslett and Canback 2004) . Web-based bioinformatic analysis software programs including NCBI BLASTp (Altschul et al., 1997) using NCBI non-redundant protein sequences and Actinobacteriophage Protein databases, Starterator (phages.wustl.edu/starterator/) and HHPred 3.18 (Soding et al., 2005) , using PDB mmCIF70, SCOPe70, Pfam-A and NCBI Conserved Domain databases, were used to confirm or revise start sites and assign functions for 40 of a total of 92 ORFs. These include six membrane proteins identified using Deep TMHMM v 1.024 (Hallgren et al., 2022) and SOSUI v1.11 (Hirokawa et al., 1998) . All tools were run with default parameters. Gene maps of Wheezy and other CR2 subcluster phage (genomes average 92 total genes) were compared using Phamerator (Actino_Draft v513) (Cresawn et al., 2011) .
The left arm of the Wheezy genome encodes genes that are transcribed in the forward direction (gp1–51). Genes for the terminase large subunit, and phage structural and assembly proteins, including the tail assembly chaperones that are predicted to be produced through a translation frameshift (Xu et al., 2004) , are clustered together (gp24–44). As seen in CR Cluster phage, the two domains of lysin A are encoded by adjacent but separate genes (gp47 and gp48). Genes transcribed in the reverse direction are located roughly in the middle portion of the genome (gp52–77) and include ones predicted to encode proteins that interact with DNA such as WhiB family transcription factors (gp54 and gp73), Dna-E like DNA polymerase III (gp60) and DNA helicase (gp72). The remaining downstream genes are transcribed in the forward direction and encode proteins with unknown functions (gp78–92). Genes for neither integrase nor repressor functions could be identified, suggesting that the phage is unlikely to establish lysogeny.
Nucleotide sequence accession numbers
Phage Wheezy is available at GenBank with Accession No. OR475262 and Sequence Read Archive (SRA) No. SRX20630260 .
Acknowledgments
Acknowledgments
Annotation of the Wheezy genome was done primarily by RHM to fulfill requirements for Independent Research credit under the supervision of CPC. Wheezy was isolated in Fall 2021 by Doane Univ. (Crete, NE) students M. Randazzo and A. Kuhl. They were guided by Dr. Dane Bowder who provided the TEM micrograph. We thank Renee Box and Chanasei Ziemann (Univ. Nebraska-Lincoln) for assisting in the review of the Wheezy genome annotation, and the HHMI SEA-PHAGES Program for support.
Funding Statement
No funding.
References
- Altschul S. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research. 1997 Sep 1;25(17):3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arenskötter Matthias, Bröker Daniel, Steinbüchel Alexander. Biology of the Metabolically Diverse Genus Gordonia . Applied and Environmental Microbiology. 2004 Jun 1;70(6):3195–3204. doi: 10.1128/aem.70.6.3195-3204.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Besemer J., Borodovsky M. GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Research. 2005 Jul 1;33(Web Server):W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cresawn Steven G, Bogel Matt, Day Nathan, Jacobs-Sera Deborah, Hendrix Roger W, Hatfull Graham F. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics. 2011 Oct 12;12(1) doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delcher Arthur L., Bratke Kirsten A., Powers Edwin C., Salzberg Steven L. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007 Jan 19;23(6):673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drzyzga Oliver. The strengths and weaknesses of Gordonia : A review of an emerging genus with increasing biotechnological potential . Critical Reviews in Microbiology. 2012 May 3;38(4):300–316. doi: 10.3109/1040841x.2012.668134. [DOI] [PubMed] [Google Scholar]
- Dyson Zoe A., Tucci Joseph, Seviour Robert J., Petrovski Steve. Lysis to Kill: Evaluation of the Lytic Abilities, and Genomics of Nine Bacteriophages Infective for Gordonia spp. and Their Potential Use in Activated Sludge Foam Biocontrol. PLOS ONE. 2015 Aug 4;10(8):e0134512–e0134512. doi: 10.1371/journal.pone.0134512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gulvik Christopher A., Batra Dhwani, Rowe Lori A., Sheth Milli, Nobles Sarah, Lee Justin S., McQuiston John R., Lasker Brent A. Complete and Circularized Bacterial Genome Sequence of Gordonia sp. Strain X0973 . Microbiology Resource Announcements. 2021 Mar 4;10(9) doi: 10.1128/mra.01479-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hallgren Jeppe, Tsirigos Konstantinos D., Pedersen Mads Damgaard, Almagro Armenteros José Juan, Marcatili Paolo, Nielsen Henrik, Krogh Anders, Winther Ole. DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. 2022 Apr 10; doi: 10.1101/2022.04.08.487609. [DOI]
- Hirokawa T, Boon-Chieng S, Mitaku S. SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics. 1998 Jan 1;14(4):378–379. doi: 10.1093/bioinformatics/14.4.378. [DOI] [PubMed] [Google Scholar]
- Laslett D. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Research. 2004 Jan 2;32(1):11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawrence JG. 2007. DNA Master (Version 5.23.6). Retrieved from cobamide2.bio.pitt.edu/computer.htm
- Lowe Todd M., Eddy Sean R. tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Research. 1997 Mar 1;25(5):955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pope Welkin H., Mavrich Travis N., Garlena Rebecca A., Guerrero-Bustamante Carlos A., Jacobs-Sera Deborah, Montgomery Matthew T., Russell Daniel A., Warner Marcie H., Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) Hatfull Graham F. Bacteriophages of Gordonia spp. Display a Spectrum of Diversity and Genetic Relationships . mBio. 2017 Sep 6;8(4) doi: 10.1128/mbio.01069-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramanan Poornima, Deziel Paul J., Wengenack Nancy L. Gordonia Bacteremia. Journal of Clinical Microbiology. 2013 Oct 1;51(10):3443–3447. doi: 10.1128/jcm.01449-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Russell Daniel A. Sequencing, Assembling, and Finishing Complete Bacteriophage Genomes. Methods in Molecular Biology. 2017 Nov 14;:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed]
- Russell Daniel A, Hatfull Graham F. PhagesDB: the actinobacteriophage database. Bioinformatics. 2016 Dec 6;33(5):784–786. doi: 10.1093/bioinformatics/btw711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soding J., Biegert A., Lupas A. N. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Research. 2005 Jul 1;33(Web Server):W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sowani Harshada, Kulkarni Mohan, Zinjarde Smita. An insight into the ecology, diversity and adaptations of Gordonia species . Critical Reviews in Microbiology. 2017 Dec 25;44(4):393–413. doi: 10.1080/1040841x.2017.1418286. [DOI] [PubMed] [Google Scholar]
- Sowani Harshada, Kulkarni Mohan, Zinjarde Smita. Harnessing the catabolic versatility of Gordonia species for detoxifying pollutants. Biotechnology Advances. 2019 May 1;37(3):382–402. doi: 10.1016/j.biotechadv.2019.02.004. [DOI] [PubMed] [Google Scholar]
- Xu Jun, Hendrix Roger W., Duda Robert L. Conserved Translational Frameshift in dsDNA Bacteriophage Tail Assembly Genes. Molecular Cell. 2004 Oct 1;16(1):11–21. doi: 10.1016/j.molcel.2004.09.006. [DOI] [PubMed] [Google Scholar]
- Zorawik Michelle, Jacobs-Sera Deborah, Freise Amanda C., SEA-PHAGES. Reddi Krisanavane. Isolation of Bacteriophages on Actinobacteria Hosts. Methods in Molecular Biology. 2024:273–298. doi: 10.1007/978-1-0716-3798-2_17. [DOI] [PubMed]

