We report here the sequences of 20 bacteriophages isolated on Gordonia terrae 3612. These phages span considerable sequence diversity, represent 12 clusters and a singleton genome, and range in genome length from 16.2 kbp to 151.3 kbp. Phages Pupper and SCentae are the first reported Myoviridae phages of Gordonia spp.
ABSTRACT
We report here the sequences of 20 bacteriophages isolated on Gordonia terrae 3612. These phages span considerable sequence diversity, represent 12 clusters and a singleton genome, and range in genome length from 16.2 kbp to 151.3 kbp. Phages Pupper and SCentae are the first reported Myoviridae phages of Gordonia spp.
ANNOUNCEMENT
Bacteriophages are the most numerous and diverse biological entities in the biosphere, with an estimated global population of 1031 particles, and the population turns over every several days (1). Gordonia terrae is widely distributed in soils, is an opportunistic pathogen in immunocompromised people, and is implicated in nonproductive foaming in wastewater treatment plants (2). To advance our understanding of viral diversity of bacteria in the phylum Actinobacteria, 20 phages of Gordonia terrae 3612 were isolated and sequenced.
Phage isolation was performed either through direct plating of soil filtrate or by enriched growth with G. terrae, as described previously (3). Phages from isolated plaques were purified and amplified, and genomic DNA was extracted using a Wizard DNA prep kit (Promega). Lysates were deposited on carbon-coated copper grids, stained with uranyl acetate, and imaged using an FEI Morgagni transmission electron microscope to determine virion morphology.
Genomic libraries were prepared from phage DNA using an Illumina TruSeq Nano kit. Libraries were sequenced using an Illumina MiSeq instrument, yielding 150-bp single-end reads sufficient to provide enough coverage for each genome (Table 1). Sequence reads were quality controlled and then assembled using Newbler version 2.9; for each genome assembly, a single contig was evaluated for completeness and accuracy using Consed version 29 as described previously (4). Genomic termini were determined using PAUSE (https://cpt.tamu.edu/computer-resources/pause/); for circularly permuted genomes, coordinate number one was chosen by alignment with similar genomes or was immediately upstream of the terminase large subunit gene (4). Genomes were annotated using DNA Master (http://cobamide2.bio.pitt.edu/), ARAGORN (5), tRNAscan-SE (6), Glimmer (7), GeneMark (8), Phamerator (9), BLAST (10), and HHpred (11) and by manual inspection (12). All software was used with default settings.
TABLE 1.
Gordonia phage genometrics
| Phage name | Cluster | Genome length (bp) | G+C content (%) | Fold coverage (no. of reads) | Direct platingc | No. of CDSsb | No. of tRNAs | Genome endsa | GenBank accession no. | SRA accession no. |
|---|---|---|---|---|---|---|---|---|---|---|
| Lozinak | CQ1 | 93,201 | 61.9 | 559 | N | 180 | 6 | CGCGACGCTC | MF919520 | SRX5726432 |
| Toniann | CQ1 | 92,546 | 61.9 | 878 | N | 180 | 6 | CGCGACGCTC | MF919537 | SRX5726440 |
| Patio | CR3 | 66,251 | 65.6 | 3,811 | N | 90 | 0 | CGCCGCGTAC | MF919542 | SRX5726435 |
| BirksAndSocks | CS2 | 77,354 | 58.9 | 1,008 | Y | 110 | 1 | DTR | MG099940 | SRX5726420 |
| Boneham | CS2 | 77,497 | 58.9 | 1,822 | N | 109 | 1 | DTR | MG757155 | SRX5726426 |
| Gorko | CS2 | 75,158 | 59 | 1,064 | Y | 103 | 1 | DTR | MK801728 | SRX5726428 |
| Anamika | CS3 | 73,851 | 59.2 | 907 | Y | 92 | 1 | DTR | MG099935 | SRX5726421 |
| DinoDaryn | CU1 | 44,936 | 66.2 | 1,416 | Y | 82 | 0 | TCCGGGCCGGTA | KY471269 | SRX5726424 |
| Lysidious | CV | 50,948 | 67 | 874 | Y | 83 | 0 | TCTCCGGTGA | MF919521 | SRX5726433 |
| William | CV | 50,678 | 66.5 | 456 | N | 83 | 0 | TCGCCGGTGA | MK801721 | SRX5726430 |
| Coeur | CW2 | 16,223 | 58 | 5,134 | Y | 26 | 0 | ACCCCT | MK801723 | SRX5726423 |
| ThankyouJordi | CZ1 | 53,453 | 66.4 | 4,321 | N | 83 | 0 | CGGCTGGGGA | MK801727 | SRX5726439 |
| Bjanes7 | CZ2 | 46,042 | 66.6 | 2,827 | N | 72 | 0 | TACCAGGGGGA | MG099941 | SRX5726425 |
| Nordenberg | DE1 | 57,572 | 67.3 | 3,348 | Y | 84 | 0 | CP | MH976514 | SRX5726434 |
| BENtherdunthat | DN | 54,867 | 63.4 | 1,622 | Y | 102 | 0 | CTCGGGGCAT | MG099939 | SRX5726422 |
| Pupper | DO | 150,830 | 66.1 | 818 | N | 234 | 1 | CP | MK977695 | SRX5726436 |
| SCentae | DO | 151,316 | 66 | 665 | Y | 233 | 1 | CP | MK977696 | SRX5726438 |
| Yago84 | DR | 61,890 | 70 | 2,874 | Y | 83 | 0 | CP | MK801725 | SRX5726429 |
| Forza | DS | 114,174 | 53.2 | 511 | Y | 164 | 28 | DTR | MK814760 | SRX5726427 |
| Reyja | Singleton | 41,500 | 67.4 | 2,021 | N | 65 | 0 | TCCGGAGGTA | MK814759 | SRX5726437 |
Genome ends are 3′ single-stranded overhangs (overhang sequence is listed), direct terminal repeats (DTR), or circularly permuted (CP).
CDSs, coding sequences.
Y, yes; N, no.
The 20 phage genomes were assigned cluster and subcluster designations using previously described criteria (3). In brief, genomes with greater than 35% shared gene content with these or previously sequenced actinobacteriophage genomes grouped in the same cluster, and average nucleotide identity values were used to divide clusters into subclusters (3, 13). Of the 20 newly sequenced phages, 13 were assigned into previously designated clusters (Table 1) (3, 14). Six of the 20 phages formed parts of five new clusters (clusters DE, DN, DO, DR, and DS; Table 1), and one phage (Reyja) is a singleton with no close relatives (Table 1). The genomes vary substantially in length from 16 kb (Coeur) to 151 kb (SCentae), with many different genome sizes among them (Table 1), and there is a wide variation in G+C contents (from 53% to 70% in Forza and Yago84, respectively; Table 1). Although most of the 20 phages do not contain any tRNA genes, phage Forza (cluster DS) has 28 tRNA genes, the most for any Gordonia phage reported to date (phagesdb.org) (3).
Most of the Gordonia phages have siphoviral morphologies. However, phages Pupper and SCentae (cluster DO) are myoviruses with virion morphologies and genome architectures similar to those of the cluster C mycobacteriophages. These are the first known contractile-tailed Gordonia phages, but they differ from the cluster C phages—which code for >30 tRNA—in lacking any tRNA genes (phagesdb.org) (13).
Data availability.
Sequence reads are deposited at NCBI under BioProject accession number PRJNA488469 and SRA accession numbers SRX5726419 to SRX5726440 (Table 1).
ACKNOWLEDGMENTS
We thank Beckie Bortz, Emily Furbee, and Sarah Grubb for assistance with course instruction and implementation, the Pitt PHIRE program, and the HHMI SEA-PHAGES program for support. Consortium membership for University of Pittsburgh SEA-PHAGES is available at https://seaphages.org/media/docs/Pitt_SEA-PHAGES_Consortium_authors.xlsx.
This work was supported by Howard Hughes Medical Institute grant 54308198 to G.F.H.
REFERENCES
- 1.Hendrix RW. 2002. Bacteriophages: evolution of the majority. Theor Popul Biol 61:471–480. doi: 10.1006/tpbi.2002.1590. [DOI] [PubMed] [Google Scholar]
- 2.Andalibi F, Fatahi-Bafghi M. 2017. Gordonia: isolation and identification in clinical samples and role in biotechnology. Folia Microbiol (Praha) 62:245–252. doi: 10.1007/s12223-017-0491-1. [DOI] [PubMed] [Google Scholar]
- 3.Pope WH, Mavrich TN, Garlena RA, Guerrero-Bustamante CA, Jacobs-Sera D, Montgomery MT, Russell DA, Warner MH, Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) , Hatfull GF. 2017. Bacteriophages of Gordonia spp. display a spectrum of diversity and genetic relationships. mBio 8:e01069-17. doi: 10.1128/mBio.01069-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Russell DA. 2018. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 1681:109–125. doi: 10.1007/978-1-4939-7343-9_9. [DOI] [PubMed] [Google Scholar]
- 5.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Besemer J, Borodovsky M. 2005. GeneMark: Web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res 33:W451–W454. doi: 10.1093/nar/gki487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395. doi: 10.1186/1471-2105-12-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic Local Alignment Search Tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 11.Soding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pope WH, Jacobs-Sera D. 2018. Annotation of bacteriophage genome sequences using DNA master: an overview. Methods Mol Biol 1681:217–229. doi: 10.1007/978-1-4939-7343-9_16. [DOI] [PubMed] [Google Scholar]
- 13.Hatfull GF, Jacobs-Sera D, Lawrence JG, Pope WH, Russell DA, Ko C-C, Weber RJ, Patel MC, Germane KL, Edgar RH, Hoyte NN, Bowman CA, Tantoco AT, Paladin EC, Myers MS, Smith AL, Grace MS, Pham TT, O’Brien MB, Vogelsberger AM, Hryckowian AJ, Wynalek JL, Donis-Keller H, Bogel MW, Peebles CL, Cresawn SG, Hendrix RW. 2010. Comparative genomic analysis of 60 mycobacteriophage genomes: genome clustering, gene acquisition, and gene size. J Mol Biol 397:119–143. doi: 10.1016/j.jmb.2010.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Pope WH, Montgomery MT, Bonilla JA, Dejong R, Garlena RA, Guerrero Bustamante C, Klyczek KK, Russell DA, Wertz JT, Jacobs-Sera D, Hatfull GF. 2017. Complete genome sequences of 38 Gordonia sp. bacteriophages Genome Announc 5:e01143-16. doi: 10.1128/genomeA.01143-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Sequence reads are deposited at NCBI under BioProject accession number PRJNA488469 and SRA accession numbers SRX5726419 to SRX5726440 (Table 1).
