Serratia marcescens is a ubiquitous Gram-negative opportunistic pathogen. This announcement describes the isolation and genome annotation of S. marcescens T5-like siphophage Slocum. Terminal repeats, 170 protein-coding genes, and 23 tRNAs were predicted in the 112,436-bp Slocum genome.
ABSTRACT
Serratia marcescens is a ubiquitous Gram-negative opportunistic pathogen. This announcement describes the isolation and genome annotation of S. marcescens T5-like siphophage Slocum. Terminal repeats, 170 protein-coding genes, and 23 tRNAs were predicted in the 112,436-bp Slocum genome.
ANNOUNCEMENT
Serratia marcescens is a Gram-negative bacillus that is ubiquitous throughout nature and is a member of the family Enterobacteriaceae (1). S. marcescens causes infections in immunocompromised patients and infants and was recently reported as a pathogen of several bee species (2, 3). In this study, the S. marcescens-infecting bacteriophage Slocum was isolated and its genome was annotated.
Bacteriophage Slocum originated from filtered (0.2-μm filter) municipal wastewater in Bryan, Texas. The host, S. marcescens strain D1 (no. 8887172; Ward’s Science), was cultured aerobically at 30°C and 37°C in LB (BD), and phages were propagated via the soft-agar overlay method (4). After phages were stained with 2% (wt/vol) uranyl acetate, Slocum morphology was determined by transmission electron microscopy at the Texas A&M University Microscopy and Imaging Center (5). Phage genomic DNA was purified with the modified Promega Wizard DNA clean-up system shotgun library preparation protocol described by Summer (6), and then an Illumina TruSeq Nano low-throughput kit was used to prepare a library for Illumina MiSeq sequencing with paired-end 250-bp reads, using a 500-cycle v2 kit. Quality control of 2,820,474 total sequenced reads was performed using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc). After trimming was performed using the FastX Toolkit v0.0.14 (http://hannonlab.cshl.edu/fastx_toolkit), the raw Slocum contig was assembled using SPAdes v3.5.0, with default parameters, to 409.3-fold coverage (7). PCR from the contig ends (forward, 5′-CCGTTGTTGCACAAGATGAAG-3′; reverse, 5′-ACTAGGGTATGCCTAAGAGGAAA-3′) and Sanger sequencing of the products were used to verify a complete and accurate sequence. Structural annotations relied on gene calling by MetaGeneAnnotator v1.0 and GLIMMER v3.0 for proteins and ARAGORN v2.36 for tRNAs (8–10). Rho-independent termination sites were annotated using TransTermHP v2.09 (11). Functional predictions of genes were made using InterProScan v5.33-72, TMHMM v2.0, and BLAST v2.2.31 with NCBI nonredundant and UniProtKB Swiss-Prot/TrEMBL databases, with a maximum expectation value of 0.001 (12–15). Additionally, coding sequences were analyzed for lipoylation signals with LipoP v1.0 (16). Structural predictions were performed with HHSuite v3.0 tool HHPred (multiple sequence alignment generation with HHblits using the ummiclus30_2018_08 database and modeling with the PDB_mmCIF70 database) (17). Genome-wide DNA sequence similarity for Slocum was calculated using progressiveMauve v2.4.0 (18). The genome termini were predicted with PhageTerm (19). All listed annotation tools and their outputs (with the exception of HHPred) are available in the Galaxy and Web Apollo instances hosted at the Center for Phage Technology (CPT) (https://cpt.tamu.edu/galaxy-pub) (20, 21). Unless otherwise stated, all tools were executed using default parameters.
The phage Slocum genome consists of 112,436 bp of double-stranded DNA, with a G+C content of 44.8%. With 170 predicted protein-coding genes and 23 predicted tRNA genes, the genome protein coding density is 88%. Additionally, 12,521-bp predicted direct terminal repeats were used as the boundary for genome reopening. At the amino acid level, phage Slocum shares 81 similar proteins with Escherichia phage T5 (GenBank accession no. NC_005859) and T5-like Escherichia phage slur09 (GenBank accession no. LN887948), with 27 to 29% nucleotide identity to slur09 and several Salmonella phages, fragmented across the genome.
Data availability.
The genome sequence and associated data for phage Slocum were deposited under GenBank accession no. MN095770, BioProject accession no. PRJNA222858, SRA accession no. SRR8892204, and BioSample accession no. SAMN11411459.
ACKNOWLEDGMENTS
This work was supported by funding from the National Science Foundation (grant DBI-1565146). Additional support came from the CPT, an Initial University Multidisciplinary Research Initiative supported by Texas A&M University and Texas AgriLife, and from the Texas A&M University Department of Biochemistry and Biophysics.
We are grateful for the advice and support of the CPT staff.
This announcement was prepared in partial fulfillment of the requirements for BICH464 Bacteriophage Genomics, an undergraduate course at Texas A&M University.
REFERENCES
- 1.Mahlen SD. 2011. Serratia infections: from military experiments to current practice. Clin Microbiol Rev 24:755–791. doi: 10.1128/CMR.00017-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cristina ML, Sartini M, Spagnolo AM. 2019. Serratia marcescens infections in neonatal intensive care units (NICUs). Int J Environ Res Public Health 16:610. doi: 10.3390/ijerph16040610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Funfhaus A, Ebeling J, Genersch E. 2018. Bacterial pathogens of bees. Curr Opin Insect Sci 26:89–96. doi: 10.1016/j.cois.2018.02.008. [DOI] [PubMed] [Google Scholar]
- 4.Adams M. 1959. Bacteriophages. Interscience Publishers, New York, NY. [Google Scholar]
- 5.Valentine RC, Shapiro BM, Stadtman ER. 1968. Regulation of glutamine synthetase. XII. Electron microscopy of the enzyme from Escherichia coli. Biochemistry 7:2143–2152. doi: 10.1021/bi00846a017. [DOI] [PubMed] [Google Scholar]
- 6.Summer EJ. 2009. Preparation of a phage DNA fragment library for whole genome shotgun sequencing. Methods Mol Biol 502:27–46. doi: 10.1007/978-1-60327-565-1_4. [DOI] [PubMed] [Google Scholar]
- 7.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Noguchi H, Taniguchi T, Itoh T. 2008. MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes. DNA Res 15:387–396. doi: 10.1093/dnares/dsn027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kingsford C, Ayanbule K, Salzberg S. 2007. Rapid, accurate, computational discovery of rho-independent transcription terminators illuminates their relationship to DNA uptake. Genome Biol 8:R22. doi: 10.1186/gb-2007-8-2-r22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong SY, Lopez R, Hunter S. 2014. InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240. doi: 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Krogh A, Larsson B, von Heijne G, Sonnhammer EL. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- 14.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.The UniProt Consortium. 2019. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 47:D506–D515. doi: 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, Krogh A. 2003. Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 12:1652–1662. doi: 10.1110/ps.0303703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Zimmermann L, Stephens A, Nam SZ, Rau D, Kubler J, Lozajic M, Gabler F, Soding J, Lupas AN, Alva V. 2018. A completely reimplemented MPI bioinformatics Toolkit with a new HHpred server at its core. J Mol Biol 430:2237–2243. doi: 10.1016/j.jmb.2017.12.007. [DOI] [PubMed] [Google Scholar]
- 18.Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5:e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Garneau JR, Depardieu F, Fortier LC, Bikard D, Monot M. 2017. PhageTerm: a tool for fast and accurate determination of phage termini and packaging mechanism using next-generation sequencing data. Sci Rep 7:8292. doi: 10.1038/s41598-017-07910-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Gruning BA, Guerler A, Hillman-Jackson J, Hiltemann S, Jalili V, Rasche H, Soranzo N, Goecks J, Taylor J, Nekrutenko A, Blankenberg D. 2018. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res 46:W537–W544. doi: 10.1093/nar/gky379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lee E, Helt GA, Reese JT, Munoz-Torres MC, Childers CP, Buels RM, Stein L, Holmes IH, Elsik CG, Lewis SE. 2013. Web Apollo: a Web-based genomic annotation editing platform. Genome Biol 14:R93. doi: 10.1186/gb-2013-14-8-r93. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence and associated data for phage Slocum were deposited under GenBank accession no. MN095770, BioProject accession no. PRJNA222858, SRA accession no. SRR8892204, and BioSample accession no. SAMN11411459.
