We report the complete genome sequence of P22-like Salmonella enterica serovar Typhimurium phage MG40, whose prophage repressor specificity is different from that of other known temperate phages.
ABSTRACT
We report the complete genome sequence of P22-like Salmonella enterica serovar Typhimurium phage MG40, whose prophage repressor specificity is different from that of other known temperate phages.
ANNOUNCEMENT
MG40 was isolated from human stool in the mid-1960s and is a short-tailed double-stranded DNA generalized transducing phage (1). Wild-type MG40 was obtained from David Botstein and propagated on Salmonella enterica DB7000 (2) grown in LB broth (3) at 37°C. MG40 DNA was purified from CsCl gradient-purified virions by the method described by Casjens and Gilcrease (4). An Illumina TruSeq library was prepared using a TruSeq DNA PCR-free HT library preparation kit and sequenced using Illumina MiSeq 150-bp paired-end run methodology with a 350-bp insert library. The reads obtained were quality controlled using FastQC (www.bioinformatics.babraham.ac.uk/projects/fastqc). Geneious v9.0.5 was used for trimming and assembly of the reads; not including trimming and including trimming both gave identical sequences, and the Geneious de novo assembly program was used to assemble the sequence (5). A single circular contig with 373.3× mean coverage was obtained. Circular sequence assembly is expected for headful packaging phages (6, 7). The genome was annotated using the annotation pipeline hosted by the Center for Phage Technology (https://cpt.tamu.edu/galaxy-pub). All tools are hosted in the Galaxy and Web Apollo platforms and, unless otherwise stated, were executed using default parameters (8, 9). No tRNAs were detected using ARAGORN v2.36 (10). Rho-independent transcription termination sites were annotated using TransTermHP v2.09 (11). GLIMMER v3.0 and MetaGeneAnnotator v1.0 were used to predict protein-coding genes (12, 13). The prediction of gene functions was facilitated by InterProScan v5.33-72, TMHMM v2.0, LipoP v1.0, and BLAST v2.2.31 searches against the NCBI nonredundant, UniProtKB, Swiss-Prot, and TrEMBL databases (14–18).
The MG40 genome is 40,315 bp long (47.2% G+C content), and we annotated 72 genes in its chromosome. MG40 is relatively unstudied, but its virions are essentially indistinguishable from those of P22 by negative-stain electron microscopy (1), and we found that its virion assembly genes are indeed similar to those of P22 (GenBank accession no. BK000583); however, many of its other genes are quite different, and its genome is mosaically related to that of P22. MG40 was shown to have different repressor specificity and the same host chromosome integration site as P22 (1), and its repressor gene is very different from those of the studied P22-like phages, while its integrase is nearly identical to that of P22. MG40 has only 156 bp in place of the ∼3-kbp P22 immI region, and its genome encodes a putative prophage-expressed O-antigen rhamnoseacetylase that is 92% identical to the characterized homologue of P22-like phage BTP1 (19); the P22 genome encodes three proteins (GtrA, GtrB, and GtrC) that add glucose to the O-antigen at this genome location. Curiously, although it infects S. enterica serovar Typhimurium and, like P22, requires O-antigen for infection (1), the C-terminal 464-amino-acid region of the receptor-binding domain of its tailspike is only 33% identical to that region of the phage P22 tailspike, while it is 99.6% identical to the putative phage SPN9TCW tailspike (GenBank accession no. JQ691610). Both phages infect S. enterica serovar Typhimurium, but SPN9TCW is a member of the ε15-like cluster of phages (20), which have a number of differences from the P22-like phages.
Data availability.
The genome sequence and associated data for the phage MG40 genome are available in GenBank under accession no. MT774487, BioProject no. PRJNA646767, SRA accession no. SRR12282813, and BioSample no. SAMN15565527.
ACKNOWLEDGMENTS
We thank David Botstein for phage MG40.
This work was supported by NIH grant GM114817 (to S.R.C.). The University of Utah High-Throughput Genomics Core Facility was supported by grant P30CA042014 from the National Cancer Institute.
REFERENCES
- 1.Grabner M, Hartman PE. 1968. MG40 phage, a transducing phage related to P22. Virology 34:521–530. doi: 10.1016/0042-6822(68)90071-8. [DOI] [PubMed] [Google Scholar]
- 2.Winston F, Botstein D, Miller JH. 1979. Characterization of amber and ochre suppressors in Salmonella typhimurium. J Bacteriol 137:433–439. doi: 10.1128/JB.137.1.433-439.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Chan R, Botstein D. 1972. Genetics of bacteriophage P22: I. Isolation of prophage deletions which affect immunity to superinfection. Virology 49:257–267. doi: 10.1016/s0042-6822(72)80027-8. [DOI] [PubMed] [Google Scholar]
- 4.Casjens SR, Gilcrease EB. 2009. Determining DNA packaging strategy by analysis of the termini of the chromosomes in tailed-bacteriophage virions. Methods Mol Biol 502:91–111. doi: 10.1007/978-1-60327-565-1_7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tye BK, Chan RK, Botstein D. 1974. Packaging of an oversize transducing genome by Salmonella phage P22. J Mol Biol 85:485–500. doi: 10.1016/0022-2836(74)90311-8. [DOI] [PubMed] [Google Scholar]
- 7.Jackson EN, Jackson DA, Deans RJ. 1978. EcoRI analysis of bacteriophage P22 DNA packaging. J Mol Biol 118:365–388. doi: 10.1016/0022-2836(78)90234-6. [DOI] [PubMed] [Google Scholar]
- 8.Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Gruning BA, Guerler A, Hillman-Jackson J, Hiltemann S, Jalili V, Rasche H, Soranzo N, Goecks J, Taylor J, Nekrutenko A, Blankenberg D. 2018. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res 46:W537–W544. doi: 10.1093/nar/gky379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lee E, Helt GA, Reese JT, Munoz-Torres MC, Childers CP, Buels RM, Stein L, Holmes IH, Elsik CG, Lewis SE. 2013. Web Apollo: a Web-based genomic annotation editing platform. Genome Biol 14:R93. doi: 10.1186/gb-2013-14-8-r93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kingsford CL, Ayanbule K, Salzberg S. 2007. Rapid, accurate, computational discovery of Rho-independent transcription terminators illuminates their relationship to DNA uptake. Genome Biol 8:R22. doi: 10.1186/gb-2007-8-2-r22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Noguchi H, Taniguchi T, Itoh T. 2008. MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes. DNA Res 15:387–396. doi: 10.1093/dnares/dsn027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S, Quinn AF, Sangrador-Vegas A, Scheremetjew M, Yong SY, Lopez R, Hunter S. 2014. InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240. doi: 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Krogh A, Larsson B, von Heijne G, Sonnhammer EL. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- 16.Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, Krogh A. 2003. Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 12:1652–1662. doi: 10.1110/ps.0303703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.UniProt Consortium. 2018. UniProt: the universal protein knowledgebase. Nucleic Acids Res 46:2699. doi: 10.1093/nar/gky092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kintz E, Davies MR, Hammarlof DL, Canals R, Hinton JC, van der Woude MW. 2015. A BTP1 prophage gene present in invasive non-typhoidal Salmonella determines composition and length of the O-antigen of the lipopolysaccharide. Mol Microbiol 96:263–275. doi: 10.1111/mmi.12933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Grose JH, Casjens S. 2014. Understanding the enormous diversity of bacteriophages: the tailed phages that infect the bacterial family Enterobacteriaceae. Virology 468–470:421–443. doi: 10.1016/j.virol.2014.08.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence and associated data for the phage MG40 genome are available in GenBank under accession no. MT774487, BioProject no. PRJNA646767, SRA accession no. SRR12282813, and BioSample no. SAMN15565527.
