Draft Whole-Genome Sequences of 25 Salmonella enterica Strains Representing 24 Serovars

Catherine Yoshida; Stephanie L Brumwell; Erika J Lingohr; Aaminah Ahmad; Travis M Blimkie; Benjamin A Kogan; Jessica Pilsworth; Muhammad A Rehman; Krista L Schleicher; Jenitta Shanmugaraj; Andrew M Kropinski; John H E Nash

doi:10.1128/genomeA.01718-15

. 2016 Mar 3;4(2):e01718-15. doi: 10.1128/genomeA.01718-15

Draft Whole-Genome Sequences of 25 Salmonella enterica Strains Representing 24 Serovars

Catherine Yoshida ^a,^✉, Stephanie L Brumwell ^a, Erika J Lingohr ^a, Aaminah Ahmad ^a, Travis M Blimkie ^a, Benjamin A Kogan ^a, Jessica Pilsworth ^a, Muhammad A Rehman ^a, Krista L Schleicher ^a, Jenitta Shanmugaraj ^a, Andrew M Kropinski ^b, John H E Nash ^a,^b

PMCID: PMC4777767 PMID: 26941156

Abstract

We report the draft genome sequences of 25 Salmonella enterica strains representing 24 different serotypes, many of which were not available in public repositories during our selection process. These draft genomes will provide useful reference for the genetic variation between serotypes and aid in the development of molecular typing tools.

GENOME ANNOUNCEMENT

Salmonella spp. are Gram-negative bacteria commonly responsible for human diarrheal illness through the contamination of foods including meats and produce. There are an estimated 1.7 million incidents of salmonellosis each year in North America alone, with over 80 million occurring globally (1). Having the ability to attribute illness to specific serotypes and subtypes can aid outbreak detection and source tracking. There is a momentum toward the acceptance and integration of new molecular methods for pathogen detection and classification. This places great importance on the availability of unique whole-genome sequences to accurately differentiate serotypes and subtypes and to validate methodologies. Genome sequences facilitate advancements in the detection and characterization of Salmonella serotypes, and those presented here contributed to the development of the Salmonella genoserotyping array (SGSA), a molecular-based serotyping assay (2), as well as a web-based platform for in silico serotyping of draft genome assemblies of Salmonella spp. (https://lfz.corefacility.ca/sistr-app; 3). The SGSA is able to provide the antigenic formula and serovar, preserving the nomenclature of legacy data gathered using traditional serotyping methods.

When this study began, there were numerous Salmonella serogroups and serovars not represented in GenBank, and isolates were selected to create a more complete public data set and expand the identification capacity of the SGSA. While this manuscript was being readied for publication, several of these serogroups and/or serovars became publicly available; however these will contribute to the accumulation of multiple representative genomes for the development of molecular typing tools. As of 10 July 2015, this is the first assembly and publication of whole-genome sequences for three Salmonella serogroups, O:43(U), O:56, and O:63, and 10 serovars: Blegdam, Fresno, Hillingdon, Itami, Milwaukee, Moscow, Virginia, Weslaco, subsp. salamae (II) 56:z₁₀:e,n,x, and subsp. arizonae (IIIa) 63:g,z₅₁:-.

The genomic DNA required for sequencing was extracted from Salmonella isolates grown overnight at 37°C on Luria-Bertani agar (BD Canada, Mississauga, ON, Canada) using an EZ1 DNA tissue kit (Qiagen Ltd., Mississauga, ON, Canada). Three technologies were used for sequencing: Roche’s 454 GS-FLX Titanium, obtaining an average coverage of 40-fold; Illumina HiSeq 2000 with TruSeq sample preparation of 2 × 100 paired-end runs, obtaining an average coverage of 90-fold; and Illumina MiSeq with TruSeq sample preparation of 2 × 251 paired-end runs, obtaining an average coverage of 90-fold. Of the 25 draft genomes, 20 isolates were sequenced using both Illumina and 454 technologies, while five were sequenced by 454 only. Mira assembler version 4.0 (4) was used to assemble the reads into contigs, which were then proofread and corrected in the program Gap5 of the Staden software package (5). The genomes were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (http://www.ncbi.nlm.nih.gov/genome/annotation_prok).

These draft genomes can provide useful information for the development and validation of molecular diagnostic tools, as well as other research activities.

Nucleotide sequence accession numbers.

The draft genome sequences for these 25 Salmonella isolates have been deposited in DDBJ/ENA/GenBank under Bioproject no. PRJNA294295. The GenBank accession numbers are listed in Table 1. The raw sequence data are available in the Sequence Read Archive (SRA).

TABLE 1 .

Salmonella strains sequenced in this study

Serovar	Antigenic formula	Isolate no.	Sequencing method(s)^a	Accession no.
Blegdam	9,12:g,m,q: -	SA20065575	1	LHSP00000000
Enteritidis	1,9,12:g,m,p: -	SA20093032	1, 2	LHSQ00000000
Enteritidis	1,9,12:f,g,m,t: -	SA20103550	1, 2	LHSR00000000
Fresno	9,46:z₃₈: -	ST224	1, 2	LHSS00000000
Gallinarum	1,9,12: -: -	ST572	1, 2	LHST00000000
Gaminara	16:d: 1,7	SA20063285	1, 2	LHSU00000000
Hadar	6,8:z₁₀:e,n,x	SA20026260	1, 2	LHSV00000000
Hillingdon	9,46:g,m: -	S01-0588	1	LHSX00000000
Hvittingfoss	16:b:e,n,x	SA20014981	1, 2	LHSW00000000
Itami	9,12: l,z₁₃: 1,5	SA20014991	1, 3	LHSY00000000
Johannesburg	40:b:e,n,x	SA20025782	1	LHSZ00000000
Kentucky	8,20:i:z₆	SA20030505	1, 2	LHTA00000000
Manhattan	6,8:d: 1,5	SA20034532	1, 2	LHTB00000000
Milwaukee	43:f,g,[t]: -	SA19950795	1, 2	LHTC00000000
Moscow	9,12:g,q: -	SA20061414	1	LIXO00000000
Newport	6,8,20:e,h: 1,2	L0167	1	LIXP00000000
Panama	1,9,12:l,v: 1,5	SA20030878	1, 3	LHTD00000000
Paratyphi_A	2,12:a: -	SA19950809	1, 2	LHTE00000000
Pullorum	1,9,12: -: -	S4037-07	1, 2	LHTF00000000
Rubislaw	11:r:e,n,x	SA20030553	1, 2	LHTG00000000
subsp. II 56:z10:e,n,x	56:z₁₀:e,n,x	1369–1373	1, 2	LHTH00000000
subsp. IIIa 62:z36: -	62:z₃₆: -	5335/86	1, 2	LHTK00000000
subsp. IIIa 63:g,z51: -	63:g,z₅₁: -	So 20/20	1, 2	LHTL00000000
Virginia	8:d: 1,2	SA19971529	1, 2	LHTI00000000
Weslaco	42:z₃₆: -	247K	1, 3	LHTJ00000000

Open in a new tab

^{^a}

1 = Roche 454 GS-FLX titanium; 2 = Illumina MiSeq; 3 = Illumina HiSeq.

ACKNOWLEDGMENTS

This work was funded in part by the Genome Research and Development Initiative through the Office of Biotechnology and Science, Health Canada.

We sincerely thank the following for providing isolates: Roger Johnson, Linda Cole, Betty Wilkie, Ann Perets, Ketna Mistry, OIE Reference Laboratory for Salmonellosis, National Microbiology Laboratory, Public Health Agency of Canada, Guelph, ON, Canada; Vanessa Allen, Anne Maki, Analyn Peralta, Enteric Section, Public Health Ontario Laboratory, Toronto, ON, Canada; Danielle Daignault, National Microbiology Laboratory, Public Health Agency of Canada, St. Hyacinthe, QC, Canada (ASHQ); Martin Cormican, School of Medicine, National University of Ireland Galway, Galway, Ireland; Niall Delappe, National Salmonella, Shigella and Listeria Reference Laboratory, Galway University Hospital, Galway, Ireland; Muna Anjum, Sarah North, Victoria Barrett, Gail Wise, Sam Chappell, Animal and Plant Health Agency, Weybridge, New Haw, Addlestone, Surrey, United Kingdom; Durda Slavic, Animal Health Laboratory, University of Guelph, Guelph, ON, Canada; John Devenish, Canadian Food Inspection Agency, Ottawa, ON, Canada; S.M. Laboratory Services Inc., Longueuil, QC, Canada; Helen Tabor, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada; Sean Byrne, Animal Health Centre, Abbotsford, BC, Canada; Francois-Xavier Weill, Institut Pasteur, Paris, France.

We also thank the NCBI PGAP team for annotation services and the National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada, and McGill University and Génome Québec Innovation Centre, Montréal, QC, Canada, for genome sequencing.

Footnotes

Citation Yoshida C, Brumwell SL, Lingohr EJ, Ahmad A, Blimkie TM, Kogan BA, Pilsworth J, Rehman MA, Schleicher KL, Shanmugaraj J, Kropinski AM, Nash JHE. 2016. Draft whole-genome sequences of 25 Salmonella enterica strains representing 24 serovars. Genome Announc 4(2):e01718-15. doi:10.1128/genomeA.01718-15.

REFERENCES

1.Majowicz SE, Musto J, Scallan E, Angulo FJ, Kirk M, O’Brien SJ, Jones TF, Fazil A, Hoekstra RM. 2010. The global burden of nontyphoidal Salmonella gastroenteritis. Food Safety 50:882–889. doi: 10.1086/650733. [DOI] [PubMed] [Google Scholar]
2.Franklin K, Lingohr EJ, Yoshida C, Anjum M, Bodrossy L, Clark CG, Kropinski AM, Karmali MA. 2011. Rapid genoserotyping tool for classification of Salmonella serovars. J Clin Microbiol 49:2954–2965. doi: 10.1128/JCM.02347-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Yoshida CE, Kruczkiewicz P, Laing CR, Lingohr EJ, Gannon VP, Nash JH, Taboada EN. 2016. The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies. PLoS One 11(1):e0147101. doi: 10.1371/journal.pone.0147101. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Chevreux B, Wetter T, Suhai S. 1999. Genome sequence assembly using trace signals and additional sequence information, p 45–56. In Computer science and biology. Proceedings of the German Conference on Bioinformatics, GCB '99. GCB, Hannover, Germany. [Google Scholar]
5.Bonfield JK, Whitwham A. 2010. Gap5—editing the billion fragment sequence assembly. Bioinformatics 26:1699–1703. doi: 10.1093/bioinformatics/btq268. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B1] 1.Majowicz SE, Musto J, Scallan E, Angulo FJ, Kirk M, O’Brien SJ, Jones TF, Fazil A, Hoekstra RM. 2010. The global burden of nontyphoidal Salmonella gastroenteritis. Food Safety 50:882–889. doi: 10.1086/650733. [DOI] [PubMed] [Google Scholar]

[B2] 2.Franklin K, Lingohr EJ, Yoshida C, Anjum M, Bodrossy L, Clark CG, Kropinski AM, Karmali MA. 2011. Rapid genoserotyping tool for classification of Salmonella serovars. J Clin Microbiol 49:2954–2965. doi: 10.1128/JCM.02347-10. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3.Yoshida CE, Kruczkiewicz P, Laing CR, Lingohr EJ, Gannon VP, Nash JH, Taboada EN. 2016. The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies. PLoS One 11(1):e0147101. doi: 10.1371/journal.pone.0147101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4.Chevreux B, Wetter T, Suhai S. 1999. Genome sequence assembly using trace signals and additional sequence information, p 45–56. In Computer science and biology. Proceedings of the German Conference on Bioinformatics, GCB '99. GCB, Hannover, Germany. [Google Scholar]

[B5] 5.Bonfield JK, Whitwham A. 2010. Gap5—editing the billion fragment sequence assembly. Bioinformatics 26:1699–1703. doi: 10.1093/bioinformatics/btq268. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Draft Whole-Genome Sequences of 25 Salmonella enterica Strains Representing 24 Serovars

Catherine Yoshida

Stephanie L Brumwell

Erika J Lingohr

Aaminah Ahmad

Travis M Blimkie

Benjamin A Kogan

Jessica Pilsworth

Muhammad A Rehman

Krista L Schleicher

Jenitta Shanmugaraj

Andrew M Kropinski

John H E Nash

Abstract

GENOME ANNOUNCEMENT

Nucleotide sequence accession numbers.

TABLE 1 .

ACKNOWLEDGMENTS

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Draft Whole-Genome Sequences of 25 Salmonella enterica Strains Representing 24 Serovars

Catherine Yoshida

Stephanie L Brumwell

Erika J Lingohr

Aaminah Ahmad

Travis M Blimkie

Benjamin A Kogan

Jessica Pilsworth

Muhammad A Rehman

Krista L Schleicher

Jenitta Shanmugaraj

Andrew M Kropinski

John H E Nash

Abstract

GENOME ANNOUNCEMENT

Nucleotide sequence accession numbers.

TABLE 1 .

ACKNOWLEDGMENTS

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases