Skip to main content
PLOS One logoLink to PLOS One
. 2021 Apr 6;16(4):e0249079. doi: 10.1371/journal.pone.0249079

Population structure, case clusters, and genetic lesions associated with Canadian Salmonella 4,[5],12:i:- isolates

Clifford G Clark 1,*, Ashley K Kearney 1, Lorelee Tschetter 1, James Robertson 2, Frank Pollari 3, Stephen Parker 3, Gitanjali Arya 2, Kim Ziebell 2, Roger Johnson 2, John Nash 2, Celine Nadon 1
Editor: Patrick Butaye4
PMCID: PMC8049487  PMID: 33822792

Abstract

Monophasic Salmonella 4,[5]:12:i:- are a major public health problem because they are one of the top five Salmonella serotypes isolated from clinical cases globally and because they can carry resistance to multiple antibiotics. A total of 811 Salmonella 4,[5]:12:i:- and S. Typhimurium whole genome sequences (WGS) were generated. The various genetic lesions causing the Salmonella 4,[5]:12:i:- genotype were identified and assessed with regards to their distribution in the population of 811 Salmonella 4,[5]:12:i:- and S. Typhimurium isolates, their geographical and temporal distribution, and their association with non-human sources. Several clades were identified in the population structure, and the largest two were associated almost exclusively with a short prophage insertion and insertion of a mobile element carrying loci encoding antibiotic and mercury resistance. IS26-mediated deletions and fljB point mutants appeared to spread clonally. ‘Inconsistent’ Salmonella 4,[5]:12:i:- isolates associated with specific, single amino acid changes in fljA and hin were found in a single clade composed of water, shellfish, and avian isolates. Inclusion of isolates from different case clusters identified previously by PFGE validated some of the clusters and invalidated others. Some wgMLST clusters of clinical isolates composed of very closely related isolates contained an isolate(s) with a different genetic lesion, suggesting continuing mobility of the implicated element responsible. Such cases may need to be left out of epidemiological investigations until sufficient numbers of isolates are included that statistical significance of association with sources is not impaired. Non-human sources were frequently found in or near clinical case clusters. Prospective surveillance and WGS of non-human sources and retrospective analysis by WGS of isolates from existing culture collections provides data critical for epidemiological investigations of food- and waterborne outbreaks.

Introduction

Foodborne bacteria originate in animal and environmental niches, are amplified or lost during the processing of animals into food, and are ultimately introduced into human populations from food and water [13]. After excretion from infected hosts, including humans, these organisms can be recycled back into the environment, especially through untreated sewage [4], and then into humans or animals. Bacteria may be spread by both symptomatic or asymptomatic hosts. The composition of bacterial populations infecting humans is not static, but frequently follows a pattern of establishment, rise to prominence, and subsequent decline. At the root of these phenomena are populations responding to selective pressures and changes in potential niches for growth and survival, or targeted eradication programs.

The decline of Salmonella enterica serovar Typhimurium in most regions world-wide has been accompanied by increases in the occurrence of both Salmonella Enteritidis and the monophasic Typhimurium variant, Salmonella 4,[5],12:i:- [58]. Salmonella 4,[5],12:i:- is derived from S. Typhimurium from disruptions in production of the second phase flagellar antigen locus (fljAB hin) associated with deletions, interruptions, or point mutations affecting all or part of this locus [810]. S. Typhimurium and Salmonella 4,[5],12:i:- populations appear to be in a dynamic state, with the monophasic variant either continuing to arise from S. Typhimurium or creating new genetic variants near iroB by rearrangement [8, 11], and with a non-zero probability of S. Typhimurium re-acquiring a functional second-phase antigen locus [12]. Characterizing the multiple changes that disrupt the second-phase antigen locus may have practical value in surveillance, in outbreak and trace-back investigations, in public health risk assessments, and in developing interventions to reduce transmission of the organism through the food chain.

Salmonella 4,[5],12:i:- isolates currently constitute a globally distributed group of organisms that show considerable diversity in antimicrobial and heavy metal resistance in addition to changes at the second phase antigen locus [79, 1315]. These organisms infect humans, chicken, cattle, and pigs, and may be found in food for human consumption [8, 16, 17]. Different sub-populations are responsible for human disease in disparate, partially overlapping geographical regions [7, 9, 12, 18], though the realities of our connected world may mean that there will be a certain level of mixture and homogenization over time. Like S. Typhimurium, the monophasic Salmonella 4,[5],12:i:- variants have been associated with outbreaks throughout the world [8]. A critically important sub-population of this organism defines a twenty-first century clonal expansion of Salmonella 4,[5],12:i:- in the UK, the US, Canada, and elsewhere in the world [12, 14, 15, 19]. This population carries SGI-4 (Salmonella Genomic Island-4) with metal resistance determinants and frequently acquires or re-acquires a mercury resistance element (MREL) that also includes antibiotic resistance regions RR1-RR2, and RR3 encoding determinants of the ASSuT phenotype [16, 1921], and both may contribute greatly to adaptive fitness, antimicrobial resistance, and transmission, as well as to its public health risk and emergence [15, 17, 2224]. It is closely associated with porcine, and to a lesser extent turkey and cattle, sources [24], and there is an association with the use of heavy metals as growth promoters in pig production [21, 23, 25].

Methods that facilitate the division of bacterial populations into smaller groups that are more likely to share an epidemiological association can increase the strength of outbreak signals and reduce the number of cases needed to solve the outbreak [26]. New bacterial infections of humans are now investigated using only whole genome sequencing (WGS), while much of the available historical data may have been created using PFGE or some other typing method for analysis. This can create some difficulties in data interpretation, but also allows a more systematic evaluation of the value of different analysis methods for informing epidemiological investigations [27]. Phylogenetic analysis can reveal underlying epidemiological processes and assist hypothesis generation and source attribution in outbreak investigations; it is preferred to previous methods of grouping or clustering Salmonella [28]. As has been well documented previously, whole genome sequence provides additional information compared with PFGE, and in some cases there is only partial concordance of PFGE types with the phylogenetic structure of the population [2932].

Because of the global public health importance of Salmonella 4,[5],12:i:- and the unique genetic mechanisms involved in creation and propagation of this S. Typhimurium variant, we have chosen to subject 811 Canadian clinical and non-human isolates to detailed phylogenetic analysis. In previous work [19] we used WGS to evaluate the sequences and phylogenetic placement of elements conferring heavy metal and antimicrobial resistance in Canadian and global Salmonella 4,[5],12:i:- isolates, as well as the genetic lesions associated with introduction of the Mercury Resistance Element (MREL). The phylogenetic population thus inferred contains 4 large and 8 smaller clades, with the majority of isolates with heavy metal ASSuT resistance occupying only one clade (clade IV) within the overall population.

The current work identifies the nature of genetic lesions other than the MREL that result in loss of, or mutation within, the genes (fljAB hin) of the second-phase flagellar locus that cause the monophasic phenotype in Canadian isolates. We have further mapped the distribution of these genes onto the population structure of the organism. The population structure has been developed using case clusters obtained historically by pulsed-field gel electrophoresis (PFGE), which have been compared with clusters based on whole genome multi-locus sequence typing (wgMLST) to assess the concordance of these results and determine what value may accrue to historical data. Many PFGE clusters are not well supported by the phylogenetic structure of the population. As a consequence, the use of phylogenetic analysis based on whole genome sequence analysis (WGS) is expected to result in better, cheaper, faster, and easier public health surveillance for this organism.

Materials and methods

Isolates, growth conditions, DNA preparation, and sequencing

Human and non-human isolates of Salmonella 4,[5],12:i:- and S. Typhimurium were obtained from Canadian provincial public health laboratories (via PulseNet Canada), FoodNet Canada, the Canadian Integrated Program for Antimicrobial Resistance Surveillance (CIPARS), the Canadian Food Inspection Agency (CFIA), Agriculture Canada, and provincial agriculture, veterinary, and animal health laboratories according to the selection criteria previously described [19]. Permission for use of the human isolates and associated metadata was granted by the PulseNet Canada Steering Committee and provincial public health laboratories. Isolates from water were provided by Dr. V.P.J. Gannon. Isolate numbers were replaced with randomly generated numbers for publication to comply with privacy regulations. Selected metadata for isolates, including those newly used in this study may be found in S1 Spreadsheet.

Isolates were stored and grown as previously described [19]. Clinical isolates were chosen on the basis of case clusters previously identified by Pulse Net Canada using PFGE, and additional isolates were included to provide insight into the background clinical isolates that were not identified as belonging to case clusters. PFGE was done by PulseNet Canada laboratories (provincial public health laboratories and the Canadian National Microbiology Laboratory) using the PulseNet International standard methods [33] and banding patterns were analyzed using Bionumerics software (Applied Maths). DNA isolation was carried out either with Epicentre Metagenomic DNA Isolation Kit for Water (Illumina) or the DNeasy® 96 Blood & Tissue Isolation Kit (Qiagen). Isolated DNA was sent to the Genomics Core facility at the NML for quantitation and dilution, and sample libraries were prepared using a MiSeq Nextera® XT DNA library preparation kit (Illumina). Whole genome sequencing was performed by 150 bp paired-end read sequencing on the Illumina NextSeq platform using NextSeq 500/550 Mid Output kits and sequence reads were deposited in the appropriate IRIDA Platform Beta Release databases [34]. All sequence data were deposited to the NCBI sequence read archive, with the relevant identifiers included in S1 Spreadsheet.

DNA sequence quality, assembly, and annotation

Sequence reads were assembled into contigs using the SPAdes assembler (v3.0) [35]. Contigs smaller than 1 kb and with average genome coverage less than 15× were filtered and removed from the analysis. Draft genome sequences were annotated using PROKKA [36]. Sequence and assembly quality were assessed using QUAST in IRIDA and the quality tools in Bionumerics v. 7.6.2 (Applied Maths). Quality measures for assembled genomes can be found in S2 Spreadsheet. SISTR (the Salmonella In Silico Typing Resource) [37] was used to validate the serotype data, though it was not useful for isolates with point mutants or deletions in fljA or fljB loci.

DNA sequence analysis, metadata, and bioinformatics

Closed genomes [38, 39] were used as references for analysis. Whole genome MLST (wgMLST) is the current method used by PulseNet International for analysis of WGS data [26], and was therefore used to create wgMLST UPGMA and Minimum Spanning Tree (MST) dendrograms in BioNumerics. Default PulseNet parameters were used for the analysis; for the UPGMA tree Categorical (values) and UPGMA were selected. We previously found SNP-based Maximum-Likelihood trees to be highly concordant with wgMLST UPGMA trees [19] and consider this to provide validation for use of the UPGMA dendrograms and MSTs based on them in this work. wgMLST clusters were defined for this manuscript as groups of isolates that were distinguishable from the background of isolates in the UPGMA dendrogram or within ~10–12 alleles of each other [27] in BioNumerics wgMLST analysis. The term “clade” was only used for the first level of branching within the UPGMA dendrogram, but was not applied to single branches originating from the highest level of difference (at the farthest left or the greatest distance in dendrograms). We used the term “sub-clade” to refer to groups of isolates arising from a higher level (more distant genetically) clade or sub-clade. Case clusters were assigned on the basis of PFGE identity by PulseNet Canada, which curated all isolate data and metadata arriving at the NML.

GView server (https://server.gview.ca) was used for data visualizations in the form of displaying genome features or BLAST-based pangenome analysis. GView [40] is an application that uses.gbk or.fasta sequence files for: genome mapping and visualization of individual genomes (Display genome features) or comparison of multiple genomes against a reference closed genome to determine the presence/absence and % identity of genes and proteins in BLAST-based analysis (Pangenome Analysis). Verification of features occurring in these visualizations, eg. Salmonella serotype Typhimurium vs Salmonella 4,[5]:12:i:-, was checked manually in.gbk files of the relevant genomes. Figures were prepared from BioNumerics or GView output files using Adobe Illustrator CC 2018.

Results and discussion

The population structure obtained by WGS provides a framework for understanding the organisms and informing public health investigations

In previous work we determined that Salmonella 4,[5],12:i:- and S. Typhimurium isolates assessed in this project could be classified roughly by cgMLST into four large clades (I-IV) and eight smaller clades (V-XII), with a smaller number of isolates present on additional single branches of both Maximum Likelihood SNP trees and wgMLST MSTs [19]; however, only clade IV isolates were characterized in detail. Here we describe the distribution of clades, sub-clades, and clusters using the criteria described in the Materials and Methods on the same set of isolates discussed previously plus an additional 13 water isolates (Fig 1, S1 Fig). A total of 34 clusters are present in the MST, and the clade structure is congruent with that previously described except for the splitting of clade I into clades IA and IB. The UPGMA dendrograms also exhibited the same two clades. This was different than the dendrogram of Canadian isolates presented in earlier work [19] but similar to the dendrogram of Canadian, US, and European isolates, suggesting that the use of additional isolates in dendrogram construction affected the analysis and that dendrogram topology is quite sensitive to the number and genomic composition of isolates used. We found UPGMA dendrograms to be very similar to Maximum Likelihood trees created using SNPs [19], so that the MSTs and UPGMA dendrograms used in this work should provide useful visualizations of the data. However, isolates from distant UPGMA clusters 2, 5, and 8 were found in clade IB, suggesting an artifact associated with MST construction has also been introduced. Both draft and closed genomes were used in this analysis to evaluate allele differences between each pair of genomes, so please note that there are duplicate entries for a small number of isolates.

Fig 1. MST showing the clusters of isolates differing by approximately 12 alleles of fewer.

Fig 1

A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST using BioNumerics 7.6.3.

Clusters of closely related isolates (13 allele differences or fewer) were present in clades IA & B (6), clade II (7), clade III (2), clade IV (11), clade V (1), clade VI (3), clade VIII (1), clade XI (1), and clade XII (1). Some of these were the result of sampling the same non-human source within the same geographical location eg. (cluster 3, water, Québec, 2011; cluster 7, chicken, British Columbia, 2013). While some UPGMA clusters were also temporally clustered (eg. clusters 1, 3, 7, 32), many included isolates from different years, suggesting these isolates were members of common or long-lasting lineages (S1 Fig, Table 1). Since clinical isolates were chosen on the basis of case clusters previously identified by PulseNet Canada using PFGE, the MST/UPGMA clusters were compared with PFGE clusters (S1 Fig). It is important to note that only a fraction of possible isolates, frequently as few as four, were included from most PFGE case clusters, making it more difficult to infer much about the nature of these clusters. Consistent with PulseNet Canada practice, background isolates were also obtained for comparison with isolates in PFGE clusters at the time of the cluster and during 60 day windows before the beginning and after the end of the cluster. This provided a reasonable representation of isolates in the Salmonella 4,[5],12:i:- population throughout the sampling period of 2008–2016 (Fig 2).

Table 1. Description of wgMLST UPGMA clusters.

Cluster Isolates Provinces Non-human isolates in cluster Years
1 6 NB none 2010
2 9 ON, NL chicken 2012–2015
3 8 QC porcine 2011
4 5 BC, MB, ON none 2009–2016
5 6 QC porcine 2008–2009
6 6 SK, MB, QC none 2008
7 5 BC chicken 2013
8 34 BC, AB, SK, MB, ON, QC, PE bovine, chicken 2008–2009, 2011, 2013, 2015
9 35 BC, AB, SK, MB, ON, NL none 2008–2009, 2016
10 19 BC, AB, SK MB, PE feeder rodent 2008–2011
11 5 BC, SK none 2010–2011
12 23 BC, AB, SK, MB, ON, NB none 2008–2012, 2014
13 5 MB none 2009
14 6 AB chicken, water 2010, 2012–2013
15 6 AB, SK chicken 2014–2015
16 11 AB, ON chicken, porcine 2010, 2012–2015
17 12 BC, ON, QC porcine 2009–2014
18 4 ON, QC none 2010, 2014
19 5 QC, NL bovine 2012–2015
20 5 AB, QC porcine 2013–2015
21 6 QC none 2012–2013
22 18 BC, SK, ON, QC porcine 2008, 2012–2015
23 14 AB, SK, ON, QC porcine 2012–2013, 2015
24 14 BC, AB, SK, MB, ON, QC, NS chicken, turkey, avian 2013–2016
25 6 MB, ON porcine 2014–2016
26 12 BC, MB, ON, NB none 2012–2016
5 QC none 2013
27 5 BC, SK, ON, QC none 2013, 2015
28 4 MB, ON, NL none 2011–2013
29 7 SK, ON, QC, NL, PE none 2012–2015
30 8 ON, QC porcine 2007, 2009–2011, 2014
31 4 ON, QC porcine 2010–2011
32 4 BC none 2016
33 5 ON, QC none 2013–2015

Fig 2. MST showing the distribution of isolates by date of isolation.

Fig 2

When the date of isolation was not available the date the isolate was received by the laboratory was used. A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST using BioNumerics 7.6.3.

PFGE types were not randomly distributed in the population, though some clustered more tightly than others (Fig 3, S1 Fig). Most PFGE type predominate in specific MST/UPGMA clades, consistent with a fairly good correlation between whole genome sequence and PFGE type, though in some cases (eg. STXAI.0399, STXAI.0008) a small number of isolates can be found in a completely different clade. Some other PFGE type were distributed over a greater diversity of genotypes within the dendrogram and MST (eg. STXAI.0010, STXAI.0324) while there were PFGE type that appeared very tightly clustered (eg. STXAI.0185, STXAI.0821; Fig 3, S1 Fig) though this may have been the result of testing fewer isolates. PFGE type STXAI.0008 and STXAI.0399 were over-represented because they were associated with large case clusters for which epidemiological information was thought to be available, and which were therefore sampled in their entirety for the period in which the cluster was identified and at higher frequencies for other time periods. This has affected the clade structure of both the UPGMA dendrogram and MST visualization by emphasizing clades containing isolates with these PFGE type.

Fig 3. Mapping of major PFGE types onto MST of Salmonella 4,[5],12:i:- isolates.

Fig 3

The MST was constructed in BioNumerics 7.6.3 using wgMLST data for all isolates sequenced in this study with the exception of four outliers. A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST.

Case clusters defined by PFGE were also mapped onto the wgMLST dendrogram and an MST was constructed (Fig 4, Table 1, S1 Fig) in order to understand the concordance of these methods for possible future evaluation of archival data. STXAI.0399 includes wgMLST UPGMA clusters 10, 11, and 12, and predominates in both clades IA and IB spanning the years 2008–2016. PFGE cluster 1008ST399MP (see Fig 4 for explanation of cluster designations) was a large cluster associated with feeder rodents (manuscript in preparation). Canadian isolates and cases were part of a multi-national Salmonella 4,[5],12:i:- PFGE type STXAI.0399 outbreak attributed to feeder rodents that was extended over several years (see Fig 4, Table 1; manuscript in preparation) [41, 42]. This is consistent with the nature of the outbreak and suggests there are still aspects of the outbreak that are not well understood. We think that the multi-cluster nature of this outbreak is consistent with the origin of the source (manuscript in preparation). The data suggest that when using WGS-based prospective surveillance it may be sufficient to investigate phylogenetic clusters first, then identify sources, and link related clusters afterward as long as there is enough statistical power in the phylogenetic clusters to provide epidemiological insights. Most isolates in clade I were from western Canada, though a few were isolated in Ontario and Québec (S1 Fig), observations useful for epidemiological investigations. It is also clear from the data that the use of WGS for surveillance is more conducive than molecular typing methods for detecting these kinds of extended outbreaks, which in turn is extremely helpful for determining the sources and implementing interventions to prevent the organism from introduction into the human population.

Fig 4. MST showing the distribution of PFGE case clusters.

Fig 4

A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST.

There was considerable genomic heterogeneity in clade II, with several sub-clades and multiple wgMLST UPGMA clusters (Fig 1). Phylogenetic clusters 1 and 13 correspond to PFGE clusters 1006ST306MB and 0911ST593MP, respectively (compare Figs 1 & 4). Cluster 2 partially overlaps PFGE cluster 1207ST10MP (S1 Fig) and contains both clinical and chicken isolates. Though the earliest clinical isolate was obtained before the chicken isolates, that does not preclude the possibility that chicken was the source for the clinical cases. We have noticed that isolates from animal sources can exhibit remarkable genomic stability over a period of a few years (eg. Fig 1, Cluster 2; see also S. Typhimurium in S1 Fig, clade XII), and it is likely that chickens contaminated with the cluster strain existed before the cluster occurred. It should be possible to link WGS surveillance data for clinical and animal infections to identify the establishments where the contaminated sources are housed and implement control measures to mitigate contamination of the animals with pathogens. wgMLST UPGMA cluster 3 contained porcine and water isolates from Québec in 2011. Similarly, cluster 14 contained chicken and water isolates from Alberta in 2010, 2012, and 2013. We do not currently have data on whether these sources were co-located but it is possible that traceback could elucidate the mechanisms propagating these organisms within these sources. Cluster 15 was comprised of a number of chicken isolates and one clinical isolate, suggesting the source of infection for the human case was chicken. The clinical isolate was obtained more than a year after the chicken isolates, suggesting that, though no isolates were available, chickens may have harboured the same strain during that time. Source attribution and traceback investigations using WGS data benefit from source surveillance data, and may do so more frequently as non-outbreak investigations and interventions are undertaken to mitigate contamination of foods before they reach the human population. The rest of the isolates in clade II were diverse (Fig 3). While there were loose phylogenetic clusters, they were not suggestive of potential outbreaks. Non-human isolates in clade II were obtained from chickens, ducks, a parrot, pigs, cows, a horse, and water (S1 Fig).

Most isolates in clade III were included in the study because they were part of a large cluster of cases identified as potential case clusters using PFGE patterns but were never subject to an epidemiological investigation. These were originally identified as nine PFGE case clusters (Table 1, Fig 4), as four phylogenetic clusters (Fig 1 and S1 Fig), and as four sub-clades, A-D (Fig 1 and S1 Fig). These sub-clades have large enough wgMLST allele differences between them that they would not be included in a single case cluster using our stringent interpretive guidelines. However, they may be related to each other in a way similar to the extended multi-year feeder rodent outbreak discussed earlier. Clade III contains mostly STXAI.008 PFGE type distributed in four subclades (Fig 3). Isolates within clade III were obtained between 1999 and 2016 though, because of the way the isolates were chosen for analysis, most were acquired between 2008 and 2010. No epidemiological data or investigation results were available for these case clusters and no source was previously determined. Phylogenetic clusters 8 and 9 were separated by only 16.6 alleles in the wgMLST dendrogram (S1 Fig) and contained PulseNet Canada PFGE case clusters spanning several years (Table 1, S1 Fig). Chicken and bovine isolates from Saskatchewan and Manitoba co-clustered with these isolates and may have contributed to human illness. Isolate PNCS014919 in sub-clade D was missing hin in the draft genome, unlike the adjacent isolates, though the remaining deletion of all sequence between iroB and hlyD was the same. This suggests a different genetic event may have resulted in the deletion of the hin locus, though it is possible that this could have occurred through rearrangement as well. An epidemiological investigation would be necessary to determine whether this difference was relevant in a public health context.

Clade IV has been discussed in a previous publication (19), though it should be noted that clusters have been renamed in this work. Several wgMLST UPGMA clusters were present but frequently dispersed through time (Fig 1, S1 Fig). Few of these corresponded to case clusters based on PFGE (S1 Fig). Exceptions were phylogenetic clusters18 (1006ST324 QC), 21 (1310ST821QC), 26 (includes 1307ST524MP), and PFGE cluster 1308ST1027MP. Most clusters defined by PFGE were scattered throughout the clade and many sub-clades comprised several case clusters defined by PFGE results. Placing new cases in the larger context of existing WGS data will be a valuable tool for understanding the population dynamics of the organism and helping to define case clusters for further analysis.

Though 8/14 clade V isolates were originally assigned PFGE type STXAI.0103, only two of them are present in wgMLST UPGMA cluster 33 (compare Figs 1 and 4). Given the wgMLST distances between the members of PulseNet Canada cluster 1303ST10MP in this clade many the isolates would now be considered insufficiently related for epidemiological analysis. The isolates with the closest phylogenetic relationships in cluster 33 are from different years. In addition, the monophasic genotype of one isolate in cluster 33 and a second on outside this cluster is due to insertion of the MREL rather than the short prophage fragment (S2 Fig), supporting the hypothesis that the MREL is mobile within the population studied. All the isolates in this clade are from humans (see S1 Fig).

S. Typhimurium isolates were found in clade VI along with Salmonella 4,[5].12:i:- (S1 Fig). Isolates from clusters 30 and 31, along with adjacent isolates all have the same fljB point mutant responsible for the monophasic phenotype, and likely represent a clone of isolates diversifying in time. All but one of these isolates were found in Québec and there was an association with porcine and bovine sources. Cluster 32 represented a very tight case cluster of S. Typhimurium in British Columbia. Six isolates in clade VI with lesions associated with the short prophage fragment and a truncated hin may also be clonally related. These isolates represent clinical cases and porcine sources. They are closely related to S. Typhimurium isolates comprising the PFGE 1007ST462MP case cluster in Québec and British Columbia, though the wgMLST suggests the human cases caused by the Salmonella 4,[5],12:i:- are not part of the same cluster as the S. Typhimurium. An isolate from a human case from Ontario included in this PFGE 1007ST462MP case cluster, PNCS015029, was located in clade VI more distant from the other isolates, consistent with the geographical difference (S1 Fig).

Clade VII is uniquely composed of shellfish isolates from New Brunswick, Nova Scotia, and Prince Edward Island, avian isolates from Saskatchewan and Prince Edward Island, and freshwater isolates from Alberta, Ontario, and Québec. The isolates were ST99 and were phenotypically classified as Salmonella 4,[5],12:i:- but were identified as S. Typhimurium using SISTR [37], GView analysis, and by the presence of the fljAB hin locus in annotated genomes. All isolates in this clade were finally classified as inconsistent Salmonella 4,[5],12:i:- with the characteristic point mutations conferring the monophasic phenotype (see below). We found it interesting that this genotype appeared to be stable through time (see Fig 2) in the marine aquatic niche.

S. Typhimurium and Salmonella 4,[5],12:i:- were both present in clade VIII. An IS-26-mediated deletion from iroB to STM2767 was detected in isolates from wgMLST UPGMA cluster 29 (Fig 1, S1 Fig), a sub-clade separate from the S. Typhimurium isolates (Fig 5, S2 Fig). The clinical isolates associated with cluster 29 were from Saskatchewan, Ontario, Québec, and Prince Edward Island, and were isolated from 2012–2015. A majority were PFGE type STXAI.0185. Assuming that these isolates were from a single source, our hypothesis is that at the time the source could have been traced and interventions taken to eliminate the bacteria from the food or food production chain. This type of investigation is not frequently undertaken but could become a valuable and cost-effective way of identifying and removing contaminated food before sale, thus promoting public health.

Fig 5. MST tree showing the genetic lesions responsible for the Salmonella 4,[5],12:i:- phenotype.

Fig 5

A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST.

Clades IX and X are small and do not contain wgMLST clusters. Cluster 28 partially overlaps PFGE cluster 1201ST280MP and is located in clade XI (Fig 4, S1 Fig). The 1201ST280MP isolates were not too distant phylogenetically, and it may be reasonable to consider all of them a case cluster. All isolates in this clade had the same specific fljB point mutation (S1 and S2 Figs). Phylogenetic cluster 14 and clade XII are synonymous. All isolates in clade XII were S. Typhimurium, were from animals and, except for an isolate from Alberta, were all obtained in Ontario.

There appears to be some geographical segregation of Salmonella 4,[5],12:i:- and S. Typhimurium isolates in Canada. Isolates from Ontario are enriched in clades II, IV, V, VIII, and XII, while Québec isolates predominate in one sub-clade of clade IV, clade VI, and clade IX (Fig 6, S1 Fig). In clades I and III a majority of isolates are from western Canada, likely because isolates thought to be part of two large ongoing clusters were preferentially selected for sequencing. Isolates from Alberta and Québec slightly predominate in clade II. Almost all of the seafood isolates in clade VII are from the Maritime Provinces (New Brunswick, Nova Scotia, and Prince Edward Island). However, the limitations of the sampling strategy must be kept in mind when interpreting these data.

Fig 6. Mapping of isolates by province of origin.

Fig 6

A full wgMLST UPGMA dendrogram (S1 Fig) was used to create this MST.

Canadian Salmonella 4,[5],12:i:- isolates were derived from S. Typhimurium by several mechanisms

We were interested in the genetic lesions or scars associated with the genomes of monophasic Salmonella 4,[5],12:i:- isolates from a public health perspective. Our hypothesis was that, once created, many of them may be relatively stable and therefore useful in the time frame of public health investigations, though this will not be tested directly in the current work. We also speculated that the genetic lesions or structures affecting the second phase antigen locus may arise during the insertion of mobile elements during integration into the genome and are therefore fixed. However, it is also possible that a variety of genomic scars or configurations due to a single element may result from insertion and subsequent rearrangement, in which case they would not be fixed.

Twelve genetic lesions causing the monophasic phenotype (see below for details) were mapped onto wgMLST UPGMA dendrograms and MSTs. There were occasional instances in which contig breaks did not allow the identity and sequence of the lesion causing the monophasic genotype to be determined (Fig 5, S2 Fig). Two different variants predominate in the Canadian Salmonella 4,[5],12:i:- population characterized (Fig 5). The first type of lesion is caused by MREL insertion into the chromosome, creating the variants described earlier [19] that are predominantly found in clade IV and constitute what has previously been labelled the European lineage or clone [9]. Multiple sequence variants caused by insertion of the MREL have been collapsed into one category for this work. The second major clone constitutes clades I-III. Only one of the S. Typhimurium isolates included in this study was found in clades I-IV.

Lesions caused by the presence of a 5089 bp inserted or retained prophage fragment (“short prophage insertion”; S1 File, S2 and S3 Figs) adjacent to hin, previously identified as a unique insertion in the region [7], predominate in clades I to III, V, and X. This lesion results in the deletion of ~72 kb of adjacent DNA, including fljAB hin, between homologs of S. Typhimurium homologs STM2692 and STM2773. A small number of isolates with this element are also found in other clades in which they do not predominate, raising questions about whether this prophage fragment is, in fact, mobile in the genome (see S2 Fig). These isolates therefore belong to the U.S. clone as described previously [7]. PHASTER results for the 5089 bp short prophage insertion indicated identity with Salmonella phage SJ46 (Genbank accession number NC_031129.1), especially on the basis of the three tail proteins. Consistent with previous results [7], two proteins encoded by this insertion have some identity with S. Typhimurium LT2. UmuC has 91.76% identity (98% coverage) with STM1997 and YedK has 94.38% identity (79% coverage) with Gifsy-2 prophage protein STM1053. The hypothetical protein/DUF4236 domain-containing protein was not detected in LT2 by using NCBI blastp searches. We concur with Soyer and colleagues [7] that this insertion may be the remnant of a larger prophage that has inserted proximal to hin, but there does not appear to be a candidate prophage with the appropriate characteristics yet deposited into GenBank. The inversion of the invertible segment containing hin adjacent to the side tail fiber protein/shikimate transporter of the short prophage results in differences in the N-terminal aa sequence of this protein (S3 Fig) while the C-terminal aa ‘WRTING’ sequence is retained. This protein appears to have evolved so that it is always in-frame and potentially expressed no matter the orientation of hin, and the distribution of isolates in the wgMLST UPGMA dendrogram is consistent with the reversible hin-mediated inversion of the invertible segment rather than effects associated with the population structure Salmonella 4,[5].12:i:- (S2 Fig). In contrast, the population distribution of isolates with this short prophage (Fig 5, S2 Fig) is generally consistent with an initial introduction of genetic material and clonal expansion, as is the consistent location and orientation of the insertion.

During the analysis of the short prophage fragment we noticed that there were several instances in which hin was present in draft assemblies associated with iroB and the short prophage fragment but was not included in the complete genome assemblies (PNCS000211, PNCS014846, PNCS014850, PNCS014853, PNCS014854, PNCS015054; S2 Fig). Draft genomes of isolate PNCS014849 had hin and the intact prophage while in the complete genome the side tail fiber protein/shikimate transporter was a pseudogene, suggesting WGS assembly difficulties. There are draft genome assemblies in which hin was on its own contig (eg. PNCS015391, 1005 bp hin contig) and we suggest in some cases these small contigs may have been lost in the process of combining Illumina NextSeq and Nanopore data. Isolates PNCS014883 and PNCS015288 was the only ones that contained the short prophage fragment and lacked hin. However, iroB and the short prophage were separated by a contig break, so we cannot be certain hin was not lost in the assembly process instead of being completely absent.

U.S. clone isolates have lost the entire Fels-1 prophage from STM0893 to STM0929 (Cluster II in Soyer et al., 2009 [7]), leaving STM0892 adjacent to STM0930 (see closed genome PNCS015054, GenBank accession no. CP037877). Fels-1 is absent in U.S. clone isolates from our study. While not part of the U.S. clone, isolates with a truncated hin and an adjacent short prophage insertion were detected in clade VI.

The loci encoding the allantoin-glyoxalate pathway that was previously reported to be absent in the Spanish Salmonella 4,[5],12:i:- clone [7, 42] was missing in 43 isolates in this study. Of these, 7 (16%) were S. Typhimurium, suggesting these allantoin-glyoxalate pathway chromosomal loci may be sufficiently labile that they are not diagnostic of any clone in particular on larger temporal or geographical scales. Therefore we are reluctant to designate our isolates lacking these loci as belonging to the Spanish clone of Salmonella 4,[5],12:i:-. Isolates PNCS015049, -051, and -052 were part of a cluster of Salmonella 4,[5],12:i:- clinical cases included by PFGE in PulseNet Canada cluster 1001ST7QC (S1 Fig), while the S. Typhimurium clinical isolate PNCS014862 that was also included in this cluster by PFGE was in a separate cluster at least 72 alleles distant from the monophasic strains. Isolates that lacked the allantoin-glyoxalate pathway, stratified by source, were: human clinical (18); porcine (16); bovine (7); chicken (2).

The presence or absence of the Gifsy-1 prophage (S. Typhimurium LT2 loci STM2585- STM2636; 2,728,973 to 2,776,825 nt on the genome) is presumed to be diagnostic for the Spanish clone of Salmonella 4,[5],12:i:- [7, 42]. We have found this prophage to be naturally variable in all S. Typhimurium and monophasic variants (see Fig 7A), subject to large deletions in a smaller number of isolates (Fig 7B), and completely lost in only three isolates (PNCS014869, PNCS014878, PNCS015595; see Fig 3B). Both PNCS014869 and PNCS015595 have also completely lost the Gifsy-2 prophage. Isolate PNCS015595 is a S. Typhimurium isolate, so the loss of both prophages may occur with a certain frequency regardless of clonal origin. It is not clear if the “Spanish clone” is actually a clone of Salmonella 4,[5],12:i:- in the population genomics sense as implemented through WGS.

Fig 7. Variability of the Gifsy-1 prophage in S. Typhimurium and Salmonella 4,[5],12:i:- isolates.

Fig 7

A. Example of normal variability seen in the population. B. Compilation of all genomes with large deletions within Gifsy-1. Isolates were compared using pangenome analysis with GView server.

IS26 was associated with deletions of fljAB hin at iroB in four clinical isolates distributed between clades IV and VIII (Fig 5). (The draft and complete genomes of isolate PNCS014868 constitute two different nodes in clade VIII.) The next complete coding sequence was STM2767 in clade VIII isolates PNCS014868 and PNCS015229 and STM2759 (sgrR) in clade IV isolate PNCS015233. This is consistent with two independent deletion events and with previous reports that IS26 alone may be responsible for deletions at iroB [43]. Deletions of fljAB, leaving hin intact or truncated, were also associated with IS26 (Fig 5).

Four mutations in the fljB gene were detected in the population under study. Three (C337T, C439T, C1285T) were point mutations creating a stop codon and truncated protein. The fourth was a deletion of C470 that creates a frameshift in fljB, making it incapable of producing an active full-length protein. Most fljB C1285T mutations are associated with isolates in clade VI, suggesting there may have been a clonal expansion of isolates, possibly from pork to the human population over a period of years (see above). However, one human isolate with this mutation was also found in clade IV, indicating the mutation has also arisen independently. The fljB C337T mutation was found in all (clinical) isolates comprising clade XI (Fig 5), which were isolated during the period from September 2011 through April 2012 (Fig 3), suggesting they are clonally related. The other two fljB variants were in distant parts of the MST tree from the aforementioned fljB point mutants and from each other.

“Inconsistent” monophasic variants that are phenotypically Salmonella 4,[5],12:i:- but have an intact fljAB hin locus can result from point mutations in hin (R140L aa change) and fljA (A46T aa change) [44]; these changes have been found to reduce the frequency of phase variation to undetectable levels [10]. Clade VII (Fig 5, S1 and S2 Figs) was composed of Salmonella 4,[5],12:i:- isolates with these point mutants. Other water isolates from Québec were located in clade II and were associated with the short prophage fragment, indicating that neither genotype is exclusively associated with monophasic water isolates. Single nucleotide variants that result in the monophasic phenotype are potentially reversible and will not necessarily be detected using Salmonella geno-serotyping tools, though they will be detected phenotypically.

None of the isolates with point mutations resulting in the monophasic phenotype were identified by SISTR.

Conclusions

Phylogenetically related isolates may on occasion have different genetic lesions that arise from different genetic events and appear to be fixed in the genome. It may be necessary to account for these differences in investigations of small numbers of isolates to maximize the statistical power of epidemiological findings. The use of WGS data will result in fewer wrong and confusing case clusters when compared with PFGE or other molecular typing methods, reducing the number of clusters that must be assessed by epidemiologists for further investigation and concomitantly reducing the inherent time and effort involved. Previously the low resolution provided by PFGE made epidemiological investigation highly challenging, if not impossible. In stark contrast, the much higher specificity of WGS (cgMLST- or snp-based analysis) is highly informative for decisions regarding the definition of case clusters. Increasing surveillance of non-human isolates will aid public health investigations by providing critical, early information about non-human sources of isolates phylogenetically linked to clinical case clusters. Further development and analysis of historical datasets of WGS data for both human and non-human isolates, as was done for this work, will facilitate these improvements. Some aspects of the biology of lesions affecting the second phase antigen locus, especially questions about whether the MREL, SGI-4, and short prophage fragment are mobile within the population and at what frequency, still remain to be answered.

Supporting information

S1 Fig. wgMLST UPGMA dendrogram showing clusters and associated isolate and case metadata.

The dendrogram was created in BioNumerics v7.6.3 and annotated using Adobe Creative Cloud Illustrator CC.

(PDF)

S2 Fig. wgMLST dendrogram containing Salmonella 4,[5],12,i:- and S. Typhimurium isolates showing the genetic lesion responsible for the 4,[5],12,i:- genotype.

The dendrogram was created in BioNumerics v7.6.3 and annotated using Adobe Creative Cloud Illustrator CC.

(PDF)

S3 Fig. Genes associated with the short prophage insertion replacing fljAB, its location in the genome, and N-terminal aa sequences resulting from inversion of the hin element.

The figure was obtained using GView Server and annotated using Adobe Creative Cloud Illustrator CC.

(PDF)

S1 File. Fasta file of the short prophage inserted into Salmonella 4,[5],12:i:- genomes near the iroB locus.

(FASTA)

S1 Spreadsheet. Selected isolate metadata.

(XLSX)

S2 Spreadsheet. Quality statistics for WGS data for assembled genomes.

The data were generated using tools within BioNumerics version 7.6.2.

(XLSX)

Acknowledgments

We acknowledge the PulseNet Canada (PNC) Steering Committee and provincial Public Health laboratories (PHLs) from all Canadian provinces and territories for characterization of isolates, collection and curation of metadata, sending isolates to the NML Winnipeg, and granting permission to use isolates and associated metadata. Thanks to Lorelee Tschetter and Ashley K. Kearney for managing all interaction and inquiries associated with the PNC Steering Committee and provincial PHLs.

Isolates, isolate metadata, and permission to use non-human isolates and publish the results were provided by the following individuals and organizations: Mark Hicks, Agri-Food Laboratories, Alberta Agriculture and Forestry, Alberta; Jan Giles, Atlantic Veterinary College, PEI; Erin Zabek and Jaime Battle, Animal Health Centre, British Columbia Ministry of Agriculture; John Devenish, Canadian Food Inspection Agency, Moe Elmufti, Canadian Food Inspection Agency; Jane Parmley, Canadian Integrated Program for Antimicrobial Resistance Surveillance; Danielle Daignault, National Microbiology Laboratory at St. Hyacinthe; Neil Pople, Manitoba Agriculture, Manitoba; Olivia Labrecque and Julie Fairbrother, Ministère de l’Agriculture, des Pêcheries et de l’Alimentation du Québec, Québec; Moussa Diarra, Agriculture and Agrifood Canada, Guelph, Ontario; Susan Bach, Pacific Agri-Food Research Centre, British Columbia; Dawn Daku, Saskatchewan Health, Saskatchewan; Durda Slavic, Animal Health Laboratory, University of Guelph, Ontario; Carlos Leon-Velarde, University of Guelph, Ontario; Joseph Rubin and Janet Hill, Dept. of Veterinary Microbiology, University of Saskatchewan, Saskatchewan.

Genome sequencing was done within and by the Genomics Core Facility at NML Winnipeg. We would like to thank Dr. Morag Graham (Chief, Genomics Core) and Brynn Kaplen, Christine Bonner, Geoff Peters, Kim Melnychuk, Erika Landry, Shari Tyson, and Vanessa Laminman, for preparation of libraries and all other steps in sequencing.

Thanks to Guangzhi Zhang for DNA template preparation. Dr. V.P.J. Gannon provided water isolates and metadata for analysis. Thanks to Chantal Munyuza for helping to sort out the genetic lesions associated with monophasic Salmonella Typhimurium.

We acknowledge the work of the Bioinformatics Core of NML Winnipeg, led by Dr. Gary van Domselaar, in creating and maintaining bioinformatics tools used in this manuscript.

Data Availability

The datasets generated or analyzed in the current study are available in the National Center for Biotechnology Information (NCBI) repository under BioProject PRJNA543337. Biosample numbers for each isolate can be found in S1 Spreadsheet.

Funding Statement

Government of Canada Genomics Research and Development (GRDI) - Phase VI -CGC, RJ; Government of Canada A-base program funding - CGC, RJ, CN, JN, FP, SP.

References

  • 1.Gerner-Smidt P, Besser J, Concepcion-Acevedo J, Folster JP, Huffman J, Joseph LA, et al. Whole genome sequencing: bridging One-Health surveillance of foodborne diseases. Front Public Health. 2019;7: 172. 10.3389/fpubh.2019.00172 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hynds PD, Thomas MK, Pintar KDM. Contamination of groundwater systems in the US and Canada by enteric pathogens, 1990–2013: A review and pooled analysis. PLoS One. 2014;9(5): e93301. 10.1371/journal.pone.0093301 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Silva C, Calva E, Maloy S. One Health and food-borne disease: Salmonella transmission between humans, animals, and plants. Microbiol Spectrum. 2014;2(1): OH-0020-2013. [DOI] [PubMed] [Google Scholar]
  • 4.Cui Q, Huang Y, Wang H, Fang T. Diversity and abundance of bacterial pathogens in urban rivers impacted by domestic sewage. Environ Pollution. 2019;249: 24–35. 10.1016/j.envpol.2019.02.094 [DOI] [PubMed] [Google Scholar]
  • 5.O’Brien S. The “decline and fall” of nontyphoidal Salmonella in the United Kingdom. Clin Infect Dis. 2013;56(5): 705–10. 10.1093/cid/cis967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Powell MR, Crim SM, Hoekstra RM, Williams MS, Gu W. Temporal patterns in principal Salmonella serotypes in the USA; 1996–2014. Epidemiol Infect. 2018;146: 437–41. 10.1017/S0950268818000195 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Soyer Y, Switt AM, Davis MA, Maurer J, McDonough PL, Schoonmaker-Bopp DJ, et al. Salmonella enterica serotype 4,5,12:i:-, an emerging Salmonella serotype that represents multiple distinct clones. J Clin Microbiol. 2009;47(11): 3546–56. 10.1128/JCM.00546-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sun H, Wan Y, Du P, Bai L. The epidemiology of monophasic Salmonella Typhimurium. Foodborne Pathogens Dis. 2020. February;17(2): 87–97. 10.1089/fpd.2019.2676 [DOI] [PubMed] [Google Scholar]
  • 9.Switt AIM, Soyer Y, Warnick LD, Wiedmann M. Emergence, distribution, and molecular and phenotypic characteristics of Salmonella enterica serotype 4,5,12:i:-. Foodborne Pathogens Dis. 2009;6(4): 407–415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ido N, Lee K-I, Iwabuchi K, Izumiya H, Uchida I, Kusumoto M, et al. Characteristics of Salmonella enterica serovar 4,[5],12:i:- as a monophasic variant of serovar Typhimurium. PLoS One. 2014;9(8): e104380. 10.1371/journal.pone.0104380 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Petrovska L, Mather AE, AbuOun M, Branchu P, Harris SR, Connor T, et al. Microevolution of monophasic Salmonella Typhimurium during epidemic, United Kingdom, 2005–2010. Emerg Infect Dis. 2016;22: 617–24. 10.3201/eid2204.150531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Mather AE, Phuong TLT, Gao Y, Clare S, Mukhopadhyay S, Goulding DA, et al. New variant of multidrug-resistant Salmonella enterica serovar Typhimurium associated with invasive disease in immunocompromised patients in Vietnam. mBio. 2018;9: e01056–18. 10.1128/mBio.01056-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mulvey MR, Finley R, Allen V, Ang L, Bekal S, Bailey SE, et al. Emergence of multidrug-resistant Salmonella enterica serotype 4,[5],12:i:- involving human cases in Canada: results from the Canadian Integrated Program on Antimicrobial Resistance Surveillance (CIPARS), 2003–10. J Antimicrobial Chemother. 2013;68: 1982–86. [DOI] [PubMed] [Google Scholar]
  • 14.Petrovska L, Mather AE, AbuOun M, Branchu P, Harris SR, Connor T, et al. Microevolution of monophasic Salmonella Typhimurium during epidemic, United Kingdom, 2005–2010. Emerg Infect Dis. 2016;22: 617–24. 10.3201/eid2204.150531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Elnekave E, Hong S, Mather AE, Boxrud D, Taylor AJ, Lappi V, et al. Salmonella enterica 4,[5],12:i:- in swine in the United States Midwest: an emerging multidrug resistant clade. Clin Infect Dis. 2018;66(6): 877–85. 10.1093/cid/cix909 [DOI] [PubMed] [Google Scholar]
  • 16.Hauser E, Tietze E, Helmuth R, Junker E, Blank K, Prager R, Rabsch W, Appel B, Fruth A, Malorney B. Pork contaminated with Salmonella 4,[5],12:i:-, an emerging health risk for humans. Appl Environ Microbiol. 2010;76: 4601–4610. 10.1128/AEM.02991-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.White PL, Green AL, Holt KG, Hale KR. Multidrug-resistant Salmonella enterica subspecies I serovar 4,[5],12:i:- isolates recovered from food safety and inspection service-regulated products and food animal ceca, 2007–2016. Foodborne Pathogens Dis. 2019; 16(10):679–86. 10.1089/fpd.2018.2573 [DOI] [PubMed] [Google Scholar]
  • 18.Palma F, Manfreda G, Silva M, Parisi A, Barker DOR, Taboada EN, et al. Genome-wide identification of geographical segregated genetic markers in Salmonella enterica serovar Typhimurium variant 4,[5]:12:i:-. Sci Rep. 2018;8: 15251. 10.1038/s41598-018-33266-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Clark CG, Landgraff C, Robertson J, Pollari F, Parker S, Nadon C, et al. Distribution of heavy metal resistance elements in Canadian Salmonella 4,[5],12:i:- populations and association with the monophasic genotypes and phenotype. PLoS One. 2020;15(7): e0236436. 10.1371/journal.pone.0236436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hopkins KL, Kirchner M, Guerra B, Granier SA, Lucarelli C, Porrero MC, et al. Multiresistant Salmonella enterica serovar 4,[5],12:i:- in Europe: a new pandemic strain? Eurosurveill. 2010;15(22): pii = 19580. [PubMed] [Google Scholar]
  • 21.Mourão J, Novais C, Machado J, Peixe L, Antunes P. Metal tolerance in emerging clinically relevant multidrug-resistant Salmonella enterica serotype 4,[5],12:i:- clones circulating in Europe. Int J Antimicrob Agents. 2015; 45:610–16. 10.1016/j.ijantimicag.2015.01.013 [DOI] [PubMed] [Google Scholar]
  • 22.Medardus JJ, Molla BZ, Nicol M, Morrow WM, Rajala-Schultz PJ, Kazwala R, et al. In-feed use of heavy metal micronutrients in U.S. swine production systems and its role in persistence of multidrug-resistand Salmonellae. Appl Environ Microbiol. 2014;80(7): 2317–25. 10.1128/AEM.04283-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mastrorilli E, Pietrucci D, Barco L, Ammendola S, Petrin S, Longo A, et al. A comparative genomic analysis provides novel insights into the ecological success of the monophasic Salmonella serovar 4,[5],12:i:-. Frontiers Microbiol 2018;9: 715. 10.3389/fmicb.2018.00715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Campos J, Mourão J, Peixe L, Antunes P. Non-typhoidal Salmonella in the pig production chain: a comprehensive analysis of its impact on human health. Pathogens. 2019;8: 19. 10.3390/pathogens8010019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Arruda BL, Burrough ER, Schwartz KJ. Salmonella enterica I 4,[5],12:i:- associated with lesions typical of swine enteric salmonellosis. Emerg Infect Dis. 2019;25(7): 1377–79. 10.3201/eid2507.181453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Besser JM, Carleton HA, Trees E, Stroika SG, Hise K, Wise M, Gerner-Schmidt P. Interpretation of whole-genome sequencing for enteric disease surveillance and outbreak investigation. Foodborne Pathogens Dis. 2019;16(7): 504–12. 10.1089/fpd.2019.2650 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nadon C, van Walle I, Gerner-Smidt P, Campos J, Chinen I, Concepcion-Acevedo J, et al. 2017. PulseNet International: Vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance. Euro Surveill. 2017;22(23): pii = 30544. 10.2807/15607917.ES.2017.22.23.30544 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chattaway MA, Dallman TJ, Larkin L, Nair S, McCormick J, Mikhail A, et al. The transformation of reference microbiology methods and surveillance for Salmonella with the use of whole genome sequencing in England and Wales. Front Public Health 2019;7: 317. 10.3389/fpubh.2019.00317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Leekitcharoenphon P, Nielsen EM, Kaas RS, Lund O, Aarestrup FM. Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica. PLoS One. 2014;9(2): e87991. 10.1371/journal.pone.0087991 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Morganti M, Bolzoni L, Scaltriti E, Casadei G, Carra E, Rossi L, et al. Rise and fall of outbreak-specific clone inside endemic pulsotype of Salmonella 4,[5],12:i:-; insights from high-resolution molecular surveillance in Emilia-Romagna, Italy, 2012 to 2015. Euro Surveill. 2018. March; 23(13):17–00375. 10.2807/1560-7917.ES2018.23.13.17-00375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Keefer AB, Xaioli L, M’ikanatha NM, Yao K, Hoffman M, Dudley EG. Retrospective whole genome sequencing analysis distinguished PFGE and drug-resistance-matched retail meat and clinical Salmonella isolates. Microbiology. 2019;165: 270–86. 10.1099/mic.0.000768 [DOI] [PubMed] [Google Scholar]
  • 32.Kozyreva VK, Crandall J, Sabol A, Poe A, Zhang P, Concepción-Acevedo J, et al. Laboratory investigation of Salmonella enterica serovar Poona outbreak in California: comparison of Pulsed-Field Gel Electrophoresis (PFGE) and Whole Genome Sequencing (WGS) results. PLoS Curr. 2016; November 22;8: ecurrent.outbreaks.1bb3e36e74bd5779bc43ac3a8dae52e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.PulseNet International. Standard operating procedure for PulseNet PFGE of Escherichia coli O157:H7, Escherichia coli non-O157, Salmonella serotypes, Shigella sonnei, and Shigella flexneri. 2013. http://www.pulsenetinternational.org/assets/PulseNet/uploads/pfge/PNL05_Ec-Sal-ShigPFGEprotocol.pdf
  • 34.Matthews TC, Bristow FR, Griffiths EJ, Petkau A, Adam J, Dooley D, et al. The Integrated Rapid Infectious Disease Analysis (IRIDA) Platform. bioRxiv. 2018; 381830. 10.1101/381830. [DOI] [Google Scholar]
  • 35.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin MM, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its application to single-cell sequencing. J Comput Biol. 2012;19: 455–477. 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Seeman T. PROKKA: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14): 2068–9. 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 37.Yoshida CE, Kruczkiewicz P, Laing CR, Lingohr EJ, Gannon VP, Nash JH, et al. The Salmonella In Silico typing Resource (SISTR): an open web-accessible tool for rapidly typing and subtyping draft Salmonella genome assemblies. PLoS One. 2016. January 22;11(1)e0147101. 10.1371/journal.pone.0147101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Schonfeld J, Clark C, Robertson J, Gitanjali A, Eagle S, Gurnik S, et al. Complete genome sequences for 40 Canadian Salmonella Typhimurium and I 1,4,[5],12:i:- from clinical and animal sources. Microbiol Resource Announc. 2021;10: e00734–20. 10.1128/MRA.00734-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Walker M, Landgraff C, Clark C. Complete genome sequences of six Salmonella 4,[5],12:i:- isolates from Canada. Microbiol Resource Announc. 2020; June 11;9(24): e00074–20. 10.1128/MRA.00074-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Petkau A, Stuart-Edwards M, Stothard P, Van Domselaar G. Interactive microbial visualization with GView. Bioinformatics. 2010;26: 3125–6. 10.1093/bioinformatics/btq588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Sweat D, Valiani A, Griffin D, Springer D, Rath S, Greene S, et al. Infections with Salmonella I 4,[5],12:i:- linked to exposure to feeder rodents–United States, August 2011-February 2012. Morbid Mortal Wkly Rep. 2012;61(15): 277. [PubMed] [Google Scholar]
  • 42.Cartwright EJ, Nguyen T, Melluso C, Ayers T, Lane C, Hodges A, et al. A multistate investigation of antibiotic-resistant Salmonella enterica serotype I 4,[5]:12:i:- infections as part of an international outbreak associated with frozen feeder rodents. Zoonoses Public Health. 2016;63: 62–71. 10.1111/zph.12205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Garaizer J, Porwollik S, Echeita A, Rementeria A, Herrera S, Wong RM-Y, et al. DNA microarray-based typing of an atypical monophasic Salmonella enterica serovar. J Clin Microbiol. 2002;40(6): 2074–8. 10.1128/jcm.40.6.2074-2078.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Barco L, Longo A, Letini AA, Cortini E, Saccardin C, Minorello C, et al. Molecular characterization of “inconsistent” variants of Salmonella Typhimurium isolates in Italy. Foodborne Pathog Dis. 2014;11(6): 497–9. 10.1089/fpd.2013.1714 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Patrick Butaye

10 Nov 2020

PONE-D-20-29513

Population structure, case clusters, and genetic lesions associated with Canadian Salmonella 4,[5],12:i:- isolates

PLOS ONE

Dear Dr. Clark,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

As you can see, the reviewers well received the science and value of the manuscript, as did I. Please reply to the minor changes indicated by the reviewers.

Please submit your revised manuscript by Dec 25 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Patrick Butaye, DVM, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1.) Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.) We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere.

"Clark et al. 2020. Distribution of heavy metal resistance elements in Canadian Salmonella 4,[5],12:i:- populations and association with the monophasic genotypes and phenotype. PLoS One 15(7): e0236436. Fig 14, S1 Spreadsheet, S2 Spreadsheet were included in unmodified or modified form in the current manuscript"

Please clarify whether this publication was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Comments to the Author by Reviewer (Manuscript ID: PONE-D-20-29513)

Major comments:

The manuscript titled “Population structure, case clusters, and genetic lesions associated with Canadian Salmonella 4,[5],12:i:- isolates” by Clark et al. is an interesting, well written and well-presented study, which bringing up one of the major threat to public health. Salmonella 4,[5],12:i:- constitutes an international clone that in some circumstances harbor several AMR genes, particularly encoding resistance to colistin.

Considering the dramatic increase of mcr genes and their variants in Salmonella enterica 4,[5],12:i:-, especially, sequence type 34, the authors would possible include a short statement about mobile colistin resistance and sequence types distributed in these clades. It will allow the readership to associate the population structure with the distribution of certain AMR genes/ST, denoting their promiscuity in several hosts. Besides that, the authors denoted the worryingly identification of Salmonella 4,[5],12:i:-, which in the past were identified as S. Typhimurium, and now through high-resolution methods we can distinguish correctly these serovars. Lastly, the manuscript is technically sound, and the data support the conclusions. All data underlying the findings in their manuscript fully available and written in standard English.

Detailed comments:

Abstract

L19-40: Well written and well presented.

L24: Please insert a “S.” before 4,[5],12:i:-.

L29: IS26

Introduction

L44: not only infected hosts can make the spread. Would be relevant to considering the asymptomatic carriers.

L51: Are you talking about disease? If so, you can keep the word “incidence”. Otherwise, replace to something like prevalence/occurrence/frequency.

L64-66 and 77-81: Please, would be appropriated to cite others studies regarding Salmonella 4,[5],12:i:- and AMR, including the following references:

-Arnott A, et al. Multidrug-resistant Salmonella enterica 4,[5],12:i:- Sequence Type 34, New South Wales, Australia, 2016–2017. Emerg Infect Dis. 2018;24:751–753. doi: 10.3201/eid2404.171619.

-Mulvey MR, Bharat A, Boyd DA, Irwin RJ, Wylie J. Characterization of a colistin-resistant Salmonella enterica 4,[5],12:i:- harbouring mcr-3.2 on a variant IncHI-2 plasmid identified in Canada. J Med Microbiol. 2018;67:1673–1675. doi: 10.1099/jmm.0.000854.

-Monte DF, et al. Multidrug- and colistin-resistant Salmonella enterica 4,[5],12:i:- sequence type 34 carrying the mcr-3.1 gene on the IncHI2 plasmid recovered from a human. J Med Microbiol. 2019 Jul;68(7):986-990. doi: 10.1099/jmm.0.001012.

L71: Please insert a “S.” before 4,[5],12:i:-.

L74: Instead (Salmonella Genetic Island-4) replace by (Salmonella Genomic Island-4)

Materials and methods

L175: Please insert a “S.” before 4,[5],12:i:-.

Results and Discussion

L216: In my opinion would be relevant the replacement of “Please” by “It is important to note”…

L354: S. 4,[5],12:i:-

L393: S.

L451: Please, replace “think” to “suggest”.

L489: IS26

L494: IS26

L495: IS26

L501: Insert a space before however.

Conclusion

Well written and well presented.

Reviewer #2: This paper reports very interesting findings that resulted from the comparison of the population structure of Salmonella Typhirumium and (mostly) its monophasic variant, using the classical PFGE method vs cgMLST, on selected Canadian isolates. The paper is well written and the methodology used is very robust. The quality and the variety of bio-informatic analysis, up to the nucleic acid sequence level for understanding some cgMLST sub-clusters, including the analysis of the monophasic variant, make this paper a very informative one. The results related to the discrimination level offered by cgMLST vs PFGE is another very interesting component of this manuscript.

My only significant concern is about the representivity of the isolates. The authors claimed to have a reasonable representivity of Canadian isolates from 2008-2016 while they not only recognise the limitations of the sampling strategy but also explained some changes in their population structure due to addition of water samples and as well they recognized that in some instances, the fact that more isolates were coming from a given epidemic may have affected their results. I therefore do not concur with this claim. There is nothing such as a meaningful Salmonella sampling strategy in Canada. This country relies almost solely on a passive, and very variable one from a province to another, surveillance system. In addition, as written, the authors are providing arguments that their sampling is not really representative. In fact, there is no need for the sampling to be representative of the Salmonella situation within this country. The characterization of different isolates from various areas (of a country) and the study of epidemic strains by comparing the findings from PFGE and cgMLST is by itself very interesting. Explaining the difference between various isolates at the genomic level also is highly valuable.

On another topic, within the results and discussion section, the authors occasionaly explained their findings in relation with some outbreaks, which is it good and make it interesting. It would be nice to know which proportion of the isolates are coming from sporadic cases and where they are located among the different clusters.

Some specific comments to consider are:

Line 149: The comment on point mutations should go in the results section.

Line 194: There is no need to underline that addition of isolates (from water) affected dendrogram topology; the opposite would have been very surprising.

Line 222: Isolates recovered from 2008 to 2016 vs Table 1 where 2007 is mentioned.

line 282: Unclear as written. Chickens would have been already contaminated by a cluster strains? In addition, explain what is supporting this hypothesis?

Line 364: Why this assumption would be valid? I would rather present it as an hypothesis.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 6;16(4):e0249079. doi: 10.1371/journal.pone.0249079.r002

Author response to Decision Letter 0


7 Jan 2021

Reviewers' comments:

Reviewer #1: Comments to the Author by Reviewer (Manuscript ID: PONE-D-20-29513)

Major comments:

The manuscript titled “Population structure, case clusters, and genetic lesions associated with Canadian Salmonella 4,[5],12:i:- isolates” by Clark et al. is an interesting, well written and well-presented study, which bringing up one of the major threat to public health. Salmonella 4,[5],12:i:- constitutes an international clone that in some circumstances harbor several AMR genes, particularly encoding resistance to colistin.

A. Thank you!

Considering the dramatic increase of mcr genes and their variants in Salmonella enterica 4,[5],12:i:-, especially, sequence type 34, the authors would possible include a short statement about mobile colistin resistance and sequence types distributed in these clades. It will allow the readership to associate the population structure with the distribution of certain AMR genes/ST, denoting their promiscuity in several hosts.

A. A third manuscript is in preparation, focussing on findings not included in the previous two manuscripts. It will deal with the accessory genome and mobilome using WGS data from the isolates in this study and the one previously published. Our intention is to look at the geographic and perhaps temporal distribution of prophages, plasmids and other elements that may confer antibiotic resistance. Significant contributions will be made by two researchers specializing in this type of analysis, Dr. John Nash and James Robertson, who are also authors on this manuscript.

Besides that, the authors denoted the worryingly identification of Salmonella 4,[5],12:i:-, which in the past were identified as S. Typhimurium, and now through high-resolution methods we can distinguish correctly these serovars. Lastly, the manuscript is technically sound, and the data support the conclusions. All data underlying the findings in their manuscript fully available and written in standard English.

Detailed comments:

Abstract

L19-40: Well written and well presented.

A. Thank you!

L24: Please insert a “S.” before 4,[5],12:i:-.

A. “Salmonella” has now been added to all instances where it was previously missing.

L29: IS26

A. IS26 has now been italicized wherever it occurs.

Introduction

L44: not only infected hosts can make the spread. Would be relevant to considering the asymptomatic carriers.

A. A sentence stating that “Bacteria may be spread by both symptomatic or asymptomatic hosts.” has now been added to lines 48-49 of the revision with tracked changes.

L51: Are you talking about disease? If so, you can keep the word “incidence”. Otherwise, replace to something like prevalence/occurrence/frequency.

A. We have changed the word to “occurrence” in line 54 of the revision with tracked changes.

L64-66 and 77-81: Please, would be appropriated to cite others studies regarding Salmonella 4,[5],12:i:- and AMR, including the following references:

We do intend to do an analysis of the mcr resistance genes as part of a thorough description of the resistome in our isolates. James Robertson and John Nash have already identified a very early instance of mcr-3 in our dataset. The references you have so kindly provided will be an integral part of that manuscript.

-Arnott A, et al. Multidrug-resistant Salmonella enterica 4,[5],12:i:- Sequence Type 34, New South Wales, Australia, 2016–2017. Emerg Infect Dis. 2018;24:751–753. doi: 10.3201/eid2404.171619.

-Mulvey MR, Bharat A, Boyd DA, Irwin RJ, Wylie J. Characterization of a colistin-resistant Salmonella enterica 4,[5],12:i:- harbouring mcr-3.2 on a variant IncHI-2 plasmid identified in Canada. J Med Microbiol. 2018;67:1673–1675. doi: 10.1099/jmm.0.000854.

-Monte DF, et al. Multidrug- and colistin-resistant Salmonella enterica 4,[5],12:i:- sequence type 34 carrying the mcr-3.1 gene on the IncHI2 plasmid recovered from a human. J Med Microbiol. 2019 Jul;68(7):986-990. doi: 10.1099/jmm.0.001012.

L71: Please insert a “S.” before 4,[5],12:i:-.

A. done

L74: Instead (Salmonella Genetic Island-4) replace by (Salmonella Genomic Island-4)

A. Done. Brain glitch. Thank you !

Materials and methods

L175: Please insert a “S.” before 4,[5],12:i:-.

A. Done

Results and Discussion

L216: In my opinion would be relevant the replacement of “Please” by “It is important to note”…

L354: S. 4,[5],12:i:-

L393: S.

L451: Please, replace “think” to “suggest”.

L489: IS26

L494: IS26

L495: IS26

L501: Insert a space before however.

A. All changes suggested for the Results and Discussion have been made.

Conclusion

Well written and well presented.

A. Thank you!

Reviewer #2: This paper reports very interesting findings that resulted from the comparison of the population structure of Salmonella Typhirumium and (mostly) its monophasic variant, using the classical PFGE method vs cgMLST, on selected Canadian isolates. The paper is well written and the methodology used is very robust. The quality and the variety of bio-informatic analysis, up to the nucleic acid sequence level for understanding some cgMLST sub-clusters, including the analysis of the monophasic variant, make this paper a very informative one. The results related to the discrimination level offered by cgMLST vs PFGE is another very interesting component of this manuscript.

My only significant concern is about the representivity of the isolates. The authors claimed to have a reasonable representivity of Canadian isolates from 2008-2016 while they not only recognise the limitations of the sampling strategy but also explained some changes in their population structure due to addition of water samples and as well they recognized that in some instances, the fact that more isolates were coming from a given epidemic may have affected their results. I therefore do not concur with this claim. There is nothing such as a meaningful Salmonella sampling strategy in Canada. This country relies almost solely on a passive, and very variable one from a province to another, surveillance system. In addition, as written, the authors are providing arguments that their sampling is not really representative. In fact, there is no need for the sampling to be representative of the Salmonella situation within this country. The characterization of different isolates from various areas (of a country) and the study of epidemic strains by comparing the findings from PFGE and cgMLST is by itself very interesting.

A. I agree totally with everything you wrote. I cannot find where I used the term “representative”; perhaps the previous paper? I might like to change that. As you mention, I have tried to describe in detail how isolates were selected and the view that they were not representative. It is an unfortunate word.

PulseNet Canada has been sequencing all Salmonella isolates since 2017. It is our intention to analyze the data to present (whenever that happens to be when we do the analysis) to determine the ratio of isolates with MREL/SGI-4 to isolates with the prophage fragment. That should tell us what proportion of total the multi-resistant MREL/SGI-4 isolates are and give us a baseline to see if that proportion has expanded in the future.

Explaining the difference between various isolates at the genomic level also is highly valuable.

On another topic, within the results and discussion section, the authors occasionaly explained their findings in relation with some outbreaks, which is it good and make it interesting. It would be nice to know which proportion of the isolates are coming from sporadic cases and where they are located among the different clusters.

Some specific comments to consider are:

Line 149: The comment on point mutations should go in the results section.

A. Agreed. We have made it the last sentence of the Results section so that nobody misses it.

Line 194: There is no need to underline that addition of isolates (from water) affected dendrogram topology; the opposite would have been very surprising.

A. A reviewer for the first paper was unimpressed with Minimum Spanning Trees. We defended their use, both because the MST and UPGMA dendrograms were congruent in that analysis and because MST make it much easier to intuitively understand temporal and geographic differences. However, when the additional data from water isolates was added in this manuscript, the MST and UPGMA trees were no longer topologically equivalent. That reviewer had a point, and I wanted it noted so that readers would be able to assess the analysis appropriately. I do agree with you that it probably doesn’t make any difference in interpretation.

Line 222: Isolates recovered from 2008 to 2016 vs Table 1 where 2007 is mentioned.

A. The clinical/human/PulseNet Canada isolates chosen were selected from 2008 through 2016. Far fewer non-human FoodNet Canada isolates were available so we used all that we had available, which were from 2000 through 2016. Since we were looking at clusters based on WGS and not case clusters, we included one isolate from before 2008 that was part of the WGS cluster.

line 282: Unclear as written. Chickens would have been already contaminated by a cluster strains? In addition, explain what is supporting this hypothesis?

A. All chicken isolates belonging to the cluster were obtained after the clinical/human isolates. But it is very likely that chickens were the source of human infections in this cluster.

Line 364: Why this assumption would be valid? I would rather present it as an hypothesis.

A. Done!

Attachment

Submitted filename: Reply to Reviewers.docx

Decision Letter 1

Patrick Butaye

11 Mar 2021

Population structure, case clusters, and genetic lesions associated with Canadian Salmonella 4,[5],12:i:- isolates

PONE-D-20-29513R1

Dear Dr. Clark,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Patrick Butaye, DVM, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Patrick Butaye

26 Mar 2021

PONE-D-20-29513R1

Population structure, case clusters, and genetic lesions associated with Canadian Salmonella</italic> 4,[5],12:i:- isolates

Dear Dr. Clark:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Patrick Butaye

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. wgMLST UPGMA dendrogram showing clusters and associated isolate and case metadata.

    The dendrogram was created in BioNumerics v7.6.3 and annotated using Adobe Creative Cloud Illustrator CC.

    (PDF)

    S2 Fig. wgMLST dendrogram containing Salmonella 4,[5],12,i:- and S. Typhimurium isolates showing the genetic lesion responsible for the 4,[5],12,i:- genotype.

    The dendrogram was created in BioNumerics v7.6.3 and annotated using Adobe Creative Cloud Illustrator CC.

    (PDF)

    S3 Fig. Genes associated with the short prophage insertion replacing fljAB, its location in the genome, and N-terminal aa sequences resulting from inversion of the hin element.

    The figure was obtained using GView Server and annotated using Adobe Creative Cloud Illustrator CC.

    (PDF)

    S1 File. Fasta file of the short prophage inserted into Salmonella 4,[5],12:i:- genomes near the iroB locus.

    (FASTA)

    S1 Spreadsheet. Selected isolate metadata.

    (XLSX)

    S2 Spreadsheet. Quality statistics for WGS data for assembled genomes.

    The data were generated using tools within BioNumerics version 7.6.2.

    (XLSX)

    Attachment

    Submitted filename: Reply to Reviewers.docx

    Data Availability Statement

    The datasets generated or analyzed in the current study are available in the National Center for Biotechnology Information (NCBI) repository under BioProject PRJNA543337. Biosample numbers for each isolate can be found in S1 Spreadsheet.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES