Abstract
The food industry is facing a major transition regarding methods for confirmation, characterization, and subtyping of Salmonella. Whole-genome sequencing (WGS) is rapidly becoming both the method of choice and the gold standard for Salmonella subtyping; however, routine use of WGS by the food industry is often not feasible due to cost constraints or the need for rapid results. To facilitate selection of subtyping methods by the food industry, we present: (i) a comparison between classical serotyping and selected widely used molecular-based subtyping methods including pulsed-field gel electrophoresis, multilocus sequence typing, and WGS (including WGS-based serovar prediction) and (ii) a scoring system to evaluate and compare Salmonella subtyping assays. This literature-based assessment supports the superior discriminatory power of WGS for source tracking and root cause elimination in food safety incident; however, circumstances in which use of other subtyping methods may be warranted were also identified. This review provides practical guidance for the food industry and presents a starting point for further comparative evaluation of Salmonella characterization and subtyping methods.
Keywords: Salmonella, subtyping, serotyping, WGS, PFGE, MLST, food industry
Introduction
A number of food safety incidents and recalls caused by Salmonella contamination have been associated with ready-to-eat low-moisture products (e.g., milk powder, raw almonds, dry seasonings, and peanut butter) (Pillai and Ricke, 2002; Maciorowski et al., 2004; Park et al., 2008; GMA, 2009; Hanning et al., 2009), and other food commodities (e.g., meat products, eggs, and vegetables) (Greig and Ravel, 2009; Wu et al., 2017; Ricke et al., 2018) in recent years. These cases highlight the need to reinforce Salmonella control measures in the food industry, including rapid and accurate tracking of pathogen contamination sources with appropriate subtyping tools. Tools used in incident investigations that can differentiate Salmonella beyond the species level (defined as Salmonella subtyping) are essential to improve control of this pathogen, as Salmonella contamination can occur from diverse sources at any stage of food production (Olaimat and Holley, 2012; Barco et al., 2013; Shi et al., 2015).
Conventional serotyping (White–Kauffmann–Le minor scheme) has been used as a Salmonella subtyping method for >80 years (Salmonella Subcommittee of the Nomenclature Committee of the International Society for, Microbiology, 1934; Grimont and Weill, 2007; Guibourdenche et al., 2010; Dera-Tomaszewska, 2012; Shi et al., 2015) and has been a certified approach for public health monitoring of Salmonella infections for over 50 years (CDC, 2015). This method classifies the genus Salmonella into serovars (also known as “serotypes”) based on surface antigens including somatic (O), flagellar (H), and capsular (Vi) antigens (Brenner et al., 2000). More than 2,500 serovars of Salmonella enterica, the Salmonella species responsible for virtually all salmonellosis cases have been identified by conventional serotyping (Hadjinicolaou et al., 2009; Ferrari et al., 2017), but less than 100 serovars account for the vast majority of human infections (CDC, 2015). Due to the large variety of Salmonella serovars, a laboratory needs to maintain more than 250 different high-quality typing antisera and 350 different antigens for conventional serotyping of Salmonella (McQuiston et al., 2004; Fitzgerald et al., 2006). The turnaround time (i.e., time needed from isolate submission to a laboratory to receipt of the result) for serotyping a single isolate is usually >3 days. In some cases, it can take much longer (>12 days) as multiple antibody/agglutination reactions may be needed in a step-wise fashion to assign a final classification for complex serovars (Kim et al., 2006; Boxrud, 2010). Traditional serotyping is thus time-consuming and labor intensive requiring well-trained, experienced technicians (Boxrud, 2010; Shi et al., 2015). Unfortunately, it can also be imprecise (McQuiston et al., 2011). Moreover, the low discriminatory power of conventional serotyping may result in false-positive identification of relatedness between two unrelated isolates, as strains with the same serovar (such as the serovar Salmonella Enteritidis) may originate from multiple contamination sources. Further in-depth resolution beyond the serovar level is thus required for incident investigations (Ricke, 2017; Ricke et al., 2018). Various rapid molecular-based subtyping methods have been developed to provide faster, more discriminatory, and more accurate subtyping of Salmonella thus overcoming the limitations of traditional serotyping. Nevertheless, serovar data can still provide important historical epidemiological information, as certain serovars have specific virulence characteristics or may be associated with specific contamination sources (Ricke et al., 2018). Thus, it is important to link the subtypes identified by these molecular-based methods to Salmonella serovars.
There is no current global recommendation for the application of molecular characterization methods for Salmonella, although the food industry has applied both banding pattern-based and sequence-based subtyping methods for incident investigations. This review will provide (i) a comparison between classical serotyping and selected widely used molecular-based subtyping methods including pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and whole-genome sequencing (WGS, including WGS-based serovar prediction), and (ii) a scoring system to evaluate and compare Salmonella subtyping assays.
Banding Pattern-Based and Sequencing-Based Characterization Methods for Salmonella
There are two major types of molecular-based subtyping methods: (i) nucleotide banding pattern-based subtyping methods, representing the banding patterns generated from the restriction digestion or polymerase chain reaction (PCR) amplification of genomic or plasmid DNA (Wachsmuth et al., 1991; Hartmann and West, 1997) and (ii) sequencing-based subtyping, identifying variants at the single-nucleotide level of the selected gene markers or the entire genome of an isolate. A comparison of the resolution, turnaround time, ability of serovar prediction, cost, and feasibility of these methods is given below (Table 1).
TABLE 1.
Method | Ability to identify or predict serovars | Ability to provide sensitive subtype discrimination | Time to results from a single colony | Commercial availability (time to results that can be expected from commercial labs) | Summary of value for food industry | Estimated reagent cost per isolate (instrument and labor cost not included)1 | Service cost per isolate (provided by commercial labs)1 |
Classical White–Kauffman serotyping | While Salmonella serovars are based on White–Kauffmann serotyping, serotyping does provide frequent misclassification (Petersen et al., 2002). | Very poor subtype discrimination; only valuable as subtyping method for rare and unusual serovars. | 2–17 days (usually >5 days) (ECDC, 2015; Bopp et al., 2016) | 2–4 weeks | Classical serotyping is likely to be replaced rapidly by WGS-based serovar prediction. Main value for industry is as a rapid confirmation and subtype screen if access exists to lab that can provide rapid turnaround time. | $5–65 (ECDC, 2015; Bopp et al., 2016) | ∼$175 |
Pulsed-field gel electrophoresis (PFGE) | Intermediate ability to predict serovars | Good subtyping discrimination for most serovars. Some PFGE patterns are very common within some serovars (e.g., Pattern 4 for Salmonella Enteritidis) | 4–6 days (ECDC, 2015; Bopp et al., 2016) | 2–3 weeks | Has been the gold standard subtyping method for Salmonella, is likely to be replaced rapidly by WGS, starting with public health authorities and food regulators. | $7–50 (ECDC, 2015; Bopp et al., 2016) | $130–200 |
Multiple locus variable number of tandem repeats (VNTR) analysis (MLVA) | Intermediate ability to predict serovars | Good subtyping discrimination for most serovars. May perform better than PFGE for some serovars but worse for others. | 1–2 days | NA2 | Has been used as a secondary subtyping method to compensate the low discriminatory power of serotyping and PFGE for some Salmonella serovars; it is likely to be replaced rapidly by WGS, starting with public health authorities and food regulators. | $9–36 (Amirkhanian et al., 2006; Top et al., 2008; Schouls et al., 2009; ECDC, 2015) | NA2 |
Legacy multilocus sequence typing (legacy MLST) | Intermediate ability to predict serovars | Better than conventional serotyping and riboprinting, worse than PFGE and WGS. | 1–2 days | 2–3 days | Main value for industry is as a rapid confirmation and subtype screen, can be used to select the reference genome for WGS data analysis. | $30–82 (Ranieri et al., 2013; Shi et al., 2015) | ∼$280 |
Whole-genome sequencing (WGS) | Currently available serovar-prediction software using WGS data work well for less common serovars. May not work for extremely rare serovars. | Best discrimination among molecular subtyping approaches | 3–17 days (ECDC, 2015) (depends on sequencing capabilities. Usually 1 day after sequencing is finished). | 2–8 weeks | For companies with high demand of isolates to be subtyped, WGS is probably the most affordable and fastest method that provides the best discrimination. In addition, in silico serotyping and in silico MLST can be done from the data to allow for comparison with historical isolates that have not been whole-genome-sequenced. Other information, such as presence of antibiotic resistance genes and virulence genes can be easily retrieved from the data. For companies with low demand, the costs of real-time sequencing may be prohibitive, requiring that old isolates must wait until more isolates are collected to be submitted together. | $60–230 (ECDC, 2015) | $100 (using Illumina HiSeq X series)–up to more than $500 (using Illumina MiSeq) |
1These cost estimates per isolate are based on (i) previous cost estimation reports and studies, (ii) official prices available on Internet, and/or (iii) personal communication with service providers and product vendors, as of June 2018, true costs may vary considerably based on number of isolates tested per run, labor costs, and region/country, etc. 2NA, not available.
Banding Pattern-Based Characterization Methods
Pulsed-Field Gel Electrophoresis (PFGE)
Pulsed-field gel electrophoresis was first described in 1984 and developed as a subtyping method for Salmonella in the 1990s (Threlfall and Frost, 1990; Figure 1). PFGE is currently the gold standard for PulseNet International, and has been used by public health authorities and food regulators for outbreak investigations and source tracking globally (including USCDC, USFDA, USDA, and ECDC) (Zou et al., 2010; Wattiau et al., 2011; PulseNet, 2014; CDC, 2016a). Alternative methods for Salmonella subtyping are commonly compared against PFGE (Call et al., 2008). However, PulseNet is transitioning from using PFGE and multiple locus variable number of tandem repeats analysis (MLVA) toward using WGS as the standardized genotyping method for foodborne pathogens (CDC, 2017a; Nadon et al., 2017). PulseNet International has defined standard PFGE protocols (PulseNet, 2013; CDC, 2017b) and maintains a database of Salmonella PFGE profiles with >350,000 PFGE patterns representing >500 serovars. These PFGE patterns predominantly represent isolates collected since 1996 in North America and Europe (Zou et al., 2013). PFGE has relatively high concordance with epidemiological relatedness with two decades of data accumulation (CDC, 2018a). However, the PulseNet database for PFGE patterns is not publicly available and can only be accessed by PulseNet participating laboratories.
The PFGE approach uses restriction enzymes that recognize specific restriction sites along the genomic DNA and fragment the DNA to sizes normally ranging from 20 to 800 kb (up to 2,000 kb) (Schwartz and Cantor, 1984; Singh et al., 2006). These large fragments are separated in a flat agarose gel by constantly changing the direction of the electric current (pulsed field), which causes the DNA to separate by size, generating a specific “fingerprint pattern” for a given isolate (Foley et al., 2009). The restriction enzymes XbaI, NotI, SpeI, and SfiI have been typically used for Gram-negative bacteria including Salmonella (Barg and Goering, 1993). The primary restriction enzyme used for Salmonella PFGE is XbaI. A public health laboratory usually has access to software [e.g., BioNumerics and GelCompar (Applied Maths, Sint-Martens-Latem, Belgium); Diversity Database Fingerprinting Software (Bio-Rad Laboratories, Hercules, CA, United States)], to analyze a PFGE pattern (Nsofor, 2016) and uploads PFGE patterns to a national database. PulseNet Central’s database managers then analyze the uploaded pattern to see if a new outbreak has emerged or whether the isolate is part of an ongoing outbreak (CDC, 2018a). To make inter-laboratory comparison of DNA patterns possible, standardized protocols, molecular size standards (Salmonella Braenderup H2812, ATCC BAA-664), software, and nomenclature of PFGE patterns are required (PulseNet, 2015a). The approximate cost of the equipment and reagents required by PFGE can be accessed on the PulseNet International – PFGE site (PulseNet, 2015b).
Pulsed-field gel electrophoresis has been shown repeatedly to be more discriminatory than methods such as conventional serotyping, ribotyping, or MLST for many bacteria (Fakhr et al., 2005; Harbottle et al., 2006; Oloya et al., 2009; Soyer et al., 2010; Hauser et al., 2012). The combination of profiles generated by using additional restriction enzymes can enhance the value of this method for differentiating highly homogeneous Salmonella strains (Zheng et al., 2011); however, the cost increases as additional enzymes are used. PFGE can be used for subtyping of both Gram-positive (e.g., Listeria monocytogenes, Staphylococcus aureus) and Gram-negative (e.g., Salmonella, Escherichia coli, Shigella, Campylobacter jejuni) pathogenic bacteria. Typically, only the choice of the restriction enzyme and conditions for electrophoresis need to be optimized depending on the bacterial species investigated (PulseNet, 2015a).
Although various software platforms are available for PFGE pattern analysis, artifacts (e.g., brightly fluorescing spot) may lead to misidentification of bands. PFGE technology cannot usually be used to reliably visualize smaller fragments (e.g., <20.5 kb; Hunter et al., 2005) and has difficulty in differentiating bands differing by <5–10% in size due to the limited resolution of electrophoresis (Dijkshoorn et al., 2001; Persing et al., 2011). To address these issues, it has been recommended that users confirm PFGE pattern assignments using their experience and additional information to avoid incorrect band calling and systematic band shifts due to gel imperfections or imperfect reproducibility of electrophoretic conditions (Van Belkum et al., 2007). PFGE cannot be automated and requires high-level technical expertise and, thus, is hampered by low throughput, and may show low robustness and poor comparability of results between laboratories (Hyytia-Trees et al., 2007; Fabre et al., 2012; Kjeldsen et al., 2016).
No genetic information such as virulence potential or presence of antimicrobial resistance genes can be provided by PFGE, as the DNA fragments are separated by size rather than sequence (Ferrari et al., 2017). Observed bands of comparable size might not represent the same sequence of DNA, and a small mutation in a restriction site may result in changes in multiple bands. “Relatedness” determined by PFGE thus may not represent a true phylogenetic relationship between isolates (CDC, 2018a). Typically, multiple distinct PFGE patterns can be identified among isolates classified into the same serovar. Polyphyletic serovars, which are derived from more than one common evolutionary ancestor or ancestral group (e.g., serovars Newport, Mississippi, Saintpaul, Kentucky), show high levels of PFGE diversity (Porwollik et al., 2004; Sukhnanand et al., 2005; Alcaine et al., 2006; Harbottle et al., 2006; Sangal et al., 2010). PFGE-based prediction of these serovars is unreliable if isolates in the database are not representative of all clades of the serovar. On the other hand, PFGE may cluster epidemiologically unrelated isolates into identical PFGE types (Barco et al., 2013) and may even provide similar or identical PFGE types for isolates that represent different, but genetically very similar serovars that have a common ancestor (Barco et al., 2013; Shi et al., 2015), such as Typhimurium (antigenic formula: 1,4,[5],12:i:1,2) versus Typhimurium var. Copenhagen (antigenic formula: 1,4,12:i:1,2) (Heisig et al., 1995; Hauser et al., 2011), and Typhimurium versus 4,5,12:i:- (Guerra et al., 2000; Soyer et al., 2009; Wiedmann and Nightingale, 2009; Hoelzer et al., 2010; Ranieri et al., 2013). Furthermore, differentiation of genetically homogeneous serovars such as serovar Enteritidis challenges the usefulness of PFGE in Salmonella subtyping activities (Olson et al., 2007; Zheng et al., 2007). Approximately 45% of serovar Enteritidis isolates reported to PulseNet display the same PFGE XbaI pattern (JEGX01.0004), although many of these isolates are not epidemiologically related (Zheng et al., 2007). It is important to mention that the serovars mentioned above (i.e., Enteritidis, Typhimurium, Newport, Mississippi, Saintpaul, and Kentucky) are ranked among the most common Salmonella serovars associated with human and animal salmonellosis globally (Galanis et al., 2006; CDC, 2009).
Multiple Locus Variable Number of Tandem Repeats Analysis (MLVA)
Multiple locus variable number of tandem repeats analysis is a PCR-based typing method originating from forensic science where it is used for DNA “fingerprinting” samples of human origin. It has frequently been applied to scientific studies of prokaryotes as well as to microbial outbreak detection and source tracking (Lindstedt et al., 2003, 2013; Figure 1). MLVA is the second major genotyping tool (after PFGE) used in the PulseNet network (PulseNet, 2015c); prior to WGS, MLVA was one of the most popular subtyping methods used in public health surveillance and outbreak investigation of Salmonella, particularly in Europe (Torpdahl et al., 2007; Hopkins et al., 2011; Barco et al., 2013; Bauer et al., 2013; Lindstedt et al., 2013; Mughini-Gras et al., 2018). MLVA is usually performed following serotyping or PFGE for routine surveillance as a complementary technique for Salmonella subtyping (Torpdahl et al., 2007; Lienemann et al., 2015; Kjeldsen et al., 2016; CDC, 2017c; Ferrari et al., 2017), as it is challenging for PFGE to further differentiate isolates of genetically homogeneous serovars such as Salmonella Enteritidis (Kjeldsen et al., 2016). MVLA is especially used for typing Salmonella Typhimurium and Salmonella Enteritidis strains in reference or regulatory laboratories in Denmark, France, Germany, and United States [e.g., CDC, USDA – Food Safety and Inspection Service (FSIS) laboratories] (Barco et al., 2013; Bauer et al., 2013).
Multiple locus variable number of tandem repeats analysis is serovar specific, thus different Salmonella serovars usually require different MLVA schemes (Kruy et al., 2011). The first step toward uniform standardization of the MLVA profiles was collectively taken by PulseNet International and ECDC in defining the standard protocols of MLVA for Salmonella Typhimurium and Salmonella Enteritidis (ECDC, 2011, 2016b; PulseNet, 2015c). These serovars account for 26% of the culture-confirmed human Salmonella infections reported by US Laboratory-based Enteric Disease Surveillance (LEDS) and >60% of the salmonellosis cases reported by ECDC (ECDC, 2016a; Kjeldsen et al., 2016; CDC, 2018b). This uniform standardization of the MLVA profiles allowed direct comparison between laboratories irrespective of the platform used for MLVA (Larsson et al., 2009). Validated MLVA standard protocols for additional Salmonella serovars of clinical importance worldwide are largely missing, making MLVA use for serovars other than Enteritidis and Typhimurium difficult. However, with the advent of and transition into WGS, further development of MLVA may not occur.
Multiple locus variable number of tandem repeats analysis assesses the variation in the number of tandem repeated DNA sequences referred to as “variable-number tandem repeats” (VNTRs) in multiple regions of the bacterial genome to characterize bacterial isolates. The number of VNTRs in a given locus may vary between different microorganisms and even among bacterial isolates of the same species and serovar (Lindstedt et al., 2003; Torpdahl et al., 2007; Ngoi et al., 2015). The VNTR profiles vary in length between a few base pairs long to over 100 base pairs, enabling the development of techniques that utilize variation in the size of VNTR to discriminate closely related isolates (Lindstedt et al., 2003; Torpdahl et al., 2007; Fabre et al., 2012). The improved discriminatory power of MLVA varies with the serovar and phage type investigated (Torpdahl et al., 2007; Lienemann et al., 2015); e.g., in a study in Denmark, MLVA could differentiate distinct clusters within the most common phage types of Salmonella Typhimurium such as DT104, DT120, and DT12 even though these isolates displayed comparable PFGE patterns (Torpdahl et al., 2007). Public health laboratories usually have access to software (e.g., BioNumerics, GeneMapper, the free Peak Scanner) for analysis of MLVA patterns (ECDC, 2011; PulseNet, 2015c). Minimum spanning trees are frequently applied to MLVA profiles, yielding maps of predicted relationships among isolates based on single-locus and dual-locus variants (Van Belkum et al., 2007). However, web-accessible MLVA databases are not widely used for international collaboration (Guigon et al., 2008).
Multiple locus variable number of tandem repeats analysis is cheaper, faster, simpler to execute, and shows a relatively high-throughput compared with other molecular methods (Torpdahl et al., 2005, 2007; Lindstedt et al., 2007, 2013; Hopkins et al., 2011; Kruy et al., 2011). MLVA is less labor-intensive, time-consuming, and it is easier to perform than PFGE and MLST, as the protocol requires only a regular PCR step followed by capillary electrophoresis (Torpdahl et al., 2007; Lindstedt et al., 2013). Reduced handling time of pathogenic bacteria is beneficial for large-scale investigations. MLVA is also suitable for automation using a pipetting robot work station, automated sequencer, and analytical software (Barco et al., 2013; Lindstedt et al., 2013; Ferrari et al., 2017). Moreover, MLVA demonstrates good international repeatability and reproducibility for specific serovars such as Salmonella Typhimurium and Salmonella Enteritidis (Larsson et al., 2013). The data generated by MLVA can be readily analyzed and standardized for inter-laboratory comparisons (Torpdahl et al., 2007; Hopkins et al., 2011; Lindstedt et al., 2013; Wuyts et al., 2013).
A major drawback of MLVA for Salmonella subtyping is that the most effective MLVA protocols described so far are serovar-specific (Barco et al., 2013; Ngoi et al., 2015; Kjeldsen et al., 2016); hence, isolates have to be serotyped prior to selecting a specific MLVA scheme for further subtyping (Kjeldsen et al., 2016). At least 27 MLVA schemes have been developed to subtype different Salmonella serovars, whereas only Salmonella Typhimurium and Salmonella Enteritidis MLVA assays have been standardized in Europe and in the PulseNet network (PulseNet, 2015c; Kjeldsen et al., 2016). Another drawback is that rapid evolution of the target loci may decrease the reliability of results provided by MLVA regarding the relationship between strains under investigation (Hopkins et al., 2007, 2011; Lindstedt et al., 2013). This might hamper the use of MLVA, particularly in long-term epidemiological studies (Lindstedt, 2005; Li et al., 2009).
Repetitive Element PCR (Rep-PCR)
Repetitive element PCR targets the repetitive elements of genomic DNA to discriminate bacterial isolates. This method has been developed using three families of repeat sequences for subtyping Salmonella, including “enterobacterial repetitive intergenic consensus” (ERIC) sequences, “the repetitive extragenic palindromic” (REP) sequences, and the “BOX” sequences (Gilson et al., 1990; Hulton et al., 1991; Martin et al., 1992). The PCR products amplified from genome regions containing these repetitive elements are analyzed by agarose gel electrophoresis, and the banding patterns generated are used to investigate the genetic relatedness between bacterial isolates (Sabat et al., 2013). The DiversiLab system (bioMérieux, Marcy-l’Etoile, France) automated the whole process of the Rep-PCR subtyping approach after 2000 and has been used for subtyping pathogens in hospitals worldwide (Healy et al., 2005; Chenu et al., 2012; Sabat et al., 2013; Figure 1). As the low reproducibility of original Rep-PCR method may have resulted from variability in reagents and gel electrophoresis systems (Sabat et al., 2013), the application of the DiversiLab system with microfluidic capillary electrophoresis increased both the resolution and reproducibility of the Rep-PCR approach (Healy et al., 2005; Chenu et al., 2012; Sabat et al., 2013). However, the system has been discontinued, making Rep-PCR unavailable as a commercial platform.
The major advantages of this method include its relatively low cost (comparable to that of PFGE) and short turnaround time (within one day) (Sabat et al., 2013; Ngoi et al., 2015). However, the discriminatory power of Rep-PCR in subtyping Salmonella is reportedly lower than that of PFGE (Tiong et al., 2010; Thong and Ang, 2011; Elemfareji and Thong, 2013; Ngoi et al., 2015). Its relatively low reproducibility (which can at least be partially addressed by automation, such as in the DiversiLab system), and low accuracy of serovar prediction (Weigel et al., 2004; Wise et al., 2009) have limited its application in Salmonella subtyping.
Sequencing-Based Characterization Methods
Legacy Multilocus Sequence Typing (Legacy MLST)
Multilocus sequence typing is a nucleotide sequence-based approach that assesses DNA sequence variations (i.e., allelic type) of typically three, four, or seven selected well-conserved, housekeeping genes, usually using Sanger sequencing technology (Liu, 2010; Achtman et al., 2012). Schemes targeting seven genes are typically considered the “classical” MLST approach; this typing approach was originally proposed for isolates of Neisseria meningitidis (Liu, 2010). In this review, we focus on the most widely used Salmonella scheme targeting seven housekeeping genes [aroC, dnaN, hemD, hisD, thrA, sucA, and purE; hereafter denoted as legacy MLST to distinguish newer approaches (described below)] (Li et al., 2009; Yun et al., 2015). It was first introduced for Salmonella Typhi in 2002 (Kidgell et al., 2002), and extended to all Salmonella serovars in 2012 (Achtman et al., 2012; Figure 1). Legacy MLST is mainly used in research studies, assessing the population genetics and evolution of Salmonella. Public Health England (PHE) started adopting the seven-gene MLST (based on WGS data) approach as a replacement for traditional serotyping in 2015 (Ashton et al., 2016).
Historical MLST data including legacy MLST sequence types are maintained on EnteroBase (Alikhan et al., 2018). As of November 2017, the number of legacy MLST sequence types for Salmonella has reached 3,929 (Alikhan et al., 2018). Legacy MLST analysis can be conducted online by entering the sequences of amplified genes. Allelic variation at each locus is cataloged and a sequence type is assigned by comparing the allele set. The strains are characterized by their unique sequence type. With the advent of next-generation-sequencing technologies, legacy MLST data can also be extracted directly from WGS data using bioinformatics pipelines (Achtman et al., 2012; Ashton et al., 2016). The relatedness of isolates subtyped by legacy MLST can be displayed as a dendrogram or a minimum spanning tree based on the matrix of pairwise differences between their allelic profiles (Francisco et al., 2009), or as a phylogenetic tree built directly from the nucleotide alignment of the seven genes.
Legacy MLST can deliver results more rapidly than PFGE (Shi et al., 2015; Yun et al., 2015; Table 1), and the publicly available databases and online query system enable legacy MLST results to be highly reproducible and exchangeable between laboratories. However, legacy MLST shows lower discriminatory power than PFGE and MLVA, which limits its application to further discriminate isolates within a given serovar (Torpdahl et al., 2005; Alcaine et al., 2006; Foley et al., 2006; Harbottle et al., 2006; Hauser et al., 2012; Ngoi et al., 2015), and for source attribution (Barco et al., 2013). Protocols targeting sequences in genes that change more rapidly than housekeeping genes have been developed to improve the discriminatory power of legacy MLST (Ross and Heuzenroeder, 2005, 2008).
Clustered Regularly Interspaced Short Palindromic Repeat-Based Subtyping (CRISPR-Based Subtyping)
The clustered regularly interspaced short palindromic repeat (CRISPR) typing method uses the diversity of the content of CRISPR loci to distinguish bacterial strains. The application of the CRISPR system for subtyping foodborne pathogens is discussed in detail elsewhere (Shariat and Dudley, 2014; Shi et al., 2015; Barrangou and Dudley, 2016; Ferrari et al., 2017; Ricke et al., 2018). Although the CRISPR system has been applied to the subtyping of at least 100 Salmonella serovars (Shariat and Dudley, 2014; Barrangou and Dudley, 2016), this approach is not widely used by public health authorities or food regulators (Ferrari et al., 2017).
Clustered regularly interspaced short palindromic repeat loci contain variable lengths of CRISPR spacers obtained from foreign nucleic acids of plasmids or bacteriophages (Shariat and Dudley, 2014; Wright et al., 2017). These CRISPR spacers are acquired or lost during evolution of the pathogen over time in a sequential manner (Ricke et al., 2018), thus constructing a unique set of DNA sequence patterns that may provide sufficient resolution for pathogen subtyping (Fricke et al., 2011; Barrangou and Horvath, 2012; Shariat and Dudley, 2014; Wright et al., 2017). For subtyping, amplified CRISPR loci PCR products are sequenced by Sanger sequencing technology (Liu et al., 2011). The CRISPR spacer sequences are analyzed to assign each locus with an allelic type. The combination of the allelic types of analyzed CRISPR loci determine the isolate’s allelic profile (also referred to as the isolate’s sequence type) and is used to investigate the relationships between isolates (Liu et al., 2011).
The CRISPR approach has been shown to be feasible for subtyping of Salmonella (Liu et al., 2011; Fabre et al., 2012; DiMarzio et al., 2013; Shariat et al., 2013a, b, c; Almeida et al., 2017). Liu et al. (2011) developed a CRISPR–multi-virulence-locus sequence typing (MVLST) approach using virulence genes sseL and fimH with CRISPR1 and CRISPR2 loci; this approach was used to compare 171 isolates representing nine serovars (Typhimurium, Enteritidis, Newport, Heidelberg, Javiana, I 4,[5],12:i:-, Montevideo, Muenchen, Saintpaul) and was reported to be able to subtype Salmonella with resolution at the outbreak level. CRISPR–MVLST using different schemes of virulence genes has also been applied by others for subtyping Salmonella (DiMarzio et al., 2013; Shariat et al., 2013a; Almeida et al., 2017). The results from these studies suggest that CRISPR–MVLST has a higher discriminatory power than legacy MLST (Ferrari et al., 2017); however, discrimination is lower than PFGE in some cases (Almeida et al., 2015). While CRISPR typing has a relatively short turnaround time (comparable to MLST), current major drawbacks include high cost (Almeida et al., 2017; Ferrari et al., 2017), unstandardized protocol, and database, as well as limited research on the concordance between the diversity of Salmonella isolates reflected by CRISPR loci content and by the other standard subtyping methods (Shi et al., 2015).
Whole-Genome Sequencing (WGS)
Whole-genome sequencing captures DNA sequence changes across the entire genome of single microbial isolates. The data are useful to assess evolution, allowing accurate description of the genetic relatedness of isolates. The use of WGS for Salmonella subtyping in outbreak investigation and pathogen source tracking has proven effective by a rapidly increasing number of studies (den Bakker et al., 2011, 2014; Allard et al., 2012; Leekitcharoenphon et al., 2014; Deng et al., 2015; Taylor et al., 2015; Hoffmann et al., 2016; Inns et al., 2016). WGS was first used to trace a Salmonella multistate outbreak in the United States in 2009 (CDC, 2019), and has been used for pathogen subtyping by the public health surveillance systems in the United States (Allard et al., 2018), Canada (Vincent et al., 2018), the United Kingdom (Ashton et al., 2016), Denmark (Kvistholm Jensen et al., 2016), and France (Moura et al., 2016). PulseNet international is also making efforts to implement WGS within the PulseNet network as a routine tool to replace PFGE and MLVA (Nadon et al., 2017; Figure 1). Both PHE (Ashton et al., 2016) and the US FDA (2018) have started using “real-time” WGS to subtype Salmonella isolates. CDC is also using WGS in state laboratories for Salmonella outbreak investigations (CDC, 2016b). WGS will be used increasingly for contamination incident investigations in the food industry, particularly as cost continues to shrink and ease of use increases. WGS (as well as other sequencing approaches that use the same next-generation sequencing technologies used for WGS) also have a number of additional applications in the food industry, which will further drive implementation of these tools. Examples of other applications include (i) monitoring ingredient supplies, (ii) identification of microbial persistence in processing environments, and (iii) prediction of antimicrobial resistance (including in Salmonella) and other relevant phenotypes, facilitating the improvement of sanitary management, microbial hazard control, and microbiological risk assessment (Allard et al., 2018; Rantsiou et al., 2018; Ricke et al., 2018).
Sequenced Salmonella genomes can be deposited and made publicly available on the National Center for Biotechnology Information site1, the European Bioinformatics Institute site2, or the DNA Data Bank of Japan site3 with data shared between all three (Kodama et al., 2012; Jagadeesan et al., 2019). NCBI provides phylogenetic tree-based clustering of all publicly available sequence data at the NCBI pathogen detection site4. These phylogenetic trees show the closest matches to any newly submitted data (Allard et al., 2018). NCBI also houses the data using GenomeTrakr Network (FDA, 2018). This was developed by the US FDA and NCBI as the first distributed network of laboratories to utilize WGS, with both genomic and geographic data, for foodborne pathogen characterization. This network includes the WGS laboratories of the CDC and USDA (Allard et al., 2016; Jackson et al., 2016). As of February 2019, there are over 184,000 genome sequences or raw sequencing data of S. enterica available on NCBI. WGS data analysis can also be performed off-line without using any public databases, an approach that may sometimes be preferred by industry.
Sequencing platforms that can be used currently for WGS include Illumina, Ion Torrent, Oxford Nanopore Technologies, and Pacific Biosciences (PacBio). Procedures to validate the complete workflow for S. enterica WGS with Illumina (MiSeq and HiSeq) and PacBio platforms from subculture of isolates to bioinformatics analysis have been reported by Portmann et al. (2018). The Illumina sequencing system is one of the most widely used sequencing platforms; it produces DNA-sequence reads with the length of 50–300 bp using sequencing-by-synthesis (SBS). This process uses fragmented DNA templates to detect single bases as they are incorporated during a DNA replication reaction on a solid surface flow cell (Illumina (2019)). For applications including comparative genomics and phylogeny, these short reads of DNA sequences can be aligned to a reference genome or de novo assembled into longer sequences called contigs (Loman and Pallen, 2015). The large amount of data generated by WGS combined with a complex data analysis process generally requires expertise in bioinformatics to deploy and run (Wyres et al., 2014; Deurenberg et al., 2017). Software with a more user-friendly interface, such as CLC Genomics Workbench5, BioNumerics, and Geneious (Biomatters, New Zealand), however, is available, including for industry users with limited bioinformatics expertise and an increasing number of user-friendly bioinformatics tools are being developed.
The rapid growth of WGS data in the publicly available databases allows industry to compare isolates with global entries of pathogen sequences used by food regulators and public health authorities (Allard et al., 2018; Rantsiou et al., 2018). Despite increasing availability of data analysis software, it is still challenging to generate consistent analytical reports due to the lack of standardized approaches to data analysis and interpretation (Clooney et al., 2016); for example, even with a standard software, choice of reference genomes can have considerable effects on the data analyses (Pightling et al., 2014). Furthermore, there are currently no clearly outlined safeguards to protect companies from regulatory action if shared WGS data show a relationship between pathogen isolates identified by a company and an outbreak isolate. Development of a mechanism for sharing data through anonymous hubs may allay concerns on confidentiality and encourage data sharing (FAO, 2016). This mechanism may also enable more effective data capture and analysis for monitoring trends and identifying related incidents.
The current cost of the entire WGS process, including DNA library preparation, sequencing, data analysis, and storage, is relatively high compared with the other molecular-based subtyping methods. The cost difference is more apparent when a small number of isolates are sequenced (as could be typical for the food industry). The cost of maintaining data analysis tools and bioinformatics personnel needs to be taken into consideration (Leekitcharoenphon et al., 2014; Ferrari et al., 2017; Nadon et al., 2017).
WGS-Analysis Procedures
Interpretation of WGS data for source tracking or outbreak investigation typically uses two approaches to represent results: (i) single-nucleotide polymorphism (SNP) or allelic differences (often presented as distance matrix tables), and (ii) phylogeny or clustering of the isolates. SNP or allelic differences show objectively the genetic distance between two isolates. Hence, if isolate A shows three SNPs or allelic differences to isolate B, and 26 SNPs or allelic differences to isolate C, then we can say that isolate A is more similar to isolate B than to isolate C. If one assumes that all three isolates evolved at the same rate, then we can say that isolates A and B are evolutionarily more closely related to each other than they are to C. However, this assumption (i.e., all isolates evolve at the same rate) may not always be true. Environmental conditions or mutations in the DNA repair system may influence the rate of genetic change accumulated in a genome; e.g., a Salmonella isolate persisting in a humid, nutritious environment such as in a chicken farm may multiply much faster than an isolate persisting in a dry food processing environment. This environmental difference will allow the “chicken farm” isolate to accumulate more mutations (per year or any other time unit) than the dry food processing environment isolate, because the “chicken farm” isolate will multiply more times during the same period than the dry food processing environment isolate. Moreover, mutations in genes involved in DNA repair may result in the so-called “mutator phenotypes” (also sometime referred to as “hypermutators”). Mutator isolates accumulate mutations at a higher rate than non-mutator isolates (Muteeb and Sen, 2010). Hence, analyzing the number of SNP or allelic differences alone may result in misinterpretation of the results if the assumption that isolates evolved at the same rate does not hold true. Phylogenetic or clustering analyses are thus better suited to an investigation, as these analyses group isolates by their similarities instead of their differences (Pightling et al., 2018). To infer the evolutionary relationship of the isolates within a data set, therefore, a phylogeny must be constructed. For more detailed and technical information on reconstructing bacterial phylogenies from WGS data, the reader is referred to two in-depth reviews on this subject (Collins and Xavier, 2017; Patané et al., 2018).
WGS Analysis Approaches for Serotyping
Genetic-based approaches have been developed for in silico determination of serovars, because the phenotypic determination of Salmonella serovars is costly, time-consuming, and labor-intensive. These in silico methods have relied on two main approaches: (i) indirect determination using genetic markers associated with particular serovars and (ii) direct determination using genes responsible for the expression of the somatic O (rfb gene cluster) and flagellar H (fljB and fliC) antigens. The latter method has the advantage of relying on the same genetic information that results in the phenotype assessed by traditional serotyping, while the former method may require validation for new described serovars. These two approaches can also be combined for more reliable serovar prediction.
With the advent of whole-genome sequencing (WGS), in silico direct serovar determination has become the most used approach, and at least two Salmonella serovar databases and programs have been routinely used for in silico serotyping of Salmonella: SeqSero (Zhang et al., 2015) and SISTR (Yoshida et al., 2016a). SeqSero uses a database of 473 alleles representing 56 fliC antigenic types and 190 alleles representing 18 fljB antigenic types in a combined H-antigen database (Zhang et al., 2015). The somatic O-antigen database associated with SeqSero consists of 46 rfb gene cluster sequences corresponding to the 46 O-antigens identified in Salmonella (Zhang et al., 2015). The rfb database was specifically designed to be used with genome assemblies (as opposed to raw sequencing reads). A third database was specifically built for determination of the somatic O-antigen using raw sequencing reads (as opposed to genome assemblies). This third database consists of the genes wzx (encoding the O-antigen flipase), wzy (encoding the O-antigen polymerase), and other targets, all of which are found within the rfb gene cluster. In total, the authors claimed that the SeqSero scheme can theoretically identify 2,389 of the 2,577 serovars that were described in the White–Kauffmann–Le minor scheme by the end of 2014 (Zhang et al., 2015). The inability to predict 188 serovars is due to the absence of the DNA sequences for the antigen-encoding genes corresponding to these serovars in the SeqSero database. Empirical data showed that the SeqSero database has an accuracy of 91.5–92.6% for serotype prediction (Zhang et al., 2015).
SISTR is a platform for in silico analysis of Salmonella draft genome assemblies. SISTR includes the Salmonella Genoserotyping Array (SGSA) tool among other resources. SGSA relies on the allelic differences found within the rfb gene cluster for determination of 18 of the 46 somatic O-antigens, and fljB and fliC for determination of 41 flagellar H antigens (Yoshida et al., 2014). SGSA targets the identification of 90% (n = 2,190) of Salmonella serovars. When serovar determination using genoserotyping is not possible or is incomplete, SISTR also has the option to use the core genome MLST (cgMLST) scheme to infer the serovar based on phylogenetic context. The accuracy of SISTR in predicting Salmonella serovars has been assessed to be close to 95% (Yoshida et al., 2016a, b; Robertson et al., 2018).
Since SISTR can use genoserotyping and the cgMLST scheme to infer the serovar, higher confidence should be attributed to assignments where both genoserotyping and cgMLST agree on the serovar designation. Moderate confidence should be attributed to serovar assignments when only cgMLST is able to identify the serovar. When neither the genoserotyping nor cgMLST can identify the serovar, SeqSero may be used and may allow for serovar prediction.
WGS Analyses for Subtype Characterization
Overview of WGS data analysis approaches
Different approaches can be used for analysis of WGS data for subtyping characterization related to source incident tracking. The most common approaches are based on (i) high-quality single-nucleotide polymorphism (hqSNP) identification and pairwise comparison of hqSNP differences, or (ii) whole-genome (wg)/cgMLST typing using pre-defined schemes (i.e., databases) containing allelic differences for either the pan (wg) or core (cg) genomes of Salmonella and subsequent pairwise comparison for assessing the number of allelic differences.
High-quality SNP analyses
High-quality SNP analyses rely on identification of SNP differences across a set of closely related isolates using raw sequence data, which are mapped to a closed or draft genome assembly (also referred to as the “reference genome”). Only SNPs that have been vertically transferred from an ancestral isolate to the current isolates are subject to the hqSNP analysis, while SNPs that were supposedly horizontally transferred are filtered out from the results. The reference can be a closely related genome outside the dataset, or a genome within the dataset. The analysis consists of two main steps: (i) mapping the raw sequence reads against the reference genome and (ii) SNP calling using stringent criteria to prevent the misidentification of sequencing errors or misaligned regions as SNPs (Davis et al., 2015; Katz et al., 2017). The choice of a closely related reference has been shown to be a key step in the analysis. Reference genomes that are not closely related to the set of isolates under investigation may result in underestimation of the number of SNPs, due to specific regions of the genome that may be present in the dataset under investigation, but that are missing in the reference genome (Pightling et al., 2014). There are at least two publicly available approaches that have been commonly used for hqSNP analysis: (i) the US FDA CFSAN (The Center for Food Safety and Applied Nutrition) SNP pipeline (Davis et al., 2015) and (ii) the US CDC-developed Lyve-SET hqSNP pipeline (Katz et al., 2017). These two pipelines rely on publicly available software to carry out the mapping and SNP calling steps and offer similar results despite some methodological differences, including different criteria for filtering out low-quality SNPs and masking regions supposedly acquired through horizontal gene transfer.
High-quality SNP analysis has been applied in several outbreak investigations in the United States, Canada, and some European countries, including a Salmonella Enteritidis outbreak in the United Kingdom that was linked to a German egg producer (Inns et al., 2015). Historical Salmonella Typhimurium isolates from humans and foods involved in five outbreaks and consisting of five distinct MLVA subtypes were re-analyzed using hqSNP analysis by Octavia et al. (2015); in this study at least 11 isolates not previously linked to the outbreaks were ruled in based on less than two SNP differences to the isolates previously linked to the outbreaks. Another retrospective study used hqSNP to analyze a collection of 55 Salmonella Enteritidis from seven epidemiologically characterized outbreaks and sporadic cases. One isolate not previously linked to any outbreak (i.e., sporadic) was identified to be part of one outbreak (“ruled in”) (Taylor et al., 2015). An investigation into a multi-state outbreak caused by Salmonella Poona was carried out in 2015 using PFGE and hqSNP analysis. Analysis by PFGE demonstrated three different patterns. However, WGS results showed that isolates with different PFGE patterns were genetically linked with less than six SNP differences (Kozyreva et al., 2016). Subtyping of Salmonella Dublin with PFGE was shown to have limited value in a recent outbreak investigation due to its low discriminatory power for this Salmonella serovar (Mohammed et al., 2016). The nine clinical isolates associated with the outbreak were indistinguishable by PFGE, but they were also indistinguishable from other unrelated Salmonella Dublin isolates. The nine isolates linked to the outbreak clustered together with one to nine SNP differences when analyzed using hqSNP, and they could be distinguished from other isolates that shared the same PFGE pattern with epidemiologically unrelated isolates showing more than 50 SNP differences when compared to the outbreak isolates (Mohammed et al., 2016). These studies show that public health agencies are increasingly relying on hqSNP analysis for outbreak investigation, including tracking the source of outbreaks. High-quality SNP analysis clearly improves subtype accuracy and outbreak investigations by not only allowing for increased discriminatory power, but also reducing instances where closely related isolates are being classified as “different.”
wgMLST
Whole-genome MLST (wgMLST) analysis relies on the comparison of individual genomes against a database containing all known alleles for all the genes representing the pan genome of a defined group of strains (i.e., serovar, subspecies, species, and genus). The pan genome is defined as all the genes present in at least one genome from a defined group. Two main approaches can be used, and these are often used in combination: (i) assembly free mapping and (ii) assembly based mapping. Raw sequencing reads are directly mapped against the database in an assembly free approach. Hence, this approach does not require de novo assembly of the genome prior to its utilization. SRST2 (Inouye et al., 2014) and BWA-MEM (Li, 2013) are the most commonly used programs to carry out this task. Because this approach deals directly with the raw sequence reads, it allows filtering low-quality reads or specific nucleotides with low quality within a good-quality read. In an assembly based approach the raw sequence reads are first used to generate a high-quality draft genome (i.e., usually not a closed genome) using a genome assembler. Later, the draft genome (i.e., assembly) is used to find matches against the database. The program most commonly used to map the draft genome against the database is BLASTN (Altschul et al., 1990), although other options also exist. Independently from the approach used (i.e., assembly free or assembly based), the result of mapping a genome against a database is a list of the alleles found in the analyzed genome. When more than one genome is analyzed, the list of alleles from each genome can be compared and the number of allele differences can be computed. Assembly free and assembly based wgMLST allele assignment should match for high confidence. Results are often shown as a distance matrix of allele differences and a dendrogram constructed from this distance matrix. The wgMLST methods allow for comparison of non-closely related isolates from different groups since all genomes are compared against the same database, which is a great advantage of this method over hqSNP (Maiden et al., 2013; Nadon et al., 2017). A disadvantage of the method is that the database must be constructed and shared across different groups, who must agree in using the same database in order to make their results comparable (Nadon et al., 2017). Construction of such databases is also time-consuming and labor-intensive, with the difficulty increasing with the diversity of the organisms included in the same database (e.g., a database for S. enterica subspecies enterica serovar Agona will require less time and labor than a database for all S. enterica).
Core genome MLST (cgMLST)
The cgMLST method is very similar to the wgMLST method. The major difference is the size and nature of the database. While the wgMLST database contains alleles for all genes in the pan genome of the defined group, the cgMLST only contains alleles for those genes that are present in all (or almost all) genomes of the defined group (i.e., the “core genome”). Hence, a cgMLST database will not capture the genetic diversity present in the accessory genes (i.e., genes that are not present in all isolates) and hence tends to be much smaller than a corresponding wgMLST database. The advantages of using the cgMLST are: (i) speed; because the cgMLST database is smaller than the wgMLST database, results can be obtained faster, and (ii) construction of the cgMLST database is generally easier than the wgMLST database, as typically less genomes are needed to identify the core genome than the pan genome of a group (den Bakker et al., 2010). While allele code schemes are used by some groups to summarize the differences observed among isolates subtyped by both cgMLST and wgMLST (Nadon et al., 2017), it generally is easier to define standard, stable, cgMLST allele codes. This allele code scheme can be easily transferred in a spreadsheet and can be interpreted similarly to what has been in use for PFGE. An allele code scheme may not, however, be fully stable and may need to be revised as new cg- or wgMLST types are identified (Nadon et al., 2017). A disadvantage of cgMLST is that it may show reduced discriminatory power over wgMLST, as shown in a comparison between the Salmonella cgMLST and wgMLST schemes defined in EnteroBase (Alikhan et al., 2018), carried out using Salmonella Enteritidis historical isolates from a UK egg-associated outbreak (Inns et al., 2015), as well as closely related non-outbreak isolates identified previously (Dallman et al., 2016). The 177 isolates from this dataset resulted in 177 unique sequence types by wgMLST (Simpson’s diversity index = 1.00) and 137 unique sequence types by cgMLST (Simpson’s diversity index = 0.98) (P < 0.05), showing the superior discriminatory power of wgMLST over cgMLST. However, both approaches grouped the isolates into identical clusters (Pearce et al., 2018).
Comparison of hqSNP-based analysis and genomic MLST analysis
Theoretically, hqSNP analysis is the most discriminatory approach for molecular subtyping, as it investigates all possible SNPs between each pair of isolates in the dataset. The second most discriminatory approach is wgMLST, which is designed to investigate virtually all genes in the genomes; intergenic regions and genes not present in the wgMLST scheme will not be investigated and polymorphisms present in these regions will be missed. The cgMLST approach is the least discriminatory of the three as it relies on only a subset of the genes present in the wgMLST scheme. Hence, similarly to the wgMLST approach, polymorphisms present in intergenic regions and in genes not included in the cgMLST scheme will not be assessed (Chen et al., 2017). Both wgMLST and cgMLST are reference-independent which makes the results more reproducible and transferable than hqSNP analysis (Nadon et al., 2017). In order to reproduce the results obtained from hqSNP analysis, one needs to use the same reference and parameters that were used in the original analysis (Nadon et al., 2017). This is not an issue with wgMLST or cgMLST analysis as long as analyses use the same scheme containing the same genes and alleles to allow for comparisons. Transference and communication of the results also seem to be more complicated for hqSNP analysis than for cgMLST or wgMLST (Nadon et al., 2017). This is because hqSNP analysis, as compared to cgMLST or wgMLST analyses, requires more parameter settings, which must be communicated for better interpretation. wgMLST and cgMLST analyses are also typically integrated into commercially available software, while the hqSNP pipelines are available as free open software or integrated into commercial software. Free-of-charge hqSNP pipelines require UNIX-based systems and are run through the command line, which may require specialized expertise (Nadon et al., 2017). Commercially available software, which can run cgMLST and wgMLST (e.g., BioNumerics) tends to be more user-friendly. BioNumerics uses a graphical user interface and can be installed in Microsoft Windows computers. The hqSNP analysis can easily be kept private as the analysis can be run within a closed dataset of genomes. The cgMLST and wgMLST can also be kept private; however, it may require some additional infrastructure (i.e., a private cloud) to be built around the commercial software.
Comparison of Molecular Methods for Predicting the Serovar of Salmonella
A comparison of different molecular methods for predicting the serovar of Salmonella is shown Table 2. Acceptable correlation between PFGE patterns and serovars has been described by several researchers (Weigel et al., 2004; Nde et al., 2006; Gaul et al., 2007; Kerouanton et al., 2007; Zou et al., 2010; Shi et al., 2015; Bopp et al., 2016). Shi et al. (2015) summarized the serovar-prediction accuracy of different molecular serotyping methods with studies from 1993 to 2013. The proportion of isolates that may not be accurately serotyped with PFGE is generally comparable to the proportion that is not typeable, or that requires extensive additional labor and reagents using conventional serotyping (Bopp et al., 2016). Examples of serovars incorrectly predicted by PFGE are summarized below (Table 3). Overall, with PFGE patterns for approx. 500 Salmonella serovars in the PFGE pattern database (Ranieri et al., 2013; Shi et al., 2015) and the reported good correlation between PFGE patterns and serovars, PFGE-based serovar prediction should be possible for a large proportion of these serovars, but will not be possible for a large number of less common serovars not represented in the database.
TABLE 2.
Number of isolates tested | Number of serovars tested | Isolate sources | Serovar-prediction accuracy (%) | References |
PFGE | ||||
80 | 6 | Turkey processing plant | 99 | Nde et al.,2006 |
68 | 10 | Swine farms | 84 | Weigel et al.,2004 |
674 | 12 | Swine | 85 | Gaul et al.,2007 |
866 | 8 | Food animals, production facilities, and clinical samples | 96 | Zou et al.,2010 |
1,128 | 31 | Food, animals, humans, natural environment, and processing plants | 97 | Kerouanton et al.,2007 |
46 | 40 | Human and cattle | 75 | Ranieri et al.,2013 |
1,486 | 110 | New York State Department of Health, isolates received in 2012; human clinics | 96 | Bopp et al.,2016 |
1,437 | 131 | New York State Department of Health, isolates received in 2013; human clinics | 91 | Bopp et al.,2016 |
1,558 | 107 | New York State Department of Health, isolates received in 2014; human clinics | 90 | Bopp et al.,2016 |
Legacy MLST | ||||
25 | 7 | Chickens | 92 | Liu,2010 |
66 | 1 | Cattle, birds, horses, and other animals | 99 | Sukhnanand et al.,2005 |
110 | 25 | Human and veterinary source | 98 | Torpdahl et al.,2005 |
152 | 33 | Reference collection | 100 | Ben-Darif et al.,2010 |
4,257 | 554 | Reference collection | 88 | Achtman et al.,2012 |
46 | 40 | Human and cattle | 91 | Ranieri et al.,2013 |
42,400 | 624 | SRA collection | 91 | Robertson et al.,2018 |
7,338 | 263 | Human | 96 | Ashton et al.,2016 |
WGS-(SeqSero) | ||||
308 | 72 | CDC collection | 99 | Zhang et al.,2015 |
3,306 | 228 | Genome Trakr collection | 93 | Zhang et al.,2015 |
354 | 44 | GenBank collection | 92 | Zhang et al.,2015 |
WGS-(SISTR) | ||||
4,291 | 246 | SRA and NCBI Assembly collections | 95 | Yoshida et al.,2016a |
42,400 | 624 | SRA collection | 97 | Robertson et al.,2018 |
1This table is revised from the information provided by the review of Shi et al. (2015).
TABLE 3.
Major incorrectly predicted serovars | “O” antigens | Phase 1 “H” antigens | Phase 2 “H” antigens | References |
Montevideo (clustered with Senftenberg) | 6,7 | g,m,s | No phase 2 antigen | Nde et al.,2006 |
Senftenberg (clustered with Montevideo) | 1,3,19 | g,s,t | No phase 2 antigen | Nde et al.,2006 |
Typhimurium var. Copenhagen (clustered with 4,[5],12:i:- and Typhimurium) | 1,4,12 | I | 1,2 | Gaul et al.,2007 |
4,5,12:i:- (clustered with Typhimurium var. Copenhagen and Typhimurium) | 4,5,12 | I | No phase 2 antigen | Gaul et al.,2007 |
Typhimurium (clustered with Typhimurium var. Copenhagen and 4,[5],12:i:-) | 1,4,5,12 | I | 1,2 | Gaul et al.,2007 |
Saintpaul (clustered with Typhimurium var. Copenhagen and Typhimurium) | 1,4,5,12 | e,h | 1,2 | Ranieri et al.,2013 |
Putten (clustered with Agona) | 13, 23 | D | l, w | Gaul et al.,2007 |
Agona (clustered with Putten) | 4,12 | f,g,s | No phase 2 antigen | Gaul et al.,2007 |
Paratyphi B | 1,4,5,12 | B | 1,2 | Kerouanton et al.,2007 |
Give | 3,10 | l,v | 1,7 | Kerouanton et al.,2007 |
Newport | 6,8 | e,h | 1,2 | Kerouanton et al.,2007 |
Multiple locus variable number of tandem repeats analysis is not widely used for serovar prediction even though efforts have been made to develop MLVA subtyping schemes to subtype multiple serovars of Salmonella with one protocol (Van Cuyck et al., 2011; Kjeldsen et al., 2016). A universal MLVA scheme for most frequently isolated Salmonella serovars (accounting for 80% of the clinical isolates from humans in Europe) has been developed by Kjeldsen et al. (2016). In another study, an MLVA scheme identified 31 serovars (Van Cuyck et al., 2011). Nevertheless, further development of multiple-serovar MLVA schemes and robust MLVA profile databases is unlikely to occur given the benefits offered by WGS.
The serovar-prediction accuracy of Rep-PCR has been reported to range between 0 and 100%, indicating some limitations of this method (Shi et al., 2015). Ranieri et al. (2013) showed that Rep-PCR accurately predicted the serovar of 30 out of 46 isolates representing the top 40 Salmonella serovars isolated from human and non-human sources, with an accuracy of 65%. This accuracy was relatively lower than that obtained with PFGE or MLST, when the same set of isolates were evaluated.
Ashton et al. (2016) compared the serovars predicted by using legacy MLST sequences extracted from WGS data to the results generated by conventional serotyping, for 7,338 isolates representing 263 serovars of Salmonella enterica subspecies I. The 10 most common serovars in this S. enterica subspecies I dataset were serovars Enteritidis, Typhimurium, Infantis, Typhi, Newport, Virchow, Kentucky, Stanley, Paratyphi A, and Java. They found that the serovar prediction accuracy of legacy MLST was 96%.
The overall serovar-prediction accuracy for the CRISPR subtyping approach has been reported to range from 78 to 90% (Liu et al., 2011; Fabre et al., 2012; Shi et al., 2015). More studies are needed to further assess serovar-prediction accuracy using CRISPR.
Given the range of serovars represented in the SeqSero and SISTR databases, WGS can be used to theoretically predict 2,389 and 2,190 of the 2,577 serovars described in the White–Kauffmann–Le minor when using the serovar prediction programs SeqSero (Zhang et al., 2015) and SISTR (Yoshida et al., 2016a), respectively. Using empirical data, the accuracy of serotype prediction with SeqSero and SISTR has been reported to be approx. 92 and 95%, respectively (Zhang et al., 2015; Yoshida et al., 2016a, b; Robertson et al., 2018). By comparison, traditional Salmonella serotyping had an accuracy of 73% when 33–36 independent laboratories performed serotyping of the same eight Salmonella strains representing seven different serovars (Petersen et al., 2002), suggesting that WGS-based methods may be more reliable than traditional serotyping to assign Salmonella isolates to serovars. Nevertheless, further experimental studies are needed to continue to quantify the ability of WGS-based methods to identify Salmonella serovars.
Comparison of Molecular Methods for Subtype Differentiation of Salmonella
Molecular methods are used for subtyping Salmonella isolates that belong to the same serovar, as well as being used for serovar prediction. This section briefly provides some examples of comparative studies of subtyping methods. In one study, PFGE was compared to MLVA to subtype 163 non-typhoidal Salmonella isolates representing 15 serovars; MLVA differentiated the isolates into 79 MLVA subtypes while PFGE differentiated the same isolates into 87 subtypes. The Nei’s diversity index for MLVA was 0.979 compared to 0.999 for PFGE (Kjeldsen et al., 2016). However, for specific serovars (e.g., Salmonella Enteritidis) MLVA has been reported to provide improved discriminatory power over PFGE (Boxrud et al., 2007; Beranek et al., 2009; De Cesare et al., 2015). MLST has the advantage of being highly reproducible and easily transferable among laboratories. However, in a study of 110 Salmonella isolates from 25 serovars (Torpdahl et al., 2005), MLST resulted in 43 sequence types, while PFGE was able to differentiate the isolates into 73 PFGE subtypes. The downside of PFGE in this study was the inability to type 11 of the 110 (10%) isolates. In a study comparing different molecular methods to differentiate 52 Salmonella Enteritidis isolates, PFGE resulted in eight subtypes, while MLVA resulted in 18 subtypes and WGS resulted in 34 subtypes. The discriminatory power of PFGE, MLVA, and WGS was 0.81, 0.92, and 0.97 (Simpson’s index of diversity), respectively (Deng et al., 2015). In another study, PFGE and WGS were used to differentiate 55 Salmonella Enteritidis isolates; PFGE resulted in 10 subtypes; however, WGS was able to further differentiate the isolates into 45 unique subtypes (Taylor et al., 2015), showing the greater discriminatory power of WGS over PFGE. In a study of isolates from a Salmonella Poona outbreak (Kozyreva et al., 2016), 4 PFGE subtypes and 7 WGS subtypes were observed among the 16 isolates; in silico MLST using the WGS data resulted in one MLST sequence type. Phylogenetic analysis using WGS data showed that the distinct PFGE types did not necessarily correlate with increased genetic distance between isolates. Isolates that differed by 0 SNPs showed distinct PFGE subtypes, suggesting that PFGE results would be misleading for these isolates (Kozyreva et al., 2016). While the relative discriminatory power of different subtyping methods depends on the strains and serovars tested, WGS methods were consistently found to be most discriminatory, followed by PFGE. While some MLVA schemes provide enhanced discriminatory power over PFGE for some serovars, for other serovars PFGE may be more discriminatory than MLVA.
Criteria to Evaluate and Validate Different Salmonella Characterization Methods
Molecular-based Salmonella characterization methods including WGS are evolving very fast. Many of the characterization methods and technologies, as well as data analysis pipelines, are operated as research tools, and are under continuous development. Evaluation of these tools for Salmonella investigation, especially for those serovars/strains highly relevant to food products and processing environments, is pre-requisite for the implementation of these methods. Methods that can be used by the food industry must be thoroughly validated before implementation to ensure reliability and consistency of the method when it is used across different laboratories. Validation should cover the end-to-end workflow for source tracking from isolate subculture to bioinformatic analysis, articulating the key quality requirements and criteria (Ferrari et al., 2017; Nadon et al., 2017; Portmann et al., 2018). Proposed criteria for evaluation of Salmonella characterization methods for potential routine use in the food industry are shown below (Table 4).
TABLE 4.
Key criteria for evaluation | Description | Target | Key factors affecting performance | Quantitative evaluation (scale of 0–5) |
Stability | Consistency of the typing result for an isolate after its primary isolation and during laboratory storage and subculture. | Typing results should be stable during laboratory storage and subculture; strain markers should not mutate too rapidly to change the strain’s position in the epidemiological context; data on the stability of the markers should be available. | Rapid mutations and recombination of the marker(s) during storage and subculture could lead to poor reproducibility. | 0 – Extremely poor stability 1 – No data are available on stability 3 – Some limited data suggest that markers are stable 5 – Strong data are available supporting stability of markers (and/or data are available that can be used to correct for mutations or changes in markers during passage). |
Typeability | Ability to assign a type to all isolates tested by it. | Typeability should be as high as possible. | Poor typeability could be found in assays using a scheme that does not cover genetic variation in full; typeability may also be reduced if some isolates show high endogenous nuclease activity. | 0 – Extremely poor typeability (<80%) 1 – Data indicate between 80 and 90% typeability; or no evaluation of typeability performed 2 – Data indicate between 90 and 93% typeability 3 – Data indicate between 94 and 96% typeability 4 – Data indicate between 97 and 99% typeability 5 – Data indicate >99% typeability. |
Discriminatory power | Ability to assign a different type to two unrelated strains; discriminatory power can be expressed using Simpson’s index of diversity (SID) | Discriminatory power should be as high as possible. For highly discriminatory methods, clustering using phylogenetic analysis tools can be used to define isolates that share a recent common ancestor. | Discriminatory power is highly dependent on the marker(s) selected for typing. | 0 – Extremely poor discriminatory power (<80%, SID <0.80) 1 – Data indicate between 80 and 90% discriminatory power (SID 0.80–0.90); or no evaluation of discriminatory power performed 2 – Data indicate between 90 and 93% discriminatory power (SID 0.90–0.93) 3 – Data indicate between 94 and 96% discriminatory power (SID 0.94–0.96) 4 – Data indicate between 97 and 99% discriminatory power (SID 0.97–0.99) 5 – Data indicate >99% discriminatory power (i.e., SID > 0.99). Note: we recommend that data are generated using appropriate strain collection and >100 isolates. |
Epidemiological concordance | Ability to reflect, agree with, and possibly further illuminate the available epidemiological information about the cases under study. | Epidemiological concordance should be as high as possible; strains from the same outbreak or strains that are otherwise linked by epidemiological evidence should be classified into the same subtype (or phylogenetically characterized as sharing a recent common ancestor). | Low epidemiological concordance could be found in assays that either target “low stability markers” or an assay with limited discriminatory power, which will group together isolates that are epidemiologically unrelated. | 0 – Extremely poor epidemiological concordance; <80% isolates are classified correctly. 1 – Poor epidemiological concordance; data indicate between 80 and 90% isolates are classified correctly; or no evaluation of epidemiological concordance 2 – Low epidemiological concordance; data indicate between 90 and 93% isolates are classified correctly 3 – Intermediate level of epidemiological concordance; data indicate between 94 and 96% isolates are classified correctly) 4 – Good epidemiological concordance; data indicate between 97 and 99% isolates are classified correctly 5 – Strong epidemiological concordance; data indicate all isolates are classified correctly Note: we recommend that data are generated by using at least 20 sets of epidemiologically related isolates. Ideally, a given subtyping method classifies all of these isolates correctly. |
Reproducibility | Ability to perform reproducibly in different laboratories and with different personnel. | Results should be highly reproducible (>99%). | Poor reproducibility could be the results of (i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), (ii) reagents not standardized sufficiently, (iii) equipment not performing reproducibly, (iv) poorly optimized typing system, (v) sensitivity of equipment or assay system to environmental factors (e.g., humidity, temperature), (vi) bias in observing, recording, analysis, and interpretation of the results; (vii) or assays targeting biologically highly variable markers (e.g., some of the surface antigens targeted by classical serotyping). | 0 – Extremely poor reproducibility; <80%; meaning for >20% of isolates results are not reproducible between labs 1 – Poor reproducibility; data indicate between 80 and 90% of isolates results are reproducible between labs 2 – Low reproducibility; data indicate between 91 and 93% of isolates results are reproducible between labs 3 – Intermediate reproducibility; data indicate between 94 and 96% of isolates results are reproducible between labs 4 – Good reproducibility; data indicate between 97 and 99% of isolates results are reproducible between labs 5 – Strong reproducibility; data indicate >99% of isolates results are reproducible between labs Note: we recommend that data are generated based on an evaluation by at least four laboratories. |
Repeatability | Ability to produce the same results in the same laboratory with the same equipment and personnel | Results should be highly repeatable ( > 99%) | Poor repeatability could be the result of i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), ii) reagents not standardized sufficiently, iii) equipment not performing reproducibly. | 0 – Extremely low repeatability (<90%; meaning for >10% of isolates results are not repeatable) 1 – No evaluation of repeatability performed 2 – Data indicate between 90 and 93% repeatability 3 – Data indicate between 94 and 96% repeatability; or repeatability evaluated with small number of isolates (<40) 4 – Data indicate between 97 and 99% repeatability 5 – Data indicate >99% repeatability Note: we recommend that repeatability evaluation performed with at least 40 isolates, ideally with 100 isolates. |
Serovar prediction ability | Ability to accurately predict the serovar of a given strain. | Range, as the number of identifiable serovars, and accuracy (i.e., percentage of isolates with correct serovar identification) should be maximized. Accuracy should be given priority over range as misclassification may lead to worse decisions than non-classification. | Poor serovar prediction could be a result of (i) limited database coverage of different serovars, (ii) low discriminatory power, (iii) low typeability, (iv) no standard protocol of serovar prediction with produced data. | 0 – Extremely low serovar prediction accuracy (serovar is correctly predicted for <70% of serovars) 1 – No evaluation of serovar prediction ability, or weak prediction accuracy (data indicate between 70 and 80% serovar prediction accuracy) 2 – Data indicate between 80 and 85% serovar prediction accuracy 3 – Data indicate between 85 and 90% serovar prediction accuracy; or serovar prediction ability evaluated with small number of serovars 4 – Data indicate between 90 and 98% serovar prediction accuracy 5 – Data indicate >98% serovar prediction accuracy); serovars are correctly predicted for all common isolates2 Note: we recommend that data are generated by using at least 40 different serovars, ideally more than 100 serovars. |
Speed | Time to results from pure single colony | <5 days | Speed can be influenced by throughput, equipment, and data analysis program used for a given assay | 0 – >1 month 1 – 3–4 weeks 2 – 2–3 weeks 3 – 1–2 weeks 4 – ≤5 days 5 – ≤2 days |
Ease of use | Ease of use encompasses technical simplicity, workload, suitability for high throughput test, ease of data analysis, and result interpretation | Ease of use is important for the implementation of an assay in the internal laboratories of food industry, less important when using services provided by a commercial laboratory. | Poor ease of use is usually caused by the high level of expertise and experience required by a given assay, e.g., bioinformatics expertise to analyze data produced by the assay. | 0 – The given assay requires extremely high level of expertise and experience in specific techniques (PhD level scientist with >4 days of specialized training) 3 – The given assay requires average level of expertise and experience of a microbiological technician 5 – No specific expertise or experience required; assay can be completed by high school diploma and <1 day training. |
Cost | Total cost encompasses cost of equipment reagent/consumables, data analysis platform, and staffing. For routine use, we usually just assess the reagent cost per isolate. Staffing cost can vary considerably in different regions/countries within a given turnaround time, thus needs to be assessed separately with actual local situations. | A balance between efficiency/effectiveness and cost of a given assay is more important than pursuing low cost, because low cost may potentially lead to larger economic loss and extra investigation time caused by poor quality of typing result. | High cost per isolate for routine test is usually caused by high reagent cost and long turnaround time (leading to high staffing cost). | We recommend to use the actual reagent cost per isolate plus staffing cost estimated with given turnaround time to compare the assay being validated to the currently/previously used methods by food industry; data here are based on costs from commercial laboratories in North America and Europe: 0 – >$1,000 per isolate 1 – $500–$1,000 per isolate 2 – $200–$500 per isolate 3 – $150–200 per isolate 4 – $100–150 per isolate 5 – ≤$100 |
1The parameters and information in this table are adapted from Van Belkum et al. (2007) and Wiedmann et al. (2014) with industry-specific practical needs. 2The serovar typing ability of conventional serotyping method (Kaufmann–White Le Minor scheme) is around 90% taking the typeability and accuracy of it into consideration (Bopp et al., 2016).
Implementation of Molecular-Based Salmonella Subtyping Methods by the Food Industry
We consider that WGS is the most suitable method to characterize Salmonella for incident investigation at production facilities in the food industry. This opinion is based on comparison of the resolution, turnaround time, ability of serovar prediction, cost, and feasibility of the available methods. Bioinformatics is a key capability required for WGS. The food industry may choose to invest in in-house capability that can interface with outside resources (e.g., academic partners, industry partners, government agencies), however, there are also opportunities to outsource data analyses to commercial or academic partner labs. Both the CFSAN pipeline and the Lyve-SET pipeline have been widely tested and seem to provide comparable and reliable results for hqSNP analysis. Implementation of wgMLST and cgMLST within BioNumerics has been successfully completed for L. monocytogenes in the United States. A cgMLST scheme is publicly available from EnteroBase (EnteroBase URL: https://enterobase.warwick.ac.uk/) and it is likely to be implemented within BioNumerics in the future. Other data analysis methods such as genome distance analysis (Pinho et al., 2009; Auch et al., 2010) can also become possible future approaches that allow for the food industry to develop data analysis capabilities for contamination source tracking.
The turnaround time of in-house WGS subtyping can be comparable to many conventional subtyping methods including conventional serotyping and PFGE (Table 1). WGS, however, provides much more information about an isolate with one single experimental procedure, enabling full characterization of the pathogen (including in silico serovar prediction and antimicrobial resistance gene identification) and more accurate clustering/discrimination of the isolates investigated. This is faster than using multiple conventional subtyping approaches in a stepwise approach to get equal information. The cost of WGS is also comparable to that of the conventional subtyping tools, considering the high quality and volume of information provided by WGS within one experimental procedure. In silico serotyping should be performed instead of traditional serotyping for determination of serovars once WGS is implemented as the subtyping method for Salmonella. This approach will greatly reduce the costs and time associated with serotyping.
Legacy MLST targeting variants of seven housekeeping genes of Salmonella can be used in combination with WGS. While legacy MLST classification can be obtained using Sanger sequencing technology (also known as first-generation sequencing technology) within 1 week, it can also be obtained by using the sequence information extracted from WGS data. Although legacy MLST has relatively lower discriminatory power compared with PFGE and MLVA, it is faster than PFGE when using an in-house Sanger sequencer such as Applied Biosystems Genetic Analysis Systems (Thermo Fisher Scientific). It is also more universal to all Salmonella serovars than MLVA which usually requires a specific scheme for each serovar. In addition, the serovar prediction ability of legacy MLST has been demonstrated to be comparable to that of PFGE (Tables 1, 3).
PFGE is currently still the “gold standard” and most widely used Salmonella DNA fingerprinting method used by public health authorities and food regulators to characterize and track this pathogen in outbreaks, although it is being replaced by WGS. PFGE remains a valuable tool for foodborne pathogen characterization by the food industry, while a transition to WGS occurs. PFGE has been repeatedly shown to be more discriminatory than methods such as conventional serotyping, automated ribotyping, or MLST for many bacteria including Salmonella. In addition to these methods, single-plex or multiplex PCR assays that can detect and identify specific Salmonella serotypes have been described (Kim et al., 2006; Akiba et al., 2011; Zhu et al., 2015; Xiong et al., 2018; Xu L. et al., 2018; Xu Y. et al., 2018); these tools provide an alternative approach for detection and identification of specific Salmonella serovars.
The results of any subtyping approach can be used to assess the relationship of isolates in an investigation. Nevertheless, the epidemiological context is indispensable in final decision making in incident investigation and to determine further actions for food safety management improvement. High-resolution WGS subtyping results should not be interpreted in the absence of epidemiological information.
The raw sequence data generated by molecular-based subtyping methods, especially WGS, require both physical and virtual space for storage. It is desirable to retain the original sequence reads (usually files with >200 MB for each Salmonella isolate) for potential future analysis using alternative data analysis methods or for a retrospective investigation. Commercial clouds can provide a storage solution, provided that special attention is paid to data security. A robust Internet connection and high band-width is needed to transfer WGS data if data storage is outsourced. Subtyping analysis needs to be supported by complete metadata providing the relevant epidemiological context to identify the root cause of the contamination. Thus, the capability for metadata collection, organization, and storage is needed together with building the capability for WGS. The metadata should include information such as the geographic and temporal background of the isolates, the sample type, and sample source (e.g., raw ingredients, finished products, environment), etc. The Consortium for Sequencing the Food Supply Chain, founded by IBM and Mars Incorporated, represents industrial groups putting effort into collecting genomic information on pathogenic bacteria across the food supply chain6. This consortium represents one part of the broader goal to increase knowledge of foodborne pathogens at the genomic level.
Conclusion
The application of DNA-based methods for characterization of pathogens such as Salmonella has become common practice. Our literature-based assessment supports the superior discriminatory power of WGS and its advantages compared with other methods for Salmonella subtyping and source tracking for the food industry. We also identified circumstances under which use of other subtyping methods may be warranted. Implementation of molecular-based Salmonella characterization methods, including WGS, provides improvement of source tracking and root cause elimination; however, these methods require investment in bioinformatics capability. Routine use of WGS or complete replacement of current subtyping methods by WGS will require attention to key issues including standardization, robustness, and validation of the analytical methodology. High resolution WGS subtyping of Salmonella promises to vastly improve the ability of the food industry to track and control Salmonella and is poised to become standard methodology in food safety for characterization of foodborne pathogens by public health authorities and food regulators. Nevertheless, standardization of WGS operation and data analysis, in particularly source tracking analysis, is required at a global level. A common agreement of understanding and the application of WGS between the food industry, public health, and food safety regulators are expected to guide the implementation of WGS in food safety management.
Author Contributions
ST and MW conceived and designed the work. ST, RO, and HL collected the data, conducted data analysis, and interpreted it. ST, RO, HL, and MW drafted the article. MW, AS, RB, CG, and GZ critically revised the article.
Conflict of Interest Statement
ST, HL, CG, GZ, RB, and AS were employed by the Mars Global Food Safety Center. MW serves as a compensated scientific advisor for BioMérieux, Mérieux NutriSciences, Mars, and Neogen and has served as a paid speaker for 3M and IBM. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We thank Peter Markwell, Dr. Bala Ganesan, and Dr. Kristel Hauben for comments that greatly improved the manuscript.
Funding. The authors declare that this study received funding from Mars Global Food Safety Center. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.
References
- Achtman M., Wain J., Weill F. X., Nair S., Zhou Z., Sangal V. (2012). Multilocus sequence typing as a replacement for serotyping in Salmonella enterica. PLoS Pathog. 8:e1002776. 10.1371/journal.ppat.1002776 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Akiba M., Kusumoto M., Iwata T. (2011). Rapid identification of Salmonella enterica serovars, Typhimurium, Choleraesuis, Infantis, Hadar, Enteritidis, Dublin and Gallinarum, by multiplex PCR. J. Microbiol. Methods 85 9–15. 10.1016/j.mimet.2011.02.002 [DOI] [PubMed] [Google Scholar]
- Alcaine S. D., Soyer Y., Warnick L. D., Su W. L., Sukhnanand S., Richards J., et al. (2006). Multilocus sequence typing supports the hypothesis that cow- and human-associated Salmonella isolates represent distinct and overlapping populations. Appl. Environ. Microbiol. 72 7575–7585. 10.1128/aem.01174-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alikhan N. F., Zhou Z., Sergeant M. J., Achtman M. (2018). A genomic overview of the population structure of Salmonella. PLoS Genet. 14:e1007261. 10.1371/journal.pgen.1007261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allard M. W., Bell R., Ferreira C. M., Gonzalez-Escalona N., Hoffmann M., Muruvanda T., et al. (2018). Genomics of foodborne pathogens for microbial food safety. Curr. Opin. Biotechnol. 49 224–229. 10.1016/j.copbio.2017.11.002 [DOI] [PubMed] [Google Scholar]
- Allard M. W., Luo Y., Strain E., Li C., Keys C. E., Son I., et al. (2012). High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach. BMC Genomics 13:32. 10.1186/1471-2164-13-32 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allard M. W., Strain E., Melka D., Bunning K., Musser S. M., Brown E. W., et al. (2016). Practical value of food pathogen traceability through building a whole-genome sequencing network and database. J. Clin. Microbiol. 54 1975–1983. 10.1128/JCM.00081-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Almeida F., Medeiros M. I., Rodrigues P., Dos D., Falcao J. P. (2015). Genotypic diversity, pathogenic potential and the resistance profile of Salmonella typhimurium strains isolated from humans and food from 1983 to 2013 in Brazil. J. Med. Microbiol. 64 1395–1407. 10.1099/jmm.0.000158 [DOI] [PubMed] [Google Scholar]
- Almeida F., Seribelli A. A., da Silva P., Medeiros M. I. C., Dos Prazeres Rodrigues D., Moreira C. G., et al. (2017). Multilocus sequence typing of Salmonella typhimurium reveals the presence of the highly invasive ST313 in Brazil. Infect. Genet. Evol. 51 41–44. 10.1016/j.meegid.2017.03.009 [DOI] [PubMed] [Google Scholar]
- Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215 403–410. 10.1006/jmbi.1990.9999 [DOI] [PubMed] [Google Scholar]
- Amirkhanian V., Lui M., Guttman A., Szantai E. (2006). Cost Benefit Analysis of a Multicapillary Electrophoresis System American Laboratory. Available at: https://www.americanlaboratory.com/913-Technical-Articles/35725-Cost-Benefit-Analysis-of-a-Multicapillary-Electrophoresis-System/ (accessed June 1, 2006). [Google Scholar]
- Ashton P. M., Nair S., Peters T. M., Bale J. A., Powell D. G., Painset A. (2016). Identification of Salmonella for public health surveillance using whole genome sequencing. PeerJ 4:e1752. 10.7717/peerj.1752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Auch A. F., Klenk H., Göker M. (2010). Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs. Stand. Genomic Sci. 2 142–148. 10.4056/sigs.541628 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barco L., Barrucci F., Olsen J. E., Ricci A. (2013). Salmonella source attribution based on microbial subtyping. Int. J. Food Microbiol. 163 193–203. 10.1016/j.ijfoodmicro.2013.03.005 [DOI] [PubMed] [Google Scholar]
- Barg N. L., Goering R. V. (1993). Molecular epidemiology of nosocomial infection: analysis of chromosomal restriction fragment patterns by pulsed-field gel electrophoresis. Infect. Control Hosp. Epidemiol. 14 595–600. 10.1086/646645 [DOI] [PubMed] [Google Scholar]
- Barrangou R., Dudley E. G. (2016). CRISPR-based typing and next-generation tracking technologies. Annu. Rev. Food Sci. Technol. 7 395–411. 10.1146/annurev-food-022814-015729 [DOI] [PubMed] [Google Scholar]
- Barrangou R., Horvath P. (2012). CRISPR: new horizons in phage resistance and strain identification. Annu. Rev. Food Sci. Technol. 3 143–162. 10.1146/annurev-food-022811-101134 [DOI] [PubMed] [Google Scholar]
- Bauer N., Evans P., Leopold B., Levine J., White P. (2013). USDA-FSIS Subtyping Work Group: Current and Future Development and Use of Molecular Subtyping by USDA-FSI. Available at: https://www.fsis.usda.gov/wps/wcm/ connect/6c7f71fd-2c0c-4ff0-b2bc-4977c7947516/Molecular-Subtyping-White- Paper.pdf?MOD=AJPERES (accessed October 27, 2018). [Google Scholar]
- Ben-Darif E., De Pinna E., Threlfall E. J., Bolton F. J., Upton M., Fox A. J., et al. (2010). Comparison of a semi-automated rep-PCR system and multilocus sequence typing for differentiation of Salmonella enterica isolates. J. Microbiol. Methods 81 11–16. 10.1016/j.mimet.2010.01.013 [DOI] [PubMed] [Google Scholar]
- Beranek A., Mikula C., Rabold P., Arnhold D., Berghold C., Lederer I., et al. (2009). Multiple-locus variable-number tandem repeat analysis for subtyping of Salmonella enterica subsp. enterica serovar Enteritidis. Int. J. Med. Microbiol. 299 43–51. 10.1016/j.ijmm.2008.06.002 [DOI] [PubMed] [Google Scholar]
- Bopp D. J., Baker D. J., Thompson L., Saylors A., Root T. P., Armstrong L., et al. (2016). Implementation of Salmonella serotype determination using pulsed-field gel electrophoresis in a state public health laboratory. Diagn. Microbiol. Infect. Dis. 85 416–418. 10.1016/j.diagmicrobio.2016.04.023 [DOI] [PubMed] [Google Scholar]
- Boxrud D. (2010). Advances in subtyping methods of foodborne disease pathogens. Curr. Opin. Biotechnol. 21 137–141. 10.1016/j.copbio.2010.02.011 [DOI] [PubMed] [Google Scholar]
- Boxrud D., Pederson-Gulrud K., Wotton J., Medus C., Lyszkowicz E., Besser J., et al. (2007). Comparison of multiple-locus variable-number tandem repeat analysis, pulsed-field gel electrophoresis, and phage typing for subtype analysis of Salmonella enterica serotype Enteritidis. J. Clin. Microbiol. 45 536–543. 10.1128/jcm.01595-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brenner F. W., Villar R. G., Angulo F. J., Tauxe R., Swaminathan B. (2000). Salmonella nomenclature - Guest commentary. J. Clin. Microbiol. 38 2465–2467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Call D. R., Orfe L., Davis M. A., Lafrentz S., Kang M. S. (2008). Impact of compounding error on strategies for subtyping pathogenic bacteria. Foodborne Pathog. Dis. 5 505–516. 10.1089/fpd.2008.0097 [DOI] [PMC free article] [PubMed] [Google Scholar]
- CDC (2009). U Centers, S., for Disease Control and Prevention: Salmonella Surveillance: Annual Summary. Available at: https://www.cdc.gov/ncezid/dfwed/pdfs/salmonellaannualsummarytables2009.pdf (accessed October 27, 2018). [Google Scholar]
- CDC (2015). Serotypes and the Importance of Serotyping Salmonella. Available at: https://www.cdc.gov/salmonella/reportspubs/salmonella-atlas/serotyping-importance.html (accessed February 8, 2019). [Google Scholar]
- CDC (2016a). Frequently Asked Questions. Available at: https://www.cdc.gov/pulsenet/about/faq.html (accessed February 16, 2016). [Google Scholar]
- CDC (2016b). Whole Genome Sequencing (WGS). Available at: https://www.cdc.gov/pulsenet/pathogens/wgs.html (accessed February 11, 2016). [Google Scholar]
- CDC (2017a). PulseNet International - On the Path to Implementing Whole Genome Sequencing for Foodborne Disease Surveillance. Available at: https://www.cdc.gov/pulsenet/participants/international/wgs-vision.html (accessed June 9, 2017). [Google Scholar]
- CDC (2017b). Standard Operating Procedure for PulseNet PFGE of Escherichia coli O157:H7, Escherichia coli non-O157 (STEC), Salmonella serotypes, Shigella sonnei and Shigella flexneri. Available at: https://www.cdc.gov/pulsenet/pdf/ecoli-shigella-salmonella-pfge-protocol-508c.pdf (accessed October 27, 2018). [Google Scholar]
- CDC (2017c). US Centers for Disease Control and Prevention (CDC): PulseNet Methods - Multiple Locus Variable-Number Tandem Repeat Analysis (MLVA). Available at: https://www.cdc.gov/pulsenet/pathogens/mlva.html (accessed October 27, 2018). [Google Scholar]
- CDC (2018a). Pulsed-Field Gel Electrophoresis (PFGE). Available at: https://www.cdc.gov/pulsenet/pathogens/pfge.html (accessed December 30, 2018). [Google Scholar]
- CDC (2018b). US Centers for Disease Control and Prevention (CDC): National Enteric Disease Surveillance - Salmonella Annual Report, 2016. Available at: https://www.cdc.gov/nationalsurveillance/salmonella-surveillance.html (accessed December 30, 2018). [Google Scholar]
- CDC (2019). Outbreak of Salmonella Infections Linked to Pet Hedgehogs. Available at: https://www.cdc.gov/salmonella/typhimurium-01-19/index.html (accessed May 30, 2019). [Google Scholar]
- Chen Y., Luo Y., Carleton H., Timme R., Melka D., Muruvanda T. (2017). Whole genome and core genome multilocus sequence typing and single nucleotide polymorphism analyses of Listeria monocytogenes isolates associated with an outbreak linked to cheese, United States, 2013. Appl. Environ. Microbiol. 83:e00633-17. 10.1128/AEM.00633-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chenu J. W., Cox J. M., Pavic A. (2012). Classification of Salmonella enterica serotypes from Australian poultry using repetitive sequence-based PCR. J. Appl. Microbiol. 112 185–196. 10.1111/j.1365-2672.2011.05172.x [DOI] [PubMed] [Google Scholar]
- Clooney A. G., Fouhy F., Sleator R. D., O’Driscoll A., Stanton C., Cotter P. D., et al. (2016). Comparing apples and oranges?: next generation sequencing and its impact on microbiome analysis. PLoS One 11:e0148028. 10.1371/journal.pone.0148028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collins C., Xavier D. (2017). “Reconstructing the ancestral relationships between bacterial pathogen genomes,” in Bacterial Pathogenesis: Methods and Protocols, eds Nordenfelt P., Collin M. (New York, NY: Springer; ), 109–137. 10.1007/978-1-4939-6673-8_8 [DOI] [PubMed] [Google Scholar]
- Dallman T., Inns T., Jombart T., Ashton P., Loman N., Chatt C. (2016). Phylogenetic structure of European Salmonella Enteritidis outbreak correlates with national and international egg distribution network. Microb. Genom. 2:e000070. 10.1099/mgen.0.000070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis S., Pettengill J. B., Luo Y., Payne J., Shpuntoff A., Rand H., et al. (2015). CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Comput. Sci. 1:e20 10.7717/peerj-cs.20 [DOI] [Google Scholar]
- De Cesare A., Krishnamani K., Parisi A., Ricci A., Luzzi I., Barco L., et al. (2015). Comparison between Salmonella enterica serotype Enteritidis genotyping methods and phage type. J. Clin. Microbiol. 53 3021–3031. 10.1128/JCM.01122-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- den Bakker H. C., Allard M. W., Bopp D., Brown E. W., Fontana J., Iqbal Z., et al. (2014). Rapid whole-genome sequencing for surveillance of Salmonella enterica serovar Enteritidis. Emerg. Infect. Dis. 20 1306–1314. 10.3201/eid2008.131399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- den Bakker H. C., Cummings C. A., Ferreira V., Vatta P., Orsi R. H., Degoricija L., et al. (2010). Comparative genomics of the bacterial genus Listeria: genome evolution is characterized by limited gene acquisition and limited gene loss. BMC Genomics 11:688. 10.1186/1471-2164-11-688 [DOI] [PMC free article] [PubMed] [Google Scholar]
- den Bakker H. C., Switt A. I., Cummings C. A., Hoelzer K., Degoricija L., Rodriguez-rivera L. D., et al. (2011). A whole-genome single nucleotide polymorphism-based approach to trace and identify outbreaks linked to a common Salmonella enterica subsp. enterica serovar Montevideo pulsed-field gel electrophoresis type. Appl. Environ. Microbiol. 77 8648–8655. 10.1128/AEM.06538-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deng X., Shariat N., Driebe E. M., Roe C. C., Tolar B., Trees E., et al. (2015). Comparative analysis of subtyping methods against a whole-genome-sequencing standard for Salmonella enterica serotype enteritidis. J. Clin. Microbiol. 53 212–218. 10.1128/JCM.02332-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dera-Tomaszewska B. (2012). Salmonella serovars isolated for the first time in Poland, 1995-2007. Int. J. Occup. Med. Environ. Health 25 294–303. 10.2478/S13382-012-0038-2 [DOI] [PubMed] [Google Scholar]
- Deurenberg R. H., Bathoorn E., Chlebowicz M. A., Couto N., Ferdous M., García-Cobos S. (2017). Application of next generation sequencing in clinical microbiology and infection prevention. J. Biotechnol. 243 16–24. 10.1016/j.jbiotec.2016.12.022 [DOI] [PubMed] [Google Scholar]
- Dijkshoorn L., Towner K. J., Struelens M. (2001). New Approaches for the Generation and Analysis of Microbial Typing Data. Amsterdam: Elsevier. [Google Scholar]
- DiMarzio M., Shariat N., Kariyawasam S., Barrangou R., Dudley E. G. (2013). Antibiotic resistance in Salmonella enterica serovar typhimurium associates with CRISPR sequence type. Antimicrob. Agents Chemother. 57 4282–4289. 10.1128/AAC.00913-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- ECDC (2011). European Centre for Disease Prevention and Control (ECDC) Technical Document: Laboratory Standard Operating Procedure for MLVA of Salmonella enterica Serotype Typhimurium. Available at: https://ecdc.europa. eu/sites/portal/files/media/en/publications/Publications/1109_SOP_Salmonella _Typhimurium_MLVA.pdf (accessed October 27, 2018). [Google Scholar]
- ECDC (2015). European Centre for Disease Prevention and Control (ECDC): Expert Opinion on the Introduction of Next-Generation Typing Methods for Food-and Waterborne Diseases in the EU and EEA. Stockholm: ECDC. [Google Scholar]
- ECDC (2016a). European Centre for Disease Prevention and Control (ECDC): Salmonellosis - Annual Epidemiological Report 2016 [2014 Data]. Available at: https://ecdc.europa.eu/en/publications-data/salmonellosis-annual-epidemiological-report-2016-2014-data (accessed January 31, 2016). [Google Scholar]
- ECDC (2016b). European Centre for Disease Prevention and Control (ECDC) Technical Document: Laboratory Standard Operating Procedure for Multiple-Locus Variable-Number Tandem Repeat Analysis of Salmonella enterica Serotype Enteritidis. Available at: https://ecdc.europa.eu/sites/portal/files/media/en/publications/Publications/Salmonella-Enteritidis-Laboratory-standard-operating-procedure.pdf (accessed October 27, 2018). [Google Scholar]
- Elemfareji O. I., Thong K. L. (2013). Comparative virulotyping of Salmonella typhi and Salmonella enteritidis. Indian J. Microbiol. 53 410–417. 10.1007/s12088-013-0407-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fabre L., Zhang J., Guigon G., Le Hello S., Guibert V., Accou-Demartin M. (2012). CRISPR typing and subtyping for improved laboratory surveillance of Salmonella infections. PLoS One 7:e36995. 10.1371/journal.pone.0036995 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fakhr M. K., Nolan L. K., Logue C. M. (2005). Multilocus sequence typing lacks the discriminatory ability of pulsed-field gel electrophoresis for typing Salmonella enterica serovar typhimurium. J. Clin. Microbiol. 43 2215–2219. 10.1128/JCM.43.5.2215-2219.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- FAO (2016). Food and Agriculture Organization of the United Nations: Applications of Whole Genome Sequencing in Food Safety Management. Rome: FAO. [Google Scholar]
- FDA (2018). GenomeTrakr Network. Available at: https://www.fda.gov/Food/FoodScienceResearch/WholeGenomeSequencingProgramWGS/ucm363134.htm (accessed March 21, 2019). [Google Scholar]
- Ferrari R. G., Panzenhagen P. H. N., Conte C. A., Jr. (2017). Phenotypic and genotypic eligible methods for Salmonella typhimurium source tracking. Front. Microbiol. 8:2587. 10.3389/fmicb.2017.02587 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitzgerald C., Gheesling L., Collins M., Fields P. I. (2006). Sequence analysis of the rfb loci, encoding proteins involved in the biosynthesis of the Salmonella enterica O17 and O18 antigens: serogroup-specific identification by PCR. Appl. Environ. Microbiol. 72 7949–7953. 10.1128/aem.01046-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Foley S. L., Lynne A. M., Nayak R. (2009). Molecular typing methodologies for microbial source tracking and epidemiological investigations of Gram-negative bacterial foodborne pathogens. Infect. Genet. Evol. 9 430–440. 10.1016/j.meegid.2009.03.004 [DOI] [PubMed] [Google Scholar]
- Foley S. L., White D. G., McDermott P. F., Walker R. D., Rhodes B., Fedorka-Cray P. J., et al. (2006). Comparison of subtyping methods for differentiating Salmonella enterica serovar typhimurium isolates obtained from food animal sources. J. Clin. Microbiol. 44 3569–3577. 10.1128/jcm.00745-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Francisco A. P., Bugalho M., Ramirez M., Carrico J. A. (2009). Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach. BMC Bioinformatics 10:152. 10.1186/1471-2105-10-152 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fricke W. F., Mammel M. K., McDermott P. F., Tartera C., White D. G., Leclerc J. E., et al. (2011). Comparative genomics of 28 Salmonella enterica isolates: evidence for CRISPR-mediated adaptive sublineage evolution. J. Bacteriol. 193 3556–3568. 10.1128/JB.00297-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galanis E., Lo Fo Wong D. M., Patrick M. E., Binsztein N., Cieslik A., Chalermchikit T. (2006). Web-based surveillance and global Salmonella distribution, 2000-2002. Emerg. Infect. Dis. 12 381–388. 10.3201/eid1205.050854 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gaul S. B., Wedel S., Erdman M. M., Harris D. L., Harris I. T., Ferris K. E., et al. (2007). Use of pulsed-field gel electrophoresis of conserved XbaI fragments for identification of swine Salmonella serotypes. J. Clin. Microbiol. 45 472–476. 10.1128/jcm.00962-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilson E., Bachellier S., Perrin S., Perrin D., Grimont P. A., Grimont F., et al. (1990). Palindromic unit highly repetitive DNA sequences exhibit species specificity within Enterobacteriacea. Res. Microbiol. 141 1103–1116. 10.1016/0923-2508(90)90084-4 [DOI] [PubMed] [Google Scholar]
- GMA (2009). The Association of Food, Beverage and Consumer Products Companies (GMA): Control of Salmonella in Low-Moisture Foods. Available at: https://www.gmaonline.org/downloads/technical-guidance-and-tools/SalmonellaControlGuidance.pdf (accessed March 16, 2009). [Google Scholar]
- Greig J. D., Ravel A. (2009). Analysis of foodborne outbreak data reported internationally for source attribution. Int. J. Food Microbiol. 130 77–87. 10.1016/j.ijfoodmicro.2008.12.031 [DOI] [PubMed] [Google Scholar]
- Grimont P., Weill F. (2007). Antigenic Formulae of the Salmonella Serovars, 9th Edn Paris: WHO Collaborating Centre for Reference and Research on Salmonella. [Google Scholar]
- Guerra B., Laconcha I., Soto S. M., Gonzalez-Hevia M. A., Mendoza M. C. (2000). Molecular characterisation of emergent multiresistant Salmonella enterica serotype [4,5,12:i:-] organisms causing human salmonellosis. FEMS Microbiol. Lett. 190 341–347. 10.1111/j.1574-6968.2000.tb09309.x [DOI] [PubMed] [Google Scholar]
- Guibourdenche M., Roggentin P., Mikoleit M., Fields P. I., Bockemuhl J., Grimont P. A., et al. (2010). Supplement 2003-2007 (No. 47) to the White-Kauffmann-Le Minor scheme. Res. Microbiol. 161 26–29. 10.1016/j.resmic.2009.10.002 [DOI] [PubMed] [Google Scholar]
- Guigon G., Cheval J., Cahuzac R., Brisse S. (2008). MLVA-NET–a standardised web database for bacterial genotyping and surveillance. Euro Surveill. 13:18863 10.2807/ese.13.19.18863-en [DOI] [PubMed] [Google Scholar]
- Hadjinicolaou A. V., Demetriou V. L., Emmanuel M. A., Kakoyiannis C. K., Kostrikis L. G. (2009). Molecular beacon-based real-time PCR detection of primary isolates of Salmonella typhimurium and Salmonella enteritidis in environmental and clinical samples. BMC Microbiol. 9:97. 10.1186/1471-2180-9-97 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanning I. B., Nutt J. D., Ricke S. C. (2009). Salmonellosis outbreaks in the United States due to fresh produce: sources and potential intervention measures. Foodborne Pathog. Dis. 6 635–648. 10.1089/fpd.2008.0232 [DOI] [PubMed] [Google Scholar]
- Harbottle H., White D. G., McDermott P. F., Walker R. D., Zhao S. (2006). Comparison of multilocus sequence typing, pulsed-field gel electrophoresis, and antimicrobial susceptibility typing for characterization of Salmonella enterica serotype Newport isolates. J. Clin. Microbiol. 44 2449–2457. 10.1128/jcm.00019-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hartmann F. A., West S. E. (1997). Utilization of both phenotypic and molecular analyses to investigate an outbreak of multidrug-resistant Salmonella anatum in horses. Can. J. Vet. Res. 61 173–181. [PMC free article] [PubMed] [Google Scholar]
- Hauser E., Tietze E., Helmuth R., Junker E., Prager R., Schroeter A., et al. (2012). Clonal dissemination of Salmonella enterica serovar infantis in Germany. Foodborne Pathog. Dis. 9 352–360. 10.1089/fpd.2011.1038 [DOI] [PubMed] [Google Scholar]
- Hauser E., Tietze E., Helmuth R., Malorny B. (2011). Different mutations in the oafA gene lead to loss of O5-antigen expression in Salmonella enterica serovar typhimurium. J. Appl. Microbiol. 110 248–253. 10.1111/j.1365-2672.2010.04877.x [DOI] [PubMed] [Google Scholar]
- Healy M., Huong J., Bittner T., Lising M., Frye S., Raza S., et al. (2005). Microbial DNA typing by automated repetitive-sequence-based PCR. J. Clin. Microbiol. 43 199–207. 10.1128/JCM.43.1.199-207.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heisig P., Kratz B., Halle E., Graser Y., Altwegg M., Rabsch W., et al. (1995). Identification of DNA gyrase A mutations in ciprofloxacin-resistant isolates of Salmonella typhimurium from men and cattle in Germany. Microb. Drug Resist. 1 211–218. 10.1089/mdr.1995.1.211 [DOI] [PubMed] [Google Scholar]
- Hoelzer K., Soyer Y., Rodriguez-Rivera L. D., Cummings K. J., McDonough P. L., Schoonmaker-Bopp D. J. (2010). The prevalence of multidrug resistance is higher among bovine than human Salmonella enterica serotype Newport, Typhimurium, and 4,5,12:i:- isolates in the United States but differs by serotype and geographic region. Appl. Environ. Microbiol. 76 5947–5959. 10.1128/AEM.00377-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann M., Luo Y., Monday S. R., Gonzalez-Escalona N., Ottesen A. R., Muruvanda T. (2016). Tracing origins of the Salmonella bareilly strain causing a food-borne outbreak in the united states. J. Infect. Dis. 213 502–508. 10.1093/infdis/jiv297 [DOI] [PubMed] [Google Scholar]
- Hopkins K. L., Maguire C., Best E., Liebana E., Threlfall E. J. (2007). Stability of multiple-locus variable-number tandem repeats in Salmonella enterica serovar typhimurium. J. Clin. Microbiol. 45 3058–3061. 10.1128/jcm.00715-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hopkins K. L., Peters T. M., de Pinna E., Wain J. (2011). Standardisation of multilocus variable-number tandem-repeat analysis (MLVA) for subtyping of Salmonella enterica serovar Enteritidis. Euro Surveill. 16:19942. 10.2807/ese.16.32.19942-en [DOI] [PubMed] [Google Scholar]
- Hulton C. S. J., Higgins C. F., Sharp P. M. (1991). ERIC sequences: a novel family of repetitive elements in the genomes of Escherichia coli, Salmonella typhimurium and other enterobacteria. Mol. Microbiol. 5 825–834. 10.1111/j.1365-2958.1991.tb00755.x [DOI] [PubMed] [Google Scholar]
- Hunter S. B., Vauterin P., Lambert-Fair M. A., Van Duyne M. S., Kubota K., Graves L., et al. (2005). Establishment of a universal size standard strain for use with the pulsenet standardized pulsed-field gel electrophoresis protocols: converting the national databases to the new size standard. J. Clin. Microbiol. 43 1045. 10.1128/JCM.43.3.1045-1050.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyytia-Trees E. K., Cooper K., Ribot E. M., Gerner-Smidt P. (2007). Recent developments and future prospects in subtyping of foodborne bacterial pathogens. Future Microbiol. 2 175–185. 10.2217/17460913.2.2.175 [DOI] [PubMed] [Google Scholar]
- Illumina (2019). Illumina - Introduction to SBS Technology. Available at: https://www.illumina.com/science/technology/next-generation-sequencing/sequencing-technology.html (accessed October 27, 2018). [Google Scholar]
- Inns T., Ashton P., Herrera-Leon S., Lighthill J., Foulkes S., Jombart T. (2016). Prospective use of whole genome sequencing (WGS) detected a multi-country outbreak of Salmonella enteritidis. Epidemiol. Infect. 145 289–298. 10.1017/S0950268816001941 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inns T., Lane C., Peters T., Dallman T., Chatt C., McFarland N. (2015). A multi-country Salmonella enteritidis phage type 14b outbreak associated with eggs from a German producer: ‘near real-time’ application of whole genome sequencing and food chain investigations, United Kingdom, May to September 2014. Euro Surveill. 20:21098. 10.2807/1560-7917.ES2015.20.16.21098 [DOI] [PubMed] [Google Scholar]
- Inouye M., Dashnow H., Raven L. A., Schultz M. B., Pope B. J., Tomita T., et al. (2014). SRST2: rapid genomic surveillance for public health and hospital microbiology labs. Genome Med. 6:90. 10.1186/s13073-014-0090-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson B. R., Tarr C., Strain E., Jackson K. A., Conrad A., Carleton H. (2016). Implementation of nationwide real-time whole-genome sequencing to enhance listeriosis outbreak detection and investigation. Clin. Infect. Dis. 63 380–386. 10.1093/cid/ciw242 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jagadeesan B., Gerner-Smidt P., Allard M. W., Leuillet S., Winkler A., Xiao Y. (2019). The use of next generation sequencing for improving food safety: translation into practice. Food Microbiol. 79 96–115. 10.1016/j.fm.2018.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katz L. S., Griswold T., Williams-Newkirk A. J., Wagner D., Petkau A., Sieffert C., et al. (2017). A comparative analysis of the Lyve-SET phylogenomics pipeline for genomic epidemiology of foodborne pathogens. Front. Microbiol. 8:375. 10.3389/fmicb.2017.00375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kerouanton A., Marault M., Lailler R., Weill F. X., Feurer C., Espie E., et al. (2007). Pulsed-field gel electrophoresis subtyping database for foodborne Salmonella enterica serotype discrimination. Foodborne Pathog. Dis. 4 293–303. 10.1089/fpd.2007.0090 [DOI] [PubMed] [Google Scholar]
- Kidgell C., Reichard U., Wain J., Linz B., Torpdahl M., Dougan G., et al. (2002). Salmonella typhi, the causative agent of typhoid fever, is approximately 50,000 years old. Infect. Genet. Evol. 2 39–45. 10.1016/s1567-1348(02)00089-8 [DOI] [PubMed] [Google Scholar]
- Kim S., Frye J. G., Hu J., Fedorka-Cray P. J., Gautom R., Boyle D. S., et al. (2006). Multiplex PCR-based method for identification of common clinical serotypes of Salmonella enterica subsp. enterica. J. Clin. Microbiol. 44 3608–3615. 10.1128/jcm.00701-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kjeldsen M. K., Torpdahl M., Pedersen K., Nielsen E. M. (2016). Development and comparison of a generic multiple-locus variable-number tandem repeat analysis with PFGE for typing of Salmonella enterica subsp. enterica. J. Appl. Microbiol. 119 1707–1717. 10.1111/jam.12965 [DOI] [PubMed] [Google Scholar]
- Kodama Y., Shumway M., Leinonen R. (2012). The Sequence Read Archive: explosive growth of sequencing data. Nucleic Acids Res. 40 54–56. 10.1093/nar/gkr854 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozyreva V. K., Crandall J., Sabol A., Poe A., Zhang P., Concepcion-Acevedo J., et al. (2016). Laboratory investigation of Salmonella enterica serovar Poona outbreak in California: comparison of pulsed-field gel electrophoresis (PFGE) and whole genome sequencing (WGS) results. PLoS Curr. 8:ecurrents.outbreaks.1bb3e36e74bd5779bc43ac3a8dae52e6. 10.1371/currents.outbreaks.1bb3e36e74bd5779bc43ac3a8dae52e6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kruy S. L., van Cuyck H., Koeck J. L. (2011). Multilocus variable number tandem repeat analysis for Salmonella enterica subspecies. Eur. J. Clin. Microbiol. Infect. Dis. 30 465–473. 10.1007/s10096-010-1110-0 [DOI] [PubMed] [Google Scholar]
- Kvistholm Jensen A., Nielsen E. M., Bjorkman J. T., Jensen T., Muller L., Persson S., et al. (2016). Whole-genome sequencing used to investigate a nationwide outbreak of listeriosis caused by ready-to-eat delicatessen meat, Denmark, 2014. Clin. Infect. Dis. 63 64–70. 10.1093/cid/ciw192 [DOI] [PubMed] [Google Scholar]
- Larsson J. T., Torpdahl M. Mlva working group Moller N. E. (2013). Proof-of-concept study for successful inter-laboratory comparison of MLVA results. Euro Surveill. 18:20566. 10.2807/1560-7917.ES2013.18.35.20566 [DOI] [PubMed] [Google Scholar]
- Larsson J. T., Torpdahl M., Petersen R. F., Sorensen G., Lindstedt B. A., Nielsen E. M., et al. (2009). Development of a new nomenclature for Salmonella typhimurium multilocus variable number of tandem repeats analysis (MLVA). Euro Surveill. 14:19174. 10.2807/ese.14.15.19174-en [DOI] [PubMed] [Google Scholar]
- Leekitcharoenphon P., Nielsen E. M., Kaas R. S., Lund O., Aarestrup F. M. (2014). Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica. PLoS One 9:e87991. 10.1371/journal.pone.0087991 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H. (2013). Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. Available at: http://arxiv.org/abs/1303.3997 (accessed March 16, 2013). [Google Scholar]
- Li W., Raoult D., Fournier P. E. (2009). Bacterial strain typing in the genomic era. FEMS Microbiol. Rev. 33 892–916. 10.1111/j.1574-6976.2009.00182.x [DOI] [PubMed] [Google Scholar]
- Lienemann T., Kyyhkynen A., Halkilahti J., Haukka K., Siitonen A. (2015). Characterization of Salmonella typhimurium isolates from domestically acquired infections in Finland by phage typing, antimicrobial susceptibility testing, PFGE and MLVA. BMC Microbiol. 15:131. 10.1186/s12866-015-0467-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindstedt B. A. (2005). Multiple-locus variable number tandem repeats analysis for genetic fingerprinting of pathogenic bacteria. Electrophoresis 26 2567–2582. 10.1002/elps.200500096 [DOI] [PubMed] [Google Scholar]
- Lindstedt B. A., Heir E., Gjernes E., Kapperud G. (2003). DNA fingerprinting of Salmonella enterica subsp. enterica serovar typhimurium with emphasis on phage type DT104 based on variable number of tandem repeat loci. J. Clin. Microbiol. 41 1469–1479. 10.1128/JCM.41.4.1469-1479.203 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindstedt B. A., Torpdahl M., Nielsen E. M., Vardund T., Aas L., Kapperud G., et al. (2007). Harmonization of the multiple-locus variable-number tandem repeat analysis method between Denmark and Norway for typing Salmonella typhimurium isolates and closer examination of the VNTR loci. J. Appl. Microbiol. 102 728–735. 10.1111/j.1365-2672.2006.03134.x [DOI] [PubMed] [Google Scholar]
- Lindstedt B. A., Torpdahl M., Vergnaud G., Le Hello S., Weill F. X., Tietze E., et al. (2013). Use of multilocus variable-number tandem repeat analysis (MLVA) in eight European countries, 2012. Euro Surveill. 18:20385. 10.2807/ese.18.04.20385-en [DOI] [PubMed] [Google Scholar]
- Liu F. (2010). Virulence Gene and CRISPR Multilocus Sequence Typing Scheme for Subtyping the Major Serovars of Salmonella enterica Subspecies enterica. Available at: https://etda.libraries.psu.edu/files/final_submissions/1960 (accessed October 27, 2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu F., Barrangou R., Gernersmidt P., Ribot E. M., Knabel S. J., Dudley E. G., et al. (2011). Novel virulence gene and clustered regularly interspaced short palindromic repeat (CRISPR) multilocus sequence typing scheme for subtyping of the major serovars of Salmonella enterica subsp. enterica. Appl. Environ. Microbiol. 77 1946–1956. 10.1128/AEM.02625-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loman N. J., Pallen M. J. (2015). Twenty years of bacterial genome sequencing. Nat. Rev. Microbiol. 13 787–794. 10.1038/nrmicro3565 [DOI] [PubMed] [Google Scholar]
- Maciorowski K. G., Jones F. T., Pillaiand S. D., Ricke S. C. (2004). Incidence, sources, and control of food-borne Salmonella spp. in poultry feeds. Worlds Poult. Sci. J. 60 446–457. 10.1079/WPS200428 [DOI] [Google Scholar]
- Maiden M. C., Jansen van Rensburg M. J., Bray J. E., Earle S. G., Ford S. A., Jolley K. A., et al. (2013). MLST revisited: the gene-by-gene approach to bacterial genomics. Nat. Rev. Microbiol. 11 728–736. 10.1038/nrmicro3093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin B., Humbert O., Camara M., Guenzi E., Walker J., Mitchell T., et al. (1992). A highly conserved repeated DNA element located in the chromosome of Streptococcus pneumoniae. Nucleic Acids Res. 20 3479–3483. 10.1093/nar/20.13.3479 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McQuiston J. R., Parrenas R., Ortiz-Rivera M., Gheesling L., Brenner F., Fields P. I., et al. (2004). Sequencing and comparative analysis of flagellin genes fliC, fljB, and flpA from Salmonella. J. Clin. Microbiol. 42 1923–1932. 10.1128/JCM.42.5.1923-1932.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McQuiston J. R., Waters R. J., Dinsmore B. A., Mikoleit M. L., Fields P. I. (2011). Molecular determination of H antigens of Salmonella by use of a microsphere-based liquid array. J. Clin. Microbiol. 49 565–573. 10.1128/JCM.01323-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mohammed M., Delappe N., O’Connor J., McKeown P., Garvey P., Cormican M., et al. (2016). Whole genome sequencing provides an unambiguous link between Salmonella Dublin outbreak strain and a historical isolate. Epidemiol. Infect. 144 576–581. 10.1017/S0950268815001636 [DOI] [PubMed] [Google Scholar]
- Moura A., Criscuolo A., Pouseele H., Maury M. M., Leclercq A., Tarr C., et al. (2016). Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes. Nat. Microbiol. 2:16185. 10.1038/nmicrobiol.2016.185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mughini-Gras L., Franz E., van Pelt W. (2018). New paradigms for Salmonella source attribution based on microbial subtyping. Food Microbiol. 71 60–67. 10.1016/j.fm.2017.03.002 [DOI] [PubMed] [Google Scholar]
- Muteeb G., Sen R. (2010). “Random mutagenesis using a mutator strain,” in In Vitro Mutagenesis Protocols, 3rd Edn, ed. Braman J. (Totowa, NJ: Humana Press; ), 411–419. 10.1007/978-1-60761-652-8_29 [DOI] [PubMed] [Google Scholar]
- Nadon C., Van Walle I., Gerner-Smidt P., Campos J., Chinen I., Concepcion-Acevedo J., et al. (2017). PulseNet International: vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance. Euro Surveill. 22:30544. 10.2807/1560-7917.ES.2017.22.23.30544 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nde C. W., Sherwood J. S., Doetkott C., Logue C. M. (2006). Prevalence and molecular profiles of Salmonella collected at a commercial turkey processing plant. J. Food Prot. 69 1794–1801. 10.4315/0362-028X-69.8.1794 [DOI] [PubMed] [Google Scholar]
- Ngoi S. T., Teh C. S., Chai L. C., Thong K. L. (2015). Overview of molecular typing tools for the characterization of Salmonella enterica in Malaysia. Biomed. Environ. Sci. 28 751–764. 10.3967/bes2015.105 [DOI] [PubMed] [Google Scholar]
- Nsofor C. (2016). Pulsed-field gel electrophoresis (PFGE): principles and applications in molecular epidemiology: a review. Int. J. Curr. Res. Med. Sci. 2 38–51. 10.1111/j.1863-2378.2009.01259.x [DOI] [PubMed] [Google Scholar]
- Octavia S., Wang Q., Tanaka M. M., Kaur S., Sintchenko V., Lan R., et al. (2015). Delineating community outbreaks of Salmonella enterica serovar Typhimurium by use of whole-genome sequencing: insights into genomic variability within an outbreak. J. Clin. Microbiol. 53 1063–1071. 10.1128/JCM.03235-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olaimat A. N., Holley R. A. (2012). Factors influencing the microbial safety of fresh produce: a review. Food Microbiol. 32 1–19. 10.1016/j.fm.2012.04.016 [DOI] [PubMed] [Google Scholar]
- Oloya J., Doetkott D., Khaitsa M. L. (2009). Antimicrobial drug resistance and molecular characterization of Salmonella isolated from domestic animals, humans, and meat products. Foodborne Pathog. Dis. 6 273–284. 10.1089/fpd.2008.0134 [DOI] [PubMed] [Google Scholar]
- Olson A. B., Andrysiak A. K., Tracz D. M., Guard-Bouldin J., Demczuk W., Ng L. K., et al. (2007). Limited genetic diversity in Salmonella enterica serovar Enteritidis PT13. BMC Microbiol. 7:87. 10.1186/1471-2180-7-87 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park S. Y., Woodward C. L., Kubena L. F., Nisbet D. J., Birkhold S. G., Ricke S. C., et al. (2008). Environmental dissemination of foodborne Salmonella in preharvest poultry production: reservoirs, critical factors, and research strategies. Crit. Rev. Environ. Sci. Technol. 38 73–111. 10.1080/10643380701598227 [DOI] [Google Scholar]
- Patané J. S. L., Martins J., Setubal J. C. (2018). “Phylogenomics,” in Comparative Genomics: Methods and Protocols, eds Setubal J. C., Stoye J., Stadler P. F. (New York, NY: Springer; ). [Google Scholar]
- Pearce M. E., Alikhan N. F., Dallman T. J., Zhou Z., Grant K., Maiden M. C. J., et al. (2018). Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak. Int. J. Food. Microbiol. 274 1–11. 10.1016/j.ijfoodmicro.2018.02.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Persing D. H., Tenover F. C., Versalovic J., Tang Y. W., Unger E. R., Relman D. A. (2011). Molecular Microbiology: Diagnostic Principles and Practice. Washington, DC: ASM Press. [Google Scholar]
- Petersen A., Aarestrup F. M., Angulo F. J., Wong S., Stöhr K., Wegener H. C., et al. (2002). WHO global salm-surv external quality assurance system (EQAS): an important step toward improving the quality of Salmonella serotyping and antimicrobial susceptibility testing worldwide. Microb. Drug Resist. 8 345–353. 10.1089/10766290260469615 [DOI] [PubMed] [Google Scholar]
- Pightling A. W., Petronella N., Pagotto F. (2014). Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses. PLoS One 9:e104579. 10.1371/journal.pone.0104579 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pightling A. W., Pettengill J. B., Luo Y., Baugher J. D., Rand H., Strain E., et al. (2018). Interpreting whole-genome sequence analyses of foodborne bacteria for regulatory applications and outbreak investigations. Front. Microbiol. 9:1482. 10.3389/fmicb.2018.01482 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pillai S. D., Ricke S. C. (2002). Bioaerosols from municipal and animal wastes: background and contemporary issues. Can. J. Microbiol. 48 681–696. 10.1139/w02-070 [DOI] [PubMed] [Google Scholar]
- Pinho A. J., Bastos C. A. C., Ferreira P. J. S. G., Garcia S. P., Afreixo V. (2009). Genome analysis with inter-nucleotide distances. Bioinformatics 25 3064–3070. 10.1093/bioinformatics/btp546 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Portmann A. C., Fournier C., Gimonet J., Ngom-Bru C., Barretto C., Baert L. (2018). A validation approach of an end-to-end whole genome sequencing workflow for source tracking of Listeria monocytogenes and Salmonella enterica. Front. Microbiol. 9:446. 10.3389/fmicb.2018.00446 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Porwollik S., Boyd E. F., Choy C., Cheng P., Florea L., Proctor E., et al. (2004). Characterization of Salmonella enterica subspecies I genovars by use of microarrays. J. Bacteriol. 186 5883–5898. 10.1128/JB.186.17.5883-5898.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- PulseNet (2013). Standard Operating Procedure for PulseNet PFGE of Escherichia coli O157:H7, Escherichia coli non-O157 (STEC), Salmonella Serotypes, Shigella sonnei and Shigella flexneri. Available at: http://www.pulsenetinternational.org/assets/PulseNet/uploads/pfge/PNL05_Ec-Sal-ShigPFGEprotocol.pdf (accessed March 24, 2013). [Google Scholar]
- PulseNet (2014). PulseNet Europe. Available at: http://www.pulsenetinternational.org/networks/europe/ (accessed November 4, 2014). [Google Scholar]
- PulseNet (2015a). PulseNet Explained. Available at: http://www.pulsenetinternational.org/international/pulsenetexplained/ (accessed May 13, 2015). [Google Scholar]
- PulseNet (2015b). PulseNet Protocols A - Pulse Field Gel Electrophoresis (PFGE). Available at: http://www.pulsenetinternational.org/protocols/pfge/ (accessed October 22, 2015). [Google Scholar]
- PulseNet (2015c). PulseNet Protocols B - Molecular Typing for MLVA. Available at: http://www.pulsenetinternational.org/protocols/mlva/ (accessed May 13, 2015). [Google Scholar]
- Ranieri M. L., Shi C., Switt M. A. I., den Bakker H. C., Wiedmann M. (2013). Comparison of typing methods with a new procedure based on sequence characterization for Salmonella serovar prediction. J. Clin. Microbiol. 51 1786–1797. 10.1128/JCM.03201-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rantsiou K., Kathariou S., Winkler A., Skandamis P., Saint-Cyr M. J., Rouzeau-Szynalski K., et al. (2018). Next generation microbiological risk assessment: opportunities of whole genome sequencing (WGS) for foodborne pathogen surveillance, source tracking and risk assessment. Int. J. Food Microbiol. 287 3–9. 10.1016/j.ijfoodmicro.2017.11.007 [DOI] [PubMed] [Google Scholar]
- Ricke S. C. (2017). Insights and challenges of Salmonella infection of laying hens. Curr. Opin. Food Sci. 18 43–49. 10.1016/j.cofs.2017.10.012 [DOI] [Google Scholar]
- Ricke S. C., Kim S. A., Shi Z., Park S. H. (2018). Molecular-based identification and detection of Salmonella in food production systems: current perspectives. J. Appl. Microbiol. 125 313–327. 10.1111/jam.13888 [DOI] [PubMed] [Google Scholar]
- Robertson J., Yoshida C., Kruczkiewicz P., Nadon C., Nichani A., Taboada E. N., et al. (2018). Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico typing resource (SISTR). Microb. Genom. 4:e000151. 10.1099/mgen.0.000151 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross I. L., Heuzenroeder M. W. (2005). Discrimination within phenotypically closely related definitive types of Salmonella enterica serovar typhimurium by the multiple amplification of phage locus typing technique. J. Clin. Microbiol. 43 1604–1611. 10.1128/JCM.43.4.1604-1611.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross I. L., Heuzenroeder M. W. (2008). A comparison of three molecular typing methods for the discrimination of Salmonella enterica serovar Infantis. FEMS Immunol. Med. Microbiol. 53 375–384. 10.1111/j.1574-695X.2008.00435.x [DOI] [PubMed] [Google Scholar]
- Sabat A. J., Budimir A., Nashev D., Sá-Leão R., van Dijl J. M., Laurent F., et al. (2013). Overview of molecular typing methods for outbreak detection and epidemiological surveillance. Euro Surveill. 18:20380. 10.2807/ese.18.04.20380-en [DOI] [PubMed] [Google Scholar]
- Salmonella Subcommittee of the Nomenclature Committee of the International Society for, Microbiology (1934). The genus Salmonella Lignières, 1900. J. Hyg. 34 333–350. 10.1017/s0022172400034677 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sangal V., Harbottle H., Mazzoni C. J., Helmuth R., Guerra B., Didelot X., et al. (2010). Evolution and population structure of Salmonella enterica serovar Newport. J. Bacteriol. 192 6465–6476. 10.1128/JB.00969-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schouls L. M., Spalburg E. C., van Luit M., Huijsdens X. W., Pluister G. N., van Santen-Verheuvel M. J., et al. (2009). Multiple-locus variable number tandem repeat analysis of Staphylococcus aureus: comparison with pulsed-field gel electrophoresis and spa-typing. PLoS One 4:e5082. 10.1371/journal.pone.0005082 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartz D. C., Cantor C. R. (1984). Separation of yeast chromosome-sized DNAs by pulsed field gradient gel electrophoresis. Cell 37 67–75. 10.1016/0092-8674(84)90301-5 [DOI] [PubMed] [Google Scholar]
- Shariat N., DiMarzio M. J., Yin S., Dettinger L., Sandt C. H., Lute J. R., et al. (2013a). The combination of CRISPR-MVLST and PFGE provides increased discriminatory power for differentiating human clinical isolates of Salmonella enterica subsp. enterica serovar Enteritidis. Food Microbiol. 34 164–173. 10.1016/j.fm.2012.11.012 [DOI] [PubMed] [Google Scholar]
- Shariat N., Sandt C. H., DiMarzio M. J., Barrangou R., Dudley E. G. (2013b). CRISPR-MVLST subtyping of Salmonella enterica subsp. enterica serovars Typhimurium and Heidelberg and application in identifying outbreak isolates. BMC Microbiol. 13:254. 10.1186/1471-2180-13-254 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shariat N., Sandt C. H., Kirchner M. K., Trees E., Barrangou R., Dudley E. G. (2013c). Subtyping of Salmonella enterica serovar Newport outbreak isolates by CRISPR-MVLST and determination of the relationship between CRISPR-MVLST and PFGE results. J. Clin. Microbiol. 51 2328–2336. 10.1128/JCM.00608-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shariat N., Dudley E. G. (2014). CRISPRs: molecular signatures used for pathogen subtyping. Appl. Environ. Microbiol. 80 430–439. 10.1128/AEM.02790-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi C., Singh P., Ranieri M. L., Wiedmann M., Moreno Switt A. I. (2015). Molecular methods for serovar determination of Salmonella. Crit. Rev. Microbiol. 41 309–325. 10.3109/1040841X.2013.837862 [DOI] [PubMed] [Google Scholar]
- Singh A., Goering R. V., Simjee S., Foley S. L., Zervos M. J. (2006). Application of molecular techniques to the study of hospital infection. Clin. Microbiol. Rev. 19 512–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soyer Y., Alcaine S. D., Schoonmaker-Bopp D. J., Root T. P., Warnick L. D., McDonough P. L., et al. (2010). Pulsed-field gel electrophoresis diversity of human and bovine clinical Salmonella isolates. Foodborne Pathog. Dis. 7 707–717. 10.1089/fpd.2009.0424 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soyer Y., Moreno Switt A., Davis M. A., Maurer J., McDonough P. L., Schoonmaker-Bopp D. J., et al. (2009). Salmonella enterica serotype 4,5,12:i:-, an emerging Salmonella serotype that represents multiple distinct clones. J. Clin. Microbiol. 47 3546–3556. 10.1128/JCM.00546-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sukhnanand S., Alcaine S., Warnick L. D., Su W. L., Hof J., Craver M. P., et al. (2005). DNA sequence-based subtyping and evolutionary analysis of selected Salmonella enterica serotypes. J. Clin. Microbiol. 43 3688–3698. 10.1128/JCM.43.8.3688-3698.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taylor A. J., Lappi V., Wolfgang W. J., Lapierre P., Palumbo M. J., Medus C. (2015). Characterization of foodborne outbreaks of Salmonella enterica serovar enteritidis with whole-genome sequencing single nucleotide polymorphism-based analysis for surveillance and outbreak detection. J. Clin. Microbiol. 53 3334–3340. 10.1128/JCM.01280-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thong K. L., Ang C. P. (2011). Genotypic and phenotypic differentiation of Salmonella enterica serovar Paratyphi B in Malaysia. Southeast Asian J. Trop. Med. Public Health 42 1178–1189. [PubMed] [Google Scholar]
- Threlfall E. J., Frost J. A. (1990). The identification, typing and fingerprinting of Salmonella: laboratory aspects and epidemiological applications. J. Appl. Bacteriol. 68 5–16. 10.1111/j.1365-2672.1990.tb02542.x [DOI] [PubMed] [Google Scholar]
- Tiong V., Thong K. L., Yusof M. Y., Hanifah Y. A., Sam J. I., Hassan H. (2010). Macrorestriction analysis and antimicrobial susceptibility profiling of Salmonella enterica at a University Teaching Hospital, Kuala Lumpur. Jpn. J. Infect. Dis. 63 317–322. [PubMed] [Google Scholar]
- Top J., Banga N. M., Hayes R., Willems R. J., Bonten M. J., Hayden M. K. (2008). Comparison of multiple-locus variable-number tandem repeat analysis and pulsed-field gel electrophoresis in a setting of polyclonal endemicity of vancomycin-resistant Enterococcus faecium. Clin. Microbiol. Infect. 14 363–369. 10.1111/j.1469-0691.2007.01945.x [DOI] [PubMed] [Google Scholar]
- Torpdahl M., Skov M. N., Sandvang D., Baggesen D. L. (2005). Genotypic characterization of Salmonella by multilocus sequence typing, pulsed-field gel electrophoresis and amplified fragment length polymorphism. J. Microbiol. Methods 63 173–184. 10.1016/j.mimet.2005.03.006 [DOI] [PubMed] [Google Scholar]
- Torpdahl M., Sorensen G., Lindstedt B. A., Nielsen E. M. (2007). Tandem repeat analysis for surveillance of human Salmonella typhimurium infections. Emerg. Infect. Dis. 13 388–395. 10.3201/eid1303.060460 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Belkum A., Tassios P. T., Dijkshoorn L., Haeggman S., Cookson B., Fry N. K., et al. (2007). Guidelines for the validation and application of typing methods for use in bacterial epidemiology. Clin. Microbiol. Infect. 13 1–46. 10.1111/j.1469-0691.2007.01786.x [DOI] [PubMed] [Google Scholar]
- Van Cuyck H., Farbos-Granger A., Leroy P., Yith V., Guillard B., Sarthou J. L., et al. (2011). MLVA polymorphism of Salmonella enterica subspecies isolated from humans, animals, and food in Cambodia. BMC Res. Notes 4:306. 10.1186/1756-0500-4-306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vincent C., Usongo V., Berry C., Tremblay D. M., Moineau S., Yousfi K., et al. (2018). Comparison of advanced whole genome sequence-based methods to distinguish strains of Salmonella enterica serovar Heidelberg involved in foodborne outbreaks in Quebec. Food Microbiol. 73 99–110. 10.1016/j.fm.2018.01.004 [DOI] [PubMed] [Google Scholar]
- Wachsmuth I. K., Kiehlbauch J. A., Bopp C. A., Cameron D. N., Strockbine N. A., Wells J. G., et al. (1991). The use of plasmid profiles and nucleic acid probes in epidemiologic investigations of foodborne, diarrheal diseases. Int. J. Food Microbiol. 12 77–89. 10.1016/0168-1605(91)90049-U [DOI] [PubMed] [Google Scholar]
- Wattiau P., Boland C., Bertrand S. (2011). Methodologies for Salmonella enterica subsp. enterica subtyping: gold standards and alternatives. Appl. Environ. Microbiol. 77 7877–7885. 10.1128/AEM.05527-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weigel R. M., Qiao B., Teferedegne B., Suh D. K., Barber D. A., Isaacson R. E. (2004). Comparison of pulsed field gel electrophoresis and repetitive sequence polymerase chain reaction as genotyping methods for detection of genetic diversity and inferring transmission of Salmonella. Vet. Microbiol. 100 205–217. 10.1016/j.vetmic.2004.02.009 [DOI] [PubMed] [Google Scholar]
- Wiedmann M., Nightingale K. (2009). DNA-based subtyping methods facilitate identification of foodborne pathogens. Food Technol. 63 44–49. [Google Scholar]
- Wiedmann M., Wang S., Post L., Nightingale K. (2014). Assessment criteria and approaches for rapid detection methods to be used in the food industry. J. Food Prot. 77 670–690. 10.4315/0362-028X.JFP-13-138 [DOI] [PubMed] [Google Scholar]
- Wise M. G., Siragusa G. R., Plumblee J., Healy M., Cray P. J., Seal B. S. (2009). Predicting Salmonella enterica serotypes by repetitive sequence-based PCR. J. Microbiol. Methods 76 18–24. 10.1016/j.mimet.2008.09.006 [DOI] [PubMed] [Google Scholar]
- Wright A. V., Liu J. J., Knott G. J., Doxzen K. W., Nogales E., Doudna J. A. (2017). Structures of the CRISPR genome integration complex. Science 357 1113–1118. 10.1126/science.aao0679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu S., Ricke S. C., Schneider K. R., Ahn S. (2017). Food safety hazards associated with ready-to-bake cookie dough and its ingredients. Food Control 73 986–993. 10.1016/j.foodcont.2016.10.010 [DOI] [Google Scholar]
- Wuyts V., Mattheus W., De Laminne de Bex G., Wildemauwe C., Roosens N. H., Marchal K., et al. (2013). MLVA as a tool for public health surveillance of human Salmonella typhimurium: prospective study in Belgium and evaluation of MLVA loci stability. PLoS One 8:e84055. 10.1371/journal.pone.0084055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wyres K., Conway T., Garg S., Queiroz C., Reumann M., Holt K. (2014). WGS analysis and interpretation in clinical and public health microbiology laboratories: what are the requirements and how do existing tools compare? Pathogens 3 437–458. 10.3390/pathogens3020437 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiong D., Song L., Pan Z., Jiao X. (2018). Identification and discrimination of Salmonella enterica serovar gallinarum biovars pullorum and gallinarum based on a one-step multiplex PCR assay. Front. Microbiol. 9:1718. 10.3389/fmicb.2018.01718 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu L., Liu Z., Li Y., Yin C., Hu Y., Xie X., et al. (2018). A rapid method to identify Salmonella enterica serovar Gallinarum biovar Pullorum using a specific target gene ipaJ. Avian Pathol. 47 238–244. 10.1080/03079457.2017.1412084 [DOI] [PubMed] [Google Scholar]
- Xu Y., Hu Y., Guo Y., Zhou Z., Xiong D., Meng C., et al. (2018). A new PCR assay based on the new gene-SPUL_2693 for rapid detection of Salmonella enterica subsp. enterica serovar Gallinarum biovars Gallinarum and Pullorum. Poult. Sci. 97 4000–4007. 10.3382/ps/pey254 [DOI] [PubMed] [Google Scholar]
- Yoshida C., Ahmad A., Gurnik S., Blimkie T., Murphy S. A., Kropinski A. M. (2016a). Evaluation of molecular methods for identification of Salmonella serovars. J. Clin. Microbiol. 54 1992–1998. 10.1128/JCM.00262-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoshida C., Kruczkiewicz P., Laing C. R., Lingohr E. J., Gannon V. P., Nash J. H. (2016b). The Salmonella in silico typing resource (SISTR): an open web-accessible tool for rapidly typing and subtyping draft Salmonella genome assemblies. PLoS One 11:e0147101. 10.1371/journal.pone.0147101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoshida C., Lingohr E. J., Trognitz F., MacLaren N., Rosano A., Murphy S. A., et al. (2014). Multi-laboratory evaluation of the rapid genoserotyping array (SGSA) for the identification of Salmonella serovars. Diagn. Microbiol. Infect. Dis. 80 185–190. 10.1016/j.diagmicrobio.2014.08.006 [DOI] [PubMed] [Google Scholar]
- Yun Y. S., Chae S. J., Na H. Y., Chung G. T., Yoo C. K., Lee D. Y. (2015). Modified method of multilocus sequence typing (MLST) for serotyping in Salmonella species. J. Bacteriol. Virol. 45 314–318. 10.4167/jbv.2015.45.4.314 [DOI] [Google Scholar]
- Zhang S., Yin Y., Jones M. B., Zhang Z., Kaiser D. B. L., Dinsmore B. A., et al. (2015). Salmonella serotype determination utilizing high-throughput genome sequencing data. J. Clin. Microbiol. 53 1685–1692. 10.1128/JCM.00323-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng J., Keys C. E., Zhao S., Ahmed R., Meng J., Brown E. W. (2011). Simultaneous analysis of multiple enzymes increases accuracy of pulsed-field gel electrophoresis in assigning genetic relationships among homogeneous Salmonella strains. J. Clin. Microbiol. 49 85–94. 10.1128/JCM.00120-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng J., Keys C. E., Zhao S., Meng J., Brown E. W. (2007). Enhanced subtyping scheme for Salmonella enteritidis. Emerg. Infect. Dis. 13 1932–1935. 10.3201/eid1312.070185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu C., Yue M., Rankin S., Weill F. X., Frey J., Dieter Schifferli M. (2015). One-step identification of five prominent chicken Salmonella serovars and biotypes. J. Clin. Microbiol. 53 3881–3883. 10.1128/JCM.01976-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zou W., Lin W. J., Foley S. L., Chen C. H., Nayak R., Chen J. J., et al. (2010). Evaluation of pulsed-field gel electrophoresis profiles for identification of Salmonella serotypes. J. Clin. Microbiol. 48 3122–3126. 10.1128/JCM.00645-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zou W., Tang H., Zhao W., Meehan J., Foley S. L., Lin W., et al. (2013). Data mining tools for Salmonella characterization: application to gel-based fingerprinting analysis. BMC Bioinformatics 14 (Suppl. 14):S15. 10.1186/1471-2105-14-S14-S15 [DOI] [PMC free article] [PubMed] [Google Scholar]