Skip to main content
Microbial Genomics logoLink to Microbial Genomics
. 2022 Nov 24;8(11):mgen000884. doi: 10.1099/mgen.0.000884

Novel application of metagenomics for the strain-level detection of bacterial contaminants within non-sterile industrial products – a retrospective, real-time analysis

Edward Cunningham-Oakes 1,2,*, Tom Pointon 3,4, Barry Murphy 3, Stuart Campbell-Lee 3, Thomas R Connor 1, Eshwar Mahenthiralingam 1,*
PMCID: PMC9836090  PMID: 36748522

Abstract

The home and personal care (HPC) industry generally relies on initial cultivation and subsequent biochemical testing for the identification of microorganisms in contaminated products. This process is slow (several days for growth), labour intensive, and misses organisms which fail to revive from the harsh environment of preserved consumer products. Since manufacturing within the HPC industry is high-throughput, the process of identification of microbial contamination could benefit from the multiple cultivation-independent methodologies that have developed for the detection and analysis of microbes. We describe a novel workflow starting with automated DNA extraction directly from a HPC product, and subsequently applying metagenomic methodologies for species and strain-level identification of bacteria. The workflow was validated by application to a historic microbial contamination of a general-purpose cleaner (GPC). A single strain of Pseudomonas oleovorans was detected metagenomically within the product. The metagenome mirrored that of a contaminant isolated in parallel by a traditional cultivation-based approach. Using a dilution series of the incident sample, we also provide evidence to show that the workflow enables detection of contaminant organisms down to 100 CFU/ml of product. To our knowledge, this is the first validated example of metagenomics analysis providing confirmatory evidence of a traditionally isolated contaminant organism, in a HPC product.

Keywords: microbial contamination, metagenomics, strain identification, bioinformatics, metagenome binning

Data Summary

The authors confirm that all supporting data, code and protocols have been provided within the article. Illumina raw sequence reads have been deposited in the European Nucleotide Archive (ENA) under ENA project accession number PRJEB53377.

Impact Statement.

Home and personal care (HPC) product microbiology is an area where the rapid and early detection of contamination is essential to protect the health of the consumer, and minimise economic and reputational damage to the industry, if contaminants are pathogenic microbes. In contrast to the food industry, where sequencing is routinely used to monitor outbreaks of organisms such as Salmonella , Escherichia coli (STEC), and Listeria monocytogenes , the use of genomics for bacterial surveillance in the HPC industry is relatively unexplored. The metagenomics workflow presented in this study benefits both industry and the consumer, by providing risk understanding, and comparative data for long-term surveillance of industrial contamination. Moreover, the study also demonstrates the ability to expand the traditional microbiology toolkit currently used by the HPC industry, in a way that has the potential to improve speed, taxonomic accuracy and sensitivity of detection.

Introduction

Industries that manufacture home and personal care (HPC) products, such as body wash and dish wash liquid, are major users of antimicrobial compounds in the form of preservatives [1]. Preservatives are used because HPC products are largely based on water and other components such as nutrients, oils and proteins that readily support the growth of microorganisms [2]. Regulatory guidelines state that these products are not expected to be sterile, but should neither contain specified microorganisms with the potential to affect product quality and consumer safety, nor permit the growth of any non-specified microorganisms [3]. Preservatives are therefore used to limit microbial growth in HPC products and preserve the shelf life of the product [3] which could potentially be contaminated at multiple stages, including inadvertently during manufacture, and during consumer use via repeated microbial challenge from the home environment and skin flora [4, 5].

However, the HPC industry is facing increasing pressure in the face of regulatory, safety and public relation pressures, which limit the available palette of preservative chemistries [3, 6]. Intrinsically preservative-tolerant Gram-negative bacteria, which may occasionally overcome product preservation and cause contamination during manufacture or long-term storage, add further complexity to the preservation challenge [7]. Bacterial species of the genera Pseudomonas and Burkholderia have also been shown to acquire a tolerant phenotype following prolonged exposure to preservatives [1, 8]. ISO 11930 and European Pharmacopoeia challenge testing methods require an absence of Pseudomonas aeruginosa from non-food products, while Burkholderia are known opportunistic pathogens in cystic fibrosis patients and immunocompromised individuals [9, 10]. Thus, early detection of bacterial contamination is critical to protect consumers and industry.

Traditionally, the identification of organisms has revolved around cultivation-dependent methods that assess phenotype [11]. This involves subjecting microorganisms to a plethora of biochemical and physiological assays, observation of growth characteristics on selective and differential media, and assessing antibiotic susceptibility and microscopic morphology [12, 13]. These and other culture-based techniques are currently the mainstay of identification in industrial settings, in part due to their ease of use, but also owing to acceptance and regulatory requirements [3]. An increasing number of non-colony-forming unit (CFU) detection methods such as ATP-bioluminescence, impedance, flow cytometry and PCR-based assays have been introduced to areas such as food microbiology in order to screen for contamination [14–16]. While these methods can be complementary to CFU-based alternatives [17], techniques such as ATP-bioluminescence still require a culturing step to reliably distinguish between different microorganisms [18]. These methods also do not resolve bacterial identification to the strain-level which, in the context of the changing landscape of HPC manufacturing, is useful for surveillance of contamination.

The use of cultivation-independent techniques is relatively unexplored territory in the HPC industry. However, the aforementioned research, alongside a desire for detection limits of 100 CFU/ml or less [3], suggest that a cultivation-independent detection methodology that can be implemented in a high-throughput manner would be desirable in an industrial environment. Metagenomic approaches also have the benefit of generating genomic DNA sequence data which can provide accurate taxonomic identification of microbial contaminants, a gap which needs to be addressed in the international product recall databases which report spoilage incidents [7]. Genomic-level taxonomy is also vital for the accurate resolution of Burkholderia cepacia complex contaminants which belong to novel species groups such as Burkholderia aenigmatica , for which techniques such as 16S ribosomal RNA (rRNA) gene sequencing provide insufficient resolution for accurate identification [19]. With extensive DNA databases in place, genomic identification also enables strain-level identification of bacterial contaminants [19].

Herein, we describe a novel cultivation-independent workflow, using a one-step automated DNA extraction method to extract directly from a HPC product, followed by next-generation sequencing for species and strain-level identification of bacteria. The workflow was applied to a historic microbial contamination of a general-purpose cleaner (GPC). In parallel, traditional, cultivation-based microbiology followed by genome sequencing of the isolated bacterium was used to validate our metagenomic data and show that the workflow offers an improvement over existing methodology. Finally, we also provide evidence that the metagenomics protocol enabled detection of the contaminating bacterial strain down to 100 CFU/ml. This unique and direct application of metagenomics for detection of HPC product contaminating microorganisms has multiple benefits for the surveillance and understanding of preservative-tolerant bacteria which occur as industrial contaminants.

Methods

Cultivation-dependent and independent DNA extraction

A workflow enabling parallel metagenomic sequencing and bacterial cultivation analysis of a HPC contaminant was developed (Fig. 1). Bacterial isolates were cultivated by plating serial dilutions in sterile deionised water (to 10−7; 50 µl drops) of the GPC onto a nutrient-rich agar, tryptic soya agar (TSA), and a minimal agar, Reasoner’s 2A (R2A) agar, to account for the presence of slow growing organisms. The same dilution series enabled colony counts to be performed and enumeration of the detection limits of the metagenomics in comparison to cultivation (see below). Colonies isolated from TSA were inoculated into 3 ml tryptic soya broth for overnight culture (18 h), while colonies from R2A were inoculated into 3 ml R2A broth. Overnight cultures were pelleted, then suspended in 300 µl of the lysis agent Guanidinium isothiocyanate (4M concentration), before extraction. The GPC dilution series used for counts was also mixed in a 1 : 1 ratio with Guanidinium isothiocyanate, before 400 µl was transferred for cultivation-independent DNA extraction. A series of dilutions were also performed using a non-contaminated GPC, and DNA extraction cartridges from the same kit used to extract both cultures, and contaminated GPC. The dilutions and extractions were performed identically to those performed for the contaminated product and served as controls to enable the removal of background DNA within the generic product formulation or extraction kit (referred to herein as the ‘productome’ and ‘kitome’ respectively).

Fig. 1.

Fig. 1.

Workflow used for cultivation-dependent and cultivation-independent detection of contaminant organisms in this study. Genomes chosen from reference databases can be changed based on known organisms of concern, a priori knowledge, or site specific needs. Parts of this workflow that are exclusive to cultivation-independent (purple) and traditional microbiology (green) arms of analysis are highlighted.

In order to validate the assumption that Pseudomonas -assigned reads from the contaminated product were distinguishable from those in the kitome and productome, raw reads were annotated using Kraken2 (v2.1.2) [20]. Assignments were made using a custom database, containing RefSeq complete genomes and proteins for archaea, bacteria, human, fungi, viruses, plants, and protozoa, as well as RefSeq complete plasmid nucleotide and protein sequences, and a subset of the NCBI UniVec database. A confidence threshold of 0.1 was set for read assignments, and reports were generated for downstream biom file generation. Kraken-biom (v1.0.1) [21] was used to generate a biom file from Kraken2 report files. Biom (v2.1.6) [22] was then used to assign tabulated metadata to this biom file. Ordination plots representing taxa assigned to reads were then visualised using Phyloseq (v1.36.0) [23], with non-metric multidimensional scaling (NMDS) and Bray-Curtis dissimilarity.

Sample processing was performed during: (i) the week of contamination (week 1), and (ii) repeated 2 weeks post-contamination (week 3); this was used to detect whether the contamination signatures were stable or increased over time. All culture and product DNA extractions were then sequenced on a NovaSeq 6000, using 150 bp paired-end sequencing by a commercial sequencing provider (Novogene, Cambridge, UK) as described previously [19].

Establishing a reference database of complete Pseudomonas genomes

Initial 16S rRNA gene sequence-based identification of the bacterial contamination of the GPC highlighted P. aeruginosa as a potential cause; this analysis was performed by neutralising the product in peptone-tween, before plating onto TSA. Once growth was recovered, colonies were then washed in nuclease-free water, and then suspended in absolute ethanol. This suspension was then sent to a third party for DNA extraction and 16S rRNA identification. Based on this, we opted to generate a reference Pseudomonas genome database, against which we could compare both the genomes from culture, and metagenome-assembled genomes (MAGs). This database was generated by obtaining all complete genomes available from The Pseudomonas Genome database [24] (n=612). This reference database was further simplified by using dRep (v3.0.0) [25] to identify redundant genomes within the dataset, leaving only representative genomes (n=288). Finally, sequencing reads generated from isolated pure bacterial colonies on week 1 and week 3 were used to generate draft assemblies of the contaminant organism, using SPAdes (v3.15.3) [26] in isolate mode. The draft assemblies were then corrected using Pilon (v1.24) [27] and dereplicated using dRep, resulting in one representative genome, from colonies grown on TSA at week 3. This representative draft genome was amalgamated with the simplified Pseudomonas Genome database genomes to make the final reference database (n=289). To confirm that dereplication to a single representative genome was correct, Average Nucleotide Identity (ANI) analysis was also performed using PyANI (v0.2.11) [28] with MUMmer as an alignment method (ANIm). This enabled confirmation that the draft genomes generated from culture at each timepoint were, at a minimum, identical species (95 % or greater nucleotide identity in comparison to each other).

Detection limits of metagenome binning for species-level contaminant identification

In order to standardise analysis as far as possible, we made use of the MetaWRAP (v1.3.2) pipeline [29], which functions as a wrapper around multiple utilities for metagenome analysis. The pipeline was deployed in a dedicated conda environment, using the ‘manual installation’ guide (see https://github.com/bxlab/metaWRAP). For each control dilution, decoy metagenomes were assembled using MetaSPAdes (v3.15.3) [30]. These assemblies were then used with the MetaWRAP ‘read_qc’ module to remove any productome or kitome-aligned reads from the contaminated GPC reads (e.g. the 10−1 control dilution was used to remove kitome reads from the 10−1 dilution of the contaminated GPC). The remaining reads from the contaminated GPC were then assembled using MetaSPAdes, before being binned with the ‘binning’ module of MetaWRAP, using CONCOCT [31], MaxBin 2.0 [32] and MetaBAT 2 [33] as binning methods. The quality of any resulting bins was then assessed using CheckM (v1.0.12) [34]. Any bins that fell below the threshold required for a medium-quality draft MAG [35] (over 50 % completion and less than 10 % contamination), were removed, using the ‘bin_refinement’ module of MetaWRAP. All bins were then compared to the simplified Pseudomonas genome database using FastANI (v1.32) [36] to run a faster, less-computationally intensive analysis to identify the most related genomes to the contaminant organism. This analysis was then used as a basis to obtain the genome for appropriate type strains. Final species identity was then confirmed by comparing all binned metagenomes and culture genomes to the identified type strains, using PyANI.

PopANI detection limits for cultivation-independent strain-level identification

We aimed to determine whether the cultivation-independent workflow devised was sufficient to i) accurately strain-type a contaminant organism via recruitment of metagenomic reads to the correct reference genome and ii) determine the detection limits of this methodology. To achieve this, the productome and kitome-subtracted reads for all dilutions were then recruited to our Pseudomonas genome database (described below). Reads were first mapped to all reference genomes using Bowtie 2 (v2.4.5) [37]. The genomes of identical strains were then identified within the database by using inStrain (v1.3.4) [38] and population ANI (popANI) metric, which uses a stringent cut-off of 99.999 % as the definition of an identical strain. Several metrics were generated by inStrain as follows: Coverage overlap= percentage of bases that are either covered or not covered in both of the profiles (Formula=length[coveredInBoth] / length[coveredInEither]); Compared base count=The number of considered bases; popANI=The Average Nucleotide Identity among compared bases between the two scaffolds (Formula = [compared_bases_count - population_SNPs] / compared_bases_count); Compared=percentage of the genome compared. The breadth and depth of coverage in higher-quality samples was assessed by i) using BWA (v0.7.17) [39] with samtools (v1.13) [40] to map to our representative genome from culture (described above), and ii) using the coverage command within samtools to provide coverage statistics.

Results

A single bacterial colony type was present in the general-purpose cleaner at >104 CFU/ml

The workflow employed both cultivation-independent and growth-based microbiology in parallel (Fig. 1). Traditional microbiology enabled the background level of bacterial contamination within the GPC to be enumerated and morphologically characterised as follows. Counts performed prior to DNA extraction indicated that a single colony type observed on TSA was present in the contaminated GPC at 3.60×105 CFU/ml in week 1, and 2.44×105 CFU/ml in week 3. The second pure morphology observed on R2A was present at 6.52×104 CFU/ml at week 1, and 8.86×104 CFU/ml at week 3. Conversely, nothing was cultivated from the control GPC. Assembly of the genomic DNA reads from the pure cultures on TSA and R2A across the timepoints resulted in four draft genomes of total length ranging from 4.98 to 5.01 Mbp. ANIm confirmed that all the genomes were near taxonomically identical (99.9–100 % identity when compared to each other; Fig. 2). As the organisms isolated on each media type were identical, TSA was taken as the optimal nutrient-rich growth medium for the contaminant organism. As such, all cultivation-independent detection limit calculations in subsequent analyses were made in reference to counts observed on TSA for each time point.

Fig. 2.

Fig. 2.

Genomic taxonomy of the HPC contaminant strain. ANIm analysis, showing pairwise comparisons between MAGs, assemblies from culture, and the P. oleovorans type strain, NCTC 10692. This analysis shows that i) cultivation-independent MAGs and those assembled from culture-based sequencing data are nearly identical and ii) that the species identity of the contaminant organism was P. oleovorans . The darker the shade of red between pairwise comparisons, the higher the percentage shared nucleotide identity. Colours around the outside of the heatmap are used to delineate the positions of individual samples on the heatmap.

Unsupervised metagenome binning enabled reliable identification of the contaminant species group down to a presence of 103 CFU/ml

In parallel to the traditional microbial cultivation and enumeration of the bacterial contaminant within the GPC, the novel metagenomic workflow (Fig. 1) demonstrated the following characteristics for the bacterial DNA within the sample. Initial metagenome binning produced bins for dilutions down to 101 CFU/ml across all binning methods. For dilutions beyond this, no bins were produced. Subsequent refinement of bins based on MIMAG standards [34] removed any bins from dilutions containing counts under 103 CFU/ml (see Table 1). The closest matching species for both the pure culture assemblies and MIMAG-quality metagenome bins was a Pseudomonas oleovorans genome (Accession: GCF_000953455.1, 96.8–97.0 % identity by FastANI). A subsequent comparison to the type strain genome P. oleovorans NCTC 10692 (Accession: GCF_900455615.1) via ANIm confirmed this species identity, which was consistent across all isolated culture genomes and MAGs (96.6–96.7 % identity, see Fig. 2). This also demonstrated that the initial 16S rRNA gene based molecular identification as Pseudomonas aeruginosa did not have sufficient resolution for accurate identification of the contaminant.

Table 1.

Genome completion metrics for MAGs from cultivation-independent DNA samples taken from the contaminated GPC. Key: Timepoint=timepoint at which DNA extraction was performed; Completeness=estimate of MAG completeness, based on presence/absence of single-copy genes; Contamination=estimate of MAG contamination, based on whether a single copy gene appears more than once; Mean coverage breadth=percentage of bases sequenced across the target genome; Mean coverage depth=average number of times the bases of the target genome have sequenced in a sample; GC=MAG GC content, Lineage=estimation of taxonomy, based on lineage-specific marker genes; N50=shortest contig for which longer and equal length contigs cover at least 50 % of the assembly; Size=total MAG size

Dilution

Timepoint

Completeness (%)

Contamination (%)

Mean coverage breadth (%)

Mean coverage depth (fold)

GC (%)

Lineage

N50 (bp)

Size (bp)

NEAT

Week 1

97.51

1.825

98.45

336

62.6

Pseudomonadales

54 936

4 434 208

NEAT

Week 3

96.91

1.501

98.47

355

62.6

Pseudomonadales

53 432

4 359 468

101

Week 1

96.71

1.906

98.31

202

62.6

Pseudomonadales

51 769

4 351 642

101

Week 3

96.5

2.718

97.50

23.8

61.9

Pseudomonadales

19 205

4 801 974

102

Week 1

78.79

0

97.46

14.4

62.7

Bacteria

15 061

4 189 333

102

Week 3

71.89

0

97.03

15.2

62.8

Bacteria

15 503

4 116 775

PopANI improves cultivation-independent detection limits to 102 CFU/ml

Microdiversity analysis using inStrain [37] confirmed that the same isolate cultured from the contaminated GPC could be detected in our cultivation-independent sequencing data (>99.999 % popANI). Data from samples containing ≥103 CFU/ml produced data with a high breadth and depth of coverage (see Table S1, available in the online version of this article). This was also evidenced in the unsupervised binning approach, where coverage was sufficient to produce MAGs from these samples. The organism detected in these samples were nearly identical in terms of popANI and shared a high breadth of coverage between samples (95.05–99.80 % – see Fig. 3 and Table S1). These samples were also placed into a single strain cluster, showing that they were identical to the cultured pure isolate.

Fig. 3.

Fig. 3.

Dendrogram comparing all cultivation-independent samples based on popANI (a) and shared bases (b). All samples are labelled with both their cell counts (CFU/ml) relative to counts performed on TSA, and time point at which the sample was taken.

The closest matching reference genome using popANI was a Pseudomonas oleovorans genome (Accession: GCF_000953455.1), which was also identified as the closest match using binning and ANIm approaches. The popANI approach also enabled the detection of the cultivated isolate at week 1 in the sample containing ~102 CFU/ml (see Table S2). The coverage depth of this sample was comparatively low (2.68×), thus preventing meaningful comparison to samples with higher cell counts and coverage. However, this sample possessed 99.999 % identity to the cultured isolate genome, with a high breadth of coverage (79.20%). Moreover, no matches were seen when compared to other genomes within the simplified Pseudomonas database, including the closest matching P. oleovorans genome seen for other samples. This signal was absent at the same dilution by week 3. This finding was supported by our ordination analysis, (see Fig. S1), which showed that Pseudomonas -assigned reads from the contaminant organism and kitome/productome were distinguishable down to a dilution factor of 10−3.

Discussion

The workflow presented shows that cultivation-independent, automated DNA extraction can be applied directly to HPC products and provides a promising alternative to cultivation-based methodologies for contamination surveillance within preserved industrial products. It can also overcome issues seen in growth-based approaches, which can take up to 7 days for microorganism recovery [41] and are potentially inaccurate depending on the methodology used for identification [19]. Moreover, the use of genomic methodologies such as ANIm and popANI reliably enables detection of down to 103 CFU/ml and identification with species or consensus MAG-level accuracy, respectively.

It is important to note that this analysis does not come without limitations. Firstly, the use of 16S rRNA sequencing from a cultured isolate in this study allowed us to limit the scope of our genomic database to Pseudomonas . For this analysis to be truly cultivation-independent, another methodology, such as further interrogation of the k-mer based taxa-assignment results (see Fig. S1), would need to be used to focus analysis towards the likely taxonomy of the contaminant organism. Second, assuming a closely related genome is present within the reference database used, it is possible to extend this detection limit down to 102 CFU/ml, thereby meeting lower detection thresholds required for certain HPC product types [3]. However, this detection limit was only achieved due to the acquisition of the genome from the cultured contaminant and its presence in our database; coverage was too low to identify the closest matching reference species without this. Thus, in instances of low coverage, an exactly matching genome is required to achieve any form of detection. Future work should look to improve coverage from samples with lower cell counts, by investigating methods such as organism-specific amplification steps before sequencing [42]. This would, in turn, enable species identification in the absence of a reference genome that perfectly matches a contaminant organism.

The use of low-input library preparation for both short [43] and long-read [44] sequencing technologies has previously seen success when applied to metagenomics. Such approaches could improve coverage in HPC products with low-level contaminant bioburden, thereby improving the sensitivity of detection [45]. All future work should also seek to validate this workflow further, via application to larger numbers of samples containing a range of organisms of interest to the industry [7], and sample types reflecting the wide variety of products sold to consumers [46]. Methods which attempt to distinguish between viable and non-viable recoveries (e.g. viability PCR) [47], should also be considered. The current workflow does not make this distinction, which would be necessary in cases where background DNA of a related microorganism is abundant. An ideal workflow should also consider emerging cost-saving methodologies [48] to make routine implementation feasible. If the cost could be reduced, then regular surveillance of production facilities could be considered by manufacturers. The workflow could then be used to determine whether there is a build-up of bacterial DNA within industrial product types and could then act as an early warning of a potential manufacturing incident. Implementation will also require this workflow and downstream analysis to be entirely automated, to address both the need for standardisation, and gaps in expertise.

Overall, the HPC industry is undergoing considerable change to reduce environmental impact. This includes working with consumers to incorporate desired natural and milder preservatives and moving away from single-use plastic packaging towards consumer refillable approaches. These changes mean that the detection of microbial contamination will become more vital than ever within the industry, especially since preservative-tolerant bacteria such as Pseudomonas , Burkholderia and the Enterobacteriaceae are frequently found as problematic HPC product contaminants [7]. The sensitive and accurate metagenomic workflow provided here could be highly beneficial in enabling industry-level surveillance of microbial contamination going forward.

Supplementary Data

Supplementary material 1
Supplementary material 2

Funding information

This work was supported by the Biotechnology and Biological Sciences Research Council (BBSRC) and Unilever R and D industrial CASE studentship award, BB/M009122/1.

Acknowledgements

All library preparation and sequencing was performed by Novogene UK. All analysis was performed using the Cloud Infrastructure for Microbial Bioinformatics, funded by Medical Research Council grant MR/L015080/1. We thank Dr Rebecca Weiser for her contributions to the preliminary DNA extraction methodology.

Author contributions

We describe author contributions to the paper using the CRedit taxonomy. Conceptualization: E.C.-O. and E.M. Data curation: E.C.-O. Formal analysis: E.C.-O. Funding acquisition: E.M., T.R.C., T.P. and B.M. Investigation: E.C.-O. and E.M. Methodology: E.C.-O; R.W. Project administration: E.M. Resources: E.M., T.R.C. Software: E.C.-O. Supervision: E.M., T.R.C., T.P., S.C.-L. and B.M. Validation: E.C.-O. Visualization: E.C.-O. Writing – original draft: E.C.-O. and E.M. Writing – review and editing: all authors.

Conflicts of interest

At the time of analysis, T.P. was an employee of Unilever R and D, and B.M. and S.C.-L. are current employees of Unilever R and D, the company that part-funded the studentship to E.C.-O. The remaining authors have no conflicts of interest.

Footnotes

Abbreviations: ANI, average nucleotide identity; ANIm, average nucleotide identity, using MUMmer as an alignment method; ATP, adenosine 5′-triphosphate; CFU, colony-forming unit; DNA, deoxyribonucleic acid; GPC, general-purpose cleaner; HPC, home and personal care; ISO, International Organization for Standardization; MAG, metagenome-assembled genome; NMDS, non-metric multidimensional scaling; PCR, polymerase chain reaction; popANI, population ANI; R2A, Reasoner's 2A; RNA, ribonucleic acid; rRNA, ribosomal RNA; TSA, tryptic soya agar.

All supporting data, code and protocols have been provided within the article or through supplementary data files. One supplementary figure and two supplementary tables are available with the online version of this article.

References

  • 1.Rushton L, Sass A, Baldwin A, Dowson CG, Donoghue D, et al. Key role for efflux in the preservative susceptibility and adaptive resistance of Burkholderia cepacia complex bacteria. Antimicrob Agents Chemother. 2013;57:2972–2980. doi: 10.1128/AAC.00140-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Smart R, Spooner DF. Microbiological spoilage in pharmaceuticals and cosmetics. J Soc Cosmet Chem Soc Cosmet Chem Gt Britain. 1972;23:721–737. [Google Scholar]
  • 3.Halla N, Fernandes IP, Heleno SA, Costa P, Boucherit-Otmani Z, et al. Cosmetics preservation: a review on present strategies. Molecules. 2018;23:E1571. doi: 10.3390/molecules23071571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sutton S. Cosmetic Microbiology: A Practical Approach. 2006. Epub ahead of print 2006. [DOI] [Google Scholar]
  • 5.Orth DS, Kabara JJ, Denyer SP, Tan SK. Cosmetic and drug microbiology. New York, USA: Informa Healthcare; 2006. [Google Scholar]
  • 6.Silva V, Silva C, Soares P, Garrido EM, Borges F, et al. Isothiazolinone Biocides: Chemistry, Biological, and Toxicity Profiles. Molecules. 2020;25:991. doi: 10.3390/molecules25040991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Cunningham-Oakes E, Weiser R, Pointon T, Mahenthiralingam E. Understanding the challenges of non-food industrial product contamination. FEMS Microbiol Lett. 2019;366:fnaa010. doi: 10.1093/femsle/fnaa010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Abdel Malek SMA, Badran YR. Pseudomonas aeruginosa PAO1 adapted to 2-phenoxyethanol shows cross-resistance to dissimilar biocides and increased susceptibility to antibiotics. Folia Microbiol (Praha) 2010;55:588–592. doi: 10.1007/s12223-010-0094-6. [DOI] [PubMed] [Google Scholar]
  • 9.Zlosnik JEA, Zhou G, Brant R, Henry DA, Hird TJ, et al. Burkholderia species infections in patients with cystic fibrosis in British Columbia, Canada. 30 years’ experience. Ann Am Thorac Soc. 2015;12:70–78. doi: 10.1513/AnnalsATS.201408-395OC. [DOI] [PubMed] [Google Scholar]
  • 10.Ashish A, Shaw M, Winstanley C, Ledson MJ, Walshaw MJ. Increasing resistance of the Liverpool Epidemic Strain (LES) of Pseudomonas aeruginosa (Psa) to antibiotics in cystic fibrosis (CF)--A cause for concern? J Cyst Fibros. 2012;11:173–179. doi: 10.1016/j.jcf.2011.11.004. [DOI] [PubMed] [Google Scholar]
  • 11.Laupland KB, Valiquette L. The changing culture of the microbiology laboratory. Can J Infect Dis Med Microbiol. 2013;24:125–128. doi: 10.1155/2013/101630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Woo PCY, Lau SKP, Teng JLL, Tse H, Yuen KY. Then and now: use of 16S rDNA gene sequencing for bacterial identification and discovery of novel bacteria in clinical microbiology laboratories. Clin Microbiol Infect. 2008;14:908–934. doi: 10.1111/j.1469-0691.2008.02070.x. [DOI] [PubMed] [Google Scholar]
  • 13.Henry DA, Mahenthiralingam E, Vandamme P, Coenye T, Speert DP. Phenotypic methods for determining genomovar status of the Burkholderia cepacia complex. J Clin Microbiol. 2001;39:1073–1078. doi: 10.1128/JCM.39.3.1073-1078.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Postollec F, Falentin H, Pavan S, Combrisson J, Sohier D. Recent advances in quantitative PCR (qPCR) applications in food microbiology. Food Microbiol. 2011;28:848–861. doi: 10.1016/j.fm.2011.02.008. [DOI] [PubMed] [Google Scholar]
  • 15.van Belkum A, Durand G, Peyret M, Chatellier S, Zambardi G, et al. Rapid clinical bacteriology and its future impact. Ann Lab Med. 2013;33:14–27. doi: 10.3343/alm.2013.33.1.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Maukonen J, Mättö J, Wirtanen G, Raaska L, Mattila-Sandholm T, et al. Methodologies for the characterization of microbes in industrial environments: A review. J Ind Microbiol Biotechnol. 2003;30:327–356. doi: 10.1007/s10295-003-0056-y. [DOI] [PubMed] [Google Scholar]
  • 17.Rubio E, Zboromyrska Y, Bosch J, Fernandez-Pittol MJ, Fidalgo BI, et al. Evaluation of flow cytometry for the detection of bacteria in biological fluids. PLoS One. 2019;14:e0220307. doi: 10.1371/journal.pone.0220307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Linklater N, Örmeci B. Evaluation of the adenosine triphosphate (ATP) bioluminescence assay for monitoring effluent quality and disinfection performance. Water Quality Research Journal. 2014;49:114–123. doi: 10.2166/wqrjc.2013.110. [DOI] [Google Scholar]
  • 19.Cunningham-Oakes E, Pointon T, Murphy B, Campbell-Lee S, Webster G. Genomics reveals the novel species placement of industrial contaminant isolates incorrectly identified as Burkholderia lata . Microb Genom. 2021;7:000564. doi: 10.1099/mgen.0.000564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20:257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zampieri A, Babbucci M, Carraro L, Milan M, Fasolato L, et al. Combining Culture-Dependent and Culture-Independent Methods: New Methodology Insight on the Vibrio Community of Ruditapes philippinarum . Foods. 2021;10:1271. doi: 10.3390/foods10061271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.McDonald D, Clemente JC, Kuczynski J, Rideout JR, Stombaugh J, et al. The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome. GigaSci. 2012;1:7. doi: 10.1186/2047-217X-1-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.McMurdie PJ, Holmes S, Watson M. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS ONE. 2013;8:e61217. doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Winsor GL, Griffiths EJ, Lo R, Dhillon BK, Shay JA, et al. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database. Nucleic Acids Res. 2016;44:D646–53. doi: 10.1093/nar/gkv1227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Olm MR, Brown CT, Brooks B, Banfield JF. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 2017;11:2864–2868. doi: 10.1038/ismej.2017.126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Pritchard L, Glover RH, Humphris S, Elphinstone JG, Toth IK. Genomics and taxonomy in diagnostics for food security: soft-rotting enterobacterial plant pathogens. Anal Methods. 2016;8:12–24. doi: 10.1039/C5AY02550H. [DOI] [Google Scholar]
  • 29.Uritskiy GV, DiRuggiero J, Taylor J. MetaWRAP-a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome. 2018;6:158. doi: 10.1186/s40168-018-0541-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–834. doi: 10.1101/gr.213959.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, et al. Binning metagenomic contigs by coverage and composition. Nat Methods. 2014;11:1144–1146. doi: 10.1038/nmeth.3103. [DOI] [PubMed] [Google Scholar]
  • 32.Wu Y-W, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32:605–607. doi: 10.1093/bioinformatics/btv638. [DOI] [PubMed] [Google Scholar]
  • 33.Kang DD, Li F, Kirton E, Thomas A, Egan R, et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ. 2019;7:e7359. doi: 10.7717/peerj.7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Bowers RM, Kyrpides NC, Stepanauskas R, Harmon-Smith M, Doud D, et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol. 2017;35:725–731. doi: 10.1038/nbt.3893. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9:5114. doi: 10.1038/s41467-018-07641-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Olm MR, Crits-Christoph A, Bouma-Gregson K, Firek BA, Morowitz MJ, et al. inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains. Nat Biotechnol. 2021;39:727–736. doi: 10.1038/s41587-020-00797-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Moser CL, Meyer BK. Comparison of compendial antimicrobial effectiveness tests: a review. AAPS PharmSciTech. 2011;12:222–226. doi: 10.1208/s12249-010-9575-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Chiu CY, Miller SA. Clinical metagenomics. Nat Rev Genet. 2019;20:341–355. doi: 10.1038/s41576-019-0113-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chen L, Liu W, Zhang Q, Xu K, Ye G, et al. RNA based mNGS approach identifies a novel human coronavirus from two individual pneumonia cases in 2019 Wuhan outbreak. Emerg Microbes Infect. 2020;9:313–319. doi: 10.1080/22221751.2020.1725399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Mojarro A, Hachey J, Ruvkun G, Zuber MT, Carr CE. CarrierSeq: a sequence analysis workflow for low-input nanopore sequencing. BMC Bioinformatics. 2018;19:108. doi: 10.1186/s12859-018-2124-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Rodriguez-R LM, Konstantinidis KT. Estimating coverage in metagenomic data sets and why it matters. ISME J. 2014;8:2349–2351. doi: 10.1038/ismej.2014.76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Corby-Edwards AK. Cosmetics and FDA Regulation. 2013. FDA regulation of cosmetics and personal care products. [Google Scholar]
  • 47.Zeng D, Chen Z, Jiang Y, Xue F, Li B. Advances and Challenges in Viability Detection of Foodborne Pathogens. Front Microbiol. 2016;7:1833. doi: 10.3389/fmicb.2016.01833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Gaio D, Anantanawat K, To J, Liu M, Monahan L, et al. Hackflex: low-cost, high-throughput, Illumina Nextera Flex library construction. Microb Genom. 2022;8 doi: 10.1099/mgen.0.000744. Epub ahead of print January 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material 1
Supplementary material 2

Articles from Microbial Genomics are provided here courtesy of Microbiology Society

RESOURCES