ABSTRACT
We report the whole-genome sequences, along with annotations, of five Pediococcus ethanolidurans and three Pediococcus pentosaceus isolates from commercial cucumber fermentations performed in North Carolina (n = 3) and Minnesota (n = 5), USA.
ANNOUNCEMENT
Pediococcus ethanolidurans and Pediococcus pentosaceus are lactic acid bacteria species relevant in the production of fermented foods. P. ethanolidurans is associated with spoilage in both olive fermentation (1) and in the production of cider (2) but is also important for the development of flavors in pickled radishes (3). P. pentosaceus is a significant organism for sauerkraut production (4). Both organisms were detected in unspoiled cucumber fermentations conducted in 2009 and 2010 in Mount Olive, NC, and Chaska, MN, respectively.
Pediococcus isolates were collected on days 1, 3, 7, 14, and 30 of commercial cucumber fermentations at depths of 2 and 8 ft from the brine surface in the collection tank (5). The isolation day and depth are incorporated into the genome sequence identification numbers. For example, genome sequence 14.2.45 was generated from a single colonial isolate (number 45) plated from fermentation brine collected on day 14 at a depth of 2 ft.
The isolates were derived from cucumber fermentation brine by spiral plating on Lactobacillus de Man Rogosa and Sharpe agar (MRS) supplemented with 0.0001% cycloheximide solution and incubated at 30°C (5). Anaerobic growth conditions were maintained via the BD GasPak EZ system (Franklin Lakes, NJ), which maintained an anaerobic atmosphere with ≥10% carbon dioxide. The isolated colonies were streaked onto MRS agar prior to preparing frozen stocks in MRS broth supplemented with 1.5% glycerol. Pure cultures were generated on MRS broth from the frozen stocks for DNA extraction. The MRS broth was incubated statically at 30°C for 24 h under aerobic conditions. DNA extraction was conducted using the Promega Wizard genomic DNA extraction kit (Madison, WI). The bacterial isolates had been previously identified using partial 16S rRNA gene sequences, as described by Pérez-Díaz and others (5). CosmosID (Rockville, MD) prepared libraries for Illumina reads using the Illumina Nextera XT kit and assessed the libraries for quality using Qubit (Thermo Fisher, Waltham, MA), prior to whole-genome sequencing. The samples were sequenced on an Illumina NextSeq 550 platform (San Diego, CA), producing paired-end reads at a maximum length of 150 bases. The estimated average raw read lengths are reported in Table 1, based on back calculation from the genome coverage (6). CosmosID also performed raw sequence data trimming for adapters and low-quality bases using BBDuk (https://sourceforge.net/projects/bbmap) in the spring of 2022, applying standard parameters (phred quality trimq = 22, and minimum length minlen = 36).
TABLE 1.
Accession numbers and genome statistics from P. ethanolidurans and P. pentosaceus isolates
| Species | Isolate ID | GenBank accession no. | SRA accession no. | Assembly size (bp) | No. of contigs | Estimated coverage (×) | No. of read pairs | Avg raw read length (bp) | GC content (%) | N50 (bp) | Collection site (U.S. state) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Pediococcus ethanolidurans | 14.2.5 | JANJYK000000000 | SRR20829250 | 2,148,770 | 175 | 442.575 | 3,525,283 | 134.8 | 37.45 | 33,153 | NC |
| 3.8.1 | JANQAL000000000 | SRR20829249 | 2,208,096 | 164 | 360.416 | 2,906,518 | 136.9 | 37.34 | 34,310 | NC | |
| 1.2.28 | JANQAM000000000 | SRR20829248 | 2,170,406 | 166 | 433.77 | 3,601,207 | 130.7 | 37.46 | 31,828 | NC | |
| 14.2.45 | JANJPN000000000 | SRR20829251 | 2,163,868 | 167 | 277.284 | 2,164,405 | 138.6 | 37.39 | 33,670 | MN | |
| 30.2.11 | JANQCL010000000 | SRR21954066 | 2,129,602 | 162 | 409.327 | 3,261,290 | 133.6 | 37.42 | 37,022 | MN | |
| Pediococcus pentosaceus | 7.2.43 | JANKYZ000000000 | SRR20964832 | 1,853,217 | 24 | 549.323 | 3,780,940 | 134.6 | 37.2 | 295,794 | MN |
| 3.2.19 | JANKZA000000000 | SRR20964831 | 1,854,702 | 20 | 428.985 | 2,826,910 | 140.7 | 37.16 | 375,985 | MN | |
| 7.2.36 | JANKZB000000000 | SRR20964830 | 1,881,615 | 35 | 371.876 | 2,506,629 | 139.5 | 37.17 | 286,823 | MN |
The genome sequences were assembled and annotated using the software described below with default parameters unless otherwise specified. De novo assembly was performed using PATRIC v. p3-build-178 (7) via Unicycler v. 0.4.8 (8), with a minimum contig cutoff of 300. Quality assessment of the assemblies was performed using QUAST v. 5.0.2 (9), SAMtools v. 13 (10), and Pilon v. 1.23 (11). The closest reference genomes were identified using Mash/MinHash with the PATRIC database (12). Upon submission to GenBank, the assemblies were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v. 2022-04-14.build6021 (13).
Data availability.
The assemblies have been submitted to GenBank under BioProject accession number PRJNA674638. Strain identification and accession numbers for each genome annotation are listed in Table 1.
ACKNOWLEDGMENTS
We thank Echo Pan, a graduate student in genetics and genomics at North Carolina State University, for reviewing this article.
Mention of a trademark or proprietary product does not constitute a guarantee or warranty of the product by the U.S. Department of Agriculture or North Carolina Agricultural Research Service, nor does it imply approval to the exclusion of other products that may be suitable.
Contributor Information
Ilenys M. Pérez-Díaz, Email: ilenys.perez-diaz@usda.gov.
J. Cameron Thrash, University of Southern California.
REFERENCES
- 1.Penland M, Mounier J, Pawtowski A, Tréguer S, Deutsch S-M, Coton M. 2021. Use of metabarcoding and source tracking to identify desirable or spoilage autochthonous microorganism sources during black olive fermentations. Food Res Int 144:110344. doi: 10.1016/j.foodres.2021.110344. [DOI] [PubMed] [Google Scholar]
- 2.Llamas-Arriba MG, Pérez-Ramos A, Puertas AI, López P, Dueñas MT, Prieto A. 2018. Characterization of Pediococcus ethanolidurans CUPV141: a β-D-glucan- and heteropolysaccharide-producing bacterium. Front Microbiol 9:2041. doi: 10.3389/fmicb.2018.02041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yun L, Mao B, Cui S, Tang X, Zhang H, Zhao J, Chen W. 2021. Gas chromatography-mass spectrometry-based metabolomics analysis of metabolites in commercial and inoculated pickles. J Sci Food Agric 101:1436–1446. doi: 10.1002/jsfa.10757. [DOI] [PubMed] [Google Scholar]
- 4.Plengvidhya V, Breidt F, Lu Z, Fleming HP. 2007. DNA fingerprinting of lactic acid bacteria in sauerkraut fermentations. Appl Environ Microbiol 73:7697–7702. doi: 10.1128/AEM.01342-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pérez-Díaz IM, Hayes J, Medina E, Anekella K, Daughtry K, Dieck S, Levi M, Price R, Butz N, Lu Z, Azcarate-Peril MA. 2017. Reassessment of the succession of lactic acid bacteria in commercial cucumber fermentations and physiological and genomic features associated with their dominance. Food Microbiol 63:217–227. doi: 10.1016/j.fm.2016.11.025. [DOI] [PubMed] [Google Scholar]
- 6.Sims D, Sudbery I, Ilott NE, Heger A, Ponting CP. 2014. Sequencing depth and coverage: key considerations in genomic analyses. Nat Rev Genet 15:121–132. doi: 10.1038/nrg3642. [DOI] [PubMed] [Google Scholar]
- 7.Davis JJ, Wattam AR, Aziz RK, Brettin T, Butler R, Butler RM, Chlenski P, Conrad N, Dickerman A, Dietrich EM, Gabbard JL, Gerdes S, Guard A, Kenyon RW, Machi D, Mao C, Murphy-Olson D, Nguyen M, Nordberg EK, Olsen GJ, Olson RD, Overbeek JC, Overbeek R, Parrello B, Pusch GD, Shukla M, Thomas C, VanOeffelen M, Vonstein V, Warren AS, Xia F, Xie D, Yoo H, Stevens R. 2020. The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities. Nucleic Acids Res 48:D606–D612. doi: 10.1093/nar/gkz943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, Phillippy AM. 2016. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 17:132. doi: 10.1186/s13059-016-0997-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The assemblies have been submitted to GenBank under BioProject accession number PRJNA674638. Strain identification and accession numbers for each genome annotation are listed in Table 1.
