ABSTRACT
We report 36 whole-genome sequences, along with annotations, of fermentative (n = 12) and spoilage associated (n = 6) lactic acid bacteria, Lysinibacillus (n = 3), Streptococcus (n = 1), and Proteobacteria (n = 14) isolated from commercial cucumber fermentations. Fifty-three percent of the genome sequence assemblies consist of 1–4 contigs, and the remainder have fewer than 16.
KEYWORDS: fermentation, food microbiology
ANNOUNCEMENT
We document genome sequences of multiple Bacilli and Proteobacteria isolated from commercial cucumber fermentation. Nine genome sequences are higher quality versions of previous submissions (1). Three genome sequences represent the nisin producer Lactococcus lactis (2) and putative bacteriocin producing Pediococcus pentosaceus and Leuconostoc mesenteroides strains (3). The remainder are Proteobacteria (n = 14) associated with fermented cucumber bloater defect (4) and Bacilli (n = 9) relevant to the instability of cucumber fermentation (5).
Table 1 describes the collection site of the isolates. Most fermentative lactic acid bacteria were collected, classified, and identified in surveys of commercial cucumber fermentations conducted between 2009 and 2010 as described by Pérez-Díaz et al. (6). LA0445 was derived from anaerobic fermentation of cucumbers (7). The isolations of Lactococcus lactis NCK401 (2) and Pediococcus pentosaceus LA0061 (previously identified as P. cerevisiae L-7230) (3) were conducted in Raleigh, NC, USA from cover brine samples derived from undefined sources. Leuconostoc mesenteroides FFL48 (previously identified as P. cerevisiae FBB-61) was collected and isolated in Marshall, MI, USA (8). The Proteobacteria and spoilage associated lactic acid bacteria included here were isolated as described by Pérez-Díaz et al. (6) and Pérez-Díaz et al. (4), respectively. Lysinibacillus strains were isolated from spoiled commercial cucumber fermentations in Mt. Olive, NC, USA as described by Medina et al. (5).
TABLE 1.
Accession numbers and genome statistics for isolates described in this announcementa
| Strain IDs | NCBI accession number | SRA accession number | Assembly size (bp) | Contigs | Estimated coverage (×) | Reads | Average raw read length | GC % | N50 (bp) | Collection site |
|---|---|---|---|---|---|---|---|---|---|---|
| Lactiplantibacillus pentosus | ||||||||||
| 7.2.15 | JAVLAU000000000 | SRR25916667 | 3,919,042 | 8 | 157.542 | 63,240 | 9,772 | 45.93 | 3,645,019 | Mount Olive, NC |
| LA0445 | JAHLEN000000000.2 | SRR25916659 | 3,945,202 | 11 | 71.017 | 42,596 | 6,588 | 45.84 | 3,580,249 | Mount Olive, NC |
| 1.8.18 | JAVLAN000000000 | SRR25916663 | 3,897,544 | 15 | 108.48 | 47,598 | 8,910 | 45.82 | 2,062,758 | Mount Olive, NC |
| 1.8.9 | JAVLAO000000000 | SRR25916661 | 3,734,812 | 12 | 90.879 | 50,246 | 6,761 | 45.92 | 3,419,212 | Mount Olive, NC |
| 14.8.42 | JAVGXB000000000 | SRR25916660 | 4,050,922 | 16 | 62.214 | 32,012 | 7,882 | 45.78 | 789,853 | Mount Olive, NC |
| 7.8.46 | JAVLAQ000000000 | SRR25916658 | 3,966,095 | 12 | 106.639 | 46,231 | 9,160 | 45.73 | 2,973,648 | Mount Olive, NC |
| 7.8.2 | CP134788-CP134793 | SRR26196798 | 3,919,042 | 8 | 157.542 | 63,240 | 9,772 | 45.93 | 3,645,019 | Chaska, MN |
| Lactiplantibacillus plantarum | ||||||||||
| 7.8.4 | JAVLAR000000000 | SRR25916657 | 3,490,038 | 15 | 44.239 | 23,524 | 6,609 | 44.23 | 2,712,281 | Mount Olive, NC |
| Lactococcus lactis | ||||||||||
| NCK401 | CP137623 | SRR26043614 | 2,551,975 | 1 | 75.178 | 25,782 | 7,450 | 34.86 | 2,551,975 | Raleigh, NC |
| LA0312 | CP134164 | SRR25916659 | 2,563,933 | 1 | 172.327 | 49,924 | 8,857 | 34.86 | 2,563,933 | Mount Olive, NC |
| Pediococcus pentosaceus | ||||||||||
| LA0061 | CP137627-CP137629 | SRR26043613 | 1,945,268 | 3 | 250.134 | 51,502 | 9,455 | 37.19 | 1,847,864 | Raleigh, NC |
| Leuconostoc mesenteroides | ||||||||||
| FFL48 | CP137625-CP137626 | SRR26043612 | 1,816,909 | 2 | 252.299 | 53,393 | 8,589 | 38.02 | 1,799,974 | Marshall, MI |
| Companilactobacillus alimentarius | ||||||||||
| 7.2.6 | JAVKYT000000000 | SRR25916668 | 2,665,037 | 10 | 192.679 | 56,034 | 9,171 | 35.5 | 1,641,583 | Mount Olive, NC |
| Levilactobacillus namurensis | ||||||||||
| 1.2.9 | JAVLAL000000000 | SRR25916666 | 2,932,609 | 10 | 173.356 | 64,197 | 7,925 | 50.77 | 2,656,307 | Mount Olive, NC |
| 3.8.38 | JAVLAM000000000 | SRR25916665 | 2,983,047 | 14 | 152.717 | 57,348 | 7,938 | 50.68 | 2,566,380 | Mount Olive, NC |
| 90.8.26 | CP134159-CP134163 | SRR25916664 | 2,776,426 | 5 | 196.758 | 59,866 | 9,290 | 51.4 | 2,674,724 | Mount Olive, NC |
| Latilactobacillus zymae | ||||||||||
| 90.8.36 | JAVLAS000000000 | SRR25916656 | 2,892,482 | 8 | 170.533 | 56,995 | 8,701 | 51.49 | 2,645,371 | Mount Olive, NC |
| Latilactobacillus curvatus | ||||||||||
| 3.8.43 | JAVLAK000000000 | SRR25916651 | 2,056,476 | 4 | 237.549 | 62,061 | 7,860 | 42.03 | 1,616,338 | Chaska, MN |
| Lysinibacillus capsici | ||||||||||
| Lys1 | CP137622 | SRR26060012 | 4,682,737 | 1 | 94.303 | 49,304 | 8,969 | 37.58 | 4,682,737 | Mount Olive, NC |
| Lys24 | CP134502 | SRR26060010 | 4,687,025 | 1 | 87.806 | 48,098 | 8,564 | 37.62 | 4,687,025 | Mount Olive, NC |
| Lysinibacillus louembei | ||||||||||
| Ll15 | CP137624S | SRR26043611 | 3,960,254 | 1 | 93.807 | 45,092 | 8,245 | 38.53 | 3,960,254 | Mount Olive, NC |
| Streptococcus parasanguinis | ||||||||||
| 30.8.10 | CP134147-CP134148 | SRR25916648 | 2,128,578 | 2 | 199.9238186 | 51,124 | 8,334 | 42.01 | 2,118,583 | Mount Olive, NC |
| Enterobacter cancerogenus | ||||||||||
| 3.2.13 | JAVLAJ000000000 | SRR25916674 | 4,902,923 | 4 | 72.304 | 53,067 | 6,685 | 55.64 | 4,466,362 | Chaska, MN |
| 3.2.17 | CP134403-CP134405 | SRR25916652 | 4,882,814 | 3 | 131.134 | 68,845 | 9,309 | 55.63 | 4,668,754 | Chaska, MN |
| Enterobacter hormaechei | ||||||||||
| 1.2.3 | CP134406-CP134411 | SRR25916671 | 3,862,785 | 6 | 121.153 | 51,423 | 9,112 | 46.03 | 3,635,834 | Mount Olive, NC |
| 3.2.8 | JAVLAI000000000 | SRR25916670 | 5,031,271 | 6 | 68.791 | 52,737 | 6,568 | 55.41 | 3,591,916 | Mount Olive, NC |
| Kluyvera cryocrescens | ||||||||||
| 1.8.5 |
CP134165 | SRR25916669 | 4,849,988 | 1 | 120.883 | 60,604 | 9,685 | 53.92 | 4,849,988 | Mount Olive, NC |
| Leclercia pneumoniae | ||||||||||
| 1.2.7 | CP134503-CP134506 | SRR26196799 | 4,932,015 | 4 | 129.539 | 68,633 | 9,317 | 54.49 | 4,521,154 | Mount Olive, NC |
| Pantoea agglomerans | ||||||||||
| 1.2.4 | CP134149-CP134151 | SRR25916650 | 4,870,469 | 3 | 125.758 | 69,211 | 8,857 | 55.14 | 4,084,927 | Mount Olive, NC |
| Pseudomonas putida | ||||||||||
| 1.8.4 | CP137621 | SRR26043609 | 5,946,927 | 1 | 124.581 | 80,233 | 9,246 | 61.9 | 5,946,927 | Chaska, MN |
| Pseudomonas inefficax | ||||||||||
| 7.8.24 | CP134401 | SRR25916649 | 5,927,592 | 1 | 113.28 | 73,885 | 9,099 | 62.78 | 5,927,592 | Chaska, MN |
| Erwinia toletana | ||||||||||
| 1.2.20 | CP134152-CP134153 | SRR25916662 | 5,014,558 | 2 | 117.097 | 67,679 | 8,696 | 52.22 | 4,923,789 | Chaska, MN |
| Siccibacter colletis | ||||||||||
| 3.2.4 | CP134402 | SRR25916654 | 4,158,854 | 1 | 135.985 | 67,089 | 8,438 | 57.41 | 4,158,854 | Chaska, MN |
| Hafnia alvei | ||||||||||
| 3.8.4 | CP134154-CP134158 | SRR25916653 | 4,855,803 | 5 | 109.27 | 56,367 | 9,421 | 48.81 | 4,631,589 | Chaska, MN |
| Stenotrophomonas indicatrix | ||||||||||
| 7.2.7 | JAVYIJ000000000 | SRR26060011 | 4,573,902 | 9 | 108.262 | 72,052 | 6,878 | 66.29 | 2,356,763 | Chaska, MN |
| Brucella pseudogrignonensis | ||||||||||
| 1.2.1 | JAVLAT000000000 | SRR25916655 | 4,453,607 | 7 | 130.227 | 63,658 | 9,118 | 51.87 | 2,323,104 | Chaska, MN |
sites of sample collection are abbreviated as NC, MI, and MN for North Carolina, Michigan, and Minnesota, USA, respectively.
Pure cultures of lactic acid bacteria and bacilli were transferred from frozen stocks to Lactobacilli deMan, Rogosa, and Sharpe (MRS) broth, while proteobacteria were transferred to Brain Heart Infusion (BHI) broth and incubated statically and aerobically at 30°C prior to DNA extraction. The Promega Wizard Genomic DNA Extraction Kit (Madison, WI) was used to lyse cells and precipitate proteins. To produce a higher DNA yield than possible with the standard kit protocol, cell lysates were treated with 25:24:1 phenol:chloroform:isoamyl alcohol followed by ethanol precipitation (9).
Genomic DNA was sheared using Covaris G-tubes (Woburn, MA) targeting 10 kb fragments. Sheared DNA was prepared for PacBio sequencing using the SMRTbell Prep Kit 3.0 (PacBio, Menlo Park, CA). Samples were barcoded and libraries were size selected with AMpure PB beads (Pacific Biosciences, Menlo Park, CA) to remove fragments less than 3 kb. Sequencing was performed on a Sequel IIe System (Pacific Biosciences, Menlo Park, CA) using Binding Kit 3.2, Sequel II Sequencing Kit 2.0 and SMRTCell 8M. To target HiFi reads, the library was sequenced using a 30-hour movie time using Instrument Control Software Version 11. Raw subreads were converted to HiFi data by processing with CCS to call a single high-quality consensus sequence for each molecule, using a 99.5% consensus accuracy cutoff. De novo assembly was performed in BV-BRC v. 330.19a (10) via Unicycler version 0.4.8 (11) with a minimum contig cut-off of 300. Quality assessment of assemblies was performed with QUAST version 5.0.2 (12), SamTools version 13 (13), and Pilon version 1.23 (14). Closest reference genomes were identified by Mash/MinHash employing the PATRIC database (15). Upon submission to GenBank (Bioproject PRJNA674638), assemblies were reannotated using the NCBI Prokaryotic Genome Annotation Pipeline v. 6.5 (16). Default parameters for the software were used.
ACKNOWLEDGMENTS
We thank Sandra Parker and Research Leaders Drs. Muquarrab Qureshi and Brian Scheffler with the United States Department of Agriculture (USDA) - Agricultural Research Service for their administrative efforts to facilitate this collaborative research. We also thank Linda Ballard, IT specialist, with the USDA-ARS Genomics and Bioinformatics Research Unit in Stoneville, MS, USA for her assistance with SCINet downloads. We thank Dr. Michael Taveirne, Associate Teaching Professor in the Department of Biological Sciences, and Dr. Sarah O’Flaherty, Research Scholar in the Department of Food, Bioprocessing and Nutrition Sciences, both at North Carolina State University for reviewing the manuscript.
This work was performed in part at the Analytical Instrumentation Facility (AIF) at North Carolina State University, which is supported by the State of North Carolina and the National Science Foundation (award number ECCS-2025064). This work made use of instrumentation at AIF acquired with support from the National Science Foundation (DMR-1726294). The AIF is a member of the North Carolina Research Triangle Nanotechnology Network (RTNN), a site in the National Nanotechnology Coordinated Infrastructure (NNCI).
Contributor Information
Ilenys M. Pérez-Díaz, Email: ilenys.perez-diaz@usda.gov.
J. Cameron Thrash, University of Southern California, USA.
DATA AVAILABILITY
Strain identification and accession numbers for each genome annotation and sequence read archive are included in Table 1.
REFERENCES
- 1. Page CA, Pérez-Díaz IM. 2021. Whole-genome sequencing and annotation of selected Lactobacillales isolated from commercial cucumber fermentation. Microbiol Resour Announc 10:e00625-21. doi: 10.1128/MRA.00625-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Harris LJ, Fleming HP, Klaenhammer TR. 1992. Characterization of two nisin-producing Lactococcus lactis subsp. lactis strains isolated from commercial sauerkraut fermentation. Appl Environ Microbiol 58:1477–1483. doi: 10.1128/aem.58.5.1477-1483.1992 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Fleming HP, Etchells JL, Costilow RN. 1975. Microbial inhibition by an isolate of Pediococcus from cucumber brines. Appl Microbiol 30:1040–1042. doi: 10.1128/am.30.6.1040-1042.1975 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Zhai Y, Pagán-Medina CG, Pérez-Díaz IM. 2023. CO2-mediated bloater defect can be induced by the uncontrolled growth of Enterobacteriaceae in cucumber fermentation. Food Sci Nutr 2048:7177. doi: 10.1002/fsn3.3557 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Medina E, Pérez-Díaz IM, Breidt F, Hayes J, Franco W, Butz N, Azcarate-Peril MA. 2016. Bacterial ecology of fermented cucumber rising pH spoilage as determined by non-culture-based methods. J Food Sci 81:M121–MI29. doi: 10.1111/1750-3841.13158 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Pérez-Díaz IM, Hayes JS, Medina E, Webber AM, Butz N, Dickey AN, Lu Z, Azcarate-Peril MA. 2019. Assessment of the non-lactic acid bacteria microbiota in fresh cucumbers and commercially fermented cucumber pickles brined with 6% NaCl. Food Microbiol 77:10–20. doi: 10.1016/j.fm.2018.08.003 [DOI] [PubMed] [Google Scholar]
- 7. Fleming HP, McFeeterS RF, Daeschel MA, Humphries EG, Thompson RL. 1988. Fermentation of cucumbers in anaerobic tanks. J Food Science 53:127–133. doi: 10.1111/j.1365-2621.1988.tb10192.x [DOI] [Google Scholar]
- 8. Costilow RN, Coughlin FM, Robach DL, Ragheb HS. 1956. A study of the acid-forming bacteria from cucumber fermentation in Michigan. J Food Scie 21:27–33. doi: 10.1111/j.1365-2621.1956.tb16888.x [DOI] [Google Scholar]
- 9. Thermo-Fisher technical reference library 2023. how to use phenol/chloroform for DNA purification. 2023. Available from: https://www.thermofisher.com/us/en/home/references/protocols/nucleic-acid-purification-and-analysis/dna-extraction-protocols/phenol-chloroform-extraction.html
- 10. Olson RD, Assaf R, Brettin T, Conrad N, Cucinell C, Davis JJ, Dempsey DM, Dickerman A, Dietrich EM, Kenyon RW, et al. 2022. Introducing the bacterial and viral bioinformatics resource center (BV-BRC): a resource combining PATRIC, IRD and ViPR. Nucleic Acids Research 51:D678–D689. doi: 10.1093/nar/gkac1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10:giab008. doi: 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM, Wang J. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, Phillippy AM. 2016. Mash: fast genome and metagenome distance estimation using minhash. Genome Biol 17:132. doi: 10.1186/s13059-016-0997-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Strain identification and accession numbers for each genome annotation and sequence read archive are included in Table 1.
