Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2024 Nov 22;13(12):e00910-24. doi: 10.1128/mra.00910-24

Whole-genome sequences of fermentative and spoilage-associated lactic acid bacteria, Lysinibacillus capsici and a Serratia marcescens isolated from commercial cucumber fermentations

Clinton A Page 1, Sheron Simpson 2, Carol B M D’Souza 1, Ilenys M Pérez-Díaz 1,, Adam R Rivers 2
Editor: Leighton Pritchard3
PMCID: PMC11640280  PMID: 39576126

ABSTRACT

We report the annotated, whole-genome sequences of 16 bacteria isolated from commercial cucumber fermentations. The collection consists of nine fermentative and two spoilage-associated lactic acid bacteria, four Lysinibacillus strains, and a Serratia marcescens. Half the genome assemblies have fewer than 10 contigs, and the remainder have fewer than 30.

KEYWORDS: fermentation, food microbiology, Lactobacilli

ANNOUNCEMENT

We document genome sequences of Bacilli and a Serratia marcensens isolated from commercial cucumber or sauerkraut fermentations between 1985 and 2015 to augment understanding of the genetics, diversity, and functions hosted in such habitats. Table 1 describes the collection site and year of isolation for the pertaining bacterial cultures. The genome sequences described for Lactococcus lactis 1.8.12, Lactiplantibacillus pentosus MU045, Lactiplantibacillus plantarum 3.2.8, and Pediococcus ethanolidurans 7.8.34 are higher quality versions of previous submissions (1). Additional genome sequences include those for a Lactiplantibacillus paraplantarum, strain 1049, isolated from sauerkraut fermentation by Plengvidhya et al. (2), a Lentilactobacillus buchneri, strain LA1147, isolated from a spoiled commercial cucumber fermentation by Franco et al. (3), and five isolates from commercial cucumber fermentations obtained by Pérez-Díaz et al. (4). S. marcensens was associated with fermented cucumber bloater defect (5) and was isolated from a commercial cucumber fermentation by Pérez-Díaz et al. (6). The four genome sequences derived from Lysinibacillus are relevant to the instability of cucumber fermentation (7).

TABLE 1.

Accession numbers, assembly size, number of contigs, estimated coverage, read pairs, GC content, N50, and site of collection for the genome sequences described in this announcementa

Strain IDs NCBI accession number SRA accession number Assembly Size (bp) Contigs Estimated coverage (x) Reads Average raw read length GC % N50 (bp) Collection site Year of isolation
Lactiplantibacillus pentosus MU045b JAGYHB000000000 SRR30004943 3,918,757 15 343.29 206,977 6,599.9 45.87 743,007 Mount Olive,
NC
1985
Lactiplantibacillus paraplantarum 1049 JBFQYK000000000 SRR30005391 3,381,196 18 288.41 208,150 6,747.6 43.77 401,827 Raleigh, NC 2007
Lactiplantibacillus plantarum 3.2.8b JAGYGQ000000000 SRR30005648 3,474,346 19 307.15 161,957 6,654.4 44.22 538,455 Mount Olive,
NC
2009
Lactiplantibacillus plantarum 7.2.2 JBFRXW000000000 SRR30005937 3,386,939 20 499.998 248,256 6,862.1 44.35 703,285 Mount Olive,
NC
2009
Lactococcus lactis 1.8.12b JAGXCF000000000 SRR30004724 2,638,785 6 288.43 142,637 7,065 34.89 669,663 Mount Olive,
NC
2009
Pediococcus pentosaceus 7.2.5 JBFRXV000000000 SRR30006351 1,901,884 15 658.8 214,163 7,048.7 37.2 294,304 Chaska, MN 2009
Weissella cibaria 7.8.34b JBFWFW000000000 SRR30004637 2,499,177 7 430.154 192,976 5,587.6 44.81 778,684 Chaska, MN 2009
Weissella cibaria 7.8.48 JBFSEU000000000 SRR30006352 2,465,329 9 525.509 211,757 7,375.2 44.85 627,387 Chaska, MN 2009
Weissella paramesenteroides 3.8.13 JBFSEV000000000 SRR30151572 1,946,180 9 576.422 164,515 7,055.6 38.16 573,469 Mount Olive,
NC
2009
Lentilactobacillus buchneri LA1147 JBFYGS000000000 SRR30151571 2,712,685 29 405.195 194,664 6,397.1 44.07 334,936 Mount Olive,
NC
2010
Lentilactobacillus buchneri 90.8.10 JBFPKA000000000 SRR29918579 2,760,920 17 288.37 113,900 7,051.1 43.92 716,767 Mount Olive,
NC
2009
Lysinibacillus capsici Lys06 JBFRXU000000000 SRR30006354 4,676,048 6 304.616 209,238 6,811.7 37.56 1,387,322 Mount Olive,
NC
2015
Lysinibacillus capsici Lys16 JBGGIA010000000 SRR30005219 4,639,468 8 340.349 221,361 7,156.5 37.53 1,191,594 Mount Olive,
NC
2015
Lysinibacillus capsici Lys19 JBFRXT000000000 SRR30006353 4,644,541 6 444.197 299,160 6,906.2 37.54 4,316,120 Mount Olive,
NC
2015
Lysinibacillus capsici Lys21 JBFRYD000000000 SRR30151570 4,709,305 11 323.559 212,901 7,175.9 37.35 1,054,235 Mount Olive,
NC
2015
Serratia marcescens 1.8.3 CP163407 SRR30151569 5,241,778 1 295.691 226,499 6,848 59.4 5,241,778 Mount Olive,
NC
2009
a

Sites of sample collection are abbreviated as NC and MN for North Carolina and Minnesota, USA, respectively.

b

An asterisk (*) after the strain identification on the first column identifies isolates with previously published genome sequences of lower quality (1).

Pure cultures were transferred from frozen stocks to Lactobacilli deMan, Rogosa, and Sharpe (MRS) broth to obtain a pellet for DNA extraction, while the Lysinibacillus and S. marcensens were transferred to brain heart infusion (BHI) broth. Lysinibacillus was incubated with shaking at 23°C for 48 h. All other cultures were incubated without shaking and aerobically at 30°C in the media described above for 48 h. The Promega Wizard Genomic DNA Extraction Kit (Madison, WI) was used to lyse cells. The cell lysates were treated with 25:24:1 phenol:chloroform:isoamyl alcohol followed by ethanol precipitation (8). A second protein precipitation was performed as needed using the kit described above. The extracted DNA was used for sequencing performed as previously described (9). Briefly, the genomic DNA was sheared using Covaris G-tubes (Woburn, MA) targeting 10 kb fragments. Sheared DNA was prepared for PacBio sequencing using the SMRTbell Prep Kit 3.0 (PacBio, Menlo Park, CA). Samples were barcoded, and libraries were size selected with AMpure PB beads (Pacific Biosciences, Menlo Park, CA) to remove fragments less than 3 kb. Sequencing was performed on a Revio System (Pacific Biosciences, Menlo Park, CA) using the corresponding Polymerase Kit, Sequencing Reagents, and SMRTcell. To target HiFi reads, the library was sequenced using a 30 h movie time using Instrument Control Software Version 13. Raw subreads were converted to HiFi data by processing with circular consensus sequences (CCS) to call a single high-quality consensus sequence for each molecule, using a 99.5% consensus accuracy cutoff. De novo assembly was performed in BV-BRC v. 330.19a (10) via Metaflye v. 2.9.1-b1780 (11) with a minimum contig cut-off of 300 nucleotides. Quality assessment of assemblies was performed with QUAST version 5.0.2 (12), and SamTools version 13 (13). The closest reference genomes were identified with minhash search by Mash (14) using the PATRIC database (15) within BV-BRC. Upon submission to GenBank (Bioproject PRJNA674638), assemblies were reannotated using the NCBI Prokaryotic Genome Annotation Pipeline v. 6.7 (16). Default parameters were used for all software, unless otherwise specified.

ACKNOWLEDGMENTS

We thank Sandra Parker, and Research Leaders, Dr. Muquarrab Qureshi and Dr. Brian Scheffler, with the United States Department of Agriculture-Agricultural Research Service (USDA-ARS) for their administrative efforts to facilitate this collaborative research. We also thank Linda Ballard and Dr. Cal Youngblood from USDA-ARS Genomics and Bioinformatics Research Unit for their assistance in facilitating SCINet downloads. This research was funded by the United States Department of Agriculture-Agricultural Research Service-Food Science & Market Quality and Handling Research Unit located in Raleigh, North Carolina, USA under National Program Project Plan no. 6070–41000-010-00D.

Mention of a trademark or proprietary product does not constitute a guarantee or warranty of the product by the US Department of Agriculture or North Carolina Agricultural Research Service, nor does it imply approval to the exclusion of other products that may be suitable. USDA is an equal opportunity provider and employer.

Contributor Information

Ilenys M. Pérez-Díaz, Email: ilenys.perez-diaz@usda.gov.

Leighton Pritchard, University of Strathclyde, Glasgow, United Kingdom.

DATA AVAILABILITY

Table 1 describes strain identification and accession numbers for each genome annotation and sequence read archive entry.

REFERENCES

  • 1. Page CA, Pérez-Díaz IM. 2021. Whole-genome sequencing and annotation of selected Lactobacillales isolated from commercial cucumber fermentation. Microbiol Resour Announc 10:e00625–21. doi: 10.1128/MRA.00625-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Plengvidhya V, Breidt F, Lu Z, Fleming HP. 2007. DNA fingerprinting of lactic acid bacteria in sauerkraut fermentations. Appl Environ Microbiol 73:7697–7702. doi: 10.1128/AEM.01342-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Franco W, Pérez-Díaz IM, Johanningsmeier SD, McFeeters RF. 2012. Characteristics of spoilage-associated secondary cucumber fermentation. Appl Environ Microbiol 78:1273–1284. doi: 10.1128/AEM.06605-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Pérez-Díaz IM, Hayes J, Medina E, Anekella K, Daughtry K, Dieck S, Levi M, Price R, Butz N, Lu Z, Azcarate-Peril MA. 2017. Reassessment of the succession of lactic acid bacteria in commercial cucumber fermentations and physiological and genomic features associated with their dominance. Food Microbiol 63:217–227. doi: 10.1016/j.fm.2016.11.025 [DOI] [PubMed] [Google Scholar]
  • 5. Zhai Y, Pagán-Medina CG, Pérez-Díaz IM. 2023. CO2-mediated bloater defect can be induced by the uncontrolled growth of Enterobacteriaceae in cucumber fermentation. Food Sci Nutr:7177. doi: 10.1002/fsn3.3557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Pérez-Díaz IM, Hayes JS, Medina E, Webber AM, Butz N, Dickey AN, Lu Z, Azcarate-Peril MA. 2019. Assessment of the non-lactic acid bacteria microbiota in fresh cucumbers and commercially fermented cucumber pickles brined with 6% NaCl. Food Microbiol 77:10–20. doi: 10.1016/j.fm.2018.08.003 [DOI] [PubMed] [Google Scholar]
  • 7. Medina E, Pérez-Díaz IM, Breidt F, Hayes J, Franco W, Butz N, Azcarate-Peril MA. 2016. Bacterial ecology of fermented cucumber rising ph spoilage as determined by nonculture-based methods. J Food Sci 81:M121–9. doi: 10.1111/1750-3841.13158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Thermo-Fisher technical reference library . 2023. How to use phenol/ chloroform for DNA purification. Available from: https://www. thermofisher.com/us/en/home/references/protocols/nucleic-acid- purification-and-analysis/dna-extraction-protocols/phenol-chloroform- extraction.html
  • 9. Page CA, Simpson SA, Pérez-Díaz IM, Rivers AR. 2024. Annotated whole-genome sequences of fermentative and spoilage associated Bacilli and proteobacteria autochthonous to commercial cucumber fermentation. Microbiol Resour Announc 13:e0092623. doi: 10.1128/mra.00926-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Olson RD, Assaf R, Brettin T, Conrad N, Cucinell C, Davis JJ, Dempsey DM, Dickerman A, Dietrich EM, Kenyon RW, et al. 2023. Introducing the bacterial and viral bioinformatics resource center (BV-BRC): a resource combining PATRIC, IRD and ViPR. Nucleic Acids Res 51:D678–D689. doi: 10.1093/nar/gkac1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kolmogorov M, Bickhart DM, Behsaz B, Gurevich A, Rayko M, Shin SB, Kuhn K, Yuan J, Polevikov E, Smith TPL, Pevzner PA. 2020. metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat Methods 17:1103–1110. doi: 10.1038/s41592-020-00971-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10:giab008. doi: 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, Phillippy AM. 2016. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 17:132. doi: 10.1186/s13059-016-0997-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Gillespie JJ, Wattam AR, Cammer SA, Gabbard JL, Shukla MP, Dalay O, Driscoll T, Hix D, Mane SP, Mao C, Nordberg EK, Scott M, Schulman JR, Snyder EE, Sullivan DE, Wang C, Warren A, Williams KP, Xue T, Seung Yoo H, Zhang C, Zhang Y, Will R, Kenyon RW, Sobral BW. 2011. PATRIC: the comprehensive bacterial bioinformatics resource with a focus on human pathogenic species. Infect Immun 79:4286–4298. doi: 10.1128/IAI.00207-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Table 1 describes strain identification and accession numbers for each genome annotation and sequence read archive entry.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES