ABSTRACT
Here, we report the complete genome sequence of Levilactobacillus acidifarinae type strain JCM 15949 (DSM 19394), which was isolated from a Belgian artisanal wheat sourdough. The genome consisted of a circular chromosome (2,915,962 bp, 51.71% GC content) and a circular plasmid (30,910 bp, 39.78% GC content).
KEYWORDS: lactic acid bacteria, sourdough, Levilactobacillus, Nanopore, long read, complete genome
ANNOUNCEMENT
Levilactobacillus are free-living heterofermentative lactic acid bacteria (1). The genome sequence of the Levilactobacillus acidifarinae type strain isolated from a Belgian artisanal wheat sourdough (2) was reported using strain DSM 19394 (3) as a draft sequence. This study reports the complete genome sequence of L. acidifarinae type strain JCM 15949.
A frozen glycerol stock of the strain JCM 15949 was thawed and purified on a De Man–Rogosa–Sharpe (MRS) agar plate at 30°C. A single colony grown on the plate was picked up and inoculated to MRS broth. The broth cultured cells were harvested, washed, and suspended in TESS solution (25 mM Tris-HCl, 5 mM EDTA, 50 mM NaCl, 25% sucrose, pH 7.5). Genomic DNA was then extracted using a previously described method with some modifications (4–6). The cell suspension was incubated at 37°C with lysozyme and RNase. It was further incubated at 50°C with 10% sodium dodecyl sulfate and proteinase K. Genomic DNA was extracted using phenol-chloroform-isoamyl alcohol with 6 M NaCl, followed by purification with chloroform-isoamyl alcohol. Purified DNA was precipitated with 2-propanol at –20°C and then centrifuged. The resulting precipitate was rinsed with 90% ethanol, dried, and dissolved in Milli-Q water. The DNA was concentrated using a DNA Clean and Concentrator Kit (Zymo Research), which was quantified using a Qubit fluorometer. DNA quality was assessed using an Agilent TapeStation. A sequencing library was constructed using a Ligation Sequencing gDNA Native Barcode Kit 24 V14 (SQK-NBD114.24). Genomic DNA (1 µg) was sequenced on the FLO-MIN114 flow cell of a MinION Mk1B. Raw sequencing reads were obtained using MinKNOW v24.02.10, and bases were called using the super-accurate mode v5.0.0. Dorado v0.9.1 was used to classify the native barcodes. The NanoGalaxy platform (7) was used for all bioinformatic analyses. Default parameters were used, except where otherwise noted. Adapter sequences were trimmed using Porechop v0.2.4 (https://github.com/rrwick/Porechop). Filtlong v0.2.1 (https://github.com/rrwick/Filtlong) was used to filter the reads, retaining 1G bases with a minimum length, mean quality score, and window quality score of 1,000 bp, 10, and 10, respectively. Those processes yielded 66,100 reads and 16,416 bases at N50 read length. Two circular contigs, which meant a chromosome (350× coverage) and a plasmid (545× coverage), were generated via de novo assembly using Flye v2.9.5 (8) in nano-raw mode with five polishing iterations. Additionally, Illumina HiSeq paired-end reads (2 × 100 bp) of the DSM 19394 (=JCM 15949) genome sequence (3) were obtained from the NCBI Sequence Read Archive (ERR387483). A filtering process using fastp v0.24.0 (9) yielded 3.64M Illumina reads with 122× coverage before alignment to two circular contigs using BWA-MEM2 v2.2.1 (10). The contigs were polished twice with Illumina reads using Pilon v1.20.1 (11).
Quality checks using CheckM (12), reorientation to dnaA, and genome annotation were performed using a web-based DFAST genome annotation tool (13). The genome completeness and contamination rates were 99.35% and 0.65%, respectively. The complete genome contains a circular chromosome (2,915,962 bp, 51.71% GC content) and a circular plasmid, pJCM15949 (30,910 bp, 39.78% GC content). Annotation predicted 2,729 coding sequences (CDSs), 65 tRNA genes, and 18 rRNA genes in the chromosome.
ACKNOWLEDGMENTS
This work was supported by JSPS KAKENHI Grant Number JP23K13873, a cooperative research grant of the genome research for bioresource from NODAI Genome Research Center in Tokyo University of Agriculture, a research grant from the Iijima Tojuro Memorial Foundation for Food Science and Technology in FY2023, and a research grant from the Mishima Kaiun Memorial Foundation in FY2024. We would like to thank Editage (www.editage.jp) for English language editing.
Footnotes
writing - review and editing
Contributor Information
Mugihito Oshiro, Email: oshiro@agr.kyushu-u.ac.jp.
David A. Baltrus, The University of Arizona, Tucson, Arizona, USA
DATA AVAILABILITY
The complete genome sequence of L. acidifarinae type strain JCM 15949 has been deposited in DDBJ (the Nucleotide accession numbers AP040118 and AP040119) under BioProject PRJDB20195. The raw sequence data are available in the DDBJ Sequence Read Archive with the accession number DRX636836. The detailed DNA extraction protocol is shown in figshare (https://doi.org/10.6084/m9.figshare.28787192.v1).
REFERENCES
- 1. Zheng J, Wittouck S, Salvetti E, Franz C, Harris HMB, Mattarelli P, O’Toole PW, Pot B, Vandamme P, Walter J, Watanabe K, Wuyts S, Felis GE, Gänzle MG, Lebeer S. 2020. A taxonomic note on the genus Lactobacillus: description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int J Syst Evol Microbiol 70:2782–2858. doi: 10.1099/ijsem.0.004107 [DOI] [PubMed] [Google Scholar]
- 2. Vancanneyt M, Neysens P, De Wachter M, Engelbeen K, Snauwaert C, Cleenwerck I, Van der Meulen R, Hoste B, Tsakalidou E, De Vuyst L, Swings J. 2005. Lactobacillus acidifarinae sp. nov. and Lactobacillus zymae sp. nov., from wheat sourdoughs. Int J Syst Evol Microbiol 55:615–620. doi: 10.1099/ijs.0.63274-0 [DOI] [PubMed] [Google Scholar]
- 3. Sun Z, Harris HMB, McCann A, Guo C, Argimón S, Zhang W, Yang X, Jeffery IB, Cooney JC, Kagawa TF, et al. 2015. Expanding the biotechnology potential of lactobacilli through comparative genomics of 213 strains and associated genera. Nat Commun 6:8322. doi: 10.1038/ncomms9322 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Sato K, Ikagawa Y, Niwa R, Nishioka H, Horie M, Iwahashi H. 2023. Genome sequencing unveils nomadic traits of Lactiplantibacillus plantarum in Japanese post-fermented tea. Curr Microbiol 81:52. doi: 10.1007/s00284-023-03566-9 [DOI] [PubMed] [Google Scholar]
- 5. Colombini L, Santoro F, Tirziu M, Lazzeri E, Morelli L, Pozzi G, Iannelli F. 2023. The mobilome of Lactobacillus crispatus M247 includes two novel genetic elements: Tn7088 coding for a putative bacteriocin and the siphovirus prophage ΦM247. Microb Genom 9:001150. doi: 10.1099/mgen.0.001150 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Pinzauti D, Iannelli F, Pozzi G, Santoro F. 2022. DNA isolation methods for nanopore sequencing of the Streptococcus mitis genome. Microb Genom 8:000764. doi: 10.1099/mgen.0.000764 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. de Koning W, Miladi M, Hiltemann S, Heikema A, Hays JP, Flemming S, van den Beek M, Mustafa DA, Backofen R, Grüning B, Stubbs AP. 2020. NanoGalaxy: nanopore long-read sequencing data analysis in Galaxy. Gigascience 9:giaa105. doi: 10.1093/gigascience/giaa105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8 [DOI] [PubMed] [Google Scholar]
- 9. Chen S, Zhou Y, Chen Y, Gu J. 2018. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Vasimuddin M, Misra S, Li H, Aluru S. Efficient architecture-aware acceleration of BWA-MEM for multicore systems. 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS); Rio de Janeiro, Brazil: , p 314–324IEEE, Brazil. doi: 10.1109/IPDPS.2019.00041 [DOI] [Google Scholar]
- 11. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Tanizawa Y, Fujisawa T, Nakamura Y. 2018. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication. Bioinformatics 34:1037–1039. doi: 10.1093/bioinformatics/btx713 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The complete genome sequence of L. acidifarinae type strain JCM 15949 has been deposited in DDBJ (the Nucleotide accession numbers AP040118 and AP040119) under BioProject PRJDB20195. The raw sequence data are available in the DDBJ Sequence Read Archive with the accession number DRX636836. The detailed DNA extraction protocol is shown in figshare (https://doi.org/10.6084/m9.figshare.28787192.v1).
