ABSTRACT
The whole-genome sequence of a Weizmannia (Bacillus) coagulans (ProBC Plus) strain isolated from fermented rice is reported here. The complete genome analysis of the strain will be helpful in the future to combat multitudinous problems and will be helpful in providing insights regarding potential probiotic properties.
ANNOUNCEMENT
The strain was isolated from fermented rice samples and cultured in de Man-Rogosa-Sharpe (MRS) medium for 24 to 48 h at 37°C. The 16S rRNA gene region was amplified using the forward primer 27F 5′ (AGA GTT TGA TCM TGG CTC AG) 3′ and the reverse primer 1492R 5′ (TAC GGY TAC CTT GTT ACG ACT T) 3′. The gene was sequenced by the Sanger dideoxy sequencing method at Macrogen (Seoul, South Korea). The strain exhibited 100% similarity to Weizmannia coagulans LMG S-31876 (GenBank accession number MZ687045) (1).
Genomic DNA from a culture pellet was fragmented with the phenol-chloroform method (2) and sonicated in separate, generic, polypropylene tubes on a Qsonica Q800R2 sonicator. One hundred nanograms of fragmented DNA was used to prepare a paired-end sequencing library with the NEBNext Ultra II DNA library preparation kit (Illumina), and libraries were sequenced on an Illumina MiSeq instrument using the MiSeq reagent kit v2 (500 cycles).
A total of 1,75,28,596 paired-end reads of 150-bp read length on average, with genome coverage of 782.52×, were sequenced. Of these, 1,49,85,902 high-quality paired-ended reads were filtered with the next-generation sequencing (NGS) quality control (QC) Toolkit, with a Phred quality score cutoff value of 20 for high-quality filtering (3). The high-quality paired-end reads were assembled into 183 contigs with the de novo genome assembler SPAdes v3.15.5 (4). The draft genome consists of 183 contigs containing 3,461,262 bp, with a GC content of 46% and an N50 value of 64,268 bp; the largest assembled scaffold is 214,117 bp (3). The genome sequence was annotated with the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.2 (5). The genes were predicted and translated with the Prodigal program (6), following pathway identification with the KEGG Automatic Annotation Server (KAAS) (7). A total of 3,388 genes, 3,289 coding sequences (CDSs), 17 rRNAs, and 77 tRNAs were predicted. A total of 140 genes are involved in the synthesis of proteins and enzymes needed for dormancy and sporulation stages, and the strain is projected to encode approximately 324 proteins involved in carbohydrate metabolism and 264 proteins involved in amino acid metabolism. The strain also has genes for biotin, riboflavin, cobalamin, thiamine, vitamin B6, and folate production.
The genome was screened to determine putative virulence factors (with the Virulence Factor Database [VFDB] [8]), plasmids (with PlasmidFinder v2.0 [9]), and antibiotic resistance genes (with the Antibiotic Resistance Genes Database [ARDB] [10]). There was an absence of any plasmids or antibiotic resistance genes, and genes encoding putative virulence factors such as the hemolytic enterotoxin complex HBL (hblA, hblB, hblC, and hblD), nonhemolytic enterotoxin (NHE) (nheA, nheB, and nheC), and cytotoxin K (hemolysin IV) were not found. BLASTx comparisons were performed between the Weizmannia coagulans LMG S-31876 draft genome and biogenic-amine-producing protein sequences, and genes such as those for histidine decarboxylase (hdc), tyrosine decarboxylase (tdc), ornithine decarboxylase, agmatine dihydrolase (deiminase), and putrescine were absent (11). CRISPRFinder was used to screen for clustered regularly interspaced short palindromic repeat (CRISPR) sequences, and Weizmannia coagulans LMG S-31876 contained one confirmed CRISPR sequence (12). Analysis of the genome sequence of ProBC Plus Weizmannia coagulans LMG S-31876 indicates that it meets probiotic safety criteria on the genome level.
Data availability.
The whole-genome shotgun sequence of Weizmannia coagulans LMG S-31876 has been deposited in DDBJ/EMBL/GenBank under the accession number JANKOH000000000. The version described in this paper is the first version, JANKOH010000000. The raw sequence reads have been submitted to the NCBI SRA with accession number SRR21429472.
ACKNOWLEDGMENTS
We are grateful to everyone who directly or indirectly contributed to the completion of this research activity.
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Contributor Information
Ranjith Kumar Kallur, Email: noreply@abodebiotec.com.
Vanja Klepac-Ceraj, Wellesley College.
REFERENCES
- 1.Sreenadh M, Kumar KR, Nath S. 2022. In vitro evaluation of Weizmannia coagulans strain LMG S-31876 isolated from fermented rice for potential probiotic properties, safety assessment and technological properties. Life 12:1388. doi: 10.3390/life12091388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chan PKS, Chan DP, To KF, Yu MY, Cheung JL, Cheng AF. 2001. Evaluation of extraction methods from paraffin wax embedded tissues for PCR amplification of human and viral DNA. J Clin Pathol 54:401–403. doi: 10.1136/jcp.54.5.401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Patel RK, Jain M. 2012. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS One 7:e30619. doi: 10.1371/journal.pone.0030619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:5. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Moriya Y, Itoh M, Okuda S, Yoshizawa A, Kanehisa M. 2007. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res 35:182–185. doi: 10.1093/nar/gkm321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chen L, Zheng D, Liu B, Yang J, Jin Q. 2016. VFDB 2016: hierarchical and refined dataset for big data analysis—10 years on. Nucleic Acids Res 44:D694–D697. doi: 10.1093/nar/gkv1239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Carattoli A, Zankari E, García-Fernández A, Voldby Larsen M, Lund O, Villa L, Møller Aarestrup F, Hasman H. 2014. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother 58:3895–3903. doi: 10.1128/AAC.02412-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Liu B, Pop M. 2008. ARDB—Antibiotic Resistance Genes Database. Nucleic Acids Res 37:443–447. doi: 10.1093/nar/gkn656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Grissa I, Vergnaud G, Pourcel C. 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res 35:W52–W57. doi: 10.1093/nar/gkm360. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The whole-genome shotgun sequence of Weizmannia coagulans LMG S-31876 has been deposited in DDBJ/EMBL/GenBank under the accession number JANKOH000000000. The version described in this paper is the first version, JANKOH010000000. The raw sequence reads have been submitted to the NCBI SRA with accession number SRR21429472.
