ABSTRACT
Lactiplantibacillus plantarum strain VHProbi P06 is a probiotic that was isolated from kimchi soup. Here, we investigate the whole-genome sequence of this strain, which contains a chromosome and seven plasmids.
KEYWORDS: complete genome sequence, Lactiplantibacillus plantarum, VHProbi P06
ANNOUNCEMENT
One milliliter of kimchi soup purchased from the supermarket was diluted in 99 mL of de Man-Rogosa-Sharpe (MRS) agar (Haibo, China) broth medium. Then, 100 µL of dilution was spread onto the MRS agar plate and incubated under anaerobic conditions at 37°C for 24 h to enrich bacteria. A colony named VHProbi P06 was picked up and identified as the species Lactiplantibacillus plantarum using 16S rRNA sequencing, as described by Liu et al. (1). The Illumina HiSeq 2500 and the PacBio RS II Single Molecule Real Time (SMRT) sequencing platforms were combined to sequence the genome of this strain. The strain was cultured in the same growth condition as isolation for DNA extraction. Total DNA was extracted using the Genomic DNA Purification Kit (Promega, USA) following its instruction and was used for the preparation of both the Illumina and the PacBio libraries. Template DNA was sheared into 400-bp fragments for the Illumina library creation using the TruSeq library preparation kit from Illumina. The sheared fragments were used to prepare the Illumina libraries using the NEXTflex Rapid DNA-Seq Kit (BioScientific, USA). The prepared libraries were then used for paired-end Illumina sequencing (2 × 150 bp). For the PacBio sequencing, DNA was spun in a Covaris G-Tube (Covaris, Woburn, MA, USA) to be sheared into fragments. Then, ~10-kb fragments were purified, end-repaired, and ligated with the SMRTbell sequencing adapters following the PacBio recommendation. Next, a ~10-kb insert library was prepared using SMRTbell Express Template Prep Kit v.2.0 and sequenced on one SMRT cell using standard methods. Finally, 8,091,694 raw reads and 7,991,932 clean reads were generated using the Illumina sequencing platform to reach a depth of 1165.49-fold coverage and 385,591 raw reads (N 50; 218,077 bp) were generated using the PacBio sequencing platform. For Illumina raw data, adapters, non-A, G, C, and T bases of the 5′ end reads containing >10% unknown N and low-quality reads were removed by using Sickle v.1.33 (https://github.com/najoshi/sickle). Clean data were assembled into a scaffold using SOAPdenovo v.2.04 (2). The PacBio raw reads were then assembled into a scaffold using Hierarchical Genome Assembly Process v.4.0 (3) and Canu v.2.2 (4). Error correction of the PacBio assembly results was performed by Pilon v.1.22 (5) using the Illumina reads. If there was an overlap at the two ends with 5,000 bp, one end of the overlap was cut off to circulate the sequences. The final assembly generated a complete genome with a seamless chromosome and seven plasmids. Glimmer v.3.02 (6), tRNA-scan-SE v.2.0 (7), and Barrnap v.0.9 were used to predict protein-coding genes, tRNA and rRNA, respectively (https://github.com/tseemann/barrnap/). CheckM v.1.1.3 was used to check the quality of the assembled genome sequence, with a completeness of 99.45%, contamination of 0%, and strain heterogeneity of 0% (8). Genome annotations were conducted using the National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline v.5.3 (9). Default parameters were used for all software unless otherwise specified.
The whole-genome sequence of L. plantarum VHProbi P06 contains a 3,206,682-bp circular chromosome with GC contents of 44.66%. There were 3,115 protein-coding genes, 16 rRNA genes (5S rRNA, 16S rRNA, and 23S rRNA), 69 tRNA genes, 4 ncRNA genes, and 63 pseudogenes in the genome. There are also seven plasmids (Table 1). We stated that these molecules are plasmids verified by Pilon v.1.22.
TABLE 1.
Summary data of the plasmids
ACKNOWLEDGMENTS
We appreciate the individuals involved in this study.
This research was financially supported by the Mountain Tai New Strategy Industry Leader Program (grant no. tscy20180317).
Contributor Information
Zhi Duan, Email: duanzhi@vlandgroup.com.
David Rasko, University of Maryland School of Medicine, Baltimore, Maryland, USA .
DATA AVAILABILITY
The 16S rRNA sequence and the whole-genome sequence of Lactiplantibacillus plantarum VHProbi P06 have been deposited in the database with GenBank accession numbers OQ592686 and CP104084 to CP104091, SRA accession numbers SRR21288989 and SRR21288990, BioProject accession number PRJNA874163, and BioSample accession number SAMN30526907.
REFERENCES
- 1. Liu W, Bao Q, Qing M, Chen X, Sun T, Li M, Zhang J, Yu J, Bilige M, Sun T, Zhang H. 2012. Isolation and identification of lactic acid bacteria from Tarag in Eastern inner Mongolia of China by 16S rRNA sequences and DGGE analysis. Microbiol Res 167:110–115. doi: 10.1016/j.micres.2011.05.001 [DOI] [PubMed] [Google Scholar]
- 2. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. 2012. Soapdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Chin C-S, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi: 10.1038/nmeth.2474 [DOI] [PubMed] [Google Scholar]
- 4. Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Ingram S, Munzner T, Olano M. 2009. Glimmer: multilevel MDS on the GPU. IEEE Trans Vis Comput Graph 15:249–261. doi: 10.1109/TVCG.2008.85 [DOI] [PubMed] [Google Scholar]
- 7. Chan PP, Lowe TM. 2019. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol Biol 1962:1–14. doi: 10.1007/978-1-4939-9173-0_1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. Checkm: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The 16S rRNA sequence and the whole-genome sequence of Lactiplantibacillus plantarum VHProbi P06 have been deposited in the database with GenBank accession numbers OQ592686 and CP104084 to CP104091, SRA accession numbers SRR21288989 and SRR21288990, BioProject accession number PRJNA874163, and BioSample accession number SAMN30526907.