Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2022 Dec 1;12(1):e01071-22. doi: 10.1128/mra.01071-22

Complete Genome Sequence of Lacticaseibacillus paracasei Strain VHProbi O44, Isolated from Feces from a Healthy Baby

Qian Wang a, Hongchang Cui a, Chaoqun Guo a, Zhi Duan a,
Editor: David Raskob
PMCID: PMC9872572  PMID: 36453935

ABSTRACT

Lacticaseibacillus paracasei strain VHProbi O44 is a Chinese commercial lactic acid bacterium with several probiotic functions. The whole genome contains a chromosome and three plasmids.

ANNOUNCEMENT

Infant fecal samples were collected in accordance with the Declaration of Helsinki, serially diluted with 0.85% (wt/vol) NaCl, and homogenized using a Stomacher. The dilution was spread onto De Man-Rogosa-Sharpe (MRS) agar (Luqiao, China) plates and incubated at 37°C for 48 h to enrich bacteria. A selected bacterial strain was named VHProbi O44. The bacterium was identified by Gram staining, biochemical and molecular biological analyses, such as with API 50 CHL medium, 16S rRNA gene Sanger sequencing, and mass spectrometry (1). The results showed that this strain belongs to the species Lacticaseibacillus paracasei. The Illumina HiSeq 2500 and Pacific Biosciences (PacBio) RS II sequencing platforms were combined for sequencing of this strain. Strain VHProbi O44 was cultured using the same method as for isolation to extract genomic DNA. Total DNA was extracted using a genomic DNA purification kit (Promega, USA) (2). DNA samples were quality controlled using a TBS-380 fluorometer (Turner BioSystems Inc.), and high-quality DNA was used for further research (3). Illumina template DNA was sheared into 400-bp fragments for library creation using the Illumina TruSeq library preparation kit. The prepared libraries were then used for paired-end Illumina sequencing (2 × 150 bp). For PacBio sequencing, DNA was centrifuged in g-TUBES (Covaris, Woburn, MA) for shearing into fragments. Then, ~10-kb fragments were purified, end repaired, and ligated with SMRTbell sequencing adapters following the manufacturer’s recommendations (PacBio, Menlo Park, CA) (4, 5). Next, an insert library was prepared using the SMRTbell Express template preparation kit v2.0 and sequenced in one single-molecule real-time (SMRT) Cell using standard methods. Finally, 7,024,462 raw reads and 6,801,162 clean reads were generated using the Illumina sequencing platform, to reach a depth of 337.42-fold coverage, and 87,960 raw reads (N50, 12,674 bp) were generated using the PacBio sequencing platform. Adapters, non-A/G/C/T bases at the 5′ end, reads containing >10% unknown N bases, and low-quality reads were removed by using Sickle v1.33 (6). Clean data were assembled into a scaffold using SOAPdenovo v2.04 (7). The PacBio raw reads were then assembled into a scaffold using the Hierarchical Genome Assembly Process (HGAP) and Canu v2.2 (8). Error correction of the PacBio assembly results was performed with Pilon v1.22 (9) using the Illumina reads. If there was a 5,000-bp overlap of the two ends, then one end of the overlap was cut off to circularize the sequence. The final assembly generated a complete genome with a seamless chromosome and three plasmids. Glimmer v3.02 (10), tRNAscan-SE v2.0 (11), and Barrnap v0.9 (https://github.com/tseemann/barrnap) were used for prediction of coding sequences (CDSs), tRNAs, and rRNAs, respectively. Genome annotations were conducted using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.2 (12). Default parameters were used for all software unless otherwise specified.

The genome of Lacticaseibacillus paracasei strain VHProbi O44 consists of one circular chromosome (3,105,961 bp) and three plasmids (79,930 bp, 68,258 bp, and 64,133 bp), with GC contents of 46.24%. There were 3,096 protein-coding genes, 15 rRNA genes (5S rRNA, 16S rRNA, and 23S rRNA), 59 tRNA genes, 3 noncoding RNA genes, and 92 pseudogenes in the genome.

Data availability.

The final annotated genome sequence of Lacticaseibacillus paracasei strain VHProbi O44 has been deposited in the GenBank database with accession numbers CP104303, CP104304, CP104305, and CP104306. The 16S rRNA gene sequence has been deposited in the GenBank database with accession number OP692715. The SRA accession numbers are SRR21285515 and SRR21285516. The BioProject accession number is PRJNA874474, and the BioSample accession number is SAMN30550144.

ACKNOWLEDGMENTS

Many thanks go to everyone who participated in the research.

This study was supported by the Mountain Tai New Strategy Industry Leader Program (grant tscy20180317).

Contributor Information

Zhi Duan, Email: duanzhi@vlandgroup.com.

David Rasko, University of Maryland School of Medicine.

REFERENCES

  • 1.Liu W, Bao Q, Jirimutu Qing M, Siriguleng Chen X, Sun T, Li M, Zhang J, Yu J, Bilige M, Sun T, Zhang H. 2012. Isolation and identification of lactic acid bacteria from Tarag in Eastern Inner Mongolia of China by 16S rRNA sequences and DGGE analysis. Microbiol Res 167:110–115. doi: 10.1016/j.micres.2011.05.001. [DOI] [PubMed] [Google Scholar]
  • 2.Ariel O, Brouard J-S, Marete A, Miglior F, Ibeagha-Awemu E, Bissonnette N. 2021. Genome-wide association analysis identified both RNA-seq and DNA variants associated to paratuberculosis in Canadian Holstein cattle ‘in vitro’ experimentally infected macrophages. BMC Genomics 22:162. doi: 10.1186/s12864-021-07487-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Qin H, Liu Y, Cao X, Jiang J, Lian W, Qiao D, Xu H, Cao Y. 2020. RpoS is a pleiotropic regulator of motility, biofilm formation, exoenzymes, siderophore and prodigiosin production, and trade-off during prolonged stationary phase in Serratia marcescens. PLoS One 15:e0232549. doi: 10.1371/journal.pone.0232549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Garagnani P, Marquis J, Delledonne M, Pirazzini C, Marasco E, Kwiatkowska KM, Iannuzzi V, Bacalini MG, Valsesia A, Carayol J, Raymond F, Ferrarini A, Xumerle L, Collino S, Mari D, Arosio B, Casati M, Ferri E, Monti D, Nacmias B, Sorbi S, Luiselli D, Pettener D, Castellani G, Sala C, Passarino G, De Rango F, D'Aquila P, Bertamini L, Martinelli N, Girelli D, Olivieri O, Giuliani C, Descombes P, Franceschi C. 2021. Whole-genome sequencing analysis of semi-supercentenarians. Elife 10:e57849. doi: 10.7554/eLife.57849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Felestrino ÉB, Sanchez AB, Caneschi WL, Lemes CGC, Assis RAB, Cordeiro IF, Fonseca NP, Villa MM, Vieira IT, Kamino LHY, do Carmo FF, da Silva AM, Thomas AM, Patané JSL, Ferreira FC, de Freitas LG, Varani AM, Ferro JA, Silva RS, Almeida NF, Garcia CCM, Setubal JC, Moreira LM. 2020. Complete genome sequence and analysis of Alcaligenes faecalis strain Mc250, a new potential plant bioinoculant. PLoS One 15:e0241546. doi: 10.1371/journal.pone.0241546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hawaz E. 2014. Isolation and identification of probiotic lactic acid bacteria from curd and in vitro evaluation of its growth inhibition activities against pathogenic bacteria. Afr J Microbiol Res 8:1419–1425. [Google Scholar]
  • 7.Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu SM, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam TW, Wang J. 2015. Erratum: SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 4:30. doi: 10.1186/s13742-015-0069-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chan PP, Lowe TM. 2019. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol Biol 1962:1–14. doi: 10.1007/978-1-4939-9173-0_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The final annotated genome sequence of Lacticaseibacillus paracasei strain VHProbi O44 has been deposited in the GenBank database with accession numbers CP104303, CP104304, CP104305, and CP104306. The 16S rRNA gene sequence has been deposited in the GenBank database with accession number OP692715. The SRA accession numbers are SRR21285515 and SRR21285516. The BioProject accession number is PRJNA874474, and the BioSample accession number is SAMN30550144.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES