Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2024 May 2;13(6):e00082-24. doi: 10.1128/mra.00082-24

Complete genome sequence of Lacticaseibacillus paracasei A02

Jiahao Li 1, Xiuyun Wu 1, Sheng Xing 2,3, Bo Ding 2,3,
Editor: David Rasko4
PMCID: PMC11237647  PMID: 38695579

ABSTRACT

Lacticaseibacillus paracasei, a facultative anaerobic gram-positive bacterium, is commonly found in the gut of humans and animals, as well as in dairy products and plant ferments. Here, we report the complete genome sequence of L. paracasei A02, whose total genome length is 3,038,472 bp, with a GC content of 46.41%.

KEYWORDS: genome sequence

ANNOUNCEMENT

Lacticaseibacillus paracasei is a species of facultative anaerobic gram-positive bacteria and commonly found in the gut of humans and animals, dairy products, and plant ferments (1). The L. paracasei A02 was isolated from stool sample of healthy children stored in – 80°C refrigerator. We adhered to the principles of the Declaration of Helsinki. An amount of 0.1 g sample was added to 1 mL 0.9% sterile sodium chloride solution. Also, 0.1 mL of the mixture was inoculated in 10 mL of Li-Mupirocin-modified Man Rogosa Sharpe (MRS) medium (QingDao Hopebio-Technology Co., Ltd.) and cultured at 37°C for 72 h under anaerobic conditions. The enriched bacterial solution with sterile saline gradient dilution was coated on an AGAR plate containing MRS medium. A single colony purified by three rounds of scribing was selected for further identification. The 16S rRNA gene sequence was amplified using PCR with the primer pair 27F and 1492R. The 16S rRNA gene was obtained (with the GenBank accession number PP439829) by Sanger dideoxy sequencing. The strain A02 was identified as L. paracasei.

L. paracasei A02 was cultured in Li-Mupirocin-modified MRS medium at 37°C for 48 h under anaerobic conditions for DNA isolation using the QIAGEN Genomic-tip 20/G Kit (QIAGEN). Oxford Nanopore Technologies (ONT) and Illumina libraries were prepared from the same DNA preparation. For Illumina sequencing, DNA samples were sheared into 400–500 bp fragments using a Covaris M220 Focused Acoustic Shearer following manufacturer’s protocol. Then, the library was prepared from the sheared fragments using the TrueLib DNA Library Rapid Prep Kit and sequenced on an Illumina Novaseq 6000 platform. For ONT sequencing, DNA fragments were repaired, purified, and then attach sequencing adapters supplied in the SQK-LSK109 Kit to the DNA ends. Then, the library was sequenced on a PromethION48 sequencer with PromethION Flow Cell (R9 Version). Illumina sequencing generated 4,644,856× 2 raw pair reads, 1,390,758,874 bases, with 99.04% coverage base on reads mapping and raw Q30 of 95.23%. The raw reads generated from the paired-end library were subjected to quality filtering using fastp v0.23.0. ONT sequencing generated 186,597 reads, 792,889,124 bases, with 98.07% coverage base on reads mapping and the largest, average length and reads N50 value of 112,940, 4249.21, and 8,408 bp. ONT reads were extracted, base called and demultiplexed, and trimmed with the minimum Q-score cutoff of 7. Guppy v5.0.16 was used as the base caller (2). Unicycler v0.4.8 (3) was used for hybrid assembly of Illumina data after quality control and ONT data, and finally, Pilon v1.22 (4) was used to map Illumina short sequences to the assembled genome for correction. Genome annotation was performed using NCBI Prokaryotic Genome Annotation Pipeline (PGAP v5.3) (5). Default parameters were used for all software unless otherwise specified.

The complete genome sequence of L. paracasei A02 contained one chromosome (length, 3,020,466 bp; GC content, 46.45%) and two plasmids (plasmid A: length, 11,833 bp; GC content, 40.69%; plasmid B: length, 6,173 bp; GC content, 39.28%). The total genome length is 3,038,472 bp, containing 2,823 protein-coding genes, 59 tRNAs, 15 rRNAs, and the average GC content is 46.41%.

ACKNOWLEDGMENTS

This work was supported by Funds for horizontal cooperation (22450023001).

Contributor Information

Bo Ding, Email: 13791051023@126.com.

David Rasko, University of Maryland School of Medicine, Baltimore, Maryland, USA.

DATA AVAILABILITY

The genome sequence of L. paracasei A02 is available in NCBI GenBank under genome accession numbers CP139216-CP139218. The raw sequence data are available under Sequence Read Archive (SRA) accession numbers SRR27324570 (Illumina) and SRR27324569 (ONT).

REFERENCES

  • 1. Stefanovic E, McAuliffe O. 2018. Comparative genomic and metabolic analysis of three Lactobacillus paracasei cheese isolates reveals considerable genomic differences in strains from the same niche. BMC Genomics 19:205. doi: 10.1186/s12864-018-4586-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Wick RR, Judd LM, Holt KE. 2019. Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biol 20:129. doi: 10.1186/s13059-019-1727-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng QD, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 44:6614–6624. doi: 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The genome sequence of L. paracasei A02 is available in NCBI GenBank under genome accession numbers CP139216-CP139218. The raw sequence data are available under Sequence Read Archive (SRA) accession numbers SRR27324570 (Illumina) and SRR27324569 (ONT).


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES