De novo genome assembly of a probiotic Lacticaseibacillus rhamnosus ISO20, isolated from raw milk in South Africa

Goitsemang Makete; Tshifhiwa Paris Mamphogoro

doi:10.1128/mra.01227-23

. 2024 Feb 20;13(3):e01227-23. doi: 10.1128/mra.01227-23

De novo genome assembly of a probiotic Lacticaseibacillus rhamnosus ISO20, isolated from raw milk in South Africa

Goitsemang Makete ^1,^✉, Tshifhiwa Paris Mamphogoro ^1,^✉

Editor: Vanja Klepac-Ceraj²

PMCID: PMC10927639 PMID: 38376337

ABSTRACT

Lactic acid bacteria are known to exhibit probiotic properties through various mechanisms including production of antimicrobial substances and bile salts tolerance. Here, we report a draft genome sequence of Lacticaseibacillus rhamnosus ISO20, a lactic acid bacterium isolated from raw goat’s milk to provide genomic insight into its strategies as probiotic strain.

KEYWORDS: Lacticaseibacillus rhamnosus, goat's milk, potential probiotic bacterium, facultative anaerobic heterofermentative bacteria, de novo assembly

ANNOUNCEMENT

Lacticaseibacillus rhamnosus is a facultative anaerobic heterofermentative rod-shaped bacterium isolated from different ecological niches, such as gastrointestinal tract, fermented dairy products, and plant-associated environment (1 –3). L. rhamnosus has a long safety history of applications where health and industrial benefits are associated with different strains (4).

L. rhamnosus ISO20 was isolated from goat’s milk sourced at the small-stock Division of the Agricultural Research Council, Animal Production (Irene, South Africa; 25° 53' 59.6"S 28° 12' 51.6"E). Forty raw milk samples were collected and enclosed in sterile plastic containers and transported on ice to the laboratory. One milliliter of each milk sample was suspended in 9 mL of sterile saline solution (0.85% wt/vol NaCl), and the mixture was serially diluted up to 10⁻⁵. The sample suspension was then inoculated onto De Man, Rogosa, and Sharpe (MRS) agar supplemented with 0.05 g/L cysteine-HCL (MRS-cysHCL) and incubated for 24–48 hours at 37°C under anaerobic conditions. Distinct colonies formed on the plates were selected. Pure strain was obtained by subculturing onto sterile MRS-cysHCL (5).

The genome of ISO20 was extracted from overnight liquid culture using the Quick-DNA Fungal/bacterial Miniprep Kit (Zymo Research, Irvine, CA) following the manufacturer’s instructions. The DNA concentration was measured using a NanoDrop (ThermoFisher Scientific, Carlsbad, CA, USA), and DNA quality was evaluated on 2% agarose gel. The paired-end (2 × 150 bp) libraries were generated using the EBNext Ultra II FS DNA Library Prepkit (New England Biolabs, Ipswich, MA) and sequenced on an Illumina NextSeq platform at Inqaba Biotechnical Industries (Pty) Ltd. (Pretoria, South Africa), yielding a total of 1,519,878 paired-end reads. The reads quality was evaluated using FastQC v0.11.5 (6) via KBase (7); the raw reads were then trimmed to remove low-quality reads and sequence adaptors using Trimmomatic v0.36 (8). The trimmed reads were de novo assembled using SPAdes v3.15.3 (9). Assembly quality was assessed using QUAST v5.0.2 (Table 1) (10). While the genome completeness and contamination were evaluated using CheckM v1.0.18 (11).

TABLE 1.

Quast assembly statistics

Assembly	SPAdes—v3.15.3
No. of contigs	30
Largest contig	831,503
Total length	2,957,989
GC (%)	46.5
N ₅₀	251,390
N ₇₅	141,836
L ₅₀	4
L ₇₅	6
# Ns per 100 kbp	13.15

Open in a new tab

Identification of ISO20 was conducted using Kaiju v1.7.3 (12), and the results were visualized using Krona v2.7.1 (13). The assembly yielded a genome sequence of 2,957,989 bp long, a G + C content of 46.5%, and a coverage of 154×. Genome completeness was estimated at 98.56%, comprising 30 contigs, with N₅₀ and L₅₀ values of 251,390 bp and 4, respectively. Gene annotation was performed using the RASTtk v1.073 and the NCBI Prokaryotic Genome Annotation Pipeline v6.5 (14, 15). All software programs were run with default parameters. Furthermore, genome analysis revealed 2,751 total genes and 62 RNAs. The subsystem statistics showed 27 subsystem feature counts of the coding protein into functional groups with a total of 2,646 Polycomb-group (PCG). The 969 genes were grouped into biological processes, cellular components, and molecular function. The topmost three groups were protein metabolism (n = 120), carbohydrates (n = 240), and amino acids and derivatives (n = 112) (Fig. 1). The RASTtk revealed the presence of genes encoding for acid tolerance, antioxidant, bile salt tolerance, adhesion, and bacteriocin production, all of which are essential characteristics for potential probiotic strains (16).

Fig 1 — Subsystem category distribution of key PCG of *L. rhamnosus* strain ISO20 annotated in the RAST SEED viewer annotation online server. The green/blue bar represents the subsystem coverage in percentage. Blue bar correlates with the percentage (%) of proteins present.

ACKNOWLEDGMENTS

This research was financially supported by the Department of Agriculture, Land Reform and Rural Development in collaboration with Agricultural Research Council of South Africa, project number P02000272.

Contributor Information

Goitsemang Makete, Email: makete@arc.agric.za.

Tshifhiwa Paris Mamphogoro, Email: MamphogoroT@arc.agric.za.

Vanja Klepac-Ceraj, Wellesley College Department of Biological Sciences, USA.

DATA AVAILABILITY

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number JASVVP000000000. The version described in this paper is the first version. The SRA accession number is SRR24904735, the BioProject accession number is PRJNA896361, and the BioSample accession number is SAMN35721675.

REFERENCES

1. Lukjancenko O, Ussery DW, Wassenaar TM. 2012. Comparative genomics of Bifidobacterium, Lactobacillus and related probiotic genera. Microb Ecol 63:651–673. doi: 10.1007/s00248-011-9948-y [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Broadbent JR, Neeno-Eckwall EC, Stahl B, Tandee K, Cai H, Morovic W, Horvath P, Heidenreich J, Perna NT, Barrangou R, Steele JL. 2012. Analysis of the Lactobacillus casei supragenome and its influence in species evolution and lifestyle adaptation. BMC Genomics 13:533. doi: 10.1186/1471-2164-13-533 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Mahony J, Ainsworth S, Stockdale S, van Sinderen D. 2012. Phages of lactic acid bacteria: the role of genetics in understanding phage-host interactions and their co-evolutionary processes. Virology 434:143–150. doi: 10.1016/j.virol.2012.10.008 [DOI] [PubMed] [Google Scholar]
4. Ceapa C, Davids M, Ritari J, Lambert J, Wels M, Douillard FP, Smokvina T, de Vos WM, Knol J, Kleerebezem M. 2016. The variable regions of Lactobacillus rhamnosus genomes reveal the dynamic evolution of metabolic and host-adaptation repertoires. Genome Biol Evol 8:1889–1905. doi: 10.1093/gbe/evw123 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Makete G, Aiyegoro OA, Thantsha MS. 2017. Isolation, identification and screening of potential probiotic bacteria in milk from South African Saanen goats. Probiotics Antimicrob Proteins 9:246–254. doi: 10.1007/s12602-016-9247-5 [DOI] [PubMed] [Google Scholar]
6. Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. Babraham Institute, Cambridge, United Kingdom. Available from: http://www.bioinformatics.babraham.ac.uk/projects/fastqc [Google Scholar]
7. Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, et al. 2018. KBase: the United States Department of energy systems biology knowledgebase. Nat Biotechnol 36:566–569. doi: 10.1038/nbt.4163 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Menzel P, Ng KL, Krogh A. 2016. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. doi: 10.1038/ncomms11257 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Ondov BD, Bergman NH, Phillippy AM. 2011. Interactive metagenomic visualization in a web browser. BMC Bioinformatics 12:385. doi: 10.1186/1471-2105-12-385 [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Li W, O’Neill KR, Haft DH, DiCuccio M, Chetvernin V, Badretdin A, Coulouris G, Chitsaz F, Derbyshire MK, Durkin AS, Gonzales NR, Gwadz M, Lanczycki CJ, Song JS, Thanki N, Wang J, Yamashita RA, Yang M, Zheng C, Marchler-Bauer A, Thibaud-Nissen F. 2021. RefSeq: expanding the prokaryotic genome annotation pipeline reach with protein family model curation. Nucleic Acids Res 49:D1020–D1028. doi: 10.1093/nar/gkaa1105 [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Tareb R, Bernardeau M, Vernoux JP. 2015. Genome sequence of Lactobacillus rhamnosus strain CNCM 1-3698. Genome Announc 3:e00582-15. doi: 10.1128/genomeA.00582-15 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[B1] 1. Lukjancenko O, Ussery DW, Wassenaar TM. 2012. Comparative genomics of Bifidobacterium, Lactobacillus and related probiotic genera. Microb Ecol 63:651–673. doi: 10.1007/s00248-011-9948-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2. Broadbent JR, Neeno-Eckwall EC, Stahl B, Tandee K, Cai H, Morovic W, Horvath P, Heidenreich J, Perna NT, Barrangou R, Steele JL. 2012. Analysis of the Lactobacillus casei supragenome and its influence in species evolution and lifestyle adaptation. BMC Genomics 13:533. doi: 10.1186/1471-2164-13-533 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3. Mahony J, Ainsworth S, Stockdale S, van Sinderen D. 2012. Phages of lactic acid bacteria: the role of genetics in understanding phage-host interactions and their co-evolutionary processes. Virology 434:143–150. doi: 10.1016/j.virol.2012.10.008 [DOI] [PubMed] [Google Scholar]

[B4] 4. Ceapa C, Davids M, Ritari J, Lambert J, Wels M, Douillard FP, Smokvina T, de Vos WM, Knol J, Kleerebezem M. 2016. The variable regions of Lactobacillus rhamnosus genomes reveal the dynamic evolution of metabolic and host-adaptation repertoires. Genome Biol Evol 8:1889–1905. doi: 10.1093/gbe/evw123 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Makete G, Aiyegoro OA, Thantsha MS. 2017. Isolation, identification and screening of potential probiotic bacteria in milk from South African Saanen goats. Probiotics Antimicrob Proteins 9:246–254. doi: 10.1007/s12602-016-9247-5 [DOI] [PubMed] [Google Scholar]

[B6] 6. Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. Babraham Institute, Cambridge, United Kingdom. Available from: http://www.bioinformatics.babraham.ac.uk/projects/fastqc [Google Scholar]

[B7] 7. Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, et al. 2018. KBase: the United States Department of energy systems biology knowledgebase. Nat Biotechnol 36:566–569. doi: 10.1038/nbt.4163 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12. Menzel P, Ng KL, Krogh A. 2016. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. doi: 10.1038/ncomms11257 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Ondov BD, Bergman NH, Phillippy AM. 2011. Interactive metagenomic visualization in a web browser. BMC Bioinformatics 12:385. doi: 10.1186/1471-2105-12-385 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, et al. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15. Li W, O’Neill KR, Haft DH, DiCuccio M, Chetvernin V, Badretdin A, Coulouris G, Chitsaz F, Derbyshire MK, Durkin AS, Gonzales NR, Gwadz M, Lanczycki CJ, Song JS, Thanki N, Wang J, Yamashita RA, Yang M, Zheng C, Marchler-Bauer A, Thibaud-Nissen F. 2021. RefSeq: expanding the prokaryotic genome annotation pipeline reach with protein family model curation. Nucleic Acids Res 49:D1020–D1028. doi: 10.1093/nar/gkaa1105 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16. Tareb R, Bernardeau M, Vernoux JP. 2015. Genome sequence of Lactobacillus rhamnosus strain CNCM 1-3698. Genome Announc 3:e00582-15. doi: 10.1128/genomeA.00582-15 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

De novo genome assembly of a probiotic Lacticaseibacillus rhamnosus ISO20, isolated from raw milk in South Africa

Goitsemang Makete

Tshifhiwa Paris Mamphogoro

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Fig 1.

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

De novo genome assembly of a probiotic Lacticaseibacillus rhamnosus ISO20, isolated from raw milk in South Africa

Goitsemang Makete

Tshifhiwa Paris Mamphogoro

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Fig 1.

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases