Skip to main content
Brazilian Journal of Microbiology logoLink to Brazilian Journal of Microbiology
. 2021 Apr 23;52(3):1557–1563. doi: 10.1007/s42770-021-00479-2

Draft genome sequence of Lactobacillus delbrueckii subsp. bulgaricus LBP UFSC 2230: a tool for preliminary identification of enzymes involved in CLA metabolism

Gabriela Christina Kuhl 1,#, Ricardo Ruiz Mazzon 2,#, Rubens Tadeu Delgado Duarte 2, Juliano De Dea Lindner 1,
PMCID: PMC8324633  PMID: 33891284

Abstract

Several Lactobacillus ssp. are recognized as potential conjugated linoleic acid (CLA) producers. We have previously reported the ability of a range of Lactobacillus delbrueckii subsp. bulgaricus strains to produce CLA in fermented milk, being a potential candidate for the fermented dairy food chain. This study reports the draft genome sequence of L. bulgaricus strain LBP UFSC 2230, isolated from Italian Grana Padano cheese. Draft genome sequence originated in a total of 4,310,842 paired-end reads that were quality trimmed and assembled into 135 contigs with a total length of 604,745,873 bp, including 2086 protein coding genes and an average GC content of 49.7%. Draft genome sequence represents an important tool to identify the enzymes involved in this strain’s CLA metabolism. We identified a gene encoding an enzyme involved in biohydrogenation of linoleic acid pathway, oleate hydratase.

Keywords: Functional food; Bioprocess; Lactic acid bacteria; Lactobacillus ssp.; Genome sequencing; cis-9, trans-11 CLA

Introduction

Certain foods consist of a relevant number of functional microbial species. Among these, lactic acid bacteria (LAB) play an essential role in the development of starter cultures [1]. LAB are an important group of microorganisms used for a variety of food fermentations, and most of these LAB acquire the status of “Qualified Presumption of Safety” (QPS) or “Generally Recognized As Safe” (GRAS) by the European Food Safety Authority (EFSA) and by the US Food and Drug Administration (FDA), respectively. In particular, these LAB typically constitute a diverse group of Gram-positive, nonspore-forming, and nonmotile bacteria, producing lactic acid as major primary metabolite [2]. The use of these microorganisms and their enzymes for bioprocesses development has been extended to several bioactivities in the food industry, by enhancing nutrient bioavailability, providing flavor and texture [3], or converting unsaturated fatty acids into conjugated fatty acids, an example of which includes conjugated linoleic acid (CLA) [4] conversion.

CLA is a collective term used to designate a sequence of positional and geometric isomers of linoleic acid with conjugated double bonds. In particular, cis-9, trans-11 CLA is the major CLA isomer among Lactobacillus species and has been implicated as the most important isomer in terms of biological activity, such as anticarcinogenic [5], atherosclerotic [6], and body fat modulation [7]. A triple-component linoleic acid isomerase consisting of hydratase (CLA-HY), dehydrogenase/oxidoreductase (CLA-DH), and acetoacetate decarboxylase (CLA-DC), produced by Lactobacillus plantarum, was identified to convert linoleic acid into CLA [8].

Our previous analyses on CLA production in fermented milk demonstrated that Lactobacillus delbrueckii subsp. bulgaricus LBP UFSC 2230 is able to synthesize cis-9, trans-11 CLA [4]. In light of this, a genomic analysis was carried out to identify the genes responsible for CLA-HY, CLA-DH, and CLA-DC expression. In this study, we report phenotypic characterizations and the draft genome sequence of L. bulgaricus strain LBP UFSC 2230 isolated from Italian Grana Padano cheese. This strain has demonstrated a potent ability to produce CLA in dairy matrices and thus high biotechnological potential for use in CLA enrichment nourishments. The L. bulgaricus LBP UFSC 2230 strain presented as Gram-positive under optical microscopy, long rod-shaped and showed 2.7 h of generation time at 37 °C in the De Man, Rogosa, and Sharpe (MRS) broth without agitation. Colonies cultivated on MRS agar had a regular plain uncolored morphology with irregular borders.

Material and methods

Bacterial identification and phenotypic characterization

Preliminary microbial identification and phenotypic characterization regarding the metabolism of carbon sources were performed by using API 50CH (BioMérieux, France) (Table 1). The 16S rRNA gene sequence analysis was performed by Macrogen (Korea). Amplicons for the 16S rRNA sequences were generated using universal primers 1401R and 27F [9] on both strands. The consensus sequence of 1110 bp length was compared using the Basic Local Alignment Search Tool (BLAST) on the US National Center for Biotechnology Information (NCBI) Nucleotide Blast (BLASTn).

Table 1.

API 50CH fermentation profile of Lactobacillus delbrueckii subsp. bulgaricus LBP UFSC 2230

Carbon source Result Carbon source Result Carbon source Result
Glycerol D-Mannitol I D-Raffinose
Erythritol D-Sorbitol Methyl-γD-Manopyranoside
D-Arabinose Starch I Methyl-γD-Glucopyranoside
L-Arabinose Glucogen N-Acetylglucosamine +
D-Ribose Xylitol Gentiobiose
D-Xylose Amygdalin D-Turanose
L-Xylose Arbutin + D-Lyxose
D-Adonitol Esculin + Methyl-βD-Xylopyranoside
Salicin D-Tagatose D-Fucose
D-Galactose + D-Cellobiose L-Fucose
D-Glucose + D-Maltose + D-Arabitol
D-Fructose + D-Lactose + L-Arabitol
D-Mannose + D-Melibiose Potassium gluconate
L-Sorbose D-Saccharose + Potassium 2-Ketogluconate
L-Rhamnose D-Trehalose + Potassium 5-Ketogluconate
Dulcitol Inulin Inositol
D-Melezitose

(-) No fermentation up to 48 h of incubation; (+) Fermentation up to 48 h of incubation; (I) inconclusive.

Genome sequencing

Genome sequencing was carried out by WemSeq (Brazil). Genomic DNA was extracted using the UltraClean Tissue & Cells DNA Isolation Kit (Mo Bio, USA), and its integrity was ascertained visually in electrophoretic 1% agarose gel, while DNA purity and quantification were established by using Nanodrop (Thermo Scientific, USA) and Qubit (Invitrogen, USA), respectively. For library preparation, the Nextera XT kit (Illumina) was used according to the manufacturer’s instructions. Samples were sequenced on an Illumina MiSeq with a 500V2 kit 2 × 250 bp paired-end format. Total paired-end reads assembled from sequencing were trimmed for quality and de novo assembled using CLCBio Genomics Workbenchv8.5 (QIAGEN, Denmark) [10]. The average coverage of the genome was about 300×, and automatic annotation of resulting contigs was performed using RAST server [11].

Clustering unrooted tree

The evolutionary relationship was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [12]. The tree with the highest log likelihood (− 17867.84) is shown. Initial tree (s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach and then selecting the topology with superior log likelihood value. The tree was drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 13 nucleotide sequences represented by MLST concatemers for all 13 L. bulgaricus strains depicted in Table 2. Evolutionary analyses were conducted in MEGA X [13].

Table 2.

Complete genome sequences of Lactobacillus delbrueckii species available on GenBank

GenBank BioProject no. Microorganism Strain Genome size (Mb) GC (%) Genes Proteins
PRJNA591475 Lactobacillus delbrueckii TS1-06 1.854 49.8 1928 1567
PRJNA607413 Lactobacillus delbrueckii subsp. bulgaricus LJJ 1.891 49.5 1948 1604
PRJNA545627 Lactobacillus delbrueckii subsp. bulgaricus KLDS1.1011 1.887 49.8 1992 1634
PRJNA304935 Lactobacillus delbrueckii subsp. bulgaricus MN-BM-F01 1.875 49.7 1933 1585
PRJNA491249 Lactobacillus delbrueckii subsp. bulgaricus KLDS1.0207 1.869 49.8 1974 1607
PRJNA354491 Lactobacillus delbrueckii subsp. bulgaricus DSM 20080 1.868 49.8 1942 1564
PRJNA327771 Lactobacillus delbrueckii subsp. bulgaricus ND04 1.862 49.6 1932 1538
PRJEB21527 Lactobacillus delbrueckii subsp. bulgaricus ACA-DC 87 1.856 49.8 1928 1579
PRJNA331039 Lactobacillus delbrueckii subsp. bulgaricus L99 1.848 49.7 1944 1592
PRJNA16120 Lactobacillus delbrueckii subsp. bulgaricus 2038 1.873 49.7 1941 1562
PRJNA16871 Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842 1.865 49.7 1940 1561
PRJNA403 Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 1.857 49.7 1965 1579
PRJNA49147 Lactobacillus delbrueckii subsp. bulgaricus ND02 2.132 49.59 2139 2011
PRJNA615231 Lactobacillus delbrueckii subsp. bulgaricus LBP UFSC 2230 2.009 49.7 2165 2086
PRJDB5979 Lactobacillus delbrueckii subsp. delbrueckii NBRC 3202 1.91 50.1 1930 1653
PRJNA383854 Lactobacillus delbrueckii subsp. delbrueckii TUA4408L 2.012 49.9 1999 1718
PRJNA355324 Lactobacillus delbrueckii subsp. delbrueckii KCTC 13731 1.911 50 1905 1600
PRJNA354490 Lactobacillus delbrueckii subsp. delbrueckii DSM 20074 1.954 49.6 1965 1577
PRJNA355325 Lactobacillus delbrueckii subsp. indicus JCM 15610 2.022 49.37 2066 1939
PRJNA355248 Lactobacillus delbrueckii subsp. jakobsenii DSM 26046 1.892 50.1 1941 1614
PRJNA355327 Lactobacillus delbrueckii subsp. lactis KCCM 34717 2.263 49.1 2270 1905
PRJNA398581 Lactobacillus delbrueckii subsp. lactis KCTC 3034 2.238 49 2240 1889
PRJEB25625 Lactobacillus delbrueckii subsp. lactis 1 2.05 49.6 2072 1694
PRJNA350761 Lactobacillus delbrueckii subsp. lactis KCTC 3035 1.973 50 1982 1697
PRJNA454439 Lactobacillus delbrueckii subsp. lactis NWC_1_2 2.26 48.58 2297 2176
PRJNA398701 Lactobacillus delbrueckii subsp. lactis DSM 20072 2.166 49 2141 1793
PRJNA355244 Lactobacillus delbrueckii subsp. sunkii JCM 17838 2.004 50.1 1975 1726
PRJNA356 Lactobacillus plantarum WCFS1 3.308 44.5 3042 3059

Nucleotide sequence accession numbers

Raw sequencing reads were deposited on the GenBank SRA database under accession no. SRR11741240. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession JABSNS000000000. The version described in this paper is version JABSNS010000000.

Results and discussion

The phenotypic profile was not accurate since carbohydrate usage and fermentation was not fully compliant with the standards discriminated by the API test manufacturer, showing 36% Lactobacillus acidophilus and 27.5% Lactobacillus delbrueckii subsp. delbrueckii identity and T index values of 0.73 and 0.69, respectively. The phenotypic characterization of carbohydrate fermentation is shown in Table 1.

Since phenotypic identification of the strain was not successful, molecular identification was performed by 16S rRNA gene sequencing in which the consensus sequence returned a 98% query cover with 96.22% identity with L. bulgaricus strain LGM2 (GenBank accession no. AY675257.1).

Sequencing genome originated in a total of 4,310,842 paired-end reads that were quality trimmed and assembled into 135 contigs with a total length of 604,745,873 bp and an average GC content of 49.7%. The N50 of contig L. bulgaricus LBP UFSC 2230 genome is 39,590 bp, with the longest contig being 192,104 bp. The automatic annotation of the resulting contigs presented 2165 open reading frames (ORFs), of which 2,086 were protein-coding sequences and 79 were encoding RNAs. According to subsystem statistics of RAST shown in Fig. 1, 26% of features were categorized in subsystems while the majority of the genome could not be related to any subsystem by evidence. In this computation, there are 25 ORFs implicated in the category “Virulence, Disease and Defense”; however, all elements are present in the subcategory “Resistance to antibiotics and toxic compounds” of which only 5 are antibiotic-related.

Fig. 1.

Fig. 1

Subsystem statistics generated by RAST automatic annotation

Subsystem computation by RAST covered only 26% of all present features including non-hypothetical and hypothetical genes in present subsystems. The majority of the features present in the L. bulgaricus LBP UFSC 2230 genome remains with uncertain functionality related to the categories above according to the automatic annotation pipeline. Nowadays (June 2020), there are 27 completed genomes available on the GenBank database from L. delbrueckii, and among them, 12 belong to the bulgaricus subspecies, which were isolated from yogurt or fermented milk, instead of cheese as did LBP UFSC 2230. There are low discrepancies among L. delbrueckii strains concerning GC content, but a number of genes annotated are widely variable between them, varying from 1905 up to 2297 genes (Table 2).

Song et al. [14] studied by using a multi-locus (8 genes) sequence typing (MLST) method, a collection with 298 L. bulgaricus strains obtained from naturally fermented products. The authors observed 106 Sequence Types (STs) grouped then into 6 Lineages (L) and 5 clonal complexes (CC). The genes encoding the following proteins were chosen for analysis: ATP-dependent protease ATP-binding subunit ClpX (clpX), chromosomal replication initiation protein (dnaA), CTP synthetase (pyrG), chaperon in GroEL (groEL), UDP-N-acetylmuramoyl-L-alanyl-D-glutamate-L-lysine ligase (murE), phenylalanyl-tRNAsynthetase subunit alpha (pheS), recombinase A (recA), and DNA-directed RNA polymerase subunit beta (rpoB).

In order to verify similarity between available genomes of bulgaricus subspecies and LBP UFSC 2230 strain studied in the present work, a MLST analysis was also performed using the genes described by Song et al. [14]. The murE gene was excluded from the analysis because it was only present in half of the analyzed genomes. Unrooted phylogenetic trees were constructed from the concatenated MLST sequences (about 11.6 kb) using the maximum-likelihood method (Fig. 2). According to the generated tree, clusters were independent of the region of isolation of the strains (e.g., DSM20080 from Bulgaria and KLDS10207 from China instead of KLDS10207 and KLDS11011 both from China). Being the only strain obtained from cheese, LBP UFSC 2230 isolated in Italy has shown higher similarity to ND02, which was obtained from yogurt and isolated in the Inner Mongolia Autonomous Region.

Fig. 2.

Fig. 2

Clustering MLST concatenated sequences of L. bulgaricus strains with completely sequenced genomes available in GenBank. Circles indicate the type of matrix from where strains were isolated from yogurt (yellow), traditional Chinese dairy product (blue), fermented milk (orange), starter culture (purple), and Grana Padano cheese (grey)

Regarding the biohydrogenation of LA, a possible metabolic pathway involving four enzymes, three of which are in an operon, was described [8]. Among the four genes described by the authors, only the hydratase is present in the sequenced genome of LBP UFSC 2230 strain. Comparing the genomic organization and especially the region related to the locus in where the hydratase gene is placed in LBP UFSC 2230 strain (Fig. 3), leads us to believe that the genomes of the species and subspecies of Lactobacillus tolerate wide rearrangements. Indeed, the same was suggested by Song and co-workers who reported that there has been significant evidence of frequent recombination across the whole subspecies [14].

Fig. 3.

Fig. 3

Genome region comparison among LAB species. a Genome alignment and comparison among Lactobacillus plantarum WCSF1, Lactobacillus delbrueckii subsp. bulgaricus ND02, L. bulgaricus ND04, and L. bulgaricus LBP UFSC 2230 performed using Mauve [15]. Black arrow indicates position of CLA-DH, CLA-DC, and CLA-ER genes. Red arrows indicate position of CLA-HY. b Genome of L. plantarum WCSF1 indicating CLA-HY, CLA-DH, CLA-DC, and CLA-ER positions. Red arrow indicates both enzymes belonging to oleate hydratase superfamily (DH, dehydrogenase; HY, hydratase/dehydratase) while black arrows indicate other classes of enzymes (DC, isomerase; ER, desaturase) involved in linoleic acid biohydrogenation. c Oleate hydratase (red arrows) found in the L. bulgaricus LBP UFSC 2230 strain genome (first genome) compared to Lactobacillus acidophilus NCFM (two regions), Lactobacillus reuteri JCM 1112, and Lactobacillus gasseri ATCC 33323 (two regions) available on RAST database

Conclusion

Metabolic pathway that generates CLA in L. bulgaricus LBP UFSC 2230 remains unclear, once we only identified one gene encoding an enzyme involved in the biohydrogenation of linoleic acid. Further studies will be carried out in order to establish the role of oleate hydratase in CLA production in the strain. The use of LAB as a source of enzymes or as a heterologous host could have significant advantages in the food industry. Understanding these LAB and their enzymes allow for the generation of novel functional products, presenting a large opportunity for research and industry.

Repository GenBank Accession number: BioProject accession number PRJNA615231 (SRA accession number SRR11741240, WGS accession number JABSNS010000000)

Acknowledgements

We are grateful to Dr. Fabienne Ferreira (Department of Microbiology, Immunology and Parasitology, Federal University of Santa Catarina, Brazil), Dr. Harsh Mathur (Teagasc Food Reasearch Centre, Ireland) for useful comments on this manuscript, and Iwona Kozak, MSc (APC Microbiome Ireland, Biosciences Institute, University College Cork, Ireland) for contributing with the raw data GenBank submission.

Funding

This research was supported by the Brazilian federal government agency Coordenação de Aperfeiçoamento Pessoal de Nível Superior (CAPES), financial code 001.

Declarations

Conflict of interest

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Gabriela Christina Kuhl and Ricardo Ruiz Mazzon contributed equally to this work.

References

  • 1.De Dea Lindner J, Bernini V, De Lorentiis A, Pecorari A, Neviani E, Gatti M (2008) Parmigiano Reggiano cheese: evolution of cultivable and total lactic microflora and peptidase activities during manufacture and ripening. Dairy Sci Technol 88(4-5):511–523. 10.1051/dst:2008019
  • 2.Rezaul M, Rakib H, Kabir A, Amanullah SM. Starter cultures used in the production of probiotic dairy products and their potential applications: a review. Chem Biomol Eng. 2017;2(2):83–89. doi: 10.11648/j.cbe.20170202.12. [DOI] [Google Scholar]
  • 3.García-Cano I, Rocha-Mendoza D, Kosmerl E, Zhang L, Jiménez-Flores R. Technically relevant enzymes and proteins produced by LAB suitable for industrial and biological activity. Appl Microbiol Biotechnol. 2020;104(4):1401–1422. doi: 10.1007/s00253-019-10322-2. [DOI] [PubMed] [Google Scholar]
  • 4.Kuhl GC, Gusso AP, Simas Porto BL, Olivera Müller CM, Mazzon RR, Leal de Oliveira MA, Pereira dos Santos Richards NS, De Dea Lindner J (2017) Selection of lactic acid bacteria for the optimized production of sheep’s milk yogurt with a high conjugated linoleic acid content. J Food Res 6(4):44. 10.5539/jfr.v6n4p44
  • 5.Pariza MW, Ha YL. Conjugated dienoic derivatives of linoleic acid: a new class of anticarcinogens. Med Oncol Tumor Pharmacother. 1990;7(2):169–171. doi: 10.1007/BF02988544. [DOI] [PubMed] [Google Scholar]
  • 6.Lee KN, Kritchevsky D, Parizaa MW. Conjugated linoleic acid and atherosclerosis in rabbits. Atherosclerosis. 1994;108(1):19–25. doi: 10.1016/0021-9150(94)90034-5. [DOI] [PubMed] [Google Scholar]
  • 7.Park Y, Pariza MW. Mechanisms of body fat modulation by conjugated linoleic acid (CLA) Food Res Int. 2007;40(3):311–323. doi: 10.1016/j.foodres.2006.11.002. [DOI] [Google Scholar]
  • 8.Kishino S, Takeuchi M, Park S-B, Hirata A, Kitamura N, Kunisawa J, Kiyono H, Iwamoto R, Isobe Y, Arita M, Arai H, Ueda K, Shima J, Takahashi S, Yokozeki K, Shimizu S, Ogawa J. Polyunsaturated fatty acid saturation by gut lactic acid bacteria affecting host lipid composition. Proc Natl Acad Sci. 2013;110(44):17808–17813. doi: 10.1073/pnas.1312937110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yu J, Zhou XF, Yang SJ, Liu WH, Hu XF. Design and application of specific 16S rDNA-targeted primers for assessing endophytic diversity in Dendrobium officinale using nested PCR-DGGE. Appl Microbiol Biotechnol. 2013;97(22):9825–9836. doi: 10.1007/s00253-013-5294-y. [DOI] [PubMed] [Google Scholar]
  • 10.Baruzzo G, Hayer KE, Kim EJ, DI Camillo B, Fitzgerald GA, Grant GR. Simulation-based comprehensive benchmarking of RNA-seq aligners. Nat Methods. 2017;14(2):135–139. doi: 10.1038/nmeth.4106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Aziz RK, Bartels D, Best A, et al. (2008) The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9(August 2019). 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed]
  • 12.Tamura K, Nei M. <1993-Tamura-Nei.Pdf>. 1993;10(3).
  • 13.Kumar S, Mukherjee MM, Varela MF. Modulation of bacterial multidrug resistance efflux pumps of the major facilitator superfamily. Int J Bacteriol. 2013;2013:1–15. doi: 10.1155/2013/204141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Song Y, Sun Z, Guo C, Wu Y, Liu W, Yu J, Menghe B, Yang R, Zhang H. Genetic diversity and population structure of Lactobacillus delbrueckii subspecies bulgaricus isolated from naturally fermented dairy foods. Sci Rep. 2016;6(February):1–8. doi: 10.1038/srep22704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Darling A, Tritt A, Eisen J, Facciotti M. Mauve assembly metrics. Bioinformatics. 2011;27(19):2756–2757. doi: 10.1093/bioinformatics/btr451. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Brazilian Journal of Microbiology are provided here courtesy of Brazilian Society of Microbiology

RESOURCES