Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2015 Jul 30;3(4):e00821-15. doi: 10.1128/genomeA.00821-15

Draft Genome Sequence of Lactobacillus sp. Strain TCF032-E4, Isolated from Fermented Radish

Yuejian Mao a,, Meng Chen a, Philippe Horvath b
PMCID: PMC4520894  PMID: 26227596

Abstract

Here, we report the draft genome sequence of Lactobacillus sp. strain TCF032-E4 (= CCTCC AB2015090 = DSM 100358), isolated from a Chinese fermented radish. The total length of the 57 contigs is about 2.9 Mb, with a G+C content of 43.5 mol% and 2,797 predicted coding sequences (CDSs).

GENOME ANNOUNCEMENT

Lactobacillus sp. strain TCF032-E4 was isolated in 2013 from a homemade fermented white radish (brine pH 3.7) originating from Guilin, Guangxi Province, China. The 16S rRNA gene sequence of this strain shared high similarity with that of Lactobacillus plantarum subsp. plantarum ATCC 14917T (99.2%), Lactobacillus pentosus NCDO 363T (99.2%), and Lactobacillus paraplantarum CNRZ 1885T (99.1%). However, an abnormal amplification pattern (no product) was obtained with the recA multiplex PCR, a method described by Torriani et al. (1) for the differentiation of L. plantarum, L. paraplantarum, and L. pentosus species, which share highly similar 16S rRNA genes. TCF032-E4 also showed a different growth temperature range (no growth at temperatures of >32°C) and sugar fermentation pattern (e.g., no sucrose utilization). These distinguishing phenotypic characters motivated a deeper analysis of TCF032-E4 through whole-genome sequencing.

The genome of Lactobacillus sp. TCF032-E4 was sequenced by Majorbio Biopharm Technology Co., Ltd. (Shanghai, China) using an Illumina HiSeq 2500 platform. A paired-end library with a DNA insert length of 300 bp was generated, and 11,896,210 reads with a mean length of 101 nucleotides (nt) were obtained, representing ~413-fold genome coverage. Velvet 1.2.10 (2) was used to assemble the reads using a k-mer value of 31 (-ins_length 300, −cov_cutoff 50, -exp_cov 410, -min_contig_lgth 500). The resulting draft genome sequence comprised 57 contigs >500 bp in length, with a maximum contig length of 279,565 bp. The total length of all contigs is 2,904,660 bp, with a G+C content of 43.5 mol%. The contigs were ordered with Mauve 2.3.1 (3) using the complete genome of L. plantarum WCFS1 (4) (accession no. AL935263) as the reference, which shared 99% 16S rRNA gene similarity with TCF032-E4. The reordered draft genome sequence of TCF032-E4 was annotated with RAST (5), a fully automated online service for genome annotation. In total, 2,797 protein-coding sequences (CDS) were predicted, with an average length of 867 bp, representing a coding density of 83.5%. A total of 792 CDSs (28.3%) were identified to code for hypothetical proteins, while 1,225 CDS (44%) were classified into the RAST subsystems. Fifty-four tRNAs and one putative bacteriocin gene were predicted. Two intact prophages (45.3 kb and 34.4 kb) were detected by PHAST (6). No clustered regularly interspaced short palindromic repeat (CRISPR) array was detected using CRISPRFinder (7).

Nucleotide sequence accession numbers.

The strain is publicly available from two collections under references CCTCC AB2015090 and DSM 100358. This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession no. LFEE00000000. The version described in this paper is version LFEE01000000.

ACKNOWLEDGMENT

This work was funded by DuPont Nutrition & Health.

Footnotes

Citation Mao Y, Chen M, Horvath P. 2015. Draft genome sequence of Lactobacillus sp. strain TCF032-E4, isolated from fermented radish. Genome Announc 3(4):e00821-15. doi:10.1128/genomeA.00821-15.

REFERENCES

  • 1.Torriani S, Felis GE, Dellaglio F. 2001. Differentiation of Lactobacillus plantarum, L. pentosus, and L. paraplantarum by recA gene sequence analysis and multiplex PCR assay with recA gene-derived primers. Appl Environ Microbiol 67:3450–3454. doi: 10.1128/AEM.67.8.3450-3454.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss, and rearrangement. PLoS One 5:e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kleerebezem M, Boekhorst J, van Kranenburg R, Molenaar D, Kuipers OP, Leer R, Tarchini R, Peters SA, Sandbrink HM, Fiers MW, Stiekema W, Lankhorst RM, Bron PA, Hoffer SM, Groot MN, Kerkhoven R, de Vries M, Ursing B, de Vos WM, Siezen RJ. 2003. Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A 100:1990–1995. doi: 10.1073/pnas.0337704100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, Vonstein V, Wattam AR, Xia F, Stevens R. 2014. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 42:D206–D214. doi: 10.1093/nar/gkt1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. 2011. PHAST: a fast phage search tool. Nucleic Acids Res 39:W347–W352. doi: 10.1093/nar/gkr485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Grissa I, Vergnaud G, Pourcel C. 2007. CRISPRFinder: a Web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res 35:W52–W57. doi: 10.1093/nar/gkm360. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES