Abstract
We announce the genome sequence of one of the most prevalent lactic acid bacteria present during the manufacturing process of cane juice, the type strain Leuconostoc pseudomesenteroides KCTC 3652 (3,244,985 bp, with a G+C content of 38.3%), which consists of 1,160 large contigs (>100 bp in size). All of the contigs were assembled by the Newbler Assembler 2.3 software program (454 Life Sciences).
GENOME ANNOUNCEMENT
Cane juice is the juice extracted from pressed sugarcane. It is consumed as a beverage worldwide and especially in regions where sugarcane is commercially grown, such as Southeast Asia, South Asia, and Latin America. The history of cane juice dates back thousands of years, running parallel to the history of sugar. Evaporated cane juice is chemically made up of a combination of sugars, such as fructose, glucose, and sucrose.
Our laboratory received the Leuconostoc pseudomesenteroides KCTC 3652 strain, which is known to be present in cane juice (2), from the Korean Collection for Type Cultures (KCTC), and it was grown under standard conditions (Lactobacilli MRS broth, 0881; Difco), at 30°C and 200 rpm). The genomic DNA was extracted from the cultured bacteria using the alkaline lysis method (3). We then sequenced the genome of Leuconostoc pseudomesenteroides KCTC 3652; genome sequencing of this organism had not been completed or initiated when our sequencing project was begun, according to the Genomes OnLine Database (GOLD) (7).
Here we report the genome sequence of Leuconostoc pseudomesenteroides KCTC 3652, obtained using a whole-genome shotgun strategy (5) using Roche 454 GS (FLX Titanium) pyrosequencing (190,210 reads, totaling ∼68 Mb; ∼21-fold coverage of the genome) at the Genome Resource Center, KRIBB (Korea Research Institute of Bioscience and Biotechnology). Genome sequences from pyrosequencing were processed by Roche's software according to the manufacturer's instructions. All of the reads were assembled using the Newbler Assembler 2.3 software program (454 Life Science), which generated 1,160 large contigs (>100 bp). The annotation was done by merging the results obtained from the RAST (Rapid Annotation using Subsystem Technology) server (1), the Glimmer 3.02 modeling software package (4), and the tRNAscan-SE 1.21 (8) and RNAmmer 1.2 (6) servers. In addition, the contigs were searched against the KEGG, UniProt, and Clusters of Orthologous Groups (COG) databases to annotate the gene descriptions. The G+C mole percent measurements were calculated using the genome sequence. The DNA base composition was calculated using a Java programme developed.
The uncompleted draft genome includes 3,244,985 bases and is comprised of 3,451 predicted coding sequences (CDSs), with a G+C content of 38.3%. There are single predicted copies of the 5S, 16S, and 23S rRNA genes and 56 predicted tRNAs. There are 264 subsystems that are represented in the genome, and we used this information to reconstruct the metabolic network (determined using the RAST server). There are many carbohydrate subsystem features, including genes involved in di- and oligosaccharides, central carbohydrate metabolism, monosaccharides, and sugar alcohols. The results from the study of Leuconostoc pseudomesenteroides derived from cane juice help to explain the rich characteristics of sugar cane.
Nucleotide sequence accession number.
The draft genome sequence of Leuconostoc pseudomesenteroides KCTC 3652 is available in GenBank under the accession number AEOQ00000001 to AEOQ00001160.
Acknowledgments
This work was supported by grant 2009-0084206 from the Ministry of Education, Science and Technology.
We thank Kun-Hyang Park and Min-Young Kim for their work in sequencing and assembling the genome, respectively.
Footnotes
Published ahead of print on 24 June 2011.
REFERENCES
- 1. Aziz R. K., et al. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Bae J. W., et al. 2005. Development and evaluation of genome-probing microarrays for monitoring lactic acid bacteria. Appl. Environ. Microbiol. 71:8825–8835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Birnboim H. C., Doly J. 1979. A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Res. 7:1513–1523 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Delcher A. L., Bratke K. A., Powers E. C., Salzberg S. L. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Fleischmann R. D., et al. 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512 [DOI] [PubMed] [Google Scholar]
- 6. Lagesen K., et al. 2007. RNAmmer: consistent and rapid annotation of rRNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Liolios K., et al. 2010. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 38:D346–D354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of tRNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
