Skip to main content
Journal of Bacteriology logoLink to Journal of Bacteriology
. 2011 Sep;193(18):5023. doi: 10.1128/JB.05602-11

Draft Genome Sequence of Lactobacillus zeae KCTC 3804

Dong-Wook Kim 1,, Sang-Haeng Choi 1,, Aram Kang 1,2, Seong-Hyeuk Nam 1, Dae-Soo Kim 1, Ryong Nam Kim 1, Aeri Kim 1,2, Hong-Seog Park 1,2,*
PMCID: PMC3165655  PMID: 21868802

Abstract

We announce the draft genome sequence of the type strain Lactobacillus zeae KCTC 3804 (3,110,326 bp, with a G+C content of 47.8%), which is one of the most prevalent lactic acid bacteria present during the processing of raw cow's milk. The genome consists of 113 large contigs (>100 bp). All of the contigs were assembled by Newbler Assembler 2.3 (454 Life Science).

GENOME ANNOUNCEMENT

Cow's milk has all 8 essential amino acids in varying amounts, depending on the stage of lactation. About 80% of the proteins in milk are caseins, which are reasonably heat stable and, for the most part, easy to digest. The remaining 20% or so are classed as whey proteins, many of which have important physiological effects (bioactivity). Also easy to digest, but very heat sensitive (6), these include key enzymes (specialized proteins) and enzyme inhibitors, immunoglobulins (antibodies) (10), metal-binding proteins, vitamin-binding proteins, and several growth factors.

Our laboratory received the strain Lactobacillus zeae KCTC 3804, which is known to be present in raw cow's milk (2), from the Korean Collection for Type Cultures (KCTC), and it was grown under standard conditions (lactobacillus MRS broth [Difco 0881], 30°C, and 200 rpm). The genomic DNA was extracted using the alkaline lysis method (3) from the cultured bacteria. We then sequenced the genome of L. zeae KCTC 3804; genome sequencing of this organism had not been completed or initiated when our sequencing project was begun, according to the Genomes OnLine Database (GOLD) (8).

Here we report the genome sequence of L. zeae KCTC 3804, which was obtained using a whole-genome shotgun strategy (5) using Roche 454 GS (FLX Titanium) pyrosequencing (507,651 reads totaling ∼67.9 Mb; ∼26-fold coverage of the genome) at the Genome Resource Center, KRIBB (Korea Research Institute of Bioscience and Biotechnology). Genome sequences from pyrosequencing were processed by Roche's software according to the manufacturer's instructions. All of the reads were assembled using the Newbler assembler 2.3 (454 Life Science), which generated 113 large contigs (>100 bp). The annotation was done by merging the results obtained from the RAST (Rapid Annotation using Subsystem Technology) server (1), the Glimmer 3.02 modeling software package (4), tRNAscan-SE 1.21 (9), and RNAmmer 1.2 (7). In addition, the contigs were searched against the KEGG, UniProt, and Clusters of Orthologous Groups (COG) databases to annotate the gene descriptions. The G+C content (moles percent) was calculated using the genome sequence. The DNA base composition was calculated using Java.

The uncompleted draft genome includes 3,110,326 bases and is comprised of 2,965 predicted coding sequences (CDSs), with a G+C content of 47.8%. There are single predicted copies of the 5S, 16S, and 23S rRNA genes and 57 predicted tRNAs. There are 310 subsystems represented in the genome, and we used this information to reconstruct the metabolic network (determined using the RAST server). There are many carbohydrate subsystem features, including genes involved in metabolism of mono-, di-, and oligosaccharides, central carbohydrate metabolism, B, and fermentation. In particular, the ability to ferment cow's milk is a special feature of this strain.

Nucleotide sequence accession numbers.

The draft genome sequence of Lactobacillus zeae KCTC 3804 is available in GenBank under the accession numbers BACQ01000001 to BACQ101000113.

Acknowledgments

This work was supported by grant 2009-0084206 from the Ministry of Education, Science and Technology.

We thank Kun-Hyang Park and Min-Young Kim for their work in sequencing and assembling the genome, respectively.

REFERENCES

  • 1. Aziz R. K., et al. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Bae J. W., et al. 2005. Development and evaluation of genome-probing microarrays for monitoring lactic acid bacteria. Appl. Environ. Microbiol. 71:8825–8835 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Birnboim H. C., Doly J. 1979. A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Res. 7:1513–1523 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Delcher A. L., Bratke K. A., Powers E. C., Salzberg S. L. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Fleischmann R. D., et al. 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512 [DOI] [PubMed] [Google Scholar]
  • 6. Labropoulos A. E., Palmer J. K., Lopez A. 2007. Whey protein denaturation of UHT processed milk and its effect on rheology of yogurt. J. Texture Studies 12(3):365–374 [Google Scholar]
  • 7. Lagesen K., et al. 2007. RNAmmer: consistent and rapid annotation of rRNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Liolios K., et al. 2010. The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 38:D346–D354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of tRNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Ustunol Z., Sypien C. 2006. Heat stability of bovine milk immunoglobulins and their ability to bind lactococci as determined by an ELISA. J. Food Sci. 62(6):1218–1222 [Google Scholar]

Articles from Journal of Bacteriology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES