Skip to main content
Journal of Bacteriology logoLink to Journal of Bacteriology
. 2012 Nov;194(22):6294–6295. doi: 10.1128/JB.01380-12

Genome Sequence of the Thermophile Bacillus coagulans Hammer, the Type Strain of the Species

Fei Su 1, Fei Tao 1, Hongzhi Tang 1, Ping Xu 1,
PMCID: PMC3486356  PMID: 23105047

Abstract

Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans.

GENOME ANNOUNCEMENT

Bacillus coagulans is known to be a slightly acidophilic, thermotolerant, and safe species, and it is also an economically important species that is frequently involved in the production of high concentrations of optically pure lactic acid (11, 12), coagulin (7), and other thermostable enzymes. Various morphologies of the cells, spore surfaces, and sporangia have been reported in previous studies; however, these morphologies have complicated the taxonomy of the species (4). B. coagulans Hammer (ATCC 7050; DSM1) was first described as the type strain of B. coagulans in 1915 by Hammer, who isolated this organism from spoiled canned milk (4). It has been involved in many phylogenetic studies as a reference strain, but little genetic information for this strain is available (3, 9, 14). High-throughput sequence technologies for DNA sequencing have enabled genome-scale phylogenetic studies, which may resolve the incongruences in phylogeny (13). Here we present the whole-genome sequence of this strain; the genomic-level phylogenetic analysis may help us to clarify the diversity of B. coagulans strains and elucidate the taxonomic status of the species.

The whole genome of B. coagulans Hammer was sequenced using the Illumina HiSeq2000 system; sequencing was performed by the Chinese National Human Genome Center at Shanghai. A total of 10.9 million high-quality-read pairs (2 × 100 bp), which achieved a more-than-200-fold coverage of the genome, were produced for de novo assembly by using the VELVET program (version 1.2.03) (16). The protein coding sequences (CDSs) in this genome were predicted using the RAST (rapid annotation using subsystem technology) server (1). Genes encoding tRNAs and rRNAs were identified by using tRNAscan-SE (8) and RNAmmer (6), respectively. In addition, genes were searched against the KEGG (10), UniProt (2), and COG (Clusters of Orthologous Groups) (15) databases to annotate the gene description. To detect the CRISPR clusters, the genome sequence was submitted to the CRISPRFinder program (5).

The draft genome sequence is comprised of 307 assembled contigs with a total length of 3,018,045 bp (N50, 35,029 bp), with a G+C content of 47.2%. About 3,437 CDSs (average length, 750 bp) and 82 tRNA and 10 rRNA genes were predicted. There were 412 subsystems determined by RAST, of which the system of carbohydrate utilization contained 366 proteins. According to the annotation, B. coagulans Hammer is predicted to possess complete metabolic pathways, including glycolysis, the tricarboxylic acid cycle, and the pentose phosphate pathway. However, the absence of the gene encoding xylose isomerase, which initializes the utilization of xylose, prevents B. coagulans Hammer from producing lactic acid from low-cost xylose. The genome sequence contains a series of membrane transport systems, including 20 CDSs involved in ATP-binding cassette transporters and 22 CDSs involved in protein translocation, but there is not a complete phosphotransferase system (PTS). The genome also contains about 10 CRISPR-associated proteins, as well as four CRISPR repeats, which may constitute a defense mechanism that allows the host bacterium to survive exposure to foreign genetic elements.

Nucleotide sequence accession numbers.

The results of this whole-genome shotgun project have been deposited at DDBJ/EMBL/GenBank under the accession number ALAS00000000. The version described in this paper is the first version, ALAS01000000.

ACKNOWLEDGMENTS

We acknowledge Huajun Zheng for performance of genome sequencing and analysis at the Chinese National Human Genome Center at Shanghai.

This work was partly supported by the Chinese National Programs for High Technology Research and Development (2011AA02A202 and 2011AA02A207) and the National Natural Science Foundation of China (30821005).

REFERENCES

  • 1. Aziz R, et al. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75 doi:10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Bairoch A, et al. 2005. The Universal Protein Resource (UniProt). Nucleic Acids Res. 33:D154–D159 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Becker ME, Pederson CS. 1950. The physiological characters of Bacillus coagulans (Bacillus thermoacidurans). J. Bacteriol. 59:717–725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. De Clerck E, et al. 2004. Polyphasic characterization of Bacillus coagulans strains, illustrating heterogeneity within this species, and emended description of the species. Syst. Appl. Microbiol. 27:50–60 [DOI] [PubMed] [Google Scholar]
  • 5. Grissa I, Vergnaud G, Pourcel C. 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35:W52–W57 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Lagesen K, et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Le Marrec C, Hyronimus B, Bressollier P, Verneuil B, Urdaci MC. 2000. Biochemical and genetic characterization of coagulin, a new antilisterial bacteriocin in the pediocin family of bacteriocins, produced by Bacillus coagulans I4. Appl. Environ. Microbiol. 66:5213–5220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Nakamura L, Blumenstock I, Claus D. 1988. Taxonomic study of Bacillus coagulans Hammer 1915 with a proposal for Bacillus smithii sp. nov. Int. J. Syst. Evol. Microbiol. 38:63 [Google Scholar]
  • 10. Ogata H, Goto S, Fujibuchi W, Kanehisa M. 1998. Computation with the KEGG pathway database. Biosystems 47:119–128 [DOI] [PubMed] [Google Scholar]
  • 11. Patel MA, et al. 2006. Isolation and characterization of acid-tolerant, thermophilic bacteria for effective fermentation of biomass-derived sugars to lactic acid. Appl. Environ. Microbiol. 72:3228–3235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Qin J, et al. 2009. Non-sterilized fermentative production of polymer-grade L-lactic acid by a newly isolated thermophilic strain Bacillus sp. 2-6. PLoS One 4:e4359 doi:10.1371/journal.pone.0004539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Rokas A, Williams BL, King N, Carroll SB. 2003. Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature 425:798–804 [DOI] [PubMed] [Google Scholar]
  • 14. Seki T, Chung C, Mikami H, Oshima Y. 1978. Deoxyribonucleic acid homology and taxonomy of the genus Bacillus. Int. J. Syst. Evol. Microbiol. 28:182 [Google Scholar]
  • 15. Tatusov R, et al. 2003. The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4:41 doi:10.1186/1471-2105-4-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821–829 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Bacteriology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES