Abstract
Paenibacillus sp. strain ICGEB2008 (MTCC 5639) is a Gram-positive cellulolytic bacterium, isolated from the gut of Helicoverpa armigera. Here, we report the draft genome sequence of Paenibacillus sp. ICGEB2008. The annotation of the ~5.7-Mb sequence indicated a cluster of genes related to the glycosyl hydrolase family and the butanediol biosynthesis pathway.
GENOME ANNOUNCEMENT
The cellulosic biomass is the most abundant biopolymer on earth and is a promising alternative to fossil fuels. However, the absence of effective enzyme systems for the hydrolysis of cellulosic biomass has become a major hurdle in the process of lignocellulosic biofuel production. Recently, focus has been given towards identifying efficient hydrolytic enzymes from cellulolytic microbes residing in the gut of insects feeding on lignocellulosic biomass. We isolated a novel microbe from the gut of the cotton bollworm, Helicoverpa armigera, and characterized its potential to produce various cellulolytic enzymes (1). The strain, named Paenibacillus sp. ICGEB2008 (MTCC 5639), is a rod-shape facultative anaerobic bacterium. It produces enzymes for the efficient hydrolysis of lignocellulosic biomass into monomeric fermentable sugars (1, 2) and has an inherent capability to synthesize ethanol and butanediol as major fermentation products (unpublished data); this makes it an attractive host for use in the biofuel production process.
The genome of strain ICGEB2008 was sequenced using an Ion Torrent Personal Genome Machine (PGM) instrument (603.22 Mb; 111-fold coverage). De novo assembly using MIRA 3.4.0 (a de novo assembler) produced 79 contigs totaling 5,691,612 bp (45.5% G+C content) with an N50 of 283,871 bp and the maximum contig size of 910,143 bp. The genome annotation was performed using the RAST server (http://rast.nmpdr.org/) and the output was downloaded in GenBank format (3). Among the predicted 5,153 protein-coding genes, 70% have been assigned putative functions according to the subsystem categorization. A total of 110 tRNA genes encompassing all 20 amino acids were identified using the tRNAscan-SE program (4).
The complete gene clusters coding for cellulolytic genes were predicted from the genome sequence. The genes for cellulose degradation, such as endocellulase and endoxylanase, were present in the contigs that were placed far apart in the genome, but the disaccharidases and specific transporters of such disaccharides were found to be present in the same operon. Few glycosyl hydrolases were predicted to have multiple functional domains for cellulase, xylanase, lichenase, or mannanase activities.
Genes involved in the syntheses of ethanol, 2,3-butanediol, and several other important extracellular metabolites were also identified. 2,3-Butanediol is a precursor for a variety of chemical feedstocks and liquid fuels. The genome sequencing data helped in predicting the pathway involved in the microbial production of butanediol. The genes involved in the 2,3-butanediol pathway coding for alpha-acetolactate decarboxylase, alpha-acetolactate synthase, and butanediol dehydrogenase were shown to be located in two operons, in contrast to a previous report for Klebsiella and Enterobacter, where all the genes were found to be present in a single operon (5).
A whole-genome comparison with the two completely sequenced Paenibacillus polymyxa genomes revealed the close relatedness between the P. polymyxa M1 and SC2 strains. The genome information provided here will allow for the genetic manipulation of Paenibacillus sp. ICGEB2008 for enhanced biofuel and chemical productions.
Nucleotide sequence accession numbers.
This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession no. AMQU00000000. The version described in this article is the first version, accession no. AMQU01000000.
ACKNOWLEDGMENT
We thank Invitrogen Bioservices India Pvt Ltd. for assistance with sequencing and analyses.
This work was supported financially by the Department of Biotechnology of India.
Footnotes
Citation Adlakha N, Ritturaj Kushwaha H, Rajagopal R, Yazdani SS. 2013. Draft genome sequence of the Paenibacillus sp. strain ICGEB2008 (MTCC 5639) isolated from the gut of Helicoverpa armigera. Genome Announc. 1(1):e00026-12. doi:10.1128/genomeA.00026-12.
REFERENCES
- 1. Adlakha N, Rajagopal R, Kumar S, Reddy VS, Shams Yazdani S. 2011. Synthesis and characterization of chimeric proteins based on cellulase and xylanase from an insect gut bacterium. Appl. Environ. Microbiol. 77:4859–4866 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Adlakha N, Sawant S, Anil A, Lali A, Shams Yazdani S. 2012. Specific fusion of 1,4-β-endoglucanase and 1,4-β-glucosidase enhances the cellulolytic activity and helps in channeling of the intermediates. Appl. Environ. Microbiol. 78:7447–7454 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Blomqvist K, Nikkola M, Lehtovaara P, Suihko ML, Airaksinen U, Stråby KB, Knowles JK, Penttilä ME. 1993. Characterization of the genes of the 2,3-butanediol operons from Klebsiella terrigena and Enterobacter aerogenes. J. Bacteriol. 175:1392–1404 [DOI] [PMC free article] [PubMed] [Google Scholar]
