Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2013 Oct 17;1(5):e00869-13. doi: 10.1128/genomeA.00869-13

Draft Genome Sequence of the Extremely Halophilic Bacillus sp. Strain SB49, Isolated from a Salt Crystallizer Pond of the Little Rann of Kutch, India

Kamal Krishna Pal 1,, Rinku Dey 1, Manesh Thomas 1, Dharmesh Sherathia 1, Trupti Dalsania 1, Ilaxi Patel 1, Kinjal Savsani 1, Sucheta Ghorai 1, Sejal Vanpariya 1, Bhoomika Sukhadiya 1, Mona Mandaliya 1, Rupal Rupapara 1, Priya Rawal 1
PMCID: PMC3798458  PMID: 24136852

Abstract

Here we report the draft whole-genome sequence (3.72 Mbp) of Bacillus sp. strain SB49, an extremely halophilic bacterium isolated from a salt crystallizer pond of the Little Rann of Kutch in India. Unraveling the genome of this organism will facilitate understanding and isolation of the genes involved in imparting extreme osmotolerance.

GENOME ANNOUNCEMENT

An extremely halophilic and endospore-forming bacterium, Bacillus sp. strain SB49 (16S rRNA GenBank accession number JF802167), was isolated from a salt crystallizer pond of the Little Rann of Kutch, Gujarat, India. This organism grows optimally at 15% NaCl (range, 0 to 30%) concentration in medium at a temperature of 37°C and pH 7.5 (range, 5 to 9). The genome of Bacillus sp. strain SB49 was sequenced to understand the mechanism(s) of extreme osmotolerance and to isolate the relevant gene(s).

The genome of Bacillus sp. SB49 was sequenced by shotgun as well as mate-paired libraries at Macrogen Inc., South Korea, through Sequencher Tech Pvt. Ltd., Ahmedabad, India. In shotgun sequencing, an average read length of 453 bp was generated from 845,614 reads of 382,673,464 bases. However, average read lengths of 466 bp and 465 bp, respectively, were generated in mate-pair libraries from 155,818 reads of 72,629,111 bases and 152,361 reads of 70,866,553 bases, respectively.

The reads were assembled using GS De Novo Assembler v 2.6 (1). The genome assembly of Bacillus sp. SB49 (G+C content of 46.97%) has approximately 138-fold coverage and contains 12 scaffolds of 3,726,330 bp with an average length of 310,527 bp. The scaffolds consist of 44 contigs of 3,709,667 bp with an average length of 84,310 bp. N50 scaffold lengths of 567,128 bp, with the smallest scaffold of 2,486 bp and the largest scaffold of 1,122,558 bp, were obtained. Similarly, N50 contig lengths of 121,935 bp, with the smallest scaffold contig of 1,919 bp and the largest scaffold contig of 388,308 bp, were obtained. All assembly data were deposited in the DDBJ/EMBL/GenBank nucleotide sequence database.

The draft genome sequence was annotated by use of the RAST server (2), Glimmer 3 (3, 4), GeneMark (5, 6), the KEGG database (7), tRNAScan-SE (8), RNAmmer (9), and Signal P4.1 (10) for identification of subsystems, genes, pathways, RNAs, signal peptides, etc.

Using the various software programs, we predicted 3,907 coding sequences (CDS), with 3,293,355 bp in the CDS. There were 124 RNA-encoding genes (119 tRNA and 5 rRNA genes) and 416 subsystems. Among the CDS, 2,302 are not in a subsystem (948 nonhypothetical and 1,354 hypothetical), whereas 1,605 CDS (1,529 nonhypothetical and 76 hypothetical) are in a subsystem. RAST annotation also revealed associations of 96 genes in stress responses in this organism, including 11 genes involved in osmotic stress, 2 in osmoregulation, 9 in choline and betaine uptake and betaine biosynthesis, 32 in oxidative stress (6 in protection from reactive oxygen species [ROS], 16 in oxidative stress, 2 in glutathione:nonredox reactions, 6 in redox-dependent regulation of nuclear processes, and 2 in glutaredoxins), 3 in cold shock, 15 in heat shock, 9 in detoxification, and 26 in no subcategory. Similarly, 37, 96, and 9 genes associated with serine-glyoxalate, branched-chain amino acids, and glycerol and glycerol-3-phosphate pathways, respectively, have been mapped. Using the Signal P4.1 server, we also predicted 160 signal peptides in this organism. A total of 1,836 CDS were mapped to different biochemical pathways of KEGG (K00003 to K16703, including KEGG orthologs [KOs]).

We are exploring the genome of Bacillus sp. SB49 further to isolate the gene(s) involved in imparting extreme osmotolerance and the ability to sustain life in the salt pans of the Rann of Kutch, India.

Nucleotide sequence accession numbers.

This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number ATWS00000000. The version described in this paper is version ATWS01000000.

ACKNOWLEDGMENTS

The work was carried out in the subproject “Exploring bacterial and archaebacterial diversity in Kutch eco-region of Gujarat for agricultural and industrial applications” under Application of Microorganisms in Agriculture and Allied Sectors (AMAAS) of the Indian Council of Agricultural Research (ICAR). We thank ICAR for funding through AMAAS.

We also thank the directors of the Directorate of Groundnut Research, Junagadh, and NBAIM, Mau, for help during the course of this study. We thank Macrogen Inc., South Korea, and Sequencher Tech Pvt. Ltd., Ahmedabad, India, for the genomic services provided.

Footnotes

Citation Pal KK, Dey R, Thomas M, Sherathia D, Dalsania T, Patel I, Savsani K, Ghorai S, Vanpariya S, Sukhadiya B, Mandaliya M, Rupapara R, Rawal P. 2013. Draft genome sequence of the extremely halophilic Bacillus sp. strain SB49, isolated from a salt crystallizer pond of the Little Rann of Kutch, India. Genome Announc. 1(5):e00869-13. doi:10.1128/genomeA.00869-13.

REFERENCES

  • 1. 454 Life Sciences Corporation 2011. 454 sequencing system software manual, version 2.6, p 1–288 454 Life Sciences Corporation, Branford, CT [Google Scholar]
  • 2. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Salzberg SL, Delcher AL, Kasif S, White O. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26:544–548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Besemer J, Lomsadze A, Borodovsky M. 2001. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 29:2607–2618 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Lukashin AV, Borodovsky M. 1998. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 26:1107–1115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. 2004. The KEGG resource for deciphering the genome. Nucleic Acids Res. 32:277–280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Petersen TN, Brunak S, von Heijne G, Nielsen H. 2011. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat. Methods 8:785–786 [DOI] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES