Skip to main content

Some NLM-NCBI services and products are experiencing heavy traffic, which may affect performance and availability. We apologize for the inconvenience and appreciate your patience. For assistance, please contact our Help Desk at info@ncbi.nlm.nih.gov.

Genomics Insights logoLink to Genomics Insights
. 2013 Aug 18;6:25–36. doi: 10.4137/GEI.S12732

Comparative Genomics of Bacillus species and its Relevance in Industrial Microbiology

Archana Sharma 1, T Satyanarayana 1,
PMCID: PMC4510601  PMID: 26217108

Abstract

With the advent of high throughput sequencing platforms and relevant analytical tools, the rate of microbial genome sequencing has accelerated which has in turn led to better understanding of microbial molecular biology and genetics. The complete genome sequences of important industrial organisms provide opportunities for human health, industry, and the environment. Bacillus species are the dominant workhorses in industrial fermentations. Today, genome sequences of several Bacillus species are available, and comparative genomics of this genus helps in understanding their physiology, biochemistry, and genetics. The genomes of these bacterial species are the sources of many industrially important enzymes and antibiotics and, therefore, provide an opportunity to tailor enzymes with desired properties to suit a wide range of applications. A comparative account of strengths and weaknesses of the different sequencing platforms are also highlighted in the review.

Keywords: Bacillus, genome sequencing, comparative genomics, sequencing platforms, industrial enzymes

Introduction

Whole genome sequencing is a powerful approach for gaining the reference sequence information of multiple organisms. It helps in identifying genomic variations associated with phenotypes. Whole genome sequencing enables the identification of individual nodes of multi-component genetic interaction networks simultaneously, and also helps in mapping the evolutionary pathways that can support the growth of a genetically compromised strain.

Whole genome sequences of industrially important organisms and bacterial pathogens provide prospectives for industry, human health, and the environment. A wide diversity is present in bacterial genomes. Most of the bacterial proteins are highly or moderately conserved in evolution as extensive gene shuffling occurs in a few conserved gene arrays in distantly related bacteria. The outcome of whole genome sequencing has revealed new insights into the evolution of bacterial lifestyles including strategies for adaptation to new niches and overcoming competitors.

Bacillus is the most important workhorse among industrial microorganisms. They have ancient history of more than thousand years since the production of natto (a traditional Japanese dish of fermented soya beans) using B. subtilis, and their roles are continuously expanding and evolving. Bacillus species are considered as generally regarded as safe (GRAS) in the list of Food and Drug Administration (FDA). They are known to secrete copious amounts of extracellular enzymes and have high growth rates and short fermentation cycles and therefore these are one of the most important industrial enzyme producers.15 For further development and exploitation of these bacteria in industrial processes, the complete genomes of several species of Bacillus have been sequenced in order to understand their biochemistry, physiology, and genetics. B. subtilis is the first gram positive bacterium whose genome was sequenced and since then, the genomes of several Bacillus species have been sequenced.

Strategies to Sequence Complete Bacterial Genomes

The random shotgun sequencing method was developed and perfected for prokaryotic genomes which are smaller in size and contain less repetitive DNA. Various methods are used to fragment DNA including nebulization, ultrasonication, enzymatic fragmentation, and hydrodynamic shearing. After fragmentation, adaptors are ligated to these fragments. Template amplification is the next step and most of the platforms support mate pair sequencing, in which the ends of DNA fragments of size 3–20 kb are connected to form circular molecules. These molecules are then again fragmented, and the fragments flanking the joints are then selected and the end adaptors are added. Such sequencing offers significant information about the location of sequences scattered across the genome, which aids in assembling the genome.6 Another platform is paired end sequencing which is similar to mate pair sequencing however in the former DNA fragments are sequenced from each end with no requirement for additional library preparation steps. The pictorial representation of the whole genome sequencing is summarized in Figure 1.

Figure 1.

Figure 1

Pictorial representation of whole genome sequencing.

In 1977, Frederick Sanger introduced DNA sequencing technology, based on chain-termination method (also known as Sanger sequencing). Walter Gilbert developed another sequencing technology which relies on chemical modification of DNA and successive cleavage at specific bases. Sanger sequencing was accepted as the primary technology in the ‘first generation’ of laboratory and commercial sequencing applications on the basis of its high efficiency and low radioactivity. During that time, DNA sequencing was tedious and radioactive materials were required. The first automatic sequencing machine (AB370) was launched by Applied Biosystems in 1987, with capillary electrophoresis making the sequencing faster and more accurate. With the advent of new informatic approaches, holistic integration, and significant analysis of the genome sequence data, the sequencing has become easy and large information can be extracted from the complete genome. A variety of sequencing methods, softwares for the assembly of contigs, and annotation of gene are employed for sequencing the genome. The next generation sequencing (NGS) reads millions of fragments in a single run. It offers several advantages such as extensive parallel sequencing of shotgun libraries, high throughput with maximum genome coverage, and cost effectiveness. The most popular NGS platform includes (Roche 454: GS FLX, XL+; Illumina: Genome analyzer, HiSeq2000; Life Technologies: Ion Torrent, SOLiD, Pacific Biosciences: Pac Bio RS) sequencing. Illumina and Roche are the most commonly used platforms for sequencing genomes of Bacillus spp. The comparison of various sequencing platforms is depicted in Table 1.

Table 1.

Technical specifications of some important NGS platforms.

Platforms Illumina MiSeq Ion Torrent PGM PacBio RS Illumina GAIIx Illumina HiSeq 2000
Chemistry Reversible terminator Proton detection Real time fluorescence DNA polymerization Reversible terminator Reversible terminator
Instrument Cost* ($) 128 K 80 K (includes costing of PGM, server, OneTouch and OneTouch ES) 695 K 256 K 654 K
Sequence yield per run 1.5–2 Gb 20–50 Mb on 314 chip 100–200 Mb on 316 chip 1 Gb on 318 chip 100 Mb 30 Gb 600 Gb
Sequencing cost ($)per Gb* 502 1000 (318 chip) 2000 148 41
Run Time 27 h (along with 2 h cluster generation) 2 h 2 h 10 days 11 days
Reported Accuracy Mostly > Q30 Mostly Q20 <Q10 Mostly > Q30 Mostly > Q30
Observed Raw Error Rate (%) 0.80 1.71 12.86 0.76 0.26
Read length (bases) Up to 150 ~200 Average 1500** Up to 150 Up to 150
Correct SNp calls (%) 76 ~82 ~71 70 69
Paired reads Yes Yes No Yes Yes
Insert size Up to 700 bases Up to 250 bases Up to 10 kb Up to 700 bases Up to 700 bases
Typical DNA Requirements 50–1000 ng 100–1000 ng ~1 μg 50–1000 ng 50–1000 ng
*

All cost calculations are based on list price quotations obtained from the manufacturer and assume expected sequence yield.

**

Mean mapped read length (adapter and reverse strand sequences). Subread lengths, i.e. the individual stretches of sequence from the sequenced fragment, are significantly shorter.

Adapted from Quail et al.52

The first bacterial genome project started in 1991 for sequencing the complete Escherichia coli genome.7 Since then, genome projects have been started on a number of bacteria, archaea, and eukaryotes. Bacillus is one of the best described genera in the genomic database. According to Alcaraz et al8 more than 108 complete and draft genome sequences of Bacillus are available. The extensive research on Bacilli includes classical microbiology, biochemistry, and modern genomic and proteomic approaches. The details of genomes of various Bacillus spp. and their comparative analyses are highlighted in Tables 2 and 3. The dendrogram drawn using 16S rDNA sequences suggests close phylogenetic relationship among industrially important (B. licheniformis, B. subtilis, B. halodurans, B. megaterium) and pathogenic (B. anthracis, B. cereus) Bacillus spp. (Fig. 2).

Table 2.

Details of whole genome sequences of Bacillus spp.

S. no. Microorganism Genome size (Mbp) GC (%) Sequencing Annotation Fold coverage Reference
1. Bacillus anthracis Ames 5.22 35.4 TIGR microbial shotgun projects 13 5
2. Bacillus amyloliquefaciens LL3 3.99 45.7 Roche 454 GS-FLX system RAST 70 53
3. Bacillus anthracis H9401 5.23 35.0 454 GS-FLX UniRef90, NCBI nr, COG, KEGG 30 54
4. Bacillus cereus 5.43 35.3 Applied Biosystems 3700 DNA sequencers ERGO 6 34
5. Bacillus coagulans 36D1 3.55 46.5 Combination of Roche 454 & Sanger UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG 9 13
6. Bacillus halodurans C-125 4.20 43.7 ABI Prism 377 7.1 9
7. Bacillus isronensis 4.0 38.77 Roche 454 GS PROKKA 34 35
8. Bacillus licheniformis DSM13 4.22 46.2 Mega-BACE 1000/4000 ABI Prism 377 DNA ERGO tool 7.2 4
9. Bacillus megaterium WSH-002 4.14 454 Newbler Phred/Phrap/Consed 30
10. Bacillus subtilis 4.21 43.5 3
11. Bacillus subtilis BSn5 4.09 43.85 Illumina Solexa GA IIx PGAAP 169 55
12. Bacillus subtilis subsp. Spizizenii W23 3.60 43.89 Illumina HiSeq 2000 PGAAP 56

Table 3.

Comparative analysis of some industrially important Bacillus spp.

Characteristics B. subtilis B. halodurans B. licheniformis B. amyloliquefaciens B. megaterium
Pathways Embden-Meyerhof-Parnas Glycolytic pathway Tricarboxylic acid cycle (TCA) TCA cycle aminoacyl-tRNA synthetases Glycolysis, pentose cycle, TCA cycle glyoxylate bypass Embden-Meyerhof pathway, Krebs cycle glyoxylate pathway
Open reading frames (%) 87 85 86 89.6 83.3
Genome coding
Predicted number 4100 4066 4208 4325 5124
Conserved with function assigned 2379 2144 3691
Conserved with unknown function 668 1182 211
Non-conserved 1053 743 423
Gene start (%)
ATG 78 78
TTG 13 10
GTG 9.0 12
Avg. gene Length (bp) 890 877 873
RNAs
rRNAs 10 9 7 22 11
tRNAs 86 78 72 72 115
ABC transporters 77 75
Secondary Metabolism 4% 82 secreted protein & enzymes
Phage-associated genes 268 42 71
Reference 5 9 4 53 29

Figure 2.

Figure 2

Phylogenetic relationships among Bacillus spp. Closely related industrially important Bacillus spp. are highlighted in blue and the pathogenic Bacillus spp. are highlighted in red.

Metabolism of the Genus Bacillus

The type and range of metabolism provides information regarding the natural environment of the organism and biological activity. The key feature of B. subtilis metabolism includes the requirement of branched short chain carboxylic acids for lipid biosynthesis. It can also synthesize and utilize branched short chain carboxylic acids and alcohols.10 The high growth yield of B. licheniformis is due to its ability to utilize carbohydrates under conditions of varying oxygen tensions.11 B. anthracis shows reduced ability for sugar utilization as compared to B. subtilis. It lacks catabolic pathways for mannose, arabinose and rhamnose, less phophotransferase system, and sugar transporters. However, the existence of genes for extracellular chitin and chitosan hydrolysis and N-acetylglucosamine utilization reflects the association of this bacterium with insects.12 B. cereus has fewer genes for the degradation of carbohydrates and therefore this group lacks the metabolic versatility for the uptake and assimilation of plant-derived carbohydrates present in the soil as compared to B. subtilis.13

Horizontal Gene Transfer

The comparisons among bacterial, archaeal, and eukaryotic genomes show that an important fraction of genes in the prokaryotic genomes have been subjected to horizontal gene transfer (HGT). The occurrence of HGT can be demonstrated by the discovery of pathogenicity islands, i.e., gene clusters that possess pathogenicity determinants in parasitic bacteria, and similar ‘symbiosis islands’ in symbiotic bacteria.14,15 The comparative genomic analysis of the enterohemorrhagic E. coli strain O157:H7 and the laboratory K12 strain of E. coli has shown that nearly 30% of the genes in the pathogenic E. coli strain O157:H7 have been acquired by HGT.16 Massive archaeal-bacterial HGT has been shown between hyperthermophilic bacteria Aquifex aeolicus17 and Thermotoga maritime.18 Comparisons with mesophilic bacteria showed the presence of more ‘archaeal’ proteins in hyperthermophiles than in mesophiles.17 In another study, mesophilic archaea with relatively larger genomes, such as Methanosarcina and Halobacteria, had acquired more ‘bacterial’ genes than thermophilic archaea with smaller genomes.1921 This implies that ~20% of the genes in an organism might have been acquired via archaeal-bacterial HGT in shared habitats.19

The presence of ten prophages and their remains in B. subtilis genome showed the evolutionary role in HGT, mainly in the propagation of bacterial pathogenesis.10 Alkaliphilic Bacillus strains cannot grow or grow poorly at neutral pH. The complete genome sequence of B. halodurans C125 has thrown light on the mechanism of adaptation of this strain to alkaline environments. The presence of a number of transposase genes (112 genes) played an evolutionary role in HGT and internal genetic rearrangement in the genome.22 The HGT in two strains of B. amyloliquefaciens (DSM7T and FZB42) has not affected the main metabolic pathways.23 The role of HGT is unclear as several of these genes are annotated as conserved hypotheticals.

Comparative analysis of the genomes of B. subtilis and E. coli

The B. subtilis and E. coli genomes are similar in size. The evolutionary divergence of eubacteria into gram positive and gram negative groups has also been shown by sequencing B. subtilis and E. coli genomes. Nearly 1000 B. subtilis genes have orthologous counterparts in E. coli (one-quarter of the genome). These genes neither belong to the prophage-like regions nor to regions coding for secondary metabolism (15% of the B. subtilis genome), signifying that a huge portion of these genomes shared similar functions. One hundred putative operons or parts of the operons were conserved between E. coli and B. subtilis. Twelve of these operons showed a reshuffled gene order (typically, the arabinose operon is araABD in B. subtilis and araBAD in E. coli). Some other classes of operons, including major integrated functions like ATP synthesis (atp operon) and electron transfer (cta and qox operons), are also conserved among these microorganisms. Many operons with unknown functions are also conserved in E. coli and B. subtilis; these could be the main targets for functional analysis of these model genomes.10

Comparative Analysis of Genomes of the Genus Bacillus

The B. subtilis genome contains five signal peptidase genes along with the components of secretion apparatus which help in the secretion of macromolecular hydrolases (proteases and carbohydrases in g L−1) and antibiotics.10 Brown et al24 reported the evolution of a new strain of B. subtilis, named WN716, having different colony morphology as compared to known strains. This strain is also lacking sporulation, competence, acetoin production, motility, multiple auxotrophies, and enhanced competitive strength.25 Genome sequencing of this strain revealed no large chromosomal rearrangements, 34 single- nucleotide polymorphisms (SNPs) and +1 frameshifts. In response, many genes were affected including biosynthetic pathways, sporulation, competence and DNA repair. The whole genome sequencing within the species specified how genetic variation in the nucleotides add to the competitive robustness within species.

The genome sequence of B. subtilis natto was compared to its closely related strain B. subtilis 168. A number of insertion sequences harbored by B. subtilis natto and lacked by the latter strain, showing significant relation to natto production.24 The genome of sporogenic lactic acid bacterium B. coagulans highlights the potential of this organism to produce biocatalysts for the production of fuels and chemicals. Comparative analysis of the genome of B. coagulans with other Bacillus and Lactobacillus spp. reveals the presence of some genes common to both genera however this strain falls into the genus Bacillus. The presence of D-lactate dehydrogenase (D-LDH) encoding gene marks the potential of this organism for probiotic use in humans.26

The genome of industrially important B. licheniformis DSM 13 has been studied in detail. Some industrially significant genes include those which encode carbohydrate-, lipid-, and protein-degrading enzymes. The genes having extracellular protein secretory functions are presented in Table 4. The presence of conserved regulatory DNA motives, the glyoxylate bypass, and the anaerobic ribonucleotide reductase signifies that B. licheniformis can grow on acetate and 2, 3-butanediol as well as anaerobically on glucose.11 The complete genome of B. licheniformis ATCC 14580 was sequenced and compared with closely related B. subtilis. It produces many extracellular enzymes useful in nature in nutrient cycling. The B. licheniformis and B. subtilis shared organizational similarities. However, their genomes differed in numbers and locations of prophages, transposable elements, a number of extracellular enzymes, and secondary metabolic pathway operons.27 Many genes for exoenzyme production are also encoded in the genome including protease Subtilisin Carlsberg precursor,28,29 the glutamic acid-specific protease,30,31 the maltogenic α-amylase,32 and the temperature- and pH-stable α-amylase.3336 The analysis of the genome of industrially important alkaliphilic B. halodurans offered remarkable and significant information.37 This strain has been reported as a producer of β-galactosidase38 and xylanase.39

Table 4.

Enzymes identified in the industrially important strain of B. licheniformis and the corresponding orthologs present in B. subtilis.

Gene ID Gene function Gene designation in B. subtilis
BLi00656 α-Amylase precursor (EC 3.2.1.1)
BLi02117 α-Glucosidase (EC 3.2.1.20)
BLi03021 α-L-Arabinofuranosidase abfA
BLi01295 Arabinan endo-1,5-L-arabinase abnA
BLi04220 Arabinan endo-1,5-α-L-arabinosidase yxiA
BLi04276 Arabinogalactane endo-1,4-α-galactosidase yvfO
BLi00447 β-Galactosidase lacA
BLi04214 β-Glucosidase bglH
BLi01882 Cellulase (EC 3.2.1.4)
BLi01881 Cellulose 1,4-β-cellobiosidase
BLi00338 Chitinase (EC 3.2.1.14)
BLi02088 Endo-1,4-β-glucanase bglC
BLi01883 Endo-1,4-β-mannosidase
BLi00655 Endo-1,4-β-xylanase yjeA
BLi01880 Endo-1,4-glucanase (EC 3.2.1.4)
BLi00545 Esterase/lipase
BLi00340 Glutamic acid-specific protease mpr
BLi02827 Levanase sacC
BLi03707 Levanase yveB
BLi03706 Levansucrase sacB
BLi03370 Lipase/esterase
BLi00658 Maltogenic α-amylase (EC 3.2.1.1)
BLi04019 Minor extracellular serine protease vpr
BLi01123 Minor extracellular serine protease epr
BLi01404 Pectate lyase pel
BLi03053 Pectate lyase pelB
BLi03741 Pectate lyase yvpA
BLi03498 Pectin methylesterase
BLi04177 Peptidase T pepT
BLi01399 Polysugar-degrading enzyme ykfC
BLi02863 Protease yrr
BLi02862 Protease yrrO

Adapted from Veith et al.4

The establishment of relation between intermediary metabolism and genome structure, function, and evolution is important. Therefore, B. subtilis proteins were compared with themselves and with other complete genomes of Bacillus species. Nearly all of the essential genes in B. subtilis have orthologs in B. licheniformis, and many of them are present in a broad range of bacterial taxa. A number of paralogues are composed of big families of functionally associated proteins engaged in the transport of compounds in and out of the cell or in transcription regulation10 (Fig. 3).

Figure 3.

Figure 3

Comparison of the orthologous gene complements of B. licheniformis ATCC 14580, B. subtilis 168 and B. halodurans C-125. Numbers in the rectangular boxes shows the number of pairwise orthologs between neighboring species (BLAST threshold E = 1 × 10−5). Numbers in the outer circles indicates the total number of CDSs predicted in each genome, numbers in areas of overlap represents the number of orthologs predicted by reciprocal BLASTP analysis (threshold E = 1 × 10−5), and the number in the center presents the number of orthologous sequences common to all three genomes. The outer circles show the paralogs in the genome of these Bacillus spp (Modified from Rey et al).27

B. subtilis possesses less genes involved in replication, recombination, and repair showing a relationship with the scarce repetitive elements. On the other hand, the average number of genes related to carbohydrate transport and metabolism are high, as it is a soil microbe and is in close contact with plants and their products.40 Some studies imply that the chromosome remodeling and genome reduction4143 is not a common characteristic of B. subtilis. However, B. pumilus consists of fewer genes involved in DNA repair, oxidative stress, and acid soluble proteins that lessen the DNA damage due to desiccation and formation of UV resistant spores when compared to B. subtilis and related species.43,44

B. megaterium is a commercially available, industrially important non-pathogenic host for the production of products such as vitamin B12, penicillin acylase, and amylases. It employs Sec and Tat systems for effective secretion of proteins into the medium and accounts for a total of 30 secreted proteases genes in the genome. It is also the first biotechnological vitamin B12 producer. It is deep-rooted in the Bacillus phylogeny and is therefore significant in understanding genome evolution, dynamics, and agility in the Bacilli. The presence of an asymmetric region around the origin of replication was syntenic across the genus, a characteristic feature of the genome architecture of Bacillus spp. and sporulating lifestyle. B. megaterium is the largest of all Bacilli because of the presence of a second ftsZ gene.45 The draft genome of B. megaterium WSH-002 contains 9.26% of genes responsible for the synthesis and secretion of proteins.46

Read et al12 sequenced and analyzed the complete genome sequence of the chromosome of B. anthracis Ames for understanding anthrax pathogenesis. Many proteins that contributed to pathogenicity (haemolysins, phospholipases, and iron acquisition functions) and surface proteins, as well as the targets for vaccines and drugs have been identified. The B. anthracis genome was compared with the closely related B. cereus and B. thuringiensis which shared similarity of chromosomal genes but are not associated with anthrax. The high pathogenicity of B. anthracis H9401 is due to a tripartite exotoxin and a poly- D-glutamic acid capsule, the main virulence determinants, responsible for complete pathogenicity and are encoded by two plasmids, pXO1 and pXO2.4749 B. cereus is an opportunistic pathogen that causes food poisoning and is closely related to B. anthracis (animal and human pathogen) and B. thuringiensis (insect pathogen). Ivanova et al13 reported sequencing and analysis of B. cereus ATCC 14579 and compared it with another pathogenic strain B. anthracis A2012. The pathogenicity related genes required for invasion, establishment, and propagation of bacteria in the host are well known in B. anthracis and are also present in B. cereus, showing the similarity among the strains. Whole genome sequence of B. cereus reveals the presence of protein coding sequences (CDSs) for pathogenicity that indicates the relatedness of B. cereus to B. anthracis and B. thuringiensis. Nearly 75%–80% genes are conserved between B. cereus ATCC 14579 and B. anthracis A2012, signifying that both have a common ancestor. The analysis of the metabolic potential encoded by the sets of common genes challenges the assumption of the cereus group ancestor being a soil bacterium. The characteristic feature of soil bacteria (Streptomyces spp. or B. subtilis) is the existence of multiple carbohydrate catabolic pathways, as a variety of carbohydrates are present in the soil and plant-derived material is the major source of nutrients. A total of 41 genes for degradation of carbohydrate polymers were identified in the B. subtilis genome, in contrast to only 14 and 15 CDSs in B. cereus and B. anthracis, respectively. Furthermore, a total of 51 and 48 protease-encoding CDSs were identified in B. cereus and B. anthracis, respectively, as compared to only 30 in B. subtilis. It has been suggested on the basis of certain observations that the insect intestine may have been the natural habitat for the common ancestor of the cereus group. The major collection of antibiotic resistance genes are also the characteristic feature of B. cereus group. This underlines the probability of the cereus group ancestor as an opportunistic insect pathogen than a benign soil bacterium and the nature and origin of the B. cereus.

Shivaji et al50 have recently sequenced the complete genome of B. isronenesis isolated from an air sample collected at an altitude of 27 to 30 kilometers. As it survives in the upper atmosphere and exposed to UV radiation, it carries a uvsE gene encoding a UV DNA damage endonuclease enzyme responsible for a novel form of excision repair of DNA damaged by UV light.51 A number of antibiotic resistance proteins (penicillin acylase 2 precursor, putative niacin/nicotinamide, erythromycin esterase, putative vancomycin resistance protein, tunicamycin resistance protein, putative phosphinothricin acetyltransferase, daunorubicin/doxorubicin resistance ATP binding protein, daunorubicin, doxorubicin, and fosmidomycin resistance) are present in the genome of B. isronenesis. The genome of this novel bacterial strain can be useful in comparative genomics among phylogenetically related Bacillus isolates from different environments.

Conclusions

The genome sequences of Bacillus spp. provide a wealth of information related to gene conservation, diversity among species and systematic information, which cannot be obtained by any other approach. The understanding of the biochemistry of the organism also offers clues regarding its adaptation to the extreme environmental conditions. It can also be applied in genotype association studies, mutation screening, evolutionary studies, and environmental profiling of microorganisms. Genome sequencing has provided insights into the genetic framework of the microbial world and is a leading light of advancing technologies such as microarrays and proteomics, which have reinvigorated the field of microbiological research. There is still a need to explore new industrially important Bacillus species which possess novel genes which encode hitherto unknown products for a variety of applications.

Footnotes

Author Contributions

All authors reviewed and approved of the final manuscript.

Funding

Archana Sharma is grateful to the Council of Scientific and Industrial Research (CSIR), Government of India, New Delhi for awarding research fellowship while writing this review.

Competing Interests

Authors disclose no potential conflicts of interest.

Disclosures and Ethics

As a requirement of publication the authors have provided signed confirmation of their compliance with ethical and legal obligations including but not limited to compliance with ICMJE authorship and competing interests guidelines, that the article is neither under consideration for publication nor published elsewhere, of their compliance with legal and ethical guidelines concerning human and animal research participants (if applicable), and that permission has been obtained for reproduction of any copyrighted material. This article was subject to blind, independent, expert peer review. The reviewers reported no competing interests.

References

  • 1.Harwood CR. Bacillus subtilis and its relatives: molecular biological and industrial workhorses. Trends Biotechnol. 1992;10(7):247–256. doi: 10.1016/0167-7799(92)90233-l. [DOI] [PubMed] [Google Scholar]
  • 2.Satyanarayana T, Sharma DC, Rao JLUM, Ezhilvannan M, Bharadwaj M, Adhikari S. Potential applications of enzymes produced by Bacillus and Geobacillus. In: Satyanarayana T, Johri BN, editors. Microbial Diversity: Current Perspectives and Potential Applications. IK International Pvt Ltd; New Delhi: 2005. pp. 741–768. [Google Scholar]
  • 3.Satyanarayana T, Sharma A, Mehta D, et al. Biotechnological applications of biocatalysts from the Firmicutes Bacillus and Geobacillus species. In: Satyanarayana T, Johri BN, Prakash A, editors. Microorganisms in Sustainable Agriculture and Biotechnology. Springer Verlag; 2012. pp. 343–379. [Google Scholar]
  • 4.Nisha M, Satyanarayana T. Thermostable archeal and bacterial pullulanases and amylopullulanses. In: Satyanarayana T, Littlechild J, Kawarabayasi Y, editors. Thermostable Microbes in Environmental and Industrial Biotechnology. Springer Verlag; 2013. pp. 535–587. [Google Scholar]
  • 5.Kumar V, Verma D, Archana A, Satyanarayana T. Thermostable bacterial xylanases. In: Satyanarayana T, Littlechild J, Kawarabayasi Y, editors. Thermostable Microbes in Environmental and Industrial Biotechnology. Springer Verlag; 2013. pp. 813–857. [Google Scholar]
  • 6.Fleischmann RD, Adams MD, White O, et al. Whole genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269(5223):496–512. doi: 10.1126/science.7542800. [DOI] [PubMed] [Google Scholar]
  • 7.Daniels DL, Plunkett G, Burland V, Blattner FR. Analysis of the Escherichia coli genome: DNA sequence of the region from 84.5 to 86.5 min. Science. 1992;257(5071):771–778. doi: 10.1126/science.1379743. [DOI] [PubMed] [Google Scholar]
  • 8.Alcaraz LD, Moreno-Hagelsieb G, Eguiarte LE, Souza V, Herrera-Estrella L, Olmedo G. Understanding the evolutionary relationships and major traits of Bacillus through comparative genomics. BMC Genomics. 2010;11:332. doi: 10.1186/1471-2164-11-332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ravel J, Fraser CM. Genomics at the genus scale. Trends Microbiol. 2005;13(3):95–97. doi: 10.1016/j.tim.2005.01.004. [DOI] [PubMed] [Google Scholar]
  • 10.Kunst F, Ogasawara N, Moszer I, et al. The complete genome sequence of the Gram-positive bacterium Bacillus subtilis. Nature. 1997;390(6657):249–256. doi: 10.1038/36786. [DOI] [PubMed] [Google Scholar]
  • 11.Veith B, Herzberg C, Steckel S, et al. The complete genome sequence of Bacillus licheniformis DSM13, an organism with great industrial potential. J Mol Microbiol Biotechnol. 2004;7(4):204–211. doi: 10.1159/000079829. [DOI] [PubMed] [Google Scholar]
  • 12.Read TD, Peterson SN, Tourasse N, et al. The genome sequence of Bacillus anthracis Ames and comparison to closely related bacteria. Nature. 2003;423(6935):81–86. doi: 10.1038/nature01586. [DOI] [PubMed] [Google Scholar]
  • 13.Ivanova N, Sorokin A, Anderson I, et al. Genome sequence of Bacillus cereus and comparative analysis with Bacillus anthracis. Nature. 2003;423(6935):87–91. doi: 10.1038/nature01582. [DOI] [PubMed] [Google Scholar]
  • 14.Ochman H, Lawrence JG, Groisman EA. Lateral gene transfer and the nature of bacterial innovation. Nature. 2000;405(6784):299–304. doi: 10.1038/35012500. [DOI] [PubMed] [Google Scholar]
  • 15.Ochman H, Moran NA. Genes lost and genes found: evolution of bacterial pathogenesis and symbiosis. Science. 2001;292(5519):1096–1099. doi: 10.1126/science.1058543. [DOI] [PubMed] [Google Scholar]
  • 16.Perna NT, Plunkett G, III, Burland V, et al. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature. 2001;409(6819):529–533. doi: 10.1038/35054089. [DOI] [PubMed] [Google Scholar]
  • 17.Aravind L, Tatusov RL, Wolf YI, Walker DR, Koonin EV. Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles. Trends Genet. 1998;14(11):442–444. doi: 10.1016/s0168-9525(98)01553-4. [DOI] [PubMed] [Google Scholar]
  • 18.Nelson KE, Clayton RA, Gill SR, et al. Evidence for lateral gene transfer between Archaea and bacteria from genome sequence of Thermotoga maritima. Nature. 1999;399(6734):323–329. doi: 10.1038/20601. [DOI] [PubMed] [Google Scholar]
  • 19.Kennedy SP, Ng WV, Salzberg SL, Hood L, DasSarma S. Understanding the adaptation of Halobacterium species NRC-1 to its extreme environment through computational analysis of its genome sequence. Genome Res. 2001;11(10):1641–1650. doi: 10.1101/gr.190201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Deppenmeier U, Johann A, Hartsch T, et al. The genome of Methanosarcina mazei: evidence for lateral gene transfer between bacteria and archaea. J Mol Microbiol Biotechnol. 2002;4(4):453–461. [PubMed] [Google Scholar]
  • 21.Koonin EV, Wolf YI. Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world. Nucleic Acids Res. 2008;36(21):6688–6719. doi: 10.1093/nar/gkn668. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Takami H, Nakasone K, Takaki Y, et al. Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis. Nucleic Acid Res. 2000;28(21):4317–4331. doi: 10.1093/nar/28.21.4317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Rückert C, Blom J, Chen X, Reva O, Borriss R. Genome sequence of B. amyloliquefaciens type strain DSM7T reveals differences to plant- associated B. amyloliquefaciens FZB42. J Biotechnol. 2011;155(1):78–85. doi: 10.1016/j.jbiotec.2011.01.006. [DOI] [PubMed] [Google Scholar]
  • 24.Maughan H, Nicholson WL. Increased fitness and alteration of metabolic pathways during Bacillus subtilis evolution in the laboratory. Appl Environ Microbiol. 2011;77(12):4105–4118. doi: 10.1128/AEM.00374-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Brown CT, Fishwick LK, Chokshi BM, et al. Whole-genome sequencing and phenotypic analysis of Bacillus subtilis mutants following evolution under conditions of relaxed selection for sporulation. Appl Environ Microbiol. 2011;77(19):6867–6877. doi: 10.1128/AEM.05272-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rhee MS, Moritz BE, Xie G, et al. Complete genome sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1. Stand Genomic Sci. 2011;5(3):331–340. doi: 10.4056/sigs.2365342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Rey MW, Ramaiya P, Nelson BA, et al. Complete genome sequence of the industrial bacterium Bacillus licheniformis and comparisons with closely related Bacillus species. Genome Biol. 2004;5(10):R77. doi: 10.1186/gb-2004-5-10-r77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Smith EL, DeLange RJ, Evans WH, Landon M, Markland FS. Subtilisin Carlsberg. V. The complete sequence: Comparison with subtilisin BPN’; evolutionary relationships. J Biol Chem. 1968;243(9):2184–2191. [PubMed] [Google Scholar]
  • 29.Syed R, Wu ZP, Hogle JM, Hilvert D. Crystal structure of selenosubtilisin at 2.0-A resolution. Biochemistry. 1993;32(24):6157–6164. doi: 10.1021/bi00075a007. [DOI] [PubMed] [Google Scholar]
  • 30.Kakudo S, Kikuchi N, Kitadokoro K, et al. Purification, characterization, cloning, and expression of a glutamic acid-specific protease from Bacillus licheniformis ATCC14580. J Biol Chem. 1992;267(33):23782–23788. [PubMed] [Google Scholar]
  • 31.Svendsen I, Breddam K. Isolation and amino acid sequence of a glutamic acid specific endopeptidase from Bacillus. 1992. Eur J Biochem. 1992;204(1):165–171. doi: 10.1111/j.1432-1033.1992.tb16619.x. [DOI] [PubMed] [Google Scholar]
  • 32.Kim IC, Cha JH, Kim JR, et al. Catalytic properties of the cloned amylase from Bacillus licheniformis. J Biol Chem. 1992;267(31):22108–22114. [PubMed] [Google Scholar]
  • 33.Gray GL, Mainzer SE, Rey MW, et al. Structural genes encoding the thermophilic α-amylases of Bacillus stearothermophilus and Bacillus licheniformis. J Bacteriol. 1986;166(2):635–643. doi: 10.1128/jb.166.2.635-643.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kandra L, Gyemant G, Remenyik J, Hovanszki G, Liptak A. Action pattern and subsite mapping of Bacillus licheniformis α-amylase (BLA) with modified maltooligosaccharide substrates. FEBS Lett. 2002;518(1–3):79–82. doi: 10.1016/s0014-5793(02)02649-2. [DOI] [PubMed] [Google Scholar]
  • 35.Stephens MA, Ortlepp SA, Ollington JF, McConnell DJ. Nucleotide sequence of the 5' region of the Bacillus licheniformis α-amylase gene: Comparison with the B. amyloliquefaciens gene. J Bacteriol. 1984;158(1):369–372. doi: 10.1128/jb.158.1.369-372.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Yuuki T, Nomura T, Tezuka H, et al. Complete nucleotide sequence of a gene coding for heat- and pH-stable α-amylase of Bacillus licheniformis: Comparison of the amino acid sequences of three bacterial liquefying α-amylases deduced from the DNA sequences. J Biochem (Tokyo) 1985;98(5):1147–1156. doi: 10.1093/oxfordjournals.jbchem.a135381. [DOI] [PubMed] [Google Scholar]
  • 37.Takami H, Horokoshi K. Analysis of the genome of an alkaliphilic Bacillus strain from an industrial point of view. Extremophiles. 2000;4:99–108. doi: 10.1007/s007920050143. [DOI] [PubMed] [Google Scholar]
  • 38.Ikura Y, Horokoshi K. Isolation and some properties of beta-galacosidase-producing bacteria. Agric Biol Chem. 1979;43:85–88. [Google Scholar]
  • 39.Honda H, Kudo T, Ikura Y, Horikoshi K. Two types of xylanases of alkalophilic Bacillus sp. No. C-125. Can J Microbiol. 1985;31(6):538–542. [Google Scholar]
  • 40.Takami H, Takaki Y, Chee GJ, et al. Thermoadaptation trait revealed by the genome sequence of thermophilic Geobacillus kaustophilus. Nucleic Acids Res. 2004;32(21):6292–6303. doi: 10.1093/nar/gkh970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Dufresne A, Garczarek L, Partensky F. Accelerated evolution associated with genome reduction in a free-living prokaryote. Genome Biol. 2005;6(2):R14. doi: 10.1186/gb-2005-6-2-r14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Pushker R, Mira A, Rodriguez-Valera F. Comparative genomics of gene-family size in closely related bacteria. Genome Biol. 2004;5(4):R27. doi: 10.1186/gb-2004-5-4-r27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Setlow P. Spores of Bacillus subtilis: their resistance to and killing by radiation, heat and chemicals. J Appl Microbiol. 2006;101(3):514–525. doi: 10.1111/j.1365-2672.2005.02736.x. [DOI] [PubMed] [Google Scholar]
  • 44.Gioia J, Yerrapragada S, Qin X, et al. Paradoxical DNA Repair and Peroxide Resistance Gene Conservation in Bacillus pumilus SAFR-032. PLoS ONE. 2007;2(9):e928. doi: 10.1371/journal.pone.0000928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Eppinger M, Bunk B, Johns MA, et al. Genome sequence of the biotechnologically important Bacillus megaterium strains QM B1551 and DSM319. J Bacteriol. 2011;193(16):4199–4213. doi: 10.1128/JB.00449-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Liu L, Li Y, Zhang J, et al. Complete Genome Sequence of the Industrial Strain Bacillus megaterium WSH-002. J Bacteriol. 2011;193(22):6389–6390. doi: 10.1128/JB.06066-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Brittingham KC, Ruthel G, Panchal RG, et al. Dendritic cells endocytose Bacillus anthracis spores: implications for anthrax pathogenesis. J Immunol. 2005;174(9):5545–5552. doi: 10.4049/jimmunol.174.9.5545. [DOI] [PubMed] [Google Scholar]
  • 48.Leppla SH, Robbins JB, Schneerson R, Shiloach J. Development of an improved vaccine for anthrax. J Clin Invest. 2002;110(2):141–144. doi: 10.1172/JCI16204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Little SF, Ivins BE. Molecular pathogenesis of Bacillus anthracis infection. Microbes Infect. 1999;1(2):131–139. doi: 10.1016/s1286-4579(99)80004-5. [DOI] [PubMed] [Google Scholar]
  • 50.Shivaji S, Ara S, Singh SK, Bandi S, Singh A, Pinnaka AK. Draft genome sequence of Bacillus isronensis Strain B3W22, isolated from the upper atmosphere. J Bacteriol. 2012;194(23):6624–6625. doi: 10.1128/JB.01651-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kanno S-I, Iwai S, Takao M, Yasui A. Repair of apurinic/apyrimidinic sites by UV damage endonuclease; a repair protein for UV and oxidative damage. Nucleic Acids Res. 1999;27(15):3096–3103. doi: 10.1093/nar/27.15.3096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Quail MA, Smith M, Coupland P, et al. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers. BMC Genomics. 2012;13:341. doi: 10.1186/1471-2164-13-341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Geng W, Cao M, Song C, et al. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid. J Bacteriol. 2011;193(13):3393–3394. doi: 10.1128/JB.05058-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Chun JH, Hong KJ, Cha SH, et al. Complete genome sequence of Bacillus anthracis H9401, an isolate from a Korean patient with Anthrax. J Bacteriol. 2012;194(15):4116–4117. doi: 10.1128/JB.00159-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Deng Y, Zhu Y, Wang P, et al. Complete genome sequence of Bacillus subtilis BSn5, an endophytic bacterium of Amorphophallus konjac with antimicrobial activity for the plant pathogen Erwinia carotovora subsp. carotovora. J Bacteriol. 2011;193(8):2070–2071. doi: 10.1128/JB.00129-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zeigler DR. The genome sequence of Bacillus subtilis subsp. spizizenii W23: insights into speciation within the B. subtilis complex and into the history of B. subtilis genetics. Microbiology. 2011;157(Pt 7):2033–2041. doi: 10.1099/mic.0.048520-0. [DOI] [PubMed] [Google Scholar]

Articles from Genomics Insights are provided here courtesy of SAGE Publications

RESOURCES