Abstract
Bacillus sphaericus strain C3-41 is an aerobic, mesophilic, spore-forming bacterium that has been used with great success in mosquito control programs worldwide. Genome sequencing revealed that the complete genome of this entomopathogenic bacterium is composed of a chromosomal replicon of 4,639,821 bp and a plasmid replicon of 177,642 bp, containing 4,786 and 186 potential protein-coding sequences, respectively. Comparison of the genome with other published sequences indicated that the B. sphaericus C3-41 chromosome is most similar to that of Bacillus sp. strain NRRL B-14905, a marine species that, like B. sphaericus, is unable to metabolize polysaccharides. The lack of key enzymes and sugar transport systems in the two bacteria appears to be the main reason for this inability, and the abundance of proteolytic enzymes and transport systems may endow these bacteria with exclusive metabolic pathways for a wide variety of organic compounds and amino acids. The genes shared between B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905, including mobile genetic elements, membrane-associated proteins, and transport systems, demonstrated that these two species are a biologically and phylogenetically divergent group. Knowledge of the genome sequence of B. sphaericus C3-41 thus increases our understanding of the bacilli and may also offer prospects for future genetic improvement of this important biological control agent.
Bacillus sphaericus is a naturally occurring, aerobic, mesophilic, spore-forming bacterium, commonly isolated from the soil. Isolates of B. sphaericus can be grouped into 49 serotypes on the basis of flagellar agglutination, but relatively few biochemical and morphological tests are available to distinguish B. sphaericus as a species. Analysis of DNA homology between strains indicated five major groups (I to V), each probably corresponding to a separate species (34). Group II was further subdivided into groups IIA and IIB. Mosquitocidal B. sphaericus strains are all found within DNA subgroup IIA and in association with nine serotypes (H1, H2, H3, H5, H6, H9, H25, H26, and H48). High-toxicity strains exhibit toxicity against mosquito larvae and thus are utilized in insect control programs to reduce the populations of vector species that transmit tropical diseases, such as malaria, filariasis, and arboviral diseases (e.g., yellow fever, dengue fever, and West Nile virus). The mosquitocidal properties are due to the action of binary toxin (Bin proteins), which forms crystal inclusions during sporulation, and mosquitocidal toxins (Mtx proteins), produced during vegetative growth (51). Some strains also produce a further two-component toxin on sporulation (Cry48 and Cry49 proteins) (31). Compared to Bacillus thuringiensis subsp. israelensis, which is the other major bacterium used in the biological control of mosquitoes, B. sphaericus offers a distinct advantage, having higher levels of efficacy and environmental persistence (50, 66). Besides being an important bioinsecticide for mosquito control, B. sphaericus has several important phenotypic properties, including those of being incapable of polysaccharide utilization and having exclusive metabolic pathways for a wide variety of organic compounds and amino acids (3, 28, 54, 59, 72).
To date, no extensive chromosomal DNA sequencing of B. sphaericus isolates has been reported. Moreover, no other genome sequencing of a bacterium incapable of polysaccharide utilization has been completed. B. sphaericus C3-41, a highly active strain isolated from a mosquito breeding site in China in 1987, shows toxicity against Culex sp., Anopheles sp., and Aedes sp. and has significantly higher activity against Culex sp. than the commercialized B. sphaericus strain 2362 (75). The C3-41 strain belongs to the flagellar serotype H5a5b, like B. sphaericus strains 2362 and 1593 (74), and it has been developed as a commercial larvicide (JianBao) and successfully used for the control of mosquito larvae for more than 10 years in China. In some cities, such as Shenzhen, Foshan, and Dongguan in southern China, the C3-41 mosquitocidal formulation has been chosen as the sole larvicidal agent for breeding-site management in the integrated mosquito control program. Here, we report the sequencing of the B. sphaericus C3-41 genome and a comparative analysis with genomes of other species. These data provide a global view of the genes possessed by the organism and an insight into evolutionary relationships among the bacilli.
MATERIALS AND METHODS
Sequencing of the B. sphaericus C3-41 genome.
High-molecular-mass genomic DNA isolated from B. sphaericus C3-41 was used to construct small (1.5 to 3 kb) and large (6 to 8 kb) random sequencing libraries. A whole-genome shotgun sequencing was performed by using Applied Biosystems 3700 DNA sequencers (Perkin-Elmer). The genome was first assembled into 418 contigs by using the Phred-Phrap-Consed package (17, 18, 24). Gaps were closed by primer walking over clone inserts, genome PCR, and pooling. Finally, the genome was assembled into two contigs representing the circular chromosome (with an average of 8.9-times coverage) and the circular plasmid (21.4 times). An estimate of the copy number of the plasmid was obtained by dividing the coverage depth of the plasmid by that of the chromosome. The repeats of the chromosome were categorized by means of a suffix tree algorithm (37).
Sequence annotation.
Glimmer 3 gene finder (56) was utilized to identify potential coding regions. The annotation was accomplished by BlastP analysis of sequences in the Nr, Nt, and Swissprot databases, respectively, and by manual curation of the outputs of a variety of similarity searches and was completed as described previously (63). The possible orthologs of the genome were identified based on the COG (clusters of orthologous groups of proteins) database and classified accordingly (62). The potential coding sequences (CDSs) involved in different pathways were determined by KEGG analysis (47, 30). The protein motifs and domains of all CDSs were documented based on intensive searches against publicly available databases and by using their application tools, including Pfam, PRINTS, PROSITE, ProDom, and SMART. The results were summarized with InterPro (6). tRNA genes were identified by using tRNAscan-SE (41). GC skew analysis and the circular-genome-map drawing were performed by using CGView software (60).
Construction of orthologous/paralogous families and comparison analysis.
Orthologous/paralogous families for B. sphaericus C3-41, five other completely sequenced organisms (i.e., Bacillus anthracis strain Ames, Bacillus subtilis strain 168, Thermoanaerobacter tengcongensis MB4, Clostridium perfringens strain 13, and Escherichia coli K-12), and two gapped genomes, those of Bacillus sp. strain NRRL B-14911 and Bacillus sp. strain NRRL B-14905, were built by using Treefam's method of comparing the gene tree with the species tree (http://www.treefam.org/) in stages (38). The method of Treefam defines a gene family as a group of genes that evolved after the speciation, and the orthologs and paralogs in TreeFam are inferred from the phylogenetic tree of a gene family and are different from those inferred by BLAST matches (i.e., Inparanoid, KOGs, and OrthoMCL) or BLAST matches and synteny (i.e., Ensembl-Compara and HomoloGene). Thus, it also tries to include outgroup genes to reveal the distant members (38). In this study, besides the reference Bacillus genomes, we have taken C. perfringens and E. coli as the outgroup species. In addition, B. sphaericus is an archaic organism and its spores have apparently been found in 25- to 40-million-year-old amber (9). Previous studies have also suggested that this mesophilic bacterium is closely related to some thermophilic archaea (29, 45, 70). T. tengcongensis was, therefore, included in the construction of orthologous/paralogous families.
There are four main steps: (i) an all-versus-all BLAST with proteins and conjoined fragmental alignments by Solar (http://treesoft.svn.sourceforge.net/viewrc/treesoft/branches/dev/solar); (ii) clustering of gene families by using Hcluster_sg (http://treesoft.svn.sourceforge.net/viewrc/treesoft/trunk/hcluster); (iii) performing multiple alignments by Muscle (http://www.ebi.ac.uk/muscle) and converting protein alignments to CDS alignments by using a Perl script; and (iv) building phylogenetic trees and inferring orthologs and paralogs by the neighbor-joining method.
Nucleotide sequence accession numbers.
The B. sphaericus C3-41 genome is available in GenBank under accession numbers CP000817 and CP000818.
RESULTS
General features of the genome sequence.
The genome of B. sphaericus C3-41 consists of a circular chromosome of 4,639,821 bp with an average G+C content of 37.29% and a two-copy plasmid (named pBsph) of 177,642 bp with an average G+C content of 33.10% (Table 1; Fig. 1A and B). A total of 4,786 and 186 CDSs were identified in the chromosome and plasmid, respectively. There are 85 tRNA genes representing all 20 amino acids and 10 rRNA operons in the chromosome. The likely origin of replication of the chromosome of B. sphaericus C3-41 was identified by similarities to several features of the corresponding regions in B. subtilis and other bacteria, including colocalization of four genes (rpmH, dnaA, dnaN, and recF) near the origin, GC nucleotide skew [(G − C)/(G + C)] analysis, and the presence of multiple dnaA boxes and AT-rich sequences immediately upstream of the dnaA gene (13, 36, 40, 42). The replication termination site of the chromosome is believed to be localized near 2.9 megabases (Mb), according to GC skew analysis, and the coding bias for the two strands of the genome is for the majority of CDSs to be on the outer strand from 0 to ∼2.9 Mb and on the inner strand from ∼2.9 Mb to the origin (Fig. 1, circles 1, 4, and 5). This is also reflected by the presence of several genes near the 2.9-Mb site, including parC and parE, which encode the subunits of topoisomerase IV, involved in chromosome partitioning (1, 12). However, we did not find the homolog of rtp (replication terminator protein-encoding gene) in the chromosome of B. sphaericus C3-41, and it can be seen from Fig. 1 that the putative replication termination site is significantly offset from the point diametrically opposite to the origin. This is not unique to B. sphaericus C3-41, but it is unusual. The potential origin of replication of the large, 178-kb plasmid was also analyzed by GC skew (Fig. 1B). The possible replication proteins were identified through database comparisons (Bsph_014-017). Close to this putative origin, a predicted CDS shows high similarities to FtsZ/tubulin family proteins, which are known to be involved in plasmid replication (7, 67).
TABLE 1.
General features of whole-chromosome sequences of B. sphaericus C3-41
Genomic location | Size (bp) | No. of CDSs | Replicon coding (%) | G+C content (%) | No. of:
|
||||
---|---|---|---|---|---|---|---|---|---|
rRNA operons | tRNAs | Putative prophages | Transposase genes | Insertion elements | |||||
Chromosome | 4,639,821 | 4,786 | 82.7 | 37 | 10 | 85 | 3 | 45 | 7 |
Plasmid | 177,642 | 186 | 80.2 | 33.1 | 0 | 0 | 0 | 1 | 1 |
FIG. 1.
Circular representations of the genome of B. sphaericus C3-41. (A) Chromosome. (B) Plasmid pBsph. From the inside: circles 1 and 2, GC skew and G+C content (20-kb window with 5-kb step); circle 3, blue and green bars show positions of tRNA and rRNA, respectively, and black bars show positions of repeats; circles 4 and 5, CDSs on the − and + strands. Colors reflect functional categories of CDSs. Teal, chromatin structure and dynamics; blue, energy production and conversion; orange, cell cycle control, cell division, and chromosome partitioning; maroon, amino acid transport and metabolism; dark blue, nucleotide transport and metabolism; silver, carbohydrate transport and metabolism; dark green, coenzyme transport and metabolism; dark purple, lipid transport and metabolism; navy, translation, ribosomal structure, and biogenesis; light brown, transcription; aqua, replication, recombination, and repair; green, cell wall/membrane/envelope biogenesis; fuchsia, cell motility; gray, posttranslational modification, protein turnover, and chaperones; dark yellow, inorganic ion transport and metabolism; dark blue, secondary metabolite biosynthesis, transport, and catabolism; dark red, general function prediction only; dark gray, function unknown; lime, signal transduction mechanisms; yellow, intracellular trafficking, secretion, and vesicular transport; olive, defense mechanisms; black, not classified by COG. The “0” coordinates marked on the outmost circles correspond to the putative replication origins, and the putative replication termination site is located near 2.9 Mb.
Ortholog analysis of a total of 34,008 CDSs from the genome of B. sphaericus C3-41; five other genomes, those of B. anthracis strain Ames, B. subtilis strain 168, T. tengcongensis MB4, C. perfringens strain 13, and E. coli K-12; and two gapped genomes, those of Bacillus sp. strain NRRL B-14911 and Bacillus sp. strain NRRL B-14905 (GenBank accession numbers NC_003997, NC_000964, NC_003869, NC_003366, NC_000913, NZ_AAOX00000000, and NZ_AAXV00000000) was performed. From this analysis, 29,321 CDSs with obvious homologs were classified in relation to 4,601 protein families. The largest two families of B. sphaericus C3-41 are ATP-binding iron transport system proteins (66 CDSs) and sensory transduction proteins (57 CDSs). About 339 CDSs from B. sphaericus C3-41 have no matches in the other seven species and were unclassified, and another 102 CDSs from B. sphaericus C3-41 that have only low similarities to other CDSs were classified singly.
Genome comparisons between B. sphaericus C3-41 and other bacteria.
Direct comparisons between the predicted CDSs of the B. sphaericus C3-41 chromosome and those of seven other bacterial species (B. anthracis strain Ames, B. subtilis strain 168, T. tengcongensis MB4, C. perfringens strain 13, E. coli K-12, Bacillus sp. strain NRRL B-14911, and Bacillus sp. strain NRRL B-14905) were performed by BLAST analyses (5). The results revealed that the putative CDSs of B. sphaericus C3-41 share greater similarities with those of Bacillus sp. NRRL B-14905 than with those of other Bacillus species (Table 2). About 3,716 CDSs (77.64% of the total genes predicted in B. sphaericus C3-41) have matches in the gapped genome sequences of Bacillus sp. strain NRRL B-14905, and the average identity of these genes is 91.62%. Bacillus sp. strain NRRL B-14905 is a marine Bacillus species, reported previously to be closely related to B. sphaericus C3-41 according to a phylogenetic tree based on 16S rRNA sequencing, and both of them share similar physiological characteristics, such as strict aerobiosis, lack of sugar metabolism, and the production of round spores (58). Additionally, T. tengcongensis MB4 and C. perfringens strain 13 are more closely related than E. coli K-12 to B. sphaericus C3-41, these bacteria having 42.77%, 41.73%, and 27.48% CDSs, respectively, that match those of B. sphaericus C3-41 (Table 2).
TABLE 2.
Genome comparisons between B. sphaericus C3-41 and seven other species
Strain | Size (bp) | Total no. of genes | No. of genes aligned | % of genes in B. sphaericus C3-41 | % of genes in query genomes | Mean DNA identityb |
---|---|---|---|---|---|---|
Bacillus sp. strain NRRL B-14905 | 4,497,271a | 4,624 | 3,716 | 77.64 | 80.36 | 91.62 |
Bacillus sp. strain NRRL B-14911 | 5,085,825a | 5,691 | 2,315 | 48.37 | 40.68 | 50.24 |
B. anthracis strain Ames | 5,227,293 | 5,311 | 2,221 | 46.41 | 41.82 | 48.89 |
B. subtilis strain 168 | 4,214,630 | 4,105 | 2,025 | 42.31 | 49.33 | 48.53 |
T. tengcongensis MB4 | 2,689,445 | 2,588 | 1,107 | 23.13 | 42.77 | 38.64 |
C. perfringens strain 13 | 3,031,430 | 2,660 | 1,110 | 23.19 | 41.73 | 37.91 |
E. coli K-12 | 4,639,675 | 4,243 | 1,166 | 24.36 | 27.48 | 36.28 |
Gapped genome sequences from NCBI database.
Quantitated according to the identity value of the alignment of each gene.
The synteny between genomes of B. sphaericus C3-41 and B. anthracis strain Ames (GenBank accession number NC_003997), B. subtilis strain 168 (GenBank accession number NC_000964), T. tengcongensis MB4 (GenBank accession number NC_003869), C. perfringens strain 13 (GenBank accession number NC_003366), E. coli K-12 (GenBank accession number NC_000913), Bacillus cereus 10987 (GenBank accession number NC_003909), B. cereus ATCC 14579 (GenBank accession number NC_004722), and B. thuringiensis serovar Konkukian strain 97-27 (GenBank accession number NC_005957), respectively, is low or absent (Fig. 2 illustrates this for B. subtilis, B. anthracis, and T. tencongensis). Two main long-fragment syntenies (about 460 kb and 760 kb, respectively) are observed between B. sphaericus C3-41, B. subtilis strain 168, and the three B. cereus group strains. B. sphaericus C3-41 and B. subtilis strain 168 display a better synteny than other Bacillus species. These species may have shared a common “chromosome backbone” in a very ancient stage.
FIG. 2.
Plots of gene pairs based on genomic location. Each dot indicates a single protein plotted on the 5′ end of the coding region in the reference genome and the best match in the query genome. From the top down: comparison of B. sphaericus C3-41 and B. subtilis subsp. strain 168; comparison of B. sphaericus C3-41 and B. anthracis strain Ames; comparison of B. sphaericus C3-41 and T. tengcongensis MB4.
Further comparisons among B. sphaericus C3-41, Bacillus sp. strain NRRL B-14905, and B. subtilis strain 168 were performed. There are a significant number of proteins that are homologous (BLAST E value, <10−5) between B. sphaericus C3-41 (1,227) and Bacillus sp. strain NRRL B-14905 (1,152) but are not present in B. subtilis strain 168 (Fig. 3).
FIG. 3.
Venn diagram illustrating the number of putative proteins associated with each organism and the number shared between these organisms. Bsu, B. subtilis strain 168; Bac, Bacillus sp. strain NRRL B-14905; Bsp, B. sphaericus C3-41; n, total number of putative proteins encoded by the organism.
Mobile genetic elements and prophages.
B. sphaericus C3-41 exhibits 45 and 1 CDSs in the chromosome and the plasmid, respectively, that are predicted to encode transposases. The 45 transposases of the chromosome are assigned to 11 families according to the ortholog analysis among eight species (see Table S1 in the supplemental material). The largest family, containing 18 transposases in B. sphaericus C3-41, has 14 matches in Bacillus sp. strain NRRL B-14905. The CDSs of this family member appear homologous to the transposase sequences of the IS5 and ISNCY families. The second- and third-largest families contain 11 and 5 transposases in B. sphaericus C3-41, having 22 and 19 matches in Bacillus sp. strain NRRL B-14905, respectively. The CDSs of both families are homologous to the IS3 family (The definitions and the sequences of insertion sequence [IS] families are from the IS Finder database [http://www-is.biotoul.fr/is.html]). Obviously, B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 share transposase sequences with higher similarities than the six other species.
The transposase sequences of B. sphaericus C3-41 were submitted to the IS Finder database to search for possible insertion elements. At least seven IS elements were found in the chromosome (named ISBsph3 to ISBsph9), and one copy of ISBsph9 exists in plasmid pBsph (Table 3). ISBsph3, ISBsph4, ISBsph5, ISBsph6, ISBsph7, and ISBsph9 share a number of features with other IS3 family elements, including inverted repeats, similarity between transposases, and the DD-(35)-E-(7)-K or DD-(35)-E-(7)-R motifs, which are highly conserved patterns among ISs (19, 33, 35).
TABLE 3.
IS elements predicted in genome of B. sphaericus C3-41
Element | Familya | CDS(s) and identity to known ISs | IR(s)b (bp/bp) | Length(s) (bp) |
---|---|---|---|---|
ISBsph3 | IS3 | Bsph_1044-1045, 58% and 34% identity to ISPsy8_PEP2 and ISBcen9_PEP1 | 19/26 | 1,444 |
ISBsph4 | IS3 | Bsph_1068-1070, 35%, 52% and 51% identity to PEP1, PEP2, and PEP3 of ISCpe6 | 24/35 | 1,491 |
ISBsph5 | IS3 | Bsph_1073-1075, the first two CDSs having 39% and 60% identity to ISEnfa3_PEP1 and ISPsy8_PEP3 | 19/28 | 2,140 |
ISBsph6 | IS3 | Bsph_1796-1798, the first two CDSs having 33% and 68% identity to PEP1 and PEP3 of ISBam1 | 14/19 | 2,005 |
ISBsph7 | IS3 | Bsph_2816-2818, the first two CDSs having 44% and 55% identity to PEP2 and PEP3 of ISSau2 | 25/42 | 1,112 |
ISBsph8 | IS607 | Bsph_2862-2863, 44% and 53% identity to PEP1 and PEP2 of ISCbt4 | 17/24 | 1,961 |
ISBsph9 | IS3 | Bsph_3190-3191, Bsph_152-153 | 19/28, 19/28 | 1,554, 1,554 |
The IS classification into families is according to IS Finder (http://www-is.biotoul.fr/is.html).
IR, terminal inverted-repeat sequences.
According to the AT-rich signatures and similarities to other known phage and prophage sequences, three putative prophages or prophage-related regions (i.e., Bsph_1772-1792, Bsph_1936-1952, and Bsph_3729-3788, respectively) were predicted to be in the chromosome. The putative amino acid sequences of Bsph_1945 and Bsph_3765 showed high similarities to the large subunit of terminase, a signature protein that is highly conserved among prophages (10). The first putative prophage was found to be similar to the B. subtilis phage-like element PBSX but lacked a terminase gene. This prophage region is also present in Bacillus sp. strain NRRL B-14905. The third putative prophage is similar to the possible prophage of Salmonella enterica serovar Typhimurium. Only three CDSs in the second putative prophage-related region were similar to known phage-associated proteins, including Bsph_1945 (similar to the putative bacteriophage terminase large subunit), Bsph_1947 (similar to the portal vertex protein), and Bsph_1949 (similar to the phage-related holin protein). Thus, the second putative prophage-related region might be the remnant of phage invasion and genome deterioration during evolution.
Carbohydrate metabolism and transport systems.
The inability to metabolize carbohydrates is a well-known feature of both B. sphaericus and Bacillus sp. strain NRRL B-14905, but the details of their energy metabolism remain to be investigated. Three CDSs, encoding CcpA (catabolite control protein A), HPr, and Crh (the regulatory paralogue of HPr), were present in the genomes of B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905. CcpA is a pleiotropic transcriptional regulator that acts as the key factor in the regulation of carbon and nitrogen metabolism, interacting with regulatory sites in the control regions of the regulated operon to either repress or activate transcription (8, 44, 69, 71). An alignment of CcpA sequences among B. sphaericus C3-41 (Bsph_4200) and 50 other bacterial isolates was performed (data not shown). Despite the variability among different species, the phylogenetic relationships among the 51 samples revealed sequence conservation of this pleiotropic transcriptional regulator in closely related species. B. sphaericus was grouped with the other Bacillus species and classified into a clade with Bacillus sp. strain NRRL B-14905 (see Fig. S1 in the supplemental material). In B. sphaericus C3-41, 15 examples of the TGWAANCGNTNWCA consensus, a cis-acting palindromic sequence called cre (catabolite-responsive element), which might be the regulatory binding sites of CcpA (69), were found throughout the genome where they may influence genes encoding proteins involved in amino acid metabolism, transport, and transcriptional regulators. This number is far less than the numbers in B. subtilis and other low-GC, gram-positive bacterial strains (8, 30, 44, 61).
The function of HPr, which acts as a cofactor for CcpA, is to participate in the phosphotransferase system (PTS)-catalyzed transport and phosphorylation of carbohydrates (15, 21, 30, 61). The presence in B. sphaericus C3-41 of the conserved enzyme I (EI)- and of HPr-encoding genes, such as Bsph_2351, Bsph_2352, and Bsph_0434, which might be involved in the phosphoenolpyruvate-dependent protein phosphorylation chain, was predicted. The direct and specific interaction of CcpA and HPr-Ser46-P or HPr-His15-P has been demonstrated previously and is assumed to provide a direct link between glycolytic activity of the cell and Cre binding by CcpA (15). The predicted HPr of B. sphaericus C3-41 (i.e., Bsph_2351) might be phosphorylated by the EI (PtsI) encoded by Bsph_2352 at His-15 or phosphorylated at a regulatory serine (Ser-46) by ATP and the HPr kinase encoded by Bsph_0434, as occurs in the phosphoenolpyruvate-dependent PTS system of B. subtilis (16, 22, 23, 53), and the phosphoryl group might be transferred to the sugar-specific EIIA binding proteins. This ATP-dependent phosphorylation might regulate the induction and carbon catabolite repression of some catabolic genes, as described previously (55).
However, with the exception of the sugar-binding protein EII that is specific for N-acetylglucosamine, no other sugar-binding protein was identified in B. sphaericus C3-41. Bacterial genome comparison analysis revealed that B. sphaericus C3-41 lacks many PTS systems, such as glucose- or fructose-specific PTS enzyme components. Compared with B. subtilis strain 168 (25 PTS systems) and B. anthracis strain Ames (19 PTS systems), B. sphaericus C3-41 has fewer PTS systems (only 9), and these are probably specific for the transport of N-acetylglucosamine, cellobiose, and an unknown pentitol, according to their KEGG orthologs (http://www.genome.jp/kegg/). Furthermore, ortholog analysis indicated that some ABC sugar transporters functioning in sugar binding and transport are present in the other six species but absent from B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905, both of which may, therefore, be defective in the transport of sugars.
The KEGG pathways were compared between B. sphaericus C3-41, B. subtilis strain 168, and the B. cereus group strains. The results indicated that at least glucose-6-phosphate isomerase (Pgi), phosphomannose isomerase (ManB), PTS system glucose-specific EII, and fructose-specific EII components were absent from B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905. Furthermore, both B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 lack an additional 17 enzymes involved in carbohydrate metabolism in B. subtilis (data not shown). The absence of a pgi gene from B. sphaericus was reported previously by other authors (28), and the introduction of a heterologous pgi gene into this bacterium was able to restore polysaccharide utilization in vitro but not in vivo (B. Han, unpublished data). Thus, the absence of key metabolizing enzymes and a sugar transport system may explain the metabolic inactivity toward glucose, fructose, and most polysaccharides.
In contrast, a fragment of the N-acetylglucosamine utilization operon present in B. sphaericus C3-41 (Bsph_2343-Bsph_2352), including genes encoding a GntR family transcriptional regulator, three ABC transporters, N-acetylglucosamine-6-phosphate deacetylase (NagA), glucosamine-6-phosphate deaminase (NagB), HPr, and PtsI, might be involved in the whole pathway for N-acetylglucosamine metabolism and transport, which supports previous indications that B. sphaericus can degrade N-acetylglucosamine (4). It is interesting to note the presence of another N-acetylglucosamine utilization operon (Bsph_3892-Bsph_3899). The nagA gene in this region contains several frame shifts, causing a premature stop, and might be a pseudogene (Fig. 4).
FIG. 4.
CDSs probably involved in N-acetylglucosamine metabolism. From left to right: (a) Bsph_2343, transcriptional regulator (GntR family); Bsph_2344-2346, ABC transporters; Bsph_2347, PTS system N-acetylglucosamine-specific EIIC component (NagE); Bsph_2348, N-acetylglucosamine-6-phosphate deacetylase (NagA); Bsph_2349, hypothetical protein; Bsph_2350, glucosamine-6-phosphate deaminase (NagB); Bsph_2351, HPr (PtsH); and Bsph_2352 (PtsI); and (b) Bsph_3892-Bsph_3894, PTS system EIIC/A/B components; Bsph_3895, hypothetical protein; Bsph_3896: HPr (PtsH); Bsph_3897, hypothetical protein; Bsph_3898, transcriptional regulator (GntR family); and Bsph_3899, similar to NagA but containing several frame shifts. The possible transcriptional regulators are indicated by gray arrows, ABC transporters by arrows with dots, CDSs involved in sugar PTS systems by hatched arrows, hypothetical proteins by open arrows, and key enzymes in N-acetylglucosamine metabolism by dark arrows.
Other metabolic enzymes and transport systems.
Although B. sphaericus C3-41 is incapable of polysaccharide utilization, it has exclusive metabolic pathways for a wide variety of organic compounds and amino acids (3, 54). A 12-gene urease gene cluster (Bsph_2699-Bsph_2710) encoding the urease structural (UreABC) and accessory proteins (UreDEFG), as well as five ABC transporters, which may be a transcriptional unit, is located in the chromosome of B. sphaericus C3-41. The urease gene cluster is not present in B. anthracis strain Ames, B. thuringiensis serovar Konkukian strain 97-27, B. cereus ATCC 14579, E. coli K-12, or T. tengcongensis MB4 according to our comparative analysis, but is present in B. cereus ATCC 10987 as described before (52). Also, the 12 genes of the urease cluster are located in the gapped chromosome of Bacillus sp. strain NRRL B-14905, but only 4 genes of the urease cluster are located in B. subtilis strain 168. Upstream of the urease gene cluster, a putative transcriptional regulator and an aspartase are identified. The presence of the urease enzymes may increase the fitness of B. sphaericus C3-41 under different nutrient conditions in the environment.
A fragment of a putative ethanolamine utilization operon consisting of 13 CDSs, including eutA, eutB, eutE, eutH, eutL, eutM, eutP, and eutS, was identified in B. sphaericus C3-41 (Bsph_2095-Bsph_2108). Similar operons are also found in Bacillus sp. strain NRRL B-14905, C. perfringens strain 13, and E. coli K-12, but not in the other five genomes, those of B. subtilis strain 168, B. anthracis strain Ames, B. thuringiensis serovar Konkukian strain 97-27, B. cereus ATCC 14579, and T. tengcongensis MB4, in our comparison. It has been proposed that a metabolic pathway for ethanolamine utilization may be an important survival strategy against the constant famine that microorganisms face in nature (20, 43). In addition, ethanolamine found in the mammalian gastrointestinal tract may present an important alternative source of nitrogen and carbon for bacteria living in the gut. Therefore, like many enterobacteria, such as E. coli K-12, B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 may use ethanolamine as a source of both carbon and nitrogen (57).
In contrast to the reduced number of sugar-specific phosphoenolpyruvate-dependent PTS system genes, the genome of B. sphaericus C3-41 harbors abundant CDSs for ABC transporters (253 CDSs in total). Among them, 99 transporters appear to be involved in amino acid transport and metabolism in B. sphaericus C3-41, compared to only 32 in B. subtilis strain 168. Among the ABC peptide transporters in B. sphaericus C3-41, there are 32 putative ATP-binding proteins and 46 permeases. Only seven ABC transporters appear to be used in polysaccharide or sugar transport and metabolism, including Bsph_1250, Bsph_0763, Bsph_1252, Bsph_0765, Bsph_1251, Bsph_1260, and Bsph_0764. Furthermore, B. sphaericus C3-41 has more peptidases and proteases than B. subtilis strain 168 (93 versus 30 and 75 versus 24, respectively). Among these is Bsph_0341, which encodes sphaericase (also known as sfericase), the enzyme responsible for the degradation of the Mtx1 mosquitocidal toxin in B. sphaericus (68). In addition, the prediction of four amino acid efflux systems and 17 other transport systems involved in amino acid transport and metabolism implies that B. sphaericus C3-41 should be able to carry out active scavenging and secretion processes. The inability to metabolize carbohydrates and the abundant presence of protein-metabolizing systems suggests that the ancestor of B. sphaericus may have an animal- or insect-associated origin.
Structural component, S layer, and membrane proteins.
Another interesting similarity between B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 that may affect the phenotype is the presence of 22 CDSs probably functioning as surface (S) layer proteins or S layer homologs and of 17 membrane proteins. The counterparts of the S layer and membrane proteins in B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 were not found in B. subtilis strain 168. These membrane- or S layer-associated proteins may be related to the marine environment of Bacillus sp. strain NRRL B-14905 or the multiple environmental sources of bacteria designated as B. sphaericus, including soil, aquatic habitats, and even waste piles from uranium mines (49). In the latter case, the complex of S layer proteins appears to be important in heavy metal sequestration by B. sphaericus strains, although it is not clear whether such strains belong to the same group (IIA) of B. sphaericus strains as C3-41.
Recently it was proposed that B. fusiformis and B. sphaericus be reclassified into the genus of Lysinibacillus (2). We suggest that Bacillus sp. strain NRRL B-14905 also should be classified into one genus with B. fusiformis and B. sphaericus. This is also in accordance with the phylogenetic relationship among the three Bacillus species based on 16S rRNA genes (58).
Sporulation and germination.
As with other spore-forming bacteria, the spore formation of B. sphaericus occurs in response to nutrient limitation in the environment. A list of protein families related to sporulation and germination according to ortholog/paralog analysis of B. sphaericus C3-41 and seven other species was compiled (see Table S2 in the supplemental material). The sporulation of B. sphaericus C3-41 initiated by a deficiency in energy might be linked to the expression of several genes, probably functioning as sporulation initiation proteins or regulators of genetic competence, such as Bsph_1217. Four CDSs, including two AbrB genes (Bsph_0057 and Bsph_113, present in the chromosome and plasmid, respectively), Bsph_0076, and Bsph_2825, might regulate the developmental pathways of spores and biofilms in B. sphaericus C3-41, according to their predicted function. The predicted CDSs encoding spore coat proteins of B. sphaericus C3-41 are classified into 13 protein families according to their homologies. The 13 protein families are present in all five Bacillus species included in this study. For example, many of the B. subtilis strain 168 spore coat proteins, including CotA, CotE, CotY, CotJ, YhcQ, YabP, YabQ, YkuD, and SspF, are included in the 13 protein families. Fifty-three CDSs predicted in the chromosome of B. sphaericus C3-41, whose products are classified into 13 protein families according to ortholog analysis, are related to spore germination. Most of these CDSs can be classified into the GerC, GerK, GerH, GerQ, and GerX families, according to homology to these operons. Seventeen CDSs probably associated with sporulation and germination, such as GerP, can be found in other Bacillus species but are absent from B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905. Moreover, gene orthologs and COG analysis show that at least 64 possible CDSs in B. sphaericus C3-41 might be involved in the regulation and posttranslational modification of the sporulation process and assembly of the spore coat and exosporium (COG data are not shown).
However, sporulation and germination are complex, multistage events and a number of genes involved in transcription, signal transduction, posttranslational modification, and other functions may also be involved in these processes, including the assembly of the spore coat and exosporium. Thus, many CDSs not included in Table S2 in the supplemental material and some hypothetical proteins with as-yet-unknown functions might also be linked to the sporulation and germination processes, and the resolution of this issue will require further proteomic analysis.
Evolution of mosquitocidal toxin genes.
Unlike other bacterial pathogens, whose virulence genes are assembled within putative pathogenicity islands, the insecticidal toxins of B. sphaericus C3-41 are widely distributed around the chromosome. Comparison of the region encoding the mtx1 genes (64) in strain 2297 (GenBank accession number AB126007) and strain SSII-1 (GenBank accession number M60446) with the equivalent region from B. sphaericus C3-41 (nucleotides 1,141,180 to 1,138,569) reveals that the mtx1 gene in C3-41 might be a pseudogene with a frame shift (Bsph_1076). The mtx1 gene is poorly expressed in B. sphaericus and has a repressor binding site between its putative promoter and its ribosome binding site (64). The mtx2 toxin gene (Bsph_1071) (previously described by Thanabalu and Porter) is located close to the mtx1 toxin gene (65). The Mtx2 protein is known to be related to the Mtx3 toxin of B. sphaericus (39), which is present in the C3-41 genome as Bsph_2822. It is interesting to note a further open reading frame (ORF), Bsph_2821, located upstream of mtx3 that has homology to both mtx2 and mtx3 but appears to be a pseudogene.
Our results indicated that the mtx1, mtx2, and mtx3 genes of B. sphaericus C3-41 might have some direct relationship to mobile genetic elements. Two insertion elements that were named ISBsph4 and ISBsph5, located within the mtx1-mtx2 cluster region, were found (Fig. 5A). orf2 and orf3 of the mtx2 distal insertion element appear to be a transposase disrupted into two nonfunctional genes (i.e., Bpsh_1069 and Bpsh_1070) by a premature stop, which suggests that this element may have been integrated for some time. This notion may be supported by the fact that, of the known B. sphaericus toxins, the Mtx2 proteins are the least conserved, with interstrain variations having significant effects on host range (11). There is also an insertion element (ISBsph7) upstream of Bsph_2821 and mtx3 (Bsph_2822) (Fig. 5B). This implies that these mobile genetic elements might play an important role in Mtx toxin evolution. With the exception of mtx1 (Bsph_1076), the other predicted mtx toxin genes, mtx2 (Bsph_1071) and mtx3 (Bsph_2822), have close orthologs in Bacillus sp. strain NRRL B-14905.
FIG. 5.
Mosquitocidal toxin gene clusters and relationship with mobile genetic elements. (a) Gene cluster from Bsph_1068 to Bsph_1080. (b) Gene cluster from Bsph_2814 to Bsph_2825. Gray arrows, transposases; arrows with lattice, mosquitocidal toxins; dark arrows, possible transcriptional regulators; open arrows, hypothetical proteins. Insertion elements and terminal inverted-repeat sequences (IR) are marked in the figures. The region encoding the possible Mtx1 contains a frame shift. pseudo, pseudogene.
The binary toxin genes binA and binB of B. sphaericus C3-41, which are the main source of its activity against mosquito larvae, were found to be located in an ∼35-kb duplicate fragment present both in the chromosome and in the large plasmid. In total, 21 CDSs were predicted within this duplicated fragment (Fig. 6). Apart from binA and binB (Bsph_3193/Bsph_155 and Bsph_3192/Bsph_154, respectively), the nearby Bsph_3195/Bsph_157 appears to encode a further protein with homology to the Mtx2/3 toxins. Genes encoding a putative peptide synthase and a chitin-binding protein are located in this region (Bsph_3196/Bsph_158 and Bsph_3198/Bsph_160). Two CDSs for phage integrase family proteins were observed upstream of the 35-kb fragment. Similar to those of the mtx cluster, a putative transposase gene (Bsph_3188) and insertion element (ISBsph9) are also present within the ∼35-kb duplicate fragment, suggesting that the pathogenicity locus of B. sphaericus C3-41 may have phage or transposon origins. The GerXB-XA-XC gene cluster is found upstream of the putative transposase gene. This is exactly in accordance with B. anthracis plasmid pXO1 that also has a GerXB-XA-XC gene cluster upstream of a transposase (48), where it appears to influence the germination rate of B. anthracis spores (26, 27). Comparison of the region following the binary toxin genes in B. sphaericus strains C3-41 and strain 1593 (GenBank accession number AJ224477) with the equivalent region from strain 2297 (GenBank accession number AJ224478) reveals that the strain 2297 sequence contains a probable transposase pseudogene. The transposase homology begins at a small CDS capable of encoding a peptide of only 14 amino acids, but the homology continues in various reading frames, with at least five frame shifts, for 1,110 nucleotides. In strains C3-41 and 1593, however, a 1,554-bp insertion is seen at the end of the small CDS of strain 2297. Several repeat sequences characteristic of transposition/rearrangement events are found in the above transposase regions. Downstream of the initial CDS in strain 2297, there is a direct repeat (TAAAGAATATAA) separated by 25 bases within the disrupted transposase. This feature is maintained in strain C3-41, downstream of the inserted sequence. In the latter strain, there is a long direct-repeat sequence (AAATAAAGTCgtGAtGTTTATAAAAAAGaTGCGAtgTTTaAAATAAAGTCacGAcGTTTATAAAAAAGcTGCGAag cTTT; centered around the a nucleotide) beginning just 11 nucleotides from the 5′ end of the inserted sequence and a smaller inverted-repeat sequence 62 nucleotides upstream of the 3′ end of the insertion (ATAGAAAAAGCGTGTTTTGAtcAAtctTTttTCAAAACACGCTTTTTCTAT; centered around the tct nucleotides) (Fig. 6).
FIG. 6.
CDSs of the 35-kb duplicates in B. sphaericus C3-41 and comparisons of the transposase homologs downstream from the binary toxins in B. sphaericus strains C3-41, 1593, and 2297. The 35-kb duplicates presented both in the chromosome and in the large plasmid contain 21 CDSs. Binary toxin BinA and BinB, CDSs for phage integrase family protein, insertion elements, the GerXB-XA-XC operon, a putative peptide synthase, and a chitin-binding protein are indicated. Some other, hypothetical proteins are indicated by open arrows. The transposase homolog of B. sphaericus 2297 begins at a small CDS capable of encoding a peptide of only 14 amino acids, but the homolog continues in various reading frames for 1,110 nucleotides. In strains C3-41 and 1593, a 1,554-bp insertion is located at the end of the small CDS of strain 2297. Downstream of the initial CDS in strain 2297, there is a direct repeat (TAAAGAATATAA) separated by 25 bases within the disrupted transposase. This feature is maintained downstream of the inserted sequence in strains 1593 and C3-41. The two opposite-facing triangles represent the long, direct-repeat sequence at the 3′ end of the inserted sequence and the shorter, inverted-repeat sequence upstream of the 3′ end of the IS, respectively.
Comparative analysis indicated that BinA, BinB, and the two phage integrase family proteins are unique to B. sphaericus and are not present in the other seven species analyzed. We hypothesize that the binary toxins and even the ∼35-kb fragment may be the remnant of the phage infection that may have happened only in B. sphaericus, and not in the other seven species listed in this study. The plasmid location, putative phage infection, and transposition may all provide explanations of the fact that bin genes are not present in all B. sphaericus strains, while the bin genes of different serotypes and isolates show extremely high levels of similarity (73, 74).
A further insecticidal toxin from B. sphaericus was recently reported to be active against the German cockroach Blattela germanica (46). The gene for this sphaericolysin is identified in the B. sphaericus C3-41 genome as Bsph_4094 and has high similarity with the gene encoding cereolysin O, which is present in many Bacillus species. In addition, the chromosome of B. sphaericus C3-41 contains several homologs of genes known to be involved in the pathogenicity of other gram-positive bacteria, such as B. cereus and Listeria monocytogenes. These include haemolysin III (25) or putative haemolysin CDSs (Bsph_2727, Bsph_3508, Bsph_3583, and Bsph_3651), internalin-like genes (Bsph_4728), sigma factor B (Bsph_4295), and P60 extracellular protease (Bsph_1123) (14). These shared virulence genes might be part of the common arsenal associated with the pathogenicity of some gram-positive bacteria.
Duplication in genome sequences.
The predicted proteins of B. sphaericus C3-41 were used for finding segmental duplication and tandem duplication. The results of BLAST analysis revealed a large number of predicted CDSs with similar matches within the chromosome of B. sphaericus C3-41 (BLAST E value, <10−5), indicating gene duplication (Fig. 2A). This includes single-gene duplication with the two genes widely separated, duplication of neighboring genes, fragment duplication with many CDSs, and even the duplication of almost one-third to one-half of the circular chromosome. The duplication of the chromosome itself suggested that the chromosome of B. sphaericus C3-41 might have evolved by a drastic doubling in genome size. Similar phenomena were observed in other Bacillus species, such as B. anthracis strain Ames (GenBank accession number NC_003997) (data not shown). On the other hand, by genome comparison two main long-fragment syntenies (about 460 kb and 760 kb, respectively) were observed between B. sphaericus C3-41, B. subtilis strain 168, and the three B. cereus group strains, as we described before. Thus, we made an assumption that the Bacillus species shared a common “chromosome backbone” in a very ancient stage, after which a duplication process happened, the chromosome size increased, and then variation developed as a consequence of complex and dynamic evolutionary mechanisms, finally resulting in the divergence of species. In addition, the presence of the ∼35-kb duplication between the chromosome and the large plasmid pBsph, containing the CDSs of binary toxin, transposase, and phage integrase family proteins, was confirmed by analyses, including the average sequencing coverage of the chromosome (8.9 times), plasmid (21.4 times), and 35-kb duplicate fragment (25 times), respectively, multiple PCRs, single-nucleotide polymorphism analysis, and the coding bias (data not shown). We suggest that this large fragment may be the remnant of a phage infection and, thus, that bin genes are only present in a subset of B. sphaericus strains and show extremely high levels of similarity among different serotypes and isolates.
DISCUSSION
There is a considerable debate surrounding the evolutionary process and systematic classification of B. sphaericus. The completion of the genomic sequence of B. sphaericus C3-41 and comparative analysis with other bacterial species now lays the foundation for understanding the special metabolic abilities and some common physiological and biochemical characteristics on a molecular level and, accordingly, offers new information regarding the evolution and ecology of this species.
Based on overall nucleotide and protein similarities, B. sphaericus C3-41 is most similar to a marine bacterial species, Bacillus sp. strain NRRL B-14905. The similarities of these two species, including mobile genetic elements, membrane-associated proteins, and metabolic and transport systems, provide solid data to explain the common features of B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905, such as their inability to utilize polysaccharide, and suggest that these two species are a biologically and phylogenetically divergent group whose members have developed to adapt to particular environmental conditions over evolutionary time. This is in accordance with previous suggestions that B. sphaericus has some special features similar to those of some archaic organisms and bacteria that can grow in extreme environments (9, 49) and is, therefore, distinct from most Bacillus species (45). It was recently proposed that B. fusiformis and B. sphaericus be reclassified into the genus of Lysinibacillus, with the renaming of B. sphaericus as Lysinibacillus sphaericus (2). We would propose from the comparative analysis in our study that Bacillus sp. NRRL B-14905 should be classified into one genus with B. fusiformis and B. sphaericus.
On the other hand, despite the approximately 80% identity of the B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905 genomes, there are many CDSs that are individually unique in B. sphaericus C3-41. These include prophage and IS elements, sporulation- and germination-related proteins, and virulence factors. Some unique genes, such as bin genes and mtx1, might have been obtained by insertion through mobile genetic elements. This means of gene acquisition may explain why these genes are only present in some B. sphaericus strains and show extremely high levels of similarity among different serotypes and isolates.
Together, the similarities and differences may hint at overlapping but nonidentical environmental and ecological niches for the taxa of these species. In performing a comparative analysis of the synteny and duplication among eight species, we postulated that, although B. sphaericus C3-41 is quite different from other Bacillus species, it still shares a common “chromosome backbone” with them. The variation of the chromosomes might be due to duplication and a complex and dynamic evolutionary process producing the current bacterial species.
Supplementary Material
Acknowledgments
We thank Quanxin Cai and Dongyuan Liu for technical assistance.
This project was supported by grant number KSCX2-SW-315 from the Chinese Academy of Sciences, 973 project number 2003CB114201, and grant number 30470037 from NFSC, China, and with financial assistance from Valent Biosciences Corp.
Footnotes
Published ahead of print on 22 February 2008.
Supplemental material for this article may be found at http://jb.asm.org/.
REFERENCES
- 1.Adams, D. E., E. M. Shekhtman, E. L. Zechiedrich, M. B. Schmid, and N. R. Cozzarelli. 1992. The role of topoisomerase IV in partitioning bacterial replicons and the structure of catenated intermediates in DNA replication. Cell 71277-288. [DOI] [PubMed] [Google Scholar]
- 2.Ahmed, I., A. Yokota, A. Yamazoe, and T. Fujiwara. 2007. Proposal of Lysinibacillus boronitolerans gen. nov. sp. nov., and transfer of Bacillus fusiformis to Lysinibacillus fusiformis comb. nov. and Bacillus sphaericus to Lysinibacillus sphaericus comb. nov. Int. J. Syst. Evol. Microbiol. 571117-1125. [DOI] [PubMed] [Google Scholar]
- 3.Alexander, B., and F. G. Priest. 1990. Numerical classfication and identification of Bacillus sphaericus including some strains pathogenic for mosquito larvae. J. Gen. Microbiol. 136367-376. [DOI] [PubMed] [Google Scholar]
- 4.Alice, A. F., G. Pérez-Martínez, and C. Sánchez-Rivas. 2003. Phosphoenolpyruvate phosphotransferase system and N-acetylglucosamine metabolism in Bacillus sphaericus. Microbiology 1491687-1698. [DOI] [PubMed] [Google Scholar]
- 5.Altschul, S. F., W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. Basic local alignment search tool. J. Mol. Biol. 215403-410. [DOI] [PubMed] [Google Scholar]
- 6.Apweiler, R., T. K. Attwood, A. Bairoch, A. Bateman, E. Birney, M. Biswas, P. Bucher, L. Cerutti, F. Corpet, and M. D. Croning. 2001. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 2937-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Berry, C., S. O'Neil, E. Ben-Dov, A. F. Jones, L. Murphy, M. A. Quail, M. T. G. Holden, D. Harris, A. Zaritsky, and J. Parkhill. 2002. Complete sequence and organization of pBtoxis, the toxin-coding plasmid of Bacillus thuringiensis subsp. israelensis. Appl. Environ. Microbiol. 685082-5095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Blencke, H. M., G. Homuth, H. Ludwig, U. Mäder, M. Hecker, and J. Stülke. 2003. Transcriptional profiling of gene expression in response to glucose in Bacillus subtilis: regulation of the central metabolic pathways. Metab. Eng. 5133-149. [DOI] [PubMed] [Google Scholar]
- 9.Cano, R. J., and M. K. Borucki. 1995. Revival and identification of bacterial spores in 20- to 40-million-year-old Dominican amber. Science 2681060-1064. [DOI] [PubMed] [Google Scholar]
- 10.Casjens, S. 2003. Prophages and bacterial genomics: what have we learned so far? Mol. Microbiol. 49277-300. [DOI] [PubMed] [Google Scholar]
- 11.Chan, S. W., T. Thanabalu, B. Y. Wee, and A. G. Porter. 1996. Unusual amino acid determinants of host range in the Mtx2 family of mosquitocidal toxins. J. Biol. Chem. 27114183-14187. [DOI] [PubMed] [Google Scholar]
- 12.Chen, Y., and H. P. Erickson. In vitro assembly studies of FtsZ/tubulin-like proteins (TubZ) from Bacillus plasmids—evidence for a capping mechanism. J. Biol. Chem., in press. doi: 10.1074/jbc.M709163200. [DOI] [PMC free article] [PubMed]
- 13.Christensen, B. B., T. Atlung, and F. G. Hansen. 1999. DnaA boxes are important elements in setting the initiation mass of Escherichia coli. J. Bacteriol. 1812683-2688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cossart, P. 2002. Molecular and cellular basis of the infection by Listeria monocytogenes: an overview. Int. J. Med. Microbiol. 291401-409. [DOI] [PubMed] [Google Scholar]
- 15.Deutscher, J., E. Küster, U. Bergstedt, V. Charrier, and W. Hillen. 1995. Protein kinase-dependent HPr/CcpA interaction links glycolytic activity to carbon catabolite repression in gram-positive bacteria. Mol. Microbiol. 151049-1053. [DOI] [PubMed] [Google Scholar]
- 16.Deutscher, J., and M. H. Saier, Jr. 1983. ATP-dependent protein kinase catalyzed phosphorylation of a seryl residue in HPr, a phosphate carrier protein of the phosphotransferase system in Streptococcus pyogenes. Proc. Natl. Acad. Sci. USA 806790-6794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ewing, B., and P. Green. 1998. Base-calling of automated sequencer traces using Phred. II. Error probabilities. Genome Res. 8186-194. [PubMed] [Google Scholar]
- 18.Ewing, B., L. Hillier, M. C. Wendl, and P. Green. 1998. Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res. 8175-185. [DOI] [PubMed] [Google Scholar]
- 19.Fayet, O., P. Ramond, P. Polard, M. F. Prere, and M. Chandler. 1990. Functional similarities between retroviruses and the IS3 family of bacterial insertion sequences? Mol. Microbiol. 41771-1777. [DOI] [PubMed] [Google Scholar]
- 20.Freter, R., H. Brickner, M. Botney, D. Cleven, and A. Aranki. 1983. Mechanisms that control bacterial populations in continuous-flow culture models of mouse large intestinal flora. Infect. Immun. 39676-685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Galinier, A., J. Haiech, M. C. Kilhoffer, M. Jaquinod, J. Stülke, J. Deutscher, and I. Martin-Verstraete. 1997. The Bacillus subtilis crh gene encodes a HPr-like protein involved in catabolite repression. Proc. Natl. Acad. Sci. USA 948439-8444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Galinier, A., M. Kravanja, R. Engelmann, W. Hengstenberg, M. C. Kilhoffer, J. Deutscher, and J. Haiech. 1998. New protein kinase and protein phosphatase families mediate signal transduction in bacterial catabolite repression. Proc. Natl. Acad. Sci. USA 951823-1828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gassner, M., D. Stehlik, O. Schrecker, W. Hengstenberg, W. Maurer, and H. Rüterjans. 1977. The phosphoenolpyruvate-dependent phosphotransferase system of Staphylococcus aureus. 2.1H and 31P nuclear-magnetic-resonance studies on the phosphocarrier protein HPr, phosphohistidines and phosphorylated HPr. Eur. J. Biochem. 75287-296. [DOI] [PubMed] [Google Scholar]
- 24.Gordon, D., C. Abajian, and P. Green. 1998. Consed: a graphical tool for sequence finishing. Genome Res. 8195-202. [DOI] [PubMed] [Google Scholar]
- 25.Granum, P. E., and T. Lund. 1997. Bacillus cereus and its food poisoning toxins. FEMS Microbiol. Lett. 157223-228. [DOI] [PubMed] [Google Scholar]
- 26.Guidi-Rontani, C., M. Weber-Levy, E. Labruyere, and M. Mock. 1999. Germinationof Bacillus anthracis spores with alveolar macrophages. Mol. Microbiol. 319-17. [DOI] [PubMed] [Google Scholar]
- 27.Guidi-Rontani, C., Y. Pereira, S. Ruffie, J.-C. Sirard, M. Weber-Levy, and M. Mock. 1999. Identification and characterisation of a germination operon on the virulence plasmid pXO1 of Bacillus anthracis. Mol. Microbiol. 33407-414. [DOI] [PubMed] [Google Scholar]
- 28.Han, B., H. Liu, X. Hu, Y. Cai, D. Zheng, and Z. Yuan. 2007. Molecular characterization of a glucokinase of broad hexose specificity from Bacillus sphaericus strain C3-41. Appl. Environ. Microbiol. 733581-3586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Han, B., H. Liu, X. Hu, and Z. Yuan. 2006. Preliminary characterization of a thermostable DNA polymerase I from a mesophilic Bacillus sphaericus strain C3-41. Arch. Microbiol. 186203-209. [DOI] [PubMed] [Google Scholar]
- 30.Henkin, T. M. 1996. The role of the CcpA transcriptional regulator in carbon metabolism in Bacillus subtilis. FEMS Microbiol. Lett. 1359-15. [DOI] [PubMed] [Google Scholar]
- 31.Jones, G. W., C. Nielsen-Leroux, Y. Yang, Z. Yuan, V. Fiúza Dumas, R. Gomes Monnerat, and C. Berry. 2007. A new Cry toxin with a unique two-component dependency from Bacillus sphaericus. FASEB J. 214112-4120. [DOI] [PubMed] [Google Scholar]
- 32.Reference deleted.
- 33.Katzman, M., J. P. Mack, A. M. Skalka, and J. Leis. 1991. A covalent complex between retroviral integrase and nicked substrate DNA. Proc. Natl. Acad. Sci. USA 884695-4699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Krych, V. K., J. L. Johnson, and A. A. Yousten. 1980. Deoxyribonucleic acid homologies among strains of Bacillus sphaericus. Int. J. Sys. Bacteriol. 30476-484. [Google Scholar]
- 35.Kulkosky, J., K. S. Jones, R. A. Katz, J. P. Mack, and A. M. Skalka. 1992. Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. Mol. Cell. Biol. 122331-2338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kunst, F., N. Ogasawara, I. Mozser, A. M. Albertini, G. Alloni, V. Azebedo, M. G. Bertero, P. Bessieres, A. Bolotin, S. Borchert, R. Borriss, L. Boursier, A. Brans, M. Braun, S. C. Brignell, S. Bron, S. Brouillet, C. V. Bruschi, B. Caldwell, V. Capuano, N. M. Carter, S.-K. Choi, J.-J. Codani, I. F. Connerton, N. J. Cummings, R. A. Daniel, F. Denizot, K. M. Devine, A. Dü Sterhöft, S. D. Ehrlich, P. T. Emmerson, K. D. Entian, J. Errington, C. Fabret, E. Ferrari, D. Foulger, C. Fritz, M. Fujita, Y. Fujita, S. Fuma, A. Galizzi, N. Galleron, S.-Y. Ghim, P. Glaser, A. Goffeau, E. J. Golightly, G. Grandi, G. Guiseppi, B. J. Guy, K. Haga, J. Haiech, C. R. Harwood, A. Hénaut, H. Hilbert, S. Holsappel, S. Hosono, M.-F. Hullo, M. Itaya, L. Jones, B. Joris, D. Karamata, Y. Kasahara, M. Klaerr-Blanchard, C. Klein, Y. Kobayashi, P. Koetter, G. Koningstein, S. Krogh, M. Kumano, K. Kurita, A. Lapidus, S. Lardinois, J. Lauber, V. Lazarevic, S.-M. Lee, A. Levine, H. Liu, S. Masuda, C. Mauë, C. Médigue, N. Medina, R. P. Mellado, M. Mizuno, D. Moestl, S. Nakai, M. Noback, D. Noone, M. O'Reilly, K. Ogawa, A. Ogiwara, B. Oudega, S.-H. Park, V. Parro, T. M. Pohl, D. Portetelle, S. Porwollik, A. M. Prescott, E. Presecan, P. Pujic, B. Purnelle, G. Rapoport, M. Rey, S. Reynolds, M. Rieger, C. Rivolta, E. Rocha, B. Roche, M. Rose, Y. Sadaie, T. Sato, E. Scanlan, S. Schleich, R. Schroeter, F. Scoffone, J. Sekiguchi, A. Sekowska, S. J. Seror, P. Serror, B.-S. Shin, B. Soldo, A. Sorokin, E. Tacconi, T. Takagi, H. Takahashi, K. Takemaru, M. Takeuchi, A. Tamakoshi, T. Tanaka, P. Terpstra, A. Tognoni, V. Tosato, S. Uchiyama, M. Vandenbol, F. Vannier, A. Vassarotti, A. Viari, R. Wambutt, E. Wedler, H. Wedler, T. Weitzenegger, P. Winters, A. Wipat, H. Yamamoto, K. Yamane, K. Yasumoto, K. Yata, K. Yoshida, H.-F. Yoshikawa, E. Zumstein, H. Yoshikawa, and A. Danchin. 1997. The complete genome sequence of the gram-positive bacterium Bacillus subtilis. Nature 390249-256. [DOI] [PubMed] [Google Scholar]
- 37.Kurtz, S., J. V. Choudhuri, E. Ohlebusch, C. Schleiermacher, J. Stoye, and R. Giegerich. 2001. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 294633-4642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Li, H., A. Coghlan, J. Ruan, L. J. Coin, J. K. Hériché, L. Osmotherly, R. Li, T. Liu, Z. Zhang, L. Bolund, G. K. Wong, W. Zheng, P. Dehal, J. Wang, and R. Durbin. 2006. TreeFam: a curated database of phylogenetic trees of animal gene families. Nucleic Acids Res. 34572-580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Liu, J. W., A. G. Porter, B. Y. Wei, and T. Thanabalu. 1996. New gene from nine Bacillus sphaericus strains encoding highly conserved 35.8-kilodalton mosquitocidal toxins. Appl. Environ. Microbiol. 622174-2176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lobry, J. R. 1996. Asymmetric substitution patterns in the two DNA strands of bacteria. Mol. Biol. Evol. 13660-665. [DOI] [PubMed] [Google Scholar]
- 41.Lowe, T. M., and S. R. Eddy. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25955-964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Majka, J., D. Jakimowicz, W. Messer, H. Schrempf, M. Lisowski, and J. Zakrzewska-Czerwińska. 1999. Interactions of the Streptomyces lividans initiator protein DnaA with its target. Eur. J. Biochem. 260325-335. [DOI] [PubMed] [Google Scholar]
- 43.Mason, T. G., and G. Richardson. 1981. Escherichia coli and the human gut: some ecological considerations. J. Appl. Bacteriol. 11-16. [DOI] [PubMed] [Google Scholar]
- 44.Moreno, M. S., B. L. Schneider, R. R. Maile, W. Weyler, and M. H. Saier, Jr. 2001. Catabolite repression mediated by CcpA protein in Bacillus subtilis: novel modes of regulation revealed by whole genome analyses. Mol. Microbiol. 391366-1381. [DOI] [PubMed] [Google Scholar]
- 45.Nakamura, L. K. 2000. Phylogeny of Bacillus sphaericus-like organisms. Int. J. Syst. Evol. Microbiol. 501715-1722. [DOI] [PubMed] [Google Scholar]
- 46.Nishiwaki, H., K. Nakashima, C. Ishida, T. Kawamura, and K. Matsuda. 2007. Cloning, functional characterization, and mode of action of a novel insecticidal pore-forming toxin, sphaericolysin, produced by Bacillus sphaericus. Appl. Environ. Microbiol. 733404-3411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ogata, H., S. Goto, K. Sato, W. Fujibuchi, H. Bono, and M. Kanehisa. 1999. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2729-34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Okinaka, R. T., K. Cloud, O. Hampton, A. R. Hoffaster, K. K. Hill, P. Keim, T. M. Koehler, G. Lamke, S. Kumano, J. Mahillon, D. Manter, Y. Martinez, D. Ricke, R. Svensson, and P. J. Jackson. 1999. Sequence and organization of pXO1, the large Bacillus anthracis plasmid harboring the anthrax toxin genes. J. Bacteriol. 1816509-6515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pollmann, K., J. Raff, M. Merroun, K. Fahmy, and S. Selenska-Pobell. 2006. Metal binding by bacteria from uranium mining waste piles and its technological applications. Biotechnol. Adv. 2458-68. [DOI] [PubMed] [Google Scholar]
- 50.Priest, F. G. 1992. Biological control of mosquitoes and other biting flies by Bacillus sphaericus and Bacillus thuringiensis. J. Appl. Bacteriol. 72357-369. [DOI] [PubMed] [Google Scholar]
- 51.Priest, F. G., L. Ebdrup, V. Zahner, and P. Carter. 1997. Distribution and characterization of mosquitocidal toxin genes in some strains of Bacillus sphaericus. Appl. Environ. Microbiol. 631195-1198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rasko, D. A., J. Ravel, O. A. Økstad, E. Helgason, R. Z. Cer, L. Jiang, K. A. Shores, D. E. Fouts, N. J. Tourasse, S. V. Angiuoli, J. Kolonay, W. C. Nelson, A. B. Kolstø, C. M. Fraser, and T. D. Read. 2004. The genome sequence of Bacillus cereus ATCC 10987 reveals metabolic adaptations and a large plasmid related to Bacillus anthracis pXO1. Nucleic Acids Res. 32977-988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Reizer, J., C. Hoischen, F. Titgemeyer, C. Rivolta, R. Rabus, J. Stülke, D. Karamata, M. H. Saier, Jr., and W. Hillen. 1998. A novel protein kinase that controls carbon catabolite repression in bacteria. Mol. Microbiol. 271157-1169. [DOI] [PubMed] [Google Scholar]
- 54.Russell, B. L., S. A. Jelley, and A. A. Yousten. 1989. Carbohydrate metabolism in the mosquito pathogen Bacillus sphaericus 2362. Appl. Environ. Microbiol. 55294-297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Saier, M. H., Jr. 1996. Cyclic AMP-independent catabolite repression in bacteria. FEMS Microbiol. Lett. 13897-103. [DOI] [PubMed] [Google Scholar]
- 56.Salzberg, S. L., A. L. Delcher, S. Kasif, and O. White. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26544-548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Scarlett, F. A., and J. M. Turner. 1976. Microbial metabolism of amino alcohols. Ethanolamine catabolism mediated by coenzyme B12-dependent ethanolamine ammonia lyase in Escherichia coli and Klebsiella aerogenes. J. Gen. Microbiol. 95173-176. [DOI] [PubMed] [Google Scholar]
- 58.Siefert, J. L., M. Larios-Sanz, L. K. Nakamura, R. A. Slepecky, J. H. Paul, E. R. Moore, G. E. Fox, and P. Jurtshuk, Jr. 2000. Phylogeny of marine Bacillus isolates from the Gulf of Mexico. Curr. Microbiol. 4184-88. [DOI] [PubMed] [Google Scholar]
- 59.Stanley, C. H., J. J. Gauthier, and D. J. Tipper. 1975. Ultrastructural studies of sporulation in Bacillus sphaericus. J. Bacteriol. 1221322-1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Stothard, P., and D. S. Wishart. 2005. Circular genome visualization and exploration using CGView. Bioinformatics 21537-539. [DOI] [PubMed] [Google Scholar]
- 61.Stülke, J., and W. Hillen. 2000. Regulation of carbon catabolism in Bacillus species. Annu. Rev. Microbiol. 54849-880. [DOI] [PubMed] [Google Scholar]
- 62.Tatusov, R. L., D. A. Natale, I. V. Garkavtsev, T. A. Tatusova, U. T. Shankavaram, B. S. Rao, B. Kiryutin, M. Y. Galperin, N. D. Fedorova, and E. V Koonin. 2001. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2922-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Tettelin, H., K. E. Nelson, I. T. Paulsen, J. A. Eisen, T. D. Read, S. Peterson, J. Heidelberg, R. T. DeBoy, D. H. Haft, R. J. Dodson, A. S. Durkin, M. Gwinn, J. F. Kolonay, W. C. Nelson, J. D. Peterson, L. A. Umayam, O. White, S. L. Salzberg, M. R. Lewis, D. Radune, E. Holtzapple, H. Khouri, A. M. Wolf, T. R. Utterback, C. L. Hansen, L. A. McDonald, T. V. Feldblyum, S. Angiuoli, T. Dickinson, E. K. Hickey, I. E. Holt, B. J. Loftus, F. Yang, H. O. Smith, J. C. Venter, B. A. Dougherty, D. A. Morrison, S. K. Hollingshead, and C. M. Fraser. 2001. Complete genome sequence of a virulent isolate of Streptococcus pneumoniae. Science 293498-506. [DOI] [PubMed] [Google Scholar]
- 64.Thanabalu, T., J. Hindley, J. Jackson-Yap, and C. Berry. 1991. Cloning, sequencing, and expression of a gene encoding a 100-kilodalton mosquitocidal toxin from Bacillus sphaericus SSII-1. J. Bacteriol. 1732776-2785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Thanabalu, T., and A. G. Porter. 1996. A Bacillus sphaericus gene encoding a novel type of mosquitocidal toxin of 31.8 kDa. Gene 17085-89. [DOI] [PubMed] [Google Scholar]
- 66.Thiery, I., C. Back, P. Barbazan, and G. Sinègre. 1996. Application de Bacillus thuringiensis et de B. sphaericus dans la démoustication et la lutte contre les vecteurs de maladies tropicales. Ann. Inst. Pasteur Actual. 7247-260. [Google Scholar]
- 67.Tinsley, E., and S. A. Khan. 2006. A novel FtsZ-like protein is involved in replication of the anthrax toxin-encoding pXO1 plasmid in Bacillus anthracis. J. Bacteriol. 1882829-2835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Wati, M., T. Thanabalu, and A. G. Porter. 1997. Gene from tropical Bacillus sphaericus encoding a protease closely related to subtilisins from Antarctic bacilli. Biochim. Biophys. Acta 135256-62. [DOI] [PubMed] [Google Scholar]
- 69.Weickert, M. J., and G. H. Chambliss. 1990. Site-directed mutagenesis of a catabolite repression operator sequence in Bacillus subtilis. Proc. Natl. Acad. Sci. USA 876238-6242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Xu, D., and J. C. Cote. 2003. Phylogenetic relationships between Bacillus species and related genera inferred from comparison of 3′ end 16S rDNA and 5′ end 16S-23S ITS nucleotide sequences. Int. J. Syst. Evol. Microbiol. 53695-704. [DOI] [PubMed] [Google Scholar]
- 71.Yoshida, K. I., K. Kobayashi, Y. Miwa, C. M. Kang, M. Matsunaga, H. Yamaguchi, S. Tojo, M. Yamamoto, R. Nishi, N. Ogasawara, T. Nakayama, and Y. Fujita. 2001. Combined transcriptome and proteome analysis as a powerful approach to study genes under glucose repression in Bacillus subtilis. Nucleic Acids Res. 296683-6692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Yousten, A. A., and W. D. Elizabeth. 1982. Ultrastructural analysis of spores and parasporal crystals formed by Bacillus sphaericus 2297. Appl. Environ. Microbiol. 441449-1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Yuan, Z. M., C. Neilsen-LeRoux, N. Pasteur, A. Delecluse, J. F. Charles, and R. Frutos. 1999. Cloning and expression of the bin genes of Bacillus sphaericus C3-41 in a crystal minus B. thuringiensis subsp. israelensis. Acta Biol. Sin. 3929-35. [PubMed] [Google Scholar]
- 74.Yuan, Z. M., C. Rang, R. C. Maroun, J. P. Victor, R. Frutos, N. Pasteur, C. Vendrely, C. Jean-Francois, and C. Nielsen-LeRoux. 2001. Identification and molecular structural prediction analysis of a toxicity determinant in the Bacillus sphaericus crystal larvicidal toxin. Eur. J. Biochem. 2682751-2760. [DOI] [PubMed] [Google Scholar]
- 75.Zhang, Y., E. Liu, C. Cai, and Z. Chen. 1987. Isolation of two highly toxic Bacillus sphaericus strains. Insecticidal Microorg. 198-99. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.