Abstract
Spore formation by Bacillus subtilis is governed by global changes in gene transcription. We used nylon-substrate DNA arrays representing ≈96% of the predicted open reading frames in the B. subtilis chromosome to compare the pattern of transcripts from wild-type cells with the pattern from cells mutant for the sporulation transcription factors Spo0A or σF. We found 520 genes whose transcript levels were at least 3-fold dependent on Spo0A but not on σF, and an additional 66 genes whose transcript levels were dependent upon both regulatory proteins. Two strategies were used to help assign genes to the direct control of a particular developmental regulatory protein. In one approach, we analyzed the effects on global gene expression of artificially producing a constitutively active form of Spo0A during growth. In a second approach, Hidden Markov models were used to identify promoters likely to be activated by Spo0A, σF, or a third sporulation transcription factor, σE. In addition to detecting known sporulation genes, we identified many genes of unknown function whose patterns of expression and regulation suggest that they could be involved in sporulation. Disruption of two such newly identified genes, yabP and yabQ, blocked sporulation at a late stage.
Keywords: hidden Markov models, functional genomics, DNA arrays
Sporulation by Bacillus subtilis is governed by a program of gene transcription involving more than 125 genes (reviewed in ref. 1). However, a description of global changes in gene expression during the course of spore formation has been lacking. Recent methods for creating DNA arrays with large numbers of individual genes has made it possible to visualize genome-wide changes in gene transcript levels, as, for example, during sporulation in budding yeast and during the response of human fibroblasts to serum (2, 3). Taking advantage of the availability of a complete genome sequence for B. subtilis (4), we sought to apply transcriptional profiling to the process of sporulation in this bacterium.
Entry into sporulation is principally governed by the DNA-binding protein Spo0A, which is both an activator and a repressor of transcription. The activity of Spo0A, which is a member of the response regulator family of proteins, is controlled through phosphorylation by a phosphorelay that integrates environmental and physiological signals in the decision to sporulate (5). Among the targets of the phosphorylated form, Spo0A∼P, are one or more unknown genes involved in the formation of a polar septum, which divides the cell into forespore and the mother cell compartments (6). Spo0A∼P also directs the transcription of genes and operons (spoIIA, spoIIE, spoIIG) involved in the activation of the compartment-specific regulatory proteins σF and σE, which direct transcription in the forespore and the mother cell, respectively (1). Spo0A also activates transcription of certain genes indirectly by repressing the gene for the “transition-state regulator” AbrB (7). Among the targets of AbrB is the gene (sigH) for the early-appearing sporulation transcription factor σH (8). Later in development, σF is replaced in the forespore by σG and σE is replaced in the mother cell by σK (1).
We sought to identify genes that are activated or repressed at early to middle stages of sporulation by comparing transcripts present during growth and sporulation of the wild type with transcripts present in mutants for Spo0A and σF. Because σF is required for the appearance of σE and all other transcription factors downstream in the sporulation pathway (1), genes whose expression depends on Spo0A, but not on σF, would be candidates for targets of Spo0A. Genes whose expression depends on both Spo0A and σF could be under the direct control of σF or of a downstream transcription factor in the sporulation pathway. Two additional approaches were used to identify genes under the control of Spo0A and other regulatory proteins. In one, we monitored gene expression in cells engineered to produce a constitutively active form of Spo0A (Spo0A-Sad67) during growth (9). In a second approach, we used hidden Markov models (HMMs) trained on known promoter sequences to identify genes that were likely to be under the control of Spo0A, σF, or σE. This analysis enabled us to discover many additional genes that are likely to be part of the sporulation program, two of which are shown to be essential for spore formation.
Materials and Methods
Strains and Media.
The strains used were as follows: RL2242 (spo0A∷spc); RL1104 (amyE∷Pspac-spo0A-sad67 cat, spo0AΔerm); RL1265 (sigF∷kan); PY79 (prototroph); RL2243 (yabP∷spc); RL2244 (yabQ∷tet); RL2245 [yabP∷spc thrC∷pCW63(cotD-lacZ erm)]; and RL2246 [yabQ∷tet thrC∷pCW63(cotD-lacZ erm)]. Cells were grown overnight on LB agar and inoculated into 30 ml of Difco sporulation medium (DSM) at OD600 ≈ 0.05 (10). Parallel cultures of PY79, RL2198, and RL1265 were grown at 37°C until they reached an OD600 of ≈0.3, and samples were then diluted into 30 ml of fresh DSM to achieve similar cell densities. At times T−1.5, T0, and T2 relative to the end of the exponential growth, 7 ml of culture was withdrawn, and the cells were pelleted and frozen in liquid N2. Induction of Pspac-spo0A-sad67 was accomplished by growing RL1104 in LB at 37°C until OD600 ≈ 0.3. The culture was split, and isopropyl β-d-thiogalactoside (IPTG) was added to one half at a final concentration of 1 mM. Cells were harvested at 0, 15, 30, and 60 min, and handled as before. Deletion mutants of yabP (RL2243) and yabQ (RL2246) were created by using the long-flanking homology PCR strategy (11).
Transcriptional Profiling.
We used custom nylon-substrate PCR-product-based arrays. PCR amplification in 96-well plates was carried out with a set of primers for 4,100 B. subtilis ORFs (Eurogentech; Seraing, Belgium). PCR products were subjected to electrophoresis on agarose gels and scored, and ≈96% of the 4,100 ORFs of B. subtilis were represented. RNA was prepared by using the hot acid/phenol method (protocols available at http://mcb.harvard.edu/losick). Random hexamer-primed reverse transcription reactions in the presence of [α-33P]dCTP using 7.5 μg of total RNA were used to create cDNA probes. Spin-purified probes were divided and hybridized to replicate array membranes at 68°C for 18 h. Membranes were washed, baked, and exposed overnight to phosphoimager screens. A Fuji phosphoimager and ArrayVision software (Imaging Research, St. Catherines, ON, Canada) were used to quantify hybridization intensities, and for background subtraction. Spots were associated with ORFs by using Expression Explorer (Millennium Pharmaceuticals), and intensity distributions were median normalized to a value of 1. A minimum threshold value of 0.1 was set for all data points to avoid spurious expression level ratios at the bottom of the spot intensity range. Datasets were imported into a FileMaker Pro (FileMaker) database (http://mcb.harvard.edu/losick). Expression level ratios were calculated with averaged values obtained from replicate filters, and we considered only those ORFs showing at least a 3-fold difference. We superimposed on this basic criterion a statistical heuristic to eliminate ORFs that gave inconsistent hybridization by calculating a 90% confidence interval on the replicate data and asking that ratios calculated at the extremes of the confidence intervals remained greater than 3-fold.
HMMs.
Multiple alignments of known promoter recognition sequences for each specific transcription factor were annotated with hand-specified model structures; regions spanning the −35 to −10 of each promoter were assigned to match states, and bases in the spacer region were replaced with ambiguity characters representing the background nucleotide distribution of B. subtilis. Separate models were built in both orientations, and for known allowed sizes of the spacer. The hmmbuild module of the hmmer 2.1.1 package (http://hmmer.wustl.edu) was used to create HMMs global with respect to the model, but local with respect to the search string (12). To model Spo0A-activated σA-dependent promoters, we made hybrid HMMs that combined the probabilities of known “−35” regions for promoters recognized by σA–RNA polymerase with the derived probabilities of Spo0A-binding sites, separated by spacers the same size as for known Spo0A-activated σA-dependent promoters (spoIIG, 21 bp; spoIIE, 22 bp). In the case of the σF model, little training data are available, but since promoter-recognition sequences for σF and σG have significant similarities, we also constructed a model trained on known σF and σG promoters. The hmmsearch module was then used to search genome release 14.2 (4, 13). Custom software was used to correlate the position of hits with respect to nearby ORFs.
Results and Discussion
Transcriptional Profiling During Sporulation.
Experiments were carried out with RNA from wild-type cells and from cells mutant for Spo0A or for σF. Cells were harvested 1.5 h before (T−1.5), at (T0), and 2 h after (T2) the start of sporulation. Radioactive cDNAs were generated from the RNAs and incubated with nylon filters spotted with PCR products corresponding to almost all of the annotated ORFs in the genome. As an indication of reproducibility, spot intensities were compared between pairs of filters hybridized with the same radioactive cDNAs. The results (Fig. 1A is a representative example) showed good reproducibility between filters; in all cases the Pearson correlation coefficient (r) of replicate filter pairs was >0.96. In contrast, a scatterplot (Fig. 1B) comparing the intensity of spots generated with cDNAs from wild-type cells with those generated with cDNAs from a Spo0A mutant showed that the mutation caused significant changes in the expression profile. As an example of the primary data, Fig. 2 shows a close-up of the autoradiographs for a small region of arrays that had been hybridized with radioactive cDNA generated from RNAs from wild-type and σF mutant cells undergoing sporulation.
We used the clustering technique of Eisen et al. (14) to represent the expression profiles of all genes (586) whose transcript levels were at least 3-fold different between the wild type and the Spo0A mutant during at least one of the time points (Fig. 3A). Of these 586 genes, 266 exhibited at least a 5-fold dependence in transcript levels on Spo0A, and 83 exhibited greater than a 10-fold dependence. The first three columns of the cluster diagram compare the ratio of the hybridization signals between the wild type and the Spo0A mutant for RNAs from cells at T−1.5, T0, and T2. The fourth column compares the ratio of signals between the wild type and the σF mutant at T2.
Fig. 3A shows that the genes fell into three categories. Category I were genes whose expression was higher in the wild type than in the Spo0A mutant (during at least one of the time points) but whose expression was not significantly different between wild type and the σF mutant. Category I therefore represented genes whose expression was dependent on Spo0A but not on σF. Category II was genes with lower expression in the wild type than in the Spo0A mutant during at least one of the time points. This category represented genes whose expression was inhibited by Spo0A. Finally, category III was genes whose expression was higher in the wild type than in both the Spo0A mutant and the σF mutant. This category represented genes whose expression was under the control of σF or of a downstream transcription factor in sporulation.
Genes Whose Expression Depended on Spo0A but Not σF.
The largest group of genes (283) were in category I. Representative members are shown in the zoom in Fig. 3A. Consistent with the idea that genes in this category are directly or indirectly dependent on Spo0A∼P for their expression, only 47 of the 283 genes in category I showed a significant dependence on spo0A during the mid-exponential phase of growth (the T−1.5 time point); it was not until at or after the start of sporulation, when Spo0A∼P becomes abundant, that the majority of genes in this category showed a clear difference in expression levels between the wild type and the spo0A mutant.
Category I included sporulation genes whose transcription was previously known to be under the direct control of Spo0A, such as spo0F, spoIIA, and spoIIG (1, 15). It also included sporulation genes whose transcription was known to be indirectly dependent on Spo0A, such as kinA and spoVG (16, 17). Included in category I were various stationary-phase genes, including aprE, nprB, vpr, and epr, which encode proteases, pel, which encodes a pectate lyase (18), and csn, which specifies a chitosanase. Also in category I were genes for a nonribosomal peptide synthase (srfAB, srfAD), a peptide antibiotic (sbo), and polyketide synthetases (pksD and pksE). Several metabolic genes were also in this category, including genes involved in glutamine uptake (glnH) and histidine utilization (hutH, hutI, hutG).
In addition, category I included an operon (ctaC,D,E,F,G) encoding an alternative terminal cytochrome c oxidase of the caa3 family. This is of interest because sporulation defects arise from mutations in genes involved with energy production, notably those in the tricarboxylic acid cycle (19). A sporulation phenotype is also seen in a mutant of the divergently transcribed ctaA gene, which is needed for the biosynthesis of heme A, a prosthetic group required by the caa3 oxidases (20, 21). Significant expression of ctaC,D,E,F,G occurs only during stationary phase, and it is subject to catabolite control (22). If the encoded cytochrome oxidase is involved with sporulation, its regulation could contribute to the catabolite control of sporulation.
Other notable genes in this category were those involved in oligopeptide transport (dppB, dppD, dppE, appD, appF, appB), genes for peptide pheromones (phrE, phrF, phrG, phrI), and genes (rapC, rapE, rapF, rapI) for regulatory aspartyl phosphatases, which are the presumptive targets of the pheromones (23). Many of the previously known genes in category I are indirectly linked to Spo0A. For example, some of the extracellular protease genes are negatively regulated by AbrB, the gene for which is negatively regulated by Spo0A∼P or by the product of hpr, which is activated by AbrB (24).
The large majority of genes in category I were of unknown function. Many either exhibited no significant homology to other genes in the databases or were homologous to other proteins of unknown function. Some of the previously unknown genes in category I did exhibit homology to genes of known function. For example, ykuA encodes a protein with similarity to penicillin-binding proteins. Several of the genes of unknown function in this category appear to be organized in operons. Two examples are the gene cluster that includes yxbB, yxbA, and yxnB and the cluster that includes ybcO, ybcP, ybcQ, ybcS, ybcT, ybdA, and ybdB.
Genes Whose Expression Was Inhibited by Spo0A.
Category II represented genes (242) whose expression was directly or indirectly inhibited by Spo0A. Representative members of category II are shown in the zoom in the bottom right of Fig. 3A. Category II included genes that were previously known to be subject to repression by Spo0A, such as the transition state regulator abrB (7). As expected, we found that abrB transcript levels were much higher in the Spo0A mutant than in the wild type at all of the time points. Category II also included genes whose expression was indirectly linked to Spo0A. For example, it included the ribose transport operon (rbsR,K,D,A,C,B), whose expression under conditions of catabolite repression is known to be dependent on AbrB (25). Thus, expression of the ribose transport operon is enhanced in a Spo0A mutant because of overexpression of abrB. Other notable genes in category II were chemotaxis genes (mcpA, fliJ, cheB, cheA, cheW, and cheC), motility genes (flgC, flgE, flhO, fliD, fliF, fliG, fliK, and fliY) and autolysin genes (lytD and lytE), which were overexpressed in the spo0A mutant. The presence of chemotaxis, motility, and autolysin genes in this category illustrates the complex pleiotropy of spo0A mutations. These genes are transcribed by the σD transcription factor, which is encoded by sigD, and expression of sigD is positively regulated by the SinR transition state regulator (26). However, SinR activity is impaired when it interacts with SinI (27), the gene for which is transcribed in a Spo0A∼P-dependent manner. A spo0A mutation therefore results in abnormally high levels of σD-dependent gene transcription.
Genes Whose Expression Depended on Both Spo0A and σF.
Category III consisted of 66 genes, which are shown in the middle zoom on the right of Fig. 3A and in Fig. 3B. Consistent with the idea that expression of these genes was dependent on σF, genes in category III were expressed more highly in the wild type than in either the Spo0A mutant or the σF mutant, and in almost all cases this selective expression was not observed until T2, the earliest time point by which σF would have become active in directing sporulation gene transcription. Because the appearance of σE requires the prior activation of σF (1) and because some σE-directed transcription would have commenced by T2, category III was expected to include genes under the control of σF or σE.
Category III included many genes known to be under the control of σF (or both σF and σG), such as spoIIQ, dacF, sleB, sspE, and ywlB, or σE, such as cotE, spoIVA, spoIVCA, spoIID, spoIIID, spoIIIAH, spoVR, spoVID, usd, mbl, and ysxE. However, certain other genes known to be under the control of σF, such as spoIIR and katX, or σE, such as the spoIIIA operon and spoIVCB, were missing from category III because of the low level and/or timing of their expression. Category III included many genes not known to be expressed during sporulation. Some of these newly identified sporulation-controlled genes encode proteins with homology to proteins with known functions. For example, we detected an apparent operon (yngJ,I,H,G,F,E) whose products appear to be involved in the metabolism of long-chain fatty acids. Thus, the products of yngJ, yngI, yngH, yngG, yngF, and yngE are similar, respectively, to a butyryl-CoA dehydrogenase, a long-chain acyl-CoA synthetase, a biotin carboxylase, a hydroxymethylglutaryl-CoA ligase, a hydroxybutyryl-CoA dehydratase, and a propionyl-CoA carboxylase. Likewise, yodT, yodS, and yodR appear to constitute an operon whose products are similar, respectively, to an adenosylmethionine-8-amino-7-oxononanoate aminotransferase, a 3-oxoadipate CoA-transferase, and a butyrate-acetoacetate CoA-transferase. In keeping with a sporulation role for fatty acid metabolism, mmg, a previously identified operon involved with fatty acid utilization, is known to exhibit a σE-dependent mode of transcription (28).
Category III also included several previously annotated genes that had not been recognized as being expressed under sporulation control. Examples are murA, which is homologous to an Escherichia coli gene involved in peptidoglycan biosynthesis (29), prkA, which encodes a serine protein kinase of unknown function (30), and comER (31).
Alterations in the Global Pattern of Gene Expression in Cells Engineered to Produce a Constitutively Active Form of Spo0A During Growth.
As a complementary approach to identifying genes under the control of Spo0A, we examined the effect on global gene expression of producing a constitutively active form of Spo0A during growth. Spo0A-Sad67 is a mutant form of Spo0A whose activity does not require phosphorylation (9). Synthesis of Spo0A-Sad67 was induced during growth by using the inducer IPTG and a fusion of spo0A-sad67 to the IPTG-inducible promoter Pspac. RNA was extracted from cells collected at 0, 15, 30, and 60 min after the addition of inducer and from untreated cells collected at the same times. Transcriptional profiling revealed 29 genes for which there was at least a 3-fold difference in transcript levels between the treated and the untreated cells by 15 min after the addition of inducer. By 30 min this number increased to 67 and, finally, by 60 min, 246 genes were observed whose transcript levels were significantly altered by treatment with inducer. Genes whose expression was altered in the earlier time points were more likely to be under the direct control of Spo0A than genes for which an alteration in transcript levels was not observed until an hour after the addition of inducer.
Among genes whose expression rapidly increased in response to IPTG were known targets of Spo0A, such as spoIIGA, sigE, spoIIB, and sinI. These genes are transcribed from Spo0A-controlled promoters that are recognized by σA–RNA polymerase (1). For example, after 15 min the level of spoIIGA transcripts was 25-fold higher in the treated than in the untreated cells. Conversely, transcript levels for abrB, which is repressed by Spo0A (7), were 26-fold lower in the induced cells than in the uninduced cells at the same time point. Of the 67 genes whose expression was altered by the 30-min time point, 37 genes had higher expression in the treated than in the untreated cells. Of these, 26 were previously uncharacterized genes, and several had similarities to known genes: ykuA is similar to penicillin-binding gene 2b of Streptococcus pneumonia; ywqD, yveR, and yveN are similar to spore coat or capsular polysaccharide biosynthetic genes (ywqD to capB of Staphylococcus aureus; yveR to numerous galactosyl- and glycosyltransferases, including the spsA spore coat polysaccharide biosynthesis gene of Synechocystis sp.; yveN to lipopolysaccharide synthesis genes, including wbnE of E. coli); and yvhJ, yerO, and slr are similar to transcriptional regulators (yvhJ to lytR of B. subtilis; yerO to members of the tetR/acrR family, and slr to sinR of B. subtilis). Also noteworthy was an apparent operon formed by ywcI and sacT, the antiterminator of a sucrose utilization operon.
Among the genes whose transcript levels decreased in response to inducer were several flagellar protein genes (fliH, fliK, flgC, fliE), an uncharacterized (in B. subtilis) homolog of the E. coli cell division gene ftsE, and several genes involved in purine biosynthesis (purD, purE, purH, purM). A possible role of Spo0A in the repression of purine biosynthetic genes is intriguing because it has long been appreciated that the onset of sporulation is accompanied by a decrease in the cellular concentrations of GDP and GTP, and inhibition of guanine synthesis can induce sporulation in rich medium (32). Conceivably, Spo0A sets up a self-reinforcing loop for entry into sporulation by repressing purine biosynthetic genes.
Finally, we note that among the genes whose expression was enhanced or repressed in response to IPTG were 24 genes whose expression was influenced by Spo0A during sporulation in the transcriptional profiling experiment of Fig. 3A. These 24 genes, which are annotated with red text in Fig. 3C, are our best candidates for genes that are directly under the control of Spo0A. Included in this group is one anomalous case [cotD (1)], which we presume is attributable to a misamplified PCR product.
Promoter Recognition Sequence Prediction.
To gain further insight into the regulatory mechanisms governing the expression of the genes identified in our transcriptional profiling experiments, we constructed a series of HMMs that probabilistically modeled promoter sequences recognized by RNA polymerase containing σF and σE, as well as Spo0A-activated promoters recognized by σA–RNA polymerase (12). Fig. 3B shows that of the 48 novel or poorly characterized genes that were found to have transcription profiles characteristic of regulation by σF or σE, we were able to predict a likely promoter for 34, of which 11 were assigned to control by σF, and 22 were assigned to control by σE (one gene, ybaN, was predicted to have both σE- and σF-recognized promoters). Although there are doubtless false positives in our assignments, the percentage (70%) of promoters that could be assigned to control by σF or σE was higher than could occur by chance, and the promoters predicted for the genes identified by transcriptional profiling tended to be located within 100 bp of the initiation codon.
Fig. 3C presents the results of the HMM analysis for Spo0A-activated promoters recognized by σA–RNA polymerase. Among genes identified as potential targets of Spo0A through the use Spo0A-Sad67 (Fig. 3C), yjcP, yneE, ybdA, and ybcO,P were preceded by sequences that significantly conformed to our model for Spo0A-activated promoters. Of particular interest is a cluster of seven genes consisting of the genes ybcO, ybcP, ybcQ, ybcS, and ybcT and ybdA and ybdB, which could constitute an operon. This operon is a strong candidate for a transcription unit under the direct control of Spo0A: it was activated in a Spo0A-dependent manner during sporulation; it was switched on during growth in response to the induced synthesis of Spo0A-Sad67 (Fig. 3B); and it appears to be preceded by a sequence conforming to that of Spo0A-activated promoters recognized by σA–RNA polymerase. None of the products encoded by this putative operon are similar to previously characterized proteins, except for the products of ybdA and ybdB, which resemble, respectively, the binding and permease components of ABC transporters.
yabP and yabQ Are Required for Sporulation.
Our profiling and HMM analyses have revealed many previously uncharacterized genes that appear to be under sporulation control. We have begun to build null mutations of many of these genes in an effort to identify those that play a role in sporulation. One such case is that of the adjacent yabP and yabQ genes, which appear to constitute an operon. The upstream member of the putative operon, yabP, had an expression profile consistent with regulation by σF or σE. HMM analysis predicted a sequence, beginning 72 bp upstream of the initiation codon, with similarity to promoter sequences recognized by σE-containing RNA polymerase. yabP is predicted to encode a 100-residue protein, and yabQ is predicted to encode a 211-residue protein with five membrane-spanning domains. The putative yabPQ operon is immediately upstream of the cell division gene divIC and is located near and between known sporulation genes spoVT and spoIIE. We found that the replacement of yabP or yabQ with a drug-resistance cassette led to a severe sporulation defect. Similar conclusions have been reached by the Japanese functional analysis network (http://bacillus.genome.ad.jp). Examination of sporulating cells of the yabP and the yabQ mutants by phase-contrast microscopy revealed the production of phase-gray spores that failed to brighten, indicating a block at a late stage of sporulation (data not shown). We found orthologs of both yabP and yabQ in Clostridium acetobutylicum ATCC824 (Genome Therapeutics Corp.), Clostridium difficile (Sanger Centre), and Bacillus anthracis (The Institute for Genomic Research) (an alignment is shown in Fig. 4). In these organisms, the local gene order is the same as in B. subtilis (yobO-yabP-yabQ-divIC), but the same region in Listeria monocytogenes, a non-spore-forming Gram-positive bacterium, is simply yobO-divIC (as brought to our attention by P. Stragier, personal communication).
Summary.
Transcriptional profiling reinforces the view that sporulation involves global changes in gene expression and has led to the identification of many previously uncharacterized genes whose expression is under sporulation control. In total, we identified 586 genes, representing more than 10% of the ORFs in the B. subtilis genome (4), whose level of expression was altered relative to that of a Spo0A mutant at least 3-fold during the early to middle stages of sporulation. We were able to tentatively assign many of these genes to regulatory classes through the use of mutants for the sporulation regulatory proteins Spo0A and σF, through the use of cells engineered to produce Spo0A-Sad67 during growth, and through an informatics approach based on HMMs trained on known sporulation promoters.
The large majority of genes identified in our analysis, corresponding to categories I and III, were activated or repressed early in sporulation under the direct or indirect control of Spo0A. This indicates that the master regulator for entry into sporulation causes profound changes in the global pattern of gene expression as cells switch from growth to differentiation. Among the newly identified candidates for genes under the direct control of Spo0A are yjcP, yneE, and a putative seven-member operon consisting of the genes ybcO, ybcP, ybcQ, ybcS, ybcT, ybdA, and ybdB. Transcriptional profiling and informatics also led to the identification of several previously uncharacterized genes that appear to be under the control of σF or σE. Two of these genes, yabP and yabQ, were shown to be essential for spore formation, and we suspect that others in this category play a role in sporulation. Disruption of previously uncharacterized genes from our analysis should provide a more complete view of how global changes in gene expression help govern differentiation.
Acknowledgments
We thank N. Su and C. Murphy for assistance with profiling and R. Britton, E. Gonzalez, F. Gueiros-Filho, C. Price, and A. L. Sonenshein for advice. Work at Harvard was supported by National Institutes of Health Grant GM18568 to R.L. and a gift from Millennium Pharmaceuticals. P.E. was a postdoctoral fellow of the Human Frontier Science Program.
Abbreviations
- HMM
hidden Markov model
- IPTG
isopropyl β-d-thiogalactoside
Footnotes
Article published online before print: Proc. Natl. Acad. Sci. USA, 10.1073/pnas.140209597.
Article and publication date are at www.pnas.org/cgi/doi/10.1073/pnas.140209597
References
- 1.Stragier P, Losick R. Annu Rev Genet. 1996;30:297–241. doi: 10.1146/annurev.genet.30.1.297. [DOI] [PubMed] [Google Scholar]
- 2.Chu S, DeRisi J, Eisen M, Mulholland J, Botstein D, Brown P O, Herskowitz I. Science. 1998;282:699–705. doi: 10.1126/science.282.5389.699. [DOI] [PubMed] [Google Scholar]
- 3.Iyer V R, Eisen M B, Ross D T, Schuler G, Moore T, Lee J C F, Trent J M, Staudt L M, Hudson J, Jr, Boguski M S, et al. Science. 1999;283:83–87. doi: 10.1126/science.283.5398.83. [DOI] [PubMed] [Google Scholar]
- 4.Kunst F, Ogasawara N, Moszer I, Albertini A M, Alloni G, Azevedo V, Bertero M G, Bessieres P, Bolotin A, Borchert S, et al. Nature (London) 1997;390:249–256. doi: 10.1038/36786. [DOI] [PubMed] [Google Scholar]
- 5.Burbulys D, Trach K A, Hoch J A. Cell. 1991;64:545–552. doi: 10.1016/0092-8674(91)90238-t. [DOI] [PubMed] [Google Scholar]
- 6.Levin P A, Losick R. Genes Dev. 1996;10:478–488. doi: 10.1101/gad.10.4.478. [DOI] [PubMed] [Google Scholar]
- 7.Strauch M, Webb V, Spiegelman G, Hoch J A. Proc Natl Acad Sci USA. 1990;87:1801–1805. doi: 10.1073/pnas.87.5.1801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Weir J, Predich M, Dubnau E, Nair G, Smith I. J Bacteriol. 1991;173:521–529. doi: 10.1128/jb.173.2.521-529.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ireton K, Rudner D Z, Siranosian K J, Grossman A D. Genes Dev. 1993;7:283–294. doi: 10.1101/gad.7.2.283. [DOI] [PubMed] [Google Scholar]
- 10.Harwood C R, Cutting S M. Molecular Biological Methods for Bacillus. New York: Wiley; 1990. [Google Scholar]
- 11.Wach A. Yeast. 1996;12:259–265. doi: 10.1002/(SICI)1097-0061(19960315)12:3%3C259::AID-YEA901%3E3.0.CO;2-C. [DOI] [PubMed] [Google Scholar]
- 12.Durbin R, Eddy S, Krogh A, Mitchison G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge, U.K.: Cambridge Univ. Press; 1998. [Google Scholar]
- 13.Moszer I, Glaser P, Danchin A. Microbiology. 1995;141:261–268. doi: 10.1099/13500872-141-2-261. [DOI] [PubMed] [Google Scholar]
- 14.Eisen M B, Spellman P T, Brown P O, Botstein D. Proc Natl Acad Sci USA. 1998;95:14863–14868. doi: 10.1073/pnas.95.25.14863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Asayama M, Yamamoto A, Kobayashi Y. J Mol Biol. 1995;250:11–23. doi: 10.1006/jmbi.1995.0354. [DOI] [PubMed] [Google Scholar]
- 16.Predich M, Nair G, Smith I. J Bacteriol. 1992;174:2771–2778. doi: 10.1128/jb.174.9.2771-2778.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Zuber P, Losick R. J Bacteriol. 1987;169:2223–2230. doi: 10.1128/jb.169.5.2223-2230.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nasser W, Awade A C, Reverchon S, Robert-Baudouy J. FEBS Lett. 1993;335:319–326. doi: 10.1016/0014-5793(93)80410-v. [DOI] [PubMed] [Google Scholar]
- 19.Ireton K, Jin S, Grossman A D, Sonenshein A L. Proc Natl Acad Sci USA. 1995;92:2845–2849. doi: 10.1073/pnas.92.7.2845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mueller J P, Taber H W. J Bacteriol. 1989;171:4967–4978. doi: 10.1128/jb.171.9.4967-4978.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Svensson B, Lubben M, Hederstedt L. Mol Microbiol. 1993;10:193–201. doi: 10.1111/j.1365-2958.1993.tb00915.x. [DOI] [PubMed] [Google Scholar]
- 22.Liu X, Taber H W. J Bacteriol. 1998;180:6154–6163. doi: 10.1128/jb.180.23.6154-6163.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Perego M. Trends Microbiol. 1998;6:366–370. doi: 10.1016/s0966-842x(98)01350-x. [DOI] [PubMed] [Google Scholar]
- 24.Perego M, Hoch J A. J Bacteriol. 1988;170:2560–2567. doi: 10.1128/jb.170.6.2560-2567.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Strauch M A. J Bacteriol. 1995;177:6727–6731. doi: 10.1128/jb.177.23.6727-6731.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rashid M H, Sekiguchi J. J Bacteriol. 1996;178:6640–6643. doi: 10.1128/jb.178.22.6640-6643.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Bai U, Mandic-Mulec I, Smith I. Genes Dev. 1993;7:139–148. doi: 10.1101/gad.7.1.139. [DOI] [PubMed] [Google Scholar]
- 28.Bryan E M, Beall B W, Moran C P., Jr J Bacteriol. 1996;178:4778–4786. doi: 10.1128/jb.178.16.4778-4786.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Marquardt J L, Siegele D A, Kolter R, Walsh C T. J Bacteriol. 1992;174:5748–5752. doi: 10.1128/jb.174.17.5748-5752.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fischer C, Geourjon C, Bourson C, Deutscher J. Gene. 1996;168:55–60. doi: 10.1016/0378-1119(95)00758-x. [DOI] [PubMed] [Google Scholar]
- 31.Inamine G S, Dubnau D. J Bacteriol. 1995;177:3045–3051. doi: 10.1128/jb.177.11.3045-3051.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lopez J M, Marks C L, Freese E. Biochim Biophys Acta. 1979;587:238–252. doi: 10.1016/0304-4165(79)90357-x. [DOI] [PubMed] [Google Scholar]
- 33.Besemer J, Borodovsky M. Nucleic Acids Res. 1999;27:3911–3920. doi: 10.1093/nar/27.19.3911. [DOI] [PMC free article] [PubMed] [Google Scholar]