ABSTRACT
The current classification of the phylum Firmicutes (new name, Bacillota) features eight distinct classes, six of which include known spore-forming bacteria. In Bacillus subtilis, sporulation involves up to 500 genes, many of which do not have orthologs in other bacilli and/or clostridia. Previous studies identified about 60 sporulation genes of B. subtilis that were shared by all spore-forming members of the Firmicutes. These genes are referred to as the sporulation core or signature, although many of these are also found in genomes of nonsporeformers. Using an expanded set of 180 firmicute genomes from 160 genera, including 76 spore-forming species, we investigated the conservation of the sporulation genes, in particular seeking to identify lineages that lack some of the genes from the conserved sporulation core. The results of this analysis confirmed that many small acid-soluble spore proteins (SASPs), spore coat proteins, and germination proteins, which were previously characterized in bacilli, are missing in spore-forming members of Clostridia and other classes of Firmicutes. A particularly dramatic loss of sporulation genes was observed in the spore-forming members of the families Planococcaceae and Erysipelotrichaceae. Fifteen species from diverse lineages were found to carry skin (sigK-interrupting) elements of different sizes that all encoded SpoIVCA-like recombinases but did not share any other genes. Phylogenetic trees built from concatenated alignments of sporulation proteins and ribosomal proteins showed similar topology, indicating an early origin and subsequent vertical inheritance of the sporulation genes.
IMPORTANCE Many members of the phylum Firmicutes (Bacillota) are capable of producing endospores, which enhance the survival of important Gram-positive pathogens that cause such diseases as anthrax, botulism, colitis, gas gangrene, and tetanus. We show that the core set of sporulation genes, defined previously through genome comparisons of several bacilli and clostridia, is conserved in a wide variety of sporeformers from several distinct lineages of Firmicutes. We also detected widespread loss of sporulation genes in many organisms, particularly within the families Planococcaceae and Erysipelotrichaceae. Members of these families, such as Lysinibacillus sphaericus and Clostridium innocuum, could be excellent model organisms for studying sporulation mechanisms, such as engulfment, formation of the spore coat, and spore germination.
KEYWORDS: Bacillus, Clostridia, gene loss, genome analysis, phylogenetic analysis, spore coat, spores, taxonomy
INTRODUCTION
A variety of bacteria are capable of producing resting forms, commonly referred to as spores. However, the ability to form heat-, solvent- and UV-resistant endospores has been observed only in members of the phylum Firmicutes (low-G+C Gram-positive bacteria, recently renamed Bacillota) (1–8). Occasional reports of endosporeformers from other phyla have not been validated so far (9, 10). Sporulation enables bacteria to survive adverse environmental conditions. Thus, when this ability is found in human pathogens that cause severe diseases, such as anthrax, botulism, colitis, infectious diarrhea, gas gangrene, sepsis, tetanus, and food poisoning, it makes them particularly dangerous and difficult to eradicate (3, 4). Even sporeformers that used to be considered benign could turn out to be opportunistic pathogens, as exemplified by the recently described involvement of Paenibacillus thiaminolyticus in postinfectious hydrocephalus (11).
Formation of endospores is a complex process that starts with the asymmetric division of a vegetative cell, producing a mother cell and a prespore, and proceeds through several stages of spore maturation. In the best-studied model organism, Bacillus subtilis, the sporulation process affects expression of more than 500 genes, some of which are essential for sporulation, whereas others appear to be involved in various regulatory circuits (12–20). While some genes are exclusively involved in sporulation and are expressed only at certain stages of the process, spores also contain many housekeeping proteins that function during spore germination and subsequent vegetative growth (21–23). Studies on sporulation mechanisms in clostridia identified a somewhat smaller set of sporulation genes than in bacilli, leading to the conclusion that clostridia encode a streamlined version of the sporulation machinery (22, 24–27).
The ability to form spores is widespread in the two major classes of Firmicutes, Bacilli and Clostridia, and also has been observed in the other, more recently described, firmicute lineages (3, 28–30). However, some well-studied taxa within the Bacilli, such as lactobacilli, listeria, staphylococci, and streptococci, do not include any spore-forming members (3). Similarly, no sporeformers have been identified in the clostridial family Halanaerobiaceae (order Halanaerobiales) and in several families of the order Clostridiales (recently renamed Eubacteriales) (3, 31). Many other firmicute lineages include both spore-forming and non-spore-forming members (3).
These observations, coupled with the attempts to identify potential drug targets for extermination of spores, call for identification of the core set of essential sporulation genes that are necessary and, possibly, sufficient for a bacterium to be a sporeformer. The most obvious candidate, the sporulation master regulator Spo0A, indeed, appeared to be essential, as no Spo0A– organism had been shown to form spores (3, 32). However, Spo0A is encoded in the genomes of many nonsporeformers and, therefore, could not be used as a reliable predictor of the sporulation ability (3, 32). Three more genes, sspA, dpaA (spoVFA), and dpaB (spoVFB), initially proposed as sporulation signatures (33), presented the same problem, being present in a variety of nonsporeformers (32).
The availability of complete genome sequences of many diverse firmicutes offers an opportunity to identify sporulation genes through comparative genomics. Following the approach pioneered by Stragier (34), several studies took advantage of the constantly expanding genome list to define the conserved core of sporulation genes that are present in all (or at least most) known sporeformers (2, 32, 35–37). The resulting list included about 60 genes, many of which had been previously shown to be essential for sporulation because mutations in these genes caused sporulation arrest and/or production of immature spores. However, deletion of some other genes from the core set appeared to have only a minor effect on the sporulation efficiency, indicative of a substantial redundancy of the sporulation machinery. Indeed, in B. subtilis, many sporulation genes are found in two or more paralogous forms (37, 38).
Here, we used the latest update of the Clusters of Orthologous Genes (COG) database (39) to analyze the sporulation gene sets in the current genome collection of diverse firmicutes, including members of the recently defined classes Negativicutes, Tissierellia, Erysipelotrichia, and Limnochordia. The COG database covers a limited number of selected, completely sequenced microbial genomes (typically, a single representative per bacterial genus) and features COG-specific patterns of presence-absence of genes in the respective organisms (40, 41). Thus, COG profiles offer an easy way to identify genomes in which a given gene, e.g., one involved in sporulation, is missing (39, 41, 42). Analysis of COG profiles of 180 genomes representing 76 spore-forming and 102 asporogenous species allowed us to detect numerous events of lineage-specific gene loss in various components of the sporulation machinery. Of particular interest was the widespread loss of sporulation genes in the families Planococcaceae and Erysipelotrichaceae, which resulted in a greatly streamlined machinery for engulfment, formation of the spore coat, and spore germination. These observations suggest that certain members of these families could be excellent model organisms for studying the fundamental mechanisms of sporulation in greater molecular detail.
RESULTS
Sporeformer genome collection.
The analyzed set consisted of 180 genomes from 160 genera, representing every named genus of Firmicutes that included at least one completely sequenced genome by 1 April 2019 and several genomes released after that date (39). These organisms belong to six currently recognized classes of Firmicutes, including Bacilli, Clostridia, Erysipelotrichia, Limnochordia, Negativicutes, and Tissierellia, and represent 76 species whose original descriptions mentioned their ability to sporulate, 102 nonsporeformers (based on the descriptions of the respective strains), and two species with an unclear sporulation status (see Table S1 in the supplemental material). At the time of this study, there were no completely sequenced genomes from any representatives of the classes Culicoidibacteria and Thermolithobacteria; members of these classes have been described as non-spore forming (43, 44).
To ensure reliability of the gene presence-absence patterns, this work relied on high-quality genomes included in the recent release of the COG database (39), most of which had been vetted by the NCBI’s RefSeq team and selected as RefSeq reference genomes (45). For two organisms, Caproiciproducens sp. strain NJN-50 and Thermincola potens JR, the sporulation status had not been described, and they had both spore-forming and asporogenous relatives. To ensure proper coverage of the class Erysipelotrichia, its two members featured in the COG database were supplemented with five additional genomes, including two from known sporeformers, [Clostridium] innocuum (the brackets indicate that this organism has been misnamed, which remains to be rectified [28]) and Erysipelatoclostridium (formerly Clostridium) ramosum (28, 46, 47).
Sporulation and genome size.
As reported previously, sporeformers generally have larger genomes than nonsporeformers (32). In our set, the mean genome size of the former was ~3.9 Mb, compared to ~2.7 Mb for the latter (Fig. S1 in the supplemental material). Although the genome size distributions overlapped, the previously reported 2.3-Mb boundary for cultivated sporeformers (32) held for the expanded genome set analyzed here. Only two sporeformers in the set, both still uncultured, “Candidatus Arthromitus” SFB-mouse and “Candidatus Desulforudis audaxviator” MP104C, had genome sizes of less than 2.4 Mb (see Table S1; the recently cultivated spore-forming strain of “Ca. Desulforudis audaxviator” with a 2.2-Mb genome [48] was not part of the COG set). In agreement with the previous reports (3, 49, 50), the ability to sporulate often differed even between closely related organisms: the families Bacillaceae and Planococcaceae in the class Bacilli, most families in the class Clostridia, and the families Tissierellaceae and Erysipelotrichaceae all included both spore-forming and asporogenous members (Table S1). A comparison of the genome sizes of sporeformers and nonsporeformers from different classes is presented in Fig. S1B.
Selection of sporulation genes.
For the purposes of this work, sporulation genes were defined as genes that participate in spore formation but have no known housekeeping roles in vegetative cells. This approach excluded most metabolic enzymes, as well as proteins involved in DNA replication and repair, transcription, translation, motility, secretion, and other processes. However, we included the cell division genes that are (also) involved in asymmetric cell division, as well as Spo0A-regulated genes that function at the onset of sporulation, prespore and forespore genes expressed under the control of SigF and/or SigG, mother cell genes that are expressed under the control of SigE and/or SigK, genes involved in the formation of the spore cortex and spore coat, and genes involved in spore germination. The list of the genes belonging to these groups was compiled based on the previous studies (2, 15, 16, 28, 32, 36, 37) and the data from the SubtiWiki database (http://subtiwiki.uni-goettingen.de/) (38). In the COG database, products of these genes formed 237 clusters of orthologous groups (COGs). We extracted COG phyletic profiles (the patterns of gene presence-absence in the respective genomes) for these sporulation genes and sorted them by their functions and representation among the 76 spore-forming organisms. The list of the most common sporulation genes identified in this manner is presented in Table 1. The patterns of distribution of these genes among all 180 analyzed species are listed in Table S2 in the supplemental material and are shown graphically in Fig. S2.
TABLE 1.
Conservation of sporulation genes in spore-forming Firmicutes
Sporulation stage or regulon | Sporulation gene(s)a |
|
---|---|---|
Universally conserved in Firmicutes | Conserved mostly in Bacilli | |
Sporulation onset and checkpoints; asymmetric cell division | spo0A, spo0H (sigH), spoIIE, spoIIIE, spoIIIJ, spoVC (pth), spoVG,b spoVS,b divIB,c divIC,c divIVA,b ftsA,b ftsE,c ftsH, ftsX,c ftsY, ftsZ, jag, minC,b minD, minJ, obgE, sweCc | spo0B, spo0F, spo0E, spoVN (ald), ftsL, ricA (ymcA), ricF (ylbF), ricT (yaaT), sda |
Spo0A regulon | sigE, sigF, sigG, spoIIAA, spoIIAB, spoIIGA, spo0JA (parA),c spo0JB (parB) | yisK, yusE |
Engulfment | spoIID,c spoIIM,b spoIIP,c spoIIQ, spoIIIAA,b spoIIIAB,b spoIIIAC,b spoIIIAD,b spoIIIAE,b spoIIIAF,b spoIIIAG,c spoIIIAH | spoIIB, spoIIIL, fisB (yunB) |
Forespore-expressed genes | ||
SigF regulon | spoIIR,c spoIVB, spoVT,c dacB/dacF, gerW (ytfJ), ydfS/yetF,c yhcV/ylbB,b ylbC,b yloC,b yuiC,c yyaCb | bofC, rsfA, fin (yabK), yjbA, ymfJ, yqhG, ywzB |
SigG regulon | spoVAC, spoVAD, spoVAEB, nfo (yqfS),b sspA/sspB/sspC/sspD | spoVAA, spoVAB, spoVAF, sspE, sspF, sspH, sspI, tlp, yqfX |
Mother cell-expressed genes | ||
SigE regulon | sigK, spoIIID, spoIVFB,c spoVB (spoIIIF)/ykvU, spoVE, spoVK,b ctpB, dacB/dacF, alr (yncD), spmA,b spmB,b yisY/yfhM, yitE/yqfU,c ylmC/ymxH,c ytaF,c ytvI, yyaD/ykvI | spoIVFA, spoVM, bofA, ydcA, ydcC, yhbH |
SigK regulon | dpaA (spoVFA),b dpaB (spoVFB),b cgeD,b ykuD/yciB,b ytdA,b ytlDb | |
Spore cortex | spoVD, spoVV (ylbJ), cwlC/cwlD, lytH,b stoA (spoIVH), yabP, yabQ,c yqfC,b yqfD | cotD |
Spore coat | spoIVA, cotJC/yjqC,b cotSA, gerM,b lipC (ycsK),b safA,b sleL/ydhD,b yhaX | spoVID, spoVIF, cotE, cotJA, cotJB, cotM/cotP, yhjR |
Germination | gerA,b gerB,b gerC,b gerF(lgt), gpr, cwlJ/sleB,b cspA,c gdh | gerD, gerE, gerQ, ypeB |
Distribution of the COGs that contain the indicated Bacillus subtilis sporulation genes. No distinction is made for members of the same COG. Paralogs are separated with a slash; parentheses indicate alternative names of the same gene (see Table S2 for details). Genes whose products were used to build the consensus tree (Fig. 3) are shown in bold typeface.
Gene conserved in diverse sporeformers but missing in three or more sporeformer genomes.
Gene missing in one or two genomes of sporeformers (not counting the frameshifts); see Table S3 for a detailed list.
Distribution of spo0A, dpaAB, and sspA genes.
In 2004, Onyenwoke and colleagues identified four conserved sporulation genes, namely, spo0A, dpaA, dpaB, and sspA, and proposed using these genes as markers of the ability of a bacterium to form endospores (33). We have previously reported the presence of these genes in several clostridial nonsporeformers and argued that the presence of these genes was insufficient—and, in the case of dpaAB, unnecessary—for sporulation (32). It was instructive to explore the distribution and evolution of these four genes in the current, much more diverse collection of firmicute genomes.
(i) spo0A.
Analysis of the current genome set confirmed that the master cell regulator Spo0A is an essential component of the sporulation machinery. Indeed, the genome of every experimentally characterized spore-forming member of the Firmicutes encodes Spo0A, and its absence is an excellent predictor of the organism’s inability to sporulate. Among the 180 species analyzed in this work, the spo0A gene was present in 118, including all 76 sporeformers and 40 nonsporeformers (Table 1; Table S1). These observations confirm that the presence of spo0A is insufficient to conclude that a bacterium is a sporeformer, although its absence unequivocally indicates that it is not. In agreement with the previous reports (32, 33, 51), Spo0A-encoding nonsporeformers were found in the classes Bacilli, Clostridia, and Erysipelotrichia. Among Bacilli, our set included six Spo0A-encoding nonsporeformers, all in the order Bacillales: Lentibacillus amyloliquefaciens in the family Bacillaceae, Kurthia zopfii and Planococcus antarcticus in Planococcaceae; Macrococcus caseolyticus in Staphylococcaceae, Novibacillus thermophilus in Thermoactinomycetaceae, and Exiguobacterium sp. from Bacillales family XII, incertae sedis. No Spo0A-encoding genomes and accordingly no sporeformers were represented among the 23 members of the order Lactobacillales. The current version of GenBank includes several spo0A genes from several clinical isolates of Streptococcus pneumoniae, but it remains unclear whether these isolates are correctly classified.
Spo0A is encoded by 32 of the 42 non-spore-forming members of Clostridia in our set and the two members with an unclear sporulation status. Among the 10 members of Negativicutes and nine members of Tissierellia, all four Spo0A-encoding organisms were sporeformers. Among the seven representatives of Erysipelotrichia, Spo0A was encoded in two sporeformers, C. innocuum and E. ramosum, and two nonsporeformers, Amedibacterium intestinale and Turicibacter sp. (Table S1). Most Spo0A+ genomes, 107 of 118, carried a single spo0A gene. The only exceptions were observed in the order Clostridiales, where 10 genomes encompassed two paralogous spo0A genes each, and one, the asporogen Anaerostipes hadrus, had three spo0A paralogs.
The phylogenetic tree of Spo0A proteins (Fig. S3 in the supplemental material) showed only minor deviations from the 16S rRNA-based phylogeny of the Firmicutes: Spo0As from the members of the same bacterial family typically formed distinct clades. Among the Bacilli, monophyly was observed for the families Alicyclobacillaceae and Planococcaceae, whereas Spo0As from members of Bacillaceae and Paenibacillaceae each formed two separate clades (Fig. S3). In Clostridia, Spo0As from most members of the families Clostridiaceae, Lachnospiraceae (formerly known as Ruminococcaceae or Hungateiclostridiaceae), and Peptostreptococcaceae (Clostridioides difficile and Paeniclostridium sordellii), as well as members of the order Halanaerobiales, appeared to be monophyletic. At the class level, Spo0As from spore-forming members of the Erysipelotrichia formed a strongly supported clade that mapped within the Bacilli, whereas members of the Negativicutes (family Sporomusaceae) and Tissierellia mapped within the clostridial clade (Fig. S3). Altogether, these groupings were consistent with vertical inheritance of the spo0A genes across the entire Firmicutes phylum.
Notably, the monophyly of Spo0As from the members of the same family did not depend on whether they came from sporeformers or nonsporeformers. Thus, Spo0As from all members of Planococcaceae formed a single clade, with Spo0As from nonsporeformers Kurthia zopfii and Planococcus antarcticus grouping together with Spo0As from planococcal sporeformers (Fig. S3). Likewise, Spo0A from the sporeformer Lachnoclostridium phytofermentans fit into a clade with non-spore-forming members of the family Lachnospiraceae, whereas Spo0A from the nonsporeformer Dehalobacter sp. strain CF mapped together with spore-forming members of the family Peptococcaceae (Fig. S3).
Irrespective of their ability to form spores, Spo0A-encoding organisms had larger genomes than Spo0A− ones (Fig. S1B) and possessed many more sporulation genes than Spo0A− genomes (Fig. 1). This trend was most pronounced for widespread sporulation genes (Fig. 1A) but also held for narrowly conserved sporulation genes (Fig. 1B) and for those sporulation genes that were conserved mostly in Bacilli (not shown).Thus, Spo0A-encoding bacteria, whether spore forming or not, differ from their Spo0A− relatives in terms of their sporulation gene content. In other words, non-spore-forming Spo0A+ organisms still carry a fair number of sporulation genes. Because stepwise loss of sporulation genes in formerly spore-forming lineages appears far more likely than acquisition of such genes by nonsporulating organisms, these observations serve as additional evidence of the ancient origin of sporulation (see below).
FIG 1.
Distribution of sporulation genes in Spo0A+ and Spo0A− firmicutes. The graphs show the numbers of widespread sporulation genes (out of 112) (A) and more narrowly conserved sporulation genes (out of 74) (B) encoded in sporeformers (which are all Spo0A+; blue columns), Spo0A+ nonsporeformers (orange columns), and Spo0A− nonsporeformers (gray columns). “Other classes” includes Negativicutes, Tissierellia, Erysipelotrichia, and Limnochordia.
(ii) dpaAB.
Dipicolinate, a key spore component, is produced by oxidation of dihydrodipicolinate, which is catalyzed by dipicolinate synthase, whose two subunits are encoded by dpaA (spoVFA) and dpaB (spoVFB), genes that are expressed in the mother cell under σK control. Among the members of the class Bacilli in our genome set, the dpaAB gene pair was found in all sporeformers and was absent in nearly all nonsporeformers. Therefore, within Bacilli, these genes indeed could serve as markers of sporulation. The dpaAB gene pair was also found in Limnochorda pilosa and in both spore-forming members of Erysipelotrichia. In clostridia, however, the picture was more complicated. The dpaA and dpaB genes have been previously shown to be missing in Clostridium perfringens, C. botulinum, and C. tetani, in which dihydrodipicolinate oxidation is catalyzed by the electron transfer flavoprotein, encoded by the etfA-etfB gene pair (32, 52). Nonorthologous displacement of dpaAB by etfAB was also observed in several other clostridial sporeformers, including Clostridium acetobutylicum, Alkaliphilus metalliredigens, Paeniclostridium sordellii, and Tepidanaerobacter acetatoxydans, as well as two spore-forming representatives of Negativicutes, Methylomusa anaerophila and Pelosinus fermentans, and in Tissierella sp. strain JN-28 (also called Sporanaerobacter sp. strain NJN-17), a member of the class Tissierellia. Finally, Gottschalkia acidurici does not encode either DpaAB or EtfAB, suggesting that in this organism, dipicolinate production is catalyzed by yet another, currently unidentified oxidoreductase.
(iii) sspA.
The distribution of the sspA gene, which encodes an α/β-type small acid-soluble sporulation protein (SASP), is also dramatically different between the classes Bacilli and Clostridia. Among bacilli, sspA is found in all sporeformers and is missing in nearly all nonsporeformers (Table S2), making it another good marker of the ability of bacilli to sporulate. Essentially the same picture was observed in the classes Negativicutes, Tissierellia, and Erysipelotrichia. However, in Clostridia, the sspA gene is represented both in sporeformers and in many nonsporeformers, albeit only in Spo0A-encoding ones. The only sporeformer in our set that did not carry the sspA gene was C. innocuum. The ability of this organism to form spores in the absence of SspA (or any other known SASPs; see below) merits further study.
Identification of the core sporulation gene set.
The COGs that included the B. subtilis sporulation genes were sorted by their representation in the 76 sporeformer genomes and, specifically, in the genomes of spore-forming bacilli and clostridia. We classified these COGs into four groups (Table 1): (i) genes (COGs) that were represented in every sporeformer genome, with the possible exception of one or two; (ii) genes that were widespread in sporeformers but were missing in three or more genomes, often those from a specific lineage; (iii) genes that were widespread in bacilli but poorly represented in other classes; and (iv) genes that were present in only a limited number of sporeformers. Representation of these genes (COGs) in all analyzed firmicute genomes was recorded as well (Table S2).
As mentioned above, the analyzed genomes were extracted from the COG collection (39), most of which were selected from the NCBI’s RefSeq database (45). Nevertheless, it has been shown that even highly conserved ribosomal proteins are occasionally missed during genome assembly, left untranslated because of sequencing errors, or simply overlooked in the course of genome annotation (53). Therefore, cases of widespread sporulation genes missing in one or more sporeformer genomes were individually verified by checking for the potential presence of highly diverged or partial open reading frames (ORFs) using a DELTA-BLAST search (54) against the NCBI protein database and/or a TBLASTn search (55) against the nucleotide sequence database, each time selecting the respective species or strain as the target database and a representative protein from a closely related organism as a query (56). This analysis identified frameshifts and nonsense mutations in several widely conserved sporulation genes in several genomes (Table S3 in the supplemental material). However, the occurrence of such suspicious frameshifts was typically limited to one or two per genome, even in Sulfobacillus acidophilus strain TPY (GenBank accession number CP003179.1), which had several unannotated ribosomal proteins (53) and was excluded from RefSeq because of the missing rRNA genes. This genome also lacked an unusually high number of widespread sporulation genes (Table S4 in the supplemental material). Another exception was the genome of Jeotgalibacillus malaysiensis D5 (GenBank number CP009416.1), which also had several frameshifted and missing sporulation genes (Tables S3 and S4), some of which were found in other Jeotgalibacillus spp. (Table S2). The peculiar phylogenetic position of the genus Jeotgalibacillus between Bacillaceae and Planococcaceae (57, 58) warranted keeping this genome in the analyzed set, but its sporulation genes patterns were considered unreliable.
Comparison of Table 1 with the previously identified sets of core sporulation genes (2, 32, 36, 37) showed that the list remained largely stable, despite the substantially expanded coverage of Bacilli and Clostridia and the inclusion of representatives of three more classes of Firmicutes, Negativicutes, Tissierellia, and Limnochordia (Table S5 in the supplemental material). Most genes previously assigned to the conserved core were found in all or nearly all of the 76 sporeformers in the current genome set. Further, several genes, such as spoIVFB, spoVE, cotS, yfhM, yhaX, yisY, and yyaA, turned out to be more widespread than previously thought (Table S5). The only deviation from this pattern was observed in the class Erysipelotrichia, in which the spore-forming members, C. innocuum and E. ramosum, were missing more than a dozen genes from the conserved sporulation gene set (see below). This comparison also showed that, despite the overall conservation of the core sporulation genes in all classes of Firmicutes, many of these genes were found to be completely missing (as opposed to being disrupted by frameshifting) in one, two, or more genomes (Table S6 in the supplemental material). Although some of these gaps in phyletic patterns could be caused by errors in genome sequencing and/or assembly, the consistency of such patterns in certain phylogenetic groups appeared to reflect an actual evolutionary trend. In the following sections, we describe the streamlining of the sporulation gene complement in sporeformers from the families Planococcaceae and Erysipelotrichaceae, followed by discussion of the gene loss within specific sporulation systems.
Sporulation genes in Planococcaceae.
The analyzed genome set included six spore-forming members of the family Planococcaceae (also known as Caryophanaceae), order Bacillales: Paenisporosarcina sp. strain K2R23-3, Rummeliibacillus stabekisii, Solibacillus silvestris, Sporosarcina psychrophila, and Ureibacillus thermosphaericus. In addition, we included Jeotgalibacillus malaysiensis, which is currently assigned to the Planococcaceae, and Lysinibacillus sphaericus, which is usually assigned to Bacillaceae but in phylogenetic analyses robustly falls within the Planococcaceae (58). These seven organisms showed a consistent pattern of sporulation gene loss, with most of them lacking four of the nine components of the SpoIIQ–SpoIIIA complex (spoIIIAA, spoIIIAB, spoIIIAD, and spoIIIAF genes), as well as such conserved sporulation genes as spmA, spmB, spoIIM, gerM, gerW (ytfJ), yqfC, and ytxC, which are found in most sporeformers (Table 1; Tables S2 and S4). Furthermore, these seven organisms also lacked some genes that are generally conserved among the spore-forming members of the Bacillaceae, including bofC, csfB, spoIIIL, spoIVFA, spoVAA, spoVAB, spoVAEA, spoVID, yabG, yhbB, yrrD, ysxE, and ytxC (Table S2), although they encoded SpoVAC, SpoVAD, and SpoVAEB, proteins that collectively are sufficient for the calcium dipicolinate uptake by the developing spore (59). They also lack the genes that encode such SASPs as SspK, SspL, SspN, SspO, SspP, and CsgA, spore coat proteins CotM, CotI, CotO, CotP, and CotW, and spore maturation proteins SpsJ and CgeB, which are found in most spore-forming bacilli (Table S2). The most dramatic deviations from the common phyletic pattern were detected in J. malaysiensis, in accordance with its uncertain phylogenetic position (58). The consistent absence of all these genes suggests that members of the Planococcaceae possess substantially streamlined sporulation machinery. The analyzed genome set also included two asporogenous representatives of Planococcaceae, K. zopfii and P. antarcticus, which have spo0A but lack almost all other sporulation genes (Table S2).
Sporulation genes in Erysipelotrichaceae.
The evolution of the Erysipelotrichia lineage likely involved a massive loss of genes, including many related to sporulation (60). Indeed, the genomes of C. innocuum and E. ramosum, despite their relatively large sizes (4.7 and 3.25 Mb, respectively), show patterns of extensive loss of sporulation genes. Consistent with the previous observations (60, 61), they both lack seven of the nine genes of the SpoIIQ-SpoIIIA complex, retaining only SpoIIQ and SpoIIIAH components. Further, they both lack such widely conserved sporulation genes as spoIIID, bofA, cotQ, safA, yabQ, ypeB, yyaC, yyaD, sleL (yyaH), and gerW (ytfJ) (Tables S2 and S4). Despite the evolutionary proximity of Erysipelotrichia and Bacilli (28, 60), many widely conserved bacillar sporulation genes are missing as well, including bofC, csfB, spoIIIL, spoIVFA, spoVAA, spoVAB, spoVAEA, spoVID, yabG, yhbB, yrrD, ysxE, and ytxC. Some sporulation genes appear to have been lost in only one of these genomes. In particular, C. innocuum lost spoIIR and spoVS, whereas E. ramosum retained these genes but lost spoVG and spoVT (Table S4). Another system with an unusual phyletic pattern is the spore germination receptor GerABC. While non-spore-forming members of Erysipelotrichia encode either all three subunits of GerABC (Turicibacter sp. strain H121) or none at all (three other genomes), the sporeformers C. innocuum and E. ramosum encode the membrane subunit GerA (GerAA) but not the membrane transporter GerB (GerAB) or the lipoprotein GerC (GerAC) subunits. Although the GerABC receptors are not strictly required for germination (they are missing in Clostridioides difficile [62, 63] and could be dispensable in C. botulinum [64]), they are nearly ubiquitous among sporeformers (Table S2). The presence of GerAA but not GerAB or GerAC subunits is so far unique among known sporeformers.
Sporulation onset and asymmetric division.
Along with spo0A, genes involved in sporulation onset and asymmetric division are highly conserved in sporeformers of the class Bacilli, and many, although not all, of these genes are also conserved in spore-forming members of Clostridia and four other classes of Firmicutes (Table 1; Table S2). As discussed above, many clostridia lack the Spo0B-Spo0F-Spo0A phosphorelay system, and their Spo0As are phosphorylated by orphan histidine kinases that are not orthologous to any of the five sporulation histidine kinases (KinA to KinE) of B. subtilis. However, many of these orphan kinases contain at least one PAS sensor domain (25, 51, 65). Nevertheless, the spo0B-spo0F gene pair is present in several clostridia, primarily members of the families Heliobacteriaceae, Peptococcaceae, and Thermoanaerobacteraceae (51), as well as in Negativicutes and Tissierellia (Table S2).
The genes involved in asymmetric cell division, such as divIB, divIC, divIVA, spoIIE, ftsE, ftsH, ftsX, ftsY, ftsZ, minC, minD, and minJ, are widespread in bacteria, including most firmicutes and nearly all sporeformers (Table S2). As expected, genes encoding the four sporulation-specific sigma subunits, sigF, sigG, sigE, and sigK, are found in every sporeformer, but the Spo0A-regulated genes spoIIAA, spoIIAB, spoIIGA, spo0JA (parA), and spo0JB (parB) are also highly conserved in sporeformers of every firmicute lineage (Table 1). However, such Spo0A-dependent genes of B. subtilis as ykuJ, ykuK, and yneF, all three with unknown functions, are not conserved even within Bacillales (Table S2). It should be noted that sporulation of Sporosarcina ureae has been observed to involve symmetric division into two daughter cells (66). We do not know how widespread this phenomenon is, but it could reflect the above-mentioned genome streamlining in the Planococcaceae.
Formation and activation of σK.
While σK is universally encoded in the genomes of all spore-forming firmicutes, its activation is tightly controlled, preventing premature expression of late-stage sporulation genes in the mother cell (67–69). In B. subtilis strain 168, the σK-encoding gene is interrupted by a 48-kb prophage-like skin (sigK-intervening) element, which results in two separate coding regions designated spoIVCB and spoIIIC, neither of which is expressed. These fragments of sigK are joined as a result of SpoIVCA-dependent excision of skin to form the complete, intact sigK gene that encodes pro-σK (70, 71). Pro-σK is then activated in the mother cell through cleavage by the intramembrane metalloprotease SpoIVFB (67, 69). In C. difficile 630, the skin element is only 14.6 kb long, and its excision yields a sigK gene encoding a mature σK, which does not require proteolytic activation (72, 73). SigK-interrupting skin elements have also been identified in C. tetani and C. perfringens (74, 75). We confirmed the presence of split sigK genes in B. subtilis, C. difficile, and C. tetani and identified split sigK genes in 12 additional organisms from Bacilli, Clostridia, Negativicutes, and Tissierellia (Fig. 2). These skin elements range in size from 2.6 to 48 kb and encode from 2 to 67 proteins (see Table S7 in the supplemental material).
FIG 2.
Organization of skin elements in 15 firmicute genomes. spoIVCB genes are shown in light green, spoIIIC in dark green, and spoIVCA in orange. The organism names are followed by the GenBank genome entries and genomic coordinates of the regions between spoIVCB and spoIIIC. The numbers above the gaps indicate the distances between spoIVCB or spoIIIC and spoIVCA (~1.5 kb less than the total length of the skin element). R. cellulolyticum Ccel_0434 is annotated as a single 7,394-nucleotide pseudogene.
The orientation of skin elements with respect to the sigK component genes is variable, but the spoIVCA gene is always located at the edge of the element, usually immediately upstream of the spoIIIC gene, which encodes the C-terminal fragment of σK (Fig. 2). The SpoIVCA proteins from different skin elements (see Fig. S4A in the supplemental material for a sequence alignment) are members of a single family of site-specific DNA recombinases (COG1961) that is represented by multiple copies in most firmicute genomes, including up to 26 genes in (non-spore-forming) Flavonifractor plautii and Oscillibacter valericigenes. In the phylogenetic tree of SpoIVCA-like recombinases/integrases (Fig. S4B), SpoIVCAs from skin elements are interspersed with those from tailed phages (Caudovirales), suggesting a (pro)phage origin of the skin elements. Remarkably, aside from spoIIIC, spoIVCA, and spoIVCB, no genes are shared by all skin elements (Fig. S4C). The presence of skin elements that split sigK genes in two nonadjacent fragments often causes confusion in genome annotation. Thus, in the current genomic entries for B. subtilis strain 168 (GenBank accession numbers AL009126.3, CP051860.2, CP053102.1, and CP052842.1), sigK is erroneously marked as a pseudogene, whereas in genomic entries for C. tetani E88 (GenBank number AE015927.1) and Pelosinus fermentans (GenBank number CP010978.1), the C-terminal spoIIIC-like fragment is left untranslated (Tables S5 and S7). Remarkably, sigK is not unique as the skin insertion site: skin-like elements were also found interrupting the cotJC gene of Laceyella sacchari and spoIIID gene of Ruminiclostridium cellulolyticum (Table S3).
In B. subtilis, activation of pro-σK by SpoIVFB is regulated by BofA and also SpoIVFA, which is itself regulated by proteolysis by the forespore-expressed protease SpoIVB (76). σK is encoded in every sporeformer genome, as is, with a single exception, SpoIVFB, which is a member of a widespread metalloprotease family (COG1994). However, SpoIVFB-interacting BofA and SpoIVFA are far less widespread. BofA is missing in 10 of the 76 sporeformers, including six clostridia, C. innocuum, E. ramosum, and Limnochorda pilosa. The distribution of SpoIVFA is generally limited to the members of the class Bacilli. It is missing in the Planococcaceae, in nearly all clostridia, and, again, in C. innocuum, E. ramosum, and L. pilosa. In B. subtilis, the SpoIVFA-cleaving serine protease SpoIVB also cleaves another signaling serine protease, CtpB (77), and plays an additional, essential role in sporulation that appears to involve cleavage of SpoIIQ (78, 79). This cleavage of SpoIIQ has been proposed to be the function of SpoIVB in C. difficile, which has a pro-less sigK gene and does not need SigK processing (80). Accordingly, SpoIVB is nearly universal among sporeformers, being represented in numerous genomes that do not encode SpoIVFA. The details of SpoIVFB-BofA interaction in the absence of SpoIVFA and the functions of SpoIVB homologs in these organisms remain to be elucidated. This system also includes BofC, which has been proposed to interact with SpoIVB (81). BofC consists of two distinct domains, and its full-length version is found almost exclusively in Bacillaceae and Paenibacillaceae but not in other families of Bacillales, including Planococcaceae (Table S2). The C-terminal domain of BofC, PF08955, is more widespread but is still missing in most spore-forming clostridia. The functions of BofC and its C-terminal domain remain obscure.
Conservation of the engulfment complex.
In B. subtilis, engulfment involves two key protein complexes, SpoIID-SpoIIM-SpoIIP (the DMP complex) and SpoIIIA-SpoIIQ, the first of which is targeted to the growing septum by either SpoIIB or the SpoIVFA-SpoIVFB pair (82–84), with additional involvement of GerM (85). Both protein complexes are found in C. difficile (86) and are conserved in almost all sporeformers (Table 1), whereas the targeting proteins are less widespread: as noted previously (32), both spoIIB and spoIVFA are missing in clostridia (Table S2).
Among the three genes encoding the peptidoglycan-degrading DMP complex, spoIID was missing in a single genome in our 76-sporeformer genome set, and spoIIP was missing in two (Table S6); these three cases could be due to the sequencing problems. In contrast, spoIIM, although widespread as well, was missing in nine genomes (Table S2). This pattern is consistent with the reports that SpoIIM is not essential for engulfment in C. difficile (87, 88), whereas SpoIID and SpoIIP appear to be indispensable.
The transmembrane SpoIIIA–SpoIIQ complex consists of nine proteins, eight of which are encoded in the mother cell by the genes of the spoIIIAA-AB-AC-AD-AE-AF-AG-AH operon, which is under SigE control. The ninth protein, SpoIIQ, is produced in the forespore under SigF control and contains the M23-type metallopeptidase LytM domain (84, 89). The core of the SpoIIIA-SpoIIQ complex consists of SpoIIIAH and SpoIIQ proteins. Both these proteins form dodecameric (or even larger) rings that interact to form a 60-Å (or even a 140-Å) transmembrane channel, which connects the mother cell and the forespore and could potentially enable trafficking of small molecules (or even proteins) between the two compartments (90, 91). The structure of this complex from B. subtilis (Fig. S5A) revealed a key role of the LytM domain of SpoIIQ in its interaction with SpoIIIAH. However, despite the presence of the common LytM domain, SpoIIQ proteins from bacilli and clostridia are substantially distinct: the M23 peptidase is inactivated in the former but appears to be intact in the latter (92). Accordingly, these proteins belong to two different COGs, COG5820 and COG5821, in the COG database (39). Both varieties of SpoIIQ are often encoded in similar spoIID-spoIIQ-spoIIID operons, although the genomic neighborhoods of spoIIQ are highly variable even within the Bacillaceae (Fig. S6). Although we previously reported not being able to find orthologs of clostridial spoIIQ (CD0125 in C. difficile) in the genomes of several members of the family Peptococcaceae (32), in this work, using synteny and DELTA-BLAST (54) searches, we identified these in the genomes of all clostridial sporeformers but very few nonsporeformers (Table S2). Two spore-forming members of Erysipelotrichia, C. innocuum and E. ramosum, encode the bacillar form of SpoIIQ, whereas spore-forming members of Negativicutes, Tissierellia, and Limnochordia encode the clostridial form. Thus, all 76 sporeformers in our set encode both SpoIIIAH and one of the two forms of SpoIIQ, indicating strict conservation of the SpoIIQ-SpoIIIAH pore among spore-forming bacteria. Despite its conservation, the functions of the SpoIIQ-SpoIIIAH complex are not yet fully resolved. There is growing evidence that this complex plays a key role in keeping the two forespore membranes together, functioning as a zipper (93, 94) and/or a noncanonical piliation system (95).
As noted above, two groups of Firmicutes, Erysipelotrichaceae and Planococcaceae, have apparently lost multiple components of the SpoIIQ-SpoIIIA complex. The Erysipelotrichia members C. innocuum and E. ramosum retain only the spoIIIAH and spoIIQ genes. In addition, they possess SpoIVFB and GerM but lack SpoIIB and SpoIVFA (Table S2; Fig. S5B and C) (61). Six members of the family Planococcaceae show stronger conservation of the SpoIIQ-SpoIIIA complex: in addition to SpoIIIAH and SpoIIQ, they retain SpoIIIAC, SpoIIIAE, and SpoIIIAG components (Fig. S3). However, all these organisms lack SpoIIB, SpoIVFA, SpoIVFB, and GerM proteins, which participate in the assembly of the SpoIIQ-SpoIIIA complex in B. subtilis. Members of Planococcaceae, such as Lysinibacillus sphaericus, could be useful model organisms for studying the core mechanisms of engulfment in the Firmicutes.
SASPs.
SASPs are short proteins that constitute up to 20% of the total spore protein content and are the major source of amino acids during spore germination (96–100). In B. subtilis, the majority of the SASPs belong to two types, α/β (COG5852, which unifies the SspA, SspB, SspC, and SspD families, and COG5854 [SspF]) and γ (COG5853), whereas several minor SASPs (96, 99, 101) make up COGs from COG5855 through COG5864. As observed previously, the α/β-type SASPs, which bind double-stranded DNA and protect spores from heat, UV radiation, and other adverse conditions, are nearly universal in spore-forming firmicutes, whereas the distribution of the other SASP families varies (32, 102). This observation is supported by the data from our new set. Indeed, COG5852 is represented in 75 of the 76 sporeformers and in 31 of the 40 Spo0A-encoding nonsporeformers (Table S2). The only sporeformer missing SspA was C. innocuum; the other spore-forming member of Erysipelotrichaceae, E. ramosum, has two sspA-like paralogous genes and a single sspI gene for a minor SASP. Homologs of sspA are also found in other members of Erysipelotrichaceae, such as [Clostridium] spiroforme and [Clostridium] cocleatum (Fig. S7). However, C. innocuum lacks known ssp genes, so the nature of its SASPs, if any, remains enigmatic. Most of the minor SASPs described in B. subtilis are also encoded in at least some members of Bacillales (102). In contrast, members of other classes of Firmicutes encode only α/β-type SASPs of the SspF and Ssp4 families (103, 104); some clostridia also encode Tlp-like SASPs. Thus, the diversity of SASPs in B. subtilis does not appear to be shared by other groups of Firmicutes, which is consistent with the reports of nonessentiality of minor SASPs (97, 104).
Spore cortex.
Genes involved in the biosynthesis of the spore cortex, the peptidoglycan layer that surrounds the forespore membrane, have been previously found to be conserved throughout Bacilli and Clostridia (32). Examination of the current expanded genome set showed that this conclusion still holds for spoVB, spoVD (ftsI), spoVE (ftsW), spoVV (ylbJ), yabP, and yqfD genes (Table 1). One more gene, yabQ, which is essential in B. subtilis (105, 106), was missing only in the genomes of C. innocuum and E. ramosum, whereas the yqfC gene was missing in the genomes of six Planococcaceae members and E. ramosum (Table S6).
Spore coat.
As noted previously (5, 32, 107), bacilli and clostridia have dramatically different spore coats: many of the ~70 proteins that form the coat of B. subtilis (7) are conserved only within Bacillaceae or not even in all members of this family (32, 108). Among the 10 morphogenetic spore coat proteins of B. subtilis (SpoIVA, SpoVM, SpoVID, SafA, CotE, CotH, CotO, CotX, CotY, and CotZ), only SpoIVA is universally conserved in all spore-forming firmicutes (Table S2). In bacilli, SpoIVA interacts with SpoVID (109, 110), whereas in C. difficile and some other clostridia, its interaction partner is SipL (CD3567), which shares a C-terminal LysM domain with SpoVID (111, 112). Of the other SpoIVA- and/or SpoVID-interacting proteins, SpoVM is found in some clostridia but is not essential for sporulation (113), CotE is found in only a few clostridia, and the full-length SafA (as opposed to its LytM domain) is not represented outside bacilli. CotH, CotO, CotX, CotY, and CotZ are missing even in some members of Bacillaceae, and, except for CotH, none of these genes is found in clostridia or members of other classes of Firmicutes (Table S2).
Among the proteins of the B. subtilis inner coat, the most widespread are proteins of the CotJC/YjqC family, which belong to COG3546, “Mn-containing catalase.” Three members of this family (CotC, CotD, and CotE) were found in the spore coat of C. difficile, and the latter two have been shown to retain enzymatic activity (107, 114). Two more spore coat proteins of C. difficile, the peroxiredoxin CotE and the superoxide dismutase SodA, could be involved in protection from oxidative stress (107). Other widely conserved proteins of the B. subtilis basement layer and inner coat are also (not necessarily active) enzymes: the acetyltransferase CgeE, the peptidoglycan hydrolases CwlJ and SleL (YaaH), the phospholipase LipC, the HAD family phosphatase YhaX, the ATP-grasp family enzymes YheC and YheD, the MenH-related esterase YisY, and the glycosyltransferases CotSA and YdhD (Table S2). A similar pattern is apparent in the outer coat, which contains the alanine racemase YncD, the nucleotidyl transferase YtdA, the predicted FAD-dependent dehydrogenase CotQ, and the LysM domain-containing protein YkzQ. Both inner and outer coats of B. subtilis contain members of the small heat shock protein (HSP20) family, CotP and CotM, respectively. These examples demonstrate the widespread recruitment (exaptation) of common housekeeping enzymes as structural components of the spore coat. Their broad representation among bacillar sporeformers, as opposed to the scarcity in nonsporeformers (Table S2), suggests that such recruitment is a common mechanism of spore coat assembly in Bacilli. There are still insufficient data on spore coat proteins in Clostridia and none on those in other classes of Firmicutes, but studies in C. difficile (107, 114) suggest that such recruitment occurs in clostridia as well.
Germination proteins.
As mentioned above, the common germination receptors consisting of GerA, GerB, and GerC components are encoded in all classes of Firmicutes (Table S2). In our set of 76 sporeformers, the only exceptions were two members of the family Peptostreptococcaceae, C. difficile and Paeniclostridium sordellii, which lacked all three components of the receptor, and two members of Erysipelotrichaceae, C. innocuum and E. ramosum, which lacked GerB and GerC. In C. difficile, the bile acid germinant receptor has been identified as the pseudoprotease CspC (5, 115, 116), a member of the vast family of subtilisin-like serine proteases (COG1404). The presence of multiple members of this protein family in most spore-forming firmicutes suggests that the CspC-dependent germination mechanism is not limited to C. difficile and could potentially function in other organisms, as an addition to the better-studied GerABC-mediated signaling.
The germination process involves degradation of the spore protective layers, the cortex, and the inner and outer coats, as well as hydrolysis of DNA-shielding SASPs. In B. subtilis, hydrolysis of the spore cortex is catalyzed by peptidoglycan hydrolases CwlJ and SleB, members of the same COG3773, which is widespread in spore-forming bacilli and clostridia and is also found in spore-forming members of Tissierellia and Limnochordia, albeit not in the Negativicutes or C. innocuum (Table S2). The regulators of the activity of these enzymes, however, are less widespread. YpeB, which regulates SleB activity in bacilli and many clostridia (117–119), is missing in C. difficile and several other clostridia, as well as in all members of Negativicutes and Erysipelotrichia in the current set (Table S2). In C. difficile, the principal spore cortex-hydrolyzing enzyme is SleC (CD630_05510), a multidomain protein that consists of a structurally, but not functionally, characterized DUF3869 domain (Pfam [120] domain PF12985) (121), a SpoIID-like amidase domain, and a C-terminal peptidoglycan-binding (PF01471) domain. SleC homologs are found in many clostridia, as well as C. innocuum and E. ramosum.
Degradation of SASPs is catalyzed by the germination protease, Gpr, an aspartate protease (104, 122) that is universally conserved in spore-forming firmicutes (Table 1) but is absent in non-spore-forming members of Bacilli, Negativicutes, Tissierellia, and Erysipelotrichia. Among the clostridial nonsporeformers, the distribution of Gpr is generally similar to that of Spo0A: Spo0A-encoding sporeformers usually have the gpr gene, whereas genomes that lack spo0A also lack gpr (Table S2).
Among other genes involved in spore germination in B. subtilis, only a few, such as spoVAD, gdh, gerF (lgt), ykvU, and ypeB, are widespread, whereas others, such as csgA, gerD, gerE, gerPA/gerPF, gerPB, gerPC, gerPD, gerPE, and spoVAEA, are found mostly (or exclusively) in spore-forming bacilli (Table S2).
Uncharacterized sporulation proteins.
Despite numerous studies on sporulation in B. subtilis and other model organisms, functions of several widespread sporulation genes have been clarified only recently, whereas functions of several others remain unknown (Table 1). Altogether, the SubtiWiki database (38) lists >250 y genes that appear to be specifically expressed during sporulation (15, 16, 20, 123, 124), but have not been characterized experimentally. Assignment of such genes to the COGs allowed for making at least tentative functional predictions for several y genes (Table S8 in the supplemental material). Examples include YdgB and YteA proteins, both annotated as “uncharacterized” in UniProt, which belong to COG1734, “RNA polymerase-binding transcription factor DksA,” and the uncharacterized proteins YqcK and YqjC that fall into COG0346, “Catechol 2,3-dioxygenase-related enzyme, vicinal oxygen chelate (VOC) family.” Experimental characterization of these genes and other widespread genes of unknown function could provide further insight into the details of sporulation mechanisms in diverse firmicutes.
Evolution of sporulation.
We employed the updated list of core sporulation genes (Table 1) to investigate the evolution of sporulation and compare it with the overall phylogeny of Firmicutes. To this end, we constructed a ribosomal protein-based phylogenetic tree for all 180 analyzed organisms as well as sporulation protein-based and ribosomal protein-based trees for the 76 sporeformers. The maximum-likelihood phylogenetic tree of 180 firmicutes (Fig. S8) was built using IQ-TREE 2 (125) from a concatenated alignment of 54 ribosomal proteins (6,951 total positions, 6,366 unique) essentially as described previously (28, 126). This tree had excellent bootstrap support and largely agreed with the currently accepted 16S rRNA gene-based phylogeny of the Firmicutes. Members of most previously defined Firmicutes families clustered together, typically as clades. The notable deviations from the current taxonomy (56) included members of the class Erysipelotrichia (family Erysipelotrichaceae), mapped within Bacilli as a sister group to Listeriaceae and Lactobacillales, and members of three other classes mapping within Clostridia as follows: (i) all three families of class Tissierellia as a sister group to Peptostreptococcaceae; (ii) members of the class Negativicutes as a sister group to families Peptococcaceae and Syntrophomonadaceae, and (iii) Limnochorda pilosa, the sole member of the class Limnochordia, as a sister group to Sulfobacillus acidophilus and Thermaerobacter marianensis (Eubacteriales family XVII, incertae sedis). This tree also suggested the placement of some organisms with previously obscure taxonomy: Intestinimonas butyriciproducens mapped into Oscillospiraceae, whereas Ndongobacter massiliensis and Ezakiella massiliensis fell within the radiation of the family Peptoniphilaceae (class Tissierellia).
For a bird’s-eye view of the phylogeny of sporulation, we constructed a maximum-likelihood tree based on a concatenated alignment of 41 core sporulation proteins (as listed in Table 1) from all 76 sporeformers in the current set (Fig. 3, left) and compared it with the ribosomal protein-based tree for the same 76 species (Fig. 3, right). This comparison showed a strong, clear trend: with a few exceptions, sporulation protein sets from members of the same family formed well-supported clades. As an example, members of the Planococcaceae in both trees formed a sister group to the Bacillaceae, the only differences being the branching order of Sporolactobacillus terrae, a member of the family Sporolactobacillaceae, and Alkalihalobacillus (formerly Bacillus) halodurans, and two members of the Erysipelotrichaceae, C. innocuum and E. ramosum. The clostridial portions of both trees had closely similar topologies with essentially the same branching order for most groups. Members of the recently recognized classes Negativicutes (family Sporomusaceae), Tissierellia, and Limnochordia mapped within the clostridial lineage but, again, in the same positions on both trees (Fig. 3). The congruence of both trees indicated that the sporulation genes largely evolved by vertical inheritance, without substantial horizontal gene transfer.
FIG 3.
Phylogenic trees for sporulation and ribosomal proteins from Firmicutes. Maximum-likelihood phylogenetic trees were built using IQ-TREE2 (125) with the LG+F+R9 substitution model from concatenated alignments of 41 core sporulation proteins (left panel) and 54 ribosomal proteins (right panel) encoded in the genomes of 76 spore-forming members of the Firmicutes.
The evolutionary history of gains and losses of these sporulation genes (the same 41 except for spoIIQ) was reconstructed using GLOOME, by mapping the patterns of gene presence and absence in each lineage onto the phylogeny of 180 representative Firmicutes, to infer the posterior probabilities of gene presence in the ancestral tree nodes (127). This reconstruction placed all 40 core sporulation genes at the root of the Firmicutes phylogenetic tree and suggested only a few gene gains (jag in three Lactobacillus spp, the dipicolinate transporter-encoding spoVV/yjiH/ylbJ genes in several nonsporeformers, and ycaP in Veillonella and Megasphaera). An interesting example of a likely gain was the spo0A gene in Macrococcus caseolyticus, one of the few instances of this gene in non-spore-forming bacilli and the only such gene in the Staphylococcaceae members in our set (in GenBank, spo0A is also found in the members of the newly recognized genus Mammaliicoccus). The maximum-likelihood approach implemented in GLOOME considered the gain of spo0A by M. caseolyticus more likely than the loss of this gene in five other staphylococcal genomes, although the highly degraded sequence of this gene and its unusual placement in the phylogenetic tree of Spo0A (Fig. S3) suggested the latter alternative as more realistic.
With all 40 core genes mapped to the root of the Firmicutes, GLOOME predicted extensive loss of sporulation genes in non-spore-forming lineages, such as lactobacilli, staphylococci, and listeria. A major loss of sporulation genes was also inferred for the asporogenous members of the order Bacillales, such as Salimicrobium jeotgali, Gemella haemolysans, and Exiguobacterium sp. (see Table S1). A similar picture was observed in the classes Negativicutes and Tissierellia. Many sporulation genes are conserved in the sporeformers Methylomusa and Pelosinus in the former class and Gottschalkia and Tissierella in the latter, whereas asporogenous representatives of these classes retained very few sporulation genes (Table S1). Accordingly, GLOOME interpreted the absence of these genes in the nonsporeformers as lineage-specific gene losses. The two lineages mentioned above, Planococcaceae and Erysipelotrichaceae, showed loss of some sporulation genes in their spore-forming representatives and Spo0A-encoding nonsporeformers and a far more extensive loss in the Spo0A− bacteria and nonsporeformers.
Among Clostridia, massive, family-wide gene loss was seen only in Eubacteriaceae, as most other families included both spore-forming and asporogenic genera. Here, dramatic differences in sporulation gene content were often observed within the same family. Thus, in the family Peptostreptococcaceae, the analyzed genome set included five members, two sporeformers (C. difficile and Paeniclostridium sordellii) and three nonsporeformers (Acetoanaerobium sticklandii, Filifactor alocis, and Peptoclostridium acidaminophilum) (49). Accordingly, the first two encompassed nearly complete sets of core sporulation genes (with the exception of those coding for the germination receptor GerABC and several others; see above), whereas the three nonsporeformers showed the loss of at least 29 of the 40 analyzed genes (see Table S2). Further, many clostridial nonsporeformers still carried spo0A and retained a fair number of sporulation genes (Fig. 1; Table S1). As an example, all six representatives of the order Halanaerobiales in the current set were nonsporeformers, but they all carried spo0A, and a major loss of sporulation genes was observed only in Halanaerobium hydrogenoformans (Table S1). This case, as well as the examples of lineage-specific gene loss in Butyrivibrio proteoclasticus (family Lachnospiraceae) or Fastidiosipila sanguinis and Mageeibacillus indolicus (family Oscillospiraceae) (Tables S1 and S2), shows that dramatic loss of sporulation genes can occur over relatively short evolutionary distances.
DISCUSSION
Sporulation is a complex process that involves dozens of genes that function in a tightly regulated fashion to ensure the survival of the bacterial cell under adverse conditions. Spores are resistant to a variety of damaging factors, including extreme conditions, and can persist for thousands—and potentially millions—of years (128). A recent study suggested that spores of B. subtilis could even survive on a simulated Martian surface (100). A better understanding of the sporulation mechanisms could help in finding new approaches to eradicate spore-forming human and animal pathogens. Furthermore, recent studies of the human microbiome have revealed a variety of sporeformers that remain to be characterized in detail (129), suggesting that the impact of Firmicutes sporeformers on human health could be even greater than currently appreciated.
An essential step toward understanding the sporulation process is delineation of the set of genes that are necessary and, possibly, sufficient for sporulation across the entire diversity of bacterial sporeformers. Given that the functions of many genes involved in sporulation remain unknown, those genes that belong to the core set would become priority targets for structural and functional characterization. A major stumbling block for the identification of the core set of sporulation genes is the uncertainty of the sporulation status of many bacteria with sequenced genomes. While many genomes come from well-characterized type species, many others do not, and it is usually not known whether the sequenced strain exactly corresponds to the one that was described previously, in some cases many years ago. A case in point is Acetohalobium arabaticum, which was initially described as a sporeformer (130) but apparently lost the ability to sporulate during the laboratory passages (31, 131); we listed it as a nonsporeformer (Table S1). Conversely, Turicibacter sanguinis, a Spo0A-encoding member of the class Erysipelotrichia, was originally described as asporogenous (132) but subsequently shown to form typical subterminal spores (see Extended Data Figure 4 in reference 129). The genome of Turicibacter sp. strain H121 included in this work has almost the same set of sporulation genes as C. innocuum and E. ramosum but lacks, among others, divIVA, dpaB, spoIIGA, spoVK, and spoVN (Table S2), making this strain unlikely to sporulate. The spore-forming representatives of Erysipelotrichia analyzed in this work, C. innocuum and E. ramosum, were occasionally described as forming very low numbers of spores (133, 134). Nevertheless, there was no doubt that these organisms are sporeformers, which made defining their set of sporulation genes a worthwhile exercise.
Previous studies by several independent groups (2, 32, 34–37) identified essentially the same set of ~60 sporulation genes of B. subtilis that were shared by spore-forming members of Bacilli and Clostridia. However, in the genomes of some sporeformers, certain genes from this conserved core were found to be missing (32, 36, 37). The goal of this work was to reassess the distribution of the key sporulation genes using the widest possible selection of complete genomes from all branches of the Firmicutes. To this end, we employed the latest version of the COG database (39) and supplemented it with five recently sequenced genomes from various members of the Erysipelotrichia. This resulted in a set of 180 species from 160 genera, representing 45 different families from six classes of the Firmicutes. At this scale, most genera were represented only by a single genome of a single species, which provided a different perspective from most previous studies (2, 32, 34–37). Therefore, this work complements, rather than supersedes, those studies.
As discussed elsewhere, the COG database displays the patterns of presence-absence of orthologous genes (COG members) across all covered genomes and therefore allows straightforward identification of those genes that are missing in a given genome (39, 42). However, there is an important caveat: a single COG often includes several paralogous genes, which can give the impression that each of these paralogs has a wider phylogenetic distribution than it actually does. For example, cwlJ and sleB, members of COG3773, “Cell wall hydrolase CwlJ,” are both shown as universal (Table 1), although some genomes, such as that of C. perfringens or C. difficile, may encode only a single member of that family. Likewise, dacB and dacF are paralogous genes that encode closely related d-alanyl-d-alanine carboxypeptidases (penicillin-binding proteins 5* and 1, respectively, members of COG1686) that are involved in peptidoglycan metabolism. However, the former is expressed in the mother cell under SigE control, whereas the latter is expressed in the prespore under the control of SigF and/or SigG. Members of this COG, which also contains a sporulation-independent paralog, dacA, are found in most firmicutes, both spore forming and not.
Further, certain gaps in phyletic patterns can arise from errors in genome sequencing and annotation. In some cases, genes that were frameshifted in the examined genomes (Table S3) were found in full-size versions in the genomes of other strains or closely related species. Although we made an effort to address the potential annotation problems, fixing possible sequencing errors was not part of this project. As a result, we recorded the instances of rarely missing genes in Table S6 but were mostly interested in cases where a certain gene was missing in more than one or two genomes, particularly when these genomes came from the same bacterial lineage.
In our previous work (32), we noted the absence of many key sporulation genes in the relatively large (4.8 Mb) genome of Lysinibacillus sphaericus C3-41 (135) but blamed it on the poor quality of the genome sequence, calling it “the worst offender” among all bacilli and excluding it from most analyses. Here, many of those genes were also found to be missing in the genome of the type strain of this species, L. sphaericus DSM 28. Further, such genes as spoIIB, spoIIM, spoIIIAA, spoIIIAB, spoIIIAD, spoIIIAF, spoVAA, spoVAB, spoVID, bofC, gerM, spmA, spmB, and yqfC were missing in the genomes of all members of the family Planococcaceae (Table S4), to which L. sphaericus was recently reassigned (58). These observations showed that, rather than being “the worst offender” in terms of genome content, L. sphaericus could be a valuable model organism to study the basics of the engulfment process.
The expansion of the initial genome set to include two spore-forming members of Erysipelotrichia, C. innocuum and E. ramosum, provided a window into an even greater loss of core sporulation genes (Table S4). In these organisms, sporulation appears to be less frequent than in members of other lineages, making them somewhat similar to oligo-sporulating mutants of B. subtilis. Nevertheless, the ability of C. innocuum and E. ramosum to form spores in the absence of numerous core genes (Table S4) makes them useful model organisms for studying minimal requirements for sporulation among the Firmicutes.
The conservation of core sporulation genes in Bacilli and Clostridia suggested that the ability to form spores was a common ancient feature of the Firmicutes (2, 3). Although the scenarios of the origin of Gram-negative (diderm) bacteria from spore-forming Gram-positive (monoderm) ones (136, 137) do not seem to be supported by phylogenetic trees of universal genes (126, 138, 139), understanding the fundamental mechanisms of sporulation and clarifying the origin of sporeformers remains a major goal in microbiology. Here, we used an expanded collection of the firmicute genomes to trace the phylogeny of the core sporulation genes within this phylum. Despite a certain variability in the branching order, proteins coming from the members of the same phylum typically formed well-supported clades. This was particularly clearly demonstrated by a comparison of the trees built from concatenated alignments of sporulation proteins and ribosomal proteins (Fig. 3). We and others have previously demonstrated that such a ribosomal protein-based tree is closely similar to the 16S rRNA gene-based tree and faithfully represents the organismal phylogeny of the Firmicutes (28, 49, 140). The remarkable congruence of these trees strongly suggested vertical inheritance of the sporulation genes from a common ancestor of all Firmicutes. Further, in the Spo0A tree (Fig. S3), proteins from nonsporeformers often grouped with Spo0As from sporeformers of the same bacillar or clostridial family, suggesting that Spo0A-encoding nonsporeformers lost the ability to form spores, rather than acquired their spo0A genes. Accordingly, non-spore-forming Spo0A+ bacteria typically retain many more sporulation genes than Spo0A− nonsporeformers (Fig. 1). These observations lend further support to the hypothesis that the ability to form (endo)spores was an ancestral feature of the Firmicutes that emerged in this phylum at the early stages of bacterial evolution (3, 32). This ability remains a unique feature of some Firmicutes, distinguishing them from all other bacterial lineages. The evolution of Firmicutes from their spore-forming last common ancestor involved extensive, but in most cases incomplete, loss of sporulation genes in lineages that lost the sporulation capacity. Subsets of the sporulation genes were also lost in several lineages of sporeformers. Experimental study of sporulation in these organisms with reduced complements of sporulation genes can be expected to yield further insight into the minimal biochemical requirements for sporulation.
Despite the substantial progress made in the past several years, a large fraction of sporulation genes remain uncharacterized. For some genes, this is reflected in their y names, but many genes with specific names have been characterized only with respect to the timing of their expression, as being controlled by sporulation-specific sigma factors, or as encoding proteins of the spore coat, whereas their biochemical activities remain obscure. For example, spore maturation proteins SpmA and SpmB, which are conserved in all sporeformers except for the members of Planococcaceae (Table S6), participate in spore core dehydration and are required for heat resistance of spores (141, 142). However, the biochemical mechanisms of spore dehydration by SpmA and SpmB remain unknown. We hope that phylogenetic profiles of the key sporulation proteins derived in this work (Table S2) will help in identifying priority targets for experimental research and allow the selection of suitable model organisms for such studies.
MATERIALS AND METHODS
Genome coverage.
The list of Firmicute genomes used in this work was taken from the recent release of the COG database (39). The COG genome set includes complete genomes of 175 members of the Firmicutes: 73 members of the class Bacilli, representing 59 different genera; 79 members of the class Clostridia, representing 76 genera; 10 members of the class Negativicutes; 9 members of the class Tissierellia; 2 members of the class Erysipelotrichia; 1 member of the class Limnochordia; and an additional genome of “Ndongobacter massiliensis,” which remained unclassified at the time of this work (143). Most genera were represented by a single genome, except for five members each of Bacillus spp. and Streptococcus spp., four members each of Clostridium spp. and Lactobacillus spp., three Streptococcus spp., and two Listeria spp. The list of these organisms is available in Table S1 in the supplemental material and also on the NCBI website https://ftp.ncbi.nih.gov/pub/COG/COG2020/data/cog-20.org.csv. Examination of the available publications allowed extraction of sporulation characteristics for 173 of these 175 organisms. For 74 of these, the ability to form spores was documented in the descriptions of the respective species (Table S1). Ninety-nine organisms were non-spore forming, based on the explicit descriptions of the respective strains and/or their lineages (such as the order Lactobacillales and families Acidaminococcaceae, Halanaerobiaceae, Listeriaceae, Peptoniphilaceae, Selenomonadaceae, Staphylococcaceae, and Veillonellaceae). For two organisms, the sporulation status remained unknown, as their (in)ability to form spores had not been properly documented and the respective lineages included both sporeformers and asporogens (Table S1).
Since both members of the Erysipelotrichia included in the COGs were asporogenic, the genomic list for this class was supplemented with five recently sequenced complete genomes: two from spore-forming members of the genus Erysipelatoclostridium, [Clostridium] innocuum ATCC 14501 (GenBank accession number CP048838.1) and Erysipelatoclostridium ramosum DSM 1402 (GenBank accession number CP036346.1), and three genomes of nonsporeformers, Amedibacterium intestinale (GenBank accession number AP019711.1), Faecalibaculum rodentium (GenBank accession number CP011391.1), and Intestinibaculum porci (GenBank accession number AP019309.1). This brought the total set to 180 genomes, 76 of which came from proven sporeformers (Table S1).
Protein selection.
The list of sporulation proteins analyzed in this work was based on the lists of genes that were previously demonstrated to play a role in the sporulation of B. subtilis and/or were regulated either by Spo0A or by the sporulation-specific sigma subunit SigE, SigF, SigG, or SigK (2, 7, 12, 13, 15–18, 33, 34, 123, 144), as documented in the SubtiWiki database (38).
The selected genes were matched against the COG database (39), and the patterns of presence or absence of the respective COGs in the selected organisms were recorded in Table S2 in the supplemental material (in some cases, a single COG included several paralogous sporulation genes). The resulting set consisted of 237 protein families (COGs), which included 112 COGs for widely conserved genes from the previous COG releases, such as ald (spoVN), ftsH (spoVK), ftsI (spoVE), ftsK (spoIIIE), spmA, spmB, spoVG, spoVS, and so on (Table S2). That list was expanded by the addition of 125 sporulation COGs that were made available in the recent release of the COG database (39). While this list mostly contained genes (proteins) characterized in B. subtilis, it also included clostridial forms of SpoIIQ (COG5833) and small acid-soluble spore protein (SASP) (COG5864), as well as the Amidase_6 domain (COG5877). The widely conserved sporulation genes (proteins) were assigned to one of the following eight functional groups: (i) sporulation onset and checkpoints (22 genes); (ii) Spo0A-regulated genes (9 genes); (iii) engulfment (12 genes); (iv) spore maturation, SigF- or SigG-regulated forespore expression (19 genes); (v) spore maturation, SigE- or SigK-regulated mother cell expression (25 genes); (vi) spore cortex synthesis (9 genes); (vii) spore coat and crust proteins (8 genes); and (viii) germination proteins (8 genes) (see Table S2). Two more groups included sporulation genes conserved in Bacilli (51 genes) and narrowly represented genes (74 genes) (Table S2). This selection was based on the previous studies (2, 15, 16, 28, 32, 36, 37) and the data from the SubtiWiki database (http://subtiwiki.uni-goettingen.de/) (38).
Verification of the COG profiles.
The patterns of presence-absence of the sporulation genes in the 180 analyzed genomes were extracted from the COG master file cog-20.cog.csv (available at https://ftp.ncbi.nih.gov/pub/COG/COG2020/data/ folder) and matched against the assembly numbers listed in Table S1, and the counts of each gene in the 76 sporeformer genomes were calculated, ignoring the paralogs. The cases of apparently essential sporulation proteins missing in certain sporeformer genomes were verified using a recent version of the TBLASTn program (55) that allows selection of specific target organisms (or lineages) based on their entries in the NCBI Taxonomy database (56). Proteins from either the B. subtilis reference set or closely related organisms (same genus or family) were used as queries. The resulting BLAST hits (cutoff E value, 0.1) were verified using CD-search (145) and compared against the protein sets in GenBank and RefSeq databases. Confirmed sporulation genes were used to correct the COG profiles. Newly translated sporulation ORFs will be reported to GenBank and/or RefSeq.
Phylogenetic analysis.
Unless indicated otherwise, protein sequences for phylogenetic analyses were aligned with MUSCLE v5 (146), and the maximum-likelihood trees were inferred using a locally installed version of IQ-TREE 2 (125) with the best-fit model automatically selected by ModelFinder (147) and branch support calculated using a Bayesian-like approximate-likelihood test (148). For each COG that included paralogs, a consensus was produced as described in reference 149; a pairwise similarity score between each sequence and the consensus sequence was calculated using the BLOSUM62 matrix (with the value of −4 assigned to a gap in the sequence, matching a nongap character of the consensus sequence); the paralog with the maximum score was used in the alignment. Individual phylogenetic trees, constructed using IQ-TREE 2 (125), are available from the authors.
For the phylogenetic analysis of Spo0A, the alignment included sequences from all 118 Spo0A-encoding organisms in the analyzed set: 76 sporeformers, 40 nonsporeformers, and 2 organisms with uncertain sporulation status (Table S1). The MUSCLE alignment was edited to remove variable-length linkers between the REC and DNA-binding domains (corresponding to residues 131 to 148 in the Spo0A of B. subtilis), which left 225 informative sites. The maximum-likelihood tree of Spo0A (Fig. S3) was constructed with IQ-TREE 2 (125) using the LG+R6 substitution model.
The SpoIVCA tree (Fig. S4B) was constructed from an alignment of 1,220 SpoIVCA-like site-specific DNA recombinases/integrases from bacteria and phages that were identified in a PSI-BLAST (55) search. The alignment contained 476 informative positions. The approximately maximum-likelihood tree was built with FastTree 2 (150) using the “–gamma” option to rescale the branch lengths and the WAG (151) model of amino acid evolution.
For phylogenetic analysis of sporulation, alignments of 41 core sporulation proteins from 76 sporeformers were constructed (the 40 genes marked in Table 1 with the addition of YmfB [COG0740], a SASP-degrading paralog of ClpP [152]). The few missing proteins (Table S2) were replaced with the respective sequences from closely related organisms (the same genus, where available). For the overall phylogenetic tree of sporulation core, these 41 alignments were concatenated, resulting in an alignment with a total of 10,129 informative sites, and the maximum-likelihood tree (Fig. 3, left) was built using IQ-TREE 2 (125) with the LG+F+R9 substitution model. The ribosomal protein-based tree (Fig. 3, right) was built from a concatenated sequence alignment of 54 ribosomal proteins from 76 sporeformers with IQ-TREE 2 and the LG+R8 substitution model.
ACKNOWLEDGMENT
This study was supported by the Intramural Research Program of the U.S. National Library of Medicine at the National Institutes of Health.
Footnotes
Supplemental material is available online only.
Contributor Information
Michael Y. Galperin, Email: galperin@ncbi.nlm.nih.gov.
Eugene V. Koonin, Email: koonin@ncbi.nlm.nih.gov.
Elizabeth Anne Shank, University of Massachusetts Medical School.
REFERENCES
- 1.Errington J. 1993. Bacillus subtilis sporulation: regulation of gene expression and control of morphogenesis. Microbiol Rev 57:1–33. 10.1128/mr.57.1.1-33.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.de Hoon MJ, Eichenberger P, Vitkup D. 2010. Hierarchical evolution of the bacterial sporulation network. Curr Biol 20:R735–R745. 10.1016/j.cub.2010.06.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Galperin MY. 2013. Genome diversity of spore-forming Firmicutes. Microbiol Spectr 1:TBS-0015-2012. 10.1128/microbiolspectrum.TBS-0015-2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Setlow P. 2014. Spore resistance properties. Microbiol Spectr 2:TBS-0003-2012. 10.1128/microbiolspec.TBS-0003-2012. [DOI] [PubMed] [Google Scholar]
- 5.Paredes-Sabja D, Shen A, Sorg JA. 2014. Clostridium difficile spore biology: sporulation, germination, and spore structural proteins. Trends Microbiol 22:406–416. 10.1016/j.tim.2014.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tan IS, Ramamurthi KS. 2014. Spore formation in Bacillus subtilis. Environ Microbiol Rep 6:212–225. 10.1111/1758-2229.12130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Driks A, Eichenberger P. 2016. The spore coat. Microbiol Spectr 4:TBS-0023-2016. 10.1128/microbiolspec.TBS-0023-2016. [DOI] [PubMed] [Google Scholar]
- 8.Khanna K, Lopez-Garrido J, Pogliano K. 2020. Shaping an endospore: architectural transformations during Bacillus subtilis sporulation. Annu Rev Microbiol 74:361–386. 10.1146/annurev-micro-022520-074650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Traag BA, Driks A, Stragier P, Bitter W, Broussard G, Hatfull G, Chu F, Adams KN, Ramakrishnan L, Losick R. 2010. Do mycobacteria produce endospores?.Proc Natl Acad Sci USA 107:878–881. 10.1073/pnas.0911299107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Beskrovnaya P, Fakih D, Morneau I, Hashimi A, Bello DG, Xing S, Nanci A, Huan T, Tocheva EI. 2020. No endospore formation confirmed in members of the phylum Proteobacteria. Appl Environ Microbiol 87:e02312-20. 10.1128/AEM.02312-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Paulson JN, Williams BL, Hehnly C, Mishra N, Sinnar SA, Zhang L, Ssentongo P, Mbabazi-Kabachelor E, Wijetunge DSS, von Bredow B, Mulondo R, Kiwanuka J, Bajunirwe F, Bazira J, Bebell LM, Burgoine K, Couto-Rodriguez M, Ericson JE, Erickson T, Ferrari M, Gladstone M, Guo C, Haran M, Hornig M, Isaacs AM, Kaaya BN, Kangere SM, Kulkarni AV, Kumbakumba E, Li X, Limbrick DD, Jr, Magombe J, Morton SU, Mugamba J, Ng J, Olupot-Olupot P, Onen J, Peterson MR, Roy F, Sheldon K, Townsend R, Weeks AD, Whalen AJ, Quackenbush J, Ssenyonga P, Galperin MY, Almeida M, Atkins H, Warf BC, Lipkin WI, et al. 2020. Paenibacillus infection with frequent viral coinfection contributes to postinfectious hydrocephalus in Ugandan infants. Sci Transl Med 12:eaba0565. 10.1126/scitranslmed.aba0565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Stragier P, Losick R. 1996. Molecular genetics of sporulation in Bacillus subtilis. Annu Rev Genet 30:297–241. 10.1146/annurev.genet.30.1.297. [DOI] [PubMed] [Google Scholar]
- 13.Piggot PJ, Losick R. 2002. Sporulation genes and intercompartmental regulation, p 483–518. In Sonenshein AL, Hoch JA, Losick R (ed), Bacillus subtilis and its closest relatives: from genes to cells. ASM Press, Washington, DC. [Google Scholar]
- 14.Piggot PJ, Hilbert DW. 2004. Sporulation of Bacillus subtilis. Curr Opin Microbiol 7:579–586. 10.1016/j.mib.2004.10.001. [DOI] [PubMed] [Google Scholar]
- 15.Eichenberger P, Jensen ST, Conlon EM, van Ooij C, Silvaggi J, Gonzalez-Pastor JE, Fujita M, Ben-Yehuda S, Stragier P, Liu JS, Losick R. 2003. The σE regulon and the identification of additional sporulation genes in Bacillus subtilis. J Mol Biol 327:945–972. 10.1016/S0022-2836(03)00205-5. [DOI] [PubMed] [Google Scholar]
- 16.Eichenberger P, Fujita M, Jensen ST, Conlon EM, Rudner DZ, Wang ST, Ferguson C, Haga K, Sato T, Liu JS, Losick R. 2004. The program of gene transcription for a single differentiating cell type during sporulation in Bacillus subtilis. PLoS Biol 2:e328. 10.1371/journal.pbio.0020328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Steil L, Serrano M, Henriques AO, Völker U. 2005. Genome-wide analysis of temporally regulated and compartment-specific gene expression in sporulating cells of Bacillus subtilis. Microbiology (Reading) 151:399–420. 10.1099/mic.0.27493-0. [DOI] [PubMed] [Google Scholar]
- 18.Wang ST, Setlow B, Conlon EM, Lyon JL, Imamura D, Sato T, Setlow P, Losick R, Eichenberger P. 2006. The forespore line of gene expression in Bacillus subtilis. J Mol Biol 358:16–37. 10.1016/j.jmb.2006.01.059. [DOI] [PubMed] [Google Scholar]
- 19.Veening JW, Murray H, Errington J. 2009. A mechanism for cell cycle regulation of sporulation initiation in Bacillus subtilis. Genes Dev 23:1959–1970. 10.1101/gad.528209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Meeske AJ, Rodrigues CD, Brady J, Lim HC, Bernhardt TG, Rudner DZ. 2016. High-throughput genetic screens identify a large and diverse collection of new sporulation genes in Bacillus subtilis. PLoS Biol 14:e1002341. 10.1371/journal.pbio.1002341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Liu H, Bergman NH, Thomason B, Shallom S, Hazen A, Crossno J, Rasko DA, Ravel J, Read TD, Peterson SN, Yates J, III, Hanna PC. 2004. Formation and composition of the Bacillus anthracis endospore. J Bacteriol 186:164–178. 10.1128/JB.186.1.164-178.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lawley TD, Croucher NJ, Yu L, Clare S, Sebaihia M, Goulding D, Pickard DJ, Parkhill J, Choudhary J, Dougan G. 2009. Proteomic and genomic characterization of highly infectious Clostridium difficile 630 spores. J Bacteriol 191:5377–5386. PubMed:19542279. 10.1128/JB.00597-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zheng L, Abhyankar W, Ouwerling N, Dekker HL, van Veen H, van der Wel NN, Roseboom W, de Koning LJ, Brul S, de Koster CG. 2016. Bacillus subtilis spore inner membrane proteome. J Proteome Res 15:585–594. 10.1021/acs.jproteome.5b00976. [DOI] [PubMed] [Google Scholar]
- 24.Alsaker KV, Papoutsakis ET. 2005. Transcriptional program of early sporulation and stationary-phase events in Clostridium acetobutylicum. J Bacteriol 187:7103–7118. 10.1128/JB.187.20.7103-7118.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Paredes CJ, Alsaker KV, Papoutsakis ET. 2005. A comparative genomic view of clostridial sporulation and physiology. Nat Rev Microbiol 3:969–978. 10.1038/nrmicro1288. [DOI] [PubMed] [Google Scholar]
- 26.Jones SW, Paredes CJ, Tracy B, Cheng N, Sillers R, Senger RS, Papoutsakis ET. 2008. The transcriptional program underlying the physiology of clostridial sporulation. Genome Biol 9:R114. 10.1186/gb-2008-9-7-r114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Shen A, Edwards AN, Sarker MR, Paredes-Sabja D. 2019. Sporulation and germination in clostridial pathogens. Microbiol Spectr 7:GPP3-0017-2018. 10.1128/microbiolspec.GPP3-0017-2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yutin N, Galperin MY. 2013. A genomic update on clostridial phylogeny: Gram-negative spore formers and other misplaced clostridia. Environ Microbiol 15:2631–2641. 10.1111/1462-2920.12173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Watanabe M, Kojima H, Fukui M. 2015. Limnochorda pilosa gen. nov., sp. nov., a moderately thermophilic, facultatively anaerobic, pleomorphic bacterium and proposal of Limnochordaceae fam. nov., Limnochordales ord. nov. and Limnochordia classis nov. in the phylum Firmicutes. Int J Syst Evol Microbiol 65:2378–2384. 10.1099/ijs.0.000267. [DOI] [PubMed] [Google Scholar]
- 30.Lawson PA. 2019. Tissierella, p 1–12. In Whitman WB (ed) Bergey's manual of systematics of Archaea and Bacteria. John Wiley & Sons, New York, NY. 10.1002/9781118960608.gbm00721.pub2. [DOI] [Google Scholar]
- 31.Oren A. 2014. The order Halanaerobiales, and the families Halanaerobiaceae and Halobacteroidaceae, p 153–177. In Rosenberg E, DeLong EF, Lory S, Stackebrandt E, Thompson F (ed), The Prokaryotes. Firmicutes and Tenericutes, 4th ed. Springer-Verlag, Berlin, Germany. [Google Scholar]
- 32.Galperin MY, Mekhedov SL, Puigbo P, Smirnov S, Wolf YI, Rigden DJ. 2012. Genomic determinants of sporulation in Bacilli and Clostridia: towards the minimal set of sporulation-specific genes. Environ Microbiol 14:2870–2890. 10.1111/j.1462-2920.2012.02841.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Onyenwoke RU, Brill JA, Farahi K, Wiegel J. 2004. Sporulation genes in members of the low G+C Gram-type-positive phylogenetic branch (Firmicutes). Arch Microbiol 182:182–192. 10.1007/s00203-004-0696-y. [DOI] [PubMed] [Google Scholar]
- 34.Stragier P. 2002. A gene odyssey: exploring the genomes of endospore-forming bacteria, p 519–526. In Sonenshein AL, Hoch JA, Losick R (ed), Bacillus subtilis and its closest relatives: from genes to cells. ASM Press, Washington, DC. [Google Scholar]
- 35.Wu M, Ren Q, Durkin AS, Daugherty SC, Brinkac LM, Dodson RJ, Madupu R, Sullivan SA, Kolonay JF, Haft DH, Nelson WC, Tallon LJ, Jones KM, Ulrich LE, Gonzalez JM, Zhulin IB, Robb FT, Eisen JA. 2005. Life in hot carbon monoxide: the complete genome sequence of Carboxydothermus hydrogenoformans Z-2901. PLoS Genet 1:e65. 10.1371/journal.pgen.0010065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Abecasis AB, Serrano M, Alves R, Quintais L, Pereira-Leal JB, Henriques AO. 2013. A genomic signature and the identification of new sporulation genes. J Bacteriol 195:2101–2115. 10.1128/JB.02110-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ramos-Silva P, Serrano M, Henriques AO. 2019. From root to tips: sporulation evolution and specialization in Bacillus subtilis and the intestinal pathogen Clostridioides difficile. Mol Biol Evol 36:2714–2736. 10.1093/molbev/msz175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Pedreira T, Elfmann C, Stülke J. 2022. The current state of SubtiWiki, the database for the model organism Bacillus subtilis. Nucleic Acids Res 50:D875–D882. 10.1093/nar/gkab943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Galperin MY, Wolf YI, Makarova KS, Vera Alvarez R, Landsman D, Koonin EV. 2021. COG database update: focus on microbial diversity, model organisms, and widespread pathogens. Nucleic Acids Res 49:D274–D281. 10.1093/nar/gkaa1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Natale DA, Galperin MY, Tatusov RL, Koonin EV. 2000. Using the COG database to improve gene recognition in complete genomes. Genetica 108:9–17. 10.1023/A:1004031323748. [DOI] [PubMed] [Google Scholar]
- 41.Tatusov RL, Koonin EV, Lipman DJ. 1997. A genomic perspective on protein families. Science 278:631–637. 10.1126/science.278.5338.631. [DOI] [PubMed] [Google Scholar]
- 42.Galperin MY, Makarova KS, Wolf YI, Koonin EV. 2015. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res 43:D261–D269. 10.1093/nar/gku1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Sokolova T, Hanel J, Onyenwoke RU, Reysenbach AL, Banta A, Geyer R, Gonzalez JM, Whitman WB, Wiegel J. 2007. Novel chemolithotrophic, thermophilic, anaerobic bacteria Thermolithobacter ferrireducens gen. nov., sp. nov. and Thermolithobacter carboxydivorans sp. nov. Extremophiles 11:145–157. 10.1007/s00792-006-0022-5. [DOI] [PubMed] [Google Scholar]
- 44.Neupane S, Ghosh A, Gunther S, Martin K, Zurek L. 2020. Culicoidibacter larvae gen. nov., sp. nov., from the gastrointestinal tract of the biting midge (Culicoides sonorensis) larva, belongs to a novel lineage Culicoidibacteraceae fam. nov., Culicoidibacterales ord. nov. and Culicoidibacteria classis nov. of the phylum Firmicutes. Int J Syst Evol Microbiol 70:6482–6490. 10.1099/ijsem.0.004543. [DOI] [PubMed] [Google Scholar]
- 45.Haft DH, DiCuccio M, Badretdin A, Brover V, Chetvernin V, O'Neill K, Li W, Chitsaz F, Derbyshire MK, Gonzales NR, Gwadz M, Lu F, Marchler GH, Song JS, Thanki N, Yamashita RA, Zheng C, Thibaud-Nissen F, Geer LY, Marchler-Bauer A, Pruitt KD. 2018. RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res 46:D851–D860. 10.1093/nar/gkx1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Smith LD, King E. 1962. Clostridium innocuum, sp. n., a spore-forming anaerobe isolated from human infections. J Bacteriol 83:938–939. 10.1128/jb.83.4.938-939.1962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Holdeman LV, Cato EP, Moore WEC. 1971. Clostridium ramosum (Vuillemin) comb. nov.: emended description and proposed neotype strain. Int J Syst Evol Microbiol 21:35–39. 10.1099/00207713-21-1-35. [DOI] [Google Scholar]
- 48.Karnachuk OV, Frank YA, Lukina AP, Kadnikov VV, Beletsky AV, Mardanov AV, Ravin NV. 2019. Domestication of previously uncultivated Candidatus Desulforudis audaxviator from a deep aquifer in Siberia sheds light on its physiology and evolution. ISME J 13:1947–1959. 10.1038/s41396-019-0402-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Galperin MY, Brover V, Tolstoy I, Yutin N. 2016. Phylogenomic analysis of the family Peptostreptococcaceae (Clostridium cluster XI) and proposal for reclassification of Clostridium litorale (Fendrich et al. 1991) and Eubacterium acidaminophilum (Zindel et al. 1989) as Peptoclostridium litorale gen. nov. comb. nov. and Peptoclostridium acidaminophilum comb. nov. Int J Syst Evol Microbiol 66:5506–5513. 10.1099/ijsem.0.001548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Poehlein A, Yutin N, Daniel R, Galperin MY. 2017. Proposal for the reclassification of obligately purine-fermenting bacteria Clostridium acidurici (Barker 1938) and Clostridium purinilyticum (Durre et al. 1981) as Gottschalkia acidurici gen. nov. comb. nov. and Gottschalkia purinilytica comb. nov. and of Eubacterium angustum (Beuscher and Andreesen 1985) as Andreesenia angusta gen. nov. comb. nov. in the family Gottschalkiaceae fam. nov. Int J Syst Evol Microbiol 67:2711–2719. 10.1099/ijsem.0.002008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Davidson P, Eutsey R, Redler B, Hiller NL, Laub MT, Durand D. 2018. Flexibility and constraint: evolutionary remodeling of the sporulation initiation pathway in Firmicutes. PLoS Genet 14:e1007470. 10.1371/journal.pgen.1007470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Orsburn BC, Melville SB, Popham DL. 2010. EtfA catalyses the formation of dipicolinic acid in Clostridium perfringens. Mol Microbiol 75:178–186. 10.1111/j.1365-2958.2009.06975.x. [DOI] [PubMed] [Google Scholar]
- 53.Galperin MY, Wolf YI, Garushyants SK, Vera Alvarez R, Koonin EV. 2021. Non-essential ribosomal proteins in bacteria and archaea identified using Clusters of Orthologous Genes. J Bacteriol 203:e00058-21. 10.1128/JB.00058-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Boratyn GM, Schaffer AA, Agarwala R, Altschul SF, Lipman DJ, Madden TL. 2012. Domain enhanced lookup time accelerated BLAST. Biol Direct 7:12. 10.1186/1745-6150-7-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. 1997. Gapped BLAST and PSI-BLAST—a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402. 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Schoch CL, Ciufo S, Domrachev M, Hotton CL, Kannan S, Khovanskaya R, Leipe D, Mcveigh R, O’Neill K, Robbertse B, Sharma S, Soussov V, Sullivan JP, Sun L, Turner S, Karsch-Mizrachi I. 2020. NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database (Oxford) 2020:baaa062. 10.1093/database/baaa062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Yaakop AS, Chan KG, Ee R, Kahar UM, Kon WC, Goh KM. 2015. Isolation of Jeotgalibacillus malaysiensis sp. nov. from a sandy beach, and emended description of the genus Jeotgalibacillus. Int J Syst Evol Microbiol 65:2215–2221. 10.1099/ijs.0.000242. [DOI] [PubMed] [Google Scholar]
- 58.Gupta RS, Patel S. 2019. Robust demarcation of the family Caryophanaceae (Planococcaceae) and its different genera including three novel genera based on phylogenomics and highly specific molecular signatures. Front Microbiol 10:2821. 10.3389/fmicb.2019.02821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Luo Y, Korza G, DeMarco AM, Kuipers OP, Li YQ, Setlow P. 2021. Properties of spores of Bacillus subtilis with or without a transposon that decreases spore germination and increases spore wet heat resistance. J Appl Microbiol 131:2918–2928. 10.1111/jam.15163. [DOI] [PubMed] [Google Scholar]
- 60.Davis JJ, Xia F, Overbeek RA, Olsen GJ. 2013. Genomes of the class Erysipelotrichia clarify the firmicute origin of the class Mollicutes. Int J Syst Evol Microbiol 63:2727–2741. 10.1099/ijs.0.048983-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Soto-Avila L, Merce RC, Santos W, Castaneda N, Gutierrez-Rios RM. 2021. Distribution and preservation of the components of the engulfment. What is beyond representative genomes? PLoS One 16:e0246651. 10.1371/journal.pone.0246651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Paredes-Sabja D, Setlow P, Sarker MR. 2011. Germination of spores of Bacillales and Clostridiales species: mechanisms and proteins involved. Trends Microbiol 19:85–94. 10.1016/j.tim.2010.10.004. [DOI] [PubMed] [Google Scholar]
- 63.Xiao Y, Francke C, Abee T, Wells-Bennik MH. 2011. Clostridial spore germination versus bacilli: genome mining and current insights. Food Microbiol 28:266–274. 10.1016/j.fm.2010.03.016. [DOI] [PubMed] [Google Scholar]
- 64.Clauwers C, Lood C, Van den Bergh B, van Noort V, Michiels CW. 2017. Canonical germinant receptor is dispensable for spore germination in Clostridium botulinum group II strain NCTC 11219. Sci Rep 7:15426. 10.1038/s41598-017-15839-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Al-Hinai MA, Jones SW, Papoutsakis ET. 2015. The Clostridium sporulation programs: diversity and preservation of endospore differentiation. Microbiol Mol Biol Rev 79:19–37. 10.1128/MMBR.00025-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Zhang L, Higgins ML, Piggot PJ. 1997. The division during bacterial sporulation is symmetrically located in Sporosarcina ureae. Mol Microbiol 25:1091–1098. 10.1046/j.1365-2958.1997.5341892.x. [DOI] [PubMed] [Google Scholar]
- 67.Kroos L, Zhang B, Ichikawa H, Yu YT. 1999. Control of σ factor activity during Bacillus subtilis sporulation. Mol Microbiol 31:1285–1294. 10.1046/j.1365-2958.1999.01214.x. [DOI] [PubMed] [Google Scholar]
- 68.Fimlaid KA, Shen A. 2015. Diverse mechanisms regulate sporulation sigma factor activity in the Firmicutes. Curr Opin Microbiol 24:88–95. 10.1016/j.mib.2015.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Sun G, Yang M, Jiang L, Huang M. 2021. Regulation of pro-σK activation: a key checkpoint in Bacillus subtilis sporulation. Environ Microbiol 23:2366–2373. 10.1111/1462-2920.15415. [DOI] [PubMed] [Google Scholar]
- 70.Stragier P, Kunkel B, Kroos L, Losick R. 1989. Chromosomal rearrangement generating a composite gene for a developmental transcription factor. Science 243:507–512. 10.1126/science.2536191. [DOI] [PubMed] [Google Scholar]
- 71.Kunkel B, Losick R, Stragier P. 1990. The Bacillus subtilis gene for the development transcription factor σK is generated by excision of a dispensable DNA element containing a sporulation recombinase gene. Genes Dev 4:525–535. 10.1101/gad.4.4.525. [DOI] [PubMed] [Google Scholar]
- 72.Haraldsen JD, Sonenshein AL. 2003. Efficient sporulation in Clostridium difficile requires disruption of the σK gene. Mol Microbiol 48:811–821. 10.1046/j.1365-2958.2003.03471.x. [DOI] [PubMed] [Google Scholar]
- 73.Serrano M, Kint N, Pereira FC, Saujet L, Boudry P, Dupuy B, Henriques AO, Martin-Verstraete I. 2016. A recombination directionality factor controls the cell type-specific activation of σK and the fidelity of spore development in Clostridium difficile. PLoS Genet 12:e1006312. 10.1371/journal.pgen.1006312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Kim KP, Born Y, Lurz R, Eichenseher F, Zimmer M, Loessner MJ, Klumpp J. 2012. Inducible Clostridium perfringens bacteriophages PhiS9 and PhiS63: different genome structures and a fully functional sigK intervening element. Bacteriophage 2:89–97. 10.4161/bact.21363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Cohen JE, Wang R, Shen RF, Wu WW, Keller JE. 2017. Comparative pathogenomics of Clostridium tetani. PLoS One 12:e0182909. 10.1371/journal.pone.0182909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Ramirez-Guadiana FH, Rodrigues CDA, Marquis KA, Campo N, Barajas-Ornelas RDC, Brock K, Marks DS, Kruse AC, Rudner DZ. 2018. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in Bacillus subtilis. PLoS Genet 14:e1007753. 10.1371/journal.pgen.1007753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Campo N, Rudner DZ. 2007. SpoIVB and CtpB are both forespore signals in the activation of the sporulation transcription factor σK in Bacillus subtilis. J Bacteriol 189:6021–6027. 10.1128/JB.00399-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Oke V, Shchepetov M, Cutting S. 1997. SpoIVB has two distinct functions during spore formation in Bacillus subtilis. Mol Microbiol 23:223–230. 10.1046/j.1365-2958.1997.2091573.x. [DOI] [PubMed] [Google Scholar]
- 79.Jiang X, Rubio A, Chiba S, Pogliano K. 2005. Engulfment-regulated proteolysis of SpoIIQ: evidence that dual checkpoints control sigma activity. Mol Microbiol 58:102–115. 10.1111/j.1365-2958.2005.04811.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Saujet L, Pereira FC, Serrano M, Soutourina O, Monot M, Shelyakin PV, Gelfand MS, Dupuy B, Henriques AO, Martin-Verstraete I. 2013. Genome-wide analysis of cell type-specific gene transcription during spore formation in Clostridium difficile. PLoS Genet 9:e1003756. 10.1371/journal.pgen.1003756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Patterson HM, Brannigan JA, Cutting SM, Wilson KS, Wilkinson AJ, Ab E, Diercks T, de Jong RN, Truffault V, Folkers GE, Kaptein R. 2005. The structure of bypass of forespore C, an intercompartmental signaling factor during sporulation in Bacillus. J Biol Chem 280:36214–36220. 10.1074/jbc.M506910200. [DOI] [PubMed] [Google Scholar]
- 82.Aung S, Shum J, Abanes-De Mello A, Broder DH, Fredlund-Gutierrez J, Chiba S, Pogliano K. 2007. Dual localization pathways for the engulfment proteins during Bacillus subtilis sporulation. Mol Microbiol 65:1534–1546. 10.1111/j.1365-2958.2007.05887.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Chastanet A, Losick R. 2007. Engulfment during sporulation in Bacillus subtilis is governed by a multi-protein complex containing tandemly acting autolysins. Mol Microbiol 64:139–152. 10.1111/j.1365-2958.2007.05652.x. [DOI] [PubMed] [Google Scholar]
- 84.Crawshaw AD, Serrano M, Stanley WA, Henriques AO, Salgado PS. 2014. A mother cell-to-forespore channel: current understanding and future challenges. FEMS Microbiol Lett 358:129–136. 10.1111/1574-6968.12554. [DOI] [PubMed] [Google Scholar]
- 85.Rodrigues CD, Ramirez-Guadiana FH, Meeske AJ, Wang X, Rudner DZ. 2016. GerM is required to assemble the basal platform of the SpoIIIA-SpoIIQ transenvelope complex during sporulation in Bacillus subtilis. Mol Microbiol 102:260–273. 10.1111/mmi.13457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Kelly A, Salgado PS. 2019. The engulfasome in C. difficile: variations on protein machineries. Anaerobe 60:102091. 10.1016/j.anaerobe.2019.102091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Dembek M, Kelly A, Barwinska-Sendra A, Tarrant E, Stanley WA, Vollmer D, Biboy J, Gray J, Vollmer W, Salgado PS. 2018. Peptidoglycan degradation machinery in Clostridium difficile forespore engulfment. Mol Microbiol 110:390–410. 10.1111/mmi.14091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Ribis JW, Fimlaid KA, Shen A. 2018. Differential requirements for conserved peptidoglycan remodeling enzymes during Clostridioides difficile spore formation. Mol Microbiol 110:370–389. 10.1111/mmi.14090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Doan T, Marquis KA, Rudner DZ. 2005. Subcellular localization of a sporulation membrane protein is achieved through a network of interactions along and across the septum. Mol Microbiol 55:1767–1781. 10.1111/j.1365-2958.2005.04501.x. [DOI] [PubMed] [Google Scholar]
- 90.Levdikov VM, Blagova EV, McFeat A, Fogg MJ, Wilson KS, Wilkinson AJ. 2012. Structure of components of an intercellular channel complex in sporulating Bacillus subtilis. Proc Natl Acad Sci USA 109:5441–5445. 10.1073/pnas.1120087109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Meisner J, Maehigashi T, Andre I, Dunham CM, Moran CP, Jr.. 2012. Structure of the basal components of a bacterial transporter. Proc Natl Acad Sci USA 109:5446–5451. 10.1073/pnas.1120113109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Serrano M, Crawshaw AD, Dembek M, Monteiro JM, Pereira FC, Pinho MG, Fairweather NF, Salgado PS, Henriques AO. 2016. The SpoIIQ-SpoIIIAH complex of Clostridium difficile controls forespore engulfment and late stages of gene expression and spore morphogenesis. Mol Microbiol 100:204–228. 10.1111/mmi.13311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Blaylock B, Jiang X, Rubio A, Moran CP, Jr, Pogliano K. 2004. Zipper-like interaction between proteins in adjacent daughter cells mediates protein localization. Genes Dev 18:2916–2928. 10.1101/gad.1252704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Ojkic N, Lopez-Garrido J, Pogliano K, Endres RG. 2014. Bistable forespore engulfment in Bacillus subtilis by a zipper mechanism in absence of the cell wall. PLoS Comput Biol 10:e1003912. 10.1371/journal.pcbi.1003912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Rodrigues CD, Henry X, Neumann E, Kurauskas V, Bellard L, Fichou Y, Schanda P, Schoehn G, Rudner DZ, Morlot C. 2016. A ring-shaped conduit connects the mother cell and forespore during sporulation in Bacillus subtilis. Proc Natl Acad Sci USA 113:11585–11590. 10.1073/pnas.1609604113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Setlow P. 1988. Small, acid-soluble spore proteins of Bacillus species: structure, synthesis, genetics, function, and degradation. Annu Rev Microbiol 42:319–338. 10.1146/annurev.mi.42.100188.001535. [DOI] [PubMed] [Google Scholar]
- 97.Setlow P. 2007. I will survive: DNA protection in bacterial spores. Trends Microbiol 15:172–180. 10.1016/j.tim.2007.02.004. [DOI] [PubMed] [Google Scholar]
- 98.Mason JM, Setlow P. 1986. Essential role of small, acid-soluble spore proteins in resistance of Bacillus subtilis spores to UV light. J Bacteriol 167:174–178. 10.1128/jb.167.1.174-178.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Moeller R, Setlow P, Reitz G, Nicholson WL. 2009. Roles of small, acid-soluble spore proteins and core water content in survival of Bacillus subtilis spores exposed to environmental solar UV radiation. Appl Environ Microbiol 75:5202–5208. 10.1128/AEM.00789-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Cortesao M, Fuchs FM, Commichau FM, Eichenberger P, Schuerger AC, Nicholson WL, Setlow P, Moeller R. 2019. Bacillus subtilis spore resistance to simulated Mars surface conditions. Front Microbiol 10:333. 10.3389/fmicb.2019.00333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Bagyan I, Setlow B, Setlow P. 1998. New small, acid-soluble proteins unique to spores of Bacillus subtilis: identification of the coding genes and regulation and function of two of these genes. J Bacteriol 180:6704–6712. 10.1128/JB.180.24.6704-6712.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Vyas J, Cox J, Setlow B, Coleman WH, Setlow P. 2011. Extremely variable conservation of gamma-type small, acid-soluble proteins from spores of some species in the bacterial order Bacillales. J Bacteriol 193:1884–1892. 10.1128/JB.00018-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Li J, McClane BA. 2008. A novel small acid soluble protein variant is important for spore resistance of most Clostridium perfringens food poisoning isolates. PLoS Pathog 4:e1000056. 10.1371/journal.ppat.1000056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Wetzel D, Fischer RJ. 2015. Small acid-soluble spore proteins of Clostridium acetobutylicum are able to protect DNA in vitro and are specifically cleaved by germination protease GPR and spore protease YyaC. Microbiology (Reading) 161:2098–2109. 10.1099/mic.0.000162. [DOI] [PubMed] [Google Scholar]
- 105.Fawcett P, Eichenberger P, Losick R, Youngman P. 2000. The transcriptional profile of early to middle sporulation in Bacillus subtilis. Proc Natl Acad Sci USA 97:8063–8068. 10.1073/pnas.140209597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Asai K, Takamatsu H, Iwano M, Kodama T, Watabe K, Ogasawara N. 2001. The Bacillus subtilis yabQ gene is essential for formation of the spore cortex. Microbiology (Reading) 147:919–927. 10.1099/00221287-147-4-919. [DOI] [PubMed] [Google Scholar]
- 107.Permpoonpattana P, Phetcharaburanin J, Mikelsone A, Dembek M, Tan S, Brisson MC, La Ragione R, Brisson AR, Fairweather N, Hong HA, Cutting SM. 2013. Functional characterization of Clostridium difficile spore coat proteins. J Bacteriol 195:1492–1503. 10.1128/JB.02104-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Secaira-Morocho H, Castillo JA, Driks A. 2020. Diversity and evolutionary dynamics of spore-coat proteins in spore-forming species of.Bacillales Microb Genom 6:mgen000451. 10.1099/mgen.0.000451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Wang KH, Isidro AL, Domingues L, Eskandarian HA, McKenney PT, Drew K, Grabowski P, Chua MH, Barry SN, Guan M, Bonneau R, Henriques AO, Eichenberger P. 2009. The coat morphogenetic protein SpoVID is necessary for spore encasement in Bacillus subtilis. Mol Microbiol 74:634–649. 10.1111/j.1365-2958.2009.06886.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.de Francesco M, Jacobs JZ, Nunes F, Serrano M, McKenney PT, Chua MH, Henriques AO, Eichenberger P. 2012. Physical interaction between coat morphogenetic proteins SpoVID and CotE is necessary for spore encasement in Bacillus subtilis. J Bacteriol 194:4941–4950. 10.1128/JB.00914-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Putnam EE, Nock AM, Lawley TD, Shen A. 2013. SpoIVA and SipL are Clostridium difficile spore morphogenetic proteins. J Bacteriol 195:1214–1225. 10.1128/JB.02181-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Touchette MH, Benito de la Puebla H, Ravichandran P, Shen A. 2019. SpoIVA-SipL complex formation is essential for Clostridioides difficile spore assembly. J Bacteriol 201:e00042-19. 10.1128/JB.00042-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Ribis JW, Ravichandran P, Putnam EE, Pishdadian K, Shen A. 2017. The conserved spore coat protein SpoVM Is largely dispensable in Clostridium difficile spore formation. mSphere 2:e00315-17. 10.1128/mSphere.00315-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Permpoonpattana P, Tolls EH, Nadem R, Tan S, Brisson A, Cutting SM. 2011. Surface layers of Clostridium difficile endospores. J Bacteriol 193:6461–6470. 10.1128/JB.05182-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Donnelly ML, Li W, Li YQ, Hinkel L, Setlow P, Shen A. 2017. A Clostridium difficile-specific, gel-forming protein required for optimal spore ggermination mBio 8:e02085-16. 10.1128/mBio.02085-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Rohlfing AE, Eckenroth BE, Forster ER, Kevorkian Y, Donnelly ML, Benito de la Puebla H, Doublie S, Shen A. 2019. The CspC pseudoprotease regulates germination of Clostridioides difficile spores in response to multiple environmental signals. PLoS Genet 15:e1008224. 10.1371/journal.pgen.1008224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Meaney CA, Cartman ST, McClure PJ, Minton NP. 2015. Optimal spore germination in Clostridium botulinum ATCC 3502 requires the presence of functional copies of SleB and YpeB, but not CwlJ. Anaerobe 34:86–93. 10.1016/j.anaerobe.2015.04.015. [DOI] [PubMed] [Google Scholar]
- 118.Sayer CV, Popham DL. 2019. YpeB dimerization may be required to stabilize SleB for effective germination of Bacillus anthracis spores. BMC Microbiol 19:169. 10.1186/s12866-019-1544-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Amon JD, Yadav AK, Ramirez-Guadiana FH, Meeske AJ, Cava F, Rudner DZ. 2020. SwsB and SafA are required for CwlJ-dependent spore germination in Bacillus subtilis. J Bacteriol 202:e00668-19. PubMed:31871031. 10.1128/JB.00668-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL, Tosatto SCE, Paladin L, Raj S, Richardson LJ, Finn RD, Bateman A. 2021. Pfam: the protein families database in 2021. Nucleic Acids Res 49:D412–D419. 10.1093/nar/gkaa913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Natarajan P, Punta M, Kumar A, Yeh AP, Godzik A, Aravind L. 2015. Structure and sequence analyses of Bacteroides proteins BVU_4064 and BF1687 reveal presence of two novel predominantly-beta domains, predicted to be involved in lipid and cell surface interactions. BMC Bioinformatics 16:7. 10.1186/s12859-014-0434-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Nessi C, Jedrzejas MJ, Setlow P. 1998. Structure and mechanism of action of the protease that degrades small, acid-soluble spore proteins during germination of spores of Bacillus species. J Bacteriol 180:5077–5084. 10.1128/JB.180.19.5077-5084.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Molle V, Fujita M, Jensen ST, Eichenberger P, Gonzalez-Pastor JE, Liu JS, Losick R. 2003. The Spo0A regulon of Bacillus subtilis. Mol Microbiol 50:1683–1701. 10.1046/j.1365-2958.2003.03818.x. [DOI] [PubMed] [Google Scholar]
- 124.Bate AR, Bonneau R, Eichenberger P. 2014. Bacillus subtilis systems biology: applications of -omics techniques to the study of endospore formation. Microbiol Spectr 2:TBS-0019–2013. PubMed:26105826. 10.1128/microbiolspec.TBS-0019-2013. [DOI] [PubMed] [Google Scholar]
- 125.Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. 2020. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 37:1530–1534. 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Yutin N, Puigbo P, Koonin EV, Wolf YI. 2012. Phylogenomics of prokaryotic ribosomal proteins. PLoS One 7:e36972. 10.1371/journal.pone.0036972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Cohen O, Ashkenazy H, Belinky F, Huchon D, Pupko T. 2010. GLOOME: gain loss mapping engine. Bioinformatics 26:2914–2915. 10.1093/bioinformatics/btq549. [DOI] [PubMed] [Google Scholar]
- 128.Cano RJ, Borucki MK. 1995. Revival and identification of bacterial spores in 25- to 40-million-year-old Dominican amber. Science 268:1060–1064. 10.1126/science.7538699. [DOI] [PubMed] [Google Scholar]
- 129.Browne HP, Forster SC, Anonye BO, Kumar N, Neville BA, Stares MD, Goulding D, Lawley TD. 2016. Culturing of 'unculturable' human microbiota reveals novel taxa and extensive sporulation. Nature 533:543–546. 10.1038/nature17645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Kevbrin VV, Zhilina TN, Zavarzin GA. 1995. Physiology of the halophilic homoacetic bacterium Acetohalobium arabaticum. Microbiology (Moscow) 64:134–138. [Google Scholar]
- 131.Sikorski J, Lapidus A, Chertkov O, Lucas S, Copeland A, Glavina Del Rio T, Nolan M, Tice H, Cheng JF, Han C, Brambilla E, Pitluck S, Liolios K, Ivanova N, Mavromatis K, Mikhailova N, Pati A, Bruce D, Detter C, Tapia R, Goodwin L, Chen A, Palaniappan K, Land M, Hauser L, Chang YJ, Jeffries CD, Rohde M, Goker M, Spring S, Woyke T, Bristow J, Eisen JA, Markowitz V, Hugenholtz P, Kyrpides NC, Klenk HP. 2010. Complete genome sequence of Acetohalobium arabaticum type strain (Z-7288). Stand Genomic Sci 3:57–65. 10.4056/sigs.1062906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Bosshard PP, Zbinden R, Altwegg M. 2002. Turicibacter sanguinis gen. nov., sp. nov., a novel anaerobic, Gram-positive bacterium. Int J Syst Evol Microbiol 52:1263–1266. 10.1099/00207713-52-4-1263. [DOI] [PubMed] [Google Scholar]
- 133.Alexander CJ, Citron DM, Brazier JS, Goldstein EJ. 1995. Identification and antimicrobial resistance patterns of clinical isolates of Clostridium clostridioforme, Clostridium innocuum, and Clostridium ramosum compared with those of clinical isolates of Clostridium perfringens. J Clin Microbiol 33:3209–3215. 10.1128/jcm.33.12.3209-3215.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Crum-Cianflone N. 2009. Clostridium innocuum bacteremia in a patient with acquired immunodeficiency syndrome. Am J Med Sci 337:480–482. 10.1097/MAJ.0b013e31819f1e95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Hu X, Fan W, Han B, Liu H, Zheng D, Li Q, Dong W, Yan J, Gao M, Berry C, Yuan Z. 2008. Complete genome sequence of the mosquitocidal bacterium Bacillus sphaericus C3-41 and comparison with those of closely related Bacillus species. J Bacteriol 190:2892–2902. 10.1128/JB.01652-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Tocheva EI, Matson EG, Morris DM, Moussavi F, Leadbetter JR, Jensen GJ. 2011. Peptidoglycan remodeling and conversion of an inner membrane into an outer membrane during sporulation. Cell 146:799–812. 10.1016/j.cell.2011.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Tocheva EI, Ortega DR, Jensen GJ. 2016. Sporulation, bacterial cell envelopes and the origin of life. Nat Rev Microbiol 14:535–542. 10.1038/nrmicro.2016.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Megrian D, Taib N, Witwinowski J, Beloin C, Gribaldo S. 2020. One or two membranes? Diderm Firmicutes challenge the Gram-positive/Gram-negative divide. Mol Microbiol 113:659–671. 10.1111/mmi.14469. [DOI] [PubMed] [Google Scholar]
- 139.Taib N, Megrian D, Witwinowski J, Adam P, Poppleton D, Borrel G, Beloin C, Gribaldo S. 2020. Genome-wide analysis of the Firmicutes illuminates the diderm/monoderm transition. Nat Ecol Evol 4:1661–1672. 10.1038/s41559-020-01299-7. [DOI] [PubMed] [Google Scholar]
- 140.Teeling H, Gloeckner FO. 2006. RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits. BMC Bioinformatics 7:66. 10.1186/1471-2105-7-66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Popham DL, Illades-Aguiar B, Setlow P. 1995. The Bacillus subtilis dacB gene, encoding penicillin-binding protein 5*, is part of a three-gene operon required for proper spore cortex synthesis and spore core dehydration. J Bacteriol 177:4721–4729. 10.1128/jb.177.16.4721-4729.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Orsburn B, Sucre K, Popham DL, Melville SB. 2009. The SpmA/B and DacF proteins of Clostridium perfringens play important roles in spore heat resistance. FEMS Microbiol Lett 291:188–194. 10.1111/j.1574-6968.2008.01454.x. [DOI] [PubMed] [Google Scholar]
- 143.Brahimi S, Cadoret F, Fournier PE, Moal V, Raoult D. 2017. ‘Ndongobacter massiliensis’ gen. nov., sp. nov., a new bacterial genus isolated from a human urine sample after de novo kidney transplantation. New Microbes New Infect 16:34–36. 10.1016/j.nmni.2016.12.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Lai EM, Phadke ND, Kachman MT, Giorno R, Vazquez S, Vazquez JA, Maddock JR, Driks A. 2003. Proteomic analysis of the spore coats of Bacillus subtilis and Bacillus anthracis. J Bacteriol 185:1443–1454. 10.1128/JB.185.4.1443-1454.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Marchler GH, Song JS, Thanki N, Yamashita RA, Yang M, Zhang D, Zheng C, Lanczycki CJ, Marchler-Bauer A. 2020. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res 48:D265–D268. 10.1093/nar/gkz991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Edgar RC. 2021. MUSCLE v5 enables improved estimates of phylogenetic tree confidence by ensemble bootstrapping. bioRxiv 10.1101/2021.06.20.449169. [DOI]
- 147.Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14:587–589. 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 148.Anisimova M, Gil M, Dufayard JF, Dessimoz C, Gascuel O. 2011. Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst Biol 60:685–699. 10.1093/sysbio/syr041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Esterman ES, Wolf YI, Kogay R, Koonin EV, Zhaxybayeva O. 2021. Evolution of DNA packaging in gene transfer agents. Virus Evol 7:veab015. 10.1093/ve/veab015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Price MN, Dehal PS, Arkin AP. 2010. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One 5:e9490. 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Whelan S, Goldman N. 2001. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol 18:691–699. 10.1093/oxfordjournals.molbev.a003851. [DOI] [PubMed] [Google Scholar]
- 152.Traag BA, Pugliese A, Setlow B, Setlow P, Losick R. 2013. A conserved ClpP-like protease involved in spore outgrowth in Bacillus subtilis. Mol Microbiol 90:160–166. 10.1111/mmi.12355. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figures S1-S8 and Tables S3-S8. Download jb.00079-22-s0001.pdf, PDF file, 2.4 MB (2.4MB, pdf)
Table S1. Download jb.00079-22-s0002.xlsx, XLSX file, 0.04 MB (39.9KB, xlsx)
Table S2. Download jb.00079-22-s0003.xlsx, XLSX file, 0.4 MB (401.2KB, xlsx)