Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
letter
. 2019 May 10;11(7):2035–2044. doi: 10.1093/gbe/evz096

Revisiting the Evolution and Taxonomy of Clostridia, a Phylogenomic Update

Pablo Cruz-Morales 1,3, Camila A Orellana 1, George Moutafis 2, Glenn Moonen 2, Gonzalo Rincon 2, Lars K Nielsen 1, Esteban Marcellin 1,
Editor: Eric Bapteste
PMCID: PMC6656338  PMID: 31076745

Abstract

Clostridium is a large genus of obligate anaerobes belonging to the Firmicutes phylum of bacteria, most of which have a Gram-positive cell wall structure. The genus includes significant human and animal pathogens, causative of potentially deadly diseases such as tetanus and botulism. Despite their relevance and many studies suggesting that they are not a monophyletic group, the taxonomy of the group has largely been neglected. Currently, species belonging to the genus are placed in the unnatural order defined as Clostridiales, which includes the class Clostridia. Here, we used genomic data from 779 strains to study the taxonomy and evolution of the group. This analysis allowed us to 1) confirm that the group is composed of more than one genus, 2) detect major differences between pathogens classified as a single species within the group of authentic Clostridium spp. (sensu stricto), 3) identify inconsistencies between taxonomy and toxin evolution that reflect on the pervasive misclassification of strains, and 4) identify differential traits within central metabolism of members of what has been defined earlier and confirmed by us as cluster I. Our analysis shows that the current taxonomic classification of Clostridium species hinders the prediction of functions and traits, suggests a new classification for this fascinating class of bacteria, and highlights the importance of phylogenomics for taxonomic studies.

Keywords: Clostridium, phylogenomics, pangenome, taxonomy

Introduction

Clostridia are an important genus of Gram-positive, often anaerobic, rod shaped, spore-forming bacteria. The group includes important human and animal pathogens such as C. botulinum, C. tetani, and C. difficile as well as industrially relevant microorganisms such as C. acetobutylicum. The importance of the genus is reflected by the more than 42,000 entries in the PubMed database, and about 1,700 genome sequences from this group deposited in the GenBank database.

Early molecular analyses in the 1970s demonstrated considerable diversity and ambiguities among the genus (Johnson and Francis 1975). In fact, this early classification of the genus Clostridium does not respect the identity thresholds established for 16s rRNA (Rossi-Tamisier et al. 2015), a widely used taxonomic marker. In consequence, this classification has been revisited several times (Collins et al. 1994; Yutin and Galperin 2013; Lawson 2016). Currently, it is well known that there are at least three C. botulinum lineages and that C. difficile belongs to a distantly related genus leading to the recent reclassification of C. difficile as a Clostridioides difficile (Lawson et al. 2016). Furthermore, the genus Sarcina (Skerman et al. 1980) has been phylogenetically located within the “cluster I” (sensu stricto) group, which is widely accepted as the “true” Clostridium genus. The paroxysm of the conflicting organization of the Clostridium genus is the fact that the Sarcina genus was proposed before (Goodsir 1842) the Clostridium genus (Prazmowski 1880), giving priority to the name Sarcina for the whole genus. Although such change may be excessive and could cause a great deal of confusion, it highlights the need to revisit the taxonomy using modern approaches (see Lawson and Rainey 2016; Tindall 2016 for an interesting discussion on this subject).

The recent availability of sequenced genomes provides a new opportunity to revisit the clostridial taxonomy beyond 16S rRNA-based classification sequencing. Such an opportunity enables a comprehensive taxonomic and evolutionary analysis to confirm that they are not a monophyletic group and there is a need to redefine the group taxonomically.

In this work, we have compiled the genomes classified as “Clostridium” and “Clostridioides” in the GenBank database (Benson et al. 2006) to identify a set of conserved genes that were used to define taxonomy. Once the classification was established, we focused on what has been called “cluster I” species (sensu stricto) (Lawson and Rainey (2016)) to identify differences between the core/pan genomes of cluster I strains and to reveal general evolutionary trends and specific traits linked to adaptation to different lifestyles.

Materials and Methods

All genomes assemblies were downloaded from the NCBI FTP site and filtered by number of contigs (cut-off <=400), N50 (>=20,000 bases), and completeness (>=80%) based on benchmarking Universal Single-Copy Orthologs (Waterhouse et al. 2017) implemented in QUAST v5.0.2 (Gurevich et al. 2013) resulting in 779 genomes which were annotated in RAST (Aziz et al. 2008). The conserved proteins present in the selected genomes were identified using BPGA v1.3 (Chaudhari et al. 2016) with an identity cut-off of 0.4 for clustering of groups of orthologs using Usearch (Edgar 2010).

The resulting 27 groups of orthologs were aligned using Muscle v3.8 (Edgar 2004) and the alignments were manually curated and concatenated using SeaView v4 (Gouy et al. 2010). The final amino acid matrix included 12,836 amino acids. The best amino acid substitution model for each of the 27 partitions (supplementary table S1, Supplementary Material online) was selected using the ModelFinder tool implemented in IQ-tree (Kalyaanamoorthy et al. 2017) and the phylogeny was constructed using IQ-tree (Nguyen et al. 2015), using the partitioned models with 10,000 bootstrap replicates.

Pangenome analysis of cluster I subgroups was performed using BPGA following the same approach described above. Homologs of BotA and AroA were mined and retrieved from the database using BlastP (Altschul et al. 1990) with an e-value cut-off of 1E-9 and bit score of 200. Phylogenetic trees for clostridial toxins and AroA were obtained using the same approach. Synteny analysis was performed using CORASON-BGC (Cruz-Morales et al. 2017) with an e-value cut-off of 1E-9 and a bit score of 200.

The full noncollapsed aroA and species trees are available as supplementary Tree 1, Supplementary Material online, deposited at TreeBASE (Vos et al. 2012): http://purl.org/phylo/treebase/phylows/study/TB2:S23279. Last accessed May 13, 2019.

Results and Discussion

General Taxonomy

We first retrieved more than 1,700 genomes and draft genomes deposited as “Clostridium” and “Clostridioides” from the GenBank as of July 2017. The data set was filtered by removing low-quality genomes (genomes with more than 400 contigs) and by eliminating redundancy at the strain level. This filtering resulted in a subset of 779 genomes (supplementary table S1, Supplementary Material online) used hereafter. We used the taxonomic definition of clostridial “clusters” as reference and annotated those strains with a species name accordingly (Tamburini et al. 2001; Liou et al. 2005; Rainey et al. 2006; Sakuma et al. 2006; Warren et al. 2006; Slobodkina et al. 2008; Shiratori et al. 2009; Bowman et al. 2010; Jung et al. 2010).

From this subset of genomes, we calculated the core genome at different sequence identity cut-offs and selected the largest set of core proteins that could confidently be used for phylogenomic analysis (fig. 1) . Using this approach we identified 27 conserved protein sequences (supplementary table S2, Supplementary Material online) that were used for the construction of a clostridial species tree (fig. 1). This tree defined seven major clades that were consistent with the previous established clostridial “clusters” classification (Rainey et al. 2006). Accordingly, clusters III, IV, Xia, XIVa, XIVb, and XVI were distantly related to cluster I (sensu stricto), which contains 369 strains including C. butyricum, the type strain for the genus, the most toxin-producer pathogens and industrially relevant strains, but clearly excludes difficile species. Based on this analysis, we defined the members of cluster I as the authentic members of the Clostridium genus.

Fig. 1.

Fig. 1.

—Phylogenetic reconstruction of Clostridium species. Left corner: core protein families identified at different sequence identity cut-offs in all genomes and in genomes located in cluster I. The phylogeny was constructed using 27 markers conserved across 779 genomes (supplementary table S1, Supplementary Material online) deposited in the GenBank database and taxonomically defined as Clostridium. The main clades outside (blue lines) and within (369 taxa, red lines) the cluster I sensu stricto group (real clostridia) have been collapsed and defined as 17 taxonomic subgroups (table 1). Branch support is shown at each node. Uncollapsed clades for subgroups 1–17 are shown in supplementary figures S1–S14, Supplementary Material online.

Cluster I was further divided into 17 subgroups (table 1) using the species tree presented in figure 1. Our analysis also showed that strains named C. botulinum are found in subgroups 1, 5, and 8 (table 1). These clades include C. botulinum strains defined by their toxin types as A/B/F (subgroup 1), C/D/CD (subgroup 5), and E (subgroup 8). Clades 1 and 5 also include other species namely: C. sporogenes in clade 1 and C. haemolyticum and C. novyi in clade 5.

Table 1.

Subgroup Core Genome Analysis

Subgroup a Number of Strains Species Proteins Families in Subgroup Core Core of Cores b
Proteins in Core Accessory Proteins Unique Proteins c
1 106 botulinum A, B, F, sporogenes 890 205 673 2
2 21 autoethanogenum, carboxidivorans, coskatii, drakei, kluyveri, ljungdahlii, magnum, ragsdalei, scatologenes, tyrobutyricum 1,071 205 755 99
3 10 tetani 2,214 205 1,115 877
4 16 acetobutylicum, akagii, arbusti, aurantibutyricum, felsineum, pasteurianum, roseum 1,063 205 752 96
5 42 botulinum C and D, haemolyticum, novyi 1,367 205 824 324
6 11 argentinense, collagenovorans, estertheticum, proteolyticum, senegalense, sulfidigenes, tepidiprofundi, tunisiense 462 205 245 4
7 48 beijerinckii, butyricum(Clostridium type strain), puniceum, saccharobutylicum, saccharoperbutylacetonicum 1,319 205 925 178
8 20 botulinum E 994 205 769 10
9 4 paraputrificum 2,704 205 1,452 1,021
10 8 baratii, colicanis 1,350 205 1,037 94
11 15 chauvoei, disporicum, sartagoforme, saudiense, septicum 1,028 205 779 32
12 55 perfringens 2,044 205 1,276 547
13 3 (Sarcina) ventriculi 2,040 205 1,091 732
14 2 Spp. 1,505 205 1,043 242
15 1 fallax 2,415 205 1,336 854
16 3 cavendishii, intestinale 1,500 205 1,104 178
17 5 algidicarnis, cadaveris, hydrogeniformans 992 205 719 58
a

Subgroups within cluster I are defined by the species tree presented in figure 1.

b

Calculated using representative proteins for each protein family in the core of the subgroups at an identity cut-off of 0.4.

c

This category includes protein families that are only found in the core of a given subgroup, therefore, they represent subgroup-specific protein families.

Comparison of the overall synteny between C. botulinum strains from clades 1, 5, and 8 showed divergence among them. High synteny could be observed between C. botulinum strains from clade 1 and C. sporogenes as well as C. botulinum strains from clade 5 and C. novyi, respectively (supplementary fig. S15, Supplementary Material online). The fact that C. sporogenes, C. novyi, and C. haemolyticum species show little divergence with their respective C. botulinum relatives suggest that these strains are either artificially defined as distinct species or have just recently diverged. Together, these observations indicate that the strains defined as C.botulinum should be split into three species found within groups 1, 5, and 8. Clostridiumbotulinum strains in subgroups 1 and 5 may be called, C. sporogenes and C. haemolyticum, respectively, since these species have been previously defined, whereas strains within subgroup 8 may remain as members of the authentic C. botulinum species. However, as highlighted by Lawson (2016b), changing names of medically relevant organisms can cause great confusion in the healthcare community. As these three species produce botulinum neurotoxins, the change of name might be rejected under Rule 56a (5) of the International Code of Nomenclature of Prokaryotes (Parker et al. 2015), which states that “names whose application is likely to lead to accidents endangering health or life or both” can be rejected.

As this analysis uses draft genomes to include as many genomes as possible, and only 27 proteins were conserved among these genomes, the analysis was repeated using a smaller subset of high-quality genomes (179, N50 > 600 kb) to validate our results. As such, a higher number of conserved proteins (79) were obtained and used (supplementary table S3, Supplementary Material online). This new analysis (supplementary fig. S16, Supplementary Material online) showed that the taxonomic groups maintained the same distribution (tree topology) when using a data set of 179 or 779 genomes and a matrix containing 79 or 27 protein sequences respectively. The same clusters and cluster I subgroups were observed (supplementary figs. S16–S29, Supplementary Material online), with the exception of clades that disappeared as they did not pass the stringent genome quality cut-off (clusters IV and XVI, and subgroups 9, 13, 14, and 15 in cluster I).

In agreement with our species tree, a new calculation of the clostridial core genome using only the 369 genomes of cluster I strains (fig. 1) yielded a set of conserved protein families one order of magnitude larger than when using all the genomes in our database. Overall, our results demonstrate the large divergence of clusters III, IV, XIa, XIVa, XIVb, and XVI relative to cluster I and highlight the need for their reclassification into at least five new genera.

Toxin Evolution

Pathogenic clostridia produce the highest number of life-threatening toxins of any genus. This includes enterotoxins that affect the gut, such as C. difficile toxins A and B, histotoxins that affect soft tissue such as C. perfringens and C. septicum alpha-toxins, and neurotoxins affecting nervous tissue such as tetanus (C. tetani) and botulinum (C. botulinum) toxins. Diseases range from gastroenteritis to abdominal disorders, colitis, muscle necrosis, soft tissue infections, tetanus, and botulism among others (Hatheway 1990). These toxin-encoding genes are often located on mobile genetic elements or in variable regions of the chromosome (Hatheway 1990; Petit et al. 1999; Skarin and Segerman 2011), resulting in gene transfer between species. Here, we analyzed different toxins evolution to compare taxonomy with phylogeny.

The botulinum neurotoxin (BotA) for example, represents the most poisonous biological protein known and has been used as a phenotypic and genotypic marker for taxonomic classification. In fact, C. botulinum strains are often classified as members of groups A–F, in direct relationship with the production of antigenically distinguishable variants of the neurotoxin. In this work, homologs of BotA were found exclusively among members of cluster I and were distributed among C. botulinum, C. tetani, C. argentinense, C. baratii, and C. butyricum species. A phylogenetic reconstruction of these homologs (fig. 2A) showed little divergence except for three homologs: two on botulinum species and one on C. argentinense that seem to be more divergent.

Fig. 2.

Fig. 2.

—(A) Phylogenetic reconstruction of BotA toxin proteins. Six A–F clades are consistent with previous reports. Including non-botulinum strains, argentinense, tetani, butyricum, and baratti (marked with a dot). Three new sequences (in black) account for new unclassified toxin diversity. (B) Genome context of BotA homologs found in cluster I strains.

The topology of the BotA phylogeny agrees with previous definitions of the C. botulinum subgroups A–F, with clades populated by strains with similar toxin types (i.e., clade A has only C. botulinum A strains, etc.). However, toxin markers were not consistent with the species tree, for which C. botulinum toxins types A, B, and F were in clade I while grouping independently in the toxin tree.

A gene context analysis (fig. 2B) showed two major synteny groups {A, B, C, D} and {E, F}. The presence of toxin accessory proteins (Lam et al. 2017) was found to be the main difference between them, namely hemagglutinin coding genes in groups A–D, and protein p47 in groups E and F. These observations are consistent with C. botulinum strains located in subgroups 1, 5, and 8 being distinct species that acquired the toxin genes by horizontal gene transfer hindering taxonomic classification.

The analysis of other important toxins also shows many horizontal gene transfer events of toxin genes between subgroups. C. difficile toxins A and B homologs were distributed among C. difficile, C.sordellii, C. acetobutylicum, and C novyi species (supplementary fig. S30, Supplementary Material online). TpeL from C. perfringens has been previously defined as a homolog of toxins A and B from C. difficile (Amimoto et al. 2007). However, TpeL proteins are largely divergent and therefore were not included in this analysis. Homologs of C. perfringens alpha-toxin were observed in C. perfringens, C. novyi, C. botulinum C and D, C. baratii, C. hemolyticum, C. cavendishii, C. argentinense, C. sordellii, and C. dakarense species (supplementary fig. S31, Supplementary Material online). Finally, C. septicum toxin alpha homologs were distributed among C. septicum, C. novyi, C. haemolyticum, and C. botulinum C and D species (supplementary fig. S32, Supplementary Material online). A summary of these findings can be found in supplementary table S4, Supplementary Material online. Interestingly, C. botulinum C and D (subgroup 5) also have C. perfringens and C. septicum alpha-toxins orthologs, whereas C. botulinum A, B, E, and F do not. According to these observations, we suggest that toxin production should not be used to define taxonomic groups, as it uncouples taxonomy from phylogeny.

Core Genome Analysis

Once the taxonomic framework was established, we used it to study the evolutionary dynamics and to identify general differences among the subgroups within cluster I. For this purpose, we calculated core/pangenomes for each subgroup having more than ten genomes (table 1). This analysis (fig. 3) showed that subgroups 3 (C. tetani), 5 (C. botulinum toxin group C and D, C. haemolyticum, and C. novyi), 8 (C. botulinum toxin group E), and 12 (C. perfringens) have almost closed pangenomes, implying loss of genetic diversity. This observation is consistent with the evolutionary dynamic observed in pathogenic species by other authors attributed to “Specialist” species (Georgiades and Raoult 2010).

Fig. 3.

Fig. 3.

—Pangenome analysis of selected subgroups. The y axis shows the number of gene families and the x axis shows the number of genomes analyzed. The number of conserved genes was calculated by randomly adding genomes, with 20 replicates (if n > 20) or the same numbers as genomes (if n < 20). This analysis shows large differences in the genetic diversity of the subgroups, with less diversity and almost closed pangenomes in pathogenic subgroups.

In contrast, the remaining lineages showed open pangenomes. Subgroup 1, which includes important pathogens such as C. botulinum toxin groups A, B, and F, and the closely related C. sporogenes strains, has open pangenomes, implying larger genetic diversity and probably more recent adaptation to a pathogenic lifestyle. These observations further emphasize the presence of three distinct lineages among C. botulinum strains that may be reclassified as distinct species.

After establishing general differences between the evolutionary dynamics of the subgroups, we took a closer look at differences at the functional level. For this purpose, we extracted amino acid sequences of the core genes of each subgroup and identified conserved functions among them (i.e., a core of cores) and functions that are distinctive of each subgroup (table 1). This analysis revealed that cluster I has a core of 205 genes. As expected, many of these conserved genes are associated with housekeeping functions such as nucleotide biosynthesis, replication and repair (supplementary fig. S33, Supplementary Material online). Unique genes were abundantly classified as members of carbohydrate metabolism and for membrane transport. Interestingly, the largest number of accessory functions was related to amino acid metabolism, implying that multiple genes for this category are conserved at the subgroup level only. This observation is illustrated by the example described in the following section.

The Divergence of the Shikimate Pathway in Pathogenic Clostridia

To investigate adaptive traits that could define differences within each subgroup, we mined the pangenomes for functions that were uniquely found in each group. From a taxonomic point of view, unique genomic traits are important as they can be used for the development of genetic markers and to identify distinctive phenotypes that can be used for classification. The rationale for searching unique traits within subgroups was that the use of such a large genomic database would enable, for the first time, to find unique functions conserved in all members of a subgroup but absent in other subgroups, thereby enabling to dissect for subgroup-specific adaptations. This was the case for the essential enzyme 3-phosphoshikimate 1-carboxyvinyltransferase (AroA), which was found in the pangenome of cluster I. AroA is part of the shikimate pathway and is essential for the biosynthesis of aromatic amino acids phenylalanine, tyrosine, and tryptophan. This seemed unusual given that all subgroup cores include AroA.

We reasoned that the presence of AroA among the pangenome may be due to 1) divergence among AroA orthologs beyond the cut-off for orthology defined in our pangenome strategy leading to fragmentation of the gene family or 2) duplication events in certain subgroups and divergence, which have been previously linked to adaptive evolution in bacteria (Schniete et al. 2018). To explore this idea, we searched for homologs of the AroA enzyme in all the strains from cluster I and found a single ortholog conserved in most strains. Thus, we assumed that AroA has divergently evolved within the cluster I species.

Phylogenetic reconstruction of AroA (fig. 4A) confirmed the presence of two largely divergent AroA clades, one including homologs from strains in most subgroups and the other including subgroups 1, 3, and 11. Interestingly, most strains in this clade can colonize human hosts and are toxin-producing pathogens, except for C. sporogenes. Inspection of the genome context of representative AroA homologs from different subgroups (fig. 4B) revealed that despite sequence divergence, AroA homologs from C. tetani, C. botulinum toxin groups A, B and F, and C. sporogenes are located within a gene neighborhood that includes enzymes from the shikimate pathway. Thus, the gene context topology indicates that the function of these enzymes is linked to the production of aromatic amino acids. However, the divergent AroA homologs were found associated with the pyrimidine-associated regulator pyrR and a uracil permease. Such genomic organization suggests a link between aromatic amino acid biosynthesis and pyrimidine utilization. A recent in-depth molecular characterization of the C. tetani toxin production fermentation showed a potential link between extracellular uracil concentration and toxin production (Licona-Cassani et al. 2016). However, this link is yet to be fully understood.

Fig. 4.

Fig. 4.

—Phylogenomic analysis of AroA in Clostridium group I. (A) The phylogeny shows that AroA, has significantly diverged in all members of the subgroup 3 (C. tetani; clear blue), subgroup 6 (C. argentinense; yellow) and subgroup 1 (C. sporogenesC. botulinum B; red) from the rest of the subgroups in cluster I (black). The full tree is provided as supplementary Tree 2, Supplementary Material online. (B) Genome context of AroA homologs. C. autoethanogenum is shown as a typical Group I AroA genome context, whereas divergent homologs show a genome context that includes enzymes from pyrimidine metabolism.

Studies have also shown that C. sporogenes, a soil bacterium rarely pathogenic for humans (Inkster et al. 2011) although it may be found in the gut, and C. botulinum (cluster I), a toxin-producing pathogen, copiously produce tryptophan, phenylalanine and tyrosine. It has been suggested that secretion of these amino acids and intermediates of its degradation may influence intestinal permeability and systemic immunity of the host (Dodd et al. 2017). We speculate that the divergence in AroA may be related to the evolution of new metabolic interactions that do not affect the enzymatic activity of AroA, and unique regulation that occurs in clostridial species as a result of their ability to colonize hosts. Given the presence of this trait in pathogen and commensal strains, we reasoned that this trait likely evolved prior to the acquisition of toxin genes. Following the same argumentation, we suggest that C. botulinum subgroup 1 toxin groups A, B, and F have only recently evolved into pathogenic organisms.

By selecting amino acid biosynthesis to illustrate the use of the new classification, we show here that a correlation between traits, function gain and loss cannot be extracted from the current taxonomic classification of Clostridium species. Through this effort, we hope that our work serves to inspire the research community to study the evolution of clostridia at the genome-scale level and suggest a new classification for this fascinating class of bacteria.

Conclusions

Here, we present an inclusive framework for phylogenomic analysis aimed at providing an updated view of the Clostridium genus. Our work shows that the current definition of clostridia encompasses a large and diverse group of species that is inconsistent with its definition as a genus. Instead, the group includes multiple genera. Furthermore, within the group I, arguably the authentic Clostridium genus, further taxonomic inconsistencies exist due to the use of BotA for taxonomic classification as a taxonomic marker. This has previously been observed by others (Yutin and Galperin 2013; Weigand et al. 2015; Lawson and Rainey 2016; Tindall 2016; Udaondo et al. 2017) but to the best of our knowledge, clostridial taxonomy and evolution has not been revisited using the opportunity offered by next-generation sequencing for phylogenomic reclassification until now. Given the pervasiveness of the misclassification in clostridial species, we wonder whether the current system of classification should be kept, or if it should be revisited and simplified using genomic analyses or a combination of “Classic” and phylogenomic approaches. Indeed, such synergy has already been shown to be useful in a smaller scale to reclassify former members of the genus Clostridium (Gerritsen et al. 2014).

The recent explosion of available annotated genomes offers an unprecedented opportunity to answer intriguing questions surrounding pathogenic clostridial evolution. For example, the incredible diversity and the number of toxins produced by some strains are yet to be fully understood. So is the astonishing potency of some of the toxins produced by these pathogens, which must confer an evolutionary advantage that remains to be elucidated.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online.

Supplementary Material

Supplementary_Matrial_evz096

Acknowledgments

P.C.-M. thanks the Australian Department of Education and Training for the Endeavour Fellowship. The authors thank Robin Palfreyman for technical support, Cuauhtemoc Licona-Cassani and Nicolas Zaragoza for helpful discussions, and the anonymous reviewers for constructive criticism. This research was funded through an Australian Research Council Linkage grant LP150100087 with Zoetis as the Industrial Partner. Elements of this research utilized support provided by the Queensland Node of Metabolomics Australia, an initiative of the Australian Government being conducted as part of the NCRIS National Research Infrastructure for Australia.

Literature Cited

  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ.. 1990. Basic local alignment search tool. J Mol Biol. 215(3):403–410. [DOI] [PubMed] [Google Scholar]
  2. Amimoto K, Noro T, Oishi E, Shimizu M.. 2007. A novel toxin homologous to large clostridial cytotoxins found in culture supernatant of Clostridium perfringens type C. Microbiology (Reading, Engl). 153(Pt 4):1198–1206. [DOI] [PubMed] [Google Scholar]
  3. Aziz et al 2008. The RAST server:rapid annotation using subsystems technology. BMC Genomics. 9:75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL.. 2006. GenBank. Nucleic Acids Res. 34 (Database issue):D16–D20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bowman KS, Dupré RE, Rainey FA, Moe WM.. 2010. Clostridium hydrogeniformans sp. nov. and Clostridium cavendishii sp. nov., hydrogen-producing bacteria from chlorinated solvent-contaminated groundwater. Int J Syst Evol Microbiol. 60(Pt 2):358–363. [DOI] [PubMed] [Google Scholar]
  6. Chaudhari NM, Gupta VK, Dutta C.. 2016. BPGA—an ultra-fast pan-genome analysis pipeline. Sci Rep. 6:24373.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Collins MD, et al. 1994. The phylogeny of the genus Clostridium: proposal of five new genera and eleven new species combinations. Int J Syst Bacteriol. 44(4):812–826. [DOI] [PubMed] [Google Scholar]
  8. Cruz-Morales P, et al. 2017. Actinobacteria phylogenomics, selective isolation from an iron oligotrophic environment and siderophore functional characterization, unveil new desferrioxamine traits. FEMS Microbiol Ecol. 93(9):fix086.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Dodd D, et al. 2017. A gut bacterial pathway metabolizes aromatic amino acids into nine circulating metabolites. Nature 551(7682):648–652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Edgar RC. 2004. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Edgar RC. 2010. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26(19):2460–2461. [DOI] [PubMed] [Google Scholar]
  12. Georgiades K, Raoult D.. 2010. Defining pathogenic bacterial species in the genomic era. Front Microbiol. 1:151.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gerritsen J, et al. 2014. Characterization of Romboutsia ilealis gen. nov., sp. nov., isolated from the gastro-intestinal tract of a rat, and proposal for the reclassification of five closely related members of the genus Clostridium into the genera Romboutsia gen. nov., Intestinibacter gen. nov., Terrisporobacter gen. nov. and Asaccharospora gen. nov. Int J Syst Evol Microbiol. 64(Pt 5):1600–1616. [DOI] [PubMed] [Google Scholar]
  14. Goodsir J. 1842. History of a case in which a fluid periodically ejected from the stomach contained vegetable organisms of an undescribed form. Edinb Med Surg J. 57:430–443. [PMC free article] [PubMed] [Google Scholar]
  15. Gouy M, Guindon S, Gascuel O.. 2010. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 27(2):221–224. [DOI] [PubMed] [Google Scholar]
  16. Gurevich A, Saveliev C, Vyahhi N, Tesler G. 2013. QUAST: aulity assessment tool for genome assemblies. Bioinformatics. 29(8):1072-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hatheway CL. 1990. Toxigenic clostridia. Clin Microbiol Rev. 3(1):66–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Inkster T, Cordina C, Siegmeth A.. 2011. Septic arthritis following anterior cruciate ligament reconstruction secondary to Clostridium sporogenes; a rare clinical pathogen. J Clin Pathol. 64(9):820–821. [DOI] [PubMed] [Google Scholar]
  19. Johnson JL, Francis BS.. 1975. Taxonomy of the Clostridia: ribosomal ribonucleic acid homologies among the species. J Gen Microbiol. 88(2):229–244. [DOI] [PubMed] [Google Scholar]
  20. Jung MY, et al. 2010. Clostridium arbusti sp. nov., an anaerobic bacterium isolated from pear orchard soil. Int J Syst Evol Microbiol. 60(Pt 9):2231–2235. [DOI] [PubMed] [Google Scholar]
  21. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS.. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 14:587–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lam KH, et al. 2017. The hypothetical protein P47 of Clostridium botulinum E1 strain Beluga has a structural topology similar to bactericidal/permeability-increasing protein. Toxicon 147:19–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lawson PA. 2016. The taxonomy of the genus Clostridium: current status and future perspectives. Conference: the 7th National Conference of Microbial Resources & the International Symposium on Microbial Systematics and Taxonomy, at Hangzhou, China. Microbiol China 43(5):1070–1074. [Google Scholar]
  24. Lawson PA, Citron DM, Tyrrell KL, Finegold SM.. 2016. Reclassification of Clostridium difficile as Clostridioides difficile (Hall and O’Toole 1935) Prévot 1938. Anaerobe 40:95–99. [DOI] [PubMed] [Google Scholar]
  25. Lawson PA, Rainey FA.. 2016. Proposal to restrict the genus Clostridium Prazmowski to Clostridium butyricum and related species. Int J Syst Evol Microbiol. 66(2):1009–1016. [DOI] [PubMed] [Google Scholar]
  26. Licona-Cassani C, et al. 2016. Tetanus toxin production is triggered by the transition from amino acid consumption to peptides. Anaerobe 41:113–124. [DOI] [PubMed] [Google Scholar]
  27. Liou JS, Balkwill DL, Drake GR, Tanner RS.. 2005. Clostridium carboxidivorans sp. nov., a solvent-producing Clostridium isolated from an agricultural settling lagoon, and reclassification of the acetogen Clostridium scatologenes strain SL1 as Clostridium drakei sp. nov. Int J Syst Evol Microbiol. 55(Pt 5):2085–2091. [DOI] [PubMed] [Google Scholar]
  28. Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ.. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol Biol Evol. 32(1):268–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Parker CT, Tindall BJ, Garrity GM.. 2015. International code of nomenclature of prokaryotes. Int J Syst Evol Microbiol. 69(1A):S1–S111. [DOI] [PubMed] [Google Scholar]
  30. Petit L, Gibert M, Popoff MR.. 1999. Clostridium perfringens: toxinotype and genotype. Trends Microbiol. 7(3):104–110. [DOI] [PubMed] [Google Scholar]
  31. Prazmowski A. 1880. Untersuchung über die Entwickelungsgeschichteund Fermentwirking einiger Bacterien-Arten Inaugural Dissertation. Leipzig (Germany): Hugo Voigt.
  32. Rainey F, Tanner R, Wiegel J.. 2006. Family Clostridiaceae In: The Prokaryotes: a handbook on the biology of bacteria: Vol. 4: bacteria: firmicutes, cyanobacteria. 3rd ed release 3.20. New York: Springer-Verlag; p. 654–678. [Google Scholar]
  33. Rossi-Tamisier M, Benamar S, Raoult D, Fournier PE. 2015. Cautionary tale of 16S rRNA gene sequence similarity values in identification of huma-associated-bacterial species. Int J Syst Evol Microbiol. 65(pt 6):1929–34. [DOI] [PubMed] [Google Scholar]
  34. Sakuma K, et al 2006. Clostridium glycyrrhizinilyticum sp. nov., a glycyrrhizin-hydrolysing bacterium isolated from human faeces. Microbiol Immunol. 50(7):481–5. [DOI] [PubMed] [Google Scholar]
  35. Schniete JK, et al. 2018. Expanding primary metabolism helps generate the metabolic robustness to facilitate antibiotic biosynthesis in Streptomyces. mBio 9:e02283–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shiratori H, et al. 2009. Clostridium clariflavum sp. nov. and Clostridium caenicola sp. nov., moderately thermophilic, cellulose-/cellobiose-digesting bacteria isolated from methanogenic sludge. Int J Syst Evol Microbiol. 59(Pt 7):1764–1770. [DOI] [PubMed] [Google Scholar]
  37. Skarin H, Segerman B.. 2011. Horizontal gene transfer of toxin genes in Clostridium botulinum: involvement of mobile elements and plasmids. Mob Genet Elements 1(3):213–215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Skerman VBD, McGowan V, Sneath PHA, editors. 1980. Approved lists of bacterial names. Int J Syst Bacteriol. 30(1):225–420. [Google Scholar]
  39. Slobodkina GB, et al. 2008. Clostridium tepidiprofundi sp. nov., a moderately thermophilic bacterium from a deep-sea hydrothermal vent. Int J Syst Evol Microbiol. 58(Pt 4):852–855. [DOI] [PubMed] [Google Scholar]
  40. Tamburini E, Daly S, Steiner U, Vandini C, Mastromei G.. 2001. Clostridium felsineum and Clostridium acetobutylicum are two distinct species that are phylogenetically closely related. Int J Syst Evol Microbiol. 51(Pt 3):963–966. [DOI] [PubMed] [Google Scholar]
  41. Tindall BJ. 2016. Priority of the genus name Clostridium Prazmowski 1880 (approved lists 1980) vs Sarcina Goodsir 1842 (approved lists 1980) and the creation of the illegitimate combinations Clostridium maximum (Lindner 1888) Lawson and Rainey 2016 and Clostridium ventriculi (Goodsir 1842) Lawson and Rainey 2016 that may not be used. Int J Syst Evol Microbiol. 66:4890–4894. [DOI] [PubMed] [Google Scholar]
  42. Udaondo Z, Duque E, Ramos JL.. 2017. The pangenome of the genus Clostridium. Environ Microbiol. 19(7):2588–2603. [DOI] [PubMed] [Google Scholar]
  43. Vos RA, et al. 2012. NeXML: rich, extensible, and verifiable representation of comparative data and metadata. Syst Biol. 61(4):675–689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Warren YA, Tyrrell KL, Citron DM, Goldstein EJ.. 2006. Clostridium aldenense sp. nov. and Clostridium citroniae sp. nov. isolated from human clinical infections. J Clin Microbiol. 44(7):2416–2422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Waterhouse RM, et al. 2017. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. doi:10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Weigand MR, et al. 2015. Implications of genome-based discrimination between Clostridium botulinum group I and Clostridium sporogenes strains for bacterial taxonomy. Appl Environ Microbiol. 81(16):5420–5429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Yutin N, Galperin MY.. 2013. A genomic update on clostridial phylogeny: gram-negative spore formers and other misplaced clostridia. Environ Microbiol. 15(10):2631–2641. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary_Matrial_evz096

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES