Skip to main content
. 2021 Jun 11;10:e64325. doi: 10.7554/eLife.64325

Table 2. Major clade-specific gene clusters identified by Pangenome-Wide Association Study (pan-GWAS).

Protein Gene Clade specificity Functional insights
Ethanolamine kinase ETNK, EKI Unique to C-III and is in addition to the highly conserved eut cluster found in all lineages. Has a unique composition and includes six additional genes that are not present in the traditional CD630 eut operon or any other non-C-III strains. An alternative process for the breakdown of ethanolamine and its utilisation as a source of reduced nitrogen and carbon.
Agmatinase speB
1-propanol dehydrogenase pduQ
Ethanolamine utilisation protein EutS eutS
Ethanolamine utilisation protein EutP eutP
Ethanolamine ammonia-lyase large subunit eutB
Ethanolamine ammonia-lyase small subunit eutC
Ethanolamine utilisation protein EutL eutL
Ethanolamine utilisation protein EutM eutM
Acetaldehyde dehydrogenase E1.2.1.10
Putative phosphotransacetylase K15024
Ethanolamine utilisation protein EutN eutN
Ethanolamine utilisation protein EutQ eutQ
TfoX/Sxy family protein -
Iron complex transport system permease protein ABC.FEV.P Unique to C-III. Multicomponent transport system with specificity for chelating heavy metal ions.
Iron complex transport system ATP-binding protein ABC.FEV.A
Iron complex transport system substrate-binding protein ABC.FEV.S
Hydrogenase nickel incorporation protein HypB hypB
Putative ABC transport system ATP-binding protein yxdL
Class I SAM-dependent methyltransferase -
Peptide/nickel transport system substrate-binding protein ABC.PE.S
Peptide/nickel transport system permease protein ABC.PE.P
Peptide/nickel transport system permease protein ABC.PE.P1
Peptide/nickel transport system ATP-binding protein ddpD
Oligopeptide transport system ATP-binding protein oppF
Class I SAM-dependent methyltransferase -
Heterodisulfide reductase subunit D (EC:1.8.98.1) hdrD Unique to C-III and is in addition to the highly conserved spermidine uptake cluster found in all other lineages. Alternative spermidine uptake processes that may play a role in stress response to nutrient limitation. The additional cluster has homologs in Romboutsia, Paraclostridium, and Paeniclostridium spp.
CDP-L-myo-inositol myo-inositolphosphotransferase dipps
Spermidine/putrescine transport system substrate-binding protein ABC.SP.S
Spermidine/putrescine transport system permease protein ABC.SP.P1
Spermidine/putrescine transport system permease protein ABC.SP.P
Spermidine/putrescine transport system ATP-binding protein potA
Sigma-54-dependent transcriptional regulator gfrR Present in all lineages except C-I. Cluster found in a different genomic position in C-III. Mannose-type PTS system essential for utilisation of fructosamines such as fructoselysine and glucoselysine, abundant components of rotting fruit and vegetable matter.
Fructoselysine/glucoselysine PTS system EIIB component gfrB
Mannose PTS system EIIA component manXa
Fructoselysine/glucoselysine PTS system EIIC component gfrC
Fructoselysine/glucoselysine PTS system EIID component gfrD
SIS domain-containing protein -
Fur family transcriptional regulator, ferric uptake regulator furB Unique to C-II and C5. Associated with EDTA resistance in E. coli, helping the bacteria survive in Zn-depleted environment.
Zinc transport system substrate-binding protein znuA
Fe-S-binding protein yeiR
Rrf2 family transcriptional regulator -
Putative signalling protein - Unique to C-I and C5 STs 163, 280, and 386 In E. coli, AbgAB proteins enable uptake and cleavage of the folate catabolite p-aminobenzoyl-glutamate, allowing the bacterium to survive on exogenous sources of folic acid.
Aminobenzoyl-glutamate utilisation protein B abgB
MarR family transcriptional regulator -