Maximum likelihood trees and domain architecture diagrams for DUF179-1,3, DUF179-2, DUF3143, ARM, HugZ-1,2,3, and CLPF protein families. Inferred gene duplication events are indicated with blue circles at the corresponding node in the tree. Diagrams of domains were predicted by the NCBI conserved domain search tool CDsearch. For full species names and their lineage see legend to Figure 3. Information about the functional domains and superfamily listed in the figure (see also Table S2): Arm, Armadillo/beta-catenin-like repeat of ∼40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands; PLN03200, cellulose synthase-interactive protein; SRP1, Karyopherin (importin) alpha; DUF2470, putative heme-iron utilization family; PKc_like superfamily, there are 60 domains in this superfamily. The protein kinase superfamily is mainly composed of the catalytic domains of serine/threonine-specific and tyrosine-specific protein kinase; DUF179, superfamily consists of pfam02622 (Uncharacterized ACR), COG1678 (AlgH), and PRK00228 (YqgE/AlgH family protein); ER_PDI_fam superfamily, protein disulfide isomerase; PDI_a_family, Protein Disulfide Oxidoreductases and Other Proteins with a Thioredoxin fold; Thioredoxin_like superfamily, Protein Disulfide Oxidoreductases and Other Proteins with a Thioredoxin fold; DUF3143, Protein of unknown function 3143, pfam11341 is the only member of this superfamily; PRK14904, 16S rRNA methyltransferase B; EnvC,superfamily, Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB; F-box-Like, ∼50 amino acids long mediating protein–protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding, and translational repression; MscA, superfamily Protein-arginine kinase activator protein McsA; SirB1 superfamily, transglutaminaselike and TPR domain; Transglut_core2 superfamily, Transglutaminase-like superfamily has two domains: pfam13369 - Transglut_core2 and PRK10941 - tetratricopeptide repeat-containing protein; UVR, pfam02151 is the only member in this superfamily; yccV, domain in the small protein from E. coli YccV and its homologs in other Proteobacteria; YccV-like, superfamily has five domains pfam08755, TIGR02097, PRK14129 (HSPQ), smart00992, COG3785.