Clustering of the strains based on the distribution of all the Cluster of Orthologous Groups
categories. J, translation, ribosomal structure and biogenesis; K, transcription; L, replication,
recombination and repair; B, chromatin structure and dynamics; D, cell cycle control, cell division,
chromosome partitioning; O, post-translational modification, protein turnover, chaperones; M, cell
wall/membrane/envelope biogenesis; N, cell motility and secretion; P, inorganic ion transport and
metabolism; T, signal transduction mechanisms; U, intracellular trafficking, secretion and vesicular
transport; V, defence mechanisms; W, extracellular structures; Z, cytoskeleton; C, energy production
and conversion; G, carbohydrate transport and metabolism; E, amino acid transport and metabolism; F,
nucleotide transport and metabolism; H, coenzyme transport and metabolism; I, lipid transport and
metabolism; Q, secondary metabolite biosynthesis, transport and catabolism; R, general function
prediction only; S, function unknown. Colours depend on the number of proteins implied in each
category for each strain. The scale is presented in the figure.