Similarity clustering, evidence of gene expression, and overall domain organizations of Arabidopsis ARM repeat proteins. The cluster diagram on the left shows the extent of similarity between ARM repeat proteins. Higher similarity is demonstrated by shorter branch length between any two sequences. The three columns of circles indicate the presence of expression tags: C, cDNA from SIGnAL database; E, ESTs from GenBank; M, massive parallel signature sequencing (MPSS) tags from the Arabidopsis MPSS database (see “Materials and Methods”). The sequence names and their correspondence to GenBank accessions can be found in supplemental Table S-I, where the accessions for ESTs and cDNAs are also tabulated. ARM proteins with similar domain organizations are grouped together with alternate shaded boxes. The representative domain organization of each group is shown on the right. The domain names follow those in Pfam and/or SMART except the U-box N-terminal domain (UND) defined in this study. Italics, Gene names for AtPUBARM family members; arrow, divergent AtPUB-ARM members; asterisk, region where the sequence was truncated to fit the width of the graphics (no known protein domains were present in this region).