Table 2. Structural diversity among proteins identified and aligned by MAPGAPS.
Superfamily | structures* | RMSD† (Å) | Domain length‡ | Resolution (Å) | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
% ID | No. | Avg | Min | Max | S.D. | MSA | Avg | S.D. | Avg | Max | |
GNAT | 27 | 16 | 3.25 | 1.0 | 6.7 | 1.4 | 125 | 139.8 | 17.0 | 1.94 | 2.61 |
GTPases | 30 | 20 | 3.96 | 0.6 | 14.7 | 3.5 | 164 | 195.9 | 41.6 | 2.31 | 3.10 |
Helicases | 40 | 12 | 6.39 | 2.6 | 9.8 | 1.8 | 466 | 482.8 | 60.7 | 2.86 | 3.56 |
EEP | 40 | 16 | 3.02 | 0.8 | 5.2 | 0.95 | 241 | 259.0 | 27.6 | 2.07 | 2.99 |
UDG/TDG | 40 | 8 | 2.54 | 1.1 | 3.6 | 0.69 | 125 | 135.9 | 12.7 | 1.83 | 2.58 |
*NMR and poor resolution structures were not used; no two proteins in each set contained more than the indicated level of percent sequence identity (% ID); pdb identifies for these are given in supplementary file 1.
†RMSDs were computed using MUSTANG (Konagurthu et al., 2006) with default parameters; the structural coordinates used for the analysis were limited to the domain of interest.
‡The number of aligned columns in the MSA, and the average length and standard deviation of the domain ‘footprint’.