Skip to main content
. 2018 Jan 16;7:e29880. doi: 10.7554/eLife.29880

Table 2. Structural diversity among proteins identified and aligned by MAPGAPS.

Superfamily structures* RMSD (Å) Domain length Resolution (Å)
% ID No. Avg Min Max S.D. MSA Avg S.D. Avg Max
GNAT 27 16 3.25 1.0 6.7 1.4 125 139.8 17.0 1.94 2.61
GTPases 30 20 3.96 0.6 14.7 3.5 164 195.9 41.6 2.31 3.10
Helicases 40 12 6.39 2.6 9.8 1.8 466 482.8 60.7 2.86 3.56
EEP 40 16 3.02 0.8 5.2 0.95 241 259.0 27.6 2.07 2.99
UDG/TDG 40 8 2.54 1.1 3.6 0.69 125 135.9 12.7 1.83 2.58

*NMR and poor resolution structures were not used; no two proteins in each set contained more than the indicated level of percent sequence identity (% ID); pdb identifies for these are given in supplementary file 1.

RMSDs were computed using MUSTANG (Konagurthu et al., 2006) with default parameters; the structural coordinates used for the analysis were limited to the domain of interest.

The number of aligned columns in the MSA, and the average length and standard deviation of the domain ‘footprint’.