Table 1.
The content of the datasets and the query lists used for PRIDE testing
Dataset | Number of domains in the dataset | Number of histograms used for the domain structure representation | Number of domains in the query list | ||||||
E* | D** | Total | |||||||
α | β | α/β | α | β | α/β | ||||
1 | 29 098 | > 30 | 24 | 25 | 25 | 25 | 25 | 25 | 149 |
2 | 4 937 | 10 – 30 | 6 | 6 | 6 | 8 | 8 | 8 | 42 |
*E corresponds to the "easy" cases when the queries belong to highly populated groups of investigated datasets containing at least 50 domains at the homologous superfamily classification level of CATH;
**D corresponds to the "difficult cases" when queries belonged to small groups having no more than 3 domains at the homologous superfamily classification level of CATH