Table 1. The Hits database at the end of October 2000.
SWISS-PROT | TrEMBL | TrEMBLnew | trEST | trGEN | |
|
(88166) |
(301497) |
(102633) |
(165758) |
(501714) |
Prosite patterns | 36.8% | 20.4% | 27.4% | 13.5% | 8.7% |
(1304) | 74465 | 130516 | 46913 | 39834 | 95558 |
Prosite profiles | 28.0% | 19.7% | 24.3% | 18.6% | 13.9% |
(330) | 54716 | 155292 | 51573 | 64822 | 211439 |
Pfam HMMs | 56.1% | 34.4% | 51.3% | 21.9% | 13.9% |
(2216) | 94973 | 190537 | 88541 | 64928 | 161337 |
The columns are the five collections of proteins, the rows are the three collections of protein-domain predictors, the figures in parentheses are the number of entries in each collection. In each cell, the percentage indicates the fraction of the protein sequences with at least one match by a predictor, the count below is the total number of significant matches. It is quite common that a protein is hit by more than one predictor as the three collections of predictors are partially redundant.