Table 2.
Dataset | MFO | MFOa | BPO | BPOa | CCO | CCOa | All-GO | All-GOa | Pfam | Pfam and All-GO | bPDB |
---|---|---|---|---|---|---|---|---|---|---|---|
Cluster (25 989)c,d | |||||||||||
Sequences | 12 755 | 9147 | 13 611 | 11 323 | 13 749 | 11 480 | 15 500 | 12 918 | 15 488 | 16 675 | 7284 |
Clusters | 6491 | 3835 | 7064 | 5344 | 7155 | 5496 | 8597 | 6523 | 8598 | 9552 | 3421 |
Terms | 3902 | 3215 | 12 520 | 12 020 | 1517 | 1370 | 17 939 | 16 605 | 3962 | 21 901 | – |
Singleton (217)c,d | |||||||||||
Sequences | 121 | 8 | 131 | 9 | 167 | 9 | 179 | 9 | 133 | 181 | 0 |
Terms | 132 | 18 | 222 | 19 | 79 | 9 | 433 | 46 | 118 | 551 | – |
Total (26 206)d | |||||||||||
Sequences | 12 876 | 9155 | 13 742 | 11 332 | 13 916 | 11 489 | 15 679 | 12 927 | 15 621 | 16 856 | 7284 |
Terms | 3904 | 3218 | 12 521 | 12 020 | 1517 | 1370 | 17 942 | 16 608 | 3968 | 21 910 | – |
aTerms that are statistically validated and have an experimental evidence code with the corresponding number of sequences that inherit them in a given number of clusters.
bPig protein sequences in clusters that inherit a structure.
cNumbering considers only unique GO terms and Pfam domains.
dClusters are generated as described in the SUS-BAR section. Singletons are pig sequences that do not belong to clusters and carry along only their original UniProtKB or Ensembl annotation.