Table 1. Distribution of sequences in clusters and singletons with their annotations.
In clusters | In singletons | |
---|---|---|
# Sequences | 28 869 663 | 3 399 026 |
From SwissProt | 519 015 | 17 478 |
From TrEMBL | 28 350 648 | 3 381 548 |
# Sequences with experimental GO annotations | 82 672 | 6 092 |
From SwissProt | 57 391 | 3 684 |
From TrEMBL | 25 281 | 2 408 |
# Sequences with GO annotations | 20 556 103 | 1 506 125 |
From SwissProt | 494 047 | 14 277 |
From TrEMBL | 20 062 056 | 1 491 848 |
# Sequences with PFAM annotation | 23 263 014 | 1 509 339 |
From SwissProt | 487 946 | 12 111 |
From TrEMBL | 22 775 068 | 1 497 228 |
# Sequences with PDB | 35 660 | 1 185 |
As defined by gene ontology Consortium, experimental GO terms are those associated to evidence codes EXP, IDA, IPI, IMP, IGI, IEP (http://geneontology.org/page/guide-go-evidence-codes)