Skip to main content
. 2023 Jun 30;39(Suppl 1):i318–i325. doi: 10.1093/bioinformatics/btad208

Table 1.

The statistics of the curated protein function prediction dataset.a

Sequence identity threshold
Ontology No. of protein No. GO terms 0.3 0.5 0.9 0.95
MF 35 507 600 14 667 19 512 26 876 28 067
CC 50 340 547 20 679 26 808 36 721 38 509
BP 50 320 3774 20 180 26 647 37 536 39 348
a

The first three columns are the GO ontology category, the total number of proteins in each category and the number of GO terms in each category. The remaining four columns list the number of protein clusters at each sequence identity threshold (0.3, 0.5, 0.9, 0.95).