Table 1. Non-redundant glycosylated and non-glycosylated (positive+negative) patterns at different level of similarity cut-off.
Redundancy cut-off | Number of total patterns (glycosylated plus non-glycosylated) | ||
N-linked (Positive+Negative) | O-linked (Positive+Negative) | C-linked (Positive +Negative) | |
Standard dataset | 39024 = (2604+36420) | 10403 = (451+9952) | 157 = (48+109) |
100% | 39019 = (2604+36415) | 10371 = (451+9920) | 157 = (48+109) |
90% | 35293 = (2588+32705 | 7314 = (339+6975) | 150 = (48+102) |
80% | 32245 = (2549+29696) | 5669 = (289+5380) | 116 = (32+84) |
70% | 29376 = (2506+26870) | 4566 = (258+4308) | 106 = (27+79) |
60% | 26505 = (2454+24051) | 3776 = (235+3541) | 105 = (27+78) |
50% | 23076 = (2361+20715) | 3234 = (214+3020) | 99 = (23+7) |
40% | 10102 = (1599+8503) | 2390 = (174+2216) | 90 = (16+74) |