Skip to main content
. 2014 May 14;9(5):e97446. doi: 10.1371/journal.pone.0097446

Table 1. Number of protein sequences after removing redundant proteins at thresholds of 90%, 70% and 40% using CD-HIT.

Redundancy cut off Positive Dataset (208) Negative Dataset (1321)
90% 142 949
70% 118 815
40% 66 555