Skip to main content
. Author manuscript; available in PMC: 2009 Jan 1.
Published in final edited form as: J Proteome Res. 2007 Dec 8;7(1):113–122. doi: 10.1021/pr070361e

Table 1.

Clustering performance with different similarity thresholds. Results are shown for a single run from the human dataset (793000 spectra searched against the human IPI sequence database). For each similarity threshold we report the number of spectra searched, the number of spectra identified, the number of peptides identified and the number of proteins identified. These values are compared with the values obtained from a regular non-clustered search of the same dataset (the difference is reported as a percentage).

Similarity threshold Spectra/Clusters searched Spectra identified Peptides identified Proteins identified
Non-clustered 793000 86682 21090 6191
0.30 167407 −78.9% 116571 +34.5% 18352 −13.0% 5772 −6.8%
0.35 204851 −74.2% 114196 +31.7% 19503 −7.5% 5991 −3.2%
0.40 241489 −69.5% 111309 +28.4% 20142 −4.5% 6096 −1.5%
0.45 276059 −65.2% 104983 +21.1% 20592 −2.4% 6178 −0.2%
0.50 309501 −61.0% 102859 +18.7% 20978 −0.5% 6229 +0.6%
0.55 340847 −57.0% 99488 +14.8% 21142 +0.2% 6282 +1.5%
0.60 369159 −53.4% 95764 +10.5% 21163 +0.3% 6275 +1.4%
0.65 394990 −50.2% 93511 +7.9% 21224 +0.6% 6266 +1.2%
0.70 417576 −47.3% 92666 +6.9% 21349 +1.2% 6300 +1.8%
0.75 436973 −44.9% 91269 +5.3% 21412 +1.5% 6310 +1.9%
0.80 452294 −43.0% 90018 +3.8% 21386 +1.4% 6289 +1.6%
0.85 467361 −41.1% 89137 +2.8% 21414 +1.5% 6286 +1.5%
0.90 478978 −39.6% 88406 +2.0% 21367 +1.3% 6268 +1.2%
0.95 487833 −38.5% 87689 +1.2% 21276 +0.9% 6245 +0.9%
1.00(only filtering) 493023 −37.8% 87276 +0.7% 21239 +0.7% 6242 +0.8%