Skip to main content
. Author manuscript; available in PMC: 2009 Jan 1.
Published in final edited form as: J Proteome Res. 2007 Dec 8;7(1):113–122. doi: 10.1021/pr070361e

Table 5.

Distributions of cluster sizes. Results of clustering 11.4 million spectra from 14 runs of the Human dataset. The left side holds the cluster size distribution for 5.6 million spectra that passed spectral quality filtration and were grouped into 1.85 million clusters. The middle holds the distribution for the subset of clusters that were identified in the database search. The right hand side holds the distribution of “perfect” clustering, in which all the spectra belonging to a single peptide are grouped into a single cluster.

Clust. size All Clusters Identified Clusters “Perfect” Clusters
#Clust. (%) #Spec. (%) #Clust. (%) #Spec. (%) #Clust. (%) #Spec. (%)
1 1275893 68.9% 1275893 22.7% 143487 53.6% 143487 8.9% 15830 24.6% 15830 1.0%
2 235387 12.7% 470774 8.4% 30993 11.6% 61986 3.8% 8027 12.5% 16054 1.0%
3–5 156917 8.5% 558491 10.0% 31008 11.6% 113935 7.1% 12474 19.4% 47641 3.0%
6–10 58085 3.1% 447501 8.0% 17902 6.7% 138702 8.6% 9447 14.7% 72646 4.5%
11–15 38758 2.1% 506373 9.0% 12484 4.7% 163030 10.1% 4905 7.6% 62629 3.9%
16–25 75335 4.1% 1290671 23.0% 26742 10.0% 461884 28.7% 5169 8.0% 101827 6.4%
26–50 6065 0.32% 210775 3.8% 2636 1.0% 91294 5.7% 4169 6.5% 146599 9.2%
51–100 2590 0.14% 179410 3.2% 1203 0.4% 82743 5.1% 2074 3.2% 143300 9.0%
101–500 1689 0.09% 332853 5.9% 832 0.3% 166587 10.3% 1820 2.8% 375502 23.5%
500+ 373 0.02% 335727 6.0% 205 0.1% 187019 11.6% 403 0.6% 617369 38.6%
Total 1851092 5608468 267492 1610667 64318 1599397