. 2012 Apr 30;12(Suppl 1):S3. doi: 10.1186/1472-6947-12-S1-S3

Table 4.

Cluster quality metrics for Lingo and UTC

	Lingo	UTC
Cluster purity	0.423	0.825
Pairwise cluster contamination	0.644	0.242
Within-cluster similarity	0.363	0.531

The table shows the scores of cluster purity, pairwise cluster contamination and within-cluster similarity achieved by the clustering and cluster labeling algorithms Lingo and UTC. Experiments used 5-fold cross validation and were performed on 1800 clinical trial protocols containing 9 frequently occurring query words.