Skip to main content

View full-text article in PMC

. 2010 Jul;12(4):476–486. doi: 10.2353/jmoldx.2010.090197

Table 4.

Confusion Matrices and Accuracies for the Training and Independent Test Sets for the Three Second Tier Primary Site of Origin Classifiers

Adenocarcinoma
Train–Test Split
Kidney	Ovary	Uterus	Colon	Lung	Prostate	Breast
19	0	0	0	0	1	0
1	9	9	0	0	0	1
0	4	14	0	0	0	2
0	0	0	20	0	0	0
0	0	0	0	20	0	0
0	0	1	0	0	19	0
0	1	1	0	2	0	17
CM Accuracy = 84%

Independent Test Set
Kidney	Ovary	Uterus	Colon	Lung	Prostate	Breast
8	0	2	0	0	0	0
0	10	0	0	0	0	0
3	0	6	1	0	0	0
0	0	1	8	1	0	0
0	0	0	0	10	0	0
0	0	0	1	0	9	0
0	0	0	0	0	0	10
CM Accuracy = 87%

Squamous
Train–Test Split
Tongue-larynx	Vulva	Cervix	Penis	Lung	Anal-rectum
24	0	1	0	0	0
0	3	0	0	0	0
2	0	6	0	1	0
4	1	0	0	0	0
1	0	1	0	16	0
0	0	0	0	0	8
CM Accuracy = 83%

Independent Test Set
Tongue-larynx	Vulva	Cervix	Penis	Lung	Anal-rectum
11	0	0	0	0	0
2	0	0	0	0	0
0	0	6	0	0	0
0	0	0	1	1	0
0	0	0	0	4	0
0	0	0	0	0	4
CM Accuracy = 90%

Neuroendocrine
Train–Test Split
Small-bowel	Pancreas	Lung
9	0	2
0	7	0
0	0	11
CM Accuracy = 93%

Independent Test Set
Small-bowel	Pancreas	Lung
4	0	1
0	4	0
0	0	11
CM Accuracy = 87%

Samples from the Moffitt and expO data sets were first grouped and tumors randomly selected for both training and independent validation.