Skip to main content
. 2010 Jul;12(4):476–486. doi: 10.2353/jmoldx.2010.090197

Table 4.

Confusion Matrices and Accuracies for the Training and Independent Test Sets for the Three Second Tier Primary Site of Origin Classifiers

Adenocarcinoma
Train–Test Split
Kidney Ovary Uterus Colon Lung Prostate Breast
19 0 0 0 0 1 0
1 9 9 0 0 0 1
0 4 14 0 0 0 2
0 0 0 20 0 0 0
0 0 0 0 20 0 0
0 0 1 0 0 19 0
0 1 1 0 2 0 17
CM Accuracy = 84%
Independent Test Set
Kidney Ovary Uterus Colon Lung Prostate Breast
8 0 2 0 0 0 0
0 10 0 0 0 0 0
3 0 6 1 0 0 0
0 0 1 8 1 0 0
0 0 0 0 10 0 0
0 0 0 1 0 9 0
0 0 0 0 0 0 10
CM Accuracy = 87%
Squamous
Train–Test Split
Tongue-larynx Vulva Cervix Penis Lung Anal-rectum
24 0 1 0 0 0
0 3 0 0 0 0
2 0 6 0 1 0
4 1 0 0 0 0
1 0 1 0 16 0
0 0 0 0 0 8
CM Accuracy = 83%
Independent Test Set
Tongue-larynx Vulva Cervix Penis Lung Anal-rectum
11 0 0 0 0 0
2 0 0 0 0 0
0 0 6 0 0 0
0 0 0 1 1 0
0 0 0 0 4 0
0 0 0 0 0 4
CM Accuracy = 90%
Neuroendocrine
Train–Test Split
Small-bowel Pancreas Lung
9 0 2
0 7 0
0 0 11
CM Accuracy = 93%
Independent Test Set
Small-bowel Pancreas Lung
4 0 1
0 4 0
0 0 11
CM Accuracy = 87%

Samples from the Moffitt and expO data sets were first grouped and tumors randomly selected for both training and independent validation.