Skip to main content
. 2023 Jan 7;7:4. doi: 10.1038/s41698-022-00340-1

Table 1.

Model performance metrics on blind holdout test sets.

Dataset Variant category Method AUC MCC TPR TNR PPV NPV Balanced accuracy FP TP FN TN Call rate
Blind test set— BRCA SARC UCEC Overall: 30,270 somatic, 31,359 germline TabNet 0.942 0.762 0.931 0.828 0.839 0.926 0.879 5397 28,184 2086 25,962 100
XGBoost 0.946 0.757 0.865 0.892 0.885 0.873 0.878 3392 26,186 4084 27,967 100
LightGBM 0.949 0.766 0.874 0.892 0.886 0.88 0.883 3399 26,451 3819 27,960 100
PureCN 0.85 0.592 0.662 0.912 0.882 0.729 0.787 2237 16,792 8588 23,065 82.2
SNVs: 29,593 somatic, 30,122 germline TabNet 0.945 0.775 0.943 0.827 0.843 0.937 0.885 5199 27,918 1675 24,923 100
XGBoost 0.949 0.77 0.879 0.891 0.888 0.882 0.885 3286 26,006 3587 26,836 100
LightGBM 0.951 0.776 0.884 0.892 0.889 0.887 0.888 3268 26,159 3434 26,854 100
PureCN 0.851 0.593 0.662 0.913 0.887 0.724 0.788 2082 16,343 8360 21,983 81.7
Indels: 839 somatic, 1762 germline TabNet 0.784 0.433 0.838 0.624 0.515 0.89 0.731 662 703 136 1100 100
XGBoost 0.799 0.408 0.666 0.757 0.566 0.827 0.712 428 559 280 1334 100
LightGBM 0.815 0.455 0.765 0.718 0.564 0.865 0.742 497 642 197 1265 100
PureCN 0.827 0.529 0.606 0.893 0.756 0.805 0.749 132 410 267 1105 73.6
Blind test set —metastatic melanoma Overall: 15,813 somatic, 12,871 germline TabNet 0.852 0.55 0.797 0.753 0.799 0.751 0.775 3176 12,598 3215 9695 100
XGBoost 0.861 0.558 0.801 0.756 0.802 0.756 0.779 3135 12,670 3143 9736 100
LightGBM 0.867 0.57 0.823 0.744 0.798 0.774 0.784 3289 13,020 2793 9582 100
PureCN 0.824 0.52 0.789 0.732 0.794 0.726 0.76 2951 11,340 3041 8050 88.5
SNVs: 15,688 somatic, 12,412 germline TabNet 0.85 0.545 0.796 0.749 0.8 0.744 0.773 3114 12,490 3198 9298 100
XGBoost 0.862 0.565 0.815 0.748 0.804 0.762 0.782 3125 12,788 2900 9287 100
LightGBM 0.868 0.573 0.833 0.737 0.8 0.777 0.785 3266 13,061 2627 9146 100
PureCN 0.823 0.516 0.788 0.729 0.797 0.717 0.758 2856 11,228 3028 7686 88.2
Indels: 147 somatic, 611 germline TabNet 0.82 0.361 0.837 0.619 0.346 0.94 0.728 233 123 24 378 100
XGBoost 0.833 0.384 0.837 0.646 0.363 0.943 0.742 216 123 24 395 100
LightGBM 0.84 0.343 0.898 0.534 0.317 0.956 0.716 285 132 15 326 100
PureCN 0.887 0.624 0.888 0.826 0.581 0.964 0.857 80 111 14 379 77

AUC area under the receiver operating characteristic curve, MCC Matthews Correlation Coefficient, TP true positives— somatic mutations correctly classified as somatic, FP false positives—rare germline variants misclassified as somatic mutations, FN false negatives—somatic mutations misclassified as germline variants, TN true negatives—rare germline mutations correctly classified as germline. Bold values indicate the best performance for a given metric, variant category, and test set.

Benchmark accuracy metrics for tumor-only somatic vs germline classification by TabNet, XGBoost, LightGBM, and PureCN on blind test datasets. Overall performance considers all single-nucleotide variants (SNVs) and indels.

Call rate—percentage of total coding variants classified.