TABLE 3.
Model performance analysis of the models using ROC metrics
AUC (95%CI) | Accuracy | Sensitivity | Specificity | HL test | ||
---|---|---|---|---|---|---|
Radiomics | Training | 0.894 (0.837–0.951) | 0.811 (0.645–0.875) | 0.692 (0.587–0.840) | 0.926 (0.821–0.979) | 0.415 |
Test | 0.886 (0.784–0.989) | 0.841 (0.585–0.941) | 0.818 (0.640–0.964) | 0.864 (0.542–0.913) | ||
Combined | Training | 0.932 (0.889–0.976) | 0.859 (0.698–0.922) | 0.750 (0.649–0.884) | 0.963 (0.873–0.995) | 0.931 |
Test | 0.899 (0.805–0.992) | 0.863 (0.615–0.952) | 0.818 (0.640–0.964) | 0.909 (0.589–0.940) |
Notes: The best operating point of the ROC was chosen at the point, whose Youden index is maximal. (Youden index = Sensitivity+Specificity−1). Delong test between two models: p = 0.650.
Abbreviations: AUC, area under curve; HL, Hosmer–Lemeshow test.