Table 3.
| Method | Set | SEN (%) | SPE (%) | PPV (%) | NPV (%) | AUC | ACC | P |
|---|---|---|---|---|---|---|---|---|
| MSKCC nomogram | previous research [17] | 93.1 | – | – | – | 0.78 | – | – |
| Tenon score | previous research [17,40] | 92.1 | 70.1 | 54.7 | 95.8 | 0.81 | – | – |
| DLR model proposed | Training set | 100 (97.4–100) | 47.3 (42.0–52.8) | 63.6 (58.3–69.5) | 100 (98.5–100) | 0.909 (0.864–0.953) | 72.6 (65.3–79.0) | – |
| Test set | 98.4 (95.6–99.9) | 39.3 (32.5–46.4) | 78.5 (71.3–85.6) | 91.7 (88.8–97.9) | 0.812 (0.740–0.884) | 80.2 (73.7–85.7) | – | |
| DLR model proposed | Trainingset (32) | 100 (91.2–100) | 50.0 (39.2–61.8) | 66.7 (50.3–75.0) | 100 (88.5–100) | 0.914 (0.787–1) | 75.0 (56.7–88.5) | <0.001 |
| MSKCC nomogram | 25.0 (12.4–35.8) | 100 (95.5–100) | 100 (96.1–100) | 57.1 (50.1–64.2) | 0.822 (0.675–0.970) | 62.5 (43.7–78.9) | ||
| DLR model proposed | Test set (57) | 95.8 (91.1–99.3) | 33.3 (27.8–39.5) | 51.1 (42.8–57.6) | 91.7 (87.2–97.8) | 0.846 (0.742–0.950) | 59.7 (45.8–72.4) | <0.001 |
| MSKCC nomogram | 41.7 (35.9–46.4) | 84.9 (79.9–91.5) | 66.7 (59.1–72.8) | 66.7 (61.0–72.3) | 0.742 (0.614–0.870) | 66.7 (52.9–78.6) |
Abbreviations: MSKCC, Memorial Sloan-Kettering Cancer Centre; Tenon score, a scoring system; AUC, area under the receiver operating characteristic curve; ACC, accuracy; SEN, sensitivity; SPE, specificity; PPV, positive predictive value; NPV, negative predictive value; DLR model, deep learning radiomics model;.
P represents the sensitivity difference between MSKCC nomogram and DLR model (McNemar's test).
Data in parentheses are the 95% confidence interval.