Table 2.
The performance metrics, including AUC, ACC, TPR, and TNR, for all three models
| Model | Cohort | AUC(95% CI) | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|---|
| Rad | Training | 0.828 (0.7866—0.8695) | 0.751 | 0.702 | 0.794 |
| Validation | 0.770 (0.6948—0.8449) | 0.719 | 0.712 | 0.725 | |
| External test | 0.801 (0.6599—0.9426) | 0.783 | 0.733 | 0.796 | |
| DLRresnet18 | Training | 0.837 (0.7965—0.8783) | 0.773 | 0.839 | 0.714 |
| Validation | 0.805 (0.7374—0.8729) | 0.712 | 0.849 | 0.587 | |
| External test | 0.857 (0.7592—0.9544) | 0.710 | 0.867 | 0.667 | |
| DLRresnet50 | Training | 0.847 (0.8075—0.8863) | 0.776 | 0.780 | 0.772 |
| Validation | 0.810(0.7433—0.8769) | 0.732 | 0.808 | 0.662 | |
| External test | 0.860 (0.7433—0.8769) | 0.783 | 0.800 | 0.778 |