Table 3. The performance of deep learning-based method.
Cross-validation | Data | AUC | P value | ACC | SEN | SPE | PPV | NPV |
---|---|---|---|---|---|---|---|---|
Fold 1 | GTV | 0.866 (0.822–0.907) | 0.237 | 0.727 (0.674–0.783) | 0.553 (0.469–0.633) | 0.937 (0.897–0.977) | 0.913 (0.862–0.969) | 0.634 (0.565–0.701) |
GPTV | 0.898 (0.86–0.938) | – | 0.857 (0.815–0.902) | 0.844 (0.792–0.902) | 0.873 (0.814–0.946) | 0.890 (0.837–0.956) | 0.821 (0.756–0.866) | |
Fold 2 | GTV | 0.931 (0.902–0.958) | 0.100 | 0.856 (0.815–0.891) | 0.829 (0.774–0.894) | 0.889 (0.833–0.951) | 0.900 (0.854–0.956) | 0.812 (0.744–0.881) |
GPTV | 0.972 (0.954–0.991) | – | 0.914 (0.88–0.946) | 0.896 (0.849–0.942) | 0.937 (0.897–0.937) | 0.945 (0.913–0.98) | 0.881 (0.826–0.936) | |
Fold 3 | GTV | 0.942 (0.917–0.968) | 0.014* | 0.871 (0.826–0.913) | 0.855 (0.8– 0.917) | 0.889 (0.833–0.950) | 0.903 (0.857–0.956) | 0.836 (0.773–0.900) |
GPTV | 0.990 (0.982–0.997) | – | 0.914 (0.880–0.946) | 0.896 (0.851–0.942) | 0.937 (0.897–0.977) | 0.945 (0.911–0.98) | 0.881 (0.829–0.936) | |
Fold 4 | GTV | 0.931 (0.903–0.961) | 0.400 | 0.871 (0.837–0.913) | 0.882 (0.837– 0.939) | 0.857 (0.795–0.923) | 0.882 (0.830–0.938) | 0.857 (0.800–0.923) |
GPTV | 0.956 (0.932–0.985) | – | 0.936 (0.913–0.967) | 0.948 (0.917–0.981) | 0.921 (0.875–0.976) | 0.936 (0.900–0.980) | 0.935 (0.895–0.977) | |
Fold 5 | GTV | 0.937 (0.910–0.966) | 0.349 | 0.871 (0.837–0.913) | 0.882 (0.836–0.938) | 0.857 (0.800–0.923) | 0.882 (0.837–0.936) | 0.857 (0.795–0.925) |
GPTV | 0.961 (0.94–0.986) | – | 0.900 (0.870–0.935) | 0.883 (0.836–0.938) | 0.921 (0.875–0.975) | 0.932 (0.894–0.979) | 0.866 (0.810–0.930) | |
Mean | GTV | 0.921 (0.896–0.937) | 0.003* | 0.839 (0.812–0.868) | 0.800 (0.759–0.843) | 0.899 (0.851–0.924) | 0.897 (0.863–0.927) | 0.787 (0.745–0.832) |
GPTV | 0.955 (0.939–0.971) | – | 0.904 (0.881–0.927) | 0.893 (0.861–0.925) | 0.917 (0.884–0.947) | 0.929 (0.901–0.955) | 0.876 (0.841–0.912) |
P value was derived from the DeLong test of comparing AUCs between GTV and GPTV. Statistics in the brackets showed 95% confidence intervals (CIs). *, denotes P<0.05. Fold 1–5, 5-fold cross-validation; GTV, gross tumor volume; GPTV, tumor incorporating peritumoral region; AUC, area under the curve; ACC, accuracy; SEN, sensitivity; SPE, specificity; PPV, positive predictive value; NPV, negative predictive value.