Skip to main content
. 2022 Aug 17;50(1):67–79. doi: 10.1007/s00259-022-05927-1

Table 2.

Fully automated model performance at all levels (patient level classification through to voxel-level segmentation) calculated on the dedicated test set

Task Metric Value
Patient-level classification Accuracy (%) 94.5 (121/128)
Sensitivity (%) 93.3 (70/75)
PPV (%) 97.2 (70/72)
Specificity (%) 96.2 (51/53)
NPV (%) 91.1 (51/56)
Lesion-level detection PPV (%) 88.2 (224/254)
Sensitivity (%) 73.0 (224/307)
F1 score (%) 79.9
Lesion sub-groups detection
   Local prostate Sensitivity (%) 90.0 (36/40)
   Regional nodal Sensitivity (%) 68.3 (41/60)
   Distant nodal Sensitivity (%) 76.2 (115/151)
   Osseous Sensitivity (%) 58.2 (32/55)
   Visceral Sensitivity (%) 0 (0/1)
Voxel-level segmentation DSC (mean ± SD) 43.5 ± 21.5
Sensitivity (mean ± SD) 45.0 ± 29.2
PPV (mean ± SD) 58.5 ± 28.2