Table VI.
Confidence intervals on the change in segmentation metrics (positive for Dice scores and negative for the boundary distances when DenseVNet is better) for the algorithm comparison.
| Spleen | Left Kidney | Gallbladder | Esophagus | Liver | Stomach | Pancreas | Duodenum | |
|---|---|---|---|---|---|---|---|---|
| Dice coefficient (%) | ||||||||
| DEEDS+JLF | 0.03,0.09 | 0.01,0.03 | 0.03,0.18 | 0.04,0.12 | 0.01,0.03 | 0.02,0.08 | -0.00,0.08 | -0.01,0.09 |
| VoxResNet | 0.04,0.05 | 0.03,0.05 | 0.00,0.05 | 0.03,0.09 | 0.01,0.01 | 0.02,0.04 | 0.02,0.05 | 0.01,0.05 |
| VNet | 0.01,0.03 | 0.01,0.03 | 0.04,0.12 | 0.02,0.09 | 0.01,0.02 | 0.02,0.07 | 0.04,0.10 | 0.03,0.09 |
| Mean boundary distance (mm) | ||||||||
| DEEDS+JLF | -1.8,-0.5 | -0.5,-0.2 | -1.5,-0.1 | -0.7,-0.1 | -0.9,-0.1 | -1.3,-0.2 | -0.8,0.1 | -0.9,0.3 |
| VoxResNet | -1.1,-0.8 | -0.8,-0.5 | -0.5,-0.1 | -0.7,-0.3 | -0.5,-0.2 | -0.8,-0.3 | -0.5,-0.2 | -0.5,0.2 |
| VNet | -0.5,-0.2 | -0.4,-0.2 | -1.4,-0.4 | -0.7,-0.2 | -0.8,-0.3 | -1.7,-0.2 | -1.1,-0.4 | -1.1,-0.1 |
| 95% Hausdorff distance (mm) | ||||||||
| DEEDS+JLF | -3.5,-0.7 | -0.7,0.2 | -4.2,-1.1 | -1.6,0.0 | -1.2,0.1 | -4.9,-0.0 | -1.7,0.1 | -1.9,1.9 |
| VoxResNet | -1.7,-1.0 | -1.2,-0.5 | -1.0,0.3 | -1.0,-0.0 | -0.6,0.1 | -1.2,0.2 | -1.3,-0.1 | -1.7,2.1 |
| VNet | -1.7,-0.5 | -1.2,-0.1 | -3.4,-0.7 | -1.5,-0.3 | -1.8,-0.5 | -5.0,-0.3 | -4.1,-1.1 | -3.0,0.4 |