Skip to main content
. 2024 Jan 11;110(4):1975–1982. doi: 10.1097/JS9.0000000000001067

Table 3.

Qualitative assessment of performance of the deep learning model compared to initially masked image (original) and the ground truth.

Initially masked model Ground truth Inference of Auto-segmentation model Assessment for Accuracy Modified assessment for accuracy
Complete Partial Absent Complete Partial Absent Complete Partial Absent Against original Against GT Against original Against GT
CBD 22 (88%) 2 (8%) 1 (4%) 25 (100%) 22 (88%) 2 (8%) 1 (4%) 22/22 (100%) 22/25 (88%) 23/23 (100%) 23/25 (92%)
CHD 22 (88%) 3 (12%) 25 (100%) 23 (92%) 2 (8%) 23/22 (104.5%) 23/25 (92%) 24/23.5 (102%) 24/25 (96%)
Cystic 20 (80%) 3 (12%) 2 (8%) 23 (92%) 1 (4%) 1 (4%) 15 (60%) 3 (12%) 7 (28%) 15/20 (75%) 15/23 (65%) 16.5/21.5 (76.7%) 16.5/23.5 (70.2%)
GB 20 (80%) 4 (16%) 1 (4%) 24 (96%) 1 (4%) 19 (76%) 5 (20%) 1 (4%) 19/20 (95%) 19/24 (79%) 21.5/22 (97.7%) 21.5/24 (89.6%)
Hilum 23 (92%) 1 (4%) 1 (4%) 25 (100%) 24 (96%) 1 (4%) 24/23 (104%) 24/25 (96%) 24.5/23.5 (104%) 24.5/25 (98%)
RHD 25 (100%) 25 (100%) 25 (100%) 25/25 (100%) 25/25 (100%) 25/25 (100%) 25/25 (100%)
RAHD 20 (80%) 2 (8%) 3 (12%) 25 (100%) 19 (76%) 5 (20%) 1 (4%) 19/20 (95%) 19/25 (76%) 21.5/22 (97.7%) 21.5/25 (86%)
RPHD 17(68%) 5 (20%) 3 (12%) 25 (100%) 14 (56%) 8 (32%) 3 (12%) 14/17 (82.3%) 14/25 (56%) 18/19.5 (92.3%) 18/25 (72%)
3rd order 6 (24%) 10 (40%) 9 (36%) 22 (88%) 1 (4%) 2 (8%) 4 (16%) 13 (52%) 8 (32%) 4/6 (66.6%) 4/22 (18.2%) 10.5/11 (95.5%) 10.5/22.5 (46.7%)
LHD 25 (100%) 25 (100%) 24 (96%) 1 (4%) 24/25 (96%) 24/25 (96%) 24.5/15 (98%) 24.5/25 (98%)
2nd order 15 (60%) 5 (20%) 5 (20%) 21 (84%) 1 (4%) 3 (12%) 15 (60%) 5 (20%) 5 (20%) 15/15 (100%) 15/21 (71.4%) 17.5/17.5 (100%) 17.5/21.5 (81.4%)

CBD, common bile duct; CHD, common hepatic duct; GB, gall bladder; LHD, left hepatic duct; RAHD, right anterior hepatic duct; RHD, right hepatic duct; RPHD, right posterior hepatic duct.