Skip to main content
. 2023 Jun 13;18:426. doi: 10.1186/s13018-023-03909-z

Table 2.

Diagnostic performance of 2D and 3D CNN models and participated reading clinicians on surgery and internal test sets

Metrics Accuracy Precision Sensitivity Specificity F1-score Youden-Index
Surgery test set
 2D CNN 0.87 (60/69) 0.75 (21/28) 0.913 (21/23) 0.848 (39/46) 0.824 0.761
 3D CNN 0.71 (49/69) 0.54 (17/31) 0.739 (17/23) 0.696 (32/46) 0.624 0.435
 Senior surgeon 1 and 2 0.891 (123/138) 0.782 (43/55) 0.935 (43/46) 0.870 (80/92) 0.852 0.805
 Junior surgeon 3 and 4 0.761 (105/138) 0.592 (42/71) 0.913 (42/46) 0.685 (63/92) 0.718 0.598
 Senior radiologist 5 and 6 0.862 (119/138) 0.729 (43/59) 0.935 (43/46) 0.826 (76/92) 0.819 0.761
 Junior radiologist 7 and 8 0.775 (107/138) 0.612 (41/67) 0.891 (41/46) 0.717 (66/92) 0.726 0.608
Internal test set
 2D CNN 0.818 (117/143) 0.72 (39/54) 0.78 (39/50) 0.839 (78/93) 0.75 0.619
 3D CNN 0.783 (112/143) 0.679 (36/53) 0.72 (36/50) 0.817 (76/93) 0.699 0.537
 Senior surgeon 1 and 2 0.857 (245/286) 0.748 (89/119) 0.89 (89/100) 0.839 (156/186) 0.813 0.729
 Junior surgeon 3 and 4 0.801 (229/286) 0.669 (85/127) 0.85 (85/100) 0.753 (140/186) 0.749 0.603
 Senior radiologist 5 and 6 0.839 (240/286) 0.725 (87/120) 0.87 (87/100) 0.823 (153/186) 0.791 0.693
 Junior radiologist 7 and 8 0.759 (217/286) 0.637 (72/113) 0.74 (74/100) 0.801 (149/186) 0.685 0.541