Skip to main content
. 2025 Mar 7;25:350. doi: 10.1186/s12903-025-05730-y

Table 2.

Study descriptors including the anatomical site, imaging modality, segmentation approach and evaluation matrix

Study Subjects Part Training and evaluation data Image Modality Segmentation approach Evaluation metrics Results
Chen, Wang, et al. [4] 30 subjects, 20 females maxilla 30 scans for training, 6 for testing CBCT ML, Learning-based multi-source IntegratioN frameworK for Segmentation (LINKS) DSC The average Dice ratio of the maxilla was 0.800 ± 0.029.
Chen, Du, et al. [27] 25 subjects, 11 females, age range of 10 to 49 years teeth 20 scans for training, 5 for testing CBCT DL, multi-task 3D fully convolutional network (FCN), and marker-controlled watershed transform (MWT) DSC, Jaccard index The average Dice similarity coefficient, and Jaccard index were 0.936 (± 0.012) and 0.881 (± 0.019).
Chung et al. [28] NR teeth 150 scans for training, 25 for testing CBCT DL, single CNN, base architecture of the 3D U-net Aggregate Jaccard index (AJI), ASSD AJI score of 0.86 ± 0.01, and ASSD [mm] of 0.20 ± 0.10
Cui et al. [29] 4,215 subjects, 2312 females, mean age of 38.4 years teeth, alveolar bone 4531 images for training, 407 for testing CBCT DL Dice ratio, ASD

Average Dice scores of 94.1%, ASD (mm) of 0.17 for tooth.

Average Dice scores of 94.1 (76.9–96.9) %, ASD (mm) of 0.35 (0.18–0.84) for maxillary bone.

Average Dice scores of 94.8 (80.3–97.3) %, ASD (mm) of 0.29 (0.13–0.77) for mandible bone.

Duan et al. [30] NR teeth 20 sets of images for training and testing CBCT DL, Region Proposal Network (RPN) with Feature Pyramid Network (FPN) method DSC, ASD, RVD

They achieved an average dice of 95.7% (0.957 ± 0.005) for single-rooted tooth ST, 96.2% (0.962 ± 0.002) for multirooted tooth MT.

The ASD was 0.104 ± 0.019 for ST and 0.137 ± 0.019 for MT.

The RVD was 0.049 ± 0.017 for ST and 0.053 ± 0.010 for MT.

Hsu et al. [31] 24 subjects, 9 females, mean age of 29.1 ± 14.7 years teeth 24 sets of images for training and testing CBCT DL, 3.5D U-Net DSC The 3.5 Dv5 U-Net achieved highest DSC among all U-Nets.
Ileșan et al. [32] NR mandible 120 scans for training, 40 for validation CBCT and CT DL, CNN, a 3D U-Net DSC Mean DSC was 0.884 for teeth segmentation and 0.894 for mandible segmentation
Jaskari et al. [33] 594 subjects mandibular canal 457 scans for training, 52 for validation, 128 for testing CBCT DL, fully CNN DSC, MCD

DSC was 0.57 for the left canal and 0.58 for the right canal,

MCD was 0.61 mm for the left canal and 0.50 mm for the right canal.

Kim et al. [34] 25 subjects mandibular condyle 18 subject cases for training, 5 for validation, 2 for training CBCT DL, modified U-Net, and a CNN IoU, HD

Intersection over union (IoU) was 0.870 ± 0.023 for marrow bone and 0.734 ± 0.032 for cortical bone.

The Hausdorff distance was 0.928 ± 0.166 mm for marrow bone and 1.247 ± 0.430 mm for cortical bone.

Kwak et al. [5] 102 subjects, age range of 18–90 years mandibular canal 29, 456 images for training, 9818 for validation, 9818 for testing CBCT DL, deep CNN Global classification accuracy Global accuracy of 0.99 using a 3D U-Net CNN model for segmentation of mandibular canal
Lahoud et al. [35] 46 subjects teeth 2095 samples for training, 501 for optimization, 328 for validation CBCT DL, deep CNN IoU, The mean intersection over union IOU for full-tooth segmentation was 0.87 (60.03) and 0.88 (60.03) for semiautomated (SA)
Lee et al. [24] 102 subjects teeth 1066 images for training, 400 for validation, 151 for testing CBCT DL, CNN Dice ratio Dice value of 0.918.
Lin et al. [36] 220 subjects, 136 females, mean age of 36.93 ± 13.77 years mandibular canal 132 images for training, 44 for validation, 44 for testing CBCT DL, CNN, two-stage 3D-Unet DSC, 95% HD The mean DSC was 0.875 ± 0.045 and the mean 95% HD was 0.442 ± 0.379.
Lo Giudice et al. [37] 40 subjects, 20 females, mean age of 23.37 ± 3.34 years mandible 20 scans for training, 20 for testing CBCT DL, CNN DSC, surface-to-surface matching DSC of 0.972.
Macho et al. [38] 40 subjects teeth 36 images for training, 4 for validation CT DL, CNN employing 3D volumetric convolutions dice ratio Dice value between 0.88 and 0.94.
Miki et al. [39] 52 subjects teeth 42 subject cases for training, 10 for testing CBCT DL, deep CNN classification accuracy Classification accuracy 88.8%
Minnema et al. [20] NR teeth 20 scans for training,2 for validation, 18 for testing CBCT DL, mixed-scale dense (MS-D) CNN DSC DSC of 0.87 ± 0.06
Pankert et al. [40] 307 subjects, 112 females, mean age of 63 years mandible 248 images for testing, 30 for validation, 29 for testing CT DL, two-stepped CNN DSC, ASD, HD Dice coefficient of 94.824% and an average surface distance of 0.31 mm.
Park et al. [41] 171 subjects mandible, maxilla 146 sets for training, 10 for validation, 15 for testing CT DL, hierarchical, parallel, and multi-scale residual block to the U-Net (HPMR-U-Net) DSC, ASD, 95HD DC of 90.2 ± 19.5 for maxilla segmentation and 97.4 ± 0.4 for mandible segmentation
Qiu et al. [6] 11 subjects mandible 8 scans for training, 2 for validation, 1 for testing CT DL, CNN DSC Average dice coefficient of 0.89
Qiu et al. [42] 48 subjects mandible 52 scans for training, 8 for validation, 49 for testing CT DL, Single-planar CNN DSC, 3D surface error Average dice score of0.9328(± 0.0144), 95HD (mm) of 1.4333(± 0.5564)
Qiu, Guo, et al. [43] 48 subjects mandible 90 scans for training, 2 for validation, 17 for testing CT DL Recurrent Convolutional Neural Networks for Mandible Segmentation (RCNNSeg) DSC, ASD, 95HD Average DSC of 97.48%, ASD of 0.2170 mm, and 95HD of 2.6562 mm
Qiu, van der Wel, et al. [44] 59 subjects mandible 38 subject cases for training, 1 for validation, 20 for testing CBCT DL, SASeg DSC, ASD, 95HD Average DSC of 95.35 (± 1.54)%, ASD of0.9908 (± 0.4128) mm, and 95HD of 2.5723 (± 4.1192) mm
Qiu, van der Wel, et al. [8] 59 subjects mandible 38 scans for training, 1 for validation, 20 for testing CBCT DL, 3D CNN and recurrent SegUnet DSC, ASD, 95HD Average DSC of 95.31 (± 1.11)%, ASD of 1.2827 (± 0.2780) mm, and 95HD of 3.1258 (± 3.2311) mm
Usman et al. [45] NR mandibular canal 400 scans for training, 500 for testing CBCT DL, Attention-Based CNN model with Multi-Scale input Residual UNet (MSiR-UNet) Dice ratio, mean IoU, A dice score of 0.751, mean IoU of 0.795
Verhelst et al. [46] NR mandible 160 scans for training, 30 for testing CBCT DL, 2-stageed 3D U-Net DSC, IoU, HD, IoU of 94.6 ± 1.17%, DSC of 0.9722 ± 0.0062, HD (mm) of 4.1583 ± 2.8549,
Vinayahalingam et al. [47] 81 subjects mandibular condyles and glenoid f 130 scans for training, 24 for validation, 8 for testing CBCT DL, 3D U-Net Dice ratio, IoU,

The dice ratio was 0.976 (0.01) and IoU was 0.954 (0.02) for mandibular condyle segmentation.

The dice ratio was 0.966 (0.03) and IoU was 0.936 (0.04) for Glenoid fossa segmentation.

Wang et al. [7] 30 subjects, 19 females, mean age of 14.2 ± 3.4 years mandible, maxilla, teeth 21 scans for training, 7 for testing CBCT DL, mixed-scale dense (MS-D) CNN DSC, surface deviation Dice similarity coefficient: 0.934 ± 0.019, jaw; 0.945 ± 0.021, teeth
Wang et al. [48] 15 subjects, 9 females, mean age of 26 ± 10 years mandible, maxilla 30 scans for training, 15 scans for validation CBCT ML, patch-based sparse representation, and convex optimization DSC Dice ratio of 0.92 ± 0.02 for mandible segmentation and 0.87 ± 0.02 for maxilla segmentation.
Wang et al. [49] 30 subjects, 18 females, mean age of 24 ± 10 years mandible, maxilla 30 scans for training, 30 CBCT scans, and 60 spiral multislice CT (MSCT) scans for validation CBCT ML, prior-guided sequential random forests DSC Dice ratio of 0.94 ± 0.02 for mandible segmentation and 0.91 ± 0.03 for maxilla segmentation

DL, Deep learning, ML, Machine learning, CNN, convolutional neural network; DSC, Dice similarity Coefficient; HD, Hausdorff distance; 95HD, 95% Hausdorff distance; ASD, average surface distance; RVD, Relative Volume Difference; ASSD, Average Symmetric Surface Distance; JSC, Jaccard similarity coefficient, IoU, intersection over union; RMS, root mean square; MCD, mean curve distance; RVD, relative volume difference