Skip to main content
. 2023 Oct 18;11(20):2760. doi: 10.3390/healthcare11202760

Table 4.

The application of AI in treatment planning.

Author (Year) Data Type Dataset Size
(Training/Test)
Algorithms Purpose Performance
Xie et al. (2010) [121] Cephalometric variables, cast measurement. 180/20 ANN To predict tooth extraction diagnosis. Accuracy: 80%.
Jung et al. (2016) [122] Cephalometric variables, dental variable, profile variables, and chief complaint for protrusion. 64/60
(additional 32 samples than validation set)
ANN To predict tooth extraction diagnosis, and extraction patterns. Success rate:
Tooth extraction diagnosis: 93%.
Extraction patterns: 84%.
Li et al. (2019) [123] Demographic data, cephalometric data, dental data, and soft tissue data. A total of 302 samples MLP (ANN) To predict tooth extraction diagnosis, extraction patterns and anchorage patterns. Accuracy:
For extraction diagnosis: 94%.
For extraction patterns: 84.2%.
For anchorage patterns: 92.8%.
Suhail et al. (2020) [124] Diagnosis, feature identification of photos, models and X-rays. A total of 287 samples ANN, LR, RF To predict tooth extraction diagnosis, and extraction patterns. For extraction diagnosis: LR outperformed the ANN.
For extraction patterns: RF outperformed ANN.
Etemad et al. (2021) [125] Demographic data, cephalometric data, dental data, and soft tissue data. A total of 838 samples RF, MLP (ANN) To predict tooth extraction diagnosis. Accuracy of RF with 22/117/all inputs: 0.75/0.76/0.75.
Accuracy of MLP with 22/117/all inputs: 0.79/0.75/0.79.
Shojaei et al. (2022) [126] Medical records, extra and intra oral photos, dental model records, and radiographic images. A total of 126 samples LR, SVM, DT, RF, Gaussian NB, KNN Classifier, ANN To predict tooth extraction diagnosis, extraction patterns and anchorage patterns. Accuracy for extraction decision:
ANN: 93%, LR:86%, SVM:83%, DT: 76%, RF: 83%, Gaussian NB: 72%, KNN Classifier: 72%.
Accuracy for extraction pattern: ANN: 89%, RF:40%.
Accuracy for extraction and anchorage pattern: ANN: 81%, RF:23%.
Real et al. (2022) [127] Sex, model variables, cephalometric variables, outcome variables. -/214 Commercially available software (Auto-WEKA) To predict tooth extraction diagnosis. Accuracy:
93.9%: input model and cephalometric data.
87.4%: input only model data.
72.7%: input only cephalometric data.
Leavitt et al.
(2023) [128]
Cephalometric variables, dental variables, demographic characteristics. 256/110 RF, LR, SVM To predict tooth extraction patterns. Overall accuracy:
RF: 54.55%, SVM: 52.73%, LR: 49.09%.
Ryu et al. (2023) [78] Intraoral photographs, extraction decision. 2736/400 ResNet (ResNet50, ResNet101), VggNet (VGG16, and VGG19) To predict tooth extraction diagnosis. Accuracy:
Maxilla: VGG19 (0.922) > ResNet101 (0.915) > VGG16 (0.910) > ResNet50 (0.909).
Mandible: VGG19 (0.898) = VGG16 (0.898) > ResNet50 (0.895) > ResNet101 (0.890).
Prasad et al. (2022) [129] Clinical data, cephalometric data, cast and photographic data. A total of 700 samples RF, XGB, LR, DT, K-Neighbors, Linear SVM, NB To predict skeletal jaw base, extraction diagnosis for Class 1 jaw base, and functional/camouflage/surgical strategies for Class 2/3 jaw base. Different algorithms showed different accuracies in different layers. RF performed best in 3 out of 4 layers.
Knoops et al. (2019) [130] 3D face scans A total of 4261 SVM for classification
LR, RR, LARS, and LASSO for regression
To predict surgery/non-surgery decision and surgical outcomes. For surgery/non-surgery decision:
Accuracy: 95.4%. Sensitivity: 95.5%. Specificity: 95.2%.
For surgical outcomes simulation:
Average error: LARS and RR (1.1 ± 0.3 mm). LASSO (1.3 ± 0.3 mm). LR (3.0 ± 1.2 mm).
Choi et al. (2019) [131] Lateral cephalometric variables, dental variable, profile variables, chief complaint for protrusion. 136/112
(additional 68 samples than validation set)
ANN To predict surgery/non-surgery decision, extraction/non-extraction for surgical treatment. Accuracy for all dataset:
Diagnosis of surgery/non-surgery: 96%.
Diagnosis of extraction/non-extraction for Class II surgery: 97%.
Diagnosis of extraction/non-extraction for Class III surgery: 88%.
Diagnosis of extraction/non-extraction for surgery: 91%.
Lee et al. (2020) [132] Lateral cephalograms. 220/40 (additional 73 samples than validation set) CNN (Modified-Alexnet, MobileNet, and Resnet50) To predict the need for orthognathic surgery. Average accuracy for all dataset:
Modified-Alexnet: 96.4%.
MobileNet: 95.4%.
Resnet50: 95.6%.
Jeong et al. (2020) [133] Facial photos (front and right). A total of 822 samples.
Group 1: 207/204.
Group 2: 205/206
CNN To predict the need for orthognathic surgery. Accuracy: 0.893.
Precision: 0.912.
Recall: 0.867.
F1 scores:0.889.
Shin et al. (2021) [134] Lateral cephalograms and posteroanterior cephalograms. A total of 840 samples.
Group 1: 273/304 (additional 30 samples than validation set).
Group 2: 98/109 (additional 11 samples than validation set)
CNN To predict the diagnosis of orthognathic surgery. Accuracy: 0.954.
Sensitivity: 0.844.
Specificity: 0.993.
Kim et al. (2021) [135] Lateral cephalograms. 810/150 CNN
(ResNet-18, 34, 50, 101)
To predict the diagnosis of orthognathic surgery. Accuracy for test dataset:
ResNet-18/34/50/101: 93.80%/93.60%/91.13%/91.33%.
Lee et al. (2022) [136] Cephalometric measurements, demographic characteristics, dental analysis, and chief complaint. 136/60 RF, LR To predict the diagnosis of orthognathic surgery. Accuracy (RF/LR): 90%/78%.
Sensitivity (RF/LR): 84%/89%.
Specificity (RF/LR): 93%/73%.
Woo et al. (2023) [137] Intraoral scan data -/30 Three commercially available software packages (Autolign, Outcome Simulator Pro, Ortho Simulation) To evaluate the accuracy of automated digital setup accuracy. Mean error of three pieces of software:
Linear movement: 0.39~1.40 mm.
Angular movement: 3.25~7.80°.
Park et al. (2021) [138] Lateral cephalograms A total of 284 cases CNN (U-Net) To predict the cephalometric changes of Class II patients after using modified C-palatal plates. Total mean error: 1.79 ± 1.77 mm.
Tanikawa et al. (2021) [139] 3D facial images A total of 72 cases in surgery group and 65 cases in extraction group Deep learning To predict facial morphology change after orthodontic or orthognathic surgical treatment. Average system errors:
Surgery group: 0.94 ± 0.43 mm; orthodontic group: 0.69 ± 0.28 mm.
Success rates (<1 mm): Surgery group: 54%; orthodontic group: 98%.
Success rates (<2 mm): Surgery group: 100%; orthodontic group: 100%.
Park et al. (2022) [140] CBCT 268/44 cGAN To predict post-orthodontic facial changes. Mean prediction error: 1.2 ± 1.01 mm.
Accuracy within 2 mm: 80.8%.
Xu et al. (2022) [141] Total of 17 clinical features A total of 196 cases ANN To predict patient experience of Invisalign treatment. Predictive success rate:
Pain: 87.7%. Anxiety: 93.4%. Quality of life: 92.4%.

ANN, artificial neural network; DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; NB, naive bayes; KNN, k-nearest neighbors; MLP, multilayer perceptron; XGB, eXtreme Gradient Boosting; RR, ridge regression; LARS, least-angle regression; LASSO, least absolute shrinkage and selection operator regression; CNN, convolutional neural network; CBCT, cone-beam computed tomography; cGAN, conditional generative adversarial networks.