Skip to main content
. 2025 May 1;21(5):843–854. doi: 10.5664/jcsm.11560

Table 3.

Evaluation of discriminating performances of OSA risk screening models.

Study Sample Size (Whole; Test) Moderate/Severe OSA Definition Diagnostic Cut-Off Criteria for Labeling Method Features Analyzed, (n) AUROC AUPRC Sen Spe Acc PPV NPV F1 Score Threshold
Our model 342; 86 AHI ≥ 15 50% CNN + LR Facial photo, (1); SBQ, (8) 97.2% 97.0% 93.0% 90.7% 91.9% 90.9% 92.9% 92.0% 0.547
CNN Facial photo, (1) 85.7% 84.6% 79.1% 88.4% 76.7% 87.2% 80.9% 82.9% 0.573
RF SBQ, (8) 85.7% 81.1% 78.1% 82.8% 80.5% 82.0% 79.1% 80.0% 0.56
SBQ ≥ 4, (1) 79.1% 71.4% 90.7% 67.4% 79.1% 73.6% 87.9% 81.3%
He et al 202114 197; — AHI ≥ 10 74% LR Facial photo, (4) 90% 85.6% 84.3%
Facial photo, (4); physical measurements, (2) 93% 88.4% 86.3%
Chen et al 202315 653;187 AHI ≥ 15 55.3% CatBoost Facial photo, (68); clinical variables, (19) 76% 75% 71% 72% 73%
SBQ ≥ 3, (1) 69%
Remya et al 201716 76; — AHI ≥ 10 68.4% LR Facial photo, (28); Anthropometric parameters, (11) 93.1% 20.0% 74.4%
Huo et al 202340 2,357; 1,237 AHI ≥ 15 45.2% LR Clinical variables (6) 78% 72% 77% 68%
SBQ ≥ 3, (1) 69% 59%
He et al 202241 202; 101 AHI ≥ 15 62.3 LR Clinical variables (4) 83.7% 81.1% 76.0% 81.2%
SBQ ≥ 3, (1) 73.8%

The F1 score is calculated by 2 × sensitivity × PPV/(sensitivity + PPV). Acc = accuracy, AHI = apnea-hypopnea index, AUROC = area under the receiver operating characteristic, AUPRC = area under the precision–recall curve, CNN = convolutional neural network, LR = logistic regression, NPV = negative predictive value, OSA = obstructive sleep apnea, PPV = positive predictive value, RF = random forest, Sen = sensitivity, Spe = specificity, SBQ = STOP-BANG questionnaire.