Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Aug 3.
Published in final edited form as: Proc SPIE Int Soc Opt Eng. 2023 Apr 7;12465:124652P. doi: 10.1117/12.2654353

Predicting obstructive sleep apnea severity from craniofacial images using ensemble machine learning models

Ziyu Su 1, Sandhya Kumar 1, Thomas E Tavolara 1, Metin N Gurcan 1,*, Scott Segal 1,*, M Khalid Khan Niazi 1,*
PMCID: PMC10399208  NIHMSID: NIHMS1857752  PMID: 37538448

Abstract

Obstructive sleep apnea (OSA) is a prevalent disease affecting 10 to 15% of Americans and nearly one billion people worldwide. It leads to multiple symptoms including daytime sleepiness; snoring, choking, or gasping during sleep; fatigue; headaches; non-restorative sleep; and insomnia due to frequent arousals. Although polysomnography (PSG) is the gold standard for OSA diagnosis, it is expensive, not universally available, and time-consuming, so many patients go undiagnosed due to lack of access to the test. Given the incomplete access and high cost of PSG, many studies are seeking alternative diagnosis approaches based on different data modalities. Here, we propose a machine learning model to predict OSA severity from 2D frontal view craniofacial images. In a cross-validation study of 280 patients, our method achieves an average AUC of 0.780. In comparison, the craniofacial analysis model proposed by a recent study only achieves 0.638 AUC on our dataset. The proposed model also outperforms the widely used STOP-BANG OSA screening questionnaire, which achieves an AUC of 0.52 on our dataset. Our findings indicate that deep learning has the potential to significantly reduce the cost of OSA diagnosis.

Keywords: obstructive sleep apnea, machine learning, craniofacial image analysis

Description of Purpose

Obstructive sleep apnea (OSA) is a sleep disorder that occurs when the upper airway gets obstructed, preventing normal breathing [1]. The disease is characterized by episodes of decreased breathing (hypopnea) or cessation of breathing for 10 seconds or more (apnea) during sleep. It has been recognized to be associated with multiple severe complications including drowsy driving and motor vehicle crashes; neuropsychiatric dysfunction; cardiovascular disorders; metabolic syndromes; and liver disease. Overall, OSA is a highly prevalent disease that affects 10 to 15% of Americans and nearly one billion people worldwide [2].

OSA is diagnosed by polysomnography (PSG), a formal in-lab study that monitors patients’ respiration, oxygen saturation, heart rate and electrocardiogram, and electroencephalography (EEG) during sleep. Although PSG is the gold standard, it is expensive (approximately $2100), not universally available, and time-consuming, so about 80% of OSA patients cannot be diagnosed due to lack of access to the test. To make the OSA diagnosis more accessible, many studies have sought alternative diagnosis approaches based on different data modalities, such as computed tomography, magnetic resonance imaging, and craniofacial scan [37]. However, radiology imaging data is cumbersome and expensive to acquire. As one of the most easy-to-access modalities, frontal view craniofacial images have significant interaction with the OSA pathogenesis and thus have been studied as a predictor for OSA diagnosis [3, 7, 8]. Here, we propose to develop a machine learning-based model for 2D frontal view craniofacial images to predict OSA severity.

Methods

Dataset

Our dataset contains data from 280 patients who seen our Atrium Health Wake Forest Baptist (AHWFB) sleep medicine clinics. Patients were included in our dataset if they are over 18 years old and provided consent to take part in our study. Craniofacial images from patients’ frontal view were taken by cameras and relevant clinical variables were also collected. The ground truth of OSA was determined by patients’ PSG results in the form of apnea-hypopnea index (AHI). The data were labeled positive (mild/moderate/severe OSA) if their AHI ≥ 5 or negative (none/minimal OSA) if their AHI < 5.

DeepXGBoost

We propose a deep-learning based craniofacial image analysis model, DeepXGBoost, for OSA prediction. It is composed of two modules – an image feature extractor and XGBoost classifier.

For the image feature extractor, we propose a spectral decoupled neural network (SDNet) to learn features related to OSA from 2D frontal-view craniofacial images. The backbone of our SDNet is ResNet50 initialized with ImageNet pretrained weights [9]. To improve the feature learning ability of our model on small datasets, we propose to add a spectral decoupling constraint on top of the regular binary cross-entropy (BCE) loss function [10]. We then train the SDNet on craniofacial images. After the training of SDNet, we use it as a feature extractor to extract feature vectors from images. Then, we train an XGBoost classifier [11] based on the features vectors to predict OSA severity.

Integration of craniofacial images and clinical data

We also use patients’ clinical data as a secondary modality to predict OSA severity. We use a separate XGBoost model to predict OSA from clinical variables. We select variables according to their correlation with AHI using Pearson correlation coefficients. After training, the clinical model outputs are aggregated with DeepXGBoost outputs by average pooling.

Experimental Design

We perform five-fold cross-validation of each model on our dataset to classify patient’s OSA severity. We compare our method with a commonly used OSA screening questionnaire (i.e. STOP-BANG) [12] and a 2D craniofacial model for OSA prediction recently published [8]. We report AUC to evaluate the performance of the models.

Results

Table 1 reports the performance of our DeepXGBoost (craniofacial), ensemble model of DeepXGBoost (craniofacial) and clinical variable model, and the comparison methods. Overall, our craniofacial image-based DeepXGBoost outperform the comparison methods. After combining our DeepXGBoost with clinical variable model’s outputs, the performance is further improved to 0.780 AUC.

Table 1.

Results of OSA severity prediction of different methods

Modalities Methods AUC (± standard variation)
Questionnaire STOP-BANG 0.529±0.061
Craniofacial EfficienNet-OSA (He et al. 2022) [8] 0.638±0.115
Craniofacial DeepXGBoost 0.733±0.045
Clinical variables XGBoost 0.749±0.037
Craniofacial + clinical variables DeepXGBoost (craniofacial) +
XGBoost (clinical variables)
0.780 ± 0.066

New or breakthrough work to be presented

We are among the first to develop deep learning based methods to predict OSA severity from 2D craniofacial images. Compared to the widely studied 3D craniofacial scan based methods [7, 13], our proposed modality is much easier to acquire and thus much more practical. To the best of our knowledge, there is only one existing study predict OSA from 2D craniofacial images using deep learning [8]. However, this study is a simple application of EfficientNet model [14] and only achieves moderate results on our dataset (see Table 1 for details). On the other hand, we innovatively utilize spectral decoupling [10] to tackle craniofacial analysis and achieve reasonable prediction performance.

Conclusions

The main purpose of this study was to develop a machine learning based model to predict OSA severity from 2D craniofacial images. To achieve our purpose, we innovatively developed a deep-learning based model: DeepXGBoost which shows the potential of deep learning in identifying OSA patients. We have shown that our DeepXGBoost outperforms the comparison craniofacial-base OSA prediction model and a commonly used OSA screening questionnaire. After combining our DeepXGBoost with clinical model’s output, our method achieves an average AUC of 0.780. With this prediction performance and accessible data modality, our method has the potential to serve as a convenient pre-assessment tool for OSA diagnosis.

Acknowledgements

The work was partly supported through a National Institutes of Health Trailblazer award R21EB029493 (PIs: Niazi, Segal). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This study was also partly funded by the Anesthesia Patient Safety Foundation Award 2020 (“Development of machine learning algorithms to predict difficult airway management”) and a pilot award provided by Center for Biomedical Informatics at Wake Forest School of Medicine.

References

  • [1].Kapur VK et al. , “Clinical practice guideline for diagnostic testing for adult obstructive sleep apnea: an American Academy of Sleep Medicine clinical practice guideline,” Journal of Clinical Sleep Medicine, vol. 13, no. 3, pp. 479–504, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Benjafield AV et al. , “Estimation of the global prevalence and burden of obstructive sleep apnoea: a literature-based analysis,” Lancet Respir Med, vol. 7, no. 8, pp. 687–698, Aug 2019, doi: 10.1016/S2213-2600(19)30198-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Lee RWW, Petocz P, Prvan T, Chan ASL, Grunstein RR, and Cistulli PA, “Prediction of Obstructive Sleep Apnea with Craniofacial Photographic Analysis,” Sleep, vol. 32, no. 1, pp. 46–52, 2009, doi: 10.5665/sleep/32.1.46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Ogawa T, Enciso R, Shintaku WH, and Clark GT, “Evaluation of cross-section airway configuration of obstructive sleep apnea,” Oral Surgery, Oral Medicine, Oral Pathology, Oral Radiology, and Endodontology, vol. 103, no. 1, pp. 102–108, 2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Schwab RJ et al. , “Identification of upper airway anatomic risk factors for obstructive sleep apnea with volumetric magnetic resonance imaging,” American journal of respiratory and critical care medicine, vol. 168, no. 5, pp. 522–530, 2003. [DOI] [PubMed] [Google Scholar]
  • [6].Lowe AA, Fleetham JA, Adachi S, and Ryan CF, “Cephalometric and computed tomographic predictors of obstructive sleep apnea severity,” American Journal of Orthodontics and Dentofacial Orthopedics, vol. 107, no. 6, pp. 589–595, 1995. [DOI] [PubMed] [Google Scholar]
  • [7].Hanif U. et al. , “Estimation of apnea-hypopnea index using deep learning on 3-D craniofacial scans,” IEEE Journal of Biomedical and Health Informatics, vol. 25, no. 11, pp. 4185–4194, 2021. [DOI] [PubMed] [Google Scholar]
  • [8].He S, Su H, Li Y, Xu W, Wang X, and Han D, “Detecting obstructive sleep apnea by craniofacial image–based deep learning,” Sleep and Breathing, pp. 1–11, 2022. [DOI] [PubMed]
  • [9].He K, Zhang X, Ren S, and Sun J, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778. [Google Scholar]
  • [10].Pezeshki M, Kaba O, Bengio Y, Courville AC, Precup D, and Lajoie G, “Gradient starvation: A learning proclivity in neural networks,” Advances in Neural Information Processing Systems, vol. 34, pp. 1256–1272, 2021. [Google Scholar]
  • [11].Chen T. and Guestrin C, “Xgboost: A scalable tree boosting system,” in Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794. [Google Scholar]
  • [12].Chung F, Abdullah HR, and Liao P, “STOP-Bang questionnaire: a practical approach to screen for obstructive sleep apnea,” Chest, vol. 149, no. 3, pp. 631–638, 2016. [DOI] [PubMed] [Google Scholar]
  • [13].Eastwood P. et al. , “Predicting sleep apnea from three-dimensional face photography,” Journal of Clinical Sleep Medicine, vol. 16, no. 4, pp. 493–502, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Tan M. and Le Q, “Efficientnet: Rethinking model scaling for convolutional neural networks,” in International conference on machine learning, 2019: PMLR, pp. 6105–6114. [Google Scholar]

RESOURCES