Skip to main content
. 2022 Mar 9;27(1):39–55. doi: 10.1007/s11325-022-02592-4

Table 1.

Examples of studies using machine learning algorithms for sleep stage and respiratory scoring

Author, year Population studied Dataset source Channels and sensors Data preprocess Classifier used Performance measures Other findings
Perslev [87], 2021 15,660 participants 16 clinical datasets Single channel EEG and EOG sugnal U Sleep software Convoluted Neural Network(CNN) U sleep performed as well as other algorithms, even though U sleep was not trained on similar datasets It predicts sleep stages in a single forward pass
Sharma [88] 2021 80 subjects comprising of healthy controls as well as various sleep disorders Cyclic Alternating Pattern (CAP) database Dual channel EEG Optimized wavelet filters Bagged tree (EBT) classifier with tenfold cross validation

Accuracy 85.3%

κ = 0.786

Accuracy improved in a balanced dataset created using over-sampling and under-sampling techniques
Sun, 2020 [6] 8682 PSG N/A ECG and respiratory signals 270 s time windows used CNN

κ = 0.585 (sleep stages)

κ = 0.76 (wake vs NREM vs REM)

Performance is better for younger ages
Jaoude 2020 [89] 6431 patients

MGH-PSG dataset;

Ambulatory scalpEEG dataset

4 EEG channels Bandpass filter, downsampling to 100 Hz, generated bipolar montage to make it “reference channel-free” CNN followed by RNN

κ = 0.74 (MGH held out dataset)

κ = 0.78 (scalpEEG dataset)

Performance was consistent across common EEG background abnormalities
Zhange et al., 2020 [90] 294 sleep studies (122 training data set, 20 validation dataset, 152 testing data set) Prospectively collected 2 channel EEG, EOG, EMG, ECG, airflow Filtered signal at 66 Hz, downsampled signal sampling frequency to 66 Hz CNN

Accuracy 81.81%

κ = 0.7276

Number of arousals affected model’s performance
Peter-Derex 2020 [91] 23 patients with insomnia, 24 patients with idiopathic hypersomnia, 24 patients with narcolepsy, 24 patients with OSA Lyon sleep database Single channel EEG ASEEGA software ASEEGA software Agreement between software and consensual scorer was: insomnia 85.5% (κ = 0.80), narcolepsy 83.8% (κ = 0.78), idiopathic hypersomnia 86.1% (κ = 0.68), and obstructive sleep disorder 87.2% (κ = 0.82)
Sridhar, 2020 [92]

800 (561 subjects);

993 nights (993 subjects)

Sleep Heart Health Study;

Multi-ethnic study of Atherosclerosis;

Physionet Computing in Cardiology (CinC)

ECG Normalize ECG signal, interbeat interval time series computed CNN

Accuracy 77%

κ = 0.66

(SHHS)

Accuracy 72%

κ = 0.55

(CinC)

Zhu, 2020 [93] 8 recordings (4 healthy, 4 sleep disorders); 20 recordings (healthy) Sleep EDF, Sleep EDFX Single channel EEG Z score normalization of data CNN + attention mechanism

Accuracy 93.7%

F1 score 84.5%

Attention mechanism helped in learning inter and intra-epoch features
Xu, 2020 [94] 5793 participants (sleep disorders) Sleep Heart Health Study Multichannel EEG, EOG Time frequency spectra LSTM (Long short term memory)/RNN

Accuracy 87.4%

κ = 0.8216

RNNtakes temporal information into account
Zhang et al. Sleep, 2019 [95] 5213 patients Sleep Heart Health Study Multichannel EEG, EOG, EMG Raw signal, short term Fourier transform for spectogram Recurrent and convolutional neural networks

κ = 0.82

Validation MrOS

κ = 0.68

Validation SOF

κ = 0.70

Yildrim, 2019 [96]

8 recordings;

61 recordings

(healthy and insomnia)

Sleep EDF, Sleep EDFX Single channel EEG, Single channel EOG Raw PSG signal CNN Accuracy 98.06%
Phan, 2019 [97] 200 recordings MGH sleep lab Single channel EEG, EOG, EMG Time frequency images CNN + RNN

Accuracy 87%

κ = 0.81

Trained network in end to end fashion
Zhang and Wu, 2018 [98]

25 recordings (sleep disorders)

16 recordings

MIT-BIH database, Sleep EDF Single channel EEG Phase encoder, unsupervised training CNN

Accuracy 87%

κ = 0.81

Stephansen, 2018 [57] 3000 recordings (healthy and sleep disorders) 10 databases Multichannel EEG, EOG, Filter + octave encoding CNN + RNN Accuracy = 87% Automates type 1 narcolepsy diagnosis
Sors et al., 2018 [99] 5793 recordings (sleep disorders) Sleep Heart Healthy Study Single channel EEG raw CNN

Accuracy = 87%

κ = 0.81

Biswal et al. 2018 [50]

1000 recordings

5804 recordings

Sleep Heart Health study;

ISRUC-sleep

Multichannel EEG Spectogram Recurrent and convolutional neural networks

accuracy 87.5%

κ = 0.805

Validation SHHS

accuracy 77.7%

κ = 0.732

Minimal reduction in accuracy noted when working on single channel
Patanaik et al. Sleep, 2018 [18]

1046 recordings (healthy adolescents)

284 recordings (healthy young adults)

210 recordings(sleep disorders)

77 recordings (Parkinson disease adults)

CNL lab, CSL lab, Singapore; UCSD sleep lab, UC database Multichannel EEG, EOG, Spectral images Convolutional neural network

accuracy 89.2%

κ = 0.86

Validation set 1-

accuracy 81.4%

κ = 0.740

Validation set 2 (PD)-

accuracy 72.1%

κ = 0.597

Faster compared to human experts (F sec compared to 30–60 min)
Olesen 2018 [7] 2310 recordings (healthy and sick) CNL lab, Singapore Multichannel EEG, EOG, EMG Raw data CNN κ = 0.75 Most errors made in stage N1 and N3
Malafeev, 2018 [56]

54 recordings (healthy);

43 recordings (22 PSG and 21 MSLT, hypersomnia patients)

Warsaw (healthy);

Wisconsin Sleep Cohort (hypersomnia)

Single channel EEG, EMG, EOG Spectrogram CNN κ = 0.8 (except stage N1) Performance in healthy subjects were better compared to those on hypersomnia patients
Cui 2018 [46] 116 recordings including healthy and sick population University of Zurich Multichannel EEG, EOG, EMG Time series CNN Accuracy = 92.2%
Chambon, 2018 [100] 61 recordings from healthy adults MGH sleep lab

Multichannel EEG

Three chin EMG

Linear spatial filtering CNN Sensitivity = 52% Utilized 1 min of data before and after each data segment which offered the strongest improvement
Vilamala, 2017 [101] 40 recordings from 20 healthy adults Montreal archive Single channel EEG Multitaper spectral analysis CNN Acc = 84–88%
Supratak,2017 [55] 62 healthy recordings MGH sleep lab, Montreal Archive Single channel EEG Raw data CNN + RNN

Acc = 86.2%

K = 0.8

Used multiple datasets
Tsinalis, 2016 [42] 40 recordings from 20 Healthy young adults Montreal archive Single channel EEG Class balanced random sampling CNN and 2D stack of frequency-specific activity in time

Accuracy = 71–76%

Per stage Accuracy = 80–84%

Performance balanced across classes

CNN convolutional neural network, RNN recurrent neural network, LSTM long-short-term memory, EBT ensemble bagged tree, Κ kappa