. 2022 Mar 9;27(1):39–55. doi: 10.1007/s11325-022-02592-4

Table 1.

Examples of studies using machine learning algorithms for sleep stage and respiratory scoring

Author, year	Population studied	Dataset source	Channels and sensors	Data preprocess	Classifier used	Performance measures	Other findings
Perslev [87], 2021	15,660 participants	16 clinical datasets	Single channel EEG and EOG sugnal	U Sleep software	Convoluted Neural Network(CNN)	U sleep performed as well as other algorithms, even though U sleep was not trained on similar datasets	It predicts sleep stages in a single forward pass
Sharma [88] 2021	80 subjects comprising of healthy controls as well as various sleep disorders	Cyclic Alternating Pattern (CAP) database	Dual channel EEG	Optimized wavelet filters	Bagged tree (EBT) classifier with tenfold cross validation	Accuracy 85.3% κ = 0.786	Accuracy improved in a balanced dataset created using over-sampling and under-sampling techniques
Sun, 2020 [6]	8682 PSG	N/A	ECG and respiratory signals	270 s time windows used	CNN	κ = 0.585 (sleep stages) κ = 0.76 (wake vs NREM vs REM)	Performance is better for younger ages
Jaoude 2020 [89]	6431 patients	MGH-PSG dataset; Ambulatory scalpEEG dataset	4 EEG channels	Bandpass filter, downsampling to 100 Hz, generated bipolar montage to make it “reference channel-free”	CNN followed by RNN	κ = 0.74 (MGH held out dataset) κ = 0.78 (scalpEEG dataset)	Performance was consistent across common EEG background abnormalities
Zhange et al., 2020 [90]	294 sleep studies (122 training data set, 20 validation dataset, 152 testing data set)	Prospectively collected	2 channel EEG, EOG, EMG, ECG, airflow	Filtered signal at 66 Hz, downsampled signal sampling frequency to 66 Hz	CNN	Accuracy 81.81% κ = 0.7276	Number of arousals affected model’s performance
Peter-Derex 2020 [91]	23 patients with insomnia, 24 patients with idiopathic hypersomnia, 24 patients with narcolepsy, 24 patients with OSA	Lyon sleep database	Single channel EEG	ASEEGA software	ASEEGA software	Agreement between software and consensual scorer was: insomnia 85.5% (κ = 0.80), narcolepsy 83.8% (κ = 0.78), idiopathic hypersomnia 86.1% (κ = 0.68), and obstructive sleep disorder 87.2% (κ = 0.82)
Sridhar, 2020 [92]	800 (561 subjects); 993 nights (993 subjects)	Sleep Heart Health Study; Multi-ethnic study of Atherosclerosis; Physionet Computing in Cardiology (CinC)	ECG	Normalize ECG signal, interbeat interval time series computed	CNN	Accuracy 77% κ = 0.66 (SHHS) Accuracy 72% κ = 0.55 (CinC)
Zhu, 2020 [93]	8 recordings (4 healthy, 4 sleep disorders); 20 recordings (healthy)	Sleep EDF, Sleep EDFX	Single channel EEG	Z score normalization of data	CNN + attention mechanism	Accuracy 93.7% F1 score 84.5%	Attention mechanism helped in learning inter and intra-epoch features
Xu, 2020 [94]	5793 participants (sleep disorders)	Sleep Heart Health Study	Multichannel EEG, EOG	Time frequency spectra	LSTM (Long short term memory)/RNN	Accuracy 87.4% κ = 0.8216	RNNtakes temporal information into account
Zhang et al. Sleep, 2019 [95]	5213 patients	Sleep Heart Health Study	Multichannel EEG, EOG, EMG	Raw signal, short term Fourier transform for spectogram	Recurrent and convolutional neural networks	κ = 0.82 Validation MrOS κ = 0.68 Validation SOF κ = 0.70
Yildrim, 2019 [96]	8 recordings; 61 recordings (healthy and insomnia)	Sleep EDF, Sleep EDFX	Single channel EEG, Single channel EOG	Raw PSG signal	CNN	Accuracy 98.06%
Phan, 2019 [97]	200 recordings	MGH sleep lab	Single channel EEG, EOG, EMG	Time frequency images	CNN + RNN	Accuracy 87% κ = 0.81	Trained network in end to end fashion
Zhang and Wu, 2018 [98]	25 recordings (sleep disorders) 16 recordings	MIT-BIH database, Sleep EDF	Single channel EEG	Phase encoder, unsupervised training	CNN	Accuracy 87% κ = 0.81
Stephansen, 2018 [57]	3000 recordings (healthy and sleep disorders)	10 databases	Multichannel EEG, EOG,	Filter + octave encoding	CNN + RNN	Accuracy = 87%	Automates type 1 narcolepsy diagnosis
Sors et al., 2018 [99]	5793 recordings (sleep disorders)	Sleep Heart Healthy Study	Single channel EEG	raw	CNN	Accuracy = 87% κ = 0.81
Biswal et al. 2018 [50]	1000 recordings 5804 recordings	Sleep Heart Health study; ISRUC-sleep	Multichannel EEG	Spectogram	Recurrent and convolutional neural networks	accuracy 87.5% κ = 0.805 Validation SHHS accuracy 77.7% κ = 0.732	Minimal reduction in accuracy noted when working on single channel
Patanaik et al. Sleep, 2018 [18]	1046 recordings (healthy adolescents) 284 recordings (healthy young adults) 210 recordings(sleep disorders) 77 recordings (Parkinson disease adults)	CNL lab, CSL lab, Singapore; UCSD sleep lab, UC database	Multichannel EEG, EOG,	Spectral images	Convolutional neural network	accuracy 89.2% κ = 0.86 Validation set 1- accuracy 81.4% κ = 0.740 Validation set 2 (PD)- accuracy 72.1% κ = 0.597	Faster compared to human experts (F sec compared to 30–60 min)
Olesen 2018 [7]	2310 recordings (healthy and sick)	CNL lab, Singapore	Multichannel EEG, EOG, EMG	Raw data	CNN	κ = 0.75	Most errors made in stage N1 and N3
Malafeev, 2018 [56]	54 recordings (healthy); 43 recordings (22 PSG and 21 MSLT, hypersomnia patients)	Warsaw (healthy); Wisconsin Sleep Cohort (hypersomnia)	Single channel EEG, EMG, EOG	Spectrogram	CNN	κ = 0.8 (except stage N1)	Performance in healthy subjects were better compared to those on hypersomnia patients
Cui 2018 [46]	116 recordings including healthy and sick population	University of Zurich	Multichannel EEG, EOG, EMG	Time series	CNN	Accuracy = 92.2%
Chambon, 2018 [100]	61 recordings from healthy adults	MGH sleep lab	Multichannel EEG Three chin EMG	Linear spatial filtering	CNN	Sensitivity = 52%	Utilized 1 min of data before and after each data segment which offered the strongest improvement
Vilamala, 2017 [101]	40 recordings from 20 healthy adults	Montreal archive	Single channel EEG	Multitaper spectral analysis	CNN	Acc = 84–88%
Supratak,2017 [55]	62 healthy recordings	MGH sleep lab, Montreal Archive	Single channel EEG	Raw data	CNN + RNN	Acc = 86.2% K = 0.8	Used multiple datasets
Tsinalis, 2016 [42]	40 recordings from 20 Healthy young adults	Montreal archive	Single channel EEG	Class balanced random sampling	CNN and 2D stack of frequency-specific activity in time	Accuracy = 71–76% Per stage Accuracy = 80–84%	Performance balanced across classes

CNN convolutional neural network, RNN recurrent neural network, LSTM long-short-term memory, EBT ensemble bagged tree, Κ kappa