Abstract
Obstructive sleep apnea (OSA) is a prevalent respiratory condition in children and is characterized by partial or complete obstruction of the upper airway during sleep. The respiratory events in OSA induce transient alterations of the cardiovascular system that ultimately can lead to increased cardiovascular risk in affected children. Therefore, a timely and accurate diagnosis is of utmost importance. However, polysomnography (PSG), the standard diagnostic test for pediatric OSA, is complex, uncomfortable, costly, and relatively inaccessible, particularly in low-resource environments, thereby resulting in substantial underdiagnosis. Here, we propose a novel deep-learning approach to simplify the diagnosis of pediatric OSA using raw electrocardiogram tracing (ECG). Specifically, a new convolutional neural network (CNN)-based regression model was implemented to automatically predict pediatric OSA by estimating its severity based on the apnea-hypopnea index (AHI) and deriving 4 OSA severity categories. For this purpose, overnight ECGs from 1,610 PSG recordings obtained from the Childhood Adenotonsillectomy Trial (CHAT) database were used. The database was randomly divided into approximately 60%, 20%, and 20% for training, validation, and testing, respectively. The diagnostic performance of the proposed CNN model largely outperformed the most accurate previous algorithms that relied on ECG-derived features (4-class Cohen’s kappa coefficient of 0.373 versus 0.166). Specifically, for AHI cutoff values of 1, 5, and 10 events/hour, the binary classification achieved sensitivities of 84.19%, 76.67%, and 53.66%; specificities of 46.15%, 91.39%, and 98.06%; and accuracies of 75.92%, 86.96%, and 91.97%, respectively. Therefore, pediatric OSA can be readily identified by our proposed CNN model, which provides a simpler, faster, and more accessible diagnostic test that can be implemented in clinical practice.
Keywords: Obstructive sleep apnea, Pediatrics, Electrocardiogram, Convolutional Neural Network, Apnea-hypopnea index, Childhood Adenotonsillectomy Trial
1. Introduction
Obstructive Sleep Apnea (OSA) is a common breathing disorder characterized by multiple episodes of partial or total upper airway obstruction during sleep, resulting in either reduction or cessation of the airflow with attendant alterations in gas exchange, and recurrent arousals fostering the occurrence of sleep fragmentation [1]. In otherwise healthy children, the prevalence of OSA ranges between 1% and 5%, affecting both sexes similarly [2]. Enlarged adenoids and tonsils are one of the major pathophysiological processes enhancing the risk of OSA occurrence [3], and the disease can impose a significant deleterious impact on the cardiovascular and central nervous systems when left untreated. Indeed, the obstructive respiratory events and accompanying manifestations (i.e., intermittent hypoxia and hypercapnia and recurrent arousals along with enhanced intrathoracic pressure swings) induce increased cardiac workload as well tachy- and brady-arrhythmias and catecholaminergic and sympathetic nervous system bursts [1,4–8]. Moreover, OSA has been linked to an elevated risk of developing cardiovascular disease during adulthood, particularly if left untreated [2,7,9,10].
OSA is routinely diagnosed using overnight polysomnography (PSG), which involves monitoring various physiological parameters, such as the electrocardiogram (ECG), oral and nasal airflow (AF), peripheral blood oxygen saturation (SpO2), and the electroencephalogram (EEG), among others [1,2]. The signals obtained from PSG are manually analyzed by trained medical personnel to derive the apnea-hypopnea index (AHI), the standard metric used in the diagnosis of OSA [11]. AHI measures the frequency of apnea and hypopnea events per hour of sleep (e/h) and helps determine the severity of the disease [2]. However, nocturnal PSG is an uncomfortable, time-consuming, and complex test, especially in children. It requires a sleep laboratory with appropriately trained staff, and pediatric subjects have to spend the night in the hospital facility while being monitored with sensors, thus potentially disrupting their natural sleep patterns [1]. Limited availability of such facilities for children, coupled with the cost of conducting overnight PSG, leads to underdiagnosis of pediatric OSA, particularly in developing countries where resources are limited [12,13].
To simplify OSA diagnosis in children, investigators have focused on developing approaches that require a limited number of signals coupled with artificial intelligence techniques [14,15]. These proposed methodologies have mainly focused on the use of overnight SpO2 and AF [15]. However, none of these approaches relied on cardiac signals derived from the ECG which contains a plethora of relevant information that can be used for diagnostic purposes involving both cardiorespiratory coupling and changes in heart rate frequency and variability associated with the respiratory events [2–4,7,16]. These physiological behaviors, together with the increased risk for developing cardiovascular disease, make ECG signal analysis worthy of special interest in the study of OSA [7,9,10]. Furthermore, because the ECG is one of the most widely analyzed signals in clinical practice around the world, an ECG-based application to aid in the diagnosis of OSA could be readily implementable and accessible [17–19].
Several investigative groups have focused on the analysis of cardiac information using deep learning (DL) methods, such as convolutional neural networks (CNN), recurrent neural networks (RNN), combinations of CNN and RNN, and hybrid methods to automatically detect the severity of OSA in adult patients, achieving overall robust performances [20–26]. However in children, most studies have relied on derivatives of cardiac function such as heart rate variability (HRV) or photoplethysmography (PPG), but not used the raw ECG signal [3,27–33]. These methodologies, while yielding satisfactory results, rely on existing knowledge of the effects of pediatric OSA on the ECG to extract features from different analytical approaches [34,35]. However, these approaches do not use all the information available in the ECG, which could be crucial for achieving a more thorough understanding and accurate diagnosis of the disease. Moreover, most of these previous studies relied on a feature-engineering approach, which is demanding and time-consuming due to comprehensive signal preprocessing and feature extraction [3,27–33].
To the best of our knowledge, no studies have investigated the use of raw ECG signals coupled with DL methods to explore their feasibility in the diagnosis of pediatric OSA. Such combinatorial approach is essential for two reasons: on the one hand, DL methods are gaining great interest in many fields due to a large number of existing databases [36], especially highlighting their application in the biomedical sector to help in the diagnosis and treatment of diseases [37–39]. Specifically, DL methods can handle high-dimensional data due to multiple-layer processing that allows for extracting relevant information intrinsically without exhaustive signal preprocessing [40]. On the other hand, analyzing the ECG signal could be very helpful for pediatric OSA due to associated cardiovascular risk, particularly in more severe cases [2].
Furthermore, analyzing previous studies focused on automatically detecting the severity of OSA using DL, most of them used architectures based on CNNs [20–26]. Although CNNs were originally created for image analysis [41], these networks have proven to be appropriate for time series analysis in a wide range of fields [42], especially in the domain of biomedical signal analysis [35,43,44]. CNNs contain a multi-layer architecture and their design is characterized by weight sharing, sparse connections, and pooling operations [41]. This structure allows them to identify short- and long-term patterns happening in distinct regions of the data sequences [43], with a computational efficiency that outperforms other DL models [41]. This property of CNNs can be essential in identifying patterns in the ECG signal associated with apneic events, which may manifest at different times during the night. In addition, CNNs have the ability to generate higher-level representations [41], which may allow them to learn complex patterns in prolonged segments of the ECG signal, such as heart rate fluctuations that are triggered in response to apneic events [2,7].
For these reasons, we here propose the development of a CNN-based algorithm for the classification of pediatric patients at risk of OSA based on their overnight ECG recordings that permits delineation of pediatric OSA severity according to the conventional AHI categories. This study presents two main novelties: 1) it is the first time that a DL approach is used along with raw ECG signals for the diagnosis of pediatric OSA; and 2) our approach is a new CNN-based regression model trained to aid in the diagnosis and severity estimation of pediatric OSA, in which we first estimate the number of apneic events per signal segment and then the AHI values per subject. For this purpose, we propose a data augmentation technique by dividing the whole night recordings into 10-minute segments with a 50% overlap to increase the volume of the dataset during model training. Thus, we hypothesize that a CNN-based architecture fed with the raw ECG signal can enhance and streamline the diagnosis of pediatric OSA by utilizing all relevant cardiac signal data to estimate disease severity. Accordingly, the main objective of our proposal is to evaluate a CNN-based model with the ECG signal to estimate the AHI, and accurately establish a diagnosis of OSA and its severity in pediatric patients.
2. Database and signals
The Childhood Adenotonsillectomy Trial (CHAT) database was used in this study (number of clinical trial: NCT00560859). Access to CHAT data is public upon request from the National Sleep Research Resource website (https://sleepdata.org/datasets/chat). A total of 1,610 overnight PSG recordings performed on children between the ages of 5 and 9.9 years old with suspected clinical symptoms of OSA were analyzed. Sleep studies were collected from 6 pediatric sleep centers in the United States of America (Children’s Hospital of Boston, Boston, MA; Cardinal Glennon Children’s Hospital, St. Louis, MO; Children’s Hospital of Philadelphia, Philadelphia, PA; Cincinnati Children’s Hospital, Cincinnati, OH; Montefiore Children’s Hospital, New York, NY; Rainbow Babies and Children’s Hospital, Cleveland, OH) [45]. All nocturnal PSGs were conducted following the 2007 American Academy of Sleep Medicine (AASM) recommendations [11]. Inclusion and exclusion criteria for the study can be found in previous literature [45,46]. CHAT is a randomized, controlled, single-blind, multicenter trial aimed at assessing the effectiveness of a surgical treatment for pediatric OSA. Details of the trial design, performance, and initial results obtained in the original study are explicitly documented in published reports [45,46]. Studies of nocturnal PSG were divided into three groups. Initially, the baseline cohort (451 subjects) that met the inclusion criteria, completed an initial PSG, and individuals were then randomly assigned to either early surgical adenotonsillectomy (eAT) or watchful waiting with supportive care (WWSC) [46]. In addition, the nonrandomized group (755 subjects) corresponded to individuals who did not meet the inclusion criteria conditions in the original study but had undergone an initial PSG. Finally, the follow-up group (404 subjects) comprised the subjects in the baseline group who underwent a follow-up PSG seven months after the initial PSG. Approximately half of the children in the follow-up group were treated with eAT and the other half randomly allocated to WWSC.
In this study, all recordings were randomly divided into training (60%), validation (20%) and test (20%) subsets. This partition was made so that no subject could be present in two subsets. In turn, following the recommendations from the AASM [11], all PSG recordings from CHAT included annotated data on the beginning and duration of apnea and hypopnea events. Accordingly, OSA diagnosis was established by calculating the AHI [11]. Then, pediatric OSA severity was divided into four categories based on AHI values: no OSA (AHI < 1e/h), mild OSA (1≤AHI<5 e/h), moderate OSA (5≤AHI<10 e/h), or severe OSA (AHI≥10 e/h) [11]. Table 1 presents the demographic and clinical characteristics of the subjects included in the training, validation, and test subsets.
Table 1:
Features | Training subset | Validation subset | Test subset |
---|---|---|---|
Baseline subjects (n) | 278 (61.64%) | 88 (19,51%) | 85 (18.85%) |
Non-randomized subjects (n) | 461 (61.06%) | 157 (20.79%) | 137 (18.15%) |
Follow-up subjects (n) | 249 (61.63%) | 78 (19.31%) | 77 (19.06%) |
Total subjects (n) | 988 (61.37%) | 323 (20.06%) | 299 (18.57%) |
Age (years) | 7.00 [2.00] | 7.00 [2.00] | 6.90 [2.00] |
Females (n) | 477 (48.28%) | 164 (50.77%) | 161 (53.85%) |
BMI (kg/m2) | 17.31 [5.92] | 17.12 [6.25] | 17.43 [6.04] |
AHI (e/h) | 2.64 [4.77] | 2.45 [4.77] | 2.32 [5.11] |
AHI < 1 (e/h) * | 212 (21.46%) | 67 (20.74%) | 65 (21.74%) |
1≤AHI<5 (e/h) * | 488 (49.39%) | 167 (51.70%) | 144 (48.16%%) |
5≥AHI<10 (e/h) * | 159(16.09%) | 44 (13.62%) | 49 (16.39%) |
AHI ≥ 10 (e/h) * | 129 (13.06%) | 45 (13.93%) | 41 (13.71%) |
Baseline ECG segments (n) | 33,402 | 5,350 | 5,059 |
Non-randomized ECG segments (n) | 53,137 | 9,057 | 7,972 |
Follow-up ECG segments | 29,587 | 4,519 | 4,442 |
Total ECG segments (n) | 116,126 | 18,927 | 17,473 |
Data are shown as number (percentage) or median [interquartile range], depending on the feature type. BMI: body mass index; AHI: apnea-hypopnea index; e/h: apneic events per hour.
AHI<1 (e/h): no OSA; 1≤AHI<5 (e/h): mild OSA; 5≤AHI<10(e/h): moderate OSA; AHI≤10(e/h): severe OSA.
3. Methods
This section presents the methodology to obtain the proposed solution for predicting pediatric OSA and its severity. The workflow followed is introduced (see Figure 1), based on six principal stages: 1) ECG signals were extracted from pediatric PSG recordings; 2) A minimal preprocessing was conducted to obtain 10-minute ECG segments; 3) CHAT database was divided into training (60% of the subjects), validation (20% of the subjects), and test (20% of the subjects) subsets; 4) A regression CNN-based architecture was designed. This CNN model was trained using the preprocessed 10-minute ECG segments belonging to the training subset as input data. was calculated for each subject by adding the estimated events in all segments of a pediatric overnight recording and dividing them by the total recording time in hours. The value of this rate calculated after training with CNN underestimates the value of the actual AHI (AHIactual) extracted from the CHAT database because the index in our initial proposal uses the total signal recording time rather than the total sleep time. To correct this tendency, we calculated the final estimated AHI (AHIest) by implementing a support vector regression (SVR) model; 5) The validation subset was used to adjust the algorithm hyperparameters and select the optimal model; and 6) The optimal model was applied to the test data to evaluate the diagnostic ability of the CNN-based algorithm.
3.1. ECG signal preprocessing
Following the AASM recommendations, ECG was obtained from the bipolar lead II [47]. ECG was minimally preprocessed before its use on the CNN. The raw signals were resampled at a sampling frequency of 100 Hz, which is consistent with the sampling frequencies used in prior studies [20,25,48–51]. Subsequently, the signals underwent a two-step filtering process. First, the continuous component was corrected by eliminating the signal mean within 30-second duration windows. Then, we applied a low pass filter with a pass band between 0 and 25 Hz to reduce noise, being a less restrictive approach than in previous studies which was intent on avoiding loss of important frequency components [25,50,52]. Specifically, we employed a linear-phase finite impulse response digital filter using a Hamming window to smooth out any discontinuities at the start and end of the signal [53]. After the complete records were filtered, ECG signals were divided into 10-minute segments, with a 50% overlap between segments to increase the volume of data for model training, i.e., as a data augmentation technique [54]. The final preprocessing step consisted of normalizing the amplitude of each ECG segment by extracting the mean value to the segment and dividing it by the standard deviation [55].
Finally, the determination of segment labels was performed by considering the annotations regarding the duration, beginning, and end of apneic events from CHAT annotations of the recordings [56,57]. The labels were calculated as the number of apneic events in each segment. In this way, both complete events and the proportion of incomplete events found at the edges of the segments were included as part of the target. For instance, when a single event was identified within a segment, it was labeled as 1. For segments containing a complete event along with, for example, 70% of another event (equivalent to 1.7 events), a label of 1.7 was assigned to that segment.
3.2. Design of the CNN architecture
Although CNN were initially created to deal with image data, they have been highly effective in processing time series in many fields [58], including biomedical signal processing [59]. In this work, we implemented a one-dimensional (1D) CNN-based approach since our input data responded to physiological single-channel ECG signals (see Figure 2). The network began with a data input layer containing a tensor of 60,000 samples (10-minute ECG segment) and a batch normalization (BN) layer of the input data [60]. Subsequently, the network processed the segments, which was composed of three convolutional blocks , and each of them consisting of convolutional sub-blocks with the layers described below.
First, we incorporated a 1D convolutional layer to extract feature maps based on relevant patterns [40]. The 1D convolution operation in this layer was calculated by following the next equation [41]:
(1) |
where is the m-th feature map and m=[1:convolutional filters ()], is the kernel size, which determines the filter size, values are the weights of the convolutional filter, are the input ECG segments, and is the bias term. The convolutional layer comprised a set of 1D filters with kernel size , a step length strides = 1, and zero padding (padding= ‘same’) to obtain the output with the same dimension as the input [41]. Secondly, we integrated a BN layer to normalize the feature maps obtained from the previous convolutional layer [60], followed by an activation layer. This layer used the rectified linear unit activation (ReLU) function [41]:
(2) |
where x is the value of each sample in the feature map. This activation is the standard choice in DL architectures and was used to introduce nonlinearity and solve the problem of vanishing gradient during training [41]. A max-pooling layer with a window size 2 (pool size = 2) and a two-step window offset (strides = 2) was then included, which reduces the dimensionality of the input data by half while keeping the most relevant features [41]. In the last step of each sub-block, we applied regularization through a dropout layer that randomly deactivated a fraction of the neurons with probability p during training. This layer was used to prevent the network from overfitting and improve the generalization of the model. Particularly, we used spatial dropout, an approach derived from the standard dropout for CNN [61]. The conventional dropout discards individual units with a certain probability in the training process. In contrast, in the case of spatial dropout, complete feature maps are discarded to improve generalization between the different feature maps [61,62].
Finally, after the configurations, we added a flattening layer to rearrange the spatial information into 1D vectors, and a densely/fully connected layer with linear activation to provide the final estimation [41]. Linear activation was selected for a regression problem to estimate apnea events by 10-minute segment as output from the proposed CNN.
The proposed CNN model was trained using the He-normal method to randomly initialize layer weights following a normal distribution [41]. We used the adaptative moment estimation (Adam) algorithm to optimize weight updates, with an initial learning rate of 0.001 [63]. Training data was presented to the network in batches of 15 samples for a maximum of 250 epochs, with randomized samples at the start of each epoch to improve convergence [41]. The loss function used to minimize the Adam algorithm in the validation subset was Huber loss with a delta parameter of , which is robust in the presence of outliers [64]. During training, we monitored the validation loss and decreased the learning rate by a factor of 2 if there was no improvement after 10 epochs. We also used an early stop method to prevent overfitting, which stopped training after 30 epochs without a decrease in validation loss. The weights were then readjusted to the epoch in which the minimal validation loss achieved [41].
3.3. AHI estimation
As previously explained in Section 1, AHI is the standard respiratory metric for diagnosing OSA and its severity. In this study, once we obtained the predictions of the apnea and hypopnea events for each 10-minute ECG segment in the CNN, we calculated for each subject the overnight rate of events per hour of recording . This was done by adding the estimated events in all segments of an overnight recording and dividing them by the total recording time in hours. It is essential to highlight that the value of this rate calculated after CNN training underestimates the value of the actual AHI , extracted from the CHAT database. This underestimation occurs because the rate of our initial proposal uses the total recording time of the signal rather than the total sleep time. Total recording time is typically greater than the total sleep time which is used to calculate the AHI, and which requires EEG and other signals for this purpose. To address this issue, we calculated the final estimated AHI , through the implementation of a support vector regression (SVR) model [65]. This model is more robust than ordinary least squares when dealing with outliers. It determines the best regression function using a kernel function and minimizes the -insensitive loss function. This function allows a certain degree of prediction error within a margin defined by the hyperparameter [66]. The loss function is defined as follows:
(3) |
where y is the actual value of the target variable, is the model prediction and is the acceptable margin of error [66]. It is necessary to set a minimum value for the margin , being , to delimit an area around where the difference between and does not contribute to the error. The formulation of the loss function ensures that only errors exceeding the threshold are penalized, while smaller errors do not contribute to the loss [67]. The purpose of the parameter is twofold: to control the model’s sensitivity to errors and to determine the size of the tolerance band around the regression line [66]. In this way, the loss function penalizes errors outside the tolerance band. In addition to the hyperparameter, other hyperparameters influence the SVR model optimization, such as the penalty parameter C and the kernel coefficient. The C hyperparameter balances the accuracy of model predictions and complexity, determining how prediction errors are penalized. The coefficient is responsible for controlling the form that the kernel function takes [65,66].
3.4. Model hyperparameter search and algorithm evaluation
To achieve optimal algorithm performance, we tuned a set of hyperparameters to minimize the generalization error of the CNN-based architecture. First, we heuristically selected the parameter of the convolutional layers for each block . We used an approach with ascending value in powers of two as the depth of the network increased: . This choice was made because the deeper the layers, the more complex the characteristics that are extracted [41,68]. Subsequently, for choosing the remaining CNN hyperparameters, we implemented a more exhaustive tuning strategy using a grid search method. With this method, different combinations of hyperparameters were tested in the search space. Specifically, we searched for the following values and dropout withp = {0.0,0.05,0.1}.
Considering the initial analyses performed, we selected this set of search hyperparameters so that the network had enough complexity to train and generalize properly. If we included smaller values in the implementation of CNN, it could not adjust sufficiently. If we increased the values, CNN took too long to train and adapted too much to the training data. In addition, the increased complexity of the network led to a computational cost not supported by the training of the CNN-based algorithm (NVIDIA GeForce RTX 2080 GPU; Keras 2.4.3 framework with TensorFlow 2.3 backend).
Then, for each of the CNN models trained with a specific combination of hyperparameters, we conducted a sequential search of three more hyperparameters in the SVR model , C = [0:100] with logarithmic scale value search, and , Ultimately, to determine the optimal hyperparameter configuration, we evaluated the performance of the CNN-based approach. This involved calculating the 4-class Cohen’s kappa coefficient for the subject-wise OSA severity classification (no OSA, mild, moderate, and severe OSA) in the validation subset. We selected the architecture with the highest k. Cohen’s kappa is a statistical metric used to measure agreement, particularly appropriate in classification tasks with unbalanced class distributions. Its usefulness comes from its capability to consider the likelihood of chance agreement, which reduces the bias towards the predominate class typically associated with accuracy metric [69].
3.5. Statistical analysis and figures of merit
To establish the agreement between the with our proposed algorithm and the extracted from the CHAT database, we performed a statistical analysis. First, we calculated the intraclass correlation coefficient (ICC), which consists of a statistical measure used to evaluate the reliability or consistency of measurements made by different observers [70]. In addition to this metric, we obtained Bland-Altman plots [71] to have a more representative view of the agreement between and .
To assess the effectiveness of the proposed algorithm in diagnosing pediatric OSA, we classified subjects into four severity groups based on their estimated AHI values: AHI < 1 e/h (no OSA),1 ≤ AHI ≤ 5 e/h (mild OSA), 5 ≤ AHI < 10 e/h (moderate OSA), AHI ≥ 10 e/h (severe OSA). After establishing these categories, we calculated the confusion matrix and 4-class accuracy (Acc4) to assess the overall performance of the model across classes. Additionally, k coefficient was calculated to assess the agreement between the actual classes, i.e., the true OSA severity level of each subject, and the classes derived from the AHI predicted by the model. The importance of k should be emphasized, as it has the advantage of correcting for agreement that occurs by chance [69]. Furthermore, we calculated different merit figures for each of the OSA severity thresholds (AHI = 1,5, and 10 e/h) from the values obtained in the confusion matrix. In this way, we determined accuracy (Acc which reflects the proportion of correct predictions overall. In addition, we obtained values for sensitivity (Se) and specificity (Sp), which measure the ability of the model to detect true positives and negatives, respectively. Additionally, we used the positive predictive value (PPV) and negative predictive value (NPV) to assess the proportion of true positive and negative predictions. Moreover, we calculated positive likelihood Ratio (LR+) and negative likelihood Ratio (LR−), which provide information on how the predictions affect the probability of a positive or negative case, respectively. These metrics, in addition to giving information on crucial aspects of model performance, allow us to fully and fairly evaluate and compare the efficacy and generalizability of our model relative to previous approaches in the literature [3,29,31,72]. Finally, we calculated the 2-class kappa to assess the agreement between the actual classes and the classes predicted by the model for each AHI threshold. With this metric, we can determine at which threshold the model performs higher in terms of classification [69].
4. Results
4.1. Optimal CNN-based approach configuration
As described in Section 3.3, to select the optimal configuration of the CNN-based model, we performed an exhaustive search of different hyperparameters in the CNN network . Subsequently, sequential adjustment of the hyperparameters of the SVR model was performed. The training of the algorithms with each of the combinations of search values was conducted with the training subset. The evaluation was made with the validation subset, calculating the k coefficient when classifying OSA severity. Figure 3 shows the result of the CNN optimization process throughout the model training. On the one hand, the graph on the left shows how the Huber loss value decreases as the training epochs progress. In addition, it can be seen that training stopped with early stopping at epoch 155, 30 epochs after the minimum validation loss value was reached. On the other hand, the graph on the right shows how the mean absolute error obtained from the and is also decreasing in each training epoch. Figure 4 shows the boxplots (median and interquartile range) of the k values obtained in the models for each search hyperparameter. The model that obtained the highest k value (k = 0.3964) presented the CNN hyperparameters , and (Figures 4(a), 4(b), and 4(c), respectively). To optimize the model with this configuration learning rate reached a final value of 3 × 10−5. Finally, after adjustment of the SVR regression, the optimal hyperparameters of , and were selected. Additionally, Table 1 of the supplementary material shows all the combinations of hyperparameters tested and the k results obtained in the validation subset.
4.2. AHI estimation
After training the model with the training subset and establishing the optimal configuration using the validation subset, the test subset was used to estimate the and evaluate the performance of the CNN-based algorithm. The Bland-Altman plot was calculated to establish the similarity between the estimated final AHI of each subject in the proposed model and the AHI extracted from the PSG in the CHAT database (see Figure 5). The solid black line of the Bland-Altman plot shown in Figure 5 indicates the average difference between and . A negative value indicates that our proposed model underestimates in the test subset slightly. However, despite this underestimation, the limits of the confidence interval ([7.40, −9.40]), together with the ICC = 0.79, show that there is high agreement between and .
4.3. Diagnostic ability of pediatric OSA
To evaluate the diagnostic ability of the model, we determined the severity of pediatric OSA after estimating the in the test subset. We calculated the confusion matrix, , and k (Figure 6). On the one hand, the cells of the confusion matrix show the proportion of subjects of the actual class assigned to each severity group. These values are associated with a cell color, indicating the performance of the model. On the other hand, the values of the number of subjects estimated in each of the four groups are shown. We observe an overestimation of the OSA severity in healthy subjects (no OSA) and an underestimation of the OSA severity in mild OSA subjects. According to the number of correctly classified subjects on the main diagonal, the performance of the model is better in subjects with mild OSA (65.28%), followed by those with moderate (55.10%), severe OSA (53.66%), and no OSA subjects (46.15%). Finally, it is important to note that all the subjects estimated with an are patients who have at least moderate OSA.
Table 2 presents the diagnostic performance of our pediatric OSA estimation algorithm in the three severity thresholds (AHI = 1 e/h, AHI = 5 e/h, AHI = 10 e/h). We can observe that the values of Sp are higher as the severity of OSA increases and, conversely, the values of Se decrease. The AHI threshold with the most balanced Se and Sp is 5 e/h. If we focus on the values of PPV and NPV, the highest values are obtained in 1 e/h and 10 e/h, respectively. Regarding LR values, it is essential to highlight the LR+ value obtained for the threshold 10 e/h (27.69), which indicates a relevant diagnostic utility for this threshold. Finally, as the severity of OSA increases, the Acc of the model is higher, with 91.97% Acc for 10 e/h. If we analyze the 2-class kappa at each threshold, we can observe that the highest value is obtained for 5 e/h (0.69). Thus, in strict terms of classification, we can state that the highest 2-class kappa results are obtained for 5 e/h. However, we consider it essential to emphasize that determining the most suitable threshold for model performance depends significantly on aligning it with the specific objectives of the clinicians using our approach. Consequently, we show various metrics to let the potential users of the model make the last decision on its application. This inherent versatility constitutes a notable advantage of our proposal.
Table 2:
Test subset | ||||||||
---|---|---|---|---|---|---|---|---|
AHI threshold | Se (%) | Sp (%) | PPV (%) | NPV (%) | LR + | LR − | Acc (%) | 2-class kappa |
1 e/h | 84.19 | 46.15 | 84.91 | 44.78 | 1.56 | 0.34 | 75.92 | 0.30 |
5 e/h | 76.67 | 91.39 | 79.31 | 90.09 | 8.90 | 0.26 | 86.96 | 0.69 |
10 e/h | 53.66 | 98.06 | 81.48 | 93.01 | 27.69 | 0.47 | 91.97 | 0.60 |
5. Discussion
This work evaluated a CNN-based model reliant on raw overnight ECG signals to estimate respiratory events per segment and AHI per subject for diagnosing OSA severity in pediatric patients. A CNN-based algorithm reached a moderate-to-high diagnostic performance for the thresholds of 1 e/h, 5 e/h, and 10 e/h. Results demonstrate that using single-channel ECG signal and a CNN shows an excellent potential for identifying pediatric OSA severity, thus enabling a simple, fast, objective, and accurate diagnosis. Furthermore, the diagnostic performance obtained with the proposed approach could be a valuable starting point to assess for cardiac co-morbidities, a significant risk factor in children, particularly those with severe OSA [2].
5.1. Comparison between adult and pediatric OSA
Pediatric OSA disease is a serious health problem due to its elevated prevalence and the associated morbidities it can cause [2]. As a result of the cost of PSG and the human and instrument resources required for its detection, much effort has been invested in simplifying the signals required to enable automatic OSA diagnosis in children [14]. Diagnosing pediatric OSA using a single-channel ECG signal is particularly interesting for its simplicity of recording and comfort for children. Analysis of this signal is crucial due to the relevant information it contains including potential underlying cardiovascular pathology [2,10].
In adults, some approaches have analyzed the ECG signals through DL methods. Specifically, CNN networks have been implemented for event detection and classification of OSA, achieving high performance in the OSA classification, with Acc values ranging from approximately 90% to 99% [20–23,25,26]. However, DL methods have yet to be exhaustively evaluated in conjunction with cardiac signals in children. Therefore, we developed a CNN-based model to analyze ECG signals aimed at identifying OSA in children and establishing its severity. Lower diagnostic performances emerged when compared to adults, with differences likely due to the lower AHI traditionally seen in clinical practice among children, the latter reflecting the reduced upper airway collapsible of the upper airway during childhood. As a corollary, the criteria required to establish the presence of OSA are markedly different, being substantially more restrictive for the pediatric population [1,4]. Additionally, AHI thresholds for determining the degree of severity of OSA are lower in children compared to adults (1 e/h, 5 e/h, and 10 e/h versus 5 e/h, 15 e/h, and 30 e/h, respectively) [73,74]. Another critical factor involves the differences in the kinetics of cardiac and other physiological responses during apnea episodes. Accordingly, the analysis of the ECG signals must be specifically restricted to data in children [2,10].
5.2. Configuration of CNN-based model
Regarding the architecture of the model, a regression CNN was implemented. Initially, the number of events per signal segment was estimated, and later, the AHI index per subject was calculated. This methodology had already been used in previous studies by our research group while analyzing other PSG signals such as SpO2 or AF to identify the severity of pediatric OSA, achieving high accuracy in the classification [56,57].
During the preliminary testing runs, we trained the model with 5-minute segments to avoid any impact on the delay between apnea events and the patterns of tachycardia/bradycardia onset [7]. Later, we used 10-minute segments to train the CNN. The algorithm showed a higher k coefficient with 10-minute segments (0.3762 versus 0.3244 in the validation set), indicating that larger segment sizes have a lesser effect on inter-event delay and bradycardia/tachycardia patterns. However, if the training included segments longer than 10 minutes, there was a high computational cost. Then, once the 10-minute segments were selected, we trained the network initially with 10-minute ECG segments and then applied a 50% overlap as a data augmentation strategy. This approach served to minimize overfitting during training and to facilitate the optimal performance of the model. The highest results in terms of k in the validation subset were obtained in the CNN trained with segment overlap (0.3964 versus 0.3762 ). Therefore, we selected 10-minute segments with 50% overlap to evaluate the model on the test subset.
Concerning the selection of CNN hyperparameters, Figure 4 shows that the highest value of k was obtained with , p = 0.1, and . Figure 4 (a) shows that increasing the value and, therefore, increasing the complexity of the network, would not lead the CNN to an improvement in the capture of characteristic patterns in the ECG. Figure 4 (b) indicates that further increasing the dropout would lead to lower median values of the boxplots and lower k values for most models, as shown by the trend of interquartile ranges. From Figure 4 (c), we can extract that those models with of similar values obtain lower median and interquartile ranges in the boxplots. This finding highlights the importance of optimizing this hyperparameter.
5.3. Proposal of a diagnostic protocol
Regarding the confusion matrix of Figure 6, it can be observed that 96.9% of healthy children (no OSA) were estimated with an e/h (no OSA or mild OSA). In addition, of the subjects with , 91.4% were predicted as no OSA or mild OSA. All subjects belonging to the no OSA or mild OSA class were estimated with an . Finally, of the children who were estimated with an (severe OSA), 100% were subjects with at least moderate OSA.
Based on these results, a diagnostic protocol could be created from the developed CNN algorithm to demonstrate its clinical utility, according to the following criteria:
If our estimate is (no OSA), the presence of OSA can be discarded since most of these subjects (95.5%) would have an e/h. In these children, a follow-up evaluation of the symptoms would need to be done periodically. If the symptoms were to persist over time (i.e., 2-3 months), a determination of the need for PSG would be then made by the sleep specialist.
If e/h is estimated by the algorithm, 77.2% of the subjects could have e/h (OSA presence), so the specialist would then recommend a PSG to establish a more precise diagnosis.
If the model estimates , the clinician would consider treatment because 96.7% of these subjects would suffer from at least mild OSA.
If the model estimates , treatment would be established since 100% of children would have an (moderate OSA or severe OSA).
As a result, 51.5% of the PSG would be avoided with this screening protocol. Moreover, only 3.1% of children with an e/h would be indicated for treatment and 3.3% with an e/h would not be referred to PSG/treatment in the first visit to the specialist. This solution would reduce waiting lists in the hospital service, achieving a more accurate pediatric OSA diagnosis and timely treatment of the more severe cases.
5.4. Comparison with previous studies
Table 3 summarizes the diagnostic performance of previous publications that used cardiac information, but not the raw ECG signal, and where conventional machine learning (ML) techniques were applied for feature extraction. In the study by Shouldice et al. [3], the binary classification of segments (normal/apnea) and subsequent binary classification of subjects was performed using a single AHI threshold of 1 e/h. RR interval features were used to achieve Se = 85.7%, Sp = 81.8%, PPV = 85.7%, NPV = 81.8%, and Acc = 84%. Comparing the results, our approach at that threshold achieved lower values of Sp, NPV, and Acc, but similar values of Se and NPV. It is important to highlight that in their study they used only 50 total recordings and 25 for the test subset, compared to 1,610 total recordings and 299 in the test subset used in this study. Thus, we can conclude that our results are more powerful and generalizable.
Table 3:
Publication | Recordings | Signal | Algorithm | Validation | AHI threshold (e/h) | Se (%) | Sp (%) | Acc (%) | PPV (%) | NPV (%) | LR+ | LR− |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Shouldice et al. [3] | 50 | RR Interval* | QD | Training/test Loo-cv | 1 | 85.70 | 81.80 | 84.00 | 85.70 | 81.80 | 4.71 | 0.17 |
|
||||||||||||
Gil et al. [28][29] | 21 | PPG* | QDA | - | >18 OSA <5 No OSA |
87.50 | 71.40 | 80.00 | - | - | 3.05 | 0.17 |
|
||||||||||||
Lázaro et al. [30] | 21 | PPG* | LDA | Training/test Loo-cv | >18 OSA <5 No OSA |
100.00 | 71.40 | 86.70 | - | - | 3.5 | 0.00 |
|
||||||||||||
Martin-Montero et al. [33] | 1738 | HRV* | LDA | Training/test | 1 5 10 |
85.47 64.44 53.66 |
35.38 93.78 97.67 |
74.58 84.95 91.64 |
82.64 81.69 78.57 |
40.35 85.96 92.99 |
1.32 10.36 23.07 |
0.41 0.38 0.47 |
|
||||||||||||
Martin-Montero et al. [32] | 1738 | HRV* | MLP | Training/test | 1 5 10 |
76.30 62.50 66.70 |
38.30 84.20 91.60 |
63.40 81.00 89.30 |
70.70 40.70 44.20 |
45.50 92.80 96.50 |
1.20 4.00 7.90 |
0.60 0.40 0.40 |
|
||||||||||||
Martin-Montero et al. [31] | 1610 | HRV* | LSBoost | Training/validation/test | 1 5 10 |
90.76 66.67 40.00 |
23.40 61.17 92.03 |
80.07 63.18 84.12 |
86.26 49.66 47.37 |
32.35 76.16 89.53 |
1.18 1.72 5.02 |
0.39 0.54 0.65 |
|
||||||||||||
This work | 1610 | ECG | CNN | Training/validation/test |
1
5 10 |
84.19
76.67 53.66 |
46.15
91.39 98.06 |
75.92
86.96 91.97 |
84.91
79.31 81.48 |
44.78
90.09 93.01 |
1.56
8.09 27.69 |
0.34
0.26 0.47 |
Classification was performed using features extracted from these signals.
PSG (polysomnography); RR interval (interval between two R peaks); PPG (photoplethysmography); HRV( heart rate variability); PTTV (pulse transit time variability); PRV (pulse rate variability); ECG (electrocardiogram); QD (quadratic discriminant); LDA (linear discriminant analysis); MLP (multilayer perceptron); FCBF (fast correlation based filter); LSBoost (least-squares boosting); CNN (convolutional neural network); Loo-cv (leave-one-out cross-validation); e/h (events/hour); Se (sensitivity); Sp (specificity); PPV (positive predictive value), NPV (negative predictive value), LR+ (positive likelihood ratio); LR− (negative likelihood ratio); AHI (apnea-hypopnea index); OSA (obstructive sleep apnea).
Other investigators used extracted features from the PPG signal for binary classification of pediatric OSA (AHI < 5 No OSA and AHI > 18 OSA) [28–30]. Compared to these studies, although Se was lower, our approach achieved higher Sp, LR+ and Acc for the most restrictive threshold of 10 e/h. Thus, we can conclude that the current methodology is preferable to correctly identifying healthy children without severe OSA. Moreover, these studies had smaller samples (50 and 21 recordings compared to 1,610 recordings), limiting the generalizability of their results.
Martín-Montero et al. [32,33] used HRV spectral characteristics in two studies to automatically classify pediatric OSA with the common AHI thresholds. In [33], the authors achieved slightly higher Se for 1 e/h (85.47% vs. 84.19%) and Sp for 5 e/h (93.78% vs. 91.39%) than our approach. However, our algorithm achieved a better balance between Se-Sp for both thresholds, resulting in higher Acc (75.95% vs. 74.58% for 1 e/h and 86.96% vs. 84.95% for 5 e/h). We also obtained better predictive accuracy for the threshold of 10 e/h for all measures, thus highlighting the ability of our algorithm to identify severe OSA cases more accurately. Compared to Martín-Montero et al. [32] who obtained slightly higher NPV values for all thresholds, our approach achieved higher Sp values, providing a better balance between NPV and Sp results. Notably, our LR+ value of 27.69 at the threshold of 10 e/h (versus 7.90) demonstrates the clinical utility of our algorithm in identifying children with severe OSA. Additionally, our algorithm showed higher classification measures for Se, Sp, Acc, PPV, LR+, and LR− at the thresholds of 1 e/h and 5 e/h. This indicates the ability of this approach to distinguish between healthy children and patients with OSA and differentiate between mild and moderate pediatric OSA.
Finally, in another study, authors automatically classified pediatric OSA using features derived from HRV signal segments and a conventional ML technique [31]. This study was the most suitable to compare our results since the same study database (CHAT), number of subjects (1610), and AHI thresholds (1, 5, and10 e/h) were used. In addition, they used the same validation strategy as our proposal with three independent data subsets (training, validation, and test). Our algorithm performed better in terms of Se, Sp, Acc, PPV, NPV, LR+, and LR− at the thresholds of 5 e/h and 10 e/h, indicating its clinical applicability in identifying the more severe cases of pediatric OSA. These findings are particularly encouraging as children with moderate to severe OSA are at increased risk of developing cardiovascular and neurocognitive morbidities, and therefore will benefit most from early diagnosis and timely treatment [10,15]. While Se value was lower at the 1 e/h threshold in this work, our Sp value was higher. In this sense, our model achieved a more balanced Se-Sp and PPV-NPV relationships, respectively. These findings indicate that the model proposed here can better identify healthy children, thereby enabling improved utilization of limited PSG resources to those patients more likely to benefit from PSG. Overall, our proposal reached higher values of k (0.372 versus 0.166) and (57.86% versus 41.89%). On the one hand, the difference in the k value indicates that the model proposed in this study shows a higher level of agreement between the OSA severity degrees derived from the estimated AHI and the actual OSA severity degrees. Furthermore, it can also be concluded from these results that our model can better capture patterns and relationships in the data, resulting in predictions that are more consistent with the actual OSA severity degrees. On the other hand, the results conclude that the overall performance of the proposed model on all classes is approximately 16% higher. This makes our CNN-based algorithm a more reliable diagnostic model for clinical practice.
Despite the most interesting discussion relies on comparing our results with those studies using cardiac information, different previous studies also used the CHAT database along with DL alternatives for diagnosing pediatric OSA [56,57]. They focused on the analysis of overnight SpO2 and/or AF signals and reached promising results. However, they only used respiratory information, thus obviating possible OSA pathophysiological effects on the cardiac system, including those associated with cardiac morbidities. Moreover, Jimenez-Garcia et al. [56] showed higher technical complexity as they used the two signals for the automated diagnosis instead of a single one. On the other hand, other approaches worked with DL and cardiac information in contexts other than pediatric OSA [75], as for example the investigation of the impact of DL architectures (FC, VGGs, ResNet50, U-Net, and others) on accelerated cardiac T1 mapping, reaching U-Net the highest performance [76]. Despite all these interesting approaches, we consider that studying the ECG signal in pediatric OSA brings added value. First, there is demonstrated evidence of the lack of ECG signal analysis in the pediatric OSA context despite being an extensively studied signal in the clinical field [15]. Furthermore, the ECG contains a plethora of relevant information that can be used for diagnostic purposes, including cardiorespiratory coupling, heart rate changes, and heart rate variability associated with respiratory events [2,7,16]. Importantly, by analyzing overnight ECG the possibility of assessing cardiovascular risk in children is enabled, which is particularly relevant for severe pediatric OSA patients in which cardiovascular morbidity is a significant risk [7,10]. Finally, because the ECG is one of the most analyzed signals in clinical practice worldwide, an ECG-based application to aid in diagnosing OSA could be easily interpretable, implementable, and accessible [17–19].
Therefore, we can conclude that the application of DL-based algorithms using the raw ECG signal offers three notable advantages: 1) All the information contained in the ECG is potentially used; 2) it eliminates the need to develop specifically designed feature extraction methods; 3) it demonstrates higher diagnostic performance of pediatric OSA.
5.5. Limitations
This work has obviously several limitations that must be considered. Although we used a large sample size (1,610 PSG recordings), it would always be better to have even more recordings to perform the analysis to reach more generalized conclusions. Another limitation to consider is that all the subjects included in the CHAT database were initially suspected of having OSA, which could imply that the clinical characteristics of these subjects are more specific of OSA presence. Furthermore, all subjects in our dataset are between 5 and 9.9 years old, which prevents us from drawing conclusions beyond these age limits. Considering the characteristics of the subjects and the study signals, it is challenging to compare our results with other studies. This is because the raw ECG signal was not analyzed before in the previous existing publications of pediatric OSA [28–30,32,33]. Also, of the studies in the literature, there is only one in which the same database was used (CHAT) [31], which provides the most accurate basis for comparison. If we analyze the DL method used here, a CNN-based model network was implemented based on its suitability in the field of time series analysis and its excellent performance in previous studies in adults to diagnose OSA [20–25,35,44,49]. However, RNN/LSTM architectures could be tested to furtherly evaluate the usefulness of our proposal. Finally, there is a real limitation of DL methods in explaining the features extracted by the models that should be considered for future work.
6. Conclusions
To the best of our knowledge, this study reflects the first evaluation of a CNN-based algorithm using single-channel ECG signals to identify pediatric OSA and estimate its severity. Our findings demonstrate that this approach outperforms previous studies that used cardiac information (PPG and HRV signals) with conventional ML methods. This suggests the potential value of combining ECG signals and DL methods in pediatric OSA. Based on the diagnostic performance achieved by our approach, we suggest a diagnostic protocol that could serve as a valuable clinical tool. It would effectively reduce the need for unnecessary PSG by half, thus alleviating waiting lists in clinical facilities. In this sense, our solution could facilitate the objectivity and accuracy of pediatric OSA diagnosis, while ensuring that urgent and appropriate treatment is administered to the most severe cases. Overall, using ECG recordings and our CNN-based proposal provides a highly promising and viable alternative to PSG for diagnosing pediatric OSA.
For future works, it would always improve our study to have more recordings to perform the analysis. Moreover, it would be desirable to use other child AOS databases to assess the generalizability of the model. Additionally, a study based on screening children at risk for OSA would allow a more general population group to be analyzed. Furthermore, it would be appropriate to evaluate and compare the performance of different architectures, as has already been done in other approaches within the cardiac setting [76]. Specifically, it would be interesting to use different alternative models, such as RNN, hybrid models, or transformers, to evaluate their performance and test their diagnostic ability. Furthermore, applying explainable artificial intelligence methods may be useful for future work [77]. These approaches could provide greater clarity in interpreting the results obtained in the CNN-based model, thus enabling a better understanding of the extracted features and the relationship between apneic events and the behavior of the ECG. Finally, future development of user-friendly tools could be used to furtherly validate our proposal in a clinical environment.
Supplementary Material
Highlights.
A CNN aimed at simplifying the diagnosis of pediatric OSA using raw ECG signals.
First study to use this signal coupled with DL methods to diagnose pediatric OSA.
The model outperformed previous approaches using conventional ML methods.
A diagnostic protocol is suggested as a clinical tool, halving unnecessary PSGs.
Accurate detection of moderate and severe OSA cases, aiding in early OSA diagnosis.
ACKNOWLEDGEMENTS
This research was supported by ‘Ministerio de Ciencia, Innovación y Universidades - Agencia Estatal de Investigación/10.13039/501100011033/’, ‘ERDF A way of making Europe’, and ‘European Union NextGenerationEU/PRTR’ under projects PDC2021-120775-I00 and PID2020-115468RB-I00, and by ‘CIBER -Consorcio Centro de Investigación Biomédica en Red-(CB19/01/00012)’ through ‘Instituto de Salud Carlos III’, as well as under the project TinyHeart from 2022 Early Stage call. The Childhood Adenotonsillectomy Trial (CHAT) was supported by the National Institutes of Health (HL083075, HL083129, UL1-RR-024134, UL1 RR024989). The National Sleep Research Resource was supported by the National Heart, Lung, and Blood Institute (R24 HL114473, 75N92019R002). C. García-Vicente was in receipt of a ‘Ayudas para contratos predoctorales para la Formación de Doctores’ grant from the ‘Minisiterio de Ciencia, Innovación y Universidades (PRE2021-100792)’. J. Jiménez-García was in receipt of a PIF-UVa grant of the University of Valladolid. A. Martín-Montero was in receipt of a ‘Ayudas para contratos predoctorales para la Formación de Doctores’ grant from the ‘Minisiterio de Ciencia, Innovación y Universidades (PRE2018-085219)’. GC. Gutiérrez-Tobal is supported by a post-doctoral grant from the University of Valladolid.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
CONFLICT OF INTEREST STATEMENT
There are no conflicts of interest that could inappropriately influence this research work.
ETHICAL APPROVAL
This work has been carried out according to the Declaration of Helsinki. As part of the research protocol, written consent was obtained from all parents. Additionally, assent was also obtained from children over 7 years old. The clinical trial identifier of the original CHAT database is NCT00560859.
Declaration of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
REFERENCES
- [1].Tan HL, Gozal D, Kheirandish-Gozal L, Obstructive sleep apnea in children: A critical update, Nat. Sci. Sleep 5 (2013) 109–123. 10.2147/NSS.S51907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Marcus CL, Brooks LJ, Ward SD, Draper KA, Gozal D, Halbower AC, Jones J, Lehmann C, Schechter MS, Sheldon S, Shiffman RN, Spruyt K, Diagnosis and Management of Childhood Obstructive Sleep Apnea Syndrome, Pediatrics. 130 (2012) e714–e755. 10.1542/peds.2012-1672. [DOI] [PubMed] [Google Scholar]
- [3].Shouldice RB, O’Brien LM, O’Brien C, de Chazal P, Gozal D, Heneghan C, Detection of Obstructive Sleep Apnea in Pediatric Subjects using Surface Lead Electrocardiogram Features, Sleep. 27 (2004) 784–792. 10.1093/sleep/27.4.784. [DOI] [PubMed] [Google Scholar]
- [4].Aljadeff G, Gozal D, Schechtman VL, Burrell B, Harper RM, Davidson Ward SL, Heart Rate Variability in Children With Obstructive Sleep Apnea, Sleep. 20 (1997) 151–157. 10.1093/sleep/20.2.151. [DOI] [PubMed] [Google Scholar]
- [5].O’Driscoll DM, Foster AM, Ng M. l., Yang JSC, Bashir F, Wong S, Nixon GM, Davey MJ, Anderson V, Walker AM, Trinder J, Horne RSC, Central apnoeas have significant effects on blood pressure and heart rate in children, J. Sleep Res 18 (2009) 415–421. 10.1111/j.1365-2869.2009.00766.x. [DOI] [PubMed] [Google Scholar]
- [6].Vitelli O, Del Pozzo M, Baccari G, Rabasco J, Pietropaoli N, Barreto M, Villa MP, Autonomic imbalance during apneic episodes in pediatric obstructive sleep apnea, Clin. Neurophysiol 127 (2016) 551–555. 10.1016/j.clinph.2015.05.025. [DOI] [PubMed] [Google Scholar]
- [7].Guilleminault C, Winkle R, Connolly S, Melvin K, Tilkian A, Cyclical variation of the heart rate in sleep apnoea syndrome. Mechanisms, and Usefulness of 24 h Electrocardiography as a Screening Technique, Lancet. 323 (1984) 126–131. 10.1016/S0140-6736(84)90062-X. [DOI] [PubMed] [Google Scholar]
- [8].Eckert DJ, Malhotra A, Jordan AS, Mechanisms of Apnea, Prog. Cardiovasc. Dis 51 (2009) 313–323. 10.1016/j.pcad.2008.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Tauman R, Gozal D, Obstructive sleep apnea syndrome in children, Expert Rev. Respir. Med 5 (2011) 425–440. 10.1586/ers.11.7. [DOI] [PubMed] [Google Scholar]
- [10].Bhattacharjee R, Kheirandish-Gozal L, Pillar G, Gozal D, Cardiovascular Complications of Obstructive Sleep Apnea Syndrome: Evidence from Children, Prog. Cardiovasc. Dis 51 (2009) 416–433. 10.1016/j.pcad.2008.03.002. [DOI] [PubMed] [Google Scholar]
- [11].Iber C, Ancoli-Israel S, Chesson AL, Quan SF, The AASM Manual for the Scoring of Sleep and Associated Events: Rules Terminology and Technical Specification, (2007). [Google Scholar]
- [12].Sogebi OA, Ogunwale A, Risk factors of obstructive sleep apnea among nigerian outpatients, Braz. J. Otorhinolaryngol 78 (2012) 27–33. 10.5935/18088694.20120029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Chiu H-Y, Chen P-Y, Chuang L-P, Chen N-H, Tu Y-K, Hsieh Y-J, Wang Y-C, Guilleminault C, Diagnostic accuracy of the Berlin questionnaire, STOP-BANG, STOP, and Epworth sleepiness scale in detecting obstructive sleep apnea: A bivariate meta-analysis, Sleep Med. Rev 36 (2017) 57–70. 10.1016/j.smrv.2016.10.004. [DOI] [PubMed] [Google Scholar]
- [14].Bertoni D, Isaiah A, Towards Patient-centered Diagnosis of Pediatric Obstructive Sleep Apnea—A Review of Biomedical Engineering Strategies, Expert Rev. Med. Devices 16 (2019) 617–629. 10.1080/17434440.2019.1626233. [DOI] [PubMed] [Google Scholar]
- [15].Gutiérrez-Tobal GC, Álvarez D, Kheirandish-Gozal L, del Campo F, Gozal D, Hornero R, Gutiérrez-Tobal GC, Álvarez D, Kheirandish-Gozal L, del Campo F, Gozal D, Hornero R, Reliability of machine learning to diagnose pediatric obstructive sleep apnea: Systematic review and meta-analysis, Pediatr. Pulmonol 57 (2022) 1931–1943. 10.1002/ppul.25423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Penzel T, Kantelhardt JW, Bartsch RP, Riedl M, Kraemer JF, Wessel N, Garcia C, Glos M, Fietze I, Schöbel C, Modulations of Heart Rate, ECG, and Cardio-Respiratory Coupling Observed in Polysomnography, Front. Physiol 7 (2016). 10.3389/fphys.2016.00460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Longo DL, Fauci AS, Kasper DL, Hauser SL, Jamenson J, Harrinson’s manual of medicine, 2013. [Google Scholar]
- [18].Goldberger A, Clinical Electrocardiography: A Simplified Approach: Seventh Edition, Elsevier, 2006. 10.1016/B0-323-04038-1/X5001-X. [DOI] [Google Scholar]
- [19].Lyon A, Mincholé A, Martínez JP, Laguna P, Rodriguez B, Computational techniques for ECG analysis and interpretation in light of their contribution to medical advances, J. R. Soc. Interface 15 (2018). 10.1098/rsif.2017.0821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Dey D, Chaudhuri S, Munshi S, Obstructive sleep apnoea detection using convolutional neural network based deep learning framework, Biomed. Eng. Lett 8 (2018) 95–100. 10.1007/s13534-017-0055-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Urtnasan E, Park J-U, Lee K-J, Multiclass classification of obstructive sleep apnea/hypopnea based on a convolutional neural network from a single-lead electrocardiogram, Physiol. Meas 39 (2018) 065003. 10.1088/13616579/aac7b7. [DOI] [PubMed] [Google Scholar]
- [22].Urtnasan E, Park J-U, Joo EY, Lee KJ, Identification of Sleep Apnea Severity Based on Deep Learning from a Short-term Normal ECG, J. Korean Med. Sci 35 (2020). 10.3346/jkms.2020.35.e399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Erdenebayar U, Kim YJ, Park J-U, Joo EY, Lee K-J, Deep learning approaches for automatic detection of sleep apnea events from an electrocardiogram, Comput. Methods Programs Biomed 180 (2019) 105001. 10.1016/j.cmpb.2019.105001. [DOI] [PubMed] [Google Scholar]
- [24].Wang T, Lu C, Shen G, Detection of Sleep Apnea from Single-Lead ECG Signal Using a Time Window Artificial Neural Network, Biomed Res. Int 2019 (2019) 1–9. 10.1155/2019/9768072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Chang H-Y, Yeh C-Y, Lee C-T, Lin C-C, A Sleep Apnea Detection System Based on a One-Dimensional Deep Convolution Neural Network Model Using Single-Lead Electrocardiogram, Sensors. 20 (2020) 4157. 10.3390/s20154157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Mostafa Mendonça, Ravelo-García Morgado-Dias, A Systematic Review of Detecting Sleep Apnea Using Deep Learning, Sensors. 19 (2019) 4934. 10.3390/s19224934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Ye P, Qin H, Zhan X, Wang Z, Liu C, Song B, Kong Y, Jia X, Qi Y, Ji J, Chang L, Ni X, Tai J, Diagnosis of obstructive sleep apnea in children based on the XGBoost algorithm using nocturnal heart rate and blood oxygen feature, Am. J. Otolaryngol. - Head Neck Med. Surg 44 (2023) 103714. 10.1016/j.amjoto.2022.103714. [DOI] [PubMed] [Google Scholar]
- [28].Gil E, Mendez M, Vergara JM, Cerutti S, Bianchi AM, Laguna P, Discrimination of Sleep-Apnea-Related Decreases in the Amplitude Fluctuations of PPG Signal in Children by HRV Analysis, IEEE Trans. Biomed. Eng 56 (2009) 1005–1014. 10.1109/TBME.2008.2009340. [DOI] [PubMed] [Google Scholar]
- [29].Gil E, Bailon R, Vergara JM, Laguna P, PTT Variability for Discrimination of Sleep Apnea Related Decreases in the Amplitude Fluctuations of PPG Signal in Children, IEEE Trans. Biomed. Eng 57 (2010) 1079–1088. 10.1109/TBME.2009.2037734. [DOI] [PubMed] [Google Scholar]
- [30].Lazaro J, Gil E, Vergara JM, Laguna P, Pulse Rate Variability Analysis for Discrimination of Sleep-Apnea-Related Decreases in the Amplitude Fluctuations of Pulse Photoplethysmographic Signal in Children, IEEE J. Biomed. Heal. Informatics. 18 (2014) 240–246. 10.1109/JBHI.2013.2267096. [DOI] [PubMed] [Google Scholar]
- [31].Martín-Montero A, Armañac-Julián P, Gil E, Kheirandish-Gozal L, Álvarez D, Lázaro J, Bailón R, Gozal D, Laguna P, Hornero R, Gutiérrez-Tobal GC, Pediatric sleep apnea: Characterization of apneic events and sleep stages using heart rate variability, Comput. Biol. Med 154 (2023) 106549. 10.1016/j.compbiomed.2023.106549. [DOI] [PubMed] [Google Scholar]
- [32].Martín-Montero A, Gutiérrez-Tobal GC, Gozal D, Barroso-García V, Álvarez D, del Campo F, Kheirandish-Gozal L, Hornero R, Bispectral Analysis of Heart Rate Variability to Characterize and Help Diagnose Pediatric Sleep Apnea, Entropy. 23 (2021) 1016. 10.3390/e23081016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Martín-Montero A, Gutiérrez-Tobal GC, Kheirandish-Gozal L, Jiménez-García J, Álvarez D, del Campo F, Gozal D, Hornero R, Heart rate variability spectrum characteristics in children with sleep apnea, Pediatr. Res 89 (2021) 1771–1779. 10.1038/s41390-020-01138-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Moridian P, Shoeibi A, Khodatars M, Jafari M, Pachori RB, Khadem A, Alizadehsani R, Ling SH, Automatic diagnosis of sleep apnea from biomedical signals using artificial intelligence techniques: Methods, challenges, and future works, WIREs Data Min. Knowl. Discov 12 (2022). 10.1002/widm.1478. [DOI] [Google Scholar]
- [35].Faust O, Hagiwara Y, Hong TJ, Lih OS, Acharya UR, Deep learning for healthcare applications based on physiological signals: A review, Comput. Methods Programs Biomed 161 (2018) 1–13. 10.1016/J.CMPB.2018.04.005. [DOI] [PubMed] [Google Scholar]
- [36].Alkan A, Abdullah MÜ, Abdullah HO, Assaf M, Zhou H, A smart agricultural application: automated detection of diseases in vine leaves usinghybrid deep learning, Turkish J. Agric. For 45 (2021) 717–729. 10.3906/tar-2007-105. [DOI] [Google Scholar]
- [37].Sunnetci KM, Kaba E, Celiker FB, Alkan A, Deep Network-Based Comprehensive Parotid Gland Tumor Detection, Acad. Radiol (2023). 10.1016/j.acra.2023.04.028. [DOI] [PubMed] [Google Scholar]
- [38].Muhammed Sunnetci K, Ulukaya S, Alkan A, Periodontal bone loss detection based on hybrid deep learning and machine learning models with a user-friendly application, Biomed. Signal Process. Control 77 (2022) 103844. 10.1016/J.BSPC.2022.103844. [DOI] [Google Scholar]
- [39].Sunnetci KM, Alkan A, Biphasic majority voting-based comparative COVID-19 diagnosis using chest X-ray images, Expert Syst. Appl 216 (2023) 119430. 10.1016/J.ESWA.2022.119430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].LeCun Y, Bengio Y, Hinton G, Deep learning, Nature. 521 (2015) 436–444. 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
- [41].Goodfellow I, Bengio Y, Courville A, Deep learning MIT Press, 2016. [Google Scholar]
- [42].Ismail Fawaz H, Forestier G, Weber J, Idoumghar L, Muller P-A, Deep learning for time series classification: a review, Data Min. Knowl. Discov 33 (2019) 917–963. 10.1007/s10618-019-00619-1. [DOI] [Google Scholar]
- [43].Ebrahimi Z, Loni M, Daneshtalab M, Gharehbaghi A, A review on deep learning methods for ECG arrhythmia classification, Expert Syst. with Appl. X 7 (2020) 100033. 10.1016/j.eswax.2020.100033. [DOI] [Google Scholar]
- [44].Murat F, Yildirim O, Talo M, Baloglu UB, Demir Y, Acharya UR, Application of deep learning techniques for heartbeats detection using ECG signals-analysis and review, Comput. Biol. Med 120 (2020) 103726. 10.1016/j.compbiomed.2020.103726. [DOI] [PubMed] [Google Scholar]
- [45].Redline S, Amin R, Beebe D, Chervin RD, Garetz SL, Giordani B, Marcus CL, Moore RHRH, Rosen CL, Arens R, Gozal D, Katz ES, Mitchell RB, Muzumdar H, Taylor HGG, Thomas N, Ellenberg S, The Childhood Adenotonsillectomy Trial (CHAT): Rationale, Design, and Challenges of a Randomized Controlled Trial Evaluating a Standard Surgical Procedure in a Pediatric Population, Sleep. 34 (2011) 1509–1517. 10.5665/sleep.1388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Marcus CL, Moore RH, Rosen CL, Giordani B, Garetz SL, Taylor HG, Mitchell RB, Amin R, Katz ES, Arens R, Paruthi S, Muzumdar H, Gozal D, Thomas NH, Ware J, Beebe D, Snyder K, Elden L, Sprecher RC, Willging P, Jones D, Bent JP, Hoban T, Chervin RD, Ellenberg SS, Redline S, A Randomized Trial of Adenotonsillectomy for Childhood Sleep Apnea, N. Engl. J. Med 368 (2013) 2366–2376. 10.1056/NEJMoa1215881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [47].Berry RB, Budhiraja R, Gottlieb DJ, Gozal D, Iber C, Kapur VK, Marcus CL, Mehra R, Parthasarathy S, Quan SF, Redline S, Strohl KP, Ward SLD, Tangredi MM, Rules for scoring respiratory events in sleep: Update of the 2007 AASM manual for the scoring of sleep and associated events, J. Clin. Sleep Med 8 (2012) 597–619. 10.5664/JCSM.2172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Wang L, Lin Y, Wang J, A RR interval based automated apnea detection approach using residual network, Comput. Methods Programs Biomed 176 (2019) 93–104. 10.1016/j.cmpb.2019.05.002. [DOI] [PubMed] [Google Scholar]
- [49].Mashrur FR, Islam MS, Saha DK, Islam SMR, Moni MA, SCNN: Scalogram-based convolutional neural network to detect obstructive sleep apnea using single-lead electrocardiogram signals, Comput. Biol. Med 134 (2021) 104532. 10.1016/j.compbiomed.2021.104532. [DOI] [PubMed] [Google Scholar]
- [50].Zhang J, Tang Z, Gao J, Lin L, Liu Z, Wu H, Liu F, Yao R, Automatic Detection of Obstructive Sleep Apnea Events Using a Deep CNN-LSTM Model, Comput. Intell. Neurosci 2021 (2021) 1–10. 10.1155/2021/5594733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Sheta A, Turabieh H, Thaher T, Too J, Mafarja M, Hossain MS, Surani SR, Diagnosis of Obstructive Sleep Apnea from ECG Signals Using Machine Learning and Deep Learning Classifiers, Appl. Sci 11 (2021) 6622. 10.3390/app11146622. [DOI] [Google Scholar]
- [52].Urtnasan E, Park J-U, Lee K-J, Automatic detection of sleep-disordered breathing events using recurrent neural networks from an electrocardiogram signal, Neural Comput. Appl 32 (2020) 4733–4742. 10.1007/s00521-018-3833-2. [DOI] [Google Scholar]
- [53].HARRIS FJ, Multirate FIR Filters for Interpolating and Desampling, in: Handb. Digit. Signal Process, Elsevier, 1987: pp. 173–287. 10.1016/B978-0-08050780-4.50008-4. [DOI] [Google Scholar]
- [54].van Dyk DA, Meng X-L, The Art of Data Augmentation, J. Comput. Graph. Stat 10 (2001) 1–50. 10.1198/10618600152418584. [DOI] [Google Scholar]
- [55].Patro SGK, Sahu KK, Normalization: A Preprocessing Stage, IARJSET. 2 (2015) 20–22. 10.17148/IARJSET.2015.2305. [DOI] [Google Scholar]
- [56].Jiménez-García J, García M, Gutiérrez-Tobal GC, Kheirandish-Gozal L, Vaquerizo-Villar F, Álvarez D, del Campo F, Gozal D, Hornero R, A 2D convolutional neural network to detect sleep apnea in children using airflow and oximetry, Comput. Biol. Med 147 (2022). 10.1016/j.compbiomed.2022.105784. [DOI] [PubMed] [Google Scholar]
- [57].Vaquerizo-Villar F, Alvarez D, Kheirandish-Gozal L, Gutierrez-Tobal GC, Barroso-Garcia V, Santamaria-Vazquez E, Del Campo F, Gozal D, Hornero R, A Convolutional Neural Network Architecture to Enhance Oximetry Ability to Diagnose Pediatric Obstructive Sleep Apnea, IEEE J. Biomed. Heal. Informatics 25 (2021) 2906–2916. 10.1109/JBHI.2020.3048901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Kiranyaz S, Avci O, Abdeljaber O, Ince T, Gabbouj M, Inman DJ, 1D convolutional neural networks and applications: A survey, Mech. Syst. Signal Process 151 (2021) 107398. 10.1016/j.ymssp.2020.107398. [DOI] [Google Scholar]
- [59].Faust O, Hagiwara Y, Hong TJ, Lih OS, Acharya UR, Deep learning for healthcare applications based on physiological signals: A review, Comput. Methods Programs Biomed 161 (2018) 1–13. 10.1016/j.cmpb.2018.04.005. [DOI] [PubMed] [Google Scholar]
- [60].Bjorck N, Gomes CP, Selman B, Weinberger KQ, Understanding Batch Normalization, Adv. Neural Inf. Process. Syst 31 (2018). [Google Scholar]
- [61].Tompson J, Goroshin R, Jain A, LeCun Y, Bregler C, Efficient object localization using Convolutional Networks, in: 2015 IEEE Conf. Comput. Vis. Pattern Recognit., IEEE, 2015: pp. 648–656. 10.1109/CVPR.2015.7298664. [DOI] [Google Scholar]
- [62].Choudakkanavar G, Mangai JA, A Hybrid 1D-CNN-Bi-LSTM based Model with Spatial Dropout for Multiple Fault Diagnosis of Roller Bearing, Int. J. Adv. Comput. Sci. Appl 13 (2022) 637–644. 10.14569/IJACSA.2022.0130873. [DOI] [Google Scholar]
- [63].Kingma DP, Ba JL, Adam: A method for stochastic optimization, in: 3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc., International Conference on Learning Representations, ICLR, San Diego, 2015: pp. 1–15. https://arxiv.org/abs/1412.6980v9. [Google Scholar]
- [64].Huber PJ, Robust Estimation of a Location Parameter, Ann. Math. Stat 35 (1964) 73–101. 10.1214/aoms/1177703732. [DOI] [Google Scholar]
- [65].Smola AJ, Schölkopf B, A tutorial on support vector regression, Stat. Comput 14 (2004) 199–222. 10.1023/B:STCO.0000035301.49549.88. [DOI] [Google Scholar]
- [66].Vapnik VN, The nature of statistical learning theory, Springer science & business media, 1999. 10.1007/978-1-4757-3264-1. [DOI] [Google Scholar]
- [67].Witten MA, Frank IH, Hall E, Data Mining: Practical machine learning tools and techniques, Morgan Kaufmann, 2005. [Google Scholar]
- [68].Chollet F, Deep Learning with Python, Manning Publications, New York, NY, 2017. [Google Scholar]
- [69].Cohen J, A coefficient of agreement for nominal scales, Educ. Psychol. Meas 20 (1960) 37–46. 10.1177/001316446002000104. [DOI] [Google Scholar]
- [70].Bartko JJ, The Intraclass Correlation Coefficient as a Measure of Reliability, Psychol. Rep 19 (1966) 3–11. 10.2466/pr0.1966.19.1.3. [DOI] [PubMed] [Google Scholar]
- [71].Giavarina D, Understanding Bland Altman analysis, Biochem. Medica 25 (2015) 141–151. 10.11613/BM.2015.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [72].Barroso-García V, Gutiérrez-Tobal GC, Gozal D, Vaquerizo-Villar F, Álvarez D, Del Campo F, Kheirandish-Gozal L, Hornero R, Wavelet analysis of overnight airflow to detect obstructive sleep apnea in children, Sensors. 21 (2021) 1–19. 10.3390/s21041491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [73].Epstein LJ, Kristo D, Strollo PJ, Friedman N, Malhotra A, Patil SP, Ramar K, Rogers R, Schwab RJ, Weaver EM, Weinstein MD, Clinical Guideline for the Evaluation, Management and Long-term Care of Obstructive Sleep Apnea in Adults, J. Clin. Sleep Med 05 (2009) 263–276. 10.5664/jcsm.27497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [74].Church GD, The Role of Polysomnography in Diagnosing and Treating Obstructive Sleep Apnea in Pediatric Patients, Curr. Probl. Pediatr. Adolesc. Health Care. 42 (2012) 2–25. 10.1016/j.cppeds.2011.10.001. [DOI] [PubMed] [Google Scholar]
- [75].Musa N, Gital AY, Aljojo N, Chiroma H, Adewole KS, Mojeed HA, Faruk N, Abdulkarim A, Emmanuel I, Folawiyo YY, Ogunmodede JA, Oloyede AA, Olawoyin LA, Sikiru IA, Katb I, A systematic review and Meta-data analysis on the applications of Deep Learning in Electrocardiogram, J. Ambient Intell. Humaniz. Comput 14 (2023) 9677–9750. 10.1007/s12652-022-03868-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [76].Amyar A, Guo R, Cai X, Assana S, Chow K, Rodriguez J, Yankama T, Cirillo J, Pierce P, Goddu B, Ngo L, Nezafat R, Impact of deep learning architectures on accelerated cardiac T 1 mapping using MyoMapNet, NMR Biomed. 35 (2022). 10.1002/nbm.4794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [77].Barredo Arrieta A, Díaz-Rodríguez N, Del Ser J, Bennetot A, Tabik S, Barbado A, Garcia S, Gil-Lopez S, Molina D, Benjamins R, Chatila R, Herrera F, Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI, Inf. Fusion 58 (2020) 82–115. 10.1016/j.inffus.2019.12.012. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.