Abstract
Children with type 1 diabetes (T1D) frequently have nocturnal hypoglycemia, daytime physical activity being the most important risk factor. The risk for late post-exercise hypoglycemia depends on various factors and is difficult to anticipate. The availability of continuous glucose monitoring (CGM) enabled the development of various machine learning approaches for nocturnal hypoglycemia prediction for different prediction horizons. Studies focusing on nocturnal hypoglycemia prediction in children are scarce, and none, to the best knowledge of the authors, investigate the effect of previous physical activity. The primary objective of this work was to assess the risk of hypoglycemia throughout the night (prediction horizon 9 h) associated with physical activity in children with T1D using data from a structured setting. Continuous glucose and physiological data from a sports day camp for children with T1D were input for logistic regression, random forest, and deep neural network models. Results were evaluated using the F2 score, adding more weight to misclassifications as false negatives. Data of 13 children (4 female, mean age 11.3 years) were analyzed. Nocturnal hypoglycemia occurred in 18 of a total included 66 nights. Random forest using only glucose data achieved a sensitivity of 71.1% and a specificity of 75.8% for nocturnal hypoglycemia prediction. Predicting the risk of nocturnal hypoglycemia for the upcoming night at bedtime is clinically highly relevant, as it allows appropriate actions to be taken—to lighten the burden for children with T1D and their families.
Keywords: diabetes management, digital health, machine learning, supervised learning, biomedical signal processing
1 Introduction
Type 1 Diabetes (T1D) affects more than 8 million people worldwide, 1.5 million of them being younger than 20 years of age (1, 2). The disease results from autoimmune destruction of pancreatic beta cells leading to insulin deficiency. The missing hormone insulin needs to be replaced several times a day, either with subcutaneous injections or an insulin pump. Good blood glucose control is important to avoid acute and long-term complications (3). However, especially for young children and adolescents, this represents an everyday challenge (4). Low blood sugar (hypoglycemia) is the most feared and common acute complication of T1D (5, 6), and the constant risk of hypoglycemia represents a great burden, in particular for children and their caregivers (7).
Bachmann et al. (8) showed that asymptomatic nocturnal hypoglycemia is frequent, and episodes often are prolonged for several hours. The most important risk factor for hypoglycemia during the night was physical activity during the day. The risk of nocturnal hypoglycemia increased with vigorous-intensity physical activity (8, 9). On the other hand, regular physical activity is essential in long-term diabetes treatment (10) and has many additional beneficial effects (11). Simultaneously, people with diabetes need to know how to avoid exercise-induced hypoglycemia, as physical activity has a direct and delayed glucose-lowering effect (12). Physical activity induces different glucose responses in different individuals and depends on characteristics of the activity itself, like type, duration, and intensity (10). Therefore, with the current state of knowledge, it is challenging to provide the right personalized recommendations to prevent exercise-associated hypoglycemia, in particular, late onset post-exercise hypoglycemia. Developing preventive measures to avoid such nocturnal hypoglycemia can increase the children's safety overnight, improve sleep quality, and quality of life of the children and caregivers.
Several machine learning approaches exist in the literature to determine hypoglycemia—the review of (13) summarizes the current state-of-the-art, considering in total 79 studies. 31 of them focused on T1D. The most used machine learning method for hypoglycemia prediction was logistic regression (in 28 studies), followed by random forest (in 14 studies). Further applied algorithms incorporated support vector machines (SVMs), autoregressive and neural networks, and XGBoost. About 50% of the considered studies used continuous glucose monitoring (CGM) data or CGM-derived parameters for hypoglycemia prediction. Of these studies, twelve included parameters of physiological signals, and seven included parameters of exercise and physical activity. Previous work focused on different prediction horizons for hypoglycemia, such as short-term (< 180 min), mid-term (180 min to 24 h), and long-term (several days, months, or even years) (13). In the area of nocturnal hypoglycemia prediction, prediction horizons between 15 min and 7 h or nighttime without an exact length or start and end time were considered: 15 min (14–16), 30 min (14–17), 45 min (15, 16), 1 h (15, 16, 18), 3 h (19), 6 h (19–26), 7 h (27), or nighttime without an exact length or start and end time (28).
Studies focusing on nocturnal hypoglycemia prediction in children and adolescents are scarce (16, 18, 22, 23). Sampath et al. (22) and Tkachenko et al. (23) used the ChildrenData dataset that contains data from about 179 children, however, no CGM data were available. The available n = 476 records contained nine blood glucose measurements of the following distinct time points within 24 h: 08:00, 11:30, 13:30, 16:00, 18:00, 21:00, 00:00, 03:00, and 06:00. The three measurements at 00:00, 03:00, and 06:00 were used to identify if nocturnal hypoglycemia was present, while the other measurements of the day formed the input for the nocturnal hypoglycemia prediction. Both used a method consisting of aggregating ranking algorithms with a stochastic model. Dave et al. (16) and Duckworth et al. (18) used data with CGM for a prediction horizon of up to 60 min. The dataset used by (16) consisted of 112 participants (aged 1–21 years). They used logistic regression and random forest on data collected over 90 days. The dataset used by (18) contained 153 participants (aged 14–24 years). They obtained best results with the XGBoost framework that uses an ensemble of weak learners which are trained stagewise through gradient boosting.
In this study, the primary objective was to predict nocturnal hypoglycemia associated with physical activity in children with T1D using CGM and physiological data acquired during day and night in a structured setting. We incorporate the children's particularities like longer sleep duration, focusing on the entire night (prediction horizon of 9 hours), or investigating the effect of previous physical activity. Second, as the children performed various structured physical activities during the day in the dataset that we are considering, we want to analyze if including data from a wearable device improves the outcomes. Third, as more advanced machine learning techniques such as deep learning are currently underrepresented in literature, we want to investigate the performances of Deep Neural Network (DNN) models like Recurrent Neural Networks (RNNs) and Multilayer Perceptron (MLP) compared to the most used approaches in literature like logistic regression and random forest.
2 Materials and methods
This part describes the used data and presents the implemented machine learning pipeline (Figure 1).
2.1 Data
Data of children with T1D participating in a one-week sports day camp were considered (29). The responsible Ethics Committee [Ethikkommission Nordwest- und Zentralschweiz (EKNZ)], Gesuchsnummer: 2020 - 00543, approved the study. For the data acquisition, the hardware equipment contained of a glucose sensor [intermittently scanned continuous glucose monitoring (isCGM), Freestyle libre 2 (Abbott Diabetes Care Inc., Alameda, US) or a CGM device, Dexcom (Dexcom, San Diego, US)] where the sensor was put into the subcutaneous tissue on the abdomen or the upper arm. The CGM device saved data every 5 min, and the isCGM device every 15 min. With all glucose sensors used, intersitital glucose was measured continuously. With the intermittent scanning system, however, the values are only displayed on demand, while they are visible constantly with the CGM systems. The accuracy of the sensors is comparable with mean absolute relative difference (MARD) values between 9.0% (Dexcom) and 9.7% (Freestyle libre 2) (30–34). The values are therefore reliable and comparable, even if different devices were used. Also, we trusted the devices. The alarms were not used during the day because hypoglycemia was well perceived by the participants and glucose checks were performed regularly.
In addition to CGM devices, the children were equipped with a physiological wearable sensor (Everion, Biofourmis, Boston, US). This sensor is a CE-certified research device and captured 22 signals and seven associated quality measures in real-time. In this work, we selected ten signals—and their associated quality measures, if available—for further processing (marked in bold in Table 1). The sampling rate was 1 Hz. Two sensors were used for each study participant and worn alternately. At the beginning of each study day (around 9 am), the Everion sensors were changed. The Everion sensor was attached to the upper arm (right or left) with an appropriately sized armband.
Table 1.
Measured signal |
Activity classification* |
Activity score |
Barometer pressure |
Blood pulse wave |
Core temperature * |
Energy* |
Galvanic Skin Response (GSR) electrode |
Health score |
Heart rate * |
Heart rate variability* |
Motion activity |
Number of steps |
Oxygen saturation* |
Perfusion index |
Relax stress intensity score |
Respiration rate* |
Richness score |
Sleep quality index score |
Temperature barometer |
Temperature local |
Temperature object |
Training effect score |
The signals marked in bold were considered in this work. The * next to the signals states if an associated quality measure exists.
The glucose sensor and the Everion sensor were worn day and night, and the Everion sensor had to be taken off for showering and water-based activities. Additionally to sensor glucose measurements, self-monitoring blood glucose (SMBG) were performed hourly during exercise sessions, in each case of symptoms of hypoglycemia, and in case of sensor glucose values below 70 mg/dl or above 270 mg/dl. Self-monitoring of blood glucose was performed hourly during exercise i.e. at the beginning of each exercise session as recommended in the exercise management guidelines (35). As it was mandatory to perform a capillary blood glucose measurement in case of symptoms of hypoglycemia or a sensor glucose value below 70 mg/dl, a proportion of children had to do a fingerprick testing. It turned out to be more convenient/practical to do so for all participants.
Preprocessing was necessary for combining the glucose sensor data with the SMBG from the logbook and for the signals of the Everion sensor. The preprocessing steps are illustrated in Figure 2. In case of two different glucose values at the same timestamp, sensor data was overwritten with SMBG and the lower glucose sensor data were kept (Figure 2A). We decided to keep the lower value as hypoglycemia detection was the focus of this work. We curated all time series such that they have a sampling frequency of 5 min, where we use time-weighted linear interpolation to assign the glucose measurements to a time stamp. All data gaps in the glucose were filled with the respective mean of the corresponding data of the day. For data from the Everion sensor, we replaced values of duplicated timestamps with their mean (Figure 2B). For signals with an associated quality measure, we ignored values when the quality measure was less than 50% as this percentage was recommended by the manufacturer (36). Further, we considered only days with at least 30% of available data. To obtain uniform temporal data, we set the sampling interval to 5 min (corresponding to a sampling frequency of 3.33 mHz) by calculating the mean in windows of 5 min length, with no overlap of the individual windows. Similar to the glucose data, all data gaps were filled with the respective mean of the corresponding data of the day.
This above described procedure led to the exclusion of a child as not enough Everion data were measured. These processing steps resulted in using data from 13 children with a total of 66 days (ranging from 9 to 12 children per day) in this work. Of them eight children used an isCGM device and five a CGM device.
The activity classes of the parameter activity classification (Table 1) were transferred to Metabolic Equivalent of Task (MET) values (Table 2) (37) with 1 MET = 1 kcal / kg / h.
Table 2.
Activity | Assigned MET values |
---|---|
Undefined | NaN |
Resting | 1.5 |
Walking flat | 3 |
Running flat | 7 |
Biking flat | 7 |
Walking up | 5.5 |
Running up | 13 |
Biking up | 10 |
Rowing | 6 |
Other | NaN |
Biking | 7 |
Running | 7 |
Walking | 3 |
Walking down | 3.5 |
Running down | 6 |
Biking down | 3 |
Sitting | 1.5 |
Standing | 2 |
Driving car | 2.5 |
Driving public | 1.5 |
Sleeping | 1 |
Awake | 1.5 |
Ctrl rest med ee | NaN |
Measured and relevant improvement in Relax | NaN |
Measured and relevant improvement in Sleep | NaN |
Measured and relevant improvement in Exercise | NaN |
Measured and relevant improvement in Move | NaN |
We assigned, according to the “compendium of physical activities” (37) the following Metabolic Equivalent of Task (MET) values to single activities with 1 MET = 1 kcal / kg / h.
2.2 Machine learning algorithm
The general idea of this work was to develop a classification system to answer the research question whether nocturnal hypoglycemia can be predicted with physiological and glucose data collected during the day. The night was defined between 10 pm and 7 am. The day was defined between 10 am and 10 pm.
A hypoglycemic event was defined as either (1) a single or multiple SMBG < 3.9 mmol/l or (2) an interval greater than 15 min, in which all continuous glucose measurements were < 3.9 mmol/l (14, 16, 38). In the present data, 48 nights were found without nocturnal hypoglycemia and 18 with nocturnal hypoglycemia.
Figure 1 gives an overview of the different steps of the designed machine learning system. In this work, we use a supervised learning approach. In the training phase of a supervised machine learning system, the information about the actual label is known to the classifier. Then in the testing phase, the actual labels are predicted. The standard pipeline of a supervised machine learning model consists of the following four steps: (1) data, (2) preprocessing, (3) features, and (4) classification (39). These steps were also used in this work (Figure 1). For the last step, the classification, we decided to use two baseline classifiers, logistic regression and random forest, as these are the most used algorithms in literature for hypoglycemia prediction (13). In case of the DNN models, the third steps (features) was left out as the complete data were directly used.
We used glucose measurements, the logbook, the Everion sensor, and participant information as data sources (Section 2.1). The sensor glucose measurements and the logbook data (SMBG) were combined to form the glucose data (details also available in Figure 3). The chosen signals of the Everion sensor (Table 1) form the physiological data. From the available participant information, the age, weight, height, Body mass index (BMI), and gender (male or female) were extracted to form the static data.
With the available temporal and static data, four dataset combinations were chosen as input data for the algorithms:(1) glucose data only, (2) glucose and static data, (3) glucose and physiological data and (4) glucose, physiological, and static data.
In the case of the DNN models, these four input datasets were directly used in the following two scenarios:
-
A RNN for the temporal data.
The RNN included a masking layer followed by a bidirectional Gated Recurrent Unit (GRU) (40) layer, a dropout layer, a Long Short-Term Memory (LSTM) (41) layer, and another dropout layer.
-
A RNN for the temporal data, and a MLP for the static data.
Both were combined afterward. The output of both the MLP and the RNN were concatenated and processed by an additional MLP. The RNN included a single LSTM layer.
Considering every distinct combination of input datasets, four different architectures were designed by composing individual network blocks. The RNN is predominantly used for sequential and temporal data, whereas the MLP finds applications in all domains of machine learning (42). Here, we can only pass data according to a predefined time-window into the MLP, which can not consider data outside of the window. RNNs on the other hand, continuously update a hidden state while processing the data. Therefore, the hidden state can potentially capture long-ranging dependencies.
To find the best architectures, considering each composition individually, the neural networks were subject to hyperparameter optimization using the Hyperband algorithm, introduced by Li et al. (43). The grid of hyperparameters to be optimized included the number of layers, number of nodes per layer, application of GRU or LSTM units, addition of bidirectionality to the recurrent units, addition of dropout layers, and masking zero-values. In order to properly treat the missing values of the input data, which have been substituted with the parameter-specific mean values, the masking ignores the mean values. Since all parameters have been standardized, the mean values are equal to zero. We introduced class weights to the loss function during training to account for the class imbalance (18 nights with nocturnal hypoglycemia, 48 nights without nocturnal hypoglycemia). We chose the class weights inversely proportional to their respective frequencies. This led to increased attention to the underrepresented class samples and forced the model to improve equally in both classes during the training phase. We used the Rectified Linear Unit (ReLU) activation function throughout the dense layers to avoid computational complexity and vanishing gradients, as suggested in literature (44, 45). The hyperbolic tangent (tanh) activation function was used for the recurrent layers based on research conducted by Chung et al. (46) and Hochreiter and Schmidhuber (41), as it has the property of limiting the issue of exploding gradients while still providing a strong gradient. We used adaptive moment estimation (Adam) as the optimizer with a learning rate of 0.001 since the optimizer Adam performs on average better on deep learning task than than Stochastic Gradient Descent (SGD) (47, 48). We set the batch size to 1 due to the small number of samples in the dataset and the intention to counteract for overfitting and poor generalization (49). Although, Adam with batch size 1 reduces to an update rule similar to the vanilla SGD algorithm, it still preserves the features regarding the moments of the gradient (47).
In the case of the baseline classifiers, features were engineered from the four input datasets and selected before being the input to either logistic regression or random forest. Logistic regression uses a logistic function (the sigmoid function) to model the probability that a given input belongs to a particular class (50). Random forest is an ensemble learning method that operates by constructing multiple decision trees and using the majority vote of these trees to determine the correct class (50).
For the glucose data, we calculated eight features from literature (14) to reflect glucose dynamics. These were coefficient of variation, lability index, low blood glucose index, 1 h continuous overlapping net glycemic action, minimal value, the difference between the last two values, acceleration over the last values, and linear trend coefficient. In addition, we used for all four input datasets the Python library tsfresh to calculate time series characteristics, which was followed by a feature selection step. This library was specifically developed to create new features aggregated on temporal dependencies. With tsfresh, approximately 800 new features for every single initial feature were engineered. To reduce the number of features to the best 15 features, we conducted a performance-based, sequential feature selection using the Python library scikit-learn. The options of scikit-learn were set in accordance to the classifier that was used afterward.
This resulted in five different feature-data-combinations that were used for the following two baseline classifiers:
Logistic regression with Least Absolute Shrinkage and Selection Operator (LASSO) regularization
Random forest with 10 trees.
We chose to focus on these two classifiers because they are the most commonly used algorithms in the literature for hypoglycemia prediction (13), and are also applied in the field of nocturnal hypoglycemia prediction (14, 16).
2.3 Comparison to algorithm in literature
To assess the quality of our dataset, we decided to re-implemented an algorithm in literature (14). Berikov et al. (14) applied the two most used algorithms for hypoglycemia prediction (logistic regression and random forest) in the field of nocturnal hypoglycemia and obtained best results with a prediction horizon of 15 min, which was much shorter than the used prediction horizon of 9 h in this work. Still, we considered it valuable to evaluate the quality of our dataset. We reimplemented the algorithm as described in the associated manuscript (14), adapted where necessary, and applied it to the described data of this work. Additionally, we used an adapted version using elements of the published algorithm and the algorihm of this work. More details about the reimplemented algorithm and the adapted version is given in the next paragraph. Figure 4 illustrates the steps of the algorithm from (14) applied in this work. The glucose measurements, the logbook, and participant information were included as data sources.
As described previously, the glucose data consisted of a combination of the glucose measurements and the SMBG entries of the logbook. From the participant information, age, BMI, and gender formed the selected static data. The glucose data was processed in sliding windows of the acquired nights, in the time frame of 0 am (midnight) to 6 am. Each sliding window had a duration of 45 min. The next window started 20 min after the start of the previous window. Single windows were used to predict a hypoglycemic event exactly 15 min after the end of the window. In each window for the glucose data, we extracted the same eight features from literature (14) as mentioned in Section 2.2. These features were combined with the selected static data to form the feature set. Similar to (14), a logistic regression with LASSO regularization and random forest were trained with this feature set. Additionally, not mentioned by (14), we decided to train a MLP with the same feature set.
2.4 Performance measures and evaluation
We compared all predicted nocturnal hypoglycemic events with all true nocturnal hypoglycemic events (Table 3).
Table 3.
Predicted | |||
---|---|---|---|
no NH | NH | ||
True | no NH | True Negatives (TN) (correct rejection) | False Positives (FP) (overestimation) |
NH | False Negatives (FN) (underestimation) | True Positives (TP) (hit) |
The classification systems were compared regarding six evaluation metrics: the specificity, the sensitivity, the precision, the F2 score, the F1 score, and the Area Under the Receiver Operating Characteristic Curve (AUC) score (Table 4). The specificity represents the proportion of nights without nocturnal hypoglycemia, which were correctly classified as nights without nocturnal hypoglycemia. The sensitivity (also known as recall) gives the proportion of nights with nocturnal hypoglycemia, which were correctly classified as nights with nocturnal hypoglycemia. The precision (also known as positive predictivity) gives the proportion of correctly classified nights with nocturnal hypoglycemia over all nights predicted as nights with nocturnal hypoglycemia.
Table 4.
Metric | Definition |
---|---|
Specificity | |
Sensitivity | |
Precision | |
Fβ score | , with either β = 2 (F2 score) or β = 1 (F1 score) |
AUC score | Area Under the Receiver Operating Characteristic Curve |
TN, true negatives; FP, false positives; FN, false negatives; TP, true positives.
The F1 score gives precision and sensitivity the same weight, is broadly used in literature, and is known to be a good metric for imbalanced classification tasks (51, 52). The Fβ score is a generalization of the F1 score adding the configuration parameter β. In the F2 score, β is set to 2, giving more weight to sensitivity than to precision (52). In the present case and generally in medical settings, the misclassification as false negative (underestimation) is worse than the misclassification as false positive (overestimation). We decided to use the F2 score as the metric on the validation set when it came to stopping the training or in determining the relevant parameteres and features. The AUC score gives the area under the receiver operating characteristic curve.
Each machine learning model was subjected to six-fold cross-evaluation to counteract the small sample size and to support reliable and reproducible results. The available data after preprocessing (Figure 3) was divided into individual days with their corresponding label from the following night (Figure 5). 66 days with available label from the night formed the dataset. The dataset was shuffled and divided into six-folds, with five folds forming the training set and one fold forming the test set. In each case, 20% of the training set was subtracted to form the validation set for the DNN models (Figure 6). We considered a population-based approach and it was possible, that data from the same child was in the training, validation and test set. The shuffle and split were done in a stratified fashion so that classes were distributed almost identically among the different sets. This above mentioned procedure was executed five times, using the same randomly generated partitions of the data for all dataset combinations. The results are given averaged over all executions with the belonging standard deviation.
3 Results
Table 5 gives the characteristics of the 13 children that were included in the data analysis. Table 6 shows the mean values with standard deviation of three performance measures (Table 4) for the developed machine learning pipeline (Figure 1). Figure 6 displays the three remaining performance measures, the specificity, the sensitivity, and the precision, to the same machine learning pipeline as in Table 6. Table 6 gives the obtained results for the (adapted) implemented algorithm from literature (14) (Figure 4).
Table 5.
Mean ± SD | Range | |
---|---|---|
Age (years) | 11.3 ± 2.2 | [7.5, 13.9] |
BMI (kg/m2) | 19.8 ± 4.4 | [13.2, 27.7] |
Diabetes duration (years) | 4.1 ± 2.8 | [0.9, 9.5] |
HbA1c (%) | 7.1 ± 0.9 | [5.1, 8.5] |
Diabetes duration is the time since Type 1 Diabetes (T1D) diagnosis. HbA1c refers to glycated hemoglobin measured in the range of 54 days before the first study day and 74 days after the first study day. SD, Standard deviation; BMI, Body mass index.
Table 6.
Time series characteristics | ||||||
---|---|---|---|---|---|---|
Data sets | Glucose literature features | Glucose | Glucose & static | Glucose & physiological | Glucose, physiological & static | |
LR | F2 | 56.4 ± 20.9 | 50.6 ± 24.0 | 52.1 ± 19.8 | 56.3 ± 27.8 | 63.0 ± 23.7 |
F1 | 56.0 ± 12.0 | 42.7 ± 18.6 | 43.6 ± 15.4 | 49.3 ± 24.2 | 54.8 ± 20.3 | |
AUC | 68.3 ± 13.1 | 59.4 ± 15.4 | 58.9 ± 14.0 | 66.0 ± 17.2 | 69.0 ± 16.2 | |
RF | F2 | 40.8 ± 23.1 | 64.4 ± 26.0 | 56.0 ± 26.3 | 61.3 ± 30.3 | 56.6 ± 29.6 |
F1 | 47.3 ± 25.2 | 58.3 ± 22.6 | 48.3 ± 22.3 | 60.9 ± 27.6 | 55.4 ± 25.9 | |
AUC | 66.4 ± 11.9 | 73.5 ± 14.5 | 65.3 ± 15.3 | 75.2 ± 16.6 | 72.6 ± 14.9 | |
DNN | F2 | n.a. | 57.5 ± 19.1 | 59.5 ± 12.9 | 62.0 ± 11.3 | 49.1 ± 22.3 |
F1 | n.a. | 41.9 ± 14.1 | 40.4 ± 7.5 | 42.1 ± 6.9 | 37.6 ± 16.6 | |
AUC | n.a. | 53.9 ± 13.7 | 48.2 ± 8.6 | 51.0 ± 7.6 | 51.7 ± 14.0 |
The standard deviations are calculated as average values of the five runs. For LR and RF, different features were calculated (Literature Features or Time Series Characteristics). The used machine learning pipeline is illustrated in Figure 1. AUC, Area Under the Receiver Operating Characteristic Curve; n.a., not applicable. Best F2 scores per data set are marked in bold.
In the implemented machine learning pipeline, different feature-data-combinations were the input for the classification task using logistic regression, random forest or DNN. Best results (F2 score 64.4%) were obtained with time series characteristics of glucose data using random forest (Table 6). Time series characteristics of glucose, physiological, and static data with logistic regression (F2 score 63.0%) achieved a similar F2 score then the DNN with glucose and physiological data (F2 score 62.0%) (Table 6). Single high values (above 80%) were also reached for specificity and sensitivity for a variety of options (Figure 6).
4 Discussion
The focus of this work was the prediction of nocturnal hypoglycemia (prediction horizon 9 h) concentrating on children with T1D. We used CGM, physiological data from a wearable device, and children characteristics.
4.1 General discussion
Despite achieving single high values (above 80%) for specificity and sensitivity for a variety of options (Figure 6), the selected and implemented machine learning pipeline could not achieve values over 65% for the combined F2 score, e.g. due to the small data set. The high standard deviations for all models and dataset combinations indicate that, depending on the distribution of the dataset (cross-validation), it is unclear which model is actually the best (Table 6, error bars in Figure 6).
Comparing the baseline models to DNNs, it is apparent that the DNNs achieved smaller standard deviations. This shows the potential of the DNN models as this indicates the more consistent performance over all six-folds for the five runs. The consistency suggests that DNNs better capture complex patterns and relationships than traditional models. DNN models typically require large datasets—ideally large in number of samples to generalize well in tuning the parameters and in the number of children, study days, and class balance to learn a comprehensive representation. We assume that either (1) combining different datasets into a larger one or (2) applying transfer learning would improve the results for the DNN models. In transfer learning, either this dataset is used to train the model, and fine-tuning is done on the other dataset, or vice versa.
Investigating the results for incorporating physiological data, it is visible that there is an improvement in the F2 scores for the baseline models logistic regression and random forest (Table 6). As mentioned in the last paragraph, we assume that DNN models need a larger dataset to realize their full potential. An improvement was not (random forest) or, at most, slightly (logistic regression) visible when only static data was included. Our assumption here is that static data describes the children's general characteristics and could impact the general risk for hypoglycemia. In contrast, it does not improve the short-term prediction for the upcoming night. When assessing the upcoming night, the influence of physiological data is greater, particularly regarding activity and energy expenditure during the day, and has a direct impact on blood glucose at night, which is also reflected in the general trend.
Improvements for future work could be targeted hand-crafted features, with specific feature selection, and incorporating ML strategies to enlarge the dataset.
4.2 Comparison to reimplemented algorithm from literature
For evaluating the quality of our dataset, we reimplemented an algorithm from literature (14) that uses logistic regression and random forest for a prediction horizon of 15 min (Figure 4). berikov et al. (14) achieved best results with random forest (Specificity 91.4%, Sensitivity 94.5%, AUC score 0.97). Best results using our dataset were attained with logistic regression (Specificity 90.4%, Sensitivity 90.4%, F2 score 63.9%, AUC score 90.7%) (Table 7).
Table 7.
Logistic regression | Random forest | Multilayer perceptron | |
---|---|---|---|
Specificity | 90.4% | 98.9% | 77.1% |
Sensitivity | 90.4% | 20.5% | 61.1% |
Precision | 29.2% | 45.0% | 50.0% |
F2 score | 63.9% | 23.0% | 58.5% |
F1 score | 43.3% | 35.8% | 55.0% |
AUC score | 90.7% | 59.7% | 69.1% |
Single values for specificity, sensitivity, and precision range from 20.5% to 98.9%, for the F2 score from 23.0% to 63.9%. The previously best performing algorithm random forest achieves the worst results. Differences between the reimplemented algorithm and the other algorithm presented in this work are 1) different prediction horizon, 2) selected hand-crafted features are input to random forest and logistic regression, and 3) different data were used. Data between midnight to 6 am were used. Previously this data served only to determine one of the two classes: Nocturnal Hypoglycemia and No Nocturnal Hypoglycemia. Hence, more samples were used in the 15 min prediction. This could indicate that the effect of exercise is relevant whereas cannot be depicted with data only during the night and emphasizes that the combination—data, if needed features, and classifiers— is relevant.
Berikov et al. (14) did not present their results using precision or the F2 score. They used a dataset of 406 adults (aged 18 to 70 years) having a mean CGM duration of each participant of 6.7 days. Since the sensitivity and specificity in the different works are in the same ranges, we conclude that our dataset is comparable to the larger dataset (14) despite the smaller sample size and other differences (participants were children, mean CGM duration of 5.1 days).
4.3 Comparison with literature
Three studies from the literature (20, 21, 26) concentrated on nocturnal hypoglycemia prediction (prediction horizon of 6 h for all three studies) and included wearable data. Bertachi et al. (26) received best results (78.75% median sensitivity, 82.15% median specificity) with SVM, evaluating the generated classifiers with . Parcerisas et al. (20) used the same dataset as Bertachi et al. (26) and achieved with SVM a median sensitivity of 74% and a median specificity of 77% for their population models, evaluating the generated models with the F1 score. Veh´ et al.(21) used artificial neural networks and obtained a mean sensitivity of 44.0% and mean specificity of 85.9%. In this work, mean sensitivity and specificity to the best F2 score were 77.8% and 62.5%, respectively (Table 6). If we consider the differences in the studies, such as children compared to adults or longer prediction horizon, we conclude that the results in this work are comparable to or even exceed the results in the literature.
We used data collected during a sports day camp. During the day, the children were supervised by pediatric endocrinologists. The children were supervised by their parents in the evenings and at night. This study setting is less controlled than an inpatient hospitalized setting as used in literature (14, 22, 23). Other studies use an even less controlled outpatient setting, where participants continue their daily routines and come to the study center only at agreed times (16, 18, 20, 26). The chosen study setting allows the imitation of everyday daily life but offers opportunities for intervention and offers information about meals and insulin doses. Data from a less supervised setting including a higher number of participants will also be considered in the future.
Population-based models and individual-based (or personalized) models are used in the literature (20). Population-based models describe models in which one or more participants are not seen by the algorithm in the training phase. Individual-based models are models that explicitly consider data from the individual in the training phase to (ideally) produce better results for the individual. Due to the scarcity of our data, we combined all available study days and ignored the individual participant. In the future, we plan to evaluate the performance of population-based and individual-based models.
4.4 Limitations
In this work, we decided to use logistic regression and random forest as these were the most used algorithms in literature for hypoglycemia prediction (13). In principle, no particular classifier is appropriate for all classification tasks (No Free Lunch Theorem) (53). Hence it might be that for our research questions and the specific dataset including children a different classifier would improve the performance. For example, in future work we want to incorporate XGBoost or SVM as the algorithms have a lower complexity than DNN and have shown good performances in similar work.
We transferred the activity classes to MET values (Table 2) based on the adult compendium (37). This comprehensive list included all available activity classes. The 'Youth Compendium of Physical Activities' (54) contains only 200 activities and their assigned MET value for four different age groups (6-9, 10-12, 13-15, 16-18 years). Since not all required activity classes were available in the “Youth Compendium”, we decided to stay with the established adult compendium.
For the logistic regression and random forest, we performed a sequential forward feature selection and limited the number of features to 15. We did not investigate other feature selection approaches or perform an extensive search regarding best number of features. In future work, we will address this issue together with newly created hand-crafted physiological features.
4.5 Outlook
Vu et al. (19) noted, the difference between the early phases of sleep and the late phases of sleep has not been widely studied in the field of predicting nocturnal hypoglycemia. The literature often distinguishes between the first three hours of sleep (midnight to 3 am) and the remaining three hours of sleep (3 am to 6 am). During the beginning of sleep, slow wave sleep (SWS)—like non-rapid eye movement (REM) sleep—dominates, while REM sleep becomes more prevalent toward the end of sleep (55). The brain's need for glucose is significantly decreased during SWS, reaching a minimum level that is distinctively lower than during wakefulness or REM sleep (55). Research studies (55, 56) indicate that more nocturnal hypoglycemia events are present during late sleep. In addition, adults and children with T1D report worse sleep quality than healthy controls (19). In the future, we want to investigate nocturnal hypoglycemia prediction in the early phases of sleep compared to the late phases of sleep, adjusted to data from children that have a higher sleep requirement. For this, we will also consider (night) parameters of the wearable sensor like sleep quality index score (Table 1).
The logbook also contains information about the structured sports activities that were performed on a daily basis. The inclusion of this information in combination with the activity classes and other activity parameters could be used to establish rules for hand-crafted features for the signals of the Everion sensor. This could then lead to better results when incorporating physiological data from the wearable sensor.
Lehmann et al. (57) created a decision tree-based machine learning method to detect hypoglycemia in unknown individuals using only data collected by non-invasive wearables. They concentrated on adults with diabetes. Their pilot study indicates that wearable data can provide valuable insights into hypoglycemia prediction, even without using CGM. Analyzing signals from the Everion sensor separately from CGM data and then combining them with CGM and child characteristics could be another aspect of future work.
In addition, there are other options for using the signals of the Everion sensor (Table 1). Studies (58–60) have shown a relationship between heart rate variability and hypoglycemia. In future work, we want to investigate this association. Further, we want to incorporate insulin and carbohydrates in the future as it was shown in literature that the inclusion increased the performance (16). We want to also compare the performance of the developed algorithms on public datasets, e.g. the OhioT1DM Dataset (61) or data from the CITY study (18).
5 Conclusion
In this work, we utilized a dataset recorded in a structured setting to assess the risk of nocturnal hypoglycemia associated with physical activity in children with T1D. In contrast to previous studies we aimed for longer-term predictions, i.e. up to 9 h vs. up to 1 h. From our point of view, the results obtained in this study are acceptable with a sensitivity of the best F2 score close to 80%. Understanding the hypoglycemia risk for the entire upcoming “critical” night is clinically relevant as it permits children and their parents to either sleep soundly or to take appropriate action such as reducing basal insulin doses, administering additional carbohydrates, or scheduling a nocturnal glucose measurement.
Acknowledgments
Alexander Marx was supported with a Postdoctoral Fellowship by the ETH AI Center. We thank the Schweizerische Diabetesstiftung (SDS) and Stiftung Pro UKBB for their support of the Diacamp Project and the Freiwillige Akadamische Gesellschaft Basel (FAG) for supporting Sara Bachmann. Marie-Anne Burckhardt was supported by a research fellowship provided by the Research Fonds (Special Program “Nachwuchsförderung Klinische Forschung”) of the University of Basel and a Young Investigator Grant of the Swiss Society for Endocrinology and Diabetes (SSED).
Funding Statement
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The work leading to this publication was supported by the PRIME programme of the German Academic Exchange Service.
Data availability statement
The presented datasets in this article are not publicly available to ensure participant privacy. Requests to access the dataset should be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Ethikkommission Nordwest- und Zentralschweiz (EKNZ), Gesuchsnummer: 2020 - 00543. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin.
Author contributions
HL: Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. MB: Data curation, Formal analysis, Methodology, Validation, Writing – original draft, Writing – review & editing. AM: Conceptualization, Methodology, Supervision, Writing – original draft, Writing – review & editing. MP: Funding acquisition, Resources, Supervision, Writing – original draft, Writing – review & editing. M-AB: Conceptualization, Methodology, Resources, Writing – original draft, Writing – review & editing. SB: Conceptualization, Funding acquisition, Methodology, Resources, Writing – original draft, Writing – review & editing. JV: Conceptualization, Funding acquisition, Resources, Supervision, Writing – original draft, Writing – review & editing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- 1.Gregory GA, Robinson TI, Linklater SE, Wang F, Colagiuri S, de Beaufort C, et al. Global incidence, prevalence, and mortality of type 1 diabetes in 2021 with projection to 2040: a modelling study. Lancet Diabetes Endocrinol. (2022) 10:741–60. 10.1016/S2213-8587(22)00218-2 [DOI] [PubMed] [Google Scholar]
- 2.Patterson CC, Karuranga S, Salpea P, Saeedi P, Dahlquist G, Soltesz G, et al. Worldwide estimates of incidence, prevalence and mortality of type 1 diabetes in children and adolescents: results from the International Diabetes Federation Diabetes Atlas. Diabetes Res Clin Pract. (2019) 157:107842. 10.1016/j.diabres.2019.107842 [DOI] [PubMed] [Google Scholar]
- 3.Diabetes Control and Complications Trial Research Group . The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N Engl J Med. (1993) 329:977–86. 10.1056/NEJM199309303291401 [DOI] [PubMed] [Google Scholar]
- 4.Foster NC, Beck RW, Miller KM, Clements MA, Rickels MR, DiMeglio LA, et al. State of type 1 diabetes management and outcomes from the T1D exchange in 2016-2018. Diabetes Technol Ther. (2019) 21:66–72. 10.1089/dia.2018.0384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Nordfeldt S, Ludvigsson J. Fear and other disturbances of severe hypoglycaemia in children and adolescents with type 1 diabetes mellitus. J Pediatr Endocrinol Metab. (2005) 18:83. 10.1515/JPEM.2005.18.1.83 [DOI] [PubMed] [Google Scholar]
- 6.Glocker V, Bachmann S, Hess M, Szinnai G, Burckhardt MA. Fear of hypoglycemia and quality of life in young people with type 1 diabetes and their parents in the era of sensor glucose monitoring. Front Endocrinol. (2022) 13:958671. 10.3389/fendo.2022.958671 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Patton SR, Clements MA, Marker AM, Nelson EL. Intervention to reduce hypoglycemia fear in parents of young kids using video-based telehealth (REDCHiP). Pediatr Diabetes. (2020) 21:112–9. 10.1111/pedi.12934 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bachmann S, Hess M, Martin-Diener E, Denhaerynck K, Zumsteg U. Nocturnal hypoglycemia and physical activity in children with diabetes: new insights by continuous glucose monitoring and accelerometry. Diabetes Care. (2016) 39:e95–6. 10.2337/dc16-0411 [DOI] [PubMed] [Google Scholar]
- 9.Jaggers JR, King KM, Watson SE, Wintergerst KA. Predicting nocturnal hypoglycemia with measures of physical activity intensity in adolescent athletes with type 1 diabetes. Diabetes Technol Ther. (2019) 21:406–8. 10.1089/dia.2019.0048 [DOI] [PubMed] [Google Scholar]
- 10.García-Hermoso A, Ezzatvar Y, Huerta-Uribe N, Alonso-Martínez AM, Chueca-Guindulain MJ, Berrade-Zubiri S, et al. Effects of exercise training on glycaemic control in youths with type 1 diabetes: a systematic review and meta-analysis of randomised controlled trials. Eur J Sport Sci. (2022) 23:1741–9. 10.1080/17461391.2022.2086489 [DOI] [PubMed] [Google Scholar]
- 11.Giannini C, Mohn A, Chiarelli F. Physical exercise and diabetes during childhood. Acta Biomed. (2006) 77:18–25. [PubMed] [Google Scholar]
- 12.McMahon SK, Ferreira LD, Ratnam N, Davey RJ, Youngs LM, Davis EA, et al. Glucose requirements to maintain euglycemia after moderate-intensity afternoon exercise in adolescents with type 1 diabetes are increased in a biphasic manner. J Clin Endocrinol Metab. (2007) 92:963–8. 10.1210/jc.2006-2263 [DOI] [PubMed] [Google Scholar]
- 13.Zhang L, Yang L, Zhou Z. Data-based modeling for hypoglycemia prediction: Importance, trends, and implications for clinical practice. Front Public Health. (2023) 11:1044059. 10.3389/fpubh.2023.1044059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Berikov VB, Kutnenko OA, Semenova JF, Klimontov VV. Machine learning models for nocturnal hypoglycemia prediction in hospitalized patients with type 1 diabetes. J Pers Med. (2022) 12:1262. 10.3390/jpm12081262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Li J, Ma X, Tobore I, Liu Y, Kandwal A, Wang L, et al. A novel CGM metric-gradient and combining mean sensor glucose enable to improve the prediction of nocturnal hypoglycemic events in patients with diabetes. J Diabetes Res. (2020) 2020:8830774. 10.1155/2020/8830774 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Dave D, DeSalvo DJ, Haridas B, McKay S, Shenoy A, Koh CJ, et al. Feature-based machine learning model for real-time hypoglycemia prediction. J Diabetes Sci Technol. (2021) 15:842–55. 10.1177/1932296820922622 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wang W, Wang S, Wang X, Liu D, Geng Y, Wu T, et al. glucose-insulin mixture model and application to short-term hypoglycemia prediction in the night time. IEEE Trans Biomed Eng. (2021) 68:834–45. 10.1109/TBME.2020.3015199 [DOI] [PubMed] [Google Scholar]
- 18.Duckworth C, Guy MJ, Kumaran A, OKane AA, Ayobi A, Chapman A, et al. Explainable machine learning for real-time hypoglycemia and hyperglycemia prediction and personalized control recommendations. J Diabetes Sci Technol. (2022) 18:113–23. 10.1177/19322968221103561 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Vu L, Kefayati S, Idé T, Pavuluri V, Jackson G, Latts L, et al. Predicting nocturnal hypoglycemia from continuous glucose monitoring data with extended prediction horizon. In: AMIA Annual Symposium Proceedings. Bethesda, MD: American Medical Informatics Association; (2019). p. 874. [PMC free article] [PubMed] [Google Scholar]
- 20.Parcerisas A, Contreras I, Delecourt A, Bertachi A, Beneyto A, Conget I, et al. A machine learning approach to minimize nocturnal hypoglycemic events in type 1 diabetic patients under multiple doses of insulin. Sensors. (2022) 22:1665. 10.3390/s22041665 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.VehíJ, Contreras I, Oviedo S, Biagi L, Bertachi A. Prediction and prevention of hypoglycaemic events in type-1 diabetic patients using machine learning. Health Informat J. (2020) 26:703–18. 10.1177/1460458219850682 [DOI] [PubMed] [Google Scholar]
- 22.Sampath S, Tkachenko P, Renard E, Pereverzev SV. Glycemic control indices and their aggregation in the prediction of nocturnal hypoglycemia from intermittent blood glucose measurements. J Diabetes Sci Technol. (2016) 10:1245–50. 10.1177/1932296816670400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tkachenko P, Kriukova G, Aleksandrova M, Chertov O, Renard E, Pereverzyev SV. Prediction of nocturnal hypoglycemia by an aggregation of previously known prediction approaches: proof of concept for clinical application. Comput Methods Programs Biomed. (2016) 134:179–86. 10.1016/j.cmpb.2016.07.003 [DOI] [PubMed] [Google Scholar]
- 24.Klimontov VV, Myakina NE. Glucose variability indices predict the episodes of nocturnal hypoglycemia in elderly type 2 diabetic patients treated with insulin. Diabetes Metab Syndr. (2017) 11:119–24. 10.1016/j.dsx.2016.08.023 [DOI] [PubMed] [Google Scholar]
- 25.Jensen MH, Dethlefsen C, Vestergaard P, Hejlesen O. Prediction of nocturnal hypoglycemia from continuous glucose monitoring data in people with type 1 diabetes: a proof-of-concept study. J Diabetes Sci Technol. (2020) 14:250–6. 10.1177/1932296819868727 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Bertachi A. Vi nals C, Biagi L, Contreras I, VehíJ, Conget I, et al. Prediction of nocturnal hypoglycemia in adults with type 1 diabetes under multiple daily injections using continuous glucose monitoring and physical activity monitor. Sensors. (2020) 20:1705. 10.3390/s20061705 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mosquera-Lopez C, Dodier R, Tyler NS, Wilson LM, El Youssef J, Castle JR, et al. Predicting and preventing nocturnal hypoglycemia in type 1 diabetes using big data analytics and decision theoretic analysis. Diabetes Technol Ther. (2020) 22:801–11. 10.1089/dia.2019.0458 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Calhoun P, Levine RA, Fan J. Repeated measures random forests (RMRF): Identifying factors associated with nocturnal hypoglycemia. Biometrics. (2021) 77:343–51. 10.1111/biom.13284 [DOI] [PubMed] [Google Scholar]
- 29.Marx A, Di Stefano F, Leutheuser H, Chin-Cheong K, Pfister M, Burckhardt MA, et al. Blood glucose forecasting from temporal and static information in children with T1D. Front Pediatr. (2023) 11:1296904. 10.3389/fped.2023.1296904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cohen O, Andersen G, Huang S, Zijlstra E, O'Sullivan O, Welsh JB, et al. Accuracy of the continuous glucose sensor used with the predictive low glucose management system. Diabetes Technol Ther. (2016) 18:A81. [Google Scholar]
- 31.Shah VN, Laffel LM, Wadwa RP, Garg SK. Performance of a factory-calibrated real-time continuous glucose monitoring system utilizing an automated sensor applicator. Diabetes Technol Ther. (2018) 20:428–33. 10.1089/dia.2018.0143 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Medtronic . Enlite Glukosesensor. (2018). Available at: https://www.medtronic.com/de-de/diabetes/home/produkte/kontinuierliche-glukosemessung/enlite-glukosesensor.html (accessed August 2, 2024).
- 33.Abbott Diabetes. Care Fact sheet: FreeStyle Libre 2 Messgenauigkeit (2018). [Google Scholar]
- 34.Alva S, Bailey T, Brazg R, Budiman ES, Castorino K, Christiansen MP, et al. Accuracy of a 14-day factory-calibrated continuous glucose monitoring system with advanced algorithm in pediatric and adult population with diabetes. J Diabetes Sci Technol. (2022) 16:70–7. 10.1177/1932296820958754 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Adolfsson P, Taplin CE, Zaharieva DP, Pemberton J, Davis EA, Riddell MC, et al. ISPAD clinical practice consensus guidelines 2022: exercise in children and adolescents with diabetes. Pediatr Diabetes. (2022) 23:1341–72. 10.1111/pedi.13452 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Biofourmis . Data Glossary. (2020). [Google Scholar]
- 37.Ainsworth BE, Haskell WL, Herrmann SD, Meckes N, Bassett Jr DR, Tudor-Locke C, et al. 2011 compendium of physical activities: a second update of codes and MET values. Med Sci Sports Exerc. (2011) 43:1575–81. 10.1249/MSS.0b013e31821ece12 [DOI] [PubMed] [Google Scholar]
- 38.Danne T, Nimri R, Battelino T, Bergenstal RM, Close KL, DeVries JH, et al. International consensus on use of continuous glucose monitoring. Diabetes Care. (2017) 40:1631–40. 10.2337/dc17-1600 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Niemann H. Pattern Analysis and Understanding. Berlin: Springer Berlin Heidelberg. (2013). [Google Scholar]
- 40.Cho K, van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, et al. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha, Qatar: Association for Computational Linguistics; (2014). p. 1724–34. Available at: https://aclanthology.org/D14-1179 (accessed September 12, 2024). [Google Scholar]
- 41.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. (1997) 9:1735–80. 10.1162/neco.1997.9.8.1735 [DOI] [PubMed] [Google Scholar]
- 42.Murphy KP. Probabilistic Machine Learning: Advanced Topics. Cambridge, MA: MIT Press. (2023). [Google Scholar]
- 43.Li L, Jamieson K, DeSalvo G, Rostamizadeh A, Talwalkar A. Hyperband: a novel bandit-based approach to hyperparameter optimization. J Mach Learn Res. (2017) 18:6765–816. [Google Scholar]
- 44.He K, Zhang X, Ren S, Sun J. Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: 2015 IEEE International Conference on Computer Vision (ICCV) (2015). p. 1026–1034. 10.1109/ICCV.2015.123 [DOI] [Google Scholar]
- 45.Nair V, Hinton GE. Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th International Conference on International Conference on Machine Learning. ICML'10. Madison, WI, USA: Omnipress; (2010). p. 807–814. [Google Scholar]
- 46.Chung J, Gulcehre C, Cho K, Bengio Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014 (2014). [Google Scholar]
- 47.Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv [preprint] arXiv:14126980. (2014). Available at: https://hdl.handle.net/11245/1.505367
- 48.Ruder S. An overview of gradient descent optimization algorithms. CoRR abs/1609.04747 (2016). Available at: http://arxiv.org/abs/1609.04747 (accessed September 12, 2024).
- 49.Masters D, Luschi C. Revisiting small batch training for deep neural networks. arXiv [preprint] arXiv:180407612. (2018). [Google Scholar]
- 50.Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Cham: Springer. (2009). [Google Scholar]
- 51.Sun Y, Wong AK, Kamel MS. Classification of imbalanced data: a review. Int J Pattern Recognit Artif . (2009) 23:687–719. 10.1142/S0218001409007326 [DOI] [Google Scholar]
- 52.Chinchor N. MUC-4 Evaluation Metrics. In: Fourth Message Uunderstanding Conference (MUC-4): Proceedings of a Conference Held in McLean, Virginia, June 16-18, 1992. (1992). Available at: https://aclanthology.org/M92-1002 (accessed September 12, 2024).
- 53.Duda RO, Hart PE, Stork D. Pattern Classification. Hoboken, NJ: John Wiley & Sons. (2006). [Google Scholar]
- 54.Butte NF, Watson KB, Ridley K, Zakeri IF, McMurray RG, Pfeiffer KA, et al. A youth compendium of physical activities: activity codes and metabolic intensities. Med Sci Sports Exerc. (2018) 50:246. 10.1249/MSS.0000000000001430 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Jauch-Chara K, Hallschmid M, Gais S, Oltmanns KM, Peters A, Born J, et al. Awakening and counterregulatory response to hypoglycemia during early and late sleep. Diabetes. (2007) 56:1938–42. 10.2337/db07-0044 [DOI] [PubMed] [Google Scholar]
- 56.Robinson R, Harris N, Ireland R, Macdonald I, Heller S. Changes in cardiac repolarization during clinical episodes of nocturnal hypoglycaemia in adults with type 1 diabetes. Diabetologia. (2004) 47:312–5. 10.1007/s00125-003-1292-4 [DOI] [PubMed] [Google Scholar]
- 57.Lehmann V, Föll S, Maritsch M, van Weenen E, Kraus M, Lagger S, et al. Noninvasive hypoglycemia detection in people with diabetes using smartwatch data. Diabetes Care. (2023) 22:2290. 10.2337/figshare.21995150.v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Bachmann S, Auderset A, Burckhardt MA, Szinnai G, Hess M, Zumsteg U, et al. Autonomic cardiac regulation during spontaneous nocturnal hypoglycemia in children with type 1 diabetes. Pediatr Diabet. (2021) 22:1023–30. 10.1111/pedi.13262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Koeneman M, Olde Bekkink M. Meijel Lv, Bredie S, de Galan B. Effect of hypoglycemia on heart rate variability in people with type 1 diabetes and impaired awareness of hypoglycemia. J Diabetes Sci Technol. (2022) 16:1144–9. 10.1177/19322968211007485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cichosz SL, Frystyk J, Tarnow L, Fleischer J. Combining information of autonomic modulation and CGM measurements enables prediction and improves detection of spontaneous hypoglycemic events. J Diabetes Sci Technol. (2014) 9:132–7. 10.1177/1932296814549830 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Marling C, Bunescu R. The OhioT1DM dataset for blood glucose level prediction: update 2020. In: CEUR Workshop Proceedings. Milwaukee, WI: NIH Public Access; (2020). p. 71. [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The presented datasets in this article are not publicly available to ensure participant privacy. Requests to access the dataset should be directed to the corresponding authors.