Comparison of machine learning models for the prediction of mortality of patients with unplanned extubation in intensive care units

Meng Hsuen Hsieh; Meng Ju Hsieh; Chin-Ming Chen; Chia-Chang Hsieh; Chien-Ming Chao; Chih-Cheng Lai

doi:10.1038/s41598-018-35582-2

. 2018 Nov 20;8:17116. doi: 10.1038/s41598-018-35582-2

Comparison of machine learning models for the prediction of mortality of patients with unplanned extubation in intensive care units

Meng Hsuen Hsieh ^1,^#, Meng Ju Hsieh ^2,^#, Chin-Ming Chen ^3,^4,^✉, Chia-Chang Hsieh ⁵, Chien-Ming Chao ⁶, Chih-Cheng Lai ^6,^✉

PMCID: PMC6244193 PMID: 30459331

Abstract

Unplanned extubation (UE) can be associated with fatal outcome; however, an accurate model for predicting the mortality of UE patients in intensive care units (ICU) is lacking. Therefore, we aim to compare the performances of various machine learning models and conventional parameters to predict the mortality of UE patients in the ICU. A total of 341 patients with UE in ICUs of Chi-Mei Medical Center between December 2008 and July 2017 were enrolled and their demographic features, clinical manifestations, and outcomes were collected for analysis. Four machine learning models including artificial neural networks, logistic regression models, random forest models, and support vector machines were constructed and their predictive performances were compared with each other and conventional parameters. Of the 341 UE patients included in the study, the ICU mortality rate is 17.6%. The random forest model is determined to be the most suitable model for this dataset with F₁ 0.860, precision 0.882, and recall 0.850 in the test set, and an area under receiver operating characteristic (ROC) curve of 0.910 (SE: 0.022, 95% CI: 0.867–0.954). The area under ROC curves of the random forest model was significantly greater than that of Acute Physiology and Chronic Health Evaluation (APACHE) II (0.779, 95% CI: 0.716–0.841), Therapeutic Intervention Scoring System (TISS) (0.645, 95% CI: 0.564–0.726), and Glasgow Coma scales (0.577, 95%: CI 0.497–0.657). The results revealed that the random forest model was the best model to predict the mortality of UE patients in ICUs.

Introduction

Acute respiratory failure is a common clinical condition in the intensive care unit (ICU), and most patients with acute respiratory failure require endotracheal intubation with mechanical ventilation (MV) support. For these patients, both endotracheal tube and mechanical ventilator are essential for life support devices. However, unplanned extubation (UE) – an accidental removal of an endotracheal tube (ETT), may develop in about 2–16% of acute respiratory failure patients requiring MV^1–8. Although some UE patients may have successful extubation without re-intubation, other patients experiencing UE may have severe complications, such as aspiration pneumonia, unstable hemodynamic status, airway obstruction, bronchospasm, respiratory failure, prolonged MV uses, and a prolonged length of stay in ICUs and hospitals^9–13. Most unfortunately, some UE patients may present a fatal outcome; the mortality rate can even be as high as 25%^7,13,14.

Several studies reported that tachypnea before UE, underlying uremia or liver cirrhosis, severe conditions with higher APACHE II scores, not undergoing the weaning process, reintubation, chronic neurological disease, and emergency surgery were found to be significantly associated with high mortality among UE patients^4,15. Since clinical assessment of illness severity is especially important for the prediction of the mortality and morbidity of critically ill patients, several researchers have attempted to design various scoring systems to determine illness severity for the prediction of patient prognosis. The Acute Physiology, Age, Chronic Health Evaluation (APACHE-II) score predicts the mortality risk for critically ill patients in the ICU¹⁶, whereas the Therapeutic Intervention Scoring System (TISS) assesses illness severity. Both systems are often used in conjunction to assess patient prognosis¹⁷. The usage of these scales, however, comes with limitations. The TISS-28 includes 28 items, divided into seven groups: basic activities, ventilatory support, cardiovascular support, renal support, neurological support, metabolic support, and specific interventions¹⁸. However, its accuracy in predicting actual mortality, alongside the APACHE II system, is limited¹⁹.

In recent years, multivariate outcome prediction models such as artificial neural networks (ANN), logistic regression models (LR), random forest models (RF), and support vector machines (SVM) have been developed in many areas of health care research^20–25. In this study, we aim to construct several machine learning models to predict the mortality of UE patients and compare their predicting performance with other conventional parameters.

Materials and Methods

Patients and setting

This study was conducted in eight adult ICUs of Chi-Mei Medical Center from December 1, 2008 through July 31, 2017. This is a 1288-bed tertiary medical center with 96 ICU beds: 48 medical ICU beds, 9 cardiac beds, and 39 surgical beds for adults. Every year, an average of more than 5,000 patients are admitted to the ICU. The ICU is covered by intensivists, senior residents, nurses, respiratory therapists, dietitians, physical therapists, and clinical pharmacists. Each shift had the same workload and the patient-to-nursing staff ratio is of 2:1. There were no differences in nursing experience by shift. Each respiratory therapist was responsible for fewer than 10 patients at the same time on every shift. The ICU team made rounds at least once daily, and respiratory therapists were responsible for all the weaning processes and spontaneous breathing trials of all MV patients. An UE was defined as the dislodgement or removal of the ETT from the trachea in a patient undergoing invasive MV at a time that was not specifically planned for or ordered by the physicians in charge of the patient. During the study period, a total of 341 patients experiencing UE were enrolled in this study, and their demographic and clinical information, laboratory results, comorbidities, severity scores, mortality, and length of stays for both ICU and hospital were collected for analysis. Our elderly patients (≥65 years) were about 58.7% (201/341) of all UE patients. The data were retrospectively collected and then analyzed. Therefore, informed consent was specifically waived and the study was approved by the Institutional Review Board of Chi Mei Medical Center (IRB: 10705–011). All methods were performed in accordance with the relevant guidelines and regulations.

Constructing data sets

All features are extracted from the original dataset. The categorical data is cleaned and one-hot encoded in RStudio. The age data, which is continuous, is unity-based normalized and standardized. After data processing, there are 16 input features. The features include subject sex, age, APACHE II scores, Glasgow Coma Scale (GCS), TISS scale, comorbidities, ICU length of stay in days, and hospital length of stay in days. These features are chosen due to their wide availability in ICUs.

Data description

The entire data set is comprised of 341 data points. The data is split into training and test sets at an approximate 9:1 ratio. 307 subjects are placed into the train set while 34 subjects are placed in the test set. In the overall dataset, the data distribution is unbalanced: ratio between patient death and survival is 1:4.683. To ensure that the output of the prediction model does not overfit the data, the data is weighted according to their outcome ratios when training the models.

Algorithm and training

The configuration of each model was reached through a hyperparameter selection process using the k-fold cross-validation accuracy (k = 10). We decided to use k-fold cross-validation instead of holdout cross-validation due to the limited number of subjects. The hyperparameter selection process was independently performed for each model type.

Artificial Neural Network Model

We began the model construction process by testing a three-layer perceptron network with a hidden layer that has half the number of neurons compared to the input layer. Using this configuration, we tested various activation functions in the hidden layer (Rectified Linear Unit (ReLU), Scaled Exponential Linear Unit (SeLU), and inverse tangent (tanh)) and the output layer (Softmax and sigmoid). If the model was under-fitting, we increased the number of hidden layers and neurons. If the model was over-fitting, we added regularization or decreased the number of hidden layers and neurons. Once the model achieved high precision and recall, we applied various optimizers (Root Mean Square Propagation (RMSProp), Adam, Adadelta, Adam with Nesterov Momentum, and PowerSign) to the same architecture and determined if the model would perform better. We would decrease the learning rate if the model did not converge, but this was not found necessary.

The final ANN model consists of one input layer of 16 dimensions, a hidden layer of 24 dimensions, and an output layer of 2 dimensions. The network is trained using stochastic gradient descent and optimized using Adam with default parameters outlined by Kingma et al.²⁶. The neural network is trained for 200 epochs. We used the SeLU activation function at each layer and the Softmax at the output layer²⁷. Dropout regularization of 20% is applied at the input layer and 50% at the output layer²⁸. The categorical cross entropy error function for binary classification is used as the loss function. The ANN model is implemented using the Tensorflow framework (version 1.9.0)²⁹.

Logistic Regression Model

First, we used the default configuration to determine the best optimizer. We tested different solvers, which include the Newton-conjugate gradient method (Newton-CG), Limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) algorithm, stochastic average gradient descent, and the liblinear solver, and compared their performances. Then, we trained models with different regularization strengths on a linear scale.

The final LR model used L₂ regularization with primal formulation. Primal formulation was used because there are more samples than features. Stochastic average gradient descent was used as the optimizer. The one-vs-rest scheme was used as the loss function. The regularization strength was set to 1.0, and the model was trained for 100 iterations before convergence. The LR software was implemented using the scikit-learn library (version 0.19.1)³⁰ and the LIBLINEAR library (version 3.21)³¹.

Random Forest Model

First, we trained the data on a single decision tree model to determine the optimal depth. We started with a depth of one and increased the depth until the model began to overfit, or when the precision and recall of the train and test sets began to diverge. Then, we trained random forest models with various number of trees. We started with one tree and increased the number of trees until the out-of-bag error did not decrease further.

The final RF model used ten separate decision tree estimators. Each decision tree used Gini impurity to measure the quality of split. The minimum number of samples required to split a node was set to two, and the minimum samples per leaf is set to one. All trees had a maximum depth of four; this was done to prevent the model from overfitting the training set. Probability estimates were used to plot the ROC curve. The RF model was implemented with the scikit-learn framework (version 0.19.2)³⁰.

Support Vector Machine Model

The SVM model is a C-support vector classification (C-SVC) model. We began by testing out various kernel types (linear, polynomial, sigmoid, and radial-basis function kernels) using the default kernel coefficient (gamma) and C value. We tested the polynomial kernel with degree three. Then, we trained different models with varying gamma and C values on a logarithmic scale.

The final SVM model used a radial basis function (RBF) as its kernel with the shrinking heuristic enabled. The model used a C value of one and a gamma value of the reciprocal of the number of features. Additionally, probability estimates were calculated in order to plot a ROC curve for the model. The SVM model was implemented using the LIBSVM library (version 3.21)³².

Statistical analyses

Mean values, standard deviations, and group sizes were used to summarize the results for continuous variables. The differences between the survival and non-survival group at hospital discharge were examined by univariate analysis with a Student t test and a Chi-square test. A p value < 0.05 was considered statistically significant. Statistical analysis of the data was done with SPSS 13.0 for Windows (SPSS, Inc., Il, USA).

Since the data distribution is unbalanced, accuracy is not a reliable measurement of prediction model performance³³. Instead, we used the weighted averaged F₁, precision and recall values to measure model performance. These three metrics are calculated for the train set, the test set, and all data.

The Receiving Operating Characteristic (ROC) curve is also used as a metric to measure prediction model performance. The area under ROC curve (AUROC) of each prediction model was pairwise-compared using the DeLong test³⁴. The area under ROC curve of the prediction models were also compared to the those of the control predictors of the original data set, which include the APACHE II score, GCS, and TISS scale.

Results

Clinical features of UE patients

Table 1 shows the demographic and clinical characteristics of the study population. Of the 341 patients included in the study, 67.1% were male and 32.8% were female. The mean age of survivors was 64.96 years, while the mean age of non-survivors was 66.85 years. The ICU mortality rate is 17.6%. The mean APACHE II score among non-survivors is 24.23, which is significantly higher than that of the survivors, at 15.92 (p < 0.001). The non-survivor group had higher risks of underlying cancer, liver cirrhosis and uremia than the survivor group. The mean length of stay in ICU and hospital were 13.7 days and 36.12 days respectively for the survivor group, and 21.66 and 44.55 days respectively for the non-survivor group.

Table 1.

The demographic and clinical characteristics of ICU patients.

Variable	Survivor n = 281	Non-survivor n = 60	P value
Sex (male/female)	183/98	46/14
Age, year	64.96 ± 16.86	66.85 ± 14.36	0.420
BMI	23.56 ± 4.33	23.47 ± 5.02	0.894
APACHE II scores	15.92 ± 7.83	24.23 ± 8.55	<0.001
Glasgow Coma scales	10.46 ± 3.73	9.50 ± 4.13	0.076
TISS scales	26.75 ± 8.00	31.00 ± 9.21	0.001
Comorbidities
Cancer	18 (6)	12 (20)	0.001
COPD	41 (15)	7 (12)	0.554
Coronary artery disease	69 (25)	14 (23)	0.841
Liver cirrhosis	5 (2)	10 (17)	<0.001
Uremia	14 (5)	13 (22)	<0.001
Stroke	92 (33)	13 (22)	0.092
Diabetes	67 (24)	20 (33)	0.126
ICU days	13.70 ± 11.64	21.66 ± 20.79	<0.001
Hospital days	36.12 ± 26.64	44.55 ± 41.99	0.049

Open in a new tab

Data are presented as mean ± standard deviation or n (%).

BMI = Body Mass Index (kg/m²).

APACHE = Acute Physiology and Chronic Health Evaluation.

TISS = Therapeutic Intervention Scoring System.

COPD = Chronic obstructive lung disease.

ICU = intensive care unit.

Results of Prediction Models and Control Predictors

The F₁, precision, and recall values of all models are shown in Table 2. In the test set, the RF model has the greatest F₁ value among all models, followed by the SVM, ANN, and LR models. The RF model also has the greatest recall and precision values among all models in the test set.

Table 2.

F₁, Precision, and Recall values for all prediction models.

Dataset	Metric	ANN	LR	RF	SVM
All	F₁	0.819	0.819	0.863	0.831
	Precision	0.846	0.848	0.888	0.848
	Recall	0.805	0.805	0.853	0.821
Train	F₁	0.829	0.706	0.824	0.775
	Precision	0.840	0.788	0.824	0.790
	Recall	0.824	0.676	0.824	0.765
Test	F₁	0.820	0.808	0.860	0.825
	Precision	0.844	0.842	0.882	0.842
	Recall	0.806	0.792	0.850	0.815

Open in a new tab

The area under ROC curves of the control predictors are outlined in Table 3 and Fig. 1. The area under ROC curves of all control predictors are significantly greater than the null hypothesis area of 0.5. The area under ROC curve for all prediction models is summarized in Table 4 and Fig. 2. The RF model had the highest area under ROC curve among all prediction models. There was no significant difference between any of the prediction models using the standard 95% confidence interval criteria. However, the p values between the RF model and all other models (p ≈ 0.08) were lower than the other comparisons.

Table 3.

Area under ROC of Control Variables.

Control Variable	AUROC	SE	95% CI
APACHE II scores	0.779	0.032	0.716–0.841
Glasgow Coma Scales	0.577	0.041	0.497–0.657
TISS scales	0.645	0.041	0.564–0.726

Open in a new tab

AUROC = Area under ROC curve.

SE = Standard Error of area under ROC curve.

CI = Confidence interval.

Table 4.

Area under ROC curve for all prediction models.

Model	AUROC	SE	95% CI
ANN	0.846	0.028	0.791–0.902
LR	0.853	0.025	0.804–0.903
RF	0.910	0.022	0.867–0.954
SVM	0.843	0.028	0.787–0.898

Open in a new tab

AUROC = Area under ROC curve.

SE = Standard Error of area under ROC curve.

CI = Confidence interval.

ROC curve of ANN, LR, RF, and SVM models.

The area under ROC curves were also pair-wise compared with those of the control variables. The RF model was the only model that had a significantly better prediction power than APACHE II, GCS, and TISS according to the area under ROC curve (p < 0.0001) (Table 5). The ANN, LR, and SVM models have a significantly better prediction power than GCS and TISS and does not have a significantly better prediction power than APACHE II (Table 6).

Table 5.

Pairwise p values of area under ROC curves of prediction models using the DeLong test.

AUROC	ANN	LR	RF	SVM
ANN	1	0.844	0.085	0.912
LR	0.844	1	0.086	0.792
RF	0.085	0.086	1	0.074
SVM	0.912	0.792	0.074	1

Open in a new tab

Table 6.

Pairwise p values of area under ROC curves of prediction models and the control variables using the DeLong test.

Control Variable	ANN	LR	RF	SVM
APACHEII	0.085	0.063	0.002	0.101
GCS	<0.0001	<0.0001	<0.0001	<0.0001
TISS	<0.0001	<0.0001	<0.0001	<0.0001

Open in a new tab

Discussion

In terms of the F₁ value across all data, the RF model achieves the best performance among all models. The RF model is also the only model that performed significantly better than APACHE II, GCS, and TISS according to the area under ROC curve. The RF model has F₁ 0.860, precision 0.882, and recall 0.850 in the test set, and an area under ROC curve of 0.910 (SE: 0.022, 95% CI: 0.867–0.954). Therefore, in this study, we demonstrated that a random forest model is a good predictor of UE patient mortality.

We showed that when a model with a combination of multiple existing physiological scores, including APACHE II, GCS, TISS scores and eight comorbidities, are analyzed using a random forest model, the outcome of death or survival can be predicted reasonably. The usage of random forests in this study opens the feasibility for an aggregate index for a reliable patient prognosis modelling using existing scoring systems. Furthermore, the model used in this study includes features that are not included in standard APACHE II scoring such as chronic liver disease, which in Table 1 shows a significant difference between survivors and non-survivors. Ultimately, the incorporation of multiple ICU scoring systems in our proposed model shows its flexibility to extend and enhance existing frameworks for better prognosis prediction.

Several studies had tried to compare the predictive power in ICU mortality between different machine learning models^35–37. In a medical-neurological Indian ICU, Nimgaonkar et al.³⁵ showed that an ANN using 15 variables was superior to APACHE II in predicting hospital mortality (p < 0.001). Another study using the University of Kentucky Hospital’s data showed that the performance of ANN was as good as APACHE II in predicting ICU mortality³⁶. Though this study had similar findings in that this ANN model had a slight edge over APACHE II, TISS, and GCS, for predicting ICU mortality, this study further compared among different machine learning models including ANN, LR, RF, and SVM models and found that the RF model results in the highest AUROC, hence the best predictive power. It indicates that RF model may be a better machine learning method for prediction of the outcome of UE patients. In summary, while all of the tentative models can help in predicting the outcome of ICU patients, even for a specific group – UE patients; the RF model may even outperform conventional predicting tools, such as APACHE scores.

In this study, the overall mortality rate of UE patients was 17.6%, and the mortality cases had more underlying cancer, liver cirrhosis, and uremia. All these findings are consistent with the previous study⁴, and remind intensivists that they should be aware of the high mortality of UE patients, especially for the patients with multiple co-morbidities. In contrast to mortality cases, more than 80% of patients had successful extubation and survival-to-discharge. This finding might hint that the extubation was delayed in these cases with successful UE. The possible cause of delayed extubation may be due to that the final decision regarding extubation should be made by intensivists even though hospitals have standard weaning and extubation protocols.

Conclusion

The results revealed that the random forest model was the best model for predicting the mortality of UE patients in the ICUs. Such a model will be helpful for predicting ICU patients’ mortality.

Limitations of our study include the lack of more patient data and features. Goodfellow et al. recommends that a supervised deep learning algorithm will generally achieve acceptable performance with more than 5000 data points³⁸. However, this study shows that we can still develop a good prediction model using a limited data set. Future studies can be performed to determine whether similar datasets with a larger number of samples will produce comparable results. In addition, clinical lab data, such as liver and renal function, are omitted because these features fluctuate frequently during the patient’s stay. We chose features that were consistent for each patient and were known to predict patient mortality, which could yield good results according to this study.

Author Contributions

C.M. Chen is the guarantor of this manuscript, M.H. Hsieh, M.J. Hsieh, C.C. Hsieh, C.M. Chao and C.C. Lai contributed to the conception and design of the study, M.H. Hsieh, M.J. Hsieh and C.C. Hsieh analysed and interpreted the data, C.C. Lai, C.M. Chen and M.H. Hsieh drafted the manuscript. All authors reviewed the manuscript.

Competing Interests

The authors declare no competing interests.

Footnotes

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Meng Hsuen Hsieh and Meng Ju Hsieh contributed equally.

Contributor Information

Chin-Ming Chen, Email: chencm3383@yahoo.com.tw.

Chih-Cheng Lai, Email: dtmed141@gmail.com.

References

1.Betbese AJ, Perez M, Bak E, Mancebo J. A prospective study of unplanned endotracheal extubation in intensive care unit patients. Crit. Care Med. 1998;26:1180–1186. doi: 10.1097/00003246-199807000-00016. [DOI] [PubMed] [Google Scholar]
2.Boulain T. Unplanned extubations in the adult intensive care unit: a prospective multicenter study. Association des Reanimateurs du Centre-Ouest. Am. J. Respir. Crit. Care Med. 1998;157:1131–1137. doi: 10.1164/ajrccm.157.4.9702083. [DOI] [PubMed] [Google Scholar]
3.Chao CM, et al. Multidisciplinary interventions and continuous quality improvement to reduce unplanned extubation in adult intensive care units: A 15-year experience. Medicine (Baltimore) 2017;96:e6877. doi: 10.1097/MD.0000000000006877. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Chao CM, et al. Prognostic factors and outcomes of unplanned extubation. Sci. Rep. 2017;7:8636. doi: 10.1038/s41598-017-08867-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Christie JM, Dethlefsen M, Cane RD. Unplanned endotracheal extubation in the intensive care unit. J. Clin. Anesth. 1996;8:289–293. doi: 10.1016/0952-8180(96)00037-2. [DOI] [PubMed] [Google Scholar]
6.Coppolo DP, May JJ. Self-extubations. A 12-month experience. Chest. 1990;98:165–169. doi: 10.1378/chest.98.1.165. [DOI] [PubMed] [Google Scholar]
7.de Groot RI, Dekkers OM, Herold IH, de Jonge E, Arbous MS. Risk factors and outcomes after unplanned extubations on the ICU: a case-control study. Crit. Care. 2011;15:R19. doi: 10.1186/cc9964. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Vassal T, et al. Prospective evaluation of self-extubations in a medical intensive care unit. Intensive Care Med. 1993;19:340–342. doi: 10.1007/BF01694708. [DOI] [PubMed] [Google Scholar]
9.Pandey CK, et al. Self-extubation in intensive care and re-intubation predictors: a retrospective study. J. Indian Med. Assoc. 2002;100(11):14–16. [PubMed] [Google Scholar]
10.Birkett KM, Southerland KA, Leslie GD. Reporting unplanned extubation. Intensive Crit. Care Nurs. 2005;21:65–75. doi: 10.1016/j.iccn.2004.07.012. [DOI] [PubMed] [Google Scholar]
11.Epstein SK, Nevins ML, Chung J. Effect of unplanned extubation on outcome of mechanical ventilation. Am. J. Respir. Crit. Care Med. 2000;161:1912–1916. doi: 10.1164/ajrccm.161.6.9908068. [DOI] [PubMed] [Google Scholar]
12.de Lassence A, et al. Impact of unplanned extubation and reintubation after weaning on nosocomial pneumonia risk in the intensive care unit: a prospective multicenter study. Anesthesiology. 2002;97:148–156. doi: 10.1097/00000542-200207000-00021. [DOI] [PubMed] [Google Scholar]
13.Krinsley JS, Barone JE. The drive to survive: unplanned extubation in the ICU. Chest. 2005;128:560–566. doi: 10.1378/chest.128.2.560. [DOI] [PubMed] [Google Scholar]
14.Phoa LL, Pek WY, Syap W, Johan A. Unplanned extubation: a local experience. Singapore Med. J. 2002;43:504–508. [PubMed] [Google Scholar]
15.Lee JH, et al. Clinical outcomes after unplanned extubation in a surgical intensive care population. World J. Surg. 2014;38:203–210. doi: 10.1007/s00268-013-2249-5. [DOI] [PubMed] [Google Scholar]
16.Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit. Care Med. 1985;13:818–829. doi: 10.1097/00003246-198510000-00009. [DOI] [PubMed] [Google Scholar]
17.Vincent JL, Moreno R. Clinical review: scoring systems in the critically ill. Crit. Care. 2010;14:207. doi: 10.1186/cc8204. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Miranda DR, de Rijk A, Schaufeli W. Simplified Therapeutic Intervention Scoring System: the TISS-28 items–results from a multicenter study. Crit. Care Med. 1996;24:64–73. doi: 10.1097/00003246-199601000-00012. [DOI] [PubMed] [Google Scholar]
19.Saleh AAM, Sultan I, Abdel-Lateif A. comparison of the mortality prediction of different ICU scoring systems (APACHE II and III, SAPS II, and SOFA) in a single-center ICU subpopulation with acute respiratory distress syndrome. Egyptian. J. Chest Dis.Tuberc. 2015;64:843–848. doi: 10.1016/j.ejcdt.2015.05.012. [DOI] [Google Scholar]
20.DiRusso Stephen M., Sullivan Thomas, Holly Cheryl, Cuff Sara Nealon, Savino John. An Artificial Neural Network as a Model for Prediction of Survival in Trauma Patients: Validation for a Regional Trauma Area. The Journal of Trauma: Injury, Infection, and Critical Care. 2000;49(2):212–223. doi: 10.1097/00005373-200008000-00006. [DOI] [PubMed] [Google Scholar]
21.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
22.Hsieh MH, et al. An artificial neural network model for predicting successful extubation in intensive care units. J. Clin. Med. 2018;7:240. doi: 10.3390/jcm7090240. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996;49:1225–31. doi: 10.1016/S0895-4356(96)00002-9. [DOI] [PubMed] [Google Scholar]
24.Yang F, Wang HZ, Mi H, Lin CD, Cai WW. Using random forest for reliable classification and cost-sensitive learning for medical diagnosis. BMC. Bioinformatics. 2009;10(Suppl 1):S22. doi: 10.1186/1471-2105-10-S1-S22. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Furey TS, et al. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000;16:906–914. doi: 10.1093/bioinformatics/16.10.906. [DOI] [PubMed] [Google Scholar]
26.Kingma, D. & Adam, J. B. A method for stochastic optimization. International Conference on Learning Representations (ICLR) (2015).
27.Klambauer, G. et al. Self-normalizing neural networks. Advances in Neural Information Processing Systems (2017).
28.Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learning. Res. 2014;15:1929–1958. [Google Scholar]
29.Abadi, M. et al. Tensor Flow: A System for Large-Scale Machine Learning. OSDI. Vol. 16 (2016).
30.Pedregosa F, et al. Scikit-learn: Machine learning in Python. J of Machine Learning res. 2011;12:2825–2830. [Google Scholar]
31.Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ. LIBLINEAR: A library for large linear classification. Journal of machine learning research. 2008;9:1871–1874. [Google Scholar]
32.Chang Chih-Chung, Lin Chih-Jen. LIBSVM. ACM Transactions on Intelligent Systems and Technology. 2011;2(3):1–27. doi: 10.1145/1961189.1961199. [DOI] [Google Scholar]
33.He H, Garcia EA. Learning from imbalanced data. IEEE Transactions on knowledge and data engineering. 2009;21:1263–1284. doi: 10.1109/TKDE.2008.239. [DOI] [Google Scholar]
34.De Long, E. R., De Long, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 837–845 (1988). [PubMed]
35.Nimgaonkar A, et al. Prediction of mortality in an Indian intensive care unit. Comparison between APACHE II and artificial neural networks. Intensive Care. Med. 2004;30:248–253. doi: 10.1007/s00134-003-2105-4. [DOI] [PubMed] [Google Scholar]
36.Kim S, Kim W, Park RW. A comparison of intensive care unit mortality prediction models through the use of data mining techniques. Healthc. Inform. Res. 2011;17:232–243. doi: 10.4258/hir.2011.17.4.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Wong LS, Young JD. A comparison of ICU mortality prediction using the APACHE II scoring system and artificial neural networks. Anaesthesia. 1999;54:1048–1054. doi: 10.1046/j.1365-2044.1999.01104.x. [DOI] [PubMed] [Google Scholar]
38.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning. MIT Press (2016).

[CR1] 1.Betbese AJ, Perez M, Bak E, Mancebo J. A prospective study of unplanned endotracheal extubation in intensive care unit patients. Crit. Care Med. 1998;26:1180–1186. doi: 10.1097/00003246-199807000-00016. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Boulain T. Unplanned extubations in the adult intensive care unit: a prospective multicenter study. Association des Reanimateurs du Centre-Ouest. Am. J. Respir. Crit. Care Med. 1998;157:1131–1137. doi: 10.1164/ajrccm.157.4.9702083. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Chao CM, et al. Multidisciplinary interventions and continuous quality improvement to reduce unplanned extubation in adult intensive care units: A 15-year experience. Medicine (Baltimore) 2017;96:e6877. doi: 10.1097/MD.0000000000006877. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Chao CM, et al. Prognostic factors and outcomes of unplanned extubation. Sci. Rep. 2017;7:8636. doi: 10.1038/s41598-017-08867-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Christie JM, Dethlefsen M, Cane RD. Unplanned endotracheal extubation in the intensive care unit. J. Clin. Anesth. 1996;8:289–293. doi: 10.1016/0952-8180(96)00037-2. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Coppolo DP, May JJ. Self-extubations. A 12-month experience. Chest. 1990;98:165–169. doi: 10.1378/chest.98.1.165. [DOI] [PubMed] [Google Scholar]

[CR7] 7.de Groot RI, Dekkers OM, Herold IH, de Jonge E, Arbous MS. Risk factors and outcomes after unplanned extubations on the ICU: a case-control study. Crit. Care. 2011;15:R19. doi: 10.1186/cc9964. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Vassal T, et al. Prospective evaluation of self-extubations in a medical intensive care unit. Intensive Care Med. 1993;19:340–342. doi: 10.1007/BF01694708. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Pandey CK, et al. Self-extubation in intensive care and re-intubation predictors: a retrospective study. J. Indian Med. Assoc. 2002;100(11):14–16. [PubMed] [Google Scholar]

[CR10] 10.Birkett KM, Southerland KA, Leslie GD. Reporting unplanned extubation. Intensive Crit. Care Nurs. 2005;21:65–75. doi: 10.1016/j.iccn.2004.07.012. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Epstein SK, Nevins ML, Chung J. Effect of unplanned extubation on outcome of mechanical ventilation. Am. J. Respir. Crit. Care Med. 2000;161:1912–1916. doi: 10.1164/ajrccm.161.6.9908068. [DOI] [PubMed] [Google Scholar]

[CR12] 12.de Lassence A, et al. Impact of unplanned extubation and reintubation after weaning on nosocomial pneumonia risk in the intensive care unit: a prospective multicenter study. Anesthesiology. 2002;97:148–156. doi: 10.1097/00000542-200207000-00021. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Krinsley JS, Barone JE. The drive to survive: unplanned extubation in the ICU. Chest. 2005;128:560–566. doi: 10.1378/chest.128.2.560. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Phoa LL, Pek WY, Syap W, Johan A. Unplanned extubation: a local experience. Singapore Med. J. 2002;43:504–508. [PubMed] [Google Scholar]

[CR15] 15.Lee JH, et al. Clinical outcomes after unplanned extubation in a surgical intensive care population. World J. Surg. 2014;38:203–210. doi: 10.1007/s00268-013-2249-5. [DOI] [PubMed] [Google Scholar]

[CR16] 16.Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit. Care Med. 1985;13:818–829. doi: 10.1097/00003246-198510000-00009. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Vincent JL, Moreno R. Clinical review: scoring systems in the critically ill. Crit. Care. 2010;14:207. doi: 10.1186/cc8204. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Miranda DR, de Rijk A, Schaufeli W. Simplified Therapeutic Intervention Scoring System: the TISS-28 items–results from a multicenter study. Crit. Care Med. 1996;24:64–73. doi: 10.1097/00003246-199601000-00012. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Saleh AAM, Sultan I, Abdel-Lateif A. comparison of the mortality prediction of different ICU scoring systems (APACHE II and III, SAPS II, and SOFA) in a single-center ICU subpopulation with acute respiratory distress syndrome. Egyptian. J. Chest Dis.Tuberc. 2015;64:843–848. doi: 10.1016/j.ejcdt.2015.05.012. [DOI] [Google Scholar]

[CR20] 20.DiRusso Stephen M., Sullivan Thomas, Holly Cheryl, Cuff Sara Nealon, Savino John. An Artificial Neural Network as a Model for Prediction of Survival in Trauma Patients: Validation for a Regional Trauma Area. The Journal of Trauma: Injury, Infection, and Critical Care. 2000;49(2):212–223. doi: 10.1097/00005373-200008000-00006. [DOI] [PubMed] [Google Scholar]

[CR21] 21.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Hsieh MH, et al. An artificial neural network model for predicting successful extubation in intensive care units. J. Clin. Med. 2018;7:240. doi: 10.3390/jcm7090240. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996;49:1225–31. doi: 10.1016/S0895-4356(96)00002-9. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Yang F, Wang HZ, Mi H, Lin CD, Cai WW. Using random forest for reliable classification and cost-sensitive learning for medical diagnosis. BMC. Bioinformatics. 2009;10(Suppl 1):S22. doi: 10.1186/1471-2105-10-S1-S22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Furey TS, et al. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000;16:906–914. doi: 10.1093/bioinformatics/16.10.906. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Kingma, D. & Adam, J. B. A method for stochastic optimization. International Conference on Learning Representations (ICLR) (2015).

[CR27] 27.Klambauer, G. et al. Self-normalizing neural networks. Advances in Neural Information Processing Systems (2017).

[CR28] 28.Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. J. Machine Learning. Res. 2014;15:1929–1958. [Google Scholar]

[CR29] 29.Abadi, M. et al. Tensor Flow: A System for Large-Scale Machine Learning. OSDI. Vol. 16 (2016).

[CR30] 30.Pedregosa F, et al. Scikit-learn: Machine learning in Python. J of Machine Learning res. 2011;12:2825–2830. [Google Scholar]

[CR31] 31.Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ. LIBLINEAR: A library for large linear classification. Journal of machine learning research. 2008;9:1871–1874. [Google Scholar]

[CR32] 32.Chang Chih-Chung, Lin Chih-Jen. LIBSVM. ACM Transactions on Intelligent Systems and Technology. 2011;2(3):1–27. doi: 10.1145/1961189.1961199. [DOI] [Google Scholar]

[CR33] 33.He H, Garcia EA. Learning from imbalanced data. IEEE Transactions on knowledge and data engineering. 2009;21:1263–1284. doi: 10.1109/TKDE.2008.239. [DOI] [Google Scholar]

[CR34] 34.De Long, E. R., De Long, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 837–845 (1988). [PubMed]

[CR35] 35.Nimgaonkar A, et al. Prediction of mortality in an Indian intensive care unit. Comparison between APACHE II and artificial neural networks. Intensive Care. Med. 2004;30:248–253. doi: 10.1007/s00134-003-2105-4. [DOI] [PubMed] [Google Scholar]

[CR36] 36.Kim S, Kim W, Park RW. A comparison of intensive care unit mortality prediction models through the use of data mining techniques. Healthc. Inform. Res. 2011;17:232–243. doi: 10.4258/hir.2011.17.4.232. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Wong LS, Young JD. A comparison of ICU mortality prediction using the APACHE II scoring system and artificial neural networks. Anaesthesia. 1999;54:1048–1054. doi: 10.1046/j.1365-2044.1999.01104.x. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning. MIT Press (2016).

PERMALINK

Comparison of machine learning models for the prediction of mortality of patients with unplanned extubation in intensive care units

Meng Hsuen Hsieh

Meng Ju Hsieh

Chin-Ming Chen

Chia-Chang Hsieh

Chien-Ming Chao

Chih-Cheng Lai

Abstract

Introduction

Materials and Methods

Patients and setting

Constructing data sets

Data description

Algorithm and training

Artificial Neural Network Model

Logistic Regression Model

Random Forest Model

Support Vector Machine Model

Statistical analyses

Results

Clinical features of UE patients

Table 1.

Results of Prediction Models and Control Predictors

Table 2.

Table 3.

Figure 1.

Table 4.

Figure 2.

Table 5.

Table 6.

Discussion

Conclusion

Author Contributions

Competing Interests

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases