Skip to main content
Journal of Clinical Medicine logoLink to Journal of Clinical Medicine
. 2021 Oct 28;10(21):5021. doi: 10.3390/jcm10215021

Machine Learning Prediction Models for Mortality in Intensive Care Unit Patients with Lactic Acidosis

Pattharawin Pattharanitima 1,*, Charat Thongprayoon 2,*, Wisit Kaewput 3, Fawad Qureshi 2, Fahad Qureshi 4, Tananchai Petnak 5, Narat Srivali 6, Guido Gembillo 7, Oisin A O’Corragain 8, Supavit Chesdachai 9, Saraschandra Vallabhajosyula 10, Pramod K Guru 11, Michael A Mao 12, Vesna D Garovic 2, John J Dillon 2, Wisit Cheungpasitporn 2,*
Editor: Didier Payen
PMCID: PMC8584535  PMID: 34768540

Abstract

Background: Lactic acidosis is the most common cause of anion gap metabolic acidosis in the intensive care unit (ICU), associated with poor outcomes including mortality. We sought to compare machine learning (ML) approaches versus logistic regression analysis for prediction of mortality in lactic acidosis patients admitted to the ICU. Methods: We used the Medical Information Mart for Intensive Care (MIMIC-III) database to identify ICU adult patients with lactic acidosis (serum lactate ≥4 mmol/L). The outcome of interest was hospital mortality. We developed prediction models using four ML approaches consisting of random forest (RF), decision tree (DT), extreme gradient boosting (XGBoost), artificial neural network (ANN), and statistical modeling with forward stepwise logistic regression using the testing dataset. We then assessed model performance using area under the receiver operating characteristic curve (AUROC), accuracy, precision, error rate, Matthews correlation coefficient (MCC), F1 score, and assessed model calibration using the Brier score, in the independent testing dataset. Results: Of 1919 lactic acidosis ICU patients, 1535 and 384 were included in the training and testing dataset, respectively. Hospital mortality was 30%. RF had the highest AUROC at 0.83, followed by logistic regression 0.81, XGBoost 0.81, ANN 0.79, and DT 0.71. In addition, RF also had the highest accuracy (0.79), MCC (0.45), F1 score (0.56), and lowest error rate (21.4%). The RF model was the most well-calibrated. The Brier score for RF, DT, XGBoost, ANN, and multivariable logistic regression was 0.15, 0.19, 0.18, 0.19, and 0.16, respectively. The RF model outperformed multivariable logistic regression model, SOFA score (AUROC 0.74), SAP II score (AUROC 0.77), and Charlson score (AUROC 0.69). Conclusion: The ML prediction model using RF algorithm provided the highest predictive performance for hospital mortality among ICU patient with lactic acidosis.

Keywords: lactic acid, lactic acidosis, lactate, mortality, intensive care unit, machine learning, artificial intelligence, critical care, critical care medicine, nephrology, precision medicine, personalized medicine, individualized medicine

1. Introduction

Lactic acidosis is the most common cause of anion gap metabolic acidosis in the intensive care unit (ICU) [1,2], and it is associated with poor clinical outcomes [1,3,4,5,6]. While ICU patients with lactic acidosis typically have classic type A lactic acidosis due to tissue hypoperfusion, certain patients with liver disease, cancer, or predisposing drug use (such as metformin, salicylate) can also have coexisting type B lactic acidosis [7,8]. Lactic acidosis (serum lactate ≥4 mmol/L) is associated with high mortality in various ICU settings, including sepsis, trauma, or cardiac surgeries [3,4,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25]. Hence, the incorporation of ICU admission lactate levels in illness severity scores has been proven to improve predictive performance for in-hospital mortality [26,27].

Recently, artificial intelligence (AI) and machine learning (ML) have been increasingly utilized for precision medicine [28,29], including prediction of clinical outcomes among critically ill patients [30,31,32,33,34]. Due to the ability of ML to cope with nonlinear, complex, and multidimensional data [31,35], recent studies have demonstrated that ML approaches using ICU data provided high predictive performances that outperformed traditional analysis [32,33]. While the use of lactate levels in mortality prediction among critically ill patients has been investigated [26,27], data on mortality risk prediction among the subgroup of ICU patients with lactic acidosis are limited. Given the heterogeneity of impacts of lactic acidosis on clinical outcomes in a variety of different patient characteristics and ICU settings (such as lactic acidosis in patients with trauma, cardiac surgery, and septic shock) [9,10,11,12,13,14,15,16,17,18,19,20], an ML-based mortality prediction model for ICU patients with lactic acidosis can provide a novel individualized approach to clinical decision making for critically ill patients.

In this study, we aimed to develop and then assess various ML-based prediction model performances in predicting mortality of ICU patients with lactic acidosis in comparison to the traditional statistical model.

2. Methods

2.1. Patient Population

The Mayo Clinic Institutional Review Board approved this observational study (IRB number-21-009222). We used the Medical Information Mart for Intensive Care III (MIMIC III) database to conduct this study. MIMIC-III provides deidentified comprehensive clinical data from ICU patients at Beth Israel Deaconess Medical Center in Boston, Massachusetts, United States between 2001 and 2012 [36]. The database is widely accessible to researchers internationally under a data use agreement. If patients had multiple ICU admissions, we analyzed only the first admission.

Inclusion criteria were (1) age ≥18 years and (2) presence of lactic acidosis at ICU admission, defined as the first serum lactate measured within 48 h of ICU admission of ≥4.0 mmol/L. The exclusion criteria were (1) no serum lactate measurements within 48 h of ICU admission or (2) being admitted to the ICU for ≤24 h.

2.2. Data Collection

We abstracted data on patient characteristics, comorbidities, vital signs, organ support, and laboratory results for prediction model development. As our goal was to develop and assess a prediction model for mortality in lactic acidosis patients based on the available data at the time of ICU admission, we only used data that were present within 48 h of ICU admission for analysis. When multiple values existed, we selected the closest vital sign or laboratory value to lactic acidosis occurrence. We excluded laboratory results with more than 10% missing data. Otherwise, we imputed missing data through multiple imputation using Random Forest (RF).

2.3. Model Development

In order to utilize ML models to predict the risk of in-hospital mortality in ICU patients with lactic acidosis, we followed the TRIPOD to build these ML models (Online Supplementary) [37]. Spearman’s rank correlation was applied to assess the separate correlation of variables in the dataset and demonstrate no significant correlations (Supplementary Figure S1). Numeric data were normalized to have a standard deviation of 1 and a mean of 0 [38]. The overall study cohort was randomized into a training (80%) and testing dataset (20%) as per the Pareto principle [39]. We used the training dataset to develop ML models. The testing cohort was blinded to all methods until the final evaluation. As a reference model, we used multivariable logistic regression analysis. We conducted forward stepwise variable selection using criteria of p < 0.20 for entry cut-off.

ML models include decision tree (DT), RF, extreme gradient boosting (XGBoost), and deep learning. RF and XGBoost are both DT ensemble algorithms [40,41]. However, RF forests rely on bagging, which is a democratic process to “elect” the best decision among the subgroups of trees [40]. XGBoost is based on a gradient descent–boosting process, which is an ensemble of weak learners that is reinforced depending on the quality of the assessment [41]. We used deep learning based on a multi-layer feedforward artificial neural network (ANN) that is trained with stochastic gradient descent using back-propagation.

For DT analysis, the number of terminal nodes was determined considering the scree plot showing the relationship between the tree size and coefficient of variance. The decision tree was pruned based on cross-validated error results using the complexity parameter associated with the minimal error (Supplementary Figure S2). For the RF model, the number of trees was 500, which yielded the lowest error rate (Supplementary Figure S3), and mtry value was calculated by the square root of the number of variables [42]. For XGBoost and ANN, we created a hyperparameter tuning grid to identify the best combination of hyperparameters using cross-validation methods [43]. Detailed hyperparameters are provided in the Online Supplementary data.

2.4. Model Evaluation and Calibration

Model performance was assessed with area under the receiver operating characteristic curve (AUROC), accuracy, precision, error rate (ERR), Matthews correlation coefficient (MCC), and F1 score in the testing dataset [44,45,46]. The formula for each measure is provided in the Online Supplementary data. The Brier score was used to evaluate model calibration [47].

2.5. Explanations of the Features in the ML-Based Prediction Model That Drive Patient-Specific Predictions of Mortality

After we identified ML model with highest predictive performances, we applied the Shapley additive explanations (SHAP) values to explain which features initiate patient-specific estimates. In addition, we also applied the local interpretable model-agnostic explanations (LIME) approach to approximate a complex nonlinear model to a linear model near variables of interest.

2.6. Statistical Analysis

All analyses were performed using R, version 4.0.3 (RStudio, Inc., Boston, MA, USA; http://www.rstudio.com/ (accessed on 15 January 2021)). We used the “rpart” package for DT, “randomForest” and “randomForestExplainer” for RF, “caret” package for XGBoost and grid search, “h2o” package for ANN and SHAP, “LIME” for LIME, and the “missForest” package for missing data imputation [48].

3. Results

A total of 1919 ICU patients with lactic acidosis were eligible for analysis. Of these, 1535 and 384 were included in the training and testing dataset, respectively. Table 1 shows the clinical characteristics of patients in the training and testing datasets. Clinical characteristics between the training and testing datasets were comparable. Hospital mortality was also similar between training and testing datasets (29.8% vs. 29.7%; p = 0.97).

Table 1.

Patient characteristics in the training and testing datasets.

Characteristics All
(n = 1919)
Training Set
(n = 1535)
Testing Set
(n = 384)
p-Value
Age (years) 61.8 ± 17.1 61.5 ± 17.0 63.0 ± 17.3 0.13
Male sex 1118 (58) 889 (58) 229 (60) 0.54
Race 0.85
White 1560 (81) 1246 (81) 314 (82)
Black 152 (8) 124 (8) 28 (7)
Hispanic 79 (4) 56 (4) 14 (4)
Other 128 (7) 100 (7) 28 (7)
ICU type 0.75
Cardiac ICU 206 (11) 164 (11) 42 (11)
Cardiac surgery ICU 467 (24) 375 (24) 92 (24)
Medical ICU 605 (32) 475 (31) 130 (34)
Surgical ICU 295 (15) 243 (16) 52 (13)
Trauma/surgical ICU 346 (18) 278 (18) 68 (18)
Elixhauser Comorbidities
Congestive heart failure 456 (24) 370 (24) 86 (22) 0.48
Valvular disease 352 (18) 282 (18) 70 (18) 0.95
Pulmonary circulation disorders 133 (7) 110 (7) 23 (6) 0.42
Peripheral vascular disease 286 (15) 227 (15) 59 (15) 0.78
Hypertension 884 (46) 694 (45) 190 (49) 0.13
Paralysis 55 (3) 36 (2) 19 (5) 0.006
Neurologic disorders 174 (9) 131 (9) 43 (11) 0.10
Chronic pulmonary disease 266 (14) 204 (13) 62 (16) 0.15
Uncomplicated diabetes 385 (20) 307 (20) 78 (20) 0.89
Complicated diabetes 73 (4) 61 (4) 12 (3) 0.44
Hypothyroidism 134 (7) 108 (7) 26 (7) 0.86
Liver disease 291 (15) 240 (16) 51 (13) 0.25
Peptic ulcer 1 (0.05) 1 (0.05) 0 (0) 0.62
AIDS/HIV 27 (1) 21 (1) 6 (2) 0.77
Lymphoma 52 (3) 41 (3) 11 (3) 0.83
Metastatic cancer 136 (7) 105 (7) 31 (8) 0.40
Solid tumor 128 (7) 103 (7) 25 (7) 0.97
Rheumatoid arthritis 41 (2) 36 (2) 5 (1) 0.20
Coagulopathy 500 (26) 395 (26) 105 (27) 0.52
Obesity 97 (5) 74 (5) 23 (6) 0.35
Weight loss 68 (4) 55 (4) 13 (3) 0.85
Fluid and electrolyte disorders 843 (44) 676 (44) 167 (43) 0.85
Blood loss anemia 36 (2) 29 (2) 7 (2) 0.93
Deficiency anemia 275 (14) 232 (15) 43 (11) 0.05
Alcohol abuse 199 (10) 168 (11) 31 (8) 0.10
Drug abuse 70 (4) 57 (4) 13 (3) 0.76
Psychosis 71 (4) 60 (4) 11 (3) 0.33
Depression 104 (5) 78 (5) 26 (7) 0.19
Chronic kidney disease 25 (1) 23 (1) 2(1) 0.13
Body weight (kg) 81.7 ± 21.0 81.9 ± 20.7 81.4 ± 22.1 0.72
Vital signs
Temperature (F) 97.2 ± 2.2 97.2 ± 2.2 97.4 ± 2.0 0.23
Heart rate (per minutes) 97 ± 21 97 ± 21 97 ± 22 0.47
Systolic blood pressure (mmHg) 117 ± 26 117 ± 26 117 ± 24 0.86
Diastolic blood pressure (mmHg) 62 ± 15 62 ± 15 62 ± 15 0.91
Mean blood pressure (mmHg) 81 ± 21 82 ± 22 80 ± 18 0.32
Respiratory rate (per minutes) 17 ± 9 17 ± 9 17 ± 9 0.95
Oxygen saturation (%) 97 ± 5 97 ± 5 97 ± 5 0.18
Glasgow coma score 7.9 ± 4.9 8.3 ± 4.9 7.8 ± 4.9 0.05
Vasopressor use 1230 (64) 984 (64) 246 (64) 0.99
Ventilator use 1608 (84) 1285 (84) 323 (84) 0.85
Any renal replacement therapies 54 (3) 44 (3) 10 (3) 0.78
Hemodialysis 35 (2) 29 (2) 6 (2) 0.67
CRRT 22 (1) 18 (1) 4 (1) 0.83
Acute kidney injury 1401 (73) 1117 (73) 284 (74) 0.64
Laboratory data
BUN (mg/dL) 27 ± 21 27 ± 20 28 ± 23 0.28
eGFR (mL/min/1.73 m2) 68 ± 31 68 ± 31 67 ± 29 0.65
Sodium (mEq/L) 138 ± 5 138 ± 6 139 ± 5 0.38
Potassium (mEq/L) 4.4 ± 0.9 4.3 ± 0.9 4.4 ± 0.9 0.45
Chloride (mEq/L) 106 ± 7 107 ± 6 106 ± 7 0.79
Bicarbonate (mEq/L) 20 ± 5 20 ± 5 20 ± 5 0.81
Anion gap (mEq/L) 18 ± 6 18 ± 5 18 ± 6 0.57
Total calcium (mg/dL) 8.2 ± 1.2 8.2 ± 1.2 8.2 ± 1.1 0.91
Ionized calcium (mmol/L) 1.1 ± 0.2 1.1 ± 0.2 1.1 ± 0.1 0.60
Phosphate (mg/dL) 4.1 ± 1.8 4.1 ± 1.7 4.2 ± 1.9 0.29
Magnesium (mg/dL) 1.9 ± 0.5 1.9 ± 0.5 2.0 ± 0.5 0.60
Lactate (mmol/L) 6.2 ± 2.6 6.2 ± 2.6 6.1 ± 2.5 0.45
Glucose (mg/dL) 179 ± 89 179 ± 88 180 ± 91 0.89
Hemoglobin (g/dL) 10.6 ± 2.3 10.6 ± 2.4 10.6 ± 2.3 0.98
WBC (109 cells/L) 14.1 ± 8.3 14.0 ± 8.6 14.2 ± 7.2 0.73
Platelet (109 cells/L) 170 ± 103 178 ± 102 187 ± 105 0.13
pH 7.31 ± 0.12 7.31 ± 0.12 7.31 ± 0.12 0.94
pCO2 (mmHg) 39 ± 11 39 ± 11 39 ± 11 0.96
pO2 (mmHg) 209 ± 133 209 ± 133 210 ± 134 0.80
INR 1.8 ± 1.0 1.8 ± 1.1 1.8 ± 1.0 0.98
PTT (second) 49 ± 30 49 ± 30 48 ± 31 0.82
Culture data
Positive blood culture 197 (10) 158 (10) 39 (10) 0.94
Positive urine culture 205 (11) 171 (11) 34 (9) 0.19
Positive sputum culture 284 (15) 220 (14) 64 (17) 0.25
Hospital death 571 (30) 457 (30) 114 (30) 0.97

Abbreviations: AIDS, Acquired Immune Deficiency Syndrome; AKI, Acute Kidney Injury; BUN, Blood Urea Nitrogen; CCU, Coronary Care Unit; CHF, Chronic Heart Failure; Cl, Chloride; CRRT Continuous Renal Replacement Therapy; CSRU, Cardiac Surgery Recovery Unit; DBP, Diastolic Blood Pressure; eGFR, estimated Glomerular Filtration Rate; GCS, Glasgow Coma Scale; Hb Hemoglobin; HR, Heart Rate; ICU, Intensive Care Unit; IHD, Intermittent Hemodialysis; INR, International Normalized Ratio; K, Potassium; MAP, Mean Arterial Pressure; Mg, Magnesium; MICU, Medical Intensive Care Unit; Na, Sodium; pH, potential hydrogen; pCO2, partial pressure of carbon dioxide; pO2, partial pressure of oxygen; PT, Prothrombin time; PTT, Partial Thromboplastin Time; PVD, Peripheral vascular disease; RR, Respiratory Rate; RRT, Renal Replacement Therapy; SAPS II Score, Simplified Acute Physiology Score II; SPO2, Saturation of Peripheral Oxygen; Systolic Blood Pressure; SICU, Surgical Intensive Care Unit; WBC, White Blood Cell.

The ERRs and AUROCs of all ML models and the multivariable logistic regression model for mortality prediction in the test data set are shown in Table 2 and Figure 1. RF showed the lowest ERR (21.4%) and the highest accuracy (0.79), precision (0.72), MCC score (0.45), F1 score (0.56), and AUROC (0.83, 95% confidence interval (CI) 0.79–0.87). While the decision tree demonstrated a simple algorithm to follow (Figure 2), it had the highest ERR (26.7%) and lowest accuracy (0.73), precision (0.59), MCC score (0.30), F1 score (0.44), and AUROC (0.71, 95%CI 0.66–0.77) among all ML models (Figure 1).

Table 2.

Comparison of evaluation and calibration among the different models.

Model Error Rate of Test Data Set Accuracy Precision MCC F1 Score AUROC in the Test Set Brier Score
Random forest model 21.4% 0.79 0.72 0.45 0.56 0.83 (0.79–0.87) 0.15
Decision tree 26.7% 0.73 0.59 0.30 0.44 0.71 (0.66–0.77) 0.19
XGBoost 25.0% 0.75 0.60 0.36 0.52 0.81 (0.76–0.85) 0.18
ANN 25.0% 0.75 0.67 0.33 0.42 0.79 (0.74–0.84) 0.19
Multivariable logistic regression 22.9% 0.77 0.67 0.41 0.54 0.81 (0.79–0.83) 0.16
SOFA score 25.5% 0.74 0.67 0.30 0.39 0.74 (0.68–0.80) 0.17
SAPS II score 23.2% 0.77 0.71 0.39 0.49 0.77 (0.71–0.82) 0.17
Charlson score 28.4% 0.72 0.73 0.16 0.13 0.69 (0.63–0.74) 0.19

MCC: worst value −1 and best value +1. F1 score, accuracy, and precision: worst value 0 and best value 1. The Brier score is a combined measure of discrimination and calibration that ranges between 0 and 1, where the best score is 0 and the worst is 1. ANN, artificial neural network; MCC, Matthews correlation coefficient; AUROC, area under the receiver operating characteristic curve; SOFA, Sequential Organ Failure Assessment; SAPS II, Simplified Acute Physiology Score.

Figure 1.

Figure 1

Comparison of AUROC among the different ML models, logistic regression model, SAPS II score, Charlson score, and SOFA score. ROC, operating characteristic.

Figure 2.

Figure 2

Simple decision tree model showing the classification of ICU patients with lactic acidosis who died (1) and did not (0) die during hospitalization. The numbers with two decimals in each cell mean the probability of mortality in each classification tree. The blue or green color becomes dense when it is more likely to die or not. The % number in the boxes denotes the percentage of patients with each discriminating variable from CART (Classification and Regression Tree) analysis. WBC, White Blood Cell; INR, International Normalized Ratio; RR, Respiratory Rate; MAP, Mean Arterial Pressure; GCS, Glasgow Coma Scale.

The results of multivariable logistic regression analysis with stepwise variable selection are shown in Table 3. The AUROC of the multivariable logistic prediction model with forward stepwise variable selection was 0.81 (95%CI 0.79–0.83). We also compared our predictive models with Sequential Organ Failure Assessment (SOFA) score, Simplified Acute Physiology Score (SAPS II) score (acute severity score) and Charlson score (comorbidity score). The RF model outperformed the multivariable logistic regression model, SOFA score (AUROC 0.74), SAP II score (AUROC 0.77), and Charlson score (AUROC 0.69) (Table 2).

Table 3.

Development of multivariable logistic regression model to predict mortality using stepwise variable selection in the training dataset.

KERRYPNX Univariate Analysis Multivariable Analysis
Characteristics OR (95% CI) p-Value OR (95% CI) p-Value
Age per 10 years 1.10 (1.03–1.17) 0.005 1.16 (1.07–1.26) 0.001
Male sex 0.93 (0.75–1.16) 0.52
Race
White 1 (reference) 1 (reference)
Black 1.05 (0.70–1.56) 0.83
Hispanic 0.37 (0.18–0.75) 0.006
Other 0.93 (0.59–1.46) 0.75
ICU type
Cardiac ICU 1.21 (0.85–1.73) 0.30 0.73 (0.48–1.12) 0.15
Cardiac surgery ICU 0.21 (0.15–0.30) <0.001 0.26 (0.16–0.42) <0.001
Medical ICU 1 (reference) 1 (reference) 1 (reference) 1 (reference)
Surgical ICU 0.56 (0.40–0.77) 0.001 0.67 (0.45–1.00) 0.05
Trauma/surgical ICU 0.43 (0.31–0.60) <0.001 0.82 (0.54–1.25) 0.35
Elixhauser Comorbidities 1.20 (0.93–1.54) 0.16
Congestive heart failure 0.51 (0.37–0.71) <0.001
Valvular disease 1.21 (0.80–1.83) 0.36
Pulmonary circulation disorders 0.80 (0.58–1.11) 0.18
Peripheral vascular disease 0.75 (0.60–0.94) 0.01
Hypertension 0.91 (0.43–1.89) 0.79
Paralysis 1.17 (0.80–1.71) 0.43
Neurologic disorders 0.98 (0.71–1.35) 0.90
Chronic pulmonary disease 0.95 (0.73–1.26) 0.74
Uncomplicated diabetes 0.69 (0.38–1.27) 0.24
Complicated diabetes 0.81 (0.52–1.27) 0.37
Hypothyroidism 1.80 (1.35–2.39) <0.001
Liver disease 2.17 (0.92–5.15) 0.08
AIDS/HIV 3.12 (1.67–5.84) <0.001
Lymphoma 2.29 (1.53–3.41) <0.001
Metastatic cancer 0.74 (0.47–1.18) 0.21
Solid tumor 0.91 (0.43–1.89) 0.79
Rheumatoid arthritis 1.77 (1.39–2.25) <0.001 1.96 (1.22–3.15) 0.005
Coagulopathy 0.49 (0.27–0.90) 0.02
Obesity 0.97 (0.53–1.75) 0.91
Weight loss 1.92 (1.54–2.40) <0.001
Fluid and electrolyte disorders 0.90 (0.39–2.04) 0.80 0.49 (0.25–0.99) 0.04
Blood loss anemia 0.61 (0.43–0.85) 0.003
Deficiency anemia 0.94 (0.66–1.34) 0.72
Alcohol abuse 1.00 (0.56–1.79) 0.99
Drug abuse 0.50 (0.34–0.74) <0.001
Psychosis 0.64 (0.34–1.20) 0.16
Depression 0.92 (0.56–1.53) 0.76
Chronic kidney disease 0.49 (0.17–1.46) 0.20
Body weight per 5 kg 0.98 (0.96–1.01) 0.25
Vital signs
Temperature per 1 F 0.96 (0.91–1.01) 0.08
Heart rate per 10 times/minute 1.11 (1.05–1.16) <0.001
Systolic per 10 mmHg 0.92 (0.88–0.96) <0.001
Diastolic BP per 5 mmHg 0.97 (0.93–1.01) 0.09 1.05 (1.01–1.10) 0.03
Mean BP per 5 mmHg 0.97 (0.94–0.99) 0.02
Respiratory rate per 1 time/minute 1.05 (1.04–1.06) <0.001 1.02 (1.00–1.03) 0.04
Oxygen saturation per 1 percent 0.93 (0.91–0.95) <0.001
Glasgow coma score per 1 unit 1.00 (0.98–1.02) 0.91
Vasopressor use 2.19 (1.71–2.79) <0.001 2.11 (1.54–2.89) <0.001
Ventilator use 1.31 (0.96–1.79) 0.09 1.81 (1.21–2.70) 0.004
Any renal replacement therapies 1.36 (0.73–2.54) 0.33
Hemodialysis 1.68 (0.80–3.55) 0.17
CRRT 0.91 (0.32–2.56) 0.85
Acute kidney injury 3.45 (2.55–4.67) <0.001 2.10 (1.49–2.96) <0.001
Laboratory data
BUN per 1 mg/dL 1.03 (1.02–1.03) <0.001
eGFR per 10 mL/min/1.73 m2 0.90 (0.87–0.94) <0.001
Sodium per 1 mEq/L 1.00 (0.98–1.02) 0.72
Potassium per 1 mEq/L 1.03 (0.92–1.16) 0.60
Chloride per 1 mEq/L 0.95 (0.94–0.97) <0.001 0.97 (0.95–0.99) 0.01
Bicarbonate per 1 mEq/L 0.90 (0.87–0.92) <0.001
Anion gap per 1 mEq/L 1.14 (1.12–1.17) <0.001 1.04 (1.01–1.08) 0.009
Total calcium per 1 mg/dL 0.88 (0.80–0.96) 0.006
Ionized calcium per 1 mmol/L 0.06 (0.03–0.13) 0.06 0.19 (0.08–0.46) <0.001
Phosphate per 1 mg/dL 1.29 (1.21–1.37) <0.001
Magnesium per 1 mg/dL 1.48 (1.21–1.81) <0.001 1.54 (1.18–2.02) 0.002
Lactate per 1 mmol/L 1.25 (1.20–1.31) <0.001 1.11 (1.04–1.17) 0.001
Glucose per 1 mg/dL 1.00 (1.00–1.00) 0.14
Hemoglobin per 1 g/dL 1.06 (1.01–1.11) 0.02
WBC per 109 cells/L 1.01 (1.00–1.02) 0.13
Platelet per 109 cells/L 1.00 (1.00–1.00) 0.12
pH per 1 unit 0.04 (0.02–0.10) <0.001
pCO2 per 1 mmHg 0.99 (0.98–0.99) 0.04
pO2 per 1 mmHg 1.00 (1.00–1.00) <0.001 0.99 (0.99–1.00) 0.004
INR per 1 unit 1.62 (1.43–1.84) <0.001 1.17 (1.03–1.33) 0.02
PTT per 1 s 1.01 (1.01–1.01) <0.001 1.01 (1.00–1.01) 0.003
Culture data
Positive blood culture 2.49 (1.79–3.48) <0.001
Positive urine culture 2.05 (1.53–2.74) <0.001
Positive sputum culture 1.90 (1.37–2.63) <0.001

Abbreviations: AIDS, Acquired Immune Deficiency Syndrome; AKI, Acute Kidney Injury; BUN, Blood Urea Nitrogen; CCU, Coronary Care Unit; CHF, Chronic Heart Failure; Cl, Chloride; CRRT Continuous Renal Replacement Therapy; CSRU, Cardiac Surgery Recovery Unit; DBP, Diastolic Blood Pressure; eGFR, estimated Glomerular Filtration Rate; GCS, Glasgow Coma Scale; Hb Hemoglobin; HR, Heart Rate; ICU, Intensive Care Unit; IHD, Intermittent Hemodialysis; INR, International Normalized Ratio; K, Potassium; MAP, Mean Arterial Pressure; Mg, Magnesium; MICU, Medical Intensive Care Unit; Na, Sodium; pH, potential hydrogen; pCO2, partial pressure of carbon dioxide; pO2, partial pressure of oxygen; PT, Prothrombin time; PTT, Partial Thromboplastin Time; PVD, Peripheral vascular disease; RR, Respiratory Rate; RRT, Renal Replacement Therapy; SPO2, Saturation of Peripheral Oxygen; Systolic Blood Pressure; SICU, Surgical Intensive Care Unit; WBC, White Blood Cell.

The RF model was the most well-calibrated among all models (Supplementary Figures S4–S10). The Brier score for RF, DT, XGBoost, ANN, and multivariable logistic regression was 0.15, 0.19, 0.18, 0.19 and 0.16, respectively (Table 2). Variable importance analysis of RF, the best model, was performed. The top important variables of RF combined the mean decrease in Gini (how much each variable decreases the node impurity), decrease in accuracy, and p values of the clinical indices include BUN, anion gap, lactate level, INR, pO2, phosphate level, PTT, platelet count, pH, and baseline eGFR (Figure 3).

Figure 3.

Figure 3

Multiway feature importance analysis of RF model combining the mean decrease in Gini, decrease in accuracy, and p values of the features (pink circle; p < 0.01). Abbreviations: BUN, blood urea nitrogen; eGFR, estimated glomerular filtration rate; pH, potential hydrogen; pO2, partial pressure of oxygen; PTT, partial thromboplastin time.

To identify the features that influenced the prediction model the most, we depicted the SHAP summary plot of RF model (Figure 4) and the top 20 features of the prediction model. This plot depicts how high and low features’ values were in relation to SHAP values in the testing dataset. According to the prediction model, the higher the SHAP value of a feature, the higher probability of mortality occurring. Additionally, we applied LIME into RF model to illustrate the impact of key features at the individual level (Figure 5).

Figure 4.

Figure 4

SHAP summary plot of the top 20 features of the RF model. The higher the SHAP value of a feature, the higher the probability of mortality. A dot is created for each feature attribution value for the model of individual patient. Each patient is allocated one dot on the line for each feature. Dots are colored according to the values of features for the respective patient and accumulate vertically to depict density. Red represents higher feature values, and blue represents lower feature values. Abbreviations: AKI, acute kidney injury; BUN, blood urea nitrogen; CRRT, continuous renal replacement therapy; HCO3, bicarbonate; ICU, intensive care unit; INR, international normalized ratio.

Figure 5.

Figure 5

Local interpretable model explainer (LIME) for 6 individual cases (case# 379 to 384) from the testing dataset. Label “1” means prediction of mortality and label “0” means prediction of no mortality (survival). Probability shows the probability of the observation belong to the label “1” or “0”. The five most influential variables that best explain the linear model in that observation’s local region are provided and whether the variable causes an increase in the probability (supports/blue bar) or a decrease in the probability (contradicts/red bar). The x-axis shows how much each feature added or subtracted to the final probability value for the patient. Abbreviations: BUN, blood urea nitrogen; GCS, Glasgow Coma Scale; PTT, partial thromboplastin time; pO2, partial pressure of oxygen.

4. Discussion

Significant efforts have been invested into the development of predictive risk models of mortality for ICU patients. Traditional statistical models such as logistic regression analysis have been previously utilized to construct such prognostication tools [49,50,51,52]. In recent years, ML predictive algorithms have emerged as a method to handle high-dimensional, unstructured, and complex structure data [28,29,30,31,32,33,34]. In this study, we compared ML models and a conventional multivariable logistic regression model to assess the best-performing model for in-hospital mortality among ICU patients with lactic acidosis. The findings from our study suggest that the RF algorithm demonstrated superior performance in prediction of mortality among critically ill patients with lactic acidosis compared to other predictive tools.

Modern ICUs and advances in electronic health records (EHRs) generate vast amounts of complex and multidimensional data that provide valuable information on patient outcomes. This has led to considerable advances in precision medicine [53]. While elevated serum lactate levels have been shown to be associated with increased mortality [54,55,56] and its incorporation improves predictive performance in traditional logistic regression models among critically ill patients [26,27], mortality risk prediction among the subgroup of ICU patients with lactic acidosis [54,55,56] is limited, especially utilizing ML approaches. Furthermore, patients with lactic acidosis are heterogenous and the impact of lactic acidosis on ICU mortality varies based on the clinical ICU setting, such as trauma, cardiac surgery, and sepsis [54,55,56]. Given the heterogeneity of ICU patients with lactic acidosis and the lack of adequate tools for patient-level prognostication, clinicians may often resort to subjective gestalt judgment, which is prone to bias [57]. Thus, we investigated whether ML methods improved mortality prognostication for ICU lactic acidosis in this study to improve precision medicine. Our best model was reached using the RF algorithm, which was associated with the highest AUROC and lowest ERR compared to all other models. We acknowledged that AUROC has several flaws [58], and thus, we also investigated other evaluation metrics including accuracy, precision, MCC, and F1 score. These confirmed the robustness of our RF prediction model. Finally, the findings that the predicted probabilities are close to the expected probability distribution supports that our RF model for mortality predication among ICU patients with lactic acidosis is well-calibrated.

RF is a widely used ML approach that can effectively predict outcomes [40]. It does this by utilizing additive combinations of trees that are built using different subsets of data and variables [42,59]. This nonparametric and nonlinear machine learning RF method can resist noise, and thus, it is expected to build accurate prediction models using aggregated data [40,60]. As a type of robust nonparametric model, RF can simulate complex relationships and it does not depend on data distribution, such as in logistic regression [40]. In addition, RF works well on large datasets, particularly when there are many categorical independent variables and unbalanced data [42]. On the other hand, the logistic regression analysis approach uses a generalized linear equation and the stepwise variable selection method is based on the likelihood ratio test to describe the directed dependencies among a set of variables. These approaches require that a number of statistical assumptions must be met. Thus, logistic regression analysis possesses inherent bias and, consequently, low variance due to the rigid nature from the shape of the line. Our RF prediction model also outperformed acute severity scores (SOFA and SAPS II scores) and comorbidities score (Charlson score) for prediction of ICU patients with lactic acidosis hospital mortality. In addition, this study provided important information on variables in our RF model including BUN, anion gap, lactate level, INR, pO2, phosphate level, PTT, platelet count, pH, and baseline eGFR. These variables in the RF model for critically ill patients with lactic acidosis using a MIMICIII database will help each institution develop their individualized RF model to better prognosticate mortality risk. Additionally, we applied model-agnostic approaches including feature relevant explanation through SHAP and local explanations through the LIME approach [61], which demonstrated how our RF model can be used to explain how each feature contributes to mortality prediction among patients with lactic acidosis.

Although our study includes a large sample size of ICU patients with lactic acidosis and ICU admission data, there are several important limitations. First, our models utilized data obtained during the time of ICU admission in order to prognosticate mortality risk in the early ICU course. Thus, events that markedly altered the prognosis for an individual patient were not included. In addition, our study is retrospective and based on the MIMIC III database, a large single-center tertiary care hospital in the United States. Hence, the model might have been influenced by the specific clinical guidelines, practice, and treatment decisions for that institution. A future validation study with the updated MIMIC-IV database and external validation studies of ML prediction models are needed.

5. Conclusions

In conclusion, an ML prediction model using the RF algorithm (available online as a shiny app at https://wisitc.shinyapps.io/RandomForestLacticAcid/ created on 15 September 2021)) provided the highest predictive performance for hospital mortality among ICU patients with lactic acidosis. While future external validation studies are required, the findings of our study provide support towards the utilization of RF algorithms to improve risk stratification among critically patients with lactic acidosis.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/jcm10215021/s1, Figure S1: Correlation of variables in the dataset; Figure S2: Pruned DT based on cross-validated error results using the complexity parameter associated with the minimal error; Figure S3: Error rate of RF model by number of trees; Figures S4–S10: Calibration plot of among all models; Supplementary method: detailed model development and TRIPOD checklist.

Author Contributions

Conceptualization, P.P., C.T., W.K., F.Q. (Fawad Qureshi), F.Q. (Fahad Qureshi), T.P., N.S., G.G., O.A.O., S.C., S.V., P.K.G., M.A.M., V.D.G., J.J.D. and W.C.; Data curation, P.P., C.T. and W.C.; Formal analysis, P.P., C.T. and W.C.; Investigation, P.P., C.T., W.K. and W.C.; Methodology, P.P., C.T., W.K. and W.C.; Project administration, P.P., C.T., W.K., F.Q. (Fawad Qureshi) and W.C.; Resources, P.P., F.Q. (Fawad Qureshi) and W.C.; Software, P.P. and W.C.; Supervision, P.P., F.Q. (Fawad Qureshi), N.S., G.G., O.A.O., V.D.G., J.J.D. and W.C.; Validation, P.P., C.T., W.K., T.P. and W.C.; Visualization, P.P., C.T. and W.C.; Writing—original draft, P.P., C.T. and F.Q. (Fahad Qureshi); Writing—review and editing, P.P., C.T., W.K., F.Q. (Fawad Qureshi), F.Q. (Fahad Qureshi), T.P., N.S., G.G., O.A.O., S.C., S.V., P.K.G., M.A.M., V.D.G. and J.J.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of Mayo Clinic (IRB number-21-009222).

Informed Consent Statement

Patient consent was waived due to the minimal risk nature of observational chart review studies.

Data Availability Statement

Data are available upon reasonable request to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Vernon C., LeTourneau J.L. Lactic Acidosis: Recognition, Kinetics, and Associated Prognosis. Crit. Care Clin. 2010;26:255–283. doi: 10.1016/j.ccc.2009.12.007. [DOI] [PubMed] [Google Scholar]
  • 2.Levy B. Lactate and shock state: The metabolic view. Curr. Opin. Crit. Care. 2006;12:315–321. doi: 10.1097/01.ccx.0000235208.77450.15. [DOI] [PubMed] [Google Scholar]
  • 3.Trzeciak S., Dellinger R.P., Chansky M.E., Arnold R.C., Schorr C., Milcarek B., Hollenberg S.M., Parrillo J.E. Serum lactate as a predictor of mortality in patients with infection. Intensiv. Care Med. 2007;33:970–977. doi: 10.1007/s00134-007-0563-9. [DOI] [PubMed] [Google Scholar]
  • 4.Jansen T.C., van Bommel J., Woodward R., Mulder P.G.H., Bakker J. Association between blood lactate levels, Sequential Organ Failure Assessment subscores, and 28-day mortality during early and late intensive care unit stay: A retrospective observational study. Crit. Care Med. 2009;37:2369–2374. doi: 10.1097/CCM.0b013e3181a0f919. [DOI] [PubMed] [Google Scholar]
  • 5.Levraut J., Ciebiera J.-P., Chave S., Rabary O., Jambou P., Carles M., Grimaud D. Mild Hyperlactatemia in Stable Septic Patients Is Due to Impaired Lactate Clearance Rather Than Overproduction. Am. J. Respir. Crit. Care Med. 1998;157:1021–1026. doi: 10.1164/ajrccm.157.4.9705037. [DOI] [PubMed] [Google Scholar]
  • 6.Filho R.R., Rocha L.L., Corrêa T.D., Pessoa C.M.S., Colombo G., Assuncao M.S.C. Blood Lactate Levels Cutoff and Mortality Prediction in Sepsis—Time for a Reappraisal? A Retrospective Cohort Study. Shock. 2016;46:480–485. doi: 10.1097/SHK.0000000000000667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Cheungpasitporn W., Zand L., Dillon J.J., Qian Q., Leung N. Lactate clearance and metabolic aspects of continuous high-volume hemofiltration. Clin Kidney J. 2015;8:374–377. doi: 10.1093/ckj/sfv045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kreisberg R.A. Lactate Homeostasis and Lactic Acidosis. Ann. Intern. Med. 1980;92:227–237. doi: 10.7326/0003-4819-92-2-227. [DOI] [PubMed] [Google Scholar]
  • 9.Bakker J., Gris P., Coffernils M., Kahn R.J., Vincent J.-L. Serial blood lactate levels can predict the development of multiple organ failure following septic shock. Am. J. Surg. 1996;171:221–226. doi: 10.1016/S0002-9610(97)89552-9. [DOI] [PubMed] [Google Scholar]
  • 10.Sammour T., Kahokehr A., Caldwell S., Hill A.G. Venous glucose and arterial lactate as biochemical predictors of mortality in clinically severely injured trauma patients—A comparison with ISS and TRISS. Injury. 2009;40:104–108. doi: 10.1016/j.injury.2008.07.032. [DOI] [PubMed] [Google Scholar]
  • 11.Lavery R.F., Livingston D.H., Tortella B.J., Sambol J.T., Slomovitz B.M., Siegel J.H. The utility of venous lactate to triage injured patients in the trauma center. J. Am. Coll. Surg. 2000;190:656–664. doi: 10.1016/S1072-7515(00)00271-4. [DOI] [PubMed] [Google Scholar]
  • 12.O’Connor E., Fraser J.F. The Interpretation of Perioperative Lactate Abnormalities in Patients Undergoing Cardiac Surgery. Anaesth. Intensiv. Care. 2012;40:598–603. doi: 10.1177/0310057X1204000404. [DOI] [PubMed] [Google Scholar]
  • 13.Ranucci M., De Toffol B., Isgrò G., Romitti F., Conti D., Vicentini M. Hyperlactatemia during cardiopulmonary bypass: Determinants and impact on postoperative outcome. Crit. Care. 2006;10:R167. doi: 10.1186/cc5113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Toraman F., Evrenkaya S., Yuce M., Aksoy N., Karabulut H., Bozkulak Y., Alhan C. Lactic Acidosis after Cardiac Surgery Is Associated with Adverse Outcome. Hear. Surg. Forum. 2004;7:E155–E159. doi: 10.1532/HSF98.20041002. [DOI] [PubMed] [Google Scholar]
  • 15.Renew J.R., Barbara D.W., Hyder J.A., Dearani J.A., Rivera M., Pulido J.N. Frequency and outcomes of severe hyperlactatemia after elective cardiac surgery. J. Thorac. Cardiovasc. Surg. 2016;151:825–830. doi: 10.1016/j.jtcvs.2015.10.063. [DOI] [PubMed] [Google Scholar]
  • 16.Maillet J.-M., Le Besnerais P., Cantoni M., Nataf P., Ruffenach A., Lessana A., Brodaty D. Frequency, Risk Factors, and Outcome of Hyperlactatemia After Cardiac Surgery. Chest. 2003;123:1361–1366. doi: 10.1378/chest.123.5.1361. [DOI] [PubMed] [Google Scholar]
  • 17.Demers P., Elkouri S., Martineau R., Couturier A., Cartier R. Outcome with high blood lactate levels during cardiopulmonary bypass in adult cardiac operation. Ann. Thorac. Surg. 2000;70:2082–2086. doi: 10.1016/S0003-4975(00)02160-3. [DOI] [PubMed] [Google Scholar]
  • 18.Ranucci M., Ballotta A., Castelvecchio S., Baryshnikova E., Brozzi S., Boncilli A. Intensive Care Unit Admission Parameters Improve the Accuracy of Operative Mortality Predictive Models in Cardiac Surgery. PLoS ONE. 2010;5:e13551. doi: 10.1371/journal.pone.0013551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Peters N., Jay N., Barraud D., Cravoisy A., Nace L., Bollaert P.-E., Gibot S. Metformin-associated lactic acidosis in an intensive care unit. Crit. Care. 2008;12:R149. doi: 10.1186/cc7137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Biradar V., Moran J.L., Peake S.L., Peter J.V. Metformin-associated lactic acidosis (MALA): Clinical profile and outcomes in patients admitted to the intensive care unit. Crit. Care Resusc. 2010;12:191–195. [PubMed] [Google Scholar]
  • 21.Dellinger R.P., Levy M.M., Carlet J.M., Bion J., Parker M.M., Jaeschke R., Reinhart K., Angus D.C., Brun-Buisson C., Beale R., et al. Surviving Sepsis Campaign: International guidelines for management of severe sepsis and septic shock: 2008. Intensive Care Med. 2008;36:296–327. doi: 10.1097/01.ccm.0000298158.12101.41. [DOI] [PubMed] [Google Scholar]
  • 22.Weil M.H., Afifi A.A. Experimental and Clinical Studies on Lactate and Pyruvate as Indicators of the Severity of Acute Circulatory Failure (Shock) Circulation. 1970;41:989–1001. doi: 10.1161/01.CIR.41.6.989. [DOI] [PubMed] [Google Scholar]
  • 23.Baysan M., Baroni G.D., van Boekel A.M., Steyerberg E.W., Arbous M.S., van der Bom J.G. The Added Value of Lactate and Lactate Clearance in Prediction of In-Hospital Mortality in Critically Ill Patients With Sepsis. Crit. Care Explor. 2020;2:e0087. doi: 10.1097/CCE.0000000000000087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zhang Z., Chen K., Ni H., Fan H. Predictive value of lactate in unselected critically ill patients: An analysis using fractional polynomials. J. Thorac. Dis. 2014;6:995–1003. doi: 10.3978/j.issn.2072-1439.2014.07.01. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Yang C.-C., Chan K.-S., Tseng K.-L., Weng S.-F. Prognosis of alcohol-associated lactic acidosis in critically ill patients: An 8-year study. Sci. Rep. 2016;6:35368. doi: 10.1038/srep35368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Drolz A., Horvatits T., Rutter K., Landahl F., Roedl K., Meersseman P., Wilmer A., Kluwe J., Lohse A.W., Kluge S., et al. Lactate Improves Prediction of Short-Term Mortality in Critically Ill Patients With Cirrhosis: A Multinational Study. Hepatology. 2019;69:258–269. doi: 10.1002/hep.30151. [DOI] [PubMed] [Google Scholar]
  • 27.Hayashi Y., Endoh H., Kamimura N., Tamakawa T., Nitta M. Lactate indices as predictors of in-hospital mortality or 90-day survival after admission to an intensive care unit in unselected critically ill patients. PLoS ONE. 2020;15:e0229135. doi: 10.1371/journal.pone.0229135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Fröhlich H., Balling R., Beerenwinkel N., Kohlbacher O., Kumar S., Lengauer T., Maathuis M.H., Moreau Y., Murphy S.A., Przytycka T.M., et al. From hype to reality: Data science enabling personalized medicine. BMC Med. 2018;16:1–15. doi: 10.1186/s12916-018-1122-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sohail A., Arif F. Supervised and unsupervised algorithms for bioinformatics and data science. Prog. Biophys. Mol. Biol. 2020;151:14–22. doi: 10.1016/j.pbiomolbio.2019.11.012. [DOI] [PubMed] [Google Scholar]
  • 30.Lin K., Hu Y., Kong G. Predicting in-hospital mortality of patients with acute kidney injury in the ICU using random forest model. Int. J. Med. Inform. 2019;125:55–61. doi: 10.1016/j.ijmedinf.2019.02.002. [DOI] [PubMed] [Google Scholar]
  • 31.Raita Y., Goto T., Faridi M.K., Brown D.F.M., Camargo C.A., Jr., Hasegawa K. Emergency department triage prediction of clinical outcomes using machine learning models. Crit. Care. 2019;23:64. doi: 10.1186/s13054-019-2351-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Johnson A.E.W., Ghassemi M.M., Nemati S., Niehaus K.E., Clifton D.A., Clifford G. Machine Learning and Decision Support in Critical Care. Proc. IEEE. 2016;104:444–466. doi: 10.1109/JPROC.2015.2501978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Nielsen A.B., Thorsen-Meyer H.-C., Belling K., Nielsen A.P., Thomas C.E., Chmura P., Lademann M., Moseley P.L., Heimann M., Dybdahl L., et al. Survival prediction in intensive-care units based on aggregation of long-term disease history and acute physiology: A retrospective study of the Danish National Patient Registry and electronic patient records. Lancet Digit. Health. 2019;1:e78–e89. doi: 10.1016/S2589-7500(19)30024-X. [DOI] [PubMed] [Google Scholar]
  • 34.Hsieh M.H., Chen C.-M., Hsieh C.-C., Chao C.-M., Lai C.-C. Comparison of machine learning models for the prediction of mortality of patients with unplanned extubation in intensive care units. Sci. Rep. 2018;8:17116. doi: 10.1038/s41598-018-35582-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Manz C.R., Chen J., Liu M., Chivers C., Regli S.H., Braun J., Draugelis M., Hanson C.W., Shulman L.N., Schuchter L.M., et al. Validation of a Machine Learning Algorithm to Predict 180-Day Mortality for Outpatients With Cancer. JAMA Oncol. 2020;6:1723. doi: 10.1001/jamaoncol.2020.4331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Johnson A.E.W., Pollard T.J., Shen L., Lehman L.-W.H., Feng M., Ghassemi M., Moody B., Szolovits P., Anthony Celi L., Mark R.G. MIMIC-III, a freely accessible critical care database. Sci. Data. 2016;3:160035. doi: 10.1038/sdata.2016.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Collins G.S., Reitsma J.B., Altman D.G., Moons K.G.M. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): The TRIPOD Statement. J. Br. Surg. 2015;102:148–158. doi: 10.1002/bjs.9736. [DOI] [PubMed] [Google Scholar]
  • 38.Singh D., Singh B. Investigating the impact of data normalization on classification performance. Appl. Soft Comput. 2020;97:105524. doi: 10.1016/j.asoc.2019.105524. [DOI] [Google Scholar]
  • 39.Harvey H.B., Sotardi S.T. The Pareto Principle. J. Am. Coll. Radiol. 2018;15:931. doi: 10.1016/j.jacr.2018.02.026. [DOI] [PubMed] [Google Scholar]
  • 40.Breiman L. Random forests. Mach. Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
  • 41.Huang J.-C., Tsai Y.-C., Wu P.-Y., Lien Y.-H., Chien C.-Y., Kuo C.-F., Hung J.-F., Chen S.-C., Kuo C.-H. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Comput. Methods Programs Biomed. 2020;195:105536. doi: 10.1016/j.cmpb.2020.105536. [DOI] [PubMed] [Google Scholar]
  • 42.Muchlinski D., Siroky D., He J., Kocher M. Comparing Random Forest with Logistic Regression for Predicting Class-Imbalanced Civil War Onset Data. Political Anal. 2016;24:87–103. doi: 10.1093/pan/mpv024. [DOI] [Google Scholar]
  • 43.Bergstra J., Bardenet R., Bengio Y., Kégl B. Algorithms for hyper-parameter optimization. Adv. Neural Inf. Process. Syst. 2011;24:2546–2554. [Google Scholar]
  • 44.McGee S. Simplifying likelihood ratios. J. Gen. Intern. Med. 2002;17:647–650. doi: 10.1046/j.1525-1497.2002.10750.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zou K.H., O’Malley A.J., Mauri L. Receiver-Operating Characteristic Analysis for Evaluating Diagnostic Tests and Predictive Models. Circulation. 2007;115:654–657. doi: 10.1161/CIRCULATIONAHA.105.594929. [DOI] [PubMed] [Google Scholar]
  • 46.Hanley J.A., McNeil B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143:29–36. doi: 10.1148/radiology.143.1.7063747. [DOI] [PubMed] [Google Scholar]
  • 47.Huang Y., Li W., Macheret F., Gabriel R.A., Ohno-Machado L. A tutorial on calibration measurements and calibration models for clinical prediction models. J. Am. Med. Inform. Assoc. 2020;27:621–633. doi: 10.1093/jamia/ocz228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Stekhoven D.J., Bühlmann P. MissForest—Non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28:112–118. doi: 10.1093/bioinformatics/btr597. [DOI] [PubMed] [Google Scholar]
  • 49.Moreno R.P., Metnitz P.G.H., Almeida E., Jordan B., Bauer P., Campos R.A., Iapichino G., Edbrooke D., Capuzzo M., Le Gall J.-R. SAPS 3—From evaluation of the patient to evaluation of the intensive care unit. Part 2: Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31:1345–1355. doi: 10.1007/s00134-005-2763-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Iwashyna T.J., Ma C., Wang X.Q., Seelye S., Zhu J., Waljee A.K. Variation in model performance by data cleanliness and classification methods in the prediction of 30-day ICU mortality, a US nationwide retrospective cohort and simulation study. BMJ Open. 2020;10:e041421. doi: 10.1136/bmjopen-2020-041421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Fika S., Nanas S., Baltopoulos G., Charitidou E., Myrianthefs P. A novel mortality prediction model for the current population in an adult intensive care unit. Heart Lung. 2018;47:10–15. doi: 10.1016/j.hrtlng.2017.10.009. [DOI] [PubMed] [Google Scholar]
  • 52.Loreto M., Lisboa T., Moreira V.P. Early prediction of ICU readmissions using classification algorithms. Comput. Biol. Med. 2020;118:103636. doi: 10.1016/j.compbiomed.2020.103636. [DOI] [PubMed] [Google Scholar]
  • 53.Handelman G.S., Kok H.K., Chandra R.V., Razavi A.H., Lee M.J., Asadi H. eDoctor: Machine learning and the future of medicine. J. Intern. Med. 2018;284:603–619. doi: 10.1111/joim.12822. [DOI] [PubMed] [Google Scholar]
  • 54.Chik C., Hayes G.M., Menard J. Development of a veterinary trauma score (VetCOT) in canine trauma patients with per-formance evaluation and comparison to the animal trauma triage score: A VetCOT registry study. J. Vet. Emerg. Crit. Care. 2021 doi: 10.1111/vec.13135. [DOI] [PubMed] [Google Scholar]
  • 55.Kasapoğlu U.S., Kaçmaz O., Gök A., Yildiz Eglen M., Şayan H., Çolak F. Prognostic factors for 30-days mortality in eighty years aged and older critically ill patients: A single center retrospec-tive cohort study. Turk. J. Med. Sci. 2021 doi: 10.3906/sag-2104-359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lim J.-H., Jeon Y., Ahn J.-S., Kim S., Kim D.K., Lee J.P., Ryu D.-R., Seong E.Y., Ahn S.Y., Baek S.H., et al. GDF-15 Predicts In-Hospital Mortality of Critically Ill Patients with Acute Kidney Injury Requiring Continuous Renal Replacement Therapy: A Multicenter Prospective Study. J. Clin. Med. 2021;10:3660. doi: 10.3390/jcm10163660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kahneman D., Lovallo D., Sibony O. Before you make that big decision. Harv. Bus. Rev. 2011;89:50. [PubMed] [Google Scholar]
  • 58.Lobo J.M., Jiménez-Valverde A., Real R. AUC: A misleading measure of the performance of predictive distribution models. Glob. Ecol. Biogeogr. 2008;17:145–151. doi: 10.1111/j.1466-8238.2007.00358.x. [DOI] [Google Scholar]
  • 59.Rashidi H.H., Tran N.K., Betts E.V., Howell L.P., Green R. Artificial Intelligence and Machine Learning in Pathology: The Present Landscape of Supervised Methods. Acad. Pathol. 2019;6:2374289519873088. doi: 10.1177/2374289519873088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Wasserman L., Roeder K. High-dimensional variable selection. Ann. Stat. 2009;37:2178–2201. doi: 10.1214/08-AOS646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lundberg S.M., Nair B., Vavilala M.S., Horibe M., Eisses M.J., Adams T., Liston D.E., Low D.K.-W., Newman S.-F., Kim J., et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat. Biomed. Eng. 2018;2:749–760. doi: 10.1038/s41551-018-0304-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Data are available upon reasonable request to the corresponding author.


Articles from Journal of Clinical Medicine are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES