Abstract
OBJECTIVE
The Sequential Organ Failure Assessment (SOFA) score, a measure of multiple organ dysfunction syndrome (MODS), is used to predict mortality in critically ill patients by assigning equally weighted scores across six different organ systems. We hypothesized that specific organ systems would have a greater association with mortality than others.
DESIGN
We retrospectively studied patients admitted over a period of 4.2 years to a mixedprofile intensive care unit (ICU). We recorded age and comorbidities, and calculated SOFA organ scores. The primary outcome was 30-day all-cause mortality. We determined which organ sub-scores of the SOFA score were most associated with mortality using multiple analytic methods: random forests, conditional inference trees, distanced-based clustering techniques, and logistic regression.
SETTING
A 24-bed mixed-profile adult ICU that cares for medical, surgical, and trauma (Level 1) patients at an academic referral center.
PATIENTS
All patients’ first admission to the study ICU during the study period.
MEASUREMENTS AND MAIN RESULTS
We identified 9,120 first admissions during the study period. Overall 30-day mortality was 12%. Multiple analytical methods all demonstrated that the best initial prediction variables were age and the central nervous system (CNS) SOFA subscore, which is determined solely by Glasgow Coma Scale score.
CONCLUSIONS
In a mixed population of critically ill patients, the Glasgow Coma Scale score dominates the association between admission SOFA score and 30-day mortality. Future research into outcomes from multiple organ dysfunction may benefit from new models for measuring organ dysfunction with special attention to neurologic dysfunction.
Keywords: SOFA, multiple organ dysfunction, mortality, ICU, Glasgow Coma Score, trauma, sepsis, age, outcomes, random forests, classification trees
INTRODUCTION
Multiple organ dysfunction syndrome (MODS) occurs in a severely ill patient in whom organismal homeostasis cannot be maintained without external therapeutic support.1 MODS typically occurs in shock states in which arterial blood pressure is too low to adequately perfuse organs, and can occur even after successful initial resuscitation of a shock state.1 Over half of intensive care unit (ICU) patients have at least one organ system dysfunction, and 20% have multiple organ dysfunction.2 As much as half of the mortality observed in modern ICUs is attributable to MODS.3,4
The Sequential Organ Failure Assessment (SOFA) score was developed to assess the extent of MODS.5 As opposed to more complex, sometimes proprietary predictive models, the SOFA score is easily calculated from routinely collected data and provides categorizations directly relevant to MODS. While SOFA was originally developed as a tool for evaluating and comparing organ failure in patients with sepsis, it has been used widely to predict mortality in general ICU populations.6–11 The SOFA score sums dysfunction across six organ systems, assigning equal ranges of sub-scores, 0 to 4, to each organ system (Table 1).12 The equal weighting of organ failure allows for easier calculation, but fails to recognize that organ systems may contribute unequally to the sequelae of MODS, including death. Recognition of unequal contribution of organ systems towards mortality could allow for a more refined scoring system or for targeted research or more patient-tailored treatments, based on the pattern of organ dysfunction. We selected the SOFA score for study because of its widespread clinical use, ease of calculation from routine clinical data, and its direct applicability to MODS.
Table 1.
SOFA Score
| SOFA Sub-score | 0 | 1 | 2 | 3 | 4 |
|---|---|---|---|---|---|
| Respiratory PaO2/FiO2 (torr)a |
>340 |
≤ 340 |
≤ 255 |
≤ 170b |
≤ 85b |
| Central Nervous System Glasgow Coma Score |
15 |
13–14 |
10–12 |
6–9 |
<6 |
| Cardiovascular MAP (mmHg) Vasopressor ratec (µg/kg/min) |
≥ 70 |
< 70 |
Dopamine ≤ 5 OR Dobutamine > 0 |
Dopamine > 5 OR Epinephrine ≤ 0.1 OR Norepinephrine ≤ 0.1 OR Phenylephrine ≤ 0.22 |
Dopamine > 15 OR Epinephrine > 0.1 OR Norepinephrine > 0.1 OR Phenylephrine > 0.22 |
| Liver Bilirubin (mg/dl) |
<1.2 |
≥1.2 |
≥2.0 |
≥6.0 |
≥ 12.0 |
| Coagulation Platelet (×103/µL) |
>150 |
≤ 150 |
≤ 100 |
≤ 50 |
≤ 20 |
| Renal Creatinine (mg/dl) Urine output (ml/day) |
<1.2 |
≥1.2 |
≥2.0 |
≥3.5 |
≥5.0 OR ≥3.5 AND < 200 mL/day |
PaO2/FiO2 ratio was adjusted for altitude (1500m) per ARDS-Network methodology of adjusting the cutoffs by a factor of (Hospital Patm/Sea level Patm)52
Requires the use of positive pressure ventilation
Per original description, vasopressor dose must have been present for ≥ 60 minutes to qualify5
We added phenylephrine to the standard list of vasopressors according to standard equivalency53
Various groups have attempted to refine the SOFA score,6,13–19 but there is no consensus about which elements of the SOFA score are most relevant to mortality. In this study, we evaluated the relative importance of the organ systems measured with the SOFA score in a mixed ICU population with regard to 30-day mortality. Secondarily, we assessed the relevance of age and comorbidities to the association between admission SOFA score and 30-day mortality and investigated whether clustering and classification techniques could identify relevant patterns of organ dysfunction within MODS as measured by the SOFA score.
METHODS
We performed a retrospective study of all patients admitted from November 2007 through January 2012 to the Shock Trauma ICU, a 24-bed ICU that cares for adult medical, surgical, and trauma patients, at Intermountain Medical Center, a university-affiliated 452-bed Level 1 Trauma Center and referral hospital in Murray, Utah, USA. The Intermountain Medical Center Institutional Review Board (IRB #1014318) approved this study with waiver of consent.
We included all patients admitted to the ICU from the emergency department (ED), hospital floor, or other facilities. We only included the first ICU admission for a given patient during the study period. We excluded no other ICU admissions. For purposes of analysis, we assigned patients into one of five admitting diagnostic categories: medical, surgical, neurological, cardiac, or acute trauma. Cardiac diagnoses included primarily heart failure, endocarditis, and acute coronary syndrome admissions; the study ICU does not routinely admit cardiac surgery patients. We measured comorbidities using the Elixhauser score, according to standard methodology.20,21 Our primary outcome was 30-day all-cause mortality. We determined mortality from the Intermountain Master Death Record, which incorporates results from Utah state vital statistics. We queried the electronic medical record (EMR) for SOFA data, admission diagnosis, comorbidities from discharge diagnostic codes, gender, age and ethnicity.
Calculating SOFA scores
For the admission SOFA score, we included data obtained up to 6 hours preceding ICU admission and 24 hours after ICU admission, in order to capture the first ICU day. We included data up to 6 hours prior to ICU admission in order to avoid excluding relevant values obtained in the ED (at the study hospital, most ED stays are < 5 hours). We calculated SOFA scores for each patient.2,5,22 We used the standard method with the exception of our treatment of urine output in the renal sub-score which we solely used to reclassify patients with a renal sub-score of 3 to a renal sub-score of 4 due to less reliable charting of hourly urine flow rates in our system (Table 1). We calculated the SOFA by using the highest score for each organ system measured during the 30-hour scoring period, and then summed the organ system sub-scores to calculate the final SOFA score.
The SOFA central nervous system sub-score is based on the Glasgow Coma Scale (GCS) score., which is routinely measured in the study ICU according to standard methods.23 All patients undergoing mechanical ventilation through an endotracheal tube were assigned a GCS verbal score of 1, irrespective of the clinician’s estimate of the patient’s underlying verbal capacity due to the fact that estimated verbal capacity was not available in all study patients.
Calculation of the PaO2/FiO2 was performed from arterial blood gas, when available. When arterial blood gases were not available, PaO2/FiO2 was estimated from SpO2/FiO2, using the Severinghaus equation24 to convert SpO2 to PaO2. This method has previously been applied to estimate oxygenation in clinical studies on similar populations.25–29
Statistical Methods
Summary statistics are reported as: 1) mean with standard deviation, 2) median with interquartile range, or 3) count and percentage as dictated by the data. We compared between- or among-group central tendencies with Chi-square or Kruskall-Wallis statistic where appropriate.
For our primary analysis, we utilized random forests30–32 and conditional inference trees33 to identify the SOFA sub-scores most predictive of 30 day-mortality. The random forest method uses bootstrapped samples with a random number of variables to create a large number of regression trees and assess the most influential variables in terms of classification. Conditional inference trees are a supervised classification method for analyzing data that select covariates by permutation-based significance tests, thereby avoiding potential bias of the more traditional decision tree algorithms.33 We also employed a multivariate naïve Bayesian classification model of mortality.34 For the Bayesian classification model, we utilized a feature selection strategy that employed repeated bootstrap sampling to maximize the area under the receiver operating characteristic curve (AUC) in the “out of bag” sample (those sample observations not used to build the given model), a technique used to avoid over-fitting of the model.35,36
To evaluate the presence of clusters of organ dysfunction, we utilized three standard clustering techniques: k-means partitioning, partitioning around k-medoids and hierarchical agglomerative clustering.37–39 The k-means and k-medoids algorithms partition observations into clusters wherein each observation belongs to the cluster with the nearest central data point, while the agglomerative clustering technique is a “bottom-up” approach wherein each observation starts in its own cluster and pairs of clusters are merged as one moves up the hierarchy. We generated silhouette plots and gap statistics to evaluate the optimal number of clusters.40,41
By convention GCS assigns a verbal sub-score of 1 to all intubated patients, which could lead to over-weighting of respiratory failure in the overall SOFA score because it is also represented in the pulmonary sub-score. Some investigators have modified the GCS by imputing a verbal score to intubated patients.42–44 To determine whether an imputed verbal sub-score changed our main findings we performed a sensitivity analysis in which we imputed the verbal sub-score for both intubated patients and, separately, for all patients, using the methodology of Meredith et al.43
While our primary analysis was of the entire patient population, given the statistically significant differences between patients in different admission diagnostic groups, we performed a sensitivity analysis to confirm overall findings in each diagnostic group.45 To report the results of our regression analysis we generated effect plots (rather than display coefficients from the regression equation), which display the association between a given predictor and outcome, with other predictors held constant, over the range of values observed in the predictor.46 We performed all analyses in the R Statistical Package, version 2.14.2 (www.r-project.org) using packages randomForests, party, gbm, effects, cluster, fpc and MASS.47
Missing Data
Missing data were imputed using data from the 24-hour period preceding ICU admission. Where missing data persisted after the initial imputation, we employed a mixed approach to evaluate the robustness of our findings because these data were likely to be Missing Not at Random. Conditional inference trees and our bootstrap logistic regression technique accommodate missing data natively. For random forests and the clustering analyses, we performed several sensitivity analyses: incomplete case (also known as list-wise) deletion, built-in imputation techniques for the clustering analysis, and imputation of normal to missing values i.e. if data were missing, the associated SOFA sub-score was considered to be 0.18
RESULTS
During the study period, 9,558 patients were admitted to the study ICU: 9,120 admissions were the patients’ first admission. SOFA sub-scores varied by admitting diagnosis with trauma and neurological patients being younger and having fewer comorbidities, albeit with slightly higher CNS sub-scores (i.e., lower GCS score), than patients admitted with other diagnoses (Table 2). Overall mortality was 12%. Histogram and line charts displaying the distribution and mortality of the SOFA sub-scores are displayed in Figure 1.
Table 2.
Descriptive statistics.
| Total | Medical | Surgical | Trauma | Cardiac | Neurologic | |
|---|---|---|---|---|---|---|
| Patients | 9120 | 4507 | 1289 | 2733 | 386 | 205 |
| Total SOFA Score | 5 (3–8) | 6 (3–9) | 5 (4–8) | 4 (3–7) | 7 (5–11) | 6 (3–8) |
| Liver | 0 (0–1) | 0 (0–1) | 0 (0–2) | 0 (0–0) | 0 (0–1) | 0 (0–0) |
| Renal | 0 (0–1) | 1 (0–2) | 0 (0–1) | 0 (0–1) | 1 (0–2) | 0 (0–1) |
| Coagulation | 0 (0–1) | 0 (0–1) | 0 (0–1) | 0 (0–1) | 0 (0–1) | 0 (0–1) |
| CNS | 1 (0–3) | 1 (0–2) | 1 (0–2) | 1 (0–3) | 1 (0–4) | 2 (1–3.25) |
| Cardiovascular | 1 (1–1) | 1 (1–1) | 1 (1–1) | 1 (0–1) | 1 (1–2) | 1 (0–1) |
| Respiratory | 2 (1–2) | 2 (1–2) | 2 (1–2) | 2 (1–2) | 2 (2–3) | 2 (1–2) |
| 30-day Mortality | 1086 (12%) | 559 (12%) | 82 (6%) | 295 (11%) | 115 (30%) | 35 (17%) |
| Age (years) | 54 (±20) | 56 (±19) | 59 (±17) | 49 (±22) | 61 (±18) | 54 (±18) |
| Elixhauser Score | 3.3 (2.3) | 4.1 (2.2) | 3.4 (2.3) | 1.9 (1.7) | 4.6 (2.2) | 3.6 (2.3) |
| % Receiving Vasopressors for >1 hr | 1328 (15%) | 846 (19%) | 119 (15%) | 233 (14%) | 113 (16%) | 17 (19%) |
| % Mechanically Ventilated | 2098 (23%) | 876 (23%) | 306 (25%) | 688 (22%) | 164 (25%) | 64 (31%) |
| % on Hemodialysis | 329 (4%) | 237 (5%) | 33 (3%) | 22 (1%) | 32 (8%) | 5 (2%) |
| Female sex | 4115 (45%) | 2265 (50%) | 675 (52%) | 885 (32%) | 196 (51%) | 94 (46%) |
| Race: | ||||||
| American Indian/ Alaskan Native | 55 (1%) | 22 (0%) | 7 (1%) | 22 (1%) | 3 (1%) | 1 (0%) |
| Asian/Pacific Island | 291 (3%) | 146 (3%) | 32 (2%) | 92 (3%) | 10 (3%) | 11 (5%) |
| Black | 140 (2%) | 57 (1%) | 11 (1%) | 57 (2%) | 9 (2%) | 6 (3%) |
| Hispanic | 627 (7%) | 282 (6%) | 66 (5%) | 231 (8%) | 29 (8%) | 19 (9%) |
| Other | 65 (1%) | 40 (1%) | 5 (0%) | 19 (1%) | 1 (0%) | 0 (0%) |
| Unknown | 409 (4%) | 126 (3%) | 13 (1%) | 251 (9%) | 12 (3%) | 7 (3%) |
| White | 7533 (83%) | 3834 (85%) | 1155 (90%) | 2061 (75%) | 322 (83%) | 161 (79%) |
Legend: Descriptive statistics organized by diagnostic group. Data reported as mean (standard deviation), median (25%–75% interquartile range) or count (percentage of total). Each group was statistically different from one another in all variables with a p<0.01.
Figure 1.
Histogram and line charts demonstrating the distribution of the SOFA organ subscores, age and Elixhauser comorbidities.
Random forests analysis produces a variable importance plot, which measures how important to the predictive model each variable is. We display the variable importance plot from the study cohort in Figure 2‥ We tested for interactions between the variables and found interactions between the Elixhauser comorbidity score and both CNS and Cardiovascular sub-scores (h-statistic > 0.15).48 The SOFA CNS sub-score and age were the most relevant variables for prediction of death, while Renal and Coagulation sub-scores were the least relevant. A representative consensus classification tree is depicted in Figure 3, suggesting that a CNS SOFA sub-score of 3 (actual GCS threshold of 6) is the best initial selection variable. This tree is made without manual direction, i.e., all division points are selected by the algorithm automatically. The naïve Bayes classification technique demonstrated age and CNS sub-score were the most important for predicting death, followed by cardiovascular and liver sub-scores. The AUC for the prediction of mortality increased to 0.757, 0.802, 0.809 and 0.817 respectively with each additional variable (age, CNS sub-score, cardiovascular sub-score, liver sub-score) as shown in eTable 1. Effect plots of the prediction model are displayed in eFigure 1.
Figure 2. Plot of Variable Importance Scores from Random Forest Analysis.
Legend: The variable importance plot shows how important to the model each variable is.
Figure 3. Consensus classification tree.
Legend: This is a computer-generated “flowchart” to predict 30-day mortality. A p<0.01 is achieved at each division and each final grouping contains at least 800 patients. The bar charts at the base of the figure depict 30-day mortality for that patient population.
Although each model generated slightly different rankings of variable importance, all models included age and CNS sub-scores as most predictive of death and Renal and Coagulation sub-scores as least predictive of mortality. The sensitivity analysis in which we imputed the GCS verbal sub-score did not change the main findings of the study.
The secondary clustering analyses revealed no meaningful clusters within the SOFA sub-scores (gap statistic confirmed the ideal number of clusters = 1). Silhouette plots with varying cluster sizes and techniques demonstrated that no meaningful clusters had a silhouette width > 0.25 (maximum average silhouette width = 0.24 with two clusters), as shown in eTable 2.38
Our burden of missing data after our standard imputation of using values in the prior 24 hour period was 9% for bilirubin and <1% for all other organ systems. Sensitivity analyses employing various approaches to missing data as well as analyses utilizing diagnostic subgroups versus the entire population revealed minimal differences and did not influence the main findings of the study.
DISCUSSION
In a large, unselected cohort of patients admitted to a mixed-profile ICU, the SOFA CNS sub-score dominated the association between admission SOFA score and 30-day mortality. Age, Cardiovascular sub-score, Liver sub-score, Respiratory sub-score and Elixhauser comorbidity index were all moderately predictive of 30-day mortality. Renal and Coagulation were the least predictive SOFA sub-scores. It may be relevant to the practicing intensivist that not all SOFA sub-scores are equally predictive of mortality. Similarly, this finding may be relevant to applications of the SOFA score to questions of ICU triage.19,49–51
This study highlights several important issues with the SOFA scoring system. While we included trauma patients in this study, the domination of the SOFA mortality prediction by the CNS sub-score was present in patients with all other admitting diagnoses. Our sensitivity analysis imputing the GCS verbal sub-score for the SOFA CNS sub-score confirms the importance of the CNS component of the SOFA score. By utilizing three different methods (imputed verbal component in all patients, imputed verbal component in only intubated patients, and assigning all intubated patients a verbal sub-score of 1 as per the standard GCS methodology) we show that the association between the SOFA CNS sub-score and mortality is independent of intubation status.
Second, we found the Cardiovascular sub-score to be poorly discriminatory. Employing non-standard cutoffs/conversion factors for different vasopressors, and specifying single vasopressors rather than the sum of simultaneous vasopressor infusion rates may have contributed to the poor discrimination associated with the Cardiovascular sub-score. There appears to be room for improvement in the SOFA cardiovascular sub-score.
Third, the addition of age to the SOFA score appears to improve mortality prediction substantially. Age was the second most predictive variable after the CNS sub-score in our patient population. It is likely that age is a combined marker of multiple factors including comorbidities, physiologic reserve, desire for aggressive medical care, and ability to recover after injury.
We hypothesized the existence of meaningful sub-groups, “clusters” or “phenotypes”, of MODS measured by SOFA in a general ICU population. However, standard clustering techniques did not demonstrate the presence of meaningful sub-groups. In a general ICU population, SOFA appears unable to identify meaningful MODS phenotypes. We speculate that within specific disease groupings, such as sepsis, the SOFA score or a modification thereof might still be able to identify meaningful MODS phenotypes.
This study has several limitations. The data were obtained retrospectively and are subject to the limitations inherent in all retrospective studies. We were unable to capture urine output in a reliable manner to use it as described in the original SOFA publications, which may have contributed to the poor predictive ability of the renal component. The study population is from a single ICU staffed mainly by one group of intensivists and may not generalize to other healthcare environments. This study also has several strengths. The broad patient composition increases the generalizability of this study. The EMR at the study hospital is highly detailed, with all study data entered prospectively, allowing for considerably less missing data than in most other retrospective studies.18 Furthermore, we tested our hypothesis with multiple statistical methods, all of which arrived at the same conclusion.
CONCLUSION
In a mixed population of critically ill patients, the Glasgow Coma Scale score dominates the association between admission SOFA score and 30-day mortality. The addition of age to the SOFA score may be beneficial in prediction of 30-day mortality in a mixed ICU population. Future research into outcomes from multiple organ dysfunction may benefit from new models of meaningful organ dysfunction, or perhaps developing or validating a differently weighted SOFA scoring system.
Supplementary Material
Legend: Stepwise bootstrap feature selection for multivariate naïve Bayesian classification model of 30-day mortality with 95% confidence intervals. Each additional variable is added to the prior variable(s) to increases the predictive nature of the model.
Legend: A cluster with a silhouette width > 0.25 suggests the clustering may be significant. The lack of a solution with a silhouette width > 0.25 suggests all the clustering solutions are artificial categorizations and not meaningful.
Legend: Effect plots detailing the association, with other predictors held constant, between a given predictor and 30-day mortality. Dotted lines represent 95% confidence intervals for the association.
Acknowledgments
This study was funded by National Institute of General Medical Sciences (K23GM094465 to SMB), the Intermountain Research and Medical Foundation and the Easton Fund. The authors thank Colin K Grissom, MD for a close reading of the manuscript.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Authors’ contributions:
DK organized and designed the study, participated in data acquisition and data analysis, as well as drafting the manuscript. JJ helped design the study, analyze the data and revised the manuscript for important intellectual content. CP helped acquire the data and revised the manuscript for important intellectual content. KK helped acquire the data and revised the manuscript for important intellectual content. ML assisted in study design and drafting and revision of the manuscript. SB participated in study design, data analysis and manuscript revision for important intellectual content. All authors read and approved the final manuscript.
Competing Interests:
The authors declare that they have no competing interests.
References
- 1.American College of Chest Physicians/Society of Critical Care Medicine Consensus Conference: definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. Crit Care Med. 1992;20:864–874. [PubMed] [Google Scholar]
- 2.Vincent JL, de Mendonca A, Cantraine F, et al. Use of the SOFA score to assess the incidence of organ dysfunction/failure in intensive care units: results of a multicenter, prospective study. Working group on "sepsis-related problems" of the European Society of Intensive Care Medicine. Crit Care Med. 1998;26:1793–1800. doi: 10.1097/00003246-199811000-00016. [DOI] [PubMed] [Google Scholar]
- 3.Tran DD, Groeneveld AB, van der Meulen J, et al. Age, chronic disease, sepsis, organ system failure, and mortality in a medical intensive care unit. Crit Care Med. 1990;18:474–479. doi: 10.1097/00003246-199005000-00002. [DOI] [PubMed] [Google Scholar]
- 4.Tran DD, Van Onselen EB, Wensink AJ, et al. Factors related to multiple organ system failure and mortality in a surgical intensive care unit. Nephrol Dial Transplant. 1994;9(Suppl 4):172–178. [PubMed] [Google Scholar]
- 5.Vincent JL, Moreno R, Takala J, et al. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. On behalf of the Working Group on Sepsis-Related Problems of the European Society of Intensive Care Medicine. Intensive Care Med. 1996;22:707–710. doi: 10.1007/BF01709751. [DOI] [PubMed] [Google Scholar]
- 6.Vosylius S, Sipylaite J, Ivaskevicius J. Sequential organ failure assessment score as the determinant of outcome for patients with severe sepsis. Croat Med J. 2004;45:715–720. [PubMed] [Google Scholar]
- 7.Nfor TK, Walsh TS, Prescott RJ. The impact of organ failures and their relationship with outcome in intensive care: analysis of a prospective multicentre database of adult admissions. Anaesthesia. 2006;61:731–738. doi: 10.1111/j.1365-2044.2006.04707.x. [DOI] [PubMed] [Google Scholar]
- 8.Toma T, Abu-Hanna A, Bosman RJ. Discovery and inclusion of SOFA score episodes in mortality prediction. J Biomed Inform. 2007;40:649–660. doi: 10.1016/j.jbi.2007.03.007. [DOI] [PubMed] [Google Scholar]
- 9.Minne L, Abu-Hanna A, de Jonge E. Evaluation of SOFA-based models for predicting mortality in the ICU: A systematic review. Crit Care. 2008;12:R161. doi: 10.1186/cc7160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Jones AE, Trzeciak S, Kline JA. The Sequential Organ Failure Assessment score for predicting outcome in patients with severe sepsis and evidence of hypoperfusion at the time of emergency department presentation. Crit Care Med. 2009;37:1649–1654. doi: 10.1097/CCM.0b013e31819def97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cabré L, Mancebo J, Solsona JF, et al. Multicenter study of the multiple organ dysfunction syndrome in intensive care units: the usefulness of Sequential Organ Failure Assessment scores in decision making. Intensive Care Med. 2005;31:927–933. doi: 10.1007/s00134-005-2640-2. [DOI] [PubMed] [Google Scholar]
- 12.Arts DG, de Keizer NF, Vroom MB, et al. Reliability and accuracy of Sequential Organ Failure Assessment (SOFA) scoring. Crit Care Med. 2005;33:1988–1993. doi: 10.1097/01.ccm.0000178178.02574.ab. [DOI] [PubMed] [Google Scholar]
- 13.van Soest J. Score engineering Optimizing the SOFA score for predicting intensive care mortality using evolutionary strategies. Department of Medical Informatics: University of Amsterdam; 2012. [Google Scholar]
- 14.Ferreira FL, Bota DP, Bross A, et al. Serial evaluation of the SOFA score to predict outcome in critically ill patients. JAMA. 2001;286:1754–1758. doi: 10.1001/jama.286.14.1754. [DOI] [PubMed] [Google Scholar]
- 15.Kajdacsy-Balla Amaral AC, Andrade FM, Moreno R, et al. Use of the sequential organ failure assessment score as a severity score. Intensive Care Med. 2005;31:243–249. doi: 10.1007/s00134-004-2528-6. [DOI] [PubMed] [Google Scholar]
- 16.Levy MM, Macias WL, Vincent JL, et al. Early changes in organ function predict eventual survival in severe sepsis. Crit Care Med. 2005;33:2194–2201. doi: 10.1097/01.ccm.0000182798.39709.84. [DOI] [PubMed] [Google Scholar]
- 17.Junger A, Engel J, Benson M, et al. Discriminative power on mortality of a modified Sequential Organ Failure Assessment score for complete automatic computation in an operative intensive care unit. Crit Care Med. 2002;30:338–342. doi: 10.1097/00003246-200202000-00012. [DOI] [PubMed] [Google Scholar]
- 18.Nates JL, Cárdenas-Turanzas M, Wakefield C, et al. Automating and simplifying the SOFA score in critically ill patients with cancer. Health Informatics J. 2010;16:35–47. doi: 10.1177/1460458209353558. [DOI] [PubMed] [Google Scholar]
- 19.Grissom CK, Brown SM, Kuttler KG, et al. A modified sequential organ failure assessment score for critical care triage. Disaster Med Public Health Prep. 2010;4:277–284. doi: 10.1001/dmp.2010.40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Elixhauser A, Steiner C, Harris DR, et al. Comorbidity measures for use with administrative data. Med Care. 1998;36:8–27. doi: 10.1097/00005650-199801000-00004. [DOI] [PubMed] [Google Scholar]
- 21.Quan H, Sundararajan V, Halfon P, et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care. 2005;43:1130–1139. doi: 10.1097/01.mlr.0000182534.19832.83. [DOI] [PubMed] [Google Scholar]
- 22.Moreno R, Vincent JL, Matos R, et al. The use of maximum SOFA score to quantify organ dysfunction/failure in intensive care. Results of a prospective, multicentre study. Working Group on Sepsis related Problems of the ESICM. Intensive Care Med. 1999;25:686–696. doi: 10.1007/s001340050931. [DOI] [PubMed] [Google Scholar]
- 23.Teasdale G, Jennett B. Assessment of coma and impaired consciousness. A practical scale. Lancet. 1974;2:81–84. doi: 10.1016/s0140-6736(74)91639-0. [DOI] [PubMed] [Google Scholar]
- 24.Severinghaus JW. Simple, accurate equations for human blood O2 dissociation computations. J Appl Physiol. 1979;46:599–602. doi: 10.1152/jappl.1979.46.3.599. [DOI] [PubMed] [Google Scholar]
- 25.Hinkelbein J, Osika A, Fiedler F. Accuracy and precision of pulse oximetry in ventilated patients of an intensive care unit: A-729. European Journal of Anaesthesiology (EJA) 2004;21:179. [Google Scholar]
- 26.Riley PD, Aronsky D, Dean NC. Validation of the 2001 American Thoracic Society criteria for severe community-acquired pneumonia. Crit Care Med. 2004;32:2398–2402. doi: 10.1097/01.ccm.0000147443.38234.d2. [DOI] [PubMed] [Google Scholar]
- 27.Dean NC, Jones JP, Aronsky D, et al. Hospital admission decision for patients with communityacquired pneumonia: variability among physicians in an emergency department. Ann Emerg Med. 2012;59:35–41. doi: 10.1016/j.annemergmed.2011.07.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lanspa MJ, Jones BE, Brown SM, et al. Mortality, morbidity, and disease severity of patients with aspiration pneumonia. J Hosp Med. 2013;8:83–90. doi: 10.1002/jhm.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lobete Prieto CJ. Predicción del índice PaO2/FiO2 a partir del índice SpO2/FiO2 en niños. 2012. [DOI] [PubMed] [Google Scholar]
- 30.Breiman L. Random forests. Machine learning. 2001;45:5–32. [Google Scholar]
- 31.Breiman L. Manual on setting up, using, and understanding random forests v3. 1. CA, USA: Statistics Department University of California Berkeley; 2002. [Google Scholar]
- 32.Liaw A, Wiener M. Classification and Regression by randomForest. R news. 2002;2:18–22. [Google Scholar]
- 33.Hothorn T, Hornik K, Zeileis A. Unbiased recursive partitioning: A conditional inference framework. Journal of Computational and Graphical Statistics. 2006;15:651–674. [Google Scholar]
- 34.Zhang H. The Optimality of Naive Bayes. In: Barr V, Markov Z, editors. Seventeenth International Florida Artificial Intelligence Research Society Conference; Miami Beach, FL: AAAI Press; 2004. pp. 562–567. [Google Scholar]
- 35.Efron B, Tibshirani R. Improvements on Cross-Validation: The .632+ Bootstrap Method. Journal of the American Statistical Association. 1997;92:548–560. [Google Scholar]
- 36.Harrell FE. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. New York: Springer; 2001. [Google Scholar]
- 37.Hartigan JA, Wong MA. Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics) 1979;28:100–108. [Google Scholar]
- 38.Kaufman L, Rousseeuw P. Finding groups in data: An introduction to cluster analysis. New York: Wiley Series in Probability and Mathematical Statistics; 1990. [Google Scholar]
- 39.Murtagh F. Multidimensional clustering algorithms. Vienna: Physika Verlag; 1985. [Google Scholar]
- 40.Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics. 1987;20:53–65. [Google Scholar]
- 41.Tibshirani R, Walther G, Hastie T. Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2001;63:411–423. [Google Scholar]
- 42.Prasad K, Menon GR. Comparison of the three strategies of verbal scoring of the Glasgow Coma Scale in patients with stroke. Cerebrovascular Diseases. 1998;8:79–85. doi: 10.1159/000015822. [DOI] [PubMed] [Google Scholar]
- 43.Meredith W, Rutledge R, Fakhry SM, et al. The conundrum of the Glasgow Coma Scale in intubated patients: a linear regression prediction of the Glasgow verbal score from the Glasgow eye and motor scores. Journal of Trauma-Injury, Infection, and Critical Care. 1998;44:839–845. doi: 10.1097/00005373-199805000-00016. [DOI] [PubMed] [Google Scholar]
- 44.Rutledge R, Lentz CW, Fakhry S, et al. Appropriate use of the Glasgow Coma Scale in intubated patients: a linear regression prediction of the Glasgow verbal score from the Glasgow eye and motor scores. Journal of Trauma and Acute Care Surgery. 1996;41:514–522. doi: 10.1097/00005373-199609000-00022. [DOI] [PubMed] [Google Scholar]
- 45.Czitrom V. One-factor-at-a-time versus designed experiments. The American Statistician. 1999;53:126–131. [Google Scholar]
- 46.Fox J. Effect displays in R for generalised linear models. Journal of Statistical Software. 2003;8:1–27. [Google Scholar]
- 47.R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2009. [Google Scholar]
- 48.Popescu B, Friedman J. Predictive learning via rule ensembles: Tech. Report. Stanford University; 2005. [Google Scholar]
- 49.Craig D, Zafar S, Reid T, et al. The sequential organ failure assessment (SOFA) score is an effective triage marker following staggered paracetamol (acetaminophen) overdose. Alimentary pharmacology & therapeutics. 2012;35:1408–1415. doi: 10.1111/j.1365-2036.2012.05102.x. [DOI] [PubMed] [Google Scholar]
- 50.Khan Z, Hulme J, Sherwood N. An assessment of the validity of SOFA score based triage in H1N1 critically ill patients during an influenza pandemic. Anaesthesia. 2009;64:1283–1288. doi: 10.1111/j.1365-2044.2009.06135.x. [DOI] [PubMed] [Google Scholar]
- 51.Shahpori R, Stelfox HT, Doig CJ, et al. Sequential Organ Failure Assessment in H1N1 pandemic planning*. Critical care medicine. 2011;39:827–832. doi: 10.1097/CCM.0b013e318206d548. [DOI] [PubMed] [Google Scholar]
- 52.Schuller D, Schuster DP. Fluid-management strategies in acute lung injury. N Engl j Med. 2006;355:1175. doi: 10.1056/NEJMc061857. [DOI] [PubMed] [Google Scholar]
- 53.Brown SM, Lanspa MJ, Jones JP, et al. Survival After Shock Requiring High-Dose Vasopressor TherapyShock Requiring High-Dose Vasopressor Therapy. CHEST Journal. 2013;143:664–671. doi: 10.1378/chest.12-1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Legend: Stepwise bootstrap feature selection for multivariate naïve Bayesian classification model of 30-day mortality with 95% confidence intervals. Each additional variable is added to the prior variable(s) to increases the predictive nature of the model.
Legend: A cluster with a silhouette width > 0.25 suggests the clustering may be significant. The lack of a solution with a silhouette width > 0.25 suggests all the clustering solutions are artificial categorizations and not meaningful.
Legend: Effect plots detailing the association, with other predictors held constant, between a given predictor and 30-day mortality. Dotted lines represent 95% confidence intervals for the association.



