We read with interest the article by van den Boogaard and colleagues, which proposed that delirium measured within 24 hours of admission did not improve the Acute Physiology and Chronic Health Evaluation (APACHE) II in-hospital mortality prediction [1]. Their data should be interpreted after considering the study design and statistical limitations.
First, the Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) measurements include assessing the level of consciousness (using any valid sedation scale), which is highly correlated with the Glasgow Coma Score. Therefore it is not surprising that addition of delirium to the APACHE score (which includes the Glasgow Coma Score) on the first intensive care unit day does not alter predictions; however, earlier detection of delirium at the initial evaluation of Emergency Department patients is an important predictor of death [2]. We have found that the level of consciousness (via the Richmond Agitation-Sedation Scale) has been predictive of in-hospital mortality, but this relationship is not as strong as the independent value of delirium duration (via the CAM-ICU) for predicting long-term survival, even after adjusting for APACHE II score and sedatives [3,4].
Second, the authors base their conclusions upon comparisons of areas under the curve using the c statistic. Recent insights suggest that this analytic method is insensitive and open to type II error [5]. A more sensitive method to assess additive predictive ability applies likelihood ratio testing between models with and without additional risk factors. In addition, substantial improvements in risk reclassification may be apparent despite limited increases in the c statistic.
In sum, it may be true (but confirmation is required) that adding delirium to a measurement such as the APACHE score is not of value. Clinicians and hospital quality officers should continue to consider early detection of delirium and ongoing delirium detection as an important prognostic tool.
Authors' response
Mark van den Boogaard, Pieter Leffers and Lisette Schoonhoven
We thank Dr Vasilevskis and coworkers for their interest in our publication [1]. We are fully aware of the limitations of the c statistic as a measure for clinical usefulness of a predictive model - that is why we did not base our conclusions only on the lack of improvement of the c statistic, but also on the deteriorating ability to predict mortality [1].
As Cook pointed out in her publication, the evaluation of the clinical usefulness of risk-stratification models is not at all straightforward [5]; others make it clear that the last word about proper analysis and its interpretations has not yet been written [6,7]. This complicated issue needs further methodological development and thorough discussion. In addition to this, we would like to stress that showing the independent contribution of delirium after controlling for covariables in a Cox regression model is not a valid method to show the clinical usefulness of delirium as a predictor of mortality, not even when the corrected hazard ratio is high [5,8]. Also, showing the improved model fit from adding a variable to a model with the log-likelihood test does not serve that purpose [8].
As Vasilevskis and colleagues correctly point out, the probable reason why delirium does not add to the predictive properties of the APACHE score is that the latter already contains variables that essentially measure the same information about the clinical state of the patient. The predictive validity of a model is usually and mainly determined by its power to discriminate and/or by its ability to predict outcome (calibration). The reclassification index is a potentially interesting tool for evaluation of predictive models. Unfortunately this index is highly dependent on the width of the chosen categories of predicted risk. We do not know of category boundaries that would have a direct meaning for clinical decision-making [5,8]. Because proper interpretation of the index will not be possible, we have chosen not to include such an analysis.
In summary, despite shortcomings of various methods to determine the predictive value, our conclusion remains that delirium does not improve the predictive value of the APACHE score.
Abbreviations
APCHE: Acute Physiology and Chronic Health Evaluation; CAM-ICU: Confusion Assessment Method for the Intensive Care Unit.
Competing interests
The authors declare that they have no competing interests.
See related research by van den Boogaard et al., http://ccforum.com/content/14/4/R146
Contributor Information
Eduard E Vasilevskis, Email: eduard.vasilevskis@vanderbilt.edu.
Jin H Han, Email: jin.h.han@vanderbilt.edu.
Ayumi Shintani, Email: ayumi.shintani@vanderbilt.edu.
Timothy D Girard, Email: timothy.girard@vanderbilt.edu.
E Wesley Ely, Email: wes.ely@vanderbilt.edu.
References
- van den Boogaard M, Peters SAE, van der Hoeven JG, Dagnelie PC, Leffers P, Pickkers P, Schoonhoven L. The impact of delirium on the prediction of in-hospital mortality in intensive care patients. Crit Care. 2010;14:R146. doi: 10.1186/cc9214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han JH, Shintani A, Eden S, Morandi A, Solberg LM, Schnelle J, Dittus RS, Storrow AB, Ely EW. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56:244–252. doi: 10.1016/j.annemergmed.2010.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ely EW, Shintani A, Truman B, Speroff T, Gordon SM, Harrell FE Jr, Inouye SK, Bernard GR, Dittus RS. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291:1753–1762. doi: 10.1001/jama.291.14.1753. [DOI] [PubMed] [Google Scholar]
- Pisani MA, Kong SYJ, Kasl SV, Murphy TE, Araujo KLB, Van Ness PH. Days of delirium are associated with 1-year mortality in an older intensive care unit population. Am J Respir Crit Care Med. 2009;180:1092–1097. doi: 10.1164/rccm.200904-0537OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–935. doi: 10.1161/CIRCULATIONAHA.106.672402. [DOI] [PubMed] [Google Scholar]
- Janket SJ, Shen Y, Meurman JH. Letter by Janket et al regarding article, 'Use and misuse of the receiver operating characteristic curve in risk prediction'. Circulation. 2007;116:e133. doi: 10.1161/CIRCULATIONAHA.107.714360. [DOI] [PubMed] [Google Scholar]
- Pepe MS, Janes H, Gu JW. Letter by Pepe et al regarding article, 'Use and misuse of the receiver operating characteristic curve in risk prediction'. Circulation. 2007;116:e132. doi: 10.1161/CIRCULATIONAHA.107.709253. [DOI] [PubMed] [Google Scholar]
- Steyerberg EW. Clinical Prediction Models; A Practical Approach to Development, Validation, and Updating. 1. Rotterdam: Springer Science+Business Media, LCC; 2009. [Google Scholar]