Skip to main content
Chest logoLink to Chest
. 2009 Apr 10;136(1):89–101. doi: 10.1378/chest.08-2591

Mortality Probability Model III and Simplified Acute Physiology Score II

Assessing Their Value in Predicting Length of Stay and Comparison to APACHE IV

Eduard E Vasilevskis 1,, Michael W Kuzniewicz 1, Brian A Cason 1, Rondall K Lane 1, Mitzi L Dean 1, Ted Clay 1, Deborah J Rennie 1, Eric Vittinghoff 1, R Adams Dudley 1
PMCID: PMC3198495  PMID: 19363210

Abstract

Background:

To develop and compare ICU length-of-stay (LOS) risk-adjustment models using three commonly used mortality or LOS prediction models.

Methods:

Between 2001 and 2004, we performed a retrospective, observational study of 11,295 ICU patients from 35 hospitals in the California Intensive Care Outcomes Project. We compared the accuracy of the following three LOS models: a recalibrated acute physiology and chronic health evaluation (APACHE) IV-LOS model; and models developed using risk factors in the mortality probability model III at zero hours (MPM0) and the simplified acute physiology score (SAPS) II mortality prediction model. We evaluated models by calculating the following: (1) grouped coefficients of determination; (2) differences between observed and predicted LOS across subgroups; and (3) intraclass correlations of observed/expected LOS ratios between models.

Results:

The grouped coefficients of determination were APACHE IV with coefficients recalibrated to the LOS values of the study cohort (APACHE IVrecal) [R2 = 0.422], mortality probability model III at zero hours (MPM0 III) [R2 = 0.279], and simplified acute physiology score (SAPS II) [R2 = 0.008]. For each decile of predicted ICU LOS, the mean predicted LOS vs the observed LOS was significantly different (p ≤ 0.05) for three, two, and six deciles using APACHE IVrecal, MPM0 III, and SAPS II, respectively. Plots of the predicted vs the observed LOS ratios of the hospitals revealed a threefold variation in LOS among hospitals with high model correlations.

Conclusions:

APACHE IV and MPM0 III were more accurate than SAPS II for the prediction of ICU LOS. APACHE IV is the most accurate and best calibrated model. Although it is less accurate, MPM0 III may be a reasonable option if the data collection burden or the treatment effect bias is a consideration.


The ICU provides advanced and resource-intensive treatment for the sickest hospitalized patients. Care in the ICU accounts for approximately 13% of hospital costs and 4.2% of national health expenditures.1 These costs are largely explained by the length of stay (LOS) in the ICU.2,3 There is significant variation in ICU LOS among hospitals that persists even after adjusting for patient risk factors.46 This possibly reflects variations in ICU organization, safety, quality, or other hospital or community factors such as the availability of non-ICU beds.710

An important objective is to identify ICUs requiring longer or shorter LOSs after accounting for differences in patient characteristics. Comparing risk-adjusted ICU LOSs among ICUs may prove complementary to risk-adjusted mortality and process measures in assessing ICU performance.11 The Joint Commission12 and others13 have expressed interest in public reporting of risk-adjusted ICU LOS.

The acute physiology and chronic health evaluation (APACHE [a registered trademark of Cerner Corporation; Kansas City, MO])14,15 system is the only validated ICU risk-adjustment model that provides performance information about two separate outcomes of care (mortality and ICU LOS). The APACHE IV model is the most recent version. Two other validated ICU mortality prediction models, the mortality probability model III at zero hours (MPM0 III) and the simplified acute physiology score (SAPS) II, use alternative risk-adjustment methods to assess mortality, although they have not been used for LOS prediction.16,17 MPM0 III and SAPS are important to consider for LOS risk adjustment because, as with APACHE, using the data collected for mortality prediction may provide an efficient means of assessing LOS. In addition, both models are used for the purposes of risk adjustment.18,19 In contrast to APACHE, they have fewer risk factors and impose less of a data collection burden.20

We used data from > 11,000 patients in the California Intensive Care Outcomes (CALICO) project to develop and compare the performance of APACHE IV, MPM0 III, and SAPS II models in LOS prediction. In addition, we explored additional patient and hospital factors that may influence ICU LOS or hospital rankings.

Materials and Methods

Hospital Selection

All California hospitals were sent a recruiting packet. A network of volunteer hospitals was established through mailings and regional presentations.

Patient Selection

Data were collected between 2001 and 2004. Inclusion criteria were age ≥ 18 years and ICU stay ≥ 4 h. We excluded patients with conditions that were not examined across each risk-adjustment model, including burns, trauma, and coronary artery bypass graft (CABG) patients. In addition, we excluded patients who had been readmitted to the ICU, consistent with prior studies, and only abstracted data from the index ICU admission. We utilized a proportional sampling method where the goal sample size depended on the hospitals' annual number of ICU admissions.20

Risk Models and Variables

We used the MPM0 III and SAPS II variables specified in their mortality model publications to create a LOS predictive model.16,17 For the APACHE IV model, we used predictor variables detailed in the ICU LOS model publication.15 Trained nurses from participating hospitals abstracted data for all models. ICU LOS, defined in hours and minutes, was the time at discharge from the ICU (either death or physical departure from the unit) minus the time of admission (first recorded vital sign on the ICU flow sheet). The LOS was calculated in days to the second significant digit and truncated at 30 days to minimize the impact of outliers, as previous investigators have done.14,15 MPM0 III required collection of variables within 1 h of admission to the ICU. The other models used the most abnormal physiologic values in the first day after ICU admission. A list of diagnoses organized by system and condition was used to code the reason for ICU admission.21 Data collection methods and interrater reliability have been previously described.20

Statistical Analysis

We compared CALICO hospital characteristics with all California hospitals that had > 50 hospital beds using the 2004 American Hospital Association survey.22 Next, we divided data into development (60%) and validation (40%) samples, and used the χ2 test, Student t test, and Mann-Whitney test, where appropriate, to compare characteristics of the samples.

Due to the hierarchical nature of the data (patients clustered within hospitals), we then used mixed-effects, multilevel modeling to generate ICU LOS prediction models for APACHE IV, MPM0 III, and SAPS II using all variables in the original models. Due to known calibration limitations arising from using estimates of predictive performance on populations other than the one on which a risk model was developed,23,24 we also reestimated the APACHE IV coefficients on the CALICO data set. This was necessary given the different time period, as well as reports of regional variations in health-care utilization patterns,25,26 demographic mix,27,28 and quality of care.29,30 Our recalibration procedure maintained the original variable weights in the APACHE acute physiology score, as well as the spline knot values. The final models are APACHE IV models using coefficients described by the original publication of the APACHE IV LOS model (APACHE IVorig), APACHE IV with coefficients recalibrated to LOS values of the study cohort (APACHE IVrecal), MPM0 III LOS model, and SAPS II LOS model.

Multiple methods were used to assess model performance in the validation sample. First, we used the paired Student t test to compare mean observed ICU LOS to mean predicted ICU LOS for the entire validation population and for specific subgroups (age groups, medical vs surgical patients, and patients grouped by primary clinical system deranged). Second, we divided the sample into deciles of predicted LOS and used the paired Student t test and calibration curves to compare mean predicted LOS to observed LOS for each model. Third, to measure the variance in LOS explained by the models, we calculated coefficients of determination (R2) equal to the square of the correlation coefficient between the individual predicted LOS and the observed LOS. To assess the proportion of variation across hospitals explained by the models, we performed bivariate regressions of the mean observed LOS against the mean predicted LOS (grouped R2) for hospitals with > 100 admissions, which was consistent with the intent of the developers of the original APACHE LOS model.15

Finally, we compared the assessments by the three models of the performance of the ICU of each hospital. The hospital LOS predictions were standardized by calculating a standardized LOS ratio (SLOSR) that was equal to the mean observed LOS divided by the mean predicted LOS for each hospital. Confidence intervals (CIs) were calculated by the Fieller method.31 SLOSRs were limited to hospitals with > 100 admissions, which was consistent with prior studies.15,32 We then assessed intraclass correlations between SLOSRs produced by the models.

Additional Risk Factors and Sensitivity Analyses

Due to the potential relationship of demographic and hospital factors with LOS, we developed additional models using data from the 2004 American Hospital Association survey and the California Office of Statewide Health Planning and Development. We adjusted for “do not resuscitate” (DNR) orders at hospital admission, payor status (Medicare, Medicaid, private, other), and hospital bed size.33,34 We also used Spearman rank correlations to assess the relationship between demographic patient mix (eg, percentage of Medicaid patients) and hospital SLOSR performance assessed by the APACHE IVrecal.

Next, to determine whether hospital SLOSR was sensitive to hospital admission thresholds or the availability of step-down units,35,36 we developed models after excluding patients with very short (< 24 h) LOSs. In addition, to assess the impact of case mix on performance, we assessed the Spearman correlation between the hospital mean severity of illness and the SLOSR.

Finally, we tested an additional SAPS II model treating each variable as an independent predictor, rather than a summed score, to evaluate for differences in model accuracy. The institutional review boards of the University of California, San Francisco, and the state of California approved the study. All analyses were performed using a statistical software package (STATA, version 9.2; Stata Corp; College Station, TX).

Results

Hospital Characteristics

The 35 participating hospitals included 57% not-for-profit institutions, 29% teaching hospitals, 9% hospitals with < 100 beds, 51% with 100 to 300 beds, and 41% with > 300 beds. Additional information on the CALICO hospitals has been previously published.20

Patient Characteristics

A total of 11,366 patients met our inclusion criteria. Of those, 71 patients (0.6%) had missing or indeterminate ICU LOS data, leaving a final data set of 11,295 patients. The overall mean and median LOSs were 4.0 and 2.0 days, respectively. The characteristics between the estimation and validation data sets were statistically similar across all characteristics (Table 1).

Table 1.

Demographic and Clinical Characteristics

Characteristics Total Sample (n = 11,295) Estimation Sample (n = 6,684) Validation Sample (n = 4,611) p Value*
Age, yr 62.2 (17.4) 62.2 (17.6) 62.2 (17.3) 0.94
Age categories 0.33
    18–44 yr 1,919 (17.0) 1,150 (17.2) 769 (16.7)
    45–64 yr 3,852 (34.1) 2,244 (33.6) 1,608 (34.9)
    65–84 yr 4,578 (40.5) 2,711 (40.6) 1,867 (40.5)
    > 85 yr 946 (8.4) 579 (8.7) 367 (8.0)
Race 0.17
    White 6,510 (57.6) 3,787 (56.7) 2,723 (59.1)
    Black 669 (5.9) 409 (6.1) 260 (5.6)
    Hispanic 1,960 (17.4) 1,193 (17.9) 767 (16.6)
    Asian/Pacific Islander 630 (5.6) 379 (5.7) 251 (5.4)
    Native American/other 319 (2.8) 184 (2.8) 135 (2.9)
    Unknown 1,207 (10.7) 732 (11.0) 475 (10.3)
Expected payor 0.27
    Medicare 5,021 (44.5) 2,989 (44.7) 2,032 (44.1)
    Medicaid 1,605 (14.2) 962 (14.4) 643 (13.9)
    Private coverage 2,597 (23.0) 1,490 (22.3) 1,107 (24.0)
    Other (eg, self-pay, workers' compensation, other government) 865 (7.7) 511 (7.7) 354 (7.7)
    Unknown 1,207 (10.7) 732 (11.0) 475 (10.3)
DNR patients at admission 541 (4.8) 313 (4.7) 228 (4.9) 0.52
Operative status 0.65
    Nonoperative 8,789 (77.8) 5,181 (77.5) 3,608 (78.3)
    Elective surgery 2,016 (17.9) 1,208 (18.1) 808 (17.5)
    Emergency surgery 490 (4.3) 295 (4.4) 195 (4.2)
Severity of illness
    APACHE score 44.9 (27.6) 44.7 (27.4) 45.2 (28.0) 0.31
    SAPS II score 33.2 (17.6) 33.1 (17.5) 33.4 (17.7) 0.41
Location prior to ICU admission 0.51
    Emergency department 5,548 (49.1) 3,270 (48.9) 2,278 (49.4)
    Operating room/recovery room 2,506 (22.2) 1,503 (22.5) 1,003 (21.8)
    Floor 2,426 (21.5) 1,421 (21.3) 1,005 (21.8)
    Transfer from another hospital 440 (3.9) 255 (3.8) 185 (4.0)
    Other 375 (3.3) 235 (3.5) 140 (3.0)
Primary reason for admission: system 0.49
    Cardiac 4,699 (41.6) 2,759 (41.3) 1,940 (42.1)
    Pulmonary 2,181 (19.3) 1,286 (19.2) 895 (19.4)
    GI 1,480 (13.1) 900 (13.5) 580 (12.6)
    Neurologic 1,582 (14.0) 923 (13.8) 659 (14.3)
    GU 269 (2.4) 172 (2.6) 97 (2.1)
    Overdose/poisoning 379 (3.4) 216 (3.2) 163 (3.5)
    Metabolic 392 (3.5) 232 (3.5) 160 (3.5)
    Hematologic/oncologic 115 (1.0) 71 (1.1) 44 (1.0)
    Other 198 (1.8) 125 (1.9) 73 (1.6)
LOS
    Prior LOS,§ d 0.3 (0.1–0.8) 0.3 (0.1–0.8) 0.3 (0.1–0.8) 0.98
    ICU LOS, d 4.0 (6.4) 4.0 (6.7) 4.0 (6.2) 0.93
    ICU LOS,§ d 2.0 (1.0–4.1) 2.0 (1.0–4.2) 1.9 (1.0–4.1) 0.24
    ICU mortality 1,279 (11.4) 752 (11.3) 527 (11.4) 0.77
    In-hospital mortality 1,766 (15.6) 1,036 (15.5) 730 (15.8) 0.63

GU = genitourinary.

*The p values are based on χ2 test of statistical independence for categorical data, Student t test for parametric data, or Mann-Whitney test for nonparametric data. Totals may not add to 100% due to rounding.

†Values are given as the mean (SD).

‡Values are given as the No. (%).

§Values are given as the median (interquartile range).

Predictive Performance of Four Models

The development sample (n = 6,684) was used to estimate coefficients for each model. Coefficients for MPM0 III LOS and SAPS II LOS models are given in Table 2. Original coefficients for APACHE IV LOS are publicly available,12 and reestimated coefficients are given in the Appendix.

Table 2.

Coefficients for MPM0 III LOS and SAPS II LOS Models

Variables Coefficient for Estimation Sample (n = 6,684) 95% CI
MPM0 III LOS model
Heart rate ≥ 150 beats/min 1.6517 0.9290 to 2.3744
SBP ≤ 90 mm Hg 0.1442 −1.0821 to 1.3704
Chronic kidney disease −0.5952 −1.1567 to −0.0337
Cirrhosis 1.3865 −1.4989 to 4.2718
Coma/deep stupor −1.4622 −3.4426 to 0.5182
Metastatic neoplasm 3.4601 1.1031 to 5.8171
Acute renal failure 0.6548 −0.1365 to 1.4461
Cardiac dysrhythmia −0.9552 −3.0329 to 1.1225
Cerebrovascular incident 1.1122 0.5227 to 1.7016
GI bleed −0.7975 −1.3560 to −0.2390
Intracranial mass effect 1.8107 −0.0294 to 3.6508
CPR before ICU admission 1.9279 −0.5657 to 4.4215
Mechanical ventilation 2.4888 2.1530 to 2.8246
Unscheduled surgical admission or medical admission 1.3964 1.0410 to 1.7518
Age (per 10 yr) 0.1369 0.0562 to 0.2176
Full code on ICU admission 0.8537 0.2926 to 1.4147
Zero risk factors (no factors other than age) −0.6006 −0.9936 to −0.2076
Interaction terms
Age coma/deep stupor 0.1247 −0.1714 to 0.4208
Age SBP ≤ 90 mm Hg 0.0165 −0.1667 to 0.1997
Age cirrhosis −0.0546 −0.5703 to 0.4610
Age metastatic neoplasm −0.4949 −0.8649 to −0.1249
Age cardiac dysrhythmia −0.0051 −0.2941 to 0.2838
Age intracranial mass effect −0.3209 −0.6210 to −0.0208
Age CPR prior to admission −0.2442 −0.6078 to 0.1193
Intercept 0.5566 −0.3409 to 1.4541
SAPS II LOS model
SAPS score 0.0178 0.0019 to 0.0337
Log (SAPS score) 1.6057 1.1150 to 2.0965
Intercept −2.2334 −3.4928 to −0.9741

CPR = cardiopulmonary resuscitation; SBP = systolic BP.

Model performance was assessed in the 40% validation sample (n = 4,611). The difference between the mean observed LOS and the predicted ICU LOS for the validation sample was 4.6 h for APACHE IVorig (p = 0.006), 1.7 h for APACHE IVrecal (p = 0.32), 0.2 h for MPM0 III LOS (p = 0.90), and 0.4 h for SAPS II LOS (p = 0.82). Observed LOS vs predicted LOS for strata of age, medical vs surgical admission status, and the primary system affected leading to ICU admission are displayed in Table 3. APACHE IVorig, APACHE IVrecal, and MPM0 III LOS each had a single age stratum with significant differences between observed and predicted LOS. SAPS II LOS systematically underpredicted LOS for younger patients and overpredicted LOS for older patients. APACHE IVrecal and MPM0 III-LOS accurately predicted ICU LOS for medical and elective surgical patients. For more specific diagnostic categories, including emergency surgery, APACHE IVrecal was the most accurate.

Table 3.

Difference Between Observed and Predicted LOS for Age and Primary Medical/Surgical System Categories on Validation Sample

APACHE IVorig Model
APACHE IVrecal Model
MPM0 III LOS Model
SAPS II LOS Model
Variables Patients, No. Difference of Observed Minus Predicted, d p Value* Difference of Observed Minus Predicted, d p Value* Difference of Observed Minus Predicted, d p Value* Difference of Observed Minus Predicted, d p Value*
Age
    18–30 yr 224 0.5 0.08 0.2 0.4 0.0 0.93 0.8 0.006
    31–45 yr 602 0.1 0.46 0.0 1.0 −0.1 0.61 0.4 0.03
    46–60 yr 1,209 0.4 0.003 0.1 0.45 0.3 0.04 0.5 0.002
    61–70 yr 864 0.2 0.14 0.0 0.89 0.1 0.39 0.1 0.74
    71–80 yr 1,012 −0.1 0.31 −0.3 0.02 −0.2 0.11 −0.6 < 0.001
    ≥ 81 yr 700 0.2 0.37 −0.2 0.18 −0.2 0.18 −0.8 < 0.001
Medical vs surgical status
    Elective surgery 808 0.3 0.04 −0.1 0.64 0.0 0.8 −0.1 0.27
    Emergency surgery 195 0.3 0.45 0.2 0.67 0.8 0.05 1.4 0.002
    Medical 3608 0.2 0.04 −0.1 0.29 0.0 0.75 −0.1 0.45
Medical/surgical system
    Cardiac medical 1,670 0.0 0.88 −0.2 0.02 −0.4 < 0.001 −0.8 < 0.001
    Cardiac surgical 270 0.5 0.03 0.2 0.46 −0.2 0.48 −0.4 0.1
    Pulmonary medical 759 0.7 0.006 0.3 0.28 1.5 < 0.001 1.9 < 0.001
    Pulmonary surgical 136 0.7 0.10 −0.2 0.61 0.5 0.21 0.4 0.38
    GI medical 297 −0.1 0.53 −0.4 0.12 −0.4 0.1 −0.7 0.002
    GI surgical 283 0.2 0.60 0.0 0.88 0.7 0.03 0.8 0.009
    Neurologic medical 441 −0.1 0.56 −0.3 0.22 −0.3 0.28 0.3 0.29
    Neurologic surgical 218 0.1 0.77 −0.1 0.8 0.0 0.91 0.1 0.63
    GU medical 63 0.7 0.37 0.5 0.55 0.4 0.66 0.0 0.97
    GU surgical 34 −0.1 0.77 −0.1 0.83 −0.4 0.26 −0.7 0.04
Overdose/poisoning 163 0.21 0.38 0.2 0.5 −1.4 < 0.001 −0.8 0.001
Metabolic 160 0.5 0.03 0.2 0.31 −0.7 0.003 −1.0 < 0.001
Hematology/oncology 44 −0.8 0.03 −1.4 < 0.001 −1.3 0.003 −1.6 < 0.001
Other 73 1.1 0.12 1.0 0.13 0.3 0.7 0.6 0.37

*Based on paired Student t tests. See Table 1 for abbreviation not used in the text.

For each decile of predicted ICU LOS, the difference between mean observed and predicted LOS differed significantly (p ≤ 0.05) for 6, 3, 2, and 6 of the 10 deciles, respectively, using APACHE IVorig, APACHE IVrecal, MPM0 III LOS, and SAPS II LOS (Table 4). This is graphically represented in Figure 1 as calibration curves. The calibration curve of APACHE IVorig demonstrates poor fit at the lowest deciles. APACHE IVrecal demonstrates excellent fit, with the poorest calibration in the lowest decile. MPM0 III LOS demonstrates an excellent fit as well. SAPS II LOS appears to have a poor fit across multiple deciles.

Table 4.

Differences Between Observed and Predicted LOS Across Decile of Predicted LOS for Each Model in Validation Data Set

APACHE IVorig Model
APACHE IVrecal Model
MPM III0 LOS Model
SAPS II LOS Model
Decile of Predicted ICU LOS,* % Patients, No. Mean Observed ICU LOS, d Mean Predicted ICU LOS, d Difference of Observed-Predicted LOS, d p Value Patients, No. Mean Observed ICU LOS, d Mean Predicted ICU LOS, d Difference of Observed-Predicted LOS, d p Value Patients, No. Mean Observed ICU LOS, d Mean Predicted ICU LOS, d Difference of Observed-Predicted LOS, d p Value Patients, No. Mean Observed ICU LOS, d Mean Predicted ICU LOS, d Difference of Observed-Predicted LOS, d p Value
0–10 462 1.5 0.6 0.9 < 0.001 462 1.5 0.9 0.7 < 0.001 462 2.4 2.0 0.3 0.05 564 2.0 1.8 0.2 0.08
11–20 461 1.8 1.2 0.6 < 0.001 461 1.8 1.6 0.1 0.19 462 2.4 2.5 −0.1 0.45 381 2.4 2.8 −0.4 0.03
21–30 461 2.1 1.6 0.5 < 0.001 461 2.1 2.0 0.1 0.33 461 2.7 2.7 0.0 0.84 525 2.5 3.1 −0.6 < 0.001
31–40 461 2.4 2.1 0.4 0.003 461 2.5 2.4 0.1 0.48 464 3.1 3.0 0.1 0.53 426 2.7 3.4 −0.7 < 0.001
41–50 461 2.9 2.7 −0.2 0.23 461 2.8 3.0 −0.2 0.16 457 3.2 3.1 0.1 0.67 470 3.6 3.7 −0.1 0.77
51–60 461 3.2 3.5 −0.3 0.11 461 3.2 3.6 −0.4 0.03 461 2.8 3.3 −0.5 < 0.001 446 3.6 4.0 −0.4 0.10
61–70 461 3.9 4.3 −0.4 0.05 461 4.2 4.5 −0.3 0.17 461 3.8 3.9 −0.1 0.66 454 4.7 4.3 0.4 0.11
71–80 461 4.9 5.3 −0.4 0.16 461 4.9 5.4 −0.5 0.04 462 4.4 4.8 −0.4 0.15 457 5.4 4.6 0.8 < 0.01
81–90 461 6.1 6.5 −0.4 0.18 461 6.6 6.6 0.0 0.92 464 6.1 5.8 0.3 0.33 456 6.8 5.1 1.7 < 0.001
91–100 461 9.1 8.3 0.8 0.05 461 8.4 8.7 −0.3 0.48 457 7.1 6.7 0.4 0.29 432 4.6 5.9 −1.4 < 0.001

*Population sorted by increasing predicted risk and then split into deciles.

†Based on paired Student t test.

Figure 1.

Figure 1

Calibration curves comparing mean observed and mean predicted ICU LOS for four ICU LOS models.

The coefficients of determination for patient-level ICU LOS predictions were as follows: APACHE IVorig, R2 = 0.182; APACHE IVrecal, R2 = 0.202; MPM0 III LOS, R2 = 0.098; and SAPS II LOS, R2 = 0.049. Grouped R2 analysis for the 29 hospitals with > 100 admissions were as follows: APACHE IVorig, R2 = 0.439; APACHE IVrecal, R2 = 0.422; MPM0 III LOS, R2 = 0.279; and SAPS II LOS, R2 = 0.008. This indicates that 42% and 28%, respectively, of the ICU LOS variations are accounted for by APACHE IVrecal and MPM0 III-LOS.

Finally, Figure 2 displays a comparison of the predictions of the models for hospital-level SLOSRs, excluding the original APACHE model. Regardless of the model used, there was significant variation in SLOSRs among 29 hospitals with > 100 admissions. There were similar ranges among the SLOSRs of the hospitals for each model as follows: APACHE IVrecal, 0.47 to 1.60; MPM0 III LOS, 0.40 to 1.68; and SAPS II LOS, 0.38 to 1.69. The intraclass correlations of the SLOSRs between each pair of models were high: APACHE IVrecal and MPM0 III-LOS, r = 0.89 (95% CI, 0.74 to 0.96); APACHE IVrecal and SAPS II-LOS, r = 0.85 (95% CI, 0.70 to 0.93); and MPM0 III-LOS and SAPS II-LOS, r = 0.96 (95% CI, 0.92 to 0.98).

Figure 2.

Figure 2

Plot of LOS prediction model-specific SLOSRs for each hospital with at least 100 admissions.

Additional Risk Factors and Sensitivity Analyses

The addition of DNR status and Medicaid payment (when compared to private insurance) to APACHE IV models independently predicted shorter LOS (−1.10 days; 95% CI, −0.57 to −1.65) and longer LOS (0.74 days; 95% CI, 0.38 to 1.09), respectively. The number of hospital beds had no effect. Each of these factors did not significantly improve the accuracy, calibration, or agreement of hospital SLOSRs between each model. In addition, there was no statistically significant correlation between percentages of DNR patients (r = 0.18; p = 0.36) or Medicaid patients (r = 0.35; p = 0.06) of the hospital and the SLOSR. Likewise, there was no statistically significant correlation between bed size (r = −0.25; p = 0.22) and SLOSR.

Models developed on the population excluding patients with the short ICU LOS (< 24 h) maintained excellent calibration for APACHE IVrecal and improved calibration for MPM0 III LOS. The range of SLOSRs for each model when excluding patients with LOS < 24 h (SLOSR range: APACHE IVrecal, 0.58 to 1.49; MPM0 III LOS, 0.61 to 1.46; and SAPS II LOS, 0.55 to 1.53) was smaller than the range of SLOSRs produced when using all patients in the sample, with comparable agreement. There was no correlation between the mean severity of illness of the hospitals (r = −0.05; p = 0.80) and the SLOSR. The mean SLOSRs of the five hospitals with the lowest and highest mean severity of illness were 1.0 (SD, 0.2) and 1.0 (SD, 0.3), respectively.

Finally, a model based on the SAPS II LOS independent variables revealed no meaningful differences in accuracy (R2 = 0.061) and calibration between that and the primary SAPS II model used in the analyses just cited. No further data from that model are presented.

Discussion

Our study is the first description of the use of MPM0 III LOS and SAPS II LOS variables for the additional purpose of predicting risk-adjusted ICU LOS. In addition, our study is the first independent validation of the APACHE IV LOS model. We have shown MPM0 III LOS, an alternative risk-adjustment model originally developed for mortality prediction, can also be used for predicting LOS in a broad medical and surgical population. However, SAPS II LOS did not appear well suited for LOS prediction. The MPM0 III LOS model explains the lower variation in hospital-level LOSs but requires substantially fewer resources to implement than the APACHE IV LOS model. Individual hospitals received similar rankings with these two models.

Regardless of the model, we observed sizable variations in risk-adjusted LOS performance among hospitals that could not be accounted for by patient risk factors. The apparent variation in ICU LOS after accounting for differences in patient severity of illness supports the need to assess risk-adjusted ICU LOS as one aspect of performance.

The primary objective of our study was to assess the utility of two established mortality prediction models in predicting an alternative outcome, ICU LOS, and to compare these models to the APACHE IVorig and APACHE IVrecal models. With regard to model accuracy, APACHE IVrecal has the best predictive accuracy across clinical categories, excellent calibration, and the highest grouped R2. The APACHE IVrecal model proved more accurate when compared to the APACHE IVorig model. There are many potential reasons for this, as follows: (1) the CALICO cohort had a different patient mix, including more nonsurgical patients and higher mean APACHE score; (2) when compared to APACHE IVorig, the coefficients for individual risk factors differed across many domains, including, but not limited to, acute and chronic diagnoses; (3) patterns in health-care utilization may differ in the CALICO cohort; and (4) in contrast to CALICO hospitals, the APACHE IV cohort hospitals were users of the APACHE system,15 which could be a marker of increased attention toward quality, efficiency, and information technology.

The superior predictive accuracy of APACHE IVrecal compared to the other models may be explained by having more variables. Including the ICU admitting diagnosis may be particularly influential because prior research15 has shown that they account for up to 17% of the explanatory power of the original APACHE IV model. In addition, the use of linear splines to model nonlinearities in predictor response (eg, acute physiology score) address the reality that patients with both the lowest and highest acute physiology scores will generally have shorter average LOSs.15 Alternatively, it may be that part of the additional predictive power comes from including variables that reflect pre-ICU care, such as pre-ICU LOS and admission source, or response to treatment (because the worst physiology values for the first 24 h are included). Further research is needed to define the source of the additional predictive power and to assess whether including these variables is actually desirable. For instance, if the model predicts LOS better because it “risk adjusts” for undertreatment, that may not be desirable.

The poor accuracy of SAPS II LOS suggests that this model is inadequate for predicting LOS. The limited value of the SAPS II LOS model might be improved by reweighting the individual variables that make up the SAPS II LOS score or modeling their relationships to LOS as nonlinear. Treating the individual variables as independent rather than summarized did not provide significant additional benefit.

With > 100 fewer model coefficients than APACHE IVrecal and without modeling nonlinear relationships, the MPM0 III LOS model nonetheless displayed fair accuracy and excellent calibration. Despite a low R2 for predicting an individual patient's LOS, MPM0 III LOS was effective in predicting LOS across hospital, demographic, and broad clinical groups. The inability of the MPM0 III LOS model to predict LOS especially well for derangements of an individual physiologic system reflects the absence in the MPM0 III model of a variable indicating the system involved. This suggests that MPM0 III LOS may be poorly suited for assessing the performance of individual specialty ICUs. MPM0 III LOS may also be poorly suited for assessing ICUs that care for a large proportion of emergency surgery patients (eg, trauma ICU). Despite being statistically significant, differences between predicted LOS and actual LOS did not always appear to be clinically significant (eg, for the medical cardiac system, a difference of < 12 h). Therefore, if predictions for clinical subgroups are an important goal, the MPM0 III LOS model may be considered, albeit with caution.

MPM0 III LOS and APACHE IVrecal were also similar in their appraisals of hospital performance. Performance assessments from the two models were highly correlated (r = 0.89) and were not significantly affected by additional patient and hospital factors (eg, DNR status, payor status, number of hospital beds). Limiting the sample to patients with an ICU stay of at least 24 h maintained high correlation (r = 0.85) and improved calibration of the MPM0 III LOS model. Improvement in calibration may reflect difficulty in predicting LOS for patients with very short ICU stays due to low severity of illness or early mortality. Performance estimates on this reduced sample were more conservative, as evidenced by a narrower range of SLOSRs. Therefore, one would expect fewer performance outliers in the restricted sample.

With respect to model accuracy, the APACHE IV LOS model is a superior tool for LOS risk adjustment. APACHE IV is an excellent tool for hospital mortality risk adjustment and, unlike the MPM0 III model, has been applied as well to CABG patients. However, there are real-world limitations in data collection, so using MPM0 III may be a legitimate consideration. First, MPM0 III is a validated tool for risk-adjusted mortality,18 and it involves about a third the data collection time of APACHE IV.20 Few hospitals currently have ICU risk variables available electronically, and the degree to which hospitals face resource and technology barriers may influence the preferences for MPM0 III LOS vs APACHE IV LOS.37,38 However, this benefit of the MPM0 III LOS model may be lessened if hospitals are not currently using a risk-adjustment model for CABG patients and are considering the measurement of ICU and CABG outcomes. Second, because model performance deteriorates over time or when applied to populations that differ from the one used for model development, another factor to consider is the ability to reestimate the model to the study population. With substantially fewer coefficients, reestimation of the MPM0 III LOS requires a smaller database and, hence, can be performed more often or when the size of the database does not allow for the recalibration of APACHE. This problem with APACHE would be lessened if the Joint Commission was to adopt a national ICU performance set, therefore creating a large national database with which frequent recalibration would be possible with any model. Finally, the MPM0 III LOS model only uses risk information from the first hour after a patient's ICU admission, whereas the APACHE IV LOS model requires data be collected throughout the first day of ICU care. Limiting the data collection period may decrease the resources needed to collect data and limits the influence of treatment on the predicted LOS. For example, although hypotension that results from sepsis should be included as a risk factor, hypotension caused by failure to treat appropriately (eg, not starting appropriate therapy with antibiotics in sepsis patients) should not. Models that use post-hospital admission data cannot distinguish between these cases, so their better predictive ability may not always serve the purpose of identifying the best performing ICUs.

Our study has important limitations to consider. One is that we used a convenience sample of volunteer hospitals from California. Despite this, the sampling strategy is more likely to affect the estimation of individual model coefficients and is less likely to affect the comparisons between the models. We would recommend a reestimation of the coefficients for all models if applied to a national sample. Second, our hospital sample has a limited number of performance outliers. A larger sample of hospitals is needed to draw more reliable conclusions about the validity of the three models for identifying performance outliers. Third, the recently updated SAPS III model39 became available after our data collection began, so we did not capture all of its required data elements. Finally, although LOS may be a useful measure, it is likely affected by hospital discharge policies, bed availability, and community resources. Adding information about these factors might improve the predictive capacity of LOS models, although it would require frequently updated hospital-level information (eg, the number of stepdown unit or regular ward beds that are available on each hospital day). In addition, adding these factors to LOS models would mask the extent to which the management of these resources by a hospital contributes to its ICU LOS. Because understanding (and eliminating) the impact of such factors is a goal of clinicians and policymakers who seek to assess ICU LOS, their inclusion in predictive models would improve accuracy but might reduce the relevance of the assessments. In any case, risk-adjusted LOS should be used as a complementary measure to a suite of ICU performance measures, including structural, process, and outcomes measures of performance, because these other measures may both help to explain variations in ICU LOS and contribute to efforts to improve performance.33,40,41

In summary, the APACHE IVrecal and MPM0 III LOS model are more accurate than the SAPS II LOS model for the prediction of ICU LOS. APACHE IVrecal is the most accurate LOS prediction model for specific ICU subpopulations. This is in part due to its larger number of variables, but it also likely reflects a longer window of data collection (the first 24 h, instead of the first hour, in the ICU). It is the preferred model when either ample resources are available for data collection or the APACHE IV variables can be generated by an electronic medical record, and there are no concerns about treatment impacting measured severity of illness over the first day of treatment. The MPM0 III LOS model is less accurate, although it performs well across broad hospital populations, imposes less of a data collection burden, uses a shorter data collection window, and, therefore, is less likely to be influenced by treatment. The final choice of a model by physicians, hospitals, quality-reporting groups, or payers must reflect value judgments regarding the balance between predictive accuracy and data burden. Only with a wider application of risk-adjusted LOS and mortality measures will we understand those factors that account for the large observed differences in hospital outcomes and be able to accelerate improvements in ICU care.

Acknowledgment:

We acknowledge Teresa Chipps, BS, Department of Medicine (General Internal Medicine and Public Health), Center for Health Services Research, Vanderbilt University, Nashville, TN, for her administrative and editorial assistance in the preparation of this article.

Abbreviations:

APACHE

acute physiology and chronic health evaluation

APACHE IVorig

acute physiology and chronic health evaluation using coefficients described by the original publication of the acute physiology and chronic health evaluation IV length-of-stay model

APACHE IVrecal

acute physiology and chronic health evaluation IV with coefficients recalibrated to the length-of-stay values of the study cohort

CABG

coronary artery bypass graft

CALICO

California Intensive Care Outcomes

CI

confidence interval

DNR

do not resuscitate

LOS

length of stay

MPM0 III

mortality probability model III at zero hours

SAPS

simplified acute physiology score

SLOSR

standardized length of stay ratio

Appendix

Appendix 1.

Reestimated Coefficients for APACHE IV LOS Model

Variables Coefficient Estimation Sample(n = 6,684) 95% CI
Age 0.0078 −0.0234 to 0.0390
    Knot = 27 0.000001 −0.00003 to 0.00003
    Knot = 51 −0.000059 −0.0003 to 0.0001
    Knot = 64 0.00027 −0.0003 to 0.0009
    Knot = 74 −0.00066 −0.0016 to 0.0003
    Knot = 86 0.0021 −0.0003 to 0.0045
Comorbidity
    None Reference Reference
    Cirrhosis −0.0547 −0.8426 to 0.7334
    Immunosuppressed −0.0917 −0.6706 to 0.4873
    Cancer, metastatic −0.2231 −0.8596 to 0.4134
    Lymphoma 0.0901 −1.1180 to 1.2981
    Hepatic failure 2.3535 1.2357 to 3.4713
    AIDS −0.4178 −1.8666 to 1.0310
    Leukemia, myeloma 0.8278 −0.3980 to 2.0537
APS 0.0411 −0.0204 to 0.1025
    Knot = 10 −0.000034 −0.0002 to 0.0001
    Knot = 22 0.00016 −0.0002 to 0.0006
    Knot = 32 −0.00021 −0.0006 to 0.0001
    Knot = 48 0.000085 −0.00002 to 0.0002
    Knot = 89 0.000001 −0.00003 to 0.00003
Pao2/Fio2 ratio −0.0052 −0.0063 to −0.0041
Ventilated on ICU day 1 1.8966 1.5566 to 2.2366
Admission source
    Other Reference Reference
    Floor 0.3217 −0.0208 to 0.6643
    Other hospital 1.3000 0.6194 to 1.9807
    Operating/recovery room −1.0302 −2.2836 to 0.2233
Emergency surgery 1.1476 0.5190 to 1.7762
Previous LOS −0.2760 −1.4315 to 0.8795
    Knot = 0.121 1.7218 −1.0812 to 4.5249
    Knot = 0.423 −3.3143 −8.8047 to 2.1762
    Knot = 0.794 1.6265 −1.1756 to 4.4285
    Knot = 2.806 −0.0392 −0.1899 to 0.1114
Thrombolytic therapy for AMI 0.3031 −0.6018 to 1.2080
GCS score 0.0215 −0.0214 to 0.0645
Unable to assess GCS 0.7593 0.3503 to 1.1682
Nonoperative diagnostic groups
    Cardiovascular diagnoses
    AMI
    Anterior 0.0926 −0.8988 to 1.0841
    Inferior/lateral −0.2644 −1.2252 to 0.6964
    Non-Q wave −0.6638 −2.2126 to 0.8849
    Other Reference Reference
    Cardiac arrest 1.8213 0.2694 to 3.3731
    Cardiogenic shock 0.8254 −0.5682 to 2.2191
    Cardiomyopathy −0.2542 −2.3527 to 1.8442
    Congestive heart failure −0.1450 −0.9686 to 0.6785
    Chest pain, rule out AMI 1.0292 −2.1827 to 4.2410
    Hypertension −0.3278 −1.5456 to 0.8899
    Hypovolemia/dehydration (not shock) −0.5539 −2.9398 to 1.8320
    Hemorrhage (not related to GI bleeding) −1.8497 −4.8867 to 1.1873
    Aortic aneurysm, dissecting 1.5569 −0.9018 to 4.0156
    Peripheral vascular disease 0.1520 −1.7145 to 2.0185
    Rhythm disturbance −0.3191 −1.1107 to 0.4725
    Sepsis
    Cutaneous 0.2151 −1.8327 to 2.2629
    GI 0.3856 −1.3586 to 2.1298
    Pulmonary 2.2312 0.8886 to 3.5737
    Urinary tract 0.6214 −0.5759 to 1.8188
    Other 0.0842 −2.9556 to 3.1241
    Unknown 0.4545 −0.5199 to 1.4289
    Cardiac drug toxicity 0.4403 −1.9391 to 2.8198
    Unstable angina −0.2866 −1.2664 to 0.6932
    Cardiovascular, other −0.0935 −0.9220 to 0.7351
    Respiratory diagnoses
    Airway obstruction −1.1566 −2.5816 to 0.2683
    Asthma −0.9504 −2.4029 to 0.5021
    Aspiration pneumonia 1.8594 0.6822 to 3.0366
    Bacterial pneumonia 1.3593 0.5127 to 2.2059
    Viral pneumonia 11.9734 7.9610 to 15.9858
    Parasitic/fungal pneumonia −0.3144 −2.4677 to 1.8390
    COPD −0.5337 −1.4327 to 0.3653
    Pleural effusion 2.3729 0.2764 to 4.4693
    Pulmonary edema (noncardiac, ARDS) 1.8502 0.5768 to 3.1236
    Pulmonary embolism 0.0365 −1.3239 to 1.3969
    Respiratory arrest 5.5090 2.4528 to 8.5652
    Respiratory cancer 1.6241 −0.7706 to 4.0187
    Restrictive lung disease −0.3943 −3.4324 to 2.6439
    Respiratory, other 0.6541 −0.2716 to 1.5797
    GI diagnoses
    GI bleeding, upper −0.1162 −1.0717 to 0.8393
    GI bleeding, lower 0.0846 −1.2942 to 1.4634
    GI bleeding, varices 0.0706 −1.1279 to 1.2691
    GI inflammatory disease 2.0000 0.1665 to 3.8335
    Neoplasm −0.1524 −2.4206 to 2.1158
    Obstruction −1.5949 −4.4752 to 1.2853
    Perforation 2.3205 −2.1588 to 6.7999
    Vascular insufficiency 0.3367 −5.9729 to 6.6464
    Hepatic failure 1.3973 −0.8488 to 3.6434
    Intra/retroperitoneal hemorrhage −0.0192 −4.0357 to 3.9974
    Pancreatitis −0.0271 −2.1165 to 2.0623
    GI, other 1.0184 −0.7128 to 2.7496
    Neurologic diagnoses
    Intracerebral hemorrhage 1.1529 0.2131 to 2.0927
    Neurologic neoplasm 0.1640 −1.9908 to 2.3188
    Neurologic infection 0.2610 −1.4320 to 1.9541
    Neuromuscular disease −0.3268 −2.9793 to 2.3256
    Drug overdose −0.9729 −1.8955 to −0.0502
    Subdural/epidural hematoma 0.4392 −1.0542 to 1.9326
    Subarachnoid hemorrhage, intracranial aneurysm 2.9454 1.6706 to 4.2203
    Seizures (no structural disease) −0.0589 −1.2930 to 1.1753
    Stroke 0.9552 −0.3379 to 2.2483
    Neurologic, other 2.3299 0.8984 to 3.7615
    Metabolic/endocrine diagnoses
    Acid-base, electrolyte disorder 0.1873 −1.6411 to 2.0156
    Diabetic ketoacidosis −0.6338 −1.6196 to 0.3521
    Diabetic HHNC −0.5630 −1.7594 to 0.6334
    Metabolic/endocrine, other −0.2237 −1.6751 to 1.2277
    GU diagnoses
    Renal, other 0.1151 −0.9682 to 1.1983
    Miscellaneous diagnoses
    General, other −0.2454 −1.7009 to 1.2101
Operative diagnoses
    Cardiovascular surgery
    Valvular heart surgery −1.0431 −2.8156 to 0.7295
    Aortic aneurysm, elective repair 0.4275 −1.3497 to 2.2047
    Aortic aneurysm, ruptured 0.5937 −4.0943 to 5.2817
    Aortic aneurysm, dissection 0.3527 −2.8310 to 3.5364
    Femoral-popliteal bypass graft 0.3356 −1.5599 to 2.2311
    Aortoiliac, aortofemoral bypass graft 0.9262 −2.5131 to 4.3654
    Peripheral ischemia (emobolectomy, thrombectomy, dilation) −0.4225 −4.6282 to 3.7832
    Carotid endarterectomy 0.8925 −0.7279 to 2.5129
    Cardiovascular surgery, other 0.1896 −1.5406 to 1.9198
    Respiratory surgery
    Thoracotomy, malignancy 0.9806 −0.7279 to 2.6892
    Neoplasm, mouth, larynx 1.5202 −0.9609 to 4.0013
    Thoracotomy, lung biopsy, pleural disease 4.8600 1.7232 to 7.9968
    Thoracotomy, respiratory infection 0.2357 −2.3060 to 2.7774
    Respiratory surgery, other 1.7429 −0.0452 to 3.5310
    GI surgery
    GI malignancy 1.7652 0.0896 to 3.4409
    GI bleeding 0.8628 −1.4034 to 3.1291
    Fistula, abscess −0.8891 −3.8190 to 2.0408
    Cholecystitis, cholangitis −0.0360 −2.0664 to 1.9945
    GI inflammation 1.8150 −0.9391 to 4.5692
    GI obstruction 0.1693 −1.6523 to 1.9909
    GI perforation 2.5490 0.6072 to 4.4909
    GI vascular ischemia 5.2939 1.8331 to 8.7548
    Liver transplant −3.1338 −7.3945 to 1.1270
    GI surgery, other 0.0103 −1.5726 to 1.5932
    Neurologic surgery
    Craniotomy or transsphenoidal procedure for neoplasm 0.7337 −0.8877 to 2.3552
    Intracranial hemorrhage 1.8154 −1.0389 to 4.6697
    Subarachnoid hemorrhage, intracranial aneurysm 2.9454 1.6706 to 4.2203
    Subdural/epidural hematoma 0.4392 −1.0542 to 1.9326
    Laminectomy, fusion, spinal cord surgery 0.7094 −1.0999 to 2.5188
    Neurologic surgery, other 0.6249 −1.1605 to 2.4102
    Genitourinary surgery
    Renal/bladder/prostate neoplasm 0.2622 −3.2134 to 2.4526
    Renal transplant −2.0178 −7.3254 to 3.2897
    Hysterectomy −0.1985 −3.2134 to 2.8164
    Genitourinary surgery, other 0.4942 −1.9733 to 2.9617
    Miscellaneous surgery
    Amputation, nontraumatic −0.3057 −9.2631 to 8.6516
Intercept 2.2550 −4.4486 to 8.9587

Knot = numerical cut point for each splined variable; APS = acute physiology score; Fio2 = fraction of inspired oxygen; GCS = Glasgow coma scale; AMI = acute myocardial infarction; HHNC = hyperglycemic hyperosmolar nonketotic coma. See Table 1 for abbreviations not used in the text.

Footnotes

Dr. Vasilevskis had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Responsibility for areas of the study were as follows: study concept and design: Drs. Vasilevskis, Kuzniewicz, and Dudley; acquisition of data: Drs. Kuzniewicz, Cason, Lane, and Dudley, and Ms. Dean; analysis and interpretation of data: Drs. Vasilevskis, Kuzniewicz, Cason, Lane, Vittinghoff, and Dudley, Ms. Dean, Mr. Clay, and Ms. Rennie; drafting of the manuscript: Drs. Vasilevskis and Dudley; critical revision of the manuscript for important intellectual content: Drs. Vasilevskis, Kuzniewicz, Cason, Lane, Vittinghoff, and Dudley, Ms. Dean, Mr. Clay, and Ms. Rennie; statistical analysis: Drs. Vasilevskis and Vittinghoff, and Mr. Clay; obtained funding: Dr. Dudley; administrative, technical, or material support: Drs. Cason and Lane, Ms. Dean, and Ms. Rennie; and study supervision: Ms. Dean and Dr. Dudley.

The views expressed in this article are those of the authors and do not necessarily represent the views of the US Department of Veterans Affairs.

This work was supported by the California Office of Statewide Health Planning and Development and the Agency for Healthcare Research and Quality (R01 HS13919-01). Dr. Dudley's work was also supported by an Investigator Award in Health Policy from the Robert Wood Johnson Foundation. Dr. Vasilevskis was supported by a Ruth L. Kirschstein National Research Service Award institutional research training grant T32, the Veterans Affairs Clinical Research Center of Excellence, and the Geriatric Research Education and Clinical Center, Veterans Affairs, Tennessee Valley Healthcare, Nashville, TN.

The authors have reported to the ACCP that no significant conflicts of interest exist with any companies/organizations whose products or services may be discussed in this article.

Reproduction of this article is prohibited without written permission from the American College of Chest Physicians (www.chestjournal.org/site/misc/reprints.xhtml).

References

  • 1.Halpern NA, Pastores SM, Greenstein RJ. Critical care medicine in the United States 1985–2000: an analysis of bed numbers, use, and costs. Crit Care Med. 2004;32:1254–1259. doi: 10.1097/01.ccm.0000128577.31689.4c. [DOI] [PubMed] [Google Scholar]
  • 2.Rapoport J, Teres D, Lemeshow S, et al. Explaining variability of cost using a severity-of-illness measure for ICU patients. Med Care. 1990;28:338–348. doi: 10.1097/00005650-199004000-00005. [DOI] [PubMed] [Google Scholar]
  • 3.Rapoport J, Teres D, Lemeshow S, et al. A method for assessing the clinical performance and cost-effectiveness of intensive care units: a multicenter inception cohort study. Crit Care Med. 1994;22:1385–1391. doi: 10.1097/00003246-199409000-00006. [DOI] [PubMed] [Google Scholar]
  • 4.Render ML, Kim HM, Deddens J, et al. Variation in outcomes in Veterans Affairs intensive care units with a computerized severity measure. Crit Care Med. 2005;33:930–939. doi: 10.1097/01.ccm.0000162497.86229.e9. [DOI] [PubMed] [Google Scholar]
  • 5.Rosenthal GE, Harper DL, Quinn LM, et al. Severity-adjusted mortality and length of stay in teaching and nonteaching hospitals: results of a regional study. JAMA. 1997;278:485–490. [PubMed] [Google Scholar]
  • 6.Woods AW, MacKirdy FN, Livingston BM, et al. Evaluation of predicted and actual length of stay in 22 Scottish intensive care units using the APACHE III system. Anaesthesia. 2000;55:1058–1065. doi: 10.1046/j.1365-2044.2000.01552.x. [DOI] [PubMed] [Google Scholar]
  • 7.Lilly CM, Sonna LA, Haley KJ, et al. Intensive communication: four-year follow-up from a clinical practice study. Crit Care Med. 2003;31(suppl):S394–S399. doi: 10.1097/01.CCM.0000065279.77449.B4. [DOI] [PubMed] [Google Scholar]
  • 8.Pronovost PJ, Angus DC, Dorman T, et al. Physician staffing patterns and clinical outcomes in critically ill patients: a systematic review. JAMA. 2002;288:2151–2162. doi: 10.1001/jama.288.17.2151. [DOI] [PubMed] [Google Scholar]
  • 9.Pronovost PJ, Jenckes MW, Dorman T, et al. Organizational characteristics of intensive care units related to outcomes of abdominal aortic surgery. JAMA. 1999;281:1310–1317. doi: 10.1001/jama.281.14.1310. [DOI] [PubMed] [Google Scholar]
  • 10.Acute Respiratory Distress Syndrome Network. Ventilation with lower tidal volumes as compared with traditional tidal volumes for acute lung injury and the acute respiratory distress syndrome. N Engl J Med. 2000;342:1301–1308. doi: 10.1056/NEJM200005043421801. [DOI] [PubMed] [Google Scholar]
  • 11.Gupta N, Kotler PL, Dudley RA. Considerations in the development of intensive care unit report cards. J Intensive Care Med. 2002;17:211–217. [Google Scholar]
  • 12.Joint Commission on Accreditation of Healthcare Organizations. National hospital quality measures: ICU. [Accessed May 18, 2009]. Available at: http://www.jointcommission.org/PerformanceMeasurement/MeasureReserveLibrary/Spec+Manual+-+ICU.htm.
  • 13.Hospital Association of Southern California. Quality/patient safety resources: 2008 CHART hospital performance measures. [Accessed June 4, 2009]. Available at: http://www.hasc.org/download.cfm?ID=28358.
  • 14.Knaus WA, Wagner DP, Zimmerman JE, et al. Variations in mortality and length of stay in intensive care units. Ann Intern Med. 1993;118:753–761. doi: 10.7326/0003-4819-118-10-199305150-00001. [DOI] [PubMed] [Google Scholar]
  • 15.Zimmerman JE, Kramer AA, McNair DS, et al. Intensive care unit length of stay: benchmarking based on Acute Physiology and Chronic Health Evaluation IV. Crit Care Med. 2006;34:2517–2529. doi: 10.1097/01.CCM.0000240233.01711.D9. [DOI] [PubMed] [Google Scholar]
  • 16.Higgins TL, Teres D, Copes WS, et al. Assessing contemporary intensive care unit outcome: an updated Mortality Probability Admission Model (MPM0-III) Crit Care Med. 2007;35:827–835. doi: 10.1097/01.CCM.0000257337.63529.9F. [DOI] [PubMed] [Google Scholar]
  • 17.Le Gall JR, Lemeshow S, Saulnier F. A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. JAMA. 1993;270:2957–2963. doi: 10.1001/jama.270.24.2957. [DOI] [PubMed] [Google Scholar]
  • 18.Tri-Analytics, Inc. Project IMPACT CCM's Critical Care Data Systems. [Accessed June 4, 2009]. Available at: http://www.trianalytics.com/programs_pi.html.
  • 19.California HealthCare Foundation. Rating hospital quality in California, 2008. [Accessed May 18, 2009]. Available at: http://www.calhospitalcompare.org.
  • 20.Kuzniewicz MW, Vasilevskis EE, Lane R, et al. Variation in ICU risk-adjusted mortality: impact of methods of assessment and potential confounders. Chest. 2008;133:1319–1327. doi: 10.1378/chest.07-3061. [DOI] [PubMed] [Google Scholar]
  • 21.Young JD, Goldfrad C, Rowan K. Development and testing of a hierarchical method to code the reason for admission to intensive care units: the ICNARC coding method; Intensive Care National Audit & Research Centre. Br J Anaesth. 2001;87:543–548. doi: 10.1093/bja/87.4.543. [DOI] [PubMed] [Google Scholar]
  • 22.American Hospital Association. AHA Annual Survey Database. 2004 ed. Chicago, IL: American Hospital Association; 2004. [Google Scholar]
  • 23.Steyerberg EW, Bleeker SE, Moll HA, et al. Internal and external validation of predictive models: a simulation study of bias and precision in small samples. J Clin Epidemiol. 2003;56:441–447. doi: 10.1016/s0895-4356(03)00047-7. [DOI] [PubMed] [Google Scholar]
  • 24.van Houwelingen HC. Validation, calibration, revision and combination of prognostic survival models. Stat Med. 2000;19:3401–3415. doi: 10.1002/1097-0258(20001230)19:24<3401::aid-sim554>3.0.co;2-2. [DOI] [PubMed] [Google Scholar]
  • 25.Fisher ES, Wennberg DE, Stukel TA, et al. The implications of regional variations in Medicare spending: part 1. The content, quality, and accessibility of care. Ann Intern Med. 2003;138:273–287. doi: 10.7326/0003-4819-138-4-200302180-00006. [DOI] [PubMed] [Google Scholar]
  • 26.Wennberg JE, Fisher ES, Stukel TA, et al. Use of hospitals, physician visits, and hospice care during last six months of life among cohorts loyal to highly respected hospitals in the United States. BMJ. 2004;328:607. doi: 10.1136/bmj.328.7440.607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.US Census Bureau. United States Census 2000, migration by race and Hispanic origin for the population 5 years and over for the United States, regions, states, and Puerto Rico: 2000 (PHC-T-25); 2008. [Accessed May 18, 2009]. Available at: http://www.census.gov/population/www/cen2000/briefs/phc-t25/tables/tab01.pdf.
  • 28.Nelson DE, Bolen J, Wells HE, et al. State trends in uninsurance among individuals aged 18 to 64 years: United States, 1992–2001. Am J Public Health. 2004;94:1992–1997. doi: 10.2105/ajph.94.11.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Burwen DR, Galusha DH, Lewis JM, et al. National and state trends in quality of care for acute myocardial infarction between 1994–1995 and 1998–1999: the Medicare health care quality improvement program. Arch Intern Med. 2003;163:1430–1439. doi: 10.1001/archinte.163.12.1430. [DOI] [PubMed] [Google Scholar]
  • 30.Jencks SF, Cuerdon T, Burwen DR, et al. Quality of medical care delivered to Medicare beneficiaries: a profile at state and national levels. JAMA. 2000;284:1670–1676. doi: 10.1001/jama.284.13.1670. [DOI] [PubMed] [Google Scholar]
  • 31.Fieller EC. A fundamental formula in the statistics of biological assay, and some applications. Q J Pharm Pharmacol. 1944;17:117–123. [Google Scholar]
  • 32.Nathanson BH, Higgins TL, Teres D, et al. A revised method to assess intensive care unit clinical performance and resource utilization. Crit Care Med. 2007;35:1853–1862. doi: 10.1097/01.CCM.0000275272.57237.53. [DOI] [PubMed] [Google Scholar]
  • 33.Angus DC, Linde-Zwirble WT, Sirio CA, et al. The effect of managed care on ICU length of stay: implications for Medicare. JAMA. 1996;276:1075–1082. [PubMed] [Google Scholar]
  • 34.Jayes RL, Zimmerman JE, Wagner DP, et al. Variations in the use of do-not-resuscitate orders in ICUs: findings from a national study. Chest. 1996;110:1332–1339. doi: 10.1378/chest.110.5.1332. [DOI] [PubMed] [Google Scholar]
  • 35.Arabi Y, Venkatesh S, Haddad S, et al. The characteristics of very short stay ICU admissions and implications for optimizing ICU resource utilization: the Saudi experience. Int J Qual Health Care. 2004;16:149–155. doi: 10.1093/intqhc/mzh025. [DOI] [PubMed] [Google Scholar]
  • 36.Rosenthal GE, Sirio CA, Shepardson LB, et al. Use of intensive care units for patients with low severity of illness. Arch Intern Med. 1998;158:1144–1151. doi: 10.1001/archinte.158.10.1144. [DOI] [PubMed] [Google Scholar]
  • 37.Ash J, Gorman P, Seshadri V, et al. Computerized physician order entry in U.S. hospitals: results of a 2002 survey. J Am Med Inform Assoc. 2004;11:95–99. doi: 10.1197/jamia.M1427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Poon E, Jha A, Christino M, et al. Assessing the level of healthcare information technology adoption in the United States: a snapshot. BMC Med Inform Decis Mak. 2006;6:1. doi: 10.1186/1472-6947-6-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Moreno R, Metnitz P, Almeida E, et al. From evaluation of the patient to evaluation of the intensive care unit: part 2. Development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med. 2005;31:1345–1355. doi: 10.1007/s00134-005-2763-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mant J, Hicks N. Detecting differences in quality of care: the sensitivity of measures of process and outcome in treating acute myocardial infarction. BMJ. 1995;311:793–796. doi: 10.1136/bmj.311.7008.793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wagner DP, Knaus WA, Harrell FE, et al. Daily prognostic estimates for critically ill adults in intensive care units: results from a prospective, multicenter, inception cohort analysis. Crit Care Med. 1994;22:1359–1372. doi: 10.1097/00003246-199409000-00004. [DOI] [PubMed] [Google Scholar]

Articles from Chest are provided here courtesy of American College of Chest Physicians

RESOURCES