Table 1.
Subgroup # | Phenotype | AUROC [95% Bootstrap CI] | Number of patients |
---|---|---|---|
AFISP 1 | Anemia and nonspecific lung disease | 0.81 [0.78, 0.84] | 562 |
AFISP 2 | Nonspecific lung disease and Hypoxemia | 0.82 [0.77, 0.86] | 310 |
AFISP 3 | Sepsis and acute respiratory failure | 0.83 [0.79, 0.87] | 306 |
AFISP 4 | Sepsis and anemia | 0.85 [0.82, 0.88] | 606 |
AFISP 5 | Acute respiratory failure and no bronchitis | 0.89 [0.88, 0.91] | 1095 |
AFISP 6 | Nonspecific lung disease and hypoxemia | 0.89 [0.88, 0.91] | 1468 |
AFISP 7 | Acute respiratory failure | 0.89 [0.88, 0.91] | 1137 |
AFISP 8 | Sepsis | 0.91 [0.90, 0.93] | 1361 |
AFISP 9 | Hypoxemia | 0.91 [0.90, 0.93] | 1587 |
AFISP 10 | Nonspecific lung disease and not admitted from ED | 0.92 [0.90, 0.94] | 468 |
AFISP 11 | Nonspecific lung disease and no infection | 0.92 [0.90, 0.95] | 422 |
AFISP 12 | Transferred from nursing facility and admitted from ED | 0.93 [0.91, 0.94] | 2196 |
AFISP 13 | Transferred from a nursing facility | 0.93 [0.92, 0.94] | 3035 |
Phenotypes of subgroups found by AFISP in the worst 10% subset of the evaluation dataset, and the performance of the AAM-inspired model on these subgroups. For reference, the full evaluation dataset contained 60,998 patients and the AUROC of the AAM-inspired model on the full dataset was 0.986 [0.985, 0.987]. Confidence intervals computing using 100 bootstrap resamples. All subgroups had a statistically significant difference (at the 0.05 significance level) in performance from the performance threshold (0.944) with all p values less than 1 × 10−4 after correcting for multiple comparisons.