Table 3.
Test data set for prediction of 30-day readmission (n = 44 705)
Feature | ROC AUC | F1 Score | Precision/PPV | Recall/Sensitivity | Specificity | NPV |
---|---|---|---|---|---|---|
(95% CI) | (95% CI) | (95% CI) | (95% CI) | (95% CI) | ||
SNOMED | 0.75 | 0.67 | 0.20 | 0.70 | 0.67 | 0.95 |
(0.74–0.75) | (0.19–0.20) | (0.68–0.71) | (0.67–0.68) | (0.95–0.95) | ||
ICD | 0.72 | 0.65 | 0.19 | 0.69 | 0.65 | 0.95 |
(0.72–0.73) | (0.18–0.19) | (0.67–0.70) | (0.64–0.65) | (0.94–0.95) | ||
LOINC | 0.74 | 0.66 | 0.19 | 0.68 | 0.67 | 0.95 |
(0.73–0.74) | (0.19–0.20) | (0.67–0.70) | (0.66–0.67) | (0.94–0.95) | ||
RxNORM | 0.72 | 0.66 | 0.19 | 0.66 | 0.66 | 0.94 |
(0.71–0.73) | (0.18–0.19) | (0.65–0.68) | (0.66–0.67) | (0.94–0.95) | ||
MeSH | .075 | 0.68 | 0.20 | 0.68 | 0.68 | 0.95 |
(0.74–0.75) | (0.19–0.21) | (0.67–0.69) | (0.68–0.69) | (0.95–0.95) | ||
SNOMED + ICD + LOINC + RXNORM + MESH | 0.75 | 0.68 | 0.19 | 0.70 | 0.70 | 0.95 |
(0.74–0.76) | (0.18–0.20) | (0.68–0.71) | (0.67–0.68) | (0.95–0.96) | ||
n-gram(raw text) | 0.75 | 0.74 | 0.23 | 0.57 | 0.77 | 0.94 |
(0.74–0.76) | (0.22–0.23) | (0.57–0.60) | (0.76–0.77) | (0.94–0.94) |
Test data set from time period between June 11, 2015 and September 30, 2017. Raw text examined were both unigrams and bigrams, and results are for unigrams which had better performance (n-grams).
Abbreviations: CI, confidence interval; F1 score, micro F1; ICD, international classification of diseases; LOINC, logical observation identifiers name and codes; MeSH, medical subject heading; NPV, negative predictive value; PPV, precision/positive predictive value; ROC AUC, receiver operating characteristic area under the curve; RxNORM, standard names given to clinical drugs and drug delivery devices; SNOMEDct, systematized nomenclature of medicine, clinical terms.