Table 2.
Comparison of model performance metrics.
Model | AUROCa | Recall | Specificity | Precision | AUPRCb |
ICD code vectorization + LSTMc | 0.812 | 0.646 | 0.812 | 0.192 | 0.462 |
Patient level vectorization + XGBoostd | 0.790 | 0.670 | 0.758 | 0.161 | 0.358 |
Ensemble | 0.818 | 0.693 | 0.777 | 0.177 | 0.463 |
Baseline (age, sex) | 0.667 | 0.787 | 0.416 | 0.0855 | 0.119 |
Baseline (age, sex, diagnosis count) | 0.668 | 0.547 | 0.707 | 0.114 | 0.130 |
aAUROC: area under the receiver operating characteristics curve.
bAUPRC: area under the precision-recall curve.
cLSTM: long short-term memory.
dXGBoost: extreme gradient boosting.