Table 2.
Comparison of model performance metrics.
| Model | AUROCa | Recall | Specificity | Precision | AUPRCb |
| ICD code vectorization + LSTMc | 0.812 | 0.646 | 0.812 | 0.192 | 0.462 |
| Patient level vectorization + XGBoostd | 0.790 | 0.670 | 0.758 | 0.161 | 0.358 |
| Ensemble | 0.818 | 0.693 | 0.777 | 0.177 | 0.463 |
| Baseline (age, sex) | 0.667 | 0.787 | 0.416 | 0.0855 | 0.119 |
| Baseline (age, sex, diagnosis count) | 0.668 | 0.547 | 0.707 | 0.114 | 0.130 |
aAUROC: area under the receiver operating characteristics curve.
bAUPRC: area under the precision-recall curve.
cLSTM: long short-term memory.
dXGBoost: extreme gradient boosting.