Table 3:
Training set | Testing set | Accuracy | Specificity | F1-score | Recall | Precision | AUROC | AUPRC |
---|---|---|---|---|---|---|---|---|
MGH+BIDMC | BIDMC | 0.86 [0.84–0.87] |
0.86 [0.83–0.87] |
0.84 [0.82–0.86] |
0.85 [0.83–0.88] |
0.82 [0.80–0.84] |
0.92 [0.91–0.94] |
0.91 [0.90–0.93] |
MGH | BIDMC | 0.90 [0.89–0.92] |
0.92 [0.91–0.93] |
0.83 [0.81–0.85] |
0.88 [0.86–0.90] |
0.79 [0.77–0.83] |
0.95 [0.93–0.96] |
0.90 [0.89–0.92] |
BIDMC | MGH | 0.94 [0.94–0.95] |
0.96 [0.96–0.97] |
0.91 [0.89–0.92] |
0.90 [0.88–0.91] |
0.91 [0.91–0.93] |
0.98 [0.97–0.98] |
0.98 [0.97–0.98] |
MGH+BIDMC | MGH | 0.93 [0.92–0.95] |
0.96 [0.95–0.98] |
0.93 [0.92–0.95] |
0.91 [0.88–0.92] |
0.96 [0.94–0.99] |
0.98 [0.97–0.99] |
0.98 [0.98–0.99] |
MGH+BIDMC | MGH+BIDMC | 0.89 [0.88–0.90] |
0.90 [0.89–0.92] |
0.88 [0.87–0.90] |
0.88 [0.86–0.90] |
0.88 [0.88–0.91] |
0.95 [0.94–0.96] |
0.95 [0.94–0.96] |
The bootstrapping results in 95% confidence intervals are in parenthesis. ACC - accuracy, Spec - specificity, AP - average precision, AUROC - Area under the receiver operating characteristic curve, AUPRC - area under the precision-recall curve. Data Sets: MGH: Data derived from Massachusetts General Hospital. BI: Data derived from Beth Israel Deaconess Medical Center. MGH+ BIDMC: Data derived from Massachusetts General Hospital and Beth Israel Deaconess Medical Center.