[Preprint]. 2024 Nov 6:rs.3.rs-5046441. Originally published 2024 Sep 9. [Version 2] doi: 10.21203/rs.3.rs-5046441/v2

Table 3:

Average performance and [95% confidence intervals] for logistic regression model using all features in the different testing sets

Training set	Testing set	Accuracy	Specificity	F1-score	Recall	Precision	AUROC	AUPRC
MGH+BIDMC	BIDMC	0.86 [0.84–0.87]	0.86 [0.83–0.87]	0.84 [0.82–0.86]	0.85 [0.83–0.88]	0.82 [0.80–0.84]	0.92 [0.91–0.94]	0.91 [0.90–0.93]
MGH	BIDMC	0.90 [0.89–0.92]	0.92 [0.91–0.93]	0.83 [0.81–0.85]	0.88 [0.86–0.90]	0.79 [0.77–0.83]	0.95 [0.93–0.96]	0.90 [0.89–0.92]
BIDMC	MGH	0.94 [0.94–0.95]	0.96 [0.96–0.97]	0.91 [0.89–0.92]	0.90 [0.88–0.91]	0.91 [0.91–0.93]	0.98 [0.97–0.98]	0.98 [0.97–0.98]
MGH+BIDMC	MGH	0.93 [0.92–0.95]	0.96 [0.95–0.98]	0.93 [0.92–0.95]	0.91 [0.88–0.92]	0.96 [0.94–0.99]	0.98 [0.97–0.99]	0.98 [0.98–0.99]
MGH+BIDMC	MGH+BIDMC	0.89 [0.88–0.90]	0.90 [0.89–0.92]	0.88 [0.87–0.90]	0.88 [0.86–0.90]	0.88 [0.88–0.91]	0.95 [0.94–0.96]	0.95 [0.94–0.96]

The bootstrapping results in 95% confidence intervals are in parenthesis. ACC - accuracy, Spec - specificity, AP - average precision, AUROC - Area under the receiver operating characteristic curve, AUPRC - area under the precision-recall curve. Data Sets: MGH: Data derived from Massachusetts General Hospital. BI: Data derived from Beth Israel Deaconess Medical Center. MGH+ BIDMC: Data derived from Massachusetts General Hospital and Beth Israel Deaconess Medical Center.