Table 2.
Massachusetts General Hospital
(N=4,166) |
Brigham and Women’s Hospital
(N=37,963) |
UK Biobank
(N=41,033) |
|||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Model | Hazard ratio (per 1-SD) | 5-year AUROC | 5-year average precision | Calibration slope | ICI‡ | Hazard ratio (per 1-SD) | 5-year AUROC | 5-year average precision | Calibration slope | ICI | Hazard ratio (per 1-SD) | 2-year AUROC | 2-year average precision | Calibration slope | ICI |
Deep learning architectures | |||||||||||||||
ECG-AI | - | 0.823* (0.790–0.856) |
0.27 (0.21–0.34) |
- | 0.0231 | - | 0.747* (0.736–0.759) |
0.19*† (0.17–0.20) |
- | 0.0124 | - | 0.705 (0.659–0.724) |
0.060* (0.043–0.087) |
- | 0.0768 |
Cox proportional hazards models | |||||||||||||||
Age and sex | 2.91 (2.44–3.47) |
0.768 (0.732–0.805) |
0.16 (0.13–0.20) |
1.05 (0.88–1.23) |
0.0074 | 2.48 (2.35–2.62) |
0.730 (0.717–0.743) |
0.14 (0.13–0.15) |
0.94 (0.88–1.00) |
0.0072 | 2.21 (1.96–2.50) |
0.728 (0.702–0.755) |
0.018 (0.015–0.024) |
1.48 (1.25–1.71) |
0.0019§ |
CHARGE-AF | 3.36 (2.98–4.30) |
0.802* (0.767–0.836) |
0.21* (0.17–0.26) |
0.68 (0.58–0.77) |
0.0320 | 2.78 (2.63–2.94) |
0.752* (0.741–0.763) |
0.17* (0.15–0.18) |
0.57 (0.53–0.60) |
0.0344 | 2.26 (2.00–2.55) |
0.732 (0.704–0.759) |
0.020 (0.016–0.026) |
0.87 (0.75–1.00) |
0.0011§ |
ECG-AI | 2.45 (2.23–2.69) |
0.823* (0.790–0.856) |
0.27* (0.21–0.34) |
1.06 (0.95–1.17) |
0.0212 | 2.05 (1.98–2.11) |
0.747* (0.736–0.759) |
0.19*† (0.17–0.20) |
0.81 (0.77–0.84) |
0.0129 | 2.01 (1.88–2.14) |
0.705 (0.673–0.737) |
0.060*† (0.044–0.090) |
0.75 (0.68–0.82) |
0.0035§ |
CH-AI | 3.74 (3.24–4.33) |
0.838*† (0.807–0.869) |
0.30*† (0.24–0.38) |
1.13 (1.01–1.25) |
0.0120 | 2.76 (2.64–2.88) |
0.777*† (0.766–0.788) |
0.21*† (0.19–0.23) |
0.77 (0.74–0.81) |
0.0108 | 2.27 (2.11–2.44) |
0.746 (0.716–0.776) |
0.059*† (0.042–0.083) |
1.01 (0.92–1.10) |
0.0001§ |
p<0.05 for comparison against age and sex
p<0.05 for comparison against CHARGE-AF
Integrated calibration index (ICI), a quantitative measure of the average difference between predicted event risk and observed event incidence, weighted by the empirical distribution of event risk.30 Smaller values indicate better calibration.
Values reflect ICI after recalibration to the baseline 2-year AF risk in the UK Biobank
Difference in c-index for CH-AI vs ECG-AI: AUROC MGH p=NS, BWH p<0.05, UK Biobank p<0.05; average precision MGH p<0.05, BWH p<0.05, p=NS
AUROC = area under the receiver operating characteristic curve; ICI = integrated calibration index; SD = standard deviation