Table 4.
Comparison of the hazard regression model with repeated measurements and the deep learning model with repeated measurements using validation datasets from the National Health Insurance Service-Health Screening Cohort (40-79 years of age).
| Performance index | All-cause dementia | Alzheimer dementia | |
| DL-Ra,b versus HR-Rc | DL-Ra versus HR-R | ||
| Discrimination |
|
|
|
|
|
Difference between AUCsd | 0.034 (0.029-0.039)e | 0.024 (0.018-0.031)e |
|
|
Absolute IDIf | 0.334 | 0.423 |
|
|
Relative IDI | 3.200 | 5.351 |
| Reclassification |
|
|
|
|
|
Patients move to higher, n (%) | 4163 (76.30) | 2163 (86.42) |
|
|
Patients move to lower, n (%) | 0 (0.00) | 0 (0.00) |
|
|
Controls move to higher, n (%) | 17,664 (19.52) | 19,231 (21.25) |
|
|
Controls move to lower, n (%) | 0 (0.00) | 0 (0.00) |
|
|
NRIg (%) | 56.79h | 65.17h |
aNRIs were calculated to determine the improvement in the performance of each model to identify individuals whose risk of dementia was more than 50%.
bDL-R: deep learning model with repeated measurements.
cHR-R: hazard regression model with repeated measurements.
dAUC: area under the receiver operating characteristic curve.
eDifference between AUCs was significant with P<.001.
fIDI: integrated discrimination improvement.
gNRI: net reclassification improvement.
hNRI was significant with P<.001.