Abstract
Background:
Lack of external validation of dementia risk tools is a major limitation for generalizability and translatability of prediction scores in clinical practice and research.
Objectives:
We aimed to validate a new dementia prediction risk tool called CogDrisk and a version, CogDrisk-AD for predicting Alzheimer’s disease (AD) using cohort studies.
Design, Setting, Participants and Measurements:
Four cohort studies were identified that included majority of the dementia risk factors from the CogDrisk tool. Participants who were free of dementia at baseline were included. The predictors are component variables in the CogDrisk tool that include self-reported demographical, medical risk factors and lifestyle habits. Risk scores for Any Dementia and AD were computed and area under the curve (AUC) was assessed. To examine modifiable risk factors for dementia, the CogDrisk tool was tested by excluding age and sex estimates from the model.
Results:
The performance of the tool varied between studies. The overall AUC and 95% CI for predicting dementia was 0.77 (0.57, 0.97) for the Swedish National study on Aging and Care in Kungsholmen, 0.76 (0.70, 0.83) for the Health and Retirement Study - Aging, Demographics and Memory Study, 0.70 (0.67,0.72) for the Cardiovascular Health Study Cognition Study, and 0.66 (0.62,0.70) for the Rush Memory and Aging Project.
Conclusions:
The CogDrisk and CogDrisk-AD performed well in the four studies. Overall, this tool can be used to assess individualized risk factors of dementia and AD in various population settings.
Keywords: dementia, Alzheimer’s disease, risk tool, accuracy, prediction, ROC curve
1. INTRODUCTION
Currently, over 55 million people live with dementia worldwide and the number of cases are projected to increase to 78 million by 20301. This, along with unsuccessful clinical trials on dementia treatment, has led to urgent calls for dementia prevention. As the evidence of both risk factors and effective risk reduction interventions is increasing, there is a need to implement findings in policy and practice. Hence, there is a need for accurate risk assessment tools to be leveraged by clinicians, researchers, policy makers, and the general population to support the implementation of dementia risk reduction programs.
Several dementia and Alzheimer’s disease (AD) risk models have been developed in community settings2–5. Systematic reviews6–8 have compared different prediction models based on mid- and late-life age groups, variables included, follow-up time of dementia diagnosis, study setting, and the discriminative accuracy of the tools to predict dementia and its subtypes. However, direct comparisons are limited due to the different methodologies used for tool development and different target samples and outcomes. Few of these prediction models have been translated into practical tools. The needs of a tool for implementing dementia risk reduction in practice should inform prevention strategies and not be considered from a purely statistical perspective. Studies reporting high Area Under the Curve values (AUC) for risk tools typically include disease indicators that are not independent of disease onset such as memory loss, imaging biomarkers, or genetics which are not modifiable or widely accessible9. In contrast our aim was to develop a tool that informs preventive action and can be widely applicable. Of the available risk assessment tools that similarly focus on providing preventive information, two have been developed from more than one dataset. The Brief Dementia Screening Index (BDSI)10, was developed from four international cohort studies, and the ANU-Alzheimer’s Disease Risk Index (ANU-ADRI)3, was developed using an evidence-based synthesis approach which collated data from various countries and is available online. Two other tools, the Cardiovascular Risk Factors, Aging and Dementia (CAIDE)4 and the Lifestyle for Brain Health (LIBRA)5, have been developed from single studies but validated on several cohorts and have been developed into apps. The evidence base has progressed substantially since the development of dementia risk tools such as CAIDE and ANU-ADRI11, 12. Recently, cardiovascular disease (CVD) risk estimates using SCORE2 (a risk tool for CVD) was shown to predict all cause dementia in the UK Biobank (Zheng et al 2022)13.
To incorporate the newest evidence on dementia risk factors in a practical tool, we recently developed the CogDrisk tool for predicting dementia, and a version of the tool for specifically predicting AD called CogDrisk-AD14. This tool incorporates risk and protective factors identified through systematic synthesis of the latest evidence base and selects predictors of dementia and AD based on strength of evidence as well as availability of measures that are practicable in a range of clinical and research contexts. To our knowledge, the CogDrisk includes the largest number of modifiable risk factors and incorporates age group and sex differences. Lack of external validation of risk tools is a major limitation for generalizability and translatability of prediction scores in clinical practice and research. Therefore, in this study, we aim to validate the CogDrisk tool within four international studies to evaluate how well they predict Any Dementia and AD in different populations. Once validated, the tool may be used by clinicians, the public, and policy makers to assess level of risk in individuals or communities and to guide specific feedback for dementia risk reduction.
2. METHODS
2.1. Validation cohorts
We shortlisted datasets through database searches, review of consortia, consultation with experts, and then evaluated them in terms of the outcome measures, i.e., availability of clinical diagnosis of dementia, inclusion of majority of risk and protective factors measured in the CogDrisk assessment tool (refer Supplementary Information 1 Part B), and long follow-up time (average of more than 5 years) for AD and dementia. Four cohorts were available that met our criteria are briefly described below. We also considered the Framingham Heart Study (FHS) but there was a large difference in age (midlife for FHS versus late life for other studies) and a far earlier timeframe (1975 compared with late 19th to 20th century) of baseline assessment between this and the other studies. We decided to exclude the FHS to avoid differences while comparing the validation results. Supplementary Information 1 Table A1 describes the study characteristics, number and age of study participants, follow-up scheme, the criteria used for diagnosing dementia and AD.
The Swedish National study on Aging and Care in Kungsholmen (SNAC-K)15:
The SNAC-K study was initiated in 2001–2004 (baseline), and comprised 3,363 participants aged 60 years and above. In our analysis, we have included 3122 participants, where exclusions were due to dementia at baseline (n=241). At baseline, the mean age was 73·6 years and 36·6% of the participants were males.
The Health and Retirement Study - Aging, Demographics and Memory Study (HRS ADAMS)16:
The HRS ADAMS is a supplementary study in the HRS17 that conducted in-person clinical assessments to gather information on cognitive status. The study consisted of 856 community-based individuals aged 70 years and above who were assessed in 2001 (baseline) and followed through to 2008. The mean age of participants at baseline was 81·6 years and 41·5% were males.
The Cardiovascular Health Study Cognition Study (CHS-CS)18:
The CHS-CS was an ancillary study of the main Cardiovascular Health Study. The CHS-CS was initiated in 1991–1994 and was followed up until 1999–2000 with 3,602 community-based participants who had a cerebral MRI and Modified Mini-Mental State Examination (3MSE)19. In our analysis, participants with dementia at baseline were excluded (n=227) leaving 3,375 participants. At baseline, the mean age was 74·8 years and 40·9% were males.
The Rush Memory and Aging Project (MAP)20, 21:
The MAP comprised 2,184 participants aged 60 years and older who undertook the baseline examination in 1997–1998 and were followed annually for up to 22 years at the time of these analyses. The participant’s mean age at baseline was 80·0 years and 26·5% were males.
2.2. Data harmonization and coding of predictors
Data harmonization was carried out across studies using a common scoring system to standardize the measure of the variables (refer to Supplementary Information 1 Tables B1–B17 for details). Selection of predictors, their definition and coding have been described previously14. The predictors used in this validation study are based on the component variables included in the CogDrisk assessment tool and details are described in the Supplementary Information 1 Part B. Seventeen risk/protective risk factors were identified for inclusion in the algorithm to estimate the risk of Any Dementia including age, sex, education, mid-life obesity, high cholesterol and hypertension, diabetes, stroke, traumatic brain injury (TBI), atrial fibrillation, insomnia, depression, physical inactivity, cognitive and social engagement, fish intake, and smoking. The CogDrisk-AD tool had similar risk factors included with the omission of atrial fibrillation and insomnia due to insufficient evidence that these factors are associated with an increased risk of AD, and the inclusion of pesticide exposure in CogDrisk-AD. We could not include pesticide exposure in the validation of the CogDrisk-AD model as this variable was not available in the validation cohorts.
2.3. Statistical analysis
The accuracy of the statistical models for identifying participants at risk of dementia and AD using the CogDrisk score was quantified by calculating the AUC and associated 95% Confidence Interval22. To exclusively check the effect of modifiable risk factors, the predictive accuracy of the proposed CogDrisk score was further evaluated without age and sex for both dementia and AD. We also evaluated the predictive ability of mid-life risk factors on late-life dementia with data included from mid-life participants (40 to 65 years) across all cohorts, where available.
The CogDrisk score was calculated by adding points allocated to individual risk/protective factors. The methodology for the scoring system has been described previously14. Briefly, risk algorithms were developed for dementia and AD by first converting relative risk ratios (described in detail in the development of CogDrisk manuscript14) to points that were added to form a risk score. Conditional equations were specified for risk factors that only had an effect in mid-life (high cholesterol, obesity and overweight, and hypertension).
Not all risk/protective factors were measured at baseline in the validation cohorts (Table 1). In such cases, measures were taken from visits closest to the baseline under the assumption that the specific characteristics were constant over time (see Supplementary Information 1 Table C1). To assess the impact of missing data we ran two different sensitivity analyses including (i) a reduced model by removing risk/protective factors with a large number of missing observations (ranging from 24% to 79·7%) which improved the sample sizes, and (ii) multiple imputations. To carry out multiple imputations, we excluded participants missing education status or who had missing data for more than five covariates (exclusions: 30 in SNAC-K, five in HRS-ADAMS and CHS-CS, and eight in MAP). We considered twenty imputed datasets (equivalent to the proportion of missing data in most covariates) following fully conditional specifications/imputations by chained equation23. In the multiple imputation models, appropriate covariates were chosen to ensure compatibility between the imputation and analysis model. All analyses were conducted using Stata Statistical Software: Release 16·0 (StataCorp, College Station, Texas, USA).
Table 1:
Descriptive statistics for the four evaluation cohorts, their measured risk and protective factors and the points allocated to each factor on CogDrisk.
SNAC-K | HRS ADAMS | CHS-CS | MAP | Points | |
---|---|---|---|---|---|
Sample size (n) | 3122 | 856 | 3375 | 2184 | |
Mean age at baseline in years, (SD) | 73·6 (10·7) | 81·6 (7·1) | 74·8 (4·9) | 80·0 (7·6) | |
Age range, years | 60–104 | 70–110 | 65–97 | 54–100 | |
Males, n (%) | 1144 (36·6) | 355 (41·5) | 1381 (40·9) | 578 (26·5) | |
Age of males (years), n (%) | |||||
<60 | --- | --- | --- | 3 (0·1) | - |
60–64 | 330 (28·9) | --- | --- | 14 (0·6) | 0 |
65–69 | 238(20·8) | --- | 130 (9·4) | 35 (1·6) | 6 |
70–74 | 186 (16·3) | 85 (9·9) | 610 (44·2) | 61 (2·8) | 8 |
75–79 | 147 (12·9) | 89 (10·4) | 391 (28·3) | 123 (5.6) | 13 |
80–84 | 134 (11·7) | 92 (10·8) | 181 (13·1) | 182 (8·3) | 17 |
85–89 | 39 (3·4) | 52 (6·1) | 55 (4·0) | 113 (5·2) | 20 |
>90 | 70 (6·1) | 37 (4·3) | 14 (1·0) | 47 (2·2) | 22 |
Age of females (years), n (%) | |||||
<60 | --- | --- | --- | 20 (0·9) | - |
60–64 | 409 (20·7) | --- | --- | 43 (2·0) | 0 |
65–69 | 326 (16·5) | --- | 239 (12·0) | 127 (5·8) | 4 |
70–74 | 280 (14·2) | 81 (9·5) | 890 (44·6) | 204 (9·3) | 7 |
75–79 | 301 (15·2) | 99 (11·6) | 521 (26·1) | 370 (16·9) | 11 |
80–84 | 280 (14·2) | 126 (14·7) | 267 (13·4) | 432 (19·8) | 15 |
85–89 | 112 (5·7) | 97 (11·3) | 69 (3·5) | 280 (12·8) | 19 |
>90 | 270 (13·7) | 98 (11·5) | 8 (0·4) | 130 (6·0) | 23 |
Education level, n (%) | |||||
Primary | 802 (25·7) | 291 (34·0) | 368 (10·9) | 70 (3·2) | 4 |
Secondary | 1235 (39·6) | 347 (40·5) | 442 (13·1) | 87 (4·0) | 2 |
Tertiary | 1064 (34·1) | 218 (25·5) | 2560 (75·9) | 2027 (92·8) | 0 |
Missing | 21 (0·7) | . | 5 (0·2) | . | - |
Midlife BMI, n (%) | NA | NA | |||
Under weight | 6 (0·2) | - | 2 | ||
Normal | 294 (9·4) | 18 (0·8) | 0 | ||
Overweight | 286 (9·2) | 26 (1·2) | 1 | ||
Obese | 109 (3·5) | 33 (1·5) | 3 | ||
Not applicable | 2427 (77·8) | 2107 (96·4) | - | ||
Diabetes, n (%) | |||||
Yes | 183 (5·9) | 172 (20·1) | 508 (15·1) | 274 (12·6) | 2 |
Missing | 36 (1·2) | 9 (1·1) | 76 (2·3) | 117 (5·4) | - |
Midlife Cholesterol >6.5mmol/litre | NA | NA | NA | NA | 3 |
Stroke, n (%) | |||||
Yes | 188 (6·0) | 198 (23·1) | 152 (4·5) | 169 (7·7) | 2 |
Missing | 34 (1·1) | 15 (1·8) | 285 (13·1) | - | |
TBI, n (%) | NA | ||||
Yes | 424 (13·6) | 48 (5·6) | 131 (6·0) | 2 | |
Missing | 48 (1·5) | 95 (11·1) | 275 (12·6) | ||
Midlife Hypertension, n (%) | |||||
Yes | 118 (3·8) | NA | NA | 33 (2·2) | 1 |
Missing | 12 (0·4) | 0 (0·0) | |||
Not applicable | 2427 (77·8) | 2104 (96·3) | |||
Atrial fibrillation, n (%) | NA | ||||
Yes | 103 (3·3) | 28 (3·3) | 97 (2·9) | 2 | |
Missing | 552 (17·7) | 522 (61·0) | |||
Insomnia, n (%) | NA | ||||
Yes | 401 (12·8) | 4 (0·5) | 991 (29·4) | 2 | |
Missing | 2453 (78·6) | 5 (0·6) | 58 (1·7) | ||
Depression, n (%) | |||||
Yes | 237 (7·6) | 10 (1·2) | 612 (18·1) | 218 (10·0) | |
Missing | 132 (3·9) | 81 (9·5) | --- | 119 (5·5) | 3 |
Physical activity, n (%) | NA | ||||
Moderate/vigorous | 715 (22·9) | 2130 (63·1) | 1370 (62·7) | −3 | |
Missing | 1138 (36·5) | 8 (0·2) | 0 (0·0) | ||
Cognitive stimulating activity, n (%) | NA | ||||
Low | 818 (26·2) | 596 (69·6) | 826 (37·8) | 0 | |
Moderate | 1290 (41·3) | 122 (14·3) | 1058 (48·4) | −5 | |
High | 595 (19·1) | 19 (2·2) | 182 (8·3) | −4 | |
Missing | 419 (13·4) | 119 (13·9) | 118 (5·4) | - | |
Loneliness, n (%) | NA | ||||
Yes | 762 (24·4) | 266 (7·9) | 177 (8·1) | 2 | |
Missing | 58 (1·9) | 12 (0·4) | 317 (14·5) | ||
Fish serves/week, median (ranges) | 2.5 (0–22·5) | NA | 1·5 (0–17) | 4 (2–11) | −0·25 |
Missing | 742 (23·8) | --- | 1121 (51·3) | ||
Smoking, n (%) | |||||
Never smoked | 1435 (46·0) | 423 (49·4) | 1507 (44·7) | 1251 (57·6) | 0 |
Former smoker | 1200 (38·4) | 342 (40·0) | 1555 (46·1) | 862 (39·7) | 0 |
Current smoker | 452 (14·5) | 77 (9·0) | 312 (9·2) | 58 (2·7) | 1 |
Missing | 35 (1·1) | 14 (1·6) | ---- | 13 (0·6) | - |
Dementia, n (%) | |||||
Yes | 255 (8·2) | 414 (51·6) | 480 (14·2) | 589 (27·0) | |
Missing | 10 (0·3) |
Abbreviations:
TBI: Traumatic brain injury
BMI: Body mass index
3. RESULTS
3.1. Baseline characteristics of participants in the four studies
Table 1 shows the baseline characteristics of the four validation samples. Out of the 17 risk/protective factors included in the CogDrisk score, SNAC-K had the most risk/protective factors (16) followed by MAP (14), CHS-CS (12), and HRS ADAMS had the fewest (11) risk/protective factors. Mid-life BMI and mid-life hypertension measurements were available in MAP and SNAC-K studies. Age, sex, education, diabetes, stroke, and smoking status measures were available in all four cohorts.
The average age of the participants across all cohorts was 70 years and above. In all the studies, females comprised a greater proportion of the sample as compared to males, especially in the MAP (73·5%) and SNAC-K (63·4%). Tertiary education was low in HRS ADAMS (25·5%) followed by SNAC-K (34.1%) while MAP had the highest proportion of participants with tertiary education (92·8%). Each study differed in the prevalence of several risk factors. Missing data in risk/protective factors are reported in Table 1. The points attributed to each of the risk and protective factors are also reported in Table 1. The number of incident dementia cases available in these datasets were 255 (7·6%) for SNAC-K (baseline: 2001–2004, followed through till 2018), 414 (51·6%) for HRS ADAMS (baseline: 2001–2003 followed till 2008–2009), 480 (14·2%) for CHS-CS (baseline:1991–1994, followed up till 1998–1999) and 589 (27%) for MAP (baseline: 1997–1998, followed through till 2020).
3.2. Performance of the CogDrisk tool across the studies
The application of the CogDrisk scores on various datasets including both sexes and missing data in risk/protective factors resulted in good AUC (0·77; 95%CI: 0·57, 0·97) for SNAC-K, (0·76; 95% CI: 0·70,0·83) for HRS ADAMS, and (0·70; 95%CI: 0·67, 0·72) for CHS-CS to moderate AUC (0·66, 95% CI: 0·62, 0·70) for MAP (refer Table 2). For males, the AUC was highest for HRS ADAMS followed by CHS-CS, SNAC-K, and MAP. For females, highest AUC was observed when the CogDrisk score was applied to the SNAC-K followed by HRS ADAMS, CHS-CS, and MAP (refer Table 2 for AUCs). Overall, we observed good performance of the CogDrisk score when applied to SNAC-K, HRS ADAMS and CHS-CS.
Table 2:
Parameters of CogDrisk for predicting dementia in the four cohort studies.
SNAC-K | HRS ADAMS | CHS-CS | MAP | |
---|---|---|---|---|
Risk and protective factors | (15) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity, Cognitive activities and Fish intake | (11) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, Atrial fibrillation, insomnia, Depression and Cognitive activity | (12) Age, Gender, Education, Diabetes, Stroke, Smoking, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity and Fish intake | (14) Age, Gender, Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake |
n | N=1473, M=602, F=871 | N=229 M=111, F=118 | N=3226, M=1307, F=1919 | N=850, M=210, F=640 |
AUC Male | 0·68 (0·41, 0·95) | 0·81 (0·73, 0·89) | 0·70 (0·66, 0·74) | 0·65 (0·58, 0·73) |
AUC Female | 0·89 (0·80, 0·98) | 0.73 (0·63, 0·82) | 0·70 (0·67, 0·73) | 0·66 (0·62, 0·70) |
AUC Overall | 0·77 (0·57, 0·97) | 0·76 (0·70, 0·83) | 0·70 (0·67, 0·72) | 0·66 (0·62, 0·70) |
Reduced variable Model | (13) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Atrial fibrillation, Depression, Loneliness, Cognitive activity and Fish intake | (9) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, Depression and Cognitive activity | (11) Age, Gender, Education, Stroke, Smoking, Atrial Fibrillation, Insomnia, Depression, Loneliness, Physical activity and Fish intake | (13) Age, Gender, Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity and Cognitive activity |
n | N=1987, M=770, F=1217 | N=586 M=258, F=328 | N=3295, M=1330, F=1965 | N=1478, M=370, F=1108 |
AUC Male | 0·79 (0·59, 0·99) | 0·78 (0·72, 0·84) | 0·70 (0·67, 0·73) | 0·68 (0·62, 0·73) |
AUC Female | 0·72 (0·54, 0·90) | 0·75 (0·70, 0·81) | 0·70 (0·66, 0·74) | 0·66 (0·62, 0·69) |
AUC Overall | 0·76 (0·63, 0·88) | 0·75 (0·71, 0·79) | 0·70 (0·67, 0·72) | 0·67 (0·64, 0·70) |
Multiple imputed dataset | (15) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity, Cognitive activity, and Fish intake | (11) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, Atrial fibrillation, Insomnia, Depression and Cognitive activity | (12) Age, Gender, Education, Diabetes, Stroke, Smoking, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity and Fish intake | (14) Age, Gender, Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake. |
n | N=2943, M=1081, F=1862 | N=851 M=353, F=498 | N=3370, M=1381, F=1989 | N=2176, M=575, F=1601 |
AUC Male | 0·77 (0·62, 0·92) | 0·77 (0·72, 0·82) | 0·70 (0·66, 0·74) | 0·66 (0·62, 0·71) |
AUC Female | 0·84 (0·78, 0·90) | 0·76 (0·72, 0·80) | 0·71 (0·67, 0·74) | 0·65 (0·62, 0·68) |
AUC Overall | 0·83 (0·78, 0·89) | 0·75 (0·72, 0·79) | 0·70 (0·68, 0·73) | 0·66 (0·63, 0·68) |
Abbreviations:
TBI: Traumatic brain injury
3.3. Sensitivity analysis
To assess the impact of missing data on the discriminatory performance of CogDrisk score, we evaluated the CogDrisk score on the reduced variable model and using multiple imputations with all the available variables. Although the sample size improved, the resulting AUC was similar when variables were dropped from studies, i.e., when physical activity was dropped from SNAC-K, atrial fibrillation and insomnia from HRS ADAMS, TBI from CHS-CS and fish intake from MAP (refer Table 2). To evaluate the application of CogDrisk on middle-aged people, CogDrisk was also validated in participants less than 65 years of age at baseline, where available (refer Table 4). Obesity and hypertension were only evaluated for midlife participants as the evidence for these relates only to midlife. Midlife participants were available in SNAC-K (n=736) and MAP (n=80), with six, and two cases respectively. We used similar methodology to apply CogDrisk score in these subsets of cohorts as in the full cohorts. As the number of incident dementia cases were low, the performance of the CogDrisk score for midlife was poor i.e., 0·51 (95% CI: 0·27,0·75) for SNAC-K. There was a slight improvement in AUCs with multiple imputations on the respective datasets (see Table 4).
Table 4:
Parameters of CogDrisk for predicting dementia in midlife participants from SNAC-K and MAP studies.
SNAC-K | MAP | |
---|---|---|
Risk and protective factors | (16) Age, Gender, Education, Obesity Diabetes, Stroke, Hypertension, Smoking, TBI, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake | (14) Age, Gender, Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake |
n | N=584, M=253, F=331 | |
AUC Male | 0·51 (0·34, 0·67) | |
AUC Female | 0·50 (0·00, 1·00) | |
AUC Overalla | 0·51 (0·27, 0·75) | |
Multiple imputed dataset | (16) Age, Gender, Education, Obesity, Diabetes, Stroke, Hypertension, Smoking, TBI, Atrial fibrillation, Insomnia, Depression, Loneliness, Physical activity, Cognitive Activity and Fish intake | (14) Age, Gender, Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake. |
n | N=736, M=329, F=407 | N=80, M=17, F=63 |
AUC male | 0·57 (0·36, 0·77) | 0·94 (*) |
AUC Female | 0·54 (0·00, 1·00) | 1·00 (*) |
AUC Overall | 0·57 (0·33, 0·81) | 0·97 (0·93, 1·00) |
Estimates are not available due to missing data in covariates.
Standard error not available.
Abbreviations: TBI: Traumatic brain injury
3.4. Characteristics and accuracy of CogDrisk without age and gender estimates for predicting dementia
As age and sex are non-modifiable risk factors for dementia, we also assessed the performance of the CogDrisk score without age and sex estimates. The resulting AUCs (95% CI) are 0·61 (0·55, 0·68) for SNAC-K, 0·61 (0·56, 0·65) for HRS ADAMS, 0·59 (0·56, 0·62) for CHS-CS, 0·54 (0·49, 0·58) for MAP (see Table 3).
Table 3:
Parameters of CogDrisk without age and gender for predicting dementia in the four cohort studies.
SNAC-K | HRS ADAMS | CHS-CS | MAP | |
---|---|---|---|---|
Risk and protective factors | (13) Education, Diabetes, Stroke, TBI, Hypertension, Depression, Physical activity, Cognitive activity, Social isolation, Smoking, Insomnia, Atrial fibrillation and Fish intake | (7) Education, Diabetes, Stroke, TBI, Depression, Cognitive activity, and Smoking | (10) Education, Atrial fibrillation, Diabetes, Stroke, Insomnia, Depression, Social isolation, Smoking, Physical activity and Fish intake | (12) Education, Obesity, Diabetes, Stroke, TBI, Hypertension, Depression, Physical activity, Cognitive activity, Loneliness, Smoking and Fish intake. |
n | N=1479, M=603, F=876 | N=586, M=286, F=328 | N=3226, M=1307, F=1919 | N=850, M=210, F=640 |
AUC Male | 0·68 (0·60, 0·76) | 0·60 (0·53, 0·67) | 0·60 (0·56, 0·65) | 0·55 (0·47, 0·64) |
AUC Female | 0·55 (0·44, 0·65) | 0·62 (0·56, 0·67) | 0·58 (0·54, 0·62) | 0·53 (0·48, 0·58) |
AUC Overall | 0·61 (0·55, 0·68) | 0·61 (0·56, 0·65) | 0·59 (0·56, 0·62) | 0·54 (0·49, 0·58) |
Abbreviations:
TBI: Traumatic brain injury
3.5. Performance of CogDrisk-AD score in predicting Alzheimer’s disease across the studies
We also evaluated the validity of the CogDrisk-AD score in all the four datasets to assess the predictive ability for Alzheimer’s disease (using the diagnosis provided by the constituent cohorts) (refer Table 5). Specifically, the CogDrisk-AD score performed the best when applied to the HRS ADAMS and CHS-CS data, followed by SNAC-K, and MAP. When the CogDrisk score was defined based on a reduced set of variables and using multiple imputed datasets, this reduced CogDrisk performed similarly to the full model that included covariates with missing data (refer Table 5).
Table 5:
Parameters of CogDrisk-AD for predicting Alzheimer disease in the four cohort studies.
SNAC-K | HRS ADAMS | CHS-CS | MAP | |
---|---|---|---|---|
Risk and protective factors | (13) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Depression, Loneliness, Physical activity, Cognitive Activity and Fish intake | (9) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, Depression and Cognitive activity | (10) Age, Gender, Education, Diabetes, Stroke, Smoking, Depression, Loneliness, Physical activity and Fish intake | (13) Age, Gender, Education, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake |
n | N=1579, M=638, F=941 | N=594 M=336, F=258 | N=2701, M=1105, F=1596 | N=850, M=210, F=640 |
AUC Male | 0·65 (0·50, 0·80) | 0·73 (0·66, 0·80) | 0·72 (0·68, 0·77) | 0·66 (0·59, 0·74) |
AUC Female | 0·72 (0·62, 0·83) | 0·73 (0·67, 0·78) | 0·72 (0·68, 0·76) | 0·65 (0·61, 0·70) |
AUC Overall | 0·69 (0·60, 0·78) | 0·72 (0·68, 0·77) | 0·72 (0·69, 0·75) | 0·66 (0·62, 0·70) |
Reduced variable Model | (12) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Depression, Loneliness, Cognitive activity and Fish intake | (8) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, and Depression | (9) Age, Gender, Education, Stroke, Smoking, Depression, Loneliness, Physical activity and Fish intake | (12) Age, Gender, Education, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity and Cognitive activity |
n | N=2137, M=816, F=1321 | N=671 M=284, F=387 | N=2757, M=1122, F=1635 | N=1511, M=382, F=1129 |
AUC Male | 0·66 (0·52, 0·80) | 0·72 (0·75, 0·75) | 0·72 (0·67, 0·76) | 0·69 (0·63, 0·74) |
AUC Female | 0·71 (0·62, 0·80) | 0·70 (0·65, 0·75) | 0·72 (0·68, 0·75) | 0·62 (0·62, 0·69) |
AUC Overall | 0·69 (0·61, 0·76) | 0·71 (0·67, 0·75) | 0·72 (0·69, 0·74) | 0·67 (0·64, 0·70) |
Multiple imputed dataset | (15) Age, Gender, Education, Diabetes, Stroke, Hypertension, Smoking, TBI, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake | (9) Age, Gender, Education, Diabetes, Stroke, Smoking, TBI, Depression and Cognitive activity | (11) Age, Gender, Education, Diabetes, Stroke, Smoking, Depression, Loneliness, Physical activity and Fish intake | (13) Age, Gender, Education, Diabetes, Stroke, TBI, Hypertension, Smoking, Depression, Loneliness, Physical activity, Cognitive activity and Fish intake |
n | N=3068, M=1133, F=1935 | N=853 M=353, F=498 | N=2771, M=1132, F=1639 | N=2176, M=575, F=1601 |
AUC Male | 0·70 (0·60, 0·81) | 0·76 (0·71, 0·82) | 0·73 (0·69, 0·78) | 0·67 (0·63, 0·72) |
AUC Female | 0·70 (0·64, 0·76) | 0·75 (0·71, 0·80) | 0·73 (0·70, 0·77) | 0·64 (0·61, 0·67) |
AUC Overall | 0·70 (0·65, 0·76) | 0·75 (0·72, 0·79) | 0·73 (0·70, 0·75) | 0·65 (0·63, 0·68) |
Abbreviations:
TBI: Traumatic brain injury
4. DISCUSSION
In this study, we externally validated the CogDrisk on four high-quality cohort studies. Our results indicate that the CogDrisk has adequate predictive ability for dementia and AD in late life adults and is of comparable accuracy to other risk tools that have been developed for to inform preventive actions4, 5, 24. Overall, our findings demonstrate that the CogDrisk tool can be used to assess individualized risk factors of dementia and AD in various population settings.
On externally evaluating the CogDrisk, we found that the AUCs varied between the studies with the best predictive performance in the SNAC-K and HRS ADAMS followed by, CHS-CS, and MAP. There are two potential reasons for the variation in the AUCs. Firstly, there is a difference in the mean age of the participants at baseline with HRS ADAMS and MAP having higher mean age as compared to the CHS-CS and SNAC-K. Secondly, studies differed in the number and prevalence of predictors available in their datasets.
A study by Licher et al. (2018) compared different dementia risk models and identified age as a major contributing factor for dementia occurrence while other risk factors had marginal contributions25. On analysing the model without age and sex, the AUCs were reduced for all studies with HRS ADAMS, SNAC-K, and CHS-CS maintaining an adequate predictive ability. In clinical practice, knowledge of areas where individuals can reduce risk is more important than non-modifiable factors such as age and sex.
The CogDrisk did not perform well on midlife participants probably due to the low number of incident dementia cases in these samples. Further testing of the tool on larger datasets with mid-life participants is underway. Risk scores for mid- and late-life adults can be used as surrogate outcomes for mid- and late-life dementia preventive interventions.
We found that the CogDrisk was predictive of Any Dementia and that the CogDrisk-AD version was predictive of AD. The CogDrisk-AD may be useful for clinical trials focussing on AD, and to guide risk reduction advice for individuals at increased risk of AD due to family history or having an APOE ɛ4 genotype.
This study has several strengths. The CogDrisk tool was assessed on four studies, for two outcomes, in both mid and late-life participants. To our knowledge, this is the most extensive validation conducted on any dementia risk assessment tool. The external validation samples include two countries from four different datasets bringing in different population characteristics, supporting the generalizability of the instrument. The tool is cost-effective because the measures are self-reported which will enable it to be used in universal health initiatives and by clinicians. The CogDrisk tool also has the potential to inform patient and health practitioners in targeting specific risk factors for individuals thus providing personalized advice in clinical setting. It includes a wider range of factors than that captured by tools that focus on cardiovascular risk factors such as SCORE2. Given the overlap in risk and protective factors of dementia with other chronic diseases, a single predictive tool for dementia and other conditions may be efficient in clinical practice. Our team is working to evaluate with such risk prediction models that can be developed.
Our study has some limitations. Firstly, not all variables were available in the existing datasets to be able to calculate the full CogDrisk risk score. In practice, the CogDrisk tool is available online and so all factors can be easily assessed in practice. Secondly, though we harmonised the measures of the predictors between studies there is some variability in the measurement of risk factors across studies for example questions measuring moderate and vigorous physical activities. The addition of new literature to the field since earlier tools were developed has extended the number of modifiable risk factors but not led to noticeably larger AUCs. Our interpretation is that our findings demonstrate the upper limit of what is possible for low-cost, convenient dementia risk assessment when validating using cohort study data. Measurement of individual risk factors involves a degree of error and lack of specificity of clinical thresholds where patients or clinicians endorse binary responses. Co-occurrence of risk factors within individuals may also in-part explain the limit on AUC that can be reached in developing practical dementia risk tools. Finally, the long prodromal period of dementia, mean that unless a cohort study has followed every participant to completion, there will be undetected cases of preclinical dementia or individuals who will ultimately develop dementia that who do not obtain a diagnosis during the observation period. This reduces the accuracy of predictive models.
We conclude that the CogDrisk is a valid assessment tool that predicts dementia and AD. It incorporates a large number of modifiable risk factors for dementia that are available in a range of clinical and research contexts making the tool practical and available to use for dementia prevention interventions. The CogDrisk tool can be used by clinicians, researchers, policy makers and the public for identifying individuals at risk for dementia and monitoring risk reduction efforts.
Supplementary Material
FUNDING
This research was funded by a NeuRA Grant to KJA, the NHMRC Dementia Collaborative Research Centre, and NHMRC GNT1171279. KJA is funded by Australian Research Council Fellowship FL190100011, SK is funded by NHMRC GNT1171279. CQ is partly funded by grants from the Swedish Research Council (grant no.: 2020-01574), the Swedish Foundation for International Cooperation in Research and Higher Education (STINT) (grant no.: CH2019-8320), and Karolinska Institutet, Stockholm, Sweden. LF received grants from the Swedish Research Council (grant no.: 2017-06088) and the Swedish Research Council for Health, Working Life and Welfare (grant no.: 2016-07175). MAP is supported by NIA grant R01AG17917. MAP data can be requested at https://www.radc.rush.edu
Footnotes
Conflict of interest disclosure
SDH is an advisor to Staying Sharp. MCC is a member of the AARP Scientific Advisory Board. There are no other conflicts of interest.
Ethics approval and consent to participate
Ethical approval for this project was obtained from the UNSW Human Research Ethics Committee (Protocols # HC200108, HC200331, 3349). Each cohort study was approved by its Institutional Review Board. Informed consent was obtained by each study for all participants.
Availability of data and materials
Data from the HRS ADAMS, CHS-CS, and MAP datasets are publicly available. We obtained approval to access these from the data custodians. As the SNAC-K data could not be transferred, DR carried out the analysis on our behalf. These data cannot be shared with other investigators.
REFERENCES
- 1.Gauthier S, Rosa-Neto P, Morais J, Webster C. World Alzheimer Report 2021: Journey through the diagnosis of dementia. Alzheimer’s Disease International: London, UK. 2021 [Google Scholar]
- 2.Capuano AW, Shah RC, Blanche P, Wilson RS, Barnes LL, Bennett DA, et al. Derivation and validation of the Rapid Assessment of Dementia Risk (RADaR) for older adults. PLoS One. 2022;17(3):e0265379 ( 10.1371/journal.pone.0265379). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Anstey KJ, Cherbuin N, Herath PM. Development of a new method for assessing global risk of Alzheimer’s disease for use in population health approaches to prevention. Prev Sci. 2013;14(4):411–21 ( 10.1007/s11121-012-0313-2). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kivipelto M, Ngandu T, Laatikainen T, Winblad B, Soininen H, Tuomilehto J. Risk score for the prediction of dementia risk in 20 years among middle aged people: a longitudinal, population-based study. The Lancet Neurology. 2006;5(9):735–41 [DOI] [PubMed] [Google Scholar]
- 5.Schiepers OJG, Kohler S, Deckers K, Irving K, O’Donnell CA, van den Akker M, et al. Lifestyle for Brain Health (LIBRA): a new model for dementia prevention. International journal of geriatric psychiatry. 2018;33(1):167–75 ( 10.1002/gps.4700). [DOI] [PubMed] [Google Scholar]
- 6.Hou XH, Feng L, Zhang C, Cao XP, Tan L, Yu JT. Models for predicting risk of dementia: a systematic review. J Neurol Neurosurg Psychiatry. 2019;90(4):373–9 ( 10.1136/jnnp-2018-318212). [DOI] [PubMed] [Google Scholar]
- 7.Stephan BC, Tang E, Muniz-Terrera G. Composite risk scores for predicting dementia. Curr Opin Psychiatry. 2016;29(2):174–80 ( 10.1097/YCO.0000000000000235). [DOI] [PubMed] [Google Scholar]
- 8.Tang EY, Harrison SL, Errington L, Gordon MF, Visser PJ, Novak G, et al. Current Developments in Dementia Risk Prediction Modelling: An Updated Systematic Review. PLoS One. 2015;10(9):e0136181 ( 10.1371/journal.pone.0136181). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.You J, Zhang YR, Wang HF, Yang M, Feng JF, Yu JT, et al. Development of a novel dementia risk prediction model in the general population: A large, longitudinal, population-based machine-learning study. EClinicalMedicine. 2022;53:101665 ( 10.1016/j.eclinm.2022.101665). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Barnes DE, Beiser AS, Lee A, Langa KM, Koyama A, Preis SR, et al. Development and validation of a brief dementia screening indicator for primary care. Alzheimers Dement. 2014;10(6):656–65 e1 ( 10.1016/j.jalz.2013.11.006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yu J-T, Xu W, Tan C-C, Andrieu S, Suckling J, Evangelou E, et al. Evidence-based prevention of Alzheimer’s disease: systematic review and meta-analysis of 243 observational prospective studies and 153 randomised controlled trials. Journal of Neurology, Neurosurgery & Psychiatry. 2020;91(11):1201–9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Anstey KJ, Ee N, Eramudugolla R, Jagger C, Peters R. A Systematic Review of Meta-Analyses that Evaluate Risk Factors for Dementia to Evaluate the Quantity, Quality, and Global Representativeness of Evidence. Journal of Alzheimer’s disease : JAD. 2019;70(s1):S165–s86 ( 10.3233/jad-190181). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zheng F, Xie W, Li C, Gao D, Liang J. Prediction abilities of SCORE2 risk algorithms for incident dementia and all-cause mortality: results from the UK Biobank cohort study. The Journals of Gerontology: Series A. 2022 [DOI] [PubMed] [Google Scholar]
- 14.Anstey KJ, Kootar S, Huque MH, Eramudugolla R, Peters R. Development of the CogDrisk tool to assess risk factors for dementia. Alzheimers Dement (Amst). 2022;14(1):e12336 ( 10.1002/dad2.12336). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lagergren M, Fratiglioni L, Hallberg IR, Berglund J, Elmstahl S, Hagberg B, et al. A longitudinal study integrating population, care and social services data. The Swedish National study on Aging and Care (SNAC). Aging Clin Exp Res. 2004;16(2):158–68 ( 10.1007/BF03324546). [DOI] [PubMed] [Google Scholar]
- 16.Langa KM, Plassman BL, Wallace RB, Herzog AR, Heeringa SG, Ofstedal MB, et al. The Aging, Demographics, and Memory Study: study design and methods. Neuroepidemiology. 2005;25(4):181–91 ( 10.1159/000087448). [DOI] [PubMed] [Google Scholar]
- 17.Health and Retirement Study (ADAMS) public use dataset. Produced and distributed by the University of Michigan with funding from the National Institute on Aging (grant number NIA U01AG009740). Ann Arbor, MI. 2009 [Google Scholar]
- 18.Lopez OL, Kuller LH, Fitzpatrick A, Ives D, Becker JT, Beauchamp N. Evaluation of dementia in the cardiovascular health cognition study. Neuroepidemiology. 2003;22(1):1–12 ( 10.1159/000067110). [DOI] [PubMed] [Google Scholar]
- 19.Teng EL, Chui HC. The Modified Mini-Mental State (3MS) examination. J Clin Psychiatry. 1987;48(8):314–8 [PubMed] [Google Scholar]
- 20.Bennett DA, Schneider JA, Buchman AS, Mendes de Leon C, Bienias JL, Wilson RS. The Rush Memory and Aging Project: study design and baseline characteristics of the study cohort. Neuroepidemiology. 2005;25(4):163–75 ( 10.1159/000087446). [DOI] [PubMed] [Google Scholar]
- 21.Bennett DA, Buchman AS, Boyle PA, Barnes LL, Wilson RS, Schneider JA. Religious Orders Study and Rush Memory and Aging Project. Journal of Alzheimer’s disease : JAD. 2018;64(s1):S161–S89 ( 10.3233/JAD-179939). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zou KH, O’Malley AJ, Mauri L. Receiver-operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation. 2007;115(5):654–7 ( 10.1161/CIRCULATIONAHA.105.594929). [DOI] [PubMed] [Google Scholar]
- 23.Huque MH, Carlin JB, Simpson JA, Lee KJ. A comparison of multiple imputation methods for missing data in longitudinal studies. BMC Med Res Methodol. 2018;18(1):168 ( 10.1186/s12874-018-0615-6). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Anstey KJ, Cherbuin N, Herath PM, Qiu C, Kuller LH, Lopez OL, et al. A self-report risk index to predict occurrence of dementia in three independent cohorts of older adults: the ANU-ADRI. PloS one. 2014;9(1):e86141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Licher S, Yilmaz P, Leening MJ, Wolters FJ, Vernooij MW, Stephan BC, et al. External validation of four dementia prediction models for use in the general community-dwelling population: a comparative analysis from the Rotterdam Study. European journal of epidemiology. 2018;33(7):645–55 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data from the HRS ADAMS, CHS-CS, and MAP datasets are publicly available. We obtained approval to access these from the data custodians. As the SNAC-K data could not be transferred, DR carried out the analysis on our behalf. These data cannot be shared with other investigators.