Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 Mar 1;11:4822. doi: 10.1038/s41598-021-83911-9

Machine learning methods to predict amyloid positivity using domain scores from cognitive tests

Guogen Shan 1,, Charles Bernick 2, Jessica Z K Caldwell 2, Aaron Ritter 2
PMCID: PMC7921140  PMID: 33649452

Abstract

Amyloid-β (Aβ) is the target in many clinical trials for Alzheimer’s disease (AD). Preclinical AD patients are heterogeneous with regards to different backgrounds and diagnosis. Accurately predicting Aβ status of participants by using machine learning (ML) models based on easily accessible data, could improve the effectiveness of AD clinical trials. We will develop optimal ML models for each subpopulation stratified by sex and disease stages using sub scores from screening neurological tests. Data from the AD Neuroimaging Initiative (ADNI) were used to build the ML models, for three groups: individuals with significant memory concern, early mild cognitive impairment (MCI), and late MCI. Data were further separated into 6 groups by disease stage (3 levels) and sex (2 categories). The outcome was defined as the Aβ status confirmed by the PET imaging, and the features include demographic data, newly identified risk factors, screening tests, and the domain scores from screening tests. Monte Carlo simulation studies were used together with k-fold cross-validation technique to compute model performance metric. We also develop a new feature selection method based on the stochastic ordering to avoiding searching all possible combinations of features. Accuracy of the identified optimal model for SMC male was over 90% by using domain scores, and accuracy for LMCI female was above 86%. Domain scores can improve the ML model prediction as compared to the total scores. Accurate ML prediction models can identify the proper population for AD clinical trials.

Subject terms: Neuroscience, Neurology

Introduction

The global impact of Alzheimer’s disease (AD) is immense1. With worldwide rates expected to triple in the next decade, the development of successful strategies to combat AD has become a global health imperative2. One such strategy is the identification of individuals with AD prior to the onset of dementia3. Alzheimer’s disease is now conceptualized as a continuous disease with a long asymptomatic phase in which neuropathological substrate accumulates eventually leading to stages of mild cognitive impairment (MCI) and finally to overt functional decline and dementia. Early diagnosis of AD has been associated with a variety of benefits including increased survival time4, improved psychological well-being for patients and their families5, and lower health care costs6,7. Perhaps most compelling is emerging data from clinical trials of disease modifying therapies (DMT) clearly demonstrating that meaningful therapeutic success will likely require from early intervention8.

Amyloid-β (Aβ) is one of the two hallmark pathologies for diagnosis of AD9. AD is characterized by a long preclinical stage which is referred to be as mild cognitive impairment (MCI). Aβ has been the target of disease modified therapies (DMTs) in many AD clinical trials. One of the most recent drugs is aducanumab which is believed to be able to reduce deposits of Aβ. The results from its Phase 3 study indicated a statistically significant reduction after 78 weeks in the primary outcome: Clinical Dementia Rating-Sum of Boxes (CDR-SB) score, in the high-dose aducanumab group as compared to the placebo group. This Aβ targeted DMT could be the first new AD treatment in nearly two decades. Aβ status can be confirmed by using either cerebrospinal fluid (CSF) or positron emission tomography (PET) imaging. CSF is invasive and potentially painful for patients, and sometimes a participant can’t have a lumbar puncture because of a back deformity, infection, or possible brain herniation. Amyloid PET imaging is preferable in certain scenarios, but its utilization in clinical and trial settings is limited due to patients’ concerns (e.g., radiation), and high costs which are often not covered by insurances. Thus, developing tools to accurately predict the Aβ status offers an attractive approach10,11.

One potential solution for overcoming this problem is to utilize affordable global screening tools that can be rapidly and inexpensively administered to diverse populations. The Montreal Cognitive Assessment (MoCA) is one such candidate12. Now in widespread use, the MoCA is a brief screening tool that takes approximately 10 min to administer and score. It includes 12 individual tasks grouped into seven cognitive domains (visuospatial/executive; naming; memory; attention; language; abstraction; and orientation). Scores on each task are summed to yield a total score, with maximum total of 30. Using a cutoff of 26, quantitative analysis of the MoCA shows that it has good sensitivity and specificity for individuals with dementia but variable specificity in MCI stages (76% with a range of 19–98%)13. Because the narrow range of scores within each cognitive domain limit traditional statistically inquiry, less research has been conducted to indicate whether qualitative analysis (domain-level performance) improves its diagnostic accuracy14,15.

MCI and AD patients are characterized by heterogeneity in sex and APOE ε4 status16. Women with AD decline more rapidly in cognition than men with AD from longitudinal studies including the ADNI1721. In a study to investigate the longitudinal change in ADAS-Cog in MCI patients using the ADNI data, cognitive decline is greater in females than males, and APOE ε4 carriers have a significant effect on both slope and curvature of ADAS-cog change as compared to non-carriers22. Women have better verbal memory than men on average, across the lifespan. In the context of AD, this memory advantage appears to persist in women with normal cognition, despite presence of measurable pathological changes, including presence of brain beta-amyloid10,15,19,2325. This advantage has also been suggested to result in memory measures being less effective in screening women versus men for early AD-related changes26,27.

Machine learning (ML) methods hold promise for improving diagnostic classification above current processes and have been successfully applied to studies of individuals with early AD. However, current approaches utilized only a very few ML methods based on commonly used measures, and the k-fold cross-validation resampling procedure was traditionally used to evaluate model performance11,2830, but the results are not reliable with only one simulation. We will use Monte Carlo simulations along with the k-fold cross-validation technique to provide reliable comparisons between the considered ML models3133. The primary purpose of the current paper is to explore whether incorporating domain level scoring on the MoCA, in combination with several other widely available screening tests such as the Alzheimer’s Disease Cognitive Assessment (ADAS-Cog) and the Mini-Mental Status Examination (MMSE) into a novel machine learning (ML) algorithm improves diagnostic classification of AD in early stage individuals. It was hypothesized that incorporation of domain level scoring of these screening tests would improve performance above using the total scores in the ML models for each subpopulation.

Methods

Study designs and participants

Data used in this project were obtained from the ADNI database in June 2020 (http://adni.loni.usc.edu/)34,35. The ADNI is an ongoing longitudinal cohort of early stage AD research participants that has enrolled more than 1800 participants since 2004. Although a continuous study, there have been several phases of ADNI: ADNI-1, ADNI-Go, ADNI-2, and ADNI-3 (current). For our analysis, we wanted to select research participants at the earliest stages of symptomatic disease. This required us to select participants from different ADNI studies. Individuals with significant memory concern (SMC) were selected from ADNI-2, and ADNI-3 because the SMC cohort was added in the ADNI starting from ADNI-2 to address the gap between healthy controls and MCI. Individuals with early MCI (EMCI) or late MCI (LMCI) were selected from ADNI-GO, ADNI-2, and ADNI-3. Because ADNI-1 used Pittsburgh Compound-B (PIB) to determine amyloid positivity, we did not use data from ADNI-1, but LMCI participants initially enrolled in ADNI-1 were included if they had follow up visits in the following three phases. In the ADNI study an individual’s diagnosis is rendered based on current clinical criteria used in conjunction with performance on psychometric testing. SMC is defined by having a significant memory concern but no impairment on the Logical Memroy II subscale (Delayed Paragraph Recall, Paragraph A only) from the Wechsler Memory Scale-Revised, while EMCI and LMCI are the two complementary groups of mild cognitive impairment (MCI), and are distinguished by performance on the Logical Memroy II subscale36.

Amyloid positivity was determined quantitatively. We computed the standardized uptake value ratio (SUVR): the average of weighted cortical retention means divided by the whole cerebellum SUVR, where frontal, cingulate, parietal, and temporal regions were used in the calculation of cortical retention means with a threshold of 1.11 used to define the binary amyloid status35,37,38.

In ADNI, participants are assessed at regular visits. These assessments are used to render a diagnosis. As a result, an individual’s diagnosis may change during the course of the study. For our study we analyzed data collected from the baseline visit as this is the visit when amyloid positron emission tomography (PET) occurs. The sample size and characteristics of individuals used in our analysis are presented in Table 2. To account for the important moderating factors of sex we further stratified each subgroup by sex.

Table 2.

Characteristics of the 6 subgroups from the ADNI.

SMC EMCI LMCI pvalue
F M pvalue F M pvalue F M pvalue
N 100 70 141 176 102 134
Amyloid (%) 41 (41.0) 19 (27.1) 0.063 65 (46.1) 89 (50.6) 0.429 69 (67.6) 87 (64.9) 0.662 < 0.0001
Age 70.9 (5.9) 73.4 (6.1) 0.010 70.6 (7.8) 72.4 (7.0) 0.035 72.6 (8.5) 75.7 (8.3) 0.005 < 0.0001
Edu 16.5 (2.7) 17.3 (2.2) 0.044 15.6 (2.7) 16.5 (2.6) 0.003 15.7 (2.6) 16.5 (2.8) 0.015 0.0075
Hisp (%) 5 (5.0) 1 (1.4) 0.214 8 (5.7) 7(4.0) 0.480 4 (3.9) 1 (0.7) 0.093 0.3653
Race 0.279 0.607 0.035 0.0662
Whites 87 (87.0) 66 (94.3) 129 (91.5) 166 (94.3) 93 (91.2) 131 (97.8)
African American 8 (8.0) 2 (2.9) 4 (2.8) 3 (1.7) 7 (6.9) 1 (0.7)
Other 5 (5.0) 2 (2.9) 8 (5.7) 7 (4.0) 2 (2.0) 2 (1.5)
Marry status 0.079 <0.01 <0.01 0.5913
Married 67 (67.0) 56 (80.0) 89 (63.1) 156 (88.6) 59 (57.8) 116 (86.6)
Never married 9 (9.0) 2 (2.9) 9 (6.4) 5 (2.8) 4 (3.9) 1 (0.7)
Divorced 9 (9.0) 8 (11.4) 25 (17.7) 11 (6.2) 14 (13.7) 9 (6.7)
Widowed 15 (15.0) 4 (5.7) 18 (12.8) 4 (2.3) 25 (24.5) 8 (6.0)
APOE ε4 0.176 0.122 0.485 0.0001
0 copy 59 (59.0) 50 (71.4) 89 (63.1) 91 (51.7) 45 (44.1) 69 (51.5)
1 copy 36 (36.0) 19 (27.1) 43 (30.5) 69 (39.2) 44 (43.1) 48 (35.8)
2 copies 5 (5.0) 1 (1.4) 9 (6.4) 16 (9.1) 13 (12.7) 17 (12.7)
ADAS-13 9.4 (4.3) 11.7 (4.5) 0.001 12.2 (5.7) 13.5 (5.4) 0.045 18.0 (8.2) 18.6 (7.0) 0.573 < 0.0001
MoCA 26.2 (2.6) 25.2 (2.7) 0.017 24.1 (3.1) 23.7 (2.9) 0.252 22.0 (3.8) 22.3 (3.1) 0.659 < 0.0001
CDRSB 0.1 (0.4) 0.1 (0.3) 0.193 1.2 (0.8) 1.3 (0.8) 0.180 1.7 (1.0) 1.7 (1.2) 0.931 < 0.0001
MMSE 29.1 (1.2) 29.0 (1.2) 0.884 28.4 (1.7) 28.2 (1.6) 0.162 27.4 (2.0) 27.5 (1.8) 0.816 < 0.0001
Family hist (%) 67 (67.0) 42 (60.0) 0.349 95 (67.4) 109 (61.9) 0.315 62 (60.8) 75 (56.0) 0.458 0.1776
Hypertension (%) 41 (41.0) 33 (47.1) 0.427 64 (45.4) 97 (55.1) 0.085 43 (42.2) 66 (49.3) 0.279 0.7189
GDS 1.0 (1.0) 0.9 (0.9) 0.625 2.0 (1.6) 1.6 (1.5) 0.019 2.2 (1.9) 1.8 (1.6) 0.137 < 0.0001

The pvalues in the last column are for the difference between the three groups (SMC, EMCI, and LMCI).

The three diagnosis groups (SMC, EMCI, and LMCI) were defined by their baseline diagnostic results. Paticipants’ amyloid status were obtained from the baseline visit, or the nearest visit having the amyloid status outcome and having the same diagnosis as baseline when the amyloid status was not available at baseline. We used that visit date to merge with other data files (e.g., cognitive measures). Sex is an important moderation factor in AD research19,20. For each diagnosis group, we stratified data into two subgroups by sex: Female or Male. The sample sizes for each subgroup were presented in Table 2.

Model creation

Demographics

To build our model we attempted to incorporate known risk factors for amyloid positivity. Five demographic data were obtained from the ADNI: age, race (White, African American, or others), years of eduction, Hispanic ethnicity, and marital status (married, never married, divorced, or widowed). Due to small percentages of participants other than White or African American, we combined them as one group. APOE ε4 was one of the three strong risk factors for amyloid status prediction in addition to age and ADAS-cog39. Family history of dementia40, history of hypertension41, and the Geriatric Depression Scale (GDS-15) scores were included in the ML models.

Sex

Women and men differ significantly in terms of neuropsychological test performance, disease trajectory, and interaction with APOE ε4 status. Women’s advantage in verbal memory has been suggested to result in memory measures being less effective in screening women for early AD changes. Similarly, given that women’s strong memory might have a masking effect early in the disease process, predicting presence of brain amyloid with memory test scores is expected to be less effective for women, particularly for women with no detectable memory deficits. Based on the finding of the heterogeneity in sex and APOE ε4 status in MCI and AD, it is critical to build separate statistical prediction models for each subpopulation stratified by sex and APOE ε4 status. Based on these differences we built separate models for men and women at each disease stage.

Cognitive tests

The neuropsychological scores from the following four tests were included as features in the machine learning models: (1) Clinical Dementia Rating-Sum of Boxes (CDR-SB), (2) Mini Mental State Exam (MMSE), (3) Montreal Cognitive Assessment (MoCA), and (4) the 13-item ADAS-cog. For the MoCA score and the ADAS-cog score, we also included their domain level scores. Standard administration of the MoCA consists of 12 individual tasks grouped into seven cognitive domains: (M1) visuospatial/executive, (M2) naming, (M3) attention, (M4) language, (M5) abstraction, (M6) memory, and (M7) orientation. The ADAS-cog-13 include 13 domain areas: (A1) Word Recall, (A2) Commands, (A3) Constructional Praxis, (A4) Delayed recall, (A5) Naming, (A6) Ideational Praxis, (A7) Orientation, (A8) Word Recognition, (A9) Recall instructions, (A10) Spoken language, (A11) Word finding, (A12) Comprehension, and (A13) Number cancellation42.

The narrow range of scores within each domain (range from 0 to 12), makes application of traditional statistical methods to domain-specific performance difficult. As a result, the predominance of MoCA-related research has focused on total scores and likely underestimates the full utility it may provide as a screening tool. Domain level scores provide in essence, a “mini-cognitive profile” that may provide a more granular view of an individual’s cognition.

Machine learning models

We built ML models with by using both Monte-Carlo simulations and ten-fold cross-validation procedure. In each simulation, the complete data were split into a training data set (80%) and a testing data set (20%), where the training data set will be used in ten-fold cross-validation to build the prediction model, and the testing data set will be used for validation and calculating model performance metrics.

ML models can be used to improve amyloid positivity prediction by using the easily accessible data. We applied widely used ML methods to build an optimal model with the highest average accuracy from 1000 simulations. Due to variation in splitting data into a training data set and a testing data set, a few simulations are not sufficient enough to provide reliable results. Thus, we run the simulation for 1000 times to identify the optimal ML model with reliable conclusions.

ML methods

We built ML predictive models with the statistical package caret in R43,44, using the following supervised ML methods: linear discriminant analysis (LDA), k-nearest neighbor (kNN), Decision trees (DT), support vector machines (SVM) and random forests (RF). The LDA classifier finds a linear combination of features that characterizes or separates two or more classes. SVM finds a decision function that maximizes the margin around the separating hyperplane by modeling a mapping from features to labels as a combination of kernels. In Table 1, we list the 15 ML models along with the method values used in the R function.

Table 1.

The 15 ML models from the R package caret.

Model ID ML model Method value in R
1 Linear discriminant analysis lda
2 Factor-based linear discriminant analysis RFlda
3 Generalized linear model glm
4 Random forest ranger
5 Recursive partitioning and regression trees rpart
6 k-nearest neighbors knn
7 Support vector machines with radial basis function kernel svmRadial
8 Support vector machines with linear kernel svmLinear
9 Support vector machines with polynomial kernel svmPoly
10 Random forest rf
11 Stochastic gradient boosting gbm
12 Boosted logistic regression LogitBoost
13 tree models or rule-based models C5.0
14 Bagged CART treebag
15 Boosted classification trees ada

Performance metrics

The optimal ML model is identified as the one having the highest average accuracy. Accuracy is commonly used to assess the performance of a ML model: the proportion of all classes that are correctly predicted10,45 which is defined as:

Accuracy=TP+TNTP+TN+FP+FN,

where TP, FN, TN, and FP are the numbers of true positive, false negative, true negative, and false positive, respectively. It is easy to show that the total sample size is N = TP+TN+FP+FN, and N+ = TP+FN and N- = TN+FP are the number of participants with positive and negative amyloid, respectively.

The Matthews Correlation Coefficient (MCC) can be considered as an alternative of accuracy to assess the model performance. The MCC is equivalent to the Pearson correlation coefficient between actual and predicted amyloid status, with the range from − 1 (perfect misclassification) to 1 (perfect classification)46,47. The MCC is defined as

MCC=TP×TN-FP×FN(TP+FN)(TP+FP)(TN+FP)(TN+FN).

The MCC is a reliable statistical measure, and it has a high score only if the prediction obtained good results in all of the four confusion matrix categories (high values of TP and TN, and low values of FN and FP)48. The Other performance metrics were also calculated and compared: sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).

Feature selections

For a study with a total of F features, the total number of all possible feature combinations is 2F. It increases exponentially as F goes up. For a study with 20 features, the number of all possible combinations is over 1 million. It is not computationally feasible to search over all possible combinations to identify the optimal feature set for each ML method. To reduce the computational intensity, we propose using a stochastic ordering approach in conjunction with the forward model selection approach49. The stochastic ordering approach is traditionally used in exact statistical inference to order the sample space which is sorted by a test statistic including point estimates and confidence limits11,47,49,50.

We used the forward model selection approach with the Akaike Information Criterion (AIC) as the criteria to determine the ordering of these features. The first step is to fit F models with one of the F features in each model. The model with the smallest AIC is selected and its associated feature is assigned as the feature in the first place, denoted as X(1). In the second step, we fit F-1 models with one of the remaining features after the first step, and X(1) in the model. The second feature is the one from the model with the smallest AIC among these F-1 models. Suppose the second feature is X(2). Following this procedure, the following ordered F-2 features are determined: X(3),,X(F). In this article, a multiple logistic regression model is the statistical model used to determine the feature ordering.

Instead of 2F feature combinations, we used the F combinations: Zi={X(1),,X(i)}, where i=1,2,, and F. The new stochastic ordering provides an efficient way to determine the importance of these features to predict amyloid positivity, and it provides an efficient route to search for the optimal set of features.

Results

We built ML models for female and male within each diagnosis group, with a total of 6 subgroups: SMC female, SMC male, EMCI female, EMCI male, LMCI female, and LMCI male. Table 2 presents the demographics and clinical characteristics of these 6 subgroups. The rate of amyloid positivity in female was close to that in men in the EMCI group and the LMCI group, while female had a much higher rate than male in the SMC group. Male were generally older and tended to have a higher level of education than female in each group. The ADAS-Cog-13 appeared to increase as disease was progressed, and females had better performances than men in general. We also present the pvalue in comparing the three groups (SMC, EMCI, and LMCI) for each characteristic. The following characteristics are not significant in comparing the three groups: hispanic ethnicity, race, marital status, family history of dementia, and history of hypertension. The remaining demographics and clinical characteristics show statistical differences between the three groups.

For ML models using total scores, the 13 features in Table 2 were all included in the model. For ML models using domain scores, the 7 domain scores from MoCA and the 13 domain scores from the ADAS-cog-13 were included as features in addition to the 13 features in ML models using total scores. For a categorical feature (e.g., Hispanic ethnicity), it is possible that almost all the participants belong to one category which could cause the failure of the ML model building. If that dominate category (e.g., non-Hispanic) had the participants more than the sample size in that subgroup minus 5, that feature was removed from the features in building ML models for that subgroup.

We utilized the proposed stochastic ordering method for the feature importance ordering in each subgroup. It should be noted that the ordering of features in each subgroup could be different because the importance of features in predicting amyloid positivity varies in each subgroup. Figure 1 shows the accuracy of each ML method using domain scores, as a function of the numbers of features in each subgroup. The accuracy lines are quite smooth under the new stochastic ordering feature selection method. It can be seen that accuracy for male is much higher that that for female within SMC or EMCI, while it is reversed in LMCI with a higher accuracy for female. The highest accuracy is often achieved with less than half of the features, except the case for SMC male.

Figure 1.

Figure 1

Accuracy of the 15 ML methods using the domain scores for the 6 subgroups stratified by diagnosis and sex.

We presented the optimal ML method using domain scores and the associated number of features for each subgroup in Table 3. It should be noted that the stochastic ordering of features in each subgroup is often different from each other. The optimal numbers of features are often small, except the case for the SMC male group. The SVM methods were the best in half of the cases: the SVM with polynominal kernel (svmPoly) for both subgroups in the LMCI, and the SVM with radial kernel (svmRadial) for EMCI male. The other 3 optimal methods were: a generalized linear model (glm), a boosted Logistic Regression (LogitBoost), and a boosted classification trees method (ada).

Table 3.

The optimal ML model for each subgroup.

Diagnosis Sex Optimal ML method Num Features
LMCI F SVM with polynomial kernel 7 APOE ε4, ADAS total, ADAS-Q2, 4, 9, 12, Family Hist, GDS
LMCI M SVM with polynomial kernel 3 APOE ε4, ADAS total, ADAS-Q2,
EMCI F Generalized linear model 2 Age, APOE ε4
EMCI M SVM with radial kernel 6 Age, APOE ε4, marry status, ADAS-Q4, Q5, Q6
SMC F Boosted logistic regression 4 APOE ε4, MoCA total, ADAS-Q8, ADAS-Q11
SMC M Boosted classification trees 27 All features

We compared accuracy of optimal ML models using domain scores and total scores in Table 4. The ML models using domain scores had substantial accuracy gain as compared to those based on total scores in the following three subgroups: LMCI female (3.4% increase), LMCI male (3.1% increase), and SMC male (4%). In the EMCI groups, the optimal ML models using domain scores were similar to those using total scores. In addition to accuracy, we presented the other five ML model performance matrix (MCC, sen, spe, PPV, and NPV) in Table 4 for the identified optimal ML models. When the overall accuracy was similar between female and male (e.g, the EMCI group), all other model performance matrix were similar as well. When the accuracy of the optimal ML models using domain scores was higher, the MCC was higher and other performance measures (sen, spe, PPV, NPV) were better balanced (e.g., sen and spe were close to each other).

Table 4.

Comparing ML models using domain scores or total scores based on 1000 simulations.

Diagnosis Sex Scores Accuracy (%) MCC Sen (%) Spe (%) PPV (%) NPV (%)
LMCI F Domain scores 86.1 0.694 88.8 80.4 91.2 78.8
LMCI Total scores 82.7 0.668 78.6 91.7 95.6 67.9
LMCI M Domain scores 78.1 0.534 80.9 72.7 85.2 68.0
LMCI Total scores 75.0 0.424 88.2 50.0 76.8 70.2
EMCI F Domain scores 71.2 0.428 66.3 75.5 71.0 72.8
EMCI Total scores 71.3 0.430 66.7 75.3 70.9 73.1
EMCI M Domain scores 78.4 0.578 73.0 83.8 82.7 76.1
EMCI Total scores 77.2 0.551 74.8 79.6 79.2 76.5
SMC F Domain scores 67.0 0.326 56.5 74.6 63.7 70.9
SMC Total scores 68.8 0.364 58.4 76.3 66.2 72.2
SMC M Domain scores 90.4 0.726 66.7 97.5 91.7 90.7
SMC Total scores 86.4 0.596 59.8 94.4 78.2 89.1

Discussion

Deposition of Aβ is an early recognized marker of AD, detectable as much as a decade prior to symptom onset. A popular therapeutic strategy focuses on amyloid removal, which if implemented preclinically, could potentially change the trajectory of the disease course. Current means of amyloid recognition are either invasive or expensive. Thus, methods that could predict at an individual level who may be most likely to have elevated brain amyloid using easily obtainable clinical data has the potential to reduce costs and speed enrollment in clinical trials of AD disease modifying agents51,52.

This study assessed whether incorporation of domain level scoring from cognitive screening tests into a multi-variable ML model could improve diagnostic classification of individuals with early stage AD above total scores. Screening tests have been shown to insensitive to the earliest cognitive changes in AD and we hypothesized that building a model that could presumably integrate a more granular picture of an individual’s cognition (such as isolated weaknesses in verbal memory) would significantly improve model accuracy over total scores53,54. As hypothesized, in most of the subgroups analyzed, incorporation of domain level performance but this was neither robust nor true for each subgroup. In particular, we found no benefit of incorporating domain level performance in the women groups in the two earliest stages of AD (SMC and EMCI). This is not entirely unexpected given that women have known advantages over men in verbal memory19 and the screening tests sampled—MoCA and ADAS-Cog—rely heavily on verbal memory tasks. The model’s discriminative accuracy lagged significantly behind that for men in the earliest stages of disease followed by a significant improvement in those women who had been diagnosed with LMCI. Women have been shown to decline more rapidly than men in AD and the improved accuracy of the model to classify women in LMCI as opposed to SMC or EMCI may reflect the accelerated failure of memory networks that may occur later in women compared to men55. The low discriminative accuracy of the model in SMC (68.8) and EMCI women (71.3), even with the incorporation of APOE ε4 status indicates that it is challenging to accurately predict amyloid status in women in early stages of AD.

A novel feature of our approach is the development of a new feature selection method based on the stochastic ordering of features within each subgroup45,56. This new feature selection method reduced the computational intensity from exponential to linear, making the search for the optimal set of features computationally feasible. The proposed stochastic ordering seemed to work very well in general. The presence of the APOE ε4 allele is highly correlated with amyloid positivity and we saw that this was the optimal feature seen in each diagnostic group. It was notable that performance on the delayed memory of the MoCA did not contribute significantly to the model’s ability to predict underlying amyloid status. It is likely that having only 5-items in the delayed memory task is neither sensitive nor specific for predicting amyloid accumulation in early stage AD.

One of the limitations of this study is that the samples utilized in this study are not demographically representative of the general population and thus not fully representative of populations who would participate in community screenings. In addition, it would be unlikely to know the APOE ε4 status of individuals participating in screening events. Diagnostic confirmation of AD is completed with expensive (amyloid PET) and invasive (CSF for amyloid beta) confirmatory studies and there are limited datasets that would allow us to confirm AD in large enough datasets to confirm diagnostic status. Future work should focus on developing ML algorithms using data collected from community samples to test whether these strategies are adequate to meet the challenge of the affordable and accurate diagnosis of individuals with early stage AD. In addition, the sample sizes in each subgroup are not large enough to conduct a three-fold separation into training30,33, testing, and validation data sets, as suggested by one of the reviewers. We consider this is an attractive approach to overcome the challenge of identifying an independent data set from another study as the validation data. Due to the lack of a validation data set, the presented performance matrix may be lower as the variations of data sets.

Current approaches to early identification of AD still rely on cost prohibitive, labor-intensive, and expensive diagnostic tests57. These diagnostic tests are typically only available in a limited number of tertiary care centers58. Furthermore, many psychometric tests used to support a diagnosis are available in only a limited number of languages and may not have options for hearing or visually impaired individuals59. This often means an AD diagnosis can be missed or delayed for years60. The impact is even more dramatic on the AD drug development pipeline which has seen significant bottlenecks in recruitment for early stage trials and studies composed of largely homogeneous populations61,62.

We proposed a new feature selection method based on the stochastic ordering of features within each subgroup. This new feature selection method reduces the computational intensity from exponential to linear, which makes the search for the optimal set of features computationally feasible10,56,63,64. The proposed stochastic ordering works very well in general. We did notice the issue of the optimal ML model for the SMC male group where the optimal model was achieved when all the features were included in the model. This was partially caused by the method to determine the feature ordering. For simplicity, the binary logistic regression with the AIC criteria was used for the stochastic ordering. For that subgroup (the SMC male), the final optimal ML method is a tree based method which could be very different from a logistic regression. We would consider this as future work to identity simple statistical models for feature ordering for each ML method.

Patients could be pre-screened with these tools and those that are most likely to have brain amyloid would undergo confirmatory testing with PET amyloid imaging or CSF studies. In conjunction with identifying those at a high risk of amyloid/tau pathology, we hypothesize that ML approaches will be able to estimate the likelihood of disease progression over a defined period. Enrolling patients with a high likelihood of progression will help reduce the chance of a failed trial due to lack of decline in the placebo group. AD and other neurodegenerative disorders cause characteristic patterns of cognitive decline that can be captured by neuropsychological assessments (e.g., the Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog) or a Neuropsychological Test battery (NTB)). Using novel ML methods based on newly discovered risk factors and biomarkers (e.g., stroke, diabetes, and basal forebrain volume65,66) for cognitive decline, our research will increase the understanding of how newly discovered risk factors and biomarkers contribute to prediction of AD biomarkers.

As predicted, the present study showed that for women with no measurable memory deficits, only three cognitive test features, only one related to memory, were included in the optimal model for predicting presence of brain beta amyloid in women with SMC. In contrast, the model for SMC men included all input features. For women with EMCI, no cognitive tests were included as input features in the optimal model, with only age and APOE ε4 status as most useful in prediction of brain amyloid beta. The model for EMCI men featured only three cognitive test features, including delayed recall memory and object naming, deficits in which are typically thought of as hallmarks of early AD. This pattern again suggests that cognitive assessments may not predict presence of AD pathology in women as effectively as they do in men at early disease stages. In contrast, at the LMCI stage, a broader range of cognitive test scores were included as features in optimal model for women than the one for men. Recall and recognition memory scores were among those included. This finding is consistent with studies showing that women with brain beta amyloid decline cognitively more quickly than men—therefore, cognitive test scores would be expected to better differentiate amyloid positive vs. negative women than men.

The current approach differs from many machine learning analyses, which attempt to predict future cognitive decline using current biomarker status6769. Our analysis adds uniquely to the literature by showing that current cognitive status can accurately predict current amyloid status, particularly in men with SMC. This knowledge could be applied to improve the odds that clinical trials and research studies without access to amyloid PET imaging are including amyloid positive SMC men in their cohorts (i.e., those with preclinical AD), and excluding those with non-AD SMC. This would increase the power of such studies to find results relevant to early disease process in preclinical AD men. Unfortunately, our finding is also consistent with our and others’ prior work showing cognitive tests may not be sufficient to identify women with preclinical AD27,70. Practically, this means that without biomarker confirmation or more comprehensive cognitive assessment, women included in SMC groups in clinical trials and research may be more heterogeneous than SMC men. Such heterogeneity could lead to lack of effects or could underlie some findings of sex differences in Alzheimer’s disease.

Acknowledgements

The authors are very grateful to Editor, Associate Editor, and two reviewers for their insightful comments that help improve the manuscript significantly. Shan’s research is partially supported by grants from the National Institutes of Health: P20GM109025, R01AG070849, and R03CA248006.

Author contributions

G.S. ran the data analysis. All authors wrote and reviewed the manuscript.

Data availability

Data used in preparation of this article were obtained from the Alzheimer’s disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). Thus, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in this analysis or the writing of this report. A complete listing of ADNI investigators can be found at its website.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Association A. 2020 Alzheimer’s disease facts and figures. Alzheimer’s Dementia. 2020;16(3):391–460. doi: 10.1002/alz.12068. [DOI] [Google Scholar]
  • 2.Global Action Against Dementia. G8 Dementia Summit Declaration (2013).
  • 3.Sperling RA, Aisen PS, Beckett LA, Bennett DA, Craft S, Fagan AM, et al. Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s Dementia. 2011;7(3):280–292. doi: 10.1016/j.jalz.2011.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bruandet A, Richard F, Bombois S, Maurage CA, Deramecourt V, Lebert F, et al. Alzheimer disease with cerebrovascular disease and vascular dementia: Clinical features and course compared with Alzheimer disease. J. Neurol. Neurosurg. Psychiatry. 2009;80(2):133–139. doi: 10.1136/jnnp.2007.137851. [DOI] [PubMed] [Google Scholar]
  • 5.Rasmussen J, Langerman H. Alzheimer’s disease—Why we need early diagnosis. Degener. Neurol. Neuromuscul. Dis. 2019;9:123–130. doi: 10.2147/DNND.S228939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Duboisa B, Padovanib A, Scheltensc P, Rossid A, Agnello GD. Timely diagnosis for Alzheimer’s disease: A literature review on benefits and challenges. J. Alzheimer’s Dis. 2015;49(3):617–631. doi: 10.3233/JAD-150692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Weimer DL, Sager MA. Early identification and treatment of Alzheimer’s disease: Social and fiscal outcomes. Alzheimer’s Dementia. 2009;5(3):215–226. doi: 10.1016/j.jalz.2009.01.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sevigny J, Chiao P, Bussière T, Weinreb PH, Williams L, Maier M, et al. The antibody aducanumab reduces Aβ plaques in Alzheimer’s disease. Nature. 2016;537(7618):50–56. doi: 10.1038/nature19323. [DOI] [PubMed] [Google Scholar]
  • 9.Murphy, MP. Amyloid-beta solubility in the treatment of Alzheimer’s disease. Massachussetts Medical Society (2018). [DOI] [PubMed]
  • 10.Shan, G. Exact Statistical Inference for Categorical Data. 1st ed (Academic Press, San Diego, 2015). http://www.worldcat.org/isbn/0081006810.
  • 11.Shan G, Wilding GE, Hutson AD, Gerstenberger S. Optimal adaptive two-stage designs for early phase II clinical trials. Stat. Med. 2016;35(8):1257–1266. doi: 10.1002/sim.6794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nasreddine ZS, Phillips NA, Bédirian V, Charbonneau S, Whitehead V, Collin I, et al. The Montreal Cognitive Assessment, MoCA: A brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 2005;53(4):695–699. doi: 10.1111/j.1532-5415.2005.53221.x. [DOI] [PubMed] [Google Scholar]
  • 13.Blanco-Campal, A., Diaz-Orueta, U., Navarro-Prados, A. B., Burke, T., Libon, D. J. & Lamar, M. Features and psychometric properties of the Montreal Cognitive Assessment: Review and proposal of a process-based approach version (MoCA-PA). Appl. Neuropsychol. Adult. https://pubmed.ncbi.nlm.nih.gov/31718290/, 10.1080/23279095.2019.1681996 (2019). [DOI] [PubMed]
  • 14.Ritter A, Hawley N, Banks SJ, Miller JB. The association between Montreal cognitive assessment memory scores and hippocampal volume in a neurodegenerative disease sample. J. Alzheimer’s Dis. 2017;58(3):695–699. doi: 10.3233/JAD-161241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shan G, Banks S, Miller JB, Ritter A, Bernick C, Lombardo J, et al. Statistical advances in clinical trials and clinical research. Alzheimer’s Dementia Transl. Res. Clin. Interv. 2018;4:366–371. doi: 10.1016/j.trci.2018.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Safieh M, Korczyn AD, Michaelson DM. ApoE4: An emerging therapeutic target for Alzheimer’s disease. BMC Med. 2019;17(1):1–17. doi: 10.1186/s12916-019-1299-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lin KA, Choudhury KR, Rathakrishnan BG, Marks DM, Petrella JR, Doraiswamy PM. Marked gender differences in progression of mild cognitive impairment over 8 years. Alzheimer’s Dementia Transl. Res. Clin. Interv. 2015;1(2):103–110. doi: 10.1016/j.trci.2015.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Pradier C, Sakarovitch C, Le Duff F, Layese R, Metelkina A, Anthony S, et al. The mini mental state examination at the time of Alzheimer’s disease and related disorders diagnosis, according to age, education, gender and place of residence: A cross-sectional study among the French National Alzheimer database. PLoS ONE. 2014;9(8):e103630. doi: 10.1371/journal.pone.0103630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Caldwell JZK, Berg JL, Cummings JL, Banks SJ. Moderating effects of sex on the impact of diagnosis and amyloid positivity on verbal memory and hippocampal volume. Alzheimer’s Res. Ther. 2017;9(1):72. doi: 10.1186/s13195-017-0300-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Caldwell JZK, Berg JLL, Shan G, Cummings JL, Banks SJ. Alzheimer’s disease neuroimaging initiative sex moderates the impact of diagnosis and amyloid PET positivity on hippocampal subfield volume. J. Alzheimer’s Dis. 2018;64(1):79–89. doi: 10.3233/JAD-180028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shan G, Dodge-Francis C, Wilding GE. Exact unconditional tests for dichotomous data when comparing multiple treatments with a single control. Ther. Innov. Regul. Sci. 2020;54(2):411–417. doi: 10.1007/s43441-019-00070-w. [DOI] [PubMed] [Google Scholar]
  • 22.Sohn D, Shpanskaya K, Lucas JE, Petrella JR, Saykin AJ, Tanzi RE, et al. Sex differences in cognitive decline in subjects with high likelihood of mild cognitive impairment due to Alzheimer’s disease. Sci. Rep. 2018;8(1):1–9. doi: 10.1038/s41598-017-17765-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Caldwell JZK, Cummings JL, Banks SJ, Palmqvist S, Hansson O. Cognitively normal women with Alzheimer’s disease proteinopathy show relative preservation of memory but not of hippocampal volume. Alzheimer’s Res. Ther. 2019;11(1):109. doi: 10.1186/s13195-019-0565-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sundermann EE, Katz MJ, Lipton RB. Sex differences in the relationship between depressive symptoms and risk of amnestic mild cognitive impairment. Am. J. Geriatr. Psychiatry. 2017;25(1):13–22. doi: 10.1016/j.jagp.2016.08.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sundermann EE, Tran M, Maki PM, Bondi MW. Sex differences in the association between apolipoprotein E ϵ4 allele and Alzheimer’s disease markers. Alzheimer’s Dementia Diagn. Assess. Disease Monit. 2018;10:438–447. doi: 10.1016/j.dadm.2018.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Brunet HE, Miller JB, Shi J, Chung B, Munter BT, Sabbagh MN. Does informant-based reporting of cognitive symptoms predict amyloid positivity on positron emission tomography? Alzheimer’s Dementia Diagn. Assess. Disease Monit. 2019;11:424–429. doi: 10.1016/j.dadm.2019.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Sundermann EE, Maki P, Biegon A, Lipton RB, Mielke MM, Machulda M, et al. Sex-specific norms for verbal memory tests may improve diagnostic accuracy of amnestic MCI. Neurology. 2019;93(20):E1881–E1889. doi: 10.1212/WNL.0000000000008467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Shan G, Ma C, Hutson AD, Wilding GE. Randomized two-stage phase II clinical trial designs based on Barnard’s exact test. J. Biopharm. Stat. 2013;23(5):1081–1090. doi: 10.1080/10543406.2013.813525. [DOI] [PubMed] [Google Scholar]
  • 29.Shan G, Ma C, Hutson AD, Wilding GE. An efficient and exact approach for detecting trends with binary endpoints. Stat. Med. 2012;31(2):155–164. doi: 10.1002/sim.4411. [DOI] [PubMed] [Google Scholar]
  • 30.Zhang, H. & Shan, G. Letter to the Editor: A novel confidence interval for a single proportion in the presence of clustered binary outcome data (SMMR, 2019). (SAGE Publications Ltd, 2020). [DOI] [PubMed]
  • 31.Zhang, H., Jiang, T. & Shan, G. Identification of hot spots in protein structures using Gaussian network model and Gaussian naive bayes. BioMed Res. Int. 4354901. 10.1155/2016/4354901 (2016). [DOI] [PMC free article] [PubMed]
  • 32.Zhang, H., Song, Y., Jiang, B., Chen, B. & Shan, G. Two-stage bagging pruning for reducing the ensemble size and improving the classification performance. Math. Probl. Eng.8906034. 10.1155/2019/8906034 (2019). [DOI]
  • 33.Shan G, Bayram E, Caldwell JZK, Miller JB, Shen JJ, Gerstenberger S. Partial correlation coefficient for a study with repeated measurements. Stat. Biopharm. Res. 2020;00:1–7. doi: 10.1080/19466315.2020.1784780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Weiner MW, Veitch DP, Aisen PS, Beckett LA, Cairns NJ, Cedarbaum J, et al. Impact of the Alzheimer’s disease neuroimaging initiative, 2004 to 2014. Alzheimer’s Dementia. 2015;11(7):865–884. doi: 10.1016/j.jalz.2015.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Jagust WJ, Bandy D, Chen K, Foster NL, Landau SM, Mathis CA, et al. The Alzheimer’s disease neuroimaging initiative positron emission tomography core. Alzheimer’s Dementia. 2010;6(3):221–229. doi: 10.1016/j.jalz.2010.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Aisen PS, Petersen RC, Donohue MC, Gamst A, Raman R, Thomas RG, et al. Clinical core of the Alzheimer’s disease neuroimaging initiative: Progress and plans. Alzheimer’s Dementia. 2010;6(3):239–246. doi: 10.1016/j.jalz.2010.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Landau SM, Breault C, Joshi AD, Pontecorvo M, Mathis CA, Jagust WJ, et al. Amyloid-β imaging with Pittsburgh compound B and florbetapir: Comparing radiotracers and quantification methods. J. Nucl. Med. 2013;54(1):70–77. doi: 10.2967/jnumed.112.109009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Landau SM, Fero A, Baker SL, Koeppe R, Mintun M, Chen K, et al. Measurement of longitudinal β-amyloid change with 18F-florbetapir PET and standardized uptake value ratios. J. Nucl. Med. 2015;56(4):567–574. doi: 10.2967/jnumed.114.148981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ba M, Ng KP, Gao X, Kong M, Guan L, Yu L. The combination of apolipoprotein E4, age and Alzheimer’s Disease Assessment Scale–Cognitive Subscale improves the prediction of amyloid positron emission tomography status in clinically diagnosed mild cognitive impairment. Eur. J. Neurol.. 2019;26(5):733–e53. doi: 10.1111/ene.13881. [DOI] [PubMed] [Google Scholar]
  • 40.Honea, RA., Vidoni, ED., Swerdlow, RH. & Burns, JM. Maternal family history is associated with Alzheimer’s disease biomarkers (IOS Press, 2012). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608420/. [DOI] [PMC free article] [PubMed]
  • 41.Chiang GC, Cruz Hernandez JC, Kantarci K, Jack CR, Weiner MW. Cerebral microbleeds, CSF p-tau, and cognitive decline: Significance of anatomic distribution. Am. J. Neuroradiol. 2015;36(9):1635–1641. doi: 10.3174/ajnr.A4351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Grochowalski JH, Liu Y, Siedlecki KL. Examining the reliability of ADAS-Cog change scores. Aging Neuropsychol. Cogn. 2016;23(5):513–529. doi: 10.1080/13825585.2015.1127320. [DOI] [PubMed] [Google Scholar]
  • 43.Kuhn M. Building predictive models in R using the caret package. J. Stat. Softw. 2008;28(5):1–26. doi: 10.18637/jss.v028.i05. [DOI] [Google Scholar]
  • 44.Shan G, Wang W. ExactCIdiff: An R package for computing exact confidence intervals for the difference of two proportions. R J. 2013;5(2):62–71. doi: 10.32614/RJ-2013-026. [DOI] [Google Scholar]
  • 45.Shan G, Gerstenberger S. Fisher’s exact approach for post hoc analysis of a chi-squared test. PLoS ONE. 2017;12(12):e0188709. doi: 10.1371/journal.pone.0188709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Shan G, Amei A, Young D. Efficient noninferiority testing procedures for simultaneously assessing sensitivity and specificity of two diagnostic tests. Comput. Math. Methods Med. 2015;2015:128930. doi: 10.1155/2015/128930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Shan G, Wilding G. Unconditional tests for association in 2 * 2 contingency tables in the total sum fixed design. Statistica Neerlandica. 2015;69(1):67–83. doi: 10.1111/stan.12047. [DOI] [Google Scholar]
  • 48.Parikh R, Mathai A, Parikh S, Sekhar GC, Thomas R. Understanding and using sensitivity, specificity and predictive values. Indian J. Ophthalmol. 2008;56(1):45–50. doi: 10.4103/0301-4738.37595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shan G. Accurate confidence intervals for proportion in studies with clustered binary outcome. Stat. Methods Med. Res. 2020;29(10):3006–3018. doi: 10.1177/0962280220913971. [DOI] [PubMed] [Google Scholar]
  • 50.Shan G. Exact confidence limits for the response rate in two-stage designs with over- or under-enrollment in the second stage. Stat. Methods Med. Res. 2018;27(4):1045–1055. doi: 10.1177/0962280216650918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bernick C, Cummings J, Raman R, Sun X, Aisen P. Age and rate of cognitive decline in Alzheimer disease: Implications for clinical trials. Arch. Neurol. 2012;69(7):901–905. doi: 10.1001/archneurol.2011.3758. [DOI] [PubMed] [Google Scholar]
  • 52.Cummings J, Fox N, Vellas B, Aisen P, Shan G. Biomarker and clinical trial design support for disease-modifying therapies: Report of a survey of the EU/US: Alzheimer’s Disease Task Force. J. Prev. Alzheimer’s Dis. 2018;5(2):103–109. doi: 10.14283/jpad.2018.13. [DOI] [PubMed] [Google Scholar]
  • 53.Shan G. Optimal two-stage designs based on restricted mean survival time for a single-arm study. Contemporary Clinical Trials Communications 100732 (2021). [DOI] [PMC free article] [PubMed]
  • 54.Shan G, Wang W. Advanced statistical methods and designs for clinical trials for COVID-19. Int. J. Antimicrob. Agents. 2021;57(1):106167. doi: 10.1016/j.ijantimicag.2020.106167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Koran MEI, Wagener M, Hohman TJ. Sex differences in the association between AD biomarkers and cognitive decline. Brain Imaging Behav. 2017;11(1):205–213. doi: 10.1007/s11682-016-9523-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Shan G. Exact confidence limits for the response rate in two-stage designs with over or under enrollment in the second stage. Stat. Methods Med. Res. 2018;27(4):1045–1055. doi: 10.1177/0962280216650918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Jedenius E, Wimo A, Strömqvist J, Jönsson L, Andreasen N. The cost of diagnosing dementia in a community setting. Int. J. Geriatr. Psychiatry. 2010;25(5):476–482. doi: 10.1002/gps.2365. [DOI] [PubMed] [Google Scholar]
  • 58.Waldemar G, Phung KTT, Burns A, Georges J, Hansen FR, Iliffe S, et al. Access to diagnostic evaluation and treatment for dementia in Europe. Int. J. Geriatr. Psychiatry. 2007;22:47–54. doi: 10.1002/gps.1652. [DOI] [PubMed] [Google Scholar]
  • 59.Hill-Briggs F, Dial JG, Morere DA, Joyce A. Neuropsychological assessment of persons with physical disability, visual impairment or blindness, and hearing impairment or deafness. Arch. Clin. Neuropsychol. 2007;22(3):389–404. doi: 10.1016/j.acn.2007.01.013. [DOI] [PubMed] [Google Scholar]
  • 60.Knopman D, Donohue JA, Gutterman EM. Patterns of care in the early stages of Alzheimer’s disease: Impediments to timely diagnosis. J. Am. Geriatr. Soc. 2000;48(3):300–304. doi: 10.1111/j.1532-5415.2000.tb02650.x. [DOI] [PubMed] [Google Scholar]
  • 61.Cummings J, Lee G, Ritter A, Sabbagh M, Zhong K. Alzheimer’s disease drug development pipeline: 2019. Alzheimer’s Dementia Transl. Res. Clin. Interv. 2019;5:272–293. doi: 10.1016/j.trci.2019.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Bernick C, Shan G, Zetterberg H, Banks S, Mishra VR, Bekris L, et al. Longitudinal change in regional brain volumes with exposure to repetitive head impacts. Neurology. 2020;94(3):e232–e240. doi: 10.1212/WNL.0000000000008817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Shan G, Zhang H, Jiang T, Peterson H, Young D, Ma C. Exact p-values for Simon’s two-stage designs in clinical trials. Stat. Biosci. 2016;8(2):351–357. doi: 10.1007/s12561-016-9152-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Shan G, Ma C. Unconditional tests for comparing two ordered multinomials. Stat. Methods Med. Res. 2016;25(1):241–254. doi: 10.1177/0962280212450957. [DOI] [PubMed] [Google Scholar]
  • 65.Casanova R, Saldana S, Lutz MW, Plassman BL, Kuchibhatla M, Hayden KM. Investigating predictors of cognitive decline using machine learning. J. Gerontol. Ser. B. 2020;75(4):733–742. doi: 10.1093/geronb/gby054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Teipel SJ, Cavedo E, Hampel H, Grothe MJ. Basal forebrain volume, but not hippocampal volume, is a predictor of global cognitive decline in patients with Alzheimer’s disease treated with cholinesterase inhibitors. Front. Neurol. 2018;9:642. doi: 10.3389/fneur.2018.00642/full. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Chételat G, La Joie R, Villain N, Perrotin A, De la Sayette V, Eustache F, et al. Amyloid imaging in cognitively normal individuals, at-risk populations and preclinical Alzheimer’s disease. NeuroImage Clin. 2013;2(1):356–365. doi: 10.1016/j.nicl.2013.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Hellwig S, Frings L, Bormann T, Vach W, Buchert R, Meyer PT. Amyloid imaging for differential diagnosis of dementia: Incremental value compared to clinical diagnosis and [18 F]FDG PET. Eur. J. Nucl. Med. Mol. Imaging. 2019;46(2):312–323. doi: 10.1007/s00259-018-4111-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Rice, L. & Bisdas, S. The Diagnostic Value of FDG and Amyloid PET in Alzheimer’s Disease—A Systematic Review (Elsevier Ireland Ltd, Amsterdam, 2017). [DOI] [PubMed]
  • 70.Brunet HE, Caldwell JZK, Brandt J, Miller JB. Influence of sex differences in interpreting learning and memory within a clinical sample of older adults. Aging Neuropsychol. Cogn. 2020;27(1):18–39. doi: 10.1080/13825585.2019.1566433. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data used in preparation of this article were obtained from the Alzheimer’s disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). Thus, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in this analysis or the writing of this report. A complete listing of ADNI investigators can be found at its website.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES