Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2016 Oct 5;6:34567. doi: 10.1038/srep34567

COMPASS: A computational model to predict changes in MMSE scores 24-months after initial assessment of Alzheimer’s disease

Fan Zhu 1,*, Bharat Panwar 1,*, Hiroko H Dodge 2,3,*, Hongdong Li 1, Benjamin M Hampstead 4,5, Roger L Albin 2,6,7, Henry L Paulson 2, Yuanfang Guan 1,8,9,a
PMCID: PMC5050516  PMID: 27703197

Abstract

We present COMPASS, a COmputational Model to Predict the development of Alzheimer’s diSease Spectrum, to model Alzheimer’s disease (AD) progression. This was the best-performing method in recent crowdsourcing benchmark study, DREAM Alzheimer’s Disease Big Data challenge to predict changes in Mini-Mental State Examination (MMSE) scores over 24-months using standardized data. In the present study, we conducted three additional analyses beyond the DREAM challenge question to improve the clinical contribution of our approach, including: (1) adding pre-validated baseline cognitive composite scores of ADNI-MEM and ADNI-EF, (2) identifying subjects with significant declines in MMSE scores, and (3) incorporating SNPs of top 10 genes connected to APOE identified from functional-relationship network. For (1) above, we significantly improved predictive accuracy, especially for the Mild Cognitive Impairment (MCI) group. For (2), we achieved an area under ROC of 0.814 in predicting significant MMSE decline: our model has 100% precision at 5% recall, and 91% accuracy at 10% recall. For (3), “genetic only” model has Pearson’s correlation of 0.15 to predict progression in the MCI group. Even though addition of this limited genetic model to COMPASS did not improve prediction of progression of MCI group, the predictive ability of SNP information extended beyond well-known APOE allele.


Alzheimer’s disease (AD) is a chronic neurodegenerative disorder accounting for approximately 60% of dementia cases1. Approximately 21 million people worldwide have AD2, with approximately 2.4 million people affected in the United States3,4,5. AD is the 6th leading cause of death in the United States. The majority (96%) of individuals with AD are older than 65 years with approximately exponential increases in incidence and prevalence accompanying aging4. About 11% of people at age 65 are affected by AD, and prevalence increases to 30% to 50% among people age 85 or older6.

It is now widely accepted that AD is a multi-decade process with a long asymptomatic phase7. The spectrum of AD includes a period preceding overt dementia when affected persons experience identifiable cognitive deficits without substantial functional impairments; the phase of Mild Cognitive Impairment (MCI)8. It is also widely believed that effective preventive treatment will need to be provided at early phases of the AD spectrum9. Disease progression is slow during the earliest or preclinical stages of AD10. Early identification of those at risk for developing AD and accurate prediction of cognitive trajectories in individuals with MCI are particularly important as preventive therapies are sought for AD. The MCI phase is of particular interest because individuals in this phase are easier to identify than presymptomatic individuals and consequently a key population for natural history and intervention studies.

Several attempts have been made to use clinical and biomarker data to predict progression of MCI to AD. Kovacevic et al. developed an automated method that measured segmental Medial temporal lobe (MTL) volumes to predict clinical decline in MCI patients11. Davatzikos et al. predicted MCI to AD conversion by combining information from MRI, CSF biomarkers, and pattern classification12. Zhang et al. predicted progression of MCI patients using longitudinal and multimodal biomarkers13. While the hazard levels of clinical parameters were evaluated in previous studies, accurate models designed to identify those at risk of developing AD are not available. Prior prediction methods also used different datasets, making it difficult to compare predictions. Use of multimodal biomarkers may not be feasible for clinical practice and is inconvenient for clinical research.

It was on this background of prior studies that the Dialogue for Reverse Engineering And Methods (DREAM) Alzheimer’s Disease Big Data challenge14 was organized to identify efficient and comparable methods using a ‘wisdom of the crowd’ approach. In this Challenge competition participants were asked to come up with an algorithm which predicts the decline in Mini-Mental State Examination (MMSE) scores over 24 months using a pre-specified data set. The MMSE is a 30-point questionnaire used widely in clinical and research settings to measure cognitive status in dementias15,16. Significant community-wide efforts17 are devoted to developing methods to predict significant declines in MMSE scores18,19,20. As part of the DREAM Challenge, we developed COmputational Model to Predict the development of Alzheimer’s diSease Spectrum (COMPASS) to predict significant declines in MMSE score (delta MMSE) 24 months after initial assessments. In this blinded, crowdsourcing competition, our method outperformed other participating methods21. The short computational time of this method, requiring only a few seconds to establish the model and make predictions, makes it good candidate for clinical research use and for potential clinical applications.

In an effort to improve COMPASS and to explore personalized predictions, we further included cognitive composite scores as predictors of MMSE declines, and evaluated the feasibility of using genetic information other than APOE to predict delta MMSE in the MCI population.

Materials and Methods

Data acquisition

Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI is to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD).

For the current study, we used ADNI-1 data downloaded on February 10th, 2015. ADNI-1 general eligibility criteria are described at ADNI website. Normal subjects had MMSE scores between 24 and 30 (inclusive), Clinical Dementia Ratings (CDR) of 0, and did not meet criteria for MCI or AD. MCI subjects had MMSE scores between 24 and 30 (inclusive), a memory complaint, objective memory loss measured by education adjusted scores on the Wechsler Memory Scale Logical Memory II test22, CDRs of 0.523, absence of significant levels of impairment in other cognitive domains, essentially preserved activities of daily living, and absence of dementia. Mild AD subjects had MMSE scores between 20–26 (inclusive), CDRs of 0.5 or 1.0, and met NINCDS/ADRDA criteria for probable AD24.

We downloaded the following information: age, gender, education, Apolipoprotein E (APOE) genotype, neuropsychological tests and validated composite scores based on the results of neuropsychological test results (ADNI-MEM for memory and ADNI-EF for executive functions) at baseline, and the MMSE scores at baseline and at the 24-months follow up assessments. ADNI-MEM and ADNI-EF are composite measures of memory and executive function, respectively, derived from the ADNI neuropsychological battery25,26. Since we were also interested in exploring predictive genetic information, we limited data to individuals for whom Single Nucleotide Polymorphism (SNP) data were available. Of the 489 individuals in this dataset, 152 of them are cognitively normal (CN), 230 of them are MCI, and 107 of them are AD patients at baseline (Table 1).

Table 1. Data Summary.

  CN MCI AD All
#Samples 152 230 107 489
Age 76.1 75.0 75.2 75.4
Gender (male) 56.5% 66.1% 50.0% 59.7%
Education (years) 16.3 15.8 14.7 15.7
MMSE at baseline 29.2 27.1 23.5 27.0
MMSE at 24 months 29.1 25.3 18.9 25.2
#ε2ε3 18 (11.8%) 10 (4.3%) 2 (1.9%) 30 (6.1%)
#ε2ε4 2 (1.3%) 7 (3.0%) 2 (1.9%) 11 (2.2%)
#ε3ε3 94 (61.8%) 91 (39.6%) 28 (26.1%) 233 (43.6%)
#ε3ε4 35 (23.0%) 89 (38.7%) 54 (50.1%) 178 (36.4%)
#ε4ε4 3 (2.0%) 33 (14.3%) 21 (19.6%) 57 (11.7%)
ADNI-MEM 0.996 −0.058 −0.889 0.0876
ADNI-EF 0.743 0.0434 −0.905 0.0532
#Significantly declined (MMSE drops >= 3) 2 (1.3%) 77 (33.5%) 60 (56%) 139 (28.4%)

This table shows features and statistics of the ADNI samples used in this study.

Data pre-processing

We reformed APOE genotype as two features: the number of APOE2 (rs429358, *the number of APOE2 is 2 minus the frequency of rs429358) and the number of APOE4 (rs7412). For genetic features, the dosage of each SNP is calculated from their genotypes as 2 × AA + 1 × AB + 0 × BB.

Individuals were split into three groups (CN, MCI and AD) based on their diagnosis. Within each group, all features are then scaled to [0, 1], including genetic features.

Genetic Data Retrieval

To explore personalized predictions, we made use of patients’ SNP data in our genetic model. Besides the well-known APOE bio-marker27, we are also interested in exploring how other genetic information could contribute to our method. To select the SNPs features, we built a brain-specific functional relationship network by integrating multi-species genomic data using Bayesian network28,29,30,31. In this network, each node represents a gene and an edge indicates the probability that two genes participate in the same biological process or a common pathway. We identified 10 genes closely functionally related to APOE, including PRNP (prion protein), APOA1 (apolipoprotein A-I), APP (amyloid beta precursor protein), GRID2 (glutamate receptor, ionotropic, delta 2), MT1H (metallothionein 1H), PSAP (prosaposin), MT1E (metallothionein 1E), GJA1 (gap junction protein, alpha 1, 43 kDa), EDNRB (endothelin receptor type B) and CBS (cystathionine-beta-synthase). We then queried these genes using NCBI dbSNP32 and obtained 13,347 associated SNPs (Supplementary Dataset S1). As we explored the contributions of other genetic information to our prediction method, APOE genotype was excluded as it is already a well-known bio-marker.

Implementation

Within each group, we used multiple learning methods to model delta MMSE. Our method is implemented using Support Vector Machine (SVM) light33 with linear and Radial Basis Function (RBF) kernel functions. SVM is a maximal margin classifier that maximize the distance of the nearest correctly classified examples to optimize classification performance. SVM aims at minimizing the cost function:

graphic file with name srep34567-m1.jpg

where w defines the plane that separates the positive and negative examples, ζ represents the degree of misclassification for each sample, C is a constant which is empirically optimized.

SVM can locate hidden disease development patterns through only a few samples in a training dataset and SVM may automatically identify disease development patterns whose combination maximally separates individuals developing into AD from controls.

Linear kernel is widely used and the simplest kernel function, defined as:

graphic file with name srep34567-m2.jpg

where xTy is the inner product between samples x and y, c is an optional constant parameter.

RBF kernel is a popular kernel function in learning algorithms. RBF kernel calculates the distance between x and y by:

graphic file with name srep34567-m3.jpg

where ||x − y|| is the Euclidean distance between the feature vectors of training samples. γ is a free parameter determined using cross-validation.

Our method to predict individuals meeting the delta MMSE criterion (discussed later) in the all three groups (CN, MCI, and AD), we tried SVM with both linear kernel and RBF kernel. The trade-off parameter c and gamma (γ) were optimized using computational cross-validation. Age, gender, education, baseline MMSE evaluation, and APOE genotype were used as the input features for SVM. We evaluated predictive accuracy with and without baseline ADNI-MEM and ADNI-EF.

Comparison Methods

The comparison methods, i.e. Gaussian process regression (noise = 1, γ = 2)34,35, linear regression, RBF Network, support vector machine for regression (or SMOreg, c = 1.0, E = 1, 2, or 3), decision tree, are implemented using WEKA36. Gaussian process regression, or Kriging, is a regression method based on Gaussian process governed by covariance function. RBF network, or radial basis function network, is an artificial neural network that uses RBF functions as activation functions. SMOreg is the support vector machine for regression implemented by WEKA, where the polynomial kernel is used and E parameter of indicates its exponent value.

As described above, individuals were separated into three groups based on their baseline diagnoses of CN MCI and AD. Separate models were trained for each group (Supplementary Info S2).

Significant Decline in MMSE

Predicting declines in MMSE score as continuous variables from baseline to 24 months using ADNI data was a DREAM Challenge Question14. In the current study, we further identified those with a significant decline in MMSE. We defined significant delta MMSE as a decline of 3 points, as indicated by prior research18,37. Predictive accuracy was evaluated using two criteria: the magnitudes of Pearson and Spearman’s correlation coefficients. We used both because both are widely used measures with somewhat different but relevant properties. The Pearson correlation coefficient is the standard method for parametric data while Spearman’s correlation coefficient is the preferred method for non-parametric data and more robust with respect to the effects of outliers. The correlation coefficients between the observed changes in observed delta MMSE scores and predicted delta MMSE were calculated. Predictive ability of our model in identifying those with significant MMSE decline was estimated by Receiver Operating Characteristics Area Under Curve (ROC AUC).

Results

We developed customized kernel-based models to predict delta MMSE. Because of differences between AD, Mild Cognitive Impairment (MCI) and Cognitive Normal (CN) groups, we developed separate models to predict delta MMSE for each group. We also evaluated accuracy when combining individuals from all three groups.

Clinical Data Summary

489 individuals from ADNI-1 study were used in our analysis, including 152 CN, 230 MCI and 107 AD. As shown in Table 1, average MMSE scores were 27.0 at baseline and 25.2 at the 24-month assessments. The average age was 75.4 years, and average year of education was 15.7 where 59.7% were male. There were 2 (out of 152) significantly declining individuals in the CN group, 77 (out of 230) in the MCI group, and 60 (out of 107) in the AD group, for a total number of 139 (out of 489) significantly declining individuals.

MMSE changes from baseline to 24-months

Figure 1 is a workflow figure indicating how we established the COMPASS model to predict delta MMSE from baseline to the 24 month assessment. We divided individuals into three diagnosis groups and different models were applied to each group.

Figure 1. Workflow of our method.

Figure 1

This figure shows the overview of our method. Individuals are separated into three groups based on their diagnosis and separate models are applied to each group. Additionally, to explore the predictive power of new genetic information, genetic features are extracted from SNP data as the input of the integrative model to predict the development of MMSE for MCI individuals.

(a) COMPASS predictive accuracy without ADNI-MEM and ADNI-EF

We performed computational cross-validation to evaluate the predictive accuracy of COMPASS. For each diagnosis group and parameter, correlation coefficients between predicted and observed changes in MMSE scores are calculated by repeating ten-fold cross-validation 10 times. COMPASS is implemented based on SVM light33. The comparison methods are evaluated using WEKA, where 10 fold cross-validation is also repeated 10 times.

Figure 2 shows the performance comparisons between our method and other popular machine learning methods including linear regression, RBF network, support vector machine, decision tree, and Gaussian process regression.

Figure 2. The performance of COMPASS vs. other methods (without ADNI-MEM and ADNI-EF).

Figure 2

The performance of all methods is evaluated by repeating 10-fold cross-validation 10 times. Gaussian process regression, or Kriging, is a regression method based on Gaussian process governed by covariance function. RBF network, or radial basis function network, is an artificial neural network that uses RBF functions as activation functions. SMOreg is the support vector machine for regression implemented by WEKA, where the polynomial kernel is used and parameter E indicates its exponent value.

Using COMPASS, the Pearson (Spearman’s) correlation between predicted and observed delta MMSE was 0.579 (0.469) for the CN group and 0.306 (0.30) for the AD group. MCI group predictions based on SVM provided the lowest correlation with a correlation of 0.092 and 0.109 for Pearson correlation and Spearman’s correlation, respectively. The WEKA based RBF Network algorithm performed relatively better for MCI group and provided 0.128 Pearson correlations. We tried both SVM linear and RBF kernel but linear kernel performed better for all three groups (CN, MCI, and AD).

Parameter optimization performed in COMPASS might improve predictive accuracy. Accordingly, we performed computational cross-validation to select model parameters. Figure 3 shows the performance improvement from our parameter adjustment for the different groups. As shown in Fig. 3, the c parameter is optimized to 100 for the all groups. (The results shown in Fig. 2 were obtained using the optimal parameters selected).

Figure 3. Cross validations for different diagnosis groups without ADNI-MEM and ADNI-EF.

Figure 3

This figure shows the performance of COMPASS method measured by repeating 10-fold computational cross validation 10 times. Subfigures (ac) are the results measured by Pearson correlation coefficient between the observed and predicted MMSE scores under different parameters. Subfigures (df) are the results measured by Spearman’s correlation coefficient.

(b) Predicting accuracy of COMPASS with ADNI-MEM and ADNI-EF

Two additional baseline values were obtained from ADNI-1 data: composite scores of memory (ADNI-MEM)20,38 and executive function (ADNI-EF)25,26. These two scores are psychometrically optimized scores derived from ADNI tests and externally validated26,39.

The feasibility of using baseline ADNI-MEM and ADNI-EF values to predict delta MMSE over 24-months was evaluated. Although ADNI-MEM and ADNI-EF are available at both first screening and 24-month follow up from the ADNI database, we only used the first assessment information.

Figure 4 shows the performance comparison between our method and other popular machine learning methods. For the CN group, the improvement brought by ADNI-MEM and ADNI-EF is minor and SVM with linear kernel has a Pearson’s correlation around 0.595 between predicted and observed changes in MMSE. For the MCI group, introducing ADNI-MEM and ADNI-EF significantly boosted predictive accuracy. The Pearson’s correlation coefficient between observed and predicted changes in MMSE is improved from 0.128 to 0.442. Among all the compared methods, Gaussian process regression (GPR) also delivers results with accuracy similar to our method. For the AD group, the performance improvement by adding the two composite scores was also dramatic; the Pearson’s correlation coefficient almost doubled from 0.306 to 0.690.

Figure 4. The performance of COMPASS vs. other methods (with ADNI-MEM and ADNI-EF).

Figure 4

This figure shows the performance comparison of our method and other popular machine learning methods when the two ADNI cognitive composite scores are included. The performance of all methods is evaluated by repeating 10-fold cross-validation 10 times. The methods used here are the same as in Fig. 2.

Figure 5 shows how parameter optimization improves the accuracy. The optimal c parameters for CN, MCI, and AD group are 0.1, 10, and 100, respectively. The performance significantly improved with this parameter optimization.

Figure 5. Cross validations for different diagnosis groups with ADNI-MEM and ADNI-EF.

Figure 5

This figure shows the performance of our method measured by repeating 10-fold computational cross validation 10 times. Subfigures (ac) are the results measured by Pearson correlation coefficient between the observed and predicted MMSE scores under different parameters. Subfigures (df) are the results measured by Spearman’s correlation coefficient.

(c) Predictions across all three groups

Figure 6 shows the overall performance for all individuals. Without two ADNI cognitive composite scores, the overall correlation is around 0.40 for Pearson and Spearman’s correlation coefficient. With ADNI-MEM and ADNI-EF, the accuracy is improved to approximately 0.45 for Pearson and Spearman’s correlation coefficients under default parameter. With parameter optimization, the predicting accuracy is further improved to 0.65/0.52 for Pearson/Spearman’s correlation.

Figure 6. Cross validations across all three groups.

Figure 6

This figure shows the cross validation performance of COMPASS across all three groups; ten-fold cross validation was repeated 100 times.

Predicting a large decline in MMSE within 24-months as measured by ROC and precision recall curves

MCI subjects vary widely in their rate of cognitive decline40, with some remaining as MCI, others transitioning to AD, and others reverting to normal. We measured how well our model could identify MCI individuals experiencing a clinically meaningful decline, defined as a MMSE decline of three or more points (77 individuals out of 230, about one third of the top decliners)41,42.

Figure 7 shows the receiver operating characteristic (ROC) and precision recall curves. COMPASS delivered results with an area under ROC (AUROC) of 0.814 and an area under precision recall curve (AUPRC) as high as 0.666, compared with the 0.5 AUROC and 0.33 AUPRC for random prediction. The false discovery rate (FDR) is 0%, 9%, 20% at the 5%, 10%, 20% true positive rates (TPR or recall), respectively. In other words, the top 5% of significantly declining individuals predicted using our method is 100% correct; and our method maintains 80% accuracy for its top 20% predictions.

Figure 7. Prediction of MMSE decline of 3 points or more over 24-months in ADNI.

Figure 7

This figure shows the ROC and precision recall curves for models with and without ADNI composite scores to predict those with MMSE declines of 3 or more points at 24-months from baseline.

Genetic model for MCI group

We introduced a genetic model to explore how genetic information could be used to predict delta MMSE in the MCI group. To select the genetic features for the integrative model, we queried genes that are closely related to APOE based on our gene co-functionality network28,29,30 and found the corresponding SNPs for the 10 most related genes (see Methods: Genetic data retrieving).

As APOE is a well-known genetic risk factor of AD, we employed it as a seed gene but excluded it from the genetic model. We built a brain-specific functional relationship network by integrating multi-species genomic data using a Bayesian network followed in our previous work28. This network indicates the gene pairs that are functionally related in brain context. We then identified the 10 most related genes with APOE from the network, and collected a list of SNPs corresponding to these 10 genes from the dbSNP database32. 13,347 SNPs were selected as the input features of our genetic model. Figure 8 shows the top 10 predicted neighbors of APOE. We performed pathway enrichment using commercial software, QIAGEN’S Ingenuity Pathway Analysis (IPA), for APOE and its 10 neighbors. The top enriched canonical pathways include Reelin Signaling in Neurons (p = 8.2E-04) and Atherosclerosis Signaling (p = 2.1E-03). Among the top disease families and biological functions enriched for these genes are neurological disease (p = 1.1E-03).

Figure 8. 10 top connected genes of APOE.

Figure 8

We used our co-functionality network to identify 10 neighbors of APOE, and extract the SNPs that correspond to these genes. These SNPs are then used as the input feature of our genetic model. The two sub tables show the top canonical pathways and top disease and bio functions of the 11 genes generated using Ingenuity IPA. These eleven genes are enriched to AD related pathways (Reelin Signaling in Neurons and Atherosclerosis Signaling) as well as neurological disease.

This genetic only model is able to deliver results with an accuracy of 0.15 (0.12) in terms of Pearson (Spearman’s) correlation coefficients, compared with the 0.13 (0.13) correlation for clinical only model (which includes APOE genotype) without ADNI composite memory and executive scores. This indicates that the 10 most connected neighbors of APOE (or other genes of interest in AD and other neurodegenerative diseases) could potentially be used as biomarkers, and that genome-wide association studies (GWAS) could provide good candidates to predict individuals’ delta MMSE. Adding the genetic model on top of our clinical model (with two ADNI cognitive composite scores) did not, however, further improve accuracy. We also applied the genetic only model to CN and AD groups. However, the results are close to random (correlation coefficient < 0.05).

DREAM’s validation of methods on independent dataset

COMPASS was the best-performing approach in the DREAM21 Alzheimer’s Disease Big Data Challenge43. In the DREAM AD challenge, a blind assessment was made on data from the Religious Orders Study44 and the Memory and Aging Project45. That is, each competitor submitted their algorithm derived from ADNI data and DREAM competition organizers applied the models to the above cohort data (two cohort combined). Overall, 587 individuals (407 CN, 146 MCI, 34 AD) were used in the evaluation. MMSE was provided for the baseline, but not for the 24-month screening. Each participating group in the competition predicted MMSE scores at the 24-months using their best performing methods. Competition organizers checked the correlations between their data and the predicted scores provided by participating groups. 525 people across the world signed up for the challenge and 32 teams completed the challenge. Our proposed method was the best performing method in this challenge with correlation coefficients between predicted and observed MMSE scores of 0.382 and 0.433 for Pearson correlation and Spearman’s correlation, respectively14. This method was also the best performing method in a 100,000 bootstrapped assessment.

Computational Time

COMPASS is computationally efficient. The clinical model of COMPASS takes less than one second to train and to make predictions on a common commercial computer (Dell PowerEdge R410). The running time of genetic and clinical combined model is less than one minute.

Discussion

Predicting progression of cognitive impairment in individuals along the AD spectrum is important. While significant efforts have been devoted to the identification of hazard levels of clinical parameters, sophisticated models to accurately predict changes in cognitive status are not available. We used ADNI data to develop and evaluate a computational model, COMPASS, to predict significant changes in MMSE scores 24-months after initial assessment. Additional analyses beyond the question set by the DREAM AD challenge were conducted in the current study.

Without including the two ADNI cognitive composite scores, the Pearson (Spearman’s) correlations provided by COMPASS between observed and predicted values for changes in MMSE scores were 0.58 (0.47), 0.12 (0.13) and 0.31 (0.30) for CN, MCI and AD groups, respectively. Correlations were significantly improved by adding the cognitive composite scores at baseline, ADNI-MEM and ADNI-EF. The Pearson (Spearman’s) correlations were improved to 0.59 (0.50), 0.44 (0.54) and 0.69 (0.66) for CN, MCI and AD groups, respectively. The results indicate that these two ADNI measurements are highly predictive for MCI and AD groups, as their inclusion improves the performance significantly. On the other hand, the correlation is still in a moderate range, suggesting that using only clinical variables together with APOE information has a limitation in predicting changes in MMSE within 24 months. This could reflect the variability associated with MMSE test scores. For example, changes in MMSE by 1 point could be simply random variations associated with the test. When the outcome was decline in MMSE by 3 points, which is regarded as a clinically meaningful change, then COMPASS was able to deliver results with an area under ROC of detecting this outcome of 0.814. As indicated by precision recall curve, the top 5% of significantly declined individuals predicted using our method is 100% correct (i.e. 0% FDR), while the method remains 91% accurate for its top 10% recall (or TPR). The high accuracy in predicting individuals who most likely will experience significant decline in cognitive status makes COMPASS a strong candidate to assist in the identification of individuals progressing along the AD spectrum, and a potentially useful tool for enriching clinical trials.

The effectiveness of COMPASS was tested in the recent DREAM9 AD Big Data challenge, and found to be the best performing method (judged by both the Pearson and Spearman’s correlation between observed and predicted MMSE in an unseen dataset from Religious Orders Study and the Memory and Aging Project)46. The success of COMPASS in the DREAM challenge indicates that it is likely to work accurately in other datasets.

A first generation, limited “genetic only” model was developed to measure the predictive ability of SNP information beyond the well-known APOE allele effects. 13,347 SNP features are selected using our previously established co-functional networks. Our genetic model delivered results with an accuracy of 0.15 and 0.12 in terms of Pearson and Spearman’s correlation coefficients. However, adding this limited genetic model to our clinical model did not improve the accuracy. Possible reasons include: (a) APOE effects are relatively powerful and were properly excluded from this analysis; and (b) in a complex, polygenic phenotype, a few loci are not likely to be more informative than phenotypic markers like ADNI-MEM or ADNI-EF that reflect the cumulative effect of many genes/alleles as well as individual specific cognitive reserve47. It is worth investigating the interdependence between these genetic features and clinical features in future studies. More robust genetic models covering more pathogenic pathways may be good candidates to further improve COMPASS. Finally, it may be useful to include vascular disease factors in the model to enhance the prediction algorithm. Future studies will consider including these factors in the model.

COMPASS is efficient in terms of computational cost, which makes it highly scalable. It is ready to be released for the general research/clinical community. In its present form, COMPASS will allow clinical researchers to forecast significant changes in MMSE scores of potential research subjects in real time. This may be useful for selecting or stratifying subjects in clinical trials. With extensions, it may find a role in clinical practice.

Additional Information

How to cite this article: Zhu, F. et al. COMPASS: A computational model to predict changes in MMSE scores 24-months after initial assessment of Alzheimer’s disease. Sci. Rep. 6, 34567; doi: 10.1038/srep34567 (2016).

Supplementary Material

Supplementary Dataset S1
srep34567-s1.xls (672KB, xls)
Supplementary Info S2
srep34567-s2.doc (23KB, doc)

Acknowledgments

This work is supported by NSF 1452656, Alzheimer’s Foundation (BAND-15-367116), and Michigan Alzheimer’s Disease Core Center (P30AG053760). Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Footnotes

Author Contributions F.Z. and Y.G. designed and carried out the experiments. F.Z., B.P., H.H.D., H.L., B.M.H., R.L.A., H.L.P. and Y.G. analyzed the results. F.Z. and Y.G. wrote the manuscript, with editing and revisions provided by all coauthors.

References

  1. Burns A. Alzheimer’s disease: on the verges of treatment and prevention. Lancet. Neurol. 8, 4–5 (2009). [DOI] [PubMed] [Google Scholar]
  2. Querfurth H. W. & LaFerla F. M. Alzheimer’s disease. N. Engl. J. Med. 362, 329–44 (2010). [DOI] [PubMed] [Google Scholar]
  3. Hebert L. E., Weuve J., Scherr P. A. & Evans D. A. Alzheimer disease in the United States (2010–2050) estimated using the 2010 census. Neurology 80, 1778–83 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Alzheimer’s Association. Alzheimer’s disease facts and figures. Alzheimers. Dement. 11, 332–84 (2015). [DOI] [PubMed] [Google Scholar]
  5. Prince M. et al. The global prevalence of dementia: A systematic review and metaanalysis. Alzheimer’s Dement. 9, 63–75.e2 (2013). [DOI] [PubMed] [Google Scholar]
  6. Wimo A., Ljunggren G. & Winblad B. Costs of dementia and dementia care: a review. Int. J. Geriatr. Psychiatry 12, 841–56 (1997). [PubMed] [Google Scholar]
  7. Dubois B. et al. Preclinical Alzheimer’s disease: Definition, natural history, and diagnostic criteria. Alzheimer’s Dement. 12, 292–323 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Gauthier S. et al. Mild cognitive impairment. Lancet 367, 1262–1270 (2006). [DOI] [PubMed] [Google Scholar]
  9. Jedynak B. M. et al. A computational neurodegenerative disease progression score: method and results with the Alzheimer’s disease Neuroimaging Initiative cohort. Neuroimage 63, 1478–86 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Sperling R. A. et al. Toward defining the preclinical stages of Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers. Dement. 7, 280–92 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Kovacevic S., Rafii M. S., Brewer J. B., Alzheimer’s Disease Neuroimaging Initiative. High-throughput, fully automated volumetry for prediction of MMSE and CDR decline in mild cognitive impairment. Alzheimer Dis. Assoc. Disord. 23, 139–45 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Davatzikos C., Bhatt P., Shaw L. M., Batmanghelich K. N. & Trojanowski J. Q. Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification. Neurobiol. Aging 32, 2322.e19–27 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Zhang D. et al. Predicting Future Clinical Changes of MCI Patients Using Longitudinal and Multimodal Biomarkers. Plos One 7, e33182 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Allen G. I. et al. Crowdsourced estimation of cognitive decline and resilience in Alzheimer’s disease. Alzheimers. Dement. 12, 645–53 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Rogers J. et al. Clinical trial of indomethacin in Alzheimer’s disease. Neurology 43, 1609–11 (1993). [DOI] [PubMed] [Google Scholar]
  16. Mega M. S., Cummings J. L., Fiorello T. & Gornbein J. The spectrum of behavioral changes in Alzheimer’s disease. Neurology 46, 130–5 (1996). [DOI] [PubMed] [Google Scholar]
  17. Peters M. Alzheimer’s Disease Big Data DREAM Challenge 1. 10/01/2016, at https://www.synapse.org/#!Synapse:syn2290704/wiki/60828 (2016).
  18. Hensel A., Angermeyer M. C. & Riedel-Heller S. G. Measuring cognitive change in older adults: reliable change indices for the Mini-Mental State Examination. J. Neurol. Neurosurg. Psychiatry 78, 1298–1303 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Tapiola T. et al. Three-year follow-up of cerebrospinal fluid tau, beta-amyloid 42 and 40 concentrations in Alzheimer’s disease. Neurosci. Lett. 280, 119–22 (2000). [DOI] [PubMed] [Google Scholar]
  20. Dodge H. H. et al. Biomarker progressions explain higher variability in stage-specific cognitive decline than baseline values in Alzheimer disease. Alzheimer’s Dement. 10, 690–703 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Stolovitzky G., Monroe D. & Califano A. Dialogue on reverse-engineering assessment and methods: the DREAM of high-throughput pathway inference. Ann. N. Y. Acad. Sci. 1115, 1–22 (2007). [DOI] [PubMed] [Google Scholar]
  22. Wechsler D. Wechsler memory scale (WMS-III). (Psychological Corporation San Antonio, TX, 1997). [Google Scholar]
  23. Morris J. C. et al. Clinical dementia rating training and reliability in multicenter studies: the Alzheimer’s Disease Cooperative Study experience. Neurology 48, 1508–10 (1997). [DOI] [PubMed] [Google Scholar]
  24. Dubois B. et al. Research criteria for the diagnosis of Alzheimer’s disease: revising the NINCDS-ADRDA criteria. Lancet. Neurol. 6, 734–46 (2007). [DOI] [PubMed] [Google Scholar]
  25. Nho K. et al. Voxel and surface-based topography of memory and executive deficits in mild cognitive impairment and Alzheimer’s disease. Brain Imaging Behav. 6, 551–67 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Gibbons L. E. et al. A composite score for executive functioning, validated in Alzheimer’s Disease Neuroimaging Initiative (ADNI) participants with baseline mild cognitive impairment. Brain Imaging Behav. 6, 517–27 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Dodge H. H., Chang C.-C. H., Kamboh I. M. & Ganguli M. Risk of Alzheimer’s disease incidence attributable to vascular disease in the population. Alzheimers. Dement. 7, 356–60 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Zhu F. et al. Modeling dynamic functional relationship networks and application to ex vivo human erythroid differentiation. Bioinformatics 30, 3325–33 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guan Y. et al. Tissue-specific functional networks for prioritizing phenotype and disease genes. PLoS Comput. Biol. 8, e1002694 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Zhu F., Shi L., Engel J. D. & Guan Y. Regulatory network inferred using expression data of small sample size: application and validation in erythroid system. Bioinformatics 31, 2537–44 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Guan Y. et al. A genomewide functional network for the laboratory mouse. Plos Comput. Biol. 4, e1000165 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Smigielski E. M., Sirotkin K., Ward M. & Sherry S. T. dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res. 28, 352–5 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Joachims T. Making large-Scale SVM Learning Practical. Adv. Kernel Methods Support Vector Learn, 169–184, 10.1109/ICEMI.2009.5274151 (1999). [DOI] [Google Scholar]
  34. Zhu F., Carpenter T., Rodriguez Gonzalez D., Atkinson M. & Wardlaw J. Computed tomography perfusion imaging denoising using gaussian process regression. Phys. Med. Biol. 57, N183–98 (2012). [DOI] [PubMed] [Google Scholar]
  35. Zhu Fan, Rodriguez Gonzalez D., Carpenter T., Atkinson M. & Wardlaw J. Lesion area detection using source image correlation coefficient for CT perfusion imaging. IEEE J. Biomed. Heal. informatics 17, 950–8 (2013). [DOI] [PubMed] [Google Scholar]
  36. Hall M. et al. The WEKA data mining software: an update. SIGKDD Explor. 11, 10–18 (2009). [Google Scholar]
  37. Stein J. et al. Assessing cognitive changes in the elderly: Reliable Change Indices for the Mini-Mental State Examination. Acta Psychiatr. Scand. 126, 208–218 (2012). [DOI] [PubMed] [Google Scholar]
  38. Crane P. K. et al. Development and assessment of a composite score for memory in the Alzheimer’s Disease Neuroimaging Initiative (ADNI). Brain Imaging Behav. 6, 502–16 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Habeck C. et al. Relationship between baseline brain metabolism measured using [18F]FDG PET and memory and executive function in prodromal and early Alzheimer’s disease. Brain Imaging Behav. 6, 568–83 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ritchie K., Artero S. & Touchon J. Classification criteria for mild cognitive impairment: a population-based validation study. Neurology 56, 37–42 (2001). [DOI] [PubMed] [Google Scholar]
  41. Peters R. et al. Incident dementia and blood pressure lowering in the Hypertension in the Very Elderly Trial cognitive function assessment (HYVET-COG): a double-blind, placebo controlled trial. Lancet. Neurol. 7, 683–9 (2008). [DOI] [PubMed] [Google Scholar]
  42. Paterniti S., Verdier-Taillefer M.-H., Dufouil C. & Alpérovitch A. Depressive symptoms and cognitive decline in elderly people. Longitudinal study. Br. J. Psychiatry 181, 406–10 (2002). [DOI] [PubMed] [Google Scholar]
  43. Peters M. Q1 Final Scoreboard. 20/11/2014, at https://www.synapse.org/#!Synapse:syn2290704/wiki/68511 (2014).
  44. Wilson R. S., Bienias J. L., Evans D. A. & Bennett D. A. Religious Orders Study: Overview and Change in Cognitive and Motor Speed. Aging, Neuropsychol. Cogn. 11, 280–303 (2004). [Google Scholar]
  45. Bennett D. A. et al. The Rush Memory and Aging Project: study design and baseline characteristics of the study cohort. Neuroepidemiology 25, 163–75 (2005). [DOI] [PubMed] [Google Scholar]
  46. Peters M. Description of Final Results. 22/06/2015, at https://www.synapse.org/#!Synapse:syn2290704/wiki/70719 (2015).
  47. Stern Y. Cognitive reserve and Alzheimer disease. Alzheimer Dis. Assoc. Disord. 20, 112–7 (2006). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Dataset S1
srep34567-s1.xls (672KB, xls)
Supplementary Info S2
srep34567-s2.doc (23KB, doc)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES