Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Dec 4.
Published in final edited form as: Proc IEEE Int Symp Biomed Imaging. 2019 Jul 11;2019:368–371. doi: 10.1109/ISBI.2019.8759397

EARLY PREDICTION OF ALZHEIMER’S DISEASE DEMENTIA BASED ON BASELINE HIPPOCAMPAL MRI AND 1-YEAR FOLLOW-UP COGNITIVE MEASURES USING DEEP RECURRENT NEURAL NETWORKS

Hongming Li 1, Yong Fan 1; Alzheimer’s Disease Neuroimaging Initiative1
PMCID: PMC6892161  NIHMSID: NIHMS1060600  PMID: 31803346

Abstract

Multi-modal biological, imaging, and neuropsychological markers have demonstrated promising performance for distinguishing Alzheimer’s disease (AD) patients from cognitively normal elders. However, it remains difficult to early predict when and which mild cognitive impairment (MCI) individuals will convert to AD dementia. Informed by pattern classification studies which have demonstrated that pattern classifiers built on longitudinal data could achieve better classification performance than those built on cross-sectional data, we develop a deep learning model based on recurrent neural networks (RNNs) to learn informative representation and temporal dynamics of longitudinal cognitive measures of individual subjects and combine them with baseline hippocampal MRI for building a prognostic model of AD dementia progression. Experimental results on a large cohort of MCI subjects have demonstrated that the deep learning model could learn informative measures from longitudinal data for characterizing the progression of MCI subjects to AD dementia, and the prognostic model could early predict AD progression with high accuracy.

Keywords: Prognosis, recurrent neural networks, longitudinal data, Alzheimer’s disease

1. INTRODUCTION

Alzheimer’s disease (AD) is the most prevalent neurodegenerative disorder, and individuals with mild cognitive impairment (MCI) are at a higher risk to develop AD [1]. Although promising performance has been achieved for distinguishing progressive MCI (pMCI) subjects from stable MCI (sMCI) subjects in a pattern classification setting, it remains difficult at baseline to predict which and when MCI individuals will convert to AD dementia.

Neuropsychological tests and a variety of different biological and imaging markers have been explored for diagnosis and following the progression of AD [28], such as genetic data, cerebrospinal fluid (CSF) biomarkers, magnetic resonance imaging (MRI), and positron emission tomography (PET). It has been demonstrated that pattern classifiers built upon these measures could distinguish AD patients from subjects with normal cognition (NC) with high accuracy, and recent studies have demonstrated that better performance can be achieved if longitudinal rather than cross-sectional data are used to build the classifiers [911].

Longitudinal changes of biomarkers and imaging markers have been investigated for AD diagnosis/prognosis, for instance, changes of neuropsychological measures [12], atrophy rate in cortical thickness and subcortical volume [9, 1315], and changes in brain tissue intensity/density map [10]. Most longitudinal data based prediction models require different subjects to have data at the same time points. However, missing data is a ubiquitous problem in longitudinal studies. Such a problem is typically circumvented by imputing missing data [11]. Multivariate functional principal component (MFPC) scores [12] have been used to represent longitudinal makers for handling missing or irregular data. However, certain assumption is adopted in MFPC to model the latent longitudinal process, which may not be appropriate for different types of markers.

The early prediction of AD dementia has been typically modeled as a classification problem, for instance, distinguishing sMCI subjects from pMCIs. However, the classification performance is dependent on a cut-off threshold of follow-up duration that is used to define pMCI and sMCI. Moreover, the cohorts of pMCI and sMCI subjects are typically heterogeneous regardless of the threshold used. Furthermore, the classification setting for prediction of AD dementia does not provide timing information about when MCI individuals will cross the threshold to AD dementia. Several studies [11, 12, 16] have focused on the prediction of time of progression to AD under a time-to-event analysis setting, and promising performance have been obtained.

Recently, deep learning techniques built upon recurrent neural networks (RNNs) with a long short term memory (LSTM) [17] structure have achieved remarkable advances in sequence modeling, such as machine translation and functional MRI modeling [18, 19], indicating RNNs might be better tools for characterizing longitudinal data.

In this study, a LSTM autoencoder is adopted to learn compact and informative representation from longitudinal cognitive measures for predicting progression of MCI subjects to AD dementia. These representations could encode temporal dynamics of longitudinal cognitive measures and characterize the progression trajectory of MCI subjects without any explicit assumption regarding the longitudinal process behind the measures. Based on the learned representations and baseline hippocampal MRI data, a prognostic model is built in a time-to-event setting. Particularly, Cox regression model is adopted to estimate the risk of progression of MCI subjects to AD dementia. The proposed model is applied to a large cohort obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI), and the experimental results have demonstrated that the proposed model could obtain promising prognostic performance, and cognitive measures and imaging based measures could provide complementary information for the prognosis.

2. METHODS AND MATERIAL

To build an early prediction model of AD dementia based on longitudinal data, we first train a LSTM autoencoder [20] to learn compact representations and encode temporal dynamics of longitudinal measures for each subject. The learned representations are then combined with baseline imaging data as features to build a prognosis model under a time-to-event analysis setting.

2.1. Data

Cognitive measures of 822 MCI subjects at baseline, 6 months, and 12 months were obtained from ADNI-1, GO & 2, including 13-item version of the Alzheimer’s Disease Assessment Scale-Cognitive subscale (ADAS-Cog13), Rey Auditory Verbal Learning Test (RAVLT) immediate, RAVLT learning, Functional Assessment Questionnaire (FAQ), and Mini-Mental State Examination (MMSE). Baseline characteristics of the subjects included are summarized in table 1. Baseline structural MRI scans were also obtained for these subjects to extract hippocampal imaging measures.

Table1.

Demographic information of the dataset used in this study.

ADNI sMCI pMCI
1 Age (years) 74.92±7.51 74.58±7.16
Gender (M/F) 114/61 132/76
MMSE 27.29±1.78 26.78±1.74
Conversion/censor time (months) 49.2±36.6 30.5±25.6
GO&2 Age 71.39±7.47 72.68±6.97
Gender (M/F) 182/150 60/47
MMSE 28.29±1.62 27.35±1.77
Conversion/censor time (months) 43.2±16.7 24.88±14.5

sMCI: stable MCI who remained as MCI at the last visit; pMCI: progressive MCI who converted to AD before the last visit.

2.2. LSTM based feature representation

Given the longitudinal cognitive measures at multiple time points for each subject, we learn informative and compact representations to encode the subject’s overall longitudinal cognitive performance and its temporal changes/trajectory across multiple time points. The LSTM autoencoder [20] provides an ideal tool for achieving this goal. The architecture of the LSTM autoencoder adopted in this study is illustrated in Fig. 1. It contains two parts, encoder and decoder. The encoder receives the input data of multiple time points and handles the encoding of input measures and their temporal dynamics between consecutive time points. The decoder is utilized to reconstruct the input measures at different time points step by step in a reverse order, based on the learned representations by the encoder. While the network is optimized to minimize the deviation between the reconstructed and input measures, the learned representation by the encoder is expected to characterize the overall cognitive performance and its dynamics of the input longitudinal measures.

Fig. 1.

Fig. 1.

LSTM autoencoder for longitudinal feature extraction.

As illustrated in Fig. 1, our autoencoder contains 2 LSTM layers for the encoder and decoder respectively. ft and f˜t are the input and reconstructed cognitive measures at time point t (t=1,2,3 for illustration), Wei are the trainable parameters of the i -th LSTM layer of the encoder, and Wdi are the trainable parameters of the i-th LSTM layer of the decoder. The trainable parameters are those involved in the forget gate, input gate, cell state and hidden state within one LSTM layer. Euclidean distance between the reconstructed and input measures is used as the objective function to optimize the trainable parameters. The number of LSTM layers was chosen to achieve generalizable performance with a small number of trainable parameters.

In this study, the autoencoder of cognitive measures was built on longitudinal cognitive measures of subjects from ADNI-1 cohort. Once the autoencoder was obtained, the encoder was then applied to all the MCI patients from ADNI-1 and GO&2 cohorts to extract their latent features from longitudinal cognitive measures, which were then used for the following prognostic analysis.

2.3. Prognostic modeling

Given the latent representations of longitudinal cognitive information, they were combined with baseline hippocampal MRI based measures to build a prognostic model using Cox regression [21].

Particularly, cognitive measures including ADAS-Cog13, RAVLT immediate, RAVLT learning, FAQ, and MMSE were used to learn LSTM encoded cognitive measures. Imaging features from baseline hippocampal MRI data was extracted as an imaging based risk of progression to AD using a deep learning based prognostic framework [22]. Age, gender, education years, and APOEε4 status at baseline were used as covariates in the modeling.

Cox regression model was built on data of MCI subjects of ADNI-1, and its prognostic performance was evaluated based on the data of MCI subjects of ADNI-GO&2. The schematic illustration of the prognostic modeling is illustrated in Fig. 2. We built prediction models on longitudinal cognitive data of the first 2 and 3 time points separately (within 1-year follow-up) along with the baseline hippocampal imaging based measures.

Fig. 2.

Fig. 2.

Schematic diagram of the time-to-event prognosis model.

3. RESULTS

3.1. Experimental setting

We implemented the LSTM autoencoder using Tensorflow. Two LSTM layers were adopted as shown in Fig. 1, and the number of hidden nodes in each LSTM layer was set to 5 (the dimension of the LSTM encoded cognitive measures, the same as the number of input cognitive measures at each time point). Adam optimization technique was adopted to optimize the autoencoder, with a base learning rate set to 0.01 and updated using a stepwise policy by dropping the learning rate by a factor of 0.1 after every 20000 iterations. The maximum iteration number of the training procedure was set to 100000, and the batch size was set to 64. The autoencoder was trained on a Nvidia Titan Xp graphics processing unit (GPU).

We compared the proposed prediction model with those built on longitudinal cognitive measures only or cognitive measures at single time points. The prognostic performance was evaluated using concordance measure (C-index).

3.2. Experimental results

The prognosis performance of the proposed prediction models is demonstrated in Fig. 3. For the LSTM based prediction models built on the longitudinal data, the 6m model was built on data at baseline (bl) and 6-month visits, and the 12m model was built on data at bl, 6-month, and 12-month visits. It is worth noting that if a pMCI subject converted to AD at Visit-ID or data at Visit-ID were not available for a sMCI subject, only data at visits before the Visit-ID were used. When the prediction models built upon ADNI-1 longitudinal cognitive data of 3 and 2 time points applied to ADNI-GO&2 subjects, they achieved C-index values of 0.896 and 0.873 respectively, better than the prediction models built upon single-visit data at 12 and 6 months (p=0.019 and 0.552 respectively). All these prediction models had better prediction performance than the model built upon the baseline cognitive measures (a C-index value of 0.848), while the LSTM based prediction models were significantly better (p<0.05). When both longitudinal cognitive measures and baseline imaging measures were used, prediction models achieved C-index values of 0.901 and 0.889 at 12 and 6 months respectively, significantly better (p<0.006) than a model built on baseline cognitive measures and imaging measures (a C-index value of 0.866).

Fig. 3.

Fig. 3.

Prognostic performance of prediction models built on single visit cognitive data (blue), longitudinal cognitive data with 5 LSTM encoded features (red), and longitudinal cognitive data with hippocampal MRI based features (magenta).

As shown in Fig. 3, the prediction models built upon data of any single visit had worse prediction performance than the models built upon longitudinal data, and the models built on data of later time points had better performance than those built on data of earlier time points. Best prognostic performance was obtained when LSTM encoded cognitive representation was combined with imaging based features from baseline hippocampal MRI data [22], indicating that clinical measures and imaging data could provide complementary information for the prognosis. Moreover, we expect that the prognostic performance could be further improved if longitudinal imaging data are incorporated in the prediction model.

We have also evaluated how the number of hidden nodes in each LSTM layer impacted the prognostic performance. The prognostic performance of prediction models built on longitudinal data with different numbers (3, 5, 7, and 9) of features learned by the LSTM autoencoder and baseline imaging data are shown in Fig. 4, demonstrating that the performance was relatively stable and all the longitudinal data based models outperformed the model built on baseline cognitive and imaging measures when the number of hidden nodes was larger than 3.

Fig. 4.

Fig. 4.

Prediction performance based on longitudinal data with different numbers of features learned by LSTM autoencoder.

4. CONCLUSION

In this study, we developed a deep learning based method to characterize longitudinal dynamics of cognitive measures and built prognostic models based on baseline hippocampal MRI measures and the learned longitudinal dynamics to predict individual MCI subjects’ progression to AD. Evaluation results have demonstrated that the proposed model achieved promising performance for predicting MCI subjects’ progression to AD using data within 1-year follow-up. Future work will be devoted to model optimization and more validation on external data cohorts.

5. ACKNOWLEDGEMENTS

This work was supported in part by National Institutes of Health grants [CA223358, EB022573, and AG054409].

6. REFERENCES

  • [1].Jack CR Jr. et al. , “NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease,” Alzheimers Dement, vol. 14, no. 4, pp. 535–562, April 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Khachaturian AS, Corcoran CD, Mayer LS, Zandi PP, Breitner JS, and Cache County Study I, “Apolipoprotein e ϵ4 count affects age at onset of alzheimer disease,but not lifetime susceptibility: The cache county study,” Archives of General Psychiatry, vol. 61, no. 5, pp. 518–524, 2004. [DOI] [PubMed] [Google Scholar]
  • [3].Bilgel M et al. , “Trajectories of Alzheimer disease-related cognitive measures in a longitudinal sample,” Alzheimers Dement, vol. 10, no. 6, pp. 735–742 e4, November 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Davatzikos C, Fan Y, Wu X, Shen D, and Resnick SM, “Detection of prodromal Alzheimer’s disease via pattern classification of magnetic resonance imaging,” Neurobiol Aging, vol. 29, no. 4, pp. 514–23, April 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Fan Y, Batmanghelich N, Clark CM, Davatzikos C, and Alzheimer’s Disease Neuroimaging I, “Spatial patterns of brain atrophy in MCI patients, identified via high-dimensional pattern classification, predict subsequent cognitive decline,” Neuroimage, vol. 39, no. 4, pp. 1731–43, February 15 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Fan Y, Resnick SM, Wu X, and Davatzikos C, “Structural and functional biomarkers of prodromal Alzheimer’s disease: a high-dimensional pattern classification study,” Neuroimage, vol. 41, no. 2, pp. 277–85, June 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Davatzikos C, Xu F, An Y, Fan Y, and Resnick SM, “Longitudinal progression of Alzheimer’s-like patterns of atrophy in normal older adults: the SPARE-AD index,” Brain, vol. 132, no. Pt 8, pp. 2026–35, August 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Misra C, Fan Y, and Davatzikos C, “Baseline and longitudinal patterns of brain atrophy in MCI patients, and their use in prediction of short-term conversion to AD: results from ADNI,” Neuroimage, vol. 44, no. 4, pp. 1415–22, February 15 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Farzan A, Mashohor S, Ramli AR, and Mahmud R, “Boosting diagnosis accuracy of Alzheimer’s disease using high dimensional recognition of longitudinal brain atrophy patterns,” Behavioural brain research, vol. 290, pp. 124–130, 2015. [DOI] [PubMed] [Google Scholar]
  • [10].Huang M, Yang W, Feng Q, Chen W, and Alzheimer’s Disease Neuroimaging I, “Longitudinal measurement and hierarchical classification framework for the prediction of Alzheimer’s disease,” Sci Rep, vol. 7, p. 39880, January 12 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Thung KH, Yap PT, Adeli E, Lee SW, Shen D, and Alzheimer’s Disease Neuroimaging I, “Conversion and time-to-conversion predictions of mild cognitive impairment using low-rank affinity pursuit denoising and matrix completion,” Med Image Anal, vol. 45, pp. 68–82, April 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Li K, O’Brien R, Lutz M, Luo S, and Alzheimer’s Disease Neuroimaging I, “A prognostic model of Alzheimer’s disease relying on multiple longitudinal measures and time-to-event data,” Alzheimers Dement, vol. 14, no. 5, pp. 644–651, May 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Jack CR Jr. et al. , “Comparison of different MRI brain atrophy rate measures with clinical disease progression in AD,” Neurology, vol. 62, no. 4, pp. 591–600, February 24 2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Aguilar C et al. , “Application of a MRI based index to longitudinal atrophy change in Alzheimer disease, mild cognitive impairment and healthy older individuals in the AddNeuroMed cohort,” Front Aging Neurosci, vol. 6, p. 145, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Chincarini A et al. , “Integrating longitudinal information in hippocampal volume measurements for the early detection of Alzheimer’s disease,” Neuroimage, vol. 125, pp. 834–847, January 15 2016. [DOI] [PubMed] [Google Scholar]
  • [16].Kong DH et al. , “Predicting Alzheimer’s Disease Using Combined Imaging-Whole Genome SNP Data,” Journal of Alzheimers Disease, vol. 46, no. 3, pp. 695–702, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Hochreiter S and Schmidhuber J, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997. [DOI] [PubMed] [Google Scholar]
  • [18].Li H and Fan Y, “Brain Decoding from Functional MRI Using Long Short-Term Memory Recurrent Neural Networks,” in Medical Image Computing and Computer Assisted Intervention – MICCAI 2018, Cham, 2018, pp. 320–328: Springer International Publishing. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Li H and Fan Y, “Identification of Temporal Transition of Functional States Using Recurrent Neural Networks from Functional MRI,” in Medical Image Computing and Computer Assisted Intervention – MICCAI 2018, Cham, 2018, pp. 232–239: Springer International Publishing. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Srivastava N, Mansimov E, and Salakhudinov R, “Unsupervised learning of video representations using lstms,” in International conference on machine learning, 2015, pp. 843–852. [Google Scholar]
  • [21].Cox DR, “Regression models and life-tables,” in Breakthroughs in statistics: Springer, 1992, pp. 527–541. [Google Scholar]
  • [22].Li H, Habes M, and Fan Y, “A deep learning prognostic model for early prediction of Alzheimer’s disease based on hippocampal MRI data,” presented at the Alzheimer’s Association International Conference 2018ll, Chicago, 2018. [Google Scholar]

RESOURCES