Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2014 Aug 20;9(8):e105542. doi: 10.1371/journal.pone.0105542

Predicting Progression of Alzheimer’s Disease Using Ordinal Regression

Orla M Doyle 1,*, Eric Westman 1,2, Andre F Marquand 1, Patrizia Mecocci 5, Bruno Vellas 6, Magda Tsolaki 7, Iwona Kłoszewska 8, Hilkka Soininen 9, Simon Lovestone 3, Steve C R Williams 1,3,10,11, Andrew Simmons, for the AddNeuroMed consortium and for the Alzheimer Disease Neuroimaging Initiative1,3,4,10,11
Editor: Karl Herholz12
PMCID: PMC4139338  PMID: 25141298

Abstract

We propose a novel approach to predicting disease progression in Alzheimer’s disease (AD) – multivariate ordinal regression – which inherently models the ordered nature of brain atrophy spanning normal aging (CTL) to mild cognitive impairment (MCI) to AD. Ordinal regression provides probabilistic class predictions as well as a continuous index of disease progression – the ORCHID (Ordinal Regression Characteristic Index of Dementia) score. We applied ordinal regression to 1023 baseline structural MRI scans from two studies: the US-based Alzheimer’s Disease Neuroimaging Initiative (ADNI) and the European based AddNeuroMed program. Here, the acquired AddNeuroMed dataset was used as a completely independent test set for the ordinal regression model trained on the ADNI cohort providing an optimal assessment of model generalizability. Distinguishing CTL-like (CTL and stable MCI) from AD-like (MCI converters and AD) resulted in balanced accuracies of 82% (cross-validation) for ADNI and 79% (independent test set) for AddNeuroMed. For prediction of conversion from MCI to AD, balanced accuracies of 70% (AUC of 0.75) and 75% (AUC of 0.81) were achieved. The ORCHID score was computed for all subjects. We showed that this measure significantly correlated with MMSE at 12 months (ρ = –0.64, ADNI and ρ = –0.59, AddNeuroMed). Additionally, the ORCHID score can help fractionate subjects with unstable diagnoses (e.g. reverters and healthy controls who later progressed to MCI), moderately late converters (12–24 months) and late converters (24–36 months). A comparison with results in the literature and direct comparison with a binary classifier suggests that the performance of this framework is highly competitive.

Introduction

Alzheimer’s disease (AD) is a neurodegenerative disorder characterized by progressive dementia that occurs in later life. In addition to devastating cognitive impairment, AD is characterized by progressive cerebral atrophy. The anatomical hallmarks of AD, cerebral atrophy and ventricular expansion, can be detected using magnetic resonance imaging (MRI). To model the spatial pattern of atrophy and predict conversion to AD, pattern recognition (PR) has been extensively applied, in particular to the Alzheimer’s disease neuroimaging initiative (ADNI) dataset [1][10]. Good performance has been well-established for discriminating healthy controls (CTL) and AD patients [11]. Therefore, most studies have focused on the more challenging problem of predicting conversion from mild cognitive impairment (MCI) to AD. The PR approach employed most commonly has been to train binary classifiers to discriminate CTL from AD, then apply these classifiers to make predictions for the MCI group [2], [6][9]. In these cases, the model is configured to discriminate between the extremities of disease progression while ignoring the intermediate states. A potential shortcoming of this approach is that it overlooks the ordinal nature of the disease progression: AD is associated with higher rates of brain atrophy than normal ageing with MCI between the two [12][14].

Here, we propose to use structural MRI acquired at baseline to predict disease progression across four clinical time points: CTL, MCI stable (MCI-s), MCI converter (MCI-c) and AD. To model these four groups simultaneously (distinct from pairwise comparisons) while considering the ordered relationship between the classes, we use multivariate ordinal regression [15]. Under this framework, the clinical groups are considered to lie on a continuum of disease progression, which provides a source of information that is not utilized by the conventional approach of producing predictions from models trained on the contrast between CTL (disease-free state) versus AD (disease state). We hypothesize that this method is therefore particularly well-suited to AD considering that brain atrophy/damage cannot be assumed to occur at a uniform rate [16][18]. The main advantages of performing multivariate ordinal regression over a mass univariate approach are that the method is able to make use of correlations between brain regions and provides predictions at a single subject level based on patterns in the data. In more detail, we have implemented multivariate ordinal regression using Gaussian processes in a Bayesian framework. This framework furnishes probabilistic predictions of which clinical time point (CTL, MCI-s, MCI-c or AD) a test case belongs too. In addition, to help alleviate the possibility of overfitting, this method implicitly regularises the solution via the prior over the parameters.

In this study, we explore the use of ordinal regression applied to baseline structural MRI data for automated early detection as well as diagnosis of AD using two large multicentre studies – the North American ADNI cohort and the European AddNeuroMed cohort. For the ADNI cohort, the performance of ordinal regression will be assessed using cross-validation with the AddNeuroMed cohort will be used as a completely independent test set. While cross-validation provides an estimate of the model generalizability; an independent test set provides the optimal way to assess the model generalizability [19]. In this case, the independent data set (AddNeuroMed) has been acquired so that it is compatible with ADNI but using different MRI scanners and in a European cohort.

Materials and Methods

Subjects

Subjects from two open access multicentre studies were used: AddNeuroMed (http://www.innomed-addneuromed.com) and ADNI (http://www.adni-info.org/). For AddNeuroMed follow-up is only available up to 12 months therefore, we only consider 12 month follow-up for the ADNI dataset for the main analysis. A total of 1023 subjects were included, 348 subjects from the AddNeuroMed study (119 AD, 119 MCI and 110 CTL, representing the entire sample) and 564 subjects from the ADNI study (147 AD, 226 MCI and 191 CTL; participant identifiers are listed in Table S1). For further validation, ADNI subjects with unstable diagnostic labels (reverters) and late converters (those who convert between 12–36 months) were selected using follow-up data to 36 months (4 AD, 89 MCI and 18 CTL; participant identifiers are listed in Table S2). Often these data have been previously discarded [1], [8], [20]. Here we propose to use these data as an additional test set to map these subjects onto the CTL to AD continuum. The subject characteristics are presented in Table 1. For more information about these datasets; specifically the inclusion/exclusion criteria and MRI acquisition see Text S1.

Table 1. Baseline subject characteristics.

ADNI ADNI Subset AddNeuroMed
CTL MCI AD CTL MCI AD CTL MCI AD
N 191 226 147 18 89 4 110 119 119
Age 75.8±5.0 74.7±6.9 75.0±7.4 77.7±5.4 74.7±7.5 77.2±6.6 72.9±6.5 74.3±5.7 75.5±6.0
Female/Male 94/97 84/142 71/76 7/11 37/52 2/2 60/50 59/50 60/40
Education 16.1±2.8 15.8±2.9 14.8±3.0 16.6±2.6 15.6±3.0 13±3.8 10.8±4.8 8.9±4.3 8.0±4.0
MMSE 29.1±1.0 27.1±1.8 23.4±1.9 29.1±0.8 26.7±1.6 25.0±1.4 29.1±1.2 27.1±1.7 20.9±4.7
CDR-SOB 0.03±0.1 1.56±0.9 4.26±1.6 0.03±0.1 1.66±0.9 4.00±0.9 0.06±0.2 1.38±0.9 6.50±3.2
ApoE 4+ (%) 25.1 48.7 64.0 38.9 62.9 100 28.2 32.8 53.8

Data shown as mean ± standard deviation. Education is presented in years. ADNI subset refers to the group of reverters and later converters which were used for validation of the model but were not included in the training of the model. AD = Alzheimer’s disease, MCI = Mild Cognitive Impairment, CTL = healthy control, MMSE = Mini Mental State Examination. CDR-SOB – Clinical Dementia Rating – Sum Of Boxes.

The following diagnostic criteria were used:

  • AD: Mini mental state examination (MMSE) scores between 20–26, CDR of 0.5 or 1, and met NINCDS/ADRDA criteria for probable AD.

  • MCI: MMSE scores between 24–30, memory complaints with objective memory loss measured with Wechsler Memory Scale Logical Memory II (education adjusted scores), CDR of 0.5, absence of significant levels of impairment in other cognitive domains, preserved activities of daily leaving and absence of dementia.

  • CTL: MMSE scores between 24–30, CDR of 0, not depressed, non MCI, non-demented.

The subjects were re-assessed at several time points following baseline. Using the diagnostic criteria defined earlier and at the 12 month time point MCI subjects were divided into two groups: those that did not progress to AD (MCI-s) and those that progressed to AD (MCI-c). MCI subjects who converter after 12 months were not used to train the model but were used as a test validation set.

Ethics Statement

We used ADNI subject data collected from 50 clinic sites. Ethics approval was obtained for each institution involved. The AddNeuroMed study was approved by ethical review boards in each participating country (local ethical review board at University of Perugia, University of Toulouse, Aristotle University of Thessaloniki, Medical University of Lodz, University of Eastern Finland and University Hospital of Kuopio and King’s College London). Both studies were conducted according to Good Clinical Practice guidelines, the Declaration of Helsinki, US 21CFR Part 50– Protection of Human Subjects, and Part 56– Institutional Review Boards, and pursuant to state and federal HIPAA regulations. Written consent was obtained where the research participant had capacity, and in those cases where dementia compromised capacity then assent from the patient and written consent from a relative, according to local law and process, was obtained. Consent was requested for data collection, sample storage and subsequent use of samples for research. The completed questionnaires were approved by each participating site’s Institutional Review Board. The data were anonymized before being shared.

Regional volume segmentation and cortical thickness parcellation

For both studies, the imaging protocol included a high-resolution sagittal 3D T1-weighted Magnetization Prepared RApid Gradient Echo (MPRAGE) volume. Volumetric segmentation, cortical surface reconstruction and cortical parcellation, based on the FreeSurfer package (4.5.0, http://surfer.nmr.mgh.harvard.edu/), were used to quantify baseline thicknesses and volumes of brain regions, as detailed previously [21]. Sixty-eight cortical thickness measures (34 from each hemisphere) and 50 regional volumes were generated. Volumes of white matter hypointensities, optic chiasm, right and left vessel, and right and left choroid plexus were excluded. White matter hypointensities were excluded since most subjects were characterized by zero values. Volumetric measures were normalized by their intracranial volume while cortical thickness measures were not normalized [22]. Right and left measures were averaged [23]. In total this results in 57 measures to be used as input features for ordinal regression; 34 regional cortical thickness measures and 23 regional volumes (Table S3).

Multivariate ordinal regression

Multivariate ordinal regression (ORGP) was performed using Gaussian processes in a Bayesian framework [15], [24], providing probabilistic predictions for class membership. The likelihood function captures the ordinal nature of the data using a soft threshold model. Crucially, these thresholds are learned from the training data to provide flexibility in the distances between classes. Additionally, the predictive mean of the latent function that models the ordinal continuum per test case (see Text S1) is used to measure the Ordinal Regression Characteristic Index of Dementia (ORCHID) whereby more positive values indicate a more AD-like brain structure and more negative values indicate a more CTL-like brain structure. To visualise the spatial pattern driving the discrimination multivariate maps can be constructed. This is achieved by visualising the weight vector to provide a spatial representation of the ordinal continuum.

Methodology of ordinal regression using Gaussian processes

Consider a training dataset Inline graphic of N observations, Inline graphic where each sample is a pair consisting of the input data vector Inline graphic of dimension M and corresponding label Inline graphic where L is a finite set of R ordinal categories, denoted Inline graphic. The column data vectors which in our case represent the cortical thickness and subcortical volumes for all N subjects are aggregated in the data matrix X with dimensions N×M. The targets are collected in vector y which represents the state of disease progression and are ranked from one to four whereby one: CTL, two: MCI-s, three: MCI-c and four: AD.

The main principle here is to assume an unobservable latent function Inline graphic associated with each Inline graphic and assume a Gaussian process (GP) prior over Inline graphic, where Inline graphic is a vector collecting all latent function values at the training points. The ordinal variable Inline graphic is dependent on the latent function Inline graphic by modelling the ranks as intervals on the real line. This is achieved using a Bayesian framework. First, a GP prior Inline graphic is placed on the latent function. The GP prior can be fully defined by a mean function Inline graphic and a covariance function Inline graphic. Here we define the GP as zero mean with a linear covariance matrix:

graphic file with name pone.0105542.e015.jpg

where Inline graphic is a bias term that also controls the scaling of the latent function which in turn affects the variance of the predictive weights. We refer to this quantity as a hyperparameter, collected in the vector Inline graphic, which will be optimised within this framework.

The joint probability of observing the ordinal variables, i.e. the likelihood is defined as

graphic file with name pone.0105542.e018.jpg

where Inline graphic and Inline graphic with Inline graphic representing a threshold variable and Inline graphic representing a noise term. Inline graphic is the cumulative unit Gaussian whereby Inline graphic. At the extremities of the ordinal continuum, the threshold variables serve as limits and are defined as Inline graphic and Inline graphic, and the intermediate thresholds are further defined as Inline graphic with positive padding variables Inline graphic where k = 2,…,R–1. This formulation enforces ordinal constraints by dividing the real line into R contiguous intervals Inline graphic which map Inline graphic to the discrete variable Inline graphic. Note that the thresholds are not constrained to be equidistant.

Bayes theorem can be used to compute the posterior probability, hence enabling predictions to be made. The marginal likelihood, Inline graphic is known as the model evidence and is the metric used to learn the hyperparameters. To approximate the posterior distribution and model evidence we use the Laplace approximation at the maximum a posteriori (MAP) estimate [25]. Powell’s method is used to maximise the evidence and hence, infer the optimal hyperparameters [26].

Having set the hyperparameters, we now want to make predictions about a test case Inline graphic for which the target Inline graphic is unknown. Under the Laplace approximation, the predictive distribution for the latent function can be written as a Gaussian Inline graphic where the predictive mean and variance can be written as

graphic file with name pone.0105542.e036.jpg

where Inline graphic is the covariance between the test case and the training data, Inline graphic is the variance of the test case, Inline graphic is the MAP estimate of the latent function, Inline graphic is a diagonal matrix whose ii-th entry is second derivative of the likelihood function training sample Inline graphic with respect to Inline graphic.

The predictive distribution over the ordinal target Inline graphic is

graphic file with name pone.0105542.e044.jpg

This distribution is used to assign the test case to an ordinal scale using

graphic file with name pone.0105542.e045.jpg

To visualise the spatial pattern driving the discrimination multivariate maps can be constructed. This is achieved by visualising the MAP estimate of the weight vector to provide a spatial representation of the decision boundary. This is analogous to the weight vector used for mapping SVM discrimination. For ORGP we can extract a vector Inline graphic, which is analogous to the weight vector in the function-space view of GP learning, whereby Inline graphic. The maps are constructed by computing the posterior expectation Inline graphic of the weight vector in the weight-space.

graphic file with name pone.0105542.e049.jpg

Implementation of ordinal regression

To assess the generalizability of the model two validation approaches were used. For ADNI data, the ordinal regression approach was embedded within a stratified 10-fold cross-validation scheme that preserves the relative frequencies of samples in each class. AddNeuroMed data were used as an independent validation set whereby the ordinal regression model was trained on the ADNI dataset and subsequently tested on the AddNeuroMed dataset.

To account for imbalanced subject numbers per class, the probabilistic predictions for each test case were recalibrated whereby the prediction per class was divided by the proportion of that class represented in the training set. The probabilistic predictions per test case and across all four classes were then re-normalised to sum to one [27].

This study aimed to predict the class label of each subject along an ordinal continuum representative of disease state. In addition, we aimed to develop a potentially useful tool for clinical decision-making and that can be easily compared with current approaches. For this purpose, we also consider the accuracy obtained when the ordinal labels were aggregated into two classes defined to reflect “CTL-like” (CTL or MCI-s) or “AD-like” (MCI-c or AD) subjects. Specifically, the prediction of CTL-like is considered correct if the subject belongs to the CTL or MCI-s groups and the prediction of AD-like is correct if the subject belongs to the MCI-c or AD group.

Multivariate maps were constructed to visualise the spatial pattern driving the ordinal regression. For Gaussian process models, this is achieved by visualising the maximum a posteriori estimate of the weight vector to provide a visualisation of the projection of the data ranging from CTL to AD (see Text S1 for more details). Since multivariate techniques are sensitive to spatial correlations across features (ROIs in this case), and the performance of the model is based on the entire pattern rather than individual regions, local inference should be avoided when interpreting these maps.

ORGP was implemented by the authors in MATLAB (The MathWorks, Natick, Massachusetts). Custom likelihood and inference scripts were written for compatibility with the GPML toolbox [25].

Performance Metrics

The sensitivity and specificity of binary predictions were calculated for each pair. We report the balanced accuracy (mean of the sensitivity and specificity) which avoids inflated performance estimates for imbalanced datasets.

Confusion matrices and receiver operator characteristic (ROC) curves were used for visualisation. For the confusion matrices, the rows represent the true class labels and the columns represent the labels predicted by the learning machine. The diagonal elements represent correctly classified test cases whereas the off-diagonal elements represent misclassifications. An ROC curve involves plotting true positive rate (sensitivity) against the false positive rate (1-specificity) achieved by varying the threshold on the probabilities for a binary contrast. The area under this curve can also be used as a metric for assessing performance; an area of 1 implies perfect discrimination whereas as area of 0.5 implies random chance.

Differences in the distribution of the predictive mean from ORGP were assessed using a two sample Kolmogorov-Smirnov test; whereby p<0.05 implies that they are from different distributions. Spearman’s correlation was used to assess the relationship between ORCHID scores and MMSE scores.

Results

Ordinal regression was applied to the four-class continuum spanning healthy controls (CTL), stable mild cognitive impairment (MCI-s), mild cognitive impairment with subsequent conversion (MCI-c) and Alzheimer’s disease (AD). The four class predictions were subsequently summarised to provide binary prediction of CTL-like or AD-like.

Cross-validated ADNI results

The performance of ordinal regression for the ADNI dataset using 10-fold cross-validation is presented in terms of three confusion matrices (Figure 1) and ROC curves (Figure 2(a)): the main contrast was CTL-like versus AD-like and two additional contrasts of interest were CTL vs. AD and MCI-s vs. MCI-c. For CTL-like versus AD-like, a balanced accuracy of 82% was achieved with an AUC of 0.88. Considering CTL versus AD, a balanced accuracy of 91% was achieved with an AUC of 0.95. For MCI-s versus MCI-c contrast a balanced accuracy of 70% and an AUC of 0.75 were found. The multivariate pattern of brain regions driving the discrimination is shown by Figure 3.

Figure 1. Confusion matrices for ordinal regression applied to two AD-related cohorts.

Figure 1

The confusion matrix for the binarised CTL-like vs. AD-like (CTL and MCI-s vs. MCI-c and AD) is displayed on the left. For illustration purposes, on the right confusion matrices for two contrasts of interest: CTL vs. AD and MCI-s vs. MCI-c (note: training scheme is unchanged). The top panel displays the results achieved using 10-fold cross-validation on the ADNI dataset. The bottom panel displays the results achieved by applying the ordinal regression model trained on the ADNI dataset to the AddNeuroMed data set whereby the AddNeuroMed dataset represents an independent unseen validation of the performance of the method.

Figure 2. Performance curves and correlation plots for ordinal regression.

Figure 2

(A): ROC curves for ordinal regression applied to the ADNI data set using 10-fold cross-validation (first panel) and using the AddNeuroMed data set as an independent test set. In both cases the ROC curves are shown from three contrasts. (B): Correlation plots of the MMSE score assessed at the 12 month follow-up against the Ordinal Regression Characteristic Index of Dementia (ORCHID) score for both the ADNI dataset and the AddNeuroMed dataset.

Figure 3. Multivariate discriminative weights computed using ordinal regression.

Figure 3

For ordinal regression the weights can be interpreted as the projection of the data along the function space weight vector spanning CTL to MCI-s to MCI-s to AD. Note that the weights are symmetric across hemispheres. These weights are sensitive the spatial correlations in the data and therefore should not be interpreted in a univariate manner.

Independent validation using AddNeuromed

To independently validate ordinal regression the model was trained on the ADNI data and tested on the AddNeuroMed data. The performance of the model is illustrated as confusion matrices (Figure 1) and ROC curves (Figure 2(a)). Comparing CTL and MCI-s to MCI-c and AD a balanced accuracy of 79% was obtained with an AUC of 0.88. Considering the CTL versus AD contrast a balanced accuracy of 83% was achieved with an AUC of 0.93. For MCI-c versus MCI-s a balanced accuracy of 75% with an AUC of 0.81 was obtained.

Combined ADNI and AddNeuroMed Data

The performance of ordinal regression for the combined ADNI and AddNeuroMed datasets using stratified 10-fold cross-validation is presented in terms of three confusion matrices (Figure S2). Comparing CTL and MCI-s to MCI-c and AD a balanced accuracy of 82% was achieved with an AUC of 0.90. Considering the CTL versus AD contrast a balanced accuracy of 88% was achieved with an AUC of 0.95. For MCI-c versus MCI-s a balanced accuracy of 74% with an AUC of 0.80 were obtained.

Correlation with MMSE at 12 months

The correlation between the ORCHID score and MMSE scores at 12 months was calculated (Figure 2(b)). For the ADNI dataset, a correlation coefficient of −0.643 (p<0.0001, Spearman’s ρ) was obtained following 10-fold cross-validation For the AddNeuroMed dataset, a correlation coefficient of −0.589 (p<0.0001, Spearman’s ρ) was obtained using this data as an independent test set.

ADNI participants with unstable follow-up/late conversion

Ordinal regression trained on the ADNI dataset was tested using three subsets of the ADNI cohort: 1) Subjects with unstable diagnosis across follow-up time points including AD to CTL, MCI to CTL, AD to MCI and CTL subjects who progressed to MCI past 12 months. In their most recent follow-ups these subjects were labelled CTL or MCI-s. 2) Moderately late converters: subjects who convert to AD between the 12–24 month follow-up and 3) Late converters: subjects who convert to AD between the 24–36 month follow-up. In Figure 4, we map these subjects using the ORCHID score. For comparison, we also display distributions of the ORCHID scores for CTL and AD classes as well as MCI subjects who converted before 12 months. Subjects with unstable diagnoses overlap well with the CTL distribution, implying that these two distributions are similar (p>0.05, Kolmogorov-Smirnov test). For moderately late converters, the distribution is shifted towards AD with some overlap at the boundary. For the late converters, the distribution is more flat and spread across the continuum, implying these data are not accurately predicted by the model. The moderately late converters are likely to have been drawn from the same distribution as the converters at 12 months (p>0.05, Kolmogorov-Smirnov test) whereas the late converters are not drawn from the same sample (p = 0.00023, Kolmogorov-Smirnov test).

Figure 4. Distribution of the ORCHID (Ordinal Regression Characteristic Index of Dementia) score extracted from for the ADNI dataset representing the disease progressive continuum spanning CTL to MCI-s to MCI-c to AD.

Figure 4

The lower portion of each plot represents the distributions for the CTL class (green) and the AD class (red). (a) represents the distribution of subjects with unstable labels across follow-ups, most of which appear to belong to either the CTL or MCI-s classes (N = 24). (b) represents the distribution of those who convert to AD by 12 month follow-up (N = 62) (i.e. MCI-c: the sample used for training and testing the ADNI-based ORGP model). (c) represents the distribution of those who convert to AD between the 12 and 24 month follow-up (N = 58). (d) represents the distribution of those who convert to AD between the 24 and 36 month follow-up (N = 29).

Discussion

We have presented a novel application of ordinal regression which models structural brain changes as a continuum across healthy controls, stable MCI patients, MCI patients who convert to AD and AD patients. Conventionally, prediction of conversion to AD is achieved using binary classification trained on CTL and AD classes. Therefore, MCI classes are often not used to inform the discriminative model. We model all classes simultaneously as a continuum of disease progression. This is achieved using multivariate ordinal regression from which we can extract a probabilistic prediction of class membership as well as an index of AD progression – the ORCHID score. This method enables us to map subjects with unstable diagnosis (e.g. those who revert) and also those who are late converters to a continuum which is derived using all four classes as opposed to the extreme classes (CTL and AD).

A large body of literature exists on predicting conversion to AD from MCI [1][10]. Comparison between different studies in this literature is not trivial because of varying data types and feature construction, subject inclusion and exclusion criteria, criteria for determining conversion to AD, classification methods, classifier performance metrics, and most critically some papers that have not used independent training and test sets, ultimately resulting in biased (i.e. circular) analyses, as highlighted by Eskildsen et al. [3]. The contribution of this paper is a novel pattern recognition approach for AD diagnosis, therefore we have selected an appropriate subset of studies for comparison (i.e. having similar data selection heuristics and feature construction methods but varying pattern recognition techniques). The performance of ordinal regression in predicting conversion/stability at 12 months was 70% (AUC of 0.75) for the ADNI dataset. This is highly competitive considering recent studies. For example, Cuingnet et al. [1] achieved a balanced accuracy of 56% considering ROI-based cortical thickness (albeit excluding subcortical regions) and a balanced accuracy of 66% considering the volume of the hippocampus, both using support vector machine (SVM) classification. Using the same dataset as [1], Wolz et al. [10] achieved a balanced accuracy of 67% considering hippocampal volume only and linear discriminant classification. Eskildsen et al. [3] obtained comparable balanced accuracy (73%) and AUC (0.762) to the results reported here using cortical thickness measures and a linear discriminant classifier. However, these authors grouped subjects into time-homogenous groups of MCI converters, i.e. scans were selected 6, 12, 24 and 36 months prior to conversion. Using this rationale, 128 subjects fall into the 12 month converter group, which is approximately double the sample size used for training here (i.e. considering only the baseline data). Westman et al. [28] achieved a balanced accuracy of 63% using orthogonal partial least squares (OPLS) applied to measures of cortical thickness and subcortical volumes. Cui et al. [7] achieved a balanced accuracy of 63% using SVM on cortical thickness and subcortical volume features and considering follow-up to 24 months.

In order to optimally assess the generalizability of ordinal regression for predicting conversion to AD, we tested the model on a completely independent test set – the AddNeuroMed dataset. The balanced accuracy for predicting conversion/stability at 12 months was 75% (AUC of 0.81, sensitivity of 77% and specificity of 72%). Using an identical cohort and feature set to this study and OPLS, Westman et al. [28] obtained a balanced accuracy of 68% (sensitivity of 64%, specificity of 71%). The results reported by Westman et al. were obtained by training the OPLS classifier on the CTL and AD data from the combined AddNeuroMed and ADNI dataset and then tested on the MCI-s/-c classes from AddNeuroMed. In contrast, our results were obtained by applying ordinal regression to all four classes from the ADNI dataset and testing on the MCI-c/-s from AddNeuroMed. Given that both cohorts display similar patterns of brain atrophy [28], we suggest that the improvement in performance achieved here is likely to have been strongly driven by the use of ordinal regression to model brain atrophy as a continuum.

Fan et al. [4] published the first paper proposing a multivariate ranking approach for predicting conversion to AD. Fan et al. developed a pairwise ranking approach using binary classifiers with ordinal rules and presented their results using 4 class confusion matrices therefore, we can calculate the accuracy in the same manner as here. Using a subset of the ADNI dataset, a balanced accuracy of 54.2% was obtained for predicting conversion/stability. This is markedly lower than the performance obtained using our ordinal regression model which considers all classes simultaneously, albeit using a smaller sample and including later converters (>12months).

The spatial pattern of weights driving the ordinal regression model are presented in Figure 3. These weights are influenced by the value of either the cortical thickness or subcortical volume, the variance of the data and the parameterisation of the classifier. Therefore, a negative weight cannot be directly interpreted as an ROI displaying atrophy. Nonetheless we qualitatively note that the pattern of regions for ordinal regression is similar to discriminating regions for CTL vs. AD reported in previous studies [9], [28] with the entorhinal cortex, hippocampus and temporal lobe among the most negatively weighted regions and the ventricles and CSF among the most positively weighted regions.

Ordinal regression provides a single summary index of the state of disease progression - the ORCHID score. The rationale of the ORCHID score is similar to that of the SPARE-AD [2] and OPLS [29] indices. Specifically, a more positive ORCHID score implies a more AD-like brain structure and a more negative score implies a more CTL-like brain structure. The SPARE-AD study utilised a similar sized ADNI cohort, (170 MCI-s and 69 MCI-c) using CDR to define conversion; a balanced accuracy of 63% was obtained. For the OPLS index subjects from both the ADNI and AddNeuroMed datasets were used to achieve a balance accuracy of 68%. For comparison, we evaluated ordinal regression using both datasets and achieved a balanced accuracy of 74%. Both the SPARE-AD and OPLS indices are computed using the extremal ends of disease progression – CTL versus AD. In contrast, the ORCHID score developed here is derived from data across 4 states of disease progression (CTL to MCI-s to MCI-c to AD). Moreover, this method inherently models the ordinal nature of the states of disease progression which we infer from increasing levels of brain atrophy across the states. Ordinal regression also provides probabilistic predictions which enable us to recalibrate the predictions to account for the uneven number of subjects per class. Furthermore, ordinal regression was found to outperform binary classification using Gaussian processes [25], [30] (4% increase for ADNI, 6% increase for AddNeuroMed, Figure S1) trained in a similar manner to SPARE-AD and OPLS whereby the CTL versus AD subjects from ADNI were used for training and MCI-s/c subjects from AddNeuroMed and ADNI were used for testing (see Text S1).

The ORCHID score was also found to be significantly correlated with the MMSE score at 12 month follow-up for both the ADNI dataset ρ = −0.64 (cross-validation) and the AddNeuroMed dataset ρ = −0.59 (independent validation). Stonnington et al. [31] investigated the utility of baseline MRI for directly regressing baseline MMSE scores on structural imaging data. Using a similar sized ADNI cohort they achieved a correlation coefficient of ρ = 0.48. The magnitude of the correlation coefficient reported here is higher while also being compared to the MMSE at 12 month follow-up rather than baseline.

We employed the ORCHID score to help fractionate subjects with unstable diagnoses (e.g. reverters and healthy controls who later progressed to MCI), moderately late converters (12–24 months) and late converters (24–36 months). We found the distribution of the predictive mean for the unstable diagnosis group was similar to the distribution for healthy controls. We consider this to be appropriate given that most people in this group had reverted from a diagnosis of AD or MCI-c to CTL or MCI-s. The moderately late converters were found to have similar distribution to the 12 month converters. However, the late converters were found to have a significantly different distribution of predictive means. Qualitatively, the predictive means are spread across the continuum (Figure 4). This implies that late converters are more difficult to characterise which may be as at baseline they are at an early stage of disease progression in terms of brain pathology. This is in keeping with other studies which found that predictive performance was highest closer to the point of conversion [3], [32]. Adaszewsk et al. [32] proposed that atrophy is restricted to a few brain regions in the early stages of disease progression as atrophy in AD is assumed to follow the spreading pattern of beta amyloid of tau depositions with a delay of several years,. Therefore, the pattern of brain changes for late converters (24 to 36 months) may be distinct from the pattern of earlier converters and those already diagnosed with AD and hence this may explain why late converters are not well-characterised by the ORCHID score.

We have presented a probabilistic multivariate framework for ordinal regression. A comparison with results in the literature and direct comparison with a binary classifier suggests that the performance of this framework is highly competitive. To further explore the performance of this technique, it may be interesting to extend this framework to the multi-modal case incorporating multiple data sources. This may provide further improved performance if complimentary information is spread across data sources.

Methodologically, this technique offers several advantages: the inherent ordinal nature of the disease progression of AD is captured and two quantitative predictions are provided which can be readily interpreted. The first is a probabilistic prediction at the individual subject level, which indicates the categorical group membership (e.g. CTL-like or AD-like) and can be directly applied to assist clinical decision making, following appropriate validation. Moreover, the method quantifies the certainty of this prediction, which is equally crucial for clinical applications, for example to account for class imbalance [27]. The second type of prediction is the ORCHID score, which quantifies disease progression providing a visualisation of where an individual subject lies on a continuum spanning healthy to disease state. Overall, we propose that multivariate ordinal regression is a potentially powerful method for identifying those at risk of progressing to AD.

Supporting Information

Figure S1

Confusion matrices obtained for MCI stable versus converters from the ADNI and AddNeuroMed cohorts using a binary Gaussian process classification trained on CTL versus AD subjects from the ADNI cohort.

(TIF)

Figure S2

Confusion matrices for ordinal regression applied to the combined data from ADNI and AddNeuroMed using 10-fold cross validation. The confusion matrix for the binarised CTL-like vs. AD-like (CTL and MCI-s vs. MCI-c and AD) is displayed on the left. For illustration purposes, on the right confusion matrices for two contrasts of interest: CTL vs. AD and MCI-s vs. MCI-c (note: training scheme is unchanged).

(TIF)

Table S1

List of participants selected from the ADNI dataset for training and testing the ordinal regression model.

(DOCX)

Table S2

List of participants selected from the ADNI dataset for validating the ordinal regression model.

(DOCX)

Table S3

Variables included in the ordinal regression analysis. 57 variables in total, 34 cortical thickness measures and 23 volumetric measures.

(DOCX)

Text S1

(DOCX)

Acknowledgments

We would like to thank the centres who collected the data used in this study as well as the participants and their families. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.ucla.edu) and the AddNeuroMed Study (http://www.innomed-addneuromed.com/). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. Michael Weiner (University of California, Michael.weiner@ucsf.edu) is the principle investigator of ADNI. A complete listing of ADNI investigators can be found at: http://adni.loni.ucla.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf. The AddNeuroMed Consortium includes AS, EW, SL, MT, PM, IK, HS and BV with SL (University of Oxford, simon.lovestone@psych.ox.ac.uk) as the principle investigator.

Data Availability

The authors confirm that all data underlying the findings are fully available without restriction. Data from AddNeuroMed are available from http://dataverse.brc.iop.kcl.ac.uk/dvn/dv/ANM Data from ADNI are available from: http://adni.loni.usc.edu.

Funding Statement

OMD is supported by the Innovative Medicines Initiative Joint Undertaking under Grant Agreement No 115008 (NEWMEDS). The Innovative Medicines Initiative Joint Undertaking is a public-private partnership between the European Union and the European Federation of Pharmaceutical Industries and Associations. AFM gratefully acknowledges support from King’s College London Centre of Excellence in Medical Engineering, funded by the Wellcome Trust and EPSRC under grant no. WT088641/Z/09/Z. EW wishes to thank Swedish Brain Power, the Strategic Research Programme in Neuroscience at Karolinska Institutet (StratNeuro), the regional agreement on medical training and clinical research between Stockholm County Council and Karolinska Institutet, Health Research Council of Academy of Finland, University of Eastern Finland UEFBRAIN and EVO funding from Kuopio University Hospital. AS is supported by funds from NIHR Biomedical Research Centre for Mental Health, the NIHR Biomedical Research Unit for Dementia at the South London and Maudsley NHS Foundation Trust and Institute of Psychiatry, Kings College London, Alzheimer’s Research UK and the Innovative Medicines Initiative funded European Medical Information Framework. The AddNeuroMed study was supported by InnoMed, (Innovative Medicines in Europe) an Integrated Project funded by the European Union of the Sixth Framework program priority FP6-2004-LIFESCIHEALTH-5, Life Sciences, Genomics and Biotechnology for Health. Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; BioClinica, Inc.; Biogen Idec Inc.; Bristol-Myers Squibb Company; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; GE HealtCTLare; Innogenetics, N.V.; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Medpace, Inc.; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of California, Los Angeles. This research was also supported by NIH grants P30 AG010129 and K01 AG030514. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Cuingnet R, Gerardin E, Tessieras J, Auzias G, Lehericy S, et al. (2011) Automatic classification of patients with Alzheimer’s disease from structural MRI: a comparison of ten methods using the ADNI database. Neuroimage 56: 766–781. [DOI] [PubMed] [Google Scholar]
  • 2.Davatzikos C, Bhatt P, Shaw LM, Batmanghelich KN, Trojanowski JQ (2011) Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification. Neurobiol Aging 32. [DOI] [PMC free article] [PubMed]
  • 3. Eskildsen SF, Coupe P, Garcia-Lorenzo D, Fonov V, Pruessner JC, et al. (2013) Prediction of Alzheimer’s disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning. Neuroimage 65: 511–521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Fan Y, ADNI (2011) Ordinal ranking for detecting mild cognitive impairment and Alzheimer’s disease based on multimodal neuroimages and CSF biomarkers. Proceedings of the First international conference on multimodal brain image analysis. Toronto, Canada: Springer-Verlag. 44–51.
  • 5. Gray KR, Aljabar P, Heckemann RA, Hammers A, Rueckert D (2013) Random forest-based similarity measures for multi-modal classification of Alzheimer’s disease. Neuroimage 65: 167–175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Westman E, Muehlboeck JS, Simmons A (2012) Combining MRI and CSF measures for classification of Alzheimer’s disease and prediction of mild cognitive impairment conversion. Neuroimage 62: 229–238. [DOI] [PubMed] [Google Scholar]
  • 7. Cui Y, Liu B, Luo S, Zhen X, Fan M, et al. (2011) Identification of conversion from mild cognitive impairment to Alzheimer’s disease using multivariate predictors. PLoS One 6: e21896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Young J, Modat M, Cardoso MJ, Mendelson A, Cash D, et al. (2013) Accurate multimodal probabilistic prediction of conversion to Alzheimer’s disease in patients with mild cognitive impairment. NeuroImage: Clinical 2: 735–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. McEvoy LK, Fennema-Notestine C, Roddey JC, Hagler DJ Jr, Holland D, et al. (2009) Alzheimer disease: quantitative structural neuroimaging for detection and prediction of clinical and structural changes in mild cognitive impairment. Radiology 251: 195–205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Wolz R, Julkunen V, Koikkalainen J, Niskanen E, Zhang DP, et al. (2011) Multi-method analysis of MRI images in early diagnostics of Alzheimer’s disease. PLoS One 6: e25446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kloppel S, Stonnington CM, Chu C, Draganski B, Scahill RI, et al. (2008) Automatic classification of MR scans in Alzheimers disease. Brain 131: 681–689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Leung KK, Bartlett JW, Barnes J, Manning EN, Ourselin S, et al. (2013) Cerebral atrophy in mild cognitive impairment and Alzheimer disease: rates and acceleration. Neurology 80: 648–654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Fox NC, Schott JM (2004) Imaging cerebral atrophy: normal ageing to Alzheimer’s disease. Lancet 363: 392–394. [DOI] [PubMed] [Google Scholar]
  • 14. Jack CR Jr, Shiung MM, Gunter JL, O’Brien PC, Weigand SD, et al. (2004) Comparison of different MRI brain atrophy rate measures with clinical disease progression in AD. Neurology 62: 591–600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Chu W, Ghahramani Z (2005) Gaussian processes for ordinal regression. J Mach Learn Res 6: 1019–1041. [Google Scholar]
  • 16. Sluimer JD, Vrenken H, Blankenstein MA, Fox NC, Scheltens P, et al. (2008) Whole-brain atrophy rate in Alzheimer disease: identifying fast progressors. Neurology 70: 1836–1841. [DOI] [PubMed] [Google Scholar]
  • 17. Jack CR Jr, Knopman DS, Jagust WJ, Shaw LM, Aisen PS, et al. (2010) Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. Lancet Neurol 9: 119–128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Schuff N, Tosun D, Insel PS, Chiang GC, Truran D, et al. (2012) Nonlinear time course of brain volume loss in cognitively normal and impaired elders. Neurobiol Aging 33: 845–855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hastie T, Tibshirani R, Friedman JH (2009) The elements of statistical learning: data mining, inference, and prediction. New York, NY: Springer. xxii, 745 p. p. [Google Scholar]
  • 20. Liu X, Tosun D, Weiner MW, Schuff N (2013) Locally linear embedding (LLE) for MRI based Alzheimer’s disease classification. Neuroimage 83C: 148–157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Fischl B, van der Kouwe A, Destrieux C, Halgren E, Segonne F, et al. (2004) Automatically parcellating the human cerebral cortex. Cereb Cortex 14: 11–22. [DOI] [PubMed] [Google Scholar]
  • 22. Westman E, Aguilar C, Muehlboeck JS, Simmons A (2013) Regional magnetic resonance imaging measures for multivariate analysis in Alzheimer’s disease and mild cognitive impairment. Brain Topogr 26: 9–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Walhovd KB, Westlye LT, Amlien I, Espeseth T, Reinvang I, et al. (2011) Consistent neuroanatomical age-related volume differences across multiple samples. Neurobiol Aging 32: 916–932. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Doyle OM, Ashburner J, Zelaya FO, Williams SC, Mehta MA, et al. (2013) Multivariate decoding of brain images using ordinal regression. Neuroimage 81C: 347–357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Rasmussen CE, Nickisch H (2010) Gaussian Processes for Machine Learning (GPML) Toolbox. J Mach Learn Res 11: 3011–3015. [Google Scholar]
  • 26.Powell MJD (1964) Efficient Method for Finding Minimum of Function of Several-Variables without Calculating Derivatives. Computer Journal 7: 155-&.
  • 27.Bishop CM (2006) Pattern recognition and machine learning. New York: Springer. xx, 738 p. [Google Scholar]
  • 28. Westman E, Simmons A, Muehlboeck JS, Mecocci P, Vellas B, et al. (2011) AddNeuroMed and ADNI: similar patterns of Alzheimer’s atrophy and automated MRI classification accuracy in Europe and North America. Neuroimage 58: 818–828. [DOI] [PubMed] [Google Scholar]
  • 29. Spulber G, Simmons A, Muehlboeck JS, Mecocci P, Vellas B, et al. (2013) An MRI-based index to measure the severity of Alzheimer’s disease-like structural pattern in subjects with mild cognitive impairment. J Intern Med 273: 396–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Doyle OM, De Simoni S, Schwarz AJ, Brittain C, O’Daly OG, et al. (2013) Quantifying the attenuation of the ketamine pharmacological magnetic resonance imaging response in humans: a validation using antipsychotic and glutamatergic agents. J Pharmacol Exp Ther 345: 151–160. [DOI] [PubMed] [Google Scholar]
  • 31. Stonnington CM, Chu C, Kloppel S, Jack CR, Ashburner J, et al. (2010) Predicting clinical scores from magnetic resonance scans in Alzheimer’s disease. Neuroimage 51: 1405–1413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Adaszewski S, Dukart J, Kherif F, Frackowiak R, Draganski B (2013) How early can we predict Alzheimer’s disease using computational anatomy? Neurobiol Aging 34: 2815–2826. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1

Confusion matrices obtained for MCI stable versus converters from the ADNI and AddNeuroMed cohorts using a binary Gaussian process classification trained on CTL versus AD subjects from the ADNI cohort.

(TIF)

Figure S2

Confusion matrices for ordinal regression applied to the combined data from ADNI and AddNeuroMed using 10-fold cross validation. The confusion matrix for the binarised CTL-like vs. AD-like (CTL and MCI-s vs. MCI-c and AD) is displayed on the left. For illustration purposes, on the right confusion matrices for two contrasts of interest: CTL vs. AD and MCI-s vs. MCI-c (note: training scheme is unchanged).

(TIF)

Table S1

List of participants selected from the ADNI dataset for training and testing the ordinal regression model.

(DOCX)

Table S2

List of participants selected from the ADNI dataset for validating the ordinal regression model.

(DOCX)

Table S3

Variables included in the ordinal regression analysis. 57 variables in total, 34 cortical thickness measures and 23 volumetric measures.

(DOCX)

Text S1

(DOCX)

Data Availability Statement

The authors confirm that all data underlying the findings are fully available without restriction. Data from AddNeuroMed are available from http://dataverse.brc.iop.kcl.ac.uk/dvn/dv/ANM Data from ADNI are available from: http://adni.loni.usc.edu.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES