Skip to main content
. Author manuscript; available in PMC: 2019 Oct 29.
Published in final edited form as: Alzheimers Dement. 2017 Mar 22;13(4):e1–e85. doi: 10.1016/j.jalz.2016.11.007

Table 9.

Summary of classification and prediction methods

Classification challenge
Modality Feature selection/dimensionality reduction Classifier N AD versus CN MCI versus CN MCI versus AD MCIc versus MCInc Other Unique approach Reference
MRI Multi-atlas SVM 459 ACC 91.6 ACC 72.4 Subjects registered to multiple atlases, features selected from each atlas, then jointly selected [372]
SEN 88.6 SEN 72.1
SPE 93.9 SPE 72.6
AUC 0.87 AUC 0.67
MRI Multi-atlas SVM 459 ACC 90.7 ACC 73.7 Joint learning of optimum representation of features from multiple atlases and classifier using maximum margin approach [373]
SEN 87.6 SEN 76.4
SPE 93.0 SPE 70.8
MRI Multitemplate SVM 459 ACC 93.3 ACC 80.9 Inherent structure-based multiview learning (ISML) clusters subjects within a specific class into subclasses based on features learned using multiple templates [374]
SEN 92.8 SEN 86.0
SPE 95.7 SPE 78.4
MRI Independent component analysis SVM 818 ACC 86.4 ACC 70.2 ACC 69.8 Uses novel ICA-based feature extraction method and linear SVM for classification [375]
SEN 88.3 SEN 72.9 SEN 73.4
SPE 84.0 SPE 67.5 SPE 66.2
MRI Spatially weighted principal component analysis SVM 390 Average misclassification %: 0.216 Incorporates spatial structure by using both global and local spatial weights for feature selection to improve on PCA alone [376]
MRI Circular harmonic function + principal component analysis SVM NA ACC 83.8 ACC 69.5 ACC 62.1 Uses circular harmonic functions to extract local features from the hippocampus and posterior cingulate cortex and PCA for dimensionality reduction [377]
SEN 88.2 SEN 74.8 SEN 75.1
SPE 79.1 SPE 62.5 SPE 49.0
MRI Gaussian process models 415 AUC 0.94 AUC 0.91 Generates quantitative maps of z-scores for WM, GM, and CSF abnormalities for clinical decision support [378]
MCIc versus CN
MRI Manifold learning: relevant variable selection (learned ROIs) MR1 disease state score 363 ACC 61 Novel MRI disease state score (MRI DSS). Also used to predict MMSE scores and achieved separation between CN, EMCI, MCInc, MCIc, and AD groups [379]
ADNI-GO SEN 50
SPE 72
EMCI versus CN
SVM 511 ACC 86 ACC 71 ACC 82
ADNI SEN 86 SEN 75 SEN 81
SPE 85 SPE 67 SPE 83
MCIc versus CN
MRI Manifold learning: self-organizing map SVM 818 ACC 90 ACC 84 Uses self-organizing map to select maximally discriminative ROIs and computes relative importance measure for each ROI [380]
SEN 87 SEN 82
SPE 92 SPE 87
AUC 0.92 AUC 0.84
MRI Manifold learning: local binary map + custom masks SVM 363 ACC 82.8 ACC 61.5 Uses local binary maps to directly select disease-specific patterns on a voxelwise basis without nonrigid registration. Also uses seven knowledge-based ROIs [381]
SEN 80.4 SEN 61.5
SPE 82.7 SPE 63.5
AUC 0.87 AUC 0.64
MRI Multivariate regression: elastic net Low density separation 834 AUC 0.73 Used a semisupervised learning method, low density separation, which uses unlabeled data to improve prediction of progression over supervised methods [382]
MRI Multivariate regression: Ordinal regression ORCHID score: probabilistic (not binary) 564 ACC 91 ACC 70 Models all clinical groups simultaneously as a continuum of disease, generates a summary score, ORCHID (Ordinal Regression Characteristic Index of Dementia) [383]
AUC 0.95 AUC 0.75
MRI Multivariate regression: knockout strategy Random Forests 245 SEN 98.0 Iterative multivariate knockout algorithm that uses Random Forests to construct set of relevant features [384]
MRI Tree-structured sparse learning SVM 830 ACC 90.2 ACC 70.7 ACC 87.2 Uses new tree construction method to cluster similar discriminative voxels, then a tree-structured sparse learning step and SVM classifier [385]
SEN 85.3 SEN 56.2 SEN 80.1
SPE 94.3 SPE 80.9 SPE 92.2
MCIc versus CN
MRI FreeSurfer to select ROIs, cortical thickness measures v-one class SVM 814 ACC 84.3 ACC 54.4 Focuses on detection of outliers who have abnormal brain patterns in a cognitively normal group. Trained on CN. Produces a brain abnormality index. [386]
Labeled as outliers. In control group, ACC = 67.5%
MRI Recursive feature elimination significance (p) maps from SVM weights 509 AUC 0.92 AUC 0.74 AUC 0.69 AUC 0.61 Uses p-maps of SVM weights as selection features in a wrapper method [387]
MRI Recursive feature elimination—SVM 370 ACC 95.1 Wrapper method based on recursive feature elimination and SVM selected GM and WM ROIs for classification with no need for dimensionality reduction step [388]
MRI (VBM) Probability distribution function based SVM 260 ACC 89.7 Extracts statistical patterns at multiple levels including use of probability distribution function of VOIs to represent statistical patterns [389]
SEN 87.7
SPE 91.6
AUC 0.95
MRI Hierarchical fusion of multilevel classifiers 652 ACC 92.0 ACC 85.3 Divides classification of high dimensional features into multiple low-dimensional classification problems and integrates features hierarchically [390]
SEN 90.9 SEN 82.3
SPE 93.0 SPE 88.2
MRI Predefined ROIs and routine feature ranking (Pearson Correlation Coefficient) Distance informed metric learning 321 ACC 83.1 ACC 71.6 Semisupervised metric learning framework that uses pairwise constraints that specify the relative distance between a pair of patients, according to their classification group [391]
SEN 75.0 SEN 77.6
SPE 91.2 SPE 65.6
MRI Graph-based multiple instance learning SVM 834 ACC 89.2 ACC 69.3 Extracts local intensity patches as features and uses a novel graph-based multiple instance learning approach to assign disease labels to patches [392]
SEN 85.1 SEN 66.7
SPE 92.6 SPE 71.2
MRI t-tests of predefined cortical thickness measures C4.5 decision tree 364 SEN 80.0 Encodes selected features into an ontology and uses C4.5 algorithm for clinical decision support [393]
SPE 80.0
MRI Pseudo-Zernike moments Neural network 500 ACC 97.3 ACC 95.6 ACC 94.9 Uses Pseudo-Zernike moments (30) for feature selection and neural network scaled conjugate gradient back propagation algorithm as classifier [394]
SEN 96.6 SEN 95.9 SEN 94.2
SPE 97.8 SPE 95.3 SPE 95.6
fMRI Sparse inverse covariance estimation matrix + Kernel-based PCA SVM 82 ACC 73.2 Uses a sparse inverse covariance estimation technique to model brain connectivity [395]
Note: EMCI
fMRI Sparse learning using dictionary learning algorithm SVM 210 ACC 94.1 ACC 92.0 ACC 92.3 Functional features are represented in a weighted dictionary matrix [396]
Note: SMC
FDG PET Spatially weighted principal component regression 196 Average misclassification 0.117% Spatially weighted selection of both individual features and neighboring patterns [397]
FDG PET Voxel-based longitudinal SVM 233 ACC 92.6 ACC 70.2 Extension of [398] using whole-brain voxel approach for longitudinal analysis of FDG PET [399]
FDG PET Multi-channel pattern analysis + 3D Gabor wavelets feature extraction SVM 369 MAP 56.3 MAP 76.2 Multi-channel approach integrates multiple patterns of hypometabolism from different patient groups using different analysis tools [400]
FDG PET Gaussian mixture model SVM 84 ACC 89.1 ACC 63.2 ACC 80.2 Semi-data-driven approach to feature selection defines clusters of ROIs from an NC image and uses these to extract features from scans of MCI and AD patients [401]
SEN 92.0 SEN 65.0 SEN 80.0
SPE 86.0 SPE 61.0 SPE 80.0
FDG PET Optimization of preprocessing steps for SPM8 t-sum score 108 ACC 68.0 Modified preprocessing steps using motion correction, custom FDG template, different regions for intensity scaling and varied smoothing. [402]
SEN 70.0
SPE 68.0
AUC 0.83
MRI Recursive feature elimination Random Forests 575 ACC 89.6 SEN 78.0 Compares efficacy of Random Forest classifiers in several datasets including ADNI. RF model outperformed SVM, best model combined volumetric and cortical thickness measures with APOE [403]
APOE4 demographics SEN 90.7 Note: 12-month progression
SPE 82.9
MRI Regularized logistic regression Low density separation (SVM) 825 AUC 0.90 Uses semisupervised learning to construct an aggregate biomarker by leading a separate MRI biomarker first and subsequently combining it with age and cognitive measures [404]
Cognitive
MRI Multivariate regression: LASSO based + novel loss function SVM 202 ACC 95.9 ACC 82.0 ACC 72.6 Uses a novel loss function combined with a group lasso method for joint feature selection. Multimodal classification outperformed single modalities [405]
PET SEN 95.7 SEN 98.0 SEN 48.5
CSF SPE 98.6 SPE 60.1 SPE 94.4
AUC 0.99 AUC 87.0 AUC 0.79
MRI Multivariate regression: fused LASSO + LDDMM SVM 103 SEN 84.0 Comparison of spatial regularization techniques for detecting longitudinal hippocampal shape changes in MCI progression [406]
HC shape longitudinal SPE 48.0
NPV 0.93
PPV 0.27
MRI Multivariate regression: weak hierarchical LASSO Random Forest 293 ACC 74.8 Uses hierarchical constraints and sparsity regularization to capture underlying interactions between biosignatures [407]
APOE4 SEN 66.7
Cognitive SPE 81.4
Demographics
MRI Multitask learning:: K means clustering + l 2,1 regression SVM 202 ACC 95.1 ACC 79.5 Uses a novel clustering approach to discover mulitpeak data distributions. These are accounted for in a subsequent a multitask learning step in a l 2,1 regression framework [408]
FDG PET SEN 94.0 SEN 88.9
SPE 96.3 SPE 62.0
AUC 0.96 AUC 0.78
MRI Multitask learning: K means clustering + l 2,1 regression SVM 202 ACC 94.3 ACC 80.1 ACC 74.6 Similar approach to [408]. NOTE: MCI versus AD classification uses ONLY MRI + PET data [409]
FDG PET SEN 94.0 SEN 86.8 SEN 46.7
CSF SPE 94.3 SPE 67.3 SPE 89.0
AUC 0.96 AUC 0.82 AUC 0.72
MRI Multitask learning: relational function + l 2,1 regression SVM 202 ACC 95.7 ACC 79.9 ACC 72.4 Introduces a function to conserve information about feature-feature relations, response-response relations, and sample-sample relations while jointly solving both classification and prediction of continuous variables [410]
FDG PET SEN 96.6 SEN 97.0 SEN 49.1
SPE 98.2 SPE 59.2 SPE 94.6
AUC 0.98 AUC 0.85 AUC 0.83
MRI Multitask learning: label-aligned regularization SVM 202 ACC 95.6 ACC 80.3 ACC 69.8 Uses a label-aligned regularization term in multitask learning step to use relationships across subjects, as well as across modalities in the feature selection step [411]
FDG PET SEN 95.1 SEN 85.0 SEN 66.7
SPE 96.5 SPE 70.8 SPE 71.4
AUC 0.97 AUC 0.81 AUC 0.69
MRI Multitask learning: graph-guided + latent 199 ACC 93.0 ACC 80.0 Uses new latent group LASSO penalty combined with an undirected graph approach to select correlated features that can jointly predict class label and clinical scores [412]
FDG PET LASSO
CSF group penalty
MRI, demographics, clinical, APOE4, PICALM Statistical learning embedded in SVM-based multiple kernel framework 213 ACC 91.0 Longitudinal, multimodal data are leveraged by an embedded novel statistical learning approach in a multikernel framework. [413]
SEN 95.0
SPE 80.0
MRI Domain transfer feature and sample selection SVM 202 ACC 76.5 Uses both target domain (MCI) and auxiliary domains (CN and AD) to extract features from imaging and CSF modalities in domain transfer method. Fuses selected features with a domain transfer SVM [414]
FDG PET SEN 81.2
CSF SPE 71.9
AUC 0.84
MRI Multimodal manifold-regularized transfer learning 202 ACC 80.1 Uses auxiliary domain information (as in [414]). Integrates kernel-based maximum mean discrepancy criterion and a manifold regularization function into single learning algorithm for both feature selection and classification [415]
FDG PET SEN 85.3
CSF SPE 73.3
AUC 0.85
MRI Multimodal canonical correlation analysis 103 ACC 95.1 Used canonical correlation analysis to fuse data in multimodal classification as it preserves intermodal relationships. [416]
FDG PET
CSF
MRI Deep architecture + multitask learning SVM 202 ACC 95.1 ACC 80.193.9 ACC 74.1 Uses novel deep learning architecture that discards uninformative features in a hierarchical manner during multitask learning [417]
PET SEN 92 SEN 53.7 SEN 50.5
CSF SPE 98.3 SPE SPE 92.7
MRI/PET/CSF MRI/PET MRI/PET
MRI Deep learning with Deep Boltzman Machine Hierarchical SVM 398 ACC 95.4 ACC 85.7 ACC 75.9 Uses deep learning with a Boltzman Machine to find hierarchical feature representation from MRI features and then fuses these with complementary information from PET [418]
PET SEN 94.7 SEN 95.4 SEN 48
SPE 95.2 SPE 65.9 SPE 95.2
AUC 0.99 AUC 0.88 AUC 0.75
MRI PCA/LASSO + multitask deep learning with dropout technique SVM 202 ACC 91.4 ACC 77.4 ACC 57.4 Uses dropout technique as method of regularization for deep learning with small data [419]
PET
CSF
Demographics
MRI Independent component analysis Discriminant classification analysis 320 ACC 94.3 ACC 83.3 ACC 84.1 ACC 80.0 Derived GM covariates patterns using independent component analysis and used for diagnostic classification in combination with cognitive performance [304]
Demographics SEN 94.9 SEN 76.7 SEN 86.1 SEN 78.3
SPE 94.0 SPE 89.1 SPE 73.0 SPE 81.5
MRI Stacked auto-encoder + multitask deep learning with zero-mask strategy Softmax logistic regression 331 ACC 91.4 ACC 82.1 Uses stacked autoencoders to extract high-level features and a zero-masking strategy to extract synergy between modalities [420]
PET SEN 92.3 SEN 60.0
SPE 90.4 SPE 92.3

Abbreviations: AD, Alzheimer’s disease; CN, cognitively normal; MCI, mild cognitive impairment; MCIc, mild cognitive impairment converters (progressive MCI); MCInc, mild cognitive impairment non-converters (stable MCI); MRI, magnetic resonance imaging; SVM, support vector machine; ACC, accuracy; SEN, sensitivity; SPE, specificity; AUC, area under the receiver operating curve; ICA, independent component analysis; PCA, principal component analysis; WM, white matter; GM, gray matter; CSF, cerebrospinal fluid; ADNI, Alzheimer’s Disease Neuroimaging Initiative; EMCI, early mild cognitive impairment; ROI, regions of interest; VBM, voxel-based morphometry; fMRI, functional magnetic resonance imaging; SMC, subjective memory concern; FDG PET, 18F-flurodeoxyglucose positron emission tomography; MAP, mean average precision; APOE4, apolipoprotein ε4 allele; PET, positron emission tomography; HC, hippocampus; LASSO, least absolute shrinkage and selection operator; LDDMM, large deformations via diffeomorphisms.