Abstract
Alzheimer’s disease (AD) diagnosis often requires invasive examinations (e.g., liquor analyses), expensive tools (e.g., brain imaging) and highly specialized personnel. The diagnosis commonly is established when the disorder has already caused severe brain damage, and the clinical signs begin to be apparent. Instead, accessible and low-cost approaches for early identification of subjects at high risk for developing AD years before they show overt symptoms are fundamental to provide a critical time window for more effective clinical management, treatment, and care planning. This article proposes an ensemble-based machine learning algorithm for predicting AD development within 9 years from first overt signs and using just five clinical features that are easily detectable with neuropsychological tests. The validation of the system involved both healthy individuals and mild cognitive impairment (MCI) patients drawn from the ADNI open dataset, at variance with previous studies that considered only MCI. The system shows higher levels of balanced accuracy, negative predictive value, and specificity than other similar solutions. These results represent a further important step to build a preventive fast-screening machine-learning-based tool to be used as a part of routine healthcare screenings.
Keywords: ADAS score, Cerebellar impairment, Clinical Dementia Rating Scale, Early diagnosis, Machine learning, Renal and genitourinary dysfunctions
Introduction
Alzheimer’s disease (AD) is the most worldwide diffused neurodegenerative disorder affecting elders [1, 2]. It causes progressive impairments of memory, language, visuospatial skills, and executive functions together with progressive reduction of functional autonomy in daily life. Depression and apathy are also frequent in the early and middle stages of the disease, whereas neurological signs and motor impairments (e.g., dystonia, tremor) could emerge in later stages [3]. AD diagnosis is commonly based on the analysis of the patient’s medical history, clinical tests, clinical and neurological exams, and brain imaging data. Usually, the diagnostic evaluations are started when the first clinical symptoms begin to manifest. However, the progressive neurocognitive diseases underlying AD starts 10–15 years before deficits become clinically noticeable and disease is diagnosed [4]; therefore the diagnostic process takes usually place when severe damages of brain are already present [5–8].
The early, pre-clinical identification of individuals at high risk for developing AD is fundamental to provide a critical time window for early clinical management, treatment, and care planning, thus also reducing healthcare costs. Indeed, when supplied at the earlier pre-clinical disease phases, treatments could produce more important benefits [9, 10]. Moreover, during the pre-clinical stages lifestyle changes can be made that will slow or prevent AD development. For example, it could be possible to delay neurodegeneration by early modifying the exposure to certain risk factors such as hypertension, smoking, obesity, and diabetes [11, 12]. An early diagnosis and subsequent access to the proper services could help people live independently in their own homes for longer time and maintain a good quality of life for themselves, their families, and caregivers; also, it could allow people to plan and participate in their own legal, financial, and future support/care options and treatment when they still have the capacity to do so [13]. Early diagnosis gives patient’s relatives the time to adjust to the changes in function, mood, and personality that will occur when facing AD and their transition to a caregiver role, thus allowing them to feel more competent, acquire specific skills, reduce the stress and, as a consequence to suffer less from psychological problems such as anxiety and depression [14, 15].
Currently, MCI represents the earliest detectable stage of a potential ongoing progression toward AD. However, data indicate that only 20–40% of MCI individuals will convert to AD within 3 years from diagnosis [16, 17]. Researchers are investigating several promising biomarker candidates for AD onset anticipation, including brain imaging, proteins in cerebrospinal fluid (CSF), blood and urine tests, and genetic risk profiling [7, 8, 18]. Accuracy and timing are two critical aspects of these diagnostic approaches. While the literature shows that changes in biomarkers correlate with AD development, no single biomarker adequately predicts the conversion to AD of MCI patients and of healthy individuals, with an acceptable level of accuracy and well in advance with respect to the first manifestation of AD overt signs. Another critical aspect of current diagnostic approaches is that they require expensive tools (e.g., brain imaging), invasive clinical exams (amyloid-PET scan, CSF analysis), often also involving highly specialized personnel [13, 14].
Recent works support the use of Machine Learning (ML) tools into AD research and clinical practice to provide predictions with a certain degree of confidence, pivoting on information about the specific person (personalized medicine; [19–21]). These predictions support improved and more effective decision-making by researchers and clinicians [22, 23]. So far, many of these AI tools focus on predicting the AD conversion in MCI patients using different combinations of data from different sources, including genotyping, CSF biomarkers, brain imaging, demographic and clinical information, and cognitive performance ([18, 24–29]; see [30, 31], for recent reviews). Although some of these models could reach high levels of accuracy [32], consistency regarding what combination of features is more informative to predict AD as well as the translation into clinical practice are still lacking. One possible reason for this is that current AI algorithms still generally rely on expensive and invasive predictors, such as brain imaging or CSF biomarkers. As such, these studies only serve the purpose of a proof of concept, but do not represent a viable substitute of standard approaches with which they share application complexities and economic costs. To overcome these limitations, recent works proposed ML algorithms elaborating only non-invasive and easy-to-collect predictors (e.g., neuropsychological test scores, sociodemographic and clinical features, blood biomarkers) [20, 33].
In this paper, we developed, tested, and compared several ML algorithms and a weighted average rank ensemble ML system on the predictions provided by the various algorithms. The computer simulations show how the ensemble-based approach is a valuable AI tool for early detection of subjects at risk for developing AD. In particular, our system has four critical added values compared with similar approaches proposed in the literature. First, it extends the cohort of subjects by considering both healthy individuals and MCI patients drawn from the ADNI open dataset whereas previous studies mainly focused on MCI population; in this view, the system we proposed is aimed to provide a support for the early diagnosis in pre-clinical stages of AD in absence of MCI, that lacked in previous attempts. Second, it employs individuals whose diagnostic follow-up was available within 9 years after the baseline assessment. Most of the ML works proposed in literature focus on identifying biomarkers for early diagnosis starting from individuals whose diagnostic follow-up reached up to 3 years after the baseline assessment and mainly using a combination of neuroimaging, genetic and clinical data [34–36]. To the best of our knowledge, only few works investigated a greater time window to study the time point for conversion (from normal/MCI to AD) over 8 years using a combination of multi-scale genetic, neuroimaging and clinical data [37] or up to 5 years using MRI data [38]. The ML algorithm we proposed allows us to reach similar time windows (up to 9 years), but using only non-invasive and easily detectable clinical features. Third, it uses an optimized feature selection procedure to identify only five very easy-to-collect predictors based on neuropsychological test scores. This number of features is lower than that used by similar AI approaches [20, 33]. Finally, it shows higher balanced accuracy, negative predictive value, and specificity than previous similar approaches. Overall, these aspects make the AI system we propose here a clinically translatable early diagnostic tool to predict the conversion to AD within 9 years of healthy individuals and MCI patients, based on a low number of cost-effective, fast and easily collectable predictors.
Materials and methods
ADNI dataset
The data used in the preparation of this paper were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of the ADNI project has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD. For the selection and extraction of the dataset, the data were imported into a MySql database. In order to obtain the best possible dataset, the imported data were checked, cleaned from errors and missing data (such as checking for null values), and organized for the next stage of processing to eliminate redundant or incomplete data and select high-quality data. The database consisted of several tables, one table for each file downloaded from ADNI. The cleaned and selected data were collapsed into a single table through SQL and exported to a single CSV file for subsequent processing.
Cohort chosen for the study
For this study, we employed data from n = 525 participants, using identification numbers (RID; each uniquely assigned to a subject). The data were downloaded on Jan 30, 2021. We first manually select 69 features (i.e., test scores) from the ADNI database based on their availability and facility administration in the clinical context (most are already routinely assessed in clinical practice, see below). We used data chosen from ADNI 1 first exam date, then we extracted the data on the same patients (based on RID) in ADNI 2 collected at last 5 years apart. We indicate each feature with the same name used in ADNI. In particular, for each recording related to each patient, we combined demographic measures (sex, age, marital status, handedness, education) (Table 1), data from different neuropsychological tests such as:
American National Adult Reading Test (ANARTERR) which is used to estimate premorbid verbal levels of intelligence in dementing individuals [39].
Boston Naming Test (BNTTOTAL) which is used to assess naming ability [40].
Category Fluency Test (CATANIMSC,CATANINTR,CATANPERS) which is a test used to measure ability to spontaneously generate a set of semantically related words in 1 min [39].
Clinical Dementia Rating (CDR) which is a five-point semi-structured interview between the patient and a reliable informant (e.g., caregivers) designed to stage the severity of dementia considering different aspects (memory (CDMEMORY), orientation (CDORIENT), judgment and problem solving (CDJUDGE), community affairs (CDCOMMUN), home and hobbies (CDHOME), personal care (CDCARE), global summary (CDGLOBAL)) [41, 42].
Clock Drawing Test (CLOCKSCOR, COPYSCOR) in which subjects draw a clock and set the hands to 10 after 11 [43]
Cognitive Subscale Alzheimer’s Disease Assessment Scale (ADAS14) (85 points including Q4 (Delayed Word Recall) and Q14 (Number Cancellation)) which is composed of two parts, the noncognitive subscale and the cognitive subscale, and returns a measure index of global cognition [44, 45].
Geriatric Depression Scale (GDTOTAL) which is a self-report assessment used to identify mood changes in elderly patients [46].
Neuropsychiatric Inventory Questionnaires, a short version of the Neuropsychiatric Inventory (NPISCORE), which is a brief self-administered questionnaire [47].
Mini Mental State (MMSCORE) which is a brief questionnaire measuring the global cognitive impairment [48].
Rey Auditory Verbal Learning Test (RAVLT_forgetting_bl, RAVLT_immediate_bl, RAVLT_learning_bl, RAVLT_perc_forgetting_bl) that is a cognitive test used to evaluate verbal learning and memory [49].
Trail Making Test (TRAASCOR,TRABSCOR), a test with two parts, the first is relative to psychomotor process, the second is relative to cognitive flexibility [50].
And other data such as:
Family history (FHQMOM = mother, FHQDAD = father, FHQSIB = siblings) relative to dementia.
Comorbidity with Parkinson’s disease (DXPARK).
Medical history diseases (psychiatric (MHPSYCH), neurological (MH2NEURL), head problem (MH3HEAD), cardiovascular (MH4CARD), respiratory (MH5RESP), hepatic (MH6HEPAT), dermatological (MH7DERM), musculoskeletal (MH8MUSCL), endocrine-metabolic (MH9ENDO), gastrointestinal (MH10GAST), hematopoietic-lymphatic (MH11HEMA), renal (MH12RENA), allergies (MH13ALLE), alcohol abuse (MH14ALCH), smoking (MH16SMOK), malignancy (MH17MALI), other kind of problems (MH19OTHR)).
Physical and neurological exams (general appearance (PXGENAPP), head general aspect (PXHEADEY), neck (PXNECK), chest (PXCHEST), heart (PXHEART), abdomen (PXABDOM), peripheral vascular (PXPERIPH), musculoskeletal (PXMUSCUL), visual (NXVISUAL) and auditory (NXAUDITO) impairment, tremor (NXTREMOR), cranial nerves (NXNERVE), motor strength (NXMOTOR), Cerebellar—Finger to Nose (NXFINGER), Cerebellar—Heel to Shin (NXHELL), sensory (NXSENSOR), deep tendon reflexes (NXTENDON), plantar reflexes (NXPLANTA), gait (NXGAIT)).
Table 1.
Subjects composition
| Variable | |
|---|---|
| Male | 292 |
| Female | 233 |
| Age (years) | [55 to 90]; mean 73 |
| Married | 397 |
| Divorced | 43 |
| MCI | 241 (EMCI = 55; LMCI = 186); conv. to AD 104 |
| NT | 284 (SMC = 59); conv. to AD 20 |
| Right hand | 474 |
| Left hand | 51 |
| Education (years) | [6 to 20]; mean 16 |
| Evaluation time lap (years) | [5 to 13]; mean 9 |
EMCI: early MCI; LMCI: late MCI; SMC: subjective memory complaints; NT: normotypical
Data pre-processing
A logistic lasso regression method was applied as supervised feature selection method with L1 regularization [51, 52]. We used this regularization method because it has the effect of keeping in the final model only the most significant features, in particular the method forces the coefficients of less discriminating features toward zero. Furthermore, to face the dataset unbalance we applied the class weights technique modifying the training algorithm to take into account the different numerosity of the classes, giving different weights to the majority and minority classes [53]. Before applying the method all data were subject to standardization (null mean and standard deviation equal to one) in order to homogenize the feature scale. The classification used two classes, ‘convert to Alzheimer’ and ‘non convert to Alzheimer’, as indicated by the last test of each participant of her/his dataset after the evaluation time lapse. As shown in Eq. 1, the logistic regression estimates a binary decision function where the logit can be modeled as a linear function of features:
| 1 |
where “i” is the index of sample, “q” the index of feature, and is the intercept and is coefficient of jth feature and with Y. The L1 penalty parameter is introduced into the model to reduce the estimates of the regression coefficients towards zero and to set some of them against the maximum likelihood estimates:
| 2 |
where L is the log-likelihood function and is the regularization parameter. We also perform standard statistical data analysis (Tables 2 and 3).
Table 2.
Descriptive statistics for the ordinal data of all subjects (525)
| Feature | Min | Max | Mean | Std. dev. |
|---|---|---|---|---|
| ADAS14 | 1 | 40 | 15.79 | 8.64 |
| AGE | 55 | 90 | 73.27 | 6.54 |
| ANARTERR | 0 | 47 | 10.78 | 8.39 |
| BNTTOTAL | 11 | 30 | 27.61 | 2.83 |
| CATANIMSC | 6 | 38 | 19.39 | 5.35 |
| CATANINTR | 0 | 6 | 0.06 | 0.46 |
| CATANPERS | 0 | 13 | 0.74 | 1.25 |
| CDCARE | 0 | 1 | 0.01 | 0.12 |
| CDCOMMUN | 0 | 1 | 0.07 | 0.18 |
| CDGLOBAL | 0 | 0.5 | 0.23 | 0.25 |
| CDHOME | 0 | 1 | 0.07 | 0.18 |
| CDJUDGE | 0 | 1 | 0.16 | 0.25 |
| CDMEMORY | 0 | 2 | 0.25 | 0.29 |
| CDORIENT | 0 | 2 | 0.10 | 0.22 |
| CLOCKSCOR | 1 | 5 | 4.59 | 0.71 |
| COPYSCOR | 0 | 5 | 4.81 | 0.50 |
| GDTOTAL | 0 | 6 | 1.13 | 1.29 |
| MMSCORE | 24 | 30 | 28.44 | 1.58 |
| NPISCORE | 0 | 17 | 1.25 | 2.23 |
| PTEDUCAT | 6 | 20 | 16.28 | 2.74 |
| RAVLT_forgetting_bl | − 3 | 15 | 4.31 | 2.51 |
| RAVLT_immediate_bl | 13 | 70 | 40.72 | 11.24 |
| RAVLT_learning_bl | 0 | 11 | 5.15 | 2.58 |
| RAVLT_perc_forgetting_bl | − 37.5 | 100 | 47.18 | 31.46 |
| TRAASCOR | 13 | 150 | 36.01 | 15.35 |
| TRABSCOR | 32 | 300 | 90.92 | 47.92 |
In bold the feature selected by the optimized procedure used for the features selection
Table 3.
Descriptive statistics for the nominal data of all subjects (525)
| Feature | Mode | Min | Max |
|---|---|---|---|
| DXPARK | 0 | 0 | 1 |
| FHQDAD | 0 | 0 | 2 |
| FHQMOM | 0 | 0 | 2 |
| FHQSIB | 1 | 0 | 1 |
| MHPSYCH | 0 | 0 | 1 |
| MH2NEURL | 0 | 0 | 1 |
| MH3HEAD | 1 | 0 | 1 |
| MH4CARD | 1 | 0 | 1 |
| MH5RESP | 0 | 0 | 1 |
| MH6HEPAT | 0 | 0 | 1 |
| MH7DERM | 0 | 0 | 1 |
| MH8MUSCL | 1 | 0 | 1 |
| MH9ENDO | 0 | 0 | 1 |
| MH10GAST | 0 | 0 | 1 |
| MH11HEMA | 0 | 0 | 1 |
| MH12RENA | 0 | 0 | 1 |
| MH13ALLE | 0 | 0 | 1 |
| MH14ALCH | 0 | 0 | 1 |
| MH16SMOK | 0 | 0 | 1 |
| MH17MALI | 0 | 0 | 1 |
| MH19OTHR | 0 | 0 | 1 |
| NXVISUAL | 1 | 1 | 2 |
| NXAUDITO | 1 | 1 | 2 |
| NXTREMOR | 1 | 1 | 2 |
| NXNERVE | 1 | 1 | 2 |
| NXMOTOR | 1 | 1 | 2 |
| NXFINGER | 1 | 1 | 2 |
| NXHEEL | 1 | 1 | 2 |
| NXSENSOR | 1 | 1 | 2 |
| NXTENDON | 1 | 1 | 2 |
| NXPLANTA | 1 | 1 | 2 |
| NXGAIT | 1 | 1 | 2 |
| PTGENDER | 1 | 1 | 2 |
| PTHAND | 1 | 1 | 2 |
| PTMARRY | 1 | 1 | 5 |
| PXGENAPP | 1 | 1 | 2 |
| PXHEADEY | 1 | 1 | 2 |
| PXNECK | 1 | 1 | 2 |
| PXCHEST | 1 | 1 | 2 |
| PXHEART | 1 | 1 | 2 |
| PXABDOM | 1 | 1 | 2 |
| PXPERIPH | 1 | 1 | 2 |
| PXMUSCUL | 1 | 1 | 2 |
In bold the feature selected by the optimized procedure used for the features selection
We selected the best parameter weighting the effect of the regularization of the feature selection algorithm through a tenfold cross-validation grid search on a range of the parameter described by the Python function logspace (0.1, 4, 20) that generates a row vector of 20 logarithmically spaced points between decades and . Small values of C imply a strong regularization which leads to find simple models underfitting the data. Large values of C imply a low regularization which allows a higher complexity of the model overfitting the data.
The features selection process used a tenfold cross-validation method. To this purpose, we divided the data into tenfolds (sets). Out of the tenfolds, nine sets were used for training while the remaining set was used for testing; this process was then repeated 10 times using a different fold for each test. The score used in the test directed to isolate the best C was based on the average recall of the two classes. This process led to find as the regularization value leading to the maximum scores. To select only the most relevant features and implement a tighter dimensionality reduction on the method with the best parameter C, we selected only features with a coefficient greater than 0.5. In this way, we apply a stricter feature selection by selecting only those features that have an odds ratio greater than and so a odds to have a discriminating impact greater of , in fact .
Classification model
To face the binary classification problem we used an Multi-Experts Ensemble model (MEE) composed of a random forest [54], a Neural Net [55], and a Support Vector Machine [56]. Ensemble methods usually produce more accurate solutions than single models do. This approach obtains the final prediction in the test phases by averaging the predictions of three classifiers with the hard majority voting rule. In developing the assembled classifier in addition to preliminary results, we chose a combination of classifiers that would allow us to analyze three different feature representation spaces based on the main learning paradigms Decision Tree (RF), Kernel Method (SVM) and Deep Learning (NN). To train the system and evaluate its performance we used the 10-Repeated-Nested-10-Fold-Cross-Validation procedure. In particular, we used this method to select the hyperparameters of each model of the ensemble classifier, and to achieve the average performance of ensemble method [57, 58]. In this way, we avoid model overfitting and optimistically biased estimates of model performance.
This procedure was composed of two cross-validation (CV) loops, each implementing a tenfold stratified CV:
In the outer CV loop designed to obtain an unbiased estimate of model performance, the dataset was partitioned into the ‘Model Development Set’ and the ‘Test Set’. This is schematized in the upper left part of Fig. 1;
For each iteration of outer CV loop, an entire inner CV loop was performed. The inner CV loop was designed to select the optimal hyperparameters for the final model through a Grid Search technique with the accuracy on validation set as selection score [59]. The ‘Model Development Set’ was further partitioned into the ‘Training Set’ and ’Validation Set’. This is schematized in the upper right part of Fig. 1.
The above reported whole procedure was repeated 10 times to verify the robustness of the method and the low influence of the initial random choice of the samples in the tenfolds. The completed procedure is outlined in the lower part of Fig. 1. Table 4 shows details of the three models forming the ensemble as well as the ranges of the hyperparameters used for the grid search. The neural network we used was composed of one hidden layer with rectified linear units, and one output layer with 2 logistic units. The network size was set small due to the small size of the input patterns and to avoid overfitting.
Fig. 1.
Nested tenfold cross-validation (CV) procedure for model development and evaluation. In the outer CV loop (on top left), the dataset was partitioned into the ‘Model Development Set’ and ‘Test Set’. In the inner CV loop (on top right), the ‘Model Development Set’ was further partitioned into the ‘Training Set’ and ‘Validation Set’. The inner loop was composed of tenfold cross-validation Grid Search with the aim of obtaining the best parameters for each of the three classifiers assembled. On the bottom of figure, the procedure for one single iteration of the outer CV loop is graphed in diagram form
Table 4.
Hyperparameters of the three models forming the MEE, and their range used by the grid search method
| Models | Hyperparameters | Range | |
|---|---|---|---|
| Ensemble proposed | Neural Net | Optimizer | {‘SGD’, ‘Adagrad’, ‘Adadelta’, ‘Adam’, ‘Adamax’, ‘Nadam’} |
| Batch size | {10, 20, 40, 60, 80, 100} | ||
| Epochs | {10, 50, 100} | ||
| Number of hidden units | {2:2:50} | ||
| Random Forest | Max depth | {5, 20, 50, 80, 110} | |
| Min samples for leaf | {3, 4, 5, 10]} | ||
| Min samples for split | {8, 10, 12, 24, 32} | ||
| Number of estimators | {30, 200, 300, 1000} | ||
| Support Vector Machine | C parameter | {0.1,1, 10, 100} | |
| parameter | {1, 0.1, 0.01, 0.001} | ||
| Kernel | {‘linear’, ‘poly’, ‘rbf’, ‘sigmoid’} |
Results and discussion
All tests were developed in Python and used Scikit-learn and Keras as main libraries [60]. The first key result of our study comes from the optimized procedure used for the features selection. This isolated only five critical features (on 69 initially considered, see Sect. 2.2) for very early prediction of AD development: one from the CDR, one from the ADAS14, two from the medical history questionnaires (MH3HEAD for Head, Eyes, Ears, Nose and Throat problems, and MH12RENA for Renal-Genitourinary problems), and one from neurological exams (NXHELL Cerebellar Heel to Shin, for cerebellar dysfunction). CDR and ADAS14 are two of the most common tests used in clinical practice for AD diagnosis and evaluation. CDR is a global clinical scale to evaluate different cognitive performances through six specific subscales with established diagnostic and severity-ranking utility and used for research in epidemiological studies and clinical trials as well as for patient evaluation in clinical practice [41, 61]. In particular, the optimized feature selection procedure described in Sect. 2 identified the CDR memory subscale (CDMEMORY) as one of the most relevant features to predict AD development. This result agrees with data suggesting that early episodic memory impairments related to pathologic changes in the hippocampus and entorhinal cortex are common AD initial symptoms. Several data show that memory impairment could be a good predictor for the conversion of MCI in AD [62], and memory dysfunctions could appear up to 7 years before AD diagnosis [63]. Aside from CDMEMORY, the features selection procedure underlined how the ADAS14 score is another critical feature to predict AD development. This result is in line with the crucial role that ADAS14 plays as a gold standard for assessing the efficacy of antidementia treatments [44, 45].
The optimized feature selection procedure also evidenced how some impairments (apparently) far from traditional AD neurodegenerative processes, like head injury and renal and cerebellar dysfunctions, could be critical features to predict AD development. Several studies support this result. Head injuries could lead to long-term problems with cognitive functioning and increase the risk of cognitive decline, which progresses faster in older individuals who suffer from head injuries than in those who did not [64, 65]. In addition, traumatic brain injuries could contribute to AD development, and if present in early or middle life, could increase the risk of late-life AD occurring [66–69].
There is complex pathophysiology of cognitive decline in chronic kidney disease (see [70] for a review). Kidney dysfunctions could contribute to impairments in semantic, episodic, and working memory. Furthermore, a lower estimated glomerular filtration rate at baseline was associated with a more rapid rate of cognitive decline [71].
Genetic mutations in Presenilin-1 protein have been described both in patients with cerebellar ataxia and in early AD onset [7, 72, 73]. In addition, MCI patients show lower cerebellar grey matter volumes compared with age-matched individuals, and total cerebellar grey matter volume decreases as the disease evolves. Furthermore, the decrease of cerebellar grey matter volume appears to be a predictable pattern to cerebellar grey matter atrophy in AD. This cerebellar impairment first affects the vermis and the posterior lobe and then the anterior lobe (for a review see [74]). Overall, these results suggest framing AD according to a system-level perspective, where the interactions between brain–body dysfunctions could be critical for early diagnosis [19, 75].
The second interesting result of the present study comes from the analysis of the predictive power of the ML algorithms. The first row of Table 5 reports the performance achieved by the proposed system in terms of sensitivity, specificity, accuracy, negative predictive value, balanced accuracy, and F1-score. To develop a complete comparison, we tested and optimized other classifiers belonging to different learning paradigms, including a Multi-Layer Perceptron (MLP) as a neural network, a k-Nearest Neighbor (kNN) as an instance-based classifier, a Support Vector Machine (SVM) as a kernel machine, a Naive Bayes (NB) as a Bayesian classifier, a Decision Tree (DT) as a non-parametric classifier model, a Logistic Regression (LR) as a probabilistic regression model for classification, and finally a Random Forest (RF) and a Adaptive Boosting (AdaBoost) as a classification ensemble. All these systems were tested and trained with the same technique described in Sect. 2.4.
Table 5.
Performance of the ML algorithms
| Classifier models | Sensitivity (%) | Specificity (%) | Precision (%) | NPV (%) | BA (%) | F1-score |
|---|---|---|---|---|---|---|
| MEE proposed | 73.5 | 88.3 | 68.3 | 91.5 | 80.9 | 70.8 |
| AdaBoost | 54.2 | 93.3 | 73.8 | 87.0 | 73.7 | 62.5 |
| MLP | 76.3 | 78.4 | 54.4 | 91.6 | 77.4 | 63.5 |
| NB | 48.5 | 90.0 | 61.1 | 85.4 | 69.4 | 54.1 |
| DT | 59.1 | 87.0 | 66.0 | 87.5 | 73.1 | 62.4 |
| KNN | 52.8 | 93.5 | 73.4 | 87.4 | 74.6 | 61.4 |
| LR | 75.0 | 79.5 | 55.2 | 91.3 | 77.3 | 63.6 |
| RF | 68.6 | 86.0 | 63.1 | 89.9 | 77.3 | 65.7 |
| SVM | 65.6 | 89.0 | 66.9 | 89.4 | 77.3 | 66.2 |
Sensitivity: ratio between the AD converter subjects correctly labeled by the algorithm and all subjects that actually converted; Specificity: ratio between the non-AD converter subjects correctly labeled by the algorithm and all subjects that have not actually converted; Precision: ratio between the correctly AD converter subjects labeled by the algorithm and the AD converters; Negative predictive value (NPV): the proportion of predicted negatives which are real negatives. It reflects the probability that a predicted negative is a true negative; Balanced accuracy (BA): the average between sensitivity and specificity; F1-score: the harmonic average of the sensitivity and precision
For all systems, the values of the hyperparameters that were most frequently found to be optimal during the optimization procedure and the average score obtained with the grid search are reported in Table 6, whereas their performance is reported in the remaining rows of Table 5.
Table 6.
Reports for each method the average score obtained during Grid Search and values of hyperparameters most frequently selected during k-fold nested-cross-validation
| Classifier models | Average score Accuracy on validation set (%) |
Best hyperparameters | |
|---|---|---|---|
| Ensemble proposed | NN | 78.5 | Optimizer: Adam |
| Batch size: 60 | |||
| Epochs: 100 | |||
| Number of hidden units: 32 | |||
| RF | 81.7 | Max depth: 80 | |
| Min samples for leaf: 3 | |||
| Min samples for split: 12 | |||
| Number of estimators: 100 | |||
| SVM | 83.5 | C: 0.1; | |
| : 1; | |||
| Kernel: radial basic function | |||
| AdaBoost | 80.6 | Algorithm: SAMME | |
| Learning rate: 0.1 | |||
| Number of estimators: 250 | |||
| MLP | 72.3 | Activation: identity | |
| Batch size: 20 | |||
| Epochs: 80 | |||
| Optimizer: Adam | |||
| Number of hidden units: 16 | |||
| NB | – | – | |
| DT | 78.3 | Criterion of split: Gini | |
| Max depth: 2 | |||
| Min samples for leaf: 5 | |||
| Split method: best | |||
| KNN | 54.2 | Distance metric: manhattan | |
| Number of neighbors: 19 | |||
| LR | 78.6 | C: 0.0885 | |
| Penalty: L1 | |||
| Solver: Newton-cg |
Table 5 shows that the ensemble solution produces, on average, better predictive performances than the other algorithms we tested. In addition, compared with similar works that used only non-invasive and easily detectable clinical features [20], our system has a better negative predictive power. In particular, it can predict if a subject will not develop AD with higher performances in terms of specificity, negative predictive value, and balanced accuracy. This result could be critical for developing fast-screening protocols. The other metrics (sensitivity, precision, and F1-score) are similar to those obtained with similar approaches proposed in literature. These works, however, used a substantially larger number of features, prediction windows up to 3 years, and focused only on MCI patients. The ensemble-based ML algorithm proposed here can predict AD development within 9 years from first overt signs not only in MCI patients, but also in healthy individuals.
Despite these encouraging results, future improvements of our approach, for example in terms of generalization, could be obtained by enhancing the heterogeneity of the training set, and including data from different countries (e.g., Asia and Europe). In this way, it would be possible to detect different lifestyle and epigenetic elements that could act as risk or protective factors in AD development.
Conclusion
The current approaches for AD diagnosis often require invasive and expensive tools (e.g., brain imaging) and highly specialized personnel, and start at a time-point where the disorder has already caused severe brain damages, the underlying neuropathology may be less sensitive to treatment and the clinical signs are apparent [5, 6, 8]. A critical challenge of our years is to develop an artificial tool able to detect AD onset with many years of advances in order to limit or stop symptoms altogether ([36] for a review). Several works try to answer this question by integrating different aspects of AD pathophysiology, such as neuroimaging, plasma biomarkers, and genetic data [76–79]. The proposed approaches could be very accurate, but also expensive. This aspect could limit their use since another challenge is to make early diagnosis accessible to all [80]. Moreover, most of the ML works proposed in literature focus on identifying biomarkers for early diagnosis starting from individuals whose diagnostic follow-up reached up to 3 years [34, 35].
This article proposes an ensemble-based ML algorithm for predicting AD development within 9 years from first overt signs and using only five non-invasive and easily detectable clinical tests. The results we obtained represent a first important step towards building a preventive fast-screening machine-learning tool usable as part of a routine healthcare visit. In this way, it could help to identify individuals that might develop AD at an early pre-clinical stage and in cost-effective ways without raising undue anxiety associated with attending a specialized clinic [13].
Acknowledgements
This research was supported by the Advanced School in Artificial Intelligence (www.as-ai.org) and by AI2Life s.r.l. (www.ai2life.com). Data collection and sharing for this project was funded by the ADNI (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
Author contributions
MM: conceptualization, formal analysis, investigation, methodology, software, validation, visualization, supervision, writing—original draft, writing—review and editing; SLD’A: data curation, formal analysis, investigation, resources, software, validation, visualization, writing—original draft, writing—review and editing; PM: data curation, resources, software, validation, writing—review and editing; FB: formal analysis, investigation, software, validation, writing—review and editing; CG: resources, validation, writing—review and editing; RV: validation, writing—review and editing; AC: funding acquisition, investigation, software; GB: funding acquisition, methodology, validation, writing—review and editing; MS: methodology, validation, writing—review and editing; DC: conceptualization, funding acquisition, methodology, project administration, supervision, validation, visualization, writing—original draft, writing—review and editing. All authors read and approved the final manuscript.
Funding
This research was supported by the FETPROACT-EIC IM-TWIN project, Grant Number 952095.
Availability of data and materials
Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu), which is easily available for download from the Laboratory of Neuroimaging (LONI) website to the research public. As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.
Code availability
Not applicable.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Wolters FJ, Chibnik LB, Waziry R, et al. Twenty-seven-year time trends in dementia incidence in Europe and the United States. Neurology. 2020;95:e519–e531. doi: 10.1212/WNL.0000000000010022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhang XX, Tian Y, Wang ZT, et al. The epidemiology of Alzheimer’s disease modifiable risk factors and prevention. J Prev Alzheimer’s Dis. 2021;8:313–321. doi: 10.14283/jpad.2021.15. [DOI] [PubMed] [Google Scholar]
- 3.Scheltens P, Strooper BD, Kivipelto M, et al. Alzheimer’s disease. Lancet. 2021;397:1577–1590. doi: 10.1016/S0140-6736(20)32205-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Amieva H, Le Goff M, Millet X, et al. Prodromal Alzheimer’s disease: successive emergence of the clinical symptoms. Ann Neurol. 2008;64:492–498. doi: 10.1002/ana.21509. [DOI] [PubMed] [Google Scholar]
- 5.Beason-Held LL, Goh JO, An Y, et al. Changes in brain function occur years before the onset of cognitive impairment. J Neurosci. 2013;33:18008–18014. doi: 10.1523/JNEUROSCI.1402-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rajan KB, Wilson RS, Weuve J, et al. Cognitive impairment 18 years before clinical diagnosis of Alzheimer disease dementia. Neurology. 2015;85:898–904. doi: 10.1212/WNL.0000000000001774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Reiman EM, Quiroz YT, Fleisher AS, et al. Brain imaging and fluid biomarker analysis in young adults at genetic risk for autosomal dominant Alzheimer’s disease in the presenilin 1 E280A kindred: a case-control study. Lancet Neurol. 2012;11:1048–1056. doi: 10.1016/S1474-4422(12)70228-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Younes L, Albert M, Moghekar A, et al. Identifying changepoints in biomarkers during the preclinical phase of Alzheimer’s disease. Front Aging Neurosci. 2019;11:74. doi: 10.3389/FNAGI.2019.00074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Isaacson R, Ganzer C, Hristov H, et al. The clinical practice of risk reduction for Alzheimer’s disease: a precision medicine approach. Alzheimer’s & Dementia. 2018;14:1663–1673. doi: 10.1016/j.jalz.2018.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yiannopoulou KG, Papageorgiou SG. Current and future treatments in Alzheimer disease: an update. J Cent Nervous Syst Dis. 2020 doi: 10.1177/1179573520907397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Matthews FE, Stephan BCM, Robinson L, et al. A two decade dementia incidence comparison from the Cognitive Function and Ageing Studies I and II. Nat Commun. 2016;7:1–8. doi: 10.1038/ncomms11398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Norton S, Matthews F, Barnes D, et al. Potential for primary prevention of Alzheimer’s disease: an analysis of population-based data. Lancet Neurol. 2014;13:788–794. doi: 10.1016/S1474-4422(14)70136-X. [DOI] [PubMed] [Google Scholar]
- 13.Rasmussen J, Langerman H. Alzheimer’s disease—why we need early diagnosis. Degener Neurol Neuromuscul Dis. 2019;9:123–130. doi: 10.2147/DNND.S228939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.De Vugt ME, Verhey FR. The impact of early dementia diagnosis and intervention on informal caregivers. Prog Neurobiol. 2013;110:54–62. doi: 10.1016/j.pneurobio.2013.04.005. [DOI] [PubMed] [Google Scholar]
- 15.Frias CE, Cabrera E, Zabalegui A. Informal caregivers’ roles in dementia: the impact on their quality of life. Life (Basel) 2020;10:251. doi: 10.3390/life10110251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Petersen R, Parisi J, Dickson D, et al. Neuropathologic features of amnestic mild cognitive impairment. Arch Neurol. 2006;63:665–672. doi: 10.1001/archneur.63.5.665. [DOI] [PubMed] [Google Scholar]
- 17.Roberts R, Knopman D, Mielke M, et al. Higher risk of progression to dementia in mild cognitive impairment cases who revert to normal. Neurology. 2014;82:317–325. doi: 10.1212/WNL.0000000000000055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dukart J, Sambataro F, Bertolino A. Accurate prediction of conversion to Alzheimer’s disease using imaging, genetic, and neuropsychological biomarkers. J Alzheimer’s Dis. 2015;49:1143–1159. doi: 10.3233/JAD-150570. [DOI] [PubMed] [Google Scholar]
- 19.Caligiore D, Silvetti M, D’Amelio M, et al. Computational modeling of catecholamines dysfunction in Alzheimer’s disease at pre-plaque stage. J Alzheimer’s Dis. 2020;77:275–290. doi: 10.3233/JAD-200276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Grassi M, Rouleaux N, Caldirola D, et al. A novel ensemble-based machine learning algorithm to predict the conversion from mild cognitive impairment to Alzheimer’s disease using socio-demographic characteristics, clinical information, and neuropsychological measures. Front Neurol. 2019;10:756. doi: 10.3389/fneur.2019.00756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Moustafa AA. Alzheimer’s disease : understanding biomarkers, big data, and therapy. London: Academic Press; 2021. [Google Scholar]
- 22.Hampel H, Vergallo A, Perry G, et al. The Alzheimer precision medicine initiative. J Alzheimer’s Dis. 2019;68:1–24. doi: 10.3233/JAD-181121. [DOI] [PubMed] [Google Scholar]
- 23.Perna G, Grassi M, Caldirola D, et al. The revolution of personalized psychiatry: will technology make it happen sooner? Psychol Med. 2018;48:705–713. doi: 10.1017/S0033291717002859. [DOI] [PubMed] [Google Scholar]
- 24.Grassi M, Perna G, Caldirola D, et al. A clinically-translatable machine learning algorithm for the prediction of Alzheimer’s disease conversion in individuals with mild and premild cognitive impairment. J Alzheimer’s Dis. 2018;61:1555–1573. doi: 10.3233/JAD-170547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hojjati S, Ebrahimzadeh A, Khazaee A, et al. Predicting conversion from MCI to AD using resting-state fMRI, graph theoretical approach and SVM. J Neurosci Methods. 2017;282:69–80. doi: 10.1016/j.jneumeth.2017.03.006. [DOI] [PubMed] [Google Scholar]
- 26.Liu M, Cheng D, Wang K, et al. Multi-modality cascaded convolutional neural networks for Alzheimer’s disease diagnosis. Neuroinformatics. 2018;16:295–308. doi: 10.1007/s12021-018-9370-4. [DOI] [PubMed] [Google Scholar]
- 27.Long X, Chen L, Jiang C, et al. Prediction and classification of Alzheimer disease based on quantification of MRI deformation. PLOS ONE. 2017;12:e0173372. doi: 10.1371/JOURNAL.PONE.0173372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Pan D, Zeng A, Jia L, et al. Early detection of Alzheimer’s disease using magnetic resonance imaging: a novel approach combining convolutional neural networks and ensemble learning. Front Neurosci. 2020;14:259. doi: 10.3389/fnins.2020.00259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Platero C, Lin L, Tobar MC. Longitudinal neuroimaging hippocampal markers for diagnosing Alzheimer’s disease. Neuroinformatics. 2019;17:43–61. doi: 10.1007/s12021-018-9380-2. [DOI] [PubMed] [Google Scholar]
- 30.Grueso S, Viejo-Sobera R. Machine learning methods for predicting progression from mild cognitive impairment to Alzheimer’s disease dementia: a systematic review. Alzheimer’s Res Ther. 2021;13:1–29. doi: 10.1186/s13195-020-00736-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Pradhan N, Singh AS, Singh A (2021) Alzheimer disease early diagnosis and prediction using deep learning techniques: a survey. In: Recent trends in communication and electronics, pp 590–593
- 32.Odusami M, Maskeliūnas R, Damaševičius R, et al. Analysis of features of Alzheimer’s disease: detection of early stage from functional brain changes in magnetic resonance images using a Finetuned ResNet18 Network. Diagnostics. 2021;11:1071. doi: 10.3390/diagnostics11061071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Beltran J, Wahba B, Hose N, et al. Inexpensive, non-invasive biomarkers predict Alzheimer transition using machine learning analysis of the Alzheimer’s Disease Neuroimaging (ADNI) database. PLoS ONE. 2020;15:e0235663. doi: 10.1371/journal.pone.0235663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cammisuli DM, Cipriani G, Castelnuovo G. Technological solutions for diagnosis, management and treatment of Alzheimer’s disease-related symptoms: a structured review of the recent scientific literature. Int J Environ Res Public Health. 2022 doi: 10.3390/IJERPH19053122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Odusami M, Maskeliūnas R, Damaševičius R. An intelligent system for early recognition of Alzheimerrsquo;s disease using neuroimaging. Sensors. 2022 doi: 10.3390/S22030740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Silva-Spínola A, Baldeiras I, Arrais JP, et al. The road to personalized medicine in Alzheimer’s disease: the use of artificial intelligence. Biomedicines. 2022 doi: 10.3390/BIOMEDICINES10020315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Khanna S, Domingo-Fernández D, Iyappan A, et al. Using multi-scale genetic, neuroimaging and clinical data for predicting Alzheimer’s disease and reconstruction of relevant biological mechanisms. Sci Rep. 2018 doi: 10.1038/S41598-018-29433-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Moscoso A, Silva-Rodríguez J, Aldrey JM, et al. Prediction of Alzheimer’s disease dementia with MRI beyond the short-term: implications for the design of predictive models. NeuroImage: Clin. 2019 doi: 10.1016/j.nicl.2019.101837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Battista P, Salvatore C, Castiglioni I. Optimizing neuropsychological assessments for cognitive, behavioral, and functional impairment classification: a machine learning study. Behav Neurol. 2017 doi: 10.1155/2017/1850909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kaplan E, Goodglass H, Weintraub S. Boston naming test. Philadelphia: Lea & Febiger; 1983. [Google Scholar]
- 41.Hughes CP, Berg L, Danziger WL, et al. A new clinical scale for the staging of Dementia. Br J Psychiatry. 1982;140:566–572. doi: 10.1192/bjp.140.6.566. [DOI] [PubMed] [Google Scholar]
- 42.Morris JC. The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology. 1993;43:2412–2414. doi: 10.1212/WNL.43.1_Part_1.241-a. [DOI] [PubMed] [Google Scholar]
- 43.Pinto E, Peters R. Literature review of the Clock Drawing Test as a tool for cognitive screening. Dementia Geriatr Cognit Disord. 2009;27:201–213. doi: 10.1159/000203344. [DOI] [PubMed] [Google Scholar]
- 44.Kueper JK, Speechley M, Montero-Odasso M. The Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative review. J Alzheimer’s Dis. 2018;63:423–444. doi: 10.3233/JAD-170991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rosen W, Mohs R, Davis K. A new rating scale for Alzheimer’s disease. Am J Psychiatry. 1984;141:1356–1364. doi: 10.1176/ajp.141.11.1356. [DOI] [PubMed] [Google Scholar]
- 46.Yesavage JA, Brink TL, Rose TL, et al. Development and validation of a geriatric depression screening scale: a preliminary report. J Psychiatr Res. 1982;17:37–49. doi: 10.1016/0022-3956(82)90033-4. [DOI] [PubMed] [Google Scholar]
- 47.Cummings JL, Mega M, Gray K, et al. The Neuropsychiatric Inventory: comprehensive assessment of psychopathology in dementia. Neurology. 1994;44:2308–2314. doi: 10.1212/WNL.44.12.2308. [DOI] [PubMed] [Google Scholar]
- 48.Folstein MF, Robins LN, Helzer JE. The mini-mental state examination. Arch Gen Psychiatry. 1983;40:812. doi: 10.1001/archpsyc.1983.01790060110016. [DOI] [PubMed] [Google Scholar]
- 49.Rey A. The clinical psychological examination. Paris: Presses Universitaires de France; 1964. [Google Scholar]
- 50.Reitan RM. Trail making test results for normal and brain-damaged children. Percept Motor Skills. 1971;33:575–581. doi: 10.2466/pms.1971.33.2.575. [DOI] [PubMed] [Google Scholar]
- 51.Fonti V, Belitser E (2017) Feature selection using lasso. In: VU Amsterdam Research Paper in Business Analytics 30:1–25
- 52.Muthukrishnan R, Rohini R (2016) Lasso: a feature selection technique in predictive modeling for machine learning. In: 2016 IEEE international conference on advances in computer applications (ICACA), IEEE, pp 18–20
- 53.Bekkar M, Alitouche TA. Imbalanced data learning approaches review. Int J Data Mining Knowl Manag Process. 2013;3:15–33. doi: 10.5121/ijdkp.2013.3402. [DOI] [Google Scholar]
- 54.Cutler A, Cutler DR, Stevens JR (2012) Random forests. In: Ensemble machine learning. Springer, p 157–175
- 55.Goodfellow I, Bengio Y, Courville A. Deep learning. Cambridge: MIT Press; 2016. [Google Scholar]
- 56.Svensén M, Bishop CM. Pattern recognition and machine learning. Berlin: Springer; 2007. [Google Scholar]
- 57.Abdar M, Zomorodi-Moghadam M, Zhou X, et al. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recogn Lett. 2020;132:123–131. doi: 10.1016/j.patrec.2018.11.004. [DOI] [Google Scholar]
- 58.Zhong Y, Chalise P, He J (2020) Nested cross-validation with ensemble feature selection and classification model for high-dimensional biological data. In: Communications in statistics-simulation and computation, pp 1–18
- 59.Ndiaye E, Le T, Fercoq O, et al (2019) Safe grid search with optimal complexity. In: International conference on machine learning, PMLR, pp 4771–4780
- 60.Kyriakides G, Margaritis KG. Hands-on ensemble learning with python: build highly optimized ensemble machine learning models using scikit-learn and Keras. Birmingham: Packt Publishing Ltd; 2019. [Google Scholar]
- 61.Lim WS, Chin JJ, Lam CK, et al. Clinical dementia rating experience of a multi-racial Asian population. Alzheimer Dis Assoc Disord. 2005;19:135–142. doi: 10.1097/01.wad.0000174991.60709.36. [DOI] [PubMed] [Google Scholar]
- 62.Lee YM, Park JM, Lee BD, et al. Memory impairment, in mild cognitive impairment without significant cerebrovascular disease, predicts progression to Alzheimer’s disease. Dementia Geriatr Cognit Disord. 2012;33:240–244. doi: 10.1159/000339159. [DOI] [PubMed] [Google Scholar]
- 63.Grober E, Cb Hall, Lipton RB, et al. Memory impairment, executive dysfunction, and intellectual decline in preclinical Alzheimer’s disease. J Int Neuropsychol Soc. 2008;14:266–278. doi: 10.1017/S1355617708080302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Luukinen H, Viramo P, Koski K, et al. Head injuries and cognitive decline among older adults a population-based study. Neurology. 1999;52:557–557. doi: 10.1212/WNL.52.3.557. [DOI] [PubMed] [Google Scholar]
- 65.Whiteneck GG, Gerhart KA, Cusick CP. Identifying environmental factors that influence the outcomes of people with traumatic brain injury. J Head Trauma Rehabil. 2004;19:191–204. doi: 10.1097/00001199-200405000-00001. [DOI] [PubMed] [Google Scholar]
- 66.Plassman BL, Havlik RJ, Steffens DC, et al. Documented head injury in early adulthood and risk of Alzheimer’s disease and other dementias. Neurology. 2000;55:1158–1166. doi: 10.1212/WNL.55.8.1158. [DOI] [PubMed] [Google Scholar]
- 67.Rasmusson D, Brandt J, Martin D, et al. Head injury as a risk factor in Alzheimer’s disease. Brain Inj. 1995;9:213–219. doi: 10.3109/02699059509008194. [DOI] [PubMed] [Google Scholar]
- 68.Schofield P, Tang M, Marder K, et al. Alzheimer’s disease after remote head injury: an incidence study. J Neurol Neurosurg Psychiatry. 1997;62:119–124. doi: 10.1136/jnnp.62.2.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Sivanandam TM, Thakur MK. Traumatic brain injury: a risk factor for Alzheimer’s disease. Neurosci Biobehav Rev. 2012;36:1376–1381. doi: 10.1016/j.neubiorev.2012.02.013. [DOI] [PubMed] [Google Scholar]
- 70.Etgen T. Kidney disease as a determinant of cognitive decline and dementia. Alzheimer’s Res Ther. 2015;7:29. doi: 10.1186/s13195-015-0115-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Buchman AS, Tanne D, Boyle PA, et al. Kidney function is associated with the rate of cognitive decline in the elderly. Neurology. 2009;73:920–927. doi: 10.1212/WNL.0b013e3181b72629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Braga-Neto P, Pedroso JL, Alessi H, et al. Early-onset familial Alzheimer’s disease related to presenilin 1 mutation resembling autosomal dominant spinocerebellar ataxia. J Neurol. 2013;260:1177–1179. doi: 10.1007/s00415-013-6879-1. [DOI] [PubMed] [Google Scholar]
- 73.Testi S, Peluso S, Fabrizi GM, et al. A novel PSEN1 mutation in a patient with sporadic early-onset Alzheimer’s disease and prominent cerebellar ataxia. J Alzheimer’s Dis. 2014;41:709–714. doi: 10.3233/JAD-140081. [DOI] [PubMed] [Google Scholar]
- 74.Jacobs HIL, Hopkins DA, Mayrhofer HC, et al. The cerebellum in Alzheimer’s disease: evaluating its role in cognitive decline. Brain. 2018;141:37–47. doi: 10.1093/brain/awx194. [DOI] [PubMed] [Google Scholar]
- 75.Caligiore D, Helmich RC, Hallett M, et al. Parkinson’s disease as a system-level disorder. NPJ Parkinson’s Dis. 2016;2:1–9. doi: 10.1038/npjparkd.2016.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Jo T, Nho K, Risacher SL, et al. Deep learning detection of informative features in tau PET for Alzheimer’s disease classification. BMC Bioinform. 2020 doi: 10.1186/S12859-020-03848-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Lin CH, Chiu SI, Chen TF, et al. Classifications of neurodegenerative disorders using a multiplex blood biomarkers-based machine learning model. Int J Mol Sci. 2020;21:1–15. doi: 10.3390/ijms21186914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Nguyen DT, Ryu S, Qureshi MNI, et al. Hybrid multivariate pattern analysis combined with extreme learning machine for Alzheimer’s dementia diagnosis using multi-measure rs-fMRI spatial patterns. PLOS ONE. 2019 doi: 10.1371/JOURNAL.PONE.0212582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Nunes A, Silva G, Duque C, et al. Retinal texture biomarkers may help to discriminate between Alzheimer’s, Parkinson’s, and healthy controls. PLoS ONE. 2019 doi: 10.1371/JOURNAL.PONE.0218826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Clute-Reinig N, Jayadev S, Rhoads K, et al. Alzheimer’s disease diagnostics must be globally accessible. J Alzheimer’s Dis. 2021;84:1453–1455. doi: 10.3233/JAD-210663. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu), which is easily available for download from the Laboratory of Neuroimaging (LONI) website to the research public. As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.
Not applicable.

