Skip to main content
BMC Medical Informatics and Decision Making logoLink to BMC Medical Informatics and Decision Making
. 2020 Dec 2;20:319. doi: 10.1186/s12911-020-01339-z

Cognitive biomarker prioritization in Alzheimer’s Disease using brain morphometric data

Bo Peng 1, Xiaohui Yao 2, Shannon L Risacher 3, Andrew J Saykin 3, Li Shen 2, Xia Ning 1,; for the ADNI
PMCID: PMC7709267  PMID: 33267852

Abstract

Background

Cognitive assessments represent the most common clinical routine for the diagnosis of Alzheimer’s Disease (AD). Given a large number of cognitive assessment tools and time-limited office visits, it is important to determine a proper set of cognitive tests for different subjects. Most current studies create guidelines of cognitive test selection for a targeted population, but they are not customized for each individual subject. In this manuscript, we develop a machine learning paradigm enabling personalized cognitive assessments prioritization.

Method

We adapt a newly developed learning-to-rank approach PLTR to implement our paradigm. This method learns the latent scoring function that pushes the most effective cognitive assessments onto the top of the prioritization list. We also extend PLTR to better separate the most effective cognitive assessments and the less effective ones.

Results

Our empirical study on the ADNI data shows that the proposed paradigm outperforms the state-of-the-art baselines on identifying and prioritizing individual-specific cognitive biomarkers. We conduct experiments in cross validation and level-out validation settings. In the two settings, our paradigm significantly outperforms the best baselines with improvement as much as 22.1% and 19.7%, respectively, on prioritizing cognitive features.

Conclusions

The proposed paradigm achieves superior performance on prioritizing cognitive biomarkers. The cognitive biomarkers prioritized on top have great potentials to facilitate personalized diagnosis, disease subtyping, and ultimately precision medicine in AD.

Keywords: Alzheimer’s Disease, Learning to rank, Bioinformatics, Machine learning

Background

Identifying structural brain changes related to cognitive impairments is an important research topic in Alzheimer’s Disease (AD) study. Regression models have been extensively studied to predict cognitive outcomes using morphometric measures that are extracted from structural magnetic resonance imaging (MRI) scans [1, 2]. These studies are able to advance our understanding on the neuroanatomical basis of cognitive impairments. However, they are not designed to have direct impacts on clinical practice. To bridge this gap, in this manuscript we develop a novel learning paradigm to rank cognitive assessments based on their relevance to AD using brain MRI data.

Cognitive assessments represent the most common clinical routine for AD diagnosis. Given a large number of cognitive assessment tools and time-limited office visits, it is important to determine a proper set of cognitive tests for the subjects. Most current studies create guidelines of cognitive test selection for a targeted population [3, 4], but they are not customized for each individual subject. In this work, we develop a novel learning paradigm that incorporate the ideas of precision medicine and customizes the cognitive test selection process to the characteristics of each individual patient. Specifically, we conduct a novel application of a newly developed learning-to-rank approach, denoted as PLTR [5], to the structural MRI and cognitive assessment data of the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort [6]. Using structural MRI measures as the individual characteristics, we are able to not only identify individual-specific cognitive biomarkers but also prioritize them and their corresponding assessment tasks according to AD-specific abnormality. We also extend PLTR to PLTRh using hinge loss [7] to more effectively prioritize individual-specific cognitive biomarkers. The study presented in this manuscript is a substantial extension from our preliminary study [8].

Our study is unique and innovative from the following two perspectives. First, conventional regression-based studies for cognitive performance prediction using MRI data focus on identifying relevant imaging biomarkers at the population level. However, our proposed model aims to identify AD-relevant cognitive biomarkers customized to each individual patient. Second, the identified cognitive biomarkers and assessments are prioritized based on the individual’s brain characteristics. Therefore, they can be used to guide the selection of cognitive assessments in a personalized manner in clinical practice; it has the potential to enable personalized diagnosis and disease subtyping.

Literature review

Learning to rank

Learning-to-Rank (LETOR) [9] is a popular technique used in information retrieval [10], web search [11] and recommender systems [12]. Existing LETOR methods can be classified into three categories [9]. The first category is point-wise methods [13], in which a function is learned to score individual instance, and then instances are sorted/ranked based on their scores. The second category is pair-wise methods [14], which maximize the number of correctly ordered pairs in order to learn the optimal ranking structure among instances. The last category is list-wise methods [15], in which a ranking function is learned to explicitly model the entire ranking. Generally, pairwise and listwise methods have superior performance over point-wise methods due to their ability to leverage order structure among instances in learning [9]. Recently, LETOR has also been applied in drug discovery and drug selection [1619]. For example, Agarwal et al. [20] developed a bipartite ranking method to prioritize drug-like compounds. He et al. [5] developed a joint push and learning-to-rank method to select cancer drugs for each individual patient. These studies demonstrate the great potential of LETOR in computational biology and computational medicine, particularly for biomarker prioritization.

Machine learning for AD biomarker discovery

The importance of using big data to enhance AD biomarker study has been widely recognized [6]. As a result, numerous data-driven machine learning models have been developed for early AD detection and AD-relevant biomarker identification including cognitive measures. These models are often designed to accomplish tasks such as classification (e.g., [21]), regression (e.g., [1, 2, 22]) or both (e.g., [23, 24]), where imaging and other biomarker data are used to predict diagnostic, cognitive and/or other outcome(s) of interest. A drawback of these methods is that, although outcome-relevant biomarkers can be identified, they are identified at the population level and not specific to any individual subject. To bridge this gap, we adapt the PLTR method for biomarker prioritization at the individual level, which has greater potential to directly impact personalized diagnosis.

Methods

Materials

The imaging and cognitive data used in our study were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database [6]. The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial MRI, PET, other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI, a prodromal stage of AD) and early AD. For up-to-date information, Please refer to [25] for more detailed, up-to-date information.

Participants include 819 ADNI-1 subjects with 229 healthy control (HC), 397 MCI and 193 AD participants. We consider both MCI and AD subjects as patients, and thus we have 590 cases and 229 controls. We downloaded the 1.5T baseline MRI scans and cognitive assessment data from the ADNI website [25]. We processed the MRI scans using Freesurfer version 5.1 [26], where volumetric and cortical thickness measures of 101 regions relevant to AD were extracted to characterize brain morphometry.

We focus our analysis on 151 scores assessed in 15 neuropsychological tests. For convenience, we denote these measures as cognitive features and these tests as cognitive tasks. The 15 studied tasks include Alzheimer’s Disease Assessment Scale (ADAS), Clinical Dementia Rating Scale (CDR), Functional Assessment Questionnaire (FAQ), Geriatric Depression Scale (GDS), Mini-Mental State Exam (MMSE), Modified Hachinski Scale (MODHACH), Neuropsychiatric Inventory Questionnaire (NPIQ), Boston Naming Test (BNT), Clock Drawing Test (CDT), Digit Span Test (DSPAN), Digit Symbol Test (DSYM), Category Fluency Test (FLUENCY), Weschler’s Logical Memory Scale (LOGMEM), Rey Auditory Verbal Learning Test (RAVLT) and Trail Making Test (TRAIL).

Joint push and learning-to-rank using scores—PLTR

We use the joint push and learning-to-rank method that we developed in He et al. [5], denoted as PLTR, for personalized cognitive feature prioritization. PLTR has also been successfully applied in our preliminary study [8]. We aim to prioritize cognitive features for each individual patient that are most relevant to his/her disease diagnosis. We will use patients’ brain morphometric measures that are extracted from their MRI scans for the cognitive feature prioritization. The cognitive features are in the form of scores or answers in the cognitive tasks that the patients take. The prioritization outcomes can potentially be used in clinical practice to suggest the most relevant cognitive features or tasks that can most effectively facilitate diagnosis of an individual subject.

In order to prioritize MCI/AD cognitive features, PLTR learns and uses patient latent vector representations and their imaging features to score each cognitive feature for each individual patient. Then, PLTR ranks the cognitive features based on their scores. Patients with similar imaging feature profiles will have similar latent vectors and thus similiar ranking of cognitive features [27, 28]. During the learning, PLTR explicitly pushes the most relevant cognitive features on top of the less relevant features for each patient, and therefore optimizes the latent patient vectors and cognitive feature vectors in a way that they will reproduce the feature ranking structures [9]. In PLTR, these latent vectors are learned via solving the following optimization problem:

minU,VLs=(1-α)Ps+αOs++β2Ruv+γ2Rcsim, 1

where α, β and γ[0,1] are coefficients of Os+, Ruv and Rcsim terms, respectively; U=[u1,u2,,um] and V=[v1,v2,,vn] are the latent matrices for patients and features, respectively (u and v are column latent patient vector and feature vector, respectively); Ls is the overall loss function. In Problem 1, Ps measures the average number of relevant cognitive features ranked below an irrelevant cognitive feature, defined as follows,

Ps=p=1m1np+np-fi-Ppfj+Pp+I(sp(fj+)sp(fi-)), 2

where m is the number of patients, fj+ and fi- are the relevant and irrelevant features of patient Pp, np+ and np- are their respective numbers, and I(x) is the indicator function (I(x)=1 if x is true, otherwise 0). In Eq. (2), sp(fi) is a scoring function defined as follows,

sp(fi)=upTvi, 3

that is, it calculates the score of feature fi on patient Pp using their respective latent vectors up and vi [29]. By minimizing Ps, PLTR learns to assign higher scores to relevant features than irrelevant features so as to rank the relevant features at the top of the final ranking list. Note that, PLTR learns different latent vectors and ranking lists for different subjects, and therefore enables personalized feature prioritization. In Problem (1), Os+ measures the ratio of mis-ordered feature pairs over the relevant features among all the subjects, defined as follows,

Os+=p=1m1|{fi+Ppfj+}|fi+Ppfj+I(sp(fi+)<sp(fj+)), 4

where fiPpfj represents that fi is ranked higher than fj for patient Pp. By minimizing Os, PLTR learns to push the most relevant features on top of the less relevant features. Thus, most relevant features are pushed to the very top of the ranking list. In Problem (1), Ruv is a regularizer on U and V to prevent overfitting, defined as,

Ruv=1mUF2+1nVF2, 5

where XF is the Frobenius norm of matrix X. Rcsim is a regularizer on patients to constrain patient latent vectors, defined as

Rcsim=1m2p=1mq=1mwpqup-uq22, 6

where wpq is the similarity between subject Pp and Pq that is calculated using the imaging features of the subjects. The assumption here is that patients who are similar in terms of imaging features could also be similar in terms of cognitive features.

Joint push and learning-to-rank with marginalization—PLTRh

The objective of PLTR is to score relevant features higher than less relevant features as shown in Eqs. 2 and 4. However, in some cases, the score of relevant features is expected to be higher than that of less relevant features by a large margin. For example, patients can be very sensitive to a few cognitive tasks but less sensitive to many others. In order to incorporate such information, we propose a new hinge loss [7] based PLTR, denoted as PLTRh. In PLTRh, the overall loss function is very similar to Eq. 1, defined as follows,

minU,VLh=(1-α)Ph+αOh++β2Ruv+γ2Rcsim, 7

where Lh is the overall loss function; U, V, Ruv and Rcsim are identical as those in Eq. 1. In PLTRh, Ph measures the average loss between the relevant features and irrelevant features using hinge loss as follows,

Ph=p=1m1np+np-fi-Ppfj+Pp+max(0,tp-(sp(fj+)-sp(fi-))), 8

where max(0,tp-(sp(fj+)-sp(fi-))) is the hinge loss (max(0,x)=xifx>0, otherwise 0) between the relevant feature fj+ and the irrelevant feature fi-, and tp is the pre-defined margin. Specifically, only when sp(fj+)-sp(fi-)>tp will not induce any loss during optimization. Otherwise, the hinge loss will be positive and increase as sp(fj+)-sp(fi-) gets smaller than tp. Thus, the hinge loss forces the scores of relevant features higher than those of irrelevant features by at least tp. By doing this, the relevant features are ranked higher than irrelevant features in the ranking list. Similarly, Oh+ measures the average loss among the relevant features also using hinge loss as follows,

Oh+=p=1m1|{fi+Ppfj+}|fi+Ppfj+max(0,to-(sp(fi+)-sp(fj-))), 9

where to is also the pre-defined margin.

Data processing

Data normalization

Following the protocol in our preliminary study [8], we selected all the MCI and AD patients from ADNI and conducted the following data normalization for these patients. We first performed a t test on each cognitive feature between patients and controls, and selected those features if there is a significant difference between patients and controls on these features. Then, we converted the selected features into [0, 1] by shifting and scaling the feature values. We also converted all the normalized feature values according to the Cohen’s d of the features between patients and controls, and thus, smaller values always indicate higher AD possibility. After that, we filtered out features with values 0, 1 or 0.5 for more than 95% patients. This is to discard features that are either not discriminative, or extremely dominated by patients or controls. After the filtering step, we have 112 cognitive features remained and used in experiments. Additional file 1: Table S1 presents these 112 cognitive features. We conducted the same process as above on the imaging features. Additional file 1: Table S2 presents these imaging features used in experiments.

Patient similarities from imaging features

Through the normalization and filtering steps as in “Data normalization” section, we have 86 normalized imaging features remained. We represent each patient using a vector of these features, denoted as rp=[rp1,rp2,,rp86], in which rpi (i=1,,86) is an imaging feature for patient p. We calculate the patient similarity from imaging features using the radial basis function (RBF) kernel, that is, wpq=exp(-rp-rq22σ2), where wpq is the patient similarity used in Rcsim.

Results

Baseline methods

We compare PLTR and PLTRh with two baseline methods: the Bayesian Multi-Task Multi-Kernel Learning (BMTMKL) method [30] and the Kernelized Rank Learning (KRL) method [31].

Bayesian multi-task multi-kernel learning (BMTMKL)

BMTMKL is a state-of-the-art baseline for biomarker prioritization. It was originally proposed to rank cell lines for drugs and won the DREAM 7 challenge [32]. In our study, BMTMKL uses the multi-task and multi-kernel learning within kernelized regression to predict cognitive feature values and learns parameters by conducting Bayesian inference. We use the patient similarity matrix calculated from FreeSurfer features as the kernels in BMTMKL.

Kernelized rank learning (KRL)

KRL represents another state-of-the-art baseline for biomarker prioritization. In our study, KRL uses kernelized regression with a ranking loss to learn the ranking structure of patients and to predict the cognitive feature values. The objective of KRL is to maximize the hits among the top k of the ranking list. We use the patient similarity matrix calculated from FreeSurfer features as the kernels in KRL.

Training-testing data splits

Following the protocol in our preliminary study [8], we test our methods in two different settings: cross validation (CV) and leave-out validation (LOV). In CV, we randomly split each patient’s cognitive tasks into 5 folds: all the features of a cognitive task will be either split into training or testing set. We use 4 folds for training and the rest fold for testing, and do such experiments 5 times, each with one of the 5 folds as the testing set. The overall performance of the methods is averaged over the 5 testing sets. This setting corresponds to the goal to prioritize additional cognitive tasks that a patient should complete. In LOV, we split patients (not patient tasks) into training and testing sets, and a certain patient and all his/her cognitive features will be either in the training set or in the testing set. This corresponds to the use scenario to identify the most relevant cognitive tasks that a new patient needs to take, based on the existing imaging information of the patient, when the patient has not completed any cognitive tasks. Figures 1 and 2 demonstrate the CV and LOV data split processes, respectively.

Fig. 1.

Fig. 1

Data split for cross validation (CV)

Fig. 2.

Fig. 2

Data split for leave-out validation (LOV)

Please note that as presented in “Data normalization” section, for normalized cognitive features, smaller values always indicate more AD possibility. Thus, in both settings, we use the ranking list of normalized cognitive features of each patient as ground truth for training and testing.

Parameters

We conduct grid search to identify the best parameters on each evaluation metric for each model. We use 0.3 and 0.1 as the value of tp and to, respectively. In the experimental results, we report the combinations of parameters that achieve the best performance on evaluation metrics. We implement PLTR and PLTRh using Python 3.7.3 and Numpy 1.16.2, and run the experiments on Xeon E5-2680 v4 with 128G memory.

Evaluation metrics

Metrics on cognitive feature level

We use a metric named average feature hit at k (QH@k) as in our preliminary study [8] to evaluate the ranking performance,

QH@k(τq,τ~q)=i=1kI(τ~iqτq(1:k)), 10

where τq is the ground-truth ranking list of all the features in all the tasks, τq(1:k) is the top k features in the list, τ~q is the predicted ranking list of all the features, and τ~iq is the ith ranked features in τ~q. That is, QH@k calculates the number of features among top k in the predicted feature lists that are also in the ground truth (i.e., hits). Higher QH@k values indicate better prioritization performance.

We use a second evaluation metric weighted average feature hit at k (WQH@k) as follows:

WQH@k(τq,τ~q)=j=1kQH@j(τq,τ~q)/k, 11

that is, WQH@k is a weighted version of QH@k that calculates the average of QH@j (j=1,,k) over top k. Higher WQH@k indicates more feature hits and those hits are ranked on top in the ranking list.

Metrics on cognitive task level

In in Peng et al. [8], we use the mean of the top-g normalized ground-truth scores/predicted scores on the features of each cognitive task for a patient as the score of that task for that patient. For each patient, we rank the tasks using their ground-truth scores and use the ranking as the ground-truth ranking of these tasks. Thus, these scores measure how much relevant to AD the task indicates for the patients. We use the predicted scores to rank cognitive tasks into the predicted ranking of the tasks. We define a third evaluation metric task hit at k (NHg@k) as follows to evaluate the ranking performance in terms of tasks,

NHg@k(τgn,τ~gn)=i=1kI(τ~ginτgn(1:k)), 12

where τgn/τ~gn is the ground-truth/predicted ranking list of all the tasks using top-g question scores.

Experimental results

Overall Performance on CV

Table 1 presents the performance of PLTR, PLTRh and two baseline methods in the CV setting. Note that overall, PLTR and PLTRh have similar standard deviations; KRL and BMTMKL have higher standard deviations compared to PLTR and PLTRh. This indicates that PLTR and PLTRh are more robust than KRL and BMTMKL for the prioritization tasks.

Table 1.

Overall performance in CV

Method Parameters Feature level Task level
d λ QH@5 WQH@5 NH1@1 NH2@1 NH3@1 NH5@1 NHall@1
PLTR 10 2.665_±0.07 3.136±0.12 0.605±0.03 0.701±0.04 0.713±0.05 0.725±0.05 0.683±0.04
10 2.647±0.08 3.191_±0.14 0.599±0.03 0.677±0.04 0.707±0.04 0.725±0.05 0.677±0.04
10 2.569±0.08 2.957±0.11 0.635±0.03 0.707±0.04 0.689±0.05 0.719±0.04 0.653±0.04
10 2.623±0.06 3.073±0.09 0.623±0.03 0.713_±0.05 0.707±0.04 0.719±0.04 0.671±0.04
50 2.467±0.07 2.992±0.11 0.605±0.03 0.695±0.04 0.725±0.06 0.725±0.04 0.653±0.04
30 2.491±0.07 3.080±0.14 0.563±0.04 0.689±0.05 0.713±0.04 0.749±0.04 0.689±0.03
PLTRh 10 2.599 ± 0.09 3.111 ± 0.12 0.623 ± 0.02 0.671 ± 0.03 0.713 ± 0.03 0.719 ± 0.04 0.707 ± 0.03
10 2.575±0.08 3.115 ± 0.13 0.623 ± 0.03 0.677 ± 0.03 0.737 ± 0.04 0.749 ± 0.03 0.695 ± 0.03
10 2.419±0.09 2.827±0.12 0.647_ ± 0.03 0.695 ± 0.03 0.671 ± 0.03 0.707 ± 0.03 0.635 ± 0.03
30 2.138±0.10 2.583±0.18 0.629±0.02 0.701±0.02 0.695 ± 0.03 0.695 ± 0.04 0.593 ± 0.05
50 2.102±0.07 2.470±0.10 0.533±0.03 0.677±0.03 0.743_ ± 0.04 0.754 ± 0.03 0.629 ± 0.05
30 2.281±0.07 2.768±0.18 0.563±0.03 0.689±0.03 0.707 ± 0.04 0.760 ± 0.05 0.701 ± 0.05
KRL 2 2.102 ± 0.26 2.167 ± 0.37 0.569 ± 0.03 0.611 ± 0.05 0.635 ± 0.04 0.683 ± 0.03 0.689 ± 0.07
1.5 2.078 ± 0.15 2.143 ± 0.25 0.503 ± 0.04 0.575 ± 0.05 0.617 ± 0.05 0.677 ± 0.04 0.760_ ± 0.06
BMTMKL 2.443 ± 0.12 2.614 ± 0.20 0.413 ± 0.07 0.491 ± 0.08 0.593 ± 0.05 0.784_ ± 0.05 0.749 ± 0.05

The column “d” corresponds to the latent dimension. The numbers in the form of x±y represent the mean (x) and standard deviation (y). The best performance of each method is in italic. The best performance under each evaluation metric is underlined

Comparison on cognitive feature level

For cognitive features from all tasks, PLTR is able to identify on average 2.665±0.07 out of the top-5 most relevant ground-truth cognitive features among its top-5 predictions (i.e., QH@5 = 2.665 ± 0.07). PLTRh achieves similar performance as PLTR, and identifies on average 2.599±0.09 most relevant ground-truth cognitive features on its top-5 predictions (i.e., QH@5 =2.599±0.09). PLTR and PLTRh significantly outperform the baseline methods in terms of all the evaluation metrics on cognitive feature level (i.e., QH@5 and WQH@5). Specifically, PLTR outperforms the best baseline method BMTMKL at 9.1±3.7% and 22.1±9.5% on QH@5 and WQH@5, respectively. PLTRh also outperforms BMTMKL at 6.4±4.3% and 19.2±10.1% on QH@5 and WQH@5, respectively. These experimental results demonstrate that among the top 5 features in the ranking list, PLTR and PLTRh are able to rank more relevant features on top than the two state-of-the-art baseline methods and the positions of those hits are also higher than those in the baseline methods.

Comparison on cognitive task level

For the scenario to prioritize cognitive tasks that each patient should take, PLTR and PLTRh are able to identify the top-1 most relevant task for 72.5±6.0% and 74.3±4.0% of all the patients when using 3 features to score cognitive tasks, respectively (i.e., NH3=0.725±0.06 for PLTR and NH3=0.743±0.04 for PLTRh). This indicates the strong power of PLTR and PLTRh in prioritizing cognitive features and in recommending relevant cognition tasks for real clinical applications. We also find that PLTR and PLTRh are able to outperform baseline methods on most of the metrics on cognitive task level (i.e., NHg@1). PLTR outperforms the best baseline method at 11.6±5.6%, 16.7±6.1% and 14.2±6.6% on NH1@1, NH2@1 and NH3@1, respectively. PLTRh performs even better than PLTR on NH1@1 and NH3@1, in addition to that it outperforms the best performance of baseline methods at 13.7±5.3%, 14.7±4.8% and 17.0±8.8% on NH1@1, NH2@1 and NH3@1, respectively. PLTR and PLTRh perform slightly worse than baseline methods on NH5@1 and NHall@1 (0.760±0.05 vs 0.784±0.05 on NH5@1 and 0.707±0.03 vs 0.760±0.06 on NHall@1). These experimental results indicate that PLTR and PLTRh are able to push the most relevant task to the top of the ranking list than baseline methods when using a small number of features to score cognitive tasks. Note that in CV, each patient has only a few cognitive tasks in the testing set. Therefore, we only consider the evaluation at the top task in the predicted task rankings (i.e., only NHg@1 in Table 1).

Table 1 also shows that PLTRh outperforms PLTR on most of the metrics on cognitive task level (i.e., NHg@1). PLTRh outperforms PLTR at 1.9 ± 0.5%, 2.5 ± 1.2%, 1.5 ± 0.3% and 2.6 ± 0.9% on NH1@1, NH3@1, NH5@1 and NHall@1, respectively. This indicates that generally PLTRh is better than PLTR on ranking cognitive tasks in CV setting. The reason could be that the hinge-based loss functions with pre-defined margins can enable significant difference between the scores of relevant features and irrelevant features, and thus effectively push relevant features upon irrelevant features.

Overall performance on LOV

Tables 2 and 3 present the performance of PLTR, PLTRh and two baseline methods in the LOV setting. Due to space limit, we did not present the standard deviations in the tables, but they have similar trends as those in Table 1. We first hold out 26 (Table 2) and 52 (Table 3) AD patients as testing patients, respectively. We determine these hold-out AD patients as the ones that have more than 10 similar AD patients in the training set with corresponding patient similarities higher than 0.67 and 0.62, respectively.

Table 2.

Overall performance in LOV on 26 testing patients

Method Feature level Task level
QH@5 WQH@5 NH1@1 NH1@5 NH2@1 NH2@5 NH3@1 NH3@5 NH5@1 NH5@5 NHall@1 NHall@5
PLTR 1.615_ 1.906_ 0.846 3.231 0.577 3.385 0.231 3.654 0.308 3.346 0.808 3.692
1.500 1.778 0.846_ 3.269 0.577_ 3.538_ 0.269 3.654 0.269 3.269 0.808 3.577
1.538 1.856 0.846 3.192 0.577 3.423 0.308 3.731_ 0.346 3.346 0.808 3.615
1.577 1.851 0.846 3.192 0.577 3.462 0.308 3.654 0.346_ 3.462 0.808 3.654
1.615 1.906 0.846 3.231 0.577 3.385 0.231 3.654 0.308 3.346 0.808_ 3.692
PLTRh 1.615_ 1.836 0.846_ 3.192 0.577_ 3.500 0.269 3.731 0.346_ 3.731 0.808_ 4.154
1.538 1.891 0.846 3.192 0.577 3.500 0.269 3.731 0.346 3.615 0.808 4.038
1.538 1.856 0.769 3.308 0.577 3.462 0.269 3.615 0.308 3.385 0.808 3.500
1.538 1.712 0.846 3.115 0.577 3.423 0.154 3.731_ 0.308 3.808 0.808 4.269_
KRL 1.423 1.656 0.615 2.615 0.577_ 3.308 0.038 3.577 0.346_ 3.962_ 0.808_ 4.269_
1.346 1.881 0.577 2.615 0.577 3.308 0.038 3.577 0.346 3.962 0.808 4.269
1.346 1.435 0.808 3.423_ 0.538 3.500 0.346_ 3.731_ 0.154 3.423 0.808 3.538
BMTMKL 0.423 0.212 0.846_ 2.615 0.577_ 3.308 0.038 3.577 0.346_ 3.769 0.808_ 4.269_

The column “n” corresponds to the number of hold-out testing patients. The bset performance of each method is in italic.The best performance under each evaluation metric is underlined

Table 3.

Overall Performance in LOV on 52 testing patients

Method Feature level Task level
QH@5 WQH@5 NH1@1 NH1@5 NH2@1 NH2@5 NH3@1 NH3@5 NH5@1 NH5@5 NHall@1 NHall@5
PLTR 1.385 1.668 0.788 3.212 0.423 3.654 0.115 3.750 0.288 3.423 0.788 3.423
1.327 1.616 0.808_ 3.269_ 0.423 3.654 0.115 3.731 0.173 3.423 0.788 3.404
1.327 1.652 0.788 3.212 0.423 3.712_ 0.115 3.750_ 0.269 3.423 0.788 3.404
1.308 1.616 0.788 3.154 0.423 3.654 0.115 3.712 0.288 3.481 0.788 3.615
1.288 1.581 0.808 3.173 0.423 3.596 0.115 3.750 0.192 3.519 0.788 3.635
1.269 1.616 0.808 3.115 0.423 3.635 0.115 3.731 0.250 3.481 0.788_ 3.635
PLTRh 1.404_ 1.656 0.750 2.827 0.404 3.250 0.173 3.481 0.385_ 3.596 0.788_ 4.154_
1.365 1.695_ 0.731 2.808 0.365 3.308 0.173 3.462 0.365 3.596 0.788 4.154
1.327 1.562 0.808_ 3.077 0.404 3.365 0.135 3.577 0.250 3.673 0.788 4.115
1.327 1.605 0.769 3.154 0.385 3.596 0.135 3.712 0.212 3.519 0.788 3.577
1.308 1.609 0.769 2.904 0.385 3.308 0.192_ 3.442 0.365 3.654 0.788 4.154
1.327 1.605 0.769 3.154 0.385 3.596 0.135 3.712 0.212 3.519 0.788 3.577
1.288 1.545 0.788 3.000 0.404 3.385 0.154 3.558 0.308 3.712 0.788 4.154
KRL 1.173 1.548 0.096 2.577 0.385 3.231 0.077 3.385 0.346 3.808_ 0.788_ 4.154_
1.173 1.534 0.154 2.615 0.250 3.192 0.077 3.385 0.346 3.712 0.788 4.154
1.096 1.437 0.077 2.577 0.462 3.231 0.077 3.385 0.346 3.808 0.788 4.154
0.423 0.504 0.019 2.019 0.038 2.500 0.115 2.481 0.115 2.712 0.019 2.673
BMTMKL 0.403 0.255 0.808_ 2.577 0.481_ 3.231 0.077 3.385 0.346 3.596 0.788_ 4.154_

The column “n” corresponds to the number of hold-out testing patients. The best performance of each model is in italic. The best performance under each evaluation metric is upon underline.

Comparison on cognitive feature level

Tables 2 and 3 show that PLTR and PLTRh significantly outperform the baseline methods in terms of all the evaluation metrics on cognitive feature level (i.e., QH@5 and WQH@5), which is consistent with the experimental results in CV setting. When 26 patients are hold out for testing, with parameters α=0.5, β=1.5, γ=1.0 and d=30, PLTR outperforms the best baseline method KRL at 13.4% and 1.3% on QH@5 and WQH@5, respectively. The performance of PLTRh is very comparable with that of PLTRPLTRh outperforms KRL at 13.4% and 0.5% on QH@5 and WQH@5, respectively. When 52 patients are hold out for testing, with parameters α=0.5, β=0.5, γ=1.0 and d=50, PLTR outperforms the best baseline method KRL at 18.1% and 7.8% on QH@5 and WQH@5, respectively. PLTRh even performs better than PLTR in this setting. In addition, PLTRh outperforms KRL at 19.7% and 9.5% on QH@5 and WQH@5, respectively. These experimental results demonstrate that for new patients, PLTR and PLTRh are able to rank more relevant features to the top of the ranking list than the two baseline methods. They also indicate that for new patients, ranking based methods (e.g., PLTR and PLTRh) are more effective than regression based methods (e.g., KRL and BMTMKL) for biomarker prioritization.

Comparison on cognitive task level

Table 2 also shows that when 26 patients are hold out for testing, PLTR and PLTRh are both able to identify the top most relevant questionnaire for 84.6% of the testing patients (i.e., 22 patients) under NH1@1. Table 3 shows that when 52 patients are hold out for testing, PLTR and PLTRh are both able to identify for 80.8% of the testing patients (i.e., 42 patients) under NH1@1. Note that the hold-out testing patients in LOV do not have any cognitive features. Therefore, the performance of PLTR and PLTRh as above demonstrates their strong capability in identifying most AD related cognitive features based on imaging features only. We also find that PLTR and PLTRh are able to achieve similar or even better results compared to baseline methods in terms of the evaluation metrics on cognitive task level (i.e., NHg@1 and NHg@5). When 26 patients are hold out for testing, PLTR and PLTRh outperform the baseline methods in terms of NHg@1 (i.e., g=1,25). They are only slightly worse than KRL on ranking relevant tasks on their top-5 of predictions when g=1 or g=5 (3.308 vs 3.423 on NH1@5 and 3.808 vs 3.962 on NH5@5). When 52 patients are hold out for testing, PLTR and PLTRh also achieve the best performance on most of the evaluation metrics. They are only slightly worse than KRL on NH2@1, NH5@5 (0.423 vs 0.481 on NH2@1 and 3.712 vs 3.808 on NH5@5). These experimental results demonstrate that among top 5 tasks in the ranking list, PLTR and PLTRh rank more relevant task on top than KRL.

It’s notable that in Tables 2 and 3, as the number of features used to score cognitive tasks (i.e., g in NHg@k) increases, the performance of all the methods in NHg@1 first declines and then increases. This may indicate that as g increases, irrelevant features which happen to have relatively high scores will be included in scoring tasks, and thus degrade the model performance on NHg@1. However, generally, the scores of irrelevant features are considerably lower than those of relevant ones. Thus, as more features are included, the scores for tasks are more dominated by the scores of relevant features and thus the performance increases.

We also find that BMTMKL performs poorly on NH3@1 in both Tables 2 and 3. This indicates that BMTMKL, a regression-based method, could not well rank relevant features and irrelevant features. It’s also notable that generally the best performance for the 26 testing patients is better than that for 52 testing patients. This may be due to that the similarities between the 26 testing patients and their top 10 similar training patients are higher than those for the 52 testing patients. The high similarities enable accurate latent vectors for testing patients.

Tables 2 and 3 also show that PLTRh is better than PLTR on ranking cognitive tasks in LOV setting. When 26 patients are hold out for testing, PLTRh outperforms PLTR on NH1@5, NH5@5 and NHall@5 and achieves very comparable performance on the rest metrics. When 52 patients are hold out for testing, PLTRh is able to achieve better performance than PLTR on QH@5, WQH@5, NH3@1, NH5@1, NH5@5 and NHall@5 and also achieves very comparable performance on the rest metrics. Generally, PLTRh outperforms PLTR in terms of metrics on cognitive task level. This demonstrates the effectiveness of hinge loss-based methods in separating relevant and irrelevant features during modeling.

Discussion

Our experimental results show that when NH1@1 achieves its best performance of 0.846 for the 26 testing patients in the LOV setting (i.e., the first row block in Table 2), the task that is most commonly prioritized for the testing patients is Rey Auditory Verbal Learning Test (RAVLT), including the following cognitive features: (1) trial 1 total number of words recalled; (2) trial 2 total number of words recalled; (3) trial 3 total number of words recalled; (4) trial 4 total number of words recalled; (5) trial 5 total number of words recalled; (6) total Score; (7) trial 6 total number of words recalled; (8) list B total number of words recalled; (9) 30 min delay total; and (10) 30 min delay recognition score. RAVLT is also the most relevant task in the ground truth if tasks are scored correspondingly. RAVLT assesses learning and memory, and has shown promising performance in early detection of AD [33]. A number of studies have reported high correlations between various RAVLT scores with different brain regions [34]. For instance, RAVLT recall is associated with medial prefrontal cortex and hippocampus; RAVLT recognition is highly correlated with thalamic and caudate nuclei. In addition, genetic analysis of APOE ε4 allele, the most common variant of AD, reported its association with RAVLT score in an early-MCI (EMCI) study [26]. The fact that RAVLT is prioritized demonstrates that PLTR is powerful in prioritizing cognitive features to assist AD diagnosis.

Similarly, we find the top-5 most frequent cognitive tasks corresponding to the performance at NH3@5 = 3.731 for the 26 hold-out testing patients. They are: Functional Assessment Questionnaire (FAQ), Clock Drawing Test (CDT), Weschler’s Logical Memory Scale (LOGMEM), Rey Auditory Verbal Learning Test (RAVLT), and Neuropsychiatric Inventory Questionnaire (NPIQ). In addition to RAVLT discussed above, other top prioritized cognitive tasks have also been reported to be associated with AD or its progression. In an MCI to AD conversion study, FAQ, NPIQ and RAVLT showed significant difference between MCI-converter and MCI-stable groups [35]. We also notice that for some testing subjects, PLTR is able to very well reconstruct their ranking structures. For example, when NH3@5 achieves its optimal performance 3.731, for a certain testing subject, her top-5 predicted cognitive tasks RAVLT, LOGMEM, FAQ, NPIQ and CDT are exactly the top-5 cognitive tasks in the ground truth. These evidences further demonstrate the diagnostic power of our method.

Conclusions

We have proposed a novel machine learning paradigm to prioritize cognitive assessments based on their relevance to AD at the individual patient level. The paradigm tailors the cognitive biomarker discovery and cognitive assessment selection process to the brain morphometric characteristics of each individual patient. It has been implemented using newly developed learning-to-rank method PLTR and PLTRh. Our empirical study on the ADNI data has produced promising results to identify and prioritize individual-specific cognitive biomarkers as well as cognitive assessment tasks based on the individual’s structural MRI data. In addition, PLTRh shows better performance than PLTR on ranking cognitive assessment tasks. The resulting top ranked cognitive biomarkers and assessment tasks have the potential to aid personalized diagnosis and disease subtyping, and to make progress towards enabling precision medicine in AD.

Supplementary information

12911_2020_1339_MOESM1_ESM.pdf (142.3KB, pdf)

Additional file 1. Supplementary materials.

Acknowledgements

Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

For the ADNI: Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: https://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Data_Use_Agreement.pdf.

Abbreviations

AD

Alzheimer’s Disease

MRI

Magnetic resonance imaging

ADNI

Alzheimer’s Disease Neuroimaging Initiative

LETOR

Learning-to-Rank

PET

Positron emission tomography

MCI

Mild Cognitive Impairment

HC

Healthy control

ADAS

Alzheimer’s Disease Assessment Scale

CDR

Clinical Dementia Rating Scale

FAQ

Functional Assessment Questionnaire

GDS

Geriatric Depression Scale

MMSE

Mini-Mental State Exam

MODHACH

Modified Hachinski Scale

NPIQ

Neuropsychiatric Inventory Questionnaire

BNT

Boston Naming Test

CDT

Clock Drawing Test

DSPAN

Digit Span Test

DSYM

Digit Symbol Test

FLUENCY

Category Fluency Test

LOGMEM

Weschler’s Logical Memory Scale

RAVLT

Rey Auditory Verbal Learning Test

TRAIL

Trail Making Test

RBF

Radial Basis Function

PLTR

Joint Push and Learning-to-Rank Method

PLTRh

Joint Push and Learning-to-Rank Method using Hinge Loss

BMTMKL

Bayesian Multi-Task Multi-Kernel Learning

KRL

Kernelized Rank Learning

CV

Cross Validation

LOV

Leave-Out Validation

QH@k

Average Feature Hit at k

WQH@k

Weighted Average Feature Hit at k

NHg@k

Task Hit at k

APOE

Apolipoprotein E

EMCI

Early-MCI

Authors’ contributions

XN and LS designed the research study. BP and XY contributed to the conduct of the study: XY extracted and processed the data from ADNI; BP conduced the model development and data analysis. The results were analyzed, interpreted and discussed by BP, XY, SL, AJ, LS and XN. BP and XN drafted the manuscript, all co-authors revised the manuscript, and all authors read and approved the final manuscript.

Funding

This work was supported in part by NIH R01 EB022574, R01 AG019771, and P30 AG010133; NSF IIS 1837964 and 1855501. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding agencies.

Availability of data and materials

The dataset supporting the conclusions of this article is available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [25].

Ethics approval and consent to participate

The dataset supporting the conclusions of this article is available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [25]. ADNI data can be requested by all interested investigators; they can request it via the ADNI website and must agree to acknowledge ADNI and its funders in the papers that use the data. There are also some other reporting requirements; the PI must give an annual report of what the data have been used for, and any publications arising. More details are available at http://adni.loni.usc.edu/data-samples/access-data/.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Bo Peng, peng.707@buckeyemail.osu.edu.

Xiaohui Yao, Email: Xiaohui.Yao@pennmedicine.upenn.edu.

Shannon L. Risacher, Email: srisache@iupui.edu

Andrew J. Saykin, Email: asaykin@iupui.edu

Li Shen, Email: li.shen@pennmedicine.upenn.edu.

Xia Ning, Email: ning.104@osu.edu.

Supplementary information

Supplementary information accompanies this paper at 10.1186/s12911-020-01339-z.

References

  • 1.Wan J, Zhang Z, et al. Identifying the neuroanatomical basis of cognitive impairment in Alzheimer’s disease by correlation- and nonlinearity-aware sparse Bayesian learning. IEEE Trans Med Imaging. 2014;33(7):1475–87. doi: 10.1109/TMI.2014.2314712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yan J, Li T, et al. Cortical surface biomarkers for predicting cognitive outcomes using group l2,1 norm. Neurobiol Aging. 2015;36(Suppl 1):185–93. doi: 10.1016/j.neurobiolaging.2014.07.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cordell CB, Borson S, et al. Alzheimer’s Association recommendations for operationalizing the detection of cognitive impairment during the medicare annual wellness visit in a primary care setting. Alzheimers Dement. 2013;9(2):141–50. doi: 10.1016/j.jalz.2012.09.011. [DOI] [PubMed] [Google Scholar]
  • 4.Scott J, Mayo AM. Instruments for detection and screening of cognitive impairment for older adults in primary care settings: a review. Geriatr Nurs. 2018;39(3):323–329. doi: 10.1016/j.gerinurse.2017.11.001. [DOI] [PubMed] [Google Scholar]
  • 5.He Y, Liu J, Ning X. Drug selection via joint push and learning to rank. IEEE/ACM Trans Comput Biol Bioinform. 2020;17(1):110–123. doi: 10.1109/TCBB.2018.2848908. [DOI] [PubMed] [Google Scholar]
  • 6.Weiner MW, Veitch DP, et al. The Alzheimer’s disease neuroimaging initiative 3: continued innovation for clinical trial improvement. Alzheimers Dement. 2017;13(5):561–571. doi: 10.1016/j.jalz.2016.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gentile C, Warmuth MK. Linear hinge loss and average margin. In: Proceedings of the 11th International Conference on Neural Information Processing Systems. NIPS'98. MA, USA: MIT Press, Cambridge; 1999. p. 225–31.
  • 8.Peng B, Yao X, Risacher SL, Saykin AJ, Shen L, Ning X. Prioritization of cognitive assessments in alzheimer’s disease via learning to rank using brain morphometric data. In: Proceedings of 2019 IEEE EMBS International Conference on Biomedical Health Informatics. New York, NY: IEEE; 2019. p. 1–4 . 10.1109/BHI.2019.8834618. [DOI] [PMC free article] [PubMed]
  • 9.Liu T-Y. Learning to rank for information retrieval. 1. Berlin: Springer; 2011. pp. 1–285. [Google Scholar]
  • 10.Li, H. Learning to rank for information retrieval and natural language processing. 1st ed. In: Synthesis lectures on human language technologies, p. 114. San Rafael, California USA: Morgan & Claypool Publishers; 2011. 10.2200/S00607ED2V01Y201410HLT026.
  • 11.Agichtein E, Brill E, Dumais S, Brill E, Dumais S. Improving web search ranking by incorporating user behavior. In: Proceedings of SIGIR 2006; 2006.
  • 12.Karatzoglou A, Baltrunas L, Shi Y. Learning to rank for recommender systems. In: Proceedings of the 7th ACM conference on recommender systems. RecSys’13. New York: ACM; 2013. p. 493–4. 10.1145/2507157.2508063.
  • 13.Cao Z, Qin T, Liu T-Y, Tsai M-F, Li H. Learning to rank: from pairwise approach to listwise approach. In: Proceedings of the 24th international conference on machine learning. ACM; 2007. p. 129–36.
  • 14.Burges CJC, Ragno R, Le QV. Learning to rank with nonsmooth cost functions. In: Proceedings of the 19th international conference on neural information processing systems. NIPS’06. Cambridge: MIT Press; 2006. p. 193–200
  • 15.Lebanon G, Lafferty J. Cranking: Combining rankings using conditional probability models on permutations. In: ICML, 2002; vol. 2, p. 363–70. Citeseer.
  • 16.Liu J, Ning X. Multi-assay-based compound prioritization via assistance utilization: a machine learning framework. J Chem Inf Model. 2017;57(3):484–498. doi: 10.1021/acs.jcim.6b00737. [DOI] [PubMed] [Google Scholar]
  • 17.Zhang W, Ji L, Chen Y, Tang K, Wang H, Zhu R, Jia W, Cao Z, Liu Q. When drug discovery meets web search: learning to rank for ligand-based virtual screening. J Cheminform. 2015;7(1):5. doi: 10.1186/s13321-015-0052-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Liu J, Ning X. Differential compound prioritization via bi-directional selectivity push with power. In: Proceedings of the 8th ACM international conference on bioinformatics, computational biology, and health informatics. ACM-BCB’17. New York: ACM; 2017. p. 394–9. 10.1145/3107411.3107486.
  • 19.Liu J, Ning X. Differential compound prioritization via bi-directional selectivity push with power. J Chem Inf Model. 2017;57(12):2958–2975. doi: 10.1021/acs.jcim.7b00552. [DOI] [PubMed] [Google Scholar]
  • 20.Agarwal S, Dugar D, Sengupta S. Ranking chemical structures for drug discovery: a new machine learning approach. J Chem Inf Model. 2010;50(5):716–731. doi: 10.1021/ci9003865. [DOI] [PubMed] [Google Scholar]
  • 21.Wang X, Liu K, Yan J, Risacher SL, Saykin AJ, Shen L, Huang H et al. Predicting interrelated alzheimer’s disease outcomes via new self-learned structured low-rank model. In: International conference on information processing in medical imaging. Springer; 2017. p. 198–209. [DOI] [PMC free article] [PubMed]
  • 22.Yan J, Deng C, Luo L, Wang X, Yao X, Shen L, Huang H. Identifying imaging markers for predicting cognitive assessments using wasserstein distances based matrix regression. Front Neurosci. 2019;13:668. doi: 10.3389/fnins.2019.00668. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wang H, Nie F, Huang H, Risacher SL, Saykin AJ, Shen L. Alzheimer’s disease neuroimaging I. Identifying disease sensitive and quantitative trait-relevant biomarkers from multidimensional heterogeneous imaging genetics data via sparse multimodal multitask learning. Bioinformatics. 2012;28(12):127–36. doi: 10.1093/bioinformatics/bts228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Brand L, Wang H, Huang H, Risacher S, Saykin A, Shen L et al. Joint high-order multi-task feature learning to predict the progression of alzheimer’s disease. In: International conference on medical image computing and computer-assisted intervention. Springer; 2018. p. 555–62. [DOI] [PMC free article] [PubMed]
  • 25.Weiner MW, Veitch DP et al. Alzheimer’s Disease Neuroimaging Initiative. https://adni.loni.usc.edu. Accessed 22 July 2020
  • 26.Risacher S, Kim S, et al. The role of apolipoprotein e (apoe) genotype in early mild cognitive impairment (e-mci) Front Aging Neurosci. 2013;5:11. doi: 10.3389/fnagi.2013.00011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Sarwar B, Karypis G, Konstan J, Riedl J. Item-based collaborative filtering recommendation algorithms. In: Proceedings of the 10th international conference on world wide web. WWW’01. New York: Association for Computing Machinery; 2001. p. 285–95. 10.1145/371920.372071.
  • 28.Wang J, De Vries AP, Reinders MJ. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval; 2006. p. 501–8.
  • 29.Koren Y, Bell R, Volinsky C. Matrix factorization techniques for recommender systems. Computer. 2009;42(8):30–37. doi: 10.1109/MC.2009.263. [DOI] [Google Scholar]
  • 30.Costello JC, Heiser LM, Georgii E, Gönen M, Menden MP, Wang NJ, Bansal M, Hintsanen P, Khan SA, Mpindi J-P, et al. A community effort to assess and improve drug sensitivity prediction algorithms. Nat Biotechnol. 2014;32(12):1202. doi: 10.1038/nbt.2877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.He X, Folkman L, Borgwardt K. Kernelized rank learning for personalized drug recommendation. Bioinformatics. 2018;34(16):2808–2816. doi: 10.1093/bioinformatics/bty132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Challenge D. DREAM 7 NCI-DREAM drug sensitivity prediction challenge. http://dreamchallenges.org/project/dream-7-nci-dream-drug-sensitivity-prediction-challenge/. Accessed 23 July 2020
  • 33.Moradi E, Hallikainen I, et al. Rey’s Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer’s disease. NeuroImage: Clinical. 2017;13:415–427. doi: 10.1016/j.nicl.2016.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Balthazar MLF, Yasuda CL, et al. Learning, retrieval, and recognition are compromised in aMCI and mild AD: Are distinct episodic memory processes mediated by the same anatomical structures? J Int Neuropsychol Soc. 2010;16(1):205–209. doi: 10.1017/S1355617709990956. [DOI] [PubMed] [Google Scholar]
  • 35.Risacher SL, Saykin AJ, et al. Baseline MRI predictors of conversion from MCI to probable AD in the ADNI cohort. Curr Alzheimer Res. 2009;6(4):347–361. doi: 10.2174/156720509788929273. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12911_2020_1339_MOESM1_ESM.pdf (142.3KB, pdf)

Additional file 1. Supplementary materials.

Data Availability Statement

The dataset supporting the conclusions of this article is available in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) [25].


Articles from BMC Medical Informatics and Decision Making are provided here courtesy of BMC

RESOURCES