Influence of medical domain knowledge on deep learning for Alzheimer's disease prediction

Branimir Ljubic; Shoumik Roychoudhury; Xi Hang Cao; Martin Pavlovski; Stefan Obradovic; Richard Nair; Lucas Glass; Zoran Obradovic

doi:10.1016/j.cmpb.2020.105765

. 2020 Sep 20;197:105765. doi: 10.1016/j.cmpb.2020.105765

Influence of medical domain knowledge on deep learning for Alzheimer's disease prediction

Branimir Ljubic ^a, Shoumik Roychoudhury ^a, Xi Hang Cao ^a, Martin Pavlovski ^a, Stefan Obradovic ^b, Richard Nair ^c, Lucas Glass ^c, Zoran Obradovic ^a,^⁎

PMCID: PMC7502243 PMID: 33011665

Highlights

•
LSTM, RNN model for prediction of Alzheimer's diseases(AD) is developed from EMR data.
•
Information from 3 EMR domains were used: conditions, measurements and drugs.
•
We created positive AD cohorts using relevant medical knowledge as model inputs.
•
Selection of relevant input cohorts was crucial for overall RNN model prediction.
•
We efficiently applied the drugs and the measurement domain in prediction of AD.

Keywords: Electronic medical records, Alzheimer's disease prediction, Cognitive impairment, Deep learning, Recurrent Neural Networks

Abstract

Background and objective

Alzheimer's disease (AD) is the most common type of dementia that can seriously affect a person's ability to perform daily activities. Estimates indicate that AD may rank third as a cause of death for older people, after heart disease and cancer. Identification of individuals at risk for developing AD is imperative for testing therapeutic interventions. The objective of the study was to determine could diagnostics of AD from EMR data alone (without relying on diagnostic imaging) be significantly improved by applying clinical domain knowledge in data preprocessing and positive dataset selection rather than setting naïve filters.

Methods

Data were extracted from the repository of heterogeneous ambulatory EMR data, collected from primary care medical offices all over the U.S. Medical domain knowledge was applied to build a positive dataset from data relevant to AD. Selected Clinically Relevant Positive (SCRP) datasets were used as inputs to a Long-Short-Term Memory (LSTM) Recurrent Neural Network (RNN) deep learning model to predict will the patient develop AD.

Results

Risk scores prediction of AD using the drugs domain information in an SCRP AD dataset of 2,324 patients achieved high out-of-sample score - 0.98-0.99 Area Under the Precision-Recall Curve (AUPRC) when using 90% of SCRP dataset for training. AUPRC dropped to 0.89 when training the model using less than 1,500 cases from the SCRP dataset. The model was still significantly better than when using naïve dataset selection.

Conclusion

The LSTM RNN method that used data relevant to AD performed significantly better when learning from the SCRP dataset than when datasets were selected naïvely. The integration of qualitative medical knowledge for dataset selection and deep learning technology provided a mechanism for significant improvement of AD prediction.

Accurate and early prediction of AD is significant in the identification of patients for clinical trials, which can possibly result in the discovery of new drugs for treatments of AD. Also, the contribution of the proposed predictions of AD is a better selection of patients who need imaging diagnostics for differential diagnosis of AD from other degenerative brain disorders.

1. Introduction

According to the National Institute of Aging, more than 5.5 million Americans are diagnosed with AD [1]. Estimates indicate that AD may rank third as a cause of death for older people, only second to heart disease and cancer [2], [3]. Identification of individuals at risk for developing AD is imperative for testing therapeutic interventions [4]. Many researchers have presented an overview of the classification of Mild Cognitive Impairment (MCI) [5]. Early diagnosis could help with the recruitment of patients to participate in clinical trials and the testing of possible new drug therapies for AD. Several studies indicate that the use of imaging for early detection of AD is imperative to early diagnosis [6]. Diagnostic imaging can be very costly and the question is what is the cost-effectiveness of imaging in AD [7].

Heterogeneous structures of Electronic Medical Records (EMR) data pose a challenge for machine learning (ML) algorithms [8], [9], [10]. For our study, we used a repository of heterogeneous ambulatory EMR data, collected from primary care medical offices spread over the U.S. These data are typical of ambulatory EMR data from medical practices and not data from clinical trials. The data for this study was sourced from IQVIA and EMR vendors which were then mapped into the Observational Medical Outcomes Partnership (OMOP) format.

ML algorithms utilizing Magnetic Resonance Imaging (MRI) for prediction of AD have been developed [11], [12], [13]. An application of RNN models was designed to differentiate AD patients from healthy control individuals using neuroimaging data [14]. Longitudinal EMRs were used to study the progression of chronic diseases like AD [15], [16]. LSTM RNN can effectively predict AD progression by fully leveraging the temporal and medical patterns derived from patient's office visits.

Our research goal was to implement LSTM RNN deep learning configuration to predict AD diagnosis using EMR data alone (without relying on diagnostic imaging) [17], [18]. The study objective was to show that selection of relevant input datasets is important for overall LSTM RNN model predictive performance. We wanted to determine if applying medical domain knowledge in data preprocessing and positive dataset selection significantly improves the prediction of AD comparing to the naïve model. The most current health problem with Coronavirus (COVID19) which severely affected the entire world, proves the importance of relevant medical data in the creation of analytical and predictive models as well as in the application of adequate preventive health measures [19].

Furthermore, we attempted to efficiently apply the drugs domain in prediction of AD. The objective was also to evaluate the contribution of individual clinical domains as well as the ensemble of few domains to the prediction of AD.

An accurate early prediction of AD could help with the recruitment of patients for clinical trials, which could help find new drug therapies for AD. Also, the contribution of predictions of AD is a better selection of patients who need imaging diagnostics for differential diagnosis of AD.

2. Methods

Our comprehensive methodology comprises relevant inputs selection using medical domain knowledge, and construction of the LSTM RNN deep learning model. We used the ambulatory EMR database to predict the occurrence of AD. Data are in the OMOP common data model (CDM) format. The terminologies used to describe the clinical conditions vary from database to database. In the OMOP concept, data contained in different types of observational databases are transformed into a common format. The OMOP CDM provides a common data standard to analyze multiple data sources concurrently [20]. The OMOP concept allows the evaluation of individual clinical domains separately as well as an ensemble of different domains.

Patients with AD diagnosis in the EMR database were extracted using the OMOP code for AD. We found 24,734 AD patients in the EMR database. This dataset does not contain patients treated with experimental therapies for dementia or AD. It contains only medications that are approved by the FDA and available for everyday use. The measurement domain includes labs measured in ambulatory settings such as blood tests and vital signs. It does not contain data obtained in clinical settings such as biological results about CSF biomarkers related to tau and amyloid. The condition domain data contain only basic diagnoses, not including specific cognitive measurements such as episodic memory deficits according to psychometric tests. We did not use any imaging data.

In the initial experiment, we built positive datasets by setting certain filters, calling this approach naïve. We filtered out conditions (diagnosis) that appeared less than 30 times in the whole dataset which are rare diseases that have small information value and represent noise. We also filtered out patients who had less than four visits. The number of visits was set to four or more to capture the temporal nature of patients’ histories [21].

The negative dataset was selected randomly from the entire ambulatory EMR database and it consists of patients who had 4 visits, who didn't develop AD, and who were born before 1950. By selecting this age group, we made the age distribution of patients in the negative dataset almost identical to the age distribution of patients in positive datasets. The average age in positive datasets when AD was diagnosed was 80.2. About 51% of patients in positive datasets were older than 80 at the time of AD diagnosis. The interval between the last visit before AD and the visit when AD was diagnosed varies between a few days and many years.

Further, we applied medical domain knowledge to build positive datasets. Development of AD could roughly be classified into three stages: preclinical stage, MCI stage, and clinical AD stage. Since the OMOP vocabulary does not have a hierarchical organization like ICD coding, we had to search for terms that match ICD codes for MCI. We included the following conditions from OMOP vocabulary to define the pre-AD stage: Mild Cognitive Impairment, Memory impairment, Organic mental disorder, Amnesia, Forgetful, Cognitive disorder. The final dataset represents the union of the above-mentioned conditions.

Initially, in the naïve setup, we had cases that some patients had for example: knee injury, common cold, and flu in their medical history, and the naïve model was predicting the occurrence of AD. We were instructed by clinicians that data should have some relevance to AD to be accepted by medical experts.

We applied medical domain knowledge and constructed datasets with MCI stage included, to avoid prediction of AD only from unrelated diseases. All selected patients had the MCI condition in the pre-AD stage. None of the visits designated to the pre-AD stage contained the OMOP code for AD diagnosis. We selected all data over a continuous period before the first AD visit date in our positive datasets. The first AD visit date is the date when the patient was diagnosed with AD for the first time. We didn't include visits after the first AD diagnosis, to avoid data leak. Experiments were conducted using the following three domains: conditions, measurements, and drugs. We selected patients who had at least 4 visits and data in each of the three domains to be able to compare prediction results for AD among these domains. This preprocessing resulted in the final relevant dataset of 2,324 patients. We will call this dataset Selected Clinically Relevant Positive (SCRP) dataset. One SCRP positive dataset was constructed for each of the domains.

Experiments with patients who had data in different combinations of two domains were also conducted. We present datasets used in experiments in Table 1 .

Table 1.

Datasets used in the experiments and the number of patients in each of them.

Dataset	Number of patients
Naïve model, patients with 4 visits, all studied domains	2,600
SCRP model, 4 visits, all studied domains	2,324
SCRP model, 3 visits, all studied domains	3,199
SCRP model, 2 visits, all studied domains	3,568
SCRP, 4 visits, conditions and drugs domains	3,726
SCRP, 4 visits, conditions and measurements domains	3,846
SCRP, 4 visits, conditions domain	6,418

Open in a new tab

We also conducted experiments by excluding particular conditions that were used to define the pre-AD stage and making different combinations of relevant conditions.

We developed a model in Python (PyTorch library) that uses LSTM RNN to model a sequence of medical codes and their temporal associations. RNN LSTM contains hidden units that can analyze sequences of events, like EMR. RNN LSTM is suitable for longitudinal datasets with different sizes of time intervals between events which is the case in our application.

The created model consists of an input layer, an embedding layer, a sequence modeling layers, and an output layer ( Fig. 1).

We generated and presented a sequence of visits for each patient to the input layer. Initially, a dictionary was created to map all distinct OMOP codes into indices (integers). Each visit was then represented with a one-hot (feature) vector which contains a value of one at the indices corresponding to the OMOP codes from that visit. Note that the dimensionality of the visit vectors depends on the domain that is being considered for learning (for instance, if the patients’ sequences of visits are extracted from the condition domain, the dimension of the visit vectors will equal the number of distinct OMOP condition codes). The total number of different OMOP codes for conditions (diagnoses) in the starting dataset was 7880. After applying filters in the final naïve setup, the number of different OMOP codes for conditions was 1445, for measurements 782, procedures 545, and for drugs 828. In the SCRP setup, after applying all filters, the number of different OMOP codes for conditions was 1325, for measurements 752, procedures 495, and for drugs 145.

After processing, patients’ records were in the form of sequences of one-hot encoded visit vectors. In the embedding layer, the bags of indices were converted to bags of embedding. The next layer was an RNN sequence modeling layer with LSTM cells and the last layer was the output layer.

First, we conducted all experiments using the naïve positive datasets and then all experiments using the SCRP datasets. We performed experiments with different training: testing ratios with both types of positive datasets. We randomly selected 90% of patients for training and the remaining 10% of patients for testing (234 patients in the SCRP dataset, and 260 patients in the case of the naïve dataset). Furthermore, in the next group of experiments, we kept the size of the testing dataset fixed at 10%. We were decreasing the size of the training dataset, from the starting 2,090 patients to 1,800, then to 1,500, 1,250, and 1,000 patients to identify at what size of the dataset the performance of our model starts decreasing. We also performed experiments with different split ratios of training and testing datasets. We used 80:20, as well as 70:30 ratio in the random split for training and testing datasets.

In the training phase, the AD positive to negative ratio was 1:2 (majority sub-sampling) and in the testing phase, the AD positive to negative ratio was 1:9 in experiments with the naïve dataset and between 1:5 and 1:25 in experiments with SCRP datasets. The ratio 1:9 positive to negative corresponds to the percentage of AD in the general population (10-15%). In SCRP datasets we used ratios between 5-25% in the testing phase to simulate different possible subpopulations. Negative examples that complemented positive datasets, were selected randomly from the preprocessed dataset of patients, that we described earlier. For each domain, we had at least 20 trials. In all experiments number of hidden LSTM units was set to 100, the number of epochs was 100 and the batch size was 100. We used Adam optimizer.

The objective was to predict whether the patient will be diagnosed with AD at the next visit. The evaluation metric was Area Under the Precision-Recall Curve (AUPRC). A precision-recall curve is a plot of the precision (y-axis) and the recall (x-axis) for different thresholds. We used the Softmax function to output a risk score for AD.

We ran our LSTM RNN code separately for each of the three domains. In the end, we combined the results of each domain into an ensemble model (average of the outputs from best performing domains).

3. Results

In the naïve model, after applying filters described in Section 2, 2,600 patients with at least 4 visits were in the positive dataset. Risk score predictions in the naïve model in the form of AUPRCs for models trained separately for two domains (conditions, measurements) are shown in Fig. 2 . The best results were obtained with the measurements (labs) domain. An ensemble of two domains led to the improvement of predictions of risk scores of AD development. Drugs and procedures domains had low accuracy in the naïve model. The precision/recall comparison of models using condition, measurement, and an ensemble of two domains (c.m.) is presented in Fig. 3 .

Fig. 2 — The naïve model results, AUPRC score obtained by LSTM RNN for condition and measurement domains separately and for their ensemble (c.m.) when using the naïve AD dataset selection for patients with at least 4 visits.

Fig. 3 — The average precision/recall comparison of models using condition, measurement, and an ensemble of two domains (combined).

The final SCRP dataset with all three domains and at least 4 visits, resulted in 2,324 patients. We ran the LSTM RNN model using the condition, measurement, and drug domains separately. Initially, we used 10% of data for testing and 90% for training of our model. AD prediction based on the condition domain was similar to the naïve model. However, the measurement (labs) domain produced significantly better results than the naïve model (AUPRC 0.98-0.99).

We achieved a successful application of the drugs domain in our model. Selected drugs (145 distinct drugs) were used to treat different conditions, considered as pre-AD, and they were administered before patients had been diagnosed with AD. These were standard drugs already used in everyday medical practices, and not experimental drugs. We excluded Acetylcholinesterase Inhibitors from this list. Results of AD prediction using only the drugs domain achieved AUPRC of 0.98-0.99. The selection of relevant drugs that are given to treat one of pre-AD conditions and symptoms is the factor that provided the difference in predictions of AD, comparing to the naïve dataset. This approach decreased the number of considered drugs from 828 (naïve model) to 145 in the SCRP model, which reduced noise and allowed improved prediction results.

After we developed the LSTM RNN model for each of domains separately, we joined results into an ensemble model where outputs of three single domain-based models were averaged. Results of the ensemble prediction of all three domains achieved out-of-sample AUPRC above 0.99.

For SCRP dataset experiments, in Fig. 4 we present results of risk scores for the development of AD for each of the studied domains and for the ensemble model (c.m.d) using information from all three domains. The procedure domain had low accuracy and we will not show those results.

Fig. 5 shows Precision-Recall curves for three domains and for the ensemble of three domains for the SCRP dataset. The ensemble of three domains (c.m.d.) achieved the best AD prediction results.

Changes of AUPRC scores when different ratios of training and testing datasets were used in experiments are shown in Table 2 . We kept the fixed size of the testing set and slowly decreased the size of the training set.

Table 2.

Prediction of AD by LSTM RNN trained on the SCRP dataset using the drugs, the measurements, the condition domain, and their ensemble (c.m.d.), with the fixed size of the testing dataset (234 patients) and different sizes of the training dataset. The evaluation metric - AUPRC.

No. of patients in training dataset	No. of patients in testing dataset	AUPRC Drugs	AUPRC Measurements	AUPRC Conditions	AUPRC c.m.d.
2,090	234	0.985 ± 0.040	0.986 ± 0.048	0.651 ± 0.031	0.991 ± 0.038
1,800	234	0.982 ± 0.048	0.985 ± 0.050	0.650 ± 0.040	0.990 ± 0.033
1,500	234	0.869 ± 0.053	0.889 ± 0.060	0.648 ± 0.043	0.908 ± 0.035
1,250	234	0.860 ± 0.049	0.866 ± 0.052	0.647 ± 0.037	0.871 ± 0.042
1,000	234	0.810 ± 0.058	0.814 ± 0.059	0.645 ± 0.052	0.835 ± 0.055

Open in a new tab

When training on 80% and testing on 20% of available data, results of risk score prediction of AD development were almost identical to the initial setup (90:10 split ratio) for all three domains as well as for the ensemble of domains. However, when using 70% of the SCRP data for training and the remaining 30% of data for testing, a decrease of AUPRC for drugs and measurements domains was evident (AUPRC for the drugs domain was 0.87 and AUPRC for measurements was 0.9).

The influence of different splits of the dataset on training and testing sets is illustrated in Fig. 6 for the drugs domain-specific LSTM RNN model. It appears that for each of the scenarios that we attempted when our model started dropping accuracy the critical size of the dataset for the training phase was about 1,500 patients.

Experiments that estimated the effect of increased coverage on prediction quality as well as different numbers of visits (2, 3, or 4) are presented in Table 3 .

Table 3.

Results of experiments with SCRP datasets with different numbers of visits (2, 3, or 4) for drugs, measurements, and conditions domains as well as for an ensemble of all 3 domains (c.m.d.).

Datasets Positive cohorts	No. of patients	AUPRC Drugs	AUPRC Measurements	AUPRC Conditions	AUPRC c.m.d.
4 visits	2,324	0.985 ± 0.040	0.986 ± 0.048	0.651 ± 0.031	0.991 ± 0.038
3 visits	3,199	0.974 ± 0.042	0.973 ± 0.039	0.640 ± 0.032	0.982 ± 0.035
2 visits	3,568	0.893 ± 0.046	0.908 ± 0.048	0.601 ± 0.030	0.924 ± 0.042

Open in a new tab

We conducted additional experiments by making different combinations of relevant conditions and tested the contribution of different pre-AD conditions to the accuracy results. In these experiments, the AURPC score dropped at least 10-15%, because the size of the training dataset was reduced to less than 1,500 AD positive patients which were not sufficiently large to conduct experiments with combinations of different relevant conditions.

4. Discussion

This research demonstrates that our comprehensive method which incorporates SCRP positive datasets based on the application of clinical domain knowledge and design of LSTM RNN deep learning models produced excellent prediction accuracy of AD. The way the inputs were selected proves to be important for the overall quality of the model.

We identified 125 papers on PubMed (MeSH thesaurus vocabulary search) directly describing the application of various ML techniques in diagnostics and prediction of AD. Aghili and collaborators used LSTM RNN models on longitudinal imaging data (MRI, PET) to predict AD and distinguish it from normal control cases [14]. Albright studied whether serial MRI, PET, biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD and achieved prediction results of the model's average AUC=0.866 [13]. Marzban and colleagues constructed CNN deep learning neatwork using MRI images and achieved prediction results of the onset of AD (ROC-AUC between 0.84 – 0.94) [22]. Arbabshirani and colleagues presented a review of more than 200 studies describing ML techniques in the prediction of AD and other brain diseases, using imaging [23]. Moore designed the path signature model to predict AD, using imaging data and considering conversion to AD stage from healthy individuals or from individuals with MCI [24]. Martinez-Murcia and colleagues implemented data analysis of AD, based on deep convolutional autoencoders. The MRI imaging-derived markers could predict clinical variables with correlations above 0.6, achieving a classification accuracy of over 80% for the diagnosis of AD [25]. MRI images were utilized for the construction of ML models for a diagnosis of MCI and AD, with achieved accuracy up to 75.51% (CNN networks). The prediction that patients with stable MCI will develop AD achieved ROC-AUC of 92.5% [26], [27], [28], [29]. Carpenter and Huang analyzed ML methods (naïve Bayes, kNN, SVM, random forest, and neural networks) for virtual screening (VS) [30]. They presented a workflow for applying ML-based VS to the search for potential therapeutic drugs for AD. VS is important in the drug development process because it performs efficient searches over millions of compounds increasing chances for potential AD drug discovery. Huang and collaborators presented a nonlinear supervised sparse regression-based random forest framework to predict longitudinal AD clinical scores [31]. Ford et al, created ML models (logistic regression, naïve Bayes, SVM, random forest, and neural networks) to predict early dementia using EHR data. They included data on 93,120 patients, with a median age of 82.6 years, and achieved ROC-AUC of 74% [32].

We opted to use an LSTM RNN deep learning approach on heterogeneous EMR data alone (without relying on diagnostic imaging). None of the papers currently indexed in PubMed uses only EMR data for prediction of AD. We hypothesized that for a large number of patients less costly and easier to obtain EMR data have sufficient representational power to incorporate temporal heterogeneous information into the successful prediction of AD. This was a more difficult task then prediction from imaging data, because of the irregular temporal nature of EMR data. Furthermore, EMR data are cheaper and more abundant than expensive imaging MRI and PET data.

The increasing adoption of EMR systems has enabled significant opportunities for the development of ML algorithms in health care [9.32]. EMR patient records provide a valuable resource for creating diagnostic support algorithms for the detection of dementia and AD [9], [32]. Tang and collaborators attempted to predict differential diagnoses of the top 25 most common conditions in the MIMC-III dataset [9]. They developed traditional and sequential (LSTM RNN) models, using ICD9 codes. The advantage of our model, comparing to Tang and collaborators is in the usage of the OMOP data concept which provides the ability to include more data from different sources and create ML models separately for different clinical domains and combine them into an ensemble model. Our approach allows the evaluation of performances of ML models on individual clinical domains as well as on combinations of different clinical domains, and subsequently the selection of the most optimal ML models.

Our results suggest that special consideration is required to identify a medically relevant positive dataset as the optimal input to the LSTM RNN prediction model. The conditions domain was crucial in the selection of relevant information that contributed to the creation of datasets with patients who had MCI conditions before AD was diagnosed. The drugs domain was successfully applied in our model. Other researchers usually attempt to apply all drugs contained in datasets [9]. Tang and colleagues attempted to apply all drugs from the MIMIC dataset in their models and they did not achieve good prediction results of diseases using this domain [9]. Most drugs are not relevant to diseases of interest and contribute to bad prediction as noise. Our research shows that the optimal approach with the drugs domain is to apply class-by-class of drugs and evaluate the contributions of each class and combinations of classes to predictions of diseases, which is different than the model described by Tang, where they attempted to apply all drugs [9]. Although we achieved good results with the drugs domain, the main contribution of this study is in the approach on how to use this domain. The measurements domain provided good predictions in all approaches. Considering the number of doctor's visits, RNN models achieved the best results when data include 4 ambulatory visits vs. relying on fewer visits data [21].

5. Conclusion

Significance of accurate and early prediction of AD could be found in the identification of patients for clinical trials, which can possibly result in the discovery of new drugs for the treatment of AD. Also, the contribution of predictions of AD is a better selection of patients who need some form of imaging diagnostics for AD.

This research sets the framework for future analyses of disease-relevant temporal heterogeneous EMR data. Further research is necessary for the evaluation of different groups of drugs, measurements, and conditions and their contribution to the successful prediction of AD.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This study was supported in part by the IQVIA grant Disease Detection and Disease Progression Modeling and by Pennsylvania Department of Health CURE Health Data Science Research Project.

Footnotes

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.cmpb.2020.105765.

Appendix. Supplementary materials

mmc1.xml^{(212B, xml)}

References

1.https://www.nia.nih.gov/health/alzheimers-disease-fact-sheet.
2.https://www.alz.org/media/Documents/alzheimers-facts-and-figures-2019-r.pdf.
3.National Institutes of Health. National Institute on Aging. What Happens to the Brain in Alzheimer's Disease?Available at:https://www.nia.nih.gov/health/what-happens-brain-alzheimers-disease. September 14, 2018.
4.Wang T., Qiu R.G., Yu M. Predictive modeling of the progression of Alzheimer's disease with recurrent neural networks. Sci. Rep. 2018;8(1):9161. doi: 10.1038/s41598-018-27337-w. 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Roberts R., Knopman D.S. Classification and epidemiology of MCI. Clin. Geriatr. Med. 2013;29(4):753–772. doi: 10.1016/j.cger.2013.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Mosconi L., Brys M., Glodzik-Sobanska L., De Santi S., Rusinek H., de Leon M.J. Early detection of Alzheimer's disease using neuroimaging. Exp. Gerontol. 2007;42(1-2):129–138. doi: 10.1016/j.exger.2006.05.016. [DOI] [PubMed] [Google Scholar]
7.McMahon P.M., Araki S.S., Sandberg E.A., Neumann P.J., Gazelle G.S. Cost-effectiveness of PET in the diagnosis of Alzheimer's disease. Radiology. 2003;228(2):515–522. doi: 10.1148/radiol.2282020915. [DOI] [PubMed] [Google Scholar]
8.J. Zhao, P. Papapetrou, L. Asker, H. Bostrom. Learning from heterogeneous temporal data in electronic health records. 10.1016/j.jbi.2016.11.006. [DOI] [PubMed]
9.Tang F., Xiao C., Wang F., Zhou J. Predictive modeling in urgent care: a comparative study of machine learning approaches. JAMIA Open. 2018;1(1):87–98. doi: 10.1093/jamiaopen/ooy011. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Qiu R.G., Qiu J.L., Badr Y. Proceedings of the International Conference on Grey Systems and Intelligent Services (GSIS) 2017. Predictive modeling of the severity/progression of Alzheimer's diseases; pp. 400–403. [DOI] [Google Scholar]
11.Moradi E., Pepe A., Gaser C., Huttunen H., Tohka J. Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects. Neuroimage. 2015;104:398–412. doi: 10.1016/j.neuroimage.2014.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Esmaeilzadeh S., Belivanis D.I., Pohl K.M., Adeli E. End-To-End Alzheimer's Disease diagnosis and biomarker identification. In: Shi Y., Suk HI., Liu M., editors. Vol. 11046. Springer; Cham: 2018. (Machine Learning in Medical Imaging). MLMI 2018. Lecture Notes in Computer Science. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.J. Albright. Forecasting the progression of Alzheimer's disease using neural networks and a novel preprocessing algorithm. 10.1016/j.trci.2019.07.001 [DOI] [PMC free article] [PubMed]
14.Aghili M, Tabarestani S, Adjouadi M, Adeli E. Predictive modeling of longitudinal data for Alzheimer's disease diagnosis using RNNs. Proceedings of the International Workshop on Predictive Intelligence In Medicine; Cham; Springer; 2018. [DOI] [Google Scholar]
15.Goldstein B.A., Navar A.M., Pencina M.J., Ioannidis J.P.A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inform. Assoc. 2017;24(1):198–208. doi: 10.1093/jamia/ocw042. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Yang J., McAuley J, Leskovec J., LePendu P., Shah N. Proceedings of the 23rd international conference on World wide web. 2014. Finding progression stages in time-evolving event sequences. [DOI] [Google Scholar]
17.Hochreiter S., Schmidhuber J. Long Short Term Memory. Neural Comput. 1997;9(8):1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
18.A. Sherstinsky. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network. arXiv:1808.03314v4.
19.https://www.cdc.gov/coronavirus/2019-ncov/index.html.
20.https://www.ohdsi.org/data-standardization/the-common-data-model.
21.Ljubic B., Abdel Hai A., Stanojevic M., Diaz W., Polimac D., Pavlovski M., Obradovic Z. Predicting Complications of Diabetes Mellitus Using Advanced Machine Learning Algorithms. JAMIA. 2020 doi: 10.1093/jamia/ocaa120. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Marzban E.N., Eldeib A.M., Yassine I.A., Kadah Y.M. Alzheimer's disease diagnosis from diffusion tensor images using convolutional neural networks. PLoS One. 2020;15(3) doi: 10.1371/journal.pone.0230409. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Arbabshirani M.R., Plis S., Sui J., Calhoun V.D. Single subject prediction of brain disorders in neuroimaging: promises and pitfalls. Neuroimage. 2017;145(Pt B):137–165. doi: 10.1016/j.neuroimage.2016.02.079. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Moore P.J., Lyons T.J., Gallacher J. Using path signatures to predict a diagnosis of Alzheimer's disease. PLoS One. 2019;14(9) doi: 10.1371/journal.pone.0222212. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Martinez-Murcia F.J., Ortiz A., Gorriz JM., Ramirez J., Castillo-Barnes D. Studying the manifold structure of alzheimer's disease: a deep learning approach using convolutional autoencoders. IEEE J. Biomed. Health Inform. 2020;24(1):17–26. doi: 10.1109/JBHI.2019.2914970. [DOI] [PubMed] [Google Scholar]
26.Kam TE., Zhang H., Shen D. A novel deep learning framework on brain functional networks for early MCI diagnosis. Med. Image Comput. Comput. Assist. Interv. 2018;11072:293–301. doi: 10.1007/978-3-030-00931-1_34. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Moscoso A., Silva-Rodríguez J., Aldrey J.M., Cortés J., Fernández-Ferreiro A., Gómez-Lado N., Ruibal A, Aguiar P. Prediction of Alzheimer's disease dementia with MRI beyond the short-term: implications for the design of predictive models. Neuroimage Clin. 2019;23 doi: 10.1016/j.nicl.2019.101837. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Moore P J, Lyons T J, Gallacher J. Random forest prediction of Alzheimer's disease using pairwise selection from time series data. PLoS One. 2019;14(2) doi: 10.1371/journal.pone.0211558. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Spasov S., Passamonti L., Duggento A, Liò P., Toschi N. A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer's Disease. Neuroimage. 2019;189:276–287. doi: 10.1016/j.neuroimage.2019.01.031. [DOI] [PubMed] [Google Scholar]
30.Carpenter K.A., Huang X. Machine learning-based virtual screening and its applications to Alzheimer's drug discovery: a review. Curr. Pharm Des. 2018;24(28):3347–3358. doi: 10.2174/1381612824666180607124038. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.L. Huang, Y. Jin, Y. Gao, KH. Thung, D Shen. Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression-based random forest. doi: 10.1016/j.neurobiolaging.2016.07.005. [DOI] [PMC free article] [PubMed]
32.Ford E., Rooney P., Oliver S., Hoile R., Hurley P., Banerjee S., van Marwijk H., Cassell J. Identifying undetected dementia in uk primary care patients: a retrospective case-control study comparing machine-learning and standard epidemiological approaches. BMC Med. Inform. Decis. Mak. 2019;19(1):248. doi: 10.1186/s12911-019-0991-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.xml^{(212B, xml)}

[bib0001] 1.https://www.nia.nih.gov/health/alzheimers-disease-fact-sheet.

[bib0002] 2.https://www.alz.org/media/Documents/alzheimers-facts-and-figures-2019-r.pdf.

[bib0003] 3.National Institutes of Health. National Institute on Aging. What Happens to the Brain in Alzheimer's Disease?Available at:https://www.nia.nih.gov/health/what-happens-brain-alzheimers-disease. September 14, 2018.

[bib0004] 4.Wang T., Qiu R.G., Yu M. Predictive modeling of the progression of Alzheimer's disease with recurrent neural networks. Sci. Rep. 2018;8(1):9161. doi: 10.1038/s41598-018-27337-w. 15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0005] 5.Roberts R., Knopman D.S. Classification and epidemiology of MCI. Clin. Geriatr. Med. 2013;29(4):753–772. doi: 10.1016/j.cger.2013.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0006] 6.Mosconi L., Brys M., Glodzik-Sobanska L., De Santi S., Rusinek H., de Leon M.J. Early detection of Alzheimer's disease using neuroimaging. Exp. Gerontol. 2007;42(1-2):129–138. doi: 10.1016/j.exger.2006.05.016. [DOI] [PubMed] [Google Scholar]

[bib0007] 7.McMahon P.M., Araki S.S., Sandberg E.A., Neumann P.J., Gazelle G.S. Cost-effectiveness of PET in the diagnosis of Alzheimer's disease. Radiology. 2003;228(2):515–522. doi: 10.1148/radiol.2282020915. [DOI] [PubMed] [Google Scholar]

[bib0008] 8.J. Zhao, P. Papapetrou, L. Asker, H. Bostrom. Learning from heterogeneous temporal data in electronic health records. 10.1016/j.jbi.2016.11.006. [DOI] [PubMed]

[bib0009] 9.Tang F., Xiao C., Wang F., Zhou J. Predictive modeling in urgent care: a comparative study of machine learning approaches. JAMIA Open. 2018;1(1):87–98. doi: 10.1093/jamiaopen/ooy011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0010] 10.Qiu R.G., Qiu J.L., Badr Y. Proceedings of the International Conference on Grey Systems and Intelligent Services (GSIS) 2017. Predictive modeling of the severity/progression of Alzheimer's diseases; pp. 400–403. [DOI] [Google Scholar]

[bib0011] 11.Moradi E., Pepe A., Gaser C., Huttunen H., Tohka J. Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects. Neuroimage. 2015;104:398–412. doi: 10.1016/j.neuroimage.2014.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0012] 12.Esmaeilzadeh S., Belivanis D.I., Pohl K.M., Adeli E. End-To-End Alzheimer's Disease diagnosis and biomarker identification. In: Shi Y., Suk HI., Liu M., editors. Vol. 11046. Springer; Cham: 2018. (Machine Learning in Medical Imaging). MLMI 2018. Lecture Notes in Computer Science. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0013] 13.J. Albright. Forecasting the progression of Alzheimer's disease using neural networks and a novel preprocessing algorithm. 10.1016/j.trci.2019.07.001 [DOI] [PMC free article] [PubMed]

[bib0014] 14.Aghili M, Tabarestani S, Adjouadi M, Adeli E. Predictive modeling of longitudinal data for Alzheimer's disease diagnosis using RNNs. Proceedings of the International Workshop on Predictive Intelligence In Medicine; Cham; Springer; 2018. [DOI] [Google Scholar]

[bib0015] 15.Goldstein B.A., Navar A.M., Pencina M.J., Ioannidis J.P.A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inform. Assoc. 2017;24(1):198–208. doi: 10.1093/jamia/ocw042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0016] 16.Yang J., McAuley J, Leskovec J., LePendu P., Shah N. Proceedings of the 23rd international conference on World wide web. 2014. Finding progression stages in time-evolving event sequences. [DOI] [Google Scholar]

[bib0017] 17.Hochreiter S., Schmidhuber J. Long Short Term Memory. Neural Comput. 1997;9(8):1735–1780. doi: 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]

[bib0018] 18.A. Sherstinsky. Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network. arXiv:1808.03314v4.

[bib0019] 19.https://www.cdc.gov/coronavirus/2019-ncov/index.html.

[bib0020] 20.https://www.ohdsi.org/data-standardization/the-common-data-model.

[bib0021] 21.Ljubic B., Abdel Hai A., Stanojevic M., Diaz W., Polimac D., Pavlovski M., Obradovic Z. Predicting Complications of Diabetes Mellitus Using Advanced Machine Learning Algorithms. JAMIA. 2020 doi: 10.1093/jamia/ocaa120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0022] 22.Marzban E.N., Eldeib A.M., Yassine I.A., Kadah Y.M. Alzheimer's disease diagnosis from diffusion tensor images using convolutional neural networks. PLoS One. 2020;15(3) doi: 10.1371/journal.pone.0230409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0023] 23.Arbabshirani M.R., Plis S., Sui J., Calhoun V.D. Single subject prediction of brain disorders in neuroimaging: promises and pitfalls. Neuroimage. 2017;145(Pt B):137–165. doi: 10.1016/j.neuroimage.2016.02.079. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0024] 24.Moore P.J., Lyons T.J., Gallacher J. Using path signatures to predict a diagnosis of Alzheimer's disease. PLoS One. 2019;14(9) doi: 10.1371/journal.pone.0222212. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0025] 25.Martinez-Murcia F.J., Ortiz A., Gorriz JM., Ramirez J., Castillo-Barnes D. Studying the manifold structure of alzheimer's disease: a deep learning approach using convolutional autoencoders. IEEE J. Biomed. Health Inform. 2020;24(1):17–26. doi: 10.1109/JBHI.2019.2914970. [DOI] [PubMed] [Google Scholar]

[bib0026] 26.Kam TE., Zhang H., Shen D. A novel deep learning framework on brain functional networks for early MCI diagnosis. Med. Image Comput. Comput. Assist. Interv. 2018;11072:293–301. doi: 10.1007/978-3-030-00931-1_34. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0027] 27.Moscoso A., Silva-Rodríguez J., Aldrey J.M., Cortés J., Fernández-Ferreiro A., Gómez-Lado N., Ruibal A, Aguiar P. Prediction of Alzheimer's disease dementia with MRI beyond the short-term: implications for the design of predictive models. Neuroimage Clin. 2019;23 doi: 10.1016/j.nicl.2019.101837. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0028] 28.Moore P J, Lyons T J, Gallacher J. Random forest prediction of Alzheimer's disease using pairwise selection from time series data. PLoS One. 2019;14(2) doi: 10.1371/journal.pone.0211558. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0029] 29.Spasov S., Passamonti L., Duggento A, Liò P., Toschi N. A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer's Disease. Neuroimage. 2019;189:276–287. doi: 10.1016/j.neuroimage.2019.01.031. [DOI] [PubMed] [Google Scholar]

[bib0030] 30.Carpenter K.A., Huang X. Machine learning-based virtual screening and its applications to Alzheimer's drug discovery: a review. Curr. Pharm Des. 2018;24(28):3347–3358. doi: 10.2174/1381612824666180607124038. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0031] 31.L. Huang, Y. Jin, Y. Gao, KH. Thung, D Shen. Longitudinal clinical score prediction in Alzheimer's disease with soft-split sparse regression-based random forest. doi: 10.1016/j.neurobiolaging.2016.07.005. [DOI] [PMC free article] [PubMed]

[bib0032] 32.Ford E., Rooney P., Oliver S., Hoile R., Hurley P., Banerjee S., van Marwijk H., Cassell J. Identifying undetected dementia in uk primary care patients: a retrospective case-control study comparing machine-learning and standard epidemiological approaches. BMC Med. Inform. Decis. Mak. 2019;19(1):248. doi: 10.1186/s12911-019-0991-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Influence of medical domain knowledge on deep learning for Alzheimer's disease prediction

Branimir Ljubic

Shoumik Roychoudhury

Xi Hang Cao

Martin Pavlovski

Stefan Obradovic

Richard Nair

Lucas Glass

Zoran Obradovic

Highlights

Abstract

Background and objective

Methods

Results

Conclusion

1. Introduction

2. Methods

Table 1.

Fig. 1.

3. Results

Fig. 2.

Fig. 3.

Fig. 4.

Fig. 5.

Table 2.

Fig. 6.

Table 3.

4. Discussion

5. Conclusion

Declaration of Competing Interest

Acknowledgments

Footnotes

Appendix. Supplementary materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases