An Optimized Deep Learning Model for Predicting Mild Cognitive Impairment Using Structural MRI

Esraa H Alyoubi; Kawthar M Moria; Jamaan S Alghamdi; Haythum O Tayeb

doi:10.3390/s23125648

. 2023 Jun 16;23(12):5648. doi: 10.3390/s23125648

An Optimized Deep Learning Model for Predicting Mild Cognitive Impairment Using Structural MRI

Esraa H Alyoubi ^1,^*, Kawthar M Moria ¹, Jamaan S Alghamdi ², Haythum O Tayeb ³

PMCID: PMC10302234 PMID: 37420812

Abstract

Early diagnosis of mild cognitive impairment (MCI) with magnetic resonance imaging (MRI) has been shown to positively affect patients’ lives. To save time and costs associated with clinical investigation, deep learning approaches have been used widely to predict MCI. This study proposes optimized deep learning models for differentiating between MCI and normal control samples. In previous studies, the hippocampus region located in the brain is used extensively to diagnose MCI. The entorhinal cortex is a promising area for diagnosing MCI since severe atrophy is observed when diagnosing the disease before the shrinkage of the hippocampus. Due to the small size of the entorhinal cortex area relative to the hippocampus, limited research has been conducted on the entorhinal cortex brain region for predicting MCI. This study involves the construction of a dataset containing only the entorhinal cortex area to implement the classification system. To extract the features of the entorhinal cortex area, three different neural network architectures are optimized independently: VGG16, Inception-V3, and ResNet50. The best outcomes were achieved utilizing the convolution neural network classifier and the Inception-V3 architecture for feature extraction, with accuracy, sensitivity, specificity, and area under the curve scores of 70%, 90%, 54%, and 69%, respectively. Furthermore, the model has an acceptable balance between precision and recall, achieving an F1 score of 73%. The results of this study validate the effectiveness of our approach in predicting MCI and may contribute to diagnosing MCI through MRI.

Keywords: mild cognitive impairments, deep learning, entorhinal cortex, magnetic resonance imaging, transfer learning

1. Introduction

A computer-aided diagnosis system (CADS) helps to automate the diagnosis process of diseases. Clinicians are increasingly using CADS to help them detect and interpret diseases. CADS identifies regions of an image that may reveal certain problems and notifies doctors during image interpretation [1]. Typically, CADS includes pre-processing, feature extraction, and classification [2].

Alzheimer’s disease (AD) is a neurological disease most commonly linked to memory and cognitive loss. Neurodegenerative disorders are not curable [3]. The goal of medicine is to enhance patients’ well-being and slow the progression of the disease. However, early diagnosis may provide AD patients with better treatment outcomes than patients who discover the disease late. Mild cognitive impairment (MCI) disease is an initial stage of AD. MCI is a form of memory loss or a decline in cognitive skills, such as language or vision. Patients with MCI go through a stage in which they have cognitive deficiencies that are not severe enough to cause dementia. MCI is a condition in which patients experience greater memory or thinking difficulties than people of the same age. Studies show that individuals with MCI are more likely than people with normal cognitive abilities to develop Alzheimer’s disease within a few years [4]. According to the National Institute on Aging [5], dementia is estimated to develop in 10% to 20% of MCI patients aged 65 or older within one year.

Brain imaging can detect MCI before clinical symptoms appear [6]. Researchers use medical imaging modalities to diagnose AD and MCI, including positron emission tomography (PET) and magnetic resonance imaging (MRI) scans. These studies classify MCI disease based on its biomarkers (indicators that the disease exists), such as decreased grey matter volume. Figure 1 illustrates how the grey matter volume in an AD patient’s brain is less than that in a healthy brain, whereas the grey matter volume for the MCI patient is less than that of the healthy brain but more than that of the brain affected by AD.

T1-weighted MRI shows the volume of grey matter for normal, MCI, and AD subjects [7]; the normal amount of grey matter in left subject, whereas the grey matter decreases gradually from the MCI to the AD subjects.

Another significant biomarker related to MCI is the shrinkage of the hippocampus and entorhinal cortex (EC) area. In contrast with the hippocampus, the EC area is important in the early detection of MCI as studies have demonstrated that the EC area shrinks before the hippocampus region [8]. However, there are limited studies exploring the EC area for the diagnosis of MCI as EC is challenging to analyze due to its small size, which makes it difficult to detect with the human eye.

Recently, machine learning approaches are being more widely used to automatically classify medical images [9,10,11]. However, the feature engineering step in machine learning approaches is time-consuming and requires expert knowledge. In contrast, deep learning models can automatically learn relevant features from the raw data, eliminating the need for explicit feature engineering. A recent study [12] employed a mathematical approach called ellipse fitting to detect abnormalities in medical images. However, the ellipse fitting algorithm is sensitive to outlier data when the organ being analyzed has an irregular shape, which can significantly affect the algorithm’s accuracy.

Significant advancements have been made in the field of brain tumor detection. Deep learning algorithms have been used to analyze brain images and identify patterns of brain tumors. Recent studies have demonstrated that convolutional neural networks (CNNs) are highly accurate at detecting brain tumors [13,14,15]. In addition, deep learning algorithms have been shown to distinguish between various forms of brain tumors, assisting medical professionals in determining the best strategy for treatment [16].

Inception architecture has demonstrated promising outcomes in brain studies, achieving notable performance in brain tumor detection [17,18]. Several advanced techniques were adopted in inception architecture, including factorized convolution and auxiliary classifiers. In factorized convolution, the number of parameters in the network is reduced by factorizing the convolutional filters into two smaller filters, minimizing the overall complexity of the model. Auxiliary classifiers refer to classifiers that are added to the network at intermediate levels to avoid overfitting and enhance prediction accuracy.

Support vector machine (SVM) is a supervised classification algorithm that separates the classification points by hyperplane (decision boundary) [19]. The maximum distance between the hyperplane and the closest point is called the margin; this is what gives the SVM its robustness, meaning that it is dependable and avoids errors as much as possible. In practice, the arguments passed while creating the SVM classifier strongly influence the model outcome. Kernels, gamma, and C are augments that must be tuned to achieve the highest accuracy possible. The combination of a CNN as a feature extractor and an SVM as a classifier has been shown to perform medical imaging tasks effectively [20].

Using pre-trained CNN architectures (such as VGG, Inception, and ResNet) to extract features of the EC is a powerful technique that can save time and resources compared to training a model from scratch. The effectiveness of this technique can be explained theoretically in the following ways: (1) using the transfer learning technique as a feature extractor can capture both low-level and high-level visual information of the EC area because these models have been trained on large datasets, such as ImageNet, with 10,000 classes. (2) By using our proposed method of extracting the features of the EC area, we reduce the chance of overfitting issues that can occur since training a CNN from scratch on a small dataset can lead to overfitting.

In this study, we aim to perform automatic classification of MCI disease from MRIs using the EC area. To achieve this goal, we define the study objectives as follows: (1) construct a dataset for the EC area from MRIs with normal and MCI subjects; (2) investigate using different collections of MRI slices as inputs for the classification system; (3) explore different neural network architectures, including VGG16, Inception-V3, and ResNet50, to extract the features of the EC area; and (4) classify subjects using machine learning algorithms, including CNN and SVM. There are two important contributions of this study. First, this work expands on the studies conducted on the EC area since there is a limited body of knowledge addressing EC and linking it with neurological diseases. Second, to our knowledge, no dataset for the EC area is available. In this work, we used MRIs from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset to extract the left and right EC areas and use them as inputs for the proposed classification models.

The remaining sections of this document are organized as follows: Section 2 presents the state-of-the-art classification systems for predicting MCI, Section 3 explains the proposed system, Section 4 reports the experimental results, and Section 5 presents the conclusions and recommendations for further studies.

2. Related Work

Many studies have been conducted to classify MCI and normal control (NC) samples, some of which used the whole brain to diagnose MCI. For this literature review, we focused on the studies that were conducted based on the region of interest (ROI) approach as they relate more closely to the purpose of this study. The ROI approach considers only the brain’s informative part because including the whole brain will also involve some areas of the brain that are unaffected by the disease [21]. This approach reduces the overall complexity of the system by reducing the inputs. For these studies, the ROI was either the hippocampus region or the entorhinal cortex area. The related works are divided into three sections based on the inputs to the model: the whole brain, the hippocampus region, and the EC area.

2.1. Classification Systems Based on the Whole Brain Image

Mehmood et al. [22] proposed a deep learning system for classifying three groups: NC, late MCI (LMCI), and early MCI (EMCI). The authors used the grey matter biomarker to differentiate between the samples, employing the VGG-19 network architecture. The convolution layer was fixed, while the last two fully connected layers and the classification layers were modified. The authors proposed two models: Model 1 freezes the first eight convolutional layers with three max pooling layers, and Model 2 freezes the first twelve convolutional layers with four max pooling layers. Model 2 achieved a higher result compared to Model 1. The highest accuracy obtained was 87% for differentiating between NC and EMC, and an accuracy of 89% for classifying NC vs. LMCI. Different reports used the transfer learning approach to train the systems to deal with limited data [22,23], producing comparable classification performance. However, the computation requirement was enormous since the whole brain MRIs are used as inputs.

Senanayake Upul et al. [24] combined two modalities: MRI and neuropsychological measures with 35 features to diagnose MCI; however, the neuropsychological features are not presented clearly in the paper. The obtained accuracy was 75% for classifying MCI samples from NC. Other previous studies removed the skull and the neck from the MRI scans [25,26,27]. Wang et al. [26] intensity normalization was also performed on the MRIs before they were fed into the CNN model. The authors achieved an accuracy of 86%. However, the dataset used to validate the system was relatively small. Table 1 lists the parameters used in previous studies conducted on whole brain images.

Table 1.

Mild cognitive impairment classification systems based on whole brain image.

Reference	Input	Dataset	Architecture	Accuracy
[23] 2017	MRI	ADNI (MCI 400, NC 229)	CNN 9 layers	MCI vs. NC: 91%
[24] 2018	MRI	ADNI (MCI 193, NC 161)	CNN: dilated, residual, and dense connections	MCI vs. NC: 75%
[28] 2018	MRI	ADNI (MCI 70, NC 70)	CNN: autoencoder	MCI vs. NC: 86%
[26] 2019	MRI	ADNI (MCI 297, NC 315)	CNN: dense connections	MCI vs. NC: 98%
[27] 2020	MRI	ADNI (NC 162, MCIc 76)	CNN and ensemble learning	MCIc vs. NC: 79%
[22] 2021	MRI	ADNI (NC 85, EMCI 70, LMCI 70)	Transfer learning using VGG-19	EMCI vs. NC: 87% LMCI vs. NC: 89%

Reference	Input	Dataset	Segmentation	Features Extraction	Classification	Accuracy MCI vs. NC
[30] 2017	MRI	ADNI (MCI 399, NC 228)	AAL Template	CNN	DL: CNN	66%
[31] 2018	MRI + DTI	ADNI (MCI 399, NC 228)	AAL Template	CNN	DL: CNN	80%
[33] 2019	MRI	ADNI (MCI 50, NC 50)	ICBM Template	GLCM and Gabor filters	ML: SVM	82%
[32] 2020	MRI	ADNI (MCI 60, NC 60)	3D slicer	CNN	DL: CNN	78.1%

Reference	Input	Dataset	ROI	Segmentation	Features	Classification	Performance
[36] 2019	MRI	ADNI + BU-ADC (NC 46, MCI 50)	EC, Hip, and other cortical measures	FreeSurfer	Volume	Logistic Regression	Accuracy NC vs. MCI: 94%
[35] 2020	MRI	ADNI (NC 194, MCI 200)	EC	FreeSurfer	Texture + Volume	Logistic Regression	AUCs NC vs. MCI: 71%
[37] 2021	MRI	MDCN (aMCI-s 29, aMCI-m 22, NC 26)	EC	FreeSurfer	Thickness, Surface Area, and Volume	Statistical Analyses: ANOVA	AUCs NC vs. aMCI-s: 76% NC vs. aMCI-m: 79%

	MCI	NC
Sample Size	337	442
Age (year, mean ± SD)	76.3 ± 7.6	76.3 ± 6.3
Gender (male:female)	220:117	247:177

Experiment	MRI Slices Included	Feature Extraction	Classifier	Accuracy
Expt 1.1	Slice 0–255	VGG16	CNN	0.55
Expt 1.2	Slice 130–140	VGG16	CNN	0.58
Expt 1.3	Exclude uninformative slices	VGG16	CNN	0.60

Classifier	Number of PCs	Feature Extraction	Accuracy
SVM	2500	VGG16	0.45
SVM	5000	VGG16	0.51
SVM	7500	VGG16	0.53
SVM	10,000	VGG16	0.52
SVM	17,500	VGG16	0.52

Classifier	Feature Extraction	Epoch	Optimization Method	Learning Rate	Accuracy
CNN	VGG16	300	SGD	0.1	0.62
CNN	VGG16	300	SGD	0.01	0.63
CNN	VGG16	300	SGD	0.001	0.63
CNN	VGG16	300	SGD	0.0001	0.60
CNN	VGG16	300	SGD	0.00001	0.59

Classifier	Feature Extraction	Epoch	Accuracy
CNN	Inception-V3	25	0.62
CNN	Inception-V3	50	0.63
CNN	Inception-V3	100	0.65
CNN	Inception-V3	200	0.67
CNN	Inception-V3	300	0.64

Classifier	Feature Extraction	Epoch	Optimization Method	Accuracy
CNN	Inception-V3	300	Adam	0.67
CNN	Inception-V3	300	SGD	0.56
CNN	Inception-V3	300	RMSprop	0.49

Classifier	Feature Extraction	Epoch	Accuracy
CNN	ResNet50	25	0.54
CNN	ResNet50	50	0.61
CNN	ResNet50	100	0.63
CNN	ResNet50	200	0.61
CNN	ResNet50	300	0.63

Experiment	Classifier	Feature Extraction	Accuracy	F1 Score	Sen	Spe	AUC
Expt 5.1	CNN	VGG16	0.70	0.66	0.69	0.70	0.68
Expt 5.2		Inception-V3	0.70	0.73	0.90	0.54	0.69
Expt 5.3		ResNet50	0.73	0.65	0.58	0.84	0.69
Expt 5.4	SVM	VGG16	0.76	0.47	0.34	0.64	0.63
Expt 5.5		Inception-V3	0.66	0.54	0.66	0.45	0.63
Expt 5.6		ResNet50	0.69	0.58	0.48	0.68	0.67

PERMALINK

An Optimized Deep Learning Model for Predicting Mild Cognitive Impairment Using Structural MRI

Esraa H Alyoubi

Kawthar M Moria

Jamaan S Alghamdi

Haythum O Tayeb

Roles

Abstract

1. Introduction

Figure 1.

2. Related Work

2.1. Classification Systems Based on the Whole Brain Image

Table 1.

2.2. Classification Systems Based on the Hippocampus Region

Table 2.

2.3. Classification Systems Based on the Entorhinal Cortex Region

Table 3.

3. Proposed System

Figure 2.

3.1. Constructing the Dataset

Table 4.

3.2. Pre-Processing

Figure 3.

3.3. Extracting the Entorhinal Cortex Area

Figure 4.

Figure 5.

3.4. Feature Extraction and Classification

Figure 6.

4. Experimental Design and Results

4.1. Experiment 1: Investigate the Use of Different Collections of MRI Slices as Inputs for the Classification System

Figure 7.

Table 5.

4.2. Experiment 2: Investigate Implementing the PCA Technique for Feature Reduction

Figure 8.

Table 6.

4.3. Experiment 3: Evaluate the Accuracy of the SVM Classifiers with Tuned Parameters

Table 7.

4.4. Experiment 4: Tuning VGG16, Inception-V3, and ResNet50 Network Parameters

Table 8.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

4.5. Experiment 5: Choose the Optimal Combinations of Feature Extraction Techniques and the Classifier

Table 17.

Figure 9.

4.6. Experiment 6: Comparison with State-of-the-Art MCI Classification Systems

5. Conclusions

Acknowledgments

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases