Abstract
This study aims to develop an optimally performing convolutional neural network to classify Alzheimer’s disease into mild cognitive impairment, normal controls, or Alzheimer’s disease classes using a magnetic resonance imaging dataset. To achieve this, we focused the study on addressing the challenge of image noise, which impacts the performance of deep learning models. The study introduced a scheme for enhancing images to improve the quality of the datasets. Specifically, an image enhancement algorithm based on histogram equalization and bilateral filtering techniques was deployed to reduce noise and enhance the quality of the images. Subsequently, a convolutional neural network model comprising four convolutional layers and two hidden layers was devised for classifying Alzheimer’s disease into three (3) distinct categories, namely mild cognitive impairment, Alzheimer’s disease, and normal controls. The model was trained and evaluated using a 10-fold cross-validation sampling approach with a learning rate of 0.001 and 200 training epochs at each instance. The proposed model yielded notable results, such as an accuracy of 93.45% and an area under the curve value of 0.99 when trained on the three classes. The model further showed superior results on binary classification compared with existing methods. The model recorded 94.39%, 94.92%, and 95.62% accuracies for Alzheimer’s disease versus normal controls, Alzheimer’s disease versus mild cognitive impairment, and mild cognitive impairment versus normal controls classes, respectively.
Introduction
Alzheimer’s disease (AD) is a neurological condition contributing to the progressive degeneration of cognitive abilities, including memory, visual-spatial perception, and alterations in personality and behavior. This ailment is commonly linked to the geriatric population, although it can impact individuals of all ages. It is postulated that a global population exceeding 1.5 million individuals exhibits indications of this particular ailment, with a probable escalation anticipated within the forthcoming two to three decades [1]. The absence of a viable remedy for AD underscores the importance of timely identification to manage the condition and forestall further deterioration effectively. Individuals in the initial stages of the disease, referred to as mild cognitive impairment (MCI), primarily manifest indications such as memory impairment or diminished decision-making abilities, which may subsequently deteriorate into AD. Detecting MCI early enough would be beneficial for clinicians in preventing further progression [2, 3].
Brain imaging is a known non-invasive technique for detecting diseases through the use of brain scans in the form of magnetic resonance imaging (MRI), computed tomography (CT), or positron emission tomography (PET) [4, 5]. Brain imaging has substantially contributed to the timely diagnosis of AD and detecting the transformation of MCI into AD using a deep learning methodology. Various alternative deep-learning methods have been successfully implemented to accurately categorize AD, including those of Y. Zhang et al. [6], Zhang et al. [7].
Despite these notable achievements, using MRI imaging to diagnose AD or predict its conversion using deep learning is still challenging. One of the key obstacles is the insufficiency of appropriate and uncontaminated data, which adversely impacts the training of deep learning models and consequently undermines their accuracy and generalizability [8–12]. As the data is collected over time, there is a tendency for the datasets not to be enough for training models, and again, due to the varying image scanning protocols, imaging artefacts, and moving artefacts being used during the data collection, MRI images are highly susceptible to noise. Image quality can also vary due to the different imaging equipment being used. To address these challenges, it requires developing more robust deep learning models that are more tolerant to noisy images [13, 14] or applying a data preprocessing approach to addressing image quality [15–18].
The motivation of our study is to propose a more reliable approach to improving image quality with ease for classifying AD with improved accuracy. As such, the research devised a proficient strategy for improving image quality by utilizing bilateral filtering and histogram equalization techniques. The proposed method aimed to reduce noise and enhance brightness in images, thereby improving the quality of images used in categorizing AD. The primary contribution of this study comprises two key components:
First, an image enhancement technique that effectively reduces noise in MRI images while preserving image quality was designed and implemented. The image enhancement algorithm for color images was developed using two image processing techniques: image equalization and bilateral filtering. The algorithm mainly improves the quality of the images while maintaining their edges and details. The bilateral filtering was used to keep the edges and details of the image, while histogram equalization modifies the intensities of the image to improve its contrast.
Second, a CNN model consisting of four convolutional layers and two hidden layers successfully classifies AD more accurately than existing deep learning models. The model was trained using the k-fold cross-validation technique, ensuring that all the images were effectively used in training and testing the model.
The remaining sections of the paper are the literature review, materials and methods, results, and discussion. The literature review section presents a review of some existing state-of-the-art studies. A description of the dataset, the proposed method, and a detailed description of the experiment are presented under the materials and methods section. The results section reveals the study’s experimental results, which are further discussed in the discussion section.
Literature review
This section analyzes prior studies that employed MRI images to construct machine learning and deep learning models to classify AD. A particular emphasis was placed on methods that address data inadequacy and image quality challenges.
Ebrahimi et al. [19] designed a pre-trained ResNet in their deep sequencing model to classify AD. The prognostication of the AD recorded an accuracy of approximately 91.78%, signifying a 10% enhancement compared to other CNNs. Kang et al. [20] developed a deep ensemble 2D CNN architecture incorporating multiple models and slides. The model’s accuracy was 90.36% on normal controls (NC)/AD, 77.19% on AD/MCI classes, and 72.36% on NC/MCI class labels.
Alinsaif and Lang [21] designed a methodology that integrated 3D shearlet-based descriptors to decrease the dimensionality of MRI datasets. The method showed a noteworthy degree of efficacy in identifying AD. Mahendran and P. M. [22] developed a framework to detect AD that achieved an accuracy of 88.7% using a limited number of data records.
Zhang et al. [23] proposed a multi-modal model to facilitate timely AD detection. The proposed method showed superior performance in diagnosing AD, as evidenced by its training on two image modalities, namely PET and MRI, obtained from ADNI.
Spasov et al. [24] deployed a parameter-efficient model to check the conversion of MCI to AD. The approach combines various data types, including MRI images, demographic information, neuropsychological data, and genetic data. The method showed adaptability and has the potential to incorporate additional imaging techniques, including PET images and various collections of medical information.
In their scholarly work, Sharma et al. [25] presented a model capable of extracting all-level features. The model, FDN-ADNet, designed for early AD diagnosis using MRI images, yielded encouraging outcomes. Abrol et al. [26] applied ResNet in analyzing medical images to investigate the conversion of MCI to AD. Initially, the deep models were trained solely on MCI individuals for prediction purposes. Subsequently, a domain transfer learning version was employed, which involved additional training on AD and NC. The frameworks demonstrated significant prominence in the use of the model. The authors of [27] tackled the issue of insufficient data and missing data in diagnosing AD. The accuracy of all four classes of subjects used was observed to have improved by up to 3% when both complete and incomplete samples were considered during the model generation and testing phases.
In their study, Liu et. al. [28] presented a CNN-based approach for detecting AD. The model was specifically designed to train on a limited quantity of MRIs and accurately identify cases of AD. The model’s accuracy was approximately 78.02%, and it was observed that its portability was enhanced as an added advantage. Janghel and Rathore [29] applied a distinctive approach to improve the efficiency of Convolutional Neural Network (CNN) models. They implemented a preprocessing technique on an image dataset before submitting it to the CNN architecture for feature extraction. The experimental findings indicate that an fMRI dataset yielded a precision of 99.95%, whereas the PET data stood at 73.46%.
In their study, Bae et. al. [30] employed transfer learning to forecast the progression of MCI to AD dementia using a 3-D CNN. An 82.4% accuracy recorded for the target task surpassed the performance of existing models in the respective domain. Sathiyamoorthi et al. [31] proposed a method that exploits an adaptive histogram adjustment image correction algorithm to improve the quality of the image in terms of brightness and contrast. The algorithm was applied to segment the interested AD regions in an adaptive and modified manner. The diverse characteristics were computed using a second-order Gray Level Co-Occurrence Matrix. The method classified diseased images and their respective stages based on distinctive features. The empirical findings demonstrate that the suggested approach yields superior levels of precision and efficacy compared to the current system.
Ferri et al. [32] proposed a classification system with LORETA source estimation derived from rsEEG and sMRI variables. The study implemented two artificial neural networks (ANNs) to classify participants as having AD or being part of the control group. The results indicated classification accuracy of 83%, and 86% for the two ANNs, respectively.
In their study, Tong et al. [33] developed a CNN-Sparse Coding Network (CNN-SCN) architecture to detect MCI before it converts. The empirical findings indicate that the model exhibits good stability, accuracy, and generalization. The experiment recorded an accuracy rate of 92.6% in the AD/CN class, while the AD/MCI and MCI/CN classes recorded an accuracy of 74.9% and 76.3%, respectively.
Hridhee et al. [34] investigated and proposed an early detection 2D CNN model for classifying AD using MRI data samples. The researchers applied various image processing and augmentation strategies to the dataset. Subsequently, they trained the data on a VGG16, Xception, and a custom model. The customized model achieved the best performance of the selected models with an accuracy of 94.77%.
Allada et al. [35] aimed to enhance the accuracy of classifying AD and developed a swarm multi-verse algorithm to optimize a deep neuro-fuzzy network for AD classification at various stages. Their study incorporated techniques such as median filtering for preprocessing to improve image quality, a channel-wise feature pyramid network for image segmentation, and CNN for feature extraction. Their model achieved an accuracy of 89.9%, assessed based on k-fold values.
Gowhar et al. [36] introduced a deep learning framework centered on CNN for early AD detection. They implemented various data preprocessing and augmentation techniques on the dataset. The classifier was constructed through feature extraction, reduction, and classification. Their proposed model surpassed existing models, achieving an accuracy of 96.22%.
Tufail et al. [37] developed an early-stage AD classifier using 2D and 3D CNN with PET images. Their study differentiated between binary and multiclass categories of AD. Experimental results highlighted superior performance with an accuracy of 89.21% for the AD/NC binary classification using the 3D-CNN model. They employed data augmentation and 5-fold cross-validation to boost the model’s performance.
This section examined the performance of some deep learning models in classifying AD and is summarized in Table 1.
Table 1. Summary of existing CNN models.
| S/No. | Author | Cite | Method | Dataset | Classification | Accuracy |
|---|---|---|---|---|---|---|
| 1 | (Hridhee et al., 2023) | [34] | CNN | ADNI | AD/NC/MCI/ EMCI/LMCI | 94.77 |
| 2 | (Allada et al., 2023) | [35] | Neuro-fuzzy | Kaggle | AD/NC/MCI/ EMCI/LMCI | 89.9 |
| 3 | (Gowhar et al., 2023) | [36] | CNN | ADNI | AD/NC/MCI/ EMCI/LMCI | 96.22 |
| 4 | (Tufail et al., 2022) | [37] | CNN | ADNI | AD/NC | 89.21 |
| AD/MCI | 71.70 | |||||
| NC/MCI | 62.25 | |||||
| AD/NC/MCI | 59.73 | |||||
| 5 | (Kang et al., 2021) | [20] | CNN | ADNI | AD/NC | 90.36 |
| AD/MCI | 77.19 | |||||
| NC/MCI | 72.36 | |||||
| 6 | (Mahendran & P, 2022) | [22] | EDRNN | GEO Omnibus | AD/MCI | 88.7 |
| 7 | (Tong et al., 2022) | [33] | CNN | ADNI | AD/NC | 92.6 |
| AD/MCI | 74.9 | |||||
| NC/MCI | 76.3 | |||||
| 8 | (Zhang et al., 2022) | [6] | ResNet | ADNI | AD/NC | 90 |
| AD/MCI | 74.9 | |||||
| NC/MCI | 62.6 | |||||
| 9 | (J. Liu et al., 2021) | [28] | CNN | OASIS | AD/NC/ MCI | 78.02 |
| 10 | (Ferri et al., 2021) | [32] | autoencoders | ADNI | AD/NC | 89.0 |
Materials and methods
Subjects/datasets
The datasets were acquired from the AD Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu) [38]. In 2003, the ADNI was established as a public-private collaboration with the primary objective of evaluating the feasibility of integrating MRI, PET, and other biological markers to investigate and curb the degeneration of MCI to AD.
This study extracted MRI images from a sample of 396 subjects diagnosed with AD, 255 subjects classified as NC, and 104 subjects with MCI from the ADNI database captured from 2005 to 2021 and within an age range of 55 to 97 years old. The MRI datasets were initially in DICOM format and were subsequently transformed into JPG format. The images collected for the experiment included 1,581 AD images, 1,310 MCI images, and 1,591 NC images. Each image was initially dimensioned as 256 x 256 pixels and later resized to 64 x 64 pixels.
Method
In this study, we sought to build a deep learning model with optimal performance to classify AD into three categories: MCI, AD, and NC. To achieve this, we proposed an approach to improve the quality of the image datasets during the data preprocessing stage, after which the data volume was increased using data augmentation. The data was then sampled using the K-fold cross-validation technique to ensure that all data samples were used in the training and testing. The method divides the datasets into K folds, of which one-fold is used for testing and the remaining for training. The technique assesses the model’s capability to handle new examples, with K referring to the number of groups into which the data samples are divided. As presented in the subsequent section, a convolutional neural network was proposed, which was trained and evaluated iteratively using 10-fold cross-validation. For each iteration, the model is tested with the one-fold test data, after which the recorded test results are averaged. A detailed representation of the flow of activities in the proposed method is shown in Fig 1.
Fig 1. The proposed method defining the flow of activities.
The experiment was implemented with an NVIDIA GeForce GTX 1060 Graphic Processing Unit (GPU) machine operating on CUDA 10.1 with 8 GB of memory allocation. The model was implemented in Python using the TensorFlow framework and other libraries such as Sklearn, Numpy, Keras, Pillow and Opencv-python.
Data preprocessing
Data preprocessing is an essential step in conducting a deep learning experiment, as the efficacy of the dataset can significantly influence the learning outcomes of the models. The MRI image datasets utilized in this study were initially transformed into JPG format to facilitate manual preprocessing of the datasets. MRI images that were corrupt or of insufficient quality were manually removed. Following manual preprocessing, it was observed that the dataset still contained noisy images with varying brightness levels, highlighting the necessity for additional preprocessing measures.
An algorithm was proposed to address the issue of image noise and varying brightness. The algorithm was designed with histogram equalization and bilateral filtering, as presented under Algorithm 1.
Algorithm 1 Image enhancement algorithm
I ← RGBimage
[Ir, Ig, Ib] ← GetImageChannels(I)
Ir ← histogramEqualize(Ir)
Ig ← histogramEqualize(Ig)
Ib ← histogramEqualize(Ib)
Ir ← bilateralFilter(Ir)
Ig ← bilateralFilter(Ig)
Ib ← bilateralFilter(Ib)
I ← mergeImageChannels(Ir, Ig, Ib)
I ← normalizedImage(I)
Algorithm 1 was designed to improve the contrast of an input image by performing histogram equalization on each color channel, followed by bilateral filtering on the equalized channels. The bilateral filtering aims to maintain the edges and details of the image. The filtered channels are recombined into a unified RGB image and subjected to normalization procedures to enhance overall contrast. Eq (1) defines bilateral filtering.
| (1) |
| (2) |
The equation for the image value at pixel position p is defined as Ip, where Gσ represents a 2D Gaussian kernel. The normalization factor, Wp, is utilized to ensure that the pixel weights sum up to 1.0. The parameters s and r determine the extent of filtering applied to the image. Eq (3) defines the histogram equalization process, which is used to modify an image’s intensities to improve its contrast.
| (3) |
| (4) |
where L is the number of possible intensity values, In is the number of pixels with intensity, n and Tp is the total number of pixels.
Convolutional neural network model
The CNN model in Fig 2 comprised four convolutional layers to extract features from 64 x 64-sized color image inputs. In this model, every convolutional layer is paired with a corresponding max-pooling layer, batch normalization layer, and a dropout layer to reduce the learnable parameters. Each layer also incorporated a rectified linear activation function, a 3 by 3 kernel filter and L2-regularization. All convolutional layers had a stride of one (1), and same padding was used to keep the dimensions of the output feature map the same as the input dimension. Two (2) hidden layers were incorporated into the model, wherein a batch normalization layer and a dropout layer succeeded every layer. The model generated 426,979 parameters, consisting of 425,187 trainable and 1,792 nontrainable. Additional hyperparameters include a learning rate of 0.001, a batch size of 64, and 200 training epochs.
Fig 2. The proposed CNN model for classifying Alzheimer’s disease.
Performance evaluation
Various metrics were used to measure the model’s performance during training and testing. These include recall, precision, accuracy and area under the curve (AUC).
The accuracy metric represents the aggregate count of accurately predicted labels by the model and is defined as Eq (5).
| (5) |
TN is the total number of true positives, TN is the true negatives, FP refers to the false positives, and FN is the false negatives. As shown in Eq (6), precision is the ratio of true positives to the total number of predictions made.
| (6) |
Recall calculates the true positive rates as presented in Eq (7).
| (7) |
The ROC curve’s integral, AUC, corresponds to the likelihood of a positive sample being ranked at random higher than a negative sample. Given that the ROC curve is constructed through the successive joining of coordinates (x1, y1), (x2, y2), …, (xm, ym), the AUC is approximated using Eq (8).
| (8) |
Results
In this study, an algorithm for enhancing images was developed by integrating two image processing techniques: image equalization and bilateral filtering. The algorithm primarily enhances the image quality while preserving its edges and details. Bilateral filtering is employed to maintain the edges and details of the image, whereas histogram equalization adjusts the image’s intensity levels to improve contrast. Fig 3a shows the original images, while Fig 3b shows the images processed by the proposed algorithm.
Fig 3. Applying the proposed image enhancement algorithm.
(a) Original images (b) Transformed images.
In order to ascertain the impact of the image transformation by the algorithm, image quality parameters such as mean pixel intensity, contrast, and entropy were used. The mean pixel intensity calculates the average of the pixel intensities across the image, and contrast evaluates the variability (standard deviation) of pixel intensities, contributing to the sharpness or clarity of the image. An increase in mean pixel intensity indicates that the image has become brighter, while a higher contrast suggests enhanced sharpness. Entropy measures the richness of information in the image’s intensity distribution, with an increase in entropy indicating an improvement in image quality. The mean pixel intensity, contrast, and entropy values were calculated for both the original and enhanced images and visualized in Fig 4. The results indicate that the enhanced images showed more brightness, sharpness and richness of information since the mean pixel intensity, contrast, and entropy are higher.
Fig 4. Image quality assessment.
(a) Mean pixel intensity (b) Contrast (c) Entropy.
This study developed a CNN model to classify AD based on three distinct class labels: AD, MCI, and NC. The study further investigated the binary classification of AD versus MCI, AD versus NC, and MCI versus NC. The model was trained using a 10-fold cross-validation technique, with the learning rate hyperparameter set to 0.001, a batch size of 64, and 200 epochs. The training and testing were done on both the original and enhanced datasets to aid in assessing the impact of the image transformation. The test outcomes are displayed in Table 2.
Table 2. Testing results for classifying AD, MCI and NC.
| Fold | No Image Enhancement | With Image Enhancement | |||
|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Precision(%) | Recall (%) | AUC | |
| 1 | 92.37 | 93.09 | 93.49 | 92.87 | 0.98 |
| 2 | 87.66 | 93.42 | 93.51 | 93.20 | 0.99 |
| 3 | 91.98 | 90.30 | 90.56 | 89.86 | 0.97 |
| 4 | 89.57 | 93.76 | 94.27 | 93.53 | 0.99 |
| 5 | 89.44 | 94.20 | 94.51 | 94.08 | 0.99 |
| 6 | 89.95 | 93.64 | 93.84 | 93.53 | 0.99 |
| 7 | 90.59 | 94.64 | 94.62 | 94.20 | 0.99 |
| 8 | 92.37 | 94.53 | 94.95 | 94.42 | 0.99 |
| 9 | 91.46 | 92.75 | 92.85 | 92.75 | 0.98 |
| 10 | 88.03 | 94.20 | 94.39 | 93.97 | 0.99 |
| Average | 90.34 | 93.45 | 93.70 | 93.24 | 0.99 |
| Standard Deviation | 1.62 | 1.20 | 1.21 | 1.25 | 0.005 |
The model attained an accuracy of 90.34% on the original dataset, while on the transformed dataset, an accuracy of approximately 93.45% was recorded, accompanied by a standard deviation of 1.20%, suggesting the model’s capacity to forecast the labels accurately. The precision of the model in identifying true positives was 93.70%, with a standard deviation of 1.21%. The recorded recall value is 93.24%, with a standard deviation of 1.25%. The model achieved a high AUC score of 0.99 and a low standard deviation of 0.005, suggesting the accuracy of the model’s predictions. The performance of the model is further visualized using the receiver operating characteristic (ROC) curve and confusion matrix in Fig 5.
Fig 5. ROC curve and confusion matrix for classifying AD, MCI and NC.
(a) ROC curve (b) Confusion matrix.
The model underwent further training with binary class examples, namely AD and MCI, AD and NC, and MCI and NC, while maintaining the same hyperparameters and model architecture. The model demonstrated significant efficacy in forecasting the diverse labels, as outlined in Table 3. The test results show a consistent accuracy, precision, and recall score. The model’s accuracy rates for classifying AD versus MCI, AD versus NC, and MCI versus NC stood at 94.92%, 94.39%, and 95.62%, respectively. Our proposed model demonstrated good performance, as evidenced by AUC scores of 0.98, 0.98, and 0.99 for comparing AD versus MCI, AD versus NC, and MCI versus NC, respectively. The confusion matrix and the ROC curve for the binary classification are presented in Figs 6 and 7, respectively.
Table 3. Testing results for classifying AD/MCI, AD/NC and MCI/NC.
| Fold | AD/MCI | AD/NC | MCI/NC | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | Precision | Recall | AUC | Accuracy | Precision | Recall | AUC | Accuracy | Precision | Recall | AUC | |
| 1 | 94.30 | 94.30 | 94.30 | 0.97 | 94.02 | 94.02 | 94.02 | 0.99 | 96.04 | 96.04 | 96.04 | 0.99 |
| 2 | 88.77 | 88.77 | 88.77 | 0.94 | 95.28 | 95.28 | 95.28 | 0.98 | 95.01 | 95.01 | 95.01 | 0.99 |
| 3 | 96.19 | 96.19 | 96.19 | 0.99 | 96.54 | 96.54 | 96.54 | 0.99 | 95.86 | 95.86 | 95.86 | 0.99 |
| 4 | 94.64 | 94.64 | 94.64 | 0.98 | 92.91 | 92.91 | 92.91 | 0.98 | 96.38 | 96.38 | 96.38 | 0.99 |
| 5 | 95.16 | 95.16 | 95.16 | 0.99 | 94.79 | 94.79 | 94.79 | 0.98 | 95.52 | 95.52 | 95.52 | 0.99 |
| 6 | 95.85 | 95.85 | 95.85 | 0.99 | 95.90 | 95.90 | 95.90 | 0.98 | 96.03 | 96.03 | 96.03 | 0.99 |
| 7 | 95.67 | 95.67 | 95.67 | 0.99 | 94.01 | 94.01 | 94.01 | 0.98 | 96.38 | 96.38 | 96.38 | 0.98 |
| 8 | 95.85 | 95.85 | 95.85 | 0.98 | 92.27 | 92.27 | 92.27 | 0.98 | 96.21 | 96.21 | 96.21 | 0.98 |
| 9 | 97.75 | 97.75 | 97.75 | 0.99 | 92.43 | 92.43 | 92.43 | 0.98 | 94.31 | 94.31 | 94.31 | 0.98 |
| 10 | 94.98 | 94.98 | 94.98 | 0.98 | 95.74 | 95.74 | 95.74 | 0.99 | 94.48 | 94.48 | 94.48 | 0.99 |
| AVG | 94.92 | 94.92 | 94.92 | 0.98 | 94.39 | 94.39 | 94.39 | 0.98 | 95.62 | 95.62 | 95.62 | 0.99 |
| S. D. | 2.24 | 2.24 | 2.24 | 0.01 | 1.43 | 1.43 | 1.43 | 0.01 | 0.73 | 0.73 | 0.73 | 0.004 |
Fig 6. Confusion matrix for binary classification.
(a) AD/MCI (b) AD/NC (c) MCI/NC.
Fig 7. ROC curve for binary classification.
(a) AD/MCI (b) AD/NC (c) MCI/NC.
To adequately examine the model’s performance, we conducted a comparative study with prevailing models employed for AD classification. The accuracy metric was the baseline metric for comparing this study to the existing models. The findings suggest the superior efficacy of our research in AD classification, as demonstrated in Table 4.
Table 4. Comparative analysis of existing and proposed model.
| S/No. | Various Models | Cite | Class labels | |||
|---|---|---|---|---|---|---|
| AD/MCI | AD/NC | MCI/NC | Multiclass (AD/MCI/NC or more) | |||
| 1 | (Hridhee et al., 2023) | [34] | - | - | - | 94.77 |
| 2 | (Allada et al., 2023) | [35] | - | - | - | 89.9 |
| 3 | (Gowhar et al., 2023) | [36] | - | - | - | 96.22 |
| 4 | (Tufail et al., 2022) | [37] | 71.70 | 89.21 | 62.25 | 59.73 |
| 5 | (Kang et al., 2021) | [20] | 77.19 | 90.36 | 72.36 | - |
| 6 | (Mahendran and P M, 2022) | [22] | 88.7 | - | - | - |
| 7 | (Tong et al., 2022) | [33] | 74.9 | 92.6 | 76.3 | - |
| 8 | (Zhang et al., 2022) | [6] | 82.5 | 90.00 | 62.6 | - |
| 9 | (J. Liu et al., 2021) | [28] | - | - | - | 78.02 |
| 10 | (Ferri et al., 2021) | [32] | 89.0 | - | - | - |
| 11 | Proposed Model | 94.92 | 94.39 | 95.62 | 93.45 | |
Discussion
The experimental results highlight the superior performance of the proposed method in relation to other methods discussed in the literature, especially in the binary and multiclass classification of AD. As evident from Table 4, the accuracy of the proposed model surpassed that of [6, 20, 22, 32, 33, 37] in binary classification. In multiclass scenarios, our model outperformed the methods from [28, 37] that considered the same number of classes as in this study. Conversely, methods [34, 36] which encompassed more than the three classes in our study exhibited higher accuracies than our proposed model. Thus, the findings suggest that our model stands out as the most effective for binary classification, as well as for multiclass classification, when specifically focusing on AD, MCI, and NC labels.
Conclusion
AD is a prevalent neurological disorder, predominantly affecting the elderly, and remains without a definitive cure. Early detection of AD is paramount to manage and potentially slow its progression from a mild stage to more severe stages. While deep learning offers promising avenues for detecting AD, it also presents specific challenges, with image quality being a prominent one. The study proposed and implemented an image quality scheme using histogram equalization and bilateral filtering to improve the dataset’s quality, which was then used to train the proposed convolutional neural network. The proposed model consisted of four convolutional layers and two fully connected layers. It was trained using 10-fold cross-validation on three class labels: AD, MCI, and NC. The resulting accuracy, precision, recall score, and AUC score stood at 93.45%, 93.70%, 93.24%, and 0.99, respectively. These scores were comparatively higher than those of some selected state-of-the-art existing models.
Supporting information
(PDF)
Acknowledgments
We acknowledge the ADNI for granting us access to use their datasets for this research.
Data Availability
Data cannot be shared publicly because of institutional data access policy. Data are available from the ADNI Institutional Data Access / Ethics Committee (contact via adni.loni.usc.edu) for researchers who meet the criteria for access to confidential data.
Funding Statement
The author(s) received no specific funding for this work.
References
- 1. Liu CF, Padhy S, Ramachandran S, Wang VX, Efimov A, Bernal A, et al. Using deep Siamese neural networks for detection of brain asymmetries associated with Alzheimer’s Disease and Mild Cognitive Impairment. Magn Reson Imaging [Internet]. 2019;64(July):190–9. doi: 10.1016/j.mri.2019.07.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Reiss AB, de Levante Raphael D, Chin NA, Sinha V. The physician’s Alzheimer’s disease management guide: Early detection and diagnosis of cognitive impairment, Alzheimer’s disease and related dementia. AIMS Public Heal. 2022;9(4):661–89. doi: 10.3934/publichealth.2022047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Sabbagh MN, Boada M, Borson S, Doraiswamy PM, Dubois B, Ingram J, et al. Early Detection of Mild Cognitive Impairment (MCI) in an At-Home Setting. J Prev Alzheimer’s Dis. 2020;7(3):171–8. [DOI] [PubMed] [Google Scholar]
- 4. Hussain S, Mubeen I, Ullah N, Shah SSUD, Khan BA, Zahoor M, et al. Modern Diagnostic Imaging Technique Applications and Risk Factors in the Medical Field: A Review. Biomed Res Int. 2022;2022. doi: 10.1155/2022/5164970 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Kim B, Kim H, Kim S, Hwang Y ran. A brief review of non-invasive brain imaging technologies and the near-infrared optical bioimaging. Appl Microsc. 2021;51(1) doi: 10.1186/s42649-021-00058-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Zhang Y, Teng Q, Liu Y, Liu Y, He X. Diagnosis of Alzheimer’s disease based on regional attention with sMRI gray matter slices. J Neurosci Methods [Internet]. 2022;365(June 2021):109376. doi: 10.1016/j.jneumeth.2021.109376 [DOI] [PubMed] [Google Scholar]
- 7. Zhang J, Zheng B, Gao A, Feng X, Liang D, Long X. A 3D densely connected convolution neural network with connection-wise attention mechanism for Alzheimer’s disease classification. Magn Reson Imaging. 2021;78:119–26. doi: 10.1016/j.mri.2021.02.001 [DOI] [PubMed] [Google Scholar]
- 8. Althnian A, Alsaeed D, Al-baity H, Samha A, Bin Dris A, Alzakari N, et al. Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain. Appl. Sci 2021. doi: 10.3390/app11020796 [DOI] [Google Scholar]
- 9. Gupta S, Gupta A. Dealing with Noise Problem in Machine Learning Data-sets: A Systematic Review. Procedia Comput Sci [Internet]. 2019;161:466–74. doi: 10.1016/j.procs.2019.11.146 [DOI] [Google Scholar]
- 10.Nazari Z, Sayed M, Danish S. Evaluation of Class Noise Impact on Performance of Machine Learning Algorithms. IJCSNS 2018;(August).
- 11. Roxana A, Florin T, Gheorghe L, Grigorovici R, Iancu S, Hlusneac M, et al. Impact of quality, type and volume of data used by deep learning models in the analysis of medical images. Informatics Med Unlocked [Internet]. 2022;29(May 2021):100911. [Google Scholar]
- 12. Varoquaux G, Cheplygina V. Machine learning for medical imaging: methodological failures and recommendations for the future. npj Digital Medicine. 2022; doi: 10.1038/s41746-022-00592-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Ghafari S, Ghobadi Tarnik M, Sadoghi Yazdi H. Robustness of convolutional neural network models in hyperspectral noisy datasets with loss functions. Comput Electr Eng [Internet]. 2021;90(January) [Google Scholar]
- 14. Lin CY, Lin RS. A Noise Robust Convolutional Neural Network by using Noise Removal Techniques. J Inf Hiding Multimed Signal Process. 2022;13(3):178–87. [Google Scholar]
- 15.Zhao M, Shi P, Xu X, Xu X, Liu W, Yang H. Identification System Using Different. 2022; [DOI] [PMC free article] [PubMed]
- 16.Chen X. Image enhancement effect on the performance of convolutional neural networks. 2019;(June):1–40. Available from: www.bth.se
- 17. Heidari M, Mirniaharikandehei S, Khuzani AZ, Danala G, Qiu Y, Zheng B. Improving the performance of CNN to predict the likelihood of COVID-19 using chest X-ray images with preprocessing algorithms. Int J Med Inform. 2020;144 doi: 10.1016/j.ijmedinf.2020.104284 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Caseneuve G, Valova I, LeBlanc N, Thibodeau M. Chest X-Ray image preprocessing for disease classification. Procedia Comput Sci [Internet]. 2021;192:658–65. doi: 10.1016/j.procs.2021.08.068 [DOI] [Google Scholar]
- 19. Ebrahimi A, Luo S, Chiong R. Deep sequence modelling for Alzheimer’s disease detection using MRI. Comput Biol Med. 2021;134(June). [DOI] [PubMed] [Google Scholar]
- 20. Kang W, Lin L, Zhang B, Shen X, Wu S. Multi-model and multi-slice ensemble learning architecture based on 2D convolutional neural networks for Alzheimer’s disease diagnosis. Comput Biol Med. 2021;136(July). [DOI] [PubMed] [Google Scholar]
- 21. Alinsaif S, Lang J. 3D shearlet-based descriptors combined with deep features for the classification of Alzheimer’s disease based on MRI data. Comput Biol Med. 2021;138(September). [DOI] [PubMed] [Google Scholar]
- 22. Mahendran N, P M DRV. A deep learning framework with an embedded-based feature selection approach for the early detection of the Alzheimer’s disease. Comput Biol Med [Internet]. 2022;141(September 2021):105056. doi: 10.1016/j.compbiomed.2021.105056 [DOI] [PubMed] [Google Scholar]
- 23. Zhang F, Li Z, Zhang B, Du H, Wang B, Zhang X. Multi-modal deep learning model for auxiliary diagnosis of Alzheimer’s disease. Neurocomputing [Internet]. 2019;361:185–95. doi: 10.1016/j.neucom.2019.04.093 [DOI] [Google Scholar]
- 24. Spasov S, Passamonti L, Duggento A, Liò P, Toschi N. A parameter-efficient deep learning approach to predict conversion from mild cognitive impairment to Alzheimer’s disease. Neuroimage. 2019;189(January):276–87. doi: 10.1016/j.neuroimage.2019.01.031 [DOI] [PubMed] [Google Scholar]
- 25. Sharma R, Goel T, Tanveer M, Murugan R. FDN-ADNet: Fuzzy LS-TWSVM based deep learning network for prognosis of the Alzheimer’s disease using the sagittal plane of MRI scans. Appl Soft Comput [Internet]. 2022;115:108099. doi: 10.1016/j.asoc.2021.108099 [DOI] [Google Scholar]
- 26. Abrol A, Bhattarai M, Fedorov A, Du Y, Plis S, Calhoun V. Deep residual learning for neuroimaging: An application to predict progression to Alzheimer’s disease. J Neurosci Methods [Internet]. 2020;339(September 2019):108701. doi: 10.1016/j.jneumeth.2020.108701 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Aghili M, Tabarestani S, Adjouadi M. Addressing the missing data challenge in multi-modal datasets for the diagnosis of Alzheimer’s disease. J Neurosci Methods [Internet]. 2022;375(March):109582 doi: 10.1016/j.jneumeth.2022.109582 [DOI] [PubMed] [Google Scholar]
- 28. Liu J, Li M, Luo Y, Yang S, Li W, Bi Y. Alzheimer’s disease detection using depthwise separable convolutional neural networks. Comput Methods Programs Biomed [Internet]. 2021;203:106032. doi: 10.1016/j.cmpb.2021.106032 [DOI] [PubMed] [Google Scholar]
- 29. Janghel RR, Rathore YK. Deep Convolution Neural Network Based System for Early Diagnosis of Alzheimer’s Disease. Irbm [Internet]. 2021;42(4):258–67. doi: 10.1016/j.irbm.2020.06.006 [DOI] [Google Scholar]
- 30. Bae J, Stocks J, Heywood A, Jung Y, Jenkins L, Hill V, et al. Transfer learning for predicting conversion from mild cognitive impairment to dementia of Alzheimer’s type based on a three-dimensional convolutional neural network. Neurobiol Aging. 2021;99:53–64. doi: 10.1016/j.neurobiolaging.2020.12.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Sathiyamoorthi V, Ilavarasi AK, Murugeswari K, Thouheed Ahmed S, Aruna Devi B, Kalipindi M. A deep convolutional neural network based computer aided diagnosis system for the prediction of Alzheimer’s disease in MRI images. Meas J Int Meas Confed [Internet]. 2021;171(June 2020):108838. doi: 10.1016/j.measurement.2020.108838 [DOI] [Google Scholar]
- 32. Ferri R, Babiloni C, Karami V, Triggiani AI, Carducci F, Noce G, et al. Stacked autoencoders as new models for an accurate Alzheimer’s disease classification support using resting-state EEG and MRI measurements. Clin Neurophysiol [Internet]. 2021;132(1):232–45. doi: 10.1016/j.clinph.2020.09.015 [DOI] [PubMed] [Google Scholar]
- 33. Tong Y, Tong Y, Li Z, Huang H, Gao L, Xu M, et al. Application of CNN-SCN in early diagnosis of Alzheimer’ s disease Application of CNN-SCN in early diagnosis of Alzheimer’s disease. 2022; 10.21203/rs.3.rs-2187429/v1 [DOI] [Google Scholar]
- 34.Hridhee RA, Bhowmik B, Hossain QD. Alzheimer’s Disease Classification From 2D MRI Brain Scans Using Convolutional Neural Networks. 3rd Int Conf Electr Comput Commun Eng ECCE 2023. 2023;(August):1–6.
- 35. Allada A, Bhavani R, Chaduvula K, Priya R. Alzheimer’s disease classification using competitive swarm multi-verse optimizer-based deep neuro-fuzzy network. Concurr Comput Pract Exp. 2023;35(21):1–19. doi: 10.1002/cpe.7696 [DOI] [Google Scholar]
- 36. Gowhar M, Bhagat A, Ansarullah SI, Ben Othman MT, Hamid Y, Alkahtani HK, et al. A Novel Framework for Classification of Different Alzheimer’s Disease Stages Using CNN Model. Electron. 2023;12(2):1–14. [Google Scholar]
- 37. Bin Tufail A, Anwar N, Ben Othman MT, Ullah I, Khan RA, Ma Y-K, et al. Early-Stage Alzheimer’s Disease Categorization Using PET Neuroimaging Modality and Convolutional Neural Networks in the 2D and 3D Domains. Sensors [Internet]. 2022. Jun 18;22(12):4609. Available from: https://www.mdpi.com/1424-8220/22/12/4609 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.adni.loni.usc.edu. (n.d.). ADNI | Alzheimer’s Disease Neuroimaging Initiative. Retrieved June 21, 2023; https://adni.loni.usc.edu/







