Abstract
Parkinson’s disease (PD) is a chronic and progressive neurological disease that mostly shakes and compromises the motor system of the human brain. Patients with PD can face resting tremors, loss of balance, bradykinesia, and rigidity problems. Complex patterns of PD, i.e., with relevance to other neurological diseases and minor changes in brain structure, make the diagnosis of this disease a challenge and cause inaccuracy of about 25% in the diagnostics. The research community utilizes different machine learning techniques for diagnosis using handcrafted features. This paper proposes a computer-aided diagnostic system using a convolutional neural network (CNN) to diagnose PD. CNN is one of the most suitable models to extract and learn the essential features of a problem. The dataset is obtained from Parkinson’s Progression Markers Initiative (PPMI), which provides different datasets (benchmarks), such as T2-weighted MRI for PD and other healthy controls (HC). The mid slices are collected from each MRI. Further, these slices are registered for alignment. Since the PD can be found in substantia nigra (i.e., the midbrain), the midbrain region of the registered T2-weighted MRI slice is selected using the freehand region of interest technique with a 33 × 33 sized window. Several experiments have been carried out to ensure the validity of the CNN. The standard measures, such as accuracy, sensitivity, specificity, and area under the curve, are used to evaluate the proposed system. The evaluation results show that CNN provides better accuracy than machine learning techniques, such as naive Bayes, decision tree, support vector machine, and artificial neural network.
Keywords: Parkinson’s disease, convolutional neural networks, MRI
1. Introduction
Parkinson’s disease (PD) is one of the brain diseases that occur due to disorder in the neurological system of the brain. The thalamus is a region in the human brain that contains neurons and has an important role in transmitting sensory information to the brain. Another region of the human brain is the substantia nigra, which contains dopaminergic neurons. Dopamine, a neurotransmitter essential for motor coordination and control, is produced and released by these neurons [1]. Dopamine provides signals to the brain and other parts of the body related to movement and coordination. During Parkinson’s disease, dopamine chemical generation decreases and causes neuron death [2]. Parkinson’s disease symptoms include shakes, slowness in muscle movement, stiffness, imbalance, or postural instability. There are some other symptoms as well, such as slowness in thinking, voice disorder, fatigue, anxiety, and depression. Sleep may also become disturbed and concentration may be lost [3]. There is no medical lab or medical test to diagnose this disease [4]. Traditionally, medical experts have used past records and neurological investigations. However, this approach is not that accurate because of many reasons and similar neurodegeneration diseases. It is difficult to diagnose after much loss of dopamine chemicals. The correct detection of PD is very important. If a patient is diagnosed as healthy, with time, this disease becomes worse, which is difficult to control. Machine learning is widely used in many medical disease diagnoses, like heart disease detection, cancer detection, Alzheimer’s disease detection, and many more. Regarding PD, there are many symptoms that can be present in a Parkinson’s disease patient. These symptoms or features can be age, voice, brain images, etc., in different patterns. So, on the basis of these features, we can classify this disease as PD if a patient has these features or symptoms by using machine learning techniques. In the current era of technology, the trend of making everything automated has been started, which is reaching medical diagnosis as well. Automation can increase the speed and precision of medical diagnosis. Healthcare professionals can gain from the helpful decision making assistance that automated technologies can provide. By employing vast amounts of medical information and data, these technologies can assist physicians in making informed decisions, providing likely diagnoses, and prescribing appropriate tests or treatments. This can help in standardizing diagnostic processes, ensuring consistency in evaluations, and reducing diagnostic variability. These tools can aid in the early identification and prevention of sickness. Automation enables scalability and improved accessibility of medical diagnostics. Different automated and semi-automated systems have been developed for disease classifications [4,5,6,7,8,9]. In the same way, different researchers have attempted to classify PD by using machine-learning-based techniques. Most of these techniques are suport vector machine, neural network, Bayesian learning, decision tree method, etc. In articles [10,11,12], different machine-learning-based approaches that have been applied on people with PD are discussed. A research work in [13] applied the random forest approach on a dataset adopted from ADRC, which contains voice recordings of people with PD and healthy controls. The simulation results showed a 99.25% accuracy rate. However, this technique is not applied on different features and datasets.
Parkinson’s Classification Based on Machine Learning (ML) and Deep Learning (DL) Techniques
This section is dedicated to recent literature on Parkinson’s classification using different machine learning (ML) and deep learning (DL) techniques. Most of the techniques are fully automated, while some are semi-automated.
A work in [14] proposed a novel intelligent model using DL techniques that analyzes gait information. In order to build deep neural network architecture, a 1D convolutional network is used. The model receives 18-ID signals from foot sensors, which measure vertical ground reaction force (VGRF). The algorithm is tested on Parkinson’s detection and prediction of severity of Parkinson’s. The authors claimed an accuracy of 98.7% achieved by the proposed model.
In [15], the authors introduced an intelligent system that can detect PD from vowels. The features from the vowels are extracted by using singular value decomposition (SVD) and minimum average maximum (MAMa) tree. Further, 50 distinctive features are selected using feature selection techniques. For classification purposes, they used KNN classifier and obtained 92% accuracy.
In [16], they presented a CNN-based model for classification of PD and HC from neuromelanin-sensitive magnetic resonance imaging (NMS-MRI). Neuromelanin-sensitive MRI is a medical imaging technique that allows experts to study the abnormities with detail in substantia nigra pars compacta (SNc). The dataset used in this study comprises the NMS-MRI of 45 subjects in total, where 25 are PD and 35 are HC. The authors claim a superior testing accuracy of 80%.
In [17], the authors proposed a novel intelligent system, where all regions of the brain are covered by a network. Feature vectors are collected from every region of the brain and random forest is used to select relevant features. Lastly, support vector machine is applied in order to combine all the futures along with the ground truth. This model is trained and tested on the Parkinson’s Progression Markers Initiative (PPMI) dataset, including 169 HC and 374 PD subjects. The authors claimed an accuracy of 93%.
The article in [18] proposed a machine-learning-based technique to diagnose Parkinson’s disease by developing a multilayer feed forward neural network (MLFNN). They obtained the dataset from Oxford Parkinson’s datasets, which include the voice measurements of 31 subjects, where 21 of them are PD patients, while rest of the subjects are healthy controls. In total, eight different attributes on the basis of frequency (tremor) are selected. For classification, the k-means algorithm is used. The simulation results showed sensitivity 83.3%, specificity 63.6%, and accuracy 80%.
In [19], another model of PD classification was introduced. The dataset used in this study is adopted from the UCI repository. The swarm optimization technique has been applied for features extraction, while naive Bayes has been applied for classification. The authors claimed 97.5% accuracy.
In [20], the authors used non-motor features for diagnosis purposes of PD. These features are the collection of olfactory loss, sleep behavior disorder, and rapid eye movement (REM). Further, the non-motor features were combined with dopaminergic imaging markers and cerebrospinal fluid measurement features. The dataset used in the experiments was obtained from PPMI, in which 401 were PD subjects while 183 were healthy controls. Boosted tree, SVM, random forest, and Bayes were used for classification purposes. The results showed 96.4% in terms of accuracy with SVM. In the literature, it was studied that non-motor symptoms, including cognitive decline, trouble sleeping, mood problems, and autonomic dysfunction, may show up in the early stages of Parkinson’s disease (PD), even before the appearance of motor symptoms. By considering non-motor traits in addition to motor symptoms, clinical experts can make a more accurate and speedy diagnosis, leading to appropriate treatment and therapy. In addition to Parkinson’s disease, other neurological illnesses can also cause non-motor symptoms. The specific pattern and combination of non-motor symptoms can assist differentiating PD from other disorders to aid in the differential diagnosis process. In PD, non-motor symptoms could manifest before those that are motor.
In [21], the author proposed a novel intelligent model for classifation of PD. This approach is based on GA)-Walvet kernet(WK)-Extreme learning machine (ELM). The neural network was trained by ELM. WK-ELM uses three different parameters, which are adjustable. The ideal values for parameters are calculated with the support of genetic algorithm. The authors obtained a 96% accuracy rate with a dataset taken from the UCI library, which contains voice measurements of 31 subjects, where 23 are PD patients.
In [22], a CNN model, AlexNet, was presented for classifation of PD. The model is trained on 2820 HC and 3296 PD MR images and tested on 705 HC and 824 PD MR images using the transfer learning technique. The PPMI dataset was used in this study. This model achieved 88.9%, 89.30%, and 88.40% results in terms of accuracy, sensitivity, and specificity, respectively.
In [13], the authors performed experiments on Parkinson’s and Alzheimer’s diseases. A fully automated system was introduced based on different intelligence and deep learning algorithms, such as decision tree, random forest, boosted tree, bagging, and MLP. The dataset used in the research was adopted form Alzheimer’s Disease Research Center (ADRC), which contained a total of 890 subjects’ data, where 65% of cases belonged to Alzheimer’s, while 40% were PD subjects. According to this paper, alcohol, genes, and age are the main influencing factors regarding AD and PD. According to the author, accuracy of 99.25% has been achieved on random forest and MLP. A research work in [23] worked on susceptibility weighted imaging (SWI) scan. SWI is a medical imaging technique in MRI. This technique has the capability to visualize the susceptible variations in detail for many issues like blood iron, with the support of contract enhancement. SVM is used for classification of Parkinson’s and Parkinsonisms at an isolated level and obtained an accuracy of 86%. A local dataset is used, having 36 subjects’ records, where 16 were PD patients and 20 were Parkinsonisms.
In [24], the authors worked on three classes of classification regrading PD, progressive supranuclear palsy (PSP), and HC. The advanced stage of PD is PSP; its progression is very high and it is less reactive to medication. The dataset used in this study consists of the MRIs of 84 subjects. The authors applied principal computer analysis (PCA) for feature extraction, while SVM was used as a classifier. Their accuracy is about 88% on an average basis.
In [25], a multimodel on MR images was proposed. In this study, SVM was applied as a classifier. This model obtained the results 86.96 %, 92.59 %, and 78.95% in terms of accuracy, specificity, and sensitivity, respectively. A local dataset was used in this research, which contained a total of 46 subjects, where 19 belonged to PD and 27 belonged to HC.
An author in [26] worked on TRODAT and SPECT images to detect the PD. In this regard, the authors presented an ANN-based model. Striatal and striatum pixel values were obtained from images, and these were then fed to ANN as input. This model obtained an accuracy of 94%.
A comprehensive analysis of prior work is presented in Table 1.
Table 1.
Reference | Data Type | Number of Subjects | Methods Used | Accuracy | Year |
---|---|---|---|---|---|
[24] | MRI Scans | PD (n = 28) | Voxel-based morphometry | PD vs. HC: 83.2 | 2014 |
HC (n = 28) | Principal component analysis | PSP vs. PD: 84.7 | |||
PSP (n = 28) | Support vector machine | PSP vs. HC: 86.2 | |||
[27] | MRI Scans | Tremor dominant PD (n = 15) | Voxel-based morphometry | 100 | 2014 |
ET with rest tremor (n = 15) | Diffusion tensor imaging | ||||
Support vector machine | |||||
[28] | MRI Scans | PD (n = 518) | Self-organizing maps | 99.9 | 2015 |
HC (n = 245) | Support vector machine | ||||
SWEDD (n = 68) | |||||
[29] | MRI Scans | PD (n = 30) | Region-of-interest-based | 86.67 | 2015 |
HC (n = 30) | Support vector machine | ||||
[30] | MRI Scans | PD (n = 204) | Volumetry | 80 | 2016 |
MSA-C (n = 21) | Support vector machine | ||||
PSP-RS (n = 106) | |||||
MSA-P (n = 60) | |||||
[31] | MRI Scans | PPMI cohort | Joint feature-sample selection | 81.9 | 2016 |
HC (n = 169) | |||||
PD (n = 374) | |||||
[32] | MRI Scans | HC (n = 38) | Functional connectome | 80 | 2017 |
PD (n = 27) | Support vector machine | ||||
[17] | MRI Scans | HC (n = 169) | Connectivity measures | 93 | 2018 |
PD (n = 374) | Support vector machine | ||||
[33] | MRI Scans | PD (n = 26) | Voxel-based morphometry | PD vs. MSA: 96 | 2018 |
HC (n = 26) | T2 relaxometry, DTI | ||||
MSA-P (n = 16) | Self-organizing maps | ||||
MSA-C (n = 13) | |||||
[34] | MRI Scans | HC (n = 39) | NM-MRI-based atlas of Substantia nigra | 79.9 | 2019 |
PD (n = 40) | |||||
[16] | MRI Scans | HC (n = 35) | NM-MRI-based atlas of Substantia nigra | 89 | 2019 |
PD (n = 25) | |||||
[22] | MRI Scans | PPMI | CNN model AlexNet | 88.9 | 2019 |
HC = 82 | Transfer learning | ||||
PD = 100 |
There are different factors involved in PD patients, like olfactory loss, rapid eye disorder, sleep disturbance, postural unbalancing, cerebrospinal fluid, and dopaminergic imaging. There is a need to consider all these features and apply a classification technique that can correctly diagnose people with PD. CNN has shown state-of-the-art accuracy in a number of biomedical image classifications. Recently, Billones, Ciprian D. et al. [35] adjusted the parameters of a VGGNet model for Alzheimer’s detection and succeeded with 91.85% accuracy. Likewise, [36] obtained an accuracy of 93.16% for cerebral microbleeds in MRI. Due to the high accuracy of CNN with MR images, it is applied for PD detection and succeeded in obtaining satisfactory results. The main advantage of the proposed system is that it is a simple convolutional network with limited training parameters; hence, the training time is shorter than state-of-the-art models. A general limitation of the proposed model is that it deals with Parkinson’s disease as a binary classification problem; however,; there are some other diseases closely related to Parkinson’s, such as Parkinsonism, dementia, and Alzheimer’s, etc. It would be good to develop a system that can classify these diseases in a multiclass classification. Figure 1 shows the overall operation of the proposed system.
Regarding the order of the remainder of this research paper, Section 2 covers the materials and methods. The results and experiments of the proposed methods are discussed in Section 3. Section 4 is reserved for the discussion regarding the results. Finally. Section 5 concludes the research and also presents the future work. The main contributions of this paper are four-fold:
Achieved the state-of-the-art mean accuracy, sensitivity, specificity, and area under the curve as 96, , , and percent, respectively.
Dealing with limited data, this model was developed in such a manner that reduces the overfitting problem.
Low-computational-power GPU was used and obtained satisfactory results as compared to other techniques.
Specific patches were extracted from the samples.
2. Materials and Methods
2.1. Data Acquisition
The dataset utilized in this analysis was made available by the PPMI. The PPMI is a multi-study facility with the goal of discovering trustworthy biomarkers and performing an early Parkinson’s disease diagnosis. Additionally, it is the project with the greatest data, which includes a sizable number of clinical, imaging, and biological samples. It is claimed that PPMI offers the largest dataset of its kind, and their samples are known as the benchmark of PD for research purposes across the globe [37]. A total of 500 samples (T2 weighted MR scan) were obtained in Digital Imaging and Communications in Medicine (DICOM) format with the followed parameters, Plane=AXIAL Acquisition Flip Angle = 150.0 degree, Matrix X = 228.0 pixels, Matrix Y = 256.0 pixels, Matrix Z = 54.0, Slice Thickness = 6.0 mm, Pulse Sequence=Spin echo, Pixel Spacing Y = 0.9375 mm, Pixel Spacing X = 0.9375 mm. The data contained 250 numbers of PD and 250 HC samples, balanced data. Dataset is used in such a way that 70% is used for training, 20% for testing, while 10% for validation. The dataset is available on (http://www.ppmi-info.org). Table 2 represents the details of the subject in terms of gender and age, while Figure 2 shows the difference between the MRI scan of a healthy subject and Parkinson’s patient.
Table 2.
Total | Male | Female | Age (Years) | |
---|---|---|---|---|
PD | 250 | 173 | 77 | 60 ± 10 |
HC | 250 | 136 | 114 | 60 ± 10 |
2.2. Pre-Processing
The MR images were initially stored in the DICOM format and then converted into JPEG using publicly available software known as DICOM to JPEG. Each subject’s data consisted of 45 slices, and only slice number 22 was collected per subject since this slice provides the accurate image of the substantia nigra in PD class. Substantia nigra is a structure in the mid-brain area that controls movement and motor coordination. Dopamine is a substance that is produced in this area and is employed as a signal transmitter. This sends signals about movement and coordination to the brain and other parts of the body. A stack was created by combining slice number 22 from all the subjects. To align the images, intensity-based image registration was carried out using the OpenCV library on the stack. Image registration is the procedure of lining up scans of the brain or other pertinent regions taken from people with Parkinson’s disease. Using image registration techniques, this alignment establishes the spatial relationship between the pictures, enabling a consistent and uniform analysis. By ensuring that the pictures are in a uniform coordinate system, image registration eliminates variances brought on by changes in patient placement or scanning procedures. The primary objective of image registration was to eliminate unwanted and irrelevant information, which could lead our model to learn unnecessary and redundant features. For obtaining a perfect image of substantia nigra, the mid-brain section was cropped using the freehand region of interest (ROI) technique with a window size of 33 × 33. Freehand ROI was used for cropping because the size of the specific organ varies in different patients, and, instead of using fix ROI cropping, the freehand region of interest (ROI) technique provides us better control in cropping the exact position of the organ. This image was the final input to the CNN model. Figure 3 provides a visual representation of the preprocessing steps.
2.3. Convolutional Neural Network Architecture
CNN architecture have been widely used for image-relevant tasks, such as image recognition and classification of images, etc. The use of CNNs has effectively improved the performance of many image-relevant tasks. For example, a deep-CNN-based COVID-19 diagnosis system was proposed in [38].The author claimed that the deep-CNN-based dignosis of COVID-19 from sounds like dry cough outperforms other models. In an another article, CNN has been proposed for classification of lung diseases [39]. In this article, the authors applied CNN on chest X-ray images and classified lung diseases into five different disease classes. The results of CNN-based classification were higher than existing methods. The main building blocks in CNN architecture are convolutional layers, activation functions, feature maps, max pooling, and regularization. The CNN architecture begins with convolutional layer that accepts input and uses the convolutional kernels to process the spatial information in local receptive field and report activation value using activation function. The convolutional layers can be stacked over one another, which enables CNN to extract and learn features in increasingly complex hierarchy and provides a features map. The number of generated feature maps depends on the number of convolutional filters used. The activation function encodes the pixel-level spatial neighborhood activation at respective pixel location in feature map. The max pooling layer is after the feature map layer. The main purpose of using the max pooling layer is to reduce the input dimensionality, reduce the risk of overfitting, and reduce computational costs. The result of the max pooling layer can be given to another convolutional layer to create a hierarchical structure. The final feature maps are fully connected to every neuron on dense layer. Finally, softmax function as activation function is also used for classification purpose. Following are the main building blocks of CNN model.
2.3.1. Weights Initialization
Right weight initialization has the key role of deep learning, which reduces the convergence time and brings stability in loss function even after thousands of iterations. Xavier initializer is incorporated in this study that maintains activation variance and back propagation gradient in controlled levels [40,41].
(1) |
In Equation (1), U is the normal distribution, where w is tensor, the weight of input layer, and is that of output layer.
2.3.2. Convolution of Kernels
After the start of convolution on image, feature maps are generated. Each kernel has a feature map. Feature map F can be calculated using the equation below.
(2) |
where M shows kernel and N shows input channel (2).
2.3.3. Activation Function
Non-linearity into the system is introduced by the activation function. A number of activation functions have been proposed and are still under research. Each activation function has some limitations and is not suitable for every situation: for instance, sigmoid kill gradient. However, ReLU obtained better results when compared with sigmoid and hyperbolic tangent function but suffers from dying ReLU problem. For instance, the large gradient flows through the ReLU update the weights that will never activate at any data point. The other issue with ReLU activation function is that it ignores gradients smaller than zero. LeakyReLU is the improved form of ReLU and tackles the dying ReLU problem by bringing the negative gradient into it. ReLU is defined as
(3) |
LeakyReLU is defined as
(4) |
Here, is the leakiness parameter, which may be a real number between 0 and 1.
2.3.4. Pooling
Dimensionality of the feature map is reduced by pooling; it makes the system ignore small changes, such as small intensity and illumination change. The prominent pooling layer is max pooling, min pooling, average pooling. The min and max select features with the minimum, maximum value in the pooling kernel, respectively, while the average pooling calculates the average of the features in pooling kernel and returns the average effect of all features. Max pooling is used here in this study, which can be formalized as
(5) |
where k and l correspond to the spatial positions.
2.3.5. Regularization
The main purpose of regulation is to avoid model overfitting. A number of overfitting techniques are available; however, L1 and L2, global average pooling, global max pooling, and batch normalization are well-known among them. Dropout is another effective regulation technique that randomly switches the neurons on and off to learn effectively and contribute in the overall output individually. In this paper, we use Dropout, which removes neurons with probability p. The value of the Dropout can be any real value between 0 and 1. The working of Dropout can be observed by the following formula [42]:
(6) |
where is the probable result of the unit k, is the set of all thinned network, is the output of unit M, and shows the probability function.
2.3.6. Fully Connected layers or Dense layers
It is the last layer after convolutional layers. Here, each pixel of the image is considered as neuron and given to each neuron in the fully connected layer. A classifier is used for classification at the end of architecture. Softmax is most common classifier in deep neural networks. It can be defined using Bayes theorem [43].
(7) |
where is the targeted class to find, and is the j = 1,2,3, …, class. Its exponential form is as under [43]:
(8) |
2.3.7. Loss Function
It is used to calculate the compatibility between the given ground truth label and predicted values. The loss function can be custom-designed for a particular task. There are many loss functions based on the nature of the learning problem, but the most common lost function that is used in classification task is categorical cross-entropy. The categorical cross-entropy is used as the cost function. It can be formalized as
(9) |
Here, c is the actual target class, while is the predicted class in Equation (9).
2.4. Proposed Network Architecture
Our proposed model receives MRI as input and eventually labels it as PD or HC. This method takes advantage of the deeper CNN with a small convolutional kernel of size 3 × 3 throughout the network. The smaller convolutional kernel has fewer parameters to estimate and allows learning and generalizing from limited training data. Conversely, the larger convolutional kernel has larger parameters to estimate, is difficult to generalize, and demands high availability of training data. Each convolutional kernel is followed by advanced activation function (i.e., LeakyReLU). LeakReLU addresses simple ReLU issues, such as dying ReLU by adjusting negative gradients on back propagation. The recently developed batch normalization is used before every LeakyReLU layer to improve the performance. It has the ability to accelerate the training process of the network. The proposed network takes input patch of dimension 33 × 33. The first three convolutional layers are followed by max pooling layer with kernel dimension of 3 × 3 and stride 2 × 2. The output of the first max pooling layer (i.e., feature maps) is 64 (number of channels) 16 × 16 in dimension. The max pooling layer is used to reduce overall dimensionality, which results in fewer learnable parameters. The output feature maps of first max pooling layer are forwarded to next three convolutional layers. The output feature maps of the sixth convolutional layer (i.e., 128 × 16 × 16 in dimension) are forwarded to the second max pooling layer with kernel size of 3 × 3 and stride of 2 × 2. The output feature maps of this pooling layer have dimensions of 128 × 7 × 7. These feature maps are then fully connected to the FC (fully connected) layers. There are two FC layers. The first FC layer has 512 neurons, while the second has 256 neurons. Advanced regularization technique is used, which is dropout with 0.1 value in both FC layers to reduce the risk of network overfitting. At the end of network, softmax layer is used to obtain the classification probabilities. Figure 4 shows graphical representation of the proposed model, while Table 3 shows the architecture along with used parameters of proposed model. In Table 3, “Type” column conv means convolutional layer and Max-pool means max pooling layer. In “Inputs” column, the first value is the number of input channels and next two values are the dimension of the feature map or patch size.
Table 3.
Layer No. | Type | Filter Size | Stride | # Filters | FC Units | Input |
---|---|---|---|---|---|---|
Layer 1 | Conv. | 3 × 3 | 1 × 1 | 64 | - | 33 × 33 |
Layer 2 | Conv. | 3 × 3 | 1 × 1 | 64 | - | 64 × 33 × 33 |
Layer 3 | Conv. | 3 × 3 | 1 × 1 | 64 | - | 64 × 33 × 33 |
Layer 4 | Maxpool. | 3 × 3 | 2 × 2 | - | - | 64 × 33 × 33 |
Layer 5 | Conv. | 3 × 3 | 1 × 1 | 128 | - | 64 × 16 × 16 |
Layer 6 | Conv. | 3 × 3 | 1 × 1 | 128 | - | 128 × 16 × 16 |
Layer 7 | Conv. | 3 × 3 | 1 × 1 | 128 | - | 128 × 16 × 16 |
Layer 8 | Maxpool. | 3 × 3 | 2 × 2 | - | - | 128 × 16 × 16 |
Layer 9 | FC | - | - | - | 512 | 6272 |
Layer 10 | FC | - | - | - | 256 | 512 |
Layer 11 | FC | - | - | - | 2 | 256 |
3. Results
This section discusses the outcome performance of the network on the Parkinson’s dataset.
3.1. Performance Measures
Area under curve (AUC), classification accuracy, sensitivity, and specificity are used to evaluate the performance of the proposed model. These are calculated as in Equations (10)–(12).
(10) |
(11) |
(12) |
3.2. Experimental Setup
The NVIDIA Geforce 940MX GPU, which supports Keras, has been utilized to run the CNN. The Keras Python deep learning API enables the usage of both Theano and Tensorflow. The Theano library and Sequential model are used as a CNN model.
3.3. Experiments
In order to evaluate the robustness of the network, a number of experiments have been performed. The proposed model has been applied to the patches as well as to the complete image. Different model parameters have been tested to obtain a robust model. Furthermore, several times, the model has been executed to find the performance validity of the proposed model. The accuracy of the proposed model has been recorded after each epoch. The system is evaluated on the training set as well as on the test set. Four of the experiments are elaborated below. Experiments 1 and 2 show the highest and the lowest accuracy, respectively, archived during training and validation with the same input and network settings. In experiment 3, the last convolutional layer is eliminated in order to reduce the computational cost while keeping the input the same as in experiments 1 and 2. However, in experiment 4, the same networking settings are maintained. The network is tested on the full mid-brain as input rather than ROI patches.
3.4. First Experiment
In experiment 1, Figure 5a shows the training and testing accuracy. The x-axis shows the number of epochs, while the Y-axis shows accuracy. The line in green color is for validation accuracy, while blue is for training accuracy. The accuracy increases with the number of epochs. The validation accuracy reached up to 98%. Figure 5b shows the training loss vs. validation loss. The × axis represents the numbers of epochs and the y-axis represents the loss. Figure 5c shows the ROC curve for the proposed architecture on the test set. The x-axis represents the false positive rate and the y-axis represents the true positive rate. In this experiment, the proposed model obtained an AUC of 0.94 on the test set.
3.5. Second Experiment
In experiment 2, Figure 6a–c show the results of the same model repeated for 50 epochs on the same input patches for validating the performance of the model. In this experiment, the training and validation accuracy decreased to 95%. This is the least accuracy obtained by the proposed model. The 3% decrement in the accuracy is due to the noise in input.
3.6. Third Experiment
In experiment 3, Figure 7a–c show the results of the third experiment in which the last convolutional layer is eliminated. This last layer is removed to reduce the computational cost, but the accuracy of the model is greatly affected. The AUC remains constant on the test set, but the accuracy is reduced to 65% on validation as well as on testing. The AUC remains the same up to 94% on the test set.
3.7. Fourth Experiment
In experiment 4, Figure 8a–c show the results of the next experiment in which the model is applied to a full slice of MRI instead of patches. The AUC is constant, but the accuracy is reduced to 85%. The comparison of the several experiments shows that the proposed architecture performed better with patches and produced high AUC and accuracy on validation and test sets.
4. Discussion
Numerous experiments have employed various network setups. The network parameters include layer count, input size, and other network features. The accuracy has consistently ranged from 94 to 98 percent. In Table 4, the AUC, sensitivity, specificity, and average classification accuracy have been shown. Also, we have generated the confusion matrix representing the true and false classification in Table 5.
Table 4.
Accuracy | Specificity | Sensitivity | AUC |
---|---|---|---|
96 ± 2 | 96.87 ± 3.13 | 95.83 ± 0 | 94.5 ± 0.5 |
Table 5.
Predicted | |||
---|---|---|---|
Yes | No | ||
Actual | Yes | 49 | 1 |
No | 3 | 47 |
Detection of PD from MR imaging cannot be considered a novel task since many researchers have attempted to classify PD and HC. Table 1 represents a comprehensive analysis of the prior work. In these studies, different groups of ML techniques are applied that contain supervised models, such as SVM [17,24,27,30,32,44], and unsupervised models, like [28,33]. These models achieved promising results, but their accuracies are more likely variable accuracies. In many of the mentioned works, the authors used millions of features of single or multiple modalities with limited datasets using SVM, which creates a hyperplane in the n-feature dimensional space. Using this strategy can achieve high accuracy, but it has a chance of overfitting.
To compare the results, with Alexnet [22] being the pioneers to use a novel strategy of using ROI base patches, when it comes to the biological domain, PD is associated with the substantia nigra. There is a chance of high structural changes in this organ as compared to the rest of the brain. Providing the input of the specific image (substantia nigra) to the network rather than the full image is the key factor of achieving promising results. The performance comparison of our model with other classifiers can be seen in Table 6. The results show that the proposed model suppresses the previous models, while experiment 4 confirms the involvement of mid-brain (patches) in PD classification.
Table 6.
Method | Accuracy | Sensitivity | Specificity | AUC |
---|---|---|---|---|
PDF, PCA, SVM [45] | 73.1 | 67.5 | 78.7 | - |
Complex network, SVM [17] | 88 | 85 | 88 | - |
CNN(AlexNet), Transfer learning [22] | 88.9 | 89.3 | 88.4 | - |
SVM [46] | 92.3 | 90 | 94 | 97 |
GA-ELM [47] | 89.22 | 92.35 | 92.35 | - |
SVM [44] | 86.67 | - | - | - |
ResNet50 | 82 | 83.5 | 85 | 85.4 |
Proposed CNN Model | 96 | 96.8 | 95.8 | 95 |
5. Conclusions
In conclusion, this paper proposes a customized CAD system that utilizes convolutional neural networks to accurately classify MRI patches into Parkinson’s and healthy patterns. The model successfully extracts and learns the patterns from the training samples of the benchmark PPMI dataset, resulting in improved results. The findings demonstrate that the proposed model can autonomously learn accurate features of Parkinson’s disease. However, the study highlights the challenge of overfitting in working with a limited dataset. Nevertheless, the proper design and integration of the dropout layer in the model enable effective suppression of the overfitting problem. Overall, the proposed CNN-based model offers a promising approach for the automatic and precise classification of Parkinson’s disease, and it has the potential to benefit clinical practice in the future.
5.1. Contribution
With the increasing trend of computer-aided diagnosis, it has become feasible to avail these technologies for diagnosing complex diseases. Despite limited resources, these technologies are being used along with machine learning approaches for diagnosis of different diseases in many biomedical research labs. A computer-aided diagnosis based on convolutional neural network is presented in this paper. The performance of the model has been analyzed in detail on the basis of accuracy, sensitivity, specificity, and AUC. One of the main objectives of the proposed system is to reduce the incorrect diagnosis of PD and to detect the disease in early stages to improve the QoF of patients. To the best of our knowledge, this is the very first attempt to apply convolutional neural network on ROI for classification of Parkinson’s disease. Although there is no cure for the disease itself, there are treatments available that help in reducing the symptoms for newly diagnosed patients. This maintains QoF for as long as possible.
5.2. Future Work
The proposed network is more simple, with fewer feature maps and layers. The complex features can be learned by complex organization of the network; however, a complex network requires a huge amount data. In the future, it is intended to work on the problem as the dataset is also updating with new patient records. The efforts will continue to improve the correct diagnosis of Parkinson’s disease.
Acknowledgments
Researchers supporting project number (RSPD2023R576), King Saud University, Riyadh, Saudi Arabia.
Author Contributions
Conceptualization, S.S.H. and M.A.; methodology, S.S.H.; software, X.D.; validation, S.S.H., X.D. and P.M.S.; formal analysis, S.U.I. and X.D.; investigation, X.D.; resources, S.S.H. and X.D.; data curation, S.U.I.; writing—original draft preparation, S.S.H.; writing—review and editing, S.S.H. and X.D.; visualization, S.S.H. and I.A.K.; supervision, X.D.; project administration. X.D.; funding acquisition, F.A.A. and E.A.A.I. All authors have read and agreed to the published version of the manuscript.
Data Availability Statement
The data will be made available on request.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This project is funded by King Saud University, Riyadh, Saudi Arabia.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Dauer W., Przedborski S. Parkinson’s disease: Mechanisms and models. Neuron. 2003;39:889–909. doi: 10.1016/S0896-6273(03)00568-3. [DOI] [PubMed] [Google Scholar]
- 2.Shetty S., Rao Y. SVM based machine learning approach to identify Parkinson’s disease using gait analysis; Proceedings of the 2016 International Conference on Inventive Computation Technologies (ICICT); Coimbatore, India. 26–27 August 2016; pp. 1–5. [Google Scholar]
- 3.Nackaerts E., Ginis P., Heremans E., Swinnen S.P., Vandenberghe W., Nieuwboer A. Retention of touchscreen skills is compromised in Parkinson’s disease. Behav. Brain Res. 2020;378:112265. doi: 10.1016/j.bbr.2019.112265. [DOI] [PubMed] [Google Scholar]
- 4.Shah P.M., Zeb A., Shafi U., Zaidi S.F.A., Shah M.A. Detection of Parkinson Disease in Brain MRI using Convolutional Neural Network; Proceedings of the 2018 24th International Conference on Automation and Computing (ICAC); Newcastle Upon Tyne, UK. 6–7 September 2018; pp. 1–6. [Google Scholar]
- 5.Khan H., Shah P.M., Shah M.A., ul Islam S., Rodrigues J.J. Cascading handcrafted features and Convolutional Neural Network for IoT-enabled brain tumor segmentation. Comput. Commun. 2020;153:196–207. doi: 10.1016/j.comcom.2020.01.013. [DOI] [Google Scholar]
- 6.Khan M., Shah P.M., Khan I.A., Islam S.u., Ahmad Z., Khan F., Lee Y. IoMT-Enabled Computer-Aided Diagnosis of Pulmonary Embolism from Computed Tomography Scans Using Deep Learning. Sensors. 2023;23:1471. doi: 10.3390/s23031471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Shah P.M., Khan H., Shafi U., Islam S.u., Raza M., Son T.T., Le-Minh H. 2d-cnn based segmentation of ischemic stroke lesions in mri scans; Proceedings of the Advances in Computational Collective Intelligence: 12th International Conference, ICCCI 2020; Da Nang, Vietnam. 30 November–3 December 2020; Berlin/Heidelberg, Germany: Springer; 2020. pp. 276–286. Proceedings 12. [Google Scholar]
- 8.Shah P.M., Ullah F., Shah D., Gani A., Maple C., Wang Y., Abrar M., Islam S.U. Deep GRU-CNN model for COVID-19 detection from chest X-rays data. IEEE Access. 2021;10:35094–35105. doi: 10.1109/ACCESS.2021.3077592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Shah P.M., Ullah H., Ullah R., Shah D., Wang Y., Islam S.u., Gani A., Rodrigues J.J. DC-GAN-based synthetic X-ray images augmentation for increasing the performance of EfficientNet for COVID-19 detection. Expert Syst. 2022;39:e12823. doi: 10.1111/exsy.12823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Thapa S., Adhikari S., Ghimire A., Aditya A. Feature selection based twin-support vector machine for the diagnosis of Parkinson’s disease; Proceedings of the 2020 IEEE 8th R10 Humanitarian Technology Conference (R10-HTC); Kuching, Malaysia. 1–3 December 2020; pp. 1–6. [Google Scholar]
- 11.Zhao H., Cao J., Wang R., Lei Y., Liao W.H., Cao H. Accurate identification of Parkinson’s disease by distinctive features and ensemble decision trees. Biomed. Signal Process. Control. 2021;69:102860. doi: 10.1016/j.bspc.2021.102860. [DOI] [Google Scholar]
- 12.Bind S., Tiwari A.K., Sahani A.K., Koulibaly P., Nobili F., Pagani M., Sabri O., Borght T., Laere K., Tatsch K. A survey of machine learning based approaches for Parkinson disease prediction. Int. J. Comput. Sci. Inf. Technol. 2015;6:1648–1655. [Google Scholar]
- 13.Joshi S., Shenoy D., Rrashmi P., Venugopal K., Patnaik L. Classification of Alzheimer’s disease and Parkinson’s disease by using machine learning and neural network methods; Proceedings of the 2010 Second International Conference on Machine Learning and Computing; Bangalore, India. 9–11 February 2010; pp. 218–222. [Google Scholar]
- 14.El Maachi I., Bilodeau G.A., Bouachir W. Deep 1D-Convnet for accurate Parkinson disease detection and severity prediction from gait. Expert Syst. Appl. 2020;143:113075. doi: 10.1016/j.eswa.2019.113075. [DOI] [Google Scholar]
- 15.Tuncer T., Dogan S., Acharya U.R. Automated detection of Parkinson’s disease using minimum average maximum tree and singular value decomposition method with vowels. Biocybern. Biomed. Eng. 2020;40:211–220. doi: 10.1016/j.bbe.2019.05.006. [DOI] [Google Scholar]
- 16.Shinde S., Prasad S., Saboo Y., Kaushick R., Saini J., Pal P.K., Ingalhalikar M. Predictive markers for Parkinson’s disease using deep neural nets on neuromelanin sensitive MRI. Neuroimage: Clin. 2019;22:101748. doi: 10.1016/j.nicl.2019.101748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Amoroso N., La Rocca M., Monaco A., Bellotti R., Tangaro S. Complex networks reveal early MRI markers of Parkinson’s disease. Med. Image Anal. 2018;48:12–24. doi: 10.1016/j.media.2018.05.004. [DOI] [PubMed] [Google Scholar]
- 18.Olanrewaju R.F., Sahari N.S., Musa A.A., Hakiem N. Application of neural networks in early detection and diagnosis of Parkinson’s disease; Proceedings of the 2014 International Conference on Cyber and IT Service Management (CITSM); Washington, DC, USA. 27 June–2 July 2014; pp. 78–82. [Google Scholar]
- 19.Ghanad N.K., Ahmadi S. Combination of PSO algorithm and naive Bayesian classification for Parkinson disease diagnosis. Adv. Comput. Sci. Int. J. 2015;4:119–125. [Google Scholar]
- 20.Prashanth R., Roy S.D., Mandal P.K., Ghosh S. High-accuracy detection of early parkinson’s disease through multimodal features and machine learning. Int. J. Med. Inform. 2016;90:13–21. doi: 10.1016/j.ijmedinf.2016.03.001. [DOI] [PubMed] [Google Scholar]
- 21.Avci D., Dogantekin A. An expert diagnosis system for parkinson disease based on genetic algorithm-wavelet kernel-extreme learning machine. Parkinson’s Dis. 2016;2016 doi: 10.1155/2016/5264743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sivaranjini S., Sujatha C. Deep learning based diagnosis of Parkinson’s disease using convolutional neural network. Multimed. Tools Appl. 2019;79:15467–15479. doi: 10.1007/s11042-019-7469-8. [DOI] [Google Scholar]
- 23.Haller S., Badoud S., Nguyen D., Barnaure I., Montandon M., Lovblad K., Burkhard P. Differentiation between Parkinson disease and other forms of Parkinsonism using support vector machine analysis of susceptibility-weighted imaging (SWI): Initial results. Eur. Radiol. 2013;23:12–19. doi: 10.1007/s00330-012-2579-y. [DOI] [PubMed] [Google Scholar]
- 24.Salvatore C., Cerasa A., Castiglioni I., Gallivanone F., Augimeri A., Lopez M., Arabia G., Morelli M., Gilardi M., Quattrone A. Machine learning on brain MRI data for differential diagnosis of Parkinson’s disease and Progressive Supranuclear Palsy. J. Neurosci. Methods. 2014;222:230–237. doi: 10.1016/j.jneumeth.2013.11.016. [DOI] [PubMed] [Google Scholar]
- 25.Long D., Wang J., Xuan M., Gu Q., Xu X., Kong D., Zhang M. Automatic classification of early Parkinson’s disease with multi-modal MR imaging. PloS ONE. 2012;7:e47714. doi: 10.1371/journal.pone.0047714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Acton P.D., Newberg A. Artificial neural network classifier for the diagnosis of Parkinson’s disease using [99mTc] TRODAT-1 and SPECT. Phys. Med. Biol. 2006;51:3057. doi: 10.1088/0031-9155/51/12/004. [DOI] [PubMed] [Google Scholar]
- 27.Cherubini A., Morelli M., Nisticó R., Salsone M., Arabia G., Vasta R., Augimeri A., Caligiuri M.E., Quattrone A. Magnetic resonance support vector machine discriminates between Parkinson disease and progressive supranuclear palsy. Mov. Disord. 2014;29:266–269. doi: 10.1002/mds.25737. [DOI] [PubMed] [Google Scholar]
- 28.Singh G., Samavedham L. Unsupervised learning based feature extraction for differential diagnosis of neurodegenerative diseases: A case study on early-stage diagnosis of Parkinson disease. J. Neurosci. Methods. 2015;256:30–40. doi: 10.1016/j.jneumeth.2015.08.011. [DOI] [PubMed] [Google Scholar]
- 29.Rana B., Juneja A., Saxena M., Gudwani S., Kumaran S.S., Agrawal R., Behari M. Regions-of-interest based automated diagnosis of Parkinson’s disease using T1-weighted MRI. Expert Syst. Appl. 2015;42:4506–4516. doi: 10.1016/j.eswa.2015.01.062. [DOI] [Google Scholar]
- 30.Huppertz H.J., Möller L., Südmeyer M., Hilker R., Hattingen E., Egger K., Amtage F., Respondek G., Stamelou M., Schnitzler A., et al. Differentiation of neurodegenerative parkinsonian syndromes by volumetric magnetic resonance imaging analysis and support vector machine classification. Mov. Disord. 2016;31:1506–1517. doi: 10.1002/mds.26715. [DOI] [PubMed] [Google Scholar]
- 31.Adeli E., Shi F., An L., Wee C.Y., Wu G., Wang T., Shen D. Joint feature-sample selection and robust diagnosis of Parkinson’s disease from MRI data. NeuroImage. 2016;141:206–219. doi: 10.1016/j.neuroimage.2016.05.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Abós A., Baggio H.C., Segura B., García-Díaz A.I., Compta Y., Martí M.J., Valldeoriola F., Junqué C. Discriminating cognitive status in Parkinson’s disease through functional connectomics and machine learning. Sci. Rep. 2017;7:45347. doi: 10.1038/srep45347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Péran P., Barbagallo G., Nemmi F., Sierra M., Galitzky M., Traon A.P.L., Payoux P., Meissner W.G., Rascol O. MRI supervised and unsupervised classification of Parkinson’s disease and multiple system atrophy. Mov. Disord. 2018;33:600–608. doi: 10.1002/mds.27307. [DOI] [PubMed] [Google Scholar]
- 34.Ariz M., Abad R.C., Castellanos G., Martínez M., Muñoz-Barrutia A., Fernández-Seara M.A., Pastor P., Pastor M.A., Ortiz-de Solórzano C. Dynamic atlas-based segmentation and quantification of Neuromelanin-rich brainstem structures in Parkinson disease. IEEE Trans. Med. Imaging. 2018;38:813–823. doi: 10.1109/TMI.2018.2872852. [DOI] [PubMed] [Google Scholar]
- 35.Billones C.D., Demetria O.J.L.D., Hostallero D.E.D., Naval P.C. Demnet: A convolutional neural network for the detection of Alzheimer’s Disease and Mild Cognitive Impairment; Proceedings of the 2016 IEEE Region 10 Conference (TENCON); Singapore. 22–25 November 2016; pp. 3724–3727. [Google Scholar]
- 36.Dou Q., Chen H., Yu L., Zhao L., Qin J., Wang D., Mok V.C., Shi L., Heng P.A. Automatic detection of cerebral microbleeds from MR images via 3D convolutional neural networks. IEEE Trans. Med. Imaging. 2016;35:1182–1195. doi: 10.1109/TMI.2016.2528129. [DOI] [PubMed] [Google Scholar]
- 37.Marek K., Jennings D., Lasch S., Siderowf A., Tanner C., Simuni T., Coffey C., Kieburtz K., Flagg E., Chowdhury S., et al. The Parkinson progression marker initiative (PPMI) Prog. Neurobiol. 2011;95:629–635. doi: 10.1016/j.pneurobio.2011.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lella K.K., Pja A. Automatic diagnosis of COVID-19 disease using deep convolutional neural network with multi-feature channel from respiratory sound data: Cough, voice, and breath. Alex. Eng. J. 2022;61:1319–1334. doi: 10.1016/j.aej.2021.06.024. [DOI] [Google Scholar]
- 39.Alshmrani G.M.M., Ni Q., Jiang R., Pervaiz H., Elshennawy N.M. A deep learning architecture for multi-class lung diseases classification using chest X-ray (CXR) images. Alex. Eng. J. 2023;64:923–935. doi: 10.1016/j.aej.2022.10.053. [DOI] [Google Scholar]
- 40.Initializers—Keras Documentation. [(accessed on 25 August 2023)]. Available online: https://faroit.com/keras-docs/2.0.6/initializers/
- 41.Glorot X., Bengio Y. Understanding the difficulty of training deep feedforward neural networks; Proceedings of the Aistats; Chia, Italy. 13–15 May 2010; pp. 249–256. [Google Scholar]
- 42.Hinton G.E., Srivastava N., Krizhevsky A., Sutskever I., Salakhutdinov R.R. Improving Neural Networks by Preventing Co-Adaptation of Feature Detectors 2012. [(accessed on 25 August 2023)]; Available online: http://xxx.lanl.gov/abs/1207.0580.
- 43.Bishop C.M. Pattern recognition. Mach. Learn. 2006;128:1–736. [Google Scholar]
- 44.Rana B., Juneja A., Saxena M., Gudwani S., Kumaran S.S., Behari M., Agrawal R. Graph-theory-based spectral feature selection for computer aided diagnosis of Parkinson’s disease using T 1-weighted MRI. Int. J. Imaging Syst. Technol. 2015;25:245–255. doi: 10.1002/ima.22141. [DOI] [Google Scholar]
- 45.Cigdem O., Yilmaz A., Beheshti I., Demirel H. Comparing the performances of PDF and PCA on Parkinson’s disease classification using structural MRI images; Proceedings of the 2018 26th Signal Processing and Communications Applications Conference (SIU); Izmir, Turkey. 2–5 May 2018; pp. 1–4. [Google Scholar]
- 46.Peng B., Zhou Z., Geng C., Tong B., Zhou Z., Zhang T., Dai Y. Computer aided analysis of cognitive disorder in patients with Parkinsonism using machine learning method with multilevel ROI-based features; Proceedings of the 2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI); Datong, China. 15–17 October 2016; pp. 1792–1796. [Google Scholar]
- 47.Pahuja G., Nagabhushan T. A novel GA-ELM approach for parkinson’s disease detection using brain structural T1-weighted MRI data; Proceedings of the 2016 Second International Conference on Cognitive Computing and Information Processing (CCIP); Mysuru, India. 12–13 August 2016; pp. 1–6. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data will be made available on request.