Histogram-Based Features Selection and Volume of Interest Ranking for Brain PET Image Classification

Imene Garali; Mouloud Adel; Salah Bourennane; Eric Guedj

doi:10.1109/JTEHM.2018.2796600

. 2018 Mar 16;6:2100212. doi: 10.1109/JTEHM.2018.2796600

Histogram-Based Features Selection and Volume of Interest Ranking for Brain PET Image Classification

Imene Garali ^1,², Mouloud Adel ^1,^✉, Salah Bourennane ³, Eric Guedj ^2,⁴

PMCID: PMC5881487 PMID: 29637029

Abstract

Positron emission tomography (PET) is a molecular medical imaging modality which is commonly used for neurodegenerative diseases diagnosis. Computer-aided diagnosis, based on medical image analysis, could help quantitative evaluation of brain diseases such as Alzheimer’s disease (AD). A novel method of ranking the effectiveness of brain volume of interest (VOI) to separate healthy control from AD brains PET images is presented in this paper. Brain images are first mapped into anatomical VOIs using an atlas. Histogram-based features are then extracted and used to select and rank VOIs according to the area under curve (AUC) parameter, which produces a hierarchy of the ability of VOIs to separate between groups of subjects. The top-ranked VOIs are then input into a support vector machine classifier. The developed method is evaluated on a local database image and compared to the known selection feature methods. Results show that using AUC outperforms classification results in the case of a two group separation.

Keywords: Machine learning, computer-aided diagnosis, first order statistics, feature selection, positron emission tomography, classification, Alzheimer’s disease

General scheme of region ranking and feature selection for computer-aided diagnosis of Alzheimer's Disease on PET images.

graphic file with name jtehm-gagraphic-2796600.jpg

I. Introduction

Alzheimer Disease (AD) is a degenerative and incurable brain disease which is considered the main cause of dementia in elderly people worldwide. It is expected that 1 in 85 people will be affected by 2050 and the number of affected people will double in the next decades [1]. The diagnosis of this disease is done by clinical, neuroimaging and neuropsychological assessments. Neuroimaging evaluation is based on nonspecific features such as cerebral atrophy, which appears very late in the progression of the disease. Therefore, developing new approaches for early and specific recognition of AD is of crucial importance.

Several imaging biomarkers to identify AD are effective [2], including structural Magnetic Resonance Imaging (MRI) [3]–[7], Single Photon Emission Computed Tomography (SPECT) [8], electroencephalographic EEG rhythms [9], [10] and Positron Emission Tomography (PET) [11], [12]. In addition to these imaging bio-markers, biological biomarkers such as Cerebral Spinal Fluid (CSF) [6] have also been developed for AD diagnosis. It has been shown that PET functional imaging associated with FluoroDeoxyGlucose (18F-FDG) as a tracer is able to increase the confidence of neurodegenerative diagnosis [13]–[16].

Early detection of AD is important because it increases the chances of early treatment and may enable pre-emptive and even preventative therapies in the future.

The clinical diagnosis of AD uses a variety of tests including patient’s family history, physical examination, mini-mental state examination, and/or neuroimaging [17]. Recently, functional neuroimaging technologies, such as the Positron Emission Tomography (PET), has become a powerful tools in the diagnosis of AD since it is now possible to reveal pathophysiological changes before irreversible anatomical changes are present. 18F-Fluoro-deoxyglucose (FDG) is a widely used radioactive tracer in PET imaging. FDG-PET provides useful information about the cerebral glucose metabolic rate.

Studies have demonstrated reduced glucose metabolism in a small number of brain regions in AD patients comparing to normal subjects [18], [19]. As such difference becomes noticeable, researchers are increasingly interested in distinguishing AD patients from normal subjects by utilizing their brain images. As a non-negligible complementary way in the diagnosis of AD, PET imaging has high specificity and sensitivity, even a long period before the full-blown dementia has developed.

Visual evaluation of brain PET images is qualitative and deals with two problems: it is time consuming and operator dependent [20]. Visual method depends greatly on the observer’s experience and lacks a clear cut-off between normal and pathological finding. This has been highlighted in the imaging literature [16]. FDG-PET is today recommended and largely used to contribute to the diagnosis of AD in case of atypical presentations, especially in young patients, to distinguish AD from focal neurodegenerative diseases and from non-neurodegenerative diseases (encephalitis, psychiatric disorders). This evaluation is usually conducted on visual interpretation and depends on the experience of the physician. An automated classification method may contribute to improve the robustness and the reproducibility of the diagnosis across different centres, of course if this evaluation remains compatible with a clinical work-up in terms of time spending and necessary resources. Moreover, in the perspective of future disease modifying therapies in AD, early diagnosis at initial stages of the disease will be required to stop or slow down the disease before the diffusion of the brain lesions, with at this step a greater uncertainty of the visual PET interpretation because of possible limited abnormalities, which may increase the need of computer-aided methods. Computer-Aided Diagnosis (CAD) could bring a valuable tool to quantitatively evaluate brain disease such as AD.

Image processing techniques applied to PET images have been widely used for CAD purposes [21], including feature selection and extraction [12], [22]–[31], segmentation: Fuzzy C-Means [32], Gaussian Mixture Model [33], [34], Dynamic Neuro-Fuzzy technique [35], and K-Means clustering [36] and classification: Random Forest (RF) [23], Support Vector Machine (SVM) [22], [12], [11], [37], [38], K-Nearest Neighbour (KNN) [37], [39] and Neural Networks [25], [40]–[42].

The general scheme of analyzing PET images for classification purposes consists in acquiring and preprocessing images first, then applying a feature selection or extraction technique in order to reduce the amount of redundant information and finally input these chosen feature to a classifier. Many approaches have been used to tackle the problem of feature selection and extraction including Fischer Discriminates Ratio (FDR) [22], Non-Negative Matrix Factorization (NMF) [22], [30], Partial Least squares (PLS) [23], Gaussian Mixture Models (GMM) [12], Wavelets Packet Transform [25], Principal Component Analysis (PCA) [26]–[29] and Independent Component Analysis [31] (ICA).

The curse of dimensionality problem or the small sample size is a well-known problem encountered in neuroimaging where the features used for the classification process are higher than the sample used for the training process. This is the case of the Voxel-Based Analysis (VBA), where the brain is considered as a set of raw voxels, on which some statistical tests Student’s t-Test [43]–[45] or Mann-Whitney-Wilcoxon U-Test [46], [47] are computed and then input to a classifier. A lot of voxels are redundant and irrelevant for classification tasks and therefore feature selection is needed.

Feature selection methods focus on the selection of a subset of features that describes well the input data while reducing effect noise or irrelevant variables and still provides good classification results [48]. They can be broadly classified into three groups: filters, wrapper and embedded methods.

Wrapper methods evaluate the utility of feature subsets using the results of a specified classifier. These methods allow to detect the possible interactions between variables. A search procedure within the possible feature subsets space is done. As the number of subsets grow exponentially with the number of features, a heuristic or a sequential selection algorithms are used for search purposes [49]. The two main disadvantages of these methods are the increasing overfitting risk when the number of observations is insufficient and the significant computation time when the number of variables is large [50], [51]. In the context of AD diagnosis, Chyzhyk et al. [52] used a wrapper method with a genetic algorithm for optimal subset search.

Embedded methods reduce the computational time compared to wrapper methods by incorporating the feature selection in the training process of a classifier. These are very sensitive to the learning algorithm used to set feature subset. Support Vector Machine (SVM) approaches [53] or decision tree [54] algorithms are some examples of embedded methods. This approach has been used for MRI brain classification in the case of AD diagnosis [55].

Filter methods do not depend on any classifier but can be considered as pre-processing steps based on specific criterion to evaluate the relevance of features. One of the main disadvantages of these approaches is that they ignore the interaction between features and hence are unable to remove redundant features. The most proposed techniques are univariate, this means that each feature is considered separately, for instance on mutual information [56], Fisher discriminant ratio [30], [57], Pearson correlation coefficient [58], Laplacian score [59], gain ratio [60], [61], chi-square test [62], Relief [63]. On the other hand, some multivariate methods which can handle both irrelevant and redundant features have been used to take into account dependence between features. Minimal-Redundancy Maximal-Relevance (mRMR) proposed by Peng et al. [64] is a well-known method where the selection of the feature subset is based on mutual information. Other criteria have also been proposed including conditional mutual information [65] or the second approximation of the joint mutual information [66], clustering technique [67], Fast correlation based filter [68], [69] Relevance-redundancy feature selection [70]. More details can be found in a recent feature selection survey [71], in [72] for Alzheimer’s disease and in [73] in the general case.

There has been a growing interest in using the FDG rate for AD/HC classification [74]. Four main groups of methods have been studied: voxels as feature (VAF)-based [75], discriminative voxel selection-based [76], [77], atlas-based [78], [79], and projection-based methods [30].

A lot of radiomic methods exist and have been used on PET images especially for diagnosis, staging, prognosis and therapy responses assessment in oncology. The proposed method, based on statistical features extraction from histograms and described in our paper, derives from the expertise transfer that we obtained, when discussing with nuclear medicine doctors with whom we were working. In neurodegenerative diseases there is no particular tumour region in which a Standardized Uptake Value (SUV) could be computed. The main important features we need to compute, according to doctors, is the anatomical Volumes of Interest (VOI) uptake statistical properties (average, variance, and heterogeneity through skewness and asymmetry). These are the first order statistical moments extracted from regions’ histograms. The information being captured using histograms is the difference of FDG uptake across VOI, as it important for Alzheimer’s Disease to detect reduced glucose metabolism in some VOI in the brain.

In this paper a novel supervised filter-based feature selection is developed. It allows to select the best feature subset and rank them according to the “Area Under Curve” (AUC), a criterion based on Receiver Operating Characteristic (ROC) in the feature space. With a view to applying this approach on PET brain images, a brain mapping using an atlas is applied to segment PET into Volumes-Of-Interest (VOIs). Statistical moments of each VOI’s grey level histogram are computed and using the developed technique it was possible to rank the ability of a VOI or a subset of VOIs, characterized each by a subset feature vector, to distinguish between AD and Healthy Control (HC) brain images. The performance of the proposed method is compared with other well-known feature selection methods in terms of classification success rates using the feature subset selected by each method as an input to a Support Vector Machines (SVM) classifier. The evaluation is done on our own FDG PET image database and results show that the proposed approach is amongst the algorithms with the best results.

The rest of the paper is organized as follows. Section II gives a brief description of the developed approach. The main contribution on feature selection and region ranking is presented in section III. Section IV reports the obtained results, performance evaluation and comparison. Finally, a conclusion and future work are given in section V.

II. Proposed Method Description

A. Overview of the Proposed Method

The flowchart of the proposed method is shown in Fig. 1. As for any CAD system, it consists in several important stages:

1.
18F-FDG-PET data acquisition and preprocessing.
2.
Computed features
3.
Mono-parametric analysis feature selection.
4.
Multi-parametric Volume-Of-Interest (VOI) selection and ranking.
5.
Classification.

FIGURE 1. — Flowchart of the proposed CAD system.

The different stages appearing in this flowchart are detailed in the next sections.

B. Image Database

FDG PET scans were collected from the “La Timone” University Hospital, in the Nuclear Medicine Department (Marseille, France). The database was built up based on imaging studies of subjects that followed the standardized protocol of a hospital-based service. PET scans were performed using an integrated PET/CT camera (Discovery ST, GE Healthcare, Waukesha, USA), with 6.2 mm axial resolution, allowing 47 contiguous transverse sections of the brain of 3.27 mm thickness. 150 MBq of 18FDG were injected intravenously in an awake and resting state, with eyes closed, in a quiet environment. Image acquisition started 30 min after injection and ended 15 min later. Images were reconstructed using ordered subsets expectation maximization algorithm, with 5 iterations and 32 subsets, and corrected for attenuation using CT transmission scan. The local database image enrolled 171 adults 50–90 years of age, including 81 patients with AD and 61 HC and 29 MCI (Mild Cognitive Impairment). Their demographic characteristics are summarized in Table 1. HC were free from neurological/psychiatric disease and cognitive complaints, and had a normal brain MRI. Patients exhibited NINCDS-ADRDA [80] clinical criteria for probable AD.

TABLE 1. Demographic Details of the Dataset.

	HC	AD	MCI
Number	61	81	29
Male/female	(24/37)	(32/49)	(12/17)
Ages (Mean/[Min..Max])	68.18 [50..86]	70. 60 [50..90]	67.55 [50..85]

Open in a new tab

C. Image Preprocessing

Images were spatially normalized because of brain volume variation, shape and position from one patient to another. Spatial normalization consisted in applying deformations on volumes so that the whole-brain statistical analysis was performed at voxel-level using SPM8 software [43] (Wellcome Department of Cognitive Neurology, University College, London, UK). The data were spatially normalized onto the Montreal Neurological Institute atlas (MNI).This method assumes an affine generic model with 12 parameters [22] and a cost function which presents an extreme value when the image and a template that represents our ‘brain space’ correspond to each other. The spatial normalization is achieved by optimizing the quadratic difference between the template and each image. After this spatial normalization, the dimensions of the resulting voxels were Inline graphic mm and the resulting images of voxels size. These images were then smoothed with a Gaussian filter (with different values (0, 4 and 8) of Full Width at Half Maximum (FWHM) to blur individual variations in gyral anatomy and to increase the Signal to Noise Ratio (SNR) [82]. The best FWHM appeared to be 8.

After spatial normalization, intensity normalization was required in order to perform direct images comparison between different subjects. Intensity normalization was necessary since global brain activity varies from one subject to another. Different normalization methods were tested [83] to determine the most appropriate one. The retained method consisted in dividing the intensity level of each voxel by the intensity level mean of the gray-matter global brain VOI.

III. Feature Selection and Volume of Interest Ranking

In this section, we present the main idea of both feature selection and VOI ranking. Brain images are segmented into Inline graphic VOIs according to AAL of WFU-Pickatlas tool, version 2.4. [84]. Each VOI, V_v, is characterized by a set of features . The main question that we want to answer is: how to quantify VOI ability to best distinguish between AD and HC?

Our approach in this work consists in selecting VOIs which best separate between AD and HC classes. Different parameter combinations for each VOI are used to select and rank VOIs according to the “Area Under Curve” (AUC) parameter, defined in the following. The top-ranked VOIs are then introduced into a SVM classifier.

A. Notation

We consider a dataset Inline graphic . A is a 3-D tensor, where denotes the number of parameters computed for each VOI among VOIs and is the number of acquired PET images. Let us consider their corresponding metricized version and ]. As depicts in Fig. 2, each matrix A_i is of dimension and represents all regional parameters values for cortical VOIs for a selected simple Inline graphic . Each column of A_i is defined as volume parameter values computed for all VOIs on the selected simple . A_v is of dimension and represents all simples with parameters values computed on a selected VOI, .

FIGURE 2. — Frontal (A_i) and lateral (A_v) slices of the tensor A that are handled within PET images. (a) A_i matrix of the tensor A. (b) A_v matrix of the tensor A

B. Features Definition

The brain is mapped into 116 cortical VOIs according to AAL of WFU-Pickatlas tool ( Inline graphic ). Five parameters (K = 5) are computed for Each VOI, . The first order statistics and the entropy were extracted from the histogram of each VOI:

where Inline graphic is a gray level value of a voxel belonging to volume V_v and and are the minimum and the maximum gray level values in V_v respectively.

C. Area Under Curve (AUC) Parameter

For each VOI, a set of parameters Inline graphic is computed. In the following, we will use instead of for easier readability. HC and AD subjects are plotted in a N feature space, which represents a subset of , denoted , among subsets. An N-Dimensional sphere (N-D sphere) is created over the group of healthy subjects (HC) (N equals 1 for an interval, 2 for a disk and N is greater than 2 for a sphere). The N-D sphere’s center is the mass center of healthy subjects. At various radii of the N-D sphere, we compute the following parameters:

•
nb_HC_in, number of HC subjects inside the N-D sphere,
•
nb_AD_in, number of subjects with AD inside the N-D sphere.
•
nb_HC_in/nb_HC⁽¹⁾, True Positive Rate (TPR).
•
nb_AD_in/nb_AD⁽²⁾, False Positive Rate (FPR).

⁽¹⁾ number of HC subjects in the database ⁽²⁾ number of AD subjects in the database

The ROC curve is created by plotting the true positives rate (TPR) (i.e. HC subjects well classified) vs. the false positives rate (FPR) (i.e. the AD subjects misclassified) for different radii of the N-D sphere. The AUC is defined as the Area Under ROC Curve (AUC) and is within the range [0, 1]. This AUC quantifies the ability of a given VOI, in a N-D feature space (N≤K) from a subset of K parameters, to separate between groups. This AUC value is all the more near 1 that the subset feature space on which the corresponding VOIs are plotted is able to separate well between AD and HC subjects. Fig. 3 shows the case of ‘Cingulum_Post_Left’ VOI based on three parameters: the mean, the standard deviation and the kurtosis (P₁, P₂ and P₄). The corresponding ROC curve is shown in Fig. 4.

FIGURE 3. — The separation between AD and HC groups relative to the region ’Cingulum_Post_Left with three parameters: the mean, the standard deviation and the kurtosis.

FIGURE 4. — ROC curve obtained for region ’Cingulum_Post_Left using three parameters: the mean, the standard-deviation and the kurtosis.

D. “Combination_Matrix” Analysis

For each VOI, we examine the combination of these parameters (noted P_p) with a length varying from 1 to Inline graphic . Therefore, we start by the combinations of length 1 (), then those of length 2 ({P_j, P_q}, 1 ≤ j,q , ), and so on until reaching the combination length of parameters. Thereby, we create a “Combination_Matrix” of columns.

For each column, which represents a subset Inline graphic _N, of N elements of , we compute the AUC (noted or ) for each VOI, , according to the N-D sphere. The “Combination_Matrix” has then lines and columns.

1)
“Combination_Matrix” 1: mono-parametric analysis Based on the “Combination_Matrix”, VOIs are ranked according to their higher values of AUC. Each VOI is characterized by its own combination parameters N that gives the highest AUC. The set of the top ranked VOIs (associated to the higher values of AUC) are selected to be used in AD/HC classification. This is a mono-parametric analysis, in which the selection of VOIs depends only on the combination of the parameters _N for each VOI.

Table 2 gives, for each VOI, the corresponding subset of parameters for which the highest AUC was obtained. The set of best ranked VOIs obtained using AUC approach (21 are given in Table 3 and shown in Fig. 5) are concordant with a recent review of the 18F-FDG PET literature about the positive diagnosis of AD, with VOIs involving the temporo-parietal cortex, including the precuneus and the adjacent posterior cingulate cortex [85].
2)
“Combination_Matrix” 2: multi-parametric analysis Multi-parametric analysis for “Combination_Matrix” depends on both the combination of the parameters, (subset of ), for each VOI, and the combination of the VOIs (subsets of ). For easier readability, we use the expression instead of . Depending on the “Combination_Matrix” 1, each VOI is characterized by its own combination of parameters, as it is presented in the Table 2. The procedure begins with the choice of the first two combination VOIs depending on all the possible combinations of VOIs of length 2 ({V_j, , 1 ≤ j, , j q). Thereafter, a Sequential Forward Selection (SFS) [86] is used where we add one VOI at each step to a VOI list.

So, we start by the combinations of the two VOIs already selected in the previous step, then those of length 3 ({V_j, V_q, , 1 ≤ j,q,t , j q t), and so on until reaching the combination length of L VOIs. At each step of the iterative procedure, HC and AD subjects are plotted in a new feature space by combination of parameters for that selection of VOIs. The AUC value based on this combination is named Cumulated AUC (CAUC). For easier readability, we use instead of . The VOIs that provide the best CAUC are then added to the final VOIs combination. Our algorithm stops when the same or a lower CAUC is obtained. Then, we select the feature combination with the lowest number of VOIs that achieves the highest CAUC. Fig. 6 shows that the cumulated AUC reaches 0.9071 value when combining V₃ and V₃₁, the AUC of which are respectively 0.8012 and 0.8484. The CAUC reached a limit value when more than 15 VOIs were added. This is depicted in Fig. 7.

TABLE 2. Each VOI is Caracterized By its Own Combination Parameters .


1. ‘Amygdala_L’		59. ‘Insula_L’
2. ‘Amygdala_R’		60. ‘Insula_R’
3. ‘Angular_L’		61. ‘Lingual_L’
4. ‘Angular_R’		62. ‘Lingual_R’
5. ‘Calcarine_L’		63. ‘Occipital_Inf_L’
6. ‘Calcarine_R’		64. ‘Occipital_Inf_R’
7. ‘Caudate_L’		65. ‘Occipital_Mid_L’
8. ‘Caudate_R’		66. ‘Occipital_Mid_R’
9. ‘Cerebelum_10_L’		67. ‘Occipital_Sup_L’
10. ‘Cerebelum_10_R’		68. ‘Occipital_Sup_R’
11. ‘Cerebelum_3_L’		69. ‘Olfactory_L’
12. ‘Cerebelum_3_R’		70. ‘Olfactory_R’
13. ‘Cerebelum_4_5_L’		71. ‘Pallidum_L’
14. ‘Cerebelum_4_5_R’		72. ‘Pallidum_R’
15. ‘Cerebelum_6_L’		73. ‘ParaHippocampal_L’
16. ‘Cerebelum_6_R’		74. ‘ParaHippocampal_R’
17. ‘Cerebelum_7b_L’		75. ‘Paracentral_Lobule_L’
18. ‘Cerebelum_7b_R’		76. ‘Paracentral_Lobule_R’
19. ‘Cerebelum_8_L’		77. ‘Parietal_Inf_L’
20. ‘Cerebelum_8_R’		78. ‘Parietal_Inf_R’
21. ‘Cerebelum_9_L’		79. ‘Parietal_Sup_L’
22. ‘Cerebelum_9_R’		80. ‘Parietal_Sup_R’
23. ‘Cerebelum_Crus1_L’		81. ‘Postcentral_L’
24. ‘Cerebelum_Crus1_R’		82. ‘Postcentral_R’
25. ‘Cerebelum_Crus2_L’		83. ‘Precentral_L’
26. ‘Cerebelum_Crus2_R’		84. ‘Precentral_R’
27. ‘Cingulum_Ant_L’		85. ‘Precuneus_L’
28. ‘Cingulum_Ant_R’		86. ‘Precuneus_R’
29. ‘Cingulum_Mid_L’		87. ‘Putamen_L’
30. ‘Cingulum_Mid_R’		88. ‘Putamen_R’
31. ‘Cingulum_Post_L’		89. ‘Rectus_L’
32. ‘Cingulum_Post_R’		90. ‘Rectus_R’
33. ‘Cuneus_L’		91. ‘Rolandic_Oper_L’
34. ‘Cuneus_R’		92. ‘Rolandic_Oper_R’
35. ‘Frontal_Inf_Oper_L’		93. ‘Supp_Motor_Area_L’
36. ‘Frontal_Inf_Oper_R’		94. ‘Supp_Motor_Area_R’
37. ‘Frontal_Inf_Orb_L’		95. ‘SupraMarginal_L’
38. ‘Frontal_Inf_Orb_R’		96. ‘SupraMarginal_R’
39. ‘Frontal_Inf_Tri_L’		97. ‘Temporal_Inf_L’
40. ‘Frontal_Inf_Tri_R’		98. ‘Temporal_Inf_R’
41. ‘Frontal_Med_Orb’		99. ‘Temporal_Mid_L’
42. ‘Frontal_Med_Orb’		100. ‘Temporal_Mid_R’
43. ‘Frontal_Mid_L’		101. ‘Temporal_Pole_Mid_L’
44. ‘Frontal_Mid_Orb_L’		102. ‘Temporal_Pole_Mid_R’
45. ‘Frontal_Mid_Orb_R’		103. ‘Temporal_Pole_Sup_L’
46. ‘Frontal_Mid_R’		104. ‘Temporal_Pole_Sup_R’
47. ‘Frontal_Sup_L’		105. ‘Temporal_Sup_L’
48. ‘Frontal_Sup_Medial_L’		106. ‘Temporal_Sup_R’
49. ‘Frontal_Sup_Medial_R’		107. ‘Thalamus_L’
50. ‘Frontal_Sup_Orb_L’		108. ‘Thalamus_R’
51. ‘Frontal_Sup_Orb_R’		109. ‘Vermis_10’
52. ‘Frontal_Sup_R’		110. ‘Vermis_1_2’
53. ‘Fusiform_L’		111. ‘Vermis_3’
54. ‘Fusiforrm_R’		112. ‘Vermis_4_5’
55. ‘Heschl_L’		113. ‘Vermis_6’
56. ‘Heschl_R’		114. ‘Vermis_7’
57. ‘Hippocampus_L’		115. ‘Vermis_8’
58. ‘Hippocampus_R’		116. ‘Vermis_9’

Open in a new tab

TABLE 3. The List of the Top Ranked VOIs Selected With the Best Parameters.

Name of the Top ranked VOIs		AUC
‘Cingulum_Post_L’		0.8484
‘Angular_R’		0.8124
‘Cingulum_Post_R ‘		0.8041
‘Angular_L’		0.8013
‘Parietal_Inf_L’		0.7958
‘Temporal_Mid_L’		0.7949
‘Cingulum_Mid_R’		0.7940
‘SupraMarginal_L’		0.7919
‘Precuneus_L’		0.7907
‘Rolandic_Oper_R’		0.7861
‘Temporal_Mid_R’		0.7842
‘Precuneus_R’		0.7809
‘Cerebelum_6_L’		0.7697
‘Parietal_Inf_R’		0.7615
‘Cerebelum_6_R’		0.7542
‘Postcentral_R’		0.7542
‘Temporal_Inf_L’		0.7526
‘Temporal_Sup_L’		0.7476
‘Fusiform_ L’		0.7471
‘Cerebelum_8_L’		0.7449
‘Postcentral_L’		0.7391
…	…	…

Open in a new tab

The AUC denoted Inline graphic is computed for a region depending on the contribution of the parameters .

FIGURE 5. — The top VOIs (21 VOIs) selected are presented on a 3D brain image

FIGURE 6. — ROC curve obtained for V₃, V₃₁ and V₃, V₃₁.

FIGURE 7. — Cumulated AUC value based on the different combination of the VOIs, .

Inline graphic — Cumulated AUC value based on the different combination of the VOIs, .

IV. Classification Results and Discussion

The result of the proposed algorithm is presented and assessed in this section. However, direct comparison with existing work is hard to achieve due to several factors such as the different datasets, sampling methods, and different features used. We focus on the development and performance of an algorithm as a whole, while exploring several potential features. In assessing the proposed framework, we provide performance comparison to other feature selection methods using the same local database. The normalized PET database is selected as input data for the CAD tool. PET image of each subject is of Inline graphic voxels size, which yields voxels. In order to reduce the curse of dimensionality, the 116 VOIs are used first and then the most discriminative VOI among them are then selected. The resulting reduced feature vector obtained from the different PET images is finally taken for classification. In order to validate the effectiveness of the proposed method, we performed experiments using a comparison with the state-of-art of Feature Selection methods (FS) including Student’s t-Test analysis [39], Fisher score [86], [87], Support vector Machine feature elimination (SVM-RFE) [88], feature selection with Random Forest [86], [87], [89], minimum Redundancy Maximum Relevance (mRMR) [64], and ReliefF [63], [90], [91]. The Voxel Based Analysis (VBA), considered as a baseline classification approach is also used for comparison purposes. It consists in inputting to the classifier the whole brain voxels that represent metabolic activity in the gray matter for each subject.

For comparison purposes with feature selection methods but for VBA and Student’s t-Test, a Inline graphic matrix, whose lines are VOIs and column are histogram extracted features defined in section III was used. This matrix was input to each FS method in order to obtain the optimal set of features to be input to the SVM classifier.

A. Classification Results

Classification was performed using a Support vector machines (SVM) classifier. This classifier map pattern vectors to a high-dimensional feature space where a ‘best’ separating hyperplane (the maximal margin hyperplane) is built. SVM maximize the separation margin by the distance between a hyperplane and the closest data samples. A linear kernel is used our study [86], [87]. Several experiments were carried out to evaluate the “Combination_Matrix” feature selection process and the SVM classifier.

The reduced number of subjects (142 patients) suggests the adaptation of a leave-one-out (LOO) strategy as the most suitable for classification validation. It is a technique that iteratively holds out a subject for test, while training the classifier with the remaining subjects, so that each subject is left out once. Therefore 142 training and testing experiments are carried out. Parameter C is used during the training phase and tells how many outliers are taken into account in calculating Support Vectors. For large values of Inline graphic , the optimization will choose a smaller-margin hyperplane. Conversely, a very small value of will cause the optimizer to look for a larger-margin separating hyperplane, even if that hyperplane misclassifies more subjects. A good way to estimate relevant value to be used is to perform it with cross-validation. As a result, the estimation value is fixed to Inline graphic depending upon database. In order to evaluate the developed CAD tool, which depends on the choice of , several values of are used. The soft-margin constant of the optimal hyperplane uses a value range from 10⁻⁷ to .

Subsequently, performance is also tested using a permutation test [92], repeating the classification procedure after randomizing and permuting labels. The p-value is then given by the percentage of runs for which the score obtained is greater than the classification score obtained in the first case. The error of our results were estimated at 0.09% ( Inline graphic -value) for a permutation number equal to 1000.

The results achieved using the proposed methods and the SVM are depicted in Fig. 8 and Fig. 9. It illustrates classification success rate with the different features selection methods outlined in previous sections. These figures show detailed cross-validation average accuracy results as a function of the number of selected VOIs input to the SVM classifier. For the whole Feature selection methods, good classification results (above 80% accurate classification rate) are obtained when inputting a low number of VOIs. When the number of VOIs input to the classifier is more than 20, classification results reach a value that remains constant. There is no way to increase classification results by injecting more VOIs. The best accuracy rate obtained by “Combination_Matrix” 1 equals 95.07% (with 19 VOIs), which is higher than the one obtained using VBA which equals 94.36% (see Table 4). The “Combination_Matrix” 2 achieved better results than all the other methods (96.47%) with a lower number of VOIs which are depicted in Fig. 10 (15 VOIs).

FIGURE 8. — Average Accuracy obtained with SVM classifier varying number of features for different VOI selection analyses and applying LOO cross-validation with estimation value .

FIGURE 9. — Average Accuracy obtained with SVM classifier varying number of features for different VOI selection analyses and applying LOO cross-validation with estimation value .

TABLE 4. Voxel Selection and VOI Selection Analyses To Discriminate Between Groups (AD, HC AND MCI).

Method	LOO cross-validation with a constant .			LOO cross-validation with estimation value .			Computati on Time
Method	AD vs HC	MCI vs AD	MCI vs HC	AD vs HC	MCI vs AD	MCI vs HC	Computati on Time
VBA 182413 (voxels)	94.36%	86.36%	67.77%	94.36%	86.36%	67.77%	a day
T-test	93.66% 28.413 (voxels)	88.18% 32.982 (voxels)	77.77% 3.877 (voxels)	93.66% 28.413 (voxels)	88.18% 32.982 (voxels)	77.77% 3.877 (voxels)	few minutes
116 VOIs (mean)	91.54%	85.45%	68.88%	91.54%	85.45%	68.88%
116 VOIs (5 parameters)	92.95%	87.72%	67.77%	92.95%	87.72%	67.77%
SVM-RFE	88.02% {24} (VOIs)	76.36%	67,77%	91.54 {25} VOIs	80% {15} VOIs	67.77%	few seconds
RandomForest	91.54% {21} (VOIs)	88.18% {8} (VOIs)	76.66% {21} (VOIs)	92.25% {11} VOIs	90% {24} VOIs	77.77% {19} VOIs
mRMR	92.25% {13} (VOIs)	89.09% {11} (VOIs)	67.77%	95.77% {19}VOIs	91.81% {27} VOIs	67.77%
Score Fisher	93.66% {27} (VOIs)	89.09% {21} (VOIs)	74.44% {11} VOIs	93.66% {27} VOIs	91.81% {12} VOIs	76.66% {23} VOIs
ReliefF	90.84% {19} (VOIs)	73.63%	67.77%	97.88% {25}VOIs	87.27%	74.44%
“Combination_Matrix”1	95.07% {20, 21, 22} (VOIs)	88.18% {11} (VOIs)	75.05% {23} (VOIs)	95.07% {20, 21, 22} (VOIs)	89.09% {12} (VOIs)	75.05% {22} (VOIs)
“Combination_Matrix”2	96.47% {15} (VOIs)	-------------------	-------------------	96.47% {14} (VOIs)	-------------------	-------------------

Open in a new tab

FIGURE 10. — 15 VOIs representation of the “Combination Matrix” 2 on a coronal plane (a), on a transverse plane (b) and on a sagittal plane (c).

B. Discussion

A novel approach of feature selection multi-parametric analysis is presented in this paper. The main innovative aspects of this approach rely on the ranking of each anatomical VOI according to its ability to distinguish between groups of subjects. This method could have been applied to any image data set where a feature selection and ranking is needed. It is also able to give a ranking of a set of VOIs according to their ability to separate groups of subjects. Our study explored two methods: voxel selection and VOI selection analysis applied to 18F-FDG PET data, as well as how to use the parameters in the former approach: mono-parametric or multi-parametric one. The results achieved using the proposed methods are depicted in Table 4. In the case of VBA and t-Test, we talk about voxel approach mono-parametric analysis (voxel is characterized by its intensity). In the case of other classical feature selection methods and “Combination_Matrix” 1, we used a VOI selection mono-parametric analysis. Each VOI was characterized with Inline graphic parameters. When, we selected the most discriminant VOIs to discriminate between HC and AD patients, each VOI was characterized by one parameter or a set of parameters computed on it, .

In the context of our study, each feature is either a VOI with a combination of parameters computed on it (Combination Matrix 1) or a combination of VOIs, (Combination_Matrix 2). After selecting the relevant VOIs, the classification performance was tested by varying the features vectors dimension and summarized in Tab IV. Our study reduces considerably feature space and computational time. The number of VOIs was reduced to 15 thanks to “Combination_Matrix” 2 and the computation time was reduced to a few seconds rather than VBA which lasted a day computation time to analyze the whole database.

Even if the main results presented in this paper concern HC versus AD classification, we studied two other possibilities. To evaluate the discrimination of MCI (Mild Cognitive Impairment) and AD, we had only 29 patients. Discrimination of MCI from HC is more difficult in comparison to classification of AD from HC (or AD from MCI). We achieved a low performance in discrimination of these groups due to the small number of samples (67.77% with VBA). The best accuracy achieved for discriminating between MCI and AD was 88.18% with “Combination_Matrix” 1, which needs to be evaluated on a larger database. We notice that our approach increases the classification rate for each pair of groups compared to the VBA classification rate, unlike the various classical feature selection methods. ReliefF performed well in the classification between AD and HC, rather than in the case of MCI vs AD and MCI vs HC. The mRMR achieved high accuracy in the case of AD vs HC and MCI vs AD, whereas in the case of MCI vs HC, mRMR features selection achieved lower accuracy values. The obtained results using “Combination_Matrix” 2 for both “MCI” vs “HC” or “MCI” vs “AD” did not achieve good results. This is due to the small number of MCI patients and to the fact that VOIs, in these cases, are more correlated, which did not allow to choose an optimal set of VOIs. Moreover within MCI group we did not distinguish between progressive MCI (pMCI) and stable MCI (sMCI) due to the low number of subjects. Progressive MCI are subjects that are evolving to an AD state whereas sMCI are subjects that are not going to evolve. The less good results obtained in trying to compare AD to MCI and HC to MCI could be also explained by the fact that AD and pMCI, and also HC and sMCI, are subjects that are hard to distinguish due to the fact that these sub-groups are very close to each other.

VOIs identified and selected in the present work are classically involved in FDG-PET studies of AD, with a bilateral metabolic impairment of bilateral posterior associative cortices, especially of temporo-parietal regions including the precuneus, and also the posterior cingulate cortex [93].

These results show that the proposed classification results can improve the Computer-Aided Diagnosis of AD on PET images. Therefore, the proposed features selection “Combination_Matrix” is very effective providing a good discrimination between groups and in some cases (due to the small number of samples) comparable results with the various classical feature selection methods.

V. Conclusion

In this paper, a novel method for VOIs ranking is developed to better classify brain PET images. This ranking is obtained using ROC curves and quantifies the ability of a VOI to classify HC from AD subjects. A first combination matrix was proposed in order to select relevant features extracted from VOIs then a second combination matrix was able to define best VOIs association for classification purposes.

The Area Under Curve used in this paper could be easily computed for feature selection for any other medical image modality. The evaluation of this parameter was done on a local image database but needs to be evaluated on a larger database.

To go further in the computer aided-diagnosis tasks, other features like texture, gradient computed on VOIs have to be joined to first order statistical parameters in order to enrich information and obtain higher classification results.

Funding Statement

This work was supported in part by DHU-Imaging through the A * MIDEX Project funded by the “Investissements d’Avenir” French Government Program and managed by the French National Research Agency under Grant ANR-11-IDEX-0001-02, the region PACA, and Nicesoft.

References

[1].Brookmeyer R., Johnson E., Ziegler-Graham K., and Arrighi H. M., “Forecasting the global burden of Alzheimer’s disease,” Alzheimer’s Dementia, vol. 3, no. 3, pp. 186–191, Jul. 2007. [DOI] [PubMed] [Google Scholar]
[2].Zhou Q.et al. , “An optimal decisional space for the classification of Alzheimer’s disease and mild cognitive impairment,” IEEE Trans. Biomed. Eng., vol. 61, no. 8, pp. 2245–2253, Aug. 2014. [DOI] [PubMed] [Google Scholar]
[3].Fox N. C. and Schott J. M., “Imaging cerebral atrophy: Normal ageing to Alzheimer’s disease,” Lancet, vol. 363, no. 9406, pp. 392–394, Jan. 2004. [DOI] [PubMed] [Google Scholar]
[4].Walhovd K. B.et al. , “Combining MR imaging, positron-emission tomography, and CSF biomarkers in the diagnosis and prognosis of Alzheimer disease,” Amer. J. Neuroradiol., vol. 31, no. 2, pp. 347–354, Feb. 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[5].Fan Y., Resnick S. M., Wu X., and Davatzikos C., “Structural and functional biomarkers of prodromal Alzheimer’s disease: A high-dimensional pattern classification study,” Neuroimage, vol. 41, no. 2, pp. 277–285, Jun. 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Westman E., Muehlboeck J.-S., and Simmons A., “Combining MRI and CSF measures for classification of Alzheimer’s disease and prediction of mild cognitive impairment conversion,” NeuroImage, vol. 62, no. 1, pp. 229–238, 2012. [DOI] [PubMed] [Google Scholar]
[7].Vemuri P.et al. , “MRI and CSF biomarkers in normal, MCI, and AD subjects: Predicting future clinical change,” Neurology, vol. 73, no. 4, pp. 294–301, Jul. 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
[8].Johnson K. A.et al. , “Preclinical prediction of Alzheimer’s disease using SPECT,” Neurology, vol. 50, no. 6, pp. 1563–1571, Jun. 1998. [DOI] [PubMed] [Google Scholar]
[9].Huang C., Wahlund L., Dierks T., Julin P., Winblad B., and Jelic V., “Discrimination of Alzheimer’s disease and mild cognitive impairment by equivalent EEG sources: A cross-sectional and longitudinal study,” Clin. Neurophysiol., vol. 111, no. 11, pp. 1961–1967, Nov. 2000. [DOI] [PubMed] [Google Scholar]
[10].Henderson G.et al. , “Development and assessment of methods for detecting dementia using the human electroencephalogram,” IEEE Trans. Biomed. Eng., vol. 53, no. 8, pp. 1557–1568, Aug. 2006. [DOI] [PubMed] [Google Scholar]
[11].Zhang D., Wang Y., Zhou L., Yuan H., and Shen D., “Multimodal classification of Alzheimer’s disease and mild cognitive impairment,” NeuroImage, vol. 55, no. 3, pp. 856–867, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
[12].Segovia F.et al. , “A comparative study of feature extraction methods for the diagnosis of Alzheimer’s disease using the ADNI database,” Neurocomputing, vol. 75, no. 1, pp. 64–71, 2012. [Google Scholar]
[13].Dubois B.et al. , “Research criteria for the diagnosis of Alzheimer’s disease: Revising the NINCDS-ADRDA criteria,” Lancet. Neurol., vol. 6, no. 8, pp. 734–746, 2007. [DOI] [PubMed] [Google Scholar]
[14].McKhann G. M.et al. , “The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s association workgroups on diagnostic guidelines for Alzheimer’s disease,” Alzheimer’s Dementia, vol. 7, no. 3, pp. 263–269, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
[15].Mosconi L., Berti V., Glodzik L., Pupi A., De Santi S., and de Leon M. J., “Pre-clinical detection of Alzheimer’s disease using FDG-PET, with or without amyloid imaging,” J. Alzheimers Disease, vol. 20, no. 3, pp. 843–854, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[16].Perani D.et al. , “A survey of FDG- and amyloid-PET imaging in dementia and GRADE analysis,” BioMed. Res. Int., vol. 2014, Mar. 2014, Art. no. 785039 [Online]. Available: http://dx.doi.org/10.1155/2014/785039 [DOI] [PMC free article] [PubMed] [Google Scholar]
[17].Wang X., Nan B., Zhu J., Koeppe R., and Frey K., “Classification of ADNI PET images via regularized 3D functional data analysis,” Biostat., Epidemiol., vol. 1, no. 1, pp. 3–19, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[18].Hoffman J. M.et al. , “FDG PET imaging in patients with pathologically verified dementia,” J. Nucl. Med., vol. 41, no. 11, pp. 1920–1928, 2000. [PubMed] [Google Scholar]
[19].Langbaum J. B.et al. , “Categorical and correlational analyses of baseline fluorodeoxyglucose positron emission tomography images from the Alzheimer’s disease neuroimaging initiative (ADNI),” Neuroimage, vol. 45, no. 4, pp. 1107–1116, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
[20].Tuszynski T.et al. , “Evaluation of software tools for automated identification of neuroanatomical structures in quantitative -amyloid PET imaging to diagnose Alzheimer’s disease,” Eur. J. Nucl. Med. Mole. Imag., vol. 43, no. 6, pp. 1077–1087, Jun. 2016. [DOI] [PubMed] [Google Scholar]
[21].Mareeswari S. and Jiji G. W., “A survey: Early detection of Alzheimer’s disease using different techniques,” Int. J. Comput. Sci., Appl., vol. 5, no. 1, pp. 27–37, Feb. 2015. [Google Scholar]
[22].Padilla P.et al. , “Analysis of SPECT brain images for the diagnosis of Alzheimer’s disease based on NMF for feature extraction,” Neurosci. Lett., vol. 479, no. 3, pp. 192–196, 2010. [DOI] [PubMed] [Google Scholar]
[23].Ramírez J.et al. , “Computer aided diagnosis system for the Alzheimer’s disease based on partial least squares and random forest SPECT image classification,” Neurosci. Lett., vol. 472, no. 2, pp. 99–103, 2010. [DOI] [PubMed] [Google Scholar]
[24].Fan M.et al. , “Identification of conversion from mild cognitive impairment to Alzheimer’s disease using multivariate predictors,” PLoS ONE, vol. 6, no. 7, p. e21896, Jul. 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
[25].Aiswarya V. S. and Jemimah S., “Diagnosis of Alzheimer’s disease in brain images using pulse coupled neural network,” Int. J. Innov. Technol. Exploring Eng., vol. 2, no. 6, pp. 2278–3075, May 2013. [Google Scholar]
[26].Illán I. A.et al. , “18F-FDG PET imaging analysis for computer aided Alzheimer’s diagnosis,” Inf. Sci., vol. 181, no. 4, pp. 903–916, 2011. [Google Scholar]
[27].Makeig S., Bell A. J., Jung T. P., and Sejnowski T. J., “Independent component analysis of electroencephalographic data,” in Advances in Neural Information Processing Systems, vol. 8 Cambridge, MA, USA: MIT Press, 1996, pp. 145–151. [Google Scholar]
[28].Segovia F., Bastin C., Salmon E., Górriz J. M., Ramírez J., and Phillips C., “Combining PET images and neuropsychological test data for automatic diagnosis of Alzheimer’s disease,” PLoS ONE, vol. 9, no. 2, p. e8868, Feb. 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Dessouky M. M., Elrashidy M. A., and Abdelkader H. M., “Selecting and extracting effective features for automated diagnosis of Alzheimer’s disease,” Int. J. Comput. Appl., vol. 81, no. 4, pp. 17–28, Nov. 2013. [Google Scholar]
[30].Padilla P., López M., Górriz J. M., Ramírez J., Salas-Gonzalez D., and Álvarez I., “NMF-SVM based CAD tool applied to functional brain images for the diagnosis of Alzheimer’s disease,” IEEE Trans. Med. Imag., vol. 31, no. 2, pp. 207–216, Feb. 2012. [DOI] [PubMed] [Google Scholar]
[31].Martínez-Murcia F. J., Górriz J. M., Ramírez J., Puntonet C. G., and Illán I. A., “Functional activity maps based on significance measures and independent component analysis,” Comput. Methods Programs Biomed., vol. 111, no. 1, pp. 255–268, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[32].Selvy P. T., Palanisamy V., and Radhai M. S., “A proficient clustering technique to detect CSF level in MRI brain images using PSO algorithm,” WSEAS Trans. Comput., vol. 7, no. 7, pp. 298–308, Jul. 2013. [Google Scholar]
[33].Greenspan H., Ruf A., and Goldberger J., “Constrained Gaussian mixture model framework for automatic segmentation of MR brain images,” IEEE Trans. Med. Imag., vol. 25, no. 9, pp. 1233–1245, Sep. 2006. [DOI] [PubMed] [Google Scholar]
[34].Li Perneczky R. R., Yakushev I., Förster S., Kurz A., Drzezga A., and Kramer S., “Gaussian mixture models a nd model selection for [18F] fluorodeoxyglucose positron emission tomography classification in Alzheimer’s disease,” PLoS ONE, vol. 10, no. 4, p. e0122731, Apr. 2015, doi: 10.1371/journal.pone.0122731. [DOI] [PMC free article] [PubMed] [Google Scholar]
[35].Hussain S. J., Savithri T. S., and Devi P. V. S., “Segmentation of tissues in brain MRI images using dynamic neuro-fuzzy technique,” Int. J. Soft Comput. Eng., vol. 1, no. 6, pp. 2231–2307, Jan. 2012. [Google Scholar]
[36].Meena A. and Raja K., “Segmentation of Alzheimer’s disease in PET scan datasets using MATLAB,” Int. J. Comput. Appl., vol. 57, no. 10, pp. 15–19, Nov. 2012. [Google Scholar]
[37].Abidi A., Tufail A. B., Siddiqui A. M., and Younis M. S., “Automatic classication of initial categories of Alzheimer’s disease from structural MRI phase images: A comparison of PSVM, KNN and ANN methods,” Int. J. Biomed. Biol. Eng., vol. 6, no. 12, pp. 713–717, 2012. [Google Scholar]
[38].Dessouky M. M., Elrashidy M. A., Taha E. T., and Abdelkader H. M., “Dimension and complexity study for Alzheimer’s disease feature extraction,” Int. J. Eng. Comput. Sci., vol. 3, no. 7, pp. 7132–7137, Jul. 2014. [Google Scholar]
[39].Long X. and Wyatt C., “An automatic unsupervised classification of MR images in Alzheimer’s disease,” in Proc. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2010, pp. 2910–2917. [Google Scholar]
[40].Hagan M. T. and Menhaj M. B., “Training feedforward networks with the Marquardt algorithm,” IEEE Trans. Neural Netw., vol. 5, no. 6, pp. 989–993, Nov. 1994. [DOI] [PubMed] [Google Scholar]
[41].Rumelhart D. E., Hinton G. E., and Williams R. J., “Learning representations by back-propagating errors,” Nature, vol. 323, pp. 533–536, Oct. 1986. [Google Scholar]
[42].Yang S.-T.et al. , “Discrimination between Alzheimer’s disease and mild cognitive impairment using SOM and PSO-SVM,” Comput. Math. Methods Med., vol. 2013, Apr. 2013, Art. no. 253670. [DOI] [PMC free article] [PubMed] [Google Scholar]
[43].Friston K., Ashburner J., Kiebel S., Nichols T., and Penny W., Statistical Parametric Mapping: The Analysis of Functional Brain Images. Amsterdam, The Netherlands: Academic, 2007. [Google Scholar]
[44].Salas-González D.et al. , “Feature selection using factor analysis for Alzheimer’s diagnosis using F-FDG pet images,” Med. Phys., vol. 37, no. 11, pp. 6084–6095, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[45].Stoeckel J., Ayache N., Malandain G., Koulibaly P. M., Ebmeier K. P., and Darcourt J., “Automatic classification of SPECT images of Alzheimer’s disease patients and control subjects,” in Medical Image Computing and Computer-Assisted Intervention—MICCAI (Lecture Notes in Computer Science), vol. 3217 Heidelberg, Germany: Springer, 2004, pp. 654–662. [Google Scholar]
[46].Salas-Gonzalez D.et al. , “Selecting regions of interest in SPECT images using Wilcoxon test for the diagnosis of Alzheimer’s disease,” in Proc. 5th Int. Conf. HAIS, San Sebastián, Spain, Jun. 2010, pp. 446–451. [Google Scholar]
[47].Zhao W., Wu C., Yin K., Young T. Y., and Ginsberg M. D., “Pixel-based statistical analysis by a 3D clustering approach: Application to autoradiographic images,” Comput. Methods Programs Biomed., vol. 83, no. 1, pp. 18–28, 2006. [DOI] [PubMed] [Google Scholar]
[48].Guyon I. and Elisseeff A., “An introduction to variable and feature selection,” J. Mach. Learn. Res., vol. 3, pp. 1157–1182, Jan. 2003. [Google Scholar]
[49].Chandrashekar G. and Sahin F., “A survey on feature selection methods,” Comput. Elect. Eng., vol. 40, no. 1, pp. 16–28, 2014. [Google Scholar]
[50].Kohavi R. and John G. H., “Wrappers for feature subset selection,” Artif. Intell., vol. 97, no. 1, pp. 273–324, 1997. [Google Scholar]
[51].Inza I., Larrañaga P., Blanco R., and Cerrolaza A. J., “Filter versus wrapper gene selection approaches in DNA micro array domains,” Artif. Intell. Med., vol. 31, no. 2, pp. 91–103, 2004. [DOI] [PubMed] [Google Scholar]
[52].Chyzhyk D., Savio A., and Graña M., “Evolutionary ELM wrapper feature selection for Alzheimer’s disease CAD on anatomical brain MRI,” Neurocomputing, vol. 128, pp. 73–80, Mar. 2013. [Google Scholar]
[53].Weston J., Elisseeff A., Schiölkopf B., and Tipping M., “Use of the zero norm with linear models and kernel methods,” J. Mach. Learn. Res., vol. 3, pp. 1439–1461, Mar. 2003. [Google Scholar]
[54].Breiman L., Friedman J., Olshen R., and Stone C., Classification and Regression Trees. Monterey, CA, USA: Wadsworth and Brooks, 1984. [Google Scholar]
[55].Casanova R.et al. , “High dimensional classification of structural MRI Alzheimer’s disease data based on large scale regularization,” Front. Neuroinform., vol. 5, no. 22, pp. 1–9, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
[56].Morgado P. M., Silveira M., and Marques J. S., “Diagnosis of Alzheimer’s disease using 3D local binary patterns,” Comput. Methods Biomech. Biomed. Eng. Imag. Vis., vol. 1, no. 1, pp. 2–12, 2013. [Google Scholar]
[57].Chaves R.et al. , “SVM-based computer-aided diagnosis of the Alzheimer’s disease using t-test NMSE feature selection with feature correlation weighting,” Neurosci. Lett., vol. 461, no. 3, pp. 293–297, 2009. [DOI] [PubMed] [Google Scholar]
[58].Bicacro E., Silveira M., and Marques J. S., “Alternative feature extraction methods in 3D brain image-based diagnosis of Alzheimer’s disease,” in Proc. 9th IEEE Int. Conf. Image Process. (ICIP), Sep./Oct. 2012, pp. 1237–1240. [Google Scholar]
[59].He X., Cai D., and Niyogi P., “Laplacian score for feature selection,” in Proc. Adv. Neural Inf. Process. Syst., 2005, pp. 507–514. [Google Scholar]
[60].Mitchell T. M., Machine Learning. New York, NY, USA: McGraw-Hill, 1997. [Google Scholar]
[61].Quinlan J. R., “Induction of decision trees,” Mach. Learn., vol. 1, no. 1, pp. 81–106, 1986. [Google Scholar]
[62].Yang J., Liu Y., Liu Z., Zhu X., and Zhang X., “A new feature selection algorithm based on binomial hypothesis testing for spam filtering,” Knowl.-Based-Syst., vol. 24, no. 1, pp. 904–914, 2011. [Google Scholar]
[63].Kira K. and Rendell L. A., “The feature selection problem: Traditional methods and a new algorithm,” in Proc. 10th Nat. Conf. Artif. Intell., 1992, pp. 129–134. [Google Scholar]
[64].Peng H., Long F., and Ding C., “Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 8, pp. 1226–1238, Aug. 2005. [DOI] [PubMed] [Google Scholar]
[65].Fleuret F., “Fast binary feature selection with conditional mutual information,” J. Mach. Learn. Res., vol. 5, pp. 1531–1555, Nov. 2004. [Google Scholar]
[66].Guo B. and Nixon M. S., “Gait feature subset selection by mutual information,” IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 39, no. 1, pp. 36–46, Jan. 2009. [Google Scholar]
[67].Mitra P., Murthy C. A., and Pal S. K., “Unsupervised feature selection using feature similarity,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 3, pp. 301–312, Mar. 2002. [Google Scholar]
[68].Yu L. and Liu H., “Feature selection for high-dimensional data: A fast correlation-based filter solution,” in Proc. 20th Int. Conf. Mach. Learn., 2003, pp. 856–863. [Google Scholar]
[69].Yu L. and Liu H., “Efficient feature selection via analysis of relevance and redundancy,” J. Mach. Learn. Res., vol. 5, pp. 1205–1224, Oct. 2004. [Google Scholar]
[70].Ferreira A. J. and Figueiredo M. A. T., “An unsupervised approach to feature discretization and selection,” Pattern Recognit., vol. 45, no. 9, pp. 3048–3060, Sep. 2012. [Google Scholar]
[71].Chandrashekar G. and Sahin F., “A survey on feature selection methods,” Comput. Elect. Eng., vol. 40, no. 9, pp. 16–28, 2014. [Google Scholar]
[72].Morgado P. M. and Silveira M., “Minimal neighborhood redundancy maximal relevance: Application to the diagnosis of Alzheimer’s disease,” Neurocomputing, vol. 155, pp. 230–295, May 2015. [Google Scholar]
[73].Tabakhi S. and Moradi P., “Relevance–redundancy feature selection based on ant colony optimization,” Pattern Recognit., vol. 48, no. 9, pp. 2798–2811, 2015. [Google Scholar]
[74].Rathore S., Habes M., Iftikhar M. A., Shacklett A., and Davatzikos C., “A review on neuroimaging-based classification studies and associated feature extraction methods for Alzheimer’s disease and its prodromal stages,” NeuroImage, vol. 155, pp. 530–548, Jul. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[75].Hinrichs C., Singh V., Mukherjee L., Xu G., Chung M. K., and Johnson S. C., “Spatially augmented LPboosting for AD classification with evaluations on the ADNI dataset,” Neuroimage, vol. 48, no. 1, pp. 138–149, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
[76].Cabral C., Morgado P. M., Costa D. C., and Silveira M., “Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages,” Comput. Biol. Med., vol. 58, pp. 101–109, Mar. 2015. [DOI] [PubMed] [Google Scholar]
[77].Eskildsen S. F., Coupé P., García-Lorenzo D., Fonov V., Pruessner J. C., and Collins D. L., “Prediction of Alzheimer’s disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning,” Neuroimage, vol. 65, pp. 511–521, Jan. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[78].Pagani M.et al. , “Volume of interest-based [18F] fluorodeoxyglucose PET discriminates MCI converting to Alzheimer’s disease from healthy controls. A European Alzheimer’s disease consortium (EADC) study,” Neuroimage Clin., vol. 7, pp. 34–42, Jan. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
[79].Gray K. R., Wolz R., Heckemann R. A., Aljabar P., Hammers A., and Rueckert D., “Multi-region analysis of longitudinal FDG-PET for the classification of Alzheimer’s disease,” Neuroimage, vol. 60, no. 1, pp. 221–229, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
[80].McKhann G., Drachman D., Folstein M., Katzman R., Price D., and Stadlan E. M., “Clinical diagnosis of Alzheimer’s disease report of the NINCDS-ADRDA work group under the auspices of department of health and human services task force on Alzheimer’s disease,” Neurology, vol. 34, no. 7, pp. 939–944, Jul. 1984. [DOI] [PubMed] [Google Scholar]
[81].Woods R. P., Grafton S. T., Holmes C. J., Cherry S. R., and Mazziotta J. C., “Automated image registration: I. General methods and intrasubject, intramodality validation,” J. Comput. Assist. Tomograp., vol. 22, no. 1, pp. 139–152, 1998. [DOI] [PubMed] [Google Scholar]
[82].Maldjian J. A., Laurienti P. J., Kraft R. A., and Burdette J. H., “An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets,” NeuroImage, vol. 19, no. 3, pp. 1233–1239, 2003. [DOI] [PubMed] [Google Scholar]
[83].Garali I., Adel M., Takerkart S., Bourennane S., and Guedj E., “Brain region of interest selection for 18FDG positrons emission tomography computer-aided image classification,” in Proc. IEEE Int. Conf. Image Process. Theory Appl. (IPTA), Paris, France, Oct. 2014, pp. 1–5. [Google Scholar]
[84].Woods R. P., Spatial Transformation Models. San Diego, CA, USA: Academic, 2000, ch. 29, pp. 465–490. [Google Scholar]
[85].Bohnen N. I., Djang D. S., Herholz K., Anzai Y., and Minoshima S., “Effectiveness and safety of 18F-FDG PET in the evaluation of dementia: A review of the recent literature,” J. Nucl. Med., vol. 53, no. 1, pp. 59–71, 2012. [DOI] [PubMed] [Google Scholar]
[86].Webb A. R. and Copsey K. D., Statistical Pattern Recognition, 3rd ed. Chichester, U.K.: Wiley, 2011. [Google Scholar]
[87].Duda R. O., Hart P. E., and Stork D. G., Pattern Classification, 2nd ed. New York, NY, USA: Wiley, 2001. [Google Scholar]
[88].Guyon I., Weston J., Barnhill S., and Vapnik V., “Gene selection for cancer classification using support vector machines,” Mach. Learn., vol. 46, nos. 1–3, pp. 389–422, 2002. [Google Scholar]
[89].Liaw A. and Wiener M., “Classification and regression by random forest,” R Newslett., vol. 2, no. 3, pp. 18–22, 2002. [Google Scholar]
[90].Robnik-Šikonja M. and Kononenko I., “Theoretical and empirical analysis of ReliefF and RReliefF,” Mach. Learn. J., vol. 53, nos. 1–2, pp. 23–69, Oct. 2003. [Google Scholar]
[91].Kononenko I., “Estimating attributes: Analysis and extensions of relief,” in Proc. Eur. Conf. Mach. Learn. (ECML), vol. 784 1994, pp. 171–182. [Google Scholar]
[92].Nichols T. E. and Holmes A. P., “Nonparametric permutation tests for functional neuroimaging: A primer with examples,” Hum. Brain Mapping, vol. 15, no. 1, pp. 1–25, 2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
[93].Koric L.et al. , “Molecular imaging in the diagnosis of Alzheimer’s disease and related disorders,” (Paris), Rev. Neurol., vol. 172, no. 12, pp. 725–734, Dec. 2016. [DOI] [PubMed] [Google Scholar]

[ref1] [1].Brookmeyer R., Johnson E., Ziegler-Graham K., and Arrighi H. M., “Forecasting the global burden of Alzheimer’s disease,” Alzheimer’s Dementia, vol. 3, no. 3, pp. 186–191, Jul. 2007. [DOI] [PubMed] [Google Scholar]

[ref2] [2].Zhou Q.et al. , “An optimal decisional space for the classification of Alzheimer’s disease and mild cognitive impairment,” IEEE Trans. Biomed. Eng., vol. 61, no. 8, pp. 2245–2253, Aug. 2014. [DOI] [PubMed] [Google Scholar]

[ref3] [3].Fox N. C. and Schott J. M., “Imaging cerebral atrophy: Normal ageing to Alzheimer’s disease,” Lancet, vol. 363, no. 9406, pp. 392–394, Jan. 2004. [DOI] [PubMed] [Google Scholar]

[ref4] [4].Walhovd K. B.et al. , “Combining MR imaging, positron-emission tomography, and CSF biomarkers in the diagnosis and prognosis of Alzheimer disease,” Amer. J. Neuroradiol., vol. 31, no. 2, pp. 347–354, Feb. 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref5] [5].Fan Y., Resnick S. M., Wu X., and Davatzikos C., “Structural and functional biomarkers of prodromal Alzheimer’s disease: A high-dimensional pattern classification study,” Neuroimage, vol. 41, no. 2, pp. 277–285, Jun. 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref6] [6].Westman E., Muehlboeck J.-S., and Simmons A., “Combining MRI and CSF measures for classification of Alzheimer’s disease and prediction of mild cognitive impairment conversion,” NeuroImage, vol. 62, no. 1, pp. 229–238, 2012. [DOI] [PubMed] [Google Scholar]

[ref7] [7].Vemuri P.et al. , “MRI and CSF biomarkers in normal, MCI, and AD subjects: Predicting future clinical change,” Neurology, vol. 73, no. 4, pp. 294–301, Jul. 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] [8].Johnson K. A.et al. , “Preclinical prediction of Alzheimer’s disease using SPECT,” Neurology, vol. 50, no. 6, pp. 1563–1571, Jun. 1998. [DOI] [PubMed] [Google Scholar]

[ref9] [9].Huang C., Wahlund L., Dierks T., Julin P., Winblad B., and Jelic V., “Discrimination of Alzheimer’s disease and mild cognitive impairment by equivalent EEG sources: A cross-sectional and longitudinal study,” Clin. Neurophysiol., vol. 111, no. 11, pp. 1961–1967, Nov. 2000. [DOI] [PubMed] [Google Scholar]

[ref10] [10].Henderson G.et al. , “Development and assessment of methods for detecting dementia using the human electroencephalogram,” IEEE Trans. Biomed. Eng., vol. 53, no. 8, pp. 1557–1568, Aug. 2006. [DOI] [PubMed] [Google Scholar]

[ref11] [11].Zhang D., Wang Y., Zhou L., Yuan H., and Shen D., “Multimodal classification of Alzheimer’s disease and mild cognitive impairment,” NeuroImage, vol. 55, no. 3, pp. 856–867, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref12] [12].Segovia F.et al. , “A comparative study of feature extraction methods for the diagnosis of Alzheimer’s disease using the ADNI database,” Neurocomputing, vol. 75, no. 1, pp. 64–71, 2012. [Google Scholar]

[ref13] [13].Dubois B.et al. , “Research criteria for the diagnosis of Alzheimer’s disease: Revising the NINCDS-ADRDA criteria,” Lancet. Neurol., vol. 6, no. 8, pp. 734–746, 2007. [DOI] [PubMed] [Google Scholar]

[ref14] [14].McKhann G. M.et al. , “The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s association workgroups on diagnostic guidelines for Alzheimer’s disease,” Alzheimer’s Dementia, vol. 7, no. 3, pp. 263–269, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref15] [15].Mosconi L., Berti V., Glodzik L., Pupi A., De Santi S., and de Leon M. J., “Pre-clinical detection of Alzheimer’s disease using FDG-PET, with or without amyloid imaging,” J. Alzheimers Disease, vol. 20, no. 3, pp. 843–854, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref16] [16].Perani D.et al. , “A survey of FDG- and amyloid-PET imaging in dementia and GRADE analysis,” BioMed. Res. Int., vol. 2014, Mar. 2014, Art. no. 785039 [Online]. Available: http://dx.doi.org/10.1155/2014/785039 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref17] [17].Wang X., Nan B., Zhu J., Koeppe R., and Frey K., “Classification of ADNI PET images via regularized 3D functional data analysis,” Biostat., Epidemiol., vol. 1, no. 1, pp. 3–19, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref18] [18].Hoffman J. M.et al. , “FDG PET imaging in patients with pathologically verified dementia,” J. Nucl. Med., vol. 41, no. 11, pp. 1920–1928, 2000. [PubMed] [Google Scholar]

[ref19] [19].Langbaum J. B.et al. , “Categorical and correlational analyses of baseline fluorodeoxyglucose positron emission tomography images from the Alzheimer’s disease neuroimaging initiative (ADNI),” Neuroimage, vol. 45, no. 4, pp. 1107–1116, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref20] [20].Tuszynski T.et al. , “Evaluation of software tools for automated identification of neuroanatomical structures in quantitative -amyloid PET imaging to diagnose Alzheimer’s disease,” Eur. J. Nucl. Med. Mole. Imag., vol. 43, no. 6, pp. 1077–1087, Jun. 2016. [DOI] [PubMed] [Google Scholar]

[ref21] [21].Mareeswari S. and Jiji G. W., “A survey: Early detection of Alzheimer’s disease using different techniques,” Int. J. Comput. Sci., Appl., vol. 5, no. 1, pp. 27–37, Feb. 2015. [Google Scholar]

[ref22] [22].Padilla P.et al. , “Analysis of SPECT brain images for the diagnosis of Alzheimer’s disease based on NMF for feature extraction,” Neurosci. Lett., vol. 479, no. 3, pp. 192–196, 2010. [DOI] [PubMed] [Google Scholar]

[ref23] [23].Ramírez J.et al. , “Computer aided diagnosis system for the Alzheimer’s disease based on partial least squares and random forest SPECT image classification,” Neurosci. Lett., vol. 472, no. 2, pp. 99–103, 2010. [DOI] [PubMed] [Google Scholar]

[ref24] [24].Fan M.et al. , “Identification of conversion from mild cognitive impairment to Alzheimer’s disease using multivariate predictors,” PLoS ONE, vol. 6, no. 7, p. e21896, Jul. 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref25] [25].Aiswarya V. S. and Jemimah S., “Diagnosis of Alzheimer’s disease in brain images using pulse coupled neural network,” Int. J. Innov. Technol. Exploring Eng., vol. 2, no. 6, pp. 2278–3075, May 2013. [Google Scholar]

[ref26] [26].Illán I. A.et al. , “18F-FDG PET imaging analysis for computer aided Alzheimer’s diagnosis,” Inf. Sci., vol. 181, no. 4, pp. 903–916, 2011. [Google Scholar]

[ref27] [27].Makeig S., Bell A. J., Jung T. P., and Sejnowski T. J., “Independent component analysis of electroencephalographic data,” in Advances in Neural Information Processing Systems, vol. 8 Cambridge, MA, USA: MIT Press, 1996, pp. 145–151. [Google Scholar]

[ref28] [28].Segovia F., Bastin C., Salmon E., Górriz J. M., Ramírez J., and Phillips C., “Combining PET images and neuropsychological test data for automatic diagnosis of Alzheimer’s disease,” PLoS ONE, vol. 9, no. 2, p. e8868, Feb. 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref29] [29].Dessouky M. M., Elrashidy M. A., and Abdelkader H. M., “Selecting and extracting effective features for automated diagnosis of Alzheimer’s disease,” Int. J. Comput. Appl., vol. 81, no. 4, pp. 17–28, Nov. 2013. [Google Scholar]

[ref30] [30].Padilla P., López M., Górriz J. M., Ramírez J., Salas-Gonzalez D., and Álvarez I., “NMF-SVM based CAD tool applied to functional brain images for the diagnosis of Alzheimer’s disease,” IEEE Trans. Med. Imag., vol. 31, no. 2, pp. 207–216, Feb. 2012. [DOI] [PubMed] [Google Scholar]

[ref31] [31].Martínez-Murcia F. J., Górriz J. M., Ramírez J., Puntonet C. G., and Illán I. A., “Functional activity maps based on significance measures and independent component analysis,” Comput. Methods Programs Biomed., vol. 111, no. 1, pp. 255–268, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref32] [32].Selvy P. T., Palanisamy V., and Radhai M. S., “A proficient clustering technique to detect CSF level in MRI brain images using PSO algorithm,” WSEAS Trans. Comput., vol. 7, no. 7, pp. 298–308, Jul. 2013. [Google Scholar]

[ref33] [33].Greenspan H., Ruf A., and Goldberger J., “Constrained Gaussian mixture model framework for automatic segmentation of MR brain images,” IEEE Trans. Med. Imag., vol. 25, no. 9, pp. 1233–1245, Sep. 2006. [DOI] [PubMed] [Google Scholar]

[ref34] [34].Li Perneczky R. R., Yakushev I., Förster S., Kurz A., Drzezga A., and Kramer S., “Gaussian mixture models a nd model selection for [18F] fluorodeoxyglucose positron emission tomography classification in Alzheimer’s disease,” PLoS ONE, vol. 10, no. 4, p. e0122731, Apr. 2015, doi: 10.1371/journal.pone.0122731. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref35] [35].Hussain S. J., Savithri T. S., and Devi P. V. S., “Segmentation of tissues in brain MRI images using dynamic neuro-fuzzy technique,” Int. J. Soft Comput. Eng., vol. 1, no. 6, pp. 2231–2307, Jan. 2012. [Google Scholar]

[ref36] [36].Meena A. and Raja K., “Segmentation of Alzheimer’s disease in PET scan datasets using MATLAB,” Int. J. Comput. Appl., vol. 57, no. 10, pp. 15–19, Nov. 2012. [Google Scholar]

[ref37] [37].Abidi A., Tufail A. B., Siddiqui A. M., and Younis M. S., “Automatic classication of initial categories of Alzheimer’s disease from structural MRI phase images: A comparison of PSVM, KNN and ANN methods,” Int. J. Biomed. Biol. Eng., vol. 6, no. 12, pp. 713–717, 2012. [Google Scholar]

[ref38] [38].Dessouky M. M., Elrashidy M. A., Taha E. T., and Abdelkader H. M., “Dimension and complexity study for Alzheimer’s disease feature extraction,” Int. J. Eng. Comput. Sci., vol. 3, no. 7, pp. 7132–7137, Jul. 2014. [Google Scholar]

[ref39] [39].Long X. and Wyatt C., “An automatic unsupervised classification of MR images in Alzheimer’s disease,” in Proc. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2010, pp. 2910–2917. [Google Scholar]

[ref40] [40].Hagan M. T. and Menhaj M. B., “Training feedforward networks with the Marquardt algorithm,” IEEE Trans. Neural Netw., vol. 5, no. 6, pp. 989–993, Nov. 1994. [DOI] [PubMed] [Google Scholar]

[ref41] [41].Rumelhart D. E., Hinton G. E., and Williams R. J., “Learning representations by back-propagating errors,” Nature, vol. 323, pp. 533–536, Oct. 1986. [Google Scholar]

[ref42] [42].Yang S.-T.et al. , “Discrimination between Alzheimer’s disease and mild cognitive impairment using SOM and PSO-SVM,” Comput. Math. Methods Med., vol. 2013, Apr. 2013, Art. no. 253670. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref43] [43].Friston K., Ashburner J., Kiebel S., Nichols T., and Penny W., Statistical Parametric Mapping: The Analysis of Functional Brain Images. Amsterdam, The Netherlands: Academic, 2007. [Google Scholar]

[ref44] [44].Salas-González D.et al. , “Feature selection using factor analysis for Alzheimer’s diagnosis using F-FDG pet images,” Med. Phys., vol. 37, no. 11, pp. 6084–6095, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref45] [45].Stoeckel J., Ayache N., Malandain G., Koulibaly P. M., Ebmeier K. P., and Darcourt J., “Automatic classification of SPECT images of Alzheimer’s disease patients and control subjects,” in Medical Image Computing and Computer-Assisted Intervention—MICCAI (Lecture Notes in Computer Science), vol. 3217 Heidelberg, Germany: Springer, 2004, pp. 654–662. [Google Scholar]

[ref46] [46].Salas-Gonzalez D.et al. , “Selecting regions of interest in SPECT images using Wilcoxon test for the diagnosis of Alzheimer’s disease,” in Proc. 5th Int. Conf. HAIS, San Sebastián, Spain, Jun. 2010, pp. 446–451. [Google Scholar]

[ref47] [47].Zhao W., Wu C., Yin K., Young T. Y., and Ginsberg M. D., “Pixel-based statistical analysis by a 3D clustering approach: Application to autoradiographic images,” Comput. Methods Programs Biomed., vol. 83, no. 1, pp. 18–28, 2006. [DOI] [PubMed] [Google Scholar]

[ref48] [48].Guyon I. and Elisseeff A., “An introduction to variable and feature selection,” J. Mach. Learn. Res., vol. 3, pp. 1157–1182, Jan. 2003. [Google Scholar]

[ref49] [49].Chandrashekar G. and Sahin F., “A survey on feature selection methods,” Comput. Elect. Eng., vol. 40, no. 1, pp. 16–28, 2014. [Google Scholar]

[ref50] [50].Kohavi R. and John G. H., “Wrappers for feature subset selection,” Artif. Intell., vol. 97, no. 1, pp. 273–324, 1997. [Google Scholar]

[ref51] [51].Inza I., Larrañaga P., Blanco R., and Cerrolaza A. J., “Filter versus wrapper gene selection approaches in DNA micro array domains,” Artif. Intell. Med., vol. 31, no. 2, pp. 91–103, 2004. [DOI] [PubMed] [Google Scholar]

[ref52] [52].Chyzhyk D., Savio A., and Graña M., “Evolutionary ELM wrapper feature selection for Alzheimer’s disease CAD on anatomical brain MRI,” Neurocomputing, vol. 128, pp. 73–80, Mar. 2013. [Google Scholar]

[ref53] [53].Weston J., Elisseeff A., Schiölkopf B., and Tipping M., “Use of the zero norm with linear models and kernel methods,” J. Mach. Learn. Res., vol. 3, pp. 1439–1461, Mar. 2003. [Google Scholar]

[ref54] [54].Breiman L., Friedman J., Olshen R., and Stone C., Classification and Regression Trees. Monterey, CA, USA: Wadsworth and Brooks, 1984. [Google Scholar]

[ref55] [55].Casanova R.et al. , “High dimensional classification of structural MRI Alzheimer’s disease data based on large scale regularization,” Front. Neuroinform., vol. 5, no. 22, pp. 1–9, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref56] [56].Morgado P. M., Silveira M., and Marques J. S., “Diagnosis of Alzheimer’s disease using 3D local binary patterns,” Comput. Methods Biomech. Biomed. Eng. Imag. Vis., vol. 1, no. 1, pp. 2–12, 2013. [Google Scholar]

[ref57] [57].Chaves R.et al. , “SVM-based computer-aided diagnosis of the Alzheimer’s disease using t-test NMSE feature selection with feature correlation weighting,” Neurosci. Lett., vol. 461, no. 3, pp. 293–297, 2009. [DOI] [PubMed] [Google Scholar]

[ref58] [58].Bicacro E., Silveira M., and Marques J. S., “Alternative feature extraction methods in 3D brain image-based diagnosis of Alzheimer’s disease,” in Proc. 9th IEEE Int. Conf. Image Process. (ICIP), Sep./Oct. 2012, pp. 1237–1240. [Google Scholar]

[ref59] [59].He X., Cai D., and Niyogi P., “Laplacian score for feature selection,” in Proc. Adv. Neural Inf. Process. Syst., 2005, pp. 507–514. [Google Scholar]

[ref60] [60].Mitchell T. M., Machine Learning. New York, NY, USA: McGraw-Hill, 1997. [Google Scholar]

[ref61] [61].Quinlan J. R., “Induction of decision trees,” Mach. Learn., vol. 1, no. 1, pp. 81–106, 1986. [Google Scholar]

[ref62] [62].Yang J., Liu Y., Liu Z., Zhu X., and Zhang X., “A new feature selection algorithm based on binomial hypothesis testing for spam filtering,” Knowl.-Based-Syst., vol. 24, no. 1, pp. 904–914, 2011. [Google Scholar]

[ref63] [63].Kira K. and Rendell L. A., “The feature selection problem: Traditional methods and a new algorithm,” in Proc. 10th Nat. Conf. Artif. Intell., 1992, pp. 129–134. [Google Scholar]

[ref64] [64].Peng H., Long F., and Ding C., “Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 8, pp. 1226–1238, Aug. 2005. [DOI] [PubMed] [Google Scholar]

[ref65] [65].Fleuret F., “Fast binary feature selection with conditional mutual information,” J. Mach. Learn. Res., vol. 5, pp. 1531–1555, Nov. 2004. [Google Scholar]

[ref66] [66].Guo B. and Nixon M. S., “Gait feature subset selection by mutual information,” IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 39, no. 1, pp. 36–46, Jan. 2009. [Google Scholar]

[ref67] [67].Mitra P., Murthy C. A., and Pal S. K., “Unsupervised feature selection using feature similarity,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 3, pp. 301–312, Mar. 2002. [Google Scholar]

[ref68] [68].Yu L. and Liu H., “Feature selection for high-dimensional data: A fast correlation-based filter solution,” in Proc. 20th Int. Conf. Mach. Learn., 2003, pp. 856–863. [Google Scholar]

[ref69] [69].Yu L. and Liu H., “Efficient feature selection via analysis of relevance and redundancy,” J. Mach. Learn. Res., vol. 5, pp. 1205–1224, Oct. 2004. [Google Scholar]

[ref70] [70].Ferreira A. J. and Figueiredo M. A. T., “An unsupervised approach to feature discretization and selection,” Pattern Recognit., vol. 45, no. 9, pp. 3048–3060, Sep. 2012. [Google Scholar]

[ref71] [71].Chandrashekar G. and Sahin F., “A survey on feature selection methods,” Comput. Elect. Eng., vol. 40, no. 9, pp. 16–28, 2014. [Google Scholar]

[ref72] [72].Morgado P. M. and Silveira M., “Minimal neighborhood redundancy maximal relevance: Application to the diagnosis of Alzheimer’s disease,” Neurocomputing, vol. 155, pp. 230–295, May 2015. [Google Scholar]

[ref73] [73].Tabakhi S. and Moradi P., “Relevance–redundancy feature selection based on ant colony optimization,” Pattern Recognit., vol. 48, no. 9, pp. 2798–2811, 2015. [Google Scholar]

[ref74] [74].Rathore S., Habes M., Iftikhar M. A., Shacklett A., and Davatzikos C., “A review on neuroimaging-based classification studies and associated feature extraction methods for Alzheimer’s disease and its prodromal stages,” NeuroImage, vol. 155, pp. 530–548, Jul. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref75] [75].Hinrichs C., Singh V., Mukherjee L., Xu G., Chung M. K., and Johnson S. C., “Spatially augmented LPboosting for AD classification with evaluations on the ADNI dataset,” Neuroimage, vol. 48, no. 1, pp. 138–149, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref76] [76].Cabral C., Morgado P. M., Costa D. C., and Silveira M., “Predicting conversion from MCI to AD with FDG-PET brain images at different prodromal stages,” Comput. Biol. Med., vol. 58, pp. 101–109, Mar. 2015. [DOI] [PubMed] [Google Scholar]

[ref77] [77].Eskildsen S. F., Coupé P., García-Lorenzo D., Fonov V., Pruessner J. C., and Collins D. L., “Prediction of Alzheimer’s disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning,” Neuroimage, vol. 65, pp. 511–521, Jan. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref78] [78].Pagani M.et al. , “Volume of interest-based [18F] fluorodeoxyglucose PET discriminates MCI converting to Alzheimer’s disease from healthy controls. A European Alzheimer’s disease consortium (EADC) study,” Neuroimage Clin., vol. 7, pp. 34–42, Jan. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref79] [79].Gray K. R., Wolz R., Heckemann R. A., Aljabar P., Hammers A., and Rueckert D., “Multi-region analysis of longitudinal FDG-PET for the classification of Alzheimer’s disease,” Neuroimage, vol. 60, no. 1, pp. 221–229, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref80] [80].McKhann G., Drachman D., Folstein M., Katzman R., Price D., and Stadlan E. M., “Clinical diagnosis of Alzheimer’s disease report of the NINCDS-ADRDA work group under the auspices of department of health and human services task force on Alzheimer’s disease,” Neurology, vol. 34, no. 7, pp. 939–944, Jul. 1984. [DOI] [PubMed] [Google Scholar]

[ref81] [81].Woods R. P., Grafton S. T., Holmes C. J., Cherry S. R., and Mazziotta J. C., “Automated image registration: I. General methods and intrasubject, intramodality validation,” J. Comput. Assist. Tomograp., vol. 22, no. 1, pp. 139–152, 1998. [DOI] [PubMed] [Google Scholar]

[ref82] [82].Maldjian J. A., Laurienti P. J., Kraft R. A., and Burdette J. H., “An automated method for neuroanatomic and cytoarchitectonic atlas-based interrogation of fMRI data sets,” NeuroImage, vol. 19, no. 3, pp. 1233–1239, 2003. [DOI] [PubMed] [Google Scholar]

[ref83] [83].Garali I., Adel M., Takerkart S., Bourennane S., and Guedj E., “Brain region of interest selection for 18FDG positrons emission tomography computer-aided image classification,” in Proc. IEEE Int. Conf. Image Process. Theory Appl. (IPTA), Paris, France, Oct. 2014, pp. 1–5. [Google Scholar]

[ref84] [84].Woods R. P., Spatial Transformation Models. San Diego, CA, USA: Academic, 2000, ch. 29, pp. 465–490. [Google Scholar]

[ref85] [85].Bohnen N. I., Djang D. S., Herholz K., Anzai Y., and Minoshima S., “Effectiveness and safety of 18F-FDG PET in the evaluation of dementia: A review of the recent literature,” J. Nucl. Med., vol. 53, no. 1, pp. 59–71, 2012. [DOI] [PubMed] [Google Scholar]

[ref86] [86].Webb A. R. and Copsey K. D., Statistical Pattern Recognition, 3rd ed. Chichester, U.K.: Wiley, 2011. [Google Scholar]

[ref87] [87].Duda R. O., Hart P. E., and Stork D. G., Pattern Classification, 2nd ed. New York, NY, USA: Wiley, 2001. [Google Scholar]

[ref88] [88].Guyon I., Weston J., Barnhill S., and Vapnik V., “Gene selection for cancer classification using support vector machines,” Mach. Learn., vol. 46, nos. 1–3, pp. 389–422, 2002. [Google Scholar]

[ref89] [89].Liaw A. and Wiener M., “Classification and regression by random forest,” R Newslett., vol. 2, no. 3, pp. 18–22, 2002. [Google Scholar]

[ref90] [90].Robnik-Šikonja M. and Kononenko I., “Theoretical and empirical analysis of ReliefF and RReliefF,” Mach. Learn. J., vol. 53, nos. 1–2, pp. 23–69, Oct. 2003. [Google Scholar]

[ref91] [91].Kononenko I., “Estimating attributes: Analysis and extensions of relief,” in Proc. Eur. Conf. Mach. Learn. (ECML), vol. 784 1994, pp. 171–182. [Google Scholar]

[ref92] [92].Nichols T. E. and Holmes A. P., “Nonparametric permutation tests for functional neuroimaging: A primer with examples,” Hum. Brain Mapping, vol. 15, no. 1, pp. 1–25, 2003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref93] [93].Koric L.et al. , “Molecular imaging in the diagnosis of Alzheimer’s disease and related disorders,” (Paris), Rev. Neurol., vol. 172, no. 12, pp. 725–734, Dec. 2016. [DOI] [PubMed] [Google Scholar]

PERMALINK

Histogram-Based Features Selection and Volume of Interest Ranking for Brain PET Image Classification

Imene Garali

Mouloud Adel

Salah Bourennane

Eric Guedj

Abstract

I. Introduction

II. Proposed Method Description

A. Overview of the Proposed Method

FIGURE 1.

B. Image Database

TABLE 1. Demographic Details of the Dataset.

C. Image Preprocessing

III. Feature Selection and Volume of Interest Ranking

A. Notation

FIGURE 2.

B. Features Definition

C. Area Under Curve (AUC) Parameter

FIGURE 3.

FIGURE 4.

D. “Combination_Matrix” Analysis

TABLE 2. Each VOI is Caracterized By its Own Combination Parameters .

TABLE 3. The List of the Top Ranked VOIs Selected With the Best Parameters.

FIGURE 5.

FIGURE 6.

FIGURE 7.

IV. Classification Results and Discussion

A. Classification Results

FIGURE 8.

FIGURE 9.

TABLE 4. Voxel Selection and VOI Selection Analyses To Discriminate Between Groups (AD, HC AND MCI).

FIGURE 10.

B. Discussion

V. Conclusion

Funding Statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases