Abstract
Alzheimer’s disease has been extensively studied using undirected graphs to represent the correlations of BOLD signals in different anatomical regions through functional magnetic resonance imaging (fMRI). However, there has been relatively little analysis of this kind of data using directed graphs, which potentially offer the potential to capture asymmetries in the interactions between different anatomical brain regions. The detection of these asymmetries is relevant to detect the disease in an early stage. For this reason, in this paper, we analyze data extracted from fMRI images using the net4Lap algorithm to infer a directed graph from the available BOLD signals, and then seek to determine asymmetries between the left and right hemispheres of the brain using a directed version of the Return Random Walk (RRW). Experimental evaluation of this method reveals that it leads to the identification of anatomical brain regions known to be implicated in the early development of Alzheimer’s disease in clinical studies.
Keywords: Alzheimer’s disease, neural embedding, random walk, link prediction, brain asymmetries, fMRI networks, directed graphs
1. Introduction
Alzheimer’s disease (AD) is a progressive irreversible pathology (neurodegenerative disease) that most frequently affects older people. It has been widely studied by applying network analysis methods to activation patterns using functional magnetic resonance imaging (fMRI). More precisely, the blood oxygen level-dependent (BOLD) signal in fMRI images is a good indicator of activation potentials in different brain regions, and the neuronal activity between the different brain regions can be determined by measuring the correlation between the BOLD activation signals.
These images allow us to study the neuropathophysiology associated with Alzheimer’s disease in its different stages of development. These are usually labelled as: (a) healthy normal control group patients (NORMAL), (b) early mild cognitive impairment (EMCI), (c) late mild cognitive impairment (LMCI) and (d) patients with fully developed Alzheimer’s disease [1]. The pattern of inter-region activation is usually represented as an undirected graph. This network representation of region activity has proved to be a useful tool for understanding the functional working of human brain [2]. Furthermore, in [3], such networks have proved very useful in understanding the neuronal disorders associated with Alzheimer’s disease through the analysis of the intrinsic brain connectivity patterns. Alzheimer’s disease gradually affects the function of different regions of the brain. The symptoms are the progressive impairment of memory, motor tasks, learning and other cognitive abilities. This results in dementia and, finally, death [4]. Converging evidence exists concerning which regions are affected in fMRI images of patients affected by the disease, but it is not totally clear how these cognitive problems or abnormalities are reflected in the functional structure of the whole brain, or how the functional structure changes with different stages of the disease.
In [3], tools from complex network analysis are used with the aim of characterizing the topological structures present in the brain. Specifically, they quantify the functional interaction between all regions of the brain using the mathematical study of activation networks based on graph theory. This methodology offers an attractive approach since it provides useful and effective tools for characterizing network structures together with their intrinsic complexity. There are several approaches that use global and local structures encoded by undirected graphs [4,5,6,7,8,9,10], and machine learning [11,12,13,14].
Graph theory provides meaningful and easily computable measurements to characterize network connectivity and asymmetries associated with both neurological and psychiatric pathologies [4]. In [15], functional brain analysis is performed using features based on the global and local measurement of connectivity.
Almost all the techniques described above use undirected graphs. However, if the origins of the activation patterns are associated with neuronal activity, then directed graphs may be a more natural representation which captures centrality, since they let us measure the temporal causality of activation signals for different anatomical structures in the brain. This implies a new way to establish effective methods for measuring the structural properties representing inter-regional casual networks extracted from fMRI brain data.
To this end in [16], a strategy for the identification of the differences in fMRI activation network structures for patients with EMCI, LMCI and AD is proposed. These directed graphs are combined with entropic measurements to differentiate patients falling into the three affected categories and normal group subjects (healthy patients). They explore three different approaches: (i) a method based on applying linear discriminant analysis for vectors representing the in-degree and out-degree statistics of different anatomical regions, (ii) a method that uses an entropic measure of node assortativity to estimate the asymmetries in the node with in-degree and out-degree, and (iii) a method that selects the most salient anatomical brain regions and uses the degree statistics of the connecting directed links. However, the structure of the directed networks representing the activation patterns and the proposed measures needs to be better developed to understand the differences between healthy subjects and the early development of Alzheimer’s disease when BOLD signal noise is a limiting factor.
Several studies of the early clinical diagnosis of Alzheimer’s disease suggest that there exists asymmetries between right and left brain hemisphere in specific regions of the brain [17,18,19,20], and so there is an imperative to develop methods of analysis that can be used to understand the pattern of asymmetries. In other words, the structure of the directed networks representing the activation patterns needs to be better understood to find the differences between healthy brains and those showing early signs of Alzheimer’s disease when signal noise is present. To reduce the effects of BOLD signal noise and the resulting confusion in the inferred interaction structures between different brain regions, we propose to use a powerful unsupervised network analysis tool, namely the Return Random Walk [21,22]. This method allows us to reduce the inter-class noise while increasing the density of the structure through intra-class link prediction. However, this method uses undirected graphs and it needs a pre-processing step to obtain a better-conditioned input graph (directed network).
To solve this problem, in this paper we propose a method for analyzing asymmetries in different regions of the brain for healthy normal patients (NORMAL) and patients with early mild cognitive impairment (EMCI). In particular, we use the structural properties of directed graphs extracted from fMRI brain data, and using an embedding for inferring an activation graph that is more locally isotropic and harmonic than the original graph (net4Lap) [23]. After that, to analyze the brain data in different patients, we demonstrate that the use of a directed version of the Return Random Walk provides a better-conditioned graph whose dense structure allows us to analyze with more precision the asymmetries, expanding it, and propose to quantify this asymmetries through in-out-degree measures of different regions of the brain.
Our experimental results show that the in-degree/out-degree distributions of asymmetries of Normal and EMCI subjects can be useful as a method for early detection of Alzheimer’s disease.
2. Materials
2.1. Subjects
In this paper, all participants were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset. We select 147 subjects, where 38 patients were classified as healthy patient (Normal), 47 subjects as EMCI, 32 subjects as LMCI, and 30 as AD. The selected criteria to classify between EMCI and LMCI subjects are described in the ADNI procedure manual (http://www.adni-info.org/).
A subject can present more fMRI acquisitions taken at different time steps. In our study, for each patient we choose only one acquisition (mean). Subjects’ demographic information are summarized in Table 1. This dataset stands at around 1000 patients, but not every patient presents both morphological and functional images. Moreover, the initial status of some subjects in the dataset can be changed (e.g., from Normal to EMCI). For that, those patients were excluded from our study (we only deal with patients that did not change their ADNI classification).
Table 1.
Number of Patients | Gender | Age Range (Years) | Mean Age (SD) | |
---|---|---|---|---|
Normal | 38 | 17 M/21 F | 66–87 | 74.19 (±5.8) |
EMCI | 47 | 19 M/28 F | 62–88 | 71.59 (±7.4) |
LMCI | 32 | 18 M/14 F | 61–85 | 72.42 (±6.9) |
AD | 30 | 11 M/19 F | 61–90 | 71.56 (±7.5) |
Finally, ADNI initiative dataset has been obtained with a written consent to participate by all participants, according to the recommendations of the Code of Ethics and the Good Clinical Practice guidelines of the World Medical Association (declaration of Helsinki) and and U.S. 21 CFR Part 50 (Protection of Human Subjects), and Part 56 (Institutional Review Boards).
2.2. Data Acquisition and Image Preprocessing
In the ADNI study, rs-fMRI data were collected yearly at baseline, one, and two-year follow-ups (three time points in total). The rs-fMRI imaging data scans take advantage of simultaneous multi-slice acceleration for echo-planar images templates with the following parameters: slice thickness = 3.3 mm, matrix = 256 × 256, spatial resolution = 3 × 3 × 3 mm, number of volumes = 140, and number of slices = 48. From these 140 volumes, the first 10 volumes of each patient were excluded to avoid possible noise related to the balance of the scanner. The remaining volumes were used in the following processing, including slice time correction, motion correction, normalization, and Gaussian spatial smoothing. Finally, to work in a unique reference frame is useful to realign the MRI images with the 64 × 64 fMRI images.
Each image volume is acquired every two seconds with Blood-Oxygenation-Level-Dependent (BOLD) signals. Neuroanatomy experts have been manually drawn several regions of interest (ROIs) from the fMRI voxels. In the dataset have been created 96 ROIs in each fMRI image that correspond to different anatomical regions of the brain and which are labelled with anatomical names to distinguish them (one observation per person, the data is not longitudinal). The correlation between the average time series in different ROIs represents the degree of functional connectivity between regions, and which are driven by neural activities [24].
3. Methods and Procedure
In this paper, we propose to find which regions of the brain are affected in the early stage of Alzheimer’s disease. This is relevant if the early detection of Alzheimer’s disease is attempted. Alzheimer’s is a progressive and slow disease, which takes a long time (20 years) to develop. Recent studies aim to detect and classify the disease with respect to healthy brains using functional magnetic resonance imaging (fMRI). Here we aim to explore the functional relationships between different brain regions using directed graphs. The novel contribution (see Figure 1) is to infer the directed graphs from fMRI information use the net4Lap algorithm (see Section 3.1). The resulting directed graphs have improved local isotropy and harmonic properties when compared to those inferred directly from the thresholded correlation of the BOLD signals for different brain regions. Moreover, the directed graph representation allows us to identify asymmetries between left and right hemispheres of the brain using a directed version of the Return Random Walk. Specifically, we characterize these asymmetries by measuring the in and out-degrees for each region of interest of the brain. Finally, in our experiments (see Section 4) we identify which anatomical brain regions give an early indication of the development Alzheimer’s disease, and we then discuss the meaning of these results (see Section 5) and show them to be consistent with recent clinical studies [17,18,19]. In Section 6 we present the conclusions of our work and discuss future potential directions.
3.1. Directed-RRW in fMRI Networks Using net4Lap
In this paper, we demonstrate the utility of the Return Random Walk as a tool for early detection of the disease in fMRI activation networks, allowing us to distinguish between Normal subjects and EMCI patients (patients in the early stage of Alzheimer’s Disease or AD). We first compare the differences between the in-degree and out-degree distributions of directed graphs generated using the net4Lap algorithm and analyzed using RRW. Then we compare the right and left sections of different regions of the brain (e.g., left vs right temporal cortex), identifying the regions with the strongest asymmetries between the hemispheres. Finally, we use these regions to classify subjects as normal or EMCI using linear discriminate analysis (LDA).
3.1.1. The net4Lap Architecture
To create a directed graph from the fMRI BOLD signal data we use net4Lap. This is a novel architecture for Laplacian-based ranking [23]. This architecture is based on two main ingredients: (i) pre-processing graphs with neural embeddings prior to performing Laplacian ranking, and (ii) introducing a global measure of data centrality (Katz centrality [25]) to modulate a diffusion process on the graph. In this context, ranking can be considered to be an optimization problem where regularization is emphasized. Given an input kNN graph , net4Lap (neural networks for Laplacian-based regularization), learns an embedding of the nodes of a graph into a vector space via SGD (stochastic gradient descent [26], see Figure 2) from bags-of-paths sampled through different random walks (different empirical studies show that the choice of the algorithm of random walk is not critical provided it has well defined return path policies).
A modified kNN graph based on the embedding is more locally isotropic and harmonic than the original one. This modified graph is fed into a Laplacian regularizer based on global centrality. The result is a new kNN graph based on node ranking relationships that is re-fed into the stochastic gradient descent neural model for a re-ranking cycle.
As we can see in Figure 2, the architecture of net4Lap is the following: (i) a given kNN graph (a sparse and potentially structurally noisy graph) is processed by SGD (with negative sampling), (ii) this leads to a neural embedding of the nodes of the graph into a vector space, which and yields a harmonic version of the original graph, (iii) this modified graph is itself fed into Laplacian regularizer. The required neural embeddings are constructed from a bag-of-paths extracted from the original graph using classical random walks. As an output of the initial embedding step, we obtain a denser graph which is locally isotropic and with some structural noise filtered from it. This graph can be used either for ranking or for iteratively obtaining an improved KNN graph [27] which is in turn fed into a further step of stochastic gradient descent for re-ranking.
As a result, we have modified locally isotropic and harmonic graph for use in subsequent processing tasks.
This novel neural-regularization ranking architecture exploits the flexibility and scalability of SGD to pre-process the input graph so that it is well conditioned for Laplacian regularization. In this paper, we investigate the following strategy, namely that the structural noise of fMRI data must be filtered out of the raw activation network because it is easier to compare a diseased brain versus a healthy brain, while it is difficult to compare a brain in an early stage of the Alzheimer disease due to the confusion caused by structural noise. Our approach involves pre-processing a kNN graph using neural embedding, where SGD infers a node embedding from a sampling of the graph through random walks with return policies. This gives a modified graph that is both locally isotropic and harmonic (the Laplacian-based ranking minimizes the harmonic loss and absorption probabilities are constrained by Katz centrality [25], which is summarized as follows: a node is important if it is linked to other important nodes). However, this step is only the first part of a pre-processing stage to obtain better graphs for analysis. Later on, we show how to obtain better-conditioned graphs by applying Return Random Walks to remove the structural noise.
Algorithm. Given an input kNN graph G, net4Lap (Neural Networks for Laplacian-based Regularization) learns an embedding via SGD from bags-of-paths sampled through random walks. A second kNN based on the embedding is more harmonic (locally isotropic) than the original and it feeds a Laplacian regularizer based on global (see Figure 2). This algorithm can be structured in a few steps as follows:
STEP 1: Neural embedding and Random Walk. This method is designed through neural embeddings that are built by SGD by sampling bag-of-paths in , so that the context from the statistical co-ocurrences with neighboring nodes.
We construct the undirected weighted kNN graph G as follows: , where are the pairwise affinities between nodes , is a sub-Gaussian function, and if . Finally, SGD aims at inferring a function from:
(1) |
where (log-probability proportional to correlation) and is the local partition function. Sampling is driven by random walks with some return probability. As result, we obtain a contextualized graph , whose weights are locally isotropic, filtering some of the structural noise. We exploit both the flexibility and scalability of SGD to pre-process the input graph.
STEP 2: Laplacian Regularization. The similarity relies on the probabilities A:
(2) |
where is the Laplacian of , and define an affinity function. The harmonic loss predicts that = I is better for moving around dense graphs and = D′ in case of locally sparse graphs.
After that, the problem is planned using a ranking as an optimization problem:
(3) |
where = 1. The red term penalizes large deviations associated with linked nodes:
(4) |
The differential absorption flow () can only grow when or . The local isotropy helps us to constraint this term. The resulting structure contained in imposes new links or affinities based on the absorption probabilities.
Setting a balanced we have a clever trade-off that adapts ranking to the underlying manifold, as well as relaxing our optimization problem. The result is a harmonic centralized and regularized graph.
STEP 3: Global centrality. The role of global centrality is twofold: (a) capture the underlying density of the manifold using a global measure, and (b) increase the accuracy of re-ranking processes. We build in terms of a more global centrality (Katz):
(5) |
where C is the centrality. Given the red term (Equations (3) and (4)), we must minimize this equation. The result is that global centrality modulates the diffusion process.
In conclusion, we use this method for preprocessing a kNN graph with neural embeddings and SGD infers a node embedding from sampling through RWs, achieving a contextualized graph locally isotropic. The choice of RWs is not critical (if have return probabilities). Laplacian-based ranking minimizes the harmonic loss, and absorption probabilities are constrained by centrality. The Diffusion process is modulated by global centrality (Katz),which is better than Degree centrality. Our approach improves in both ranking and re-ranking (up to 14%). For more mathematical details, see [23].
3.1.2. Directed Return Random Walks
Given the preprocessed graphs from net4Lap, we apply a Return Random Walk (RRW) [28]: is a structural filter process, which minimizes the probability of a random walk starting and ending at a given node traverses the inter-class links. As result, we obtain better-conditioned weighted adjacency matrices to identify asymmetries (denser graphs). This method has an easy concept: it works enforcing intra-class edges while penalizing inter-cluster weights, improving the efficiency, and capturing more intra-cluster edges whereas removing inter-cluster edges (noise). The RRW method is designed as follows:
STEP 1: Design of . Given a weighted adjacency matrix W we calculate a reweighted adjacency matrix as follows: (i) we track the two-step random walks from an origin node to a destination node via a transition node , (ii) we return to from , maximizing the probability of returning through a different transition node . For the first step (outgoing from to through ) we have as well as a standard return . This return works well if and in healthy brains, where left-right paths are balance (see red-blue arrows in Figure 3a). However, the transition node for the return could be constrained so that . For this case, travelling out of a region of interest is penalized since the random walker must choose a different path, which in turn is difficult to find on average. Therefore, we obtain from as follows:
(6) |
i.e., we compute the outgoing and return probabilities for each possible transition node (product of independent probabilities) through a different transition node . We choose the maximum product of probabilities for each referred to a given k and finally we retain the supremum of these maxima. As a result, when it is selected a far transition region of interest (inter-class) are frequent for a given (compare asymmetries between red-blue arrows in Figure 3b) its weight is significantly reduced. The weight measures the connectedness of two different nodes in a given region or cluster of nodes. If we have a high value, this means that two nodes belong to a region with strong neighbors and thus they are strongly connected. That a region is highly cohesive.
STEP 2: Our main hypothesis is that the number of edges is small on average, since the amount of noise (inter-class edges) tends to be small in comparison with the total number of edges. However, patterns can be confused in realistic situations due either to their intrinsic similarity or to the use of an improper similarity measure. This fact leads to a significant decrease of many weights of W.
To filter this inter-class noise, we can study the relationship between the sum of different weights of the Return Random Walks and the shortest path (between all pairs of nodes):
(7) |
where and is the shortest path between i and j.
STEP 3: When the Euclidean distance between nodes is high or the path between them is weak (small weight), we have a worse case value of .
With this equation, we have not attended to the difference between outgoing and return. For this case we take
(8) |
where
Finally, we measure the difference between outgoing and return paths, which gauges the ease with which node j is reached from node i with a different return path. If the value of b is low this will be considered to be an interclass edge because the connectivity is poor.
We expect that the connectedness values to be higher in healthy brains (Figure 3a) because of the better quality of the connections. For instance, connections may be destroyed by Alzheimer’s pathology, and the corresponding probability values are penalized due to the difficulty of the RRW moving through the graph (Figure 3b).
Once we have a filtered the graph, we are going to use it to find asymmetries in our experiments with fMRI data using a measure of imbalance of in and out degree.
3.1.3. In-Out-Degree Measure
Let be a directed graph (a set of vertices connected by edges which have a direction associated with them), where V are all nodes and E are all edges, and the directed edge starts at node u and ends at node v. The directed adjacency matrix A is defined as follows:
(9) |
In this paper, we measure the connectivity of different regions of the brain through the in-degree () and out-degree () of each node u corresponding to an anatomical brain region of interest:
(10) |
where if implies a bidirectional connection between two regions of the brain, and implies a unidirectional connection.
To measure the asymmetries between two regions (left l and right r) of the same anatomical area of the brain, we define the difference or ratio in-out-degree as follows
(11) |
and which quantity the degree dissimilarity or asymmetry of two subregions of the same area of the brain.
4. Experiments
In this section, we describe our method for early Alzheimer’s detection in fMRI activation networks for Normal and EMCI subjects. We first compare the difference between in-Degree and out-Degree of the directed graphs generated with net4Lap and the RRW (The code is available in: https://github.com/manuelcurado/ALZcode). Then we compare right and left portions of different regions of the brain, identifying those regions with the strongest asymmetries between both left and right hemispheres. Finally, we evaluate whether these anatomical regions can be used to classify subjects as normal or EMCI using linear discriminant analysis (LDA).
4.1. Synthetic Experiments
To evaluate the implications of our algorithm, we generate a synthetic dataset of 500 subjects (or graphs) with their respective 96 regions of interest or areas of the brain (nodes) with different levels of edge density (links between different regions of interest) whose values are between 0 and 1 randomly. This dataset does not distinguish between different groups (Normal, EMCI, LMCI or AD). In other words, all subjects are mixed. To understand the difference between a diseased or healthy brain, the main characteristic of this disease is a progressive disconnection of the areas of the brain, represented through sparser graphs (links are missing, or its weights are weak). In contrast, a dense graph represents a healthy brain with strong links. From this point, in Figure 4a,b we can see the impact of using RRW in our dataset. If we do not use RRW (Figure 4a), the in-degree/out-degree distribution is symmetric (balanced in-degree/out-degree distribution, in the diagonal), being very difficult to differentiate between dense and sparse graphs. In contrast, using RRW (in Figure 4b) we can see the expanding effect of the algorithm, where ROIs are more asymmetric. This situation helps us to differentiate better between normal and affected patients.
As complementary experiments, we select a random subset of 100 subjects and we represent (in density ascending order) their sum of degree differences (in-degree and out-degree) of all ROIs of each subject. A high value of this sum means a more asymmetric distribution (affected brain). In Figure 4c,d, we can see how the asymmetries are higher with denser graphs (in Figure 4c), but after applying RRW (in Figure 4d) we increment the asymmetry of dense graphs (sparse graphs), and increase the possibility of detecting and classify between healthy and affected patients.
In the following subsection, we can see the difficulty of differentiating between healthy patients and the early stages of the Alzheimer disease due to the disconnection of links of the graph structure (sparse graphs). For a correct study, we retain the highest c correlation coefficients. For that, we select a subset with the 30 sparser subjects, and we compare the evolution of asymmetries (in decreasing sparse order) with different coefficients values ( and ) in Figure 5. We choose the threshold (maximize the asymmetries to make easier subsequent tasks as classification).
4.2. Real Data: Creating Meaningful Directed Graphs from fMRI Data
In this subsection, we prove our hypothesis with real data. We use the fMRI data coming from the ADNI initiative [29].
We construct a directed graph with 96 nodes using the net4Lap method, where we deal with the sign of the time-lag between different time series for different ROIs and the magnitude of the correlation. Moreover, to model causal interaction, we use the time lagged cross-correlation values for the average time-series for all pairs of regions of interest in the directed graph. For that, we use a kNN graph, varying k from 5 to 50. Then, we applied a directed version of Return Random Walk to measure the connectivity between two regions: a high value means both nodes have a good degree of connection and probably belong to the same cluster or they share an important workflow (information). We use k = 15, since this value is more adequate in similar graphs problems using Return Random Walk [28,30].
With directed-RRW graphs, we focus on his adjacency matrix, where the diagonal elements are set to zero, and then apply a threshold to retain only the maximum values of edges with the top of correlation coefficients (as we have seen on synthetic experiments), to obtain a binary directed adjacency matrix with the most relevant ROI information for each subject. In addition, if a ROI have missing time series data is discarded. The resulting dataset has four categories which represent different levels of the disease. In this paper, we are interested in patients in an early Alzheimer’s stage. For that, our experiments deal with 47 EMCI and 38 normal healthy control subjects.
4.3. Global Degree Asymmetries
In the first experiment, we analyze the degrees of all ROIs and compare the average differences between left and right hemispheres of the same regions of the brain. The problematic interpretation of the rank order of centrality if nodes do not differ much in centrality, resulting in a rank order mostly random is partially solved with our method through the asymmetries expanding (the centrality is very significative when we retain the top 40% correlations). First, we calculate the degrees (in and out) of all graphs (or patients) from the normal and EMCI subjects. Then we sort the differences between in-degree and out-degree values in ascending order and later we rank the nodes by assigning a value from 96 to 1 to each existing ROIs, obtaining significant differences in the rank order. Finally, we compare the difference between left and right ROIs of the same area of the brain. In Figure 6, we show the six ROIs or regions with the highest difference between the averages for the normal and EMCI subjects. These results are obtained using the directed Return Random Walk of a kNN graph with respect to the originally directed graphs after just applying net4Lap’s neural embedding but without computing RRWs. We can see relevant asymmetries in the RRW in comparison with the original graphs. The information for these regions is given in Table 2. In the table, the Brodmann Area refers to the Brodmann’s code associated with each of our ROIs.
Table 2.
Area of Brain | #ROI (Left) | #ROI (Right) | #Brodmann Area | p-Value (Original) | p-Value (RRW) |
---|---|---|---|---|---|
Temporal Fusiform Cortex, anterior division | 37 | 85 | 36 | 0.069 | <0.001 |
Parahippocampal Gyrus, anterior division | 34 | 82 | 36 | 0.089 | 0.009 |
Occipital Fusiform Gyrus | 40 | 88 | 19 | 0.071 | <0.001 |
Temporal Occipital Fusiform Cortex | 39 | 87 | 37 | 0.077 | 0.008 |
Heschl’s Gyrus (includes H1 and H2) | 45 | 93 | 48 | 0.058 | 0.007 |
Parahippocampal Gyrus, posterior division | 35 | 83 | 30 | 0.121 | <0.001 |
4.4. In-Degree/Out-Degree ROI Analysis
In the second experiment, we explore the in-degree/out-degree scatter for our dataset. In the state-of-the-art works, it is possible to obtain improved classification accuracy comparing (in directed or undirected approaches) Normal versus AD patients (see Figure 7b) with respect to the original data (see Figure 7a). Our method finds a better expansion of the asymmetries in in-out degree distributions, as we show in Figure 7c.
The main problems with the state-of-the-art methods are that they focus on two different approaches: (i) the comparison between early (EMCI) and late (LMCI) patients, obviating the interesting comparison between normal subjects and patients in an early stage of the disease [5,31], and (ii) the comparison is centered on subjects between normal and mild cognitive impairment (MCI, a join set of EMCI and LMCI), losing the emphasis on early detection [32,33,34,35,36]. The comparison between EMCI and normal subjects is difficult due to the almost complete overlap of the degree distributions. We compare normal vs EMCI (in Figure 8a) and normal vs Alzheimer’s disease (in Figure 8b) after applying our method. This figure (and previous one) shows the relationship between in-degree and out-degree links. In AD patients, there are areas of the brain where one of these degrees is bigger with respect to another degree. These asymmetries are reflected through an expanded distribution (in red). In the previous figure, we can compare the difference between healthy and AD patients of the original data, our method, and the state of the art (our method achieves a more expanded distribution). However, in this Figure 8, we show the main problem: to differentiate between Normal and patients in an early stage of the disease. Our method increases the scatter of the EMCI about the diagonal in comparison with the state-of-the-art methods, but it is still very difficult to compare distributions. Our hypothesis is that there exist more asymmetries in specific anatomical regions of the brain. For this reason, we perform a deeper analysis of the ROIs involved or implicated in the early phase of Alzheimer’s disease in the following experiments.
4.5. ROI Asymmetries Detection
In the third experiment, we analyze the in-degree/out-degree distribution for regions of the brain with a high level of asymmetry (in Table 2). The representation of the symmetric left-right hemisphere distribution would be a distribution where all points are close to the diagonal or in the origin point (0,0). For that, farther the point concerning this symmetry balance, the bigger the asymmetry of this value. First, we show the importance of using the RRW to emphasize or amplify the evidence for the existing asymmetries in the graphs, for instance, in a specific area (Parahippocampal Gyrus posterior division), where the distribution between both left and right hemispheres in-out-degrees are sparser (Figure 9a) than without RRW (Figure 9b). In other words, we compare the in-degree/out-degree distribution of the six anatomical brain regions with the largest difference between left and right hemisphere obtained in the previous experiments. In Figure 10 we show the results for the original graph (without directed RRW). In all cases, in-out-degree differences are close to zero with respect to both hemispheres. However, in Figure 11, the distribution is sparser and patients with early symptoms have more asymmetries (far from the diagonal). In particular, two regions namely (a) Parahippocampal Gyrus and (b) anterior and posterior division show this effect strongly.
These results are very consistent with previous studies [17,18,19]. Moreover, the regions studied are included in the Entorhinal cortex. This region is not considered to be the first area of the brain to be affected in the development of Alzheimer’s disease [37,38], but the disease leads to asymmetries in the volume of the Entorhinal cortex.
Finally, we show in Figure 12a the effect of using RRW in the balance between hemispheres. In Figure 12 left we can see how almost patients are concentrated in the point of balance (green bar). In contrast, if we applied RRW (Figure 12b), there is a sparser distribution (expanded asymmetries).
4.6. Alzheimer’s Classification
To conclude, we perform a linear discriminant analysis (LDA) on the two classes of our dataset (normal and EMCI subjects) as in [16] for a better comparison with the state-of-the-art. In this work, we represent the graph with a multidimensional feature vector that includes the in-degree/out-degree of left and right ROIs of the six most asymmetric areas as a measure of asymmetric level (the representation of the difference between them). We analyze the classification accuracy obtained using LDA, both with and without the directed RRW. In this classification experiment, we use both normal and EMCI subjects for training. The performance of LDA classifier is computed by using 10-fold cross-validation. In other words, we randomly divide our 85 subjects (38 normal and 47 EMCI) into 10 disjoint subsets of the same size. Then, we remove a subset and we train LDA using the remaining subsets. The selected subset is retained and used to measure the classification accuracy of the trained classifier. This method is repeated 10 times (10-CV), removing different subsets in each trial, and taking the average of all the classification results. Finally, we obtain an accuracy of 74.51% with directed RRWs versus accuracy of 66.93% without them (Table 3). Other studies that use directed graphs (as [16]) obtain good results (90% of accuracy), but their comparison is between AD and Normal patients.
Table 3.
LDA | Accuracy | Sensitivity | Specificity | Precision |
---|---|---|---|---|
Non-filtered | 66.93% (±0.54) | 49.36% | 80.97% | 68.12% |
With RRW | 74.51% (±0.81) | 66.82% | 80.42% | 74.11% |
5. Discussion
There is an extensive literature about how to study Alzheimer’s disease from a computer science perspective. The main goal is to obtain good discrimination between different subjects and thus a good classification accuracy between healthy subjects and subjects with different stages of the Alzheimer pathology. Our study makes two distinct contributions:
We characterize the BOLD signal data using directed graphs. We consider this to be a more natural representation for brain structure. Moreover, graph theory provides meaningful and easily computable measurements to detect connectivity abnormalities, in contrast with other state-of-the-art methods [4,5,6,7,8,9,10].
The most difference between our methodology with respect to the state of the art is the comparison. Our goal is to compare normal subjects with those suffering from early mild cognitive impairment (Normal vs. EMCI). The most common strategy in the literature is to mix early and late phase (EMCI+LMCI cognitive impairment data) into a unique group (MCI) [39,40,41,42,43]. This is because the objective of this work is the early detection of the disease and determining which regions of the brain are involved in this phase. This represents a novel direction with respect to the state-of-the-art.
We have proposed a method for refining the directed graphs extracted from fMRI brain data using net4Lap embedding. This yields more locally isotropic and harmonic graphs. These subjected to further analysis using a directed version of the RRW, which amplifies the asymmetries between the left and right hemispheres of some affected regions of the brain. We have evaluated the importance of our method in Section 4.5, and show that the classification accuracy improves by 8% (see Section 4.6).
Finally, we have highlighted two specific regions (Parahippocampal Gyrus—anterior and posterior division) as strong indicators of early AD (this is consistent with some clinical studies [17,18,19,20]). In [16], these regions are included in the list of the ten anatomical regions with the largest entropy differences between normal and AD subjects. Related work [31,32,33,34,35,36,44] has highlighted the asymmetries between right and left hemispheres of the hippocampus as one of the regions with more relevance (beside hippocampal subfields and entorhinal cortices), but they use a subset of MCI patients.
Limitations: Our current study is limited by two factors. Besides MRI and PET, there are also other modalities of data as APOE. However, we decide to discard this modality because not every subject has information of all modalities and the number of patients with all modalities available is too small for reasonable classification. The second limitation is the lack of methods that separate MCI groups (EMCI and LMCI) with directed graphs in their experiments (see [14,40,45]). Moreover, other limitations include the cross-sectional nature of this database and the absence of longitudinal RS-fMRI data.
6. Conclusions
In conclusion, in this paper we propose a novel approach to the analysis of fMRI regional brain interaction networks with the aim of detecting the Alzheimer disease at an early stage. In contrast with the state-of-the-art, which they focus on the classification of full Alzheimer’s versus healthy normal patients, or the aggregate of subsets LMCI and EMCI patients in MCI patients, we focus on distinguishing normal and EMCI patients. We follow a strategy based on directed graphs because this representation is more natural for brain structure. Our method is based on a pre-processing step where the net4Lap embedding is applied to the input directed graph to obtain refined graphs, which are more locally isotropic and harmonic than the input ones. However, we still face an important obstacle due to the overlap between the distributions of normal and EMCI subjects. To reduce this problem, we use a directed version of the RRW to filter inter-class noise, and to highlight the regions with significant asymmetries. In other words, this method helps us to discriminate between brain regions or areas which are implicated in the early stages of Alzheimer’s by emphasizing or amplifying the existing asymmetries in the graph.
In our experiments, we have detected asymmetries in the degree distributions between the left and right regions of the brain associated with the same anatomical area. We have isolated six potential brain regions, and highlight two specific regions with an important asymmetry (Parahippocampal Gyrus, anterior and posterior division), which are indicators of the early development of Alzheimer’s and are consistent with clinical studies. Finally, we classify the data, distinguishing between EMCI and normal subjects. Our approach increases the classification accuracy from 66.93% to 74.51% (8%). This is a significant improvement that demonstrates the usefulness of using the RRW in activation structure analysis problems. In future work, we will study how these asymmetries evolve to late mild cognitive impairment and develop tests for possible changes in different anatomical regions associated with the development of Alzheimer’s disease.
Acknowledgments
Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012).
Author Contributions
Conceptualization, M.C., F.E., M.A.L. and E.R.H.; methodology, M.C., F.E., M.A.L. and E.R.H.; software, M.C. and F.E.; validation, M.C.; formal analysis, M.C., F.E. and E.R.H.; investigation, M.C., F.E., M.A.L. and E.R.H.; writing—original draft preparation, M.C.; writing—review and editing, M.C.; english-review, M.C. and E.R.H.; visualization, M.C.; foundational work on entropy in directed graphs and random walk algorithms, E.R.H. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.van den Heuvel M.P., Hulshoff Pol H.E. Exploring the brain network: A review on resting-state fMRI functional connectivity. Eur. Neuropsychopharmacol. 2010;20:519–534. doi: 10.1016/j.euroneuro.2010.03.008. [DOI] [PubMed] [Google Scholar]
- 2.Anwar A.R., Hashmy M.Y., Imran B., Riaz M.H., Mehdi S.M.M., Muthalib M., Perrey S., Deuschl G., Groppa S., Muthuraman M. Complex network analysis of resting-state fMRI of the brain; Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); Orlando, FL, USA. 16–20 August 2016; pp. 3598–3601. [DOI] [PubMed] [Google Scholar]
- 3.Rubinov M., Sporns O. Complex network measures of brain connectivity: Uses and interpretations. NeuroImage. 2010;52:1059–1069. doi: 10.1016/j.neuroimage.2009.10.003. [DOI] [PubMed] [Google Scholar]
- 4.Rombouts S.A.R.B., Barkhof F., Goekoop R., Stam C.J., Scheltens P. Altered resting state networks in mild cognitive impairment and mild Alzheimer’s disease: An fMRI study. Hum. Brain Mapp. 2005;26:231–239. doi: 10.1002/hbm.20160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ye C., Wilson R.C., Comin C.H., Costa L.d.F., Hancock E.R. Approximate von Neumann entropy for directed graphs. Phys. Rev. E. 2014;89:052804. doi: 10.1103/PhysRevE.89.052804. [DOI] [PubMed] [Google Scholar]
- 6.Yadav R., Gautam A., Mishra R.B. Classification of alzheimer using fmri data and brain network. Comput. Sci. Inf. Technol. 2018;2018:109–119. [Google Scholar]
- 7.Ford J., Farid H., Makedon F., Flashman L.A., McAllister T.W., Megalooikonomou V., Saykin A.J. Patient classification of fMRI activation maps; Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention; Montreal, QC, Canada. 15–18 November 2003; pp. 58–65. [Google Scholar]
- 8.Challis E., Hurley P., Serra L., Bozzali M., Oliver S., Cercignani M. Gaussian process classification of Alzheimer’s disease and mild cognitive impairment from resting-state fMRI. NeuroImage. 2015;112:232–243. doi: 10.1016/j.neuroimage.2015.02.037. [DOI] [PubMed] [Google Scholar]
- 9.Khazaee A., Ebrahimzadeh A., Babajani-Feremi A. Identifying patients with Alzheimer’s disease using resting-state fMRI and graph theory. Clin. Neurophysiol. 2015;126:2132–2141. doi: 10.1016/j.clinph.2015.02.060. [DOI] [PubMed] [Google Scholar]
- 10.Wang K., Jiang T., Liang M., Wang L., Tian L., Zhang X., Li K., Liu Z. Discriminative analysis of early Alzheimer’s disease based on two intrinsically anti-correlated networks with resting-state fMRI; Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention; Copenhagen, Denmark. 1–6 October 2006; pp. 340–347. [DOI] [PubMed] [Google Scholar]
- 11.Sarraf S., Tofighi G. DeepAD: Alzheimer’s disease classification via deep convolutional neural networks using MRI and fMRI. BioRxiv. 2016:070441. doi: 10.1101/070441v1.abstract. [DOI] [Google Scholar]
- 12.Dai Z., Yan C., Wang Z., Wang J., Xia M., Li K., He Y. Discriminative analysis of early Alzheimer’s disease using multi-modal imaging and multi-level characterization with multi-classifier (M3) Neuroimage. 2012;59:2187–2195. doi: 10.1016/j.neuroimage.2011.10.003. [DOI] [PubMed] [Google Scholar]
- 13.Sarraf S., Tofighi G. Deep learning-based pipeline to recognize Alzheimer’s disease using fMRI data; Proceedings of the 2016 Future Technologies Conference (FTC); San Francisco, CA, USA. 6–7 December 2016; pp. 816–820. [Google Scholar]
- 14.Wen J., Thibeau-Sutre E., Samper-Gonzalez J., Routier A., Bottani S., Durrleman S., Burgos N., Colliot O. Convolutional Neural Networks for Classification of Alzheimer’s Disease: Overview and Reproducible Evaluation. arXiv. 2019 doi: 10.1016/j.media.2020.101694.1904.07773 [DOI] [PubMed] [Google Scholar]
- 15.Khazaee A., Ebrahimzadeh A., Babajani-Feremi A. Classification of patients with MCI and AD from healthy controls using directed graph measures of resting-state fMRI. Behav. Brain Res. 2017;322:339–350. doi: 10.1016/j.bbr.2016.06.043. [DOI] [PubMed] [Google Scholar]
- 16.Wang J., Wilson R.C., Hancock E.R. Detecting Alzheimer’s disease Using Directed Graphs. In: Foggia P., Liu C.L., Vento M., editors. Graph-Based Representations in Pattern Recognition. Springer International Publishing; Cham, Switzerland: 2017. pp. 94–104. [Google Scholar]
- 17.Fennema-Notestine C., Hagler D.J., McEvoy L.K., Fleisher A.S., Wu E.H., Karow D.S., Dale A.M. Structural MRI Biomarkers for Preclinical and Mild Alzheimer’s disease. Hum. Brain Mapp. 2009;30:3238–3253. doi: 10.1002/hbm.20744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Khan U.A., Liu L., Provenzano F.A., Berman D.E., Profaci C.P., Sloan R., Mayeux R., Duff K.E., Small S.A. Molecular drivers and cortical spread of lateral entorhinal cortex dysfunction in preclinical Alzheimer’s disease. Nat. Neurosci. 2013;17:304–311. doi: 10.1038/nn.3606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.López M.E., Bruña R., Aurtenetxe S., Pineda-Pardo J.Á., Marcos A., Arrazola J., Reinoso A.I., Montejo P., Bajo R., Maestú F. Alpha-Band Hypersynchronization in Progressive Mild Cognitive Impairment: A Magnetoencephalography Study. J. Neurosci. 2014;34:14551–14559. doi: 10.1523/JNEUROSCI.0964-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lawal M., Olotu F.A., Soliman M.E. Across the blood-brain barrier: Neurotherapeutic screening and characterization of naringenin as a novel CRMP-2 inhibitor in the treatment of Alzheimer’s disease using bioinformatics and computational tools. Comput. Biol. Med. 2018;98:168–177. doi: 10.1016/j.compbiomed.2018.05.012. [DOI] [PubMed] [Google Scholar]
- 21.Escolano F., Curado M., Lozano M.A., Hancook E.R. Dirichlet Graph Densifiers. In: Robles-Kelly A., Loog M., Biggio B., Escolano F., Wilson R., editors. Structural, Syntactic, and Statistical Pattern Recognition. Springer International Publishing; Cham, Switzerland: 2016. pp. 185–195. [Google Scholar]
- 22.Curado M., Escolano F., Lozano M.A., Hancock E.R. Dirichlet densifiers for improved commute times estimation. Pattern Recognit. 2019;91:56–68. doi: 10.1016/j.patcog.2019.02.012. [DOI] [Google Scholar]
- 23.Curado M., Escolano F., Lozano M., Hancock E.R. Net4lap: Neural Laplacian Regularization for Ranking and Re-Ranking; Proceedings of the 24th International Conference on Pattern Recognition; Beijing, China. 20–24 August 2018. [Google Scholar]
- 24.Wang J., Wilson R.C., Hancock E.R. fMRI Activation Network Analysis Using Bose-Einstein Entropy. In: Robles-Kelly A., Loog M., Biggio B., Escolano F., Wilson R., editors. Structural, Syntactic, and Statistical Pattern Recognition. Springer International Publishing; Cham, Switzerland: 2016. pp. 218–228. [Google Scholar]
- 25.Katz L. A new status index derived from sociometric analysis. Psychometrika. 1953;18:39–43. doi: 10.1007/BF02289026. [DOI] [Google Scholar]
- 26.Burges C., Shaked T., Renshaw E., Lazier A., Deeds M., Hamilton N., Hullender G. Learning to rank using gradient descent; Proceedings of the 22nd International Conference on Machine Learning; Bonn, Germany. 7–11 August 2005; pp. 89–96. [Google Scholar]
- 27.Eppstein D., Paterson M.S., Yao F.F. On nearest-neighbor graphs. Discret. Comput. Geom. 1997;17:263–282. doi: 10.1007/PL00009293. [DOI] [Google Scholar]
- 28.Curado M. Return random walks for link prediction. Inf. Sci. 2020;510:99–107. doi: 10.1016/j.ins.2019.09.017. [DOI] [Google Scholar]
- 29.Alzheimer Disease Neuroimaging Initiative (ADNI) [(accessed on 30 July 2017)]; Available online: http://adni.loni.usc.edu/
- 30.Curado M., Escolano F., Lozano M.A., Hancock E.R. Semi-Supervised Graph Rewiring with the Dirichlet Principle; Proceedings of the 24th International Conference on Pattern Recognition; Beijing, China. 20–24 August 2018. [Google Scholar]
- 31.Talamonti D., Koscik R., Johnson S., Bruno D. Predicting Early Mild Cognitive Impairment With Free Recall: The Primacy of Primacy. Arch. Clin. Neuropsychol. 2019;35:133–142. doi: 10.1093/arclin/acz013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.DeVivo R., Zajac L., Mian A., Cervantes-Arslanian A., Steinberg E., Alosco M.L. Differentiating between healthy control participants and those with mild cognitive impairment using volumetric MRI data. J. Int. Neuropsychol. Soc. 2018;25:80–810. doi: 10.1017/S135561771900047X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chen J., Chen G., Shu H., Chen G., Ward B.D., Wang Z., Liu D., Antuono P.G., Li S.J., Zhang Z., et al. Predicting progression from mild cognitive impairment to Alzheimer’s disease on an individual subject basis by applying the CARE index across different independent cohorts. Aging. 2019;11:2185. doi: 10.18632/aging.101883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mitolo M., Stanzani-Maserati M., Capellari S., Testa C., Rucci P., Poda R., Oppi F., Gallassi R., Sambati L., Rizzo G., et al. Predicting conversion from mild cognitive impairment to Alzheimer’s disease using brain 1H-MRS and volumetric changes: A two-year retrospective follow-up study. Neuroimage Clin. 2019;23:101843. doi: 10.1016/j.nicl.2019.101843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wu M., Zeng Y., Zhang Z., Hong H., Xu Z., Cui L., Bai L., Hancock E.R. Directed Network Analysis Using Transfer Entropy Component Analysis; Proceedings of the Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR); Beijing, China. 17–19 August 2018; pp. 491–500. [Google Scholar]
- 36.Amezquita-Sanchez J., Mammone N., Morabito F., Marino S., Adeli H. A Novel Methodology for Automated Differential Diagnosis of Mild Cognitive Impairment and the Alzheimer’s disease using EEG signals. J. Neurosci. Methods. 2019;322 doi: 10.1016/j.jneumeth.2019.04.013. [DOI] [PubMed] [Google Scholar]
- 37.Mega M.S. The entorhinal cortex in Alzheimer’s disease. J. Neurol. Neurosurg. Psychiatry. 2001;71:431–432. doi: 10.1136/jnnp.71.4.431a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Velayudhan L., Proitsi P., Westman E., Muehlboeck J., Mecocci P., Vellas B., Tsolaki M., Kłoszewska I., Soininen H., Spenger C., et al. Entorhinal cortex thickness predicts cognitive decline in Alzheimer’s disease. J. Alzheimer’s Dis. 2013;33:755–766. doi: 10.3233/JAD-2012-121408. [DOI] [PubMed] [Google Scholar]
- 39.Dachena C., Casu S., Fanti A., Lodi M.B., Mazzarella G. Combined Use of MRI, fMRIand Cognitive Data for Alzheimer?s Disease: Preliminary Results. Appl. Sci. 2019;9:3156. doi: 10.3390/app9153156. [DOI] [Google Scholar]
- 40.Maturana-Candelas A., Gómez C., Poza J., Pinto N., Hornero R. EEG characterization of the Alzheimer?s disease continuum by means of multiscale entropies. Entropy. 2019;21:544. doi: 10.3390/e21060544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chai X., Weng X., Zhang Z., Lu Y., Liu G., Niu H. World Congress on Medical Physics and Biomedical Engineering 2018. Springer; Berlin/Heidelberg, Germany: 2019. Quantitative EEG in mild cognitive impairment and Alzheimer’s disease by AR-spectral and multi-scale entropy analysis; pp. 159–163. [Google Scholar]
- 42.Sharma N., Kolekar M., Jha K., Kumar Y. EEG and Cognitive Biomarkers Based Mild Cognitive Impairment Diagnosis. IRBM. 2019;40:113–121. doi: 10.1016/j.irbm.2018.11.007. [DOI] [Google Scholar]
- 43.Gutman B.A., Hua X., Rajagopalan P., Chou Y.Y., Wang Y., Yanovsky I., Toga A.W., Jack C.R., Jr., Weiner M.W., Thompson P.M., et al. Maximizing power to track Alzheimer’s disease and MCI progression by LDA-based weighting of longitudinal ventricular surface features. Neuroimage. 2013;70:386–401. doi: 10.1016/j.neuroimage.2012.12.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Beheshti I., Demirel H., Matsuda H. Classification of Alzheimer’s disease and prediction of mild cognitive impairment-to-Alzheimer’s conversion from structural magnetic resource imaging using feature ranking and a genetic algorithm. Comput. Biol. Med. 2017;83:109–119. doi: 10.1016/j.compbiomed.2017.02.011. [DOI] [PubMed] [Google Scholar]
- 45.Sun J., Wang B., Niu Y., Tan Y., Fan C., Zhang N., Xue J., Wei J., Xiang J. Complexity Analysis of EEG, MEG, and fMRI in Mild Cognitive Impairment and Alzheimer’s Disease: A Review. Entropy. 2020;22:239. doi: 10.3390/e22020239. [DOI] [PMC free article] [PubMed] [Google Scholar]