Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Jul 24;15(7):e0235039. doi: 10.1371/journal.pone.0235039

Estimating sparse functional brain networks with spatial constraints for MCI identification

Yanfang Xue 1,#, Limei Zhang 1,#, Lishan Qiao 1,*,#, Dinggang Shen 2,3,¤a
Editor: Irene Sendiña-Nadal4
PMCID: PMC7381102  PMID: 32707574

Abstract

Functional brain network (FBN), estimated with functional magnetic resonance imaging (fMRI), has become a potentially useful way of diagnosing neurological disorders in their early stages by comparing the connectivity patterns between different brain regions across subjects. However, this depends, to a great extent, on the quality of the estimated FBNs, indicating that FBN estimation is a key step for the subsequent task of disorder identification. In the past decades, researchers have developed many methods to estimate FBNs, including Pearson’s correlation and (regularized) partial correlation, etc. Despite their widespread applications in current studies, most of the existing methods estimate FBNs only based on the dependency between the measured blood oxygen level dependent (BOLD) signals, which ignores spatial relationship of signals associated with different brain regions. Due to the space and material parsimony principle of our brain, we believe that the spatial distance between brain regions has an important influence on FBN topology. Therefore, in this paper, we assume that spatially neighboring brain regions tend to have stronger connections and/or share similar connections with others; based on this assumption, we propose two novel methods to estimate FBNs by incorporating the information of brain region distance into the estimation model. To validate the effectiveness of the proposed methods, we use the estimated FBNs to identify subjects with mild cognitive impairment (MCI) from normal controls (NCs). Experimental results show that the proposed methods are better than the baseline methods in the sense of MCI identification accuracy.

Introduction

Alzheimer’s disease (AD) is an age-related, progressive neurodegenerative disease with the main characteristics of memory loss and cognitive decline. Currently, more than 35 million people suffer from AD all over the world [1], not only causing agony of losing memory for patients, but also bringing heavy financial burdens to the patient family and society. Unfortunately, researchers have not yet found an effective way of treating AD completely. Intervention is currently believed to play an important role in preventing or delaying AD at the stage of mild cognitive impairment (MCI).

To predict MCI (or AD as early as possible), researchers have explored many approaches from multiple aspects, including biochemistry [2], genetic [3], and brain imaging [4]. Especially, in recent years, functional magnetic resonance imaging (fMRI), which achieves blood oxygen level dependent (BOLD) signals, provides a noninvasive way of identifying subjects with MCI from normal controls (NCs). In practice, however, fMRI signals are arbitrarily scaled and have no unit [5], which causes the difficulty of comparing fMRI time series directly between different subjects. In contrast, fMRI-based functional brain network (FBN) has become a potentially effective tool to find informative patterns in fMRI data, and has been used to investigate MCI identification. In fact, with the help of FBN analysis, researchers recently have made a considerable progress in probing the mechanism of neurological diseases [6].

Normally, FBN is estimated by calculating the statistical dependency between the measured BOLD signals associated with different brain regions of interest (ROIs). The most classic method is Pearson’s correlation (PC), which is unfortunately sensitive to both direct and indirect relationship between ROIs and thus often results in false functional connections [5]. In contrast, partial correlation [7], as an alternative to PC, is only sensitive to direct relationship by regressing out the confounding effect from other ROIs. In fact, regardless of using which correlation-based method (without regularizers or postprocessing), the estimated FBN tends to be dense, when simply interpreting pairwise correlations between ROIs as weights of brain network connections. Therefore, a threshold or L1-regularizer is generally employed in correlation-based models for estimating sparse FBNs. The representative methods include regularized PC [8], sparse inverse covariance estimation [9], and sparse representation (SR) [10]. In addition, researchers have also developed many complex higher-order FBN estimation methods, as summarized in recent review papers [11, 12] for most of these methods.

Despite their continuous emergence, the current methods, to our best knowledge, only employ the data in fMRI signals to estimate FBNs, which ignores spatial relationship between these signals. In fact, due to the parsimony principle of our brain for saving the space and material, the strength of connection between two brain ROIs will decrease as their distance increases [13]. Here, we simply consider the friendship network as an analogy to FBN for explaining the meaning of spatial “constraints”. In particular, each person in the friendship network is a node (in analogy with ROI in FBN), while the social relations between two persons are defined as the edge (in analogy with the functional connection in FBN). If two persons are close in space, for example, studying in the same class or working in the same group, they 1) are likely to have a closer relationship, and/or 2) tend to share a more similar circle of friends. Therefore, we have reasons to assume that those spatially close ROIs may have a stronger functional connection, and/or share a more similar connection topology with other ROIs. Based on such an assumption, in this paper, we incorporate spatial distance information into the traditional model in two different ways, and in turn develop two novel FBN estimation methods. In order to verify the effectiveness of the proposed methods, we apply the estimated FBNs to identify subjects with MCI from NCs. The results show that our methods can achieve higher classification accuracy than the baseline methods.

The rest of this paper is organized as follows. In Section of Related Works, we review two representative FBN estimation methods. In Section of Materials and Methods, we introduce the source of the data and our proposed methods including our motivation, models and algorithms. In Section of Experiments and Results, we report the experimental setting and results. In Section of Discussion, we discuss our findings, and the strength and limitation of the proposed method. In the last section, we conclude the whole paper.

Related work

As described earlier, researchers have developed many methods to estimate FBNs, including PC [14], partial correlation [7], regularized partial correlation [10], and some complex higher-order variants [12]. In this paper, we only focus on correlation-based methods, since they are empirically verified to be more sensitive than complex higher-order methods according to a recent comparative study [11].

Pearson’s correlation

PC is the simplest correlation-based method to estimate FBNs. Suppose that the brain has been parcellated into P ROIs according to a certain atlas. Then, the functional connection wij between the ith and jth ROIs can be defined via PC can be computed as follows:

wij=(xixi)T(xjxj)(xixi)T(xixi)(xjxj)T(xjxj) (1)

wherewhere xiRN is the xi is the mean vector of the signal xi. Without loss of generality, we redefine xi=xixi(xixi)T(xixi). Then, the PC-based functional connectivity can be simplified as wij=xiTxj that exactly corresponds to the optimal solution of the following problem:

minwiji,j=1Pxiwijxj22 (2)

where P is the number of ROIs.

Sparse representation

As an alternative to PC, partial correlation is another commonly used FBN estimation method. Different from PC, partial correlation is only sensitive to indirect dependency between ROIs by regressing out the confounding effect from other ROIs. However, partial correlation needs to calculate the inverse of the sample covariance matrix explicitly or implicitly, causing an ill-posed problem. In order to obtain a stable estimation of FBN, an L1-regularizer is generally introduced into the partial correlation model, which results in many popular FBN estimation methods such as SR [15] given as follows:

minwij(12i=1Pxijiwijxj22+λji|wij|) (3)

Equivalently, Eq (3) can be simplified into the following matrix form:

minW(12XXWF2+λW1) (4)

where X = [x1, x2, ⋯, xP] ∈ RN×P is the fMRI data matrix, W = (wij) ∈ RP×P is the adjacency matrix of the estimated FBN, λ is a regularized parameter for controlling the balance of two terms in the objective function, ∥·∥F and ∥·∥1 denote the F-norm and L1-norm of a matrix, respectively.

Materials and methods

Data acquisition and preprocessing

In this paper, we use the dataset that were collected in Geneva University Hospital. It shares the same data source as previous works [1618]. The raw dataset includes 60 subjects with MCI and 350 NCs. However, in order to avoid the class imbalance problem, the same number of NCs as the subjects with MCI was randomly selected from the dataset. After data preprocessing, 45 MCIs and 46NCs were left in our experiment, and the demographic information of these subjects is shown in Table 1. All the preprocessed data can be downloaded from https://github.com/X-Yanfang/xyf-DATA. The original fMRI signals of the subjects were obtained by 3.0T Philips MR scanner, and the imaging parameters are set as follows: the obtained matrix size is 74 × 74 with 45 slices, voxel size is 2.97 × 2.97 × 3 mm3, TE is 30ms, and TR is 3000ms with 180 repetition. The 3-dimensional coordinates of voxels are extracted from the automated anatomical labeling (AAL) template [19]. All procedures performed in this study were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards, and this study is supported by the ethics committee of Geneva University Hospital [16].

Table 1.

MCI NC
Number of subjects (male/female) 25/20 14/32
Age (mean ± SD) 74.13 ± 6.68 73.5 ± 3.50
MMSE (mean ± SD) 27.71 ± 1.73 28.10 ± 1.35

Noises caused by, for example, the scanner and head motion, have large influence on the FBN estimation and analysis. In order to reduce the influence, a preprocessing pipeline is used in this paper to improve fMRI data quality prior to FBN estimation. In particular, for each subject the first ten volumes in the fMRI time course are removed for signal stabilization. Then, the remaining volumes were processed via Statistical Parametric Mapping (http://www.fil.ion.ucl.ac.uk/spm/) and DPABI [20] according to a popularly-used scheme. The main steps include 1) correcting slice timing and head motion; 2) regressing out nuisance signals with Friston 24-parameters of head motion; 3) registering the corrected images to Montreal Neurological Institute standard space; 4) spatially smoothing the signals with the full-width-half-maximum of 4mm, and filtering the signals using a band pass frequencies between 0.01 and 0.1. After that, we parcel the brain according to AAL atlas, and extract the mean signal from each ROI. Note that we use AAL atlas mainly due to its popularity. In the discussion section, we will give more details of the atlas selection and its possible effect on the results. As a result, we get a data matrix XRN×P (with N = 91 as the number of ROIs, and P = 80 as the length of time series) that is the main material for estimating FBNs. Please see (16, 18) for more details of the data preprocessing.

Proposed methods

As discussed in the introduction section, our brain, despite its extreme complexity, is believed to organize similar to many existing physical system/networks in which the spatially close elements tend to have a stronger relationship. In fact, recent researches have also shown that spatially proximal ROIs have a higher possibility of connection than spatially distant ROIs due to the parsimony principle of wiring cost [13].

In Fig 1, we provide a simple toy network for explaining the above viewpoint intuitively, as well as illustrating our motivation or assumption used in the newly proposed FBN estimation methods that will be discussed shortly. First, as shown in Fig 1, there are edges between some spatially close nodes (e.g., nodes 3 and 6), while usually no edge between spatially distant nodes (e.g., nodes 3 and 9). Regarding the relationship between the connection strength and spatial distance, Kaiser and Hilgetag investigated the fiber length distribution of cortical/nervous connections in the macaque and C. elegans, and found that most connections tend to be short in space [21]. Similar trend has been noted in human FBNs estimated with fMRI data [22]. In particular, several studies revealed that the strength of functional connections decreases with the spatial distance following a power-law [2225]. Therefore, a natural assumption (Assumption I) is that the connection strength in an FBN is inversely proportional to the distance of different ROIs.

Fig 1. An illustrative example for explaining the motivation or assumption behind the proposed methods.

Fig 1

The binary column vector in the figure denotes the connection pattern between the current node and other nodes. If two nodes (e.g., 3 and 6) are spatially closed, we assume they have a direct connection with a higher probability and share a similar connection pattern with other nodes.

Further, more complex interaction between ROIs has been characterized by a higher-order pattern (e.g., the correlations among different edges, or correlation’s correlation) in several recent studies [26, 27]. This inspires us to give Assumption II that the spatial distance also affects the network connections in a higher-order way. In other words, spatially close nodes share the similar connection topology with other nodes. For example, the spatially close nodes 3 and 6 have a similar connection pattern with other nodes, as indicated by the binary vector in Fig 1. In contrast, the connection pattern associated with the remote node 9 is different.

Now, based on the above assumptions, we develop two novel FBN estimation methods by incorporating the spatial distance information into the traditional SR model in both low- and high-order ways, respectively.

Method 1: Estimating FBN based on assumption I

Based on Assumption I, we propose to estimate FBN by the following optimization problem:

minwij(12i=1Pxijiwijxj22+λjicij|wij|) (5)

In Eq (5), we use the same data-fitting term as SR for capturing the partial correlation structure. The only difference is that a weight cij is introduced into the regularized term for strengthening/weakening the functional connectivity according to the spatial distance between different ROIs. In our method, we define cij = dij/dmax, where dij is the spatial distance between the ith and jth ROIs, and dmax is the maximum value of dij for i, j = 1, 2, ⋯, P. More specifically, the spatial distance dij is defined as follows,

dij=1ninja=1nib=1njRiaRjb2 (6)

where Ria denotes the 3-dimensional coordinates of the ath voxels in the ith ROI, ni and nj are the numbers of voxels in the ith and jth ROIs, respectively. Mathematically, Eq (5) can be simplified into the following matrix form:

minW(12XXWF2+λCW1) (7)

where C = (cij) ∈ RP×P is the coefficient matrix whose elements are the normalized distance between ROIs, and ∘ denotes Hadamard product of two matrices.

Note that the objective function of Eq (7) is convex, but non-differentiable due to the weighted L1-regularizer. Therefore, in this paper we use the proximal method [28] to solve Eq (7). In particular, we first calculate the gradient of the data fitting term denoted by f(W)=12XXWF2, and get ∇Wf(W) = XT(XWX). As a result, we have the following gradient descent step:

Wk+1=Wkαkf(Wk) (8)

where αk is the step size of the kth iteration. Then, we use the proximal operation for λCW1, defined as follows:

proximalλCW1=sign(wij)*max{|wij|λ×cij,0} (9)

to map the current W into a feasible region. As a result, we get a simple algorithm for solving Eq (7), as shown in Table 2.

Table 2. Algorithm for solving Eq (7).
Input: X, C, λ
Iterate
1. WWαk(XT(XWX))
2. WproximalλCW1=sign(wij)*max{|wij|λ×cij,0}
End
Output: W

Method 2: Estimating FBN based on assumption II

According to our previous discussion, if two ROIs are close in space, they are more likely to share a similar connection topology/pattern. Therefore, based on Assumption II we propose the second FBN estimation model as follows:

minwij(12i=1Pxijiwijxj22+λ12i,j=1Pwiwj22sij+λ2ji|wij|) (10)

where wi is the ith column of the network adjacency matrix W, indicating the connection pattern of the ith node with other nodes. λ1 and λ2 are the control parameters used to balance the tradeoff among the three terms in the objective function.

Compared with the traditional SR reviewed in Eq (3), a new regularizer, i.e., i,j=1Pwiwj22sij, is introduced into the proposed model for constraining the spatially close ROIs to have more similar connection patterns. In particular, sij is inversely proportional to the spatial distance, reflecting the similarity of the connection topology between two ROIs, and is defined as follows:

sij=edij (11)

where dij is given in Eq (6).

With a series of mathematical formulation, Eq (10) can be simplified into the following matrix form:

minW(12XXWF2+λ12tr(W(DS)WT)+λ2W1) (12)

where S = (sij) ∈ RP×P is the defined similarity matrix in Eq (11), D is a diagonal matrix whose diagonal element is the row sum of S, i.e., Djj=i=1Psij,j=1,2,,P. It can be noted that DS is in fact the graph Laplacian of S.

Similar to Method 1, the proximal method is used to solve Eq (12). Specifically, we consider the first two terms in Eq (12) as a whole, and denote it as f(W)=12XXWF2+λ12tr(W(DS)WT). Then, we can easily get its derivative ∇Wf(W) = XT(XWX) + λ1W(DS). As a result, we have the following algorithm to solve Eq (12), as shown in Table 3.

Table 3. Algorithm for solving Eq (12).
Input: X, S, λ1, λ2
Iterate
1. WWαk(XT(XWX) + λ1W(DS))
2. Wproximalλ2W1=sign(wij)*max{|wij|λ2,0}
End
Output: W

Experiments and result

Feature selection and classification based on estimated FBNs

After obtaining the “clean” fMRI data, we estimate FBNs based on different methods, and compare their performance by a systematical experiment consisting of several main steps as shown in Fig 2.

  • Step 1: Estimating FBNs based on different methods and parametric values. For PC, we estimate FBNs by removing a percentage of weak connections, where the candidate percentage is selected in the set of [100%, 90%, ⋯, 10%]. For SR and Method 1, we use different values of the regularization parameter in the range of [0.001, 0.002, ⋯, 0.009, 0.01]. For Method 2, the two regularized parameters, λ1 and λ2, are selected in the ranges of [2−10, 2−9, ⋯, 2−2, 2−1] and [0.001, 0.002, ⋯, 0.009, 0.01], respectively.

  • Step 2: Feature selection and classification. We use t-test (with p-value equal to 0.01) for selecting discriminative features, and use the linear support vector machines (SVM) (with default parameter C = 1) for performing classification tasks. Due to the limitation of sample size, we use the leave-one-out (LOO) cross validation to calculate the performance of different methods. Specifically, for the 91 subjects in the dataset, 90 of them are used for training, while the remaining one is used for testing. After a complete loop, we will obtain 91 classifiers, and the final classification accuracy is defined as:
    accuracy=thenumberofcorrectclassificationthenumberoftrainedclassifiers, (13)

Fig 2. The experimental pipeline.

Fig 2

Since the parameters involved in the FBN estimation models may affect the structure of brain network, we select the optimal parametric values by an inner LOO cross validation on the training set, as shown in Fig 2. In light of this, we will obtain a series of classification results in the inner LOO. Lastly, we determine the parametric values corresponding to the best classification accuracy, and use the FBNs estimated by the optimal parametric value to conduct the outer LOO for achieving the final classification results.

Experimental results

A set of quantitative measurements, including accuracy, sensitivity and specificity, are used to evaluate the classification performance. Their definitions are given as follows:

Accuracy=TP+TNTP+FP+TN+FN (14)
Sensitivity=TPTP+FN (15)
specificity=TNTN+FP (16)

where TP is the number of positive subjects correctly classified in the MCI identification task, and FN is the number of negative subjects incorrectly classified in the MCI identification task. Accordingly, TN and FP are the numbers of their corresponding subjects, respectively. As a result, the classification performances are reported in Table 4.

Table 4. The classification performance of Method 2.

Accuracy Sensitivity specificity
PC 60.44% 57.78% 63.04%
SR 67.03% 62.22% 71.74%
Method 1 71.43% 75.56% 67.39%
Method 2 76.92% 82.22% 71.74%

As shown in Table 4, we can find that the proposed methods are superior to the baseline methods. Hence, we argue that the spatial distance between different brain regions may play a potentially important role in estimating FBNs. Furthermore, we note that Method 2 works better than Method 1. In our opinion, this result is possibly caused by the fact that Method 1 is based on a too harsh assumption that two brain regions have stronger functional connection if they are close in spatial location, while Method 2 is based on a relatively moderate assumption that considers the distance information in a higher-order way.

For an intuitive comparison, we visualize the FBNs estimated by PC, SR and our proposed methods, respectively, in Fig 3. It can be seen that the PC-based FBN is dense, since the pairwise full correlation is used to model the network adjacency matrix. As a result, it will lead to more false connections, and in turn affect the final classification accuracy. In contrast, SR and our methods can obtain spare FBNs by removing some weak or potentially noisy connections from the estimated FBN due to the L1 regularizer. Note that, the sparsity plays an important role in FBN estimation, but over-sparsity will cause the loss of useful discriminative connections. Therefore, we determine the suitable sparsity when estimating FBNs by a nested cross validation on the training data. Particularly, compared with the traditional SR and Method 1, the proposed Method 2 tends to achieve a cleaner FBN with clear modularity structures, as shown in Fig 3(d). Note that the density of networks estimated with the proposed methods may vary from subject to subject, due to the fact that the L_1-regularizer controls the network density indirectly, and the nested LOO CV automatically assigns an optimal value for the regularized parameter.

Fig 3. FBNs estimated by 4 different methods.

Fig 3

Sensitivity to parameters

Parameters generally have an important influence on the FBN structure and the subsequent classification accuracy. In order to investigate the sensitivity of different methods to the involved parameters, we compute the classification accuracy under different parametric combinations, and report the results in Fig 4. Since there are two parameters in Method 2, we show the classification accuracy in a 2-dimensional bar graph. It can be observed from Fig 4 that the classification accuracy fluctuates heavily with the change of the parametric values, indicating that most of the FBN estimation methods are sensitive to the free parameters.

Fig 4. Classification accuracy by PC, SR and the proposed methods based on different parametric values.

Fig 4

The horizontal axis represents network density (a) or regularization parameter (b-d), and the vertical axis represents the classification accuracy.

As shown in Fig 4, the classification accuracies fluctuate with the change of the parametric values. In particular, for PC-based FBNs, the optimal parametric values (i.e., network density) is 70%, and we note that an over-sparsity instead results in a drop of classification accuracy. For SR and Method 1, the L1-regularizer is introduced into the FBN estimation model for removing the weak connections and controlling the density of the estimated FBNs. From Fig 4(b) and 4(c), we observe that the optimal values of parameter λ are 0.009 and 0.01 for SR and Method 1, respectively. Note that two regularized parameters are involved in Method 2. One (λ2) is used for controlling the density of the FBNs, while the other (λ1) is used for modelling the similarity of two ROIs. As shown in Fig 4(d), λ1 = 2−5 and λ2 = 0.003 is the optimal combination for the two parameters. Therefore, for our proposed methods, we suggest setting the parametric value for L1-regularizer within the range of [10−3, 10−2], while the parametric value for high-order regularizer (in Method 2) within the range of [2−5, 2−3].

Influence of parameter cij on results

Parameter cij, as the function of the dij, is used for strengthening/weakening the functional connectivity. We expect that the larger distance suppresses the connection, while the closer distance promotes the connection. In light of this, there are many definitions for cij. To investigate their effects, we compare the classification results by defining different cij, and report the results in Table 5. As shown in Table 5, the classification results are slightly affected by different definitions of cij.

Table 5. The classification results of Method 1 under different value of cij.

Accuracy Sensitivity specificity
cij=dijdmax 71.43% 75.56% 67.39%
cij=dij2dmax 71.43% 73.33% 69.57%
cij=dij3dmax 70.33% 71.11% 69.57%

Discriminative features

In addition to the classification accuracy itself, a more informative aspect is which features/connections in FBN contribute to the final accuracy. In this paper, we find the discriminative connections in two different ways. For the first way, we record the top features selected by t-test with the default p-value of 0.01 in each loop of the cross validation, and report the common features in Fig 5(a). For the second way, we select discriminative features according to the mean of the absolute weights derived from SVM classifier in all loops of cross validation. It is generally believed that the larger the absolute weight of the corresponding feature is, the more discriminative this feature could be. Accordingly, the result is shown in Fig 5(b). Finally, we consider the features selected simultaneously by the two ways contributing the most to the MCI identification. Specially, the brain regions involved in the selected features by both ways include precuneus, amygdala, para-hippocampal. This finding is consistent with several previous studies on MCI [29, 30].

Fig 5. The selected features/connections based on t-test (a) and the weighs of SVM classifier (b).

Fig 5

The color is set randomly for better visualization, and the width of the arc represents the discriminability of the corresponding connection.

FBN estimation with simulated data

Besides experiments on the real data set, we also estimate FBNs based on a set of simulated BOLD signals for evaluating the generalizability of the proposed methods and analyzing their ability to detect the network structures. First, we extract a subnetwork with 10 nodes from Fig 1 as the ground truth. As shown in Fig 6(a)–6(b), the nodes in the given network can be parcellated into two groups, indicating that the ground truth of FBN has a clear modularity structure. Then, based on the ground-truth network, we generate simulated BOLD signals with 1500 time points using MULAN toolbox [31]. Note that all the parametric values used in MULAN generator are provided by default in the supplementary material of [31]. In Fig 6(c), we show the 10 simulated BOLD signal series associated with 10 nodes. Finally, we estimate FBNs based on the simulated BOLD signals using different methods, and visualize the estimated results in Fig 6(d)–6(g).

Fig 6.

Fig 6

(a-b) The given network as ground truth; (c) The simulated BOLD signals generated by MULAN toolbox (https://github.com/HuifangWang/MULAN); (d)-(g) The estimated FBNs by running four different methods based on the simulated BOLD signals in (c).

From Fig 6, we have the following observations: 1) PC can only detect one of the network modules, and lead to some false connections. 2) SR can remove the weak (potentially false) connections, which makes the adjacency matrix look cleaner. However, it cannot recover the original network structure as PC. 3) The proposed Method 1 can roughly detect the network structure, but, comparing to the ground truth, there are some false connections between (and within) the two modules. 4) Different from the other three methods, our Method 2 can not only detect the original network structure, but also include the fewest false connections. Further, we also quantify the similarity between the estimated brain networks and ground truth by Pearson’s correlation coefficient as shown in Table 6. The results show that Method 2 achieves the highest similarity to the ground truth. This is consistent with the above observations in Fig 6(d)–6(g).

Table 6. The similarity between ground truth network and the generated FBNs.

PC SR Method 1 Method 2
Ground truth 43.99% 50.13% 51.02% 70.04%

Discussion

In this work, we propose two SR-based methods by combining the spatial information into the FBN estimation model in both low-order (Method 1) and high-order (Method 2) ways. Especially for Method 2, the experimental results show that its performance is better than other methods, including PC, SR and Method 1. This may be due to the fact that we do not only take the distance between different ROIs into account, but also attempt to model higher correlation (e.g., the correlation among different edges [30] or correlations’ correlation [24]) in the model. In addition, we note that there are more free parameters (λ1 & λ2) in Method 2 than other methods, which provides more flexibility to the model. However, the fact of more free parameters is not necessarily the reason for improving the performance that largely depends on whether the optimal parameter can be found in the training dataset. In our experiment, the optimal parametric value is selected by an inner CV on the training data, and Method 2 (under the selected optimal parametric value) is better than other methods. Although empirical results show the effectiveness of our proposed methods, there are many aspects that need to be further explored in the future. Therefore, in what follows, we give a brief discussion on several issues that the readers might be interested in.

For defining ROIs, researchers have developed many schemes, which can be roughly separated into atlas-based and data-driven methods. Atlas-based methods, such as AAL [19], anatomical Harvard-Oxford (HO) [32], Automatic Non-linear Imaging Matching and Anatomical Labeling (ANIMAL) [33], and Bootstrap Analysis of Stable Clusters (BASC) atlas [34], expect that the voxels within the same ROI tend to share the similar structure or function. In practice, however, this kind of methods generally suffer from low consistency [33, 35], partially due to individual difference, boundary vagueness and data unreliability. Additionally, the final BOLD signal is extracted by averaging on the voxels within a ROI, which results in information loss, and in turn makes the estimated FBNs include some unreliable connections [36].

As an alternative to atlas-based methods, data-driven methods directly work on the used dataset, and thus can relieve the low consistency issue to some extent. The popular methods for learning ROIs from data include clustering methods [37] and group independent component analysis methods (GICA) [38]. The clustering methods separate voxels into functionally homogeneous parcels, and define each parcel as a ROI. In contrast, GICA methods decompose the data into some components, each of which corresponds to a spatial map as the ROI. However, for data-driven methods, selecting the number of clusters or components is an exceedingly challenging problem. This is one of reasons why we simply use AAL atlas for parceling the brain in this paper. Another reason we use AAL is its popularity. For example, in a recent study [39], Brown et al. reviewed 77 works, in which AAL is the most commonly used method to define ROIs. In light of this, it is convenient for researchers to compare their findings with other studies. In summary, there is no consensus as to which ROI definition scheme is the best [5]. A recent study [40] systematically evaluated the influences of different aspects (including atlas selection, FBN estimation and classification design) on the FBN-based classification accuracy, and shown that, relative to the two others, the selection of atlas brings smaller influence on the final results.

For the performance improvement of the proposed methods in terms of classification accuracy, more information (e.g., spatial distance between ROIs) is included in this paper for estimating FBNs. For the spatial distance information, it can be uniquely (at least easily) determined in the voxel scale. However, in the ROI level, determining the spatial distance is not a trivial problem, because it heavily depends on both the calculation methods and the definition of ROIs. In [41], Alexander-Bloch et al. calculate spatial information using Euclidean distance between different ROIs centroids. In this paper, we define the distance as the average of the distances between all the voxel pairs in different ROIs. However, it is difficult for these methods to accurately assign physical distance between the ROIs with different sizes. For example, the distance between the two physically adjacent ROIs with big sizes may be greater than the distance between two nonadjacent ROIs with small sizes. Therefore, application of the distance between ROIs should be cautious, and further study is needed for designing a “good” metric to measuring distance between the ROIs with different shapes and sizes.

Conclusion

PC and SR are the two baseline methods to estimate FBNs. PC, as the simplest method, is sensitive to indirect relationship between BOLD signals. In contrast, SR not only regresses out the indirect confounding effect before calculating the correlation between ROIs, but also incorporates the sparse prior into the FBN estimation model. However, SR estimates FBN only based on BOLD signals, yet ignoring their spatial location. Therefore, in this paper we propose two novel methods to estimate FBN by incorporating spatial distance information into the estimation model in two different ways. In order to verify the effectiveness of proposed methods, we conduct experiments based on the estimated FBNs. The results show that the proposed methods obtain a superior performance in the task of MCI classification. Therefore, we argue that the distance between different brain regions may play an important role in improving the quality of the estimated FBNs. In the future, we plan to conduct more experiments to evaluate the influence of the spatial distance on other FBN estimation methods.

Data Availability

All relevant data are available from the GitHub: https://github.com/X-Yanfang/xyf-DATA.

Funding Statement

This work is supported by National Natural Science Foundation of China (http://www.nsfc.gov.cn/)(61976110, 11931008) and Natural Science Foundation of Shandong Province(http://cloud.sdstc.gov.cn/) (ZR2018MF020). There was no additional external funding received for this study.

References

  • 1.Martinez B, Vpeplow P. MicroRNAs as diagnostic and therapeutic tools for Alzheimer’s disease: advances and limitations. NEURAL REGENERATION RESEARGH. 2019;14(2):242–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Forman MS, Mufson EJ, Leurgans S, Pratico D, Joyce S, Leight S, et al. Cortical biochemistry in MCI and Alzheimer disease: lack of correlation with clinical diagnosis. Neurology. 2007;68(10):757–763. 10.1212/01.wnl.0000256373.39415.b1 [DOI] [PubMed] [Google Scholar]
  • 3.Drzezga A, Grimmer T, Riemenschneider M, Lautenschlager N, Siebner H, Alexopoulus P, et al. Prediction of individual clinical outcome in MCI by means of genetic assessment and (18)F-FDG PET. Journal of Nuclear Medicine. 2005;46(10):1625–1632. [PubMed] [Google Scholar]
  • 4.Smith SM, Vidaurre D, Beckmann CF, Glasser MF, Jenkinson M, Miller KL, et al. Functional connectomics from resting-state fMRI. Trends in Cognitive Sciences. 2013;17(12):666–682. 10.1016/j.tics.2013.09.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bijsterbosch J, Smith SM, Beckmann CF. Introduction to resting state FMRI functional connectivity: Oxford University Press; 2017. [Google Scholar]
  • 6.Poldrack RA, Farah MJ. Progress and challenges in probing the human brain. Nature. 2015;526(7573):371–379. 10.1038/nature15692 [DOI] [PubMed] [Google Scholar]
  • 7.Marrelec G, Krainik A, Duffau H, Pélégrini-Issac M, Lehéricy S, Doyon J, et al. Partial correlation for functional brain interactivity investigation in functional MRI. Neuroimage. 2006;32(1):228–237. 10.1016/j.neuroimage.2005.12.057 [DOI] [PubMed] [Google Scholar]
  • 8.Li W, Wang Z, Zhang L, Qiao L, Shen D. Remodeling Pearson’s Correlation for Functional Brain Network Estimation and Autism Spectrum Disorder Identification. Frontiers in Neuroinformatics. 2017;11:55 10.3389/fninf.2017.00055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Huang S, Li J, Sun L, Ye J, Fleisher A, Wu T, et al. Learning brain connectivity of Alzheimer’s disease by sparse inverse covariance estimation. Neuroimage. 2010;50(3):935–949. 10.1016/j.neuroimage.2009.12.120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lee H, Soo LD, Hyejin K, Boong-Nyun K, Chung MK. Sparse brain network recovery under compressed sensing. IEEE Transactions on Medical Imaging. 2011;30(5):1154–1165. 10.1109/TMI.2011.2140380 [DOI] [PubMed] [Google Scholar]
  • 11.Smith SM, Miller KL, Gholamreza SK, Matthew W, Beckmann CF, Nichols TE, et al. Network modelling methods for FMRI. Neuroimage. 2011;54(2):875–891. 10.1016/j.neuroimage.2010.08.063 [DOI] [PubMed] [Google Scholar]
  • 12.Zhou Z, Chen X, Zhang Y, Qiao L, Yu R, Pan G, et al. Brain Network Construction and Classification Toolbox (BrainNetClass). arXiv e-prints [Internet]. 2019 June 01, 2019. https://ui.adsabs.harvard.edu/abs/2019arXiv190609908Z.
  • 13.Fornito A, Zalesky A, Bullmore E. Fundamentals of brain network analysis: Academic Press; 2016. [Google Scholar]
  • 14.Biswal B, Yetkin FZ, Haughton VM, Hyde JS. Functional connectivity in the motor cortex of resting human brain using echo-planar MRI. Magnetic Resonance in Medicine. 1995;34(4):537–541. 10.1002/mrm.1910340409 [DOI] [PubMed] [Google Scholar]
  • 15.Lee H, Lee DS, Kang H, Kim B-N, Chung MK. Sparse brain network recovery under compressed sensing. IEEE Transactions on Medical Imaging. 2011;30(5):1154–1165. 10.1109/TMI.2011.2140380 [DOI] [PubMed] [Google Scholar]
  • 16.Gao Y, Adeli-M. E., Kim M, Giannakopoulos P, Haller S, Shen D. Medical Image Retrieval Using Multi-graph Learning for MCI Diagnostic Assistance. [DOI] [PMC free article] [PubMed]
  • 17.Preti MG, Haller S, Giannakopoulos P, De Ville DV, editors. Decomposing dynamic functional connectivity onto phase-dependent eigenconnectivities using the Hilbert transform. international symposium on biomedical imaging; 2015.
  • 18.Qiao L, Zhang H, Kim M, Teng S, Zhang L, Shen DJN. Estimating functional brain networks by incorporating a modularity prior. 2016;141:399–407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Tzourio-Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, et al. Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain. Neuroimage. 2002;15(1):273–289. 10.1006/nimg.2001.0978 [DOI] [PubMed] [Google Scholar]
  • 20.Yan CG, Zang YF. DPARSF: A MATLAB Toolbox for "Pipeline" Data Analysis of Resting-State fMRI. Frontiers in Systems Neuroscience. 2010;4(13):13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kaiser M, Hilgetag CC. Nonoptimal component placement, but short processing paths, due to long-distance projections in neural systems. PLoS computational biology. 2006;2(7):e95 10.1371/journal.pcbi.0020095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Rubinov M, Ypma RJ, Watson C, Bullmore ET. Wiring cost and topological participation of the mouse brain connectome. Proceedings of the National Academy of Sciences. 2015;112(32):10032–10037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Choi H, Mihalas S. Synchronization dependent on spatial structures of a mesoscopic whole-brain network. PLoS computational biology. 2019;15(4):e1006978 10.1371/journal.pcbi.1006978 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Achard S, Salvador R, Whitcher B, Suckling J, Bullmore E. A resilient, low-frequency, small-world human brain functional network with highly connected association cortical hubs. Journal of Neuroscience. 2006;26(1):63–72. 10.1523/JNEUROSCI.3874-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hagmann P, Kurant M, Gigandet X, Thiran P, Wedeen VJ, Meuli R, et al. Mapping human whole-brain structural networks with diffusion MRI. PloS one. 2007;2(7):e597 10.1371/journal.pone.0000597 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zhang H, Chen X, Shi F, Li G, Kim M, Giannakopoulos P, et al. Topographical information-based high-order functional connectivity and its application in abnormality detection for mild cognitive impairment. Journal of Alzheimer’s Disease. 2016;54(3):1095–1112. 10.3233/JAD-160092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chen X, Zhang H, Gao Y, Wee CY, Li G, Shen D, et al. High-order resting-state functional connectivity network for MCI classification. Human brain mapping. 2016;37(9):3282–3296. 10.1002/hbm.23240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Liu J, Yuan L, SLEP JY. Sparse learning with efficient projections. techincal report. Computer Science Center, Arizona State University. 2011.
  • 29.Albert MS, Dekosky ST, Dickson D, Dubois B, Feldman HH, Fox NC, et al. The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia: the journal of the Alzheimer’s Association. 2011;7(3):270–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Amit A, Yu L, Yang W, Jingwei W, Sujuan G, Lubna B, et al. Activity and connectivity of brain mood regulating circuit in depression: a functional magnetic resonance study. Biological Psychiatry. 2005;57(10):1079–1088. 10.1016/j.biopsych.2005.02.021 [DOI] [PubMed] [Google Scholar]
  • 31.Wang HE, Friston KJ, Bénar CG, Woodman MM, Chauvel P, Jirsa V, et al. MULAN: Evaluation and ensemble statistical inference for functional connectivity. NeuroImage. 2018;166:167–184. 10.1016/j.neuroimage.2017.10.036 [DOI] [PubMed] [Google Scholar]
  • 32.Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31(3):968–980. 10.1016/j.neuroimage.2006.01.021 [DOI] [PubMed] [Google Scholar]
  • 33.Collins DL, Holmes CJ, Peters TM, Evans AC. Automatic 3-D model-based neuroanatomical segmentation. Human brain mapping. 1995;3(3):190–208. [Google Scholar]
  • 34.Bellec P, Rosa-Neto P, Lyttelton OC, Benali H, Evans AC. Multi-level bootstrap analysis of stable clusters in resting-state fMRI. Neuroimage. 2010;51(3):1126–1139. 10.1016/j.neuroimage.2010.02.082 [DOI] [PubMed] [Google Scholar]
  • 35.Korhonen O, Saarimäki H, Glerean E, Sams M, Saramäki J. Consistency of regions of interest as nodes of fMRI functional brain networks. Network Neuroscience. 2017;1(3):254–274. 10.1162/NETN_a_00013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Stanley ML, Moussa MN, Paolini B, Lyday RG, Burdette JH, Laurienti PJ. Defining nodes in complex brain networks. Frontiers in computational neuroscience. 2013;7:169 10.3389/fncom.2013.00169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cordes D, Haughton V, Carew JD, Arfanakis K, Maravilla K. Hierarchical clustering to measure connectivity in fMRI resting-state data. Magnetic resonance imaging. 2002;20(4):305–317. 10.1016/s0730-725x(02)00503-9 [DOI] [PubMed] [Google Scholar]
  • 38.Calhoun VD, Adali T, Pearlson GD, Pekar JJ. A method for making group inferences from functional MRI data using independent component analysis. Human brain mapping. 2001;14(3):140–151. 10.1002/hbm.1048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Brown CJ, Hamarneh G. Machine learning on human connectome data from MRI. arXiv preprint arXiv:161108699. 2016.
  • 40.Dadi K, Rahim M, Abraham A, Chyzhyk D, Milham M, Thirion B, et al. Benchmarking functional connectome-based predictive models for resting-state fMRI. Neuroimage. 2019;192:115–134. 10.1016/j.neuroimage.2019.02.062 [DOI] [PubMed] [Google Scholar]
  • 41.Alexander-Bloch AF, Vertes PE, Stidd R, Lalonde F, Clasen L, Rapoport J, et al. The anatomical distance of functional connections predicts brain network topology in health and schizophrenia. Cerebral cortex. 2012;23(1):127–138. 10.1093/cercor/bhr388 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are available from the GitHub: https://github.com/X-Yanfang/xyf-DATA.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES