Abstract
Sparse representation-based brain network modeling, although popular, often results in relatively large inter-subject variability in network structures. This inevitably makes it difficult for inter-subject comparison, thus eventually deteriorating the generalization capability of personalized disease diagnosis. Accordingly, group sparse representation has been proposed to alleviate such limitation by jointly estimating connectivity weights for all subjects. However, the constructed brain networks based on this method often fail in providing satisfactory separability between the subjects from different groups (e.g., patients vs. normal controls), which will also affect the performance of computer-aided disease diagnosis. Based on the hypothesis that subjects from the same group should have larger similarity in their functional connectivity (FC) patterns than subjects from other groups, we propose an “inter-subject FC similarity-guided” group sparse network modeling method. In this method, we explicitly include the inter-subject FC similarity as a constraint to conduct group-wise FC network modeling, while retaining sufficient between-group differences in the resultant FC networks. This improves the separability of brain functional networks between different groups, thus facilitating better personalized brain disease diagnosis. Specifically, the inter-subject FC similarity is roughly estimated by comparing the Pearson’s correlation based FC patterns of each brain region to other regions for each pair of the subjects. Then, this is implemented as an additional weighting term to ensure the adequate inter-subject FC differences between the subjects from different groups. Of note, our method retains the group sparsity constraint to ensure the overall consistency of the resultant individual brain networks. Experimental results show that our method achieves a balanced trade-off by not only generating the individually consistent FC networks, but also effectively maintaining the necessary group difference, thereby significantly improving connectomics-based diagnosis for mild cognitive impairment (MCI).
1 Introduction
Characterized by progressive perceptive and cognitive deficits, Alzheimer’s disease (AD) is an irreversible neurological disease with no cure [1]. Mild cognitive impairment (MCI) is known as a prodromal stage of AD, and individuals with MCI tend to progress to AD at a rate of about 10–15% per year [2]. Early detection of MCI is of great clinical significance, thus imaging-based diagnosis has attracted much attention in the past few years [3]. However, accurate MCI diagnosis is considerably challenging due to subtle brain anatomical and functional changes during this early stage [4, 5].
As an in vivo brain functional imaging technique, the resting-state functional magnetic resonance imaging (rs-fMRI) measures blood oxygenation level-dependent (BOLD) signals related to spontaneous neural activity when subject is in the natural rest. In recent years, rs-fMRI has been successfully applied to abnormality detection of various neurological diseases and psychiatric disorders, including schizophrenia, autism spectrum disorder, and AD/MCI [6]. The most-widely used methods with rs-fMRI are based on functional connectivity (FC), which measures temporal correlation of BOLD signals between any pair of brain regions. Based on FC between each pair of brain regions, the brain FC network can be constructed for characterizing the intrinsic functional architecture of the human brain and its abnormality caused by the possible pathological attack [7, 8]. Accurate construction of the brain FC network based on rs-fMRI is one of the most essential problems for more reliable early detection of MCI and also better understanding of potential biomarkers at the whole-brain systematical level [9].
Of various brain FC network modeling methods, the most widely-used one is Pearson’s correlation (PC). Despite of its simplicity and biological intuitiveness, this method also suffers drawbacks that only pair-wise linear interactions are revealed without accounting for more complex influences from other brain regions. To solve this problem, sparse representation (SR) has been adopted for constructing sparse brain FC networks based on the observations that the brain network is largely sparse and the wiring is efficient and cost-effective [10]. Based on SR, the BOLD signals of a brain region can be represented by a linear combination of the signals from a small number of other brain regions, under a sparsity constraint by forcing insignificant connections to be zero. However, because of the data-driven nature, a potential issue of the SR is that it often results in brain networks with a relatively large inter-subject variability due to unpredicted interferences of rs-fMRI noise and artifact. This often leads to a consequence of poor generalization ability of a trained classifier for brain disease diagnosis and, more problematically, for MCI diagnosis because the subtle pathological changes in MCI could be overwhelmed by the large inter-subject variability [11]. Targeting at increasing inter-subject comparability, group sparse representation (GSR) has been proposed by jointly estimating the FC weights for all subjects using a group lasso constraint with l2,1-norm [12]. This will make certain connectivity links in all subjects being zero or non-zero jointly, which alleviates inter-subject variability, but potentially sacrificing inter-subject separability especially for the subjects from different (e.g., patient and control) groups. Therefore, it will yield suboptimal classification performance when applied to brain disease diagnosis [3].
More recently, connectivity strength-weighted SR method was proposed for individual brain network construction by exploiting FC connectivity prior [13]. With guidance from connectivity strength of PC, the network modeled at the individual level achieved better biological meaning and yielded improved classification accuracy of MCI. One previous study about unsupervised cluster analysis on major depression [14] suggested that subjects from the same group often have larger FC similarity than those from different groups. Accordingly, in this study, we introduce such a prior into GSR-based brain FC network modeling to preserve systematical group difference, without losing the merit of inter-subject consistency in sparse learning framework. Specifically, we propose an “inter-subject FC similarity-guided group sparse” brain network modeling method to increase the between-group separability of FC networks, thus facilitating personalized brain disease diagnosis. In particular, the inter-subject similarity of FC networks is estimated by comparing the PC-based FC pattern of each brain region (i.e., the one-to-all FC pattern) between each pair of subjects. This inter-subject PC-based FC-pattern resemblance is then used as a weighting to form an additional constraint that could penalize those excessive “inter-subject differences in SR-weights” for subjects from the same group, while retaining the sufficient SR-weight differences between subjects from different groups. Next, we will show how this idea can be seamlessly integrated into the GSR-based brain network modeling. To validate the effectiveness of our proposed method, we carry out experiments by applying the new network modeling method to MCI patient identification, a challenging problem due to subtle pathological changes in MCI compared with large inter-subject variability. Results show that our method can not only effectively detect group difference, but also significantly improve the brain functional connectomics-based MCI diagnosis.
2 Methods
2.1 GSR for Brain Network Modeling
Suppose that contains the mean time series of a total of K regions-of-interest (ROIs) for the i-th subject. Without loss of generality, let us assume that has been de-meaned and variance-standardized (i.e., divided by the standard deviation). With PC, the brain FC network of each subject i can be roughly estimated by calculating the full correlation , such that the k-th column in Ci characterizes the functional interactions between the k-th ROI and all other ROIs (i.e., one-to-all FC-pattern of the k-th ROI for the i-th subject). Different from PC, SR estimates such a one-to-all FC-pattern through linearly regressing BOLD signals from the k-th ROI by BOLD signals of all other regions using a l1-norm sparse regularization:
(1) |
where λ is a regularization parameter controlling the sparsity of . SR models a brain network for each subject separately; such an independent modeling strategy may easily lead to relatively large inter-subject variability in . GSR-based FC network modeling can alleviate such a problem by jointly estimating non-zero connections across subjects via l2,1-norm regularization-based group lasso:
(2) |
where consists of the one-to-all FC-patterns of the k-th ROI for all N subjects, and λ controls the amount of group sparsity. The brain networks modeled by GSR will share similar topological structures (by enforcing similar nonzero or zero connectivities for all subjects) to reduce inter-subject variability. However, an inherent problem of the GSR-based network modeling roots in the group lasso constrain term, which also sacrifices the potentially important between-group differences that often benefit the disease diagnosis. Thus, the suboptimal classification performance is achieved when applied to computer-aided disease diagnosis.
2.2 Inter-subject Similarity-Guided GSR for Brain Network Modeling
We propose to estimate inter-subject FC similarity based on PC values of the same FC-patterns between different subjects, and then use this as a prior to control the inter-subject similarity/difference in the final estimated FC. In the following, we explain how the constraint can be integrated into the GSR-based method for guiding the construction of better brain networks. Let and denote the regional FC-patterns (estimated by PC) of the k-th ROI for the i-th and the j-th subjects, respectively. A graph Laplacian [15, 16] is constructed with a similarity matrix with defining the pairwise similarity of subjects in terms of their FC patterns for the k-th ROI. For simplicity, we set the kernel width σ = 1 in our experiments. Then, a similarity-preserving regularization term can be defined to control the inter-subject similarity/difference as follows:
(3) |
where Lk = Dk − Sk, and Dk ∈ ℝN×N is a diagonal matrix with its diagonal elements defined as . By integrating the regularization term Θ into GSR, a new model for constructing brain network is derived as:
(4) |
where λ1 and λ2 denote the regularization parameters for group sparsity and inter-subject FC-pattern similarity, respectively. In Eq. (4), by further adding the second regularization term Θ, we encourage inter-subject FC-pattern resemblance if their PC-based FC patterns are similar, and penalize inter-subject FC-pattern resemblance if their PC-based FC patterns are quite distinct. This will suppress within-group FC difference while retaining sufficient group differences. That is, the proposed new model will allow us to both achieve the good separability without losing the merit of group sparsity and promote the brain connectomics-based diagnosis. The optimization problem in Eq. (4) can be solved by an iterative approach, based on the accelerated proximal gradient algorithm [17]. Figure 1 illustrates the framework of our proposed inter-subject FC similarity constraint based GSR modeling. Specifically, for the i-th subject, the constructed brain network is formed as . Note that, the network matrix Gi is typically asymmetric. Thus, a symmetry operation is further carried out to achieve the symmetric network.
3 Experiments
3.1 Data Preprocessing
We use a publicly available Alzheimer’s Disease Neuroimaging Initiative (ADNI) phase-2 dataset (http://adni.loni.usc.edu/) to evaluate our proposed brain FC network modeling method. Specifically, 52 normal control (NC) subjects and 52 MCI subjects with rs-fMRI data are selected. These selected subjects are age- and gender-matched, and were all scanned using 3.0T Philips scanners with the same imaging parameters. The rs-fMRI data of these subjects are preprocessed using SPM8 toolbox (http://www.fil.ion.ucl.ac.uk/spm/software/spm8/). In particular, the first three volumes of each rs-fMRI data are discarded for magnetization equilibrium. After head motion correction, the remaining rs-fMRI data are nonlinearly registered into Montreal Neurological Institute (MNI) space and further spatially smoothed by an isotropic Gaussian kernel with full-width-at-half-maximum (FWHM) of 6 mm. Of note, we do not perform data scrubbing (i.e., removing the rs-fMRI frames with larger than 0.5-mm frame-wise displacement), since this would introduce additional artifacts. However, the subjects with large frame-wise displacement for 2.5-min data are excluded in data screening. The rs-fMRI data are then parcellated into 116 ROIs based on the Automatic Anatomical Labeling (AAL) template. In each ROI, the mean time series is extracted and band-pass filtered (0.015–0.15 Hz); also, head motion parameters and the mean BOLD time series of both white matter and cerebrospinal fluid are regressed out for reducing the potential interference to the subsequent FC estimation.
3.2 Performance Evaluation
The upper triangle elements of the constructed brain network of each subject are concatenated to form a feature vector. The dimensionality of this feature vector is 116 × (116 – 1)/2 = 6670. Then, we carry out two-sample t-tests with a significance level of p < 0.05 to reduce the redundant features. Lasso [18] is further employed to select a small number of features with better discriminative ability. Finally, a support vector machine (SVM) with a linear kernel is trained on the selected features for MCI classification. The leave-one-out cross-validation (LOOCV) scheme is adopted for evaluation of diagnosis performance. In each fold of LOOCV procedure, an additional inner LOOCV is also carried out on the training data to determine the optimal hyper-parameters for the sparse regression (λ1 and λ2) as well as for the SVM. Classification performance is evaluated based on classification accuracy (ACC), area under ROC curve (AUC), sensitivity (SEN), and specificity (SPE). To further evaluate the effectiveness of our proposed framework, we have made extensive comparisons with the brain networks constructed by Pearson’s correlation (PC), sparse representation (SR), group sparse representation (GSR), and our method, using the same dataset.
3.3 Experimental Results
Table 1 summarizes the classification results by different brain network modeling methods. Compared with PC, all of other sparse representation-based methods improved the classification accuracy. Among them, GSR performed better than SR, and our proposed method performed better than GSR. In particular, our proposed method produced the highest accuracy of 85.6%, with improvements of 21.2%, 15.4% and 7.7% compared with PC, SR, and GSR, respectively.
Table 1.
Method | ACC (%) | AUC | SEN (%) | SPE (%) |
---|---|---|---|---|
PC | 64.4 | 0.709 | 61.5 | 67.3 |
SR | 70.2 | 0.750 | 69.2 | 71.2 |
GSR | 77.9 | 0.835 | 76.9 | 78.9 |
Proposed | 85.6 | 0.927 | 82.7 | 88.5 |
To intuitively illustrate the superiority of our proposed network modeling method, we also present the separability values (r2 value, with larger value indicating higher separability [19, 20]) between two groups, for each connection of the brain FC networks modeled using GSR (Fig. 2a) and our proposed method (Fig. 2b). Our method reveals larger and more group differences compared to the GSR-based method. Moreover, the potential biomarkers detected by our method are largely non-overlapped with those detected by GSR, indicating that including an additional constraint term has resulted in systematical changes of the constructed brain network. Our newly identified connectivities with large MCI/NC separability (r2 > 0.07) mainly include three sets of connections: (1) within-hemisphere connections that specifically located in the left hemisphere (Fig. 2c), (2) inter-hemisphere connections (Fig. 2c), and (3) cerebro-cerebellar connections (Fig. 2d). Most of these connections involve brain regions at the frontal lobe, which indicates that the pathological changes of AD start from the frontal area.
4 Conclusion
In this paper, we propose a group-level brain network modeling method (i.e., with inter-subject similarity-guided group sparse representation) for better brain functional network construction. Experimental results show that our method is capable of improving network separability between different groups, which facilitates brain disease diagnosis. The effectiveness of our method has also been proven by significantly improving MCI classification, compared with the traditional Pearson’s correlation or sparse representation based brain network modeling methods. All these suggest the promise of our method for future clinical brain imaging studies, especially for biomarker detection and personalized brain connectomics-based disease diagnosis.
Acknowledgments
This work is partially supported by NIH grants (EB006733, EB008374, EB009634, MH107815, AG041721, and AG042599).
References
- 1.Association, A. Alzheimer’s disease facts and figures. Alzheimers Dement. 2013;9(2):208–245. doi: 10.1016/j.jalz.2013.02.003. [DOI] [PubMed] [Google Scholar]
- 2.Gauthier S, Reisberg B, Zaudig M, Petersen RC, Ritchie K, Broich K, Belleville S, Brodaty H, Bennett D, Chertkow H, Cummings JL. Mild cognitive impairment. Lancet. 2006;367(9518):1262–1270. doi: 10.1016/S0140-6736(06)68542-5. [DOI] [PubMed] [Google Scholar]
- 3.Suk HI, Wee CY, Lee SW, Shen D. Supervised discriminative group sparse representation for mild cognitive impairment diagnosis. Neuroinformatics. 2015;13(3):277–295. doi: 10.1007/s12021-014-9241-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhu X, Suk HI, Wang L, Lee SW, Shen D. A novel relational regularization feature selection method for joint regression and classification in AD diagnosis. Med Image Anal. 2017;38:205–214. doi: 10.1016/j.media.2015.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhu X, Suk HI, Shen D. A novel matrix-similarity based loss function for joint regression and classification in AD diagnosis. NeuroImage. 2014;100:91–105. doi: 10.1016/j.neuroimage.2014.05.078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lee MH, Smyser CD, Shimony JS. Resting-state fMRI: a review of methods and clinical applications. Am J Neuroradiol. 2013;34(10):1866–1872. doi: 10.3174/ajnr.A3263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zhang Y, Zhang H, Chen X, Lee SW, Shen D. Hybrid high-order functional connectivity networks using resting-state functional MRI for mild cognitive impairment diagnosis. Scientific reports. 2017 doi: 10.1038/s41598-017-06509-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chen X, Zhang H, Lee SW, Shen D. Hierarchical High-order functional connectivity networks and selective feature fusion for MCI classification. Neuroinformatics. 2017;15:271–284. doi: 10.1007/s12021-017-9330-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chen X, Zhang H, Gao Y, Wee CY, Li G, Shen D. High-order resting-state functional connectivity network for MCI classification. Hum Brain Mapp. 2016;37(9):3282–3296. doi: 10.1002/hbm.23240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lee H, Lee DS, Kang H, Kim BN, Chung MK. Sparse brain network recovery under compressed sensing. IEEE Trans Med Imaging. 2011;30(5):1154–1165. doi: 10.1109/TMI.2011.2140380. [DOI] [PubMed] [Google Scholar]
- 11.Yuan M, Lin Y. Model selection and estimation in regression with grouped variables. J R Stat Soc Series B. 2006;68(1):49–67. [Google Scholar]
- 12.Wee CY, Yap PT, Zhang D, Wang L, Shen D. Group-constrained sparse fMRI connectivity modeling for mild cognitive impairment identification. Brain Struct Funct. 2014;219(2):641–656. doi: 10.1007/s00429-013-0524-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yu R, Zhang H, An L, Chen X, Wei Z, Shen D. Connectivity strength-weighted sparse group representation-based brain network construction for MCI classification. Hum Brain Mapp. 2017;38(5):2370–2383. doi: 10.1002/hbm.23524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zeng LL, Shen H, Liu L, Hu D. Unsupervised classification of major depression using functional connectivity MRI. Hum Brain Mapp. 2014;35(4):1630–1641. doi: 10.1002/hbm.22278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhou T, Bhaskar H, Liu F, Yang J. Graph regularized and locality-constrained coding for robust visual tracking. IEEE Trans Circuits Syst Video Technol. 2016 [Google Scholar]
- 16.Zhu X, Li X, Zhang S, Ju C, Wu X. Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learn Syst. 2017;28(6):1263–1275. doi: 10.1109/TNNLS.2016.2521602. [DOI] [PubMed] [Google Scholar]
- 17.Liu M, Zhang D, Shen D. View-centralized multi-atlas classification for Alzheimer’s disease diagnosis. Hum Brain Mapp. 2015;36(5):1847–1865. doi: 10.1002/hbm.22741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhang Y, Zhou G, Jin J, Zhao Q, Wang X, Cichocki A. Aggregation of sparse linear discriminant analysis for event-related potential classification in brain-computer interface. Int J Neural Syst. 2014;24(1):1450003. doi: 10.1142/S0129065714500038. [DOI] [PubMed] [Google Scholar]
- 19.Zhang Y, Zhou G, Zhao Q, Jin J, Wang X, Cichocki A. Spatial-temporal discriminant analysis for ERP-based brain-computer interface. IEEE Trans Neural Syst Rehabil Eng. 2013;21(2):233–243. doi: 10.1109/TNSRE.2013.2243471. [DOI] [PubMed] [Google Scholar]
- 20.Zhang Y, Zhou G, Jin J, Zhao Q, Wang X, Cichocki A. Sparse Bayesian classification of EEG for brain-computer interface. IEEE Trans Neural Netw Learn Syst. 2016;27(11):2256–2267. doi: 10.1109/TNNLS.2015.2476656. [DOI] [PubMed] [Google Scholar]