Abstract
We present a deep semi-nonnegative matrix factorization method for identifying subject-specific functional networks (FNs) at multiple spatial scales with a hierarchical organization from resting state fMRI data. Our method is built upon a deep semi-nonnegative matrix factorization framework to jointly detect the FNs at multiple scales with a hierarchical organization, enhanced by group sparsity regularization that helps identify subject-specific FNs without loss of inter-subject comparability. The proposed method has been validated for predicting subject-specific functional activations based on functional connectivity measures of the hierarchical multi-scale FNs of the same subjects. Experimental results have demonstrated that our method could obtain subject-specific multi-scale hierarchical FNs and their functional connectivity measures across different scales could better predict subject-specific functional activations than those obtained by alternative techniques.
Keywords: Brain functional networks, Multi-scale, Hierarchical Subject-specific, Deep matrix factorization
1. Introduction
The human brain can be represented as a multiscale hierarchical network [1, 2]. However, existing functional brain network analysis studies of resting-state functional magnetic resonance imaging (rsfMRI) data typically define network nodes at a specific scale, based on regions of interest (ROIs) obtained from anatomical atlases or functional brain parcellations [3–5]. Recent work has demonstrated important subject-specific variation in the functional neuroanatomy of large-scale brain networks [6], emphasizing the need for tools which can flexibly adapt to individuals’ variation while simultaneously maintaining correspondence for group-level analyses.
To capture subject-specific brain network without loss of inter-subject comparability, data-driven brain decomposition methods have been widely adopted to identify spatial intrinsic functional networks (FNs) and estimate functional network connectivity. In order to obtain subject-specific FNs from rsfMRI data of individual subjects while facilitating groupwise inference, independent vector analysis (IVA) and group-information guided ICA (GIGICA) methods have been proposed [7, 8]. More recently, several methods have been proposed to discover FNs from rsfMRI data with non-independence assumptions [9–11], and directly work on individual subject fMRI data and simultaneously enforce correspondence across FNs of different subjects by assuming that loadings of corresponding FNs of different subjects follow Gaussian [9] or delta-Gaussian [10] distributions. Non-negative matrix decomposition techniques have been adopted to simultaneously compute subject-specific FNs for a group of subjects regularized by group sparsity in order to separate anti-correlated FNs properly so that anti-correlation information between them could be preserved [11, 12]. However, these methods are not equipped to characterize multi-scale hierarchical organization of the brain networks [2]. Although clustering and module detection algorithms could be adopted to detect the hierarchical organization of brain networks, their performance is hinged on the network nodes/FNs used [13, 14].
To address the aforementioned limitations of existing techniques, we develop a novel brain decomposition model based on a collaborative sparse brain decomposition approach [11] and deep matrix factorization techniques [15], aiming to identify subject-specific, multi-scale hierarchal FNs from rsfMRI data. Based on rsfMRI data and task activation maps of unrelated subjects from the HCP dataset [16], we have quantitatively evaluated our method for predicting subject-specific functional activations based on functional connectivity measures of the hierarchical multi-scale FNs of the same subjects. Experimental results have demonstrated that the multi-scale hierarchical subject-specific FNs identified by our method from rsfMRI data could better predict the subject-specific functional activations evoked by different tasks than those identified by alternative techniques.
2. Methods
A deep matrix decomposition framework is proposed to identify FNs at multiple scales as schematically illustrated in Fig. 1. Particularly, a deep semi-nonnegative matrix factorization is adopted to jointly detect a hierarchy of FNs from fine to coarse spatial scales in a data-driven way, and a group sparsity regularization term is adopted for FNs of different subjects at each scale to ensure the subject-specific FNs to share similar spatial patterns. Besides enforcing the groupwise correspondence of FNs across subjects, the group sparsity term also encourages FNs at the finest scale to have sparse spatial patterns and FNs at coarse scales to comprise functionally correlated FNs at finer scales. Our decomposition model is further enhanced by a data locality regularization term that makes the decomposition robust to imaging noise and improves spatial smoothness and functional coherences of the subject specific FNs.
2.1. Deep Semi-nonnegative Matrix Factorization for Brain Decomposition
Given rsfMRI data Xi ∈ RT×S of subject i, consisting of S voxels and T time points, we aim to find Kj nonnegative FNs and their corresponding time courses at j = 1, …,h scales, so that ,,. The FNs at 1 ≤ j ≤ h scales are constrained to have a hierarchical structure and to be non-negative so that each FN does not contain any anti-correlated functional units. A deep semi-nonnegative matrix factorization (DSNMF) framework similar to [15] is adopted to identify the multi-scale hierarchical FNs by optimizing
(1) |
The FNs at different scales are represented by , ∀1 ≤ j ≤ h for subject i, and the FNs at 2 consecutive scales are linked hierarchically according to weights determined during the joint decomposition. The decomposition model does not constrain time courses to be non-negative so that it can be applied to preprocessed fMRI data with negative values.
2.2. DSNMF Based Collaborative Brain Decomposition
Given a group of n subjects, each having fMRI data Xi ∈ RT×S, i = 1,…, n, we identify subject-specific, multiscale hierarchical FNs by optimizing a joint model with integrated data fitting and regularization terms as illustrated in Fig. 1:
(2) |
where and are parameters for balancing data fitting and regularization terms, T is the number of time points, Kj is the number of FNs at scale j, nM is the number of spatially neighboring voxels at voxel level, α and β are 2 parameters, Rc,j and are regularization terms. Particularly, Rc,j is an inter-subject group sparsity term that enforces FNs of different subjects to have common spatial structures at the same scale j. The group sparsity regularization term is defined as for each row of , ∀1 ≤ i ≤ n, 1 ≤ j ≤ h where Sj = kj–1 when j > 1 and S1 = S The group sparsity regularization (term 1 in Fig. 1) enforces corresponding FNs of different subjects to have non-zero elements at the same spatial locations. Moreover, it encourages FNs to have spatially localized loadings. It is worth noting that the group sparsity term does not force different FNs to be non-overlapping, thus certain functional units may be included in multiple FNs simultaneously at each scale. We also adopt a data locality regularization term (term 2 in Fig. 1), , to encourage spatial smoothness and functional coherence of the FNs using a graph regularization technique [17] at the finest spatial scale, which is defined as , where is a Laplacian matrix for subject i, is a pairwise affinity matrix to measure spatial closeness or functional similarity between voxels, and is its corresponding degree matrix, the affinity between each pair of spatially connected voxels is calculated as , where is the Pearson correlation coefficient between their rsfMRI signals, and others are set to zero so that has a sparse structure.
We optimize the joint model using an alternative update strategy. When and (k ≠ j) are fixed, is updated as
(3) |
where ☉ denotes element-wise multiplication, repmat (b, r, c) denotes matrix obtained by replicating r and c copies of vector b in theðrow Þand column dimensions, , , Mj:k,. denotes the k−th row of matrix Mj, and Mj:k;s denotes the (k,s)-th element in the matrix. is normalized by the row-wise maximum value along the row dimension after each update iteration.
(4) |
where M† denotes the Moore-Penrose pseudo-inverse of matrix M.
To expedite the convergence of the optimization, a pre-training step at group level is adopted before the joint optimization. In particular, we compute (U1, V1) ⟵ sparseNMF(X) first, where X denotes the temporal concatenated data of {Xi,i =1,…,n} and , respectively. are then used to initialize for the joint optimization. contains a hierarchical structure with overlapping as they are obtained by decomposition in a greedy manner. Once the initialization is done, all FNs at different scales are optimized jointly. We set the parameters α to 1, and β to 10 according to [11] in the present study.
3. Experimental Results
We evaluated our method based on rsfMRI data and task activation maps of 40 unrelated subjects obtained from the Human Connectome Project (HCP) [16], aiming to evaluate the performance of predicting task-evoked activation responses based on functional connectivity measures of FNs at multiple scales. The FNs derived from rsfMRI data have demonstrated promising performance for predicting task-evoked brain activations [18].
The proposed decomposition model was applied to the minimal-preprocessed, cortical gray-coordinates based rsfMRI data. The number of scales was set to 3, and the number of FNs at the first scale was set to 90, which was estimated by MELODIC automatically [19], the numbers of FNs at the 2nd and 3rd scales were set to 50 and 25 respectively, which were decreased by half approximately along the scales.
We compared the proposed model with two alternative decomposition strategies: multi-scale decomposition performed independently at different scales, and greedy agglomerative hierarchical multi-scale decomposition, with the same setting of scales and the same numbers of FNs. For the independent decomposition, the collaborative sparse decomposition model was adopted to obtain subject-specific FNs independent at 3 scales (with independent initialization). For the agglomerative one, the initialization obtained by greedy decomposition as described Sect. 2.2 was adopted for each scale, but the final decomposition at each scale was obtained using the collaborative sparse model separately.
3.1. Multi-scale Brain Functional Networks with a Hierarchical Organization
An example hierarchy of FNs (mean of 40 subjects) corresponding to sensorimotor function obtained by the proposed method is illustrated in Fig. 2(a). The FN at the 3rd scale (top left) comprised the sensorimotor networks and part of visual networks, and was a weighted composition of FNs at the 2nd scale while the FNs at the 2nd scale were composed of FNs at the 1st scale. This example hierarchy of FNs illustrated that FNs corresponding to sensorimotor function gradually merged from fine to coarse scales in the hierarchy. However, no clear hierarchical organization was observed for FNs independently identified at different scales. Particularly, as shown in Fig. 2(b) sensorimotor regions appeared in separate FNs at the coarsest scale (the 3rd), instead of forming a single FN. We postulated that the independent decomposition at different scales favored to better data-fitting and therefore was affected by data noise, while the joint decomposition model was more robust to data noise, facilitating accurate identification of FNs with coherent functions, such as the FN comprising sensorimotor regions shown in Fig. 2(a).
3.2. Prediction of Task Evoked Activations Based on Multi-scale FN Connectivity
As no ground truth is available for FNs derived from rsfMRI data, we evaluated the multi-scale hierarchical FNs for predicting functional activations evoked by different tasks based on their functional connectivity measures with an assumption that better FNs could provide more discriminative information for predicting the brain activations. We also compared FNs obtained using different strategies in terms of their prediction performance.
Particularly, a whole brain voxelwise functional connectivity (FC) map was obtained for each FN by computing voxelwise Pearson correlation coefficient between the FN’s time course and every cortical gray-coordinate’s time course of the rsfMRI data. All the FC maps were then transferred to z-score maps using Fisher Z transformation. All the z-score values of FNs on each cortical gray-coordinate were used as features to predict its activation measures under different tasks of the same subject. Similar to [18], the whole cortical surface was divided into 90 parcels according to FNs obtained at the finest scale, and one ordinary least square model was trained for each parcel and every task event. The prediction performance was evaluated using a leave-one-subject-out cross-validation, where one subject’s activation was predicted by a model built upon data of the remaining 39 subjects. The prediction was conducted using single-scale FNs (90, 50, 25) or multi-scale FNs (165) obtained using different strategies respectively, and the prediction accuracy was evaluated as the Pearson correlation coefficients between the predicted and real activation maps of 47 task events from 7 tasks categories.
The prediction performance of 6 randomly selected task events is illustrated in Fig. 3. For all task events and FNs identified by different strategies, the prediction models built upon multi-scale FNs outperformed all prediction models built upon any single scale FNs alone, indicating that multi-scale FNs could provide complementary information for the task activation prediction. The prediction models built upon multi-scale hierarchical FNs obtained by our method had significantly better performance than those build upon multi-scale FNs obtained by either the independent decomposition or the greedy agglomerative decomposition (Wilcoxon signed rank test, p<0:02), indicating that the joint optimization of multi-scale hierarchical FNs could benefit from each other and characterize the intrinsic FNs better.
4. Conclusions
In this study, we have developed a deep decomposition model to identify multi-scale, hierarchical, subject-specific FNs with group level correspondence across different subjects. Our method is built upon deep semi-nonnegative matrix factorization framework, enhanced by a group sparsity regularization and graph regularization for maintaining inter-subject correspondence and better functional coherence. Experimental results based on rsfMRI data and task activation maps of the same subjects have demonstrated that the multi-scale hierarchical subject-specific FNs could capture informative intrinsic functional networks and improve the prediction performance of task activations evoked by different tasks, compared to FNs identified at different scales independently or in a greedy agglomerative way. In conclusion, our method provides an improved solution for characterizing subject-specific, multi-scale hierarchical organization of the brain functional networks.
Acknowledgements.
This work was supported in part by National Institutes of Health grants [CA223358, EB022573, DK114786, DA039215, and DA039002].
References
- 1.Doucet G, et al. : Brain activity at rest: a multiscale hierarchical functional organization. J. Neurophysiol 105(6), 2753–2763 (2011) [DOI] [PubMed] [Google Scholar]
- 2.Park HJ, Friston K: Structural and functional brain networks: from connections to cognition. Science 342(6158), 1238411 (2013) [DOI] [PubMed] [Google Scholar]
- 3.Bullmore E, Sporns O: Complex brain networks: graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci 10(3), 186–198 (2009) [DOI] [PubMed] [Google Scholar]
- 4.Honnorat N, et al. : GraSP: geodesic graph-based segmentation with shape priors for the functional parcellation of the cortex. Neuroimage 106, 207–221 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Li H, Fan Y: Individualized brain parcellation with integrated functional and morphological information. In: 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI) (2016) [Google Scholar]
- 6.Satterthwaite TD, Davatzikos C: Towards an individualized delineation of functional neuroanatomy. Neuron 87(3), 471–473 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Du Y, Fan Y: Group information guided ICA for fMRI data analysis. Neuroimage 69, 157–197 (2013) [DOI] [PubMed] [Google Scholar]
- 8.Lee JH, et al. : Independent vector analysis (IVA): multivariate approach for fMRI group study. Neuroimage 40(1), 86–109 (2008) [DOI] [PubMed] [Google Scholar]
- 9.Abraham A, Dohmatob E, Thirion B, Samaras D, Varoquaux G: Extracting brain regions from rest fmri with total-variation constrained dictionary learning In: Mori K, Sakuma I, Sato Y, Barillot C, Navab N (eds.) MICCAI 2013. LNCS, vol. 8150, pp. 607–615. Springer, Heidelberg: (2013). 10.1007/978-3-642-40763-5_75 [DOI] [PubMed] [Google Scholar]
- 10.Harrison SJ, et al. : Large-scale probabilistic functional modes from resting state fMRI. Neuroimage 109, 217–231 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li H, Satterthwaite TD, Fan Y: Large-scale sparse functional networks from resting state fMRI. Neuroimage 156, 1–13 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Li H, Satterthwaite T, Fan Y: Identification of subject-specific brain functional networks using a collaborative sparse nonnegative matrix decomposition method. In: 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI) (2016) [Google Scholar]
- 13.Li H, Fan Y: Hierarchical organization of the functional brain identified using floating aggregation of functional signals. In: 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI) (2014) [Google Scholar]
- 14.Li H, Fan Y: Functional brain atlas construction for brain network analysis In: SPIE Medical Imaging. SPIE; (2013) [Google Scholar]
- 15.Trigeorgis G, et al. : A deep matrix factorization method for learning attribute representations. IEEE Trans. Pattern Anal. Mach. Intell 39(3), 417–429 (2017) [DOI] [PubMed] [Google Scholar]
- 16.Glasser MF, et al. : The minimal preprocessing pipelines for the human connectome project. Neuroimage 80, 105–124 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cai D, et al. : Graph regularized nonnegative matrix factorization for data representation. IEEE Trans. Pattern Anal. Mach. Intell 33(8), 1548–1560 (2011) [DOI] [PubMed] [Google Scholar]
- 18.Tavor I, et al. : Task-free MRI predicts individual differences in brain activity during task performance. Science 352(6282), 216–220 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jenkinson M, et al. : Fsl. Neuroimage 62(2), 782–790 (2012) [DOI] [PubMed] [Google Scholar]