Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Nov 10.
Published in final edited form as: Mach Learn Med Imaging. 2020 Sep 29;12436:292–302. doi: 10.1007/978-3-030-59861-7_30

Structural Connectivity Enriched Functional Brain Network using Simplex Regression with GraphNet

Mansu Kim 1, Jingxaun Bao 2, Kefei Liu 1, Bo-yong Park 3, Hyunjin Park 4,5, Li Shen 1
PMCID: PMC8580103  NIHMSID: NIHMS1735587  PMID: 34766171

Abstract

The connectivity analysis is a powerful technique for investigating a hard-wired brain architecture as well as flexible, functional dynamics tied to human cognition. Recent multi-modal connectivity studies had the challenge of combining functional and structural connectivity information into one integrated network. In this paper, we proposed a simplex regression model with graph-constrained Elastic Net (GraphNet) to estimate functional networks enriched by structural connectivity in a biologically meaningful way with a low model complexity. Our model constructed the functional networks using sparse simplex regression framework and enriched structural connectivity information based on GraphNet constraint. We applied our model on the real neuroimaging datasets to show its ability for predicting a clinical score. Our results demonstrated that integrating multi-modal features could detect more sensitive and subtle brain biomarkers than using a single modality.

Keywords: Structural connectivity, functional connectivity, simplex regression, GraphNet, depression

1. Introduction

Connectivity analysis is a powerful technique for investigating a hard-wired brain architecture as well as flexible functional dynamics tied to human cognition [16, 23]. Indeed, the whole-brain structural connectome can be measured via diffusion magnetic resonance imaging (dMRI) data through tractography algorithms by approximating structural wiring in white matter. Functional connectivity is constructed by measuring statistical associations of temporal coherence between different brain regions, and often computed from the resting-state functional MRI (rs-fMRI) data.

In the functional domain, various network modeling methods were introduced to measure the degree of coherence in the functional network [3, 14, 19], such as Pearson correlation (PearC), partial correlation (PartC), and graphical LASSO (GL). These approaches provided a novel perspective for understanding a large-scale functional organization of the brain, which was often used for distinguishing healthy and diseased brains in the studies of psychiatric and neurological disorders [2, 13, 20]. Unlike functional connectivity that infers statistical association, structural connectivity provides information of physical neuronal connections of the complex brain network, which can be used for identifying disrupted physical wiring between distinct brain regions. Recent multi-modal studies found that imaging features combining structural and functional connectivity information provided better imaging biomarkers for common diseases [21, 22], indicating the integration of multi-modal features may help detect more sensitive and subtle brain biomarkers than using a single modality alone.

Recently, some studies have proposed various sparse models to estimate brain networks from structural, functional and/or genomic data. Huang et al. proposed a sparse simplex model (Simplex) to build a brain network using whole brain gene expression data, but their methods did not consider spatial proximity and structural connectivity [7]. Pineda-Pardo et al. applied adaptive GL to estimate an MEG connectivity network guided by a structural connectivity network [15]. Li et al. proposed an ultra-weighted-LASSO approach to efficiently estimate functional networks by considering structural connectivity and derivatives of the temporal signal [10]. These methods incorporated the adaptive LASSO regularization approach to incorporate multi-modal information. However, if one node is linked to two highly connected regions, this approach tends to select only one of the two regions randomly, thus inadequate to capture all the signals.

To over these limitations, in this paper, we proposed a simplex regression model with graph-constrained Elastic Net (GraphNet) to estimate functional networks enriched by structural connectivity in a biologically meaningful way with low model complexity. Our major contributions are as follows: i) We designed a simplex regression model to build a functional network. ii) We extended the simplex regression model to include the GraphNet penalty to incorporate structural connectivity computed from dMRI data using a tractography algorithm. iii) We applied our proposed algorithm to the Human Connectome Project (HCP) database to demonstrate its ability to predict a clinical score and showed the promise of our algorithm compared with multiple competing methods.

2. Materials

2.1. Data description

We obtained neuroimaging (i.e., fMRI and dMRI) and genotyping datasets from the HCP database. Specifically, genetically unrelated, non-twins, non-Hispanic, white participants with full demographic information were considered in this study; see Table 1 for their characteristics. Of those, we randomly selected 100 participants and divided them into two groups (depression vs healthy) with equal size based on the Diagnostic and Statistical Manual of Mental Disorders 5th edition depression (DSM-dep) scores. Participants with a DSM-dep larger than 6 were classified as depression subjects, and the remaining participants were classified as healthy subjects [12]. The age, sex, and mini-mental state examination were matched between healthy and depression groups.

Table 1.

Participant characteristics.

Healthy Depression p-value
Number of subjects 50 50
Age 29.06 ± 3.89 28.66 ± 3.54 0.5925
Sex M:28, F:22 M:24, F:26 0.4284
DSM-dep 0.66 ± 0.47 8.44 ± 2.22 <0.0001
Mini-Mental State Examination 29.16 ± 0.91 28.86 ± 1.16 0.1538

2.2. Data pre-processing

HCP database provided minimally pre-processed neuroimaging (i.e., rs-fMRI and dMRI) and genotyping datasets. For the rs-fMRI data, the CIFTI dense time series data in standard grayordinate space were obtained using the minimal pre-processing pipeline, which includes corrections for EPI distortions and head motion, registration to the T1-weighted data and subsequently MNI space, skull removal, and intensity normalization [5]. Then, the artifacts of head movement, white matter, cardiac pulsation, arterial, and large vein related contributions were removed by FMRIB’s ICA-based X-noisifier (FIX) [17]. Finally, we averaged the vertex-wise time-series into parcel-level using HCP multi-modal parcellation atlas (HCP-MMP) [4]. We added 12 subcortical regions to the 360 cortical areas yielding a total of 372 brain regions.

The dMRI data were processed using a procedure similar to the one described by Kim et al. [8]. Head motion and eddy current were corrected and then probabilistic tractography was performed to build structural connectivity using FSL [18]. Graph nodes were defined by HCP-MMP atlas and the edges were defined using connection density between the nodes. Finally, the structural connectivity information was used as the constraint in our proposed approach described later.

3. Methods

3.1. Simplex representation

Herein, we used the boldface lowercase letter to denote a vector, and the boldface uppercase letter to denote a matrix. Specifically, given the datasets Xn×p, where X corresponded to the pre-processed rs-fMRI data as described in Section 2.2., n denoted time points of rs-fMRI data, and p denoted number of brain regions. The sparse simplex learning model proposed by Huang et al. was originally proposed to construct the neuroanatomical and transcriptomic networks [7]. The model is defined as follows:

β˜i=minβiX(:,i)X(:,i)βi22+λβi1s.t. βi0,βiT1=1, (1)

where X(:,≠i) is the matrix X with the i-th column (i.e., i-th region) removed, X(:,i) is the i-th column of X, βi is p − 1 dimensional coefficient vector for the i-th brain region, and ≥ denotes “componentwise larger than or equal to”. One advantage of the simplex regression model is that the simplex constraint yields network edge weights (i.e., regression coefficients) that can be treated as probability values.

3.2. Functional brain network construction with simplex regression framework and GraphNet constraint

We herein proposed an algorithm for constructing functional network enriched by structural connectivity. Fig. 1 showed overall procedure of our approach. Specifically, we proposed a sparse simplex regression model penalized by GraphNet penalty. GraphNet penalty, proposed by Grosenick et al., has an advantage for integrating biological graph constraint, such as structural connectivity, by encouraging the coefficients to be similar between two highly connected nodes [6]. For example, when the structural connectivity between the i-th and j-th regions is high, the GraphNet penalty forces the corresponding coefficients to be similar. Additionally, the structural connectivity contains white matter fiber information, which is a good source for a biological constraint. Thus, we applied the GraphNet penalty on the sparse simplex regression model. The formula for the algorithm is defined as follows:

β˜i=minβiX(:,i)X(:,i)βi22+λβi1+λGβiTLSCβis.t. βi0,βiT1=1, (2)

where, LSC is the Laplacian matrix of the structural connectivity C with the i-th brain region excluded. The Laplacian matrix is defined as LSC = DC, where D is the degree matrix of structural connectivity. Since simplex constraint yield l1 penalty to be the constant term, we can rewrite eq. (2) as the following:

β˜i=minβiX(:,i)X(:,i)βi22+λGβiTLSCβi s.t. βi0,βiT1=1. (3)

The constraints in Eq. (3) is also simplex, so we can optimize it using the accelerated projected gradient method, as described in the next section.

Fig.1.

Fig.1.

The overall procedures of proposed algorithm.

We constructed the functional network by repeating the proposed algorithm p times for every brain region. The constructed network S=[β1˜,β2˜,,βp˜] is sparse and asymmetric. The βi˜ denoted a p dimensional vector, zero was inserted for the i-th coefficient of estimated coefficients vector βi. We defined the symmetric functional network by replacing S(i,j) and S(j,i) with the maximum value between them.

3.3. Optimization details and the proposed algorithm

In this section, we describe an iterative algorithm for minimizing the cost function. The cost function, Eq. (3), can be solved by taking the derivative with respect to βi and setting it to zero:

X(:,i)TX(:,i)βi+λGLSCβiX(:,i)TX(:,i)=0. (4)

Thus, the solution can be obtained by solving Eq. (4) as follows:

β˜i=(X(:,i)TX(:,i)+λGLSC)1X(:,i)TX(:,i). (5)

Next, we applied the accelerated projected gradient methods to solve the simplex problem as follow:

minβi12βiv22s.tβi0,βiT1=1, (6)

where v denoted βi˜. We rewrite the Eq (6) using unconstrained formulation as

12βiv22γ(βiT11)λTβi, (7)

where γ and λ is a Lagrangian multiplier and Lagrangian multiplier vector, respectively, both of which are to be determined. Suppose the optimal solution to the proximal problem (6) is β*, the associated Lagrangian multipliers are γ* and λ*. We then derived the following equations, according to the KKT conditions [1]:

j,βij*vjγ*λj*=0, (8)
j,βij*0, (9)
j,λj*0, (10)
j,βij*λj*=0, (11)

where βij* denoted the j-th element of βi*. We can rewrite Eq. (8) as βij*vjγ*1λj*=0. We have γ*=11Tv1Tλ*n using the constraint βiT1=1 and derive β*=(v1n1Tv+1n11Tλ*n1)+λ*. We rewrite it as β*=u+λ*λ*¯1, where λ*¯=1Tλ*n and u=v1n1Tv+1n1. Thus, ∀j we have

βij*=uj+λj*λ*¯. (12)

According to Eqs. (9)–(12), we have uj+λj*λ*¯=(ujλ*¯)+, where x+ = max(x, 0). We then have βij*=(ujλ*¯)+. Therefore, given we know λ*¯, we can compute the optimal solution β*.

To obtain λ*¯, we rewrite Eq. (12) as λj*=λ*¯+βij*uj. According to Eqs. (9)–(11), we have λj*=(λ*¯uj)+. Since v is a p − 1 dimensional vector, we have λ*¯=1p1j=1p1(λ*¯uj)+. Thus, we define a function as follow:

f(λ*¯)=1p1j=1p1(λ*¯uj)+λ¯, (13)

and we obtain λ*¯ by solving Eq. (13) to be zero. Since λ* ≥ 0, f(λ*¯)0, and f(λ*¯) is a piecewise linear and convex function, we can compute the root of f(λ*¯) using Newton method efficiently.

4. Experiments and Results

4.1. Experimental results on Human Connectome Project data

In our experiments, we used pre-processed rs-fMRI and dMRI data, as described in Section 2.2. The rs-fMRI pre-processing procedure resulted in 2, 400 time points for 372 regions, and the dMRI pre-processing yielded structural connectivity matrix (i.e., 372 × 372 matrix) based on probability tractography for each subject.

We applied the proposed model to construct functional networks. We compared our approach with five different functional network construction methods (i.e., PearC, PartC, GL, GL with structural connectivity [GLs], and simplex) according to previous studies [3, 9, 11]. When we constructed functional networks using our algorithm, GL, and GLs, we applied different sets of hyper-parameters for each model. We used [1, 10, 100, 1000, 10000] for the proposed algorithm, [0.01, 0.1, 1, 10, 100] for GL and GLs.

Overall, the constructed functional networks showed similar patterns except the PerC and SC, as shown in Fig. 2. We noted that our approach yielded more relevant and sparser patterns in sub-cortical structures compared to the methods using either fMRI and dMRI alone, as shown in the white box of Fig. 2. We compared the sparsity (ratio of non-zero connections in the functional network) for all methods. Fig. 3(a) shows the boxplots of sparsity across all subjects for each method. The sparsity decreased for every method, as we increase the hyper-parameter value. For our approach, the sparsity did not change much when λ was larger than 100. The GL and GLs resulted in networks with almost zero sparsity, when λ was larger than 1. We also evaluated the similarity of the network patterns by reporting the pair-wise correlation among each method. We computed the correlation for network edge weights across subjects between two different methods (Fig. 3-(b)). Interestingly, the networks based on our approach were very similar to those of partial correlation networks and the networks with GL. Finally, our results suggested that our approach constructed relatively sparse networks that are to some extent consist with existing methods but with better enriched sub-cortical structural connectivity information.

Fig.2. Example of functional network map for each model.

Fig.2.

The first and second row are the functional network for depression and healthy group, respectively. The third row shows the absolute difference map between depression and healthy group. The fourth row shows an enlarged portion of each absolute difference map.

Fig.3. The sparsity and similarity comparison of networks.

Fig.3.

Sub-figure (a) and (b) denoted a boxplot of the sparsity for each model and pair-wise correlation between models with varying hyper-parameters.

4.2. Evaluations using prediction task

In this section, we demonstrated and compared the efficacy of the constructed networks. However, there is no ground-truth for network constructions, thus we cannot directly compare the performances of all tested methods. Instead, we used the prediction task to compare the performances among different functional network construction methods. We first selected relevant features based on two-sample t-tests and built the ridge regression model to predict a DSM-dep score on the training set. A nested 10-fold-cross-validation was conducted to test the prediction performance. Specifically, the hyper-parameter for ridge regression was tuned using 10-fold-cross-validation on the training set. The trained prediction model was applied to testing set to measure the prediction performance.

After 10-fold-cross-validation, there were 349.98 ± 6.87 features were selected with λ = 1000. The lowest root-mean-square-error (RMSE) of 3.783 was obtained using our algorithm with λ = 1000. Our algorithm also led to the highest correlation of 0.475 between the actual and predicted DSM5-Depression scores. Additionally, the GLs with λ = 0.01 yielded a RMSE of 5.117 and a correlation of 0.279, the GL with λ = 0.01 yielded a RMSE of 4.113 and correlation of 0.208, and the PartC yielded a RMSE 4.218 and correlation of 0.227. The detailed performances of seven different models with varying parameters were shown in Table 2. Interestingly, functional networks from GL and GLs with λ = 0.01, and our algorithm with λ = 1000 showed similar network pattern and sparsity. However, our algorithm showed higher prediction performance than those from GL and GLs. Thus, we believe our approach leads to a more robust network with sparser connections.

Table 2.

The prediction performances of various methods. Nested 10-fold-cross-validation was conducted to select the features and tune the hyper-parameters. The performance was reported in terms of RMSE and correlation coefficients (CC) between actual and predicted DSM-dep score.

Method RMSE CC Method RMSE CC
PearC 5.077 0.110 GLs (λ = 1) 5.252 −0.248*
PartC 4.218 0.227* GLs (λ = 10) - -
SC 4.597 0.139 GL (λ = 100) - -
GL (λ = 0.01) 4.113 0.208* Ours (λ = 1) 4.799 0.119
GL (λ = 0.1) 5.153 −0.065 Ours (λ = 10) 4.557 0.261*
GL (λ = 1) - - Ours (λ = 100) 4.191 0.240*
GL (λ = 10) - - Ours (λ = 1000) 3.783 0.475*
GL (λ = 100) - - Ours (λ = 10000) 4.119 0.232*
GLs (λ = 0.01) 5.117 0.279* Simplex 4.548 0.110
GLs (λ = 0.1) 4.711 0.152

The asterisk denoted the CC with p < 0.05.

5. Conclusion

In this work, we proposed a simplex regression model with GraphNet penalty to estimate functional networks enriched by structural connectivity. We demonstrated the feasibility of our algorithm on the HCP database. Compared to the existing methods, our model has two advantages. First, the functional network based on simplex regression can be interpreted as a probability, which can help further analysis. Second, the simplex representation with GraphNet can efficiently combine structural and functional information. Furthermore, we validated our proposed algorithm on real neuroimaging data and compared the results with those obtained using the existing competing methods.

In the future, we will further look into generating the whole-brain connectivity at once by applying the matrix optimization algorithm. Furthermore, there is no ground-truth for network constructions. Thus, our results should be further confirmed by future independent replications.

Acknowledgments

This work was supported by the National Institutes of Health [R01 EB022574] and National Science Foundation [IIS 1837964]. Data were provided by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

References

  • 1.Boyd S, Vandenberghe L: Convex Optimization. Cambridge University Press; (2004) [Google Scholar]
  • 2.Damaraju E, Allen E, Belger A, Ford J, McEwen S, Mathalon D, Mueller B, Pearlson G, Potkin S, Preda A, Turner J, Vaidya J, van Erp T, Calhoun V: Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia. NeuroImage: Clinical 5, 298–308 (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Friedman J, Hastie T, Tibshirani R: Sparse inverse covariance estimation withthe graphical lasso. Biostatistics 9(3), 432–441 (2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Glasser M, C.T.R.E.e.a.: A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, Xu J, Jbabdi S, Webster M, Polimeni JR, et al. : The minimal preprocessing pipelines for the human connectome project. Neuroimage 80, 105–124 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Grosenick L, Klingenberg B, Katovich K, Knutson B, Taylor JE: Interpretable whole-brain prediction analysis with graphnet. NeuroImage 72, 304–321 (2013) [DOI] [PubMed] [Google Scholar]
  • 7.Huang H, Yan J, Nie F, Huang J, Cai W, Saykin AJ, Shen L: A new sparse simplex model for brain anatomical and genetic network analysis. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 625–632 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kim M, Won JH, Youn J, Park H: Joint-connectivity-based sparse canonicalcorrelation analysis of imaging genetics for detecting biomarkers of parkinson’s disease. IEEE Transactions on Medical Imaging 39(1), 23–34 (2020) [DOI] [PubMed] [Google Scholar]
  • 9.Li K, Guo L, Nie J, Li G, Liu T: Review of methods for functional brain connectivity detection using fmri. Computerized Medical Imaging and Graphics 33(2), 131–139 (2009) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li Y, Liu J, Luo M, Li K, Yap PT, Kim M, Wee CY, Shen D: Structural connectivity guided sparse effective connectivity for mci identification. In: International Workshop on Machine Learning in Medical Imaging. pp. 299–306 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Marrelec G, Krainik A, Duffau H, Pélégrini-Issac M, Lehéricy S, Doyon J,Benali H: Partial correlation for functional brain interactivity investigation in functional mri. Neuroimage 32(1), 228–237 (2006) [DOI] [PubMed] [Google Scholar]
  • 12.Nedley N, Ramirez FE: Nedley depression hit hypothesis: identifying depressionand its causes. American Journal of Lifestyle Medicine 10(6), 422–428 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Park B.y., Seo J, Park H: Functional brain networks associated with eatingbehaviors in obesity. Scientific reports 6(1), 1–8 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pervaiz U, Vidaurre D, Woolrich MW, Smith SM: Optimising network modelling methods for fmri. NeuroImage 211, 116604 (2020) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Pineda-Pardo JA, Bruña R, Woolrich M, Marcos A, Nobre AC, Maestú F,Vidaurre D: Guiding functional connectivity estimation by structural connectivity in meg: an application to discrimination of conditions of mild cognitive impairment. Neuroimage 101, 765–777 (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rubinov M, Sporns O: Complex network measures of brain connectivity: usesand interpretations. Neuroimage 52(3), 1059–1069 (2010) [DOI] [PubMed] [Google Scholar]
  • 17.Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L,Smith SM: Automatic denoising of functional mri data: combining independent component analysis and hierarchical fusion of classifiers. Neuroimage 90, 449–468 (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Smith SM, Jenkinson M, Woolrich MW, Beckmann CF, Behrens TE,Johansen-Berg H, Bannister PR, Luca MD, Drobnjak I, Flitney DE, Niazy RK, Saunders J, Vickers J, Zhang Y, Stefano ND, Brady JM, Matthews PM: Advances in functional and structural mr image analysis and implementation as fsl. NeuroImage 23, S208–S219 (2004) [DOI] [PubMed] [Google Scholar]
  • 19.Smith SM, Miller KL, Salimi-Khorshidi G, Webster M, Beckmann CF,Nichols TE, Ramsey JD, Woolrich MW: Network modelling methods for fmri. NeuroImage 54(2), 875–891 (2011) [DOI] [PubMed] [Google Scholar]
  • 20.Stam CJ, Jones B, Nolte G, Breakspear M, Scheltens P: Small-world networks and functional connectivity in alzheimer’s disease. Cerebral cortex 17(1), 92–99 (2007) [DOI] [PubMed] [Google Scholar]
  • 21.Wee CY, Yap PT, Zhang D, Denny K, Browndyke JN, Potter GG, Welsh-Bohmer KA, Wang L, Shen D: Identification of mci individuals using structural and functional connectivity networks. Neuroimage 59(3), 2045–2056 (2012) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Werring D, Clark C, Barker G, Miller D, Parker G, Brammer M, Bullmore E, Giampietro V, Thompson A: The structural and functional mechanisms of motor recovery: complementary use of diffusion tensor and functional magnetic resonance imaging in a traumatic injury of the internal capsule. Journal of Neurology, Neurosurgery & Psychiatry 65(6), 863–869 (1998) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhang Z, Liao W, Chen H, Mantini D, Ding JR, Xu Q, Wang Z, Yuan C, Chen G, Jiao Q, et al. : Altered functional–structural coupling of large-scale brain networks in idiopathic generalized epilepsy. Brain 134(10), 2912–2928 (2011) [DOI] [PubMed] [Google Scholar]

RESOURCES