Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Nov 9.
Published in final edited form as: Proc IEEE Int Symp Biomed Imaging. 2020 May 22;2020:288–291. doi: 10.1109/isbi45749.2020.9098552

A GENERALIZED FRAMEWORK OF PATHLENGTH ASSOCIATED COMMUNITY ESTIMATION FOR BRAIN STRUCTURAL NETWORK

Yurong Chen 1, Haoteng Tang 1, Lei Guo 1, Jamie C Peven 2, Heng Huang 1, Alex D Leow 3, Melissa Lamar 4,5, Liang Zhan 1
PMCID: PMC7652406  NIHMSID: NIHMS1640247  PMID: 33173559

Abstract

Diffusion MRI-derived brain structural network has been widely used in brain research and community or modular structure is one of popular network features, which can be extracted from network edge-derived pathlengths. Conceptually, brain structural network edges represent the connecting strength between pair of nodes, thus non-negative. The pathlength. Many studies have demonstrated that each brain network edge can be affected by many confounding factors (e.g. age, sex, etc.) and this influence varies on each edge. However, after applying generalized linear regression to remove those confounding’s effects, some network edges may become negative, which leads to barriers in extracting the community structure. In this study, we propose a novel generalized framework to solve this negative edge issue in extracting the modular structure from brain structural network. We have compared our framework with traditional Q method. The results clearly demonstrated that our framework has significant advantages in both stability and sensitivity.

Index Terms: brain, structural network, community structure, diffusion MRI, generalized linear regression

1. INTRODUCTION

Non-invasive MRI is one of the most popular tools to study the brain. Traditional MRI studies focus on voxelwise or region-of-interest (ROI)-based measures. Considering most brain disease-induced deterioration is a systematic process [1], these MRI measures are thus suboptimal. The brain functional [2] and structural networks [3], derived from functional MRI and diffusion-weighted MRI respectively, have the potential to gain system-level insights into the mechanisms of brain diseases, thus providing a novel platform for developing new diagnostic strategies [4]. Over the last decade, the brain network approach has emerged as a new omics approach that can potentially revolutionize our understanding of the brain by showing how complex brain activities interact in both time and space.

Currently, standard network analyses commonly involve the comparison of summary graph-theoretical metrics such as clustering coefficient, etc. While easy to compute and widely popular among network researchers, these simple scalar statistics defined at either the nodal or the global level are problematic in that they reduce the complex network data to a dimension of 1 and discard important informative graph structure encoding complex network inter-relationship for every intermediate scale between nodes and the entire graph. Just as social networks can be divided into cliques that describe modes of association (e.g. family, school) [5, 6], a brain network can be divided into modules or communities. Modules contain a collection of nodes that are densely interconnected (via edges) with one another but weakly connected with nodes in other modules. Thus, modularity or community structure best describes the intermediate scale of network organization, rather than the global or local scale. In many networks, modules can be divided into smaller sub-modules, thus can be said to exhibit hierarchical modularity or near decomposability.

The concept of community structure has been extensively used in many areas, not just brain network [7, 8]. For example, [9] adopted this concept to understand species geographic distributions with the relative abundance of coexisting species. Bader et al. [10] detected the phylogenetic and morphological global desert rodent communities using community structure patterns. Moreover, email networks provide a precise and nonintrusive depiction of the human organization information which analyzed as the social network of selfsimilar community structure [11]. Therefore, identifying the community structure in large networks is always a hot topic [12, 13]. However, there is one important issue, especially in brain network, which has not been well addressed in the literatures.

In the network, each single edge represents some mathematical relationship between pair of nodes and it’s a common sense that each edge can be affected by different confounding factors to some degree. For example, the edge in brain structural network is extracted based on the fiber bundles and represents the connecting strength between pair of brain ROIs and this connecting strength can be affected by age [14, 15], sex [16, 17], health status [18, 19], and biological knowledge [20] etc. Therefore, directly extracting the community structure from raw network without accounting for these confounding factors could be biased. The most popular approach to remove confounding factors is using generalized linear regression (GLR). However, GLR-adjusted edge values may become negative, which leads to barriers in extracting the modular structure[21].

Currently the Louvain algorithm [22] is one of the most popular methods for the community extraction. Although the implementation of Louvain algorithm in Brain Connectome Toolbox (BCT, https://sites.google.com/site/bctnet/) [23] can take into account the negative edges in the network, however, the result is not stable. (We will demonstrate this in Section 3.3). In this paper, we proposed a new generalized framework to extract the community structure from GLR-adjusted network.

2. METHODS

The proposed method is a generalized version of Path Length Associated Community Estimation (PLACE) algorithm [24]. The main idea of PLACE is to maximize the difference between inter-community versus intra-community path lengths (Eq. 1):

{ψPL=interPLCiCj12(intraPLCi+intraPLCj)interPLC1C2=nCi,mCjdnmNiNjintraPLCi=n,mCidnm(Ni2Ni)/2 (1)

Here the dnm denotes shortest distance or path length between node n and node m. Thus, maximizing ψPL is searching for a partition with stronger intra-community integration and stronger between-community separation. Since the original PLACE was designed for diffusion MRI-derived brain structural network, the dnm is derived using network edges, which are all non-negative values. dnm = 0 if node n cannot be reached from node m. If node n can be reached directly from node m, dnm = 1/wnm and wnm is the connecting strength (e.g. number of fibers) between two nodes. If node n can be reached indirectly from node m, dnm can be defined by the shortest pathlength between nodes.

Since each network edge can be affected by confounding factors and this influence varies on each edge, we adopt generalized linear regression model to remove confounding’s effects from each edge. (Eq. 2)

Y=β0+β1X1+β2X2++βnXN+R (2)

Here, Y=[yij1,yij2,,yijM]T represents the edge value between node i and node j for M subjects. Xk = [xk1, xk2, …, xkM]T (k = 1,2, …, N) is the kth confounding factor for M subjects. R=[Rij1,Rij2,,RijM]T are the regression residuals for edge (i, j) for M subjects. T represents transpose. All the confounding factors, except categorized variables (e.g. sex, race, etc.), are all normalized using Z score. After applying GLR, the residual R will replace Y to become the new network edge value and in theory R is free of confounding’s influences.

However, the pathlength cannot be computed from the new GLR-adjusted network because of the negative values caused by GLR. Here we propose to use Euclidean Distance (ED) to define the nodal distance (Eq. 3).

EDnm=i=1,in,imPRniRmi2 (3)

Here EDnm is the Euclidean distance between node n and node m, thus non-negative. P is the total number of nodes and R is GLR-adjusted value. The lager EDnm value means the longer distance between nodes, and EDnm → 0 means two nodes are closer to each other. Therefore, EDnm can replace dnm in Eq. 1.

In summary, the proposed framework adopts GLR to remove confounding factors’ influence from each edge and uses nodal Euclidean distance to replace the nodal distance. The resulted ED matrix will be fed into PLACE algorithm to extract the modular structure. Fig. 1 illustrates the pipeline of the proposed framework.

Fig. 1.

Fig. 1.

The pipeline of the proposed framework which includes three main steps. The first step is to remove confounding factors’ influence on each network edge separately using the generalized linear regression model. The second step is to quantify the pairwise nodal distance using Euclidean distance; then the last step is to adopt PLACE [24] to derive the community structure.

3. RESULTS AND DISCUSSIONS

3.1. Data description and network reconstruction

In this study, at the final inclusion criteria consisted of an absence of psychiatric symptoms based on the SCID and a score ⩽ 8 on the HAM-D, as well as an MMSE ⩾ 24, we analyzed totally 106 subjects (mean age=67.85±6.72, 55 men) from [25]. For each subject, diffusion-weighed MRI (dMRI) was acquired using 2-D spin-echo EPI sequence (FOV=20mm; voxel size=0.78×0.78×3.0mm3; TR/TE=5,525/93.5ms; flip angle=90°). 40 contiguous axial slices aligned to the AC-PC line were collected in 32 gradient directions with b=1400s/mm2 and 6 b0 images. T1-weighted MRIs were acquired using a spoiled gradient recalled-echo (SPGR) sequence (FOV=22cm; 120 interleaved axial slices 1.5mm thick; TR/TE=12.0/5.3ms; flip angle=13°; and voxel size=0.42×0.42×1.5mm3). We used FreeSurfer (V6.0) (https://surfer.nmr.mgh.harvard.edu) to parcellate T1-weighted MRI into 82 ROIs and adopted Fiber Assignment by Continuous Tracking algorithm [26] to extract wholebrain tractography from dMRI. The brain structural network was then reconstructed using the standard pipeline described in [27].

3.2. Extracting the modular structure

Each subject underwent the Judgement of Line Orientation Test (JLO) [28], which measures a person’s ability to match the angle and orientation of lines in space. The Oblique Line Errors is defined as the total number of incorrect oblique lines normalized by the total number of possible oblique line responses. We used the z score of Oblique Line Errors (zOblique) to split 106 subjects into two groups: low vs. high zOblique error group. Low zOblique error group has 52 subjects (mean age=67.84±6.83, 22 females) and high zOblique error group has 54 subjects (mean age=67.87±6.59, 33 females). Each subject’s network dimension is 82×82. We selected Age, Race and Years of Education as the confounding factors. For Race, we use 0 to represent black, 1 as Latinx and 3 as white. Age and Years of Education are Z scored. We applied GLR to remove the confounding factors’ influence on each edge, then applied the Louvain algorithm to extract the modular structure as the baseline result. Then we applied our new framework (Section 2) to extract the modular structure. Fig. 2 illustrates the community structures. The modular structure not only varies between groups but also between methods.

Fig. 2.

Fig. 2.

The derived community structures. A and B for low and high zOblique error group respectively using our new method; C and D for low and high zOblique error group respectively using Louvain method. Each color indicates a unique community branch and there are totally 8 community branches in each row.

3.3. Stability analysis

We adopted normalized mutual information (NMI) [29] to evaluate the stability of the results. The NMI equals to 1 means two results are exact same, and NMI less than 1 means there are some differences among results. We repeated our method and Louvain method 100 times on each subject, and our method achieved mean NMI = 1.00 ± 0.00 while Louvain method achieved mean NMI = 0.91 ± 0.04 which presents unstable results.This clearly suggests that our method has an advantage in the stability and Louvain result is not reliable, which is consistent with previous literature [30].

3.4. Identifying group differences in modular structure

For each branch that was clustered by our new methods in the community structure, Hotelling T test [31] was used to quantify the group difference and false discovery rate (FDR) [32] was adopted to correct for multiple comparison. By using the proposed method, isthmus cingulate in left hemisphere (highlighted using red color in Fig. 3) shows a significant difference (P value=1.04E-4) in the community assignment or membership. The lh-cuneus, lh-paracentral, lh-postcentral, lh-posteriorcingulate, lh-precentral, and lh-precuneus cortex are shared by both high and low zOblique error groups and highlighted using blue color in Fig. 3. The green regions in Fig. 3 include lh-caudalmiddlefrontal, lh-parsopercularis and lh-superiorfrontal cortex and they are associated with red region only in low zOblique error group. The orange regions in Fig. 3 include lh-frontalpole, rh-isthmuscingulate, rh-postcentral, and rh-precuneus cortex and they are associated with red region only in high zOblique error group.

Fig. 3.

Fig. 3.

The connectome modular differences between groups. The isthmus cingulate in left hemisphere (highlighted using red color) shows the significant difference (P value=1.04E-4) in the community assignment; the orange region is associated with high zOblique error group; the green region is associated with low zOblique error group; the blue region is shared by both groups.

In the meantime, no significant group difference can be identified in the community structure derived using Louvain method that is it cannot find any different community among those different subject groups.

The proposed method has two advantages. Firstly, this method can be easily modified to test any confounding factors hypothesis, in other words, is very flexible in adjusting the GLR model. In the current study, we only selected three confounding factors (Age, Year of Education and Race) for the GLR model for the purpose of the demonstration and we can examine some other confounding factors (e.g. sex) as well. Secondly, this method can be easily applied to other type of networks for exploring any factor impacts, not just brain structural network. For example, we can apply GLR on functional network and use similar strategy to define the ED on the residue matrix, and then apply PLACE to extract the modular structure.

4. CONCLUSION

In this paper, we proposed a generalized framework to extract the modular structure after accounting for confounding factors’ heterogeneous effects on each brain network edge. Compared with traditional Louvain method, our results demonstrated that the proposed method has significant advantages in stability, sensitivity and application scope.

5. ACKNOWLEDGMENT

This study is funded in part by NIH AG056782, AG040192, AG048176, and NSF-IIS 1837956.

6. REFERENCE

  • [1].Langlois JA, Rutland-Brown W, and Wald MM, “The epidemiology and impact of traumatic brain injury: a brief overview,” The Journal of head trauma rehabilitation, vol. 21, no. 5, pp. 375–378, 2006. [DOI] [PubMed] [Google Scholar]
  • [2].Finn ES et al. , “Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity,” Nat Neurosci, vol. 18, no. 11, pp. 1664–71, November 2015, doi: 10.1038/nn.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Leow A et al. , “Altered structural brain connectome in young adult fragile X premutation carriers,” Hum Brain Mapp, vol. 35, no. 9, pp. 4518–4530, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Fortel Igor et al. , “Brain Dynamics Through the Lens of Statistical Mechanics by Unifying Structure and Function,” MICCAI2019, pp. 503–511, 2019 [Google Scholar]
  • [5].Dias AM, “Reconsidering the modularity of social cognition in the human brain,” Frontiers in behavioral neuroscience, vol. 3, p. 55, 2009, doi: 10.3389/neuro.08.055.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Kurzban R and Aktipis CA, “Modularity and the social mind: are psychologists too self-ish?,” Pers Soc Psychol Rev, vol. 11, no. 2, pp. 131–49, May 2007, doi: 10.1177/1088868306294906. [DOI] [PubMed] [Google Scholar]
  • [7].Tang Haoteng et al. , “Classifying Stages of Mild Cognitive Impairment via Augmented Graph Embedding,” MBIA2019, MFCA2019, pp. 30–38, 2019 [Google Scholar]
  • [8].Zhang Yanfu et al. , “Integrating Heterogeneous Brain Networks for Predicting Brain Disease Conditions,” MICCAI 2019, pp.214–222, 2019. [Google Scholar]
  • [9].Montesinos-Navarro A et al. , “Community structure informs species geographic distributions,” Plos One, vol. 13, no. 7, July 9 2018, doi: 10.1371/journal.pone.0200556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Alhajeri BH and Steppan SJ, “Community structure in ecological assemblages of desert rodents,” (in English), Biol J Linn Soc, vol. 124, no. 3, pp. 308–318, July 2018, doi: 10.1093/biolinnean/bly068. [DOI] [Google Scholar]
  • [11].Guimera R, Danon L, Diaz-Guilera A, Giralt F, and Arenas A, “Selfsimilar community structure in a network of human interactions,” Phys Rev E, vol. 68, no. 6, December 2003, doi: 10.1103/PhysRevE.68.065103. [DOI] [PubMed] [Google Scholar]
  • [12].Ken W and Toshiyuki T, “Finding community structure in mega-scale social networks,” in Proceedings of the 16th international conference on World Wide Web 978-1-59593-654-7, ed. Banff Alberta, Canada: ACM, 2007, pp. 1275–1276. [Google Scholar]
  • [13].Clauset A, Newman MEJ, and Moore C, “Finding community structure in very large networks,” Phys Rev E, vol. 70, no. 6, December 2004, doi: 10.1103/PhysRevE.70.066111. [DOI] [PubMed] [Google Scholar]
  • [14].Douaud G et al. , “A common brain network links development, aging, and vulnerability to disease,” Proceedings of the National Academy of Sciences, vol. 111, no. 49, pp. 17648–17653, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Boots Elizabeth A. et al. , “Cardiovascular disease risk factors, tractbased structural connectomics, and cognition in older adults,” NeuroImage, vol. 196, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Ingalhalikar M et al. , “Sex differences in the structural connectome of the human brain,” Proceedings of the National Academy of Sciences, vol. 111, no. 2, pp. 823–828, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Ajilore O, Vizueta N, Walshaw P, Zhan L, Leow A, and Altshuler LL, “Connectome signatures of neurocognitive abnormalities in euthymic bipolar I disorder,” Journal of psychiatric research, vol. 68, pp. 37–44, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Conrin SD et al. , “From Default Mode Network to the Basal Configuration: Sex Differences in the Resting-State Brain Connectivity as a Function of Age and Their Clinical Correlates,” Front Psychiatry, vol. 9, pp. 365, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Zhang A et al. , “White matter tract integrity of anterior limb of internal capsule in major depression and type 2 diabetes,” Neuropsychopharmacology, vol. 38, no. 8, p. 1451, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Zhang Yanfu et al. , “ Biological Knowledge Guided Deep Neural Network for Brain Genotype-Phenotype Association Study,” MBIA 2019, MFCA 2019, pp. 84–92, 2019 [Google Scholar]
  • [21].Schwarz Adam J., McGonigle John E “Negative edges and soft thresholding in complex network analysis of resting state functional connectivity data,” NeuroImage, vol. 55, pp. 1132–1146, 2011. [DOI] [PubMed] [Google Scholar]
  • [22].Newman MEJ and Girvan M, “Finding and evaluating community structure in networks,” Phys Rev E, vol. 69, no. 2, February 2004, doi: 10.1103/PhysRevE.69.026113. [DOI] [PubMed] [Google Scholar]
  • [23].Rubinov M and Sporns O, “Complex network measures of brain connectivity: uses and interpretations,” Neuroimage, vol. 52, no. 3, pp. 1059–1069, 2010. [DOI] [PubMed] [Google Scholar]
  • [24].GadElkarim JJ et al. , “Investigating brain community structure abnormalities in bipolar disorder using path length associated community estimation,” Hum Brain Mapp, vol. 35, no. 5, pp. 2253–2264, May 2014, doi: 10.1002/hbm.22324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Peven J et al. , “The oblique effect: The relationship between profiles of visuospatial preference, cognition, and brain connectomics in older adults,” Neuropsychologia, vol. 135, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Mori S, Crain BJ, Chacko VP, and van Zijl PC, “Threedimensional tracking of axonal projections in the brain by magnetic resonance imaging,” Ann Neurol, vol. 45, no. 2, pp. 265–9, February 1999. [DOI] [PubMed] [Google Scholar]
  • [27].Zhan L et al. , “Comparison of nine tractography algorithms for detecting abnormal structural brain networks in Alzheimer’s disease,” Frontiers in Aging Neuroscience, Original Research vol. 7, no. 48, April 2015, doi: 10.3389/fnagi.2015.00048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Benton A, Hamsher K, Varney N, and Spreen O, “Judgement of line orientation test,” Iowa City, IA: University of Iowa, 1983. [Google Scholar]
  • [29].MacKay DJC, Information theory, inference, and learning algorithms. Cambridge, UK; New York: Cambridge University Press, 2003, pp. xii, 628 p. [Google Scholar]
  • [30].Zhan L et al. , “The significance of negative correlations in brain connectivity,”, J Comp Neurol, vol. 525, no. 15, pp. 3251–3265, October 15 2017, doi: 10.1002/cne.24274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Hotelling H, “A Generalized T Measure of Multivariate Dispersion,” Ann Math Stat, vol. 18, no. 2, pp. 298–298, 1947. [Google Scholar]
  • [32].Benjamini Y, “Discovering the false discovery rate,”, J R Stat Soc B, vol. 72, pp. 405–416, 2010, doi: DOI 10.1111/j.1467-9868.2010.00746.x. [DOI] [Google Scholar]

RESOURCES