The relationship between spatial configuration and functional connectivity of brain regions

Janine Diane Bijsterbosch; Mark W Woolrich; Matthew F Glasser; Emma C Robinson; Christian F Beckmann; David C Van Essen; Samuel J Harrison; Stephen M Smith

doi:10.7554/eLife.32992

. 2018 Feb 16;7:e32992. doi: 10.7554/eLife.32992

The relationship between spatial configuration and functional connectivity of brain regions

Janine Diane Bijsterbosch ^1,^✉, Mark W Woolrich ², Matthew F Glasser ^3,⁴, Emma C Robinson ⁵, Christian F Beckmann ^6,⁷, David C Van Essen ³, Samuel J Harrison ^1,^†, Stephen M Smith ^1,^†

Editor: Chris Honey⁸

PMCID: PMC5860869 PMID: 29451491

Abstract

Brain connectivity is often considered in terms of the communication between functionally distinct brain regions. Many studies have investigated the extent to which patterns of coupling strength between multiple neural populations relates to behaviour. For example, studies have used ‘functional connectivity fingerprints’ to characterise individuals' brain activity. Here, we investigate the extent to which the exact spatial arrangement of cortical regions interacts with measures of brain connectivity. We find that the shape and exact location of brain regions interact strongly with the modelling of brain connectivity, and present evidence that the spatial arrangement of functional regions is strongly predictive of non-imaging measures of behaviour and lifestyle. We believe that, in many cases, cross-subject variations in the spatial configuration of functional brain regions are being interpreted as changes in functional connectivity. Therefore, a better understanding of these effects is important when interpreting the relationship between functional imaging data and cognitive traits.

Research organism: Human

eLife digest

People differ a lot from one another in terms of their personality, behaviour and lifestyle. This individuality is attributed to the different regions in the brain, and the strength of communication between them. The connectivity pattern between these areas is thought to be as unique as a fingerprint. If the connections are weak or disrupted it can play a role in conditions such as schizophrenia, depression or Alzheimer’s disease. It is thought that the strength of the connection depends on how strongly the nerve cells in these regions communicate. But are these individual differences solely caused by different strengths of connection, or could other factors contribute to them?

Now, Bijsterbosch et al. found that the size, shape and exact position of the brain regions was also strongly linked to the different behaviours of individuals. The study used brain scans, behavioural tests and questionnaires from a large database about lifestyle choices and demographics, to analyse the relationship between the different brain features of healthy individuals. The results showed that the variations in the brain regions were linked to many behavioural factors including intelligence, life satisfaction, drug use and aggression problems. Moreover, Bijsterbosch et al. showed that the existing methods for estimating the strength of connection between brain regions could reveal more about the spatial layout of these regions than the actual connection strength between them. This suggests that new approaches are needed to properly evaluate the strength of the connections.

Some psychiatric and neurological diseases may be associated with changes in size and position of the different regions in the brain. In future, the findings of this study could be applied to individuals affected by such conditions, to see if the location of a region could be used as a diagnostic indicator.

Introduction

The organisation of the human brain into large-scale functional networks has been investigated extensively over the past two decades using resting state functional magnetic resonance imaging (rfMRI). Spontaneous fluctuations in distinct brain regions (as measured with rfMRI) show temporal correlations with each other, revealing complex patterns of functional connectivity (FC) (Biswal et al., 1995; Friston, 1994, 2011). Extensive connectivity between cortical areas and with subcortical brain regions has long been considered a core feature of brain anatomy and function (Crick and Jones, 1993), and dysfunctional coupling is associated with a variety of neurological and psychiatric disorders including schizophrenia, depression, and Alzheimer’s disease (Castellanos et al., 2013). Given the great potential neuroscientific and clinical value of rfMRI, it is important to determine which aspects of rfMRI data most sensitively and interpretably reflect trait variability across subjects. At a neural level, potential sources of meaningful cross-subject variability include: (i) the strength of the functional coupling (i.e. interactions) between two different neural populations (‘coupling’), and (ii) the spatial configuration and organisation of functional regions (‘topography’). In this study, we aim to identify how these key aspects of rfMRI data influence derived measures of functional connectivity and how they relate to interesting trait variability in behaviour and lifestyle across individuals. Our findings reveal variability in the spatial topography of functional regions across subjects, and suggest that this variability is the primary driver of cross-subject trait variability in correlation-based FC measures obtained via group-level rfMRI parcellation approaches. These results have important implications for future rfMRI research, and for the interpretation of FC findings.

A commonly applied approach used to derive FC measures from rfMRI data is to parcellate the brain into a set of functional regions (‘nodes’), and estimate the temporal correlations between pairs of node timeseries (‘edges’) to build a network matrix (Smith et al., 2013b). This approach has previously been likened to a fingerprint, enabling the unique identification of individuals, and the prediction of behavioural traits such as intelligence (Finn et al., 2015; Passingham et al., 2002). Of particular interest is the ability of network matrices to explain cross-subject variability in behaviour and performance on psychometric tests. To this end, Cross Correlation Analysis (CCA) was previously adopted to link a ‘positive-negative’ axis of behaviour to network matrices in data from the Human Connectome Project (Smith et al., 2015). CCA allows the comparison of a set of variables obtained from rfMRI (such as network matrices of edges) to a set of behavioural variables by estimating independent linear transformations for the two sets of variables such that they are maximally correlated. Here, we replicated this previous work in a larger subject sample (almost double the number of individuals), and adopt CCA to determine which key aspect of rfMRI data is uniquely associated with behaviour.

Parcellation methods that can be used to estimate network matrices include the use of anatomical, functional, and multi-modal atlases (Glasser et al., 2016; Tzourio-Mazoyer et al., 2002; Yeo et al., 2011), with functional parcellations often being data driven via techniques such as clustering and independent component analysis (ICA) (Beckmann et al., 2005; Craddock et al., 2012). Data-driven approaches such as ICA have been used to identify consistent large-scale resting state networks (Damoiseaux et al., 2006) and to characterise FC abnormalities in a variety of mental disorders (Littow et al., 2015; Pannekoek et al., 2015). Any given parcellation is typically defined at the group level, and hence additional steps are required to map a group-level parcellation onto individual subjects’ data (that has undergone registration to a common space), in order to obtain subject-specific parcel timeseries and associated connectivity edge estimates. Timeseries derived from hard (binary, non-overlapping) parcellations are often obtained using a simple masking approach (i.e. extracting the averaged BOLD timeseries across all voxels or vertices in a node), whereas ICA parcellations (partially overlapping, soft parcellations that contain continuous weights) are mapped onto single-subject data using dual regression analysis or back projection (Calhoun et al., 2001; Filippini et al., 2009). The first stage of a dual regression approach involves multiple spatial regression of group ICA maps into each preprocessed individual dataset to obtain subject-specific timeseries; the second stage is a multiple temporal regression of these stage one timeseries into the same preprocessed dataset to obtain subject-specific spatial maps. Note, dual regression is, to some extent, expected to underestimate subject-specific spatial variability because it involves post-hoc regressions of a group-level set of spatial maps, which are unlikely to be an accurate model for the data of individual subjects. Indeed, previous work has shown that, in the presence of spatial variability or inaccurate intersubject alignment, these common methods for mapping group parcellations onto individuals do not recover accurate subject-specific functional regions, and this can severely impact the accuracy of estimated timecourses and derived FC edges (Allen et al., 2012; Smith et al., 2011).

More recently, several studies have developed more thorough characterisations of the patterns of spatial variability in network topography across subjects (i.e. spatial shape, size and position of functional regions) (Glasser et al., 2016; Gordon et al., 2017a, 2017b; Laumann et al., 2015; Swaroop Guntupalli and Haxby, 2017; Wang et al., 2015). For example, Glasser et al. showed that the subject-specific spatial topology of area 55b in relation to the frontal and premotor eye fields substantially diverged from the group average in 11% of subjects (Glasser et al., 2016). In addition, the size of all cortical areas, including large ones like V1, varies by twofold or more across individuals (Amunts et al., 2000; Glasser et al., 2016). This extensive presence of spatial variability across individuals highlights the need for analysis methods that are adaptive and better able to accurately capture functional regions in individual subjects. Another approach that aims to achieve a more accurate subject-specific description of this spatial variability is PROFUMO, which simultaneously estimates subject and group probabilistic functional mode (PFM) maps and network matrices (instead of separate parcellation and mapping steps). Specifically, PROFUMO is a matrix factorisation model that decomposes data into estimates of subject-specific spatial maps, time courses, and amplitudes using a variational Bayesian approach with both spatial and temporal priors that seek to optimise for both spatial map sparsity and temporal dynamics consistent with haemodynamically regularised neural activity (Harrison et al., 2015). PROFUMO adopts a hierarchical approach by iteratively optimising subject and group estimates (instead of first estimating group components using group ICA and separately mapping these onto subjects using dual regression), and is therefore expected to more accurately capture subject-specific spatial variability than does dual regression. Other approaches are available to obtain group and subject parcellations in one step, for example using a groupwise normalised cut spectral clustering approach (Shen et al., 2013). In the present study, we show that the spatial variability across subjects captured in PFMs is strongly associated with behaviour.

Conceptually, network edges are commonly thought of as reflecting coupling strength between spatially separated neuronal populations. However, as discussed above, edge estimates are highly sensitive to spatial misalignments across individuals. Additionally, correlation-based edge estimates are influenced by the amplitudes of localised spontaneous rfMRI fluctuations (Duff et al., 2018), which have been shown to capture trait variability across subjects, and state variability within an individual over time (Bijsterbosch et al., 2017). These findings demonstrate the sensitivity of edge-strength estimates to many different types of subject variability, and highlight the need to identify which aspects of FC tap most directly into behaviourally relevant population-level variability. Here, we investigate the complex relationships between different features of an rfMRI dataset and also the associations with variability across individuals in terms of their performance on behavioural tests, their lifestyle choices, and demographic information. Using data from the Human Connectome Project (HCP), we provide evidence for systematic differences in the spatial organisation of functional regions. We then use simulations that manipulate aspects of the data such that, for example, only cross-subject spatial variability is present in the data (i.e. by fixing edge strength to be the group average for each individual) to investigate whether these differences reflect meaningful cross-subject information and drive edge estimates for several common FC approaches.

Results

Cross-subject information in fMRI-derived measures

To determine whether a given rfMRI-derived FC measure contains meaningful cross-subject information rather than random variability, we adopted an approach that makes use of the extensive set of behavioural, demographic, and lifestyle data acquired in the HCP. Our first analysis aims to determine which measures obtained from rfMRI and task data most strongly relate to interesting behavioural variability across individuals. Using Canonical Correlation Analysis (CCA), we extracted population modes of cross-subject covariation that represent maximum correlations between combinations of variables in the subject behavioural measures and in the fMRI-derived measures, uncovering multivariate relationships between brain and behaviour. For example, previous work has used CCA on HCP data to identify a mode of population covariation that linked a positive-negative axis of behavioural variables to patterns of FC edge strength (Smith et al., 2015). A specific pattern of connectivity, primarily between ‘task-negative’ (default mode) regions (Raichle et al., 2001), was found to be linked to scores on positive factors such as life satisfaction and intelligence, and inversely associated with scores on negative factors such as drug use.

CCA works by finding a linear combination of behavioural measures (V) that is maximally correlated with a linear combination of rfMRI-derived measures (U). CCA scores for each subject are obtained for the behavioural and fMRI-derived measures (V and U), which represent the subject’s position along the population continuum for the latent CCA variable(s). The key result of a CCA analysis for each mode of covariation is the correlation between U and V, denoted r_UV, which describes the strength of the multivariate brain-behaviour relationship. Given that CCA explicitly optimises r_UV, it is essential to perform permutation testing in order to test the significance of the CCA result. To determine which behavioural measures contribute strongly to the CCA result, V is subsequently regressed into original non-imaging variables (Figure 1B; although interpretation of these results is complicated by behaviour-behaviour correlations). Additionally, U is used to visualise variation at both the population extremes (see Figure 2 below and Figure 2—figure supplements 2–7), and across the full population continuum (Supplementary video files).

(A) Comparison of strength of CCA result for network matrices, spatial maps and amplitudes (node timeseries standard deviation) derived from several distinct group-average spatial parcellations/decompositions: ICA decompositions at two scales of detail (dimensionalities of 25 and 200, with ‘ICA200 partial network matrix’ corresponding to the measures used previously [Smith et al., 2015]); a PROFUMO decomposition (PFM; dimensionality 50); an atlas-based hard parcellation (108 parcels [Yeo et al., 2011]), task contrast spatial maps (86 contrasts, 47 unique), and warp field from native space to MSMAll alignment. Each bar reports a separate CCA analysis (first CCA mode shown), performed against behaviour/life-factors. A similar mode of variation is found across most of the parcellation methods and different fMRI measures. r_UV is the strength of the canonical correlation between imaging and non-imaging measures. Error bars indicate confidence intervals (2.5–97.5%) estimated using surrogate data (generated with the same correlation structure), and red lines reflect the p<0.002 significant threshold compared with a null distribution obtained with permutation testing (i.e. family-wise-error corrected across all CCA components and Bonferroni corrected across a total of 25 CCAs performed, see Supplementary file 1a and b for the full set of results). CCA estimates the highest possible r_uv given the dataset; therefore, the null distribution for low-dimensional brain data (e.g. ICA 25 amplitude) is expected to be lower than for high-dimensional brain data. (B) Set of non-imaging variables that correlate most strongly with the CCA mode (averaged subject weights V across results marked with * in A; i.e. p=0.00001) with behavioural variables. Position against the y-axis and font size indicate strength of correlation.

Figure 1—source data 1. Source data for Figure 1.

elife-32992-fig1-data1.mat^{(43.2MB, mat)}

DOI: 10.7554/eLife.32992.004

Figure 1—figure supplement 1. — (A) Comparison of strength of CCA result for network matrices, spatial maps and amplitudes (node timeseries standard deviation) derived from several distinct group-average spatial parcellations/decompositions: ICA decompositions at two scales of detail (dimensionalities of 25 and 200, with ‘ICA200 partial network matrix’ corresponding to the measures used previously [Smith et al., 2015]); a PROFUMO decomposition (PFM; dimensionality 50); an atlas-based hard parcellation (108 parcels [Yeo et al., 2011]), task contrast spatial maps (86 contrasts, 47 unique), and warp field from native space to MSMAll alignment. Each bar reports a separate CCA analysis (first CCA mode shown), performed against behaviour/life-factors. A similar mode of variation is found across most of the parcellation methods and different fMRI measures. r_UV is the strength of the canonical correlation between imaging and non-imaging measures. Error bars indicate confidence intervals (2.5–97.5%) estimated using surrogate data (generated with the same correlation structure), and red lines reflect the p<0.002 significant threshold compared with a null distribution obtained with permutation testing (i.e. family-wise-error corrected across all CCA components and Bonferroni corrected across a total of 25 CCAs performed, see Supplementary file 1a and b for the full set of results). CCA estimates the highest possible r_uv given the dataset; therefore, the null distribution for low-dimensional brain data (e.g. ICA 25 amplitude) is expected to be lower than for high-dimensional brain data. (B) Set of non-imaging variables that correlate most strongly with the CCA mode (averaged subject weights V across results marked with * in A; i.e. p=0.00001) with behavioural variables. Position against the y-axis and font size indicate strength of correlation.

Figure 1—source data 1. Source data for Figure 1.

elife-32992-fig1-data1.mat^{(43.2MB, mat)}

DOI: 10.7554/eLife.32992.004

Figure 2. — The top row shows that the inferior parietal node of the DMN differs in shape and extends into the intraparietal sulcus in subjects who score high on the positive-negative CCA mode (right), compared with subjects who score lower (left). The bottom row shows that medial prefrontal and posterior cingulate/precuneus regions of the DMN differ in size and shape as a function of the CCA positive-negative mode. The representative maps at both extremes are thresholded at ±2 (arbitrary units specific to the PFM algorithm) for visualisation purposes (the differences are not affected by the thresholding; for unthresholded video-versions of these maps, please see the Supplementary video files. The grey contours are identical on the left and right to aid visual comparison and are based on the group-average maps (thresholded at 0.75). Spatial changes of all PFM modes can be seen in the Supplementary video files and in Figure 2—figure supplements 2–7. B: difference maps (positive - negative; thresholded at ±1) are shown to aid comparison. C: A summary of topographic variability across all PFM modes, showing PFM correlations with CCA subject weights (at each grayordinate the maximum absolute r across all PFMs is displayed). An extended version of C is available in Figure 2—figure supplement 7. Data of Figure 2 available at: https://balsa.wustl.edu/8lVx.

Figure 2—figure supplement 1. — The top row shows that the inferior parietal node of the DMN differs in shape and extends into the intraparietal sulcus in subjects who score high on the positive-negative CCA mode (right), compared with subjects who score lower (left). The bottom row shows that medial prefrontal and posterior cingulate/precuneus regions of the DMN differ in size and shape as a function of the CCA positive-negative mode. The representative maps at both extremes are thresholded at ±2 (arbitrary units specific to the PFM algorithm) for visualisation purposes (the differences are not affected by the thresholding; for unthresholded video-versions of these maps, please see the Supplementary video files. The grey contours are identical on the left and right to aid visual comparison and are based on the group-average maps (thresholded at 0.75). Spatial changes of all PFM modes can be seen in the Supplementary video files and in Figure 2—figure supplements 2–7. B: difference maps (positive - negative; thresholded at ±1) are shown to aid comparison. C: A summary of topographic variability across all PFM modes, showing PFM correlations with CCA subject weights (at each grayordinate the maximum absolute r across all PFMs is displayed). An extended version of C is available in Figure 2—figure supplement 7. Data of Figure 2 available at: https://balsa.wustl.edu/8lVx.

We applied a separate CCA analysis for each of the various fMRI-derived measures (including spatial, network matrix, and amplitude measures). The results (Figure 1 and Supplementary file 1a and b) reveal that highly similar associations with behaviour and life factors occur across a wide range of different fMRI-derived measures. Correlating the behavioural subject weights (V) across the different CCA instances in Figure 1 shows that a similar behavioural mode is obtained from the independent instances of CCA (particularly for those CCAs that have a high r_U-V and low P_U-V; Figure 1—figure supplement 1). Mapping these subject weights onto behaviour through correlation reveals consistent positive associations with, for example, fluid intelligence, life satisfaction, and delayed discounting, and consistent negative correlations with use of tobacco, alcohol and cannabis. All behavioural correlations with mean correlation r>|0.25| (chosen for visualisation purposes) are shown in Figure 1B. The results show that spatial features such as PFM subject spatial maps and subject task contrast maps are strongly associated with behaviour. Overall, these findings reveal that a large variety of fMRI measures have similarly strong associations with behaviour.

Direct comparison between the results in Figure 1—figure supplement 1a) and the HCP_MMP1.0 parcellation (e.g. the 360-region ‘Glasser parcellation’ [Glasser et al., 2016]) and against associated fractional surface area (in native space as a ratio to total surface area, for each of the 360 parcels in the HCP_MMP1.0 parcellation) is challenging due to the large difference in the number of subjects (n = 819 for Figure 1 and n = 441 for HCP_MMP1.0). Therefore, we have included an analysis on all PFM metrics in a reduced number of subjects (the same n = 441 subjects) in order to facilitate direct comparison between these two recent parcellation approaches that both aim to achieve accurate detection of subject-specific spatial boundaries (Supplementary file 1b). These results show that spatial features from a variety of sources (surface area, multimodal parcellation and PFMs) are strongly associated with measures of behaviour and lifestyle. Also note that network matrices obtained by the HCP_MMP1.0 parcellation are more predictive of behaviour than are PFM network matrices.

For correlation-based parcellated FC estimates (network edges), a common assumption is that functional coupling is primarily reflected in the edges. In theory, true network coupling information can be manifested along a continuum ranging from spatial maps to network matrices. On one extreme, coupling information is purely contained in spatial maps, as is the case when performing temporal ICA (where the temporal correlation matrix is by definition the identity matrix [Smith et al., 2012]). On the other extreme, coupling information can be fully contained in network matrices as is often assumed to be the case when using an individualised hard parcellation (however, coupling can only be represented fully in edge estimates if all subjects are perfectly functionally aligned to the parcellation, and if the node timeseries amplitudes do not contain useful cross-subject information). It is likely that the dimensionality of the decomposition may influence this; for example, for a low-dimensional decomposition (into a small number of large-scale networks), much cross-subject variation in functional coupling is likely to occur between sub-nodes of the networks, which is therefore more likely to be represented in the spatial maps, whereas in a higher dimensionality decomposition this information is more likely to be represented in the network matrix. However, the results in Figure 1 show that this CCA mode of population covariation is significantly present in both spatial maps and network matrices for both low- and high-dimensional decompositions (ICA 25 and 200). Therefore, the potential role of dimensionality is not sufficient to explain the common information present in spatial maps, timeseries amplitudes, and network matrices.

The presence of this behaviourally meaningful spatial variability is somewhat surprising, because these data were aligned using a Multimodal Surface Matching (MSM) approach (Robinson et al., 2014; 2018), driven by both structural and functional cortical features (including myelin maps and resting state network maps). MSM has been shown to achieve very good functional alignment compared with other methods, and particularly compared with volumetric alignment approaches or surface-based approaches that use cortical folding patterns rather than areal features (Coalson et al., 2018). However, residual cross-subject spatial variability is still present in the HCP data after the registration to a common surface atlas space (in part due to the constrained parameterisation of MSM and in part because weighted regression subject maps used to drive MSM may not fully capture all spatial variability). In line with this, approaches which are expected to better identify residual subject spatial variability (specifically, PFM spatial maps and subject task contrast maps in Figure 1) show strong correspondence between spatial variability and behaviour/life-factor measures.

To better understand what spatial features represent behaviourally relevant cross-subject information, we visually explored what aspects of the PFM spatial maps contributed to the CCA result in Figure 1 by calculating representative maps at extremes of the CCA mode of population covariation (based on CCA subject scores). While the PFM maps are estimated using the full set of cortical and subcortical grayordinates, we focus on cortical findings because these contribute most strongly to the CCA results. The results reveal complex changes in spatial topography (Figure 2, Figure 2—figure supplements 2–7, and Videos 1–9. For example, comparing left versus right panels shows the right inferior parietal node of the DMN extending farther into the intraparietal sulcus (in the vicinity of area IP1 [Choi et al., 2006; Glasser et al., 2016]) in subjects who score higher on the behavioural positive-negative mode of covariation. Qualitative inspection of Figure 2—figure supplements 2–7 suggests that many of the difference maps show notable bilateral symmetry.

Video 1. Unthresholded maps are shown for the 4 PFMs that contribute most strongly to the CCA result (14, 45, 35, 33; corresponding stills in Figure 2 and Figure 2—figure supplement 1).