Summary
We propose a novel Bayesian hierarchical model for brain imaging data that unifies voxel level (the most localized unit of measure) and region level brain connectivity analyses, and yields population level inferences. Functional connectivity generally refers to associations in brain activity between distinct locations. The first level of our model summarizes brain connectivity for cross-region voxel pairs using a two component mixture model consisting of connected and non-connected voxels. We use the proportion of connected voxel pairs to define a new measure of connectivity strength, which reflects the breadth of between-region connectivity. Furthermore, we evaluate the impact of clinical covariates on connectivity between region-pairs at a population level. We perform parameter estimation using Markov chain Monte Carlo (MCMC) techniques, which can be executed quickly relative to the number of model parameters. We apply our method to resting-state functional magnetic resonance imaging (fMRI) data from 32 subjects with major depression and simulated data to demonstrate the properties of our method.
Keywords: Bayesian hierarchical model, brain imaging, functional connectivity, MCMC, resting-state fMRI
1. Introduction
Functional brain connectivity refers to associations between neural processing units from distinct brain locations. Specific patterns of connectivity may be linked to corresponding actions, emotions, cognition, or resting-state (reflecting default mode of neural processing). Disruptions to these connectivity patterns are often associated with psychiatric and neurological disorders. Functional magnetic resonance imaging (fMRI) enables the assessment of functional connectivity by calculating the temporal coherence of neural activity measures arising from spatially distinct regions of the brain. There are several connectivity metrics available to measure the coherence of the temporal profiles. Examples include Pearson correlation; spectral coherence, which measures the coherence between the two temporal profiles in the frequency domain and mutual information, which measures the mutual dependence between the random variables giving rise to the observed two time series (Sun et al., 2004). Modeling connectivity directly on fMRI time series often leads to a prohibitive computational burden, and thus most methods first calculate the connectivity metrics, then model these summaries to draw statistical inferences across subjects (Turk-Browne, 2013; Zatorre, 2013).
There are numerous intrinsic challenges to evaluating functional connectivity in the brain. Typical neuroimaging data in standard 2mm Montreal Neuroimaging Institute (MNI) space usually include roughly 300,000 voxels within the brain (on a regular lattice of 91×109×91 voxels). Therefore, there are more than 40 billion voxel pairs for a single whole brain connectivity analysis. It is desirable for connectivity analysis to provide spatially detailed inferences between all voxel pairs (Turk-Browne, 2013). However, such ultra-high dimensionality poses a formidable computational load. Zalesky et al., 2010 summarized the numbers of regions (or voxels) used for analysis in recent highly cited functional connectivity studies. A common strategy is to choose a small set of seed voxels among all voxels and calculate the associations between each seed voxel and all of the remaining voxels (Cordes et al., 2001). A second approach is to define regions of interest (ROIs) using an existing anatomical parcellation of the brain, for example automated anatomical labeling (AAL) or the Brodmann brain atlas (Tzourio-Mazoyer et al., 2002; Garey, 1994) and to quantify connectivity at the region level rather than voxel level (Greicius et al., 2003; Zalesky et al., 2012b). This approach usually first summarizes the temporal profiles of all voxels within each region by taking averages (or similar dimension reduction) and then estimating the connectivity by using the regional summaries. In addition, independent component analysis (ICA) has been applied to assess connectivity by separating spatially independent sources of signals (Calhoun et al., 2001).
Recently, there has been a focus on the application of graph theory metrics to depict complex brain networks (Bullmore and Sporns, 2009; Simpson et al., 2013; Bowman et al., 2014). Graph theory metrics reflect important topological properties of brain networks. For example, the “small-worldness” property indicates that most neural units (voxels) are often directly connected to neighbors (e.g. high positive Pearson correlation), but distant units can be connected through only a small number of steps via highly connected nodes (‘hub’ nodes) (Achard et al., 2005). The “small-worldness” property is common in highly organized networks such as the internet, vehicle traffic, and social networks. Nevertheless, those metrics usually only function as summary statistics rather than as motifs for establishing brain connectivity.
The seed voxel approach is simple for computation, but it is inherently limited in scope as it ignores functional connections other than the ones associated with the seeds. The region based method neglects the variations within regions, which may lead to substantial bias and information loss, and these problems are exacerbated as the size of an ROI increases. Moreover, region-based methods lack detail on the particular voxels that are connected between pairs of regions. ICA is often conducted on the voxel level where detected components may be difficult to interpret and may span multiple brain regions. Bayesian modeling has been applied to estimate localized brain activity with corresponding group/task differences and to perform region level connectivity analysis using task-induced fMRI data (Bowman et al., 2008; Patel et al., 2006; Xue et al., 2015). In summary, most existing statistical methods for brain connectivity analysis are conducted either on the region level or on the voxel level alone. There is an unmet need to develop analytical methods to unify connectivity analyses at these two levels of spatial units. We provide a new statistical framework that unifies voxel and region level connectivity analyses.
We propose a novel Bayesian hierarchical model for inferring brain connectivity. We use Pearson correlation, which is the most widely used connectivity metric in the neuroimaging community as the input data for our method (Zalesky et al., 2012a; Zatorre, 2013; Greicius et al., 2003). The base level of our framework assumes that the connections between voxel pairs from two brain regions follow a mixture distribution with two modes reflecting connected and non-connected voxel pairs (i.e. high and low connectivity metrics). The assumption follows naturally from the “small-worldness” property of brain networks, since two distant ROIs may be well connected through a small proportion of efficiently connected nodes (represented by the mixture component with higher connectivity metric values). The rest of the nodes in these two regions may not have direct across-region connections, but they could communicate via those efficiently connected voxels(connected component). Our model utilizes the connectivity information from all cross-region voxel pairs. From parameters in our mixture model, we introduce a new statistic called connectivity breadth, which reflects a measure of connectivity strength. We adjust for an individual’s covariate effects such as age, gender, and psychiatric conditions in a generalized linear model (GLM).
The hierarchical Bayesian model unifies voxel level and region level connectivity by recognizing the distribution of voxel connectivity between regions, which allows us to fully utilize rich voxel level connectivity information while keeping the more interpretable region level connectivity findings. At the voxel level, our model provides inferences to identify voxels (within a region) acting as ‘hubs’ by exhibiting many connections to another region. In addition, our model identifies cross-region voxel pairs that exhibit strong connectivity. Thus, two regions may be functionally connected via a hub in at least one of the regions or by two high-traffic ports directly linking the regions. Our method could have important applications, for example to reveal differences in connectivity properties (or other potential biomarkers) between patients with Parkinson’s disease and healthy controls. We estimate the parameters from our model using Markov chain Monte Carlo (MCMC) methods and draw inferences based on the joint posterior probability distribution for all of the parameters.
2. Motivating Example
There is currently a major focus in the neuroimaging literature on resting-state functional connectivity, which examines connectivity properties during an undirected mental state without the explicit performance of tasks. We consider resting-state fMRI data from 32 patients with major depressive disorder (MDD) participating in a depression treatment biomarker study (Dunlop et al., 2012). The sample includes 12 males and 20 females ages 24 to 57 years. We have data from baseline fMRI scans acquired prior to treatment randomization, a baseline 17 item Hamilton Depression Rating Scale (HAM-D17), and covariates age and gender. The patient cohort was divided into two groups based on the HAM-D17 score, as suggested by Maruish (1999) and Schutte and John (1995). The resulting classification includes 11 mildly depressed subjects, with HAM-D17 greater than 15 and less than 18, and 21 severely depressed subjects with HAM-D17 18 or above. For each subject, 150 fMRI volumes were obtained in 7.5 minutes during a resting-state, in which subjects are left to think for themselves while visually focusing on a cross displayed on a monitor in the scanner. We performed standard data preprocessing steps, including motion correction to offset subject movement in the scanner, slice timing correction to adjust for the fact that each 3D scan representing a single time point is actually acquired as a series of 2D slices over time, normalization to adopt a common reference space across subjects, and spatial smoothing. Also, we apply de-trending and demeaning steps to remove low frequency trends unrelated to neural processing (e.g. due to scanner hardware) and to adjust for spatial variation in brain tissue properties that may affect the fMRI signal. Pre-whitening is performed to remove the temporal correlation of the time course at each voxel.
Models of depression emphasize dysfunctional interactions among specific cortical and limbic brain areas. Particularly, the interactions between subdivisions of frontal cortex and cingulate gyrus have been repeatedly reported in depression studies by using multiple complementary imaging methods (Seminowicz et al., 2004; Mayberg et al., 2005; Greicius et al., 2007; and Fox et al., 2012). We focus on the interactions between some of those reported ROIs (based on the Brodmann atlas) in the prefrontal cortex (PFC) including medial frontal cortex (mF10), orbital frontal cortex (OF11), and lateral prefrontal cortex (latF9); subgenual cingulate cortex (Cg25) and anterior cingulate (Cg24). We show these regions of interest in Figure 1. The goal is to assess functional connections between the five ROIs and to make inferences about how depression severity influences them.
Exploring the distribution of voxel-level functional connections (using correlations for example) between ROIs, we note the appearance of a mixture distribution with two components: a larger component of relatively weakly connected voxel pairs and a smaller component of more strongly connected voxel pairs (see Figure 2). Moreover, each subject has a different number of connected voxel pairs. The size of the connected component represents the connectivity breadth, and reflects aspects of the the “small-worldness” property. Therefore, we develop a method to study such connectivity breadth and to identify what factors may influence this connectivity breadth.
3. Methods
We formulate a model that builds on the calculated voxel-level functional connections within ROIs and between ROIs. ROIs are labeled as g = 1, …, G, where we may set G as high as 116 for the AAL map and 48 for the Brodmann map (for our data example G = 5). We use zig̈v̈ to denote connectivity (here, we consider correlations with a Fisher transformation) between the cross-region pair of voxels v and v′ denoted by v̈; for regions g (v ∈ g)and g′ (v′ ∈ g′), labeled by g̈; and for subject i (i = 1, …, n). Throughout, we use the two-dot notation to indicate pairs of voxels or regions. For a region pair g̈, the total number of voxel pairs is Vg̈. Thus, if regions g and g′ include Vg and Vg′ voxels respectively, then Vg̈ = Vg×Vg′.
3.1 Bayesian Hierarchical Model
We use a Bayesian hierarchical framework to jointly model functional connections between pairs of voxels and pairs of ROIs along with clinical covariate effects while tracking the variability at each level (see Figure 3).
3.1.1 Distribution for connectivity
The first level of our model specifies functional links between ROIs for each subject via a distribution of connections from cross-region voxel pairs. A driving heuristic is that according to the “small-worldness” property of brain networks, two regions can be efficiently connected through only a small proportion of connected voxels, and this property is also borne out by descriptive assessments of our data. In the two component mixture model, we limit our attention to only non-negative correlations, following a strong precedent in the neuroimaging literature, which facilitates interpretations. (Weissenbacher et al., 2009; Fox et al., 2009).
At the first level, we assume that zig̈v̈ follows a mixture distribution with two modes representing connected and non-connected voxel pairs. The two components vary around means μ0ig̈ (non-connected) and μ1ig̈ (connected), with the variances and respectively. We set μ1ig̈ > μ0ig̈ to aid identifiability. Specifically, the model is given by
(1) |
where λig̈ = Vg̈πig̈.
The latent indicator variable ωig̈v̈ classifies a voxel pair as either connected or non-connected. Our interest lies in detecting connectivity breadth between ROIs, which is represented by the average number of connected voxel pairs λig̈ = Vg̈πig̈. Note that ωig̈v̈ in line 2 may be correlated with ωig̈v̈′, and then the indicator variables of all cross-region voxel pairs are a set of dependent Bernoulli trials (connected 1 or not connected 0). Moreover, we let the marginal sum follow a Poisson distribution and link the hyperprior parameters with λig̈ because i) the information of parameters from Bernoulli distribution and Poisson distribution are almost equivalent (λig̈ = Vg̈πig̈) and ii) Ωig̈v̈ is more sensible and computationally convenient for group level inferences. It has been shown that the Poisson distribution has a bounded approximation for the sum of dependent Bernoulli trials (Chen, 1975). The approximation should be very close in our application since Vg̈ is on the order of hundreds of thousands in many cases. Therefore, using independent or dependent correlation structure between all voxel pairs ωig̈v̈ does not impact the marginal parameter estimation of λig̈. Our approximation is analogous to the areal/lattice spatial data analysis in disease mapping, where each incident (dependent on each other) follows Bernoulli distribution yet within an area (e.g. county) the marginal sum incident count is often used as an outcome variable for association analysis.
Why two component mixture?
We adopt the two component mixture model reflecting that voxels exhibit connectivity or not, rather than specifying a finite mixture model with k > 2 or an infinite mixture model. Our formulation mirrors the use of two component mixtures in other massive hypothesis testing settings, e.g. local fdr (Efron, 2004; George and McCulloch, 1993; Do et al., 2005; Müller et al., 2007). The common target is to identify the true positive elements from a mixture model consisting of a dominating null distribution and a small number of positive events (here, functional connections). Whereas the local fdr method tends to focus on control rates at specific z values or events, we are interested in the sum of latent indicator variables Ωig̈. Marginally, , where f(zig̈v̈) = πig̈f1(zig̈v̈) + (1 − πig̈)f0(zig̈v̈), f0 and f1 represent the probability density functions associated with the null and positive components, and P(zig̈v̈) is the probability of “true positive”, which is identical to 1 − fdr. The marginal variable Ωig̈ is subject to less variance and is invariant to the choice of distributions for f0 and f1.
In addition, the conceptualization as a two component mixture mitigates the cross-subject identifiablility issues. If each subject includes k (k > 2) components, we ought to ensure that not only all components (with different center parameters) are identifiable, but also that the components can be matched across subjects for group-level inferences. The matching procedure pose challenges with respect to numerical optimization.
3.1.2 Prior specifications
The second level of the hierarchical model in (2) expresses a prior belief that each subject’s mixture component means μ0ig̈ and μ1ig̈ arise from normal distributions with means u0g̈ and u1ig̈ and variances and respectively. We anticipate that the mean parameter u1ig̈ for the cross-region voxel pairs from the connected component will be relatively large. Thus, we empirically inform u1ig̈ by calculating the mode of voxel pair functional connections within an ROI at geometric distances up to 30 mm for each subject, which seems to represent a reasonable starting point since voxel pairs within anatomically-defined regions generally exhibit high association.
(2) |
We inform the mode of u0g̈ by using the mode of voxel pair functional connections between ROIs across all subjects which is close to zero. The variances and reflect the confidence of the prior mean specification, and we set and as a relatively small value as the mean priors are informative. For the first level variance parameters and , we assume Gamma distributions with hyperprior parameters a0 = 0.1, b0 = 0.005, a1 = 0.1, b1 = 0.005, reflecting vague priors.
We model the region pair mean connectivity breadth λg̈ through a GLM with a log link to accommodate the Poisson distribution. The parameters βg̈ of the GLM capture covariate effects such as disease status, age, and treatment group. The priors for βg̈ are set vaguely enough to let the large amount of observed data weigh heavily in determining the posterior distribution, specifically using
(3) |
3.2 Estimation and posterior inference
We apply MCMC algorithms to estimate the parameters in our hierarchical probability model. The priors for the main effects β in the GLM are not conjugate, so we use a Gibbs sampler with Metropolis updates where needed for posterior sampling. The full conditional distributions are given by the following:
(4) |
where , z̄1ig̈ is the mean for all cross-region voxel pairs v̈ from the connected component, i.e. ωig̈v̈ = 1, z̄0ig̈ is the mean for all v̈ of ωig̈v̈ = 0, and πig̈ = eβg̈0+ βg̈xi/Vg̈.
When implementing the MCMC procedure, we first sample the latent indicator variable, then sample the four parameters of the Gaussian mixture model. In formula 4 line 1, each latent indicator variable ωig̈v̈ for the corresponding voxel pair v̈ is conditional on the region level parameter πig̈, which is equivalent to λig̈/Vg̈. λig̈ is determined by βg̈ (along with the subject covariates) and βg̈0, and both βg̈ and βg̈0 are sampled by incorporating (see the last two lines of formula 4). Note that Ωig̈ is updated by taking the summation of updated ωig̈v̈ rather than sampled directly from Poi(λig̈). Therefore, similar to other mixture model settings, our sampling scheme includes an implicit constraint that each ωig̈v̈ is dependent on [ωig̈v̈′, ωig̈v̈″, …] because the update of πig̈ (λig̈/Vg̈) depends on connections of all voxel pairs between a region pair i.e. πig̈ = λig̈/Vg̈ = g−1(βg̈0, βg̈) and the updates of βg̈0 and βg̈ are both dependent on Ωig̈ (the sum of [ωig̈v̈, ωig̈v̈′, ωig̈v̈″, …]).
At the second level, the GLM parameters are updated through Metropolis steps with input of Ωig̈ following a Poisson (log link). We favor a Poisson distribution, rather than a Binomial, because with Ωig̈ and Vg̈ being on the order of thousands and millions, respectively, the ratio function of the Metropolis step may easily generate zeros in the denominator if the logit link is used. In addition, since the number of voxel pairs Vg̈ is often large, the inferences for πig̈ can be directly computed as λg̈/Vg̈ from the posterior simulations. The ratios of βs in the Metropolis step can be expressed as:
where β′ are proposed values, p0 indicates the prior distribution, and we accordingly obtain ηig̈ from ηig̈ = βg̈0 + βg̈xi. The proposed value will be accepted with probability min(1, rβ). All other parameters can be sampled from known distributions through the Gibbs sampler as shown in (4).
Implementation of our Bayesian model takes samples from the joint posterior distribution for all of the model parameters, which enables statistical inferences regarding connectivity and brain network properties. First, we can make inferences about voxel-level connectivity for each subject. For example, we can target inferences pertaining to the probability of a cross region voxel pair being connected, say Pig̈(v̈), by calculating the proportion of ωig̈(v̈) equaling one. Secondly, we can identify which voxels operate as hubs, with a hub defined here as a voxel that connects with numerous cross-region voxels. Thirdly, we can determine which specific pairs of voxels are highly connected to each other. We can also make inferences about region-level connectivity. For example, we can draw inference about the connectivity breadth of a region pair by calculating the posterior probability of eβg̈0 + βg̈xi. Relevant to our upcoming data analysis, our modeling framework yields inferences at the population level, and we can investigate the impact of covariate effects such as age, disease groups, and treatment responses on connectivity breadth between ROIs.
4. Results
We apply our Bayesian hierarchical model to resting-state fMRI data from the depression study and empirically evaluate our model using simulated data. We compare our method with the region average method which is the most commonly used group-wise region level analysis approach under the predefined brain ROI atlases.
4.1 MDD Study Using Resting-state fMRI Data
In the MDD study, our goal is to explore the neural circuits associated with medial and lateral regions of prefrontal cortex (e.g. latF9, mF10, OF11) and effective subdivisions of the anterior Cingulate Cg25 and Cg24 as described in section 2. We employ the commonly used Pearson correlation coefficient as a measure of the temporal coherence between fMRI time series from distinct voxels and further apply the Fisher normalizing transformation. As demonstrated in Figure 2, the functional connections appear to follow a mixture distribution. We apply our Bayesian hierarchical model to the transformed correlation metrics to investigate functional connectivity. For illustration, we highlight results for the region pair mF10 and Cg25.
Inferences for voxel pair connectivity
After applying our Bayesian model, we estimate the probability of functional connectivity between each cross-region voxel pair. Then, we identify voxels in one brain region that are highly connected to voxels in the other region, representing hubs at the population level. In Figure 4a, each row represents a voxel in mF10, each column represents a subject, and the intensity indicates the corresponding number of functionally connected voxels in Cg25. For convenience, we have ordered the voxels in mF10 according to the number of functionally connected voxels (across subjects), with the most highly connected voxels appearing at the top and with the least connected voxels appearing at the bottom. We note that there are several mF10 voxels that are highly connected to region Cg25 across all 32 subjects, while most voxels are weakly connected. These highly connected voxels serve as “hubs” connecting the two regions since they reveal coherent activity with many other voxels. Reciprocally, Figure 4b shows the voxels in Cg25 connected to mF10. Specifically, we consider the voxels that are cumulatively connected to roughly 30% or more of the voxels in the other region. For example, in considering hubs linking voxels in mF10 to those in Cg25, we identify voxels with degree > 60 (the degree of a voxel equals to the number of its connected voxels) and voxels in Cg25 with degree > 150. The joint posterior distribution also allows also allow us to identify particular voxel pairs connecting the two regions. Figure 4c shows the highly connected voxel pairs between the two regions, and Figure 4d illustrates locations of the functionally connected voxel pairs linking regions mF10 and Cg25. The posterior also yields inferences to identify highly connected voxels for each subject, and note from Figure 4 that connections between voxels vary across subjects. Our framework has particular utility in reforming models of depression relevant to focal neuromodulation where targeting of discrete pathways may be critical (Mayberg et al., 2005; Gutman et al., 2009; and Fox et al., 2012).
Inferences for region pair connectivity
At the region level, we summarize the breadth of functional connectivity, stratified by different depression severity groups, by counting the number of the functionally connected voxel pairs. In Table 1, we note that the region pairs mF10 and Cg25, OF11 and Cg25, and Cg24 and Cg25 are relatively highly connected, but not latF9 and Cg25C. There are marked differences in connectivity breadth between mildly and severely depressed patients for mF10 and Cg25 (7.5%), latF9 and Cg25 (6%), and OF11 and Cg25 (10%) with more severely depressed patients having increased functional connections. These findings may reveal that with increased depression severity, the brain locations for emotion processing interact more with the prefrontal cortex neural units. There is very little difference, however, in extent of functional connections between groups for Cg24 and Cg25 (2% difference), with severely depressed patients exhibiting slightly reduced connectivity breadth.
Table 1.
Parameters | Median (πg̈ %) | std | 25 pct | 75 pct | Mode | Difference(CI) | % Difference |
---|---|---|---|---|---|---|---|
Cg25 & mF10 (122,591 voxel pairs) | |||||||
| |||||||
(Mildly depressed exp(β0+β1)) | 36526 (29.8) | 143 | 36466 | 36526 | 36600 | 2879 | 7.5 % |
(Severely depressed expβ0) | 39423(32.1) | 155 | 39325 | 39422 | 39437 | (2705,3084) | |
| |||||||
Cg25 & latF9 (119,426 voxel pairs) | |||||||
| |||||||
(Mildly depressed exp(β0+β1)) | 8654 (7.2) | 26 | 8334 | 8921 | 8688 | 488 | 6 % |
(Severely depressed expβ0) | 9124(7.6) | 31 | 8968 | 9306 | 9132 | (440,518) | |
| |||||||
Cg25 & OF11 (214,376 voxel pairs) | |||||||
| |||||||
(Mildly depressed exp(β0+β1)) | 45632 (21.3) | 318 | 45273 | 45987 | 45686 | 4729 | 10 % |
(Severely depressed expβ0) | 50361 (23.5) | 387 | 50186 | 50626 | 50357 | (4461,4891) | |
| |||||||
Cg25 & Cg24 (32,705 voxel pairs) | |||||||
| |||||||
(Mildly depressed exp(β0+β1)) | 10162 (31.1) | 71 | 10083 | 10239 | 10168 | −149 | −2 % |
(Severely depressed expβ0) | 10013 (30.5) | 68 | 9972 | 10045 | 10018 | (−120,−171) |
For comparison, we apply a commonly used approach that calculates correlations between the regional temporal profiles obtained as an average of all temporal profiles in mF10 and Cg25 for all subjects. There is no difference between the two groups with t test p value at 0.57 (Supplementary Figure 2). We further address differences between our proposed method and the regional representative approach in a simulation study. We conclude that our model is able to reveal the subtle difference driven by the relatively few connected voxel pairs, and accordingly to detect the group difference. Overall, we find that that cg25 and frontal cortices appeared hyper-connected in more depressed subjects, which is consistent with previous research findings about the frontal-limbic circuitry (Greicius et al., 2007; Greicius, 2008; Zeng et al., 2012). Besides the similar region-level connectivity findings above, we provide additional voxel level inferences that previous methods do not permit (e.g. Figure 4 results); for instance we can identify those highly-connected voxel pairs that effectively connect a pair of regions.
We check our MCMC runs to ensure the stability of the results and the convergence of the chains. However, the massive number of voxel pairs prevents formal use of MCMC convergence tools for all model parameters. We monitor the trace plots for all region level parameters and for parameters corresponding to select voxel pairs. For each MCMC, we sample 20,000 iterations in total and set the first 500 iterations as burn-in. The resulting Markov chain converges within 500 burn-in samples (Supplementary Figure 1).
4.2 Simulation Study
We conduct a simulation study to further examine properties of our proposed connectivity modeling framework. We generate resting-state fMRI time series based on one region pair with sizes of a 7×7×7 voxel cube and a 5×5×5 voxel cube, from 20 subjects (10 controls and 10 cases). Therefore, for each subject we simulate 73 +53 = 468 fMRI time series, which allows us to calculate the connectivity metrics of Vg̈ = 73 ×53 = 42875 between-region voxel pairs. First, we randomly generate the numbers of connected cross-region voxel pairs using Poisson distributions with mean parameter:
We use βg̈0 = 8 and βg̈1 = 0.9 to indicate that the controls have more breadth in connectivity than cases do. Second, we construct the correlation matrix R (468 × 468) for all voxels in the two regions, which determines the correlations between the fMRI time series. We set each group with different Rs since we use different Ωig̈ and Vg̈ − Ωig̈ to indicate the numbers of highly correlated (e.g. correlation coefficient=0.6) and uncorrelated (e.g. correlation coefficient=0) between-region voxel pairs for each group. For the within region voxel pairs, we use correlations on the geometric distances between voxels so that voxels close to each other are more highly correlated and distant voxel pairs are less correlated. We slightly adjust R to ensure that it is positive definite. Based on different Rs of case and control groups, we generate 468 resting-state fMRI time series for each of the 20 subjects by using multivariate normal distribution with mean zero and covariane of R (Supplementary Figure 3 demonstrates the resting-state fMRI time series for 10 voxels in one subject). We assume that the pre-whitening has been performed, so there is no temporal autocorrelation within each voxel. We also apply different variance parameters of multivariate normal distribution (power magnitude) of the BOLD signal and band filtering to remove the low and high frequency components. The tuning of these simulation parameters does little change to the resulting distribution of correlation (connectivity metric) which is the input our of method.
Based on the simulated resting-state fMRI time series, we calculate correlation coefficients of the between region voxel pairs for each subject (see histograms in Supplementary Figure 4), where the controls have a larger component of connected voxel pairs. Our main goal is to detect a group effect and to check the number of voxel pairs that are determined to be connected for each subject. We apply our proposed hierarchical Bayesian framework to the simulated data.
Based on trace plots of parameters for the group effects from our MCMC samples of 20,000 iterations (Supplementary Figure 5), we note that the chains converge after around 300 iterations and henceforth tightly vary around the true values. The results shown in Table 2 reveal that our model captures the group effect accurately. Thus, this simulation supports that the proposed method and estimation algorithm can detect differential connectivity breadth when they exist. The small bias of βs may come from the false negative classification of the mixture model due to the heavy inclination of the predominant null component.
Table 2.
Parameters | Median | std | 25 pct | 75 pct | Mode |
---|---|---|---|---|---|
Ω̄0 | 2785 | 12.38 | 2591 | 2889 | 2781 |
Ω̄1 | 6778 | 15.32 | 6691 | 7153 | 6776 |
β0 (8) | 7.95 | 0.022 | 7.91 | 8.01 | 7.97 |
β1 (0.9) | 0.89 | 0.009 | 0.88 | 0.90 | 0.89 |
As a comparison, we use the the regional representative method described in the analysis of our depression data, to detect the connectivity difference between the two groups. We first calculate the regional temporal profiles for each subject using the the average all of the fMRI time series in each region. Then, we measure the correlation coefficients between the two regional fMRI time series for each subject. We conduct a t-test to assess the null hypothesis that the two regions have the same connectivity between the two groups, and the result shows non-significant difference with p = 0.673. We note that our method detects the informative group difference that the region representative method fails to identify. Our method also provides richer information about voxel level connectivity. There are many methods that quantify connectivity, in a general sense, between pairs of ROIs across different clinical groups. However, as we limit our attention to compare connectivity between one pair of ROIs, we only compare our approach with the region representative method.
5. Discussion
Our proposed method is the first model that is able to use all available voxel pair connectivity information, while also capturing connectivity between region pairs. Given the massive number of voxel pairs across the entire brain (tens of billions), it is imperative for any voxel level approach to reduce the number of pairs considered, and we do so by focusing on particular ROIs. Another attribute of our model is that it naturally lends itself to network graph theoretical properties characterizing brain connectivity. Our Bayesian model yields inferences on numerous quantities reflecting brain connectivity. Our model enables the examination of covariate effects, and we illustrate this by considering how depression severity impacts the extent of brain connectivity. Estimation of regional brain connectivity is robust to potential bias due to imperfect image registration because the posterior inferences are not dependent on the specific locations of voxel pairs between a region pair but only on the marginal connectivity between a region pair. At the voxel level, we may also detect inter-region “hub” voxels as those that are connected to many other voxels with high probabilities, and the voxel pair connectivity pathways that connect the two regions with high probabilities across subjects. The relative size of the connected component, i.e. the proportion of connected voxel pairs, could arguably be affected by the physical distance between the pair of ROIs, e.g. yielding increased connectivity breadth between two adjacent regions. However, increased connectivity breadth between adjacent (or relatively close) regions arguably may have a valid neurophysiological basis. Regardless, the influence of distance should have little bearing on our statistical inferences, for instance the between group comparisons, since all subjects share the identical spatial adjacency information.
The mixture model in the first level determines the number of connected voxel pairs in a fashion that is similar to local fdr, rather than choosing hard cutoff points, which improves the accuracy. Although we choose Gaussian distributions for both components of the mixture model (which seems to work well for our data example), we could adopt more flexible distributions (e.g. a gamma distribution for the skewed connected component). Both the MDD study and our simulation study results indicate that the MCMC parameter estimation is satisfactory, and the trace plots of the chains behave normally. Although there are numerous voxel pairs per subject and many subjects are included, the computational load is manageable. All the programming including MCMC and imaging processing was implemented in MATLAB. With a sample size of 20 subjects and more than 40,000 voxel pairs, the MCMC procedure takes around 6 minutes for the simulation study and on average 15 minutes for the example data to run for one region pair on an Intel i7 3.4 GHz CPU and 24G memory PC. When modeling a larger number of region pairs, we would leverage the appropriate parallelization algorithms for faster implementation. It will cost approximately 60 hours to calculate all region pairs of the Brodmann atlas in 2mm standard space by using parallel computing.
In this article, our model does not explicitly account for the correlation between different pairs of voxel connections. In contrast to activity analyses, where spatial correlations are more readily addressed (Bowman et al., 2008; Derado et al., 2010), connectivity studies typically consider the association between time courses of two brain units as input data. Thus, the dependence structure between two connectivity metrics may rely on the spatial information of four nodes. Numerous complexities are involved in modeling this spatial dependence structure, and the commonly used assumptions for spatial dependence (e.g. second-order stationarity) may need to be cautiously tested and verified. Thus, while accounting for the spatial dependence structure between connections of brain areas (including region level and voxel level) is certainly an important consideration, the topic will require in depth investigation in future work. Our proposed model is also applicable to other functional connectivity metrics (e.g. mutual information and spectral coherence), and other brain connectivity measures such as structural connectivity from DTI data.
Supplementary Material
Acknowledgments
This work was partially supported by the National Institutes of Health grant U18 NS082143.
Footnotes
Supplementary figures referenced in Sections (4.1, 4.2) and the software are available with this article at the Biometrics website on Wiley Online Library.
Contributor Information
Shuo Chen, Email: shuochen@umd.edu, Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD, 20742, USA.
F. DuBois Bowman, Department of Biostatistics, Columbia University, Manhattan, NY, 10032, USA.
Helen S. Mayberg, School of Medicine, Emory University, Atlanta, GA, 30322, USA
References
- Achard S, Salvador R, Whitcher B, Suckling J, Bullmore E. A resilient, low-frequency, small-world human brain functional network with highly connected association cortical hubs. Journal of Neuroscience. 2005;26:63–72. doi: 10.1523/JNEUROSCI.3874-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowman FD, Caffo B, Bassett SS, Kilts C. A Bayesian hierarchical framework for spatial modeling of fMRI data. NeuroImage. 2008;39(1):146–156. doi: 10.1016/j.neuroimage.2007.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowman FD. Imaging Analysis. Annual Review of Statistics and Its Application. 2014;1:61–85. doi: 10.1146/annurev-statistics-022513-115611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience. 2009;10:186–198. doi: 10.1038/nrn2575. [DOI] [PubMed] [Google Scholar]
- Calhoun VD, Adali T, Pearlson GD, Pekar JJ. A method for making group inferences from fMRI data using ICA. Human Brain Mapping. 2001;14(3):140–151. doi: 10.1002/hbm.1048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen LH. Poisson approximation for dependent trials. The Annals of Probability. 1975:534–545. [Google Scholar]
- Cordes D, Haughton VM, Arfanakis K, Carew JD, Turski PA, Moritz CH, Meyerand ME. Frequencies contributing to functional connectivity in the cerebral cortex in rs data. American Journal of Neuroradiology. 2001;22(7):1326–1333. [PMC free article] [PubMed] [Google Scholar]
- Derado G, Bowman FD, Kilts C. Modeling the spatial and temporal dependence in fMRI data. Biometrics. 2010;66:949–957. doi: 10.1111/j.1541-0420.2009.01355.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Do KA, Muller P, Tang PA. Bayesian mixture model for differential gene expression. Journal of the Royal Statistical Society: Series C. 2005;54(3):627–44. [Google Scholar]
- Dunlop BW, Binder B, Cubells F, Goodman G, Kelley E, Kinkead B, et al. Predictors of Remission in Depression to Individual and Combined Treatments (PReDICT): Study Protocol for a Randomized Controlled Trial. Trials. 2012;13(1):106. doi: 10.1186/1745-6215-13-106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Efron B. Large-scale simultaneous hypothesis testing. Journal of American Statistical Association. 2004;99:96–104. [Google Scholar]
- Fox MD, Zhang D, Snyder Z, Raichle E. The global signal and observed anticorrelated rs brain networks. Journal of Neurophysiology. 2009;101(6):3270–3283. doi: 10.1152/jn.90777.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fox MD, Buckner RL, White MP, Greicius MD, Pascual-Leone A. Efficacy of transcranial magnetic stimulation targets for depression is related to intrinsic functional connectivity with the subgenual cingulate. Biological Psychiatry. 2012;72(7):595–603. doi: 10.1016/j.biopsych.2012.04.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- George EI, McCulloch RE. Variable Selection via Gibbs Sampling. Journal of American Statistics Association. 1993;88(423):881–889. [Google Scholar]
- Greicius MD, Flores BH, Menon V, Glover H, Solvason B, Schatzberg F. rs functional connectivity in major depression: abnormally increased contributions from subgenual cingulate cortex and thalamus. Biological Psychiatry. 2007;62(5):429–37. doi: 10.1016/j.biopsych.2006.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greicius M. Resting-state functional connectivity in neuropsychiatric disorders. Current Opinion in Neurology. 2008;21(4):424–430. doi: 10.1097/WCO.0b013e328306f2c5. [DOI] [PubMed] [Google Scholar]
- Greicius MD, Krasnow B, Reiss AL, Menon V. Functional connectivity in the resting brain: a network analysis of the default mode hypothesis. Proceedings of the National Academy of Sciences. 2003;100(1):253–258. doi: 10.1073/pnas.0135058100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutman D, Holtzheimer P, Behrens T, Johansen-Berg H, Mayberg H. A tractography analysis of two deep brain stimulation white matter targets for depression. Biological Psychiatry. 2009;65(4):276–82. doi: 10.1016/j.biopsych.2008.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garey L. Brodman’s localisation in the cerebral cortex: the principles of comparative localisation based on cytoarchitectonics. Springer; London: 1994. [Google Scholar]
- Jirsa K, McIntosh R. Handbook of Brain Connectivity. Springer; 2007. [Google Scholar]
- Maruish MR. The Use of Psychological Testing for Treatment Planning and Outcomes Assessment. Lawrence Erlbaum Associates; 1999. [Google Scholar]
- Mayberg HS, Lozano AM, Voon V, McNeely HE, Seminowicz D, Hamani C, et al. DBS for treatment-resistant depression. Neuron. 2005;45(5):51–60. doi: 10.1016/j.neuron.2005.02.014. [DOI] [PubMed] [Google Scholar]
- Müller P, Parmigiani G, Rice K. FDR and Bayesian multiple comparisons rules. Proc. Valencia/ISBA 8th World Meeting on Bayesian Statistics.2006. [Google Scholar]
- Patel RS, Bowman FD, Rilling JK. A Bayesian approach to determining connectivity of the human brain. Human Brain Mapping. 2006;27(3):267–276. doi: 10.1002/hbm.20182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simpson S, Bowman FD, Laurienti P. Analyzing Complex Functional Brain Networks: Fusing Statistics and Network Science. Statistics Surveys. 2013;7:1–36. doi: 10.1214/13-SS103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sporns O. Networks of the Brain. MIT Press; 2010. [Google Scholar]
- Schutte NS, John M. Sourcebook of Adult Assessment Strategies. New York: 1995. [Google Scholar]
- Seminowicz A, Mayberg H, Segal Z, Rafi-Tari S. Limbic-frontal circuitry in major depression: a path modeling metanalysis. Neuro Image. 2004;22(1):409–18. doi: 10.1016/j.neuroimage.2004.01.015. [DOI] [PubMed] [Google Scholar]
- Sun F, Miller L, D’Esposito M. Measuring interregional functional connectivity using coherence and partial coherence analyses of fmri data. Neuro Image. 2004;21:647–658. doi: 10.1016/j.neuroimage.2003.09.056. [DOI] [PubMed] [Google Scholar]
- Turk-Browne NB. Functional Interactions as Big Data in the Human Brain. Science. 2013;342(6158):580–584. doi: 10.1126/science.1238409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tzourio-Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Mazoyer B, Joliot M. Automated anatomical labelling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single subject brain. Neuro Image. 2002;15:273–289. doi: 10.1006/nimg.2001.0978. [DOI] [PubMed] [Google Scholar]
- Weissenbacher A, Kasess C, Gerstl F, Lanzenberger R, Moser E, Windischberger C. Correlations and anticorrelations in resting-state functional connectivity MRI: a quantitative comparison of preprocessing strategies. Neuro Image. 2009;47(4):1408–1416. doi: 10.1016/j.neuroimage.2009.05.005. [DOI] [PubMed] [Google Scholar]
- Xue W, Bowman FD, Pileggi AV, Mayer AR. A multimodal approach for determining brain networks by jointly modeling functional and structural connectivity. Frontiers in Computational Neuroscience. 2015:9. doi: 10.3389/fncom.2015.00022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zatorre RJ. Predispositions and plasticity in music and speech learning: neural correlates and implications. Science. 2013;342(6158):585–589. doi: 10.1126/science.1238414. [DOI] [PubMed] [Google Scholar]
- Zalesky A. Whole-brain anatomical networks: does the choice of nodes matter? Neuro Image. 2010;50(3):970–983. doi: 10.1016/j.neuroimage.2009.12.027. [DOI] [PubMed] [Google Scholar]
- Zalesky A, Fornito A, Bullmore E. On the use of correlation as a measure of network connectivity. Neuro Image. 2012;60(4):2096–2106. doi: 10.1016/j.neuroimage.2012.02.001. [DOI] [PubMed] [Google Scholar]
- Zalesky A, Cocchi L, Fornito A, Murray MM, Bullmore E. Connectivity differences in brain networks. Neuro Image. 2012;60(2):1055–1062. doi: 10.1016/j.neuroimage.2012.01.068. [DOI] [PubMed] [Google Scholar]
- Zeng LL, Shen H, Liu L, Wang L, Li B, Fang P, et al. Identifying major depression using whole-brain functional connectivity: a multivariate pattern analysis. Brain. 2012;59:1498–1507. doi: 10.1093/brain/aws059. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.