Bayesian modeling of multiple structural connectivity networks during the progression of Alzheimer’s disease

Christine B Peterson; Nathan Osborne; Francesco C Stingo; Pierrick Bourgeat; James D Doecke; Marina Vannucci

doi:10.1111/biom.13235

. Author manuscript; available in PMC: 2022 Mar 9.

Published in final edited form as: Biometrics. 2020 Feb 19;76(4):1120–1132. doi: 10.1111/biom.13235

Bayesian modeling of multiple structural connectivity networks during the progression of Alzheimer’s disease

Christine B Peterson ¹, Nathan Osborne ², Francesco C Stingo ³, Pierrick Bourgeat ⁴, James D Doecke ⁴, Marina Vannucci ²

PMCID: PMC8906798 NIHMSID: NIHMS1781994 PMID: 32026459

Abstract

Alzheimer’s disease is the most common neurodegenerative disease. The aim of this study is to infer structural changes in brain connectivity resulting from disease progression using cortical thickness measurements from a cohort of participants who were either healthy control, or with mild cognitive impairment, or Alzheimer’s disease patients. For this purpose, we develop a novel approach for inference of multiple networks with related edge values across groups. Specifically, we infer a Gaussian graphical model for each group within a joint framework, where we rely on Bayesian hierarchical priors to link the precision matrix entries across groups. Our proposal differs from existing approaches in that it flexibly learns which groups have the most similar edge values, and accounts for the strength of connection (rather than only edge presence or absence) when sharing information across groups. Our results identify key alterations in structural connectivity that may reflect disruptions to the healthy brain, such as decreased connectivity within the occipital lobe with increasing disease severity. We also illustrate the proposed method through simulations, where we demonstrate its performance in structure learning and precision matrix estimation with respect to alternative approaches.

Keywords: AIBL study, Alzheimer’s disease, Bayesian inference, Gaussian graphical model, MRI data

1 |. INTRODUCTION

Dementia is a leading cause of death, disability, and health expenditure in the elderly, with Alzheimer’s disease (AD) accounting for the majority of cases. Much research in AD aims at understanding how the disease mechanisms affect the brain, in an effort to aid in the diagnosis and treatment of those with AD. Here we are interested in exploring the changes in structural connectivity for different brain regions through the progression of the disease.

Traditional approaches to structural neuroimaging studies have focused on investigating cortical thickness, volume, and the rate of tissue loss as specific neurodegenerative biomarkers that relate to changes in the aging brain. More recently, attention has been given to the estimation of networks that capture the connectivity between cortical regions of interest and to the changes in connectivity that result from the progression of the neurological disease. It is widely known that correlated regions of interest are more likely to be part of a network and that networks are related to specific cognitive functions (Alexander-Bloch et al., 2013). During the progression of neurodegenerative disease, a person has a varying amount of cortical tissue loss, depending on their disease stage. As such, “connections” assessed throughout the disease trajectory represent coordinated changes in brain tissue, which are reflected in cortical thickness measures.

Statistical methods for network inference are a powerful tool to gain insight into the complex interactions that govern brain connectivity networks. When all samples are collected under similar conditions or reflect a single type of disease, methods such as the graphical lasso (Friedman et al., 2008) or Bayesian graphical approaches (Wang, 2012; Wang and Li, 2012) can be applied to infer a sparse graph and thereby learn the underlying network. These have been successfully used for the estimation of structural brain connectivity networks.

In studies where samples are obtained for different groups or subtypes of a disease, like the Australian Imaging, Biomarkers and Lifestyle (AIBL) study of ageing described below, separate estimation for each subgroup reduces statistical power by ignoring potential similarities across groups, while applying standard graphical model inference approaches to the pooled data across conditions leads to spurious findings. Recently, estimation methods for multiple graphical models have been proposed in the statistical literature, including penalization-based approaches that encourage either common edge selection or precision matrix similarity (Guo et al., 2011; Cai et al., 2015). In particular, Danaher et al. (2014) developed convex penalization schemes designed to encourage similar edge values (the fused graphical lasso) or shared structure (the group graphical lasso). More recent proposals encourage network similarity in a more tailored manner, assuming that the networks for each sample group are related within a tree structure (Oates and Mukherjee, 2014; Pierson et al., 2015), or, more generally, within an undirected weighted graph (Ma and Michailidis, 2016; Saegusa and Shojaie, 2016). These methods assume that the relationships across groups are either known a priori or learned via hierarchical clustering. More flexible approaches that employ a Bayesian framework to simultaneously learn the networks for each group and the extent to which these networks are similar have been proposed in Peterson et al. (2015) and Shaddox et al. (2018). More specifically, Peterson et al. (2015) proposed representing the inclusion of edges using latent binary indicators, and the sharing of edges across groups was encouraged via a Markov random field prior linking the indicators. Shaddox et al. (2018) improved upon Peterson et al.’s (2015) study by replacing the G-Wishart prior on the precision matrix within each group with a mixture prior that is more amenable to efficient sampling. However, Shaddox et al. (2018) still addresses only the inclusion or exclusion of edges, without consideration of edge strength or direction.

For the analyses of this paper, we propose a Bayesian Gaussian graphical modeling approach that retains the advantages of the approaches by Peterson et al. (2015) and Shaddox et al. (2018) in flexibly learning cross-group similarities within a joint framework, but that accounts for the similarity of edge values across groups, rather than only the binary presence or absence of those edges. Our framework allows us to not only learn the precision matrices within each group, but also to characterize the extent of shared edge values across the groups. Empirically, we demonstrate that this key feature results in a more accurate inference of the precision matrices. Unlike related approaches in the frequentist framework (Pierson et al., 2015; Saegusa and Shojaie, 2016), which require a separate, ad hoc step to learn the cross-group relationships, we can simultaneously learn both the within-group and cross-group relationships. Furthermore, even though penalization approaches are more scalable, they provide only point estimates of large networks, which are often unstable given limited sample sizes. Within our Bayesian approach, we can better quantify uncertainty in the estimates.

When applied to the data from the AIBL study, our method demonstrates that the majority of structural connections are preserved across all groups, but participants with AD have structural connectivity that is most unique compared to the other groups. In comparison to separate Bayesian estimation methods, the proposed method is able to identify a larger number of connections, reflecting the benefit of borrowing strength across groups. The fused graphical lasso, on the other hand, selects very dense graphs, which likely include a larger proportion of false positives edges, as also suggested by simulation studies in our current work and in previous investigations (Peterson et al., 2015; Shaddox et al., 2018). This issue was noted by Danaher et al. (2014), who recommended an approximation of the Akaike information criterion (AIC), which we apply here, as the best objective method for parameter selection, but acknowledged that cross-validation, AIC, and Bayesian information criterion (BIC) tend to favor models that are too large; the tendency to select overly dense graphs was also observed for standard graphical lasso (Liu et al., 2010).

1.1 |. The AIBL study

Here, we focus on cortical thickness measurements from participants in the AIBL cohort who were either healthy control (HC), mild cognitive impairment (MCI) or had AD. As a marker for neurodegeneration, cortical thickness is used to assess the atrophy of the cortical gray matter (GM) using MR images, and has been proposed as a more stable parameter for AD diagnosis than volume/density measures, because it is a more direct measure of GM atrophy (Singh et al., 2006). Investigation into GM atrophy allows the approximate measurement of neuronal loss, which is one of the underlying hallmarks of neurodegenerative diseases. Analyses using cortical thickness have been shown to successfully separate AD from MCI and HC (Querbes et al., 2009). Our aim is to examine how the progression of AD affects the structural networks of the brain.

The rest of the paper is organized as follows: In Section 2, we describe the proposed Bayesian joint graphical modeling approach and the posterior inference. We return to the case study in Section 3 and apply our method to estimate structural connectivity networks in subjects from cognitively normal to AD. In Section 4, we perform a simulation study and compare performance with alternative approaches. We conclude with a discussion in Section 5.

2 |. PROPOSED MODEL

Let K represent the number of sample groups (eg, HC, MCI, and AD) and let X_k be the n_k × p data matrix (eg, cortical thickness on p brain regions) for the kth group, with k = 1, …, K. We assume that the observed values within each group arise from a multivariate normal distribution, where each row of X_k corresponds to an independent observation following the distribution $N (μ_{k}, Σ_{k})$ . As we are interested in the covariance structure, rather than the means, we assume that the data are centered by group, so that μ_k = 0_k for k = 1, …, K. The group-specific covariance matrix Σ_k has inverse $Σ_{k}^{- 1} = Ω_{k} \equiv (ω_{k, i j})$ . The multivariate normal distribution has the special property that ω_ij = 0 if and only if variables i and j are conditionally independent given the remaining variables (Dempster, 1972). Nonzero entries in the precision matrix Ω_k therefore correspond to edges in the group-specific conditional dependence graph G_k, which can be represented as a symmetric binary matrix with elements g_k,ij = 1 if edge (i, j) is included in graph k, and equal to zero otherwise.

In the Bayesian framework, inference of a graphical model is performed by tackling two interrelated sub-problems: selecting the model and learning the model parameters. Model selection is driven by identifying the graph structures G_k, while the precision matrices Ω_k are the key model parameters. Unlike many of the existing Bayesian approaches for multiple undirected graphical models, which are based on prior distributions that link groups through the graph structures G_k, in this paper we propose a novel prior that links the groups through the parameters Ω_k, accounting for edge strength rather than only edge presence or absence. The specification of such a prior requires some care as all precision matrices are constrained to be positive semidefinite.

2.1. Prior formulation

Our goal is to construct a prior on the precision matrices Ω₁ …, Ω_K that enables inference of a graphical model for each group, encourages similar edge values when appropriate, and allows for computationally tractable posterior inference. There have been a number of prior distributions proposed for the precision matrix Ω in a Gaussian graphical model. Early approaches required restrictive assumptions on the graph structure (in particular, decomposibility) to allow tractable sampling (Dawid and Lauritzen, 1993; Giudici and Green, 1999). Later methods included shrinkage priors (Wang, 2012), which offered computational scalability but not graph selection, and conjugate priors with no restriction on the graph structure (Wang and Li, 2012), which, due to limited computational scalability, could only be applied in the moderate p setting with less than 100 variables in a single network.

Here, we build on the stochastic search structure learning model of Wang (2015), which assumes a normal mixture prior on the off-diagonal entries of the precision matrix, enabling graph selection with no restrictions on the graph structure within a computationally efficient sampling framework. To achieve this, we define a joint prior distribution on the precision matrices Ω₁, …, Ω_K that encourages similarity across groups in terms of the off-diagonal elements of the precision matrices. Specifically, we consider the continuous shrinkage prior (Wang, 2012, 2015) for K networks defined as

\begin{array}{l} p (Ω_{1} \dots, Ω_{K} ∣ {Θ_{i j} : i < j}) \propto \prod_{i < j} N_{K} (ω_{i j}^{'} ∣ 0, Θ_{i j}) \\ \times \prod_{i} \prod_{k} Exp (ω_{k, i i} ∣ λ / 2) 1_{Ω_{1} \dots, Ω_{K} \in M^{+}}, \end{array}

(1)

where ω_ij = (ω_1,ij, …, ω_K,ij) is the vector of precision matrix entries corresponding to edge (i, j) across the K groups, λ > 0 is a fixed hyperparameter, and M⁺ denotes the space of p × p positive definite symmetric matrices. The first term in the joint prior specifies a multivariate normal prior with covariance matrix Θ_ij on the vector of precision matrix entries ω_ij corresponding to edge (i, j) across groups. To define a prior on Θ_ij, we work with the decomposition Θ_ij = diag(ν_ij) ⋅ Φ ⋅diag(ν_ij), where ν_ij is a K × 1 vector of standard deviations specific to edge (i, j), and Φ is a K × K matrix shared across all (i, j) pairs with 1s along the diagonal. To ensure that Θ_ij is positive definite, the only requirements are that the standard deviations ν_k,ij must be positive and Φ must be a valid correlation matrix. Given these constraints, we can then define a mixture prior on the edge-specific elements of ν_ij that enables the selection of edges in each graph, and a prior on the off-diagonal entries of Φ that allows us to model the relatedness of edge values across the sample groups. Following Wang (2015), the standard deviations ν_k,ij are set to either a large or small value depending on whether edge (i, j) is included in graph k, that is, ν_k,ij = υ₁ if g_k,ij = 1, and ν_k,ij = υ₀ otherwise. The hyperparameters υ₁ > 0 and υ₀ > 0 are fixed to large and small values, respectively. Small values of υ₀ will shrink the value of ω_k,ij for edges that are not included in the graph toward 0. This prior indirectly encourages the selection of similar graphs in related networks. Specifically, a small value of ω_k,ij will encourage small values of ω_l,ij for any other group l and in turn the exclusion of edge (i, j) in both groups k and l. Similarly, a large value of ω_k,ij will encourage large values of ω_l,ij and the inclusion of edge (i, j) in groups k and l. Networks k and l are considered related if the posterior distribution of the (k, l) element of Φ is concentrated on relatively larger values.

For the prior on the graphs G₁, …, G_K, we assume an independent Bernoulli distribution

p (G_{1}, \dots, G_{K}) \propto \prod_{k = 1}^{K} \prod_{i < j} {π^{g_{k, i j}} {(1 - π)}^{1 - g_{k, i j}}} .

(2)

This prior is analytically defined only up to a normalizing constant. As discussed in Wang (2015), the unknown normalizing constant of prior (1) and prior (2) are proportional and cancel out in the joint prior on (Ω_k, G_k). Consequently, the parameter π is not exactly the prior probability of edge inclusion; however, as shown by Wang (2015) the effect of these unknown normalizing constants on the posterior inference is extremely mild, and the parameter π can be easily calibrated to achieve a prespecified level of sparsity.

Recall that Φ is a correlation matrix, and must therefore have all diagonal entries fixed to 1 and be positive definite. To specify the prior on Φ, we rely on the joint uniform prior:

p (Φ) \propto 1 \cdot 1_{Φ \in R^{K}},

(3)

where $R^{K}$ denotes the space of valid K × K correlation matrices, that is, positive definite symmetric matrices Φ such that ϕ_jk = 1 for all j = k and |ϕ_jk| < 1 for all j ≠ k. When Φ = I, the precision matrices for each group are independent, and the proposed model reduces to that of Wang (2015) applied separately to each sample group.

Alternative priors could be defined on the precision matrices Ω₁, …, Ω_K that ensure the support to be constrained to the space of symmetric positive semidefinite matrices M⁺. However, our proposed prior has the key advantage of computational tractability. In the next section, we show how we can define a sampler that is automatically restricted to the targeted support M⁺. In our model, cross-group similarity is defined by Φ, which links the elements of the precision matrices, whereas previous approaches (Peterson et al., 2015; Shaddox et al., 2018) encouraged similarity through a joint prior on the adjacency matrices G₁, …, G_K.

2.2 |. MCMC algorithm for posterior inference

We rely on Markov chain Monte Carlo (MCMC) to generate a sample from the joint posterior. At a high level, the sampling steps are as follows (see also Supplementary Material):

Step 1: For each sample group k = 1, …, K, we first update the precision matrix Ω_k using a block Gibbs sampler with closed-form conditional distributions for each column, as in Wang (2015), and then update G_k by drawing each edge from an independent Bernoulli.
Step 2: We sample the entire correlation matrix Φ at once using a Metropolis-within-Gibbs step following the parameter expansion method of Liu and Daniels (2006).

After discarding the results from the burn-in period, we take the median model (Barbieri and Berger, 2004) as the posterior selected value for the graph G_k for each group. Specifically, we select edges g_k,ij with marginal posterior probability of inclusion ≥ 0.5, as in Wang (2015). To obtain a posterior estimate of the precision matrix consistent with the selected graph, we resample Ω_k conditional on the posterior estimate of Φ and the selected value of G_k.

3 |. STRUCTURAL CONNECTIVITY PATTERNS IN THE AIBL COHORT

3.1 |. Subjects and MRI data processing

We have disease stage information and measurements of cortical thickness across 100 regions of interest in the brain from a total of 584 subjects. Here we focus on imaging data and cognitive assessments from the last follow-up time point available. The subjects were divided into four groups: high performing HC (hpHC, n = 143), HC (n = 145), MCI (n = 148), and AD (n = 148). To obtain this classification, subjects were first evaluated by a clinician for current diagnosis and categorized as HC, MCI, or AD. HC subjects were further divided into hpHC and HC using eight different cognitive composite scores representing different cognitive domains. Magnetic resonance imaging (MRI) was performed on each subject, and the resulting images were parcellated into 100 regions of interest (ROIs). Mean cortical thickness was computed in each ROI, and used in subsequent analysis. This gave us data on p = 100 brain regions for the K = 4 groups of subjects. Within each group, data were centered. Additional details on the cognitive scoring and MRI data processing, along with a list of ROIs grouped by lobe of the brain, are provided in the Supplementary Material.

3.2 |. Application of the proposed method

The application of our model requires the specification of a few hyperparameters. Here we provide details on the specification we used to obtain the results reported below and refer readers to the sensitivity analysis found in the Supplementary Material for more insights on parameter selection. In particular, priors (1) and (2) require the choice of the hyperparameters ν₀, ν₁, and π. These were set to ν₀ = 0.01, ν₁ = 15, and $π = \frac{2}{(100 - 1)}$ . The parameters ν₀ and ν₁ were chosen so that the network structure results were sparse, while the selection of π was based on the default setting recommended in Wang (2015). As a guideline, increasing ν₀ while holding the ratio between ν₀ and ν₁ fixed will result in sparser graphs, as shown in the sensitivity analysis, which agrees with the sensitivity analysis provided in Wang (2015). Increasing the ratio between ν₀ and ν₁ while holding ν₀ fixed will likewise increase the sparsity of the inferred graphs.

The results we report below were obtained by running two MCMC chains with 20 000 iterations, after a burn-in of 5000 iterations. Posterior probabilities of inclusion (PPI) for each edge were compared for the two chains to check for convergence. A correlation of 0.997 was found between these two posterior samples. We also used the Gelman and Rubin’s convergence diagnostic (Gelman and Rubin, 1992) to check for signs of nonconvergence of the individual parameters of the estimated Φ matrix and the estimated precision matrices. Those statistics were all below 1.1, clearly indicating that the MCMC chains were run for a sufficient number of iterations. The results reported here were obtained by pooling together the outputs from the two chains to give a total of 20 000 MCMC samples.

3.3 |. Results

Figure 1 shows histograms of the PPIs for each group and scatter plots of the PPIs across pairs of groups. Off-diagonal plots show scatter plots of the PPIs, on the upper triangle plots, and percentage of PPIs falling in each quadrant, in the lower triangle plots, for pairs of groups. In the scatter plots, the points in the upper right quadrants indicate edges that belong to the median model in both groups (shared edges), whereas points in the lower right and upper left quadrants indicate edges that were selected in one group but not the other (differential edges). The points in the lower left quadrant correspond to edges selected in neither group. These plots illustrate that the edge selection is fairly sparse overall, with a high concentration of PPIs close to 0 in the histograms, and that there are a number of edges that are strongly supported as shared across groups, as shown by the dense cluster of points in the upper right corner of the off-diagonal plots. Finally, we can see that many of the PPI values are the same across groups, as shown in the linear trend in the upper triangle plots. Although we generally do not observe a strong trend in terms of network differences across groups, we note that AD differentiates itself from the other groups most, because of the PPI values that vary (are relatively more dispersed from the linear trend) between AD and the other groups. Additionally, heatmaps of the PPIs within each group are shown in Figure 2. This figure appears in color in the electronic version of this article, and any mention of color refers to that version. In these plots, the ROIs are groups within brain lobes, specifically, frontal, temporal, parietal, occipital, and limbic cortex. These probabilities, which can only be obtained via a Bayesian approach, represent the confidence we have in the presence of each edge, and provide a useful summary of the uncertainty regarding edge selection. As expected, larger PPIs values are observed within lobes versus across lobes for all disease stages.

Case study results discussed in Section 3.3. PPIs across the four groups of subjects. Plots on the diagonal show histograms of the PPIs for the individual groups. We introduced a break in the y-axis to allow better visualization of the small PPIs. Off-diagonal plots show scatter plots of the PPIs, on the upper triangle plots, and percentage of PPIs falling in each quadrant, in the lower triangle plots, for pairs of groups

Case study results discussed in Section 3.3. Plot of the PPIs across the 4 groups of subjects. In each plot, ROIs are grouped within individual brain lobes. This figure appears in color in the electronic version of this article, and any mention of color refers to that version

To allow an in-depth view of the estimated networks, sub-networks corresponding to the individual lobes are shown in Figure 3, where the edges shown are those selected in the median model; the estimated graphs G_k for each group across all lobes are plotted in Supplementary Figure S2. In these circular plots, the left side represents the left brain hemisphere, and the right side represents the right brain hemisphere. In all plots, blue lines indicate edges shared by all four groups, red lines indicate edges unique to an individual group, and black lines those shared by two or more groups. This figure appears in color in the electronic version of this article, and any mention of color refers to that version. The strongest pattern visible in the graphs are the horizontal blue lines connecting the corresponding regions in the right and left hemispheres of the brain. The pattern of strong correlations between contralateral homologous regions of the cortex in structural imaging has been previously observed, for example by Mechelli et al. (2005).

Case study results discussed in Section 3.3. Subnetworks corresponding to the frontal, temporal, parietal, occipital, and limbic lobes (from top to bottom), for the four groups of subjects, where the edges shown are those selected in the median model. The left side of each circular array represents the left brain hemisphere, and the right side represents the right brain hemisphere. Blue lines indicate edges shared by all four groups, red lines indicate edges unique to an individual group, and black lines those shared by two or more groups. This figure appears in color in the electronic version of this article, and any mention of color refers to that version

Our findings are quantified in Table 1, which summarizes the numbers of edges included per group and shared across groups in the networks for all ROIs of Supplementary Figure S2 and the lobe-specific networks of Figure 3. Within each subtable, the diagonal values represent the numbers of edges present in each group, and the off-diagonal values are the numbers of shared edges between pairs of groups. Finally, the numbers of edges that are unique to a specific group is reported as values in parentheses along the diagonals. From this, we see that the healthy control groups have slightly more edges than the cognitively impaired groups. We can also see that there is a decrease in connections in the occipital lobe as AD progresses. Additional ROI-specific patterns can be found in Table S2 in Supplementary Material, which shows total number of edges for each ROI pair in each group.

TABLE 1.

Case study results discussed in Section 3.3 Note. Number of edges included per group and shared across groups in the networks for all ROIs of Supplementary Figure S2 and the lobe-specific networks of Figure 3. Diagonal values represent the number of edges selected in each group, with values in parentheses representing the number of edges that are unique to that group. Off-diagonal values indicate the numbers of shared edges between pairs of groups

All ROIs	hpHC	HC	MCI	AD	Frontal	hpHC	HC	MCI	AD
hpHC	231 (1)				hpHC	89 (1)
HC	223	231 (3)			HC	86	91 (2)
MCI	222	217	223 (1)		MCI	87	85	87 (0)
AD	219	222	214	227 (3)	AD	86	89	85	89 (0)
Temporal	hpHC	HC	MCI	AD	Parietal	hpHC	HC	MCI	AD
hpHC	25 (0)				hpHC	19 (0)
HC	25	25 (0)			HC	19	19 (0)
MCI	25	25	25 (0)		MCI	19	19	19 (0)
AD	25	25	25	25 (0)	AD	19	19	19	19 (0)
Occipital	hpHC	HC	MCI	AD	Limbic	hpHC	HC	MCI	AD
hpHC	30 (0)				hpHC	12 (0)
HC	29	29 (0)			HC	12	13 (0)
MCI	27	26	27 (0)		MCI	11	11	11 (0)
AD	26	26	25	27 (1)	AD	11	12	11	13 (1)

Open in a new tab

Our method also produces estimated values of the elements of the Φ matrix, which capture similarity in the precision matrix entries between the different subject groups. Notably, as they are based on the joint posterior distribution, these values account for uncertainty in the estimation of the group-specific precision matrices.

(\begin{matrix} hpHC & HC & MCI & AD \\ hpHC & 1.000 \\ HC & 0.929 & 1.000 \\ MCI & 0.942 & 0.885 & 1.000 \\ AD & 0.865 & 0.940 & 0.883 & 1.000 \end{matrix})

These values, which reflect the similarity in edge strength across groups, provide a complementary look at the patterns of structural connectivity. In particular, values of Φ show that hpHC and AD are the least similar. They also show that HC and AD are related, which is supplemented by Table 1 that shows that HC and AD have a large number of shared edges. The similarity of HC and AD may be caused by the way hpHC and HC were separated, as HC may have a higher propensity to develop AD. Our results also support similarity of the hpHC and MCI groups. Although these findings suggest there may be an underlying classification other than AD that influences the structural connectivity, the values we observe are generally large, supporting high degree of network similarity across groups.

We conclude our analysis by summarizing the network structure of the estimated graphs via some graph metrics commonly used in neuroimaging (Yao et al., 2010). Specifically, we calculated the clustering coefficient γ, the absolute path length λ, and the small world coefficient σ = γ/λ. See Yao et al. (2010), and references within, for a formal definition. From a quantitative perspective, if both λ ≈ 1 and γ > 1, and consequently σ > 1, a network is said to exhibit small-world characteristics, which means in a qualitative sense that any node can be reached from any other node in a small number of steps. Disconnected nodes were removed when calculating the characteristic path length. Based on the estimated values of λ and γ, we obtain small world coefficients σ of 1.717, 1.635, 1.627, and 1.475 for hpHC, HC, MCI, and AD, respectively. We observe that σ is greater than 1 for all the groups, but steadily decreases during the progression of AD. Small-world characteristics in the brain network of AD have also observed by other authors (He et al., 2008). Our conclusions on the differences in structural connectivity across groups are descriptive in nature, as our findings generally support a high degree of overlap in the structural connectivity networks.

3.4 |. Results from alternative approaches

For additional perspective, we compare our results to those of the fused graphical lasso (Danaher et al., 2014), separate graph estimation in the Bayesian framework (Wang, 2015), and the joint estimation approach of Shaddox et al. (2018). For the fused graphical lasso, λ₁ and λ₂ were selected by performing a grid search to find the combination of values minimizing the AIC, as recommended in Danaher et al. (2014). Separate Bayesian inference was applied with the same settings for ν₀, ν₁, λ, π as in the linked method. Shaddox et al. (2018) was applied with ν₀ = 0.50, ν₁ = 15, λ = 1, a = 1, b = 4, α = 2, β = 5, and w = 0.5.

For each of the brain regions, Table 2 shows the number of total edges for each method on the diagonal, and the number of common edges on the off-diagonal. Although the ground truth is not known, these results suggest that the proposed linked precision matrix method generally improves power over separate estimation: a large majority of the edges selected using separate estimation are also discovered under the proposed method, while separate estimation results in a slight increase in the number of edges across stages. We see a similarly large overlap of selected edges with the joint Bayesian estimation, although the joint Bayesian method leads to models that are more dense, due, in part, to the larger number of parameters of that model that control the sparsity. The fused graphical lasso tends to select models which are even denser. This is because the AIC is not optimal for variable selection, tending to result in models that are not sufficiently sparse.

TABLE 2.

Comparison of case study results discussed in Section 3.4 Note. For each group and brain region, diagonal values represent the total number of edges using the specified method, and off diagonal values represent the number of edges the two methods have in common. Fused is the fused graphical lasso of Danaher et al. (2014), Separate is the separate Bayesian graph estimation with mixture priors of Wang (2015), Joint is the joint Bayesian estimation with mixture priors of Shaddox et al. (2018), and Linked is the proposed approach

		hpHC				HC				MCI				AD
		Fused	Separate	Joint	Linked	Fused	Separate	Joint	Linked	Fused	Separate	Joint	Linked	Fused	Separate	Joint	Linked
All regions	Fused	1486				1495				1345				1218
	Separate	167	168			175	175			181	181			185	185
	Joint	578	168	670		576	175	679		534	181	652		587	185	688
	Linked	229	142	215	231	229	147	218	231	222	160	221	223	226	165	223	227
Frontal lobe	Fused	459				418				421				399
	Separate	62	62			68	68			66	66			68	68
	Joint	204	62	216		198	68	220		196	66	212		211	68	225
	Linked	88	53	82	89	90	59	85	91	87	61	86	87	89	62	88	89
Temporal lobe	Fused	73				76				66				65
	Separate	23	23			20	20			24	24			24	24
	Joint	45	23	48		49	20	50		46	24	47		49	24	50
	Linked	25	22	24	25	25	19	25	25	25	22	25	25	25	23	25	25
Parietal lobe	Fused	55				52				54				48
	Separate	16	16			19	19			15	15			16	16
	Joint	40	16	40		40	19	40		37	15	37		34	16	35
	Linked	19	16	19	19	19	16	19	19	19	15	19	19	19	13	19	19
Occipital lobe	Fused	97				104				89				75
	Separate	22	22			27	27			25	25			23	23
	Joint	49	22	52		56	27	56		46	25	46		48	23	48
	Linked	30	22	29	30	29	23	29	29	27	23	27	27	27	22	27	27
Limbic lobe	Fused	31				27				24				29
	Separate	9	9			10	10			10	10			12	12
	Joint	14	9	16		16	10	16		13	10	13		17	12	17
	Linked	12	9	12	12	13	10	13	13	11	10	11	11	13	11	12	13

Open in a new tab

3.5 |. SIMULATION STUDY

We present here a simulation study to compare performance across methods in learning graphs with related structure. The simulation is designed to mimic the real data application in terms of the number of variables, number of subjects per group, and graph structures.

We consider a setting with K = 3 groups, p = 100 variables, and n = 150 observations per group, where the underlying graph and precision matrix for each group are constructed as follows. G₁, the graph for the first group, consists of five communities, each with 20 variables. Within each community, the nodes are connected via a scale-free network. There are no connections across communities in G₁. The precision matrix entries in Ω₁ for edges are sampled independently from the uniform distribution on [−0.6, −0.4] ⋃ [0.4, 0.6], whereas entries for missing edges are set to 0. To obtain G₂, five edges are removed from G₁ and five new edges added at random, so that now there are some cross-community connections. The entries in Ω₂ for the new edges are generated in a similar fashion as for Ω₁, whereas the entries for the edges removed are set to zero. To ensure positive definiteness, Ω₁ and Ω₂ are each adjusted following the approach in Danaher et al. (2014). To obtain G₃, 20 edges are removed from the graph for group 2, and the corresponding 20 entries in Ω₂ are set to zero to obtain Ω₃. These steps result in graphs G₁ and G₂ that share 180 of 185 edges (97.3%), graphs G₂ and G₃ that share 165 of 185 edges (89.2%), and graphs G₁ and G₃ that share 162 of the 185 edges in G₁ (87.6%). The correlations between the off-diagonal elements of the precision matrices are 0.98 between Ω₁ and Ω₂, 0.94 between Ω₂ and Ω₃, and 0.93 between Ω₁ and Ω₃. To simulate the data, we generate n samples per group from the multivariate normal $N (0, Ω_{k}^{- 1})$ , for k = 1, 2, 3. Below we report results obtained over 25 simulated data sets.

3.5.1 |. Performance comparison

We compare the following methods: fused graphical lasso (Danaher et al., 2014), group graphical lasso (Danaher et al., 2014), Bayesian inference applied separately for each group (Wang, 2015), Bayesian joint inference relating edge probabilities (Shaddox et al., 2018), and the proposed Bayesian joint inference method linking the precision matrix entries. For the lasso methods, the within-group penalty λ₁ and cross-group penalty λ₂ were selected using a grid search to identify the combination that minimize the AIC. Both separate Bayesian inference and the proposed linked precision matrix approach were applied using the parameter setting ν₀ = 0.01, ν₁ = 0.1, λ = 1, and π = 2/(p − 1). Shaddox et al. (2018) was applied using ν₀ = 0.05, ν₁ = 0.5, λ = 1, a = 1, b = 16, α = 2, β = 5, and w = 0.5, where the parameters were chosen to achieve a similar number of selected edges as obtained under the proposed linked precision matrix approach.

All Bayesian methods were run with 10 000 iterations as burn-in and 20 000 iterations for posterior inference. For the Bayesian methods, we take the posterior selected graph as the median model, and compute the posterior estimate of the precision matrices Ω_k as the MCMC average when the precision matrices are resampled conditional on the graphs and the posterior estimate of Φ from the initial run (for our method), or conditional on the graph using separate mixture priors (for separate and joint estimation approaches).

The performance across methods in terms of edge selection and differential edge selection is compared on the basis of true positive rate (TPR), false positive rate (FPR), Matthews correlation coefficient (MCC), and area under the curve (AUC). A detailed description of how these performance metrics were computed is provided in the Supplementary Material. The performance results for graph and precision matrix learning are given in Table 3. In general, the Bayesian methods tend to favor sparser graphs, and achieve quite low FPRs. The lasso methods tend to select somewhat denser graphs, and have correspondingly higher TPRs and FPRs. The proposed linked precision matrix method achieves the best overall performance, as demonstrated by its high MCC value. The AUC, which is computed across a range of model sizes, shows that the lasso methods and the proposed linked precision matrix approach have very good accuracy. For the lasso methods, the AUC was computed for multiple values of the cross-group penalty parameter while varying the within-group penalty, and the best was included here. Thus, the reported AUCs for these methods are likely to err on the optimistic side. Finally, the Frobenius loss is minimized under the proposed method.

TABLE 3.

Performance summary across 25 simulated data sets Note. Comparison of true positive rate (TPR), false positive rate (FPR), Matthews correlation coefficient (MCC) and area under the ROC curve (AUC) for structure learning, and Frobenius loss (FL) for precision matrix estimation. The standard error of the mean is given in parentheses. The methods compared are the fused and group graphical lasso of Danaher et al. (2014), separate Bayesian graph estimation with mixture priors of Wang (2015), the joint Bayesian estimation with mixture priors of Shaddox et al. (2018), and the proposed linked precision matrix approach

	All Edges						Differential Edges
	TPR	FPR	MCC	AUC	Fr Loss	# edges	TPR	FPR	MCC	AUC
Fused graphical lasso	0.80	0.07	0.48	0.97	0.065	461	0.74	0.14	0.11	0.24
	(0.01)	(0.003)	(0.01)	(0.001)	(0.001)	(15.1)	(0.01)	(0.001)	(0.003)	(0.01)
Group graphical lasso	0.73	0.08	0.40	0.96	0.077	508	0.68	0.14	0.10	0.13
	(0.01)	(0.003)	(0.005)	(0.001)	(0.001)	(16.3)	(0.02)	(0.004)	(0.003)	(0.004)
Separate estimation with	0.17	0.0002	0.40	0.89	0.099	31	0.16	0.01	0.10	0.84
mixture priors	(0.002)	(3.0×10⁻⁵)	(0.003)	(0.001)	(0.001)	(0.5)	(0.01)	(2.0×10⁻⁴)	(0.01)	(0.01)
Joint estimation with	0.57	0.03	0.47	0.89	0.327	236	0.53	0.06	0.12	0.84
mixture priors	(0.004)	(3.0×10⁻⁴)	(0.003)	(0.002)	(0.003)	(1.6)	(0.02)	(0.001)	(0.004)	(0.01)
Linked precision	0.43	0.0002	0.64	0.95	0.057	77	0.22	0.003	0.23	0.87
matrix approach	(0.01)	(2.6×10⁻⁵)	(0.004)	(0.001)	(7.4×10⁻⁴)	(1.1)	(0.01)	(9.9×10⁻⁵)	(0.019)	(0.01)

Open in a new tab

For MCC, AUC, and FL, the result reflecting the best performance among the methods compared is marked in bold.

Based on the results in Table 3, the proposed method is conservative in the identification of differential edges, as indicated by its fairly low sensitivity and very high specificity. The proposed method achieves both the highest MCC and AUC across methods compared. The high FPR of the lasso methods in selecting differential edges is partly due to the fact that they select a larger number of false positive edges overall, and may also reflect that they use a single penalty parameter to control cross-group similarity, which is not optimal when some groups have more similar dependence structure than others.

Finally, the proposed linked precision matrix approach provides a posterior summary of cross-group similarity. Specifically, the posterior estimated value of Φ under the proposed linked precision matrix method is

(\begin{matrix} 1.0 & 0.65 & 0.63 \\ 1.0 & 0.64 \\ 1.0 \end{matrix}) .

Although the entries are fairly similar across groups, we can see that groups 1 and 2, which are the most similar to each other, have a higher value in the Φ matrix.

Additional simulated scenarios with varying degrees of shared structure and edge values are included in the Supplementary Material. Results demonstrate that although the proposed method has the largest performance advantage when edge values across groups are in fact similar, it is robust to deviations from this setting, and performs similarly to separate Bayesian inference when there is no more overlap across groups than by random chance.

3.6 |. DISCUSSION

We have introduced a novel method for the joint analysis of multiple brain networks. The proposed approach allows flexible modeling of the cross-group relationships, resulting in relative measures of precision matrix similarity that fall in the (0,1) interval. With respect to other methods for joint estimation, the proposed method not only shares information about the presence or absence of edges between groups, but also about the strength of those connections. Building on the sampling framework of Wang (2015) has allowed the proposed method to scale up to around 100–150 variables; the posterior sampling for a data set comprised p = 100 ROIs and K = 4 groups took approximately 55 minutes for 1000 MCMC iterations in MATLAB on a laptop with a single Intel(R) Core(TM) i5-5200U CPU @ 2.20GHz and 16GB RAM. The proposed method was proven to be suitable for the analysis of multiple brain networks based on ROI measurements; in case interest is in larger networks, such as networks of voxels, more scalable approaches focused on point estimation, such as lasso or EM algorithms (Danaher et al., 2014; Li and McCormick, 2019), should be used.

We have applied our method to the analysis of structural data from the AIBL study on AD, with the purpose of exploring the changes in structural connectivity for different brain regions through the progression of the disease. Our method has demonstrated that the majority of structural connections are preserved across all groups. Some of our findings are consistent with the literature on structural connectivity networks in Alzheimer patients: networks are fairly sparse and a number of edges are shared across groups.

In theory, structural connectivity networks in Alzheimer’s patients do not change dramatically with disease progression. Our findings confirm this theory, and support our assumption that all networks are similar to some extent, that is, all elements of the Φ matrix are nonzero. However, from a statistical modeling perspective, it might be of interest to replace the prior given in Equation (3) with a prior that assumes sparsity of the cross-group relationships. Such an extension is nontrivial due to the combination of constraints that Φ must both be a positive-definite matrix and have all diagonal entries fixed to 1.

Supplementary Material

NIHMS1781994-supplement-Supplementary_Material.pdf^{(341.4KB, pdf)}

ACKNOWLEDGMENTS

CBP, NO, and MV are partially supported by NSF/DMS 1811568/1811445. CBP is partially supported by NIH/NCI CCSG grant P30CA016672.

Funding information

National Cancer Institute, Grant/Award Number: CCSG grant P30CA016672; National Science Foundation, Grant/Award Number: NSF/DMS 1811568/1811445

Footnotes

SUPPORTING INFORMATION

Web Appendices, Tables, and Figures referenced in Sections 3– 5 are available with this paper at the Biometrics website on Wiley Online Library, along with Matlab scripts, R code and example data designed to resemble that of our real data application, also available online at https://github.com/cbpeterson/Linked_precision_matrices.

REFERENCES

Alexander-Bloch A, Giedd JN and Bullmore E (2013) Imaging structural co-variance between human brain regions. Nature Reviews Neuroscience, 14, 322–336. [DOI] [PMC free article] [PubMed] [Google Scholar]
Barbieri M and Berger J (2004) Optimal predictive model selection. Annals of Statistics, 32, 870–897. [Google Scholar]
Cai T, Li H, Liu W and Xie J (2015) Joint estimation of multiple high-dimensional precision matrices. Statistica Sinica, 38, 2118–2144. [DOI] [PMC free article] [PubMed] [Google Scholar]
Danaher P, Wang P and Witten D (2014) The joint graphical lasso for inverse covariance estimation across multiple classes. Journal of the Royal Statistical Society Series B, 76, 373–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dawid A and Lauritzen S (1993) Hyper Markov laws in the statistical analysis of decomposable graphical models. Annals of Statistics, 21, 1272–1317. [Google Scholar]
Dempster A (1972) Covariance selection. Biometrics, 28, 157–175. [Google Scholar]
Friedman J, Hastie T and Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9, 432–441. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gelman A and Rubin DB (1992) Inference from iterative simulation using multiple sequences. Statistical Science, 7, 457–472. [Google Scholar]
Giudici P and Green PJ (1999) Decomposable graphical Gaussian model determination. Biometrika, 86, 785–801. [Google Scholar]
Guo J, Levina E, Michailidis G and Zhu J (2011) Joint estimation of multiple graphical models. Biometrika, 98, 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
He Y, Chen Z and Evans A (2008) Structural insights into aberrant topological patterns of large-scale cortical networks in Alzheimer’s disease. Journal of Neuroscience, 28 (18), 4756–4766. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li ZR and McCormick TH (2019) An expectation conditional maximization approach for Gaussian graphical models. Journal of Computational and Graphical Statistics, 28, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu X and Daniels M (2006) A new algorithm for simulating a correlation matrix based on parameter expansion and reparameterization. Journal of Computational and Graphical Statistics, 15, 897–914. [Google Scholar]
Liu H, Roeder K and Wasserman L (2010) Stability approach to regularization selection for high dimensional graphical models. Advances in Neural Information Processing Systems, 24, 1432–1440. [PMC free article] [PubMed] [Google Scholar]
Ma J and Michailidis G (2016) Joint structural estimation of multiple graphical models. Journal of Machine Learning Research, 17, 1–48. [Google Scholar]
Mechelli A, Friston KJ, Frackowiak RS and Price CJ (2005) Structural covariance in the human cortex. Journal of Neuroscience, 25, 8303–8310. [DOI] [PMC free article] [PubMed] [Google Scholar]
Oates C and Mukherjee S (2014) Joint structure learning of multiple non-exchangeable networks. International Conference on Artificial Intelligence and Statistics (AISTATS), 33, 687–695. [Google Scholar]
Peterson CB, Stingo F and Vannucci M (2015) Bayesian inference of multiple Gaussian graphical models. Journal of the American Statistical Association, 110, 159–174. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pierson E, Consortium G, Koller D, Battle A and Mostafavi S (2015) Sharing and specificity of co-expression networks across 35 human tissues. PLOS Computational Biology, 11, e1004220. [DOI] [PMC free article] [PubMed] [Google Scholar]
Querbes O, Aubry F, Pariente J, Lotterie J, Démonet J-F, Duret V, et al. (2009) Early diagnosis of Alzheimer’s disease using cortical thickness: impact of cognitive reserve. Brain, 132, 2036–2047. [DOI] [PMC free article] [PubMed] [Google Scholar]
Saegusa T and Shojaie A (2016) Joint estimation of precision matrices in heterogeneous populations. Electronic Journal of Statistics, 10, 1341–1392. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shaddox E, Stingo FC, Peterson CB, Jacobson S, Cruickshank-Quinn C, Kechris K, Bowler R and Vannucci M (2018) A Bayesian approach for learning gene networks underlying disease severity in COPD. Statistics in Biosciences, 10, 59–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
Singh V, Chertkow H, Lerch JP, Evans AC, Dorr AE and Kabani NJ (2006) Spatial patterns of cortical thinning in mild cognitive impairment and Alzheimer’s disease. Brain, 129, 2885–2893. [DOI] [PubMed] [Google Scholar]
Wang H (2012) Bayesian graphical lasso models and efficient posterior computation. Bayesian Analysis, 7, 771–790.27375829 [Google Scholar]
Wang H (2015) Scaling it up: stochastic search structure learning in graphical models. Bayesian Analysis, 10, 351–377. [Google Scholar]
Wang H and Li S (2012) Efficient Gaussian graphical model determination under G-Wishart prior distributions. Electronic Journal of Statistics, 6, 168–198. [Google Scholar]
Yao Z, Zhang Y, Lin L, Zhou Y, Xu C, Jiang T and Alzheimer’s Disease Neuroimaging Initiative. (2010) Abnormal cortical networks in mild cognitive impairment and Alzheimer’s disease. PLOS Computational Biology, 6, e1001006. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material

NIHMS1781994-supplement-Supplementary_Material.pdf^{(341.4KB, pdf)}

[R1] Alexander-Bloch A, Giedd JN and Bullmore E (2013) Imaging structural co-variance between human brain regions. Nature Reviews Neuroscience, 14, 322–336. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] Barbieri M and Berger J (2004) Optimal predictive model selection. Annals of Statistics, 32, 870–897. [Google Scholar]

[R3] Cai T, Li H, Liu W and Xie J (2015) Joint estimation of multiple high-dimensional precision matrices. Statistica Sinica, 38, 2118–2144. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Danaher P, Wang P and Witten D (2014) The joint graphical lasso for inverse covariance estimation across multiple classes. Journal of the Royal Statistical Society Series B, 76, 373–397. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Dawid A and Lauritzen S (1993) Hyper Markov laws in the statistical analysis of decomposable graphical models. Annals of Statistics, 21, 1272–1317. [Google Scholar]

[R6] Dempster A (1972) Covariance selection. Biometrics, 28, 157–175. [Google Scholar]

[R7] Friedman J, Hastie T and Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9, 432–441. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Gelman A and Rubin DB (1992) Inference from iterative simulation using multiple sequences. Statistical Science, 7, 457–472. [Google Scholar]

[R9] Giudici P and Green PJ (1999) Decomposable graphical Gaussian model determination. Biometrika, 86, 785–801. [Google Scholar]

[R10] Guo J, Levina E, Michailidis G and Zhu J (2011) Joint estimation of multiple graphical models. Biometrika, 98, 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] He Y, Chen Z and Evans A (2008) Structural insights into aberrant topological patterns of large-scale cortical networks in Alzheimer’s disease. Journal of Neuroscience, 28 (18), 4756–4766. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Li ZR and McCormick TH (2019) An expectation conditional maximization approach for Gaussian graphical models. Journal of Computational and Graphical Statistics, 28, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Liu X and Daniels M (2006) A new algorithm for simulating a correlation matrix based on parameter expansion and reparameterization. Journal of Computational and Graphical Statistics, 15, 897–914. [Google Scholar]

[R14] Liu H, Roeder K and Wasserman L (2010) Stability approach to regularization selection for high dimensional graphical models. Advances in Neural Information Processing Systems, 24, 1432–1440. [PMC free article] [PubMed] [Google Scholar]

[R15] Ma J and Michailidis G (2016) Joint structural estimation of multiple graphical models. Journal of Machine Learning Research, 17, 1–48. [Google Scholar]

[R16] Mechelli A, Friston KJ, Frackowiak RS and Price CJ (2005) Structural covariance in the human cortex. Journal of Neuroscience, 25, 8303–8310. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Oates C and Mukherjee S (2014) Joint structure learning of multiple non-exchangeable networks. International Conference on Artificial Intelligence and Statistics (AISTATS), 33, 687–695. [Google Scholar]

[R18] Peterson CB, Stingo F and Vannucci M (2015) Bayesian inference of multiple Gaussian graphical models. Journal of the American Statistical Association, 110, 159–174. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Pierson E, Consortium G, Koller D, Battle A and Mostafavi S (2015) Sharing and specificity of co-expression networks across 35 human tissues. PLOS Computational Biology, 11, e1004220. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Querbes O, Aubry F, Pariente J, Lotterie J, Démonet J-F, Duret V, et al. (2009) Early diagnosis of Alzheimer’s disease using cortical thickness: impact of cognitive reserve. Brain, 132, 2036–2047. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Saegusa T and Shojaie A (2016) Joint estimation of precision matrices in heterogeneous populations. Electronic Journal of Statistics, 10, 1341–1392. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] Shaddox E, Stingo FC, Peterson CB, Jacobson S, Cruickshank-Quinn C, Kechris K, Bowler R and Vannucci M (2018) A Bayesian approach for learning gene networks underlying disease severity in COPD. Statistics in Biosciences, 10, 59–85. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Singh V, Chertkow H, Lerch JP, Evans AC, Dorr AE and Kabani NJ (2006) Spatial patterns of cortical thinning in mild cognitive impairment and Alzheimer’s disease. Brain, 129, 2885–2893. [DOI] [PubMed] [Google Scholar]

[R24] Wang H (2012) Bayesian graphical lasso models and efficient posterior computation. Bayesian Analysis, 7, 771–790.27375829 [Google Scholar]

[R25] Wang H (2015) Scaling it up: stochastic search structure learning in graphical models. Bayesian Analysis, 10, 351–377. [Google Scholar]

[R26] Wang H and Li S (2012) Efficient Gaussian graphical model determination under G-Wishart prior distributions. Electronic Journal of Statistics, 6, 168–198. [Google Scholar]

[R27] Yao Z, Zhang Y, Lin L, Zhou Y, Xu C, Jiang T and Alzheimer’s Disease Neuroimaging Initiative. (2010) Abnormal cortical networks in mild cognitive impairment and Alzheimer’s disease. PLOS Computational Biology, 6, e1001006. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Bayesian modeling of multiple structural connectivity networks during the progression of Alzheimer’s disease

Christine B Peterson

Nathan Osborne

Francesco C Stingo

Pierrick Bourgeat

James D Doecke

Marina Vannucci

Abstract