Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2014 Dec 16;16(2):240–251. doi: 10.1093/biostatistics/kxu054

Detecting differential patterns of interaction in molecular pathways

Masanao Yajima 1,*, Donatello Telesca 2, Yuan Ji 3, Peter Müller 4
PMCID: PMC4441103  PMID: 25519431

Abstract

We consider statistical inference for potentially heterogeneous patterns of association characterizing the expression of bio-molecular pathways across different biologic conditions. We discuss a modeling approach based on Gaussian-directed acyclic graphs and provide computational and methodological details needed for posterior inference. Our application finds motivation in reverse phase protein array data from a study on acute myeloid leukemia, where interest centers on contrasting refractory versus relapsed patients. We illustrate the proposed method through both synthetic and case study data.

Keywords: Conditional independence, Directed acyclic graphs, Gaussian Markov models, Reversible jumps MCMC

1. Introduction

In “omics” studies, data often manifests as an Inline graphic matrix Inline graphic, which forms the empirical basis for the exploration of scientific hypotheses relating a set of Inline graphic genes, metabolites, proteins, etc., to an underlying biological process. When inference focuses on molecular interactions, bio-melecular pathways are often interpreted as describing marginal or conditional independence restrictions, compatible with a data generating mechanism for Inline graphic (see, e.g., Dobra and others, 2004 or Telesca and others, 2012). More precisely, the joint sampling distribution of Inline graphic is assumed to factor according to stochastic restrictions encoded in a graphical model Inline graphic (Lauritzen, 1996) and inference about molecular pathways is interpreted as inference about a graph Inline graphic supporting the observed data Inline graphic.

These biomedical studies are often designed to contrast characteristics of population subgroups. In this manuscript, we consider the problem of contrasting differential association structures characterizing labeled subsets of a data matrix Inline graphic. Our application finds motivation in a proteomics study of acute myeloid leukemia (AML) patients (Tibes and others, 2006). This study targets a specific proteomic pathway, thought to be associated with disease progression. Interest centers on understanding what components of this pathway behave differently when one compares refractory patients to patients experiencing relapse after standard treatment. Figure 1 shows the targeted protein expression level for AML patients, measured using a high-throughput proteomic technology called reverse-phase protein array (RPPA) (Tibes and others, 2006), along with the sample partial correlation matrix of the expression levels for both refectory and relapsed patients in the upper and lower triangle, respectively. The two sample partial correlation mostly agree with each other, yet there are clear discrepancies that may signify differing interactions mechanism (Table 1).

Fig. 1.

Fig. 1.

The observed expression levels of targeted proteins for AML patients quantified using RPPA (left) and a image plot of the sample partial correlation coefficients for refractory patients in the upper triangle and relapsed patients in the lower triangle (right).

Table 1.

The list of differential edges

Extra edges
CASP9 Inline graphic YAP, STAT3 Inline graphic YAP, SRC Inline graphic YAP, P38 Inline graphic YAP, FOXO3A Inline graphic YAP, RB Inline graphic YAP,
GSK3 Inline graphic AKT, PTEN Inline graphic GSK3, CASP9 Inline graphic GSK3, BAX Inline graphic GSK3, BCL2 Inline graphic BAD
Canceled edges
NF2 Inline graphic EGFR, PDK1 Inline graphic P53, PDK1 Inline graphic STAT3, BCL2 Inline graphic PDK1, BCL2 Inline graphic HER2,
GAB2 Inline graphic HER2, PDK1 Inline graphic HER2, P38 Inline graphic BAD, PRC Inline graphic BAX, FOXO3A Inline graphic GAB2

There is a large body of literature focused on the estimation of a single graphical model. However, limited attention has been given to the estimation of a collection of graphs accounting for data heterogeneity. Flexible models for heterogeneous association have been introduced by Ickstadt and others (2010) and Rodriguez and others (2011). These models focus on relaxing the assumption of multivariate Gaussianity, rather then testing specific contrasts in biological pathways. More in line with our inferential goals are the procedures introduced by Guo and others (2011) and Danaher and others (2011). Both manuscripts consider regularized estimation of inverse covariance matrices, using hierarchical extensions of the graphical lasso, originally proposed by Friedman and others (2008). The proposed procedures are shown to be scalable to large undirected graphs, with estimators that enjoy asymptotic consistency.

Beyond statistical exploration of structured dependence, proteomics studies, like the one motivating this manuscript, is inherently designed to test statistical hypotheses that go beyond simpler hypothesis generation exercises. This need finds available statistical techniques to be wanting in several directions. First, substantive understanding of biomolecular interactions is often based on directed graphs. Therefore, statistical inference should consider more general classes of graphical models that go beyond the estimation of structural zeros in inverse covariances. Secondly, even in the simplified setting of undirected graphs, the use of lasso-based estimation techniques, beyond exploratory analyses, does raise methodological and theoretical concerns regarding multiplicity correction and error estimation (Kyung and others, 2010). Third, gene and protein interactions have been studied for decades, with comprehensive databases providing the opportunity for informed priors on model spaces (The Gene Ontology Consortium, 2000, http://geneontology.org; The Kyoto Encyclopedia of Genes and Genomes http://genome.jp/kegg/; the BioCarta pathways, http://biocarta.com/genes/index/asp; etc.).

This manuscript addresses these issues considering a model of differential interaction based on directed acyclic graphs (DAGs). A DAG is a directed graph with no directed cycles, encoding Markov properties by d-separation (Pearl, 1986, 2000). We discuss how scientific knowledge about pathways known a priori can be incorporated in the inference through different classes of prior probabilities. Finally, we propose inference based on Bayesian model determination through decision theoretic principles aimed at controlling false discoveries.

This paper is structured as follows. In Section 2, we propose a Gaussian differential DAG model followed by computational detail in Section 3. We illustrate the application of the proposed method with a simulated example and an application to the RPPA data. We conclude the manuscript with a critical discussion in Section 5.

2. A model for differential interactions

We consider data in the form of an Inline graphic matrix Inline graphic, where the rows of Inline graphic are assumed to be indexed by two known sample subgroups and are labelled by Inline graphic, with Inline graphic. We refer to the group indexed with Inline graphic as the baseline group and the ones indexed with Inline graphic as the differential group. Within each group, we assume that the joint sampling distribution factors according to a DAG Inline graphic, Inline graphic; where Inline graphic indexes the set of Inline graphic genes or proteins and Inline graphic is the set of edges characterizing Inline graphic. Finally, the strength of conditional dependence is indexed by two parameter vectors Inline graphic and Inline graphic.

Let Inline graphic denote the collection of graphs associated with Inline graphic, the proposed joint probability model is constructed as follows:

2. (2.1)

The model includes two separate graphs, Inline graphic for the baseline sample (Inline graphic) and Inline graphic for the differential sample (Inline graphic). Our inference will focus on identifying a set of differential interactions partially indexed by the set Inline graphic. Each sub model is discussed in individual sections, as indicated by the under-braces.

2.1. Sampling model: Inline graphic

We assume that Inline graphic can be subdivided into two groups Inline graphic and Inline graphic each of size Inline graphic and Inline graphic, where Inline graphic. Given the graphical structures Inline graphic and Inline graphic, let Inline graphic denote the parent nodes of vertex Inline graphic, induced by graph Inline graphic, (Inline graphic), and Inline graphic, denote the Inline graphicth column of Inline graphic, Inline graphic. The joint likelihood is defined as

2.1.

In the multivariate Gaussian framework, we can express each of Inline graphic as a conditional linear model of the form

2.1. (2.2)

for Inline graphic, Inline graphic, and Inline graphic. Here, Inline graphic parametrizes a conditional mean value and Inline graphic is a conditional variance. In the foregoing formulation, Inline graphic indexes the strength of association between Inline graphic and Inline graphic, with the convention Inline graphic when Inline graphic. The strength of association between Inline graphic and Inline graphic is defined by Inline graphic, with Inline graphic whenever Inline graphic. In this setting, the parameter Inline graphic becomes the main quantity of interest as it directly informs the differences in association between subgroup random quantities.

Throughout the manuscript we will refer to Inline graphic and Inline graphic to index full sets of interaction coefficients and a will use a subscript to denote their column subsets. Details about how Inline graphic is used to index the differences between Inline graphic and Inline graphic and final inference about differential interactions are discussed in Section 2.2.

2.2. Priors on continuous parameters Inline graphic and Inline graphic

The strength of association between random quantities in the baseline group is parametrized through the Inline graphic coefficients. Differential parameters Inline graphic distinguish the strength of association between the baseline and differential groups. We explicitly address two inferential questions. First, are there differences in patterns of conditional dependence between baseline and differential groups? This question relates to the identification of the set Inline graphic. Secondly, when considering edges that are shared between both baseline and differential groups, are there significant differences in the way these edges define conditional dependence patters? Here, we consider the set Inline graphic, but we are specifically interested in the size of Inline graphic.

We encode these inferential goals directly in the prior. In particular, it is convenient to define latent trinary indicators Inline graphic distinguishing between three cases: (1) Inline graphic, whenever the association implied by the edge Inline graphic is invariant with group label; (2) Inline graphic, whenever Inline graphic, but Inline graphic; and (3) Inline graphic, whenever the strength of association implied by Inline graphic varies with group label, beyond the scenario of case (2). Note that Inline graphic and Inline graphic together deterministically determine Inline graphic. However, Inline graphic codes information beyond topological differences between Inline graphic and Inline graphic by also indicating common edges with varying strength across conditions.

For any configuration of Inline graphic and Inline graphic (implying also a configuration of Inline graphic), and for all Inline graphic; let Inline graphic denote the matrix including all parents of Inline graphic in Inline graphic. Similarly, let Inline graphic denote the matrix including all parents of Inline graphic in Inline graphic, for which Inline graphic. We define a differential autoregression matrix Inline graphic s.t.

2.2.

Using this notation, the conditional likelihood in (2.2) is represented as

2.2.

A conditional Zellner g-prior (Zellner, 1986) is then defined as

2.2.

Different choices of Inline graphic have been proposed in the model selection literature (Celeux and others, 2012). We consider the benchmark prior Inline graphic (Fernandez and others, 2001), where Inline graphic is the number of columns in Inline graphic. Alternatively, data-dependent choices have been discussed in Consonni and La Rocca (2012).

Given this parametrization, posterior inference over differential patterns of interaction focuses on Inline graphic, as this quantity is directly informative about the size of differences in partial correlation and about the significance of such differences, as measured by Inline graphic. Finally, dispersions Inline graphic, Inline graphic are modeled independently as a conjugate inverse gamma priors with hyper parameters Inline graphic and Inline graphic, so that Inline graphic.

2.3. Model space priors

Let Inline graphic be a prior graph derived through direct expert opinion or constructed using public databases (e.g., a DAG-derived through Gene Ontology, http://genontology.org). In order to define a prior centered on Inline graphic, we follow Telesca and others (2012) and define an exponentially decaying function centered around the prior graph Inline graphic, so that Inline graphic where the graph distance function is defined as Inline graphic. This prior depends on inclusion probabilities Inline graphic, on which we place a hierarchical beta Inline graphic prior, as suggests in Scott and Berger (2006) to control for error control in the multiple comparison problem. Marginalizing over Inline graphic the model space prior becomes Inline graphic which is strictly decaying function with respect to Inline graphic. Given the prior graph Inline graphic, we model Inline graphic and Inline graphic as partially exchangeable, thus we define the overall model space prior as Inline graphic We note that this prior becomes a simple sparsity prior when Inline graphic is empty, making it useful for de-novo analyses. Our simulation (see supplementary material available at Biostatistics online) do however outline the advantage of partially informative model space priors, even in the context of partial misspecification.

3. Posterior simulation

To obtain draws from the posterior distribution Inline graphic, we use a Gibbs sampling algorithm similar to Smith and Kohn (1996), with slight modification in the proposal strategy. The conditionally conjugate structure of priors and likelihood allows for the computation of models marginal likelihood (see Appendix D of supplementary material available at Biostatistics online for details) and efficient transitions in the model space (Eklund and Karlsson, 2007).

The proposed transition sequence extends the method proposed by Fronk and Giudici (2004). to allow for transitions over the additional structure Inline graphic. We note that, in our formulation, Inline graphic is fully determined by Inline graphic and latent components Inline graphic. It follows that systematic or random scans through the following transition sequence define an ergodic Markov chain that we can use to sample from posterior quantities of interest. In particular, we consider the following transition sequence

3.

Details about each transition are explained in the corresponding sections.

3.1. Updating the baseline DAG Inline graphic

To update Inline graphic, we select an edge Inline graphic at random, that is, using a uniform distribution over all possible edges. If Inline graphic we propose its addition to Inline graphic (birth); if Inline graphic we propose removal (death); if Inline graphic but Inline graphic then propose to remove Inline graphic and add Inline graphic (switch). This transition can be represented through changes in the differential autoregressive matrix Inline graphic consistent with the proposed Inline graphic and Inline graphic. Acceptance of a move is defined as

3.1.

The moves are deterministic except for the case when Inline graphic but Inline graphic where we propose Inline graphic as 0 or 2 with equal probability. For the list of all possible moves, see Table 3 of supplementary material available at Biostatistics online. Computational efficiency can be gained by pre-computing Inline graphic and Inline graphic for saturated models. Details are discussed in Eklund and Karlsson (2007).

3.2. Updating the differential model space through latent indicators Inline graphic

Given the baseline graph Inline graphic, we propose to move over the differential model space updating the latent variables Inline graphic. Updates in the state of Inline graphic will also define changes in Inline graphic. We select an edge Inline graphic at random. Depending on the current state of Inline graphic and Inline graphic, we consider the proposal transitions summarized in Table 2. The proposed extension on simple birth, death, and switch moves allows for transitions that do not alter the structure of Inline graphic but change the magnitude of association (moveInline graphic4,7). As before, all moves define a new differential autoregression matrix Inline graphic. The acceptance ratio is defined as

3.2.

Table 2.

Proposal transition scheme for exploration of the differential model space to update Inline graphic

Inline graphic Current Inline graphic Proposed Inline graphic Probability Move # Type
Inline graphic Inline graphic Inline graphic 1 1 Birth
Inline graphic Inline graphic 1 2 Death
Inline graphic Inline graphic Inline graphic 3 Death
Inline graphic Inline graphic 4 Differential
Inline graphic Inline graphic Inline graphic Inline graphic 5 Birth
Inline graphic Inline graphic 6 Birth
Inline graphic Inline graphic Inline graphic 7 Differential
Inline graphic Inline graphic 8 Death

The transition probabilities Inline graphic through Inline graphic include four pairs of moves that are each other's inverse: Inline graphic and Inline graphic.

3.3. Updating Inline graphic and Inline graphic

Given Inline graphic and Inline graphic, sampling Inline graphic from their marginal posterior distributions and Inline graphic, Inline graphic, and Inline graphic given Inline graphic from their conditional posterior distributions is achieved through direct simulation. Detailed calculations are reported in Appendix of supplementary material available at Biostatistics online.

3.4. Posterior summaries

Posterior probabilities Inline graphic, Inline graphic and corresponding MCMC samples characterize our knowledge about baseline and differential interactions in light of the data. Based on these quantities, the main inferential goal is to select representative baseline and differential graphs, say Inline graphic and Inline graphic. While posterior probabilities do summarize evidence about interaction structures, selecting a point estimate in these model spaces requires further consideration.

Given a joint model on edge and parameter inclusion probabilities, in the Bayesian framework, selection of point estimators for interaction structures Inline graphic and Inline graphic usually translates into the appropriate definition of a cutoff value for posterior inclusion probabilities (Scott and Berger, 2006; Müller and others, 2006). A cutoff threshold is often determined in order to ensure optimization of a chosen loss function.

An alternative operational strategy is to select a point estimator, on the basis of multiple comparison arguments. An often used error rate is the false discovery rate (FDR) (Benjamini and Hochberg, 1995). Rules as discussed in Benjamini and Hochberg (1995) control the frequentist expectation of the error rate across repeat experimentation. Several authors chose instead to control the posterior expectation of the same error rate (see, e.g., Newton, 2004). From a Bayesian perspective, Müller and others (2006) illustrate a decision-theoretic procedure aimed at minimizing the posterior expected false negative rate (Inline graphic), while controlling for the posterior expected Inline graphic at a level Inline graphic. The rest of this article is based on results obtained under Bayesian FDR control proposed by Müller and others (2006). Details, along with simulations, can be found in supplementary material available at Biostatistics online. Our studies show that the procedure controls for empirical FDR and FNR at the specified error rate.

4. Case study: acute myeloid leukemia

We apply our model to the data from a study of acute myeloid leukemia (AML) obtained using the RPPA (Tibes and others, 2006). RPPA is a high-throughput proteomic technology that provides a quantification of the expression for specifically targeted proteins selected from molecular pathways. Our study is based on RPPA of bone marrow specimens collected from 435 newly diagnosed AML patients; 332 primary refractory patients and 103 relapsed patients. We will call the refractory patients the baseline group and the relapsed patients the differential group.

Scientific interest centers on the identification of differential signal transduction pathways (STPs), which are suspected to contribute to leukemogenesis by perturbing the rates of proliferation, differentiation, and apoptosis (Kornblau and others, 2006). The objective of this study is to investigate the difference in interactions of important protein markers related to AML for the refractory patients and the relapsed patients. We selected 29 proteins in signal transduction, apoptosis, and cell cycle regulatory pathways and studied their expression profiles in all 435 samples. An attractive feature of the AML data under study is that the number of samples Inline graphic is much greater than the number of proteins Inline graphic, which provides an opportunity for principled inference about differential interaction structures on the basis of a highly structured stochastic system.

The prior signaling network was developed based on published articles from PubMed searches as well as from the connections map in Signal Transduction Knowledge Environment (http://www.stke.org). Other prior distributions on the parameters were selected as vague as possible to show that it is suitable for initial studies since the likelihood will dominate the posterior when the sample size is large. The two parameters of the dispersion parameter Inline graphic where set to 0.5 and 0.5. The prior on Inline graphic is set to beta Inline graphic. We ran our algorithm for 2 million iteration saving every 100th sample while discarding the initial 1 million samples.

For the decision rule, we employed the criteria of Bayesian FDR control method proposed by Müller and others (2006) that simultaneously controls for the expected posterior FDRs and FNRs. We control the expected posterior FDR at 1% for both baseline and differential associations separately.

Figure 2 is a network representation of the estimated graph in the form of essential graph (Chickering, 2002) for the refractory and relapsed patients. The bottom three figures show edges that patients in two conditions agree on, that exists only in the refractory patients and that exists only in the relapsed patients. A solid line indicates a positive coefficient estimate and a dotted line indicates a negative coefficient.

Fig. 2.

Fig. 2.

Network representation of the estimated protein network for refectory patients and relapsed patients for associations chosen at Inline graphic level of 0.01 using the decision criteria of Müller and others (2006). The positive associations are shown as a solid line and negative associations are shown as a dotted line. The bottom three plots classify the edges into three categories: the edges that two groups agree on, the edges that do not exist in the differential graph, and edges that only exist in the differential graph.

While we maintain that our findings are exploratory, we have observed that selected differential interaction patterns have been confirmed in the literature as potential indicators of more aggressive forms of AML. For example, we find a the striking differential role of the YAP protein (Yes-associated protein) as a hub node in the relapse sample. This protein plays a key role in the Hippo pathway, usually associated with the regulation of proliferation and apoptosis. While the role of Hippo has been established in solid tumors, only recent reports (Kornblau and others, 2013) associate YAP with differential prognostics in AML.

Figure 3 is the estimated posterior inclusion probability. The figures for the estimated mixing proportion and the posterior density plot of the coefficients can also be found in supplementary material available at Biostatistics online. A comprehensive bio-medical interpretation of our findings is perhaps out of the scope or this paper, but it is our hope that our illustration shows the potential and practical relevance of the proposed method.

Fig. 3.

Fig. 3.

Barplot of the estimated edge inclusion probability for the refractory patients (left) and the relapsed patients (right) for each edge. Rows/columns correspond to the 29 proteins AKT(1), AMPKa(2), BAD(3), BAK(4), BAX(5), BCL2(6), EGFR(7), FOXO3A(8), CASP9(9), CateninB(10), GAB2(11), GSK3(12), ERK2(13), HER2(14), MEK(15), MCL1(16), NF2(17), P38(18), P53(19), P70S6K(20), PDK1(21), PTEN(22), RB(23), S6RP(24), SRC(25), STAT1(26), STAT3(27), STAT5(28), YAP(29).

To validate our modeling approach, we compared our proposal with a model ignoring sample labeling and fitting a Gaussian DAG model without differential interactions. The result showed that the difference in average log marginal likelihood was 325.19 and the difference in log conditional predictive ordinate was 75.04, both suggesting strong evidence for the differential association.

5. Discussion

We proposed a probability model for inference on differential interaction in Gaussian DAGs. The proposed framework is likely to be particularly useful when primary interest focuses on potential contrasts characterizing the association structure between known subgroups of a given sample. Although we worked on a case where there are only two subgroups, the method is directly generalizable to the case of Inline graphic subgroups. Monte Carlo studies (supplementary material available at Biostatistics online) show the model to be robust against prior graph misspecification. Inference yields well-calibrated empirical FDR and FNR scaling to larger graphs and sample sizes. We evaluated our method analyzing data generated from a synthetic experiment and showed that our inferences have desirable operative characteristics (see supplementary material available at Biostatistics online for details). The application of the proposed model to the analysis of RPPA data in AML identified interesting differential regulation patterns, distinguishing refractory from relapsed patients. While we are well aware that our model belongs to the class of hypothesis generation tools, we remark that the proposed methodology avoids the use of step-wise analyses and ad hoc penalization choices, providing a principled tool for inference on differential networks.

Although we avoided all the costly matrix inversion steps in our computation, when dealing with large and dense graphs, the sheer size of the space to explore could make the computation challenging. Alternative strategies may be necessary for these large problems.

The proposed framework of differential network inference could be extended beyond the multivariate Gaussian distribution. Models space and interaction parameters could, for example, be applied to the approach of Telesca and others (2012), who show how to incorporate heavy tails in the observations by the use of a mixture model. As for the case of discrete and mixed data, the copula Gaussian graphical model framework proposed by Dobra and Lenkoski (2011) could be easily expanded using a modeling strategy similar to the one proposed in this paper.

Extension beyond DAGs may be desirable in many applied settings. For example, in the setting of Reciprocal Graphs (Koster, 1996), used in Telesca and others (2011) one may allow baseline and differential models, to be defined in terms of undirected edges as well as the directed ones, with the possibility of including cycles and reciprocal relations. We should also point out that the same idea could of course be applied to undirected graphical models. While these extension are conceptually trivial, coherent multivariate representation and computational constraints may require extensive additional work.

Supplementary material

Supplementary Material is available at http://biostatistics.oxfordjournals.org.

Funding

Yuan Ji's research is in part supported by NIH R01 CA132897. Peter Mueller's research is in part supported by NIH R01 CA132897 and NIH/NCI R01CA075981. Donatello Telesca's research was in part supported by NIH grant R01 CA158113-01.

Supplementary Material

Supplementary Data

Acknowledgement

The authors acknowledge two anonymous referees for their important feedback on the original manuscript. Conflict of Interest: None declared.

References

  1. Benjamini Y., Hochberg Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological) 57(1), 289–300. [Google Scholar]
  2. Celeux G., El Anbari M., Marin J. M., Robert C. P. (2012). Regularization in regression: comparing Bayesian and frequentist methods in a poorly informative situation. Bayesian Analysis 7(2), 477–502. [Google Scholar]
  3. Chickering D. M. (2002). Learning equivalence classes of Bayesian-network structures. The Journal of Machine Learning Research 2, 445–498. [Google Scholar]
  4. Consonni G., La Rocca L. (2012). Objective Bayes factors for Gaussian directed acyclic graphical models. Scandinavian Journal of Statistics 39(4), 743–756. [Google Scholar]
  5. Danaher P., Wang P., Witten D. M. (2011). The joint graphical lasso for inverse covariance estimation across multiple classes. arXiv preprint arXiv:1111.0324. [DOI] [PMC free article] [PubMed]
  6. Dobra A., Hans C., Jones B., Nevins J. R., Yao G., West M. (2004). Sparse graphical models for exploring gene expression data. Journal of Multivariate Analysis 90(1), 196–212. [Google Scholar]
  7. Dobra A., Lenkoski A. (2011). Copula Gaussian graphical models and their application to modeling functional disability data. The Annals of Applied Statistics 5(2A), 969–993. [Google Scholar]
  8. Eklund J., Karlsson S. (2007). Computational Efficiency in Bayesian Model and Variable Selection. Central Bank of Iceland, Economics Department. [Google Scholar]
  9. Fernandez C., Ley E., Steel M. F. J. (2001). Benchmark priors for Bayesian model averaging. Journal of Econometrics 100(2), 381–427. [Google Scholar]
  10. Friedman J. H., Hastie T. J., Tibshirani R. (2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9, 432–441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fronk E.-M., Giudici P. (2004). Markov Chain Monte Carlo model selection for DAG models. Statistical Methods & Applications 13, 259–273. [Google Scholar]
  12. Guo J., Levina E., Michailidis G., Zhu J. (2011). Joint estimation of multiple graphical models. Biometrika 98(1), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Ickstadt K., Bornkamp B., Grzegorczyk M., Wieczorek J., Sheriff M., Grecco H., Zamir E. (2010). Nonparametric Bayesian networks. Bayesian Statistics 9, 283–316. [Google Scholar]
  14. Kornblau S. M., Hu C. W., Qiu Y. H., Yoo S. Y., Zhang N., Kadia T. M., Ferrajoli A., Coombes K. R., Qutub A. A. (2013). Hippo pathway (hp) activity in acute myelogenous leukemia (aml): Different prognostic implications of taz versus yap inactivation by phosphorylation. Blood 15(21), 1337. [Google Scholar]
  15. Kornblau S. M., Womble M., Qiu Y. H., Jackson C. E., Chen W., Konopleva M., Estey E. H., Andreeff M. (2006). Simultaneous activation of multiple signal transduction pathways confers poor prognosis in acute myelogenous leukemia. Blood 108(7), 2358–2365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Koster J. T. A. (1996). Markov properties of nonrecursive causal models. The Annals of Statistics 24(5), 2148–2177. [Google Scholar]
  17. Kyung M., Gill J., Ghosh M., Casella G. (2010). Penalized regression, standard errors, and Bayesian lassos. Bayesian Analysis 5(2), 369–411. [Google Scholar]
  18. Lauritzen S. L. (1996). Graphical Models, Volume 17 USA: Oxford University Press. [Google Scholar]
  19. Müller P., Parmigiani G., Rice K. (2006) FDR and Bayesian multiple comparisons rules. Johns Hopkins University, Department of Biostatistics; Working Papers, 115. [Google Scholar]
  20. Newton M. A. (2004). Detecting differential gene expression with a semiparametric hierarchical mixture method. Biostatistics (Oxford) 5(2), 155–176. [DOI] [PubMed] [Google Scholar]
  21. Pearl J. (1986). Fusion, propagation, and structuring in belief networks. Artificial Intelligence 29(3), 241–288. [Google Scholar]
  22. Pearl J. (2000). Causality: Models, Reasoning and Inference. The Edinburgh Building, Cambridge CB2 2RU, UK: Cambridge University Press. [Google Scholar]
  23. Rodriguez A., Lenkoski A., Dobra A. (2011). Sparse covariance estimation in heterogeneous samples. Electronic Journal of Statistics 5, 981–1014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Scott J. G., Berger J. O. (2006). An exploration of aspects of Bayesian multiple testing. Journal of Statistical Planning and Inference 136(7), 2144–2162. [Google Scholar]
  25. Smith M., Kohn R. (1996). Nonparametric regression using bayesian variable selection. Journal of Econometrics 75(2), 317–343. [Google Scholar]
  26. Telesca D., Muller P., Kornblau S., Suchard M., Ji Y. (2011). Modeling protein expression and protein signaling pathways. COBRA Preprint Series, 87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Telesca D., Müller P., Parmigiani G., Freedman R. S. (2012). Modeling dependent gene expression. The Annals of Applied Statistics 6(2), 542–560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Tibes R., Qiu Y., Lu Y., Hennessy B., Andreeff M., Mills G. B., Kornblau S. M. (2006). Reverse phase protein array: Validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells. Molecular Cancer Therapeutics 5(10), 2512–2521. [DOI] [PubMed] [Google Scholar]
  29. Zellner A. (1986). On assessing prior distributions and Bayesian regression analysis with g-prior distributions. Bayesian Inference and Decision Techniques: Essays in Honor of Bruno de Finetti. Amsterdam: North-Holland, pp. 233–243. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES