Bayesian meta-analysis models for microarray data: a comparative study

Erin M Conlon; Joon J Song; Anna Liu

doi:10.1186/1471-2105-8-80

. 2007 Mar 7;8:80. doi: 10.1186/1471-2105-8-80

Bayesian meta-analysis models for microarray data: a comparative study

Erin M Conlon ^1,^✉, Joon J Song ², Anna Liu ¹

PMCID: PMC1851021 PMID: 17343745

Abstract

Background

With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods.

Results

Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled.

Conclusion

The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets.

Background

Due to the growing accumulation of publicly available microarray data, it is increasingly important to develop methods to integrate findings across studies. Combining results will increase sample sizes and thus the power to detect differentially expressed genes. While meta-analysis has been used extensively in medical and public health applications, it has only recently been developed for microarray studies [1-15]. The two primary methods for data integration consist of either combining gene expression measures across studies or combining summary measures of expression such as p-values, probabilities or ranks. In the first approach, Wang et al. [1] used a weighted average procedure to combine standardized mean expression differences across three independent studies. Through this method they identified genes that were consistently differentially expressed between leukemia and normal B cells. Choi et al. [2] and Stevens and Doerge [3] combined standardized gene effects into an overall mean effect using statistical modeling. The models accounted for different sources of variation in microarray studies, including differences between studies. Since inter-study variability was assumed to be constant for each gene, the uncertainty of this parameter was not included in the subsequent analyses (this refers only to the non-Bayesian model of Choi et al. [2]). Hu et al. [4] extended the method of Choi et al. [2] by incorporating an individual study quality index for each gene into the effect size estimate. The authors combined two lung cancer data sets and demonstrated that their method identified more differentially expressed genes than previous analyses. Other investigators combined "raw" gene expression data rather than gene effects when the data was comparable across studies. Examples include Morris et al. [5], which combined Affymetrix studies by creating new probesets across arrays, and Park et al. [6], which integrated log-expression ratios in an ANOVA meta-analysis model for cDNA microarrays.

Due to the difficulty in comparing cross-laboratory and cross-platform expression measures, several microarray meta-analysis methods have combined summary measures of expression rather than expression measures themselves. Rhodes et al. [7] calculated p-values in individual lung cancer studies and aggregated the p-values to provide an overall estimate of gene significance. Parmigiani et al. [9] introduced an integrative correlation approach that identified genes with consistent expression patterns across multiple platforms. Other approaches convert gene expression values within each study to rank orderings or probabilities of expression. The transformed data is then aggregated across studies to identify disease marker genes or prognostic signatures (Shen et al. [10], Xu et al. [11], Warnat et al. [12]).

Recently, Bayesian meta-analysis models have been introduced that are analogous to the classical methods described above. Bayesian approaches have several advantages over traditional methods and have been used widely in individual microarray studies [16-18]; in particular the discrete mixture model has been developed extensively [19-30]. Bayesian models are well-suited to the small sample sizes of microarray studies since they borrow information from all genes to estimate model parameters. They also provide a framework for incorporating all available information in a systematic manner, and explicitly include model and parameter variability. A third important benefit of the Bayesian approach is that a predictive distribution for future data is produced (Stangl and Berry [31], Tweedie et al. [32]). Bayesian models can also handle the large amounts of missing data inherent in microarray studies relatively easily.

The two primary Bayesian approaches to meta-analysis for microarray studies correspond to the traditional approaches: one combines standardized gene effects and the other combines probabilities, as follows. Choi et al. [2] introduced the first Bayesian meta-analysis model for microarray data, which integrated standardized gene effects in individual studies into an overall mean effect. Inter-study variability was included in the model with an associated uninformative prior distribution. This type of model, termed hierarchical Bayesian random effects, has been used broadly in non-microarray contexts (see, for example, DuMouchel and Harris [33], Smith et al. [34], Tweedie et al. [32], Normand [35], DuMouchel and Normand [36], Pauler and Wakefield [37], Sargent et al. [38], Gelman et al. [39]). The hierarchical Bayesian random effects meta-analysis model has several favorable features: it provides an overall effect size among all studies, and it accounts for inter-study variability, which may improve accuracy of results. However, many microarray meta-analyses include a small number of studies, e.g. between two and four studies [1,2,4-7,9-11,13-15]. Due to these typical small study numbers, the inter-study variability is challenging to model. An uninformative prior distribution for this parameter may not provide enough information, and thus more informative priors need to be considered. Alternatively, Conlon et al. [15] introduced a Bayesian meta-analysis model that combined study probabilities rather than gene effect levels, eliminating the need to estimate inter-study variability. The model produced the overall gene-specific posterior probability of differential expression while incorporating several sources of data replication. Here, we compare two Bayesian meta-analysis models: the standardized expression integration model and the probability integration model of Conlon et al. [15]. The standardized expression integration model is similar in approach to hierarchical Bayesian random effects models (such as that of Choi et al. [2]) in that mean values are combined across studies to provide an overall mean value, with inter-study variability included as a parameter in the model with an associated prior distribution. However, instead of combining effect sizes as in Choi et al., we combine standardized gene expression values, since our data is assumed to be from the same platform and comparable across studies (for further detail, see Background: Background on effect sizes and standardized gene expression levels). In simulations, we illustrate that combining probabilities improves performance versus combining standardized gene expression levels based on three comparison measures: the increase in true discovered genes in meta-analysis versus separate studies, the decrease in true omitted genes in meta-analysis versus individual studies, and the number of true genes identified for fixed levels of Bayesian false discovery. Our findings are similar when analyzing biological data in two studies of Bacillus (B.) subtilis.

While many meta-analytic methods incorporate data across multiple microarray platforms, several recent reports have shown the difficulty in such approaches. Kuo et al. [40] and Jarvinen et al. [41] both worked with cell lines and concluded that combining data across cDNA and oligonucleotide platforms was not reliable. Mah et al. [42] found that there is only moderate overlap in gene expression levels between cDNA and oligonucleotide platforms. Due to these findings, we focus on meta-analysis for one platform, cDNA microarrays.

Background on effect sizes and standardized gene expression levels

Traditional random effects models and hierarchical Bayesian random effects models first summarize the data from two conditions into study-specific mean effects and associated variances; the models use these summary effect measures rather than the individual data values (i.e. "raw" data). For instance, the hierarchical Bayesian random effects model of Choi et al. [2] used the well-studied estimator of Hedges and Olkin [43], which estimates the effect size as the difference of the means of two groups divided by the pooled standard deviation. The estimator is sample-size adjusted to obtain an unbiased effect size; the associated variance is also based on Hedges and Olkin's work. However, Hedges and Olkin's method requires two separate groups of data with only one level of replication, which is not always available. For one sample data with replicate slides within repeated experiments, the Hedges and Olkin effect size estimates do not apply. For these reasons, for our one-sample cDNA microarray studies with multiple sources of replication, we use the standardized logarithms of the red/green (Cy5/Cy3) fluorescent intensity ratios in our meta-analysis models. We use log-ratios since they create more symmetric distributions and stabilize the variances (Dudoit et al. [44]). The log-expression ratios are standardized so that each slide has zero mean and unit standard deviation (see also Shen et al. [10] and Dominici and Parmigiani [45]). For further background on cDNA microarrays, see [46-49].

Results and discussion

Bayesian standardized expression integration meta-analysis model

Researchers often conduct multiple independent microarray studies for the same biological system. For example, Eichenberger et al. [50] designed two studies to identify genes under the control of a primary transcription factor sigma E (σ^E) in B. subtilis. The first study was a mutation of σ^Eand the second was an overexpression of σ^E(referred to as the mutant and induction studies, respectively; for details of the data sets, see Methods: Biological data). Thus, genes that were overexpressed in one study should be underexpressed in the other. Combining results of the two studies in a meta-analysis will increase sample size and help more precisely identify target genes. In the Bayesian standardized expression integration model, expression levels are smoothed across studies to produce an overall expression measure for each gene. This model assumes that the standardized expression means are not the same in each study, but that each study-specific mean is a random sample from a common population distribution. Inter-study variability is included as a parameter in the model. More specifically,

$\begin{array}{l} y_{j g s e} | ξ_{j g e} & ~ & N (ξ_{j g e}, φ_{j g}^{2}), j = 1, ..., J; g = 1, ..., G; s = 1, ..., S_{e}; e = 1, ..., E \\ ξ_{j g e} | θ_{j g} & ~ & N (θ_{j g}, σ_{j g}^{2}), j = 1, ..., J; g = 1, ..., G; e = 1, ..., E \\ θ_{j g} | μ_{g} & ~ & N (μ_{g}, τ_{g}^{2}), j = 1, ..., J; g = 1, ..., G \\ μ_{g} | I_{g} = 0 & ~ & N (0, η_{g 0}^{2}) \\ μ_{g} | I_{g} = 1 & ~ & N (0, c \times η_{g 0}^{2}) \\ I_{g} & ~ & Bernoulli (p) \\ p & ~ & Uniform (0, 1), \end{array} (1)$

for J independent studies. Here, y_jgseare the observed microarray data, and are the normalized log-expression ratios for study j, gene g, slide s, and experiment e. For cDNA microarrays, the expression ratio is the ratio of fluorescent intensity levels for the red and green-labeled mRNA (Cy5 and Cy3) samples, or treatment and control. We standardized the y_jgsevalues so that each slide has zero mean and unit standard deviation (see also [10,45]). In this model, we take into account that the y_jgseare affected by variability due to slides, experiments (cultures) and studies. In individual studies, y_jgseis a sample from a normal distribution of the slide values within the same experiment for gene g. We represent this in the model as y_jgse~ N(ξ_jge, $φ_{j g}^{2}$ ) where ξ_jgeis the mean among all slide values of an experiment for gene g. The parameter $φ_{j g}^{2}$ represents the variability of the slide value distribution for each gene. The within-experiment mean ξ_jgeis a sampling from a normal distribution of experiment values; we denote this as ξ_jge~ N(θ_jg, $σ_{j g}^{2}$ ). Here, θ_jgis the study-specific mean log-expression ratio of gene g, and $σ_{j g}^{2}$ represents the variability across experiments. In turn, the study-specific mean θ_jgis a sampling from a normal distribution of study values. Thus, θ_jg~ N(μ_g, $τ_{g}^{2}$ ), where μ_grepresents the overall mean log-expression ratio across studies, and $τ_{g}^{2}$ is the variability across studies. The μ_gvalues are assumed to be normally distributed with a mean of zero with a small variance for non-differentially expressed genes, and with a large variance for differentially expressed genes. Note that only y_jgsevalues are observed data, while the remaining parameters are unobserved. Based on the overall mean value μ_g, the model produces the posterior distribution for I_g, which is used to calculate the probability of differential expression D_g= Prob(I_g= 1 | data). More specifically, I_g~ Bernoulli(p) is the indicator variable for differential expression of gene g corresponding to μ_g≠ 0, and p is the fraction of expressed genes. Thus, Prob(I_g= 1) = p, where

$I_{g} = {\begin{matrix} 0 if μ_{g} = 0 \\ 1 if μ_{g} \neq 0 \end{matrix}$

With this model, genes are divided into two groups, differentially expressed (I_g= 1) and non-expressed (I_g= 0), with probabilities p and (1-p), respectively. For each gene, the posterior probability of differential expression over all studies, D_g= Prob(I_g= 1 | data), is produced, and we compare results based on D_g. In assigning prior distributions, when I_g= 0, the μ_gare assumed to be normally distributed with mean zero and small variance $η_{g 0}^{2}$ ; when I_g= 1, the μ_gare assumed to be normally distributed with mean zero and large variance (c × $η_{g 0}^{2}$ ).

The inter-study variability parameter $τ_{g}^{2}$ influences the results to a large degree and requires careful consideration. We detail several specifications for the prior distribution of $τ_{g}^{2}$ in the sections: Two-study simulation results; and Methods: Standardized expression integration model: prior distributions for inter-study variation.

We assign conjugate scaled inverse chi-squared prior distributions to the slide and experiment variation parameters, $φ_{j g}^{2}$ and $σ_{j g}^{2}$ . The scale parameters are derived from the data, by pooling information from all genes (similar to Tseng et al. [17], Gottardo et al. [23], Lönnstedt and Speed [24]; see Methods). The prior framework for a single study is similar to Gottardo et al. [23] except that we calculate a posterior distribution for p rather than using an iterative algorithm to estimate p. Our data also has one more level of replication than that of Gottardo et al. [23], i.e. repeated slides within repeated experiments. Our hierarchical Gaussian structure for one study also resembles the Bayesian ANOVA models (BAM) of Ishwaran and Rao [29,30]. BAM restructures the problem of identifying overexpressed genes as a variable selection procedure and uses a Bayesian model designed toward selective shrinkage. The difference between our model in a single study context and BAM is that we have one-sample data, while BAM models are tailored to a two-sample format; our data also has more levels of replication.

Each study is assumed to contain only two conditions: a treatment and a control. We simulate posterior distributions for each parameter using a Markov chain Monte Carlo (MCMC) implementation of the model [51]. See the Methods section for more details of the prior distributions and the MCMC procedure.

Bayesian probability integration meta-analysis model

The Bayesian model to combine probabilities was introduced in Conlon et al. [15] and is similar to the Bayesian standardized expression integration model, except that the mean expression levels of each study are not combined and thus inter-study variability is not modeled. We call this the probability integration model because, for each gene, it calculates the overall probability of differential expression given the data of each separate study; this effectively smoothes the probability of differential expression across studies. This differs from the standardized expression integration model, which first calculates an overall mean gene expression measure and determines the probability of differential expression given the estimated mean. We specify the probability integration model as follows:

$\begin{array}{l} y_{j g s e} | ξ_{j g e} & ~ & N (ξ_{j g e}, φ_{j g}^{2}), j = 1, ..., J; g = 1, ..., G; e = 1, ..., E; s = 1, ..., S_{e} \\ ξ_{j g e} | θ_{j g} & ~ & N (θ_{j g}, σ_{j g}^{2}), j = 1, ..., J; g = 1, ..., G; e = 1, ..., E \\ θ_{j g} | I_{g} = 0 & ~ & N (0, η_{j g 0}^{2}) \\ θ_{j g} | I_{g} = 1 & ~ & N (0, c_{j} \times η_{j g 0}^{2}) \\ I_{g} & ~ & Bernoulli (p) \\ p & ~ & Uniform (0, 1) . \end{array} (2)$

The parameters common to both Models (1) and (2) are as defined for Model (1) (see also Conlon et al. [15]). The y_jgseare again the observed microarray data, and are the normalized log-expression ratios for study j, gene g, slide s, and experiment e. In this model, we again take into account that the y_jgseare affected by variability due to slides and experiments, and the y_jgsevalues are standardized to have zero mean and unit standard deviation. The θ_jgis again the study-specific mean log-expression ratio of gene g. However, rather than modeling the study-specific mean θ_jgas a sampling from a normal distribution of study values, this model treats each study separately, and does not combine the mean values from each study into one mean value. Thus, an overall mean value μ_gis not produced for this model. The gene-specific posterior probability of differential expression is again produced; however, this probability is based upon the separate mean levels of each study rather than an overall mean level, as in Model (1). Note that the differences between Model (1) and Model (2) occur at the inter-study level; the models are the same for a single study. The MCMC implementation is similar to that for Model (1), with details in the Methods section. We compare Models (1) and (2) using integration-driven discovery rates, integration-driven revision rates and Bayesian false discovery rates, defined in the following.

Integration-driven discovery and revision

Choi et al. [2] define the integration-driven discovery rate (IDR) as the proportion of genes that are identified as differentially expressed in the meta-analysis that were not identified in any of the individual studies alone. IDR represents the increase in information based on combining studies versus individual studies. For our models, for a given threshold of posterior probability of differential expression, γ, genes are identified as differentially expressed if (D_g≥ γ). The IDR is defined as the percent of differentially expressed genes in the meta-analysis that are not differentially expressed in any of the individual analyses:

$IDR (γ) = \frac{# genes [(D_{g} \geq γ) in meta-analysis] and [(D_{g} < γ) in all individual studies]}{# genes [(D_{g} \geq γ) in meta-analysis]} .$

In simulations, we define true genes as genes that were simulated to be differentially expressed. The corresponding true integration-driven discovery rate, tIDR, is the percent of true genes discovered in the meta-analysis that were not discovered in any of the individual analyses:

$t IDR (γ) = \frac{# true genes [(D_{g} \geq γ) in meta-analysis] and [(D_{g} < γ) in all individual studies]}{# true genes [(D_{g} \geq γ) in meta-analysis]} .$

Stevens and Doerge [3] define the integration-driven revision rate (IRR) as the percent of genes that are declared differentially expressed in individual studies but not in meta-analysis. IRR represents the genes that are missed or "dropped" in meta-analysis versus separate study analyses:

$IRR (γ) = \frac{# genes [(D_{g} \geq γ) in at least one individual study] and [(D_{g} < γ) in meta-analysis]}{# genes [(D_{g} \geq γ) in at least one individual study]} .$

The corresponding true integration-driven revision rate, tIRR, is the percent of true genes that are identified as differentially expressed in at least one individual study but not in meta-analysis:

$t IRR (γ) = \frac{# true genes [(D_{g} \geq γ) in at least one individual study] and [(D_{g} < γ) in meta-analysis]}{# true genes [(D_{g} \geq γ) in at least one individual study]} .$

Bayesian false discovery rate

The false discovery rate (FDR) was introduced by Benjamini and Hochberg [52], and is defined as the expected number of discovered genes that are not truly differentially expressed divided by the total number of discovered genes. Further discussion and application of FDR in a microarray context include Tusher et al. [53], Storey [54], Storey and Tibshirani [55] and Genovese and Wasserman [56]. In a Bayesian approach, Genovese and Wasserman [57] defined the posterior expected FDR (peFDR) as:

$p e FDR = E (FDR | Y) = \frac{\sum_{g} (1 - D_{g}) δ_{g}}{\sum_{g} δ_{g}},$

where δ_gis an indicator for differential expression and Y represents the data (see also Do et al. [28]).

Two-study simulation results

We simulated data for two studies, Study 1 and Study 2, with 3,000 genes and a format similar to the B. subtilis mutant and induction studies. We used three levels for the percent of differentially expressed genes (denoted p_sto indicate simulated), p_s= 5%, 10%, 25%, and three mean levels of inter-study variation $τ_{g}^{2}$ . We refer to the mean levels of inter-study variability as low, medium and high, based on comparison to the biological data, as follows. Each level of $τ_{g}^{2}$ was simulated from a Normal distribution with the following means: low variability: mean $τ_{g}^{2}$ = 0.1 for differentially expressed and mean $τ_{g}^{2}$ = 0.01 for non-expressed genes; medium: mean $τ_{g}^{2}$ = 0.3 and 0.03; and high: mean $τ_{g}^{2}$ = 0.7 and 0.07, for expressed and non-expressed genes, respectively. The variances of the Normal distributions were equivalent to the biological data (for more detail, see Methods). Again, we refer to genes that were simulated to be differentially expressed as true genes. Each array was standardized to have mean zero and unit standard deviation.

Bayesian standardized expression integration model: modeling inter-study variation

For the Bayesian standardized expression integration meta-analysis model, the prior distribution assigned to $τ_{g}^{2}$ has a large influence on the results. We considered three prior distributions with extensive use in hierarchical Bayesian meta-analytic models: inverse-gamma prior distributions (see, for example, Choi et al. [2], Smith et al. [34], Normand [35], Sargent et al. [38]), log-logistic (DuMouchel and Normand [36]), and locally uniform (Gelman et al. [39]). See Methods for a detailed description of these prior assignments. Briefly, the informative inverse-gamma and locally uniform distributions pooled information from sets of genes to provide better estimates of inter-study variability. The log-logistic prior distribution treated each gene separately without pooling information from sets of genes, and was a function of the weighted average of the sampling variabilities across studies for each gene. The log-logistic distribution prevents the meta-analysis results from being overly influenced by studies with large sampling variability. Of the prior specifications, only the locally uniform prior improved upon individual study analyses. In particular, this prior distribution was centered at the median of the inter-study variability based on the data, with separate distributions for the differentially expressed and non-expressed genes (see Methods). We show results hereafter based on this prior distribution.

Standardized expression integration versus probability integration model

We implemented the Bayesian standardized expression integration model (SEI hereafter, Model (1)) and the Bayesian probability integration model (PI hereafter, Model (2)) to combine the two simulated studies for the three levels of percent differentially expressed genes and three levels of inter-study variation. We also analyzed each study individually. Note again that in individual studies, the SEI and PI models are equivalent, i.e. the only differences between the SEI and PI models are at the inter-study level. In order to compare the SEI and PI models, we calculated for each model the true integration-driven discovery rate (tIDR) and the true integration-driven revision rate (tIRR) for thresholds of γ ≥ 0.50, i.e. the posterior probability of differential expression greater or equal to 50%. The PI model produced higher tIDR and lower tIRR than the SEI model for all values of γ ≥ 0.50 for the simulated data. Figures 1 and 2 display the tIDR and tIRR results, respectively, for the three simulated levels of p_sand high mean $τ_{g}^{2}$ . Table 1 presents results for all simulated data sets, for representative threshold value γ = 0.95.

**tIDR versus posterior probability of differential expression for the two-study simulation data**. True integration-driven discovery rate (tIDR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the two-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and the following simulated percent differentially expressed genes p_s: a) 5%; b) 10%; c) 25%.

**tIRR versus posterior probability of differential expression for the two-study simulation data**. True integration-driven revision rate (tIRR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the two-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and the following simulated percent differentially expressed genes p_s: a) 5%; b) 10%; c) 25%.

Table 1.

Results for two-study simulation data. True integration-driven discovery rate (tIDR) and true integration-driven revision rate (tIRR) for threshold value of posterior probability of differential expression γ = 0.95, and the number of true discovered genes for posterior expected false discovery rate peFDR = 5%, for the Bayesian standardized expression integration (SEI) and probability integration (PI) models. Results are shown for the three levels of simulated percent differentially expressed genes p_sand three levels of mean inter-study variability $τ_{g}^{2}$ for the two-study simulation data.

		p_s= 5%		p_s= 10%		p_s= 25%

Two-Study Simulation Data		SEI	PI	SEI	PI	SEI	PI
Low: mean $τ_{g}^{2}$ = 0.1 (DE); 0.01 (non DE)	tIDR, γ = 0.95	24.6%	29.9%	12.9%	18.9%	3.8%	9.0%
	tIRR, γ = 0.95	3.7%	0%	2.9%	1.4%	6.2%	1.5%
	True Genes, peFDR = 5%	84	92	183	196	479	513

Medium: mean $τ_{g}^{2}$ = 0.3 (DE); 0.03 (non DE)	tIDR, γ = 0.95	11.2%	19.5%	10.8%	16.2%	2.6%	6.4%
	tIRR, γ = 0.95	16.4%	7.5%	15.9%	4.5%	16.4%	3.1%
	True Genes, peFDR = 5%	80	97	185	211	493	547

High: mean $τ_{g}^{2}$ = 0.7(DE); 0.07(non DE)	tIDR, γ = 0.95	11.0%	15.3%	5.4%	7.6%	2.9%	4.9%
	tIRR, γ = 0.95	16.7%	7.7%	19.9%	6.6%	16.0%	4.8%
	True Genes, peFDR = 5%	93	105	212	235	572	617

Open in a new tab

We also fixed levels of peFDR and compared the number of true discoveries for the two models. We found that both models improved the number of true discoveries versus individual analyses, and the PI model identified more true genes than the SEI model for the same levels of peFDR < 20%. Figure 3 illustrates the results for the three p_slevels and high mean $τ_{g}^{2}$ . Table 1 reports results for all simulations, for representative peFDR = 5%.

**Number of true discovered genes versus peFDR for the two-study simulation data**. The maximum number of true discovered genes versus posterior expected false discovery rate (peFDR) for the standardized expression integration model (blue circles), probability integration model (black diamonds), individual analyses of Study 1 (red checks) and Study 2 (green triangles), for the two-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and the following simulated percent differentially expressed genes p_s: a) 5%; b) 10%; c) 25%.

The primary difference between the two models is that the SEI model first combines the mean standardized gene expression levels from each study into an overall mean and then calculates the probability of differential expression based on this overall mean, while the PI model calculates the probability of differential expression based on the separate study means. For genes that have high mean standardized expression in one study but lower mean standardized expression in a second study, the SEI model tends to identify these genes as non-differentially expressed based on an approximately medium overall mean; however, the PI model identifies more of these types of genes as differentially expressed. This is due to keeping the study means separate in the PI model, and thus genes with high probability of differential expression in one study are not overly offset by genes with lower probability of differential expression in the second study. Due to these model differences, the SEI model declares more true genes as non-differentially expressed, thus producing higher tIRR than the PI model. For genes that are declared non-differentially expressed in both individual studies, the PI model identifies more of these genes as differentially expressed in meta-analysis versus the SEI model, resulting in higher tIDR and more true genes identified for the same level of false discovery than the SEI model.

Five-study simulation results

We also compared the SEI and PI models for five independent studies. For this, we simulated three additional studies with format similar to Study 1, but with different parameter values for slide and experiment variation (see Methods). All other parameter specifications were similar to the two-study simulations. In all data sets, the PI model again identified higher tIDR and lower tIRR than the SEI model for all thresholds of γ ≥ 0.50. Figures 4a and 4b show results for p_s= 10% and high mean $τ_{g}^{2}$ ; Table 2 details the results for all data sets, with representative threshold value γ = 0.95. Note that the SEI model produced tIDR = 0% for many levels of γ. In these incidents, for all true genes identified by the SEI model, at least one of the five individual studies had D_gat least as high as γ. When compared to the two-study simulations, combining more studies resulted in lower tIDR in most cases for both the SEI and PI models. This occurred since, for a larger number of studies, it was more likely that some genes had D_g≥ γ in at least one individual study, which reduced tIDR. The tIRR was also lower in most cases in comparison to the two-study simulations for both the SEI and PI models. This was due to the increase in D_gin the meta-analysis models when combining more studies, which reduced tIRR.

**tIDR, tIRR and true discovered genes versus peFDR for the five-study simulation data**. a) True integration-driven discovery rate (tIDR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the five-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and simulated percent differentially expressed genes p_s= 10%; b) True integration-driven revision rate (tIRR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the five-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and simulated percent differentially expressed genes p_s= 10%; c) The maximum number of true discovered genes versus posterior expected false discovery rate (peFDR) for the standardized expression integration model (blue circles), probability integration model (black diamonds), individual analyses of Study 1 (red checks), Study 2 (green triangles), Study 3 (turquoise pluses), Study 4 (pink inverted triangles), Study 5 (gold stars) for the five-study simulation data with high mean $τ_{g}^{2}$ = 0.7 (differentially expressed); 0.07 (non-differentially expressed) and simulated percent differentially expressed genes p_s= 10%.

Table 2.

Results for five-study simulation data. True integration-driven discovery rate (tIDR) and true integration-driven revision rate (tIRR) for threshold value of posterior probability of differential expression γ = 0.95, and the number of true discovered genes for posterior expected false discovery rate peFDR = 5%, for the Bayesian standardized expression integration (SEI) and probability integration (PI) models. Results are shown for the three levels of simulated percent differentially expressed genes p_sand three levels of mean inter-study variability $τ_{g}^{2}$ for the five-study simulation data.

		p_s= 5%		p_s= 10%		p_s= 25%

Five-Study Simulation Data		SEI	PI	SEI	PI	SEI	PI
Low : mean $τ_{g}^{2}$ = 0.1 (DE); 0.01(non DE)	tIRR, γ = 0.95	13.7%	34.7%	3.8%	20.6%	0%	11.4%
	tIRR, γ = 0.95	4.5%	0%	6.7%	0.6%	15.4%	1.4%
	True Genes, peFDR = 5%	88	106	187	215	496	576

Medium : mean $τ_{g}^{2}$ = 0.3 (DE); 0.03 (non DE)	tIDR, γ = 0.95	7.2%	21.3%	2.2%	15.2%	0%	6.7%
	tIRR, γ = 0.95	9.4%	0%	10.2%	3.6%	16.8%	3.0%
	True Genes, peFDR = 5%	99	118	212	241	571	642

High : mean $τ_{g}^{2}$ = 0.7 (DE); 0.07 (non DE)	tIDR, γ = 0.95	3.8%	13.3%	0%	5.4%	0%	3.4%
	tIRR, γ = 0.95	9.8%	0.9%	7.9%	2.8%	10.7%	3.5%
	True Genes, peFDR = 5%	122	135	259	270	654	688

Open in a new tab

For all simulated data sets, both the SEI and PI models again identified more true discoveries than the individual analyses for the same levels of peFDR < 20%; the PI model also found more true discoveries than the SEI model, similar to the two-study simulations. Figure 4c displays the results for p_s= 10% and high mean $τ_{g}^{2}$ ; Table 2 reports the results for all data sets, with representative peFDR = 5%. When compared to two-study simulations, combining more studies resulted in more true discovered genes for the same levels of peFDR, for both the SEI and PI models. This indicates that pooling more data improves the accuracy of peFDR.

The PI model showed improved performance over the SEI model in simulations of five studies for similar reasons as discussed in the simulations of two studies. By calculating the probability of differential expression based on the separate study means in the PI model, genes with high probability of differential expression in at least one study produce a higher overall probability of differential expression in the PI meta-analysis. However, the SEI model first produces an overall mean and then calculates the probability of differential expression based on this overall mean, which results in fewer true genes being declared differentially expressed. Due to these meta-analytic model differences, the PI model results in higher tIDR, lower tIRR and more true differentially expressed genes identified for the same level of false discovery than the SEI model.

Biological data results

We implemented the SEI and PI models to combine the B. subtilis mutant and induction studies, using 2,509 genes that had at least one expression value in each study. We also analyzed each study individually. Each slide was standardized to have zero mean and unit standard deviation. Since the truly differentially expressed genes are unknown for the biological data, we report results somewhat differently than for the simulated data. For both models, we show IDR and IRR for fixed levels of γ ≥ 0.50 with corresponding peFDR (Figures 5a, 5b and Table 3). The PI model produced higher IDR than the SEI model for most levels of γ ≥ 0.50, with corresponding lower peFDR. In a few instances, the IDR was higher for the SEI than the PI model, but the difference was less than 1%, and the corresponding peFDR was lower for the PI model. On average, the PI model produced an IDR 3.5% higher than the SEI model for γ ≥ 0.50, and 6.5% higher for γ ≥ 0.95; the corresponding average peFDR was 1% and 0.3% lower for the PI model, respectively. The IRR was lower for the PI model for all values of γ ≥ 0.50. In addition, for fixed levels of peFDR < 20%, both the PI and SEI models discovered more genes than the individual studies alone, and the PI model discovered more genes than the SEI model in all cases (Figure 5c).

**IDR, IRR and discovered genes versus peFDR for the biological data**. a) Integration-driven discovery rate (IDR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the *B. subtilis* mutant and induction biological study data; b) Integration-driven revision rate (IRR) versus threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration model (blue circles) and probability integration model (black diamonds) for the *B. subtilis* mutant and induction biological study data; c) The maximum number of differentially expressed genes versus posterior expected false discovery rate *(peFDR)* for the standardized expression integration model (blue circles), probability integration model (black diamonds), individual analyses of *B. subtilis* mutant study (red checks) and induction study (green triangles).

Table 3.

Results for Bacillus subtilis biological data. Integration-driven discovery rate (IDR), integration-driven revision rate (IRR) and posterior expected false discovery rate (peFDR) for various threshold values of posterior probability of differential expression γ ≥ 0.50, for the standardized expression integration (SEI) and probability integration (PI) models applied to the B. subtilis mutant and induction biological study data.

	Integration-Driven Discovery Rate		Integration-Driven Revision Rate		Posterior Expected FDR

γ	SEI	PI	SEI	PI	SEI	PI
0.995	51.1%	64.6%	32.4%	14.7%	0.0008	0.0004
0.99	46.3%	61.4%	28.2%	12.7%	0.002	0.001
0.95	35.7%	37.9%	20.3%	8.9%	0.013	0.007
0.90	26.7%	27.1%	7.1%	3.1%	0.022	0.014
0.50	4.7%	7.1%	12.4%	8.6%	0.098	0.085

Open in a new tab

Sensitivity analysis

The prior distributions of the slide effect and experiment effect variance parameters, $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , respectively, require some information from the data in order for the models to converge. When assigning uninformative distributions to these parameters, i.e. $φ_{j g}^{2}$ ~ Inverse Gamma(0.001,0.001) and $σ_{j g}^{2}$ ~ Inverse Gamma(0.001,0.001), the models do not converge. We thus assigned inverse chi-squared distributions to $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , with scale parameters based on pooling variance information from all genes, similar to other authors ([17,23,24]). We used 3 degrees of freedom so that the prior distributions were as uninformative as possible (similar to [17]). To examine the sensitivity of the results to the degrees of freedom of the scaled inverse chi-squared distributions, we performed sensitivity analyses for the two-study and five-study simulation data sets as well as the biological data. For the simulation data, we used the data sets with percent of differentially expressed genes p_s= 10% and medium mean $τ_{g}^{2}$ . We repeated the analyses with the following degrees of freedom for both $φ_{j g}^{2}$ and $σ_{j g}^{2}$ : 6, 10, 20, 40. Larger degrees of freedom correspond to more informative priors, i.e. smaller means and variances imposed upon the variance parameters. In all analyses, we found that with more informative priors, the mean posterior probabilities of differential expression for all genes increased. For the two-study simulation data, this resulted in larger numbers of true genes identified for the same levels of peFDR for both the SEI and PI models. tIDR decreased for larger degrees of freedom for thresholds of γ ≥ 0.50 for both the SEI and PI models, since the individual studies as well as the meta-analyses produced higher posterior probabilities of differential expression, which lowered tIDR. tIRR decreased for larger degrees of freedom for thresholds of γ ≥ 0.50 for both the SEI and PI models, again due to the higher posterior probabilities of differential expression.

The five-study simulation results were similar to the two-study results, except that the tIRR increased slightly for the SEI model for larger degrees of freedom for some thresholds of γ ≥ 0.50. This was due to the larger number of individual studies; with more individual studies, there were more genes with higher posterior probabilities of differential expression in at least one individual study versus the SEI meta-analysis, which increased tIRR for the SEI model. For the biological data, we found similar results to the two-study simulation data: with larger degrees of freedom, IDR, IRR and peFDR decreased on average for thresholds of γ ≥ 0.50 for both the SEI and PI models.

Overall, the PI model outperformed the SEI model for all prior degrees of freedom imposed, and using 3 degrees of freedom resulted in the most conservative findings for the posterior probabilities of differential expression. We show results for the two-study and five-study simulation data in Table 4 and Supplemental Figures S1 and S2 (see Additional file 1); the biological data results are displayed in Table 5 and Supplemental Figure S3 (see Additional file 1).

Table 4.

Sensitivity analysis results for the two-study and five-study simulation data. Sensitivity analysis results for the prior degrees of freedom assigned to the slide and experiment effect variance parameters, $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , respectively, for the two-study and five-study simulation data sets with medium mean $τ_{g}^{2}$ = 0.3 (differentially expressed); 0.03 (non-differentially expressed) and simulated percent differentially expressed genes p_s= 10%. Shown are true integration-driven discovery rate (tIDR) and true integration-driven revision rate (tIRR) for threshold value of posterior probability of differential expression γ = 0.95, and the number of true discovered genes for posterior expected false discovery rate peFDR = 5%, for the Bayesian standardized expression integration (SEI) and probability integration (PI) models.

Two-Study Simulation Data		True Integration-Driven Discovery Rate, γ = 0.95		True Integration-Driven Revision Rate, γ = 0.95		True Genes, peFDR = 5%
Prior Degrees of Freedom
$φ_{j g}^{2}$	$σ_{j g}^{2}$	SEI	PI	SEI	PI	SEI	PI

3	3	10.8%	16.2%	15.9%	4.5%	185	211
6	6	4.4%	11.9%	11.6%	1.2%	197	216
10	10	3.6%	10.2%	7.9%	0.6%	202	218
20	20	1.7%	8.1%	4.4%	0.5%	206	219
40	40	1.6%	6.6%	5.9%	0.5%	208	221

Five-Study Simulation Data

3	3	2.2%	15.2%	10.2%	3.6%	212	241
6	6	0%	8.8%	13.0%	2.8%	219	246
10	10	0%	6.9%	13.1%	2.7%	225	248
20	20	0%	3.9%	14.4%	3.1%	228	250
40	40	0%	5.1%	12.8%	1.8%	227	250

Open in a new tab

Table 5.

Sensitivity analysis results for the Bacillus subtilis biological data. Sensitivity analysis results for the prior degrees of freedom assigned to the slide and experiment effect variance parameters, $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , respectively, for the B. subtilis mutant and induction biological study data. Shown are the integration-driven discovery rate (IDR), integration-driven revision rate (IRR) and posterior expected false discovery rate (peFDR) for threshold value of posterior probability of differential expression γ = 0.95, for the standardized expression integration (SEI) and probability integration (PI) models applied to the B. subtilis mutant and induction biological study data.

Prior Degrees of Freedom		Integration-Driven Discovery Rate, γ = 0.95		Integration-Driven Revision Rate, γ = 0.95		Posterior Expected FDR, γ = 0.95

$φ_{j g}^{2}$	$σ_{j g}^{2}$	SEI	PI	SEI	PI	SEI	PI
3	3	35.7%	37.9%	20.3%	8.9%	0.013	0.0073
6	6	13.9%	20.1%	7.9%	5.3%	0.0088	0.0049
10	10	7.1%	14.0%	7.1%	2.4%	0.0059	0.0037
20	20	6.3%	14.5%	7.7%	0.0%	0.0044	0.0040
40	40	5.3%	13.3%	10.1%	0.7%	0.0038	0.0041

Open in a new tab

Conclusion

We compared two Bayesian approaches to meta-analysis of microarray data: the standardized expression integration and probability integration models. The standardized expression integration model includes inter-study variability and may thus improve accuracy of findings; it also produces an overall estimate of standardized gene expression among all studies. However, due to the typical small number of studies in meta-analyses for microarrays, the inter-study variability is difficult to model. Alternatively, the probability integration approach eliminates the need to specify inter-study variability since each study is modeled separately, with overall smoothing of probabilities across studies. In our simulations of two and five studies, the probability integration model produced higher tIDR and lower tIRR than the standardized expression integration model for fixed posterior probabilities of differential expression, and also identified more true discoveries for the same levels of peFDR. We found similar results for the biological data, with the probability integration model producing higher IDR on average and lower IRR with corresponding lower values of peFDR, for fixed probabilities of differential expression. We conclude that, for our data sets, aggregating probabilities across studies rather than combining gene expression levels improves IDR, IRR and the number of discovered genes versus peFDR.

In the standardized expression integration meta-analysis model, the prior assignment for the inter-study variability had a large impact on the results. We assigned some of the most common prior distributions used in practice: inverse gamma, log-logistic and locally uniform. The uninformative inverse gamma and log-logistic distributions were gene-specific and did not pool information from similar genes; these distributions did not improve results versus individual analyses. The informative inverse gamma distributions pooled information either from all genes or sets of differentially and non-differentially expression genes, but also did not improve upon separate study analyses. We found the most improvement in true integration-driven discovery rates and increases in true discoveries versus peFDR using the locally uniform prior distribution centered at the medians of the differentially expressed and non-expressed genes; this emphasizes the need for priors that pool information across genes rather than using individual gene information or uninformative priors, for small data sets.

The probability integration model does not produce an overall measure of expression for each gene, similar to the classical meta-analysis methods of combining p-values, probabilities and ranks. However, study-specific gene expression values are produced, and these can be examined for individual genes of interest.

The standardized expression integration and probability integration models presented here were developed for studies from the same platform. A common control sample is not required across studies, and the studies are assumed to be independent. In addition, the models do not require the same array-layout across studies. For example, some studies could have replicate slides within repeated experiments, while other studies could have only replicate slides within a single study. The models are thus applicable to a wide range of study designs.

Methods

Two study simulation data

We simulated data for two studies, with similar format to the B. subtilis mutant and induction biological data (see Methods: Biological data), with simulated proportion of differentially expressed genes p_s= 5%, 10%, 25%. Each study contained 3,000 genes. Study 1 had 5 replicate slides within 3 repeated experiments, and Study 2 had 4 replicate slides within 3 repeated experiments. We simulated data from Model (1), with model parameters chosen to resemble the biological data. We set $η_{g 0}^{2}$ = 0.025 and c = 48. For Study 1, we assigned variance across slides to 0.074 and across experiments to 0.026. For Study 2, we assigned slide variation to 0.023 and experiment variation to 0.022. The biological data had inter-study variability that was approximately Normally distributed with mean of 0.33 and variation of 0.12 for the top 10% of genes, and mean 0.031 and variation 0.004 for the remaining genes. We simulated $τ_{g}^{2}$ from a Normal distribution, with mean values that were lower, similar to, and higher than these values for the overexpressed and non-expressed genes, respectively: low mean: 0.1 and 0.01; medium mean: 0.3 and 0.03; high mean: 0.7 and 0.07; with variation values equivalent to the biological data. We refer to these as low, medium and high mean levels of inter-study variability. Each slide was standardized to have mean zero and unit standard deviation. Although correlation of expression is expected among genes, this has been shown to be difficult to simulate. We thus assumed independence among genes in simulations, similar to other studies (see, for example, Gottardo et al. [23], Lönnstedt and Speed [24]).

Simulation data for five studies

For the simulation of five studies, Study 1 and Study 2 were the same as in the previous section, and we simulated data for 3 additional studies, with proportion of differentially expressed genes p_s= 5%, 10%, 25%. In total there were 3,000 genes. For Studies 3, 4 and 5, we used the study format similar to Study 1, with 5 replicate slides within 3 replicate experiments. For the slide and experiment variance parameters, we assigned values that were either within the range of values for Study 1 and Study 2, or somewhat outside the range. For Study 3, the slide variance was assigned 0.05, and experiment variance 0.02. For Study 4, the slide variance was assigned 0.04, and experiment variance 0.022. For Study 5, the slide variance was set to 0.06, and experiment variance 0.03. We again set $η_{g 0}^{2}$ = 0.025 and c = 48 and simulated $τ_{g}^{2}$ from Normal distributions with three mean levels: low, medium and high, with variation values equivalent to the biological data, similar to the two-study simulation data. Each slide was standardized to have zero mean and unit standard deviation.

Biological data

B. subtilis is a bacterium that responds to starvation by forming spores, which allow it to survive in extreme environmental conditions. Two independent B. subtilis microarray studies were designed to identify genes in the sporulation pathway controlled by the sigma factor σ^E. The studies had reciprocal designs; the first was a mutation of σ^E, and the second was an induction of σ^E, with details in the following (see also [50,58]).

Mutant study

For the mutant study, cells with a deletion in the gene for σ^E(the mutant sample) were compared to wild-type cells. The wild-type/mutant ratios identified genes under the control of σ^E. In total, five microarrays were created from three independent repeated experiments; the first experiment produced three replicate slides and the second and third experiments each produced one slide. Each array contained somewhat more spots than the B. subtilis genome size of 4,106 due to multiple spotting of specific genes on the arrays. The percent of missing data due to low quality spots ranged from 18.6% to 64.5% across the five slides; a total of 3,713 genes had measurable expression values for at least one slide. We used log-ratios of intensities [44], normalized slides using a rank-invariant method [17,59] and standardized each slide to have zero mean and unit standard deviation.

Induction study

The induction study was an overexpression of σ^Ein which cells treated with an inducer were compared to unaltered cells. The induction/wild-type ratios identified genes in the σ^Eregulon. In total, four microarrays were created from three independent repeated experiments. The first two experiments each produced one slide and the third experiment produced two replicate slides. The percent of data removed due to low quality spots ranged from 52.6% to 67.0% across the four slides; a total of 2,552 had measurable expression for at least one slide. We again analyzed the post-normalized log-ratios of intensities, and standardized each array to have zero mean and unit standard deviation.

Standardized expression integration model: prior distributions for inter-study variation

We assigned the following inverse gamma, log-logistic and locally uniform prior distributions for the inter-study variability parameter $τ_{g}^{2}$ in the standardized expression integration model (Model (1)) for both the simulation and biological data. We applied both uninformative and informative priors; the informative priors either used individual gene information or pooled information from sets of genes. We found that uninformative priors and priors that used individual gene information did not improve upon individual study analyses. Only one prior distribution produced true integration-driven discovery rates > 0% for γ ≥ 0.50, and an increase in true discoveries versus peFDR < 20% compared to individual study analyses in the two and five study simulations: 3) locally uniform centered at the median of the inter-study variability (see below).

1) Inverse gamma distribution (Choi et al. [2], Smith et al. [34], Normand [35], Sargent et al. [38]): this is a standard conjugate prior distribution for variance parameters. We first applied an uninformative prior distribution, in order to allow the data to inform on the posterior distribution. However, since this distribution did not result in improved performance, we also applied two informative distributions, as follows.

a) Uninformative inverse gamma distribution: IG(0.001,0.001), corresponding to mean 1, variance 1000.

b) Informative inverse gamma distribution: IG(α, β), with α, β chosen so that the mean and variance were equal to the mean and variance of the inter-study variability based on data of all genes. This prior distribution pools data from all genes in order to provide a more accurate measure of inter-study variability. Pooling variance information from all genes is similar to the methods of Tseng et al. [17], Gottardo et al. [23] and Lönnstedt and Speed [24].

c) Informative inverse gamma distribution: separate priors for differentially and non-differentially expressed genes. Since the inter-study variability is higher for differentially expressed genes than non-expressed genes, we assigned two different priors conditioned on I_g= 1 and I_g= 0. For I_g= 1, we assigned an inverse-gamma distribution with mean and variance equal to that of the inter-study variability based on the top p% of data. The proportion p was estimated from the average of the individual study analyses. Similarly for I_g= 0, we assigned the prior based on the remaining (1-p)% of data. Estimating inverse-gamma parameters separately for the top and remaining proportions of genes is similar to the method of Gottardo et al. [23].

2) Log-logistic distribution (DuMouchel and Normand [36]):

$p (τ_{g}) = \frac{s_{g 0}}{{(s_{g 0} + τ_{g})}^{2}} for τ_{g} \geq 0, where s_{g 0}^{2} = \frac{K}{\sum s_{g j}^{- 2}} .$

Here, K is the number of studies and $s_{g j}^{2}$ is sampling variability for gene g in study j. Note that this prior assignment is calculated for each gene individually. Our data included slide and experiment variance; we thus calculated $s_{g j}^{2}$ for gene g and study j as follows:

$s_{g j}^{2} = \frac{n_{1} ν_{1}^{2} + n_{2} ν_{2}^{2} + ... + n_{E} ν_{E}^{2}}{{(n_{1} + n_{2} + ... + n_{E})}^{2}},$

where E is the total number of experiments, n_eis the number of slides within experiment e, and $ν_{e}^{2}$ is the sample variance of the y_jgsewithin experiment e.

As discussed in DuMouchel and Normand [36], the $s_{g 0}^{2}$ is the harmonic mean of the K sampling variances, $s_{g j}^{2}$ , and the density p(τ_g) has median equal to s_g0. The quartiles of the distribution of τ_gare s_g0/3, s_g0and 3s_g0, so that the distribution covers a sensible range of values. If the sample standard deviations are not equal across microarray studies, then s_g0will be weighted toward the studies with smaller s_gj; this prevents the meta-analysis results from being overly influenced by a few studies that have large s_gjvalues.

3) Locally uniform distribution (Gelman et al. [39]): separate priors for differentially and non-differentially expressed genes. The uniform prior is used when a variable is known to lie within a specific interval and is equally likely to be found anywhere within the interval. For inter-study variability of gene expression, this prior assignment pools information from the differentially and non-differentially expressed genes in order to provide a more accurate estimate of the variability. Again, since the inter-study variability is higher for differentially expressed than non-expressed genes, we assigned different priors conditioned on I_g= 1 and I_g= 0. For I_g= 1, we assigned a locally uniform prior centered at the median inter-study variability based on the top p% of data. The percent p was determined from the average of the individual study analyses. Similarly for I_g= 0, we assigned a locally uniform prior based on the remaining (1-p)% of data. The ranges of the uniform distributions were ± one standard error of the median. The standard error of the median was estimated as 1.253 × standard error of the mean (Kendall et al. [60]). We selected the range of ± one standard error of the median based on exploratory calculations. In simulations, this range produced higher tIDR for γ ≥ 0.50 and more true discoveries for levels of peFDR < 20% versus individual study analyses compared to the following alternative ranges: fixed medians (range of zero); and ± two standard errors of the medians. We also repeated prior assignment 3) using means in place of medians, but this prior specification did not result in improvements in discoveries versus individual study analyses.

Normal versus t-distribution modeling of the study-specific mean values

As discussed and implemented by several authors (Smith et al. [34], Sargent et al. [38], Choi et al. [2]), an alternative to the Normal distribution assumption for the study-specific mean values θ_jgis to model the values as t-distributions, due to the small number of studies. We repeated each of the prior assignments 1) to 3) above assuming a t-distribution with degrees of freedom less than 30 (with degrees of freedom 30 equivalent to a Normal distribution). For simulated data, we found that the tIDR versus γ ≥ 0.50 and the number of true discoveries versus peFDR decreased with fewer degrees of freedom. Based on these findings, we model the mean values θ_jgas Normal distributions in the standardized expression integration model. The Normal assumption is commonly used in hierarchical Bayesian meta-analysis models, even for small numbers of studies (see, for example, Normand [35], Pauler and Wakefield [37], Sargent et al. [38]).

Markov chain Monte Carlo implementation

We implement a Markov chain Monte Carlo (MCMC) procedure to simulate the posterior distributions of each of the parameters. MCMC methods generate samples from a density p(ψ) for a parameter ψ (with p(ψ) possibly known only to a constant of proportionality) by creating a Markov chain on the state space of ψ which has p as its true (stationary) distribution (see Liu et al. [51] for further details).

Joint posterior distributions

For Model (1), the joint distribution of the data and parameters is:

$\begin{array}{l} p (y_{j g s e}, ξ_{j g e}, φ_{j g}^{2}, σ_{j g}^{2}, θ_{j g}, τ_{g}^{2}, μ_{g}, I_{g}, p, η_{g 0}^{2}, c) = \\ [\prod_{j = 1}^{J} \prod_{g = 1}^{G} \prod_{e = 1}^{E} {\prod_{s = 1}^{S_{e}} p (y_{j g s e} | ξ_{j g e}, φ_{j g}^{2}) p (ξ_{j g e} | θ_{j g}, σ_{j g}^{2})} p (θ_{j g} | μ_{g}, τ_{g}^{2}) p (μ_{g}, I_{g} | p, Ω_{j}) \times \\ p (φ_{j g}^{2}) p (σ_{j g}^{2}) p (τ_{g}^{2}) p (p) p (Ω_{j})], \end{array}$

where Ω_j= ( $η_{g 0}^{2}$ , c), j = study, g = gene, e = experiment, s = slide.

For Model (2), the joint distribution of the data and parameters is:

$\begin{array}{l} p (y_{j g s e}, ξ_{j g e}, φ_{j g}^{2}, σ_{j g}^{2}, θ_{j g}, I_{g}, p, η_{j g 0}^{2}, c_{j}) = \\ \prod_{j = 1}^{J} \prod_{g = 1}^{G} \prod_{e = 1}^{E} {\prod_{s = 1}^{S_{e}} p (y_{j g s e} | ξ_{j g e}, φ_{j g}^{2}) p (ξ_{j g e} | θ_{j g}, σ_{j g}^{2})} p (θ_{j g}, I_{g} | p, Ω_{j}) p (φ_{j g}^{2}) p (σ_{j g}^{2}) p (p) p (Ω_{j}), \end{array}$

where Ω_j= ( $η_{j g 0}^{2}$ , c_j), j = study, g = gene, e = experiment, s = slide.

Prior distributions

We assigned as uninformative prior distributions as possible to the parameters of Models (1) and (2) that still resulted in convergence of the models. For the parameter p, we assigned a non-informative Uniform(0, l) distribution. The prior distributions of the slide effect and experiment effect variance parameters, $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , respectively, required some information from the data in order for the models to converge. We assigned the following prior distributions for these parameters (see also [17,23,24]):

$\begin{matrix} φ_{j g}^{2} ~ k {\tilde{φ}}_{j}^{2} / χ_{k}^{2} \\ σ_{j g}^{2} ~ h {\tilde{σ}}_{j}^{2} / χ_{h}^{2} \end{matrix}$

Here, ${\tilde{φ}}_{j}^{2}$ and ${\tilde{σ}}_{j}^{2}$ are scale parameters of the inverse chi-squared distribution obtained from the data. ${\tilde{φ}}_{j}^{2}$ is computed as follows:

${\tilde{φ}}_{j}^{2} = \frac{1}{G (\sum S_{e} - 1)} \sum_{g = 1}^{G} \sum_{e = 1}^{E} \sum_{s = 1}^{S_{e}} {(y_{j g s e} - y_{j g . e})}^{2},$

where y_jg.eis the log-expression ratio averaged over the slides within each experiment:

$y_{j g . e} = \frac{1}{S_{e}} \sum_{s = 1}^{S_{e}} y_{j g s e} .$

The scale parameter for $σ_{j g}^{2}$ is similarly produced as follows:

${\tilde{σ}}_{j}^{2} = \frac{1}{G (E - 1)} \sum_{g = 1}^{G} \sum_{e = 1}^{E} {(y_{j g . e} - y_{j g ..})}^{2},$

where y_jg..is the log-expression ratio averaged over both slides and experiments. We assigned 3 degrees of freedom in each study for $φ_{j g}^{2}$ and $σ_{j g}^{2}$ , i.e. h = k = 3, so that the distribution is as uninformative as possible (see also Results and Discussion: Sensitivity analysis). The remaining parameters were assigned the following prior distributions, which were as uninformative as possible while still allowing the models to converge.

$\begin{matrix} η_{g 0}^{2} ~ a s_{1}^{2} / χ_{a}^{2} \\ c ~ b s_{2}^{2} / χ_{b}^{2} \end{matrix}$

The degrees of freedom and scale parameters a, $s_{1}^{2}$ , respectively, were assigned so that the prior mean of $η_{g 0}^{2}$ was 1 with variance 0.1, and b, $s_{2}^{2}$ were assigned so that the prior mean of c was 100 with variance 10,000. For a description of the prior distribution for $τ_{g}^{2}$ , see the Methods section: Standardized expression integration model: prior distributions for inter-study variation.

Full conditional posterior distributions

Each parameter was sampled from the full conditional posterior distributions by an MCMC algorithm using the WinBUGS software [61]. We used 5,000 iterations for all analyses, which was more than sufficient. Further MCMC implementation details can be found in Conlon et al. [15].

WinBUGS running time

The WinBUGS running time ranged from approximately 7 minutes for 3,000 genes and two studies for the PI model to 2.41 hours for 20,000 genes and five studies for the SEI model, for 5,000 iterations, using a personal computer with an Intel Core Duo T2500 2.0 GHz Processor. Table 6 details running times in seconds for the number of genes: 3,000, 10,000, 20,000, for two and five studies and 5,000 iterations for both the PI and SEI models.

Table 6.

Runtime values for the WinBUGS MCMC implementation. Runtime values in seconds for 5,000 iterations of the WinBUGS MCMC implementation of SEI and PI models for various numbers of genes and studies, using a personal computer with an Intel Core Duo T2500 2.0 GHz Processor.

Runtime (Seconds)
	Number of Genes

Two Studies	3,000	10,000	20,000

SEI model	561	2,652	4,621
PI model	415	1,947	3,375

Five Studies

SEI model	1,511	4,981	8,679
PI model	1,204	4,287	8,008

Open in a new tab

Availability and requirements

The WinBUGS code for implementing the models is included in the Supplemental Files (see Additional files 2 and 3).

Operating system: Windows 98 or later.

Other requirements: WinBUGS software version 1.4 or later [61].

License: free.

Authors' contributions

All authors contributed to development of the methodology. EMC and JJS contributed to writing the computer code and writing the manuscript.

Supplementary Material

Additional File 1

Supplemental Figures. containing Figures for the Sensitivity Analysis Results.

Click here for file^{(137.9KB, pdf)}

Additional File 2

WinBUGS code for the SEI model. containing the WinBUGS code for the SEI model.

Click here for file^{(3.5KB, txt)}

Additional File 3

WinBUGS code for the PI model. containing the WinBUGS code for the PI model.

Click here for file^{(3.3KB, txt)}

Acknowledgments

Acknowledgements

We thank Joseph Horowitz for helpful discussion, and Patrick Eichenberger and the laboratory of Richard Losick for the B. subtilis microarray data and helpful advice. We also thank three anonymous reviewers for insightful comments that enhanced the manuscript. EMC was supported by a University of Massachusetts Healey Endowment Grant.

Contributor Information

Erin M Conlon, Email: econlon@mathstat.umass.edu.

Joon J Song, Email: jjsong@uark.edu.

Anna Liu, Email: anna@math.umass.edu.

References

Wang J, Coombes KR, Highsmith WE, Keating MJ, Abruzzo LV. Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studies. Bioinformatics. 2004;20:3166–3178. doi: 10.1093/bioinformatics/bth381. [DOI] [PubMed] [Google Scholar]
Choi JK, Yu U, Kim S, Yoo OJ. Combining multiple microarray studies and modeling inter-study variation. Bioinformatics. 2003. pp. i84–i90. [DOI] [PubMed]
Stevens JR, Doerge RW. Combining Affymetrix microarray results. BMC Bioinformatics. 2005;6:57. doi: 10.1186/1471-2105-6-57. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hu P, Greenwood CMT, Beyene J. Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models. BMC Bioinformatics. 2005;6:128. doi: 10.1186/1471-2105-6-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
Morris JS, Yin G, Baggerly KA, Wu C, Zhang L. Pooling information across different studies and oligonucleotide microarray chip types to identify prognostic genes for lung cancer. In: Shoemaker JS, Lin SM, editor. Methods of Microarray Data Analysis IV. New York: Springer-Verlag; 2005. pp. 51–66. [Google Scholar]
Park T, Yi SG, Shin YK, Lee S. Combining multiple microarrays in the presence of controlling variables. Bioinformatics. 2006;22:1682–1689. doi: 10.1093/bioinformatics/btl183. [DOI] [PubMed] [Google Scholar]
Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM. Meta-analysis of microarrays: inter-study validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Research. 2002;62:4427–4433. [PubMed] [Google Scholar]
Rhodes DR, Yu J, Shanker K, Deshpande N, Varambally R, Ghosh D, Barrette T, Pandey A, Chinnaiyan AM. Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. Proc Natl AcadSci USA. 2004;101:9309–9314. doi: 10.1073/pnas.0401994101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Parmigiani G, Garrett-Mayer ES, Anbazhagan R, Gabrielson E. A cross-study comparison of gene expression studies for the molecular classification of lung cancer. Clinical Cancer Research. 2004;10:2922–2927. doi: 10.1158/1078-0432.ccr-03-0490. [DOI] [PubMed] [Google Scholar]
Shen R, Ghosh D, Chinnaiyan AM. Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data. BMC Genomics. 2004;5:94. doi: 10.1186/1471-2164-5-94. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xu L, Tan AC, Naiman DQ, Geman D, Winslow RL. Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data. Bioinformatics. 2005;21:3905–3911. doi: 10.1093/bioinformatics/bti647. [DOI] [PubMed] [Google Scholar]
Warnat P, Eils R, Brors B. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics. 2005;6:265. doi: 10.1186/1471-2105-6-265. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiang H, Deng Y, Chen H, Tao L, Sha Q, Chen J, Tsai C, Zhang S. Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes. BMC Bioinformatics. 2004;5:81. doi: 10.1186/1471-2105-5-81. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ghosh D, Barette TR, Rhodes D, Chinnaiyan AM. Statistical issues and methods for meta-analysis of microarray data: a case study in prostate cancer. Functional & Integrative Genomics. 2003;3:180–188. doi: 10.1007/s10142-003-0087-5. [DOI] [PubMed] [Google Scholar]
Conlon EM, Song JJ, Liu JS. Bayesian models for pooling microarray studies with multiple sources of replications. BMC Bioinformatics. 2006;7:247. doi: 10.1186/1471-2105-7-247. [DOI] [PMC free article] [PubMed] [Google Scholar]
Baldi P, Long AD. A Bayesian framework for the analysis of microarray expression data: reguralized t-test and statistical inferences of gene changes. Bioinformatics. 2001;17:509–519. doi: 10.1093/bioinformatics/17.6.509. [DOI] [PubMed] [Google Scholar]
Tseng GC, Oh MK, Rohlin L, Liao JC, Wong WH. Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Res. 2001;29:2549–2557. doi: 10.1093/nar/29.12.2549. [DOI] [PMC free article] [PubMed] [Google Scholar]
Townsend JP, Hartl DL. Bayesian analysis of gene expression levels: statistical quantification of relative mRNA level across multiple treatments or samples. Genome Biology. 2002;3 doi: 10.1186/gb-2002-3-12-research0071. research0071.1-71.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
Efron B, Tibshirani R, Storey JD, Tusher VG. Empirical Bayes Analysis of a Microarray Experiment. Journal of the American Statistical Association. 2001;96:1151–1160. [Google Scholar]
Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW. On Differential Variability of Expression Ratios: Improving Statistical Inference About Gene Expression Changes From Microarray Data. Journal of Computational Biology. 2001;8:37–52. doi: 10.1089/106652701300099074. [DOI] [PubMed] [Google Scholar]
Ibrahim JG, Chen M-H, Gray RJ. Bayesian Models for Gene Expression With DNA Microarray Data. Journal of the American Statistical Association. 2002;97:88–99. [Google Scholar]
Broët P, Richardson S, Radvanyi F. Bayesian hierarchical model for identifying changes in gene expression from microarray experiments. Journal of'Computational Biology. 2002;9:671–683. doi: 10.1089/106652702760277381. [DOI] [PubMed] [Google Scholar]
Gottardo R, Pannucci JA, Kuske CR, Brettin T. Statistical analysis of microarray data: a Bayesian approach. Biostatistics. 2003;4:597–620. doi: 10.1093/biostatistics/4.4.597. [DOI] [PubMed] [Google Scholar]
Lönnstedt I, Speed TP. Replicated microarray data. Statistica Sinica. 2002;12:31–46. [Google Scholar]
Pan W. A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments. Bioinformatics. 2002;18:546–554. doi: 10.1093/bioinformatics/18.4.546. [DOI] [PubMed] [Google Scholar]
Kendziorski CM, Newton MA, Lan H, Gould MN. On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles. Statistics in Medicine. 2003;22:3899–3914. doi: 10.1002/sim.1548. [DOI] [PubMed] [Google Scholar]
Newton MA, Noueiry A, Sarkar D, Ahlquist P. Detecting differential gene expression with a semiparametric hierarchical mixture method. Biostatistics. 2004;5:155–176. doi: 10.1093/biostatistics/5.2.155. [DOI] [PubMed] [Google Scholar]
Do KA, Müller P, Tang F. A Bayesian mixture model for differential gene expression. Journal of the Royal Statistical Society C. 2005;54:627–644. [Google Scholar]
Ishwaran H, Rao JS. Detecting Differentially Expressed Genes in Microarrays Using Bayesian Model Selection. Journal of the American Statistical Association. 2003;98:438–55. [Google Scholar]
Ishwaran H, Rao JS. Spike and Slab Gene Selection for Multipgroup Microarray Data. Journal of the American Statistical Association. 2005;100:764–780. [Google Scholar]
Stangl DK, Berry DA. Meta-analysis: past and present challenges. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 1–28. [Google Scholar]
Tweedie RL, Scott DJ, Biggerstaff BJ, Mengersen KL. Bayesian meta-analysis, with application to studies of ETS and lung cancer. Lung Cancer. 1996;14:S171–S194. doi: 10.1016/s0169-5002(96)90222-6. [DOI] [PubMed] [Google Scholar]
DuMouchel WH, Harris JE. Bayes methods for combining the results of cancer studies in humans and other species. Journal of the American Statistical Association. 1983;78:293–315. [Google Scholar]
Smith TC, Spiegelhalter DJ, Thomas A. Bayesian approaches to random-effects meta-analysis: a comparative study. Stat Med. 1995;14:2685–2699. doi: 10.1002/sim.4780142408. [DOI] [PubMed] [Google Scholar]
Normand SL. Meta-analysis: formulating, evaluating, combining, and reporting. Stat Med. 1999;18:321–359. doi: 10.1002/(sici)1097-0258(19990215)18:3<321::aid-sim28>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
DuMouchel W, Normand SL. Computer-modeling and graphical strategies for meta-analysis. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 127–178. [Google Scholar]
Pauler DK, Wakefield J. Modeling and implementation issues in Bayesian meta-analysis. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 205–230. [Google Scholar]
Sargent DJ, Zee BC, Milan C, Torri V, Francini G. Meta-analysis of individual-patient survival data using random-effect models. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 255–275. [Google Scholar]
Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. 2. New York: Chapman & Hall; 2003. [Google Scholar]
Kuo WP, Jenssen TK, Butte AJ, Ohno-Machado L, Kohane IS. Analysis of matched mRNA measurements from two different microarray technologies. Bioinformatics. 2002;18:405–412. doi: 10.1093/bioinformatics/18.3.405. [DOI] [PubMed] [Google Scholar]
Jarvinen AK, Hautaniemi S, Edgren H, Auvinen P, Saarela J, Kallioniemi OP, Monni O. Are data from different gene expression microarray platforms comparable? Genomics. 2004;83:1164–1168. doi: 10.1016/j.ygeno.2004.01.004. [DOI] [PubMed] [Google Scholar]
Mah N, Thelin A, Lu T, Nikolaus S, Kuhbacher T, Gurbuz Y, Eickhoff H, Kloppel G, Lehrach H, Mellgard B, Costello CM, Schreiber S. A comparison of oligonucleotide and cDNA-based microarray systems. Physiol Genomics. 2004;16:361–370. doi: 10.1152/physiolgenomics.00080.2003. [DOI] [PubMed] [Google Scholar]
Hedges LV, Olkin I. Statistical Methods for Meta-Analysis. Orlando: Academic Press; 1985. [Google Scholar]
Dudoit S, Yang YH, Callow MJ, Speed TP. Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Statistica Sinica. 2002;12:111–139. [Google Scholar]
Dominici F, Parmigiani G. Combining studies with continuous and dichotomous responses: a latent-variables approach. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 105–125. [Google Scholar]
Lockhart DJ, Winzeler EA. Genomics, gene expression and DNA arrays. Nature. 2000;405:827–836. doi: 10.1038/35015701. [DOI] [PubMed] [Google Scholar]
Wu TD. Analyzing gene expression data from DNA microarrays to identify candidate genes. Journal of Pathology. 2001;195:53–65. doi: 10.1002/1096-9896(200109)195:1<53::AID-PATH891>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]
Hardiman G. Microarray technologies – an overview. Pharmacogenomics. 2002;3:293–297. doi: 10.1517/14622416.3.3.293. [DOI] [PubMed] [Google Scholar]
Southern EM. DNA microarrays. History and overview. Methods Mol Biol. 2000;170:1–15. doi: 10.1385/1-59259-234-1:1. [DOI] [PubMed] [Google Scholar]
Eichenberger P, Jensen ST, Conlon EM, van Ooij C, Silvaggi J, Gonzalez-Pastor JE, Fujita M, Ben-Yehuda S, Stragier P, Liu JS, Losick R. The sigmaE regulon and the identification of additional sporulation genes in Bacillus subtilis. Journal of Molecular Biology. 2003;327:945–972. doi: 10.1016/s0022-2836(03)00205-5. [DOI] [PubMed] [Google Scholar]
Liu JS. Monte Carlo Strategies in Scientific Computing. New York: Springer-Verlag; 2001. [Google Scholar]
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society B. 1995;85:289–300. [Google Scholar]
Tusher VG, Tibshirani R, Chu G. Significance Analysis of Microarrays Applied to the Ionizing Radiation Response. Proceedings of the National Academy of Sciences USA. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. [DOI] [PMC free article] [PubMed] [Google Scholar]
Storey JD. A Direct Approach to False Discovery Rates. Journal of the Royal Statistical Society B. 2002;64:479–498. [Google Scholar]
Storey JS, Tibshirani R. SAM Thresholding and False Discovery Rates for Detecting Differential Gene Expression in DNA Microarrays. In: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL, editor. The Analysis of Gene Expression Data: Methods and Software. Springer, NY; 2003. pp. 272–290. [Google Scholar]
Genovese C, Wasserman L. Operating characteristics and extensions of the false discovery rate procedure. Journal of the Royal Statistical Society B. 2002;64:499–518. [Google Scholar]
Genovese C, Wasserman L. Bayesian and frequentist multiple testing. In: Bernardo JM, Bayarri JM, Berger JO, Dawid AP, Heckerman D, Smith AFM, West M, editor. Bayesian Statistics 7. Oxford: Oxford University Press; 2003. pp. 145–162. [Google Scholar]
Conlon EM, Eichenberger P, Liu JS. Determining and analyzing differentially expressed genes from cDNA microarray experiments with complementary designs. Journal of Multivariate Analysis. 2004;90:1–18. [Google Scholar]
Schadt EE, Li C, Ellis B, Wong WH. Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data. J Cell Biochem Suppl. 2001;37:120–125. doi: 10.1002/jcb.10073. [DOI] [PubMed] [Google Scholar]
Kendall M, Stuart A, Ord JK. Kendall's Advanced Theory of Statistics. 5. London: Charles Griffin; 1992. [Google Scholar]
The BUGS Project http://www.mrc-bsu.cam.ac.uk/bugs

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional File 1

Supplemental Figures. containing Figures for the Sensitivity Analysis Results.

Click here for file^{(137.9KB, pdf)}

Additional File 2

WinBUGS code for the SEI model. containing the WinBUGS code for the SEI model.

Click here for file^{(3.5KB, txt)}

Additional File 3

WinBUGS code for the PI model. containing the WinBUGS code for the PI model.

Click here for file^{(3.3KB, txt)}

[B1] Wang J, Coombes KR, Highsmith WE, Keating MJ, Abruzzo LV. Differences in gene expression between B-cell chronic lymphocytic leukemia and normal B cells: a meta-analysis of three microarray studies. Bioinformatics. 2004;20:3166–3178. doi: 10.1093/bioinformatics/bth381. [DOI] [PubMed] [Google Scholar]

[B2] Choi JK, Yu U, Kim S, Yoo OJ. Combining multiple microarray studies and modeling inter-study variation. Bioinformatics. 2003. pp. i84–i90. [DOI] [PubMed]

[B3] Stevens JR, Doerge RW. Combining Affymetrix microarray results. BMC Bioinformatics. 2005;6:57. doi: 10.1186/1471-2105-6-57. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] Hu P, Greenwood CMT, Beyene J. Integrative analysis of multiple gene expression profiles with quality-adjusted effect size models. BMC Bioinformatics. 2005;6:128. doi: 10.1186/1471-2105-6-128. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] Morris JS, Yin G, Baggerly KA, Wu C, Zhang L. Pooling information across different studies and oligonucleotide microarray chip types to identify prognostic genes for lung cancer. In: Shoemaker JS, Lin SM, editor. Methods of Microarray Data Analysis IV. New York: Springer-Verlag; 2005. pp. 51–66. [Google Scholar]

[B6] Park T, Yi SG, Shin YK, Lee S. Combining multiple microarrays in the presence of controlling variables. Bioinformatics. 2006;22:1682–1689. doi: 10.1093/bioinformatics/btl183. [DOI] [PubMed] [Google Scholar]

[B7] Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM. Meta-analysis of microarrays: inter-study validation of gene expression profiles reveals pathway dysregulation in prostate cancer. Cancer Research. 2002;62:4427–4433. [PubMed] [Google Scholar]

[B8] Rhodes DR, Yu J, Shanker K, Deshpande N, Varambally R, Ghosh D, Barrette T, Pandey A, Chinnaiyan AM. Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. Proc Natl AcadSci USA. 2004;101:9309–9314. doi: 10.1073/pnas.0401994101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] Parmigiani G, Garrett-Mayer ES, Anbazhagan R, Gabrielson E. A cross-study comparison of gene expression studies for the molecular classification of lung cancer. Clinical Cancer Research. 2004;10:2922–2927. doi: 10.1158/1078-0432.ccr-03-0490. [DOI] [PubMed] [Google Scholar]

[B10] Shen R, Ghosh D, Chinnaiyan AM. Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data. BMC Genomics. 2004;5:94. doi: 10.1186/1471-2164-5-94. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] Xu L, Tan AC, Naiman DQ, Geman D, Winslow RL. Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data. Bioinformatics. 2005;21:3905–3911. doi: 10.1093/bioinformatics/bti647. [DOI] [PubMed] [Google Scholar]

[B12] Warnat P, Eils R, Brors B. Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics. 2005;6:265. doi: 10.1186/1471-2105-6-265. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] Jiang H, Deng Y, Chen H, Tao L, Sha Q, Chen J, Tsai C, Zhang S. Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes. BMC Bioinformatics. 2004;5:81. doi: 10.1186/1471-2105-5-81. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] Ghosh D, Barette TR, Rhodes D, Chinnaiyan AM. Statistical issues and methods for meta-analysis of microarray data: a case study in prostate cancer. Functional & Integrative Genomics. 2003;3:180–188. doi: 10.1007/s10142-003-0087-5. [DOI] [PubMed] [Google Scholar]

[B15] Conlon EM, Song JJ, Liu JS. Bayesian models for pooling microarray studies with multiple sources of replications. BMC Bioinformatics. 2006;7:247. doi: 10.1186/1471-2105-7-247. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] Baldi P, Long AD. A Bayesian framework for the analysis of microarray expression data: reguralized t-test and statistical inferences of gene changes. Bioinformatics. 2001;17:509–519. doi: 10.1093/bioinformatics/17.6.509. [DOI] [PubMed] [Google Scholar]

[B17] Tseng GC, Oh MK, Rohlin L, Liao JC, Wong WH. Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects. Nucleic Acids Res. 2001;29:2549–2557. doi: 10.1093/nar/29.12.2549. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] Townsend JP, Hartl DL. Bayesian analysis of gene expression levels: statistical quantification of relative mRNA level across multiple treatments or samples. Genome Biology. 2002;3 doi: 10.1186/gb-2002-3-12-research0071. research0071.1-71.16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19] Efron B, Tibshirani R, Storey JD, Tusher VG. Empirical Bayes Analysis of a Microarray Experiment. Journal of the American Statistical Association. 2001;96:1151–1160. [Google Scholar]

[B20] Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW. On Differential Variability of Expression Ratios: Improving Statistical Inference About Gene Expression Changes From Microarray Data. Journal of Computational Biology. 2001;8:37–52. doi: 10.1089/106652701300099074. [DOI] [PubMed] [Google Scholar]

[B21] Ibrahim JG, Chen M-H, Gray RJ. Bayesian Models for Gene Expression With DNA Microarray Data. Journal of the American Statistical Association. 2002;97:88–99. [Google Scholar]

[B22] Broët P, Richardson S, Radvanyi F. Bayesian hierarchical model for identifying changes in gene expression from microarray experiments. Journal of'Computational Biology. 2002;9:671–683. doi: 10.1089/106652702760277381. [DOI] [PubMed] [Google Scholar]

[B23] Gottardo R, Pannucci JA, Kuske CR, Brettin T. Statistical analysis of microarray data: a Bayesian approach. Biostatistics. 2003;4:597–620. doi: 10.1093/biostatistics/4.4.597. [DOI] [PubMed] [Google Scholar]

[B24] Lönnstedt I, Speed TP. Replicated microarray data. Statistica Sinica. 2002;12:31–46. [Google Scholar]

[B25] Pan W. A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments. Bioinformatics. 2002;18:546–554. doi: 10.1093/bioinformatics/18.4.546. [DOI] [PubMed] [Google Scholar]

[B26] Kendziorski CM, Newton MA, Lan H, Gould MN. On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles. Statistics in Medicine. 2003;22:3899–3914. doi: 10.1002/sim.1548. [DOI] [PubMed] [Google Scholar]

[B27] Newton MA, Noueiry A, Sarkar D, Ahlquist P. Detecting differential gene expression with a semiparametric hierarchical mixture method. Biostatistics. 2004;5:155–176. doi: 10.1093/biostatistics/5.2.155. [DOI] [PubMed] [Google Scholar]

[B28] Do KA, Müller P, Tang F. A Bayesian mixture model for differential gene expression. Journal of the Royal Statistical Society C. 2005;54:627–644. [Google Scholar]

[B29] Ishwaran H, Rao JS. Detecting Differentially Expressed Genes in Microarrays Using Bayesian Model Selection. Journal of the American Statistical Association. 2003;98:438–55. [Google Scholar]

[B30] Ishwaran H, Rao JS. Spike and Slab Gene Selection for Multipgroup Microarray Data. Journal of the American Statistical Association. 2005;100:764–780. [Google Scholar]

[B31] Stangl DK, Berry DA. Meta-analysis: past and present challenges. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 1–28. [Google Scholar]

[B32] Tweedie RL, Scott DJ, Biggerstaff BJ, Mengersen KL. Bayesian meta-analysis, with application to studies of ETS and lung cancer. Lung Cancer. 1996;14:S171–S194. doi: 10.1016/s0169-5002(96)90222-6. [DOI] [PubMed] [Google Scholar]

[B33] DuMouchel WH, Harris JE. Bayes methods for combining the results of cancer studies in humans and other species. Journal of the American Statistical Association. 1983;78:293–315. [Google Scholar]

[B34] Smith TC, Spiegelhalter DJ, Thomas A. Bayesian approaches to random-effects meta-analysis: a comparative study. Stat Med. 1995;14:2685–2699. doi: 10.1002/sim.4780142408. [DOI] [PubMed] [Google Scholar]

[B35] Normand SL. Meta-analysis: formulating, evaluating, combining, and reporting. Stat Med. 1999;18:321–359. doi: 10.1002/(sici)1097-0258(19990215)18:3<321::aid-sim28>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]

[B36] DuMouchel W, Normand SL. Computer-modeling and graphical strategies for meta-analysis. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 127–178. [Google Scholar]

[B37] Pauler DK, Wakefield J. Modeling and implementation issues in Bayesian meta-analysis. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 205–230. [Google Scholar]

[B38] Sargent DJ, Zee BC, Milan C, Torri V, Francini G. Meta-analysis of individual-patient survival data using random-effect models. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 255–275. [Google Scholar]

[B39] Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. 2. New York: Chapman & Hall; 2003. [Google Scholar]

[B40] Kuo WP, Jenssen TK, Butte AJ, Ohno-Machado L, Kohane IS. Analysis of matched mRNA measurements from two different microarray technologies. Bioinformatics. 2002;18:405–412. doi: 10.1093/bioinformatics/18.3.405. [DOI] [PubMed] [Google Scholar]

[B41] Jarvinen AK, Hautaniemi S, Edgren H, Auvinen P, Saarela J, Kallioniemi OP, Monni O. Are data from different gene expression microarray platforms comparable? Genomics. 2004;83:1164–1168. doi: 10.1016/j.ygeno.2004.01.004. [DOI] [PubMed] [Google Scholar]

[B42] Mah N, Thelin A, Lu T, Nikolaus S, Kuhbacher T, Gurbuz Y, Eickhoff H, Kloppel G, Lehrach H, Mellgard B, Costello CM, Schreiber S. A comparison of oligonucleotide and cDNA-based microarray systems. Physiol Genomics. 2004;16:361–370. doi: 10.1152/physiolgenomics.00080.2003. [DOI] [PubMed] [Google Scholar]

[B43] Hedges LV, Olkin I. Statistical Methods for Meta-Analysis. Orlando: Academic Press; 1985. [Google Scholar]

[B44] Dudoit S, Yang YH, Callow MJ, Speed TP. Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Statistica Sinica. 2002;12:111–139. [Google Scholar]

[B45] Dominici F, Parmigiani G. Combining studies with continuous and dichotomous responses: a latent-variables approach. In: Stangl DK, Berry DA, editor. Meta-Analysis in Medicine and Health Policy. New York: Marcel Dekker; 2000. pp. 105–125. [Google Scholar]

[B46] Lockhart DJ, Winzeler EA. Genomics, gene expression and DNA arrays. Nature. 2000;405:827–836. doi: 10.1038/35015701. [DOI] [PubMed] [Google Scholar]

[B47] Wu TD. Analyzing gene expression data from DNA microarrays to identify candidate genes. Journal of Pathology. 2001;195:53–65. doi: 10.1002/1096-9896(200109)195:1<53::AID-PATH891>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]

[B48] Hardiman G. Microarray technologies – an overview. Pharmacogenomics. 2002;3:293–297. doi: 10.1517/14622416.3.3.293. [DOI] [PubMed] [Google Scholar]

[B49] Southern EM. DNA microarrays. History and overview. Methods Mol Biol. 2000;170:1–15. doi: 10.1385/1-59259-234-1:1. [DOI] [PubMed] [Google Scholar]

[B50] Eichenberger P, Jensen ST, Conlon EM, van Ooij C, Silvaggi J, Gonzalez-Pastor JE, Fujita M, Ben-Yehuda S, Stragier P, Liu JS, Losick R. The sigmaE regulon and the identification of additional sporulation genes in Bacillus subtilis. Journal of Molecular Biology. 2003;327:945–972. doi: 10.1016/s0022-2836(03)00205-5. [DOI] [PubMed] [Google Scholar]

[B51] Liu JS. Monte Carlo Strategies in Scientific Computing. New York: Springer-Verlag; 2001. [Google Scholar]

[B52] Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society B. 1995;85:289–300. [Google Scholar]

[B53] Tusher VG, Tibshirani R, Chu G. Significance Analysis of Microarrays Applied to the Ionizing Radiation Response. Proceedings of the National Academy of Sciences USA. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B54] Storey JD. A Direct Approach to False Discovery Rates. Journal of the Royal Statistical Society B. 2002;64:479–498. [Google Scholar]

[B55] Storey JS, Tibshirani R. SAM Thresholding and False Discovery Rates for Detecting Differential Gene Expression in DNA Microarrays. In: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL, editor. The Analysis of Gene Expression Data: Methods and Software. Springer, NY; 2003. pp. 272–290. [Google Scholar]

[B56] Genovese C, Wasserman L. Operating characteristics and extensions of the false discovery rate procedure. Journal of the Royal Statistical Society B. 2002;64:499–518. [Google Scholar]

[B57] Genovese C, Wasserman L. Bayesian and frequentist multiple testing. In: Bernardo JM, Bayarri JM, Berger JO, Dawid AP, Heckerman D, Smith AFM, West M, editor. Bayesian Statistics 7. Oxford: Oxford University Press; 2003. pp. 145–162. [Google Scholar]

[B58] Conlon EM, Eichenberger P, Liu JS. Determining and analyzing differentially expressed genes from cDNA microarray experiments with complementary designs. Journal of Multivariate Analysis. 2004;90:1–18. [Google Scholar]

[B59] Schadt EE, Li C, Ellis B, Wong WH. Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data. J Cell Biochem Suppl. 2001;37:120–125. doi: 10.1002/jcb.10073. [DOI] [PubMed] [Google Scholar]

[B60] Kendall M, Stuart A, Ord JK. Kendall's Advanced Theory of Statistics. 5. London: Charles Griffin; 1992. [Google Scholar]

[B61] The BUGS Project http://www.mrc-bsu.cam.ac.uk/bugs

PERMALINK

Bayesian meta-analysis models for microarray data: a comparative study

Erin M Conlon

Joon J Song

Anna Liu

Abstract

Background

Results

Conclusion

Background

Background on effect sizes and standardized gene expression levels

Results and discussion

Bayesian standardized expression integration meta-analysis model

Bayesian probability integration meta-analysis model

Integration-driven discovery and revision

Bayesian false discovery rate

Two-study simulation results

Bayesian standardized expression integration model: modeling inter-study variation

Standardized expression integration versus probability integration model

Figure 1.

Figure 2.

Table 1.

Figure 3.

Five-study simulation results

Figure 4.

Table 2.

Biological data results

Figure 5.

Table 3.

Sensitivity analysis

Table 4.

Table 5.

Conclusion

Methods

Two study simulation data

Simulation data for five studies

Biological data

Mutant study

Induction study

Standardized expression integration model: prior distributions for inter-study variation

Normal versus t-distribution modeling of the study-specific mean values

Markov chain Monte Carlo implementation

Joint posterior distributions

Prior distributions

Full conditional posterior distributions

WinBUGS running time

Table 6.

Availability and requirements

Authors' contributions

Supplementary Material

Acknowledgments

Acknowledgements

Contributor Information

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases