Skip to main content
Advances and Applications in Bioinformatics and Chemistry : AABC logoLink to Advances and Applications in Bioinformatics and Chemistry : AABC
. 2008 Jul 1;1:13–18. doi: 10.2147/aabc.s3624

Identification of significant genes in genomics using Bayesian variable selection methods

Eugene Lin 1, Lung-Cheng Huang 2,3,
PMCID: PMC3169938  PMID: 21918603

Abstract

In the studies of genomics, it is essential to select a small number of genes that are more significant than the others for research ranging from candidate gene studies to genome-wide association studies. In this study, we proposed a Bayesian method for identifying the promising candidate genes that are significantly more influential than the others. We employed the framework of variable selection and a Gibbs sampling based technique to identify significant genes. The proposed approach was applied to a genomics study for persons with chronic fatigue syndrome. Our studies show that the proposed Bayesian methodology is effective for deriving models for genomic studies and for providing information on significant genes.

Keywords: Bayesian variable selection, genomics, Gibbs sampling, variable selection

Introduction

In the studies of genomics, the problem of identifying significant genes remains a challenge for researchers. Single nucleotide polymorphisms (SNPs) can be used in clinical association studies to determine the contribution of genes to disease susceptibility or drug efficacy. By using candidate gene approaches or genome-wide association studies, the key goal is to find responsible genes and SNPs for certain events (for example, certain diseases or certain drug efficacy). It is vital to select a small number of SNPs that are more significant than the others and ignoring the SNPs of lesser significance, thereby allowing researchers to focus on the most promising candidate genes and SNPs for diagnostics and therapeutics (Lee et al 2003; Lin et al 2007a). As we have 2p models with p SNPs, exhaustive computation over this model space is not feasible when the model space is very large (Lee et al 2003).

A variety of Bayesian variable selection methods based on Markov chain Monte Carlo (MCMC) approaches have been proposed for variable selection including the stochastic search variable selection (SSVS) of George and McCulloch (1993), the unconditional priors (UP) approach of Kuo and Mallick (1998), and the Gibbs variable selection (GVS) by Dellaportas and colleagues (2000, 2002). These three Bayesian variable selection methods utilize one particular MCMC method, the Gibbs sampler. The SSVS method has been applied to the identification of quantitative trait loci (QTL) and treats mapping QTL as a problem of model determination and variable selection (Yi et al 2003). Lee and colleagues (2003) applied the SSVS method to the problem of gene selection with microarray data for discovering significant disease genes on breast tumors. Similarly, Oh and colleagues (2003) utilized the SSVS method to identify the markers that are associated with the disease genes related to a high rate of increase in cholesterol. Furthermore, the SSVS approach was extended to the multivariate regression model in the multivariate Bayesian variable selection method (Brown et al 1998). Sha and colleagues (2004) used the multivariate Bayesian variable selection method to classify disease stages in microarray data. In addition, Swartz and colleagues (2006, 2007a) utilized the SSVS method with a conditional logistic regression likelihood to identify genetic loci relevant to a disease using case-parent triads. Oh (2007) also coupled the SSVS method with the new Haseman-Elston method to perform linkage analysis in rheumatoid arthritis. Similarly, Kwon and colleagues (2007) applied an iterative SSVS method to find SNPs that are associated with rheumatoid arthritis.

The rest of the paper is organized as follows. First, we propose a Bayesian-based methodology to identify the promising candidate genes that are significantly more influential than the others. Secondly, we evaluate and compare the proposed methods using a real dataset in a candidate gene study. Finally, we present the discussion and provide the conclusion.

Methods

Population

The study population was original to the previous study by the CDC Chronic Fatigue Syndrome Research Group. More information is available on the website (http://www.camda.duke.edu/camda06/datasets/index.html). In the present study, we only focused on the 42 SNPs as described in Table 1. There were 109 subjects, including 55 subjects having had experienced chronic fatigue syndrome (CFS) and 54 nonfatigued controls. In this analysis, we employed the 71 subjects, including 35 CFS subjects and 36 nonfatigue subjects, without any missing SNP values.

Table 1.

A panel of 42 SNPs by the CDC Chronic Fatigue Syndrome Research Group.

Gene SNPs
POMC rs12473543
TH rs4074905, rs2070762
MAOA rs1801291, rs979606, rs979605
MAOB rs3027452, rs2283729, rs1799836
TPH2 rs2171363, rs4760816, rs4760750, rs1386486, rs1487280, rs1872824, rs10784941
COMT rs4646312, rs740603, rs6269, rs4633, rs165722, rs933271, rs5993882
NR3C1 rs2918419, rs1866388, rs860458, rs852977, rs6196, rs6188, rs258750
SLC6A4 rs2066713, hCV7911132, rs140701
CRHR1 rs110402, rs1396862, rs242940, rs173365, rs242924, rs7209436
CRHR2 rs2267710, rs2267714, rs2284217

Gibbs variable selection

Assume that we observe p SNPs along the genome. Among the p SNPs, some may be tightly linked with large effects, and others may have only weak effects. Our aim is to identify a small number of SNPs that have the greatest discriminating power.

We consider binary responses as Yi = 1 indicates that the subject has a certain disease and Yi = 0 indicates that the subject is a control, for i = 1, ..., n. The observed phenotypic value Yj can be described by the linear model as follows (Dellaportas et al 2000; Oh et al 2003):

Yi=β0+j=1pγjXjβj+ɛ, (1)

where Xj is the design matrix, βj the parameter vector related to the j th term, and ɛ ~ N(0,σ2). In GVS, a set of binary indicator variables γj (j = 1, …, p), where γj = 1 or 0 represents the presence or absence of covariate j in the model, respectively.

The prior for (γ, β ) is specified as f (γ, β ) = f (γ ) f ( β |γ ). Furthermore, β can be partitioned into two vectors βγ and β corresponding to those components of β that are included (γj = 1) or not included (γj = 0) in the model. Then, the prior f (β |γ ) may be partitioned into model prior f (βγ |γ ) and pseudoprior f (β |βγ,γ ).

The sampling procedure is summarized by the following three steps (Dellaportas et al 2000):

  1. We sample the parameters included in the model by the posterior

    f(βγβ\γ,γ,y)f(yβ,γ)f(βγγ)f(β\γβγ,γ), (2)

    where y denotes the observed data.

  2. We sample the parameters excluded from the model from the pseudoprior

    f(β\γβγ,γ,y)f(β\γβγ,γ). (3)
  3. We sample each variable indicator j from a Bernoulli distribution with success probability Oj /(1 + Oj); where Oj is given by

    Oj=f(yβ,γj=1,γ\j)f(βγj=1,γ\j)f(γ=1,γ\j)f(yβ,γj=0,γ\j)f(βγj=0,γ\j)f(γ=0,γ\j), (4)

    where γ \ j denotes all terms of γ except γ j.

For the simplest approach, it is assumed that the prior βj depends only on γj and is given by

f(βjγj)=(1-γj)f(βjγj=0)+γjf(βjγj=1). (5)

The simplified prior (5) results in the following full conditional posterior distribution

f(βjγ,β\j,y){f(yγ,β)f(βjγj=1)γj=1f(βjγj=0)γj=0. (6)

A mixture of Normal distribution is used for model parameters as follows:

f(βjγj=1)N(0,Σj)and f(βjγj=0)N(μ¯j,Sj), (7)

where μ̄j, Sj are the mean and variance respectively in the corresponding pseudoprior distributions and ∑j is the prior variance, when the j term is included in the model.

Stochastic search variable selection and unconditional priors

In summary, we present the similarities and differences between the three Bayesian variable selection methods including GVS, SSVS, and UP as follows. In the SSVS strategy, unlike GVS and UP, variables corresponding to γj = 0 are included in the model as follows (George and McCulloch 1993; Oh et al 2003):

Yi=β0+j=1pXjβj+ɛ. (8)

And in the SSVS strategy, βj parameters are constrained to be close to zero when γj = 0 (George and McCulloch 1993). In this situation, f (y| β,γ ) is independent of γ. Thus, the first term on the right hand side of (4) can be omitted as follows:

Oj=f(βγj=1,γ\j)f(γ=1,γ\j)f(βγj=0,γ\j)f(γ=0,γ\j). (9)

In the UP approach, a prior distribution for (γ,β ) is specified with β independent of γ (Kuo and Mallick 1998). Then, the second term on the right hand side of (4) is absent as follows:

Oj=f(yβ,γj=1,γ\j)f(γ=1,γ\j)f(yβ,γj=0,γ\j)f(γ=0,γ\j). (10)

Two-stage Bayesian variable selection methodology

In this study, we propose a two-stage selection methodology based on GVS. In Stage I, we conduct GVS on all potential variables (that is, genetic markers) and calculate the estimated posterior probabilities for all potential variables. After ranking the variables according to the posterior probabilities, we then select a subset of N variables with top main effects based on the estimated posterior probabilities. That is, we identify the top N candidate genetic markers in Stage I.

In Stage II, we perform GVS again only on the N variables selected in Stage I and rank the selected N variables according to the estimated posterior probabilities. Next, we choose a small subset of M variables with top main effects based on the sorted posterior probabilities. That is, we identify the top M candidate genetic markers as a panel of significant genetic markers in Stage II.

Similarly, we can utilize SSVS or UP with the above two-stage selection methodology.

OpenBUGS software

The proposed Bayesian methodology can be implemented using the OpenBUGS software (Thomas et al 2006). The implementation involves the definition with a likelihood of the model f (y| β,γ ) and the specification of the prior distributions f ( β,γ )and f (γ )using OpenBUGS (Ntzoufras 2002). The posterior probabilities are calculated using OpenBUGS and can be monitored using the command “summaryStats” in the OpenBUGS environment (Ntzoufras 2002).

When no restrictions on the model space are imposed, a common prior for the indicator variables γj is f (γj) = Bernoulli(1/2) (Ntzoufras 2002). According to George and McCulloch (1993, 1997), the Gibbs sampler should begin with all γj = 1, which corresponds to starting with the full model. A selection of μ̄j = 0 and Sj = ∑j /k2 with k =10 has been proven to be an adequate choice (Ntzoufras 2002). The pseudoprior parameters μ̄j, Sj, and k are only relevant to the behavior of the MCMC chain and do not affect the posterior distribution (Ntzoufras 2002). Dellaportas and colleagues (2000) suggested that the Gibbs sampler is run for 100,000 iterations for GVS, 500,000 iterations for SSVS, and 500,000 iterations for UP, respectively, after discarding the first 10,000 iterations for the burn-in period.

Results

We applied the proposed Bayesian strategy to the published dataset in CFS as described previously for discovering significant genes.

First, we calculated the estimated marginal posterior probabilities based on GVS, SSVS, and UP for all the potential SNPs by using OpenBUGS. Figure 1 shows the results of the estimated marginal posterior probabilities in Stage I. Then, we ranked the SNPs according to the estimated marginal posterior probabilities and selected ten SNPs with top main effects. Table 2 summarizes the top ten SNPs based on the calculated marginal posterior probabilities in Stage I. The results in Figure 1 were based on 100,000 iterations for GVS, 500,000 iterations for SSVS, and 500,000 iterations for UP, respectively. For all methods, we discarded 10,000 iterations as a burn-in period. All three methods provided similar marginal posterior probabilities in Stage I. As shown in Table 2, the top ten SNPs in GVS were the same as the ones in UP, although the ranking in GVS was slightly different from the one in UP. And there were two different SNPs between GVS and SSVS among the top ten SNPs.

Figure 1.

Figure 1

Table 2.

In Stage I, the top ten SNPs based on their marginal posterior probabilities using the OpenBUGS software for three Bayesian variable selection methods including the Gibbs variable selection (GVS), the Stochastic Search Variable Selection (SSVS), the unconditional priors (UP) approach.

Posterior Ranking GVS SSVS UP
1 hCV7911132 hCV7911132 hCV7911132
2 rs173365 rs1866388 rs173365
3 rs2070762 rs2070762 rs2070762
4 rs6196 rs173365 rs6196
5 rs1866388 rs1801291 rs1866388
6 rs140701 rs6196 rs2267710
7 rs2267710 rs2267710 rs140701
8 rs1801291 rs852977 rs1801291
9 rs852977 rs933271 rs852977
10 rs1396862 rs165722 rs1396862

Secondly, we calculated the estimated marginal posterior probabilities based on GVS, SSVS, and UP again for the selected ten SNPs on the first run. Then, we ranked the SNPs according to the estimated marginal posterior probabilities and selected five SNPs with top main effects as a panel of significant SNPs. Table 3 shows the top five SNPs based on the calculated marginal posterior probabilities in Stage II. The results in Table 3 were based on 100,000 iterations for GVS, 500,000 iterations for SSVS, and 500,000 iterations for UP, respectively. For all methods, we discarded 10,000 iterations as a burn-in period. All three methods provided similar marginal posterior probabilities in Stage II. Furthermore, all three methods selected the same top five SNPs, although the ranking in GVS was different from the one in SSVS and was the same as the one in UP.

Table 3.

In Stage II, the top five SNPs based on their marginal posterior probabilities using the OpenBUGS software for three Bayesian variable selection methods including the Gibbs variable selection (GVS), the Stochastic Search Variable Selection (SSVS), the unconditional priors (UP) approach.

Posterior Ranking GVS SSVS UP
1 hCV7911132 hCV7911132 hCV7911132
2 rs173365 rs2070762 rs173365
3 rs2070762 rs1866388 rs2070762
4 rs6196 rs173365 rs6196
5 rs1866388 rs6196 rs1866388

For all three methods, the OpenBUGS programs were run on a 2.4 GHz processor. The CPU execution time in Stage I was approximately 139 minutes for GVS, 45 minutes for SSVS, and 703 minutes for UP, respectively. Furthermore, the CPU execution time in Stage II was approximately 9.8 minutes for GVS, 4 minutes for SSVS, and 60 minutes for UP, respectively. Based on the above CPU execution time, SSVS seemed to be most efficient among these three methods.

Discussion

We have developed a Bayesian-based methodology to address the problem of identifying genetic markers such as SNPs and genes that are more significant than the others. This problem occurs frequently in genomic and epidemiologic studies ranging from candidate gene studies to high-density genome scans. Our method treats the mapping of genetic markers as a problem of model determination and variable selection. Variable selection approaches for gene mapping include Bayesian methods and frequentist methods (Swartz et al 2007b). Several reports compared Bayesian methods to frequentist methods and found that the Bayesian methods may provide fewer false positives (Swartz et al 2007b). Because the dimensionality is kept constant across all possible models, the Bayesian-based methodology can be easily implemented via the Gibbs sampler (Dellaportas et al 2000). The Bayesian procedure can even be implemented using the publicly available software OpenBUGS (Thomas et al 2006) and thus can be widely used in genomic studies.

As shown in the MCMC results, we compared three Bayesian variable selection strategies including GVS, SSVS, and UP. The proposed SSVS method was shown to be more efficient than two other methods for discovering significant genes under typical situations of a genomics study. These three methods are all based on the Gibbs sampler. Compared with the reversible-jump MCMC, the Gibbs sampling approach has advantages on simplicity of computation and diagnosis of convergence (George and McCulloch 1997). Another major advantage is that these methods can be easily applied with the Gibbs sampling software such as OpenBUGS (Dellaportas et al 2000). The UP approach is extremely easy to implement, but may be insufficiently flexible for many practical problems (Dellaportas et al 2000). In the cases of hundreds of genetic markers, a second iteration of SSVS might be considered with a subset of variables based on the first run (George and McCulloch 1993). Accuracy of estimating the main effects and the posterior probabilities may be improved by using this two-stage strategy (Yi et al 2003). Similarly, our proposed two-stage Bayesian method may have better accuracy by conducting a second run with a reduced set of genetic markers based on the first run. Moreover, Beattie and colleagues (2002) proposed a two-stage Bayesian variable selection strategy that incorporates the SSVS method with the intrinsic Bayes factor (IBF). In the first stage, the SSVS procedure is employed on all factors. Then, in the second stage, the factors identified in the first stage are used as the input for the IBF analysis. The difference between our proposed two-stage Bayesian method and theirs was that only Bayesian variable selection strategies such as GVS, SSVS, and UP were used for both stages in our study.

To the best of our knowledge, this is the first study that proposes to use the Bayesian-based approach to provide a way to find a panel of genetic markers that is more significant than the others in CFS. It has been reported that subjects with CFS were distinguished by genetic markers that were involved in either hypothalanmic-pituitary-adrenal (HPA) axis function or neurotransmitter systems, including monoamine oxidase A (MAOA), monoamine oxidase B (MAOB), nuclear receptor subfamily 3; group C, member 1 glucocorticoid receptor (NR3C1), proopiomelanocortin (POMC) and tryptophan hydroxylase 2 (TPH2) genes (Smith et al 2006). Moreover, it has been shown that genetic markers, including catechol-O-methyltransferase (COMT), NR3C1 and TPH2 genes, could predict whether a person has CFS (Geortzel et al 2006). In this study, we identified significant SNPs in solute carrier family 6 member 4 (SLC6A4), corticotropin releasing hormone receptor 1 (CRHR1), tyrosine hydroxylase (TH), and NR3C1 genes. An interesting finding was that an association of NR3C1 with CFS compared with nonfatigued controls appeared to be consistent across several studies. Thus, this significant association strongly suggests that NR3C1 may be involved in biological mechanisms with CFS. However, these two previous studies (Smith et al 2006; Geortzel et al 2006) identified the TPH2 gene among the reported associations, which was not included in this study. The potential reason for the discrepancies between the results of this study and those of other studies may be the sample sizes. The studies conducted on small populations may have biased a particular result. Future research with independent replication in large sample sizes is needed to confirm the role of the candidate genes identified in this study.

In this study, we focused the context of this paper being a candidate gene approach. In future research, we will investigate the identifiability (Gelfand and Sahu 1999) of the proposed method and explore the possibility of extension to larger scale problems such as genome-wide association studies, where thousands of SNPs for a chromosome scan are examined. Moreover, the proposed Bayesian-based methodology was employed for modeling genetic markers associated with diseases and may be suitable for association studies in pharmacogenomics. In the studies of pharmacogenomics, genetic markers such as SNPs can be used to understand the relationship between genetic inheritance and the body’s response to drugs (Lin et al 2006a, 2006b). In future work, we aim to investigate the Bayesian-based methodology for application in pharmacogenomics.

Furthermore, we focused on the issue of selecting the significant genes without considering epistatic models in this study. Epistasis analysis for gene-gene and gene-environment interactions have been advocated for deciphering these complex mechanisms, particularly when each involved genetic marker only demonstrates a minor marginal effect (Lin et al 2007a, 2007b). It is important to address gene–gene and gene–environment interactions for describing a trait involving complex disease-related, pharmacokinetic and pharmacodynamic mechanisms. In future work, we will investigate gene–gene and gene–environment interactions based on the Bayesian variable selection strategies.

Conclusion

In this study, we propose an alternative Bayesian method for assessing significant genes in genomic studies. Our method is based on the Bayesian variable selection methods. Our findings suggest that our approach may provide a plausible way to identify a panel of genetic markers that is more significant than the others. Over the next few years, the results of our studies could be utilized to develop molecular diagnostic/prognostic tools. However, application of genomics in routine clinical practice will become a reality after a prospective clinical trial has been conducted to validate genetic markers.

Acknowledgments

The authors extend their sincere thanks to Vita Genomics, Inc. for funding this research. The authors would also like to thank Dr. Charles Wang for helpful suggestions and thank the anonymous reviewers for their constructive comments, which improved the context and the presentation of this paper.

References

  1. Beattie SD, Fong DKH, Lin DKJ. A two-stage Bayesian model selection strategy for supersaturated designs. Technometrics. 2002;44:55–63. [Google Scholar]
  2. Brown PJ, Vannucci M, Fearn T. Multivariate Bayesian variable selection and prediction. J Roy Stat Soc B. 1998;60:627–41. [Google Scholar]
  3. Dellaportas P, Forster JJ, Ntzoufras I. Bayesian variable selection using the Gibbs sampler. In: Dey DK, Ghosh S, Mallick B, editors. Generalized Linear Models: A Bayesian Perspective. New York: Marcel Dekker; 2000. [Google Scholar]
  4. Dellaportas P, Forster JJ, Ntzoufras I. On Bayesian model and variable selection using MCMC. Statist Comput. 2002;12:27–36. [Google Scholar]
  5. Gelfand AE, Sahu SK. Identifiability, improper priors and Gibbs sampling for generalized linear models. J Am Statist Assoc. 1999;94:247–53. [Google Scholar]
  6. George EI, McCulloch RE. Variable selection via Gibbs sampling. J Am Statist Assoc. 1993;88:881–9. [Google Scholar]
  7. George EI, McCulloch RE. Approaches for Bayesian variable selection. Statistica Sinica. 1997;7:339–74. [Google Scholar]
  8. Goertzel BN, Pennachin C, de Souza Coelho L, et al. Combinations of single nucleotide polymorphisms in neuroendocrine effector and receptor genes predict chronic fatigue syndrome. Pharmacogenomics. 2006;7:475–83. doi: 10.2217/14622416.7.3.475. [DOI] [PubMed] [Google Scholar]
  9. Kuo L, Mallick B. Variable selection for regression models. Sankhya. 1998;B60:65–81. [Google Scholar]
  10. Kwon S, Wang D, Guo X. Application of an iterative Bayesian variable selection method in a genome-wide association study of rheumatoid arthritis. BMC Proc. 2007;1(Suppl 1):S109. doi: 10.1186/1753-6561-1-s1-s109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Lee KE, Sha N, Dougherty ER, et al. Gene selection: a Bayesian variable selection approach. Bioinformatics. 2003;19:90–7. doi: 10.1093/bioinformatics/19.1.90. [DOI] [PubMed] [Google Scholar]
  12. Lin E, Hwang Y, Wang SC, et al. An artificial neural network approach to the drug efficacy of interferon treatments. Pharmacogenomics. 2006a;7:1017–24. doi: 10.2217/14622416.7.7.1017. [DOI] [PubMed] [Google Scholar]
  13. Lin E, Hwang Y, Tzeng CM. A case study of the utility of the HapMap database for pharmacogenomic haplotype analysis in the Taiwanese population. Mol Diagn Ther. 2006b;10:367–70. doi: 10.1007/BF03256213. [DOI] [PubMed] [Google Scholar]
  14. Lin E, Hwang Y, Liang KH, et al. Pattern-recognition techniques with haplotype analysis in pharmacogenomics. Pharmacogenomics. 2007a;8:75–83. doi: 10.2217/14622416.8.1.75. [DOI] [PubMed] [Google Scholar]
  15. Lin E, Hwang Y, Chen EY. Gene-gene and gene-environment interactions in interferon therapy for chronic hepatitis C. Pharmacogenomics. 2007b;8:1327–35. doi: 10.2217/14622416.8.10.1327. [DOI] [PubMed] [Google Scholar]
  16. Ntzoufras I. Gibbs variable selection using BUGS. J Statist Soft. 2002;7(7) [Google Scholar]
  17. Oh C, Ye KQ, He Q, et al. Locating disease genes using Bayesian variable selection with the Haseman-Elston method. BMC Genet. 2003;4(Suppl 1):S69. doi: 10.1186/1471-2156-4-S1-S69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Oh C. A Bayesian genome-wide linkage analysis of quantitative traits for rheumatoid arthritis via perfect sampling. BMC Proc. 2007;1(Suppl 1):S110. doi: 10.1186/1753-6561-1-s1-s110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Sha N, Vannucci M, Tadesse MG, et al. Bayesian variable selection in multinomial probit models to identify molecular signatures of disease stage. Biometrics. 2004;60:812–19. doi: 10.1111/j.0006-341X.2004.00233.x. [DOI] [PubMed] [Google Scholar]
  20. Smith AK, White PD, Aslakson E, et al. Polymorphisms in genes regulating the HPA axis associated with empirically delineated classes of unexplained chronic fatigue. Pharmacogenomics. 2006;7:387–94. doi: 10.2217/14622416.7.3.387. [DOI] [PubMed] [Google Scholar]
  21. Swartz MD, Kimmel M, Mueller P, et al. Stochastic search gene suggestion: a Bayesian hierarchical model for gene mapping. Biometrics. 2006;62:495–503. doi: 10.1111/j.1541-0420.2005.00451.x. [DOI] [PubMed] [Google Scholar]
  22. Swartz MD, Shete S. The null distribution of stochastic search gene suggestion: a Bayesian approach to gene mapping. BMC Proc. 2007a;1(Suppl 1):S113. doi: 10.1186/1753-6561-1-s1-s113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Swartz MD, Thomas DC, Daw EW, et al. Model selection and Bayesian methods in statistical genetics: summary of group 11 contributions to Genetic Analysis Workshop 15. Genet Epidemiol. 2007b;31(Suppl 1):S96–102. doi: 10.1002/gepi.20285. [DOI] [PubMed] [Google Scholar]
  24. Thomas A, O’Hara B, Ligges U, et al. Making BUGS open. R News. 2006;6:12–17. [Google Scholar]
  25. Yi N, George V, Allison DB. Stochastic search variable selection for identifying multiple quantitative trait loci. Genetics. 2003;164:1129–38. doi: 10.1093/genetics/164.3.1129. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Advances and applications in bioinformatics and chemistry : AABC are provided here courtesy of Dove Press

RESOURCES