Genomic Prediction from Multiple-Trait Bayesian Regression Methods Using Mixture Priors

Hao Cheng; Kadir Kizilkaya; Jian Zeng; Dorian Garrick; Rohan Fernando

doi:10.1534/genetics.118.300650

. 2018 Mar 7;209(1):89–103. doi: 10.1534/genetics.118.300650

Genomic Prediction from Multiple-Trait Bayesian Regression Methods Using Mixture Priors

Hao Cheng ^*,¹, Kadir Kizilkaya ^†, Jian Zeng ^‡, Dorian Garrick ^§, Rohan Fernando ^**

PMCID: PMC5937171 PMID: 29514861

Abstract

Bayesian multiple-regression methods incorporating different mixture priors for marker effects are used widely in genomic prediction. Improvement in prediction accuracies from using those methods, such as BayesB, BayesC, and BayesCπ, have been shown in single-trait analyses with both simulated and real data. These methods have been extended to multi-trait analyses, but only under the restrictive assumption that a locus simultaneously affects all the traits or none of them. This assumption is not biologically meaningful, especially in multi-trait analyses involving many traits. In this paper, we develop and implement a more general multi-trait BayesC $Π$ and BayesB methods allowing a broader range of mixture priors. Our methods allow a locus to affect any combination of traits, e.g., in a 5-trait analysis, the “restrictive” model only allows two situations, whereas ours allow all 32 situations. Further, we compare our methods to single-trait methods and the “restrictive” multi-trait formulation using real and simulated data. In the real data analysis, higher prediction accuracies were observed from both our new broad-based multi-trait methods and the “restrictive” formulation. The broad-based and restrictive multi-trait methods showed similar prediction accuracies. In the simulated data analysis, higher prediction accuracies to the “restrictive” method were observed from our general multi-trait methods for intermediate training population size. The software tool JWAS offers open-source routines to perform these analyses.

Keywords: multi-trait, mixture priors, genomic prediction, Bayesian regression, pleiotropy, GenPred, Shared data resources, Genomic Selection

GENOMIC prediction was proposed by Meuwissen et al. (2001) to incorporate marker effects from whole-genome data into genetic evaluation. In genomic prediction, all the marker or haplotype effects are estimated simultaneously, and these estimates can then be used to predict breeding values of individuals not in the training population used to estimate the effects.

Bayesian multiple-regression methods incorporating mixture priors for marker effects are used widely in genomic prediction, including various extensions to the BayesB method of Meuwissen et al. (2001). BayesB accommodates models where the prior for each marker effect follows a mixture distribution with a point mass at zero with probability π and a univariate-t distribution with probability $1 - π$ (Meuwissen et al. 2001; Gianola et al. 2009; Cheng et al. 2015b). Another model, BayesC, assumes a mixture with a point mass at zero with probability π and a univariate normal distribution with probability $1 - π$ for all marker effects, and its extension known as BayesCπ further treats π as an unknown parameter with a uniform prior distribution (Habier et al. 2011).

Bayesian multiple-regression methods were first proposed for single-trait analyses but have been extended to some particular forms of multi-trait analyses (Calus and Veerkamp 2011; Jia and Jannink 2012). Those extensions have pertained to a particular, somewhat restrictive mixture model. The “restrictive” multi-trait BayesC $Π$ presented by Jia and Jannink (2012) assumes any particular locus affects none of the traits or simultaneously affects all traits. This assumption of genetic architecture in that multi-trait BayesC $Π$ model is violated if some loci have no effect on at least one of the traits while having an effect on the remaining traits.

In this paper, we propose a more general class of multi-trait BayesC $Π$ and BayesB methods, where each locus can have an effect on any combination of traits. For example, in a 5-trait analysis, the restricted model only allows two situations, whereas ours allows all 32 situations. The previous restrictive multi-trait models are special cases of this general class of models. Further, our model allows the use of a single-site Gibbs sampler that requires less computing effort than some alternative Markov chain Monte Carlo approaches, especially for analyses involving many traits. Methodologies for the new models are compared to single-trait methods and the previous multi-trait methods using real and simulated data.

Materials and Methods

Multi-trait marker effects model

For simplicity of our description, but without loss of generality, we will assume individuals have all traits measured with a general mean as the only fixed effect, and write the multi-trait model for individual i from n genotyped individuals as

y_{i} = μ + \sum_{j = 1}^{p} m_{i j} α_{j} + e_{i},

where $y_{i}$ is a vector of phenotypes of t traits for individual i, $μ$ is a vector of overall means for t traits, $m_{i j}$ is the genotype covariate at locus j for individual i (coded as 0,1, and 2), p is the number of genotyped loci, $α_{j}$ is a vector of allele substitution effects or marker effects of t traits for locus j, and $e_{i}$ is a vector of random residuals of t traits for individual i. The fixed effects, or general mean in this case, are assigned flat priors. The residuals, $e_{i},$ are a priori assumed to be independently and identically distributed multivariate normal vectors with null mean and covariance matrix $R,$ which, in turn, is assumed to have an inverse Wishart prior distribution, $W_{t}^{- 1} (S_{e}, ν_{e}) .$

We will show that, employing the concept of data augmentation, the vector of marker effects at a particular locus $α_{j}$ can be written as $α_{j} = D_{j} β_{j},$ where $D_{j}$ is a diagonal matrix whose kth diagonal entry is an indicator variable indicating whether the marker effect of locus j for trait k is zero or nonzero, and $β_{j}$ follows a multivariate normal distribution in multi-trait BayesC $Π$ or a multivariate t distribution in multi-trait BayesB.

Multi-trait BayesC $Π$ model

Priors for marker effects:

The prior for $α_{j k},$ the allele substitution or marker effect of trait k for locus j, is a mixture with a point mass at zero and a univariate normal distribution conditional on $σ_{k}^{2} :$

α_{j k} | π_{k}, σ_{k}^{2} {\begin{array}{l} \sim N (0, σ_{k}^{2}) & p r o b a b i l i t y (1 - π_{k}) \\ 0 & p r o b a b i l i t y π_{k} \end{array}

and the covariance between effects for traits k and $k^{'}$ at the same locus, i.e., $α_{j k}$ and $α_{j k^{'}}$ is

c o v (α_{j k}, α_{j k^{'}} | σ_{k k^{'}}) = {\begin{array}{l} σ_{k k^{'}} & i f b o t h α_{j k} \neq 0 a n d α_{j k^{'}} \neq 0 \\ 0 & o t h e r w i s e \end{array} .

The vector of marker effects at a particular locus $α_{j}$ is written as $α_{j} = D_{j} β_{j},$ where $D_{j}$ is a diagonal matrix with elements $d i a g (D_{j}) = δ_{j} = (δ_{j 1}, δ_{j 2}, δ_{j 3} \dots δ_{j t}),$ where $δ_{j k}$ is an indicator variable indicating whether the marker effect of locus j for trait k is zero or nonzero, and the vector $β_{j}$ follows a multivariate normal distribution with null mean and covariance matrix $G = [\begin{matrix} σ_{1}^{2} & \dots & σ_{1 t} \\ ⋮ & ⋱ & ⋮ \\ σ_{1 t} & \dots & σ_{t}^{2} \end{matrix}]$ The covariance matrix $G$ is a priori assumed to follow an inverse Wishart distribution, $W_{t}^{- 1} (S_{β}, ν_{β}) .$ Thus, the multi-trait BayesC $Π$ model with data augmentation is written as

y_{i} = μ + \sum_{j = 1}^{p} m_{i j} D_{j} β_{j} + e_{i} .

(1)

In the most general case, any marker effect might be zero for any possible combination of t traits resulting in $2^{t}$ possible combinations of $δ_{j} .$ For example, in a t=2 trait model, there are $2^{t} = 4$ combinations for $δ_{j} :$ $(0, 0),$ $(0, 1),$ $(1, 0),$ $(1, 1) .$ In the restrictive special case of this model described by Jia and Jannink (2012), only two combinations, i.e., $(0, 0)$ and $(1, 1),$ have nonzero probability. Suppose, in general, we use numerical labels “1,” “2,” $\dots,$ “l” for the $2^{t}$ possible outcomes for $δ_{j},$ then the prior for $δ_{j}$ is a categorical distribution

p (δ_{j} = “ i ”) = Π_{1} I (δ_{j} = “ 1 ”) + Π_{2} I (δ_{j} = “ 2 ”) + \dots + Π_{l} I (δ_{j} = “ l ”),

where $\sum_{i = 1}^{l} Π_{i} = 1$ with $Π_{i}$ being the prior probability that the vector $δ_{j}$ corresponds to the vector labeled $“ i ” .$ A Dirichlet distribution with all parameters equal to one, i.e., a uniform distribution, can be used for a prior for $Π = (Π_{1}, Π_{2}, \dots, Π_{l}) .$

As shown below, we consider two Gibbs samplers to draw samples for all the parameters in this model. Gibbs sampler I is a single-site sampler, where only one of the t indicator labels is sampled at a time. Thus, in a 2-trait model, for example, this sampler cannot move from $(0, 0)$ to $(1, 1)$ in a single step without stepping through $(1, 0)$ or $(0, 1)$ for $δ_{j} .$ Therefore, Gibbs sampler I cannot be used for the restrictive model which excludes $(1, 0)$ and $(0, 1)$ from the state space for $δ_{j} .$ Gibbs sampler II, however, samples all elements of $δ_{j}$ jointly, and can move from $(0, 0)$ to $(1, 1)$ in a single step. However, Gibbs sampler II is computationally more intensive because it requires drawing samples from a multivariate normal distribution of order t, the number of traits.

Gibbs sampler I for multi-trait BayesC $Π$ :

Suppose the prior for $δ_{j}$ is a categorical distribution for which the support is the set of $2^{t}$ outcomes of $δ_{j} .$ For convenience, from now on let “1” denote trait k and “2” the other $t - 1$ traits. In our sampling scheme, $β_{j 1}$ and $δ_{j 1}$ are sampled from their joint full conditional distributions, which can be written as the product of the full conditional distribution of $β_{j 1}$ given $δ_{j 1}$ and the marginal full conditional distribution of $δ_{j 1} .$ Let $θ$ denote all other parameters except $δ_{j 1}$ and $β_{j 1},$ then our sampling scheme can be written as

f (β_{j 1}, δ_{j 1} | θ, y) = f (β_{j 1} | δ_{j 1}, θ, y) f (δ_{j 1} | θ, y) .

The full conditional distributions of $β_{j 1},$ $δ_{j 1},$ $Π,$ $G$ and $R$ for Gibbs sampler I, whose derivations are in the Appendix, are given below. The full conditional distributions of $β_{j 1}$ is

p (β_{j 1} | δ_{j 1}, θ, y) = {\begin{matrix} N ({\hat{β}}_{j 1}^{0}, {(G^{11})}^{- 1}) & w h e n δ_{j 1} = 0 \\ N ({\hat{β}}_{j 1}^{1}, {(C_{j, 11}^{1})}^{- 1}) & w h e n δ_{j 1} = 1 \end{matrix},

with

{\hat{β}}_{j 1}^{0} = - {(G^{11})}^{- 1} G^{12} β_{j 2},

{\hat{β}}_{j 1}^{1} = {(C_{j, 11}^{1})}^{- 1} (r_{j 1} - C_{j, 12}^{1} β_{j 2}),

C_{j, 11}^{1} = G^{11} + R^{11} \sum_{i = 1}^{n} m_{i j}^{2}

C_{j, 12}^{1} = G^{12} + R^{12} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2},

r_{j 1} = (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{11} \\ R^{21} \end{matrix}],

where $w_{i} = y_{i} - μ - \sum_{j^{'} \neq j} m_{i j^{'}} D_{j^{'}} β_{j^{'}},$ $G^{11}$ and $G^{12}$ are the partitions of $G^{- 1}$ corresponding to trait k and covariances between trait k and other traits, respectively. $R^{11}$ and $R^{12}$ are the partitions of $R^{- 1}$ corresponding to trait k and covariances between trait k and other traits, respectively.

The marginal full conditional probability that $δ_{j 1} = 1$ is

f (δ_{j 1} = 1 | θ, y) = {1 + {(\frac{P r (δ_{j 1} = 0, δ_{j 2} | Π)}{P r (δ_{j 1} = 1, δ_{j 2} | Π)} H)}^{- 1}}^{- 1},

where $H =$

e x p {- \frac{1}{2} (l o g C_{j, 11}^{1} - {\hat{β_{j 1}^{1}}}^{2} C_{j, 11}^{1}) - (- \frac{1}{2} (l o g G^{11} - {\hat{β_{j 1}^{0}}}^{2} G^{11}))} .

The full conditional distribution for $Π$ can be written as

f (Π | β, D, G, R, y) \propto D i r i c h l e t (n_{1} + 1, n_{2} + 1, \dots),

where $n_{i}$ is the number of loci or markers for which $δ_{j} = “ i ” .$

The full conditional distributions for $R,$ the covariance matrix for residuals, is an inverse Wishart distribution, $W_{t}^{- 1} (S_{e} + e' e, ν_{e} + n),$ where $e$ is the $n \times t$ matrix for residuals whose ith row is $e_{i}^{'} .$ The full conditional distribution for $G,$ the covariance matrix for $β_{j},$ is an inverse Wishart distribution, $W_{t}^{- 1} (S_{β} + β' β, ν_{β} + p),$ where $β$ is the $p \times t$ matrix whose ith row is $β_{i}^{'} .$

Gibbs sampler II for multi-trait BayesC $Π$ :

The Gibbs sampler above, where only one of the t indicator labels is sampled at a time, cannot be used for the restrictive model assuming any particular locus affects all traits or none of them. Further, if some particular $Π_{i}$ are near zero, the chain might exhibit mixing problems. Another, more general, but computationally intensive, Gibbs sampler that samples all elements of $δ_{j}$ jointly and may exhibit improved mixing is proposed below.

The full conditional distributions of $β_{j},$ $δ_{j},$ $Π,$ $G,$ and $R$ for Gibbs sampler II, whose derivations are in the Appendix, are given below.

Let $θ$ denote all other parameters except $β_{j}$ and $δ_{j},$ then our sampling scheme can be written as

f (β_{j}, δ_{j} | θ, y) = f (δ_{j} | θ, y) f (β_{j} | δ_{j}, θ, y) .

The full conditional distribution of $β_{j}$ is

f (β_{j} | δ_{j}, θ, y) \propto N (C_{j}^{- 1} r_{j}, C_{j}^{- 1}),

where $C_{j} = D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}$ and $r_{j}^{'} = (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) R^{- 1} D_{j} .$

The marginal full conditional probability of $δ_{j} = “ i ”$ is

f (δ_{j} = “ i ” | θ, y) = \frac{f (y | δ_{j} = “ i ”, θ) f (δ_{j} = “ i ” | Π)}{\sum_{“ i ” \in {“ 1 ”, “ 2 ”, \dots, “ l ”}} f (y | δ_{j} = “ i ”, θ) f (δ_{j} = “ i ” | Π)},

where

f (y | δ_{j}, θ) = {| C_{j}^{- 1} |}^{\frac{1}{2}} e x p {\frac{1}{2} r_{j}^{'} C_{j}^{- 1} r_{j}} .

This Gibbs sampler can accommodate the “restrictive” multi-trait BayesC $Π$ that was proposed by Jia and Jannink (2012), which only allows $δ_{j}$ to be a vector of all ones or a vector of all zeros.

Multi-trait BayesB model

The multi-trait BayesC $Π$ model proposed above can be modified to accommodate the general multi-trait BayesB model. Model Equation (1) can also be used for the multi-trait BayesB method. The differences in multi-trait BayesB method is that the prior for $β_{j}$ is a multivariate t distribution, rather than a multivariate normal distribution. This is equivalent to assuming $β_{j}$ has a multivariate normal distribution with null mean and locus-specific covariance matrix $G_{j},$ which is assigned an inverse Wishart prior, $W_{t}^{- 1} (S_{β}, ν_{β}) .$

The derivations of the full conditional distributions of parameters of interest for Gibbs samplers are shown in the Appendix. In the multi-trait BayesB model, the full conditional distributions for all parameters except $G_{j}$ are similar to the multi-trait BayesC $Π$ model. The full conditional distribution for $G_{j},$ the covariance matrix for $β_{j},$ is an inverse Wishart distribution, $W_{t}^{- 1} (S_{β} + β_{j} β_{j}^{'}, ν_{β} + 1) .$

Data analyses

Real data:

Published genotypic and deregressed breeding values based on phenotypic data for Loblolly Pine (Pinus taeda L.) were used (Resende et al. 2012; Daetwyler et al. 2013). Two disease traits, namely presence or absence of rust (Rust_bin) and gall volume (Rust_gall_vol) were analyzed. These are the two traits used in Jia and Jannink (2012). The reported heritabilities were 0.21 for Rust_bin and 0.12 for Rust_gall_vol. Loci with missing genotypes were imputed as the mean of the observed genotype covariates at that locus but loci with a missing rate $>$ 50% were excluded. After these quality control edits, 4828 SNPs on 807 individuals with deregressed phenotypes and genotypes on both traits remained.

Prediction accuracy was calculated as the correlation between the vector of deregressed phenotypes and the vector of estimated breeding values. Cross-validation using 10 folds formed the basis for comparing these methods. Paired t-tests were used for tests of significance of difference in prediction accuracies between two methods, where prediction accuracies for two different methods from each validation fold were considered as paired samples. The general multi-trait BayesC $Π$ model (MT-BayesC $Π$ -G) was compared to a similar model where the prior for $α_{j}$ is a multivariate normal rather than a mixture of multivariate normals (MT-BayesC0), the more restricted multi-trait BayesC $Π$ proposed by Jia and Jannink (2012) (MT-BayesC $Π$ -R) and the usual single trait formulations of the mixture models (ST-BayesC0, ST-BayesCπ). Since BayesC0 is equivalent to random regression best linear unbiased prediction (RR-BLUP), ST-BayesC0 and MT-BayesC0 are denoted as ST-RR-BLUP and MT-RR-BLUP below. The prior for the residual covariance matrix $R$ in all multi-trait methods was an inverse Wishart distribution, $W^{- 1} = ([\begin{matrix} 0.003 & 0 \\ 0 & 0.003 \end{matrix}], 6),$ for which the mean of $R$ is $[\begin{matrix} 0.001 & 0 \\ 0 & 0.001 \end{matrix}],$ the SD of diagonal elements are $1.4 \times 10^{- 3},$ and the SD of off-diagonal elements are 0. This same prior was used for the marker effects covariance matrix $G .$ The priors for the residual variance and marker effects variance in single-trait analyses were scaled inverted chi-squared distributions with scale parameter $S^{2} = 0.0005$ and degrees of freedom $ν = 4,$ for which the mean of the prior was also 0.001. In the data analyses, multi-trait BayesB methods provided similar results as multi-trait BayesC $Π$ methods. Thus, only results from BayesC $Π$ analyses were presented below to demonstrate the superiority of our multi-trait methods.

Simulated data:

Simulated data described below were used to quantify the superiority of the general multi-trait Bayesian methods. Two scenarios were simulated. In scenario 1, as a known ideal condition, the simulated genome consisted of 100 loci on each of two chromosomes that were in Hardy-Weinberg and linkage equilibria. All these loci were considered as QTL or causative variants and used in the analyses. The QTL on the first chromosome had effects only on trait 1 and those on the second chromosome only on trait 2. The effects of these QTL were simulated from a standard normal distribution and then were equally scaled to provide unit genetic variance for each trait in the simulated population of 8000 unrelated individuals. The phenotypes for these traits were obtained by adding independent residuals to the genetic values. Two situations were simulated: (1) heritabilities for both traits were 0.5; (2) heritability for trait 1 was 0.2 and for trait 2 was 0.8. The XSim package was used in the simulation (Supplemental Material, File S1) (Cheng et al. 2015a).

In scenario 2, both markers and QTL were simulated. The simulated genome consisted of 100 evenly spaced loci on each of three chromosomes of length 10 cM. Ten loci were randomly selected on each chromosome as QTL. Allele states were sampled from a Bernoulli distribution with frequency 0.5 in the base population. Starting from a base population of 500 males and 500 females, random mating was simulated for 500 generations to generate linkage disequilibrium. Random mating was continued for five more generations to increase the population size to 4000 males and 4000 females, which were used in the analyses. The effects of QTL on the first two chromosomes were simulated following the same strategy in scenario 1, i.e., the QTL on the first chromosome had effects only on trait 1, and those on the second chromosome only on trait 2. All QTL on the third chromosome had effects on both traits. The effects of these QTL on the third chromosome were simulated from a standard bivariate normal distribution with correlation 0.5. The phenotypes for these traits were obtained by adding independent residuals to the genetic values. In total, 8000 individuals were simulated with heritability 0.2 for trait 1 and 0.8 for trait 2.

The same validation approaches were used for these two simulation scenarios. A total of 500 individuals were used for testing, and for each training population of size N, 100 replicates of the training population were sampled from the remaining individuals. The values considered for N were 50, 100, 200, 400, 1000, 2000, 4000, or 7000. The true genetic and residual variances were used to compute the scale parameters for the priors of the variance components. The general multi-trait BayesC $Π$ model (MT-BayesC $Π$ -G) was compared to the more restricted multi-trait BayesC $Π$ (MT-BayesC $Π$ -R) using this dataset.

All analyses were performed using JWAS (Cheng et al. 2018), an open-source, publicly available package for single-trait and multi-trait whole-genome analyses written in the freely available Julia language.

Data availability

The genotypic and phenotypic data used in the real data analysis are publicly available (Resende et al. 2012). The scripts used to generate the simulated data are provided as supplementary information. The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.

Results

Real data

The prediction accuracies from all methods for Rust_bin and Rust_gall_vol are in Figure 1. The prediction accuracies from all single-trait analyses using JWAS are similar to those in Resende et al. (2012).

Comparison of single-trait and multi-trait methods for Rust_bin and Rust_gall_vol traits. ST, MT-G and MT-R indicate single-trait, our general multi-trait and restricted multi-trait analyses, respectively. * indicates a statistically significant (P < 0.01) difference between methods.

The predictions of Rust_bin exhibited no significant difference in accuracy between multi-trait and single-trait analyses within each method (ST-RR-BLUP vs. MT-RR-BLUP; ST-BayesCπ vs. MT-BayesC $Π$ -R; ST-BayesCπ vs. MT-BayesC $Π$ -G).

In contrast, prediction accuracies for the lower heritability Rust_gall_vol with MT-BayesC $Π$ -G were significantly higher than those from ST-BayesCπ. MT-BayesC $Π$ -G and MT-BayesC $Π$ -R showed similar prediction accuracies. The posterior means of $Π$ for both methods are in Table 1. When RR-BLUP was used for the analysis, however, the advantage of the multiple-trait analysis (MT-RR-BLUP) over the single-trait analysis (ST-RR-BLUP) for Rust_gall_vol was not observed.

Table 1. Estimation of π for alternative multi-trait BayesC $Π$ methods.

	Different categories of δ
	$(0, 0)$	$(1, 1)$	$(0, 1)$	$(1, 0)$
MT-BayesC $Π$ -G	0.966	0.029	0.002	0.003
MT-BayesC $Π$ -R	0.971	0.029	NA^a	NA^a

Open in a new tab

Posterior mean of $Π$ were given for different categories of δ. Different categories of δ are denoted as $(k_{1}, k_{2}),$ where $k_{1} = 0$ if a marker has a null effect on Rust_bin, otherwise $k_{1} = 1,$ and similarly for $k_{2}$ representing sampled effects for Rust_gall_vol.

Combinations that do not exist in the restricted model.

Simulated data

The prediction accuracies from MT-BayesC $Π$ -G and MT-BayesC $Π$ -R methods were compared for varying size (N) of training populations under two simulation scenarios. In simulation scenario 1, Figure 2 shows the prediction accuracies where heritabilities for both traits were 0.5. Figure 3 shows the prediction accuracies where heritabilities for trait 1 and trait 2 were 0.2 and 0.8, respectively. When $N = 50,$ both methods had similar prediction accuracy. For both traits, as N increased, initially, MT-BayesC $Π$ -G became superior to MT-BayesC $Π$ -R, but, as expected, the accuracies of these methods asymptotically converged (Karaman et al. 2016). In most cases, the differences in accuracies for both traits were small. However, in Figure 3, the differences in accuracies for trait 1, for which the heritability was 0.2, were substantial for intermediate values of N. Figure 4 shows the prediction accuracies for simulation scenario 2. The pattern observed is similar to Figure 3 under simulation scenario 1. MT-BayesC $Π$ -G was superior to MT-BayesC $Π$ -R for intermediate training population, but as N increased, the accuracies of these methods asymptotically converged (Karaman et al. 2016).

Comparison of multi-trait BayesC $Π$ methods for situation 1 under simulation scenario 1.

Comparison of multi-trait BayesC $Π$ methods for situation 2 under simulation scenario 1.

Comparison of multi-trait BayesC $Π$ methods under simulation scenario 2.

Discussion

Real data

Significant differences between multi-trait and single-trait analyses were only observed for Rust_gall_vol within BayesCπ methods (MT-BayesC $Π$ -G vs. ST-BayesCπ; MT-BayesC $Π$ -R vs. ST-BayesCπ). MT-BayesC $Π$ -G and MT-BayesC $Π$ -R outperformed ST-BayesCπ for Rust_gall_vol, and the accuracy gain was $26 %$ (from 0.287 to 0.364). The lower-heritability trait Rust_gall_vol benefited from information on the other correlated trait Rust_bin. Thus higher prediction accuracy from MT- BayesC $Π$ -G were observed in trait Rust_gall_vol but not for the high heritability Rust_bin. Results in Jia and Jannink (2012) showed no difference between MT-BayesC $Π$ and ST-BayesCπ because a reduced marker panel (500 markers) was used.

The fact that RR-BLUP showed no improvement in multi-trait analyses suggested that benefits from MT-BayesC $Π$ -G may be due to the estimation of the hyper-parameter $Π .$ In the MT-BayesC $Π$ -G, the mean of the posterior probability that a marker has a null effect on Rust_gall_vol was ∼0.97, calculated as the summation of posterior mean of $Π$ for categories $(0, 0)$ and $(1, 0) .$ The posterior mean of π, the probability that a marker has a null effect, in ST-BayesCπ for Rust_gall_vol was 0.74, different from the equivalent value, 0.97, in MT-BayesC $Π$ -G shown above. Thus a ST-BayesC analysis with $π = 0.97$ was undertaken. Prediction accuracy from this ST-BayesCπ analysis with $π = 0.97$ was 0.361, which was similar to the accuracy from MT-BayesC $Π$ -G. This shows that including an additional correlated trait, especially one with high heritability, will bring in more data into the analysis, helping variable selection in a low-heritability trait to become more effective and result in improved prediction accuracy.

The difference between MT-BayesC $Π$ -G and MT-BayesC $Π$ -R is that MT-BayesC $Π$ -R assumes a locus has an effect on all traits or none of them. This assumption regarding genetic architecture is likely to be seldom true. MT-BayesC $Π$ -G and MT-BayesC $Π$ -R, however, showed similar prediction accuracies. This can be explained by the estimation of $Π$ in MT-BayesC $Π$ -G and MT-BayesC $Π$ -R in Table 1. The posterior probability means for $(0, 1)$ and $(1, 0)$ were almost zero in MT-BayesC $Π$ -G and for $(0, 0)$ and $(1, 1)$ are similar in MT-BayesC $Π$ -G and MT-BayesC $Π$ -R, suggesting that the assumption of genetic architecture whereby the same loci affect both traits as explicit in MT-BayesC $Π$ -R may be valid for these two disease traits. Note that the lack of difference between the methods may also result from the limited size of the training population.

Simulated data

In scenario 1, we simulated bivariate data where each QTL had an effect on only one or the other of the traits. In MT-BayesC $Π$ -R, if a locus has an effect on one of the traits, that locus is included in the model for all traits. So, in the simulated data, MT-BayesC $Π$ -R would need to include all loci in the model for both traits. Thus for the trait that had heritability 0.2, the contribution of noise to the prediction from loci on chromosome 2, which had no effect on this trait, is large relative to the real signal from QTL on chromosome 1. In contrast, the general variable selection in MT-BayesC $Π$ -G allows loci on chromosome 2, which have no effect on trait 1, to be excluded from the model for trait 1. Thus when sufficient data were available for variable selection to exclude loci on chromosome 2 for trait 1, MT-BayesC $Π$ -G showed a substantial advantage over MT-BayesC $Π$ -R. On the other hand, for the trait with heritability 0.8, the contribution of noise to the prediction from the loci on chromosome 1, which had no effect on this trait, is small relative to the signal from loci on chromosome 2. Thus MT-BayesC $Π$ -G and MT-BayesC $Π$ -R had similar accuracies. As the training population size increased, the contribution of noise to the prediction of a trait from loci which had no effect on this trait, vanished even when the heritability was low. This was observed for both traits as apparent in Figure 2 and Figure 3. Since only bivariate data with different heritabilities showed substantial differences in prediction accuracies, traits with different heritabilities were simulated in scenario 2. In scenario 2, both markers and QTL were simulated. As expected, MT-BayesC $Π$ -G showed higher prediction accuracy to MT-BayesC $Π$ -R for intermediate training population, but as N increased, the accuracies of these methods asymptotically converged (Karaman et al. 2016).

Further, in both real and simulated analyses, MT-BayesC $Π$ -G gave equal or higher prediction accuracy than MT-BayesC $Π$ -R. In addition, MT-BayesC $Π$ -R requires drawing samples from a multivariate normal distribution of order t, whereas Gibbs sampler I, which can be used for MT-BayesC $Π$ -G, requires sampling from a univariate normal. Thus, in addition to MT-BayesC $Π$ -G giving equal or better performance than MT-BayesC $Π$ -R, MT-BayesC $Π$ -G can also be computationally more efficient.

Priors

In practice, genetic variances from previous conventional analyses are usually used to construct priors for marker effect variances. For single trait analyses, under some assumptions, it can be shown that the marker effect variance $σ_{α}^{2}$ can be obtained as

σ_{α}^{2} = \frac{σ_{g}^{2}}{(1 - π) \sum 2 p_{j} (1 - p_{j})},

(2)

where $σ_{g}^{2}$ is the genetic variance, $p_{j}$ is the allele frequency for locus j, and π is the probability that a marker has a null effect (Habier et al. 2007; Gianola et al. 2009; Fernando and Garrick 2013). Following a similar strategy, the marker effect covariance matrix $G$ in a two-trait analysis can be obtained as

G = \frac{1}{\sum 2 p_{j} (1 - p_{j})} [\begin{matrix} \frac{Q_{11}}{p (δ = (1, 1)) + p (δ = (1, 0))} & \frac{Q_{12}}{p (δ = (1, 1))} \\ \frac{Q_{21}}{p (δ = (1, 1))} & \frac{Q_{22}}{p (δ = (1, 1)) + p (δ = (0, 1))} \end{matrix}],

(3)

where $Q = [\begin{matrix} Q_{11} & Q_{12} \\ Q_{21} & Q_{22} \end{matrix}]$ is the genetic covariance matrix and $p (δ = (0, 1)),$ $p (δ = (1, 0)),$ and $p (δ = (1, 1))$ are the probabilities a marker has null effects on the first trait but not the second trait, on the second trait but not the first trait, or on neither trait. Thus the probability that a marker has an effect on the first trait can be obtained as $p (δ = (1, 1)) + p (δ = (1, 0)),$ which is the denominator of the upper left element in (3). This strategy relating marker effect covariance matrix to genetic covariance matrix can be readily extended to >2 traits. Note that positive definite matrix $Q$ may result in negative definite matrix $G$ using (3), especially when the prior for the probability a marker has null effects is far from the real value. In that case, the diagonal elements of $G,$ which are the marker effect variances for different traits, can be obtained using (2), where π may be estimated from previous single-trait analyses, and the off-diagonal elements of $G$ may be set to zero to guarantee positive definiteness of $G .$

Multi-trait variable selection:

In regard to a single trait, a locus either has an effect, or it does not. Hence, the scalar parameter π (and its complement $1 - π$ ) completely defines this circumstance. In a multi-trait setting, it is conceivable that loci that influence one trait, may or may not influence other traits. In that circumstance, a vector $Π$ is required to define the genetic architecture. The number of parameters that constitute the vector $Π$ is $2^{t},$ which grows rapidly with the number of traits. In most cases, the researcher will have little or no knowledge of the likely extent of pleiotropy of loci that influence two traits, other than knowing or having an estimate of the genetic covariance. There are two simple ways to reduce this complexity in priors.

First, one can assume, as did Jia and Jannink (2012), that, in the context of variable selection, a locus should be selected for all of the traits or selected for none of the traits, reducing the required probabilities to being analogous to the single trait π and $(1 - π) .$ This approach has the advantage of simplicity, but the disadvantage that many effects might need to be estimated for loci that have no effect on a trait, and this may erode the accuracy of prediction. This should not be a problem for asymptotically large datasets, as in that case the fitted locus effects should converge to zero for those traits not influenced by that locus.

A second simple way to accommodate the multiple trait circumstance is to assume the $2^{t}$ parameters can be derived from t trait-specific parameters. However, when the probability that a single trait locus has an effect is small for each of two or more traits, the pair-wise probability that a locus affects all the traits will be the product of those small probabilities, making it very difficult for loci to enter the model for all traits simultaneously.

The better way to solve this problem is to use a hyper-parameter $Π$ that completely defines the alternative models that are required to capture all the alternative forms of genetic architecture. We have shown here how this can be done, with two alternative Gibbs sampling strategies. One involves single-site sampling for one locus and trait at a time. The other samples all the alternative combinations of effects for one locus considering all traits simultaneously. We have shown that both are practical with real data and can result in improved accuracies of prediction in certain circumstances in terms of genetic architecture and size of dataset.

Conclusions

Many researchers are interested in genome-wide association studies and finding causal genes and variants. For those researchers, pleiotropy is of considerable interest, and they would want to know which loci affect which traits, from a purely biological perspective. Practitioners are often interested in “breaking” the genetic correlation, by selecting parents to give a favorable selection response in respect to multiple trait consequences. In either of these circumstances, with intermediate- rather than asymptotically large datasets, we believe the methods described here and available in the open-source, freely-available JWAS package offer real promise.

Supplementary Material

Supplemental material is available online at www.genetics.org/lookup/suppl/doi:10.1534/genetics.118.300650/-/DC1.

Click here for additional data file.^{(55.1KB, pdf)}

Acknowledgments

We thank bioRxiv for making this manuscript available early online as preprints. This work was supported by the United States Department of Agriculture, Agriculture and Food Research Initiative National Institute of Food and Agriculture Competitive grant no. 2015-67015-22947.

Appendix

Gibbs Sampler Algorithm for Multi-Trait BayesC $Π$ -G

Single-site Gibbs sampler for multi-trait BayesC $Π$ -G

The full conditional distribution of $β_{j 1}$ can be written as

\begin{array}{l} f (β_{j 1} | δ_{j 1}, β_{- j 1}, D_{- j 1}, G, R, y) \propto f (y | μ, β, D, G, R) f (β_{j 1}, β_{j 2} | G) \\ \propto e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G^{- 1} β_{j}), \end{array}

where $w_{i} = y_{i} - μ_{i} - \sum_{j^{'} \neq j} m_{i j^{'}} D_{j^{'}} β_{j^{'}} .$ Further, by dropping factors that do not involve $β_{j 1},$

\begin{matrix} f (β_{j 1} | δ_{j 1}, β_{- j 1}, D_{- j 1}, G, R, y) \propto e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j}]} \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j}]} \\ \propto e x p {- \frac{1}{2} [[\begin{matrix} β_{j 1} & β_{j 2}^{'} \end{matrix}] [\begin{matrix} C_{j, 11} & C_{j, 12} \\ C_{j, 21} & C_{j, 22} \end{matrix}] [\begin{matrix} β_{j 1} \\ β_{j 2} \end{matrix}] - 2 [\begin{matrix} r_{j 1} & r_{j 2}^{'} \end{matrix}] [\begin{matrix} β_{j 1} \\ β_{j 2} \end{matrix}]]} \\ \propto e x p {- \frac{1}{2} (C_{j, 11} β_{j 1}^{2} + (2 C_{j, 12} β_{j 2} - 2 r_{j 1}) β_{j 1})} \\ \propto e x p {- \frac{C_{j, 11}}{2} {(β_{j 1} + (C_{j, 12} β_{j 2} - r_{j 1}) C_{j, 11}^{- 1})}^{2}} \\ \propto N (C_{j, 11}^{- 1} (r_{j 1} - C_{j, 12} β_{j 2}), C_{j, 11}^{- 1}) \\ \propto N (β_{_{j 1}}^{^}, C_{j, 11}^{- 1}) \end{matrix}

where $C_{j} = D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}$ and $r_{j}^{'} = (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) R^{- 1} D_{j} .$

Note that when $δ_{j 1} = 0,$

\begin{matrix} C_{j} = [\begin{matrix} C_{j, 11}^{0} & C_{j, 12}^{0} \\ C_{j, 21}^{0} & C_{j, 22}^{0} \end{matrix}] \\ = [\begin{matrix} G^{11} & G^{12} \\ G^{21} & G^{22} + D_{j 2}^{'} R^{22} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2} \end{matrix}] \\ r_{j}^{'} = [\begin{matrix} r_{j 1}^{0} & r_{j 2}^{0^{'}} \end{matrix}] \\ = [\begin{matrix} 0 & (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{12} \\ R^{22} \end{matrix}] D_{j 2} \end{matrix}] \end{matrix}

When $δ_{j 1} = 1,$

\begin{matrix} C_{j} = [\begin{matrix} C_{j, 11}^{1} & C_{j, 12}^{1} \\ C_{j, 21}^{1} & C_{j, 22}^{1} \end{matrix}] \\ = [\begin{matrix} G^{11} + R^{11} \sum_{i = 1}^{n} m_{i j}^{2} & G^{12} + R^{12} D_{j 2} {\sum_{i = 1}^{n} m_{i j}^{2}}_{i j}^{2} \\ G^{21} + D_{j 2}^{'} R^{21} \sum_{i = 1}^{n} m_{i j}^{2} & G^{22} + D_{j 2}^{'} R^{22} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2} \end{matrix}] \\ r_{j}^{'} = [\begin{matrix} r_{j 1}^{1} & r_{j 2}^{1^{'}} \end{matrix}] \\ = [\begin{matrix} (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{11} \\ R^{21} \end{matrix}] & (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{12} \\ R^{22} \end{matrix}] D_{j 2} \end{matrix}] \end{matrix}

Thus when $δ_{j 1} = 0,$ the full conditional distribution of $β_{j 1}$ is

f (β_{j 1} | δ_{j 1} = 0, β_{- j 1}, D_{- j 1}, G, R, y) \propto N (\hat{β_{j 1}^{0}}, {(C_{j, 11}^{0})}^{- 1}) = N (- {(G^{11})}^{- 1} G^{12} β_{j 2}, {(G^{11})}^{- 1}) .

When $δ_{j 1} = 1,$ the full conditional distribution of $β_{j 1}$ becomes

f (β_{j 1} | δ_{j 1} = 1, β_{- j 1}, D_{- j 1}, G, R, y) \propto N (\hat{β_{j 1}^{1}}, {(C_{j, 11}^{1})}^{- 1}) = N ({(C_{j, 11}^{1})}^{- 1} (r_{j 1} - C_{j, 12}^{1} β_{j 2}), {(C_{j, 11}^{1})}^{- 1}) .

The marginal full conditional distribution of $δ_{j 1}$ can be written as

\begin{matrix} f (δ_{j 1} = 1 | θ, y) = \frac{f (δ_{j 1} = 1, θ, y)}{\sum_{δ_{j 1} \in (0, 1)} f (δ_{j 1}, θ, y)} \\ = \frac{f (y | δ_{j 1} = 1, θ) f (δ_{j 1} = 1, δ_{j 2} | Π)}{\sum_{δ_{j 1} \in (0, 1)} f (y | δ_{j 1}, θ) f (δ_{j} | Π)} . \\ = {1 + \frac{f (y | δ_{j 1} = 0, θ) f (δ_{j 1} = 0, δ_{j 2} | Π)}{f (y | δ_{j 1} = 1, θ) f (δ_{j 1} = 1, δ_{j 2} | Π)}}^{- 1} \end{matrix}

The factor $f (y | δ_{j 1}, θ)$ can be written as

\begin{matrix} f (y | δ_{j 1}, θ) \propto \int f (y | μ, β_{j 1}, β_{- j 1}, D, G, R) f (β_{j 1}, β_{j 2} | G) d β_{j 1} \\ \propto \int e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G^{- 1} β_{j}) d β_{j 1} \\ \propto e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {(r_{j 1} - C_{j, 12} β_{j 2})}^{2} C_{j, 11}^{- 1})} \\ \times \int e x p [- \frac{1}{2} {(β_{j 1} - \hat{β_{j 1}})}^{2} C_{j, 11}] d β_{j 1} \\ \propto {(C_{j, 11})}^{- \frac{1}{2}} e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {(r_{j 1} - C_{j, 12} β_{j 2})}^{2} C_{j, 11}^{- 1})} \\ \propto {(C_{j, 11})}^{- \frac{1}{2}} e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {\hat{β_{j 1}}}^{2} C_{j, 11})} \end{matrix}

Note that $\sum_{i} w_{i}^{'} R^{- 1} w_{i}, r_{j 2}^{'} β_{j 2}, β_{j 2}^{'} C_{j, 22} β_{j 2}$ are same when $δ_{j 1} = 0$ or 1. Thus the ratio $f (y | δ_{j 1} = 1, θ) / f (y | δ_{j 1} = 0, θ)$ becomes

\begin{array}{l} H = {(C_{j, 11}^{1})}^{- \frac{1}{2}} {(G^{11})}^{\frac{1}{2}} e x p (- \frac{1}{2} ({\hat{β_{j 1}^{0}}}^{2} G^{11} - {\hat{β_{j 1}^{1}}}^{2} C_{j, 11}^{1})) \\ = e x p {- \frac{1}{2} (l o g C_{j, 11}^{1} - {\hat{β_{j 1}^{1}}}^{2} C_{j, 11}^{1}) - (- \frac{1}{2} (l o g G^{11} - {\hat{β_{j 1}^{0}}}^{2} G^{11}))} \end{array}

Thus the conditional probability of $δ_{j 1} = 1$ is

{1 + \frac{f (y | δ_{j 1} = 0, θ) f (δ_{j 1} = 0, δ_{j 2} | Π)}{f (y | δ_{j 1} = 1, θ) f (δ_{j 1} = 1, δ_{j 2} | Π)}}^{- 1} = {1 + {(\frac{Π_{j 0}}{Π_{j 1}} H)}^{- 1}}^{- 1},

where $Π_{j 0} = P r (δ_{j 1} = 0, δ_{j 2} | Π)$ and $Π_{j 1} = P r (δ_{j 1} = 1, δ_{j 2} | Π) .$

The full conditional distribution for $Π$ can be written as

\begin{matrix} f (Π | β, D, G, R, y) \propto f (δ | Π) f (Π) \\ \propto Π_{1}^{n_{1}} Π_{2}^{n_{2}} \dots Π_{l}^{n_{l}} \\ \propto D i r i c h l e t (n_{1} + 1, n_{2} + 1, \dots), \end{matrix}

where $n_{i}$ is the number of markers with $δ_{j} = “ i ” .$

Joint Gibbs sampler for multi-trait BayesC $Π$ -G

Let $θ$ denote all other parameters except $β_{j}$ and $δ_{j},$ then our sampling scheme can be written as

f (β_{j}, δ_{j} | θ, y) = f (δ_{j} | θ, y) f (β_{j} | δ_{j}, θ, y)

The marginal full conditional distribution of $δ_{j}$ can be written as

\begin{matrix} f (δ_{j} | θ, y) = \frac{f (δ_{j}, θ, y)}{\sum_{δ_{j}} f (δ_{j}, θ, y)} \\ = \frac{f (y | δ_{j}, θ) f (δ_{j} | Π)}{\sum_{δ_{j}} f (y | δ_{j}, θ) f (δ_{j} | Π)} . \end{matrix}

Denote $w_{i} = y_{i} - μ_{i} - \sum_{j^{'} \neq j} m_{i j^{'}} D_{j^{'}} β_{j^{'}},$ then

\begin{matrix} f (y | δ_{j}, θ) \propto \int f (y | β, D, R) f (β_{j} | G) d β_{j} \\ \propto \int e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G^{- 1} β_{j}) d β_{j} \\ \propto \int e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j} + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i}]} d β_{j} \\ \propto \int e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j} + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i}]} d β_{j} \\ \propto \int e x p {- \frac{1}{2} [(β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j}) + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]} d β_{j} \\ \propto e x p {- \frac{1}{2} [\sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]} \\ \times {| C_{j}^{- 1} |}^{\frac{1}{2}} \int {| C_{j}^{- 1} |}^{- \frac{1}{2}} e x p [- \frac{1}{2} (β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j})] d β_{j} \\ \propto {| C_{j}^{- 1} |}^{\frac{1}{2}} e x p {- \frac{1}{2} [\sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]}, \end{matrix}

where $C_{j} = D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}$ and $r_{j}^{'} = ({\sum^{}}_{i = 1}^{n} w_{i}^{'} m_{i j}) R^{- 1} D_{j} .$

Note that $\sum_{i}^{} w_{i}^{'} R^{- 1} w_{i}$ is same for different $δ_{j} .$ Thus the marginal full conditional distribution of $δ_{j}$ can be written as

f (δ_{j} | θ, y) = \frac{f (y | δ_{j}, θ) f (δ_{j} | Π)}{\sum_{δ_{j}} f (y | δ_{j}, θ) f (δ_{j} | Π)},

where

f (y | δ_{j}, θ) \propto {| C_{j}^{- 1} |}^{\frac{1}{2}} e x p {\frac{1}{2} r_{j}^{'} C_{j}^{- 1} r_{j}} .

The full conditional distribution of $β_{j}$ is

\begin{matrix} f (β_{j} | δ_{j}, θ, y) \propto e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G^{- 1} β_{j}), \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j}]} \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j}]} \\ \propto e x p {- \frac{1}{2} (β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j})} \\ \propto N (C_{j}^{- 1} r_{j}, C_{j}^{- 1}) \end{matrix}

Gibbs Sampler Algorithm for Multi-Trait BayesB

Single-site Gibbs sampler for multi-trait BayesB

For convenience, from now on let “1” denote trait k and “2” the other traits. Thus, $β_{j}$ can be denoted as $[\begin{matrix} β_{j 1} \\ β_{j 2} \end{matrix}]$ and $D_{j}$ can be denoted as $[\begin{matrix} δ_{j 1} & 0 \\ 0 & D_{j 2} \end{matrix}] .$ The Gibbs sampler for $β_{j k}$ and $δ_{j k}$ is derived as below. In our sampling scheme, $β_{j 1}$ and $δ_{j 1}$ are sampled from their joint full conditional distributions, which can be written as the product of the full conditional distribution of $β_{j 1}$ given $δ_{j 1}$ and the marginal full conditional distribution of $δ_{j} .$ Let $θ$ denote all other parameters except $δ_{j 1}$ and $β_{j 1},$ then our sampling scheme can be written as

f (β_{j 1}, δ_{j 1} | θ, y) = f (β_{j 1} | δ_{j 1}, θ, y) f (δ_{j 1} | θ, y) .

The full conditional distribution of $β_{j}$ can be written as

\begin{matrix} f (β_{j 1} | δ_{j 1}, β_{- j 1}, D_{- j 1}, G_{j}, G_{- j}, R, y) \propto f (y | μ, β, D, G_{j}, G_{- j}, R) f (β_{j 1}, β_{j 2} | G_{j}) \\ \propto e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G_{j}^{- 1} β_{j}), \end{matrix}

where $w_{i} = y_{i} - μ_{i} - \sum_{j^{'} \neq j} m_{i j^{'}} D_{j^{'}} β_{j^{'}} .$ Further, by dropping factors that do not involve $β_{j 1},$

\begin{matrix} f (β_{j 1} | δ_{j 1}, β_{- j 1}, D_{- j 1}, G_{j}, G_{- j}, R, y) \propto e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G_{j}^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j}]} \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j}]} \\ \propto e x p {- \frac{1}{2} [[\begin{matrix} β_{j 1} & β_{j 2}^{'} \end{matrix}] [\begin{matrix} C_{j, 11} & C_{j, 12} \\ C_{j, 21} & C_{j, 22} \end{matrix}] [\begin{matrix} β_{j 1} \\ β_{j 2} \end{matrix}] - 2 [\begin{matrix} r_{j 1} & r_{j 2}^{'} \end{matrix}] [\begin{matrix} β_{j 1} \\ β_{j 2} \end{matrix}]]} \\ \propto e x p {- \frac{1}{2} (C_{j, 11} β_{j 1}^{2} + (2 C_{j, 12} β_{j 2} - 2 r_{j 1}) β_{j 1})} \\ \propto e x p {- \frac{C_{j, 11}}{2} {(β_{j 1} + (C_{j, 12} β_{j 2} - r_{j 1}) C_{j, 11}^{- 1})}^{2}} \\ \propto N (C_{j, 11}^{- 1} (r_{j 1} - C_{j, 12} β_{j 2}), C_{j, 11}^{- 1}) \\ \propto N (\hat{β_{j 1}}, C_{j, 11}^{- 1}) \end{matrix}

where $C_{j} = D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G_{j}^{- 1}$ and $r_{j}^{'} = (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) R^{- 1} D_{j} .$

Note that, when $δ_{j 1} = 0,$

\begin{matrix} C_{j} = [\begin{matrix} G_{j}^{11} & G_{j}^{12} \\ G_{j}^{21} & G_{j}^{22} + D_{j 2}^{'} R^{22} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2} \end{matrix}] \\ r_{j}^{'} = [\begin{matrix} 0 & (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{12} \\ R^{22} \end{matrix}] D_{j 2} \end{matrix}] \end{matrix}

When $δ_{j 1} = 1,$

\begin{matrix} C_{j} = [\begin{matrix} C_{j, 11}^{1} & C_{j, 12}^{1} \\ C_{j, 21}^{1} & C_{j, 22}^{1} \end{matrix}] \\ = [\begin{matrix} G_{j}^{11} + R^{11} \sum_{i = 1}^{n} m_{i j}^{2} & G_{j}^{12} + R^{12} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2} \\ G_{j}^{21} + D_{j 2}^{'} R^{21} \sum_{i = 1}^{n} m_{i j}^{2} & G_{j}^{22} + D_{j 2}^{'} R^{22} D_{j 2} \sum_{i = 1}^{n} m_{i j}^{2} \end{matrix}] \\ r_{j}^{'} = [\begin{matrix} r_{j 1}^{1} & r_{j 2}^{1'} \end{matrix}] \\ = [\begin{matrix} (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{11} \\ R^{21} \end{matrix}] & (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) [\begin{matrix} R^{12} \\ R^{22} \end{matrix}] D_{j 2} \end{matrix}] \end{matrix}

Thus, when $δ_{j 1} = 0,$ the full conditional distribution of $β_{j 1}$ is

f (β_{j 1} | δ_{j 1} = 0, β_{- j 1}, D_{- j 1}, G_{j}, G_{- j}, R, y) \propto N (- {(G_{j}^{11})}^{- 1} G_{j}^{12} β_{j 2}, {(G_{j}^{11})}^{- 1}) .

When $δ_{j 1} = 1,$ the full conditional distribution of $β_{j 1}$ becomes

f ((β_{j 1} | δ_{j 1} = 1, β_{- j 1}, D_{- j 1}, G_{j}, G_{- j}, R, y) \propto N (C_{j, 11}^{1 - 1} (r_{j 1} - C_{j, 12}^{1} β_{j 2}), C_{j, 11}^{1 - 1}) .

The marginal full conditional distribution of $δ_{j 1}$ can be written as

\begin{matrix} f (δ_{j 1} = 1 | θ, y) = \frac{f (δ_{j 1}, θ, y)}{\sum_{δ_{j 1} \in (0, 1)} f (δ_{j 1}, θ, y)} \\ = \frac{f (y | δ_{j 1} = 1, θ) f (δ_{j 1} = 1, δ_{j 2} | Π)}{\sum_{δ_{j 1} \in (0, 1)} f (y | δ_{j 1}, θ) f (δ_{j} | Π)} . \\ = {1 + \frac{f (y | δ_{j 1} = 0, θ) f (δ_{j 1} = 0, δ_{j 2} | Π)}{f (y | δ_{j 1} = 0, θ) f (δ_{j 1} = 1, δ_{j 2} | Π)}}^{- 1} \end{matrix}

The factor $f (y | δ_{j 1}, θ)$ can be written as

\begin{matrix} f (y | δ_{j 1}, θ) \propto \int f (y | μ, β_{j 1}, β_{- j 1}, D, G, R) f (β_{j 1}, β_{j 2} | G_{j}) d β_{j 1} \\ \propto \int e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G_{j}^{- 1} β_{j}) d β_{j 1} \\ \propto e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {(r_{j 1} - C_{j, 12} β_{j 2})}^{2} C_{j, 11}^{- 1})} \\ \times \int e x p [- \frac{1}{2} {(β_{j 1} - β_{_{j 1}}^{^})}^{2} C_{j, 11}] d β_{j 1} \\ \propto {(C_{j, 11})}^{- \frac{1}{2}} e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {(r_{j 1} - C_{j, 12} β_{j 2})}^{2} C_{j, 11}^{- 1})} \\ \propto {(C_{j, 11})}^{- \frac{1}{2}} e x p {- \frac{1}{2} (\sum_{i} w_{i}^{'} R^{- 1} w_{i} - 2 r_{j 2}^{'} β_{j 2} + β_{j 2}^{'} C_{j, 22} β_{j 2} - {\hat{β_{j 1}}}^{2} C_{j, 11})} . \end{matrix}

\begin{matrix} H = {(C_{j, 11}^{1})}^{- \frac{1}{2}} {(G_{j}^{11})}^{\frac{1}{2}} e x p (- \frac{1}{2} ({\hat{β_{j 1}^{0}}}^{2} G_{j}^{11} - \hat{β_{j 1}^{1}} C_{j, 11}^{1})) \\ = e x p {- \frac{1}{2} (l o g C_{j, 11}^{1} - {\hat{β_{j 1}^{1}}}^{2} C_{j, 11}^{1}) - (- \frac{1}{2} (l o g G_{j}^{11} - {\hat{β_{j 1}^{0}}}^{2} G_{j}^{11}))} \end{matrix}

Thus, the conditional probability of $δ_{j 1} = 1$ is

{1 + \frac{f (y | δ_{j 1} = 0, θ) f (δ_{j 1} = 0, δ_{j 2} | Π_{1}, Π_{2...})}{f (y | δ_{j 1} = 1, θ) f (δ_{j 1} = 1, δ_{j 2} | Π_{1}, Π_{2...})}}^{- 1} = {1 + {(\frac{Π_{j 0}}{Π_{j 1}} H)}^{- 1}}^{- 1},

where $Π_{j 0} = P r (δ_{j 1} = 0, δ_{j 2} | Π)$ and $Π_{j 1} = P r (δ_{j 1} = 1, δ_{j 2} | Π) .$

Joint Gibbs sampler for multi-trait BayesB

Let $θ$ denote all other parameters except $β_{j}$ and $δ_{j},$ then our sampling scheme can be written as

f (β_{j}, δ_{j} | θ, y) = f (δ_{j} | θ, y) f (β_{j} | δ_{j}, θ, y)

The marginal full conditional distribution of $δ_{j}$ can be written as

f (δ_{j} | θ, y) = \frac{f (δ_{j}, θ, y)}{\sum_{δ_{j}} f (δ_{j}, θ, y)} = \frac{f (y | δ_{j}, θ) f (δ_{j} | Π)}{\sum_{δ_{j}} f (y | δ_{j}, θ) f (δ_{j} | Π)} .

Denote $w_{i} = y_{i} - μ_{i} - \sum_{j^{'} \neq j} m_{i j^{'}} D_{j^{'}} β_{j^{'}},$ then

\begin{matrix} f (y | δ_{j}, θ) \propto \int f (y | β, D, R) f (β_{j} | G_{j}) d β_{j} \\ \propto \int e x p [- \frac{1}{2} \overset{n}{\sum_{i = 1}} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G_{j}^{- 1} β_{j}) d β_{j} \\ \propto \int e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G_{j}^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j} + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i}]} d β_{j} \\ \propto \int e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j} + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i}]} d β_{j} \\ \propto \int e x p {- \frac{1}{2} [(β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j}) + \sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]} d β_{j} \\ \propto e x p {- \frac{1}{2} [\sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]} \\ \times {| C_{j}^{- 1} |}^{\frac{1}{2}} \int {| C_{j}^{- 1} |}^{- \frac{1}{2}} e x p [- \frac{1}{2} (β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j})] d β_{j} \\ \propto {| C_{j}^{- 1} |}^{\frac{1}{2}} e x p {- \frac{1}{2} [\sum_{i = 1}^{n} w_{i}^{'} R^{- 1} w_{i} - r_{j}^{'} C_{j}^{- 1} r_{j}]}, \end{matrix}

where $C_{j} = D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G_{j}^{- 1}$ and $r_{j}^{'} = (\sum_{i = 1}^{n} w_{i}^{'} m_{i j}) R^{- 1} D_{j} .$

Note that $\sum_{i} w_{i}^{'} R^{- 1} w_{i}$ is same for different $δ_{j} .$ Thus the marginal full conditional distribution of $δ_{j}$ can be written as

f (δ_{j} | θ, y) = \frac{f (y | δ_{j}, θ) f (δ_{j} | Π)}{\sum_{δ_{j}} f (y | δ_{j}, θ) f (δ_{j} | Π)},

where

f (y | δ_{j}, θ) \propto {| C_{j}^{- 1} |}^{\frac{1}{2}} e x p {\frac{1}{2} r_{j}^{'} C_{j}^{- 1} r_{j}} .

The full conditional distribution of $β_{j}$ is

\begin{matrix} f (β_{j} | δ_{j}, θ, y) \propto e x p [- \frac{1}{2} \sum_{i = 1}^{n} {(w_{i} - m_{i j} D_{j} β_{j})}^{'} R^{- 1} (w_{i} - m_{i j} D_{j} β_{j})] e x p (- \frac{1}{2} β_{j}^{'} G_{j}^{- 1} β_{j}), \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} (D_{j}^{'} R^{- 1} D_{j} \sum_{i = 1}^{n} m_{i j}^{2} + G_{j}^{- 1}) β_{j} - 2 \sum_{i = 1}^{n} w_{i}^{'} m_{i j} R^{- 1} D_{j} β_{j}]} \\ \propto e x p {- \frac{1}{2} [β_{j}^{'} C_{j} β_{j} - 2 r_{j}^{'} β_{j}]} \\ \propto e x p {- \frac{1}{2} (β_{j}^{'} - r_{j}^{'} C_{j}^{- 1}) C_{j} (β_{j} - C_{j}^{- 1} r_{j})} \\ \propto N (C_{j}^{- 1} r_{j}, C_{j}^{- 1}) \end{matrix}

Footnotes

Communicating editor: M. Calus

Literature Cited

Calus M. P., Veerkamp R. F., 2011. Accuracy of multi-trait genomic selection using different methods. Genet. Sel. Evol. 43: 26 10.1186/1297-9686-43-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheng H., Garrick D., Fernando R., 2015a XSim: simulation of descendants from ancestors with sequence data. G3 5: 1415–1417. 10.1534/g3.115.016683 [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheng H., Qu L., Garrick D. J., Fernando R. L., 2015b A fast and efficient Gibbs sampler for BayesB in whole-genome analyses. Genet. Sel. Evol. 47: 80 10.1186/s12711-015-0157-x [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheng H., Fernando R. L., Garrick D. J., 2018. JWAS: Julia implementation of whole-genome analysis software. Proceedings of the World Congress on Genetics Applied to Livestock Production, 11.859. Auckland, New Zealand. [Google Scholar]
Daetwyler H. D., Calus M. P. L., Pong-Wong R., de los Campos G., Hickey J. M., 2013. Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking. Genetics 193: 347–365. 10.1534/genetics.112.147983 [DOI] [PMC free article] [PubMed] [Google Scholar]
Fernando R. L., Garrick D., 2013. Bayesian methods applied to GWAS, pp. 237–274 in Genome-Wide Association Studies and Genomic Prediction. Humana Press, Totowa, NJ: 10.1007/978-1-62703-447-0_10 [DOI] [PubMed] [Google Scholar]
Gianola D., de los Campos G., Hill W. G., Manfredi E., Fernando R., 2009. Additive genetic variability and the Bayesian alphabet. Genetics 183: 347–363. 10.1534/genetics.109.103952 [DOI] [PMC free article] [PubMed] [Google Scholar]
Habier D., Fernando R. L., Dekkers J. C. M., 2007. The impact of genetic relationship information on genome-assisted breeding values. Genetics 177: 2389–2397. [DOI] [PMC free article] [PubMed] [Google Scholar]
Habier D., Fernando R. L., Kizilkaya K., Garrick D. J., 2011. Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 12: 186 10.1186/1471-2105-12-186 [DOI] [PMC free article] [PubMed] [Google Scholar]
Jia Y., Jannink J.-L., 2012. Multiple-trait genomic selection methods increase genetic value prediction accuracy. Genetics 192: 1513–1522. 10.1534/genetics.112.144246 [DOI] [PMC free article] [PubMed] [Google Scholar]
Karaman E., Cheng H., Firat M. Z., Garrick D. J., Fernando R. L., 2016. An upper bound for accuracy of prediction using GBLUP. PLoS One 11: e0161054 10.1371/journal.pone.0161054 [DOI] [PMC free article] [PubMed] [Google Scholar]
Meuwissen T. H. E., Hayes B. J., Goddard M. E., 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829. [DOI] [PMC free article] [PubMed] [Google Scholar]
Resende M. F. R., Muñoz P., Resende M. D. V., Garrick D. J., Fernando R. L., et al. , 2012. Accuracy of genomic selection methods in a standard data set of Loblolly Pine (Pinus taeda L.). Genetics 190: 1503–1510. 10.1534/genetics.111.137026 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Click here for additional data file.^{(55.1KB, pdf)}

Data Availability Statement

[bib1] Calus M. P., Veerkamp R. F., 2011. Accuracy of multi-trait genomic selection using different methods. Genet. Sel. Evol. 43: 26 10.1186/1297-9686-43-26 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Cheng H., Garrick D., Fernando R., 2015a XSim: simulation of descendants from ancestors with sequence data. G3 5: 1415–1417. 10.1534/g3.115.016683 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Cheng H., Qu L., Garrick D. J., Fernando R. L., 2015b A fast and efficient Gibbs sampler for BayesB in whole-genome analyses. Genet. Sel. Evol. 47: 80 10.1186/s12711-015-0157-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Cheng H., Fernando R. L., Garrick D. J., 2018. JWAS: Julia implementation of whole-genome analysis software. Proceedings of the World Congress on Genetics Applied to Livestock Production, 11.859. Auckland, New Zealand. [Google Scholar]

[bib5] Daetwyler H. D., Calus M. P. L., Pong-Wong R., de los Campos G., Hickey J. M., 2013. Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking. Genetics 193: 347–365. 10.1534/genetics.112.147983 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Fernando R. L., Garrick D., 2013. Bayesian methods applied to GWAS, pp. 237–274 in Genome-Wide Association Studies and Genomic Prediction. Humana Press, Totowa, NJ: 10.1007/978-1-62703-447-0_10 [DOI] [PubMed] [Google Scholar]

[bib7] Gianola D., de los Campos G., Hill W. G., Manfredi E., Fernando R., 2009. Additive genetic variability and the Bayesian alphabet. Genetics 183: 347–363. 10.1534/genetics.109.103952 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Habier D., Fernando R. L., Dekkers J. C. M., 2007. The impact of genetic relationship information on genome-assisted breeding values. Genetics 177: 2389–2397. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Habier D., Fernando R. L., Kizilkaya K., Garrick D. J., 2011. Extension of the Bayesian alphabet for genomic selection. BMC Bioinformatics 12: 186 10.1186/1471-2105-12-186 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Jia Y., Jannink J.-L., 2012. Multiple-trait genomic selection methods increase genetic value prediction accuracy. Genetics 192: 1513–1522. 10.1534/genetics.112.144246 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] Karaman E., Cheng H., Firat M. Z., Garrick D. J., Fernando R. L., 2016. An upper bound for accuracy of prediction using GBLUP. PLoS One 11: e0161054 10.1371/journal.pone.0161054 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Meuwissen T. H. E., Hayes B. J., Goddard M. E., 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Resende M. F. R., Muñoz P., Resende M. D. V., Garrick D. J., Fernando R. L., et al. , 2012. Accuracy of genomic selection methods in a standard data set of Loblolly Pine (Pinus taeda L.). Genetics 190: 1503–1510. 10.1534/genetics.111.137026 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Genomic Prediction from Multiple-Trait Bayesian Regression Methods Using Mixture Priors

Hao Cheng

Kadir Kizilkaya

Jian Zeng

Dorian Garrick

Rohan Fernando

Abstract

Materials and Methods

Multi-trait marker effects model

Multi-trait BayesCΠ model

Priors for marker effects:

Gibbs sampler I for multi-trait BayesCΠ:

Gibbs sampler II for multi-trait BayesCΠ:

Multi-trait BayesB model

Data analyses

Real data:

Simulated data:

Data availability

Results

Real data

Figure 1.

Table 1. Estimation of π for alternative multi-trait BayesCΠ methods.

Simulated data

Figure 2.

Figure 3.

Figure 4.

Discussion

Real data

Simulated data

Priors

Multi-trait variable selection:

Conclusions

Supplementary Material

Acknowledgments

Appendix

Gibbs Sampler Algorithm for Multi-Trait BayesCΠ-G

Single-site Gibbs sampler for multi-trait BayesCΠ-G

Joint Gibbs sampler for multi-trait BayesCΠ-G

Gibbs Sampler Algorithm for Multi-Trait BayesB

Single-site Gibbs sampler for multi-trait BayesB

Joint Gibbs sampler for multi-trait BayesB

Footnotes

Literature Cited

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Multi-trait BayesC $Π$ model

Gibbs sampler I for multi-trait BayesC $Π$ :

Gibbs sampler II for multi-trait BayesC $Π$ :

Table 1. Estimation of π for alternative multi-trait BayesC $Π$ methods.

Gibbs Sampler Algorithm for Multi-Trait BayesC $Π$ -G

Single-site Gibbs sampler for multi-trait BayesC $Π$ -G

Joint Gibbs sampler for multi-trait BayesC $Π$ -G