A Genomic Bayesian Multi-trait and Multi-environment Model

Osval A Montesinos-López; Abelardo Montesinos-López; José Crossa; Fernando H Toledo; Oscar Pérez-Hernández; Kent M Eskridge; Jessica Rutkoski

doi:10.1534/g3.116.032359

. 2016 Jun 24;6(9):2725–2744. doi: 10.1534/g3.116.032359

A Genomic Bayesian Multi-trait and Multi-environment Model

Osval A Montesinos-López ^*, Abelardo Montesinos-López ^†, José Crossa ^*,¹, Fernando H Toledo ^*, Oscar Pérez-Hernández ^‡, Kent M Eskridge ^§, Jessica Rutkoski ^*

PMCID: PMC5015931 PMID: 27342738

Abstract

When information on multiple genotypes evaluated in multiple environments is recorded, a multi-environment single trait model for assessing genotype × environment interaction (G × E) is usually employed. Comprehensive models that simultaneously take into account the correlated traits and trait × genotype × environment interaction (T × G × E) are lacking. In this research, we propose a Bayesian model for analyzing multiple traits and multiple environments for whole-genome prediction (WGP) model. For this model, we used Half- $t$ priors on each standard deviation term and uniform priors on each correlation of the covariance matrix. These priors were not informative and led to posterior inferences that were insensitive to the choice of hyper-parameters. We also developed a computationally efficient Markov Chain Monte Carlo (MCMC) under the above priors, which allowed us to obtain all required full conditional distributions of the parameters leading to an exact Gibbs sampling for the posterior distribution. We used two real data sets to implement and evaluate the proposed Bayesian method and found that when the correlation between traits was high (>0.5), the proposed model (with unstructured variance–covariance) improved prediction accuracy compared to the model with diagonal and standard variance–covariance structures. The R-software package Bayesian Multi-Trait and Multi-Environment (BMTME) offers optimized C++ routines to efficiently perform the analyses.

Keywords: multi-trait, multi-environment, Bayesian estimation, genome-enabled prediction, genomic selection, GenPred, shared data resource

Since the whole-genome prediction (WGP) model of Meuwissen et al. (2001), practical results have shown that genomic selection (GS) using Bayesian and non-Bayesian linear regression models improves prediction accuracy compared to conventional and pedigree selection (de los Campos et al. 2009, 2010; Crossa et al. 2010, 2011; Heslot et al. 2012; Pérez-Rodríguez et al. 2012). With GS, genomic breeding values are estimated as the sum of marker effects for genotyped individuals in the testing or prediction population. The marker effects are estimated simultaneously using a training population that contains phenotyped and genotyped individuals.

In plant breeding, most of the available methods for WGP are useful for analyzing a single trait measured either in a single environment or in multi-environments with the incorporation of genotype × environment interaction (G × E) (Burgueño et al. 2012; Heslot et al. 2014; Jarquín et al. 2014, Montesinos-López et al. 2015, López-Cruz et al. 2015). However, researchers often face situations in which multiple traits are measured across multiple environments. For example, crop breeders record phenotypic data for multiple traits such as grain yield and its components (e.g., grain type, grain weight, biomass, etc.), grain quality (e.g., taste, shape, color, nutrient content), and tolerance to biotic and abiotic stresses. They often aim to improve all these multiple correlated traits simultaneously or to predict the ones that are difficult to measure with those that are easy to measure. However, it is common practice to perform an independent analysis and genomic prediction on a single phenotypic trait.

The advantage of jointly modeling multiple traits compared to analyzing each trait separately is that the inference process appropriately accounts for the correlation among the traits, which helps to increase prediction accuracy, statistical power, parameter estimation accuracy, and reduce trait selection bias (Henderson and Quaas 1976; Pollak et al. 1984; Schaeffer 1984). In the context of WGP, Jia and Jannink (2012), Guo et al. (2014), and Jiang et al. (2015) found that joint prediction of multiple traits benefits from genetic correlation between traits and significantly improves prediction accuracy compared to single trait methods, specifically for low-heritability traits that are genetically correlated with a high-heritability trait. Jia and Jannink (2012) also found better prediction accuracy for multiple traits than for single traits when phenotypes are not available for all individuals and traits. Therefore, there is evidence that multiple trait analysis is useful to predict yet-to-be observed phenotypes in plant and animal breeding when selecting unphenotyped candidates early through the prediction of their genomic breeding values. Multi-trait analysis has also been found to substantially increase prediction accuracy when some traits are observed in all individuals but the trait of interest is not observed in the individuals in the test set (Pszczola et al. 2013; Rutkoski et al. 2016).

Multivariate analysis of continuous outcomes is well established in statistical literature (Johnson and Wichern 1992). However, the available methods cannot be applied in a straightforward manner for WGP, since the number of independent variables (p) is usually larger than the available sample size (n). The genomic best linear unbiased predictor (GBLUP) WGP model can be implemented in standard software for multiple traits and multiple environments by taking into account two-way interaction terms and estimating separable unstructured covariance matrices of the form $A \otimes B$ (where $A$ and $B$ are the corresponding covariance matrices of factors A and B, respectively). However, these software programs are unable to estimate separable unstructured variance–covariance matrices of the form $A \otimes B \otimes C$ for three-way interaction terms. For this reason, in this situation, at least one of the variance–covariance components is assumed to be identity or a new variable is created by merging two factors and estimating a covariance matrix with only two components as $A \otimes B^{*}$ , where $B^{*}$ contains the variance–covariance of two factors, but each component cannot be separated. Also, univariate Bayesian inference has been proposed and extensively implemented in WGP models (Gianola 2013). The Bayesian alphabet methods (Bayes A, Bayes C, and Bayes $C π$ ) have been extended for multiple trait analysis (de los Campos and Gianola 2007; Calus and Veerkamp 2011; Jia and Jannink 2012; Guo et al. 2014) and most recently, Jiang et al. (2015) proposed a Bayesian multivariate antedependence model.

Despite evidence of the increased prediction accuracy of WGP models incorporating G × E (Burgueño et al. 2012; Jarquín et al. 2014, Montesinos-López et al. 2015; López-Cruz et al. 2015) and of WGP models for multi-trait data, statistical models for analyzing continuous data for simultaneously assessing multi-traits and multi-environments are lacking. Thus, the integration of these two approaches in one unified WGP model is required (Jiang et al. 2015). This unified WGP model would be useful in two cases: (i) when individuals are measured for all traits in one environment, but only some traits in other environments; and (ii) when some traits are recorded in only a subset of individuals in all environments. This model would be useful not only in plant breeding but also in animal breeding, where genetic evaluation of many traits is performed on a weekly basis by many breeding programs globally. It is also possible to integrate other advantageous strategies such as the antedependence model to incorporate dominant and epistatic effects.

All the Bayesian methods developed so far for multiple trait analysis use the Inverse-Wishart (IW) conjugate family of distributions as priors for the covariance matrices between traits. However, Gelman (2006) and Huang and Wand (2013) argued against using IW priors for covariance matrices because they impose a degree of informativity and the posterior inferences are sensitive to the choice of hyper-parameters. Recently, Huang and Wand (2013) proposed a scale mixture approach involving an IW distribution and independent Inverse-Gamma (IG) distributions for each dimension as priors for the covariance matrix parameters. The ensuing covariance matrix distribution is such that all standard deviation parameters have Half- $t$ distributions and the correlation parameters have uniform distributions on (−1,1) for a particular choice of the IW shape parameter. The advantage of this approach is that it is possible to choose shape and scale parameters that achieve arbitrary high noninformativity of all standard deviations and correlation parameters (Huang and Wand 2013). However, the model proposed by Huang and Wand (2013) is a standard mixed model with correlated errors that does not include interaction terms of any kind and does not consider three-way interaction.

In this study, we propose a Bayesian method that integrates the analysis of multi-traits and multi-environments and takes into account trait × genotype × environment interaction (T × G × E) in a unified WGP model. We used Half- $t$ priors on each standard deviation term and uniform priors on each correlation to achieve high noninformativity and posterior inferences that are not sensitive to the choice of hyper-parameters. We illustrate the use of the unified Bayesian Multi-Trait and Multi-Environment (BMTME) method in simulated data sets and two real data sets (one maize and one wheat) including multiple traits measured on wheat and maize lines evaluated in multiple environments and genotyped with dense molecular markers. We also provide an R package called BMTME that can be used to fit the proposed methods.

Methods

Statistical model

We use $y_{i j k}^{(l)}$ to represent the normal response from the $k$ th replication of the $j$ th line in the $i$ th environment for the $l$ th trait ( $i = 1, 2, \dots, I$ , $j = 1, 2, \dots, J$ , $k = 1, 2, \dots, K$ , $l = 1, \dots, L),$ where $K$ represents the number of replicates of each line in each environment and $L$ denotes the number of traits under study. To present the theory in a simple manner, we will use $I = 3$ and $L = 3.$ Therefore, the total number of observations for the $l$ th trait is $n = I \times J \times K .$ We propose the following linear mixed model for each trait:

y_{i j k}^{(l)} = E_{i}^{(l)} + g_{j}^{(l)} + g E_{i j}^{(l)} + e_{i j k}^{(l)}

(1)

where $E_{i}^{(l)}$ represents the $i$ th environment for the $l$ th trait and is assumed as a fixed effect, $g_{j}^{(l)}$ represents the genomic effect of $j$ th line in the $l$ th trait and is assumed as random effect, $g E_{i j}^{(l)}$ is the interaction between the genomic effect of the $j$ th line and the $i$ th environment for the $l$ th trait and is assumed a random effect, and $e_{i j k}^{(l)}$ is a random error term associated with the $k$ th replication of the $j$ th line in the $i$ th environment for the $l$ th trait. To take into account the correlation between traits, one could use the following $L$ variate linear mixed model:

y_{i j k} = X_{i j k} β + Z_{1 i j k} b_{1 j} + Z_{2 i j k} b_{2 i j} + e_{i j k}

(2)

$y_{i j k} = {[y_{i j k}^{(1)}, \dots, y_{i j k}^{(3)}]}^{T}$ , $X_{i j k} = [\begin{matrix} x_{i}^{T (1)} & 0 & 0 \\ 0 & x_{i}^{T (2)} & 0 \\ 0 & 0 & x_{i}^{T (3)} \end{matrix}], β = {[β^{T (1)}, β^{T (2)}, β^{T (3)}]}^{T},$ $Z_{1 i j k} = Z_{2 i j k} = I_{3},$ $b_{1 j} = {[b_{1 j}^{(1)}, \dots, b_{1 j}^{(3)}]}^{T} \sim N_{L} (0, Σ_{t}), Σ_{t}$ is the genetic covariance matrix between traits and is assumed unstructured, $b_{2 i j} = {[b_{2 i j}^{(1)}, \dots, b_{2 i j}^{(3)}]}^{T},$ $e_{i j k} = {[e_{i j k}^{(1)}, \dots, e_{i j k}^{(3)}]}^{T} \sim N_{L} (0, R_{e})$ , $R_{e}$ is the residual covariance matrix between traits and is assumed unstructured. $x_{i}^{T (l)} = [x_{i 1}^{(l)}, x_{i 2}^{(l)}, x_{i 3}^{(l)}],$ $x_{i r}^{(l)} = 1$ if the environment $i$ is observed and 0 otherwise for the $l$ th trait, for $r = 1, 2, 3;$ and $l = 1, 2, 3.$ $β^{T (l)} = [β_{1}^{(l)}, β_{2}^{(l)}, β_{3}^{(l)}],$ $x_{i}^{T (l)} β^{(l)} = E_{i}^{(l)},$ $b_{1 j}^{(l)} = g_{j}^{(l)}$ and $b_{2 i j}^{(l)} = g E_{i j}^{(l)}$ . With model (1), we can perform a separate analysis for each trait, with the inconvenience that independence between the $L$ traits is assumed. Model (2) can take into account and exploit the correlation between traits.

In matrix notation, the model given in equation (2) including all the information is expressed as:

Y = Xβ + Z_{1} b_{1} + Z_{2} b_{2} + e

(3)

where $Y$ is of order $L n \times 1$ , $X$ is of order $L n \times I L$ , $β$ is of order $I L \times 1$ , $Z_{1}$ is of order $L n \times L J$ , $b_{1}$ is of order $L J \times 1$ , $Z_{2}$ is of order $L n \times I J L$ , $b_{2}$ is of order $I J L \times 1$ _, and $e$ is of order $L n \times 1$ . Then $b_{1} \sim N (0, G_{1}),$ $b_{2} \sim N (0, G_{2})$ and $e \sim N (0, R)$ , where $G_{1} = G_{g} \otimes Σ_{t}$ , $\otimes$ denotes a Kronecker product, $G_{2} = Σ_{E} \otimes G_{1},$ where $Σ_{E}$ is assumed a diagonal matrix of order $I \times I$ , which indicates that we are assuming independence between environments. It is important to point out that the trait × environment (T × E) interaction term is included in the fixed effect $β$ , while the trait × genotype (T × G) interaction term is included in the random effect $b_{1}$ and the three-way (T × G × E) interaction term is included in $b_{2}$ . The errors are assumed to be correlated with the covariance defined as $R = I_{n} \otimes R_{e}$ . More flexible variance–covariances as diagonal or identity are straightforward. Also note that $G_{g}$ is of order $J \times J$ ; therefore, $G_{1}$ is of order $J L \times J L$ and $G_{2}$ is of order $I J L \times I J L$ . The matrix of the genomic relationship between lines $G_{g}$ , also known as Genomic Relationship Matrix (GRM), was calculated using the method of VanRaden (2008).

Joint posterior density and prior specification

In this section, we provide the joint posterior density and prior specification for the Bayesian WGP Multiple Trait and Multiple Environment (BMTME) model. The joint posterior density of the parameter vector becomes:

P (β, b_{1}, b_{2}, Σ_{t}, Σ_{E}, σ_{β}^{2}, R_{e}, a_{β}, a, a_{E}, a_{e}) \propto P (Y | β, b_{1}, b_{2}, R_{e}) P (β | σ_{β}^{2}) P (σ_{β}^{2} | a_{β}) P (a_{β}) P (b_{1} | Σ_{t}) P (Σ_{t} | a_{1,} \dots, a_{L}) P (a_{1,} \dots, a_{L}) \times P (b_{2} | Σ_{t}, Σ_{E}) P (Σ_{E} | a_{E 1,} \dots, a_{E I}) P (a_{E 1,} \dots, a_{E I}) \times P (R_{e} | a_{e 1,} \dots, a_{e L}) P (a_{e 1,} \dots, a_{e L})

(4)

where $a = (a_{1}, \dots, a_{L}), a_{E} = (a_{E 1}, \dots, a_{E I}), a_{e} = (a_{e 1}, \dots, a_{e L})$ .

The notation $Ω \sim$ Inverse-Wishart $(κ, B)$ indicates that the density function of $Ω$ is $P (Ω) \propto {| B |}^{\frac{κ}{2}} {| Ω |}^{- \frac{κ + p + 1}{2}} \exp [- \frac{1}{2} t r (B Ω^{- 1})],$ $κ > 0, B, Ω$ both are positive definite matrices. We assume that $β | σ_{β}^{2} \sim N_{IL} (β_{0}, Σ_{0} σ_{β}^{2}),$ $σ_{β}^{2} | a_{β} \sim I W (ν_{β}, 2 ν_{β} / a_{β}) where I W (ν_{β}, 2 ν_{β} / a_{β})$ denotes an Inverse-Wishart distribution with shape $ν_{β}$ and scale parameters $2 ν_{β} / a_{β}$ with $a_{β} \sim I G (\frac{1}{2}, 1 / A_{β}^{2})$ , $where I G (\frac{1}{2}, 1 / A_{β}^{2})$ denote an IG distribution with shape $1 / 2$ and scale parameters $1 / A_{β}^{2}$ . $b_{1} | Σ_{t} \sim N_{J L} (0, G_{1})$ , $Σ_{t} | a_{1,} \dots, a_{L} \sim I W (ν_{t} + L - 1, 2 ν_{t} diag (\frac{1}{a_{1}}, \dots \frac{1}{a_{L}}))$ , $a_{l} \sim I G (\frac{1}{2}, 1 / A_{l}^{2})$ for $l = 1, \dots, L$ . $b_{2} | Σ_{t}, Σ_{E} \sim N_{I J L} (0, G_{2})$ , $R_{e} | a_{e 1,} \dots, a_{e L} \sim$ $I W (ν_{e} + L - 1, 2 ν_{e} diag (\frac{1}{a e_{1}}, \dots \frac{1}{a e_{L}}))$ and $a_{e l} \sim I G (\frac{1}{2}, 1 / A_{e l}^{2})$ . Since $Σ_{E} = diag (σ_{E 1}^{2}, \dots, σ_{E I}^{2})$ , the prior for $σ_{E i}^{2} | a_{E i} \sim I W (ν_{E i}, 2 ν_{E i} / a_{E i})$ with the prior for $a_{E i} \sim I G (\frac{1}{2}, 1 / A_{E i}^{2})$ for $i = 1, \dots, I$ .

Next we combine the joint posterior density of the parameter vector (4) with the priors to obtain the full conditional distribution for parameters $β$ , $σ_{β}^{2}$ , $a_{β}$ , $b_{1}$ , $b_{2}$ , $Σ_{t},$ $a, R_{e}, a_{e}$ . All full conditionals, as well as details of their derivations, are given in Appendix A.

Gibbs sampler

In order to produce posterior means for all relevant model parameters, below we outline the exact Gibbs sampler procedure that we propose for estimating the parameters of interest. As is the case with Markov Chain Monte Carlo (MCMC) techniques, the ordering of draws is somewhat arbitrary; however, we suggest the following order:

Step 1. Simulate $β$ according to the normal distribution given in Appendix A (A.1).
Step 2. Simulate $σ_{β}^{2}$ according to the IW distribution given in Appendix A (A.2).
Step 3. Simulate $a_{β}$ according to the IG distribution given in Appendix A (A.3).
Step 4. Simulate $b_{h}$ for $h = 1, 2,$ according to the normal distribution given in Appendix A (A.4 and A.5).
Step 5. Simulate $Σ_{t}$ according to the IW distribution given in Appendix A (A.6).
Step 6. Simulate $a_{l}, for l = 1, 2, \dots, L,$ according to the IG distribution given in Appendix A (A.7).
Step 7. Simulate $σ_{E i}^{2}, for i = 1, \dots, I,$ according to the IW distribution given in Appendix A (A.8).
Step 8. Simulate $a_{E i}, for i = 1, \dots, I,$ according to the IG distribution given in Appendix A (A.9).
Step 9. Simulate $R_{e}$ according to the IW distribution given in Appendix A (A.10).
Step 10. Simulate $a_{e l}, for l = 1, 2, \dots, L,$ according to the IG distribution given in Appendix (A.11).
Step 11. Return to step 1 or terminate when chain length is adequate to meet convergence diagnostics.

Model implementation

The Gibbs sampler described above for the BMTME model was implemented as an R-software package. We performed a total of 60,000 iterations; 30,000 samples were used for inference because the first 30,000 were used as burn-in to decrease the MCMC errors in prediction accuracy. We did not apply thinning of the chains following the suggestions of Geyer (1992), MacEachern and Berliner (1994), and Link and Eaton (2012), who provide justification of the ban on subsampling MCMC output for approximating simple features of the target distribution (e.g., means, variances, and percentiles).

We implemented the prior specification given in the previous section where the BMTME model was defined. The hyper-parameters used were for $β | σ_{β}^{2} \sim N_{I L} (β_{0} = 0_{I L}^{T}, I_{I L} \times 10, 000),$ for $σ_{β}^{2} | a_{β}$ we used $ν_{β} = 2,$ for $a_{β}$ we used $A_{β} = 100, 000,$ for $b_{1} | Σ_{t} \sim N_{J L} (0, G_{1})$ , for $Σ_{t} | a_{1,} \dots, a_{L}$ we used $ν_{t} = 2$ , for $a_{l}$ we used $A_{l} = 100, 000,$ for $l = 1, 2, \dots, L$ , for $b_{2} | Σ_{t}, Σ_{E} \sim N_{I J L} (0, G_{2})$ , for $σ_{E i}^{2} | a_{E i}$ we used $ν_{E i} = 2$ for $a_{E i}$ we used $A_{E i} = 100, 000$ for $i = 1, 2, \dots, I, for R_{e} | a_{e 1,} \dots, a_{e L} we used ν_{e} = 2, for a_{e l} we used A_{e l} = 100, 000 for l = 1, .., L$ . All these hyper-parameters were chosen to lead weakly informative priors.

Assessing prediction accuracy

We used two cross-validation schemes for generating training and validation sets that mimic two real situations a breeder might face. Cross-validation 1 (CV1) mimics a situation where lines were evaluated in some environments for all traits but some lines are missing in other environments; this is similar to cross-validation 2 of Burgueño et al. (2012). The other cross-validation scheme is CV2, which mimics a situation where a trait is lacking in all lines in one environment but present in the remaining environments (see Table D1 in Appendix D). In this case, information from relatives is used, and prediction assessment can benefit from borrowing information between lines across environments, and among correlated traits.

We implemented a 10-fold cross-validation with 80% of the observations in the training set and 20% in the testing set. Of the variety of methods for comparing the predictive posterior distribution to the observed data (generally termed “posterior predictive checks”), we used two criteria: the mean square error of prediction (MSEP) and the Pearson correlation. Models with small MSEP indicate better predictions, and higher correlation values indicate better predictions. The predicted observations were calculated with $S$ collected Gibbs samplers as: ${\hat{Y}}^{(s)} = X β^{(s)} + Z_{1} b_{1}^{(s)} + Z_{2} b_{2}^{(s)}$ , where $β^{(s)}, b_{1}^{(s)}, and b_{2}^{(s)}$ are estimates of $β$ , $b_{1}$ , and $b_{2}$ in the sth collected sample.

Simulation data

To illustrate the parameter estimation of the proposed BMTME method, a small simulation experiment was conducted. The data were simulated based on model (3) with three environments, three traits, 80 genotypes, and 20 replications. We assumed that $β^{T} = [$ 15,8,7,12,6,7,14,9,8], where the first three β coefficients belong to traits 1, 2, and 3 in environment 1, the second three values for the three traits in environment 2 and the last three for environment 3, $Σ_{t} = [\begin{matrix} 0.600 & 0.466 & 0.551 \\ 0.466 & 0.500 & 0.503 \\ 0.551 & 0.503 & 0.700 \end{matrix}]$ , $R_{e} = [\begin{matrix} 0.150 & 0.114 & 0.119 \\ 0.114 & 0.120 & 0.106 \\ 0.119 & 0.106 & 0.130 \end{matrix}]$ . These two variance–covariance matrices gave rise to a matrix of correlation between traits with each correlation between pairs of traits equal to 0.85. Also, we assume that the GRM is known, $G_{g} = 0.7 I_{80} + 0.3 J_{80}$ , where $I_{80}$ is an identity matrix of order 80 and $J_{80}$ is a matrix of order $80 \times 80$ of ones. The relationship between environments is assumed as $Σ_{E} = diag (0.65, 0.55, 0.75)$ . Therefore, the total number of observations was $3 \times 80 \times 3 \times 20 = 14400,$ i.e., $4800$ for each trait. With these parameters, 50 data sets were simulated according to model (3) and for each data set, parameters $β^{T}$ , $Σ_{t}$ , $Σ_{E}$ , and $R_{e}$ were estimated with the BMTME model using the Gibbs sampler given above. We used the priors given in the section on model implementation, which were also used for the applications with real data sets. For this simulated data set, we computed 20,000 MCMC samples, and Bayes estimates were computed with 10,000 samples, since the first 10,000 were discarded as burn-in. In Table 1, we report average estimates along with standard deviations (SD).

Table 1. Simulated data with three traits and three environments.

	Posterior Mean of ${\hat{Σ}}_{t}$			Posterior SD of ${\hat{Σ}}_{t}$
	T1	T2	T3	T1	T2	T3
T1	0.591	0.458	0.530	0.094	0.078	0.090
T2	—	0.500	0.488	—	0.080	0.084
T3	—	—	0.670	—	—	0.109
	Posterior Mean of ${\hat{R}}_{e}$			Posterior SD of ${\hat{R}}_{e}$
	T1	T2	T3	T1	T2	T3
T1	0.151	0.115	0.119	0.003	0.002	0.003
T2	—	0.121	0.107	—	0.002	0.002
T3	—		0.131	—	—	0.003
	Posterior Mean of ${\hat{Σ}}_{E}$			Posterior SD of ${\hat{Σ}}_{E}$
	E1	E2	E3	E1	E2	E3
	0.854	0.740	0.937	0.167	0.184	0.210
	Posterior Mean of $\hat{β}$			Posterior SD of $\hat{β}$
	T1	T2	T3	T1	T2	T3
E1	15.046	8.006	7.054	0.406	0.326	0.365
E2	12.004	5.980	7.003	0.307	0.254	0.378
E3	14.104	9.003	8.053	0.464	0.407	0.434

Open in a new tab

Posterior mean and standard deviation (SD) of the β coefficients ( $\hat{β}$ ) of three traits (T1, T2, and T3) in three environments (E1, E2, E3) and the estimated variance–covariance components for the traits ( ${\hat{Σ}}_{t}$ ), for the residuals ( ${\hat{R}}_{e}$ ), and for the environments ( ${\hat{Σ}}_{E}$ ).

Also, with the proposed BMTME model, we simulated two data sets similar to the simulation study explained above, except that the environmental covariance matrix we used was an identity matrix. The first data set assumes that the genetic and residual correlation between traits was 0.85 for all pairs of traits under study, while the second data set assumes that the correlation between all pairs of traits was 0.2 for both covariance matrices ( $Σ_{t}$ and $R_{e}$ ). We implemented a 10-fold cross-validation (CV1). The training data set has 80% of the lines (64 lines), while the testing data set has the remaining 20% (16 lines). We assessed the prediction performance using the simulated data set under three conditions: (1) unstructured (Appendix A): assuming both variance–covariances are unstructured ( $Σ_{t}$ and $R_{e}$ ); (2) diagonal (Appendix B): assuming both variance–covariances are diagonal; and (3) standard (Appendix C): assuming both variance–covariances are identity multiplied by the scale parameters $σ_{t}^{2}$ and $σ_{e}^{2}$ , respectively.

Real data sets

Maize data set:

A total of 309 double-haploid maize lines were phenotyped and genotyped; this is part of the data set used by Crossa et al. (2013) that comprised a total of 504 doubled haploid lines derived by crossing and backcrossing eight inbred lines to form several full-sib families. Traits available in this data set include grain yield (Yield), anthesis-silking interval (ASI), and plant height (PH); each of these traits was evaluated in three optimum rainfed environments (E1, E2, and E3). The experimental field design in each of the three environments was an α-lattice incomplete block design with two replicates. Data were preadjusted using estimates of block and environmental effects derived from a linear model that accounted for the incomplete block design within environment and for environmental effects.

Information about genotyping-by-sequencing (GBS) data for each maize chromosome, the number of markers after initial filtering, and the number of markers after imputation, was summarized in Crossa et al. (2013). Filtering was first done by removing markers that had >80% of the maize lines with missing values, and then markers with minor allele frequency lower than or equal to 0.05 were deleted. The total number of GBS data was 681,257 single nucleotide polymorphisms (SNPs) and, after filtering for missing values and minor allele frequency, 158,281 SNPs were used for the analyses. About 20% of cells were missing in the filtered GBS information used for prediction; these missing values were replaced by their expected values before doing the prediction.

Wheat data set:

A total of 250 wheat lines were extracted from a large set of 39 yield trials grown during the 2013–2014 crop season in Ciudad Obregon, Sonora, Mexico (Rutkoski et al. 2016). The trials were sown in mid-November and grown on beds with 5 and 2 irrigations plus drip irrigation. Days to heading (DTHD) were recorded as the number of days from germination until 50% of spikes had emerged in each plot, in the first replicate of each trial. Grain yield (GRYLD) was the total plot grain yield measured after maturity, and plant height (PTHT) was recorded in centimeters.

Image data of the yield trials were collected using a hyper-spectral camera (A-series, Mirco-Hyperspec VNIR, Headwall Photonics, Fitchburg, Massachusetts) mounted on a manned aircraft. From this data, vegetative indices for each plot were calculated. The green normalized difference vegetation index (GNDVI) was one of the traits used in this study. Trait GNDVI is considered a good predictor when used with pedigree and/or genomic prediction of GRYLD in wheat due to its high heritability and genetic correlation with GRYLD. Also, trait GNDVI can be measured remotely in large numbers of candidates for selection.

Genotyping-by-sequencing was used for genome-wide genotyping. Single nucleotide polymorphisms were called across all lines using the TASSEL GBS pipeline anchored to the genome assembly of Chinese Spring. Single nucleotide polymorphism calls were extracted and markers were filtered so that percent missing data did not exceed 80% and 20%, respectively. Individuals with >80% missing marker data were removed, and markers were recorded as −1, 0, and 1, indicating homozygous for the minor allele, heterozygous, and homozygous for the major allele, respectively. Next, markers with <0.01 minor allele frequency were removed, and missing data were imputed with the marker mean. A total of 12,083 markers remained after marker editing.

Data and codes repository

The phenotypic and genotypic information of the two data sets included in this study as well as the R package for performing the analyses can be downloaded from the link: http://hdl.handle.net/11529/10646. This link contains the phenotypic data on maize (Data.maize) and wheat (Data.trigo), as well as genomic data on maize (G.maize) and wheat (G.trigo). Also, the link includes the BMTME.zip with the R package used to perform the analyses under the BMTME model.

The BMTME R package

Nowadays the R programming language is a popular tool in statistical science for analyzing and visualizing data (R Core Team, 2015). However, in the context of big data with complex models, the speed of R is slow. For this reason, many times R is combined with C++ codes to produce high-performance programs that considerably increase the speed of programs (Stroustrup 2000; Eddelbuettel and Sanderson 2014). The R package we developed for fitting the BMTME models merges R and C++ through the use of Rcpp together with Armadillo C++ library (Sanderson 2010; Eddelbuettel 2013). Appendix E describes how the three-way data should be arranged and Appendix F explains the basic input needed to run the routines built in the R package for fitting the BMTME.

Data availability

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.

Results

Results for the simulated data sets and for the real data sets (maize and wheat) are shown below.

Simulated data set

Table 1 gives the posterior mean and posterior SD for β coefficients ( $β^{T}$ ) for each trait and for the variance–covariance matrices ( $Σ_{t},$ $R_{e}, Σ_{E}$ ). The estimates of the posterior means for the β coefficients ( $β^{T}$ ) and for the variance–covariance matrices ( $Σ_{t},$ $R_{e}$ ) are very close to the true values, while the estimates of the diagonal covariance matrix ( $Σ_{E}$ ) are slightly overestimated. Although the diagonal covariance matrix of $Σ_{E}$ is slightly overestimated according to the performed simulation study, we have evidence that the proposed BMTME model does reasonably well in terms of parameter estimation. We also tested the proposed BMTME model with another set of parameters and our results agree with the above mentioned results.

Table 2 shows the resulting prediction accuracy (Correlation and MSEP) for each environment–trait combination for the two simulated data sets; we also present the ranking of the BMTME model under the three conditions (unstructured, diagonal, and standard) for each environment–trait combination. Based on the ranking given in Table 2, the best prediction accuracy for both data sets with low and high correlation between traits (using both criteria) was achieved when the model assumed an unstructured variance–covariance matrix for both $Σ_{t}$ and $R_{e}$ , followed by the second condition, which assumes a diagonal matrix for $Σ_{t}$ and $R_{e}$ in terms of MSEP, but for the standard condition in terms of the Pearson correlation. In terms of the Pearson correlation for both data sets (low and high correlation between traits), in five of the nine environment–trait combinations, the unstructured condition performed better in terms of prediction accuracy, while the standard condition performed better in three of nine environment–trait combinations, and the diagonal condition performed better in only one of nine.

Table 2. Simulated data with three traits and three environments.

Method	E-T	Low Correlation Between Traits (0.20)						High Correlation Between Traits (0.85)
		Correlation			MSPE			Correlation			MSPE
		Mean	SE	R^a	Mean	SE	R^a	Mean	SE	R^a	Mean	SE	R^a
	E1-T1	0.17	0.15	2	1.27	0.15	2	0.54	0.17	1	0.96	0.21	2
	E1-T2	0.30	0.22	1	0.74	0.11	1	0.66	0.11	1	0.88	0.12	1
	E1-T3	0.51	0.12	1	0.93	0.15	1	0.59	0.09	1	1.10	0.22	1
	E2-T1	0.69	0.09	2	0.87	0.17	2	0.72	0.06	2	0.85	0.14	1
U	E2-T2	0.66	0.10	1	0.73	0.08	2	0.74	0.07	1	0.79	0.08	1
	E2-T3	0.72	0.04	1	0.79	0.14	1	0.70	0.07	2	0.99	0.18	2
	E3-T1	0.59	0.14	3	1.51	0.30	2	0.66	0.10	3	1.27	0.22	2
	E3-T2	0.80	0.06	1	0.95	0.15	1	0.77	0.07	1	1.11	0.16	1
	E3-T3	0.66	0.05	2	1.79	1.79	1	0.67	0.07	3	1.70	0.36	1
	Ave	0.57	0.11	1.56	1.06	0.34	1.44	0.67	0.09	1.67	1.07	0.19	1.33
	E1-T1	0.14	0.16	3	1.07	0.13	1	0.48	0.18	3	0.84	0.14	1
	E1-T2	0.24	0.20	3	0.78	0.07	2	0.43	0.17	3	1.01	0.11	2
	E1-T3	0.25	0.12	3	1.18	0.12	2	0.55	0.09	3	1.21	0.22	2
	E2-T1	0.71	0.07	1	0.85	0.16	1	0.76	0.04	1	0.92	0.14	2
D	E2-T2	0.64	0.07	2	0.71	0.16	1	0.70	0.05	2	0.90	0.13	2
	E2-T3	0.67	0.08	3	0.91	0.18	3	0.66	0.08	3	1.15	0.24	3
	E3-T1	0.65	0.11	2	1.26	0.35	1	0.73	0.06	2	1.19	0.27	1
	E3-T2	0.61	0.16	3	1.43	0.26	3	0.63	0.13	3	1.66	0.24	3
	E3-T3	0.66	0.04	3	2.02	2.02	3	0.69	0.07	2	1.81	0.34	3
	Ave	0.51	0.11	2.56	1.13	0.38	1.89	0.63	0.10	2.44	1.19	0.20	2.11
	E1-T1	0.22	0.17	1	1.32	0.20	3	0.53	0.18	2	1.08	0.23	3
	E1-T2	0.27	0.19	2	0.99	0.25	3	0.52	0.16	2	1.22	0.30	3
	E1-T3	0.43	0.09	2	1.23	0.18	3	0.55	0.13	2	1.48	0.38	3
	E2-T1	0.66	0.07	3	1.10	0.23	3	0.70	0.06	3	1.17	0.20	3
S	E2-T2	0.52	0.11	3	1.02	0.13	3	0.60	0.07	3	1.16	0.13	3
	E2-T3	0.71	0.07	2	0.91	0.19	2	0.73	0.07	1	0.94	0.18	1
	E3-T1	0.70	0.09	1	1.54	0.32	3	0.77	0.05	1	1.34	0.23	3
	E3-T2	0.71	0.10	2	1.03	0.18	2	0.69	0.09	2	1.17	0.18	2
	E3-T3	0.69	0.06	1	1.96	0.39	2	0.73	0.07	1	1.72	0.36	2
	Ave	0.55	0.11	1.89	1.23	0.23	2.67	0.65	0.10	1.89	1.25	0.24	2.56

Open in a new tab

Mean and standard error (SE) of the estimated correlations and Mean Squared Prediction Error (MSPE) from the 10-fold cross-validation CV1. The BMTME model was fitted using unstructured (U), diagonal (D), and standard (S) variance–covariance matrix. Environment (E1, E2, E3)–trait (T1, T2, T3) combination. Method stands for the variance–covariance matrix used with the BMTME, E-T for the environment–trait combination, R for rank, and Ave for average.

Since three conditions are compared (unstructured, diagonal, and standard), the values of the ranks range from 1 to 3, and the lower the values, the better the prediction accuracy. For ties, we assigned the average of the ranks that would have been assigned had there been no ties.

In terms of MSEP, the unstructured BMTME performed better in five (low correlation between traits) and six (high correlation between traits) of the nine environment–trait combinations, the diagonal BMTME in only four (low correlation between traits) and two (high correlation between traits) of nine combinations, and the standard BMTME in zero (low correlation between traits) and one (high correlation between traits) of nine combinations. Regarding the average of the nine groups (environment–trait combinations) for both prediction criteria (correlation and MSEP), the unstructured BMTME gave the best prediction, correlation = 0.57 (low correlation between traits), and correlation = 0.67 (high correlation between traits) and MSEP = 1.06 (low correlation between traits) and MSEP = 1.07 (high correlation between traits). In both data sets, the unstructured BMTME model had the best prediction accuracy; however, the higher the correlation between traits, the higher the prediction accuracies observed, since the average correlation between traits under the unstructured BMTME was 17.5% higher when the correlation between traits was 0.85 compared to when it was 0.2.

Maize data set

Table 3 shows that for each trait there are moderate differences between the β coefficients between environments. For Yield and PH, the largest and smallest β coefficients were observed in environments E1 and E2, respectively, while for trait ASI, the largest β coefficient was observed in E3 and the smallest in E2. The genetic estimates of the variance–covariance components of traits are given in ${\hat{Σ}}_{t}$ , where the correlation between traits is moderate. Yield and ASI have a negative correlation (−0.27), and the correlation between ASI and PH is also negative (−0.25), while the correlation between Yield and PH is 0.41. The same tendency is observed in the residual correlation between traits but with smaller correlation between traits.

Table 3. Maize data.

	Posterior Mean of ${\hat{Σ}}_{t}$			Posterior SD of ${\hat{Σ}}_{t}$
	Yield	ASI	PH	Yield	ASI	PH
Yield	1.666	−0.260	0.069	0.430	0.210	0.030
ASI	−0.158	1.631	−0.046	—	0.430	0.030
PH	0.315	−0.212	0.028	—	—	0.010
	Posterior Mean of ${\hat{R}}_{e}$			Posterior SD of ${\hat{R}}_{e}$
	Yield	ASI	PH	Yield	ASI	PH
Yield	0.506	−0.077	0.022	0.050	0.030	0.000
ASI	−0.151	0.512	−0.012	—	0.050	0.000
PH	0.278	−0.153	0.013	—	—	0.000
	Posterior Mean of ${\hat{Σ}}_{E}$			Posterior SD of ${\hat{Σ}}_{E}$
	E1	E2	E3	E1	E2	E3
	0.663	0.655	0.898	0.0369	0.0320	0.0349
	Posterior Mean of $\hat{β}$			Posterior SD of $\hat{β}$
	Yield	ASI	PH	Yield	ASI	PH
E1	6.445	1.872	2.354	0.210	0.280	0.030
E2	4.958	1.147	2.066	0.280	0.330	0.040
E3	6.102	2.276	2.341	0.290	0.300	0.040
	Correlation in the Entire Data			MSPE in the Entire Data
	Yield	ASI	PH	Yield	ASI	PH
E1	0.796	0.756	0.646	0.428	0.233	0.012
E2	0.769	0.798	0.757	0.245	0.605	0.007
E3	0.799	0.794	0.763	0.480	0.338	0.011

Open in a new tab

Posterior mean and SD of the β coefficients ( $\hat{β}$ ) for three traits, Yield, anthesis-silking interval (ASI), and plant height (PH) in three environments (E1, E2, and E3). Estimate variance–covariance components for the traits ( ${\hat{Σ}}_{t}$ ), the environments ( ${\hat{Σ}}_{E}$ ), and the residuals ( ${\hat{R}}_{e}$ ). In ${\hat{Σ}}_{t}$ and ${\hat{R}}_{e}$ , the upper triangle contains the variance–covariance components and the lower triangle contains the correlations. ${\hat{Σ}}_{E}$ is a diagonal matrix.

Table 4 shows the prediction accuracies (Correlation and MSEP) for each environment–trait combination and the ranking of the three conditions studied for each criterion in the maize testing data set for cross-validation CV1. From the ranking, the best condition is the standard model, since it was the best in five of nine environment–trait combinations in terms of correlation, while in terms of MSEP, the diagonal model was the best in three of nine environment–trait combinations. As for the averages of the environment–trait combinations, the standard model was also the best in terms of both criteria. The second-best model was the diagonal, and the unstructured model was the worst in terms of both criteria. This can be explained by the low correlation between traits that exists for this maize data set.

Table 4. Maize data.

		Correlation			MSPE
BMTME	Environment–Trait	Mean	SE	Rank^a	Mean	SE	Rank
	E1-Yield	0.28	0.07	3	0.74	0.08	3.00
	E2-Yield	0.40	0.09	1.5	0.39	0.06	2.50
	E3-Yield	0.37	0.08	2.5	0.02	0.01	1.50
	E1-ASI	0.39	0.08	2	0.37	0.03	1.50
Unstructured	E2-ASI	0.46	0.06	2.5	1.26	0.35	3.00
	E3-ASI	0.42	0.05	2	0.01	0.00	1.50
	E1-PH	0.37	0.06	2	0.86	0.07	3.00
	E2-PH	0.26	0.08	3	0.48	0.07	3.00
	E3-PH	0.44	0.07	2.5	0.02	0.00	2.00
	Average	0.37	0.07	2.33	0.46	0.08	2.33
	E1-Yield	0.30	0.07	1.5	0.73	0.07	2.00
	E2-Yield	0.40	0.08	1.5	0.36	0.03	1.00
	E3-Yield	0.37	0.05	2.5	0.85	0.07	3.00
	E1-ASI	0.40	0.09	1	0.39	0.06	3.00
Diagonal	E2-ASI	0.46	0.06	2.5	1.25	0.35	2.00
	E3-ASI	0.27	0.08	3	0.48	0.07	3.00
	E1-PH	0.36	0.07	3	0.02	0.01	1.00
	E2-PH	0.41	0.06	1	0.01	0.00	1.00
	E3-PH	0.44	0.06	2.5	0.02	0.00	2.00
	Average	0.38	0.07	2.06	0.46	0.08	2.00
	E1-Yield	0.30	0.07	1.5	0.72	0.07	1.00
	E2-Yield	0.39	0.09	3	0.39	0.06	2.50
	E3-Yield	0.38	0.08	1	0.02	0.01	1.50
	E1-ASI	0.38	0.08	3	0.37	0.03	1.50
Standard	E2-ASI	0.48	0.06	1	1.24	0.35	1.00
	E3-ASI	0.43	0.05	1	0.01	0.00	1.50
	E1-PH	0.39	0.06	1	0.84	0.06	2.00
	E2-PH	0.27	0.08	2	0.47	0.07	2.00
	E3-PH	0.45	0.07	1	0.02	0.00	2.00
	Average	0.39	0.07	1.61	0.45	0.07	1.67

Open in a new tab

Mean and SE of the estimated correlations and MSPE from the 10-fold cross-validation CV1. The BMTME model was fitted using unstructured, diagonal, and standard variance–covariance matrices. Environment (E1, E2, E3)–trait (Yield, ASI, PH) combination.

Since three BMTME models are fitted (unstructured, diagonal, and standard) the values of the ranks ranged from 1 to 3, and the lower the values, the better the prediction accuracy. For ties, we assigned the average of the ranks that would have been assigned had there been no ties.

Table 5 provides the results of cross-validation CV2 for the maize testing data set. The trait yield is unobserved in only one environment (for example, E1) for all lines, but data on the other two traits are available for this environment (E1), as well as for the other two environments (E2 and E3). The best model in terms of correlation and MSEP for predicting Yield for all lines in E1 was the standard model (0.215, 41.714), followed by the diagonal (0.168, 43.691) and the unstructured model (0.163, 46.407). In E2-Yield and E3-Yield, the best BMTME model was the unstructured model with Pearson correlation. In terms of MSEP, the BMTME unstructured model was the best model for predicting the Yield of the unobserved lines in E2, followed by the other two models (diagonal and standard). For E3-Yield, the best predictive model in terms of MSEP was the BMTME standard, followed by the unstructured model.

Table 5. Maize data.

	Unstructured		Diagonal		Standard
Environment–Trait	Correlation	MSPE	Correlation	MSPE	Correlation	MSPE
E1-Yield	0.163	46.407	0.168	43.691	0.215	41.714
E2-Yield	0.405	23.671	0.156	31.586	0.214	24.943
E3-Yield	0.298	39.946	0.243	42.735	0.247	37.383

Open in a new tab

Mean of the estimated correlations and MSPE for predicting the trait Yield for all lines in each environment. The BMTME was fitted using unstructured, diagonal, and standard variance–covariance matrices. Environment (E1, E2, and E3)–trait Yield.

Wheat data set

Table 6 shows that β coefficients are very different between traits and environments for the wheat data set. In environment Bed2IR, the largest β coefficients were observed in traits DTHD and GNDVI, respectively, while in environment Drip, the largest β coefficients were observed in traits GNDVI and GRYLD, respectively. The genetic estimates of the variance–covariance components of traits are given in ${\hat{Σ}}_{t}$ ,where the largest correlations were observed between trait DTHD vs. GNDVI, GRYLD, and PTHT; the same is true for the residual correlation between traits ( ${\hat{R}}_{e}$ ). In terms of prediction accuracies for the entire data set, they are high in terms of correlation and less precise in terms of MSEP mostly for trait PTHT in the three environments.

Table 6. Wheat data.

	Posterior Mean of ${\hat{Σ}}_{t}$				Posterior SD of ${\hat{Σ}}_{t}$
	DTHD	GNDVI	GRYLD	PTHT	DTHD	GNDVI	GRYLD	PTHT
DTHD	16.172	0.028	−0.413	−4.505	0.696	0.002	0.047	0.619
GNDVI	0.7348	0.000	0.000	−0.010	—	0.000	0.000	0.002
GRYLD	−0.386	−0.19	0.071	0.111	—	—	0.008	0.061
PTHT	−0.386	−0.35	0.144	8.442	—	—	—	1.023
	Posterior Mean of ${\hat{R}}_{e}$				Posterior SD of ${\hat{R}}_{e}$
	DTHD	GNDVI	GRYLD	PTHT	DTHD	GNDVI	GRYLD	PTHT
DTHD	0.523	−0.003	0.112	0.606	0.214	0.001	0.035	0.393
GNDVI	−0.453	0.000	0.000	−0.002	—	0.000	0.000	0.002
GRYLD	0.569	−0.192	0.074	0.561	—	—	0.008	0.077
PTHT	0.215	−0.048	0.530	15.214	—	—	—	1.327
	Posterior Mean of ${\hat{Σ}}_{E}$				Posterior SD of ${\hat{Σ}}_{E}$
	Bed2IR	Bed5IR	Bed5IR		Bed2IR	Bed5IR	Bed5IR
	0.461	1.326	0.014	—	0.076	0.189	0.020	—
	Posterior Mean of $\hat{β}$				Posterior SD of $\hat{β}$
	DTHD	GNDVI	GRYLD	PTHT	DTHD	GNDVI	GRYLD	PTHT
Bed2IR	−3.202	−4.061	−0.312	−0.004	0.227	0.233	0.223	0.001
Bed5IR	−0.011	0.006	−0.135	−0.341	0.001	0.001	0.023	0.024
Drip	−0.407	−4.595	−7.368	−0.576	0.023	0.292	0.295	0.291
	Correlation in the Entire Data				MSPE in the Entire Data
	DTHD	GNDVI	GRYLD	PTHT	DTHD	GNDVI	GRYLD	PTHT
Bed2IR	0.999	0.930	0.906	0.906	0.115	0.000	0.018	10.187
Bed5IR	0.998	0.943	0.885	0.666	0.227	0.000	0.059	8.284
Drip	0.992	0.908	0.873	0.871	0.352	0.000	0.069	13.839

Open in a new tab

Posterior mean and SD of the β coefficients ( $\hat{β}$ ) for four traits (DTHD, GNDVI, GRYLD, and PTHT) in three environments (Bed2I, Bed5I, and Drip). Estimated variance–covariance components for traits ( ${\hat{Σ}}_{t}$ ) and for residual ( ${\hat{R}}_{e}$ ). In ${\hat{Σ}}_{t}$ , and ${\hat{R}}_{e}$ the upper triangle contains the variance–covariance components and the lower triangle contains the correlations.

Table 7 gives the prediction accuracy of the wheat data set for the testing data set for each environment–trait combination; it also gives the ranking of the three conditions studied under both criteria for cross-validation CV1. The best case is when the BMTME model assumes an unstructured variance–covariance matrix for both $Σ_{t}$ and $R_{e}$ and a diagonal matrix for the variance–covariance for $Σ_{E},$ followed by BMTME with a diagonal matrix for $Σ_{t,}$ $R_{e}$ , and $Σ_{E} .$ As for the ranking in terms of Pearson correlation, in 6 of 12 groups the BMTME unstructured model performed better in terms of prediction accuracy, while the BMTME diagonal model was the second-best model since it was the best model in 3 of 12 cases; the BMTME standard model was the worst model in terms of prediction accuracy since it was the best in only 1 of 12 cases. In terms of MSEP, the BMTME unstructured model performed better in 5 of 12 cases, the BMTME diagonal model was the best model in only 3 of 12 cases, and the BMTME standard model was the best in 1 of 12 cases. The BMTME unstructured model also had the best average prediction accuracy of the 12 groups.

Table 7. Wheat data.

Method	Environment–trait	Correlation			MSEP
Method	Environment–trait	Mean	SE	Rank^a	Mean	SE	Rank
	Bed2I-DTHD	0.93	0.03	2	4.82	1.60	2
	Bed2I-GNVI	0.79	0.08	1.5	6.8E-05	0.00	2
	Bed2I-GRYLD	0.64	0.12	1	0.05	0.01	1
	Bed2I-PTHT	0.60	0.18	1	26.55	11.90	2
	Bed5I-DTHD	0.76	0.13	2	17.61	7.63	1
U	Bed5I-GNVI	0.60	0.20	1	9.7E-05	0.00	2
	Bed5I-GRYLD	0.35	0.32	2	0.23	0.09	2
	Bed5I-PTHT	0.46	0.16	3	11.90	3.40	3
	Drip-DTHD	0.95	0.02	1	2.66	0.81	1
	Drip-GNVI	0.68	0.21	1.5	0.00	0.00	2
	Drip-GRYLD	0.67	0.17	1	0.13	0.05	1
	Drip-PTHT	0.69	0.08	1	22.68	10.21	1
	Ave	0.68	0.14	1.50	7.22	2.98	1.67
	Bed2I-DTHD	0.95	0.01	1	4.44	0.57	1
	Bed2I-GNVI	0.79	0.01	1.5	6.3E-05	0.00	2
	Bed2I-GRYLD	0.60	0.04	2	0.06	0.00	2
	Bed2I-PTHT	0.56	0.06	2	28.24	3.70	2
	Bed5I-DTHD	0.79	0.04	1	16.51	2.37	1
D	Bed5I-GNVI	0.66	0.06	2	8.4E-05	0.00	2
	Bed5I-GRYLD	0.38	0.09	1	0.22	0.02	1
	Bed5I-PTHT	0.47	0.06	2	11.86	1.10	2
	Drip-DTHD	0.94	0.01	2	4.34	0.53	2
	Drip-GNVI	0.68	0.06	1.5	0.00	0.00	2
	Drip-GRYLD	0.59	0.06	3	0.14	0.02	2
	Drip-PTHT	0.62	0.03	2	23.83	3.14	2
	Ave	0.67	0.04	1.75	7.47	0.95	1.75
	Bed2I-DTHD	0.94	0.05	3	17.37	6.94	3
	Bed2I-GNVI	0.33	0.21	3	0.00	0.00	2
	Bed2I-GRYLD	0.58	0.18	3	0.07	0.01	3
	Bed2I-PTHT	0.56	0.23	3	32.70	14.42	3
	Bed5I-DTHD	0.78	0.14	3	30.94	8.83	3
S	Bed5I-GNVI	0.46	0.25	3	0.00	0.00	2
	Bed5I-GRYLD	0.38	0.33	3	0.24	0.09	3
	Bed5I-PTHT	0.41	0.18	1	9.88	3.18	1
	Drip-DTHD	0.93	0.05	3	7.27	2.56	3
	Drip-GNVI	0.43	0.14	3	0.00	0.00	2
	Drip-GRYLD	0.55	0.18	2	0.17	0.08	3
	Drip-PTHT	0.61	0.16	3	28.84	12.68	3
	Ave	0.58	0.17	2.75	10.62	4.07	2.58

Open in a new tab

Mean and SE of the estimated correlations and MSPE from the 10-fold cross-validation CV1. The BMTME model was fitted using unstructured (U), diagonal (D), and standard (S) variance–covariance matrices. Environment (Bed2I, Bed5I, Drip)–trait [days to heading (DTHD), GNDVI, grain yield (GRYLD), and plant height (PTHT)] combination. Method stands for the three variance–covariance matrices used with the BMTME.

Since three BMTME models are fitted (unstructured, diagonal, and standard), the values of the ranks ranged from 1 to 3, and the lower the values, the better the prediction accuracy. For ties, we assigned the average of the ranks that would have been assigned had there been no ties.

Table 8 gives the results of cross-validation CV2 which assumes that trait GRYLD is lacking in one environment for all lines but not in the other environments. The results given are only for the testing data set (trait GRYLD missing for all lines in one environment). The best model for predicting GRYLD for all lines in environment Bed2I with Pearson correlation was the BMTME unstructured model, followed by the BMTME standard and, in the last position, the BMTME diagonal model. In terms of MSEP, the results are exactly the opposite. While in environment Bed5I the best predictive model in terms of Pearson correlation was the BMTME standard, then the BMTME diagonal, and, in the last position, the BMTME unstructured model. In terms of MSEP, the best model was the unstructured model, then the standard, and, at the end, the diagonal model. In environment Drip, the ranking of models based on both criteria was as follows: BMTME unstructured, BMTME diagonal, and BMTME standard.

Table 8. Wheat data.

	Unstructured		Diagonal		Standard
Environment–trait	Correlation	MSEP	Correlation	MSEP	Correlation	MSEP
Bed2I-GRYLD	0.648	0.085	0.589	0.079	0.580	0.076
Bed5I-GRYLD	0.173	0.342	0.164	0.408	0.187	0.343
Drip-GRYLD	0.634	0.246	0.516	0.264	0.420	0.304

Open in a new tab

Mean of the estimated correlations and MSPE for the prediction of the trait grain yield (GRYLD) for all lines in each environment (Bed2I, Bed5I, Drip). The BMTME was fitted using unstructured, diagonal, and standard variance–covariance matrices.

Discussion

To our knowledge, this is the first statistical three-way genomic model for assessing the prediction accuracy of trait × genotype × environment. Other models for assessing multi-traits or multi-environments have been extensively studied in the related literature (see, for example, Jarquín et al. 2014; Montesinos-López et al. 2015); however, none of them have simultaneously assessed and modeled the three-way variance–covariance structure. The BMTME model does this task simultaneously using Bayesian estimation and the package for performing such a task is given in this article.

Performance of the BMTME model in simulated and real data sets

In the simulated data sets, the best prediction accuracies were achieved with the BMTME model (which assumes an unstructured variance–covariance matrix for the genetic and residual components) even when the correlation between traits was low, followed by the model that assumed a diagonal variance–covariance matrix for both matrices (in terms of the Pearson correlation) of traits, and then by the standard model, which was formed by an identity matrix multiplied by $σ_{t}^{2}$ and $σ_{e}^{2}$ for the genetic and residual variance–covariance matrices, respectively. The simulation study provides evidence that when the correlation between traits is high, it is really important to use a multivariate model that takes into account this correlation to improve prediction accuracies.

This evidence is also supported by the results obtained with the wheat data set, where the BMTME unstructured model was the most accurate model, followed by the diagonal and finally by the standard model. However, with the maize data set, we did not observe any gain using the unstructured variance–covariance matrix in comparison to the other two variance–covariances used (diagonal and standard), maybe because in this data set the genetic and residual correlations between traits were low. Therefore, the important message is that when the correlation between traits is high (>0.5), it is really important to estimate the unstructured variance–covariance matrix; when this correlation is low, it is enough to use the BMTME standard model because with the unstructured model, the results could be worse than those of the standard model. These suggestions are not new; they were also made by Calus and Veerkamp (2011), Jia and Jannink (2012), Guo et al. (2014), and Jiang et al. (2015) in the context of multi-trait analysis. Here we only point out that they are also valid in the multi-trait, multi-environment context, taking into account the T × G × E interaction term.

Our contribution added to the traditional multi-trait model (proposed by Calus and Veerkamp 2011; Jia and Jannink 2012; Guo et al. 2014; Jiang et al. 2015) is that our model also is valid for the multi-environment and the three-way (T × G × E) interaction term, which more realistically mimic the type of data that are very common in plant breeding programs, where genotypes are evaluated for multi-traits in multi-environments. We are also aware that normally distributed traits are not the only traits commonly measured in plant breeding programs. For this reason, models for multiple categorical ordinal traits, multiple count traits, or a mixture of types of traits are also needed to help breeders improve the process of selecting candidate genotypes.

Prediction assessment of the BMTME model

We introduced a Gibbs sampler for Bayesian analysis of multi-traits and multi-environments that takes into account the three-way (T × G × E) interaction term that uses simple conditional distribution to simulate the joint posterior distribution of all required unknown parameters in the WGP model. This model has the advantage that it uses Half- $t$ priors on each standard deviation term and uniform priors between −1 and 1 on each correlation of the covariance matrix of traits in order to achieve noninformativity and posterior inferences with low sensitivity to the choice of hyper-parameters for the variance–covariance matrices.

Since we modeled the correlation patterns separately for each factor as $G_{1} = G_{g} \otimes Σ_{t}$ and $G_{2} = Σ_{E} \otimes G_{g} \otimes Σ_{t}$ , this facilitates the interpretation of the contribution of every factor to the overall correlation structure. It also allows choosing specific covariance structures for each factor, which improves accuracy and makes model fitting easier. In addition, fewer parameters than an unstructured model are required. For example, for modeling $G_{2}$ under an unstructured model, we need to estimate $I J L (I J L + 1) / 2$ unknown parameters; this number of parameters is larger than the number of parameters required to be estimated using Kronecker products for a three-factor separable model that only needs $\frac{I (I + 1)}{2} + \frac{J (J + 1)}{2} + \frac{L (L + 1)}{2} - 2$ parameters. In our context, the number of parameters is lower, since we assumed a diagonal matrix for the variance–covariance matrix of environments and the matrix $G_{g}$ is given. Also, if needed, partial derivatives, inverse computation, and Cholesky decomposition of the overall covariance matrix are performed more easily on the factor-specific covariances because they have smaller dimensions. Therefore, the use of separable covariance matrices with Kronecker products has substantial computational advantages, besides improving interpretation and model fitting (Simpson et al. 2014). However, care needs to be exercised with the assumption of a Kronecker product structured variance–covariance matrix, especially in three-way multivariate data, because incorrect assumptions may lead to invalid conclusions (Roy and Leiva 2008).

Contributions and limitations of the BMTME model

This study clearly described the full conditional distributions for modeling the three-way (T × G × E) interaction term with multi-traits and multi-environments, which is of paramount importance for evaluating genotypic performance in target environments and for predicting yet-to-be observed phenotypes when the relative performance of genotypes varies across environments. Because the proposed model takes into account the correlation between traits and includes the three-way (T × G × E) interaction term, the BMTME can be a useful tool for efficiently selecting superior genotypes. The proposed BMTME model can be considered a Bayesian GBLUP for multiple traits and multiple environments since the marker information is taken into account in the GRM ( $G_{g}$ ). Some of the advantages of our model over standard software are: (a) it is able to estimate separable covariance matrices of the form $A \otimes B \otimes C$ , which is not possible with other software; (b) the estimation of three-way terms with covariance matrices of the form $A \otimes B \otimes C$ is more parsimonious since fewer parameters are needed than when two factors are joined and the estimation process is performed using two separable covariance structures as $A \otimes B^{*}$ , where $B^{*}$ contains the covariance of the two factors B and $C$ ; (c) the convergence of our model is not a big deal compared to the convergence problems of other software for complex data; and (d) our model facilitates the interpretation of the covariance matrices because we can estimate the three covariance matrices.

On the other hand, as expected, the disadvantage of the BMTME model is its high computational cost even under the optimized C++ developed and made available in this research article. Large numbers of lines might indeed cause some delays in the computation of such large numbers of parameters in the full conditionals. However, constant developments in computing science will soon reduce the computing time of the three-way BMTME model.

Finally, our proposed BMTME model can also be useful (a) in QTL-mapping studies, since some WGP methods are also commonly used for GWAS (Peters et al. 2012; Garrick and Fernando, 2013; Jiang et al. 2015), and (b) to include spatial information in the residual $R$ matrix of the proposed model. This information is often available from breeding programs since they measure geographical information of the plots where genotypes are tested in each environment; this could help improve prediction accuracy.

Conclusions

In this paper, we extended the multi-trait WGP model to the multi-trait and multi-environment WGP model. This unified WGP model takes into account the correlation between traits and the three-way interaction term (T × G × E). Additionally, a transparent derivation of all full conditional distributions required is given that allows us to propose an efficient Gibbs sampler that is easy to implement and produces precise parameter estimates with high noninformativity and posterior inferences with low sensitivity to the choice of hyper-parameters for the variance–covariance matrices. Finally, we successfully applied the proposed method to simulated and real data and found that when the correlation between the traits is high (>0.5), the proposed BMTME model with an unstructured covariance matrix should be preferred over the diagonal and standard methods to help improve prediction accuracy. However, when correlations are low, it is enough to use the BMTME standard model because if we use the unstructured model, the results could be worse than those of the standard model. The R-software package BMTME offers specialized and optimized C++ routines to efficiently perform the analyses under the proposed model.

Acknowledgments

We very much appreciate the International Maize and Wheat Improvement Center (CIMMYT) field collaborators, laboratory assistants, and technicians who collected the phenotypic and genotypic data used in this study.

Appendix A

Derivation of full conditional distributions for the BMTME unstructured model

Full conditional for $β$

\begin{matrix} P (β | E L S E) = P (Y | β, b_{1}, b_{2}, R_{e}) P (β | σ_{β}^{2}) \\ \propto \exp (- \frac{1}{2} {(Y - Xβ - \sum_{h = 1}^{2} Z_{h} b_{h})}^{T} R^{- 1} (Y - Xβ - \sum_{h = 1}^{2} Z_{h} b_{h}) - \frac{1}{2} {(β - β_{0})}^{T} Σ_{0}^{- 1} σ_{β}^{- 2} (β - β_{0})) \\ \propto \exp (- \frac{1}{2} [{(β - {\tilde{β}}_{0})}^{T} {\tilde{Σ}}_{0}^{- 1} (β - {\tilde{β}}_{0})]) \propto N ({\hat{β}}_{0}, {\tilde{Σ}}_{0}) \end{matrix}

(A.1)

where ${\tilde{Σ}}_{0} = {(Σ_{0}^{- 1} σ_{β}^{- 2} + X^{T} R^{- 1} X)}^{- 1}$ , ${\tilde{β}}_{0} = {\tilde{Σ}}_{0} (Σ_{0}^{- 1} σ_{β}^{- 2} β_{0} - X^{T} R^{- 1} Σ_{h = 1}^{2} Z_{h} b_{h} + X^{T} R^{- 1} Y)$ with $R^{- 1} = I_{n} \otimes R_{e}^{- 1}$ . Also, if we had assumed $P (β) \propto 1$ as prior for $β$ , we would have maintained a multivariate Normal posterior distribution due to the multivariate Normal distribution’s conjugacy. However, the mean vector and covariance matrix would be slightly modified.

Full conditional for $σ_{β}^{2}$

\begin{array}{l} P (σ_{β}^{2} | ELSE) \propto P (β | σ_{β}^{2}) P (σ_{β}^{2} | a_{β}) \\ \propto \frac{1}{{(σ_{β}^{2})}^{\frac{ν_{β} + 1 + I L + 1}{2}}} \exp (- \frac{{(β - β_{0})}^{T} Σ_{0}^{- 1} (β - β_{0}) + 2 ν_{β} / a_{β}}{2 σ_{β}^{2}}) \\ \propto I W ({\tilde{k}}_{β^{*}} = ν_{β} + I L, {\tilde{B}}_{β} = {(β - β_{0})}^{T} Σ_{0}^{- 1} (β - β_{0}) + 2 ν_{β} / a_{β}) \end{array}

(A.2)

Full conditional for $a_{β}$

\begin{array}{l} P (a_{β} | ELSE) \propto P (σ_{β}^{2} | a_{β}) P (a_{β}) \\ \propto \frac{1}{{(a_{β})}^{\frac{ν_{β} + 1}{2} + 1}} \exp (- \frac{1 / A_{β}^{2} + ν_{β} / σ_{β}^{2}}{a_{β}}) \\ \propto I G (\frac{ν_{β} + 1}{2}, 1 / A_{β}^{2} + ν_{β} / σ_{β}^{2}) \end{array}

(A.3)

Full conditional for $b_{1}$

Defining $η^{1} = Xβ + Z_{2} b_{2},$ the conditional distribution of $b_{1}$ is given as

P (b_{1} | E L S E) \propto P (b_{1} | Σ_{t}) P (Y | β, b_{1}, b_{2}, R_{e})

\propto \exp (- \frac{1}{2} {(Y - Z_{1} b_{1} - η^{1})}^{T} R^{- 1} (Y - Z_{1} b_{1} - η^{1}) - \frac{1}{2} b_{1}^{T} G_{1}^{- 1} b_{1})

\propto \exp {- \frac{1}{2} {(b_{1} - {\tilde{b}}_{1})}^{T} F_{1}^{- 1} (b_{1} - {\tilde{b}}_{1})} \propto N ({\tilde{b}}_{1}, F_{1})

(A.4)

where $F_{1} = {(G_{1}^{- 1} + Z_{1}^{T} R^{- 1} Z_{1})}^{- 1} and {\tilde{b}}_{1} = F_{1} (Z_{1}^{T} R^{- 1} Y - Z_{1}^{T} R^{- 1} η^{1})$ . In a similar way, by defining $η^{2} = Xβ + Z_{1} b_{1}$ , we arrive at the full conditional of $b_{2}$ as

P (b_{2} | ELSE) \sim N ({\tilde{b}}_{2}, F_{2})

(A.5)

where $F_{2} = {(G_{2}^{- 1} + Z_{2}^{T} R^{- 1} Z_{2})}^{- 1}, {\tilde{b}}_{2} = F_{2} (Z_{2}^{T} R^{- 1} Y - Z_{2}^{T} R^{- 1} η^{2}),$ $G_{1}^{- 1} = G_{g}^{- 1} \otimes Σ_{t}^{- 1}$ , and $G_{2}^{- 1} = Σ_{E}^{- 1} \otimes G_{1}^{- 1}$ .

Full conditional for $Σ_{t}$

P (Σ_{t} | E L S E) \propto P (b_{1} | Σ_{t}) P (Σ_{t} | a_{1,} \dots, a_{L}) P (b_{2} | Σ_{t}, Σ_{E}) \propto {| G_{g} \otimes Σ_{t} |}^{- \frac{1}{2}} \exp (- \frac{1}{2} b_{1}^{T} {(G_{g} \otimes Σ_{t})}^{- 1} b_{1}) P (Σ_{t} | a_{1,} \dots, a_{L}) {| G_{3} \otimes Σ_{t} |}^{- \frac{1}{2}} \exp (- \frac{1}{2} b_{2}^{T} {(G_{3} \otimes Σ_{t})}^{- 1} b_{2})

with $G_{3} = Σ_{E} \otimes G_{g}$

\propto {| Σ_{t} |}^{- \frac{J + J I}{2}} \exp (- \frac{1}{2} [b_{1}^{T} (G_{g}^{- \frac{1}{2}} \otimes Σ_{t}^{- \frac{1}{2}}) (G_{g}^{- \frac{1}{2}} \otimes Σ_{t}^{- \frac{1}{2}}) b_{1} + b_{2}^{T} (G_{3}^{- \frac{1}{2}} \otimes Σ_{t}^{- \frac{1}{2}}) (G_{3}^{- \frac{1}{2}} \otimes Σ_{t}^{- \frac{1}{2}}) b_{2}]) P (Σ_{t} | a_{1,} \dots, a_{L})

\propto {| Σ_{t} |}^{- \frac{J + J I}{2}} \exp (- \frac{1}{2} [\sum_{j = 1}^{J} c_{1 j}^{T} Σ_{t}^{- \frac{1}{2}} Σ_{t}^{- \frac{1}{2}} c_{1 j} + \sum_{j * = 1}^{J L} c_{2 j *}^{T} Σ_{t}^{- 1 / 2} Σ_{t}^{- 1 / 2} c_{2 j *}) P (Σ_{t} | a_{1,} \dots, a_{L})

\propto {| Σ_{t} |}^{- \frac{ν_{t} + J + L + J I - 1 + L + 1}{2}} \exp (- \frac{1}{2} (t r {[b_{1}^{*} G_{g}^{- 1} b_{1}^{* T} + b_{2}^{*} G_{3}^{- 1} b_{2}^{* T} + B_{t}] Σ_{t}^{- 1}})) {| B_{t} |}^{\frac{ν_{t} + L - 1}{2}}

$\propto I W (κ^{*} = ν_{t} + J + L + J I - 1, B_{t}^{*} = [b_{1}^{*} G_{g}^{- 1} b_{1}^{* T} + b_{2}^{*} G_{3}^{- 1} b_{2}^{* T} + 2 ν_{t} diag (\frac{1}{a_{1}}, \dots, \frac{1}{a_{L}})$ ]) (A.6) where $B_{t} = 2 ν_{t} diag (\frac{1}{a_{1}}, \dots, \frac{1}{a_{L}})$ . Note that $(G_{g}^{- 1 / 2} \otimes Σ_{t}^{- 1 / 2}) b_{1} = v e c (Σ_{t}^{- \frac{1}{2}} b_{1}^{*} G_{g}^{- \frac{1}{2}}) = v e c (Σ_{t}^{- \frac{1}{2}} C_{1})$ , with $b_{1}^{*} = [b_{11}, \dots, b_{1 J}]$ , $C_{1} = [c_{11}, \dots, c_{1 J}] = b_{1}^{*} G_{g}^{- 1 / 2}$ . From here, $(G_{g}^{- 1 / 2} \otimes Σ_{t}^{- 1 / 2}) b_{1} = v e c ([Σ_{t}^{- \frac{1}{2}} c_{11}, \dots, Σ_{t}^{- \frac{1}{2}} c_{1 J}]) = [\begin{matrix} Σ_{t}^{- \frac{1}{2}} c_{11} \\ ⋮ \\ Σ_{t}^{- \frac{1}{2}} c_{1 J} \end{matrix}]$ , and so $b_{1}^{T} {(G_{g} \otimes Σ_{t})}^{- 1} b_{1} = Σ_{j = 1}^{J} c_{1 j}^{T} Σ_{t}^{- 1} c_{1 j} = t r [(Σ_{j = 1}^{J} C_{1} u_{j} u_{j}^{T} C_{1}^{T}) Σ_{t}^{- 1}] = t r [(C_{1} \sum_{j = 1}^{J} u_{j} u_{j}^{T} C_{1}^{T}) Σ_{t}^{- 1}] = t r [(C_{1} C_{1}^{T}) Σ_{t}^{- 1}] = t r [b_{1}^{*} G_{g}^{- 1} b_{1}^{* T} Σ_{t}^{- 1}] .$ Also note that $(G_{3}^{- 1 / 2} \otimes Σ_{t}^{- 1 / 2}) b_{2} = v e c (Σ_{t}^{- \frac{1}{2}} b_{2}^{*} G_{3}^{- \frac{1}{2}}) = v e c (Σ_{t}^{- \frac{1}{2}} C_{2})$ with $b_{2}^{*} = [b_{21}, \dots, b_{2 J I}]$ , $C_{2} = [c_{21}, \dots, c_{2 J L}] = b_{2}^{*} G_{3}^{- 1 / 2}$ . Obtained using $(B^{T} \otimes A) v e c (X) = v e c (AXB)$ .

Full conditional for $a_{l}$

\begin{array}{l} P (a_{l} | ELSE) \propto P (Σ_{t} | a_{l}) P (a_{l}) \\ \propto \frac{1}{{(a_{l})}^{\frac{ν_{t} + L}{2} + 1}} \exp (- \frac{1 / A_{l}^{2} + ν_{t} {(Σ_{t}^{- 1})}_{l l}}{a_{l}}) \\ \propto I G (\frac{ν_{t} + L}{2}, 1 / A_{l}^{2} + ν_{t} {(Σ_{t}^{- 1})}_{l l}) \end{array}

(A.7)

with $l = 1, .., L$ and ${(Σ_{t}^{- 1})}_{l l}$ denotes the $(l, l)$ entry of $Σ_{t}^{- 1}$ .

Full conditional for $σ_{E i}^{2},$ with $i = 1, .., I .$

\begin{array}{l} P (σ_{E i}^{2} | ELSE) \propto P (b_{2 i} | σ_{E i}^{2}) P (σ_{E i}^{2} | a_{E i}) \\ \propto {| σ_{E i}^{2} \otimes G_{1} |}^{- \frac{1}{2}} \exp (- \frac{1}{2 σ_{E i}^{2}} b_{2 i}^{T} G_{1}^{- 1} b_{2 i}) P (σ_{E i}^{2} | a_{E i}) \\ \propto σ_{E i}^{2}^{- \frac{ν_{E i} + 1 + J L - 1 + 1 + 1}{2}} \exp (- \frac{1}{2} (tr {[b_{2 i}^{T} G_{1}^{- 1} b_{2 i} + B_{E i}] σ_{E i}^{- 2}})) {| B_{E i} |}^{\frac{ν_{E i} + 1 - 1}{2}} \\ \propto I W (κ^{*} = ν_{E i} + 1 + J L - 1, B_{E}^{*} = [b_{2 i}^{T} G_{1}^{- 1} b_{2 i} + \frac{2 ν_{E i}}{a_{E i}}]) \end{array}

(A.8)

Since $B_{E i} = \frac{2 ν_{E i}}{a_{E i}}$ . With $i = 1, .., I$ and $b_{2 i} = {[b_{2 i 1}^{T}, \dots, b_{2 i J}^{T}]}^{T}$ .

Full conditional for $a_{E i},$ with $i = 1, .., I .$

\begin{matrix} P (a_{E i} | E L S E) \propto P (σ_{E i}^{2} | a_{E i}) P (a_{E i}) \\ \propto \frac{1}{{(a_{E i})}^{\frac{ν_{E i} + 1}{2} + 1}} \exp (- \frac{1 / A_{E i}^{2} + ν_{E i} / σ_{E i}^{2}}{a_{E i}}) \\ \propto I G (\frac{ν_{E i} + 1}{2}, 1 / A_{E i}^{2} + ν_{E i} / σ_{E i}^{2}) \end{matrix}

(A.9)

Full conditional for $R_{e}$ with $l = 1, \dots, L .$

P (R_{e} | E L S E) \propto P (Y | β, b_{1}, b_{2}, R_{e}) P (R_{e} | a_{e 1,} \dots, a_{e L}) .

\begin{array}{l} \propto \frac{1}{{(R_{e})}^{\frac{ν_{e} + L + n - 1 + L + 1}{2}}} \exp (- \frac{tr {[\sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} e_{i j k} e_{i j k}^{T} + 2 ν_{e} diag (\frac{1}{a_{e 1}}, \dots, \frac{1}{a_{e L}})] R_{e}^{- 1}}}{2}) {| B_{e} |}^{\frac{ν_{e} + L - 1}{2}} \\ \propto I W ({\tilde{k}}_{b h l} = ν_{e} + L + n - 1, B_{e}^{*} = \sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} e_{i j k} e_{i j k}^{T} + 2 ν_{e} diag (\frac{1}{a_{e 1}}, \dots, \frac{1}{a_{e L}})) \end{array}

(A.10)

with $B_{e} = 2 ν_{e} diag (\frac{1}{a_{e 1}}, \dots, \frac{1}{a_{e L}}),$ $e_{i j k} = y_{i j k} - (X_{i j k} β + Z_{1 i j k} b_{1 j} + Z_{2 i j k} b_{2 i j})$ , and $l = 1, 2, 3.$

Full conditional for $a_{e l}$

\begin{array}{l} P (a_{e l} | ELSE) \propto P (R_{e} | a_{e l}) P (a_{e l}) \\ \propto \frac{1}{{(a_{e l})}^{\frac{ν_{e} + L}{2} + 1}} \exp (- \frac{1 / A_{e l}^{2} + ν_{e} {(R_{e}^{- 1})}_{l l}}{a_{e l}}) \\ \propto I G (\frac{ν_{e} + L}{2}, 1 / A_{e l}^{2} + ν_{e} {(R_{e}^{- 1})}_{l l}) \end{array}

(A.11)

with $l = 1, 2, 3, \dots L$ and ${(R_{e}^{- 1})}_{l l}$ denotes the $(l, l)$ entry of $R_{e}^{- 1}$ .

Appendix B

Derivation of full conditional distributions for the BMTME diagonal model

All full conditional distributions of the BMTME diagonal model are the same as those of the BMTME unstructured model, except those needed for the variance–covariance ( $Σ_{t}, R_{e}),$ which are now diagonal. For this reason, here we provide the variances of the diagonal elements of $Σ_{t}$ ( $σ_{t}^{2 (l)},$ with $l = 1, \dots, L)$ , $R_{e}$ ( $σ_{e}^{2 (l)},$ with $l = 1, \dots, L$ ) and the required elements of $a$ and $a_{e}$ .

Full conditional for $σ_{t}^{2 (l)}$

$P (σ_{t}^{2 (l)} | E L S E) \propto I W (κ^{*} = ν_{t} + J + 1 + J I - 1, B_{t}^{*} = [{(b_{1}^{*} G_{g}^{- 1} b_{1}^{* T})}_{l l} + {(b_{2}^{*} G_{3}^{- 1} b_{2}^{* T})}_{l l} + 2 ν_{t} (\frac{1}{a_{l}})$ ]) where ${(b_{1}^{*} G_{g}^{- 1} b_{1}^{* T})}_{l l}$ and ${(b_{2}^{*} G_{3}^{- 1} b_{2}^{* T})}_{l l}$ denote the $(l, l)$ entry of the matrix $b_{1}^{*} G_{g}^{- 1} b_{1}^{* T}$ and $b_{2}^{*} G_{3}^{- 1} b_{2}^{* T}_{l l}$ , respectively.

Full conditional for $a_{l}$

P (a_{l} | E L S E) \propto P (σ_{t}^{2 (l)} | a_{l}) P (a_{l})

\propto \frac{1}{{(a_{l})}^{\frac{ν_{t} + 1}{2} + 1}} \exp (- \frac{1 / A_{l}^{2} + ν_{t} σ_{t}^{- 2 (l)}}{a_{l}})

\propto I G (\frac{ν_{t} + 1}{2}, 1 / A_{l}^{2} + ν_{t} / σ_{t}^{2 (l)})

Full conditional for $σ_{e}^{2 (l)}$ with $l = 1, \dots, L$

P (σ_{e}^{2 (l)} | ELSE) \propto I W ({\tilde{k}}_{b h l} = ν_{e} + 1 + n - 1, B_{e}^{*} = {(\sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} e_{i j k} e_{i j k}^{T})}_{l l} + \frac{2 ν_{e}}{a_{e l}})

where ${(\sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} e_{i j k} e_{i j k}^{T})}_{l l}$ denotes the $(l, l)$ entry of the matrix $\sum_{i = 1}^{I} \sum_{j = 1}^{J} \sum_{k = 1}^{K} e_{i j k} e_{i j k}^{T}$ .

Full conditional for $a_{e l}$

P (a_{e l} | E L S E) \propto P (σ_{e}^{2 (l)} | a_{e l}) P (a_{e l})

\propto \frac{1}{{(a_{e l})}^{\frac{ν_{e} + 1}{2} + 1}} \exp (- \frac{1 / A_{e l}^{2} + ν_{e} σ_{e}^{- 2 (l)}}{a_{e l}})

\propto I G (\frac{ν_{e} + 1}{2}, \frac{1}{A_{e l}^{2}} + \frac{ν_{e}}{σ_{e}^{2 (l)}})

Appendix C

Derivation of full conditional distributions for the BMTME standard model

All full conditional distributions of the BMTME standard model are the same as those of the BMTME unstructured model, except those needed for the variance–covariance ( $Σ_{t}, Σ_{E}, R_{e}),$ which are now equal to an identity multiplied by $σ_{t}^{2}$ , $σ_{E}^{2}$ , and $σ_{e}^{2}$ , respectively. For this reason, here we provide the full conditional of $σ_{t}^{2}$ , $σ_{E}^{2}$ , and $σ_{e}^{2}$ and the required elements of $a_{t},$ $a_{E}$ , and $a_{e}$ .

Full conditional for $σ_{t}^{2}$

P (σ_{t}^{2} | E L S E) \propto I W (κ^{*} = ν_{t} + J L + 1 + I J L - 1, B_{t}^{*} = [b_{1}^{T} (G_{g}^{- 1} \otimes I_{L}) b_{1} + b_{2}^{T} (G_{3}^{- 1} \otimes I_{L}) b_{2} + 2 ν_{t} (\frac{1}{a_{t}})])

with G_{3} = I_{I} σ_{E}^{2} \otimes G_{g} .

Full conditional for $a_{t}$

P (a_{t} | E L S E) \propto P (σ_{t}^{2} | a_{t}) P (a_{t}) \propto \frac{1}{{(a_{t})}^{\frac{ν_{t} + 1}{2} + 1}} \exp (- \frac{1 / A_{t}^{2} + ν_{t} σ_{t}^{- 2}}{a_{t}})

\propto I G (\frac{ν_{t} + 1}{2}, 1 / A_{t}^{2} + ν_{t} / σ_{t}^{2})

Full conditional for $σ_{E}^{2}$

P (σ_{E}^{2} | E L S E) \propto I W (κ^{*} = ν_{E} + 1 + I J L - 1, B_{t}^{*} = [b_{2}^{T} (I_{I} \otimes G_{1}^{- 1}) b_{2} + 2 ν_{E} (\frac{1}{a_{E}})])

with G_{1} = G_{g} \otimes I_{L} σ_{t}^{2} .

Full conditional for $a_{E}$

P (a_{E} | E L S E) \propto P (σ_{E}^{2} | a_{E}) P (a_{E}) \propto \frac{1}{{(a_{E})}^{\frac{ν_{E} + 1}{2} + 1}} \exp (- \frac{1 / A_{E}^{2} + ν_{E} σ_{E}^{- 2}}{a_{E}})

\propto I G (\frac{ν_{E} + 1}{2}, 1 / A_{E}^{2} + ν_{E} / σ_{E}^{2})

Full conditional for $σ_{e}^{2}$

P (σ_{e}^{2} | E L S E) \propto I W ({\tilde{k}}_{b h l} = ν_{e} + 1 + n L - 1, B_{e}^{*} = e^{T} e + \frac{2 ν_{e}}{a_{e}})

where $e = Y - Xβ - Z_{1} b_{1} - Z_{2} b_{2}$ .

Full conditional for $a_{e}$

P (a_{e} | E L S E) \propto P (σ_{e}^{2} | a_{e}) P (a_{e})

\propto \frac{1}{{(a_{e})}^{\frac{ν_{e} + 1}{2} + 1}} \exp (- \frac{1 / A_{e}^{2} + ν_{e} σ_{e}^{- 2}}{a_{e}})

\propto I G (\frac{ν_{e} + 1}{2}, \frac{1}{A_{e}^{2}} + \frac{ν_{e}}{σ_{e}^{2}})

Appendix D

Table D1. Cross-validation schemes.

Line	Trait	CV1			CV2
Line	Trait	env1	env2	env3	env1	env2	env3
1	1	y11 (1)	y21 (1)	y31 (1)	y11 (1)	y21 (1)	M
1	2	y11 (2)	y21 (2)	y31 (2)	y11 (2)	y21 (2)	y31 (2)
1	3	y11 (3)	y21 (3)	y31 (3)	y11 (3)	y21 (3)	y31 (3)
2	1	M	y22 (1)	y32 (1)	y12 (1)	y22 (1)	M
2	2	M	y22 (2)	y32 (2)	y12 (2)	y22 (2)	y32 (2)
2	3	M	y22 (3)	y32 (3)	y12 (3)	y22 (3)	y32 (3)
3	1	y13 (1)	y23 (1)	y33 (1)	y13 (1)	y23 (1)	M
3	2	y13 (2)	y23 (2)	y33 (2)	y13 (2)	y23 (2)	y33 (2)
3	3	y13 (3)	y23 (3)	y33 (3)	y13 (3)	y23 (3)	y33 (3)
4	1	y14 (1)	y24 (1)	y34 (1)	y14 (1)	y24 (1)	M
4	2	y14 (2)	y24 (2)	y34 (2)	y14 (2)	y24 (2)	y34 (2)
4	3	y14 (3)	y24 (3)	y34 (3)	y14 (3)	y24 (3)	y34 (3)
5	1	y15 (1)	y25 (1)	y35 (1)	y15 (1)	y25 (1)	M
5	2	y15 (2)	y25 (2)	y35 (2)	y15 (2)	y25 (2)	y35 (2)
5	3	y15 (3)	y25 (3)	y35 (3)	y15 (3)	y25 (3)	y35 (3)
6	1	y16 (1)	M	M	y16 (1)	y26 (1)	M
6	2	y16 (2)	M	M	y16 (2)	y26 (2)	y36 (2)
6	3	y16 (3)	M	M	y16 (3)	y26 (3)	y36 (3)
7	1	y17 (1)	y27 (1)	y37 (1)	y17 (1)	y27 (1)	M
7	2	y17 (2)	y27 (2)	y37 (2)	y17 (2)	y27 (2)	y37 (2)
7	3	y17 (3)	y27 (3)	y37 (3)	y17 (3)	y27 (3)	y37 (3)
8	1	y18 (1)	y28 (1)	y38 (1)	y18 (1)	y28 (1)	M
8	2	y18 (2)	y28 (2)	y38 (2)	y18 (2)	y28 (2)	y38 (2)
8	3	y18 (3)	y28 (3)	y38 (3)	y18 (3)	y28 (3)	y38 (3)
9	1	y19 (1)	y29 (1)	y39 (1)	y19 (1)	y29 (1)	M
9	2	y19 (2)	y29 (2)	y39 (2)	y19 (2)	y29 (2)	y39 (2)
9	3	y19 (3)	y29 (3)	y39 (3)	y19 (3)	y29 (3)	y39 (3)
10	1	M	M	y310 (1)	y110 (1)	y210 (1)	M
10	2	M	M	y310 (2)	y110 (2)	y210 (2)	y310 (2)
10	3	M	M	y310 (3)	y110 (3)	y210 (3)	y310 (3)
…	…	…	…	…	…	…	…
J-10	1	y1(J-10) (1)	y2(J-10) (1)	y3(J-10) (1)	y1(J-10) (1)	y2(J-10) (1)	M
J-10	2	y1(J-10) (2)	y2(J-10) (2)	y3(J-10) (2)	y1(J-10) (2)	y2(J-10) (2)	y3(J-10) (2)
J-10	3	y1(J-10) (3)	y2(J-10) (3)	y3(J-10) (3)	y1(J-10) (3)	y2(J-10) (3)	y3(J-10) (3)
J-9	1	y1(J-9) (1)	y2(J-9) (1)	y3(J-9) (1)	y1(J-9) (1)	y2(J-9) (1)	M
J-9	2	y1(J-9) (2)	y2(J-9) (2)	y3(J-9) (2)	y1(J-9) (2)	y2(J-9) (2)	y3(J-9) (2)
J-9	3	y1(J-9) (3)	y2(J-9) (3)	y3(J-9) (3)	y1(J-9) (3)	y2(J-9) (3)	y3(J-9) (3)
J-8	1	y1(J-8) (1)	M	y3(J-8) (1)	y1(J-8) (1)	y2(J-8) (1)	M
J-8	2	y1(J-8) (2)	M	y3(J-8) (2)	y1(J-8) (2)	y2(J-8) (2)	y3(J-8) (2)
J-8	3	y1(J-8) (3)	M	y3(J-8) (3)	y1(J-8) (3)	y2(J-8) (3)	y3(J-8) (3)
J-7	1	y1(J-7) (1)	y2(J-7) (1)	y3(J-7) (1)	y1(J-7) (1)	y2(J-7) (1)	M
J-7	2	y1(J-7) (2)	y2(J-7) (2)	y3(J-7) (2)	y1(J-7) (2)	y2(J-7) (2)	y3(J-7) (2)
J-7	3	y1(J-7) (3)	y2(J-7) (3)	y3(J-7) (3)	y1(J-7) (3)	y2(J-7) (3)	y3(J-7) (3)
J-6	1	y1(J-6) (1)	y2(J-6) (1)	y3(J-6) (1)	y1(J-6) (1)	y2(J-6) (1)	M
J-6	2	y1(J-6) (2)	y2(J-6) (2)	y3(J-6) (2)	y1(J-6) (2)	y2(J-6) (2)	y3(J-6) (2)
J-6	3	y1(J-6) (3)	y2(J-6) (3)	y3(J-6) (3)	y1(J-6) (3)	y2(J-6) (3)	y3(J-6) (3)
J-5	1	M	M	y3(J-5) (1)	y1(J-5) (1)	y2(J-5) (1)	M
J-5	2	M	M	y3(J-5) (2)	y1(J-5) (2)	y2(J-5) (2)	y3(J-5) (2)
J-5	3	M	M	y3(J-5) (3)	y1(J-5) (3)	y2(J-5) (3)	y3(J-5) (3)
J-4	1	y1(J-4) (1)	y2(J-4) (1)	y3(J-4) (1)	y1(J-4) (1)	y2(J-4) (1)	M
J-4	2	y1(J-4) (2)	y2(J-4) (2)	y3(J-4) (2)	y1(J-4) (2)	y2(J-4) (2)	y3(J-4) (2)
J-4	3	y1(J-4) (3)	y2(J-4) (3)	y3(J-4) (3)	y1(J-4) (3)	y2(J-4) (3)	y3(J-4) (3)
J-3	1	y1(J-3) (1)	y2(J-3) (1)	y3(J-3) (1)	y1(J-3) (1)	y2(J-3) (1)	M
J-3	2	y1(J-3) (2)	y2(J-3) (2)	y3(J-3) (2)	y1(J-3) (2)	y2(J-3) (2)	y3(J-3) (2)
J-3	3	y1(J-3) (3)	y2(J-3) (3)	y3(J-3) (3)	y1(J-3) (3)	y2(J-3) (3)	y3(J-3) (3)
J-2	1	y1(J-2) (1)	y2(J-2) (1)	M	y1(J-2) (1)	y2(J-2) (1)	M
J-2	2	y1(J-2) (2)	y2(J-2) (2)	M	y1(J-2) (2)	y2(J-2) (2)	y3(J-2) (2)
J-2	3	y1(J-2) (3)	y2(J-2) (3)	M	y1(J-2) (3)	y2(J-2) (3)	y3(J-2) (3)
J-1	1	y1(J-1) (1)	y2(J-1) (1)	y3(J-1) (1)	y1(J-1) (1)	y2(J-1) (1)	M
J-1	2	y1(J-1) (2)	y2(J-1) (2)	y3(J-1) (2)	y1(J-1) (2)	y2(J-1) (2)	y3(J-1) (2)
J-1	3	y1(J-1) (3)	y2(J-1) (3)	y3(J-1) (3)	y1(J-1) (3)	y2(J-1) (3)	y3(J-1) (3)
J	1	y1J (1)	y2J (1)	y3J (1)	y1J (1)	y2J (1)	M
J	2	y1J (2)	y2J (2)	y3J (2)	y1J (2)	y2J (2)	y3J (2)
J	3	y1J (3)	y2J (3)	y3J (3)	y1J (3)	y2J (3)	y3J (3)

Open in a new tab

In cross-validation 1 (CV1) lines were evaluated in some environments with all traits but are missing (M) in other environments (for all traits). Cross-validation 2 (CV2) simulates a situation where a trait is lacking in all lines in one environment but present in the remaining environments. Example of onefold cross-validation for J lines, three environments and three traits where the env are the environments. Yij(l) represents the response variable measured in environment i, genotype j, and trait l. For simplification we ignore the subscript of replication ( $k)$ .

Appendix E

Data preparation for analysis with I environments, J lines, K replications, and L traits is shown in the table below. gid denotes unique lines name, env are the environments, rep denotes the replications, and resp represents the response variables.

ThreeWay.

Trait	gid	env	rep	resp
1	G1	Env1	1	y111(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	G1	Env1	K	y11K(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
L	G1	Env1	K	y11K(L)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	GJ	Env1	1	y1J1(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	GJ	Env1	K	y1JK(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
L	GJ	Env1	K	y1JK(L)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	G1	EnvI	1	yI11(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	G1	EnvI	K	yI1K(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
L	G1	EnvI	K	yI1K(L)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	GJ	EnvI	1	yIJ1(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
1	GJ	EnvI	K	yIJK(1)
$⋮$	$⋮$	$⋮$	$⋮$	$⋮$
L	GJ	EnvI	K	yIJK(L)

Open in a new tab

Appendix F

How to install and use the BMTME package

The BMTME package performs the proposed models (unstructured, diagonal, and standard).

Step 1. Install R-software version 3.2.4.

Step 2. Manually install the BMTME package that is available at the link: http://hdl.handle.net/11529/10646

Step 3. Use the package. Open R and copy and page Example 1.

################################ Example 1 #################################

rm(list = objects()); ls()

library(BMTME)

> data(ThreeWay) # load the built in package data file

> # to run with other data: load your data

## Transforming to the data to be used. Here you do not need to modify anything.

ThreeWay = transform(ThreeWay,

trait = factor(trait),

gid = factor(gid),

env = factor(env),

rep = factor(rep)

## Creating the GRM matrix for this example (here you need to upload your own genomic relationship matrix ###########################################################

K_x <- matrix(.7, ncol = 10, nrow = 10)

diag(K_x) <- 1

K <- diag(8) %x% K_x

ISigmaG <- solve(K)

####Here the model is fitted. You are only allowed to change model (“un” for unstructured #covariance matrix, “bd” for diagonal, and “st” for standard), nChain, nIter, and the working #directory (getwd ()) where you want to save your output. The output will be the β #coefficients, random effects b1 and b2, the three variance–covariances matrices (Sigma #Trait, Sigma Environments, and Sigma Residual of traits). The order of the call should be as follows considering the corresponding names in the dataframe ###########################

fit1<- fit(formula = resp ∼ trait + gid + env + rep,

data = ThreeWay,

K = ISigmaG,

model = ’un’,

nChain = 1,

nIter = 100,

saveAt = getwd(C:\\Osval\\))

Footnotes

Communicating editor: D. J. de Koning

Literature Cited

Burgueño J., de los Campos G., Weigel K., Crossa J., 2012. Genomic prediction of breeding values when modeling genotype×environment interaction using pedigree and dense molecular markers. Crop Sci. 52(2): 707–719. [Google Scholar]
Calus M. P., Veerkamp R. F., 2011. Accuracy of multi-trait genomic selection using different methods. Genet. Sel. Evol. 43(1): 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crossa J., de los Campos G., Pérez-Rodríguez P., Gianola D., Burgueño J., et al. , 2010. Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186(2): 713–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crossa J., Pérez-Rodríguez P., de los Campos G., Mahuku G., Dreisigacker S., et al. , 2011. Genomic selection and prediction in plant breeding. J. Crop Improv. 25(3): 239–261. [Google Scholar]
Crossa J., Beyene Y., Kassa S., Pérez-Rodríguez P., Hickey J. M., et al. , 2013. Genomic prediction in maize breeding populations with genotyping-by-sequencing. G3 (Bethesda) 3(11): 1903–1926. [DOI] [PMC free article] [PubMed] [Google Scholar]
de los Campos G., Gianola D., 2007. Factor analysis models for structuring covariance matrices of additive genetic effects: a Bayesian implementation. Genet. Sel. Evol. 39(5): 481–494. [DOI] [PMC free article] [PubMed] [Google Scholar]
de los Campos G., Naya H., Gianola D., Crossa J., Legarra A., et al. , 2009. Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics 182(1): 375–385. [DOI] [PMC free article] [PubMed] [Google Scholar]
de los Campos G., Gianola D., Rosa G. J. M., Weigel K., Crossa J., 2010. Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. Genet. Res. 92(04): 295–308. [DOI] [PubMed] [Google Scholar]
Eddelbuettel D., 2013. Seamless R and C++ Integration with Rcpp. Springer, New York. [Google Scholar]
Eddelbuettel D., Sanderson C., 2014. Rcpparmadillo: Accelerating R with high-performance C++ linear algebra. Comput. Stat. Data Anal. 71: 1054–1063. [Google Scholar]
Garrick, D. J., and R. L. Fernando, 2013 Implementing a QTL detection study (GWAS) using genomic prediction methodology, pp. 275–298 in Genome-Wide Association Studies and Genomic Prediction edited by John M. Walker. Humana Press, Hertfordshire, UK. [DOI] [PubMed] [Google Scholar]
Gelman A., 2006. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Anal. 1(3): 515–534. [Google Scholar]
Geyer C. J., 1992. Practical Markov Chain Monte Carlo. Stat. Sci. 7(4): 473–483. [Google Scholar]
Gianola D., 2013. Priors in whole-genome regression: the Bayesian alphabet returns. Genetics 194(3): 573–596. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo G., Zhao F., Wang Y., Zhang Y., Du L., et al. , 2014. Comparison of single-trait and multiple-trait genomic prediction models. BMC Genet. 15(1): 30. [DOI] [PMC free article] [PubMed] [Google Scholar]
Henderson C. R., Quaas R. L., 1976. Multiple trait evaluation using relatives’ records. J. Anim. Sci. 43(6): 1188–1197. [Google Scholar]
Heslot N., Yang H. P., Sorrells M. E., Jannink J. L., 2012. Genomic selection in plant breeding: A comparison of models. Crop Sci. 52(1): 146–160. [Google Scholar]
Heslot N., Akdemir D., Sorrells M. E., Jannink J. L., 2014. Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions. Theor. Appl. Genet. 127(2): 463–480. [DOI] [PubMed] [Google Scholar]
Huang A., Wand M. P., 2013. Simple marginally noninformative prior distributions for covariance matrices. Bayesian Anal. 8(2): 439–452. [Google Scholar]
Jarquín D., Crossa J., Lacaze X., Cheyron P. D., Daucourt J., et al. , 2014. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor. Appl. Genet. 127(3): 595–607. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jia Y., Jannink J. L., 2012. Multiple-trait genomic selection methods increase genetic value prediction accuracy. Genetics 192(4): 1513–1522. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiang J., Zhang Q., Ma L., Li J., Wang Z., et al. , 2015. Joint prediction of multiple quantitative traits using a Bayesian multivariate antedependence model. Heredity 115(1): 29–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
Johnson R. A., Wichern D. W., 1992. Applied Multivariate Statistical Analysis, Vol. 4 Prentice Hall, Englewood Cliffs, NJ. [Google Scholar]
Link W. A., Eaton M. J., 2012. On thinning of chains in MCMC. Methods Ecol. Evol. 3(1): 112–115. [Google Scholar]
López-Cruz M. A., Crossa J., Bonnet D., Dreisigacker S., Poland J., et al. , 2015. Increased prediction accuracy in wheat breeding trials using a markers × environment interaction genomic selection model. G3 (Bethesda) 5(4): 569–582. [DOI] [PMC free article] [PubMed] [Google Scholar]
MacEachern S. N., Berliner L. M., 1994. Subsampling the Gibbs sampler. Am. Stat. 48(3): 188–190. [Google Scholar]
Meuwissen T. H., Hayes B. J., Goddard M. E., 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157(4): 1819. [DOI] [PMC free article] [PubMed] [Google Scholar]
Montesinos-López O. A., Montesinos-López A., Pérez-Rodríguez P., de los Campos G., Eskridge K. M., et al. , 2015. Threshold models for genome-enabled prediction of ordinal categorical traits in plant breeding. G3 (Bethesda) 5(1): 291–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pérez-Rodríguez P., Gianola D., González-Camacho J. M., Crossa J., Manes Y., et al. , 2012. A comparison between linear and non-parametric regression models for genome-enabled prediction in wheat. G3 (Bethesda) 2(12): 1595–1605. [DOI] [PMC free article] [PubMed] [Google Scholar]
Peters S. O., Kizilkaya K., Garrick D. J., Fernando R. L., Reecy J. M., et al. , 2012. Bayesian genome-wide association analysis of growth and yearling ultrasound measures of carcass traits in Brangus heifers. J. Anim. Sci. 90(10): 3398–3409. [DOI] [PubMed] [Google Scholar]
Pollak E. J., Van der Werf J., Quaas R. L., 1984. Selection bias and multiple trait evaluation. J. Dairy Sci. 67(7): 1590–1595. [Google Scholar]
Pszczola M., Veerkamp R. F., de Haas Y., Wall E., Strabel T., et al. , 2013. Effect of predictor traits on accuracy of genomic breeding values for feed intake based on a limited cow reference population. Animal 7(11): 1759–1768. [DOI] [PubMed] [Google Scholar]
R Core Team, 2015 R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
Roy A., Leiva R. A., 2008. Testing of a Structured Covariance Matrix for Three-level Repeated Measures Data. UTSA, College of Business; San Antonio, Texas. [Google Scholar]
Rutkoski J., Poland J., Mondal S., Autrique E., Crossa J., Reynolds M., Singh R., 2016. Predictor traits from high-throughput phenotyping improve accuracy of pedigree and genomic selection for yield in wheat. G3 (in press). [DOI] [PMC free article] [PubMed] [Google Scholar]
Sanderson C., 2010. Armadillo: An Open Source C++ Algebra Library for Fast Prototyping and Computationally Intensive Experiments. Tech. Rep. NICTA, QLD, Australia. [Google Scholar]
Schaeffer L. R., 1984. Sire and cow evaluation under multiple trait models. J. Dairy Sci. 67(7): 1567–1580. [Google Scholar]
Simpson S. L., Edwards L. J., Styner M. A., Muller K. E., 2014. Kronecker Product Linear Exponent AR(1) Correlation Structures for Multivariate Repeated Measures. PLoS One 9(2): e88864 DOI:. 10.1371/journal.pone.0088864 [DOI] [PMC free article] [PubMed] [Google Scholar]
Stroustrup B., 2000. The C++ Programming Language, Ed. 4 Addison-Wesley, New York. [Google Scholar]
VanRaden P. M., 2008. Efficient methods to compute genomic predictions. J. Dairy Sci. 91(11): 4414–4423. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.

[bib1] Burgueño J., de los Campos G., Weigel K., Crossa J., 2012. Genomic prediction of breeding values when modeling genotype×environment interaction using pedigree and dense molecular markers. Crop Sci. 52(2): 707–719. [Google Scholar]

[bib2] Calus M. P., Veerkamp R. F., 2011. Accuracy of multi-trait genomic selection using different methods. Genet. Sel. Evol. 43(1): 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Crossa J., de los Campos G., Pérez-Rodríguez P., Gianola D., Burgueño J., et al. , 2010. Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186(2): 713–724. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Crossa J., Pérez-Rodríguez P., de los Campos G., Mahuku G., Dreisigacker S., et al. , 2011. Genomic selection and prediction in plant breeding. J. Crop Improv. 25(3): 239–261. [Google Scholar]

[bib5] Crossa J., Beyene Y., Kassa S., Pérez-Rodríguez P., Hickey J. M., et al. , 2013. Genomic prediction in maize breeding populations with genotyping-by-sequencing. G3 (Bethesda) 3(11): 1903–1926. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] de los Campos G., Gianola D., 2007. Factor analysis models for structuring covariance matrices of additive genetic effects: a Bayesian implementation. Genet. Sel. Evol. 39(5): 481–494. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] de los Campos G., Naya H., Gianola D., Crossa J., Legarra A., et al. , 2009. Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics 182(1): 375–385. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] de los Campos G., Gianola D., Rosa G. J. M., Weigel K., Crossa J., 2010. Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. Genet. Res. 92(04): 295–308. [DOI] [PubMed] [Google Scholar]

[bib9] Eddelbuettel D., 2013. Seamless R and C++ Integration with Rcpp. Springer, New York. [Google Scholar]

[bib10] Eddelbuettel D., Sanderson C., 2014. Rcpparmadillo: Accelerating R with high-performance C++ linear algebra. Comput. Stat. Data Anal. 71: 1054–1063. [Google Scholar]

[bib11] Garrick, D. J., and R. L. Fernando, 2013 Implementing a QTL detection study (GWAS) using genomic prediction methodology, pp. 275–298 in Genome-Wide Association Studies and Genomic Prediction edited by John M. Walker. Humana Press, Hertfordshire, UK. [DOI] [PubMed] [Google Scholar]

[bib12] Gelman A., 2006. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Anal. 1(3): 515–534. [Google Scholar]

[bib13] Geyer C. J., 1992. Practical Markov Chain Monte Carlo. Stat. Sci. 7(4): 473–483. [Google Scholar]

[bib14] Gianola D., 2013. Priors in whole-genome regression: the Bayesian alphabet returns. Genetics 194(3): 573–596. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Guo G., Zhao F., Wang Y., Zhang Y., Du L., et al. , 2014. Comparison of single-trait and multiple-trait genomic prediction models. BMC Genet. 15(1): 30. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Henderson C. R., Quaas R. L., 1976. Multiple trait evaluation using relatives’ records. J. Anim. Sci. 43(6): 1188–1197. [Google Scholar]

[bib17] Heslot N., Yang H. P., Sorrells M. E., Jannink J. L., 2012. Genomic selection in plant breeding: A comparison of models. Crop Sci. 52(1): 146–160. [Google Scholar]

[bib18] Heslot N., Akdemir D., Sorrells M. E., Jannink J. L., 2014. Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions. Theor. Appl. Genet. 127(2): 463–480. [DOI] [PubMed] [Google Scholar]

[bib19] Huang A., Wand M. P., 2013. Simple marginally noninformative prior distributions for covariance matrices. Bayesian Anal. 8(2): 439–452. [Google Scholar]

[bib20] Jarquín D., Crossa J., Lacaze X., Cheyron P. D., Daucourt J., et al. , 2014. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor. Appl. Genet. 127(3): 595–607. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Jia Y., Jannink J. L., 2012. Multiple-trait genomic selection methods increase genetic value prediction accuracy. Genetics 192(4): 1513–1522. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Jiang J., Zhang Q., Ma L., Li J., Wang Z., et al. , 2015. Joint prediction of multiple quantitative traits using a Bayesian multivariate antedependence model. Heredity 115(1): 29–36. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Johnson R. A., Wichern D. W., 1992. Applied Multivariate Statistical Analysis, Vol. 4 Prentice Hall, Englewood Cliffs, NJ. [Google Scholar]

[bib24] Link W. A., Eaton M. J., 2012. On thinning of chains in MCMC. Methods Ecol. Evol. 3(1): 112–115. [Google Scholar]

[bib25] López-Cruz M. A., Crossa J., Bonnet D., Dreisigacker S., Poland J., et al. , 2015. Increased prediction accuracy in wheat breeding trials using a markers × environment interaction genomic selection model. G3 (Bethesda) 5(4): 569–582. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] MacEachern S. N., Berliner L. M., 1994. Subsampling the Gibbs sampler. Am. Stat. 48(3): 188–190. [Google Scholar]

[bib27] Meuwissen T. H., Hayes B. J., Goddard M. E., 2001. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157(4): 1819. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Montesinos-López O. A., Montesinos-López A., Pérez-Rodríguez P., de los Campos G., Eskridge K. M., et al. , 2015. Threshold models for genome-enabled prediction of ordinal categorical traits in plant breeding. G3 (Bethesda) 5(1): 291–300. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Pérez-Rodríguez P., Gianola D., González-Camacho J. M., Crossa J., Manes Y., et al. , 2012. A comparison between linear and non-parametric regression models for genome-enabled prediction in wheat. G3 (Bethesda) 2(12): 1595–1605. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Peters S. O., Kizilkaya K., Garrick D. J., Fernando R. L., Reecy J. M., et al. , 2012. Bayesian genome-wide association analysis of growth and yearling ultrasound measures of carcass traits in Brangus heifers. J. Anim. Sci. 90(10): 3398–3409. [DOI] [PubMed] [Google Scholar]

[bib31] Pollak E. J., Van der Werf J., Quaas R. L., 1984. Selection bias and multiple trait evaluation. J. Dairy Sci. 67(7): 1590–1595. [Google Scholar]

[bib32] Pszczola M., Veerkamp R. F., de Haas Y., Wall E., Strabel T., et al. , 2013. Effect of predictor traits on accuracy of genomic breeding values for feed intake based on a limited cow reference population. Animal 7(11): 1759–1768. [DOI] [PubMed] [Google Scholar]

[bib33] R Core Team, 2015 R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.

[bib34] Roy A., Leiva R. A., 2008. Testing of a Structured Covariance Matrix for Three-level Repeated Measures Data. UTSA, College of Business; San Antonio, Texas. [Google Scholar]

[bib35] Rutkoski J., Poland J., Mondal S., Autrique E., Crossa J., Reynolds M., Singh R., 2016. Predictor traits from high-throughput phenotyping improve accuracy of pedigree and genomic selection for yield in wheat. G3 (in press). [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Sanderson C., 2010. Armadillo: An Open Source C++ Algebra Library for Fast Prototyping and Computationally Intensive Experiments. Tech. Rep. NICTA, QLD, Australia. [Google Scholar]

[bib37] Schaeffer L. R., 1984. Sire and cow evaluation under multiple trait models. J. Dairy Sci. 67(7): 1567–1580. [Google Scholar]

[bib38] Simpson S. L., Edwards L. J., Styner M. A., Muller K. E., 2014. Kronecker Product Linear Exponent AR(1) Correlation Structures for Multivariate Repeated Measures. PLoS One 9(2): e88864 DOI:. 10.1371/journal.pone.0088864 [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Stroustrup B., 2000. The C++ Programming Language, Ed. 4 Addison-Wesley, New York. [Google Scholar]

[bib40] VanRaden P. M., 2008. Efficient methods to compute genomic predictions. J. Dairy Sci. 91(11): 4414–4423. [DOI] [PubMed] [Google Scholar]

PERMALINK

A Genomic Bayesian Multi-trait and Multi-environment Model

Osval A Montesinos-López

Abelardo Montesinos-López

José Crossa

Fernando H Toledo

Oscar Pérez-Hernández

Kent M Eskridge

Jessica Rutkoski

Abstract

Methods

Statistical model

Joint posterior density and prior specification

Gibbs sampler

Model implementation

Assessing prediction accuracy

Simulation data

Table 1. Simulated data with three traits and three environments.

Real data sets

Maize data set:

Wheat data set:

Data and codes repository

The BMTME R package

Data availability

Results

Simulated data set

Table 2. Simulated data with three traits and three environments.

Maize data set

Table 3. Maize data.

Table 4. Maize data.

Table 5. Maize data.

Wheat data set

Table 6. Wheat data.

Table 7. Wheat data.

Table 8. Wheat data.

Discussion

Performance of the BMTME model in simulated and real data sets

Prediction assessment of the BMTME model

Contributions and limitations of the BMTME model

Conclusions

Acknowledgments

Appendix A

Derivation of full conditional distributions for the BMTME unstructured model

Full conditional for β

Full conditional for σβ2

Full conditional for aβ

Full conditional for b1

Full conditional for Σt

Full conditional for al

Full conditional for σEi2, with i=1,..,I.

Full conditional for aEi, with i=1,..,I.

Full conditional for Re with l=1,…,L.

Full conditional for ael

Appendix B

Derivation of full conditional distributions for the BMTME diagonal model

Full conditional for σt2(l)

Full conditional for al

Full conditional for σe2(l) with l=1,…,L

Full conditional for ael

Appendix C

Derivation of full conditional distributions for the BMTME standard model

Full conditional for σt2

Full conditional for at

Full conditional for σE2

Full conditional for aE

Full conditional for σe2

Full conditional for ae

Appendix D

Table D1. Cross-validation schemes.

Appendix E

ThreeWay.

Appendix F

How to install and use the BMTME package

Footnotes

Literature Cited

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Full conditional for $β$

Full conditional for $σ_{β}^{2}$

Full conditional for $a_{β}$

Full conditional for $b_{1}$

Full conditional for $Σ_{t}$

Full conditional for $a_{l}$

Full conditional for $σ_{E i}^{2},$ with $i = 1, .., I .$

Full conditional for $a_{E i},$ with $i = 1, .., I .$

Full conditional for $R_{e}$ with $l = 1, \dots, L .$

Full conditional for $a_{e l}$

Full conditional for $σ_{t}^{2 (l)}$

Full conditional for $a_{l}$

Full conditional for $σ_{e}^{2 (l)}$ with $l = 1, \dots, L$

Full conditional for $a_{e l}$

Full conditional for $σ_{t}^{2}$

Full conditional for $a_{t}$

Full conditional for $σ_{E}^{2}$

Full conditional for $a_{E}$

Full conditional for $σ_{e}^{2}$

Full conditional for $a_{e}$