Comparison of analyses of the XVth QTLMAS common dataset III: Genomic Estimations of Breeding Values

Pascale Le Roy; Olivier Filangi; Olivier Demeure; Jean-Michel Elsen

doi:10.1186/1753-6561-6-S2-S3

. 2012 May 21;6(Suppl 2):S3. doi: 10.1186/1753-6561-6-S2-S3

Comparison of analyses of the XV^thQTLMAS common dataset III: Genomic Estimations of Breeding Values

Pascale Le Roy ^1,^2,^✉, Olivier Filangi ^1,², Olivier Demeure ^1,², Jean-Michel Elsen ³

PMCID: PMC3363157 PMID: 22640599

Abstract

Background

The QTLMAS XV^thdataset consisted of pedigree, marker genotypes and quantitative trait performances of animals with a sib family structure. Pedigree and genotypes concerned 3,000 progenies among those 2,000 were phenotyped. The trait was regulated by 8 QTLs which displayed additive, imprinting or epistatic effects. The 1,000 unphenotyped progenies were considered as candidates to selection and their Genomic Estimated Breeding Values (GEBV) were evaluated by participants of the XV^thQTLMAS workshop. This paper aims at comparing the GEBV estimation results obtained by seven participants to the workshop.

Methods

From the known QTL genotypes of each candidate, two "true" genomic values (TV) were estimated by organizers: the genotypic value of the candidate (TGV) and the expectation of its progeny genotypic values (TBV). GEBV were computed by the participants following different statistical methods: random linear models (including BLUP and Ridge Regression), selection variable techniques (LASSO, Elastic Net) and Bayesian methods. Accuracy was evaluated by the correlation between TV (TGV or TBV) and GEBV presented by participants. Rank correlation of the best 10% of individuals and error in predictions were also evaluated. Bias was tested by regression of TV on GEBV.

Results

Large differences between methods were found for all criteria and type of genetic values (TGV, TBV). In general, the criteria ranked consistently methods belonging to the same family.

Conclusions

Bayesian methods - A<B<C<Cπ - were the most efficient whatever the criteria and the True Value considered (with the notable exception of the MSEP of the TBV). The selection variable procedures (LASSO, Elastic Net and some adaptations) performed similarly, probably at a much lower computing cost. The TABLUP, which combines BayesB and GBLUP, generally did well. The simplest methods, GBLUP or Ridge Regression, and even worst, the fixed linear model, were much less efficient.

Background

In 1990, Lande and Thompson [1] defined a two steps marker assisted selection procedure. Firstly, apparent effects of markers were estimated in a reference population. Secondly, during n generations, breeding values of candidates to selection were calculated from these estimated effects giving a so called Molecular Scores. These ideas, which founded the genomic selection, were more recently made operational by SNP chips which provide tens of thousands genotypes per individual. The seminal paper of Meuwissen et al. [2] presented a few statistical approaches of these Genomic Estimated Breeding Values (GEBV). A large literature followed, describing and comparing various methods.

These methods could be classified according to the assumption made concerning the variance of chromosome segments effects. The simplest assumption, assumed in BLUP methodology [2] or Ridge Regression [3], is that the variance of these effects is equal for all chromosome segments. However, this hypothesis is not consistent with classical genetic prior, and observations, that only a few chromosome segments contain QTL, with various extent of their effects, while most chromosome segments do not contain QTL.

Variable selection procedures were proposed to better fit this biological knowledge. In [2], a stepwise procedure, including a QTL detection step through single segments regression analyses, was envisaged in the least square framework. The efficiency for genomic evaluation of more advanced penalized regression approaches were evaluated, like sparse PLS [4], LASSO [5] or Elastic Net [6], which all allow the vast majority of loci to have null regression coefficients.

On the other hand, Bayesian methods were proposed to take into account the between chromosome segments variances heterogeneity. In BayesA [2], each chromosome segment is given its own variance, all segments contributing to the variability. This last hypothesis is made free with other Bayesian techniques which assume that only a fraction π of the segments carry QTL: BayesB keeps the between segments variance heterogeneity, while BayesC considers a single variance for the active segments. In BayesCπ, the proportion π is estimated from the data [7].

The QTLMAS XV^thdataset consisted of the pedigree, marker genotypes and quantitative trait performances of animals with a sib family structure [8]. Pedigree and genotypes concerned 3,000 progenies among those 2,000 were phenotyped. The trait was regulated by 8 QTLs which displayed additive, imprinting or epistatic effects. The 1,000 unphenotyped progenies were considered as candidates to selection. Participants of the XV^thQTLMAS workshop were invited to predict GEBV of these 1,000 individuals and to send to the organizers the description of their methods and results before the meeting. This paper aims at comparing the GEBV estimations obtained by participants to the workshop. Comparing the results obtained by the different groups should provide insight into determining which method is best fitted to analyze this kind of data set.

Methods

Simulated data

The simulated data set was described by Elsen et al. [8]. Briefly, the population comprised 3,000 individuals born from 20 sires and 200 dams , i.e. 10 dams per sire. Within each family, 10 progenies were assigned phenotypes and marker genotypes and 5 were assigned only marker genotypes. A total of 10,000 SNPs carried by 5 chromosomes of 1 Morgan each were simulated. Eight QTLs were simulated: one quadri-allelic additive QTL with a large effect on Chr1, two linked QTLs in phase on Chr2, two linked QTLs in repulsion on Chr3, one imprinted QTL on Chr4 and two interacting QTLs on Chr5. Random noise was added giving an heritability coefficient of 0.30. The marker density, linkage disequilibrium (LD) and minor allele frequency (MAF) were similar to real life parameters.

Computation of the true genotypic and breeding values

"True" genetic values of the candidates to selection were calculated from simulated QTL genotypes information. Two values were calculated for each candidate. Firstly, a True Genotypic Value (TGV) defined as the sum of the 5 chromosomal genotypic values corresponding to the candidate genotypes at each of these chromosomes. The TGV of candidate i, TGV_idepends on its QTL genotypes $(g_{i}^{1}, g_{i}^{2} . . . g_{i}^{5})$ and on the QTL effects (a_j) given in the Table 1 of Elsen et al. [8]:

Table 1.

Methods used by the participants to the XV^thQTLMAS workshop

First author	Label	Method	Description
Shariati	BayesS_1	2 steps (all SNP)	First step: a GBLUP giving estimation of SNP effects. Groups of size 150, 75 (SPNa) or 50 (SNPb) are made assembling SNP of similar effect. Second step: BayesA with all or a limited (1500 or 450) number of SNP and a unique SNP effect variance per group.

	BayesS_2	2 steps (1500 SNP)

	BayesS_3	2 steps-Bayes (450 SNPa)

	BayesS_4	2 steps-Bayes (450 SNPb)

Ogutu	RR	Ridge regression

	GBLUP_O	GBLUP	Qualified Ridge Regression BLUP by the authors

	LASSO_O	LASSO

	LASSO_ad	Adaptative LASSO	Following Zou [21], data-driven weights are added to the penalty to force LASSO to be consistent

	EN	Elastic net

	EN_ad	Adaptative EN	Mixture of adaptative lasso and EN

Wang	BayesA_W	BayesA

	BayesB_W	BayesB

	BayesCπ_W	BayesCπ

	TABLUP	TABLUP	In the genomic matrix, loci IBD probability estimations are weighted by their effect variance estimated from BayesB [11]

	GBLUP_W	GBLUP

Mucha	AM	Animal model	All models are estimating haplotypes effects. Haplotypes are obtained using the PHASE software [18]. RM1 and RM2 differ by the estimation of the haplotype effect variance

	FM	Fixed effect

	RM1	Random model 1

	RM2	Random model 2

Zeng	GBLUPa_Z	GBLUP1	Additive effect only

	GBLUPd_Z	GBLUP2	Additive and dominance effect

	BayesB _W	BayesB

	BayesCπ_W	BayesCπ

Usai	LASSO_Uc	LASSO-LARS classic	The penalty is describes as ∑\|β_j\|≤t. In the LASSO-LARS classic, the t parameter is the average number of active SNP in 1000 simulations. In strategy 1, the number which occurred more than 5% of the times and in strategy 2, which minimized a selection criteria

	LASSO_Uc1	LASSO-LARS strategy 1

	LASSO_Uc2	LASSO-LARS strategy 2

Schurink	BayesZ	BayesZ	Similar to BayesCπ, with a Bernoulli prior for π

Open in a new tab

T G V_{i} = \sum_{j = 1}^{5} a_{j} (g_{i}^{j})

Secondly, the expectation of the genotypic value of candidate's progenies was calculated, according to the same principle, i.e. as a sum of chromosomal genotypic values. It depends on the QTL genotypes of the candidate, on the QTL effects and on the frequencies of QTL genotypes in the population, i.e. the QTL genotypes probabilities of the mate of the candidate. This breeding value was noted TBV_i:

T B V_{i} = \sum_{j = 1}^{5} \sum_{g^{j} = 1}^{n} p r o b (g^{j} / g_{i}^{j}) * a_{j} (g^{j})

where the g^jare the n QTL possible genotypes on the chromosome j (n=(10,9,9,4,9) for the chromosomes 1 to 5 respectively), $p r o b (g^{j} / g_{i}^{j})$ is the probability of the genotype g^jfor the progeny of the candidate i given the candidate's genotype and a_j(g^j) is the genotypic value associated to that QTL genotype (Table 1 in [8]). Concerning the QTL6 on chromosome 4, which was imprinted, candidates were considered as those which give the allele expressed by their progeny.

The participants were sent the TGV and TBV only after the meeting.

Methods used by the participants

The participants estimated Genomic Estimated Breeding Values, noted GEBV in the following, and sent them, with a short description of the methods used, to organizers before the meeting. A total of 27 methods were studied by the participants (table 1). Most of them belong to the three categories which were presented in the introduction: (i) (G)BLUP methods - including Ridge regression [9], GBLUP describing dominance effect [10] and TABLUP [11] where the genomic matrix includes information about the SNP effect variance (here estimated using BayesB) [12] (ii) Selection variable procedures - LASSO and Elastic Net, including adaptative versions which aim at forcing the LASSO to be consistent, i.e. to correctly estimate the subset of zero coefficients with a probability tending to 1 [9,13] and (iii) Bayesian approaches [12,10] - including the BayesZ [14,15] and a new two-steps Bayes procedure intermediate between the BayesA or B (one variance for each SNP) and the BayesC (a single variance for the active SNP), with a grouping of SNP based on their effect estimated with a GBLUP [16]. This method will be given the "BayesS" acronym in the following. Mucha et al. [17] used simple linear models (fixed or random) with the idea of estimating haplotype rather than SNP effects, the haplotypes being inferred with the PHASE software [18].

Comparison criteria

Results (GEBV as given by the participants) were compared based on 4 criteria. For each criteria, the two True Values (TGV and TBV) were considered. Accuracy of GEBV was calculated as the Pearson's correlation between the TV and the GEBV. Ability to identify the best individuals was assessed from the Spearman's rank correlation between the TV and the GEBV in the top 10% of TV. Bias was assessed from the linear regression coefficient (named also the regression slope) of the TV on the GEBV. Finally, mean squared error of prediction was calculated on GEBV and TV centered on zero.

Results

They are presented in table 2 (TGV) and 3 (TBV). The ranking is nearly the same for those two values.

Table 2.

Comparison of True Genomic Values estimations

First author	Label	r	rank	bias	MSE
Shariati	BayesS_1	0.86	0.53	0.89	7.51

	BayesS_2	0.86	0.53	0.89	7.52

	BayesS_3	0.86	0.53	0.88	7.85

	BayesS_4	0.85	0.55	0.87	8.00

Ogutu	RR	0.85	0.54	1.19	8.44

	GBLUP_O	0.90	0.52	1.11	5.55

	LASSO_O	0.92	0.63	1.09	4.67

	LASSO_ad	0.92	0.62	1.02	4.30

	EN	0.92	0.62	1.23	4.96

	EN_ad	0.90	0.40	0.97	5.48

Wang	BayesA_W	0.92	0.65	1.06	4.15

	BayesB_W	0.93	0.70	1.05	3.66

	BayesCπ_W	0.93	0.70	1.06	3.63

	TABLUP	0.91	0.68	0.97	4.59

	GBLUP_W	0.78	0.37	1.20	11.71

Mucha	AM	0.61	0.36	1.06	17.57

	FM	0.49	0.32	0.35	43.17

	RM1	0.70	0.38	1.77	16.65

	RM2	0.71	0.38	1.69	16.30

Zeng	GBLUPa_Z	0.82	0.53	1.04	8.94

	GBLUPd_Z	0.81	0.52	1.04	9.46

	BayesB _W	0.93	0.71	1.05	3.63

	BayesCπ_W	0.94	0.72	1.07	3.41

Usai	LASSO_Uc	0.92	0.62	1.25	5.04

	LASSO_Uc1	0.90	0.64	1.02	5.30

	LASSO_Uc2	0.92	0.63	1.09	4.66

Schurink	BayesZ	0.90	0.60	1.06	5.20

Open in a new tab

r=Pearson correlation between TGV and GEBV, rank=rank correlation of the best 10% TGV, bias = regression coefficient between TGV and GEBV, MSEP= mean squared error of prediction of TGV by GEBV.

Table 3.

Comparison of True Breeding Values estimations

First author	Label	r	rank	bias	MSE

Shariati	BayesS_1	0.84	0.48	0.33	12.92

	BayesS_2	0.84	0.47	0.33	12.94

	BayesS_3	0.83	0.49	0.33	13.37

	BayesS_4	0.82	0.49	0.32	13.62
Ogutu	RR	0.83	0.52	0.45	5.55

	GBLUP_O	0.81	0.51	0.39	8.23

	LASSO_O	0.87	0.55	0.43	6.44

	LASSO_ad	0.88	0.60	0.37	9.94

	EN	0.87	0.52	0.44	5.86

	EN_ad	0.81	0.48	0.33	12.01

Wang	BayesA_W	0.86	0.61	0.38	9.07

	BayesB_W	0.89	0.66	0.38	9.12

	BayesCπ_W	0.88	0.65	0.39	9.00

	TABLUP	0.88	0.64	0.36	10.88

	GBLUP_W	0.77	0.48	0.46	4.98

Mucha	AM	0.59	0.37	0.40	5.93

	FM	0.47	0.44	0.13	43.01

	RM1	0.70	0.34	0.68	2.54

	RM2	0.70	0.34	0.65	2.68

Zeng	GBLUPa_Z	0.82	0.50	0.40	7.61

	GBLUPd_Z	0.81	0.49	0.40	7.59

	BayesB _W	0.89	0.66	0.38	9.22

	BayesCπ_W	0.89	0.66	0.39	8.84

Usai	LASSO_Uc	0.86	0.53	0.45	5.66

	LASSO_Uc1	0.86	0.62	0.37	9.54

	LASSO_Uc2	0.87	0.55	0.43	6.48

Schurink	BayesZ	0.87	0.63	0.39	5.20

Open in a new tab

(r=Pearson correlation between TBV and GEBV, rank=rank correlation of the best 10% TBV, bias = regression coefficient between TBV and GEBV, MSE= mean squared error of prediction of TBV by GEBV)

Accuracy

The Pearson correlation between GEBV and the TV were consistent within type of technique used. The range was large, from 0.49 (GEBV- TGV correlation, 0.47 for GEBV-TBV) for the Mucha et al. [15] fixed effect model to 0.94 (GEBV- TGV correlation, 0.89 for GEBV-TBV) for the Zeng et al. [10] BayesCπ. The highest correlations were obtained with the Bayesian approaches, with a very good performance of BayesCπ [10,12] which overperformed BayesZ, a similar approach based on an alternative prior. The very limited number of QTL simulated in the dataset, a situation far from the BayesZ prior, is a possible explanation for this difference. The same argument could explain the lower performance of the BayesS [16], where SNP effects are assembled in groups of similar effects. The TABLUP, which mixes BayesB estimation and GBLUP, was intermediate between the "classical" and the new Bayesian approaches of Shariati et al. [16].

The variable selection procedures can work nearly as well as the BayesB or C, in particular the LASSO and Elastic net [9,13]. However the adaptative Elastic Net did not give the expected improvement.

The GBLUP performances were more variable with a very low correlation given by the Mucha et al. [17] version based on haplotypes, and higher values for the Zeng et al. [10] and Ogutu et al. [9] proposals. Finally, the fixed effect linear model was far below all other methods.

Even if all tendencies were observed for both groups of correlations, the correlations between GEBV and the TBV were always lower than the correlations between GEBV and TGV. These last correlations were always lower than the former.

Rank correlation

As compared to the Pearson's correlation, this criteria, which illustrates how methods can capture the best individuals, shows a similar range (0.32 to 0.72, i.e. 0.4 points of correlation between extreme situations), i.e. the fixed model and the BayesCπ. Globally the classification between groups of methods is the same: Bayes methods outperformed the selection variable approaches, the GBLUP family arrived last in the classification. The only exception was the TABLUP which was positioned between the two first groups. However, within some groups, differences were exacerbated. This was particularly true for the Bayes group, where the ranking BayesCπ > BayesB > BayesA > BayesZ > BayesS was preserved, and even more for the Ogutu et al. [9] selection variable, with a very low correlation observed for the adaptative Elastic Net. Notably the random model proposed by Mucha et al. [17] fell in the worst positions with this criterion.

The rank correlation between GEBV and TBV is generally lower than the GEBV-TGV one, with some exceptions (GBLUP [12], animal and fixed models [17], adaptative EN [9]).

Regression coefficient (or regression slope)

Unbiased estimators are supposed to have a regression coefficient of 1. Most of the regression coefficients observed were in the range 0.85-1.25. The ranking of the Bayesian techniques were consistently correct, while the coefficient were more variable for the other approaches. Three of the methods proposed by Mucha et al. [15] clearly gave biased estimations (the fixed and both random models).

Mean squared error of prediction (MSEP)

The results are still very consistent with the other criteria. The Bayesian techniques (excluding BayesS) and the selection variable techniques (LASSO or EN) gave the more precise estimations of the TGV. TABLUP was in the same range. The GBLUP and BayesS performed not as well and Mucha et al. haplotypes models [17] did very badly.

The MSEP of the TBV were quite different and were above or under the TGV MSEP depending on the method. The more precise estimation was given by the Mucha et al. [17] random model. LASSO, EN and GBLUP were satisfying, while the Bayesian approaches (in particular the BayesS) provided high Mean Squared Error of Prediction.

Conclusions

The very general tendency is a better ranking of the Bayesian methods, in the alphabetic order (A<B<C<Cπ) whatever the criteria and the True Value considered (with the notable exception of the MSEP of the TBV). The Selection variable procedures (LASSO, Elastic Net and some adaptations) performed similarly, probably at a much lower computing cost. The TABLUP, which combines BayesB and GBLUP, generally did well. The simplest methods, GBLUP or Ridge Regression, and even worst, the fixed linear model, were much less efficient. The approach followed by Mucha et al. [17] to incorporate haplotype information was not efficient.

These observations are consistent with the results presented in the previous analyses of QTLMAS data [19-21], even if the genetic architecture simulated was restricted to a quite limited (8) number of QTL. It may be that this oligogenic situation did not work in favor of methods probably more suited to highly polygenic cases, such as BayesS [16] or BayesZ [15].

Competing interests

The authors declare that they have no competing interests.

List of abbreviations used

SNP: Single Nucleotide Polymorphism; QTL: Quantitative Trait Locus; MAF: Minor Allele Frequency; LD: Linkage Disequilibrium; GEBV: Genomic Estimated Breeding Value; TBV: True Breeding Value; TGV: True Genomic Value; LASSO: Least Absolute Shrinkage and Selection Operators; EN: Elastic Net; MSEP: Mean Squared Error of Prediction; GBLUP: Genomic Best Linear Unbiased Prediction.

Authors' contributions

OD and OF collected and processed the data files. PLR analyzed the data. PLR and JME wrote the manuscript. All authors contributed to the ideas and methods, and read and approved the manuscript.

Contributor Information

Pascale Le Roy, Email: Pascale.LeRoy@rennes.inra.fr.

Olivier Filangi, Email: olivier.filangi@rennes.inra.fr.

Olivier Demeure, Email: olivier.demeure@rennes.inra.fr.

Jean-Michel Elsen, Email: Jean-Michel.Elsen@toulouse.inra.fr.

Acknowledgements

This article has been published as part of BMC Proceedings Volume 6 Supplement 2, 2012: Proceedings of the 15th European workshop on QTL mapping and marker assisted selection (QTL-MAS). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcproc/supplements/6/S2.

References

Lande R, Thompson R. Efficiency marker assisted selection in the improvement of quantitative traits. Genetics. 1990;124:743–756. doi: 10.1093/genetics/124.3.743. [DOI] [PMC free article] [PubMed] [Google Scholar]
Meuwissen THE, Hayes BJ, Goddard ME. Prediction of Total Genetic Value using genome-wide dense marker maps. Genetics. 2001;157:1819–1829. doi: 10.1093/genetics/157.4.1819. [DOI] [PMC free article] [PubMed] [Google Scholar]
Piepho HP. Ridge regression and extensions for genome wide selection in maize. Crop Science. 2009;49:1165–1176. doi: 10.2135/cropsci2008.10.0595. [DOI] [Google Scholar]
Colombani C, Legarra A, Croiseau P, Guillaume F, Fritz S, Ducrocq V, Robert-Granié C. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Application of PLS and Sparse PLS regression in genomic selection; p. 0439. [Google Scholar]
Usai MG, Goddard ME, Hayes BJ. LASSO with cross-validation for genomic selection. Genet Res. 2009;91:427–436. doi: 10.1017/S0016672309990334. [DOI] [PubMed] [Google Scholar]
Croiseau P, Colombani C, Legarra A, Guillaume F, Fritz S, Baur A, Dassonneville R, Patry C, Robert-Granié C, Ducrocq V. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Improving genomic evaluation strategies in dairy cattle through SNP selection; p. 0360. [Google Scholar]
Habier D, Fernando RL, Kizilkaya K, Garrick DJ. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Extension of The Bayesian alphabet for genomic selection; p. 0468. [Google Scholar]
Elsen JM, Tesseydre S, Filangi O, Le Roy P, Demeure O. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. XVth QTLMAS: simulated dataset. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ogutu JO, Schulz-Streeck T, Piepho HP. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zeng J, Pszczola M, Wolc A, Strabel T, Fernando RL, Garrick DJ, Dekkers JCM. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Genomic Breeding Value Prediction and QTL mapping of QTLMAS2011 data using Bayesian and GBLUP methods. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang Z, Liu J, Ding X, Bijma P, de Koning DJ, Zhang Q. Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix. PLoS ONE. 2010;5(9):e12648. doi: 10.1371/journal.pone.0012648. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang CL, Ma PP, Zhang Z, Ding XD, Liu JF, Fu WX, Weng ZQ, Zhang Q. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Comparison of five methods for genomic breeding value estimation for the common dataset of the 15th QTL-MAS Workshop. [DOI] [PMC free article] [PubMed] [Google Scholar]
Usai MG, Carta A, Casu S. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Alternative strategies for selecting subsets of predicting SNPs by LASSO-LARS procedure. [DOI] [PMC free article] [PubMed] [Google Scholar]
Janss LLG. Bayz manual version 2.03. Leiden, The Netherlands, Janss Biostatistics; 2010. [Google Scholar]
Schurink A, Janss LLG, Heuven HCM. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Bayesian Variable Selection to identify QTL affecting a simulated quantitative trait. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shariati MM, Sorensen P, Janss L. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. A two step Bayesian approach for genomic prediction of breeding values. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mucha A, Wierzbicki H. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Linear models for breeding values prediction in haplotype-asssited selection - an analysis of QTLMAS Workshop 2011 Data. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stephens M, Smith N, Donnelly P. A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics. 2001;68:978–989. doi: 10.1086/319501. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pszczola M, Strabel T, Wolc A, Mucha S, Szydlowski M. Comparison of analyses of the QTLMAS XIV common dataset. I: genomic selection. BMC Proceedings. 2011;5(Suppl 3):S1. doi: 10.1186/1753-6561-5-S3-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bastiaansen J, Bink M, Coster A, Maliepaard C, Calus M. Comparison of analyses of the QTLMAS XIII common dataset. I: genomic selection. BMC Proceedings. 2010;4(Suppl 1):S1. doi: 10.1186/1753-6561-4-s1-s1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lund M, Sahana G, de Koning D-J, Su G, Carlborg Ö. Comparison of analyses of the QTLMAS XII common dataset. I: Genomic selection. BMC Proceedings. 2009;3(Suppl 1):S1. doi: 10.1186/1753-6561-3-s1-s1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zou H. The adaptive lasso and its oracle properties. Journal of the American Statistical Association. 2006;101:1418–1429. doi: 10.1198/016214506000000735. [DOI] [Google Scholar]

[B1] Lande R, Thompson R. Efficiency marker assisted selection in the improvement of quantitative traits. Genetics. 1990;124:743–756. doi: 10.1093/genetics/124.3.743. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] Meuwissen THE, Hayes BJ, Goddard ME. Prediction of Total Genetic Value using genome-wide dense marker maps. Genetics. 2001;157:1819–1829. doi: 10.1093/genetics/157.4.1819. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] Piepho HP. Ridge regression and extensions for genome wide selection in maize. Crop Science. 2009;49:1165–1176. doi: 10.2135/cropsci2008.10.0595. [DOI] [Google Scholar]

[B4] Colombani C, Legarra A, Croiseau P, Guillaume F, Fritz S, Ducrocq V, Robert-Granié C. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Application of PLS and Sparse PLS regression in genomic selection; p. 0439. [Google Scholar]

[B5] Usai MG, Goddard ME, Hayes BJ. LASSO with cross-validation for genomic selection. Genet Res. 2009;91:427–436. doi: 10.1017/S0016672309990334. [DOI] [PubMed] [Google Scholar]

[B6] Croiseau P, Colombani C, Legarra A, Guillaume F, Fritz S, Baur A, Dassonneville R, Patry C, Robert-Granié C, Ducrocq V. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Improving genomic evaluation strategies in dairy cattle through SNP selection; p. 0360. [Google Scholar]

[B7] Habier D, Fernando RL, Kizilkaya K, Garrick DJ. In: Proceedings of the 9th World Congress of Genetics Applied to Livestock Production: 1-6 August 2010; Leipzig Germany. Gesellschaft für Tierzuchtwissenschaften, editor. 2010. Extension of The Bayesian alphabet for genomic selection; p. 0468. [Google Scholar]

[B8] Elsen JM, Tesseydre S, Filangi O, Le Roy P, Demeure O. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. XVth QTLMAS: simulated dataset. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] Ogutu JO, Schulz-Streeck T, Piepho HP. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] Zeng J, Pszczola M, Wolc A, Strabel T, Fernando RL, Garrick DJ, Dekkers JCM. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Genomic Breeding Value Prediction and QTL mapping of QTLMAS2011 data using Bayesian and GBLUP methods. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] Zhang Z, Liu J, Ding X, Bijma P, de Koning DJ, Zhang Q. Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix. PLoS ONE. 2010;5(9):e12648. doi: 10.1371/journal.pone.0012648. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] Wang CL, Ma PP, Zhang Z, Ding XD, Liu JF, Fu WX, Weng ZQ, Zhang Q. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Comparison of five methods for genomic breeding value estimation for the common dataset of the 15th QTL-MAS Workshop. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] Usai MG, Carta A, Casu S. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Alternative strategies for selecting subsets of predicting SNPs by LASSO-LARS procedure. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] Janss LLG. Bayz manual version 2.03. Leiden, The Netherlands, Janss Biostatistics; 2010. [Google Scholar]

[B15] Schurink A, Janss LLG, Heuven HCM. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Bayesian Variable Selection to identify QTL affecting a simulated quantitative trait. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] Shariati MM, Sorensen P, Janss L. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. A two step Bayesian approach for genomic prediction of breeding values. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] Mucha A, Wierzbicki H. In: Proceedings of the XVth QTLMAS Workshop: 19-20 May 2011; Rennes France. Demeure O, Elsen JM, Filangi O, Le Roy P, editor. 2012. Linear models for breeding values prediction in haplotype-asssited selection - an analysis of QTLMAS Workshop 2011 Data. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] Stephens M, Smith N, Donnelly P. A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics. 2001;68:978–989. doi: 10.1086/319501. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19] Pszczola M, Strabel T, Wolc A, Mucha S, Szydlowski M. Comparison of analyses of the QTLMAS XIV common dataset. I: genomic selection. BMC Proceedings. 2011;5(Suppl 3):S1. doi: 10.1186/1753-6561-5-S3-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20] Bastiaansen J, Bink M, Coster A, Maliepaard C, Calus M. Comparison of analyses of the QTLMAS XIII common dataset. I: genomic selection. BMC Proceedings. 2010;4(Suppl 1):S1. doi: 10.1186/1753-6561-4-s1-s1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] Lund M, Sahana G, de Koning D-J, Su G, Carlborg Ö. Comparison of analyses of the QTLMAS XII common dataset. I: Genomic selection. BMC Proceedings. 2009;3(Suppl 1):S1. doi: 10.1186/1753-6561-3-s1-s1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] Zou H. The adaptive lasso and its oracle properties. Journal of the American Statistical Association. 2006;101:1418–1429. doi: 10.1198/016214506000000735. [DOI] [Google Scholar]

PERMALINK

Comparison of analyses of the XV^thQTLMAS common dataset III: Genomic Estimations of Breeding Values

Pascale Le Roy

Olivier Filangi

Olivier Demeure

Jean-Michel Elsen

Supplement

Conference

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Simulated data

Computation of the true genotypic and breeding values

Table 1.

Methods used by the participants

Comparison criteria

Results

Table 2.

Table 3.

Accuracy

Rank correlation

Regression coefficient (or regression slope)

Mean squared error of prediction (MSEP)

Conclusions

Competing interests

List of abbreviations used

Authors' contributions

Contributor Information

Acknowledgements

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Comparison of analyses of the XVth QTLMAS common dataset III: Genomic Estimations of Breeding Values

Pascale Le Roy

Olivier Filangi

Olivier Demeure

Jean-Michel Elsen

Supplement

Conference

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Simulated data

Computation of the true genotypic and breeding values

Table 1.

Methods used by the participants

Comparison criteria

Results

Table 2.

Table 3.

Accuracy

Rank correlation

Regression coefficient (or regression slope)

Mean squared error of prediction (MSEP)

Conclusions

Competing interests

List of abbreviations used

Authors' contributions

Contributor Information

Acknowledgements

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Comparison of analyses of the XV^thQTLMAS common dataset III: Genomic Estimations of Breeding Values