Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Mar 1.
Published in final edited form as: Nat Genet. 2012 Aug 19;44(9):1066–1071. doi: 10.1038/ng.2376

A mixed-model approach for genome-wide association studies of correlated traits in structured populations

Arthur Korte 1,*, Bjarni J Vilhjálmsson 1,2,*, Vincent Segura 1,3,*, Alexander Platt 1,2, Quan Long 1, Magnus Nordborg 1,2
PMCID: PMC3432668  NIHMSID: NIHMS392993  PMID: 22902788

Abstract

Genome-wide association studies (GWAS) are a standard approach for studying the genetics of natural variation. A major concern in GWAS is the need to account for the complicated dependence-structure of the data both between loci as well as between individuals. Mixed models have emerged as a general and flexible approach for correcting for population structure in GWAS. Here we extend this linear mixed model approach to carry out GWAS of correlated phenotypes, deriving a fully parameterized multi-trait mixed model (MTMM) that considers both the within-trait and between-trait variance components simultaneously for multiple traits. We apply this to human cohort data for correlated blood lipid traits from the Northern Finland Birth Cohort 1966, and demonstrate greatly increased power to detect pleiotropic loci that affect more than one blood lipid trait. We also apply this to an Arabidopsis dataset for flowering measurements in two different locations, identifying loci whose effect depends on the environment.

Introduction

Most GWAS to date have been conducted using the simplest possible statistical model: a single-locus test of association between a binary SNP genotype and a single phenotype. Given that most traits of interest are multifactorial, this clearly amounts to model misspecification, and the resulting danger of biased results whenever there is non-independence (linkage disequilibrium) between causal loci (for example due to population structure) is well known1,2,3. Much less attention has been devoted to the fact that phenotypes may also be correlated. Whenever multiple measurements are taken from individuals, the resulting phenotypes will be correlated because of pleiotropy, which is of direct interest, as well as shared environment and linkage disequilibrium, which are usually confounding factors. Taking these correlations into account is important not only because of the importance of understanding pleiotropy, but also because we may expect increased power compared to marginal analyses. Intuitively, correlated traits amount to a form of replication. The importance of correlated phenotypes becomes even clearer when we consider measurements across environments. The canonical example here is an agricultural field experiment using inbred lines, a setting in which no one would consider analyzing phenotypes from different environments independently of each other, because the whole point of the study is to separate genetic from environmental effects and identify genotype-environment interactions. In human genetics, disentangling genetic and environmental effect is also of obvious interest, although much more challenging as the environment usually cannot be experimentally manipulated4.

There is a long history of multi-trait models in quantitative genetics5,6,7,8,9, but these methods have rarely been applied to GWAS. In this paper we demonstrate how a standard linear mixed model from animal breeding10 may be used to model correlated traits while at the same time correcting for dependence among loci (e.g., due to population structure). As designs like cohort studies become more prevalent, the need for modeling correlated traits as well as population structure will grow2,11,12, and the same is true for the increasing number of non-human GWAS 13,14,15,16,17.

The mixed model, which handles population structure by estimating the phenotypic covariance that is due to genetic relatedness, or kinship, between individuals, has previously been shown to perform well in GWAS2,13,18,19,20,21,22. Here we extend this approach to handle correlated phenotypes by deriving a fully parameterized multi-trait mixed model (MTMM) that considers both the within-trait and between-trait variance components simultaneously for multiple traits (Online Methods), and implementing it for GWAS. The idea is not new23,24,25,26,27, but it has never been applied for association mapping on a genome-wide scale. Alternative approaches for GWAS in multiple traits exist, but they generally fail to control for population structure28,29, and often are not applicable to genome-wide data.

We validate our approach using extensive simulations based on available SNP data from A. thaliana30, demonstrating that our model increases power to detect associations while controlling the false discovery rate. We then demonstrate its usefulness by considering correlated blood lipid traits from the Northern Finland Birth Cohort 1966 (NFBC1966)31, and environmental plasticity in an A. thaliana data set that contains flowering measurements for two simulated growth seasons in two different locations32. Finally, we discuss its utility, not only in terms of increasing power to detect associations, but also in terms of understanding the basic genetic architecture of the phenotypes.

RESULTS

Simulations

Pairs of correlated phenotypes were simulated by adding phenotypic effects to genome-wide SNP data from A. thaliana30. A single randomly selected SNP was set to account for up to 2% of the phenotypic variance, but with the possibility of different effects in each of the two phenotypes (see below). In addition, 10,000 SNPs were given much smaller effects to simulate the genetic background. A randomly chosen fraction of these background SNPs was shared between the two phenotypes, allowing for variation in the degree of phenotypic correlation (Supplementary Fig. 1, Online Methods).

We compared our ability to identify the focal locus using MTMM and marginal, single-trait analyses (using the smallest p-value from the latter to ensure a fair comparison). Three different tests were used: a full test that compares the full model, including the effect of the marker genotype and its interaction, with a model that includes neither; an interaction effect test that compares the full model to one that does not include interaction, and finally; a common effect test that compares a model with a marker genotype to one without (see Online Methods for details). As expected, the results depended strongly on the effect of the focal polymorphism (Fig. 1). When it had the same effect in both phenotypes (i.e., positive pleiotropy or a common effect across environments; see Fig. 1a), MTMM performed slightly better than the single trait mixed model (MM) regardless of whether we tested for full model fit, or just for a common effect (Fig. 1e). The reason for this is the increased power that results from analyzing the traits together. Testing for an interaction effect alone is pointless as no interaction exist.

Figure 1.

Figure 1

Simulation results. (a–d) illustrate the scenarios simulated: (a) positive pleiotropy, alt. common effect across environments; (b) positive pleiotropy, alt. common effect across environments, with size of effect differing between traits/environments; (c) effect only on one trait, alt. only in one environment; (d) negative pleiotropy, alt. opposite effect across environments. (e) shows the estimated relationship between power and false discovery rate (FDR) using six different statistical tests (see text and Online Methods) for the scenario described in (a). (f) shows the estimated relationship between power and false discovery rate (FDR) for the scenario described in (b). (g) shows the estimated relationship between power and false discovery rate (FDR) for the scenario described in (c). (h) shows the estimated relationship between power and false discovery rate (FDR) for the scenario described in (d). Dots on curves denote nominal Bonferroni-corrected 5% significance thresholds. Note that both power and FDR are calculated with respect to the single focal locus, only.

When the effect of the polymorphism is slightly weaker in one trait/environment (Fig. 1b), testing for a full model fit using MTMM again outperforms single-trait analyses (Fig. 1f). Testing only for a common or interaction effect using MTMM is also less effective. Although an interaction effect now exists, it is too weak to be detected. However, as the strength of the interaction effect increases (Figs. 1c,d), it becomes possible to detect directly, and the relative advantage of using MTMM increases dramatically (Figs. 1g–h).

An alternative to carrying out two marginal single-trait analyses might be to combine the phenotypes, e.g., by fitting the traits principal components or their sum or difference. We tested the latter and as might be expected, this approach works very well when the focal SNP has exactly the same (or the opposite, when using the difference) effect on the phenotype (Fig. 1a,d). However, if the effect of the SNPs differs between the two traits, MTMM outperforms these approaches (Fig. 1b–c).

It should be noted that, because the background SNPs are correlated due to population structure, simple single-locus tests of association are strongly biased towards false positives, just like in the original data14. The mixed model effectively removes this bias, regardless of whether we analyze one phenotype at a time using a single trait MM or both simultaneously using MTMM (Supplementary Fig. 2). However, analyzing these data with methods that do not take population structure into account is clearly not a realistic option (Supplementary Fig. 3).

In addition to the model just described, we simulated an oligogenic scenario in which each phenotype was determined by 20 loci, each of which could, with equal probability: affect that phenotype only; affect both phenotypes in the same way, or; affect both phenotypes but in opposite ways. The behavior of each locus was chosen independently, and the resulting distribution of correlations between the phenotypes was thus centered around zero (Supplementary Fig. 4), which is very different from the positively correlated phenotypes generated under the first simulation scenario (Supplementary Fig. 1). MTMM is intended for correlated phenotypes, and is expected to perform less well when phenotypes are weakly correlated. The oligogenic simulation results supported this intuition. For weakly correlated pairs of phenotypes, single-trait analysis often outperformed MTMM (especially for detecting SNPs with effect in one phenotype only), however, for more strongly correlated phenotypes, the results agreed with those presented above in that MTMM always outperformed marginal analyses (Supplementary Fig. 5). Note that the correlation does not have to be positive: for negatively correlated phenotypes, MTMM has relatively higher power to detect SNPs with the same effect in both phenotypes, whereas for positively correlated phenotypes, it performs best for SNPs that have opposing effects (note that it may sometimes make sense to simply change the direction of correlation by negating one of the phenotypes when analyzing real data).

As noted in the introduction, an advantage of MTMM is that it can be used for correlated phenotypes regardless of whether the phenotypes represent different measurements (and the correlations are due to pleiotropy), or the same trait measured in different environments (cf. Fig. 1a–d). However, the simulations above assume that the phenotypic correlations are solely due to genetics, not environment, and this is only likely to be true for studies involving inbred lines in controlled environments. Certainly correlations between pleiotropic traits will reflect environment as well as genotype. To verify that MTMM is able to separate these effects, we simulated another 5,000 pairs of correlated traits using the 10,000locus model, but now with correlations reflecting environmental as well as genetic covariance (see Online Methods). Both the environmental and genetic correlations were well estimated (Supplementary Fig. 6), although it should be noted that the residuals of the genetic and environmental correlation estimates are negatively correlated (Supplementary Fig. 6d). The accuracy of these estimates does affect the performance of GWAS, but the effect appears to be relatively minor (Supplementary Fig. 7).

Pleiotropy in human data

To illustrate the utility of MTMM for traits that are correlated because they are part of the same biological system, we reanalyzed data from the Northern Finland Birth Cohort 1966 (NFBC1966)31 (see Online Methods for details). We focused on measurements of four blood metabolites that are strongly involved in cardiovascular heart disease33, namely triglycerol (TG), low-density lipoprotein (LDL), high-density lipoprotein (HDL), and C-reactive protein (CRP). These metabolites are significantly correlated, and MTMM analysis indicates that the correlations are caused by genetics as well as environment (Table 1), supporting the notion that these traits are mechanistically related and/or have linked causal loci. For HDL/CRP and TG/CRP the correlations of the genetic effects are in the opposite direction of the environmental correlations. However, in these cases, the genetic correlations are not significantly different from zero, and it is likely that the phenotypic correlations driven primarily by the shared environment.

Table 1.

MTMM estimates of correlation and heritability in the NFBC 1966 data

Genetic Environmental
Phenotypicα corr. SE pval corr. SE pval Heritabilityβ
HDL/TG -0.37 -0.42 0.14 0.024 -0.36 0.06 1:58 × 10-8 0.38/0.18
HDL/LDL -0.13 -0.19 0.11 0.085 -0.09 0.08 0.26 0.39/0.45
HDL/CRP -0.19 0.24 0.23 0.25 -0.34 0.06 1:50 × 10-7 0.39/0.14
TG/LDL 0.32 0.31 0.14 0.062 0.35 0.06 9:64 × 10-7 0.19/0.44
TG/CRP 0.21 -0.50 0.39 0.115 0.34 0.05 3:19 × 10-9 0.18/0.13
LDL/CRP 0.09 0.08 0.19 0.65 0.10 0.06 0.12 0.45/0.13
α

Direct estimates of the Pearson correlation are identical to the precision given.

β

The SE of all heritability estimates is between 0.05 and 0.06. Single-trait estimates are: 0.38 (HDL), 0.18 (TG), 0.45 (LDL) and 0.13 (CRP).

In terms of associations, the results from the joint analysis of TG and LDL suffice to illustrate how two of our main predictions were borne out. First, almost all SNPs that were found to be significantly associated in the marginal analysis of either LDL or TG were also significant in the joint analysis (Table 2). However, MTMM arguably provides greater insight into the nature of the associations, as it reveals interaction effects. Second, MTMM finds associations that the marginal analyses do not. In particular, for positively correlated phenotypes such as TG and LDL, we expect MTMM to have much greater power to detect polymorphism whose effects differ greatly between the phenotypes. A nice example of this is the FADS1-FADS2 locus, which was not significant in either marginal analysis, but becomes highly significant using MTMM thanks to a very strong interaction effect (Table 2 and Fig. 2). These genes are excellent candidates, and were mentioned in the previous analysis of the NFBC1996 data31. Strikingly, they were also identified in a massive meta-analysis involving more than 100,000 individuals34, which furthermore reported opposite effects on TG and LDL, in agreement with the strong interaction effect we observe (Fig. 2g) using a sample of only 5,000 individuals.

Table 2.

SNPs detected in the analysis of LDL and TG using a genome-wide significance of 0.05

MTMM(p-valueα)
EMMAX(p-valueα)
SNP Position full test interaction common LDL TG
CELSR2 region, chromosome 1
rs611917 109616775 6.42 × 10-8 3.19 × 10-3 7.72 × 10-7 1.80 × 10-8 0.46
rs646776 109620053 2.48 × 10-15 1.42 × 10-6 3.28 × 10-11 3.92 × 10-15 0.77
APOB region, chromosome 2
rs10198175 20997364 6.32 × 10-7 0.02 1.33 × 10-6 9.48 × 10-8 0.29
rs3923037 21011755 6.39 × 10-9 0.13 2.64 × 10-9 2.72 × 10-7 7.17 × 10-7
rs6728178 21047434 9.57 × 10-10 0.11 4.37 × 10-10 7.95 × 10-8 1.81 × 10-7
rs6754295 21059688 1.31 × 10-9 0.14 4.97 × 10-10 7.10 × 10-8 4.12 × 10-7
rs676210 21085029 2.43 × 10-9 0.04 2.56 × 10-9 7.23 × 10-7 9.21 × 10-8
rs693 21085700 1.80 × 10-10 0.19 5.00 × 10-11 2.84 × 10-11 2.79 × 10-3
rs673548 21091049 1.63 × 10-9 0.04 1.85 × 10-9 5.97 × 10-7 6.43 × 10-8
rs1429974 21154275 4.85 × 10-7 0.02 1.06 × 10-6 7.69 × 10-8 0.24
rs754524 21165046 5.30 × 10-8 0.02 1.39 × 10-7 7.83 × 10-9 0.17
rs754523 21165196 4.51 × 10-7 0.02 1.01 × 10-6 7.15 × 10-8 0.24
GCKR region, chromosome 2
rs1260326 27584444 5.33 × 10-10 2.10 × 10-8 7.73 × 10-3 0.21 1.87 × 10-10
rs780094 27594741 5.98 × 10-9 4.22 × 10-8 0.01 0.44 3.15 × 10-9
LPL region, chromosome 8
rs10096633 19875201 2.42 × 10-8 2.04 × 10-8 0.06 0.97 1.93 × 10-8
FADS1 region, chromosome 11
rs174537 61309256 1.60 × 10-9 9.02 × 10-9 0.01 6.82 × 10-6 3.81 × 10-3
rs102275 61314379 8.79 × 10-10 6.20 × 10-9 4.86 × 10-3 4.13 × 10-6 3.82 × 10-3
rs174546 61326406 5.52 × 10-10 3.83 × 10-9 4.88 × 10-3 3.69 × 10-6 3.12 × 10-3
rs174556 61337211 2.56 × 10-9 4.43 × 10-8 1.93 × 10-3 2.03 × 10-6 0.01
rs1535 61354548 2.08 × 10-9 1.35 × 10-8 0.01 6.04 × 10-6 4.96 × 10-3
rs2072114 61361791 8.77 × 10-8 7.31 × 10-7 4.77 × 10-3 1.59 × 10-5 0.03
LDLR region, chromosome 19
rs11668477 11056030 3.16 × 10-8 0.15 1.18 × 10-8 3.89 × 10-9 0.02
rs2228671 11071912 7.20 × 10-8 5.30 × 10-4 4.87 × 10-6 4.47 × 10-8 0.96
α

P-values below the Bonferroni-corrected 5% cut-off of 1.5 × 10-7 are highlighted in red.

Figure 2.

Figure 2

GWAS of LDL and TG. (a–b) Manhattan plots for the marginal, single-trait analyses of LDL(a) and TG(b), respectively. (c–e) Manhattan plots for the joint MTMM analyses: (c) full model; (d) interaction effect, and; (e) common effect. The dashed horizontal line denotes the 5%Bonferroni adjusted genome-wide significance level. (f) Closeup of the FADS1-FADS2 region on chromosome 11. The points for the single-trait analyses are shown in light (TG) and dark blue (LDL), while the point for MTMM are shown in orange (full test), light green (interaction effect), and red (common effect). The gray shading denotes the FADS1 gene region. (g) Estimated phenotypic effect of the rs174546 SNP in light (TG) and dark blue (LDL).

The other five trait combinations gave similar results (Supplementary Tables 1-5). Almost all SNPs that were identified using single-trait analysis were also detected using MTMM, either due to a strong common or strong interaction effect. In addition, MTMM also detected two more regions that were not identified using marginal analyses. First, the gene PPP1R3B was identified due to strong common effects in the joint analyses of HDL and CRP, and HDL and TG. These pairs of phenotypes are negatively correlated (Table 1), so we expect MTMM to have increased power to detect common effects. The association with PPP1R3B was not found in previous analyses of these data31, but was reported (and confirmed) in the much larger meta-analysis of blood lipids34. Second, the TOM40-APOE region was identified in the joint analysis of TG and CRP, this time due to an interaction effect (TG and CRP are positively correlated). Albeit not quite significant, this association was noted in the previous analysis of these data31, and was also found in the meta-analysis34.

G × E in A. thaliana data

The other natural application for MTMM is when phenotypes are correlated because they represent the same trait measured in different environments. In such a setting, we are often directly interested in finding genes that are involved in the differential response to the environment, i.e., genotype-by-environment (G × E) interactions. We tested this application using a data set from A. thaliana in which flowering time was measured (for a global collection of naturally occurring inbred lines) in environmental control chambers for two simulated seasons (“Spring” and “Summer”) and two simulated locations (“Spain” and “Sweden”)32. Flowering time varies in a clinal manner, and is generally thought to be important in local adaptation. It is thus both natural and interesting to try identifying genes that are responsible for the differential flowering response to different environments32.

We analyzed the A. thaliana data using a full 2 × 2 factorial model, i.e., in addition to estimating the effect of genotype, season, and location, we have two pairwise interaction terms (see Online Methods and Supplementary Note). The results are summarized in Figure 3 (for details, see Supplementary Figures 8-9 and Supplementary Tables 6-7). Perhaps surprisingly, we found very few interaction effects. Out of a total of 41 significant SNP associations, only three appeared to be due to interactions. A rare allele (MAF = 4 % ) on chromosome 5 was identified as a significant genotype-by-season effect, but it does not correspond to any obvious candidate (Supplementary Fig. 10). A more convincing example was provided by the two tightly linked and perfectly correlated SNPs on chromosome 1. These were identified by comparing the full model to one without interaction terms, although the interaction with the simulated season seems to be strongest (Fig. 4). The minor allele (MAF = 3 %) is associated with delayed flowering (Fig. 4b), but the effects depends strongly on the season, and is much more pronounced in the (simulated) summer. Interestingly, both SNPs are in the coding region of the gene FRS6, which is known to be involved in the phyA-mediated response to far-red light35. T-DNA knockout lines of this locus have an early-flowering phenotype, the magnitude of which depends on day length (one of the factors that vary between the simulated seasons).

Figure 3.

Figure 3

Venn diagrams summarizing the GWAS of A. thaliana flowering data32. (a) Classification of the 41 significant SNPs according to the test(s) in which they were significant. (b) Classification of the 41 SNPs (black) and corresponding gene regions (red) according to whether they were found using marginal (MM) or joint (MTMM) analysis.

Figure 4.

Figure 4

Summary of FRS6 results. (a) Closeup of a 50kb region on chromosome 1 showing significant G _ E associations. The gene FRS6 is highlighted in gray. The results for the four marginal analysis (using a single trait MM) are shown in blue, while the MTMM results are shown in orange (full test), light green (three-way interaction), green (genotype by location), dark green (genotype by season), and red (common effects).(b) Phenotypic distribution as a function of experimental condition and genotype. (c–f) Plots contrasting the allelic effect in different comparisons: (c) the effect of the season in ‘Spain’; (d) the effect of the season in ‘Sweden’; (e) the effect of the location in ‘Spring’; (f) the effect of the location in ‘Summer’. The effect depends strongly on the season within each location (c–d) and less strongly on location with season (e–f).

Of the remaining 38 SNPs, 28 we found by both marginal and joint analysis (as common effects), and 10 were found only by marginal analysis. While our simulations would seem to suggest that MTMM should always have higher power than marginal test, even for detecting common or unique effects, this is clearly not always the case. The phenotypes analyzed here are extremely highly correlated as well as heritable (all coefficients typically well above 0.9; see Supplementary Table 6). In such cases, the advantage of increasing the sample size through joint analysis does not necessarily outweigh the cost of a more complex model with more degrees of freedom.

DISCUSSION

We have shown how the classical mixed model from breeding may be used for GWAS of correlated phenotypes in structured populations, often providing greater statistical power than marginal analyses. However, we emphasize that our approach is much more than an ad hoc method for increasing power. The model we use effectively dates back to Fisher36, and can be derived from basic genetic principles under the assumption that heritable phenotypic variation is due to very large numbers of genes of very small effect (Online Methods). Assuming that this is a reasonable approximation (and it seems to be, for a growing number of traits), we can disentangle genetic correlations from environmental correlations, whenever these uncorrelated. This allows us to address fundamental questions about the nature of variation. When applied to traits that may be biologically related the resulting variance component estimates allow us to assess the level of pleiotropy without estimating effects of individual loci. Using data from different human blood lipids, we demonstrated how the phenotype covariance can be decomposed into genetic and environmental terms, suggesting that most of these traits are indeed correlated due to shared genetics (i.e., they are pleiotropic or due to causal sites in linkage disequilibrium). A similar approach was recently used by Price et al.37 to assess the heritability of RNA expression levels within and across human cell tissues.

Irrespective of this, we also demonstrated increased power, detecting several interesting loci affecting human blood lipid level that were not significant in the single trait analysis, but that have all been replicated in GWAS studies using much larger sample sizes. This finding alone strongly argues for routine application of our method to correlated phenotypes.

As an example of how the method can be used to detect environmental interactions, we applied our method to an A. thaliana flowering time dataset, where the plants had been phenotyped under four different environmental conditions (in a classical 2times2 factorial design). These phenotypes are highly correlated as well as highly heritable, and the estimated variance components suggest that there is in fact very little difference between the environments at the genetic level (Supplementary Table 6). Hence, it is arguably not surprising that we detected little in terms of interaction effects. While it is of course possible that we simply do not have the power to detect interactions, it is notable that analogous studies in maize have also failed to detect large G × E interaction effects38. The result from A. thaliana and maize are strikingly different from what has been reported for mouse39, yeast40, and even humans4, but the reason for these differences are far from clear given the dramatically different study designs.

Full factorial designs with replicated genotypes are of course not possible in most organisms; however, we note that MTMM does not require this. Indeed, a mixed-model approach has previously been proposed for estimating G × E variance components in humans25 (using a special case of our model in which heritabilities are assumed to be equal across environments; see Online Methods). Either approach is directly applicable to human data.

Although we have focused on relatively simple pairwise correlations in this paper, it is easy to model more than two phenotypes using MTMM. Conceptually we believe that extending this to larger multi-trait experiments should allow for greater benefits in estimating error terms and elucidating functional relationships between suites of traits. However for such complex models, the computational complexity grows fast and the results become increasingly difficult to interpret compared to sequential two-trait analyses.

This is a well-known problem in statistics and quantitative genetics, but MTMM has the additional caveat that it assumes that the increasingly complex covariance structure, which is estimated in the absence of fixed effects, remains constant as these are added. Various intermediate approaches are possible, e.g. variance components might be estimated using a full model once, followed by GWAS using sub-models: more work in this area is clearly desirable.

Finally, when the phenotypes are not correlated, or if the correlation is not due to genetics (something that can be deduced from the variance component estimates), a single trait mixed model will generally have greater power to detect causal loci that are phenotype-specific.

When, precisely, this will be the case is hard to predict, however, we suggest using the MTMM approach as a complement to rather than replacement for marginal GWAS. The advantages are clear: it allows the detection of both interactions and pleiotropic loci in a rigorous statistical framework, while simultaneously accounting for population structure.

URLs

MTMM has been implemented in a set of R scripts (MTMM) for carrying out GWAS. They rely on the software ASREML41 for the estimation of the variance components. The scripts can be obtained at https://cynin.gmi.oeaw.ac.at/home/resources/MTMM.

ONLINE METHODS

Theory

Multiple traits mixed model

Following Henderson10 we can write the mixed model for the phenotypes of n individuals as

y=Xβ+g+e, (1)

where y is a vector of the n phenotype values. In this notation, the trait mean is included, together with other fixed effects, in the design matrix X. The β are the effect sizes of the fixed effects, gN(0,σg2K) is a random effect, and eN(0,σe2I). It follows that the covariance matrix for the trait values, y, is

var(y)=σg2K+σe2I, (2)

Where K is a n×n kinship or relatedness matrix. If we consider two traits, y1,y2 measured on the same set of individuals, then under the mixed model for the k ’th phenotype follows the partitions of the variance accordingly, i.e., var(yk)=σgk2K+σek2I. However, for the covariance matrix between the two phenotypes it is not obvious what the appropriate model is. Henderson42 suggests the following covariance model:

cov(y1,y2)=σg1σg2ρgK+σe1σe2ρeI, (3)

where ρg captures the genetic correlation between two phenotypes and the term ρe captures the correlation caused by shared environment and other non-genetic sources of correlations. We can generalize this for phenotypes which have been measured for different sets of individuals (see Supplementary Note).

Estimating the variance parameters

The estimation procedure for the variance components is described in the Supplementary Note.

Application to GWAS

Similar to EMMAX2 or P3D20 we estimate the covariance matrix only once to re-estimate a scalar in front of it for every marker. This fixes five degrees of freedom out of six in total (maximum number of variance components for two traits. For a pair of traits (the i ’th and j ’th traits), the proposed approximation effectively assumes that the three variance ratios (σgi/σei,σgj/σej and σgi/σgj) and the two correlations ρgij and ρeij are fixed with and without the marker in the model.

With multiple traits we can search for causal loci with common effects (across all traits) as well as trait specific loci or loci with opposite effects in different traits. Depending on what we are interested in, a GLS F-test can be constructed to compare two models. For two traits we can write the single marker model as follows:

y=[y1y2]=s1μ1+s2μ2+xβ+(x×s1)α+υ, (4)

where x is the marker and si is a vector with 1 for all values belonging to the i’th trait and 0 otherwise. The ψ ~ N(0,cov(y)) is a random variable capturing both the error and genetic random effects. Depending on what kind of loci we are interested in, we propose three different F-tests tests:

  • The full model tested against a null model where β = 0 and α = 0. This identifies both loci with common and differing effects in one model, but suffers in power from the extra degree of freedom.

  • To identify common genetic effects we propose to test the genetic model (α = 0) against a null model where β = 0 and α = 0.

  • Finally, to identify differing genetic effects between the traits we propose to test the full model against a null model where α = 0.

As both the interaction test and the common effect test are sensitive to scaling of the phenotype values, we propose to normalize them either by the total variance, or the genetic variance (as obtained in the marginal trait analysis). To minimize multiple-testing problems, one could, for example, carry out GWAS using the full model and use the other tests in post hoc analysis of significant loci.

Extending this model for arbitrary number of traits is straight-forward (one example for the analysis of four traits is described in the Supplementary Note). However, when there are more than two traits in the model, the number of possible tests grows quickly. An interesting special case is when there are several environmental variables in a factorial study design, in which case each environmental variable can be included in the model instead of the term i=1tsiμi, and their interactions with the genotype could replace the term x(i=1tsiα). This can result in a simpler and a more tractable model than if all possible combinations of environments were treated as independent.

Genotype-environment interactions

Given two measured phenotype vectors, y1and y2,Yang et al.25 include a G × E random effect in a mixed model as follows:

y=[y1y2]=Xβ+uG+uG×E+e, (5)

where uG,uG×E are random effects and have covariance matrices

cov([y1y2])=cov(uG)+cov(uG×E)+cov(e)=σG2[K11K21K12K22]+σG×E2[K1100K22]+I. (6)

Compared to the model proposed in equation (4), this model implicitly assumes two things: that there are no environmental correlations; and that the heritabilities are the same in each environment, i.e., h12=h22. As the individuals are different in each environment, the first assumption is appropriate. However, the second assumption is not guaranteed to hold in general, and we therefore propose relaxing it.

Simulations

10,000-locus model

We simulated 2,000 pairs of correlated phenotypes using a model under which the phenotypes consisted of one randomly chosen SNP with a “large” (additive) effect, accounting for up to 2% of the total phenotypic variance, and 10,000 randomly chosen SNPs with small additive effects. The effects sizes were drawn from a normal distribution and scaled to fix the trait heritability to 0.95. To ensure variation in trait correlations, all trait pairs shared a random fraction of the 10,000 causal loci, with the fraction drawn from a uniform distribution. The four phenotypic models were distinguished by different effect-correlations at the major locus (Fig. 1).

In addition, we simulated 5,000 pairs of correlated traits with environmental correlations. We fixed the heritability to 0.5 and allowed the genetic correlation to vary from -1 to 1. Additionally, we added a shared environmental term to the model, mimicking scenarios for both negative and positive environmental correlation.

20-locus model

We also simulated 1,000 pairs of correlated phenotypes using a 20-locus model. Each phenotype was determined by 20 loci, using a multinomial distribution, we randomly assigned to three categories with equal probabilities: i) SNPs with same effect in both phenotypes; ii) SNPs with opposite effect in the two phenotypes, and; iii) SNPs with effect in one trait only. The SNPs had additive effects drawn from an exponential distribution. Finally, the effect sizes were scaled to fix the heritabilities to 0.95. To obtain a single p-value for two traits the smaller of the two p-values for each SNP from the marginal mixed model analysis was retained.

Power calculations

For the calculation of the power and FDR, any significant SNP within 50 kb of a (or the) causal SNP was classified as a true positive; otherwise it was a false positive. The results are almost independent of the window size used (Supplementary Fig. 11). More important is the effect of the causal SNP(s). The nearly two-fold increase of power observed at a FDR of 0.1 in Figure 1 depends on the effect size of the simulated SNP (Supplementary Fig. 12). Throughout this paper, we used the single-analysis Bonferroni-corrected 5% significance threshold.

Human data

We used the 1966 North-Finland Birth Cohort (NFBC1966) which consist of phenotypic and genotypic data for 5,402 individuals31. Using the exact same dataset as was used in2, after their filtering the dataset consisted of 5,326 individuals and 331,475 SNPs. To expedite the mapping, the unknown genotypes (< 1% in the dataset) were imputed by replacing missing values with the average genotypic value. Neither the marginal MM analysis nor the MTMM tests show evidence of population structure confounding (Supplementary Fig. 13).

Analysis of A. thaliana data

The genotype data for A. thaliana consisted of 1,307 individuals genotyped at 214,051 SNPs using a custom Affymetrix SNP chip30. The phenotypes used were measurements of flowering time for 459 accessions32. Flowering time was measured in plants grown in four different environments, a factorial setting with two simulated seasons (“Spring” and “Summer”) and two simulated locations (“Spain” and “Sweden”). Analyzing the four phenotype vectors together we can derive five different F-tests (see Supplementary Note). Neither of these tests shows evidence of confounding due to population structure (Supplementary Fig. 14).

Supplementary Material

1

ACKNOWLEDGEMENTS

We thank the NFBC1966 Study Investigators for allowing us to use their phenotype and genotype data in our study. The NFBC1966 Study is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with the Broad Institute, UCLA, University of Oulu, and the National Institute for Health and Welfare in Finland. This manuscript was not prepared in collaboration with investigators of the NFBC1966 Study and does not necessarily reflect the opinions or views of the NFBC1966 Study Investigators, the Broad Institute, UCLA, University of Oulu, National Institute for Health and Welfare in Finland or the NHLBI. We furthermore thank Nelson B. Freimer and Susan K. Service for their help in pre-processing the NFBC1966 data. We would also like to thank Petar Forai for excellent IT and cluster support at the GMI, the INRA MIGALE bioinformatics platform for further computational resources, and Jack Dekkers, Peter Donnelly, Eleazar Eskin, Christelle Niango,and Alkes Price for comments on the manuscript and/or helpful discussions. This work was supported by grants to MN from the United States National Institutes of Health (P50 HG002790) and the EU Framework Programme 7 (“TransPLANT”, grant agreement number 283496), as well as by grants from the “Deutsche Forschungsgemeinschaft” (DFG) (AK, KO4184/1-1) and the “Ecologie des Forêts, Prairies et milieux Aquatiques” (EFPA) department of INRA (VS).

Footnotes

AUTHOR CONTRIBUTIONS

All authors helped design the study. AK, BJV, and VS developed the theory and implemented the simulations. AK, BJV, and MN wrote the paper with input from all authors.

COMPETING FINANCIAL INTERESTS

The authors declare no competing financial interests.

References

  • 1.Platt A, Vilhjálmsson BJ, Nordborg M. Conditions under which genome-wide association studies will be positively misleading. Genetics. 2010;186:1045–52. doi: 10.1534/genetics.110.121665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kang HM, Sul JH, Service SK, Zaitlen NA, Kong S, Freimer NB, Sabatti C, Eskin E. Variance component model to account for sample structure in genome-wide association studies. Nature Genet. 2010;42:348–354. doi: 10.1038/ng.548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Price AL, Zaitlen NA, Reich D, Patterson N. New approaches to population stratification in genome-wide association studies. Nature Rev. Genet. 2010;11:459–463. doi: 10.1038/nrg2813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hamza TH, Chen H, Hill-Burns EM, Rhodes SL, Montimurro J, Kay DM, Tenesa A, Kusel VI, Sheehan P, Eaaswarkhanth M, Yearout D, Samii A, Roberts JW, Agarwal P, Bordelon Y, Park Y, Wang L, Gao J, Vance JM, Kendler KS, Bacanu S-A, Scott WK, Ritz B, Nutt J, Factor SA, Zabetian CP, Payami H. Genome-wide gene environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee. PLoS Genetics. 2011;7:e1002237. doi: 10.1371/journal.pgen.1002237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lynch M, Walsh B. Genetics and Analysis of QuantitativeTraits. Sinauer Associates, Inc.; Sunderland, Massachusetts: 1997. [Google Scholar]
  • 6.Jiang C, Zeng ZB. Multiple trait analysis of genetic mappingfor quantitative trait loci. Genetics. 1995;140(3):1111–27. doi: 10.1093/genetics/140.3.1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ferreira MA, Purcell SM. A multivariate test of association. Bioinformatics. 2009;25:132–3. doi: 10.1093/bioinformatics/btn563. [DOI] [PubMed] [Google Scholar]
  • 8.Zhang L, Pei YF, Li J, Papasian CJ, Deng HW. Univariate/multivariate genome-wide association scans using data from families and unrelated samples. PLoS One. 2009;4:e6502. doi: 10.1371/journal.pone.0006502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Knott SA, Haley CS. Multitrait least squares for quantitative trait loci detection. Genetics. 2000;156:899–911. doi: 10.1093/genetics/156.2.899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Henderson CR. Application of Linear Models in Animal Breeding. Unversity of Guelph; 1984. [Google Scholar]
  • 11.Thomas D. Gene–environment-wide association studies: emerging approaches. Nature Rev. Genet. 2010;11:259–272. doi: 10.1038/nrg2764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ober C, Vercelli D. Gene-environment interactions in human disease: nuisance or opportunity? Trends Genet. 2011;27:107–115. doi: 10.1016/j.tig.2010.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES. A unified mixed model method for association mapping that accounts for multiple levels of relatedness. Nature Genet. 2006;38:203–8. doi: 10.1038/ng1702. [DOI] [PubMed] [Google Scholar]
  • 14.Atwell S, Huang YS, Vilhj'almsson BJ, Willems G, Horton M, Li Y, Meng D, Platt A, Tarone AM, Hu TT, Jiang R, Muliyati NW, Zhang X, Amer MA, Baxter I, Brachi B, Chory J, Dean C, Debieu M, de Meaux J, Ecker JR, Faure N, Kniskern JM, Jones JDG, Michael T, Nemri A, Roux F, Salt DE, Tang C, Todesco M, Traw MB, Weigel D, Marjoram P, Borevitz JO, Bergelson J, Nordborg M. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010;465:627–31. doi: 10.1038/nature08800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Huang X, Wei X, Sang T, Zhao Q, Feng Q, Zhao Y, Li C, Zhu C, Lu T, Zhang Z, Li M, Fan D, Guo Y, Wang A, Wang L, Deng L, Li W, Lu Y, Weng Q, Liu K, Huang T, Zhou T, Jing Y, Li W, Lin Z, Buckler ES, Qian Q, Zhang Q-F, Li J, Han B. Genome-wide association studies of 14 agronomic traits in rice landraces. Nature Genet. 2010;42:961–7. doi: 10.1038/ng.695. [DOI] [PubMed] [Google Scholar]
  • 16.Olsen HG, Hayes BJ, Kent MP, Nome T, Svendsen M, Larsgard AG, Lien S. Genome-wide association mapping in Norwegian Red cattle identifies quantitative trait loci for fertility and milk production on BTA12. Anim. Genet. 2011;42:466–74. doi: 10.1111/j.1365-2052.2011.02179.x. [DOI] [PubMed] [Google Scholar]
  • 17.Tian F, Bradbury PJ, Brown PJ, Hung H, Sun Q, Flint-Garcia S, Rocheford TR, McMullen MD, Holland JB, Buckler ES. Genome-wide association study of leaf architecture in the maize nested association mapping population. Nature Genet. 2011;43:159–162. doi: 10.1038/ng.746. [DOI] [PubMed] [Google Scholar]
  • 18.Zhao K, Aranzana MJ, Kim S, Lister C, Shindo C, Tang C, Toomajian C, Zheng H, Dean C, Marjoram P, Nordborg M. An Arabidopsis example of association mapping instructured samples. PLoS Genet. 2007;3:e4. doi: 10.1371/journal.pgen.0030004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kang HM, Zaitlen NA, Wade CM, Kirby A, Heckerman D, Daly MJ, Eskin E. Efficient control of population structure in model organism association mapping. Genetics. 2008;178:1709–23. doi: 10.1534/genetics.107.080101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang Z, Ersoz E, Lai CQ, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu J, Arnett DK, Ordovas JM, Buckler ES. Mixed linear model approach adapted for genome-wide association studies. Nature Genet. 2010;42:355–60. doi: 10.1038/ng.546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Idaghdour Y, Czika W, Shianna KV, Lee SH, Visscher PM, Martin HC, Miclaus K, Jadallah SJ, Goldstein DB, Wolfinger RD, Gibson G. Geographical genomics of human leukocyte gene expression variation in southern Morocco. Nat Genet. 2010;42:62–7. doi: 10.1038/ng.495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.International Multiple Sclerosis Genetics Consortium and WellcomeTrust Case Control Consortium 2 Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature. 2011;476(7359):214–219. doi: 10.1038/nature10251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Stich B, Piepho HP, Schulz B, Melchinger AE. Multitraitassociation mapping in sugar beet (Beta vulgaris L.). TheorAppl Genet. 2008;117:947–54. doi: 10.1007/s00122-008-0834-z. [DOI] [PubMed] [Google Scholar]
  • 24.Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet. 2011;88:294–305. doi: 10.1016/j.ajhg.2011.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Yang J, Lee SH, Goddard ME, Visscher PM. GCTA:A tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lee SH, Decandia TR, Ripke S, Yang J, Sullivan PF, Goddard ME, Keller MC, Visscher PM, Wray NR. Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs. Nat Genet. 2012;44:247–250. doi: 10.1038/ng.1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Deary IJ, Yang J, Davies G, Harris SE, Tenesa A, Liewald D, Luciano M, Lopez LM, Gow AJ, Corley J, Redmond P, Fox HC, Rowe SJ, Haggarty P, McNeill G, Goddard ME, Porteous DJ, Whalley LJ, Starr JM, Visscher PM. Genetic contributions to stability and change in intelligence from childhood to old age. Nature. 2012;482:212–5. doi: 10.1038/nature10781. [DOI] [PubMed] [Google Scholar]
  • 28.Kim S, Xing EP. Statistical estimation of correlated genome associations to a quantitative trait network. PLoS Genet. 2009;5:e1000587. doi: 10.1371/journal.pgen.1000587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Manning AK, LaValley M, Liu CT, Rice K, An P, Liu Y, Miljkovic I, Rasmussen-Torvik L, Harris TB, Province MA, Borecki IB, Florez JC, Meigs JB, Cupples LA, Dupuis J. Meta-analysis of gene-environment interaction:joint estimation of SNP and SNP x environment regression coefficients. Genet Epidemiol. 2011;35:11–8. doi: 10.1002/gepi.20546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Horton MW, Hancock AM, Huang YS, Toomajian C, Atwell S, Auton A, Muliyati NW, Platt A, Sperone FG, Vilhjálmsson BJ, Nordborg M, Borevitz JO, Bergelson J. Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel. NatureGenet. 2012;44:212–6. doi: 10.1038/ng.1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sabatti C, Service SK, Hartikainen A-L, Pouta A, Ripatti S, Brodsky J, Jones CG, Zaitlen NA, Varilo T, Kaakinen M, Sovio U, Ruokonen A, Laitinen J, Jakkula E, Coin L, Hoggart C, Collins A, Turunen H, Gabriel S, Elliot P, McCarthy MI, Daly MJ, Järvelin M-R, Freimer NB, Peltonen L. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat Genet. 2009;41:35–46. doi: 10.1038/ng.271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Li Y, Huang Y, Bergelson J, Nordborg M, Borevitz JO. Association mapping of local climate-sensitive quantitative traitloci in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA. 2010;107:21199–204. doi: 10.1073/pnas.1007431107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kathiresan S, Manning AK, Demissie S, D'Agostino RB, Surti A, Guiducci C, Gianniny L, Burtt NP, Melander O, Orho-Melander M, Arnett DK, Peloso GM, Ordovas JM, Cupples LA. A genome-wide association study for bloodlipid phenotypes in the Framingham Heart Study. BMC MedGenet. 2007;8(Suppl 1):S17. doi: 10.1186/1471-2350-8-S1-S17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Ripatti S, Chasman DI, Willer CJ, Johansen CT, Fouchier SW, Isaacs A, Peloso GM, Barbalic M, Ricketts SL, Bis JC, Aulchenko YS, Thorleifsson G, Feitosa MF, Chambers J, Orho-Melander M, Melander O, Johnson T, Li X, Guo X, Li M, Shin Cho Y, Jin Go M, Jin Kim Y, Lee JY, Park T, Kim K, Sim X, Twee-Hee Ong R, Croteau-Chonka DC, Lange LA, Smith JD, Song K, Hua Zhao J, Yuan X, Luan J, Lamina C, Ziegler A, Zhang W, Zee RY, Wright AF, Witteman JC, Wilson JF, Willemsen G, Wichmann HE, Whitfield JB, Waterworth DM, Wareham NJ, Waeber G, Vollenweider P, Voight BF, Vitart V, Uitterlinden AG, Uda M, Tuomilehto J, Thompson JR, Tanaka T, Surakka I, Stringham HM, Spector TD, Soranzo N, Smit JH, Sinisalo J, Silander K, Sijbrands EJ, Scuteri A, Scott J, Schlessinger D, Sanna S, Salomaa V, Saharinen J, Sabatti C, Ruokonen A, Rudan I, Rose LM, Roberts R, Rieder M, Psaty BM, Pramstaller PP, Pichler I, Perola M, Penninx BW, Pedersen NL, Pattaro C, Parker AN, Pare G, Oostra BA, O'Donnell CJ, Nieminen MS, Nickerson DA, Montgomery GW, Meitinger T, McPherson R, McCarthy MI, et al. Biological, clinical and population relevance of 95loci for blood lipids. Nature. 2010;466:707–13. doi: 10.1038/nature09270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Lin R, Wang H. Arabidopsis FHY3/FAR1 gene family anddistinct roles of its members in light control of Arabidopsis development. Plant Physiol. 2004;136:4010–22. doi: 10.1104/pp.104.052191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Fisher R. The correlation between relatives on the supposition of Mendelian inheritance. Transactions of the Royal Society ofEdinburgh. 1918;52:399–433. [Google Scholar]
  • 37.Price AL, Helgason A, Thorleifsson G, McCarroll SA, Kong A, Stefansson K. Single-tissue and cross-tissue heritability of gene expression via identity-by-descent in related or unrelated individuals. PLoS Genet. 2011;7:e1001317. doi: 10.1371/journal.pgen.1001317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Buckler ES, Holland JB, Bradbury PJ, Acharya CB, Brown PJ, Browne C, Ersoz E, Flint-Garcia S, Garcia A, Glaubitz JC, Goodman MM, Harjes C, Guill K, Kroon DE, Larsson S, Lepak NK, Li H, Mitchell SE, Pressoir G, Peiffer JA, Rosas MO, Rocheford TR, Romay MC, Romero S, Salvo S, Sanchez Villeda H, da Silva HS, Sun Q, Tian F, Upadyayula N, Ware D, Yates H, Yu J, Zhang Z, Kresovich S, McMullen MD. The genetic architecture of maize flowering time. Science. 2009;325:714–8. doi: 10.1126/science.1174276. [DOI] [PubMed] [Google Scholar]
  • 39.Valdar W, Solberg LC, Gauguier D, Cookson WO, Rawlins JNP, Mott R, Flint J. Genetic and environmentaleffects on complex traits in mice. Genetics. 2006;174:959–984. doi: 10.1534/genetics.106.060004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Smith E, Kruglyak L. Gene-environment interaction in yeast gene expression. PLoS Biology. 2008;6:e83. doi: 10.1371/journal.pbio.0060083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Gilmour A, Gogel B, Cullis B, Welham SJ, Thompson R. ASReml User Guide Release 1.0. 2002 [Google Scholar]
  • 42.Henderson C, Quaas RL. Multiple trait evaluation using relatives’ records. Journal of Animal Science. 1976;43:1188–1197. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES