Skip to main content
Genetics logoLink to Genetics
. 2015 Jan 27;199(4):1255–1269. doi: 10.1534/genetics.114.173070

Heritable Environmental Variance Causes Nonlinear Relationships Between Traits: Application to Birth Weight and Stillbirth of Pigs

Herman A Mulder *,1, William G Hill , Egbert F Knol
PMCID: PMC4391567  PMID: 25631318

Abstract

There is recent evidence from laboratory experiments and analysis of livestock populations that not only the phenotype itself, but also its environmental variance, is under genetic control. Little is known about the relationships between the environmental variance of one trait and mean levels of other traits, however. A genetic covariance between these is expected to lead to nonlinearity between them, for example between birth weight and survival of piglets, where animals of extreme weights have lower survival. The objectives were to derive this nonlinear relationship analytically using multiple regression and apply it to data on piglet birth weight and survival. This study provides a framework to study such nonlinear relationships caused by genetic covariance of environmental variance of one trait and the mean of the other. It is shown that positions of phenotypic and genetic optima may differ and that genetic relationships are likely to be more curvilinear than phenotypic relationships, dependent mainly on the environmental correlation between these traits. Genetic correlations may change if the population means change relative to the optimal phenotypes. Data of piglet birth weight and survival show that the presence of nonlinearity can be partly explained by the genetic covariance between environmental variance of birth weight and survival. The framework developed can be used to assess effects of artificial and natural selection on means and variances of traits and the statistical method presented can be used to estimate trade-offs between environmental variance of one trait and mean levels of others.

Keywords: genetics of environmental variance, optimum trait, fitness, complex traits, mass selection


IN the classic model of quantitative genetics, the phenotype is assumed to be the sum of an effectively infinite number of genes and environmental effects; i.e., P=G+E (Falconer and Mackay 1996; Lynch and Walsh 1998). The phenotypic variance is the sum of the genetic (VG) and environmental (VE) variance. It is assumed that the environmental variance is homogeneous across genotypes, but recently there is evidence from laboratory experiments and analysis of livestock populations that not only the phenotype itself, but also its VE, is under genetic control (Hill and Mulder 2010). This might be expressed as genetic heterogeneity in the within-individual variance of a trait with repeat measurements, such as weight of individual piglets in a litter or total litter weight across parities, or as a difference in within-family variance for traits such as juvenile body weight expressed once.

The median genetic coefficient of variation for VE (GCVVE=σAv/VE, where σAv is additive genetic variance in VE) is ∼0.3 based on a review of 14 studies with estimates of genetic variance in VE primarily on livestock populations (Hill and Mulder 2010). This indicates that VE could be increased or decreased by 30% if changed by one genetic standard deviation. These analyses were mainly based on genetic analysis of pedigreed livestock populations; but with 300 isofemale lines of Drosophila melanogaster, thereby avoiding possible confounding by within-line genetic heterogeneity, Mackay and Lyman (2005) showed a significant genetic variance in CV and VE. In a selection experiment in rabbits, selection on estimated breeding value for VE of birth weight successfully changed the within-litter variability of birth weight (Garreau et al. 2008). There are more recent data on genetic control of VE at the level of individual loci from genome-wide association studies (see review by Geiler-Samerotte et al. 2013). Those in experimental species are very clear cut: for example, Jimenez-Gomez et al. (2011) and Nelson et al. (2013) located QTL for phenotypic variability in gene expression in Arabidopsis and yeast, respectively, and Shen et al. (2012) identified regions controlling the phenotypic variability of traits in Arabidopsis. In vertebrates, the FTO genotype was shown to be associated with phenotypic variability in body mass index in humans, with the two homozygotes differing 7% in standard deviation (SD) (Yang et al. 2012). In laying hens, a QTL region on chromosome 4 was identified that explained 30% and 16% of the genetic variance in mean and within-individual SD, respectively, of egg weight (Wolc et al. 2012); and in dairy cattle the phenotypic variability of somatic cell score, an indicator for mastitis, was shown to be determined by many loci, while that with the largest effect explained ∼3% of the genetic variance in VE (Mulder et al. 2013a). It has been argued that findings such as those of Yang et al. (2012) can also be explained in terms of a scale effect (Sun et al. 2013), but the extreme transformations required are not biologically plausible (Shen and Rönnegård 2013).

In agriculture, there is interest in reducing phenotypic variance to improve uniformity of animals and animal products. For instance, penalties are imposed by slaughterhouses for meat animals delivered with excessive variability because uniform products are demanded by retailers and packing efficiency of products is higher (Hennessy 2005). It has been shown both in theory (SanCristobal-Gaudy et al. 1998; Sorensen and Waagepetersen 2003; Mulder et al. 2007, 2008) and from experiment (Garreau et al. 2008) that uniformity can be increased by selection.

Before implementing selection to reduce VE in livestock breeding programs it is necessary to know what correlated changes in means of the target trait and other traits of interest would be expected. A particular concern is that animals with small VE would not have the capacity to react to unpredictable environments, i.e., have reduced plasticity. There is some evidence that plasticity and VE are positively correlated, i.e., that genotypes with higher VE have higher plasticity (Mulder et al. 2013b; Tonsor et al. 2013). Whether plastic genotypes with high VE or stable genotypes with small VE are better capable of handling unpredictable environments is under debate. In livestock little is known about trade-offs between VE and means of other traits, except that uniformity of birth weight in litter-bearing animals does increase the survival of the offspring, e.g., in pigs (Kapell et al. 2011) and rabbits (Garreau et al. 2008).

Genetic variation in VE of a trait has been shown analytically to lead to a nonlinear relationship between the breeding value of VE and the phenotypic deviation from the mean (Mulder et al. 2007). Therefore, if there is a genetic correlation between the mean of one trait and VE of another, which we term an M-V genetic correlation between traits, the phenotypic relationship between them is expected to be nonlinear. Nonlinear parent–offspring regression and nonlinear regression of one trait on another within half-sib groups have been employed to explore nonlinearity between traits by Sölkner and Fuerst-Waltl (1996) and Fuerst-Waltl et al. (1997, 1998), who assumed this was caused by nonlinearity between additive genetic effects of different traits. They also showed that phenotypic relationships were often more linear than the genetic relationships due to dilution with environmental covariance. Nonlinear relationships might also be expected when one of the traits, for example fitness, has an optimum value (Lande and Arnold 1983; Falconer and Mackay 1996; Kingsolver et al. 2001, 2012). Such nonlinear relationships are often observed or hypothesized in natural populations, typically as “fitness profiles” between trait values and fitness (e.g., Falconer and Mackay 1996) that show whether phenotypes on one side of the distribution (directional), in the middle (stabilizing), or at extremes (disruptive) have highest fitness. If there is an optimum phenotype, a maximum would imply a negative covariance between VE and fitness; i.e., families with small variance would have selective advantage if there is stabilizing selection, and correspondingly families with a large variance would have an advantage if there is disruptive or strong directional selection (Hill and Zhang 2004; Mulder et al. 2007). Examples of nonlinear trait relationships include that between birth weight and survival of infants in humans in historic data when medical care was less optimized (Karn and Penrose 1951 in Schluter 1988), survival and size in house sparrows (Schluter and Smith 1986; Schluter 1988), and between litter size and birth weight in piglets (Varona and Sorensen 2014). These nonlinear relationships might be due in part to a nonnormality of at least one of the traits such as survival (binomial). To our knowledge no studies have yet been undertaken of the degree to which nonlinearity between traits is caused by an M-V genetic correlation between them. It is possible that low VE may yield better fitness in some environments (a conservative bet-hedging strategy) or high VE in others (a coin-flipping strategy) (Olofsson et al. 2009). Therefore, while insight into the nonlinear relationships between traits should increase understanding of the existence and if so of the position of phenotypic and genetic optima, no framework is available to study the role of M-V genetic correlations between traits and the consequent effects of selection.

The objectives this article are to (1) derive mathematically nonlinear relationships between traits when one exhibits genetic variance in VE, (2) predict responses to selection in mean and variance of both traits, (3) predict genetic and phenotypic optima, (4) predict changes in genetic parameters consequent on selection shifting the population mean toward the fitness optimum, and (5) investigate the relationship between piglet birth weight and survival in pigs as a practical application of the theory. We assumed multivariate normality for deriving the expressions, whereas survival is binomially distributed. Therefore, we extended the predictions toward assuming that survival has an underlying normal distribution and predictions can be transformed to the observed binomial scale.

Quantitative Genetic Model

Consider two traits for which the phenotypic deviations from the mean are the sum of additive genetic and environmental effects, with no dominance or epistasis (Falconer and Mackay 1996). The first trait, with phenotypic value P1, is also subject to additive genetic differences in its environmental variance, so-called genetic heterogeneity of environmental variance (SanCristobal-Gaudy et al. 1998), whereas the second trait (phenotypic value P2) has homogeneous environmental variance. Here we assume the exponential model for environmental variance (SanCristobal-Gaudy et al. 1998; Hill and Mulder 2010), because of its statistical tractability, as the environmental variance cannot be negative and the model appears linear on the log scale. Different models to model genetic differences in environmental variance appear similar when genetic variance in VE is small. Further discussion and comparison of models for genetic analysis of the environmental variance are given elsewhere (Hill and Mulder 2010). The genetic model is

P1=μ1+Am,1+E1=μ1+Am,1+χexp[12(ln(σe12)+Av,1)] (1)
P2=μ2+Am,2+E2, (2)

where μi is the population mean, Am,i is the breeding value, and Ei is the environmental deviation for trait i. The environmental variance for trait 2 is assumed constant, σe22, whereas σe12 is the expected environmental variance for trait 1 when Av,1=0, but varies among individuals according to their breeding value Av,1. The standard normal deviate χ is the standardized environmental deviation for trait 1. The breeding values Am,1, Av,1, and Am,2 are assumed to be the sum of effects of an infinite number of additive loci and are multivariate normally distributed,

[Am,1Av,1Am,2]N(000,[σAm,12covAm,1,Av,1covAm,1,Am,2σAv,12covAv,1,Am,2symmetricσAm,22]A),

where A is the additive genetic or numerator relationship matrix; σAm,12, σAv,12, and σAm,22 are the additive genetic variances in Am,1, Av,1, and Am,2; and covAm,1,Av,1, covAm,1,Am,2, and covAv,1,Am,2 are the corresponding additive genetic covariances. The environmental effects E1(=χexp[(1/2)(ln(σe12)+Av,1)]) and E2 are bivariate normally distributed,

[E1E2]N(00,[σe12exp(Av,1)reσe1exp(12Av,1)σe2symmetricσe22]),

where σe22 is the environmental variance of trait 2 and re is the environmental correlation.

Methodological Framework for Nonlinear Relationships

Multiple-regression framework

We used a multiple-regression framework to predict the nonlinear relationship between traits caused by M-V genetic correlations between them and to predict responses in mean and variance due to selection, following the methodology of Mulder et al. (2007). In Mulder et al. (2007), breeding values and selection responses for mean and variance were predicted using linear, quadratic, and cubic regression on the phenotypic deviation of an individual phenotype. This methodology formally requires multivariate normality, which holds only approximately because genetic variance in VE for P1 causes small deviations from normality (Mulder et al. 2007). We also used selection index theory (Hazel 1943), which is essentially an application of multiple regression assuming that fixed effects are known without error.

Monte Carlo simulation

To check the accuracy of derived relationships and predicted responses to selection, we simulated 1000 replicates each with 1 million animals according to the genetic models in Equations 1 and 2 and calculated the expected value of trait 2 given trait 1 by calculating the average P2 given P1 within 601 successive intervals of 0.01σP,1 between −3 and 3σP,1. Selection responses to directional mass selection on P1 and P2 were calculated as the mean Am,1, Av,1, and Am,2 of all selected animals with Px in each replicate and averaged over replicates, where x is the truncation point.

Deriving the phenotypic relationship between P2 and P1

The phenotypic relationship between P2 and P1 was derived as the multiple regression of P2 on P1 and P12, because genetic variance in VE causes genetic variance in P12 (Mulder et al. 2007). The regression can be written as

E(P2|P1,P12)=b1P1+b2P12, (3)

where

[b1b2]=P1c, (4)
P=[var(P1)cov(P1,P12)cov(P1,P12)var(P12)],

and

c=[cov(P2,P1)cov(P2,P12)].

The elements of P and c were derived using the higher-order moments of the normal distribution (Stuart and Ord 1994) and using the laws of total (co)variance and standard variance–covariance rules (see Appendix A for full derivations),

var(P1)=σAm,12+σe12exp(12σAv,12)=σP12 (5)
cov(P1,P12)=3covAm,1,Av,1σe12exp(12σAv,12) (6)
var(P12)=2σP14+6covAm,1,Av,12σe12exp(12σAv,12)+3σe14exp(σAv,12)(exp(σAv,12)1) (7)
cov(P2,P1)=covAm,1,Am,2+cove1e2 (8)
cov(P2,P12)= covAv,1,Am,2σe12exp(12σAv,12)+cove1e2covAm,1,Av,1, (9)

where cove1e2=reσe1exp((1/4)σAv,12)σe2. Note that Equation 3 is about the relationship between single observations of P2 and P1. In Equation 7, the two rightmost terms are related to the skewness (Equation 6) and kurtosis of the distribution, respectively. All elements of P and c (equal to Equations 59) were checked using Monte Carlo simulation. The multiple-regression prediction of P2 from P1 shows very good agreement with the Monte Carlo simulation (Figure 1 and Figure 2), with R2 > 0.995 for all cases evaluated. Appendix C shows a natural extension to multiple random effects, such as permanent and common environmental effects.

Figure 1.

Figure 1

(A and B) The relationship between P2 and P1 either assuming no genetic correlation (rAv,1,Am,2=0) between environmental variance in P1 and the mean of P2 using Monte Carlo simulation (MC_lin) and multiple regression (MR_lin) or assuming a negative genetic correlation (rAv,1,Am,2=0.5) between them (MC_nonlin, MR_nonlin) when rAm,1,Am,2=0.1 (A) or when rAm,1,Am,2=0.5 (B) (σAm,12=σAm,22=0.3,σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAm,1,Am,2=0.1, rAm,1,Av,1=0, re=0).

Figure 2.

Figure 2

(A and B) The relationship between P2 and P1, using Monte Carlo simulation (MC) and multiple regression (MR) either with positive genetic correlation (A: rAv,1,Am,2=0.5,re=0) between environmental variance in P1 and the mean of P2 or with negative genetic correlation (B: when rAv,1,Am,2=0.5,re=0.1) (σAm,12=σAm,22=0.3,σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAm,1,Am,2=0.1, rAm,1,Av,1=0).

Deriving response to mass selection

Deterministic predictions:

Response to selection (R) is predicted as the mean deviation of offspring from the population mean regressed on the selection differential of the parents R=bS, which is equal to the regression of the breeding value (A) on the selection differential (S) ΔA=bS (Falconer and Mackay 1996). We showed previously that response to selection in Am,1 and Av,1 could be predicted for different kinds of mass selection, e.g., directional, stabilizing, or disruptive (Mulder et al. 2007). Here we extend this framework to the correlated responses in (1) P2 from mass selection on P1 and (2) Am,1 and Av,1 from mass selection on P2. In both cases, the general equation to predict selection responses in

Δa=[ΔAm,1ΔAv,1ΔAm,2]

in terms of selection differentials (Δx) is

Δa=P−1GΔx=BΔx, (10)

where P is the variance–covariance matrix among selection differentials in Δx, because selection differentials are random variables under the quantitative genetic model assumed; G is the covariance matrix between Δa and Δx; and B is a matrix of regression coefficients. First, if mass selection on P1 is performed, then the selection differentials on P1 and P12 are

Δx=[P1,s¯μ1P1,s2¯σP12]=[P1,s¯E(P1) (P1,sE(P1))2¯E{(P1E(P1))2}]. (11)

If it is assumed that μ1=0,

[ΔAm,1ΔAv,1ΔAm,2]=BP1[P1,s¯P1,s2¯σP12]=BP1[iσP1ixσP12], (12)

where BP1 is the transpose of BP1=P1GP1, P is the same as in Equation 3, and

GP1=[σAm,12covAm,1,Av,1covAm,1Am,2covAm,1,Av,1σe12exp(12σAv,12)σAv,12σe12exp(12σAv,12)covAv,1Am,2σe12exp(12σAv,12)],

i.e., the covariance matrix between

[P1,s¯P1,s2¯σP12]and[ΔAm,1ΔAv,1ΔAm,2],

i is the selection intensity when selecting the best p animals, and x is the truncation point corresponding to p based on a normal distribution (Mulder et al. 2007).

Second, if mass selection is on P2, Equation 10 becomes

[ΔAm,1ΔAv,1ΔAm,2]=bP2[P2,s¯]=bP2iσP2 (13)

where bP2=var(P2)1gP2 and gP2=[covAm,1Am,2covAv,1Am,2σe12exp((1/2)σAv,12)σAm,22], the covariance vector between P2,s¯ and

[ΔAm,1ΔAv,1ΔAm,2].

Evaluation of predictions:

Genetic selection differentials in Am,1, Av,1, and Am,2 with directional mass selection on either P1 (Equation 12) or P2 (Equation 13) were shown by Monte Carlo simulation to be very accurately predicted, with differences only at the third decimal (Table 1). Correlated responses in both Av,1 and Am,2 are expected following mass selection on P1, even if genetic correlations between Am,1 and Av,1 or Am,2 are zero, provided that the genetic correlation between Av,1 and Am,2 is nonzero. This is because, with strong directional selection (e.g., P ≥ 20%), animals with higher Av,1 (i.e., environmental variance in P1) have a higher probability of selection (Hill and Zhang 2004; Mulder et al. 2007). In other words, with directional selection genotypes with a higher variance are more likely to be parents of the next generation than genotypes with a low variance.

Table 1. Genetic selection differentials in Am,1, Av,1, and Am,2 when selecting on P1 or on P2 for different selected proportions (p) and when rAm,1,Av,1 and rAm,1,Am,2 are the same (0 or 0.5).
Am,1 Av,1 Am,2
Selection rAm,1,Av,1=rAm,1,Am,2 p MR MC MR MC MR MC
P1 0 0.80 0.105 0.105 −0.005 −0.005 0.006 0.006
0.50 0.239 0.242 0.000 0.000 0.000 0.000
0.20 0.420 0.421 0.020 0.020 −0.024 −0.025
0.5 0.80 0.104 0.103 0.017 0.017 0.063 0.061
0.50 0.239 0.241 0.048 0.049 0.123 0.121
0.20 0.422 0.430 0.099 0.102 0.180 0.179
P2 0 0.80 0.000 0.000 −0.021 −0.021 0.105 0.105
0.50 0.000 0.000 −0.049 −0.049 0.239 0.239
0.20 0.000 0.000 −0.086 −0.086 0.420 0.420
0.5 0.80 0.052 0.052 −0.021 −0.021 0.105 0.105
0.50 0.120 0.120 −0.049 −0.049 0.239 0.239
0.20 0.210 0.210 −0.086 −0.086 0.420 0.420

Values were obtained analytically [multiple regression (MR): Equations 12 and 13] and with Monte Carlo simulation (MC) (σAm,12=σAm,22=0.3, σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAv,1,Am,2=0.5, re=0).

Response to selection on estimated breeding value

In animal breeding, selection is often on predicted breeding values rather than on raw phenotypes. For instance, if the breeding goal is to increase trait 2, then the predicted breeding value A^m,2 using information of P1 and P12 is

A^m,2=ba2[P1μ1P12σP12], (14)

where ba2=P−1ga2, P is given in Equation 3, and

ga2=[covAm,1,Am,2covAv,1,Am,2σe12exp(12σAv,12)].

The expected response to selection would be equal to that using the generalized form of the breeders equation (Hazel 1943):

ΔAm,2=iba2ga2ba2P−1ba2. (15)

An alternative to using squared phenotypic deviations is to use squared residuals of P1 corrected for its leverage (l), which is the diagonal element of the “hat matrix” for each observation (Hoaglin and Welsh 1978) as used in the double-hierarchical generalized linear model (DHGLM) (Rönnegård et al. 2010):

e12(1l)=[((1bAm,1)P1)2][1l]. (16)

In the absence of any fixed effects, 1l=1bAm,1, Equation 16 reduces to e12/(1l)=(1bAm,1)P12 and

A^m,2=ba2e[P1e12/(1l)], (17)

where ba2e=Pe1ga2e,

Pe=[var(P1)cov(P1,e12/(1l))cov(P1,e12/(1l))var(e12/(1l))]=WPW,

and ga2e=Wga2, where

W=[100(1bAm,1)]

(see derivation in Appendix B). Use of either P12 or e12/(1l) would yield identical accuracy of prediction A^m,2 and of response to selection ΔAm,2 in this case where only their own phenotypes provide information and there are no fixed effects; otherwise squared residuals corrected for leverages are expected to give higher accuracy and response than using squared phenotypic deviations.

Determining whether the parabola have a maximum or a minimum

The multiple-regression Equation 3 is quadratic and therefore has a parabolic shape. Such quadratic relationships may indicate a biological optimum, for instance birth weight and survival in humans prior to modern medical intervention (Karn and Penrose 1951 in Schluter 1988). Expressions for b1 and b2 (from Equation 3) are given by

b1=[var(P12)c1cov(P1,P12)c2]det(P) (18)
b2=[cov(P1,P12)c1+var(P1)c2]det(P), (19)

where c1=cov(P2,P1) (Equation 8) and c2=cov(P2,P12) (Equation 9) and P is shown below Equation 4. Whether there is a maximum or a minimum depends on the sign of the quadratic regression coefficient (b2), i.e., P12. If b2>0, it has a minimum and if b2<0, it has a maximum. The sign of the regression coefficient depends on multiple parameters (Equations 18 and 19). As Figure 3 shows, b2 is linear in rAv,1,Am,2 because it is a linear function of c2 (Equation 19) and c2 is the genetic covariance between Av,1 and Am,2. When rAv,1,Am,2=0 and other correlations are zero, b2=0, i.e., when the straight-line relationship between P2 and P1 passes through (0, 0). If other correlations are nonzero, b2=0 shifts to a point dependent mainly on rAv,1,Am,2, with other parameters having less influence on b2. In summary, a moderate to strong negative M-V genetic correlation between traits (rAv,1,Am,2) yields a parabola with a maximum, e.g., animals with phenotypes close to the optimum have maximum fitness, a fitness optimum; while a moderate to strong positive genetic correlation yields one with a minimum; e.g., animals with phenotypes largely deviating from the minimum have the highest fitness.

Figure 3.

Figure 3

The quadratic regression coefficient b2 as a function of the genetic correlation rAv,1,Am,2 for different values of rAm,1,Av,1 and rAm,1,Am,2 (σAm,12=σAm,22=0.3,σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, re=0).

Similarly, Equation 14 shows whether the genetic parabola has a maximum or a minimum. The regression coefficients now are

ba21=[var(P12)ga21cov(P1,P12)ga22]det(P) (20)
ba22=[cov(P1,P12)ga21+σP12ga22]det(P), (21)

where ga21 and ga22 are the elements of ga2 (just below Equation 14). Equations 20 and 21 differ from Equations 18 and 19 only in ga21 and ga22. If re=0, then ga21=c1 and ga22=c2 and consequently ba21=b1 and ba22=b2.

Can phenotypic and genetic parabolas have different shapes?

The interest here is in whether there are conditions when the phenotypic parabola has a maximum and the genetic parabola a minimum or vice versa. Because re affects only the phenotypic parabola, it determines the relative shapes of the parabolas, and so it is helpful to calculate the magnitude of re when b2=0 and switches to a different sign than that of ba22. The most obvious situation is when the environmental correlation re and the genetic correlation rAm,1,Am,2 have opposite signs, but b2=0 only when covAm,1,Av,10. Calculations show that, for the parabolas to be of different shape, re has to be strong, in which case the regression is almost linear. So in practice, parabolas of opposite sign are likely to occur only rarely, if at all.

Determining the phenotypic and genetic stationary points

The maximum (minimum) phenotype P2 can be found at the stationary point of P1 (P1stat). The stationary point is mathematically the point where the first derivative is zero, e.g., a minimum or maximum. The stationary point of the quadratic relationship of P2 as a function of P1 can be calculated as

P1stat=b12b2=var(P12)c1+cov(P1,P12)c22cov(P1,P12)c1+2σP12c2. (22)

If we assume for simplicity that cov(P1,P12)=0 when covAm,1,Av,1=0, then P1stat=var(P12)c1/2σP12c2. The population mean is at a stationary point of zero when c1=rAm,1,Am,2=0. A positive (negative) stationary point implies that the current population mean is lower (higher) than it is. In general, P1stat = 0 if var(P12)c1=cov(P1,P12)c2.

The genetic stationary point can be determined similarly:

P1g_stat=ba212ba22=var(P12)ga21+cov(P1,P12)ga222cov(P1,P12)ga21+2σP12ga22. (23)

The genetic and phenotypic stationary points differ unless re=0. They are shown in Figure 4 as functions of rAv,1,Am,2 (Figure 4A) and rAm,1,Am,2 (Figure 4B), and although they differ, their shapes do not. Both are near zero when rAv,1,Am,2 is close to −1 or 1, but large in absolute terms when rAv,1,Am,20 (Figure 4A), i.e., a hyperbolic form, because rAv,1,Am,2 largely determines the curvature of the parabolas; and both increase linearly with rAm,1,Am,2 (Figure 4B). These phenotypic and genetic stationary points can be interpreted as phenotypic and genetic optima in the case of stabilizing selection.

Figure 4.

Figure 4

(A and B) Phenotypic and genetic optima as functions of rAv,1,Am,2 (A) and rAm,1,Am,2 (B) when re=0.5 (σAm,12=σAm,22=0.3,σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAm,1,Av,1=0; A, rAm,1,Am,2=0.5; B,rAv,1,Am,2=0.2).

Effects of selection on genetic parameters

Both natural and artificial selection may bring the population mean to a fitness optimum. As shown in Figure 4, there is a relationship between the distance between the stationary point and the current population mean and the genetic correlations, such that the genetic correlations may change as selection shifts the population toward the stationary point. The formulas underlying Figure 4 can also be expressed to show how genetic correlations depend on the distance between the current population mean and the stationary point, e.g., a fitness optimum (Ω), assuming that the parabola has fixed shape. The regression equation can be written as

P2=b2(P1Ω)2+d, (24)

where d is a constant. If we assume d=0 and cov(P1,P12)=0, then b2=c2/var(P12). From Equation 22, b1=2b2Ω, substituting b2=c2/var(P12) and equating to the expression for b1 in Equation 18 assuming cov(P1,P12)=0,

b1=2c2Ωvar(P12)=c1σP12. (25)

The change in genetic correlation rAm,1,Am,2 can be predicted as the population mean moves to the stationary point. For instance, if all other parameters stay equal,

c1=2σP12c2Ωvar(P12) (26)
rAm,1,Am,2=2σP12c2Ωvar(P12)σAm,1σAm,2=2σP12Ω(covAv,1,Am,2σe12exp((1/2)σAv,12)+cove1e2covAm,1,Av,1)var(P12)σAm,1σAm,2. (27)

Equation 27 shows that rAm,1,Am,2 and Ω are linearly related.

Similarly, for the genetic correlation between rAv,1,Am,2,

c2=c1var(P12)2σP12Ω (28)
rAv,1,Am,2=c1var(P12)2σP12ΩσAv,1σAm,2cove1e2covAm,1,Av,1σAv,1σAm,2, (29)

showing that rAv,1,Am,2 has a hyperbolic relationship with Ω (Figure 4A).

Example: index selection with changing genetic correlation rAm,1,Am,2

If index selection is applied to improve trait 2 using the phenotypic deviation of trait 1 and its square, i.e., Equations 14 and 15, the mean of trait 1 will tend to move to the optimum value (Figure 5). If only rAm,1,Am,2 changes according to Equation 27, the genetic correlation will become smaller as the population mean approaches the optimum. The response in trait 1 is nearly linear up to generation seven and becomes almost zero after generation eight, whereas the response in trait 2 is curvilinear. The response in trait 2 in the first seven generations is due mainly to bringing the population mean for trait 1 to the optimum and subsequently is due to reducing the phenotypic variance in trait 1.

Figure 5.

Figure 5

Population mean for traits 1 and 2 and the phenotypic variance of trait 1 as a function of generation when applying index selection for trait 2 using the phenotype deviation of trait 1 and the squared phenotypic deviation of trait 1 in the index (starting values are as follows: selected proportion is 5%, σAm,12=σAm,22=0.3, σAv,12=0.05, σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAm,1,Am,2=0.5, rAm,1,Av,1=0.1, re=0, rAv,1,Am,2=0.5).

Deriving the phenotypic relationship between P2 and P1 when P2 is a binary trait

Until now, we assumed multivariate normality but nonlinear relationships between traits occur when one of the traits is not normally distributed, such as survival or presence/absence of disease. Therefore we extend the prediction of P2 on P1 (Equation 3) when P2 is a 0/1 binomial trait. We assume that P2 is normally distributed on the underlying/liability scale (P2,liab) (Gianola 1982; Foulley 1992) and therefore, P2,liab^ can be predicted using Equation 3. On the observed scale (P2,obs), the proportion of animals that survive (dead = 0; alive =1) is predicted as

P2,obs=psurvival=xP2,liab^φ(t)dt, (30)

where x is the truncation point of the normal distribution corresponding to the average survival in the population (psurvival¯), xP2,liabP2,liab^ represents the predicted truncation point given P1, and φ(t) is the standard normal density function. Using Equation 30 together with Equation 3 to predict P2,obs from P1 shows very good agreement with results from Monte Carlo simulation (Figure 6), with R2 > 0.995 for all cases evaluated. (Fortran scripts for all theoretical predictions and Monte Carlo simulations are provided in Supporting Information, File S3).

Figure 6.

Figure 6

The relationship between the proportion of surviving animals P2,obs and P1 using Monte Carlo simulation (MC_lin) and multiple regression (MR_lin) either with no genetic correlation (rAv,1,Am,2=0) or with a negative genetic correlation (rAv,1,Am,2=0.5) (MC_nonlin, MR_nonlin) between environmental variance in P1 and the mean of P2,liab on the underlying scale (σAm,12=σAm,22=0.3,σAv,12=0.05,σe12exp((1/2)σAv,12)=0.7, σe22=0.7, rAm,1,Am,2=0.2, rAm,1,Av,1=0, re=0, average proportion for P2,obs = 0.9).

Application to Birth Weight and Stillbirth in Pigs

The theoretical framework developed above shows that an M-V genetic correlation between traits leads to a nonlinear relationship between their phenotypes. The subsequent analyses demonstrated that the genetic and phenotypic optima could differ in position, but that selection could bring the population mean to the genetic optimum. As a practical example, we investigate such a nonlinear relationship between birth weight and stillbirth in pigs. We used 32,450 birth weight records from 2129 litters from 813 sows. Piglets were crossbred whereas the sows were purebred Landrace pigs from TOPIGS. Piglet survival at birth was recorded for each piglet as one (alive) or zero (dead). More details on the data are given elsewhere (Sell-Kubiak et al. 2015). Data are provided in Supporting Information, File S2.

Statistical analysis

We analyzed piglet birth weight, its environmental variance (VE), and piglet survival at birth as three traits with a DHGLM in ASReml4 (Rönnegård et al. 2010; Felleki et al. 2012; Gilmour et al. 2014). For piglet survival we assumed a linear model with homogeneous residual variance because combining a logistic model for piglet survival with DHGLM for birth weight is not yet feasible and indeed outside the scope of this article. Fixed effects fitted were herd–year–season, parity of the sow, and sex of the piglet for all three traits. The random effects included were maternal genetic effects, litter effects, and residuals for all three traits, with each random effect trivariate normally distributed. The model is

[ym,1yv,1ym,2]=[X000X000X][bm,1bv,1bm,2]+[Z000Z000Z][am,1av,1am,2]+[W000W000W][cm,1cv,1cm,2]+[em,1ev,1em,2], (31)

where ym,1, yv,1, and ym,2 are the vectors of observations for piglet birth weight, the response variable for VE of birth weight, and piglet survival; and X, Z, and W are the incidence matrices relating fixed effects (b), random breeding values, and random litter effects to observations. The vectors am,1, av,1, and am,2 are maternal breeding values for birth weight, its VE, and survival

[am,1av,1am,2]N([000],[σAm,12covAm,1,Av,1covAm,1,Am,2σAv,12covAv,1,Am,2symmetricσAm,22]A);

cm,1, cv,1, and cm,2 are the corresponding vectors for random litter effects

[cm,1cv,1cm,2]N([000],[σCm,12covCm,1,Cv,1covCm,1,Cm,2σCv,12covCv,1,Cm,2symmetricσCm,22]I),

where I is the identity matrix; σCm,12, σCv,12, and σCm,22are the litter variances; and covCm,1,Cv,1, covCm,1,Cm,2, and covCv,1,Cm,2 are the corresponding covariances of cm,1, cv,1, and cm,2. Although sows had repeated observations, at the risk of biasing estimates of genetic covariance estimates, permanent sow effects were excluded from the model because their inclusion yielded estimates at the lower boundaries. Subsequently, these estimates of parameters were used to predict the regression of piglet survival on birth weight and were compared to the averages of survival rate for each of 20 ranked bins with equal numbers of piglet birth weight records. Because the statistical model included litter effects, the extension in Appendix C was used to predict the regression equation when the residual covariances between VE and birth weight and survival are ignored.

Results

The parameter estimates (Table 2 and Table 3) indicate that the genetic correlation between mean piglet birth weight and survival is negative, as is that between VE of birth weight and survival, the M-V genetic correlation between traits. This implies that lower birth weight or lower VE is genetically associated with a higher survival rate and, importantly, that uniform litters have higher mean survival rates. The multiple-regression equation based on the estimated variance components describes the nonlinear relationship between individual survival rate and individual birth weight quite well (Figure 7). Because a linear model for survival is used, the regression equation goes beyond a survival rate of 1.0. Accounting for the binomial nature of survival, using Equation 30, solves the problem of predicting a survival rate beyond 1.0, but the goodness of fit was slightly worse (R2 = 0.78) than for the regression assuming multivariate normality (R2 = 0.79). The relationship between individual birth weight and individual survival rate is, however, almost linear because the residual covariance between birth weight and survival substantially dilutes the nonlinear genetic relationship. Similarly Sölkner and Fuerst-Waltl (1996) found that environmental or residual correlations can greatly reduce the nonlinearity of the genetic relationship such that the phenotypic relationship is almost linear. From a biological point of view, a nonlinear relationship between piglet birth weight and survival rate at litter level can arise because extremely small piglets decrease survival rate, as also do very large piglets but to a lesser extent, because the very large piglets induce early farrowing.

Table 2. Estimates of genetic, litter, and residual variance for birth weight (kg) (BW), its environmental variance VE (BWv), and survival rate (SU) in pigs (standard error in parentheses).

Variance component BW (×10−3) BWv (×10−3) SU (×10−3)
Genetic variance 16.4 (1.51) 46.5 (7.55) 0.968 (0.22)
Litter variance 15.0 (0.80) 75.0 (7.53) 2.74 (0.26)
Residual variancea 74.9 (0.61) 1803 (14.7) 63.4 (0.50)
a

Residual variance for BW is based on a model with homogeneous residual variance.

Table 3. Estimates for traits as in Table 2 of genetic correlations (above diagonal) and litter correlations (below diagonal) in pigs (standard errors in parentheses).

Trait BW BWv SU
BW 0.51 (0.08) −0.24 (0.11)
BWv 0.20 (0.05) −0.19 (0.14)
SU 0.35 (0.05) 0.06 (0.07)

Note that residual correlation between BW and SU was 0.18.

Figure 7.

Figure 7

The phenotypic relationship between survival and individual piglet birth weight (kg) when averages were calculated for 20 bins after sorting on birth weight (obs), regression based on Equation 3 assuming multivariate normality (regress_MV), and regression based on Equations 3 and 30 to account for the binomial nature of survival (regress_bin).

Discussion

Methodological framework and quantitative genetic aspects of nonlinear relationships

This study shows that a genetic covariance between the environmental variance of one trait and the mean of another trait, i.e., an M-V genetic correlation between them, can lead to nonlinear relationships between these traits and presented a framework based on multiple regression to investigate the consequences of selection in the presence of such an M-V genetic correlation. Although all relationships between breeding values are assumed to be linear, the phenotypic relationship between traits becomes curvilinear because the breeding value for environmental variance for the first trait Av,1 is associated with the squared phenotype P12 and its covariance to the mean of trait 2. Most quantitative genetic machinery is based on multivariate normality and thus linear relationships (Bulmer 1980), but we show that it can be extended to incorporate nonlinear relationships using methods of Mulder et al. (2007) for predicting breeding values and selection responses when there is genetic variation in VE. Mulder et al. (2007) used the additive model for genetic heterogeneity of VE, but here the exponential model was used because of its statistical tractability and implementation in the DHGLM methodology to estimate genetic variance in VE. The additive and exponential models are similar, assuming that the genetic variance in VE is small and parameters can be converted from one model to the other (Mulder et al. 2007; Hill and Mulder 2010). Here the focus is on the principles rather than the details and therefore the choice of model does not have a major impact on the nonlinear relationships caused by M-V genetic correlations between traits.

We extended the DHGLM methodology to estimate genetic and environmental variances and the correlations between piglet birth weight, its VE, and survival. The current algorithm can be run in ASREML4 (code provided in Supporting Information, File S1). It could, for example, also be used to estimate the trade-offs between VE and other traits and thereby help further unravel the biological mechanisms and causes of genetic differences in VE. Genetic correlations may change due to selection, as predicted in this study, even under infinitesimal model assumptions, and therefore reestimation of genetic parameters seems important if there are curvilinear relationships between traits. Large family sizes are required to get accurate estimates for genetic variance in VE (Hill 2004; Hill and Mulder 2010), and consequently very large data sets are required for estimating M-V genetic correlations between traits with sufficient precision (inferred from Robertson 1959).

The regression in Figure 1 shows substantial nonlinearity, but in our examples the coefficient of determination (R2) would not be greatly reduced if a linear regression was applied even though the true relationship is quadratic because of an M-V genetic correlation between traits. The linear regression would generally explain ≥90% of the variation, except when the phenotypic correlation between the traits is close to zero (i.e., −0.15 < phenotypic correlation < 0.15) (Figure 8). Therefore, a linear regression between traits is sufficient when the population is far from the optimum, but if there is a nonzero M-V genetic correlation between them and the population mean is close to the optimum, quadratic regression should be used.

Figure 8.

Figure 8

The coefficient of determination (R2) of a linear regression when the true relationship is quadratic as a function of the phenotypic correlation (=rAm,1,Am,2=re) between trait 1 and trait 2 for different values of rAv,1,Am,2 (σAm,12=σAm,22=0.3,  σAv,12=0.05,  σe12exp((1/2))σAv,12)=0.7, σe22=0.7, rAm,1,Av,1=0).

A related question is how much of the nonlinearity observed is due to an M-V genetic correlation between the traits. In the pig example, the quadratic relationship improved R2 only from 0.787 to 0.793, whereas the best-fitting quadratic regression R2 would explain 0.794. Assuming that the true relationship is equal to the best-fitting quadratic regression, it can be concluded that the genetic covariance between VE of birth weight and survival is the main driver for the quadratic relationship, and no other factors need to be invoked. The environmental correlations seem to have a dominant influence on diluting the nonlinearity, which accords with findings of Sölkner and Fuerst-Waltl (1996) who investigated the effect of nonlinear relationships between traits due only to nonlinearity of their breeding values for trait means. If the environmental relationship between traits is approximately linear, the phenotypic relationship will also be approximately linear, especially when the heritabilities are small.

In this study, we investigated the contribution of M-V genetic correlations between traits to nonlinearity between them. Nonnormality of at least one of the traits can cause a nonlinear relationship between them (Varona and Sorensen 2014). Survival is binomially distributed, although we assumed multivariate normality of birth weight and survival, but estimates of additive genetic correlations on the linear scale are expected to be equal to those on the underlying scale (Vinson et al. 1976; Gianola 1982) and therefore not biased by the current procedure. Furthermore, we showed that the prediction assuming multivariate normality fitted the observed survival rates better than when assuming that survival is binomially distributed. Nevertheless, Equation 30 is a useful extension of the framework to predict phenotypes of traits that are not normally distributed, but have an underlying liability scale that is normally distributed. More research is needed to explore the effects of nonnormality on estimation of such M-V correlations.

The focus in the present study was on understanding the nonlinear relationship between traits caused by M-V correlations between traits and its consequences for artificial and natural selection. We focused on mass selection, but the same framework could be used to predict responses to selection when using information of relatives, e.g., sibs or offspring as shown by Mulder et al. (2007, 2008). With progeny testing, accuracy will be greatly improved, especially for the breeding value for VE. In Mulder et al. (2008) we showed that when profit has an optimum value, artificial selection will first bring the population mean to the optimum and subsequently selection will be to reduce the variance, similar to that in Figure 5. It should be noted that current calculations as well as those in Mulder et al. (2007, 2008) do not take into account reduced genetic variance due to gametic phase disequilibrium (Bulmer 1971).

The framework provides estimates of positions and heights of phenotypic and genetic optima, which are functions of genetic and nongenetic (co)variances. As standard errors, particularly on genetic (co)variances, are moderate to high, those on optima are also likely to be large, especially when the M-V genetic correlation between traits is close to zero (Figure 5). The calculated location of optima gives insight into where the current population mean is compared to the optimum. The piglet example shows that phenotypic and genetic optima may be very different: the current population mean is far from the phenotypic optimum (∼13 phenotypic SD), because higher birth weight still yields higher survival on average based on the regression, although observed survival rate flattens at ∼1.8–2.0 kg. The high prediction for the phenotypic optimum is likely due to neglecting the binomial nature of survival. In contrast, the genetic optimum is ∼6 phenotypic SD below the current population mean, with the discrepancy between the phenotypic and the genetic optimum due to the opposite signs of the genetic vs. litter and environmental correlations between birth weight and survival. Yoshimura and Shields (1995) indicate that environmental uncertainty is the driver between differences in optima. If genetic/nongenetic (co)variances differ among environments, the optima, i.e., fitness profiles, would also differ among environments (e.g., Dewitt and Yoshimura 1998).

Phenotypic variability and fitness in natural populations

Nonlinear relationships are often observed or hypothesized in natural populations, typically as “fitness profiles” between trait values and fitness (Lande and Arnold 1983; Falconer and Mackay 1996; Kingsolver et al. 2001, 2012). If the optimum phenotype is intermediate, there would be a negative covariance between VE and fitness, such that families with small variance would contribute more survivors, and correspondingly families with a large variance would have an advantage with disruptive or strong directional selection (Hill and Zhang 2004; Mulder et al. 2007). The strength of natural selection (i.e., the selection gradient) can be determined by regressing relative fitness on phenotypic value if it is directional or on the squared deviation of phenotypic value from the mean if it is stabilizing or disruptive (Lande and Arnold 1983). Thus if fitness is taken as trait 2, the regression coefficients in Equation 3 indicate the strength of selection. For instance, the strength of stabilizing selection is obtained as VS=1/2b2 (Falconer and Mackay 1996), where b2 is the regression of fitness on the squared deviation of the trait under selection, i.e., trait 1 in this study. Applying this to our study and assuming rAm,1Av,1=0, re=0, rAm,1Am,2=0, i.e., the population mean is at the optimum, the strength of stabilizing selection (expressed as the “variance” of the fitness profile standardized by VE on trait 1, so small values imply strong selection) would be VS=1/2b2=[2σP14+3σe14exp(σAv,12)(exp(σAv,12)1)]/[2covAv,1,Am,2σe14exp(σAv,12)] (Falconer and Mackay 1996). The estimate of VS (VS=1/2b2) for the example of piglet survival and piglet birth weight is 435VE.

Based on early estimates of the strength of stabilizing selection, Falconer and Mackay (1996) suggested that strong selection would be VS = 10 and weak selection VS = 100, with “typical” estimates of the order of 20–25 (Falconer and Mackay 1996; Zhang and Hill 2005). Kingsolver et al. (2001, 2012), however, reviewed selection gradients estimated in a range of natural populations and found that in only a few studies negative quadratic regression coefficients (b2) were actually obtained, indicating that strong stabilizing selection is rarely detected. The value obtained here for the piglet data implies very weak stabilizing selection. Indeed, as fitness typically has a low heritability (<0.1) and σAv,12<0.1 (GCVVEσAv,1<0.32) in most recent estimates using DHGLM or MCMC (Felleki et al. 2012; Mulder et al. 2013a; Rönnegård et al. 2013), the stabilizing selection driven by the M-V genetic correlation between VE and fitness is likely to be weak (examples in Table 4).

Table 4. The strength of stabilizing selection (Vs) standardized by VE for different combinations of the heritability of relative fitness (σP,fitness2=0.25), the genetic correlation between VE of trait 1 and fitness (rAv,1,Am,2), and the genetic variance in VE (σAv,12).

σAv,12
h2 fitness rAv,1,Am,2 0.05 0.1
0.02 −0.1 1339 983
−0.5 268 197
−1.0 134 98
0.1 −0.1 599 440
−0.5 120 88
−1.0 60 44

Implications and applications in agriculture

In agriculture, a frequent objective is to increase uniformity of animals and plants for some traits because these may have an optimum with respect to profit, for instance for slaughter traits in livestock because slaughterhouses prefer low variability in carcass weight and meat quality to optimize processing and to provide homogeneous products to retailers (Hennessy 2005). Some conformation traits such as udder depth or front teat placement may have a nonlinear relationship with respect to fitness traits, such as longevity (Larroque and Ducrocq 2001), or with milk production (Fuerst-Waltl et al. 1998). In contrast, there is also concern that animals with small VE would not have the capacity to react to unpredictable environments, i.e., lack robustness, so it is likely to be worthwhile to explore the relationship between VE and fitness traits in different environments. While very little is known about trade-offs between VE and means of traits, the current study provides insight in showing that a negative genetic covariance between VE and fitness is advantageous for piglet survival and birth weight. Furthermore, selection on VE can increase accuracy of selection provided the population is at the optimum and the genetic correlation between trait means is close to zero. Therefore, inclusion of VE in index selection is mainly of interest for traits that have a direct economic value for uniformity or where VE has a strong genetic correlation with an economically important trait that has a low heritability and is not strongly correlated to other economic traits.

In conclusion, this study provides a framework to study nonlinear relationships between traits caused by M-V genetic correlations between them and to assess the effects of artificial and natural selection on their means and variances. It is shown that phenotypic and genetic optima may differ; that genetic relationships are often more curvilinear than phenotypic relationships, especially for environmentally correlated traits; and that genetic correlations may change when population means change relative to the optimal phenotypes. The analysis of piglet birth weight and survival shows that the nonlinearity can be partly explained by such an M-V genetic correlation between VE of birth weight and survival.

Supplementary Material

Supporting Information

Acknowledgments

We thank Ian White, University of Edinburgh, for helping with some of the derivations. TOPIGS Research Center IPG is acknowledged for providing the data. This study was supported by Breed4Food, a public–private partnership in the domain of animal breeding and genomics.

Appendix A

Derivation of Elements in the Multiple-Regression Equation of P2 on P1 and P12

The elements of the P matrix and the c vector were derived as follows for the genetic model in Equation 1 (P1=μ1+Am,1+E1), in which E1=χexp[(1/2)(ln(σe12)+Av,1)] and E(χ2exp(Av,1)=exp((1/2)σAv,12), where the roman E denotes expectation.

The elements of P are

E(P1)=0
E(P12)=E(Am,12)+2E(Am,1E1)+E(E12)=σAm,12+σe12exp(12σAv,12)=σP2
E(P13)= E(Am,13)+3E(Am,12E1)+3E(Am,1E12)+E(E13)=3covAm,1,Av,1σe12exp(12σAv,12)
E(P14)=E(Am,14)+4E(Am,13E1)+6E(Am,12E12)+4E(Am,1E13)+E(E14)
=3σAm,14+6covAm,1,Av,12σe12exp(12σAv,12)+3σe14exp(2σAv,12),

where

E(Am,12E12)=2cov2(Am,1E1)=(2)(12)covAm,1,Av,12σe12exp(12σAv,12),

and 1/2 comes from the coefficient of Av,1 in E1=χexp[(1/2)(ln(σe12)+Av,1)];

var(P1)=E(P12)[E(P1)]2=σP2
cov(P1,P12)=E(P13)E(P1)E(P12)=3covAm,1,Av,1σe12exp(12σAv,12)
var(P12)=E(P14)[E(P12)]2=2σP14+6covAm,1,Av,12σe12exp(12σAv,12)+3σe14exp(σAv,12)(exp(σAv,12)1).

The elements of c are

cov(P2,P1)=E(Am,2Am,1)+E(E2E1)=covAm,1,Am,2+cove1e2
cov(P2,P12)=E(Am,2Am,12)+2E(Am,2Am,1E1)+E(Am,2E12)+E(E2Am,12)+2E(E2Am,1E1)+E(E2,E12)=covAv,1,A2σe12exp(12σAv,12)+covAm,1,Av,1cove1e2.

Appendix B

Derivation of Elements of P and g When Using Squared Residuals Corrected for Leverages

Instead of using P12 directly, squared residuals of P1 corrected for its leverage (l) can be used to predict breeding values for the second trait Am,2 (see Equation 17 in main text). The leverage is the diagonal element of the hat matrix for each observation (Hoaglin and Welsh 1978) as used in the DHGLM (Rönnegård et al. 2010).

Assuming that only information of the animal itself is used for breeding value estimation and in the absence of fixed effects, bAm,1=l and hence the elements of Pe in Equation 17 are

var(e121l)=var((1bAm,1)P12)=(1bAm,1)2var(P12)
cov(P1,e121l)=cov(P1,(1bAm,1)P12)=(1bAm,1)cov(P1,P12)

and similarly the elements of ga2e  are

cov(Av1,e121l)=(1bAm,1)cov(Av1,P12)
cov(Am,2,e121l)=(1bAm,1)cov(Am,2,P12).

If (1bAm,1) is put into a scaling matrix

W=[100(1bAm,1)],

the derived elements in Appendix A can be used together with the scaling matrix (see main text below Equation 17). It can be shown that the accuracy of the estimated breeding value Am,2^ (rAm,2,Am,2^) is the same as with using squared phenotypic deviations instead of squared residuals corrected for leverages:

rAm,2,Am,2^=(W1P1W1Wg)WgσAm,2=(P1g)gσAm,2.

Appendix C

Generalization of Equations Involved in Multiple Regression with Multiple Random Effects

Assume the statistical model for trait 1 (= y), its VE, and trait 2 contains fixed effects and a number of random effects (n1) in addition to the residual,

y=Xb+i=1n1Ziui+e,

and P2 is predicted from P1 and P12. The (co)variances in the P matrix are V11=i=1n1vci11, V12=V21=i=1n1vci12, V22=i=1n1vci22, where vci11 (vci22) is the variance for random effect i for trait 1 (VE of trait 1) and vci12 is the covariance for random effect i between trait 1 and its VE:

var(P1)=V11+σe12exp(12V22)
cov(P1,P12)=3V12σe12exp(12V22)
var(P12)=2(V11+σe12exp(12V22))2+6V122σe12exp(12V22)+3σe14exp(V22)(exp(V22)1).

The covariances in c are V13=i=1n1vci13, V23=i=1n1vci23, where vci13 is the covariance for random effect i for trait 1 and trait 2 and vci23 is the covariance for random effect i between trait 1 and its VE:

cov(P2,P1)=V13+cove1e2
cov(P2,P12)=V23σe12exp(12V22)+V12cove1e2.

Footnotes

Communicating editor: J. B. Wolf

Literature Cited

  1. Bulmer M. G., 1971.  The effect of selection on genetic variability. Am. Nat. 105: 201–211. [Google Scholar]
  2. Bulmer M. G., 1980.  The Mathematical Theory of Quantitative Genetics. Oxford University Press, Oxford. [Google Scholar]
  3. DeWitt T. J., Yoshimura J., 1998.  The fitness threshold model: random environmental change alters adaptive landscapes. Evol. Ecol. 12: 615–626. [Google Scholar]
  4. Falconer D. S., Mackay T. F. C., 1996.  Introduction to Quantitative Genetics. Pearson Education, Essex, UK. [Google Scholar]
  5. Felleki M., Lee D., Lee Y., Gilmour A. R., Rönnegård L., 2012.  Estimation of breeding values for mean and dispersion, their variance and correlation using double hierarchical generalized linear models. Genet. Res. 94: 307–317. [DOI] [PubMed] [Google Scholar]
  6. Foulley J. L., 1992.  Prediction of selection response for threshold dichotomous traits. Genetics 132: 1187–1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Fuerst-Waltl B., Essl A., Sölkner J., 1997.  Nonlinear genetic relationships between traits and their implications on the estimation of genetic parameters. J. Anim. Sci. 75: 3119–3125. [DOI] [PubMed] [Google Scholar]
  8. Fuerst-Waltl B., Sölkner J., Essl A., Hoeschele I., Fuerst C., 1998.  Non-linearity in the genetic relationship between milk yield and type traits in Holstein cattle. Livest. Prod. Sci. 57: 41–47. [Google Scholar]
  9. Garreau H., Bolet G., Larzul C., Robert-Granie C., Saleil G., et al. , 2008.  Results of four generations of a canalising selection for rabbit birth weight. Livest. Sci. 119: 55–62. [Google Scholar]
  10. Geiler-Samerotte K. A., Bauer C. R., Li S., Ziv N., Gresham D., et al. , 2013.  The details in the distributions: why and how to study phenotypic variability. Curr. Opin. Biotechnol. 24: 752–759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gianola D., 1982.  Theory and analysis of threshold characters. J. Anim. Sci. 54: 1079–1096. [Google Scholar]
  12. Gilmour A. R., Gogel B. J., Cullis B. R., Welham S. J., Thompson R., 2014.  ASReml User Guide Release 4.0. VSN International, Hemel Hempstead, UK. [Google Scholar]
  13. Hazel L. N., 1943.  The genetic basis for constructing selection indexes. Genetics 28: 476–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Hennessy D. A., 2005.  Slaughterhouse rules: animal uniformity and regulating for food safety in meat packing. Am. J. Agric. Econ. 87: 600–609. [Google Scholar]
  15. Hill W. G., 2004.  Heterogeneity of genetic and environmental variance of quantitative traits. J. Ind. Soc. Agric. Stat. 57: 49–63. [Google Scholar]
  16. Hill W. G., Mulder H. A., 2010.  Genetic analysis of environmental variation. Genet. Res. 92: 381–395. [DOI] [PubMed] [Google Scholar]
  17. Hill W. G., Zhang X. S., 2004.  Effects on phenotypic variability of directional selection arising through genetic differences in residual variability. Genet. Res. 83: 121–132. [DOI] [PubMed] [Google Scholar]
  18. Hoaglin D. C., Welsh R. E., 1978.  The hat matrix in regression and ANOVA. Am. Stat. 32: 17–22. [Google Scholar]
  19. Jimenez-Gomez J. M., Corwin J. A., Joseph B., Maloof J. N., Kliebenstein D. J., 2011.  Genomic analysis of QTLs and genes altering natural variation in stochastic noise. PLoS Genet. 7: 1002295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kapell D., Ashworth C. J., Knap P. W., Roehe R., 2011.  Genetic parameters for piglet survival, litter size and birth weight or its variation within litter in sire and dam lines using Bayesian analysis. Livest. Sci. 135: 215–224. [Google Scholar]
  21. Karn M. N., Penrose L. S., 1951.  Birth weight and gestation time in relation to maternal age, parity and infant survival. Ann. Eugen. 16: 147–164. [PubMed] [Google Scholar]
  22. Kingsolver J. G., Hoekstra H. E., Hoekstra J. M., Berrigan D., Vignieri S. N., et al. , 2001.  The strength of phenotypic selection in natural populations. Am. Nat. 157: 245–261. [DOI] [PubMed] [Google Scholar]
  23. Kingsolver J. G., Diamond S. E., Siepielski A. M., Carlson S. M., 2012.  Synthetic analyses of phenotypic selection in natural populations: lessons, limitations and future directions. Evol. Ecol. 26: 1101–1118. [Google Scholar]
  24. Lande R., Arnold S. J., 1983.  The measurement of selection on correlated characters. Evolution 37: 1210–1226. [DOI] [PubMed] [Google Scholar]
  25. Larroque H., Ducrocq V., 2001.  Relationships between type and longevity in the Holstein breed. Genet. Sel. Evol. 33: 39–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lynch M., Walsh B., 1998.  Genetics and Analysis of Quantitative Traits. Sinauer Associates, Sunderland, MA. [Google Scholar]
  27. Mackay T. F. C., Lyman R. F., 2005.  Drosophila bristles and the nature of quantitative genetic variation. Philos. Trans. R. Soc. B Biol. Sci. 360: 1513–1527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Mulder H. A., Bijma P., Hill W. G., 2007.  Prediction of breeding values and selection responses with genetic heterogeneity of environmental variance. Genetics 175: 1895–1910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Mulder H. A., Bijma P., Hill W. G., 2008.  Selection for uniformity in livestock by exploiting genetic heterogeneity of residual variance. Genet. Sel. Evol. 40: 37–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Mulder H. A., Crump R. E., Calus M. P. L., Veerkamp R. F., 2013a Unraveling the genetic architecture of environmental variance of somatic cell score using high-density single nucleotide polymorphism and cow data from experimental farms. J. Dairy Sci. 96: 7306–7317. [DOI] [PubMed] [Google Scholar]
  31. Mulder H. A., Ronnegard L., Fikse W. F., Veerkamp R. F., Strandberg E., 2013b Estimation of genetic variance for macro- and micro-environmental sensitivity using double hierarchical generalized linear models. Genet. Sel. Evol. 45: 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Nelson R. M., Pettersson M. E., Li X. D., Carlborg O., 2013.  Variance heterogeneity in Saccharomyces cerevisiae expression data: trans-regulation and epistasis. PLoS ONE 8: e79507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Olofsson, H., J. Ripa, and N. Jonzen, 2009 Bet-hedging as an evolutionary game: the trade-off between egg size and number. Proc. R. Soc. B Biol. Sci. 276: 2963–2969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Robertson A., 1959.  The sampling variance of the genetic correlation coefficient. Biometrics 15: 469–485. [Google Scholar]
  35. Rönnegård L., Felleki M., Fikse F., Mulder H. A., Strandberg E., 2010.  Genetic heterogeneity of residual variance - estimation of variance components using double hierarchical generalized linear models. Genet. Sel. Evol. 42: 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Rönnegård L., Felleki M., Fikse W. F., Mulder H. A., Strandberg E., 2013.  Variance component and breeding value estimation for genetic heterogeneity of residual variance in Swedish Holstein dairy cattle. J. Dairy Sci. 96: 2627–2636. [DOI] [PubMed] [Google Scholar]
  37. SanCristobal-Gaudy M., Elsen J. M., Bodin L., Chevalet C., 1998.  Prediction of the response to a selection for canalisation of a continuous trait in animal breeding. Genet. Sel. Evol. 30: 423–451. [Google Scholar]
  38. Schluter D., 1988.  Estimating the form of natural-selection on a quantitative trait. Evolution 42: 849–861. [DOI] [PubMed] [Google Scholar]
  39. Schluter D., Smith J. N. M., 1986.  Natural-selection on beak and body size in the song sparrow. Evolution 40: 221–231. [DOI] [PubMed] [Google Scholar]
  40. Sell-Kubiak E., Bijma P., Knol E. F., Mulder H. A., 2015.  Comparison of methods to study uniformity of traits: application to birth weight in in pigs. J. Anim. Sci. DOI: 10.2527/jas2014-8313. [DOI] [PubMed] [Google Scholar]
  41. Shen X., Rönnegård L., 2013.  Issues with data transformation in genome-wide association studies for phenotypic variability. F1000 Res. 2: 200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Shen X., Pettersson M., Rönnegård L., Carlborg O., 2012.  Inheritance beyond plain heritability: variance-controlling genes in Arabidopsis thaliana. PLoS Genet. 8: 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Sölkner J., Fuerst-Waltl B., 1996.  Nonlinear heritabilities and curvilinear relationships: genetic factors complicating selection for functional traits. Interbull Bull. 12: 226–233. [Google Scholar]
  44. Sorensen D., Waagepetersen R., 2003.  Normal linear models with genetically structured residual variance heterogeneity: a case study. Genet. Res. 82: 207–222. [DOI] [PubMed] [Google Scholar]
  45. Stuart A., Ord J. K., 1994.  Kendall’s Advanced Theory of Statistics: Distribution Theory. Arnold, London. [Google Scholar]
  46. Sun X. Q., Elston R., Morris N., Zhu X. F., 2013.  What is the significance of difference in phenotypic variability across SNP genotypes? Am. J. Hum. Genet. 93: 390–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Tonsor S. J., Elnaccash T. W., Scheiner S. M., 2013.  Developmental instability is genetically correlated with phenotypic plasticity, constraining heritability, and fitness. Evolution 67: 2923–2935. [DOI] [PubMed] [Google Scholar]
  48. Varona L., Sorensen D., 2014.  Joint analysis of binomial and continuous traits with a recursive model: a case study using mortality and litter size of pigs. Genetics 196: 643–651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Vinson W. E., White J. M., Kliewer R. H., 1976.  Overall classification as a selection criterion for improving categorically scored components of type in Holsteins. J. Dairy Sci. 59: 2104–2114. [Google Scholar]
  50. Wolc A., Arango J., Settar P., Fulton J. E., O’Sullivan N. P., et al. , 2012.  Genome-wide association analysis and genetic architecture of egg weight and egg uniformity in layer chickens. Anim. Genet. 43: 87–96. [DOI] [PubMed] [Google Scholar]
  51. Yang J., Loos R. J. F., Powell J. E., Medland S. E., Speliotes E. K., et al. , 2012.  FTO genotype is associated with phenotypic variability of body mass index. Nature 490: 267–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Yoshimura J., Shields W. M., 1995.  Probabilistic optimization of body size - a discrepancy between genetic and phenotypic optima. Evolution 49: 375–378. [DOI] [PubMed] [Google Scholar]
  53. Zhang X.-S., Hill W. G., 2005.  Evolution of the environmental component of the phenotypic variance: stabilizing selection in changing environments and the cost of homogeneity. Evolution 59: 1237–1244. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES