Abstract
Accurate knowledge of the null distribution of hypothesis tests is important for valid application of the tests. In previous papers and software, the asymptotic null distribution of likelihood ratio tests for detecting genetic linkage in multivariate variance components models has been stated to be a mixture of chi-square distributions with binomial mixing probabilities. For variance components models under the complete pleiotropy assumption, we show by simulation and by theoretical arguments based on the geometry of the parameter space that all aspects of the previously stated asymptotic null distribution are incorrect—both the binomial mixing probabilities and the chi-square components. Correcting the null distribution gives more conservative critical values than previously stated, yielding P values that can easily be 10 times larger. The true mixing probabilities give the highest probability to the case where all variance parameters are estimated positive, and the mixing components show severe departures from chi-square distributions. Thus, the asymptotic null distribution has complex features that raise challenges for the assessment of significance of multivariate linkage findings. We propose a method to generate an asymptotic null distribution that is much faster than other empirical methods such as permutation, enabling us to obtain P values with higher precision more efficiently.
Keywords: Asymptotic null distribution, Likelihood ratio test, Mixing probabilities, Multivariate linkage, Nonstandard boundary condition, Single-factor model, Variance components
1. INTRODUCTION AND MAIN RESULTS
Variance components methods are widely used for mapping quantitative trait loci (Almasy and Blangero, 1998; Amos, 1994; Goldgar, 1990; Schork, 1993; SOLAR, 2008). Multivariate extensions of variance components methods, which simultaneously test genetic effects on multiple traits by exploiting extra information related to correlations among the traits, have attracted much attention and have been found to be more powerful than univariate methods by several studies (Almasy and others, 1997; Amos and others, 2001; Boomsma and Dolan, 1998; Schmitz and others, 1998; Williams, Begleiter, and others, 1999; Williams, Van Eerdewegh, and others, 1999).
The asymptotic null distribution of a univariate variance components test, which is a likelihood ratio test for comparing the full model to the null model where the genetic effect variance parameter is constrained to zero, is well known to be a 50:50 mixture of a point mass at zero and (the chi-squared distribution with 1 degree of freedom). This nonstandard limiting distribution arises because the parameter of interest lies on the boundary of the parameter space under the null hypothesis (Self and Liang, 1987).
An analogous mixture distribution was applied to multivariate variance components tests by a number of papers (Almasy and others, 1997; Amos and others, 2001; Amos and de Andrade, 2001; Kraft and others, 2004; Williams, Begleiter, and others, 1999; Williams, Van Eerdewegh, and others, 1999) in which the asymptotic null distribution of the tests was stated to be a mixture of a point mass at zero and several chi-squared distributions, with binomial mixing probabilities. This multivariate method has become widely available; for example, it is implemented in SOLAR (Almasy and Blangero, 1998; SOLAR, 2008), one of the most commonly used software packages for mapping quantitative loci.
However, the evaluation and verification of the asymptotic null distribution of this test have not yet been done systematically. As part of a study comparing the power of univariate to bivariate variance components methods, Amos and others (2001) conducted a simulation of the null distribution. Their bivariate simulation result was not consistent with the previously stated asymptotic null distribution that they assumed, but the source and nature of the inconsistency have remained unexplained (Amos and de Andrade, 2001).
Here we study the null distribution of multivariate variance components tests under a complete pleiotropy assumption. We demonstrate that all aspects of the previously stated asymptotic null distribution are incorrect—both the binomial mixing probabilities and the chi-squared components. We propose a method to assess pointwise significance by generating the asymptotic null distribution making use of the Fisher information estimated from given data. In an example application to an illustrative data set, we compare our method to other well-known empirical methods such as gene-dropping, permutation, and bootstrap methods.
2. BACKGROUND
2.1. Variance components linkage methods
Variance components linkage methods partition the variance of the traits into several random-effect components, one of which captures the influence of a hypothesized “major gene” that affects the traits. In univariate models, the response variable includes a single measured trait for each individual, modeled as a sum of 3 independent normally distributed random effects: an additive major gene effect, which is to be tested on each genetic marker, a polygenic effect, and an environmental effect with corresponding variance parameters , , and , respectively. For simplicity, here let us assume one sib-pair in each family. The trait vector for the ith family follows a multivariate normal distribution: with
|  | 
Here the identity by descent (IBD) sharing proportion  quantifies the allele sharing between individuals 1 and 2 at the locus being tested. The kinship coefficient  between individuals 1 and 2 is obtained from the pedigree without using any genetic marker information; for example, for a sib-pair, ϕi, 12 =  . Linkage is tested by comparing the null hypothesis  to the alternative . The likelihood ratio test (LRT) statistic is twice the difference between the log-likelihood of the full model and the model restricted according to the null hypothesis. The parameter , which must be nonnegative, lies on the boundary of the parameter space under the null hypothesis , so that a nonstandard boundary condition applies (Self and Liang, 1987), and the asymptotic null distribution of LRT is
. Linkage is tested by comparing the null hypothesis  to the alternative . The likelihood ratio test (LRT) statistic is twice the difference between the log-likelihood of the full model and the model restricted according to the null hypothesis. The parameter , which must be nonnegative, lies on the boundary of the parameter space under the null hypothesis , so that a nonstandard boundary condition applies (Self and Liang, 1987), and the asymptotic null distribution of LRT is  χ02 +
χ02 +  χ12, that is, the mixture distribution of 50% point mass at zero (equivalent to ) and 50 % .
χ12, that is, the mixture distribution of 50% point mass at zero (equivalent to ) and 50 % .
Multivariate variance components models are a natural extension of the above single-trait model, where the response variables include k measured traits for each individual. Continuing to illustrate with the case of one sib-pair per family, , where is the value of trait t measured on individual j in family i and
|  | (2.1) | 
Here and are general covariance matrices for the polygenic effect and environmental effect, respectively. The covariance matrix for the additive major gene effect, , is assumed to be an outer product of the form . Here the variances, which are the diagonal entries , are of course nonnegative, but the entries can take both positive and negative values, so that the covariances can be positive or negative. The null hypothesis to be tested, , is that all the major gene effect variances are 0. The parameters in the matrices G and E are nuisance parameters.
This model, also known as the “single-factor model” (Evans and others, 2004; Vogler and others, 1997) or the “complete pleiotropic model” (Almasy and others, 1997; Amos and others, 2001; Amos and de Andrade, 2001; Kraft and others, 2004; Williams, Begleiter, and others, 1999; Williams, Van Eerdewegh, and others, 1999), arises when the dominance components of variance for the effects of a single major gene on each trait are assumed to be 0. The idea of the assumed form for A is that the effect on a given trait of a given major gene genotype is a proportionality constant that depends on the trait times an effect size that depends on the genotype. That is, the major gene genotype is modeled as a latent factor that affects the traits in a linear pleiotropic manner. The single-factor model has been extensively used (Amos and others, 2001; Amos and de Andrade, 2001; Evans, 2002; Evans and Duffy, 2004; Evans and others, 2004; Marlow and others, 2003; Monaco, 2007; Vogler and others, 1997). Later, we will use the term “general model” for models without the complete pleiotropic constraint.
2.2. Asymptotic null distributions and binomial mixing probabilities
The null hypothesis of the LRT in the models discussed above violates the regularity conditions that imply the typical asymptotic chi-square distribution, the most obvious violation being that constraining a variance to be 0 forces the variance to lie in the boundary of the parameter space rather than its interior. In the previous work (Amos and others, 2001; SOLAR, 2008), the asymptotic null distribution of the multivariate LRT statistic for testing k traits has been stated to be a mixture distribution of point mass at zero and chi-square distributions with degrees of freedom from 1 to k, with mixing probabilities coming from the Binomial distribution B(k,  ), that is,
), that is,
|  | (2.2) | 
Table 1 shows mixing probabilities and critical values according to formula (2.2) for significance levels and 0.05 for k-trait asymptotic null distributions for , and 5.
Table 1.
Mixing probabilities and critical values of the previously stated asymptotic null distributions for k-trait multivariate tests according to the binomial mixture formula (2.2)
| χ02 | χ12 | χ22 | χ32 | χ42 | χ52 | Critical value (α = 0.01) | Critical value (α = 0.05) | |
| k = 2 | 0.2500 | 0.5000 | 0.2500 | 7.289 | 4.231 | |||
| k = 3 | 0.1250 | 0.3750 | 0.3750 | 0.1250 | 8.746 | 5.435 | ||
| k = 4 | 0.0625 | 0.2500 | 0.3750 | 0.2500 | 0.0625 | 10.019 | 6.498 | |
| k = 5 | 0.0313 | 0.1563 | 0.3125 | 0.3125 | 0.1563 | 0.0313 | 11.183 | 7.480 | 
3. SIMULATIONS
3.1. Methods
We simulated the null distributions of k-trait multivariate LRT statistics for , and 5. For each test, we generated 1000 data sets each including 2000 independent sib-pairs with traits simulated under the truth of the null hypothesis using the multivariate normal distributions specified at (2.1). Each trait had total variance 1 with (i.e. the polygenic effect explained 40% of the total variance of each trait) and (i.e. the rest of the variance was due to environmental effects). Polygenic and environmental correlation coefficients among traits were assigned the value 0.1, so that and for . The IBD sharing levels were simulated from {0, 1/2, 1} with probabilities {1/4, 1/2, 1/4}, respectively, which amounts to assuming complete linkage information and random sampling.
Optimization was done using the Mx software (Neale and others, 1999). We recorded the LRT statistic and estimates  for each replicate data set. In order to separate the mixture distribution of LRT into its components, we grouped the replicate results by the number of major gene variance parameters that were estimated positive, which we will denote by v. There were  groups, starting from 0 (where none of the k parameters was estimated positive) up to k (where all the k parameters were estimated positive), so that v took its values in . The empirical mixing probabilities were calculated simply as the fraction of each group within the total number of replicates.
 for each replicate data set. In order to separate the mixture distribution of LRT into its components, we grouped the replicate results by the number of major gene variance parameters that were estimated positive, which we will denote by v. There were  groups, starting from 0 (where none of the k parameters was estimated positive) up to k (where all the k parameters were estimated positive), so that v took its values in . The empirical mixing probabilities were calculated simply as the fraction of each group within the total number of replicates.
3.2. Results
Table 2, which shows the results of the null distribution simulations, reveals that the mixing probabilities do not agree with the probabilities anticipated in Table 1, that is, the binomial distribution previously used in the literature and software. For example for , the probabilities for v taking the values 0, 1, and 2 are 0.142, 0, and 0.858, respectively, whereas the corresponding binomial probabilities are 0.25, 0.5, and 0.25. The substantial discrepancies already seen in the case  become larger for higher k. Instead of a binomial distribution having probabilities  for , the true mixing distribution puts positive probability only on  and . Furthermore, rather than having equal probabilities for the cases  and , the true distribution puts larger probability on , and this effect becomes more extreme as k increases. Empirical null distributions of LRT statistics for k-trait multivariate tests for the cases where . The blue curve is the  density.
 for , the true mixing distribution puts positive probability only on  and . Furthermore, rather than having equal probabilities for the cases  and , the true distribution puts larger probability on , and this effect becomes more extreme as k increases. Empirical null distributions of LRT statistics for k-trait multivariate tests for the cases where . The blue curve is the  density.
Table 2.
Mixing probabilities, K-S tests, and critical values for k-trait multivariate test null distributions obtained by simulations. The first 6 columns are probabilities for various values for v, calculated as the fraction of replicates that yielded that value for v. The Kolmogorov tests were conducted for the v = k cases
| Number of major gene effect variance parameters estimated positive (v) | K-S test (P value) | Critical value | |||||||
| 0 | 1 | 2 | 3 | 4 | 5 | α = 0.01 | α = 0.05 | ||
| k = 2 | 0.142 | — | 0.858 | 3.3 × 10−12 | 8.640 | 5.485 | |||
| k = 3 | 0.020 | — | — | 0.980 | 4.4 × 10−9 | 12.221 | 7.696 | ||
| k = 4 | 0.001 | — | — | — | 0.999 | 8.2 × 10−4 | 14.820 | 10.172 | |
| k = 5 | — | — | — | — | — | 1.000 | 2.4 × 10−14 | 15.693 | 12.458 | 
Figure 1 shows empirical distributions of the LRT statistics in the cases where . We focused on these cases since, as shown in Table 2, only the component had large enough counts to be drawn as histogram. Figure 1 reveals some departures from the anticipated distributions. In and 3, there are more points near zero than anticipated, in , the distribution is shifted to the right compared to the curve, and is somewhat in between these 2 different patterns. To quantify the departure from the anticipated distributions, we conducted Kolmogorov–Smirnov (K-S) tests, and the results are shown in Table 2. From the small P values we conclude that the empirical distributions do not follow the anticipated distributions.
Fig. 1.
Empirical null distributions of LRT statistics for k-trait multivariate tests for the cases where v = k. The blue curve is the χk2 density.
The last 2 columns in Table 2 are critical values for the significance levels and 0.05. Compared to the critical values anticipated in Table 1, these critical values are larger, that is, more conservative. This implies that the previously stated null distribution can lead to false-positive findings. For example, suppose we observe an LRT statistic value 14 in a multivariate test with traits, which the previously stated null distribution assigns a P value of 0.002849. According to the simulated null distribution, however, the true P value is 0.025, which is about 10 times less significant.
4. GEOMETRIC EXPLANATIONS AND APPLICATIONS
Our simulation results contain a number of interesting features: (i) zero mixing proportion for the case where only a subset of boundary parameters are estimated positive and highest proportion on the case where all variance parameters are estimated positive and (ii) departure from chi-square distributions within the mixing components. Here we explain these results using theoretical arguments based on the geometry of the parameter space.
The LRT of the null hypothesis versus the alternative hypothesis uses the LRT statistic , where L is the likelihood function. According to Theorem 3 in Self and Liang (1987), when , under certain regularity conditions, the asymptotic distribution of LRT is the same as the distribution of the LRT for testing versus based on a single observation Y, where and and are cones approximating and at . This latter LRT takes the form so that, defining , the LRT is equivalent to
|  | (4.1) | 
where and under the null hypothesis. For the special case when is the identity matrix I,
|  | (4.2) | 
So in the case , LRT is the difference between the squared (Euclidean) distance from Z to and squared distance from Z to .
Below, we will mainly write expressions explicitly for the case of traits with the information matrix being the identity matrix, which reveals the essential ideas with minimal notational complexity.
4.1. Mixing probabilities
A way of thinking about the problem that is tempting but leads to incorrect conclusions, including the asymptotic distribution incorrectly stated in previous papers, is as follows. Denote the parameters of interest by . The null hypothesis  is , and the alternative  is the first quadrant , and the approximating cones  and  are the same as  and . Let  and  where
 where  is the vector θ that achieves the infimum in , that is, the projection of Z onto the first quadrant . The random point  falls in each of the 4 quadrants (regions
 is the vector θ that achieves the infimum in , that is, the projection of Z onto the first quadrant . The random point  falls in each of the 4 quadrants (regions  in Figure 2) with equal probability . When Z falls in the first quadrant (region
 in Figure 2) with equal probability . When Z falls in the first quadrant (region  , probability 1/4),  (because
, probability 1/4),  (because  is simply Z itself, so that ‖Z −
 is simply Z itself, so that ‖Z −  ‖2 = 0), which is distributed as , and both
‖2 = 0), which is distributed as , and both  1 and
1 and  2 are positive. When Z lies in the second quadrant, (
2 are positive. When Z lies in the second quadrant, ( 1,
1,  2) = (0, Z2), so that , and only  is estimated positive. Symmetric logic applies to the fourth quadrant, so that , and only  is estimated positive. When Z lies in the third quadrant, the projection (
2) = (0, Z2), so that , and only  is estimated positive. Symmetric logic applies to the fourth quadrant, so that , and only  is estimated positive. When Z lies in the third quadrant, the projection ( 1,
1,  ) = (0, 0), so that  and both parameters are estimated zero. In summary, LRT is distributed as the mixture
) = (0, 0), so that  and both parameters are estimated zero. In summary, LRT is distributed as the mixture  , with the different mixture components determined by the number of parameters estimated positive.
, with the different mixture components determined by the number of parameters estimated positive.
Fig. 2.
The parameter space θ ∈ {(θ1, θ2): θ1 > 0, θ2 > 0}.
The above reasoning, however, ignores the covariance parameter . For simplicity, let us focus our attention on the parameters of interest , and  and ignore the nuisance parameters in the model that describe the polygenic and environmental effects. The single-factor model is constrained according to . That is, letting , the constraint in the single-factor model is expressed by  . So for this problem, the null hypothesis is  and the alternative hypothesis is
. So for this problem, the null hypothesis is  and the alternative hypothesis is  . Here we are thinking of the parameter space as a curved surface in 3-dimensional space, not just a single quadrant in the 2-dimensional space . Figures 3(a) and (b) show this surface from different angles. To apply Theorem 3 of Self and Liang (1987) to this problem, we note that the hypotheses  and  are already cones, so that the approximating cones are  and . Suppose for illustration that the information matrix under the null hypothesis, , is the identity matrix . A general case will be discussed in Section 4.3.
. Here we are thinking of the parameter space as a curved surface in 3-dimensional space, not just a single quadrant in the 2-dimensional space . Figures 3(a) and (b) show this surface from different angles. To apply Theorem 3 of Self and Liang (1987) to this problem, we note that the hypotheses  and  are already cones, so that the approximating cones are  and . Suppose for illustration that the information matrix under the null hypothesis, , is the identity matrix . A general case will be discussed in Section 4.3.
Fig. 3.
Plots (a) and (b) are the parameter space of points θ = (θ1, θ2, θ3) such that  . Plot (c) shows the boundary of the polar cone of the parameter space in blue. Plot (d) shows the points Z that give small LRT values (? 0.02) in green. Plot (e) shows the points z giving v = 0 when z ∼ N(0, I). Plot (f) shows the points Z with v = 0 when the covariance of Z is not the identity matrix, but a general covariance matrix obtained from the Fisher information matrix of a genetic model.
. Plot (c) shows the boundary of the polar cone of the parameter space in blue. Plot (d) shows the points Z that give small LRT values (? 0.02) in green. Plot (e) shows the points z giving v = 0 when z ∼ N(0, I). Plot (f) shows the points Z with v = 0 when the covariance of Z is not the identity matrix, but a general covariance matrix obtained from the Fisher information matrix of a genetic model.
The above problem arises from the original genetic problem by using its asymptotic equivalence to a problem with a single observation from a Gaussian distribution, ignoring nuisance parameters, and considering the special case where the information matrix is the identity. Assume the null hypothesis holds. As in (4.2), the asymptotic distribution of the LRT is the same as the distribution of  , where  and
, where  and  = (
 = ( 1,
1,  2,
2,  3) is the vector θ that achieves the infimum in , that is, the projection of Z onto the alternative hypothesis surface
3) is the vector θ that achieves the infimum in , that is, the projection of Z onto the alternative hypothesis surface 
The distribution of LRT is a mixture of components that are distinguished by the number of variance parameters  1 and
1 and  2 that are estimated positive (which, by the constraint that defines , also determines whether or not
2 that are estimated positive (which, by the constraint that defines , also determines whether or not  3 is estimated nonzero). As before, let v be the number of variance parameters
3 is estimated nonzero). As before, let v be the number of variance parameters  1 and
1 and  2 that are estimated positive, that is, the number of positive components among
2 that are estimated positive, that is, the number of positive components among  1 and
1 and  2; the possible values for v are 0, 1, and 2.
2; the possible values for v are 0, 1, and 2.
The determination of v can be explained using the polar cone of the parameter space . The polar cone of is defined to be the set . Figure 3(c) depicts the boundary of the polar cone of the parameter space.
When Z is in the polar cone, we claim . To see this, assume and note that if , then by definition of the polar cone, which implies . Thus, the origin is the point in that is closest to Z, which gives .
When Z lies outside the polar cone, we claim with probability 1. First, by definition, for any point Z outside the polar cone , there exists a vector in that has positive inner product with Z. This implies that the projection of Z onto that vector is closer to Z than the origin is, proving . Second, we want to show with probability 1. Note that can occur when Z lies on the plane in the first, second, and fourth quadrants, which has probability 0. We claim that whenever is nonzero (which occurs with probability 1), v cannot be 1. This is shown in each quadrant in the following paragraphs.
Suppose  falls in the first quadrant, that is,  and . Without loss of generality, suppose  (an analogous argument will handle the case  and . Then the point in  having  that is closest to Z is . However, it is obvious that for sufficiently small , the point  , which has , is closer to Z than  is, proving that v cannot be 1. Thus, we have shown that when  falls in the first quadrant, we have  with probability 1.
, which has , is closer to Z than  is, proving that v cannot be 1. Thus, we have shown that when  falls in the first quadrant, we have  with probability 1.
Suppose  falls in the second quadrant, that is,  and . Again suppose , and observe that the point in  having  that is closest to Z is . Define  , and define  to be the squared distance
, and define  to be the squared distance  . The derivative
. The derivative  approaches  as . Therefore, for sufficiently small positive ϵ, we must have , so that the point  is closer to Z than  is, which implies that  cannot hold. The case where  lies in the fourth quadrant is analogous to the case of the second quadrant. Thus, for both the cases of the second and fourth quadrants, we have  with probability 1.
 approaches  as . Therefore, for sufficiently small positive ϵ, we must have , so that the point  is closer to Z than  is, which implies that  cannot hold. The case where  lies in the fourth quadrant is analogous to the case of the second quadrant. Thus, for both the cases of the second and fourth quadrants, we have  with probability 1.
In summary, when Z lies inside the polar cone, , and when Z lies outside the polar cone,  with probability 1. To illustrate this, we generated 10 000 vectors Z from the 3-dimensional Gaussian distribution with mean zero and identity covariance matrix. For each Z, we calculated v by estimating  with the optimization being performed under the constraints
 with the optimization being performed under the constraints  using the program Mx, which applies here because the projection
 using the program Mx, which applies here because the projection  is also the maximum likelihood estimate in this problem. The red points in Figure 3(e) are the points Z having , which form the polar cone of the parameter space.
 is also the maximum likelihood estimate in this problem. The red points in Figure 3(e) are the points Z having , which form the polar cone of the parameter space.
4.2. Departures from chi-squared distributions
In Section 3.2, one of the departures from the anticipated  distributions we observed was an excessive number of LRT values near 0. These small LRT values result from points Z that are located close to the boundary of the polar cone, where the difference between the squared distance from Z to the origin and the squared distance from Z to the parameter space is small. To visualize this, using the same method of generating Z as described in Section 4.1, we simulated 10 000 vectors Z from , and for each Z we performed the LRT for testing  versus  with the optimization being performed under the constraints , and  . Then we identified the points Z showing small LRT values () and plotted them as green in Figure 3(d). We see that they are located very close to the outer surface of the polar cone.
. Then we identified the points Z showing small LRT values () and plotted them as green in Figure 3(d). We see that they are located very close to the outer surface of the polar cone.
Another type of departure we observed from the simulation results, particularly in models with larger numbers of traits, was a pattern that the distribution was shifted to the right compared to the anticipated chi-square distribution. This can be explained by the fact that the parameter space consists of 2 surfaces joined at an angle of less than 180. The angle between the 2 surfaces (one blue and the other red; see Figures 3(a) and (b)) in our parameter space  is 70.54 in the direction . When Z lies between these 2 surfaces, the projection of Z onto the parameter space involves a choice of whether the point is closer to one surface or the other, and the availability of this choice affects the distribution of LRT. An analogous example for illustration in 2 dimensions can help explain this problem simply. Figure 4 shows 2 different cases. In the first, the parameter space is a 1-dimensional straight line, and in the second, the parameter space is 2 lines joined at the origin with an angle less than , say, . For both hypothesis-testing problems, the null hypothesis is the origin. For the first case, the LRT is exactly  which is distributed as . On the other hand, in the second plot, consider the points A and B lying between the 2 lines forming the parameter space. The point A has LRT , while the point B has LRT ; the choice of whether LRT is  or  depends on whether the point is closer to the first line or the second. Thus, in the first quadrant, LRT is max(), which in general is larger than, say, , which is distributed as . Thus, in the first quadrant, LRT does not have a  distribution, but rather a distribution that is stochastically larger than . Note that if we change the parameter space by decreasing the angle between the lines, the LRT in the region between the lines becomes closer to , which is distributed as . Returning to the 3-dimensional problem, as the angle between the surfaces becomes closer to , the distribution of LRT in the region between the surfaces would be closer to  because the difference between the squared distance from Z to the origin, which is , and the squared distance from Z to the alternative space, which is close to zero, would be close to , which is distributed as .
 is 70.54 in the direction . When Z lies between these 2 surfaces, the projection of Z onto the parameter space involves a choice of whether the point is closer to one surface or the other, and the availability of this choice affects the distribution of LRT. An analogous example for illustration in 2 dimensions can help explain this problem simply. Figure 4 shows 2 different cases. In the first, the parameter space is a 1-dimensional straight line, and in the second, the parameter space is 2 lines joined at the origin with an angle less than , say, . For both hypothesis-testing problems, the null hypothesis is the origin. For the first case, the LRT is exactly  which is distributed as . On the other hand, in the second plot, consider the points A and B lying between the 2 lines forming the parameter space. The point A has LRT , while the point B has LRT ; the choice of whether LRT is  or  depends on whether the point is closer to the first line or the second. Thus, in the first quadrant, LRT is max(), which in general is larger than, say, , which is distributed as . Thus, in the first quadrant, LRT does not have a  distribution, but rather a distribution that is stochastically larger than . Note that if we change the parameter space by decreasing the angle between the lines, the LRT in the region between the lines becomes closer to , which is distributed as . Returning to the 3-dimensional problem, as the angle between the surfaces becomes closer to , the distribution of LRT in the region between the surfaces would be closer to  because the difference between the squared distance from Z to the origin, which is , and the squared distance from Z to the alternative space, which is close to zero, would be close to , which is distributed as .
Fig. 4.
Influence on LRT of the angle joining the parts of the parameter. The bold lines form the parameter space and the origin is the null hypothesis.
4.3. General case of and genetic models
General case of .
So far, for simplicity, we have focused on the special case where the Fisher information was the identity matrix I. For the general case when , the results of the Self and Liang theorem stated in (4.1) show that the asymptotic distribution of the LRT can be analyzed as above (involving a difference between the squared distance from Z to the origin and the squared distance from Z to the parameter space), with the Euclidean distance replaced by the more general Mahalanobis distance . This distance takes into account the covariance of Z, where . For the multivariate normal model, can be obtained from the formula (Searle and others, 1992)
|  | (4.3) | 
Generating the asymptotic null distribution in a genetic model.
The results in (4.1) and (4.3) above allow us to simulate from the asymptotic null distribution in genetic models that have general Fisher information matrices. For example, we did this for the  trait genetic model specified in Section 3, that is, the model with parameter values , ,  and . After obtaining Σ by plugging in the specified values in G and E, using (4.3) we calculated the  Fisher information matrix for the full set of parameters . After extracting the  submatrix V of the inverted Fisher information matrix corresponding to the parameters of interest , we generated 10 000 vectors Z from . For each Z, we performed the LRT for testing  versus  under the constraints , and  using Mx. In this application of Mx, we assigned fixed values for the known covariances in V, and Mx optimized the likelihood only over mean parameters. We also determined v using the same technique of comparing a full model to a partially constrained model as described in Section 3.
 using Mx. In this application of Mx, we assigned fixed values for the known covariances in V, and Mx optimized the likelihood only over mean parameters. We also determined v using the same technique of comparing a full model to a partially constrained model as described in Section 3.
According to the results, the mixing proportions for , , and were 0.1477, 0, and 0.8523, and critical values for and were 8.713 and 5.405, which were quite similar to the results in Table 2. To confirm that the results in this section were consistent with the genetic simulation results for the trait null distribution in Section 3, a K-S test gave a P value of 0.2063.
Plot (f) in Figure 3 shows the points Z with colored as red, filling the polar cone. The contrast between the polar cones in plots (e) and (f) illustrates the effect of having a Fisher information matrix .
Different sets of nuisance parameter values.
To investigate whether and how the asymptotic null distribution is influenced by different choices of the nuisance parameters, we selected 10 sets of nuisance parameters in G and E for each k-trait model, using inverse Wishart distributions to choose matrices that were well spread out over the range of possible covariance matrices. We used the same method described in Section 4.3 to generate the asymptotic null distribution for each set of nuisance parameters. In order to assess whether the asymptotic null distributions generated using different sets of nuisance parameters were different, we conducted pairwise K-S tests. Most of the results did not show significant differences, but the asymptotic distributions for a few of the models were found to have highly significant departures from the rest, showing that the asymptotic distribution does depend on nuisance parameters. For example, Figure 5 shows a case having K-S P value .
Fig. 5.
Asymptotic null distributions of tests under 2 models with k = 4 traits.
However, although differences among asymptotic distributions of LRT statistics are real, their practical effect on P values for LRT tests seems to be quite minor. For example, even the pair of asymptotic distributions in Figure 5, which gave the smallest K-S P value we observed, are quite similar. Asymptotic null distributions of tests under 2 models with traits.
5. PROPOSED METHOD FOR GENERATING ASYMPTOTIC NULL DISTRIBUTIONS AND CALCULATING P VALUES
As we have shown, asymptotic null distributions of multivariate linkage tests have several complex features: the mixing probabilities are not binomial distributions and the mixture components show severe departures from chi-square distributions. The asymptotic null distributions also varied depending on nuisance parameter values to some extent. These complexities in the asymptotic null distribution raise challenges for the assessment of significance of multivariate linkage findings. Empirical methods for obtaining P values, including gene-dropping, permutation, and bootstrap methods, involve generating large numbers of replicated data sets, which can require long computation times. Here we propose a method to calculate P values by generating from an asymptotic null distribution. The new method is much faster than other empirical methods and it gives a correct asymptotic distribution. The idea was implicitly used in Section 4.3, although there we assumed knowledge of the nuisance parameter values. Here we specify a method that is applicable to the more realistic situation where we are given data and do not know nuisance parameter values.
The description of the method uses the following notation. In a k-trait model, the total parameter set consists of the parameters in , , and , where each matrix has  parameters. The full parameter set contains all  parameters in A, G, and E. The parameters in A consist of the parameters of interest , with constraints of the form  for  and
 parameters. The full parameter set contains all  parameters in A, G, and E. The parameters in A consist of the parameters of interest , with constraints of the form  for  and  for , where we write  for . For example, for the models with  traits, the parameters of interest are , and the constraints are
 for , where we write  for . For example, for the models with  traits, the parameters of interest are , and the constraints are  , and
, and  .
.
5.1. Method
Suppose we have a given data set and have calculated an LRT statistic for a multivariate linkage test for that data set, and we wish to calculate a P value.
STEP 1: Estimate the Fisher information from the given data.
STEP 1(a): Under the null hypothesis, estimate all parameters in the model— and
 and  . Use these to form the estimate
. Use these to form the estimate  .
.
STEP 1(b): Using  and the formula (4.3), calculate the  Fisher information matrix for the full parameter set and invert it.
 and the formula (4.3), calculate the  Fisher information matrix for the full parameter set and invert it.
STEP 1(c): Extract the submatrix of the inverted Fisher information matrix corresponding to, the parameters of interest, and denote it as V.
STEP 2: Generate N random vectors Z from the multivariate Normal distribution . An appropriate choice of N depends on the desired precision level.
STEP 3: For each Z, perform the LRT for the null hypothesis  in the model , under the constraints  for  and  for . Here the covariance matrix V is fixed at the value calculated in STEP 1(c), and the test is performed by optimizing the likelihood only over the mean parameters in the vector θ.
 for . Here the covariance matrix V is fixed at the value calculated in STEP 1(c), and the test is performed by optimizing the likelihood only over the mean parameters in the vector θ.
STEP 4: The P value is calculated as the fraction of LRT values that are larger than or equal to the LRT value calculated from the given data.
An example Mx script that performs the LRT in STEP 3 above is available in the web supplement at http://www.stat.yale.edu/∼sh437/mxExample.
5.2. Applied example in an illustrative data set
In this section, we apply our proposed method and other well-known empirical methods for assessing P values to an illustrative data set and compare the times for completing the tasks. To create the illustrative data set, we simulated 2000 independent sib-pairs with 2 traits generated from the bivariate model with the same parameter values used in Section 3. For each sibling, we simulated a segment of chromosome with length 80 cM, having 16 markers located 5 cM apart. Each marker had 8 equally frequent alleles, and IBD sharing proportions were estimated by Merlin (Abecasis and others, 2001) with grid size 2.5 cM. We prespecified a single test point at 22.5 cM on the chromosome where the LRT would be conducted.
We applied 3 well-known methods for obtaining empirical critical values and P values: gene dropping (Terwilliger and others, 1993), permutation (Wan and others, 1997), and the bootstrap (Marlow and others, 2003). Keeping the phenotype data fixed, the gene-dropping method simulates genotypes on founders’ chromosomes using estimated marker allele frequencies and then segregates the chromosomes to offspring using marker recombination fraction information. For the permutation method, we fixed the trait values and permuted IBD estimates among families. The (parametric) bootstrap, which is not as commonly applied in linkage analysis as the previous 2 methods, estimates parameters from a given data set, treats the estimates as if they were true parameters, and generates replicate data sets, keeping the genotypes the same as in the original data. We applied the above 3 methods to our illustrative data set, generating 10 000 replicates for each method using R and running LRTs at the prespecified test point for each replicated data set using Mx. For STEP 2 in our proposed method, we used .
In this experiment, our method took 56 min to complete the task, while the gene-dropping, bootstrap, and permutation methods took 5893, 3182, and 1611 min, respectively (see Table 3). In order to measure the accuracy of the null distribution estimations, we estimated the type 1 error rates for and 0.05 based on the critical values shown in Table 2. The results in Table 3 show reasonably accurate performances of all methods in controlling type 1 errors. The K-S tests for comparing these 4 distributions to the reference distribution obtained in Section 3.2 were conducted, and P values were 0.2063, 0.654, 0.2742, and 0.4982 for the proposed method, gene-dropping, bootstrap, and permutation, respectively, none of them showing severe departures.
Table 3.
Comparing the performances of proposed method and 3 empirical methods
| Method | Computation time | Type 1 error rates | Type 1 error rates | 
| (min) | (α = 0.01) | (α = 0.05) | |
| Proposed method | 56 | 0.0107 | 0.0483 | 
| Gene dropping | 5893 | 0.0099 | 0.0487 | 
| Parametric bootstrap | 3182 | 0.0100 | 0.0451 | 
| Permutation | 1611 | 0.0121 | 0.0547 | 
The empirical methods took longer than our method since they require generating many replicate data sets with each data set including as many observations as in the given data set (2000 in our experiment), whereas our method requires generating only one Gaussian observation in each data set. Also, each Mx optimization took longer for the empirical methods than for our proposed method, again because our proposed method applies Mx to data sets of just one Gaussian observation.
6. DISCUSSION
Evaluating the null distribution of a test statistic is of course a prerequisite for valid P values. The previously stated asymptotic null distribution, which gave strongly anticonservative P values, has been used in several studies. In fact, some researchers (Amos and de Andrade, 2001; Evans and others, 2004; Marlow and others, 2003; Monaco, 2007) have mentioned the possibility that the previously stated asymptotic null distribution might be incorrect. However, clear statements confirming and quantifying these discrepancies and explaining their nature and origin have not appeared previously, and some researchers (Evans and others, 2004; Monaco, 2007) have expressed the desirability of clarifying the asymptotic null distribution of this test. A part of the purpose of this paper is to provide such clarification. Our simulations establish that there is indeed a real problem underlying discrepancies that had been mentioned as a puzzle in these previous papers. A device that enabled our simulations to quantify the discrepancies was to separate the mixture distribution into components defined by the number of variance parameters estimated positive. Our geometric arguments help explain how and why the discrepancies occur and give insight into the true nature of the asymptotic distribution.
It is of interest to understand, for various possible derivations that would falsely lead to the previously stated asymptotic distribution, exactly where the reasoning fails. For the simple case of  traits, a belief that the asymptotic null distribution is simply  would follow from the incorrect reasoning given in Section 4.1. This incorrect reasoning views the parameter space as a quadrant in the 2-dimensional space of possible values for the variance parameters and also makes the further mistake of assuming the information matrix is the identity. Even correcting this reasoning to allow a general information matrix leads only to asymptotic distributions of the form ( − p)χ02 +
 − p)χ02 +  χ12 + pχ22; see Case 7 in Self and Liang (1987) for the derivation of this formula. However, this mixture of chi-square distributions is still evidently incorrect, as we have seen that the correct asymptotic distribution has zero mixing probability for the  component. All the above reasoning is invalidated by the observation that the parameter space is actually not isomorphic to the first quadrant in 2 dimensions; in fact, for each point  in the quadrant, there are 2 different points in the parameter space, with one point having a12 =
χ12 + pχ22; see Case 7 in Self and Liang (1987) for the derivation of this formula. However, this mixture of chi-square distributions is still evidently incorrect, as we have seen that the correct asymptotic distribution has zero mixing probability for the  component. All the above reasoning is invalidated by the observation that the parameter space is actually not isomorphic to the first quadrant in 2 dimensions; in fact, for each point  in the quadrant, there are 2 different points in the parameter space, with one point having a12 =  and the other having a12 = −
 and the other having a12 = − . Since there are 2 distinct distributions for each point in the quadrant, a reasonable attempt to correct this topological problem is to view the parameter space as 2 quadrants. For example, as in Figure 6, we could consider the parameters to be  and  where  can take any real value (including negative numbers) and  is constrained to be nonnegative. These 2 parameters determine the covariance  by the relationship . More precisely, to make the problem identifiable, we should remove the ray of points  from the parameter space since the 2 points  and  correspond to the same distribution. Although this modified half-plane gives a correct one-to-one correspondence with the set of distributions in our problem, attempting to apply the reasoning of Self and Liang (1987) to this situation (see their Case 6) would lead to a conjectured asymptotic null distribution of
. Since there are 2 distinct distributions for each point in the quadrant, a reasonable attempt to correct this topological problem is to view the parameter space as 2 quadrants. For example, as in Figure 6, we could consider the parameters to be  and  where  can take any real value (including negative numbers) and  is constrained to be nonnegative. These 2 parameters determine the covariance  by the relationship . More precisely, to make the problem identifiable, we should remove the ray of points  from the parameter space since the 2 points  and  correspond to the same distribution. Although this modified half-plane gives a correct one-to-one correspondence with the set of distributions in our problem, attempting to apply the reasoning of Self and Liang (1987) to this situation (see their Case 6) would lead to a conjectured asymptotic null distribution of  χ12 +
χ12 +  χ22, which is also incorrect. In fact, the assumptions required for the results of Self and Liang are questionable in our problem in this 2-dimensional formulation since the modified half-plane is not closed and the information matrix at the null hypothesis  is the zero matrix. The modified half-plane parameter space.
χ22, which is also incorrect. In fact, the assumptions required for the results of Self and Liang are questionable in our problem in this 2-dimensional formulation since the modified half-plane is not closed and the information matrix at the null hypothesis  is the zero matrix. The modified half-plane parameter space.
Fig. 6.
The modified half-plane parameter space.
In this paper, we focused on the model with the complete pleiotropic constraint. This model has been used in many studies instead of the general model that includes additional correlation parameters (Evans, 2002; Evans and Duffy, 2004; Evans and others, 2004; Loo and others, 2004; Marlow and others, 2003; Monaco, 2007; Vogler and others, 1997). One disadvantage of the general model is that optimization is more difficult (Loo and others, 2004). Also, due to the increased number of parameters, the general model has more degrees of freedom than the complete pleiotropic model. A recent study (Han and others, 2009) showed in a related but different context that using the general model is often less powerful than using the complete pleiotropic model due to the increased degrees of freedom.
Although the focus of the current study is on the complete pleiotropic model, our approaches and results can be extended to investigate the general model. The asymptotic null distribution of the LRT in the general model has also been stated to be a mixture of chi-square distributions with binomial mixing probabilities (e.g. Amos and de Andrade, 2001; Amos and others, 2001; SOLAR, 2008). Although the general model does not put explicit constraints on covariance parameters as in the complete pleiotropic model, covariance parameters still have implicit constraints, for example, ensuring that the correlation coefficients lie between and 1. For this reason, applying the binomial mixture of chi-square distributions, which does not take the implicit constraints into account, is incorrect. We are currently studying this issue and preparing a separate paper extending the approaches provided by our present study to the general model.
FUNDING
Yale Center for High Performance Computation in Biology and Biomedicine (National Institutes of Health [NIH] grant RR19895-02); NIH (R01 DC007665).
Acknowledgments
We are grateful to Dr Elena Grigorenko for inspiring our interest in these tests and for helpful advice and pointers to the literature and to Dr Michael Neale for help with Mx. Conflict of Interest: None declared.
References
- Abecasis GR, Cherny SS, Cookson WO, Cardon LR. Merlin—rapid analysis of dense genetic maps using sparse gene flow trees. Nature Genetics. 2001;30:97–101. doi: 10.1038/ng786. [DOI] [PubMed] [Google Scholar]
- Almasy L, Blangero J. Multipoint quantitative-trait linkage analysis in general pedigrees. American Journal of Human Genetics. 1998;62:1198–1211. doi: 10.1086/301844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Almasy L, Dyer T, Blangero J. Bivariate quantitative trait linkage analysis: pleiotropy versus co-incident linkages. Genetic Epidemiology. 1997;14:958. doi: 10.1002/(SICI)1098-2272(1997)14:6<953::AID-GEPI65>3.0.CO;2-K. [DOI] [PubMed] [Google Scholar]
- Amos C, de Andrade M, Zhu D. Comparison of multivariate tests for genetic linkage. Human Heredity. 2001;51:133–144. doi: 10.1159/000053334. [DOI] [PubMed] [Google Scholar]
- Amos CI. Robust variance-components approach for assessing genetic linkage in pedigrees. American Journal of Human Genetics. 1994;54:535. [PMC free article] [PubMed] [Google Scholar]
- Amos CI, de Andrade M. Genetic linkage methods for quantitative traits. Statistical Methods in Medical Research. 2001;10:3. doi: 10.1177/096228020101000102. [DOI] [PubMed] [Google Scholar]
- Boomsma DI, Dolan CV. A comparison of power to detect a QTL in sib-pair data using multivariate phenotypes, mean phenotypes, and factor scores. Behavior Genetics. 1998;28:329–340. doi: 10.1023/a:1021665501312. [DOI] [PubMed] [Google Scholar]
- Evans DM. The power of multivariate quantitative-trait loci linkage analysis is influenced by the correlation between variables. The American Journal of Human Genetics. 2002;70:1599–1602. doi: 10.1086/340850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evans DM, Duffy DL. A simulation study concerning the effect of varying the residual phenotypic correlation on the power of bivariate quantitative trait loci linkage analysis. Behavior Genetics. 2004;34:135–141. doi: 10.1023/B:BEGE.0000013727.15845.f8. [DOI] [PubMed] [Google Scholar]
- Evans DM, Zhu G, Duffy DL, Montgomery GW, Frazer IH, Martin NG. Multivariate QTL linkage analysis suggests a QTL for platelet count on chromosome 19q. European Journal of Human Genetics. 2004;12:835–842. doi: 10.1038/sj.ejhg.5201248. [DOI] [PubMed] [Google Scholar]
- Goldgar DE. Multipoint analysis of human quantitative genetic variation. American Journal of Human Genetics. 1990;47:957–967. [PMC free article] [PubMed] [Google Scholar]
- Han SS, Grigorenko EL, Chang JT. Uncovering shared common genetic risk factors for various aspects of complex disorders captured in multiple traits. eprint arXiv. 2009;0904.2229 [Google Scholar]
- Kraft P, Bauman L, Yuan JY, Horvath S. Multivariate variance-components analysis of longitudinal blood pressure measurements from the Framingham Heart Study. BMC Genetics. 2004;4:S55. doi: 10.1186/1471-2156-4-S1-S55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loo SK, Fisher SE, Francks C, Ogdie MN, MacPhie IL, Yang M, McCracken JT, McGough JJ, Nelson SF, Monaco AP. Genome-wide scan of reading ability in affected sibling pairs with attention-deficit/hyperactivity disorder: unique and shared genetic effects. Molecular Psychiatry. 2004;9:485–493. doi: 10.1038/sj.mp.4001450. [DOI] [PubMed] [Google Scholar]
- Marlow AJ, Fisher SE, Francks C, MacPhie IL, Cherny SS, Richardson AJ, Talcott JB, Stein JF, Monaco AP, Cardon LR. Use of multivariate linkage analysis for dissection of a complex cognitive trait. The American Journal of Human Genetics. 2003;72:561–570. doi: 10.1086/368201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monaco AP. Multivariate linkage analysis of specific language impairment (SLI) Annals of Human Genetics. 2007;71:660–673. doi: 10.1111/j.1469-1809.2007.00361.x. [DOI] [PubMed] [Google Scholar]
- Neale MC, Boker SM, Xie G, Maes HH. Medical College of Virginia. Richmond, VA: Department of Psychiatry; 1999. Mx: Statistical Modeling. [Google Scholar]
- Schmitz S, Cherny SS, Fulker DW. Increase in power through multivariate analyses. Behavior Genetics. 1998;28:357–363. doi: 10.1023/a:1021669602220. [DOI] [PubMed] [Google Scholar]
- Schork NJ. Extended multipoint identity-by-descent analysis of human quantitative traits: efficiency, power, and modeling considerations. American Journal of Human Genetics. 1993;53:1306. [PMC free article] [PubMed] [Google Scholar]
- Searle SR, Casella G, McCulloch CE. Variance Components. New York: John Wiley; 1992. [Google Scholar]
- Self SG, Liang KY. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association. 1987;82:605–610. [Google Scholar]
- SOLAR . Version 4.1.5. San Antonio, TX: Southwest Foundation For Biomedical Research; 2008. Sequential Oligogenic Linkage Analysis Routines. SOLAR. [Google Scholar]
- Terwilliger JD, Speer M, Ott J. Chromosome-based method for rapid computer simulation in human genetic linkage analysis. Genetic Epidemiology. 1993;10:217–224. doi: 10.1002/gepi.1370100402. [DOI] [PubMed] [Google Scholar]
- Vogler GP, Tang W, Nelson TL, Hofer SM, Grant JD, Tarantino LM, Fernandez JR. A multivariate model for the analysis of sibship covariance structure using marker information and multiple quantitative traits. Genetic Epidemiology. 1997;14:921–926. doi: 10.1002/(SICI)1098-2272(1997)14:6<921::AID-GEPI60>3.0.CO;2-N. [DOI] [PubMed] [Google Scholar]
- Wan Y, Cohen J, Guerra R. A permutation test for the robust sib-pair linkage method. Annals of Human Genetics. 1997;61:77–85. doi: 10.1046/j.1469-1809.1997.6110077.x. [DOI] [PubMed] [Google Scholar]
- Williams JT, Begleiter H, Porjesz B, Edenberg HJ, Foroud T, Reich T, Goate A, Van Eerdewegh P, Almasy L, Blangero J. Joint multipoint linkage analysis of multivariate qualitative and quantitative traits. II. Alcoholism and event-related potentials. The American Journal of Human Genetics. 1999;65:1148–1160. doi: 10.1086/302571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Williams JT, Van Eerdewegh P, Almasy L, Blangero J. Joint multipoint linkage analysis of multivariate qualitative and quantitative traits. I. Likelihood formulation and simulation results. The American Journal of Human Genetics. 1999;65:1134–1147. doi: 10.1086/302570. [DOI] [PMC free article] [PubMed] [Google Scholar]






