Abstract
It is well known that the regular likelihood ratio test of a bounded parameter is not valid if the boundary value is being tested. This is the case for testing the null value of a scalar variance component. Although an adjusted test of variance component has been suggested to account for the effect of its lower bound of zero, no adjustment of its interval estimate has ever been proposed. If left unadjusted, the confidence interval of the variance may still contain zero when the adjusted test rejects the null hypothesis of a zero variance, leading to conflicting conclusions. In this research, we propose two ways to adjust the confidence interval of a parameter subject to a lower bound, one based on the Wald test and the other on the likelihood ratio test. Both are compatible to the adjusted test and parametrization-invariant. A simulation study and two examples are given in the framework of ACDE models in twin studies.
Keywords: confidence interval, variance component, likelihood ratio test, Wald test, ACDE models
1 Introduction
1.1 Confidence Intervals
Statistical models, such as structural equation models (SEM), have been widely used in the analysis of twin and family data. Consequently, confidence intervals (CIs) are routinely presented along with point estimates of model parameters to quantify the uncertainty of parameter estimates due to sampling errors. Let ξ = (θ, ζ′)′ be the parameter vector in a model in which θ is the parameter of interest and ζ is the vector of other parameters. A 100(1 − α)% CI of θ is defined as the set of all test values θ0 such that the following statistical testing problem does not reject the null hypothesis at level α (Lehmann, 1986, section 5.6):
| (1) |
By definition, the construction of a CI is usually performed by inverting a test, or finding the true values that cannot be rejected by this test. If test (1) is performed using the Wald statistic
| (2) |
the CI can be constructed as (θ̂L, θ̂U) with θ̂L = θ̂ − σ(ξ̂)zα/2 and θ̂U = θ̂ + σ(ξ̂)zα/2, where σ(ξ̂) is a consistent estimate of the standard error (SE) of θ̂ and zα is the 1 − α quantile of a standard normal distribution. The subscripts L and U are used to denote lower and upper limits throughout this article. Usually, σ2(ξ̂) is taken as the corresponding diagonal element of the inverse Fisher information matrix. Note that a 95% CI would require 2.5% of the distribution at either side, which is why α is divided by two to find the upper and lower limits.
Although the SE-based CIs are widely used and easy to compute, they inherit some problems from the Wald test. Because the Wald test is not parametrization-invariant, the SE based CIs of different monotonic functions of the same parameter are usually incompatible with each other. For example, the upper and lower limits of a CI constructed for a variance are usually not the squares of those of a CI for the corresponding standard deviation (Neale, Heath, Hewitt, Eaves and Fulker, 1989). In addition, the performance of a SE-based CI depends on whether the Wald statistic is close to normally distributed for the given parametrization and sample size. For example, Fisher’s z transformation of a correlation usually requires smaller sample size to achieve asymptotic normality than does the original correlation parameter, and the coverage of a CI based on the z transformation is usually closer to the nominal level.
To avoid the problems above, Neale and Miller (1997) proposed a likelihood-based CI by inverting the likelihood ratio test (LRT). Because the LRT is parametrization-invariant, the resultant CIs have similar behavior. To obtain the likelihood-based CI, the function
| (3) |
is minimized with respect to parameter vector ξ = (θ, ζ′)′, where F is the negative twice log likelihood function and ξ̂ is the maximum likelihood estimate (MLE). The resultant values of θ at the minima are the upper and lower limits of the CI. It was noted that this procedure is not strictly equivalent to inverting a LRT because the value of the likelihood function at the minima differs from their desired values by a (usually slight) bias.
1.2 Confidence Intervals close to a Boundary
Because CIs are formed by inverting tests, they are no longer valid when the tests are not valid. This situation is encountered when the true parameter of interest θ0 is on or close to its boundary. When θ0 is on its boundary, test (1) becomes a one sided test because the alternative hypothesis now lies only on one side of the null hypothesis, and the χ2 asymptotic distribution in a regular LRT is no longer valid. LRTs under such situations have been the subject of a series of studies in general multivariate statistics and in ACE models (Self and Liang, 1987; Shapiro, 1988; Dominicus, Skrondal, Gjessing, Pedersen and Palmgren, 2006; Visscher, 2006; Wu and Neale, in press). In general, when the parameter of interest is the only parameter on a boundary and the Fisher information matrix has full rank, the asymptotic sampling distribution of the LRT statistic for test (1) is an equally weighted mixture of a distribution and a point mass at 0 (Self and Liang, 1987), and the p-value is half of that of a regular LRT (Dominicus, Skrondal, Gjessing, Pedersen and Palmgren, 2006).
Under such situation, the CI constructed by inverting the LRT test also needs to be corrected, or it will no longer be compatible with the test. It may happen that a (corrected) LRT test rejects the null hypothesis because the corrected p-value is smaller than the specified α level, but the uncorrected CI still contains the boundary because the p-value of a regular LRT is still larger than α. Nevertheless, modifying the CI involves more than modifying a boundary LRT because the corrected LRT can only suggest whether the boundary value is inside the CI, while the modified CI needs to determine all true values that would not be reject by the LRT, which concerns LRTs with true values close to the boundary.
When the true value θ0 is not on the boundary, the chance that the MLE θ̂ falls beyond the boundary can be made arbitrarily small by increasing sample size (which decreases the variance of θ̂). This implies that the χ2 distribution is still valid as an asymptotic distribution. Unfortunately, this asymptotic result is not helpful for the current purpose of CI construction because 1) the actual sample size required for a valid regular LRT increases unboundedly as θ0 gets closer and closer to its boundary, and 2) it cannot give a CI that is compatible with the corrected boundary LRT. The construction of a CI of θ, which concerns a range of true values θ0 in the neighborhood of the boundary, must take into account the effect of a finite sample to bridge the gap between the regular and corrected asymptotic distributions.
In this paper, we propose two methods that yield a CI compatible with the corrected LRT, one based on the inversion of a Wald test and the other based on the inversion of a LRT. For both methods, we begin with a simplified example of a bounded mean parameter of a normal distribution and then discuss models with bounded parameters in general. Algorithms for computation will be given and simulation studies and examples will also be presented.
2 Correction Based on a Wald Test
2.1 Testing a Normal Mean Close to a Bound
We begin with a brief review of the hypothesis testing procedure and CI construction of the mean parameter μ in a normal distribution N(μ, 1). If a random sample {X1, X2, · · ·, Xn} of size n is obtained, we have μ̂ = X̄. Remember that μ̂ is a random variable and has sampling distribution , where μ0 is the true value of μ in the population and is the variance of the sampling distribution. For a given sample and observed value μ̂obs and a given test value μ0, the p-value is defined as the probability to obtain a μ̂ that is more extreme than the current observation μ̂obs, or
| (4) |
where Φ() is the cumulative distribution function (cdf) of a standard normal distribution. If this p-value is smaller than or equal to α, the specified significance level, the null hypothesis of test (1) is rejected. The 100(1 − α)% CI is obtained by solving for the μ0 from the inequality p > α and is given by μ̂obs ± σ0zα/2.
When the boundary condition μ ≥ 0 is imposed, the MLE becomes μ̂+ = max{X̄, 0}. It coincides with μ̂ if μ̂ ≥ 0 but remains at zero if μ̂ < 0. As a result, it has a similar distribution to μ̂ except that a point mass on the value zero replaces the part of the distribution below zero. The size of this point mass is given by
Because this distribution has both discrete and continuous parts, neither a probability mass function (pmf) nor a probability density function (pdf) can describe it appropriately. Its cdf is shown in Figure 1.
Figure 1.

The cumulative distribution function (cdf) of the sampling distribution of μ̂+ = max{X̄, 0} for a normal model N(μ, 1) with a bounded mean μ ≥ 0. Note the jump indicates the value 0 has a positive probability. A regular normal cdf would follow the dashed curve for negative values and no single value would have positive probability.
For a given observed value of the restricted MLE, , the p-value can be calculated as
| (5) |
| (6) |
To understand this result, we first note that it is different from equation (4) because the sampling distribution μ̂+ is different from that of μ̂. In particular, when , the second term in (5) is 0 because μ̂+ cannot take negative values. As a result, the p-value in this case is just a half of what it would be when no boundary condition were imposed on parameter μ. Note the condition simplifies to in equation (6). When , or equivalently , the p-value remains the same as in a non-bounded normal mean problem.
The p-value given by equation 6 is plotted as a function of the observed MLE μ̂+ for μ0 = 1.5 and σ0 = 1 in Figure 2. We note that in the region of [0, 2μ0], the curve follows a two-sided p-value and is symmetric around the test value μ0. However, a dent occurs at 2μ0 where it switches from a two-sided p-value to a one-sided p-value. If a horizontal line of p = α intersects with the symmetric part of the curve, the α level test is the regular two sided test; if the line intersects with the trailing part of the curve, the α level test is the regular one-sided test; if the line goes through the jump, the α level test is one-sided with rejection region and its actual type I error rate is smaller than α. The last situation occurs when α lies between 2Φ (−μ0/σ0) and Φ(−μ0/σ0), or equivalently, when μ0 lies between zα/2σ0 and zασ0.
Figure 2.
The p-value of a Wald test as a function of observed MLE for μ0 = 1.5 and σ0 = 1 in a bounded normal mean problem as discussed in section 2.1. The horizontal line marks p = 0.1.
2.2 A Confidence Interval for a Bounded Normal Mean
The Wald test adjusted CI of μ can be obtained by collecting all true values μ0 that would yield a p-value greater than α. When the estimate is on the boundary, or , the CI is given by [0, σ0zα/2]. When the estimate is not on the boundary, solving for μ0 from p ≥ α using equation (6) can be quite cumbersome due to the complicated form of the p-value. Instead, we plot the p-value given by equation (6) as a function of test value μ0 for a fixed . This plot is shown in Figure 3. We can see that the upper limit of the CI is always calculated from the solid curve of a regular two-sided p-value and is always the same as the upper limit of a regular CI for a normal mean, or . The lower limit is more complicated, depending on which part of the p-value curve the horizontal line of p = α intersects. If the line p = α intersects the solid part of the p-value curve, the lower limit of CI is given by , the lower limit of a regular CI, and it must be greater than or equal to , where the p-value curve jumps. If the line p = α intersects the dashed part of the p-value curve, the lower limit of CI is given by , where the lower limit of a 100(1 − 2α)% CI, and it must be smaller than . If the line p = α goes through the jump between the two branches, the lower limit of CI is given by , the location of the jump. In practice, one may calculate and , the three potential candidates of the lower limit, and determine which one to use according to their relative locations. The above result is summarized in the following expression for the adjusted CI (μ̂U, μ̂L)1
Figure 3.
Plot of p-value in a Wald test as a function of test value μ0 given μ̂obs > 0 for a bounded normal mean problem as discussed in section 2.1. The (bold and regular) solid curve shows the two sided p-value when μ is not constrained in H1. The (bold and regular) dashed curve gives the one sided p-value. The discontinuous bold (solid and dashed) curve shows the p-value of a Wald test as given by Equation 6. Three horizontal lines denotes three different α-levels and their corresponding CIs are also marked. They correspond to the three different situations when calculating μ̂L. One should note that these different situations can happen for a single α level if μ̂obs varies. In this plot μ̂obs = 3σ0.
| (7) |
where and . It should be noted that when the true value lies between zα/2σ0 and zασ0, the Wald test adjusted CI is conservative, or has more confidence than its nominal level 1− α, because the rejection rate of the corresponding Wald test is smaller than α as discussed above.
2.3 The General Case
In light of the above analysis of the simple problem of a bounded normal mean, the general problem for a bounded parameter in a parametric model can be solved. Let f(x|ξ) be a statistical model with parameter vector ξ = (θ, ζ′)′, where θ is the parameter of interest. Let ξ̂ and θ̂ be the non-constrained maximum likelihood estimates (MLE) of ξ and θ. When a lower bound of 0 is imposed on θ, let ξ̂+ and θ̂+ be the constrained MLEs. We assume the central limit theorem (CLT) holds for both boundary and nonboundary true values ξ0’s2. Derivations in the last section lead to a CI of the same expression as in equation 7 with the μ’s replaced by θ’s and σ0 replaced by as its estimate. However, the CI in the last section was derived under the assumption that θ̂ follows a normal distribution with constant variance . In practice, normality only holds approximately through CLT with its validity depending on the parametrization, and the variance of θ̂ usually depends on the true value ξ0. To make the adjusted CI parametrization-invariant, the four critical quantities that appear in equation 7 must be based on the likelihood function. These quantities are and .
When , or is not on the boundary, and can be calculated using a likelihood-based approach as explained by Neale and Miller (1997). For example, is the upper limit of a regular 100(1 − α)% CI and it must satisfy the condition that the LRT statistic is exactly when the test (1) is tested using the LRT, or
| (8) |
where F is the negative twice log-likelihood function for the given sample. Unfortunately in Neale and Miller (1997), the limits of CI were computed using a suboptimal algorithm, which minimizes Equation 3. Minimizing Equation 3 is not equivalent to solving Equation 8 and a bias is present in . Exact solution of can be found by maximizing θ with respect to the whole parameter vector ξ subject to the nonlinear constraint3
| (9) |
which can be easily implemented in both the classic- and OpenMx for covariance structure models (e.g. the ACDE models) and in R for a general problem of a boundary parameter. Similarly, can be found by minimizing θ with respect to ξ subject to the constraint above. The third quantity , or the lower limit of a regular 100(1 − 2α)% CI can also be obtained.
The fourth quantity present in Equation 7 is , the middle point between the observed estimate and the boundary 0. In the case of a bounded normal mean, this middle point plays a significant role because when . due to the symmetry of the normal sampling distribution of θ̂. For a general parametric model, it need not be a half of because the sampling distribution of θ̂ may not be symmetric. We replace it by θ̂m defined below through the likelihood function. This desired midpoint θ̂m should satisfy
| (10) |
where ξ̂b is the MLE of ξ restricted to be on the boundary θ = 0. The above equation specifies that the “distance” between the middle point ξ̂m and the observed value ξ̂obs is just half of the distance from the observed value to the boundary. Remember that difference in F measures the squared distance. ξ̂m can be found by minimizing θ subject to the constraint F (ξ) = F (ξ̂obs) + (F (ξ̂b) − F (ξ̂obs))/4.
When , the lower limit of the CI must be θ̂L = 0. For the normal bounded mean problem discussed in section 2.1, the upper limit is given by . This is the value, if taken as the true value, that would yield a sampling distribution whose point mass on 0 is exactly q0 = α/2. In terms of the likelihood function, the LRT value should be for a data set whose non-constrained MLE ξ̂ happens to be on the boundary, or the distance between θ̂U and the boundary is zα/2. If the non-constrained MLE of the current observed data ξ̂obs with θ̂obs < 0 is available, the squared distance between θ̂obs and the boundary can be measured by . Because the desired distance from θ̂U to the boundary is zα/2, the desired distance between θ̂obs and θ̂U should be . Thus, θ̂U can be obtained by maximizing θ with respect to ξ subject to the constraint
| (11) |
The general procedure of obtaining an adjusted CI based on Wald test is illustrated in Figure 4.
Figure 4.

Flowchart of the procedure to obtain a Wald test adjusted CI.
3 Correction Based on the Likelihood Ratio Test
3.1 Testing a Normal Mean Close to a Bound
It should be noted that the proposed CI in Section 2 is based on a Wald test corrected for the boundary condition. It is in general not equivalent to inverting a (corrected) LRT unless the test value is on the boundary. This non-equivalence can be observed by revisiting the case of a bounded normal mean. The LRT for the hypothesis testing problem in (1) would use the statistic
| (12) |
which is plotted in Figure 5. When μ0 = 0, or the true value is on the boundary, T = W and the two tests are equivalent. When μ0 > 0, this test statistic coincides with the Wald test statistic in Equation 2 only if μ̂= μ̂+ ≥ 0. If the unrestricted MLE μ̂ violates the boundary condition, we have μ̂ < μ̂+ = 0 and T increases linearly in |μ̂|. As a result, as long as μ0 > 0, the LRT may reject the null hypothesis if the unrestricted MLE μ̂ is negative with a large absolute value, resulting in a rejection region for both positive and negative μ̂’s, as illustrated in Figure 5. This is in contrast to the Wald test, which uses only the information of μ̂+ and only rejects positive μ̂’s when μ0 < zα/2σ0. Figure 5 shows when the lower rejection region is beyond the boundary, it is farther away from the true value than the upper rejection region, resulting in a smaller rejection probability on the left tail than on the right tail.
Figure 5.
Plot of LRT statistic T as a function of μ̂ for a normal distribution with σ0 = μ0 = 1. If the alternative model has no boundary constraint on μ, T follows the (bold and regular) solid curve. If the alternative model imposes μ ≥ 0, T follows the bold (dashed and solid) curve. The rejection region {T > 2.5} is also marked for the second case.
For μ0 > 0, given an observed LRT statistic t, from Equation 12, the p-value can be calculated as
| (13) |
In terms of the observed MLE μ̂obs, when , we have and Equation 13 becomes
| (14) |
When , we have and Equation 13 becomes
| (15) |
Note that in this case , so only the second line in equation 13 is relevant.
The p-value is plotted in Figure 6 as a function of the unrestricted MLE μ̂obs for μ0 = 1.5 and σ0 = 1. Because this curve does not have jumps, a test with exact type I error rate of α exists for all α. In addition, the rejection region lies on both sides for all α.
Figure 6.
The p-value of a LRT as a function of observed MLE μ̂obs for μ0 = 1.5 and σ0 = 1 in a bounded normal mean problem as discussed in section 3.1. The horizontal line marks p = 0.1.
3.2 A Confidence Interval for a Bounded Normal Mean
To obtain the CI, we only need to find the true value μ0 for which the p-value is greater than α. If the p-value in Equation 14 is plotted in Figure 7 as a function of the test value μ0. We can see the upper limit of the CI, similar to that of the Wald test adjusted CI, is always given by , the upper limit of a regular 100(1 − α)% CI. The lower limit of the CI is more complicated, depending on whether the horizontal line p = α crosses the (bold) dashed part of the curve or the (bold) solid part of the curve. The two parts of the p-value curves join each other at μ̂+/2. If the horizontal line crosses the (bold) solid part of the p-value curve, the resulting lower limit is given by , the lower limit of a regular 100(1 − α)% CI, and it satisfies . If the horizontal line crosses the (bold) dashed part of the p-value curve, the resulting lower limit satisfies the equations
Figure 7.
Plot of p-value in a LRT as a function of test value μ0 given μ̂obs > 0 for a bounded normal mean problem as discussed in section 3.1. The (bold and regular) solid curve shows the two sided p-value when μ is not constrained in H1. The bold (solid and dashed) curve shows the p-value of a LRT as given by Equation 13. The one sided p-value is also included as a dotted curve. Note the dotted curve and the bold p-value curve only intersect at μ0 = 0 but are very close to each other over a range of positive μ0. Two horizontal lines denotes two different α-levels and their corresponding CIs are also marked. They correspond to the two different situations when calculating μ̂L. One should note that these different situations can happen for a single α level but for different μ̂obs’s. In this plot μ̂obs = 3σ0.
| (16) |
and . The above result is summarized below as
where ( ) is a regular 100(1 − α)% CI, and satisfies Equation 16. If , immediately we have μ̂L = 0 and μ̂U is given by the value that satisfies
| (17) |
Comparing the LRT adjusted and the Wald test adjusted CIs, we can see that when , both methods produce the regular if the lower limit is larger than the middle point between θ̂obs and the boundary 0. If this is not true, the two methods differ in how to adjust the lower limit. As can be observed from Figure 7, the Wald test adjusted CI gives a higher lower limit than LRT adjusted one, though these two lower limits may be numerically the same over a region above the boundary. When the observed data violate the boundary condition, the lower limit of the CI must be 0, and the two methods differ on how the upper limit be calculated. The Wald test adjusted CI has the same upper limit irrespective of the unrestricted MLE θ̂obs, while LRT adjusted CI considers the information from θ̂obs < 0 and has a smaller upper limit.
3.3 The General Case
To extend the above arguments to the general case of a bounded parameter and to achieve invariance under reparametrization, quantities in the CI must be likelihood-based. Section 2.3 provided the algorithms for finding and the middle point θ̂m. Below we give the procedures to calculate likelihood based and , the counterparts of and in section 3.2.
When needs to be calculated. Its counterpart in the normal model, , satisfies equation (16), where gives the distance between μ̂obs and , and is the distance from to the boundary 0. For a general parametric model with bounded parameter θ, can be replaced by , an approximation of the distance from to θ̂obs, and by rb − r*, where , an approximation of the distance from θ̂ to the boundary. Note that ξ̂b is the MLE constrained to be on the boundary (i.e., θ̂b = 0). Thus, satisfies
| (18) |
where r* as explained above is a function of . Following from this equation, one may obtain by minimizing θ with respect to ξ subject to
| (19) |
where r2 = F (ξ) − F (ξ̂). Note rb can be obtained before the optimization.
When , the upper limit of the CI is . For , its counterpart in the normal model satisfies equation 17, where is the square root of the LRT statistic t if the in the test (see equation 12), and is the distance between and μ̂obs. As a result, should satisfy
| (20) |
where and are both functions of . can be obtained by maximizing θ with respect to ξ under the constraint
| (21) |
where r12 = F (ξ) − F (ξ̂obs) and .
Both the two optimization problems above for obtaining and can be solved numerically. However, they are less stable than those for obtaining the Wald test based CI. One reason is that the nonlinear constraints in these problems are expressed in terms of the p-values, which, when small, become very insensitive to parameters. The general procedure of obtaining an adjusted CI based on LRT is illustrated in Figure 8.
Figure 8.
Flowchart of the procedure to obtain a LRT adjusted CI.
4 Simulation Study
4.1 Study I
We first present a simulation study for the simple case of a bounded normal mean μ ≥ 0. In this study, we set σ0 = 1 and vary the true value μ0, which takes four values 0, 1, 1.9 and 2.5. 10, 000 replications are used for each true value to simulate the missing probabilities4 of the two adjusted CIs and an unadjusted CI. The unadjusted CI is an naively computed CI whose upper and lower limits are determined to give an increase of in negative twice log-likelihood. The result is summarized in Table 1.
Table 1.
Missing probabilities of the unadjusted and two adjusted CIs in simulation study I. The lower and upper missing probabilities are probabilities that the true value falls below the lower limit or above the upper limit of the CI.
| μ = 0 | μ = 1 | μ = 1.9 | μ = 2.5 | ||
|---|---|---|---|---|---|
| unadjusted | lower | 2.69 | 2.18 | 2.36 | 2.36 |
| upper | 0 | 0.66 | 2.31 | 2.42 | |
| total | 2.69 | 2.84 | 4.67 | 4.78 | |
|
| |||||
| Wald test adjusted | lower | 5.07 | 4.74 | 2.71 | 2.36 |
| upper | 0 | 0 | 0 | 2.42 | |
| total | 5.07 | 4.74 | 2.71 | 4.78 | |
|
| |||||
| LRT adjusted | lower | 5.07 | 3.24 | 2.36 | 2.36 |
| upper | 0 | 1.48 | 2.31 | 2.42 | |
| total | 5.07 | 4.72 | 4.67 | 4.78 | |
Entries are percentages based on 10, 000 replications. Monte Carlo error is about ±0.43% for missing probabilities that are supposedly 5% and ±0.31% for those supposedly 2.5%.
We can see that the unadjusted CI has correct missing probabilities from below the lower limit in all cases because the upper tail of the sampling distribution is not affected by the boundary. However, the missing probability from above its upper limit is smaller than the nominal value. Especially, when the true value is on the boundary, because the upper limit of an unadjusted CI cannot be below the boundary, the missing probability from above the upper limit is 0, making the total missing rate only half of its nominal value.
The LRT adjusted CI have the correct overall missing probabilities. In particular, when μ0 = 0, the missing probability come from below its lower limit, which corresponds to samples on the upper tail of the sampling distribution. This is not surprising because in this case the test of μ = μ0 is one-sided and only samples on the upper tail of the sampling distribution should be rejected. As μ0 moves away from the boundary, the CI becomes gradually balanced. When μ0 > zα/2, the rejection region of the LRT test coincide with the traditional two sided z-test and the CI has balanced missing probabilities.
For the Wald test adjusted CI, the missing probability from above its upper limit stays at zero for μ0 < zα/2 and becomes half the nominal missing rate for μ0 > zα/2. This is consistent with the fact that a Wald test is one-sided in the former case and becomes two-sided in the latter case as explained in section 2.1 and by Figure 2. The missing probability from below its lower limit is 5% for μ0 < zα and is 2.5% for μ0 > zα/2, as expected for a one-sided test and a two-sided test, respectively. When μ0 = 1.9, which is between zα and zα/2, the CI has smaller missing probability than its nominal level.
4.2 Study II
The second simulation study is conducted with a univariate ACE model, in which the variance components a2, c2 and e2 have lower bounds of 0. In this study, we are interested in the estimation of the standardized component of common environment c2. Four different conditions are chosen with c2 = 0, 0.15, 0.2 and 0.25. In all four conditions, e2 is set at e2 = 0.1 and a2 is chosen such that the total variance is 1. The sample sizes are nMZ = nDZ = 150 for the MZ and DZ twins. 10, 000 replications are used and all replications are convergent for all procedures5. The missing probabilities from below the lower limits and above the upper limits of different types of CIs are shown in Table 2. The unconstrained CIs are CIs calculated for an ACE model with c2 allowed to become negative. The unadjusted CIs are CIs obtained from the ACE model in the traditional way without adjustment. They differ from the unconstrained CIs in two ways: first, their lower limit cannot be negative; second, when a boundary MLE is obtained, their upper limit produces an increase of in negative twice log-likelihood computed at the boundary MLE instead of at a non-constrained MLE and is therefore greater than that of the unconstrained CIs. The remaining two types of CIs are those derived in sections 2.3 and 3.3.
Table 2.
Missing probabilities of CIs in simulation study II. The lower and upper missing probabilities are probabilities that the true value falls below the lower limit or above the upper limit of the CI.
| c2 = 0 | c2 = 0.15 | c2 = 0.2 | c2 = 0.25 | ||
|---|---|---|---|---|---|
| unconstrained | lower | 2.62 | 2.67 | 2.72 | 2.86 |
| upper | 2.55 | 2.48 | 2.21 | 2.28 | |
| total | 5.17 | 5.15 | 4.93 | 5.14 | |
|
| |||||
| unadjusted | lower | 2.62 | 2.67 | 2.72 | 2.86 |
| upper | 0 | 1.76 | 2.15 | 2.28 | |
| total | 2.62 | 4.43 | 4.87 | 5.14 | |
|
| |||||
| Wald test adjusted | lower | 5.16 | 5.21 | 5.03 | 2.86 |
| upper | 0 | 0 | 0 | 2.28 | |
| total | 5.16 | 5.21 | 5.03 | 5.14 | |
|
| |||||
| LRT adjusted | lower | 5.16 | 3.25 | 2.80 | 2.86 |
| upper | 0 | 2.16 | 2.18 | 2.28 | |
| total | 5.16 | 5.41 | 4.98 | 5.14 | |
Entries are percentages based on 10, 000 replications. Monte Carlo error is about ±0.43% for missing probabilities that are supposedly 5% and ±0.31% for those supposedly 2.5%.
The unconstrained CI has correct total missing probabilities under all conditions. This is not surprising as no boundary issue arise in this case. This CI is included to show that the χ2 approximation works well for the given sample sizes when no boundary is present. The missing probabilities for the remaining three types of CIs in the table exhibit a similar pattern to simulation study I, except that a missing probability smaller than the nominal value is not observed for the Wald test adjusted CI in this study.
Table 3 compares the sizes of the adjusted CIs. When boundary MLE is obtained, the lower limits of both CIs must be zero and the LRT based CI has smaller upper limit and is therefore shorter. When the MLE is not on the boundary, both CIs have the same upper limit, which is also the upper limit of an unconstrained CI. In this case, the lower limit of the LRT adjusted CI is smaller than or equal to that of the Wald test adjusted CI. The two types of CIs have the same lower limits if both limits are close or equal to 0 or are above the middle point . The overall frequencies of relative sizes depends on the frequency of occurrence of boundary MLEs. However, the expected lengths of the two kinds of CIs are very close to each other, as suggested by Table 4.
Table 3.
Comparison of adjusted CIs in simulation study II.
| c2 = 0 | c2 = 0.15 | c2 = 0.2 | c2 = 0.25 | ||
|---|---|---|---|---|---|
| ĉ = 0 | UW > ULRT | 49.77 | 9.95 | 3.79 | 1.28 |
|
| |||||
| ĉ > 0 | 0 = LLRT = LW | 45.07 | 55.20 | 42.53 | 27.23 |
| 0 < LLRT = LW | 2.63 | 11.79 | 14.92 | 15.98 | |
| 0 < LLRT < LW | 2.53 | 23.06 | 38.76 | 55.61 | |
| subtotal | 50.23 | 90.05 | 96.21 | 98.82 | |
|
| |||||
| overall length comparison | CIW >CILRT | 49.77 | 9.95 | 3.79 | 1.28 |
| CIW =CILRT | 47.70 | 66.99 | 57.45 | 43.11 | |
| CIW <CILRT | 2.53 | 23.06 | 38.76 | 55.61 | |
Entries are percentages based on 10, 000 replications.
Table 4.
Comparison of expected lengths of the two adjusted CIs in simulation study II.
| c2 = 0 | c2 = 0.15 | c2 = 0.2 | c2 = 0.25 | ||
|---|---|---|---|---|---|
| ĉ= 0 | CIW | 0.220 | 0.221 | 0.222 | 0.224 |
| CILRT | 0.152 | 0.178 | 0.185 | 0.188 | |
|
| |||||
| ĉ > 0 | CIW | 0.296 | 0.331 | 0.341 | 0.344 |
| CILRT | 0.296 | 0.333 | 0.346 | 0.351 | |
|
| |||||
| overall | CIW | 0.258 | 0.320 | 0.336 | 0.342 |
| CILRT | 0.225 | 0.318 | 0.339 | 0.349 | |
Entries are based on 10, 000 replications.
Figure 9 plots the calculated middle point against the MLE ĉ2+. A reference line with slope 0.5 is also plotted. It can be observed that as the estimate is away from the boundary, the calculated middle point deviates from and is greater than half of the standardized estimate.
Figure 9.
Plot of middle point against ĉ2/2 in simulation study II.
5 Examples
5.1 Depression data
For illustration, we use the data published by Neale and Cardon (1992, Chapter 6) as reproduced in Table 5. These data come from a sample of adult female twin pairs drawn from birth records in the Commonwealth of Virginia, USA. An ADE model is fitted with different thresholds for the two twin types and the parameter estimates and unadjusted 95% CIs are summarized in Table 6. As the total variance is constrained to unity for the purpose of identification, all variance components are standardized components and are bounded between zero and one. For the A and D components the lower bound of zero is attainable while for the E component the upper bound of one is attainable6. As the MLE is not on any boundary, we only need to consider the lower limits of the CIs of A and D components and the upper limit of the CI of the E component. Fortunately the 90% CIs of the A and D components both have lower limit of 0, so their adjusted 95% CIs must have lower limits zero. For the E parameter, the 90% CI has an upper limit of 0.672. When e2 = 1, the negative twice ML is 2556.185, about 46.553 larger than that of the ADE model. To find the middle point between ê2 and its boundary, we set the negative twice ML at (46.553/4 + 2509.632 = 2521.270) and maximize the e2 parameter, yielding a value of 0.780. This middle point is closer to the boundary than both the upper limits of the 95% and 90% CIs, so the CI need not be adjusted as well. It should be noted that because more than one parameter in this example is close to its boundary, all CIs displayed in Table 6 may be conservative.
Table 5.
Contingency Tables of Twin Pair Diagnosis of Lifetime Major Depressive Illness.
| Twin 2 | MZ | DZ | ||
|---|---|---|---|---|
| Twin 1 | Twin 1 | |||
|
| ||||
| Normal | Depressed | Normal Depressed | ||
| Normal | 329 | 83 | 201 | 94 |
| Depressed | 95 | 83 | 82 | 63 |
Table 6.
Parameter estimates and CIs in the ADE model fitted to the depression data.
| Parameter | MLE | unadjusted and Wald/LRT adjusted CI | |
|---|---|---|---|
| a2 | 0.303 | 0.000 | 0.530 |
| d2 | 0.131 | 0.000 | 0.542 |
| e2 | 0.566 | 0.452 | 0.689 |
| tMZ | 0.549 | 0.464 | 0.634 |
| tDZ | 0.404 | 0.314 | 0.494 |
| tMZ − tDZ | 0.145 | 0.021 | 0.269 |
The negative twice ML is 2509.632. The two adjusted CIs both coincide with the unadjusted CI.
5.2 Body mass index data
Another example includes body mass index (BMI) data from a survey of volunteers from the Australian NH&MRC twin register (Neale and Cardon, 1992, Chapter 6). Table 7 contains the covariance matrices of log-transformed BMI of male MZ and DZ twins with age less than 30. An ADE model is fitted to the data and the estimates are summarized in Table 8. For the original parameters, both the A and D components have an attainable lower bound of zero and their estimates are positive, so the lower limits of their CIs may need to be adjusted. For the A component, the 90% CI also has a lower limit of zero, so the adjusted CI must have a lower limit of zero. For the D component, the 90% CI has a lower limit of 0.038 and the middle point between d̂2 and zero is 0.144, so the adjusted lower limit should be 0.038 if the Wald type adjustment is used. If the LRT type adjustment is used, the lower limit should be separately calculated and is also 0.038.
Table 7.
Covariance matrices of MZ and DZ male twin pairs of age under 30 for body mass index: 1981 Australian Survey.
| MZ twins (nMZ = 251) | DZ twins (nMZ = 184) | ||
|---|---|---|---|
| 0.597 | 0.448 | 0.719 | 0.245 |
| 0.448 | 0.569 | 0.245 | 0.818 |
The raw data was transformed as 7 ln(BMI) − 21.
Table 8.
Parameter estimates and CIs in the ADE model fitted to the BMI data.
| Parameter | MLE | unadjusted CI | Wald/LRT adjusted CIs | |||
|---|---|---|---|---|---|---|
| original | a2 | 0.248 | 0.000 | 0.571 | 0.000 | 0.571 |
| d2 | 0.295 | 0.000 | 0.592 | 0.038 | 0.592 | |
| e2 | 0.137 | 0.116 | 0.165 | – | – | |
|
| ||||||
| standardized | a2 | 0.365 | 0.000 | 0.805 | 0.000 | 0.805 |
| d2 | 0.433 | 0.000 | 0.824 | 0.055 | 0.824 | |
| e2 | 0.202 | 0.165 | 0.248 | 0.165 | 0.248 | |
The two adjusted CIs coincide to the accuracy presented.
The standardized components are proportions of the A, D and E components in the total variance and are more interesting to researchers. The standardized a2, d2 parameters have an attainable lower bound of zero and the standardized e2 have an attainable upper bound of one. Again, for the standardized A component, the 90% CI has a zero lower limit and therefore the adjusted CI should also have lower limit zero. For the standardized C component, the lower limit of a 90% CI is 0.055 and the middle point between the estimate and the boundary is 0.210, so the adjusted CI should have a lower limit of 0.055 if the Wald type adjustment is employed. If the LRT type adjustment is used, the lower limit is still 0.055. The standardized E component is estimated far away from its upper bound of one and its CI is unlikely affected. In fact, its 90% CI has an upper limit of 0.240 and the middle point between its MLE 0.202 and its upper boundary e2 = 1 is at 0.471, much larger than the upper limits of both the upper limits of the regular 90% and 95% CIs.
6 Discussion
In this article we proposed two methods of CI adjustement when the true parameter is close to its boundary. These methods originate from Wald test and LRT of the a bounded normal mean parameter and are generalized and made parametrization-invariant. The proposed CIs do not touch the boundary if the (corrected) LRT rejects the boundary null hypothesis. Several points need to be considered before concluding the article.
First, Carey (2005) suggests the removal of the lower bounds on (scalar) variance components (and the more general inequality constraints of positive-semidefiniteness on a matrix variance component). This approach does remove the technical issue in hypothesis testing and CI, but on the other hand renders the model difficult to interpret because a variance cannot be negative. In particular, neglecting the lower bounds may yield a CI of a variance that includes negative values. This approach was motivated in a time when no adjusted LRT or CI was available. With the advance in statistical methodology (e.g. Dominicus, Skrondal, Gjessing, Pedersen and Palmgren, 2006), this approach may no longer be the best strategy for testing a variance component. The research presented in this article is to further provide CIs that give consistent results with the adjusted test.
Second, CI adjustment is only needed when the boundary of concern is not the natural boundary of the statistical model, which we now define. A parameter on or beyond its natural boundary leads to an invalid or degenerate distribution of the data. For example, a binomial parameter p beyond the boundary of (0, 1) does not define a valid binomial distribution, and similarly a population correlation between observed variables cannot be beyond (−1, 1). However, a correlation between latent variables may still define a valid multivariate normal distribution even if it is beyond this range, as a large residual variance may guarantee the positive-definiteness of the covariance matrix of the observed variables. In this case, ±1 are no longer natural boundaries of the parameter. In terms of an univariate ACE model, the lower bound of zero is the natural boundary for parameter e2. Because the MLE of e2 never violates this natural boundary, the regular LRT and the likelihood-based CI need not be modified in this case. In contrast, a2, c2 or d2 can be estimated at zero, so these boundaries are attainable and their LRTs and CIs need to be corrected.
Third, both the two proposed adjusted CIs are unbalanced in the sense that the missing rates beyond the two limits of a CI are not the same. Although this seems a drawback of the current procedure, it is a necessary feature for adjusted CIs near a boundary. To see this, we note that when the true value θ0 is on the boundary, the test in Equation 1 is necessarily a one-sided test on the upper tail of the sampling distribution and θ0 can only be rejected from below the CI. As a result, the missing rate from above the upper limit of the CI must be 0. On the other hand, if θ0 is far from the boundary, the test is not affected by the boundary condition and the CI is balanced. To connect the above two extreme cases, the adjusted CI must become gradually unbalanced when the true value approaches the boundary, with the missing rate from above the upper limit decreasing from α/2 to 0 and that from below the lower limit increasing from α/2 to α.
Fourth, as we have discussed and illustrated with simulation, between the two proposed methods, the Wald-test adjusted CI is shorter when the MLE is not on the boundary and the LRT adjusted CI is shorter when the MLE is on the boundary. It seems appealing to choose the method of adjustment to produce the shorter CI. Unfortunately this is not a valid procedure as it would result in under-coverage of the CI. To see this, remember as a frequentist concept, coverage concerns not only the situation of the current observation but also observations that would have been observed. If under those hypothetical situations a different method would be chosen to produce a shorter CI, the coverage probability would be smaller than using one method consistently.
Last, the development of the two adjusted CIs assumes that the parameter of interest is only close to one boundary and is the only parameter that is close to a boundary. Because the sampling distributions of the Wald and LRT statistics are affected by boundary conditions of all parameters, the proposed CIs may not produce desired coverage if a parameter not being considered is close to its boundary or the parameter of interest is subject to additional boundary conditions.7 However, in such cases, it is still advisable to use the proposed adjustments instead of a CI that neglects all boundary conditions. This is because the current approach still includes partial boundary information of the parameter space in such cases and is expected to produce CIs whose coverage probabilities are closer to the nominal ones. For example, the adjustment of CI of the standardized e2 when e2 is close to 1 in an ACE model is invalid, because the boundary condition of e2 = 1 implies a2 = c2 = 0, which involves two parameters. Geometrically, the parameter space is given by {a2 + c2 + e2 = 1, a2 > 0, c2 > 0 and e2 > 0}, which defines an angular region near the point e2 = 1. The proposed adjustments assume the parameter space is a half plane {a2 + c2 + e2 = 1 and e2 < 1}, which is larger than the true parameter space but still more restricted than the plane {a2 + c2 + e2 = 1} assumed in the regular CI.
7 Summary and Conclusion
In this article we present two adjustments to traditional CIs when the parameter of interest is close to its boundary. The Wald type adjustment inverts a Wald test based on the inequality-constrained MLE. The LRT type adjustment inverts a LRT. Both adjustment methods are developed to be invariant under reparametrization and are consistent with the corrected LRT for a boundary null hypothesis. Both methods also give the regular CI when the MLE is far away from the boundary. The Wald type adjustment is relatively simpler but may give CIs with higher confidence than its nominal level in some cases. The LRT type adjustment is more complicated but does not suffer from this problem of being conservative. The Wald type adjustment tends to give longer CIs when the MLE is on boundary, while the LRT type adjustment tends to give longer CIs when the MLE is not on the boundary, but the expected lengths of the CIs do not differ much.
Acknowledgments
This research is supported by the National Institute of Drug Abuse training grant R25DA026119 to the second author. We thank Gregory Carey, Sophie van der Sluis and Conor Dolan for their helpful comments that have led to the improvement of the style of this article.
Footnotes
All upper and lower limits of CIs in this article are non-inclusive except for two cases: 1) a zero lower limit in both adjusted CIs and 2) a lower limit in Wald test adjusted CI that takes the value of the middle point between the observed and the boundary values.
This assumption is satisfied for most models as long as θ0 is not an natural boundary and no nuisance parameter is on or close to their boundaries. See Discussion for the issues of a natural boundary and boundary nuisance parameters.
The “=” sign in the constraint can be replaced by “≤” if the constrained optimization algorithm requires that the starting value satisfy the constraint. This is also the case for constraints in equations 10 and 11.
i.e. the probability that the true value is not covered by the CI.
Non-convergent replications were handled by manually adjusting the starting values until convergence was reached.
see Discussion section for the distinction between natural and attainable boundaries.
Similarly, a regular CI for an unbounded parameter may not be valid when another parameter is close to its boundary.
Contributor Information
Hao Wu, Email: hao.wu.5@bc.edu, Department of Psychology, Boston College, 300 McGuinn, 140 Commonwealth Avenue, Chestnut Hill, MA 02467.
Michael C. Neale, Department of Psychiatry and Virginia Institute for Psychiatric and Behavior Genetics, Virginia Commonwealth University, E. 800 Leigh St, Richmond, VA 23219
References
- Boker S, Neale M, Maes H, Wilde M, Spiegel M, Brick T, Spies J, Estabrook R, Kenny S, Bates T, Mehta P, Fox J. OpenMx: An Open Source Extended Structural Equation Modeling Framework. Psychometrika. 2011;76:306–317. doi: 10.1007/s11336-010-9200-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carey G. Cholesky problems. Behavioral Genetics. 2005;35:653–665. doi: 10.1007/s10519-005-5355-9. [DOI] [PubMed] [Google Scholar]
- Dominicus A, Skrondal A, Gjessing HK, Pedersen NL, Palmgren J. Likelihood ratio tests in behavioral genetics: problems and solutions. Behavior Genetics. 2006;36:331–340. doi: 10.1007/s10519-005-9034-7. [DOI] [PubMed] [Google Scholar]
- Lehmann EL. Testing Statistical Hypotheses. John Wiley and Sons; 1986. [Google Scholar]
- Neale MC. Mx: Statistical Modeling. 3. Box 980126 MCV, Richmond VA 23298: 1995. [Google Scholar]
- Neale MC, Cardon LR. Methodology for Genetic Studies of Twins and Families. Kluwer Academic; Dordrecht: 1992. [Google Scholar]
- Neale MC, Heath AC, Hewitt JK, Eaves LJ, Fulker DW. Fitting genetic models with LISREL: hypothesis testing. Behavior Genetics. 1989;19:37–49. doi: 10.1007/BF01065882. [DOI] [PubMed] [Google Scholar]
- Neale MC, Miller MB. The use of likelihood based confidence intervals in genetic models. Behavior Genetics. 1997;27:113–120. doi: 10.1023/a:1025681223921. [DOI] [PubMed] [Google Scholar]
- Self SG, Liang KY. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under non-standard conditions. Journal of the American Statistical Association. 1987;82:605–610. [Google Scholar]
- Shapiro A. Towards a unified theory of inequality constrained testing in multivariate analysis. International Statistical Review. 1988;56:49–62. [Google Scholar]
- Visscher PM. A note on the asymptotic distribution of likelihood ratio tests to test variance components. Twin research and human genetics. 2006;9:490–495. doi: 10.1375/183242706778024928. [DOI] [PubMed] [Google Scholar]
- Wu H, Neale MC. On the Likelihood Ratio Tests in Bivariate ACDE Models. Psychometrika. doi: 10.1007/s11336-012-9304-2. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]







