Skip to main content
Translational and Clinical Pharmacology logoLink to Translational and Clinical Pharmacology
. 2025 Feb 17;33(1):10–18. doi: 10.12793/tcp.2025.33.e1

The superiority of likelihood-based confidence interval for variance estimation in a single group

Soo-Min Jung 1, Minkyu Kim 1, Kyun-Seop Bae 1,
PMCID: PMC11976149  PMID: 40206870

Abstract

The χ2 distribution is commonly used for estimating confidence interval (CI) for variance. However, the validity of the CIs from this method is highly dependent on the assumption that the population follows a normal distribution. Additionally, the Wald CI used in this method does not account for the asymmetry. To address this limitation and provide more accurate interval estimates, especially with relatively small sample sizes, a likelihood interval (LI) approach was adopted. The Likelihood-Based Interval R software package was developed to implement this approach. We conducted a simulation to compare 3 methods for interval estimation of variance in a single group, using the luteinizing hormone (Inline graphic) data available with the default R installation and random small sample sizes of 10, 20, and 30 from a standard normal distribution: the conventional χ2 interval method, the LI method, and the likelihood-based confidence interval (LBCI) method. The average width (standard deviation) of the CIs from the simulation with Inline graphic data was 0.2582 (0.0534) for LBCI, 0.2604 (0.0538) for LI, and 0.2667 (0.0551) for CI, indicating that LBCI produced the narrowest CIs. The interval coverage was 95.24% for CI, 95.38% for LBCI, and 95.45% for LI. In simulations with small sample sizes, LBCI and LI exhibited narrower widths than CI, while the coverage was similar. Therefore, LBCI or LI for variance estimation can be considered a more efficient option than the conventional method.

Keywords: Likelihood Function, Likelihood Ratio Test, Maximum Likelihood Estimates, Confidence Intervals

INTRODUCTION

Intervals in statistics include various types such as confidence interval (CI), prediction interval, credible interval, highest density interval, likelihood interval (LI), and likelihood-based confidence interval (LBCI). The current paradigm in statistics is the Null Hypothesis Significance Testing framework, which involves type I and type II errors, alpha and beta values, and null and alternative hypotheses. Within this framework, probability-based inference, particularly Wald CI estimation, is predominantly used [1]. This probability-based inference is robust in linear models such as t-tests, chi-square tests, ANOVA, and linear regression. However, it is less robust in models based on post-Fisher likelihood methods, such as generalized linear models, mixed-effects models, and survival analysis. For example, the Wald CI does not account for the asymmetry of the likelihood profile in nonlinear regression.

The LI is derived directly from the likelihood profile and reflects the shape of the likelihood profile without assuming symmetry around the parameter estimate. Therefore, the LI is well-suited for parameters obtained by Maximum Likelihood Estimation (MLE) methods [2]. However, since the current statistical paradigm primarily relies on probability-based inference, a LBCI or LI for the normal distribution has been developed to yield results similar to the conventional CI [3]. LI is an inference based solely on the likelihood function, which is not necessarily related to probability. Thus, its coverage is only approximately similar to that of the conventional CI. In contrast, LBCI is an interval that targets the probability-based conventional CI while utilizing the likelihood function, ensuring that its coverage is the same as the conventional CI.

Due to the complexity of the calculations required for LI estimation, software implementation is essential for its practical use in interval estimation. The lack of such software has hindered its widespread application [2]. In this study, we developed the Inline graphic R software (R Foundation for Statistical Computing, Vienna, Austria) package to facilitate the use of LI and LBCI for variance interval estimation in a single group, in addition to conventional methods, and added the Inline graphic and Inline graphic functions. Through simulation, we evaluated the performance of the LI and LBCI in comparison to the conventional CI for variance estimation.

METHODS

Lower limit (LL) and upper limit (UL) of conventional CI are calculated according to the following formula:

LL,ULofCI=(n-1)σ2^χ1-α22,(n-1)σ2^χα22

The concept of LI is shown in Fig. 1. CI has different heights of h1 and h2 in terms of likelihood, while LI or LBCI has the same height (h1 = h2). Therefore, Inline graphic for CI, and Inline graphic for LI or LBCI.

Figure 1. Concept of likelihood interval. The confidence interval bounds are determined by drawing a horizontal line at Inline graphic on likelihood function. Conventional confidence interval has different heights of h1 and h2 in terms of likelihood, while the likelihood interval or likelihood-based confidence interval has the same height (h1 = h2).

Figure 1

maxL, maximum likelihood.

The Inline graphic function calculates the LI for the standard deviation and variance, assuming a normal distribution in a single group. This function, as shown in Fig. 2, computes the variance of the data and then applies MLE. If the cutoff value k is not provided, it defaults to calculating the log (k) value corresponding to approximate 95% CI, and the LI is estimated accordingly. In this process, the LLs and ULs are determined using R’s basic function Inline graphic.

Figure 2. Bare R script of Inline graphic function. This function computes the variance of the data and then applies Maximum Likelihood Estimation using the log-likelihood (maxLL). If the cutoff value k is not provided, it defaults to calculating the log (k) value corresponding to an approximate 95% confidence interval, and the likelihood interval is estimated accordingly. In this process, the lower and upper limits are determined using R’s basic function, Inline graphic [2,3].

Figure 2

The formula for the MLE and the calculation of log (k) are as follows:

maxLL=n2·(log(2πσ2^+1))# maximum log-likelihood for variance estimation
logk=n2·log1+Fα1,n-2n-2#logkforapproximateCI

The degree of freedom for the numerator is n−2 because there are 2 unknown parameters (µ, σ2).

In contrast, the Inline graphic function calculates an LBCI for the standard deviation and variance, assuming a normal distribution in a single group. As shown in Fig. 3, unlike Inline graphic, this function utilizes 2 objective functions and uses both F and χ2 distributions, the former to find LL and UL corresponding to current temporary log (k) and the latter to find exact log (k) for exact CI [4].

Figure 3. Bare R script of Inline graphic function. What distinguishes Inline graphic from Inline graphic is that Inline graphic utilizes 2 objective functions and employs both F and χ2 distribution, the former objective function is used to find LL and UL corresponding to the current temporary log (k) and the latter objective function is used to determine exact log (k) for the precise confidence interval.

Figure 3

LL, lower limit; UL, upper limit.

Through simulation, the performance of the LI and LBCI methods was evaluated in comparison to the conventional CI. A total of 10,000 simulations were conducted for each method to estimate the 95% CIs using the luteinizing hormone (Inline graphic) data available with the default R installation and random small sample sizes of 10, 20, and 30 drawn from a standard normal distribution. Descriptive statistics of the intervals were computed to show the coverage percentage and width of the intervals.

These functions were developed by Kyun-Seop Bae after a thorough review of the literature that discusses the concepts of LIs and the selection of cutoff values for interval estimation [1,3,4,5,6,7]. The software implementation and data analysis presented in the results section were conducted using R version 4.4.2 (R Foundation).

RESULTS

Fig. 4 shows the application of conventional CI, LBCI, and LI to Inline graphic data, which consist of 48 levels. The interval width for variance is 0.2665 for CI, 0.2580 for LBCI, and 0.2602 for LI, indicating that LBCI has the narrowest interval. A simulation comparing the results of the 3 methods for estimating variance intervals, using data samples that follow a normal distribution with the given Inline graphic data’s mean, standard deviation, and number of data points, is presented in Fig. 5. To compare whether the intervals from each method include the actual variance and to assess the width of the intervals, the results for Inline graphic (CI), Inline graphic (LBCI), and Inline graphic (LI) were stored to indicate whether the actual variance was included. The differences between the upper and lower limits of the interval (interval width) were stored in Inline graphic (CI), Inline graphic (LBCI), and Inline graphic (LI), and the results were returned accordingly.

Figure 4. R script for comparing conventional CI, LBCI, and LI using Inline graphic data in R software. The results show that the ranges of LBCI and LI are narrower than those of the conventional CI, with LBCI being the narrowest.

Figure 4

CI, confidence interval; LBCI, likelihood-based confidence interval; LI, likelihood interval; lh, luteinizing hormone; LL, lower limit; UL, upper limit.

Figure 5. R script for simulating and comparing CI, LBCI, and LI using Inline graphic data in R software. The results for Inline graphic (CI), Inline graphic (LBCI), and Inline graphic (LI) were stored to indicate whether the actual variance was included, while the differences between the upper and lower limits of the interval (interval width) were stored in Inline graphic (CI), Inline graphic (LBCI), and Inline graphic (LI), and the results were returned accordingly.

Figure 5

CI, confidence interval; LBCI, likelihood-based confidence interval; LI, likelihood interval; lh, luteinizing hormone; LBI, Likelihood-Based Interval; LL, lower limit; UL, upper limit.

The results of the simulation using Inline graphic data are summarized in Fig. 6. The coverage was 95.45% for LI, 95.38% for LBCI, and 95.24% for CI, respectively. The average width (standard deviation) of the intervals was 0.2582 (0.0534) for LBCI, 0.2604 (0.0538) for LI, and 0.2667 (0.0551) for CI, indicating that LBCI produced the narrowest CIs.

Figure 6. Comparison results of the simulation of CI, LBCI, and LI using Inline graphic data in R software. The results show that the ranges of LBCI and LI are narrower than those of the conventional CI, with LBCI being the narrowest.

Figure 6

CI, confidence interval; LBCI, likelihood-based confidence interval; LI, likelihood interval; lh, luteinizing hormone; LBI, Likelihood-Based Interval; sd, standard deviation.

Additionally, when comparing cases where the CIs were wider than those from other methods, no instances of wider interval estimates were found for LBCI or LI compared to CI. The coverage rate for LI was the highest, while the range of the intervals was narrowest for LBCI. The results were consistent with different seed values and data. In simulations with random small sample sizes from a standard normal distribution, described in Fig. 7, the coverage was 94.81% for CI, 94.89% for LBCI, and 96.08% for LI at a sample size of 10; 94.93% for CI, 94.82% for LBCI, and 95.32% for LI at a sample size of 20; and 95.44% for CI, 95.34% for LBCI, and 95.69% for LI at a sample size of 30. LI and LBCI exhibited narrower widths than CI, with LBCI showing the narrowest width. LBCI and CI also showed similar interval coverages.

Figure 7. Comparison of the interval results for CI, LBCI, and LI at small sample sizes (10, 20, and 30). The results show that the ranges of LBCI and LI are narrower than those of the conventional CI, with LBCI being the narrowest. Additionally, the coverage percentages of LBCI and LI are similar to that of CI.

Figure 7

CI, confidence interval; LBCI, likelihood-based confidence interval; LI, likelihood interval; sd, standard deviation.

DISCUSSION

The 3 primary methodologies commonly used in statistics are ‘frequentist,’ ‘Bayesian,’ and ‘likelihoodist.’ Among these, ‘likelihoodist’ or ‘Fisherian’ is relatively less well-known compared to the other 2 schools of thought. The dominant paradigm in modern statistics is the frequentist approach, which relies on probability-based inference to calculate p-values. One of the interval estimation methods used in the frequentist approach is the Wald CI, which assumes that the population follows a normal distribution and is calculated using quadratic approximation. This method does not show significant discrepancies in interval estimation when the likelihood is symmetric. However, when the likelihood function is asymmetric, discrepancies may occur.

While the Bayesian methodology incorporates the likelihood concept, it is fundamentally a probability-based statistical technique. A drawback of the Bayesian approach is that prior probabilities are typically subjective, meaning that posterior probabilities cannot be considered fully objective. In most cases, probability-based inference is preferred, but when calculating probabilities is difficult, likelihood-based inference methods, such as MLE proposed by Fisher and the Likelihood Ratio Test (LRT) developed by Neyman, can be valuable [8]. The LI, introduced by Fisher in 1956 or earlier, is derived directly from the likelihood profile, reflecting its shape without approximation and without necessarily being symmetric around the point estimate [1]. Therefore, the LI is well-suited for parameters estimated using MLE methods, such as nonlinear regression or nonlinear mixed-effects models. Interval estimation based on the estimated likelihood rarely fails when MLE is successful [9].

In comparison to Wald CI, LBCI are generally more challenging to estimate but offer greater efficiency to model reparameterization. The accuracy of Wald CI is contingent on approaching a normal distribution and is not invariant under reparameterization or parameter transformation. For instance, the upper and lower limits of a Wald CI for variance typically do not correspond to the squares of those for the corresponding standard deviation [10]. To address these limitations of Wald CI, Neale and Miller (1997) [11] developed the LBCI by inverting the LRT. The negative of twice the logarithm of the likelihood ratio statistic has a recognizable limiting probability density function—the χ2 distribution with one degree of freedom—which can be used to construct asymptotically exact CIs [11,12]. Since the LRT does not change regardless of how the parameters are set in the model, the interval produced is also unaffected by parameter settings [13]. This extended invariance property ensures more reliable conclusions across different models.

The LI presented in this study is based on the estimated likelihood rather than the profile likelihood. When focusing on a single parameter of interest among several, the other parameters are treated as nuisance parameters and replaced with their maximum likelihood estimates [2]. Despite their advantages, LI and LBCI have not been widely used for interval estimation due to the lack of available software for their calculation. In response to this gap, this study contributes the LBCIvar and LInormVar functions to the LBI R package for variance interval estimation.

Simulation results, including those with small sample sizes, demonstrated that both LBCI and LI produced narrower intervals compared to the conventional χ2 CI. Additionally, the coverage of the LBCI was similar to that of the CI. Consequently, LBCI exhibited similar coverage while producing the narrowest interval compared to CI.

The limitations of this study include the assumption of a normal distribution in the development of the LInormVar and LBCIvar functions, which restricts their applicability to other distribution assumptions. However, authors opinion is that these can be used regardless of distribution when there is no better option. Furthermore, the simulation was conducted using in silico data rather than real-world data. While interval estimation of variance is not frequently used in clinical trial reports, it could be valuable in the descriptive statistics of continuous variables for real-world data. Given these limitations, further evaluation using real-world data is necessary, and future development of LI and LBCI methods applicable to distributions other than the normal distribution is needed. Despite these limitations, LBCI can be considered a useful method for variance interval estimation in single-group data when compared to conventional CI.

Footnotes

Conflict of Interest: - Authors: Nothing to declare

- Reviewers: Nothing to declare

- Editors: Nothing to declare

Reviewer: This article was reviewed by peer experts who are not TCP editors.

Author Contributions:
  • Conceptualization: Bae KS.
  • Formal analysis: Bae KS.
  • Investigation: Kim M.
  • Methodology: Bae KS.
  • Software: Bae KS.
  • Writing - original draft: Jung SM.
  • Writing - review & editing: Bae KS.

References

  • 1.Fisher RA. Statistical methods and scientific inference. 3rd ed. Royal Oak (MI): Haffner Press; 1973. [Google Scholar]
  • 2.Lee MH, Bae KS. Likelihood interval for nonlinear regression. Transl Clin Pharmacol. 2023;31:85–94. doi: 10.12793/tcp.2023.31.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Pawitan Y. In all likelihood: statistical modelling and inference using likelihood. Oxford: Oxford University Press; 2001. [Google Scholar]
  • 4.Bae KS. LBI: Tool for likelihood-based inference [Internet] [Accessed January 10, 2025]. https://cran.r-project.org/package=LBI .
  • 5.Edwards AW. Likelihood: an account of the statistical concept of likelihood and its application to scientific inference. Cambridge: Cambridge University Press; 1972. [Google Scholar]
  • 6.Rohde CA. Introductory statistical inference with the likelihood function. Cham: Springer International Publishing Switzerland; 2014. [Google Scholar]
  • 7.Held L, Bové DS. Likelihood and Bayesian Inference. Berlin: Springer; 2020. [Google Scholar]
  • 8.Lehmann EL. Fisher, Nayman, and the creation of classical statistics. Berlin: Springer; 2011. [Google Scholar]
  • 9.Bae KS. wnl: Minimization tool for pharmacokinetic-pharmacodynamic data analysis [Internet] [Accessed January 10, 2025]. https://CRAN.R-project.org/package=wnl .
  • 10.Neale MC, Heath AC, Hewitt JK, Eaves LJ, Fulker DW. Fitting genetic models with LISREL: hypothesis testing. Behav Genet. 1989;19:37–49. doi: 10.1007/BF01065882. [DOI] [PubMed] [Google Scholar]
  • 11.Neale MC, Miller MB. The use of likelihood-based confidence intervals in genetic models. Behav Genet. 1997;27:113–120. doi: 10.1023/a:1025681223921. [DOI] [PubMed] [Google Scholar]
  • 12.Leemis ML. Mathematical statistics. La Vergne (TN): Lightning Source; 2020. [Google Scholar]
  • 13.Pritikin JN, Rappaport LM, Neale MC. Likelihood-based confidence intervals for a parameter with an upper or lower bound. Struct Equ Modeling. 2017;24:395–401. doi: 10.1080/10705511.2016.1275969. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Translational and Clinical Pharmacology are provided here courtesy of Korean Society for Clinical Pharmacology and Therapeutics

RESOURCES