Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2024 Oct 15;19(10):e0311850. doi: 10.1371/journal.pone.0311850

Simultaneous confidence interval construction for many-to-one of proportion ratios of bilateral correlated data

Zhaoqi Zhang 1, Chang-Xing Ma 1,*
Editor: Shovan Chowdhury2
PMCID: PMC11478910  PMID: 39405312

Abstract

In ophthalmology and otolaryngology, data collected from paired body parts are typically reformatted into categorical bilateral data structures for subsequent research. This article applies Donner’s equal correlation coefficient model and obtains nine simultaneous confidence intervals (SCI) of proportion ratios under three asymptotic statistical methods and three ways of multiplicity adjustment. The empirical coverage probability and mean interval width are evaluated through Monte Carlo simulations. A real example is used to demonstrate the proposed methods.

Introduction

In the fields of ophthalmology and otolaryngology, data collected from paired body parts are typically reformatted into categorical bilateral data structures for subsequent research. For instance, visual acuity and intraocular pressure are typically measured from both eyes of a patient. The outcome would be bilateral responses, unilateral responses or no response. In this situation, the outcomes from both eyes of each patient tend to be highly correlated. Failing to consider intraclass correlation during data analysis can result in inaccurate findings.

Multiple methodologies and approaches have been developed over the last few decades to address the issue of correlated data [1]. Rosner [2] proposed a constant R model, which assumes the probability of a response on one side of the body part, given a response on the opposite side, is proportional to the prevalence rate of the corresponding group in the research study. Tang et al. [3] examined the performance of various methods for assessing the equality of proportions, focusing on asymptotic and approximate unconditional approaches. Their results indicated that the approximate unconditional score test performs well in general scenarios. Additionally, Tang [4] and Xue [5, 6] developed multiple test statistics to assess proportion differences and proportion ratios, as well as to construct confidence intervals for these parameters. Moreover, Wang and Shan [7] developed twelve exact methods for constructing CIs for relative risk and odds ratio, which resulted in shorter interval lengths.

Dallal’s study [8] identified a limitation inherent in the constant R model, pointing out that it doesn’t fit well when a characteristic displays high variability across different groups and often occurs bilaterally. He addressed the issue by proposing that the model’s assumption of constant conditional probability is not proportional to the prevalence rate. Subsequently, Donner [9] proposed an equal correlation coefficient model, also known as the ρ model, predicated on the assumption of a common correlation coefficient among paired body ports within each group.

For the ρ model, numerous studies have been proposed. For instance, Ma and Liu [10] developed a common test to examine the equality of proportion among multiple groups, utilizing three statistical methods. They recommended the score test as the most reliable approach. Beyond statistical tests, CI approaches offer a more straightforward alternative, providing a range of values within which the true parameter value is likely to fall. Pei et al. [11] introduced five asymptotic CIs approaches for measuring the proportion differences between two groups, recommending the Wald-type CI with an assumption of dependence as the most robust option. Afterward, Li and Ma [12], Shen [13] and Zhuang [14] developed a common test for odds ratios; multiple CIs approaches to analyze odds ratios, proportion differences, and proportion ratio in a two-group scenario.

In random clinical trials, there is a trend to include multiple treatment groups alongside a control group. This design allows researchers to evaluate the collective effects of various treatments simultaneously or to evaluate a new therapy against several established alternatives. For instance, a multiple-dose study may be required to determine the bioavailability. In this context, the use of simultaneous confidence intervals (SCIs) provides a methodology for many-to-one comparisons. Yang [15] and Peng [16] introduce asymptotic SCIs for the proportion differences and odds ratio based on the constant R model. Later, Yang [17] developed asymptotic SCIs for the proportion difference based on the ρ model.

In addition to proportion differences and odds ratios, proportion ratios are also crucial for assessing the relative strength of associations between groups, especially when the corresponding proportions are small. To address the need for further research, this article extends the discussion to asymptotic SCIs for proportion ratios in settings with g (where g≥ 2) groups based on the ρ model.

The rest of the article is structured in the following manner. The Methods section introduces three methods for constructing SCIs (Wald-type SCI, profile likelihood SCI, and asymptotic score SCI) and the multiplicity adjustment methods. In the Simulation studies section, simulation experiments are conducted to evaluate the performance of the proposed methods, with comparisons made based on the empirical coverage probability and the mean interval width. The Real case example section uses a real data example to illustrate the methodology proposed in this article. Finally, the last section offers a discussion of the results.

Methods

Data structure

Suppose the object of this study is to evaluate the effectiveness and safety of multiple new treatments for eye disease against a standard treatment or placebo. Let mli be the number of patients in the ith group (i = 1,2,⋯, g) with l responses (l = 0,1,2), and mi be the total number of patients in the ith group, which is assumed to be fixed. Let Sl be the total number of patients with l responses. Let N be the total number of patients in the research study. The data structure is summarized in Table 1.

Table 1. Data structure for the correlated bilateral data.

Number of responses(l) Group Total
1 2 g
0 m 01 m 02 m 0g S 0
1 m 11 m 12 m 1g S 1
2 m 21 m 22 m 2g S 2
Total m 1 m 2 m q N

According to the constant correlation coefficient model proposed by Donner [9], the disease rates are assumed to be the same in the same group.

Let Zijk be the dummy variable of the response of the kth body part (eg. eye) (k = 1,2) of the jth individual in the ith group (i = 1,2,⋯, g). Let ρ be the common correlation coefficient.

Zijk={1,event(disease)occur.0,otherwise.
Pr(Zijk=1)=πi;i=1,,g;j=1,,mi;0πi1Corr(Zijk,Zij(3-k))=ρ,0ρ1

Each group of the data follows a multinomial distribution. The probability density function of observation frequencies mi = (m0i, m1i, m2i) is defined as follows,

f(m0i,m1i,m2i)=mi!m0i!m1i!m2i!p0im0ip1im1p2im2i

Let pli be the corresponding probability for an individual in the ith group has exactly l events (l = 0, 1, 2, i = 1, 2,⋯, g) happened,

{p0i=(1-πi)(ρπi-πi+1)p1i=2πi(1-ρ)(1-πi),p2i=πi2+ρπi(1-πi), (1)

and p0i + p1i + p2i = 1 for any fixed i.

Without loss of generality, let the 1th group be the control, denotes the ratio of proportions between any treatment groups and control group by δi = πi/π1(i = 2, …, g), and for all pairwise comparisons by δij = πi/πj (i ≠ j).

The corresponding log-likelihood function can be expressed as:

l1(π1,,πg;ρ)=i=1g[m2ilog(πi2-πiρ(πi-1))+m1ilog(2πi(πi-1)(ρ-1))+m0ilog(-(πi-1)(πiρ-πi+1))]+logC (2)

where C=i=1g(mi!m0i!m1i!m2i!) is a constant.

Substituting πi = π1 δi(i = 2, ⋯, g) into l1, the log-likelihood function can be written as

l2(δi,π1,,πg;ρ)=j=1,jig[m2jlog(πj2-πjρ(πj-1))+m1jlog(2πj(πj-1)(ρ-1))+m0jlog(-(πj-1)(πjρ-πj+1))]+m2ilog((π1δi)2-π1δiρ(π1δi-1))+m1ilog(2π1δi(π1δi-1)(ρ-1))+m0ilog(-(π1δi-1)(π1δiρ-π1δi+1)) (3)

where δi(i = 2, ⋯, g) is the parameter of interest, πj(ji) and ρ are nuisance parameters.

Multiplicity adjustment

For the construction of SCIs, without considering multiplicity adjustment, the type I error rate will increase, as well as false-positive errors. Based on the data structure mentioned previously, there are g groups in total, one control group and g-1 treatment groups. The main purpose of measuring the effectiveness of each treatment is to compare the proportion ratio between the treatment and the control group. One method used in this paper to control multiplicity adjustment is the Bonferroni correction, the quantile c = z1−α/2(g−1), where z denotes the standard normal distribution.

Another method used in this paper is the Sidak correction [18], which represents a modification of the Bonferroni correction. This correction involves the quantile c=z1(1α)1/g, where z denotes the standard normal distribution.

The other method used in this paper is based on Dunnett test. Piergorsch [19] proposed a general method for constructing SCIs for pairwise proportion differences. let oi = log πi for the = ith group, o2o1,…,ogo1 are simultaneously compared, the SCI of proportion ratio δi = πi/π1 is obtained by exponentiating the previous outcome. The critical value c=|z|g-1;Rα equals to 1-α/2 quantile of g − 1 variate normal distribution with mean equal to zero and correlation matrix R = {ρij}, ρij = ωi ωj, and

ωi=[1+m1miπi^(1-πi^)πi^(1-πi^)]-1/2

Wald-type interval

Ma and Liu [10] proposed a third-order polynomial and Fisher scoring method to derive the maximum likelihood estimator (MLE) of (π1, …, πg; ρ). After that, we can derive the MLE of proportion ratio (δ) through a simple linear transformation form log(πi) based on the invariant property of MLE. Let β = (log(π1), …, log(πg), log(ρ)) and the corresponding MLE of β is β^=(log(π1^),,log(πg^),log(ρ^)), then the MLE of log(δi) is log(δi^)=KiβT^, where

K(g-1)×(g+1)=[-110000-101000-100100-100010]

Ki means the ith row of the matrix K. The standard error of log(πi),i = 1,2,…,g, can be derived from πi^ using delta method.

Let γ = (π1, ⋯, πg, ρ), under regularity conditions, the asymptotic distribution of γ is given by

n(γ^-γ)dN(0,I-1)

where I is the Fisher information matrix of γ. See S1 Appendix for detail.

By delta method,

n(β^-β)dN(0,gI-1gT)

where g=Diag(1π1,,1πg,1ρ). The 100(1-α)% SCI for log(δi) is given by

Kiβ^T±cKi[gI-1^gT]KiT

The 100(1-α)% SCI for δi is given by

exp(Kiβ^T±cKi[gI-1^gT]KiT)

where c is the critical value. If Bonferroni method is used, c = z1−α/2(g−1), where z denotes the standard normal distribution. If Sidak method is used, α′ = 1 − (1 − α)(1/g), c = z1−α′/2, where z denotes the standard normal distribution. If Dunnett method is used, c = |z|g-1;Rα, where |z| denotes 1-α/2 quantile of g − 1 variate normal distribution described in the previous section.

Profile likelihood confidence interval

The asymptotic profile likelihood SCI for each proportion ratio (δi, i = 2, …, g), can be constructed by inverting the likelihood ratio test of hypothesis H0 : δi = δ0 vs. Hα : δiδ0, i = 2, …, g. To simplify the explanation, we first set i = 2, when constructing the SCI, we start by calculating the CI for the ratio (δ2 = π2/π1) between the 2nd group and the control group.

Let (δ˜2,π˜1,π˜3,...,π˜g,ρ^) denotes the constrained MLEs of (δ2, π1, π3, …, πg, ρ) under the null hypothesis, and (δ^2,π^1,π^3,,π^g,ρ^) denotes the unconstrained MLEs under the alternative hypothesis. The likelihood ratio test statistic is given by

TL=2[l2(δ^2,π^1,π^3,,π^g,ρ^)-l2(δ˜2,π˜1,π˜3,,π˜g,ρ˜)].

By solving the equations:

l2πiδ2=δ0=0,(i=1,3,,g)l2ρδ2=δ0=0

(π˜1,π˜3,,π˜g,ρ˜) can be computed given the constraints that δ2 = δ0. Since there is no closed-form solution, we adopt the Fisher-scoring method [14].

[π1(t+1)π3(t+1)πg(t+1)ρ(t+1)]=[π1(t)π3(t)πg(t)ρ(t)]+I-1(π1(t),π3(t),,πg(t),ρ(t))[l2π1l2π3l2πgl2ρ]|(πi=πi(t),ρ=ρ(t))

where I(π1(t),π3(t),,πg(t),ρ(t)) is the g × g Fisher information matrix estimated under the condition of (π1, π3, …, πg, ρ) = (π1(t),π3(t),...,πg(t),ρ(t)). See S1 Appendix for detail. The test statistic follows a chi-square distribution with one degree of freedom, the 100(1 − α)% profile likelihood SCI for proportion ratio(δi) satisfies

2(l2(δ^2,π^1,π^3,,π^g,ρ^)-l2(δ0,π˜1,π˜3,,π˜g,ρ˜))χ1-α/2(g-1)2,

where χ1-α/(g-1)2 is the 1 − α/2(g − 1) quantile of the chi-square distribution with one degree of freedom with Bonferroni multiplicity adjustment, since χ1-α/(g-1)2=(z1-α/2(g-1))2. Similarly, Sidak method can be applied bu substituting the critical value χ1-α/(g-1)2 with χ1−α, where α′ = 1 − (1 − α)(1/g). Dunnet’s multiplicity adjustment method can be applied by substituting the critical value z1−α/2(g−1) with |z|g-1;Rα.

To compute the confidence interval and establish the lower and upper bounds, the following algorithm could be utilized to identify two roots:

To obtain the larger root, which is the upper bound of the CI.

  1. Initiate parameters: Calculate the unconstrained MLEs (δ^2,π^1,π^3,,π^g,ρ^) as the initial value (δ2(0),π1(0),π3(0),,πg(0),ρ(0))). Set initial value flag = 1, and stepsize = 0.1.

  2. Update estimates: Updating δ2^(1)=δ2(0)+flag×stepsize. And compute the constrained MLEs (π1˜(1),π3˜(1),,πg˜(1),ρ˜(1)) with δ2=δ2^(1).

  3. Evaluate the test statistics:

    If 2×flag×[l2(δ^2,π^1,π^3,,π^g,ρ^)-l2(δ^2(1),π^1(1),π^3(1),,π^g(1),ρ^(1))]<flag×χ1-α/(g-1)2, return to step 2. Updating δ2^(t+1)=δ2^(t)+flag×stepsize. Otherwise decrease the stepsize to 0.1 × stepsize and set flag = −flag.

  4. Check for Convergence: If the stepsize is small enough (eg. 10−5), indicating convergence, return δ^2(t+1) as the upper bound of CI and stop iterating.

To obtain the smaller root, which is the lower bound of the CI, repeat steps 1–4, with the initial value flag = -1.

Asymptotic score confidence interval

Under hypothesis H0 : δi = δ0 vs. Hα : δiδ0, i = 2, …, g, the asymptotic score test statistic can be derived as

TS2=UI-1UT|H0.

To simplify the explanation, we continue use the setting i = 2, where the score vector:

U=(l2δ2,l2π1,l2π3,,l2πg,l2ρ),

and I is the Fisher information matrix for (δ2, π1, π3, …, πg, ρ)T. Since δ2 is the parameter of interest, and πi, ρ are nuisance parameters, the score test statistic can be rewritten as:

TS2=(l2δ2)2I-1(1,1)|δi=δ0,

I(1, 1) denotes the (1, 1)th element of I. See S1 Appendix for detail. TS is asymptotically distributed as a chi-square distribution with one degree of freedom. The 100(1-α)% SCI for proportion ratio (δi) satisfies

TS2=(l2δ2)2I-1(1,1)|δi=δ0χ1-α/2(g-1)2.

In a similar manner, the CI of the ratio needs to be determined using the iterative method described in the previous section. For each iteration, score test statistic is updated with a new Fisher information matrix, I=I(δ2^(t),π1^(t),π3^(t),,πg^(t),ρ^(t)). To address multiplicity, we use Bonferroni and Dunnett methods, as outlined in the previous section.

Simulation studies

The performance of the proposed methods for constructing confidence intervals is evaluated through Monte Carlo simulation studies, utilizing empirical coverage probability (ECP) and mean interval width (MIW) as evaluation metrics. Both balanced (all mi’s are equal) and unbalanced (mi’s are different) designs are considered. The study’s parameter configurations are presented in Table 2.

Table 2. Parameter configuration setting for a simulation study.

group ρ π1, π2, ⋯, πg m1, m2, ⋯, mg
g = 3 0.3,0.5,0.7 a. (0.4,0.4,0.4) I. (50,50,50)
b. (0.25,0.375,0.5) II. (30,50,80)
III. (30,100,500)
g = 4 a. (0.4,0.4,0.4,0.4); I. (50,50,50,50)
b. (0.25,0.375,0.425,0.5) II. (30,50,80,100)
III. (30,50,100,500)
g = 5 a. (0.4,0.4,0.4,0.4,0.4) I. (50,50,50,50,50)
b. (0.25,0.375,0.425,0.5,0.54) II. (30,45,60,75,90)
III. (30,50,100,200,500)

The study generate 10,000 replications for each configuration setting and construct 95% confidence intervals. All tests are conducted at a 5% significance level. The ECP is the proportion of sample replicates generated under the null hypothesis (H0) where the true value of the ratio (δ) is contained within the constructed CI. The MIW is the average of all widths of the SCIs across all replicates. A CI method is considered conservative when the ECP is significantly greater than the pre-specified nominal level of 1-α, liberal when the ECP is significantly less than 1-α, and recommended when the ECP is approximately at 1-α.

Tables 35 provide the ECPs and MIWs for group g = 3, 4, and 5, respectively. The ECP of the Score-Dunnett method closely aligns with the pre-specified nominal level (0.95) across all configurations. Although the Wald-Sidak method demonstrates a competitive advantage across all configurations, the Sidak adjustment method does not perform well when combined with the profile likelihood and the score method. The Wald-Dunnett, Profile-Bonferroni, Profile-Dunnett, and Score-Bonferroni methods have ECPs approximately equal to 0.95 in most configurations. The Wald-Bonferroni method is conservative since its ECP is greater than 0.95 across most configurations. Similarly, the Profile-Sidak method and Score-Sidak method also show conservative behavior, with their ECPs consistently above 0.95 in various settings. The Profile-Dunnett method reveals a tendency toward liberal behavior in some configurations. Additionally, in balanced cases, ECP is typically closer to the nominal level, with a shorter MIW. In general, the Dunnet method achieves better performance than the Bonferroni method in multiplicity adjustment. Therefore, SCI produced from the Score method with Dunnet multiplicity adjustment is strongly recommended.

Table 3. The empirical coverage probability (ECP) and the mean interval width (MIW) of 95% CI for proportion ratio (g = 3).

ρ,π,m Wald-Bonferroni Wald-Sidak Wald-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9569 0.9418 0.9419 0.8882 0.9524 0.9281
II 0.9583 1.0908 0.9453 1.0276 0.9525 1.0592
b I 0.9628 2.0614 0.9477 1.9439 0.9576 2.0119
II 0.9618 2.4924 0.9513 2.3563 0.9548 2.3979
III 0.9667 2.3562 0.9538 2.2607 0.9538 2.2185
0.5 a I 0.9570 1.0242 0.9416 0.9653 0.9535 1.0090
II 0.9595 1.1948 0.9472 1.1256 0.9531 1.1603
b I 0.9599 2.2388 0.9451 2.1103 0.9533 2.1840
II 0.9632 2.6767 0.9520 2.5401 0.9552 2.5843
III 0.9651 2.5449 0.9549 2.4221 0.9520 2.4009
0.7 a I 0.9605 1.1034 0.9446 1.0394 0.9569 1.0867
II 0.9612 1.2941 0.9469 1.2182 0.9552 1.2558
b I 0.9617 2.4003 0.9468 2.2706 0.9549 2.3487
II 0.9654 2.8283 0.9541 2.6911 0.9581 2.7390
III 0.9664 2.6919 0.9549 2.5712 0.9533 2.5323
ρ,π,m Profile-Bonferroni Profile-Sidak Profile-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9499 0.9608 0.9641 1.0312 0.9465 0.9462
II 0.9516 1.1539 0.9681 1.2474 0.9447 1.1169
b I 0.9552 2.1503 0.9694 2.3126 0.9493 2.0944
II 0.9595 2.6749 0.9715 2.8638 0.9485 2.5696
III 0.9621 2.5320 0.9698 2.7370 0.9492 2.3723
0.5 a I 0.9488 1.0466 0.9630 1.1246 0.9445 1.0304
II 0.9497 1.2723 0.9656 1.3768 0.9423 1.2297
b I 0.9510 2.3327 0.9659 2.5038 0.9439 2.2735
II 0.9570 2.8481 0.9697 3.0402 0.9468 2.7478
III 0.9570 2.7368 0.9677 2.9060 0.9410 2.5651
0.7 a I 0.9500 1.1291 0.9650 1.2140 0.9460 1.1110
II 0.9518 1.3884 0.9652 1.5054 0.9419 1.3400
b I 0.9510 2.5059 0.9650 2.6879 0.9448 2.4445
II 0.9540 2.9896 0.9682 3.1835 0.9431 2.8884
III 0.9491 2.8411 0.9601 3.0821 0.9361 2.6989
ρ,π,m Score-Bonferroni Score-Sidak Score-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9517 0.9456 0.9660 1.0125 0.9486 0.9317
II 0.9560 1.1183 0.9711 1.2023 0.9503 1.0846
b I 0.9583 2.0711 0.9708 2.2193 0.9516 2.0217
II 0.9618 2.5332 0.9730 2.7063 0.9536 2.4393
III 0.9692 2.4075 0.9773 2.5826 0.9540 2.2554
0.5 a I 0.9504 1.0254 0.9642 1.0983 0.9470 1.0101
II 0.9549 1.2215 0.9694 1.3126 0.9486 1.1840
b I 0.9513 2.2410 0.9687 2.3938 0.9460 2.1861
II 0.9609 2.7047 0.9724 2.8749 0.9528 2.6115
III 0.9644 2.5568 0.9790 2.7477 0.9508 2.4270
0.7 a I 0.9518 1.0993 0.9656 1.1778 0.9486 1.0827
II 0.9569 1.3185 0.9697 1.4178 0.9491 1.2777
b I 0.9508 2.3924 0.9655 2.5539 0.9448 2.3407
II 0.9585 2.8388 0.9706 3.0014 0.9475 2.7512
III 0.9644 2.7373 0.9787 2.9108 0.9504 2.5607

a: π = (0.4,0.4,0.4); b: π = (0.25,0.375,0.5).

I: m = (50,50,50); II: m = (30,50,80); III: m = (30,100,500).

Table 5. The empirical coverage probability (ECP) and the mean interval width (MIW) of 95% CI for proportion ratio (g = 5).

ρ,π,m Wald-Bonferroni Wald-Sidak Wald-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9632 1.0600 0.9388 0.9777 0.9546 1.0331
II 0.9663 1.2296 0.9510 1.1370 0.9545 1.1639
b I 0.9699 2.4150 0.9532 2.2238 0.9597 2.3024
II 0.9724 2.9981 0.9546 2.7789 0.9558 2.7808
III 0.9743 2.9189 0.9628 2.6869 0.9558 2.6206
0.5 a I 0.9656 1.1536 0.9442 1.0589 0.9588 1.1238
II 0.9666 1.3493 0.9478 1.2449 0.9543 1.2758
b I 0.9727 2.6408 0.9549 2.4382 0.9624 2.5194
II 0.9738 3.2094 0.9561 2.9882 0.9576 2.9969
III 0.9723 3.1119 0.9592 2.8945 0.9535 2.8277
0.7 a I 0.9676 1.2367 0.9409 1.1516 0.9606 1.2043
II 0.9667 1.4642 0.9445 1.3480 0.9556 1.3821
b I 0.9714 2.8503 0.9460 2.6266 0.9603 2.7193
II 0.9686 3.4006 0.9556 3.1580 0.9553 3.1733
III 0.974 3.3101 0.9619 3.0778 0.9562 3.0001
ρ,π,m Profile-Bonferroni Profile-Sidak Profile-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9526 1.0882 0.9608 1.1225 0.9448 1.0591
II 0.9528 1.3236 0.9644 1.3774 0.9402 1.2426
b I 0.9618 2.5504 0.9698 2.6284 0.9501 2.4237
II 0.9634 3.2611 0.9698 3.3636 0.9470 3.0198
III 0.9768 3.1854 0.9771 3.2860 0.9506 2.8588
0.5 a I 0.9546 1.1875 0.9647 1.2207 0.9450 1.1550
II 0.9508 1.4679 0.9599 1.5284 0.9345 1.3743
b I 0.9635 2.7837 0.9712 2.8910 0.9517 2.6500
II 0.9664 3.4799 0.9711 3.5792 0.9484 3.2241
III 0.9711 3.4271 0.9762 3.5346 0.9467 3.0511
0.7 a I 0.9530 1.2770 0.9607 1.3352 0.9436 1.2413
II 0.9520 1.6120 0.9587 1.6767 0.9330 1.5042
b I 0.9586 3.0100 0.9620 3.0943 0.9467 2.8648
II 0.9581 3.6429 0.9645 3.7517 0.9396 3.4234
III 0.9637 3.5429 0.9712 3.6957 0.9432 3.2936
ρ,π,m Score-Bonferroni Score-Sidak Score-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9542 1.0689 0.9638 1.1010 0.9467 1.0415
II 0.9608 1.2731 0.9726 1.3199 0.9508 1.2020
b I 0.9634 2.4310 0.9707 2.5053 0.9523 2.3167
II 0.9677 3.0540 0.9721 3.1466 0.9516 2.8265
III 0.9773 2.9760 0.9807 3.0692 0.9539 2.6676
0.5 a I 0.9566 1.1612 0.9671 1.1918 0.9485 1.1310
II 0.9619 1.3988 0.9682 1.4478 0.9488 1.3193
b I 0.9643 2.6418 0.9708 2.7265 0.9538 2.5198
II 0.9694 3.2419 0.9741 3.3346 0.9528 3.0321
III 0.9746 3.1572 0.9794 3.2316 0.9498 2.8805
0.7 a I 0.9545 1.2413 0.9616 1.2944 0.9462 1.2087
II 0.9588 1.5114 0.9652 1.5663 0.9464 1.4246
b I 0.9559 2.8322 0.9597 2.9083 0.9455 2.7023
II 0.9570 3.4054 0.9617 3.5069 0.9413 3.1633
III 0.9684 3.3396 0.9754 3.4821 0.9467 2.9993

a: π = (0.4,0.4,0.4,0.4,0.4); b: π = (0.25,0.375,0.425,0.5,0.54).

I: m = (50,50,50,50,50); II: m = (30,45,60,75,90); III: m = (30,50,100,200,500).

Table 4. The empirical coverage probability (ECP) and the mean interval width (MIW) of 95% CI for proportion ratio (g = 4).

ρ,π,m Wald-Bonferroni Wald-Sidak Wald-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9632 1.0088 0.9417 0.9368 0.9571 0.9875
II 0.9659 1.1546 0.9476 1.0707 0.9555 1.0992
b I 0.9654 2.2043 0.9484 2.0467 0.9571 2.1262
II 0.9724 2.6947 0.9594 2.5069 0.9623 2.5233
III 0.9712 2.6441 0.9535 2.4566 0.9576 2.4446
0.5 a I 0.9639 1.0981 0.9441 1.0189 0.9589 1.0745
II 0.9648 1.2620 0.9503 1.1691 0.9556 1.2001
b I 0.9679 2.4001 0.9486 2.2298 0.9598 2.3169
II 0.9730 2.8915 0.9583 2.7005 0.9607 2.7212
III 0.9683 2.8376 0.9557 2.6435 0.9524 2.6327
0.7 a I 0.9689 1.1838 0.9477 1.0975 0.9623 1.1578
II 0.9673 1.3687 0.9526 1.2670 0.9588 1.3004
b I 0.9702 2.5835 0.9506 2.4002 0.9617 2.4916
II 0.9713 3.0647 0.9591 2.8643 0.9606 2.8860
III 0.9693 3.0029 0.9575 2.7802 0.9540 2.7847
ρ,π,m Profile-Bonferroni Profile-Sidak Profile-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9551 1.0328 0.9646 1.0803 0.9496 1.0100
II 0.9539 1.2366 0.9606 1.2997 0.9387 1.1600
b I 0.9575 2.3173 0.9674 2.4291 0.9487 2.2249
II 0.9657 2.9145 0.9729 3.0552 0.9508 2.7300
III 0.9631 2.8617 0.9671 2.9897 0.9468 2.6504
0.5 a I 0.9543 1.1270 0.9641 1.1798 0.9491 1.1015
II 0.9482 1.3615 0.9600 1.4340 0.9336 1.2834
b I 0.9595 2.5194 0.9688 2.6395 0.9498 2.4234
II 0.9639 3.1285 0.9720 3.2662 0.9514 2.9301
III 0.9547 3.0884 0.9625 3.2011 0.9383 2.8308
0.7 a I 0.9569 1.2189 0.9669 1.2769 0.9506 1.1905
II 0.9488 1.4926 0.9628 1.5742 0.9369 1.4047
b I 0.9587 2.7223 0.9682 2.8468 0.9478 2.6213
II 0.9609 3.2946 0.9671 3.4351 0.9422 3.1173
III 0.9467 3.2144 0.9538 3.3409 0.9310 3.0552
ρ,π,m Score-Bonferroni Score-Sidak Score-Dunnet
ECP MIW ECP MIW ECP MIW
0.3 a I 0.9568 1.0155 0.9667 1.0604 0.9514 0.9938
II 0.9610 1.1928 0.9698 1.2490 0.9497 1.1329
b I 0.9588 2.2174 0.9692 2.3175 0.9500 2.1387
II 0.9714 2.7438 0.9788 2.8645 0.9572 2.5690
III 0.9705 2.6988 0.9758 2.8097 0.9541 2.4823
0.5 a I 0.9564 1.1033 0.9661 1.1525 0.9506 1.0795
II 0.9634 1.3038 0.9711 1.3658 0.9515 1.2371
b I 0.9608 2.4036 0.9702 2.5079 0.9514 2.3198
II 0.9712 2.9329 0.9771 3.0464 0.9586 2.7590
III 0.9686 2.8768 0.9764 2.9656 0.9503 2.6843
0.7 a I 0.9578 1.1863 0.9687 1.2394 0.9533 1.1603
II 0.9633 1.4088 0.9715 1.4763 0.9530 1.3373
b I 0.9594 2.5762 0.9694 2.6903 0.9498 2.4867
II 0.9676 3.0907 0.9743 3.2139 0.9518 2.9080
III 0.9660 3.0436 0.9756 3.1695 0.9492 2.8051

a: π = (0.4,0.4,0.4,0.4); b: π = (0.25,0.375,0.425,0.5).

I: m = (50,50,50,50); II: m = (30,50,80,100); III: m = (30,50,100,500).

An extensive simulation study is conducted for group g = 3, 4, 5 with balanced designs having sample size mi = 20, 40, 80, 500. In this part of the study, 1000 sets of π=(π1,π2,,πg) and ρ are randomly generated from the uniform distribution U (0,1) subject to the condition that each corresponding response probability adhered to the formula (1). The values of πi are sorted in increasing order, ensuring that the ratios between each ith group (i = 2, ⋯, g) and the 1st group are consistently greater than 1. This arrangement facilitates a straightforward comparison of the MIW.

For each configuration setting, 10,000 replications are generated, and 95% confidence intervals are constructed. The ECP and MIW are calculated for each method to assess their performance. Additionally, boxplots are created to enable visual comparisons among different methods.

Figs 1 and 2 illustrate the overall distribution of ECP and MIW for all SCIs methods. Among all the proposed methods, the Score method with Dunnett multiplicity adjustment performed best, achieving an average ECP closest to the pre-specified nominal level and the shortest MIW. As shown in Fig 1, as the sample size increases, the ECP of the Profile and Score methods does not show any obvious pattern, while the Wald method approaches the nominal level. The Wald method, when paired with Sidak adjustment method, yields results closer to the pre-specified nominal level (0.95), indicating greater reliability compared to the other two adjustment methods. However, the Sidak correction does not yield an improvement when combined with the profile likelihood and score methods. Meanwhile, Fig 2 demonstrates that the MIW becomes shorter as the sample size increases since asymptotic methods tend to perform better with larger sample sizes. The MIW exhibits a slight increase as the number of group increases, which is an expected trend. This happens because the MIW is calculated as the average of all SCIs. Given that the πi are sorted in ascending order and the ratios between group (i = 2, ⋯, g) and the 1st group are consistently greater than one and increasing, the addition of more intervals as the number of groups grows leads to a slightly larger MIW.

Fig 1. Boxplots of Empirical Coverage Probabilities (ECP).

Fig 1

Fig 2. Boxplots of Mean Intercal Width (MIW).

Fig 2

Additionally, the MIW does not exhibit an obvious difference when comparing the Bonferroni and Dunnett multiplicity adjustments. However, the ECP showes a distinct difference. As expected, the Bonferroni method is more conservative, while the Dunnett method, although computationally more demanding, is justified for its benefits. Overall, score SCI with Dunnett multiplicity adjustment is highly recommeded.

Real case example

The dataset used for this analysis is sourced from Rosner [2]. It includes information on 218 patients aged 20–39, who were diagnosed with retinitis pigmentosa (RP) and were seen at the Massachusetts Eye and Ear Infirmary from 1970 to 1979. The patients were divided into four groups based on their genetic type: autosomal dominant RP (DOM), autosomal recessive RP (AR), sex-linked RP (SL), and isolated RP (ISO). To simplify the analysis, each patient was associated with a unique family and then randomly selected for the study. The Snellen visual acuity (VA) of an eye was considered affected if it was 20/50 or worse, and normal if it was 20/40 or better. For this analysis, a subgroup of 216 individuals was selected from a total of 218, all of whom had complete VA information for both eyes. Detail information is presented in Table 6.

Table 6. Number of affected eyes per person in each group.

Number of affected eyes Genetic type
ISO DOM AR SL
0 67 15 7 3
1 24 6 5 2
2 57 7 9 14

According to Liu and Ma [20] and Tang et al. [21], following a goodness-of-fit test, the equal correlation coefficient model (ρ model) is found appropriate for analyzing this dataset. In the ρ model, the MLE πi^ values are similar to the sample proportion, indicating a strong fit to the model. The estimated values for the parameters are: ρ^=0.6416, π^ISO=0.4658, π^DOM=0.3625, π^AR=0.5455, and π^SL=0.7926. When calculating the SCI, ISO is considered the control group based on evidence suggesting that RP may occur as an isolated sporadic disorder, without genetic links [22]. The 95% SCI between DOM, AR, SL, and ISO are presented in Table 7.

Table 7. 95% SCI of the proportion ratio of affected rates among these groups.

Methods CIs
DOM/ISO AR/ISO SL/ISO
WaldBonf (0.4380, 1.3825) (0.7302, 1.8782) (1.2406, 2.3339)
WaldSidak (0.4552, 1.3304) (0.7536, 1.8199) (1.2671, 2.2851)
WaldDun (0.4389, 1.3798) (0.7314, 1.8752) (1.2420, 2.3313)
ProfileBonf (0.4016, 1.2719) (0.6696, 1.7442) (1.1507, 2.2621)
ProfileSidak (0.3892, 1.2941) (0.6516, 1.7690) (1.1281, 2.2870)
ProfileDun (0.4027, 1.2700) (0.6711, 1.7421) (1.1527, 2.2600)
ScoreBonf (0.4242, 1.2665) (0.6880, 1.7218) (1.1257, 2.2434)
ScoreSidak (0.4134, 1.2882) (0.6715, 1.7450) (1.1021, 2.2667)
ScoreDun (0.4252, 1.2647) (0.6894, 1.7199) (1.1277, 2.2414)

Confidence intervals are relatively straightforward to interpret and apply in statistical analysis. According to prior simulation results, the Score method with Dunnett adjustment has been identified as the most effective approach. Additionally, the Wald method with Sidak correction also demonstrates a competitive advantage, particularly in terms of computational efficiency. In simple terms, we examine whether the confidence interval includes the value 1, given that the statistical measure of interest in this context is a ratio. If the confidence interval does not contain 1, it indicates a statistically significant difference. On the other hand, if 1 is included within the confidence interval, it suggests that there is no statistically significant difference. The results show that the CI between DOM and ISO, AR, and ISO contained 1, indicating no significant difference. However, the affected rate in the SL group is significantly greater than in ISO, as the lower bounds of the CIs are greater than 1.

Discussions

In this study, nine asymptotic SCIs are derived for the ratio of proportions, with these methods being better suited for large sample sizes. To ensure robustness for smaller sample sizes, exact methods are planned for future research. To control the Type I error rate, this study also compared different multiplicity adjustment methods, specifically focusing on the Bonferroni, Sidak and Dunnett methods, with the latter demonstrating better performance. The Bonferroni method has been criticized for its conservative nature, leading to consideration of alternative approaches, such as the Holm method and modified Bonferroni method [23]. However, since these multiplicity adjustment methods are not well-suited to our current computational algorithm, the development of new algorithms may be necessary for future research.

As the group size increases, combining the Dunnett method with an iteration method can become quite time-consuming. However, with advancements in computing technology, a broader range of methods can be explored. Future research could consider a wider range of multiplicity adjustment methods to find the most suitable SCIs.

Furthermore, the asymptotic SCIs proposed in this article are suitable for bilateral data. A potential further research could involve combining unilateral and bilateral data to develop more appropriate SCIs.

Supporting information

S1 Appendix

(PDF)

pone.0311850.s001.pdf (92.2KB, pdf)

Data Availability

All relevant data are within the manuscript.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1. A AR. Statistical guidelines for the analysis of data obtained from one or both eyes. Ophthalmic and physiological optics: the journal of the British College of Ophthalmic Opticians (Optometrists). 2013;33(1). [DOI] [PubMed] [Google Scholar]
  • 2. Rosner B. Statistical methods in ophthalmology: an adjustment for the intraclass correlation between eyes. Biometrics. 1982;38. doi: 10.2307/2530293 [DOI] [PubMed] [Google Scholar]
  • 3. Tang NS, Tang ML, Qiu SF. Testing the equality of proportions for correlated otolaryngologic data. Computational Statistics and Data Analysis. 2008;52. [Google Scholar]
  • 4. Tang NS, Qiu SF. Homogeneity test, sample size determination and interval construction of difference of two proportions in stratified bilateral-sample designs. Journal of Statistical Planning and Inference. 2012;142(5). [Google Scholar]
  • 5.Xue Y. Likelihood-based inferential methods of effect measure parameters for correlated binary data in stratified bilateral-sample design [Doctoral Dissertation]. University at Buffalo; 2019.
  • 6. Xue Y, Ma CX. Interval estimation of proportion ratios for stratified bilateral correlated binary data. Statistical Methods in Medical Research. 2020;29(7). doi: 10.1177/0962280219882043 [DOI] [PubMed] [Google Scholar]
  • 7. Weizhen Wang GS. Exact Confidence Intervals for the Relative Risk and the Odds Ratio. Biometrics. 2015;71(4):985–995. doi: 10.1111/biom.12360 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Dallal GE. Paired Bernoulli Trials. Biometrics. 1988;44. doi: 10.2307/2531913 [DOI] [PubMed] [Google Scholar]
  • 9. Donner A. Statistical methods in ophthalmology: an adjusted chi-square approach. Biometrics. 1989;45. doi: 10.2307/2531501 [DOI] [PubMed] [Google Scholar]
  • 10. Ma CX, Liu S. Testing equality of proportions for correlated binary data in ophthalmologic studies. Journal of Biopharmaceutical Statistics. 2017;27(4). doi: 10.1080/10543406.2016.1167072 [DOI] [PubMed] [Google Scholar]
  • 11. Pei Yb, Tang ML, Wong WK, Guo JH. Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial. Statistical Methods in Medical Research. 2012;21(2). [DOI] [PubMed] [Google Scholar]
  • 12. Li Z, Ma C. Statistical Inference for Odds Ratio of Two Proportions in Bilateral Correlated Data. Axioms. 2022;11(10). doi: 10.3390/axioms11100502 [DOI] [Google Scholar]
  • 13. Shen X, Ma CX, Yuen KC, Tian GL. Common risk difference test and interval estimation of risk difference for stratified bilateral correlated data. Statistical Methods in Medical Research. 2019;28(8). doi: 10.1177/0962280218781988 [DOI] [PubMed] [Google Scholar]
  • 14. Zhuang T, Tian GL, Ma CX. Confidence intervals for proportion ratios of stratified correlated bilateral data. Journal of Biopharmaceutical Statistics. 2019;29(1). doi: 10.1080/10543406.2018.1489405 [DOI] [PubMed] [Google Scholar]
  • 15. Yang ZY, Tian GL, Liu X, Ma CX. Simultaneous Confidence Interval Construction for Many-to-One Comparisons of Proportion Differences Based on Correlated Paired Data. Journal of Applied Statistics. 2020. doi: 10.1080/02664763.2020.1795815 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Peng X. Confidence Interval Methods of Association Parameters for Correlated Bilateral Data [Doctoral Dissertation]. University at Buffalo; 2019.
  • 17.Yang Z. Confidence Interval Construction for the Difference of Proportions Based on Correlated Bilateral Data [Doctoral Dissertation]. University at Buffalo; 2017.
  • 18. Šidák Z. Rectangular Confidence Regions for the Means of Multivariate Normal Distributions. Journal of the American Statistical Association. 1967;62(318):626–633. doi: 10.2307/2283989 [DOI] [Google Scholar]
  • 19. Piegorsch WW. Multiple Comparisons for Analyzing Dichotomous Response. International Biometric Society. 1991. doi: 10.2307/2532494 [DOI] [PubMed] [Google Scholar]
  • 20. Liu X, Ma CX. Goodness-of-fit tests for correlated data from multiple groups. Springer; 2020. [Google Scholar]
  • 21. Tang ML, Pei YB, Wong WK, Li JL. Goodness-of-fit tests for correlated paired binary data. Statistical Methods in Medical Research. 2012;21(4). doi: 10.1177/0962280210381176 [DOI] [PubMed] [Google Scholar]
  • 22. Hirji SH. In: Clinical Evaluation of Patients with Retinitis Pigmentosa. New York, NY: Springer US; 2023. p. 31–39. [DOI] [PubMed] [Google Scholar]
  • 23. Simes RJ. An Improved Bonferroni Procedure for Multiple Tests of Significance. Biometrika. 1986;73(3):751–754. doi: 10.1093/biomet/73.3.751 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix

(PDF)

pone.0311850.s001.pdf (92.2KB, pdf)

Data Availability Statement

All relevant data are within the manuscript.


Articles from PLOS ONE are provided here courtesy of PLOS

RESOURCES