Confidence intervals for assessing equivalence of two treatments with combined unilateral and bilateral data

Shi-Fang Qiu; Ji-Ran Tao

doi:10.1080/02664763.2021.1949440

. 2021 Jul 7;49(13):3414–3435. doi: 10.1080/02664763.2021.1949440

Confidence intervals for assessing equivalence of two treatments with combined unilateral and bilateral data

Shi-Fang Qiu ^a,^CONTACT, Ji-Ran Tao ^b

PMCID: PMC9543133 PMID: 36213773

Abstract

Responses from the paired organs are generally highly correlated in bilateral studies, statistical procedures ignoring the correlation could lead to incorrect results. Note the intraclass correlation in the study of combined unilateral and bilateral outcomes; 11 confidence intervals (CIs) including 7 asymptotic CIs and 4 Bootstrap-resampling CIs for assessing the equivalence of 2 treatments are derived under Rosner's correlated binary data model. Performance is evaluated with respect to the empirical coverage probability (ECP), the empirical coverage width (ECW) and the ratio of the mesial non-coverage probability to the non-coverage probability (RMNCP) via simulation studies. Simulation results show that (i) all CIs except for the Wald CI and the bias-corrected Bootstrap percentile CI generally produce satisfactory ECPs and hence are recommended; (ii) all CIs except for the bias-corrected Bootstrap percentile CI provide preferred RMNCPs and are more symmetrical; (iii) as the measurement of the dependence increases, the ECWs of all CIs except for the score CI and the profile likelihood CI show increasing patterns that look like linear, while there is no obvious pattern on the ECPs of all CIs except for the profile likelihood CI. A data set from an otolaryngologic study is used to illustrate the proposed methods.

Keywords: Bootstrap-resampling method, combined unilateral and bilateral data, confidence interval, intra-class correlation, proportion difference

1. Introduction

In randomized clinical trials, it is frequently that we may collect data from the paired organs or body parts, and the outcomes may be either bilateral (e.g. two organs are sick) or unilateral response (only one organ or a body part is sick). Consequently, it causes the data generally highly correlated in bilateral studies, and statistical procedures ignoring the intraclass correlation of the bilateral observations may lead to incorrect inference [4,7,9,20,21]. Taking the intraclass correlation into consideration, Rosner [20] presented an intraclass correlation model for analyzing ophthalmologic data to which a person may have contributed two eyes worth of information. Le [12] considered the testing for linear trends in proportions using correlated otolaryngology or ophthalmology data. Note that asymptotic test procedures could also yield unacceptably high type I error rates for small sample studies and sparse data structures even if the intraclass correlation is taken into consideration, Tang et al. [26] proposed the exact unconditional and approximate unconditional procedures based on three test statistics (i.e. the Wald statistic, the statistics based on a dependence model and an independence model proposed by Rosner and Milton [21]). Followed by Tang et al. [26], eight test statistics for testing the equality of two treatments and the corresponding asymptotical and approximate unconditional test procedures in the bilateral study are developed by Tang et al. [25]. Tang et al. [23] investigated the goodness-of-fit and the model selection for a few popular statistical models for correlated paired binary outcomes. A variety of confidence intervals for estimating the difference between the proportions of responders in a randomized two-armed clinical trial are proposed in Pei et al. [18].

However, individuals may produce either unilateral data (e.g. data from only one organ) or bilateral data (e.g. data from two organs) in many medical comparative studies (e.g. otolaryngologic or ophthalmologic studies). For example, to evaluate the efficacy of two antibiotics (i.e. Cefaclor and Amoxicillin) for the treatment of otitis media with effusion (OME) in an otolaryngologic study, Mandel et al. [15] considered a randomized double-blinded clinical trial. In this trial, a total of 214 children (293 ears) underwent unilateral or bilateral tympanocentesis before they were randomly assigned to one of the two treatments. After a 14-day course of treatment with one of the antibiotics, the outcome of each child was recorded at the end of the treatment. In this study, only 203 evaluable children without repeat tympanocentesis, treatment change or tympanic membrane perforations have received one of the treatments. For the group with unilateral disease, two results were determined: cured and not-cured; and for the group with bilateral disease, three results, i.e. cured (both ears become OME-cured), partially cured (only one ear becomes OME-cured) and both ears are not cured were recorded. The data are reported in Table 1.

Table 1.

OME status after 14 days and 42 days of treatment (in terms of No. of children).

	Amoxicillin			Cefaclor
No. of OME-free ears	0	1	2	0	1	2
Unilateral	39	27	–	24	38	–
Bilateral	15	3	13	14	9	21

Open in a new tab

For this study, it is important to test whether the cure rates are identical between the Cefaclor and Amoxicillin groups. Under the equal correlation coefficient model, Pei et al.[17] considered the equivalence testing of two successful cure rates and developed several asymptotic test procedures under the independent and the dependent models, respectively. However, CI estimators for comparative studies with combined unilateral and bilateral binary data have not been developed. In this article, we consider the CI construction for proportion difference in comparative medical studies with combined unilateral and bilateral binary outcomes under Rosner's correlated binary data model.

This article is organized as follows. The data structure and the probability model are described in Section 2. Seven asymptotic confidence intervals and four Bootstrap-resampling CIs are proposed in Section 3. In Section 4, the performance of all CIs is evaluated via simulation studies in terms of the ECP, the ECW and the RMNCP. An illustration of our methodologies with otolaryngological data is presented in Section 5. The paper closes with a brief conclusion and discussion in Section 6.

2. Data structure

Let $m_{h^{'} i}^{(1)}$ represent the number of individuals with $h^{'}$ ear being cured in the ith treatment for the unilateral group, and $m_{h i}^{(2)}$ represent the number of individuals with h ear/ears being cured in the ith treatment for the bilateral group, $p_{h^{'} i}^{(1)}$ and $p_{h i}^{(2)}$ be the corresponding probabilities ( $h^{'} = 0, 1$ , $h = 0, 1, 2, i = 0, 1$ ). Let $m_{+ i}^{(s)} = \sum_{h = 0}^{s} m_{h i}^{(s)}$ (s = 1, 2) and $m_{+ +} = \sum_{s = 1}^{2} \sum_{i = 0}^{1} m_{+ i}^{(s)}$ .

Let $Z_{i j}^{(1)} = 1$ if the jth individual received the ith treatment in the unilateral group is OME-cured and $Z_{i j}^{(1)} = 0$ otherwise. In the bilateral group, $Z_{i j k}^{(2)} = 1$ if the kth ear of the jth individual received the ith treatment is OME-cured and $Z_{i j k}^{(2)} = 0$ otherwise ( $i = 0, 1, j = 1, 2, \dots, m_{+ i}^{(s)}, s = 1, 2, k = 1, 2$ ). Following Rosner [20], it is eligible to assume that the correlation of the responses from the two ears should be the same for the two treatments, i.e.

\begin{aligned} P r (Z_{i j}^{(1)} & = 1) = P r (Z_{i j k}^{(2)} = 1) = λ_{i}, \\ and P r (Z_{i j k}^{(2)} & = 1 ∣ Z_{i j, 3 - k}^{(2)} = 1) = R λ_{i} \end{aligned}

for $i = 0, 1, j = 1, 2, \dots, m_{+ i}^{(s)}, s = 1, 2, k = 1, 2$ . As shown in [20], the correlation coefficient between $Z_{i j 1}^{(2)}$ and $Z_{i j 2}^{(2)}$ is $ρ = λ_{i} (R - 1) / (1 - λ_{i})$ for i = 0, 1, then R represents a measurement of the dependence between two ears of the same individual in two treatments. Specially, R = 1 if two ears are completely independent, while $R λ_{i} = 1 (i = 0, 1)$ if two ears are completely dependent. By simple calculation, we can have the following probability model under Rosner's assumption:

\begin{aligned} \begin{aligned} p_{0 i}^{(1)} & = 1 - λ_{i}, p_{1 i}^{(1)} = λ_{i}, \\ p_{0 i}^{(2)} & = 1 + R λ_{i}^{2} - 2 λ_{i}, p_{1 i}^{(2)} = 2 λ_{i} (1 - R λ_{i}), p_{2 i}^{(2)} = R λ_{i}^{2} \end{aligned} \end{aligned}

(1)

for i = 0, 1. The data structure and the probability model for the combined unilateral and bilateral outcomes are given in Table 2.

Table 2.

The data structure and probability model for combined unilateral and bilateral data.

	Unilatral		Bilateral
No. of ears with OME-free	i = 0	i = 1	i = 0	i = 1	Total
0	$m_{00}^{(1)} (p_{00}^{(1)})$	$m_{01}^{(1)} (p_{01}^{(1)})$	$m_{00}^{(2)} (p_{00}^{(2)})$	$m_{01}^{(2)} (p_{01}^{(2)})$
1	$m_{10}^{(1)} (p_{10}^{(1)})$	$m_{11}^{(1)} (p_{11}^{(1)})$	$m_{10}^{(2)} (p_{10}^{(2)})$	$m_{11}^{(2)} (p_{11}^{(2)})$
2	–	–	$m_{20}^{(2)} (p_{20}^{(2)})$	$m_{21}^{(2)} (p_{21}^{(2)})$
Total	$m_{+ 0}^{(1)} (1.0)$	$m_{+ 1}^{(1)} (1.0)$	$m_{+ 0}^{(2)} (1.0)$	$m_{+ 1}^{(2)} (1.0)$	$m_{+ +}$

Open in a new tab

According to Rosner's assumption, the log-likelihood function of the parameters $λ_{0}$ , $λ_{1}$ and R for the observation data $m = {(m_{0 i}^{(1)}$ , $m_{1 i}^{(1)}$ , $m_{0 i}^{(2)}$ , $m_{1 i}^{(2)}$ , $m_{2 i}^{(2)})$ : $i = 0, 1}$ is given by

\begin{aligned} l (m; λ_{0}, λ_{1}, R) & = C + m_{00}^{(1)} \log (1 - λ_{0}) + m_{10}^{(1)} \log λ_{0} + m_{01}^{(1)} \log (1 - λ_{1}) + m_{11}^{(1)} \log λ_{1} \\ + m_{00}^{(2)} \log (1 + R λ_{0}^{2} - 2 λ_{0}) + m_{10}^{(2)} \log [2 λ_{0} (1 - R λ_{0})] + m_{20}^{(2)} \log (R λ_{0}^{2}) \\ + m_{01}^{(2)} \log (1 + R λ_{1}^{2} - 2 λ_{1}) + m_{11}^{(2)} \log [2 λ_{1} (1 - R λ_{1})] + m_{21}^{(2)} \log (R λ_{1}^{2}), \end{aligned}

(2)

where C is a constant that doesn't involve parameters.

Let $δ = λ_{1} - λ_{0}$ be the difference between two proportions for two treatments, then $λ_{1} = λ_{0} + δ$ . Therefore, the log-likelihood function given in Equation (2) becomes

\begin{aligned} ℓ (m; δ, λ_{0}, R) & = C + m_{00}^{(1)} \log (1 - λ_{0}) + (m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}) \log λ_{0} \\ + m_{01}^{(1)} \log (1 - λ_{0} - δ) \\ + (m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}) \log (λ_{0} + δ) + m_{00}^{(2)} \log (1 + R λ_{0}^{2} - 2 λ_{0}) \\ + (m_{20}^{(2)} + m_{21}^{(2)}) \log R + m_{10}^{(2)} \log (1 - R λ_{0}) + m_{11}^{(2)} \log [1 - R (λ_{0} + δ)] \\ + m_{01}^{(2)} \log [1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)] . \end{aligned}

(3)

It is easily shown that the parameter vector $(δ, λ_{0}, R)$ satisfies the following conditions:

\begin{array}{lll} when δ \geq 0, 0 \leq λ_{0} \leq 1 - δ and max {\frac{2 (λ_{0} + δ) - 1}{(λ_{0} + δ)^{2}}, 0.0} \leq R \leq \frac{1}{λ_{0} + δ}, \\ when δ < 0, - δ \leq λ_{0} \leq 1 and max {\frac{2 λ_{0} - 1}{λ_{0}^{2}}, 0.0} \leq R \leq \frac{1}{λ_{0}} . \end{array}

In order to investigate whether there is a significant difference between the two treatments, we are interested in the confidence interval construction for the proportion difference (i.e. δ) in this article. Seven asymptotic confidence interval estimators and four Bootstrap-resampling confidence intervals are developed and evaluated as follows.

3. Confidence interval estimators

3.1. CIs based on Wald-type statistics

It is easily shown that the sample estimates of $λ_{0}$ and $λ_{1}$ are, respectively, given by

{\hat{λ}}_{0} = \frac{m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}}{m_{+ 0}^{(1)} + 2 m_{+ 0}^{(2)}}, {\hat{λ}}_{1} = \frac{m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}}{m_{+ 1}^{(1)} + 2 m_{+ 1}^{(2)}} .

(4)

Therefore, the sample estimate of δ can be given by

\hat{δ} = \frac{m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}}{m_{+ 1}^{(1)} + 2 m_{+ 1}^{(2)}} - \frac{m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}}{m_{+ 0}^{(1)} + 2 m_{+ 0}^{(2)}},

(5)

and the maximum likelihood estimation $\hat{R}$ of the parameter R can be obtained by solving the following equation:

\frac{m_{00}^{(2)} {\hat{λ}}_{0}^{2}}{1 + R {\hat{λ}}_{0}^{2} - 2 {\hat{λ}}_{0}} - \frac{m_{10}^{(2)} {\hat{λ}}_{0}}{1 - R {\hat{λ}}_{0}} + \frac{m_{20}^{(2)} + m_{21}^{(2)}}{R} \frac{m_{01}^{(2)} {\hat{λ}}_{1}^{2}}{1 + R {\hat{λ}}_{1}^{2} - 2 {\hat{λ}}_{1}} - \frac{m_{11}^{(2)} {\hat{λ}}_{1}}{1 - R {\hat{λ}}_{1}} = 0.

(6)

It is easily shown that the variance of ${\hat{λ}}_{i}$ (i = 0, 1) can be given by

Var ({\hat{λ}}_{i}) = \frac{m_{+ i}^{(1)} λ_{i} (1 - λ_{i}) + 2 m_{+ i}^{(2)} λ_{i} [1 + (R - 2) λ_{i}]}{(m_{+ i}^{(1)} + 2 m_{+ i}^{(2)})^{2}} .

(7)

Therefore, the variance of $\hat{δ}$ is given by $σ^{2} = σ^{2} (δ, λ_{0}, R) = Var ({\hat{λ}}_{0}) + Var ({\hat{λ}}_{1}),$ and it can be estimated by

\begin{aligned} {\hat{σ}}^{2} & = σ^{2} (\hat{δ}, {\hat{λ}}_{0}, \hat{R}) = \frac{m_{+ 0}^{(1)} {\hat{λ}}_{0} (1 - {\hat{λ}}_{0}) + 2 m_{+ 0}^{(2)} {\hat{λ}}_{0} [1 + (\hat{R} - 2) {\hat{λ}}_{0}]}{(m_{+ 0}^{(1)} + 2 m_{+ 0}^{(2)})^{2}} \\ + \frac{m_{+ 1}^{(1)} ({\hat{λ}}_{0} + \hat{δ}) (1 - {\hat{λ}}_{0} - \hat{δ}) + 2 m_{+ 1}^{(2)} ({\hat{λ}}_{0} + \hat{δ}) [1 + (\hat{R} - 2) ({\hat{λ}}_{0} + \hat{δ})]}{(m_{+ 1}^{(1)} + 2 m_{+ 1}^{(2)})^{2}} \end{aligned}

(8)

and

\begin{aligned} {\tilde{σ}}^{2} (δ) & = σ^{2} (δ, {\tilde{λ}}_{0}, \tilde{R}) \\ = \frac{m_{+ 0}^{(1)} {\tilde{λ}}_{0} (1 - {\tilde{λ}}_{0}) + 2 m_{+ 0}^{(2)} {\tilde{λ}}_{0} [1 + (\tilde{R} - 2) {\tilde{λ}}_{0}]}{(m_{+ 0}^{(1)} + 2 m_{+ 0}^{(2)})^{2}} \\ + \frac{m_{+ 1}^{(1)} ({\tilde{λ}}_{0} + δ) (1 - {\tilde{λ}}_{0} - δ) + 2 m_{+ 1}^{(2)} ({\tilde{λ}}_{0} + δ) [1 + (\tilde{R} - 2) ({\tilde{λ}}_{0} + δ)]}{(m_{+ 1}^{(1)} + 2 m_{+ 1}^{(2)})^{2}}, \end{aligned}

(9)

respectively, where ${\tilde{λ}}_{0} = {\tilde{λ}}_{0} (δ)$ , $\tilde{R} = \tilde{R} (δ)$ are the constrained maximum-likelihood estimates (CMLEs) of $λ_{0}$ and R given the value of δ, and they can be obtained by solving the following equations:

\frac{\partial ℓ (m; δ, λ_{0}, R)}{\partial λ_{0}} = 0, \frac{\partial ℓ (m; δ, λ_{0}, R)}{\partial R} = 0.

(10)

No closed-form exists, an iterative algorithm (e.g. Fisher-score iterative algorithm, refer to Appendix 1 for details) can be used to find the solutions to the above equations. When the solutions are out of the parameter space, we can use the search algorithm to find the CMLEs. According to the central limit theorem, it is easily shown that $T = (\hat{δ} - δ) / \sqrt{Var (\hat{δ})}$ is asymptotically distributed as standard normal distribution as all $m_{+ i}^{(1)}$ and $m_{+ i}^{(2)}$ ( $i = 0, 1$ ) are large. Therefore, the $100 (1 - α) %$ confidence interval for δ based on the Wald method is given by

{CI}_{w_{1}} = [max {- 1, \hat{δ} - z_{α / 2} \hat{σ}}, min {1, \hat{δ} + z_{α / 2} \hat{σ}}],

(11)

where $z_{α / 2}$ is the upper $α / 2$ percentile of the standard normal distribution.

Similar to Wilson [27], the $100 (1 - α) %$ confidence lower and upper limits for δ can be obtained by solving the equations with respect to $δ : (\hat{δ} - δ) / \tilde{σ} (δ) = z_{α / 2}$ and $(\hat{δ} - δ) / \tilde{σ} (δ) = - z_{α / 2}$ , respectively. Let $f (δ) = (\hat{δ} - δ) / \tilde{σ} (δ) - z_{α / 2}$ and $g (δ) = (\hat{δ} - δ) / \tilde{σ} (δ) + z_{α / 2}$ , the following bisection method can be used to find the solutions of these equations:

Step 1: Select a suitable constant h , for example, 0.1, 0.05, etc., start from $- 1$ and search for the minimum positive integer k such that $f (- 1 + k h) \cdot f (- 1 + (k + 1) h) \leq 0$ , where $[- 1 + k h, - 1 + (k + 1) h] \subseteq [- 1, 1]$ .

Step 2: Use the bisection method to find the root of $f (δ) = 0$ on the interval $[- 1 + k h, - 1 + (k + 1) h]$ , then we obtain the lower limit of the interval, denoted as $δ_{w l}$ .

Step 3: Similarly, start from 1 and search for the minimum positive integer k such that $g (1 - k h) \cdot g (1 - (k + 1) h) \leq 0$ , where $[1 - k h, 1 - (k + 1) h] \subseteq [- 1, 1]$ . Use the bisection method to find the root of $g (δ) = 0$ on the interval $[1 - k h, 1 - (k + 1) h]$ , then we obtain the upper limit of the interval, denoted as $δ_{w u}$ .

Therefore, the $100 (1 - α) %$ confidence interval for δ based on the Wilson method is given by ${CI}_{w_{2}} = [δ_{w l}, δ_{w u}]$ .

3.2. CI based on Agresti–Coull method

As shown in many literatures, the Wald CI given by Equation (11) usually performs not well when the sample size is small in the sense that it usually provides empirical coverage probabilities lower than the pre-specified confidence level. Adding a small count to every cell before computing the interval limits is a common strategy. For example, Agresti and Coull [1] suggested a Wald CI by adding 2 for one-sample binomial problems. Similar to Agresti and Coull [1], we use the simulation approach to find the small count to be added. The simulation study shows that the Wald CI given in Equation (11) by adding 0.25 to all cell counts (i.e. $m_{h^{'} i}^{(1)}$ and $m_{h i}^{(2)}$ , $h^{'} = 0, 1$ , h = 0, 1, 2, i = 0, 1) performs well. We denote this CI as ${CI}_{m w}$ .

3.3. CI based on the inverse hyperbolic tangent transformation

Note that the Wald CI and the CI based on the Wilson method are derived from the normal approximation of $\hat{δ}$ . However, when the sample size is small or the data have a sparse structure, the asymptotical distribution of $\hat{δ}$ is usually highly skewed. In this case, the inverse hyperbolic tangent transformation (i.e. Fisher's Z transformation [11]) can be used to improve the normal approximation of $\hat{δ}$ . By using this transformation for $\hat{δ}$ , we have ${tanh}^{- 1} (\hat{δ}) = \frac{1}{2} \log [(1 + \hat{δ}) / (1 - \hat{δ})]$ . It is easily shown that the expectation of ${tanh}^{- 1} (\hat{δ})$ is given by $E [{tanh}^{- 1} (\hat{δ})] = {tanh}^{- 1} (δ)$ , and the variance of ${tanh}^{- 1} (\hat{δ})$ is given by $σ_{z}^{2} = Var ({tanh}^{- 1} (\hat{δ})) = Var (\hat{δ}) / (1 - δ^{2})^{2}$ by using the delta method. The corresponding variance estimation of ${tanh}^{- 1} (\hat{δ})$ is given by ${\hat{σ}}_{z}^{2} = {\hat{σ}}^{2} / (1 - {\hat{δ}}^{2})^{2}$ . When all $m_{+ i}^{(1)}$ and $m_{+ i}^{(2)}$ (i = 0, 1) are large, the test statistic $[{tanh}^{- 1} (\hat{δ}) - {tanh}^{- 1} (δ)] / {\hat{σ}}_{z}$ is asymptotically distributed as a standard normal distribution, so the $100 (1 - α) %$ confidence interval for ${tanh}^{- 1} (δ)$ can be given by $[δ_{l}^{(z)}, δ_{u}^{(z)}]$ , where

δ_{l}^{(z)} = {tanh}^{- 1} (\hat{δ}) - z_{α / 2} {\hat{σ}}_{z}, δ_{u}^{(z)} = {tanh}^{- 1} (\hat{δ}) + z_{α / 2} {\hat{σ}}_{z} .

Therefore, the $100 (1 - α) %$ confidence interval for δ can be obtained via the inverse transformation of ${tanh}^{- 1} (δ)$ , which is given by

{CI}_{z} = [(\exp (2 δ_{l}^{(z)}) - 1) / (\exp (2 δ_{l}^{(z)}) + 1), (\exp (2 δ_{u}^{(z)}) - 1) / (\exp (2 δ_{u}^{(z)}) + 1)] .

(12)

If $\hat{δ}$ equals $- 1$ or 1, the confidence limits for ${tanh}^{- 1} (δ)$ are not defined. In this case, we just take the CI for δ to be $[- 1, 1]$ .

3.4. CI based on the Haldane method

According to the central limit theorem, since $(\hat{δ} - δ) / \sqrt{Var (\hat{δ})}$ is asymptotically distributed as the standard normal distribution when the sample size is large enough, then we have

P (| (\hat{δ} - δ) / \sqrt{Var (\hat{δ})} | \leq z_{α / 2}) \approx 1 - α .

Let $η = λ_{1} - λ_{0}$ , then $λ_{0} = (η - δ) / 2$ and $λ_{1} = (η + δ) / 2$ . Thus, the variance $Var (\hat{δ})$ can be expressed in terms of parameters δ and η. Similar to Beal [3], we can consider the following quadratic equation of δ:

a δ^{2} - b δ + c \leq 0,

where

\begin{array}{lll} a = A B - z_{α / 2}^{2} {2 (R - 2) (B m_{+ 0}^{(2)} + A m_{+ 1}^{(2)}) - (m_{+ 0}^{(1)} B + m_{+ 1}^{(1)} A)} / 4, \\ b = 2 A B \hat{δ} + z_{α / 2}^{2} {(1 - η) (m_{+ 1}^{(1)} A - m_{+ 0}^{(1)} B) + 2 ((R - 2) η + 1) (m_{+ 1}^{(2)} A - m_{+ 0}^{(2)} B)} / 2, \\ c = A B {\hat{δ}}^{2} - z_{α / 2}^{2} η {(2 (R - 2) η + 4) (m_{+ 0}^{(2)} B + m_{+ 1}^{(2)} A) - ((η - 2) (m_{+ 0}^{(1)} B + m_{+ 1}^{(1)} A)} / 4 \end{array}

with $A = (m_{+ 0}^{(1)} + 2 m_{+ 0}^{(2)})^{2}$ and $B = (m_{+ 1}^{(1)} + 2 m_{+ 1}^{(2)})^{2}$ . If a>0 and $b^{2} - 4 a c \geq 0$ , the asymptotic $100 (1 - α) %$ confidence limits of δ are given by $[δ_{l} (η), δ_{u} (η)]$ , where $δ_{l} (η, R) = max {- 1.0, (b - \sqrt{b^{2} - 4 a c}) / 2 a}$ , $δ_{u} (η, R) = min {1.0, (b + \sqrt{b^{2} - 4 a c}) / 2 a}$ . The unknown parameters η and R can be estimated by $\hat{η} = {\hat{λ}}_{1} + {\hat{λ}}_{0}$ and $\hat{R}$ , respectively, therefore, the asymptotic $100 (1 - α) %$ confidence interval for δ is given by

{CI}_{h} = [δ_{l} (\hat{η}, \hat{R}), δ_{u} (\hat{η}, \hat{R})] .

(13)

As noted by Beal [3], this confidence interval can be regarded as the extension of ‘Haldane Interval’ to account for the intraclass correlation of the combined unilateral and bilateral data. Note that the probability that the estimation of $λ_{i}$ is 0 cannot be negligible when $λ_{i}$ are small, especially when $m_{+ i}^{(1)}$ and $m_{+ i}^{(2)}$ (i = 0, 1) are small. In this case, we can adopt the simple adjustment by adding 0.5 to each cell.

3.5. CI based on the score test statistic

Using the general theory of efficient scores proposed by Rao [19], the score statistic for testing the hypothesis $H_{0} : δ = δ_{0}$ can be given by

T_{s c} (δ_{0}) = \frac{\partial ℓ (m; δ, λ_{0}, R)}{\partial δ} \sqrt{I^{11}} |_{δ = δ_{0}, λ_{0} = {\tilde{λ}}_{0} (δ_{0}), R = \tilde{R} (δ_{0})},

where

\begin{aligned} \frac{\partial ℓ (m; δ, λ_{0}, R)}{\partial δ} & = \frac{m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}}{λ_{0} + δ} - \frac{m_{01}^{(1)}}{1 - λ_{0} - δ} \\ - \frac{2 m_{01}^{(2)} [1 - R (λ_{0} + δ)]}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} - \frac{m_{11}^{(2)} R}{1 - R (λ_{0} + δ)} \end{aligned}

is the score function, and $I^{11}$ is the $(1, 1)$ th element of the inverse of the Fisher information matrix, which is given by

\begin{aligned} \begin{aligned} I^{11} & = {[I_{11} - (\begin{matrix} I_{12} & I_{13} \end{matrix}) {(\begin{matrix} I_{22} & I_{23} \\ I_{23} & I_{33} \end{matrix})}^{- 1} (\begin{matrix} I_{12} \\ I_{13} \end{matrix})]}^{- 1} \\ = \frac{I_{22} I_{33} - (I_{23})^{2}}{2 I_{12} I_{13} I_{23} + I_{11} I_{22} I_{33} - I_{11} (I_{23})^{2} - I_{22} (I_{13})^{2} - I_{33} (I_{12})^{2}}, \end{aligned} \end{aligned}

where $I_{s l}$ ( $1 \leq s \leq l \leq 3$ ) is the element of the Fisher information matrix (refer to Appendix for details).

As shown in [19], $T_{s c}$ is asymptotically distributed as a standard normal distribution under $H_{0} : δ = δ_{0}$ as all $m_{+ i}^{(1)}$ and $m_{+ i}^{(2)}$ (i = 0, 1) are large. Therefore, the $100 (1 - α) %$ confidence interval for δ based on the score test can be given by

{CI}_{s c} = [δ_{s l}, δ_{s u}],

(14)

where the lower limit $δ_{s l}$ is the solution of the equation $T_{s c} (δ_{0}) = z_{α / 2}$ with respect to $δ_{0}$ , and the upper limit $δ_{s u}$ is the solution of the equation $T_{s c} (δ_{0}) = - z_{α / 2}$ , respectively. Similarly, no closed form exists; the above bisection method given in Section 3.1 can be used to obtain the solutions.

3.6. CI based on the profile-likelihood-ratio test

For testing the hypothesis $H_{0} : δ = δ_{0}$ , the likelihood ratio test statistic is given by

T_{l} (δ_{0}) = 2 [ℓ (m; \hat{δ}, {\hat{λ}}_{0}, \hat{R}) - ℓ (m; δ_{0}, {\tilde{λ}}_{0} (δ_{0}), \tilde{R} (δ_{0}))] .

Similarly, $T_{l} (δ_{0})$ is asymptotically distributed as the Chi-square distribution with one degree of freedom under $H_{0} : δ = δ_{0}$ when all $m_{+ i}^{(1)}$ and $m_{+ i}^{(2)} \to \infty$ (i = 0, 1). Therefore, the asymptotic profile likelihood CI for δ can be obtained by inverting the likelihood ratio test $T_{l} (δ_{0})$ , i.e. the $100 (1 - α) %$ confidence lower and upper limits for δ satisfy

2 [ℓ (m; \hat{δ}, {\hat{λ}}_{0}, \hat{R}) - ℓ (m; δ_{0}, {\tilde{λ}}_{0} (δ_{0}), \tilde{R} (δ_{0}))] \leq χ_{1, α}^{2},

where $χ_{1, α}^{2}$ is the upper α percentile of the chi-square distribution with one degree of freedom. Let $f (δ_{0}) = 2 [ℓ (m; \hat{δ}, {\hat{λ}}_{0}, \hat{R}) - ℓ (m; δ_{0}, {\tilde{λ}}_{0} (δ_{0}), \tilde{R} (δ_{0}))] - χ_{1, α}^{2}$ . Similarly, the following bisection method can be used to find the upper and lower limits:

Step 1: Select a suitable constant h, for example, 0.1, 0.05, etc., starting from $- 1$ and search for the minimum positive integer k such that $f (- 1 + k h) \cdot f (- 1 + (k + 1) h) \leq 0$ , where $[- 1 + k h, - 1 + (k + 1) h] \subseteq [- 1, 1]$ .

Step 2: Use the bisection method to find the root of $f (δ_{0}) = 0$ on the interval $[- 1 + k h, - 1 + (k + 1) h]$ , then we obtain the lower limit of the interval, denoted as $δ_{l l}$ .

Step 3: Similarly, start from 1 and search for the minimum positive integer k such that $f (1 - k h) \cdot f (1 - (k + 1) h) \leq 0$ , where $[1 - k h, 1 - (k + 1) h] \subseteq [- 1, 1]$ . Use the bisection method to find the root of $f (δ_{0}) = 0$ on the interval $[1 - k h, 1 - (k + 1) h]$ , then we obtain the upper limit of the interval, denoted as $δ_{l u}$ .

Therefore, the $100 (1 - α) %$ confidence interval for δ based on the likelihood ratio test is given by

{CI}_{l} = [δ_{l l}, δ_{l u}] .

(15)

3.7. Bootstrap-resampling CIs

It is well known that the Bootstrap-resampling method has been applied extensively in many fields. For example, Efron and Tibshirani [10] and Shao and Tu [22] used the Bootstrap-resampling method to estimate the variability of complicated statistics, and Li [13] suggested the use of a Bootstrap procedure to generate the empirical distribution of the test statistic in ROC analysis. In particular, when the sample size is small, confidence intervals for proportion difference based on the large sample approximation may not be reliable. In this case, the Bootstrap-resampling method is usually recommended to construct CIs [10,22]. Therefore, we consider the following parametric Bootstrap-resampling procedure to construct the CI for δ:

Step 1: Given the observed data $m = {(m_{0 i}^{(1)}$ , $m_{1 i}^{(1)}$ , $m_{0 i}^{(2)}$ , $m_{1 i}^{(2)}$ , $m_{2 i}^{(2)})$ : $i = 0, 1}$ , we can obtain the parameter estimators ${\hat{λ}}_{0}$ , ${\hat{λ}}_{1}$ , $\hat{δ}$ and $\hat{R}$ of the parameters $λ_{0}$ , $λ_{1}$ , δ and R via Equations (4)–(6). Let ${\hat{σ}}^{2}$ be the estimate of $σ^{2}$ that is calculated from the observed data.

Step 2: Generate the Bootstrap sample $m^{*} = {(m_{0 i}^{* (1)}$ , $m_{1 i}^{* (1)}$ , $m_{0 i}^{* (2)}$ , $m_{1 i}^{* (2)}$ , $m_{2 i}^{* (2)})$ : $i = 0, 1}$ from the product of binomial and trinomial distributions based on the estimated parameters, i.e. $m_{1 i}^{* (1)}$ follows Binomial distribution $(m_{+ i}^{(1)}; {\hat{p}}_{1 i}^{(1)})$ and $(m_{0 i}^{* (2)}, m_{1 i}^{* (2)}, m_{2 i}^{* (2)})$ follows Trinomial distribution $(m_{+ i}^{(2)}; {\hat{p}}_{0 i}^{(2)}, {\hat{p}}_{1 i}^{(2)}, {\hat{p}}_{2 i}^{(2)})$ (i = 0, 1), where $p_{h^{'} i}^{(1)}$ and $p_{h i}^{(2)}$ ( $h^{'} = 0, 1$ , h = 0, 1, 2) are given by Equation (1), in which parameters $λ_{0}$ , $λ_{1}$ and R are substituted by their estimators ${\hat{λ}}_{0}$ , ${\hat{λ}}_{1}$ and $\hat{R}$ , respectively.

Step 3: For each generated sample $m^{*}$ , calculate the estimations ${\hat{λ}}_{0}^{*}$ , ${\hat{δ}}^{*}$ and ${\hat{R}}^{*}$ of $λ_{0}$ , δ and R via Equations (4)–(6).

Step 4: Independently repeating the above process (i.e. Steps 2–3) B times, we can obtain B Bootstrap estimators ${\hat{δ}}^{* (b)}$ , ${\hat{λ}}_{0}^{* (b)}$ and ${\hat{R}}^{* (b)}$ of δ, $λ_{0}$ and R ( $b = 1, 2, \dots, B$ ), respectively. The B Bootstrap estimators ${{\hat{δ}}^{* (b)}}_{b = 1}^{B}$ are then ordered from the smallest to the largest, and let ${\hat{δ}}_{(1)}^{*}$ , ${\hat{δ}}_{(2)}^{*}$ , ··· , ${\hat{δ}}_{(B)}^{*}$ be the ordered values.

(i) Bootstrap percentile CI

Following Efron and Tibshirani [10], the $100 (1 - α) %$ Bootstrap percentile CI for δ is given by

{CI}_{b_{1}} = [{\hat{δ}}_{([B α / 2])}, {\hat{δ}}_{([B (1 - α / 2)])}],

(16)

where $[a]$ denotes the maximum integer not greater than a.

(ii) Bootstrap percentile-t CI

Let ${\hat{σ}}^{* (b)} = σ ({\hat{δ}}^{* (b)}, {\hat{λ}}_{0}^{* (b)}, {\hat{R}}^{* (b)})$ be the estimated standard deviation of the bth Bootstrap estimator ${\hat{δ}}^{* (b)}$ . For each of the B bootstrap samples, we can obtain ${t^{* (b)} = ({\hat{δ}}^{* (b)} - \hat{δ}) / {\hat{σ}}^{* (b)} : b = 1, 2, \dots, B}$ . Following Efron and Tibshirani [10], the $100 (1 - α) %$ bootstrap percentile-t confidence interval for δ can be obtained by

{CI}_{b_{2}} = [max {- 1, \hat{δ} - t_{([(1 - α / 2) B])}^{*} \hat{σ}}, min {1, \hat{δ} + t_{([(1 - α / 2) B])}^{*} \hat{σ}}],

(17)

where $t_{(b)}^{*}$ 's denote the ordered values of $t^{* (b)}$ 's from the smallest to the largest and $\hat{σ}$ is the positive squared root of ${\hat{σ}}^{2}$ given in Step 1.

(iii) Bias-corrected Bootstrap percentile CI

Following DiCiccio and Efron [8], the $100 (1 - α) %$ confidence interval for δ can be given by

{CI}_{b_{3}} = [{\hat{δ}}_{([B α_{1}])}, {\hat{δ}}_{([B α_{2}])}],

(18)

where $α_{1} = Φ (2 {\hat{z}}_{0} - z_{1 - α / 2})$ , $α_{2} = Φ (2 {\hat{z}}_{0} + z_{1 - α / 2})$ with $z_{0} = Φ^{- 1} (\frac{1}{B} \sum_{b = 1}^{B} I ({\hat{δ}}^{* (b)} < \hat{δ}))$ . Here, $Φ (\cdot)$ is the standard normal distribution function, and $Φ^{- 1} (\cdot)$ is its inverse.

(iv) Bootstrap percentile-t CI combining with the inverse hyperbolic tangent transformation

Since that a variance stabilizing transformation much like the inverse hyperbolic tangent transformation can give good results when this transformation is used to construct CI for numbers lying between −1 and +1, then we consider a Bootstrap percentile-t CI combining with the inverse hyperbolic tangent transformation. First, for each of the B bootstrap samples, we can obtain ${t_{z}^{* (b)} = [{tanh}^{- 1} ({\hat{δ}}^{* (b)}) - {tanh}^{- 1} (\hat{δ})] \cdot [1 - ({\hat{δ}}^{* (b)})^{2}] / {\hat{σ}}^{* (b)} : b = 1, 2, \dots, B}$ , then the $100 (1 - α) %$ bootstrap percentile-t confidence interval for ${tanh}^{- 1} (δ)$ can be obtained by $[δ_{b z l}, δ_{b z u}] = [{tanh}^{- 1} (\hat{δ}) - t_{z ([(1 - α / 2) B])}^{*} \hat{σ} / [1 - ({\hat{δ}}^{* (b)})^{2}]$ , ${tanh}^{- 1} (\hat{δ}) + t_{z ([(1 - α / 2) B])}^{*} \hat{σ} / [1 - ({\hat{δ}}^{* (b)})^{2}]]$ , where $t_{z (b)}^{*}$ 's denote the ordered values of $t_{z}^{* (b)}$ 's from the smallest to the largest. Therefore, the $100 (1 - α) %$ bootstrap confidence interval for δ is given by

{CI}_{b_{4}} = [(\exp (2 δ_{b z l}) - 1) / (\exp (2 δ_{b z l}) + 1), (\exp (2 δ_{b z u}) - 1) / (\exp (2 δ_{b z u}) + 1)] .

(19)

Note that the asymptotical methods based on the large sample assumption do not necessarily control their actual coverage probabilities at the pre-specified confidence level for small sample sizes. In this case, some adjusted methods can be used to construct the CI for proportion difference, for example, the approximate unconditional methods proposed in Tang et al. [24], the exact binomial method and mid-P method modified from the exact binomial method proposed in Li et al. [14]. However, for two groups of combined unilateral and bilateral data, a severe computing burden will be encountered for our problem even for the very small sample size $m_{+ 0}^{(1)} = m_{+ 1}^{(1)} = m_{+ 0}^{(2)} = m_{+ 1}^{(2)} = 10$ . Therefore, we have not adopted the exact and approximate unconditional methods in this article.

4. Simulation study

In this section, we investigate the performance of the proposed confidence intervals via simulation studies in terms of the empirical coverage probability (ECP) and the empirical coverage width (ECW). In general, a method to construct a CI is better if the ECP is closer to the pre-specified confidence level and the ECW is smaller. Following Newcombe [16], the location of a CI can be characterized in terms of the balance between the mesial non-coverage probability (MNCP) and the distal non-coverage probability (DNCP). A simple index, i.e. the ratio of the MNCP to the non-coverage probability (RMNCP=MNCP/( $1 -$ ECP )) can be used to evaluate the location of a CI, where DNCP and MNCP are defined with respect to the true value of δ. These evaluation indices are defined as follows.

(i) Empirical coverage probability

ECP = \frac{1}{K} \sum_{k = 1}^{K} I (δ \in [δ_{l} (m^{(k)}), δ_{u} (m^{(k)})]),

where K is the number of replications and $I (\cdot)$ is the indicator function, $m^{(k)} = {(m_{0 i}^{(1)}$ , $m_{1 i}^{(1)}$ , $m_{0 i}^{(2)}$ , $m_{1 i}^{(2)}$ , $m_{2 i}^{(2)})^{(k)}$ : $i = 0, 1}$ is the kth replication, and $[δ_{l} (m^{(k)}), δ_{u} (m^{(k)})]$ is the CI that is constructed from the set of $m^{(k)}$ by any of the 11 methods under evaluation.

(ii) Empirical coverage width

ECW = \frac{1}{K} \sum_{k = 1}^{K} (δ_{u} (m^{(k)}) - δ_{l} (m^{(k)})) .

(iii) Ratio of the mesial non-coverage probability to the non-coverage probability

According to Newcombe [16], the mesial non-coverage probability (MNCP) is defined as

MNCP = \frac{1}{K} \sum_{k = 1}^{K} I (δ \in A (m^{(k)})),

where $A (m^{(k)})$ is given by

A (m^{(k)}) = {\begin{cases} [- 1.0, δ_{l} (m^{(k)})), & if δ > 0.0, \\ (δ_{u} (m^{(k)}), 1.0] \cup [- 1.0, δ_{l} (m^{(k)})), & if δ = 0.0, \\ (δ_{u} (m^{(k)}), 1.0], & if δ < 0.0, \end{cases}

and distal non-coverage probability (DNCP) is defined as

DNCP = \frac{1}{K} \sum_{k = 1}^{K} I (δ \in B (m^{(k)})),

where $B (m^{(k)})$ is given by

B (m^{(k)}) = {\begin{cases} [- 1.0, δ_{l} (m^{(k)})), & if δ < 0.0, \\ (δ_{u} (m^{(k)}), 1.0], & if δ > 0.0 . \end{cases}

The ratio of the mesial non-coverage probability to the non-coverage probability (RMNCP) is then defined as

RMNCP = MNCP / NCP = MNCP / (MNCP + DNCP) .

As shown in [16], a CI is classified as satisfactory if the RMNCP lies in [0.4, 0.6], and too mesially located if the ratio is smaller than 0.4, and too distal if it is greater than 0.6.

To evaluate the proposed methods, we consider three kinds of sample size designs of $(m_{+ 0}^{(1)}$ , $m_{+ 1}^{(1)})$ , $m_{+ 0}^{(2)}$ , $m_{+ 1}^{(2)})$ : (i) small sample size $(20$ , 20, 20, $20)$ ; (ii) moderate sample size $(30$ , 30, 30, $30)$ ; and (iii) large sample size $(50$ , 50, 50, $50)$ . We only report the simulation results for balanced sample sizes due to no substantial difference for the performance between balanced and unbalanced sample sizes. For parameter settings of ( $λ_{0}$ , δ, R), a total of $54 = 3 \times 3 \times 6$ combinations with $δ = 0.0$ , 0.10, 0.15, $λ_{0} = 0.1$ , 0.3, 0.5 and $R = 1.0 (0.1) 1.5$ , i.e. from 1.0 to 1.5 with step size 0.1 are considered in the simulation studies. A total of 5000 replications are conducted to the simulation studies and the bootstrap CI of δ is based on 2000 replications. According to the definition of MNCP, when $δ = 0.0$ , the RMNCP is always equal to 1.0. Therefore, we only report RMNCPs of all CIs for the situations with $δ \neq 0$ .

Figures 1(i), 2(i) and 3(i) report the change of ECPs of two-sided $95 %$ CIs for δ as the change of R. It is observed that ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ have slightly deflated ECPs when the sample size is small (e.g. $m_{+ 0}^{(1)} = m_{+ 1}^{(1)} = m_{+ 0}^{(2)} = m_{+ 1}^{(2)} = 20$ ), and ${CI}_{m w}$ is slightly conservative for $δ = 0.0$ , $λ_{0} = 0.1$ under the small sample size (i.e. $m_{+ 0}^{(1)} = m_{+ 1}^{(1)} = m_{+ 0}^{(2)} = m_{+ 1}^{(2)} = 20$ ), other CIs generally perform well in the sense that they can well control their ECPs around the pre-specified confidence level. Although the intra-class correlation is not ignorable in bilateral data analysis, it seems that the CIs except for ${CI}_{l}$ are independent of data structure, i.e. there is no obvious pattern on ECPs as R changes as shown in Figures 1–3, while the ECP of ${CI}_{l}$ increases as the increase of R for $λ_{0} = 0.5$ with $δ = 0.1$ and 0.15.

Figure 1. — (i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size $(m_{+ 0}^{(1)}, m_{+ 1}^{(1)}, m_{+ 0}^{(2)}, m_{+ 1}^{(2)}) = (20, 20, 20, 20)$ .

Figure 2. — (i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size $(m_{+ 0}^{(1)}, m_{+ 1}^{(1)}, m_{+ 0}^{(2)}, m_{+ 1}^{(2)}) = (30, 30, 30, 30)$ .

Figure 3. — (i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size $(m_{+ 0}^{(1)}, m_{+ 1}^{(1)}, m_{+ 0}^{(2)}, m_{+ 1}^{(2)}) = (50, 50, 50, 50)$ .

Figures 1(ii), 2(ii) and 3(ii) report the change of ECWs of two-sided $95 %$ CIs for δ as the change of R. It is interesting to find that ECWs of the CIs except for ${CI}_{s c}$ and ${CI}_{l}$ tend to be wider as R increases, and can be seen increasing patterns that look like linear, while ECWs of ${CI}_{s c}$ and ${CI}_{l}$ show the patterns that look like nonlinear. As expected, ECWs of all CIs are shorter with the increase of the sample size.

Figure 4 reports the change of RMNCPs of two-sided $95 %$ CIs for δ as the change of R. It is observed that all CIs except for ${CI}_{b_{3}}$ usually have satisfactory interval locations, as their ratios of the mesial non-coverage probability to total non-coverage probability are close to 0.5 even for small sample sizes, while ${CI}_{b_{3}}$ has too mesially located interval in some cases. The larger the sample sizes, the closer to 0.5 the RMNCPs of all CIs. As shown in Figure 4, it seems that there is no obvious pattern on RMNCPs as R changes.

To further investigate the overall performance of the CIs across a range of values for the nuisance parameters $λ_{0}$ and R for given values of δ: i.e. $δ = - 0.2 (0.1) 0.2$ , let $c_{1} = max {(2 λ_{0} - 1) / λ_{0}^{2}, 0.0}$ and $c_{2} = max {[2 (λ_{0} + δ) - 1] / (λ_{0} + δ)^{2}, 0.0}$ ; if $δ \leq 0.0$ , $λ_{0} = (- δ + \frac{1 + δ}{10}) (\frac{1 + δ}{5}) (- δ + \frac{9 (1 + δ)}{10})$ , $R = (c_{1} + \frac{1 / λ_{0} - c_{1}}{10}) (\frac{1 / λ_{0} - c_{1}}{5}) (c_{1} + \frac{9 (1 / λ_{0} - c_{1})}{10})$ ; if $δ > 0$ , $λ_{0} = \frac{1 - δ}{10} (\frac{1 - δ}{5}) \frac{9 (1 - δ)}{10}$ , $R = (c_{2} + \frac{1 / (λ_{0} + δ) - c_{2}}{10}) (\frac{1 / (λ_{0} + δ) - c_{2}}{5}) (c_{2} + \frac{9 (1 / (λ_{0} + δ) - c_{2})}{10})$ , where $a (b) c$ means that the value is from a to c with step size b. Four sample size designs of $(m_{+ 0}^{(1)}$ , $m_{+ 1}^{(1)})$ , $m_{+ 0}^{(2)}$ , $m_{+ 1}^{(2)})$ are considered in this study, i.e. (i) $(20, 20, 20, 20)$ ; (ii) $(30, 30, 30, 30)$ ; (iii) $(50, 50, 50, 50)$ ; and (iv) $(30, 50, 30, 50)$ . Boxplots of ECPs, ECWs and RMNCPs of various CIs are reported in Figure 5.

In terms of the coverage probability, the boxplots of ECP in Figure 5 show that the results of various methods are generally satisfactory. Furthermore, ${CI}_{m w}$ , ${CI}_{w 2}$ and ${CI}_{s c}$ perform better than the others, as their median empirical coverage probabilities are closer to the preassigned confidence level even under small sample sizes. When the sample size increases, the performance of other CIs show improvements. The results given in Figures 1–3 also support these findings.

With regard to the width of the various CIs, the boxplots of ECW in Figure 5 show that ${CI}_{s c}$ has wider interval width than the others under small and moderate sample sizes (e.g. $(m_{+ 0}^{(1)}$ , $m_{+ 1}^{(1)}$ , $m_{+ 0}^{(2)}$ , $m_{+ 1}^{(2)})$ = $(20, 20, 20, 20)$ and $(30, 30, 30, 30)$ ), as its median empirical coverage widths are larger than the others. However, when the sample size is large, all CIs have similar median ECWs.

For the location of the various CIs, the boxplots of RMNCP in Figure 5 show that the RMNCPs of ${CI}_{s c}$ are almost in $[0.4, 0.6]$ , suggesting that the CI has a satisfactory location. Except for ${CI}_{b 3}$ , the other CIs can also produce good results, as their median RMNCPs are closer to 0.5, while for ${CI}_{b 3}$ , the values of RMNCP suggest that the CI tends to be slightly too mesial when the sample size is not large; the results given in Figures 1–3 also support these findings.

In summary, with the increase of the sample size, ECPs of all CIs are closer to the pre-specified confidence level, and all CIs have better interval locations and smaller interval widths. Generally, all CIs except for ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ can usually achieve the nominal coverage and exhibit shorter interval widths from small to large sample sizes, and hence are recommended to practical applications.

5. Real example

To illustrate the proposed methods in this article, we re-visit the otolaryngological study in Section 1. Let i = 0 be the treatment of Amoxicillin and i = 1 be the treatment of Cefaclor, respectively. According to Equations (4)–(6), $\hat{δ} = 0.1558$ , ${\hat{λ}}_{0} = 0.4375$ and $\hat{R} = 1.4679$ . The corresponding $95 %$ confidence intervals for δ based on various methods are reported in Table 3.

Table 3.

Various $95 %$ confidence intervals of δ for real data.

Methods	CI	Width	Methods	CI	Width
${CI}_{w 1}$	$(0.0241, 0.2875)$	0.2634	${CI}_{l}$	$(- 0.0780, 0.2247)$	0.3027
${CI}_{m w}$	$(0.0228, 0.2845)$	0.2617	${CI}_{b 1}$	$(0.0191, 0.2839)$	0.2648
${CI}_{z}$	$(0.0221, 0.2840)$	0.2619	${CI}_{b 2}$	$(0.0348, 0.2768)$	0.2420
${CI}_{h}$	$(0.0233, 0.2853)$	0.2620	${CI}_{b 3}$	$(0.0255, 0.2770)$	0.2515
${CI}_{w 2}$	$(0.0193, 0.2813)$	0.2620	${CI}_{b 4}$	$(0.0320, 0.2750)$	0.2430
${CI}_{s c}$	$(- 0.0563, 0.2127)$	0.2690

Open in a new tab

It is noteworthy that the lower limits of all CIs except for ${CI}_{s c}$ and ${CI}_{l}$ are all greater than 0.0. The results indicate that there is no evidence to support rejecting the hypothesis $δ = 0$ at the $95 %$ confidence level using ${CI}_{s c}$ and ${CI}_{l}$ , although we reject it at this confidence level using the other nine methods. In this case, which result is reliable? The results of our simulation study show that ${CI}_{s c}$ usually have a little wider interval widths than other CIs, although it can well control it's ECPs around the pre-specified confidence level. Since the upper limits of all CIs are greater than 0.0. Moreover, the lower limits of ${CI}_{s c}$ and ${CI}_{l}$ are very close to 0.0. Therefore, we are more inclined to the conclusion that the treatment of Amoxicillin is more effective than the treatment of Cefaclor at the $5 %$ nominal level.

6. Conclusion and discussion

We consider the problem of CI construction for the proportion difference based on combined unilateral and bilateral data, which are commonly observed in the paired organs or two body parts studies. Seven asymptotic CIs, i.e. ${CI}_{w 1}$ , ${CI}_{w 2}$ , ${CI}_{m w}$ , ${CI}_{z}$ , ${CI}_{h}$ , ${CI}_{s c}$ and ${CI}_{l}$ based on Rosner's dependence model, are constructed. Together with 4 bootstrap resampling methods, i.e. ${CI}_{b 1}$ , ${CI}_{b 2}$ , ${CI}_{b 3}$ and ${CI}_{b 4}$ , we have developed 11 methods to construct CIs. A large-scale empirical study has been conducted to evaluate the performance of these 11 methods from different aspects. Overall, the empirical results suggest that all CIs except for ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ generally perform well from small to large sample size designs, while ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ produce a little deflated ECPs when the sample size is small (e.g. $(m_{+ 0}^{(1)}$ , $m_{+ 1}^{(1)}$ , $m_{+ 0}^{(2)}$ , $m_{+ 1}^{(2)}) = (20, 20, 20, 20)$ ), and ${CI}_{a w}$ is slightly conservative for $δ = 0.0$ , $λ_{0} = 0.1$ under small sample sizes (e.g. $(m_{+ 0}^{(1)}$ , $m_{+ 1}^{(1)}$ , $m_{+ 0}^{(2)}$ , $m_{+ 1}^{(2)}) = (20, 20, 20, 20)$ ). All CIs can well control their ECPs close to the pre-specified confidence level with preferred RMNCPs and shorter interval widths when sample sizes are large. Therefore, all CIs except for ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ are recommended for practical applications for small to large sample size designs, and when the sample size is large, ${CI}_{w_{1}}$ and ${CI}_{b_{3}}$ can also be recommended for applications.

The intra-class correlation is not ignorable in bilateral data analysis. Therefore, simulation studies are conducted to investigate the effect of the measurement of dependence (i.e. R) on ECP, ECW and RMNCP of various CIs. It is interesting to find that the ECWs of all CIs except for ${CI}_{s c}$ and ${CI}_{l}$ show like a linear increasing trend as R increases, and ${CI}_{s c}$ and ${CI}_{l}$ show like a nonlinear trend. However, except for ${CI}_{l}$ , there is no obvious pattern on ECPs of other CIs, and for all CIs, there is no obvious pattern on RMNCPs.

It is well known that confidence intervals for proportion difference based on the large sample approximation may not be reliable when the sample size is small. In this case, other CI construction methods are available in the literature, such as exact unconditional (e.g. [2,5,6]) and approximate unconditional methods [24]. However, for two groups of combined unilateral and bilateral data, a severe computing burden will be encountered for our problem even under very small sample sizes. In addition, several methods proposed in this article perform well even under small sample sizes. Therefore, we do not consider these methods in this article.

The equivalence testing of two cure rates in the paired organs or two body parts studies has been investigated in other literature, for example, [18,20,21,25,26]; however, they considered the situation in which only bilateral binary data are available. When unilateral data are also available, although Pei et al. [17] have investigated these combined data, they just considered the hypothesis testing under the equal correlation coefficient model. In this article, we investigate a number of methods to construct CIs in the presence of both bilateral and unilateral data under Rosner's correlated binary data model, and several effective methods are recommended for practical applications.

Appendices.

Appendix 1. The restrained MLEs ${\tilde{λ}}_{0} (δ)$ and $\tilde{R} (δ)$ of $λ_{0}$ and R given the value of δ

Given the value of δ, the log-likelihood function of $m$ is given by

\begin{aligned} ℓ (λ_{0}, R) & = C + m_{00}^{(1)} \log (1 - λ_{0}) + (m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}) \log λ_{0} + m_{01}^{(1)} \log (1 - λ_{0} - δ) \\ + (m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}) \log (λ_{0} + δ) + m_{00}^{(2)} \log (1 + R λ_{0}^{2} - 2 λ_{0}) \\ + (m_{20}^{(2)} + m_{21}^{(2)}) \log R + m_{10}^{(2)} \log (1 - R λ_{0}) + m_{11}^{(2)} \log [1 - R (λ_{0} + δ)] \\ + m_{01}^{(2)} \log [1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)] . \end{aligned}

Differentiating $ℓ (λ_{0}, R)$ with respect to $λ_{0}$ and R yields

\begin{aligned} \frac{\partial ℓ (λ_{0}, R)}{\partial λ_{0}} & = \frac{m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}}{λ_{0}} - \frac{m_{00}^{(1)}}{1 - λ_{0}} - \frac{m_{10}^{(2)} R}{1 - R λ_{0}} + \frac{2 m_{00}^{(2)} (R λ_{0} - 1)}{1 + R λ_{0}^{2} - 2 λ_{0}} \\ + \frac{m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}}{λ_{0} + δ} - \frac{m_{01}^{(1)}}{1 - (λ_{0} + δ)} - \frac{m_{11}^{(2)} R}{1 - R (λ_{0} + δ)} \\ + \frac{2 m_{01}^{(2)} [R (λ_{0} + δ) - 1]}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} = 0, \\ \frac{\partial ℓ (λ_{0}, R)}{\partial R} & = \frac{m_{20}^{(2)} + m_{21}^{(2)}}{R} - \frac{m_{10}^{(2)} λ_{0}}{1 - R λ_{0}} + \frac{m_{00}^{(2)} λ_{0}^{2}}{1 + R λ_{0}^{2} - 2 λ_{0}} \\ - \frac{m_{11}^{(2)} (λ_{0} + δ)}{1 - R (λ_{0} + δ)} + \frac{m_{01}^{(2)} (λ_{0} + δ)^{2}}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} = 0. \end{aligned}

Then, the restrained MLEs ${\tilde{λ}}_{0} (δ)$ and $\tilde{R} (δ)$ of $λ_{0}$ and R satisfy the following equations:

\frac{\partial ℓ (λ_{0}, R)}{\partial λ_{0}} = 0, \frac{\partial ℓ (λ_{0}, R)}{\partial R} = 0.

Differentiating $\frac{\partial ℓ (λ_{0}, R)}{\partial λ_{0}}$ and $\frac{\partial ℓ (λ_{0}, R)}{\partial R}$ with respect to $λ_{0}$ and R leads to

\begin{aligned} \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial λ_{0}^{2}} & = - \frac{m_{10}^{(1)} + m_{10}^{(2)} + 2 m_{20}^{(2)}}{λ_{0}^{2}} - \frac{m_{00}^{(1)}}{(1 - λ_{0})^{2}} - \frac{m_{10}^{(2)} R^{2}}{(1 - R λ_{0})^{2}} \\ - \frac{2 m_{00}^{(2)} (R^{2} λ_{0}^{2} - 2 R λ_{0} - R + 2)}{(1 + R λ_{0}^{2} - 2 λ_{0})^{2}} - \frac{m_{11}^{(1)} + m_{11}^{(2)} + 2 m_{21}^{(2)}}{(λ_{0} + δ)^{2}} \\ - \frac{m_{01}^{(1)}}{[1 - (λ_{0} + δ)]^{2}} - \frac{m_{11}^{(2)} R^{2}}{[1 - R (λ_{0} + δ)]^{2}} \\ - \frac{2 m_{01}^{(2)} [R^{2} (λ_{0} + δ)^{2} - 2 R (λ_{0} + δ) - R + 2]}{[1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)]^{2}} \end{aligned}

\begin{aligned} \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial λ_{0} \partial R} & = - \frac{m_{10}^{(2)}}{(1 - R λ_{0})^{2}} + \frac{2 m_{00}^{(2)} λ_{0} (1 - λ_{0})}{(1 + R λ_{0}^{2} - 2 λ_{0})^{2}} - \frac{m_{11}^{(2)}}{[1 - R (λ_{0} + δ)]^{2}} \\ + \frac{2 m_{01}^{(2)} (λ_{0} + δ) [1 - (λ_{0} + δ)]}{[1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)]^{2}} \\ \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial R^{2}} & = - \frac{m_{20}^{(2)} + m_{21}^{(2)}}{R^{2}} - \frac{m_{10}^{(2)} λ_{0}^{2}}{(1 - R λ_{0})^{2}} - \frac{m_{00}^{(2)} λ_{0}^{4}}{(1 + R λ_{0}^{2} - 2 λ_{0})^{2}} \\ - \frac{m_{11}^{(2)} (λ_{0} + δ)^{2}}{[1 - R (λ_{0} + δ)]^{2}} - \frac{m_{01}^{(2)} (λ_{0} + δ)^{4}}{[1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)]^{2}} . \end{aligned}

From $E (m_{h i}^{(k)} / m_{+ i}) = p_{h i}^{(k)}$ (h = 0, 1, 2, i = 0, 1 and k = 1, 2), we have

\begin{aligned} I_{11} & = E (- \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial λ_{0}^{2}}) = 2 m_{+ 0}^{(2)} [\frac{R^{2} λ_{0}^{2} - 2 R λ_{0} - R + 2}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{1}{λ_{0}} + \frac{R^{2} λ_{0}}{1 - R λ_{0}}] + \frac{m_{+ 0}^{(1)}}{λ_{0} (1 - λ_{0})} \\ + 2 m_{+ 1}^{(2)} [\frac{R^{2} (λ_{0} + δ)^{2} - 2 R (λ_{0} + δ) - R + 2}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{1}{λ_{0} + δ} + \frac{R^{2} (λ_{0} + δ)}{1 - R (λ_{0} + δ)}], \\ I_{12} & = E (- \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial λ_{0} \partial R}) = 2 m_{+ 0}^{(2)} λ_{0} [\frac{λ_{0} - 1}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{1}{1 - R λ_{0}}] \\ + 2 m_{+ 1}^{(2)} (λ_{0} + δ) [\frac{λ_{0} + δ - 1}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{1}{1 - R (λ_{0} + δ)}], \\ I_{22} & = E (- \frac{\partial^{2} ℓ (λ_{0}, R)}{\partial R^{2}}) = m_{+ 0}^{(2)} [\frac{λ_{0}^{4}}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{2 λ_{0}^{3}}{1 - R λ_{0}} + \frac{λ_{0}^{2}}{R}] \\ + m_{+ 1}^{(2)} [\frac{(λ_{0} + δ)^{4}}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{2 (λ_{0} + δ)^{3}}{1 - R (λ_{0} + δ)} + \frac{(λ_{0} + δ)^{2}}{R}] . \end{aligned}

Then, the Fisher information matrix is given by

I (λ_{0}, R) = (\begin{matrix} I_{11} & I_{12} \\ I_{12} & I_{22} \end{matrix}) .

Then the restrained MLEs ${\tilde{λ}}_{0} (δ)$ and $\tilde{R} (δ)$ by iteratively solving the following equation

(\begin{matrix} λ_{0}^{(t + 1)} \\ R^{(t + 1)} \end{matrix}) = (\begin{matrix} λ_{0}^{(t)} \\ R^{(t)} \end{matrix}) + (I (λ_{0}^{(t)}, R^{(t)}))^{- 1} {(\begin{matrix} \frac{\partial ℓ (λ_{0}, R)}{\partial λ_{0}} \\ \frac{\partial ℓ (λ_{0}, R)}{\partial R} \end{matrix}) |}_{λ_{0} = λ_{0}^{(t)}, R = R^{(t)}}, t = 0, 1, 2, \dots .

Appendix 2. Derivation of score statistic $T_{s c}$

Differentiating $\frac{\partial ℓ (δ, λ_{0}, R)}{\partial δ}$ , $\frac{\partial ℓ (δ, λ_{0}, R)}{\partial λ_{0}}$ , $\frac{\partial ℓ (δ, λ_{0}, R)}{\partial R}$ with respect to δ, $λ_{0}$ and R, respectively. Thus, it is easily shown that the Fisher information matrix is given by

\begin{aligned} I (δ, λ_{0}, R) = (\begin{matrix} I_{11} & I_{12} & I_{13} \\ I_{12} & I_{22} & I_{23} \\ I_{13} & I_{23} & I_{33}, \end{matrix}), \end{aligned}

where

\begin{aligned} I_{11} & = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial δ^{2}}) \\ = 2 m_{+ 1}^{(2)} [\frac{R^{2} (λ_{0} + δ)^{2} - 2 R (λ_{0} + δ) - R + 2}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{1}{λ_{0} + δ} + \frac{R^{2} (λ_{0} + δ)}{1 - R (λ_{0} + δ)}] \\ + \frac{m_{+ 1}^{(1)}}{(λ_{0} + δ) [1 - (λ_{0} + δ)]}, I_{12} = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial δ \partial λ_{0}}) = I_{11}, \\ I_{13} & = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial δ \partial R}) = 2 m_{+ 1}^{(2)} (λ_{0} + δ) [\frac{λ_{0} + δ - 1}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{1}{1 - R (λ_{0} + δ)}], \\ I_{22} & = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial λ_{0}^{2}}) \\ = I_{11} + 2 m_{+ 0}^{(2)} [\frac{R^{2} λ_{0}^{2} - 2 R λ_{0} - R + 2}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{1}{λ_{0}} + \frac{R^{2} λ_{0}}{1 - R λ_{0}}] + \frac{m_{+ 0}^{(1)}}{λ_{0} (1 - λ_{0})}, \\ I_{23} & = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial λ_{0} \partial R}) = I_{13} + 2 m_{+ 0}^{(2)} λ_{0} [\frac{λ_{0} - 1}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{1}{1 - R λ_{0}}], \\ I_{33} & = E (- \frac{\partial^{2} ℓ (δ, λ_{0}, R)}{\partial R^{2}}) = m_{+ 0}^{(2)} [\frac{λ_{0}^{4}}{1 + R λ_{0}^{2} - 2 λ_{0}} + \frac{2 λ_{0}^{3}}{1 - R λ_{0}} + \frac{λ_{0}^{2}}{R}] \\ + m_{+ 1}^{(2)} [\frac{(λ_{0} + δ)^{4}}{1 + R (λ_{0} + δ)^{2} - 2 (λ_{0} + δ)} + \frac{2 (λ_{0} + δ)^{3}}{1 - R (λ_{0} + δ)} + \frac{(λ_{0} + δ)^{2}}{R}], \end{aligned}

and the first main-diagonal element of the inverse of the Fisher information matrix $I (δ, λ_{0}, R)$ is given by

\begin{aligned} I^{11} & = [I_{11} - (\begin{matrix} I_{12} & I_{13} \end{matrix}) {(\begin{matrix} I_{22} & I_{23} \\ I_{23} & I_{33} \end{matrix})}^{- 1} (\begin{matrix} I_{12} \\ I_{13} \end{matrix})]^{- 1} \\ = \frac{I_{22} I_{33} - (I_{23})^{2}}{2 I_{12} I_{13} I_{23} + I_{11} I_{22} I_{33} - I_{11} (I_{23})^{2} - I_{22} (I_{13})^{2} - I_{33} (I_{12})^{2}} . \end{aligned}

Therefore, the score statistic for testing $H_{0} : δ = δ_{0}$ can be given by

T_{s c} = {[\frac{\partial ℓ (δ, λ_{0}, R)}{\partial δ} \sqrt{I^{11}}] |}_{δ = δ_{0}, λ_{0} = {\tilde{λ}}_{0} (δ_{0}), R = \tilde{R} (δ_{0})},

where ${\tilde{λ}}_{0} (δ_{0})$ and $\tilde{R} (δ_{0})$ are the constrained MLEs of $λ_{0}$ and R under $H_{0}$ : $δ = δ_{0}$ , respectively.

Funding Statement

The work of Qiu was supported by the National Natural Science Foundation of China [Grant Nos. 11871124, 11471060] and the Natural Science Foundation of Chongqing [Grant No. cstc2018jcyjAX0241, cstc2020jcyj-msxmX0232].

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

1.Agresti A. and Coull B.A., Approximate is better than exact for interval estimation of binomial proportion, Amer. Statist. 52 (1998), pp. 119–126. [Google Scholar]
2.Agresti A. and Min Y., On small-sample confidence intervals for parameters in discrete distributions, Biometrics 57 (2001), pp. 963–971. [DOI] [PubMed] [Google Scholar]
3.Beal S.L., Asymptotic confidence intervals for the difference between two binomial parameters for use with small samples, Biometrics 43 (1987), pp. 941–950. [PubMed] [Google Scholar]
4.Bodian C.A., Intraclass correlation for two-by-two tables under three sampling designs, Biometrics 50 (1994), pp. 183–193. [PubMed] [Google Scholar]
5.Chan I.S.F and Zhang Z., Test-based exact confidence intervals for the difference of two binomial proportions, Biometrics 55 (1999), pp. 1202–1209. [DOI] [PubMed] [Google Scholar]
6.Chen X., A quasi-exact method for the confidence intervals of the difference between two independent binomial proportions in small sample cases, Stat. Med. 21 (2002), pp. 943–956. [DOI] [PubMed] [Google Scholar]
7.Dallal G.E., Paired Bernoulli trials, Biometrics 44 (1988), pp. 253–257. [PubMed] [Google Scholar]
8.DiCiccio T.J. and Efron B., Bootstrap confidence intervals, Statist. Sci. 11 (1996), pp. 189–228. [Google Scholar]
9.Donner A., Statistical methods in ophthalmology: An adjusted chi-square approach, Biometrics 45 (1989), pp. 605–611. [PubMed] [Google Scholar]
10.Efron B. and Tibshirani R.J., An Introduction to the Bootstrap, Chapman and Hall, London, 1993. [Google Scholar]
11.Fisher R.A., On the ‘probable error’ of a coefficient of correlation deduced from a small sample, Metron 1 (1921), pp. 3–32. [Google Scholar]
12.Le C.T., Testing for linear trends in proportions using correlated otolaryngology or ophthalmology data, Biometrics 44 (1988), pp. 299–303. [PubMed] [Google Scholar]
13.Li J.L., Applications of the bootstrap in ROC analysis, Commun. Statist. Simul. Comput. 41 (2012), pp. 865–877. [Google Scholar]
14.Li J.L, Tai B.C., and Nott D.J., Confidence interval for the bootstrap P-value and sample size calculation of the bootstrap test, J. Nonparametr. Stat. 21 (2009), pp. 649–661. [Google Scholar]
15.Mandel E.M., Bluestone C.D., Rockette H.E., Blatter M.M., Reisinger K.S., Wucher F.P., and Harper J., Duration of effusion after antibiotic treatment for acute otitis media: Comparison of cefaclor and amoxicillin, Pediatr. Infect. Dis. 1 (1982), pp. 310–316. [DOI] [PubMed] [Google Scholar]
16.Newcombe R.G., Measures of location for confidence intervals for proportions, Commun. Statist. Theory Methods 40 (2011), pp. 1743–1767. [Google Scholar]
17.Pei Y.B., Tang M.L., and Guo J.H., Testing the equality of two proportions for combined unilateral and bilateral data, Commun. Statist. Simul. Comput. 37 (2008), pp. 1515–1529. [Google Scholar]
18.Pei Y.B., Tang M.L., Wong W.K., and Guo J.H., Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial, Statist. Methods Med. Res. 21 (2010), pp. 167–187. [DOI] [PubMed] [Google Scholar]
19.Rao C.R., Linear Statistical Inference and Its Applications, 2nd ed., Wiley, New York, 1985. [Google Scholar]
20.Rosner B., Statistical methods in ophthalmology: An adjustment for the intraclass correlation between eyes, Biometrics 38 (1982), pp. 105–114. [PubMed] [Google Scholar]
21.Rosner B. and Milton R.C., Significance testing for correlated binary outcome data, Biometrics 44 (1988), pp. 505–512. [PubMed] [Google Scholar]
22.Shao J. and Tu D.S., The Jackknife and Bootstrap, Springer-Verlag, New York, 1995. [Google Scholar]
23.Tang M.L., Pei Y.B., Wong W.K., and Li J.L., Goodness-of-fit tests for correlated paired binary data, Statist. Methods Med. Res. 21 (2010), pp. 331–345. [DOI] [PubMed] [Google Scholar]
24.Tang M.L., Tang N.S., and Chan I.S.F., Confidence interval construction for proportion difference in small-sample paired studies, Stat. Med. 24 (2005), pp. 3565–3579. [DOI] [PubMed] [Google Scholar]
25.Tang N.S., Tang M.L., and Qiu S.F., Testing the equality of proportions for correlated otolaryngologic data, Comput. Statist. Data Anal. 52 (2008), pp. 3719–3729. [Google Scholar]
26.Tang M.L., Tang N.S., and Rosner B., Statistical inference for correlated data in ophthalmologic studies, Stat. Med. 25 (2006), pp. 2771–2783. [DOI] [PubMed] [Google Scholar]
27.Wilson E.B., Probable inference, the law of succession, and statistical inference, J. Am. Stat. Assoc. 22 (1927), pp. 209–212. [Google Scholar]

[CIT0001] 1.Agresti A. and Coull B.A., Approximate is better than exact for interval estimation of binomial proportion, Amer. Statist. 52 (1998), pp. 119–126. [Google Scholar]

[CIT0002] 2.Agresti A. and Min Y., On small-sample confidence intervals for parameters in discrete distributions, Biometrics 57 (2001), pp. 963–971. [DOI] [PubMed] [Google Scholar]

[CIT0003] 3.Beal S.L., Asymptotic confidence intervals for the difference between two binomial parameters for use with small samples, Biometrics 43 (1987), pp. 941–950. [PubMed] [Google Scholar]

[CIT0004] 4.Bodian C.A., Intraclass correlation for two-by-two tables under three sampling designs, Biometrics 50 (1994), pp. 183–193. [PubMed] [Google Scholar]

[CIT0005] 5.Chan I.S.F and Zhang Z., Test-based exact confidence intervals for the difference of two binomial proportions, Biometrics 55 (1999), pp. 1202–1209. [DOI] [PubMed] [Google Scholar]

[CIT0006] 6.Chen X., A quasi-exact method for the confidence intervals of the difference between two independent binomial proportions in small sample cases, Stat. Med. 21 (2002), pp. 943–956. [DOI] [PubMed] [Google Scholar]

[CIT0007] 7.Dallal G.E., Paired Bernoulli trials, Biometrics 44 (1988), pp. 253–257. [PubMed] [Google Scholar]

[CIT0008] 8.DiCiccio T.J. and Efron B., Bootstrap confidence intervals, Statist. Sci. 11 (1996), pp. 189–228. [Google Scholar]

[CIT0009] 9.Donner A., Statistical methods in ophthalmology: An adjusted chi-square approach, Biometrics 45 (1989), pp. 605–611. [PubMed] [Google Scholar]

[CIT0010] 10.Efron B. and Tibshirani R.J., An Introduction to the Bootstrap, Chapman and Hall, London, 1993. [Google Scholar]

[CIT0011] 11.Fisher R.A., On the ‘probable error’ of a coefficient of correlation deduced from a small sample, Metron 1 (1921), pp. 3–32. [Google Scholar]

[CIT0012] 12.Le C.T., Testing for linear trends in proportions using correlated otolaryngology or ophthalmology data, Biometrics 44 (1988), pp. 299–303. [PubMed] [Google Scholar]

[CIT0013] 13.Li J.L., Applications of the bootstrap in ROC analysis, Commun. Statist. Simul. Comput. 41 (2012), pp. 865–877. [Google Scholar]

[CIT0014] 14.Li J.L, Tai B.C., and Nott D.J., Confidence interval for the bootstrap P-value and sample size calculation of the bootstrap test, J. Nonparametr. Stat. 21 (2009), pp. 649–661. [Google Scholar]

[CIT0015] 15.Mandel E.M., Bluestone C.D., Rockette H.E., Blatter M.M., Reisinger K.S., Wucher F.P., and Harper J., Duration of effusion after antibiotic treatment for acute otitis media: Comparison of cefaclor and amoxicillin, Pediatr. Infect. Dis. 1 (1982), pp. 310–316. [DOI] [PubMed] [Google Scholar]

[CIT0016] 16.Newcombe R.G., Measures of location for confidence intervals for proportions, Commun. Statist. Theory Methods 40 (2011), pp. 1743–1767. [Google Scholar]

[CIT0017] 17.Pei Y.B., Tang M.L., and Guo J.H., Testing the equality of two proportions for combined unilateral and bilateral data, Commun. Statist. Simul. Comput. 37 (2008), pp. 1515–1529. [Google Scholar]

[CIT0018] 18.Pei Y.B., Tang M.L., Wong W.K., and Guo J.H., Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial, Statist. Methods Med. Res. 21 (2010), pp. 167–187. [DOI] [PubMed] [Google Scholar]

[CIT0019] 19.Rao C.R., Linear Statistical Inference and Its Applications, 2nd ed., Wiley, New York, 1985. [Google Scholar]

[CIT0020] 20.Rosner B., Statistical methods in ophthalmology: An adjustment for the intraclass correlation between eyes, Biometrics 38 (1982), pp. 105–114. [PubMed] [Google Scholar]

[CIT0021] 21.Rosner B. and Milton R.C., Significance testing for correlated binary outcome data, Biometrics 44 (1988), pp. 505–512. [PubMed] [Google Scholar]

[CIT0022] 22.Shao J. and Tu D.S., The Jackknife and Bootstrap, Springer-Verlag, New York, 1995. [Google Scholar]

[CIT0023] 23.Tang M.L., Pei Y.B., Wong W.K., and Li J.L., Goodness-of-fit tests for correlated paired binary data, Statist. Methods Med. Res. 21 (2010), pp. 331–345. [DOI] [PubMed] [Google Scholar]

[CIT0024] 24.Tang M.L., Tang N.S., and Chan I.S.F., Confidence interval construction for proportion difference in small-sample paired studies, Stat. Med. 24 (2005), pp. 3565–3579. [DOI] [PubMed] [Google Scholar]

[CIT0025] 25.Tang N.S., Tang M.L., and Qiu S.F., Testing the equality of proportions for correlated otolaryngologic data, Comput. Statist. Data Anal. 52 (2008), pp. 3719–3729. [Google Scholar]

[CIT0026] 26.Tang M.L., Tang N.S., and Rosner B., Statistical inference for correlated data in ophthalmologic studies, Stat. Med. 25 (2006), pp. 2771–2783. [DOI] [PubMed] [Google Scholar]

[CIT0027] 27.Wilson E.B., Probable inference, the law of succession, and statistical inference, J. Am. Stat. Assoc. 22 (1927), pp. 209–212. [Google Scholar]

PERMALINK

Confidence intervals for assessing equivalence of two treatments with combined unilateral and bilateral data

Shi-Fang Qiu

Ji-Ran Tao

Abstract

1. Introduction

Table 1.

2. Data structure

Table 2.

3. Confidence interval estimators

3.1. CIs based on Wald-type statistics

3.2. CI based on Agresti–Coull method

3.3. CI based on the inverse hyperbolic tangent transformation

3.4. CI based on the Haldane method

3.5. CI based on the score test statistic

3.6. CI based on the profile-likelihood-ratio test

3.7. Bootstrap-resampling CIs

4. Simulation study

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

5. Real example

Table 3.

6. Conclusion and discussion

Appendices.

Appendix 1. The restrained MLEs ${\tilde{λ}}_{0} (δ)$ and $\tilde{R} (δ)$ of $λ_{0}$ and R given the value of δ

Appendix 2. Derivation of score statistic $T_{s c}$

Funding Statement

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Confidence intervals for assessing equivalence of two treatments with combined unilateral and bilateral data

Shi-Fang Qiu

Ji-Ran Tao

Abstract

1. Introduction

Table 1.

2. Data structure

Table 2.

3. Confidence interval estimators

3.1. CIs based on Wald-type statistics

3.2. CI based on Agresti–Coull method

3.3. CI based on the inverse hyperbolic tangent transformation

3.4. CI based on the Haldane method

3.5. CI based on the score test statistic

3.6. CI based on the profile-likelihood-ratio test

3.7. Bootstrap-resampling CIs

4. Simulation study

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

5. Real example

Table 3.

6. Conclusion and discussion

Appendices.

Appendix 1. The restrained MLEs λ~0(δ) and R~(δ) of λ0 and R given the value of δ

Appendix 2. Derivation of score statistic Tsc

Funding Statement

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Appendix 1. The restrained MLEs ${\tilde{λ}}_{0} (δ)$ and $\tilde{R} (δ)$ of $λ_{0}$ and R given the value of δ

Appendix 2. Derivation of score statistic $T_{s c}$