Abstract
Responses from the paired organs are generally highly correlated in bilateral studies, statistical procedures ignoring the correlation could lead to incorrect results. Note the intraclass correlation in the study of combined unilateral and bilateral outcomes; 11 confidence intervals (CIs) including 7 asymptotic CIs and 4 Bootstrap-resampling CIs for assessing the equivalence of 2 treatments are derived under Rosner's correlated binary data model. Performance is evaluated with respect to the empirical coverage probability (ECP), the empirical coverage width (ECW) and the ratio of the mesial non-coverage probability to the non-coverage probability (RMNCP) via simulation studies. Simulation results show that (i) all CIs except for the Wald CI and the bias-corrected Bootstrap percentile CI generally produce satisfactory ECPs and hence are recommended; (ii) all CIs except for the bias-corrected Bootstrap percentile CI provide preferred RMNCPs and are more symmetrical; (iii) as the measurement of the dependence increases, the ECWs of all CIs except for the score CI and the profile likelihood CI show increasing patterns that look like linear, while there is no obvious pattern on the ECPs of all CIs except for the profile likelihood CI. A data set from an otolaryngologic study is used to illustrate the proposed methods.
Keywords: Bootstrap-resampling method, combined unilateral and bilateral data, confidence interval, intra-class correlation, proportion difference
1. Introduction
In randomized clinical trials, it is frequently that we may collect data from the paired organs or body parts, and the outcomes may be either bilateral (e.g. two organs are sick) or unilateral response (only one organ or a body part is sick). Consequently, it causes the data generally highly correlated in bilateral studies, and statistical procedures ignoring the intraclass correlation of the bilateral observations may lead to incorrect inference [4,7,9,20,21]. Taking the intraclass correlation into consideration, Rosner [20] presented an intraclass correlation model for analyzing ophthalmologic data to which a person may have contributed two eyes worth of information. Le [12] considered the testing for linear trends in proportions using correlated otolaryngology or ophthalmology data. Note that asymptotic test procedures could also yield unacceptably high type I error rates for small sample studies and sparse data structures even if the intraclass correlation is taken into consideration, Tang et al. [26] proposed the exact unconditional and approximate unconditional procedures based on three test statistics (i.e. the Wald statistic, the statistics based on a dependence model and an independence model proposed by Rosner and Milton [21]). Followed by Tang et al. [26], eight test statistics for testing the equality of two treatments and the corresponding asymptotical and approximate unconditional test procedures in the bilateral study are developed by Tang et al. [25]. Tang et al. [23] investigated the goodness-of-fit and the model selection for a few popular statistical models for correlated paired binary outcomes. A variety of confidence intervals for estimating the difference between the proportions of responders in a randomized two-armed clinical trial are proposed in Pei et al. [18].
However, individuals may produce either unilateral data (e.g. data from only one organ) or bilateral data (e.g. data from two organs) in many medical comparative studies (e.g. otolaryngologic or ophthalmologic studies). For example, to evaluate the efficacy of two antibiotics (i.e. Cefaclor and Amoxicillin) for the treatment of otitis media with effusion (OME) in an otolaryngologic study, Mandel et al. [15] considered a randomized double-blinded clinical trial. In this trial, a total of 214 children (293 ears) underwent unilateral or bilateral tympanocentesis before they were randomly assigned to one of the two treatments. After a 14-day course of treatment with one of the antibiotics, the outcome of each child was recorded at the end of the treatment. In this study, only 203 evaluable children without repeat tympanocentesis, treatment change or tympanic membrane perforations have received one of the treatments. For the group with unilateral disease, two results were determined: cured and not-cured; and for the group with bilateral disease, three results, i.e. cured (both ears become OME-cured), partially cured (only one ear becomes OME-cured) and both ears are not cured were recorded. The data are reported in Table 1.
Table 1.
OME status after 14 days and 42 days of treatment (in terms of No. of children).
| Amoxicillin | Cefaclor | |||||
|---|---|---|---|---|---|---|
| No. of OME-free ears | 0 | 1 | 2 | 0 | 1 | 2 |
| Unilateral | 39 | 27 | – | 24 | 38 | – |
| Bilateral | 15 | 3 | 13 | 14 | 9 | 21 |
For this study, it is important to test whether the cure rates are identical between the Cefaclor and Amoxicillin groups. Under the equal correlation coefficient model, Pei et al.[17] considered the equivalence testing of two successful cure rates and developed several asymptotic test procedures under the independent and the dependent models, respectively. However, CI estimators for comparative studies with combined unilateral and bilateral binary data have not been developed. In this article, we consider the CI construction for proportion difference in comparative medical studies with combined unilateral and bilateral binary outcomes under Rosner's correlated binary data model.
This article is organized as follows. The data structure and the probability model are described in Section 2. Seven asymptotic confidence intervals and four Bootstrap-resampling CIs are proposed in Section 3. In Section 4, the performance of all CIs is evaluated via simulation studies in terms of the ECP, the ECW and the RMNCP. An illustration of our methodologies with otolaryngological data is presented in Section 5. The paper closes with a brief conclusion and discussion in Section 6.
2. Data structure
Let represent the number of individuals with ear being cured in the ith treatment for the unilateral group, and represent the number of individuals with h ear/ears being cured in the ith treatment for the bilateral group, and be the corresponding probabilities ( , ). Let (s = 1, 2) and .
Let if the jth individual received the ith treatment in the unilateral group is OME-cured and otherwise. In the bilateral group, if the kth ear of the jth individual received the ith treatment is OME-cured and otherwise ( ). Following Rosner [20], it is eligible to assume that the correlation of the responses from the two ears should be the same for the two treatments, i.e.
for . As shown in [20], the correlation coefficient between and is for i = 0, 1, then R represents a measurement of the dependence between two ears of the same individual in two treatments. Specially, R = 1 if two ears are completely independent, while if two ears are completely dependent. By simple calculation, we can have the following probability model under Rosner's assumption:
| (1) |
for i = 0, 1. The data structure and the probability model for the combined unilateral and bilateral outcomes are given in Table 2.
Table 2.
The data structure and probability model for combined unilateral and bilateral data.
| Unilatral | Bilateral | ||||
|---|---|---|---|---|---|
| No. of ears with OME-free | i = 0 | i = 1 | i = 0 | i = 1 | Total |
| 0 | |||||
| 1 | |||||
| 2 | – | – | |||
| Total | |||||
According to Rosner's assumption, the log-likelihood function of the parameters , and R for the observation data , , , , : is given by
| (2) |
where C is a constant that doesn't involve parameters.
Let be the difference between two proportions for two treatments, then . Therefore, the log-likelihood function given in Equation (2) becomes
| (3) |
It is easily shown that the parameter vector satisfies the following conditions:
In order to investigate whether there is a significant difference between the two treatments, we are interested in the confidence interval construction for the proportion difference (i.e. δ) in this article. Seven asymptotic confidence interval estimators and four Bootstrap-resampling confidence intervals are developed and evaluated as follows.
3. Confidence interval estimators
3.1. CIs based on Wald-type statistics
It is easily shown that the sample estimates of and are, respectively, given by
| (4) |
Therefore, the sample estimate of δ can be given by
| (5) |
and the maximum likelihood estimation of the parameter R can be obtained by solving the following equation:
| (6) |
It is easily shown that the variance of (i = 0, 1) can be given by
| (7) |
Therefore, the variance of is given by and it can be estimated by
| (8) |
and
| (9) |
respectively, where , are the constrained maximum-likelihood estimates (CMLEs) of and R given the value of δ, and they can be obtained by solving the following equations:
| (10) |
No closed-form exists, an iterative algorithm (e.g. Fisher-score iterative algorithm, refer to Appendix 1 for details) can be used to find the solutions to the above equations. When the solutions are out of the parameter space, we can use the search algorithm to find the CMLEs. According to the central limit theorem, it is easily shown that is asymptotically distributed as standard normal distribution as all and ( ) are large. Therefore, the confidence interval for δ based on the Wald method is given by
| (11) |
where is the upper percentile of the standard normal distribution.
Similar to Wilson [27], the confidence lower and upper limits for δ can be obtained by solving the equations with respect to and , respectively. Let and , the following bisection method can be used to find the solutions of these equations:
Step 1: Select a suitable constant h , for example, 0.1, 0.05, etc., start from and search for the minimum positive integer k such that , where .
Step 2: Use the bisection method to find the root of on the interval , then we obtain the lower limit of the interval, denoted as .
Step 3: Similarly, start from 1 and search for the minimum positive integer k such that , where . Use the bisection method to find the root of on the interval , then we obtain the upper limit of the interval, denoted as .
Therefore, the confidence interval for δ based on the Wilson method is given by .
3.2. CI based on Agresti–Coull method
As shown in many literatures, the Wald CI given by Equation (11) usually performs not well when the sample size is small in the sense that it usually provides empirical coverage probabilities lower than the pre-specified confidence level. Adding a small count to every cell before computing the interval limits is a common strategy. For example, Agresti and Coull [1] suggested a Wald CI by adding 2 for one-sample binomial problems. Similar to Agresti and Coull [1], we use the simulation approach to find the small count to be added. The simulation study shows that the Wald CI given in Equation (11) by adding 0.25 to all cell counts (i.e. and , , h = 0, 1, 2, i = 0, 1) performs well. We denote this CI as .
3.3. CI based on the inverse hyperbolic tangent transformation
Note that the Wald CI and the CI based on the Wilson method are derived from the normal approximation of . However, when the sample size is small or the data have a sparse structure, the asymptotical distribution of is usually highly skewed. In this case, the inverse hyperbolic tangent transformation (i.e. Fisher's Z transformation [11]) can be used to improve the normal approximation of . By using this transformation for , we have . It is easily shown that the expectation of is given by , and the variance of is given by by using the delta method. The corresponding variance estimation of is given by . When all and (i = 0, 1) are large, the test statistic is asymptotically distributed as a standard normal distribution, so the confidence interval for can be given by , where
Therefore, the confidence interval for δ can be obtained via the inverse transformation of , which is given by
| (12) |
If equals or 1, the confidence limits for are not defined. In this case, we just take the CI for δ to be .
3.4. CI based on the Haldane method
According to the central limit theorem, since is asymptotically distributed as the standard normal distribution when the sample size is large enough, then we have
Let , then and . Thus, the variance can be expressed in terms of parameters δ and η. Similar to Beal [3], we can consider the following quadratic equation of δ:
where
with and . If a>0 and , the asymptotic confidence limits of δ are given by , where , . The unknown parameters η and R can be estimated by and , respectively, therefore, the asymptotic confidence interval for δ is given by
| (13) |
As noted by Beal [3], this confidence interval can be regarded as the extension of ‘Haldane Interval’ to account for the intraclass correlation of the combined unilateral and bilateral data. Note that the probability that the estimation of is 0 cannot be negligible when are small, especially when and (i = 0, 1) are small. In this case, we can adopt the simple adjustment by adding 0.5 to each cell.
3.5. CI based on the score test statistic
Using the general theory of efficient scores proposed by Rao [19], the score statistic for testing the hypothesis can be given by
where
is the score function, and is the th element of the inverse of the Fisher information matrix, which is given by
where ( ) is the element of the Fisher information matrix (refer to Appendix for details).
As shown in [19], is asymptotically distributed as a standard normal distribution under as all and (i = 0, 1) are large. Therefore, the confidence interval for δ based on the score test can be given by
| (14) |
where the lower limit is the solution of the equation with respect to , and the upper limit is the solution of the equation , respectively. Similarly, no closed form exists; the above bisection method given in Section 3.1 can be used to obtain the solutions.
3.6. CI based on the profile-likelihood-ratio test
For testing the hypothesis , the likelihood ratio test statistic is given by
Similarly, is asymptotically distributed as the Chi-square distribution with one degree of freedom under when all and (i = 0, 1). Therefore, the asymptotic profile likelihood CI for δ can be obtained by inverting the likelihood ratio test , i.e. the confidence lower and upper limits for δ satisfy
where is the upper α percentile of the chi-square distribution with one degree of freedom. Let . Similarly, the following bisection method can be used to find the upper and lower limits:
Step 1: Select a suitable constant h, for example, 0.1, 0.05, etc., starting from and search for the minimum positive integer k such that , where .
Step 2: Use the bisection method to find the root of on the interval , then we obtain the lower limit of the interval, denoted as .
Step 3: Similarly, start from 1 and search for the minimum positive integer k such that , where . Use the bisection method to find the root of on the interval , then we obtain the upper limit of the interval, denoted as .
Therefore, the confidence interval for δ based on the likelihood ratio test is given by
| (15) |
3.7. Bootstrap-resampling CIs
It is well known that the Bootstrap-resampling method has been applied extensively in many fields. For example, Efron and Tibshirani [10] and Shao and Tu [22] used the Bootstrap-resampling method to estimate the variability of complicated statistics, and Li [13] suggested the use of a Bootstrap procedure to generate the empirical distribution of the test statistic in ROC analysis. In particular, when the sample size is small, confidence intervals for proportion difference based on the large sample approximation may not be reliable. In this case, the Bootstrap-resampling method is usually recommended to construct CIs [10,22]. Therefore, we consider the following parametric Bootstrap-resampling procedure to construct the CI for δ:
Step 1: Given the observed data , , , , : , we can obtain the parameter estimators , , and of the parameters , , δ and R via Equations (4)–(6). Let be the estimate of that is calculated from the observed data.
Step 2: Generate the Bootstrap sample , , , , : from the product of binomial and trinomial distributions based on the estimated parameters, i.e. follows Binomial distribution and follows Trinomial distribution (i = 0, 1), where and ( , h = 0, 1, 2) are given by Equation (1), in which parameters , and R are substituted by their estimators , and , respectively.
Step 3: For each generated sample , calculate the estimations , and of , δ and R via Equations (4)–(6).
Step 4: Independently repeating the above process (i.e. Steps 2–3) B times, we can obtain B Bootstrap estimators , and of δ, and R ( ), respectively. The B Bootstrap estimators are then ordered from the smallest to the largest, and let , , ··· , be the ordered values.
(i) Bootstrap percentile CI
Following Efron and Tibshirani [10], the Bootstrap percentile CI for δ is given by
| (16) |
where denotes the maximum integer not greater than a.
(ii) Bootstrap percentile-t CI
Let be the estimated standard deviation of the bth Bootstrap estimator . For each of the B bootstrap samples, we can obtain . Following Efron and Tibshirani [10], the bootstrap percentile-t confidence interval for δ can be obtained by
| (17) |
where 's denote the ordered values of 's from the smallest to the largest and is the positive squared root of given in Step 1.
(iii) Bias-corrected Bootstrap percentile CI
Following DiCiccio and Efron [8], the confidence interval for δ can be given by
| (18) |
where , with . Here, is the standard normal distribution function, and is its inverse.
(iv) Bootstrap percentile-t CI combining with the inverse hyperbolic tangent transformation
Since that a variance stabilizing transformation much like the inverse hyperbolic tangent transformation can give good results when this transformation is used to construct CI for numbers lying between −1 and +1, then we consider a Bootstrap percentile-t CI combining with the inverse hyperbolic tangent transformation. First, for each of the B bootstrap samples, we can obtain , then the bootstrap percentile-t confidence interval for can be obtained by , , where 's denote the ordered values of 's from the smallest to the largest. Therefore, the bootstrap confidence interval for δ is given by
| (19) |
Note that the asymptotical methods based on the large sample assumption do not necessarily control their actual coverage probabilities at the pre-specified confidence level for small sample sizes. In this case, some adjusted methods can be used to construct the CI for proportion difference, for example, the approximate unconditional methods proposed in Tang et al. [24], the exact binomial method and mid-P method modified from the exact binomial method proposed in Li et al. [14]. However, for two groups of combined unilateral and bilateral data, a severe computing burden will be encountered for our problem even for the very small sample size . Therefore, we have not adopted the exact and approximate unconditional methods in this article.
4. Simulation study
In this section, we investigate the performance of the proposed confidence intervals via simulation studies in terms of the empirical coverage probability (ECP) and the empirical coverage width (ECW). In general, a method to construct a CI is better if the ECP is closer to the pre-specified confidence level and the ECW is smaller. Following Newcombe [16], the location of a CI can be characterized in terms of the balance between the mesial non-coverage probability (MNCP) and the distal non-coverage probability (DNCP). A simple index, i.e. the ratio of the MNCP to the non-coverage probability (RMNCP=MNCP/( ECP )) can be used to evaluate the location of a CI, where DNCP and MNCP are defined with respect to the true value of δ. These evaluation indices are defined as follows.
(i) Empirical coverage probability
where K is the number of replications and is the indicator function, , , , , : is the kth replication, and is the CI that is constructed from the set of by any of the 11 methods under evaluation.
(ii) Empirical coverage width
(iii) Ratio of the mesial non-coverage probability to the non-coverage probability
According to Newcombe [16], the mesial non-coverage probability (MNCP) is defined as
where is given by
and distal non-coverage probability (DNCP) is defined as
where is given by
The ratio of the mesial non-coverage probability to the non-coverage probability (RMNCP) is then defined as
As shown in [16], a CI is classified as satisfactory if the RMNCP lies in [0.4, 0.6], and too mesially located if the ratio is smaller than 0.4, and too distal if it is greater than 0.6.
To evaluate the proposed methods, we consider three kinds of sample size designs of , , , : (i) small sample size , 20, 20, ; (ii) moderate sample size , 30, 30, ; and (iii) large sample size , 50, 50, . We only report the simulation results for balanced sample sizes due to no substantial difference for the performance between balanced and unbalanced sample sizes. For parameter settings of ( , δ, R), a total of combinations with , 0.10, 0.15, , 0.3, 0.5 and , i.e. from 1.0 to 1.5 with step size 0.1 are considered in the simulation studies. A total of 5000 replications are conducted to the simulation studies and the bootstrap CI of δ is based on 2000 replications. According to the definition of MNCP, when , the RMNCP is always equal to 1.0. Therefore, we only report RMNCPs of all CIs for the situations with .
Figures 1(i), 2(i) and 3(i) report the change of ECPs of two-sided CIs for δ as the change of R. It is observed that and have slightly deflated ECPs when the sample size is small (e.g. ), and is slightly conservative for , under the small sample size (i.e. ), other CIs generally perform well in the sense that they can well control their ECPs around the pre-specified confidence level. Although the intra-class correlation is not ignorable in bilateral data analysis, it seems that the CIs except for are independent of data structure, i.e. there is no obvious pattern on ECPs as R changes as shown in Figures 1–3, while the ECP of increases as the increase of R for with and 0.15.
Figure 1.
(i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size .
Figure 2.
(i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size .
Figure 3.
(i) ECPs of two-sided 95% CIs of δ versus R; (ii) ECWs of two-sided 95% CIs of δ versus R under sample size .
Figures 1(ii), 2(ii) and 3(ii) report the change of ECWs of two-sided CIs for δ as the change of R. It is interesting to find that ECWs of the CIs except for and tend to be wider as R increases, and can be seen increasing patterns that look like linear, while ECWs of and show the patterns that look like nonlinear. As expected, ECWs of all CIs are shorter with the increase of the sample size.
Figure 4 reports the change of RMNCPs of two-sided CIs for δ as the change of R. It is observed that all CIs except for usually have satisfactory interval locations, as their ratios of the mesial non-coverage probability to total non-coverage probability are close to 0.5 even for small sample sizes, while has too mesially located interval in some cases. The larger the sample sizes, the closer to 0.5 the RMNCPs of all CIs. As shown in Figure 4, it seems that there is no obvious pattern on RMNCPs as R changes.
Figure 4.
RMNCPs of two-sided 95% CIs of δ versus R under different sample sizes of : (i) ; (ii) ; (iii) .
To further investigate the overall performance of the CIs across a range of values for the nuisance parameters and R for given values of δ: i.e. , let and ; if , , ; if , , , where means that the value is from a to c with step size b. Four sample size designs of , , , are considered in this study, i.e. (i) ; (ii) ; (iii) ; and (iv) . Boxplots of ECPs, ECWs and RMNCPs of various CIs are reported in Figure 5.
Figure 5.
Boxplots of ECPs, ECWs, RMNCPs of two-sided 95% CIs for δ under different sample sizes of : (i) ; (ii) ; (iii) and (iv) .
In terms of the coverage probability, the boxplots of ECP in Figure 5 show that the results of various methods are generally satisfactory. Furthermore, , and perform better than the others, as their median empirical coverage probabilities are closer to the preassigned confidence level even under small sample sizes. When the sample size increases, the performance of other CIs show improvements. The results given in Figures 1–3 also support these findings.
With regard to the width of the various CIs, the boxplots of ECW in Figure 5 show that has wider interval width than the others under small and moderate sample sizes (e.g. , , , = and ), as its median empirical coverage widths are larger than the others. However, when the sample size is large, all CIs have similar median ECWs.
For the location of the various CIs, the boxplots of RMNCP in Figure 5 show that the RMNCPs of are almost in , suggesting that the CI has a satisfactory location. Except for , the other CIs can also produce good results, as their median RMNCPs are closer to 0.5, while for , the values of RMNCP suggest that the CI tends to be slightly too mesial when the sample size is not large; the results given in Figures 1–3 also support these findings.
In summary, with the increase of the sample size, ECPs of all CIs are closer to the pre-specified confidence level, and all CIs have better interval locations and smaller interval widths. Generally, all CIs except for and can usually achieve the nominal coverage and exhibit shorter interval widths from small to large sample sizes, and hence are recommended to practical applications.
5. Real example
To illustrate the proposed methods in this article, we re-visit the otolaryngological study in Section 1. Let i = 0 be the treatment of Amoxicillin and i = 1 be the treatment of Cefaclor, respectively. According to Equations (4)–(6), , and . The corresponding confidence intervals for δ based on various methods are reported in Table 3.
Table 3.
Various confidence intervals of δ for real data.
| Methods | CI | Width | Methods | CI | Width |
|---|---|---|---|---|---|
| 0.2634 | 0.3027 | ||||
| 0.2617 | 0.2648 | ||||
| 0.2619 | 0.2420 | ||||
| 0.2620 | 0.2515 | ||||
| 0.2620 | 0.2430 | ||||
| 0.2690 |
It is noteworthy that the lower limits of all CIs except for and are all greater than 0.0. The results indicate that there is no evidence to support rejecting the hypothesis at the confidence level using and , although we reject it at this confidence level using the other nine methods. In this case, which result is reliable? The results of our simulation study show that usually have a little wider interval widths than other CIs, although it can well control it's ECPs around the pre-specified confidence level. Since the upper limits of all CIs are greater than 0.0. Moreover, the lower limits of and are very close to 0.0. Therefore, we are more inclined to the conclusion that the treatment of Amoxicillin is more effective than the treatment of Cefaclor at the nominal level.
6. Conclusion and discussion
We consider the problem of CI construction for the proportion difference based on combined unilateral and bilateral data, which are commonly observed in the paired organs or two body parts studies. Seven asymptotic CIs, i.e. , , , , , and based on Rosner's dependence model, are constructed. Together with 4 bootstrap resampling methods, i.e. , , and , we have developed 11 methods to construct CIs. A large-scale empirical study has been conducted to evaluate the performance of these 11 methods from different aspects. Overall, the empirical results suggest that all CIs except for and generally perform well from small to large sample size designs, while and produce a little deflated ECPs when the sample size is small (e.g. , , , ), and is slightly conservative for , under small sample sizes (e.g. , , , ). All CIs can well control their ECPs close to the pre-specified confidence level with preferred RMNCPs and shorter interval widths when sample sizes are large. Therefore, all CIs except for and are recommended for practical applications for small to large sample size designs, and when the sample size is large, and can also be recommended for applications.
The intra-class correlation is not ignorable in bilateral data analysis. Therefore, simulation studies are conducted to investigate the effect of the measurement of dependence (i.e. R) on ECP, ECW and RMNCP of various CIs. It is interesting to find that the ECWs of all CIs except for and show like a linear increasing trend as R increases, and and show like a nonlinear trend. However, except for , there is no obvious pattern on ECPs of other CIs, and for all CIs, there is no obvious pattern on RMNCPs.
It is well known that confidence intervals for proportion difference based on the large sample approximation may not be reliable when the sample size is small. In this case, other CI construction methods are available in the literature, such as exact unconditional (e.g. [2,5,6]) and approximate unconditional methods [24]. However, for two groups of combined unilateral and bilateral data, a severe computing burden will be encountered for our problem even under very small sample sizes. In addition, several methods proposed in this article perform well even under small sample sizes. Therefore, we do not consider these methods in this article.
The equivalence testing of two cure rates in the paired organs or two body parts studies has been investigated in other literature, for example, [18,20,21,25,26]; however, they considered the situation in which only bilateral binary data are available. When unilateral data are also available, although Pei et al. [17] have investigated these combined data, they just considered the hypothesis testing under the equal correlation coefficient model. In this article, we investigate a number of methods to construct CIs in the presence of both bilateral and unilateral data under Rosner's correlated binary data model, and several effective methods are recommended for practical applications.
Appendices.
Appendix 1. The restrained MLEs and of and R given the value of δ
Given the value of δ, the log-likelihood function of is given by
Differentiating with respect to and R yields
Then, the restrained MLEs and of and R satisfy the following equations:
Differentiating and with respect to and R leads to
From (h = 0, 1, 2, i = 0, 1 and k = 1, 2), we have
Then, the Fisher information matrix is given by
Then the restrained MLEs and by iteratively solving the following equation
Appendix 2. Derivation of score statistic
Differentiating , , with respect to δ, and R, respectively. Thus, it is easily shown that the Fisher information matrix is given by
where
and the first main-diagonal element of the inverse of the Fisher information matrix is given by
Therefore, the score statistic for testing can be given by
where and are the constrained MLEs of and R under : , respectively.
Funding Statement
The work of Qiu was supported by the National Natural Science Foundation of China [Grant Nos. 11871124, 11471060] and the Natural Science Foundation of Chongqing [Grant No. cstc2018jcyjAX0241, cstc2020jcyj-msxmX0232].
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- 1.Agresti A. and Coull B.A., Approximate is better than exact for interval estimation of binomial proportion, Amer. Statist. 52 (1998), pp. 119–126. [Google Scholar]
- 2.Agresti A. and Min Y., On small-sample confidence intervals for parameters in discrete distributions, Biometrics 57 (2001), pp. 963–971. [DOI] [PubMed] [Google Scholar]
- 3.Beal S.L., Asymptotic confidence intervals for the difference between two binomial parameters for use with small samples, Biometrics 43 (1987), pp. 941–950. [PubMed] [Google Scholar]
- 4.Bodian C.A., Intraclass correlation for two-by-two tables under three sampling designs, Biometrics 50 (1994), pp. 183–193. [PubMed] [Google Scholar]
- 5.Chan I.S.F and Zhang Z., Test-based exact confidence intervals for the difference of two binomial proportions, Biometrics 55 (1999), pp. 1202–1209. [DOI] [PubMed] [Google Scholar]
- 6.Chen X., A quasi-exact method for the confidence intervals of the difference between two independent binomial proportions in small sample cases, Stat. Med. 21 (2002), pp. 943–956. [DOI] [PubMed] [Google Scholar]
- 7.Dallal G.E., Paired Bernoulli trials, Biometrics 44 (1988), pp. 253–257. [PubMed] [Google Scholar]
- 8.DiCiccio T.J. and Efron B., Bootstrap confidence intervals, Statist. Sci. 11 (1996), pp. 189–228. [Google Scholar]
- 9.Donner A., Statistical methods in ophthalmology: An adjusted chi-square approach, Biometrics 45 (1989), pp. 605–611. [PubMed] [Google Scholar]
- 10.Efron B. and Tibshirani R.J., An Introduction to the Bootstrap, Chapman and Hall, London, 1993. [Google Scholar]
- 11.Fisher R.A., On the ‘probable error’ of a coefficient of correlation deduced from a small sample, Metron 1 (1921), pp. 3–32. [Google Scholar]
- 12.Le C.T., Testing for linear trends in proportions using correlated otolaryngology or ophthalmology data, Biometrics 44 (1988), pp. 299–303. [PubMed] [Google Scholar]
- 13.Li J.L., Applications of the bootstrap in ROC analysis, Commun. Statist. Simul. Comput. 41 (2012), pp. 865–877. [Google Scholar]
- 14.Li J.L, Tai B.C., and Nott D.J., Confidence interval for the bootstrap P-value and sample size calculation of the bootstrap test, J. Nonparametr. Stat. 21 (2009), pp. 649–661. [Google Scholar]
- 15.Mandel E.M., Bluestone C.D., Rockette H.E., Blatter M.M., Reisinger K.S., Wucher F.P., and Harper J., Duration of effusion after antibiotic treatment for acute otitis media: Comparison of cefaclor and amoxicillin, Pediatr. Infect. Dis. 1 (1982), pp. 310–316. [DOI] [PubMed] [Google Scholar]
- 16.Newcombe R.G., Measures of location for confidence intervals for proportions, Commun. Statist. Theory Methods 40 (2011), pp. 1743–1767. [Google Scholar]
- 17.Pei Y.B., Tang M.L., and Guo J.H., Testing the equality of two proportions for combined unilateral and bilateral data, Commun. Statist. Simul. Comput. 37 (2008), pp. 1515–1529. [Google Scholar]
- 18.Pei Y.B., Tang M.L., Wong W.K., and Guo J.H., Confidence intervals for correlated proportion differences from paired data in a two-arm randomised clinical trial, Statist. Methods Med. Res. 21 (2010), pp. 167–187. [DOI] [PubMed] [Google Scholar]
- 19.Rao C.R., Linear Statistical Inference and Its Applications, 2nd ed., Wiley, New York, 1985. [Google Scholar]
- 20.Rosner B., Statistical methods in ophthalmology: An adjustment for the intraclass correlation between eyes, Biometrics 38 (1982), pp. 105–114. [PubMed] [Google Scholar]
- 21.Rosner B. and Milton R.C., Significance testing for correlated binary outcome data, Biometrics 44 (1988), pp. 505–512. [PubMed] [Google Scholar]
- 22.Shao J. and Tu D.S., The Jackknife and Bootstrap, Springer-Verlag, New York, 1995. [Google Scholar]
- 23.Tang M.L., Pei Y.B., Wong W.K., and Li J.L., Goodness-of-fit tests for correlated paired binary data, Statist. Methods Med. Res. 21 (2010), pp. 331–345. [DOI] [PubMed] [Google Scholar]
- 24.Tang M.L., Tang N.S., and Chan I.S.F., Confidence interval construction for proportion difference in small-sample paired studies, Stat. Med. 24 (2005), pp. 3565–3579. [DOI] [PubMed] [Google Scholar]
- 25.Tang N.S., Tang M.L., and Qiu S.F., Testing the equality of proportions for correlated otolaryngologic data, Comput. Statist. Data Anal. 52 (2008), pp. 3719–3729. [Google Scholar]
- 26.Tang M.L., Tang N.S., and Rosner B., Statistical inference for correlated data in ophthalmologic studies, Stat. Med. 25 (2006), pp. 2771–2783. [DOI] [PubMed] [Google Scholar]
- 27.Wilson E.B., Probable inference, the law of succession, and statistical inference, J. Am. Stat. Assoc. 22 (1927), pp. 209–212. [Google Scholar]





