Exact Confidence Intervals for the Relative Risk and the Odds Ratio

Weizhen Wang; Guogen Shan

doi:10.1111/biom.12360

. Author manuscript; available in PMC: 2016 Dec 1.

Published in final edited form as: Biometrics. 2015 Jul 30;71(4):985–995. doi: 10.1111/biom.12360

Exact Confidence Intervals for the Relative Risk and the Odds Ratio

Weizhen Wang ^1,^2,^*, Guogen Shan ³

PMCID: PMC4715482 NIHMSID: NIHMS700003 PMID: 26228945

Summary

For comparison of proportions there are three commonly used measurements: the difference, the relative risk and the odds ratio. Significant effort has been spent on exact confidence intervals for the difference. In this paper, we focus on the relative risk and the odds ratio when data are collected from a matched-pairs design or a two-arm independent binomial experiment. Exact one-sided and two-sided confidence intervals are proposed for each configuration of two measurements and two types of data. The one-sided intervals are constructed using an inductive order, they are the smallest under the order, and are admissible under the set inclusion criterion. The two-sided intervals are the intersection of two one-sided intervals. R codes are developed to implement the intervals. Supplementary materials for this article are available online.

Keywords: Binomial distribution, Coverage probability, Multinomial distribution, Set inclusion

1. INTRODUCTION

In medical research we often need to compare two treatments using binary data, and three parameters are commonly used: the risk difference (the difference of two proportions), the relative risk (the ratio of two proportions) and the odds ratio. The risk difference is an absolute measurement of effect, while the relative risk and the odds ratio are relative measurements for comparing outcomes. In retrospective case-control studies, the odds ratio is used because the other two parameters cannot be estimated. It is also well known that the odds ratio has a direct relationship with the regression coefficient in logistic regression. The relative risk is used in randomized controlled trials and cohort studies especially when the two relevant proportions are both small. In such a case, the risk difference is not as informative as the relative risk (see McCullagh 1980, Goodman 1985, and Agresti 2002, p.44). The relative risk and the odds ratio are comparable when the disease is rare with very low probability. For some common diseases (e.g., hypertension), the value of the odds ratio could be overestimated, and the relative risk should be used instead. It should be noted that the relative risk and the odds ratio are defined differently in Eq (3) and Eq (23) for a matched-pairs design and a two-arm independent binomial experiment. In this paper we focus on interval estimation of the relative risk and the odds ratio using data that are collected in a matched-pairs design or a two-arm independent binomial experiment, and are organized in a 2 × 2 contingency table. Results on the risk difference can be found, for example, in Wang (2010, 2012).

When no pivot quantity can be found for a discrete distribution, the coverage probability of a confidence interval is typically not a constant. Therefore, for a secure capture of the parameter of interest the coverage probability function of a desired confidence interval must be always at least the nominal level 1 − α for all parameter configurations, and we call it an exact confidence interval of level 1 − α (see Casella and Berger, 1990, p. 404). Such an interval guarantees predetermined coverage for a fixed sample size no matter where the true parameter vector is located in the parameter space. The implementation, however, is typically challenging as compared with asymptotic intervals. Some studies have shown that asymptotic intervals for proportions may have an infimum coverage probability lower than the nominal level by a fixed positive amount regardless of sample size, see Huwang (1995), Agresti and Coull (1998), Brown, Cai and DasGupta (2001), and Wang and Zhang (2014). In particular, the Wald interval, the Wilson interval (1927), and the Agresti-Coull interval (1998) for a binomial proportion are proven to have an incorrect infimum coverage even for very large sample sizes. One may turn to bootstrap intervals, especially when the parameter of interest is complicated. Such efforts can be found in, for example, Li, Taylor and Nan (2010), Lin et al. (2009) and Parzen et al. (2002). However, Wang (2013) proved that all bootstrap intervals for any function of proportions, including the relative risk and the odds ratio, always have an infimum coverage probability of zero. Therefore, practitioners are at their own risk to use these intervals, since the intervals may have a very small chance of capturing the parameter of interest, and the usage of exact intervals is justified.

Exact intervals for the relative risk and the odds ratio may be obtained by inverting exact tests in the case of two independent binomials, see for example, Gart (1971), Santner and Snell (1980), and Chan and Zhang (1999). This indirect construction may result in wider intervals. Wang (2010, 2012) proposed optimal intervals for the risk difference based on a direct analysis of coverage probability and an inductive order of the sample space. Shan and Wang (2013) developed an R-package, ExactCIdiff, to implement his intervals. This approach is now adapted to more complicated cases: the relative risk and the odds ratio. In Section 2 we describe preliminary results for the smallest one-sided interval construction. Section 3 discusses how to derive exact intervals for the relative risk and the odds ratio using a matched-pairs design and how to implement the computation. Section 4 deals with the case of a two-arm independent binomial experiment. The proposed intervals tend to be shorter than the ones from SAS (Version 9.3). Section 5 is a summary. The proofs and some figures are given in Supplementary Materials online.

2. PRELIMINARY RESULTS

Following the work by Buehler (1957), Bol’shev (1965), Chen (1993), Lloyd and Kabaila (2003), and Wang (2010) the construction of an exact one-sided confidence interval becomes automatic provided that an order (or equivalently, a rank function on a finite sample space) is specified in advance. We describe this result here, as it will be applied four times in the paper. Suppose a random vector X is observed from a finite sample space S, i.e., $S = {{\underline{x}}_{i}}_{i = 1}^{n}$ . A rank function R(.), assuming positive integer values, is defined on S. The probability mass function of X is given by p(x; ξ), where ξ is the parameter vector belonging to a parameter space H, a subset of R^k. Suppose ξ = (θ, η) and

H = {\underline{ξ} : \underline{η} \in D (θ) for each θ \in [A, B]},

where θ is the parameter of interest and η is the nuisance parameter vector, [A, B] is a given interval in R¹ (A and B may be ±∞, and the interval is open when the corresponding end is infinity) and D(θ) is a subset of R^k−1 depending on θ. We are interested in searching for the smallest exact lower one-sided confidence interval [L_S(X), B] among all 1 − α intervals for θ with form [L(X), B] that satisfy

1) L (\underline{x}) = L ({\underline{x}}^{'}) if R (\underline{x}) = R ({\underline{x}}^{'}); 2) L (\underline{x}) \leq L ({\underline{x}}^{'}) if R ({\underline{x}}^{'}) \leq R (\underline{x}) .

So a sample point x with a smaller rank has a larger confidence limit.

Lemma 1: Assume α ∈ (0, 1). For a given rank function R(.) on S and any x ∈ S, consider

f_{\underline{x}} {(θ)}^{\underset{=}{def}} \inf_{\underline{η} \in D (θ)} [1 - \sum_{{{\underline{x}}^{'} \in S : R ({\underline{x}}^{'}) \leq R (\underline{x})}} p ({\underline{x}}^{'}; \underline{ξ})] = 1 - α .

(1)

If f_x(θ) is a continuous function in θ, define

L_{s} (\underline{x}) = {\begin{matrix} the smallest solution of (1), & if (1) has a solution; \\ A, & otherwise, \end{matrix}

(2)

then

i)
[L_S(X), B] is of level 1 − α and satisfies 1) and 2);
ii)
for any 1 − α interval [L(X), B] satisfying 1) and 2), L(X) ≤ L_S(X).

This lemma follows Theorem 4 in Wang (2010). Due to ii), [L_S(X), B] is the smallest interval since it is a subset of any other interval. Then it is the best under the given rank function R(.). In order to derive the smallest exact interval for parameters including the relative risk and the odds ratio with A = 0 and B = +∞, we have to resolve the following two problems for implementation:

A)
provide a reasonable rank function R(.) for each of four cases;
B)
find the infimum in Eq (1) over D(θ) and the smallest solution of Eq (1) efficiently.

For a sample space S with n sample points, there are about 2ⁿ possible rank functions on S. Some are clearly bad for interval construction. For example, the rank function R_c(.) that assumes a constant value over S (i.e., all sample points are tied) produces an interval with a constant confidence limit over S. Lemma 1 still shows that it is the smallest under R_c(.). However, such an interval is useless in practice, and can be uniformly improved by simply giving up those ties. Therefore, identifying a reasonable rank function is extremely important and also challenging, but Lemma 1 does not discuss it. One may use, for example, the maximum likelihood estimator for θ as the rank function. This, however, generates too many ties and results in a wide confidence interval. Wang (2010) proposed an inductive construction on rank function that yields an admissible interval. We apply this idea to two interesting parameters: the relative risk and the odds ratio.

Searching for the infimum of a given function is a classic, but difficult problem in numerical computation, especially for a multivariate function. A more challenging issue is that the infimum must be computed a large number of times. Programs exist for optimization, however, none is able to provide a solution precisely and quickly. Our study suggests that a two-stage grid search (explained later) for the infimum on D(θ) is an effective solution.

3. CASE I: A MATCHED-PAIRS DESIGN

In a 2 × 2 table with a matched-pairs design, suppose there are n independent and identical trials, and each trial is inspected by two criteria 1 and 2. By criterion i, each trial is classified as S_i or F_i for i = 1, 2. The numbers of trials with outcomes (S₁, S₂), (S₁, F₂), (F₁, S₂) and (F₁, F₂) are the observations, and are denoted by N₁₁, N₁₂, N₂₁ and N₂₂, respectively. Thus X_p = (N₁₁, N₁₂, N₂₁) follows a multinomial distribution with probabilities p₁₁, p₁₂, and p₂₁, respectively, denoted by Multinomial(n, p₁₁, p₁₂, p₂₁). Let p_1* = P (S₁) and p_*1 = P(S₂) be the two paired proportions. The involved items are displayed below.

	S₂	F₂
S₁	(S₁, S₂), N₁₁, p₁₁	(S₁, F₂), N₁₂, p₁₂	p_1* = p₁₁ + p₁₂
F₁	(F₁, S₂), N₂₁, p₂₁	(F₁, F₂), N₂₂, p₂₂
	p_*1 = p₁₁ + p₂₁		∑_i,j N_ij = n,∑_i,j p_ij = 1

Open in a new tab

The relative risk θ_{p_r} and the odds ratio θ_po are given by:

θ_{p_{r}} \overset{def}{=} \frac{p_{1 *}}{p_{* 1}} = \frac{p_{11} + p_{12}}{p_{11} + p_{21}} and θ_{p_{o}} \overset{def}{=} \frac{p_{1 *} (1 - p_{* 1})}{p_{* 1} (1 - p_{1 *})} = \frac{(p_{11} + p_{12}) (1 - p_{11} - p_{21})}{(p_{11} + p_{21}) (1 - p_{11} - p_{12})} .

(3)

Here, the subscripts p, r and o stand for “paired proportions”, “relative risk” and “odds ratio”, respectively. Two one-sided 1 − α confidence intervals [L(X_p), +∞) and [0, U(X_p)] and a two-sided 1 − α confidence interval [L(X_p), U(X_p)] are to be constructed for θ_{p_r} and θ_po. To the best of our knowledge, no exact confidence intervals have been proposed for these parameters. StatXact 10 (2013) claims to compute an interval for so called “the odds ratio” (see Equations 13.17 and 13.7 in the user manual). It is indeed θ_{p_r}, but we find that their computed interval is for p₁₂/p₂₁ not θpr. SAS (Version 9.3) does not have any discussion on exact intervals for the parameters. The sample space and the parameter space are

S_{p} = {{\underline{x}}_{p} = (n_{11}, n_{12}, n_{21}) : n_{i j} \geq 0 is an integer, 0 \leq n_{11} + n_{12} + n_{21} \leq n}

(4)

with (n + 1)(n + 2)(n + 3)/6 sample points and

H_{p} = {(p_{11}, p_{1 *}, p_{* 1}) : 0 \leq p_{11} \leq \min (p_{1 *}, p_{* 1}), 0 \leq p_{1 *} + p_{* 1} - p_{11} \leq 1},

(5)

respectively. The random vector X_p has a joint probability mass function

p_{p} (n_{11}, n_{12}, n_{21}; p_{11}, p_{1 *}, p_{* 1}) = \frac{n!}{n_{11}! n_{12}! n_{21}! n_{22}!} p_{11}^{n_{11}} {(p_{1 *} - p_{11})}^{n_{12}} {(p_{* 1} - p_{11})}^{n_{21}} p_{22}^{n_{22}} .

(6)

with n₂₂ = n−n₁₁ −n₁₂ −n₂₁ and p₂₂ = 1−(p_1* + p_*1−p₁₁). We illustrate the setting below.

Example 1. Bentur et al. (2009, p.847) conducted a study on airway hyper-responsiveness (AHR) status before and after stem cell transplantation (SCT) on 21 patients. The AHR status for each patient is assessed by a methacholine challenge test (MCT) twice, before and after SCT. The data summary is given as follows.

		Before SCT
		AHR yes	AHR no	total
After SCT	AHR yes	1(=n₁₁), p₁₁	7(=n₁₂), p₁₂	8, p_1*
	AHR no	1(=n₂₁), p₂₁	12(=n₂₂), p₂₂	13

	total	2, p_*1	19	21

Open in a new tab

For example, one (= n₁₁) patient has AHR before and after SCT, and p₁₁ is the probability that a patient has AHR before and after SCT. Exact confidence intervals for θ_{p_r} and θ_{p_o} will be derived to study the effect of SCT on AHR status especially in this small sample.

3.1 INTERVALS FOR θ_{p_r}

Three intervals (lower one-sided, upper one-sided and two-sided) are to be constructed for θ_{p_r}. The two-sided 1 − α interval can be obtained by using the intersection of the two one-sided 1 − α/2 intervals. The next lemma discusses how to obtain an upper one-sided interval from a lower one-sided interval. Therefore, we focus on the construction of a lower one-sided 1 − α interval. Let [a, b] = [a, +∞) if b = +∞.

Lemma 2: Suppose [L(N₁₁, N₁₂, N₂₁), +∞) is a lower one-sided 1− α confidence interval for θ_{p_r}. Then

[0, U (N_{11}, N_{12}, N_{21})] \overset{def}{=} [0, \frac{1}{L (N_{11}, N_{21}, N_{12})}]

(7)

is an upper one-sided 1 − α confidence interval for θ_{p_r}. Furthermore, [L(N₁₁, N₁₂, N₂₁), U(N₁₁, N₁₂, N₂₁)] is a two-sided 1 − 2α interval for θ_{p_r}.

The parameter space H_p can be expressed in terms of (θ_{p_r}, p₁₁, p₂₁) with

p_{12} = (θ_{p_{r}} - 1) p_{11} + θ_{p_{r}} p_{21}

(8)

as follows: H_{p_r} = {(θ_{p_r}, p₁₁, p₂₁) : (p₁₁, p₂₁) ∈ D_{p_r}(θ_{p_r}), ∀ θ_{p_r} ∈ [0, +∞)}, where

D_{p_{r}} (θ_{p_{r}}) = {\begin{matrix} {(p_{11}, p_{12}) : θ_{p_{r}} p_{11} + (θ_{p_{r}} + 1) p_{21} \in [0, 1]}, if θ_{p_{r}} \geq 1; \\ {(p_{11}, p_{21}) : θ_{p_{r}} p_{11} + (θ_{p_{r}} + 1) p_{21} \in [0, 1], p_{11} + p_{21} \in [0, 1]}, otherwise . \end{matrix}

(9)

The first line in (9) is a triangle with three vertices, (0, 0), (1/θ_{p_r}, 0) and (0, 1/(θ_{p_r} + 1)), in the p₁₁-p₂₁ plane, and the second is the intersection of two triangles, one just mentioned and the other with three vertices, (0, 0), (1, 0) and (0, 1). See both cases in Figure S1 in Supplementary Materials. The joint probability mass function p_p is rewritten as

p_{p_{r}} (n_{11}, n_{12}, n_{21}; θ_{p_{r}}, p_{11}, p_{21}) = p_{p} (n_{11}, n_{12}, n_{21}; p_{11}, p_{12}, p_{21}),

(10)

where p_p and p₁₂ are given in (6) and (8), respectively.

As in Lemma 1, the construction of a one-sided interval [L(X_p), +∞) depends on a rank function R_{p_r}(.) on S_p. Point x_p is large if R_{p_r}(x_p) is small. Here are three natural rules on R_{p_r}.

a)
R_{p_r}(0, n, 0) = 1. i.e., point (0, n, 0) is the largest.
b)
R_{p_r}(n₁₁, n₁₂, n₂₁) ≤ R_{p_r}(n₁₁, n₁₂ − 1, n₂₁) for any n₁₂ ∈ [1, n − n₁₁ − n₂₁].
c)
R_{p_r}(n₁₁, n₁₂, n₂₁ − 1) ≤ R_{p_r}(n₁₁, n₁₂, n₂₁) for any n₂₁ ∈ [1, n − n₁₁ − n₁₂].

Rule a) follows the intuition that (0, n, 0) provides the largest estimate for θ_{p_r}. Rules b) and c) follow the monotonicity of function θ_{p_r} = (p₁₁ + p₁₂)/(p₁₁ + p₂₁). One would expect a similar rule for the case that n₁₂ and n₂₁ are fixed and n₁₁ varies. Both the numerator and the denominator of θ_{p_r} include p₁₁, so it is not appropriate to propose a simple rule for n₁₁.

The rank function R_{p_r}(.) is determined by combining Rules a), b), c) and numerical evaluation sequentially on all sample points. Again, R_{p_r}(0, n, 0) = 1. We next describe how to assign a value for the rank function from small to large (i.e., determine sample points from large to small). Suppose that R_{p_r}(.) has been assigned values to sets E_{p_r,1} through E_{p_r,k} with values 1 through k (e.g., E_{p_r,1} = {(0, n, 0)}) for some k ≥ 1. i.e., set E_{p_r,i} contains the ith largest point(s) for i ≤ k. Let $S_{p_{r}, k} = \cup_{i = 1}^{k} E_{p_{r}, i}$ . Now we identify a nonempty set E_{p_r,k+1} in S_p that contains the (k + 1)th largest point(s). If such a set is found, then by induction, the function R_{p_r}(.) is defined on S_p because S_{p_r,k} is strictly increasing and S_p is finite.

For each point x_p = (n₁₁, n₁₂, n₂₁), we introduce five points: A = (n₁₁ + 1, n₁₂, n₂₁), B = (n₁₁, n₁₂ − 1, n₂₁), C = (n₁₁, n₁₂, n₂₁ + 1), D = (n₁₁ − 1, n₁₂, n₂₁), E = (n₁₁ + 1, n₁₂ − 1, n₂₁), that are next to but less than x_p. Let N_{x_p} be the neighbor set that consists up to four points:

N_{{\underline{x}}_{p}} = {\begin{matrix} {A, B, C, D} \cap S_{p}, & if A \in S_{p} \\ {B, C, D, E} \cap S_{p}, & if A \notin S_{p} . \end{matrix}

(11)

See N_{x_p} in Figure S2 in Supplementary Materials for x_p = (3, 3, 2).

The neighbor set for S_{p_r,k}, denoted by N_{p_r,k}, consists of points in N_{x_p} for any x_p ∈ S_{p_r,k} but not in S_{p_r,k}. i.e.,

N_{p_{r}, k} = (\cup_{{\underline{x}}_{p} \in S_{p_{r}, k}} N_{{\underline{x}}_{p}}) \cap S_{p_{r}, k}^{c} .

(12)

However, some points in N_{p_r,k} are impossible to be the (k + 1)th largest due to Rules b) and c). To eliminate them from the selection, consider a subset of N_{p_r,k}, called the candidate set C_{p_r,k}, given by

C_{p_{r}, k} = {(n_{11}, n_{12}, n_{21}) \in N_{p_{r}, x} : (n_{11}, n_{12} + 1, n_{21}) \notin N_{p_{r}, k}, (n_{11}, n_{12}, n_{21} - 1) \notin N_{p_{r}, k}} .

(13)

Set E_{p_r,k+1} is to be selected from C_{p_r,k}, not N_{p_r,k}. For each ${\underline{x}}_{p}^{'} = (n_{11}^{'}, n_{12}^{'}, n_{21}^{'}) \in C_{p_{r}, k}$ , consider an equation similar to (1)

f_{{\underline{x}}_{p}^{'}} (θ_{p_{r}}) = \inf_{(p_{11}, p_{12}) \in D_{p_{r}} (θ_{p_{r}})} [1 - \sum_{{\underline{x}}_{p} \in S_{p_{r}, k} \cup {\underline{x}}_{p}^{'}} p_{p_{r}} ({\underline{x}}_{p}; θ_{p_{r}}, p_{11}, p_{21})] = 1 - α,

(14)

where p_{p_r} is given in (10). Let $L^{*} ({\underline{x}}_{p}^{'})$ be the smallest solution to the above equation if a solution exists, and let $L^{*} ({\underline{x}}_{p}^{'})$ be 0 otherwise. Then define

E_{p_{r}, k + 1} = {{\underline{x}}_{p} \in C_{p_{r}, k} : L^{*} ({\underline{x}}_{p}) = \max {L^{*} ({\underline{x}}_{p}^{'}) : {\underline{x}}_{p}^{'} \in C_{p_{r}, k}}},

(15)

R_{p_{r}} ({\underline{x}}_{p}) = k + 1, \forall {\underline{x}}_{p} \in E_{p_{r}, k + 1}, and S_{p_{r}, k + 1} = \cup_{i = 1}^{k + 1} E_{p_{r}, i} .

(16)

Eq (15) assures that the rank function R_{p_r}(.) yields the smallest (best) interval (with the largest lower confidence limit) in each step. Since E_{p_r,k+1} is not empty, and S_p is finite, there always exists a positive integer k_{p_r} such that S_{p_r,k_p_r} = S_p. Thus, the rank function R_{p_r}(.) is defined on the entire S_p, and the construction for R_{p_r}(.) is complete. Then the smallest lower one-sided 1 − α confidence interval [L_{p_r}, +∞) for θ_{p_r}, under the rank function R_{p_r}(.), is derived following Lemma 1, the smallest upper one-sided 1 − α interval [0, U_{p_r}] follows Lemma 2, and [L_{p_r}, U_{p_r}] is a two-sided 1 − 2α interval. If we use an existing function, e.g., the maximum likelihood estimator of θ_{p_r}, to define an order, then many sample points, for example, the points (n₁₁, n₁₂, n₂₁) with n₁₁ + n₁₂ = n₁₁ + n₂₁, are tied since they have the same estimate, 1, for θ_{p_r}. So, the corresponding confidence limits are equal to each other at these sample points following Lemma 1. In particular, the confidence limits at points (n₁₁, n₁₂, n₂₁) = (i, 0, 0), for i = 1, …, n, remain unchanged, indicating that the order by the maximum likelihood estimator is unreasonable.

Three facts are worth mentioning for the computation in (14) and (15). i) Find E_{p_r,k+1} from C_{p_r,k} in (13) instead of N_{p_r,k} in (12). ii) Using a two-stage grid search for the infimum in (14). i.e., for each D_{p_r}(θ_{p_r}) a partition is given first. We pick a point (θ_{p_r}, p₁₁, p₁₂) in each set of the partition and identify the point that yields the minimum value of the function in (14). Then on the set of the partition that contains this point, we have another finer partition and search for the minimum again. iii) Suppose two points x_p1 and x_p2 belong to C_{p_r,k} and we already compute L* (x_p1). If we find f_{x_p2}(L* (x_p1)) < 1 − α, then x_p2 does not belong to E_{p_r,k+1} and the computation of L*(x_p2) is not needed. These three facts make the computation more efficient, and are also used for the other three cases in this paper. Next, we provide a closed form for L_{p_r}(0, n, 0), which is useful for checking the precision of the numerical calculation.

Lemma 3: For any rank function R(.) with R(0, n, 0) = 1, let [L(N₁₁, N₁₂, N₂₁), +∞) be the smallest one-sided 1 − α interval for θ_{p_r} under R. Then

L (0, n, 0) = \frac{α^{1 ∕ n}}{1 - α^{1 ∕ n}} .

(17)

Example 2. For illustration purpose, we show the construction of the largest four L_{p_r}(x_p)’s on four sample points with ranks 1 through 4, when 1 − α = 0.95 and n = 3.

Due to a) R_{p_r}(0, 3, 0) = 1. So L_{p_r}(0, 3, 0) = 0.5832 following (17), or one can obtain the same result by solving a special case of (1):

f_{(0, 3, 0)} (θ_{p_{r}}) = \inf_{(p_{11}, p_{12}) \in D_{p_{r}} (θ_{p_{r}})} [1 - p_{p_{r}} (0, 3, 0; θ_{p_{r}}, p_{11}, p_{12})] = 0.95 .

To find the sample point with rank 2, we have

N_{(0, 3, 0)} = {(0, 2, 0), (1, 2, 0)}, N_{p_{r}, 1} = N_{(0, 3, 0)}, and C_{p_{r}, 1} = N_{p_{r}, 1}

following (11), (12) and (13). Then solve (14) twice by using x′ = (0, 2, 0) and x′ = (1, 2, 0), respectively, with S_{p_r,1} = {(0, 3, 0)}, and obtain L*(0, 2, 0) = 0.4320 and L*(1, 2, 0) = 0.5504. Since L*(1, 2, 0) is larger than L*(0, 2, 0), set E_{p_r,1} = {(1, 2, 0)} and the rank function R_{p_r}(1, 2, 0) = 2.

To find the sample points with ranks 3 and 4, repeat a similar step to the above paragraph. Then R_{p_r}(1, 1, 0) = 3 and R_{p_r}(2, 1, 0) = 4. The details are given in Table 1. Note that C_{p_r,3} is a proper subset of N_{p_r,3} and set E_{p_r,3} is found within C_{p_r,3} instead of E_{p_r,3}. This would save a lot of computing time especially when n is large.

Table 1.

The details of the construction of L_{p_r} at the four largest sample points in Example 2.

k	E_{p_r,k}	N_{p_r,k}	C_{p_r,k}	$L^{*} ({\underline{x}}_{p}^{'})$	$\max {L^{*} ({\underline{x}}_{p}^{'})}$	x_p	R_{p_r}(x_p)	L_{p_r}(x_p)
						(0,3,0)	1	0.5832
1	(0,3,0)	(0,2,0)	(0,2,0)	0.4320
		(1,2,0)	(1,2,0)	0.5504	0.5504	(1,2,0)	2	0.5504
2	(1,2,0)	(0,2,0)	(0,2,0)	0.4320
		(1,1,0)	(1,1,0)	0.5151	0.5151	(1,1,0)	3	0.5151
		(2,1,0)	(2,1,0)	0.4750
3	(1,1,0)	(0,2,0)	(0,2,0)	0.4104
		(1,0,0)	(1,0,0)	0.1127
		(1,1,1)	(1,1,1)	0.2169
		(0,1,0)	(2,1,0)	0.4521	0.4521	(2,1,0)	4	0.4521
		(2,1,0)
		(2,0,0)

Open in a new tab

The lower confidence limits on these four points, (0,3,0), (1,2,0), (1,1,0) and (2,1,0), are also given in Table 1 following Lemma 1 with the rank function R_{p_r}(.) at the four points. For example, L_{p_r}(1, 1, 0) = 0.5151 is the smallest solution of the following function of θ_{p_r}, $\inf_{(p_{11}, p_{12}) \in D_{p_{r}} (θ_{p_{r}})} [1 - p_{p_{r}} (0, 3, 0; θ_{p_{r}}, p_{11}, p_{12}) - p_{p_{r}} (1, 2, 0; θ_{p_{r}}, p_{11}, p_{12}) - p_{p_{r}} (1, 1, 0; θ_{p_{r}}, p_{11}, p_{12})] = 0.95$ which is also a special case of (1).

Example 1 (continued). Confidence intervals for θ_{p_r} are reported in Table 2. For example, the 95% intervals [L_{p_r}, +∞), [0, U_{p_r}] and the 90% interval [L_{p_r}, U_{p_r}] for θ_{p_r} are equal to [1.2906, +∞), [0, 15.9291] and [1.2906, 15.9291], respectively. It is clear that SCT increases the chance of having AHR because the lower-sided and two-sided intervals are inside (1, +∞). We obtain L_{p_r}(1, 7, 1) = 1.2906 and L_{p_r}(1, 1, 7) following Lemma 1 under the rank function R_{p_r}(.), then U_{p_r}(1, 7, 1) = 1/L_{p_r}(1, 1, 7) = 15.9291 by Lemma 2. The computation takes time as the infimum in (14) is found over a two-dimensional region D_{p_r}(θ_{p_r}) many times.

Table 2.

Exact one-sided and two-sided intervals for θ_{p_r} and θ_{p_o} in Example 1 when n₁₁ = 1, n₁₂ = 7, n₂₁ = 1, n₂₂ = 12.

	Two-sided 90%		Two-sided 95%
	Lower 95%	Upper 95%	Lower 97.5%	Upper 97.5%
θ _{p_r}	1.2906	15.9291	1.0448	23.0365

θ _{p_o}	1.4149	26.9615	1.1956	37.4813

Open in a new tab

3.2 INTERVALS FOR θ_{p_o}

Similar to Lemma 2, we provide a one-to-one relationship between lower and upper confidence intervals for θ_{p_o}. Therefore, we only derive a lower confidence interval as upper one-sided and two-sided intervals follow Lemma 4.

Lemma 4: Suppose [L(N₁₁, N₁₂, N₂₁), +∞) is a lower one-sided 1 − α confidence interval for θ_{p_o}. Then

[0, U (N_{11}, N_{12}, N_{21})] \overset{def}{=} [0, \frac{1}{L (N_{11}, N_{21}, N_{12})}]

(18)

is an upper one-sided 1 − α confidence interval for θ_{p_o}. Furthermore, [L(N₁₁, N₁₂, N₂₁), U(N₁₁, N₁₂, N₂₁)] is a two-sided 1 − 2α interval for θ_{p_o}.

The parameter space H_p is expressed in terms of (θ_{p_o}, p₁₁, p₂₁) with

p_{12} = \frac{θ_{p_{o}} (p_{11} + p_{21}) (1 - p_{11}) - p_{11} (1 - p_{11} - p_{21})}{θ_{p_{o}} (p_{11} + p_{21}) + (1 - p_{11} - p_{21})}

(19)

as follows: H_{p_o} = {(θ_{p_o}, p₁₁, p₂₁) : (p₁₁, p₂₁) ∈ D_po(θ_{p_o}), ∀ θ_{p_o} ∈ [0, +∞)}, where

D_{p_{o}} (θ_{p_{o}}) = {(p_{11}, p_{21}) : p_{11} \leq \frac{{(1 - p_{21})}^{2} - θ_{p_{o}} p_{21}^{2}}{1 + (θ_{p_{o}} - 1) p_{21}}, p_{21} \leq \frac{1}{\sqrt{θ_{p_{o}}} + 1}},

which is a right curved triangle. See Figure S3 in Supplementary Materials for this set with different values of θ_{p_o}. The joint probability mass function p_p in (6) is rewritten as p_{p_o}(n₁₁, n₁₂, n₂₁; θ_{p_o}, p₁₁, p₂₁) = p_p(n₁₁, n₁₂, n₂₁; p₁₁, p₁₂, p₂₁), where p₁₂ is in (19).

The construction of an interval [L(X_p), +∞) depends on a rank function R_p_o(.) on S_p. Since θ_{p_o} and θ_{p_r} have the same monotonicity in p₁₁, p₁₂ and p₂₁, Rules a), b) and c) for R_{p_r}(.) in Section 3.1 are also valid for R_p_o(.). However, we add one more rule for R_p_o(.): d) R_p_o(n₁₁, n₁₂, n₂₁) = R_p_o(n₂₂, n₁₂, n₂₁), which follows that θ_{p_o} is invariant if p₁₁ and p₂₂ are exchanged. This rule in fact makes the sample space simpler. For a point x_p = (n₁₁, n₁₂, n₂₁) let ${\overset{‒}{x}}_{p}$ be a set in S_p:

{\overset{‒}{x}}_{p} = {\begin{matrix} {\underline{x}}_{p}, if n_{11} = n_{22}; \\ {\underline{x}}_{p} \cup {(n_{22}, n_{12}, n_{21})}, otherwise . \end{matrix}

(20)

By Rule d), the rank function R_p_o(.) assumes a constant value on set ${\overset{‒}{x}}_{p}$ . Thus R_p_o(.) generates ties, and the confidence interval assumes a constant value on ${\overset{‒}{x}}_{p}$ that coincides with the nature of θ_{p_o}. When computing probability, each ${\overset{‒}{x}}_{p}$ is one sample point in a new sample space

S_{p}^{n} = {{\overset{‒}{x}}_{p} = (n_{11}, n_{12}, n_{21}) : n_{11} + n_{12} + n_{21} \in [0, n], n_{11} \leq (n - n_{12} - n_{21}) ∕ 2},

(21)

and the associated probability mass function is

p_{p_{o}}^{n} ({\overset{‒}{x}}_{p}; θ_{p_{o}}, p_{11}, p_{21}) = \sum_{{\underline{x}}_{p} \in {\overset{‒}{x}}_{p}} p_{p_{o}} ({\underline{x}}_{p}; θ_{p_{o}}, p_{11}, p_{21}) .

(22)

Each (n₁₁, n₁₂, n₂₁) that satisfies (21) is called the representation of ${\overset{‒}{x}}_{p}$ . In this section, ${\overset{‒}{x}}_{p}$ can be the representation or the set in (20). The advantage of $S_{p}^{n}$ in (21) over S_p in (4) is that the former contains fewer elements. For example, when n = 3, there are twenty points in S_p but only thirteen points in $S_{p}^{n}$ . Each set ${\overset{‒}{x}}_{p}$ and its representation are listed below:

${\overset{‒}{x}}_{p}$ in terms of (20)	the representation of ${\overset{‒}{x}}_{p}$	${\overset{‒}{x}}_{p}$ in terms of (20)	the representation of ${\overset{‒}{x}}_{p}$
{(0,0,0), (3,0,0)}	(0,0,0)	{(0,0,1), (2,0,1)}	(0,0,1)
{(0,0,2), (1,0,2)}	(0,0,2)	{(0,0,3)}	(0,0,3)
{(0,1,0),(2,1,0)}	(0,1,0)	{(0,1,1),(1,1,1)}	(0,1,1)
{(0,1,2)}	(0,1,2)	{(0,2,0),(1,2,0)}	(0,2,0)
{(0,2,1)}	(0,2,1)	{(0,3,0)}	(0,3,0)
{(1,0,0),(2,0,0)}	(1,0,0)	{(1,0,1)}	(1,0,1)
{(1,1,0)}	(1,1,0)

Open in a new tab

In the list above, for example, R_{p_o}(0, 3, 0) = 1 and R_{p_o}(0, 0, 0) = R_{p_o}(3, 0, 0).

The construction of the rank function R_{p_o}(.) is the same as R_{p_r}(.) except that x_p and S_p in (4) and p_{p_r} in (10) are replaced by ${\overset{‒}{x}}_{p}$ in (20), $S_{p}^{n}$ in (21) and $p_{p_{o}}^{n}$ in (22), respectively. In particular, we need to follow (12) through (16) to build up R_{p_o}(.). Once R_{p_o}(.) is defined on $S_{p}^{n}$ , the smallest lower one-sided 1 − α confidence interval [L_{p_o}, +∞) for θ_{p_o} under R_{p_o}(.) is derived following Lemma 1, the smallest upper one-sided 1 − α confidence interval [0, U_{p_o}] follows Lemma 4, and [L_{p_o}, U_{p_o}] is a 1 − 2α interval.

Example 1 (continued). The three intervals above are also reported in Table 2. SCT does increase the odds of having AHR because the lower one-sided and the two-sided confidence intervals are inside (1, +∞). Again, we compute the lower one-sided interval first, then the upper one-sided and two-sided intervals follow Lemma 4.

As a closing remark for this section, the associate editor pointed out that the interval construction just developed can also be applied to the multinomial sampling in a 2 × 2 table to infer another odds ratio

OR = \frac{p_{11} p_{22}}{p_{12} p_{21}},

see Agresti (2002, p. 44). However, the technical details are quite different due to the structure of this odds ratio (OR), and will not be discussed in this paper.

4. CASE II: A TWO-ARM INDEPENDENT BINOMIAL EXPERIMENT

We also have a 2 × 2 table, but each row contains a binomial experiment as follows:

	S	F
experiment 1	S₁, X, p₁	F₁, n₁ − X, 1 − p₁	n₁
experiment 2	S₂, Y, p₂	F₂, n₂ − Y, 1 − p₂	n₂

Open in a new tab

where X ~ Bin(n₁, p₁) is a binomial observation with n₁ trials and a success probability p₁ and Y ~ Bin(n₂, p₂) is independent of X. The relative risk θ_{i_r} and the odds ratio θ_{i_o},

θ_{i_{r}} \overset{def}{=} \frac{p_{1}}{p_{2}} and θ_{i_{o}} \overset{def}{=} \frac{p_{1} (1 - p_{2})}{p_{2} (1 - p_{1})},

(23)

are of interest. The subscript i stands for “independent proportions”. Compared with X_p = (N₁₁, N₁₂, N₂₁) that has three parameters p₁₁, p₁₂ and p₂₁ in Section 3, we now observe a simpler random vector X_i = (X, Y) with two parameters p₁ and p₂. In consequence, the interval construction is easier. The sample space and the parameter space are given below:

S_{i} = {{\underline{x}}_{i} = (x, y) : 0 \leq x \leq n_{1}, 0 \leq y \leq n_{2}, x and y are integers},

(24)

and

H_{i} = {(p_{1}, p_{2}) : 0 \leq p_{1}, p_{2} \leq 1} .

(25)

The joint probability mass function is

p_{i} (x, y; p_{1}, p_{2}) = \frac{n_{1}!}{x! (n_{1} - x)!} p_{1}^{x} {(1 - p_{1})}^{n_{1} - x} \frac{n_{2}!}{y! (n_{2} - y)!} p_{2}^{y} {(1 - p_{2})}^{n_{2} - y} .

(26)

Example 3. Consider a study in Essenberg (1952), where a two-arm randomized clinical trial was conducted for testing the effect of tobacco smoking on tumor development in mice. In the treatment (smoking) group, there were 23(= n₁) mice, and tumors were observed on 21(= x) mice; in the control group, n₂ = 32 and y = 19. Let p₁ and p₂ be the tumor rates for the treatment and control groups, respectively. A comparison between p₁ and p₂ using θ_{i_r} and θ_{i_o} will be discussed for the smoking effect on tumor development.

4.1 INTERVALS FOR θ_{i_r}

Similar to Lemmas 2 and 4, there exists a one-to-one relationship between lower and upper one-sided intervals.

Lemma 5: Suppose [L_n2,n1(Y, X), +∞) is a lower one-sided 1 − α confidence interval for 1/θ_{i_r} = p₂/p₁. Then

[0, U (X, Y)] \overset{def}{=} [0, \frac{1}{L_{n_{2}, n_{1}} (Y, X)}]

(27)

is an upper one-sided 1 − α interval for θ_{i_r}. Suppose [L_n1,n2(X, Y ), +∞) is a lower one-sided 1 − α interval for θ_{i_r}, then [L_n1,n2(X, Y), U(X, Y)] is a two-sided 1 − 2α interval for θ_{i_r}.

Following Lemma 5, only the construction of L_n1,n2(X, Y) for all possible n₁ and n₂ is needed. We drop the subscript and use L(X, Y) for future discussion. The interval construction depends on a rank function R_{i_r}(.) on S_i. Here R_{i_r}(.) should satisfy several rules that are from the monotonicity of θ_{i_r} = p₁/p₂ as a function of p₁ and p₂.

a)
R_{i_r}(n₁, 0) = 1. i.e., point (n₁, 0) is the largest.
b)
R_{i_r}(x, y) ≤ R_{i_r}(x − 1, y) for any x ∈ [1, n₁] and y ∈ [0, n₂].
c)
R_{i_r}(x, y − 1) ≤ R_{i_r}(x, y) for any x ∈ [0, n₁] and y ∈ [1, n₂].

These rules are shown in

\begin{matrix} (x, y + 1) \\ ↑ Rule b) \\ (x, y - 1) & \overset{Rule c)}{\leftarrow} & (x, y) \end{matrix}

where “←” means “larger than or equal to”.

The parameter space H_i is rewritten in terms of (θ_{i_r}, p₂) with p₁ = θ_{i_r}p₂ as follows:

H_{i_{r}} = {(θ_{i_{r}}, p_{2}) : p_{2} \in D_{i_{r}} (θ_{i_{r}}) \overset{def}{=} [0, \min {1, \frac{1}{θ_{i_{r}}}}], \forall θ_{i_{r}} \in [0, + \infty)},

where D_{i_r}(θ_{i_r}) is a line segment. See Figure S4 in Supplementary Materials. The joint probability mass function p_i for (X, Y) is rewritten as p_{i_r}(x, y; θ_{i_r}, p₂) = p_i(x, y; θ_{i_r}p₂, p₂), where p_i is given in (26). The construction for the rank function R_{i_r}(.) is similar to Wang (2010). By induction, we start with R_{i_r}(n₁, 0) = 1. Let E_{p_r,1} = {(n₁, 0)}. Suppose R_{i_r} has been defined on k(≥ 1) sets E_{i_r,1}, , , E_{i_r,k} with values 1, 2, …, k. Let

N_{i_{r}, k} = {(x, y) \in S_{i} : (x, y) \notin \cup_{i = 1}^{k} E_{i_{r}, i}; (x + 1, y) \in \cup_{i = 1}^{k} E_{i_{r}, i} or (x, y - 1) \in \cup_{i = 1}^{k} E_{i_{r}, i}}

be the neighbor set of $\cup_{i = 1}^{k} E_{i_{r}, i}$ . Then, from

C_{i_{r}, k} = {(x, y) \in N_{i_{r}, k} : (x + 1, y) \notin N_{i_{r}, k} and (x, y - 1) \notin N_{i_{r}, k}},

a subset of N_{i_r,k}, we pick the point(s) (x, y) that has the largest possible lower confidence limit to form a set E_{i_r,k+1} and assign a rank of R_{i_r}(x, y) = k + 1 to any point (x, y) ∈ E_{i_r,k+1}. This is similar to (15) and (16). Following induction, the construction of the rank function R_{i_r}(.) is complete. Then the smallest lower one-sided 1 − α confidence interval [L_{i_r}, +∞) under the rank function R_{i_r}(.) is derived following Lemma 1, the smallest upper one-sided 1 − α confidence interval [0, U_{i_r}] follows Lemma 5, and [L_{i_r}, U_{i_r}] is a 1 − 2α interval.

Example 3 (continued). We apply the three intervals above to Example 3 and then compare to exact intervals from SAS (Version 9.3). The intervals are reported in Table 3. For example, the 95% intervals [L_{i_r}, +∞), [0, U_{i_r}] and the 90% interval [L_{i_r}, U_{i_r}] for θ_{i_r} are equal to [1.1671, +∞), [0, 2.0859] and [1.1671, 2.0859], respectively. The lower one-sided and two-sided intervals are subsets of (1, +∞), so the smoking group has a higher tumor rate than the control group. Regarding interval construction, we first compute L_{i_r} = 1.1671 with n₁ = 23, n₂ = 32, x = 21 and y = 19 (using the rank function R_{i_r}(.) and Lemma 1), then compute L_{i_r} = 0.4794 using n₁ = 32, n₂ = 23, x = 19 and y = 21, and then U_{i_r} = 1/0.4794 = 2.0859 (use Lemma 5). The calculation takes about 4 minutes on an HP-2760 laptop with Intel(R) Core(TM) i5=2520M CPU@2.50 GHz and 8 GB RAM using an R-code from the authors. SAS (Version 9.3) provides two exact intervals for θ_{i_r} using “proc freq; exact relrisk;”. The first interval (default in SAS, Santner and Snell, 1980) is computed by inverting two separate one-sided exact tests that use the unstandardized relative risk as the test statistic; it is clearly too wide. The second interval (method=fmscore) also inverts tests, but uses the Farrington-Manning relative risk score statistic (Chan and Zhang, 1999), which is a less discrete statistic than the raw relative risk, and produces much sharper confidence limits (Agresti and Min, 2001) than the default. However, our two-sided intervals are shorter. See Section 4.3 for another comparison.

Table 3.

Exact one-sided and two-sided intervals for θ_{i_r}, θ_{i_o} and p₁ − p₂ in Example 3 when n₁ = 23, x = 21, n₂ = 32, y = 19.

		Two-sided 90%		Two-sided 95%
		Lower 95%	Upper 95%	Lower 97.5%	Upper 97.5%
θ _{i_r}	Our method	1.1671	2.0859	1.1259	2.2289
	SAS(default)	0.1919	123356	0.0960	152092
	SAS(fmscore)	1.1755	2.1519	1.1204	2.2301

θ _{i_o}	Our method	1.9534	33.0987	1.5832	48.5190
	SAS	1.6022	48.2034	1.3114	71.3653

p₁ − p₂	Wang (2010)	0.1330	0.4860	0.0947	0.5126

Open in a new tab

4.2 INTERVALS FOR θ_{i_o}

The construction of intervals [L_i_o, +∞), [0, U_{i_o}] and [L_{i_o}, U_{i_o}] for θ_{i_o} is similar to that for θ_{i_r} since the monotonicity of θ_{i_o} as a function of p₁ and p₂ is the same for θ_{i_r}.

First, we have the following and skip the proof.

Lemma 6: Suppose [L_n2,n1(Y, X), +∞) is a lower one-sided 1 − α confidence interval for 1/θ_{i_o} = p₂(1 − p₁)/[p₁(1 − p₂)]. Then

[0, U (X, Y)] \overset{def}{=} [0, \frac{1}{L_{n_{2}, n_{1}} (Y, X)}]

(28)

is an upper one-sided 1 − α interval for θ_{i_o}. Suppose [L_n1,n2(X, Y), +∞) is a lower one-sided 1 − α interval for θ_{i_o}, then [L_n1,n2(X, Y), U(X, Y)] is a two-sided 1 − 2α interval for θ_{i_o}.

Secondly, the rank function R_{i_o}(X, Y) needed for the construction of L(X, Y) satisfies the same three rules for R_{i_r}(X, Y). However, there is a new computing issue. The parameter space H_i is rewritten in terms of (θ_{i_o}, p₂) with

p_{1} = \frac{θ_{i_{o}} p_{2}}{1 + (θ_{i_{o}} - 1) p_{2}}

(29)

as follows: $H_{i_{o}} = {(θ_{i_{o}}, p_{2}) : p_{2} \in D_{i_{o}} (θ_{i_{o}}) \overset{def}{=} [0, 1], \forall θ_{i_{o}} \in [0, + \infty)}$ , where D_{i_o}(θ_{i_o}) is an interval independent of θ_{i_o}. See Figure 1.

The parameter space *H_{i_o}* (the unit square), the sets of *θ_{i_o}* = 0.05 (the solid curve), *θ_{i_o}* = 20 with the equal p₂-spacing (the circle curve), *θ_{i_o}* = 10 with the equal u-spacing (the circle-line curve), two lines p₁ + p₂ = u for u = 0.5 (the dashed line, short) and u = 1 (the dashed line, long). Note *θ_{i_o}* ∈ [0, +∞), p₂ ∈ [0, 1] and u ∈ [0, 2].

The joint probability mass function p_i for (X, Y) is rewritten as p_{i_o}(x, y; θ_{i_o}, p₂) = p_i(x, y; p₁, p₂), where p_i and p₁ are given in (26) and (29), respectively. We need to compute probabilities on the curve of a fixed value for θ_{i_o} to find the infimum in (1) by a grid search on the curve. However, as shown in the circle curve of θ_{i_o} = 20 (Figure 1) that is obtained by partitioning [0, 1] with equal spacing in p₂, the partitioned points are clearly not evenly distributed on the curve. This is very different from Figure S4 (in Supplementary Materials), where the circle points are evenly distributed on the line of θ_{i_r} = constant. Had we used the circle points in Figure 1 for a grid search for the infimum in (1), it would have led to an inaccurate numerical solution. So we introduce a new parameter u so that both p₁ and p₂ are functions of u ∈ [0, 2] at each fixed value of θ_{i_o}:

(p_{2}, p_{1}) = {\begin{matrix} (g_{2} (u), g_{1} (u)), if θ_{i_{o}} \neq 1; \\ (\frac{u}{2}, \frac{u}{2}), otherwise, \end{matrix}

(30)

where

g_{2} (u) = \frac{u}{2} - \frac{1 + θ_{i_{o}} - \sqrt{{(θ_{i_{o}} - 1)}^{2} {(u - 1)}^{2} + 4 θ_{i_{o}}}}{2 (θ_{i_{o}} - 1)}, g_{1} (u) = \frac{θ_{i_{o}} g_{2} (u)}{1 + (θ_{i_{o}} - 1) g_{2} (u)} .

This makes the selected points on the curve much more evenly distributed, as shown in the circle-line curve of θ_{i_o} = 10. More importantly, these points are symmetric about the line p₁+p₂ = 1, which does not occur for the circle points. In fact, point (p₂, p₁) is the intersection of the curve with a fixed value θ_{i_o} and the line p₁ + p₂ = u for any u ∈ [0, 2]. Therefore, we partition u on interval [0, 2] with an equal spacing instead of p₂ on interval [0, 1]. This change is clearly justified by a comparison of the two spacings in Figure 1 (θ_{i_o} = 10 vs θ_{i_o} = 20).

Lastly, we use the rank function R_{i_o}(.) to derive the smallest 1 − α interval [L_{i_o}, +∞) following Lemma 1, and obtain [0, U_{i_o}] of level 1 − α and [L_{i_o}, U_{i_o}] of level 1 − 2α by Lemma 6.

Example 3 (continued). Three proposed exact intervals and their correspondents from SAS based on Thomas (1971) (see also Gart, 1971) are reported in Table 3. Lower one-sided and two-sided intervals are inside (1, +∞) indicating that the odds of developing a tumor for the smoking group was higher than the control group. The proposed intervals are much smaller subsets of those from SAS. An R code for the proposed intervals is available. Exact intervals for p₁ − p₂ in Wang (2010) are also included in Table 3.

4.3 A SMALL COMPARISON

The comparison between exact and approximate intervals is not valid since they do not have the same confidence level 1 − α, even though the approximate interval claims to be of level 1 − α. Here we present a limited comparison between the proposed exact 90% confidence intervals [L_{i_j}, U_{i_j}] for j = r and j = o in Case II and the corresponding exact 90% intervals from SAS using “proc freq; exact relrisk(method=fmscore); exact or;”, denoted by $[L_{i_{j}}^{SAS}, U_{i_{j}}^{SAS}]$ . When n₁ = n₂ = 10, there are 121 intervals on all sample points. We only compare the interval lengths that are finite, and they are given in Figure 2. Each point in the plot has coordinate ( $U_{i_{j}}^{SAS} - L_{i_{j}}^{SAS}$ , U_{i_j} − L_{i_j}) at a sample point (x, y) ∈ S_i. Most points are in the lower triangle; also the average of the length ratio, $(U_{i_{j}} - L_{i_{j}}) ∕ (U_{i_{j}}^{SAS} - L_{i_{j}}^{SAS})$ , over these sample points is equal to 0.9490992 and 0.723343, respectively, for j = r and j = o. Both indicate a shorter length for the proposed intervals. Figure 3 gives the coverage probability comparison of these intervals. The left one in Figure 3, for example, is the plot of the infimum coverage probabilities of two intervals, [L_{i_r}, U_{i_r}] and $[L_{i_{r}}^{SAS}, U_{i_{r}}^{SAS}]$ , over set D_{i_r}(θ_{i_r}) versus θ_{i_r}. The coverage probability of proposed intervals is closer to the nominal level than that from SAS. A similar result is expected for other sample sizes.

The length comparison for the proposed two-sided 90% intervals and the 90% intervals from SAS in Case II when n₁ = n₂ = 10. The horizontal axis is the length of the SAS interval and the vertical axis is for the proposed interval. Each circle is the length of two intervals at a sample point.

The coverage comparison for the proposed two-sided 90% intervals and the 90% intervals from SAS in Case II when n₁ = n₂ = 10.

5. SUMMARY

The relative risk and the odds ratio are commonly used in medical research to compare two treatments. Estimating them both with accuracy and precision is important for practitioners. In this paper, we propose twelve intervals in four sets, each set contains two one-sided intervals and one two-sided interval for each of four parameters θ_{p_r}, θ_{p_o}, θ_{i_r}, and θ_{i_o}. They are all of level 1 − α. The one-sided intervals are smallest under the rank functions, and are also admissible by the set inclusion criterion (see Wang, 2006). This indicates that a uniform improvement is impossible. An inductive construction is employed and in each step of the process, the shortest (best) interval is picked as shown, for example, in Eq (15). This, similar to Wang (2010, Proposition 3), indeed justifies the admissibility of the one-sided intervals. The computation time on intervals is affected by the number of nuisance parameters.

Supplementary Material

Supp MaterialS1

NIHMS700003-supplement-Supp_MaterialS1.pdf^{(255.3KB, pdf)}

Acknowledgements

The authors thank Dr. Yi-Hau Chen, an associate editor, and two referees for their valuable comments and suggestions that improved the manuscript significantly. Shan’s research is partially supported by NIH Grant 5U54GM104944.

Footnotes

6. SUPPLEMENTARY MATERIALS

Proofs of Lemmas 2 through 5 and Figures S1 through S4 referenced in Sections 3 and 4 are available with this paper at the Biometrics website on Wiley Online Library.

References

Agresti A. Categorical Data Analysis. 2nd ed John Wiley & Sons, Inc; New York: 2002. [Google Scholar]
Agresti A, Coull BA. Approximate is better than “exact” for interval estimation of binomial proportions. The American Statistician. 1998;52:119–126. [Google Scholar]
Agresti A, Min Y. On small-sample confidence intervals for parameters in discrete distributions. Biometrics. 2001;57:963–971. doi: 10.1111/j.0006-341x.2001.00963.x. [DOI] [PubMed] [Google Scholar]
Bentur L, Lapidot M, Livnat G, Hakim F, Lidroneta-Katz C, Porat I, Vilozni D, Elhasid R. Airway reactivity in children before and after stem cell transplantation. Pediatric Pulmonology. 2009;44:845–850. doi: 10.1002/ppul.20964. [DOI] [PubMed] [Google Scholar]
Bol’shev LN. On the construction of confidence limits. Theory of Probability and its Applications. 1965;10:173–177. (English translation) [Google Scholar]
Brown LD, Cai TT, DasGupta A. Interval estimation for a binomial proportion. Statistical Sciences. 2001;16:101–133. [Google Scholar]
Buehler RJ. Confidence intervals for the product of two binomial parameters. Journal of the American Statistical Association. 1957;52:482–493. [Google Scholar]
Chan ISF, Zhang Z. Test-based exact confidence intervals for the difference of two binomial proportions. Biometrics. 1999;55:1202–1209. doi: 10.1111/j.0006-341x.1999.01202.x. [DOI] [PubMed] [Google Scholar]
Casella G, Berger RL. Statistical Inference. Duxbury Press; Belmont, CA: 1990. [Google Scholar]
Chen J. The order relations in the sample spaces and the confidence limits for parameters. Advances in Mathematics. 1993;22:542–552. [Google Scholar]
Essenberg JM. Cigarette smoke and the incidence of primary neoplasm of the lung in albino mice. Science. 1952;116:561–562. doi: 10.1126/science.116.3021.561. [DOI] [PubMed] [Google Scholar]
Gart JJ. The comparison of proportions: a review of significance tests, confidence intervals, and adjustments for stratification. Review of the International Statistical Institute. 1971;39(2):148–169. [Google Scholar]
Goodman LA. The Analysis of Cross-Classified Data Having Ordered and/or Unordered Categories: Association Models, Correlation Models, and Asymmetry Models for Contingency Tables With or Without Missing Entries. The Annals of Statistics. 1985;13:10–69. [Google Scholar]
Huwang L. A note on the accuracy of an approximate interval for the binomial parameter. Statistics & Probability Letters. 1995;24:177–180. [Google Scholar]
Li Z, Taylor JMG, Nan B. Construction of confidence intervals and regions for ordered binomial probabilities. The American Statistician. 2010;64:291–298. [Google Scholar]
Lin Y, Newcombe RG, Lipsitz S, Carter RE. Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator. Statistics in Medicine. 2009;28:2876–2890. doi: 10.1002/sim.3670. [DOI] [PubMed] [Google Scholar]
Lloyd CJ, Kabaila P. On the optimality and limitations of Buehler bounds. Australian and New Zealand Journal of Statistics. 2003;45:167–174. [Google Scholar]
McCullagh P. Regression Models for Ordinal Data. Journal of the Royal Statistical Society. Series B. 1980;42:109–142. [Google Scholar]
Parzen M, Lipsitz S, Ibrahim J, Klar N. An estimate of the odds ratio that always exists. Journal of Computational and Graphical Statistics. 2002;11:420–436. [Google Scholar]
Santner TJ, Snell MK. Small-sample confidence intervals for p1−p2 and p1/p2 in contingency tables. Journal of the American Statistical Association. 1980;75:386–394. [Google Scholar]
Shan G, Wang W. ExactCIdiff: an R package for computing exact confidence intervals for the difference of two proportions. The R Journal. 2013;5(2):62–70. [Google Scholar]
StatXact 10th release of the most popular exact statistics analysis software. Cytel Inc. 675 Massachusetts Avenue; Cambridge, MA: 2013. p. 02139. [Google Scholar]
Thomas DG. Algorithm AS-36: exact confidence limits for the odds ratio in a 2 × 2 table. Applied Statistics. 1971;20:105–110. [Google Scholar]
Wang W. Smallest confidence intervals for one binomial proportion. Journal of Statistical Planning and Inference. 2006;136:4293–4306. [Google Scholar]
Wang W. On construction of the smallest one-sided confidence interval for the difference of two proportions. The Annals of Statistics. 2010;38:1227–1243. [Google Scholar]
Wang W. An inductive order construction for the difference of two dependent proportions. Statistics & Probability Letters. 2012;82:1623–1628. [Google Scholar]
Wang W. A note on bootstrap confidence intervals for proportions. Statistics & Probability Letters. 2013;83:2699–2702. [Google Scholar]
Wang W, Zhang Z. Asymptotic infimum coverage probability for interval estimation of proportions. Metrika. 2014;77:635–646. [Google Scholar]
Wilson EB. Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association. 1927;22:209–212. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp MaterialS1

NIHMS700003-supplement-Supp_MaterialS1.pdf^{(255.3KB, pdf)}

[R1] Agresti A. Categorical Data Analysis. 2nd ed John Wiley & Sons, Inc; New York: 2002. [Google Scholar]

[R2] Agresti A, Coull BA. Approximate is better than “exact” for interval estimation of binomial proportions. The American Statistician. 1998;52:119–126. [Google Scholar]

[R3] Agresti A, Min Y. On small-sample confidence intervals for parameters in discrete distributions. Biometrics. 2001;57:963–971. doi: 10.1111/j.0006-341x.2001.00963.x. [DOI] [PubMed] [Google Scholar]

[R4] Bentur L, Lapidot M, Livnat G, Hakim F, Lidroneta-Katz C, Porat I, Vilozni D, Elhasid R. Airway reactivity in children before and after stem cell transplantation. Pediatric Pulmonology. 2009;44:845–850. doi: 10.1002/ppul.20964. [DOI] [PubMed] [Google Scholar]

[R5] Bol’shev LN. On the construction of confidence limits. Theory of Probability and its Applications. 1965;10:173–177. (English translation) [Google Scholar]

[R6] Brown LD, Cai TT, DasGupta A. Interval estimation for a binomial proportion. Statistical Sciences. 2001;16:101–133. [Google Scholar]

[R7] Buehler RJ. Confidence intervals for the product of two binomial parameters. Journal of the American Statistical Association. 1957;52:482–493. [Google Scholar]

[R8] Chan ISF, Zhang Z. Test-based exact confidence intervals for the difference of two binomial proportions. Biometrics. 1999;55:1202–1209. doi: 10.1111/j.0006-341x.1999.01202.x. [DOI] [PubMed] [Google Scholar]

[R9] Casella G, Berger RL. Statistical Inference. Duxbury Press; Belmont, CA: 1990. [Google Scholar]

[R10] Chen J. The order relations in the sample spaces and the confidence limits for parameters. Advances in Mathematics. 1993;22:542–552. [Google Scholar]

[R11] Essenberg JM. Cigarette smoke and the incidence of primary neoplasm of the lung in albino mice. Science. 1952;116:561–562. doi: 10.1126/science.116.3021.561. [DOI] [PubMed] [Google Scholar]

[R12] Gart JJ. The comparison of proportions: a review of significance tests, confidence intervals, and adjustments for stratification. Review of the International Statistical Institute. 1971;39(2):148–169. [Google Scholar]

[R13] Goodman LA. The Analysis of Cross-Classified Data Having Ordered and/or Unordered Categories: Association Models, Correlation Models, and Asymmetry Models for Contingency Tables With or Without Missing Entries. The Annals of Statistics. 1985;13:10–69. [Google Scholar]

[R14] Huwang L. A note on the accuracy of an approximate interval for the binomial parameter. Statistics & Probability Letters. 1995;24:177–180. [Google Scholar]

[R15] Li Z, Taylor JMG, Nan B. Construction of confidence intervals and regions for ordered binomial probabilities. The American Statistician. 2010;64:291–298. [Google Scholar]

[R16] Lin Y, Newcombe RG, Lipsitz S, Carter RE. Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator. Statistics in Medicine. 2009;28:2876–2890. doi: 10.1002/sim.3670. [DOI] [PubMed] [Google Scholar]

[R17] Lloyd CJ, Kabaila P. On the optimality and limitations of Buehler bounds. Australian and New Zealand Journal of Statistics. 2003;45:167–174. [Google Scholar]

[R18] McCullagh P. Regression Models for Ordinal Data. Journal of the Royal Statistical Society. Series B. 1980;42:109–142. [Google Scholar]

[R19] Parzen M, Lipsitz S, Ibrahim J, Klar N. An estimate of the odds ratio that always exists. Journal of Computational and Graphical Statistics. 2002;11:420–436. [Google Scholar]

[R20] Santner TJ, Snell MK. Small-sample confidence intervals for p1−p2 and p1/p2 in contingency tables. Journal of the American Statistical Association. 1980;75:386–394. [Google Scholar]

[R21] Shan G, Wang W. ExactCIdiff: an R package for computing exact confidence intervals for the difference of two proportions. The R Journal. 2013;5(2):62–70. [Google Scholar]

[R22] StatXact 10th release of the most popular exact statistics analysis software. Cytel Inc. 675 Massachusetts Avenue; Cambridge, MA: 2013. p. 02139. [Google Scholar]

[R23] Thomas DG. Algorithm AS-36: exact confidence limits for the odds ratio in a 2 × 2 table. Applied Statistics. 1971;20:105–110. [Google Scholar]

[R24] Wang W. Smallest confidence intervals for one binomial proportion. Journal of Statistical Planning and Inference. 2006;136:4293–4306. [Google Scholar]

[R25] Wang W. On construction of the smallest one-sided confidence interval for the difference of two proportions. The Annals of Statistics. 2010;38:1227–1243. [Google Scholar]

[R26] Wang W. An inductive order construction for the difference of two dependent proportions. Statistics & Probability Letters. 2012;82:1623–1628. [Google Scholar]

[R27] Wang W. A note on bootstrap confidence intervals for proportions. Statistics & Probability Letters. 2013;83:2699–2702. [Google Scholar]

[R28] Wang W, Zhang Z. Asymptotic infimum coverage probability for interval estimation of proportions. Metrika. 2014;77:635–646. [Google Scholar]

[R29] Wilson EB. Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association. 1927;22:209–212. [Google Scholar]

PERMALINK

Exact Confidence Intervals for the Relative Risk and the Odds Ratio

Weizhen Wang

Guogen Shan

Summary

1. INTRODUCTION

2. PRELIMINARY RESULTS

3. CASE I: A MATCHED-PAIRS DESIGN

3.1 INTERVALS FOR θ_{p_r}

Table 1.

Table 2.

3.2 INTERVALS FOR θ_{p_o}

4. CASE II: A TWO-ARM INDEPENDENT BINOMIAL EXPERIMENT

4.1 INTERVALS FOR θ_{i_r}

Table 3.

4.2 INTERVALS FOR θ_{i_o}

Figure 1.

4.3 A SMALL COMPARISON

Figure 2.

Figure 3.

5. SUMMARY

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Exact Confidence Intervals for the Relative Risk and the Odds Ratio

Weizhen Wang

Guogen Shan

Summary

1. INTRODUCTION

2. PRELIMINARY RESULTS

3. CASE I: A MATCHED-PAIRS DESIGN

3.1 INTERVALS FOR θpr

Table 1.

Table 2.

3.2 INTERVALS FOR θpo

4. CASE II: A TWO-ARM INDEPENDENT BINOMIAL EXPERIMENT

4.1 INTERVALS FOR θir

Table 3.

4.2 INTERVALS FOR θio

Figure 1.

4.3 A SMALL COMPARISON

Figure 2.

Figure 3.

5. SUMMARY

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1 INTERVALS FOR θ_{p_r}

3.2 INTERVALS FOR θ_{p_o}

4.1 INTERVALS FOR θ_{i_r}

4.2 INTERVALS FOR θ_{i_o}