Response to comment on “Randomization inference for treatment effects on a binary outcome”

Joseph Rigdon; Wen Wei Loh; Michael G Hudgens

doi:10.1002/sim.7192

. Author manuscript; available in PMC: 2018 Feb 28.

Published in final edited form as: Stat Med. 2017 Feb 28;36(5):876–880. doi: 10.1002/sim.7192

Response to comment on “Randomization inference for treatment effects on a binary outcome”

Joseph Rigdon ^a, Wen Wei Loh ^b, Michael G Hudgens ^b

PMCID: PMC5358813 NIHMSID: NIHMS831922 PMID: 28093845

Abstract

We thank Professor Yasutaka Chiba [1] for commenting on Rigdon and Hudgens (RH) [2]. Chiba [1] described a certain exact confidence interval reported in RH as “somewhat unnatural.” Chiba also presented an alternative approach to constructing confidence intervals [3]. In this response, we (i) provide a simple explanation why the confidence interval in RH appeared “unnatural,” and (ii) explain the relationship between the RH [2] and Chiba [3] confidence intervals. Essentially the two approaches are equivalent, except RH entails inverting one two-sided test whereas Chiba inverts two one-sided tests. We present a more computationally efficient method (RLH) for computing the RH intervals based on Chiba’s principal stratification formulation of the problem. We also propose a third method based on Blaker [4] which inverts a single two-sided test but forms a confidence interval that is at least as narrow as inverting two one-sided tests. Simulation results show the RLH intervals tend to be as narrow or narrower than the Chiba and Blaker intervals on average.

Keywords: additivity, causal inference, exact confidence interval, permutation tests, randomization inference

1. Introduction

We would like to thank Professor Yasutaka Chiba [1] for his comments on Rigdon and Hudgens (RH) [2]. In this response, we compare Chiba’s approach to constructing confidence intervals (CIs), as described in [3], with the approach in RH. First, we explain that despite the different parameterizations, both approaches test the same null hypotheses for the average causal effect of treatment, using the same randomization distributions to carry out the permutation tests. The difference between the two methods lies in the choice of whether to invert and combine two one-sided tests as Chiba proposed, or to invert a single two-sided test as in RH.

Next, we present a more computationally efficient method (RLH) for computing the RH intervals based on Chiba’s principal stratification formulation of the problem. We also propose a third method based on Blaker [4] which inverts a single two-sided test but forms a CI that is at least as narrow as Chiba’s method. Our simulation results show the RLH intervals tend to be narrowest on average among the three methods, but the differences between the methods are reduced either as the magnitude of the average causal effect increases or as the population size increases.

Finally, we provide a simple explanation why a certain CI reported in RH was described by Chiba as “somewhat unnatural” [1]. As the original formulation of the RH approach could be computationally intensive, Monte Carlo sampling was used to approximate p-values when carrying out the hypothesis tests. The interval reported in RH appeared “unnatural” because of the Monte Carlo approximation. Using the RLH approach without Monte Carlo sampling, we obtain the same interval as Chiba [1, 3].

2. Methods

Suppose m of n individuals are randomized to either treatment or control. Assume m is fixed by experimental design. Let Z_j = 1 if individual j is assigned treatment and Z_j = 0 otherwise. Let the binary outcome of interest be coded as either Y_j = 1 or Y_j = 0. Assume prior to randomization, each individual has two potential outcomes: y_j(1) if treatment is assigned and y_j(0) if control is assigned. Consequently, the treatment assignment reveals the observed outcome Y_j = Z_jy_j(1) + (1 − Z_j)y_j(0). The parameter of interest is the average causal effect of treatment $τ = \sum_{j = 1}^{n} δ_{j} / n$ , where δ_j = y_j(1) − y_j(0). The data observed from the experiment can be summarized using the notation in Table 1. The observed difference in proportions T = a/(a + b) − c/(c + d) is an unbiased estimator of τ [5].

Table 1.

Observed data from experiment

	Outcome
Randomization Assignment	Y = 1	Y = 0	Total
Z = 1	a	b	a + b = m
Z = 0	c	d	c + d = n − m

Total	a + c	b + d	n

Open in a new tab

2.1. Exact Confidence Interval of Rigdon and Hudgens (2015)

Two exact CIs for τ were proposed in [2]. The first combines two prediction intervals for attributable effects [6], while the second entails inverting a permutation test. Simulations presented in [2] showed the permutation-based CIs tend to be narrower than the attributable effects-based CIs. Only the permutation-based CI will be considered below.

The observed data reveal one of the two potential outcomes for each individual. Since the missing potential outcome is either 0 or 1 for a binary outcome, there are 2ⁿ possible unique values for the vector δ = (δ₁, …, δ_n) given the observed data. Furthermore, there are n + 1 unique values of τ that are compatible with the observed data and comprise a set of width one, where the width of a set is defined as the difference between the maximum and minimum elements of the set [2, §1]. A confidence set for τ can be constructed by conducting hypothesis tests about δ as follows.

To test the null hypothesis $H_{0} : δ = δ^{0} = (δ_{1}^{0}, \dots, δ_{n}^{0})$ , a test statistic can be chosen, its distribution under H₀ computed, and a measure of extremeness of the observed data defined [7, §4.1]. A natural choice for the test statistic is T. The sampling distribution of T under H₀ can be determined exactly by computing T for each of the $C = (\begin{matrix} n \\ m \end{matrix})$ possible randomizations for {Z_j : j = 1, …, n} because all potential outcomes are known under the sharp null H₀. For randomizations c = 1, …, C, let t^c denote the value of T under H₀. Each randomization occurs with probability 1/C, so a two-sided p-value can be defined as $\sum_{c = 1}^{C} 1 {| t^{c} - τ^{0} | \geq | t^{obs} - τ^{0} |} / C$ where t^obs is the value of T for the observed data and $τ^{0} = \sum_{j = 1}^{n} δ_{j}^{0} / n$ . The subset of δ⁰ vectors where the p-value is greater than or equal to α forms a 100(1 − α)% confidence set for δ. The τ⁰ values corresponding to the δ⁰ vectors in this confidence set for δ then form a 100(1 − α)% confidence set for τ.

However, one need not test all 2ⁿ hypotheses for H₀ : δ = δ⁰ to obtain an exact confidence set for τ. The set of possible δ⁰ values can be partitioned into (a + 1)(b + 1)(c + 1)(d + 1) subsets such that all δ⁰ in the same subset yield the same p-value. It is thus only necessary to test one δ⁰ in each of these subsets [2, §3]; this approach was proposed in RH. When the total number of possible treatment assignments C becomes large, RH suggested approximating the p-value for each hypothesis test with a Monte Carlo random sample of nperm assignments instead of evaluating all C possibilities.

2.2. Exact Confidence Interval of Chiba (2015)

Chiba [3] proposed an exact CI for τ using a principal stratification approach. Let n_st denote the number of individuals with {y_j(1) = s, y_j(0) = t} for s, t = 0, 1, such that τ = (n₁₀ − n₀₁)/n. Chiba considered null hypotheses about the parameter n = (n₁₁, n₁₀, n₀₁, n₀₀) that are compatible with the observed data, i.e., values of n that satisfy the conditions:

\begin{matrix} \sum_{s = 0}^{1} \sum_{t = 0}^{1} n_{st} = n, & n_{11} \leq a + c, & n_{10} \leq a + d, & n_{01} \leq b + c, & n_{00} \leq b + d, \\ n_{11} + n_{10} \leq n - b, & n_{11} + n_{01} \leq n - d, & n_{00} + n_{10} \leq n - c, & n_{00} + n_{01} \leq n - a . \end{matrix}

(1)

Let n_st,z denote the number of individuals with {y_j(1) = s, y_j(0) = t} and Z_j = z for s, t = 0, 1. For m fixed by design, the conditional probability of (n_11,1, n_10,1, n_01,1, n_00,1) given n is:

p_{n}^{*} (n_{11, 1}, n_{10, 1}, n_{01, 1}, n_{00, 1}) = \prod_{s = 0}^{1} \prod_{t = 0}^{1} (\begin{matrix} n_{st} \\ n_{st, 1} \end{matrix}) / (\begin{matrix} n \\ m \end{matrix}) .

(2)

Chiba proposed the following conditional exact p-value to test a null hypothesis for a given value of n:

P_{f} (t_{obs}; n) = \sum_{i = 0}^{n_{11}} \sum_{j = 0}^{n_{10}} \sum_{k = 0}^{n_{01}} \sum_{l = 0}^{n_{00}} f (T, t_{obs}) p_{n}^{*} (i, j, k, l) 1 {i + j + k + l = m},

(3)

where 1{B} = 1 if B is true and 0 otherwise, and f is a function of T = (i + j)/m − {(n₁₁ − i) + (n₀₁ − k)} /(n − m) and t_obs. Unlike the RH method described above, Chiba’s approach does not explicitly compute a test statistic for all $(\begin{matrix} n \\ m \end{matrix})$ possible randomization assignments, thus potentially obviating the need for Monte Carlo approximation of the p-values P_f (t_obs; n). Chiba’s 100(1 − α)% CI [3, §1] entails inverting two one-sided tests and equals [L₁, U₁] where:

L_{1} = min {\frac{n_{10} - n_{01}}{n} : P_{f_{L, 1}} (t_{obs}; n) \geq α / 2}, U_{1} = max {\frac{n_{10} - n_{01}}{n} : P_{f_{U, 1}} (t_{obs}; n) \geq α / 2},

(4)

and f_L,1 = 1{(T − t_obs) ≥ 0} and f_U,1 = 1{(T − t_obs) ≤ 0}.

2.3. Comparison of RH and Chiba

In general, the approach of inverting and combining two separate one-sided tests is termed the tail method [8]. An alternative to Chiba’s tail method CI is to invert a single two-sided test. A 100(1 − α)% CI analogous to (4) that inverts a single two-sided test is [L₂, U₂], where:

L_{2} = min {\frac{n_{10} - n_{01}}{n} : P_{f_{2}} (t_{obs}; n) \geq α}, U_{2} = max {\frac{n_{10} - n_{01}}{n} : P_{f_{2}} (t_{obs}; n) \geq α},

(5)

and f₂ = 1{|T − (n₁₀ − n₀₁)/n| ≥ |t_obs − (n₁₀ − n₀₁)/n|}. The interval [L₂, U₂] in (5) is equivalent to the permutation-based interval RH in [2]. Note in the finite population where n is fixed, one may determine an element n_st in n if given the other three, for example n₀₀ = n − n₁₁ − n₁₀ − n₀₁. The set of possible values for the parameter n that satisfy (1) thus lies in a three-dimensional space [9]. Therefore, under the formulation in (5), one need only test O(n³) hypotheses for n, as compared to the O(n⁴) hypotheses under the RH formulation. Moreover, the requirement for Monte Carlo approximations of the p-values is also potentially avoided. We will henceforth refer to this modified approach as RLH. Li and Deng [10] describe a more computationally efficient approach to construct these intervals which requires only O(n²) hypothesis tests, but the resulting CIs may potentially be wider.

2.4. Blaker Confidence Interval

An alternative approach that inverts a single two-sided test but forms an interval necessarily contained within the interval using the tail method was proposed by Blaker [4]. Let the minimum one-sided tail probability of T be denoted as γ(T, n) = min {P_{f_L,1} (T; n), P_{f_U,1} (T; n)}, and let f_γ = 1{γ(T, n) ≤ γ(t_obs, n)}. A 100(1 − α)% CI is [L_γ, U_γ], henceforth referred to as a Blaker interval, where:

L_{γ} = min {\frac{n_{10} - n_{01}}{n} : P_{f_{γ}} (t_{obs}; n) \geq α}, U_{γ} = max {\frac{n_{10} - n_{01}}{n} : P_{f_{γ}} (t_{obs}; n) \geq α} .

(6)

The Chiba, RLH and Blaker intervals can be computed using version 1.3 of the R [11] package RI2by2 [12].

3. Results

3.1. Simulation Study

We compared the Chiba interval (4), the RLH interval (5), and the Blaker interval (6) in a simulation study. The simulations were carried out as described in scenarios (i) and (ii) of [2], and the results are summarized in Tables 2 and 3 respectively. The CIs using all three methods had coverage greater than the nominal level. For the scenarios where exactly 50% were assigned to treatment, all three methods returned identical intervals. For the scenarios where either 30% or 70% were assigned to treatment, the RLH intervals were as narrow or narrower than both the Blaker and Chiba intervals on average for smaller values of n. The Blaker intervals tended to be slightly narrower than the Chiba intervals. The differences between the methods were reduced when either n or the magnitude of the true average causal effect τ increased.

Table 2.

Simulation results under scenario (i) in [2]. Table entries give the average empirical width [coverage] of 95% CIs, where τ is the true average treatment effect, % treatment is the percent of n total individuals assigned to treatment in each experiment, Chiba CIs are given by (4), RLH CIs by (5), and Blaker CIs by (6).

		30% treatment			50% treatment			70% treatment
n	Method	τ = 0.2	τ = 0.5	τ = 0.95	τ = 0.2	τ = 0.5	τ = 0.95	τ = 0.2	τ = 0.5	τ = 0.95
20	Chiba	0.76[1.00]	0.72[1.00]	0.39[1.00]	0.72[1.00]	0.69[1.00]	0.30[1.00]	0.76[1.00]	0.72[1.00]	0.39[1.00]
	Blaker	0.73[1.00]	0.68[1.00]	0.39[1.00]	0.72[1.00]	0.69[1.00]	0.30[1.00]	0.73[1.00]	0.67[1.00]	0.39[1.00]
	RLH	0.71[1.00]	0.66[1.00]	0.39[1.00]	0.72[1.00]	0.69[1.00]	0.30[1.00]	0.71[1.00]	0.66[1.00]	0.39[1.00]
40	Chiba	0.58[0.99]	0.51[0.99]	0.25[1.00]	0.54[1.00]	0.47[1.00]	0.17[1.00]	0.58[0.99]	0.51[0.99]	0.25[1.00]
	Blaker	0.57[0.99]	0.49[0.99]	0.23[1.00]	0.54[1.00]	0.47[1.00]	0.17[1.00]	0.57[0.99]	0.49[0.99]	0.23[1.00]
	RLH	0.56[0.99]	0.49[0.99]	0.24[1.00]	0.54[1.00]	0.47[1.00]	0.17[1.00]	0.56[0.99]	0.49[0.99]	0.24[1.00]
60	Chiba	0.48[0.99]	0.41[0.99]	0.18[1.00]	0.45[1.00]	0.38[1.00]	0.13[1.00]	0.48[0.99]	0.41[0.99]	0.18[1.00]
	Blaker	0.48[0.99]	0.40[0.99]	0.17[1.00]	0.45[1.00]	0.38[1.00]	0.13[1.00]	0.48[0.99]	0.41[0.99]	0.17[1.00]
	RLH	0.47[0.99]	0.40[0.99]	0.18[1.00]	0.45[1.00]	0.38[1.00]	0.13[1.00]	0.47[0.99]	0.40[0.99]	0.18[1.00]
100	Chiba	0.39[0.99]	0.32[1.00]	0.13[1.00]	0.36[1.00]	0.29[1.00]	0.10[1.00]	0.39[0.99]	0.32[1.00]	0.13[1.00]
	Blaker	0.38[0.99]	0.31[1.00]	0.12[1.00]	0.36[1.00]	0.29[1.00]	0.10[1.00]	0.38[0.99]	0.32[1.00]	0.12[0.99]
	RLH	0.38[0.99]	0.31[1.00]	0.13[1.00]	0.36[1.00]	0.29[1.00]	0.10[1.00]	0.38[0.99]	0.31[1.00]	0.13[0.99]

Open in a new tab

Table 3.

Simulation results under scenario (ii) in [2]. Table entries give the average empirical width [coverage] of 95% CIs, where γ is the degree of additivity, % treatment is the percent of n total individuals assigned to treatment in each experiment, Chiba CIs are given by (4), RLH CIs by (5), and Blaker CIs by (6).

		30% treatment			50% treatment			70% treatment
n	Method	γ = 0.2	γ = 0.8	γ = 1	γ = 0.2	γ = 0.8	γ = 1	γ = 0.2	γ = 0.8	γ = 1
20	Chiba	0.77[1.00]	0.76[0.99]	0.76[0.98]	0.72[1.00]	0.72[1.00]	0.72[0.99]	0.77[1.00]	0.76[0.99]	0.76[0.98]
	Blaker	0.74[1.00]	0.73[0.98]	0.73[0.97]	0.72[1.00]	0.72[1.00]	0.72[0.99]	0.73[1.00]	0.73[0.99]	0.73[0.97]
	RLH	0.71[1.00]	0.71[0.98]	0.71[0.97]	0.72[1.00]	0.72[1.00]	0.72[0.99]	0.71[1.00]	0.71[0.99]	0.71[0.97]
40	Chiba	0.58[1.00]	0.58[1.00]	0.58[0.98]	0.55[1.00]	0.55[0.99]	0.55[0.98]	0.58[1.00]	0.58[0.98]	0.58[0.97]
	Blaker	0.57[1.00]	0.57[0.99]	0.57[0.96]	0.55[1.00]	0.55[0.99]	0.55[0.97]	0.57[1.00]	0.57[0.97]	0.57[0.95]
	RLH	0.57[1.00]	0.56[0.99]	0.56[0.97]	0.55[1.00]	0.55[0.99]	0.55[0.98]	0.57[1.00]	0.56[0.97]	0.56[0.96]
60	Chiba	0.49[1.00]	0.49[0.98]	0.49[0.97]	0.46[1.00]	0.46[0.99]	0.46[0.97]	0.49[1.00]	0.49[0.98]	0.49[0.97]
	Blaker	0.48[1.00]	0.48[0.98]	0.48[0.96]	0.46[1.00]	0.46[0.99]	0.46[0.97]	0.48[1.00]	0.48[0.98]	0.48[0.97]
	RLH	0.48[1.00]	0.48[0.98]	0.48[0.96]	0.46[1.00]	0.46[0.99]	0.46[0.97]	0.48[1.00]	0.48[0.98]	0.48[0.97]
100	Chiba	0.39[1.00]	0.39[0.98]	0.39[0.97]	0.36[1.00]	0.36[0.98]	0.36[0.96]	0.39[1.00]	0.39[0.98]	0.39[0.97]
	Blaker	0.39[1.00]	0.39[0.97]	0.39[0.96]	0.36[1.00]	0.36[0.98]	0.36[0.96]	0.39[1.00]	0.39[0.98]	0.39[0.96]
	RLH	0.39[1.00]	0.39[0.97]	0.39[0.96]	0.36[1.00]	0.36[0.98]	0.36[0.96]	0.39[1.00]	0.39[0.98]	0.39[0.96]

Open in a new tab

3.2. Counterexamples

While the RLH intervals tend to be narrower than the Chiba intervals, it is possible that the Chiba intervals are narrower for particular data sets. For example, suppose a = 11, b = 1, c = 7, d = 21. Then both the Chiba and Blaker 95% CIs equal [0.400, 0.775] whereas the RLH interval equals [0.375, 0.775].

The Blaker CI may also be narrower than both the Chiba and RLH CIs for particular data sets. For example, suppose a = 7, b = 5, c = 1, d = 27. For these data the Blaker 95% CI equals [0.275, 0.750], whereas the RLH interval equals [0.250, 0.750] and the Chiba interval equals [0.250, 0.775].

3.3. Application

In the vaccine adherence example presented in §4.3 of [2], n = 96 injection drug users were randomized to a monetary incentive group or an outreach group. Of the m = 48 individuals in the monetary incentive group, a = 33 were adherent, and of the n − m = 48 in the outreach group, c = 11 were adherent. The estimated average causal effect was t^obs = 33/48 − 11/48 ≈ 0.46. The RH approach entails testing (33 + 1)(15 + 1)(11 + 1)(37 + 1) = 248064 hypotheses to obtain an exact CI. Given the large number of possible treatment assignments $C = (\begin{matrix} 96 \\ 48 \end{matrix}) \approx 6 \times 10^{27}$ , RH approximated the p-value for each hypothesis test with nperm=100 re-randomizations. The resulting 95% CI in [2] was [0.28, 0.64], which Chiba [1] described as “somewhat unnatural.” However, because only 100 re-randomizations were used, the resulting CI in RH was heavily dependent on the random seed, with different random seeds producing different intervals. Using RLH without Monte Carlo sampling, the interval equals [0.28125, 0.59375], which is the same as Chiba’s interval [1, 3]. The Blaker interval also equals [0.28125, 0.59375].

Acknowledgments

The authors thank the Editor for the invitation to respond to Professor Chiba’s comments. This research was supported in part by the National Institutes of Health and by a Gillings Innovation Laboratory award from the UNC Gillings School of Global Public Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

References

1.Chiba Y. A note on exact confidence interval for causal effects on a binary outcome in randomized trials. Statistics in Medicine. 2016;35(10):1739–1741. doi: 10.1002/sim.6826. [DOI] [PubMed] [Google Scholar]
2.Rigdon J, Hudgens MG. Randomization inference for treatment effects on a binary outcome. Statistics in Medicine. 2015;34(6):924–935. doi: 10.1002/sim.6384. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Chiba Y. Exact tests for the weak causal null hypothesis on a binary outcome in randomized trials. Journal of Biometrics & Biostatistics. 2015;6(244) doi: 10.1002/bimj.201600085. [DOI] [PubMed] [Google Scholar]
4.Blaker H. Confidence curves and improved exact confidence intervals for discrete distributions. Canadian Journal of Statistics. 2000;28(4):783–798. [Google Scholar]
5.Neyman J. On the application of probability theory to agricultural experiments. Essay on principles. Section 9. Annals of Agricultural Science 1923. In: Dabrowska DM, Speed TP, translators. Statistical Science. 4. Vol. 5. 1990. pp. 465–472. [Google Scholar]
6.Rosenbaum PR. Effects attributable to treatment: Inference in experiments and observational studies with a discrete pivot. Biometrika. 2001;88(1):219–231. [Google Scholar]
7.Rubin DB. Practical implications of modes of statistical inference for causal effects and the critical role of the assignment mechanism. Biometrics. 1991;47(4):1213–1234. [PubMed] [Google Scholar]
8.Agresti A. Dealing with discreteness: Making ‘exact’ confidence intervals for proportions, differences of proportions, and odds ratios more exact. Statistical Methods in Medical Research. 2003;12(1):3–21. doi: 10.1191/0962280203sm311ra. [DOI] [PubMed] [Google Scholar]
9.Copas J. Randomization models for the matched and unmatched 2×2 tables. Biometrika. 1973;60(3):467–476. [Google Scholar]
10.Li X, Ding P. Exact confidence intervals for the average causal effect on a binary outcome. Statistics in Medicine. 2016;35(6):957–960. doi: 10.1002/sim.6764. [DOI] [PubMed] [Google Scholar]
11.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2016. URL https://www.R-project.org/ [Google Scholar]
12.Rigdon J, Loh WW, Hudgens MG. RI2by2: Randomization Inference for Treatment Effects on a Binary Outcome. R package version 1.3 [Google Scholar]

[R1] 1.Chiba Y. A note on exact confidence interval for causal effects on a binary outcome in randomized trials. Statistics in Medicine. 2016;35(10):1739–1741. doi: 10.1002/sim.6826. [DOI] [PubMed] [Google Scholar]

[R2] 2.Rigdon J, Hudgens MG. Randomization inference for treatment effects on a binary outcome. Statistics in Medicine. 2015;34(6):924–935. doi: 10.1002/sim.6384. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Chiba Y. Exact tests for the weak causal null hypothesis on a binary outcome in randomized trials. Journal of Biometrics & Biostatistics. 2015;6(244) doi: 10.1002/bimj.201600085. [DOI] [PubMed] [Google Scholar]

[R4] 4.Blaker H. Confidence curves and improved exact confidence intervals for discrete distributions. Canadian Journal of Statistics. 2000;28(4):783–798. [Google Scholar]

[R5] 5.Neyman J. On the application of probability theory to agricultural experiments. Essay on principles. Section 9. Annals of Agricultural Science 1923. In: Dabrowska DM, Speed TP, translators. Statistical Science. 4. Vol. 5. 1990. pp. 465–472. [Google Scholar]

[R6] 6.Rosenbaum PR. Effects attributable to treatment: Inference in experiments and observational studies with a discrete pivot. Biometrika. 2001;88(1):219–231. [Google Scholar]

[R7] 7.Rubin DB. Practical implications of modes of statistical inference for causal effects and the critical role of the assignment mechanism. Biometrics. 1991;47(4):1213–1234. [PubMed] [Google Scholar]

[R8] 8.Agresti A. Dealing with discreteness: Making ‘exact’ confidence intervals for proportions, differences of proportions, and odds ratios more exact. Statistical Methods in Medical Research. 2003;12(1):3–21. doi: 10.1191/0962280203sm311ra. [DOI] [PubMed] [Google Scholar]

[R9] 9.Copas J. Randomization models for the matched and unmatched 2×2 tables. Biometrika. 1973;60(3):467–476. [Google Scholar]

[R10] 10.Li X, Ding P. Exact confidence intervals for the average causal effect on a binary outcome. Statistics in Medicine. 2016;35(6):957–960. doi: 10.1002/sim.6764. [DOI] [PubMed] [Google Scholar]

[R11] 11.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2016. URL https://www.R-project.org/ [Google Scholar]

[R12] 12.Rigdon J, Loh WW, Hudgens MG. RI2by2: Randomization Inference for Treatment Effects on a Binary Outcome. R package version 1.3 [Google Scholar]

PERMALINK

Response to comment on “Randomization inference for treatment effects on a binary outcome”

Joseph Rigdon

Wen Wei Loh

Michael G Hudgens

Abstract

1. Introduction

2. Methods

Table 1.

2.1. Exact Confidence Interval of Rigdon and Hudgens (2015)

2.2. Exact Confidence Interval of Chiba (2015)

2.3. Comparison of RH and Chiba

2.4. Blaker Confidence Interval

3. Results

3.1. Simulation Study

Table 2.

Table 3.

3.2. Counterexamples

3.3. Application

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Response to comment on “Randomization inference for treatment effects on a binary outcome”

Joseph Rigdon

Wen Wei Loh

Michael G Hudgens

Abstract

1. Introduction

2. Methods

Table 1.

2.1. Exact Confidence Interval of Rigdon and Hudgens (2015)

2.2. Exact Confidence Interval of Chiba (2015)

2.3. Comparison of RH and Chiba

2.4. Blaker Confidence Interval

3. Results

3.1. Simulation Study

Table 2.

Table 3.

3.2. Counterexamples

3.3. Application

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases