Sample Size and Power Calculations for Additive Interactions

TJ VanderWeele

doi:10.1515/2161-962X.1010

. Author manuscript; available in PMC: 2014 Dec 1.

Published in final edited form as: Epidemiol Methods. 2012 Aug 29;1(1):159–188. doi: 10.1515/2161-962X.1010

Sample Size and Power Calculations for Additive Interactions

TJ VanderWeele ¹

PMCID: PMC4249707 NIHMSID: NIHMS632459 PMID: 25473594

Introduction

The literature on power and sample size calculations for interaction has focused on the multiplicative scale (Lubin and Gail, 1990; Hwang et al., 1994; Foppa and Spiegelman, 1997; Yang et al., 1999; Garcia-Closas and Lubin, 1999; Qiu et al., 2000; Luan et al. 2001; Gauderman, 2002a b; Sturmer, 2002; Wang et al., 2003; Wang and Zhao, 2003; Demidenko, 2008; VanderWeele, 2011). However, interaction on the additive scale is more relevant for public health purposes (Rothman et al.,1980; Rothman et al., 2008) and is also more closely related to notions of mechanistic synergism within the sufficient cause framework (Rothman, 1976; VanderWeele and Robins, 2007, 2008; Rothman et al., 2008). Arguably, the reason interaction is most frequently assessed on the multiplicative scale is that this is what is most easily computed from the output of standard logistic regression software. In addition, in the context of case-control studies, odds ratios can be estimated but risk differences cannot, unless additional information concerning e.g. the prevalence of the outcome or exposures in the underlying population is available (Rothman et al., 2008). This again leads to the multiplicative scale as being the default for assessing interaction. That power and sample size calculations are better developed for multiplicative interaction than for additive interaction perhaps further encourages the use of the multiplicative scale for interaction assessment. However, measures of additive interaction based on risk ratios or odds ratios using the relative excess risk due to interaction (“RERI”; Rothman, 1986) can easily be calculated from logistic regression with either cohort or case-control data (Hosmer and Lemeshow, 1992) and in this paper we will derive power and sample size formulae for interaction on the additive scale. Power and sample size calculations for additive interaction were discussed in Greenland (1983, 1985) but no closed form expressions were provided.

In this paper, we will consider measures of additive interaction based on absolute risks and also on the relative excess risk due to interaction for both cohort and case-control data and we will provide closed form analytic expressions for power and sample size in each of these cases. Analytically, we will for the most part follow the development of Demidenko (2008) who considered multiplicative interaction but we will be taking a similar approach for the additive scale. We will see that when main effects of both exposures are positive, power to detect positive interaction on the additive scale will be greater than that on the multiplicative scale, providing yet another reason, beyond public health relevance and relation to mechanistic synergism, for using the additive scale to assess interaction. The reader who is interested in only the application of the power and sample size formulae derived in this paper is referred to Appendix 2 at the end of the paper on epidemiologic practice. This appendix gives instructions on using Excel spreadsheets (included as an online supplement to this paper) to automatically carry out power and sample size calculations for additive and multiplicative interaction for cohort, case-control and case-only data.

Notation and Definitions

We will suppose we have a binary outcome Y and two binary exposures G and E. Although G and E might represent genetic and environmental exposures, respectively, nothing in the development will require this. They might be two environmental exposures, or two genetic exposures, or behavior exposures, etc. Let p_ge = P(Y = 1|G = g, E = e) and let π_ge = P(G = g, E = e). The measure of interaction on the additive scale using risks is then

p_{11} - p_{10} - p_{01} + p_{00} .

This can be re-expressed as (p₁₁ − p₀₀) − {(p₁₀ − p₀₀) + (p₀₁ − p₀₀)} and measures the extent to which the effect of both exposures combined exceeds (or is less than) the sum of the effects of each exposure considered separately. If p₁₁ − p₁₀ − p₀₁ + p₀₀ > 0, the interaction is said to be positive or “superadditive”. If p₁₁ − p₁₀ − p₀₁ + p₀₀ < 0, the interaction is said to be negative or “subadditive”. If p₁₁ − p₁₀ − p₀₁ + p₀₀ = 0, there is said to be no interaction on the additive scale. This measure of additive interaction corresponds to the coefficient of the product term for the two exposure in a linear risk model for the outcome.

In many studies, analyses are presented using risk ratios or odds ratios rather than absolute risks. Define the risk ratio as ${R R}_{g e} = \frac{P (Y = 1 ∣ G = g, E = e)}{P (Y = 1 ∣ G = 0, E = 0)} = \frac{p_{g e}}{p_{00}}$ and the odds ratio as ${O R}_{g e} = \frac{P (Y = 1 ∣ G = g, E = e) / P (Y = 0 ∣ G = g, E = e)}{P (Y = 1 ∣ G = 0, E = 0) / P (Y = 0 ∣ G = 0, E = 0)} = \frac{p_{g e} / (1 - p_{g e})}{p_{00} / (1 - p_{00})}$ . The measure of multiplicative interaction used on the risk ratio or odds ratio scale is then generally taken as $I_{R R} = \frac{{R R}_{11}}{{R R}_{10} {R R}_{01}}$ or $I_{O R} = \frac{{O R}_{11}}{{O R}_{10} {O R}_{01}}$ respectively. These measures of multiplicative interaction correspond to the exponentiated coefficients of the product term for the two exposures in log-linear and logistic regression models for the outcome respectively.

Suppose now we were to divide our measure of additive interaction based on risks, p₁₁ − p₁₀ − p₀₁ + p₀₀, by the baseline risk p₀₀. We would then obtain what is sometimes referred to as the relative excess risk due to interaction or RERI (Rothman, 1986):

RERI = {R R}_{11} - {R R}_{10} - {R R}_{01} + 1.

This measure RERI will be greater than 0 (or respectively less than 0) if and only if the measure of additive interaction using absolute risks, p₁₁ − p₁₀ − p₀₁ + p₀₀, is greater than 0 (or less than 0 respectively). The relative excess risk due to interaction can thus be used to assess additive interaction using data on relative risks. When the probability of the outcome is rare in all exposure strata then odds ratios will approximate risk ratios i.e. $\frac{p_{g e} / (1 - p_{g e})}{p_{00} / (1 - p_{00})} \approx \frac{p_{g e}}{p_{00}}$ and thus we can approximate RERI by

{RERI}_{O R} = {O R}_{11} - {O R}_{10} - {O R}_{01} + 1 \approx RERI .

This final measure, RERI_OR = OR₁₁ − OR₁₀ − OR₀₁ + 1, is advantageous because it is an approximate measure of additive interaction and yet can also be obtained directly from logistic regression analyses and from case-control data. We will, however, first begin with additive interaction on the absolute risk scale using cohort data.

Additive Interaction in Cohort Studies Using A Linear Risk Model

Suppose data were available from a cohort study and we were to use a linear risk model to measure additive interaction:

P (Y = 1 ∣ G = g, E = e) = θ_{0} + θ_{1} g + θ_{2} e + θ_{3} g e .

(1)

In this model θ₃ = p₁₁ − p₁₀ − p₀₁ + p₀₀ is our measure of additive interaction. Suppose we plan to fit this model to the cohort data using maximum likelihood and use a Wald test for the null hypothesis θ₃ = 0. Once we have fit the model and obtained an estimate θ̂₃ of θ₃ from the data, the Wald test statistic for the null hypothesis θ₃ = 0 is given by θ̂₃/V̂ where V̂ is the estimated variance of θ̂₃. We would reject the null at significance level α if |θ̂₃/V̂| > Z_{1 −} _α_/2 where Z_{1 −} _α_/2 is the (1 − α/2)th quantile of the standard normal distribution. Suppose we wish to calculate the sample size required to reject the null hypothesis with significance level α and power β if the magnitude of the interaction were θ₃ = η.

By standard sample size arguments, the sample size required to detect an additive interaction of magnitude θ₃ = η with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V}{η^{2}}

where Z_{1 −} _α_/2 and Z_β are the (1 − α/2)th and βth quantiles respectively of the standard normal distribution and where V is the variance of θ̂₃ under the alternative that θ₃ = η. The difficulty lies in calculating the variance V. In Appendix 1, we show that the variance V is given by

V = \frac{1}{L^{'}} + \frac{1}{F^{'}} + \frac{1}{J^{'}} + \frac{1}{R^{'}}

where

\begin{array}{l} L^{'} = \frac{1}{(θ_{0}) (1 - θ_{0})} π_{00} \\ F^{'} = \frac{1}{(θ_{0} + θ_{1}) {1 - (θ_{0} + θ_{1})}} π_{10} \\ J^{'} = \frac{1}{(θ_{0} + θ_{2}) {1 - (θ_{0} + θ_{2})}} π_{01} \\ R^{'} = \frac{1}{(θ_{0} + θ_{1} + θ_{2} + θ_{3}) {1 - (θ_{0} + θ_{1} + θ_{2} + θ_{3})}} π_{11} . \end{array}

Thus to calculate the sample size we would need to specify (i) the significance level α, the power β, and the magnitude of additive interaction θ₃ = η; (ii) the proportion of subjects in each exposure stratum, π₀₀, π₁₀, π₀₁, π₁₁; and (iii) the main effect of the two exposures on the additive scale θ₁ and θ₂ and the baseline risk of the doubly unexposed group θ₀ = P(Y = 1|G = 0, E = 0).

Instead of specifying the proportion of subjects in each joint exposure stratum π₀₀, π₁₀, π₀₁, π₁₁, we could instead specify the marginal probability of each exposure π_g = P(G = 1) and π_e = P(E = 1) along with the odds ratio relating G and E, Δ = {P(G = 1|E = 1)/P(G = 0|E = 1}/{P(G = 1|E = 0)/P(G = 0|E = 0}. The probabilities π₀₀, π₁₀, π₀₁, π₁₁ are then given by (Demidenko, 2008):

\begin{array}{l} π_{00} = \frac{1 - π_{e}}{1 + C} \\ π_{10} = \frac{(1 - π_{e}) C}{1 + C} \\ π_{01} = \frac{π_{e}}{1 + C Δ} \\ π_{11} = \frac{C Δ π_{e}}{1 + C Δ} \end{array}

(2)

where

\begin{array}{l} C = \frac{q + \sqrt{q^{2} + 4 π_{g} (1 - π_{g}) Δ}}{2 (1 - π_{g}) Δ} \\ and where q = π_{g} (1 + Δ) + π_{e} (1 - Δ) - 1. \end{array}

If G and E are independent then Δ = 1 and C simplifies to C = π_e/(1 − π_e).

If instead of calculating the required sample size for a fixed power β, we wanted to calculate the power for a given sample size using the Wald test for the null hypothesis θ₃ = 0 based on model (1) we could proceed as follows. For a fixed sample size n the power to reject the null θ₃ = 0 at significance level α under the alternative that θ₃ = η is given by

Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V)}}

where Φ⁻¹ is the inverse cumulative distribution function for a standard normal random variable and where V can be calculated as above. In Appendix 2 we describe how to use a simple Excel spreadsheet (included with this paper as an online supplement) to carry out such sample size and power calculations automatically. The online supplement also provides Excel spreadsheets for the sample size and power calculations for additive interaction using relative excess risk due to interaction from logistic regression with cohort or case-control data described in the following sections. The use of these Excel spreadsheets is described in detail in Appendix 2. Finally, it should be noted that if the null hypothesis were rejected for extreme values of θ₃ on either side of zero (two-sided test) then the relevant power formula would be:

Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V)}} + Φ^{- 1} {- Z_{1 - α / 2} - η \sqrt{(n / V)}} .

Before moving on, we give a brief example of the use of these formulae for additive interaction.

Example 1

Suppose we wish to calculate the power of a test at significance level α = 0.05, with n = 4000, with the prevalence of the genetic and environmental factors being π_g = 0.5 and π_e = 0.3 respectively and assuming these are independent so that Δ = 1, with the probability of the outcome in the reference category of θ₀ = P(Y = 1|G = 0, E = 0) = 0.02, with main effects on the risk difference scale of θ₁ = 0.01 and θ₂ = 0.01 and with additive interaction θ₃ = 0.02. We can use the equations in (2) to calculate π₀₀ = 0.35, π₁₀ = 0.35, π₀₁ = 0.15, π₁₁ = 0.15 and from this we can calculate L′, F′, J′, R′ and the variance V and the power $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V)}}$ to obtain 0.32.

Additive Interaction in Cohort Studies Using Logistic Regression and RERI

In this section we consider power and sample size calculations for measures of interaction based on RERI_OR obtained from logistic regression using cohort data. We will first review the power and sample size calculations for multiplicative interaction from logistic regression using cohort data given by Demidenko (2008) since the variance calculation of Demidenko will underlie those given here for additive interaction using the relative excess risk due to interaction.

Suppose we fit a logistic regression model to cohort data:

log it {P (Y = 1 ∣ G = g, E = e)} = γ_{0} + γ_{1} g + γ_{2} e + γ_{3} g e .

(3)

The coefficient γ₃ is generally referred to as a measure of interaction of the multiplicative scale. The exponentiated coefficient is equal to the odds ratio multiplicative interaction ratio $e^{γ_{3}} = I_{O R} = \frac{{O R}_{11}}{{O R}_{10} {O R}_{01}}$ . Suppose we wish to use a Wald test for the null hypothesis γ₃ = 0. The sample size required to detect a multiplicative interaction of magnitude γ₃ = η with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{mult (O R)}}{η^{2}}

where Z_{1 −} _α_/2 and Z_β are the (1 − α/2)th and βth quantiles respectively of the standard normal distribution and where V_mult₍_OR₎ is the variance of γ̂₃ under the alternative that θ₃ = η. Demidenko (2008) derives the variance matrix for the maximum likelihood estimator of (γ₀, γ₁, γ₂, γ₃), given in Appendix 1, and specifically shows that

V_{mult (O R)} = \frac{1}{L} + \frac{1}{F} + \frac{1}{J} + \frac{1}{R}

where

\begin{array}{l} L = \frac{e^{γ_{0}}}{{(1 + e^{γ_{0}})}^{2}} π_{00} \\ F = \frac{e^{γ_{0} + γ_{1}}}{{(1 + e^{γ_{0} + γ_{1}})}^{2}} π_{10} \\ J = \frac{e^{γ_{0} + γ_{2}}}{{(1 + e^{γ_{0} + γ_{2}})}^{2}} π_{01} \\ R = \frac{e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}}}{{(1 + e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}})}^{2}} π_{11} . \end{array}

(4)

Once again, to calculate the sample size we would need to specify (i) the significance level α, the power β, and the magnitude of additive interaction γ₃ = η; (ii) the proportion of subjects in each exposure stratum, π₀₀, π₁₀, π₀₁, π₁₁; and (iii) the main effect odds ratios of the two exposures on the logistic scale, γ₁ and γ₂, and the log odds of the baseline risk of the doubly unexposed group γ₀ = log{P(Y = 1|G = 0, E = 0)/P(Y = 0|G = 0, E = 0)}. Once again, if instead of specifying the joint probabilities π₀₀, π₁₀, π₀₁, π₁₁, we specified the marginal probabilities of each exposure π_g = P(G = 1) and π_e = P(E = 1) and the odds ratio relating G and E, Δ = {P(G = 1|E = 1)/P(G = 0|E = 1}/{P(G = 1|E = 0)/P(G = 0|E = 0} then we could obtain the π₀₀, π₁₀, π₀₁, π₁₁ using the formulae in (2). And once again, if instead of calculating the required sample size for a given power, we wanted to calculate the power for a given sample size we could use $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V_{mult (O R)})}}$ .¹

Example 2

Suppose we wish to calculate the power of a test at significance level α = 0.05, with n = 5000, with the joint prevalence of the genetic and environmental factors being π₀₀ = 0.35, π₁₀ = 0.20, π₀₁ = 0.20, π₁₁ = 0.25 respectively, with the probability of the outcome in the reference category of P(Y = 1|G = 0, E = 0) = 0.015, with main effects on the odds ratio scale of e^γ^₁ = 1.3 and e^γ^₂ = 1.4 and with odds ratio multiplicative interaction e^γ^₃ = 1.6. We can calculate L, F, J, R from these values and the variance V_mult₍_OR₎ to obtain $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V)}} = 0.216$ .

We will now use the variance matrix calculations of Demidenko (2008) to derive sample size and power formulae for the relative excess risk due to interaction (RERI). The RERI from logistic regression model (3) is given by:

{RERI}_{O R} = e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{1}} - e^{γ_{2}} + 1.

Suppose we wish to use a Wald test for the null hypothesis RERI_OR = 0. The sample size required to detect a RERI_OR of magnitude η = e^γ^₁+^γ^₂+^γ^₃ − e^γ^₁ − e^γ^₂+ 1 with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{RERI (O R)}}{η^{2}}

where Z_{1 −} _α_/2 and Z_β are the (1 − α/2)th and βth quantiles respectively of the standard normal distribution and where V_RERI₍_OR₎ is the variance of RERI_OR = e^γ̂^₁+^γ̂^₂+^γ̂^₃ − e^γ̂^₁ − e^γ̂^₂ + 1 under the alternative. Using the delta method, we show in Appendix 1 that this variance is given by:

V_{RERI (O R)} = (\frac{1}{L} + \frac{1}{R}) e^{2 (γ_{1} + γ_{2} + γ_{3})} - \frac{2}{L} e^{2 γ_{1} + γ_{2} + γ_{3}} - \frac{2}{L} e^{γ_{1} + 2 γ_{2} + γ_{3}} + (\frac{1}{L} + \frac{1}{F}) e^{2 γ_{1}} + (\frac{1}{L} + \frac{1}{J}) e^{2 γ_{2}} + \frac{2}{L} e^{γ_{1} + γ_{2}}

where L, F, J, R are given as in equation (4) above.

To calculate the sample size to reject the null of no additive interaction using RERI_OR, we would need to specify (i) the significance level α, the power β; (ii) the proportion of subjects in each exposure stratum, π₀₀, π₁₀, π₀₁, π₁₁; and (iii) the main effect odds ratios of the two exposures on the logistic scale, γ₁ and γ₂, the log odds of the baseline risk of the doubly unexposed group γ₀ = log{P(Y = 1|G = 0, E = 0)/P(Y = 0|G = 0, E = 0)}, and the magnitude of the interaction on the multiplicative scale γ₃. Instead of specifying the magnitude of the interaction on the multiplicative scale, γ₃, one could specify the magnitude of RERI_OR under the alternative RERI_OR = η and then back-calculate the magnitude of γ₃ = log(η + e^γ^₁ − e^γ^₂ − 1) − γ₁ − γ₂.

And once again, if instead of specifying the joint probabilities π₀₀, π₁₀, π₀₁, π₁₁, we specified the marginal probabilities of each exposure π_g = P(G = 1) and π_e = P(E = 1) and the odds ratio relating G and E, Δ = {P(G = 1|E = 1)/P(G = 0|E = 1}/{P(G = 1|E = 0)/P(G = 0|E = 0} then we could obtain π₀₀, π₁₀, π₀₁, π₁₁ using the formulae in (2). And once again, if instead of calculating the required sample size for a given power, we wanted to calculate the power for a given sample size we could use $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{n / V_{RERI (O R)}}}$ .

Example 3

Suppose again we wish to calculate the power of a test at significance level α = 0.05, with n = 5000, with the joint prevalence of the genetic and environmental factors being π₀₀ = 0.35, π₁₀ = 0.20, π₀₁ = 0.20, π₁₁ = 0.25 respectively, with the probability of the outcome in the reference category of P(Y = 1|G = 0, E = 0) = 0.015, with main effects on the odds ratio scale of e^γ^₁ = 1.3 and e^γ^₂ = 1.4 and with odds ratio multiplicative interaction e^γ^₃ = 1.6 as in Example 2, but that we now wish to calculate the power for testing RERI_OR > 0. Here the true RERI_OR is η = e^γ^₁+^γ^₂+^γ^₃ − e^γ^₁ − e^γ^₂ + 1 = (1.3)(1.4)(1.6) − (1.3) − (1.4) + 1 = 1.212 > 0. From L, F, J, R we can calculate the variance V_RERI₍_OR₎ to obtain $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V_{RERI (O R)})}} = 0.482$ . In this example, the power to detect additive interaction, 0.482, is greater than that to detect multiplicative interaction, 0.216.

The reader is reminded that the tests for additive interaction using RERI_OR hold only approximately to the extent that the outcome is rare so that RERI_OR approximates RERI on the risk ratio scale. In Appendix 1 we also derive sample size and power formulae for the multiplicative interaction from a log-linear model and for additive interaction using RERI estimated from a log-linear model. However, if the measure of additive interaction is fit with cohort data, it may be preferable to fit model (1) directly for additive interaction using absolute risks rather than employing RERI.

Additive Interaction in Case-Control Studies Using Logistic Regression and RERI

Suppose instead we fit a logistic regression model to case-control data:

log it {P (Y = 1 ∣ G = g, E = e)} = γ_{0} + γ_{1} g + γ_{2} g + γ_{3} g e .

The sample size required to detect a RERI_OR of magnitude η = e^γ^₁+^γ^₂+^γ^₃ − e^γ^₁ − e^γ^₂ + 1 with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{RERI (O R)}^{*}}{η^{2}}

where

V_{RERI (O R)}^{*} = (\frac{1}{L^{*}} + \frac{1}{R^{*}}) e^{2 (γ_{1} + γ_{2} + γ_{3})} - \frac{2}{L^{*}} e^{2 γ_{1} + γ_{2} + γ_{3}} - \frac{2}{L^{*}} e^{γ_{1} + 2 γ_{2} + γ_{3}} + (\frac{1}{L^{*}} + \frac{1}{F^{*}}) e^{2 γ_{1}} + (\frac{1}{L^{*}} + \frac{1}{J^{*}}) e^{2 γ_{2}} + \frac{2}{L^{*}} e^{γ_{1} + γ_{2}}

with

\begin{array}{l} L^{*} = \frac{e^{γ_{0}}}{{(1 + e^{γ_{0}})}^{2}} π_{00}^{*} \\ F^{*} = \frac{e^{γ_{0} + γ_{1}}}{{(1 + e^{γ_{0} + γ_{1}})}^{2}} π_{10}^{*} \\ J^{*} = \frac{e^{γ_{0} + γ_{2}}}{{(1 + e^{γ_{0} + γ_{2}})}^{2}} π_{01}^{*} \\ R^{*} = \frac{e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}}}{{(1 + e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}})}^{2}} π_{11}^{*} . \end{array}

and where $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ are now the proportions of subjects in each joint exposure stratum in the case-control sample.

If we know the overall outcome prevalence in the underlying population, P(Y = 1), we could also obtain the proportions $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ from the proportions of subjects in each joint exposure stratum in the underlying population, π₀₀, π₁₀, π₀₁, π₁₁, though doing so requires solving a non-linear equation numerically (Demidenko, 2008). Alternatively, if the outcome is rare we can obtain $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ from π₀₀, π₁₀, π₀₁, π₁₁ approximately using the following formulas (see Appendix 1 for proof):

\begin{array}{l} π_{00}^{*} \approx π_{00} P^{*} (Y = 0) + \frac{π_{00}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{10}^{*} \approx π_{10} P^{*} (Y = 0) + \frac{e^{γ_{1}} π_{10}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{01}^{*} \approx π_{01} P^{*} (Y = 0) + \frac{e^{γ_{2}} π_{01}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{11}^{*} \approx π_{11} P^{*} (Y = 0) + \frac{e^{γ_{1} + γ_{2} + γ_{3}} π_{11}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \end{array}

where P^*(Y = 0) is the proportion of controls in the case-control sample and P^*(Y = 1) is the proportion of cases in the case-control sample. If we instead specify the marginal probabilities of each exposure π_g = P(G = 1) and π_e = P(E = 1) and the odds ratio, Δ, relating G and E, in the underlying population then we can calculate π₀₀, π₁₀, π₀₁, π₁₁ using the formulae in (2).

Thus, to calculate the sample size to reject the null of no additive interaction using RERI_OR from case-control data we would need to specify (i) the significance level α, the power β; (ii) the proportion of subjects in each exposure stratum, $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ in the case-control sample, or alternatively these proportions π₀₀, π₁₀, π₀₁, π₁₁ or the marginal probabilities and marginal odds ratio, π_g, π_e, Δ, n the underlying population along with a rare outcome assumption and the proportions of cases P^*(Y = 1) in the case-control sample, and finally (iii) the main effect odds ratios of the two exposures on the logistic scale, γ₁ and γ₂, the log odds of the baseline probability of the outcome in the doubly unexposed group γ₀ = log{P^*(Y = 1|G = 0, E = 0)/P^*(Y = 1|G = 0, E = 0)} in the case-control sample, and the magnitude of the interaction on the multiplicative scale γ₃ (or instead the magnitude of RERI_OR = η and then back-calculate the magnitude of γ₃ = log(η + e^γ^₁ − e^γ^₂ − 1) − γ₁ − γ₂. Note that if the joint or marginal exposure probabilities are specified separately for the cases and controls then under an assumption of a rare outcome, the distribution of the exposures amongst the controls could be used as an approximation to π₀₀, π₁₀, π₀₁, π₁₁ or π_g, π_e, Δ.

Note also that with case control data, γ₀ = log{P^*(Y = 1|G = 0, E = 0)/P^*(Y = 0|G = 0, E = 0)} is the log odds of baseline probability of the outcome in doubly unexposed group in the case-control sample i.e. the log the number of cases to controls in the study for the doubly unexposed group. It is shown in the Appendix that under a rare outcome assumption γ₀ can be approximated as γ₀ ≈ log it[1/{1+(π₀₀+π₁₀e^γ^₁ + π₀₁e^γ^₂ + π₁₁e^γ^₁+^γ^₂+^γ^₃)P^*(Y = 0)/P^*(Y = 1)}].

Example 4

Suppose we wish to calculate the same size required for a test at significance level α = 0.05, with power β = 0.80, with the joint prevalence of the genetic and environmental factors being π_g = 0.5, π_e = 0.3 respectively in the underlying population with the factors being independent in the underlying population so that Δ = 1. Suppose that the number of cases and controls in the study were going to be equal P^*(Y = 1) = P^*(Y = 0) = 0.5, with main effects on the odds ratio scale of e^γ^₁ = 1.1 and e^γ^₂ = 1.1 and with multiplicative interaction e^γ^₃ = 1.5. We can calculate that the sample size then required to detect positive multiplicative interaction would be n = 3447. We can also calculate that sample size required to detect positive interaction using RERI_OR would be n = 2212.

It should also be noted that when multiplicative interaction is of interest and the genetic and environmental factors are independent of one another in the underlying population, a “case-only” estimator of multiplicative interaction will have greater power to detect multiplicative interaction as it exploits the independence assumption (Piegorsch et al., 1994; Yang et al., 1999). Power and sample size calculations for case-only estimators have been considered elsewhere (Yang et al., 1999; VanderWeele, 2011). Although these case-only estimators can be quite powerful, they are also fairly sensitive to the assumption that the two exposures are independent in the population and can result in considerable bias if this assumption does not hold (Albert et al., 2001).

A Power Comparison of Additive and Multiplicative Interaction

VanderWeele (2009a) noted that in a log-linear model with non-negative main effects, whenever positive multiplicative interaction is present on the risk ratio scale, positive additive interaction on the risk difference scale will be present as well; the reverse implication does not hold. Here we will explore power to detect such additive or multiplicative interaction and we will consider the odds ratio scale rather than the risk ratio scale. In this power comparison we will assume a case-control study with a rare outcome so that RERI_OR approximates a measure of additive interaction. Table 1 below reports power for a number of scenarios with varying sample sizes, main effect odds ratios and multiplicative interaction parameters on the odds ratio scale $I_{O R} = \frac{{O R}_{11}}{{O R}_{10} {O R}_{01}}$ .

Table 1.

Power to detect additive interaction and multiplicative interaction for various sample sizes, main effects, and interaction parameters (first number in each column is power to detect additive interaction; second number is power for multiplicative interaction)

I_OR	OR₁₀	OR₀₁	n = 500	n = 1000	n = 3000	n = 5000
1.1	1	1	.05, .05	.06, .06	.10, .09	.14, .13
1.1	1.3	1.3	.07, .04	.10, .05	.23, .09	.34, .12
1.1	1.5	1.8	.13, .04	.23, .05	.55, .08	.77, .11
1.3	1	1	.12, .11	.21, .17	.50, .42	.72, .62
1.3	1.3	1.3	.18, .10	.32, .15	.73, .37	.91, .56
1.3	1.5	1.8	.27, .09	.48, .14	.91, .33	.99, .50
1.5	1	1	.25, .19	.44, .34	.88, .77	.98, .93
1.5	1.3	1.3	.32, .17	.56, .30	.95, .70	1.00, .89
1.5	1.5	1.8	.40, .15	.68, .26	.99, .63	1.00, .84
2	1	1	.57, .44	.85, .73	1.00, .99	1.00, 1.00
2	1.3	1.3	.58, .39	.86, .65	1.00, .98	1.00, 1.00
2	1.5	1.8	.59, .34	.87, .59	1.00, .97	1.00, 1.00
3	1	1	.81, .77	.98, .97	1.00, 1.00	1.00, 1.00
3	1.2	1.3	.74, .70	.96, .94	1.00, 1.00	1.00, 1.00
3	1.5	1.8	.68, .62	.93, .89	1.00, 1.00	1.00, 1.00

Open in a new tab

In these examples it is assumed that the proportion of case and controls in the case-control sample are equal and that the prevalence of the genetic and environmental factors are each π_g = π_e = 0.5 with the odds ratio relating these factors being Δ = 1.1. Note that in all scenarios considered there is positive interaction on both additive and multiplicative scales. Power for one-sided test (rejecting only for positive interaction) is reported.

We see that for the scenarios considered here with non-negative main effects and positive interaction, power is greater to detect additive interaction than multiplicative interaction. However, as noted in Greenland (1983), when outcome probabilities are additive or sub-additive, power to detect a (negative) multiplicative interaction will often be greater.

Power and Sample Size Calculations for Sufficient Cause Interactions and Epistatic Interactions

VanderWeele and Robins (2007, 2008) discuss “causal” or “sufficient cause” interactions within the sufficient cause and counterfactual frameworks (Rothman, 1976; Rubin, 1990; Hernán, 2004) which provide a somewhat stronger notion of positive additive interaction. A sufficient cause interaction is present if there are individuals for whom the outcome would occur if both exposures are present but would not occur if just one or the other exposure is present. In counterfactual notation, if we let Y_ge denote the counterfactual outcome (or potential outcome) for each subject if, possibly contrary to fact, G had been set to g and E had been set to e, then a sufficient cause interaction is present if for some individual Y₁₁ = 1 but Y₁₀ = Y₀₁ = 0. VanderWeele and Robins (2007, 2008) showed that if the effect of the two exposures were un-confounded (in that the counterfactual outcomes Y_ge were independent of the actual exposures {G, E}) then

p_{11} - p_{10} - p_{01} > 0

would imply the presence of a sufficient cause interaction. This is a stronger condition than regular positive additive interaction which only requires p₁₁ − p₁₀ − p₀₁ + p₀₀ > 0 because with the condition p₁₁ − p₁₀ − p₀₁ > 0 we are no longer adding back in the outcome probability p₀₀ for the doubly unexposed group. The condition p₁₁ − p₁₀ − p₀₁ > 0 expressed in terms of RERI is equivalent to RERI > 1.

VanderWeele (2010a b) discussed empirical tests for an even stronger notion of interaction. We might say that there is a “singular” or “epistatic” interaction if there are individuals in the population who will have the outcome if and only if both exposures are present; in counterfactual notation, that is, there are individuals for whom Y₁₁ = 1 but Y₁₀ = Y₀₁ = Y₀₀ = 0. In the genetics literature, when gene-gene interactions are considered, such response patterns are sometimes called instances of “compositional epistasis” (Phillips, 2008; Cordell, 2009) and constitute settings in which the effect of one genetic factor is masked unless the other is present. VanderWeele (2010a b) noted that if the effects of the two exposures on the outcome were unconfounded then

p_{11} - p_{10} - p_{01} - p_{00} > 0

would imply the presence of such an “epistatic interaction”. Again this is an even stronger notion of interaction in that we are now subtracting p₀₀. The condition p₁₁ − p₁₀ − p₀₁ − p₀₀ > 0 expressed in terms of RERI is equivalent to RERI > 2.

It is relatively straightforward to derive sample size and power formulae for tests for such sufficient cause or epistatic interactions. The sample size for RERI given above could be used but for sufficient cause interaction, to test RERI > 1, one would replace the η in the denominator of the sample size formula by (η − 1); and for epistatic interaction, to test RERI > 2, one would replace the η in the denominator of the formula by (η − 2).

Thus, for cohort data, to detect a sufficient cause interaction (RERI > 1) at significance level α with power β when the true RERI is η = e^γ^₁+^γ^₂+^γ^₃ − e^γ^₁ − e^γ^₂ + 1, the required sample size would be

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{RERI}}{{(η - 1)}^{2}}

where V_RERI is the variance of RERI (see Appendix 1). And likewise, the power to detect a sufficient cause interaction for a given sample size is $Power = Φ^{- 1} {- Z_{1 - α / 2} + (η - 1) \sqrt{(n / V_{RERI})}}$ . Similar formulae hold for odds ratios and using case-control data under a rare outcome: once again, one simply replaces η with (η − 1) in all relevant formulae.

Similarly, for cohort data, to detect an epistatic interaction (RERI > 2) at significance level α with power β when the true RERI is η = e^γ^₁+^γ^₂+^γ^₃ − e^γ^₁ − e^γ^₂ + 1, the required sample size would be

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{RERI}}{{(η - 2)}^{2}}

The power to detect an epistatic interaction for a given sample size is $Power = Φ^{- 1} {- Z_{1 - α / 2} + (η - 2) \sqrt{(n / V_{RERI})}}$ . Similar formulae hold for odds ratios and using case-control data under a rare outcome: one simply replaces η with (η − 2) in all relevant formulae.

Finally, it should be noted that if it can be assumed that the effects of both exposures are positive “monotonic” in the sense that the counterfactuals Y_ge are non-decreasing in g and e for all individuals (i.e. the exposures never have protective effects on the outcome for any individual), then the tests p₁₁ − p₁₀ − p₀₁ + p₀₀ > 0 and RERI > 0 can be used to test for sufficient cause interaction (VanderWeele and Robins, 2007, 2008). For epistatic interactions, if the effect of at least one of the exposures is positive monotonic (Y_ge is nondecreasing in at least of of g and e), then p₁₁ − p₁₀ − p₀₁ > 0 suffices for an epistatic interaction the tests for RERI > 1 could be used; if the effect of both exposures are positive monotonic then p₁₁ − p₁₀ − p₀₁ + p₀₀ > 0 suffices and and tests for RERI > 0 could be used to test for an epistatic interaction (VanderWeele, 2010a b). To interpret interaction estimates causally, or to draw conclusions about sufficient cause or epistatic interaction, control must be made for confounding for both exposures. If control for confounding is only made for one of the two exposures the interaction estimates can still often be interpreted as measures of effect heterogeneity (VanderWeele, 2009b; Vander-Weele and Knol, 2011), i.e. of how the effect of one exposure varies across strata of the other (without commenting on the effect of the second exposure itself). Sensitivity analysis techniques for interaction and effect modification (VanderWeele and Arah, 2011; VanderWeele et al., 2012) can also be useful in assessing the impact of unmeasured confounding for interaction estimates. To interpret estimates causally, measurement error in interaction analyses should also be taken into account or corrected for (Garcia Closas et al., 1998; Zhang et al., 2008; VanderWeele, 2012); such measurement error can often lead to bias and effect estimate attenuation, and will often decrease power.

Discussion

In this paper we have derived sample size and power formulae for additive interaction in a variety of scenarios. We have considered additive interaction for absolute risks in cohort data and also the use of the relative excess risk due to interaction from logistic regression using cohort or case-control data. We saw that when the main effects were both positive then the power to detect positive interaction on the additive scale was in general greater than on the multiplicative scale. We have also discussed how the sample size and power calculations for the relative excess risk due to interaction can be easily modified to provide sample size and power calculations for causal interactions corresponding to notions of synergism in the sufficient cause framework and to notions of compositional epistasis in genetics.

As is often the case with analytic formulae for sample size and power calculations, we have not considered the consequences of control for additional covariates. In settings in which these covariates are independent of the exposures (e.g. if the exposures were both randomized) then adjustment for additional covariates should increase the power of tests (Robinson and Jewell, 1991) and in such cases the sample size and power calculations in this paper could be considered conservative estimates.

The sample size and power formulae in this paper provide additional tools for researchers to utilize additive interaction in their analyses. It is hoped that these additional tools will further encourage the use of the additive scale for interaction analysis. Not only is additive interaction more relevant for public health purposes and more closely related to mechanistic interaction in the sufficient cause framework, but as we have seen, power will often be greater to detect additive interaction.

Supplementary Material

Supllement CaseControl

NIHMS632459-supplement-Supllement_CaseControl.xls^{(33.5KB, xls)}

Supplement Cohort

NIHMS632459-supplement-Supplement_Cohort.xls^{(58.5KB, xls)}

Appendix 1. Derivations

A.1. Derivations for additive interaction with absolute risk and cohort data

For model (1),

P (Y = 1 ∣ G = g, E = e) = θ_{0} + θ_{1} g + θ_{2} e + θ_{3} g e .

(1)

the likelihood is given by

L (θ_{0}, θ_{1}, θ_{2}, θ_{3}) = \prod_{i = 1}^{n} {(θ_{0} + θ_{1} g_{i} + θ_{2} e_{i} + θ_{3} g_{i} e_{i})}^{y_{i}} {1 - (θ_{0} + θ_{1} g_{i} + θ_{2} e_{i} + θ_{3} g_{i} e_{i})}^{1 - y_{i}}

and the log-likelihood by $l (θ_{0}, θ_{1}, θ_{2}, θ_{3}) = \sum_{i = 1}^{n} y_{i} log (θ_{0} + θ_{1} g_{i} + θ_{2} e_{i} + θ_{3} g_{i} e_{i}) + log {1 - (θ_{0}, θ_{1} g_{i} + θ_{2} e_{i} + θ_{3} g_{i} e_{i})} (1 - y_{i})$ .

The second derivative is given by

\frac{\partial^{2} l (θ_{0}, θ_{1}, θ_{2}, θ_{3})}{\partial {(θ_{0}, θ_{1}, θ_{2}, θ_{3})}^{2}} = \sum_{i = 1}^{n} \frac{- y_{i} + 2 y_{i} Q_{i} - {Q_{i}}^{2}}{{Q_{i}}^{2} {(1 - Q_{i})}^{2}} (\begin{matrix} 1 & g_{i} & e_{i} & g_{i} e_{i} \\ g_{i} & g_{i} & g_{i} e_{i} & g_{i} e_{i} \\ e_{i} & g_{i} e_{i} & e_{i} & g_{i} e_{i} \\ g_{i} e_{i} & g_{i} e_{i} & g_{i} e_{i} & g_{i} e_{i} \end{matrix})

where Q_i = θ₀ + θ₁g_i + θ₂e_i + θ₃g_ie_i. Let Q = θ₀ + θ₁G + θ₂E + θ₃GE. The expected information matrix is then given by

\begin{array}{l} I = E [\frac{Y - 2 Y Q + Q^{2}}{Q^{2} {(1 - Q)}^{2}} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix})] \\ = E [E [\frac{Y - 2 Y Q + Q^{2}}{Q^{2} {(1 - Q)}^{2}} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix}) ∣ G, E]] \\ = E [E [\frac{1}{Q (1 - Q)} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix}) ∣ G, E]] \end{array}

which we may write as

\begin{array}{l} \frac{1}{(θ_{0}) (1 - θ_{0})} M_{1} π_{00} + \frac{1}{(θ_{0} + θ_{1}) {1 - (θ_{0} + θ_{1})}} M_{2} π_{10} \\ + \frac{1}{(θ_{0} + θ_{1} + θ_{2}) {1 - (θ_{0} + θ_{1} + θ_{2})}} M_{3} π_{01} \\ + \frac{1}{(θ_{0} + θ_{1} + θ_{2} + θ_{3}) {1 - (θ_{0} + θ_{1} + θ_{2} + θ_{3})}} M_{4} π_{11} \end{array}

where

\begin{array}{l} M_{1} = (\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}), M_{2} = (\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}) \\ M_{3} = (\begin{matrix} 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}), M_{4} = (\begin{matrix} 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \end{matrix}) \end{array}

If we let $L^{'} = \frac{1}{(θ_{0}) (1 - θ_{0})} π_{00}, F^{'} = \frac{1}{(θ_{0} + θ_{1}) {1 - (θ_{0} + θ_{1})}} π_{10}, J^{'} = \frac{1}{(θ_{0} + θ_{2}) {1 - (θ_{0} + θ_{12})}} π_{01}$ , and $R^{'} = \frac{1}{(θ_{0} + θ_{1} + θ_{2} + θ_{3}) {1 - (θ_{0} + θ_{1} + θ_{2} + θ_{3})}} π_{11}$ we then have

I = (\begin{matrix} L^{'} + F^{'} + J^{'} + R^{'} & F^{'} + R^{'} & J^{'} + R^{'} & R^{'} \\ F^{'} + R^{'} & F^{'} + R^{'} & R^{'} & R^{'} \\ J^{'} + R^{'} & R^{'} & J^{'} + R^{'} & R^{'} \\ R^{'} & R^{'} & R^{'} & R^{'} \end{matrix}) .

The inverse of this matrix is

I^{- 1} = (\begin{matrix} \frac{1}{L^{'}} & - \frac{1}{L^{'}} & - \frac{1}{L^{'}} & \frac{1}{L^{'}} \\ - \frac{1}{L^{'}} & \frac{1}{L^{'}} + \frac{1}{F^{'}} & \frac{1}{L^{'}} & - \frac{1}{L^{'}} - \frac{1}{F^{'}} \\ - \frac{1}{L^{'}} & \frac{1}{L^{'}} & \frac{1}{L^{'}} + \frac{1}{J^{'}} & - \frac{1}{L^{'}} - \frac{1}{J^{'}} \\ \frac{1}{L^{'}} & - \frac{1}{L^{'}} - \frac{1}{F^{'}} & - \frac{1}{L^{'}} - \frac{1}{J^{'}} & \frac{1}{L^{'}} + \frac{1}{F^{'}} + \frac{1}{J^{'}} + \frac{1}{R^{'}} \end{matrix}),

from which it follows $V = \frac{1}{L^{'}} + \frac{1}{F^{'}} + \frac{1}{J^{'}} + \frac{1}{R^{'}}$ .

A.2. Derivations for relative excess risk due to interaction from logistic regression using cohort data

Demidenko (2008) showed that for the logistic regression model (3):

log it {P (Y = 1 ∣ G = g, E = e)} = γ_{0} + γ_{1} g + γ_{2} e + γ_{3} g e .

(3)

the variance-covariance matrix for the maximum likelihood estimate of (γ₀, γ₁, γ₂, γ₃) was given by

(\begin{matrix} \frac{1}{L} & - \frac{1}{L} & - \frac{1}{L} & \frac{1}{L} \\ - \frac{1}{L} & \frac{1}{L} + \frac{1}{F} & \frac{1}{L} & - \frac{1}{L} - \frac{1}{F} \\ - \frac{1}{L} & \frac{1}{L} & \frac{1}{L} + \frac{1}{J} & - \frac{1}{L} - \frac{1}{J} \\ \frac{1}{L} & - \frac{1}{L} - \frac{1}{F} & - \frac{1}{L} - \frac{1}{J} & \frac{1}{L} + \frac{1}{F} + \frac{1}{J} + \frac{1}{R} \end{matrix}),

where

\begin{array}{l} L = \frac{e^{γ_{0}}}{{(1 + e^{γ_{0}})}^{2}} π_{00} \\ F = \frac{e^{γ_{0} + γ_{1}}}{{(1 + e^{γ_{0} + γ_{1}})}^{2}} π_{10} \\ J = \frac{e^{γ_{0} + γ_{2}}}{{(1 + e^{γ_{0} + γ_{1}})}^{2}} π_{01} \\ R = \frac{e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}}}{{(1 + e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}})}^{2}} π_{11} . \end{array}

From the delta method, it follows that the variance of $\hat{RERI} = e^{{\hat{γ}}_{1} + {\hat{γ}}_{2} + {\hat{γ}}_{3}} - e^{{\hat{γ}}_{1}} - e^{{\hat{γ}}_{2}} + 1$ is given by

\begin{array}{l} {(\begin{matrix} 0 \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{1}} \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{2}} \\ e^{γ_{1} + γ_{2} + γ_{3}} \end{matrix})}^{'} (\begin{matrix} \frac{1}{L} & - \frac{1}{L} & - \frac{1}{L} & \frac{1}{L} \\ - \frac{1}{L} & \frac{1}{L} + \frac{1}{F} & \frac{1}{L} & - \frac{1}{L} - \frac{1}{F} \\ - \frac{1}{L} & \frac{1}{L} & \frac{1}{L} + \frac{1}{J} & - \frac{1}{L} - \frac{1}{J} \\ \frac{1}{L} & - \frac{1}{L} - \frac{1}{F} & - \frac{1}{L} - \frac{1}{J} & \frac{1}{L} + \frac{1}{F} + \frac{1}{J} + \frac{1}{R} \end{matrix}) \times (\begin{matrix} 0 \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{1}} \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{2}} \\ e^{γ_{1} + γ_{2} + γ_{3}} \end{matrix}) \\ = {(\begin{matrix} 0 \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{1}} \\ e^{γ_{1} + γ_{2} + γ_{3}} - e^{γ_{2}} \\ e^{γ_{1} + γ_{2} + γ_{3}} \end{matrix})}^{'} (\begin{matrix} - \frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} + \frac{1}{L} (e^{γ_{1}} + e^{γ_{2}} \\ \frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - (\frac{1}{L} + \frac{1}{F}) e^{γ_{1}} - \frac{1}{L} e^{γ_{2}} \\ \frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - \frac{1}{L} e^{γ_{1}} - (\frac{1}{L} + \frac{1}{J}) e^{γ_{2}} \\ (- \frac{1}{L} + \frac{1}{R}) e^{γ_{1} + γ_{2} + γ_{3}} + (\frac{1}{L} + \frac{1}{F}) e^{γ_{1}} + (\frac{1}{L} + \frac{1}{J}) e^{γ_{2}} \end{matrix}) \\ = e^{γ_{1} + γ_{2} + γ_{3}} {\frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - (\frac{1}{L} + \frac{1}{F}) e^{γ_{1}} - \frac{1}{L} e^{γ_{2}} + \frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - \frac{1}{L} e^{γ_{1}} \\ - (\frac{1}{L} + \frac{1}{J}) e^{γ_{2}} + (- \frac{1}{L} + \frac{1}{R}) e^{γ_{1} + γ_{2} + γ_{3}} + (\frac{1}{L} + \frac{1}{F}) e^{γ_{1}} + (\frac{1}{L} + \frac{1}{J}) e^{γ_{2}}} \\ - e^{γ_{1}} {\frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - (\frac{1}{L} + \frac{1}{F}) e^{γ_{1}} + \frac{1}{L} e^{γ_{2}}} \\ - e^{γ_{2}} {\frac{1}{L} e^{γ_{1} + γ_{2} + γ_{3}} - \frac{1}{L} e^{γ_{1}} - (\frac{1}{L} + \frac{1}{J} e^{γ_{2}}} \\ = (\frac{1}{L} + \frac{1}{R}) e^{2 (γ_{1} + γ_{2} + γ_{3})} - \frac{2}{L} e^{2 γ_{1} + γ_{2} + γ_{3}} - \frac{2}{L} e^{γ_{1} + 2 γ_{2} + γ_{3}} + (\frac{1}{L} + \frac{1}{F}) e^{2 γ_{1}} + (\frac{1}{L} + \frac{1}{J}) e^{2 γ_{2}} + \frac{2}{L} e^{γ_{1} + γ_{2}} . \end{array}

A.3. Derivations for multiplicative and additive interaction for the log-linear model

For the log-linear model,

log {P (Y = 1 ∣ G = g, E = e)} = κ_{0} + κ_{1} g + κ_{2} e + κ_{3} g e .

(5)

suppose we wish to use a Wald test for the null hypothesis κ₃ = 0. The sample size required to detect an multiplicative interaction of magnitude κ₃ = η with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{mult (R R)}}{η^{2}}

where Z₁₋_α/₂ and Z_β are the (1−α/2)th and βth quantiles respectively of the standard normal distribution and where V_mult₍_RR₎ is the variance of κ̂₃ under the alternative that κ₃ = η. Likewise, we can calculate the power for a given sample size using $Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V_{mult (R R)})}}$ . The variance V_mult₍_RR₎ can be derived as follows. The likelihood is given by

L (κ_{0}, κ_{1}, κ_{2}, κ_{3}) = \prod_{i = 1}^{n} e^{(κ_{0} + κ_{1} g_{i} + κ_{2} e_{i} + κ_{3} g_{i} e_{i}) y_{i}} {1 - e^{(κ_{0} + κ_{1} g_{i} + κ_{2} e_{i} + κ_{3} g_{i} e_{i})}}^{1 - y_{i}}

and the log-likelihood by

l (κ_{0}, κ_{1}, κ_{2}, κ_{3}) = \sum_{i = 1}^{n} y_{i} (κ_{0} + κ_{1} g_{i} + κ_{2} e_{i} + κ_{3} g_{i} e_{i}) + log {1 - e^{(κ_{0}, κ_{1} g_{i} + κ_{2} e_{i} + κ_{3} g_{i} e_{i})})} (1 - y_{i}) .

The second derivative is given by

\frac{\partial^{2} l (κ_{0}, κ_{1}, κ_{2}, κ_{3})}{\partial {(κ_{0}, κ_{1}, κ_{2}, κ_{3})}^{2}} = \sum_{i = 1}^{n} \frac{- (1 - y_{i}) Q_{i}}{{(1 - Q_{i})}^{2}} (\begin{matrix} 1 & g_{i} & e_{i} & g_{i} e_{i} \\ g_{i} & g_{i} & g_{i} e_{i} & g_{i} e_{i} \\ e_{i} & g_{i} e_{i} & e_{i} & g_{i} e_{i} \\ g_{i} e_{i} & g_{i} e_{i} & g_{i} e_{i} & g_{i} e_{i} \end{matrix})

where Q_i = e^{κ₀+κ₁g_i+κ₂e_i+κ₃g_ie_i}. Let Q = e^κ^₀+^κ^₁^G⁺^κ^₂^E⁺^κ^₃^GE. The expected information matrix is then given by

\begin{array}{l} I = E [\frac{(1 - Y) Q}{{(1 - Q)}^{2}} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix})] \\ = E [E [\frac{(1 - Y) Q}{{(1 - Q)}^{2}} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix}) ∣ G, E]] \\ = E [E [\frac{Q}{(1 - Q)} (\begin{matrix} 1 & G & E & G E \\ G & G & G E & G E \\ E & G E & E & G E \\ G E & G E & G E & G E \end{matrix}) ∣ G, E]] \end{array}

which we may write as

\frac{e^{κ_{0}}}{(1 - e^{κ_{0}})} M_{1} π_{00} + \frac{e^{κ_{0} + κ_{1}}}{(1 - e^{κ_{0} + κ_{1}})} M_{2} π_{10} + \frac{e^{κ_{0} + κ_{2}}}{(1 - e^{κ_{0} + κ_{2}})} M_{3} π_{01} + \frac{e^{κ_{0} + κ_{1} + κ_{2} + κ_{3}}}{(1 - e^{κ_{0} + κ_{1} + κ_{2} + κ_{3}})} M_{4} π_{11}

where

\begin{array}{l} M_{1} = (\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}), M_{2} = (\begin{matrix} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}) \\ M_{3} = (\begin{matrix} 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}), M_{4} = (\begin{matrix} 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 \end{matrix}) \end{array}

If we Let $L^{†} = \frac{e^{κ_{0}}}{(1 - e^{κ_{0}})} π_{00}, F^{†} = \frac{e^{κ_{0} + κ_{1}}}{(1 - e^{κ_{0} + κ_{1}})} π_{10}, J^{†} = \frac{e^{κ_{0} + κ_{2}}}{(1 - e^{κ_{0} + κ_{2}})} π_{01}$ , and $R^{†} = \frac{e^{κ_{0} + κ_{1} + κ_{2} + κ_{3}}}{(1 - e^{κ_{0} + κ_{1} + κ_{2} + κ_{3}})} π_{11}$ we then have

I = (\begin{matrix} L^{†} + F^{†} + J^{†} + R^{†} & F^{†} + R^{†} & J^{†} + R^{†} & R^{†} \\ F^{†} + R^{†} & F^{†} + R^{†} & R^{†} & R^{†} \\ J^{†} + R^{†} & R^{†} & J^{†} + R^{†} & R^{†} \\ R^{†} & R^{†} & R^{†} & R^{†} \end{matrix}) .

The inverse of this matrix is

I^{- 1} = (\begin{matrix} \frac{1}{L^{†}} & - \frac{1}{L^{†}} & - \frac{1}{L^{†}} & \frac{1}{L^{†}} \\ - \frac{1}{L^{†}} & \frac{1}{L^{†}} + \frac{1}{F^{†}} & \frac{1}{L^{†}} & - \frac{1}{L^{†}} - \frac{1}{F^{†}} \\ - \frac{1}{L^{†}} & \frac{1}{L^{†}} & \frac{1}{L^{†}} + \frac{1}{J^{†}} & - \frac{1}{L^{†}} - \frac{1}{J^{†}} \\ \frac{1}{L^{†}} & - \frac{1}{L^{†}} - \frac{1}{F^{†}} & - \frac{1}{L^{†}} - \frac{1}{J^{†}} & \frac{1}{L^{†}} + \frac{1}{F^{†}} + \frac{1}{J^{†}} + \frac{1}{R^{†}} \end{matrix}),

from which it follows $V = \frac{1}{L^{†}} + \frac{1}{F^{†}} + \frac{1}{J^{†}} + \frac{1}{R^{†}}$ .

The RERI from log-linear model (5) is given by:

RERI = e^{κ_{1} + κ_{2} + κ_{3}} - e^{κ_{1}} - e^{κ_{2}} + 1.

Suppose we wish to use a Wald test for the null hypothesis RERI = 0. The sample size required to detect a RERI of magnitude η = e^κ^₁+^κ^₂+^κ^₃−e^κ^₁−e^κ^₂+1 with significance level α and power β is

n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{RERI (R R)}}{η^{2}}

where Z₁₋_α/₂ and Z_β are the (1−α/2)th and βth quantiles respectively of the standard normal distribution and where V_RERI₍_RR₎ is the variance of RERI = e^κ̂^₁+^κ̂^₂+^κ̂^₃−e^κ̂^₁−e^κ̂^₂+1 under the alternative. Likewise, to calculate the power for a given sample size we could use

Power = Φ^{- 1} {- Z_{1 - α / 2} + η \sqrt{(n / V_{RERI (R R)})}} .

Using an argument analogous to that in Appendix A.2 we have that

V_{RERI (R R)}^{*} = (\frac{1}{L^{†}} + \frac{1}{R^{†}}) e^{2 (κ_{1} + κ_{2} + κ_{3})} - \frac{2}{L^{†}} e^{2 κ_{1} + κ_{2} + κ_{3}} - \frac{2}{L^{†}} e^{κ_{1} + 2 κ_{2} + κ_{3}} + (\frac{1}{L^{†}} + \frac{1}{F^{†}}) e^{2 κ_{1}} + (\frac{1}{L^{†}} + \frac{1}{J^{†}}) e^{2 κ_{2}} + \frac{2}{L^{†}} e^{κ_{1} + κ_{2}} .

A.4 Derivations for Case-Control Exposure Probabilities from the Probabilities in the Underlying Population

Here we derive the proportions in each joint exposure group in the case-control sample, $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ , from the proportion in each joint exposure group in the underlying population, π₀₀, π₁₀, π₀₁, π₁₁, under an assumption that the outcome is rare. We will use P^*(·) to denote probabilities in the case-control sample and P(·) to denote probabilities in the underlying population. We have that

\begin{array}{l} π_{g e}^{*} = P^{*} (G = g, E = e) = P^{*} (G = g, E = e ∣ Y = 0) P^{*} (Y = 0) + P^{*} (G = g, E = e ∣ Y = 1) P^{*} (Y = 1) \\ = P (G = g, E = e ∣ Y = 0) P^{*} (Y = 0) + P (G = g, E = e ∣ Y = 1) P^{*} (Y = 1) \\ \approx π_{g e} = P^{*} (Y = 0) + P (G = g, E = e ∣ Y = 1) P^{*} (Y = 1) \end{array}

where the final equality follows because the outcome is rare and thus the exposure distribution among the controls will approximate that in the underlying population. We then also have that

\begin{array}{l} P (G = g, E = e ∣ Y = 1) = \frac{P (Y = 1 ∣ G = g, E = e) P (G = g, E = e)}{P (Y = 1)} \\ = \frac{P (Y = 1 ∣ G = g, E = e) P (G = g, E = e)}{\sum_{i, j} P (Y = 1 ∣ G = i, E = j) P (G = i, E = j)} \\ = \frac{\frac{P (Y = 1 ∣ G = g, E = e)}{P (Y = 1 ∣ G = 0, E = 0)} π_{g e}}{\sum_{i, j} \frac{P (Y = 1 ∣ G = i, E = j)}{P (Y = 1 ∣ G = 0, E = 0)} π_{g e}} \\ \approx \frac{\frac{P (Y = 1 ∣ G = g, E = e) / {1 - P (Y = 1 ∣ G = g, E = e)}}{P (Y = 1 ∣ G = 0, E = 0) / {1 - P (Y = 1 ∣ G = 0, E = 0)}} π_{g e}}{\sum_{i, j} \frac{P (Y = 1 ∣ G = i, E = j) / {1 - P (Y = 1 ∣ G = i, E = j)}}{P (Y = 1 ∣ G = 0, E = 0) / {1 - P (Y = 1 ∣ G = 0, E = 0)}} π_{i j}} \end{array}

where the final equality follows from the rare outcome assumption which implies that risk ratios approximate odds ratio. The odds ratios can then be obtained from the specification of the parameters of the logistic regression model and we thus obtain:

\begin{array}{l} π_{00}^{*} \approx π_{00} P^{*} (Y = 0) + \frac{π_{00}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{10}^{*} \approx π_{10} P^{*} (Y = 0) + \frac{e^{γ_{1}} π_{10}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{01}^{*} \approx π_{01} P^{*} (Y = 0) + \frac{e^{γ_{2}} π_{01}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \\ π_{11}^{*} \approx π_{11} P^{*} (Y = 0) + \frac{e^{γ_{1} + γ_{2} + γ_{3}} π_{11}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1) \end{array}

Under this rare outcome assumption we can also obtain γ₀ = log{P^*(Y = 1|G = 0, E = 0)/P^*(Y = 0|G = 0, E = 0)}, the log odds of baseline probability of the outcome in doubly unexposed group in the case-control sample, from P^*(Y = 0) and P^*(Y = 1) because

\begin{array}{l} P^{*} (Y = 1 ∣ G = 0, E = 0) = \frac{P^{*} (G = 0, E = 0 ∣ Y = 1) P^{*} (Y = 1)}{P^{*} (G = 0, E = 0)} \\ = \frac{P (G = 0, E = 0 ∣ Y = 1) P^{*} (Y = 1)}{P^{*} (G = 0, E = 0)} \\ \approx \frac{\frac{π_{00}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1)}{π_{00} P^{*} (Y = 0) + \frac{π_{00}}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}} P^{*} (Y = 1)} \\ = \frac{P^{*} (Y = 1)}{π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}} P^{*} (Y = 0) + P^{*} (Y = 1)} \\ = 1 / {1 + (π_{00} + π_{10} e^{γ_{1}} + π_{01} e^{γ_{2}} + π_{11} e^{γ_{1} + γ_{2} + γ_{3}}) P^{*} (Y = 0) / P^{*} (Y = 1)} . \end{array}

If instead the proportions in each joint exposure group in the case-control sample are specified, $π_{00}^{*}, π_{10}^{*}, π_{01}^{*}, π_{11}^{*}$ , then we could obtain γ₀ by numerically solving

P^{*} (Y = 0) = \frac{π_{00}^{*}}{1 + e^{γ_{0}}} + \frac{π_{10}^{*}}{1 + e^{γ_{0} + γ_{1}}} + \frac{π_{01}^{*}}{1 + e^{γ_{0} + γ_{2}}} + \frac{π_{11}^{*}}{1 + e^{γ_{0} + γ_{1} + γ_{2} + γ_{3}}}

for γ₀. If the joint or marginal exposure probabilities are specified separately for the cases and controls then under an assumption of a rare outcome, the distribution of the exposures amongst the controls could be used as an approximation to π₀₀, π₁₀, π₀₁, π₁₁ or π_g, π_e, Δ.

Appendix 2. Epidemiologic Practice: Excel Spreadsheets for Sample Size and Power Calculations for Additive and Multiplicative Interaction

As part of the online supplement for this paper there are two Excel spread-sheets that will automatically perform power and sample size calculations for additive and multiplicative interaction for (i) cohort and (ii) case-control, and case-only data. All of these spreadsheets return sample size and power calculations for the Wald test statistic for additive or multiplicative interaction with variance calculated under the alternative (cf. Demidenko, 2008; VanderWeele, 2011).

The first spreadsheet performs power and sample size calculations for additive and multiplicative interaction for cohort data. For the power calculations, the user has the option of entering marginal exposure probabilities and the odds ratio relating the prevalence of both exposures (Sheet 1) or the joint exposure probabilities (Sheet 2). On Sheet 1, the user inputs the significance level of the test (alpha), the sample size (n), the probability of the outcome in the doubly unexposed reference group (p00), the main effect odds ratio for the first exposure (OR10), the main effect odds ratio for the second exposure (OR01), the odds ratio multiplicative interaction (IOR=OR₁₁/(OR₁₀*OR₀₁)), the marginal prevalence of the first exposure (P(G=1)), the marginal prevalence of the second exposure (P(E=1)) and the odds ratio relating the dependence between the two exposures (OR_GE). The Excel spreadsheet returns both one-sided power (to detect positive interaction) and two-sided power (to detect positive or negative interaction) for (i) additive interaction on the risk difference scale, (ii) multiplicative interaction on the risk ratio scale, (iii) multiplicative interaction on the odds ratio scale, (iv) additive interaction using the relative excess risk due to interaction (RERI; cf. Hosmer and Lemeshow, 1992) for risk ratios, and (v) additive interaction using the relative excess risk due to interaction for odds ratios, assuming a rare outcome. On Sheet 2, the user specifies the same inputs except that instead of the marginal probabilities and odds ratio relating the exposures (P(G=1), P(E=1), OR_GE), the user specifies the joint exposure probabilities for each of the four possible exposure combinations (in the Excel spreadsheet these are pi00, pi10, pi01, pi11). The Excel spreadsheet then again returns items (i)–(v) above.

For sample size calculations from cohort data, the user has the option of entering marginal exposure probabilities and the odds ratio relating the prevalence of both exposures (Sheet 3) or the joint exposure probabilities (Sheet 4). The user specifies exactly the same parameters as the spreadsheet for power calculations for cohort data except that instead of specifying the sample size, the power is specified (Power), and the Excel spreadsheet returns the required sample size for a test of the specified significance level and power to detect (i) additive interaction on the risk difference scale, (ii) multiplicative interaction on the risk ratio scale, (iii) multiplicative interaction on the odds ratio scale, (iv) additive interaction using the relative excess risk due to interaction (RERI) for risk ratios, (v) additive interaction using the relative excess risk due to interaction for odds ratios, assuming a rare outcome.

The second spreadsheet performs power and sample size calculations for additive and multiplicative interaction for case-control and case-only data. For power calculations (Sheet 1), the user inputs the significance level of the test (alpha), the number of cases (n Cases) and number of controls (n Controls), the main effect odds ratio for the first exposure (OR10), the main effect odds ratio for the second exposure (OR01), the odds ratio multiplicative interaction (IOR), the marginal prevalence of the first exposure (P(G=1)), the marginal prevalence of the second exposure (P(E=1)) and the odds ratio relating the dependence between the two exposures (OR_GE). The Excel spreadsheet returns both one-sided power (to detect positive interaction) and two-sided power (to detect positive or negative interaction) for (i) additive interaction using the relative excess risk due to interaction (RERI) for odds ratios and (ii) multiplicative interaction on the odds ratio scale. If the two exposures are specified as independent (i.e. if OR_GE is specified as 1) then the spreadsheet will also return the power for the case-only estimator of multiplicative interaction (cf. Piegorsch et al, 1994; Yang et al., 1999) based on the number of cases. If the two exposures are not specified as independent (i.e. if OR_GE is specified as any number other than 1), the spreadsheet will return “#DIV/0!” for the power for the case-only estimator indicating that the case-only test is inapplicable in this setting because the two exposures are not independent. All power calculations for the case-control and case-only power spreadsheet make a rare outcome assumption. The power calculations are based on the variance calculated under the alternative (as in Demidenko (2008) for logistic regression multiplicative interactions and VanderWeele (2011) for case-only multiplicative interactions) rather the variance calculated under the null, as the variance under the alternative corresponds to the test statistics that are commonly used in practice.

For sample size calculations for additive and multiplicative interaction for case-control and case-only data (Sheet 2), the user inputs the significance level of the test (alpha), the proportion of cases in the case-control sample (Cs/(Cs+Cont)), the desired power of the test (Power), the main effect odds ratio for the first exposure (OR10), the main effect odds ratio for the second exposure (OR01), the odds ratio multiplicative interaction (IOR), the marginal prevalence of the first exposure (P(G=1)), the marginal prevalence of the second exposure (P(E=1)) and the odds ratio relating the dependence between the two exposures (OR_GE). The Excel spreadsheet returns the required sample size for a test of the specified significance level and power for (i) additive interaction using the relative excess risk due to interaction (RERI) for odds ratios and (ii) multiplicative interaction on the odds ratio scale. If the two exposures are specified as independent (i.e. if OR_GE is specified as 1) then the spreadsheet will also return the required sample size, i.e. number of cases, to detect multiplicative interaction for the case-only estimator of multiplicative interaction. If the two exposures are not specified as independent (i.e. if OR_GE is specified as any number other than 1), the spreadsheet will return “#DIV/0!” for the required sample size for the case-only estimator indicating that the case-only test is inapplicable in this setting because the two exposures are not independent. All power calculations for the case-control and case-only sample size spreadsheet make a rare outcome assumption. The sample size calculations are based on the variance calculated under the alternative as this corresponds to the test statistics that are commonly used in practice (cf. Garcia-Closas and Lubin, 1999; Demidenko, 2008; VanderWeele, 2011).

Footnotes

¹

Demidenko (2008) also noted that a number of previous authors (Hwang et al., 1994; Foppa and Spiegelman, 1997) who had considered sample size and power calculations for interaction in logistic regression had relied on a different formula for their sample size calculations. These other authors had assumed that for the test statistic, the variance of γ̂₃ had been calculated under the null hypothesis of no interaction. When the variance for the test statistic is calculated under the null of no interaction then the required sample size is given by $n = \frac{(Z_{1 - α / 2} \sqrt{V_{0}} + Z_{β} \sqrt{V_{mult (O R)}})}{η^{2}}$ rather than by $n = \frac{{(Z_{1 - α / 2} + Z_{β})}^{2} V_{mult (O R)}}{η^{2}}$ where V₀ is the variance of γ̂₃ calculated under the null that γ₃ = 0. Demidenko (2008) points out that although the sample size calculations of Hwang et al. (1994) and Foppa and Spiegelman (1997) would be fine if, for γ̂₃, the variance were indeed calculated under the null, in practice, the variance of γ̂₃ is almost always calculated under the alternative; it is the variance under the alternative that is generally given as the default in standard logistic regression output. Thus, the sample size calculations of Hwang et al. (1994) and Foppa and Spiegelman (1997), although not technically incorrect, do not correspond to the test statistics that are generally used in practice. A similar point and criticism was made by Garcia-Closas and Lubin (1999) some years earlier. Both Garcia-Closas and Lubin (1999) and Demidenko (2008) note that when interactions are large, the sample size calculations using the “null-variance” can underestimate the required sample size if the test statistic with the variance under the alternative is in fact used. Likewise a similar point pertains to the sample size and power calculations of Yang et al. (1997) for multiplicative interaction in case-only studies (cf. VanderWeele, 2011).

References

Albert PS, Ratnasinghe D, Tangrea J, Wacholder S. Limitations of the case-only design for identifying gene-environment interactions. American Journal of Epidemiology. 2001;154:687–693. doi: 10.1093/aje/154.8.687. [DOI] [PubMed] [Google Scholar]
Cordell HJ. Detecting gene-gene interaction that underlie human diseases. Nature Reviews Genetics. 2009;10:392–404. doi: 10.1038/nrg2579. [DOI] [PMC free article] [PubMed] [Google Scholar]
Demidenko E. Sample size and optimal design for logistic regression with binary interaction. Statistics in Medicine. 2008;27:36–46. doi: 10.1002/sim.2980. [DOI] [PubMed] [Google Scholar]
Foppa I, Spiegelman D. Power and sample size calculations for case-control studies of gene-environment interactions with a polytomous exposure variable. American Journal of Epidemiology. 1997;146:596–604. doi: 10.1093/oxfordjournals.aje.a009320. [DOI] [PubMed] [Google Scholar]
Garcia-Closas M, Lubin JH. Power and sample size calculations in case-control studies of gene-environment interactions: Comments on different approaches. American Journal of Epidemiology. 1999;149:689–692. doi: 10.1093/oxfordjournals.aje.a009876. [DOI] [PubMed] [Google Scholar]
Garcia-Closas M, Thompson WD, Robins JM. Differential misclassification and the assessment of gene-enviornment interactions. American Journal of Epidemiology. 1998;147:426–433. doi: 10.1093/oxfordjournals.aje.a009467. [DOI] [PubMed] [Google Scholar]
Gauderman WJ. Sample size requirements for association studies of gene-gene interaction. American Journal of Epidemiology. 2002a;155:478–484. doi: 10.1093/aje/155.5.478. [DOI] [PubMed] [Google Scholar]
Gauderman WJ. Sample size requirements for matched case-control studies of gene-environment interaction. Statistics in Medicine. 2002b;21:35–50. doi: 10.1002/sim.973. [DOI] [PubMed] [Google Scholar]
Greenland S. Tests for interaction in epidemiologic studies: a review and study of power. Statistics in Medicine. 1983;2:243–251. doi: 10.1002/sim.4780020219. [DOI] [PubMed] [Google Scholar]
Greenland S. Power, sample size and smallest detectable effect determination for multivariate studies. Statistics in Medicine. 1985;4:117–127. doi: 10.1002/sim.4780040203. [DOI] [PubMed] [Google Scholar]
Hernán MA. A definition of causal effect for epidemiological studies. Journal of Epidemiology and Community Health. 2004;58:265–271. doi: 10.1136/jech.2002.006361. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hosmer DW, Lemeshow S. Confidence interval estimation of interaction. Epidemiology. 1992;3:452–56. doi: 10.1097/00001648-199209000-00012. [DOI] [PubMed] [Google Scholar]
Hwang S-J, Beaty TH, Liang K-Y, Coresh J, Khoury MJ. Minimum sample size esitimation to detect gene-environment interaction in case-control designs. American Journal of Epidemiology. 1994;140:1029–1037. doi: 10.1093/oxfordjournals.aje.a117193. [DOI] [PubMed] [Google Scholar]
Luan J, Wong M, Day N, Wareham N. Sample size determination for studies of gene-environment interaction. International Journal of Epidemiology. 2001;30:1035–1040. doi: 10.1093/ije/30.5.1035. [DOI] [PubMed] [Google Scholar]
Lubin JH, Gail MH. On power and sample size for studying features of the relative odds of disease. American Journal of Epidemiology. 1990;131:552–566. doi: 10.1093/oxfordjournals.aje.a115530. [DOI] [PubMed] [Google Scholar]
Piegorsch WW, Weinberg CR, Taylor JA. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Statistics in Medicine. 1994;13:153–162. doi: 10.1002/sim.4780130206. [DOI] [PubMed] [Google Scholar]
Phillips PC. Epistasis – the essential role of gene interactions in the structure and evolution of genetic systems. Nature Reviews Genetics. 2008;9:855–867. doi: 10.1038/nrg2452. [DOI] [PMC free article] [PubMed] [Google Scholar]
Qiu P, Moeschberger ML, Cooke GE, Goldschmidt-Clermont PJ. Sample size to test for interaction between a specific exposure and a second risk factor in a pair-matched case-control study. Statistics in Medicine. 2000;19:923–935. doi: 10.1002/(sici)1097-0258(20000415)19:7<923::aid-sim341>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
Robinson L, Jewell NP. Some surprising results about covariate adjustment in logistic regression models. International Statistical Review. 1991;59:227–240. [Google Scholar]
Rothman KJ. Causes. American Journal of Epidemiology. 1976;104:587–592. doi: 10.1093/oxfordjournals.aje.a112335. [DOI] [PubMed] [Google Scholar]
Rothman KJ, Greenland S, Walker AM. Concepts of interaction. American Journal of Epidemiology. 1980;112:467–470. doi: 10.1093/oxfordjournals.aje.a113015. [DOI] [PubMed] [Google Scholar]
Rothman KJ. Modern Epidemiology. 1. Little, Brown and Company; Boston, MA: 1986. [Google Scholar]
Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3. chapter 5. Philadelphia: Lippincott Williams and Wilkins; 2008. Concepts of interaction. [Google Scholar]
Rubin DB. Formal modes of statistical inference for causal effects. Journal of Statistical Planning and Inference. 1990;25:279–292. [Google Scholar]
Sturmer T, Brenner H. Flexible matching strategies to increase power and efficiency to detect and estimate gene-environment interactions in case-control studies. American Journal of Epidemiology. 2002;155:593–602. doi: 10.1093/aje/155.7.593. [DOI] [PubMed] [Google Scholar]
VanderWeele TJ. Sufficient cause interactions and statistical interactions. Epidemiology. 2009a;20:6–13. doi: 10.1097/EDE.0b013e31818f69e7. [DOI] [PubMed] [Google Scholar]
VanderWeele TJ. On the distinction between interaction and effect modification. Epidemiology. 2009b;20:863–871. doi: 10.1097/EDE.0b013e3181ba333c. [DOI] [PubMed] [Google Scholar]
VanderWeele TJ. Empirical tests for compositional epistasis. Nature Reviews Genetics. 2010a;11:166. doi: 10.1038/nrg2579-c1. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ. Epistatic interactions. Statistical Applications in Genetics and Molecular Biology. 2010b;9:Article 1, 1–22. doi: 10.2202/1544-6115.1517. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ. Sample size and power calculations for case-only interaction studies. Epidemiology. 2011;22:873–874. doi: 10.1097/EDE.0b013e31822e18e5. [DOI] [PubMed] [Google Scholar]
VanderWeele TJ. Inference for additive interaction under exposure misclassification. Biometrika. 2012;99:502–508. doi: 10.1093/biomet/ass012. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ, Arah OA. Bias formulas for sensitivity analysis of unmeasured confounding for general outcomes, treatments and confounders. Epidemiology. 2011;22:42–52. doi: 10.1097/EDE.0b013e3181f74493. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ, Knol MJ. The interpretation of subgroup analyses in randomized trials: heterogeneity versus secondary interventions. Annals of Internal Medicine. 2011;154:680–683. doi: 10.7326/0003-4819-154-10-201105170-00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ, Mukherjee B, Chen J. Sensitivity analysis for interactions under unmeasured confounding. Statistics in Medicine. 2012 doi: 10.1002/sim.4354. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
VanderWeele TJ, Robins JM. The identification of synergism in the sufficient-component cause framework. Epidemiology. 2007;18:329–339. doi: 10.1097/01.ede.0000260218.66432.88. [DOI] [PubMed] [Google Scholar]
VanderWeele TJ, Robins JM. Empirical and counterfactual conditions for sufficient cause interactions. Biometrika. 2008;95:49–61. [Google Scholar]
Wang S, Zhao H. Sample size needed to detect gene-gene interactions using association designs. American Journal of Epidemiology. 2003;158:899–914. doi: 10.1093/aje/kwg233. [DOI] [PubMed] [Google Scholar]
Yang Q, Khoury MJ, Flanders WD. Sample size requirements in case-only designs to detect gene-environment interaction. American Journal of Epidemiology. 1997;146:713–719. doi: 10.1093/oxfordjournals.aje.a009346. [DOI] [PubMed] [Google Scholar]
Yang Q, Khoury MJ, Sun F, Flanders WD. Case-only design to measure gene–gene interaction. Epidemiology. 1999;10:167–170. [PubMed] [Google Scholar]
Zhang L, Mukherjee B, Ghosh M, Gruber S, Moreno V. Accounting for error due to misclassification of exposures in case-control studies of gene-environment interaction. Statistics in Medicine. 2008;27:2756–2783. doi: 10.1002/sim.3044. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supllement CaseControl

NIHMS632459-supplement-Supllement_CaseControl.xls^{(33.5KB, xls)}

Supplement Cohort

NIHMS632459-supplement-Supplement_Cohort.xls^{(58.5KB, xls)}

[R1] Albert PS, Ratnasinghe D, Tangrea J, Wacholder S. Limitations of the case-only design for identifying gene-environment interactions. American Journal of Epidemiology. 2001;154:687–693. doi: 10.1093/aje/154.8.687. [DOI] [PubMed] [Google Scholar]

[R2] Cordell HJ. Detecting gene-gene interaction that underlie human diseases. Nature Reviews Genetics. 2009;10:392–404. doi: 10.1038/nrg2579. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Demidenko E. Sample size and optimal design for logistic regression with binary interaction. Statistics in Medicine. 2008;27:36–46. doi: 10.1002/sim.2980. [DOI] [PubMed] [Google Scholar]

[R4] Foppa I, Spiegelman D. Power and sample size calculations for case-control studies of gene-environment interactions with a polytomous exposure variable. American Journal of Epidemiology. 1997;146:596–604. doi: 10.1093/oxfordjournals.aje.a009320. [DOI] [PubMed] [Google Scholar]

[R5] Garcia-Closas M, Lubin JH. Power and sample size calculations in case-control studies of gene-environment interactions: Comments on different approaches. American Journal of Epidemiology. 1999;149:689–692. doi: 10.1093/oxfordjournals.aje.a009876. [DOI] [PubMed] [Google Scholar]

[R6] Garcia-Closas M, Thompson WD, Robins JM. Differential misclassification and the assessment of gene-enviornment interactions. American Journal of Epidemiology. 1998;147:426–433. doi: 10.1093/oxfordjournals.aje.a009467. [DOI] [PubMed] [Google Scholar]

[R7] Gauderman WJ. Sample size requirements for association studies of gene-gene interaction. American Journal of Epidemiology. 2002a;155:478–484. doi: 10.1093/aje/155.5.478. [DOI] [PubMed] [Google Scholar]

[R8] Gauderman WJ. Sample size requirements for matched case-control studies of gene-environment interaction. Statistics in Medicine. 2002b;21:35–50. doi: 10.1002/sim.973. [DOI] [PubMed] [Google Scholar]

[R9] Greenland S. Tests for interaction in epidemiologic studies: a review and study of power. Statistics in Medicine. 1983;2:243–251. doi: 10.1002/sim.4780020219. [DOI] [PubMed] [Google Scholar]

[R10] Greenland S. Power, sample size and smallest detectable effect determination for multivariate studies. Statistics in Medicine. 1985;4:117–127. doi: 10.1002/sim.4780040203. [DOI] [PubMed] [Google Scholar]

[R11] Hernán MA. A definition of causal effect for epidemiological studies. Journal of Epidemiology and Community Health. 2004;58:265–271. doi: 10.1136/jech.2002.006361. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Hosmer DW, Lemeshow S. Confidence interval estimation of interaction. Epidemiology. 1992;3:452–56. doi: 10.1097/00001648-199209000-00012. [DOI] [PubMed] [Google Scholar]

[R13] Hwang S-J, Beaty TH, Liang K-Y, Coresh J, Khoury MJ. Minimum sample size esitimation to detect gene-environment interaction in case-control designs. American Journal of Epidemiology. 1994;140:1029–1037. doi: 10.1093/oxfordjournals.aje.a117193. [DOI] [PubMed] [Google Scholar]

[R14] Luan J, Wong M, Day N, Wareham N. Sample size determination for studies of gene-environment interaction. International Journal of Epidemiology. 2001;30:1035–1040. doi: 10.1093/ije/30.5.1035. [DOI] [PubMed] [Google Scholar]

[R15] Lubin JH, Gail MH. On power and sample size for studying features of the relative odds of disease. American Journal of Epidemiology. 1990;131:552–566. doi: 10.1093/oxfordjournals.aje.a115530. [DOI] [PubMed] [Google Scholar]

[R16] Piegorsch WW, Weinberg CR, Taylor JA. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Statistics in Medicine. 1994;13:153–162. doi: 10.1002/sim.4780130206. [DOI] [PubMed] [Google Scholar]

[R17] Phillips PC. Epistasis – the essential role of gene interactions in the structure and evolution of genetic systems. Nature Reviews Genetics. 2008;9:855–867. doi: 10.1038/nrg2452. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Qiu P, Moeschberger ML, Cooke GE, Goldschmidt-Clermont PJ. Sample size to test for interaction between a specific exposure and a second risk factor in a pair-matched case-control study. Statistics in Medicine. 2000;19:923–935. doi: 10.1002/(sici)1097-0258(20000415)19:7<923::aid-sim341>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]

[R19] Robinson L, Jewell NP. Some surprising results about covariate adjustment in logistic regression models. International Statistical Review. 1991;59:227–240. [Google Scholar]

[R20] Rothman KJ. Causes. American Journal of Epidemiology. 1976;104:587–592. doi: 10.1093/oxfordjournals.aje.a112335. [DOI] [PubMed] [Google Scholar]

[R21] Rothman KJ, Greenland S, Walker AM. Concepts of interaction. American Journal of Epidemiology. 1980;112:467–470. doi: 10.1093/oxfordjournals.aje.a113015. [DOI] [PubMed] [Google Scholar]

[R22] Rothman KJ. Modern Epidemiology. 1. Little, Brown and Company; Boston, MA: 1986. [Google Scholar]

[R23] Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3. chapter 5. Philadelphia: Lippincott Williams and Wilkins; 2008. Concepts of interaction. [Google Scholar]

[R24] Rubin DB. Formal modes of statistical inference for causal effects. Journal of Statistical Planning and Inference. 1990;25:279–292. [Google Scholar]

[R25] Sturmer T, Brenner H. Flexible matching strategies to increase power and efficiency to detect and estimate gene-environment interactions in case-control studies. American Journal of Epidemiology. 2002;155:593–602. doi: 10.1093/aje/155.7.593. [DOI] [PubMed] [Google Scholar]

[R26] VanderWeele TJ. Sufficient cause interactions and statistical interactions. Epidemiology. 2009a;20:6–13. doi: 10.1097/EDE.0b013e31818f69e7. [DOI] [PubMed] [Google Scholar]

[R27] VanderWeele TJ. On the distinction between interaction and effect modification. Epidemiology. 2009b;20:863–871. doi: 10.1097/EDE.0b013e3181ba333c. [DOI] [PubMed] [Google Scholar]

[R28] VanderWeele TJ. Empirical tests for compositional epistasis. Nature Reviews Genetics. 2010a;11:166. doi: 10.1038/nrg2579-c1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] VanderWeele TJ. Epistatic interactions. Statistical Applications in Genetics and Molecular Biology. 2010b;9:Article 1, 1–22. doi: 10.2202/1544-6115.1517. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] VanderWeele TJ. Sample size and power calculations for case-only interaction studies. Epidemiology. 2011;22:873–874. doi: 10.1097/EDE.0b013e31822e18e5. [DOI] [PubMed] [Google Scholar]

[R31] VanderWeele TJ. Inference for additive interaction under exposure misclassification. Biometrika. 2012;99:502–508. doi: 10.1093/biomet/ass012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] VanderWeele TJ, Arah OA. Bias formulas for sensitivity analysis of unmeasured confounding for general outcomes, treatments and confounders. Epidemiology. 2011;22:42–52. doi: 10.1097/EDE.0b013e3181f74493. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] VanderWeele TJ, Knol MJ. The interpretation of subgroup analyses in randomized trials: heterogeneity versus secondary interventions. Annals of Internal Medicine. 2011;154:680–683. doi: 10.7326/0003-4819-154-10-201105170-00008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] VanderWeele TJ, Mukherjee B, Chen J. Sensitivity analysis for interactions under unmeasured confounding. Statistics in Medicine. 2012 doi: 10.1002/sim.4354. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] VanderWeele TJ, Robins JM. The identification of synergism in the sufficient-component cause framework. Epidemiology. 2007;18:329–339. doi: 10.1097/01.ede.0000260218.66432.88. [DOI] [PubMed] [Google Scholar]

[R36] VanderWeele TJ, Robins JM. Empirical and counterfactual conditions for sufficient cause interactions. Biometrika. 2008;95:49–61. [Google Scholar]

[R37] Wang S, Zhao H. Sample size needed to detect gene-gene interactions using association designs. American Journal of Epidemiology. 2003;158:899–914. doi: 10.1093/aje/kwg233. [DOI] [PubMed] [Google Scholar]

[R38] Yang Q, Khoury MJ, Flanders WD. Sample size requirements in case-only designs to detect gene-environment interaction. American Journal of Epidemiology. 1997;146:713–719. doi: 10.1093/oxfordjournals.aje.a009346. [DOI] [PubMed] [Google Scholar]

[R39] Yang Q, Khoury MJ, Sun F, Flanders WD. Case-only design to measure gene–gene interaction. Epidemiology. 1999;10:167–170. [PubMed] [Google Scholar]

[R40] Zhang L, Mukherjee B, Ghosh M, Gruber S, Moreno V. Accounting for error due to misclassification of exposures in case-control studies of gene-environment interaction. Statistics in Medicine. 2008;27:2756–2783. doi: 10.1002/sim.3044. [DOI] [PubMed] [Google Scholar]

PERMALINK

Sample Size and Power Calculations for Additive Interactions

TJ VanderWeele

Introduction

Notation and Definitions

Additive Interaction in Cohort Studies Using A Linear Risk Model

Example 1

Additive Interaction in Cohort Studies Using Logistic Regression and RERI

Example 2

Example 3

Additive Interaction in Case-Control Studies Using Logistic Regression and RERI

Example 4

A Power Comparison of Additive and Multiplicative Interaction

Table 1.

Power and Sample Size Calculations for Sufficient Cause Interactions and Epistatic Interactions

Discussion

Supplementary Material

Appendix 1. Derivations

A.1. Derivations for additive interaction with absolute risk and cohort data

A.2. Derivations for relative excess risk due to interaction from logistic regression using cohort data

A.3. Derivations for multiplicative and additive interaction for the log-linear model

A.4 Derivations for Case-Control Exposure Probabilities from the Probabilities in the Underlying Population

Appendix 2. Epidemiologic Practice: Excel Spreadsheets for Sample Size and Power Calculations for Additive and Multiplicative Interaction

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Sample Size and Power Calculations for Additive Interactions

TJ VanderWeele

Introduction

Notation and Definitions

Additive Interaction in Cohort Studies Using A Linear Risk Model

Example 1

Additive Interaction in Cohort Studies Using Logistic Regression and RERI

Example 2

Example 3

Additive Interaction in Case-Control Studies Using Logistic Regression and RERI

Example 4

A Power Comparison of Additive and Multiplicative Interaction

Table 1.

Power and Sample Size Calculations for Sufficient Cause Interactions and Epistatic Interactions

Discussion

Supplementary Material

Appendix 1. Derivations

A.1. Derivations for additive interaction with absolute risk and cohort data

A.2. Derivations for relative excess risk due to interaction from logistic regression using cohort data

A.3. Derivations for multiplicative and additive interaction for the log-linear model

A.4 Derivations for Case-Control Exposure Probabilities from the Probabilities in the Underlying Population

Appendix 2. Epidemiologic Practice: Excel Spreadsheets for Sample Size and Power Calculations for Additive and Multiplicative Interaction

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases