Skip to main content
American Journal of Epidemiology logoLink to American Journal of Epidemiology
. 2010 Jan 4;171(4):498–505. doi: 10.1093/aje/kwp398

A Comparison of Sample Size and Power in Case-Only Association Studies of Gene-Environment Interaction

Geraldine M Clarke *, Andrew P Morris
PMCID: PMC2816730  PMID: 20047976

Abstract

Assuming continuous, normally distributed environmental and categorical genotype variables, the authors compare 6 case-only designs for tests of association in gene-environment interaction. Novel tests modeling the environmental variable as either the response or the predictor and allowing a genetic variable with multiallelic variants are included. The authors show that tests imposing the same genotypic pattern of inheritance perform similarly regardless of whether genotype is the response variable or the predictor variable. The novel tests using the genetic variable as the response variable are advantageous because they are robust to non-normally distributed environmental exposures. Dominance deviance—deviation from additivity in the main or interaction effects—is key to test performance: When it is zero or modest, tests searching for a trend with increasing risk alleles are optimal; when it is large, tests for genotypic effects are optimal. However, the authors show that dominance deviance is attenuated when it is observed at a proxy locus, which is common in genome-wide association studies, so large dominance deviance is likely to be rare. The authors conclude that the trend test is the appropriate tool for large-scale association scans where the true gene-environment interaction model is unknown. The common practice of assuming a dominant pattern of inheritance can cause serious losses of power in the presence of any recessive, or modest dominant, effects.

Keywords: dominance, genome-wide association study, interaction, linkage disequilibrium, lung neoplasms, power, research design, sample size


Gene-environment (G-E) interaction refers to variation in the effect of genetic factors, measured in a suitable scale, as a function of variation in environmental exposure and vice versa. Case-only study designs have been shown to be more powerful than case-control study designs in the assessment of possible interactions between genetic factors and environmental exposures in the etiology of a disease (1, 2). The ability to test for G-E interaction using the case population only may be advantageous—for example, in studies that rely on public population control data for which information on environmental exposures of interest is not available. Assuming independence between environment and genotype in the population of controls, Begg and Zhang (3), Piegorsch et al. (2), Umbach and Weinberg (4), and Yang et al. (5) showed that efficient estimates of interaction for categorical environment and binary genotype variables can be obtained via logistic regression in a case-only analysis. Albert et al. (6), Armstrong (7), and Cheng (8) explored extensions to continuous environment and categorical genotype variables using both logistic and multinomial regression. Typically, the genetic factor is treated as the response variable, with the environmental exposure being used as the predictor variable; the converse, whereby the response variable is the environmental exposure and thus the effect of a genetic factor on the association between the environmental exposure and disease is studied, has received less attention. With the genetic factor used as the predictor variable rather than the response variable, there is greater freedom to categorize it appropriately. By definition, a successful finding in either situation represents a G-E interaction. Kraft et al. (9) compared the power and sample-size requirements of various case-control study designs for testing for G-E interaction to a single case-only design that assumed a dominant inheritance pattern at the genetic locus. Like other investigators (2, 5), Kraft et al. showed that in the presence of any interaction effect, the case-only design was most powerful.

Here we consider a categorical genetic variable G and a normally distributed environmental exposure variable E. We assume independence between G and E in the population under study and compare the power and sample-size requirements of 6 case-only tests for detecting G-E interaction. Three categorizations of G are considered: binary, ordinal, and nominal. The first 3 tests model E as the response variable using a linear regression with each category of G as the predictor. The remaining 3 tests use multinomial regression techniques to allow each categorization of G to be the response variable, with E as the predictor variable: A logistic regression (this test is the same as that examined by Kraft et al. (9)), a proportional odds regression, and a multinomial regression are performed for binary, ordinal, and nominal codings of G. Each test is considered under 3 different penetrance models that are based on logistic risk models.

MATERIALS AND METHODS

Models

Let D be an indicator of disease (D = 1 if affected and D = 0 otherwise), E a standard normally distributed environmental exposure variable, and g a genotype at a disease susceptibility locus G with alleles A and a. Let pA denote the prevalence of the minor allele, A. Let G denote the minor allele count: G = 0 when g = a/a, G = 1 when g = a/A, and G = 2 when g = A/A. Let GD equal 1 when g = a/A; otherwise, GD = 0. G is then an additive genotype coding and GD is its deviation from additivity.

We consider true penetrance models based on the log odds of disease:

graphic file with name amjepidkwp398fx1_ht.jpg (1)

bG represents an additive genetic main effect with bGD its deviation from additivity, hereafter referred to as dominance deviance; bE represents an environmental main effect; and bEG represents an additive G-E interaction effect with bEGD its dominance deviance. In practical terms, bE represents the log odds ratio for disease risk associated with a 1-unit increase in E among persons with no minor alleles; bG + bGD and 2bG are the log odds ratios associated with an increase of 1 and 2 minor alleles, respectively, in persons with zero exposure, etc. We assume that G and E are independent in the population under study.

Cases are persons affected by the disease of interest (D = 1). Below we describe the 6 case-only tests considered. Each test is based on a model that generates a log likelihood, and tests of G-E interaction are based on standard likelihood ratio statistics. The specific likelihoods associated with each model and our approach to power and sample-size estimation are described in the Web Appendix, which is posted on the Journal’s Web site (http://aje.oxfordjournals.org/).

Linear regression additive (EGT) model.

The relation between E and G is modeled as a simple linear regression assuming additive genetic effects at the genetic locus:

graphic file with name amjepidkwp398fx2_ht.jpg

The likelihood ratio test constrains βEG(1)≡0 under the null hypothesis and has 1 df. This test assumes a trend in risk with an increasing number of risk alleles and is referred to as a trend test.

Linear regression dominant (EGD) model.

The relation between E and G is modeled as a simple linear regression, assuming a dominant pattern of inheritance at the genetic locus: GDOM = 1 for carriers of the minor allele and zero otherwise.

graphic file with name amjepidkwp398fx3_ht.jpg

The likelihood ratio test constrains βEGDOM(2)≡0 under the null hypothesis and has 1 df. This test is referred to as a dominant test.

Linear regression deviation additivity (EGG) model.

The relation between E and G is modeled as a simple linear regression but with the introduction of a term to model any dominance deviance associated with the interaction:

graphic file with name amjepidkwp398fx4_ht.jpg

The likelihood ratio test constrains βEG(3)≡0 and βEGD(3)≡0 under the null hypothesis and has 2 df. This test allows independent genotype effects and is referred to as a genotypic test.

Logistic regression dominant (GDE) model.

The relation between carriers of the minor allele and E is modeled as a logistic regression:

graphic file with name amjepidkwp398fx5_ht.jpg

If the genetic factor and the environmental exposure are independent among the cases, then βEG(4) is a consistent estimator of the interaction parameter measuring the departure of the joint effects of E and GDOM from multiplicative risk ratios (8). If the disease is rare, then βEG(4) is also a good estimator of the interaction parameter measuring the departure of the joint effects of E and GDOM from multiplicative odds ratios. The likelihood ratio test constrains βEG(4)≡0 under the null hypothesis of no interaction and has 1 df. Like EGD, this test assumes a dominant pattern of inheritance at the genetic locus and is referred to as a dominant test.

Proportional odds regression (GTE) model.

We compare the probability of an equal or smaller number of high-risk alleles, Gk, to the probability of a larger number, G > k, as a function of the continuous environmental exposure E:

graphic file with name amjepidkwp398fx6_ht.jpg

If βEG(5) > 0, then increasing values of E are associated with increasing genetic risk in the case population. The likelihood ratio test constrainsβEG(5)≡0 under the null hypothesis of no interaction and has 1 df. Like EGT, this test assumes a trend in risk with an increasing number of risk alleles and is referred to as a trend test.

Multinomial regression genetic response (GGE) model.

The probability of observing 1 or 2 minor alleles is compared with the probability of observing none:

graphic file with name amjepidkwp398fx7_ht.jpg

The likelihood ratio test constrainsβEG1(6)≡0 and βEG2(6)≡0 under the null hypothesis and has 2 df. Like EGG, this test allows independent genotype risks and is referred to as a genotypic test.

Power calculations

We calculated the number of cases required to achieve 80% statistical power, as well as the power for 1,000 case participants, to detect G-E interaction at the disease susceptibility locus G. We considered a range of minor allele frequencies (pA = 0.1, 0.25), genetic main effects (bG = 0, log(1.3)), environmental main effects (bE = 0, log(1.3)), and G-E interaction effects (bEG = 0, log(1.1), log(1.2), log(1.3)). To reduce the dimensionality of the model parameter space, we assumed a common value δ for the dominance deviance associated with main and interaction genetic effects: δ = bGD = bEGD. When δ = 0, the model is additive and when δ > 0 (respectively, δ < 0), dominant (respectively, recessive) main and interaction genetic effects are present in the underlying model. We considered a range of realistic disease models by allowing the dominance deviance to vary within the range ± min(bG,bEG) to ensure no heterozygote advantage: If δ > bEG, for example, then the penetrance associated with the heterozygotic genotype may be greater than the homozygotic genotype giving a heterozygote advantage.

We also performed similar calculations assuming that the interaction was observed at a nearby noncausal locus H with minor allele frequency pB = 0.2 and the maximum value of linkage disequilibrium r2 between alleles at the disease susceptibility locus G. There are algebraic restrictions on the range of r2 (10), and the maximum value of linkage disequilibrium between alleles at G and H when pB = 0.2 depends on the value of pA: When pA = 0.1, maximum r2 = 0.44; when pA = 0.25, maximum r2 = 0.75; and when pA = 0.4, maximum r2 = 0.37. All tests assumed a 2-sided alternative hypothesis and a significance level of 0.05.

One of the underlying assumptions of the tests of interaction with E as the response, EGT, EGD, and EGG, is that E is normally distributed. In order to assess the effect of nonnormality of E, we calculated power when only cases with extreme values of E, |E| > 1, were included and when 10% of cases were allowed to have extreme values of E with 9 times the standard deviation of the remaining 90% of cases.

Example: study of G-E interaction for lung cancer

In a case-control study, Zhou et al. (11) found an interaction between polymorphisms in the excision repair cross-complementing group 2 (ERCC2) gene and cumulative cigarette smoking exposure in lung cancer. They studied 1,092 cases and 1,240 controls selected from the same Caucasian population. Using parameter values estimated from their analyses and a true penetrance model of the form given in equation 1, we calculated the number of samples that would be required to achieve 80% power in a standard case-control test of G-E interaction, as well as in our 6 case-only tests.

RESULTS

General design comparisons

The number of cases required to achieve 80% power in each of the dominant tests, GDE and EGD, hardly differs for any set of parameter values studied (Tables 1, 2, and 3). Similarly, approximately equivalent numbers of cases are also required to achieve the same power in the genotypic tests, EGG and GGE, with the largest differences being seen at the largest main interaction effect sizes. The trend tests EGT and GTE require similar numbers of cases to achieve the same power under a strictly additive model but show greater differences in the presence of any dominance deviance.

Table 1.

Sampling Units for Case-Only Tests of Gene-Environment Interaction at a Disease Susceptibility Locus G Under Zero Deviation from Additivitya

pA ebG ebE ebEG Additive δ = 0
EGT EGD EGG GDE GTE GGE
0.1 1 1 1.1 4,793 1.1 1.2 1.1 1 1.2
0.1 1 1 1.2 1,292 1.1 1.2 1.1 1 1.2
0.1 1 1 1.3 611 1.1 1.2 1.1 1 1.2
0.1 1 1.3 1.1 4,699 1.1 1.2 1.1 1 1.2
0.1 1 1.3 1.2 1,244 1.1 1.2 1.1 1 1.2
0.1 1 1.3 1.3 579 1.1 1.2 1.1 1 1.2
0.1 1.3 1 1.1 3,912 1.1 1.2 1.1 1 1.2
0.1 1.3 1 1.2 1,054 1.1 1.2 1.1 1 1.2
0.1 1.3 1 1.3 499 1.1 1.2 1.1 1 1.2
0.1 1.3 1.3 1.1 3,841 1.1 1.2 1.1 1 1.2
0.1 1.3 1.3 1.2 1,018 1.1 1.2 1.1 1 1.2
0.1 1.3 1.3 1.3 474 1.1 1.2 1.1 1 1.2
0.25 1 1 1.1 2,301 1.2 1.2 1.2 1 1.2
0.25 1 1 1.2 621 1.2 1.2 1.2 1 1.2
0.25 1 1 1.3 294 1.2 1.2 1.2 1 1.2
0.25 1 1.3 1.1 2,273 1.2 1.2 1.2 1 1.2
0.25 1 1.3 1.2 606 1.2 1.2 1.2 1 1.2
0.25 1 1.3 1.3 285 1.2 1.2 1.2 1 1.2
0.25 1.3 1 1.1 2,046 1.2 1.2 1.2 1 1.2
0.25 1.3 1 1.2 553 1.2 1.2 1.2 1 1.2
0.25 1.3 1 1.3 262 1.2 1.2 1.2 1 1.2
0.25 1.3 1.3 1.1 2,027 1.2 1.2 1.2 1 1.2
0.25 1.3 1.3 1.2 543 1.2 1.2 1.2 1 1.2
0.25 1.3 1.3 1.3 256 1.3 1.2 1.3 1 1.2
0.4 1 1 1.1 1,800 1.3 1.2 1.3 1 1.2
0.4 1 1 1.2 487 1.3 1.2 1.4 1 1.2
0.4 1 1 1.3 232 1.4 1.2 1.4 1 1.2
0.4 1 1.3 1.1 1,792 1.3 1.2 1.3 1 1.2
0.4 1 1.3 1.2 483 1.4 1.2 1.4 1 1.2
0.4 1 1.3 1.3 230 1.4 1.2 1.4 1 1.2
0.4 1.3 1 1.1 1,739 1.4 1.2 1.4 1 1.2
0.4 1.3 1 1.2 472 1.5 1.2 1.5 1 1.2
0.4 1.3 1 1.3 226 1.5 1.2 1.5 1 1.2
0.4 1.3 1.3 1.1 1,737 1.4 1.2 1.4 1 1.2
0.4 1.3 1.3 1.2 471 1.5 1.2 1.5 1 1.2
0.4 1.3 1.3 1.3 226 1.5 1.2 1.5 1 1.2
a

Number of sampling units required for case-only test of gene-environment interaction EGT to achieve 80% power for a range of environmental and genetic main effects and gene-environment interaction effects under zero dominance deviance and the ratio of sampling units required in each of the other tests to achieve the same power in comparison with the number required in EGT. Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. A sampling unit is a case.

Table 2.

Sampling Units for Case-Only Tests of Gene-Environment Interaction at a Disease Susceptibility Locus G Under Positive Deviation from Additivitya

pA ebG ebE ebEG Positive Deviation from Additivity (δ > 0 [δ = log(1.1)])
EGT EGD EGG GDE GTE GGE
0.1 1 1 1.1 1,390 0.9 1.2 0.9 1 1.2
0.1 1 1 1.2 607 1 1.2 1 1 1.2
0.1 1 1 1.3 348 1 1.2 1 1 1.2
0.1 1 1.3 1.1 1,351 0.9 1.2 0.9 1 1.2
0.1 1 1.3 1.2 582 1 1.2 1 1 1.2
0.1 1 1.3 1.3 330 1 1.2 1 1 1.2
0.1 1.3 1 1.1 1,217 0.9 1.2 0.9 0.9 1.1
0.1 1.3 1 1.2 522 1 1.2 1 1 1.2
0.1 1.3 1 1.3 298 1 1.2 1 1 1.2
0.1 1.3 1.3 1.1 1,189 0.9 1.2 0.9 0.9 1.1
0.1 1.3 1.3 1.2 505 1 1.2 1 1 1.2
0.1 1.3 1.3 1.3 284 1 1.2 1 1 1.2
0.25 1 1 1.1 1,019 0.9 1.1 0.9 0.9 1.1
0.25 1 1 1.2 396 1 1.2 1 0.9 1.2
0.25 1 1 1.3 217 1 1.2 1 1 1.2
0.25 1 1.3 1.1 1,019 0.9 1.1 0.9 0.9 1
0.25 1 1.3 1.2 395 1 1.1 1 0.9 1.2
0.25 1 1.3 1.3 215 1 1.2 1 1 1.2
0.25 1.3 1 1.1 1,065 0.8 1 0.8 0.9 1
0.25 1.3 1 1.2 393 0.9 1.1 1 0.9 1.1
0.25 1.3 1 1.3 211 1 1.2 1 1 1.2
0.25 1.3 1.3 1.1 1,076 0.8 1 0.8 0.9 1
0.25 1.3 1.3 1.2 396 0.9 1.1 1 0.9 1.1
0.25 1.3 1.3 1.3 212 1 1.2 1 1 1.2
0.4 1 1 1.1 1,305 0.7 0.9 0.7 0.9 0.9
0.4 1 1 1.2 425 0.9 1.1 0.9 1 1.1
0.4 1 1 1.3 218 1 1.2 1.1 1 1.2
0.4 1 1.3 1.1 1,338 0.7 0.9 0.7 0.9 0.9
0.4 1 1.3 1.2 437 0.9 1.1 0.9 1 1.1
0.4 1 1.3 1.3 224 1 1.2 1.1 1 1.2
0.4 1.3 1 1.1 1,607 0.7 0.8 0.7 1 0.8
0.4 1.3 1 1.2 473 0.9 1.1 0.9 1 1.1
0.4 1.3 1 1.3 234 1.1 1.2 1.1 1 1.2
0.4 1.3 1.3 1.1 1,665 0.7 0.8 0.7 1 0.8
0.4 1.3 1.3 1.2 491 0.9 1.1 0.9 1 1.1
0.4 1.3 1.3 1.3 244 1.1 1.1 1.1 1 1.2
a

Number of sampling units required for case-only test of gene-environment interaction EGT to achieve 80% power for a range of environmental and genetic main effects and gene-environment interaction effects under positive dominance deviance and the ratio of sampling units required in each of the other tests to achieve the same power in comparison with the number required in EGT. Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. A sampling unit is a case.

Table 3.

Sampling Units for Case-Only Tests of Gene-Environment Interaction at a Disease Susceptibility Locus G Under Negative Deviation from Additivitya

pA ebG ebE ebEG Negative Deviation from Additivity (δ < 0 [δ = − log(1.1)])
EGT EGD EGG GDE GTE GGE
0.1 1 1 1.1 106,376 4.1 0.2 4.1 2.8 0.2
0.1 1 1 1.2 3,835 1.3 1.1 1.3 1.2 1.1
0.1 1 1 1.3 1,233 1.2 1.2 1.2 1.1 1.2
0.1 1 1.3 1.1 97,376 4.1 0.3 4 2.8 0.3
0.1 1 1.3 1.2 3,638 1.3 1.1 1.3 1.2 1.1
0.1 1 1.3 1.3 1,154 1.2 1.2 1.2 1.1 1.2
0.1 1.3 1 1.1 54,648 4.1 0.3 4.1 2.6 0.3
0.1 1.3 1 1.2 2,804 1.4 1.1 1.4 1.2 1.1
0.1 1.3 1 1.3 936 1.3 1.2 1.3 1.2 1.2
0.1 1.3 1.3 1.1 50,221 4.1 0.3 4.1 2.6 0.3
0.1 1.3 1.3 1.2 2,651 1.4 1.1 1.4 1.3 1
0.1 1.3 1.3 1.3 874 1.3 1.2 1.3 1.2 1.2
0.25 1 1 1.1 8,318 4.3 0.5 4.3 1.7 0.5
0.25 1 1 1.2 1,068 1.7 1 1.7 1.2 1
0.25 1 1 1.3 412 1.5 1.1 1.5 1.2 1.2
0.25 1 1.3 1.1 7,758 4.3 0.5 4.3 1.7 0.5
0.25 1 1.3 1.2 1,005 1.8 1 1.8 1.2 1
0.25 1 1.3 1.3 385 1.5 1.2 1.5 1.2 1.2
0.25 1.3 1 1.1 5,099 4.5 0.6 4.5 1.5 0.6
0.25 1.3 1 1.2 817 1.9 1.1 1.9 1.2 1
0.25 1.3 1 1.3 331 1.6 1.1 1.6 1.1 1.2
0.25 1.3 1.3 1.1 4,791 4.4 0.6 4.4 1.5 0.6
0.25 1.3 1.3 1.2 771 1.9 1.1 1.9 1.2 1.1
0.25 1.3 1.3 1.3 311 1.7 1.2 1.7 1.1 1.2
0.4 1 1 1.1 2,607 4.8 0.7 4.8 1.2 0.7
0.4 1 1 1.2 563 2.2 1.1 2.2 1.1 1.1
0.4 1 1 1.3 248 1.9 1.2 1.9 1.1 1.2
0.4 1 1.3 1.1 2,482 4.7 0.7 4.7 1.2 0.7
0.4 1 1.3 1.2 537 2.2 1.1 2.2 1.1 1.1
0.4 1 1.3 1.3 237 1.9 1.2 1.9 1.1 1.2
0.4 1.3 1 1.1 1,894 5.1 0.8 5.1 1 0.8
0.4 1.3 1 1.2 475 2.5 1.1 2.5 1 1.1
0.4 1.3 1 1.3 220 2.1 1.2 2.1 1 1.2
0.4 1.3 1.3 1.1 1,821 5 0.8 5 1 0.8
0.4 1.3 1.3 1.2 457 2.5 1.1 2.5 1 1.1
0.4 1.3 1.3 1.3 212 2.1 1.2 2.1 1 1.2
a

Number of sampling units required for case-only test of gene-environment interaction EGT to achieve 80% power for a range of environmental and genetic main effects and gene-environment interaction effects under negative dominance deviance and the ratio of sampling units required in each of the other tests to achieve the same power in comparison with the number required in EGT. Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. A sampling unit is a case.

Figure 1 shows the power of the tests to detect G-E interaction with 1,000 case participants as a function of the dominance deviance when observing genotypes at a disease susceptibility locus. Under the null hypothesis, when there is no G-E interaction present in the underlying model (bEG = 0 and δ = 0), none of the tests show any increase in the false-positive rate; that is, they all have power equal to the type I error rate. In the presence of no dominance deviance or a slight dominance deviance, δ≈0, the trend tests are the most powerful. For example, Figure 1, part D, shows that the trend tests are most powerful for dominance deviance in the range −0.06 < δ < 0.05. While the strictly additive trend test EGT appears to be marginally preferable compared with the proportional odds trend test GTE, there is little information with which to choose between the 2 tests at low values of dominance deviance.

Figure 1.

Figure 1.

Statistical power to detect gene-environment interaction for 1,000 case participants as a function of the dominance deviance δ for tests EGT (solid lines), EGD (dashed lines), EGG (dotted lines), and GTE (dashed-dotted lines) at a disease susceptibility locus G for various values of the high-risk allele frequency pA and interaction effect size bEG: A) pA = 0.1, bEG = log(1.1); B) pA = 0.1, bEG = log(1.2); C) pA = 0.1, bEG = log(1.3); D) pA = 0.25, bEG = log(1.1); E) pA = 0.25, bEG = log(1.2); F) pA = 0.25, bEG = log(1.3). Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. Results are shown for the situation where there is no main environmental effect, bE = 0, and the main genetic effect is bG = log(1.3); variation in the sizes of main environmental or genetic effects resulted in only minor variations in power and not did not alter conclusions regarding the relative efficacies of the tests considered. The environmental exposure variable was normally distributed and standardized.

For larger positive dominance deviance, δ >> 0, indicating large dominant effects, the dominant tests are, as expected, most powerful. However, for any negative dominance deviance, δ < 0, suggesting a tendency towards recessive effects at the genetic locus, the dominant tests perform poorly in comparison with other tests. The genotypic tests are only optimal in the presence of large absolute values of δ: When large and dominant (δ >> 0), they equal the dominant tests, and when large and recessive (δ << 0), they outshine all of the other tests.

But how likely is a large dominance deviance? When testing at a noncausal locus with alleles that are not in complete linkage disequilibrium with alleles at the causal locus (r2 < 1), common in genome-wide association studies, the observed effects are attenuated; in particular, the apparent dominance deviance at the noncausal locus is reduced in comparison with that observed at the causal locus (see Web Appendix (http://aje.oxfordjournals.org/) for discussion). Thus, the chance of a large dominance deviance is even less when testing at a noncausal locus, suggesting that use of genotypic tests must be carefully considered and that the comparison between tests in the presence of low dominance deviance is of the greatest practical importance. Figure 2 shows the power of the tests to detect G-E interaction with 1,000 case participants as a function of the dominance deviance when observing genotypes at a noncausal locus in linkage disequilibrium with the disease susceptibility locus described in Figure 1. While power, in general, is lower when testing at a noncausal locus, a comparison of Figures 1 and 2 indicates that the same pattern of test efficacy according to variation in δ holds when testing at a noncausal locus as when testing at the disease susceptibility locus, except that the range of values of δ for which the trend tests are optimal is wider.

Figure 2.

Figure 2.

Statistical power to detect gene-environment interaction for 1,000 case participants as a function of the dominance deviance δ for tests EGT (solid lines), EGD (dashed lines), EGG (dotted lines), and GTE (dashed-dotted lines) at a noncausal locus H with minor allele frequency pB = 0.2 for various values of the minor allele frequency pA and interaction effect size bEG at the disease susceptibility locus G: A) pA = 0.1, bEG = log(1.1); B) pA = 0.1, bEG = log(1.2); C) pA = 0.1, bEG = log(1.3); D) pA = 0.25, bEG = log(1.1); E) pA = 0.25, bEG = log(1.2); F) pA = 0.25, bEG = log(1.3). The alleles at H are in maximum linkage disequilibrium (r2) with the alleles at G: when pA = 0.1, r2 = 0.44; when pA = 0.25, r2 = 0.75; and when pA = 0.4, r2 = 0.37. Dominance deviance is defined as the deviation from additivity in the main or interaction effects: Here we assume that the dominance deviance in the main effect and the interaction effect are the same. Results are shown for the situation where there is no main environmental effect, bE = 0, and the main genetic effect is bG = log(1.3); variation in the sizes of main environmental or genetic effects resulted in only minor variations in power and not did not alter conclusions regarding the relative efficacies of the tests considered. The environmental exposure variable was normally distributed and standardized.

When cases were selectively sampled so that only those with extreme values of E, |E| > 1, were included, the power of all tests was increased, but there were no changes in conclusions regarding relative test efficacies (results not shown). This increase in power under selective sampling was previously observed by Boks et al. (12). Furthermore, under this modest nonnormality, there was no increase in the false-positive error rate in any of the tests when no G-E interaction was present in the underlying model. When more extreme departure from normality was allowed, so that 10% of cases were allowed to have environmental exposure values with standard deviations 9 times that of the remaining 90% of cases, the type I error rate for the tests with E as the response variable did increase, but only marginally—for example, from 0.05 to an average of 0.0507 in EGT. Results suggest that the tests EGT, EGD, and EGG, where E is assumed to be a normally distributed response, are robust to this type of nonnormality.

Example: study of G-E interaction for lung cancer

Zhou et al. (11) considered a standard case-control test of G-E interaction between the ERCC2 polymorphism Asp312Asn and the square root of pack-years of smoking which adjusted for age, sex, smoking status, time since smoking cessation (years), and the interaction between smoking status and the square root of pack-years. The frequency of the Asp allele (Asp312Asn) was 0.3, the square root of pack-years was approximately normally distributed with mean and standard deviation 3, and the adjusted odds ratios for lung cancer risk in persons with zero pack-years were 1.5 (95% confidence interval: 1.0, 2.3) for the Asp/Asn genotype and 3.4 (95% confidence interval: 1.9, 6.0) for the Asn/Asn genotype (when each was compared with the Asp/Asp genotype). These risks decreased by exp(−0.07√k) for the heterozygous Asp/Asn genotype and by exp(−0.17√k) for the homozygous Asn/Asn genotype for k pack-years. The decrease in risk was not significant for the homozygous genotype and was only significant up to approximately 30 pack-years for the heterozygous genotype. Further analysis of Zhou et al.’s results suggests that the unadjusted odds ratio for lung cancer risk in persons with the Asp/Asp genotype associated with k pack-years of smoking was in the approximate range exp(0.3√k) to exp(0.4√k).

Assuming these parameter values, Table 4 compares the numbers of samples required to achieve 80% power in a standard case-control test of G-E interaction, as well as in our 6 case-only tests. Our results are unadjusted for any other covariates. The case-only tests require substantially fewer sampling units than the case-control tests. The most powerful test is EGT, closely followed by GTE. These tests both search for a trend in the interaction with increasing number of risk alleles. The model virtually stipulates zero dominance deviance associated with the interaction effect (bEGD = 0.01), indicating that the extra degree of freedom required by using EGG and GGE rather than EGT or GTE is not warranted. However, the estimated dominance deviance for the main genetic effect is negative (bGD = − 0.22), suggesting a slight recessive effect and making it much more difficult for the tests imposing a dominant model on the genotype, EGD and GDE, to detect effects. Notice that the marginal effect size of pack-years on lung cancer risk does not alter conclusions regarding the relative test efficacies.

Table 4.

Numbers of Sampling Units Required to Achieve 80% Power to Detect a Gene-Environment Interaction Between the ERCC2 Asp312Asn Polymorphism and Pack-Years of Smokinga

bE Case-Control Test Case-Only Tests
EGT EGD EGG GDE GTE GGE
0.25 1,059 238 325 291 329 252 295
0.30 1,151 244 331 298 335 259 303
0.35 1,271 252 340 308 344 268 312
0.40 1,424 264 354 321 358 281 326
0.45 1,614 282 380 343 384 303 348
0.60 1,847 314 428 381 432 340 387

Abbreviation: ERCC2, excision repair cross-complementing group 2.

a

The true penetrance model is of the form given in equation 1 (see text), and effect sizes are bG = 0.61, bGD = − 0.22, bEG = − 0.09, and bEGD = 0.01, as estimated by Zhou et al. (11). Results are shown for values of bE as indicated for a 2-df case-control test of G-E interaction and our 6 case-only tests of G-E interaction. A sampling unit is a pair for the case-control test and a case for the case-only tests.

DISCUSSION

We compared the power and sample-size requirements of 6 case-only tests for detecting interaction between a genetic factor and a continuous environmental exposure in association with disease. Under the null hypothesis of no G-E interaction, none of the tests showed any increase in the false-positive rate, while under the complementary hypothesis, all of the tests had power greater than the type I error rate. We conclude that these tests are valid tests of G-E interaction. More work will be required to interpret the estimated coefficients from these case-only regressions in terms of risk ratios and other measures of effect size.

In order to simplify analyses, a commonly used approach in case-only tests of G-E interaction is to assume either an autosomal dominant or recessive inheritance pattern at the genetic locus (8, 9, 13, 14). Such models are represented in the tests GDE and EGD. The novel tests based on multinomial and proportional odds models that we considered here allow for alternative patterns of genetic inheritance.

For normally distributed environmental exposures, we have shown that EGD, a test using linear regression with the environmental variable as the response variable, requires similar numbers of cases as GDE, a test using logistic regression with the genetic variable as the response variable, in order to achieve the same power. Similarly, EGG and GGE also require similar numbers of cases to achieve equivalent power. This is as expected, since these pairs of tests each model the genetic variable in the same manner; the genetic variant is modeled as dominant in GDE and EGD and as genotypic in EGG and GGE. Whereas EGT imposes an additive model on the genetic variable, GTE only imposes a trend in the form of its proportional odds assumption. As such, these 2 “trend” tests show greater differences between the numbers of cases required to achieve the same power in any given scenario.

EGT, EGD, and EGG depend on an assumption that the environmental exposure is normally distributed, and while we have shown that departure from normality need not have a significant impact on power outcomes, GDE, GTE, and GGE are clearly advantageous, since they do not require this assumption and are robust to non-normally distributed environmental exposures (15). Careful consideration of the scale of measurement is important, because changing the scale, such as transforming a quantitative environmental variable, can remove or add G-E interaction.

We have shown that the dominance deviance is key to determining the optimal test. For zero (suggesting strictly additive genetic effects) or modest (suggesting slight recessive or dominant genetic effects) dominance deviance, the trend tests, EGT and GTE, are optimal. For large positive dominance deviance (suggesting a large dominant genetic effect), the dominant tests EGD and GDE are strictly optimal, closely followed by the genotypic tests EGG and GGE. However, if the dominance deviance is actually negative (suggesting a recessive genetic effect), the choice of a dominant test can have severe implications for power, and in the presence of a large recessive genetic effect, only a genotypic test will be suitable. Hence, if the dominance deviance is large but of unknown direction, the genotypic test is the safest choice. However, we showed that when testing at a noncausal locus, the dominance deviance at the observed locus is attenuated. Thus, for the genotypic tests to outperform the trend tests, dominance deviance at the disease susceptibility locus must be very large when testing for G-E interaction at a noncausal locus with alleles in less-than-perfect linkage disequilibrium with alleles at the causal locus. We conclude that the trend test is the most robust choice for large-scale association scans where the true G-E interaction model is unknown. The common practice of assuming a dominant pattern of inheritance can cause serious losses of power in the presence of any recessive, or modest dominant, effects.

The exemplary data method used to calculate power relies on asymptotic distributions, and the results presented here are valid for the large sample sizes typically seen in current association studies. In critical applications, simulation studies would be required to determine precise power estimates (16).

The case-only tests presented here are only valid under the assumption of independence between gene and environmental variables; any dependence between these variables will be reflected in the case series even in the absence of G-E interaction and will lead to biased estimates of interaction. Independence is therefore required to protect against false-positive findings. Empirical studies have suggested that violation of gene-environment independence, when it occurs, is usually modest (17), but more work will be required to categorize the impact of departure from gene-environment independence on the relative efficacies of the tests considered here.

Supplementary Material

[Web Material]
kwp398_index.html (699B, html)

Acknowledgments

Author affiliation: Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom (Geraldine M. Clarke, Andrew P. Morris).

This work was supported by the Wellcome Trust.

Conflict of interest: none declared.

Glossary

Abbreviations

ERCC2

excision repair cross-complementing group 2

G-E

gene-environment

References

  • 1.Khoury MJ, Flanders WD. Nontraditional epidemiologic approaches in the analysis of gene-environment interaction: case-control studies with no controls! Am J Epidemiol. 1996;144(3):207–213. doi: 10.1093/oxfordjournals.aje.a008915. [DOI] [PubMed] [Google Scholar]
  • 2.Piegorsch WW, Weinberg CR, Taylor JA. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. Stat Med. 1994;13(2):153–162. doi: 10.1002/sim.4780130206. [DOI] [PubMed] [Google Scholar]
  • 3.Begg CB, Zhang ZF. Statistical analysis of molecular epidemiology studies employing case-series. Cancer Epidemiol Biomarkers Prev. 1994;3(2):173–175. [PubMed] [Google Scholar]
  • 4.Umbach DM, Weinberg CR. Designing and analysing case-control studies to exploit independence of genotype and exposure. Stat Med. 1997;16(15):1731–1743. doi: 10.1002/(sici)1097-0258(19970815)16:15<1731::aid-sim595>3.0.co;2-s. [DOI] [PubMed] [Google Scholar]
  • 5.Yang Q, Khoury MJ, Flanders WD. Sample size requirements in case-only designs to detect gene-environment interaction. Am J Epidemiol. 1997;146(9):713–720. doi: 10.1093/oxfordjournals.aje.a009346. [DOI] [PubMed] [Google Scholar]
  • 6.Albert PS, Ratnasinghe D, Tangrea J, et al. Limitations of the case-only design for identifying gene-environment interactions. Am J Epidemiol. 2001;154(8):687–693. doi: 10.1093/aje/154.8.687. [DOI] [PubMed] [Google Scholar]
  • 7.Armstrong BG. Fixed factors that modify the effects of time-varying factors: applying the case-only approach. Epidemiology. 2003;14(4):467–472. doi: 10.1097/01.ede.0000071408.39011.99. [DOI] [PubMed] [Google Scholar]
  • 8.Cheng KF. A maximum likelihood method for studying gene-environment interactions under conditional independence of genotype and exposure. Stat Med. 2006;25(18):3093–3109. doi: 10.1002/sim.2506. [DOI] [PubMed] [Google Scholar]
  • 9.Kraft P, Yen YC, Stram DO, et al. Exploiting gene-environment interaction to detect genetic associations. Hum Hered. 2007;63(2):111–119. doi: 10.1159/000099183. [DOI] [PubMed] [Google Scholar]
  • 10.Weir BS. Inferences about linkage disequilibrium. Biometrics. 1979;35(1):235–254. [PubMed] [Google Scholar]
  • 11.Zhou W, Liu G, Miller DP, et al. Gene-environment interaction for the ERCC2 polymorphisms and cumulative cigarette smoking exposure in lung cancer. Cancer Res. 2002;62(5):1377–1381. [PubMed] [Google Scholar]
  • 12.Boks MP, Schipper M, Schubart CD, et al. Investigating gene environment interaction in complex diseases: increasing power by selective sampling for environmental exposure. Int J Epidemiol. 2007;36(6):1363–1369. doi: 10.1093/ije/dym215. [DOI] [PubMed] [Google Scholar]
  • 13.Stern MC, Johnson LR, Bell DA, et al. XPD codon 751 polymorphism, metabolism genes, smoking, and bladder cancer risk. Cancer Epidemiol Biomarkers Prev. 2002;11(10):1004–1011. [PubMed] [Google Scholar]
  • 14.Yang Q, Khoury MJ, Sun F, et al. Case-only design to measure gene-gene interaction. Epidemiology. 1999;10(2):167–170. [PubMed] [Google Scholar]
  • 15.O'Brien PC. Comparing 2 samples: extensions of the t, rank-sum, and log-rank tests. J Am Stat Assoc. 1988;83(401):52–61. [Google Scholar]
  • 16.Brown BW, Lovato J, Russell K. Asymptotic power calculations: description, examples, computer code. Stat Med. 1999;18(22):3137–3151. doi: 10.1002/(sici)1097-0258(19991130)18:22<3137::aid-sim239>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
  • 17.Liu X, Fallin MD, Kao WH. Genetic dissection methods: designs used for tests of gene-environment interaction. Curr Opin Genet Dev. 2004;14(3):241–245. doi: 10.1016/j.gde.2004.04.011. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Web Material]
kwp398_index.html (699B, html)
kwp398_1.pdf (70.1KB, pdf)

Articles from American Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES