Abstract
Detection of gene-gene interaction has become increasingly popular over the past decade in genome wide association studies (GWAS). Besides traditional logistic regression analysis for detecting interactions between two markers, new methods have been developed in recent years such as comparing linkage disequilibrium (LD) in case and control groups. All these methods form the building blocks of most screening strategies for disease susceptibility loci in GWAS. In this paper, we are interested in comparing the competing methods and providing practical guidelines for selecting appropriate testing methods for interaction in GWAS. We first review a series of existing statistical methods to detect interactions, and then examine different definitions of interactions to gain insight into the theoretical relationship between the existing testing methods. Lastly, we perform extensive simulations to compare powers of various methods to detect either interaction between two markers at two unlinked loci or the overall association allowing for both interaction and main effects. This investigation reveals informative characteristics of various methods that are helpful to GWAS investigators.
Keywords: gene-gene interaction, GWAS, linkage disequilibrium, penetrance, logistic regression
1 Introduction
Over the past two decades, many genome wide association studies (GWAS) have been conducted to understand the genetic architecture of complex diseases. These studies have revealed a repertoire of disease susceptibility loci (e.g., Kraft and Cox [2008]; Seng and Seng [2008]). However, these loci individually have small or, at best, moderate effect sizes, and account for only a small fraction of the genetic variation in complex diseases. For example, the seven susceptibility loci identified through GWAS together are estimated to explain 5.9% of the familial breast cancer risk [Seng and Seng, 2008]. For more reviews of the published GWAS, see Donnelly [2008]; Goldstein [2009]; Hardy and Singleton [2009]; Hirschhorn [2009]; Hirschhorn and Daly [2005]; Kraft and Hunter [2009]; Ku et al. [2010]; Manolio [2008, 2009]; McCarthy et al. [2008]; Teo [2008]; Wang et al. [2005]; Ziegler et al. [2008].
Where should we discover the genetic factors explaining the remaining variation? The polygenic nature of complex diseases suggests that the disease risk has to be a synergistic result of multiple genes in a complex biological network. Traditional GWAS tested one marker at a time, and thus ignored the complex genomic context [Moore and Williams, 2009]. Given the common-disease-common-variant hypothesis, which is supported by both evolution theory [Chakravarti, 1999; Reich and Lander, 2001] and studies, part of the unexplained genetic variation could very well be attributed to the interactions among genes. Endeavors aimed to detect and assess gene × gene (G × G) interactions have surged up in both the statistical and biomedical research communities recently, and a variety of methods have been developed for detecting genetic interactions [Kooperberg et al., 2009; Moore et al., 2010; Moore and Williams, 2009]. Cordell [2009] provides informative review for a handful of methods on G × G interaction. Many of these methods make specific statistical/genetic model assumptions, and are derived from different definitions of G × G interactions. Hence, they test different hypotheses. To gain better understanding, we study the theoretical relationship between various definitions of interaction between two unlinked loci regarding a binary outcome. We clarify the null spaces of several tests for interaction effects, and derive the conditions under which these tests share the common null space. The studied tests include two log odds ratio tests Tlogit, TOR, three linkage disequilibrium (LD) based tests TLD [Zhao et al., 2006], TLD* ( in Yang C. et al. [2009]), TCLD [Wu et al., 2008], and three case-only statistics TPearson, TLDc, TORc. We then conduct extensive simulation studies with settings mimicking the real scenarios with or without main effects to assess the performance of these competing methods. Since the ultimate goal of GWAS is to detect the association between diseases and genetic markers, we also evaluate tests for detecting association signals of a pair of loci allowing for both interaction and main effects. For this purpose, in addition to the tests mentioned above, we also include in our investigation three overall association tests — Tlogisticall, Tχ2, and the logistic kernel machine method Tkernel [Liu et al., 2008]—, as well as two correlation based tests — TPerlman[Larntz and Perlman, 1988] and TKullback[Kullback, 1967].
The remainder of this article is organized as follows. In section 2, we briefly review the existing statistical methods for detecting genetic interactions between two single markers and between two sets of markers, as well as methods for genome-wide screening for genetic interactions. In section 3, we evaluate the powers of tests for detecting interactions between two unlinked single markers and study their theoretical properties, since these tests serve as the building blocks of most interaction testing methods for GWAS. In section 4, we extend our research to a range of tests on association between a binary outcome and a pair of unlinked markers allowing for both interaction and main effects. Through these investigations, we will provide a practical guideline for selecting appropriate testing methods in GWAS, which is summarized in section 5.
2 Methods for testing G × G interactions in GWAS
Due to the lack of established biochemical mechanism behind G × G interaction, the form in which interaction is defined is inexplicable. This necessitates a review on the various interaction definitions behind existing tests. The concept of interaction was first introduced by Bateson [1909], in which epistasis was defined as a masking effect from the allele at one locus to the allele at another locus. Later a more quantitative definition was proposed by Fisher [1918, p.404] as the deviation from addition of superimposed effects from different factors. This makes it possible to define interaction through a mathmatical model. Since then, testing an interaction has been translated to testing a statistical hypothesis concerning a specific quantity in a model that describes the relation between a phenotype and a genotype. Over the last decade, as GWAS have become a powerful tool for genetic studies of complex diseases, a variety of new methods have been developed for detecting genetic interactions. Particularly, some special features of GWAS data can be used to increase the statistical testing power and have given rise to numerous statistical methods. In this review, we collect and organize them in three categories and synopsize them in Supplemental Table 1-3. These lists are not intended to be fully inclusive but instead to serve as a guide to understanding the features of these methods.
Supplemental Table 1 summarizes the statistical methods for detecting interactions between two single markers. They are the most basic tools for detecting G × G interaction. In its fundamental form, the logistic regression method directly relates disease risks to two genetic markers in a prospective fashion. Whereas the other methods, including LD-based and Chi-square tests, retrospectively detect genetic interactions by comparing “association” between two markers in case and control populations. Although the two categories of methods appear to arise from different motivations, they are closely related analytically, as will be revealed in section 3.
Supplemental Table 2 briefly describes the statistical methods for testing interactions between multiple markers. There are two main types of approaches. The first one includes multi-factor dimensionality reduction (MDR), the canonical correlation based test and the covariance based test that all focus on characterizing the joint, or some aspects of the joint distributions of the genotype data. The second group is principal-component analysis and least absolute shrinkage and selection operator (PCA-LASSO) approach and the latent variable logistic regression that both detect interaction via regression models coupled with dimension reduction strategies.
In order to screen for genetic interactions genome-widely among hundreds of thousands of genetic markers, methodologies that are both statistically powerful and computationally feasible are in great demand. Numerous methods have been published in the literature to tackle the challenges, and we summarize some major ones in Supplemental Table 3. These methods are primarily based on stochastic screens and searches. About one third of them pre-screen genetic markers based on, for example, their main effects to reduce the dimensionality (number of markers) of the problem (e.g., method 4, 7, 10 and 11 in Table 3), while some others genome-widely search for candidate genetic interactions. Some methods aggregates results from a collection of models/searches to reap promising interactions (e.g., method 2, 3, 7, 8, 10 and 11). The number of single-nucleotide polymorphisms (SNPs) of the real data sets described in the papers vary from a dozen to hundreds of thousands of genetic markers. While the majority of these methods directly screen for genetic interactions, some also detect the overall association in the presence of genetic interactions (e.g., method 2, 3, 4, 5, and 10). Genome-wide search for genetic interactions is bound to involve a large number of tests, and thus prohibitive computation is inevitable. What makes such testing truly feasible are recent advances in computation and strategic algorithm design. A partial list of such methods includes methods 9, 12 and 13 in Supplemental Table 3.
Table 3. Penetrance table for the general disease model.
| Genotype | AA | Aa | aa |
|---|---|---|---|
| BB | α | αλ1 | αλ2 |
| Bb | αλ3 | αλ1λ3γ1 | αλ2λ3γ2 |
| bb | αλ4 | αλ1λ4γ3 | αλ2λ4γ4 |
Since testing the interaction between two single markers forms the basis for virtually all the methods in Supplemental Table 2 and 3, in what follows, we will focus on investigating the methods in Supplemental Table 1 as well as their variants.
3 Tests on gene-gene interaction
In this section, a collection of statistical methods presented in Supplemental Table 1 and their variants with slight modifications will be investigated. These methods include two logit-based tests, three LD-based tests [Wu et al., 2008; Yang Y. et al., 2009; Zhao et al., 2006], and three case-only tests. They are designed to test for interaction between two single markers. We first study various definitions of interactions and the relations between them. Then we clarify the definition of interaction assumed by each test. Next we elaborate the scenarios under which these tests share the same null space, and lastly compare the tests in these settings. The investigation proceeds first with the dominant (recessive) disease model (section 3.1) and then extends to the general disease model (section 3.2).
3.1 Dominant (or recessive) disease model (2×2)
3.1.1 Notation and definition
When the disease model is dominant or recessive, we can reduce the commonly used nine-genotype model at two loci to the four-genotype model. Without loss of generality, we assume one allele is dominant to the other on both loci. Let G(or H) denote the genotypes AA(or BB) and Aa(or Bb) ; let g(or h) denote the genotype aa(or bb). Then the genotype information of a dataset can be summarized in a frequency table, Table 1, where p ≡ {p00, p01, p10, p11} and q ≡ {q00, q01, q10, q11} are the genotype frequencies for {GH, gH, Gh, gh} in cases and controls. Let p̂ and q̂ be the observed frequencies of the corresponding genotypes. If haplotype at two loci can be inferred from diplotype without uncertainty, the same framework applies as well. We denote the case (control) status by D = 1 (D = 0) and let n1 and n0 be the sample sizes for the case and control groups. We consider three major approaches to model interaction effects: the penetrance based definition, the logistic regression model, and the LD based measurement. We focus on the exploration of these three concepts and how they are inter-related.
Table 1. Notations for genotype frequencies in cases and controls.
| D=1 | D = 0 | ||||
|---|---|---|---|---|---|
|
|
|
||||
| G | g | G | g | ||
| H | P00 | P01 | H | q00 | q01 |
| h | P10 | P11 | h | q10 | q11 |
(a) Penetrance based definition
measures the inter-locus dependence in the penetrance table, whose elements represent the probability of developing a disease given certain genotypes [Cordell, 2002]. If we choose to measure the dependence on a multiplicative scale, we have
According to this definition, interaction is measured by the difference between the inter-locus genotype dependences in case and general populations. If we assume the two loci are unlinked in the general population, i.e., , the above definition examines only the inter-locus dependence in cases:
| (1) |
We can then apply case-only analysis and use Pearson's chi-squared statistic to test for interaction. Note that multiplicativity is only one of many possible scales we can use. For example, Risch [1990] used the deviation from additivity on penetrances as the measure of interaction.
(b) Logit based definition
stems from logistic regression, which is commonly used for testing for interaction between two factors. Let X1 = 1 if G is present and 0 otherwise; let X2 = 1 if H is present and 0 otherwise. Consider the following logistic regression model
| (2) |
We decide whether there is an interaction effect by testing if β3 = 0, where
By Bayes' rule, β3 as a function of penetrances can be replaced by genotype frequencies:
| (3) |
As shown in (3), when detecting interaction using logistic regression, we actually compare the differences of inter-locus dependences between the case and control groups on a log-odds scale and examine if these differences are equal.
(c) LD based definition
uses LD to measure inter-locus dependence, and defines interaction as
| (4) |
where δ1 = p00p11 − p10p01 = p11 − p1+p+1; δ0 = q00q11 − q10q01 = q11 − q1+q+1. [Zhao et al., 2006]. By this definition, detecting interaction is equivalent to testing if the LDs are the same in case and control groups.
3.1.2 Relations between the three definitions
Suppose linkage equilibrium (LE) holds in the general population for the two loci we consider. According to (1), Ipenetrance = 0 implies p00p11 = p10p01. When a disease is rare, we expect that almost no linkage exists between the two loci in the control population: q00q11 = q10q01. As a result, both Ilogit = 0 and ILD = 0 reduce to p00p11 = p10p01. Therefore under the circumstance that the disease is rare and LE holds in the general population, the three definitions of no interaction by (1), (3), and (4) are equivalent.
If we do not assume the disease is rare, these three definitions are still equivalent if there is at most one marker that has a main effect and LE holds in the population.
Proposition 1
Under the assumption that at least one marker has no main effects and LE holds in the population,
See the proof in Supplemental Proof A.
3.1.3 Interaction testing statistics
In this section, we describe two statistical tests for the interaction defined by Ilogit, two tests for the interaction defined by ILD and three tests for the interaction defined by Ipenetrance.
(a) Logit based testing statistics
Deviance statistic Tlogistic is a commonly used statistic for testing the interaction term in logistic regression (2). Specifically let M1 denote the full model (2) and M0 denote the reduced model:
Let LM1 and LM0 be the maximum log likelihoods of the data under models M1 and M0, respectively, then Tlogistic = − 2(LM0 − LM1). Another test statistic is the Wald statistic based on (3): where , and . Both Tlogistic and TOR are asymptotically distributed under the null Ilogit = 0. Note our TOR is the same as Ueki and Cordel [2012]'s improved statistic TAWU−cc under LE (See the supporting information Text S1 of Ueki and Cordel [2012].
(b) LD based testing statistics
According to the definition of interaction in (4),
| (5) |
Yang Y. et al. [2009] proposed another LD related statistic TLD* using a different estimator for the variance of (δ̂1 − δ̂0)2.
| (6) |
In TLD*, the variance is obtained by conditioning on the margins of the genotype frequency table and the null hypothesis of no association, pij = qij for i,j = 0,1. Both TLD and TLD* are asymptotically distributed under the null ILD = 0. In addition, TLD* was shown to be an interaction component partitioned from a two-locus total χ2-statistics [Yang Y. et al., 2009].
(c) Case-only testing statistics
When LE holds in the population, definition of interaction by (1) suggests a case-only analysis. Pearson's chi-squared statistic of independence between two loci readily serves here
| (7) |
Furthermore, TOR and TLD can be easily modified to case-only test statistics: and where and . Note vâr(ÎORc) is always smaller than vâr(ÎOR) in part (a), and thus is more efficient. Also TORc is equivalent to TAWU−co in Ueki and Cordel [2012]. Test statistics TPearson,TLDc,TORc are all asymptotically central distributed under the null hypothesis Ipenetrance = 0.
3.1.4 Simulation Study
In section 3.1.2, we show theoretically that when LE between two loci holds in the general population, if one marker has no main effect or the disease is rare, then the parameter space corresponding to “no interaction” defined by Ipenetrance = 0, Ilogit = 0 and ILD = 0 are the same. Consequently, under these circumstances, the seven test statistics Tlogistic, TOR, TLD, TLD*, TPearson, TORc, and TLDc share the same null space and are comparable. In this section, we will first illustrate this result by examining the type I errors of various statistics. Different forms of different test statistics make them target on different alternative spaces. Whether one is superior to another depends on whether the alternative space of the statistic reflects the underlying mechanism of biological interaction, of which we often do not have a good understanding. Therefore we also conduct extensive simulation studies to compare the performances of these statistics under different settings mimicking various biological interaction patterns. The simulation result provides insights on the choices of interaction tests in practice.
We simulate data according to a penetrance model described in Table 2 where each cell specifies the penetrance of the given genotype. We set the minor allele frequencies for g and h at 0.1. Then we vary γ, the strength of interaction, and λ1, λ2, the main effects of the two markers. When examining the type I errors, we set γ = 1 and change the prevalences through α, the penetrance for genotype GH. When comparing the powers of different tests, we increase γ from 1 to 3. For each simulation setting, we have 5000 replicates. For each dataset, 5000 cases and 5000 controls are generated to calcuate each test statistic, but the case-only statistics TPearson, TORc and TLDc use only the 5000 cases.
Table 2. Penetrance table for the dominant (recessive) model.
| G | g | |
|---|---|---|
| H | α | αλ1 |
| h | αλ2 | αλ1 λ2γ |
Type I error
Table 6 summarizes the type I error results. When there is no main effect (λ1 = 1, λ2 = 1) or only one marker has a main effect (λ1 = 1, λ2 = 3), type I errors of all statistics are well controlled at 0.05 across different prevalences (Supplemental Table 4). This result is consistent with Proposition 1. When both markers have main effects, the type I errors of Tlogistic, TOR, TLD, and TLD* are inflated as the disease becomes common. Specifically when λ1 = λ2 = 1.5, type I errors are not controlled after the disease prevalence becomes greater than 0.2; when λ1 = λ2 = 3, type I errors are not controlled as the prevalence is beyond 0.05. On the other hand, the type I errors of case-only statistics TPearson, TORc, TLDc remain at 0.05 across different disease prevalences. This simulation results also illustrate the range of disease risk, in which these different tests are comparable. Note the range depends on the strength of main effects.
Table 6. Type I errors for different testing statistics across disease prevalences.
| Test Statistics | Disease Prevalence | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.001 | 0.005 | 0.01 | 0.05 | 0.1 | 0.2 | 0.3 | |||
|
|
|||||||||
| Tlogit | 0.0528 | 0.054 | 0.053 | 0.0516 | 0.0494 | 0.0698 | 0.1594 | ||
| TOR | 0.0518 | 0.0532 | 0.051 | 0.0502 | 0.0478 | 0.0646 | 0.1412 | ||
| TLD | 0.0576 | 0.0528 | 0.0542 | 0.0504 | 0.0482 | 0.0602 | 0.0872 | ||
|
|
0.057 | 0.0518 | 0.053 | 0.0496 | 0.0472 | 0.0574 | 0.0834 | ||
| TPearson | 0.0594 | 0.0506 | 0.0526 | 0.054 | 0.0528 | 0.0516 | 0.0464 | ||
| λ1 = 1.5, λ2 = 1.5 | TLogitc | 0.0582 | 0.0498 | 0.052 | 0.0536 | 0.0526 | 0.0512 | 0.046 | |
| TLDc | 0.0604 | 0.051 | 0.0524 | 0.058 | 0.0556 | 0.0518 | 0.049 | ||
|
| |||||||||
| TLogit | 0.056 | 0.0524 | 0.0522 | 0.1766 | 0.749 | - | -* | ||
| TOR | 0.0548 | 0.05 | 0.05 | 0.16 | 0.7154 | - | - | ||
| TLD | 0.0582 | 0.0482 | 0.0462 | 0.0706 | 0.1688 | - | - | ||
|
|
0.05 | 0.0412 | 0.0402 | 0.0602 | 0.146 | - | - | ||
| TPearson | 0.0586 | 0.0544 | 0.0504 | 0.0504 | 0.0516 | - | - | ||
| λ1 = 3, λ2 = 3 | TLogitc | 0.0586 | 0.0544 | 0.0502 | 0.0502 | 0.0516 | - | - | |
| TLDc | 0.06 | 0.055 | 0.0506 | 0.052 | 0.0514 | - | - | ||
The penetrance for genotype gh is greater than 1 when λ1 = λ2 = 3 and the disease prevalence becomes too large.
Power
We consider three interaction settings in our simulations. In the first setting (Figure 1(A)), both defective alleles g and h are required to exhibit the trait (λ1 = λ2 = 1). An example of this phenomenon would be the development of Retinitis Pigmentosa (RP) in which only double heterozygotes of both genes ROM1 and peripherin/RDS develop the disease [Kajiwara et al., 1994]. In the second setting (Figure 1(B)), we set λ1 = 1, and λ2 = 1.5. This simulation mimics the scenario that one locus acts as a modifier of the other locus. For example, it is reported that a single-nucleotide polymorphism (SNP) in RAD51 gene modified breast or ovarian cancer risk in BRCA2 carriers [Levy-Lahad et al., 2001]. The third setting, (Figure 1(C)), assumes a heterogeneity model. Predisposing markers at either locus will lead to the disease, but the presence of both predisposing markers further increases the disease risk (λ1 = λ2 = 1.5). According to Table 6 and Supplementary Table 4, we set the disease risk at 0.005, so that the type I errors of all the statistics are well controlled at the same level and we can perform the power comparison. As illustrated in Figure 1, the seven tests fall into three categories: (1) Tlogistic and TOR; (2) TLD and TLD*; (3) TPearson, TORc and TLDc. Performances of the tests within each category are close to each other. Comparing these three groups of tests, the logit based tests have the lowest power in all scenarios. LD based tests lie in between the case-only tests and the logit based tests. The powers of the LD based tests increase as main effects increase when comparing Figure 1(B) to Figure 1(A) and Figure 1(C) to Figure 1(B). Case-only statistics use less data than case-control statistics (5000 cases versus 5000 cases plus 5000 controls) but have better powers. This is due to the fact that vâr(ÎORc) < vâr(ÎOR) as we demonstrated in section 3.1.3. Our result agrees with Yang et al. [1999]'s conclusion that the case-only design requires fewer cases than case-control design, and is an efficient and valid approach to measuring gene-gene interaction, assuming the genes under study are in LE.
Figure 1.
Comparison of testing statistics on interaction effects under the dominant (recessive) disease models. 5000 cases and 5000 controls are simulated according to Table 2. γ is the strength of interaction. The main effect parameters are set at various values to simulate different scenarios: (A) λ1 = 1, λ2 = 1; (B) λ1 = 1, λ2 = 1.5; (C) λ1 = 1.5, λ2 = 1.5. A, B allele frequencies are both set at 0.1. 5000 replicates are simulated for each setting.
3.2 General disease model (3 × 3)
3.2.1 Notation and definition
In this section, we consider more general disease models with a total of nine genotypes on two loci. Let p ≡ {p00,p01,p02,p10…,p22} denote the genotype frequencies of {AABB, AaBB, aaBB, AABb, …aabb} for cases and q = {q00,q01,q02,q10…q22} for controls. In section 3.1.1 and 3.1.3, three types of interaction definitions and their motivated testing statistics are introduced. In the following, we will extend them to the general disease models.
(a) Penetrance based definition
Table 3 parameterizes different penetrance patterns we might observe. A natural interpretation of no interaction is that the penetrance for a given genotype in Table 3 is the product of the two corresponding “marginal penetrances”. We parameterize penetrance patterns such that λ1,…, λ4 represents the main effects and γ1,…, γ4 the interaction effects, and when γ1 = γ2 = γ3 = γ4 = 1, the rows and the columns of Table 3 are independent. Interaction is thus defined as .
(b) Logit based definition
Let X1 be the count of allele A on the first locus and X2 be the count of allele B on the second locus. Consider the logistic regression model
| (8) |
Interaction based on this model is defined as , such that no interaction (Ilogit = 0) means β5 = β6 = β7 = β8 = 0. Note parameters β5,…, β8 can be written as
This again suggests that testing for interaction by logistic regression is equivalent to testing the differences of inter-locus dependences between cases and controls on a log-odds scale.
(c) LD based definition
To extend the LD based definition to general disease models, we first need to find a measure for genotypic disequilibrium for two loci with nine genotypes. Weir [1996] proposed a digenic disequilibrium composite measure called composite LD (CLD). In this definition, the CLD for cases is defined as where gametic frequencies are recovered from genotypic frequencies
so that . Since pApB = 2(P0+ + p1+)(p+0 + p+1), can be calculated by p. Following the same step, can be also calculated for the controls. Wu et al. [2008] then proposed the following definition of interaction:
| (9) |
An alternative way to extend the concept of LD for two loci with nine genotypes is to consider each genotype individually. Yang Y. et al. [2009] detected interaction by comparing a vector of inter-locus disequilibrium measure from the cases to the controls , i, j = 0, 1, 2. In this approach, the definition of interaction assumed is
| (10) |
3.2.2 Relations between the three definitions
Although the above definitions come from different perspectives, correspondence among them still exists under certain circumstances as stated in Proposition 2 and 3.
Proposition 2
is a sufficient condition for ICLD = 0.
Proof is provided in Supplemental Proof B. However, ICLD = 0 does not imply . To show this, we provide a counterexample in Supplemental Proof B.
Proposition 3
Under the assumption that linkage equilibrium holds in a population,
| (11) |
Where and for i, j = 0, 1, 2.
Proof is provided in Supplemental Proof C. If we further assume that the disease is rare, the meanings of no interaction based on Ipenetrance, Ilogitcase and are almost the same. This is because LE in the population implies that the LD or log odds ratio in the control population is approximately 0. As a result, reduces to and Ilogit = 0 to Ilogitcase ≈ 0. In Proposition 3, we show the equivalence of them to Ipenetrance = 0, therefore Ilogit = 0, and Ipenetrance = 0 are equivalent when the disease is rare and LE holds in the population.
3.2.3 Testing statistics
(a) Logit based test
For H0 : Ilogit = 0, a deviance test can be used. Let M1 be the full model in (8) and M0 be the reduced model:
| (12) |
Let LM1 and LM0 be the estimated maximum likelihood of the data under model M1 and M0. Then Tlogistic = − 2(LM0 − LM1) is a deviance test statistic that is asymptotically distributed under the null: Ilogit = 0.
(b) LD based test
Following Weir [1996], Wu et al. [2008] proposed the following statistic to detect interaction defined using ICLD:
| (13) |
The derivation for the variance is provided in Supplemental Section D. TCLD is asymptotically distributed as central under the null ICLD = 0.
In addition, Yang Y. et al. [2009] proposed to detect interaction by testing whether in (10) equals 0. The proposed test statistic is in the same form as TLD* in (6) with i, j = 0,1, 2 instead. TLD* is asymptotically distributed as a under the null .
(c) Penetrance based test
We extend the case-only testing statistic TPearson in (7) using i, j = 0, 1, 2. The resulting new TPearson is asymptotically distributed as a central under the null Ipenetrance = 0.
The statistics from the same statistical model/method are annotated in the same way as in section 3.1 and section 3.2, although the detailed calculations are different. In section 3.1.4, we illustrate that the performances of TOR and Tlogistic are close to each other, and the case-only tests TORc, TLDc and TPearson show similar powers. Thus in this section we only consider Tlogistic among the logit based tests and TPearson among the penetrance based tests.
3.2.4 Simulation study
To evaluate different tests for interaction under the general disease model, we simulate data based on a multiplicative penetrance model illustrated in Table 4, where the strength of interaction is controlled by varying γ from 1 to 3. Penetrance parameter α is chosen so that the risk of the disease in the population is always kept at 0.01. Minor allele frequencies of a and b are both set at 0.3, so that the genotype frequency for aabb is close to 0.1, the value we used in section 3.1.4. Sample sizes are 5000 for both case and control groups. We run 5000 simulations for each of the three settings: (a) both λ1 and λ2 are 1, i.e., neither marker has a main effect; (b) λ1 = 1 and λ2 = 1.2, i.e., one marker has a main effect; and (c) λ1 = λ2 = 1.2, i.e., both markers have main effects. Here we set λ1, λ2 at 1.2 so that the penetrance risk for aabb in Table 4 is close to that for gh in Table 2 in section 3.1.4. The powers of the tests Tlogit, TLD*, TCLD, and TPearson under different settings are plotted in Figure 2. Firstly, when γ = 1, i.e., there is no interaction effect, the power curves of all tests sit right at the 0.05 level, suggesting proper controls of type I errors for all methods under simulation settings considered. As to the power performances, the overall trends are similar as that shown in Figure 1. Case-only test TPearson has the highest power whereas the logistic regression method has the lowest. Between TLD* and TCLD, TCLD gives more favorable results. Its performance is very close to the case-only test and outperforms Tlogit and by as large as 20% in power (Figure 2). The advantage of TCLD over TLD* is partly due to the fact that it is a 1-df test and has a larger null space than the other test (see Proposition 3).
Table 4. Simulation settings to detect interaction in the general disease model.
| Genotype | AA | Aa | aa | |||
|---|---|---|---|---|---|---|
| BB | α | αλ1 |
|
|||
| Bb | αλ2 | αλ2λ1γ1/4 |
|
|||
| bb |
|
|
|
Figure 2.
Comparison of testing statistics for interaction effects under the general disease models. 5000 cases and 5000 controls are simulated according to Table 4. γ is the strength of interaction. The main effect parameters are set at various values to simulate different scenarios: (A) λ1 = 1, λ2 = 1; (B) λ1 = 1, λ2 = 1.2; (C) λ1 = 1.2, λ2 = 1.2. A, B allele frequencies are both set at 0.3. 5000 replicates are simulated for each setting.
4 Detecting association allowing for interaction effects
Besides selecting a powerful test for interaction, researchers are often interested in detecting genetic effects while taking into account the possible interactions among markers. In GWAS, it has been reported that the power to detect association was improved by including interaction into consideration [Cordell, 2002]. This motivates us to conduct the following simulations and comparison studies for detecting the association between a binary outcome and a pair of loci with main and/or interaction effects. Again, we consider both the dominant (recessive) model and the general disease model. Besides the few testing statistics discussed in previous sections (TPearson, TLD, TLD* and TCLD), we include five additional test statistics— Tlogisticall, Tχ2, TKullback, TPearlman, and Tkernel— in our investigation. Some of these tests are for detecting the overall association, while others are for detecting interaction alone. We consider both groups of tests in the comparison, because we are interested in knowing whether interaction tests may be superior to overall association tests when interaction effects are much larger than main effects. The results from our simulation study will provide practical guidelines for selecting appropriate testing methods in GWAS.
4.1 Additional testing statistics
In this section, we describe a few additional testing statistics: deviance test Tlogisticall based on logistic regression, Pearson's chi-squared statistic Tχ2, and Tkernel based on the logistic kernel machine regression method [Liu et al., 2008; Wu et al., 2010]. The first three statistics detect both interaction effects between two single markers and their main effects. We include kernel machine regression method because, by employing the flexible kernel framework, this method is able to model complicated interaction patterns, and has been demonstrated to be effective in association studies [Liu et al., 2008; Wu et al., 2010]. The two correlation based tests TKullback and TPerlman for detecting interaction are considered here because correlation coefficient r can be calculated conveniently without estimating phase [Wellek and Ziegler, 2008]. Specifically, TPerlman are based on Fisher's z transformation of r, which relates closely to the statistics proposed by Ueki and Cordel [2012]; Wellek and Ziegler [2008] and Kam-Thong et al. [2010].
(a) Deviance test of total association through logistic regression
Consider the full model M1 in (8) and the reduced model :
| (14) |
We can reject the null that there is no association between the two markers and the disease if is large. The significance level can be decided based on Tlogisticall.
(b) Pearson's chi-squared test of association
We arrange the observed genotype frequencies in cases (p̂) and in controls (q̂) in a 2×9 table with the first row for cases, the second row for controls, and the columns representing specific genotypes. Then we apply Pearson's chi-squared test to test whether the two rows have the same distribution, which corresponds to the null hypothesis that the disease is not attributable to the two markers. The test statistics Tχ2 is asymptotically central distributed under the null.
(c) Logistic kernel machine method
According to Liu et al. [2008]; Wu et al. [2010], the logistic kernel machine regression model for two loci can be written as
in which for some γ1, …, γn, and K (xi, xi′) measures the similarities between xi and xi′. Based on the link between this logistic kernel machine framework and the generalized linear mixed model, h(xi) was treated as a subject specific random effect. Suppose h(x) follows an arbitrary distribution F with mean 0 and variance τK. Then H0 : h(x) = 0 is equivalent to H0 : τ = 0. The corresponding score statistic is
Under H0, Tkernel is asymptotically distributed. The calculation of κ and ν are described in Supplemental Section E. We use a quadratic kernel in our simulation studies but other kernel choices are also possible [Wu et al., 2010].
(d) Correlation based tests
We introduce two more statistics TPerlman and TKullback, which target on testing interaction of two markers on the disease through comparing the correlations between the two loci in cases and controls. Theses tests relate closely to LD tests in which the interaction effect is assessed by comparing the covariance between two markers in the cases and controls. Specifically,
where r1 and r0 are the correlation coefficients of two markers in the cases and controls. TPerlman is asymptotically normal distributed with a variance of [Larntz and Perlman, 1988].
where . TKullback is asymptotically central distributed [Kullback, 1967]. Note Wellek and Ziegler's statistics is the same as TPerlman but the variance was developed based on delta method and var(r̂), thus is more complicated to calculate.
4.2 Simulation study for testing joint association
We perform simulation studies to compare above statistics including Tlogisticall, Tχ2, TLD, TLD*, TCLD, TPerlman, TKullback, TPearson and Tkernel for testing the association of a phenotype and a pair of loci. We compare their powers under simple dominant (recessive) disease model (2 × 2) first and then general disease model (3 × 3). Simulation settings and results under dominant (recessive) disease models are summarized in Supplemental Figure 1. For general disease model (3 × 3), we adopt the six epistatic models (Table 5) introduced by Neuman et al. [1992] to generate data. The relative risk with predisposing genotypes versus other genotypes f*/f varies from 1 to 2. Disease prevalences are kept at 0.01 by adjusting f. Allele frequencies for a and b are both set to 0.3 for the same reason described in section 3.2.4. The sample sizes of cases and controls are 5000. For each setting, 5000 independent replicates are generated, and TLogisticall, Tχ2, TLD*, TCLD, TPearson, TPerlman, TKullback and Tkernel are compared. Note TLD for the dominant (recessive) model (2 × 2) is replaced by TCLD.
Table 5. Simulation settings to detect association in the general disease model.
|
|
|
|
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Genotype | AA | Aa | aa | Genotype | AA | Aa | aa | Genotype | AA | Aa | aa |
|
|
|
|
|||||||||
| BB | f | f | f | BB | f | f | f | BB | f | f | f* |
| Bb | f | f | f | Bb | f | f* | f* | Bb | f | f | f* |
| bb | f | f | f* | bb | f | f* | f* | bb | f* | f* | f |
|
|
|
|
|||||||||
| (A) | (B) | (C) | |||||||||
|
|
|
|
|||||||||
| Genotype | AA | Aa | aa | Genotype | AA | Aa | aa | Genotype | AA | Aa | aa |
|
|
|
|
|||||||||
| BB | f | f | f | BB | f | f | f | BB | f | f | f* |
| Bb | f | f | f* | Bb | f | f | f* | Bb | f | f | f* |
| bb | f | f* | f* | bb | f | f | f* | bb | f | f* | f* |
|
|
|
|
|||||||||
| (D) | (E) | (F) | |||||||||
Figure 3 shows the performance of each test. The null hypotheses of interest here is f*/f = 1, i.e., there is no association between the disease and the pair of loci. Under this null, all the tests properly control their type I errors, as all lines in Figure 3 sit around 0.05 when f*/f = 1. Two correlation based tests TKullback and TPearlman have identical results, and so do TLogisticall and Tχ2 with Tχ2 computationally much simpler. However, if we need to incorporate other covariates, the logistic regression provides a framework with such capability. For simulation settings (B)-(F), the tests for overall association TLogisticall, Tχ2 and Tkernel give favorable results compared to the two LD based tests and the two correlation based tests. Opposite results are observed in simulation setting (A), in which Tkernel has the worst power and the case-only test TPearson has the best power. When the interaction effect is relatively larger than the main effect as in simulation settings (B), (D) and (E), the advantage of case only tests such as TPearson remains. It is worth noting that, in general, TLD* has a better performance than TCLD, especially in the simulation settings (C) and (D). As Figure 3 shows, which method is superior depends on the structure of the underlying disease model, which is often unknown. Overall TLogisticall, Tχ2, TPearson perform well. When interaction effects are relatively strong TPearson surpasses TLogisticall, Tχ2; when main effects are stronger, the opposite are observed. Tkernel performs well in most scenarios unless the interaction effect is more prominent than the marginal effect. In summary, based on our simulations under LE, we would recommend using TLogisticall, Tχ2 and TPearson. If some level of interaction effects is believed to be present, TPearson is the first choice; if, on the other hand, the interaction effect is believed to be very small and there is no other factor needs to be adjusted in the analysis, Tχ2 is recommended due to its computational convenience over TLogisticall; if other covariates need to be adjusted, TLogisticall is advocated.
Figure 3.
Power comparison of tests on joint effects of two SNPs under general disease models. The data are simulated according to penetrance Table 5(A)-(F). Relative risk RR = f*/f. A, B allele frequencies are both set at 0.3 across all settings. Sample sizes for cases and controls are both 5000. 5000 replicates are simulated for each setting.
5 Discussion
The strong interest of biologists to learn the underlying disease mechanism and the growing need to incorporate interaction testing in GWAS have motivated our investigation on existing gene-gene interaction tests and association tests allowing for interaction.
Since interaction is a concept without agreed mathematical definition, it is worthy to clarify how various interactions are defined when performing the statistical tests. Interactions based on penetrance tables model the disease risk directly and provide a straightforward interpretation for biologists, whereas LD and logistic regression model based definitions are introduced from statistical perspectives. In this paper, we first derive the theoretical relationships between these definitions. We demonstrate that, under LE, when there is at most one marker that has a main effect or when the disease is rare, the three definitions share the same or similar null spaces. Therefore, under these circumstances, tests designed based on one definition shall always have properly controlled type I errors, even if the true underlying interaction mechanism is based on another definition. However, when disease is more common and both loci have main effects, the null spaces of various tests stemmed from different interaction definitions become different. And thus, if the true disease mechanism follows one interaction model, the tests based on another interaction model may very likely encounter inflated type I errors. These theoretical conclusions are well supported by empirical studies conducted in the paper. Specifically, we simulate data under LE according to the penetrance risk models (Table 2). Under this setting, case-only statistics considered in the paper all correspond to Ipenetrance, the penetrance based definition of interaction, and thus are always targeting at the right null hypothesis space. Other statistics using both Case and Control data, referred to as CC statistics, are either testing Ilogit = 0 or ILD = 0. According to Proposition 1, when disease is more common and both markers have main effects, Ilogit = 0 or ILD = 0 are not equivalent to Ipenetrance = 0. Therefore, as the disease prevalence increases, type I errors of CC statistics are subjected to type I error inflation, as their null hypothesis space is mis-specified (see Table 6).
We then evaluate the power performance of a collection of interaction tests through extensive simulation experiments. We again generate data based on penetrance risk models, and focus on settings where these tests share the same null space and thus all have properly controlled type I errors. The results suggest that case-only tests have the most favorable power for testing gene-gene interaction under LE. The advantage of case-only tests comes from their use of the LE assumption. For example, in section 3.1.3, we show the variance of the case-only statistics that is based on log odds ratio is always smaller than the one that uses both case and control data. These results are consistent with Yang et al. [1999] on case-only design for measuring gene-gene interaction. We also performed extensive simulation studies to compare the power of several association tests targeting on both main and interaction effects. Across the six different disease models evaluated, the overall performance of the logistic regression based test Tlogisticall and Pearsons chi-squared test Tχ2 are the best.
This work relates closely to a recent paper by Ueki and Cordel [2012], in which the authors also studied several interaction testing methods using case/control or case-only data. Our theories are consistent with, and provide helpful insights towards interpreting the empirical results of Ueki and Cordel [2012]: (1) when the disease is more common, the presence of main effects appears to have an impact on the type I errors of some methods; (2) some case/control statistics and their corresponding case-only statistics have different behaviors in term of the type I error inflation caused by main effects; and (3) some statistics are insensitive to main effects at both loci under a rare disease assumption or in the presence of a single main effect. It is worth noting that which statistics have well controlled type I errors depend on the “true” interaction model underlying the data. Since penetrance model and logistic model are used to generate data in our and Ueki and Cordel [2012]'s simulations respectively, case-only statistics and case-control statistics demonstrate well controlled type I errors accordingly.
All the investigation conducted in this paper is under the assumption of LE. However, in real applications, often LE may not hold. With regard to this limitation, on one hand, we want to emphasize that the neat theoretical connections between various statistics derived under LE provide deep insights on the differences of these statistics, and can help us to better understand the behaviors of these statistics under more complicated settings. On the other hand, to complete the picture, we also conduct small scale simulations under LD (data not shown due to space limitation). We find the logistic regression test Tlogit and log odds ratio test TOR are robust to LD, while the two LD based tests (TLD and TLD*), as well as the case-only tests are not. The inflation in type I errors of LD or penetrance based statistics caused by main effects can be substantial even when a disease is rare. Similar behavior for LD based case/control statistics were also observed in Ueki and Cordel [2012]. Thus, the LD based statistics and penetrance based statistics are not recommended if LD is present in the data. These results also imply that the relationship among various definitions of interaction is more complicated under LD, which warrants future research.
The development of gene-gene interaction tests helps to detect genetic effects in association studies, especially for complex diseases. However the inference of biological mechanism through statistical interaction is sometimes limited. Biological interaction is more of a result of physical interaction between molecules within a biochemical pathway. The two interactions are closely related if the statistical model reflects the biological model. This poses a dilemma since often scientists are interested in making inferences about biological interaction from statistical results. Without one-to-one correspondence between the two, our ability to make such inference is limited [Cordell, 2009]. However, the advancement in new technologies might provide us more information on genome, proteome and metabolome. Combined with this information, it is not infeasible to elucidate biological interactions through sensible statistical models in the future [Moore and Williams, 2005].
Supplementary Material
Acknowledgments
Support for this work was provided by the National Institute of Health grant R01CA138215 and R01GM082802.
References
- Bateson W. Mendel's principles of heredity. Cambridge University Press; 1909. [Google Scholar]
- Chakravarti A. Population genetics making sense out of sequence. Nat Genet. 1999;21(1 Suppl):56–60. doi: 10.1038/4482. [DOI] [PubMed] [Google Scholar]
- Cook N, Zee R, Ridker P. Tree and spline based association analysis of gene-gene interaction models for ischemic stroke. Stat Med. 2004;23:1439–1453. doi: 10.1002/sim.1749. [DOI] [PubMed] [Google Scholar]
- Cordell HJ. Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet. 2002;11:2463–2468. doi: 10.1093/hmg/11.20.2463. [DOI] [PubMed] [Google Scholar]
- Cordell HJ. Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet. 2009;10:392–404. doi: 10.1038/nrg2579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- D'Angelo G, Rao D, Gu C. Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies. BMC Proceedings. 2009;3(Suppl 7):S62. doi: 10.1186/1753-6561-3-s7-s62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Donnelly P. Progress and challenges in genome-wide association studies in humans. Nature. 2008;456:728–731. doi: 10.1038/nature07631. [DOI] [PubMed] [Google Scholar]
- Fisher R. The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edin. 1918;52:399–433. [Google Scholar]
- Goldstein D. Common genetic variation and human traits. N Engl J Med. 2009;360:1696–8. doi: 10.1056/NEJMp0806284. [DOI] [PubMed] [Google Scholar]
- Hardy J, Singleton A. Genomewide Association Studies and Human Disease. N Engl J Med. 2009;360:1759–68. doi: 10.1056/NEJMra0808700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hirschhorn J. Genomewide association studies illuminating biologic pathways. N Engl J Med. 2009;360:1699–701. doi: 10.1056/NEJMp0808934. [DOI] [PubMed] [Google Scholar]
- Hirschhorn J, Daly M. Genome-wide association studies for common diseases and complex traits. Nature Reviews Genetics. 2005;6:95–108. doi: 10.1038/nrg1521. [DOI] [PubMed] [Google Scholar]
- Jiang R, Tang W, Wu X, Fu W. A random forest approach to the detection of epistatic interactions in case-control studies. BMC bioinformatics. 2009;10:S65. doi: 10.1186/1471-2105-10-S1-S65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kajiwara K, Berson EL, Dryja TP. Digenic retinitis pigmentosa due to mutations at the unlinked peripherin/RDS and ROM1 loci. Science. 1994;264:1604–1608. doi: 10.1126/science.8202715. [DOI] [PubMed] [Google Scholar]
- Kam-Thong T, Czamara D, Tsuda K, Borgwardt K, Lewis CM, et al. EPIBLASTER-fast exhaustive two-locus epistasis detection strategy using graphical processing units. European Journal of Human Genetics. 2010;19:465–471. doi: 10.1038/ejhg.2010.196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kooperberg C, LeBlanc M, YJ D, Rajapakse I. Structures and Assumptions: Strategies to Harness Gene × Gene and Gene × Environment Interactions in GWAS. Statistical Science. 2009;24:472–488. doi: 10.1214/09-sts287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kooperberg C, Ruczinski I. Identifying interaction SNPs using Monte Carlo logic regression. Genet Epidemiol. 2005;28:157–170. doi: 10.1002/gepi.20042. [DOI] [PubMed] [Google Scholar]
- Kraft P, Cox G. Study designs for genome-wide association studies. Advances in Genetics. 2008;60:465–504. doi: 10.1016/S0065-2660(07)00417-8. [DOI] [PubMed] [Google Scholar]
- Kraft P, Hunter D. Genetic risk prediction are we there yet? N Engl J Med. 2009;360:1701–3. doi: 10.1056/NEJMp0810107. [DOI] [PubMed] [Google Scholar]
- Ku C, Loy E, Pawitan Y, Chia K. The pursuit of genome-wide association studies: where are we now? J Hum Genet. 2010;55:195–206. doi: 10.1038/jhg.2010.19. [DOI] [PubMed] [Google Scholar]
- Kullback S. On testing correlation matrices. Journal of the Royal Statistical Society Series C (Applied Statistics) 1967;16(1):80–85. [Google Scholar]
- Larntz K, Perlman MD. A Simple Test for the Equality of Correlation Matrices IV. Vol. 2. New York: Springer-Verlag; 1988. pp. 289–298. [Google Scholar]
- Levy-Lahad E, Lahad A, Eisenberg S, Dagan E, Paperna T, et al. A single nucleotide polymorphism in the RAD51 gene modifies cancer risk in BRCA2 but not BRCA1 carriers. Proc Natl Acad Sci U S A. 2001;98:3232–3236. doi: 10.1073/pnas.051624098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu D, Ghosh D, Lin X. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC Bioinformatics. 2008;9:292. doi: 10.1186/1471-2105-9-292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manolio Tea. A HapMap harvest of insights into the genetics of common disease. J Clin Invest. 2008;118:1590–1605. doi: 10.1172/JCI34772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manolio Tea. Finding the missing heritability of complex diseases. Nature. 2009;461:747–753. doi: 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marchini J, Donnelly P, Cardon L. Genome-wide strategies for detecting multiple loci that influence complex diseases. Nature Genetics. 2005;37(4):413–417. doi: 10.1038/ng1537. [DOI] [PubMed] [Google Scholar]
- McCarthy M, Abecasis G, Cardon L, Goldstein D, Little J, et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet. 2008;9:356–369. doi: 10.1038/nrg2344. [DOI] [PubMed] [Google Scholar]
- Millstein J, Conti D, Gilliland F, Gauderman W. A Testing Framework for Identifying Susceptibility Genes in the Presence of Epistasis. Am J Hum Genet. 2006;78:1527. doi: 10.1086/498850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore J, Asselbergs F, Williams S. Bioinformatics Challenges for Genome-Wide Association Studies. Bioinformatics. 2010;26(4):445–55. doi: 10.1093/bioinformatics/btp713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore J, Williams S. Epistasis and Its Implications for Personal Genetics. American Journal of Human Genetics. 2009;85(3):309–20. doi: 10.1016/j.ajhg.2009.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore JH, Williams SM. Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis. BioEssays. 2005;27:637–646. doi: 10.1002/bies.20236. [DOI] [PubMed] [Google Scholar]
- Neuman RJ, Rice JP, Chakravarti A. Two-Locus models of disease. Genetic Epidemiology. 1992;9:347–365. doi: 10.1002/gepi.1370090506. [DOI] [PubMed] [Google Scholar]
- Peng Q, Zhao J, Xue F. A gene-based method for detecting genegene co-association in a casecontrol association study. European Journal of Human Genetics. 2010;18:582–587. doi: 10.1038/ejhg.2009.223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rajapakse I, Perlman MD, Martin PJ, Hansen JA, Kooperberg C. Multivariate Detection of Gene-Gene Interactions. Genetic Epidemiology. 2012;36:622–630. doi: 10.1002/gepi.21656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reich D, Lander E. On the allelic spectrum of human disease. Trends Genet. 2001;17:502510. doi: 10.1016/s0168-9525(01)02410-6. [DOI] [PubMed] [Google Scholar]
- Risch N. Linkage strategies for genetically complex traits. I. Multilocus models. Am J Hum Genet. 1990;46:222–228. [PMC free article] [PubMed] [Google Scholar]
- Ritchie M, Hahn L, Roodi N, Bailey L, Dupont W, et al. Multifactor-dimensionality reduction reveals highorder interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet. 2001;69(1):138–147. doi: 10.1086/321276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seng K, Seng C. The success of the genome-wide association approach: a brief story of a long struggle. Eur J Hum Genet. 2008;16:554–564. doi: 10.1038/ejhg.2008.12. [DOI] [PubMed] [Google Scholar]
- Teo Y. Common statistical issues in genome-wide association studies: a review on power, data quality control, genotype calling and population structure. Curr Opin Lipidol. 2008;19:133–143. doi: 10.1097/MOL.0b013e3282f5dd77. [DOI] [PubMed] [Google Scholar]
- Ueki M, Cordel HJ. Improved Statistics for Genome-Wide Interaction Analysis. PLoS Genetics. 2012;8 doi: 10.1371/journal.pgen.1002625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wan X, Yang C, Yang Q, Xue H, Fan X, et al. BOOST: A Fast Approach to Detecting Gene-Gene Interactions in Genome-wide Case-Control Studies. The American Journal of Human Genetics. 2010a;87:325–340. doi: 10.1016/j.ajhg.2010.07.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wan X, Yang C, Yang Q, Xue H, Tang NL, et al. Predictive rule inference for epistatic interaction detection in genome-wide association studies. Bioinformatics. 2010b;26:30–37. doi: 10.1093/bioinformatics/btp622. [DOI] [PubMed] [Google Scholar]
- Wang Y, Barratt J, Clayton G, Todd A. Genome-wide association studies: theoretical and practical concerns. Nature Reviews Genetics. 2005;6:109–118. doi: 10.1038/nrg1522. [DOI] [PubMed] [Google Scholar]
- Weir B. Genetic data analysis II. II. Sinauer Associates; 1996. [Google Scholar]
- Wellek S, Ziegler A. A genotype-based approach to assessing the association between single nucleotide polymorphisms. Human Heredity. 2008;67:128–139. doi: 10.1159/000179560. [DOI] [PubMed] [Google Scholar]
- Wu J, Devlin B, Ringquist S, Trucco M, Roeder K. Screen and clean: a tool for identifying interactions in genome-wide association studies. Genetic epidemiology. 2010;34:275–285. doi: 10.1002/gepi.20459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, et al. Powerful SNP-set analysis for case-control genome-wide association studies. Am J Hum Genet. 2010;86:929–942. doi: 10.1016/j.ajhg.2010.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X, Jin L, Xiong M. Composite measure of linkage disequilibrium for testing interaction between unlinked loci. Eur J Hum Genet. 2008;16:644–651. doi: 10.1038/sj.ejhg.5202004. [DOI] [PubMed] [Google Scholar]
- Yang C, He Z, Wan X, Yang Q, Xue H, et al. SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies. Bioinformatics. 2009;25:504–511. doi: 10.1093/bioinformatics/btn652. [DOI] [PubMed] [Google Scholar]
- Yang Q, Khoury MJ, Sun F, Flanders WD. Case-Only Design to Measure Gene-Gene Interaction. Epidemiology. 1999;10:167–170. [PubMed] [Google Scholar]
- Yang Y, He C, Ott J. Testing association with interactions by partitioning chi-squares. Annals of human genetics. 2009;73:109–117. doi: 10.1111/j.1469-1809.2008.00480.x. [DOI] [PubMed] [Google Scholar]
- Zhang X, Pan F, Xie Y, F Z, Wang W. FastChi: An Efficient Algorithm for Analyzing Gene-Gene Interactions. Pacific Symposium on Biocomputing'09. 2009:528–539. [PMC free article] [PubMed] [Google Scholar]
- Zhang X, Pan F, Xie Y, Zou F, Wang W. COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study. Journal of Computational Biology. 2010;17(3) doi: 10.1089/cmb.2009.0155. [DOI] [PubMed] [Google Scholar]
- Zhang Y, Liu J. Bayesian inference of epistatic interactions in case-control studies. Nat Genet. 2007;39:1167–1173. doi: 10.1038/ng2110. [DOI] [PubMed] [Google Scholar]
- Zhao J, Jin L, Xiong M. Test for interaction between two unlinked loci. American journal of human genetics. 2006;79:831–845. doi: 10.1086/508571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng T, Wang H, Lo S. Backward genotype-trait association (BGTA) - based dissection of complex traits in case-control design. Hum Hered. 2006;62:196–212. doi: 10.1159/000096995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziegler A, Knig I, Thompson J. Biostatistical aspects of genome-wide association studies. Biom J. 2008;50(1):8–28. doi: 10.1002/bimj.200710398. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



