Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Jan 1.
Published in final edited form as: Genet Epidemiol. 2010 Jan;34(1):67–77. doi: 10.1002/gepi.20436

Single-Marker and Two-Marker Association Tests for Unphased Case-Control Genotype Data, with a Power Comparison

Sulgi Kim 1, Nathan J Morris 1, Sungho Won 2, Robert C Elston 1,*
PMCID: PMC2796706  NIHMSID: NIHMS141665  PMID: 19557751

Abstract

In case-control Single Nucleotide Polymorphism (SNP) data, the Allele frequency, Hardy Weinberg Disequilibrium (HWD) and Linkage Disequilibrium (LD) contrast tests are three distinct sources of information about genetic association. While all three tests are typically developed in a retrospective context, we show that prospective logistic regression models may be developed that correspond conceptually to the retrospective tests. This approach provides a flexible framework for conducting a systematic series of association analyses using unphased genotype data and any number of covariates. For a single stage study, two single-marker tests and four two-marker tests are discussed. The true association models are derived and they allow us to understand why a model with only a linear term will generally fit well for a SNP in weak LD with a causal SNP, whatever the disease model, but not for a SNP in high LD with a non-additive disease SNP. We investigate the power of the association tests using real LD parameters from chromosome 11 in the HapMap CEU population data. Among the single-marker tests, the allelic test has on average the most power in the case of an additive disease; but, for dominant, recessive and heterozygote disadvantage diseases, the genotypic test has the most power. Among the six two-marker tests, the Allelic-LD contrast test, which incorporates linear terms for two markers and their interaction term, provides the most reliable power overall for the cases studied. Therefore, our result supports incorporating an interaction term as well as linear terms in multi-marker tests.

Keywords: Allele frequency contrast test, LD contrast test, HWD contrast test, Genome-wide Association

INTRODUCTION

A genome-wide association study with case-control data aims to localize disease susceptibility regions in the genome. Single Nucleotide Polymorphism (SNP) markers, which are usually diallelic, have been used to cover the whole genome. Two categories of tests have been applied to these data: single-marker association tests, which examine association between affection status and the data one SNP at a time, and multi-marker association tests, which examine association between affection status and multiple SNP data simultaneously.

For single-marker association tests, we can consider the allelic frequency contrast test (allelic test) [Sasieni 1997], and the Hardy Weinberg Disequilibrium (HWD) contrast test [Song and Elston 2006]. In genome-wide association studies, the allelic test has been predominantly used (e.g. The Wellcome Trust Case Control Consortium [2007]). However, this test often fails on account of a relatively strict correction for multiple comparisons, because it does not take advantage of marker linkage disequilibrium (LD) structure efficiently, and/or it is not suitable for detecting rare disease variants. For these reasons, multi-marker association tests may have more power than single-marker association tests. Such tests include the haplotype-based test [Schaid 2004; Schaid, et al. 2002], Hotelling's T2 test [Chapman, et al. 2003; Clayton, et al. 2004; Xiong, et al. 2002] and the LD contrast test [Nielsen, et al. 2004; Wang, et al. 2007; Zaykin, et al. 2006]. The original LD-contrast test requires phased genotype data, but Zaykin et al. [2006] proposed the composite-LD contrast test that does not require phased genotype data. From now on in this paper, when it applies to unphased data, we use the term “LD contrast test” to denote the composite-LD contrast test.

Several authors have proposed that either of the HWD and LD contrasts be jointly tested with the allele frequency contrast [Song and Elston 2006; Zheng, et al. 2008; Zheng, et al. 2007]. Recently, Won and Elston [2008] described the allele frequency, HWD and LD contrasts as three distinct sources of information about case-control differences and suggested performing these tests in a joint or multi-stage manner. While these three sources of information are often close to being independent, they are only strictly independent under limiting conditions [Won and Elston 2008]. This fact has restricted a systematic use of the three tests, because extra work is required to adjust for their correlations.

The allele frequency, HWD and LD contrast tests are typically developed in what has been termed a retrospective context; i.e. case-control status is considered fixed and the genotypes are considered random. However, for case-control data, epidemiologists typically take advantage of the properties of the odds ratio and use the prospective logistic regression model, making the case-control status the random variable dependent on the predictors (i.e. genotypes and covariates) which are considered fixed [Prentice and Pyke 1979]. This prospective modeling tends to allow for greater flexibility, especially when adjusting for covariates. It also provides a natural way to adjust for any correlations between the tests or other covariates and can be extended to quantitative traits. The allele frequency contrast test has been performed in a prospective logistic regression model [Longmate 2001]. However, there is little discussion in the literature concerning the prospective modeling of the other two tests.

Here, for unphased case-control data, we discuss how the allelic, HWD and LD contrasts tests may be combined, either pairwise or all three together, in the retrospective context. We then show that these joint tests correspond very closely to analogous tests based on certain prospective models. Using these general models, various specific models and their corresponding tests are presented. After deriving “true” models in terms of the penetrances of a single disease SNP and the LD structure, we look for the best test in terms of power. Lastly, under the assumption that LD among two markers and the disease locus is similar to that among three markers, we compare the power of each test using the SNP data on chromosome 11 of the HapMap CEU (Utah residents with ancestry from northern and western Europe) population data.

METHODS

Notation and Assumptions

We assume that there is a single diallelic disease SNP in the genomic region being considered, but we allow multiple disease SNPs to exist that are not in LD with any SNP in the region. We suppose there are two diallelic SNP markers, A and B, having alleles {A1,A2} and {B1,B2}, respectively, where A1 and B1 are the minor alleles. Let X and Y denote random variable for genotypes of markers A and B coded as follows:

X={1forA1A10forA1A21forA2A2},Y={1forB1B10forB1B21forB2B2}.

The random variables X and Y for the i-th individual are denoted by Xi and Yi. Icase and Ictrl denote the sets of cases and controls. We assume a multinomial distribution for unphased genotype data in the general population and denote their probabilities as in TABLE I. Note that we make minimal assumptions about the general population sampled; in particular, we do not assume HWE in the population. The allele frequencies of A1 and B1 are given by pA=pA1A2+12pA1A2 and pB=pB1B2+12pB1B2. We use μX,σX2 and σX,Y to denote the expected value of X, the variance of X and the covariance of X and Y, respectively. Note that μX = 2PA − 1 and μY = 2PB − 1. We similarly assume a multinomial distribution for cases and controls, denoting any parameters associated with these populations respectively by the subscripts “case” and “ctrl”. The HWD parameter for marker A and the composite LD parameter for alleles A1 and B1 of markers A and B are respectively given by [Weir 1996; Zaykin 2004]

dA=pA1A1pA2DAB=(g(1,1)+12g(1,0)+12g(0,1)+14g(0,0))pApB=14σX,Y.

The HWD parameter dAcan also be expressed in a different form. It can be shown that σX2=μX2(μX)2=2(pA+pA1A1)4pA2. Under HWE (i.e. pA2=pA1A1), this become σXHWE2=2pA(1pA). Thus the the HWD parameter can be expressed by dA=12(σX2σXHWE2) This means that the HWD parameter, dA, is half the the deviation of the variance from the variance expected under HWE.

TABLE I.

Probabilities for unphased genotypes

A1A1 A1A2 A2A2
B 1 B 2 g (1,1) g (0,1) g (−1,1) p B 1 B 1
B 1 B 2 g (1,0) g (0,0) g (−1,0) p B 1 B 2
B 2 B 2 g (1,−1) g (0,−1) g (−1,−1) p B 2 B 2

p A 1 A 1 p A 1 A 2 p A 2 A 2 1

Tests in the Retrospective Context

The allele frequency and HWD contrast tests for marker A and the LD contrast test for markers A and B test the equalities, between cases and controls, of the parameters, PA, dA and Δ, respectively. Suppose we have ncase cases and nctrl controls. Since p^A=12(X+1), the test statistic for the allele frequency contrast can be written TAF2=(XcaseXctrl)2SXcase2+Sxctrl2, where SXcase2=1ncaseΣiIcase(XiXcase)2 and SXctrl2=1nctrlΣiIctrl(XiXctrl)2. This corresponds to the univariate version of Hotelling's [1931] T2 test. Estimates for the HWD and LD parameters dA and Δ of a population from a sample with size n are obtained by d^A=12(1nΣi(Xiμx)2μx(1μx)2) and Δ^=12nΣi(XiμX)(YiμY). Then the T2 statistics for the HWD and LD contrast tests are given by THWD2=(d^Acased^Actrl)2Var^(d^Acase)+Var^(d^Actrl) and TLD2=(d^cased^ctrl)2Var^(d^case)+Var^(d^ctrl). It can be easily verified that, under the assumption of known and μX and μY,

Var(d^A)=14nσX22andVar(Δ^)=14n(σXY2+μXσY2+μYσX2). (1)

In practice, when μX and μY are unknown, we may replace them by the consistent estimates X and Y. Other tests exist that utilize allele frequency, HWD and LD contrasts. However, these tests are in form and perform similarly.

The joint test of allele frequency and HWD contrasts between cases and controls tests the null hypothesis H0: (PA|case dA|case) = (PA|ctrl dA|ctrl). Note the important point that the parameter vector (PA dA) determines the genotype distribution and therefore this test is equivalent to the genotypic test. We denote this test the Allelic-HWD contrast test. In the following, M′ denote the transpose of a column vector, M. Letting Zi ≡ (Xi Xi2)′, the sample mean Z is a sufficient statistic for (PA dA)′. Thus the Allelic-HWD contrast test can be performed by comparing Zcase and Zctrl. The T2 statistic for this test is given by

T2=ncasenctrlncase+nctrl(ZcaseZctrl)ST2(ZcaseZctrl), (2)

where ST2=1ncase+nctrl2(ΣiIcase(ZiZcase)(ZiZcase)+ΣiIctrl(ZiZctrl)(ZiZctrl)) and ⊖ denotes a generalized inverse. Under the null hypothesis, T2 asymptotically follows the chi-square distribution with degrees of freedom equal to the rank of ST2. Similarly, we can build a joint test of the allele frequency contrasts of two markers and their LD contrast, which tests the null hypothesis H0: (PA|case PB|case Δcase) = (PA|ctrl PB|ctrl Δctrl). We denote this test the Allelic-LD contrast test. Letting Zi = (Xi Yi XiYi)′, Z is a sufficient statistic for (PA PB Δ)′. Thus, the Allelic-LD contrast test can be performed using a version of Hotelling's T2.

Therefore, it can be seen that the additional case-control differences that can be captured by the HWD and LD contrast tests, given the allele frequency contrast(s), are equivalent to the differences in quadratic and interaction terms, respectively. The joint test for the case-control difference in allele frequency and HWD of two markers and their LD (the Allelic-HWD-LD contrast test) can be constructed, in a similar manner, by contrasting the mean vector of Zi=(XiYiXiYiXi2Yi2) between cases and controls.

Prospective Single-marker and Two-marker Association Models and Tests

Tests of regression coefficients in a logistic model with case control data, possibly with covariates, are known to be valid and widely used when covariates need to be included in the model as we mentioned earlier. In fact, the score test statistic is of the same form as (2), but with ST2 replaced by the similar matrix

1ncase+nctrli=1ncase+nctrl(ZiZ)(ZiZ)=ncase+nctrl2ncase+nctrlST2+ncasenctrlncase+nctrl)2(ZcaseZctrl)(ZcaseZctrl).

This implies that the T2 and score test statistics are asymptotically equivalent as the null and the alternative hypothesis approach each other (i.e. they become a Pitman sequence [Davidson and MacKinnon 1987]). Because the likelihood ratio test (LRT) for a logistic model is also very close to the score test for the same model, we expect all three tests, the T2 test, score test and LRT, to behave similarly.

A single marker model for the Allelic-HWD contrast test incorporating covariates is

log(μ1μ)=β0+βXX+β0+βX2X2+(covariates), (3)

where we suppress the index i and μ denotes the probability that an individual is affected. Similarly, the two marker model for the Allelic-HWD-LD contrast test incorporating covariates is

log(μ1μ)=β0+βXX+βYY++βXYXY++βX2X2+βY2Y2+(covariates). (4)

Based on these two models or a reduced form of them, we can set up various types of association tests that examine the significance of all or a subset of the regression coefficients. Six models and their global hypotheses that may be tested in a single stage analysis are numbered and presented in TABLE II and, for each of these hypotheses, we can set up the corresponding score test or LRT. For genome-wide association analysis, we can perform a single marker test with every single SNP and/or a two-marker test with every consecutive pair of SNPs.

TABLE II.

Single-marker and two-marker association tests with corresponding models and hypotheses

Test Model Null hypothesis (H0) Test Description

Single-marker association
Test 1–2 log(μ1μ)=β0+βXX+βX2X2 X βX2 = 0 Allelic-HWD contrast test (Genotypic test)
Test 1–1 log(μ1μ)=β0+βXX X) = 0 Allele frequency contrast test (Allelic test)

Two-marker association
Test 2–5 log(μ1μ)=β0+βXX+βYY+βX2X2+βY2Y2βXYXY X βY βXY βX2 βY2)= 0 Joint Allelic-HWD-LD contrast test
Test 2–4 log(μ1μ)=β0+βXX+βYY+βX2X2+βY2Y2 X βY βX2 βY2) = 0 Joint Allelic-HWD contrast test
Test 2–3 log(μ1μ)=β0+βXX+βYY+βXYXY X βY βXY) = 0 Joint Allelic-LD contrast test
Test 2–2 log(μ1μ)=β0+βXX+βYY X βY) = 0 Joint Allelic contrast test

The models in TABLE II are specific genotype-based models and it may be helpful to review the relationship between these genotype-based models and haplotype-based models. Let Hij be the number of (Ai,Bj) haplotypes an individual has. Then the haplotype-based model for two markers, taking H22 as a baseline, is written as

log(μ1μ)=β0+β12H12+β21H21+β11H11+(covariates), (5)

and the haplotype-based test is set up as a global test of their effects, that is a test of H0: (β12 β21 β11)′ = O. Ignoring covariates, the saturated haplotype-based model has four parameters for main effects (i.e. we rewrite model (5) to include β22H22 instead of β0) and six parameters for their first-order interaction effects. This model, with a total of ten parameters, is equivalent to the genotype-based model that includes the six terms in (4), together with the higher order terms X2Y,XY2,X2Y2 and an extra term for phase [Schaid 2004]. Therefore, each test in TABLE II and the haplotype-based test examine the case-control differences summarized in a different way. In particular, for two-marker data, Test 2–3 is comparable to the haplotype-based test because both of them are generally 3-degree-of-freedom tests [Conti and Gauderman 2004].

Multi-stage analysis, which utilizes the allele frequency, HWD and LD contrasts in sequential stages of the analysis, can be created by using a sequence of the tests in TABLE II in a prospective model. Suppose, for example, that SNPs have been selected for genotyping by an allele frequency contrast test based on pooled DNA samples. Then the HWD contrast test adjusted for the information which is used in the first stage can be performed by the test of H0:(βx2x)=0 in the model of Test 1–2, whether the second test is to be applied to the same or a new sample of persons. Therefore, based on these marker association models, using this framework in a multi-stage study we can perform the allele frequency, HWD and LD contrast tests and their joint tests in a more systematic and convenient way.

Penetrance Model and True Marker Association Model

We can obtain true marker association models from knowing the genotype penetrances and the LD structure among the disease and marker alleles. The tests in TABLE II do not require HWE in the general population, but we will consider true marker association models under the assumption of HWE in the general population. Let the disease SNP have alleles {D1,D2}, where D1 is the minor allele. Let D denote the disease genotype variable coded as

D={1forD1D10forD1D21forD2D2}.

We write the penetrance model as:

μ=P(affectedD)=γ0+γDD+γD2D2. (6)

Note that for simplicity of exposition this is not written on a logit scale. Consider the four disease models: additive, dominant, recessive and heterozygote (dis)advantage. These can be obtained by constraining the coefficients of the penetrance model (6) as indicated in TABLE III. In the following, we assume that the homozygote with the major allele D2 has the lowest risk. This implies that the minor allele is the disease susceptibility allele for the additive, dominant and recessive diseases, and that we are considering a heterozygote disadvantage disease. Nevertheless, the same test statistics are appropriate, and their power will be similar, for the diseases in which the homozygote with the major allele D2 has the highest risk.

TABLE III.

Constraints for disease models

Disease Model Constraint
Additive γD2 = 0
Dominant or Recessive γD2 = ±γD
Heterozygote (Dis)advantage γD = 0

Now consider the LD structure between a single marker locus and a disease locus in the general population. Let pD denote the disease allele frequency and DXD denote the LD of the marker allele A1 and the disease allele. The LD structure implies that E(D|X) = aX + b and E(D2|X) = −a2X2 + abX + c, where a=(1pA+11pA)DXD,b=(1pA11pA)DXD(12pD) and c=12pD(1pA11pA)DXD+2(pD+DXDpA)(pDDXD1pA) (see Appendix A). Given the true disease model and the LD structure, we can set up the true single-marker association model between the phenotype and single-marker data X, as follows:

μ=P(affectedX)=D=1,0,1P(affectedD)P(DX)=γ0+γDE(DX)+γD2E(D2X)=(abγD2+aγD)Xa2γD2X2+(γD2c+γDb+γ0). (7)

This true association model, which has the same form as the penetrance model (6), allows us to understand how the disease model and the LD structure affect the SNP association pattern. It clearly implies that a marker allele that is in LD with the disease allele would be observed in the cases and controls as if it were a disease SNP with a particular disease model. Now, in the model (7), as DXD approaches 0, the coefficients of both the linear and quadratic terms go to 0. However, the ratio of the coefficient of the linear term to that of the quadratic term in model (7) is (12pA)+((12pD)γDγD2)1DXD. When (12pD)γDγD20, as DXD approaches 0, the absolute value of the ratio approaches infinity. Thus as the LD decreases, the coefficient of the quadratic terms generally approaches 0 faster than does that of the linear term, and the association model becomes similar to that of an additive disease. However, if (12pD)γDγD2=0 holds for the disease model, it can be shown that μX|case = μX|ctrl = μX (Appendix B). This implies that a test based on a model with only a linear term, such as the allelic test, cannot identify the association at all when it is applied to any SNP correlated with the disease SNP, even the disease SNP itself. This condition holds only in over- or under-dominant disease models.

For given LD parameters and allele frequencies, the true two-marker association model can be obtained in a similar way to the single-marker case as follows:

μ=P(affectedX,Y)=D=1,0,1P(affectedD)P(DX,Y)=γ0+γDE(DX,Y)+γD2E(D2X,Y), (8)

which is a full model with 9 polynomials of X and Y, or a reduced form of this model. While the regression coefficients cannot be expressed simply in general, we may easily write out E(D|X, Y) and E(D2|X, Y) for computational purposes (see Appendix A).

Consider a disease allele on a multi-dimensional LD structure, by which we mean that E(D|X, Y) deviates from a linear combination of the variables X and Y. This often happens when there is high three-locus LD among the three SNPs. The case E(D|X, Y) = βXX + βYY + βXYXY + β0 with a relatively large absolute value of βXY is a “simple” multi-dimensional LD structure. In this case, the true marker association model for an additive disease is written as μ = P(affected|X, Y) = γDXX + βYY + βXYXY) + γ0. This implies that tests that include the contrast of an interaction term (i.e. Tests 2–5, Test 2–3 and the LD contrast test) may gain power by taking into account the multi-dimensional LD structure. On the other hand, tests that do not take into account the multi-dimensional LD structure (any single marker tests, Test 2–2 and Test 2–4) might have less power. The haplotype-based test also takes multi-dimensional LD structure into account, and its gain in power in a multi-dimensional LD structure has been observed by Nielsen et al. [2004].

We can consider the models in TABLE II as reduced, full, or extended in comparison to the true model. However, the models in TABLE II and the true association models are written with different link functions (the logit and identity functions, respectively); whereas the identity link function simplifies exposition for the relationship between the disease model and the association model, the logit link function is more convenient for data analysis. For small effect sizes the two link functions should yield similar models. Therefore, our true marker association models are sufficient to provide an intuition about which predictors will be important components of the logistic model.

Finding the most powerful test among the tests in TABLE II is not straightforward because the test under the full true model, which examines contrasts of all the predictor variables in the full model, is not always the most powerful. When a reduced model explains the data parsimoniously, its corresponding test becomes more powerful than the corresponding tests under the full model because of the smaller number of degrees of freedom. Therefore, we compared various association tests with different penetrance models and LD structures - which together determine the true association model.

Power Computation

As mentioned earlier, the T2 test in a retrospective model and the score test and LRT in a prospective logistic model are expected to perform similarly. We first derive the theoretical power calculated from the noncentrality parameter of the T2 test and compare this with the empirical power of the T2 test, score test and LRT. The noncentrality parameter of the T2 test for Test 2–5 is

λ=ncasenctrlncase+nctrl(μcaseμctrl)Σ(μcaseμctrl),

where μcase = (μX|case μY|case μXY|case μX2|case

Σ=ncasencase+nctrlΣcasenctrlncase+nctrlΣctrl,Σcase=(σXcase2σX,YcaseσX,XYcaseσX,X2caseσX,Y2caseσYcase2σY,XYcaseσY,X2caseσY,Y2caseσXYcase2σXY,X2caseσXY,Y2caseσX2case2σX2,Y2caseσY2case2)andΣctrl=(σXctrl2σX,YctrlσX,XYctrlσX,X2ctrlσX,Y2ctrlσYctrl2σY,XYctrlσY,X2ctrlσY,Y2ctrlσXYctrl2σXY,X2ctrlσXY,Y2ctrlσX2ctrl2σX2,Y2ctrlσY2ctrl2).

Here, and Σcase Σctrl are symmetric, so we only indicate the diagonal and above diagonal elements. The number of degrees of freedom,r is the rank of Σ. Given the penetrance model and LD structure, we can derive the entries of μcasectrlcase and Σctrl (see Appendix C). The noncentrality parameters for the other tests in TABLE II can be obtained by using the corresponding sub-matrices of (μcase − μctrl) and (Σcase + Σ). Then the power of the significance level test with noncentrality parameter λis given by

(POWER)=1FXλ2(X1α2),

where FXλ2 is the cumulative density function of a chi-square distribution with noncentrality parameter λ and degrees of freedom r, and X1α2 is the 1 − α quantile of a central chi-square distribution with r degrees of freedom.

We compared this theoretical power of the T2 test with the empirical power of the T2 test, score test and LRT. For each of the four disease models, we generated 100,000 replicate datasets, performed Test 1–2 on each dataset with each of the three test statistics and obtained their empirical power from the 100,000 replicate datasets. Specifically, the parameters were set as follows: pD =0.2, pA = 0.3, DXD = 0.048 (D′ = 0.8) K = 0.05, (5%), and μbase = 0.04 (4%), where K is the disease prevalence and μbase is the baseline risk. By fixing K and μbase instead of the effect size, for each region and each disease model we can condition on a constant attributable risk calculated as K − μbase. The coefficients γD2, γD and γ0 are determined by the disease model, the prevalence K and baseline risk μbase using the constraints shown in TABLE III and the following equations:

K=γD2{pD2+(1pD)2}+γD{pD2+(1pD)2}+γ0μbase=γ0γD+γD2.

The significance level was set toα = 0.05/500,000 for a genome-wide association study with 500K independent SNPs. Each dataset compromised 2,000 cases and 2,000 controls (n = ncase = nctrl =2,000) for the additive, dominant and heterozygote disadvantage diseases, and 500 cases and 500 controls (n = 500) for a recessive disease so that its power would not be too high. Empirical power was obtained by the ratio of the number of rejected replicates to the total number of replicates.

The theoretical power of the T2 test was close to the empirical power of the score, LRT and T2 tests (TABLE IV) while the three test statistics led to almost identical (but very small), departure from nominal Type I error [Data not shown]. The T2 test is slightly more powerful than the other two, while the LRT is slightly more powerful than the score test. The power under a recessive disease showed relatively greater inconsistencies because of the smaller sample size. Therefore the theoretical power of the T2 test can be a good estimate for any of the three tests. For the purpose of comparing the power of the tests in TABLE II and any other association tests, it is sufficient to compare the theoretical power of the corresponding T2 test.

TABLE IV.

Comparisons of Theoretical and Empirical Power of Test 1–2

Theoretical Power Empirical Power
T2 test T2 test LRT Score test
Additive 0.532 0.533 0.527 0.523
Dominant 0.366 0.366 0.361 0.359
Recessive 0.734 0.741 0.736 0.708
Heterozygote Disadvantage 0.284 0.283 0.277 0.275

For each of the four disease models, parameters are set as follows: pD = 0.2,pA = 0.3, K = 0.05, DXD = 0.048(D = 0.8),n = 2,000 (500 for recessive), α = 0.05/500,000. Empirical power is obtained by the ratio of the number of rejected replicates to the total 100,000 replicates.

RESULTS

Power Comparisons

The theoretical power of the HWD contrast test, LD contrast test and haplotype-based test can be computed from their noncentrality parameters. The noncentrality parameters for the HWD contrast test and the LD contrast test are given by λHWD=(dAcasedActrl)2Var(d^Acase)+Var(d^Actrl) and λLD=(ΔcaseΔctrl)2Var(Δ^case)+Var(Δ^ctrl), where the denominators are given in (1). The noncentrality parameter of the haplotype-based test,λHAP, can be also computed (see Appendix D). These three noncentrality parameters are given under the assumption that the true minor allele frequencies of markers A and B (pA,pB) are known, or that the haplotype frequencies are determined with certainty, in both cases and controls. The theoretical power from these noncentrality parameters somewhat overestimate the power in a real situation because the variances in the denominators of the noncentrality parameters would be greater. Therefore, our theoretical power comparisons of these three tests and the tests in TABLE II may give results which are a little favorable to these three tests.

We compared power among the single-marker tests and among the two-marker tests. For the single-marker tests, we present the power as a function of the LD (Lewontin's D′) between marker and disease allele (Fig. 1). Test 1–1 (the allelic test) always had more power than Test 1–2 (the genotypic test) or the HWD contrast test in the case of an additive disease. But in the other disease models, Test 1–2 had more power than the allelic test when LD is high. The HWD contrast test had less power for all four disease models. However, although the HWD contrast test performed poorly by itself, when it was combined with the allele frequency test to be Test 1–2, power was maximized.

For the two-marker tests, two cases of LD structure were considered for each disease model: a case with low multi-dimensional LD (LD structure 1) and one with high multi-dimensional LD (LD structure 2), as defined in the legend to TABLE IV. In LD structure 1, Test 2–2, which examines the allele frequencies of the two markers, had the most power except in the case of a recessive disease (TABLE V). However, in LD structure 2, the haplotype-based test, Test 2–5 and Test 2–3 had more power than Test 2–2 by taking into account the multi-dimensional LD. The LD contrast test, like the HWD contrast test, performed poorly alone but, when it was combined with the allele frequency contrast test, power was maximized. Our findings were not materially affected by different values of the prevalence or sample size [Data not shown].

TABLE V.

Power comparisons of two-marker tests

Test 2–5 Test 2–4 Test 2–3 Haplotype-based Test 2–2 LD contrast
(LD structure 1)
Additive 0.775 0.813 0.851 0.842 0.890 0.000
Dominant 0.695 0.736 0.774 0.749 0.819 0.000
Recessive 0.823 0.845 0.746 0.784 0.717 0.001
Heterozygote Disadvantage 0.617 0.653 0.673 0.621 0.711 0.000

(LD structure 2)
Additive 0.962 0.758 0.970 0.948 0.850 0.007
Dominant 0.921 0.673 0.926 0.887 0.769 0.003
Recessive 0.851 0.647 0.910 0.945 0.618 0.206
Heterozygote Disadvantage 0.845 0.584 0.831 0.773 0.656 0.001

For both LD structure 1 and LD structure 2 the parameters are set as follows: n = 2,000 (500 for recessive), K = 0.05, μbase = 0.04, α = 0.05/500,000, pA = 0.2, pD = 0.1, pB = 0.2, DXD = 0.045 (DXD = 0.56), DYD = 0.047(DYD = 0.59), DXY = 0.028(DXY = 0.18). For LD structure 1, the three locus LD, DXdy, was 0.01, and for LD structure 2, DXDY was 0.035. Bold numbers in each row indicate the maximum power over all two-marker tests in that row.

For a given disease model, none of the tests were found to be the most powerful in both cases of LD structure. Therefore, specifying the LD structure that the markers and a real disease SNP take on would suggest the best test for a particular association study, were it known. If we assume that the LD structure among the marker and disease SNPs is similar to that among the marker SNPs, we can compute and compare the power of each test given that LD structure by estimating the necessary parameters from the marker LD structure.

Power Comparisons Based on Real Data

We estimated LD parameters and marker allele frequencies from the HapMap CEU population data [The International HapMap Project 2005]. These data consist of 120 haplotypes estimated from 30 parent-offspring trios. We split chromosome 11 into mutually exclusive consecutive regions containing 3 SNPs each. For each region we estimated the LD and allele frequency parameters. We excluded regions where the minor allele frequencies of three consecutive markers were less than 0.1, leaving 4,648 regions. Following the method in Nielsen et al.'s [2004] simulation, we chose the disease SNP to be the one with the smallest allele frequency, assuming the disease allele would have a smaller allele frequency than the ascertained marker SNPs. Parameters other than the LD parameters were set to be the same as before, i.e. n = 2,000 (500 for recessive),K=0.05, μbase =0.04,α=0.05/500,000. For each region, we computed the power of the single marker tests based on the first of the non-disease SNPs, and we computed the power of the two marker tests based on the two non-disease SNPs. Then the mean power over all regions was computed (TABLE VI).

TABLE VI.

Mean of Power over Chromosome 11 of CEU HapMap Data

Single-marker Test Two-marker Test
Disease Model Test 1–2 Test 1–1 HWD contrast Test 2–5 Test 2–4 Test 2–3 Test 2–2 Haplotype-based LD contrast
Additive 0.423 0.457 0.000 0.575 0.586 0.604 0.632 0.625 0.019
Dominant 0.361 0.347 0.001 0.505 0.513 0.518 0.505 0.488 0.003
Recessive 0.519 0.415 0.255 0.687 0.677 0.672 0.572 0.624 0.278
Heterozygote Disadvantage 0.423 0.241 0.163 0.587 0.580 0.546 0.367 0.344 0.058

For both single-marker and two-marker tests, the parameters are set as follows: n = 2,000 (500 for recessive), K = 0.05, μbase = 0.04, α = 0.05/500,000. Bold numbers in each row indicate the maximum power in that row.

In a comparison of the single marker-tests, while Test 1–1 was the most powerful on average for an additive disease, Test 1–2 was the most powerful on average for the other three disease models. Comparing among the two-marker tests, the most powerful tests on average were: Test 2–2 for an additive disease, Test 2–3 for a dominant disease, and Test 2–5 for a recessive or heterozygote disadvantage disease. However, among these tests, Test 2–3 had relatively consistent power. It had greater overall average power than the haplotype-based test except for an additive disease, but it nevertheless had comparable power to the haplotype-based test. As seen before, the purely HWD contrast test and LD contrast test had the lowest power.

DISCUSSION

In this paper, we have represented the case-control HWD and LD contrasts, given the allele frequency contrast(s), as a quadratic term and an interaction term, respectively. Accordingly, we have written single-marker and two-marker association models with the predictors expressed as simple polynomials. The joint tests of the allele frequency, HWD and LD contrasts may be performed by testing the regression coefficients in a prospective marker association model. By considering these models in the prospective context, the allele frequency, HWD and LD contrasts can be utilized and understood better as follows. First, we gain some intuition as to how these contrasts are related to the disease model and LD structure. For example, the observation that the HWD contrast test has power only in the case of a non-additive disease is obvious in a prospective model because the HWD contrast corresponds to a quadratic term. Second, we can test the three contrasts more systematically in a multi-stage analysis by adopting a sequential test or any other testing procedures developed in regression modeling. Third, these contrasts can easily be tested together with other covariates. Therefore, under this framework population stratification can be modeled by any method that uses covariates for ancestry, as was developed in the context of the allele frequency contrast test [Price, et al. 2006; Pritchard, et al. 2000]. Last, the tests can be easily extended to quantitative traits.

By using the LD structures actually observed in HapMap data, we concluded that Test 2–3, which has an extra interaction term as well as linear terms, provides reliable power in any disease model. This suggests that, in tests involving more than two markers, it may be reasonable to include interaction terms as well as linear terms. Of course, it is difficult to speculate as to which of the many possible interaction terms should be included. In fact, multi-marker models with interaction terms have been presented by several authors [Conti and Gauderman 2004; Cordell and Clayton 2002; Devlin, et al. 2003]. Conti and Gauderman [2004] compared various two-marker tests, some of which are equivalent to the tests in TABLE II. They introduced a modified interaction term and gained comparable power to the haplotype-based test. In this paper, we compared a different set of two-marker tests under a more comprehensive set of disease models. We also show how the interaction term may be interpreted as an LD contrast. Therefore, our results provide further support for the inclusion of interaction terms in multi-marker tests.

Interaction variables have been also included in association models for the purpose of evaluating a biological interaction, or epistasis [Cordell 2002; Wu, et al. 2008; Zhao, et al. 2006]. They are built for unlinked markers in two regions each of which has been shown to be associated with a disease. As opposed to this, our model concerns relatively close markers that may be in pairwise-LD or multi-dimensional LD. Of course, close markers may also have biological interaction, and distinguishing between these two possibilities to identify the true disease pattern is an area for future research. However, localization of disease SNPs is the first step in a genome-wide study, and here we have been concerned with the inclusion of interaction terms to increase power at this screening step.

We showed theoretically that the model with a linear term will fit well for a SNP in weak LD with a causal SNP, whatever the disease model, except a special case. However, for non-additive diseases, as the LD between a disease and marker SNP increases, a test that includes an extra quadratic term has more power than a test that does not. Therefore, as the markers become denser, a test that includes the quadratic term may be preferable. Although the power of the HWD and LD contrast tests were by themselves very low, when combined with the allele frequency contrast(s), the joint tests gained power. Therefore we conclude that it is not advisable for HWD or LD contrast tests to be used alone when conducting a whole genome-wide association study, but rather they should be used in conjunction with the allele frequency contrast test.

Our results showed that in most cases our genotype-based tests have greater power than a haplotype based test. The practical benefit of genotype-based tests is that they do not require phase inference. Estimating haplotypes for a genome-wide association study not only introduces another source of variation in the test, but it also entails significantly more analysis time. However, it should be noted that we only considered two-locus haplotypes and our power comparisons using HapMap data are valid under the assumption that a disease SNP would be part of the same LD pattern as neighboring marker SNPs. Specifying a more realistic distribution for a disease allele and its LD structure with marker alleles – perhaps, for example, using coalescent theory [Zöllner and Pritchard 2005] - could provide a fairer comparison of the tests in real data analysis.

ACKNOWLEDGEMENTS

We thank Sung-Gon Yi for helpful advice. This work was supported in part by a U.S. Public Health Service Resource grant (RR03655) from the National Center for Research Resources, Research grant (GM28356) from the National Institute of General Medical Sciences, Cancer Center Support Grant P30CAD43703 from the National Cancer Institute, and Training grants (HL07567) from the National Heart, Lung and Blood Institute.

REFERENCES

  1. Chapman JM, Cooper JD, Todd JA, Clayton DG. Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered. 2003;56(1–3):18–31. doi: 10.1159/000073729. [DOI] [PubMed] [Google Scholar]
  2. Clayton D, Chapman J, Cooper J. Use of unphased multilocus genotype data in indirect association studies. Genet Epidemiol. 2004;27(4):415–28. doi: 10.1002/gepi.20032. [DOI] [PubMed] [Google Scholar]
  3. Conti DV, Gauderman WJ. SNPs, haplotypes, and model selection in a candidate gene region: the SIMPle analysis for multilocus data. Genet Epidemiol. 2004;27(4):429–41. doi: 10.1002/gepi.20039. [DOI] [PubMed] [Google Scholar]
  4. Cordell HJ. Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet. 2002;11(20):2463–8. doi: 10.1093/hmg/11.20.2463. [DOI] [PubMed] [Google Scholar]
  5. Cordell HJ, Clayton DG. A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. Am J Hum Genet. 2002;70(1):124–41. doi: 10.1086/338007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Davidson R, MacKinnon JG. Implicit Alternatives and the Local Power of Test Statistics. Econometrica. 1987;55(6):1305–29. [Google Scholar]
  7. Devlin B, Roeder K, Wasserman L. Analysis of multilocus models of association. Genet Epidemiol. 2003;25(1):36–47. doi: 10.1002/gepi.10237. [DOI] [PubMed] [Google Scholar]
  8. Hotelling H. The Generalization of Student's Ratio. The Annals of Mathematical Statistics. 1931:360–378. [Google Scholar]
  9. Longmate JA. Complexity and power in case-control association studies. Am J Hum Genet. 2001;68(5):1229–37. doi: 10.1086/320106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Nielsen DM, Ehm MG, Zaykin DV, Weir BS. Effect of two- and three-locus linkage disequilibrium on the power to detect marker/phenotype associations. Genetics. 2004;168(2):1029–40. doi: 10.1534/genetics.103.022335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Prentice RL, Pyke R. Logistic disease incidence models and case-control studies. Biometrika. 1979;66:403–411. [Google Scholar]
  12. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38(8):904–9. doi: 10.1038/ng1847. [DOI] [PubMed] [Google Scholar]
  13. Pritchard JK, Stephens M, Rosenberg NA, Donnelly P. Association mapping in structured populations. Am J Hum Genet. 2000;67(1):170–81. doi: 10.1086/302959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Sasieni PD. From genotypes to genes: doubling the sample size. Biometrics. 1997;53(4):1253–61. [PubMed] [Google Scholar]
  15. Schaid DJ. Evaluating associations of haplotypes with traits. Genet Epidemiol. 2004;27(4):348–64. doi: 10.1002/gepi.20037. [DOI] [PubMed] [Google Scholar]
  16. Schaid DJ, Rowland CM, Tines DE, Jacobson RM, Poland GA. Score tests for association between traits and haplotypes when linkage phase is ambiguous. Am J Hum Genet. 2002;70(2):425–34. doi: 10.1086/338688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Song K, Elston RC. A powerful method of combining measures of association and Hardy-Weinberg disequilibrium for fine-mapping in case-control studies. Stat Med. 2006;25(1):105–26. doi: 10.1002/sim.2350. [DOI] [PubMed] [Google Scholar]
  18. The International HapMap Project A haplotype map of the human genome. Nature. 2005;437(7063):1299–1320. doi: 10.1038/nature04226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. The Wellcome Trust Case Control Consortium Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447(7145):661–78. doi: 10.1038/nature05911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Wang T, Zhu X, Elston RC. Improving power in contrasting linkage-disequilibrium patterns between cases and controls. Am J Hum Genet. 2007;80(5):911–20. doi: 10.1086/516794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Weir BS. Genetic data analysis II: methods for discrete population genetic data. Sinauer Associates, Inc.; Sunderland, Mass.: 1996. [Google Scholar]
  22. Won S, Elston RC. The power of independent types of genetic information to detect association in a case-control study design. Genet Epidemiol. 2008;32(8):731–56. doi: 10.1002/gepi.20341. [DOI] [PubMed] [Google Scholar]
  23. Wu X, Jin L, Xiong M. Composite measure of linkage disequilibrium for testing interaction between unlinked loci. Eur J Hum Genet. 2008;16(5):644–51. doi: 10.1038/sj.ejhg.5202004. [DOI] [PubMed] [Google Scholar]
  24. Xiong M, Zhao J, Boerwinkle E. Generalized T2 test for genome association studies. Am J Hum Genet. 2002;70(5):1257–68. doi: 10.1086/340392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Zaykin DV. Bounds and normalization of the composite linkage disequilibrium coefficient. Genet Epidemiol. 2004;27(3):252–7. doi: 10.1002/gepi.20015. [DOI] [PubMed] [Google Scholar]
  26. Zaykin DV, Meng Z, Ehm MG. Contrasting linkage-disequilibrium patterns between cases and controls as a novel association-mapping method. Am J Hum Genet. 2006;78(5):737–46. doi: 10.1086/503710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Zhao J, Jin L, Xiong M. Test for interaction between two unlinked loci. Am J Hum Genet. 2006;79(5):831–45. doi: 10.1086/508571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Zheng G, Meyer M, Li W, Yang Y. Comparison of two-phase analyses for case-control genetic association studies. Stat Med. 2008;27(24):5054–75. doi: 10.1002/sim.3336. [DOI] [PubMed] [Google Scholar]
  29. Zheng G, Song K, Elston RC. Adaptive two-stage analysis of genetic association in case-control designs. Hum Hered. 2007;63(3–4):175–86. doi: 10.1159/000099830. [DOI] [PubMed] [Google Scholar]
  30. Zöllner S, Pritchard JK. Coalescent-based association mapping and fine mapping of complex trait loci. Genetics. 2005;169(2):1071–92. doi: 10.1534/genetics.104.031799. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES