Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jul 11.
Published in final edited form as: Genet Epidemiol. 2012 May 30;36(5):480–487. doi: 10.1002/gepi.21642

Power of Single- vs. Multi-Marker Tests of Association

Xuefeng Wang 1, Nathan J Morris 1, Daniel J Schaid 2, Robert C Elston 1,*
PMCID: PMC3708310  NIHMSID: NIHMS487643  PMID: 22648939

Abstract

Current genome-wide association studies still heavily rely on a single-marker strategy, in which each single nucleotide polymorphism (SNP) is tested individually for association with a phenotype. Although methods and software packages that consider multimarker models have become available, they have been slow to become widely adopted and their efficacy in real data analysis is often questioned. Based on conducting extensive simulations, here we endeavor to provide more insights into the performance of simple multimarker association tests as compared to single-marker tests. The results reveal the power advantage as well as disadvantage of the two- vs. the single-marker test. Power differentials depend on the correlation structure among tag SNPs, as well as that between tag SNPs and causal variants. A two-marker test has relatively better performance than single-marker tests when the correlation of the two adjacent markers is high. However, using HapMap data, two-marker tests tended to have a greater chance of being less powerful than single-marker tests, due to constraints on the number of actual possible haplotypes in the HapMap data. Yet, the average power difference was small whenever the one-marker test is more powerful, while there were many situations where the two-marker test can be much more powerful. These findings can be useful to guide analyses of future studies.

Keywords: Asymptotic power, single-marker test, two-marker test, genome-wide association

INTRODUCTION

In a typical genome-wide association studies (GWAS) analysis pipeline, a single-marker test is used first, such as the allelic frequency contrast test, the Cochran-Armitage trend test, and the Hardy-Weinberg Disequilibrium (HWD) contrast test [Li et al., 2009; Sasieni, 1997; Song and Elston, 2006]. The SNPs are then ranked based on their P-values, and a threshold is set (e.g., the genome-wide significance level 5 × 10–8) such that SNPs with a P-value below that threshold receive highest priority for replication or other downstream analyses. Single-marker tests examine association between a trait and one SNP at a time, while the linkage disequilibrium (LD) structure of the markers is ignored. On the other hand, multimarker tests that examine association between a trait and multiple SNPs simultaneously have the potential to yield more robust, powerful, and informative results [Kim et al., 2010; Slavin et al., 2011; Wang et al., 2007; Wu and Lin, 2008].

A number of multimarker-based approaches have been suggested. One major strategy is based on haplotype information, which preserves the LD structure across a set of markers [Akey et al., 2001; Schaid, 2004; Schaid et al., 2002]. Evidence of association can also be evaluated by contrasting the extent of LD across multiple markers between case and control groups [Nielsen et al., 2004; Wang et al., 2007]. For unphased genotype data, which is what we focus on here, we can consider a natural multivariate generalization—contrasting a profile/vector of allele frequencies for several markers between cases and controls—by using Hotelling's T-squared test or related procedures [Clayton et al., 2004; Fan and Knapp, 2003; Xiong et al., 2002], or we can consider the composite LD contrast test [Zaykin et al., 2006]. Nevertheless, these multimarker methods have been slow to be adopted widely. In a literature survey of all GWAS published in high impact journals, the initial step in most studies was restricted to single-marker analysis [Becker and Herold, 2009]. In addition to implementation difficulties, there is a major concern about the performance of these multimarker methods in real data analysis. Indeed, it is known that the power of an association test depends on a complex balance among many factors, such as degrees of freedom (df), multiplicity of tests, and the correlation pattern among the risk and tagging SNPs [Cordell and Clayton, 2005; Li et al., 2011; Morris and Elston, 2011; Pan et al., 2010]. Owing to their increased degrees of freedom (df), multi-marker tests may have similar or reduced power relative to single-marker tests. Difficulties also arise in determining an optimal multimarker test. Some researchers have found that analyses using multivariate marker data that do not require resolution of gametic phase were often more powerful than analyses based on haplotypes, while others reached different conclusions [Becker and Herold, 2009; Clayton et al., 2004]. In fact, a uniformly most powerful test does not exist for the analysis of either unphased or haplotype data, albeit some methods are frequent winners under certain conditions or simulations [Pan et al., 2010].

Recently, Kim et al. [2010] derived single- and two-marker statistics to compare cases and controls according to allele frequencies, HWD, and composite LD scores. The three distinct sources of information about case-control differences may be tested jointly in a framework where the retrospective contrast test and the prospective logistic regression model are unified. Simulation studies carried out by Kim et al. found that power can differ for single- versus vs. two-marker tests, and that this difference in power can depend on the LD pattern among the measured markers. We might expect 2-df tests to be less powerful than 1-df tests, but that paper suggested otherwise for markers in high LD. In this study, we explore the power differences of a single- vs. a multimarker test numerically by computing the asymptotic power for both approaches. Our strategy was to focus directly on the correlation patterns of the measured SNPs and the unmeasured causal SNP, rather than attempt to simulate different LD patterns based on population genetic assumptions. We chose this strategy because the power depends directly on the correlation patterns—different simulation parameters could give the same correlation pattern, and hence the same power. Although our multimarker calculations focus only on two SNPs, our methods are general enough to be applied to a larger number of SNPs. Through extensive simulations, we examine whether it is possible, by prescreening the correlation patterns of the markers, to predict which will be the best test to perform.

METHODS

To explain the asymptotic power calculations, we first give our notation. Let pc be the population minor allele frequency of the causal SNP, and assume that the causal allele acts multiplicatively on the genotype relative risks (RRs), such that R2 = γ2 and R1 = γ, where Ri is the risk of disease for subjects carrying i copies of the causal allele relative to homozygous noncarriers. We also assume Hardy-Weinberg equilibrium in the population. Under these assumptions, the frequency of the causal allele among cases is pcD=γpc(γpc+1pc) [Clayton, 1999]. If the disease is rare, the frequency of the causal allele among controls, pcC, is approximately the same as the population allele frequency, pcCpc. If the causal SNP were measured, the test statistic to compare allele frequencies between cases and controls (assuming equal numbers of cases and controls) would be zc=N(pcDpcC)2pc(1pc), where pc=(pcD+pcC)2. This z-statistic has an asymptotic normal distribution, N(Nλc,1), where λc=(pcDpcC)2pc(1pc).

When m marker SNPs are measured, but not the causal SNP, the corresponding vector statistic Z = (Z1, Z2, ..., Zm) asymptotically follows a multivariate normal distribution: ZMVN(Nλ,Σ), where Nλ=Nλc(ρ1c,ρ2c,,ρmc); the statistic for the ith marker (coded as the number of susceptibility alleles in the genotype) is zi=N(piDpiC)2pi(1pi), its asymptotic mean is λi = ρicλc, Σ is the correlation matrix of the measured markers, and ρic is the correlation of the ith marker with the causal SNP. These asymptotic results are further discussed elsewhere [Zaitlen et al., 2010].

Based on the asymptotic multivariate normal distribution of the vector Z, the power of using either the maximum of the single-marker tests or the multivariate T-test can be evaluated as follows. For the maximum of the single-marker tests, power is the probability that at least one of the marker z-statistics exceeds the critical value threshold, which can be expressed as 1 – P(–c < z1 < c, ..., –c < zm < c). The critical value c gives the desired type I error and P(–c < z1 < c, ..., –c < zm < c) is evaluated by numerical integration of the multivariate normal distribution. For the multivariate T-test, asymptotic power can be calculated from the noncentral chi-square distribution with m df. The noncentrality parameter is ncp = Nλ′Σ–1λ, where λ = (λcρ1c, λcρ2c, ..., λcρmc).

We conducted simulation studies to compare two- and single-marker analyses. In the first simulation, we set the type I error rate to 0.05, the causal allele frequency to 0.2, the causal allele RR to 1.2, and we simulated 1,000 cases and 1,000 controls. We then let each of the values of ρ12, ρ1c, and ρ2c range over –0.995 to 0.995, with a fixed increment of 0.05 (40 × 40 × 40 combinations) and 0.01 (200 × 200 × 200 combinations), respectively. The generated correlations that did not satisfy the constraint that the correlation matrix be positive definite were removed, i.e., the correlation combinations had to satisfy the constraint: 1ρ1c2ρ2c2ρ122+2ρ1cρ2cρ12>0. We further evaluated more scenarios, setting the type I error at 0.001 and a genome-wide significance level 5 × 10–8, increasing the causal allele RR to 1.25 and 1.3, and increasing the sample sizes to 4,000 and 8,000 (equal numbers of cases and controls), respectively. Since the power (and noncentrality parameter) depends only on the standardized effect size zc=N(pcDpcC)2pc(1pc) and type I error rate α, in the second simulation, we performed tests varying only zc (2–8) and α (0.05, 0.005, . . . ,5 × 10–8).

Lastly, instead of assuming a uniform correlation distribution, we generated the correlation space by using the SNP data on chromosome 11 of the HapMap CEU (Phase 3) population data. We split the chromosome into mutually exclusive consecutive regions containing three SNPs each. For each region, we chose the disease SNP to be the one in the middle—to simulate the ideal case that a causal SNP is tagged by two flanking marker SNPs. We excluded regions where the minor allele frequency of either of the two flanking markers was less than 0.1, leaving 14,022 regions. For each region, we computed the power of single- and two-marker tests based on the correlation coefficients among three SNPs. For those regions with ρ12 = 1 or –1 (where the noncentrality parameter for the two-marker test becomes invalid), we set the power of the two tests to be equal to the value based on the univariate normal distribution, i.e., power = 1 – P(–c < z < c), where c controls the desired type I error.

RESULTS

As shown in the scatter plot of the power (Supplementary Fig. S1) and Figure 1D, the two-marker test had greater power for most of the correlation space in the first simulation. Because we are most interested in determining whether the known correlations between marker SNPs (ρ12) can guide us in choosing an optimal test, we depicted the contrast in power of the two tests as a function of ρ12 (Fig. 1). As shown in Figure 1A and B, the mean values of the power, i.e., averaged over sets of correlations corresponding to a particular value of ρ12, for single-marker tests increases when |ρ12| decreases, while the mean power of the two-marker tests is almost invariant with respect to change in ρ12 and is always greater than that of the single-marker tests. Figure 1C shows a trend that differs from the mean results: the proportion of tests where the two-marker test is more powerful reaches its highest point when ρ12 is close to zero but is lowest when |ρ12| is around 0.23. As shown in Supplementary Figure S2, this turning point depends on the sample size, causal allele RR, and significance level. From the results of the second simulation (Supplementary Fig. S3), we see that the distance between the two symmetrically placed turning points increases with increased standardized effect size and decreased significance level. Additional information from the correlations between the marker SNPs and the causal SNP must be considered to explain these findings. Figure 2 depicts the correlation space of ρ1c and ρ2c when ρ12 is fixed. It shows that two-marker tests always have greater power than single-marker tests when ρ1c and ρ2c are extreme and in opposite directions if ρ12 > 0, and when ρ1c and ρ2c are extreme and in the same directions if ρ12 < 0. This can be understood graphically by considering the difference in the shape of the acceptance regions of the two tests (an elliptical region for the two-marker test and a square region for the single-marker test). However, the power of the two-marker test will be less sensitive to the directions of the correlations when ρ12 is small (the elliptical acceptance region will approach a circle). When ρ12 is between –0.23 and +0.23, the two-marker test can also be more powerful even when ρ1c and ρ2c are extreme and in the same direction when ρ12 > 0, or extreme and in opposite directions when ρ12 < 0. Figure 2 shows that the region where the two-marker test is more powerful tends to dominate the correlation space when ρ12 is close to zero, which explains the trend we observed in Figure 1C.

Fig. 1.

Fig. 1

Summary of power from the first simulation (with type I error 0.05, causal RR 1.2, and sample size 2,000). The x-axis is the correlation between the two markers (ρ12). (A) The mean of the test power for each set of correlations corresponding to a small bin of ρ12 values using single-marker tests. (B) The mean power using two-marker tests. (C) The proportion of tests where two-marker tests have greater power than single-marker tests for each ρ12 bin. (D) Box plots showing the distribution of the difference in power between the two-marker and single-marker tests for each ρ12.

Fig. 2.

Fig. 2

Power comparison of two-marker vs. single-marker tests and their relationship with the correlation space in the first simulation. Each plot has a fixed ρ12 (correlation between markers). The gray regions are the possible space of ρ1c and ρ2c (correlations between markers and causal SNPs) for a specific value of ρ12. The dark/light area indicates those correlation combinations where the two-marker test is more/less powerful than the single-marker test.

Figure 3 shows the results obtained using the HapMap correlation pattern, in which the effect size and type I error rate were set identical to the first simulation (α = 0.05, causal allele RR = 1.2, and sample size = 1,000 + 1,000). Similar to what we found for uniformly distributed correlations, there are many cases where the two-marker test is much more powerful than the single-marker test. However, a finding that seems to contradict the previous results is that there are actually more cases in total where using a single-marker test is more powerful than using a two-marker one, although the power difference is then small (Fig. 3C). More surprisingly, the proportion of cases where the two-marker test is more powerful—when ρ12 is fixed in a small range—always decreases as |ρ12| increases, and comes down to zero when is close to 1 or –1 (Fig. 3B). To understand how the special correlation structure of the HapMap data can cause this phenomenon, we again plotted the empirical two-dimensional (2D) correlation space of ρ1c and ρ2c while fixing ρ12 (Fig. 4A). This shows that most of the correlation points group around an empty bow-tie shaped region (or lines) and, more interestingly, this region is in the correlation space where the two-marker test is the more powerful (Fig. 4C). There is also a proportion of points scattered in another, smaller bow-tie shaped region. This pattern remained consistent when we tried using different ethnic populations, releases, SNP panels, or chromosomes in the HapMap data. We surmised this pattern to be caused by the limited number of actual possible haplotypes. This was confirmed by enumerating the possible haplotype structures, which reproduced a similar correlation pattern (Supplementary Fig. S4).

Fig 3.

Fig 3

Results of a power comparison using the correlations calculated from HapMap data. (A) Scatter plot of power of two-marker tests vs. single-marker tests (each point represents an individual simulated case). (B) The proportion of tests where the two-marker test has greater power than the single-marker test for each fixed range of ρ12. (C) Histogram of power difference between the two-marker test and the single-marker test over all simulated cases. (D) Box plots showing the distribution of the difference in power between the two tests for each small fixed range of ρ12.

Fig. 4.

Fig. 4

The empirical (HapMap data) and theoretical correlation space with fixed ρ12. (A) Hexagon binning plot showing the distribution of the correlations ρ1c and ρ2c12 is fixed around 0.5) based on the SNPs chosen from HapMap data. (B) The theoretical two-dimensional correlation space when we fix ρ12 at 0.5. The dark/light gray area indicates the two-marker test is more/less powerful than the single-marker test.

DISCUSSION

It is widely acknowledged that the success and cost-effectiveness of a GWAS depends principally on the appropriate choice of tagging markers and sample size. Comparatively little attention appears to have been given to the determination of a sensible analysis strategy. To reduce the computational burden, a multistage testing is often applied in a typical analysis pipeline. Single-marker methods have become the “gold standard” in GWAS because they are easy and fast to implement, straightforward to interpret, and involve a simple (but often too stringent) multiple testing adjustment. On the other hand, multimarker association testing methods, though widely available, have had very limited use in the initial stage of the analysis. One major concern that statisticians may have had is that the increased number of df involved might negate the benefits achieved from the joint modeling of multiple markers. In this study, we have endeavored to provide more insights into the performance of the two types of methods in practical analysis—particularly for the first stage. Specifically, we have conducted extensive simulations to investigate the power of a two-marker association test vs. a single-marker test. The simulation scenarios were created by varying effect size related parameters (sample size, causal allele RR), type I error rate and, most importantly, the correlation pattern among the markers. Earlier studies based on simulated data have shown that the power of an indirect association test—and also results from comparing between different testing methods—relies on the correlation structure among tag SNPs, as well as that between tag SNPs and causal variants [Kim et al. 2010; Roeder et al., 2005]. In real data analysis, however, we have no way of obtaining information about the correlations between tag SNPs and causal SNPs, as we do in simulation studies. This is why it becomes important to learn—and we have aimed to find out—whether one can in any way determine or narrow down the optimal tests to perform by simply prescreening the correlation pattern of the markers.

Four major conclusions can be drawn from the results obtained, assuming a uniform distribution over the correlation space. First, for the major part of the correlation space, the two-marker test has greater power and the power difference is small in situations where the single-marker test is the more powerful. Second, the mean power of the single-marker test, for a fixed value of the correlation between adjacent marker SNPs (ρ12), is always smaller than that of the two-marker test—a result that is invariant to change of ρ12. Third, two interesting turning points show up when we plot the proportion of tests where the two-marker test is more powerful as a function of values of ρ12. These points move toward –1 and +1 as the effect size increases or the significance level decreases. We have given a graphic explanation of this phenomenon by examining the profiles of the power change in a 2D correlation space. Last, the distribution of the power difference at each value of ρ12 suggests that the two-marker test has relatively better performance when the correlation of the two adjacent markers is high.

However, some different, even opposite, conclusions were revealed by simulations based on the correlation patterns abstracted from HapMap CEU population data. A major difference is that, using HapMap data, overall two-marker tests have higher probability of being less powerful than single-marker tests. We have found that this difference is caused by a special bow-tie shaped region for the correlation between two markers, which is invariant to different ethnic groups and SNP panels. We have further shown that such a pattern is due to constraints on the number of actual possible haplotypes. This result seems to suggest that in general the one-marker test has the better performance for real HapMap data. But we have further discovered that, as demonstrated in Figure 3A and C, the average power difference is small whenever the one-marker test is more powerful, while there are many situations where the two-marker test can be much more powerful. For example, Figure 3D shows that, when the correlation between marker SNPs ρ12 is very high, there are situations where two-marker tests are 50% more powerful than one-marker tests. In summary, our results indicate that overall—though not on average—the two-marker test provides a valuable supplement to the one-marker test, and in fact the two-marker test should be preferred when ρ12 is known to be small (Fig. 3B). A meaningful implication of our findings is that they provide a plausible explanation for why some methods are frequent winners in some studies, and why researchers have often reached different conclusions with different simulation designs.

As pointed out by Kim et al. [2010], an advantage of the two-marker test framework in comparison with single-marker tests is that it enables the potential use of the joint Allelic-LD contrast test, which is equivalent to including a first-order interaction term as well as linear terms in the (prospective) regression model. Their results suggested that this method provides robust power under various disease models. Nevertheless, caution must be taken in using this model for the purpose of detecting a biological interaction and one may refer to Wang et al. [2010a] for a more detailed discussion of this point. What has so far been given less recognition is that the two-marker test can capture additional haplotype information even when an interaction term is not included in the model. A recent real data example of the two-marker association test discussed here is given in Slavin et al. [2011], who detected multiple new putative associated variants for coronary artery disease and hypertension by using by using it. Surprisingly, many of the significant SNP pairs they found are in high LD, which means that their interaction terms can hardly be significant and that these SNPs should, intuitively, have already been detected in a single-marker analysis. After excluding factors such as multicollinearity, they concluded that the extra power is provided by the inclusion of haplotype information when the markers are both heterozygous and in high enough LD.

In this work, we have focused on the case of an additive disease model. For dominant/recessive models, it is known that assuming additive when recessive is true results in large loss in power, but there is not much loss if the true model is dominant. This has been demonstrated by results in Kim et al. [2010]. Therefore, results of power comparisons may change under different disease models. However, by using the true penetrance model, Kim et al. [2010] showed that any difference becomes very minor as the LD between the disease and marker SNP decreases. In the case of marker SNPs that are extremely close and hence in perfect LD with the causal SNP, we would not want to consider haplotypes at all as a single-marker test would then always give the best power.

Collectively, this study provides evidence that the two-marker association test can be superior to the single-marker test and, in particular, suitable for the initial stage of a GWAS analysis to prioritize markers. It is easy to implement and yet offers reliable power to detect risk variants across various circumstances. Moreover, it is less computationally demanding because our aim is not to search for statistical interaction, and thus there is no need to conduct an exhaustive search of all pairwise combinations of markers across the genome [Evans et al., 2006; Marchini et al., 2005; Wang et al., 2010b]. For the first-stage scan, we propose to search over only all the consecutive marker pairs, for which the total number of tests will be the number of markers minus the number of chromosomes. Consequently, the adjustment for multiple testing can be based on a level similar to that used in single-marker analysis. A possible extension, to be investigated in the future, is to test three consecutive markers at a time, in which case we need to examine issues such as its reliability, as well as whether the power increase, if any, is worth the additional complexity.

ACKNOWLEDGMENTS

This work was supported in part by the following U.S. Public Health Service grants: Resource grant P41 RR03655 from the National Center for Research Resources; Cancer Center Support grant P30 CAD43703 from the National Cancer Institute; Research grants HL074166 and HL086718 from the National Heart, Lung, Blood Institute; and Research grant U01HG006382 from the National Human Genome Research Institute. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. In addition, the National Research Foundation of Korea Grant NRF-2011-220-C00004, funded by the Korean Government, supported RCE and a grant from the Merck Foundation supported X.W.

Footnotes

Supporting Information is available in the online issue at wileyonlinelibrary.com.

REFERENCES

  1. Akey J, Jin L, Xiong M. Haplotypes vs single marker linkage disequilibrium tests: What do we gain? Eur J Hum Genet. 2001;9:291–300. doi: 10.1038/sj.ejhg.5200619. [DOI] [PubMed] [Google Scholar]
  2. Becker T, Herold C. Joint analysis of tightly linked SNPs in screening step of genome-wide association studies leads to increased power. Eur J Hum Genet. 2009;17:1043–1049. doi: 10.1038/ejhg.2009.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Clayton D. A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission. Am J Hum Genet. 1999;65:1170–1177. doi: 10.1086/302577. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Clayton D, Chapman J, Cooper J. Use of unphased multilocus genotype data in indirect association studies. Genet Epidemiol. 2004;27:415–428. doi: 10.1002/gepi.20032. [DOI] [PubMed] [Google Scholar]
  5. Cordell HJ, Clayton DG. Genetic association studies. Lancet. 2005;366:1121–1131. doi: 10.1016/S0140-6736(05)67424-7. [DOI] [PubMed] [Google Scholar]
  6. Evans DM, Marchini J, Morris AP, Cardon LR. Two-stage two-locus models in genome-wide association. PLoS Genet. 2006;2:e157. doi: 10.1371/journal.pgen.0020157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Fan R, Knapp M. Genome association studies of complex diseases by case-control designs. Am J Hum Genet. 2003;72:850–868. doi: 10.1086/373966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kim S, Morris NJ, Won S, Elston RC. Single marker and two marker association tests for unphased case control genotype data, with a power comparison. Genet Epidemiol. 2010;34:67–77. doi: 10.1002/gepi.20436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Li J, Das K, Fu G, Li R, Wu R. The Bayesian lasso for genome-wide association studies. Bioinformatics. 2011;27:516–523. doi: 10.1093/bioinformatics/btq688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Li Q, Zheng G, Liang X, Yu K. Robust tests for single marker analysis in case control genetic association studies. Ann Hum Genet. 2009;73:245–252. doi: 10.1111/j.1469-1809.2009.00506.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Marchini J, Donnelly P, Cardon LR. Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat Genet. 2005;37:413–417. doi: 10.1038/ng1537. [DOI] [PubMed] [Google Scholar]
  12. Morris NJ, Elston RC. A note on comparing the power of test statistics at low significance levels. Am Stat. 2011;65:164–166. doi: 10.1198/tast.2011.10117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Nielsen DM, Ehm MG, Zaykin DV, Weir BS. Effect of twoand three-locus linkage disequilibrium on the power to detect marker/phenotype associations. Genetics. 2004;168:1029–1040. doi: 10.1534/genetics.103.022335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Pan W, Han F, Shen X. Test selection with application to detecting disease association with multiple SNPs. Hum Hered. 2010;69:120–130. doi: 10.1159/000264449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Roeder K, Bacanu SA, Sonpar V, Zhang X, Devlin B. Analysis of single locus tests to detect gene/disease associations. Genet Epidemiol. 2005;28:207–219. doi: 10.1002/gepi.20050. [DOI] [PubMed] [Google Scholar]
  16. Sasieni PD. From genotypes to genes: Doubling the sample size. Biometrics. 1997;53:1253–1261. [PubMed] [Google Scholar]
  17. Schaid DJ. Evaluating associations of haplotypes with traits. Genet Epidemiol. 2004;27:348–364. doi: 10.1002/gepi.20037. [DOI] [PubMed] [Google Scholar]
  18. Schaid DJ, Rowland CM, Tines DE, Jacobson RM, Poland GA. Score tests for association between traits and haplotypes when linkage phase is ambiguous. Am J Hum Genet. 2002;70:425–434. doi: 10.1086/338688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Slavin TP, Feng T, Schnell A, Zhu X, Elston RC. Two-marker association tests yield new disease associations for coronary artery disease and hypertension. Hum Genet. 2011;130:725–733. doi: 10.1007/s00439-011-1009-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Song K, Elston RC. A powerful method of combining measures of association and Hardy–Weinberg disequilibrium for fine mapping in case control studies. Stat Med. 2006;25:105–126. doi: 10.1002/sim.2350. [DOI] [PubMed] [Google Scholar]
  21. Wang T, Zhu X, Elston RC. Improving power in contrasting linkage-disequilibrium patterns between cases and controls. Am J Hum Genet. 2007;80:911–920. doi: 10.1086/516794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Wang X, Elston RC, Zhu X. Statistical interaction in human genetics: How should we model it if we are looking for biological interaction? Nat Rev Genet. 2010a;12:74–74. doi: 10.1038/nrg2579-c2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Wang Z, Liu T, Lin Z, Hegarty J, Koltun WA, Wu R. A general model for multilocus epistatic interactions in case-control studies. PloS one. 2010b;5:e11384. doi: 10.1371/journal.pone.0011384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Wu R, Lin M. Statistical and computational pharmacogenomics. Chapman & Hall; London: 2008. [Google Scholar]
  25. Xiong M, Zhao J, Boerwinkle E. Generalized T2 test for genome association studies. Am J Hum Genet. 2002;70:1257–1268. doi: 10.1086/340392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Zaitlen N, Pasaniuc B, Gur T, Ziv E, Halperin E. Leveraging genetic variability across populations for the identification of causal variants. Am J Hum Genet. 2010;86:23–33. doi: 10.1016/j.ajhg.2009.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Zaykin DV, Meng Z, Ehm MG. Contrasting linkage-disequilibrium patterns between cases and controls as a novel association-mapping method. Am J Hum Genet. 2006;78:737–746. doi: 10.1086/503710. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES