To the Editor:
Although the entropy-based method described by Zhao et al. (2005) provides a sensitive way to detect kinds of departure from a random distribution of haplotype counts between two data sets, we cannot see how it can be applied in practice to case-control samples. This is because tests that treat haplotype counts estimated from unphased data as if they were actually observed haplotypes are inherently anticonservative.
To illustrate in principle why this is, let us consider a sample in which all subjects happen to be doubly heterozygous at two loci, with genotype Aa/Bb. The maximum-likelihood estimates for the haplotype frequencies do not consist, as one might think intuitively, of each possible haplotype having frequency 0.25. Instead, there are two equally likely solutions: that haplotypes AB and ab each occur with frequency 0.5 or that, conversely, haplotypes Ab and aB both have frequency 0.5. (With N subjects, the solution for four haplotypes has likelihood 0.252N, whereas the solution for just two haplotypes has likelihood 0.52N.) If a few other genotypes are added to the data set, they will push the solution one way or the other. For example, if the sample consists of a mixture of cases and controls, and one case has genotype AA/Bb and one control has genotype aa/Bb, then the estimated haplotype frequencies will suggest that almost all cases have haplotypes AB and ab, whereas almost all controls have haplotypes Ab and aB. Although such an extreme example would not occur in practice, it is important to understand that maximum-likelihood estimation of haplotype frequencies favors solutions containing a small number of different haplotypes. This implies that, when frequencies of multilocus haplotypes are estimated separately in cases and controls, small random effects can produce quite large, apparently notable differences. In a real situation, one might estimate, for instance, that a particular haplotype occurred in a small percentage of cases but never in controls, leading to the possibly erroneous deduction that this indicates the presence of a pathogenic mutation.
To determine whether haplotype frequencies differ significantly between cases and controls, the correct approach is to perform a heterogeneity test, in which one calculates whether the overall likelihood is significantly higher if different frequencies are allowed than if the same frequencies apply to both groups. An incorrect approach is to estimate haplotype counts by multiplying the frequencies by twice the sample size and then to treat these counts as if they were actually observed. The counts may be compared using a Pearson χ2 test on a contingency table, by a permutation test as implemented in the CLUMP program (Sham and Curtis 1995) or by the newly described entropy method (Zhao et al. 2005). In every case, the test based on estimated counts will be anticonservative.
To illustrate that this is the case, we randomly generated case-control samples genotyped for two markers, assuming that the population frequencies of the haplotypes were the same for all subjects, under the assumption of random mating. For each data set, we applied a Pearson χ2 test and the entropy test to the counts of the simulated haplotypes. We then combined pairs of haplotypes into two-locus genotypes, and we used the GENECOUNTING program (Zhao et al. 2002) to obtain estimated haplotype frequencies in the cases, controls, and combined sample, along with the associated likelihoods. We applied a heterogeneity test to these likelihoods and again applied the Pearson χ2 and entropy tests, this time to the estimated counts. Illustrative results are given in table 1, for which the population frequencies of the four haplotypes were set at 0.5, 0.2, 0.2, and 0.1, and a sample size of 500 cases and 500 controls was used. The Pearson χ2 and entropy tests perform appropriately when applied to the actual haplotype counts, as does the heterogeneity test using likelihoods based on estimated frequencies. However, both of the tests that use estimated counts are markedly anticonservative.
Table 1.
Real Counts |
EstimatedCounts |
||||
P | χ2 | Entropy Test |
Heterogeneity Test | χ2 | Entropy Test |
.05 | 5,013 | 4,971 | 5,072 | 11,115 | 11,089 |
.01 | 970 | 968 | 1,034 | 3,678 | 3,678 |
.001 | 103 | 107 | 114 | 797 | 813 |
.0001 | 10 | 7 | 11 | 206 | 209 |
.00001 | 0 | 0 | 1 | 51 | 54 |
.000001 | 0 | 0 | 1 | 16 | 17 |
It is not appropriate to treat estimated haplotypes as if they were observed, and tests that do so will produce unacceptably high type I error rates. As we have said, this will apply even if a permutation test is performed on the estimated haplotypes—for example, by inputting them into the CLUMP program (Sham and Curtis 1995). However, a valid test can be devised if, instead, the original data are repeatedly permuted and then, for each permuted data set, haplotypes are estimated and a test statistic is derived. The rank of the test statistic obtained from the original data set can then be used to obtain an empirical significance level (North et al. 2003), and such an approach could be used for the entropy-based statistic. Without such a permutation procedure, we do not see how the entropy test can be applied to case-control data.
References
- North BV, Curtis D, Sham PC (2003) A note on the calculation of empirical P values from Monte Carlo procedures. Am J Hum Genet 72:498–499 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sham PC, Curtis D (1995) Monte Carlo tests for associations between disease and alleles at highly polymorphic loci. Ann Hum Genet 59:97–105 [DOI] [PubMed] [Google Scholar]
- Zhao J, Boerwinkle E, Xiong M (2005) An entropy-based statistic for genomewide association studies. Am J Hum Genet 77:27–40 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao JH, Sham PC (2002) Faster haplotype frequency estimation using unrelated subjects. Hum Hered 53:36–41 10.1159/000048602 [DOI] [PubMed] [Google Scholar]