Skip to main content
Gut logoLink to Gut
letter
. 2004 Apr;53(4):614–616. doi: 10.1136/gut.2003.31856

Biallelic genotype distributions in papers published in Gut between 1998 and 2003: altered conclusions after recalculating the Hardy-Weinberg equilibrium

B Györffy 1, I Kocsis 2, B Vásárhelyi 3
PMCID: PMC1773995  PMID: 15016761

The Hardy-Weinberg law1–4 presents a mathematical statement that describes the relationship between gene frequencies and genotype frequencies: gene frequencies at a locus in a randomly interbreeding diploid population and population genotype frequencies remain constant from generation to generation if mating is random and mutation, selection, and migration do not occur. The law states a fundamental principle of population genetics that is approximately true for small, and holds with increasing exactness for large populations. Should the frequencies be perturbed for any reason, they will come to the expected equilibrium frequencies after one generation of random mating.

The Hardy-Weinberg law can be used for analytical purposes. It is suitable to test the hypothesis of panmixia and evolutionary stasis. Moreover, it represents a null hypothesis to test in genetic studies. However, according to our personal experience, data for Hardy-Weinberg equilibrium (HWE) calculations in studied populations are not always presented in articles with data on the genotype distributions of biallelic polymorphisms of Mendelian inheritance. In this retrospective survey, we tested in papers published in Gut, if this important and qualifying law was checked in studies investigating genetic polymorphisms between 1998 and April 2003.

We collected genotype distributions published in papers in Gut from 1998 (volume 42) to 2003 (volume 52). Of 2389 total publications, we found 69 where genetic polymorphisms were part of the study. Of these, those articles that fulfilled the following criteria were selected: investigation of biallelic genetic polymorphism with Mendelian inheritance; use of healthy reference population in the study; and availability of genotype distribution data.

We recalculated HWE in each paper and in each study group. For this purpose, we used Arlequin software (http://anthropologie.unige.ch/arlequin/).5,6 The level of statistical significance was set at p<0.05. Deviations from HWE were further confirmed by manual recalculation.

Twenty publications in Gut fulfilled the enrolment criteria; these publications presented data on 166 genotypes. However, only four papers (20%) reported that HWE calculations were performed and genotype distribution fulfilled HW criteria. Genotype distributions in healthy reference populations did not fulfil HWE in two publications (10%).7,8 In two reports, HWE was fulfilled in controls but failed in the diseased population and this fact was not reported.9,10 In one publication, HWE was not fulfilled either in the healthy reference or in the diseased populations.11 In summary, we found 11 genotype distributions in five publications of the studied 166 genotypes and 20 papers where calculation of the HW law (in control or in the investigated populations) would provide additional information. We present the results of HW calculations of these 11 polymorphisms in table 1.

Table 1 .

Genotype distributions of gene polymorphisms missing the Hardy-Weinberg equilibrium (HWE) in studies published in Gut between 1998 (volume 42) and 2003 (volume 52)

Publication [reference] Gene polymorphism Subjects Genotype distribution WW/WM/MM HWE p value
1998;43:187–9.7 CTLA-4 exon 1 position 49 Controls 62/47/21 0.034
HLA DR3 negative controls 54/36/16 0.027
2001;48:836–42.9 HFE H63D polymorphism haplotypes Iron overload patients 178/85/60 0.001
2002;50:520–4.11 Methylene tetrahydrofolate reductase Controls 533/560/114 0.053
Control, 60 y< 204/229/34 0.005
Female colorectal cancer patients 134/101/35 0.031
Colorectal cancer stage: Dukes’ B 94/64/28 0.003
Colorectal cancer and p53 mutation− 148/116/39 0.036
2003;52:547–51.8 CYPA1A1 exon7 Controls 122/22/5 0.012
EPXH exon 3 Controls 59/58/32 0.018
2003;52:558–62.10 NOD2 gene 1007fs variant Crohn’s disease patients 248/20/3 0.017

WW, number of patients with homozygosity for wild allele; WM, number of patients with heterozygosity, MM, number of patients with homozygosity for mutant allele.

There are several explanations why the observed genotype frequencies may deviate significantly from those expected by HW law and why genotypes have different likelihoods of being included, even when determination of genotype was methodologically correct. One or more of the assumptions of the model might be incorrect, non-random mating (inbreeding or an allele effect on the mating) or gene flow may have occurred, or selection operated. There could also be an error at sampling: the studied population was not well defined or the sample size may be too small.12 (Interestingly, the occurrence of HWE error was independent of the number of patients enrolled; the size of study populations in the affected papers ranged between 24 and 1207.)

If the genotype distribution in the control population misses the HWE, the results should be treated cautiously because the observed genotype distribution does not represent the genotype distribution in healthy (non-diseased) people and, therefore, conclusions cannot be drawn for the significance of the investigated polymorphism.

If the genotype distribution in the investigated (diseased) population does not fulfil the HWE law (while the healthy reference population fulfils it), it might be supporting evidence for the correlation between genotype and disease. Unreported or weak associations can be detected by calculating the HWE, even when statistically significant differences between genotype distributions is not present.

In conclusion, we suggest that providing genotype distribution data together with detailed results of HWE calculations should be a must when results of population genetic studies are published.

References

  • 1.Hardy GH. Mendelian proportions in a mixed population. section: Discussion and correspondence. Science 1908;28:49–50 (Reprinted in Peters JH. Classic papers in genetics. Englewood Cliffs, NJ: Prentice-Hall Inc, 1959.). [DOI] [PubMed] [Google Scholar]
  • 2.Weinberg W. Über den Nachweis der Vererbung beim Menchen. Jahresh. Verein f. vaterl. Naturk Wüttemberg 1908;64:368–82. [Google Scholar]
  • 3.Roughgarden J. Theory of population genetics and evolutionary ecology. NJ: Prentice Hall Inc, 1996.
  • 4.Stern C. The Hardy-Weinberg law. Science 1943;97:137–8. [DOI] [PubMed] [Google Scholar]
  • 5.Schneider S, Roessli D, Excoffier L. Arlequin ver. 2.000. Software for population genetics data analysis. Switzerland: Genetics and Biometry Laboratory, University of Geneva, 2000.
  • 6.Guo SW, Thompson EA. Performing the exact test of Hardy-Weinberg proportion for multiple alleles. Biometrics 1992;48:361–72. [PubMed] [Google Scholar]
  • 7.Djilali-Saiah I, Schmitz J, Harfouch-Hammoud E, et al. CTLA-4 gene polymorphism is associated with predisposition to coeliac disease. Gut 1998;43:187–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.de Jong DJ, van der Logt EM, van Schaik A, et al. Genetic polymorphisms in biotransformation enzymes in Crohn’s disease: association with microsomal epoxide hydrolase. Gut 2003;52:547–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Aguilar-Martinez P, Bismuth M, Picot MC, et al. Variable phenotypic presentation of iron overload in H63D homozygotes: are genetic modifiers the cause? Gut 2001;48:836–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Helio T, Halme L, Lappalainen M, et al. CARD15/NOD2 gene variants are associated with familially occurring and complicated forms of Crohn’s disease. Gut 2003;52:558–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Shannon B, Gnanasampanthan S, Beilby J, et al. A polymorphism in the methylenetetrahydrofolate reductase gene predisposes to colorectal cancers with microsatellite instability. Gut 2002;50:520–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hedrick PW. Genetics of populations. New York: Van Nostrand Reinhold Co, 1983.
Gut. 2004 Apr;53(4):614–616.

Author’s reply

T Heliö 4, M Lappalainen 4, H Fodstad 4, P Paavola-Sakki 4, U Turunen 4, M Färkkilä 4, K Kontula 4, L Halme 5, T Krusius 6

Györffy et al addressed the relevant issue of deviation of biallelic genotype distributions from Hardy-Weinberg equilibrium and the interpretation of case control studies. They identified several papers that had been published in Gut in which the genotype distributions of cases, controls, or both groups deviated from Hardy-Weinberg equilibrium. They criticised the lack of reporting these observed deviations and they even suggested that the conclusions drawn from these studies should be reformulated.

We completely agree with Györffy et al that if the genotype distribution of the control population deviates from Hardy-Weinberg equilibrium, it should raise the suspicion of selection bias or genotyping error, and the suitability of such material for case control studies should be rechecked.

We also share the same view with these authors that if the genotype distribution in the diseased population does not fulfil the Hardy-Weinberg equilibrium, it may in fact constitute supporting evidence for the correlation between genotype and disease. Actually, in recent literature, it has been discussed that disease susceptibility loci could be searched for by testing for Hardy-Weinberg disequilibrium in a gene bank of affected individuals.1–3

In our paper, there was a deviation of the genotype distribution from Hardy-Weinberg equilibrium in the diseased subjects but not in controls. The probands were not related and the genotyping results were controlled for. In our patient group, homozygosity for the NOD2 1007fs mutation was slightly overrepresented in statistical terms, which could serve as an additional argument for the pathophysiological significance of this mutation or may represent a simple chance finding. In our opinion, the fact that the genotype frequencies of only the patients deviate from Hardy-Weinberg equilibrium does not alter the conclusions previously made.

References

  • 1.Lee WC. Searching for disease-susceptibility loci by testing for Hardy-Weinberg disequilibrium in gene bank of affected individuals. Am J Epidemiol 2003;158:397–400. [DOI] [PubMed] [Google Scholar]
  • 2.Weinberg CR, Morris RW. Invited commentary: testing for Hardy-Weinberg disequilibrium using a genome single-nucleotide polymorphism scan based on cases only. Am J Epidemiol 2003;158:401–3. [DOI] [PubMed] [Google Scholar]
  • 3.Lee WC. Lee responds to “Testing for Hardy-Weinberg disequilibrium”. Am J Epidemiol 2003;158:404–5. [DOI] [PubMed] [Google Scholar]
Gut. 2004 Apr;53(4):614–616.

Author’s reply

P Aguilar-Martinez 7, C Thelcide 7, J F Schved 7, M Bismuth 8, G-P Pageaux 8, P Blanc 8, D Larrey 8, M C Picot 9, F Blanc 10

Györffy et al calculated Hardy-Weinberg equilibrium (HWE) using data from our paper on H63D homozygotes.1 They stated that HWE was fulfilled in the control but not in the patient population. We do not think that our paper was eligible for the study of Györffy et al because haemochromatosis genotypes reported in our paper do not have a biallelic distribution. Data calculated in their report were based on results given in table 1 of our paper. In fact, in table 1 we listed the genotypes obtained with two different alleles of the HFE gene implicated in the pathogenesis of haemochromatosis (C282Y and H63D mutations) which have been shown to be exclusive of each other,2 along with the normal (wild-type) allele. Thus a total of three alleles have to be taken into account, C282Y, H63D, and wild-type. Therefore, HWE should have been calculated using a multiallelic system and not a biallelic system.

Table 1 .

HFE genotype of the whole population referred for a personal or family history of iron overload (patients) compared with a control group (controls) from our area (1276 unselected newborns)4

HFE genotype Patients Controls
n % n %
m1m1 178 18.4 2 0.2
m1N 141 14.6 58 4.6
m1m2 85 8.8 15 1.2
m2m2 60 6.2 36 2.8
m2N 170 17.6 345 27.0
NN 334 34.5 820 64.3
Total 968 100 1276 100

m1, C282Y; m2, H63D; N, wild-type.

Furthermore, in table 1 of their paper, Györffy et al reported values used to compare the wild-type genotype (WW), homozygotes for the mutant allele (MM), and heterozygotes (WM) in each paper. For the wild-type allelic distribution of our paper, the authors took into account data relative to the C282Y mutation (homozygote (n = 178) or heterozygote (n = 85)) instead of the wild-type allele (homozygote (n = 334) or heterozygotes (n = 170)). Thus the values used to calculate the wild-type counterpart of the H63D mutation are not correct. This incorrect use of our results may obviously lead to erroneous conclusions.

Currently, it is well known that the various genotypes generated by both mutations (C282Y homozygosity, C282Y and H63D compound heterozygosity, and H63D homozygosity) and leading to iron overload have an incomplete penetrance.3 Our patient population was not a random population but a sample of patients referred for HFE analysis because of a diagnosis of iron overload. It would not have been surprising to found that, in this population, HWE was not fulfilled; abnormal genotypes were obviously overrepresented, probably according to the degree of penetrance of each.

In our paper, we did not verify if the HWE was applicable but we compared statistically the patient and control populations, which was a random well representative sample.4 We showed that there was a significant difference between the presence of H63D homozygotes among patients and controls. The statistical difference was much higher for C282Y homozygotes and compound heterozygotes, but this result was not reported as it was beyond the scope of the paper.

In conclusion, we do not think that HWE would have contributed to the analysis of the patient population in our paper except to check if the control population had been correctly chosen.

References

  • 1.Aguilar-Martinez P, Bismuth M, Picot MC, et al. Variable phenotypic presentation of iron overload in H63D homozygotes: are genetic modifiers the cause? Gut 2001;48:836–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Beutler E. Genetic irony beyond haemochromatosis: clinical effects of HLA-H mutations. Lancet 1997;349:296–7. [DOI] [PubMed] [Google Scholar]
  • 3.Feder JN, Gnirke A, Thomas W, et al. A novel MHC class I-like gene is mutated in patients with hereditary haemochromatosis. Nat Genet 1996;13:399–408. [DOI] [PubMed] [Google Scholar]
  • 4.Aguilar-Martinez P, Picot MC, Becker F, et al. Prevalence of HFE mutations in people from North Africa living in southern France. Br J Haematol 2001;114:914–16. [DOI] [PubMed] [Google Scholar]

Articles from Gut are provided here courtesy of BMJ Publishing Group

RESOURCES