Skip to main content
Genetics logoLink to Genetics
. 2006 Nov;174(3):1695–1697. doi: 10.1534/genetics.106.057042

A Clarification of the Hardy–Weinberg Law

Alan E Stark 1,1
PMCID: PMC1667064  PMID: 16702429

Abstract

C. C. Li showed that Hardy–Weinberg proportions (HWP) can be maintained in a large population by nonrandom mating as well as random mating. In particular he gave the mating matrix for the symmetric case in the most general form possible. Thus Li showed that, once HWP are attained, the same proportions can be maintained by what he called pseudorandom mating. This article shows that, starting from any genotypic distribution at a single locus with two alleles, the same in each sex, HWP can be reached in one round of nonrandom mating with no change in allele frequency. In the model that demonstrates this fact, random mating is represented by a single point in a continuum of nonrandom possibilities.


LI (1988) made a valuable contribution when he showed that Hardy–Weinberg (HW) frequencies can be maintained in large populations with nonrandom mating (Hardy 1908; Weinberg 1908, 1963). However, textbooks that invoke random mating as a justification for use of HW frequencies in population models and, in addition, maintain that random mating is a necessary condition, continue to be published. For example, Holsinger (2001, p. 914) states “… the Hardy-Weinberg Law provides a way to estimate allele frequencies … provided we are willing to assume that all of the assumptions apply to the population in which we are interested.” The important point to note here is that one of Holsinger's assumptions is that individuals “choose mates at random.”

Random mating is sometimes described as pairing in ignorance of the genotypes of potential mates. But even lack of such knowledge does not ensure pairings with frequencies satisfying the formal criterion of randomness. Moreover, it is claimed frequently that assortative mating is incompatible with HW frequencies. But this is not generally true. Random mating is not a realistic assumption for human population models and therefore it should be invoked only with careful qualification based on correct mathematical foundations. Because it is simple to describe, resort to the assumption of random mating has been too facile. But this does not justify presentations of the most basic concept of population genetics that contain dubious and even formally incorrect notions.

Li (1988, p. 736) wrote that “An infinite number of patterns of deviations from random mating exists for autosomal loci that would make a population pseudo-random mating. This could be a contributing factor to the robustness of the Hardy-Weinberg law … makes the study of the mating pattern a worthwhile subject.”

The next section gives the notation together with a brief outline of Li's model, for the reader's convenience. The final section gives the model that yields HWP from an arbitrary genotypic distribution in one round of nonrandom mating.

LI'S MODEL

Consider a population with respect to a single locus having alleles A and B with respective frequencies q and p, the same in males and females. Denote frequencies of genotypes AA, AB, and BB by f0, f1, and f2. Table 1 gives Li's symmetrical mating model, which he introduces with the remark: “…When reciprocal crosses have the same frequency, the general pattern will be symmetrical” (Li 1988, p. 733). Thus the roles of males and females can be reversed without changing the model. This case is simpler than Li's more general model but is suitable for the present purpose. The 3 × 3 matrix of cell frequencies is denoted by [fij], i = 0, 1, 2; j = 0, 1, 2.

TABLE 1.

Li's symmetric nonrandom mating model

♀ × ♂ AA AB BB
AA q4 + a 2pq3ab p2q2 + b
AB 2pq3ab 4p2q2 + 4b 2p3q + a − 3b
BB p2q2 + b 2p3q + a − 3b p4a + 2b

Both row totals and column totals give Hardy–Weinberg proportions: f0 = q2, f1 = 2pq, f2 = p2. Thus the parental population is in HW form and it is simple to show, under the usual assumptions, that the distribution of genotypes among offspring is the same. Note that f11 = 4f02.

There are four parameters or constraints in Li's model, taking up the 4 d.f. in [fij]: these are q, Sewall Wright's fixation index F, here taking the value zero, a, and b. Parameters a and b are constrained by the requirement that the elements of [fij] be nonnegative. In Li's model, random mating is defined by the pair of conditions a = 0 and b = 0.

A compact formula giving a range of nonrandom mating tables that reproduce HWP is given by Stark (2005). This was adapted from a more general formula that was used by Stark (1980) to classify systems of partial inbreeding. Other relevant sources are Stark (1977) and A. E. Stark (unpublished results).

HWP FROM AN ARBITRARY DISTRIBUTION WITH NONRANDOM MATING

Here it is demonstrated that HW proportions can be obtained in one round of nonrandom, as well as random, mating from any genotypic distribution shared by males and females. Suppose that, in generation t, the population has gene frequencies qt and pt and genotypic frequencies f0(t) = qt2 + Ftptqt, f1(t) = 2ptqt(1 − Ft), and f2(t) = pt2 + Ftptqt. The possible values of Ft are constrained by qt to ensure that {f0(t), f1(t), f2(t)} is a valid set of genotypic frequencies, but generally Ft is in the interval (−1, 1). Without loss of generality take qt in the interval 0 < qtInline graphic.

Consider a mating system in which the frequency of i × j couples in generation t giving rise to offspring in generation t + 1 is

graphic file with name M2.gif (1)

where the entries in (1) are defined by

graphic file with name M3.gif

Equation 1 was derived from Fisher's identity whose properties and background references are set out fully in Lancaster (1969, p. 90). The set of values {d0(t), d1(t), d2(t)}, denoted by d(t), was constructed by first assigning values 0, 1, and 2, respectively, to genotypes AA, AB, and BB and then correcting by deducting the mean with respect to the distribution of genotypic frequencies which is 2pt. Thus the mean of d(t), defined by Inline graphic, is zero and the variance St of d(t) is calculated from Inline graphic. The set of values {e0(t), e1(t), e2(t)} is denoted by e(t). Since the mean of e(t), defined by the expression Inline graphic, is zero, the variance of e(t), denoted Tt, is calculated from Inline graphic. In (1) the quantity ei(t)ej(t)/Tt is the product of a pair of standardized elements, that is, elements whose mean is zero and standard deviation is unity.

This article requires a special case of Equation 1, namely the case when μ = 0. Then (1) reduces to

graphic file with name M8.gif (2)

Although Equation 1 is not used in its entirety in what follows, (1) is included because otherwise (2) gives little insight to the reader as to its origins. Fisher's identity was conceived originally as a means of expressing a bivariate distribution in canonical form to explore its properties. For a 3 × 3 matrix such as that in Table 1 this entails finding two sets of variable values each with mean zero and standard deviation unity. In this article the procedure is reversed by starting from a canonical form, such as (1), but giving it desired properties. It turns out that d(t) as defined above, combined with setting the correlation coefficient μ to be zero, generates HWP in the offspring. Other values of μ produce, in time, non-HWP. Straightforward calculation shows that the distribution of genotypes in offspring of generation t + 1 obtained from (2) is

graphic file with name M9.gif

that is, the offspring are distributed in HWP.

The admissible values of ν in (2) are obtained by setting various elements of the mating matrix to zero. Given qt and −qt/pt < Ft < 1, the upper limit is obtained as

graphic file with name M10.gif

when f01(t) = 0.

The lower limit is set in two intervals: given qt and −qt/pt < FtInline graphic(ptqt)/pt, the left lower limit is obtained as

graphic file with name M12.gif

when f00(t) = 0; and given qt and Inline graphic(ptqt)/ptFt < 1, the right lower limit is obtained as

graphic file with name M14.gif

when f11(t) = 0.

As an example of (2), when the lower limits meet, that is, when f00(t) = 0 and f11(t) = 0, Ft = Inline graphic(ptqt)/pt, e0(t) = −1, e1(t) = 1, e2(t) = −qt/(2 − 3qt), Tt = qt(3 − 4qt)/(2 − 3qt), and νlr = −Tt, the mating matrix is as given in Table 2.

TABLE 2.

Mating scheme with f00(t) = 0 and f11(t) = 0

♀ × ♂ AA AB BB
AA 0 qt2 Inline graphicqt(1 2qt)
AB qt2 0 ptqt
BB Inline graphicqt(1 2qt) ptqt 1 − 3qt + 2qt2

In summary, given any gene frequency q and any admissible F, a value of ν can be chosen from an interval governed by q and F, according to the limits given above, to produce HWP in one generation. Random mating is defined by the single value ν = 0 contained in this interval. Thus Li's contribution and this one completely remove the necessity of random mating as a requirement for the establishment and maintenance of HWP. Hardy and Weinberg showed that random mating was a sufficient condition and this is the concept that has taken hold in the genetics community for almost 100 years.

Acknowledgments

I thank Francisco M. Salzano and Paulo A. Otto for their encouragement.

References

  1. Hardy, G. H., 1908. Mendelian proportions in a mixed population. Science 28: 49–50. [DOI] [PubMed] [Google Scholar]
  2. Holsinger, K. E., 2001. Hardy-Weinberg law, pp. 912–914 in Encyclopedia of Genetics, edited by S. Brenner and J. H. Miller. Academic Press, San Diego.
  3. Lancaster, H. O., 1969. The Chi-Squared Distribution. John Wiley & Sons, New York.
  4. Li, C. C., 1988. Pseudo-random mating populations. In celebration of the 80th anniversary of the Hardy-Weinberg law. Genetics 119: 731–737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Stark, A. E., 1977. Dwa uogólnienia prawa Hardy'ego-Weinberga w genetyce populacji (the Hardy-Weinberg law of population genetics and two generalizations). Matematyka Stosowana IX: 123–137. [Google Scholar]
  6. Stark, A. E., 1980. Inbreeding systems: classification by a canonical form. J. Math. Biol. 10: 305. [DOI] [PubMed] [Google Scholar]
  7. Stark, A. E., 2005. The Hardy-Weinberg principle. Genet. Mol. Biol. 28: 485. [Google Scholar]
  8. Weinberg, W., 1908. Über den nachweis der vererbung beim menschen. Jahresh. Verein f. vaterl. Naturk. Württem. 64: 368–382. [Google Scholar]
  9. Weinberg, W., 1963. On the demonstration of heredity in man, pp. 4–15 in Papers on Human Genetics, edited by S. H. Boyer. Prentice–Hall, Englewood Cliffs, NJ.

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES