Joint estimation of selection intensity and mutation rate under balancing selection with applications to HLA

Montgomery Slatkin

doi:10.1093/genetics/iyac058

. 2022 Apr 18;221(2):iyac058. doi: 10.1093/genetics/iyac058

Joint estimation of selection intensity and mutation rate under balancing selection with applications to HLA

Montgomery Slatkin ^1,^✉

Editor: A F Agrawal

PMCID: PMC9157114 PMID: 35435218

Abstract

A composite likelihood method is introduced for jointly estimating the intensity of selection and the rate of mutation, both scaled by the effective population size, when there is balancing selection at a single multi-allelic locus in an isolated population at demographic equilibrium. The performance of the method is tested using simulated data. Average estimated mutation rates and selection intensities are close to the true values but there is considerable variation about the averages. Allowing for both population growth and population subdivision does not result in qualitative differences but the estimated mutation rates and selection intensities do not in general reflect the current effective population size. The method is applied to 3 class I (HLA-A, HLA-B, and HLA-C) and 2 class II loci (HLA-DRB1 and HLA-DQA1) in the 1000 Genomes populations. Allowing for asymmetric balancing selection has only a slight effect on the results from the symmetric model. Mutations that restore symmetry of the selection model are preferentially retained because of the tendency of natural selection to maximize average fitness. However, slight differences in selective effects result in much longer persistence time of some alleles. Trans-species polymorphism, which is characteristic of major-histocompatibility loci in vertebrates, is more likely when there are small differences in allelic fitness than when complete symmetry is assumed. Therefore, variation in allelic fitness expands the range of parameter values consistent with observations of trans-species polymorphism.

Keywords: MHC, divergent allele advantage, Ewens–Watterson test, negative frequency-dependent selection, trans-specific polymorphism

The major-histocompatibility complex (MHC) in humans is called the human leucocyte antigen (HLA) system. The HLA system is found on the short arm of chromosome 6. Many HLA genes are involved in the adaptive immune response. Two groups of HLA loci have been intensively studied, class I and class II, which are similar but differ somewhat in function. They both code for glycoproteins that bind with peptides and present them at the cell surface to T cells and natural killer cells. They are key components in the body’s ability to defend itself against pathogens. They are important clinically because many HLA alleles are associated with higher risk of auto-immune and other heritable diseases and with susceptibility and resistance to various pathogens (Trowsdale and Knight 2013). For example, the B27 allele at HLA-B is associated with a higher risk of ankylosing spondylitis (Evans et al. 2011) and alleles at the HLA-C locus are associated with different rates of HIV disease progression (Kulpa and Collins 2011). HLA loci are also critically important to the success of organ transplants.

Class I and class II HLA loci are unusually diverse, with dozens and even hundreds of alleles at each locus distinguishable both by serotyping and DNA sequencing (Radwan et al. 2020). Most of the variation among alleles is concentrated in the peptide-binding regions, the second and third exons of class I loci and the second exon of class II loci. There is clear evidence of trans-species polymorphism (TSP): some alleles at both class I and class II loci in humans are more similar to alleles in chimpanzees than they are to other alleles at the same loci in humans (Klein et al. 2007; Radwan et al. 2020). Both of these features have long been taken as evidence of strong balancing selection, meaning that on average individuals heterozygous at HLA class I and II loci have a higher fitness than homozygous individuals.

Two types of balancing selection have been commonly invoked, heterozygote advantage, and rare allele advantage (also called negative frequency-dependent selection). Both heterozygote advantage and rare allele advantage are assumed to result from the role class I and II loci play in the immune system. In general, alleles differ in their peptide-binding regions and hence can present different antigens for inspection by T cells and also regulate natural killer cell activity. Consequently, an individual that can present more kinds of antigens efficiently because it is more heterozygous will likely be able to mount an effective immune response to a larger variety of pathogens. The difference between the 2 hypotheses is, from a population genetics perspective, not very important. In the heterozygote advantage model, heterozygosity at each locus is itself favored by selection because it provides defense against a pathogen pool that is regarded as unchanging. In the rare allele advantage model, an allele in low frequency is favored because pathogens are not yet well adapted to it. As an allele increases in frequency, pathogens adapt and the allele’s contribution to fitness decreases. In the rare allele advantage model, HLA loci and pathogens coevolve in a way that results in higher average fitness of heterozygous individuals. In the heterozygote advantage model, coevolution plays no role. The reason the difference between the models is not important for population genetic analysis is that Takahata and Nei (1990) showed the equations that govern the change in allele frequency to be the same in the 2 models when the allelic fitness in the rare allele advantage model is a linearly decreasing function of frequency. Spurgin and Richardson (2010) provided further support for Takahata and Nei’s conclusion.

There is extensive theory of balancing selection. I will review here only developments directly related to this paper. Kimura and Crow (1964) introduced the infinite-alleles mutation model, in which each mutation at a locus creates an allele new to the population. They also introduced the symmetric model of heterozygote advantage in which every homozygous individual has a fitness of 1 – s relative to every heterozygous individual. Their model assumed a population of constant effective size, N. They showed that, for large N, the model’s behavior depends on only 2 parameters denoted here by $θ = 4 N µ$ , where µ is the mutation rate, and $S = 2 N s$ . I will call this model the symmetric overdominance model. It will serve as a baseline model with which other models will be compared.

Ewens (1972) showed that, for the infinite-alleles model with s = 0, the number of alleles, k, in a sample is a sufficient statistic for estimating $θ$ . Furthermore, he derived the sampling distribution of neutral alleles given k and the sample size. Watterson (1977) extended Ewens’s theory and derived the sampling distribution for overdominant alleles (s > 0). Watterson’s sampling distribution is calculable only for small S and small sample sizes, however. Watterson also showed that the computed homozygosity, $F = \sum_{i} p_{i}^{2}$ where $p_{i}$ is the frequency of the ith allele, is a powerful statistic for testing neutrality against the alternative of symmetric overdominance. The resulting test of neutrality is called the Ewens–Watterson test, which has been extensively applied to HLA data (Solberg et al. 2008). Samples of class I and II HLA loci in many populations lead to rejection of neutrality, reinforcing the idea that diversity at HLA loci is maintained by balancing selection.

Takahata (1990) developed an analytic approximation for the symmetric overdominance model valid if $θ ≪ S$ . Takahata showed that under this assumption, gene genealogies in the symmetric overdominance model are similar to neutral gene genealogies but with a much larger effective population size. Takahata showed that for some parameter values, the symmetric overdominance model predicted large numbers of alleles and TSP.

Satta et al. (1994) used Takahata’s (1990) results to develop a method for estimating the intensity of selection in the symmetric model. Satta et al. (1994) estimated the mutation rate in the peptide-binding region of class I and class II loci to be $1.7 \times 10^{- 6}$ and $7.5 \times 10^{- 7}$ , respectively, per generation. They then used Takahata’s (1990) analytic approximation combined with estimates of the nonsynonymous substitution rate to estimate the selection intensity (s) of the major class I and class II loci to be between 0.0007 (for HLA-DPB1) and 0.042 (for HLA-B). Yasukochi and Satta (2013) revisited the problem using more extensive sequence data and confirmed the previous results. Yasukochi and Satta (2013) (Table 2) estimated s for HLA-A (0.0225), HLA-B (0.044), and HLA-DRB1 (0.0194) with smaller values for other loci. Assuming N = 100,000, they concluded that values of S for these 3 loci were very large (4,500, 8,825, and 3,890, respectively). Yasukochi and Satta (2013) estimated the scaled mutation rates to be much smaller than 1, with θ between 0.04 and 0.92.

Table 2.

Numbers of alleles that had ages of 100,000 generations or longer in the Vhet and Vpli models.

Model	θ	u_i uniform on (0,1)		u_i = 1
		S = 500	S = 1,000	S = 500	S = 1,000
Vhet	0.1	26/49,856	23/49,640	42/50,157	51/50,462
Vhet	0.5	40/250,193	44/249,522	1/250,218	1/249,550
Vhet	1	38/499,395	52/499,647	0/500,532	0/499,474
Vhet	3	17/1,498,229	29/1,501,249	0/1,498,887	0/1,500,634
Vhet	5	8/2,497,340	9/2,495,891	0/2,500,584	0/2,501,916

		r_i uniform on (0,0.05)		r_i $= 0.0$
		S = 500	S = 1,000	S = 500	S = 1,000
Vpli	0.1	9/55,036	20/54,855	41/55,432	51/55,116
Vpli	0.5	24/274,206	28/273,898	0/274,283	2/274,966
Vpli	1	27/550,682	38/549,395	0/550,662	0/550,021
Vpli	3	35/1,650,974	42/1,649,599	0/1,647,165	0/1,649,150
Vpli	5	34/2,750,869	38/2,752,222	0/2,751,412	0/2,749,658

Open in a new tab

The total numbers of mutations are in the denominator. Alleles were counted only if they arose after the end of the 100,000 generation burn-in period and were lost or still segregating by the end of 1,000,000 generations after the end of the burn-in period.

In this article, I introduce a method for jointly estimating $θ$ and S from the allele frequency spectrum under the assumption of symmetric overdominance in an equilibrium population. I then examine by simulation the dependence of the estimated parameter values when the assumptions of the symmetric model are relaxed in various ways. I will show that small variations in the fitness across heterozygous genotypes can make the long persistence of some alleles much more likely, which implies that TSP is facilitated by subtle variation in allelic contributions to fitness.

Allele frequency spectrum

The basis for the method presented here is the allele frequency spectrum for the model of symmetric balancing selection. Muirhead and Wakeley (2009) derived the frequency spectrum for a population of 2 N haploid individuals under frequency-dependent selection. They extended the Moran model to allow for multiple alleles and linear dependence of the death rate on allele frequency (their selection-at-death model). In this model, the probability that an allele decreases by 1 copy is proportional to $1 - s x$ , where s is the selection intensity and x is the allele frequency. They relied on Takahata and Nei’s (1990) result that the linear dependence of fitness on allele frequency is formally equivalent to symmetric balancing selection. Muirhead and Wakeley (2009) derived the expectation of the allele frequency spectrum, which is the number of alleles in each frequency class, given the scaled mutation rate $θ = 2 N u$ and the scaled selection intensity $S = N s$ . Because the effective size of a Moran model is smaller by a factor of 2 than the effective size of a Wright–Fisher model with the same number of individuals (Muirhead and Wakeley 2009), their results are applicable to a diploid population containing N individuals.

In Muirhead and Wakeley’s (2009) derivation, they represented the spectrum as a vector of random variables, $X = \{X_{1}, X_{2}, \dots, X_{2 N}\}$ where $X_{i}$ is the number of alleles present in i copies. Necessarily $\sum_{i = 1}^{2 N} {i X}_{i} = 2 N$ . The number of distinct alleles is $k = \sum_{i = 1}^{2 N} X_{i}$ . For example, if $X = \{2,1, 0,0, 0,1, 0,0, 0,0\}$ , $N = 5$ and the population contains 2 alleles present in 1 copy each, 1 allele present in 2 copies, and 1 allele present in 6 copies. They derived a system of linear equations for the expectations of $X_{i}$ , $E (X_{i})$ ( $i = 1, \dots, 2 N$ ).

E (X_{i}) = \sum_{j = 1}^{\min (i, 2 N - i)} \frac{i + j}{i (2 N - i)} \frac{2 N u}{1 - u} (\prod_{r = 1}^{j} \frac{µ_{i + r}}{µ_{j - r + 1}}) E (X_{i + j}) + \sum_{j = \min (i, 2 N - i) + 1}^{2 N - i} \frac{i + j}{i (2 N - i)} \frac{2 N u}{1 - u} (\prod_{r = 1}^{i} \frac{µ_{j + r}}{µ_{i - r + 1}}) E (X_{i + j})

(1)

( $i = 1,2, \dots, 2 N$ ), where $µ_{j} = 1 + s j / (2 N)$ . These equations can be solved iteratively by choosing an arbitrary value of $E (X_{2 N})$ and then computing $E (X_{i})$ for successively smaller values of i until $i = 1$ is reached.

To compute the expected frequency spectrum, Muirhead and Wakeley used the normalization condition $\sum_{i = 1}^{2 N} i E (X_{i}) = 2 N$ . Here, I want the probability that a randomly chosen allele is present in i copies. Therefore, I use the normalization condition $\sum_{i = 1}^{2 N - 1} E (X_{i}) = 1$ and define

ϕ_{i} = E (X_{i}) / \sum_{j = 1}^{2 N - 1} E (X_{j})

$i = 1, \dots, 2 N - 1.$ The upper limit of i is $2 N - 1$ instead of 2N because only polymorphic loci will be considered. Therefore, $ϕ_{i}$ is the probability that an allele at a polymorphic locus is present in i copies.

The $ϕ_{i}$ are the probabilities for the entire population. In a sample of n individuals, the probabilities can be obtained by assuming that 2n copies are randomly chosen from the population with replacement. Therefore, the probability that there are j copies in the sample given i copies in the population is the binomial distribution

\Pr (j| i, n) = (\begin{matrix} 2 n \\ j \end{matrix}) {(\frac{i}{2 N})}^{j} {(1 - \frac{i}{2 N})}^{2 n - j}

(2)

By summing over i,

ϕ_{j, n} = \sum_{i = 1}^{2 N - 1} ϕ_{i} (\begin{matrix} 2 n \\ j \end{matrix}) {(\frac{i}{2 N})}^{j} {(1 - \frac{i}{2 N})}^{2 n - j}

(3)

for $j = 1, \dots, 2 n - 1$ . The subscript n is added to indicate the sample size. The normalization condition for $ϕ_{i}$ implies $\sum_{j = 1}^{2 n - 1} ϕ_{j, n} = 1$ for any n.

Assuming sampling with replacement is an approximation made under the assumption that there is a diffusion limit to the Moran model analyzed by Muirhead and Wakeley (2009) if the scaled parameters S and $θ$ are fixed and N becomes large. If that is true, then the $ϕ_{i}$ based on Equation (1) will approximate the normalized spectrum for a population with much larger N. I did not attempt to prove there is a diffusion limit to the Muirhead and Wakeley model but it is a reasonable assumption because Watterson (1977) proved there is a diffusion limit to the corresponding Wright–Fisher model. In the numerical solution to Equation (1), N was chosen to be 2,500. Larger values led to computational difficulties.

Composite likelihood method

The data for a single locus is the allele frequency spectrum in a sample of n individuals, $D = {x_{1}, x_{2}, \dots, x_{2 n - 1}}$ where $x_{j}$ is the number of alleles found in j copies ( $\sum_{j = 1}^{2 n - 1} j x_{j} = 2 n$ ). The number of alleles, k, is $\sum_{j = 1}^{2 n - 1} x_{j}$ . The probability of D can be approximated by assuming the $x_{j}$ are a random draw from a multinomial distribution with probability vector $ϕ_{j, n}$ and sample size $2 n$ :

L (D | θ, S) \propto \prod_{j = 1}^{2 n - 1} {(ϕ_{j, n})}^{x_{j}}

(4)

where the multinomial coefficient is omitted because it does not affect the later analysis. The result is a composite likelihood because Equation (4) assumes independence of alleles when in fact they are not independent: their frequencies have to sum to 1. In the assumed diffusion limit, the $ϕ_{j, n}$ depend only on S and $θ$ . The maximum likelihood values of the 2 parameters, $\hat{S}$ and $\hat{θ}$ , can then be found numerically. In the results presented below, a grid search with spacing 1 for $θ$ (with a range 1–40) and 20 for S (with a range 0–1,000) was used.

Simulation tests

The first test of this method assumes the symmetric equilibrium model. A Wright–Fisher model with N = 10,000 diploid individuals was subject to balancing selection. Every heterozygote had a relative fitness of 1 and every homozyote had a relative fitness of 1 – s. Mutations occurred with probability µ per generation. Each mutant allele was new to the population. The population was initially fixed for a single allele. After a burn-in period of 100,000 generations, each replicate simulation continued for 1,000,000 generations with samples taken with replacement every 10,000 generations. Ten replicates were run for each set of parameter value. Averages were taken over the 1,010 samples.

For each sample, $\hat{S}$ and $\hat{θ}$ were obtained by using a grid search described above. In addition to estimating S and $θ$ , k and F were computed. From k, F, and n, the 1-tailed Ewens–Watterson test (Watterson 1977) yielded the probability P that the observed value of F is smaller than a random value obtained by simulations with S = 0.

Figure 1 shows the averages of $\hat{θ}$ and $\hat{S}$ for a range of values of S and $θ$ . The pattern in the results shown in Fig. 1 are representative of other values of $θ$ . Averages of $\hat{θ}$ are close to the true values for a wide range of selection intensities, except when $θ \leq 2$ and S is relatively small. The average of $\hat{S}$ is close to the true value except for the largest value of S. The downward bias for large S results from the fact that the upper limit of the search interval for $\hat{S}$ was 1,000. Figure 2 shows that the range of variation in $\hat{θ}$ across samples is not large both for relatively weak ( $S = 50$ ) and relatively strong ( $S = 500$ ) selection. In contrast, Fig. 3 shows that the range of variation in $\hat{S}$ across samples is quite large. The extensive variation in $\hat{S}$ reflects the stochastic variation in allelic configurations across samples even when selection is strong. We can conclude from these results that if the equilibrium symmetric model is valid, $θ$ can be estimated with some confidence but individual estimates of S are probably not accurate.

Fig. 2. — Distribution of $\hat{θ}$ in samples from a population of size 10,000. The simulations are the same as those presented in Fig. 1. The distribution of the 1,010 sample values are shown for S = 50 (red) and S = 500 (blue).

Fig. 3. — Distribution of $\hat{S}$ in from a population of size 10,000 The simulation results are the same as those presented in Fig. 1. Distributions of the 1,010 sample values are shown for S = 50 (red) and S = 500 (blue).

Figure 4 shows that the power of the Ewens–Watterson test (Watterson 1977) to reject neutrality decreases as $θ$ increases. A comparison of the results in Figs. 3 and 4 indicates that when values of $\hat{S}$ exceed 100, the Ewens–Watterson test has some power to reject neutrality. But unless θ is small and both S and n are large, the test does not in general have much statistical power. That conclusion is consistent with the results of Solberg et al. (2008) and others. For HLA loci in many populations, neutrality is rejected in some but not all cases.

Fig. 4. — Power of Ewens–Watterson test to reject the hypothesis of neutrality at the 5% level (one-sided test) in the simulated data sets for a stable population with samples of n = 50 and 500 individuals. The simulations are the same as those presented in Figs. 1–3. Both parts show results for $θ =$ 2 (red), 3 (blue), 5 (green) and 10 (black).

Application of the composite likelihood method

The composite likelihood method was applied to the data of Gourraud et al. (2014). They used Sanger sequencing to determine the HLA genotypes at 5 loci (HLA-A, HLA-B, HLA-C, HLA-DQB1, and HLA-DRB1) of 930 individuals in the 1,000 Genomes panel. They distinguished alleles by both the serotype and the sequence of the peptide-binding region. Thus, for example, HLA-A*0101 was treated as a different allele than HLA-A*0102; 01 and 02 represent different amino acid sequences of the peptide-binding region of the HLA-A*1 allele. The data were extracted from Supplementary Table 2 of the Gourraud et al. paper. Data were available for 14 populations, 2 from Africa (LWK, YRI), 4 from the Americas (ASW, CLM, MXL PUR), 4 from East Asia (CHB, CHD, CHS, JPT), and 4 from Europe (CEU, FIN, GBR, TSI).

Table 1 shows the result of applying the composite likelihood method to the Gourraud et al. data set. Also indicated are the samples for which the Ewens–Watterson test rejected neutrality at a 5% significance level. Estimates of $θ$ for each locus are roughly consistent across populations. Estimates for HLA-B are generally the largest and estimates for HLA-DQB1 are generally the smallest. Estimates for HLA-A, HLA-C, and HLA-DRB1 are intermediate.

Table 1.

Analysis of the 1000 Genomes data from Gourraud et al. (2014).

	Africa		America				East Asia				Europe
	LWK	YRI	ASW	CLM	MXL	PUR	CHB	CHD	CHS	JPT	CEU	FIN	GBR	TSI
A	90	60	53	70	59	70	90	90	100	91	75	100	96	90
	4	5	5	10	5	7	4	5	4	3	4	3	4	7
	260^a	140	260^a	20	100	80	20	20	20	20	20	20	20	20
	*1.77*	*1.81*	*1.67*	*2.6*	*1.87*	*2.06*	*2.36*	*2.86*	*2.52*	*3.04*	*3.37*	*3.24*	*2.65*	*2.71*
B	90	60	53	70	59	70	90	90	100	91	75	90	96	90
	6	9	18	10	20	13	11	13	10	4	10	4	5	8
	220^a	80	20	700^a	500	260	140	40	20	340^a	20	140^a	260^a	300^a
	*1.89*	*2.02*	*2.24*	*1.66*	*1.67*	*1.88*	*2.11*	*2.40*	*3.23*	*1.71*	*2.53*	*1.91*	*1.87*	*1.89*
C	90	60	53	70	59	70	90	90	100	91	75	90	96	90
	4	4	6	3	4	5	2	6	4	2	3	1	2	4
	80	20	40	140^a	100	100	160^a	20	20	120^a	120^a	220^a	240^a	120
	*2.01*	*2.15*	*2.20*	*1.77*	*1.92*	*1.96*	*1.77*	*2.55*	*2.41*	*1.67*	*1.82*	*1.58*	*1.62*	*1.99*
DRB1	90	60	53	70	59	70	90	90	100	91	75	100	96	90
	3	4	4	9	6	11	7	7	6	4	7	4	4	6
	140^a	260^a	780^a	420^a	360^a	120	140	20	40	160	20	60	100	140
	*1.82*	*1.64*	*1.44*	*1.75*	*1.68*	*2.02*	*2.04*	*2.65*	*2.40*	*1.89*	*2.73*	*2.19*	*2.04*	*1.99*
DQB1	90	60	53	70	59	70	90	90	100	91	75	100	96	90
	2	1	2	2	2	2	1	2	2	2	2	1	1	2
	20	80	60	80^a	40	60	180^a	40	60	120^a	40	140^a	120^a	80^a
	*2.29*	*1.69*	*1.69*	*1.67*	*1.84*	*1.88*	*1.61*	*1.88*	*1.86*	*1.79*	*1.82*	*1.40*	*1.56*	*1.84*

Open in a new tab

Each cell contains the sample size (in italics), $\hat{θ}$ (in plain text), $\hat{S}$ (in bold), and the product kF (in bold italics).

The P value from the Ewens–Watterson is less than 0.05 (based on 1,000 replicate simulations of the neutral distribution of F).

Estimates of S are much more variable than estimates of $θ$ , as would be expected from Fig. 3. Estimates for HLA-B tend to be the largest. With the exception of the 2 African populations, estimates for HLA-A tend to be the smallest. For each locus, neutrality can be rejected in some populations. Because of the dependence of the power of the Ewens–Watterson test on the mutation rate (see Fig. 3), neutrality is rejected for HLA-B only for large $\hat{S}$ .

Deviations from the symmetric equilibrium model

The symmetric equilibrium model is relatively easy to analyze but it is not appropriate for human HLA data for several reasons. The demographic assumptions that human populations have been of constant size for a long time and isolated from one another are obviously false. Human populations have grown rapidly in the recent past and have experienced bottlenecks in size in the more distant past (Li and Durbin 2011; Spence et al. 2018). Human populations show a pattern of isolation-by-distance both within and among geographic regions (Auton et al. 2009). The symmetry of the selection model is also unrealistic. Complete equivalence of alleles that are part of such a complex biological process seems implausible a priori. Models of interactions with pathogens do not result in symmetry (Borghans et al. 2004; De Boer et al. 2004; Siljestam and Rueffler 2019; Stefan et al. 2019). And empirical evidence of various kinds argues against symmetry (Paterson et al. 1998; Hedrick 2002; Ilmonen et al. 2007; Stoffels and Spencer 2008; Bronson et al. 2013; Radwan et al. 2020).

Here, I consider the effects of deviations from the symmetric equilibrium model, first relaxing the demographic assumptions and then relaxing the assumption of symmetric balancing selection.

Population growth

I simulated a model of exponential population growth by having a burn-in period of 100,000 generations (as in the simulations of a stable population). Then, exponential growth occurred for 10,000 generations with a specified doubling time. Five samples were drawn, at 0, 2,500, 5,000, 7,500, and 10,000 generations after the end of the burn-in period. One hundred replicates for each set of parameter values were run.

Figure 5 shows some results. Values of $\hat{θ}$ increase with the population size but not as rapidly as $4 N (t) µ$ . The results are different for $\hat{S}$ . There is little systematic increase with time. The intuitive reason is that $\hat{S}$ depends mainly on alleles in relatively high frequency. Those alleles were already in high frequency at the onset of population growth and growth did not change their frequencies by much. In contrast, $\hat{θ}$ depends primarily on the numbers of low-frequency alleles. In fact, the average number of singletons is approximately $θ$ , as shown in Fig. 6. Low-frequency alleles tend to be young and reflect the recent influx of mutations which increases with the population size.

Fig. 5. — Estimates of θ and S for model of exponential population growth with the true value of θ = $4 N_{0} µ$ = 2 and 3 doubling times, 2,000 (blue), 3,000 (black), and 4,000 (red) generations, where $N_{0} = 10,000$ is the population size during the 100,000 generation burn-in period. The dashed lines show $4 N (t) µ$ for the 3 growth rates in colors corresponding to the solid lines.

Fig. 6. — Average number of singletons found in samples of 500 individuals under the assumption of symmetric balancing selection of different strengths. As in Figs. 1–5, the averages are taken over 10 replicates each containing 101 samples taken every 10,000 generations after a burn-in period of 100,000 generations. The solid lines show the averages for $θ = 1$ (red), $θ = 3$ (blue) and $θ = 5$ (black). The dashed lines show the true values of $θ$ in each of the cases.

Population subdivision

As in the other simulations, there was a 100,000 generation burn-in period in which there was a single population of size 10,000. That population gave rise to an island model: 5 descendant populations, each of size N = 10,000, exchanged migrants with one another each generation at a rate m/4. In the results, the migration rate is scaled by 4 N, $M = 4 N m$ . Samples of 500 individuals were drawn with replacement from each population every 10,000 generations and analyzed as if they were independent replicates. In addition, a sample of 500 individuals was drawn with replacement from all 5 populations combined.

Some results are shown in Fig. 7 for M = 19, for which the expected value of Wright’s $F_{S T}$ is $1 / (1 + α M)$ , where $α = n^{2} / {(n - 1)}^{2}$ and n is the number of populations (Crow and Aoki 1984). In this case, n = 5 and the expectation of $F_{S T}$ is 0.032 which is a typical value for human populations within the same continent (Elhaik 2012). Estimates of θ for each subpopulation are larger than the true value. The reason is that immigration introduces low-frequency alleles that are equivalent to new mutations. Estimates of θ for the pooled sample are appropriately larger and of the same order of magnitude as the net mutation rate in the 5 populations together ( $10 N µ)$ .

For smaller values of S, $\hat{S}$ is biased upwards in each population and in the pooled population. Population subdivision tends to make the distribution of allele frequencies more even than in a single population, a pattern indicative of balancing selection that is reflected in larger $\hat{S}$ .

Models of asymmetric balancing selection

To examine the effects of nonequivalence of HLA alleles, I simulated 3 models of asymmetric balancing selection. They do not exhaust the range of possibilities but they illustrate some general properties of such models.

The 3 models are (1) the variable homozygous fitness model (Vhom), (2) the variable heterozygous fitness model (Vhet), and (3) the variable pleiotropy (Vpli) model. In the Vhom model, each heterozygous individual has a relative fitness of 1 (as in the symmetric model) but each mutant allele is assigned a homozygous fitness of $1 - {s (1 + v}_{i})$ where $v_{i}$ is drawn from a uniform distribution on $(0, v_{\max})$ . In the simulation results presented here, $v_{\max} = 1.$ In the Vhet model, every homozygote has a relative fitness of $1 - s$ (as in the symmetric model) and every heterozygote has a relative fitness of $1 + s (u_{i} + u_{j}) / 2$ , where $u_{i}$ is attached to the ith allele when it appears as a new mutation: $u_{i}$ is drawn from a uniform distribution on $(0, u_{\max})$ . In the results presented here, $u_{\max} = 1$ . In the Vpli model, symmetric balancing selection of intensity s is assumed. In addition, each mutation has a pleiotropic effect that reduces relative fitness by a factor $1 - r_{i}$ , where $r_{i}$ is drawn from a uniform distribution on $(0, r_{\max})$ . The pleiotropic effect is multiplicative: an $A_{i} A_{i}$ individual has relative fitness $(1 - s) {(1 - r_{i})}^{2}$ and an $A_{i} A_{j}$ individual has relative fitness $(1 - r_{i}) (1 - r_{j})$ . In the simulation results $r_{\max} = 0.05$ .

It will be useful to see the dependence of the average fitness on the selection parameters in each model. Assume in a population that there are k alleles and the frequency of allele $A_{i}$ is $p_{i}$ . It is straightforward to show that if the genotypes are in their Hardy-Weinberg frequencies, the mean relative fitnesses for the 3 models are

\bar{w} = 1 - s \sum_{i = 1}^{k} p_{i}^{2} - s \sum_{i = 1}^{k} v_{i} p_{i}^{2}

(5)

for the Vhom model,

\bar{w} = 1 - s \sum_{i = 1}^{k} p_{i}^{2} + s \sum_{i = 1}^{k} p_{i} {(1 - p_{i}) u}_{i}

(6)

for the Vhet model, and

\bar{w} = {(1 - \sum_{i = 1}^{k} p_{i} r_{i})}^{2} - s \sum_{i = 1}^{k} p_{i}^{2} {(1 - r_{i})}^{2}

(7)

for the Vpli model. For a given set of allele frequencies, $\bar{w}$ in Equation (5) is a decreasing function of the $v_{i}$ . In Equation (6), $\bar{w}$ is an increasing function of the $u_{i}$ and in Equation (7), $\bar{w}$ is a decreasing function of the $r_{i}$ .

Figure 8 shows the results from simulations of the 3 models and a comparison with the comparable symmetric model. Results from the Vhom model (black curves) are very similar to those from the symmetric model (red curves). The Vhet model (blue curves) reduces k and $\hat{θ}$ but has little effect on F or $\hat{S}$ . The Vpli model has a substantial effect on all statistics. These results are representative of those from simulations with other values of θ.

Fig. 8. — Comparison of models of asymmetric balancing selection. In all cases θ = 3. Each model was run for a 100,000 generation burn-in period and then the simulation was run for an additional 1,000,000 generations. Samples of 500 individuals were collected every 10,000 generations after the burn-in period beginning with generation 0. Each point shown is the average of the results for 101 samples. The curves in red show the results for the symmetric model. The green curves show the results for the variable pleiotropy model (Vpli) with $r_{\max} = 0.05 .$ The blue curves show the results for the variable heterozygosity model (Vhet) with $u_{\max} = 1$ . The black curves show the results for the variable homozygosity model (Vhom) with $v_{\max} = 1$ .

We can understand these results more easily by plotting the allele frequencies against values of the variable parameter in each model, $v_{i}$ , $u_{i}$ or $r_{i}$ . Figure 9 shows snapshots of the simulated populations for samples of 500 individuals taken 400,000 generations after the end of the burn-in period. For the Vhom model, there is a slight tendency for alleles with smaller $v_{i}$ to be in higher frequency, especially for stronger selection. For the Vhet and Vpli models, there is a strong relationship between allelic effect and allele frequency. For the Vhet model, alleles with large $u_{i}$ are in higher frequency, while for the Vpli model, alleles with small $r_{i}$ are in higher frequency.

Fig. 9. — Allele frequency plotted against variable selection parameters for 3 models of asymmetric selection. A sample of 500 individuals was taken 400,000 generations after the end of the burn-in period in each replicate simulation of the variable homozygosity (Vhom), variable heterozygosity (Vhet), and variable pleiotropy (Vpli) models described in the text. In all cases θ = 3. In the Vhom model $v_{\max} = 1$ ; in the Vhet model $u_{\max} = 1$ ; and in the Vpli model $r_{\max} = 0.05 .$ Note the different scales used for the horizontal axis of the Vpli model results.

These patterns are found because of the tendency for selection to increase mean fitness in a single-locus model when relative fitnesses are constant, a general principle in population genetics (Kingman 1961). In all 3 models, alleles differ in their effect on fitness when they arise by mutation. But those alleles that tend to increase mean fitness by more will be retained longer than those that have a weaker effect on mean fitness. The result is a balance between the introduction of alleles by mutation and their stochastic loss at a rate that depends on their effect on mean fitness.

The difference between the Vhom model and the other 2 can be explained by the relatively weak dependence of $\bar{w}$ on the $v_{i}$ . In Equation (5), $\bar{w}$ is reduced by $v_{i} p_{i}^{2}$ for allele $A_{i}$ . Because allele frequencies in these models tend to be small, the effect of each allele on mean fitness in the Vhom model is quite small. In contrast, the increase in $\bar{w}$ in the Vhet model for allele $A_{i}$ is $p_{i} {(1 - p_{i}) u}_{i}$ and hence is proportional to $p_{i}$ instead of $p_{i}^{2}$ . Although less obvious in Equation (7), the dependence of $\bar{w}$ on $r_{i}$ also decreases linearly $p_{i}$ in the Vpli model. These patterns are consistent with the intuition that variation in allelic effects are least important in the Vhom model because homozygous individuals are relatively infrequent. Variation in allelic effects is more important in the Vhet model because all heterozygous individuals are affected and they are much more frequent. The effect is larger still in the Vpli model because both homozygotes and heterozygotes are affected.

Although there is a tendency to increase $\bar{w}$ in these models, the maximum is never reached. Alleles that contribute most to increasing $\bar{w}$ will still be lost because of genetic drift. The balance between the introduction of alleles by mutation and their eventual stochastic loss is eventually reached. One way to characterize this balance is with the weighted average of the random parameter: $\bar{v} = \sum_{i = 1}^{k} p_{i} v_{i}$ , $\bar{u} = \sum_{i = 1}^{k} p_{i} u_{i}$ , and $\bar{r} = \sum_{i = 1}^{k} p_{i} r_{i}$ . In simulations with no burn-in period, these quantities stabilize by roughly 20,000 generations, as shown in Fig. 10. There is no tendency to increase after that time.

Fig. 10. — Weighted average values of $u_{i}$ and $r_{i}$ in simulations of the Vhet (with $u_{\max} = 1$ ) and Vpli (with $r_{\max} = 0.05$ ) models no burn-in period. In both parts, θ = 3 and N = 10,000. Initially, each replicate was fixed for a single allele. Results for S = 1,000 (black), S = 500 (red), and S = 100 (blue) are shown.

An additional result from simulations of the Vhet and Vpli models is that they can both lead to more long-lasting alleles than do comparable symmetric models for moderate mutation rates, θ=0.5 or larger. For θ=0.1, the reverse is true. Figure 11 compares histograms of allele ages with and without variation in allelic fitness. For the Vhet model with $u_{i}$ distributed uniformly on (0,1), the appropriate model for comparison is one in which $u_{i} = 1$ because $u_{i}$ is nearly 1 for alleles in moderate frequency (Fig. 9). For the Vpli model, the model comparable to one in which $r_{i}$ is uniform on (0,0.05) is one in which $r_{i}$ = 0 (Fig. 9). Table 2 quantifies the general patterns found. In Fig. 11, there are fewer alleles overall when the selection coefficients are random. The reason is that some mutations reduce mean fitness sufficiently that they are quickly lost, in effect reducing the mutation rate of alleles that will remain for longer times. Nevertheless, some of the surviving mutations persist for longer times than do alleles in a comparable symmetric model.

Fig. 11. — Paired histograms of alleles whose ages exceeded 100,000 generations. In all cases, a population of 10,000 individuals was simulated for 1,000,000 generations after a 100,000 generation burn-in period. The ages for all alleles that arose by mutation after the end of the burn-in period and were lost before the end of the 1,000,000 generations were recorded. In each of the paired histograms, the ones on the left are for $u_{i} = 1$ (Vhet) or $r_{i} = 0$ (Vpli) and the ones on the right are for randomly generated values. For the Vhet model, $u_{i}$ was uniformly distributed on (0,1) and for the Vpli model $r_{i}$ was uniformly distributes on (0,0,05).

Trans-species polymorphism

Models of balancing selection can easily be made to generate high heterozygosity and large numbers of alleles by increasing the mutation rate and selection intensity sufficiently. However, a relatively high mutation rate, even when combined with strong balancing selection, results in a high turnover of alleles. Many alleles are maintained at each time but they do not stay in the population long enough to account for trans-species polymorphism (TSP). For humans and chimps to share alleles, those alleles had to have persisted in both populations for 6 or more million years, more than 300,000 generations assuming an average generation time of 20 years.

Population geneticists have long been aware of this problem (Radwan et al. 2020). One solution is to assume that parameter values lie within a restricted range for which both a large number of alleles and long allelic persistence times are possible. Takahata (1990) used his analytic theory to show very low mutation rates and very strong selection together are needed. He estimated that the mutation rate in the peptide-binding regions of HLA loci to be roughly 10⁻⁷ per generation implying $θ = 0.004$ if the effective population size is 10,000. With s = 0.1 (S = 2,000), the expected time to loss of a selected allele is roughly 600,000 generations, more than enough to account for TSP between humans and chimps (Takahata 1990). For those parameter values, the effective number of alleles ( $n_{e}$ =1/F) exceeds 20, which is also consistent with observations.

Although Takahata’s results can account for TSP, they can do so only for a small range of parameter values: θ must be very small, much less than 1, and S must be very large (1,000 or greater) and that has to be true for every MHC locus that exhibits TSP. The parameter values that predict TSP also predict that allele frequencies are quite evenly distributed. We can quantify the evenness of the distribution of allele frequencies by computing the product kF. If frequencies are equal, this product will be one because $p_{i} = 1 / k$ and hence F = 1/k. This product exceeds 1 by an amount that reflects the unevenness of the allele frequencies. Figure 12 shows some values from the simulations presented earlier. If θ=0.1, kF approaches 1 for even moderate selection intensities. For larger θ, however, kF remains substantially larger than 1 even for S = 1,000. In the 1000 Genomes data, Table 1 shows that observed values of kF are greater than 1 and often greater than 2.

Fig. 12. — Values of the product of k (the number of alleles) and F (the homozygosity) for the symmetric model of balancing selection. The results are obtained from the same simulations as shown in Figs. 1–3. Results are shown for 3 different mutation rates, θ = 0.1 (red), 1 (blue), and 3 (black).

The simulation results presented above suggest another possible explanation for TSP, variation among alleles in their contribution to fitness. For the Vhet and Vpli models, variation among alleles greatly increases the chance that some alleles will remain in the population for a long time, much longer than is found for comparable symmetric models (Fig. 11 and Table 2).

Discussion and conclusion

This paper generalizes the Kimura and Crow (1964) symmetric infinite-alleles model of balancing selection in an equilibrium population. First, it uses the analytic results of Muirhead and Wakeley (2009) as the basis of a composite likelihood method for jointly estimating the scaled mutation rate ( $θ = 4 N µ$ ) and the scaled selection coefficient ( $S = 2 N s)$ . The method applied to data simulated under the symmetric equilibrium model results in estimates that are on average close to the true values. There is considerable variation of the estimates about the true values however. Roughly speaking, estimates of θ are within a factor of 2 while estimates of S are only within a factor of 5 or 10. There is limited information in the allele frequency spectrum in a population sampled at a single time.

The application of this method to HLA data presented by Gourraud et al. (2014) yielded a range of parameter estimates. As is predicted by the simulation results, the range of variation across populations in $\hat{θ}$ is smaller than the range of variation in $\hat{S}$ for each locus. There is some consistency in $\hat{θ}$ within loci. For example, $\hat{θ}$ values for HLA-DQA1 are substantially smaller than for HLA-B. There is less consistency in values of $\hat{S}$ for each locus, but values for HLA-B tend to be larger than for other loci. We can tentatively conclude that the variability at HLA-B reflects both stronger selection and a higher mutation rate than at the other loci examined.

There is a weak relationship between $\hat{S}$ and the outcome of the Ewens–Watterson test of neutrality. Samples for which $\hat{S}$ exceeds 100 generally reject neutrality at the 5% level if $\hat{θ}$ is small. For larger $\hat{θ}$ , some cases with large $\hat{S}$ do not reject neutrality. That pattern is also predicted by the simulations. They showed that higher mutation rates reduce the power of the Ewens–Watterson test for a given S. The failure to reject neutrality in a sample does not necessarily indicate that selection is weak, only that the Ewens–Watterson test has relatively low statistical power.

Simulations of models with population growth and population subdivision showed that $\hat{θ}$ and $\hat{S}$ are affected by deviations from the equilibrium model. Even in these relatively simple models, the estimates do not reflect the local effective population size at the time the samples were taken. There is no reason to think that more complex and realistic demographic assumptions would alter that conclusion. Instead, $\hat{θ}$ and $\hat{S}$ obtained from a single sample have to be interpreted as indicating that the allele frequency spectrum is comparable to one expected in a stable isolated population with those parameter values. When several loci from the same population are analyzed, the values of $\hat{θ}$ indicate the relative mutation rates because all loci experienced the same demographic history. For example, the results in Table 1 indicate that the mutation rate at HLA-B is probably 5–10 times higher than the mutation rate at HLA-DQB1.

The estimates of θ and S in Table 1 differ from those obtained by Yasukochi and Satta (2013). They estimated θ (4M in their notation) from inferred rates of replacement substitutions ( $K_{B})$ , the selective advantage of replacement to silent substitutions ( $γ)$ , and the numbers of sites in the peptide-binding regions of different HLA class I and II loci. Their estimates of θ ranged from 0.04 for HLA-B, HLA-DRB1, and DPB1 to 0.56 for HLA-DQB1 and 0.6 for HLA-C. These estimates are different from those shown in Table 1 which are largely reflecting the numbers of rare alleles. The methods are so different that it is difficult to know why the estimates of θ are different. The rate of gene conversion is not accounted for in the estimate of $K_{B}$ . Even the relative rates are dissimilar. In Table 1, HLA-B has the largest estimates of θ while Yasukochi and Satta (2013) inferred that it had one of the smallest. The reverse is true for HLA-DQB1.

Yasukochi and Satta’s (2013) estimates of S are larger than those in Table 1. They range from roughly 500 for HLA-DQB1 to well over 1,000 for the other loci. The difference in those estimates is probably attributable to the fact that Yasukochi and Satta’s estimates of θ are smaller than those in Table 1. Yasukochi and Satta assumed that Takahata’s (1990) analytic approximation was valid, which it would be if the mutation rates are as small as their estimated values. One thing that argues against the estimates of θ and S obtained by Yasukochi and Satta is that, if accurate, the Ewens–Watterson test should always reject the hypothesis of neutrality. Both here (Table 1) and in the previous analysis by Solberg et al. (2008), neutrality is sometimes rejected but often is not. Furthermore, values of the product kF shown in Table 1 are generally larger than is expected when selection is very strong and mutation is very weak. Both my analysis and that of Yasukochi and Satta (2013) reflect different aspects of the data and hence are not easily compared. Further analysis is needed to reconcile these differences.

The models of asymmetric selection analyzed here suggest that variation in allelic effects on fitness can have 2 consequences. First, differences in the allelic contribution to fitness result in a preferential retention of alleles that increase mean fitness. Consequently, asymmetry in the mutation process does not necessarily lead to substantial deviations from the predictions of the symmetric model. Here, the asymmetry is imposed at the mutation stage, but models of host-pathogen interaction show how such asymmetry can arise. For example, De Boer et al. (2004) assume allelic fitness depends on the relationship between the sequence of the peptide-binding region and the range of antigen sequences. In fact, Fig. 2 of that paper shows results similar to those in Fig. 9 above. Stefan et al. (2019) generalized the De Boer et al. model and emphasized the advantages of divergent alleles that would bind with a wider range of antigens. Stefan et al. (2019) also found that more fit alleles would tend to accumulate at higher frequencies. Lighten et al. (2017) make similar assumptions. Siljestam and Rueffler (2019) assume there is a tradeoff between an allele’s ability to confer resistance to specific pathogens and the number of pathogens that can be resisted. These models and similar ones represent hypotheses about how fitness differences among alleles arise and are maintained by pathogen interaction and coevolution. Once the alleles are present in the population and their fitnesses are determined by interactions with pathogens, alleles are still governed by standard population genetic processes.

One class of models not analyzed here are models of divergent-allele advantage (DAA) first proposed by Wakeland et al. (1990). There is empirical support for the DAA model (Pierini and Lenz 2018). Some analyses of DAA models concluded that they do not augment variation or readily lead to trans-species polymorphism (Lau et al. 2015; Ejsmond et al. 2018). Uyenoyama (2003) reached a similar conclusion about the evolution of self-incompatibility alleles, which experience fertility selection comparable to very strong balancing selection on viability (Vekemans and Slatkin 1994). Others analyses, including those of Lighten et al. (2017) and Stefan et al. (2019), reach the opposite conclusion that DAA can greatly increase allelic diversity. These differences can be reconciled by considering the effect of DAA on mean fitness. A simple generalization of the Vhet model that allows for DAA is one in which the relative fitness of a homozygote is $1 - s$ (as in the symmetric model) and of a heterozygote is $1 + s u_{i j}$ , where the $u_{i j}$ are chosen to allow for arbitrary interactions among alleles. In the DAA model the $u_{i j}$ would be smaller for pairs of alleles that are more similar to each other or are more recently diverged. Values of the $u_{i j}$ for other pairs of alleles would be larger because they are more dissimilar and can bind to a wider range of antigens. For this model, the average relative fitness is $\bar{w} = 1 - s \sum_{i} p_{i}^{2} + s \sum_{i} p_{i} \sum_{j \neq i} p_{j} u_{i j}$ , which is an increasing function of the $u_{i j}$ (cf, Equation 6). Alleles that tend to increase $\bar{w}$ will be preferentially retained. That is what is seen in the simulation results of Stefan et al. (2019).

Models of coevolution with pathogens are somewhat different from the asymmetric models analyzed here because each allele’s contribution to fitness changes with time as pathogens coevolve in response to the presence of new high-frequency alleles. Ejsmond and Radwan (2015) noted that coevolutionary models are similar to the Red-Queen process in evolutionary biology because MHC alleles continually need to keep up with new challenges presented by coevolving pathogens.

The results from the analysis of asymmetric models have implications for trans-species polymorphisms. The 3 models examined here are not intended to be realistic. Instead they are intended to explore the consequences of variation in different types of allelic contributions to fitness. One of the consequences is that long term persistence of a few alleles is possible even with very small differences in allelic fitness. Stochastic loss of an allele is a sensitive function of selection intensity (Takahata 1990) so slight differences in allelic contributions to fitness affect persistence times disproportionately. Any biological constraint on allelic fitness arising from their role in pathogen defense could maintain larger differences among alleles and hence lead to much larger differences in persistence times. Trans-species polymorphisms are then possible under a wider range of parameter values than is suggested by the symmetric equilibrium model.

Although I have tried to relax the assumptions of the standard model of balancing selection in several ways, the models explored in this paper are still quite simple. They are intended to indicate the ways in which alterations to the symmetric model affect predictions about the maintenance of variation in MHC loci in humans and other vertebrates.

Data availability

Copies of the C program used for data analysis will be made freely available at GitHub.com (DOI: 10.5281/zenodo.6521179).

Acknowledgments

The author thanks J. Felsenstein, S. J. Mack, D. Meywr, and C. Poux for helpful comments on an earlier version of this article and H. Erlich, M. Fernandez-Vina, J. Hollenbeck, M. Maiers, K. Osoegawa, R. Single, and G. J. Thomson for helpful discussions during the early stages of the work presented here.

Funding

This research was not supported by any grants.

Conflicts of interest statement

The author declares no conflict of interest.

Literature cited

Auton A, Bryc K, Boyko AR, Lohmueller KE, Novembre J, Reynolds A, Indap A, Wright MH, Degenhardt JD, Gutenkunst RN, et al. Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res. 2009;19(5):795–803. [DOI] [PMC free article] [PubMed] [Google Scholar]
Borghans JAM, Beltman JB, De Boer RJ. MHC polymorphism under host-pathogen coevolution. Immunogenetics. 2004;55(11):732–739. [DOI] [PubMed] [Google Scholar]
Bronson PG, Mack SJ, Erlich HA, Slatkin M. A sequence-based approach demonstrates that balancing selection in classical human leukocyte antigen (HLA) loci is asymmetric. Hum Mol Genet. 2013;22(2):252–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
Crow JF, Aoki K. Group selection for a polygenic behavioral trait: estimating the degree of population subdivision. Proc Natl Acad Sci U S A. 1984;81(19):6073–6077. [DOI] [PMC free article] [PubMed] [Google Scholar]
De Boer RJ, Borghans JAM, van Boven M, Keşmir C, Weissing FJ. Heterozygote advantage fails to explain the high degree of polymorphism of the MHC. Immunogenetics. 2004;55(11):725–731. [DOI] [PubMed] [Google Scholar]
Ejsmond MJ, Phillips KP, Babik W, Radwan J. The role of MHC supertypes in promoting trans-species polymorphism remains an open question. Nat Commun. 2018;9(1):4262. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ejsmond MJ, Radwan J. Red queen processes drive positive selection on major histocompatibility complex (MHC) genes. PLoS Comput Biol. 2015;11(11):e1004627. [DOI] [PMC free article] [PubMed] [Google Scholar]
Elhaik E. Empirical distributions of FST from large-scale human polymorphism data. PLoS One. 2012;7(11):e49837. [DOI] [PMC free article] [PubMed] [Google Scholar]
Evans DM, Spencer CCA, Pointon JJ, Su Z, Harvey D, Kochan G, Oppermann U, Dilthey A, Pirinen M, Stone MA, et al. ; The Australo-Anglo-American Spondyloarthritis Consortium (TASC). Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat Genet. 2011;43(8):761–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ewens WJ. The sampling theory of selectively neutral alleles. Theor Popul Biol. 1972;3(1):87–112. [DOI] [PubMed] [Google Scholar]
Gourraud P-A, Khankhanian P, Cereb N, Yang SY, Feolo M, Maiers M, Rioux JD, Hauser S, Oksenberg J. HLA diversity in the 1000 Genomes dataset. PLoS One. 2014;9(7):e97282. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hedrick PW. Pathogen resistance and genetic variation at MHC loci. Evolution. 2002;56(10):1902–1908. [DOI] [PubMed] [Google Scholar]
Ilmonen P, Penn DJ, Damjanovich K, Morrison L, Ghotbi L, Potts WK. and., Major histocompatibility complex heterozygosity reduces fitness in experimentally infected mice. Genetics. 2007;176(4):2501–2508. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kimura M, Crow JF. The number of alleles that can be maintained in a finite population. Genetics. 1964;49:725–738. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kingman JFC. A mathematical problem in population genetics. Math Proc Camb Phil Soc. 1961;57(3):574–582. [Google Scholar]
Klein J, Sato A, Nikolaidis N. MHC, TSP, and the origin of species: from immunogenetics to evolutionary genetics. Annu Rev Genet. 2007;41:281–304. [DOI] [PubMed] [Google Scholar]
Kulpa DA, Collins KL. The emerging role of HLA-C in HIV-1 infection. Immunology. 2011;134(2):116–122. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lau Q, Yasukochi Y, Satta Y. A limit to the divergent allele advantage model supported by variable pathogen recognition across HLA-DRB1 allele lineages. Tissue Antigens. 2015;86(5):343–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011;475(7357):493–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lighten J, Papadopulos AST, Mohammed RS, Ward BJ, G Paterson I, Baillie L, Bradbury IR, Hendry AP, Bentzen P, van Oosterhout C, et al Evolutionary genetics of immunological supertypes reveals two faces of the Red Queen. Nat Commun. 2017;8(1):1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
Muirhead CA, Wakeley J. Modeling multiallelic selection using a Moran model. Genetics. 2009;182(4):1141–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
Paterson S, Wilson K, Pemberton JM. Major histocompatibility complex variation associated with juvenile survival and parasite resistance in a large unmanaged ungulate population (Ovis aries). Proc Natl Acad Sci U S A. 1998;95(7):3714–3719. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pierini F, Lenz TL. Divergent allele advantage at human MHC genes: signatures of past and ongoing selection. Mol Biol Evol. 2018;35(9):2145–2158. [DOI] [PMC free article] [PubMed] [Google Scholar]
Radwan J, Babik W, Kaufman J, Lenz TL, Winternitz J. Advances in the evolutionary understanding of MHC polymorphism. Trends Genet. 2020;36(4):298–311. [DOI] [PubMed] [Google Scholar]
Satta Y, O'Huigin C, Takahata N, Klein J. Intensity of natural selection at the major histocompatibility complex loci. Proc Natl Acad Sci U S A. 1994;91(15):7184–7188. [DOI] [PMC free article] [PubMed] [Google Scholar]
Siljestam M, Rueffler C. Heterozygote advantage can explain the extraordinary diversity of immune genes. bioRxiv. 2019;347344. [Google Scholar]
Solberg OD, Mack SJ, Lancaster AK, Single RM, Tsai Y, Sanchez-Mazas A, Thomson G. Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies. Hum Immunol. 2008;69(7):443–464. [DOI] [PMC free article] [PubMed] [Google Scholar]
Spence JP, Steinrücken M, Terhorst J, Song YS. Inference of population history using coalescent HMMs: review and outlook. Curr Opin Genet Dev. 2018;53:70–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
Spurgin LG, Richardson DS. How pathogens drive genetic diversity: HC, mechanisms and misunderstandings. Proc Biol Sci. 2010;277(1684):979–988. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stefan T, Matthews L, Prada JM, Mair C, Reeve R, Stear MJ. Divergent allele advantage provides a quantitative model for maintaining alleles with a wide range of intrinsic merits. Genetics. 2019;212(2):553–564. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stoffels RJ, Spencer HG. An asymmetric model of heterozygote advantage at major histocompatibility complex genes: degenerate pathogen recognition and intersection advantage. Genetics. 2008;178(3):1473–1489. [DOI] [PMC free article] [PubMed] [Google Scholar]
Takahata N. A simple genealogical structure of strongly balanced allelic lines and trans-species evolution of polymorphism. Proc Natl Acad Sci U S A. 1990;87(7):2419–2423. [DOI] [PMC free article] [PubMed] [Google Scholar]
Takahata N, Nei M. Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics. 1990;124(4):967–978. [DOI] [PMC free article] [PubMed] [Google Scholar]
Trowsdale J, Knight JC. Major histocompatibility complex genomics and human disease. Annu Rev Genomics Hum Genet. 2013;14:301–323. [DOI] [PMC free article] [PubMed] [Google Scholar]
Uyenoyama MK. Genealogy-dependent variation in viability among self-incompatibility genotypes. Theor Popul Biol. 2003;63(4):281–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vekemans X, Slatkin M. Gene and allelic genealogies at a gametophytic self-incompatibility locus. Genetics. 1994;137(4):1157–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wakeland EK, Boehme S, She JX, Lu CC, McIndoe RA, Cheng I, Ye Y, Potts WK. Ancestral polymorphisms of MHC Class-II genes—divergent allele advantage. Immunol Res. 1990;9(2):115–122. [DOI] [PubMed] [Google Scholar]
Watterson GA. Heterosis or neutrality. Genetics. 1977;85(4):789–814. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yasukochi Y, Satta Y. Current perspectives on the intensity of natural selection of MHC loci. Immunogenetics. 2013;65(6):479–483. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Copies of the C program used for data analysis will be made freely available at GitHub.com (DOI: 10.5281/zenodo.6521179).

[iyac058-B1] Auton A, Bryc K, Boyko AR, Lohmueller KE, Novembre J, Reynolds A, Indap A, Wright MH, Degenhardt JD, Gutenkunst RN, et al. Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res. 2009;19(5):795–803. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B2] Borghans JAM, Beltman JB, De Boer RJ. MHC polymorphism under host-pathogen coevolution. Immunogenetics. 2004;55(11):732–739. [DOI] [PubMed] [Google Scholar]

[iyac058-B3] Bronson PG, Mack SJ, Erlich HA, Slatkin M. A sequence-based approach demonstrates that balancing selection in classical human leukocyte antigen (HLA) loci is asymmetric. Hum Mol Genet. 2013;22(2):252–261. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B4] Crow JF, Aoki K. Group selection for a polygenic behavioral trait: estimating the degree of population subdivision. Proc Natl Acad Sci U S A. 1984;81(19):6073–6077. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B5] De Boer RJ, Borghans JAM, van Boven M, Keşmir C, Weissing FJ. Heterozygote advantage fails to explain the high degree of polymorphism of the MHC. Immunogenetics. 2004;55(11):725–731. [DOI] [PubMed] [Google Scholar]

[iyac058-B6] Ejsmond MJ, Phillips KP, Babik W, Radwan J. The role of MHC supertypes in promoting trans-species polymorphism remains an open question. Nat Commun. 2018;9(1):4262. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B7] Ejsmond MJ, Radwan J. Red queen processes drive positive selection on major histocompatibility complex (MHC) genes. PLoS Comput Biol. 2015;11(11):e1004627. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B8] Elhaik E. Empirical distributions of FST from large-scale human polymorphism data. PLoS One. 2012;7(11):e49837. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B9] Evans DM, Spencer CCA, Pointon JJ, Su Z, Harvey D, Kochan G, Oppermann U, Dilthey A, Pirinen M, Stone MA, et al. ; The Australo-Anglo-American Spondyloarthritis Consortium (TASC). Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat Genet. 2011;43(8):761–767. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B10] Ewens WJ. The sampling theory of selectively neutral alleles. Theor Popul Biol. 1972;3(1):87–112. [DOI] [PubMed] [Google Scholar]

[iyac058-B11] Gourraud P-A, Khankhanian P, Cereb N, Yang SY, Feolo M, Maiers M, Rioux JD, Hauser S, Oksenberg J. HLA diversity in the 1000 Genomes dataset. PLoS One. 2014;9(7):e97282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B12] Hedrick PW. Pathogen resistance and genetic variation at MHC loci. Evolution. 2002;56(10):1902–1908. [DOI] [PubMed] [Google Scholar]

[iyac058-B13] Ilmonen P, Penn DJ, Damjanovich K, Morrison L, Ghotbi L, Potts WK. and., Major histocompatibility complex heterozygosity reduces fitness in experimentally infected mice. Genetics. 2007;176(4):2501–2508. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B14] Kimura M, Crow JF. The number of alleles that can be maintained in a finite population. Genetics. 1964;49:725–738. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B15] Kingman JFC. A mathematical problem in population genetics. Math Proc Camb Phil Soc. 1961;57(3):574–582. [Google Scholar]

[iyac058-B16] Klein J, Sato A, Nikolaidis N. MHC, TSP, and the origin of species: from immunogenetics to evolutionary genetics. Annu Rev Genet. 2007;41:281–304. [DOI] [PubMed] [Google Scholar]

[iyac058-B17] Kulpa DA, Collins KL. The emerging role of HLA-C in HIV-1 infection. Immunology. 2011;134(2):116–122. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B18] Lau Q, Yasukochi Y, Satta Y. A limit to the divergent allele advantage model supported by variable pathogen recognition across HLA-DRB1 allele lineages. Tissue Antigens. 2015;86(5):343–352. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B19] Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011;475(7357):493–496. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B20] Lighten J, Papadopulos AST, Mohammed RS, Ward BJ, G Paterson I, Baillie L, Bradbury IR, Hendry AP, Bentzen P, van Oosterhout C, et al Evolutionary genetics of immunological supertypes reveals two faces of the Red Queen. Nat Commun. 2017;8(1):1294. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B21] Muirhead CA, Wakeley J. Modeling multiallelic selection using a Moran model. Genetics. 2009;182(4):1141–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B22] Paterson S, Wilson K, Pemberton JM. Major histocompatibility complex variation associated with juvenile survival and parasite resistance in a large unmanaged ungulate population (Ovis aries). Proc Natl Acad Sci U S A. 1998;95(7):3714–3719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B23] Pierini F, Lenz TL. Divergent allele advantage at human MHC genes: signatures of past and ongoing selection. Mol Biol Evol. 2018;35(9):2145–2158. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B24] Radwan J, Babik W, Kaufman J, Lenz TL, Winternitz J. Advances in the evolutionary understanding of MHC polymorphism. Trends Genet. 2020;36(4):298–311. [DOI] [PubMed] [Google Scholar]

[iyac058-B25] Satta Y, O'Huigin C, Takahata N, Klein J. Intensity of natural selection at the major histocompatibility complex loci. Proc Natl Acad Sci U S A. 1994;91(15):7184–7188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B26] Siljestam M, Rueffler C. Heterozygote advantage can explain the extraordinary diversity of immune genes. bioRxiv. 2019;347344. [Google Scholar]

[iyac058-B27] Solberg OD, Mack SJ, Lancaster AK, Single RM, Tsai Y, Sanchez-Mazas A, Thomson G. Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies. Hum Immunol. 2008;69(7):443–464. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B28] Spence JP, Steinrücken M, Terhorst J, Song YS. Inference of population history using coalescent HMMs: review and outlook. Curr Opin Genet Dev. 2018;53:70–76. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B29] Spurgin LG, Richardson DS. How pathogens drive genetic diversity: HC, mechanisms and misunderstandings. Proc Biol Sci. 2010;277(1684):979–988. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B30] Stefan T, Matthews L, Prada JM, Mair C, Reeve R, Stear MJ. Divergent allele advantage provides a quantitative model for maintaining alleles with a wide range of intrinsic merits. Genetics. 2019;212(2):553–564. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B31] Stoffels RJ, Spencer HG. An asymmetric model of heterozygote advantage at major histocompatibility complex genes: degenerate pathogen recognition and intersection advantage. Genetics. 2008;178(3):1473–1489. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B32] Takahata N. A simple genealogical structure of strongly balanced allelic lines and trans-species evolution of polymorphism. Proc Natl Acad Sci U S A. 1990;87(7):2419–2423. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B33] Takahata N, Nei M. Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics. 1990;124(4):967–978. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B34] Trowsdale J, Knight JC. Major histocompatibility complex genomics and human disease. Annu Rev Genomics Hum Genet. 2013;14:301–323. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B35] Uyenoyama MK. Genealogy-dependent variation in viability among self-incompatibility genotypes. Theor Popul Biol. 2003;63(4):281–293. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B36] Vekemans X, Slatkin M. Gene and allelic genealogies at a gametophytic self-incompatibility locus. Genetics. 1994;137(4):1157–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B37] Wakeland EK, Boehme S, She JX, Lu CC, McIndoe RA, Cheng I, Ye Y, Potts WK. Ancestral polymorphisms of MHC Class-II genes—divergent allele advantage. Immunol Res. 1990;9(2):115–122. [DOI] [PubMed] [Google Scholar]

[iyac058-B38] Watterson GA. Heterosis or neutrality. Genetics. 1977;85(4):789–814. [DOI] [PMC free article] [PubMed] [Google Scholar]

[iyac058-B39] Yasukochi Y, Satta Y. Current perspectives on the intensity of natural selection of MHC loci. Immunogenetics. 2013;65(6):479–483. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Joint estimation of selection intensity and mutation rate under balancing selection with applications to HLA

Montgomery Slatkin

Roles

Abstract

Table 2.

Allele frequency spectrum

Composite likelihood method

Simulation tests

Fig. 1.

Fig. 2.

Fig. 3.

Fig. 4.

Application of the composite likelihood method

Table 1.

Deviations from the symmetric equilibrium model

Population growth

Fig. 5.

Fig. 6.

Population subdivision

Fig. 7.

Models of asymmetric balancing selection

Fig. 8.

Fig. 9.

Fig. 10.

Fig. 11.

Trans-species polymorphism

Fig. 12.

Discussion and conclusion

Data availability

Acknowledgments

Funding

Conflicts of interest statement

Literature cited

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases