Abstract
GC-biased gene conversion (gBGC) is a recombination-associated process mimicking selection in favor of G and C alleles. It is increasingly recognized as a widespread force in shaping the genomic nucleotide landscape. In recombination hotspots, gBGC can lead to bursts of fixation of GC nucleotides and to accelerated nucleotide substitution rates. It was recently shown that these episodes of strong gBGC could give spurious signatures of adaptation and/or relaxed selection. There is also evidence that gBGC could drive the fixation of deleterious amino acid mutations in some primate genes. This raises the question of the potential fitness effects of gBGC. While gBGC has been metaphorically termed the “Achilles' heel” of our genome, we do not know whether interference between gBGC and selection merely has practical consequences for the analysis of sequence data or whether it has broader fundamental implications for individuals and populations. I developed a population genetics model to predict the consequences of gBGC on the mutation load and inbreeding depression. I also used estimates available for humans to quantitatively evaluate the fitness impact of gBGC. Surprising features emerged from this model: (i) Contrary to classical mutation load models, gBGC generates a fixation load independent of population size and could contribute to a significant part of the load; (ii) gBGC can maintain recessive deleterious mutations for a long time at intermediate frequency, in a similar way to overdominance, and these mutations generate high inbreeding depression, even if they are slightly deleterious; (iii) since mating systems affect both the selection efficacy and gBGC intensity, gBGC challenges classical predictions concerning the interaction between mating systems and deleterious mutations, and gBGC could constitute an additional cost of outcrossing; and (iv) if mutations are biased toward A and T alleles, very low gBGC levels can reduce the load. A robust prediction is that the gBGC level minimizing the load depends only on the mutational bias and population size. These surprising results suggest that gBGC may have nonnegligible fitness consequences and could play a significant role in the evolution of genetic systems. They also shed light on the evolution of gBGC itself.
GC-BIASED gene conversion (gBGC) is increasingly recognized as a widespread force in shaping genome evolution. In different species, gene conversion occurring during double-strand break recombination repair is thought to be biased toward G and C alleles. In heterozygotes, GC alleles undergo a kind of molecular meiotic drive that mimics selection (reviewed in Marais 2003). This process can rapidly increase the GC content, especially around recombination hotspots (Spencer et al. 2006), and, more broadly, can affect genome-wide nucleotide landscapes (Duret and Galtier 2009a). For instance, it is thought to play a role in shaping isochore structure evolution in mammals (Galtier et al. 2001; Meunier and Duret 2004; Duret et al. 2006) and birds (Webster et al. 2006). Direct experimental evidence of gBGC mainly comes from studies in yeast (Birdsell 2002; Mancera et al. 2008; but see Marsolier-Kergoat and Yeramian 2009) and humans (Brown and Jiricny 1987). However, associations between recombination and the nucleotide landscape and frequency spectra biased toward GC alleles provide indirect evidence in very diverse organisms (Table 1).
TABLE 1.
Direct and indirect evidence of gBGC in several organisms
Organisms | Direct evidence | Indirect evidence | Achille's heel evidence | References |
---|---|---|---|---|
Yeast | Meiotic segregation bias | Mancera et al. (2008) | ||
Mitotic and mitotic heteromismatch correction bias | Correlation between GC and recombination | Birdsell (2002) | ||
Mammals | Mitotic heteromismatch correction bias | Brown and Jiricny (1987) | ||
Correlation between GC*/GC and recombination | Duret and Arndt (2008); Meunier and Duret (2004) | |||
Biased frequency spectrum toward GC alleles | Galtier et al. (2001); Spencer et al. (2006) | |||
GC bias associated with high dN/dS near recombination hotspot | Berglund et al. (2009; Galtier et al. (2009) | |||
Birds | Correlation between GC and recombination | International Chicken Genome Sequencing Consortium (2004) | ||
Turtles | Correlation between GC and chromosome size | Kuraku et al. (2006) | ||
Drosophila | Correlation between GC and recombination | Marais et al. (2003) | ||
Biased frequency spectrum toward GC alleles | Galtier et al. (2006) | |||
Nematodes | Correlation between GC and recombination | Marais et al. (2001) | ||
Grasses | Correlation between GC and outcrossing/selfing | Glémin et al. (2006) | ||
Correlation between GC* and recombination and outcrossing/selfing | Outcrossing increases dN/dS for genes with high GC* | Haudry et al. (2008) | ||
Green algae | Correlation between GC and recombination | Jancek et al. (2008) | ||
Paramecium | Correlation between GC and chromosome size | Duret et al. (2008) |
The impact of gBGC on noncoding sequences and synonymous sites has been studied in depth, especially because of confounding effects with selection on codon usage (Marais et al. 2001). More recently, Galtier and Duret (2007) pointed out that gBGC may also interfere with selection when affecting functional sequences. They argued that gBGC could leave spurious signatures of adaptive selection and proposed to extend the null hypothesis of molecular evolution. Indeed, gBGC can lead to a ratio of nonsynonymous (dN) over synonymous (dS) substitutions above one (Berglund et al. 2009; Galtier et al. 2009), i.e., a typical signature of positive selection (Nielsen 2005). This hypothesis has been widely debated for human-accelerated regions (HARs). These regions are extremely conserved across mammals but show evidence of accelerated evolution along the human lineage, which has been interpreted as evidence of positive selection (Pollard et al. 2006a,b; Prabhakar et al. 2006, 2008). On the contrary, other authors argued that patterns observed in HARs, such as the AT → GC substitution bias, the absence of a selective sweep signature, or the propensity to occur within or close to recombination hotspots, are more likely explained by gBGC rather than positive selection (Galtier and Duret 2007; Berglund et al. 2009; Duret and Galtier 2009b; but see also Pollard et al. 2006a who also suggested that gBGC might play a role in HARs evolution). It is thus crucial to take gBGC into account when interpreting genomic data.
Moreover, Galtier and Duret (2007) initially suggested that gBGC hotspots could contribute to the fixation of slightly deleterious AT → GC mutations and could represent the Achilles' heel of our genome. This hypothesis was reinforced later in primates, with evidence of gBGC-driven fixation of deleterious mutations in proteins (Galtier et al. 2009). A similar result was also found in some grass species, whose genomes are also supposed to be affected by gBGC (Table 1; Glémin et al. 2006). Haudry et al. (2008) compared two outcrossing and two selfing grass species and showed that GC-biased genes exhibit higher dN/dS ratio in outcrossing than in selfing lineages. The reverse pattern would be expected under pure selective models because of the reduced selection efficacy in selfers (Charlesworth 1992; Glémin 2007). This pattern is in agreement with a genomic Achilles' heel associated with outcrossing, while gBGC is inefficient in selfing species because they are mainly homozygous.
Twenty years ago, Bengtsson (1990) already pointed out that biased conversion can generally affect the mutation load. The mutation load is the reduction in the mean fitness of a population due to mutation accumulation, which could lead to population extinction if it is too high (Lynch et al. 1995). At this time, Bengtsson concluded that “it is impossible to know if biased conversion plays a major role in determining the magnitude of the mutation load in organisms such as ourselves, but the possibility must be considered and further investigated (Bengtsson 1990, p. 186).” Now, one can propose gBGC could be such a widespread biased conversion process. It thus appears timely to thoroughly investigate the fitness consequences of gBGC through its potential effects on the dynamics of deleterious mutations. The fitness consequences of gBGC were also pointed out as a major future issue to be addressed by Duret and Galtier (2009a). In addition to the load, deleterious mutations have many other evolutionary consequences (for review see Charlesworth and Charlesworth 1998). They are thought to be the main determinant of inbreeding depression, i.e., the reduction in fitness of inbred individuals compared to outbred ones. They also play a key role in the evolution of genetic systems (sexual reproduction and recombination, inbreeding avoidance mechanisms, ploidy cycles), of senescence, or in the degeneration of nonrecombining regions, such as Y chromosomes. So far, we know little, if anything, about how gBGC might affect these processes.
In his seminal work, Bengtsson (1990) did not address several important points. First, he did not include genetic drift in his model. Nearly neutral mutations, for which drift and selection are of similar intensities, are the most damaging ones because they can drift to fixation, unlike strongly deleterious mutations that are maintained at low frequency (Crow 1993; Lande 1994, 1998). While gBGC intensities are rather weak (Birdsell 2002; Spencer et al. 2006), they could markedly affect the fate of nearly neutral mutations (see also Galtier et al. 2009). Second, Bengtsson did not study the effect of gene conversion on inbreeding depression, while he showed that recessive mutations, mostly involved in inbreeding depression, are the most affected by gene conversion. Third, he did not envisage systematic GC bias with its opposite effects on A/T and G/C deleterious alleles. Fourth, while he noted that selfing affects both the efficacy of selection and that of conversion, he did not fully investigate the effect of mating systems. On one hand, selfing is efficient in purging strongly deleterious mutations causing inbreeding depression. However, since selfing is expected to increase drift, weakly deleterious mutations can fix in selfing species, contributing to the so-called “drift load” (Charlesworth 1992; Glémin 2007). Self-fertilizing populations are thus expected to exhibit low inbreeding depression and high drift load. On the other hand, gBGC, and thus its cost, vanishes as the selfing rate and homozygosity increase (Marais et al. 2004). gBGC could thus challenge classical views on mating systems and it was even speculated that gBGC could affect their evolution (Haudry et al. 2008).
Here I present a population genetics model that includes mutation, selection, drift, and gBGC, which extends previous studies (Gutz and Leslie 1976; Lamb and Helmi 1982; Nagylaki 1983a,b; Bengtsson 1990). I specifically examine how gBGC can affect inbreeding depression and the mutation load. I also focus on the effect of mating system, which is especially interesting with regard to the interaction between biased conversion and selection. Finally, I discuss how these results could give insight into how gBGC evolved.
MODEL AND RESULTS
General formulation:
Consider a single biallelic locus with weak (W = A or T) and strong (S = G or C) alleles. The life cycle is as follows: N diploid adults produce gametes after mutation and conversion events. Heterozygote individuals produce S alleles with probability 1/2(1 + b) and W alleles with probability 1/2(1 – b), where b is the gBGC coefficient (disparity coefficient sensu Nagylaki 1983a). Fertilization occurs and zygotes experience selection to give adults, which experience regulation to N individuals. If the S (resp. W) allele is deleterious, the relative fitnesses of WW (resp. SS), WS, and SS (resp. WW) genotypes are 1, 1 − hs, and 1 − s, respectively, where s is the selection coefficient and h the dominance coefficient. The W allele mutates at rate u to the S allele. The reverse mutation occurs at rate v = λu. λ is the mutation bias from S to W alleles, which ranges from 2 to 4.5 in many different organisms (Lynch 2007). I assume that u and v are much smaller than the selection and conversion coefficients (u, v ≪ s, hs, and b). For simplicity, I also assume that the gBGC intensity is constant. However, strong gBGC events are thought to be associated with short-lived recombination hotspots, at least in humans (McVean et al. 2004; Myers et al. 2005). I thus implicitly assume that gBGC/selection dynamics are shorter than the recombination hotspot lifespan. The validity of this assumption is discussed below. All parameters are summarized in Table 2.
TABLE 2.
Notations
N: Actual population size equals effective population size in panmictic populations |
F: Wright's fixation index |
α: Scaling factor summarizing the reduction in effective population size due to hitchhiking and bottleneck effects |
u: Mutation rate from AT toward GC |
v: Mutation rate from GC toward AT |
λ = v/u: Mutation bias toward AT |
h: Dominance coefficient of the deleterious allele |
s: Selection coefficient against the deleterious allele |
b: Intensity of biased gene conversion |
q: Fraction of sites where GC is the advantageous allele (![]() ![]() |
smax: Selection coefficient that maximizes the load |
![]() |
![]() |
blim: gBGC level above which the load increases |
Deterministic equilibrium—three selection regimes:
Analysis of the deterministic case (N = ∞) highlights the different behaviors of the model. Bengtsson (1990) gave the full equation of the deterministic change in allele frequency. Under the assumption of weak selection and weak gBGC (s ≪ 1 and b ≪ 1), changes due to mutation, selection, and gBGC can be computed separately. Note that a full treatment of the system, without approximations, is given by A. Popa, S. Glemin, E. Popa, D. Mouchiroud, and C. Gautier (unpublished data). The deterministic change in frequency of the deleterious allele (x) can thus be approximated by
![]() |
(1) |
with
![]() |
and
![]() |
where is the mean fitness of the population and F is Wright's fixation index, FIS.
Note that in Δxselection, the approximation holds either for s ≪ 1 or x ≪ 1. So it is still valid for lethal alleles because x ≪ 1. For weak selection, the F coefficient can be used for neutral loci, that is, F = σ/(2 − σ), where σ is the selfing rate (Caballero and Hill 1992). For strong selection, F depends on selfing rate and selection parameters (Glémin 2003). Given the approximation for Δxselection, the effect of gBGC is equivalent to modifying the selection and dominance coefficients as follows:
![]() |
(2) |
![]() |
Conventional population genetics results thus hold using this new parameterization. In panmixia (F = 0), gBGC is thus fully equivalent to modifying selection patterns. However, under inbreeding, the analogy is not complete because selection and dominance coefficients depend on F.
Equilibrium frequencies are then obtained by solving Δx=0 (see appendix a). The case for which W is the deleterious allele is straightforward because both gBGC and selection favor the S allele. The deleterious allele is maintained at a lower frequency than under mutation/selection equilibrium (see also Bengtsson 1990):
![]() |
(3) |
The first case (with S being the deleterious allele), the one underlying the Achilles' heel hypothesis, is much less trivial. It leads to three selection regimes (Figures 1 and 2). If gBGC is weak relative to selection (b ≪ s), the equilibrium frequency is
![]() |
(4a) |
The deleterious S allele is maintained at low frequency, but slightly higher than at mutation/selection equilibrium. On the contrary, if gBGC is strong (b ≫ s), the deleterious allele is close to fixation:
![]() |
(4b) |
Finally, if gBGC and selection are of similar intensity, the two forces can maintain the deleterious allele at intermediate frequency:
![]() |
(4c) |
The condition that ensures the existence of this last equilibrium point is (h + F − hF)s/(1 − F) ≤ b ≤ (1 − h + hF)s/(1 − F), and analysis of the derivative
shows that this equilibrium is stable between the two limits (Figure 2). In panmixia, this case is fully equivalent to overdominance. Below a given selfing threshold, it is well known that the equilibrium is not stable if overdominance is asymmetrical (Kimura and Ohta 1971). However, in the present case, the equilibrium is always stable because equivalent selection parameters depend on F and the range of the overdominant-like domain narrows as inbreeding increases (Figure 2). In the special case of fully recessive alleles in panmixia (F = 0),
![]() |
(4d) |
which vanishes to the well-known equilibrium without gBGC (Haldane 1937). As long as gBGC is small (b < 0.1), Equation 4d also holds for lethal alleles and gives results very close to those of Bengtsson (1990); also A. Popa, S. Glemin, E. Popa, D. Mouchiroud, and C. Gautier, unpublished results.
Figure 1.—
Deterministic equilibrium frequency of the deleterious GC allele as a function of the conversion bias for different dominance levels: h = 0.5 (thick line), h = 0.2 (thin line), h = 0.05 (dashed line), and h = 0 (dotted line). s = 0.001, u = 10−6, and λ = 2.
Figure 2.—
The three selection regimes as a function of b and s for different combinations of dominance levels (a, c, and e, h = 0.1; b, d, and f, h =0.3) and mating systems (a and b, F = 0; c and d, F = 0.5; e and f, F = 0.95). 1: gBGC overwhelms selection and the GC deleterious allele goes to fixation. 2: gBGC and selection are of similar intensities and an overdominance-like equilibrium is reached. 3: Selection overwhelms gBGC and the deleterious allele becomes extinct.
Dynamics in finite populations and time properties:
In finite populations, the first two moments of the stationary probability distribution of allele frequency are required to compute the mean load and mean inbreeding depression (e.g., Glémin et al. 2003). The work of Nagylaki (1983a) can be extended to incorporate the effect of dominance and inbreeding using the classical diffusion theory and the reparameterization given by (2). The Wright (1937) stationary distribution is given by
![]() |
(5a) |
if S is the deleterious allele and
![]() |
(5b) |
if W is the deleterious allele, where K is an integration constant, ensuring that and
, where N is the effective population size of the equivalent population without inbreeding (Pollak 1987; Nordborg 1997), and α (α = 1 in panmixia; 0 < α ≤ 1 under inbreeding) is a coefficient introduced to summarize the additional reduction in effective population size due to hitchhiking and bottleneck effects associated with inbreeding (Glémin 2007).
There is no general analytical solution for K. However, for the special case of genic selection in panmixia (F = 0 and ), the moments of Φ are
![]() |
(6a) |
if S is the deleterious allele and
![]() |
(6b) |
if W is the deleterious allele, where Γ is the gamma function and 1F1 is the regularized confluent hypergeometric function (Abramowitz and Stegun 1970). Analytical expression cannot be given in the general case, and numerical integrations are used (see appendix b).
The probability of fixation of a new mutation is also useful (see appendix a). General analytical expressions can be obtained but they are formidable and not given here. However, the probability of fixation of a new mutation under genic selection in panmixia simply reduces to
![]() |
(7a) |
if S is the deleterious allele and
![]() |
(7b) |
if W is the deleterious allele. Approximations hold for |s − b| ≪ 1 and |s + b| ≪ 1, respectively (see also Nagylaki 1983a).
The last quantities of interest are sojourn and fixation times. Time properties are needed to evaluate whether, on average, the allele life span is much shorter than the recombination hotspot life span. This is implicitly assumed by modeling a constant gBGC intensity. In the panmictic codominant case, the absorption time has been already derived by Nagylaki (1983a). Conversion only modifies the selection intensity (genic selection), and S deleterious alleles quickly lead to either extinction or fixation. The mean absorption time of a new mutant is always shorter than that under pure genetic drift (see Equations 28–33 in Nagylaki 1983a). On the contrary, as the deterministic analysis showed, gBGC can maintain recessive deleterious alleles at intermediate frequencies for a long time by mimicking overdominance. Conventional diffusion results can also be applied (e.g., Ewens 2004) to compute the mean sojourn and fixation times of a new mutation (see appendix a). In the general case, these expressions have no analytical expression and numerical integrations are performed (see appendix b). However, under the “overdominance-like” regime, deleterious alleles rarely reach fixation. It is thus more interesting to compute the mean time until the mutant allele reaches the deterministic equilibrium frequency (xeq) for the first time. I used the expression of Kimura and Ohta (1973) that gives the mean time to reach a given frequency for the first time, starting from frequency p0 < xeq. For fully recessive mutations (h = 0) in panmixia, the equilibrium frequency is well approximated by xeq = b/(b + s) (Equation 4d with s > b ≫ u). Once again, no analytical expression is available and numerical integrations are performed. Finally, under the overdominance-like regime, the allele can persist for a long time if it is not initially lost (e.g., Takahata 1990). To evaluate this conditional persistence, I also computed the mean sojourn time of an allele starting from frequency xeq. The numerical results are presented in Tables 3 and 4].
TABLE 3.
Time properties under the “overdominant-like” regime: effect of s and b (N = 10,000, h = 0)
xeq | Mean sojourn time | Fixation time | Mean time to reach xeq | Mean sojourn time from xeq | |
---|---|---|---|---|---|
s = 0.0005 | |||||
b = 0 | — | 53 | 26,846 | — | — |
b = 0.0001 | 0.2 | 67 | 40,864 | 4,383 | 1 × E + 08 |
b = 0.0002 | 0.4 | 105 | 73,314 | 9,795 | 7 × E + 08 |
b = 0.0003 | 0.6 | 95 | 73,314 | 15,219 | 1 × E + 08 |
b = 0.0004 | 0.8 | 72 | 40,864 | 19,003 | 390,270 |
s = 0.001 | |||||
b = 0 | — | 50 | 18,200 | — | — |
b = 0.0002 | 0.2 | 74 | 34,008 | 4,447 | 1 × E + 08 |
b = 0.0004 | 0.4 | 313 | 145,154 | 9,322 | 4 × E + 09 |
b = 0.0006 | 0.6 | 208 | 145,154 | 11,355 | 1 × E + 08 |
b = 0.0008 | 0.8 | 88 | 34,008 | 12,286 | 34,008 |
s = 0.002 | |||||
b = 0 | — | 47 | 12,205 | — | — |
b = 0.0004 | 0.2 | 101 | 33,633 | 4,494 | 2 × E + 08 |
b = 0.0008 | 0.4 | 4,593 | 1 × E + 06 | 7,220 | 3 × E + 11 |
b = 0.0012 | 0.6 | 2,472 | 1 × E + 06 | 7,069 | 1 × E + 08 |
b = 0.0016 | 0.8 | 137 | 33,633 | 7,451 | 26,182 |
TABLE 4.
Time properties under the “overdominant-like” regime: effect of N (h = 0, s = 0.001, b = 0.0002, xeq = 0.2)
Mean sojourn time without gBGC | Mean sojourn time | Fixation time | Mean time to reach xeq | Mean sojourn time from xeq | |
---|---|---|---|---|---|
N = 1,000 | 43 | 44 | 4,385 | 431 | 706,085 |
N = 5,000 | 49 | 63 | 20,431 | 2,191 | 3 × E + 07 |
N = 10,000 | 50 | 74 | 34,007 | 4,447 | 1 × E + 08 |
N = 50,000 | 54 | 466 | 419,015 | 20,230 | 4 × E + 10 |
N = 100,000 | 56 | 14,977 | 1 × E + 07 | 28,752 | 4 × E + 13 |
N = 500,000 | 61 | 5 × E + 17 | 4 × E + 20 | 42,339 | 4 × E + 40 |
N = 1,000,000 | 63 | 8 × E + 34 | 7 × E + 37 | 47,925 | 1 × E + 75 |
On average, gBGC makes S deleterious alleles persist longer in populations. However, the time to reach the deterministic equilibrium frequency for the first time is still short. Therefore, computing the equilibrium frequency on the basis of constant b is reliable despite the short life span of recombination hotspots. On the contrary, once the allele reaches the equilibrium frequency it will persist for a very long time. If it is not initially lost, a deleterious S allele will thus persist in the population over the hotspot life span.
Impacts of gBGC on the mutation load:
I now consider how gBGC affects the mutation load. The load is defined as
![]() |
(8) |
Here we need to consider both sites for which either S or W alleles are deleterious, in proportions q and 1 − q, respectively, so that
![]() |
(9) |
For nonsynonymous positions and noncoding functional regions, it seems reasonable to assume that . However, for synonymous positions under selection for codon usage, q will depend on the distribution of preferred codons. As GC ending codons are often more preferred than AT ending ones (Duret and Mouchiroud 1999; Wang and Roossinck 2006), q can be close to 0.
gBGC and the load structure in panmictic populations:
In infinite panmictic populations, without gBGC, the mutation load is equal to twice the mutation rate (Haldane 1937). In finite populations, deterministic results hold for strongly deleterious mutations (s ≫ 1/Ne). Slightly deleterious alleles (∼s ≤ 1/Ne) can drift to fixation, causing a much higher load (“fixation load”) than that caused by strongly deleterious mutations segregating at low frequency (“segregating load”), i.e., ∼L ≈ s (Figure 3a and see Kimura et al. 1963). Consequently, the relative part of the fixation load (or “drift load,” see the distinction below) and the segregating load depends on both the effective population size and the distribution of selection coefficients.
Figure 3.—
The load (×106) as a function of s for different effective population sizes (thick lines, N = 5000; thin lines, N = 10,000; dashed lines, N = 50,000; dotted lines, N = 100,000), without (a) or with (b–d) gBGC (b = 0.0002). The straight lines correspond to equilibrium expectations. (b) Load due to GC deleterious alleles. (c) Load averaged over half GC and half AT deleterious alleles. (d) Load averaged over 10% of GC deleterious alleles and 90% of AT deleterious alleles with a bias in favor of AT alleles. This last case would correspond to the load caused by third codon position mutations under codon usage selection, mainly in favor of GC alleles. h = 0.5, u = 10−6, and λ = 2.
gBGC affects both the magnitude and the structure of the mutation load. In infinite populations, and more generally for strongly deleterious alleles (Nes ≫ 1), the deterministic load is obtained by replacing x by xeq, given by Equations 4 in Equations 8 and 9:
![]() |
(10a) |
![]() |
(10b) |
![]() |
(10c) |
Under the first condition (b ≪ hs), gBGC increases the load if b > hs(1−2q/(q + λ − qλ)). Under the other conditions, the load due to W deleterious mutations is negligible and gBGC always increases the load. Using λ = 2 and , gBGC increases the load due to mutations for which hs < 3b. Given that b is quite low (Spencer et al. 2006), gBGC increases the load only due to rather weakly deleterious mutations.
In finite populations, the load must be integrated over the Φ distribution, which leads to
![]() |
(11) |
EΦ[x] and EΦ[x2] can be computed numerically (appendix b) and introduced in (11).
For genic selection () in panmixia (F = 0), the approximations
![]() |
(12a) |
and
![]() |
(12b) |
can be obtained (see appendix c), where Ldet is the deterministic load given by Equations 10. Interestingly, these approximations are very robust and hold for a wide range of parameters (see appendix d for numerical validation). The analysis of Equations 12 and numerical explorations show that (i) if Neb < 1, the effect of gBGC is negligible (but see below) and the mutations maximizing the load (either W or S) are of the order of s ≈ 1/Ne, and (ii) if Neb > 1, the mutations maximizing the load are S mutations with s ≈ 2b (see Figure 3 and appendix c). gBGC increases the fraction of the fixation load beyond the drift load sensu stricto, i.e., the load due to deleterious mutations fixed by drift only.
Minimum load:
As explained above, if mutations are biased toward W alleles, under certain circumstances, gBGC can decrease the load (see appendix c). A minimum is thus reached for low gBGC levels. In panmictic populations, the gBGC level that minimizes the segregating load is
![]() |
(13) |
While gBGC can greatly increase the fixation load, a very low gBGC level can also reduce it, and the gBGC level that minimizes the fixation load is
![]() |
(14) |
Both expressions clearly show that gBGC can minimize the total load only if λ > 1; i.e., the mutation is biased toward W alleles. Interestingly, the gBGC level that minimizes the fixation load does not depend on s but instead only on Ne and λ. Since (13) depends on h and s, it is hard to predict which gBGC level will minimize the load for a given distribution of h and s. However, gBGC will greatly increase the fixation load as far as b > ln(λ)/2Ne (see appendix c). Since in most cases,
is thus a very good approximation of the gBGC level that minimizes the total load, whatever the distribution of mutational effects is. This rationale was confirmed by numerical analyses (appendix d).
Quantitative effects of gBGC on the load:
To quantitatively assess the theoretical effects predicted above, I computed the total load in a genome experiencing a 3% gBGC hotspot with b = 0.0002 on average [according to estimates in humans (Spencer et al. 2006), and see also Galtier et al. 2009] and exposed to a flow of deleterious mutations with a gamma-shaped distribution of selection coefficients with mean smean = 0.0325 and shape parameter = 0.23 [according to estimates in humans (Eyre-Walker et al. 2006), and see also Galtier et al. 2009]. For simplicity, fitness is assumed to be multiplicative across loci. However, recombination and conversion are not distributed similarly in all organisms. For instance, recombination is probably not localized in hotspots in Caenorhabditis elegans (Rockman and Kruglyak 2009). I thus also tested more homogenous and more heterogeneous gBGC distributions. To make the different cases comparable, I based my computations on a constant total amount of gBGC (btotal = 0.0002 × 0.03 = 6 × 10−6). In addition to the first case (3% hotspots with b = 0.0002), I assumed that gBGC was homogeneously distributed over the whole genome, distributed over half of the genome, and in 0.3% hotspots with b = 0.002. For simplicity, all mutations are assumed to be codominant (), and the genomic mutation rate is set at U = 0.01 (see appendix b). The results are presented in Table 5. They show that in rather large populations, gBGC hotspots as low as 3% can increase the load two- or threefold. Even in small populations (N = 10,000, similar to effective size in humans), rare gBGC hotspots can increase the load by almost 10%. However, this analysis also shows that weak gBGC throughout the genome has a weak effect or can even reduce the load by compensating for the mutational bias. This occurs when b < ln(λ)/2Ne, as predicted above.
TABLE 5.
Genomic load at mutation–selection–drift equilibrium contributed by 106 loci with u = 10−8 (genomic mutation rate U = 0.01) and λ = 2, and s follows a gamma distribution with mean 0.0325 and shape 0.23
Load |
||||
---|---|---|---|---|
Without gBGC | Homogenous gBGC | Hotspots 1 | Hotspots 2 | |
N = 10,000 | ||||
F = 0 | 0.634 | 0.615 | 0.689 | 0.742 |
F = 0.2 | 0.637 | 0.624 | 0.666 | 0.706 |
F = 0.5 | 0.649 | 0.642 | 0.655 | 0.678 |
F = 0.7 | 0.670 | 0.667 | 0.670 | 0.682 |
F = 0.95 | 0.854 | 0.854 | 0.854 | 0.854 |
F = 0.99 | 0.993 | 0.993 | 0.993 | 0.993 |
N = 50,000 | ||||
F = 0 | 0.143 | 0.138 | 0.301 | 0.399 |
F = 0.2 | 0.142 | 0.131 | 0.240 | 0.309 |
F = 0.5 | 0.144 | 0.135 | 0.183 | 0.219 |
F = 0.7 | 0.150 | 0.144 | 0.165 | 0.185 |
F = 0.95 | 0.240 | 0.239 | 0.240 | 0.242 |
F = 0.99 | 0.496 | 0.496 | 0.496 | 0.496 |
N = 100,000 | ||||
F = 0 | 0.074 | 0.096 | 0.248 | 0.351 |
F = 0.2 | 0.071 | 0.074 | 0.181 | 0.253 |
F = 0.5 | 0.071 | 0.065 | 0.117 | 0.153 |
F = 0.7 | 0.073 | 0.068 | 0.093 | 0.111 |
F = 0.95 | 0.115 | 0.114 | 0.115 | 0.118 |
F = 0.99 | 0.257 | 0.257 | 0.257 | 0.257 |
Homogenous gBGC, the entire genome is affected by gBGC with b = 6 × 10−6; with hotpsots 1, 3% of the genome belongs to conversion hotspots with b = 2 × 10−4 (similar to human data); with hotspots 2, 0.3% of the genome belongs to conversion hotspots with b = 2 × 10−3.
Impacts of gBGC on inbreeding depression:
Inbreeding depression is defined as the reduction in fitness of selfed (and more generally inbred) individuals compared to outcrossed individuals,
![]() |
(15) |
where and
are the mean fitness of outcrosses and selfcrosses, respectively (Charlesworth and Charlesworth 1987; Charlesworth and Willis 2009). The approximation is very good in most conditions, because
under weak (s ≪ 1) and strong selection (x ≪ 1) (see Glémin et al. 2003). Similar to the load, considering both sites for which either S or W alleles are deleterious, in proportion q and 1 – q, respectively, we get
![]() |
(16) |
gBGC and the genetic basis of inbreeding depression in panmictic populations:
In infinite panmictic populations without gBGC, inbreeding depression depends only on mutation rates and dominance levels. Partially recessive mutations () contribute only to inbreeding depression, and the more recessive they are, the higher the inbreeding depression (Charlesworth and Charlesworth 1987). In finite populations, deterministic results hold for strongly deleterious mutations (s ≫ 1/Ne), which contribute mostly to inbreeding depression. Contrary to the load, weakly deleterious mutations (∼s ≤ 1/Ne) contribute little to inbreeding depression (Figure 4, a and c, and see Bataillon and Kirkpatrick 2000).
Figure 4.—
Inbreeding depression (×106) as a function of s without (a and c) or with (b and d) gBGC (b = 0.0002). (a and b) h = 0.2: thick lines, N = 5000; thin lines, N = 10,000; dashed lines, N = 50,000; dotted lines, N = 100,000. (c and d) N = 10,000: thick lines, h = 0.4; thin lines, h = 0.2; dashed lines, h = 0.1; dotted lines, h = 0.05. u = 10−6, λ = 2.
Like the load, gBGC affects both the magnitude and the structure of inbreeding depression. In infinite populations, and more generally for strongly deleterious alleles (Nes ≫ 1), replacing x by xeq given by Equations 4 in Equations 15 and 16 leads to
![]() |
(17a) |
![]() |
(17b) |
![]() |
(17c) |
The effect of gBGC on inbreeding depression is not monotonic. Like the load, gBGC increases inbreeding depression if b > hs(1 − 2q/(q + λ − qλ)). However, contrary to the load, a strong gBGC decreases inbreeding depression, which tends to 0 as b increases, while the load tends to qs (Equation 10c). An analysis of Equation 17b shows that mutations that maximize inbreeding depression are those that also maximize the load, i.e., S deleterious mutations with s ≈ 2b.
In finite populations, inbreeding depression must be integrated over the Φ distribution, which leads to
![]() |
(18) |
(see also Glémin et al. 2003). While it is not possible to get an analytical expression of (18), numerical computations (see appendix b) show that S deleterious mutations with s ≈ 2b also maximize inbreeding depression in finite populations (Figure 4). More broadly, inbreeding depression is maximal under the overdominant-like selection regime (gray area in Figure 2). Once again, even low to moderate gBGC markedly affects the genetic structure of inbreeding depression. First, mutations of intermediate effects contribute the most to inbreeding depression, i.e., up to one order of magnitude higher than strongly deleterious mutations (compare Figure 4a with 4b). Second, even nearly additive mutations can have a substantial effect (compare Figure 4c with 4d).
Since little is known about the distribution of dominance coefficients, especially the dominance of mildly deleterious mutations (of the order of b), it is difficult to quantitatively predict the full impact of gBGC on inbreeding depression. We can conclude that, on average, gBGC should increase inbreeding depression. However, further insight into mutational parameters is crucial to assess the quantitative impact of gBGC.
Joint effect of gBGC and mating system on the load and inbreeding depression:
Selfing, or more generally inbreeding, slightly reduces the segregating load through the purging of recessive mutations (Ohta and Cockerham 1974), but can substantially increase the fixation load because of the effective population size reduction under inbreeding: (see above and Pollak 1987; Nordborg 1997; Glémin 2007). In numerical examples, I assumed that α decreases with F according to the background selection model (Charlesworth et al. 1993; Nordborg et al. 1996), as in Glémin (2007). With gBGC, selfing thus has two opposite effects on the fixation load. Selfing increases the drift load sensu stricto but decreases the fixation load due to gBGC. A surprising consequence is that the load can be higher in outcrossing than in selfing populations (Figure 5). Quantitatively this is also expected, even with a gBGC hotspot affecting just 3% of the genome (Table 5). Another consequence is that a minimum load can be reached for intermediate selfing rates (Figure 5 and Table 5).
Figure 5.—
Effective population size (a and b) and the load (×106) (c–f) as a function of F for different gBGC intensities (thick lines, b = 0; thin lines, b = 0.0001; dashed lines, b = 0.0002; dotted lines, b = 0.0005). The effective population size depends on F under the background selection (BS) model (Charlesworth et al. 1993), using Equations 16 and 17 in Glémin (2007): , where U is the genomic deleterious mutation rate, R is the genomic recombination rate, sd is the mean selection coefficient against strongly deleterious mutations, and hd is their dominance coefficient. N = 10,000, U = 0.2, hd = 0.1, and sd = 0.05. (a, c, and e) R = 5, “weak” BS; (b, d, and f) R = 0.5, “strong” BS. (c and d) Load averaged over half GC and half AT deleterious alleles, with a bias in favor of AT alleles. (e and f) Load averaged over 10% of GC deleterious alleles and 90% of AT deleterious alleles with a bias in favor of AT alleles; see Figure 3. h = 0.5, u = 10−6, and λ = 2.
Generally, the effect of selfing is simpler for inbreeding depression. Purging, Ne reduction, and suppression of gBGC contribute to decreasing inbreeding depression in selfing populations (Figure 6a). However, there are special cases in which maximum inbreeding depression is reached for intermediate selfing rates (Figure 6b). In such cases, in outcrossing populations, gBGC is strong enough to sweep polymorphism out and reduce inbreeding depression (b > s, regime 1 in Figure 2). As the selfing rate increases, gBGC declines, and the selection dynamics become overdominant-like (regime 2, Figure 2), thus maximizing inbreeding depression. For high selfing rates, gBGC vanishes (regime 3 in Figure 2) and deleterious alleles are either purged or fixed if there is substantial drift. This is similar to the effect of selfing on inbreeding depression caused by asymmetrical overdominance, where inbreeding depression also peaks for intermediate selfing rates (Ziehe and Roberds 1989; Charlesworth and Charlesworth 1990). In the present case, the range of parameters leading to this peculiar behavior is narrow because the overdominant-like region depends on the selfing rates and can vanish either for low or for high selfing rates (Figure 2).
Figure 6.—
Inbreeding depression (×106) as a function of F for different gBGC intensities (thick lines, b = 0; thin lines, b = 0.0001; dashed lines, b = 0.0002; dotted lines, b = 0.0005). Inbreeding depression is averaged over half GC and half AT deleterious alleles. The effective population size depends on F as in Figure 5 (same parameters). (a) s = 0.002; (b) s = 0.0005; (c) s = 0.0002. h = 0.2, u = 10−6, and λ = 2.
DISCUSSION
I analyzed the effect of gBGC on the fate and consequences of deleterious mutations by extending previous theoretical studies to also include population processes, such as drift and inbreeding, and the whole range of mutations, including direction (W → S and S → W), strength, and dominance. gBGC affects S and W deleterious alleles in opposite ways. However, except in special cases, these opposite effects do not cancel out because S alleles are much more strongly affected than W alleles. Moreover, while significant gBGC levels are likely limited to recombination hotspots [only 3% of the genome in humans (Myers et al. 2005)], I showed that both the quantitative and the qualitative effects of gBGC on the load and inbreeding depression are far from being negligible.
Deleterious impacts of gBGC:
Segregation of deleterious alleles and genetic diseases:
Depending on selection and dominance levels, gBGC can slightly increase the frequency of deleterious S alleles before their elimination, drive them to fixation, or maintain them at intermediate frequencies for long periods of time in an overdominant-like way (Figure 2). In particular, the frequency of recessive alleles, even strongly deleterious ones, can be markedly increased by gBGC. Even moderate gBGC intensities can thus significantly affect lethal alleles. For instance, taking u = 10−8 and b = 0.0002 as estimated in human recombination hotspots (Myers et al. 2005), the frequency of a lethal allele can be more than doubled (see Equation 4d). On the contrary, gBGC helps in purging W deleterious alleles. We thus expect that (i) genes near recombination hotspots are more likely involved in genetic diseases and (ii) S alleles contribute disproportionately to genetic diseases, especially those found in appreciable frequencies. These predictions were recently confirmed through a genome-wide survey of disease-related mutations (A. Necsulea, A. Popa, D. N. Cooper, P. D. Stenson, D. Mouchiroud, C. Gautier, and L. Duret, unpublished results).
However, this kind of disease allele will be probably hard to identify individually through population genomic approaches. While formally equivalent in a one-locus model, recessive deleterious alleles balanced by gBGC and “true” balancing selection should leave different signatures at the molecular level. Because conversion tracts are likely only a few hundred base pairs long and mainly occur in recombination hotspots, only very short haplotypes are expected to be dragged up at intermediate frequency. It thus should be difficult to detect such alleles maintained in the population for a long time, contrary to other balancing selection mechanisms (Charlesworth 2006a).
Fraction of the load due to gBGC:
Through its effect on the fate of deleterious alleles, gBGC affects both the magnitude and the architecture of the genetic load (Figure 7). Under realistic conditions of mutation bias toward AT (λ > 1), gBGC slightly decreases the segregating load due to strongly deleterious mutations, but markedly increases the fixation load due to weakly deleterious mutations. For S mutations, the limit between the two mutation categories is mainly independent of the effective population size but depends on the intensity of gBGC. On the contrary, W mutations behave mainly as without gBGC and the limit between the two mutation categories is still ∼ s ≈ 1/2Ne. Quantitatively, the effect of gBGC on the segregating load is rather weak. The main effect of gBGC is thus to increase the fixation load by the proportion of mutations spanning ∼1/2Ne < s < 2b. The paradoxical consequences are that (i) the fraction of the fixation load induced by gBGC becomes roughly independent of population size and (ii) the proportion of the load due to gBGC increases with effective population size.
Figure 7.—
Effect of gBGC on the load structure. The gamma-shaped curve represents the distribution of selection coefficients, s. Deleterious mutations can either segregate in low frequency (open part), go to fixation because of gBGC (shaded part), or go to fixation because of drift (solid part). Compared to classical predictions without gBGC, gBGC adds the shaded part. Drift load sensu stricto and segregating load are also affected by gBGC but very weakly. In small populations (a), the drift load is higher and the gBGC fixation load lower than in large populations (b).
Quantitatively, gBGC can have a significant effect. Assuming only 3% the genome experiencing gBGC hotspots of average intensity b = 0.0002, gBGC can increase the load by two- to threefold in large populations (Table 5). In humans (using N = 10,000), gBGC could contribute to 10% of the load. However, this is only a rough estimate based on selection parameters estimated from coding regions. On one hand, recombination seems to be depressed within genes (Myers et al. 2005; Coop et al. 2008), which should reduce the load due to gBGC. For instance, if gBGC is twofold lower within genes than around them (b = 0.0001), only 5% of the load would be due to gBGC (data not shown). On the other hand, noncoding functional regions can also contribute to the load, especially in gene control regions (Keightley et al. 2005; Eory et al. 2010), where recombination can be high (Coop et al. 2008). Despite these uncertainties, gBGC may have fundamental fitness implications, not merely practical consequences for the detection of selection at the molecular level (Berglund et al. 2009; Galtier et al. 2009). These results confirm the pertinence of the “genomic Achilles' heel” metaphor proposed by Galtier and Duret (2007).
Fixation load and heterosis:
The load structure is also crucial for predicting heterosis, the increase in fitness of crosses between populations or lines compared to parents. Heterosis is created and proportional to among-population variance in deleterious allele frequencies (Whitlock et al. 2000; Glémin et al. 2003). Without gBGC, mutations contributing to heterosis also contribute to the drift load, and heterosis is mainly expected in small and isolated populations (Whitlock et al. 2000; Glémin et al. 2003). As discussed above, gBGC generates the fixation load in large populations, especially for s ∼ 2b (see above). In addition, under these conditions, the variance in allele frequencies is also high. gBGC compensates for selection and deleterious alleles behave almost neutrally. The variance in allele frequency is thus roughly that expected under mutation–drift equilibrium, VΦ[x] = λ /(1 + λ)2(1 + 4Nu(1 + λ)) (Wright 1937), which can be high, even in large populations. This is in sharp contrast with the low variance expected without gBGC, VΦ[x] ≈ u/hs(1 + 4Nhs) (Bataillon and Kirkpatrick 2000). While this rationale should be confirmed by detailed analyses of the interaction of selection and gBGC in subdivided populations, it leads to the surprising conclusion that gBGC could create heterosis in large populations.
Minimum load and the evolution of gBGC and recombination landscapes:
Although gBGC may have deleterious fitness consequences, it is surprising that it evolved in many taxa (Table 1 and reviewed in Duret and Galtier 2009a). Birdsell (2002) initially suggested that gBGC may have evolved as a response to mutational bias toward AT (λ > 1, here). Indeed, I show that a minimum load is reached for weak gBGC (b ≈ ln(λ)/4N, Equation 14). This result is very general whatever the distribution of fitness effects of mutations (appendix d). However, the range of optimal gBGC is narrow, and gBGC increases the load as far as b > ln(λ)/2N (appendix c). In humans, using N = 10,000 and λ = 2, gBGC levels that minimize the load are ∼1.17 × 10−5, i.e., one order of magnitude lower than the average bias observed in recombination hotspots (Myers et al. 2005). However, selection on conversion modifiers will not necessarily minimize the load because of gametic disequilibrium generated between modifiers and fitness loci (Bengtsson and Uyenoyama 1990). Selection for limitation of somatic AT-biased mutations could also have selected for GC-biased mismatch repair machinery (Brown and Jiricny 1987). If the bias level that would be selected for somatic reasons is >ln(λ)/2N, a side effect would be the generation of a substantial load at the population level. Finally, it is interesting to note that when synonymous codon positions are under selection for translation accuracy, optimal gBGC levels can be higher than gBGC levels that minimize the protein load, especially when most optimal codons end in G or C ().
Conversely, gBGC could also affect the evolution of recombination landscapes, which could evolve to reduce the gBGC load. Surprisingly, for a given recombination/conversion level, the hotspot distribution does not appear to be optimal (Table 5). Homogenous but lower gBGC has little effect on the load and could even reduce it by offsetting the mutational bias, provided that b < ln(λ)/2N. However, in species where recombination mainly occurs at hotspots, such as in humans, mice, and yeasts (Nishant and Rao 2005), one can speculate that the hotspot localization outside genes could be a response to avoid the deleterious effects of gBGC.
Up to now, these verbal arguments have not been assessed theoretically (but see Bengtsson and Uyenoyama 1990 for a different kind of conversion bias). Population genetics models are necessary to test these hypotheses concerning the evolution of gBGC and recombination landscapes and to pinpoint the key parameters that might govern their evolution.
gBGC and the evolution of mating systems:
Deleterious mutations also play a crucial role in the evolution of mating systems. They are the main source of inbreeding depression, which balances the automatic advantage of selfing. The drift load is also thought to contribute to the extinction of selfing species. Since they are mainly homozygous, selfing species are mostly free from gBGC and its deleterious impacts. I discuss below how this might affect the evolution of mating systems.
Inbreeding depression and the shift in mating systems:
Inbreeding depression plays a key role in the evolution of mating systems (Charlesworth and Charlesworth 1987; Charlesworth 2006b). Since it balances the automatic advantage of selfing, high inbreeding depression favors outcrossing, while selfing can evolve when it is low. Moreover, selfing helps to purge strongly deleterious mutations, thus decreasing inbreeding depression. This positive feedback reinforces the disruptive selection on the selfing rate and prevents the transition from selfing to outcrossing (Lande and Schemske 1985).
Theoretical results suggest that, in most conditions, gBGC would reinforce inbreeding depression in outcrossing populations (Figure 6), which would prevent the evolution of selfing. In reverse, if selfing is initially selected for, recurrent selfing would reduce the load through both purging and avoidance of gBGC. Under this scenario, gBGC would reinforce disruptive selection on mating systems. However, under some conditions (see Figure 6), inbreeding depression peaks at intermediate selfing rates, as observed for asymmetrical overdominance (Ziehe and Roberds 1989; Charlesworth and Charlesworth 1990). In theory, this could prevent the shift toward complete selfing and maintain stable mixed mating systems (Charlesworth and Charlesworth 1990; Uyenoyama and Waller 1991). However, this pattern is observed under restrictive conditions and it is very unlikely on the whole-genome scale. Dominance patterns are crucial for predicting inbreeding depression, especially with gBGC. Contrary to the load, it is thus difficult to evaluate the quantitative impact of gBGC on inbreeding depression. However, increased inbreeding depression in outcrossing species subject to gBGC seems to be the most likely scenario.
gBGC and the long-term evolution of mating systems:
In the long term, the gBGC-induced load also challenges the “dead-end hypothesis,” which posits that, because of the reduction of selection efficacy, self-fertilizing species would accumulate weakly deleterious mutations in the long term, eventually leading to extinction (Takebayashi and Morrell 2001). Because of gBGC, not drift, outcrossing species could also accumulate a load of weakly deleterious mutations (Figure 7), and they could suffer from a higher load than highly self-fertilizing species (Table 4). In line with this hypothesis, Haudry et al. (2008) found that in two outcrossing grass species, but not in two self-fertilizing ones, the dN/dS ratio is significantly higher for genes exhibiting GC enrichment. They speculated that substitutions in these genes might contribute to increasing the load in these two outcrossing grass species. Such results are still very sparse. In plants, evidence of strong gBGC is mainly restricted to grasses (but see Wright et al. 2007). It will be necessary to conduct more in-depth studies to assess the phylogenetic distribution of gBGC in plants and other hermaphrodite organisms and to further test the genomic Achilles' heel hypothesis in relation to mating systems. While theoretically possible, the quantitative effect of gBGC on the evolution of mating systems remains a new, open, and challenging question.
Conclusion:
I showed that the interaction between gBGC and selection might have surprising qualitative consequences on load and inbreeding depression patterns. Given the few quantitative data available on gBGC levels and selection intensities (mainly in humans), it turns out that even weak genome-wide gBGC can have significant fitness impacts. gBGC should be taken into account not only for sequence analyses (Berglund et al. 2009; Galtier et al. 2009), but also for its potential fitness consequences, for instance concerning genetic diseases. Interferences between gBGC and selection also give rise to new questions on the evolution of mating systems. However, most of the challenging conclusions given here have yet to be quantitatively evaluated. Quantification of gBGC and its interaction with selection in various organisms will be crucial in the future.
Acknowledgments
I thank N. Galtier, L. Duret, A. Poppa, and C. Gautier for discussions and two anonymous reviewers for helpful insights and suggestions to improve the manuscript. I also thank L. Duret and A. Necsulea for sharing a manuscript before publication. This is manuscript Institut des Sciences de l'Evolution de Montpellier ISEM 2010-029. This work was supported by the French Centre National de la Recherche Scientifique and Agence Nationale de la Recherche (ANR-08-GENM-036-01).
APPENDIX A: GENERAL DERIVATIONS
Analysis of the deterministic model:
Equation 1 gives the change in allele frequency due to mutation, selection, and gBGC, as follows:
![]() |
(A1a) |
if S is deleterious and
![]() |
(A1b) |
if W is deleterious. If W is deleterious, both selection and gBGC maintain the deleterious alleles at low frequency, so that x ≪ 1. Linearizing Δx around 0 and neglecting back mutation gives Δx ≈ λu − b(1 − F)x − sx(h + F − hF), which leads to (3). If S is the deleterious allele, this leads to three selection regimes. If gBGC is weak compared to selection (b ≪ s), x ≪ 1, and the linear approximation gives Δx ≈ u + b(1 − F)x − sx(h + F − hF), which leads to (4a). If gBGC is strong (b ≫ s), x is very close to 1. Linearizing Δx around 1 and neglecting direct mutation gives Δx ≈ λu + b(1 − F)(1 − x) − s(1 − x)(1 − h + hF), which leads to (4b). Finally, if gBGC and selection are of similar intensity, the two forces can maintain the deleterious allele at intermediate frequency and mutations can be neglected, Δx ≈ x(1 − x)(b(1 − F) − s)h + F − hF + (1 − F)(1 − 2h)x)), which leads to (4c). In the special case of fully recessive alleles in panmixia (F = 0), using x ≪1, Δx can be approximated by Δx ≈ u + λux + bx − sx2, which leads to (4d).
Analysis of the deterministic model:
In finite populations, the Wright (1937) stationary distribution is given by
![]() |
(A2) |
where Mx = Δx, given by Equation 1, and Vx = x(1 − x)/2Ne are the infinitesimal mean and variance of changes in allele frequency. K is an integration constant, ensuring that and Ne = α(N/(1 + F)). Integration of (A2) leads to (6a) and (6b).
To compute the probability of fixation, let us note
![]() |
(A3) |
where Mx = Δxconversion + Δxselection and Vx is the same as in (A2), and the probability of fixation starting from frequency p0 is
![]() |
(A4) |
(Kimura 1962).
For a new mutation (p0 = 1/2N) under genic selection in panmixia, (A4) simply reduces to (7a) and (7b). General analytical expressions for Pfix can be obtained but they are formidable and not given here.
The mean sojourn and fixation times of a new mutation can be obtained using classical diffusion results (e.g., Ewens 2004),
![]() |
(A5) |
and
![]() |
(A6) |
where with G defined in (A3), and p0 = 1/2N.
The mean time until the mutant allele reaches the deterministic equilibrium frequency, xeq, for the first time is given by the Kimura and Ohta (1973) expression, which gives the mean time to reach a given frequency for the first time, starting from frequency p0 < xeq,
![]() |
(A7) |
where
![]() |
.
For fully recessive mutations in panmixia, the equilibrium frequency is well approximated by xeq = b/(b +s) (Equation 4c). Under the overdominance-like regime, if the allele is not initially lost, it can persist for a long time (e.g., Takahata 1990). To evaluate this conditional persistence, I also computed the mean sojourn time of an allele starting from frequency xeq, using p0 = xeq in (A5).
In the general case, these expressions have no analytical expression and numerical integrations were performed (see appendix b).
APPENDIX B: NUMERICAL INTEGRATIONS
Stationary distribution and time properties:
Because Φ(x) given by Equations 5a and Equations 5b can diverge in 0, 1, or both, I adapted Kimura et al.'s (1963) quadrature method to facilitate the numerical integration of EΦ[xi]. The principle is given for S deleterious alleles.
Let qi(x) = x4(αN/(1 + F))u − 1 + i and r(x) = (1 − x)4(αN/(1 + F))λu − 1e−2(αN/(1 + F))(s(1 + F) − 2b(1 − F)), which verifies
![]() |
Let
![]() |
and
![]() |
Noting φ(x) = Φ(x)/K, we thus have
![]() |
and
![]() |
Due to this manipulation, the functions that are numerically integrated do not diverge. Numerical integrations were done using the NIntegrate function of the Mathematica software (Wolfram 1996), setting the maximum recursion at 50, working precision at 15, and other options at default values.
The other integrals converge so I performed simple numerical integrations using the NIntegrate function of the Mathematica software (Wolfram 1996), setting maximum recursion at 50, working precision at 15, and other options at default values.
Total genomic load:
To compute the load and on the whole-genome scale, I integrated Equation 11 over a distribution of mutational effects. I assumed a gamma distribution of selection coefficients, ψ(s), with mean smean = 0.0325 and shape parameter = 0.23, according to Eyre-Walker et al. (2006). Since the load weakly depends on the dominance levels, I assumed . For simplicity, I assumed that the fitness is multiplicative, such that
![]() |
where Li is the load contributed by the locus i, and n is the number of loci contributing to the load. I used u = 10−8 and n = 106 so that the genomic deleterious mutation rate is U = 10−2. This is much lower than current estimates in humans (U > 1, Keightley and Eyre-Walker 2000; Keightley et al. 2005, 2006). However, using such values leads to unbearable loads under the multiplicative fitness model. I performed numerical integration using the quadrature method (see above) and the function NIntegrate of Mathematica software, setting maximum recursion at 50 and working precision at 15.
APPENDIX C: LOAD APPROXIMATIONS
Genic selection in panmixia:
For genic selection in panmixia, an interesting approximation can be obtained for the load as follows. Consider that the deleterious allele can be either fixed with probability pF, causing the load s, or segregating at deterministic frequency with probability 1 − pF, causing the load Ldet. The probability of being fixed for the deleterious allele, pF, is approximately given by
![]() |
(C1) |
for S deleterious alleles and
![]() |
(C2) |
for W deleterious alleles (e.g., Bulmer 1991; Lande 1998).
Under genic selection and panmixia, this reduces to
![]() |
(C3) |
if S is the deleterious allele and
![]() |
(C4) |
if W is the deleterious allele (see also Kimura et al. 1963; Bulmer 1991; Lande 1998).
This leads to Equations 12a and Equations 12b.
Maximum load:
The deleterious effect of mutations that maximizes the load can be obtained by solving in s: ∂L/∂s = 0. Approximate solutions can be obtained through the following rationale (for the case without gBGC, see also Kimura et al. 1963). Mutations that cause the highest load are the most deleterious ones among those that can reach fixation. Fixation can be reached because of drift or gBGC. We can thus simply maximize the first part of Equations 12, that is, spF. The selection coefficients that maximize the load, smax, are thus solutions in s of
![]() |
(C5) |
for W deleterious alleles and
![]() |
(C6) |
for S deleterious alleles.
This leads to
![]() |
(C7) |
if W is the deleterious allele and
![]() |
(C8) |
if S is the deleterious allele, where PL is the product log function (Abramowitz and Stegun 1970).
For the W deleterious allele, 1/2N < smax < (1/2N)(1 + PL(λe−1)), which is ∼1/2N < smax < 1/N for realistic λ values. For the S deleterious allele, (1/2N)(1 + PL(e−1/λ)) < smax < (1/2N)(PL(e4Nb/λ)). For realistic λ-values, the minimum value is composed between 1/2N and 1/N and the maximum one between b and 2b if Nb ≫ 1. So we have ∼min(1/2N, b) < smax < max(1/N,2b).
Minimum load:
Similarly, one can compute the gBGC level that minimizes the total load by solving in b: ∂L/∂b = 0. Here, I compute separately the gBGC level that minimizes the segregating load and the fixation load. gBGC weakly decreases the load due to W deleterious mutations. To minimize the total segregating load, the gBGC level must weakly increase the load due to S deleterious alleles, so b < hs. Using Equation 10a, we thus need to minimize q(2uhs/(hs − b)) + (1 − q)(2λuhs/(hs + b)). The solution is
![]() |
(C9) |
which reduces to
![]() |
(C10) |
for .
As expected, gBGC can minimize the load only when λ > 1, that is, when the mutation is biased toward AT. The gBGC level that minimizes the drift load is given by the value that minimizes
![]() |
The solution is
![]() |
(C11) |
which reduces to
![]() |
(C12) |
for . Interestingly, (C12) is independent of s. In most cases,
;
is thus a very good approximation of the gBGC level that minimizes the total load, even when assuming a distribution of selection coefficients (appendix d).
Finally, we can compute gBGC levels above which it increases the fixation load by solving in b,
![]() |
which leads to
![]() |
(C14) |
In most cases, we also have , so gBGC levels that decrease the segregating load should increase the fixation load.
APPENDIX D: NUMERICAL VALIDATION OF EQUATIONS 12 AND 14
Equations 12:
Equations 12 are approximations for the exact Equation 10 that gives the load due to codominant mutations () in panmixia. Figure D1 and Figure D2 show that, for S deleterious alleles, (12a) is a very accurate approximation for (11) for a wide range of parameters. This is also true for W deleterious alleles (not shown):
(a) N = 10,000; b = 0, b = 2 × 10−3, b = 2 × 10−4, b = 5 × 10−5 (from left to right).
(b) b = 2 × 10−4 ; s = 10−2, s = 10−3, s = 10−4, s = 10−5 (from top left to bottom left).
Figure D1.—
N = 10,000; b = 0, b = 2 × 10−3, b = 2 × 10−4, b = 5 × 10−5 (from left to right).
Figure D2.—
b = 2 × 10−4 ; s = 10−2, s = 10−3, s = 10−4, s = 10−5 (from top left to bottom left).
Equation 14:
The total genomic load is computed as explained in appendix b. Then the value of b that minimizes the load was numerically computed through a simple iterative algorithm. Numerical results were compared to Equation 14. Results are given (×10−3) for various mutational biases (λ), population sizes (N), and shape of the gamma distribution of deleterious mutations (β). (See Table 6).
TABLE 6.
gBGC levels minimizing the total genomic load
Population size | Equation 14 | Numeric β = 0.01 | Numeric β = 0.23 | Numeric β = 1 | |
---|---|---|---|---|---|
λ = 2 | N = 1,000 | 0.173 | 0.174 | 0.174 | 0.174 |
N = 10,000 | 0.0173 | 0.0174 | 0.0174 | 0.0174 | |
N = 100,000 | 0.00173 | 0.00178 | 0.00179 | 0.00181 | |
λ = 5 | N = 1,000 | 0.402 | 0.403 | 0.403 | 0.403 |
N = 10,000 | 0.0402 | 0.0404 | 0.0406 | 0.0410 | |
N = 100,000 | 0.00402 | 0.00411 | 0.00412 | 0.00434 | |
λ = 10 | N = 1,000 | 0.576 | 0.577 | 0.577 | 0.577 |
N = 10,000 | 0.0576 | 0.0579 | 0.0579 | 0.0581 | |
N = 100,000 | 0.00576 | 0.00596 | 0.00597 | 0.00646 |
References
- Abramowitz, M., and I. A. Stegun, 1970. Handbook of Mathematical Functions. Dover, New York.
- Bataillon, T., and M. Kirkpatrick, 2000. Inbreeding depression due to mildly deleterious mutations in finite populations: size does matter. Genet. Res. 75 75–81. [DOI] [PubMed] [Google Scholar]
- Bengtsson, B. O., 1990. The effect of biased conversion on the mutation load. Genet. Res. 55 183–187. [DOI] [PubMed] [Google Scholar]
- Bengtsson, B. O., and M. K. Uyenoyama, 1990. Evolution of the segregation ratio: modification of gene conversion and meiotic drive. Theor. Popul. Biol. 38 192–218. [DOI] [PubMed] [Google Scholar]
- Berglund, J., K. S. Pollard and M. T. Webster, 2009. Hotspots of biased nucleotide substitutions in human genes. PLoS Biol. 7 e26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Birdsell, J. A., 2002. Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution. Mol. Biol. Evol. 19 1181–1197. [DOI] [PubMed] [Google Scholar]
- Brown, T. C., and J. Jiricny, 1987. A specific mismatch repair event protects mammalian cells from loss of 5-methylcytosine. Cell 50 945–950. [DOI] [PubMed] [Google Scholar]
- Bulmer, M., 1991. The selection-mutation-drift theory of synonymous codon usage. Genetics 129 897–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caballero, A., and W. G. Hill, 1992. Effects of partial inbreeding on fixation rates and variation of mutant genes. Genetics 131 493–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlesworth, B., 1992. Evolutionary rates in partially self-fertilizing species. Am. Nat. 140 126–148. [DOI] [PubMed] [Google Scholar]
- Charlesworth, B., and D. Charlesworth, 1998. Some evolutionary consequences of deleterious mutations. Genetica 102/103 3–19. [PubMed] [Google Scholar]
- Charlesworth, B., M. T. Morgan and D. Charlesworth, 1993. The effect of deleterious mutations on neutral molecular variation. Genetics 134 1289–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlesworth, D., 2006. a Balancing selection and its effects on sequences in nearby genome regions. PLoS Genet. 2 e64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlesworth, D., 2006. b Evolution of plant breeding systems. Curr. Biol. 16 R726–R735. [DOI] [PubMed] [Google Scholar]
- Charlesworth, D., and B. Charlesworth, 1987. Inbreeding depression and its evolutionary consequences. Annu. Rev. Ecol. Syst. 18 237–268. [Google Scholar]
- Charlesworth, D., and B. Charlesworth, 1990. Inbreeding depression with heterozygote advantage and its effect on selection for modifiers changing the outcrossing rate. Evolution 44 1469–1489. [DOI] [PubMed] [Google Scholar]
- Charlesworth, D., and J. H. Willis, 2009. The genetics of inbreeding depression. Nat. Rev. Genet. 10 783–796. [DOI] [PubMed] [Google Scholar]
- Coop, G., X. Wen, C. Ober, J. K. Pritchard and M. Przeworski, 2008. High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science 319 1395–1398. [DOI] [PubMed] [Google Scholar]
- Crow, J. F., 1993. Mutation, mean fitness, and genetic load, pp. 3–42 in Oxford Surveys in Evolutionary Biology, Vol. 9, edited by D. Futuyma and J. Antonovics. Oxford University Press, Oxford.
- Duret, L., and P. F. Arndt, 2008. The impact of recombination on nucleotide substitutions in the human genome. PLoS Genet. 4 e1000071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duret, L., and N. Galtier, 2009. a Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 10 285–311. [DOI] [PubMed] [Google Scholar]
- Duret, L., and N. Galtier, 2009. b Comment on “Human-specific gain of function in a developmental enhancer”. Science 323 714 (author reply: 714). [DOI] [PubMed] [Google Scholar]
- Duret, L., and D. Mouchiroud, 1999. Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, Arabidopsis. Proc. Natl. Acad. Sci. USA 96 4482–4487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duret, L., A. Eyre-Walker and N. Galtier, 2006. A new perspective on isochore evolution. Gene 385 71–74. [DOI] [PubMed] [Google Scholar]
- Duret, L., J. Cohen, C. Jubin, P. Dessen, J. F. Gout et al., 2008. Analysis of sequence variability in the macronuclear DNA of Paramecium tetraurelia: a somatic view of the germline. Genome Res. 18 585–596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eory, L., D. L. Halligan and P. D. Keightley, 2010. Distributions of selectively constrained sites and deleterious mutation rates in the hominid and murid genomes. Mol. Biol. Evol. 27 177–192. [DOI] [PubMed]
- Ewens, W. J., 2004. Mathematical Population Genetics. I. Theoretical Introduction. Springer, New York.
- Eyre-Walker, A., M. Woolfit and T. Phelps, 2006. The distribution of fitness effects of new deleterious amino acid mutations in humans. Genetics 173 891–900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galtier, N., and L. Duret, 2007. Adaptation or biased gene conversion? Extending the null hypothesis of molecular evolution. Trends Genet. 23 273–277. [DOI] [PubMed] [Google Scholar]
- Galtier, N., G. Piganeau, D. Mouchiroud and L. Duret, 2001. GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. Genetics 159 907–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galtier, N., E. Bazin and N. Bierne, 2006. GC-biased segregation of noncoding polymorphisms in Drosophila. Genetics 172 221–228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galtier, N., L. Duret, S. Glémin and V. Ranwez, 2009. GC-biased gene conversion promotes the fixation of deleterious amino acid changes in primates. Trends Genet. 25 1–5. [DOI] [PubMed] [Google Scholar]
- Glémin, S., 2003. How are deleterious mutations purged? Drift versus nonrandom mating. Evolution 57 2678–2687. [DOI] [PubMed] [Google Scholar]
- Glémin, S., 2007. Mating systems and the efficacy of selection at the molecular level. Genetics 177 905–916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glémin, S., J. Ronfort and T. Bataillon, 2003. Patterns of inbreeding depression and architecture of the load in subdivided populations. Genetics 165 2193–2212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glémin, S., E. Bazin and D. Charlesworth, 2006. Impact of mating systems on patterns of sequence polymorphism in flowering plants. Proc. Biol. Sci. 273 3011–3019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutz, H., and J. F. Leslie, 1976. Gene conversion: a hitherto overlooked parameter in population genetics. Genetics 83 861–866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haldane, J. B. S., 1937. The effect of variation on fitness. Am. Nat. 71 337–349. [Google Scholar]
- Haudry, A., A. Cenci, C. Guilhaumon, E. Paux, S. Poirier et al., 2008. Mating system and recombination affect molecular evolution in four Triticeae species. Genet. Res. 90 97–109. [DOI] [PubMed] [Google Scholar]
- International Chicken Genome Sequencing Consortium, 2004. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 432 695–716. [DOI] [PubMed] [Google Scholar]
- Jancek, S., S. Gourbiere, H. Moreau and G. Piganeau, 2008. Clues about the genetic basis of adaptation emerge from comparing the proteomes of two Ostreococcus ecotypes (Chlorophyta, Prasinophyceae). Mol. Biol. Evol. 25 2293–2300. [DOI] [PubMed] [Google Scholar]
- Keightley, P., and A. Eyre-Walker, 2000. Deleterious mutations and the evolution of sex. Science 290 331–333. [DOI] [PubMed] [Google Scholar]
- Keightley, P. D., M. J. Lercher and A. Eyre-Walker, 2005. Evidence for widespread degradation of gene control regions in hominid genomes. PLoS Biol. 3 e42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keightley, P. D., M. J. Lercher and A. Eyre-Walker, 2006. Understanding the degradation of hominid gene control. PLoS Comput. Biol. 2 e19 (author reply: e26). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura, M., 1962. On the probability of fixation of mutant genes in a population. Genetics 47 713–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura, M., and T. Ohta, 1971. Theoretical Aspects of Population Genetics. Princeton University Press, Princeton, NJ.
- Kimura, M., and T. Ohta, 1973. The age of a neutral mutant persisting in a finite population. Genetics 75 199–212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura, M., T. Maruyama and J. F. Crow, 1963. The mutation load in small populations. Genetics 48 1303–1312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuraku, S., J. Ishijima, C. Nishida-Umehara, K. Agata, S. Kuratani et al., 2006. cDNA-based gene mapping and GC3 profiling in the soft-shelled turtle suggest a chromosomal size-dependent GC bias shared by sauropsids. Chromosome Res. 14 187–202. [DOI] [PubMed] [Google Scholar]
- Lamb, B. C., and S. Helmi, 1982. The extent to which gene conversion can change allele frequencies in populations. Genet. Res. 39 199–217. [Google Scholar]
- Lande, R., 1994. Risk of population extinction from fixation of new deleterious mutations. Evolution 48 1460–1469. [DOI] [PubMed] [Google Scholar]
- Lande, R., 1998. Risk of population extinction from fixation of deleterious and reverse mutations. Genetica 102/103 21–27. [PubMed] [Google Scholar]
- Lande, R., and D. W. Schemske, 1985. The evolution of self-fertilization and inbreeding depression in plants. I. Genetic models. Evolution 39 24–40. [DOI] [PubMed] [Google Scholar]
- Lynch, M., 2007. The Origin of Genome Architecture. Sinauer Associates, Sunderland, MA.
- Lynch, M., J. Conery and R. Burger, 1995. Mutation accumulation and the extinction of small populations. Am. Nat. 146 489–518. [Google Scholar]
- Mancera, E., R. Bourgon, A. Brozzi, W. Huber and L. M. Steinmetz, 2008. High-resolution mapping of meiotic crossovers and non-crossovers in yeast. Nature 454 479–485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marais, G., 2003. Biased gene conversion: implications for genome and sex evolution. Trends Genet. 19 330–338. [DOI] [PubMed] [Google Scholar]
- Marais, G., D. Mouchiroud and L. Duret, 2001. Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc. Natl. Acad. Sci. USA 98 5688–5692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marais, G., D. Mouchiroud and L. Duret, 2003. Neutral effect of recombination on base composition in Drosophila. Genet. Res. 81 79–87. [DOI] [PubMed] [Google Scholar]
- Marais, G., B. Charlesworth and S. I. Wright, 2004. Recombination and base composition: the case of the highly self-fertilizing plant Arabidopsis thaliana. Genome Biol. 5 R45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marsolier-Kergoat, M. C., and E. Yeramian, 2009. GC content and recombination: reassessing the causal effects for the Saccharomyces cerevisiae genome. Genetics 183 31–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McVean, G. A., S. R. Myers, S. Hunt, P. Deloukas, D. R. Bentley et al., 2004. The fine-scale structure of recombination rate variation in the human genome. Science 304 581–584. [DOI] [PubMed] [Google Scholar]
- Meunier, J., and L. Duret, 2004. Recombination drives the evolution of GC-content in the human genome. Mol. Biol. Evol. 21 984–990. [DOI] [PubMed] [Google Scholar]
- Myers, S., L. Bottolo, C. Freeman, G. McVean and P. Donnelly, 2005. A fine-scale map of recombination rates and hotspots across the human genome. Science 310 321–324. [DOI] [PubMed] [Google Scholar]
- Nagylaki, T., 1983. a Evolution of a finite population under gene conversion. Proc. Natl. Acad. Sci. USA 80 6278–6281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nagylaki, T., 1983. b Evolution of a large population under gene conversion. Proc. Natl. Acad. Sci. USA 80 5941–5945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nielsen, R., 2005. Molecular signatures of natural selection. Annu. Rev. Genet. 39 197–218. [DOI] [PubMed] [Google Scholar]
- Nishant, K. T., and M. R. S. Rao, 2005. Molecular features of meiotic recombination hotspots. BioEssays 28 45–56. [DOI] [PubMed] [Google Scholar]
- Nordborg, M., and P. Donnelly, 1997. The coalescent process with selfing. Genetics 146 1185–1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nordborg, M., B. Charlesworth and D. Charlesworth, 1996. The effect of recombination on background selection. Genet. Res. 67 159–174. [DOI] [PubMed] [Google Scholar]
- Ohta, T., and C. C. Cockerham, 1974. Detrimental genes with partial selfing and effects on a neutral locus. Genet. Res. 23 191–200. [DOI] [PubMed] [Google Scholar]
- Pollak, E., 1987. On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117 353–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pollard, K. S., S. R. Salama, B. King, A. D. Kern, T. Dreszer et al., 2006. a Forces shaping the fastest evolving regions in the human genome. PLoS Genet. 2 e168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pollard, K. S., S. R. Salama, N. Lambert, M. A. Lambot, S. Coppens et al., 2006. b An RNA gene expressed during cortical development evolved rapidly in humans. Nature 443 167–172. [DOI] [PubMed] [Google Scholar]
- Prabhakar, S., J. P. Noonan, S. Paabo and E. M. Rubin, 2006. Accelerated evolution of conserved noncoding sequences in humans. Science 314 786. [DOI] [PubMed] [Google Scholar]
- Prabhakar, S., A. Visel, J. A. Akiyama, M. Shoukry, K. D. Lewis et al., 2008. Human-specific gain of function in a developmental enhancer. Science 321 1346–1350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rockman, M. V., and L. Kruglyak, 2009. Recombinational landscape and population genomics of Caenorhabditis elegans. PLoS Genet. 5 e1000419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spencer, C. C., P. Deloukas, S. Hunt, J. Mullikin, S. Myers et al., 2006. The influence of recombination on human genetic diversity. PLoS Genet. 2 e148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takahata, N., 1990. A simple genealogical structure of strongly balanced allelic lines and trans-species evolution of polymorphism. Proc. Natl. Acad. Sci. USA 87 2419–2423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takebayashi, N., and P. L. Morrell, 2001. Is self-fertilization an evolutionary dead end? Revisiting an old hypothesis with genetic theories and a macroevolutionary approach. Am. J. Bot. 88 1143–1150. [PubMed] [Google Scholar]
- Uyenoyama, M. K., and D. M. Waller, 1991. Coevolution of self-fertilization and inbreeding depression. II. Symmetric overdominance in viability. Theor. Popul. Biol. 40 47–77. [DOI] [PubMed] [Google Scholar]
- Wang, L. J., and M. J. Roossinck, 2006. Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants. Plant Mol. Biol. 61 699–710. [DOI] [PubMed] [Google Scholar]
- Webster, M. T., E. Axelsson and H. Ellegren, 2006. Strong regional biases in nucleotide substitution in the chicken genome. Mol. Biol. Evol. 23 1203–1216. [DOI] [PubMed] [Google Scholar]
- Whitlock, M. C., P. K. Ingvarsson and T. Hatfield, 2000. Local drift load and the heterosis of interconnected populations. Heredity 84 452–457. [DOI] [PubMed] [Google Scholar]
- Wolfram, S., 1996. The Mathematica Book. Cambridge University Press, Cambridge, UK.
- Wright, S., 1937. The distribution of gene frequencies in populations. Proc. Natl. Acad. Sci. USA 23 307–320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright, S. I., G. Iorgovan, S. Misra and M. Mokhtari, 2007. Neutral evolution of synonymous base composition in the Brassicaceae. J. Mol. Evol. 64 136–141. [DOI] [PubMed] [Google Scholar]
- Ziehe, M., and J. H. Roberds, 1989. Inbreeding depression due to overdominance in partially self-fertilizing plant populations. Genetics 121 861–868. [DOI] [PMC free article] [PubMed] [Google Scholar]