Skip to main content
Genetics logoLink to Genetics
. 2007 Oct;177(2):905–916. doi: 10.1534/genetics.107.073601

Mating Systems and the Efficacy of Selection at the Molecular Level

Sylvain Glémin 1,1
PMCID: PMC2034653  PMID: 17954924

Abstract

Mating systems are thought to play a key role in molecular evolution through their effects on effective population size (Ne) and effective recombination rate. Because of reduced Ne, selection in self-fertilizing species is supposed to be less efficient, allowing fixation of weakly deleterious alleles or lowering adaptation, which may jeopardize their long-term evolution. Relaxed selection pressures in selfers should be detectable at the molecular level through the analyses of the ratio of nonsynonymous and synonymous divergence, Dn/Ds, or the ratio of nonsynonymous and synonymous polymorphism, πns. On the other hand, selfing reveals recessive alleles to selection (homozygosity effect), which may counterbalance the reduction in Ne. Through population genetics models, this study investigates which process may prevail in natural populations and which conditions are necessary to detect evidence for relaxed selection signature at the molecular level in selfers. Under a wide range of plausible population and mutation parameters, relaxed selection against deleterious mutations should be detectable, but the differences between the two mating systems can be weak. At equilibrium, differences between outcrossers and selfers should be more pronounced using divergence measures (Dn/Ds ratio) than using polymorphism data (πns ratio). The difference in adaptive substitution rates between outcrossers and selfers is much less predictable because it critically depends on the dominance levels of new advantageous mutations, which are poorly known. Different ways of testing these predictions are suggested, and implications of these results for the evolution of self-fertilizing species are also discussed.


MATING systems are thought to have a deep impact on molecular evolution. They affect the effective population size, Ne, which plays a crucial role in molecular evolution because it controls the amount of polymorphism and the efficacy of selection (Charlesworth and Wright 2001). Inbreeding is expected to directly lower Ne by reducing the number of independent gametes sampled for reproduction. Because inbreeding also reduces the efficacy of recombination, Ne can be reduced further through hitchhiking effects: the elimination of deleterious alleles or the spread of advantageous mutations at linked loci increases the local level of genetic drift (Maynard-Smith and Haig 1974; Charlesworth et al. 1993). Finally, bottlenecks should be more severe in selfers than in outcrossers (Schoen and Brown 1991) because in selfers a single seed can found a new population (Baker 1955). The efficacy of selection depends on the product Nes, where s is the selection coefficient, such that selection should be less effective in selfers than in outcrossers. At the molecular level, selection intensity can be estimated due to the comparison of (presumably neutral) synonymous and (presumably selected) nonsynonymous polymorphisms or substitutions.

While the effect of selfing on polymorphism levels is well documented (Hamrick and Godt 1996; Nybom 2004; Glémin et al. 2006), its impact on selection efficacy has been assessed only in a few studies, without strong evidence. Bustamante et al. (2002) suggested that the self-fertilizing species Arabidopsis thaliana has mainly fixed slightly detrimental mutations, whereas Drosophila melanogaster has preferentially fixed beneficial mutations. However, Wright et al. (2002) did not find any difference in the rate of protein evolution in codon bias between A. thaliana and its outcrossing close relative A. lyrata. More recently, a survey of sequence polymorphism data in flowering plants revealed a weak increase in the πns ratio in selfers compared to outcrossers, suggesting relaxation of selective constraints in selfers (Glémin et al. 2006).

Despite that only few studies are available, several hypotheses have been suggested to explain the weak signal of reduced selection efficacy in selfers. First, the strength of selection obviously depends on the (poorly known) genomic distribution of selection coefficients and recombination rates. For instance, under the pure neutral model with just neutral and strongly deleterious mutations, virtually no effect is expected. Second, if selfers are of recent origin, such as suggested for A. thaliana (Charlesworth and Vekemans 2005; Bechsgaard et al. 2006), the change in the selective regime could be too recent to have significantly affected the Dn/Ds ratio. According to this hypothesis, a stronger signal should be detectable in intraspecific polymorphism analyses, which may explain the results of Glémin et al. (2006). Finally, theoretical reasons can also be invoked. Indeed, selfing also affects the efficacy of selection by increasing homozygosity and thus exposing recessive alleles to selection, as in the well-known purging process of deleterious mutations through selfing (Ohta and Cockerham 1974). This effect can counteract the effect of reducing Ne. Charlesworth (1992) studied the rate of evolution in selfers through the fixation probability of mutations with any dominance level. Considering the sole reduction in Ne due to nonindependent gamete sampling under inbreeding (that is, a 50% reduction under complete selfing), he showed that selection is less efficient under partial selfing for dominant mutations but more efficient for recessive ones. Quantitatively he found a minor effect on the fixation of deleterious mutation but a significant one on the fixation of recessive advantageous alleles. However, Charlesworth (1992) did not take into account the reduction in Ne due to bottlenecks or hitchhiking effects. It is thus not clear how much Ne must be reduced beyond the twofold level to overwhelm the homozygosity effect.

To address this issue previous population genetic models are expanded by focusing on the impact of partial selfing and dominance level on several measures commonly used in molecular evolution. Genes under purifying selection are considered first, assuming constant selection against deleterious mutations or a distribution of selection and dominance coefficients. Then, the case of advantageous mutations is presented. For each case, the additional reduction in Ne that is necessary to allow detection of relaxed selection in selfers at the molecular level is determined. The effect of population subdivision is also addressed. Different ways of testing these predictions are suggested, and the implications of these results for the evolution of self-fertilizing species are also discussed.

MODEL AND RESULTS

General formulation:

The aim of this article is to develop predictions for two commonly used measures of selection in molecular evolution, namely the Dn/Ds ratio (ω hereafter) and the πns ratio (f0 hereafter). Dn and Ds are the mean substitution rate, per generation per nonsynonymous and per synonymous site, respectively. πn and πs are nonsynonymous and synonymous nucleotide diversity (Tajima 1983). As in previous studies (Piganeau and Eyre-Walker 2003; Loewe et al. 2006), the model assumes an infinite-site mutation model, autosomal inheritance, and independently evolving sites (free recombination). Mutations occur at rate μ and synonymous sites are supposed to evolve neutrally so that

graphic file with name M1.gif (1a)

and

graphic file with name M2.gif (1b)

Nonsynonymous mutations can be neutral, deleterious, or advantageous, in proportion cn, cd, and ca, respectively (cn + cd + ca = 1). Deleterious mutations may contribute both to polymorphism and to divergence, while new advantageous mutations (distinct from back mutations from a deleterious allele) can spread rapidly to fixation, contributing to substitutions but not to polymorphism (because they are likely to be rare, ca ≪ 1, and fixation occurs rapidly). We thus have

graphic file with name M3.gif (2a)

and

graphic file with name M4.gif (2b)

where Hd is the mean equilibrium diversity at sites under purifying selection, and Dd and Da are the mean substitution rates at sites under purifying and positive selection, respectively. It follows that

graphic file with name M5.gif (3a)

and

graphic file with name M6.gif (3b)

where indexes d and a stand for the contribution of deleterious and advantageous mutations to f0 and ω. Because mating systems do not affect the neutral part of f0 and ω, cn is set to 0 for simplicity. Results remain unchanged since only relative values for different mating systems are considered. For simplicity, negative and positive selections are also analyzed separately.

First, consider single populations with effective population size Ne = αN/(1 + F), where N is the actual population size, and F = σ/(2 − σ) is the equilibrium Wright's fixation index with selfing rate σ. Compared with classical models, the parameter α is introduced as a simple way to summarize the reduction in effective population size due to hitchhiking and bottleneck effects. This parameter allows exploring Ne reduction in selfers without modeling explicitly bottleneck or hitchhiking effects, which would greatly increase the model complexity. To compare allogamous and autogamous populations, α = 1 in the former and α < 1 in the latter. The main aim of the model is then to seek Ne reduction that is necessary to overwhelm the homozygosity effect, that is, to derive the limit α-value for which Inline graphic or Inline graphic.

Selection against deleterious mutations:

Strong purifying selection:

Consider mutation from a wild-type allele, A, toward a deleterious allele, a. Let 1, 1 + hs, and 1 + s be the fitnesses of the three genotype AA, Aa, and aa, respectively (s < 0 for deleterious alleles, s > 0 for advantageous ones). Under strong selection (N|s| ≫ 1), deleterious alleles never reach fixation, so that Dn = 0. Mcvean and Charlesworth (1999) showed that the contribution to polymorphism of codominant (Inline graphic) deleterious mutations is well approximated by πn ≈ 2q, where q is their deterministic equilibrium frequency. This approximation still holds for any dominance level (see numerical results below), using q ≈ μ / (h + FhF)|s| (Caballero and Hill 1992; Bataillon and Kirkpatrick 2000) or Inline graphic for fully recessive alleles (h = 0) in a panmictic population (F = 0). This second approximation holds for 2Nμ ≪ 1 (Wright 1937; Nei 1968), which is mostly true at the nucleotide level. From these arguments we get

graphic file with name M11.gif (4)

and

graphic file with name M12.gif (5a)
graphic file with name M13.gif (5b)

According to (5) f0 is a decreasing function of F for Inline graphic and an increasing one for Inline graphic. Recessive alleles are more efficiently purged in self-fertilizing than in outcrossing populations and contribute less to polymorphism. Under strong selection, the f0 ratio does depend on N, while the frequency of deleterious alleles remains constant (except for fully recessive ones). In such conditions, the interpretation of the f0 ratio can be misleading because it does not directly estimate the strength of selection (which remains unchanged) but does estimate the fraction of deleterious mutations contributing to polymorphism. Despite this limitation, we can easily find α limit:

graphic file with name M16.gif (6a)
graphic file with name M17.gif (6b)

For complete selfing, (6a) reduces to 2h and (6b) to Inline graphic, which is very low for recessive and nearly recessive alleles.

Arbitrary purifying selection:

If selection is weak, deleterious alleles can become fixed and then mutate back to the optimal allele. This will contribute both to polymorphism and to divergence. As far as nucleotide sequences are concerned, we have to consider four states at each site. To simplify, only two equivalence classes of selected sites are considered, namely the deleterious and optimal classes, each containing one to three variants (for instance, see Figure 2 in Loewe et al. 2006). Within each class, variants are thus neutral. This can be taken into account as in Equations 5a and 5b in Loewe et al. (2006). Provided that all mutation rates from each nucleotide are equal (which is implicitly assumed in Loewe et al. 2006), the following results do not depend on the number of variants per class because relative results between selfers and outcrossers (not absolute ones) are concerned (numerical results not shown). For clarity only the two-allele case with symmetrical mutation rates is discussed below. Mutation biases induce more complex patterns and are not treated here (for instance, see McVean and Charlesworth 1999).

Figure 2.—

Figure 2.—

f0 (A) and ω (B) for weak deleterious mutations as a function of Ns. Deleterious alleles are recessive (h = 0, thick line), codominant (Inline graphic, thin line), or dominant (h = 1, dashed line).

First, we need to compute the proportion of sites fixed either for the deleterious or for the optimal allele. Following Bulmer (1991) or Piganeau and Eyre-Walker (2003), the proportion of sites fixed for the deleterious allele is

graphic file with name M19.gif (7)

where u(s, h) is the fixation probability of an allele with initial frequency 1/2N, selective advantage s > 0, and dominance level h. It is worth noting that both selection coefficients and dominance levels are reversed between the two allelic states. If the deleterious allele (−s) is recessive (h = 0), the advantageous one (s) is dominant (h = 1). According to Kimura (1962),

graphic file with name M20.gif (8)

where Inline graphic. Mδx and Vδx are the infinitesimal mean and variance of the change in allele frequency of the diffusion process. In this model with partial selfing and an arbitrary level of dominance these terms are

graphic file with name M22.gif (9a)
graphic file with name M23.gif (9b)

(see, for instance, Charlesworth 1992). We thus have

graphic file with name M24.gif (10)

πn is then given by

graphic file with name M25.gif (11)

and

graphic file with name M26.gif (12)

where Inline graphic is the expected contribution to heterozygosity of an allele with initial frequency 1/2N, selective advantage s > 0, and dominance level h:

graphic file with name M28.gif (13)

(for instance, see Equation A6 in McVean and Charlesworth 1999). Similarly,

graphic file with name M29.gif (14)

and

graphic file with name M30.gif (15)

Expressions (12) and (15) have been computed using the NIntegrate function of the Mathematica software (Wolfram 1996).

As shown in Figure 1, f0 and ω are decreasing functions of F for Inline graphic and increasing ones for Inline graphic. When Inline graphic, Inline graphic; f0 and ω are thus independent of F (see also Charlesworth 1992). Unless mating system also affects α, we do not expect any consequences of mating systems on f0 and ω for codominant mutations. Figure 1 also shows a surprising result, which is emphasized in Figure 2. For recessive deleterious alleles (Inline graphic), f0 and ω reach a maximum, >1, under (very) weak selection (Ns < 1), while they monotonically decrease with Ns when deleterious alleles are codominant or dominant. This effect is stronger for f0 than for ω. This effect also occurs with partial selfing but the maximum is reached for very low Ns values. Under complete selfing, there is no maximum. For very weak selection, about half of the sites are fixed for the deleterious allele and the other half for the advantageous one. If the deleterious allele is recessive, its contribution to polymorphism or divergence is almost the same as the contribution of a neutral allele, while the reverse dominant advantageous mutation contributes more to polymorphism and divergence than neutral alleles. For stronger selection, most sites are fixed for the optimal allele and deleterious mutations contribute very little to both polymorphism and divergence. This explains the maximum in f0 and ω reached for low Ns values. Similar patterns occur with mutation bias, without dominance (see Mcvean and Charlesworth 1999).

Figure 1.—

Figure 1.—

f0 (A) and ω (B) for deleterious mutations as a function of F for different dominance levels. Ns = 1 and α = 1 is constant. Thick lines correspond to Inline graphic. Other lines from bottom to top: h = 0, h = 0.1, h = 0.3, h = 0.7, h = 0.9, h = 1.

αlimit for weak selection:

General analytical expressions for f0 and ω are tremendous or not available so that only numerical results are presented. Table 1 gives numerical αlimit-values obtained when allogamous populations (F = 0) and fully autogamous ones (F = 1) are compared. As expected, αlimit is lower for recessive mutations because they are easily purged by selfing so that strong reductions in Ne are necessary to overwhelm this effect. Under strong selection, numerical results are close to analytical deterministic predictions. Under weaker selection, αlimit is higher, which means that purging of recessive deleterious alleles by selfing is less efficient in small than in large populations, as already pointed out (Glémin 2003). However, for very weak selection (Ns < 1), αlimit can be low, even negative for highly recessive mutations. This result comes from the peculiar pattern of f0 described above. If we do not assume back mutation, αlimit monotonically increases as Ns decreases. This result implies that it could be difficult to detect relaxed selection through f0 measure in selfers because weakly deleterious mutations contribute much to polymorphism. However, such mutations are supposed to be close to the codominance level for which αlimit is high and close to one (see below).

TABLE 1.

αlimit for f0 for various combinations of h and Ns

Ns
Strong selection
h 0.5 1 2 5 10 50 100
0 <0 <0 0.346 0.332 0.249 0.113 0.080
0.252 0.113 0.080
0.05 <0 <0 0.425 0.389 0.305 0.178 0.149
0.1
0.1 <0 0.210 0.496 0.446 0.366 0.254 0.231
0.2
0.2 <0 0.505 0.627 0.568 0.503 0.427 0.414
0.4
0.3 0.394 0.696 0.752 0.701 0.657 0.613 0.607
0.6
0.4 0.754 0.855 0.875 0.845 0.824 0.805 0.803
0.8
0.45 0.884 0.929 0.937 0.921 0.911 0.902 0.901
0.9

αlimit was computed by numerically solving for equation Inline graphic. Under strong selection, analytical approximations (Equations 6a and 6b) are given in italics.

Table 2 shows αlimit-values for ω. The pattern is similar to that of f0 but absolute values are much higher and close to one, even for recessive alleles. Without additional reduction in Ne (α = 1), recessive deleterious mutations are sheltered in outcrossing populations and reach much higher frequency than in self-fertilizing ones where they are unmasked in homozygotes. Higher f0 values are thus expected under panmixia so that αlimit needs to be rather small to compensate the homozygosity effect. However, such deleterious mutations are eventually selected against with similar intensity in outcrossing as in self-fertilizing populations, and thus rarely reach fixation in both populations. ω-values are thus similar in both populations so that αlimit-values close to one are sufficient to compensate for homozygosity. In other words, the dominance level of deleterious mutations, as well as the homozygosity effect, influences polymorphism more strongly than divergence.

TABLE 2.

αlimit for ω under purifying selection for various combinations of h and Ns

Ns
h 0.5 1 2 5 10
0 0.055 0.684 0.833 0.930 0.965
0.05 0.314 0.719 0.848 0.935 0.968
0.1 0.441 0.753 0.864 0.941 0.971
0.2 0.625 0.818 0.897 0.954 0.977
0.3 0.768 0.881 0.930 0.968 0.984
0.4 0.890 0.941 0.965 0.983 0.991
0.45 0.947 0.971 0.982 0.991 0.995

αlimit was computed by numerically solving for equation Inline graphic.

Variation of α with selfing rate:

It is not easy to predict how α should vary with the selfing rate in general. For instance, no model predicts how the frequency of bottlenecks should increase with selfing. However, under the background selection (BS) hypothesis we can predict the relationship between α and F. Here, this model is used as a quantitative example of how α may decrease with F. According to Charlesworth et al. (1993), the reduction in Ne due to BS is

graphic file with name M40.gif (16)

where U is the genomic deleterious mutation rate, R is the genomic recombination rate, sd is the mean selection coefficient against strongly deleterious mutations, and hd is their dominance coefficient. We thus have

graphic file with name M41.gif (17)

If the recombination rate is high, only highly self-fertilizing species suffer from a reduction in Ne due to BS. If the recombination rate is lower, α decreases more linearly with F. Examples are given in Figure 3. Figures 4 and 5 show how f0 and ω vary as a function of F under the BS model.

Figure 3.—

Figure 3.—

Variation of α with F under the BS model (Equation 17) with U = 0.2, hd = 0.1, and sd = 0.05 and R = 5 (thin solid line), R = 1 (dotted line), or R = 0.5 (dashed line). Thick solid line: constant α.

Figure 4.—

Figure 4.—

f0 for deleterious mutations as a function of F for constant α = 1 (thick solid line) or α decreasing with F under the BS model (same values as in Figure 3). (A) h = 0 and Ns = 2; (B) h = 0.3 and Ns = 2; (C) h = 0 and Ns = 10; (D) h = 0.3 and Ns = 10.

Figure 5.—

Figure 5.—

ω for deleterious mutations as a function of F for constant α (thick solid line) or α decreasing with F under the BS model (same values as in Figure 3). (A) h = 0 and Ns = 2; (B) h = 0.3 and Ns = 2; (C) h = 0 and Ns = 10; (D) h = 0.3 and Ns = 10.

For partially recessive alleles (h = 0.3), provided that BS is sufficiently strong (U rather high and R not too low) f0 mainly increases with F. For fully recessive alleles, f0 reaches a minimum at intermediate selfing rates. The decrease in ω for intermediate selfing rates is less pronounced, even for fully recessive alleles (Figure 5). In every case, only strong selfers experience significant relaxation of selection (a strong increase in both f0 and ω).

Distribution of mutational effects:

Previous analyses considered only the fate of alleles with fixed deleterious effect and dominance level while it is much more realistic to assume variation in selective effects among mutations. Several studies showed that selection coefficients against deleterious mutations are well modeled by a gamma distribution

graphic file with name M42.gif (18)

(Keightley 1994; Piganeau and Eyre-Walker 2003; Loewe et al. 2006), where a is the shape parameter and b the scale parameter, and Γ is the gamma function (Abramowitz and Stegun 1970). The mean and the variance of this distribution are ab and ab2, respectively. Variation in dominance levels is less documented but both theoretical (Bourguet 1999) and experimental arguments (Phadnis and Fry 2005) suggest that strongly deleterious mutations (lethals, semilethals) are recessive while weakly detrimental ones are nearly codominant. One can thus assume that h and s are inversely related:

graphic file with name M43.gif (19)

(for a similar treatment see Higgins and Lynch 2001). This expression is such that Inline graphic for s = 0, while h = 0.02 for lethals, roughly reflecting values estimated in Drosophila (Mukai et al. 1972). Equations 12 and 15 become, respectively,

graphic file with name M45.gif (20)

and

graphic file with name M46.gif (21)

In this treatment, deleterious mutations and back advantageous ones follow the same gamma distribution. Integration spans from 0 (neutral alleles) to 1 (lethals), so strictly speaking, the distribution is a truncated gamma but the probability mass between 1 and ∞ is very low.

Table 3 gives some examples of the variation in ω with F assuming either that α is constant or that α decreases as F increases, as predicted under the BS hypothesis. Under the above assumptions, weakly deleterious mutations, which mostly contribute to πn and Dn, are nearly codominant. With constant α, f0 and ω are thus nearly invariant with F. With BS, f0 and ω increase with F, revealing less efficient selection at the molecular level in selfers than in outcrossers. This is true even under strong recombination and therefore moderate BS effect. As in the previous case, only strong selfers show clear signals of relaxed selection. Results are very similar for f0 but numerical integration of (20) is somewhat problematic and results may be imprecise and are not given here.

TABLE 3.

Comparison of ω for different levels of selfing (F) assuming a distribution of deleterious mutation effects (s and h)

Exponential distribution of s (mean Ns = 2)
Skewed gamma distribution of s (mean Ns = 10)
F Constant α BS (high rec) BS (medium rec) BS (low rec) Constant α BS (high rec) BS (medium rec) BS (low rec)
0 0.427 0.427 0.427 0.427 0.665 0.665 0.665 0.665
0.2 0.427 0.429 0.439 0.449 0.664 0.665 0.667 0.669
0.5 0.426 0.437 0.474 0.504 0.664 0.667 0.675 0.682
0.7 0.426 0.451 0.527 0.578 0.664 0.670 0.687 0.698
1 0.425 0.921 0.900 0.867 0.664 0.802 0.791 0.776

α is independent of F (first column) or decreases under the BS model under high, medium, or low recombination (rec) rates (same values as in Figure 3). N = 1000; exponential distribution with mean Ns = 2 (a = 1 and b = 0.002) or strongly skewed gamma distribution with mean Ns = 10 (a = 0.1 and b = 0.1) is shown.

Positive selection:

αlimit for weak and strong positive selection:

Consider now rare advantageous mutations (s > 0). Such mutations either rapidly spread to fixation or become initially lost. They contribute to divergence, increasing Dn, but not to polymorphism. The ω-ratio is >1. The following results are mainly similar to those of Charlesworth (1992) with slight modifications. Without back mutation, we simply have

graphic file with name M47.gif (22)

As for deleterious mutations, selection for recessive mutations Inline graphic is more efficient in selfers than in outcrossers while the reverse is true for dominant ones (Inline graphic) (see Charlesworth 1992).

Under weak selection (Ns ≪ 1), according to Equation 9 in Charlesworth (1992),

graphic file with name M50.gif (23)

from which we get

graphic file with name M51.gif (24)

This reduces to Inline graphic for h = 0 and F = 1.

Under strong selection (Ns ≫ 1) we can get approximations for h = 0 and either F = 0 or F = 1. From Equation 15 in Kimura (1962) when F = 0, we have

graphic file with name M53.gif (25)

When F = 1 Equation 22 becomes

graphic file with name M54.gif (26)

We then obtain

graphic file with name M55.gif (27)

αlimit is lower than for weak selection. For example, for Ns = 10 αlimit ≈ 0.25. Outcrossers hardly fix fully recessive advantageous mutations while self-fertilizing species do, even with lower Ne (see also Charlesworth 1992).

Dominance of advantageous mutations:

While both theory and experimental data support the recessivity of deleterious mutations over wild-type alleles, we know little about the distribution of dominance coefficients of newly arisen advantageous mutations. A review of pesticide resistance alleles showed that they are often dominant or codominant (Bourguet and Raymond 1997) but such alleles have been screened after selection, which is expected to skew the distribution of mutations toward dominant ones, the so-called “Haldane's sieve.” Charlesworth (1992) used data in selfers where Haldane's sieve is strongly reduced, such that the dominance spectrum of fixed mutations is very close to the one of newly arisen mutations before selection. He found a bias toward recessive mutations but he pointed out that dominant mutations should be more common among favorable mutations than deleterious ones.

To explore the effect of the dominance spectrum of mutations, one can assume that half of the mutations are recessive and the other half are dominant. Table 4 shows the evolution of ω with F for this symmetrical distribution. For constant α, ω is higher in outcrossers than in selfers if selection is weak. The reverse is true under strong selection. However, quantitatively the values are very similar. Empirically the limit is ∼2Ns2 ≈ 1 but we did not find any demonstration of this result. Under weak selection, taking the average of (22), with h = 0 and h = 1 we get

graphic file with name M56.gif (28)

which is independent of F so αlimit = 1. Under strong selection, we can show that

graphic file with name M57.gif (29)

(see appendix), which is always less than but rather close to 1. If Ne is reduced beyond the 50% threshold, we thus expect to detect a relaxed selection of advantageous mutations in selfers if dominance levels are distributed equally. However, if advantageous mutations are mainly recessive, adaptive evolution could go faster in self-fertilizing than in outcrossing species.

TABLE 4.

ω for advantageous mutations as a function of F and selection strength (Ns)

F Ns = 1 Ns = 5 Ns = 10 Ns = 30
0 2.38 10.76 20.17 49.12
0.2 2.34 10.20 19.44 50.03
0.5 2.31 9.86 19.12 51.32
0.7 2.30 9.79 19.06 51.69
1 2.30 9.75 19.03 51.84

Half of the mutations are supposed to be recessive and the other half dominant.

Subdivided populations:

Most natural populations are subdivided, and geographical structure may affect population evolution. When comparing outcrossing and self-fertilizing species, population subdivision can be crucial because selfers are usually more spatially structured than outcrossers (Hamrick and Godt 1996). How does population subdivision affect the above theoretical results? As an example, consider the case of the island model with no extinction and zygotic migration. According to Equation 44 from Roze and Rousset (2003), the G function becomes

graphic file with name M58.gif (30)

where k is the number of demes and m the migration rate. Here, N stands for the local population size. First, it is worth noting that for complete selfing, Inline graphic, which is independent on m and h. In panmictic populations, G(x) depends both on m and on h. As shown by Roze and Rousset (2003), subdivision increases the probability of fixation of recessive advantageous mutations but decreases it for dominant ones (the reverse is expected for deleterious mutations).

Using (30) in (12), (15) and (22) allow us to explore the effect of subdivision on f0 and ω. Figure 6 shows that for recessive deleterious mutations, f0 and ω increase with migration rate, except under complete selfing when they are independent on m as predicted above. For advantageous recessive mutations, migration reduces ω. More importantly, Figure 6 shows that subdivision lessens the differences between self-fertilizing and outcrossing species. Subdivision has a strong effect in outcrossers by increasing local homozygosity, but a limited one in selfers, and even a null effect under complete selfing (see also Roze and Rousset 2003). Consequently αlimit should be higher in subdivided populations than in single ones. As an example, we give the case of weakly advantageous mutations:

graphic file with name M60.gif (31)

(see appendix). As Nm tends toward 0, αlimit tends toward 1, while it tends toward Inline graphic, the single-population value, when Nm tends toward infinity. If both self-fertilizing and outcrossing species are similarly subdivided (same Nm) it would be easier to detect relaxed selection than in single large populations (because αlimit is higher and close to 1 in the former case). If self-fertilizing species are more subdivided than outcrossing ones, intermediate results are expected (αlimit taking intermediate values between the subdivided and the single-population cases). Anyway, the conclusions based on single population models appear robust.

Figure 6.—

Figure 6.—

f0 (A) and ω (B) for deleterious mutations and ω for advantageous mutations (C) as a function of the migration rate, m, under the k-island model. N = 100, k = 10, h = 0, s = 0.002. From bottom to top (A and B) or from top to bottom (C): F = 0, F = 0.2, F = 0.5, F = 0.7, and F = 1.

DISCUSSION

Detecting reduced selection efficacy in selfers at the molecular level:

It has been claimed that reduced Ne in selfers should leave a signature of relaxed selection against deleterious mutations at the molecular level (Charlesworth and Wright 2001). This intuitive expectation comes from the idea that slightly deleterious mutations, which contribute more to polymorphism and divergence than strongly deleterious ones, should be nearly codominant and weakly affected by increased homozygosity in selfers (see also Charlesworth 1992). Here, we give a theoretical and quantitative basis to this hypothesis.

For realistic mutation parameters, f0 and ω very slightly decrease as F increases for constant α. Thus only low Ne reduction is necessary to make f0 and ω increase with F, as expected under the commonly accepted hypothesis. This study also shows that a stronger effect of mating system should be detectable using divergence rather than polymorphism statistics. The dominance level of deleterious mutations has a strong impact on polymorphism, somewhat compensating for genetic drift, but a weaker one on substitution rates, which are mainly governed by Ne. However, this is true at steady state. If mating systems evolve quickly and if self-fertilizing species are of recent origin (for example, A. thaliana; Charlesworth and Vekemans 2005; Bechsgaard et al. 2006), differences in ω between selfers and outcrossers should be weak because measures of ω along phylogenies correspond to averages over the mating system evolutionary path. Comparing f0 and ω to test the previous predictions should be done on species or groups of species for which evidence for the relative stability of mating systems exists.

Both polymorphism and divergence data can be summarized by the so-called McDonald–Kreitman table (Mcdonald and Kreitman 1991) or by the neutrality index (NI) (Rand and Kann 1996). For instance, Bustamante et al. (2002) found an excess of nonsynonymous polymorphism compared with divergence, which can also be summarized by a NI ≫ 1. However, assuming only deleterious mutations we expect NI closer to 1 in selfers (with low Ne) than in outcrossers (with large Ne). But it is worth noting that this prediction does not hold if we assume a fraction of purely neutral mutations. Using (3a) and (3b) without adaptive mutations (ca = 0), an equivalent to the NI index is given by Inline graphic. As selection against deleterious alleles increases (outcrossers with large Ne) this ratio tends toward 1 because fd and ωd tend toward 0, while it is >1 with weak or moderate selection (selfers with small Ne). If selection against deleterious alleles is relaxed in selfers, we thus expect to detect higher NI in selfers than in outcrossers.

Finally, a way to disentangle the effect of reduced Ne from that of increased homozygosity would be to compare nuclear and cytoplasmic genes. The second category is influenced only by hitchhiking and bottleneck effects affecting Ne. A related result has been reported in Daphnia where average mitochondrial ω is significantly higher in asexual lineages than in sexual ones (Paland and Lynch 2006). However, such a signature of relaxed selection on an organelle genome due to Ne reduction through a hitchhiking effect has not been reported yet in self-fertilizing species.

Adaptive evolution in selfers:

The difference in substitution rates of advantageous mutations between outcrossers and selfers critically depends on the dominance coefficients, which are poorly known. However, assuming that advantageous new mutations are either recessive or dominant with equal probability, no or low reduction in Ne is necessary to detect a signature of relaxed selection in selfers. Because adaptive mutations are supposed to be much less frequent than deleterious mutations, the average pattern at the gene level can be misleading. For example, the combination of strong selection against deleterious mutations with few adaptive substitutions in outcrossing species can lead to ω-values higher than those in self-fertilizing species where selection is weak both against deleterious mutations and in favor of advantageous ones. Depending on ω being higher or <1, opposite conclusions could be reached.

To test previous predictions, sites evolving under positive and purifying selection should be analyzed separately. Phylogenetic methods taking account of heterogeneous selection pressure among sites (Yang et al. 2000; Guindon et al. 2004) can help partition sites into different categories. Removing sites evolving under positive selection can reinforce the detection of relaxed selection against deleterious mutations in selfers. On the contrary, to test whether adaptive evolution is more or less frequent in selfers than in outcrossers, only sites detected as evolving under positive selection should be used. However, because such sites are supposed to be rare, specific genes like resistance genes should be used.

How small is α in natural populations?

The main results of this study critically depend on α-values. It is difficult to predict these values from biological data, as already mentioned. Under the BS hypothesis, however, predictions can be made provided genomic deleterious mutation parameters are known. Assuming complete selfing (17) becomes

graphic file with name M63.gif (32)

Using data available in self-fertilizing species, αBS spans between 0.48 and 0.98 but most values are close to one (Table 5). At least in those species, BS seems not sufficient to cause a strong decrease in Ne. However, strong reductions in Ne have been documented in the self-fertilizing nematodes Caenorhabditis elegans and C. briggsae, suggesting recurrent bottlenecks and/or frequent selective sweeps (Graustein et al. 2002). Irrespective of the underlying causes, a gross estimation of α can be inferred through πs comparison between self-fertilizing and outcrossing related species,

graphic file with name M64.gif (33)

with k = 2 for nuclear genes, k = 1 for cytoplasmic genes in hermaphrodite species, and k = Inline graphic for cytoplasmic genes in gonochoric species. Applying (33) to Graustein et al. (2002) data gives α-values between 0.04 and 0.28 for nuclear genes and ∼0.4 for the mitochondrial gene they used. On the contrary, in A. thaliana, α is close to or even >1 (Wright et al. 2003). This could explain why no effect of selfing was detected on the Dn/Ds ratio in this species (Wright et al. 2002).

TABLE 5.

Estimation of α in self-fertilizing species under the background selection model using Equation 32

Species U s αBS References
Amsinckia gloriosa 0.128 0.078 0.440 Schoen (2005)
0.416 0.026 0.0003
Arabidopsis thaliana 0.05 0.23 0.897 Schultz et al. (1999)
Caenorhabditis briggsae 0.074 0.051 0.484 Baer et al. (2005)
0.025 0.099 0.881
C. elegans 0.0083 0.126 0.968 Baer et al. (2005)
0.0066 0.182 0.982
Oscheius myriophila 0.0055 0.219 0.988 Baer et al. (2005)
Triticum turgidum ssp. Durum 0.04 0.2 0.905 Bataillon (2000)

Evolutionary rate and long-term evolution in selfers:

On short timescales, selfers can efficiently purge strongly deleterious mutations causing inbreeding depression (Ohta and Cockerham 1974; Lande and Schemske 1985), preventing reverse transition to allogamy (Lande and Schemske 1985). On longer timescales, reduction in Ne may increase drift load and preclude adaptation, suggesting that self-fertilization could be an evolutionary dead end (Takebayashi and Morrell 2001). The results presented here confirm that reduction in Ne in natural populations of self-fertilizing species should be large enough to allow higher rates of fixation of weakly deleterious mutations than in outcrossing species, even for fully recessive mutations. Phylogenetic analyses offer tools to quantify drift load in self-fertilizing species and investigate further the evolutionary dead-end hypothesis. However, the claim of reduced adaptation in selfers is less clear from a theoretical point of view and crucially depends on dominance levels of new advantageous mutations. A few adaptive events can prevent an extinction vortex (Whitlock 2000) and could help maintain selfing lineages over a long timescale as observed in the Medicago genus (Bena et al. 1998).

Acknowledgments

I thank Nicolas Galtier and two anonymous reviewers for their comments and suggestions to improve the manuscript. This is publication ISEM 2007-103 of the Institut des Sciences de l'Evolution de Montpellier.

APPENDIX: COMPUTATION OF αLIMIT

The details of the approximation for αlimit that are not fully derived in the main text are given here.

Case i—ω, strong positive selection with recessive and dominant mutations:

Assume that half of the advantageous mutations are recessive and half are dominant. In fully self-fertilizing populations (F = 1), ω is independent of dominance levels. For strong selection, according to (26)

graphic file with name M66.gif (A1)

In outcrossing populations (F = 0 and α = 1), for recessive mutations ω is given by (26) (and see Kimura 1962). For dominant ones, (22) becomes

graphic file with name M67.gif (A2)

where Erfi is the imaginary error function (Abramowitz and Stegun 1970).

Using the approximation Inline graphic for large x and assuming N ≫ 1, we get Inline graphic. The average ω over half of the mutations being dominant and the other half being recessive is thus given by

graphic file with name M70.gif (A3)

Solving for ωself = ωout in α gives

graphic file with name M71.gif (A4)

Case ii—ω, weak positive selection in subdivided populations:

For new advantageous mutations,

graphic file with name M72.gif (A5)

with

graphic file with name M73.gif

where

graphic file with name M74.gif

in the k-island model (Roze and Rousset 2003). For α = 1 and F = 0,

graphic file with name M75.gif (A6)

where Erf is the error function (Abramowitz and Stegun 1970).

For any α and F = 1,

graphic file with name M76.gif (A7)

Assuming weak selection Ns < 1, Taylor expansions of (A6) and (A7) in Ns ∼ 0, and further simplification assuming N ≫ 1 give

graphic file with name M77.gif (A8)
graphic file with name M78.gif (A9)

which leads to

graphic file with name M79.gif (A10)

References

  1. Abramowitz, M., and I. A. Stegun, 1970. Handbook of Mathematical Functions. Dover, New York.
  2. Baer, C. F., F. Shaw, C. Steding, M. Baumgartner, A. Hawkins et al., 2005. Comparative evolutionary genetics of spontaneous mutations affecting fitness in rhabditid nematodes. Proc. Natl. Acad. Sci. USA 102: 5785–5790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baker, H. G., 1955. Self-compatibility and establishment after “long-distance” dispersal. Evolution 9: 347–348. [Google Scholar]
  4. Bataillon, T., 2000. Estimation of spontaneous genome-wide mutation rate parameters: Wither beneficial mutations? Heredity 84: 497–501. [DOI] [PubMed] [Google Scholar]
  5. Bataillon, T., and M. Kirkpatrick, 2000. Inbreeding depression due to mildly deleterious mutations in finite populations: size does matter. Genet. Res. 75: 75–81. [DOI] [PubMed] [Google Scholar]
  6. Bechsgaard, J. S., V. Castric, D. Charlesworth, X. Vekemans and M. H. Schierup, 2006. The transition to self-compatibility in Arabidopsis thaliana and evolution within S-haplotypes over 10 Myr. Mol. Biol. Evol. 23: 1741–1750. [DOI] [PubMed] [Google Scholar]
  7. Bena, G., B. Lejeune, J. M. Prosperi and I. Olivieri, 1998. Molecular phylogenetic approach for studying life-history evolution: the ambiguous example of the genus Medicago L. Proc. Biol. Sci. 265: 1141–1151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bourguet, D., 1999. The evolution of dominance. Heredity 83(1): 1–4. [DOI] [PubMed] [Google Scholar]
  9. Bourguet, D., and M. Raymond, 1997. The molecular basis of dominance relationships: the case of some recent adaptive genes. J. Evol. Biol. 11: 103–122. [Google Scholar]
  10. Bulmer, M., 1991. The selection-mutation-drift theory of synonymous codon usage. Genetics 129: 897–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bustamante, C. D., R. Nielsen, S. A. Sawyer, K. M. Olsen, M. D. Purugganan et al., 2002. The cost of inbreeding in Arabidopsis. Nature 416: 531–534. [DOI] [PubMed] [Google Scholar]
  12. Caballero, A., and W. G. Hill, 1992. Effects of partial inbreeding on fixation rates and variation of mutant genes. Genetics 131: 493–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Charlesworth, B., 1992. Evolutionary rates in partially self-fertilizing species. Am. Nat. 140: 126–148. [DOI] [PubMed] [Google Scholar]
  14. Charlesworth, B., M. T. Morgan and D. Charlesworth, 1993. The effect of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Charlesworth, D., and X. Vekemans, 2005. How and when did Arabidopsis thaliana become highly self-fertilizing? BioEssays 27: 472–476. [DOI] [PubMed] [Google Scholar]
  16. Charlesworth, D., and S. I. Wright, 2001. Breeding systems and genome evolution. Curr. Opin. Genet. Dev. 11: 685–690. [DOI] [PubMed] [Google Scholar]
  17. Glémin, S., 2003. How are deleterious mutations purged? Drift versus nonrandom mating. Evolution 57: 2678–2687. [DOI] [PubMed] [Google Scholar]
  18. Glémin, S., E. Bazin and D. Charlesworth, 2006. Impact of mating systems on patterns of sequence polymorphism in flowering plants. Proc. Biol. Sci. 273: 3011–3019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Graustein, A., J. M. Gaspar, J. M. Walters and M. F. Palopoli, 2002. Levels of DNA polymorphism vary with mating system in the nematode genus Caenorhabditis. Genetics 161: 99–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Guindon, S., A. G. Rodrigo, K. A. Dyer and J. P. Huelsenbeck, 2004. Modeling the site-specific variation of selection patterns along lineages. Proc. Natl. Acad. Sci. USA 101: 12957–12962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hamrick, J. L., and M. J. W. Godt, 1996. Effects of life history traits on genetic diversity in plants species. Philos. Trans. R. Soc. Lond. B 351: 1291–1298. [Google Scholar]
  22. Higgins, K., and M. Lynch, 2001. Metapopulation extinction caused by mutation accumulation. Proc. Natl. Acad. Sci. USA 98: 2928–2933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Keightley, P. D., 1994. The distribution of mutation effects on viability in Drosophila melanogaster. Genetics 138: 1315–1322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kimura, M., 1962. On the probability of fixation of mutant genes in a population. Genetics 47: 713–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Lande, R., and D. W. Schemske, 1985. The evolution of self-fertilization and inbreeding depression in plants. I. Genetic models. Evolution 39: 24–40. [DOI] [PubMed] [Google Scholar]
  26. Loewe, L., B. Charlesworth, C. Bartolome and V. Noel, 2006. Estimating selection on nonsynonymous mutations. Genetics 172: 1079–1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Maynard-Smith, J., and D. Haig, 1974. The hitch-hiking effect of a favourable gene. Genet. Res. 23: 22–35. [PubMed] [Google Scholar]
  28. McDonald, J. H., and M. Kreitman, 1991. Adaptative protein evolution at the ADH locus in Drosophila. Nature 351: 652–654. [DOI] [PubMed] [Google Scholar]
  29. McVean, G. A., and B. Charlesworth, 1999. A population genetic model for the evolution of synonymous codon usage: patterns and predictions. Genet. Res. 74: 145–158. [Google Scholar]
  30. Mukai, T., S. I. Chigusa, L. E. Mettler and J. F. Crow, 1972. Mutation rate and dominance of genes affecting viability in Drosophila melanogaster. Genetics 72: 335–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Nei, M., 1968. The frequency distribution of lethal chromosomes in finite populations. Proc. Natl. Acad. Sci. USA 60: 517–524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Nybom, H., 2004. Comparison of different nuclear DNA markers for estimating intraspecific genetic diversity in plants. Mol. Ecol. 13: 1143–1155. [DOI] [PubMed] [Google Scholar]
  33. Ohta, T., and C. C. Cockerham, 1974. Detrimental genes with partial selfing and effects on a neutral locus. Genet. Res. 23: 191–200. [DOI] [PubMed] [Google Scholar]
  34. Paland, S., and M. Lynch, 2006. Transitions to asexuality result in excess amino acid substitutions. Science 311: 990–992. [DOI] [PubMed] [Google Scholar]
  35. Phadnis, N., and J. D. Fry, 2005. Widespread correlations between dominance and homozygous effects of mutations: implications for theories of dominance. Genetics 171: 385–392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Piganeau, G., and A. Eyre-Walker, 2003. Estimating the distribution of fitness effects from DNA sequence data: implications for the molecular clock. Proc. Natl. Acad. Sci. USA 100: 10335–10340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Rand, D. M., and L. M. Kann, 1996. Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from Drosophila, mice, and humans. Mol. Biol. Evol. 13: 735–748. [DOI] [PubMed] [Google Scholar]
  38. Roze, D., and F. Rousset, 2003. Selection and drift in subdivided populations: a straightforward method for deriving diffusion approximations and applications involving dominance, selfing and local extinctions. Genetics 165: 2153–2166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Schoen, D. J., 2005. Deleterious mutation in related species of the plant genus Amsinckia with contrasting mating systems. Evolution 59: 2370–2377. [PubMed] [Google Scholar]
  40. Schoen, D. J., and A. H. D. Brown, 1991. Intraspecific variation in population gene diversity and effective population size correlates with the mating system in plants. Proc. Natl. Acad. Sci USA 88: 4494–4497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Schultz, S. T., M. Lynch and J. H. Willis, 1999. Spontaneous deleterious mutation in Arabidopsis thaliana. Proc. Natl. Acad. Sci. USA 96: 11393–11398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Tajima, F., 1983. Evolutionary relationship of DNA sequences in finite populations. Genetics 105: 437–460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Takebayashi, N., and P. L. Morrell, 2001. Is self-fertilization an evolutionary dead end? Revisiting an old hypothesis with genetic theories and a macroevolutionary approach. Am. J. Bot. 88: 1143–1150. [PubMed] [Google Scholar]
  44. Whitlock, M. C., 2000. Fixation of new alleles and the extinction of small populations: drift load, beneficial alleles, and sexual selection. Evolution 54: 1855–1861. [DOI] [PubMed] [Google Scholar]
  45. Wolfram, S., 1996. The Mathematica Book. Cambridge University Press, Cambridge.
  46. Wright, S., 1937. The distribution of gene frequencies in populations. Proc. Natl. Acad. Sci. USA 23: 307–320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wright, S. I., B. Lauga and D. Charlesworth, 2002. Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol. Biol. Evol. 19: 1407–1420. [DOI] [PubMed] [Google Scholar]
  48. Wright, S. I., B. Lauga and D. Charlesworth, 2003. Subdivision and haplotype structure in natural populations of Arabidopsis lyrata. Mol. Ecol. 12: 1247–1263. [DOI] [PubMed] [Google Scholar]
  49. Yang, Z., R. Nielsen, N. Goldman and A. M. Pedersen, 2000. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155: 431–449. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES