Skip to main content
Genetics logoLink to Genetics
. 2016 Apr 12;203(2):937–957. doi: 10.1534/genetics.116.187955

Background Selection in Partially Selfing Populations

Denis Roze 1,2,1
PMCID: PMC4896204  PMID: 27075726

Abstract

Self-fertilizing species often present lower levels of neutral polymorphism than their outcrossing relatives. Indeed, selfing automatically increases the rate of coalescence per generation, but also enhances the effects of background selection and genetic hitchhiking by reducing the efficiency of recombination. Approximations for the effect of background selection in partially selfing populations have been derived previously, assuming tight linkage between deleterious alleles and neutral loci. However, loosely linked deleterious mutations may have important effects on neutral diversity in highly selfing populations. In this article, I use a general method based on multilocus population genetics theory to express the effect of a deleterious allele on diversity at a linked neutral locus in terms of moments of genetic associations between loci. Expressions for these genetic moments at equilibrium are then computed for arbitrary rates of selfing and recombination. An extrapolation of the results to the case where deleterious alleles segregate at multiple loci is checked using individual-based simulations. At high selfing rates, the tight linkage approximation underestimates the effect of background selection in genomes with moderate to high map length; however, another simple approximation can be obtained for this situation and provides accurate predictions as long as the deleterious mutation rate is not too high.

Keywords: deleterious mutation, genetic drift, effective population size, multilocus population genetics, self-fertilization


UNDERSTANDING the evolutionary consequences of transitions between reproductive systems has been the focus of an important number of theoretical and empirical studies. In particular, the shift from biparental sexual reproduction to self-fertilization has occurred frequently in plants and animals (Goodwillie et al. 2005; Jarne and Auld 2006), but the phylogenetic distribution of selfing lineages suggests that these are often relatively short-lived and may thus correspond to an “evolutionary dead end” or “blind alley” (e.g., Stebbins 1957; Williams 1992; Takebayashi and Morrell 2001; Goldberg et al. 2010; Igic and Busch 2013). A possible reason for the lack of macroevolutionary success of selfing species may be their reduced capacity to produce novel genotypes (in particular, genotypes adapted to new environmental conditions), due to a reduced efficiency of recombination. Furthermore, self-fertilization lowers the effective size of populations and should thereby decrease the efficiency of natural selection against deleterious alleles, which may lead to mutation accumulation and population extinction (Lynch et al. 1995; Schultz and Lynch 1997). Analyses based on molecular data show little evidence for increased ratios of nonsynonymous to synonymous substitutions (dN/dS) in selfing lineages that would indicate a reduced efficiency of purifying selection (Glémin and Muyle 2014; Hartfield 2015 and references therein): this may be due to the recent origin of those lineages or to the low rates of outcrossing maintained by most predominantly selfing species (Wright et al. 2013). However, several recent studies showed elevated ratios of nonsynonymous to synonymous polymorphism (πN/πS) in various selfing species (compared with their outcrossing relatives), suggesting that deleterious alleles may reach higher frequencies in selfers (e.g., Brandvain et al. 2013; Burgarella et al. 2015; and other references listed in table 1 of Hartfield 2015).

The lower effective size of selfing populations has been demonstrated empirically using neutral diversity data from a variety of species (e.g., Charlesworth 2003; Glémin et al. 2006) and is thought to result from two types of effects. The first type is an automatic increase in the rate of coalescence per generation, since the ancestral lineages of the two homologous copies of a gene in an individual may coalesce in a single generation (with probability 1/2) if this individual has been produced by selfing. Due to this effect, the effective population size is given by Ne=N/(1+F) (Pollak 1987; Nordborg 2000), where N is the census size and where the inbreeding coefficient F equals α/(2α) in a population in which a proportion α of individuals are produced by selfing. Therefore, Ne is expected to decline linearly from N to N/2 as α increases from 0 to 1. However, Ne may be further decreased (second effect) by selective sweeps or by selection against deleterious alleles (background selection) (Charlesworth et al. 1993; Charlesworth 2012), whose effects are amplified by the lower effective recombination rates of selfing populations.

Several models have computed the effect of background selection on neutral diversity in randomly mating populations, using different approaches (Hudson and Kaplan 1995; Nordborg et al. 1996; Santiago and Caballero 1998; Charlesworth 2012). These showed in particular that a deleterious allele at mutation–selection balance reduces the expected diversity at a linked neutral locus by a factor 1(ush)/(r+sh)2, where u is the mutation rate toward the deleterious allele, sh the heterozygous fitness effect of this allele (assumed different from zero), and r the recombination rate between the two loci. The case of partially selfing populations has been addressed by Nordborg (1997), using a structured coalescent model and a separation of timescales argument. Indeed, assuming that recombination and coalescence of lineages present in different individuals occur at a much lower rate than the coalescence of lineages present in the same individual due to selfing, the population can be described in terms of haplotypes instead of diploid genotypes, which considerably simplifies the analysis. Under this assumption, the effect of a deleterious allele on linked neutral diversity is given by a similar expression as in the panmictic case, replacing sh by s[h(1F)+F] (measuring the strength of selection against the deleterious allele in a partially selfing population) and r by the effective recombination rate r(1F) (see also Nordborg 2000). Extrapolating this result to the case of deleterious alleles segregating at many loci, Glémin (2007) and Glémin and Ronfort (2012) showed that the effective size of highly selfing populations may be strongly reduced by background selection effects.

Strictly, Nordborg’s (1997) result holds for tightly linked loci, since the separation of timescales argument supposes a low recombination rate. While the effective population size at a given locus should be little affected by loosely linked loci as long as the selfing rate remains moderate, this may be less so when the selfing rate is high, so that linkage disequilibria may extend over relatively large genetic distances (e.g., Nordborg et al. 2002). Using multilocus individual-based simulations of partially selfing populations, Kamran-Disfani and Agrawal (2014) observed discrepancies between the estimated Ne and predictions obtained by extrapolating Nordborg’s (1997) result over a whole genetic map. These may be caused by the fact that the effects of loosely linked loci are not sufficiently well predicted by the separation of timescales approximation and become important at high selfing rates.

In this article, I construct a model of background selection in partially selfing populations by extending the multilocus population genetics framework previously developed by Barton and Turelli (1991) and Kirkpatrick et al. (2002). As we will see, a strength of this approach is that it allows one to decompose evolutionary processes (here the background selection effect) into different terms involving linkage disequilibria and other forms of genetic associations, for which intuitive interpretation can be given. Expressions valid for any value of the recombination rate are derived and shown to converge to Nordborg’s (1997) result when linkage is tight. However, this tight linkage approximation may significantly underestimate the strength of background selection when the selfing rate is high (but <1); we will see that another approximation yielding better predictions at high selfing rates can be obtained from the general model. A good match between the analytical predictions and multilocus simulation results is observed as long as the genomic deleterious mutation rate U is not too high (0.1 per haploid genome), while discrepancies appear at higher values of U: those are likely due to genetic associations between deleterious alleles at different loci, which are neglected in the analysis.

Model

General method

As in previous models of background selection (e.g., Hudson and Kaplan 1995; Nordborg et al. 1996), I first consider the effect of a single deleterious allele maintained at mutation–selection balance at a given locus on the dynamics of genetic diversity at a linked neutral locus. This effect can be quantified by computing the expected change in neutral diversity over one generation, which is affected by various moments of genetic associations between the two loci (e.g., the variance in linkage disequilibrium and other moments of associations between genes present either on the same haplotype or on different haplotypes of a diploid individual). Assuming recurrent mutation at the neutral locus, the effective population size Ne at the neutral locus can be deduced from the expected neutral diversity at equilibrium (Nordborg et al. 1996). Alternatively, we may ignore mutation at the neutral locus and calculate Ne by equating the expected rate of loss in diversity per generation to 1/(2Ne), since diversity is eroded at a rate 1/(2N) per generation in a Wright–Fisher population. This is the approach that is used here. Strictly, it relies on a quasi-equilibrium approximation, since moments of genetic associations (for example, the variance in linkage disequilibrium) are expressed in terms of diversity at the neutral locus and of the frequency of the deleterious allele, implying that these moments of genetic associations equilibrate fast relative to changes in allele frequencies. However, this approximation is justified when population size is sufficiently large (so that changes in allele frequencies due to drift remain small) and yields the same expression for Ne as what would be obtained by calculating the equilibrium neutral diversity under recurrent mutation.

I consider the following life cycle: N individuals are present at the start of each generation and produce a very large (effectively infinite) number of juveniles in proportion to their fitness. A proportion α of offspring is produced by selfing, while the remaining 1α is produced by random fusion of gametes. Finally, N individuals are sampled randomly among all juveniles produced, to form the next generation (drift). Fitness depends on genotype at the selected locus, where two alleles (denoted 0 and 1) are segregating: allele 1 is deleterious, reducing fitness by a factor 1sh in the heterozygous state and 1s in the homozygous state. The deleterious mutation rate (from allele 0 to allele 1) is denoted u. As in previous treatments (Hudson and Kaplan 1995; Nordborg et al. 1996) I assume that sN1 and us, so that the frequency of the deleterious allele remains small and can be approximated by the deterministic mutation–selection balance frequency (strictly, this also assumes Npdel1, where pdel is the frequency of the deleterious allele). Finally, r measures the recombination rate between the two loci.

In the following I use a general method to compute the effects of selection, reproduction, and drift on moments of genetic associations. This method is based on a previous formalism for the analysis of multilocus models (Barton and Turelli 1991; Kirkpatrick et al. 2002), extended to include genetic drift. It was used previously to study selection for sex in finite diploid populations undergoing both sexual and asexual reproduction (Roze and Michod 2010) and is described in Appendix A for the case of a partially selfing population. To simplify the notation, the examples shown in Appendix A concern the case of a biallelic neutral locus; however, the method extends to multiple alleles, yielding the same expression for the decay of neutral diversity per generation. The analysis of the two-locus model then proceeds in three steps. First, I express the expected decay of genetic diversity per generation in terms of various genetic moments involving both loci. Then, recurrence equations describing the dynamics of these two-locus moments are derived (to the first order in 1/N and assuming that the deleterious allele stays at low frequency). Finally, these equations are solved to obtain expressions for two-locus moments at (quasi-)equilibrium, in terms of diversity at the neutral locus and of the different parameters of the model. Injecting these solutions into the equation describing the decay of neutral diversity yields an expression for the effect of the deleterious allele on Ne at the neutral locus.

The results of this two-locus model are then extrapolated to a situation where deleterious alleles segregate at a large number of loci, located at various genetic distances from the neutral locus. For this, I assume that the effects of the different selected loci on diversity at the neutral locus are multiplicative, thereby neglecting genetic associations between selected loci. In the absence of epistasis between deleterious alleles, this approximation is expected to yield correct results under random mating in the regime considered here (where selection against deleterious alleles is stronger than drift, e.g., Hudson and Kaplan 1995) but may be less accurate under partial selfing, as inbreeding generates different forms of associations between loci (correlations in homozygosity in particular, e.g., Roze 2015). Nonetheless, we will see that this assumption of multiplicative effects often generates accurate predictions, as long as the genomic deleterious mutation rate is not too high.

Defining genetic associations

The parameters and variables of the model are summarized in Table 1. Throughout the following, the neutral locus is denoted A, while the selected locus is denoted B. Two alleles denoted 0 and 1 segregate at each locus (we will see below how the notation can be extended to deal with multiple neutral alleles), allele 1 at locus B being the deleterious allele. Indicator variables XiM and XiP describe the genotype of an individual at locus i: these variables equal 1 if allele 1 is present on the maternally or paternally (respectively) inherited chromosome of this individual at locus i and 0 otherwise. The frequency of allele 1 at locus i (denoted pi) is thus

pi=E[XiM+XiP2], (1)

where E stands for the average over all individuals in the population. Neglecting drift, the frequency of the deleterious allele at mutation–selection balance (denoted p˜B) is given by

p˜Bus[h(1F)+F] (2)

with F=α/(2α) (e.g., Glémin 2003).

Table 1. Parameters and variables.

Symbol Definition
N Population size
α Selfing rate
s Strength of selection against the deleterious allele
h Dominance coefficient of the deleterious allele
u Mutation rate toward the deleterious allele
r Recombination rate between the neutral locus and the selected locus
U Genomic deleterious mutation rate (per haploid genome)
R Genome map length
pA Frequency of allele 1 at locus A (neutral locus)
pB Frequency of allele 1 (deleterious allele) at locus B
p˜B Deleterious allele frequency at mutation–selection balance
DS,T Genetic association between the sets S and T of loci present on different haplotypes of an individual (see Equation 4)
DAA Genetic diversity at the neutral locus
F Inbreeding coefficient (probability of identity-by-descent between the maternal and paternal copies of a gene, due to selfing)
φAB Joint probability of identity-by-descent at loci A and B
GAB=φABF2 Identity disequilibrium between loci A and B
E[X] Average of the quantity X over all individuals
Expected value of the moment over the stochastic process

For each locus i, centered variables ζiM and ζiP are defined as

ζiM=XiMpi,ζiP=XiPpi. (3)

Following Kirkpatrick et al. (2002), the association between the sets S and T of loci present on the two haplotypes of the same individual is given by

DS,T=E[ζS,T], (4)

where

ζS,T=ζSMζTP+ζSPζTM2,ζSM=iSζiM,ζTP=iTζiP (5)

(note that DS,T=DT,S), and where sets S and T may be the empty set Ø, A, B, or AB. Associations between genes present on the same haplotype of an individual (DS,Ø) are simply denoted DS. For example, DA,A=E[(XAMpA)(XAPpA)] is a measure of the departure from Hardy–Weinberg equilibrium at locus A, while DAB=(1/2)E[(XAMpA)(XBMpB)+(XAPpA)(XBPpB)] represents the linkage disequilibrium between loci A and B (genetic association between alleles present on the same haplotype, maternal or paternal). Similarly, DA,B=(1/2)E[(XAMpA)(XBPpB)+(XAPpA)(XBMpB)] measures the association between alleles at loci A and B present on different haplotypes of the same individual.

Because population size is finite, allele frequencies and genetic associations are random variables. Throughout this article, I use the notation for the expected value of the genetic moment (a product of allele frequencies, genetic associations, or both) at a given generation: for example, DAB2 is the expected squared linkage disequilibrium between the two loci. In the following, the moment DAA=(1/2)E[(XAMpA)2+(XAPpA)2] will be of particular importance: indeed, using the fact that (XAM)2=XAM and (XAP)2=XAP (since these variables equal 0 or 1), we obtain that DAA=pAqA (where qi=1pi), thus representing the expected genetic diversity at locus A. Therefore, the effective population size at the neutral locus can be quantified by computing the rate of decay of DAA per generation.

As mentioned above, the model can be extended to an arbitrary number n of alleles segregating at the neutral locus. In this case, we can define indicator variables XA,kM and XA,kP that equal 1 if the maternally (respectively, paternally) inherited chromosome of a given individual carries allele k at locus A (and 0 otherwise), with k=1,,n. Vectors XAM and XAP are defined as XAM=(XA,1M,XA,2M,,XA,nM) and XAP=(XA,1P,XA,2P,,XA,nP); in each of these vectors (and for a given individual) a single element equals 1 while all other elements equal zero. Finally, the vector pA=E[(XAM+XAP)/2] holds the frequencies of the different alleles at locus A in the population. Defining ζAM=XAMpA and ζAP=XAPpA, genetic associations may be defined in the same way as above, associations with two “A” subscripts involving a dot product between the corresponding ζA vectors. In particular,

DAA=12E[ζAM.ζAM+ζAP.ζAP]=1k=1npA,k2, (6)

where pA,k is the frequency of allele k at locus A in the population. Similarly, DA,A=E[ζAM.ζAP], while DAB,A=(1/2)E[(ζAM.ζAP)(ζBM+ζBP)]. Equation 6 shows that, as in the biallelic case, DAA represents the expected genetic diversity at the neutral locus. As mentioned earlier, the method for computing multilocus moments is explained in Appendix A for the case of a biallelic neutral locus, but the results shown below are valid for any number of alleles segregating at this locus.

Multilocus simulations

Analytical predictions are tested using individual-based, multilocus simulations. The simulation program (written in C++ and available from Dryad) is very similar to the program used in Roze (2015), representing a population of N diploid individuals whose genome consists of a linear chromosome with total map length R. Relatively large values of R (usually 10 M) are used in most simulations to mimic a whole genome with multiple chromosomes. Each generation, deleterious alleles occur at rate U per haploid genome; all deleterious alleles have the same selection and dominance coefficient and have multiplicative effects on fitness (no epistasis). As we will see, variable selection coefficients have been implemented in a different version of the program. Offspring are formed by selfing with probability α and by random fusion of gametes with probability 1α. A neutral locus with an infinite number of possible alleles is located at the midpoint of the chromosome, mutating at a rate μ=103 per generation. The program runs for 2×106 generations, genetic diversity at the neutral locus being recorded every 50 generations and measured as D=1ipi2 (where pi is the frequency of neutral allele i). The effective population size at the neutral locus is then estimated from D¯=4Neμ/(1+4Neμ), where D¯ is the average neutral diversity at equilibrium, yielding

Ne=D¯4μ(1D¯) (7)

(the exact expression including terms in μ2 yields undistinguishable results for the parameter values used here). In the simulations, D¯ is obtained by averaging after a burn-in period of 15,000 generations, which was sufficient for diversity to reach equilibrium with μ=103. As we will see, a different version of the program including a neutral sequence with an infinite number of sites was also used, in which case diversity takes longer to equilibrate.

Data availability

The author states that all data necessary for confirming the conclusions presented in the article are represented fully within the article. The Dryad DOI is doi:10.5061/dryad.p3r01.

Results

All results are obtained using the method presented in Appendix A for computing recursions for multilocus moments, implemented in a Mathematica notebook (Supplemental Material, File S1). All terms are derived to the first order in 1/N and to the first order in p˜B (the frequency of the deleterious allele at mutation–selection balance, given by Equation 2).

General results

The expected change in genetic diversity at locus A per generation can be written as

ΔDAA=ΔsDAA+ΔdDAA, (8)

where ΔsDAA is the change in diversity due to selection and ΔdDAA the change in diversity due to drift, given by

ΔdDAA=12N(DAA+DA,A) (9)

to leading order. In the absence of selection, ΔsDAA = 0 while DA,A at quasi-equilibrium is given by (from Equation A10)

DA,A=FDAA (10)

with F=α/(2α), and the expected change in neutral diversity per generation thus becomes

ΔDAA=ΔdDAA=12N(1+F)DAA. (11)

This corresponds to the classical result that Ne = N/(1+F) under partial selfing (Pollak 1987): the increased homozygosity caused by selfing amplifies the effect of drift, since the same allele is sampled twice every time a homozygote is sampled. When selection acts at locus B, we obtain the following expression for ΔsDAA to the first order in s:

ΔsDAA=sh(DAAB+DAA,B)s(12h)(DAAB,BDAADB,B). (12)

An expression to the second order in s is provided in Appendix B; however, both expressions generally yield very similar quantitative results, although adding the terms in s2 may slightly improve the predictions under loose linkage. The different terms that appear in Equation 12 may be interpreted as follows. From the definitions given in the previous section, the term DAAB+DAA,B may also be written as E[(1/2)(ζAAM+ζAAP)(ζBM+ζBP)] (where again E stands for the average over all individuals), thus measuring a covariance between (1/2)(ζAAM+ζAAP) and ζBM+ζBP (note that E[ζBM+ζBP]=0). The quantity (1/2)(ζAAM+ζAAP) is higher in individuals carrying rarer alleles at the neutral locus: for example, in the case of a biallelic neutral locus, it equals pA2, (pA2+qA2)/2, and qA2 in 00, 01, and 11 individuals, where pA is the frequency of allele 1. Furthermore, the quantity ζBM+ζBP is higher in individuals carrying more deleterious alleles at the selected locus (it is nearly 0, 1, and 2 in individuals carrying 0, 1, and 2 deleterious alleles at locus B, assuming pB is small). Therefore, a positive value of DAAB+DAA,B indicates that rarer alleles at the neutral locus tend to be found in individuals carrying higher numbers of deleterious alleles at the selected locus, while a negative value of DAAB+DAA,B indicates the opposite. Recursions for the moments DAAB and DAA,B over one generation are given in Appendix B, to the first order in s, p˜B, and 1/N. In the absence of selfing (α=0), we obtain that DAA,B=0 at quasi-equilibrium, while DAAB is generated by the variance in linkage disequilibrium DAB2 and by the effect of selection and is positive when sh>0. Indeed, when a given neutral allele becomes associated (by chance) to the deleterious allele at locus B, this neutral allele tends to decrease in frequency, generating a positive DAAB (the deleterious allele at locus B tends to become associated with rarer alleles at locus A). This in turns reduces genetic diversity at the neutral locus (as shown by Equation 12), since these rarer alleles will further decrease in frequency due to their association with the deleterious allele. Because partial selfing generates cross-haplotype associations (between genes present on different haplotypes of a diploid), DAAB and DAA,B are given by more complicated expressions when α>0, involving moments such as DABDA,B, DABDAB,B, or DAB,B2 (see Appendix B).

Similarly, the quantity DAAB,BDAADB,B that appears on the second line of Equation 12 measures a covariance between (1/2)(ζAAM+ζAAP) and ζB,B, the quantity ζB,B being higher in homozygotes at locus B than in heterozygotes. A positive value of DAAB,BDAADB,B thus indicates that rarer alleles at locus A tend to be found more often in homozygotes at locus B than in heterozygotes. As shown by Equation 12, this would reduce neutral diversity when the deleterious allele is partially recessive (h<0.5), since in this case homozygotes at the selected locus have a lower fitness than heterozygotes. A recursion for DAAB,BDAADB,B is given in Appendix B (Equation B14); remarkably, it shows that identity disequilibrium (covariance in homozygosity) between the two loci generates negative DAAB,BDAADB,B (that is, rarer alleles at locus A tend to be found more often in heterozygotes at locus B) even in the absence of selection. Indeed, setting s=0 in Equation B14 yields at quasi-equilibrium

DAAB,BDAADB,B=2N(2α)GABp˜BDAA, (13)

where GAB is the identity disequilibrium between loci A and B (Weir and Cockerham 1969). It is given by GAB=φABF2, where

φAB=α2α2α2(23α)r(1r)2α[12r(1r)] (14)

is the probability of joint identity-by-descent at the two loci. This result may be interpreted as follows. Due to identity disequilibrium, the frequency of heterozygotes at locus A is higher among heterozygotes at locus B than among homozygotes. Furthermore, the frequency of rarer neutral alleles is higher among heterozygotes than among homozygotes at locus A (this is easily seen in the case of a biallelic locus): therefore, the frequency of rarer alleles at locus A should be higher among heterozygotes at locus B. As shown by Equation 12, this effect tends to increase neutral diversity, as long as the deleterious allele is partially recessive (h<0.5), so that heterozygotes at locus B have a higher fitness than homozygotes. As shown in the next subsection (high effective recombination), the effect of identity disequilibrium dominates over all other effects when the effective recombination rate r(1F) is sufficiently high [more precisely, r(1F)s], in which case partially recessive deleterious alleles tend to increase neutral diversity (this effect usually stays rather small). However, weaker effective recombination increases the relative importance of the terms on the first line of Equation 12 that tend to decrease neutral diversity. Furthermore, weak effective recombination changes the sign of DAAB,BDAADB,B (through the terms in s in Equation B14), due to the fact that the association between rarer alleles at locus A and the deleterious allele at locus B (represented by moments DAAB, DAA,B) generates an association between those rarer alleles and homozygosity for the deleterious allele at locus B.

As shown in Appendix B, calculating the different terms of Equation 12 at quasi-equilibrium requires computing six two-locus moments that are generated by finite population size: DAB2, DABDA,B, DA,B2, DABDAB,B, DA,BDAB,B, and DAB,B2. Recursions for these moments are also given in Appendix B. Although the solutions obtained are rather complicated, they are readily computed numerically using Mathematica (see File S2); furthermore, we will see that they can be approximated by simpler expressions in several cases. Importantly, all expressions obtained are in 1/N and thus vanish when N tends to infinity.

Besides its effect on ΔsDAA, selection at locus B also affects the average excess homozygosity at locus A DA,A and thus the term ΔdDAA in Equation 8. Because DA,A is multiplied by 1/N in Equation 9, it is sufficient to compute this moment in the limit as N tends to infinity, to obtain an expression for ΔDAA to the first order in 1/N. Using the results in Roze (2015) (also derived in File S1), we have at quasi-equilibrium, to the first order in s and p˜B,

DA,A=F[1s(12h)GABp˜B]DAA, (15)

where again GAB is the identity disequilibrium between loci A and B. Indeed, homozygosity at locus A is reduced when the deleterious allele at locus B is partially recessive (h<1/2), due to the fact that homozygotes at locus A tend to be also homozygous at locus B, while homozygotes at locus B have a lower fitness than heterozygotes.

Putting everything together, we obtain an expression for the change in diversity at locus A over one generation of the form

ΔDAA=12N(1+F)(1+Tp˜B)DAA, (16)

where T is a function of s, h, r, and α. To the first order in p˜B, we thus have

Ne=N1+FBsel, (17)

where Bsel=1Tp˜B represents the effect of background selection. Although the expression obtained for T from the equations given in Appendix B is complicated, we will now see that simple approximations can be obtained in several regimes (in particular, high effective recombination, tight linkage, and high selfing).

High effective recombination (with partial selfing)

Under partial selfing and when the effective recombination rate r(1F) is high (and assuming that the dominance coefficient h of the deleterious allele is significantly different from 0.5), Equation 12 is dominated by the term on the second line, since DAAB,BDAADB,B is generated by drift even in the absence of selection and is thus proportional to 1/N (see Equation 13), while the terms DAAB and DAA,B on the first line are generated by selection and drift and are thus proportional to s/N. Neglecting the term on the first line of Equation 12 and using Equations 9 and 15 yields the following expression for the change in neutral diversity, to the first order in s, p˜B, and 1/N:

ΔDAA=12N(1+F)[1s(12h)4+α2GABp˜B]DAA. (18)

Using Equation 2, we obtain for the effective population size at the neutral locus

NeN1+F[1+u(12h)h(1F)+F4+α2GAB], (19)

independent of s. Equation 19 shows that identity disequilibrium between the neutral and the selected locus (GAB) increases the effective population size at the neutral locus when the deleterious allele is partially recessive. This is caused by two effects: (i) identity disequilibrium reduces the excess homozygosity at locus A caused by selfing (Equation 15), since homozygotes at locus A tend to be also homozygous at locus B, while homozygotes at locus B have a lower fitness than heterozygotes when the deleterious allele is partially recessive; and (ii) the higher fitness of heterozygotes at locus B increases diversity at locus A since heterozygotes at locus B tend to be also heterozygous at locus A (Equation 13).

This increase in effective population size caused by identity disequilibrium usually stays modest, however (since it is expected to occur only for high effective recombination), and is thus difficult to observe in simulations. Background selection has stronger effects when the effective recombination rate becomes low, in which case the term DAAB+DAA,B in the first line of Equation 12 becomes of the same order of magnitude as the term on the second line [indeed, we can show that the denominator of DAAB and DAA,B is proportional to ε when both r(1F) and s are of order ε], while the sign of DAAB,BDAADB,B changes due to the effect of selection. Two approximations for this regime are given below (tight linkage, high selfing).

Random mating

In the absence of selfing (α=0), Equations B4–B9 yield the following expressions for DAB2 and DA,B2 at quasi-equilibrium,

DAB2=1+r2(12sh)2N[1(1r)2(12sh)]p˜BDAA (20)
DA,B2=12Np˜BDAA, (21)

while DABDA,B, DABDAB,B, DA,BDAB,B, and DAB,B2 = 0. Furthermore, Equations B10–B14 yield

DAAB=2sh[(1r)DAB2+rDA,B2]1(1r)(1sh), (22)

while DAA,B and DAAB,BDAADB,B = 0. Finally, the rate of decay of neutral diversity is given by

ΔDAA=shDAAB(sh)2(DAB2+DA,B2)12NDAA. (23)

Under tight linkage (i.e., both r and s are of order ε, where ε is a small term), we obtain from Equations 20–22

DAB2p˜BDAA4N(r+sh)DA,B2, (24)
DAABshp˜BDAA2N(r+sh)2, (25)

giving

ΔDAA12N[1+(shr+sh)2p˜B]DAA (26)

in agreement with the results obtained by Hudson and Kaplan (1995) and Nordborg et al. (1996). Under loose linkage (rs), Equations 20–23 yield

ΔDAA12N[1+(sh)2(1+4r2)r2p˜B]DAA. (27)

Setting r=1/2 in Equation 27 and replacing p˜B by u/(sh) yields NeN(18shu), which is equivalent to Robertson’s (1961) heuristic result that selection at unlinked loci decreases the effective population size by four times the additive variance in fitness—indeed, the variance in fitness caused by selection against the deleterious allele is 2shu (see also Charlesworth 2012).

Tight linkage

When r is small, Nordborg’s (1997) separation of timescales argument can be used to express associations between genes present on different haplotypes of a diploid in terms of associations between genes present on the same haplotype (see also Nordborg 2000; Padhukasahasram et al. 2008). Consider for example the association DA,B between two genes sampled at loci A and B from different haplotypes of an individual. Going backward in time, two different events may happen to the ancestral lineages of these genes: they may find themselves on the same haplotype (which may take only a few generations if both lineages stay in the same individual due to selfing) or move to different individuals due to an outcrossing event, in which case it will take a long time before they find themselves again in the same individual (assuming N is large). To leading order, the probability that these lineages join on the same haplotype before moving to different individuals is F. Considering now all possible pairs of genes at loci A and B on different haplotypes of the same individual (in all individuals of the population) and going backward in time, we may assume that a proportion F of such pairs find themselves on the same haplotype after a small number of generations, while the remaining 1F have moved to different individual lineages (and thus become independent): therefore, DA,BFDAB—note that this approximation assumes tight linkage, as it neglects recombination events separating genes that have joined on the same haplotype, over the small number of generations considered. Using this approximation, we have

DABDA,BFDAB2,DA,B2F2DAB2. (28)

Similarly, DAB,BFDABB=F(12pB)DAB (from Equation A9), which yields (assuming that pB is small)

DABDAB,BFDAB2,DA,BDAB,BDAB,B2F2DAB2. (29)

Finally, DAAB,BFDAABB=F(DAADBB+(12pB)DAAB), from which we obtain (using the fact that the moment pBDAAB is negligible, as shown in File S1)

DAAB,BDAADB,BDAA,BFDAAB. (30)

Equations 28–30 can also be obtained from Equations B4–B14, under the assumption that r is small (see File S1). Plugging Equations 28–30 into Equation 12 yields

ΔsDAAs[h(1F)+F]DAAB. (31)

Furthermore, plugging Equations 29 and 30 into Equations B4 and B10 and assuming that r is small yields the same expressions as Equations 24 and 25 (obtained for random mating) for DAB2 and DAAB, replacing h by h(1F)+F, r by r(1F), and N by N/(1+F). Therefore, the present model converges to Nordborg’s (1997) result when loci are tightly linked.

Figure 1A shows the decrease in effective population size at the neutral locus caused by the deleterious allele (Bsel, see Equation 17), as a function of the recombination rate between the two loci (on a log scale). When r tends to zero, Bsel tends to 1p˜B, and selfing increases Ne (as it decreases the frequency of the deleterious allele). When r is sufficiently high, however (∼ >0.01 for the parameter values used in Figure 1), increased selfing causes stronger background selection. Under complete selfing (α=1), Bsel becomes independent of r and is 1u/s. As can be seen in Figure 1A, the tight linkage approximation accurately predicts the solution obtained from Equations B4–B14, except when r and α are high (in which case it underestimates the strength of background selection). As we will see, this discrepancy for high values of r may lead to substantial differences when integrating over a large genetic map, as the majority of deleterious alleles are only loosely linked to the neutral locus, yet significantly affect Ne at this locus when selfing is high (this is more visible in Figure 1C, showing the strength of background selection as a function of the position of the deleterious allele along the genetic map). Note, however, that the tight linkage approximation yields the same prediction as the more general model when α=1 (Bsel1u/s).

Figure 1.

Figure 1

Background selection effect generated by a single deleterious allele (Bsel, see Equation 17) on Ne at the neutral locus, as a function of the recombination rate r between the two loci (A and B) and of the position of the deleterious allele along a linear chromosome of total map length 10 M (C and D) (the neutral locus is located at position 0). Different colors correspond to different values of the selfing rate as indicated in A. Solid curves correspond to the results obtained from the equations in Appendix B, dashed curves in A and C correspond to the tight linkage approximation [Equation 26, replacing h by h(1F)+F and r by r(1F)], and dotted curves in B and D correspond to the high selfing approximation (Equation 34). Parameter values: s=0.05, h=0.3, u=105.

High selfing

Simple approximations can also be obtained when the selfing rate is high, for any value of the recombination rate r. From Equations B4–B9 and assuming that α is close to 1, we obtain

DAB2DABDA,BDA,B2DABDAB,BDA,BDAB,BDAB,B21+2r2N[s+2r(1α+s)]p˜BDAA. (32)

Furthermore, Equations B10–B14 yield

DAABDAA,BDAAB,BDAADB,B(1+2r)2sN[s+2r(1α+s)]2p˜BDAA, (33)

which, using p˜Bu/s, generates the following approximation for the rate of decay of neutral diversity:

ΔDAA12N(1+F)[1+((1+2r)ss+2r(1α+s))2p˜B]DAA. (34)

Note that Equation 34 again yields Bsel1u/s when α=1. As shown in Figure 1D, this approximation better matches the general model than the tight linkage approximation when the selfing rate is high (but <1) and when linkage is loose. However, Figure 1B shows that it differs substantially from the general model under tight linkage and is thus expected to perform poorly when the selfing rate is not high (since in this case background selection is mainly caused by deleterious alleles that are tightly linked to the neutral locus).

Multilocus extrapolation and simulations

Following previous work (e.g., Hudson and Kaplan 1995; Nordborg 1997; Glémin 2007; Kamran-Disfani and Agrawal 2014), results from the two-locus model may be extrapolated to the case of deleterious alleles segregating at multiple loci by assuming multiplicative effects of the different deleterious alleles on neutral diversity. When the neutral locus is located at the midpoint of a linear genome with total map length R, and when all deleterious alleles have the same selection and dominance coefficients (as in the simulation program), this yields (using Equation 2)

NeN1+Fexp[Us[h(1F)+F]×2R0R/2T(x)dx], (35)

where U is the deleterious mutation rate per haploid genome and T(x) corresponds to the term T in Equation 16, in which the recombination rate r is expressed in terms of the genetic distance x (in morgans) between the neutral locus and the deleterious allele, using Haldane’s mapping function r=(1/2)[1e2x] (Haldane 1919). This integral can be computed numerically as shown in File S2.

Figure 2 shows the effective size of a population of census size 20,000 as a function of the selfing rate, for a haploid genomic deleterious mutation rate U=0.1 and genome map length R=10 M. As can be seen in Figure 2, the analytical model (solid curves) fits well with the simulation results for all values of α. Note that in Figure 2, Figure 3, Figure 4, and Figure 5, solid curves have been obtained by calculating numerically the integral in Equation 35, using a system of recursions expressed to the second order in s for the different moments shown in Appendix B (see File S1 and File S2); however, the results obtained using the recursions given in Appendix B (expressed to the first order in s) are undistinguishable in most cases (results not shown). Explicit forms can be obtained for the integral in Equation 35 (as a function of R and the different parameters of the model), when using either the tight linkage approximation or the high selfing approximation, and are given in File S2. The results shown in Figure 2 confirm that the tight linkage approximation underestimates the effect of background selection when the selfing rate is high, as loosely linked deleterious alleles significantly affect neutral diversity. In this case, the high selfing approximation is more accurate, closely matching the simulation results when α>0.85.

Figure 2.

Figure 2

Effective population size Ne as a function of the selfing rate α, for different values of the dominance coefficient of deleterious alleles h. Solid circles show simulation results (in this and subsequent figures, error bars are smaller than the size of the solid circles), solid curves show analytical predictions from the complete model (see File S2), dashed curves show the tight linkage approximation [Equation 26, replacing h by h(1F)+F and r by r(1F)], and dotted curves show the high selfing approximation (Equation 34). Parameter values: N=20,000, s=0.05, U=0.1, R=10.

Figure 3.

Figure 3

Effective population size Ne as a function of the selfing rate α, for different values of genome map length R: 0.1 (red), 1 (orange), 10 (green), and 100 (blue). Solid circles show simulation results, solid curves show analytical predictions from the complete model, dashed curves (left) show the tight linkage approximation, and dotted curves (right) show the high selfing approximation. Parameter values: N=20,000, s=0.05, h=0.3, U=0.1.

Figure 4.

Figure 4

Effective population size Ne as a function of the selfing rate α, for different values of the strength of selection against deleterious alleles s: 0.005 (blue), 0.05 (green), and 0.5 (red). Solid circles show simulation results, solid curves show analytical predictions from the complete model, dashed curves (left) show the tight linkage approximation, and dotted curves (right) show the high selfing approximation. Parameter values: N=20,000, h=0.3, U=0.1, R=10.

Figure 5.

Figure 5

Effective population size Ne as a function of the selfing rate α, for different values of the dominance coefficient of deleterious alleles h. Circles and curves are the same as in Figure 2 with U=0.5.

Figure 3 and Figure 4 show that the full model provides accurate predictions for Ne for different values of map length R (from 0.1 to 100) and strength of selection against deleterious alleles s (from 0.005 to 0.5). As expected, the tight linkage approximation works better at lower values of R. When the selfing rate is low, increasing s magnifies the strength of background selection, while it has the opposite effect under high selfing (due to the fact that mutation-free individuals are more abundant when selection against deleterious alleles is stronger). Finally, Figure 5 shows that increasing the deleterious mutation rate U (to 0.5 per haploid genome) increases the strength of background selection, leading to very low values of Ne at high selfing rates (Ne estimated from the simulations is close to 45 when α=1). When U=0.5, discrepancies between the full model and the simulations appear at low values of h (h=0.1 in particular) and intermediate values of α: these are probably caused by genetic associations between selected loci (identity disequilibrium in particular), which are neglected in the analysis. Using an expression for the frequency of deleterious allele p˜B that takes into account the effect of identity disequilibria (Roze 2015) does not significantly improve the results (not shown). Discrepancies also appear at higher values of h and high selfing rate (α 0.8–0.9), the analytical model underestimating the strength of background selection. Finally, the model overestimates the effect of background selection when the selfing rate is very high (α>0.95), for all values of h: although this is not visible in Figure 5 (as Ne is very small when α is high), it becomes apparent when Ne is plotted on a log scale, as shown in Figure S1. For these parameter values (α=0.98,1), selection against deleterious alleles is no longer efficient and these accumulate over time in the population (results not shown).

Equation 35 can be extended to the more realistic case where deleterious alleles at different loci have different fitness effects (assuming that selection remains sufficiently strong at most loci so that deleterious alleles stay near mutation–selection balance), by replacing s and h by functions of map position x (e.g., Charlesworth 2012). If the number of deleterious loci is large and if the distribution of fitness effects of mutations does not depend on genomic location, the integral in Equation 35 can be replaced by a double integral, over map position x and over the joint distribution of s and h. To test this, the simulation program was modified to include a log-normal distribution of selection coefficients s across loci, assuming a constant heterozygous effect of mutations (sh), generating a negative covariance between s and h (see figure 5 in Roze 2015). Figure 6 shows results for α=0.9 and for different values of the variance of deleterious effects of mutations across loci, setting the average of the log-normal distribution and the heterozygous effect of mutations so that s¯=0.05 and h¯=0.25 in all cases. As can be seen in Figure 6, increasing the variance of s slightly increases Ne, and this effect is captured by integrating Equation 34 (high selfing approximation) over the distribution of s.

Figure 6.

Figure 6

Left: effective population size Ne as a function of the standard deviation σ of lns across loci, assuming a log-normal distribution of s and setting the average of lns so that s¯=0.05 for all values of σ. Solid curve shows prediction obtained by integrating Equation 34 over the distribution of s and over the genetic map; solid circles show simulation results. Parameter values: N=20,000, α=0.9, U=0.1, R=10. In the simulations the heterozygous fitness effect hs is the same for all mutations and adjusted so that h¯=0.25 for all values of σ. Right: the distribution of s for σ=0.1 (solid curve), 0.5 (dashed curve), and 1 (dotted curve).

Temporal change in population size

It is worth emphasizing that the moment-based method presented in this article yields an expression for the “inbreeding effective size” (as it is based on the rate of decay of neutral diversity or the instantaneous rate of coalescence), while coalescent-based methods (e.g., Hudson and Kaplan 1995; Nordborg 1997) yield an expression for the “coalescence effective size,” corresponding to half the average coalescence time between two sequences randomly sampled from the population. After a single change in population size, and if selection and recombination rate are sufficiently large relative to 1/Ne, the different moments computed in Appendix B should equilibrate quickly relative to the change in neutral diversity, and the inbreeding effective population size should thus rapidly converge to its equilibrium value for the new census population size. By contrast, expected coalescence times will take longer to equilibrate. Assuming that the inbreeding effective size instantaneously reaches its new equilibrium value, the expected coalescence time t generations after the change is given by

E[T]t=2Ne+(112Ne)t(E[T]02Ne), (36)

where Ne is the new inbreeding effective size and E[T]0 the average coalescence time at the time of the change in population size. To test this prediction, the simulation program was modified to include a neutral sequence with an infinite number of sites at the midpoint of the chromosome. Indeed, the expected coalescence time between two sequences is simply given by D/(2μ), where μ is the mutation rate within the sequence and D the average number of differences between two sequences, given by D=2ipiqi (where the sum is over all segregating neutral sites within the sequence). Figure 7 shows an example in which N=20,000 during the first 105 generations (starting from a monomorphic population, which is equivalent to the coalescence of all lineages at generation zero), while N=40,000 during the last 105 generations. As can be seen in Figure 7, the quasi-equilibrium argument leading to Equation 36 correctly predicts the dynamics of E[T] both during the initial phase and after the change in population size. The quasi-equilibrium argument could also possibly be used in situations where population size changes continuously over time (i.e., exponential population growth), although this was not explored here.

Figure 7.

Figure 7

Average coalescence time between two randomly sampled sequences at a neutral locus located at the midpoint of the chromosome, as a function of time. The population is monomorphic at time zero (all lineages coalesce), N=20,000 during the first 105 generations, and N=40,000 during the last 105 generations. Parameter values: α=0.9, s=0.05, h=0.3, U=0.1, R=10. Dashed curve shows prediction from Equation 36, where the inbreeding effective size is obtained from Equation 35 and assumed to reach instantaneously its equilibrium value for a given N. Solid curve shows simulation results (average over 4000 replicate simulations), where E[T] is estimated from the diversity of a neutral sequence with an infinite number of sites (see text) and mutation rate μ=104 per generation.

Discussion

We have seen how multilocus population genetics theory can be used to express the effect of a deleterious allele at mutation–selection balance on the dynamics of diversity at a linked neutral locus in terms of moments of linkage disequilibrium and other genetic associations between these two loci. This provides an alternative to methods based on computing expected coalescence times of pairs of genes present on different types of genetic backgrounds (e.g., Nordborg 1997; Agrawal and Hartfield 2016) and allows one to decompose the background selection effect into different terms for which intuitive interpretation may be given. In a panmictic population, background selection is driven by the variance in linkage disequilibrium DAB2 between the two loci: neutral alleles that become associated to the deleterious allele tend to decrease in frequency, eventually causing their loss from the population. Under partial selfing, background selection is reinforced by random associations between a given neutral allele and selected alleles present on the other haplotype of the same individual (DA,B) and associations between neutral alleles and homozygosity at the selected locus (DAB,B). Furthermore, we have seen that identity disequilibrium (correlation in heterozygosity between loci) has the opposite effect and enhances neutral diversity when the deleterious allele is partially recessive, but this effect usually stays modest.

Under tight linkage, the results converge to the approximation derived by Nordborg (1997), using a separation of timescales argument. In that case, the population behaves approximately as a panmictic population of size N/(1+F), in which the dominance coefficient of the deleterious allele and the recombination rate are replaced by the effective parameters h(1F)+F and r(1F). Similarly, Agrawal and Hartfield (2016) showed that in a population reproducing sexually at rate σ and asexually (by mitosis) at rate 1σ, the effect of a deleterious allele located at a small recombination distance r from the neutral locus takes the same form as in a panmictic (fully sexual) population, replacing r by the effective recombination rate rσ (as shown in Appendix C, the effect of partial asexuality can also be derived using the methods of the present article). However, Agrawal and Hartfield (2016) also showed that this tight linkage approximation underestimates the strength of background selection caused by loosely linked loci, which becomes important when sex is rare. We have seen that a similar result holds under partial selfing: at high selfing rates, neutral diversity may be significantly affected by loosely linked deleterious alleles, whose effect is underestimated by the tight linkage approximation. In that case, a more precise approximation is provided by Equation 34 above.

As mentioned previously (Nordborg 1997), assuming multiplicative effects of deleterious alleles at different loci on neutral diversity (Equation 35) should be less accurate under partial selfing than under random mating, since inbreeding generates different forms of genetic associations between those loci (e.g., Kamran-Disfani and Agrawal 2014; Roze 2015). Nevertheless, we have seen that Equation 35 yields accurate predictions as long as the genomic deleterious mutation rate stays moderate (U=0.1, Figure 2, Figure 3, and Figure 4), so that the average number of deleterious alleles per genome is not too large. Another important assumption of the model is that selection against deleterious alleles is sufficiently strong relative to drift, so that these alleles are maintained near their deterministic mutation–selection equilibrium frequency. In the regime where Nes1 or lower, sometimes called “interference selection” or a “weak Hill–Robertson interference” regime (e.g., McVean and Charlesworth 2000; Comeron and Kreitman 2002; Good et al. 2014), background selection models assuming Nes1 may overestimate the effect of deleterious alleles on neutral diversity by several orders of magnitude (Kaiser and Charlesworth 2008; Good et al. 2014). This may possibly be due to negative linkage disequilibria between deleterious alleles generated by the Hill–Robertson effect (Hill and Robertson 1966), reducing the variance in fitness in the population and thus increasing coalescence times. As illustrated in Figure S1, this regime may be particularly important in highly selfing populations and when the deleterious mutation rate U is high, since Ne may be sufficiently reduced to affect the efficiency of selection at an important proportion of selected sites. This is confirmed by empirical observations of higher ratios of nonsynonymous to synonymous polymorphism (πN/πS) in selfing lineages than in their outcrossing relatives (Glémin and Muyle 2014; Hartfield 2015 and references therein), indicating that a significant fraction of deleterious alleles may increase in frequency due to drift (in which case the assumption that Nes1 at most loci is not valid). It is thus important to keep in mind that background selection models such as the one presented here overestimate the reduction in Ne in such situations, and it would be interesting (although probably challenging) to obtain analytical predictions for neutral diversity in populations undergoing low rates of sex or high selfing rates and in which selection against a high proportion of deleterious mutations is rendered ineffective.

More generally, the strong reduction in effective population size of highly selfing (or asexual) populations caused by background selection may have an important influence on stochastic processes that would play a more marginal role in populations with larger Ne. For example, identity disequilibrium between selected loci in a partially selfing population generates positive linkage disequilibrium between these loci, but as shown by Kamran-Disfani and Agrawal (2014), stochastic forces generating negative linkage disequilibrium become stronger than this deterministic effect when the selfing rate is high. In a similar way, deterministic forces acting on the evolution of recombination rates (e.g., Roze and Lenormand 2005) or mutation rates (e.g., Lynch 2010) may be overwhelmed by stochastic forces under strong inbreeding. Developing analytical models that could scale these different types of effects may thus help us to better understand how mating systems affect the evolution of genetic architecture. The methods presented in this article could possibly be used to explore such questions.

Acknowledgments

I thank the bioinformatics and computing service of the Roscoff Biological Station for computing time and two anonymous reviewers for helpful comments. This work was supported by the French Agence Nationale de la Recherche (project TRANS, ANR-11-BSV7-013 and project Clonix, ANR-11-BSV7-007).

Appendices

Appendix A: Deriving Recursions for Multilocus Moments

In the following, I use the notation DS,T for genetic associations measured at the next generation, while DS,Tjuv and DS,Tpar represent associations measured among juveniles (after reproduction, before drift) and among parents after selection (that is, weighting each parent by its relative fitness). Therefore, selection changes associations DS,T to DS,Tpar, recombination and fertilization (with partial selfing) change associations DS,Tpar to DS,Tjuv, and drift changes associations DS,Tjuv to DS,T. Similarly, pi denotes allele frequencies at the next generation, while pijuv represents allele frequencies among juveniles (which are the same as among selected parents, as recombination and fertilization do not change allele frequencies). In the following I show how to derive equations representing these different steps. For this, I focus on examples rather than presenting general (and necessarily cumbersome) equations; however, general expressions are implemented in a Mathematica notebook (File S1). Throughout this article, I assume that selection is weak (s small) and that population size is sufficiently large so that 1/Ns,r; however, no assumption is done on the relative orders of magnitude of r and s. Finally, I assume that us so that the frequency of the deleterious allele at mutation–selection balance (p˜B, given by Equation 2) is small.

Drift

To illustrate the effect of drift on genetic moments, consider the expected diversity at the neutral locus, DAA. By definition, DAA=(1/2)E[(XAMpA)2+(XAPpA)2], where E is the average over all individuals of the next generation, while pA is the frequency of allele 1 at locus A among these individuals. Writing ΔdpA=pApAjuv the change in allele frequency due to drift, we have

DAA=12E[(XAMpAjuvΔdpA)2+(XAPpAjuvΔdpA)2]. (A1)

Expanding and grouping terms in ΔdpA, this is

DAA=12E[(XAMpAjuv)2+(XAPpAjuv)2]2ΔdpA12E[(XAMpAjuv)+(XAPpAjuv)]+(ΔdpA)2. (A2)

In the following, I use the notation DS,Tdft for moments measured among individuals of the next generation (after drift), but using the values of allele frequencies (called “reference values” in Kirkpatrick et al. 2002) before drift (among juveniles): in particular, DAAdft=12E[(XAMpAjuv)2+(XAPpAjuv)2], while we have DAdft=12E[(XAMpAjuv)+(XAPpAjuv)]. Noting that DAdft=ΔdpA, Equation A2 can be written as

DAA=DAAdft(DAdft)2. (A3)

Next, we can note that products of genetic associations may also be viewed as associations between genes present in two individuals, sampled with replacement from the whole population: for example, (DAdft)2 is the association between one gene at locus A from a first individual and one gene at locus A from a second individual sampled with replacement from the population at the next generation (and using allele frequencies among juveniles as reference values). Following previous works (Roze and Rousset 2008; Roze 2009; Roze and Michod 2010), I thus write such products as single associations, using the symbol / to separate sets of genes present in different individuals sampled with replacement from the population. Using this notation, (DAdft)2 is written as DA/Adft, and Equation A3 becomes

DAA=DAAdftDA/Adft. (A4)

Equation A4 corresponds to the first step of the computation of the effect of drift on genetic moments (i.e., taking into account changes in allele frequencies due to drift). The second (and last) step consists of expressing associations between genes present in different individuals sampled with replacement from the population (at the next generation) in terms of associations between genes in individuals sampled without replacement. For example, we have

DA/Adft=12N(DAAdft+DA,Adft)+(11N)DA/Adft, (A5)

where DA/Adft is the association between two genes at locus A from two individuals sampled without replacement, at the next generation (and using allele frequencies before drift as reference values). Indeed, two genes sampled with replacement from the population may be the same gene with probability 1/(2N), be the two homologous copies of the same individual with probability 1/(2N), or come from two different individuals with probability 11/N. Finally, because the different individuals of the next generation have been sampled independently from an infinite population of juveniles, associations between genes present in different individuals (and using as reference values allele frequencies among juveniles) are the same as associations measured among juveniles: in particular, DAAdft=DAAjuv, DA,Adft=DA,Ajuv, and DA/Adft=DA/Ajuv. Noting that DA/Ajuv=0 [indeed, because the population of juveniles is infinite, we have DA/Ajuv=DA/Ajuv=(DAjuv)2, while DAjuv=0 from the definition of genetic associations], we finally obtain

DAA=DAAjuv12N(DAAjuv+DA,Ajuv), (A6)

representing the effect of drift on the moment DAA. Because all results are computed to the first order in 1/N, it is sufficient to express the moments that are multiplied by 1/(2N) in Equation A6 in the limit as N tends to infinity.

The same method can be used to compute the effect of drift on other genetic moments. For example, using the same reasoning as for deriving Equation A4 above, we obtain for the expected squared linkage disequilibrium

DAB2=DAB/ABdft2DAB/A/Bdft+DA/A/B/Bdft, (A7)

while computing these moments in terms of associations among juveniles yields

DAB2=DABjuv2+12N(DAABBjuv+DAB,ABjuv2DABjuvDA,Bjuv4DABjuv2)+ο(1N). (A8)

Again, because all results are computed to the first order in 1/N, it is sufficient to compute the four moments within parentheses in Equation A8 in the limit as population size tends to infinity [we will see below that the moments DABjuvDA,Bjuv and DABjuv2 become negligible in this limit, while the moment DAB,ABjuv is generated by selfing].

Equation A8 shows that genetic associations with repeated “B” indices (such as DAABB) may appear within recursions (see also Equation A12). Because locus B is biallelic, these repeated indices can be eliminated using the relation (e.g., equation 5 in Kirkpatrick et al. 2002)

DSii=piqiDS+(12pi)DSi, (A9)

where S is any set of loci, and i is a biallelic locus. However, repeated “A” indices will not be eliminated when computing recursions, so that the equations obtained still hold for the case where more than two alleles segregate at the neutral locus.

Recombination and fertilization

Computing moments measured among juveniles in terms of moments measured among selected parents is somewhat simpler, as recombination and fertilization do not change allele frequencies: therefore, we only have to consider the different possible modes of transmission of genes between generations. In particular, DAAjuv=DAApar, while

DA,Ajuv=α2(DAApar+DA,Apar). (A10)

Indeed, the two homologous genes of a juvenile at locus A come from the same parent if this juvenile has been produced by selfing (probability α), in which case they are copies of the same parental gene with probability 1/2, while they come from the two homologous genes of the parent with probability 1/2. With probability 1α the juvenile has been produced by outcrossing, in which case its two homologous genes are sampled with replacement from the parental population: the association thus becomes DA/Apar=(DApar)2, which equals zero (since DApar=0). Similarly, we have

DABjuv2=DAB/ABjuv=(1r)2DAB/ABpar+2r(1r)DAB/A,Bpar+r2DA,B/A,Bpar, (A11)

while

DAB,ABjuv=α2[(1r)2(DAABBpar+DAB,ABpar)+2r(1r)(DAAB,Bpar+DABB,Apar)+r2(DAABBpar+DAB,ABpar)]+(1α)DAB/ABjuv, (A12)

where DAB/ABjuv is given by Equation A11.

Selection

As for the effect of drift, computing the effect of selection on genetic moments can be decomposed into two steps (see also Barton and Turelli 1991; Kirkpatrick et al. 2002). The first step consists in taking into account the change in reference values (i.e., allele frequencies that appear within associations) due to selection: denoting DS,Tsel moments measured among selected parents, but using allele frequencies before selection (pi) as reference values (instead of allele frequencies after selection pijuv), we have (following the same reasoning as for the derivation of Equations A4 and A7)

DAApar=DAAselDA/Asel, (A13)

while

DAB/ABpar=DAB/ABsel2DAB/A/Bsel+DA/A/B/Bsel. (A14)

Finally, computing associations measured after selection (using as reference values allele frequencies before selection) DS,Tsel, in terms of associations measured before selection DS,T, is done by weighting each individual by its relative fitness

DS,Tsel=E[WW¯ζS,T], (A15)

where E is the average over all individuals before selection, W is the fitness of the individual, and W¯=E[W] is the mean fitness of the population. To express the right-hand side of Equation A15 in terms of genetic associations, it is useful to write W/W¯ in terms of ζS,T variables (e.g., Barton and Turelli 1991; Kirkpatrick et al. 2002). The fitness of an individual can be written as

W=1sh(XBM+XBP)s(12h)XBMXBP, (A16)

which, after rearranging and averaging over all individuals, yields

WW¯=1Csh(ζBM+ζBP)s(12h)[ζB,B+pB(ζBM+ζBP)]1Cs(12h)DB,B (A17)

with C=2shpB+s(12h)pB2. Equation A17 thus takes the form of a polynomial of ζS,T variables. However, because terms in pB and DB,B appear in the denominator, obtaining expressions in terms of moments of genetic associations and allele frequencies requires developing W/W¯ as a Taylor series in s. To the first order in s, we have

WW¯=1sh(ζBM+ζBP)s(12h)[ζB,BDB,B+pB(ζBM+ζBP)]. (A18)

From this, we obtain, to the first order in s,

DAAsel=E[WW¯ζAA]=DAAsh(DAAB+DAA,B)s(12h)(DAAB,BDAADB,B+pBDAAB+pBDAA,B). (A19)

Given that moments pBDAAB and pBDAA,B equal zero at quasi-equilibrium to the first order in 1/N and p˜B (see File S1), Equation A19 yields Equation 12 in the main text. Associations between genes present in different individuals are obtained similarly. For example, DA/Asel equals zero to the first order in s (because DA=0), while Equation A18 yields the following expression to the second order in s:

DA/Asel=E[WW¯ζA]E[WW¯ζA]=(sh)2(DAB/AB+2DAB/A,B+DA,B/A,B)+2s2h(12h)(DAB/AB,B+DA,B/AB,B+pBDAB/AB+2pBDAB/A,B+pBDA,B/A,B)+s2(12h)2(DAB,B/AB,B+2pBDAB/AB,B+2pBDA,B/AB,B+pB2DAB/AB+2pB2DAB/A,B+pB2DA,B/A,B). (A20)

Equations A19 and A20 illustrate the fact that computing the effect of selection on a given genetic moment introduces more complicated moments, with a higher number of B indices. This may lead to an infinite system of recursions, as the effect of selection on these moments will introduce yet other moments with even more B indices. However, assuming Ns1 and su (so that the frequency of the deleterious allele at mutation–selection balance, p˜B, remains small), we may neglect moments that are proportional to p˜B2—more precisely, moments that are o(p˜B)—to obtain expressions to the first order in p˜B. As we will see, using this “rare allele approximation” yields closed systems of recursions for genetic moments.

Rare allele approximation

Deriving expressions to the first order in p˜B can be done using the following general rule. Because all genetic associations involving at least one B index are proportional to pB, moments involving two elements with a B index (where an element is either an allele frequency pi or an association DS,T among genes present in the same individual) are of order p˜B2 in the limit as population size tends to infinity. For example, pBDAB, DAB2, and DABDAB,B are all of order p˜B2 as N tends to infinity, while DB,B and DAB,AB are of order p˜B in the same limit. Now, taking the effect of drift into account when computing recursions for genetic moments generally introduces moments with one less element carrying a B index, multiplied by 1/N. This is illustrated by Equation A8 showing the effect of drift on the moment DAB2 (two elements with a B index): drift introduces two terms involving moments carrying a single element with a B index (DAABB and DAB,AB). Because these moments are multiplied by 1/N in Equation A8, it is sufficient to express them in the limit as N tends to infinity: in this limit, these terms are thus proportional to p˜B (by contrast, the terms in DAB2/N and DABDA,B/N in Equation A8 are proportional to p˜B2 and may thus be neglected). More generally, moments carrying x elements with a B index are thus of order p˜Bx in the limit as N tends to infinity and of order p˜Bx1 to the first order in 1/N (for x>1). To compute expressions to the first order in p˜B, we may thus neglect all moments that must be expressed to the first order in 1/N and that carry more than two elements with a B index (such as pBDABDAB,B) and all moments expressed in the limit as N tends to infinity carrying more than one element with a B index (such as DAB2). Using this approximation, Equation A20 simplifies to

DA/Asel=(sh)2(DAB2+2DABDA,B+DA,B2)+2s2h(12h)(DABDAB,B+DA,BDAB,B)+s2(12h)2DAB,B2. (A21)

Furthermore, we obtain for the moments DAB/ABpar, DAB/A,Bpar, and DA,B/A,Bpar (that are needed to compute a recursion for DAB2, as shown by Equation A11), to the first order in s

DAB/ABpar=(12sh)DAB22s(1h)DABDAB,B (A22)
DAB/A,Bpar=(12sh)DABDA,Bs(1h)(DABDAB,B+DA,BDAB,B) (A23)
DA,B/A,Bpar=(12sh)DA,B22s(1h)DA,BDAB,B. (A24)

Appendix B: Recursions for Two-Locus Moments

Using the method shown in Appendix A, we obtain for the change in neutral diversity during selection, to the second order in s,

ΔsDAA=sh(DAAB+DAA,B)s(12h)(DAAB,BDAADB,B)(sh)2(DAB2+2DABDA,B+DA,B2)2s2h(12h)(DABDAB,B+DA,BDAB,B)s2(12h)2DAB,B2, (B1)

while the change in neutral diversity during drift is given by Equation 9 in the main text. The same method can be used to compute recursions for the different moments that appear in Equation B1, from which solutions at quasi-equilibrium can be obtained. For example, Equations A8, A11, and A22–A24 yield the following recursion for DAB2:

DAB2=(12sh)[(1r)2DAB2+2r(1r)DABDA,B+r2DA,B2]2s(1h)[(1r)DABDAB,B+rDA,BDAB,B]+12N(DAABB+DAB,AB). (B2)

To obtain an expression to the first order in 1/N, it is sufficient to express DAABB and DAB,AB in Equation B2 in the limit when N tends to infinity. Using Equation A9, we have DAABB=pBqBDAA+(12pB)DAAB. Furthermore, DAAB equals zero at quasi-equilibrium in an infinite population (indeed, the solution obtained for DAAB from the equations below is in 1/N): therefore, DAABBp˜BDAA when population size tends to infinity. From Equation A12, a recursion for DAB,AB when N tends to infinity and s=0 is given by

DAB,AB=α2[[12r(1r)](DAABB+DAB,AB)+2r(1r)(DAAB,B+DABB,A)]. (B3)

Using DAABBp˜BDAA and DAAB,B=DABB,AFp˜BDAA yields DAB,AB=φABp˜BDAA, where φAB is given by Equation 14, and corresponds to the probability of joint coalescence of two pairs of genes sampled at loci A and B due to selfing, in an infinite population. It is possible to compute φAB to the first order in s, but the term in s is always negligible when selection is weak and is thus ignored here. Using the expressions just derived for DAABB and DAB,AB, Equation B2 becomes

DAB2=(12sh)[(1r)2DAB2+2r(1r)DABDA,B+r2DA,B2]2s(1h)[(1r)DABDAB,B+rDA,BDAB,B]+12N(1+φAB)p˜BDAA. (B4)

Similarly, we obtain the following recursions for moments DABDA,B, DA,B2, DABDAB,B, DA,BDAB,B, and DAB,B2 (which are also generated by finite population size), to the first order in s, p˜B, and 1/N (recursions to the second order in s are derived in File S1, but yield very similar quantitative results in most cases):

DABDA,B=α2[(12sh)[(1r)DAB2+DABDA,B+rDA,B2]s(1h)[(2(1r)+1)DABDAB,B+(2r+1)DA,BDAB,B]]+1NFp˜BDAA (B5)
DA,B2=(α2)2[(12sh)(DAB2+2DABDA,B+DA,B2)4s(1h)(DABDAB,B+DA,BDAB,B)]+12N(1+φAB)p˜BDAA (B6)
DABDAB,B=α2[(12sh)[(1r)2DAB2+2r(1r)DABDA,B+r2DA,B2]+[1s(3h)][(1r)DABDAB,B+rDA,BDAB,B]s(1h)DAB,B2]+12N(F+φAB)p˜BDAA (B7)
DA,BDAB,B=(α2)2[(12sh)[(1r)DAB2+DABDA,B+rDA,B2]+(12s)(DABDAB,B+DA,BDAB,B)2s(1h)[(1r)DABDAB,B+rDA,BDAB,B]2s(1h)DAB,B2]+12N(F+φAB)p˜BDAA (B8)
DAB,B2=(α2)2[(12sh)[(1r)2DAB2+2r(1r)DABDA,B+r2DA,B2]+2(12s)[(1r)DABDAB,B+rDA,BDAB,B]+[12s(2h)]DAB,B2]+12N(F+φAB)p˜BDAA. (B9)

Equations B4–B9 can be solved at quasi-equilibrium (setting = for each moment) to obtain expressions for the six moments of the form Zp˜BDAA/N, where Z is a function (that differs for each moment) of s, h, r, and α. Although these expressions are complicated, they are easily computed numerically using Mathematica (see File S2).

Recursions for the moments DAAB and DAA,B that appear on the first line of Equation 12 are given by (to the first order in s, p˜B, and 1/N)

DAAB=(1sh)[(1r)DAAB+rDAA,B]+2sh[(1r)DAB2+DABDA,B+rDA,B2]+2s(12h)[(1r)DABDAB,B+rDA,BDAB,B]s(1h)(DAAB,BDAADB,B)1NDAB,A (B10)
DAA,B=α2[(1sh)(DAAB+DAA,B)+2sh(DAB2+2DABDA,B+DA,B2)+2s(12h)(DABDAB,B+DA,BDAB,B)2s(1h)(DAAB,BDAADB,B)]1NDAB,A. (B11)

It is sufficient to express the moment DAB,A that appears in Equations B10 and B11 in the limit as N tends to infinity. To the first order in s, we have

DAB,A=α2[DAB,As(1h)(DAB,ABDA,ADB,B)], (B12)

giving at quasi-equilibrium

DAB,A=s(1h)FGABp˜BDAA (B13)

(see also Roze 2015), where GAB=φABF2 is the identity disequilibrium between loci A and B. Finally, a recursion for DAAB,BDAADB,B (second line of Equation 12) to the first order in s, p˜B, and 1/N is given by

DAAB,BDAADB,B=α2[[1s(2h)](DAAB,BDAADB,B)+(1sh)[(1r)DAAB+rDAA,B]+2sh[(1r)DAB2+DABDA,B+rDA,B2]+2s(12h)[(1r)DABDAB,B+rDA,BDAB,B]+2sh(DABDAB,B+DA,BDAB,B)+2s(12h)DAB,B2]12NGABp˜BDAA. (B14)

Equations B10, B11, and B14 can be solved to obtain expressions for DAAB, DAA,B, and DAAB,BDAADB,B at quasi-equilibrium (using the expressions for DAB2, DABDA,B, DA,B2, DABDAB,B, DA,BDAB,B, and DAB,B2 obtained from Equations B4–B9); again, Mathematica commands to obtain numerical solutions can be found in File S2. Approximations for high selfing are obtained by assuming that s and o=1α are of order ε and expressing Equations B4–B14 to the first order in ε. Similarly, tight linkage approximations are obtained by assuming that s and r are of order ε (see File S1).

Appendix C: Partial Asexuality

Background selection under partial asexuality in diploids has been explored recently by Agrawal and Hartfield (2016), using coalescence models. The method presented in Appendix A can also be used for the case of a population in which a proportion σ of offspring are produced by sexual reproduction each generation (with random gamete fusion) and a proportion 1σ by asexual reproduction (mitosis), as shown in Roze and Michod (2010). In particular, we obtain for the change in neutral diversity over one generation (to the second order in s and assuming that h is significantly different from zero)

ΔDAA=DAA2Nsh(DAAB+DAA,B)(sh)2(DAB2+2DABDA,B+DA,B2). (C1)

Recursions for the different moments that appear in Equation C1 are given by [to the first order in 1/N and p˜Bu/(sh)]

DAB2=(1sh)2[(1rσ)2DAB2+2rσ(1rσ)DABDA,B+(rσ)2DA,B2]+12Np˜BDAA (C2)
DABDA,B=(1sh)2(1σ)[(1rσ)DABDA,B+rσDA,B2] (C3)
DA,B2=(1sh)2(1σ)2DA,B2+12Np˜BDAA (C4)
DAAB=(1sh)[(1rσ)DAAB+rσDAA,B]+2sh[(1rσ)DAB2+2DABDA,B+rσDA,B2] (C5)
DAA,B=[(1σ)(1sh)DAA,B+2sh(DABDA,B+DA,B2]]. (C6)

Expressions for DAB2, DABDA,B, DA,B2, DAAB, and DAA,B at quasi-equilibrium can be obtained from Equations C2–C6 and plugged into Equation C1. When the rate of sex σ is small, this yields equations 5 and 6 in Agrawal and Hartfield (2016).

Footnotes

Communicating editor: N. A. Rosenberg

Supplemental material is available online at www.genetics.org/lookup/suppl/doi:10.1534/genetics.116.187955/-/DC1.

Literature Cited

  1. Agrawal A. F., Hartfield M., 2016.  Coalescence with background and balancing selection in systems with bi- and uniparental reproduction: contrasting partial asexuality and selfing. Genetics 202: 313–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barton N. H., Turelli M., 1991.  Natural and sexual selection on many loci. Genetics 127: 229–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Brandvain Y., Slotte T., Hazzouri K. M., Wright S. I., Coop G., 2013.  Genomic identification of founding haplotypes reveals the history of the selfing species Capsella rubella. PLoS Genet. 9: e1003754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Burgarella C., Gayral P., Ballenghien M., Bernard A., David P., et al. , 2015.  Molecular evolution of freshwater snails with contrasting mating systems. Mol. Biol. Evol. 32: 2403–2416. [DOI] [PubMed] [Google Scholar]
  5. Charlesworth B., 2012.  The effects of deleterious mutations on evolution at linked sites. Genetics 190: 5–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Charlesworth B., Morgan M. T., Charlesworth D., 1993.  The effect of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Charlesworth D., 2003.  Effects of inbreeding on the genetic diversity of populations. Philos. Trans. R. Soc. Lond. B 358: 1051–1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Comeron J. M., Kreitman M., 2002.  Population, evolutionary and genomic consequences of interference selection. Genetics 161: 389–410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Glémin S., 2003.  How are deleterious mutations purged? Drift vs. nonrandom mating. Evolution 57: 2678–2687. [DOI] [PubMed] [Google Scholar]
  10. Glémin S., 2007.  Mating systems and the efficacy of selection at the molecular level. Genetics 177: 905–916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Glémin S., Muyle A., 2014.  Mating systems and selection efficacy: a test using chloroplastic sequence data in angiosperms. J. Evol. Biol. 27: 1386–1399. [DOI] [PubMed] [Google Scholar]
  12. Glémin S., Ronfort J., 2012.  Adaptation and maladaptation in selfing in outcrossing species: new mutations vs. standing variation. Evolution 67: 225–240. [DOI] [PubMed] [Google Scholar]
  13. Glémin S., Bazin E., Charlesworth D., 2006.  Impact of mating systems on patterns of sequence polymorphism on flowering plants. Proc. Biol. Sci. 273: 3011–3019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Goldberg E. E., Kohn J. R., Lande R., Robertson K. A., Smith S. A., et al. , 2010.  Species selection maintains self-incompatibility. Science 330: 493–495. [DOI] [PubMed] [Google Scholar]
  15. Good B. H., Walczak A. M., Neher R. A., Desai M. M., 2014.  Genetic diversity in the interference selection limit. PLoS Genet. 10: e1004222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Goodwillie C., Kalisz S., Eckert C. G., 2005.  The evolutionary enigma of mixed mating systems in plants: occurrence, theoretical explanations, and empirical evidence. Annu. Rev. Ecol. Evol. Syst. 36: 47–79. [Google Scholar]
  17. Haldane J. B. S., 1919.  The combination of linkage values and the calculation of distances between the loci of linked factors. J. Genet. 8: 299–309. [Google Scholar]
  18. Hartfield M., 2015.  Evolutionary genetic consequences of facultative sex and outcrossing. J. Evol. Biol. 29: 5–22. [DOI] [PubMed] [Google Scholar]
  19. Hill W. G., Robertson A., 1966.  The effect of linkage on limits to artificial selection. Genet. Res. 8: 269–294. [PubMed] [Google Scholar]
  20. Hudson R. R., Kaplan N. L., 1995.  Deleterious background selection with recombination. Genetics 141: 1605–1617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Igic B., Busch J. W., 2013.  Is self-fertilization an evolutionary dead end? New Phytol. 198: 386–397. [DOI] [PubMed] [Google Scholar]
  22. Jarne P., Auld J. R., 2006.  Animals mix it up too: the distribution of self-fertilization among hermaphroditic animals. Evolution 60: 1816–1824. [DOI] [PubMed] [Google Scholar]
  23. Kaiser V. B., Charlesworth B., 2008.  The effects of deleterious mutations on evolution in non-recombining genomes. Trends Genet. 25: 9–12. [DOI] [PubMed] [Google Scholar]
  24. Kamran-Disfani A., Agrawal A. F., 2014.  Selfing, adaptation and background selection in finite populations. J. Evol. Biol. 27: 1360–1371. [DOI] [PubMed] [Google Scholar]
  25. Kirkpatrick M., Johnson T., Barton N. H., 2002.  General models of multilocus evolution. Genetics 161: 1727–1750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lynch M., 2010.  Evolution of the mutation rate. Trends Genet. 26: 345–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lynch M., Conery J., Bürger R., 1995.  Mutational meltdowns in sexual populations. Evolution 49: 1067–1080. [DOI] [PubMed] [Google Scholar]
  28. McVean G. A., Charlesworth B., 2000.  The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. Genetics 155: 929–944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Nordborg M., 1997.  Structured coalescent processes on different time scales. Genetics 146: 1501–1514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Nordborg M., 2000.  Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. Genetics 154: 923–929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Nordborg M., Charlesworth B., Charlesworth D., 1996.  The effect of recombination on background selection. Genet. Res. 67: 159–174. [DOI] [PubMed] [Google Scholar]
  32. Nordborg M., Borevitz J. O., Bergelson J., Berry C. C., Chory J., et al. , 2002.  The extent of linkage disequilibrium in Arabidopsis thaliana. Nat. Genet. 30: 190–193. [DOI] [PubMed] [Google Scholar]
  33. Padhukasahasram B., Marjoram P., Wall J. D., Bustamante C. D., Nordborg M., 2008.  Exploring population genetics models with recombination using efficient forward-time simulations. Genetics 178: 2417–2427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Pollak E., 1987.  On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117: 353–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Robertson A., 1961.  Inbreeding in artificial selection programmes. Genet. Res. 2: 189–194. [DOI] [PubMed] [Google Scholar]
  36. Roze D., 2009.  Diploidy, population structure and the evolution of recombination. Am. Nat. 174: S79–S94. [DOI] [PubMed] [Google Scholar]
  37. Roze D., 2015.  Effects of interference between selected loci on the mutation load, inbreeding depression and heterosis. Genetics 201: 745–757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Roze D., Rousset F., 2008.  Multilocus models in the infinite island model of population structure. Theor. Popul. Biol. 73: 529–542. [DOI] [PubMed] [Google Scholar]
  39. Roze D., Michod R. E., 2010.  Deleterious mutations and selection for sex in finite, diploid populations. Genetics 184: 1095–1112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Roze D., Lenormand T., 2005.  Self-fertilization and the evolution of recombination. Genetics 170: 841–857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Santiago E., Caballero A., 1998.  Effective size and polymorphism of linked neutral loci in populations under selection. Genetics 149: 2105–2117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Schultz S. T., Lynch M., 1997.  Mutation and extinction: the role of variable mutational effects, synergistic epistasis, beneficial mutations and degree of outcrossing. Evolution 51: 1363–1371. [DOI] [PubMed] [Google Scholar]
  43. Stebbins G. L., 1957.  Self fertilization and population variability in higher plants. Am. Nat. 91: 337–354. [Google Scholar]
  44. Takebayashi N., Morrell P. L., 2001.  Is self-fertilization an evolutionary dead end? Revisiting an old hypothesis with genetic theories and a macroevolutionary approach. Am. J. Bot. 88: 1143–1150. [PubMed] [Google Scholar]
  45. Weir B. S., Cockerham C. C., 1969.  Group inbreeding with two linked loci. Genetics 63: 711–742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Williams G. C., 1992.  Natural Selection: Domains, Levels, and Challenges. Oxford University Press, New York. [Google Scholar]
  47. Wright S. I., Kalisz S., Slotte T., 2013.  Evolutionary consequences of self-fertilization in plants. Proc. Biol. Sci. 280: 20130133. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The author states that all data necessary for confirming the conclusions presented in the article are represented fully within the article. The Dryad DOI is doi:10.5061/dryad.p3r01.


Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES