Soft Sweeps: Molecular Population Genetics of Adaptation From Standing Genetic Variation

Joachim Hermisson; Pleuni S Pennings

doi:10.1534/genetics.104.036947

. 2005 Apr;169(4):2335–2352. doi: 10.1534/genetics.104.036947

Soft Sweeps

Molecular Population Genetics of Adaptation From Standing Genetic Variation

Joachim Hermisson ^a,^b,¹, Pleuni S Pennings ^a,^b

PMCID: PMC1449620 PMID: 15716498

Abstract

A population can adapt to a rapid environmental change or habitat expansion in two ways. It may adapt either through new beneficial mutations that subsequently sweep through the population or by using alleles from the standing genetic variation. We use diffusion theory to calculate the probabilities for selective adaptations and find a large increase in the fixation probability for weak substitutions, if alleles originate from the standing genetic variation. We then determine the parameter regions where each scenario—standing variation vs. new mutations—is more likely. Adaptations from the standing genetic variation are favored if either the selective advantage is weak or the selection coefficient and the mutation rate are both high. Finally, we analyze the probability of “soft sweeps,” where multiple copies of the selected allele contribute to a substitution, and discuss the consequences for the footprint of selection on linked neutral variation. We find that soft sweeps with weaker selective footprints are likely under both scenarios if the mutation rate and/or the selection coefficient is high.

EVOLUTIONARY biologists envisage the adaptive process following a rapid environmental change or the colonization of a new niche in two contrasting ways. On the one hand, it is well known from breeding experiments and artificial selection that most quantitative traits respond quickly and strongly to artificial selection (see, e.g., Falconer and Mackay 1996). In these experiments, there is almost no time for new mutations to occur. Evolutionists who work with phenotypes therefore tend to hold the view that also in natural processes a large part of the adaptive material is not new, but already contained in the population. In other words, it is taken from the standing genetic variation. Consequently, standard predictors of evolvability, such as the heritability, the coefficient of additive variation, or the G matrix, are derived from the additive genetic variance of a trait; cf., e.g., Lande and Arnold (1983), Houle (1992), Lynch and Walsh (1998), and Hansen et al. (2003); see Steppan et al. (2002) for review. On the other hand, in the molecular literature on the adaptive process and on selective sweeps adaptation from a single new mutation is clearly the ruling paradigm (e.g., Maynard Smith and Haigh 1974; Kaplan et al. 1989; Barton 1998; Kim and Stephan 2002). In conspicuous neglect of the quantitative genetic view, the standing genetic variation as a source for adaptive substitutions is generally ignored, with only few recent exceptions (Orr and Betancourt 2001; Innan and Kim 2004).

The difference that is expressed in these two views could have important evolutionary consequences. If adaptations start out as new mutations the rate of the adaptive process is limited by the rates and effects of beneficial mutations. In contrast, if a large part of adaptive substitutions derives from standing genetic variation, the adaptive course is modulated by the quality and amount of the available genetic variation. Because this variation is shaped by previous selection, the future course of evolution will depend not only on current selection pressures, but also on the history of selection pressures and environmental conditions that the population has encountered. Clearly, quite different sets of parameters could be important under the two scenarios if we want to estimate past and future rates of evolution. To assess which alternative is more prevalent in nature, population genetic theory can be informative in two ways. First, it allows us to determine the probabilities for selective adaptations in both scenarios. Second, theory can be used to predict whether and how these different modes of adaptation can be detected from population data. In this article, we address these issues in a model of a single locus.

We study the fixation process of an allele that is beneficial after an environmental change, but neutral or deleterious under the previous conditions. The population may experience a bottleneck following the shift of the environment. Assuming that the allele initially segregates in the population at an equilibrium of mutation, selection, and drift, we calculate the probability that it spreads to fixation after positive selection begins. We compare this probability with the fixation rate of the same allele, given that it appears after the environmental change only as a new mutation. This allows us to determine the parameter space, in terms of mutation rates, selection coefficients, and the demographic structure, where a substitution that is observed some time after an environmental change is most likely from the standing genetic variation. We also analyze how the distribution of the effects of adaptive substitutions changes if the standing genetic variation is a source of adaptive material. Our main finding is that adaptations with a small effect in this case are much more frequent than predicted in a model that considers only adaptations from new mutations.

We then ask whether adaptations from standing genetic variation can be detected from the sweep pattern on linked neutral variation. If a selective sweep originates from a single new mutation, all ancestral neutral variation that is tightly linked to the selected allele will be eliminated by hitchhiking. We call this scenario a hard sweep in contrast to a soft sweep where more than a single copy of the allele contributes to an adaptive substitution. The latter may occur if the selected allele is taken from the standing genetic variation, where more than one copy is available at the start of the selective phase, or if new beneficial alleles occur during the spread to fixation. With a soft sweep, part of the linked neutral variation is retained in the population even close to the locus of selection. We calculate the probability for soft sweeps under both scenarios of the adaptive process and discuss the impact on the sweep pattern. We find that soft sweeps are likely for alleles with a high fixation probability from the standing variation, in particular for alleles that are under strong positive selection. Already for moderately high mutation rates, however, fixation of multiple independent copies is also likely if the selected allele enters the population only as a recurrent new mutation. We therefore predict that unusual sweep patterns compatible with soft sweeps may be frequent under biologically realistic conditions, but they cannot be used as a clear indicator of adaptation from standing genetic variation.

MODEL AND METHODS

Assume that a diploid population of effective size N_e experiences a rapid environmental shift at some time T that changes the selection regime at a given locus. We consider two alleles (or classes of physiologically equivalent alleles) at this locus, a and A. a is the ancestral “wild-type” allele and A is derived, in the sense that the population was never fixed for A prior to T. A is favorable in the new environment with homozygous fitness advantage s_b. The dominance coefficient is h; i.e., the heterozygous fitness is 1 + hs_b. Assuming that the population was well adapted in the old environment, A was either effectively neutral or deleterious before T, with selection coefficient s_d measuring its homozygous disadvantage and dominance coefficient h′. A is generated from a by recurrent mutations at rate u. In the following, it is convenient to work with scaled variables for selection and mutation, defined as α_b = 2N_es_b, α_d = 2N_es_d, and Θ_u = 4N_eu. We initially assume that the population size N_e stays constant over the time period under consideration, but relax this condition later. We restrict our analysis to a single adaptive substitution, which is studied in isolation. This assumption means that different adaptive events do not interfere with each other due to either physical linkage or epistasis.

Simulations:

We check all our analytical approximations by full-forward computer simulations. For this, a Wright-Fisher model with 2N_e haploid individuals is simulated. Every generation is generated by binomial or multinomial sampling, where the probability of choosing each type is weighted by its respective fitness. No dominance is assumed (h = h′ = 0.5) and 2N_e is 50,000. Data points are averaged over at least 12,000 runs for Θ_u = 0.4 and all data points in Figure 6, 20,000 runs for Θ_u = 0.04, and 40,000 runs for Θ_u = 0.004.

Figure 6.— — The probability that multiple copies with independent origin contribute to a substitution, P_ind. Lines correspond to Equation 20; symbols represent simulation data. Circles represent fixations from the standing genetic variation without new mutational input after time T; squares include new mutations. Triangles represent fixations from recurrent new mutations only.

Each simulation is started 6N_e = 150,000 generations before time T to let the population reach mutation-selection-drift equilibrium. Longer initial times did not change the results in trial runs. At the start, the population consists of only ancestral alleles “0”; the derived allele “1” is created by mutation. Whenever the derived allele reaches fixation by drift, it is itself declared “ancestral”; i.e., the population is set back to the initial state.

After 6N_e generations, the selection coefficient of the derived allele changes from neutral or deleterious (s_d) to beneficial (s_b). Mutations now convert ancestral alleles into new derived alleles (using a different symbol, “2”) with the same selection coefficient s_b. Simulations continue until eventual loss or fixation of the ancestral allele, where new mutational input is stopped G = 0.1N_e = 2500 generations after the environmental change. Each run has four possible outcomes: Fixation of 0, 1, or 2 or of 1 and 2 together.

Bottleneck:

In the bottleneck scenario, the population is reduced to 1% at time T (N_T = 250). After time T, the population is allowed to recover logistically following N_t₊₁ = N_t + rN_t(1 − N_t/K), where r = 5.092 × 10⁻² and the carrying capacity is K = 2546. This results in an average population size of N_av = 2500 (10% of the original size) after the environmental change until new mutational input is stopped at G = 0.1N_e generations. For Θ_u = 0.004 only realizations with >10 fixation events in 40,000 runs are included in the numbers.

Number of (independent) copies:

To determine the number of independent copies that contribute to a fixation, each mutation is given a different name and followed separately. Runs are done with and without new mutational input after the environmental shift and continued until fixation of the selected allele or all copies from the standing variation are lost. Additionally, also runs with only new mutations are done. When fixation of the selected allele occurs, we count the number of descendants from different origins in the population. A similar procedure is followed to determine the number of copies from the standing variation that contribute to a substitution. For this, all copies of the selected allele that are present at the time of the environmental change are given a different name. In the case of fixation, the number of different copies in the population is counted. Only realizations with >10 fixations are included in the numbers.

RESULTS

Fixation probability from the standing genetic variation:

The fixation probability of an allele A with selective advantage s_b that segregates in a population at frequency x is given by Kimura's diffusion approximation result:

(Kimura 1957). In the following, we assume that selection on the heterozygote is sufficiently strong (formally, we need 2hα_b ≫ (1 − 2h)/2h). We can then ignore the term proportional to y² in Equation 1 and Π_x is approximately

If A enters the population as a single new copy, x = 1/2N_e, and if 2N_e ≫ 2hα_b ≫ 1, we recover Haldane's classic result that the fixation probability is twice the heterozygote advantage, Π_{1/2N_e} ≈ 2hs_b (Haldane 1927). This relation underlines the importance of genetic drift: It is not sufficient for an advantageous allele to arrive in a population, it also needs to escape stochastic loss. Due to the strong linear dependence of the fixation probability on the selection coefficient, alleles with a small beneficial effect are less likely to escape such loss. The fixation process thus acts like a stochastic sieve that favors adaptations with large effects. This was stressed in particular by Kimura (1983). According to Equation 2, an approximately linear dependence of Π_x on hα_b holds more generally as long as either the initial frequency x or the heterozygote advantage hα_b is small, such that 2hα_bx < 1.

Let us now compare this view of the fixation process with the alternative scenario of adaptation from the standing genetic variation. In the most simple case, the allele A again originates from a single mutation, but before the environmental change, and already segregates in the population under neutrality when positive selection sets in. Standard results (e.g., Ewens 2004) show that under these conditions the probability for an allele to segregate at a given frequency is proportional to the inverse of the frequency, Inline graphic where x_k = k/2 N_e and . The average fixation probability is then . We derive an exact result for Π_seg in terms of a hypergeometric function in the appendix; for 2N_e ≫ 2hα_b ≫ 1 we obtain the approximation

We can make two interesting observations from this result. First, as is seen in Figure 1, there is a large increase in the (average) fixation probability if an allele does not arise as a single new copy, but already segregates in the population. This increase is particularly large for small adaptations, which points to the second observation: For alleles from the standing genetic variation, the fixation probability depends only weakly (logarithmically) on the selection coefficient. Indeed, Π_seg, unlike Π_x, does not show a linear dependence on hα_b even if hα_b is very small. The reason is that, conditioned on later fixation, the average frequency of the allele at the time of the environmental change, Inline graphic _k, increases with decreasing hα_b, such that 2hα_b_k > 1 for all hα_b [a simple calculation in the appendix reveals that _k ≈ 1/ln2hα_b]. The usual linear approximation of Π_x is therefore never appropriate.

Consider, now, an allele A that segregates in the population at an equilibrium of mutation, (negative) selection, and drift when the environment changes at time T. For t > T, positive selection sets in. We are interested in the net probability P_sgv that the allele is available in the population at time T and subsequently goes to fixation. In the continuum limit for the allele frequencies, P_sgv is given by the integral

where Π_x is the fixation probability (Equation 2) and ρ(x) is the density function for the frequency of a derived allele in mutation-selection-drift balance. Approximations for ρ(x) can be obtained from standard diffusion theory; all derivations are given in the appendix. In the neutral case (α_d = 0) the distribution of derived alleles is approximately

For a previously deleterious allele and 2h′α_d ≫ (1 − 2h′)/2h′, we obtain

C₀ and C_α are normalization constants. ρ(x) includes a probability Pr₀ that A is not present in the population at time T. For Θ_u < 1, this probability is approximately

For the probability that the population successfully adapts from the standing variation we derive the simple approximation

where R_α := 2hα_b/(2h′α_d + 1) is the relative selective advantage. R_α measures the selective advantage of A in the new environment relative to the forces that cause allele frequency changes in the ancestral environment, deleterious selection and drift (represented by the 1). We refer to R_α < 1 and R_α > 1 as cases of small and large relative advantage, respectively. If the allele A is completely recessive in the old environment (h′ = 0), similar approximations hold here and below if 2h′α_d + 1 in R_α is formally replaced by Inline graphic + 1 (see again the appendix for details). To relate Equation 8 to Equation 3, we need to calculate the fixation probability for a segregating allele that is derived from a single mutation prior to the environmental change. This probability is obtained from (8) and (7) by conditioning on segregation of the allele in the limit Θ_u → 0. We find

For α_d = 0 and hα_b ≫ 1 this reduces to Equation 3.

All further results of our study depend on Equation 8. Computer simulations show that this simple analytical expression is quite accurate over a large parameter range (assuming Θ_u < 1 and hα_b, h′α_d ≪ 2N_e; see Figure 2). Slightly better approximations (which coincide with 95% confidence intervals of all our simulation runs) can be obtained by numerical integration of Equation 4, using the allele frequency distributions Equations 5 and 6. It is instructive to compare the stochastic result, Equation 8, with the deterministic approximation used by Orr and Betancourt (2001). If we set x ≡ Θ_u/2h′α_d in Equation 2 (the equilibrium value at mutation-selection balance), the fixation probability from the standing variation becomes

Equation 8 reduces to Equation 10 if and only if there is relatively strong past deleterious selection such that R_α ≪ 1. In this limit, the initial frequency of the selected allele is sufficiently reduced that the fixation probability Π_x (Equation 2) is approximately linear in x over the range of ρ(x), Π_x ≈ 2hα_bx. In the integral (4) then only the average allele frequency Inline graphic enters, which (almost) coincides with the deterministic approximation. For R_α ≥ 1, the distribution ρ(x) feels the concavity of Π_x and the true value of P_sgv drops below the deterministic estimate. This is captured by Equation 8; see Figure 2. For R_α ≤ 1 the fixation probability does not approach the “deterministic” approximation even if N_e and thus α_d, α_b, and Θ_u get large. The reason is that it is the variance of 2hα_bx that matters, which does not go to zero even if the variance of the allele frequency Var[x] → 0 for large Θ_u and α_d.

Figure 2.— — The probability of fixation from mutation-selection-drift balance, P_sgv, for a range of mutation and selection parameters. Solid lines show approximation Equation 8 and dotted lines show the deterministic approximation Equation 10. Solid circles are simulation results. Ninety-five percent confidence intervals are contained in the circles.

Equations 8 and 9 confirm a weak dependence of the fixation probability on α_b. For fixed α_d, the fixation probability depends logarithmically on α_b (and on R_α) as long as R_α > 1. In the “deterministic limit” R_α ≪ 1, this dependence goes back to linear. However, this is true only if α_b varies independently of α_d. If stronger selected alleles have larger trade-offs, i.e., α_b and α_d are positively correlated, R_α and thus P_sgv and Π_seg will increase less than linearly with α_b even if R_α ≪ 1. Using the deterministic aproximation, Orr and Betancourt (2001) previously found that the dominance coefficient drops out of P_sgv if dominance does not change upon the environmental shift, h = h′. The stochastic result Equation 8 confirms this finding and extends it beyond the limits of validity of the deterministic approximation as long as hα_b and h′α_d are both large.

Standing variation vs. new mutations:

We want to compare the fixation probability from the standing variation with the probability that an adaptive substitution occurs from new mutation. The probability for a new allele to occur in the population that is destined for fixation is ∼p_new = 2N_eu2hs_b per generation. Using a Poisson approximation, the probability that such a mutation arrives within G generations is

where G is measured in units of 2N_e. We can now determine the number of generations G_sgv that it takes for P_new(G_sgv) = P_sgv. This value serves as a measure of the relative adaptive potential of the standing variation. Using Equation 8 we obtain

This value is independent of Θ_u and depends only on the selection parameters of the allele. One can relate G_sgv to the average fixation time t_fix of an allele with selective advantage hα_b. In the appendix, we derive t_fix in units of 2N_e,

The approximation is very accurate for h = 0.5 and hα_b ≳ 2. For h ≠ 0.5 it defines a lower bound. We see that G_sgv < t_fix for arbitrary R_α. This holds even if we account for the fact that the average fixation time from the standing variation may be shorter (but ≥ t_fix/2), since the allele starts at a higher frequency. This result means that in a time span that an allele from the standing variation needs to reach fixation, it is at least as likely that the allele alternatively appears as a new mutation destined for fixation only after the environmental change.

Next, we consider the case that a derived beneficial mutation A is found in a population some time after the environmental change. There are three possibilities: A derives from the standing genetic variation at time T, or from new mutation(s) that occurred after the environmental change, or both. Computer simulations that include new mutations after time T show that hybrid fixations that use material from both sources are quite frequent for high Θ_u, but also that the contribution of the standing variation generally dominates in this case (for Θ_u = 0.4 on average 67–97%, depending on α_b and α_d). In the following, we combine hybrid fixations with fixations that use only alleles from the standing variation and define P_sgv more broadly as the probability that an adaptive substitution uses material from the standing genetic variation. With this definition, simulation results are closely matched by the theoretical prediction in Equation 8.

We can now ask for the probability that a derived allele A, which is found in the population some time G after T, and either fixed or destined to go to fixation at this time, originated (at least partially) from alleles in the standing genetic variation. Measuring G in units of 2N_e generations, this probability may be expressed as Pr_sgv = P_sgv/(P_sgv + (1 − P_sgv)P_new). With Equation 8,

In Figure 3, this is shown for G = 0.05, i.e., for a time of 0.1N_e generations after the environmental change. This time should be sufficiently long for significant adaptive change, but still short enough for a selective sweep to be detected in DNA sequence data (Kim and Stephan 2000; Przeworski 2002). For Drosophila melanogaster, 0.1N_e generations approximately correspond to the time since it expanded its range out of Africa into Europe after the last glaciation (i.e., ∼10,000–15,000 years ago).

There are two advantages of the standing variation over adaptations purely from new mutations. First, the standing genetic variation may already contain multiple copies of the later-beneficial allele, reducing the probability of a stochastic loss relative to a single copy. This advantage is measured in the relative adaptive potential G_sgv above. A second, independent advantage is that alleles from the standing variation are immediately available and may outcompete new mutations due to this head start. Consequently, we see that substitutions from the standing variation dominate in two parameter regions. First, they dominate for small hα_b as long as selection before the environmental change was also weak because P_sgv > P_new in this range. (P_sgv > P_new for hα_b < ln[1 + R_α]/G; for small hα_b this needs h′α_d < 1/G, i.e., α_d < 40 for h′ = 0.5 and G = 0.1N_e.) The second parameter region is if hα_b and the mutation rate Θ_u are both high. In this case, the crucial advantage of the alleles from the standing genetic variation is their immediate availability: The probability for fixation from the standing variation is already sufficiently high that there is no need to wait for a new mutation to occur.

For practical application of this result, remember that Pr_sgv does not count only alleles that are fixed at time T + G, but also alleles that are destined to go to fixation. Consequently, simulations in Figure 3 are continued until loss or fixation of the allele even beyond T + G. This makes almost no difference as long as the average fixation time t_fix of an allele is much smaller than G. However, if t_fix ≥ G, Equation 14 can no longer be used to predict full substitutions. For G = 0.1N_e, t_fix > G if hα_b ≲ 275. If we count only substitutions that are completed at time T + G, P_new is more strongly reduced than P_sgv. For alleles with t_fix ≈ G, predominance of the standing genetic variation is larger than that predicted by Equation 14 (confirmed by simulations, results not shown). For alleles with t_fix ≫ G practically all substitutions that are completed at time T + G contain material from the standing variation; however, there are then only very few fixations at all.

Population bottlenecks:

So far, we have assumed that the effective population size before, during, and after the environmental change is constant. For many evolutionary scenarios, however, it may be more realistic to assume that the shift of the environmental conditions is accompanied by a population bottleneck. Examples include colonization events and human domestication, but also the (temporary) reduction of the carrying capacity of a maladapted population in a changed environment.

Suppose that a population of ancestral size N₀ goes through a bottleneck directly after the environmental change and recovers afterward until it reaches its carrying capacity in the new environment. We want to know how these demographic events change the probability Pr_sgv that a substitution is derived from the standing genetic variation. We expect two factors to play a role. On the one hand, a deep and long-lasting bottleneck may significantly reduce the standing variation and the potential of the population to adapt from it. On the other hand, a slow or incomplete recovery reduces the opportunity for new mutations to arrive in the population and thus the probability of adaptation from new mutations.

It is therefore instructive to distinguish two elements of a bottleneck, population size reduction and subsequent recovery, and discuss their effects separately. The simplest case is a pure reduction of N₀ by a factor B > 1 at time T, with no recovery. For matters of comparison, we continue to use the ancestral population size N₀ in the definitions of Θ_u, α_b, α_d, and G. In our formulas for the fixation probabilities from new or standing variation (Equations 8, 11, and 14) population size reduction is then simply included by a rescaling of the selection parameter α_b to α_b/B. (For adaptations from the standing genetic variation note that a sampling step to generate a bottleneck does not change the frequency distribution of the later-beneficial allele, leaving α_b in Equation 2 the only parameter subject to change. For adaptation from new mutations the rescaling argument follows if we express the probability for a new mutation destined for fixation per generation as p_new = (2N_e/B)u2hs_b = 2uhα_b/B.) Consequently, the graphs in Figure 3 are simply shifted to the right. A pure reduction of the population size at time T thus reduces the relative advantage of the standing genetic variation for strongly selected alleles with a large mutation rate, but enhances its advantage for weakly selected alleles. Note that the adaptive potential G_sgv increases by a factor of B relative to t_fix and can now be much larger than the fixation time.

Relative to a simple reduction in population size, recovery increases the adaptation probability from the standing variation, P_sgv, and from new mutations, P_new, in different ways. First, recovery increases P_new (but not P_sgv) simply due to the fact that the opportunity for new mutations increases with increasing population size. Second, the fixation probability of beneficial alleles is increased due to population growth. For further progress, we use results on the fixation probability in populations of changing size by Otto and Whitlock (1997). We assume that the population experiences logistic growth according to dN/dt = λ(1 − N/K)N after an initial reduction to N_T. Here, λ is the intrinsic growth rate (for t in units of 2N₀), and K the carrying capacity. There are two things to note. First, the effect of recovery on the fixation probability is significant only if it is sufficiently fast on a scale set by the selection strength. For logistic recovery, this is the case if λ ≳ hα_b. Second, the increase of the fixation probability due to recovery is much more important for P_sgv than for P_new. The reason is that only alleles that are already present during the bottleneck will be affected. While this is the case for all alleles from the standing variation that survive population size reduction, only relatively few new mutations will occur in the small bottleneck population (at least if recovery is sufficiently fast to matter). More formally, one can show that the increase in the fixation probability due to recovery can be neglected in P_new if λG ≫ 1. This leaves only a very restricted parameter space of hα_b ≲ λ ≲ 1/G, where an increase in fixation probability plays a role for P_new (confirmed by simulations, not shown).

In the following, we concentrate on fast recovery on a scale of G, i.e., λ ≫ 1/G (results for slow recovery are intermediate between fast and no recovery). As a measure for the opportunity for new beneficial mutations to arrive in the population, let N_av be the average population size from time T to time T + G, where the substitutions are censused. We then define a bottleneck parameter for new mutations B_new := N₀/N_av and rescale α_b to α_b/B_new in P_new (Equation 11). For fixations from the standing genetic variation, we define the bottleneck strength as B_sgv(hα_b) = N₀/N_fix(hα_b) and rescale the relative selection strength R_α → R_α/B_sgv in Equations 8 and 14. Here, N_fix is an average “fixation effective population size” that is felt by a beneficial allele on its way to fixation or loss. Since the sojourn time of a strongly selected allele is shorter than that of a weakly selected allele, N_fix and B_sgv depend on the selection coefficient of the allele. For logistic growth, Equation 19 in Otto and Whitlock (1997) leads to

Figure 4 shows the percentage of fixations from the standing variation for a bottleneck with N_T = N₀/100 and logistic recovery with ∼5% initial growth per generation and carrying capacity K = 2546. More precisely, we choose λ = 0.05092 · 2N₀ = 2546 for the growth rate per 2N₀ = 50,000 generations, such that the average size after the environmental change until 0.1N₀ generations (i.e., G = 0.05) is N_av = N₀/10 = 2500.

From Equation 15 and Figure 4, we can distinguish three parameter regions for the effect of a bottleneck. First, for hα_b > λ, the fixation probability of individual alleles is not substantially increased by population growth as compared to the case without recovery. However, population growth increases the opportunity for new mutations and thus B_new < B_sgv. For large Θ_u, there is nevertheless almost no change in Pr_sgv relative to no recovery. The reason is that fixation is then almost certain, with P_new ≈ 1 and thus Pr_sgv ≈ P_sgv (see the definition of Pr_sgv above Equation 14). Second, for very small selection coefficients, hα_b < λN_T/K, all alleles feel the new carrying capacity K as their fixation effective population size. If λ ≫ 1/G, the bottleneck then acts like a single change in the population size from N₀ to K. Finally, for intermediate selection coefficients, P_new generally profits more from the recovery than P_sgv, leading to a reduction in Pr_sgv if compared to no recovery.

Compared with the results of the previous section, we can summarize the effect of a bottleneck as follows. There is a tendency to further increase the predominance of the standing variation for weakly selected alleles and to decrease its advantage for high hα_b and Θ_u. However, unless the bottleneck is very strong, there is no qualitative change in the overall pattern.

Footprints of soft sweeps:

Since adaptations from the standing genetic variation start out with a higher copy number of the selected allele, more than one of these copies may escape stochastic loss and eventually contribute to fixation. Depending on whether one or multiple copies are involved in the substitution, one may expect differences in the footprint of the adaptation on linked neutral variation. To derive the probability that n copies of the allele A that segregate in the population at time T contribute to its fixation, we follow Orr and Betancourt (2001) and assume that individual copies enjoy an independent probability to escape stochastic loss. We may then apply a Poisson approximation. If the frequency of A at the time of the environmental change is x, the probability that k = n copies survive and contribute to fixation is approximately

This approximation is consistent with Equation 3 if 2hα_b ≫ 1. The probability that more than one copy contributes to the substitution (i.e., the probability for a “soft sweep”) is then Pr(k > 1; x) = 1 − (1 + 2hα_bx)exp[−2hα_bx]. Averaging over the allele frequency distribution at time T, ρ(x), and conditioning on the case that fixation did occur, we obtain the probability for a soft sweep for adaptations from the standing genetic variation,

Using the approximation Equations 5 and 6 for the allele distribution and Equation 8 for P_sgv, this gives

which reduces to P_mult ≈ 1 − R_α/((1 + R_α)ln[1 + R_α]) in the limit Θ_u → 0. This limit is essentially reached for Θ_u ≲ 0.004. We can again compare the stochastic result with the deterministic approximation that is obtained from Equation 17 assuming x ≡ Θ_u/2h′α_d,

Both approximations, Equations 18 and 19, are compared to simulation data in Figure 5. The deterministic approximation reproduces the stochastic result only for very large mutation rates, Θ_u ≫ 1, outside the parameter space in the figure. For low mutation rates, where Equation 19 predicts a zero limit for Θ_u → 0 it severely underestimates P_mult. The stochastic approximation produces a reasonable fit unless h′α_d and hα_b are both small. In this parameter range with relatively high initial allele frequency and weak positive selection, the Poisson approximation is no longer valid.

To estimate the impact of a soft sweep on linked neutral variation we are also interested in the number of independent copies that contribute to the fixation of the allele, i.e., copies that are not identical by descent. Concentrating on copies that segregate in the population at the time T of the environmental change, we can again use a Poisson approximation, P̃r(k = n) = exp(−λ)λⁿ/n!. With this conjecture, 1 − exp(−λ) is the fixation probability from the standing genetic variation. Equating with P_sgv as given in Equation 8, we obtain λ = Θ_u ln[1 + R_α]. The probability of fixation of multiple independent copies, conditioned on the cases where fixation occurs then is

Alternatively, we obtain Equation 20 from Equation 18 using the relation 1 − P_mult(Θ_u) = (1 − P_ind(Θ_u))(1 − P_mult(Θ_u = 0)). This equation expresses the probability for fixation of a single copy (“no multiple fixation given fixation”) as the probability of fixation from a single origin times the probability of fixation of a single copy given that all successful copies are from a single origin (a single origin is enforced in P_mult by Θ_u → 0). This alternative derivation shows that Equations 18 and 20 follow from the same assumption: independent fixation probability for different copies. To the order of our approximation, P_mult and P_ind depend on selection only through the relative selective advantage R_α = 2hs_b/(2h′s_d + 1/(2N_e)). This parameter combines two effects. The denominator of R_α takes into account that multiple fixations are less likely if the initial frequency of the allele at time T is low. This frequency decreases with deleterious selection h′s_d and drift, represented by the 1/2N_e term. Second, the numerator of R_α accounts for the fixation probability of the allele: The probability that the allele is maintained during the adaptive phase increases with hs_b. For hα_d ≫ 1, the result depends only on the ratio of the selection coefficients as also predicted by the deterministic approximation (Orr and Betancourt 2001). If the environmental change is followed by a bottleneck, Equations 18 and 20 can be used with R_α → R_α/B_sgv with the bottleneck factor introduced above. In contrast to P_mult, the fixation probability of multiple independent copies depends strongly on the mutation rate Θ_u and vanishes for Θ_u → 0. In Figure 6, Equation 20 is compared with simulation data. The approximation produces a good fit for α_d ≥ 10 where the Poisson approximation is valid.

By construction, both approximations (18) and (20) account only for the fixation of copies of the allele that were already in the population at time T. It is, however, also possible that a successful copy first arises for t > T as a new mutation during the adaptive phase. Since the origin of these new copies is necessarily independent, this effect contributes to P_ind. The size of this contribution depends on the population-level mutation rate Θ_u_,_t_>_T directly after the environmental change. Θ_u_,_t_>_T can be smaller than the original Θ_u that appears in Equations 18 and 20 if there is a bottleneck at T. For Θ_u_,_t_>_T = Θ_u our simulation results show that the contribution of new mutations to P_ind is substantial (Figure 6, squares). One consequence of mutational input after T is that P_ind becomes almost independent of α_d. Even more importantly, we see that the fixation of multiple independent copies is not particular to adaptations from the standing genetic variation. It occurs with basically the same probability if the selected allele enters the population after the environmental change as a recurrent new mutation (see Figure 6, triangles).

For recurrent new mutations, the simulation data show that the total fixation rate of multiple independent copies, r_ind = −ln[1 − P_ind], increases logarithmically with α_b and linearly with Θ_u. For a heuristic understanding of this dependence, assume h = 0.5 and let x(t) be the frequency of a first copy of the selected allele on its way to fixation in the absence of further mutation. For small u, the probability for a second copy of the beneficial mutation to arise while a first copy spreads to fixation is then Inline graphic . Here, t_fix is the average fixation time in 2N_e generations and we have used that the first copy spends on average equal times in frequency classes x and (1 − x). By far the largest contribution to p₂ comes from the early phase of the sweep where the frequency x of the first copy is very low. The probability of the second copy to survive until fixation of the allele depends on x, but to leading order only the survival probability for x → 0 matters, which is approximately s_b. With t_fix from Equation A17 we then obtain Inline graphic . A more detailed account will be given elsewhere.

P_ind is the probability that descendants of multiple independent copies of the selected allele segregate in the population at the time when this allele reaches fixation. Consequently, the number of copies in our simulation runs was counted at the time of fixation (same for P_mult). In practical applications, however, one is often interested in the probability of observing descendants from independent origins a fixed time G after an environmental change. This probability will decrease with G, since copies get lost by drift until, eventually (in the absence of back mutation), all copies derive from a single mutation as their common ancestor. The drift phase from the time of fixation to the time of observation G depends on the selection coefficient and will be longer for strongly selected alleles with short fixation times. In principle, this could affect the dependence of the probability of observing multiple fixed copies in a population on hα_b. To test this, we ran additional simulations to measure the probability for the survival of multiple (independent) copies G = 0.1N_e generations after the environmental change (results not shown). For alleles with fixation time t_fix < 0.1N_e, we did not detect any difference from the data displayed in Figures 5 and 6, meaning that fixation of a single copy in the neutral drift phase after initial fixation of multiple copies is rare. This is not surprising, considering that the average fixation time under neutral drift exceeds 0.1N_e generations even if the frequency of the major copy is initially at 99%.

Another question is whether multiple copies of the selected allele are likely to be found in a small experimental sample, even if they exist in the population. We tested this by arbitrarily drawing 12 chromosomes in each case of a soft sweep. Multiple copies in the sample were found in 70–80% of all cases (for Θ_u = 0.4). Summarizing our results for the fixation probabilities of multiple copies and of multiple independent copies, we can distinguish three parameter regions:

Low mutation rate, relatively strong past selection: If the mutation rate is low (Θ_u ≪ 0.1) fixation of multiple independent copies of the selected allele is unlikely. If multiple copies fix, they are most likely identical by descent. If past deleterious selection is strong, however, also the fixation of multiple homologous copies is rare. For Θ_u = 0, Equation 18 indicates that <5% and <30% of fixations originate from multiple copies for R_α ≤ 0.1 and R_α = 1, respectively (Figure 5).
Low mutation rate, relatively weak past selection: With increasing relative advantage R_α the fixation of multiple homologous copies increases. For Θ_u → 0, fixation of multiple copies occurs in >50% of the cases (P_mult > 0.5) if R_α ≳ 4 (Figure 5).
High mutation rate: For mutation rates Θ_u ≳ 0.1 fixations from independent origins are much more frequent and become more likely than the fixation of single copies. This holds true for whether the origin of the selected allele is from the standing variation or from recurrent new mutations. The fixation probability for multiple independent copies increases logarithmically with hα_b. For Θ_u = 0.4, 50–90% of substitutions involve multiple independent copies (Figure 6).

Imagine that we observe a DNA region where an adaptive substitution has happened following an environmental change at time T. Suppose that we observe this region G generations after the environmental change, and 2 ≫ G ≫ t_fix, such that the advantageous allele has reached fixation, but G (in units of 2N_e) is much shorter than the average neutral coalescent time. We want to analyze whether and how the contribution of multiple copies to an adaptive substitution affects the signature of selection on linked neutral variation. For this, it is helpful to distinguish two aspects of a selective footprint, its width in base pairs along the sequence and its maximum depth in terms of the extent of variation lost in a region close to the locus of selection.

For a hard sweep, the coalescent at the selected site itself does not extend beyond time T. Ancestral variation that has existed prior to T can be maintained only if there is recombination between the selected site and the site studied. In a core region around the selected site, where no recombination has happened, all ancestral variation is lost. Recombination therefore modulates the width of the sweep region, but in general does not affect its maximum depth. Since only recombination in the selective phase matters, and since the adaptive phase is much shorter for a strongly selected allele, the width of a selective footprint decreases with larger α_b.

For a soft sweep, the coalescent at the selected site itself extends into the ancestral environment. As compared with a hard sweep, a soft sweep therefore has a reduced maximum depth. Our results show that soft sweeps with shallower footprints are more likely for large α_b. This does not contradict that selective footprints get weaker and eventually vanish as α_b → 0, for two reasons. First, even if it is more likely for lower α_b that all ancestral variation is eliminated close to the selection center, the width of the window where this holds true gets smaller at the same time. If this width drops below the average distance of polymorphic sites, the footprint of selection becomes undetectable. Second, if we observe the sweep region G generations after positive selection begins, we can compare only selective footprints of alleles that have reached fixation by this time. If we want to study very weakly selected alleles, G needs to be so large that any footprint of selection will be washed out by new mutations that arise after time T.

The impact of a soft sweep on the molecular signature depends on whether the surviving copies are independent by descent or not. Copies from different origins are related by a neutral coalescent and represent independent ancestral haplotypes. If these haplotypes are sampled close to the locus of selection, this should mark a clearly visible difference from the classic pattern of a hard sweep. A detailed quantitative analysis with estimates of the impact on summary statistics for nucleotide variability exceeds the aims of this study and will be given elsewhere.

If multiple surviving copies are identical by descent, the expected change in the molecular footprint relative to a hard sweep depends on the strength of deleterious selection that the allele has experienced prior to the environmental change. We expect a shallower footprint (and larger deviation from the hard sweep) for weaker deleterious selection. The reason is that it is more likely for a weakly deleterious allele to segregate in a population for a long time; i.e., the average time to the most recent common ancestor in the core region of the sweep is larger for smaller α_d. Indeed, this intuition can be made more precise.

A remarkable property of the Markov process that underlies the Wright-Fisher model is that, conditional on an allele A having reached some frequency x in a population, this process is independent of the sign of the selection coefficient of A (cf. Ewens 2004, Chaps. 4.6 and 5.4; for simplicity, we assume Θ_u = 0 and h = h′ = 0.5). This has interesting consequences for adaptations from mutation-selection-drift balance. Assume that an allele A with selective disadvantage s_d that is derived from a single mutation segregates in the population at frequency x at the time T of the environmental change. Then the mean age of this allele and, more generally, the average time that it spent in each frequency class in the past are the same as if it had a selective advantage of the same absolute size prior to T. Assume that A spreads to fixation under positive selection with selection coefficient s_b after the environmental change and compare this with a sweep of an (imaginary) allele A′ with the same frequency x at time T, but selective advantage s_b throughout. For s_d = s_b, the total fixation time of the alleles and their sojourn times in every frequency class are the same; for s_d < s_b (resp. s_d > s_b) they are longer (shorter) for A.

The above argument shows that the footprint of a sweep from the standing genetic variation is identical to a “usual” sweep pattern if the selection coefficient changes its sign, but not its absolute value upon the environmental change. If we observe the sweep region at time G, the only difference from a sweep that has originated from a new mutation after time T is the somewhat older age of the sweep from the standing variation. For s_d ≠ s_b, the change in the selection regime leads to differences in the expected footprint of alleles A and A′. Clearly, this difference is due to the cases where the coalescent of A (and A′) extends into the old environment, i.e., where the sweep is “soft.” For s_d > s_b, the expected coalescence in the ancestral environment is faster for A than for A′, leading to a stronger footprint of selection. However, since soft sweeps are very rare for s_d > s_b, this will hardly lead to a detectable difference in the average footprint.

Let us now concentrate on the case s_b > s_d, or R_α > 1, where soft sweeps are frequent. In this case, the coalescence in the ancestral environment is slower and the selective signature for A is reduced in depth and width relative to A′ (due to the increased opportunity for mutation and recombination until the allele is fully coalesced). If the frequency x of the allele at time T is large, the sweep pattern of A will look more like a sweep of an advantageous allele with a selection coefficient of size s_d < s_b. We therefore also expect to find a larger difference between the footprints of soft sweeps and hard sweeps from a new mutation in this case. For a rough estimate of when this difference should be detectable, we compare the total fixation times of the allele A in the case of a soft sweep, t_fix,soft(s_d, s_b), with the average duration of a sweep from a new mutation t_fix(s_b) (cf. Equation 13). For an optimal (that is, minimal) time of observation G ≈ t_fix(s_b), we expect a clear difference in the selective signatures if the increase in coalescence time is of the same order of magnitude as the original coalescence time. Estimating the relative change in coalescence time by the change in fixation time, this means t_Δ = t_fix,soft(s_d, s_b) − t_fix(s_b) ≳ t_fix(s_b). We derive t_Δ from the frequency distribution of the allele at the time T conditional on multiple fixation and results from diffusion theory on the expected age of an allele given its frequency; details are given in the appendix. The results (not shown) predict visible changes in the sweep pattern for a minimum of R_α between 20 and 100.

DISCUSSION

The adaptive process is the genetic response of a population to external challenges. In nature, these challenges may be due to changes in climate or food resources or arise with the advent of a new predator or parasite. They either affect the original habitat of the population or are a consequence of the colonization of a new niche or of human artificial selection. In this article, we are interested in the adaptive response of a previously well-adapted population to a sudden and permanent change. We concentrate on a single locus with two (classes of) alleles, one, a, ancestral, and the other, A, derived. Allele A is either neutral or deleterious under the original conditions, but selectively advantageous after the change in the selection regime at some time T. We compare two scenarios: either A already segregates in the population at time T and fixes from the standing genetic variation or the population adapts from a new copy of the allele that enters the population only after the environmental shift.

Our results rely on two main assumptions. First, and most importantly, we assume that adaptation of the target allele does not interfere with positive or negative selection on other alleles, through either linkage or epistasis. This assumption is usually made in population genetic studies of selective sweeps. It is satisfied if the rate of selective substitutions is low and the time to fixation for each individual substitution is short, but is less plausible for weakly selected alleles with long average fixation times. In general, interference reduces fixation probabilities, with a stronger influence on weak substitutions (Barton 1995), although this does not translate into a large effect on the reduction of heterozygosity due to a selective sweep (Kim and Stephan 2003). In their study of fixation probabilities of alleles from the standing variation, Orr and Betancourt (2001) did not find a large effect of interference. This, however, may be a consequence of the neglect of new mutations and the restriction to a low initial frequency of the selected allele in their simulations. These assumptions make it unlikely that two or more beneficial alleles escape early stochastic loss and compete on their way to fixation. We therefore emphasize that our results are conditional on noninterference. Second, we assume that the variation at the locus under consideration is maintained in mutation-selection-drift balance prior to the environmental change. If selected alleles are maintained as a balanced polymorphism or are not in equilibrium at all, this may clearly affect our conclusions.

Our results pertain to three main issues: the dependence of fixation probabilities on selection coefficients if alleles are taken from the standing genetic variation, the relative importance of the standing variation and new mutations as the origin of adaptive substitutions, and the expected impact of a selective sweep from the standing genetic variation on linked nucleotide variation. We discuss them in turn.

Fixation probability from the standing variation:

In a famous argument that helped to found the micro-mutationist view of the adaptive process, Fisher (1930) showed that mutations with a small effect are much more likely to be beneficial than mutations with a large effect. Kimura (1983), however, pointed out a flaw in this argument: Even if a large majority of new beneficial mutations has a small effect, as Fisher argues, this may be offset by a much smaller fixation probability of weakly selected alleles. An allele with (constant) heterozygote advantage hs_b that enters the population as a single new copy will escape stochastic loss and spread to fixation with probability 2hs_b. One can think of stochastic loss as a sieve where small-effect alleles pass through the holes—and vanish from the population—much more often than alleles with a large selective advantage. A variant of this picture is known as Haldane's sieve and pertains to different levels of dominance: Substitutions are likely to be dominant since dominant alleles enjoy higher fixation rates.

This latter scenario is the subject of Orr and Betancourt (2001), who study Haldane's sieve if selected alleles are taken from the standing genetic variation. They conclude that the sieve is not active in this case. If the selected allele is deleterious under the original conditions (with heterozygote disadvantage h′s_d), and if the level of dominance is maintained upon the environmental shift, h = h′, the net fixation probability is approximately independent of dominance. It is easy to understand why: The advantage of a higher fixation rate with larger h is compensated by the lower frequency of the initially deleterious allele in mutation-selection balance. Orr and Betancourt (2001) focus on a limited parameter range, where the selected allele is definitely deleterious under the original conditions and thus starts at a low frequency. In their calculations, they also assume that the original deleterious effect is larger than the subsequent beneficial effect of the allele, meaning that the relative selective advantage R_α = 2hα_b/(2h′α_d + 1) < 1. Our study extends their analysis to arbitrary values of R_α. The simple analytical approximation for the probability of a substitution from the standing variation (Equation 10 above, resp. Equation 3 in Orr and Betancourt 2001), which uses the deterministic value for the initial frequency of A in mutation-selection balance, is no longer valid in the general case. Nevertheless, there is an equally simple expression, Equation 8, which serves as an approximation for the entire parameter range.

Our results corroborate and extend the findings of Orr and Betancourt (2001). To the order of our approximation, the fixation probability from the standing genetic variation depends on selection only through R_α. If selection is strong in both environments, and h′ = h, it is independent of dominance. More generally, if beneficial and deleterious effects of alleles in different environments were strictly proportional, the distribution of the effects of adaptations from the standing variation would coincide with the distribution of the effects of new beneficial mutations, as implicitly assumed in Fisher's (1930) argument. The reason is the same as in the case of dominance: Advantages in the fixation probability due to a larger α_b are compensated by disadvantages due to a smaller initial frequency with higher α_d.

Remarkably, we find that the stochastic sieve is substantially weakened even if alleles with a larger selective advantage do not have a larger disadvantage to compensate for it. If alleles are originally neutral or under relatively weak deleterious selection, such that R_α > 1, there is only a very weak logarithmic dependence of the fixation probability on all parameters for selection or dominance. The reason is the high initial frequency of the successful alleles in this case, which may be much higher than the average frequency of all segregating alleles. At these high frequencies, the fixation probability is only weakly dependent on the selection coefficient of the allele. There is, however, a sieve acting against alleles under disproportionately large past selection, R_α < 1. If the selected physiological function (with fixed hα_b) is met by several alleles with different h′α_d, alleles with a relatively mild deleterious effect in the past, h′α_d < hα_b, will be preferred. Note that this should confer a certain level of resilience to the population if the environmental conditions change back.

Empirical estimates of R_α, the relative selection strength, are difficult to obtain and generally not available. There is no a priori reason to assume that s_b is either larger or smaller than s_d (s_b < s_d was assumed by Orr and Betancourt 2001). To see this, note that the roles of the alleles A and a and the selection coefficients s_b and s_d are exchanged if the environment changes back to the old conditions at some later time. This argument does not pertain to the average selection coefficient of any deleterious allele (which is plausibly larger than the average beneficial effect), but only to the selection coefficients of deleterious alleles that are beneficial in the new environment. Several factors can cause an upward or downward bias of R_α. R_α is downward biased if there is a bottleneck at the time of the environmental change. In this case, the effective population size that enters α_b is reduced relative to the original N_e that enters α_d. An upward bias in R_α could result from a change in dominance following the environmental shift. To see this, assume that alleles a and A serve different functions that are only (or mostly) used in the old and new environments, respectively. The physiological theory of dominance claims that the common observation of dominant wild-type alleles is a natural consequence of multienzyme biochemistry (e.g., Kacser and Burns 1981; Orr 1991; Keightley 1996). If this holds true, it is natural to expect that there is at least partial dominance of the respective advantageous (wild-type) allele, hence of a (A) in the old (new) environment, and thus h > h′. Finally, if R_α is measured among successful substitutions from the standing genetic variation, a further upward bias results from the stochastic sieve against alleles with large h′α_d.

Relative importance of adaptations from the standing variation and from new mutations:

To estimate the importance of the standing genetic variation as a reservoir for adaptations, we compare a polymorphic population, in mutation-selection-drift balance, with a monomorphic one. We can measure the additional adaptive potential of the polymorphic population in the number of generations G_sgv that a monomorphic population must wait for sufficiently many new mutations to arrive to match the fixation probability from the standing variation. G_sgv can be very large for mutations with small effect (of the order 1/hs_b generations). However, for a population of constant size it is always smaller than the average fixation time of the allele. This means that there is no clear separation of adaptive phases: By the time most alleles from the standing genetic variation with a given selective advantage hα_b have reached fixation, substitutions from new mutations (with the same hα_b) will also be found. Only if the environmental change is followed by a strong reduction in population size is the reservoir of the standing variation exploited well before new mutations start to play a role.

We have also determined the probability that the standing variation contributes to an adaptive substitution that is observed some time G after an environmental change. Clearly, this probability generally declines with G. For fixed G there are two distinct parameter regions where the standing variation is most important.

Adaptations from the standing variation are favored for alleles with small effect that are under relatively weak past selection, R_α ≥ 1. This is a direct consequence of the stochastic sieve that eliminates weak alleles in a new mutation scenario. The effect is especially pronounced if the environmental shift is followed by a bottleneck with incomplete recovery. The percentage of substitutions that use alleles from the standing variation is then almost independent of the mutation rate since Θ_u affects the fixation probabilities from standing and new variation in the same way.
The standing variation is also important for alleles with a large relative selective advantage (R_α ≫ 1) if the mutation rate Θ_u is also high. In this case, fixation probabilities are high under both scenarios, new mutations and standing genetic variation. Since the standing variation other then new mutations is immediately available, it will usually contribute a major share to the substitution. Note that R_α ≫ 1 is plausible in particular for “important” adaptations with large effect, such as insecticide-resistance alleles. Whether such an adaptation likely originated from the standing genetic variation then depends mainly on Θ_u.

Selective footprints of soft sweeps:

For a classical sweep from a single new mutation, which we call a hard sweep, ancestral variation can be preserved only if there is recombination between the polymorphic locus and the selection target during the selective phase. In a “core” region around the selection center all ancestral variation is erased. In contrast, with a soft sweep, multiple copies of the selected allele contribute to the substitution. Depending on the history of these copies, part of the ancestral variation may then be maintained and appear as haplotype structure in the population. There are two types of soft sweeps. For the first type, multiple copies that contribute to the substitution derive from independent mutations. For the second type, multiple copies that existed at the time of the environmental change contribute to the substitutions, but these copies are identical by descent.

Soft sweeps of the first type (independent origins) are frequent if the mutation rate on the population level is sufficiently high (Θ_u ≳ 0.1); see Figure 6. Their probability relative to a sweep from a single origin also increases with the selection strength hα_b, i.e., altogether for alleles with high adaptive rates. Suprisingly, soft sweeps of this type are not exclusive to adaptations from the standing genetic variation, but occur with the same probability for adaptations that originate only from new mutations, which have entered the population after the environmental change. Even if material from the standing variation is used, most soft sweeps with copies from independent origins also involve new mutations. Since surviving copies represent independent ancestral haplotypes, we expect characteristic differences in the selective footprint relative to the classic pattern of a hard sweep, where only a single ancestral haplotype survives in the core region close to the selection site. A discussion of the effect of soft sweeps on the summary statistics for nucleotide variation will be given elsewhere.

Soft sweeps of the second type (copies with a common origin prior to the environmental change) can occur only for adaptations from the standing genetic variation. They are frequent even for a very low mutation rate Θ_u → 0 if the allele has a high relative selective advantage R_α ≳ 4; see Figure 5. The sweep pattern depends on the strength of deleterious selection that the allele has experienced in the old environment. For R_α > 1, we expect a weaker footprint with a narrower sweep region than predicted for a hard sweep with the same selective advantage hα_b. We predict, however, that differences in the sweep patterns are visible only for a minimum R_α of 20–100. For α_d = 0, where the probability of multiple fixations and the resulting effect on the sweep pattern are strongest, this has been studied in a recent publication by Innan and Kim (2004). Using computer simulations, these authors indeed find much weaker selective footprints if the alleles are taken from the standing genetic variation. Since their minimum value of R_α is 1000, their results fit our predictions.

We can summarize our results on soft sweeps in three observations. First, evidence of a soft sweep does not result in an easy criterion to distinguish adaptive substitutions from the standing variation and recurrent new mutations. For a large parameter space we will not be able to detect any difference between these adaptive scenarios. This confirms the conclusion of Orr and Betancourt (2001), although partly for different reasons. For high Θ_u ≳ 0.1, soft sweeps are frequent in both cases; for low Θ_u and R_α ≲ 20 they either are rare in both cases or do not lead to significant differences in the selective footprints. For a range of “interesting” substitutions, namely alleles with a large effect but a low mutation rate, however, the linked nucleotide pattern could be informative.

Second, soft sweeps are frequent in a limited but relevant parameter space. We expect soft sweeps with characteristic patterns on the selective footprints for high Θ_u, i.e., either if the population size is large or if the allelic mutation rate is high, such as at mutational hotspots or if the adaptation corresponds to a loss-of-function mutation of the gene. We also expect soft sweeps for large adaptations with hα_b ≫ h′α_d (thus R_α ≫ 1) from the standing variation, even if the mutation rate is small. The effect of a soft sweep in this last case is a reduction in the width of the sweep region relative to a hard sweep. A possible candidate for a soft sweep of this type is the evolution of DDT resistance in non-African populations of D. melanogaster. In recent studies of nucleotide and microsatellite variability in the region around an Accord insertion that is associated with DDT resistance, Schlenke and Begun (2004) and Catania et al. (2004) found evidence for a selective sweep. The width of the sweep region, however, was much narrower in D. melanogaster than expected under putatively very strong selection (Catania et al. 2004) and, as observed, for the “same” adaptation (with a Doc insertion) in D. simulans (Schlenke and Begun 2004).

Third, while hard sweeps from single mutations produce the strongest footprint for strongly selected alleles with short fixation times, the possibility of fixation of multiple alleles leads to an opposite trend: Soft sweeps with weaker footprints are more frequent for high α_b. Since the increase is only logarithmic, this trend is not very strong. Nevertheless, it could be visible for nucleotides that are tightly linked to the selected allele in regions of low recombination or in sufficiently small windows around the selection target. A genome-wide study of the small-scale reduction of heterozygosity in narrow windows of 200 bp around replacement or silent fixations has recently been performed for D. simulans by Kern et al. (2002). We note that their counterintuitive finding of a sweep signature for preferred codon substitutions, but not for replacement substitutions, matches our prediction of a stronger sweep signal for weakly selected alleles close to the selection center. However, a quantitative analysis of soft sweeps that also accounts for other factors like population substructure is needed before any conclusions can be drawn.

Acknowledgments

We thank Sylvain Mousset and Wolfgang Stephan for fruitful discussions and John Parsch for helpful comments on the manuscript. The careful comments by Sally Otto and an anonymous reviewer led to many clarifying changes. We also thank Pieter van Beek for help with the computer simulations. This work was supported by an Emmy Noether grant from the Deutsche Forschungsgemeinschaft to J.H.

APPENDIX

Fixation probability for a mutation segregating at neutrality:

We calculate the average fixation probability of an allele that is derived from a single mutation and segregates in the population under neutrality at the time T of the environmental change. The probability that there are exactly k copies at time T is distributed as ρ(k) = a_Nk⁻¹, where Inline graphic . Assuming a selection coefficient s_b for t > T and no dominance (h = 0.5), the average fixation probability is given by

We derive the sum in (A1) as

where ₂F₁ denotes the hypergeometric function. For N_es_b ≫ 1, this second term can be neglected and we obtain

In the limit of small s_b and large N_e this reduces to

where γ = 0.577 … is Euler's constant. For weak recessivity, this result holds if we replace s_b by 2hs_b.

Fixation probability for allele in mutation-selection-drift balance:

To calculate the frequency distribution of a derived allele, we start out with the Kolmogorov forward equation that describes the Wright-Fisher model in the diffusion limit (Ewens 2004),

where

are the drift and diffusion terms. Forward mutations are measured by Θ_u; back mutations are measured by Θ_v. Since the diffusion process is ergodic, the probability that the frequency of an allele falls into a certain interval [x₁, x₂] is proportional to the average time T that an allele that starts out as a single copy spends in this frequency range before it is either lost or fixed. The frequency distribution therefore directly follows from the well-known transient behavior of the process, e.g., Ewens (2004)(Chap. 4). From Equations 4.23 and 4.16 in Ewens (2004), we obtain

where C is a normalization constant. Note that this expression deviates from Wright's stationary distribution of an allele in mutation-selection-drift balance since we condition on the case that A is derived.

Simple approximate relations for Equation A7 are readily obtained in various limiting cases. First, direct numerical integration shows that back mutations can safely be ignored even in the neutral case α_d = 0 because most alleles segregate at low frequencies (this is a consequence of conditioning on derived alleles). In the neutral case, this approximation directly leads to Equation 5. If there is deleterious selection, we need to distinguish cases of weak and strong recessivity of the allele A. We concentrate mostly on the case where deleterious selection on the heterozygote is sufficiently strong, 2h′α_d ≫ (1 − 2h′)/2h′ (i.e., weak recessivity). Under these conditions, we can ignore the quadratic terms in the exponentials and express ρ(x) in terms of incomplete Gamma functions,

with normalization constant C′. For definitely deleterious A (2h′α_d ≥ 10 is sufficient), the integrand in Equation A7 is concentrated near y = 1. We can then expand y^Θu in the denominator to leading order around y = 1 (i.e., y^Θu ≈ 1) and obtain ρ(x) in terms of simple functions, which leads to Equation 6.

To obtain an analytical expression for the probability of fixation P_sgv or multiple fixation P_mult, we need to approximate ρ(x) further. If the allele A is neutral prior to the environmental change, and Θ_u ≪ 1, ρx in Equation 5 is ∼ρx ≈ Θ_ux^Θu−1. Using this in Equation 4,

where we extend the integral over exp(−2hα_bx) to ∞ after increasing 2hα_b by 1 to avoid a singularity near α_b = 0. We also use Γ(Θ_u + 1) ≈ 1 for 0 ≤ Θ_u ≤ 1.

For the deleterious case (2h′α_d ≫ 1), note that the allele frequency distribution is significantly larger than zero only for x ≤ 1/2h′α_d. Expanding around x = 0 we can approximate ρ(x) in Equation 6 as ρx ≈ C″x^Θu−1exp−2h′α_dx and obtain

A10

which gives Equation 8. In Equation A10, we have again extended integral limits after adding 1 to 2h′α_d, respectively 2hα_b + 2h′α_d. We now see that the approximation for 2h′α_d ≫ 1 reproduces the approximation for α_d = 0 in the limit α_d → 0. We can therefore use it in the entire parameter range. For Θ_u < 1, the probability that the allele A is not contained in the standing variation at time T can be approximated by the integral over ρ(x) from 0 to 1/2N_e (confirmed by simulations; see also Ewens 2004, Chap. 5.7). With the above approximations for ρ(x) this results in Equation 7. Finally, also P_mult is obtained by an analogous calculation.

If the allele A is completely recessive prior to the environmental change, h′ = 0, we again obtain an expression in incomplete Gamma functions for ρ(x) similar to Equation A8. For large α_d, this reduces to

A11

Using this expression in Equation 4, we see that the term exp[−α_dx²] can be ignored as long as 2hα_b > Inline graphic since the integral is cut off by exp[−2hα_bx]. For 2hα_b < , both selection coefficients are important. We can obtain a simple, yet compared to simulation data (not shown) reasonable, analytic approximation that captures this crossover behavior by formally replacing 2h′α_d + 1 by Inline graphic + 1 in Equations 8, 7, and 18 if h′ = 0.

The average frequency of the allele A at time T conditioned on later fixation, Inline graphic _fix, is calculated from the distribution Pr(x|fix) = Cρ(x)Π_x(hα_b). With the above approximations for ρ(x), we obtain

A12

For Θ_u → 0, this gives

A13

Finally, if also α_d = 0 and 2hα_b ≫ 1,

A14

For the calculation of the average increase in the age of a selected allele for a soft sweep with a weak trade-off, we use the frequency distribution of the allele at time T conditioned on multiple fixation, Pr(x|mfix) ≈ Cρ(x)(Π_x(hα_b))². [We use the Poisson approximation Equation 16 and 2hα_bx ≈ 1 − exp(−2hα_bx) for small x, where ρ(x) is large.] We consider only the case Θ_u → 0 and h = h′ = 0.5. For a given allele frequency x at time T, we determine the average age t_a(α_d, x) of the allele using Equation 5.113 in Ewens (2004) (see also Kimura and Ohta 1969),

A15

The increase in the age of the allele due to the change of the selection regime then is obtained by numerical integration as t_Δ = ∫(t_a(α_d, x) − t_a(α_b, x))Pr(x|mfix)dx. Choosing x = 1, Equation A15 allows for a simple approximation for the fixation time of a new allele with selective advantage α_b. We derive

A16

For α_b ≥ 3, this may be approximated as

A17

where γ ≈ 0.577 is Euler's Gamma. The error term is of order α⁻³_b. To the best of our knowledge, this simple result has not yet been used in the literature. Simulation results of our own (not included) and in Kimura and Ohta (1969) show that the estimate is very accurate. For h ≠ 0.5, we can replace α_b by 2hα_b in Equation A17. The approximation then holds as a lower bound for t_fix, since the fixation time increases if h deviates from 0.5 in either direction.

References

Barton, N. H., 1995. Linkage and the limits to natural selection. Genetics 140: 821–841. [DOI] [PMC free article] [PubMed] [Google Scholar]
Barton, N. H., 1998. The effect of hitch-hiking on neutral genealogies. Genet. Res. 72: 123–133. [Google Scholar]
Catania, F., M. O. Kauer, P. J. Daborn, J. L. Yen, R. H. Ffrench-Constant et al., 2004. World-wide survey of an Accord insertion and its association with DDT resistance in Drosophila melanogaster. Mol. Ecol. 13: 2491–2504. [DOI] [PubMed] [Google Scholar]
Ewens, W. J., 2004 Mathematical Population Genetics, Ed. 2. Springer, Berlin.
Falconer, D. S., and T. F. C. Mackay, 1996 Introduction to Quantitative Genetics. Addison Wesley Longman, Harlow, Essex, UK.
Fisher, R. A., 1930 The Genetical Theory of Natural Selection. Oxford University Press, Oxford.
Haldane, J. B. S., 1927. A mathematical theory of natural and artificial selection. Part V: selection and mutation. Proc. Camb. Philos. Soc. 23: 838–844. [Google Scholar]
Hansen, T. F., C. Pelabon, W. S. Armbruster and M. L. Carlson, 2003. Evolvability and genetic constraint in Dalechampia blossoms: components of variance and measures of evolvability. J. Evol. Biol. 16: 754–765. [DOI] [PubMed] [Google Scholar]
Houle, D., 1992. Comparing evolvability and variability of quantitative traits. Genetics 130: 195–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
Innan, H., and Y. Kim, 2004. Pattern of polymorphism after strong artificial selection in a domestication event. Proc. Natl. Acad. Sci. USA 101: 10667–10672. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kacser, H., and J. A. Burns, 1981. The molecular basis of dominance. Genetics 97: 6639–6666. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaplan, N. L., R. R. Hudson and C. H. Langley, 1989. The “hitchhiking effect” revisited. Genetics 123: 887–899. [DOI] [PMC free article] [PubMed] [Google Scholar]
Keightley, P. D., 1996. A metabolic basis for dominance and recessivity. Genetics 143: 621–625. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kern, A. D., C. D. Jones and D. J. Begun, 2002. Genomic effects of nucleotide substitutions in Drosophila simulans. Genetics 162: 1753–1761. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim, Y., and W. Stephan, 2000. Joint effects of genetic hitchhiking and background selection on neutral variation. Genetics 155: 1415–1427. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim, Y., and W. Stephan, 2002. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics 160: 765–777. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim, Y., and W. Stephan, 2003. Selective sweeps in the presence of interference among partially linked loci. Genetics 164: 389–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kimura, M., 1957. Some problems of stochastic processes in genetics. Ann. Math. Stat. 28: 882–901. [Google Scholar]
Kimura, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.
Kimura, M., and T. Ohta, 1969. The average number of generations until fixation of a mutant gene in a finite population. Genetics 61: 763–771. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lande, R., and S. J. Arnold, 1983. The measurement of selection on correlated characters. Evolution 37: 1210–1226. [DOI] [PubMed] [Google Scholar]
Lynch, M., and J. B. Walsh, 1998 Genetics and Analysis of Quantitative Traits. Sinauer, Sunderland, MA.
Maynard Smith, J., and J. Haigh, 1974. The hitch-hiking effect of a favourable gene. Genet. Res. 23: 23–35. [PubMed] [Google Scholar]
Orr, H. A., 1991. A test of Fisher's theory of dominance. Proc. Natl. Acad. Sci. USA 88: 11413–11415. [DOI] [PMC free article] [PubMed] [Google Scholar]
Orr, H. A., and A. J. Betancourt, 2001. Haldane's sieve and adaptation from the standing genetic variation. Genetics 157: 875–884. [DOI] [PMC free article] [PubMed] [Google Scholar]
Otto, S., and M. C. Whitlock, 1997. The probability of fixation in populations of changing size. Genetics 146: 723–733. [DOI] [PMC free article] [PubMed] [Google Scholar]
Przeworski, M., 2002. The signature of positive selection at randomly chosen loci. Genetics 160: 1179–1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schlenke, T. B., and D. J. Begun, 2004. Strong selective sweep associated with transposon insertion in Drosophila simulans. Proc. Natl. Acad. Sci. USA 101: 1626–1631. [DOI] [PMC free article] [PubMed] [Google Scholar]
Steppan, S. J., P. C. Phillips and D. Houle, 2002. Comparative quantitative genetics: evolution of the G matrix. TREE 17: 320–327. [Google Scholar]

[bib1] Barton, N. H., 1995. Linkage and the limits to natural selection. Genetics 140: 821–841. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Barton, N. H., 1998. The effect of hitch-hiking on neutral genealogies. Genet. Res. 72: 123–133. [Google Scholar]

[bib3] Catania, F., M. O. Kauer, P. J. Daborn, J. L. Yen, R. H. Ffrench-Constant et al., 2004. World-wide survey of an Accord insertion and its association with DDT resistance in Drosophila melanogaster. Mol. Ecol. 13: 2491–2504. [DOI] [PubMed] [Google Scholar]

[bib4] Ewens, W. J., 2004 Mathematical Population Genetics, Ed. 2. Springer, Berlin.

[bib5] Falconer, D. S., and T. F. C. Mackay, 1996 Introduction to Quantitative Genetics. Addison Wesley Longman, Harlow, Essex, UK.

[bib6] Fisher, R. A., 1930 The Genetical Theory of Natural Selection. Oxford University Press, Oxford.

[bib7] Haldane, J. B. S., 1927. A mathematical theory of natural and artificial selection. Part V: selection and mutation. Proc. Camb. Philos. Soc. 23: 838–844. [Google Scholar]

[bib8] Hansen, T. F., C. Pelabon, W. S. Armbruster and M. L. Carlson, 2003. Evolvability and genetic constraint in Dalechampia blossoms: components of variance and measures of evolvability. J. Evol. Biol. 16: 754–765. [DOI] [PubMed] [Google Scholar]

[bib9] Houle, D., 1992. Comparing evolvability and variability of quantitative traits. Genetics 130: 195–204. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Innan, H., and Y. Kim, 2004. Pattern of polymorphism after strong artificial selection in a domestication event. Proc. Natl. Acad. Sci. USA 101: 10667–10672. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] Kacser, H., and J. A. Burns, 1981. The molecular basis of dominance. Genetics 97: 6639–6666. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Kaplan, N. L., R. R. Hudson and C. H. Langley, 1989. The “hitchhiking effect” revisited. Genetics 123: 887–899. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Keightley, P. D., 1996. A metabolic basis for dominance and recessivity. Genetics 143: 621–625. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Kern, A. D., C. D. Jones and D. J. Begun, 2002. Genomic effects of nucleotide substitutions in Drosophila simulans. Genetics 162: 1753–1761. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Kim, Y., and W. Stephan, 2000. Joint effects of genetic hitchhiking and background selection on neutral variation. Genetics 155: 1415–1427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Kim, Y., and W. Stephan, 2002. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics 160: 765–777. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Kim, Y., and W. Stephan, 2003. Selective sweeps in the presence of interference among partially linked loci. Genetics 164: 389–398. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] Kimura, M., 1957. Some problems of stochastic processes in genetics. Ann. Math. Stat. 28: 882–901. [Google Scholar]

[bib19] Kimura, M., 1983 The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.

[bib20] Kimura, M., and T. Ohta, 1969. The average number of generations until fixation of a mutant gene in a finite population. Genetics 61: 763–771. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Lande, R., and S. J. Arnold, 1983. The measurement of selection on correlated characters. Evolution 37: 1210–1226. [DOI] [PubMed] [Google Scholar]

[bib22] Lynch, M., and J. B. Walsh, 1998 Genetics and Analysis of Quantitative Traits. Sinauer, Sunderland, MA.

[bib23] Maynard Smith, J., and J. Haigh, 1974. The hitch-hiking effect of a favourable gene. Genet. Res. 23: 23–35. [PubMed] [Google Scholar]

[bib24] Orr, H. A., 1991. A test of Fisher's theory of dominance. Proc. Natl. Acad. Sci. USA 88: 11413–11415. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Orr, H. A., and A. J. Betancourt, 2001. Haldane's sieve and adaptation from the standing genetic variation. Genetics 157: 875–884. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Otto, S., and M. C. Whitlock, 1997. The probability of fixation in populations of changing size. Genetics 146: 723–733. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Przeworski, M., 2002. The signature of positive selection at randomly chosen loci. Genetics 160: 1179–1189. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Schlenke, T. B., and D. J. Begun, 2004. Strong selective sweep associated with transposon insertion in Drosophila simulans. Proc. Natl. Acad. Sci. USA 101: 1626–1631. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Steppan, S. J., P. C. Phillips and D. Houle, 2002. Comparative quantitative genetics: evolution of the G matrix. TREE 17: 320–327. [Google Scholar]

PERMALINK

Soft Sweeps

Joachim Hermisson

Pleuni S Pennings

Abstract

MODEL AND METHODS

Simulations:

Figure 6.—

Bottleneck:

Number of (independent) copies:

RESULTS

Fixation probability from the standing genetic variation:

Figure 1.—

Figure 2.—

Standing variation vs. new mutations:

Figure 3.—

Population bottlenecks:

Figure 4.—

Footprints of soft sweeps:

Figure 5.—

DISCUSSION

Fixation probability from the standing variation:

Relative importance of adaptations from the standing variation and from new mutations:

Selective footprints of soft sweeps:

Acknowledgments

APPENDIX

Fixation probability for a mutation segregating at neutrality:

Fixation probability for allele in mutation-selection-drift balance:

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Soft Sweeps

Joachim Hermisson

Pleuni S Pennings

Abstract

MODEL AND METHODS

Simulations:

Figure 6.—

Bottleneck:

Number of (independent) copies:

RESULTS

Fixation probability from the standing genetic variation:

Figure 1.—

Figure 2.—

Standing variation vs. new mutations:

Figure 3.—

Population bottlenecks:

Figure 4.—

Footprints of soft sweeps:

Figure 5.—

DISCUSSION

Fixation probability from the standing variation:

Relative importance of adaptations from the standing variation and from new mutations:

Selective footprints of soft sweeps:

Acknowledgments

APPENDIX

Fixation probability for a mutation segregating at neutrality:

Fixation probability for allele in mutation-selection-drift balance:

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases