Skip to main content
Genetics logoLink to Genetics
. 2012 Aug;191(4):1283–1293. doi: 10.1534/genetics.112.141325

Fluctuations of Fitness Distributions and the Rate of Muller’s Ratchet

Richard A Neher *,1, Boris I Shraiman
PMCID: PMC3416007  PMID: 22649084

Abstract

The accumulation of deleterious mutations is driven by rare fluctuations that lead to the loss of all mutation free individuals, a process known as Muller’s ratchet. Even though Muller’s ratchet is a paradigmatic process in population genetics, a quantitative understanding of its rate is still lacking. The difficulty lies in the nontrivial nature of fluctuations in the fitness distribution, which control the rate of extinction of the fittest genotype. We address this problem using the simple but classic model of mutation selection balance with deleterious mutations all having the same effect on fitness. We show analytically how fluctuations among the fittest individuals propagate to individuals of lower fitness and have dramatically amplified effects on the bulk of the population at a later time. If a reduction in the size of the fittest class reduces the mean fitness only after a delay, selection opposing this reduction is also delayed. This delayed restoring force speeds up Muller’s ratchet. We show how the delayed response can be accounted for using a path-integral formulation of the stochastic dynamics and provide an expression for the rate of the ratchet that is accurate across a broad range of parameters.


BY weeding out deleterious mutations, purifying selection acts to preserve a functional genome. In sufficiently small populations, however, weakly deleterious mutations can by chance fix. This phenomenon, termed Muller’s ratchet (Muller 1964; Felsenstein 1974), is especially important in the absence of recombination and is thought to account for the degeneration of Y chromosomes (Rice 1987) and for the absence of long-lived asexual lineages (Lynch et al. 1993).

A click of Muller’s ratchet refers to the loss of the class of individuals with the smallest number of deleterious mutations. To understand the processes responsible for such a click, it is useful to consider a simple model of accumulation of deleterious mutations with identical effect sizes illustrated in Figure 1. Because of mutations, the population spreads out along the fitness axis, which in this model is equivalent to the number of deleterious mutations in a genome. The population can hence be grouped into discrete classes, each characterized by the number of deleterious mutations. Mutation carries individuals from classes with fewer to classes with more mutations, hence shifting the population to the left. This tendency is opposed by selection, which amplifies fit individuals on the right, while decreasing the number of unfit individuals on the left. These opposing trends lead to a steady balance, at least in sufficiently large populations. However, in addition to selection and mutation, the distribution of individuals among fitness classes is affected by fluctuations in the number of offspring produced by individuals of different classes, i.e., by genetic drift. Such fluctuations are stronger (in relative terms) in smaller populations and in particular in classes that carry only a small number of individuals. When mutation rate is high and selection is weak, the class of individuals with the smallest number of mutations (k = 0 in Figure 1) contains only few individuals and is therefore susceptible to accidental extinction—an event that corresponds to the “click” of Muller’s ratchet.

Figure 1 .

Figure 1 

Deleterious mutation–selection balance. The population is distributed among classes of individuals carrying k deleterious mutations. Classes with few mutations grow due to selection (red arrows), but lose individuals through mutations (green arrows), while classes with many mutations are selected against but replenished by mutations.

Despite the simplicity of the classic model described above, understanding the rate of the ratchet has been a challenge and remains incomplete (Stephan et al. 1993; Gessler 1995; Higgs and Woodcock 1995; Gordo and Charlesworth 2000; Stephan and Kim 2002; Etheridge et al. 2007; Jain 2008; Waxman and Loewe 2010). Here, we revisit this problem starting with the systematic analysis of fluctuations in the distribution of the population among different fitness classes. We show that fitness classes do not fluctuate independently. Instead, there are collective modes affecting the entire distribution, which relax on different time scales. Having identified these modes, we calculate the fluctuations of the number of individuals in the fittest class and show how these fluctuations affect the mean fitness. Fluctuations in mean fitness feed back on the fittest class with a delay and thereby control the probability of extinction. These insights allow us to arrive at a better approximation to the rate of the ratchet. In particular, we show that the parameter introduced in the earlier work (Haigh 1978; Stephan et al. 1993; Gordo and Charlesworth 2000) to parameterize the effective strength of selection in the least loaded class is not a constant but depends on the ratio of the mutation rate and the effect size of mutations. We use the path-integral representation of stochastic processes borrowed from physics (Feynman and Hibbs 1965) to describe the dynamics of the fittest class and arrive at an approximation of the rate of Muller’s ratchet that is accurate across a large parameter range.

Understanding the rate of the ratchet is important, for example, to estimate the number of beneficial mutations required to halt the ratchet and prevent the mutational meltdown of a population (Lynch et al. 1993; Goyal et al. 2012; Pfaffelhuber et al. 2012) (for an in-depth and up-to-date discussion of the importance of deleterious mutations we refer the reader to Charlesworth 2012). Furthermore, fluctuations of fitness distributions are a general phenomenon with profound implications for the dynamics of adaptation and genetic diversity of populations. Below we place our approach into the context of the recent studies of the dynamics of adaptation in populations with extensive nonneutral genetic diversity (Tsimring et al. 1996; Rouzine et al. 2003; Desai and Fisher 2007; Neher et al. 2010). The study of fluctuations in the approximately stationary state of mutation selection balance that we present here is a step toward more general quantitative theory of fitness fluctuations in adapting populations.

Model and Methods

We assume that mutations happen at rate u and that each mutation reduces the growth rate of the genotype by s ≪ 1. Within this model, proposed and formalized by Haigh, individuals can be categorized by the number of deleterious mutations they carry. The equation describing the fitness distribution in the population, i.e., what part nk of the population carries k deleterious mutations, is given by

ddtnk=s(kk)nkunk+unk1+nkηk, (1)

where k=N1kknk (knk=N) and the last term accounts for fluctuations due to finite populations, i.e., genetic drift, and has the properties of uncorrelated Gaussian white noise with 〈ηk(t)ηl(t′)〉 = δklδ(tt′). In the infinite population limit, this equation has the well-known steady-state solution nk=Neλλk/k!, where λ = u/s. A time-dependent analytic solution of the deterministic model has been described in Etheridge et al. (2007).

Note that we have deviated slightly from the standard model, which assumes that genetic drift amounts to a binomial resampling of the distribution with the current frequencies N−1nk. This choice would result in off-diagonal correlations between noise terms that stem from the constraint that the total population size is strictly constant. This exact population size constraint is an arbitrary model choice that we have relaxed to simplify the algebra. Instead, we control the population size by a soft constraint that keeps the population constant on average but allows small fluctuations of N. The implementation of this constraint is described explicitly below. We confirmed the equivalence of the two models by simulation.

Computer simulations

We implemented the model as a computer simulation with discrete generations, where each generation is produced by a weighted resampling of the previous generation. Specifically,

nk(t+1)Poisson(1Wi=0keuuii!(1s)kinki(t)), (2)

where W is the mean fitness W=CN1k(1s)knk and C=exp(NN011) is an adjustment made to the overall growth rate to keep the population size approximately at N0. This specific discretization is chosen because it has exactly the same stationary solution as the continuous-time version above (Haigh 1978). The simulation was implemented in Python using the scientific computing environment SciPy (Oliphant 2007). If the parameter of the Poisson distribution was larger than 104, a Gaussian approximation to the Poisson distribution was used to avoid integer overflows.

To determine the ratchet rate, the population was initialized with its steady-state expectation nk, allowed to equilibrate for 104 generations, and then run for further T = 108 generations. Over these 108 generations, the number of clicks of the ratchet were recorded and the rate was estimated as clicks per generation.

The source code of the programs, along with a short documentation, is available as supporting information (File S2). In addition, we also provide some of the raw data and the analysis scripts producing the figures as they appear in the manuscript.

Numerical determination of the most likely path

The central quantities in our path-integral formulation of the rate of Muller’s ratchet are (i) the most likely path to extinction and (ii) the associated minimal action Sλ* (see Table 1). To determine the most likely path to extinction of the fittest class, we discretize the trajectory into m equidistant time points ρi between 0 and τ, where x0(0)=x0 and x0(ρm) = 0. For a given set of x0(ρi), a continuous path x0(ρ) is generated by linear interpolation. For a given trajectory x0(ρ), we determine the mean fitness by solving the deterministic equations for xk(ρ), k ≥ 1. Note that in this scheme, the only independent variable is the path x0(ρ), and all other degrees of freedom are slaved to x0(ρ). From x0(ρ) and the resulting k(ρ), we calculate the action Sλ({x0(ρ)}) as defined in Equation 28. Sλ({x0(ρ)}) is then minimized by changing the values of x0(ρi), 0 < i < m, using the simplex minimization algorithm implemented in SciPy (Oliphant 2007). To speed up convergence, the minimization is first done with a small number of pivot points (m = 4), which is increased in steps of 2 to m = 24. The total time τ = 20 (in units of s−1) was used, which is sufficiently large to make the result independent of τ. The code used for the minimization is provided in File S2.

Table 1 . List of symbols.

Symbol Description
N, u, s Population size, mutation rate, and mutation effect
nk(t),nk Number of individuals in class k at time t, steady-state value
λ = u/s, τ = ts Rescaled mutation rate and time
xk(τ),xk Population frequency in class k at time τ, steady-state value
zτ, z Abbreviation for x0(τ) and x0
k Mean fitness: k=kkxk
δxk, δk Deviations from steady state: δxk=xkxk, δk=λk
Sλ({zρ}) Path integral action depending on the path {zρ} with 0 ≤ ρτ.
Sλ*(zτ,z0), zρ* Extremal action and the associated path depending on the endpoints zτ, z0
Sλ*(z) Long time limit of Sλ*(zτ,z0) with z = zτ
Pτ(zτ|z0) Propagator from z0 to zτ in time τ
P(z) Steady-state distribution of z
γ Rate of the ratchet in units of s
σ2 Variance of x0 depending on Ns and λ
ζ2 Rescaled variance of x0 depending on λ only: ζ2 = Nseλσ2
ψk(i), φk(i) kth component of right and left eigenvectors with eigenvalue −i
ai(τ) Projection of δxk on φk(i)
α Parameter of the effective potential confining x0 (traditionally α = 0.5−0.6)

Results and Discussion

Fluctuations of the size n0 of the least loaded class can lead to its extinction. In the absence of beneficial mutations this class is lost forever (Muller 1964), and the resulting accumulation of deleterious mutations could have dramatic evolutionary consequences. Considerable effort has been devoted to understanding this process, and it has been noted that the rate at which the fittest class is lost depends strongly on the average number of individuals in the top class n0 (Haigh 1978). Later studies have shown that the rate is exponentially small in n0s if n0s1 (Jain 2008). If n0s is small, the ratchet clicks frequently and a traveling wave approach is more appropriate (Rouzine et al. 2008). However, a quantitative understanding of the n0s1 regime is still lacking.

Here, we present a systematic analysis of the problem by first analyzing how selection stabilizes the population against the destabilizing influences of mutation and genetic drift, and later we use this insight to derive an approximation to the rate of Muller’s ratchet. Before analyzing Equation 1, it is useful to realize that it implies a common unit of time for the time derivative, the mutation rate, and the selection coefficient that is of our choosing (days, months, generations, etc.). We can use this freedom to simplify the equation and reveal what the important parameters are that govern the behavior of the equation. In this case, it is useful to use s−1 as the unit of time and work with the rescaled time τ = ts. Furthermore, we formulate the problem in terms of frequencies xk = N−1 nk rather than numbers of individuals, and obtain

ddτxk=(kkλ)xk+λxk1+xkNsηk, (3)

where λ = u/s is the dimensionless ratio of mutation rate and selection strength. In other words, λ is the average number of mutations that happen over a time s−1 (our unit of time). Note that λ uniquely specifies the deterministic part of this equation and its steady-state solution xk=λkeλ/k!. The stochastic forces are proportional to 1/Ns. Again, the parameter combination (Ns)−1 has a simple interpretation as the variance of the stochastic effects accumulated over time s−1. Other than through a prefactor determining the unit of time, any quantity governed by Equation 3 can depend only on λ and Ns. Hence it is immediately obvious that the ratchet rate cannot depend on n0=Neλ alone, but must depend on n0s instead (Jain 2008). All times and rates in rescaled time units are denoted by Greek letters, while we use Arabic letters for times and rates in units of generations.

Before turning to the ratchet rate, we analyze in greater detail the interplay of deterministic and stochastic forces in Equation 3. A full time-dependent analytic solution of the deterministic model was found in Etheridge et al. (2007). Below, we present an analytic characterization of the stochastic properties of the system in a limit where stochastic perturbations are small.

Linear stability analysis

In the limit of large populations, the fluctuations of xk around the deterministic steady state xk can be analyzed in linear perturbation theory. In other words, we express deviations from the steady state as δxk=xkxk and expand the deterministic part of Equation 3 to order δxk2. This expansion

ddτδxk=kδxk+λδxk1+xkm=0(mλ)δxm=mLkmδxm (4)

defines a linear operator Lkm. A quick calculation shows that Lkm has eigenvalues κi = −i with i = 0, 1, 2.... The right eigenvector corresponding to κ0 = 0 is simply ψk(0)=xk, while the right eigenvectors for i > 0 are given by

ψk(i)=xkixk, (5)

where k numbers the coordinate of the vector. This is readily verified by direct substitution (note that xi=0 for i < 0).

The eigenvector ψk(0) corresponds to population size fluctuations that in our implementation are controlled by a carrying capacity. The eigenvalue associated with this mode in the computer simulation is large and negative and need not be considered here (see Model and Methods). All other eigenvalues are negative, which is to say that xk is a stable solution.

The eigenvectors for i > 0 have an intuitive interpretation: Eigenvector i corresponds to a shift of a fraction of the population by i fitness classes downward. Since such a shift reduces mean fitness, the fittest classes start growing and undo the shift. More generally, any small perturbation of the population distribution can be expanded into eigenvectors δxk(τ)=jψk(j)aj(τ) and the associated amplitudes aj(τ) will decay exponentially in time with rate j (remember that the unit of time is s−1). Since the amplitudes are projections of δxk onto the left eigenvectors of Lmk, we need to know those as well. For κ0 = 0, the left eigenvector is simply φk(0)=1, while the other left eigenvectors are given by

φk(i)=(1)kieλλik(ik)!,0ki, (6)

and φk(i)=0 for k > i. With the left and right eigenvectors and the eigenvalue spectrum of the deterministic system on hand, we now reinstantiate the stochastic part of the dynamics:

ddτδxk=mLkmδxm+xkNsηk. (7)

Note that we approximated the strength of noise by its value at equilibrium. This approximation is justified as long as we consider only small deviations from the equilibrium. The full xk-dependent noise term is reintroduced later when we turn to Muller’s ratchet. Substituting the representation of δxk(τ)=iψk(i)ai(τ) and projecting onto the left eigenvector φk(j), we obtain the stochastic equations for the amplitudes

ddτaj(τ)=jaj(τ)+kφk(j)xkNsηk(τ). (8)

Each noise term ηk contributes to every aj and induces correlations between the aj, but each amplitude can be integrated explicitly:

aj(τ)=τdτej(ττ)kφk(j)xkNsηk(τ). (9)

The covariances of different amplitudes are evaluated in File S1 and found to be

ai(τ)aj(τ+Δτ)=ejΔτi+jkφk(i)φk(j)xkNs. (10)

However, we are not primarily interested in the covariance properties of the amplitudes of eigenvectors, but expect that the fluctuations of the fittest class and fluctuations of the mean fitness are important for the rate of Muller’s ratchet and other properties of the dynamics of the population. To this end we express δx0(τ) and k(τ) as

δx0(τ)=j>0ψ0(j)aj(τ)=eλj>0aj(τ) (11)
δk(τ)=j>0,kkψk(j)aj(τ)=j>0jaj(τ). (12)

With Equation 10, we can now calculate the desired quantities. The calculations required to break down the multiple sums to interpretable expressions are lengthy but straightforward and detailed in File S1. Below, we present and discuss the results obtained in File S1.

Fluctuations of x0 and the mean fitness

For the variance of the fittest class δx02 and more generally its autocorrelation, we find

δx0(0)δx0(τ)=eλNs01dθθGλ(θ,τ)Gλ(θ,τ)=eλθ2eτλ(1+eτ)θeλθeλθeτ+1. (13)

The variance of the fittest class (τ = 0 in the above expression) is therefore σ2=(x0/Ns)ζ2(λ), where ζ2(λ)=01dθθ1Gλ(θ,0) is the standardized variance of the top bin, which depends only on λ. For small λ, it simplifies to ζ2(λ)12λ+O(λ2). This limit corresponds to x0 close to 1 with only a small fraction of the population carrying deleterious mutations x1λ = u/s. The opposite limit of large λ corresponds to a broad fitness distribution where the top class represents only a very small fraction of the entire population. In this limit, the leading behavior of the variance σ2 is x0logλ/Ns. The full autocorrelation function is shown in Figure 2A for different values of λ and compared to simulation results, which agree within measurement error. In our rescaled units, the correlation functions decay over a time of order 1, corresponding to a time of order 1/s in real time. More precisely, the decay time (in scaled units) increases with increasing λ as log λ.

Figure 2 .

Figure 2 

(A) The covariance of the size of the fittest class x0(0) with x0(τ) a time τ later. The normalized autocorrelation of x0 increases with λ. (B) The covariance of x0(0) with the mean fitness at time τ in the past or future. One observes a pronounced asymmetry, showing that fluctuations of the fittest class propagate toward the bulk of the fitness distribution and results in delayed fluctuations. Simulation results are shown as dashed lines; theory curves are solid. In all cases, s = 0.01 and Nx0s = 100. Note that time is measured in units of 1/s, which is the natural time scale of the dynamics.

In a similar manner, we can calculate the autocorrelation of the mean fitness

δk(0)δk(τ)=λeλNs01dθIλ(θ,τ)Iλ(θ,τ)=eτeλθ2eτλ(1+eτ)θ(θ+λθ(θeτ1)(θ1)), (14)

which asymptotes to (4Nsx0)1 for large λ at τ = 0. It is hence inversely proportional to the size of the fittest class x0. For large λ, x0 represents only a tiny fraction of the population and fluctuations of the mean can be substantial even for very large N. This emphasizes the importance of fluctuations of the size of the fittest class for properties of the distribution.

If fluctuations of the mean fitness k are driven by fluctuations of the fittest class x0, we expect a strong correlation between those fluctuations (Etheridge et al. 2007). Furthermore, fluctuations of x0 should precede fluctuations of the mean. These expectations are confirmed by the analytic result

δx0(0)δk(τ)=λNs01dθHλ(θ,τ), (15)

where

Hλ(θ,τ)={(θ1)eτ+eτλθ2λ(1+eτ)θ+eτeτλθ,τ>0(eτθ1)eeτλθ2λ(1+eτ)θ+eλθ,τ<0. (16)

This expression is shown in Figure 2B for different values of λ. The cross correlation δx0(0)δk(τ) is asymmetric in time: With larger λ, the peak of the correlation function moves slowly (logarithmically) to larger delays. This result is intuitive, since we expect that fluctuations in the fittest class will propagate to less and less fit classes and that the dynamics of the entire distribution is, at least partly, slaved to the dynamics of the top class.

In all of these three cases, the magnitude of the fluctuations is governed by the parameter Ns, while the shape of the correlation function depends on the parameter λ. Only the unit in which time is measured has to be compared to the strength of selection directly.

The rate of Muller’s Ratchet

The ratchet clicks when the size of the fittest class hits 0, and the rate of the ratchet is given by the inverse of the mean time between successive clicks of the ratchet. Depending on the average size x0 of the fittest class, the model displays very different behavior. If Nsx0 is comparable to or smaller than 1, the ratchet clicks often without settling to a quasi-equilibrium in between clicks. This limit has been studied in Rouzine et al. (2008). Conversely, if Nsx01, ratchet clicks are rare and the system stays a long time close to its quasi-equilibrium state xk. Such a scenario, taken from simulations, is illustrated in Figure 3. Figure 3A shows the distribution of x0 prior to the click, while Figure 3B shows the realized trajectory that ends at x0 = 0. Prior to extinction, x0(τ) fluctuates around its equilibrium value and large excursions are rare and short. The final fluctuation that results in the click of the ratchet is zoomed in on in Figure 3C. Compared to the time the trajectory spends near x0, the final large excursion away from the steady is short and happens in a few units of rescaled time. Translated back to generations, the final excursion took a few hundred generations (s = 0.01 in this example).

Figure 3 .

Figure 3 

An example of a click of the ratchet with N = 5 × 107, s = 0.01, and λ = 10, corresponding to an average size of the fittest class n02269. (A) The distribution of x0 averaged over the time prior to extinction. (B and C) The trajectory of x0(τ), with the part of the trajectory that ultimately leads to extinction magnified in C. The final run toward x0 = 0 takes a few time units, as expected from the results for the correlation functions, which suggest a (rescaled) correlation time of ∼log λ. Note this time corresponds to a few hundred generations since s = 0.01.

In rescaled time, the equation governing the frequency of the top class is

ddτx0(τ)=δk(τ)x0(τ)+x0(τ)Nsη0(τ), (17)

where δk=λk. The restoring force δk depends on x0, as well as on the size of the other classes xk. For sufficiently large λ, x0 is much smaller than xk with k ≥ 1, such that the stochastic force is most important for x0. The dynamics of xk, k ≥ 1, is approximately slaved to the stochastic trajectory of x0(τ). We can therefore try to find an approximation of δk(τ) in terms of x0(τ) only. The linear stability analysis of the mutation–election balance has taught us that the restoring force exerted by the mean fitness on fluctuations in x0 is delayed with the delay increasing ∝ log λ. The latter observation implies that the restoring force on x0 will depend mainly on the values of x0 some time of order log λ in the past. Such history dependence complicates the analysis, and this delay has been ignored in previous analysis, which assumed that δk(τ) depends on the instantaneous value of x0(τ) via δk(τ)=α(1x0(τ)/x0) (Stephan et al. 1993; Gordo and Charlesworth 2000; Jain 2008). The parameter α was chosen ad hoc between 0.5 and 0.6. This restoring force is akin to a harmonic potential centered around x0 and the stochastic dynamics is equivalently described by a diffusion equation for the probability distribution P(x0, τ),

τP(z,τ)=12Ns2z2zP(z,τ)αz(1z/z)zP(z,τ), (18)

where we have denoted x0 by z for simplicity. The fact that the fittest class is lost whenever its size hits 0 corresponds to an absorbing boundary condition for P(z, τ) at z = 0. For such a one-dimensional diffusion problem, the mean first passage time can be computed in closed form (Gardiner 2004) and this formula has been used in Stephan et al. (1993) and Gordo and Charlesworth (2000) to estimate the rate of the ratchet. An accurate analytic approximation to that formula has been presented by Jain (2008). For completeness, we present an alternative derivation of these results that helps interpret the more general results presented below. In the limit of interest, Nsz1, clicks of the ratchet occur on much longer time scales than the local equilibration of z. We can therefore approximate the distribution as P(z, τ) ≈ eγτ p(z), where γ is the rate of the ratchet. In this factorization, p(z) is the quasi-steady distribution shown in Figure 3A, while γ is the small rate at which P(z, τ) loses mass due to events like the one shown in Figure 3C. Inserting this ansatz and integrating Equation 18 from z to ∞, we obtain

γP(X>z)=12Nszzp(z)αz(1z/z)p(z), (19)

where P(X>z)=zdzp(z), which is ≈1 for z<z and rapidly falls to 0 for z>z. To obtain the rate γ, we solve this equation in a regime of small zz, where the term on the left is important but constant, and in a regime z ≫ (Ns)−1, where the term on the left can be neglected. For the general discussion below, it is useful to solve this equation for a general diffusion constant D(z) (here equal to z/2Ns), force field A(z) (here equal to αz(1z/z)), and a constant C

C=zD(z)p(z)+A(z)p(z) (20)

with solution

p(z)=1D(z)e0zdyA(y)D(y)[βC0zdye0ydyA(y)D(y)]. (21)

Note that this solution is inversely proportional to the diffusion constant, while the dependence on selection is accounted for by the exponential factors. For zz, C = γ and 2αNs0zdy(1z/z)2αNsz, such that

p(z)γe2Nsαz1αz,zz, (22)

where β is fixed by the boundary condition that p(z) is finite at z = 0. Note that γ = p(0)/2Ns relates the rate of extinction to p(0). To determine the latter we need to match the zz regime to the bulk of the distribution zz. As can be seen from Equation 22, the constant term C is unimportant in this regime (e2Nsαz ≫ 1). Setting C = 0 in Equation 21, we find

p(z)zzNsαzπexp[αNs(zz)2z],zNsα1. (23)

The integration constant β in Equation 21 is fixed by the normalization. Since p(z) is concentrated around z=z and has a Gaussian shape around z, the normalization factor is simply 1/2πσ2, where σ2=z/2αNs is the variance of the Gaussian. The factor z/z corresponds to the 1/D(z) term, scaled so that it equals 1 in the vicinity of z. Note that we have already calculated the variance of p(z) earlier, Equation 13, and that consistency with this result would require that α is determined by Equation 13.

The two approximate solutions, Equation 22 and Equation 23, are both accurate in the intermediate regime (Nsα)1zz, which allows us to determine the rate γ in Equation 22 by matching the two solutions. This matching implies that

γ=zNsα3πeαNseλ, (24)

which agrees with result obtained previously (Jain 2008). Note that this rate depends only on the parameters λ and Ns of the rescaled model. Since rates have units of inverse time, this expression has to be multiplied by s to obtain the rate in units of inverse generations.

However, Equation 24 does not describe the rate accurately, as is obvious from the comparison with simulation results shown in Figure 4A. The plot shows the rescaled ratchet rate γ×π/zNsα3, which according to Equation 24 should be simply exp[−αNse−λ], indicated by the black line. The plot shows clearly that the simulation results often differ from the prediction of Equation 24 by a large factor. It seems as if α needs to depend on λ, as we already noted above when comparing the variance of p(z) to Equation 13. In fact, fixing α via Equation 13 improves the agreement substantially, but still does not describe the simulations quantitatively.

Figure 4 .

Figure 4 

The ratchet rate from simulation vs. prediction. Both A and B show the ratchet rate γ, rescaled with a prefactor to isolate the exponential dependence predicted by analytic approximations; λ is color-coded. (A) Comparison of simulation results with the prediction of Equation 24, which is shown as a straight line. The approximation works only for a particular value of λ, for otherwise the exponential dependence on Nx0s is not predicted correctly. (B) Comparison of simulation results with the prediction of Equation 32, again indicated by the straight line. The exponential dependence of rate on NsSλ*(0) is well confirmed by simulation results.

The reason for the discrepancy is the time delay between δk and z, which we quantified by calculating the correlation between δk(τ) and z(τ + Δτ). Hence we cannot use an approximation where δk depends on the instantaneous value of z, but must calculate δk from the past trajectory of z. If the fittest class is the only one that is strongly stochastic, we can calulate δk(τ) for a given trajectory z(ρ), ρτ, by integrating the deterministic evolution equations for xk with k ≥ 1 with z(ρ) as an external forcing.

Equation 17 now depends not only on z(τ), but on all z(ρ) with ρτ and cannot be mapped to a diffusion equation. Nevertheless, it corresponds to a well-defined stochastic integral, known as a path integral in physics (Feynman and Hibbs 1965), which is amenable to systematic numerical approximation. To introduce path integrals, it is useful to discretize Equation 17 in time and express z(ρi) in terms of the state at time ρi−1 = ρi − Δτ and the earlier time points. For simplicity, we use the notation zi for z(ρi),

zizi1=Δτδki1zi1+zi1ΔτNsηi1, (25)

where δki1 depends on all previous time points ρj with j < i. In the limit Δτ → 0, this difference equation converges against Equation 17 interpreted in the Itô sense since the z-dependent prefactor of the noise term is evaluated at ρi−1 rather than at an intermediate time point between ρi −1 and ρi. We can express this transition probability Pτ(z|z0) between the initial state z0 and the final state z = zm as a series of integrals over all intermediate states zi for 0<i<m,

Pτ(z|z0)=i=1m1dziPΔτ(z|{zj}j<m)PΔτ(zm1|{zj}j<m1)PΔτ(z1|z0). (26)

Each of these infinitesimal transitions corresponds to solutions of Equation 25 with ηi drawn from a standard Gaussian (Lau and Lubensky 2007). Hence

PΔτ(zi|{zj}j<i)=Ns2πΔtzi1exp[Ns(zizi1Δτzi1δki1)22Δτzi1]. (27)

In the limit of many intermediate steps and small Δτ, the transition probability can therefore be written as

Pτ(z|z0)=Dzρexp[Ns0τdρ[z˙ρzρδkρ]22zρ]=DzρeNsSλ({zρ}), (28)

where Dzρ is the limit of i=1mdzi(2πΔτzi1/Ns)1/2 known as the path-integral measure, and we have replaced the discrete time index by its continuous analog. The path integral extends over all continuous path connecting the endpoints z0 and zτ = z. The functional Sλ({zρ}) in the exponent closely corresponds to the “action” in physics (Feynman and Hibbs 1965), which is minimized by classical dynamics. Here minimization of the “action” defines the most likely trajectory. Note that Sλ({zρ}) depends on the entire path {zρ} with 0 ≤ ρτ, while the functional itself depends only on λ. The strength of genetic drift appears as a prefactor of Sλ({zρ}) in the exponent.

The most likely path zρ* connecting the endpoints points z0 and zτ in time τ can be determined either by solving the Euler–Lagrange equations or by numerical minimization, see below. Along with the functional, zρ* depends only on λ. Given this extremal path, we can parameterize every other path connecting z0 and zτ as zρ=zρ*+δzρ, where δzρ vanishes at both endpoints (δz0 = δzτ = 0). Denoting the minimal action associated with zρ* by Sλ*(zτ,z0), we have

Pτ(zτ|z0)=eNsSλ*(zτ,z0)DδzρeNsδSλ({δzρ},zτ,z0)=N1eNsSλ*(zτ,z0), (29)

where N−1 factor is equal to the integral over the fluctuations, which in general depends on zρ*. The prefactor Ns in eNsδS({δzρ},zτ,z0) implies that deviations from the optimal path are suppressed in large populations. If δS({δzρ}, zτ, z0) is independent of the final point zτ, N can be determined by the normalizing Pτ(zτ|z0) with respect to zτ. In the general case, calculating the fluctuation integral is difficult, and we determine it here by analogy to the history-independent solution presented above Equations 18–24.

If the stochastic dynamics admits an (approximately) stationary distribution, Pτ(zτ|z0) becomes independent of τ and z0 and coincides with the steady-state probability distribution p(z). It therefore becomes the analog of Equation 23, which for arbitrary diffusion equations is given by the inverse diffusion constant (the prefactor z−1), multiplied by an exponential quantifying the trade-off between deterministic and stochastic forces. In this path-integral representation, the exponential part is played by eNsSλ*(z), where Sλ*(z). is a function of the final point z and λ only. The prefactor is independent of the selection term and can hence be determined through the analogy to the Markovian case discussed above

p(z)zz12πσ2eNsSλ*(z),Nsz1, (30)

where the normalization is obtained by assuming an approximately Gaussian distribution around the steady-state value z and the variance σ2 is given by Equation 13. Note that this solution is not valid very close to the absorbing boundary since this boundary is not accounted for by the path integral, at least not without some special care. As in the history-independent case discussed above, this approximate distribution should be thought of as the time-independent “bulk” distribution in P(z, τ) = e−γτp(z). To determine the rate extinction rate γ, we again need to understand how probable it is that a trajectory actually hits z = 0, given that it has come pretty close.

To this end, we need a local solution of Equation 17 in the boundary layer zz as already obtained for the history independent case in Equation 22. Once a trajectory comes close to z = 0, its fate, i.e., whether it goes extinct or returns to zz, is decided quickly. Hence we can make an instantaneous approximation for δk, which does depend on the past trajectory, but for the time window under consideration it is simply a constant, α, yet to be determined. Having reduced the problem to Equation 22 we can determine α, and hence γ, by matching of the boundary solution to Equation 30 in the regime (Ns)1zz, where both are accurate. The matching condition is

zz12πσ2eNs(Sλ*(0)+zzSλ*(z)|z=0)=γαze2αNsz, (31)

which determines α and γ by the matching requirement 2α=zSλ*(z)|z=0 and

γ=|zSλ*(z)|zeNsSλ*(0)22πσ2Sλ*(0)8πσ2eNsSλ*(0), (32)

where we approximated |zSλ*(z)|z=0Sλ*(0)/z. The variance σ2 is given by Equation 13 and depends on λ and Ns as σ2=zζ2(λ)/Ns. Note that γ is in units of s and needs to be multiplied by s for conversion to units of inverse generations. In contrast to Markovian case above, the variance of the “bulk” is no longer simply related to strength of selection near the z = 0 “boundary.”

Since we don’t know how to calculate Sλ*(0) or the most likely path zρ* analytically, we determined discrete approximations to zρ* numerically as described in Model and Methods. Examples of numerically determined most likely path and the corresponding trajectory of the mean fitness are shown in Figure 5 for different values of λ. Generically, we find a rapid reduction of the size z of the size of the fittest class such that the mean fitness has only partially responded. The inset of Figure 5A shows how changes in mean fitness δk are related to z for different λ. For large λ, the mean fitness changes only very slowly with z, which increases the probability of large excursions and hence the rate of the ratchet.

Figure 5 .

Figure 5 

(A) The most likely path x0*(τ) to extinction of the fittest class and the concomitant reduction of the mean fitness for different λ are plotted against time. Times are shifted such that x0*(0)=x0/2. The inset shows the mean fitness δk(τ) plotted against x0*(τ) for different values of λ. (B) Haigh’s factor α(λ)=Sλ*(0)eλ as a function of λ determined numerically.

This numerically determined minimal action Sλ*(0) together with the approximation Equation 32 describes the rate of the ratchet, as determined in simulations, extremely well. Figure 4B shows the same simulation data as Figure 4A, but this time rescaled by 8πσ2(λ)/Sλ*(0) as a function of NsSλ*(0). After this rescaling, we expect all data points to lie on the same curve given by exp[NsSλ*(0)], as is indeed found for many different values of u, s, and N with λ = u/s ranging from 1 to 30. Note that the vertical shift of the black line relative to the data points depends on the prefactor, which we have approximated. Hence we should not expect agreement better than to a factor of ∼2. The important point is that the exponential dependence of the rate on NsSλ*(0) is correctly captured by Equation 32.

Previous studies of Muller’s ratchet suggested that the rate depends exponentially on αNseλ (Jain 2008). To relate this to our results, we determined “Haigh’s factor” α numerically from Sλ*(0) and plotted it in Figure 5B. We find that α(λ) drops from around 0.8 to 0.3 as λ increases from 1 to 30. The previously used values 0.5–0.6 for α correspond to λ ≈ 6. Using α(λ)=Sλ*(0)eλ as shown in Figure 5B, we can recast Equation 32 into its traditional form and undo the scaling with s. In units of generations, the mean time between clicks is given by

Tclick2.5ζ(λ)α(λ)sNseλeNsα(λ)eλ, (33)

where ζ(λ) is determined by Equation 13 and the factor 2.5 is introduced to approximate the part of the prefactor that is independent of N, s, or λ. A direct comparison of this expression with simulation results is shown in File S1.

Conclusion

The main difficulty impeding better understanding of even simple models of evolution is the fact that rare events involving a few or even single individuals determine the fate of the entire population. The important individuals are those in the high fitness tail of the distribution. Fluctuations in the high fitness tail propagate toward more mediocre individuals, which dominate a typical population sample.

We have analyzed the magnitude, decay, and propagation of fluctuations of the fitness distribution in a simple model of the balance between deleterious mutations and selection. In this model, individuals in the fittest class evolve approximately neutrally. Fluctuations in the size of this class propagate to the mean, which in turn generates a delayed restoring force opposing the fluctuation. We have shown that the variance of the fluctuations in the population n0 of the top bin is proportional to n0/s and increases as log λ with the ratio λ of the mutation rate u and the mutational effect s. Fluctuations of n0 perturb the mean after a time ∼s−1log λ. These two observations have a straightforward connection: Sampling fluctuations can accumulate without a restoring force for a time s−1log λ. During this time, the typical perturbation of the top bin by drift is n0s1logλ and hence the variance is ≈ n0s−1log λ. We have used these insights in the coupling between n0, the mean fitness, and the resulting delayed restoring force on fluctuations of n0 to approximate the rate of Muller’s ratchet.

The history dependence of the restoring force has not been accounted for in previous analysis of the rate of Muller’s ratchet (Haigh 1978; Stephan et al. 1993; Gordo and Charlesworth 2000; Jain 2008) who introduced a constant factor to parameterize the effective strength of the selection opposing fluctuations in the top bin, or Waxman and Loewe (2010), who replaced all mutant classes by one effective class and thereby mapped the problem to the fixation of a deleterious allele. We have shown that to achieve agreement between theory and numerical simulation one must account for the delayed nature of selection acting on fluctuations. Comparing our final expression for the ratchet rate with that given previously (Stephan et al. 1993; Gordo and Charlesworth 2000; Jain, 2008), the history dependence manifests itself as a decreasing effective strength of selection with increasing λ = u/s. This decrease is due to a larger temporal delay of the response of the mean fitness to fluctuations of the least loaded class. History dependence is a general consequence of projecting a multidimensional stochastic dynamics onto a lower dimensional space (here, the size n0 of the fittest class). Such memory effects can be accounted for by the path-integral formulation of stochastic processes, which we used to approximate the rate of Muller’s ratchet.

Even though the model is extremely simplistic and the sensitive dependence of the ratchet rate on poorly known parameters such as the effect size of mutations, population size, and mutation rate precludes quantitative comparison with the real world, we believe that some general lessons can be learned from our analysis. The propagation of fluctuations from the fittest to less fit individuals is expected to be a generic feature of many models and natural populations. In particular, very similar phenomena arise in the dynamics of adapting populations driven by the accumulation of beneficial mutations (Tsimring et al. 1996; Rouzine et al. 2003, 2008; Cohen et al. 2005; Desai and Fisher 2007; Neher et al. 2010; Hallatschek 2011). The speed of these traveling waves is typically determined by stochastic effects at the high fitness edges. We expect that the fluctuations of the speed of adaptation can be understood and quantified with the concepts and tools that we introduced above.

Populations spread out in fitness have rather different coalescence properties than neutral populations, which are described by Kingman’s coalescent (Kingman 1982). These differences go beyond the familiar reduction in effective population size and distortions of genealogies due to background selection (Charlesworth et al. 1993; Higgs and Woodcock 1995; Walczak et al. 2011). The most recent common ancestor of such populations most likely derives from this high fitness tail and fluctuations of this tail determine the rate at which lineages merge and thereby the genetic diversity of the population (Brunet et al. 2007; Rouzine and Coffin 2007; Neher and Shraiman 2011). Thus, quantitative understanding of fluctuations of fitness distributions is also essential for understanding nonneutral coalescent processes.

Generalizing the analysis of fluctuations of fitness distributions to adapting “traveling waves” and the study of their implications for the coalescent properties of the population are interesting avenues for future research.

Supplementary Material

Supporting Information

Acknowledgments

We are grateful for stimulating discussions with Michael Desai, Dan Balick, and Sid Goyal. R.A.N. is supported by the European Research Council through Stg-260686, and B.I.S. acknowledges support from National Institutes of Health under grant GM-086793. This research was also supported in part by the National Science Foundation under grant no. NSF PHY11-25915.

Footnotes

Communicating editor: W. Stephan

Literature Cited

  1. Brunet E., Derrida B., Mueller A. H., Munier S., 2007.  Effect of selection on ancestry: an exactly soluble case and its phenomenological generalization. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 76: 041104. [DOI] [PubMed] [Google Scholar]
  2. Charlesworth B., 2012.  The effects of deleterious mutations on evolution at linked sites. Genetics 190: 5–22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Charlesworth B., Morgan M. T., Charlesworth D., 1993.  The effect of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Cohen E., Kessler D. A., Levine H., 2005.  Front propagation up a reaction rate gradient. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 72: 066126. [DOI] [PubMed] [Google Scholar]
  5. Desai M. M., Fisher D. S., 2007.  Beneficial mutation selection balance and the effect of linkage on positive selection. Genetics 176: 1759–1798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Etheridge A., Pfaffelhuber P., Wakolbinger A., 2007.  How often does the ratchet click? Facts, heuristics, asymptotics, pp. 365–390 in Trends in Stochastic Analysis, edited by Jochen Blath P. M., Scheutzow M. Cambridge University Press, Cambridge, UK [Google Scholar]
  7. Felsenstein J., 1974.  The evolutionary advantage of recombination. Genetics 78: 737–756 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Feynman R. P., Hibbs A. R., 1965.  Quantum Mechanics and Path Integrals. McGraw-Hill, New York [Google Scholar]
  9. Gardiner C. W., 2004.  Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences. Springer, New York [Google Scholar]
  10. Gessler D. D., 1995.  The constraints of finite size in asexual populations and the rate of the ratchet. Genet. Res. 66: 241–253 [DOI] [PubMed] [Google Scholar]
  11. Gordo I., Charlesworth B., 2000.  The degeneration of asexual haploid populations and the speed of Muller’s ratchet. Genetics 154: 1379–1387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Goyal S., Balick D. J., Jerison E. R., Neher R. A., Shraiman B. I., et al. 2012.  Dynamic mutation-selection balance as an evolutionary attractor. Genetics 191: 1309–1319 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Haigh J., 1978.  The accumulation of deleterious genes in a population: Muller’s ratchet. Theor. Popul. Biol. 14: 251–267 [DOI] [PubMed] [Google Scholar]
  14. Hallatschek O., 2011.  The noisy edge of traveling waves. Proc. Natl. Acad. Sci. USA 108: 1783–1787 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Higgs P., Woodcock G., 1995.  The accumulation of mutations in asexual populations and the structure of genealogical trees in the presence of selection. J. Math. Biol. 33: 677–702 [Google Scholar]
  16. Jain K., 2008.  Loss of least-loaded class in asexual populations due to drift and epistasis. Genetics 179: 2125–2134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kingman J., 1982.  On the genealogy of large populations. J. Appl. Probab. 19: 27–43. [Google Scholar]
  18. Lau A. W. C., Lubensky T. C., 2007.  State-dependent diffusion: thermodynamic consistency and its path integral formulation. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 76: 011123. [DOI] [PubMed] [Google Scholar]
  19. Lynch M., Bürger R., Butcher D., Gabriel W., 1993.  The mutational meltdown in asexual populations. J. Hered. 84: 339–344 [DOI] [PubMed] [Google Scholar]
  20. Muller H. J., 1964.  The relation of recombination to mutational advance. Mutat. Res. 106: 2–9 [DOI] [PubMed] [Google Scholar]
  21. Neher R. A., Shraiman B. I., 2011.  Genetic draft and quasi-neutrality in large facultatively sexual populations. Genetics 188: 975–996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Neher R. A., Shraiman B. I., Fisher D. S., 2010.  Rate of adaptation in large sexual populations. Genetics 184: 467–481 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Oliphant T., 2007.  Python for scientific computing. Comput. Sci. Eng. 9: 10–20 [Google Scholar]
  24. Pfaffelhuber P., Staab P. R., Wakolbinger A., 2012.  Muller’s ratchet with compensatory mutations. Ann. Appl. Probab. Available at: http://www.imstat.org/aap/future_papers.html Accessed: June 2012. [Google Scholar]
  25. Rice W. R., 1987.  Genetic hitchhiking and the evolution of reduced genetic activity of the Y sex chromosome. Genetics 116: 161–167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Rouzine I. M., Coffin J. M., 2007.  Highly fit ancestors of a partly sexual haploid population. Theor. Popul. Biol. 71: 239–250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Rouzine I. M., Wakeley J., Coffin J. M., 2003.  The solitary wave of asexual evolution. Proc. Natl. Acad. Sci. USA 100: 587–592 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Rouzine I. M., Brunet E., Wilke C. O., 2008.  The traveling-wave approach to asexual evolution: Muller’s ratchet and speed of adaptation. Theor. Popul. Biol. 73: 24–46 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Stephan W., Kim Y., 2002.  Recent applications of diffusion theory to population genetics, pp. 72–93 in Modern Developments in Theoretical Population Genetics, edited by Slatkin M., Veuille M. Oxford University Press, Oxford [Google Scholar]
  30. Stephan W., Chao L., Smale J. G., 1993.  The advance of Muller’s ratchet in a haploid asexual population: approximate solutions based on diffusion theory. Genet. Res. 61: 225–231 [DOI] [PubMed] [Google Scholar]
  31. Tsimring L., Levine H., Kessler D., 1996.  RNA virus evolution via a fitness-space model. Phys. Rev. Lett. 76: 4440–4443 [DOI] [PubMed] [Google Scholar]
  32. Walczak A. M., Nicolaisen L. E., Plotkin J. B., Desai M. M., 2011.  The structure of genealogies in the presence of purifying selection: a ”fitness-class coalescent. Genetics 190: 753–779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Waxman D., Loewe L., 2010.  A stochastic model for a single click of muller’s ratchet. J. Theor. Biol. 264: 1120–1132 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES