Selection Limits to Adaptive Walks on Correlated Landscapes

Jorge Pérez Heredia; Barbora Trubenová; Dirk Sudholt; Tiago Paixão

doi:10.1534/genetics.116.189340

. 2016 Nov 22;205(2):803–825. doi: 10.1534/genetics.116.189340

Selection Limits to Adaptive Walks on Correlated Landscapes

Jorge Pérez Heredia ^*,¹, Barbora Trubenová ^†,¹, Dirk Sudholt ^*,², Tiago Paixão ^†,^2,³

PMCID: PMC5289853 PMID: 27881471

Abstract

Adaptation depends critically on the effects of new mutations and their dependency on the genetic background in which they occur. These two factors can be summarized by the fitness landscape. However, it would require testing all mutations in all backgrounds, making the definition and analysis of fitness landscapes mostly inaccessible. Instead of postulating a particular fitness landscape, we address this problem by considering general classes of landscapes and calculating an upper limit for the time it takes for a population to reach a fitness peak, circumventing the need to have full knowledge about the fitness landscape. We analyze populations in the weak-mutation regime and characterize the conditions that enable them to quickly reach the fitness peak as a function of the number of sites under selection. We show that for additive landscapes there is a critical selection strength enabling populations to reach high-fitness genotypes, regardless of the distribution of effects. This threshold scales with the number of sites under selection, effectively setting a limit to adaptation, and results from the inevitable increase in deleterious mutational pressure as the population adapts in a space of discrete genotypes. Furthermore, we show that for the class of all unimodal landscapes this condition is sufficient but not necessary for rapid adaptation, as in some highly epistatic landscapes the critical strength does not depend on the number of sites under selection; effectively removing this barrier to adaptation.

Keywords: speed of adaptation, correlated landscapes, weak selection regime, cost of complexity

THE question of how long it takes for a natural population to evolve complex adaptations has fascinated researchers for decades (Haldane 1957; Kimura 1961; Grant and Flake 1974; Valiant 2013). The evolution of populations can be seen as an adaptive walk across the “mutational landscape” (Gillespie 1984), the space of all possible genotypes. The speed of adaptation critically depends on how the fitness values of all genotypes are organized in this space. In particular, it depends on the number and shape of the paths leading to the optimum on this landscape. This raises both empirical and theoretical difficulties for the study of the speed of adaptation. Empirically, measuring the fitness of every possible genotype is virtually impossible. For this reason, most empirical studies focused on distributions of effects of single mutants (Eyre-Walker and Keightley 2007). However, organisms are not just the sum of their genes: gene interactions (epistasis) are pervasive and the effects of mutations will change depending on the background in which they occur (Phillips 2008). The difficulty of measuring mutational effects across multiple backgrounds grows combinatorially with the length of the genotype, and most studies are restricted to studying the effects of interactions in a local neighborhood of some genotype. In part because of this lack of knowledge about the structure of the fitness landscape, and in part due to the added difficulty of analyzing correlated landscapes, most theoretical studies have focused on landscapes in which either the fitness of genotypes (Gillespie 1983, 1984; Kauffman and Levin 1987; Orr 2002) or the effects of new mutations (Wilke 2004; Desai et al. 2007; Fogle et al. 2008) are drawn from a random distribution. The first case, adaptation on random landscapes, leads to extremely short adaptive walks and may be realistic only when the population is very close to a fitness peak (Orr 2006). In the second case, adaptation in linear landscapes, such as when the effects of mutations are drawn from a random distribution, ignores potential correlations between mutational neighborhoods and any kind of interaction between mutations.

Most studies on the speed of adaptation have focused on the limits imposed by competition between multiple beneficial mutations (Gerrish and Lenski 1998). Because of this, most models assume that populations evolve in a continuous space under a never-ending supply of beneficial mutations (Orr 2000; Wilke 2004; but see Kim and Orr 2005 for a model of a finite genome), when in reality the stage in which evolution proceeds is comprised of discrete genotypes. This fact results in a number of new and important features for the dynamics of adaptation. First, in a discrete space of genotypes, the supply of new beneficial mutations naturally decreases as adaptation occurs as a consequence of the finite size of the genome. Second, and consequently, as the population becomes more adapted, the potential for deleterious mutations increases as more and more sites become adapted. Models analyzing adaptive walks typically assume that the population or selection strength are large enough such that the probability of fixation of deleterious mutations is zero, effectively disregarding the growing difficulty of maintaining the acquired adaptations. Finally, fitness landscapes can display strong correlations between mutational neighborhoods, making the effects of new mutations not necessarily constant across the fitness landscape nor simply drawn from a random distribution. Previous attempts at analyzing the speed of adaptation in correlated neighborhoods (Kryazhimskiy et al. 2009) assumed an infinite supply of beneficial mutations and strong selection, disregarding the growing difficulty of finding new beneficial mutations and maintaining previously acquired ones. As we will show, these effects impose strong constraints to adaptation.

Other studies have focused on the properties of adaptive walks, which explicitly consider the discrete nature of the genotype space (Kauffman and Levin 1987; Orr 2002; Park et al. 2016). Many of these studies have focused on models of fitness landscapes that can display high levels of ruggedness, such as the house-of-cards model (Kingman 1978), in which fitness values are drawn randomly from some distribution; the rough Mount Fuji (Aita et al. 2000), in which fitness effects, combined with a deterministic part of fitness, are drawn randomly; or the NK model, in which the fitness effect of a locus depends, in some randomly prescribed way, on the state of K other loci (Kauffman and Weinberger 1989). Both of these classes of models lead to landscapes exhibiting multiple peaks. For this reason, these studies have focused mainly on the length of the adaptive walk, the number of substitutions that occurs before the process reaches a local peak, and how this depends on the number of local peaks in the landscape. Even though this is an empirically measurable quantity, it does not directly address the question of how long a population takes to reach this peak and how this depends on the shape of the paths leading up to it. Note that the number of substitutions is not equivalent to the time it takes to reach a peak: new mutations, even if beneficial, can be lost, and deleterious mutations can be fixed. Here, we directly address this question by asking how much time a population requires to reach a fitness peak.

To do this, instead of considering the rate of adaptation in specific fitness landscapes, which may not be informative of real trajectories since their details are unknowable; we consider classes of fitness landscapes, including many patterns of gene interactions, and focus on upper bounds for the time to reach a fitness peak. We focus on traits encoded by many genes and study how this time depends on the number of sites under selection. We argue that the scaling of this time with the length of the target sequence quantifies the complexity or “hardness” for a natural population to perform an adaptive walk on a class of landscapes. Similar to previous approaches (Gillespie 1983, 1984; Orr 2002, 2005, 2006) we consider a monomorphic population in the weak-mutation regime. However, to address the difficulties outlined above, we consider that this population evolves in a sequence space and under the combined action of mutation, selection, and drift; allowing for the possibility that deleterious mutations are fixed.

To analyze the dynamic properties of the adaptive trajectory, we take advantage of tools commonly used in the theory of randomized and evolutionary algorithms (Paixão et al. 2015b). Using these tools, we first calculate an upper bound for the time to reach an adaptive peak in a simple landscape with equal, additive contributions of all sites (loci) as a function of the number of such sites contributing to the trait. We focus on the crucial distinction between a polynomial and an exponential scaling of this time with the number of sites under selection, and argue that these two qualitatively distinct regimes correspond to situations in which adaptation is “efficient” or “inefficient,” respectively. We find conditions on selection strength that separate these two regimes, and show that populations in the weak-mutation regime (WM) can adapt efficiently, but the critical selection strength grows with the number of sites under selection, effectively setting a limit to adaptation. We generalize these results to a large family of fitness landscapes that includes very general forms of interactions between the sites under selection, only excluding forms of interactions that create multiple fitness peaks. We derive an upper limit to the time to reach a fitness peak, setting a speed limit to adaptation in these landscapes. Finally, we analyze in detail one instance of this class, an extreme form of epistasis in which mutations need to be accumulated in a particular order. We show that in this case, despite a slower speed of adaptation, the critical selection strength enabling efficient adaptation does not depend on the number of sites under selection; eliminating the limits to adaptation previously identified for simpler landscapes.

Methods

Transition probabilities

To investigate the speed of adaptation we assume the weak-mutation regime. In this regime, a new mutation is either lost or fixed in the population, replacing the previous genotype before any other mutation arises in the population. We assume that the genotype x is composed of n biallelic loci or sites $x_{i},$ and consider a trait $f (x),$ which is a function of the genotypic sequence $x,$ under constant selection gradient β such that fitness is $W (x) = 1 + β f (x) .$ The number of adapted sides in each genotype is denoted x.

In our model, at each iteration exactly one mutation occurs, which can be either beneficial with probability $p_{m}^{+} (x),$ harmful with probability $p_{m}^{-} (x),$ or neutral with the remaining probability $p_{m}^{0} (x) = 1 - p_{m}^{+} (x) - p_{m}^{-} (x) .$ These probabilities depend on the current genotype and the number of adapted sites x, and thus may change during the course of adaptation. Note that one iteration in our model does not correspond to a biological generation, but rather represents one mutation event (which takes on the order of $1 / N U$ generations to occur, where U is the genomic mutation rate).

A mutation is fixed or lost according to Kimura’s probability of fixation (Kimura 1962),

p_{fix} (Δ f) = \frac{1 - e^{- 2 β Δ f}}{1 - e^{- 2 N β Δ f}},

(1)

which depends on both the population size (N) and the fitness difference to the resident genotype (the selection coefficient in the traditional formulation, $β Δ f),$ and allows for the fixation of deleterious mutations. This model is obtained as a limit of many other models, such as the Wright–Fisher model or the Moran model, and was previously introduced in other contexts (Berg et al. 2004; Sella and Hirsh 2005; Tuğrul et al. 2015). This model is valid as long as the time for a mutation to be either fixed or lost is short compared to the time between mutations $(\approx 1 / N U) .$ This will always depend on the population size (N) and on the minimum absolute selection coefficient in the landscape.

Fitness landscapes

We start our analysis with a simple additive fitness landscape, in which all mutations have the same effect on the trait (and consequently on fitness). Fitness is formalized by the function $f_{eq} (x) = \sum_{i = 1}^{n} x_{i},$ which counts the number of correct matches (x) in a genome of length n.

We then generalize to all additive fitness landscapes by relaxing the condition of equal contributions. Fitness is defined as $f_{add} (x) : = \sum_{i = 1}^{n} x_{i} w_{i},$ where each site contributes a weight $w_{i} > 0$ to the trait, such that $\sum_{i = 1}^{n} w_{i} = W .$

Finally, we generalize our analysis even further and include all functions with a single maximum: unimodal fitness function. These functions allow arbitrary forms of epistasis, only excluding some types of reciprocal-sign epistasis which may lead to multiple peaks (Weinreich et al. 2005; Poelwijk et al. 2007). In particular, it excludes reciprocal-sign epistasis that occurs when the sign of the effect of a substitution depends on the background in which it occurs, and may lead to multiple peaks (see Poelwijk et al. 2011 for the necessary conditions and Crona et al. 2013 for the sufficient conditions for multiple peaks). We analyze in detail one instance of this class exhibiting an extreme form of epistasis, defined as $f_{ridge} (x) : = \sum_{i = 1}^{n} \prod_{j = 1}^{i} x_{j} .$ This function requires mutations to be accumulated in a particular order. See Figure 1 for illustration of used fitness functions.

Used fitness functions applied to the same genotype. Values of contributing loci are highlighted in red.

Drift analysis

To estimate the time that a population needs to find the fitness peak and its dependence on the number of genes n, we employ tools from theoretical computer science, in particular the so-called drift analysis (He and Yao 2001; Lehre and Witt 2013). In this context, drift refers to the expected progress of a population toward the fitness peak and is not to be confused with genetic drift, as traditionally used in population genetics.

Drift—the expected progress of a population toward the fitness peak in one time step—is usually denoted by $Δ (x)$ and can be calculated as the sum of the expected forward progress $Δ^{+}$ (forward drift: the product of the probability of occurrence and fixation of beneficial mutations with their effect) and the expected negative progress $Δ^{-}$ (negative drift: the same but for deleterious mutations). In our analysis, we express drift in terms of number of mutations (or states) that the population has to accumulate on its path toward the optimum.

The intuitive idea behind drift analysis is simple: it starts by underestimating (i.e., obtaining a lower bound for) the minimum expected progress toward some target state at every genotype. Then, given an initial distance to the target state, which can be pessimistically estimated as the maximum distance, one calculates an overestimation (i.e., an upper bound) of the expected time to reach this state. This is analogous to integrating a differential equation to obtain the time to reach a particular state. However, these methods are tailored to stochastic processes and can be used even for non-Markovian processes (although here we do not make use of this fact). The main advantage of these methods over more traditional Markov-chain techniques is that these allow for simple expressions for the expected time to reach some state. Traditional Markov-chain techniques can be used to this end, but they typically produce unwieldy expressions which allow for little analytical insight into the parameters that affect the earliest time to reach some state. The techniques we use here make use of controlled simplifications to the expectation of progress of the stochastic process to produce simple, but rigorous, bounds on this time (Appendix C).

Drift theorems use upper or lower bounds on the net expectation of progress, $Δ (x) = Δ^{+} (x) + Δ^{-} (x),$ to obtain bounds on the time to reach particular genotypes (Appendix C). In our analysis we use its two specific instances: variable and negative drift theorems. The variable drift theorem (Johannsen 2010) can be applied when, for any state of the system x, the expected change between two consecutive states ${E [Δ (x)]}$ is at least some positive nonincreasing function of the current state $h (x),$

E [Δ (x)] \geq h (x) > 0.

In such a case, the variable drift theorem (generalized from Johannsen 2010; see Appendix C) states that the expected time until the state with distance less than a from the target sequence is reached, starting at an initial distance of $X_{0},$ is

E [T_{\max}] \leq \frac{a}{h (a)} + \int_{a}^{X_{0}} \frac{1}{h (x)} d x .

(2)

Note that the variable drift theorem is applied to the decreasing distance to the optimum and has to be expressed accordingly in terms of decreasing number of states that have to be crossed (i.e., number of mutations necessary to reach the fitness peak). The upper integral boundary $X_{0}$ is pessimistically given by the longest path of strictly increasing fitness leading to the optimum, i.e., the maximum number of mutations that the population has to accumulate to reach the fitness peak. Using this theorem, we can calculate an upper bound on the time to reach any distance a to the optimum (lower integral boundary). By setting $a = 1,$ we can calculate an upper bound for reaching the optimum.

Conversely, the negative drift theorem (Oliveto and Witt 2011; Rowe and Sudholt 2014) can be applied when the expected change between two consecutive states is negative for all states within a given interval, i.e., the population is expected to move away from the fitness peak in some region of the state space. The negative drift theorem (Oliveto and Witt 2011; Rowe and Sudholt 2014) states the conditions on the size of this interval and on the transition probabilities that lead to an exponential time to reach the optimum. Specifically, if the transition probabilities show an exponential decay in the jump length, the time for crossing this interval is exponential in the length of the interval, with overwhelming probability. The exact statement is given in Appendix C. To express these scalings, we use asymptotic notation as explained in Cormen et al. (2009).

Simulations

All simulations were initialized from the $(0, \dots, 0)$ genotypic sequence, and parameters N and β were kept constant throughout the run, unless stated otherwise. At every iteration of a run, one site was chosen uniformly at random to mutate, changing its value $x_{i}$ to $1 - x_{i} .$ The fitness difference of the resulting genotype to the resident genotype is evaluated and Equation 1 is used to compute the probability that it replaces the resident genotype. We ran this cycle until either the fittest genotype is fixed, some fraction of the maximum fitness is reached, or some threshold number of iterations is reached (6 × 10⁴, Figure 2B; or 10⁸, Figure 4).

(A) Time required to reach the fitness peak in function $f_{eq}$ as a function of genome size. Solid black line represents the mean of 100 runs for given n and shaded area their SDs. Dashed line represents the theoretical upper bound on this expectation: $(1 + \frac{1}{2 β}) n \ln (n) + n .$ $N β$ was set to 100. (B) A sharp threshold on the strength of selection for the speed of adaptation. Black line represents the mean time to reach the fitness peak for a constant genome size $(n = 500)$ and selection strength $(β = 0.1),$ with increasing population size N, and shaded areas represent the SD. Dashed line represents the critical value of selection strength $[2 (N - 1) β = \ln n]$ separating the polynomial and exponential regimes for the time to reach the fitness peak. Simulations were stopped if they took longer than $6 \times 10^{4}$ iterations.

Time to reach the fitness peak of $f_{ridge},$ a member of the unimodal class of functions. (A) A visualization of the landscape induced by this function for $n = 8.$ z-coordinate represents trait values (bottom cluster z = 0, top genotype z = n). Links between genotypes (•) represent mutations with the only path of strictly increasing fitness from $0^{n}$ to the peak highlighted in black. (B) Symbols represent averages (of 100 runs) of the time to reach the peak (•) or to reach 50% of the maximum fitness (▪). Shaded areas represent their SDs. Dashed line represents the bound $O (n^{2}) .$ Parameters were set to $N = 100$ and $β = 0.1.$

Data availability

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article. Code to perform simulations is available upon request.

Results

In this manuscript, we investigate how the time to adaptation varies with the number of sites under selection for several classes of fitness landscapes corresponding to different choices of the trait function $f (x) .$ It should be noted that the time we refer to here is measured in number of mutations that are “tried” before the target genotype is reached, and so it is measured in units of mutation rate $1 / N U,$ where U is the genomic mutation rate.

We start by showing that on a simple landscape, in which all mutations have the same effect on the trait (and subsequently on fitness), there is a critical selection strength that allows populations to efficiently reach or approach the fitness peak. This threshold grows with the number of sites under selection, effectively setting a limit to the number of sites that can be adapted under constant selection. We then generalize our results to general additive landscapes, independent of the distribution of mutational effects. Next, we show that for the class of all landscapes with a single peak, which includes very general forms of gene interactions, this critical threshold is sufficient, but not necessary, to obtain an upper bound on the time to reach the fitness peak. We demonstrate that there are landscapes for which a constant selection strength allows efficient adaptation of arbitrary numbers of sites.

Adaptation time in simple additive landscapes

One of the simplest scenarios for adaptation is when all sites—genes or loci—contribute equally to fitness. This leads to a fitness landscape where the fitness of a genotype depends only on the number of correct matches to a target sequence. We formalize this scenario by the function $f_{eq} (x) = \sum_{i = 1}^{n} x_{i},$ which counts the number of correct matches (x) in a genome of length n (Figure D1). This function induces a structure in sequence space in which the fraction of beneficial mutations decreases linearly as a function of the distance to the optimum. We use this function to determine under which conditions populations can efficiently climb simple fitness peaks.

For each new mutation, the probability that it is beneficial depends only on the number of beneficial mutations already fixed (x), and therefore the expectation of increase in fitness (forward drift) is $Δ^{+} (x) = (n - x) / n \cdot p_{fix} (1) .$ The probability of occurrence of a deleterious mutation grows with the number of beneficial mutations already fixed, $x / n,$ and thus the negative drift is $Δ^{-} (x) = [(- x / n) \cdot p_{fix} (- 1)] .$ Therefore, the net expectation of progress $Δ (x)$ is:

\begin{matrix} Δ (x) = p_{fix} (1) \cdot [\frac{n - x}{n} - \frac{x}{n} \cdot \frac{p_{fix} (- 1)}{p_{fix} (1)}] \\ = p_{fix} (1) \cdot [\frac{n - x}{n} - \frac{x}{n} \cdot e^{- 2 (N - 1) β}] \end{matrix}

(3)

(see Appendix B, Lemma 2). This expectation is always positive as long as $2 (N - 1) β \geq \ln (c n),$ for some constant $c > 1.$ This condition states that, for the expectation of progress to be always positive, the selection differential ${[p_{fix} (1) / p_{fix} (- 1)] = e^{2 (N - 1) β}}$ needs to be large enough to counteract the deleterious mutation pressure in the worst possible case [which occurs at a genotype which is one mutation away from the optimum, when $x = n - 1$ and so $p_{m}^{+} = 1 / n$ and $p_{m}^{-} = 1 - (1 / n) < 1] .$ If this condition is met we can write:

Δ (x) \geq p_{fix} (1) \cdot (\frac{n - x}{n} - \frac{1}{c n}) = h (x) \geq 0.

(4)

We can now apply Equation 2 (see Methods; Johannsen 2010) to the decreasing number of zeros $z = n - x$ (number of remaining mutations that need to be accumulated), to obtain an upper bound on the expected time $T_{\max}$ to reach the fitness peak:

\begin{matrix} T_{\max} \leq \frac{1}{p_{fix} (1)} \cdot (\frac{c n}{c - 1} + \int_{1}^{n} \frac{c n}{z c - 1} d z) \\ \leq \frac{1}{p_{fix} (1)} \cdot [\frac{c n}{c - 1} + n \cdot \ln (\frac{c n - 1}{c - 1})] \\ \leq \frac{1}{p_{fix} (1)} \cdot [n \ln (n) + O (n)] \end{matrix}

where the maximum number of mutations that are required to reach the fitness peak is n.

This expression quantifies the impact of the length of the target sequence on the time (in units of mutation rate) to attain it. It shows that the time required to evolve adaptations involving larger numbers of sites will simply require a polynomial number of extra mutational “trials” (Figure 2A).

A critical threshold for efficient adaptation

Our analysis above shows that for a population in the WM to be able to reach the fitness peak efficiently, it is sufficient that selection strength grows logarithmically with the number of sites under selection $[2 (N - 1) β \geq \ln (c n)] .$ We next show that if selection strength is below this threshold, these populations cannot efficiently find the optimum, as the time required to reach the optimum on $f_{eq}$ becomes exponential in n with overwhelming probability. Populations in the WM therefore exhibit a phase transition behavior: changing $2 N β$ by a constant factor leads to a difference between polynomial and exponential expected time to reach the optimum on $f_{eq} .$

To show this, we consider a genotype some distance away from the optimum, $x = n - n^{ϵ / 2},$ for some small positive ε. At this point, the fraction of mutations that are beneficial becomes $(n - x) / n \leq n^{ϵ / 2 - 1} .$ Correspondingly, the fraction of deleterious mutations is $(x / n) \geq 1 - n^{ϵ / 2 - 1} .$ Now, if selection strength is between $1 \leq N β \leq (1 - ϵ) / 2 \ln n,$ we can bound $e^{\pm 2 N β}$ to obtain the probabilities of fixation of beneficial or deleterious mutations $p_{fix} (1) \leq 2 β / (1 - e^{- 2})$ and $p_{fix} (- 1) \geq 2 β n^{ϵ - 1},$ respectively. Substituting in the net expectation of progress (Equation 3) we obtain:

Δ (x) \leq \frac{2 β}{1 - e^{- 2}} \cdot n^{ϵ / 2 - 1} - \frac{2 β \cdot n^{ϵ}}{n} \cdot (1 - n^{ϵ / 2 - 1}) \leq - c \cdot β \cdot n^{ϵ - 1},

where c is a positive constant. This means that, if selection strength $N β$ is between $1 \leq N β \leq (1 - ϵ) / 2 \ln (n)$ then, as the population approaches the optimum, there will be a region $(x \geq n - n^{ϵ / 2})$ where the expectation of progress is negative. This happens because selection is not strong enough to counteract the deleterious mutation pressure that has built up. We can then apply the negative drift theorem to the number of zeros on an interval of $[0, n^{ϵ / 2}]$ and show that the expected time to reach the peak is exponential in the number of loci (see Appendix D for details). This shows that if selection strength $N β$ is below $(1 - ϵ) / 2 \ln (n),$ more complex adaptations, involving a larger number of sites, will take exponentially longer to evolve (Figure 2B).

This result sets a limit to the complexity that can be evolved: for a fixed selection strength, there is a maximum number of sites that can be efficiently adapted. Typically, selection is deemed efficient when $N s > 1$ (corresponding to Nβ in our framework). Our result defines the conditions for which selection is efficient in a multilocus setting, taking mutational pressure into account. It shows that even if $N β > 1$ at every locus, for selection to be able to drive a population to the fitness peak, $N β$ needs to scale nonlinearly with the length of the target sequence $[2 N β > \ln (c n)] .$

Efficient approach to the optimum

The results above show that the time required to reach the optimum scales nonlinearly with the number of sites under selection. However, it can be argued that populations do not have to reach the optimum, they only need to get sufficiently close.

Using Equation 4 together with Equation 2, we show that the population can reach a genotype in which at least $n - a$ sites are well adapted, where $a > 1$ is the number of mutations to the optimum. The population reaches such a genotype in:

\begin{matrix} T_{\max} \leq \frac{a}{h (a)} + \int_{a}^{n} \frac{1}{h (x)} d x \\ \leq \frac{1}{p_{fix} (1)} \cdot (\frac{a c n}{a c - 1} + \int_{a}^{n} \frac{n c}{z c - 1} d z) \\ \leq \frac{1}{p_{fix} (1)} \cdot [n \ln (n / a) + O (n)] . \end{matrix}

(5)

This means that the time to reach a genotype with a constant fraction of well-adapted sites (for example, at which 99% of sites are adapted, a = 0.01n) is linear on the length of the target sequence. This is a significant improvement over the time to actually reach the fitness peak, showing that this time is dominated by the last few steps. It should be noted that the time to reach any constant distance from the optimum (say n − a, with a constant) is of the form $n \ln n .$

General additive fitness landscapes

We now generalize the previous results to linear landscapes regardless of their distribution of mutational effects. When all mutations contribute equally to the trait, it is sufficient that selection strength is such that $2 (N - 1) β \geq \ln (c n)$ for the population to be able to reach the fitness peak in polynomial time. More generally, if each site contributes a weight $w_{i} > 0$ to the trait, such that $f_{add} (x) : = \sum_{i = 1}^{n} x_{i} w_{i}$ and $\sum_{i = 1}^{n} w_{i} = W$ for a certain selection strength, there will be a critical weight $w *$ such that all $n - n *$ sites of weight $w_{i} > w *$ will be able to be reached in polynomial time, reaching a fitness of at least $W * = W - n * w * .$

Analogously to the equal-effects case (Equation 3), we can write the net expectation of progress on these $n - n *$ “large effect” sites:

Δ (x) \geq p_{fix} (w *) [\frac{n - n * - x}{n} - e^{- 2 (N - 1) β w *}] .

This expression is positive on $x \in [0, n - n *]$ as long as $2 (N - 1) β w * > \ln (c n)$ for some constant $c > 1,$ which determines the critical threshold: $w * > \ln (c n) / [2 (N - 1) β] .$ This leads to the lower bound on the expectation of progress:

Δ (x) \geq p_{fix} (w *) \cdot \frac{c (n - n * - x) - 1}{c n} .

As before, we can use Equation 2 to obtain an upper bound for the expected time to reach fitness at least $W * \leq W - n * w *$ (see Appendix E, Equation E2):

\begin{matrix} T_{W *} \leq \frac{1}{p_{fix} (w *)} {\frac{c n}{c (n - n *) - 1} + n \ln [\frac{c (n - n *) - 1}{c - 1}]} \\ = \frac{n \ln (n) + O (n)}{p_{fix} (w *)} . \end{matrix}

Since the $n - n *$ sites of large effect behave essentially like the equal-effects case, for a constant selection strength there is a maximum fitness that can be reached in $O (n \ln n) .$ Reaching a fraction of this fitness takes linear time (Equation 5); while adapting further requires exponential time, which we confirmed with simulations (Figure 3). Without knowledge of the actual distribution of effects, it is impossible to determine $n *$ and hence the fitness level that is guaranteed to be reached in polynomial time. However, since all effects are drawn from the same distribution, $n *$ will always be a constant fraction of n [since n* is simply the fraction of weights below w*, $n * = CDF (w *) \cdot n] .$ These scalings are valid for any distribution of effects and represent hard limits on this class of fitness functions.

Time to reach different fractions of the total fitness for an exponential distribution of effects. For a fixed selection strength, there is a maximum fraction of the fitness that can be reached in $O (n \ln n)$ mutational trials. The time to reach lower fractions of this fitness scales linearly, while the time to adapt further scales exponentially. Data points correspond to means of 1000 runs, and lines correspond to the indicated scalings. N was set to 20, $β = 0.1,$ and the effects were distributed as $w_{i} \sim Exp (1) .$

Adaptation in a general class of landscapes

We now turn to a general class of fitness landscapes: unimodal functions. This class includes all functions that have only one maximum; meaning that it includes functions displaying arbitrary forms of epistasis, only excluding some types of sign epistasis which may lead to multiple peaks (Weinreich et al. 2005; Poelwijk et al. 2007), as mentioned before.

The defining feature of the members of the unimodal class is that any genotype other than the peak has at least one mutational neighbor (a genotype that differs exactly by one mutation) of higher fitness value. We denote the minimum of these trait increases (or decreases) in the landscape by δ. Because each genotype necessarily has at least one neighbor that increases the trait value by at least δ we can bound the expectation of improvement by $Δ^{+} \geq [(δ / n) p_{fix} (δ)] .$ In this class of functions, there are potentially $n - 1$ deleterious mutations, each contributing $[(Δ f_{i} / n) \cdot p_{fix} (- Δ f_{i})]$ to the total backward expectation. If the population size is $N \geq 3$ we can bound $p_{fix} (- Δ f_{i}) \leq e^{- 2 β (Δ f_{i} - δ)} \cdot p_{fix} (- δ)$ (see Appendix B, Lemma 3; and Appendix F), which implies that $p_{fix}$ decreases exponentially for deleterious mutations, and the worst case of these mutations is actually when $Δ f_{i} = \max {1, δ},$ yielding a backward expectation of $Δ^{-} \geq - (1 + δ) p_{fix} (- δ)$ and a total expectation of improvement:

\begin{matrix} Δ (x) \geq δ \cdot p_{fix} (δ) [\frac{1}{n} - \frac{1 + δ}{δ} \cdot \frac{p_{fix} (- δ)}{p_{fix} (δ)}] \\ \geq δ \cdot p_{fix} (δ) [\frac{1}{n} - (1 + 1 / δ) \cdot e^{- 2 (N - 1) β δ}] . \end{matrix}

(6)

This net expectation of progress is positive as long as $2 (N - 1) β δ \geq \ln [(1 + 1 / δ) c n],$ for some constant $c > 1.$ Therefore, for some constant $γ > 0$ this expectation then becomes simply:

Δ (x) \geq γ \cdot \frac{δ \cdot p_{fix} (δ)}{n} .

(7)

We can then use Equation 2 with the maximum and minimum fitness differences (d and δ, respectively) as integral limits to calculate an upper bound for all functions in this class. Note that in this case we are applying the drift analysis with respect to the fitness rather than to the number of one-bits in the trait:

\begin{matrix} T_{unimodal} \leq O [\frac{n δ}{δ \cdot p_{fix} (δ)}] + O [\int_{δ}^{d} \frac{n}{δ \cdot p_{fix} (δ)} d x] \\ = O [\frac{n}{p_{fix} (δ)} \cdot \frac{d}{δ}] . \end{matrix}

(8)

This bound depends on the length d, and as such is not independent of the instance of the function class we are considering. It should be noted that the upper bound of Equation 8 can be loose, as can be seen by comparing to the previous results for linear function (which are part of the unimodal function class): the fitness range d is of size n, entailing a bound for the time to adaptation of $O (n^{2})$ when, in reality, the time on the linear function class grows slower $O (n \ln n) .$ Moreover, this bound does not guarantee that the time to reach the peak is polynomial: there could exist members of the unimodal function class for which $d / δ$ is exponential; e.g., when d is constant but the Hamming path leading to the optimum is exponential, then δ will be exponentially small (Rudolph 1997b; Droste et al. 2002), making the bound exponential. Next, we focus on one particular member of this function class for which this bound is tight.

One extreme form of epistatic landscape is when mutations need to be accumulated in a particular order, having no effect outside of this order (Kondrashov and Kondrashov 2001). This creates a landscape in sequence space characterized by a fitness ridge and vast neutral plateaus leading to the optimum (Figure 4A). We formalize this landscape by the function $f_{ridge} (x) = \sum_{i = 1}^{n} \prod_{j = 1}^{i} x_{j},$ which counts the number of leading ones in a bit string (Rudolph 1997a). To increase its current fitness, it is necessary to flip the first zero in the genome to one. Flipping any other zero to one will result in a mutant offspring with the same fitness as its parent, while flipping any of the leading ones into zero can result in a drastic fitness loss. In this landscape, the fitness range d has size n (see Figure 4A), which, according to the bound from Equation 6, leads to a time of $O (n^{2}) .$ We now show that this bound is tight.

In this landscape, the probability of a beneficial mutation is $1 / n,$ as only flipping the first zero in the genome will result in a fitness increase. However, as more ones can follow this locus (neutral mutations that may have fixed neutrally), the increase in trait value can be higher than one. This means that we can bound the expectation of forward progress by $Δ^{+} (x) \geq [(1 / n) \cdot p_{fix} (1)] .$ Mutating the j-th position of the x already well-adapted sites will result in a fitness decrease of size $k = x - j + 1$ yielding: $Δ^{-} \geq - \frac{1}{n} \sum_{k = 1}^{n - 1} k \cdot p_{fix} (- k) .$ However, as long as $N \geq 3$ the fixation probability decreases exponentially for deleterious mutations and can overcome the linear impact k of mutation. Specifically, we can bound each $p_{fix} (- k) \leq e^{- 2 β (k - 1)} p_{fix} (- 1)$ (see Appendix B, Lemma 3; and Appendix F) and, using $β \geq 1 / 2$ and the fact that $\sum_{k = 1}^{∞} k \cdot e^{- (k - 1)} = [e^{2} / {(e - 1)}^{2}] \leq 3,$ we can write for the net expectation of progress:

\begin{matrix} Δ \geq \frac{1}{n} p_{fix} (1) [1 - 3 \cdot \frac{p_{fix} (- 1)}{p_{fix} (1)}] \\ \geq \frac{1}{n} p_{fix} (1) [1 - 3 \cdot e^{- 2 (N - 1) β}] . \end{matrix}

Since $N \geq 3$ and $β \geq 1 / 2$ then $2 (N - 1) β \geq 2,$ the expectation of progress is always positive and reduces to

Δ \geq \frac{6}{8} \cdot \frac{p_{fix} (1)}{n} .

We can use Equation 2 to obtain an upper bound on the expected time to reach the fitness peak:

T_{\max} \leq \frac{8 n}{6 p_{fix} (1)} + \frac{8 n}{6 p_{fix} (1)} \int_{1}^{n} 1 d x = O [\frac{n^{2}}{p_{fix} (1)}] .

This shows that even if the path to the optimum is narrow and mutations have to occur in a specific order, populations in the WM are able to climb the fitness peak relatively fast in polynomial time (Figure 4B). Remarkably, this result holds for any selection strength above a constant value; indicating that, for landscapes of this type, there are no limits to the number of loci that can be adapted in polynomial time, as long as selection strength is above this constant value. The main reason for this is that even though the number of deleterious mutations still increases as the population approaches the optimum, most of them are much less likely to be fixed due to their strong deleterious effects. This leads to a much less pronounced slowdown of the speed of adaptation as the population approaches the optimum. Notice that in this family of landscapes, the time to reach a fraction of the maximum fitness is also $O (n^{2}) .$

Discussion

There are at least two ways in which a trait can be considered “complex”: in the number of sites contributing to it, by analogy with complex traits as defined in quantitative genetics; and in the way that it is coded for by the sites that contribute to it, i.e., the complexity of the landscape in which it exists. In this manuscript we address the limits imposed by both of these factors.

We have shown that for a large class of fitness landscapes, it is sufficient that selection strength $N β$ is above a threshold $(\ln n)$ for populations to be able to climb to the fitness peak efficiently. We proved that in the class of additive landscapes, this condition is both sufficient and necessary; implying a limit to the number of sites that can be efficiently adapted at a constant selection strength. Nevertheless, this critical threshold does not seem severe: selection strength should increase logarithmically with the number of sites under selection, indicating that a small increase in selection gradient or population size translates to an exponential increase in the length of the sequences that can be evolved efficiently. Moreover, this condition is not always necessary: when considering a class of epistatic landscapes characterized by a single mutational path of strictly increasing fitness, we found that this limit no longer applies. A constant selection strength will enable a population to climb to the optimum, albeit at a slower rate than in an additive landscape, regardless of the number of sites contributing to the trait. These results quantify the complexity of adaptive walks beyond linear landscapes or uncorrelated mutational neighborhoods. They illustrate how the structure of the fitness landscape can impose limits to adaptation and how these stem directly from how the landscape conditions the distribution of effects of single mutants, in particular of deleterious mutations. Furthermore, they reveal how the buildup of mutational pressure that necessarily counteracts selection imposes a limit on the selection strength required for populations to overcome the entropic effects of mutation and make progress toward fitter genotypes.

Sewall Wright (1932) introduced the concept of fitness landscape mostly as a metaphor for the adaptation of populations, since at the time there was no hope of measuring the fitness associated with each individual genotype. Even then, this metaphor was incredibly successful at shaping evolutionary thought (Provine 2001). It is not surprising then that more recently, with the increased availability of genetic manipulation techniques, this metaphor has been taken seriously and is now the subject of experimental study (see de Visser and Krug 2014 for a recent review). The fitness landscapes of several experimental systems have now been at least partially mapped. Most of these landscape reconstructions have been performed for a small number of genes. For example, Khan et al. (2011) have reconstructed the fitness landscape defined by five beneficial mutations that fixed in a long-term evolution experiment. However, new techniques are allowing for the reconstruction of much larger fitness landscapes. For example, Kinney et al. (2010) constructed and determined the phenotype of 100s of 1000s of mutants of the lac-operon, enabling them to partially reconstruct its expression landscape. Our results inform about the consequences these fitness landscapes can have for the adaptation of populations. They speak not just about the time to reach a fitness peak, but also informs about how quickly mutations are accumulated on the way (by using Equation 2 to calculate the time required to get to a fixed distance to the optimum). If the structure of the landscape is such that many paths lead to the optimum, then the time to fix the next beneficial mutation should increase with $n \log n / (n - x)$ (Equation 5), where x is the current number of fixed beneficial mutations. On the other hand, if relatively few paths leading to an optimum exist, our results suggest that the time until the next beneficial mutation is fixed is best described by a power law (Appendix F, Figure F2). These results suggest that long-term evolution experiments could be used to identify the class of landscapes on which the population is evolving. In fact, it is interesting to note that the fitness dynamics of a long-term evolution experiment is actually best described by a power law, rather than by a hyperbolic curve (Lenski et al. 2015). Even though other explanations are possible, such as a combination of diminishing returns epistasis with clonal interference (Wiser et al. 2013); our results show that this could also be explained by a fitness landscape in which fitness effects are highly conditional on the background, such that most deleterious mutations are of large effect and very few paths uphill exist. The existence of extensive diminishing returns epistasis (Khan et al. 2011) in this landscape would not be enough to explain this pattern of fitness increase. Whether clonal interference or the existence of many highly deleterious mutations is responsible for this specific pattern of fitness increase could be tested experimentally by closer inspection of the population dynamics and mutational assays of the evolved populations.

The results we show here are related to Fisher’s “cost of complexity” (Fisher 1930). Fisher defined the cost of complexity as the slowdown of adaptation due to the diminishing probability of generating beneficial mutations as the number of traits under selection increases. This has been attributed to the pleiotropic nature of mutations in the geometric model (Wagner et al. 2008), since mutations simultaneously affect all traits under selection. Our approach is similar in the sense that we study the dependency of the speed of adaptation on the number of sites under selection. One could think of each site of a genetic sequence as a trait under selection, albeit taking only discrete values, and mutations acting on one trait only (since we consider single mutations only). Our results show that pleiotropy is not the only source for this cost of complexity since, even when mutations act on single “traits,” there is a penalty for having longer sequences. Instead, our results highlight that mutational pressure and the structure of the fitness landscape play an important role on this cost of complexity. This is a direct consequence of the fact that we deal with discrete sequence spaces and not with a continuous trait space, as in traditional formulations of Fisher’s geometric model.

The distinction between polynomial and exponential time is crucial to the question of the evolution of complexity: if only a few mutations need to fix to reach a fitness peak, this distinction is less relevant since the times would be short. This distinction becomes relevant when dealing with complex adaptations involving many sites or genes. Chatterjee et al. (2014) investigated the adaptation time in a landscape in which genotypes are assigned one of two possible fitness values (high and low) and no smooth fitness gradients exist. They show that even when the fraction of high-fitness genotypes is large, populations will take at least exponential time to reach one of them. Their results relate directly to the infeasibility of evolving complex innovations, adaptations that cannot be reached by gradual steps. However, for many such apparent innovations, paths of gradually increasing fitness actually do exist, such as in the case of de novo gene evolution, where duplications and insertions or deletions are believed to pave the way for new genes (Tautz and Domazet-Lošo 2011). Instead, we focused on adaptations for which at least one (potentially tortuous) path exists and show that for these natural selection can efficiently evolve them. For this reason, we have only dealt with constant unimodal landscapes, that is, landscapes in which a single peak exists and that remain constant over evolutionary time (in fact we have provided upper bounds for the adaptation time for all such landscapes). The results we present here, however, allow for insight on how populations climb any peak, regardless of which peak is approached.

Many measures have been proposed to characterize the structure of fitness landscapes (Szendro et al. 2013), ranging from the roughness to slope ratio $(r / s)$ (Aita et al. 2001) or correlations of fitness effects (Ferretti et al. 2016), to Fourier decompositions of the landscape (Stadler 1996). These are often seen as measures of hardness of a landscape or problem, the intuition being that it will be more difficult for a hill climber to reach a fitness peak in more epistatic landscapes. Under this reasoning, the “easiest” landscape would be one where each locus contributes equally and independently to fitness, such as $f_{eq}$ in the present manuscript. Indeed, this is the case for hill climbers in which mutations are deterministically accepted or rejected based on whether they increase or decrease fitness (Droste 2002), regardless of the magnitude of their effect (random adaptive walks), or in which the best available mutation is always accepted (greedy hill climbers) (Macken and Perelsont 1989; Park et al. 2016). However, such adaptive walks may not be adequate models of natural adaptation, except perhaps when the fitness effects of all mutations are extremely large. In fact, we have previously shown that the time to reach a fitness peak can differ significantly between a deterministic hill climber and the model for the evolution of natural populations that we use here (Paixão et al. 2015b). This shows that the dynamics of adaptation depend not only on the structure of the fitness landscape but also on the mode of evolution, and suggests that perhaps a more meaningful classification of landscapes should include information about the dynamics of the populations evolving on it, in addition to the landscape’s geometric properties. The results we present here open the door to such classification, at least for the weak-mutation regime.

Crucially, we assume the WM in which mutations are fixed or lost sequentially. This assumes that population sizes or mutation rates are low enough that no new mutations appear before the previous one has either been fixed or lost. When populations are large enough so that several segregating mutations coexist, the time it takes for a single beneficial mutation to fix increases since it necessarily competes with other beneficial mutations (Gerrish and Lenski 1998). However, the rate of adaptation will continue to increase with $N U,$ at least until the infinite population regime is reached, since the time between mutations will decrease faster than the time to fixation of beneficial mutations (Park et al. 2010). Thus, the upper bounds for the time to reach a fitness peak should hold, albeit being less tight than for the weak-mutation regime, even for larger populations.

There has been a renewed interest in computational approaches to the theory of evolution (Valiant 2013; Chastain et al. 2014). In this manuscript, we have introduced methods developed and commonly used in evolutionary computation for the analysis of randomized algorithms to the evolutionary biology community and show that these can be successfully applied to problems in this field. These methods facilitate the study of adaptive walks on complex fitness landscapes. Such a collaboration between both fields, enabled by the recent development of a unifying framework for evolutionary processes (Paixão et al. 2015a), has the potential to shed light on more complex evolutionary processes. For example, similar mathematical tools exist that allow for the analysis of polymorphic populations which could allow for the exploration of the adaptive process beyond the WM in arbitrary fitness landscapes (Corus et al. 2014). These results have the potential to illuminate a number of other fundamental limits to adaptation by natural populations.

Acknowledgments

The authors thank two anonymous reviewers for their insightful comments on a previous version of this manuscript. This project received funding from the European Union’s Seventh Framework Programme for research, technological development, and demonstration under grant agreement 618091 Speed of Adaptation in Population Genetics and Evolutionary Computation.

Appendix A: Weak-Mutation Regime as an Algorithm

To analyze the rate of adaptation in WM using drift analysis, we apply techniques from the analysis of stochastic processes and randomized algorithms. To this end, we cast WM as a randomized algorithm as follows:

Algorithm 1. WM. Choose $x \in {0, 1}^{n}$ uniformly at random.

repeat

y←mutate(x)

$Δ f = f (y) - f (x)$ Choose $r \in [0, 1]$ uniformly at random

if $r < p_{fix} (Δ f)$ then

x←y

end if

until stop

Appendix B: Probability of Fixation Inequalities

Here we derive the upper and lower bounds for $p_{fix} (Δ f)$ that are used throughout the manuscript. The bounds for $Δ f > 0$ show that $p_{fix}$ is roughly proportional to the fitness difference between solutions $β Δ f .$

Lemma 1: Probability of fixation. For every $β \in ℝ^{+}$ and $N \in ℕ^{+}$ the following inequalities hold. If $Δ f > 0$ then

\frac{2 β Δ f}{1 + 2 β Δ f} \leq p_{fix} (Δ f) \leq \frac{2 β Δ f}{1 - e^{- 2 N β Δ f}} .

(B1)

If $Δ f < 0$ then

\frac{- 2 β Δ f}{e^{- 2 N β Δ f}} \leq p_{fix} (Δ f) \leq \frac{e^{- 2 β Δ f}}{e^{- 2 N β Δ f} - 1} .

(B2)

Proof. In the following, we frequently use $1 + x \leq e^{x}$ and $1 - e^{- x} \leq 1$ for all $x \in ℝ$ as well as $e^{x} \leq 1 / (1 - x)$ for $x < 1.$

If $Δ f > 0,$

p_{fix} (Δ f) = \frac{1 - e^{- 2 β Δ f}}{1 - e^{- 2 N β Δ f}} \geq 1 - e^{- 2 β Δ f} \geq 1 - \frac{1}{1 + 2 β Δ f} = \frac{2 β Δ f}{1 + 2 β Δ f}

as well as

p_{fix} (Δ f) = \frac{1 - e^{- 2 β Δ f}}{1 - e^{- 2 N β Δ f}} \leq \frac{2 β Δ f}{1 - e^{- 2 N β Δ f}} .

If $Δ f < 0,$ using the fact that $e^{- x} - 1 \leq e^{- x} :$

p_{fix} (Δ f) = \frac{e^{- 2 β Δ f} - 1}{e^{- 2 N β Δ f} - 1} \leq \frac{e^{- 2 β Δ f}}{e^{- 2 N β Δ f} - 1} .

Similarly:

p_{fix} (Δ f) = \frac{e^{- 2 β Δ f} - 1}{e^{- 2 N β Δ f} - 1} \geq \frac{e^{- 2 β Δ f} - 1}{e^{- 2 N β Δ f}} \geq \frac{- 2 β Δ f}{e^{- 2 N β Δ f}} .

The next lemma shows that the probability of accepting an improvement of $Δ f$ is exponentially bigger $(in N β Δ f)$ than accepting its symmetric fitness variation $- Δ f .$

Lemma 2: Probability of fixation ratio. For every $β \in ℝ^{+},$ $Δ f \in ℝ,$ and $N \in ℕ^{+}$

\frac{p_{fix} (- Δ f)}{p_{fix} (+ Δ f)} = e^{- 2 (N - 1) β Δ f} .

(B3)

Proof.

\begin{matrix} \frac{p_{fix} (- Δ f)}{p_{fix} (+ Δ f)} = \frac{e^{2 β Δ f} - 1}{e^{2 N β Δ f} - 1} \cdot \frac{1 - e^{- 2 N β Δ f}}{1 - e^{- 2 β Δ f}} \\ = \frac{e^{2 β Δ f}}{e^{2 N β Δ f}} = e^{- 2 (N - 1) β Δ f}, \end{matrix}

where we have applied the relation $(e^{x} - 1) / (1 - e^{- x}) = e^{x} .$

Lemma 3: Exponential decrease of the probability of fixation. Let $N \in ℕ$ \{0,1, 2}, $δ \in ℝ^{+},$ $β \in ℝ^{+},$ and $Δ f > 0,$ then

p_{fix} (- Δ f) \geq e^{2 βδ} \cdot p_{fix} (- Δ f - δ)

Proof. Using the definition of $p_{fix}$ we can rewrite the statement as:

\frac{e^{2 β Δ f} - 1}{e^{2 β N Δ f} - 1} \geq e^{2 β δ} \cdot \frac{e^{2 β (Δ f + δ)} - 1}{e^{2 β N (Δ f + δ)} - 1},

defining $x : = e^{2 β Δ f}$ and $y : = e^{2 β δ}$ we can simplify the expression to

\Leftrightarrow \frac{x - 1}{x^{N} - 1} \geq y \cdot \frac{x y - 1}{{(x y)}^{N} - 1}

using the result for the sum of a geometric series $\sum_{k = 0}^{N - 1} x^{k} = (x^{N} - 1) / (x - 1)$ yielding:

\Leftrightarrow \frac{1}{\sum_{k = 0}^{N - 1} x^{k}} \geq y \cdot \frac{1}{\sum_{k = 0}^{N - 1} {(x y)}^{k}} \Leftrightarrow \sum_{k = 0}^{N - 1} {(x y)}^{k} \geq \sum_{k = 0}^{N - 1} x^{k} y .

If we extract the first and last term of both sums we obtain

\Leftrightarrow 1 + {(x y)}^{N - 1} + \sum_{k = 1}^{N - 2} {(x y)}^{k} \geq y + x^{N - 1} y + \sum_{k = 1}^{N - 2} x^{k} y .

(B4)

Let us focus now on the left-hand side of the previous equation. Since x,y > 1, we have that (xy)^k > x^ky for k ≥ 1 and therefore

\begin{array}{r} 1 + {(x y)}^{N - 1} + \sum_{k = 1}^{N - 2} {(x y)}^{k} \geq 1 + x^{N - 1} y^{N - 1} + \sum_{k = 1}^{N - 2} x^{k} y & (y + x^{N - 1} y - y - x^{N - 1} y = 0) \\ = y + x^{N - 1} y + 1 + x^{N - 1} y^{N - 1} - y - x^{N - 1} y + \sum_{k = 1}^{N - 2} x^{k} y \\ = y + x^{N - 1} y + (1 - y) + (y^{N - 2} - 1) x^{N - 1} y + \sum_{k = 1}^{N - 2} x^{k} y & (N \geq 3 \Rightarrow y^{N - 2} \geq y) \\ \geq y + x^{N - 1} y + (1 - y) + (y - 1) x^{N - 1} y + \sum_{k = 1}^{N - 2} x^{k} y \\ = y + x^{N - 1} y + (y - 1) \cdot (x^{N - 1} y - 1) + \sum_{k = 1}^{N - 2} x^{k} y & (y > 1 and x^{N - 1} > 1) \\ \geq y + x^{N - 1} y + \sum_{k = 1}^{N - 2} x^{k} y . \end{array}

Then, the claim (B4) is proven and so is the lemma’s statement.

Appendix C: Drift Theorems

The additive drift theorem was first introduced by He and Yao (2001). For simplicity we show the formulation from Lehre and Witt (2013).

Theorem 4: Additive drift, theorem 1 in Lehre and Witt 2013. Let ${(X_{t})}_{t \geq 0}$ be a stochastic process over some bounded state space $S \subseteq ℝ_{0}^{+},$ and $T_{0} : = \min {t | X_{t} \leq 0}$ the first hitting time of state 0. Assume that $E (T_{0} | X_{0}) < ∞ .$ Then:

If $E (X_{t} - X_{t + 1} | X_{0}, …, X_{t}; X_{t} > 0) \geq δ_{u}$ then $E (T_{0} | X_{0}) \leq X_{0} / δ_{u} .$
If $E (X_{t} - X_{t + 1} | X_{0}, …, X_{t}) \leq δ_{l}$ then $E (T_{0} | X_{0}) \geq X_{0} / δ_{l} .$

Both results are conditioned to a starting point $X_{0},$ but by applying the law of total expectation we can avoid the starting condition obtaining $E (T_{0}) \leq E (X_{0}) / δ_{u}$ and $E (T_{0}) \geq E (X_{0}) / δ_{l}$ for the first and second result, respectively. The proof (in Lehre and Witt 2013) mainly makes use of Doob’s optional-stopping theorem that can be found in standard textbooks on martingales (for example in Williams 1991, theorem 10.10).

Theorem 5: Generalized variable drift theorem. Consider a stochastic process $X_{t}$ on $ℕ_{0} .$ Suppose there is a monotonic increasing function h: $ℝ^{+} \to ℝ^{+}$ such that the function $1 / h (x)$ is integrable on [1, m], and with expected progress toward the optimum $Δ_{k}$ such that

Δ_{k} \geq h (k)

for all $k \in {a, \dots, m} .$ Then the expected first hitting time of any state from ${0, \dots, a - 1}$ for a $a \in ℕ$ is at most:

\frac{a}{h (a)} + \int_{a}^{m} \frac{1}{h (x)} d x .

Proof. The following proof is adapted from the proof of Rowe and Sudholt (2014), theorem 1.

Let

g (x) = {\begin{array}{l} \frac{x}{h (a)} & if x < a \\ \frac{a}{h (a)} + \int_{a}^{x} \frac{1}{h (z)} d z & if x \geq a . \end{array}

Note that g is strictly monotone increasing and hence invertible. Whenever the random sequence g(X_t) hits state 0, this implies that X_t has hit a state in ${0, \dots, a - 1} .$ Hence, the hitting time of any state ${0, \dots, a - 1}$ is no larger than the first hitting time of the random sequence $g (X_{t})$ of the state 0.

If $x \geq a$ and $y \geq a$ then

g (x) - g (y) = \int_{y}^{x} \frac{1}{h (z)} d z \geq \frac{x - y}{h (x)}

(since 1/h(z) is positive and monotone decreasing) and if $x \geq a$ and $y < a$ then

g (x) - g (y) = \frac{a}{h (a)} + \int_{a}^{x} \frac{1}{h (z)} d z - \frac{y}{h (a)} \geq \frac{a - y}{h (a)} + \frac{x - a}{h (x)} \geq \frac{x - y}{h (x)} .

So, for any $k \in {1, \dots, m},$

\begin{array}{l} E [g (X_{t}) - g (X_{t + 1}) | g (X_{t}) = g (k)] \\ = E [g (X_{t}) - g (X_{t + 1}) | X_{t} = k] \\ \geq E [(X_{t} - X_{t + 1}) / h (X_{t}) | X_{t} = k] \\ = \frac{Δ_{k}}{h (k)} \geq 1. \end{array}

So by the additive drift theorem (Theorem 4), the first hitting time of 0 by the sequence $g (X_{t})$ is bounded above by g(m). The result follows.

In the manuscript, we use the negative drift theorem with self-loops presented in Rowe and Sudholt (2014) (an extension of the negative drift theorem by Oliveto and Witt 2011, 2012, to stochastic processes with large self-loop probabilities). It is stated here for the sake of completeness.

Theorem 6: Negative drift with self-loops. Consider a Markov process $X_{0}, X_{1},…$ on ${0, \dots, m}$ and suppose there exists integers a, b with $0 < a < b \leq m$ and $ε > 0$ such that for all $a \leq k \leq b$ the expected drift toward 0 is

E (k - X_{t + 1} | X_{t} = k) < - ε \cdot (1 - p_{k, k})

where $p_{k, k}$ is the self-loop probability at state k. Further assume there exists constants $r, δ > 0$ (i.e., they are independent of m) such that for all $k \geq 1$ and all $d \geq 1$

p_{k, k - d}, p_{k, k + d} \leq \frac{r (1 - p_{k, k})}{{(1 + δ)}^{d}}

where $p_{k, l}$ is the transition probability from state k to state l. Let T be the first hitting time of a state at most a, starting from $X_{1} \geq b .$ Let $l = b - a .$ Then there is a constant c > 0 such that

\Pr (T \leq 2^{c l / r}) = 2^{- Ω (l / r)} .

Appendix D: Adaptation Time in Simple Additive Landscapes

In our manuscript, simple peaks are represented by function $f_{eq} (x) : = \sum_{i = 1}^{n} x_{i}$ that assumes that all alleles (bits) contribute to the fitness with weight equal to 1 (Figure D1). Each mutation therefore increases or decreases the fitness by 1.

Theorem 7: Efficiently climbing simple peaks. If $2 (N - 1) β \geq ln (cn)$ with $β \in ℝ^{+}$ and $c > 1,$ then the expected optimization time of WM on $f_{eq}$ with local mutations is

\frac{n \ln (n) + O (n)}{p_{fix} (1)} \leq (1 + \frac{1}{2 β}) \cdot [n \ln (n) + O (n)]

for every initial search point.

Proof. Let us denote by x the number of one-bits. The drift can be expressed as a combination of a forward and a backward drift

Δ (x) = Δ^{+} (x) - Δ^{-} (x),

where the forward drift is the probability of mutation flipping a zero-bit $(n - x / n)$ multiplied by the probability of accepting such a mutation $[p_{fix} (1)] .$ Note that all mutations in this fitness landscape will change the state x by ±1. Analogously, the backward drift is given by the probability of a negative mutation occurring $(x / n)$ and fixing in the population with probability $[p_{fix} (- 1)] .$ Therefore, the total expected progress is

\begin{matrix} Δ (x) = \frac{n - x}{n} \cdot p_{fix} (1) - \frac{x}{n} \cdot p_{fix} (- 1) \\ = p_{fix} (1) \cdot [\frac{n - x}{n} - \frac{x}{n} \cdot \frac{p_{fix} (- 1)}{p_{fix} (1)}] . \end{matrix}

Using Lemma 2 we get

Δ (x) = p_{fix} (1) \cdot [\frac{n - x}{n} - \frac{x}{n} \cdot e^{- 2 (N - 1) β}]

and since $2 (N - 1) β \geq \ln (c n)$ with $c > 1,$ we can bound $Δ (x)$ from below by

Δ (x) \geq p_{fix} (1) \cdot (\frac{n - x}{n} - \frac{1}{c n}) > 0.

To find the upper bound on the expected time that WM needs to find the fitness peak, we apply the variable drift theorem to the decreasing number of zeros $z = n - x :$

Δ (x) \geq p_{fix} (1) \cdot \frac{z c - 1}{c n} = h (z) .

The number of zeros changes from n (in the worst case scenario) to 1 (the last state that is not optimum), defining the boundaries of the integral

\begin{matrix} E (T | X_{0}) \leq \frac{1}{h (1)} + \int_{1}^{n} \frac{1}{h (z)} d z \\ = \frac{1}{p_{fix} (1)} \cdot \frac{c n}{c - 1} + \int_{1}^{n} \frac{1}{p_{fix} (1)} \cdot \frac{c n}{z c - 1} d z \\ = \frac{1}{p_{fix} (1)} \cdot [\frac{c n}{c - 1} + n \cdot \ln (\frac{c n - 1}{c - 1})] \\ \leq \frac{1}{p_{fix} (1)} \cdot [O (n) + n \cdot \ln (n \cdot \frac{c}{c - 1})] \\ = \frac{1}{p_{fix} (1)} \cdot [O (n) + n \cdot \ln (n) + n \cdot \ln (\frac{c}{c - 1})] \\ = \frac{n \ln (n) + O (n)}{p_{fix} (1)} . \end{matrix}

(D1)

Alternatively, we can use $p_{fix}$ bounds (B1) to obtain

E (T | X_{0}) \leq (1 + \frac{1}{2 β}) \cdot [n \ln (n) + O (n)] .

(D2)

Theorem 8: Efficient approach to the optimum. If $2 (N - 1) β \geq \ln (c n)$ with $c > 1$ and $β \in ℝ *,$ then the expected time of WM on $f_{eq}$ to first reach a solution quality of at least $n - a$ is

\frac{n \ln (n / a) + O (n)}{p_{fix} (1)} \leq (1+ \frac{1}{2 β}) \cdot [n \ln (n / a) + O (n)]

for every initial search point.

Proof. The proof is as before, showing that the drift with regards to the number of zeros is at least $h (z) = p_{fix} (1) \cdot \frac{z c - 1}{c n}$ for search points with z zeros, for a positive constant c. Then, by applying Theorem 5 to the number of zeros, we get an upper bound of

\begin{matrix} E (T | X_{0}) = \frac{a}{h (a)} + \int_{a}^{n} \frac{1}{h (z)} d z \\ = \frac{1}{p_{fix} (1)} \cdot \frac{a c n}{a c - 1} + \frac{n}{p_{fix} (1)} \cdot \int_{a}^{n} \frac{c}{z c - 1} d z \\ = \frac{1}{p_{fix} (1)} \cdot [\frac{a c n}{a c - 1} + n \cdot \ln (\frac{c n - 1}{c a - 1})] \\ = \frac{1}{p_{fix} (1)} \cdot {O (n) + n \cdot \ln [O (n / a)]} \\ = \frac{1}{p_{fix} (1)} \cdot [O (n) + n \cdot \ln (n / a)] \end{matrix}

using $p_{fix}$ bounds (B1)

E (T | X_{0}) \leq (1 + \frac{1}{2 β}) \cdot [n \ln (n / a) + O (n)] .

(D3)

Corollary 9. For $a \leq n^{1 - ε}$ for $ε > 0,$ the upper bound from Theorem 8 is $O [(n \log n) / p_{fix} (1)] .$ For $a = Ω (n),$ e. g., a = 0.001n, we get $O [n / p_{fix} (1)] .$ For $a = Θ (n / \log^{k} n)$ for any constant k > 0 we get $O [(n \log \log n) / p_{fix} (1)] .$

Theorem 10: A critical threshold for hill climbing. If $1 \leq N β \leq \frac{1 - ε}{2} \ln n$ for some $0 < ε < 1,$ then the optimization time of WM with local mutations on $f_{eq}$ is at least $2^{c n^{ε / 2}}$ with probability $1 - 2^{- Ω (n^{ε / 2})},$ for some constant $c > 0.$

Proof. To prove this theorem, the negative drift theorem (Theorem 6) will be applied, taking the number of zeros as distance function to the optimum. Our notation refers to numbers of ones for simplicity. Let $p_{x, x \pm 1}$ be the probability that WM will make a transition from a search point with x ones, to one with x ± 1 ones, and assuming $x \geq n - n^{ε / 2},$ then the expected drift toward the optimum is bounded as follows

p_{x, x + 1} = \frac{n - x}{n} \cdot p_{fix} (1) \leq n^{ε / 2 - 1} \cdot p_{fix} (1)

since $p_{fix} (1) \leq \frac{2 β}{1 - e^{- 2 N β}}$

\leq n^{ε / 2 - 1} \cdot \frac{2 β}{1 - e^{- 2 N β}} \leq n^{ε / 2 - 1} \cdot \frac{2 β}{1 - e^{- 2}} .

On the other hand,

\begin{matrix} p_{x, x - 1} \geq \frac{x}{n} \cdot p_{fix} (- 1) \geq \frac{n - n^{ε / 2}}{n} \cdot p_{fix} (- 1) \\ = p_{fix} (- 1) \cdot (1 - n^{ε / 2 - 1}) \end{matrix}

using $e^{2 N β} \leq e^{(1 - ε) \ln n} = n^{1 - ε}$

\geq \frac{2 β \cdot n^{ε}}{n} \cdot (1 - n^{ε / 2 - 1}) .

The expected drift $Δ (x)$ is hence at most

\begin{matrix} Δ (x) \leq \frac{2 β}{1 - e^{- 2}} \cdot n^{ε / 2 - 1} - \frac{2 β \cdot n^{ε}}{n} \cdot (1 - n^{ε / 2 - 1}) \\ = 2 β \cdot n^{ε / 2 - 1} \cdot [\frac{1}{1 - e^{- 2}} - n^{ε / 2} \cdot (1 - n^{ε / 2 - 1})] \\ = - Ω (β \cdot n^{ε - 1}) . \end{matrix}

Now, the self-loop probability is at least $p_{x, x} = 1 - p_{x, x + 1} - p_{x, x - 1} = 1 - O (β n^{ε - 1}),$ hence the first condition of the drift theorem is satisfied. Since there are only local mutations, the second condition on exponentially decreasing transition probabilities follows immediately. The negative drift theorem, applied to the number of zeros on an interval of $[0, n^{ε / 2}],$ proves the claimed result.

Appendix E: General Additive Fitness Landscapes

General additive fitness landscape is defined by the function $f_{add} (x) : = \sum_{i = 1}^{n} x_{i} w_{i},$ where $w_{i} > 0$ is a weight with which each site contributes to the trait, such that $\sum_{i = 1}^{n} w_{i} = W .$

For $f_{eq},$ we showed that $2 (N - 1) β \geq \ln (c n)$ for c > 1 is sufficient to get a positive drift. In a more general sense, for a bit of weight w we get a positive drift on that bit if $2 (N - 1) β w \geq \ln (c n) .$ Call all such bits large effect sites or heavy, then, by the same arguments as for $f_{eq},$ WM optimizes all heavy bits in the same time bound as for $f_{eq} .$ The only sites we cannot guarantee to fix in polynomial time are those with effect smaller than $w *,$ where $w *$ defines a threshold on the distribution of effects separating the loci “easily” adapted from the “small effect” ones. The total contribution of these sites is at most $n w * .$

Theorem 11: General additive fitness landscapes. Let $w_{1}, \dots, w_{n}$ and $W : = \sum_{i = 1}^{n} w_{i} .$ Then, WM with $2 (N - 1) β w * \geq \ln (c n)$ and $c > 1$ finds a solution of fitness at least

\sum_{i = 1}^{n - n *} w_{i} \geq W - n * w * \geq W - n w * = W (1 - w * \frac{n}{W})

in expected time at most

\frac{n \ln (n) + O (n)}{p_{fix} (w *)} \leq (1 + \frac{1}{2 β w *}) \cdot [n \ln (n) + O (n)]

where $w *$ is the minimum weight we want to optimize and $n *$ the number of weights with value less than $w * .$

Proof. If $w * \geq (W / n),$ the statement is trivial as then the lower bound on the fitness is nonpositive.

Without loss of generality, we assume that the weights are ordered in ascending order: $w_{1} \leq w_{2} \leq \dots \leq w_{n} .$ Now, when $w * < (W / n)$ and ignoring the $n *$ weights such that $w_{i} < w *, i = 1, \dots, n *,$ ${note that w * \in [\frac{\ln (c n)}{2 (N - 1) β}, \frac{W}{n})},$ we lower bound the positive drift by the probability of flipping one of the zero-bits with a weight bigger than $w *$ times the fixation probability, underestimated for the case where that bit has exactly a weight of $w * :$

Δ^{+} (x) \geq \frac{n - n * - x}{n} \cdot p_{fix} (w*) .

Where x is the number of one-bits. For the backward drift, we look to the worst expected impact of one single bit $(1 / n) \cdot p_{fix} (- w *),$ then by applying linearity of expectations we obtain

Δ^{-} (x) \geq - p_{fix} (−w*) .

The total expectation of the progress toward the optimum is therefore

Δ (x) \geq p_{fix} (w*) [\frac{n - n * - x}{n} - \frac{p_{fix} (−w*)}{p_{fix} (w*)}] .

Using Lemma 2 and then introducing $2 (N - 1) β w * \geq \ln (c n),$ we get

\begin{matrix} Δ (x) \geq p_{fix} (w*) (\frac{n - n * - x}{n} - e^{- 2 (N - 1) β w *}) \\ \geq p_{fix} (w*) \cdot (\frac{n - n * - x}{n} - \frac{1}{c n}) \\ = p_{fix} (w*) \cdot \frac{c (n - n * - x) - 1}{c n} > 0. \end{matrix}

(E1)

Now we apply the variable drift theorem to the number of zeros z in the $n - n *$ bits that we want to optimize, i.e., $z = n - n * - x :$

Δ (x) \geq p_{fix} (w*) \cdot \frac{c z - 1}{c n} = h (z),

which is always positive if $c > 1$ . The integral range will go from the farthest point to the optimum (all of the n − n* heaviest weights being zero) to the closest (only one bit of the n − n* heaviest weights being zero)

\begin{matrix} E (T | X_{0}) \leq \frac{1}{h (1)} + \int_{1}^{n - n *} \frac{1}{h (z)} d z \leq \frac{1}{h (1)} + \int_{1}^{n} \frac{1}{h (z)} d z \\ = \frac{1}{p_{fix} (w *)} \cdot \frac{c n}{c - 1} + \int_{1}^{n} \frac{1}{p_{fix} (w *)} \cdot \frac{c n}{c z - 1} d z \\ = \frac{1}{p_{fix} (w *)} \cdot [\frac{c n}{c - 1} + n \cdot \ln (\frac{c n - 1}{c - 1})] \\ \leq \frac{1}{p_{fix} (w *)} \cdot [O (n) + n \cdot \ln (\frac{c n}{c - 1})] \\ = \frac{1}{p_{fix} (w *)} \cdot [O (n) + n \cdot \ln (\frac{c}{c - 1}) + n \ln (n)] \\ = \frac{n \ln (n) + O (n)}{p_{fix} (w *)} . \end{matrix}

Alternatively, using $p_{fix}$ bounds (B1) we can get an alternative expression

E (T | X_{0}) \leq (1 + \frac{1}{2 β w *}) \cdot \frac{n \ln (n) + O (n)}{w *} .

(E2)

Appendix F: Adaptation in Unimodal Landscapes

Theorem 12. WM with $β \geq 1 / 2,$ $N \in ℕ$ \ ${0, 1, 2},$ and $2 (N - 1) β δ \geq \ln [(1 + 1 / δ) c n]$ with $c > 1$ can optimize every unimodal function in at most

O [\frac{n}{p_{fix} (δ)} \cdot \frac{d}{δ}] = O [n \cdot \frac{d}{δ} \cdot (1 + \frac{1}{2 β δ})]

where $d, δ \in ℝ^{+}$ are respectively the maximum and minimum fitness difference between any two search points.

Proof. Usually the variable drift theorem is applied over the genotype space, i.e., the Boolean hypercube (or some characteristic of it like the number of ones), however in this proof we will apply it on the phenotypic level, i.e., the fitness function.

Let us denote by x any nonoptimal search point. Mutation can only produce points in the Hamming neighborhood, we pessimistically assume that only flipping one of these n points leads to an improvement (the remaining n − 1 Hamming neighbors will have a worse fitness) and that its size is the minimum possible value $δ,$ then the forward drift can be bounded by

Δ^{+} (x) \geq \frac{δ}{n} \cdot p_{fix} (δ) .

For the backward drift, we consider all the remaining $n - 1$ Hamming neighbors and denoting by g(k) > 0 the absolute fitness difference between the new and the old search point when flipping bit k we obtain:

Δ^{-} (x) \geq - \sum_{k = 1}^{n - 1} \frac{1}{n} \cdot g (k) \cdot p_{fix} [- g (k)] .

Since $N \geq 3,$ we can apply Lemma 3 which means that p_fix decreases exponentially for deleterious mutations. Specifically, we can bound $p_{fix} [- g (k)] \leq e^{- 2 β [g (k) - δ]} \cdot p_{fix} (- δ)$ obtaining

Δ^{-} (x) \geq - \frac{1}{n} \cdot \sum_{k = 1}^{n - 1} g (k) \cdot e^{- 2 β [g (k) - δ]} \cdot p_{fix} (- δ)

since $g (k) \geq δ,$ we can introduce $β \geq 1 / 2$ yielding

Δ^{-} (x) \geq - \frac{1}{n} \cdot \sum_{k = 1}^{n - 1} g (k) \cdot e^{- [g (k) - δ]} \cdot p_{fix} (- δ) .

(F1)

The value of $g (k) > 0$ that maximizes $g (k) \cdot e^{- [g (k) - δ]}$ is $g (k)$ = 1, however, when $δ > 1$ is not a feasible solution [note that $g (k) \geq δ]$ and the maximum will be at g(k) = δ. Therefore,

g (k) \cdot e^{- [g (k) - δ]} \leq {\begin{array}{l} e^{δ −1} \leq 1 & if 0 < δ \leq 1 \\ δ & if δ > 1. \end{array}

We can upper bound these two cases by its sum $(1 + δ) .$ Introducing this back in (F1) yields

Δ^{-} (x) \geq - \frac{1}{n} \cdot \sum_{k = 1}^{n - 1} (1 + δ) \cdot p_{fix} (- δ) \geq - (1 + δ) \cdot p_{fix} (- δ) .

Now we can compute the total drift

\begin{matrix} Δ (x) \geq \frac{δ}{n} \cdot p_{fix} (δ) - (1 + δ) \cdot p_{fix} (- δ) \\ \geq δ \cdot p_{fix} (δ) [\frac{1}{n} - (1 + 1 / δ) \cdot \frac{p_{fix} (- δ)}{p_{fix} (δ)}] . \end{matrix}

And following the usual steps, applying Lemma 2 and introducing $2 (N - 1) β δ \geq \ln [(1 + 1 / δ) c n],$ we obtain

\begin{array}{l} \geq δ \cdot p_{fix} (δ) \cdot [\frac{1}{n} - (1 + 1 / δ) \cdot e^{- 2 (N - 1) β δ}] \\ \geq δ \cdot p_{fix} (δ) \cdot (\frac{1}{n} - \frac{1}{c n}) \end{array}

since $c > 1,$ the previous expression is positive and we can state

Δ (x) = Ω [\frac{δ \cdot p_{fix} (δ)}{n}] .

Finally, we apply the variable drift theorem with integral limits for the biggest fitness difference (d) and the minimum (δ)

\begin{matrix} E (T | X_{0}) \leq O [\frac{n δ}{δ \cdot p_{fix} (δ)}] + O [\int_{δ}^{d} \frac{n}{δ \cdot p_{fix} (δ)} d z] \\ = O [\frac{n d}{δ \cdot p_{fix} (δ)}] \end{matrix}

using $p_{fix}$ bounds (B1) we obtain an alternative formula

E (T | X_{0}) = O [\frac{n d}{δ} \cdot (1 + \frac{1}{2 β δ})] .

Climbing Fitness Ridges

The function $f_{ridge}$ that counts the number of leading ones in a bit string is defined as $f_{ridge} (x) = \sum_{i = 1}^{n} \prod_{j = 1}^{i} x_{j} .$ To increase current fitness by mutation, it is necessary to flip the leftmost zero-bit to one. Flipping any other zeros to one will result in a mutant offspring with the same fitness as its parent, while flipping any of the leading ones can result in a drastic fitness loss (Figure F1).

Theorem 13: Expected optimization time for $f_{ridge}$ . The expected optimization time (Figure F2) of WM with local mutations, $β \geq 1 / 2,$ and $N \in ℕ$ \ ${0, 1, 2}$ on $f_{ridge}$ is

O [\frac{n^{2}}{p_{fix} (1)}] = O [n^{2} \cdot (1 + \frac{1}{2 β})] .

Proof. For this problem, x will denote the number of leading ones in the bit string. We lower bound the forward drift by the probability of mutation choosing the first nonzero bit $(1 / n)$ and its acceptance probability of being flipped $[p_{fix} (1)] .$ For the backward drift, notice that flipping the j-th leading one will imply a fitness decrease of $k = x - j + 1$ $(note that 1 \leq k \leq n - 1)$ but, as we will show, the exponential decrease of $p_{fix}$ for deleterious mutations will overcome this effect, yielding a total positive drift toward the optimum:

Δ^{+} (x) \geq \frac{1}{n} \cdot p_{fix} (1)

Δ^{-} (x) \geq - \frac{1}{n} \cdot \sum_{k = 1}^{n = 1} k \cdot p_{fix} (- k) .

Since $N \geq 3,$ we can call Lemma 3 to simplify the backward drift by using $p_{fix} (- k) \leq e^{- 2 β (k - 1)} \cdot p_{fix} (- 1)$ yielding

Δ^{-} (x) \geq - \frac{1}{n} \cdot \sum_{k = 1}^{n - 1} k \cdot e^{- 2 β (k - 1)} \cdot p_{fix} (−1) .

Introducing $β \geq 1 / 2,$ we obtain

\begin{matrix} Δ^{-} (x) \geq - \frac{1}{n} \cdot p_{fix} (- 1) \cdot \sum_{k = 1}^{∞} k \cdot e^{- (k - 1)} \\ = - \frac{1}{n} \cdot p_{fix} (- 1) \cdot \frac{e}{{(e - 1)}^{2}} \\ \geq - \frac{3}{n} \cdot p_{fix} (- 1) . \end{matrix}

Now we compute the total drift,

\begin{matrix} Δ (x) \geq \frac{1}{n} \cdot p_{fix} (1) - \frac{3}{n} \cdot p_{fix} (- 1) \\ = \frac{p_{fix} (1)}{n} \cdot [1 - 3 \cdot \frac{p_{fix} (- 1)}{p_{fix} (1)}], \end{matrix}

calling Lemma 2 yields

Δ (x) \geq \frac{p_{fix} (1)}{n} \cdot [1 - 3 \cdot e^{- 2 (N - 1) β}],

using $N \geq 3$ and $β \geq 1 / 2$ we can lower bound $2 (N - 1) β$ by 2 obtaining

\begin{matrix} Δ (x) \geq \frac{p_{fix} (1)}{n} \cdot (1 - \frac{3}{e^{2}}) \\ \geq \frac{6 p_{fix} (1)}{8 n} . \end{matrix}

Finally, we apply the variable drift theorem to the number of bits after the x leading ones $z = n - x$

\begin{matrix} Δ (x) \geq \frac{6}{8 n} \cdot p_{fix} (1) = h (z) \\ E (T | X_{0}) \leq \frac{8 n}{6 p_{fix} (1)} + \int_{1}^{n} \frac{8 n}{6 p_{fix} (1)} d z \\ = O [\frac{n^{2}}{p_{fix} (1)}] . \end{matrix}

Using the bounds on $p_{fix}$ (B1) one gets

E (T | X_{0}) \leq O [n^{2} \cdot (1 + \frac{1}{2 β})] .

Figure D1 — f_eq function. Fitness increases linearly with increasing number of ones.

Figure F1 — f_ridge function, various mutations, n = 8. For each genotype, only one mutation is positive, while many are either neutral or negative. Red color represents new mutations, green color represents *free riders*, which are loci adding to fitness that had no fitness effect before a suitable mutation occurred.

Figure F2 — Fitness as a function of time for different genome sizes for f_ridge. Solid gray lines represent the mean of 100 simulations for n = 500, 1000, and 5000, and dashed black lines represent best-fit power laws of the form $a \cdot t^{b} .$ Fitness is scaled by the maximum fitness (n) and time scaled by $n^{2} .$ This shows that the time to reach the peak is well estimated by $O (n^{2}),$ and that the rate of approach is well approximated by a power law. Parameters were set to $N = 100$ and $β = 0.1.$

Footnotes

Communicating editor: J. Hermisson

Literature Cited

Aita T., Uchiyama H., Inaoka T., Nakajima M., Kokubo T., et al. , 2000. Analysis of a local fitness landscape with a model of the rough Mt. Fuji-type landscape: application to prolyl endopeptidase and thermolysin. Biopolymers 54: 64–79. [DOI] [PubMed] [Google Scholar]
Aita T., Iwakura M., Husimi Y., 2001. A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 14: 633–638. [DOI] [PubMed] [Google Scholar]
Berg J., Willmann S., Lässig M., 2004. Adaptive evolution of transcription factor binding sites. BMC Evol. Biol. 4: 42. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chastain E., Livnat A., Papadimitriou C., Vazirani U., 2014. Algorithms, games, and evolution. Proc. Natl. Acad. Sci. USA 111: 10620–10623. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chatterjee K., Pavlogiannis A., Adlam B., Nowak M. A., 2014. The time scale of evolutionary innovation. PLOS Comput. Biol. 10: e1003818. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cormen T. H., Leiserson C. E., Rivest R. L., Stein C., 2009. Introduction to Algorithms, Ed. 3 The MIT Press, Cambridge, MA. [Google Scholar]
Corus, D., D.-C. Dang, A. V. Eremeev, and P. K. Lehre, 2014 Level-based analysis of genetic algorithms and other search processes, pp. 912–921 in Parallel Problem Solving from Nature—PPSN XIII (Lecture Notes in Computer Science, Vol. 8672), edited by T. Bartz-Beielstein, J. Branke, B. Filipič, and J. Smith. Springer, New York. [Google Scholar]
Crona K., Greene D., Barlow M., 2013. The peaks and geometry of fitness landscapes. J. Theor. Biol. 317: 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
Desai M. M., Fisher D. S., Murray A. W., 2007. The speed of evolution and maintenance of variation in asexual populations. Curr. Biol. 17: 385–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
de Visser J. A. G. M., Krug J., 2014. Empirical fitness landscapes and the predictability of evolution. Nat. Rev. Genet. 15: 480–490. [DOI] [PubMed] [Google Scholar]
Droste S., 2002. Analysis of the (1+1) EA for a dynamically changing onemax-variant, pp. 55–60 in Proceedings of Congress on Evolutionary Computation 2002 IEEE Press, Hoboken, NJ. [Google Scholar]
Droste S., Jansen T., Wegener I., 2002. On the analysis of the (1+1) evolutionary algorithm. Theor. Comput. Sci. 276: 51–81. [DOI] [PubMed] [Google Scholar]
Eyre-Walker A., Keightley P. D., 2007. The distribution of fitness effects of new mutations. Nat. Rev. Genet. 8: 610–618. [DOI] [PubMed] [Google Scholar]
Ferretti L., Schmiegelt B., Weinreich D., Yamauchi A., Kobayashi Y., et al. , 2016. Measuring epistasis in fitness landscapes: the correlation of fitness effects of mutations. J. Theor. Biol. 396: 132–143. [DOI] [PubMed] [Google Scholar]
Fisher R. A., 1930. The Genetical Theory of Natural Selection. Clarendon Press, Oxford. [Google Scholar]
Fogle C. A., Nagle J. L., Desai M. M., 2008. Clonal interference, multiple mutations and adaptation in large asexual populations. Genetics 180: 2163–2173. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gerrish P., Lenski R., 1998. The fate of competing beneficial mutations in an asexual population. Genetica 102–103: 127–144. [PubMed] [Google Scholar]
Gillespie J. H., 1983. Some properties of finite populations experiencing strong selection and weak mutation. Am. Nat. 121: 691–708. [Google Scholar]
Gillespie J. H., 1984. Molecular evolution over the mutational landscape. Evolution 38: 1116–1129. [DOI] [PubMed] [Google Scholar]
Grant V., Flake R. H., 1974. Solutions to the cost-of-selection dilemma. Proc. Natl. Acad. Sci. USA 71: 3863–3865. [DOI] [PMC free article] [PubMed] [Google Scholar]
Haldane J. B. S., 1957. The cost of natural selection. J. Genet. 55: 511–524. [Google Scholar]
He J., Yao X., 2001. Drift analysis and average time complexity of evolutionary algorithms. Artif. Intell. 127: 57–85. [Google Scholar]
Johannsen, D., 2010 Random combinatorial structures and randomized search heuristics. Ph.D. Thesis, Universität des Saarlandes and the Max-Planck-Institut für Informatik, Saarbrücken, Germany. [Google Scholar]
Kauffman S., Levin S., 1987. Towards a general theory of adaptive walks on rugged landscapes. J. Theor. Biol. 128: 11–45. [DOI] [PubMed] [Google Scholar]
Kauffman S. A., Weinberger E. D., 1989. The NK model of rugged fitness landscapes and its application to maturation of the immune response. J. Theor. Biol. 141: 211–245. [DOI] [PubMed] [Google Scholar]
Khan A. I., Dinh D. M., Schneider D., Lenski R. E., Cooper T. F., 2011. Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332: 1193–1196. [DOI] [PubMed] [Google Scholar]
Kim Y., Orr H. A., 2005. Adaptation in sexuals vs. asexuals: clonal interference and the Fisher-Muller model. Genetics 171: 1377–1386. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kimura M., 1961. Natural selection as the process of accumulating genetic information in adaptive evolution. Genet. Res. 2: 127–140. [Google Scholar]
Kimura M., 1962. On the probability of fixation of mutant genes in a population. Genetics 47: 713–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kingman J. F. C., 1978. A simple model for the balance between selection and mutation. J. Appl. Probab. 15: 1–12. [Google Scholar]
Kinney J. B., Murugan A., Callan C. G., Cox E. C., 2010. Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence. Proc. Natl. Acad. Sci. USA 107: 9158–9163. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kondrashov F. A., Kondrashov A. S., 2001. Multidimensional epistasis and the disadvantage of sex. Proc. Natl. Acad. Sci. USA 98: 12089–12092. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kryazhimskiy S., Tkacik G., Plotkin J. B., 2009. The dynamics of adaptation on correlated fitness landscapes. Proc. Natl. Acad. Sci. USA 106: 18638–18643. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lehre, P. K. and C. Witt, 2013 General drift analysis with tail bounds. arXiv Available at: https://arxiv.org/abs/1307.2559.
Lenski R. E., Wiser M. J., Ribeck N., Blount Z. D., Nahum J. R., et al. , 2015. Sustained fitness gains and variability in fitness trajectories in the long-term evolution experiment with Escherichia coli. Proc. Biol. Sci. 282: 20152292. [DOI] [PMC free article] [PubMed] [Google Scholar]
Macken C. A., Perelsont A. S., 1989. Protein evolution on rugged landscapes. Proc. Natl. Acad. Sci. USA 86: 6191–6195. [DOI] [PMC free article] [PubMed] [Google Scholar]
Oliveto P. S., Witt C., 2011. Simplified drift analysis for proving lower bounds in evolutionary computation. Algorithmica 59: 369–386. [Google Scholar]
Oliveto, P. S. and C. Witt, 2012 Erratum: Simplified drift analysis for proving lower bounds in evolutionary computation. arXiv Available at: https://arxiv.org/abs/1211.7184.
Orr H. A., 2000. The rate of adaptation in asexuals. Genetics 155: 961–968. [DOI] [PMC free article] [PubMed] [Google Scholar]
Orr H. A., 2002. The population genetics of adaptation: the adaptation of DNA sequences. Evolution 56: 1317–1330. [DOI] [PubMed] [Google Scholar]
Orr H. A., 2005. The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6: 119–127. [DOI] [PubMed] [Google Scholar]
Orr H. A., 2006. The population genetics of adaptation on correlated fitness landscapes: the block model. Evolution 60: 1113–1124. [PubMed] [Google Scholar]
Paixão T., Badkobeh G., Barton N., Corus D., Dang D.-C., et al. , 2015a Toward a unifying framework for evolutionary processes. J. Theor. Biol. 383: 28–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
Paixão T., Pérez Heredia J., Sudholt D., Trubenová B., 2015b First steps towards a runtime comparison of natural and artificial evolution, pp. 1455–1462 in Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation. Association for Computing Machinery, New York. [Google Scholar]
Park S.-C., Simon D., Krug J., 2010. The speed of evolution in large asexual populations. J. Stat. Phys. 138: 381–410. [Google Scholar]
Park S.-C., Neidhart J., Krug J., 2016. Greedy adaptive walks on a correlated fitness landscape. J. Theor. Biol. 397: 89–102. [DOI] [PubMed] [Google Scholar]
Phillips P. C., 2008. Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems. Nat. Rev. Genet. 9: 855–867. [DOI] [PMC free article] [PubMed] [Google Scholar]
Poelwijk F. J., Kiviet D. J., Weinreich D. M., Tans S. J., 2007. Empirical fitness landscapes reveal accessible evolutionary paths. Nature 445: 383–386. [DOI] [PubMed] [Google Scholar]
Poelwijk F. J., Tănase-Nicola S., Kiviet D. J., Tans S. J., 2011. Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J. Theor. Biol. 272: 141–144. [DOI] [PubMed] [Google Scholar]
Provine W. B., 2001. The Origins of Theoretical Population Genetics, Ed. 1 University of Chicago Press, Chicago. [Google Scholar]
Rowe J. E., Sudholt D., 2014. The choice of the offspring population size in the (1,λ) evolutionary algorithm. Theor. Comput. Sci. 545: 20–38. [Google Scholar]
Rudolph G., 1997a Convergence Properties of Evolutionary Algorithms. Verlag Dr. Kovač, Altona, Germany. [Google Scholar]
Rudolph G., 1997b How mutation and selection solve long-path problems in polynomial expected time. Evol. Comput. 4: 195–205. [Google Scholar]
Sella G., Hirsh A. E., 2005. The application of statistical physics to evolutionary biology. Proc. Natl. Acad. Sci. USA 102: 9541–9546. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stadler P. F., 1996. Landscapes and their correlation functions. J. Math. Chem. 20: 1–45. [Google Scholar]
Szendro I. G., Schenk M. F., Franke J., Krug J., de Visser J. A. G. M., 2013. Quantitative analyses of empirical fitness landscapes. J. Stat. Mech. 2013: P01005. [Google Scholar]
Tautz D., Domazet-Lošo T., 2011. The evolutionary origin of orphan genes. Nat. Rev. Genet. 12: 692–702. [DOI] [PubMed] [Google Scholar]
Tuğrul M., Paixão T., Barton N. H., Tkačik G., 2015. Dynamics of transcription factor binding site evolution. PLoS Genet. 11: e1005639. [DOI] [PMC free article] [PubMed] [Google Scholar]
Valiant L., 2013. Probably Approximately Correct: Nature’s Algorithms for Learning and Prospering in a Complex World, Ed. 1 Basic Books, New York. [Google Scholar]
Wagner G. P., Kenney-Hunt J. P., Pavlicev M., Peck J. R., Waxman D., et al. , 2008. Pleiotropic scaling of gene effects and the “cost of complexity.” Nature 452: 470–472. [DOI] [PubMed] [Google Scholar]
Weinreich D. M., Watson R. A., Chao L., 2005. Perspective: sign epistasis and genetic constraint on evolutionary trajectories. Evolution 59: 1165–1174. [PubMed] [Google Scholar]
Wilke C. O., 2004. The speed of adaptation in large asexual populations. Genetics 167: 2045–2053. [DOI] [PMC free article] [PubMed] [Google Scholar]
Williams D., 1991. Probability with Martingales, Ed. 1 Cambridge University Press, Cambridge, United Kingdom. [Google Scholar]
Wiser M. J., Ribeck N., Lenski R. E., 2013. Long-term dynamics of adaptation in asexual populations. Science 342: 1364–1367. [DOI] [PubMed] [Google Scholar]
Wright S., 1932. The roles of mutation, inbreeding, crossbreeding and selection in evolution. Proceedings of the 6th International Congress of Genetics, Vol. 1, Ithaca, New York, pp. 356–366. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article. Code to perform simulations is available upon request.

[bib1] Aita T., Uchiyama H., Inaoka T., Nakajima M., Kokubo T., et al. , 2000. Analysis of a local fitness landscape with a model of the rough Mt. Fuji-type landscape: application to prolyl endopeptidase and thermolysin. Biopolymers 54: 64–79. [DOI] [PubMed] [Google Scholar]

[bib2] Aita T., Iwakura M., Husimi Y., 2001. A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 14: 633–638. [DOI] [PubMed] [Google Scholar]

[bib3] Berg J., Willmann S., Lässig M., 2004. Adaptive evolution of transcription factor binding sites. BMC Evol. Biol. 4: 42. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Chastain E., Livnat A., Papadimitriou C., Vazirani U., 2014. Algorithms, games, and evolution. Proc. Natl. Acad. Sci. USA 111: 10620–10623. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Chatterjee K., Pavlogiannis A., Adlam B., Nowak M. A., 2014. The time scale of evolutionary innovation. PLOS Comput. Biol. 10: e1003818. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Cormen T. H., Leiserson C. E., Rivest R. L., Stein C., 2009. Introduction to Algorithms, Ed. 3 The MIT Press, Cambridge, MA. [Google Scholar]

[bib7] Corus, D., D.-C. Dang, A. V. Eremeev, and P. K. Lehre, 2014 Level-based analysis of genetic algorithms and other search processes, pp. 912–921 in Parallel Problem Solving from Nature—PPSN XIII (Lecture Notes in Computer Science, Vol. 8672), edited by T. Bartz-Beielstein, J. Branke, B. Filipič, and J. Smith. Springer, New York. [Google Scholar]

[bib8] Crona K., Greene D., Barlow M., 2013. The peaks and geometry of fitness landscapes. J. Theor. Biol. 317: 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Desai M. M., Fisher D. S., Murray A. W., 2007. The speed of evolution and maintenance of variation in asexual populations. Curr. Biol. 17: 385–394. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] de Visser J. A. G. M., Krug J., 2014. Empirical fitness landscapes and the predictability of evolution. Nat. Rev. Genet. 15: 480–490. [DOI] [PubMed] [Google Scholar]

[bib11] Droste S., 2002. Analysis of the (1+1) EA for a dynamically changing onemax-variant, pp. 55–60 in Proceedings of Congress on Evolutionary Computation 2002 IEEE Press, Hoboken, NJ. [Google Scholar]

[bib12] Droste S., Jansen T., Wegener I., 2002. On the analysis of the (1+1) evolutionary algorithm. Theor. Comput. Sci. 276: 51–81. [DOI] [PubMed] [Google Scholar]

[bib13] Eyre-Walker A., Keightley P. D., 2007. The distribution of fitness effects of new mutations. Nat. Rev. Genet. 8: 610–618. [DOI] [PubMed] [Google Scholar]

[bib14] Ferretti L., Schmiegelt B., Weinreich D., Yamauchi A., Kobayashi Y., et al. , 2016. Measuring epistasis in fitness landscapes: the correlation of fitness effects of mutations. J. Theor. Biol. 396: 132–143. [DOI] [PubMed] [Google Scholar]

[bib15] Fisher R. A., 1930. The Genetical Theory of Natural Selection. Clarendon Press, Oxford. [Google Scholar]

[bib16] Fogle C. A., Nagle J. L., Desai M. M., 2008. Clonal interference, multiple mutations and adaptation in large asexual populations. Genetics 180: 2163–2173. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Gerrish P., Lenski R., 1998. The fate of competing beneficial mutations in an asexual population. Genetica 102–103: 127–144. [PubMed] [Google Scholar]

[bib18] Gillespie J. H., 1983. Some properties of finite populations experiencing strong selection and weak mutation. Am. Nat. 121: 691–708. [Google Scholar]

[bib19] Gillespie J. H., 1984. Molecular evolution over the mutational landscape. Evolution 38: 1116–1129. [DOI] [PubMed] [Google Scholar]

[bib20] Grant V., Flake R. H., 1974. Solutions to the cost-of-selection dilemma. Proc. Natl. Acad. Sci. USA 71: 3863–3865. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Haldane J. B. S., 1957. The cost of natural selection. J. Genet. 55: 511–524. [Google Scholar]

[bib22] He J., Yao X., 2001. Drift analysis and average time complexity of evolutionary algorithms. Artif. Intell. 127: 57–85. [Google Scholar]

[bib23] Johannsen, D., 2010 Random combinatorial structures and randomized search heuristics. Ph.D. Thesis, Universität des Saarlandes and the Max-Planck-Institut für Informatik, Saarbrücken, Germany. [Google Scholar]

[bib24] Kauffman S., Levin S., 1987. Towards a general theory of adaptive walks on rugged landscapes. J. Theor. Biol. 128: 11–45. [DOI] [PubMed] [Google Scholar]

[bib25] Kauffman S. A., Weinberger E. D., 1989. The NK model of rugged fitness landscapes and its application to maturation of the immune response. J. Theor. Biol. 141: 211–245. [DOI] [PubMed] [Google Scholar]

[bib26] Khan A. I., Dinh D. M., Schneider D., Lenski R. E., Cooper T. F., 2011. Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332: 1193–1196. [DOI] [PubMed] [Google Scholar]

[bib27] Kim Y., Orr H. A., 2005. Adaptation in sexuals vs. asexuals: clonal interference and the Fisher-Muller model. Genetics 171: 1377–1386. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Kimura M., 1961. Natural selection as the process of accumulating genetic information in adaptive evolution. Genet. Res. 2: 127–140. [Google Scholar]

[bib29] Kimura M., 1962. On the probability of fixation of mutant genes in a population. Genetics 47: 713–719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Kingman J. F. C., 1978. A simple model for the balance between selection and mutation. J. Appl. Probab. 15: 1–12. [Google Scholar]

[bib31] Kinney J. B., Murugan A., Callan C. G., Cox E. C., 2010. Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence. Proc. Natl. Acad. Sci. USA 107: 9158–9163. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Kondrashov F. A., Kondrashov A. S., 2001. Multidimensional epistasis and the disadvantage of sex. Proc. Natl. Acad. Sci. USA 98: 12089–12092. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] Kryazhimskiy S., Tkacik G., Plotkin J. B., 2009. The dynamics of adaptation on correlated fitness landscapes. Proc. Natl. Acad. Sci. USA 106: 18638–18643. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] Lehre, P. K. and C. Witt, 2013 General drift analysis with tail bounds. arXiv Available at: https://arxiv.org/abs/1307.2559.

[bib35] Lenski R. E., Wiser M. J., Ribeck N., Blount Z. D., Nahum J. R., et al. , 2015. Sustained fitness gains and variability in fitness trajectories in the long-term evolution experiment with Escherichia coli. Proc. Biol. Sci. 282: 20152292. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Macken C. A., Perelsont A. S., 1989. Protein evolution on rugged landscapes. Proc. Natl. Acad. Sci. USA 86: 6191–6195. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] Oliveto P. S., Witt C., 2011. Simplified drift analysis for proving lower bounds in evolutionary computation. Algorithmica 59: 369–386. [Google Scholar]

[bib38] Oliveto, P. S. and C. Witt, 2012 Erratum: Simplified drift analysis for proving lower bounds in evolutionary computation. arXiv Available at: https://arxiv.org/abs/1211.7184.

[bib39] Orr H. A., 2000. The rate of adaptation in asexuals. Genetics 155: 961–968. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Orr H. A., 2002. The population genetics of adaptation: the adaptation of DNA sequences. Evolution 56: 1317–1330. [DOI] [PubMed] [Google Scholar]

[bib41] Orr H. A., 2005. The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6: 119–127. [DOI] [PubMed] [Google Scholar]

[bib42] Orr H. A., 2006. The population genetics of adaptation on correlated fitness landscapes: the block model. Evolution 60: 1113–1124. [PubMed] [Google Scholar]

[bib43] Paixão T., Badkobeh G., Barton N., Corus D., Dang D.-C., et al. , 2015a Toward a unifying framework for evolutionary processes. J. Theor. Biol. 383: 28–43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Paixão T., Pérez Heredia J., Sudholt D., Trubenová B., 2015b First steps towards a runtime comparison of natural and artificial evolution, pp. 1455–1462 in Proceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation. Association for Computing Machinery, New York. [Google Scholar]

[bib45] Park S.-C., Simon D., Krug J., 2010. The speed of evolution in large asexual populations. J. Stat. Phys. 138: 381–410. [Google Scholar]

[bib46] Park S.-C., Neidhart J., Krug J., 2016. Greedy adaptive walks on a correlated fitness landscape. J. Theor. Biol. 397: 89–102. [DOI] [PubMed] [Google Scholar]

[bib47] Phillips P. C., 2008. Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems. Nat. Rev. Genet. 9: 855–867. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] Poelwijk F. J., Kiviet D. J., Weinreich D. M., Tans S. J., 2007. Empirical fitness landscapes reveal accessible evolutionary paths. Nature 445: 383–386. [DOI] [PubMed] [Google Scholar]

[bib49] Poelwijk F. J., Tănase-Nicola S., Kiviet D. J., Tans S. J., 2011. Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J. Theor. Biol. 272: 141–144. [DOI] [PubMed] [Google Scholar]

[bib50] Provine W. B., 2001. The Origins of Theoretical Population Genetics, Ed. 1 University of Chicago Press, Chicago. [Google Scholar]

[bib51] Rowe J. E., Sudholt D., 2014. The choice of the offspring population size in the (1,λ) evolutionary algorithm. Theor. Comput. Sci. 545: 20–38. [Google Scholar]

[bib52] Rudolph G., 1997a Convergence Properties of Evolutionary Algorithms. Verlag Dr. Kovač, Altona, Germany. [Google Scholar]

[bib53] Rudolph G., 1997b How mutation and selection solve long-path problems in polynomial expected time. Evol. Comput. 4: 195–205. [Google Scholar]

[bib54] Sella G., Hirsh A. E., 2005. The application of statistical physics to evolutionary biology. Proc. Natl. Acad. Sci. USA 102: 9541–9546. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] Stadler P. F., 1996. Landscapes and their correlation functions. J. Math. Chem. 20: 1–45. [Google Scholar]

[bib56] Szendro I. G., Schenk M. F., Franke J., Krug J., de Visser J. A. G. M., 2013. Quantitative analyses of empirical fitness landscapes. J. Stat. Mech. 2013: P01005. [Google Scholar]

[bib57] Tautz D., Domazet-Lošo T., 2011. The evolutionary origin of orphan genes. Nat. Rev. Genet. 12: 692–702. [DOI] [PubMed] [Google Scholar]

[bib58] Tuğrul M., Paixão T., Barton N. H., Tkačik G., 2015. Dynamics of transcription factor binding site evolution. PLoS Genet. 11: e1005639. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib59] Valiant L., 2013. Probably Approximately Correct: Nature’s Algorithms for Learning and Prospering in a Complex World, Ed. 1 Basic Books, New York. [Google Scholar]

[bib60] Wagner G. P., Kenney-Hunt J. P., Pavlicev M., Peck J. R., Waxman D., et al. , 2008. Pleiotropic scaling of gene effects and the “cost of complexity.” Nature 452: 470–472. [DOI] [PubMed] [Google Scholar]

[bib61] Weinreich D. M., Watson R. A., Chao L., 2005. Perspective: sign epistasis and genetic constraint on evolutionary trajectories. Evolution 59: 1165–1174. [PubMed] [Google Scholar]

[bib62] Wilke C. O., 2004. The speed of adaptation in large asexual populations. Genetics 167: 2045–2053. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] Williams D., 1991. Probability with Martingales, Ed. 1 Cambridge University Press, Cambridge, United Kingdom. [Google Scholar]

[bib64] Wiser M. J., Ribeck N., Lenski R. E., 2013. Long-term dynamics of adaptation in asexual populations. Science 342: 1364–1367. [DOI] [PubMed] [Google Scholar]

[bib65] Wright S., 1932. The roles of mutation, inbreeding, crossbreeding and selection in evolution. Proceedings of the 6th International Congress of Genetics, Vol. 1, Ithaca, New York, pp. 356–366. [Google Scholar]

PERMALINK

Selection Limits to Adaptive Walks on Correlated Landscapes

Jorge Pérez Heredia

Barbora Trubenová

Dirk Sudholt

Tiago Paixão

Abstract

Methods

Transition probabilities

Fitness landscapes

Figure 1.

Drift analysis

Simulations

Figure 2.

Figure 4.

Data availability

Results

Adaptation time in simple additive landscapes

A critical threshold for efficient adaptation

Efficient approach to the optimum

General additive fitness landscapes

Figure 3.

Adaptation in a general class of landscapes

Discussion

Acknowledgments

Appendix A: Weak-Mutation Regime as an Algorithm

Appendix B: Probability of Fixation Inequalities

Appendix C: Drift Theorems

Appendix D: Adaptation Time in Simple Additive Landscapes

Appendix E: General Additive Fitness Landscapes

Appendix F: Adaptation in Unimodal Landscapes

Climbing Fitness Ridges

Figure D1.

Figure F1.

Figure F2.

Footnotes

Literature Cited

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases