Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2019 May 6;15(5):e1008125. doi: 10.1371/journal.pgen.1008125

The evolution of hybrid fitness during speciation

Andrius J Dagilis 1,*, Mark Kirkpatrick 1, Daniel I Bolnick 1,2
Editor: Alex Buerkle3
PMCID: PMC6502311  PMID: 31059513

Abstract

The evolution of postzygotic reproductive isolation is an important component of speciation. But before isolation is complete there is sometimes a phase of heterosis in which hybrid fitness exceeds that of the two parental species. The genetics and evolution of heterosis and postzygotic isolation have typically been studied in isolation, precluding the development of a unified theory of speciation. Here, we develop a model that incorporates both positive and negative gene interactions, and accounts for the evolution of both heterosis and postzygotic isolation. We parameterize the model with recent data on the fitness effects of 10,000 mutations in yeast, singly and in pairwise epistatic combinations. The model makes novel predictions about the types of interactions that contribute to declining hybrid fitness. We reproduce patterns familiar from earlier models of speciation (e.g. Haldane’s Rule and Darwin’s Corollary) and identify new mechanisms that may underlie these patterns. Our approach provides a general framework for integrating experimental data from gene interaction networks into speciation theory and makes new predictions about the genetic mechanisms of speciation.

Author summary

As populations diverge and ultimately evolve into distinct species, hybrids between them gradually become inviable and infertile. Before this happens, there can be a brief period in which hybrids are actually more fit than their parents, a condition called heterosis or hybrid vigor. This paper describes a model of how hybrid fitness changes as two populations diverge that can explain both hybrid vigor and hybrid inviability in a single framework. These results show that interactions between alleles within a population can often be more important to hybrid fitness than new interactions first seen in hybrids. Simulations of the model, using state-of-the-art experimental data on gene interactions, show evolutionary trajectories that mirror patterns seen in nature. The model suggests how the process by which populations diverge affects the rate of speciation.

Introduction

Isolated populations accrue genetic differences. These differences can reduce the fitness of hybrids, constituting a major step the origin of new species [1, 2]. However, important exceptions exist: there are many examples of recently diverged populations whose hybrids are actually more fit than either parent [38]. Existing theories of speciation genetics do not provide a unified explanation of the seemingly contradictory observations of heterosis and hybrid incompatibility.

The decline in hybrid fitness is generally attributed to Dobzhansky-Muller incompatibilities, or “DMIs” [912]. These are negative epistatic interactions that occur between alleles that are brought together for the first time in hybrids. DMIs have been a major focus of empirical and theoretical studies of speciation genetics [1315]. Mathematical models of DMIs [16, 17] have generated testable predictions about the evolution of hybrid inviability [16, 18], the asymmetric inviability of reciprocal crosses [19], and sex differences in hybrid fitness [20] (reviewed in [21]). Although these predictions have generally been supported by empirical studies [2], the majority of these models share four limitations. First, they consider only deleterious gene interactions, and so are not able to explain heterosis. Second, many previous speciation models do not account for the mechanism by which mutations become fixed as two species diverge. Substitutions are typically assumed to occur randomly through time, with no explicit consideration of the roles played by selection and drift. How these mutations become fixed could have important consequences for how they affect hybrid fitness. Third, although models of DMI evolution consider the effects of new interactions among genes brought together for the first time in hybrids, they neglect the effects of disrupting beneficial gene interactions that evolved within the parental species. Last, previous models assume interactions between all pairs of genes in hybrids are equally likely, while real gene interaction networks are highly structured [22]. A new wave of models has begun to challenge some of these assumptions individually [2327]. Here, we develop a more general model that relaxes all four assumptions.

Some recent models have set aside the DMI framework in favor of explaining hybrid fitness using extensions of Fisher’s Geometric Model [2730]. In such models, mutations have direct effects on phenotypes, and fitness depends on distance to one (or a few) optimal combinations of traits. Appeals of this approach are that hybrid fitness is easily predictable given assumptions about parental fitnesses and it is able to fit empirical data surprisingly well [28]. This framework can explain heterosis as the result of uniparentally inherited elements [27] or low fitness in the parents [28]. Nonetheless, the approach has limitations. Predictions about heterosis are only consistent with speciation under limited assumptions about dominance [28]. More importantly, these models make highly restrictive assumptions about the relationship of genotype to phenotype and the shape of the fitness landscape.

Here we develop a framework to model how hybrid fitness evolves as the result of individual and epistatic fitness effects of mutations. Unlike previous DMI models, we allow for positive as well as negative epistatic fitness effects, as shown schematically in Fig 1. We also explicitly model the substitution process. These advances lead to several new conceptual insights. Depending on how mutations become fixed and their spectrum of fitness interactions, hybrid fitness can be higher, lower, or equal to that of the parental species. We find that hybrid fitness is contingent on two classes of epistatic interactions. The first, which has been the focus of previous DMI models, are the new interactions created by hybridization (upper half of Fig 1). The second, which has generally been ignored, are gene interactions present in the two parental species that are disrupted in the hybrids (bottom half of Fig 1).

Fig 1. Gene interactions determine parental and hybrid fitness.

Fig 1

Parents P1 and P2 cross to produce a hybrid F1 offspring. Each individual carries two loci (circle and square). The ancestral allele is white and derived mutations that have fixed in the two parental species are shaded. If one mutation is fixed in each species (top panels), negative epistasis causes decreased hybrid fitness (top left), while positive epistasis causes heterosis (top right). If two mutations are fixed in one species and none in the other (bottom panels), both negative epistasis and positive epistasis cause the hybrid to have intermediate fitness. Depending on the dominance of epistasis, hybrid fitness may be higher or lower than the average of the parents.

Under simplified conditions, this framework leads to several analytic results that can explain both heterosis and hybrid incompatibility. We extend the framework with a stochastic model that allows for variation in the effects of mutations and accounts for the probability that a mutation will become fixed in a parental species given its fitness effects. We then parameterize the model with data from recent studies in yeast systems biology. We use the fitness measurements of 10,277 unique single mutants and 20,712,321 double mutants measured in refs. [22, 31, 32] to calculate fixation probabilities and hybrid fitness. The result is the first model for the evolution of postzygotic isolation that is grounded in empirical estimates of fitness effects.

We find that heterosis sometimes occurs and sometimes not. When it does occur, it is always transient and happens early in the speciation process. Heterosis may appear when epistatic interactions within a parental species are deleterious, and their disruption causes F1 hybrids to have higher fitness than its parents (Fig 1, bottom left). This case is possible when drift is strong, and mutations with deleterious interactions are fixed in the same population. On the other hand, if epistatic interactions within a parent are beneficial, then F1 hybrids can have lower fitness, even in the absence of deleterious DMIs (Fig 1, bottom right). Simulation results show that that the decline of hybrid fitness mainly results from many gene interactions of small effect, which suggests that mapping of genetic incompatibilities will often be difficult. We find that the disruption of co-adapted sets of alleles in a parental species are often more important to determining F1 fitness than are novel negative interactions that occur in hybrids. Our study suggests that understanding the structure of epistatic networks may be a crucial prerequisite to predicting hybrid fitness.

Results

Our model considers two populations that are fixing mutations while diverging in allopatry. Isolation is complete: there is no gene flow between the populations. For simplicity, we assume no single mutation becomes fixed in both populations, and that a negligible number of loci are polymorphic for mutations at any time.

The fitness of an individual, either from a parental population or a hybrid, is a function of the derived alleles (that is, mutations) that it carries. A mutation’s independent and epistatic fitness effects, which can be either positive or negative, are assumed to be multiplicative. Individuals homozygous for mutations at loci i and j have fitness (1 + si)(1 + sj)(1 + εij), where si and sj are the independent fitness effects of the mutations, and εij is the epistatic fitness effect between them when they are both homozygous. An individual that is heterozygous for mutations in set G1 and homozygous for mutations in G2 has fitness

WG1,G2=iG1(1+hisi)iG2(1+si)×iG1,jG2(1+α2εij)i,jG1(1+α1εij)i,jG2(1+εij). (1)

Here, hi is the dominance of a mutation at locus i. There can be one, two, or four interactions between mutations at a pair of loci, depending on whether the loci are heterozygous or homozygous (S1 Fig). An interaction between two heterozygous derived alleles may be of the same strength as that between two homozygous derived alleles–a form of epistatic dominance. We parameterize epistatic dominance as α1 and α2 for interactions between two heterozygous loci and between a heterozygous and a homozygous locus, respectively. This approach is similar to the H0, H1 and H2 parameters described by Turelli and Orr [17], but allows us to parameterize our model more easily. The fitness of the common ancestor of the diverging species is defined to be 1, and epistatic interactions in the ancestor are defined to be zero. All fitness effects caused by interactions between a derived mutation at locus i and ancestral alleles throughout the genome are accounted for by the independent selection coefficient si. The square root appears in Eq (1) because each combination of mutations i and j occurs twice in the products over sets G1 and G2.

To make our results comparable to empirical studies, we measure the fitness of hybrids relative to the mean fitnesses of the two parental species, and denote relative hybrid fitness as wH. The expression for wH is given in the Methods section.

Analytical results

We first studied the model analytically by making several simplifying assumptions. Intuitively, we expect mutations that become fixed within a population will tend to have positive interactions with mutations that have previously fixed. If not, then the new mutations would be deleterious and unlikely to fix. We denote the mean of these “within-population epistasis” effects as ε¯w, which for simplicity we assume is constant in time. In contrast, the interactions between mutations fixed in different populations will not have experienced this selective sieve, so we expect them to be less positive (and perhaps even negative). We denote the mean of the “between-population epistasis” effects as ε¯b, and again assume that it is constant in time. The final simplifying assumptions are that the means and the coefficients of variation of the fitness effects of mutations are much smaller than one, and that independent fitness effects have additive dominance (h = 1/2).

Consider the situation when a total of n substitutions have fixed in the two populations, a fraction F of which fixed in the first population. The relative fitness of hybrids (which is derived in File S1) is then

wHexp{ε¯bα1vn2ε¯w(12α1)[(12v)n2n2]}. (2)

The parameter v = F(1 –F) measures the symmetry of divergence: v = 1/4 when substitutions occur at equal rates in the two populations, and v = 0 when substitutions only occur in one population. (See the Supplementary Text for derivations.) Examples of the change of hybrid fitness in time are shown in Fig 2 and S2 Fig.

Fig 2. Relative hybrid fitness through time in the analytic model.

Fig 2

Results from Eq (2) are shown for three sets of parameters. Heterosis occurs during the first four fixed substitutions with set (A). Parameters values in (A) are ε¯w = 0.02 and ε¯b = 0.01; in (B) are ε¯w = 0.01 and ε¯b = -0.01; and in (C) are ε¯w = 0.05 and ε¯b = -0.01. In all cases the populations are diverging symmetrically (v = 1/4) and epistatic dominance is additive (α1 = 1/4). See S2 Fig for an exploration of a wider range of parameters.

Eq 2 reveals that hybrid fitness is determined by two factors. First, there are the new gene interactions, represented by the first of the two terms inside the curly brackets (and shown in Fig 1, top panels). These interactions may on average be deleterious (ε¯b<0), as assumed by previous Dobzhansky-Muller incompatibility (DMI) models, or they may be beneficial (ε¯b>0). Second, epistatic interactions between mutations that were fixed in each parental population are disrupted in the hybrids, represented by the second product in the brackets (Fig 1, bottom panels). This term has been overlooked in most prior DMI models. The strength of each of these two effects depends on both the dominance of epistasis and the number of mutations fixed in the two populations. When only one population evolves, or when epistatic effects are completely recessive (v = 0 or α1 = 0), then hybrid fitness is affected solely by the loss of interactions within the parental populations. The epistatic dominance parameter α2 does not appear in Eq (2) because all mutations carried by F1 hybrids are heterozygous (due to our assumption that polymorphic loci are vanishingly rare). The independent fitness effects of mutations (si) are also absent because the number of mutations carried by hybrids is equal to the mean number carried by their parents.

Heterosis occurs whenever wH is greater than 1. Of particular interest are cases in which heterosis is transient, as seen in empirical studies of some emerging pairs of species [3, 8]. The results from Eq (2) are particularly simple when the two populations evolve at the same rate, and epistatic dominance is additive (v = ¼, α1 = ¼). Heterosis then occurs whenever both within- and between-population epistasis are on average positive and within-population epistasis is stronger than between-population epistasis (0 < ε¯b<ε¯w). Hybrid fitness ultimately declines relative to parents, and heterosis is replaced by postzygotic isolation, after a total of (ε¯wε¯b)-1/2 + ε¯w/(ε¯wε¯b) substitutions have occurred.

The potential for heterosis thus depends on the relative effects of within-population and between-population epistasis. A mutation is more likely to fix in one population if it has positive interactions with mutations that are already fixed in that same population, while its potential interactions with mutations fixed in another population are irrelevant to its fixation probability. Between-population effects will therefore generally represent a random draw from the distribution of mutant effects. The within- and between-population effects are in turn expected to be quite different. The interactions between mutations fixed within a population can be influenced by the strength of drift: mutations with negative interactions are more likely to fix when drift is strong, which can cause ε¯w to be negative. Hybrids are rescued from these negative effects, and heterosis can result. Finally, if mutations that fix in one population happen to interact badly with those fixed in the other (so ε¯b is large and negative), then no heterosis will occur, and hybrid fitness can decline rapidly with the fixation of only a small number of mutations.

Classic patterns in speciation

In a classic model of speciation caused by Dobzhansky-Muller incompatibilities, the decline in hybrid fitness accelerates, or “snowballs”, because the number of possible DMIs grows quadratically with the number of mutations fixed in the diverging populations [33]. This prediction has not been strongly supported by empirical studies, however, which has led to the suggestion that the strength of interactions changes over the course of speciation [34, 35].

Our model suggests a different explanation for the missing snowball. When the strengths of within- and between-population epistasis are equal (ε¯b=ε¯w) and the populations evolve at the same rate (v = ¼), hybrid fitness changes in a linear manner as mutations are successively fixed, rather than accelerating. The reason for this discrepancy with the snowball model is quite simple. While the number of new gene interactions in hybrids grows combinatorically, so does the number of beneficial interactions that have evolved in the parental species and that are disrupted in the hybrids. Even when both populations are evolving at equal rates, the number of between-population interactions is only greater than the number of within-population interactions by a small linear factor (n/2). Hybrid fitness is therefore determined by this small number of between population interactions. These conditions occur when epistasis is very weak or drift is very strong. In that case, within-population epistasis will not be much affected by selection and will strongly resemble epistasis between populations. Earlier conclusions about the pattern of decline in hybrid fitness failed to separate out the effect of within-population interactions and therefore need reevaluation (although see [36] for a model that only includes within population interactions).

Haldane’s Rule is the observation that when only one sex of hybrids suffers low fitness, it is the heterogametic sex [20, 37, 38]. This pattern is frequently attributed to dominance [39], but a large body of theoretical work has identified recessive interactions between sex chromosomes and autosomes, and faster evolution of the X chromosome, as important drivers (see [40] for a recent review). To learn how this pattern might emerge from our model, we extended it to include sex-linked loci (Section 2d of Supplementary Text). Hybrid female fitness now depends on interactions between mutations that fixed on X chromosomes evolving independently in the two populations, while hybrid male fitness depends on interactions between mutations fixed on the X in one population and mutations fixed on the Y in the other. The relevant comparison for Haldane’s Rule is then to ask whether a diverged X would have more deleterious interactions with an X or Y from a different species (considering only mutations fixed on non-recombining regions of the X and Y). If X mutations tend to compensate for those fixed on the Y (which is highly susceptible to drift), Haldane’s Rule may result due to largely negative epistasis between a Y and an X from different populations. In an important departure from previous theory [17], our model shows that Haldane’s Rule can result if epistasis between diverged Xs is positive even when no mutations are fixed on the Y. This result makes the testable prediction that Haldane’s Rule in XO systems could be explained by positive epistasis between diverged Xs [20]. Our model thus compliments prior explanations for Haldane’s Rule that depend on dominance.

Darwin’s Corollary is the observation that hybrid fitness sometimes depends on the direction of the cross [19, 41, 42]. It is thought this asymmetry results from genetic interactions between one set of mutations on autosomes and another set on the sex chromosomes and/or mitochondria [41, 42]. Extending our model further to include mutations that fix on mitochondria (Supplementary Text, section 2f), we find that the pattern of Darwin’s Corollary emerges whenever one population evolves faster than the other, and is especially prominent if the X or mitochondria evolve faster than autosomes (consistent with the “faster-X effect” [43, 44]).

Parameterizing the model with data

We next explored the implications of our model when the fitness effects are fit to data from laboratory populations of yeast. We estimated the selection coefficients that appear in Eq (1) using data on the 10,277 knockout mutations studied by Costanzo et al. [22], who measured the independent effects and the epistatic effects between 20,712,321 pairs of mutations. These measurements were taken during the haploid phase of the life cycle, so they provide no information about dominance effects. We therefore simply assumed mutations have no dominance (hi = ½). Further details about the data are given in Materials and Methods.

As observed previously [31], in these yeast there is a significant positive correlation between a mutation’s independent effect and its interactions with other mutations (S3 Fig). Mutations that are highly deleterious individually also tend have highly negative interactions with other mutations. While the biological cause of this correlation remains unclear, it does have an important implication for speciation: pairs of mutations that have strongly negative interactions, and are therefore more capable of killing hybrids, are unlikely to fix in the first place.

This pattern leads to a novel prediction: interactions between mutations fixed early in divergence may contribute less to reproductive isolation than those fixed later. Mutations that appear early will interact with only a small number of mutations that have previously fixed, and so the fate of these mutations is determined mainly by their independent effects. Only those with positive independent effects are likely to fix, and because of the correlation noted above, these early mutations will also tend to have beneficial interactions in hybrids. Later in divergence, the probability that a mutation becomes fixed is mainly determined by how well it interacts with mutations that were previously fixed in that population rather than its own independent effect. It will therefore not be more likely than random to have beneficial interactions with mutations fixed in the other population. Thus within-population epistasis is expected to become increasingly strong and positive, while between population epistasis will tend to become more negative. The result is that the loss of hybrid fitness accelerates. This acceleration resembles the snowball effect discussed earlier [33], but occurs for a very different reason: changes in the average epistatic effects of mutations over time, rather than changes in their numbers.

A stochastic model

To determine the full evolutionary trajectories of hybrid fitness, we turned to stochastic simulations based on the parameterized model. In the simulations, mutations fix sequentially as the result of selection and drift (details are given in the Methods). As the populations diverge, the fitness effects of mutations that fix are altered by their interactions with mutations that were fixed previously. This change allows the fixation of mutations later in the process of divergence that previously had been highly deleterious (S4 Fig). As a result, each population becomes enriched for co-adapted sets of mutations. In contrast, between-population effects of mutations change little through time because those effects are not tested by selection. These contrasting patterns are shown in Fig 3.

Fig 3. Average epistatic effects of substitutions.

Fig 3

Stochastic simulations of the full model show that the average within-population epistasis is positive at all times. It increases during the first few generations of divergence and then stabilizes. The average between-population epistasis is negative and nearly constant, and has a much smaller absolute size than within-population epistasis. The shaded regions represent 95% confidence intervals.

Fig 4 shows how hybrid fitness evolves when we parameterize our general model with fitness data from yeast. Heterosis is quite common early in the process of speciation: it occurs in fully 10% of the simulations after 10 mutations are fixed in each population. But even in those cases, hybrid fitness inevitably declines. On average, relative hybrid fitness decreases by only 1% with each additional mutation that becomes fixed, and in only 2% of simulations does a single mutation causes a large change (> 10%) at any time during divergence. These results suggest that the genetic mapping of incompatibilities will often be difficult because hybrid fitness is typically determined by many interactions of small effect.

Fig 4. Relative hybrid fitness through time from the stochastic simulations.

Fig 4

The solid black curve shows the mean hybrid fitness over 10,000 simulations, the grey lines show a sample of 100 simulations, and the shaded area gives the 95% confidence interval. Heterosis (wH > 1) occurs in some simulations early in divergence, but then rapidly disappears as populations continue to diverge.

The two dominance parameters for epistatic effects modify our conclusions in several ways. The dominance of epistasis between two heterozygous loci, α1, acts to modify the effect of within and between population epistasis to hybrid fitness (S8 Fig). On the other hand, the dominance of epistasis between a heterozygous and a homozygous locus, α2, modifies which mutations fix within populations. This greatly affects how hybrid fitness evolves (S9S12 Figs). Most notably, when α2 is very small (epistatic effects are highly recessive), the probability that a mutation becomes fixed is almost entirely based on its independent fitness effect. Mutations that do fix will also have little effect on hybrid fitness (because they are heterozygous in F1s). As a result, the average epistatic effects within and between populations are very close to the average effect of all mutations, and to each other (ε¯bε¯w). As seen in our analytical results, this leads to linear change in hybrid fitness as the number of substitutions increases.

Lastly, we find that even late in divergence, when few hybrids survive, a large proportion of mutations that have fixed in one population are also beneficial in the genomic background of the other. This fraction declines from roughly 75% when only 2 mutations are fixed in each population to 50% after 50 substitutions have fixed (S5 Fig). Thus, even when hybrids are far less fit than their parents, many of the genes they carry may be able to introgress from one population into the other. Hybridization may thus involve both reduced hybrid fitness overall, but also deterministic introgression for particular alleles.

Discussion

Although observations of heterosis and postzygotic isolation may appear to be incompatible, the results of this model show they can neatly be explained by the same factors. Hybrid fitness is affected by two kinds of epistatic interactions: those between mutations fixed within a population, and those between mutations fixed in different populations. Each of the two can be beneficial or deleterious. Previous models of speciation have focused on the negative consequences of between-population epistasis, and neglected the consequences of disrupting the interactions between mutations fixed within each parental population.

Our model reveals the importance that within-population epistasis may have in speciation. Data from yeast suggest that interactions between many mutations are positive [22] (although these data are biased for mutations of positive effect), and our results show that these can contribute to heterosis. Positive epistasis also plays a key role in the evolution of reproductive incompatibility because mutations fixed in a parental population are likely to interact positively with each other, and hybrids can miss out on those fitness benefits (Fig 1). Our simulations show that early in divergence, interactions between mutations fixed in a population are typically weak, and disrupting them has little effect on hybrid fitness. But as a population continues to evolve in isolation, it builds up sets of co-adapted alleles, and their positive interactions are disrupted in hybrids. The loss in hybrid fitness caused by this disruption can be an order of magnitude larger than that caused by new deleterious interactions between mutations that first encounter each other in hybrids (Fig 3). In the most extreme cases, the disruption of beneficial interactions within the parent populations could cause hybrid lethality.

The average strength of within-population epistasis (ε¯w) depends strongly on the roles that selection and drift play in the divergence of the two populations. If epistasis is strong and drift is weak, mutations that fix will tend to have positive interactions, and (as we have seen) their loss can be the major cause of hybrid incompatibility. But when epistatic effects are recessive or very weak and drift is strong, then ε¯w will be similar to the network average. In that case, the disruption of gene interactions that occur in the parental population will play a small role in determining hybrid fitness.

The second type of gene interaction that appears in our model is between-population epistasis. In contrast to within-population epistasis, the average strength of between-population epistasis (ε¯b) is not determined by adaptation during divergence. Instead, it is expected to represent the effects of random samples from the epistatic interaction network. Our simulations confirm this intuition (Figs 4, S12 and S14).

Another factor that contributes to the evolution of isolation is the relative rates at which the two populations are evolving. Our results suggest that isolation develops more quickly when one population is evolving much more rapidly than the other. In the extreme case when only one population is evolving (and therefore v is zero), Eq (2) shows hybrid fitness is determined entirely by the loss of beneficial epistatic effects enjoyed within that population. A follow-up prediction is that backcrossing an F1 hybrid with the faster adapting population should restore fitness, while the reciprocal backcross should not.

A final factor that influences the evolution of postzygotic isolation is dominance. Eq (2) shows that interactions between pairs of heterozygous mutations (measured by α1) play a direct role in determining hybrid fitness, but interactions between a heterozygous and a homozygous mutation (measured by α2) do not. This second parameter does, however, influence which mutations fix, as a rare mutant allele will occur in a genetic background where it is heterozygous but other derived alleles are homozygous. This intuition is confirmed by the simulations (S13 and S15 Figs). To our knowledge, there has been no experimental study of the dominance of epistatic effects, and so these parameters, crucial to determining the trajectory of hybrid fitness, remain unknown. We suspect, however, that deleterious gene interactions are often recessive, because that is the basis of the highly successful theories constructed to explain Haldane’s Rule [11, 40] and the large X effect [17].

Relations with existing models

Heterosis has previously been explained by overdominance and by rescue from recessive deleterious alleles that have fixed by strong drift [4, 5, 45, 46]. The role of epistasis has received less attention. Studies of natural populations generally do not have sufficient power to detect epistatic effects. Studies in crop plants have found that epistasis may be important to hybrid fitness [4750], but it is not known whether these findings generalize to natural populations. In nature, heterosis could well result from a combination of the epistatic effects that are the focus of our model and the effects of overdominance and recessive deleterious alleles that have been modeled previously. On the other hand, epistasis seems to be the only hypothesis that can explain why heterosis is frequently seen between closely related species, but not between ones that are more highly diverged. Our model suggests two explanations for this pattern. First, heterosis is transient under simple limiting conditions laid out by Eq 2. Second, in our simulations the variance in epistasis within and between populations is large early in divergence and becomes small as populations continue to diverge (Fig 3). This suggests that heterosis may often be a result of the stochastic sampling of epistatic interactions within and between populations.

Adaptive introgression between hybridizing species has been uncovered in several taxa including Heliconius butterflies [51], Anopheles mosquitoes [52], and (most famously) in humans [53]. Previous work has shown that DMIs, even of strong effect, may be weak barriers to gene flow across the rest of the genome [5456], and so even when hybrid fitness is low, introgression may be possible. The genetic consequences of hybridization depend not only on the individual effects that genes have on fitness, but also on how they interact: introgression is aided by positive epistasis and hampered by negative epistasis. In our simulations, fully half of the substitutions in one population can be positively selected in the other even when the populations are nearly completely reproductively isolated (S4 Fig). These results confirm that there is widespread potential for adaptive introgression.

The classic models for the evolution of postzygotic isolation focus on Dobzhansky-Muller incompatibilities (DMIs). A core prediction from these models is the so-called “snowball effect” [16, 18, 57]. This is the pattern in which the loss of hybrid fitness accelerates because the number of new deleterious interactions that occur in hybrids increases quadratically with the number of substitutions that have occurred. The snowball pattern is also predicted by our model under some conditions (Fig 2B and 2C).

There is mixed support for the snowball pattern in natural populations [18, 57], leading to the development of theories to explain the so-called “missing snowball” [24, 34, 35]. Our model is also compatible with the missing snowball. When the populations evolve at the same rate (v = ¼) and the average strength of epistasis is equal within and between populations (ε¯b=ε¯w), hybrid fitness declines linearly (rather than accelerating) with the number of mutations that have fixed (n). That is because the number of new gene interactions that appear in hybrids is largely offset by the number of co-adapted interactions that occur in the parents that are lost in the hybrids. Thus, even when the total number of incompatibilities increases quadratically, hybrid fitness may change linearly.

Another family of models for the evolution of postzygotic isolation is based on Fisher’s Geometric Model (FGM) [27, 28, 30, 58]. They assume there is a single phenotypic optimum, and that mutations have direct phenotypic effects. In contrast, our model (as well as the models of DMIs) does not make any assumptions about the underlying phenotypes. The price for this generality is that our model has many more parameters. Under some conditions, the two types of models can produce similar patterns of the change in postzygotic with time. There are, however, differences in some predictions. The core difference is the number of incompatibilities. In our model and the DMI models, the number of potential incompatibilities increases at least quadratically as populations diverge. FGM models, on the other hand, predict only a linear increase in the total number of incompatibilities. In practice it is difficult to determine the number of incompatibilities, making this difference hard to test. Our results suggest another contrast between the models: we expect more rapid speciation when one population is evolving much faster than the other, while FGM models make no such prediction.

Caveats and future extensions

Our model assumes the two populations diverge in complete isolation, and its conclusions might change substantially if there is gene flow between them. If gene flow causes mutations to be tested in the genetic backgrounds of both populations, they may have more positive effects when brought together in hybrids than they do in the absence of gene flow. Conversely, epistatic effects within populations may be less beneficial because a mutation’s survival is affected by its interactions with substitutions in the alternate population. Both effects will tend to cause hybrid fitness to be greater in the presence of gene flow than in its absence, even when the number of mutations that differ between the populations is controlled for.

Our model does not consider interactions between alleles at three or more loci. Recent work on yeast has shown that these higher order interactions may be common, but weaker than pairwise interactions [59]. As populations diverge, the number of higher order interactions increases even more rapidly than the number of pairwise interactions. Consequently, higher order interactions could be the main determinant of hybrid fitness breakdown. Future empirical and theoretical investigation should consider these higher order effects.

We did not study the fitness of F2, backcrosses, or later generations of hybrids. It is plausible that later generations may experience both effects of further hybrid breakdown and of transgressive segregation [6062]. Fitness in later generations depends strongly on the linkage relationships between interacting genes. (For example, if there are sets of interacting alleles that are tightly linked, their contribution to hybrid fitness will break down very slowly as recombination breaks them apart.) Thus, extending this framework to later generations of hybrids requires reformulating the model to account for recombination.

We parameterized the model with data from mutations in yeast that may not be representative. Their fitness effects were measured in the lab, and based on only one fitness component (growth rate). Measurements were made in a single benign environment, so they neglect the potential effects of environmental heterogeneity that may be important to speciation in yeast [63, 64]. Most importantly, our model is of a diploid organism, but the yeast were measured in their haploid state. The dominance of these mutations is unknown, and the distribution of their fitness effects as haploids may not reflect those in diploids. The actual proportion of positive epistasis is difficult to measure and is likely to be different in different species and for different measures of epistasis [65]. Speciation studies in yeast have shown a variety of patterns for both the tempo and cause of speciation in laboratory settings, suggesting the particular rate of speciation observed in our simulations is not universal [4, 64, 6673]. But while these caveats may challenge the quantitative results, the qualitative results should be robust if the distributions of epistatic effects in nature are similar to those in the data we used.

Here, we have shown that a model of speciation can be parameterized using data from a real biological system. The resulting model can recapitulate basic patterns observed in speciation, including both hybrid incompatibility and heterosis. The model also provides new insights into the genetic basis of postzygotic isolation. Refinements of this modeling framework will be possible when additional data about gene interactions become available, leading to further understanding of the kinetics of speciation.

Materials and methods

Data and simulations

To parameterize a stochastic model of evolutionary divergence and speciation, we used the fitnesses in haploids for all 10,858 single knockouts and all 20,712,321 double knockouts in each strain reported by Costanzo et al. [22]. The fitness proxy is the colony growth rate of the knockouts. See Costanzo et al. supplemental materials for details of fitness measurements and experimental design [22, 32]. Data on the distribution of single-mutation and pairwise epistatic fitness effects in yeast were obtained from the online repository at http://www.thecellmap.org/costanzo2016/.

Our goal in using these data was to estimate the complete distribution of epistatic effects, including weak and absent effects. These weak effects, which were disregarded in the paper reporting the data, may nonetheless be evolutionarily important. To obtain this distribution, we downloaded and concatenated all data files containing fitness measurements including nonessential-by-nonessential genes, nonessential-by-essential, essential-by-essential, and the DAmP experiments. To avoid biasing the dataset against small selection coefficients, we used all measured interactions regardless of their p-value.

We write the haploid fitnesses of a single knockout at locus i and a double knockout at loci i and j (respectively) as:

Wi=(1+si),Wi,j=(1+si)(1+sj)(1+εij). (3)

Using these relations, we fit the independent selection coefficients and the pairwise epistatic selection coefficients to the yeast dataset. Since some measurements did not report either double knockout fitness or the single knockout fitness of one of the mutations, we were left with 18,743,950 interactions.

To determine which mutation will be the next to become fixed, we calculated the fitness of mutations at all loci in the genome not already fixed for mutations. The fitness of a mutation depends on both its individual effect and its interactions with mutations already fixed (as described by Eq 1). Thus, the fitness of a given mutation changes in time as the genetic background of its population evolves with the fixation of mutations at other loci. The fitnesses of all new mutations were used to calculate their probability of fixation using a standard diffusion approximation, and assuming an effective population size of 106. Finally, a mutation was randomly chosen for fixation with probability proportional to the chance that it would become fixed. This process was iterated until each population was fixed for 50 mutations. The entire process was replicated in 10,000 stochastic simulations.

Analyses and simulations were performed in R, and figures were generated using ggplot2 and cowplot packages [7476]. The scripts used to generate the simulation data and perform the following analyses are available at https://github.com/adagilis/SpeciationSimulation2018.

Analytical results

We consider two populations that are diverging in isolation and independently become fixed for different sets of mutations. To assess hybrid fitness as the populations, diverge, we assume that the number of loci at which mutations can fix is so large that there is a negligible chance the same mutation will become fixed independently in both populations. Denote as A and B the sets of loci at which mutations have fixed in populations A and B, respectively, and assume that neither population is polymorphic for derived mutations. We measure hybrid fitness relative to the mean of the two parental species:

wH=W(AB),12(W,A+W,B), (4)

where WX,Y is the absolute fitness of an individual that is heterozygous for mutations at loci in set X and homozygous for loci in set Y, ∪ represents the union operator, and ∅ is the empty set. The numerator is the absolute fitness of an F1 hybrid, which is heterozygous for mutations at loci in the set (AB), and homozygous for no mutations. The denominator represents the mean of the absolute fitnesses of the parental species. Individuals from population A are heterozygous for mutations at no loci, but homozygous for mutations at loci in set A, and likewise for individuals from population B.

Divergence in allopatry

To model allopatric population divergence, we calculated the probability of fixation for each mutation in the yeast genome using Kimura’s diffusion approximation [77]. This probability depends on the mutation’s fitness in heterozygous and homozygous states. Since the strength of selection on each mutation will depend on the genetic background in which it occurs, we use the effective selection coefficient and the effective dominance of each mutation. Let σi|A and ωi|A be the effective selection coefficient and dominance for a mutation at locus i, which accounts for the mutation’s independent fitness effect and its epistatic interactions with mutations at loci in the set A that have already fixed. Then:

σi|A=(1+si)jA(1+εij)1,ωi|A=[(1+hisi)jA(1+α2εij)]1σi|A. (5)

We assume that polymorphism is sufficiently rare that a population can be assumed fixed at all other loci when considering the fixation probability of a new mutation. (An interesting extension for future work would relax the assumption of no polymorphism [78].) When no other mutations have yet been fixed, σi|A is equal to si and ωi|A = hi.

Supporting information

S1 Text. Supplementary Text.

This file contains details about the simulation results, additional simulations under varying assumptions about epistasis and analytical derivations for the model.

(DOCX)

S1 Fig. Epistatic dominance parameters.

Schematic of the 3 possible strengths of epistatic interactions when (from left to right) both loci are heterozygous for derived (shaded) allele, one is heterozygous and one is homozygous, and both are homozygous. The red lines show the epistatic interactions. These values correspond to Turelli and Orr’s notation of H0, H1, H2 from left to right, respectively.

(TIF)

S2 Fig. Analytical solutions for relative hybrid fitness.

The effect of varying the strength of epistasis, asymmetry in divergence and dominance of epistasis to hybrid fitness. All heat maps are numerical evaluations of hybrid fitness given by Eq (2) of the main text, with default parameter values ε¯b = -0.005, ε¯w = 0.01, v = 1/4, α1 = 1/4. Varying ε¯b (top left) causes hybrid fitness to decline more rapidly when between population epistasis becomes more deleterious, and heterotic when they are positive. The inverse pattern is observed for ε¯w (top right). Increasing asymmetry (lower v values, bottom left) decreases hybrid fitness more rapidly, while increasing epistatic dominance (bottom right) retains within population epistasis causing heterosis (since |ε¯b|<|ε¯b|), while lower dominance causes more rapid speciation.

(TIF)

S3 Fig. Average epistatic effects of mutations in the yeast data set.

In yeast knockouts, the mean epistatic effect of mutations correlates with the mutations’ direct fitness effects. Each point is one of the mutations in the yeast epistatic interaction data-set. The X-axis is the direct selection coefficient observed in lab experiments, while the Y-axis is the average of all epistatic interactions involving that mutation with all other genes. The red line is the linear fit to the data (adjusted R-squared of 0.103, p<10−16).

(TIF)

S4 Fig. Fixation rates of mutations in simulations.

Observed simulation fixation rates for the 10588 mutations in the yeast epistatic network dataset. Points in red are mutations with on average positive epistasis, while points in blue are epistatically neutral or deleterious. Mutations with even highly negative direct selection coefficients are fixed frequently when their epistatic effects are largely positive.

(TIF)

S5 Fig. The fraction of substitutions that may adaptively introgress.

The fraction of substitutions in one population that are positively selected in the other population over the course of the simulation. Blue line is the mean of 10000 simulations, grey area is the 95% confidence interval. Nearly all substitutions fixed early are beneficial in the ancestral background and so will fix in both populations if gene flow allows the derived allele to cross between populations. As divergence continues, fewer and fewer of the substitutions in one population are positively selected in the genomic background of the other population. The proportion of positively selected mutations remains high even when hybrid fitness is expected to be very low (<0.2, Fig 4).

(TIF)

S6 Fig. The distribution of measures of relative contributions of interactions to hybrid fitness early and late in divergence.

The distribution of γ2 (A) and γ100 (B) values, representing the potential of each individual interaction to contribute to speciation after a total of 2 and 100 mutations have been fixed in two independently evolving populations. Since γ is a cross product of the frequency an interaction is seen in a hybrid and its fitness effects in that hybrid, positive values represent potential for heterosis, while negative ones represent reproductive isolation. The vast majority of interactions are unlikely to contribute to either process.

(TIF)

S7 Fig. The importance of interactions early vs late in divergence.

The relationship between γ2 and γ100 for all interactions. Most interactions have no effect towards hybrid speciation either early or late in the process of speciation. The interactions that are most important early in the process (large absolute γ2) do not remain important as the populations continue to diverge. This suggest that different interactions may be contributing to hybrid fitness at different times during the process of speciation.

(TIF)

S8 Fig. Summary of epistasis in the yeast data assuming additive epistasis.

Summary of network using additive epistasis measures. A) The distribution of epistatic effects. B) The proportion of positive epistatic interactions as a function of the cutoff of the absolute value of the interaction. Additive measures of epistasis lead to far fewer positive interactions of large effect, reducing the proportion of positive interactions in the network to negligible amounts if strong cutoffs are considered. C) The highly linear relationship between additive epistatic effect and direct fitness effects of mutations. D) The relationship in C is derived from the strong pattern of deleterious mutations having strongly deleterious interactions, while positively selected interactions tend to occur between positively selected mutations. E) Since positively selected mutations largely have positive interactions, the interactions likely to occur early in hybrids are on average positive.

(TIF)

S9 Fig. Simulations of the yeast data set assuming additive epistasis.

The simulation results for the additive epistatic model. (A) Under the additive model, only interactions of positive direct fitness effect tend to be fixed. (B) Since positively selected mutations share largely positive epistatic interactions, within and between population interactions slowly converge to the same values. (C) Hybrid fitness asymptotes under the additive model, since novel interactions first seen in hybrids are of roughly the same positive strength as the co-adapted interactions seen within populations (as seen in B).

(TIF)

S10 Fig. Epistasis in the yeast data set using a modified version of multiplicative epistasis.

Results using the measures of epistasis presented in Costanzo et al. (2016). A) The distribution of epistatic effects. B) The proportion of positive epistatic interactions as a function of the cutoff of the absolute value of the interaction. C) The relationship between additive epistatic effect and direct fitness effects of mutations. The two are weakly but significantly correlated (r2 = ,p<10). D)The largest epistatic effects are seen between mutations of most extreme effects (both positive and negative). E) Interactions likely to occur early in speciation are of weaker and more positive effect than interactions that are unlikely to occur early on.

(TIF)

S11 Fig. Simulation results using a modified model of multiplicative epistasis.

The simulation results for the Costanzo epistatic model. Results are qualitatively similar to the results presented in the main paper, with several exceptions. (A) The modified epistatic model results in mutations with extremely strong direct fitness effects rarely being fixed. Mutations with positive or negative epistatic effect are colored blue and red, respectively. (B) Epistatic effects between populations (red) converge to the average epistatic effect between populations as substitutions are fixed, while interactions within each population (blue) quickly become positive in the majority of the simulations. The curves capture the range of 95% of the simulations. (C) Hybrid fitness decreases more slowly than in the multiplicative case, but much faster than in the additive model. Dashed line represents simulation mean; faint grey lines give representative simulation contours.

(TIF)

S12 Fig. Relative hybrid fitness varies with dominance of epistasis in hybrids.

Relative hybrid fitness when assumptions about dominance of epistasis are modified. We consider our simulations in which α2 = 0.5. Since hybrids will only experience epistasis of strength α1ε, we reanalyze their fitness assuming epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2) and dominant (red, α1 = α2). Solid lines represent means of simulations, shaded areas are 95% confidence intervals. As suggested by our model, highly dominant interactions lead to slower decline in fitness. Simulations demonstrate that a secondary effect is increased variance in observed hybrid fitness.

(TIF)

S13 Fig. Average epistasis in simulations assuming recessive epistatic interactions.

Average epistatic effects within (purple) and between (tan) populations. While variance of within population effects is lower than that of between populations, the two rapidly converge to the same slightly negative value. Shaded areas represent the 95% confidence interval of 2000 simulations, solid lines are means.

(TIF)

S14 Fig. Relative hybrid fitness in simulations assuming recessive epistatic interactions.

Relative hybrid fitness in simulations when epistatic effects are nearly completely recessive as mutations are fixed (α2 = 0.01). Hybrid fitness depends very little on whether epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2), or dominant (red, α1 = α2). Hybrid fitness therefore increases linearly with respect to fixed substitutions, proportional to the average epistatic effect.

(TIF)

S15 Fig. Average epistatic effect in simulations assuming dominant epistatic interactions.

The average epistatic effect found among mutations fixed in the same population (purple) or in different populations (tan), when epistatic effects are largely dominant. Few deleterious epistatic effects can be fixed within the same population, and so epistasis within rises rapidly and continues to increase as populations diverge. Between population epistasis behaves the same as in simulations where epistasis additively dominant. Shaded areas represent the 95% confidence interval of 2000 simulations, solid lines are means.

(TIF)

S16 Fig. Relative hybrid fitness in simulations assuming dominant epistasis.

Relative hybrid fitness in simulations when epistatic effects are nearly completely dominant as they are fixed (α2 = 0.95). Because within population epistasis is large, hybrid fitness depends heavily on whether epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2), or dominant (red, α1 = α2). When effects are recessive, hybrids lose out on co-adapted blocks and rapidly lose fitness, while interactions that display nearly the same strength as seen in parents (red) lead to extreme heterosis due to combinations of positive epistatic effects from both parents.

(TIF)

Acknowledgments

We thank our reviewers and editor for helpful comments and suggestions.

Data Availability

All novel data are within the manuscript and its Supporting Information files. The yeast epistatic interaction data were obtained from http://thecellmap.org/. The scripts used to perform the analyses are available at https://github.com/adagilis/SpeciationSimulation2018.

Funding Statement

MK was supported by National Institutes of Health grant R01-GM116853, url: www.nih.gov. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Mayr E. Systematics and the origin of species New York: Columbia Univ. Press; 1942. [Google Scholar]
  • 2.Coyne JA, Orr HA. Speciation Sunderland, MA: Sinauer Associates; 2004. [Google Scholar]
  • 3.Bolnick DI, Near TJ. Tempo of hybrid inviability in centrarchid fishes (Teleostei: Centrarchidae). Evolution. 2005;59:1754–67. [PubMed] [Google Scholar]
  • 4.Bernardes JP, Stelkens RB, Greig D. Heterosis in hybrids within and between yeast species. J Evol Biol. 2017;30(3):538–48. 10.1111/jeb.13023 . [DOI] [PubMed] [Google Scholar]
  • 5.Birchler JA, Yao H, Chudalayandi S, Vaiman D, Veitia RA. Heterosis. Plant Cell. 2010;22:2105–12. 10.1105/tpc.110.076133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Edmands S. Does parental divergence predict reproductive compatibility? Trends Ecol Evol. 2002;17(11):520–7. [Google Scholar]
  • 7.Edmands S. Heterosis and outbreeding depression in interpopulation crosses spanning a wide range of divergence. Evolution. 1999;53(6):1757–68. 10.1111/j.1558-5646.1999.tb04560.x [DOI] [PubMed] [Google Scholar]
  • 8.Schnable PS, Springer NM. Progress toward understanding heterosis in crop plants. Annu Rev Plant Biol. 2013;64:71–88. 10.1146/annurev-arplant-042110-103827 [DOI] [PubMed] [Google Scholar]
  • 9.Dobzhansky T. Genetics and the origin of species New York: Columbia University Press; 1937. [Google Scholar]
  • 10.Muller HJ. Bearing of the Drosophila work on systematics In: Huxley J, editor. The New Systematics. Oxford, UK: Oxford University Press; 1940. p. 185–268. [Google Scholar]
  • 11.Coyne JA, Orr HA. Patterns of speciation in Drosophila. Evolution. 1989;43(2):362–81. Epub 1989/03/01. 10.1111/j.1558-5646.1989.tb04233.x . [DOI] [PubMed] [Google Scholar]
  • 12.Coyne JA, Orr HA. "Patterns of speciation in Drosophila" revisited. Evolution. 1997;51:295–303. 10.1111/j.1558-5646.1997.tb02412.x [DOI] [PubMed] [Google Scholar]
  • 13.Barbash DA, Siino DF, Tarone AM, Roote J. A rapidly evolving MYB-related protein causes species isolation in Drosophila. Proc Natl Acad Sci U S A. 2003;100:5302–7. 10.1073/pnas.0836927100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Presgraves DC. A fine-scale genetic analysis of hybrid incompatibilities in Drosophila. Genetics. 2003;163:955–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rawson PD, Burton RS. Functional coadaptation between cytochrome-c and cytochrome-c oxidase within allopatric populations of a marine copepod. Proc Natl Acad Sci U S A. 2002;99(20):12955–8. PREV200200598834. 10.1073/pnas.202335899 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Orr HA, Turelli M. The evolution of postzygotic isolation: accumulating Dobzhansky-Muller incompatibilities. Evolution. 2001;55:1085–94. [DOI] [PubMed] [Google Scholar]
  • 17.Turelli M, Orr HA. Dominance, epistasis, and the genetics of postzygotic isolation. Genetics. 2000;154:1663–79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Matute DR, Butler IA, Turissini DA, Coyne JA. A test of the snowball theory for the rate of evolution of hybrid incompatibilities. Science. 2010;329(5998):1518–21. 10.1126/science.1193440 [DOI] [PubMed] [Google Scholar]
  • 19.Bolnick DI, Turelli M, Lopez-Fernandez H, Wainwright PC, Near TJ. Accelerated mitochondrial evolution and "Darwin's corollary": asymmetric viability of reciprocal F1 hybrids in centrarchid fishes. Genetics. 2008;178(2):1037–48. 10.1534/genetics.107.081364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Laurie CC. The weaker sex is heterogametic: 75 years of Haldane's rule. Genetics. 1997;147(3):937–51. 90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gavrilets S. Perspective: models of speciation: what have we learned in 40 years? Evolution. 2003;57(10):2197–215. [DOI] [PubMed] [Google Scholar]
  • 22.Costanzo M, VanderSluis B, Koch EN, Baryshnikova A, Pons C, Tan G, et al. A global genetic interaction network maps a wiring diagram of cellular function. Science. 2016;353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Guerrero RF, Muir CD, Josway S, Moyle LC. Pervasive antagonistic interactions among hybrid incompatibility loci. PLoS Genet. 2017;13(6):e1006817 10.1371/journal.pgen.1006817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kalirad A, Azevedo RB. Spiraling complexity: a test of the snowball effect in a computational model of RNA folding. Genetics. 2017;206(1):377–88. 10.1534/genetics.116.196030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Olofsson P, Livingstone K, Humphreys J, Steinman D. The probability of speciation on an interaction network with unequal substitution rates. Math Biosci. 2016;278:1–4. 10.1016/j.mbs.2016.04.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Livingstone K, Olofsson P, Cochran G, Dagilis A, MacPherson K, Seitz KA. A stochastic model for the development of Bateson–Dobzhansky–Muller incompatibilities that incorporates protein interaction networks. Math Biosci. 2012;238(1):49–53. 10.1016/j.mbs.2012.03.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Fraïsse C, Gunnarsson PA, Roze D, Bierne N, Welch JJ. The genetics of speciation: insights from Fisher's geometric model. Evolution. 2016;70(7):1450–64. 10.1111/evo.12968 [DOI] [PubMed] [Google Scholar]
  • 28.Simon A, Bierne N, Welch JJ. Coadapted genomes and selection on hybrids: Fisher's geometric model explains a variety of empirical patterns. Evol Letters. 2018;2(5):472–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Barton NH. The role of hybridization in evolution. Mol Ecol. 2001;10(3):551–68. [DOI] [PubMed] [Google Scholar]
  • 30.Martin G, Lenormand T. The fitness effect of mutations across environments: Fisher's geometrical model with multiple optima. Evolution. 2015;69(6):1433–47. 10.1111/evo.12671 [DOI] [PubMed] [Google Scholar]
  • 31.Costanzo M, Baryshnikova A, Bellay J, Kim Y, Spear ED, Sevier CS, et al. The genetic landscape of a cell. Science. 2010;327(5964):425–31. 10.1126/science.1180823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kuzmin E, Sharifpoor S, Baryshnikova A, Costanzo M, Myers CL, Andrews BJ, et al. Synthetic genetic array analysis for global mapping of genetic networks in yeast. In: Smith JS, Burke DJ, editors. Yeast Genetics: Methods and Protocols. 360. New York, NY: Springer; 2014. p. 143–68. [DOI] [PubMed] [Google Scholar]
  • 33.Orr HA. The population genetics of speciation: the evolution of hybrid incompatibilities. Genetics. 1995;139(4):1805–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Presgraves DC. Speciation genetics: search for the missing snowball. Curr Biol. 2010;20(24):R1073–R4. 10.1016/j.cub.2010.10.056 [DOI] [PubMed] [Google Scholar]
  • 35.Gourbiere S, Mallet J. Are species real? The shape of the species boundary with exponential failure, reinforcement, and the “missing snowball”. Evolution. 2010;64(1):1–24. 10.1111/j.1558-5646.2009.00844.x [DOI] [PubMed] [Google Scholar]
  • 36.Welch JJ. Accumulating Dobzhansky-Muller incompatibilities: reconciling theory and data. Evolution. 2004;58:1145–56. [DOI] [PubMed] [Google Scholar]
  • 37.Haldane JB. Sex ratio and unisexual sterility in hybrid animals. Genetics. 1922;12(2):101–9. [Google Scholar]
  • 38.Schilthuizen M, Giesbers M, Beukeboom L. Haldane's rule in the 21st century. Heredity (Edinb). 2011;107(2):95–102. 10.1038/hdy.2010.170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Muller HJ. Isolating mechanisms, evolution and temperature. Biological Symposia. 1942;6:71–125. [Google Scholar]
  • 40.Delph LF, Demuth JP. Haldane’s rule: genetic bases and their empirical support. J Hered. 2016;107(5):383–91. 10.1093/jhered/esw026 [DOI] [PubMed] [Google Scholar]
  • 41.Brandvain Y, Pauly GB, May MR, Turelli M. Explaining Darwin’s corollary to Haldane’s rule: the role of mitonuclear interactions in asymmetric postzygotic isolation among toads. Genetics. 2014;197(2):743–7. 10.1534/genetics.113.161133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Turelli M, Moyle LC. Asymmetric postmating isolation: Darwin's corollary to Haldane's rule. Genetics. 2007;176:1059–88. 10.1534/genetics.106.065979 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Meisel RP, Connallon T. The faster-X effect: integrating theory and data. Trends Genet. 2013;29(9):537–44. 10.1016/j.tig.2013.05.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Vicoso B, Charlesworth B. Effective population size and the Faster‐X effect: an extended model. Evolution. 2009;63(9):2413–26. 10.1111/j.1558-5646.2009.00719.x [DOI] [PubMed] [Google Scholar]
  • 45.Yang J, Mezmouk S, Baumgarten A, Buckler ES, Guill KE, McMullen MD, et al. Incomplete dominance of deleterious alleles contributes substantially to trait variation and heterosis in maize. PLoS Genet. 2017;13(9):e1007019 10.1371/journal.pgen.1007019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Mezmouk S, Ross-Ibarra J. The pattern and distribution of deleterious mutations in maize. G3 (Bethesda). 2014;4(1):163–71. 10.1534/g3.113.008870 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Li L, Lu K, Chen Z, Mu T, Hu Z, Li X. Dominance, overdominance and epistasis condition the heterosis in two heterotic rice hybrids. Genetics. 2008;180:1725–42. 10.1534/genetics.108.091942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Liang Q, Shang L, Wang Y, Hua J. Partial dominance, overdominance and epistasis as the genetic basis of heterosis in upland cotton (Gossypium hirsutum L.). PLoS One. 2015;10(11):e0143548 10.1371/journal.pone.0143548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Rhode JM, Cruzan MB. Contributions of heterosis and epistasis to hybrid fitness. American Naturalist. 2005;166:E124–E39. 10.1086/491798 [DOI] [PubMed] [Google Scholar]
  • 50.Shang L, Liang Q, Wang Y, Zhao Y, Wang K, Hua J. Epistasis together with partial dominance, over-dominance and QTL by environment interactions contribute to yield heterosis in upland cotton. Theor Appl Gen. 2016;129(7):1429–46. [DOI] [PubMed] [Google Scholar]
  • 51.Pardo-Diaz C, Salazar C, Baxter SW, Merot C, Figueiredo-Ready W, Joron M, et al. Adaptive introgression across species boundaries in Heliconius butterflies. PLoS Genet. 2012;8(6):e1002752 10.1371/journal.pgen.1002752 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Clarkson CS, Weetman D, Essandoh J, Yawson AE, Maslen G, Manske M, et al. Adaptive introgression between Anopheles sibling species eliminates a major genomic island but not reproductive isolation. Nat Comm. 2014;5:4248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Racimo F, Sankararaman S, Nielsen R, Huerta-Sánchez E. Evidence for archaic adaptive introgression in humans. Nat Rev Gen. 2015;16(6):359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Bank C, Bürger R, Hermisson J. The limits to parapatric speciation: Dobzhansky–Muller Incompatibilities in a continent–island model. Genetics. 2012;191(3):845–63. 10.1534/genetics.111.137513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Lindtke D, Buerkle CA. The genetic architecture of hybrid incompatibilities and their effect on barriers to introgression in secondary contact. Evolution. 2015;69(8):1987–2004. 10.1111/evo.12725 [DOI] [PubMed] [Google Scholar]
  • 56.Barton N, Bengtsson BO. The barrier to genetic exchange between hybridising populations. Heredity (Edinb). 1986;57(3):357–76. [DOI] [PubMed] [Google Scholar]
  • 57.Moyle LC, Nakazato T. Hybrid incompatibility "snowballs" between Solanum species. Science. 2010;329:1521–3. 10.1126/science.1193063 [DOI] [PubMed] [Google Scholar]
  • 58.Blanquart F, Bataillon T. Epistasis and the structure of fitness landscapes: are experimental fitness landscapes compatible with Fisher’s geometric model? Genetics. 2016;203(2):847–62. 10.1534/genetics.115.182691 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kuzmin E, VanderSluis B, Wang W, Tan G, Deshpande R, Chen Y, et al. Systematic analysis of complex genetic interactions. Science. 2018;360(6386):eaao1729 10.1126/science.aao1729 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Dittrich-Reed DR, Fitzpatrick BM. Transgressive hybrids as hopeful monsters. Evol Biol. 2013;40(2):310–5. 10.1007/s11692-012-9209-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Rieseberg LH, Widmer A, Arntz AM, Burke B. The genetic architecture necessary for transgressive segregation is common in both natural and domesticated populations. Philos Trans R Soc Lond B Biol Sci. 2003;358(1434):1141–7. 10.1098/rstb.2003.1283 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Rieseberg LH, Archer MA, Wayne RK. Transgressive segregation, adaptation and speciation. Heredity (Edinb). 1999;83(4):363. [DOI] [PubMed] [Google Scholar]
  • 63.Ono J, Gerstein AC, Otto SP. Widespread genetic incompatibilities between first-step mutations during parallel adaptation of Saccharomyces cerevisiae to a common environment. PLoS Biol. 2017;15(1):e1002591 10.1371/journal.pbio.1002591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Dettman JR, Sirjusingh C, Kohn LM, Anderson JB. Incipient speciation by divergent adaptation and antagonistic epistasis in yeast. Nature. 2007;447(7144):585–8. 10.1038/nature05856 . [DOI] [PubMed] [Google Scholar]
  • 65.Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol. 2019;34(1):69–82. 10.1016/j.tree.2018.10.009 [DOI] [PubMed] [Google Scholar]
  • 66.Anderson JB, Funt J, Thompson DA, Prabhu S, Socha A, Sirjusingh C, et al. Determinants of divergent adaptation and Dobzhansky-Muller interaction in experimental yeast populations. Curr Biol. 2010;20(15):1383–8. 10.1016/j.cub.2010.06.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Eberlein C, Hénault M, Fijarczyk A, Charron G, Bouvier M, Kohn LM, et al. Hybridization is a recurrent evolutionary stimulus in wild yeast speciation. Nat Comm. 2019;10(1):923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lee HY, Chou JY, Cheong L, Chang NH, Yang SY, Leu JY. Incompatibility of nuclear and mitochondrial genomes causes hybrid sterility between two yeast species. Cell. 2008;135(6):1065–73. 10.1016/j.cell.2008.10.047 . [DOI] [PubMed] [Google Scholar]
  • 69.Scannell DR, Byrne KP, Gordon JL, Wong S, Wolfe KH. Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts. Nature. 2006;440(7082):341–5. 10.1038/nature04562 . [DOI] [PubMed] [Google Scholar]
  • 70.Greig D. A screen for recessive speciation genes expressed in the gametes of F1 hybrid yeast. PLoS Genet. 2007;3(2):e21 10.1371/journal.pgen.0030021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Lachance M-A, Collens JD, Peng XF, Wardlaw AM, Bishop L, Hou LY, et al. Spatial Scale, Genetic Structure, and Speciation of Hawaiian Endemic Yeasts. Pacific Science. 2016;70(4):389–408. 10.2984/70.4.1 [DOI] [Google Scholar]
  • 72.Leducq J-B, Nielly-Thibault L, Charron G, Eberlein C, Verta J-P, Samani P, et al. Speciation driven by hybridization and chromosomal plasticity in a wild yeast. Nat Microbio. 2016;1(1):15003. [DOI] [PubMed] [Google Scholar]
  • 73.Hou J, Fournier T, Schacherer J. Species-wide survey reveals the various flavors of intraspecific reproductive isolation in yeast. FEMS Yeast Ressearch. 2016;16(5):fow048 10.1093/femsyr/fow048 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Wickham H. ggplot2: elegant graphics for data analysis: Springer; 2016. [Google Scholar]
  • 75.Wilke CO. Cowplot: streamlined plot theme and plot annotations for ggplot2. CRAN Repos. 2016. [Google Scholar]
  • 76.Core Team R. R: A language and environment for statistical computing. R Foundation for Statistical Computing; 2014. [Google Scholar]
  • 77.Kimura M. On the probability of fixation of mutant genes in a population. Genetics. 1962;47(6):713–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Cutter AD. The polymorphic prelude to Bateson–Dobzhansky–Muller incompatibilities. Trends Ecol Evol. 2012;27(4):209–18. 10.1016/j.tree.2011.11.004 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Text. Supplementary Text.

This file contains details about the simulation results, additional simulations under varying assumptions about epistasis and analytical derivations for the model.

(DOCX)

S1 Fig. Epistatic dominance parameters.

Schematic of the 3 possible strengths of epistatic interactions when (from left to right) both loci are heterozygous for derived (shaded) allele, one is heterozygous and one is homozygous, and both are homozygous. The red lines show the epistatic interactions. These values correspond to Turelli and Orr’s notation of H0, H1, H2 from left to right, respectively.

(TIF)

S2 Fig. Analytical solutions for relative hybrid fitness.

The effect of varying the strength of epistasis, asymmetry in divergence and dominance of epistasis to hybrid fitness. All heat maps are numerical evaluations of hybrid fitness given by Eq (2) of the main text, with default parameter values ε¯b = -0.005, ε¯w = 0.01, v = 1/4, α1 = 1/4. Varying ε¯b (top left) causes hybrid fitness to decline more rapidly when between population epistasis becomes more deleterious, and heterotic when they are positive. The inverse pattern is observed for ε¯w (top right). Increasing asymmetry (lower v values, bottom left) decreases hybrid fitness more rapidly, while increasing epistatic dominance (bottom right) retains within population epistasis causing heterosis (since |ε¯b|<|ε¯b|), while lower dominance causes more rapid speciation.

(TIF)

S3 Fig. Average epistatic effects of mutations in the yeast data set.

In yeast knockouts, the mean epistatic effect of mutations correlates with the mutations’ direct fitness effects. Each point is one of the mutations in the yeast epistatic interaction data-set. The X-axis is the direct selection coefficient observed in lab experiments, while the Y-axis is the average of all epistatic interactions involving that mutation with all other genes. The red line is the linear fit to the data (adjusted R-squared of 0.103, p<10−16).

(TIF)

S4 Fig. Fixation rates of mutations in simulations.

Observed simulation fixation rates for the 10588 mutations in the yeast epistatic network dataset. Points in red are mutations with on average positive epistasis, while points in blue are epistatically neutral or deleterious. Mutations with even highly negative direct selection coefficients are fixed frequently when their epistatic effects are largely positive.

(TIF)

S5 Fig. The fraction of substitutions that may adaptively introgress.

The fraction of substitutions in one population that are positively selected in the other population over the course of the simulation. Blue line is the mean of 10000 simulations, grey area is the 95% confidence interval. Nearly all substitutions fixed early are beneficial in the ancestral background and so will fix in both populations if gene flow allows the derived allele to cross between populations. As divergence continues, fewer and fewer of the substitutions in one population are positively selected in the genomic background of the other population. The proportion of positively selected mutations remains high even when hybrid fitness is expected to be very low (<0.2, Fig 4).

(TIF)

S6 Fig. The distribution of measures of relative contributions of interactions to hybrid fitness early and late in divergence.

The distribution of γ2 (A) and γ100 (B) values, representing the potential of each individual interaction to contribute to speciation after a total of 2 and 100 mutations have been fixed in two independently evolving populations. Since γ is a cross product of the frequency an interaction is seen in a hybrid and its fitness effects in that hybrid, positive values represent potential for heterosis, while negative ones represent reproductive isolation. The vast majority of interactions are unlikely to contribute to either process.

(TIF)

S7 Fig. The importance of interactions early vs late in divergence.

The relationship between γ2 and γ100 for all interactions. Most interactions have no effect towards hybrid speciation either early or late in the process of speciation. The interactions that are most important early in the process (large absolute γ2) do not remain important as the populations continue to diverge. This suggest that different interactions may be contributing to hybrid fitness at different times during the process of speciation.

(TIF)

S8 Fig. Summary of epistasis in the yeast data assuming additive epistasis.

Summary of network using additive epistasis measures. A) The distribution of epistatic effects. B) The proportion of positive epistatic interactions as a function of the cutoff of the absolute value of the interaction. Additive measures of epistasis lead to far fewer positive interactions of large effect, reducing the proportion of positive interactions in the network to negligible amounts if strong cutoffs are considered. C) The highly linear relationship between additive epistatic effect and direct fitness effects of mutations. D) The relationship in C is derived from the strong pattern of deleterious mutations having strongly deleterious interactions, while positively selected interactions tend to occur between positively selected mutations. E) Since positively selected mutations largely have positive interactions, the interactions likely to occur early in hybrids are on average positive.

(TIF)

S9 Fig. Simulations of the yeast data set assuming additive epistasis.

The simulation results for the additive epistatic model. (A) Under the additive model, only interactions of positive direct fitness effect tend to be fixed. (B) Since positively selected mutations share largely positive epistatic interactions, within and between population interactions slowly converge to the same values. (C) Hybrid fitness asymptotes under the additive model, since novel interactions first seen in hybrids are of roughly the same positive strength as the co-adapted interactions seen within populations (as seen in B).

(TIF)

S10 Fig. Epistasis in the yeast data set using a modified version of multiplicative epistasis.

Results using the measures of epistasis presented in Costanzo et al. (2016). A) The distribution of epistatic effects. B) The proportion of positive epistatic interactions as a function of the cutoff of the absolute value of the interaction. C) The relationship between additive epistatic effect and direct fitness effects of mutations. The two are weakly but significantly correlated (r2 = ,p<10). D)The largest epistatic effects are seen between mutations of most extreme effects (both positive and negative). E) Interactions likely to occur early in speciation are of weaker and more positive effect than interactions that are unlikely to occur early on.

(TIF)

S11 Fig. Simulation results using a modified model of multiplicative epistasis.

The simulation results for the Costanzo epistatic model. Results are qualitatively similar to the results presented in the main paper, with several exceptions. (A) The modified epistatic model results in mutations with extremely strong direct fitness effects rarely being fixed. Mutations with positive or negative epistatic effect are colored blue and red, respectively. (B) Epistatic effects between populations (red) converge to the average epistatic effect between populations as substitutions are fixed, while interactions within each population (blue) quickly become positive in the majority of the simulations. The curves capture the range of 95% of the simulations. (C) Hybrid fitness decreases more slowly than in the multiplicative case, but much faster than in the additive model. Dashed line represents simulation mean; faint grey lines give representative simulation contours.

(TIF)

S12 Fig. Relative hybrid fitness varies with dominance of epistasis in hybrids.

Relative hybrid fitness when assumptions about dominance of epistasis are modified. We consider our simulations in which α2 = 0.5. Since hybrids will only experience epistasis of strength α1ε, we reanalyze their fitness assuming epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2) and dominant (red, α1 = α2). Solid lines represent means of simulations, shaded areas are 95% confidence intervals. As suggested by our model, highly dominant interactions lead to slower decline in fitness. Simulations demonstrate that a secondary effect is increased variance in observed hybrid fitness.

(TIF)

S13 Fig. Average epistasis in simulations assuming recessive epistatic interactions.

Average epistatic effects within (purple) and between (tan) populations. While variance of within population effects is lower than that of between populations, the two rapidly converge to the same slightly negative value. Shaded areas represent the 95% confidence interval of 2000 simulations, solid lines are means.

(TIF)

S14 Fig. Relative hybrid fitness in simulations assuming recessive epistatic interactions.

Relative hybrid fitness in simulations when epistatic effects are nearly completely recessive as mutations are fixed (α2 = 0.01). Hybrid fitness depends very little on whether epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2), or dominant (red, α1 = α2). Hybrid fitness therefore increases linearly with respect to fixed substitutions, proportional to the average epistatic effect.

(TIF)

S15 Fig. Average epistatic effect in simulations assuming dominant epistatic interactions.

The average epistatic effect found among mutations fixed in the same population (purple) or in different populations (tan), when epistatic effects are largely dominant. Few deleterious epistatic effects can be fixed within the same population, and so epistasis within rises rapidly and continues to increase as populations diverge. Between population epistasis behaves the same as in simulations where epistasis additively dominant. Shaded areas represent the 95% confidence interval of 2000 simulations, solid lines are means.

(TIF)

S16 Fig. Relative hybrid fitness in simulations assuming dominant epistasis.

Relative hybrid fitness in simulations when epistatic effects are nearly completely dominant as they are fixed (α2 = 0.95). Because within population epistasis is large, hybrid fitness depends heavily on whether epistasis seen in hybrids is recessive (blue, α1 = 0), additive (yellow, α1 = 1/2 α2), or dominant (red, α1 = α2). When effects are recessive, hybrids lose out on co-adapted blocks and rapidly lose fitness, while interactions that display nearly the same strength as seen in parents (red) lead to extreme heterosis due to combinations of positive epistatic effects from both parents.

(TIF)

Data Availability Statement

All novel data are within the manuscript and its Supporting Information files. The yeast epistatic interaction data were obtained from http://thecellmap.org/. The scripts used to perform the analyses are available at https://github.com/adagilis/SpeciationSimulation2018.


Articles from PLoS Genetics are provided here courtesy of PLOS

RESOURCES