Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2020 Nov 23;74(12):2575–2590. doi: 10.1111/evo.14116

The geometry and genetics of hybridization

Hilde Schneemann 1,2,, Bianca De Sanctis 1,3, Denis Roze 4,5, Nicolas Bierne 6, John J Welch 1
PMCID: PMC7839769  PMID: 33150956

Abstract

When divergent populations form hybrids, hybrid fitness can vary with genome composition, current environmental conditions, and the divergence history of the populations. We develop analytical predictions for hybrid fitness, which incorporate all three factors. The predictions are based on Fisher's geometric model, and apply to a wide range of population genetic parameter regimes and divergence conditions, including allopatry and parapatry, local adaptation, and drift. Results show that hybrid fitness can be decomposed into intrinsic effects of admixture and heterozygosity, and extrinsic effects of the (local) adaptedness of the parental lines. Effect sizes are determined by a handful of geometric distances, which have a simple biological interpretation. These distances also reflect the mode and amount of divergence, such that there is convergence toward a characteristic pattern of intrinsic isolation. We next connect our results to the quantitative genetics of line crosses in variable or patchy environments. This means that the geometrical distances can be estimated from cross data, and provides a simple interpretation of the “composite effects.” Finally, we develop extensions to the model, involving selectively induced disequilibria, and variable phenotypic dominance. The geometry of fitness landscapes provides a unifying framework for understanding speciation, and wider patterns of hybrid fitness.

Keywords: Fisher's geometric model, hybrid fitness, line crosses, quantitative genetics, speciation


When genetically distinct populations meet and mate, their divergent alleles are brought together in new combinations. The fitness of these novel genotypes will influence the outcome of the hybridization, which might be as various as population fusion, hybrid speciation, or reinforcement selection for new prezygotic barriers.

Broadly speaking, the fitness of hybrids might depend on three factors. The first factor is the type of hybrid genotype. For example, the initial F1 cross is often fitter than subsequent crosses, and even than the parental lines (Frankel 1983; Price and Bouvier 2002; Escobar et al. 2008; Fraïsse et al. 2016). The second factor is the environmental conditions in which the hybrid is formed (Bordenstein and Drapeau 2001). For example, when parental lines are adapted to different habitats, hybrids might be selected against in those habitats, and yet enjoy a selective advantage in novel environments, due to transgressive variation (Moore 1977; Yakimowski and Rieseberg 2014). The third factor is the divergence history of the parental lines. This includes not only the amount of divergence (Bateson 1978; Waser 1993; Edmands 2002), but also how it was accrued. For example, beneficial heterosis is often observed when parental lines have been subject to severe inbreeding, such that the divergence comprises partially recessive deleterious mutations (Wright 1922; Neal 1935). This third factor implies that the outcome of hybridization might be used to make inferences about the divergence history (Dobzhansky 1937; Lynch 1991; Rundle and Whitlock 2001; Gavrilets 2004; Welch 2004; Demuth and Wade 2005; Rosas et al. 2010; Fraïsse et al. 2016; Yamaguchi and Otto 2020).

To study all of these factors, and the interactions between them, one approach uses the quantitative genetics of line crosses (Cockerham 1980; Hill 1982; Lynch 1991; Lynch and Walsh 1998, chs. 9‐10; Rundle and Whitlock 2001; Demuth and Wade 2005). This approach is fully general, and widely applied, but it does have limitations. With the quantitative genetics of single populations, a large body of theory can help us to interpret the variance components (Hill et al. 2008; Mäki‐Tanila and Hill 2014; Barton 2017; Walsh and Lynch 2018), but with line crosses, the “composite effects” are more difficult to interpret. For this reason, a lot of research on hybridization uses fitness landscapes (Dobzhansky 1937; Hill 1982; Orr 1995; Gavrilets 2004). With this alternative approach, clear understanding can come from simple models, with a few, biologically meaningful parameters. But such models can be difficult to fit to data, and often apply to a limited range of cases (such as the evolution of intrinsic incompatibilities).

Here, following previous authors, we combine these two approaches (Lynch 1991; Demuth and Wade 2005; Yamaguchi and Otto 2020), drawing an explicit connection between the quantitative genetics of line crosses (Hill 1982; Rundle and Whitlock 2001), and a class of fitness landscapes based on Fisher's geometric model (Fisher 1930, ch. 2). Fisher's model is a well‐studied model of both divergence (Hartl and Taubes 1998; Orr 1998b; Walsh and Lynch 2018, ch. 27) and hybridization, and it can account for a large number of empirical patterns (Mani and Clarke 1990; Barton 2001; Rosas et al. 2010; Chevin et al. 2014; Fraïsse et al. 2016; Simon et al. 2018; Thompson et al. 2019; Yamaguchi and Otto 2020). As such, it allows us to explore the effects of hybrid genome composition, environmental heterogeneity, and parental divergence history, all in a common framework.

This article is in three parts. In Part 1, we rederive analytical predictions from Fisher's model (Simon et al. 2018), and show that they apply to very wide range of divergence conditions and population genetic parameter regimes. We also present the results in a new way, to clarify the geometrical and biological meaning of the key quantities. In Part 2, we connect Fisher's model to quantitative genetics, providing a simple interpretation of the composite effects. We then express results for standard line crosses in different environments, unifying results from previous studies (Wright 1922; Lynch 1991; Hatfield and Schluter 1999; Rundle and Whitlock 2001; Chevin et al. 2014; Simon et al. 2018; Yamaguchi and Otto 2020). Finally, in Part 3, we introduce two extensions to the model, involving selectively induced associations between heterospecific alleles, and phenotypic dominance. These extensions address cases where the simplest model gives misleading or implausible predictions. We end by discussing some implications of our results for understanding the process of speciation.

Fisher's Geometric Model and Hybridization

MODEL DESCRIPTION AND NOTATION

Description of hybrid genotypes

We consider hybrids between two diploid parental lines P1 and P2, which differ by d substitutions. For simplicity, we ignore genetic variation within the parental lines at the time of hybridization, so that d is equal to the genetic distance between hybridizing individuals. This will be a reasonable approximation if within‐line variation is much smaller than between‐line divergence.

Hybrids will contain some combination of alleles from the two parental lines. We characterize hybrid genotypes in terms of their heterozygosity, p12, and hybrid index, h. The heterozygosity is the proportion of the d divergent sites where the hybrid carries one allele from each line; the hybrid index is the total proportion of the divergent alleles that come from line P2. As such, h ranges from 0 for a pure P1 genotype, to 1 for a pure P2 genotype. We will also use the notation p1 and p2 to refer to the proportion of divergent sites that are homozygous for alleles from P1 and P2, such that p1+p2+p12=1, and

hp2+12p12 (1)

(Turelli and Orr 2000; Simon et al. 2018). Our overall aim is to show how hybrid fitness varies with h and p12, and how this variation might be determined by environmental conditions and the history of parental divergence.

Fisher's model as a fitness landscape

Under Fisher's geometric model, each genotype is associated with the values of n continuously varying phenotypic traits, and so it can be represented as a point in an n‐dimensional trait space: z=(z1,z2,,zn). The fitness of the genotype depends on the distance of its phenotype from an optimal phenotype: o=(o1,o2,,on). We use a weighted Euclidean distance:

zoλi=1nλi(zioi)2, (2)

where the λi determine the strength of selection on each trait. Fitness is a decreasing function of distance (Turelli and Moyle 2007; Simon et al. 2018), such as

lnw=αzoλk, (3)

where α is a constant, and k denotes the curvature of the fitness landscape, that is, how quickly fitness declines with the distance from the optimum (Peck et al. 1997; Tenaillon et al. 2007; Fraïsse et al. 2016; Fraïsse and Welch 2019). This model assumes a single phenotypic optimum at any given time and location, but the position of the optimum can vary in space and time, so that we can investigate divergence and hybridization under different environmental conditions.

How seriously should we view this simple phenotypic model? Although one or a few of the traits might be identified with real‐world quantitative traits, which might be measured in the field (Barton 1990; Thompson 2019), it is usually a mistake to treat all n of the traits in this way. Instead, Fisher's model is best viewed as an approximation to phenotypic models that are more realistic but less tractable (such as models of gene‐regulatory networks, as studied in systems biology; Martin 2014; Schiffman and Ralph 2017; Fraïsse and Welch 2019), or simply as a mapping between genotype and fitness. Under this interpretation, n is a parameter of the distribution of fitness effects, with no explicit phenotypic interpretation (and no effect on our major results—see below). Nevertheless, as we show below, thinking about the trait values is a useful way of deriving and interpreting testable predictions about hybrid fitness.

Hybrids under Fisher's model

Figure 1 shows how Fisher's model can be used to study hybridization. Under the model, the parental lines, P1 and P2, are represented by points in n‐dimensional trait space. Each of the d substitutions, which differentiate the lines, are represented as n‐dimensional vectors of change, notated mj for j=1d. In any given genome, each of these substitutions can appear in either homozygous or heterozygous form. These substitutions accrued during the evolutionary divergence of P1 and P2 from their most recent common ancestor (MRCA), and so they can be represented as a chain, passing through the ancestral phenotype (Fig. 1). However, there is no important distinction between ancestral and derived alleles in this model, and so we define the direction of each substitution such that the chain starts at P1.

Figure 1.

Figure 1

Under Fisher's geometric model, each genotype is associated with the values of n quantitative traits (illustrated with n=2), and its fitness depends on the distance of this phenotype from an optimum. The position of the optimum can change over time and space. Shown are two parental lines, P1 and P2, which differ by d=7 substitutions, each represented by a vector, denoted mj for j=1..d. These vectors represent the effects of the P2 allele, whether this is derived or ancestral. But they are ordered so that the chain passes through the most recent common ancestor (MRCA) of P1 and P2. Also shown are cartoons of the parental genomes, with the derived alleles as colored boxes. Hybrid genomes contain a mix of the parental alleles. In the hybrid shown, 1/7 of the divergent sites contains an allele from each line, so that p12=1/7; and two further P2 alleles are present as homozygotes (one ancestral and one derived), yielding a hybrid index of h=2.5/7. The illustration shows that fixed differences can be physically linked. Such linkage reduces the variance in the hybrid indexes within a given cross, but plays no other role in our analyses.

With this definition, the P2 genotype contains all of the substitutions in homozygous state, and so its phenotype on trait i=1n can be written as

zP2,i=zP1,i+j=1dmij (4)

where zP1,i denotes the value of trait i in parental line P1, and the mij describe the effects on trait i of introducing substitution j (i.e., the P2 allele, whether derived or ancestral). The phenotype of any given genotype can be written in the same way:

zi=zP1,i+jJhommij+jJhet12mij (5)

where Jhom denotes the subset of d×p2 loci that are homozygous for the P2 allele, and Jhet denotes the nonoverlapping subset of d×p12 loci that are heterozygous.

The key assumption of equations (4) and (5) is that all substitutions act additively on all traits, both within and between loci. This means that the phenotypic effects of a substitution will not depend on the genomic background (although its fitness effects can vary). To make equation (5) useful, we will make a second key assumption: that Jhom and Jhet can be treated as randomly chosen subsets of loci. Both of these assumptions—phenotypic additivity, and random choice of loci in hybrids—play a major role in the results below, but both are relaxed in the final section.

A useful distance measure

Results below will concern fitness, but in most cases, we will not work with fitness directly (Turelli and Orr 2000; Demuth and Wade 2005; Turelli and Moyle 2007). Instead, following Simon et al. (2018), we express most results in terms of a scaled squared distance to the optimum, which for brevity we will call “distance”. In particular, for a hybrid H, with phenotype zH (as defined via eq. 5), this distance is defined as

rHO24zHoλ2d1λ2 (6)

where 1 is the unit vector, so that the scaling factor is

1λ2=i=1nλi (7)

The major purpose of this scaling is to remove any dependence on parameters such as n and the λi, so that our results depend solely on distances. The most important distances are between the optimum and the two parental phenotypes; these are illustrated in Figure 2A.

r1O24zP1oλ2d1λ2r2O24zP2oλ2d1λ2 (8)

Each of these distances is a scaled transformed fitness (eq. 3), or, equivalently, a measure of maladaptation to the current environmental conditions, such that larger distances correspond to lower fitness. We return below to the interpretation of the scaling factor.

Figure 2.

Figure 2

Predictions for hybrid fitness depend on a small number of geometric quantities. The distances are defined in some multi‐dimensional trait space, but are estimable, in principle. (A) With additive phenotypes, and a single environmental optimum, predictions depend on just three distances: the distances of the two parental phenotypes from the optimum (r1O2 and r2O2), and the distance between the parental phenotypes (r122). Results can also be written in terms of ρ, which measures the extent to which the parental populations are maladapted to the current environment in similar ways. Also shown is the midparental phenotype, P. This is the expected phenotype of balanced hybrids, and so hybrid advantage is maximized when P coincides with the optimum. (B) With two environments, A and B, characterized by different optima, one measure of local adaptation is ρ: the cosine similarity between the vectors linking the optima, and the parental phenotypes. When the two parental phenotypes are close to the two optima, results depend on r122 alone. (C) With variable phenotypic dominance, results depend on the phenotype of the global heterozygote, G, which is equivalent to the initial F1 cross under strictly biparental inheritance and expression, and which may differ from the midparental phenotype, P. In the example shown, G is closer to the P2 phenotype than the P1 phenotype, this implies directional dominance, with P2 alleles being dominant on average.

THE BROWNIAN BRIDGE APPROXIMATION

Simon et al. (2018) presented an approximation for the expected distance from the optimum of a hybrid with a given hybrid index and heterozygosity. This quantity is notated as E(rHO2|h,p12), where the expectation is defined over all hybrid genotypes with the same values of h and p12. In this section, and Appendix 1 in the Supporting Information, we rederive this Brownian bridge approximation, and clarify its assumptions. We also present the key results in a new form, which clarifies their biological meaning (see next section).

The first assumption of the approximation is that the fixed effects on each trait, the mij, can be treated as realizations of normally distributed random variables with unit variances and no correlations between traits. This is justified by a transformation called simultaneous diagonalization, introduced in a very similar context by Martin and Lenormand (2006), and described in detail in Appendix 1 section A1.1 in the Supporting Information. The transformation assumes that the original distribution of effects is multivariate Gaussian (Waxman and Welch 2005; Martin and Lenormand 2006). However, if the divergence, d, is sufficiently large, then the summations in equation (5) lead to central‐limit‐type behavior, and so to approximate normality in a wider range of cases (Barton et al. 2017, section 3.2). After the transformation is applied, the λi capture between‐trait differences in both the strength of selection, and the typical sizes of fixed factors.

Given statistical independence of the trait values, we have

ErHO2h,p12=4λiVarzidλi+4λiE2zioidλi (9)
V+M (10)

To derive the quantities V and M, we will next treat the mij on each trait as the increments of a Brownian bridge, that is, a random walk or Brownian motion, constrained at each end by the parental phenotypes, and split into d equal steps (Revuz and Yor 1999; Simon et al. 2018). For the term V, which captures the variation in hybrid phenotypes, Appendix 1 section A1.2 in the Supporting Information shows that

V=4p2(1p2)+p12(1p12)4p2p12 (11)
=4h(1h)p12 (12)

where the three terms in equation (11), come from variation in the effects of homozygous P2 alleles, variation in the effects of heterozygous alleles, and a negative covariance term (because a given allele cannot appear in both homozygous and heterozygous state in the same genome).

For the term M, we note that the expected hybrid phenotype will lie on the line connecting the parental phenotypes (Figure 2A). Appendix 1 section A1.2 in the Supporting Information shows that

M(1h)2r1O2+h2r2O2+2h(1h)r1Or2Oρ (13)
=r.O2+h12r2O2r1O2h(1h)r122 (14)

where we have used equation (8), and the notation r.O212(r1O2+r2O2). We also use two new quantities that are illustrated in Figure 2A. The first quantity, ρ, which appears in equation (13), is the “cosine similarity” of the vectors connecting the parental phenotypes to the optimum. ρ can vary between 1, when these vectors point in the same direction, and −1, when they point in opposite directions (see also eq. 41 in the Supporting Information). The second quantity, r122, which appears in equation (14), is the distance between the parental phenotypes:

r1224zP1zP2λ2d1λ2 (15)

ρ and r122 are related to each other via the cosine rule

r122=r1O2+r2O22r1Or2Oρwhere1ρ1 (16)

We now have an expectation for the expected fitness of any type of hybrid, using only its genomic composition (p12 and h), and the three distances, r1O2, r2O2, and r122. Notably, we have assumed almost nothing about the history of divergence between the parental lines, nor their common ancestral state. One way to understand this is to imagine all the possible paths between P1 and P2 that could be obtained by shuffling the order of the d substitution steps. One of these paths is the true history of divergence, and passes through the MRCA, but hybrids can lie on any of the paths. As a result, the Brownian bridge approximation is based on the notion of a random walk, but it does not require that the true process of divergence resembled a random walk.

To verify that the Brownian Bridge approximation works well for a wide range of divergence histories, Appendix 2 in the Supporting Information presents an individual‐based simulation study. We simulated the divergence between diploid populations under a full population genetic model, varying the parameter regime, and the patterns of demographic and environmental change. Our simulations included divergence in allopatry, and in parapatry, with ongoing gene flow (Endler 1977). We also simulated different levels of drift, selection, standing variation, and recombination. Results show that the Brownian bridge approximation is robust in all cases.

BIOLOGICAL INTERPRETATION

Let us now consider a group of hybrids that might vary in their values of h and p12. Combining results above, the expected distance of these hybrids from the optimum is

ErHO2=r.O2+h¯12r2O2r1O2p¯12+h¯1h¯Varh4r122 (17)

where p¯12 is the mean level of heterozygosity in the hybrids of interest, and h¯ and Var(h) are the mean and variance of their hybrid indexes. All four of the terms in equation (17) have a clear biological interpretation. First, and simplest, r.O2 is the mean distance to the optimum of the parental lines, as measured in the environment where the hybrids were scored; it tells us that hybrids will be fitter, on average, if their parents are fitter, on average, in the current environment. The second term depends on the difference in the parental distances in the current environment: (h¯12)(r2O2r1O2); it tells us that hybrids will be fitter if they contain more alleles from the fitter parent. The third term, p¯12, is an intrinsic benefit of heterozygosity; it states that, for any given value of h, hybrids are fitter when they are more heterozygous. This is a form of heterosis (Frankel 1983). The effect is “intrinsic” because, unlike the two previous terms, it does not depend on the current position of the environmental optimum.

The final term in equation (17) is the intrinsic effect of admixture. The level of admixture is measured by h¯(1h¯)Var(h). This means that admixture is low when most alleles come from one of the parental lines (such that h¯ is close to 0 or 1), or when there is a mix of alleles in the population, but most individual genomes are close to one or other parental type, such that Var(h) is close to its maximal value: h¯(1h¯). The admixture level is highest for a collection of balanced hybrids, where h=1/2 for all individuals. The effect of admixture is “intrinsic,” because it depends on the distance r122, which depends on the parental phenotypes, but not on the current position of the optimum (Fig. 2A; eq. 15). The effect changes qualitatively with the size of r122. When r122<4, admixture brings a net fitness cost. This reflects the breaking up of co‐adapted gene complexes in the parental lines (Lynch 1991; Wallace 1991; Simon et al. 2018). When r122>4, admixture brings a net fitness benefit. This reflects the potential benefits of transgressive variation in hybrids (Yakimowski and Rieseberg 2014), and this benefit increases with r122.

Although each of the four terms of equation (17) has a different interpretation, variation among them is constrained. This is because the distances r1O2, r2O2, and r122 are connected to each other by the geometry (Fig. 2A; eq. 16). Most importantly, parental lines cannot be, at once, highly divergent phenotypically (such that r1221), and close to the same environmental optimum (such that r.O20). This implies that large benefits of admixture are impossible without large costs due to parental maladaptation. However, the relative sizes of these terms can vary, according to the current position of the optimum. For a given value of r122, E(rHO2) is minimized when the optimum matches the midparental phenotype, denoted P in Figure 2A. (In this case, r1O2=r2O2 and ρ=1, such that, from eq. 16, r.O2=14r122). This restates a result of Yamaguchi and Otto (2020), and also follows intuitively: hybrids will be fittest when the optimal phenotype is exactly intermediate between the parental phenotypes (Moore 1977).

HYBRID FITNESS AND THE PROCESS OF DIVERGENCE

The results above show that the outcomes of hybridization depend strongly on the value of r122. So what exactly does this quantity measure? The numerator of equation (15) is zP1zP2λ2: the amount of phenotypic divergence that resulted from the genomic divergence between the parental lines. In Appendix 1 section A1.5 in the Supporting Information, we show that the denominator, d1λ2, is equivalent to the expected phenotypic divergence under an unconstrained random walk in phenotypic space, conditional on the walk having d steps, and a similar distribution of effect sizes to the observed data (as parameterized by the λi). For this reason, r122 can be thought of as the observed amount of phenotypic divergence, divided by the expected amount of phenotypic evolution under a random walk.

As shown in Figure 3A–D, this implies that the value of r122 contains some information about the mode of divergence between the populations.

Figure 3.

Figure 3

(A)–(D) the distance r122 can vary systematically with the mode of divergence between the parents. The variation depends on the chain of d substitutions that differentiate the parental lines, and compares their trajectory to a random walk with the same number of steps, and distribution of effect sizes. (A) When substitutions form a more‐or‐less direct path between the parental phenotypes, the observed phenotypic difference is greater than would be predicted under a comparable random walk; this implies that r122>4 with a maximum at max(r122)=4d. (B) When the true path of divergence really did resemble a random walk, then r1224 is expected. This might happen if stabilizing selection on the phenotype was ineffective, or if the optimum value wandered erratically. Systematically smaller values of r122 are predicted under two conditions. Either (C) genomic divergence continued, despite effective stabilizing selection on the phenotype, leading to “system drift.” Or (D) populations successfully tracked environmental optima, but without leading to a straight path of substitutions. (E)–(G) r122 also plays a key role in determining patterns of hybrid fitness, especially with locally adapted parents. Results are shown for the standard crosses, in three different environments, where the optimal phenotype coincides with: (E) the P1 phenotype, (F) the P2 phenotype, and (G) the midparental phenotype. Color shows variation in r122, including the inflection points at r122=4/3 (equal fitnesses for the F1 and fitter backcross), and r122=4/7 (equal fitnesses for the less fit parent and the less fit backcross). All results use equation (17), with Var(h)=0, and the appropriate values of h¯ and p¯12 for each cross; but results are shown on an arbitrary scale, such that fitter hybrids are higher on the plots.

For example, if r122>4, then the parents show more phenotypic divergence than expected under a random walk. This implies that the d substitutions, which connect the parental phenotypes, form a chain with relatively little meandering or changing of direction. In terms of fitness effects, this implies that the P2 alleles are largely exchangeable, with all having similar effects on fitness in any given background. Indeed, we show in Appendix 1 section A1.5 in the Supporting Information that r122 approaches a maximum of max(r122)=4d, when the P2 alleles are completely exchangeable, causing changes of the same size and direction. This sort of pattern is unlikely to arise without positive selection, and so an observation of r1224 suggests that the parental lines diverged via positive selection, either acting in one population alone, or in both populations, but in opposite directions in phenotypic space. This is illustrated in Figure 3A.

Similarly, an observation of r1224 is expected if the parental phenotypes really did diverge by random‐walk‐like evolution (Fig. 3B). One way this might occur is if parental lines fixed mutations regardless of their fitness costs, for example, under severe inbreeding. It is notable that the intrinsic effects of admixture vanish when r1224 (eq. 17). This agrees with the empirical observation that heterozygosity, rather than admixture level, is the major determinant of fitness in crosses between inbred lines (Wright 1922; Neal 1935; Simon et al. 2018).

Finally, if r122<4, then populations have accrued less phenotypic divergence than would be expected under a random walk. This could occur in two quite distinct ways. First, stabilizing selection might maintain the phenotype at a (more‐or‐less) stationary optimum, while still allowing for divergence at the genomic level, perhaps by nearly neutral evolution (Barton 1989; Hartl and Taubes 1998). This process is closely related to “system drift” (Rosas et al. 2010; Schiffman and Ralph 2017), and is illustrated in Figure 3C. Alternatively, divergence could involve adaptation to a moving optimum, but without leading to a straight path of substitutions connecting P1 and P2. In the simplest case, this could arise if the two populations adapted, independently, to identical environmental change (Mani and Clarke 1990), because the chain of substitutions would then change direction as it passed through the common ancestor. A more complex example, involving an oscillating optimum, is illustrated in Figure 3D. In both cases, the result is a chain of substitutions whose start and end points are closer together than would be expected under a random walk, such that r122<4.

Importantly, over large periods of time, at least one of these two processes—system drift, or environmental change that does not lead to a straight path—is very likely to occur. It is unlikely that populations will diverge for large periods by fixing exchangeable alleles. As such, at very large divergences, it becomes increasingly likely that r1220 will hold. In fact, it follows directly from equations (8) and (15) that all of the key distances shown in Figure 2A will tend to vanish at large divergences, and we have the limit:

limdErHO2=4h¯1h¯4Varhp¯12 (18)

where the scaling of equation (6) ensures that equation (18) is bounded at 0 and 1. Biologically, equation (18) implies that, as populations diverge genetically, both extrinsic fitness effects (as determined by r1O2 and r2O2), and any intrinsic benefits of admixture (as determined by r122), will tend to become less and less important. The model predicts convergence to a characteristic pattern of intrinsic isolation between the parental lines, where a fixed cost of admixture is mitigated by a fixed benefit of heterozygosity.

In Appendix 2 in the Supporting Information, we present simulations supporting all of the arguments in this section, namely the relationship between r122 and the mode of divergence, and the convergence over time to equation (18).

FISHER'S MODEL AND THE QUANTITATIVE GENETICS OF LINE CROSSES

In this second part of the article, we show how the distances in phenotypic space, which govern the fitness of hybrids, relate to measurable quantities. To do this, we consider fitness measures from controlled crosses, which differ in their values of h¯ and p¯12. These crosses include the initial F1 (P1×P2: h¯=1/2, p¯12=1), the F2 (F1×F1: h¯=p¯12=1/2), and the reciprocal backcrosses (BC(P1)=F1×P1: h¯=1/4, p¯12=1/2; and BC(P2)=F1×P2: h¯=3/4, p¯12=1/2). The variance in the hybrid index, Var(h), depends on both the cross type, and the level of segregation and recombination (Lynch and Walsh 1998, ch. 9). In particular, if c¯ is the mean rate of recombination among pairs of loci, then Var(h)(12c¯)/4 among F1 gametes (Zeng et al. 1990; Lynch and Walsh 1998); and so, with random union of gametes, Var(h) will be half of this value for the F2, and a quarter of this value for backcrosses. However, c¯0.5 for many species (Lynch and Walsh 1998, ch. 9), and in those cases, Var(h) can often be neglected.

THE COMPOSITE EFFECTS UNDER FISHER'S MODEL

Let us begin by following Hill (1982; see also Lynch 1991; Lynch and Walsh 1998), and writing the expected value of an arbitrary trait in a cross as

μ=μ0+θSα1+θS2α2+θHδ1+θH2δ2+θSθHα1δ1+ (19)

where θS12h¯ and θH2p¯121. The curly brackets contain the “composite effects,” which are defined in Table 1 (and noting that, in this standard notation, {α1δ1} describes an interaction term, and not a product). Equation (19) shows only pairwise effects, but the model also includes higher order terms.

Table 1.

Composite effects under Fisher's geometric model

Composite effect Geometric model prediction
Single environment Two environments Local adaptation Phenotypic dominance
{α1}
additive
12(r1O2r2O2)
12(r1.2r2.2)
0
12(r1O2r2O2)14(r1G2r2G2)
{δ1}
dominance
12
12
12
12(1+rPO2rGO2)
{α2}
additive‐by‐additive epistasis
14r1221
14r1221
14r1221
14r1221
{δ2}
dominance‐by‐dominance 0 0 0
14rGP2v
{α1δ1}
additive‐by‐dominance 0 0 0
14(r1G2r2G2)
{ε}
environmental
12(r.B2r.A2)
0
{α1ε}
additive‐by‐environmental
14[(r1B2r1A2)(r2B2r2A2)]
12r122
{δ1ε}
dominance‐by‐environmental 0 0
{μ0}
(intercept)
12+r.O214r122
12+r..214r122
12+14r122
12+v+r.O214r12214rGP2+12(rGO2rPO2)

If we neglect Var(h), then both equations (17) and (19) are polynomials in h¯ and p¯12. We can therefore choose “distance from the optimum” as the trait in equation (19). If we set μ=E(rHO2), and solve for the composite effects, the results are found in Table 1 (column “single environment”). Table 1 shows that Fisher's model predicts three nonzero composite effects. Their values reflect the biological distinctions discussed above. In particular, the additive effect, {α1}=12(r1O2r2O2), captures the benefits of carrying alleles from the fitter parent, whereas the dominance effect, {δ1}=12, captures the intrinsic benefits of heterozygosity. The pairwise epistatic effect, {α2}=14r1221, balances the intrinsic costs and benefits of admixture.

Rundle and Whitlock (2001) presented a useful extension of equation (19) for traits scored in two environments. Introducing an indicator variable, I, which is 0 for individuals scored in “environment A,” and 1 for individuals scored in “environment B,” and defining θE2I1, their model contains the additional terms:

μAB=μ+θEε+θEθSα1ε+θEθHδ1ε+ (20)

Fisher's geometric model is trivially extended in the same way, by adding a second environment, with a distinct optimum. This is illustrated in Figure 2B. Again, we can solve for the composite effects, and these are shown in Table 1 (column “two environments”). Results show that adding a second environment leaves the dominance and epistatic effects unchanged, confirming that they represent the intrinsic effects of heterozygosity and admixture. Of the remaining quantities, the additive effect, {α1}, is now averaged across environments (r1.2r2.212(r1A2r2A2+r1B2r2B2)), whereas the main environmental effect, {ε}, is simply the difference in fitness between environments, averaged across the parental lines (r.B2r.A212(r1B2r1A2+r2B2r2A2)).

Finally, the additive‐by‐environment interaction is

α1ε=14r1B2r1A2r2B2r2A2 (21)
=12r12rABρ* (22)

Here, 1ρ1 is the cosine similarity of the vector linking the parental phenotypes, and the vector linking the two optima (Fig. 2B): {α1ε} will be large when the difference between the phenotypes of P1 and P2 resembles the difference between optima A and B. Indeed, the predicted values of {α1}, {ε}, and {α1ε} are equivalent to the quantities described by Blanquart et al. (2013) for measuring local adaptation (see their eqs. 1 and 2), but applied to fitness values that have been transformed and scaled (eq. 6). As shown in Appendix 1 section A1.6 in the Supporting Information, the same framework is easily extendable to other sorts of environmental heterogeneity, namely patchy ecotones, and environmental gradients.

LOCAL ADAPTATION AND ECOLOGICAL ISOLATION

Although {α1ε} is a possible measure of local adaptation, it does not describe the extent of ecological isolation between the parental lines. For example, {α1ε} might be large, even if P1 were fitter than P2 in both habitats (Kawecki and Ebert 2004; Blanquart et al. 2013). However, we do have a measure of isolation in an important special case. If the two parental lines are well adapted to different local optima, then rAB2r122, and predictions depend on r122 alone. The results with local adaptation are shown in Table 1 (column “local adaptation”).

The effects of varying r122 are illustrated in Figure 3E–G. Each panel shows the expected distance from the optimum, comparing the two parental lines, the initial F1 cross, and the reciprocal backcrosses. The position of the optimum varies across the three panels, and matches (E): the phenotype of P1, (F): the phenotype of P2 and (G): the midparent (i.e., the mean of the P1 and P2 phenotypes). Colors show the effects of varying r122. When r122 is large (red lines), isolation in the parental habitats (Fig. 3E–F) is purely ecological, with P2 and P1 kept distinct solely by environment‐dependent selection against their divergent alleles. In the “intermediate habitat” (Fig. 3G), large r122 leads to hybrid advantage, with all crosses fitter than the parental types (Moore 1977; Yamaguchi and Otto 2020). When r122 becomes small (blue lines), results in all three habitats approach the same W‐shaped pattern of intrinsic isolation, where hybrids beyond the F1 are unfit in all environments (Rundle and Whitlock 2001). By changing the value of r122, we can interpolate between these two extremes.

Taken together, results in Figure 3 show that r122 is both a measure of the amount of meandering in the chain of fixed differences (Fig. 3A–D), and a measure of the relative strengths of ecological isolation versus intrinsic isolation (Fig. 3E–G).

HYBRID FITNESS ACROSS TIME AND SPACE

Figure 4 summarizes many of the results above, showing how hybrid fitness varies with genomic composition, environmental conditions, and the amount of divergence. Figure 4 compares analytical results to a single simulation run, which is described in full in Appendix 2 in the Supporting Information. In the simulation, two parental populations adapted to distinct optima in allopatry (shown in cartoon in Fig. 4A). This local adaptation involved fixing ∽50 substitutions, and after this, the populations continued to diverge via system drift. Hybridization between these populations was simulated in five distinct environments, whose optima are also shown in Figure 4A. Figure 4B–E shows the composite effects changing over the complete course of the divergence. In the environments to which the parents are adapted (Fig. 4B), the additive‐by‐environment interaction, {α1ε} is high in the initial stages of divergence, before declining over time. The same pattern appears, in all environments, with the epistatic effect, {α2} (Fig. 4B–E). This reflects their common dependence on r122 under local adaptation (Table 1). Initially, r122 is high, due to the adaptive phenotypic divergence of the parental lines; then it declines steadily, as stabilizing selection at the new optima is accompanied by genomic evolution via system drift.

Figure 4.

Figure 4

The outcomes of hybridization over time and space. (A) A cartoon of the divergence process that was simulated, with two populations adapting in allopatry to abruptly shifting optima, and then continuing to accumulate divergence via system drift. (B)–(E) change in the composite effects with increasing divergence (Table 1), as measured with respect to (B) both parental environments, or (C)–(E) other single environments. (F)–(O) results for simulated hybrids, plotting rHO2 on a reversed axis, such that fitter genotypes are higher. Points with error bars show the mean and 95% quantiles for 10,000 recombinant hybrids, generated for the reciprocal backcrosses, and the F2. The dark central point (labeled Fn) shows the mean of 10,000 homozygous hybrids, derived from automictic selfing among F1 gametes. Red and blue lines show analytical predictions. These use equation (17), with the measured values of r1O2 and r2O2, and the assumption that ρ=1 (for the “intermediate” environment) or ρ=0 (all other cases). Hybrids were scored in the (F)–(J) early stages of divergence (d=100; red lines), and (K)–(O) at later stages (d=2000; blue lines). Simulation procedure is described in Supporting Information Appendix 2, and used the following parameters: N=1000, Ns¯=10, 2NU=1, n=2, k=2, free recombination, and “bottom‐up” mutations.

The changes in composite effects are reflected in the results for simulated hybrids (Fig. 4F–O). Crosses were formed in the initial stages of divergence (Fig. 4F–J), and at a later stage (Fig. 4K–O). In addition to the standard crosses (also shown in Fig. 3E–G), results are reported for the F2, and an “Fn” cross, derived from automictic selfing among F1 gametes. As such, the three central points in each panel all show balanced hybrids (with h¯=1/2), but with maximally different levels of heterozygosity (F1: p¯12=1; F2: p¯12=12; and Fn: p¯12=0).

Results after 100 substitutions show the clear signature of ecological isolation in the parental habitats (Fig. 4F–G). In the environment to which P1 is adapted, hybrids tended to be fitter when they carry more P1 alleles, and vice versa (Rundle and Whitlock 2001). For the same reason, in an ecologically intermediate habitat, there was a clear signal of bounded hybrid advantage (Fig. 4H), as hybrids tended to have the favored intermediate phenotype (Moore 1977; Yamaguchi and Otto 2020). Hybrid advantage, at a lower level, also occurred in the ancestral habitat (Fig. 4I); but this had nothing to do with the habitat being ancestral, and the same patterns are observed in an entirely novel habitat, as long as it leads to similar values of r1O2 and r2O2 (Fig. 4J).

As the genomic divergence increased, and r122 decreased, the outcomes became more and more similar across the environments (Fig. 4K–O). After 2000 substitutions, hybrid fitnesses were already converging toward the characteristic pattern of intrinsic isolation (Fig. 4K–O), with the fixed cost of admixture and the fixed benefit of heterozygosity (eq. 18).

TWO EXTENSIONS

In this third and final part of the article, we highlight two ways in which the model gives misleading predictions, each resulting from a key simplifying assumption. We then show how these limitations might be overcome.

LATER CROSSES, AND THE BULMER EFFECT

The project of predicting hybrid fitness solely from the hybrid index and heterozygosity depends on heterospecific alleles appearing in random combinations. This assumption appears to be fairly robust to low recombination (see Supporting Information Appendix 2, Figure A2.9), but nonrandomness can arise for other reasons. Most importantly, selection on the earlier generation hybrids can induce nonrandom associations between alleles in their gametes. This can increase the fitness of later generation hybrids, but without changing allele frequencies in the population as a whole (Bulmer 1971; Walsh and Lynch 2018, chs. 16 & 24). For example, with random union of gametes, the distributions of h and p12 will often remain unchanged between the F2 and F3 generations, and so equations (17) and (19) make the same predictions for both. However, selection on the F2 parents can lead to very different levels of fitness.

To see this, let us consider the case of free recombination (c¯=1/2), and optimal parental lines (r.O2=0). With these assumptions, the variance in trait values (the zi) among F3 offspring is

Varzi,F3=12Varzi,F2+Varzi,F2selc¯=1/2,r.O2=0 (23)

where zi,F2 and zi,F2(sel) are the trait values for the total F2 population, and for the subset of selected parents (see Walsh and Lynch 2018, ch. 16, assuming complete heritability). In the extreme case, if only optimal F2 reproduce, then Var(zi,F2(sel))=0 and Var(zi,F3)=12Var(zi,F2). In general, the expected distance to the optimum of later generated hybrids can be written as

ErHOsel2=ErHO21βc¯=1/2,r.O2=0 (24)

where E(rHO2) is the prediction of equation (18), which neglects the effects of selection on earlier‐generation hybrids. The parameter 0β1/2 captures the effects of this selection, reaching its upper bound when only optimal individuals reproduce. This is illustrated in Figure 5A and B. We chose two simulation runs where populations diverged despite a fixed optimum, via system drift. If we generated an F3 cross using a random selection of F2 parents, then equation (18) applies well (see blue lines in Fig. 5). If we selected parents with a probability proportional to their fitness (eq. 3), the same results continued to apply for later crosses, but only when the fitness function was quadratic, that is, when we set k=2 in equation (3). In this case, there was very little inter‐individual variation in fitness. The results, shown in Figure 5A, imply that β0 with quadratic selection. Results with k=6 are shown in Figure 5B. Setting k=6 in equation (3) generates a “table‐like” fitness function (Fraïsse et al. 2016), equivalent to strong truncation selection, and this generates high variation in parental fitness. In this case, results were close to the lower bound of equation (24), such that β1/2 (see red lines in Fig. 5B). The same patterns continued unchanged for other late generation crosses, including the F4 and F5, and also applied to repeated backcrosses to the P1 line (Fig. 5A–B).

Figure 5.

Figure 5

Two extensions to the model, incorporating selection on early‐generation hybrids, and variable phenotypic dominance. Plots show the distance from the optimum, rHO2, on a reverse axis, so that higher points are fitter. After the initial F1 cross, we simulated either random union of gametes among the hybrids (F2–F5), or repeated backcrossing to parental line P1 (BC1–BC4). For the later crosses, we chose parents either wholly at random (black points and lines), or with a probability proportional to their fitness (asterisks and red points and lines). In each case, results for 10,000 simulated hybrids (mean and 95% quantiles), are compared to analytical predictions. Blue lines show predictions that ignore the effects of selection on earlier hybrids (eq. 24 with β=0); red lines show predictions assuming that only optimally‐fit parents reproduce (eq. 24 with β=1/2). (A) results with quadratic selection (eq. 3 with k=2) such that there is limited variation in parental fitness. (B) results with truncation‐like selection (eq. 3 with k=6), and high variance in parental fitness. (C) and (D) equivalent results, when populations were simulated with variable phenotypic dominance (such that heterozygous effect of each new mutation was the homozygous effect, multiplied by a uniformly distributed random variable). The clearest consequence is that the F1 are suboptimal, even when the parental lines are optimal. Here, predictions use equations (27) and (28) with the observed rGO2, and v=1/12, from the variance of a uniform distribution. All predictions assumed optimally‐fit parents (r.O2=0). Simulations are described in Supporting Information Appendix 2, and used the following parameters: N=1000, Ns¯=0.1, 2NU=1, n=2, c¯=0.5, “bottom‐up” mutations, stationary optima matching the ancestral state; hybrids were formed as soon as one of the diverging populations had fixed 1000 substitutions.

PHENOTYPIC DOMINANCE

The most implausible prediction of Fisher's model is embodied in equation (18). This equation predicts that, at very high levels of divergence, fully heterozygous hybrids will always be as fit as their parents (this is because, when p12=1 and h=1/2, the benefits of heterozygosity exactly cancel the costs of admixture; Barton 2001; Fraïsse et al. 2016; Schiffman and Ralph 2017).

This prediction is implausible because—with strictly biparental inheritance and expression—the initial F1 cross will be globally heterozygous (i.e., will carry one allele from each parent at all d divergent loci). Although many intrinsically isolated species do produce fit F1 (Wallace 1991; Price and Bouvier 2002; Fraïsse et al. 2016), F1 fitness tends to decline as the parents become very genetically divergent, even in environments where both parents are well adapted (Endler 1977; Bateson 1978; Waser 1993; Price and Bouvier 2002; Edmands 2002; Fraïsse et al. 2016). The model also struggles to explain a second widespread pattern in the F1: when some loci have uniparental inheritance or expression, the reciprocal F1 often have very different fitnesses, even if the parents are both well adapted (Bolnick and Near 2005; Turelli and Moyle 2007; Escobar et al. 2008; Brandvain et al. 2014; Sato et al. 2014; Fraïsse et al. 2016; Bouchemousse et al. 2016). Fisher's model can only account for these asymmetries if the globally heterozygous genotype is suboptimal (see Fraïsse et al. 2016 for details).

In this section, we show that these features of Fisher's model result from phenotypic additivity, and are improved by adding phenotypic dominance (Manna et al. 2011). To see this, let us replace equation (5) with

zi=zP1,i+jJhommij+jJhet12mij+δij (25)

where δij is the deviation from semi‐dominance on trait i, caused by introducing substitution j in heterozygous form. We now assume that the δij can be treated as the increments of a new, and independent Brownian bridge, linking the midparental value of trait i, to the trait value of the global heterozygote (see the phenotypes labeled P and G in Fig. 2C). (We note that this strong assumption does not allow for differences in the typical dominance relations of large‐ and small‐effect changes; Wright 1929; Manna et al. 2011; Fraïsse et al. 2016). In Appendix 1 section A1.4 in the Supporting Information, we show that these assumptions lead to

ErHO2h,p12V+M+Vδ+Mδ (26)

where V and M are the additive results from equations (11)–(14), and Vδ and Mδ are the new contributions from variable dominance. These new contributions are

Vδ=4vp12(1p12) (27)
Mδ=p12rGO2rPO2p121p12rPG2p12h12r2G2r1G2 (28)

(Supporting Information Appendix 1 section A1.4). In equation (27), a new parameter, v, describes the scaled variance of the δij. Equation (28) depends on several new distances, and these are illustrated in Figure 2C. The corresponding changes in the composite effects are listed in Table 1 (column “phenotypic dominance”). Table 1 shows that phenotypic dominance adds two new composite effects: {δ2} and {α1δ1}, and alters the value of a third, {δ1}, so that it is no longer a constant.

Equations (27) and (28) are both proportional to p12 and so they alter predictions only for heterozygous hybrids. The predictions are altered in two major ways. First, a nonzero value of {α1δ1} (which corresponds to the third term in eq. 28), now allows for “directional dominance.” For example, in Figure 2C, the global heterozygote, G, is much closer to the P2 phenotype than to the P1 phenotype (r2G2<r1G2), which implies that P2 alleles are dominant on average. This sort of asymmetry allows Fisher's model to account for “dominance drive” in hybrid zones, where alleles can spread due to their dominance relations alone (Mallet and Barton 1989; Barton 1992). Second, when the phenotype of the global heterozygote, G, differs from the midparental phenotype, P, the effects of heterozygosity are qualitatively altered. Under the additive model, these effects are intrinsic and always beneficial (see eq. 17). By contrast, with phenotypic dominance, the effects of heterozygosity become extrinsic, and so they can vary over time and space. Furthermore, at large divergences, heterozygosity will tend to become deleterious. This is because the global heterozygote—unlike the parental genotypes—may never be exposed to natural selection, and cannot, in any case, breed true. As such, its phenotype can continue to wander away from the optimum as divergence increases, even if effective stabilizing selection acts on the parental phenotypes. The result is that the distance rGO2, unlike rPO2, has no tendency to vanish as divergence increases.

In Appendix 2 in the Supporting Information, we report a full set of individual‐based simulations, incorporating variable dominance, which support the analytical results above. The most important consequence of variable dominance is illustrated in Figure 5C and D. Here, most results closely match those of the additive model, as shown in Figure 5A and B. The exception is the F1, which with dominance, is noticeably less fit than the parental lines (Fig. 5C and D).

Discussion

FISHER'S MODEL AS A FITNESS LANDSCAPE

Using fitness landscapes based on Fisher's geometric model, we have developed analytical predictions for the fitness of hybrids between divergent lines. These predictions allowed us explore several factors that can affect hybrid fitness. These factors are (i) the genotypic composition of the hybrids—parameterized by the hybrid index, h, and heterozygosity p12; (ii) the environmental conditions—parameterized by r1O2 and r2O2: the distance to the current optimum of the parental lines; and (iii) the divergence history of the populations—parameterized by r122: the distance between the parental phenotypes (Fig. 2A). We have also shown how these distances tend to change over the course of evolutionary divergence. Because our results apply to wide range of evolutionary and ecological scenarios, we can classify scenarios according to the values of a small number of geometric distances.

Of course, the simplicity of the results stems from the simplicity of the fitness landscape, and so we have to ask whether the model is overly simple. This question is not settled by pointing to the toy nature of the phenotypic model (optimizing selection on n quantitative traits). This model is best viewed as approximating more complex and realistic phenotypic models (Martin 2014; Schiffman and Ralph 2017; Fraïsse and Welch 2019); and these approximations can involve many‐to‐one mappings, so that, in principle, any n‐dimensional phenotype under Fisher's model, could correspond to multiple real‐world phenotypes. As such, the model has to be judged by its successes and failures in accounting for observed patterns in hybrid fitness (Fraïsse et al. 2016; Simon et al. 2018).

In cases where Fisher's model is inadequate, we have also shown how it can be extended. In particular, adding variable phenotypic dominance allows for a low fitness F1 between highly divergent, but equally fit parental lines (Fraïsse et al. 2016; Fig. 5C and D). This extension further supports previous claims that Fisher's model can incorporate other modeling approaches as special cases (Simon et al. 2018). For example, when the parental lines have high fitness, equation (26) gives identical predictions to a model of Dobzhansky‐Muller incompatibilities, with variable dominance relations (Turelli and Orr (2000); see eq. A37 of Simon et al. 2018).

ESTIMATING THE KEY QUANTITIES

By connecting Fisher's model to the quantitative genetics of line crosses (Hill 1982; Lynch 1991; Rundle and Whitlock 2001; Demuth and Wade 2005; Yamaguchi and Otto 2020), we have shown that the geometric distances are closely related to the composite effects. This implies that the distances can be estimated using measurements of fitness, or some component of fitness, taken from controlled crosses (Lynch and Walsh 1998; Lynch 1991; Rundle and Whitlock 2001; Simon et al. 2018). This claim comes with an important caveat. The results in Table 1 apply not to raw fitness values, but to values that have been suitably transformed and scaled. In our notation, they apply to rHO2 and not to w (see eqs. (3), (4), (5), (6)). Data transforms are an inherent part of quantitative genetics (Lynch and Walsh 1998, ch. 11), but there is also the need to estimate the scaling factor in equation (6). This extra parameter is relatively easy to estimate from a diverse collection of hybrids (see Simon et al. 2018), but not from a limited number of controlled crosses (Yamaguchi and Otto 2020). We ducked this issue in Figure 4, by estimating equation (7) directly from the simulated fixed effects (see Supporting Information Appendix 1 section A1.1). This is a real limitation, but there are many special cases where the distances can be estimated from fitness values that are transformed but unscaled (i.e., from the numerator of eq. 6). For example, with two locally adapted populations (Table 1), the distance r122 can be estimated from a ratio of composite effects:

r122^=4α1εα1ε2α2 (29)

so that the scaling factor cancels. Simon et al. (2018) give other, similar examples.

These methods of estimation all assume that hybrid fitness can be meaningfully predicted from the hybrid index and heterozygosity alone. However, we have also shown that this assumption breaks down when there are strong disequilibria between heterospecific alleles, as generated by selection on early generation hybrids (as opposed to selection during the divergence). These effects are weak in some parameter regimes (Fig. 5A), but observations of strong incompatibilities, involving small genomic regions, suggests that they might be important in nature (Barton 2001; Coyne and Orr 2004, ch. 8; Fraïsse et al. 2016). We have provided a simple solution, which applies with strong, truncation‐like selection (eq. 24; Fig. 5B). However, this approach might be difficult to apply to entire hybrid swarms, when some, but not all hybrids have strong selectively induced disequilibria (Jiggins and Mallet 2000; Allendorf et al. 2001; Vernesi et al. 2003; Simon et al. 2018). Even greater challenges will arise when selection changes allele frequencies (Walsh and Lynch 2018, chs. 16 & 24). In both cases, the distributions of h and p12 will not be sufficient to predict hybrid fitnesses.

THE PROCESS OF DIVERGENCE AND THE OUTCOME OF HYBRIDIZATION

Because it is a well‐studied model of evolutionary divergence, Fisher's model is especially useful for investigating the connections between the mode of divergence between the parental lines, and the outcome of hybridization between them.

One set of connections has been explored extensively in previous work. Compared to drift, positive selection will lead to divergence that is more rapid and more resistant to the swamping effects of gene flow, and tend to fix effects that are larger and more variable in size (Orr 1998b; Griswold 2006; Yeaman and Whitlock 2011; Rockman 2012; Matuszewski et al. 2014; Débarre et al. 2015; Matuszewski et al. 2015; Dittmar et al. 2016; Thompson et al. 2019; Yamaguchi and Otto 2020). And larger changes will often have a greater influence on hybrid fitness (Chevin et al. 2014; Fraïsse et al. 2016; Yamaguchi and Otto 2020). Together, these facts will tend to implicate natural selection, rather than drift, in any hybrid problems that appear early in the divergence process (Jiggins and Mallet 2000; Coyne and Orr 2004, ch. 11; Yamaguchi and Otto 2020). In the results presented here, these effects of selection are all incorporated into the scaling factor (eqs. 6 and 7), with positive selection tending to lead to larger values of d and larger values of the λi.

We have focused on a different set of connections between divergence and hybridization, and these are captured by the distance r122. This distance can be called “intrinsic,” because it is a property of the parental lines, which does not depend on the current position of the optimum. For this reason, r122 describes the possible outcomes of hybridization in a variety of environmental conditions (Figs. 3E–G and 4). For example, when parental lines are well adapted to different habitats, a high value of r122 implies that the isolation between the lines will be purely ecological. In an intermediate habitat, or wherever the parents are poorly adapted, lines with a high r122 are more likely to generate hybrid advantage beyond the F1. In this way, the value of r122 is closely related to the notion of “coadaptation” among the parental alleles (Wallace 1991). This is also why r122 determines {α2}: the additive‐by‐additive epistatic effect (Table 1; Lynch 1991).

As well as describing the outcomes of hybridization, r122 contains some information about the mode of divergence. This information is not about epistasis: the value of {α2} tells us nothing at all about the role of epistatic genetic variance during divergence (Lynch 1991; Welch 2004; Demuth and Wade 2005; Barton 2017; see Supporting Information Appendix 2). Instead, we have shown that r122 measures the exchangeability of divergent alleles, or—equivalently—the consistency in their “directions” in trait space, and compares this consistency to expectations under a random walk. As such, high values of r122 imply that line‐specific alleles are more similar to each other than expected (Fig. 3A–D). This definition shows that r122 is closely connected to standard tests for natural selection on quantitative traits, such as the QTL sign test (Orr 1998a), or the Qst‐Fst comparison (Spitze 1993; Whitlock and Guillaume 2009). Indeed, simulations confirm that adaptive divergence, especially in parapatry, is most likely to lead to high values of r122 (see Supporting Information Appendix 2, section A2.2).

Together, these results clarify what hybrid fitness can and cannot tell us about the mode of parental divergence. On one hand, some patterns of hybrid fitness—those associated with high r122—are reliable indicators of selectively driven divergence, and especially of local adaptation maintained in the face of gene flow. On the other hand, patterns associated with low r122 can arise in a variety of ways, including via adaptive divergence, especially in allopatry (e.g., Fig. 3D). (These limitations are closely related to the low power of the QTL sign test: Rice and Townsend 2012; Walsh and Lynch 2018, ch. 12). Furthermore, unless there is substantial gene flow, any signature of selection will be transient. Over time, the model predicts convergence to an identical pattern of intrinsic reproductive isolation, whatever the mode of divergence (Figs. 3 and 4).

AUTHOR CONTRIBUTIONS

JW, NB, and HS designed the project. JW, BDS, and HS performed the analysis. HS, DR, and JW wrote the simulation code and ran the simulations. JW, HS, and BDS wrote the manuscript, with contributions from all authors. HS and BDS equally contributed to this study.

DATA ARCHIVING

Simulation code and simulated data are deposited on Dryad (https://doi.org/10.5061/dryad.hmgqnk9f9).

Associate Editor: C. Bank

Handling Editor: D. Hall

Supporting information

Table A2.1: Population genetic parameter values used in the simulations.

Figure A2.1: Simulated evolution of a single population with a stationary phenotypic optimum, and a stable population size of N = 1000.

Figure A2.2: Simulated evolution of single populations under demographic and environmental change.

Figure A2.3: Illustrations of the 15 divergence scenarios that were simulated.

Figure A2.4: Simulated divergence between pairs of populations.

Figure A2.5: The robustness of the Brownian bridge approximation.

Figure A2.6: Evidence for deviations from multivariate normality in the fixed effects, that differentiate the simulated populations.

Figure A2.7: Simulated crosses in different environments.

Figure A2.8: Simulated crosses when parental populations diverged via drift, using divergence scenarios 11‐12 in Figure A2.4.

Figure A2.9: Simulated crosses after local adaptation, in populations with low recombination.

Figure A2.10: The Brownian bridge approximation with and without variable phenotypic dominance.

Figure A2.11: The Brownian bridge approximation for heterozygous hybrids, with variable phenotypic dominance.

Figure A2.12: The second term of the Brownian bridge approximation, with and without phenotypic dominance.

Figure A2.13: With variable phenotypic dominance, the phenotype of the global heterozygote, G, is usually further from the optimum than the midparental phenotype, P.

Figure A2.14: Simulated hybridization after local adaptation, with variable phenotypic dominance.

ACKNOWLEDGMENTS

We are very grateful to Jean‐Baptiste Grodwohl, Sally Otto, Aylwyn Scally, and Alexis Simon. We would also like to thank the editors Claudia Bank and David Hall, and two anonymous reviewers for helpful comments. Some of the simulations used the Montpellier Bioinformatics Biodiversity platform supported by the LabEx CeMEB, an ANR “Investissements d'avenir” program (ANR‐10‐LABX‐04‐01). BDS and HS are funded by the Wellcome Trust programme in Mathematical Genomics and Medicine (codes PFHZ/157 and PCGG.GAAB, respectively). HS also acknowledges financial support from the MEME programme in Evolutionary Biology.

LITERATURE CITED

  1. Allendorf, F. W. , Leary R. F., Spruell P., and Wenburg J. K.. 2001. The problems with hybrids: setting conservation guidelines. Trends Ecol. Evol. 16:613–622. [Google Scholar]
  2. Barton, N. H. 1990. Pleiotropic models of quantitative variation. Genetics 124:773–782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barton, N. H. 1992. On the spread of new gene combinations in the third phase of wright's shifting‐balance. Evolution 46:551–557. [DOI] [PubMed] [Google Scholar]
  4. Barton, N. H. 2017. How does epistasis influence the response to selection? Heredity 118:96–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Barton, N. H. 1989. Founder effect speciation Pp. 229–256 in Otte Daniel, and Endler John A., eds. Speciation and its consequences. Sinauer Associates, Sunderland, MA. [Google Scholar]
  6. Barton, N. H . 2001. The role of hybridization in evolution. Mol. Ecol. 10:551–568. [DOI] [PubMed] [Google Scholar]
  7. Barton, N. H. , Etheridge A. M., and Véber A.. 2017. The infinitesimal model: definition, derivation, and implications. Theor. Popul. Biol. 118:50–73. [DOI] [PubMed] [Google Scholar]
  8. Bateson, P . 1978. Sexual imprinting and optimal outbreeding. Nature 273:659–660. [DOI] [PubMed] [Google Scholar]
  9. Blanquart, F. , Kaltz O., Nuismer S. L., and Gandon S.. 2013. A practical guide to measuring local adaptation. Ecol. Lett. 16:1195–1205. [DOI] [PubMed] [Google Scholar]
  10. Bolnick, D. I. , and Near T. J.. 2005. Tempro of hybrid inviability in centrarchid fishes (Teleostei: Centrarchidae). Evolution 59:1754–1767. [PubMed] [Google Scholar]
  11. Bordenstein, S. , and Drapeau M.. 2001. Genotype‐by‐environment interaction and the Dobzhansky‐Muller model of postzygotic isolation. J. Evol. Biol. 14:490–501. [Google Scholar]
  12. Bouchemousse, S. , Lévêque L., Dubois G., and Viard F.. 2016. Co‐occurrence and reproductive synchrony do not ensure hybridization between an alien tunicate and its interfertile native congener. Evol. Ecol. 30:69–87. [Google Scholar]
  13. Brandvain, Y. , Pauly G. B., May M. R., and Turelli M.. 2014. Explaining Darwin's corollary to Haldane's rule: the role of mitonuclear interactions in asymmetric postzygotic isolation among toads. Genetics 197:743–747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bulmer, M. G. 1971. The effect of selection on genetic variability. Am. Nat. 105:201–211. [Google Scholar]
  15. Chevin, L.‐M. , Decorzent G., and Lenormand T.. 2014. Niche dimensionality and the genetics of ecological speciation. Evolution 68:1244–1256. [DOI] [PubMed] [Google Scholar]
  16. Cockerham, C. C. 1980. Random and fixed effects in plant genetics. Theor. Appl. Genet. 56:119–131. [DOI] [PubMed] [Google Scholar]
  17. Coyne, J. , and Orr H. A.. 2004. Speciation. Sinauer Associates, Sunderland, MA. [Google Scholar]
  18. Débarre, F. , Yeaman S., and Guillaume F.. 2015. Evolution of quantitative traits under a migration‐selection balance: when does skew matter? Am. Nat. 186:S37–S47. [DOI] [PubMed] [Google Scholar]
  19. Demuth, J. P. , and Wade M. J.. 2005. On the theoretical and empirical framework for studying genetic interactions within and among species. Am. Nat. 165:524–536. [DOI] [PubMed] [Google Scholar]
  20. Dittmar, E. L. , Oakley C. G., Conner J. K., Gould B. A., and Schemske D. W.. 2016. Factors influencing the effect size distribution of adaptive substitutions. Proc. Royal Soc. B 283:20153065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dobzhansky, T. 1937. Genetics and the origin of species. Colombia University, New York, NY. [Google Scholar]
  22. Edmands, S . 2002. Does parental divergence predict reproductive compatibility? Trends Ecol. Evol. 17:520–527. [Google Scholar]
  23. Endler, J. 1977. Geographic variation, speciation, and clines. Princeton Univ. Press, Princeton, NJ. [PubMed] [Google Scholar]
  24. Escobar, J. S. , Nicot A., and David P.. 2008. The different sources of variation in inbreeding depression, heterosis and outbreeding depression in a metapopulation of Physa acuta. Genetics 180:1593–1608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fisher, R. 1930. The genetical theory of natural selection. Oxford Univ. Press, Oxford, U.K. [Google Scholar]
  26. Fraïsse, C. , Gunnarsson P. A., Roze D., Bierne N., and Welch J. J.. 2016. The genetics of speciation: insights from Fisher's geometric model. Evolution 70:1450–1464. [DOI] [PubMed] [Google Scholar]
  27. Fraïsse, C. , and Welch J. J.. 2019. The distribution of epistasis on simple fitness landscapes. Biol. Lett. 15(20180881). [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Frankel, R. 1983. Heterosis: reappraisal of theory and practice. Springer‐Verlag, Berlin. [Google Scholar]
  29. Gavrilets, S. 2004. Fitness landscapes and the origin of species. Princeton Univ. Press, Princeton, NJ. [Google Scholar]
  30. Griswold, C. K. 2006. Gene flow's effect on the genetic architecture of a local adaptation and its consequences for QTL analyses. Heredity 96:445–453. [DOI] [PubMed] [Google Scholar]
  31. Hartl, D. L. , and Taubes C. H.. 1998. Towards a theory of evolutionary adaptation. Genetica 102/103:525–533. [PubMed] [Google Scholar]
  32. Hatfield, T. , and Schluter D.. 1999. Ecological speciation in sticklebacks: environment‐dependent hybrid fitness. Evolution 53:866–873. [DOI] [PubMed] [Google Scholar]
  33. Hill, W. G. 1982. Dominance and epistasis as components of heterosis. Zeitschrift für Tierzüchtung und Züchtungsbiologie 99:161–168. [Google Scholar]
  34. Hill, W. G. , Goddard M. E., and Visscher P. M.. 2008. Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 4:e1000008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Jiggins, C. D. , and Mallet J.. 2000. Bimodal hybrid zones and speciation. Trends Ecol. Evol. 15:250–255. [DOI] [PubMed] [Google Scholar]
  36. Kawecki, T. J. , and Ebert D.. 2004. Conceptual issues in local adaptation. Ecol. Lett. 7:1225–1241. [Google Scholar]
  37. Lynch, M . 1991. The genetic interpretation of inbreeding depression and outbreeding depression. Evolution 45:622–629. [DOI] [PubMed] [Google Scholar]
  38. Lynch, M. , and Walsh B.. 1998. Genetics and the analysis of quantitative traits. Sinauer Associates, Sunderland, MA. [Google Scholar]
  39. Mäki‐Tanila, A. , and Hill W. G.. 2014. Influence of gene interaction on complex trait variation with multilocus models. Genetics 198:355–367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Mallet, J. , and Barton N. H.. 1989. Inference from clines stabilized by frequency‐dependent selection. Genetics 122:967–976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Mani, G. , and Clarke B.. 1990. Mutational order: a major stochastic process in evolution. Proc. Royal Soc. London B 240:29–37. [DOI] [PubMed] [Google Scholar]
  42. Manna, F. , Martin G., and Lenormand T.. 2011. Fitness landscapes: an alternative theory for the dominance of mutation. Genetics 189:923–937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Martin, G . 2014. Fisher 's geometrical model emerges as a property of complex integrated phenotypic networks. Genetics 197:237–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Martin, G. , and Lenormand T.. 2006. A general multivariate extension of Fisher's geometrical model and the distribution of mutation fitness effects across species. Evolution 60:893–907. [PubMed] [Google Scholar]
  45. Matuszewski, S. , Hermisson J., and Kopp M.. 2014. Fisher's geometric model with a moving optimum. Evolution 68:2571–2588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Matuszewski, S. , Hermisson J., and Kopp M. 2015. Catch me if you can: adaptation from standing genetic variation to a moving phenotypic optimum. Genetics 200:1255–1274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Moore, W. S. 1977. An evaluation of narrow hybrid zones in vertebrates. Quart. Rev. Biol. 52:263–277. [Google Scholar]
  48. Neal, N. P. 1935. The decrease in yielding capacity in advanced generations of hybrid corn. J. Am. Soc. Agron. 27:666–670. [Google Scholar]
  49. Orr, H. A. 1995. The population genetics of speciation: the evolution of hybrid incompatibilities. Genetics 139:180–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Orr, H. A. 1998a. Testing natural selection vs. genetic drift in phenotypic evolution using quantitative trait locus data. Genetics 149:2099–2104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Orr, H. A. 1998b. The population genetics of adaptation: the distribution of factors fixed during adaptive evolution. Evolution 52:935–949. [DOI] [PubMed] [Google Scholar]
  52. Peck, J. R. , Barreau G., and Heath S. C.. 1997. Imperfect genes, Fisherian mutation and the evolution of sex. Genetics 145:1171–1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Price, T. D. , and Bouvier M. M.. 2002. The evolution of F1 postzygotic incompatibilities in birds. Evolution 56:2083–2089. [PubMed] [Google Scholar]
  54. Revuz, D. , and Yor M.. 1999. Continuous martingales and Brownian motion. Springer, Berlin, Heidelberg. [Google Scholar]
  55. Rice, D. P. , and Townsend J. P.. 2012. Resampling QTL effects in the QTL sign test leads to incongruous sensitivity to variance in effect size. G3: Genes, Genomes, Genetics 2:905–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Rockman, M. V. 2012. The QTN program and the alleles that matter for evolution: all that's gold does not glitter. Evolution 66:1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rosas, U. , Barton N. H., Copsey L., Reuille P. B. D., and Coen E.. 2010. Cryptic variation between species and the basis of hybrid performance. PLoS Biol. 8:e1000429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Rundle, H. D. , and Whitlock M. C.. 2001. A genetic interpretation of ecologically dependent isolation. Evolution 55:198–201. [DOI] [PubMed] [Google Scholar]
  59. Sato, A. , Shimeld S. M., and Bishop J. D. D.. 2014. Symmetrical reproductive compatibility of two species in the Ciona intestinalis (ascidiacea) species complex, a model for marine genomics and developmental biology. Zool. Sci. 31:369–374. [DOI] [PubMed] [Google Scholar]
  60. Schiffman, J. S. and Ralph P. L.. 2017. System drift and speciation. bioRxiv. 10.1101/231209. [DOI] [PMC free article] [PubMed]
  61. Simon, A. , Bierne N., and Welch J. J.. 2018. Coadapted genomes and selection on hybrids: Fisher's geometric model explains a variety of empirical patterns. Evol. Lett. 2:472–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Spitze, K . 1993. Population structure in daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135:367–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Tenaillon, O. , Silander O. K., Uzan J.‐P., and Chao L.. 2007. Quantifying organismal complexity using a population genetic approach. PLoS ONE 2:217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Thompson, K. A. 2019. Evidence that pleiotropic alleles underlie adaptive divergence between natural populations. bioRxiv. 10.1101/718916. [DOI] [PubMed]
  65. Thompson, K. A. , Osmond M. M., and Schluter D.. 2019. Parallel genetic evolution and speciation from standing variation. Evol. Lett. 3:129–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Turelli, M. , and Moyle L. C.. 2007. Asymmetric postmating isolation: Darwin's corollary to Haldane's rule. Genetics 176:1059–1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Turelli, M. , and Orr H. A.. 2000. Dominance, epistasis and the genetics of postzygotic isolation. Genetics 154:1663–1679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Vernesi, C. , Crestanello B., Pecchioli E., Tartari D., Caramelli D., Hauffe H., and Bertorelle G.. 2003. The genetic impact of demographic decline and reintroduction in the wild boar (Sus scrofa): a microsatellite analysis. Mol. Ecol. 12:585–595. [DOI] [PubMed] [Google Scholar]
  69. Wallace, B . 1991. Coadaptation revisited. J. Hered. 82:89–95. [DOI] [PubMed] [Google Scholar]
  70. Walsh, B. , and Lynch M.. 2018. Evolution and selection of quantitative traits. Oxford Univ. Press, Oxford, U.K. [Google Scholar]
  71. Waser, N. M. 1993. Population structure, optimal outbreeding, and assortative mating in angiosperms Pp. 173–199, in Thornhill N. W., ed. The natural history of inbreeding and outbreeding. University of Chicago Press, Chicago, IL. [Google Scholar]
  72. Waxman, D. , and Welch J. J.. 2005. Fisher's microscope and Haldane's ellipse. Am. Nat. 166:447–457. [DOI] [PubMed] [Google Scholar]
  73. Whitlock, M. C. , and Guillaume F.. 2009. Testing for spatially divergent selection: comparing QST to FST. Genetics 183:1055–1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Wright, S. 1922. The effects of inbreeding and crossbreeding on guinea pigs: III. Crosses between highly inbred families. U.S. Department of Agriculture Bulletin 1121.
  75. Wright, S. 1929. The evolution of dominance. Am. Nat. 63:556–561. [Google Scholar]
  76. Yakimowski, S. B. , and Rieseberg L. H.. 2014. The role of homoploid hybridization in evolution: a century of studies in synthesizing genetics and ecology. Am. J. Bot. 101:1247–1258. [DOI] [PubMed] [Google Scholar]
  77. Yamaguchi, R. , and Otto S. P.. 2020. Insights from Fisher's geometric model on the likelihood of speciation under different histories of environmental change. Evolution 74:1603–1619. [DOI] [PubMed] [Google Scholar]
  78. Yeaman, S. , and Whitlock M. C.. 2011. The genetic architecture of adaptation under migration‐selection balance. Evolution 65:1897–1911. [DOI] [PubMed] [Google Scholar]
  79. Zeng, Z.‐B. , Houle D., and Cockerham C. C.. 1990. How informative is Wright's estimator of the number of genes affecting a quantitative character? Genetics 126:235–247. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table A2.1: Population genetic parameter values used in the simulations.

Figure A2.1: Simulated evolution of a single population with a stationary phenotypic optimum, and a stable population size of N = 1000.

Figure A2.2: Simulated evolution of single populations under demographic and environmental change.

Figure A2.3: Illustrations of the 15 divergence scenarios that were simulated.

Figure A2.4: Simulated divergence between pairs of populations.

Figure A2.5: The robustness of the Brownian bridge approximation.

Figure A2.6: Evidence for deviations from multivariate normality in the fixed effects, that differentiate the simulated populations.

Figure A2.7: Simulated crosses in different environments.

Figure A2.8: Simulated crosses when parental populations diverged via drift, using divergence scenarios 11‐12 in Figure A2.4.

Figure A2.9: Simulated crosses after local adaptation, in populations with low recombination.

Figure A2.10: The Brownian bridge approximation with and without variable phenotypic dominance.

Figure A2.11: The Brownian bridge approximation for heterozygous hybrids, with variable phenotypic dominance.

Figure A2.12: The second term of the Brownian bridge approximation, with and without phenotypic dominance.

Figure A2.13: With variable phenotypic dominance, the phenotype of the global heterozygote, G, is usually further from the optimum than the midparental phenotype, P.

Figure A2.14: Simulated hybridization after local adaptation, with variable phenotypic dominance.

Data Availability Statement

Simulation code and simulated data are deposited on Dryad (https://doi.org/10.5061/dryad.hmgqnk9f9).


Articles from Evolution; International Journal of Organic Evolution are provided here courtesy of Wiley

RESOURCES