Skip to main content
Evolution Letters logoLink to Evolution Letters
letter
. 2018 Aug 14;2(5):472–498. doi: 10.1002/evl3.66

Coadapted genomes and selection on hybrids: Fisher's geometric model explains a variety of empirical patterns

Alexis Simon 1,2, Nicolas Bierne 1,2, John J Welch 2
PMCID: PMC6145440  PMID: 30283696

Abstract

Natural selection plays a variety of roles in hybridization, speciation, and admixture. Most research has focused on two extreme cases: crosses between closely related inbred lines, where hybrids are fitter than their parents, or crosses between effectively isolated species, where hybrids suffer severe breakdown. But many natural populations must fall into intermediate regimes, with multiple types of gene interaction, and these are more difficult to study. Here, we develop a simple fitness landscape model, and show that it naturally interpolates between previous modeling approaches, which were designed for the extreme cases, and invoke either mildly deleterious recessives, or discrete hybrid incompatibilities. Our model yields several new predictions, which we test with genomic data from Mytilus mussels, and published data from plants (Zea, Populus, and Senecio) and animals (Mus, Teleogryllus, and Drosophila). The predictions are generally supported, and the model explains a number of surprising empirical patterns. Our approach enables novel and complementary uses of genome‐wide datasets, which do not depend on identifying outlier loci, or “speciation genes” with anomalous effects. Given its simplicity and flexibility, and its predictive successes with a wide range of data, the approach should be readily extendable to other outstanding questions in the study of hybridization.

Keywords: Dobzhansky–Muller incompatibilities, Haldane's Rule, heterozygosity, inbreeding, speciation genetics, sterility


Impact summary.

When individuals of different populations mate, the offspring will carry new combinations of alleles. Sometimes the new combinations bring fitness benefits (heterosis). This is often true, for example, when the parental lines are closely related and highly inbred: a fact that can be exploited in artificial breeding programs. Sometimes, the hybrids are much less fit than their parents (hybrid breakdown), suggesting that the populations may be distinct species. These different outcomes depend on the ways in which the alleles interact, and so comparing the outcomes of different types of hybridization can tell us a lot about gene interactions. We developed a general mathematical multigenic model that makes simple predictions for the fitness of hybrids of any type. We show that our model can account for a large number of empirical patterns, including some that were not well explained by alternative theories, developed for specific cases. We tested our predictions with new data from mussels, and published data from plants and animals, and obtained a good fit. Our framework suggests a new and complementary approach to analyzing genomic data from hybrids, which does not rely on identifying small regions of the genome with anomalous effects.

Hybridization and admixture lead to natural selection acting on alleles in different genetic backgrounds. Most classical studies of hybridization can be placed into one of two classes. The first, involves crosses between closely related inbred lines, where there is no coadaptation between the deleterious alleles that differentiate the lines, such that most hybrids are fitter than their parents. Wright's single‐locus theory of inbreeding was developed to interpret data of this kind (Wright 1922, 1977; Crow 1952; Hallauer et al. 2010). The second, involves crosses between effectively isolated species, where viable and fertile hybrids are very rare. Data of this kind are often analyzed by focusing on a small number of “speciation genes,” and interpreted using models of genetic incompatibilities (Dobzhansky 1937; Coyne and Orr 1989; Orr 1995; Gavrilets 2004; Welch 2004; Kalirad and Azevedo 2017).

The differences between these types of hybridization are clear, but it is equally clear that they are extremes of a continuum. Furthermore, the intermediate stages of this continuum are of particular interest, because they include phenomena such as incipient speciation, and occasional introgression between partially isolated populations (Waser 1993; Rosas et al. 2010; Mendez et al. 2012; Fraïsse et al. 2016a; Duranton et al. 2017). However, it can be difficult to model natural selection in this intermediate regime, not least because models require a large number of parameters when they include epistatic effects between many loci. The empirical study of hybrid genotypes in this regime is also difficult. The analysis of lab crosses often focuses on segregation distortions of large effect, and pairwise incompatibilities (Coyne and Orr 2004; Abbott et al. 2013). This QTL‐mapping framework can miss small effect mutations (Noor et al. 2001; Rockman 2012), which are difficult to identify individually, but whose cumulative effect can be substantial (Boyle et al. 2017).

One promising approach is to use Fisher's geometric model, which assigns fitness values to genotypes using a model of optimizing selection on quantitative traits (Fisher 1930; Orr 1998; Welch and Waxman 2003; Martin and Lenormand 2006). The tools of quantitative genetics have often been used to study hybridization (e.g., Melchinger 1987; Lynch 1991; Demuth and Wade 2005; Fitzpatrick 2008), but Fisher's model is fully additive at the level of phenotype, and the “traits” need not correspond in any simple way to standard quantitative traits (Rosas et al. 2010; Martin 2014; Schiffman and Ralph 2017). Instead, the goal is to generate a rugged fitness landscape, which includes a wide variety of mutational effect sizes and epistatic interactions on fitness, with a minimum of free parameters (Orr 1998; Blanquart et al. 2014; Barton 2017; Hwang et al. 2017).

Here, we build on previous studies (Mani and Clarke 1990; Barton 2001; Rosas et al. 2010; Chevin et al. 2014; Fraïsse et al. 2016b; Schiffman and Ralph 2017), and use Fisher's geometric model to study hybridization. We develop a simple Brownian bridge approximation, and show that it can naturally interpolate between previous modeling approaches (Wright 1922, 1977; Orr 1995; Gavrilets 2004), which are appropriate for the two extreme types of hybridization. We then show how the model can account for surprising empirical patterns that have been observed in both regimes (Wright 1977; Moehring 2011; Moran et al. 2017). Finally, we show that the model yields novel predictions for selection on heterozygosity, and test these predictions with a wide range of new and existing datasets (Table 1).

Table 1.

Datasets

Hybridization N Sex #Markers Cross
gX
Fitness measure Reference
Zea mays inbred lines Various Excess yield See Fig. 3
Mytilus edulis/galloprovincialis 132 ♀/♂ 43 F2 This study
144 ♀/♂ 43 BC1
Drosophila pseudoobscura/persimilis 1141/1036 2 X; 11 A BC1 0.37 Motile sperm: present/absent Noor et al. (2001)
Drosophila sechellia/simulans 200/200 8 X; 31 A BC1 0.17 Sperm quantity: 3‐pt. scale Macdonald and Goldstein (1999)
Drosophila santomea/yakuba 550/549 10 X; 22 A BC1 0.17 Motile sperm: 9‐pt. scale Moehring et al. (2006a, 2006b)
Teleogryllus oceanicus/commodus 248 Various 0.30 Egg and offspring number Moran et al. (2017)
Populus alba/tremula 137 ♀♂ ∼12,000 WH Survival after 4 years Christe et al. (2016)
Senecio aethnensis/chrysanthemifolius 64 966 F2 Necrotic/Healthy Chapman et al. (2016)
Mus musculus musculus/domesticus 185 14,220 WH 0.039 Testes weight Turner and Harr (2014)
305 202 (16 X; 182 A) F2 0.039 Prop. abnormal sperm White et al. (2011)

N, The number of individual hybrids, divided by backcross direction where appropriate; #Markers, The number of genetic markers used to estimate p 12 and h, sometimes divided into X‐linked and Autosomal. BC1, First backcross; WH, Wild hybrids. gX, weight given to X‐linked markers in the calculation of overall genome composition, calculated from the length of annotated coding sequence.

Models and Results

THE MODELS

Notation and basics

We will consider hybrids between two diploid populations, labeled P1 and P2, each of which is genetically uniform, but which differ from each other by d substitutions. Considering all possible combinations of the two homozygotes and the heterozygote, the populations could generate 3d distinct hybrid genotypes, and each might differ in their overall fitness, or in some component of fitness, such as fertility or viability. Below, we will focus on rank order differences between different classes of hybrid (e.g., high vs. low heterozygosity, males vs. females, F1 vs. F2 etc.). As such, following Turelli and Orr (2000), we describe hybrids using a “breakdown score,” S, which is larger for hybrids that have lower values of the fitness component of interest. Breakdown might relate to fitness via,

lnwSα/2 (1)

in which case, the parameter α adjusts the rate at which fitness declines with breakdown. This is related to the overall levels of fitness dominance and epistasis (Hinze and Lamkey 2003; Tenaillon et al. 2007; Fraïsse et al. 2016b), and so these can vary independently of other results. We now define the key quantity f, as the expected value of S for a given class of hybrid, scaled by the expected value for an unfit reference class.

fE(S)E(S) (2)

Here, E(S), is the expected breakdown score for the class of hybrid with the lowest fitness, on the condition that the parental genotypes are optimally fit. In this case, f can vary between zero, for the best possible class of hybrid, and one, for the worst possible class. We will also consider the case where the parental types are themselves suboptimal, with their own level of “breakdown,” denoted f P1 and f P2. In this case, when fP1,fP2>0, then the value of f for the hybrids can be greater than one (see below).

To define classes of hybrid, we also follow Turelli and Orr (2000). We pay particular attention to interpopulation heterozygosity, and define p 12 as the proportion of the divergent sites that carry one allele from each of the parental types. We also define p 1 and p 2 as the proportion of divergent sites that carry only alleles originating from P1 or P2, respectively. Since p1+p2+p12=1, it is convenient to introduce the hybrid index, h, which we define as the total proportion of divergent sites originating from P2 (Barton and Gale 1993).

hp2+12p12 (3)

This quantity is closely related to measures of ancestry (e.g., Falush et al. 2003; Fitzpatrick 2012; Christe et al. 2016), although it considers only divergent sites. We can now describe each individual genotype via its heterozygosity, p 12, and hybrid index, h. This means that all hybrids can be represented as points in a triangular space, as shown in Figure 1 A (Gompert and Buerkle 2010; Fitzpatrick 2012). Our goal in this article is to represent this space as a kind of fitness landscape, and show how f can vary with p 12 and h.

Figure 1.

Figure 1

Diploid hybrid genomes can be represented as points in a triangular space, indicating their hybrid index, h (the total proportion of divergent alleles that originate from parental line P2), and their interpopulation heterozygosity, p 12 (the proportion of divergent sites that carry one allele from each parental line). Panel (A) shows how the standard crosses are placed in this space, with biparental inheritance. The two parental lines P1 and P2 are at the lower corners. The initial F1 cross (P1 × P2) will be heterozygous at all divergent sites, and so be found at the top corner. Individuals from later crosses will vary due to segregation and recombination, but the F2 (F1 × F1) will tend to be found toward the center, while backcrosses (such as BC 1=F1×P1 or F1 × P2) will be found on the upper edges (see Fig. 3 for more details of these backcrosses). Panel (B) illustrates the selection on heterozygosity predicted by Fisher's geometric model, when the parental lines are well adapted (eq. 9). In backcrosses, heterozygosity will be under diversifying selection, favoring both extreme values. By contrast, in the F2, we predict directional selection toward higher heterozygosity. Panel (C) illustrates some complications introduced by heteromorphic sex chromosomes (see eq. 16 and Appendix 4). With XO sex determination, male F1 carry no heterozygosity on the X, which will tend to reduce their fitness, consistent with Haldane's Rule. For male backcrosses (F1 × P1), the selection acting on (autosomal) heterozygosity, will depend on the alleles carried on the X. When the X carries mostly P2 alleles, fitter individuals will be more heterozygous (darker gray arrow). When the X carries mostly P1 alleles, the fittest individuals will carry no heterozygosity (lighter gray arrow). gX is the proportion of the divergent sites found on the X, and is set at 37%, as we have estimated for Drosophila pseudoobscura.

Fisher's geometric model

Fisher's model is defined by n quantitative traits under selection toward an intermediate optimum (Fisher 1930). We will assume that fitness is always measured in a fixed environment, but we make no assumptions about how this optimum might have moved during the parental divergence (see Appendix 1). If the selection function is multivariate normal, including correlated selection, then we can rotate the axes and scale the trait values, to specify n new traits that are under independent selection of different strengths (Waxman and Welch 2005; Martin and Lenormand 2006; Martin 2014). Examples with n=2 traits are shown in Figure 2. We now define the breakdown score of a phenotype as

Si=1nλizi2 (4)

where, for trait i, zi is its deviation from the optimum and λi is the strength of selection. By assumption, all mutational changes act additively on each trait, but their effects on fitness can vary with the phenotype of the individual in which they appear, and this yields fitness dominance and epistasis (Appendix 1).

Figure 2.

Figure 2

Fisher's geometric model generates a flexible series of fitness landscapes for hybrid genotypes. The left‐hand panels show a schematic representation of the model, with n=2 traits, each under optimizing selection of differing strengths. (The orientation of the axes shows that the model allows for correlated selection, although this is ignored in the text, by rotating the axes). The right‐hand panels show how the expected breakdown score (eqs. 2 and 6), varies for hybrids of different types. Panels (A)–(C) show different assumptions about levels of parental maladaptation. Panel (A) shows a scenario where all of the parental divergence is maladaptive, with no tendency for their alleles to be coadapted. In this case, hybrid fitness increases with their heterozygosity, as predicted by Wright's single‐locus theory of inbreeding (eq. 8; Wright 1922, 1977). Panel (B) shows a scenario where the parental lines are optimal (or, at least, very well adapted compared to the worst class of hybrid that can be formed between them). In this case, the hybrid index is under symmetrical diversifying selection, and the form of selection on heterozygosity will vary for different cross types (eqs. 9, 13, and 15). This landscape can also be generated by a general model of genetic incompatibilities (see Appendix 2). Panel (C) shows a situation where only one of the parental lines is maladapted (eq. 10 with fP2=0.5).

The d substitutions that differentiate P1 and P2 can be considered as a chain of vectors, which connect the two parental phenotypes (Fig. S1). While the sizes and directions of these vectors will generally be unknown, in Appendix 1, we show that the chain can be approximated as a constrained random walk, or Brownian bridge (Revuz and Yor 1999, Ch. 1). This approximation relies on the fact that hybrid genomes contain the fixed alleles in effectively random combinations, and it gives accurate results for a wide range of assumptions about the divergence process (Figs. S2–S3), including adaptation of the parental lines to different environments (Appendix 1; Fig. S4).

Under the Brownian bridge approximation, the quantity E(S), that appears in equation (2), is given by

E(S)=di=1nλivi (5)

where vi is the expected variance contributed to trait i by a fixed mutation in heterozygous state and we have assumed that any maladaptation has accrued independently in the two parental lines (see Appendix 1 and Table S1). The key quantity f is given by

f=fP1+β1h(1β2h)p12 (6)

where

β142fP1β24fP1fP242fP1 (7)

(see Appendix 1 for full details). Two features of equation (6) are immediately notable. First, it does not depend on any of the model parameters. For example, the number of traits, n, could affect the accuracy of our approximation (since S will tend to approach normality as n increases); but it does not appear in equation (6), which depends solely on p 12, h and the levels of parental maladaptation. Second, for a given value of h, the fitness of hybrids will tend to increase with their heterozygosity, p 12. This prediction agrees with the widespread finding of heterosis (Crow 1952; Table S1 of Fraïsse et al. 2016b). Indeed, by rearranging equation (6) it is clear that hybrids will tend to be fitter than parental line P1 (such that f<fP1) on the condition that p12>β1h(1β2h).

It is also useful to consider equation (6) in a few special cases. First, let us assume that all of the divergence between the parents is maladaptive, without any tendency for coadaptation between their alleles. In this case, the parental phenotypes can be treated as unconstrained random walks away from the optimum. Each fixed mutation, in homozygous state, will contribute an expected 4vi to the variance on each trait, and so, from equations (4) and (5), we have E(fP1)=E(fP2)=2 (see also Appendix 1). With this high level of parental maladaptation, the expected breakdown score of the hybrids is

f=2p12,fP1,fP2=2 (8)

The fitness landscape implied by equation (8) is illustrated in Figure 2 A. With these extreme levels of parental maladaptation, hybrid breakdown depends only on the heterozygosity. Indeed, this result is closely related to Wright's (1922) single‐locus theory of inbreeding, which was developed to analyze crosses between closely related inbred lines, where all divergence between the parental lines comprises deleterious alleles. Wright's theory therefore appears as a special case of Fisher's geometric model, when parental alleles show no coadaptation (see below).

Now let us consider the other extreme case, where the parental alleles are completely coadapted, such that P1 and P2 both have optimal fitness, but realized by different combinations of alleles. In this case, we find

f=4h(1h)p12,fP1,fP2=0 (9)

The fitness landscape implied by equation (9) is illustrated in Figure 2 B. With well‐adapted parents, the hybrid index is under symmetrical diversifying selection.

As with equation (8), equation (9) can also be derived via an alternative route, using a model of speciation genetics. In particular, we show in Appendix 2 that equation (9) can be obtained from a general model of “Dobzhansky–Muller incompatibilities,” each involving a small number of loci (Orr 1995; Turelli and Orr 2000; Gavrilets 2004; Welch 2004; Fraïsse et al. 2016b). The agreement between the two models depends on a particular set of assumptions about the dominance of incompatibilities, namely (i) partial recessivity on average, and (ii) greater reduction in fitness when they contain homozygous alleles from both parental lines. However, we show in Appendix 2 that these assumptions are biologically realistic, and implied by a number of well‐established empirical patterns (Turelli and Orr 2000). The result is that equation (9) can be interpreted as a model of genetic incompatibilities, but without the large number of free parameters that these models can require.

Equation (9) is expected to apply to many cases of wild hybridization, because it should provide a good approximation even if the parental populations are not truly optimal. The only requirement is that they be much better adapted than the worst possible class of hybrid, such that fP1,fP21.

Nevertheless, the general form of equation (6) also allows us to explore intermediate cases. For example, if only P2 is maladapted, then we find

f=4h1114fP2hp12,fP1=0 (10)

In this case, as illustrated in Figure 2 C, selection on hybrid index is skewed toward the fitter parent. Below, we will show how all three of these special cases (Fig. 2 A–C) can be used in data analysis.

TESTING THE PREDICTIONS WITH BIPARENTAL INHERITANCE

Fitness differences between crosses

The simplest predictions from equation (6) assume standard biparental inheritance at all loci. In this case, the standard cross types can be easily located on our fitness landscapes. This is shown in Figure 1 A.

With well‐adapted parental lines (Fig. 2 B) hybrids of high fitness are expected only for the initial F1 cross (P1 × P2) and breakdown is predicted for later crosses, such as the first backcross (BC1: F1 × P1 or F1 × P2) or the F2 (F1 × F1). This pattern of F1 hybrid vigor (high F1 fitness) followed by breakdown in later crosses, has widespread empirical support (see references in Table S1 of Fraïsse et al. 2016b and Rosas et al. 2010).

With highly maladapted parents, by contrast, hybrids of all types can be fitter than their parents (see Fig. 2 A). Plentiful data of this kind come from highly inbred lines of Zea mays (Neal 1935; Wright 1977; Melchinger 1987; Hinze and Lamkey 2003; Hallauer et al. 2010). To analyze these data, a widely used proxy for fitness is the excess yield of a cross (i.e., its increase in yield relative to the parental lines), scaled by the excess yield of the F1. From equations (1)(2), and using a Taylor expansion, we find

wwPwF1wPfPα/2fα/2fPα/2fF1α/2 (11)
=p12,fP=2,α=2 (12)

where the subscript P refers to both parental lines, which are assumed to have similar yields. For later crosses, these values will vary between individuals within a cross, due to segregation and recombination, but in this section we ignore this variation, and assume that p 12 and h take their expected values for a given cross type (Fig. 1 A). A fuller treatment is outlined in Appendix 3.

Equation (12) confirms that Fisher's model can produce Wright's (1922) single‐locus predictions for inbreeding, but only when all divergence between the parental lines is deleterious (fP=2), and when increases in breakdown score act independently on log fitness (α=2). These single‐locus predictions have strong support in Zea mays (Neal 1935; Wright 1977; Melchinger 1987; Hinze and Lamkey 2003; Hallauer et al. 2010). For example, as shown in Figure 3 A, the excess yield of the F2 is often around 50%, which is equal to its expected heterozygosity (Wright 1977; Hallauer et al. 2010). It is also notable that the two outlying points (from Shehata and Dhawan 1975), are variety crosses, and not inbred lines in the strict sense.

Figure 3.

Figure 3

Data on hybrid vigor (i.e., increased yield), from crosses of inbred Zea mays. The original data were collated by Wright (1977; see his Table 2.23), and Hallauer et al. (2010; see their Table 9.13), including only data from single crosses, where there was hybrid vigor in the F2, and yield measured in quintals per hectare. Panel (A) plots the excess yield of the F2 (eq. 11). Results are shown for variety crosses (black triangles), as well as crosses of inbred lines in the strict sense (all other points). The dashed line shows the prediction of 0.5 from single‐locus theory (eq. 12). Panel (B) shows the four datasets collated by Wright (1977), which allow us to compare the F2 and various backcrosses. These crosses, chosen to yield different levels of heterozygosity, are the parental type (P1), the second backcross ( BC 2=(F1×P1)×P1); the first backcross ( BC 1=F1×P1), the F2 (F1 × F1), second backcross to the other parent ( BC 2=(F1×P1)×P2), and the F1 (P1 × P2) (The data of Stringfield (1950), shown as gray triangles, replace BC2* with an F2 between two distinct F1, involving three distinct strains, but the predictions are unchanged). The gray symbols for the four datasets correspond to those used in panel (A). The dotted line in panel (B) shows predictions from Fisher's model, assuming that the between‐strain divergence contains limited coadaptation. The prediction uses equations (13), (14), and (11), with fP=1.5, and α=3. The model predicts both the roughly linear increase in vigor with heterozygosity, and the systematic difference between BC1 and F2.

Despite this predictive success, Wright (1977) noted a pattern that single‐locus theory could not explain. In Wright's words: “the most consistent deviation from expectation [...] is the low yield of F2 in comparison with the first backcrosses” (Wright 1977, p. 39). Because E(p12)=12 for both crosses, this cannot be explained without epistasis, or coadaptation between the alleles in the parental lines. In fact, the pattern is predicted by Fisher's model, when there is a small amount of coadaptation, such that 1<fP<2. This yields a fitness surface with a small amount of curvature, which is intermediate between those shown in Figure 2 A and 2 B.

Figure 3 B plots the four relevant datasets collated by Wright, and compares the results to predictions from equations (6) and (11), with fP=1.5 and α=3. The model predicts the roughly linear increase in yield with mean heterozygosity, as with single locus theory, but also predicts the consistent difference between BC1 and the F2.

Selection on heterozygosity within crosses

In the previous section, we ignored between‐individual variation in heterozygosity within a given cross type. In this section, we show how natural selection is predicted to act on this heterozygosity.

First, let us consider the F2. In this case, we have 4h(1h)1 with relatively little variation between individuals (see Appendix 3 for details). Therefore, if both parents have similar levels of maladaptation, equation (6) is well approximated by

fF21+12fPp12 (13)

The prediction is that p 12 will be under directional selection in the F2, favoring individuals with higher heterozygosity. This is illustrated in Figure 1 B.

Now let us consider a backcross: F1 × P1. In this case, all of the homozygous alleles must come from P1, such that 4h(1h)=p12(2p12), and so equation (6) becomes

f BC =fP+(1fP)p12112fPp122 (14)
=p12(1p12),fP=0 (15)

So backcrosses will tend to be least fit when they have intermediate levels of heterozygosity. When parents are well adapted, heterozygosity is under symmetrical disruptive selection, favoring heterozygosities that are either higher or lower than p12=0.5 (eq. 15). This is illustrated in Figure 1 B .

To test these predictions, we used a new set of genetic data from hybrids of the mussel species: Mytilus edulis and Mytilus galloprovincialis (Bierne et al. 2002, 2006). These species fall at the high end of the continuum of divergence during which introgression persists among incipient species (Roux et al. 2016). We used experimentally bred F2 and BC1, with selection imposed implicitly, by the method of fertilization, and by our genotyping only individuals who survived to reproductive age (Bierne et al. 2002, 2006; see Methods and Fig. S6 for full details).

To estimate heterozygosity in each hybrid individual, we used the 43 markers that were heterozygous in all of the F1 hybrids used to make the subsequent crosses (see Fig. S6). We then asked whether the distribution of p 12 values in recombinant hybrids was symmetrically distributed around its Mendelian expectation of p12=0.5, or whether it was upwardly biased, as would be expected if directional selection were acting on heterozygosity. As shown in the first column of Table 2, Wilcoxon tests found that heterozygosities in surviving hybrids were significantly higher than expected, in both the F2 and backcross. These results may have been biased by the inclusion of individuals with missing data, because they showed higher heterozygosity (see Table S2). We therefore repeated the test with these individuals excluded. As shown in the second column of Table 2, results were little changed, although the bias toward high heterozygosities was now weaker in the backcross.

Table 2.

Tests for selection on heterozygosity in F2 and Backcrosses of Mytilus mussels

Markers: 43 43 33 33
Dataset: All No missing data No missing data Subsampled
Cross p^12 (N) P‐value p^12 (N) P‐value p^12 (N) P‐value p^12 (N) P‐value
F2 0.57 (132) 1.5×106 *** 0.56 (88) 6.4×104 *** 0.55 (91) 0.0033** 0.56 (56) 0.0020**
BC 0.57 (144) 1.3×105 *** 0.53 (94) 0.0282* 0.53 (105) 0.0569 0.52 (56) 0.5815

p^12, the estimated median heterozyosity; N, the number of hybrid individuals sampled; P‐value, result of a Wilcoxon test of the null hypothesis median p12=0.5 (* P<0.05; ** P<0.01; *** P<0.001). F2, random mating of F1 between M. galloprovincialis and M. edulis; BC, Backcross of the F1 to M. galloprovincialis. No missing data, all individuals with missing data for any of the markers were excluded; Subsampled, for the BC, any individual carrying a marker that was homozygous for the major allele carried by wild M. edulis populations was excluded; for the F2, we downsampled by sequencing order to equalize sample sizes.

Interpreting these results is complicated by the ongoing gene flow between M. edulis and M. galloprovincialis in nature (Fraïsse et al. 2016a). To test for this, we genotyped 129 pure‐species individuals, and repeated our analyses with a subset of 33 markers that were strongly differentiated between the pure species (see Methods, Fig. S6 and Table S3 for details). With these markers, there was evidence of elevated heterozygosity in the F2, but not the backcross (Table 2 third column). We also noticed that many of our backcross hybrids, though backcrossed to M. galloprovincialis, carried homozygous alleles that were typical of M. edulis. We therefore repeated our analysis after excluding these “F2‐like” backcrosses. Results, shown in the fourth column of Table 2, showed that the reduced BC dataset showed no tendency for elevated heterozygosity. However, the bias toward higher heterozygosities remained in the F2, even when we subsampled to equalize the sample sizes.

Despite the problems of interpretation due to introgression and shared variants, the results support the prediction of equations (13)(15): that directional selection on heterozygosity should act in the F2, but weakly or not at all in the backcross.

PREDICTIONS OF FISHER'S GEOMETRIC MODEL WITH SEX‐SPECIFIC INHERITANCE

Results above assumed exclusively biparental inheritance. But the predictions of Fisher's model are easily extended to include heteromorphic sex chromosomes, or loci with strictly uniparental inheritance, such as organelles (Coyne and Orr 1989; Turelli and Orr 2000; Turelli and Moyle 2007; Fraïsse et al. 2016b). In this section, we introduce the approach, and demonstrate its flexibility, while the full derivations are collected in Appendix 4. The basic method of introducing sex‐specific inheritance is to write p 12 and h as weighted sums of contributions from different types of locus. For example, to focus on the contribution of the X chromosome versus the autosomes, we can write

p12=gXp12,X+gA,p12,Ah=gXhX+gAhA (16)

where the subscripts denote the chromosome type (so that p12,A is the proportion of divergent sites on the autosomes that are heterozygous), and gX and gA are weightings, which sum to one (Turelli and Orr 2000).

We can now predict differences in hybrids of different sexes. For example, previous authors have shown that Fisher's model predicts Haldane's Rule: the tendency of sex‐specific breakdown to appear in the heterogametic sex (Haldane 1922; Turelli and Orr 2000; Barton 2001; Fraïsse et al. 2016b; Schiffman and Ralph 2017; see Fig. 1 C). Using equations (16) and (6), this basic insight can easily be extended to give formal conditions for Haldane's Rule, with different levels of parental maladaptation, and uniparental inheritance (see Appendix 4).

Sex chromosomes also complicate predictions about selection on heterozygosity within crosses. Consider, for example, male backcrosses in an XY species, where the X chromosome is large, the Y chromosome is small or degenerate, and the parental lines are reasonably fit. These backcross males will vary in their autosomal heterozygosity (p12,A), but the selection on this heterozygosity will vary with the alleles carried on the X. This is illustrated in Figure 1 C. If the backcross is made to P1, but hX is large (i.e. divergent X‐linked alleles come mainly from P2), then heterozygosity will tend to be under positive selection (see darker arrow in Fig. 1 C); but if hX is small, then heterozygosity will tend to be under negative selection (see lighter arrow in Fig. 1 C, Appendix 4 and Fig. S8 for details).

To test this prediction, we used data from Noor et al. (2001) (Fig. S7). These authors generated male backcrosses between Drosophila pseudoobscura and Drosophila persimilis. These species are ideal for our test, because the X carries 37% of the coding sequence, the Y is degenerate, and the hybrids can be fully sterile (see Table 1, Methods, and Noor et al. 2001). Each backcross male was scored for sterility, and multiple genetic markers, including two markers on the X (see Table 1). We characterized the X as low‐hX if both markers carried the P1 allele, and as high‐hX if both markers carried the P2 allele; we excluded intermediate individuals, for which we have no clear prediction. We then regressed sterility on heterozygosity, p12,A, but allowed the intercept and the slope to vary with hX. Results, shown in Table 3, show that both hX and p12,A are important predictors of sterility in these backcrosses, and that their effects interact. With low hX, sterility is generally rarer, but positively correlated with p12,A. With high hX, by contrast, sterility is more common, and negatively correlated with p12,A. The predictions of Fisher's model are therefore supported, in both cross directions.

Table 3.

Regressions of male sterility on autosomal heterospecificity in Drosophila backcrosses

Backcross to N Model intercept (low hX) intercept (high hX) Best‐fit coefficients for p12,A AIC
D. persimilis 577 two intercepts −1.071 1.797 596.79
two intercepts + one slope −2.165 0.764 2.147 580.41
two intercepts + two slopes −3.033 2.871 3.746 (low‐hX); −1.973 (high‐hX) 558.91
D. pseudoobscura 610 two intercepts −0.197 2.962 603.53
two intercepts + one slope −2.031 1.219 3.620 558.18
two intercepts + two slopes −2.485 4.034 4.505 (low‐hX); −1.876 (high‐hX) 545.87

AIC, Akaike Information Criterion; preferred model shown in bold.

The interaction between hX and p12,A may also help to explain a broader set of patterns observed by Moehring (2011) in multiple datasets of Drosophila backcrosses (Macdonald and Goldstein 1999; Noor et al. 2001; Moehring et al. 2006a, 2006b and see Table 1). Moehring found that male fertility correlated strongly and negatively with hX, while correlations with p12,A were weak and inconsistent. This pattern was not consistent with predictions of a simple model of X‐autosome incompatibilities (Moehring 2011), but it is consistent with Fisher's geometric model (see Fig. 1 C, Fig. 2 B, and Appendix 4).

Female backcrosses in Teleogryllus

Fisher's model can help to explain more complex patterns. To illustrate this, we will consider the data of Moran et al. (2017) from the field crickets Teleogryllus oceanicus and T. commodus. These species have XO sex determination, and a large X chromosome (gX0.3, from Moran et al. 2017). They are also a rare exception to Haldane's Rule, with F1 sterility appearing solely in XX females (Hogan and Fontana 1973). Moran et al. (2017) hypothesized that female sterility might be caused by negative interactions between heterospecific alleles on the X, which appear together in F1, but not in F1.

To test this hypothesis, they generated backcross females with identical recombinant autosomes, but different levels of heterozygosity on the X, p12,X. The hypothesis of X–X incompatibilities predicts that fertility will decrease with p12,X, but this was not observed. This is shown in Figure 4 A.

Figure 4.

Figure 4

Data on female fertility, from crosses of the field crickets Teleogryllus oceanicus (P1), and T. commodus (P2), from Moran et al. (2017), compared to theoretical predictions from Fisher's geometric model. Plotting styles denote the level of X‐linked heterozygosity: high (circles and darker shading); intermediate (triangles and lighter shading) or low (squares and no shading). To plot the data (column A), we used the mean number of offspring per pair (upper panel), or mean number of eggs per pair (lower panel), each normalized by the value for P1 (see the supplementary information of Moran et al. 2017 for full details). The theoretical predictions (column B) are listed in Table S4. In the upper panel, we assume gX=0.3, as estimated from the chromosome sizes, and complete silencing of the paternal X (such that π=1 in Table S4). In the lower panel, we assume gX=0.7, and incomplete silencing of the paternal X (π=0.8), to improve fit to the egg data. While predictions apply to the rank order of fitnesses, to aid visualization, we plot w=e6f2 (see eq. 1), and set the parameter f P2 via 0.5=e6fP2, to reflect the lower fertility of this species under lab conditions.

To try and explain the patterns that were observed, let us note two clear asymmetries in the data. First, there are strong differences between the fitness levels of the two parental species in lab conditions, with T. commodus (labeled P2 in Fig. 4) producing around half the eggs and offspring of T. oceanicus (P1 in Fig. 4). This suggests that equation 10 might apply to these data. Figure 4 shows a second asymmetry in the data: the reciprocal F1 (female–male vs. male–female) have very different fitness (Turelli and Moyle 2007; Fraïsse et al. 2016b). Because the X and autosomes will be identical for both cross directions, this implies some sort of parent‐of‐origin effect on the phenotype. This could be either uniparental inheritance, or selective silencing (Turelli and Moyle 2007; Fraïsse et al. 2016b). One possibility is the speculation of Hoy and collaborators (see e.g., Hoy et al. 1977; Butlin and Ritchie 1989; Dr. Peter Moran pers. comm.), that dosage compensation in Teleogryllus involves silencing of the paternal X.

We can now use the foregoing assumptions to predict the levels of breakdown for each of the crosses produced by Moran et al. (2017). The predictions for each cross are listed in Table S4, and plotted in the upper panel of Figure 4 B. This simple model explains several striking aspects of the Teleogryllus data.

Adjusting the parameter values can further improve the fit. For example, if we increase gX (as would be case if divergent sites affecting female fecundity were clustered on the X), and assume that paternal X silencing is incomplete (affecting 80% of the divergent sites), then the results, shown in the lower panel of Figure 4 B, agree well with the data on Teleogryllus egg number, as shown in the lower‐left‐hand panel (only the high fitness of P2 × F112 is poorly predicted).

Further adjustments are possible, but these soon become ad hoc, at least without further knowledge of the true nature of parent‐of‐origin effects in Teleogryllus. The important point here is that Fisher's geometric model explains several key features of these hybrid data, while using only a single parameter derived from the data themselves; and even this parameter (f P2), was estimated from the parental control lines.

ESTIMATING THE FITNESS SURFACE

Across a diverse collection of hybrids, equation (6) predicts that the hybrid index will be under disruptive selection, and heterozygosity under directional selection. This prediction can be tested with datasets containing estimates of fitness, h and p 12 for many hybrid individuals. Exactly such an analysis was presented by Christe et al. (2016), for families of wild hybrids from the forest trees, Populus alba and P. tremula (Lindtke et al. 2012, 2014; Christe et al. 2016). These authors scored survival over four years in a common‐garden environment, and fit a generalized linear model to their binary data (binary logistic regression, with “family” as a random effect), and predictors including linear and quadratic terms in p 12 and h. Model selection favored a four‐term model, with terms in p 12, h, and h 2 (see Table S6, and Supplementary information of Christe et al. 2016 for full details). For comparison with our theoretical predictions, we can write their best fit model in the following form:

y=const+β0β1h1β2hp12 (17)

where y is the fitted value for hybrid breakdown. In its general form, equation (6) should hold and Fisher's model predicts that β0>0, 0β14, and 12β22. Additionally if parental maladaptation is not highly asymmetrical then it predicts β21.

The best‐fit model of Christe et al. (2016) corresponds to β^0=2.963, β^1=2.777 and β^2=0.934, which supports the predictions of directional selection toward higher heterozygosity, and near‐symmetrical diversifying selection on the hybrid index.

To obtain confidence intervals on these parameters, we fit equation (17) to the raw data of Christe et al. (2016). We also searched for other data sets, from which we could estimate the hybrid fitness surface. After applying some quality controls (see Methods and Table S2), we identified one other dataset of wild hybrids, from the mouse subspecies Mus musculus musculus/domesticus, where male testes size was the proxy for fertility (Turner and Harr 2014). We also found four datasets of controlled crosses: F2 from the same mouse subspecies (White et al. 2011), and the ragworts Senecio aethnensis and S. chrysanthemifolius (Chapman et al. 2016); and the Drosophila backcrosses discussed above (Macdonald and Goldstein 1999; Moehring et al. 2006a, 2006b). Unlike the data from wild hybrids, these single‐cross datasets leave large regions of the fitness surface unsampled (see Fig. 1); nevertheless, they each contain enough variation in h and p 12 for a meaningful estimation. Details of all six datasets are shown in Table 1, and they are plotted in Figures S9–S11.

Figure 5 shows a summary of the estimated parameters, and full results are reported in Tables S5 and S6, and Figures S9–S11. Taken together, the results show good support for the predictions of equation (6). For all six datasets there was evidence of significant positive selection on heterozygosity (β^0>0 was preferred in all cases). Furthermore, for all six datasets, we inferred diversifying selection acting on the hybrid index. Estimates of β2, shown in the upper panel of Figure 5, show that this selection was near‐symmetrical in all cases, such that β^21. The poorest fit to the predictions was found for the Drosophila backcrosses, where estimates of β1 were significantly greater than the predicted upper bound of β1=4 (Fig. 5 lower panel). But these datasets were least suited to our purpose, because estimates of h and p 12 depend strongly on our rough estimate of the relative contributions of the X and autosomes (see Methods), and because they lack F2‐like genotypes, from the center of the fitness surface (Fig. 1 A; Fig. S11). By contrast, results for the Mus musculus F2 (White et al. 2011), are remarkably close to the predictions of equation (9) (Fig. 5; Fig. S10).

Figure 5.

Figure 5

Best fit parameter estimates for the GLM of equation (17), with fitness and genomic data from six sets of hybrids (see Table 1 for details). The upper panel shows estimates of the coefficient β2 that determines the form of selection acting on the hybrid index, h. Estimates of β2=1 are consistent with symmetrical diversifying selection. The lower panel show estimates of the coefficient β1 that determine the relative strength of selection acting on the hybrid index. Estimates of β1=4 are predicted when the parental types are well adapted (eq. 9), while estimates 0<β1<4 are predicted when the parental types are maladapted (eq. 6). Confidence intervals are defined as values that reduce the AIC by 2 units. These measures of uncertainty were not obtained for the F2 data, where variation in the hybrid index contributed little to the model fit, as predicted by equation (13). Full details of the model fitting are found in the Methods and Tables S5 and S6.

Two other features of the results deserve comment. First, for the two F2 datasets, it was not possible to provide meaningful confidence intervals for β1 and β2. This is because, for these two datasets, the terms in h and h 2 did not make a significant contribution to model fit, and so the preferred model contained only selection on p 12 (see Table S6). This is consistent with our earlier prediction of equation (13), and stems from the low variation in 4h(1h) among F2 hybrids (see Appendix 3 and Figs. S9 and   S10).

Second, for two of the datasets, Populus and Senecio, the estimates of β1 are substantially lower than 4 (Fig. 5; Fig. S9). This is suggestive of parental maladaptation, creating true heterosis in the hybrids (see eq. 6). Consistent with this inference, there is independent evidence of parental load and F1 hybrid vigor in both species pairs (Populus: Caseys et al. 2015; Christe et al. 2017; Senecio: Abbott and Brennan 2014).

Discussion

In this article, we have used Fisher's geometric model to develop predictions for the relative fitness of any class of hybrid. The modeling approach is simple, with few free parameters, and it generates a wide range of testable predictions. We have tested some of these predictions with new and published datasets (Table 1), and the major predictions of the model are well supported.

We emphasize that our approach is designed for coarse‐grained patterns in the data, and typical outcomes of hybridization, without considering the particular set of substitutions that differentiate the parental lines, or the particular combination of alleles in an individual hybrid. The limitations of such an approach are seen in the low r 2 values associated with our model fitting (Table S5). Nevertheless, our approach should enable novel and complementary uses of genomic datasets, which do not depend on identifying individual loci with anomalous effects. In this approach, all alleles, and not just a handful of “speciation genes” might contribute to hybrid fitness.

A second goal of the present work was to show how Fisher's model can interpolate between previous modeling approaches, namely the classical theory of inbreeding (Wright 1922; Crow 1952), and models of genetic incompatibilities, each involving a small number of loci (Dobzhansky 1937; Orr 1995; Gavrilets 2004; Welch 2004). We have also shown that Fisher's model can account for empirical patterns that each approach has struggled to explain (Wright 1977; Moehring 2011; Moran et al. 2017).

Though we have stressed their similarities, we should also stress that Fisher's model and the model of incompatibilities remain different in two ways. First, as shown in Appendix 2, the two models agree only when the dominance relations of incompatibilities take a particular set of values (eq. A36)—albeit a biologically realistic set (Appendix 2; Turelli and Orr 2000). Second, even when predictions are identical for the quantity f (eq. 2), the two approaches still make different predictions for other kinds of data, which were not considered in the present work. The most important difference is the dependency of log fitness on d, the genomic divergence between the species. Under Fisher's geometric model, the log fitness of hybrids declines with dα/2 (eqs. 12 and 56). By contrast, with the simplest models of incompatibilities, log fitness declines with dα/2 where ℓ is the number of loci involved in each incompatibility (Orr 1995; Welch 2004; Appendix 2, eq. A22). This is due to a snowball effect, where the number of incompatibilities grows explosively with d. This is a genuine difference between the modeling approaches, although truly discriminatory tests may be difficult. For example, it may not always be possible to distinguish between a snowballing model with a low value of α (equivalent to strong positive epistasis between incompatibilities), or a model where α is higher, but where the number of “incompatibilities” does not snowball, because they appear and disappear as the genetic background changes (Welch 2004; Fraïsse et al. 2016b; Guerrero et al. 2017; Kalirad and Azevedo 2017). Furthermore, Fisher's model can generate an “apparent snowball effect,” when introgressed segments contain multiple divergent sites (Fraïsse et al. 2016b), and this is true of the strongest empirical demonstrations of the effect (Matute et al. 2010; Moyle and Nakazato 2010; Wang et al. 2015).

Finally, given the simplicity and flexibility of the modeling approach explored here, and its predictive successes with a range of data, it should be readily extendable to address other outstanding questions in the study of hybridization. These include the putative role of hybridization in adaptive evolution (e.g., Mendez et al. 2012; Fraïsse et al. 2016b, 2016a; Duranton et al. 2017), the effects of recombination in shaping patterns of divergence (Aeschbacher et al. 2017; Schumer et al. 2018), the analysis of clines (Barton and Gale 1993; Macholán et al. 2011; Baird et al. 2012), and the roles of intrinsic versus extrinsic isolation (Chevin et al. 2014, Fig. S4). Given its ability to interpolate between models of different and extreme kinds, it should also be particularly useful for understanding hybridization in intermediate regimes, where parental genomes are characterized by both maladaptation and allelic coadaptation, or where the architecture of isolation involves many genes of small or moderate effect. Data—including those analyzed here—suggest that such architectures might be quite common (Davis and Wu 1996; Maside and Naveira 1996; Rosas et al. 2010; Morán and Fontdevila 2014; Baird 2017; Buerkle 2017; Boyle et al. 2017).

Methods

Mytilus DATA

Conserved tissues from the mussel species, Mytilus edulis and Mytilus galloprovincialis, and their hybrids, were retained from the work of Bierne et al. (2002, 2006). As reported in those studies, M. edulis from the North of France were crossed with M. galloprovincialis from the French Mediterranean coast to produce F1 hybrids (five males and one female; Bierne et al. 2002). The F1 were then used to produce an F2, and sex‐reciprocal backcrosses to M. galloprovincialis (which we denote here as BC12 and BC21; Bierne et al. 2006). In particular, oocytes from the F1 female were fertilized by the pooled sperm of the five F1 males producing F2 individuals, from which 132 individuals were sampled; oocytes from the F1 female were fertilized by pooled sperm of five M. galloprovincialis males to produce BC12, from which 72 individuals were sampled; and five M. galloprovincialis females were fertilized by pooled sperm from the five F1 males, producing BC21, from which 72 individuals were sampled. In addition to these hybrids, we also genotyped 129 individuals from “reference” populations of the two species, found in regions with relatively little contemporary introgression. In particular, we sampled M. galloprovincialis from Thau in the Mediterranean Sea; and sampled M. edulis from four locations in the North Sea and English Channel (The Netherlands, Saint‐Jouin, Villerville and Réville). Full details of these reference populations are found in Table S3.

In each case, gill tissues were conserved in ethanol at −20°C. DNA was extracted using a NucleaMag 96 Tissue kit (Macherey‐Nagel) and a KingFisher™ Flex (ThermoFisher Scientific). We then genotyped all samples for 98 Mytilus markers that were designed from the data of Fraïsse et al. (2016a). The flanking sequences of the 98 SNPs are provided in Table S7. Genotyping was subcontracted to LGC‐genomics and performed with the KASP™ chemistry (Kompetitive Allele Specific PCR, Semagn et al. 2014). Results are shown in Figure S6. Many of the 98 markers are not diagnostic for M. edulis and M. galloprovincialis, and so we retained only the 43 that were scored as heterozygous in all six of the F1 hybrids. To obtain a reduced set of strongly diagnostic markers, we measured sample allele frequencies in our pure species M. edulis and M. galloprovincialis samples (pooling M. edulis individuals across the four sampling locations; Table S3), and retained only markers for which the absolute difference in allele frequencies between species was > 90%. This yielded the set of 33 markers used for the right‐hand columns in Table 2. The “subsampled” data shown in the fourth column of Table 2, excluded any BC hybrid who carried the major allele typical of M. edulis in homozygous form. This yielded 56 BC hybrids. Because sequencing order was haphazard, to equalize the sample sizes, we retained the first 56 F2 to be sequenced.

COLLATION OF PUBLISHED DATA

We searched the literature for published datasets combining measures of individual hybrid fitness, with genomic data that could be used to estimate p 12 and h. We rejected any datasets where the measure of fecundity or fertility took an extreme low value for one of the pure species, suggesting that it is not a good proxy for fitness (e.g., Orgogozo et al. 2006), data where the fitness proxy correlated strongly with a measure of genetic abnormality such as aneuploidy (Xu and He 2011), or data where the states of many markers could not be unambiguously assigned, for example, due to shared variation. Before estimating the fitness surface, we also excluded any dataset where there was a highly significant rank correlation between the proportion of missing data in an individual, and either their heterozygosity, or fitness. For this reason, we did not proceed with reanalyses of the excellent datasets of Li et al. (2011), or Routtu et al. (2014) (see Table S2 for full details). For our reanalysis of the Teleogryllus data of Moran et al. (2017), we did not consider data from the second backcross, to avoid complications due to selection on heterozygosity during the first backcross, and because of an anomalous hatching rate in the T. oceanicus controls. For our reanalysis of the Mus musculus F2 (White et al. 2011), we used a conservative subset of these data; we excluded any individual where any X‐linked marker was scored as heterozygous (indicative of sequencing errors in heterogametic males; White et al. 2011), and controlled for variation in the uniparentally inherited markers, by retaining only individuals carrying M. m. domesticus mitochondria, and the M. m. musculus Y. However, results were little changed when we used all 304 individuals with fertility data (Table S6). Results were also unaffected when we used alternative proxies for fitness (Table S6). For the wild Mus musculus hybrids data (Turner and Harr 2014), fixed markers and their orientation between species were determined using previously published data of Staubach et al. (2012) and Yang et al. (2009). See details in the Dryad data package.

ESTIMATION OF gX FROM ANNOTATED GENOMES

To fit the generalized linear models (GLM) to data from species with XY sex determination (Table 1), we needed to account for the different marker densities on the X and autosomes. Accordingly, we estimated the overall values of p 12 and h from equation (16). We estimated the weightings, gX and gA, from the total length of coding sequences associated with each chromosome type, ignoring the small contributions from the Y and mitochondria. In each case, we obtained the longest protein product for each unique gene, and then summed their lengths, using a custom R script. The gX values, shown in Table 1, were calculated as the total length of X‐linked CDS divided by the total CDS length. For Mus musculus, we used the reference genome assembly “GRCm38.p5.” For Drosophila simulans, we used the assembly “GCA_000754195.3 ASM75419v2,” and for Drosophila yakuba “GCA_000005975.1 dyak_caf1.” For Drosophila pseudoobscura, the current annotation was downloaded from FlyBase release 3.04 (Gramates et al. 2017). The .gtf file was then sorted and merged (combining overlapping coding sequences on each chromosome) using BEDTools (Quinlan and Hall 2010). Coding sequence lengths were calculated and summed over each chromosome, using custom awk commands.

GLM METHODS

The linear model results shown in Table 3, Figure 5, Tables S5 and S6, and Figures S9–S11, were all fit in R v. 3.3.2 (R Core Team 2016). For datasets with quantitative fitness measures (White et al. 2011; Turner and Harr 2014; Fig. S10) we used the standard general linear models, with Gaussian errors, and chose data transformations to reduce heteroscedasticity. For binary fitness data (Noor et al. 2001; Christe et al. 2016; Chapman et al. 2016; Table 3; Fig. S9), we used binomial regression with a logit link implemented in the glm function; and with ordinal fitness data (Macdonald and Goldstein 1999; Moehring et al. 2006b; Fig. S11) we used proportional odds logistic regression (Agresti 2003), implemented in the polr function. In these cases, the P‐values shown in Table S6 were calculated by comparing the t‐value to the upper tail of normal distribution, as in a Wald test. For the non‐Gaussian models, we also report McFadden's pseudo‐r 2, defined as one minus the ratio of log likelihoods for the fitted and null models (McFadden 1974).

DATA ACCESSIBILITY

The newly generated data on Mytilus and collated data are available as a Dryad package (doi:xxx).

Associate Editor: Z. Gompert

Supporting information

Figure S1: A schematic representation of a phenotype evolving over time in two populations, labeled P1 and P2, starting from their most recent common ancestor (MRCA).

Figure S2: The breakdown associated with haploid hybrid genotypes, under simple models of phenotypic divergence.

Figure S3: The breakdown associated with haploid hybrid genotypes, under explicit population genetic simulations of phenotypic divergence.

Figure S4: The breakdown associated with haploid hybrid genotypes, under explicit population genetic simulations of phenotypic divergence, in scenarios involving discrete jumps in the optimal value for one of n = 2 traits.

Figure S5: The effects of an incompatibility on hybrid breakdown score, sijk (eq. 41), when the incompatibility appears in a genotype comprising i loci that are homozygous for alleles from one parental species, j loci that are homozygous for alleles from the other parental species, and k loci that are heterozygous.

Figure S6: Genotype plot for the raw Mytilus data.

Figure S7: Plots of the Drosophila male backcross data reanalyzed here (see Table 1 and Moehring 2011).

Figure S8: Predictions of Fisher's geometric model for heterogametic male hybrids.

Figure S9: Estimation of the fitness surface for interspecific hybrids from plants (Table 1), namely wild hybrids of the genus Populus (row (a); Christe et al. 2016), and an F2 cross of the genus Senecio (row (b); Chapman et al. 2016).

Figure S10: Estimation of the fitness surface for subspecific hybrids from Mus musculus (Table 1, White et al. 2011; Turner and Harr 2014).

Figure S11: Estimation of the fitness surface for backcross male hybrids from Drosophila species pairs (Table 1, Macdonald and Goldstein 1999; Moehring et al. 2006a, b).

Table S1: The contribution of parental maladaptation to hybrid breakdown.

Table S2: Checks for appropriateness for genomic data sets.

Table S3: The reference populations for Mytilus crosses.

Table S4: Expected breakdown scores for homogametic female hybrids with paternal X silencing.

Table S5: Inferring the hybrid fitness surface from genomic data sets.

Table S6: The significance of individual regression coefficients.

Table S7: Information on the 98 markers used for the Mytilus genotyping.

DATA ARCHIVING

All data and analysis scripts have been deposited in the Dryad repository (DOI: https://doi.org/10.5061/dryad.2hb2br7). Source code used for the simulations is available here https://github.com/alxsimon/Hybrid_fitness_Fisher_geometric_model.

AUTHOR CONTRIBUTIONS

A.S. and J.W. worked on the mathematical models and derivations. All authors designed the study, collected the data and wrote the manuscript.

ACKNOWLEDGMENTS

We are very grateful to all of the authors of the original data reanalyzed here and especially to the following, who provided clarifications or reformatting: Luisa Bresadola, Charlie Brummer, Mark A. Chapman, Camille Christe, Eric Hagg, Xionglei He, Christian Lexer, Xuehui Li, Amanda Moehring, Bret Payseur, Michael White, and Gavin Woodruff. We are also grateful to Andrea Betancourt for advice on processing the genome annotations, and to Peter Moran, who spotted a serious mistake in an earlier draft. Stuart Baird, Jon Slate, David Waxman, and anonymous reviewers, provided helpful comments that led to substantial improvements. This work was partly funded by the Agence Nationale de la Recherche (HYSEA project, ANR‐12‐BSV7‐0011). NB acknowledges a fellowship of the French Embassy in the United Kingdom, with Churchill College Cambridge. A.S. acknowledges a fellowship of the Labex CeMEB and the doctoral school GAIA. J.W. acknowledges Joanna Weinert.

The authors declare no conflicts of interests.

Appendix 1. The Brownian Bridge Approximation

In this Appendix, we derive the Brownian bridge approximation for the breakdown score of hybrid genotypes, under Fisher's geometric model. The Appendix is in three parts. First, we state our approximation without much justification. We will also assume haploid genomics, which makes the approximation simpler to explain, and also helps to draw connections with a previous model of epistasis introduced by Barton and Gale (1993). In the second section, we justify the approximation, by showing how it relates to explicit models of genetic and phenotypic divergence. We also show that the approximation is quite robust to varying assumptions about the divergence process. Third, and finally, we introduce the full, diploid version of the approximation, which is used for the results in the main text.

Results with haploid genetics

In this section, and the following section, we will assume that the parental species P1 and P2 are haploid. In this case, of course, there is no heterozygosity (p12=0) and the hybrid index is h=p2=1p1.

We can now consider the d substitutions that differentiate P1 and P2 as a chain of vectors, in the n‐dimensional phenotypic space, which connect one of the parental phenotypes to the other. Our approximation is to treat this path, on each of the n traits, as describing a Brownian bridge, that is, a random walk constrained at its start and end points (Revuz and Yor 1999, Ch. 1 for details). This approximation will be justified by appealing to the random combinations of alleles that appear together in hybrid genotypes (see below).

To derive our approximation, let B(t) denote a Brownian bridge, taking place over a single unit of time, and with a rate σB. B(t) is normally distributed, with the following mean:

E(B(t))=(1t)B(0)+tB(1),0t1 (A1)

where B(0) and B(1) are the fixed points at the beginning and end of the walk. The covariance at two time points is given by

Cov (B(t1),B(t2))=σB2(1t2)t1,0t1t21 (A2)

(Revuz and Yor 1999, Ch. 1). For our purposes, the start and end points of the walk are given by the trait values of the parental lines, so that, for trait i, B(0)=zP1,i and B(1)=zP2,i. The total length of the walk represents the complete path of d substitutions, each with typical effect vi, so that σB2=dvi. Finally, to determine the trait value for a hybrid carrying dh alleles that come from P2, we just “stop” the random walk at the intermediate “time” h. As such, using equations (A1) and (A2), the trait value zi will be a normal random variable with the following properties:

ziN(μi,σi2)μiE(zi)=(1h)zP1,i+hzP2,i (A3)
σi2 Var (zi)=dvih(1h) (A4)

The expectations and variances in these expressions involve two sorts of averaging. First, there is averaging over different realizations of evolutionary process, that is, the different sets of substitutions that might differentiate P1 and P2. Second, there is averaging over the subset of substitutions that might appear in a hybrid with a given value of h. Only this second averaging process will apply to real datasets, and it helps to explain the connection between our approximation and explicit models of divergence (see below).

To complete the results for haploidy, we note that the breakdown score, S, depends on the squared trait value (see eq. 4 of the main text). From normal theory, we then have:

Ezi2=σi2+μi2 (A5)
Var zi2=2σi2σi2+2μi2 (A6)

and so S will be approximately gamma distributed, with a mean and variance given by the weighted sum of these quantities (see also Rosas et al. 2010). The key quantity f, as defined in equation (2), is then found to be

fE(S)ES=dinλivih(1h)+inλi(1h)zP1,i+hzP2,i2dinλivi=h(1h)+fmal (A7)

where

fmal=iλi(1h)zP1,i+hzP2,i2diλivi (A8)

The term fmal comes from the μi2 in equation (A5), and describes the contribution to hybrid breakdown from parental maladaptation. The value of fmal depends on the relative positions in phenotypic space of the two parental types. It is useful to consider a few simple cases, which are summarized in Table S1, and which we also use in the main text. First, let us consider the case where maladaptation accrues independently in the two parental lines. In this case, shown in Table S1b, we expect no covariance between their phenotypic deviations, and so fmal is just a weighted sum of the relative breakdown scores for the two parental lines

fmal=(1h)2fP1+h2fP2,zP1,izP2,i¯=0 (A9)

where fP1=iλizP12/iλidvi. The assumption of independent maladaptation should apply to many real datasets, and so equation (A9) is used in the main text, as the basis of equations (6)(9) (see below for details). Independent maladaptation need not apply in every case, however, and it might be the case that P1 and P2 are identically maladapted, perhaps because of a shared unsuitability to a lab environment. This case, however, turns out to be comparable to the case with no parental adaptation. As shown in Table S1c, when both parental phenotypes are suboptimal, but identical, all genotypes suffer an additional constant contribution to their maladaptation. Finally, we can imagine a case where the two populations tend to become maladapted in opposite directions, such that the midparental phenotype is optimal. Results for this case are shown in Table S1d, but it is probably the least realistic of the three scenarios.

When parental lines are well adapted, such that fmal0, then our haploid model is closely connected to a simple model of epistasis introduced by Barton and Gale (1993). They provide results for cline shape and linkage disequilibrium under this model (Barton and Gale 1993; p. 18).

Comparison to simple models of divergence

The approximation introduced above includes only two fixed points—the parental phenotypes—and ignores their common ancestor, and all of the intermediate steps in their divergence. Nevertheless, we can be fairly confident that all of these intermediates had reasonably high fitness. The reason it is possible to ignore these intermediate steps is that mutations fix in a particular order during divergence, but in hybrids, they appear in random combinations, due to recombination and segregation. This makes the Brownian bridge approximation robust to a wide range of different assumptions about population divergence.

In this section of the Appendix, we illustrate this point, and explore the robustness of the approximation. For simplicity, we continue to assume haploid genetics, and consider only one phenotypic trait (hence, we write zi=z throughout). We will also limit ourselves to models with a low mutation rate, such that fixations occur one at a time. Given this last assumption, we can order the d fixations that differentiate P1 and P2 in a single chain, starting at P1, and then going back in time to the most recent common ancestor, and then forwards in time again toward P2. This is illustrated in Figure S1. We will denote as z(j), the trait value in the jth step of this chain, such that z(0) is the trait value for P1, and z(d) is the trait value for P2. Each genotype will contain a different number of alleles from P2, and the hybrid index of phenotype z(j) is simply h=j/d. Finally, the phenotypic effect of the jth substitution in the chain is defined by

mjz(j)z(j1) (A10)

Figure S1 illustrates these quantities.

Now, let us first consider the case where both populations remain well adapted, or close to the optimum, throughout the divergence process. We can then model the phenotype at step j as a normal random variable.

z(j)N(0,v/2) (A11)

From equations (A10) and (A11), it follows that the effects of neighboring substitutions in the chain will negatively covary, implying that maladapted populations will tend to fix mutations that return them closer to the optimum:

Cov (mi,mj)=v,i=jv/2,i=j±10, otherwise (A12)

Now let us consider the phenotype that contains dh alleles from P2, namely z(dh). This phenotype can be written as the sum of substitutional changes, starting from the initial P1 phenotype:

z(dh)=z(0)+i=1dhmi (A13)

We define the relative breakdown of this genotype as fhz2(dh)/(dv), and the expected value of this quantity is

E(fh)E(z2(dh))dv=1dv Var (z(0))+2 Cov z(0),i=1dhmi+ Var i=1dhmi=1dv Var (z(0))+2 Cov z(0),i=1dhmi+dh Var mi+2i<jdh Cov mi,mj=12d+h(2ρz(0),mi+1+(dh1)ρmimj) (A14)

where ρz(0),mi is the average correlation between the initial P1 phenotype, and the effect of each substitution; and ρmimj is the average correlation between each distinct pair of substitutions. Let us first evaluate equation (A14) when the dh mutations are added to the hybrid in the same fixed order that they appear in the chain of fit intermediates (Fig. S1). In this case, from equation (A12), only the first of the dh substitutions will covary with the initial phenotype, and so we have ρz(0),mi=1/(2dh). Similarly, each substitution will only covary with its preceding and subsequent substitutions, such that ρmimj=122(dh1)dh(dh1)=1/(dh). As such, when mutations are added in this fixed order, equation (A14) yields:

E(fh)=12d=E(fP) (A15)

So the expected breakdown of all genotypes, regardless of their hybrid index, is equal to that of the parental lines. Equation (A15) reflects the fact that each of these “hybrid” genotypes is one of the fit intermediates, and all will be very fit compared to the worst class of hybrid, unless the divergence, d, is very small. This result can be contrasted with the result for real hybrids, where segregation and recombination will bring together collections of alleles that fixed at different times. If the dh substitutions in equation (A13) are chosen at random, without replacement, from the complete set of d substitutions, then the correlation terms in equation (A14) are both reduced by a factor h, reflecting the fact that consecutive substitutions in the chain will only appear together with probability h. With ρz(0),mi=1/(2d), and ρmi,mj=1/d, equation (A14) becomes

E(fh)=h(1h)+E(fP) (A16)

Even without conditioning on the parental phenotypes, equation (A16) is now very close to the Brownian bridge approximation of equation (A7). This is illustrated in Figure S2a. In this simulation run, the diverging populations remained close to the optimum throughout (as indicated by the jagged black line). By contrast, hybrids, which contain a randomized collection of alleles, can be very unfit on average (as shown by the smoother black curves). Their expected deviation from the optimum is well described by the Brownian bridge approximation (see colored dotted lines, which show eq. A7). Because the maladaptation of the parental lines is small, the analytical predictions are little affected if we explicitly account for the observed maladaptation, by calculating fmal from equation (A8) (see red dotted lines in Fig. S2a), or if we ignore parental maladaptation altogether, by setting fmal to zero (see blue dotted lines in Fig. S2a).

The model discussed above assumed that the effects of fixed mutations were normally distributed (see eqs. A10 and A11 and Fig. S2a), but the effect sizes of real substitutions might show substantial deviations from normality. To investigate the effects of nonnormal effect sizes, Figure S2b shows results when divergence included a few substitutions of very large effect (simulated by drawing the trait values, z(i), from a Cauchy distribution). The results show that the Brownian bridge continues to give a very good fit. This is because the approximation follows from robust, central‐limit‐type behavior, and so it does not depend strongly on the normality of the allelic effects.

The approximation is also robust if we assume more complicated patterns of divergence. For example, Figure S2c shows results where we have assumed that both populations adapted independently to a shifting optimum. This was modeled by generating the z(i) from an Ornstein–Uhlenbeck process, where the ancestral population (characterized by an intermediate value of h), was strongly maladapted.

All of the examples above assume well‐adapted parental lines. By contrast, under strong inbreeding, or mutation accumulation, divergence might be entirely maladaptive, with no tendency for populations to revert to the optimum. To model this, we can generate the allelic effects, mi, as uncorrelated normal variables, starting from an optimal ancestral state. In this case, if we assume that the parental lines each contain d/2 substitutions, then their phenotypes are distributed as z(0),z(d)N(0,vd/2), such that E(fP)=1/2. As shown in Figure S2d, the Brownian bridge approximation remains accurate in this case. For a single trait, the accuracy does depend strongly on knowing the true parental trait values (dotted red lines), and the approximation that uses their expected squared values (dotted blue lines) can be quite inaccurate. Nevertheless, over multiple traits, we can expect the errors of this approximation to cancel out. A clear feature of this approximation, is that the expected levels of breakdown are constant for all haploid hybrids (the blue line in Fig. S2d is flat). This reflects the fact that, without natural selection, both the parental lines, and all possible hybrid genotypes are simply random assemblies of alleles, without any coadaptation. This echoes the result for diploids with maximal parental maladaptation, where breakdown levels depend solely on heterozygosity, and not on the hybrid index (see eq. 8 and Fig. 2A).

Examples illustrated in Figure S2 use simple heuristic simulations, where the phenotypes z(j) were drawn from a known distribution. In Figures S3 and S4 we show results when these phenotypes were generated by explicit population genetic simulations, including multiple phenotypic traits, and the appearance, fixation, or loss of new mutations. The full simulation procedure is described in Fraïsse et al. (2016b), but all simulations assumed allopatric divergence between P1 and P2, haploid genomics, and a low mutation rate, such that fixation or loss events could be tracked one mutation at a time. In the simulations shown in Figure S3, we began with a core set of parameters (Fig. S3a), and then varied these one at a time. Specifically, we varied the curvature of the fitness function, α, with a core set of parameters (eq. 1; Fig. S3b), the size of the parental populations (Fig. S3c), the total number of traits, n (eq. 4; Fig. S3d), and the shape of the distribution of mutant effects (Fig. S3e). These different parameter regimes led to very different levels of genetic divergence between the parents (i.e., different values of d), and different distributions of fixed phenotypic effects. Nevertheless, the Brownian bridge approximation remained very accurate in all cases. Finally, we moved the phenotypic optima during the simulation, so that much of the divergence took place by positive selection (see Fraïsse et al. 2016b; Fig. S3f). This sometimes led to an increase in the variance in breakdown score (Fig. S3f), but otherwise, the Brownian bridge approximation remained accurate.

Figure S4 shows further simulations, where the optima can move in discrete jumps (as in Orr 1998; Barton 2001). For n=2 traits, we compare divergence with a stable, fixed optimum (Fig. S4a), optima that vary identically for the two diverging populations (Fig. S4b), and   optima that move in different directions in the environments experienced by P1 and P2 (Fig. S4c). These scenarios correspond, roughly, to mutation‐order speciation via drift (Fig. S4a) or selection (Fig. S4b), and to ecological speciation (Fig. S4c), where reproductive isolation evolves as a byproduct of adaptation to different environments (Mani and Clarke 1990; Coyne and Orr 2004; Gavrilets 2004; Chevin et al. 2014).

When the optima can vary, we can also distinguish between intrinsic reproductive isolation, which involves maladaptation in traits that are under identical selection in all environments, and extrinsic isolation, which involves maladaptation in traits whose optima vary (Chevin et al. 2014). To illustrate this, the right‐hand columns in Figure S4 compare the breakdown caused by selection on both traits, measured in the environment to which P1 has adapted, and the breakdown due to one trait only, corresponding to the vertical axis on the fitness landscapes, for which the optimum never varies. This shows how the approach can be used to model breakdown that is caused by the correlated effects of adaptation to distinct environments (due to pleiotropy or linkage), even when hybrid fitness is measured in a benign environment, where these divergent selection pressures do not apply.

Extension to diploidy

In this final section of the Appendix, we extend the results above to diploid genetics, as described in the main text. In this case, we must explicitly consider heterozygosity, p 12, and the hybrid index must include contributions from both homozygous and heterozygous alleles: h=p2+p12/2.

To apply the Brownian bridge approximation to diploidy, we will assume that the chain of random vectors applies to alleles in their heterozygous state. Because all mutations are assumed to be additive on the phenotype, this means that the random walk, which begins at the phenotype of P1, will now end at the midparental phenotype, instead of the phenotype of P2. It also means that, to deal with alleles that appear in homozygous state, some sections of the walk must be counted twice. In particular, we need to study the following quantity:

Bdip(t1,t2)B(t1+t2)+B(t1)B(0) (A17)

where B(·) is a Brownian bridge, as described in equations (A1) and (A2) above. From these equations, we find:

E(Bdip(t1,t2))=(12t1t2)B(0)+(2t1t2)B(1) (A18)
Var (Bdip(t1,t2))= Var (B(t1+t2))+ Var (B(t1))+2 Cov (B(t1),B(t1+t2))=σB2{(t1+t2)(1t1t2)+t1(1t1)+2(1t1t2)t1}=σB2{t2(1t2)+4t1(1t1t2)} (A19)

For our diploid problem, we start the walk at the P1 phenotype: B(0)=zP1,i, but end it at the midparent B(1)=(zP1,i+zP2,i)/2. The total walk length is σB2=dvi, just as with haploidy, but now vi is understood as the typical effect of an allele in heterozygous state. Finally, we take the intermediate “times” as t1=p2 (this section of the walk is counted twice, to account for homozygous alleles), and t2=p12 (this section is counted once, to account for heterozygous alleles). This results in

ziN(μi,σi2)μiEzi=(12p2p12)zP1,i+(2p2+p12)zP1,i+zP2,i2=(1h)zP1,i+hzP2,i (A20)
σi2 Var zi=dvip12(1p12)+4p1p2=dvi4h(1h)p12 (A21)

We note that equation (A20) is identical to the haploid result of equation (A3). This shows that, once the hybrid index is correctly defined, all of the results concerning parental maladaptation (as shown in eqs. A8 and A9 and Table S1) apply equally to diploids and haploids. Combining equations (A20)(A21) with equations (A5)(A6), and equation (A9), leads directly to the diploid results shown in the main text.

The only other difference to note is that maladaptive diploid substitutions now contribute a possible 22vi=4vi to the trait variance. This implies that highly maladapted parents, who have fixed d/2 such mutations, will have a breakdown score of E(SP)d24inλivi, such that E(fP)=2 under diploidy (as opposed to E(fP)=1/2 under haploidy). The diploid result is used in the main text, to explore data from highly inbred lines of Zea mays.

Appendix 2. Fisher's Model and Dobzhansky‐Muller Incompatibilities

In this Appendix, we show that a major prediction of Fisher's geometric model, assuming well‐adapted parents (eq. 9), can also be derived from a model of genetic incompatibilities, each involving a small number of loci. Such models stem from the classic work of Dobzhansky 1937 and have been important in the study of speciation genetics (Fraïsse et al. 2014; Orr 1995; Turelli and Orr 2000; Gavrilets 2004; Welch 2004; Demuth and Wade 2005; Fraïsse et al. 2016b). We begin by introducing the model in general form, and then explore different ways of assigning the parameters, which determine the contribution of individual incompatibilities to the overall level of breakdown.

A general model of incompatibilities

Following Orr (1995), let us assume that certain combinations of alleles, at d of the divergent loci, can be intrinsically incompatible, while all other combinations confer high fitness. By assumption, the pure species genotypes, and their ancestral states, must be fit, but all other combinations have a fixed probability ε of being incompatible.

The worst class of hybrid will contain all of the possible incompatibilities, and so its expected breakdown score will be proportional to the expected number of incompatibilities. This was calculated by Welch (2004, eqs. 1–2):

E(S,I)εd(21) (A22)

Here, and below, we use the subscript I to indicate a model of incompatibilities. To derive fI (eq. 2), we note that hybrids will have higher fitness when some of the incompatibilities are absent from their genomes (Turelli and Orr 2000). The probability that an incompatibility is present depends on how many of the ℓ loci are heterozygous. For a genotype comprising i loci that are homozygous for the P1 allele, j loci homozygous for the P2 allele, and k loci that are heterozygous, the probability required is:

πijk=2ku(i)u(j)22, with u(x)=0 if x>01ifx=0 (A23)

which is the proportion of the possible combinations of heterospecific alleles that are present in an “ijk” genotype.

Incompatibilities may also have reduced effects due to recessivity, when their negative effects are masked by the presence of alternative, compatible alleles (Turelli and Orr 2000). To model this, we introduce the free parameter sijk, which is the expected increase in breakdown when an incompatibility appears in an ijk genotype. Finally, in a hybrid genome characterized by p 1, p 2, and p 12, the trinomial expansion of (p1+p2+p12), tells us how many ℓ‐locus genotypes of each type it is expected to contain. Putting these together, we have

fI=i+j+k=i,j,kp1ip2jp12kπijksijk (A24)

where

i,j,k!i!j!k!

Equations (A23) and (A24) extend results with =2 and =3 from Turelli and Orr (2000), and represent a general model of breakdown caused by incompatibilities. A notable feature of these equations is their large number of free parameters. Even with symmetry between P1 and P2 (such that sijk=sjik), we will still require a total of (1+/4) different sijk values to specify the model (i.e., three extra parameters for two‐locus incompatibilities, five parameters for three‐locus incompatibilities etc.). There is good empirical evidence for, at least, two‐, three‐, and four‐locus incompatibilities (Fraïsse et al. 2014), and so with the sijk and the ε, the full model would depend on at least 17 free parameters. By contrast, equation (9), from Fisher's geometric model, has no free parameters. The incompatibility‐based model is therefore much more flexible, but also much more difficult to explore.

In the following section, we examine different ways of assigning the sijk. We will show that a particular set of values make the model identical to Fisher's geometric model (eq. 9). We will also follow Turelli and Orr (2000), and show that these values are biologically plausible, given well‐established empirical patterns, and especially Haldane's Rule (see Haldane 1922 and Appendix 4).

The dominance relations of incompatibilities

Let us begin by assigning the following functional form for the sijk:

sijk12δk (A25)

where below, we will use a constant of proportionality such that sijk=(1/2)δk·2(22), to simplify the final results. In equation (A25), the parameter δ allows us to tune the dominance of incompatibilities, measured in terms of breakdown score, rather than fitness.

When δ=1, then each heterozygous locus halves the effects of incompatibility. This is equivalent to assuming that incompatibilities act multiplicatively, since each heterozygous locus halves the number of times that the incompatible combination of alleles is present in the genome. The sijk under multiplicative selection (δ=1) are illustrated by the green points in Figure S5.

To determine the predictions of this model, let us substitute equation (A25) into equation (A24), and set δ=1. After some algebra, we find:

fI=21p2+12p12p1+12p12,δ=1 (A26)
21h(1h),δ=1 (A27)
=4h(1h),=2,δ=1 (A28)

where h is the hybrid index (eq. 3). As such, when incompatibilities act multiplicatively, breakdown will depend solely on the hybrid index, and not on the heterozygosity. When pairwise incompatibilities are considered (eq. A28), the model becomes equivalent to the haploid version of Fisher's geometric model (see eq. A7), and also to the model of Barton and Gale (1993).

More importantly, equation (A28) predicts that breakdown should not change between the F1 and F2 crosses (both of which have h=12), and that homogametic F1 will have the highest possible breakdown score. As such, this multiplicative model cannot predict hybrid breakdown between the F1 and F2, nor Haldane's Rule, and both patterns have widespread empirical support (see Table A1 of Fraïsse et al. 2016b).

Now let us consider another extreme assumption. We assume that incompatibilities are fully recessive, such that no breakdown appears unless all incompatible alleles appear in homozygous or hemizygous form. We model this by making δ very large, such that sijk=0 unless k=0. These values are illustrated by the red points in Figure S5. With the assumption of complete recessivity, we find:

fI=2(p1+p2)p1p2,δ (A29)
=4h(1h)2p12+p122,=2,δ (A30)

Equation (A29) also fails to predict Haldane's Rule, unless there is substantial uniparental inheritance from both the male and female parents (see Appendix 4). This is because fI=0 if p1p2=0, and so both male and female F1 will have identical and optimal fitness. For similar reasons, equation (A29) predicts that the fitness of heterogametic backcrosses will decrease with p12,A, and this prediction is not supported by the relevant data (Moehring 2011).

We have shown that both extreme regimes (no recessivity, and complete recessivity) yield unsupported predictions. But what values of δ are biologically plausible? To answer this question, let us consider Haldane's Rule under an incompatibility‐based model, and ignoring uniparental inheritance. Assuming that males are heterogametic, Haldane's Rule will hold when

fI,F1>fI,F1 (A31)

(see Appendix 4). If we use gX and 1gX to denote the proportions of the parental divergence that is found on the X and autosomes (as in eq. 16 of the main text), then using equations (A24) and (A25), equation (A31) is found to be equivalent to:

2(1gX)+2δgX(1gX)1gX+2δgX>22 (A32)

This condition is most difficult to satisfy when incompatibilities involve two loci (=2), and in this case, we find the solution:

δ>ln2gX1gX/ln(2). (A33)

The value of δ that is required to yield Haldane's Rule will therefore increase with gX. Toward the limit of the biologically plausible range, when two‐thirds of the between‐species divergence is X‐linked (gX=2/3) Haldane's Rule will hold only if δ>2. As such, setting δ=2, such that each heterozygous locus reduces the breakdown score by a factor of four, will yield Haldane's Rule in most cases. The sijk values from equation (A25) with δ=2 are shown as yellow points in Figure S5. Another feature of the model with δ=2 is that it produces parameter dependencies that are very close to those predicted by Fisher's geometrical model (see also Manna et al. 2011). The similarity is clearest with two‐locus incompatibilities, where we find

fI=12δ2p121p12112δ+4p1p2,=2 (A34)
=4h(1h)p12+14p122,=2,δ=2 (A35)

Comparing equation (A35) to equation (9), shows that fIf when we use equation (A25) with δ=2.

We can then go further, and find a set of sijk values that yield exactly the same dependencies as Fisher's model. To do this, we set fI=f, using equations (9) and (A24), and then solve for the sijk. After some algebra, we find

sijk=(k)(ij)2(1)πijk (A36)

These values are shown as blue points in Figure S5. Equation (A36) looks unwieldy, and it was derived solely to make the models agree. Nevertheless, it embodies biologically plausible assumptions about incompatibilities. First, the similarities between the blue and yellow points in Figure S5 show that Fisher's model corresponds to the assumption of partial recessivity, at levels sufficient to produce Haldane's Rule. Second, equation (A36) states that incompatibilities will have stronger effects when alleles from both parental species appear in homozygous state. For example, if the three alleles ABc form an incompatibility (with upper and lower case letters distinguishing alleles from P1 and P2), then equation (A36) predicts that the genotype Aa/BB/cc (with ijk=111) will tend to have lower fitness than the genotype AA/BB/Cc (with ijk=201) even though both genotypes contain the incompatibility, and both comprise two homozygous loci and one heterozygous locus.

This section has shown that Fisher's geometric model implies assumptions about “incompatibilities,” that yield well supported predictions. However, it remains true that the full incompatibility‐based model (eqs. A22A24) is much more flexible. While that model is very parameter rich, the results above also suggest a family of models that might combine the flexibility and simplicity. When we assume only pairwise incompatibilities (=2), then equations (A28), (A30), (A35), and the result from Fisher's model (eq. 9), are all special cases of

f=4h(1h)+(2f het r2)p12+(1+rf het )p122 (A37)

where f het is the breakdown experienced by the global heterozygote (the genotype with p12=1), and r is the rate of change in breakdown with p 12.

Appendix 3. Segregation and Recombination

In this Appendix, we consider the effects of segregation and recombination on the expected levels of breakdown. For a recombinant cross, such as the F2 or backcross, p 12 and h might vary between individuals, and so we must treat f as a random variable. To see this, let us write equation (9) as

f=4h(1h)p12=p12(1p12)+4p1p2=p12p122+4p14p124p1p12 (A38)

and so its expected value is:

E(f)=p¯12(1p¯12)+4p¯1p¯2 Var (p12)4( Var (p1)+ Cov (p12,p1)) (A39)

where overbars represent expected values. The variances and covariances in equation (A39) will depend on the distribution of the divergence across the genome, and on patterns of segregation and recombination. However, we can derive simple and useful predictions if we assume that the divergence is equally distributed among m freely recombining regions. In this case, we can use results from a multinomial distribution.

Var (p12)=p¯12(1p¯12)m (A40)
Var (p1)=p¯1(1p¯1)m (A41)
Cov (p12,p1)=p¯1p¯12m (A42)

These results also apply to estimators from m independently segregating markers. As an example, let us compare the first backcross and the F2, in a population with strictly biparental inheritance. For the first backcross, we have p¯12=12 and p1p2=0, and so

E(f BC 1)=1411m (A43)

For the F2, we have p¯12=12 and p¯1=p¯2=14, and so

E(fF2)=1211m=2E(f BC 1) (A44)

and so the predicted breakdown in the F2 is always double that of the first backcross. By the same method, we can also calculate the variance of f, but this requires higher order moments of the multinomial distribution (Newcomer et al. 2008), and this can yield lengthy expressions. However, the following results for the F2 are required to justify the approximation of equation (13) from the main text.

E(4h(1h))=112m1,p¯12=12,p¯1=14 (A45)
Var (4h(1h))=2m14m312m2,p¯12=12,p¯1=14 (A46)

Comparing equation (A46) to equation (A40) shows that most of the variance in F2 breakdown will be due to variation in heterozygosity.

Appendix 4. Predictions of Fisher's Geometric Model with Sex‐Specific Inheritance

In this Appendix, we will develop some predictions of Fisher's geometric model, after relaxing the assumption of strictly biparental inheritance. These results justify claims in the main text, and also provide analytical support for some assertions in Fraïsse et al. (2016b).

Additional notation and basics

As described in the main text, the basic strategy for incorporating sex‐specific inheritance is to write the genome‐wide measures as weighted sums of contributions from loci of different types. For example, if we have a system with an X chromosome and autosomes, and a subset of loci that are strictly maternally or paternally inherited, then we could write:

p12=gXp12,X+gAp12,A+gp12,+gp12,p1=gXp1,X+gAp1,A+gp1,+gp1,p2=gXp2,X+gAp2,A+gp2,+gp2,h=gXhX+gAhA+gh+gh (A47)

where gX+gA+g+g=1. The weightings will vary with the proportion of the divergence found at loci of each type, and with the typical effect sizes of the fixed differences. Following equation (3), the hybrid index for the X chromosome, can also be written as

hX=p2,X+p12,X2 (A48)

and similarly for the other quantities. Here, as in the main text, we have assumed an XO system, such that females are homogametic, and males heterogametic. However, the equations are flexible enough to model a range of biological situations. For example, haplodiploidy might be modeled by setting gX=1, and the uniparentally inherited loci (included via g and g) might include the Y (or W) chromosome, organelles, selectively silenced regions of the X (or Z), or other imprinted loci (Turelli and Moyle 2007).

With sex‐specific inheritance, we also require assumptions about sex‐specific selection, and this can be incorporated in several ways (Connallon and Clark 2014; Fraïsse et al. 2016b). For example, sexual conflict can be modeled by assuming that there are differences in the optimal trait values for each sex (Connallon and Clark 2014). Alternatively, we could assume that some subset of the traits is under selection in only one sex, for example traits involved in spermatogenesis or oogenesis (Wu and Davis 1993; Coyne and Orr 2004). Finally, we could assume that sexes are under identical selection, which will usually require assumptions about dosage compensation (see below and Fraïsse et al. 2016b).

Patterns in the F1: Haldane's rule and asymmetry

This section presents results for the initial F1 cross (P1 × P2), and particularly the findings of Haldane's Rule, and parental sex asymmetry (Haldane 1922; Turelli and Orr 2000; Turelli and Moyle 2007; Fraïsse et al. 2016b).

Haldane's Rule applies to offspring of different sexes, in species with chromosomal sex determination. It states that, when F1 breakdown is stronger in offspring of one sex, it will tend to be the heterogametic sex (Haldane 1922; Turelli and Orr 2000). Previous authors have noted that Fisher's model predicts this pattern (Barton 2001; Fraïsse et al. 2016b; Schiffman and Ralph 2017), and here, we extend this insight to give formal conditions for Haldane's Rule. We will assume identical selection in both sexes, and that pure‐species individuals of both sexes have the same fitness. This implies a form of dosage compensation, such that X‐linked alleles have identical effects in homozygous or hemizygous state (Mank et al. 2011; Fraïsse et al. 2016b). Assuming an XO or XY system, we expect greater breakdown in the heterogametic (male) sex on the condition that:

fF1<fF1. (A49)

In homogametic females, all divergent sites on the X and autosomes will be heterozygous, such that p12=gA+gX, while by definition, uniparentally inherited loci will come from one parent alone, such that p1p2=gg. In heterogametic males, by contrast, only autosomal sites will be heterozygous, such that p12=gA, and X‐linked sites will be hemizygous and maternally inherited, such that p1p2=(g+gX)g. Putting these results together with equation (6), and using

Δgg (A50)

to denote the difference in the contributions of exclusively maternally and paternally inherited sites, we find:

fF1=g+gΔ2+fP(1+Δ)24+fP(1Δ)24 (A51)
=fP+fP4,g,g=0 (A52)
fF1=g+g+gX(Δ+gX)2+fP(1+Δ+gX)24+fP(1ΔgX)24 (A53)
=gX(1gX)+(1+gX2)fP+fP4+gXfPfP2,g,g=0 (A54)

where we have used fP and fP to denote the initial maladaptation of the maternal and paternal lines used to make the F1. We can now combine equations (A49)(A54) to give formal conditions for Haldane's Rule. We find that the heterogametic sex will tend to show more breakdown if

0<gX<gX (A55)

where

gX22Δ(4(fP+fP))+fPfP4(fP+fP) (A56)
=12Δ,fP,fP=0 (A57)
=21Δ(2fP)2fP,fP=fPfP (A58)
=42fP4fP,Δ,fP=0 (A59)

Several observations follow from the results above. First, the conditions for Haldane's Rule are always harder to satisfy as Δ increases, and so, from equation (A50), the presence of loci with exclusively maternal inheritance makes Haldane's Rule less likely, while paternal inheritance makes it more likely. Second, Haldane's Rule is usually more likely when the parental lines are maladapted. This is an effect of heterosis, that is an intrinsic advantage to heterozygosity, because the homogametic sex will tend to be more heterozygous. Third, the sole exception to this pattern is where the paternal line is much less fit than the maternal (i.e., if fPfP as in eq. A59). In this case, Haldane's Rule is less likely, because male F1 carry less of the unfit paternal genome.

The patterns above also clarify the broader role of uniparental effects in determining F1 fitness (Fraïsse et al. 2016b). For example, when there is no strictly uniparental inheritance, such that g,g=0, then the homogametic F1 will always be fitter than the average of the parental lines (eq. A52). By contrast, when uniparental inheritance is present (eq. A51), then the relative breakdown remains roughly constant. This implies that the absolute breakdown score will decline steadily with d, the genetic divergence between the parental lines (see eqs. 2 and 5). As such, Fisher's model requires uniparental inheritance to explain the observation of an “F1 speciation clock” (Edmands 2002; Fraïsse et al. 2016b). This also helps to determine to the conditions for rare exceptions to Haldane's Rule (as observed, e.g., in Teleogryllus Moran et al. 2017). We predict such exceptions only when uniparentally inherited loci act on traits that are subject to selection only in the homogametic sex (Fraïsse et al. 2016b).

Uniparental inheritance is also required, by definition, to explain another widespread phenomenon: the strong asymmetry in fitness between the reciprocal F1, that is male–female versus female–male crosses of the same species pair (Turelli and Moyle 2007; Fraïsse et al. 2016b). Such asymmetry is found even in species without sex chromosomes, or even separate sexes, and so it cannot be connected to Haldane's Rule in any rigid sense (Bouchemousse et al. 2016; Fraïsse et al. 2016b). The results above (eqs. A51A54) predict such asymmetry only in special circumstances: when there are uniparental effects of different sizes (Δ0), and parental lines with different levels of maladaptation (fPfP). These are the assumptions that we used to analyze the Teleogryllus data of Moran et al. (2017) (see below). However, the observations of F1 asymmetry are very widespread, and apply even to species pairs that are both well adapted (Turelli and Moyle 2007; Fraïsse et al. 2016b). The reason for this apparent discrepancy is that the present work considers the expected breakdown, conditioned only on the levels of parental maladaptation (fP and fP), and so expectations for the reciprocal F1 must be identical when fP=fP. If, by contrast, we condition on the phenotypic effects of uniparentally inherited alleles, then Fisher's geometric model can account for the asymmetries observed; this was demonstrated by Fraïsse et al. (2016b).

Patterns in backcrosses

In this section, we provide further details of the analyses of backcross data. These include the male backcrosses from Drosophila (Noor et al. 2001; Macdonald and Goldstein 1999; Moehring et al. 2006a, 2006b), that were reanalyzed by Moehring (2011), and the female backcrosses from Teleogryllus presented by Moran et al. (2017). In both cases, the data come from hybrids of a single sex, and the fitness traits are components of fertility that are plausibly sex‐specific (i.e., spermatogenesis and oogenesis; see Table 1). As such, we treat the predictions of Fisher's geometric model as applying to a single sex.

The Drosophila data (Macdonald and Goldstein 1999; Noor et al. 2001; Moehring et al. 2006a, 2006b) come from three pairs of reciprocal backcrosses. The species pairs are D. simulans/sechellia (Macdonald and Goldstein 1999); D. santomea/yakuba (Moehring et al. 2006a, 2006b); and D. pseudoobscura/persimilis (Noor et al. 2001). In all three cases, the crosses can produce sterile hybrids, and so it is safe to assume that the parental lines are well adapted, compared to the worst possible class of hybrid, so that we can derive predictions from equation (9). Our analyses will also neglect loci with exclusively uniparental inheritance, because they do not qualitatively alter the predictions in this case. With these assumptions, as noted by Moehring (2011), hybrids can be characterized by two measures of “heterospecificity,” namely, the autosomal heterozygosity, p12,A, and the hybrid index of the X, hX. All datasets scored fertility in each hybrid fly as a binary, or ordinal trait (see Table 1), and so Moehring (2011) asked whether fertility problems varied systematically with p12,A and hX. She found, in all six crosses, that backcross fertility problems correlated strongly and positively with hX, but correlations with p12,A were weak and inconsistent. This is shown in Figure S7, and Table 3 of Moehring (2011). To show how Fisher's geometric model might account for these patterns, we can write equation (9) as

f BC =4h(1h)p12=p12(1p12)+4p2p1=a(1a)+4x(1xa)=aa2+4x4x24ax (A60)

where

a(1gX)p12,AxgXhX (A61)

Figure S8a depicts the fitness surface of equation (A60) as a function of x and a. Each dataset of hybrid males could occupy a rectangular region of this surface, as determined by the its value of gX, and this is how the data are plotted in Figure S7. The regions also correspond to a region of Figure 1 C, comprising the parallelogram delimited by the dotted horizontal line and the two arrows. From annotated Drosophila genomes, we estimated that gX=0.17 might characterize the simulans/sechellia and yakuba/santomea pairs, and that gX=0.37 might characterize the pseudoobscura/persimilis pair (Table 1; see Methods for details). The regions of parameter space for these values of gX are marked on Figure S8a, while panels b‐e show slices through the fitness surface for these values. In both cases, breakdown increases steadily with hX, except in the improbable case that the recombinant autosomes were completely heterozygous (Fig. S8b–c). This is consistent with the positive correlations observed by Moehring (2011). By contrast, the dependencies on p12,A (Fig. S8d–e) vary in sign. This is consistent with the lack of consistent correlations with p12,A observed by Moehring (2011) (Fig. S7). Figure S8e also shows how, when gX is large, the correlations with p12,A reverse in sign, for extreme values of hX. Figure S7e–f shows how the data of Noor et al. (2001) divide naturally into individuals with high, medium and low values of hX, and together, this explains the rationale of the test presented in Table 3.

To analyze the female backcross data of Moran et al. (2017) for Teleogryllus, we used equation (10), which assumes that P1 is well adapted, while P2 is maladapted. We also assume that a proportion π of the paternal X chromosome is silenced. In this case, the quantities p 12 and h are calculated without considering silenced alleles, because these make no contribution to the phenotype. This rich dataset contains a wide variety of cross types, and so the full predictions for all of the relevant hybrids are listed in Table S4.

LITERATURE CITED

  1. Abbott, R. J. , and Brennan A. C.. 2014. Altitudinal gradients, plant hybrid zones and evolutionary novelty. Philos. Trans. R. Soc. B Biol. Sci. 369: 20130346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abbott, R. J. , Albach D., Ansell S., Arntzen J. W., Baird S. J. E., Bierne N., Boughman J., Brelsford A., Buerkle C. A., Buggs R., et al. 2013. Hybridization and speciation. J. Evol. Biol. 26: 229–246. [DOI] [PubMed] [Google Scholar]
  3. Aeschbacher, S. , Selby J. P., Willis J. H., and Coop G.. 2017. Population‐genomic inference of the strength and timing of selection against gene flow. Proc. Natl. Acad. Sci. 114: 7061–7066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Agresti, A. 2003. Categorical data analysis. 2nd ed. John Wiley & Sons, Inc, Hoboken, New Jersey. [Google Scholar]
  5. Baird, S. J. E . 2017. The impact of high‐throughput sequencing technology on speciation research: maintaining perspective. J. Evol. Biol. 30: 1482–1487. [DOI] [PubMed] [Google Scholar]
  6. Baird, S. J. E. , Ribas A., Macholán M., Albrecht T., Piálek J., and de Bellocq J. Goü. 2012. Where are the wormy mice? A reexamination of hybrid parasitism in the European house mouse hybrid zone: helminth parasites in the house mouse hybrid zone. Evolution 66: 2757–2772. [DOI] [PubMed] [Google Scholar]
  7. Barton, N. H . 2001. The role of hybridization in evolution. Mol. Ecol. 10: 551–568. [DOI] [PubMed] [Google Scholar]
  8. Barton, N. H. 2017. How does epistasis influence the response to selection? Heredity 118:96–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Barton, N. H. , and Gale K. S.. 1993. Genetic analysis of hybrid zones Pp. 13–45 in Hybrid zones and the evolutionary process, Harrison R. G. ed. Oxford Univ. Press, Oxford. [Google Scholar]
  10. Bierne, N. , Bonhomme F., Boudry P., Szulkin M., and David P.. 2006. Fitness landscapes support the dominance theory of post‐zygotic isolation in the mussels Mytilus edulis and M. galloprovincialis . Proc. R. Soc. B 273: 1253–1260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bierne, N. , David P., Boudry P., and Bonhomme F.. 2002. Assortative fertilization and selection at larval stage in the mussels Mytilus edulis and M. galloprovincialis . Evolution 56: 292–298. [DOI] [PubMed] [Google Scholar]
  12. Blanquart, F. , Achaz G., Bataillon T., and Tenaillon O.. 2014. Properties of selected mutations and genotypic landscapes under Fisher's geometric model. Evolution 68: 3537–3554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bouchemousse, S. , Lévêque L., Dubois G., and Viard F.. 2016. Co‐Occurrence and Reproductive Synchrony Do Not Ensure Hybridization between an Alien Tunicate and Its Interfertile Native Congener. Evol. Ecol. 30: 69–87. [Google Scholar]
  14. Boyle, E. A. , Li Y. I., and Pritchard J. K.. 2017. An expanded view of complex traits: from polygenic to omnigenic. Cell 169: 1177–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Buerkle, C. A . 2017. Inconvenient truths in population and speciation genetics point towards a future beyond allele frequencies. J. Evol. Biol. 30: 1498–1500. [DOI] [PubMed] [Google Scholar]
  16. Butlin, R. K. , and Ritchie M. G.. 1989. Genetic coupling in mate recognition systems: what is the evidence? Biol. J. Linn. Soc. 37: 237–246. [Google Scholar]
  17. Caseys, C. , Stritt C., Glauser G., Blanchard T., and Lexer C.. 2015. Effects of hybridization and evolutionary constraints on secondary metabolites: the genetic architecture of phenylpropanoids in European populus species. PLoS ONE 10: e0128200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chapman, M. A. , Hiscock S. J., and Filatov D. A.. 2016. The genomic bases of morphological divergence and reproductive isolation driven by ecological speciation in Senecio (Asteraceae). J. Evol. Biol. 29: 98–113. [DOI] [PubMed] [Google Scholar]
  19. Chevin, L.‐M. , Decorzent G., and Lenormand T.. 2014. Niche dimensionality and the genetics of ecological speciation. Evolution 68: 1244–1256. [DOI] [PubMed] [Google Scholar]
  20. Christe, C. , Stölting K. N., Paris M., Frasse C., Bierne N., and Lexer C.. 2017. Adaptive evolution and segregating load contribute to the genomic landscape of divergence in two tree species connected by episodic gene flow. Mol. Ecol. 26: 59–76. [DOI] [PubMed] [Google Scholar]
  21. Christe, C. , Stölting K. N., Bresadola L., Fussi B., Heinze B., Wegmann D., and Lexer C.. 2016. Selection against recombinant hybrids maintains reproductive isolation in hybridizing Populus species despite F1 fertility and recurrent gene flow. Mol. Ecol. 25: 2482–2498. [DOI] [PubMed] [Google Scholar]
  22. Connallon, T. and Clark A. G.. 2014. Balancing Selection in Species with Separate Sexes: Insights from Fisher.s Geometric Model. Genetics 197: 991–1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Coyne, J. A. , and Orr H. A.. 1989. Two rules of speciation Pp. 180–207 in Otte D. and Endler J. A., eds. Speciation and its consequences. Sinauer Associates, Sunderland, MA. [Google Scholar]
  24. Coyne, J. A. , and Orr H. A. 2004. Speciation. Vol. 37. Sinauer Associates, Sunderland, MA. [Google Scholar]
  25. Crow, J. F. 1952. Dominance and overdominance In Heterosis, Gowen J. W. ed. Iowa State College Press, Ames, Iowa. [Google Scholar]
  26. Davis, A. W. , and Wu C.‐I.. 1996. The broom of the Sorcerer's apprentice: the fine structure of a chromosomal region causing reproductive isolation between two sibling species of Drosophila . Genetics 143: 1287–1298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Demuth, J. P. , and Wade M. J.. 2005. On the theoretical and empirical framework for studying genetic interactions within and among species. Am. Nat. 165: 524–536. [DOI] [PubMed] [Google Scholar]
  28. Dobzhansky, T. G. 1937. Genetics and the origin of species Columbia Univ. Press, New York. [Google Scholar]
  29. Duranton, M. , Allal F., Fraïsse C., Bierne N., Bonhomme F., and Gagnaire P.‐A.. 2018. The origin and remolding of genomic islands of differentiation in the European Sea Bass Nature Communications 9, no. 1. 10.1038/s41467-018-04963-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Duranton, M. , Allal F., Fraïsse C., Bierne N., Bonhomme F., and Gagnaire P.‐A.. 2017. The origin and remolding of genomic islands of differentiation in the European sea bass. bioRxiv 10.1101/223750. [DOI] [PMC free article] [PubMed]
  31. Edmands, S. 2002. Does Parental Divergence Predict Reproductive Compatibility? Trends Ecol. Evol. 17: 520–527. [Google Scholar]
  32. Falush, D. , Stephens M., and Pritchard J. K.. 2003. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164:1567–1587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Fisher, R. A. 1930. The genetical theory of natural selection. Clarendon Press, Oxford. [Google Scholar]
  34. Fitzpatrick, B. M . 2008. Hybrid dysfunction: population genetic and quantitative genetic perspectives. Am. Nat. 171: 491–498. [DOI] [PubMed] [Google Scholar]
  35. Fitzpatrick, B. M 2012. Estimating ancestry and heterozygosity of hybrids using molecular markers. BMC Evol. Biol. 12: 131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Fraïsse, C. , Elderfield J. A. D., and Welch J. J.. 2014. The Genetics of Speciation: Are Complex Incompatibilities Easier to Evolve? J. Evol. Biol. 27: 688–699. [DOI] [PubMed] [Google Scholar]
  37. Fraïsse, C. , Belkhir K., Welch J. J., and Bierne N.. 2016a. Local interspecies introgression is the main cause of extreme levels of intraspecific differentiation in mussels. Mol. Ecol. 25:269–286. [DOI] [PubMed] [Google Scholar]
  38. Fraïsse, C. , Gunnarsson P. A., Roze D., Bierne N., and Welch J. J.. 2016b. The genetics of speciation: insights from Fisher's geometric model. Evolution 70:1450–1464. [DOI] [PubMed] [Google Scholar]
  39. Gavrilets, S. 2004. Fitness landscapes and the origin of species. Princeton Univ. Press, Princeton. [Google Scholar]
  40. Gompert, Z. , and Buerkle C. A.. 2010. Introgress: a software package for mapping components of isolation in hybrids. Mol. Ecol. Resour. 10: 378–384. [DOI] [PubMed] [Google Scholar]
  41. Gramates, L. S. , Marygold S. J., Santos G. dos, Urbano J.‐M., Antonazzo G., Matthews B. B., Rey A. J., Tabone C. J., Crosby M. A., Emmert D. B., et al. 2017. FlyBase at 25: looking to the future. Nucleic Acids Res. 45: D663–D671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Guerrero, R. F. , Muir C. D., Josway S., and Moyle L. C.. 2017. Pervasive antagonistic interactions among hybrid incompatibility loci. PLoS Genet. 13: e1006817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Haldane, J. B. S . 1922. Sex ratio and unisexual sterility in hybrid animals. J. Genet. 12: 101–109. [Google Scholar]
  44. Hallauer, A. R. , Carena M. J., and Mirandad Filho J. B.. 2010. Quantitative genetics in maize breeding, Handbook of plant breeding. Vol. 6. Springer, Berlin. [Google Scholar]
  45. Hinze, L. L. , and Lamkey K. R.. 2003. Absence of epistasis for grain yield in elite maize hybrids. Crop Sci. 43: 46–56. [Google Scholar]
  46. Hogan, T. W. , and Fontana P. G.. 1973. Restoration of meiotic stability following artificial hybridisation and selection in Teleogryllus (Orth., Gryllidae). Bull. Entomol. Res. 62: 557–563. [Google Scholar]
  47. Hoy, R. , Hahn J., and Paul R.. 1977. Hybrid cricket auditory behavior: evidence for genetic coupling in animal communication. Science 195: 82. [DOI] [PubMed] [Google Scholar]
  48. Hwang, S. , Park S.‐C., and Krug J.. 2017. Genotypic complexity of Fisher's geometric model. Genetics 206: 1049–1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kalirad, A. , and Azevedo R. B. R.. 2017. Spiraling complexity: a test of the snowball effect in a computational model of RNA folding. Genetics 206: 377–388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kiesselbach, T. (1922). Corn investigations. Neb. Agric. Exp. Stn. Bull. 20:5–151. [Google Scholar]
  51. Kiesselbach, T . 1930. The use of advanced generation hybrids as parents of double‐cross seed corn. J. Am. Soc. Agron. 22: 614–625. [Google Scholar]
  52. Kinman, M. L , and Sprague G. F.. 1945. Relation between number of parental lines and theoretical performance of synthetic varieties of corn. J. Am. Soc. Agron. 37: 341–351. [Google Scholar]
  53. Li, X. , Wang X., Wei Y., and Brummer E. C.. 2011. Prevalence of segregation distortion in diploid alfalfa and its implications for genetics and breeding applications. Theor. Appl. Genet. 123: 667–679. [DOI] [PubMed] [Google Scholar]
  54. Lindstrom, E. W. 1941. Analysis of modern maize breeding principles and methods Pp. 667–679 in Proceedings of the 7th International Genetics Congress, Edinburgh, Scotland .
  55. Lindtke, D. , Gompert Z., Lexer C., and Buerkle C. A.. 2014. Unexpected ancestry of Populus seedlings from a hybrid zone implies a large role for postzygotic selection in the maintenance of species. Mol. Ecol. 23: 4316–4330. [DOI] [PubMed] [Google Scholar]
  56. Lindtke, D. , Buerkle C. A., Barbará T., Heinze B., Castiglione S., Bartha D., and Lexer C.. 2012. Recombinant hybrids retain heterozygosity at many loci: new insights into the genomics of reproductive isolation in Populus . Mol. Ecol. 21: 5042–5058. [DOI] [PubMed] [Google Scholar]
  57. Lynch, M . 1991. The genetic interpretation of inbreeding depression and outbreeding depression. Evolution 45: 622–629. [DOI] [PubMed] [Google Scholar]
  58. Macdonald, S. J. , and Goldstein D. B.. 1999. A quantitative genetic analysis of male sexual traits distinguishing the sibling species Drosophila simulans and D. sechellia . Genetics 153: 1683–1699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Macholán, M. , Baird S. J. E., Dufková P., Munclinger P., Bímová B. V., and Piálek J.. 2011. Assessing multilocus introgression patterns: a case study on the mouse X chromosome in central Europe: heterogeneity of introgression on the mouse X chromosome. Evolution 65: 1428–1446. [DOI] [PubMed] [Google Scholar]
  60. Mank, J. E. , Hosken D. J., and Wedell N.. 2011. Some Inconvenient Truths about Sex Chromosome Dosage Compensation and the Potential Role of Sexual Conflict. Evolution 65: 2133–2144. [DOI] [PubMed] [Google Scholar]
  61. Manna, F. , Martin G., and Lenormand T.. 2011. Fitness Landscapes: An Alternative Theory for the Dominance of Mutation. Genetics 189: 923–937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Mani, G. S. , and Clarke B. C.. 1990. Mutational order: a major stochastic process in evolution. Proc. R. Soc. Lond. B Biol. Sci. 240: 29–37. [DOI] [PubMed] [Google Scholar]
  63. Martin, G . 2014. Fisher's geometrical model emerges as a property of complex integrated phenotypic networks. Genetics 197: 237–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Martin, G. , and Lenormand T.. 2006. A general multivariate extension of Fisher's geometrical model and the distribution of mutation fitness effects across species. Evolution 60: 893–907. [PubMed] [Google Scholar]
  65. Martin, J. M. , and Hallauer A. R.. 1976. Relation between heterozygosis and yield for four types of maize inbred lines. Egypt. J. Genet. Cytol. 5: 119–135. [Google Scholar]
  66. Maside, X. R. , and Naveira H. F.. 1996. On the difficulties of discriminating between major and minor hybrid male sterility factors in Drosophila by examining the segregation ratio of sterile and fertile sons in backcrossing experiments. Heredity 77: 433–438. [DOI] [PubMed] [Google Scholar]
  67. Matute, D. R. , Butler I. A., Turissini D. A., and Coyne J. A.. 2010. A test of the snowball theory for the rate of evolution of hybrid incompatibilities. Science 329: 1518–1521. [DOI] [PubMed] [Google Scholar]
  68. McFadden, D. 1974. Conditional logit analysis of qualitative choice behavior Pp. 105–142 in Zarembka P. ed. Frontiers in econometrics. Academic Press, Cambridge, Massachusetts. [Google Scholar]
  69. Melchinger, A. E . 1987. Expectation of means and variances of testcrosses produced from F2 and backcross individuals and their selfed progenies. Heredity 59: 105–115. [Google Scholar]
  70. Mendez, F. L. , Watkins J. C., and Hammer M. F.. 2012. A haplotype at STAT2 introgressed from neanderthals and serves as a candidate of positive selection in papua New Guinea. Am. J. Hum. Genet. 91: 265–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Moehring, A. J. , Llopart A., Elwyn S., Coyne J. A., and Mackay T. F. C.. 2006a. The genetic basis of postzygotic reproductive isolation between Drosophila santomea and D. yakuba due to hybrid male sterility. Genetics 173: 225–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Moehring, A. J. , Llopart A., Elwyn S., Coyne J. A., and Mackay T. F. C. 2006b. The genetic basis of prezygotic reproductive isolation between Drosophila santomea and D. yakuba due to mating preference. Genetics 173: 215–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Moehring, A. J . 2011. Heterozygosity and its unexpected correlations with hybrid sterility. Evolution 65: 2621–2630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Moll, R. H. , Lonnquist J. H., Fortuno J. V., and Johnson E. C.. 1965. The relationship of heterosis and genetic divergence in maize. Genetics 52: 139–144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Moran, P. A. , Ritchie M. G., and Bailey N. W.. 2017. A rare exception to Haldanes rule: are X chromosomes key to hybrid incompatibilities? Heredity 118:554–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Morán, T. , and Fontdevila A.. 2014. Genome‐wide dissection of hybrid sterility in drosophila confirms a polygenic threshold architecture. J. Hered. 105: 381–396. [DOI] [PubMed] [Google Scholar]
  77. Moyle, L. C. , and Nakazato T.. 2010. Hybrid incompatibility “Snowballs” between solanum species. Science 329: 1521–1523. [DOI] [PubMed] [Google Scholar]
  78. Neal, N. P . 1935. Decrease in yielding capacity in advanced generations of hybrid corn. J. Am. Soc. Agron. 51: 666–670. [Google Scholar]
  79. Newcomer, J. T. , Neerchal N. K., and Morel J. G.. 2008. Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models. Dep. Math. Stat. Univ. Md. Baltim. USA.
  80. Noor, M. A. F. , Grams K. L., Bertucci L. A., Almendarez Y., Reiland J., and Smith K. R.. 2001. The genetics of reproductive isolation and the potential for gene exchange between Drosophila pseudoobscura and D. persimilis via backcross hybrid males. Evolution 55: 512–521. [DOI] [PubMed] [Google Scholar]
  81. Orgogozo, V. , Broman K. W., and Stern D. L.. 2006. High‐resolution quantitative trait locus mapping reveals sign epistasis controlling ovariole number between two Drosophila species. Genetics 173: 197–205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Orr, H. A . 1995. The population genetics of speciation: the evolution of hybrid incompatibilities. Genetics 139: 1805–1813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Orr, H. A 1998. The population genetics of adaptation: the distribution of factors fixed during adaptive evolution. Evolution 52: 935–949. [DOI] [PubMed] [Google Scholar]
  84. Pollak, E. , Robinson H. F., and Comstock R. E.. 1957. Inter‐population hybrids in open‐pollinated varieties of maize. Am. Nat. 91: 387–391. [Google Scholar]
  85. Quinlan, A. R. , and Hall I. M.. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26: 841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. R Core Team . 2016. R: a language and environment for statistical computing. R found. Stat. Comput. Vienna Austria https://www.R-project.org.
  87. Revuz, D. , and Yor M.. 1999. Continuous martingales and Brownian motion. 3rd ed. Springer‐Verlag, New York. [Google Scholar]
  88. Richey, F. D. , and Sprague G. F.. 1931. Experiments on hybrid vigor and convergent improvement in corn. 267. U.S. Department of Agriculture, Washington, D.C., pp. 1–2 2. [Google Scholar]
  89. Rockman, M. V . 2012. The QTN program and the alleles that matter for evolution: All that's gold does not glitter. Evolution 66: 1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Rosas, U. , Barton N. H., Copsey L., de Reuille P. Barbier, and Coen E.. 2010. Cryptic variation between species and the basis of hybrid performance. PLoS Biol. 8: e1000429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Routtu, J. , Hall M. D., Albere B., Beisel C., Bergeron R., Chaturvedi A., Choi J.‐H., Colbourne J., De Meester L., Stephens M. T., et al. 2014. An SNP‐based second‐generation genetic map of Daphnia magna and its application to QTL analysis of phenotypic traits. BMC Genomics 15: 1033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Roux, C. , Fraïsse C., Romiguier J., Anciaux Y., Galtier N., and Bierne N.. 2016. Shedding light on the grey zone of speciation along a continuum of genomic divergence. PLoS Biol. 14: e2000234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Schiffman, J. S. , and Ralph P. L.. 2017. System drift and speciation. bioRxiv 10.1101/231209. [DOI] [PMC free article] [PubMed]
  94. Schumer, M. , Xu C., Powell D. L., Durvasula A., Skov L., Holland C., Blazier J. C., Sankararaman S., Andolfatto P., Rosenthal G. G., et al. 2018. Natural selection interacts with recombination to shape the evolution of hybrid genomes. Science 360: 656–660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Semagn, K. , Babu R., Hearne S., and Olsen M.. 2014. Single nucleotide polymorphism genotyping using Kompetitive Allele Specific PCR (KASP): overview of the technology and its application in crop improvement. Mol. Breed. 33: 1–14. [Google Scholar]
  96. Sentz, J. C. , Robinson H. F., and Comstock R. E.. 1954. Relation between heterozygosis and performance in maize. Agron. J. 46: 514–520. [Google Scholar]
  97. Shehata, A. H. , and Dhawan N. L.. 1975. Genetic analysis of grain yield of maize as manifested in diverse varietal populations and their crosses. Egypt. J. Genet. Cytol. 4: 90–116. [Google Scholar]
  98. Staubach, F. , Lorenc A., Messer P. W., Tang K., Petrov D. A., and D. Tautz . 2012. Genome patterns of selection and introgression of haplotypes in natural populations of the house mouse (Mus Musculus). PLoS Genetics 8: e1002891. 10.1371/journal.pgen.1002891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Stringfield, G . 1950. Heterozygosis and hybrid vigor in maize. Agron. J. 42: 45–112. [Google Scholar]
  100. Tenaillon, O. , Silander O. K., Uzan J.‐P., and Chao L.. 2007. Quantifying organismal complexity using a population genetic approach. PLoS ONE 2: e217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Turelli, M. , and Moyle L. C.. 2007. Asymmetric postmating isolation: Darwin's corollary to haldane's rule. Genetics 176: 1059–1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Turelli, M. , and Orr H. A.. 2000. Dominance, epistasis and the genetics of postzygotic isolation. Genetics 154: 1663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Turner, L. M. , and Harr B.. 2014. Genome‐wide mapping in a house mouse hybrid zone reveals hybrid sterility loci and Dobzhansky‐Muller interactions. Elife 3: e02504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Wang, R. J. , White M. A., and Payseur B. A.. 2015. The pace of hybrid incompatibility evolution in house mice. Genetics 201: 229–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Waser, N. M. 1993. Population structure, optimal outbreeding and assortative mating in angiosperms Pp. 173–199 in Thornhill N. W., ed. The natural history of inbreeding and outbreeding: Theoretical and empirical perspectives. Chicago Univ. Press, Chicago. [Google Scholar]
  106. Waxman, D. , and Welch J. J.. 2005. Fisher's microscope and haldane's ellipse. Am. Nat. 166: 447–457. [DOI] [PubMed] [Google Scholar]
  107. Welch, J. J . 2004. Accumulating dobzhansky‐muller incompatibilities: reconciling theory and data. Evolution 58: 1145–1156. [DOI] [PubMed] [Google Scholar]
  108. Welch, J. J. , and Waxman D.. 2003. Modularity and the cost of complexity. Evolution 57: 1723–1734. [DOI] [PubMed] [Google Scholar]
  109. White, M. A. , Steffy B., Wiltshire T., and Payseur B. A.. 2011. Genetic dissection of a key reproductive barrier between nascent species of house mice. Genetics 189: 289–304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Wright, S . 1922. The effects of inbreeding and crossbreeding on guinea pigs : III. crosses between highly inbred families. U. S. Dep. Agric. Bull. 1121. [Google Scholar]
  111. Wright, S. 1977. Inbreeding depression and heterosis: plants In Evolution and the genetics of populations, volume 3: Experimental results and evolutionary deductions, vol. 3. Chicago Univ. Press, Chicago. [Google Scholar]
  112. Wu, C.‐I. and Davis A. W.. 1993. Evolution of Postmating Reproductive Isolation: The Composite Nature of Haldane's Rule and Its Genetic Bases. Am. Nat. 142: 187–212. [DOI] [PubMed] [Google Scholar]
  113. Xu, M. , and He X.. 2011. Genetic incompatibility dampens hybrid fertility more than hybrid viability: yeast as a case study. PLoS ONE 6:e18341. [DOI] [PMC free article] [PubMed]
  114. Yang, H. , Ding Y., Hutchins L. N., Szatkiewicz J., Bell T. A., Paigen B. J., Graber J. H., de Villena F. P.‐M., and Churchill G. A.. 2009. A customized and versatile high‐density genotyping array for the mouse Nature Methods 6: 663–666. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1: A schematic representation of a phenotype evolving over time in two populations, labeled P1 and P2, starting from their most recent common ancestor (MRCA).

Figure S2: The breakdown associated with haploid hybrid genotypes, under simple models of phenotypic divergence.

Figure S3: The breakdown associated with haploid hybrid genotypes, under explicit population genetic simulations of phenotypic divergence.

Figure S4: The breakdown associated with haploid hybrid genotypes, under explicit population genetic simulations of phenotypic divergence, in scenarios involving discrete jumps in the optimal value for one of n = 2 traits.

Figure S5: The effects of an incompatibility on hybrid breakdown score, sijk (eq. 41), when the incompatibility appears in a genotype comprising i loci that are homozygous for alleles from one parental species, j loci that are homozygous for alleles from the other parental species, and k loci that are heterozygous.

Figure S6: Genotype plot for the raw Mytilus data.

Figure S7: Plots of the Drosophila male backcross data reanalyzed here (see Table 1 and Moehring 2011).

Figure S8: Predictions of Fisher's geometric model for heterogametic male hybrids.

Figure S9: Estimation of the fitness surface for interspecific hybrids from plants (Table 1), namely wild hybrids of the genus Populus (row (a); Christe et al. 2016), and an F2 cross of the genus Senecio (row (b); Chapman et al. 2016).

Figure S10: Estimation of the fitness surface for subspecific hybrids from Mus musculus (Table 1, White et al. 2011; Turner and Harr 2014).

Figure S11: Estimation of the fitness surface for backcross male hybrids from Drosophila species pairs (Table 1, Macdonald and Goldstein 1999; Moehring et al. 2006a, b).

Table S1: The contribution of parental maladaptation to hybrid breakdown.

Table S2: Checks for appropriateness for genomic data sets.

Table S3: The reference populations for Mytilus crosses.

Table S4: Expected breakdown scores for homogametic female hybrids with paternal X silencing.

Table S5: Inferring the hybrid fitness surface from genomic data sets.

Table S6: The significance of individual regression coefficients.

Table S7: Information on the 98 markers used for the Mytilus genotyping.

Data Availability Statement

The newly generated data on Mytilus and collated data are available as a Dryad package (doi:xxx).


Articles from Evolution Letters are provided here courtesy of Oxford University Press

RESOURCES