Skip to main content
Genetics logoLink to Genetics
. 2017 Mar 20;206(1):377–388. doi: 10.1534/genetics.116.196030

Spiraling Complexity: A Test of the Snowball Effect in a Computational Model of RNA Folding

Ata Kalirad 1, Ricardo B R Azevedo 1,1
PMCID: PMC5419482  PMID: 28007889

The snowball hypothesis suggests that the number of genetic incompatibilities between two populations will accumulate faster than linearly as they diverge...

Keywords: speciation, hybrid incompatibility, missing snowball, intrinsic postzygotic isolation, high-order epistasis

Abstract

Genetic incompatibilities can emerge as a byproduct of genetic divergence. According to Dobzhansky and Muller, an allele that fixes in one population may be incompatible with an allele at a different locus in another population when the two alleles are brought together in hybrids. Orr showed that the number of Dobzhansky-Muller incompatibilities (DMIs) should accumulate faster than linearly—i.e., snowball—as two lineages diverge. Several studies have attempted to test the snowball effect using data from natural populations. One limitation of these studies is that they have focused on predictions of the Orr model, but not on its underlying assumptions. Here, we use a computational model of RNA folding to test both predictions and assumptions of the Orr model. Two populations are allowed to evolve in allopatry on a holey fitness landscape. We find that the number of inviable introgressions (an indicator for the number of DMIs) snowballs, but does so more slowly than expected. We show that this pattern is explained, in part, by the fact that DMIs can disappear after they have arisen, contrary to the assumptions of the Orr model. This occurs because DMIs become progressively more complex (i.e., involve alleles at more loci) as a result of later substitutions. We also find that most DMIs involve >2 loci, i.e., they are complex. Reproductive isolation does not snowball because DMIs do not act independently of each other. We conclude that the RNA model supports the central prediction of the Orr model that the number of DMIs snowballs, but challenges other predictions, as well as some of its underlying assumptions.


“[It is not] surprising that the facility of effecting a first cross, the fertility of the hybrids produced, and the capacity of being grafted together … should all run, to a certain extent, parallel with the systematic affinity of the forms which are subjected to experiment …” Darwin (1859)

IN the absence of gene flow, the gradual accumulation of divergent genetically based characteristics in different populations can bring new species into being. Some of these divergent characteristics, known as reproductive isolating barriers (Johnson 2006), decrease the level of interbreeding between populations. As populations diverge, isolating barriers accumulate, and the level of reproductive isolation (RI) among populations increases (Coyne and Orr 1989; Roberts and Cohan 1993; Sasa et al. 1998; Edmands 2002; Presgraves 2002; Dettman et al. 2003; Lijtmaer et al. 2003; Mendelson 2003; Moyle et al. 2004; Bolnick and Near 2005; Liti et al. 2006; Scopece et al. 2007; Stelkens et al. 2010; Giraud and Gourbière 2012; Jewell et al. 2012; Larcombe et al. 2015). Eventually, RI reaches a point where two of these populations are considered distinct species. Elucidating the precise nature of the relationship between genetic divergence and RI remains one of the central challenges in the study of speciation (Gavrilets 2004; Nosil and Feder 2012; The Marie Curie SPECIATION Network 2012; Seehausen et al. 2014).

Dobzhansky (1937) and Muller (1942) proposed a general mechanism through which genetic divergence can cause RI. They noted that, in the absence of gene flow between two populations, an allele that fixes in one population may be incompatible with an allele at a different locus in another population when the two alleles are brought together in hybrids. This negative epistasis, or genetic incompatibility, causes the two populations to become reproductively isolated. Dobzhansky-Muller incompatibilities (DMIs) have been shown to cause inviability or sterility in hybrids between closely related species (reviewed in Presgraves 2010b; Rieseberg and Blackman 2010; Maheshwari and Barbash 2011).

Orr (1995) modeled the accumulation of multiple DMIs as populations diverge. Consider two populations diverged at k loci and showing Ik simple DMIs. A simple DMI is defined as a negative epistatic interaction between an allele at one locus in one population and an allele at a different locus in the other population. Orr showed that when the next substitution takes place, the expected number of simple DMIs is

Ik+1=Ik+pk, (1)

where p is the probability that there is a simple DMI between the latest derived allele and one of the k alleles at the loci that have previously undergone substitutions (from the population that did not undergo the latest substitution). Assuming I1=0, the solution to difference Equation 1 is

Ik=pk(k1)2. (2)

Equation 2 predicts that the number of simple DMIs will accumulate faster than linearly as a function of divergence (prediction #1; Orr 1995). This prediction assumes that p remains constant as populations diverge (assumption #1).

DMIs involving n3 loci, known as complex DMIs (Cabot et al. 1994), are also expected to snowball, but following different relationships from that in Equation 2: DMIs of order n are expected to accumulate at a rate approximately proportional to kn (prediction #2; Orr 1995; Welch 2004). If DMIs have small, independent effects on RI (assumptions #2 and #3, respectively), then the postzygotic RI they generate is also expected to increase faster than linearly with k (prediction #3; Orr 1995). Orr (1995) described this pattern of quantities increasing faster than linearly as “snowballing.” We shall refer to predictions #1–3 of the Orr model collectively as the “snowball effect” (Orr and Turelli 2001).

Several studies have attempted to test the snowball effect. They have employed three different approaches. The first tests prediction #3 of the Orr model: that postzygotic RI snowballs. For example, Larcombe et al. (2015) measured the strength of hybrid incompatibility between Eucalyptus globulus and 64 species of eucalypts. They observed a faster than linear increase in RI with genetic distance, consistent with prediction #2 of the Orr model. Results from other studies using a similar approach have provided little support for a snowball effect in RI (Sasa et al. 1998; Lijtmaer et al. 2003; Mendelson et al. 2004; Bolnick and Near 2005; Gourbière and Mallet 2010; Stelkens et al. 2010; Giraud and Gourbière 2012), leading some to pronounce the snowball “missing” (Johnson 2006; Gourbière and Mallet 2010). However, this approach has several limitations. It can be applied only when postzygotic RI1. Furthermore, it only tests one prediction (#3) of the Orr model and this prediction relies on one assumption (#3) that typically goes untested. Thus, the number of DMIs might snowball (predictions #1–2) even if RI does not.

The second approach tests predictions #1–2 of the Orr model: that the number of DMIs snowballs. For example, Moyle and Nakazato (2010) used a QTL mapping approach to estimate the number of DMIs between species of Solanum directly. They introgressed one or a few genomic segments from one species to another. When an introgressed segment caused a reduction in fitness, they concluded that it participated in a DMI. They found that introgressions causing seed sterility accumulated faster than linearly. However, introgressions causing pollen sterility appeared to accumulate linearly, contrary to the snowball effect. Studies following similar approaches have tended to find support for the snowball effect (Matute et al. 2010; Moyle and Nakazato 2010; Matute and Gavin-Smyth 2014; Sherman et al. 2014; Wang et al. 2015).

One advantage of this approach over the first is that it relies on fewer assumptions (#1 compared to #1–3, respectively). However, the second approach also has limitations. The order (n) of the DMIs identified is unknown. Therefore, this approach cannot disentangle predictions #1 and #2. Another limitation of these studies is that they are likely to underestimate the true number of DMIs for two reasons. First, the introgressed genomic segments typically contain many genetic differences. For example, the individual segments introgressed in Moyle and Nakazato (2010) included ∼2–4% of the genome, and likely contained hundreds of genes. Second, individual alleles might participate in multiple DMIs, especially if complex DMIs are common (Guerrero et al. 2016).

The third approach tests prediction #1 of the Orr model: that the number of simple DMIs snowballs. Consider two species, 1 and 2, diverged at k loci. If an allele, X2, at one of these loci (X) is known to be deleterious in species 1 but is fixed in species 2, then species 2 must carry compensatory alleles at one or more loci (Y2,Z2,) that are not present in species 1 (which carries alleles Y1,Z1, at those loci). In other words, there must be a DMI involving the X2 and Y1,Z1, alleles.

Following Welch (2004), we define P1 as the proportion of the k fixed differences between the species where the allele from one species is deleterious in the other species. For example, Kachroo et al. (2015) replaced 414 essential genes of the yeast Saccharomyces cerevisiae with their human orthologs. Over half of the human genes (P1=57%) could not functionally replace their yeast counterparts.

Welch (2004) has argued that estimates of P1 can be used to test the Orr model if two additional conditions are met. If each allele participates in at most one DMI, then we have P1=Ik/k. If, in addition, P1 is entirely based on simple DMIs, then it is expected to increase linearly with genetic distance according to the Orr model (Equation 2).

P1=p(k1)2. (3)

P1 can be estimated without studying hybrids directly. Kondrashov et al. (2002) and Kulathinal et al. (2004) estimated P1 in mammals and insects, respectively. Surprisingly, both studies reported that P110% and is constant over broad ranges of genetic distances (e.g., human compared to either nonhuman primates or fishes; Kondrashov et al. 2002). These results are inconsistent with prediction #1 of the Orr model (Welch 2004; Fraïsse et al. 2016). The results of the second and third approaches give inconsistent results, a paradox first noted by Welch (2004). However, the third approach is less direct because it relies on two additional assumptions that have not been tested.

One common limitation to all approaches is that they focus on testing predictions of the Orr model, without testing its assumptions (e.g., assumption #1, constant p). Here we use a computational model of RNA folding (Schuster et al. 1994; Lorenz et al. 2011) to test both predictions and assumptions of the Orr model. The RNA folding model makes satisfactory predictions of the secondary structures of real RNA molecules (Mathews et al. 1999; Doshi et al. 2004; Lorenz et al. 2011) and has been used to study other evolutionary consequences of epistasis, including robustness (van Nimwegen et al. 1999; Ancel and Fontana 2000), evolvability (Wagner 2008; Draghi et al. 2010), and the rate of neutral substitution (Draghi et al. 2011). We model populations evolving in allopatry on a holey fitness landscape (Gavrilets 2004). In his original model, Orr (1995) made no assumptions on either the evolutionary causes of genetic divergence, or the molecular basis of the DMIs arising from that divergence. Thus, Orr’s predictions should be met in our RNA “world.” Our results provide mixed support for the Orr model.

Materials and Methods

Genotype and phenotype

The genotype is an RNA sequence. Unless otherwise stated we used sequences with a length of 100 nt. The phenotype is the minimum free-energy secondary structure of the sequence computed using the ViennaRNA package version 2.1.9 (Lorenz et al. 2011) with default parameters.

Fitness

The fitness of RNA sequence i is determined using the function

wi={1ifβi>αandδiα0otherwise (4)

where βi is the number of base pairs in the secondary structure of sequence i, δi is the base-pair distance between the structure of sequence i and the reference structure, and α is an arbitrary threshold. Unless otherwise stated, we used α=12. The fitness function in Equation 4 specifies a neutral network (Schuster et al. 1994; van Nimwegen et al. 1999) or holey fitness landscape (Gavrilets 2004) (Figure 1).

Figure 1.

Figure 1

Evolution on a holey fitness landscape. Mutational network of RNA sequences. Lines connect sequences of 20 nt that can be reached by a single nucleotide substitution. Only a tiny fraction of the entire mutational network of 1012 sequences is shown. Furthermore, only a few of the 60 mutational neighbors of each sequence are shown. A sequence is viable (yellow, blue, or gray circles) if its secondary structure both has > 2 bp and is at most 2 bp away from the reference structure (thick yellow circle); a sequence is inviable otherwise (red circles) (Equation 4). Each simulation starts with a burn-in period where a sequence with the reference structure undergoes three neutral substitutions (thick dashed blue lines). After that, the resulting sequence is used as the ancestor of two lineages that alternately accumulate neutral substitutions until they have diverged at k=8 sites (thick solid blue lines).

Evolution

Burn-in period:

We begin by picking a random viable RNA sequence, define its secondary structure as the reference, and allow the sequence to accumulate 200 random neutral substitutions sequentially, allowing multiple hits. The resulting sequence is used as the ancestor. Supplemental Material, Table S1 shows summary statistics of the ancestral sequences with α=12.

The burn-in period is necessary because the initial sequence is not representative for the fitness landscape. For example, it has the reference structure (i.e., δi=0 bp), whereas most sequences in the fitness landscape are δiα bp away from the reference structure (Table S1).

Divergence:

The ancestor is used to found two identical haploid lineages. The lineages evolve by alternately accumulating a series of neutral substitutions without gene flow (allopatry) until they differ at k=40 sites. At a given step, one of the evolving sequences is subjected to a random mutation. If the mutation is neutral, it is allowed to substitute the ancestral nucleotide; if it is deleterious, it is discarded and a new random mutation is tried. The process is repeated until a neutral mutation is found. At the next step, the other evolving lineage is subjected to the same process.

At each step, the only sites that are allowed to mutate are those that have not yet undergone a substitution in either lineage since the lineages have started to diverge from their common ancestor. This constraint implies that substitutions are irreversible and that ≤2 nt are observed at each site during the course of evolution, in agreement with the assumptions of the Orr (1995) model. Relaxing this assumption had no effect on the main results (Figure S1 and Table S2). All types of base-substitution mutations have equal probability. Insertions and deletions are not considered.

Inviable introgressions

Two viable sequences, 1 and 2, differ at k sites. To detect DMIs of increasing complexity, we conduct introgressions of one, two, or three diverged nucleotides from one sequence to another.

Single introgressions:

We introgress individual nucleotides at each of the k divergent sites from sequence 1 to sequence 2 and count the number of inviable introgressions, Jk(1). We repeat the procedure in the opposite direction (sequence 21) and calculate the average of the resulting Jk(1) values. The proportion of single introgressions (in one direction) involved in a DMI is given by P1=Jk(1)/k (Welch 2004).

Double introgressions:

We introgress the i(i1)/2 pairs of nucleotides from sequence 1 to sequence 2, where i=kJk(1) is the number of divergent sites that are not involved in inviable single introgressions in the 12 direction. We count the number of inviable double introgressions, Jk(2). We repeat the procedure in the opposite direction (21) and calculate the average of the resulting Jk(2) values.

Triple introgressions:

We introgress all triples of divergent nucleotides from sequence 1 to sequence 2 that contain neither nucleotides involved in inviable single introgressions in the 12 direction, nor pairs of nucleotides involved in inviable double introgressions in the 12 direction. We count the number of inviable triple introgressions, Jk(3). We repeat the procedure in the opposite direction (21) and calculate the average of the resulting Jk(3) values.

Potential DMIs

The simple DMIs that might, potentially, affect a sequence can be computed exhaustively by measuring the fitness of all possible single and double mutants derived from the sequence. A potential simple DMI is defined as an inviable double mutant between mutations that are individually neutral.

Direct estimation of p:

The number of potential simple DMIs, I, for a sequence allows us to estimate the value of the parameter in the Orr model (Equation 1) as

p=2Ij(j1), (5)

where j is the number of neutral mutations tested to estimate I.

DMI network:

We summarize the pattern of interactions between sites using an undirected network where the vertices are sites and the edges represent the existence of at least one potential simple DMI between them (for every pair of sites, there are nine combinations of double mutants). The resulting network is an example of the networks of interactions described by Orr and Turelli (2001) and Livingstone et al. (2012).

Reproductive isolation:

The degree of RI between two sequences is defined as

RI=1w¯R,

where w¯R is the mean fitness (Equation 4) of all possible 198 recombinants resulting from a single crossover between the sequences.

“Holeyness” of the fitness landscape:

For each simulation, we took the ancestor and each of the k=40 genotypes generated during the course of evolution, and measured the proportion of their single mutant neighbors (300 per sequence) that are inviable, excluding the 41 original sequences. This proportion estimates the local holeyness of the fitness landscape traversed by the diverging lineages.

Statistical analyses

All statistical analyses were conducted with R version 3.3.0 (R Core Team 2016).

Data availability

The software used to run all simulations was written in Python 2.7 and is available at https://github.com/Kalirad/spiraling_complexity (doi: http://doi.org/10.5281/zenodo.400020).

Results

Inviable introgressions snowball in the RNA model, but more slowly than expected

The Orr model predicts that DMIs of order n should accumulate at a rate approximately proportional to kn, where k is the number of substitutions (prediction #2: Orr 1995; Welch 2004). We tested this prediction using 103 evolutionary simulations with the RNA model. For each simulation, we estimated the numbers of inviable single, double, and triple introgressions. We fitted the following model to the average numbers of inviable introgressions at each k

Jk(i)=ai(ki)bi, (6)

where i=1,2 or 3 is the number of introgressed alleles, and ai and bi are parameters; the ki term ensures that Ji(i)=0.

An inviable introgression of i alleles indicates the existence of a DMI of order ni+1. Furthermore, if each inviable introgression is caused by a single DMI, then predictions #1 and #2 of the Orr model lead to the prediction that the exponent bii+1 (e.g., b12 for inviable single introgressions).

Figure 2 and Table 1 show that inviable introgressions accumulated faster than linearly (bi>1) in the RNA model. In addition, the bi exponent increased with the number of introgressed alleles, i, in broad qualitative agreement with predictions #1 and #2 of the Orr model. However, inviable introgressions snowballed much more slowly than expected under the Orr model (bii+1).

Figure 2.

Figure 2

Inviable introgressions snowball in the RNA model, but more slowly than expected. Evolution of the numbers of inviable single, double, and triple introgressions. Values are means of 103 simulations with α=12. Shaded regions indicate 95% CIs.

Table 1. Estimates of the parameters in Equation 6.

Introgressed alleles, i DMI order, n ai (95% CI) bi (95% CI)
1 2 0.098 (0.002) 1.35 (0.005)
2 3 0.023 (0.001) 1.82 (0.016)
3 4 0.0036 (0.0004) 2.44 (0.033)

The model was fitted by nonlinear least-squares regression to the average numbers of inviable single, double, and triple introgressions shown in Figure 2.

Some inviable introgressions are caused by multiple DMIs in the RNA model

What explains this mismatch between the RNA data and the Orr model? One possibility is that some inviable introgressions are caused by multiple DMIs (Guerrero et al. 2016). For example, imagine two genotypes, 1 and 2, divergent at multiple loci A,B,C, Genotype i has alleles Ai,Bi,Ci, If there are two simple DMIs between the two genotypes, A1/B2 and A1/C2, then single introgressions from genotype 1 to genotype 2 would only detect one inviable single introgression.

If we assume that all inviable single introgressions are caused by simple DMIs, then following the Orr (1995) model, the expected number of derived inviable introgressions after k+1 substitutions is given by

Jk+1=Jk+p(kJk). (7)

For simplicity, we refer to Jk(1) as Jk (see File S1 for more details). Assuming J1=0, the solution to difference Equation 7 is

Jk=k1(1p)kp. (8)

If p is low then most inviable single introgressions are caused by single DMIs: JkIk (Figure S2). As a result, inviable single introgressions snowball, with b12 (Equation 6). As p increases, the number of inviable introgressions caused by multiple DMIs increases and Jk begins to underestimate Ik. In addition, the value of the b1 parameter in Equation 6 declines, slowing down the snowball as it were. For example, b11.92 when p=0.005, and b11.61 when p=0.05 (Figure S2B).

Fitting Equation 8 by nonlinear least-squares regression to the evolutionary response in inviable single introgressions shown in Figure 2 yields an estimate of p=0.0264. If Jk evolved according to Equation 8 with this value of p, then b11.75 in Equation 6 (Figure 3), confirming that some inviable single introgressions are caused by multiple DMIs in the RNA model.

Figure 3.

Figure 3

Inviable single introgressions snowball in the RNA model, but more slowly than expected under the Orr model. The blue line shows the evolution of the number of inviable single introgressions in the RNA model simulations (same data as in Figure 2). The yellow line shows the prediction assuming that all inviable single introgressions are based on simple DMIs evolving according to the Orr model, with constant P=0.0264, obtained by fitting Equation 8 to the RNA data by nonlinear least-squares regression. The red dashed line shows the prediction assuming that all inviable single introgressions are based on simple DMIs accumulating according to the Orr model, with p evolving according to Equation 9 (with p0=0.0431 and c=0.000819 estimated by nonlinear least-squares regression).

However, the lack of agreement between the RNA data and the Orr model is not completely explained by the fact that some inviable introgressions are caused by multiple DMIs for two reasons. First, fitting Equation 6 directly to the same data yields an estimate of b1=1.35 ≪1.75 (Figure 3 and Table 1). Second, the prediction of b11.75 is conservative because it assumes that all inviable single introgressions are caused by simple DMIs. In fact, after k=40 substitutions in the RNA model there were 13.85±0.23 inviable single introgressions, but only 0.600±0.052 simple DMIs (mean and 95% confidence intervals, CIs) (see File S2 for details on how simple DMIs were detected).

The probability that a simple DMI appears is approximately constant in the RNA model

Another possible explanation the lack of agreement between the RNA data and the Orr model is that p itself has evolved, contrary to assumption #1 of the Orr model. To evaluate this possibility, we allowed p in Equation 7 to change linearly with divergence according to the relationship

pk=p0+ck, (9)

and estimated the p0 and c parameters by nonlinear least-squares. We obtained p0=0.0431 and c=0.000819, indicating that the lack of agreement between the RNA data and the Orr model might be explained by a declining p (Figure 3 and Figure S3, red dashed lines). If Jk evolved according to the red dashed line in Figure 3, then b11.39 in Equation 6.

These results provide indirect evidence that assumption #1 of the Orr model is violated by the RNA data. To test the assumption more directly, we measured p in each simulation using Equation 5. We found that p did decline with divergence, but only slightly (Figure S3, black line). The observed decline in p was not sufficient to explain the lack of agreement between the RNA data and the Orr model. If Jk evolves according to Equation 7 with p declining at the same rate, c, as observed in the RNA data (Figure S3, black line) and p0=0.0283 (estimated by nonlinear least-squares regression), then b11.711.35.

Simple DMIs do not persist indefinitely in the RNA model

The analyses summarized in Figure S3 reveal two additional problems. First, fitting the Orr model to the RNA data underestimated actual value of p by approximately threefold. Second, even if p0.0264 (Figure 3), then Equation 2 predicts an accumulation of I4021 simple DMIs. However, we found that in 57% of runs there were no simple DMIs at all after k=40 substitutions. These discrepancies indicate that a more fundamental assumption of the Orr model may be violated in the RNA model: that simple DMIs, once they have arisen, persist indefinitely (assumption #4). This assumption was not stated explicitly by Orr (1995) and has never, to our knowledge, been called into question.

To test assumption #4, we estimated the DMI networks of sequences as they evolved in our RNA model. Figure 4A shows an example of an RNA sequence evolving on a holey fitness landscape. Initially the sequence displays potential simple DMIs between 21 pairs of sites (Figure 4C). Figure 4B illustrates a potential simple DMI between positions 5 and 12. We refer to these simple DMIs as potential because if two diverging lineages each accumulate one of the substitutions underlying one of these DMIs, a simple DMI between the lineages will appear.

Figure 4.

Figure 4

Simple DMIs do not persist indefinitely in the RNA model. (A) The 20 nt long RNA sequence on the left acquires a neutral UA substitution at position 18 (blue). The holey fitness landscape is defined by α=2 (Equation 4). The secondary structure of the sequence on the left is the reference (δi=0 bp). The structure on the right is δi=2 bp away from the reference. (B) There is a potential simple DMI between positions 5 and 12 for the sequence on the left. A double mutant at those positions (5: AG, 12: CG, red) makes the structure inviable (δi=11 bp), even though the single mutations are neutral (data not shown). However, a single substitution causes the potential simple DMI to disappear in the sequence on the right, although the single mutations remain neutral in the new background (data not shown). In other words, the substitution causes the simple DMI to become complex. (C) A single substitution can dramatically rearrange the network of potential DMIs. DMI networks of the sequences in (A). Vertices correspond to positions in the sequences. An edge in the network on the left indicates that there is at least one potential simple DMI between the two sites (positions 4, 13, and 15–17 have no potential DMIs in either network and are not shown). Black edges in the network on the right are shared between the two networks. Blue edges exist only in the network on the right and indicate the appearance of new potential simple DMIs between sites caused by the substitution. Gray and red edges indicate disappearance of potential simple DMIs in the network on the right. Gray edges indicate disappearances due to the constituent alleles no longer being neutral in the new background. Red edges indicate disappearances caused by complexification; the DMI discussed in (B) is an example (5–12 edge).

The Orr model assumes that the DMI network is static: as populations evolve they actualize potential DMIs (for an alternative, but equivalent, interpretation of DMI networks see Livingstone et al. 2012). However, DMI networks are not static in the RNA model. After a single neutral substitution, 13 pairs of sites (62%) lost all potential simple DMIs, and potential DMIs appeared between 18 new pairs of sites (Figure 4C).

The “disappearance” of a potential DMI can occur in one of two ways. First, the substitution may cause the mutations involved in the simple DMIs to become deleterious so that they can no longer participate in potential simple DMIs. A disappearance of this kind means that a potential simple DMI is no longer accessible through independent substitution in two lineages because one of the substitutions cannot take place. Thus, such disappearances do not contradict assumption #4. The majority of disappearances in Figure 4C (gray lines) are of this kind.

The second kind of disappearance occurs when the substitution modifies the interaction between previously incompatible alleles (red lines in Figure 4C). In other words, the simple DMIs become complex. The potential simple DMI between positions 5 and 12 shown in Figure 4B disappears in this way. This kind of disappearance—complexification—implies that some simple DMIs may not persist indefinitely. In other words, assumption #4 of the Orr model is violated in the RNA model. In the next section we explore the consequences of the complexification of simple DMIs for the snowball effect.

The modified Orr model

We incorporate the dynamic nature of simple DMIs by extending the Orr (1995) model in Equation 1

Ik+1=(1q)Ik+pk, (10)

where q is the probability that a simple DMI present after k substitutions becomes complex after the next substitution. Assuming I1=0, the solution to Equation 10 is

Ik=pq2[kq1+(1q)k]. (11)

This prediction assumes that both p and q remain constant as populations diverge.

The original Orr model is a special case of the modified model when q=0. When q>0, Equation 11 has two consequences. First, the accumulation of simple DMIs eventually becomes linear with a slope of ∼p/q when k is sufficiently large. Second, if q is larger, the “linearization” of Equation 11 occurs for lower values of k. Both patterns are illustrated in Figure 5A, which compares the accumulation of simple DMIs under the Orr model with p=0.04 (q=0), and that under the modified Orr model with the same value of p and increasing values of q.

Figure 5.

Figure 5

The modified Orr model. Evolution of the number of simple DMIs according to Equation 11. Responses for p=0.04 and different values of q. The dashed line shows a slope of p/q for q=0.3.

The modified Orr model can resolve the problems stated at the top of the previous section. For example, if p0.0264 (Figure 3) and q=1, the expected number of simple DMIs after k=40 substitutions is 1.0, which is close to the observed value. Thus, the modified Orr model predicts that most simple DMIs become complex within a single substitution.

DMI complexification is pervasive in the RNA model

Is there evidence for rampant DMI complexification in the RNA model as predicted under the modified Orr model? Figure 2 shows that inviable double and triple introgressions, which must be based on complex DMIs of order n3 and n4, respectively, are quite common, as might be expected if more complex inviable introgressions are generated continuously from simpler ones. However, this is not conclusive evidence of complexification (Figure 6B): inviable introgressions of different complexity might simply arise de novo, with more complex ones appearing at a higher rate, as proposed by Orr (1995) (Figure 6A).

Figure 6.

Figure 6

Origins of DMIs of different complexity according to Orr (1995) and this paper. Higher order, n (i.e., number of loci involved in a DMI) corresponds to higher complexity. The numbers show i, the number of alleles introgressed. An inviable introgression of i alleles indicates one or more DMIs of order ni+1. (A) Orr proposed that DMIs of different complexity arise de novo. He also speculated that more complex DMIs arise at higher rates, as indicated by the thickness of the arrows. (B) We propose a “spiraling complexity” model where some DMIs arise de novo, but others arise through simplification and complexification of previously existing DMIs. We also propose that complexification is the dominant force, as indicated by the thickness of the 123 arrows. The total thickness of the incoming arrows into each value of i in (B) is twice that in (A).

To distinguish between the two scenarios, we tracked every inviable single, double, and triple introgression in each simulation and classified it according to whether it originated de novo or through the simplification or complexification of another inviable introgression present after the previous substitution. For example, imagine that after k substitutions there are two inviable introgressions from one genotype to another: A (single) and B/C/D (triple). After k+1 substitutions there are three inviable introgressions: A/E (double, resulting from complexification of A), B/C (double, resulting from simplification of B/C/D), and F (single, de novo). Figure 7 shows that initially, most inviable introgressions arise de novo. However, after k=40 substitutions, the majority of inviable double and triple introgressions arise through complexification.

Figure 7.

Figure 7

DMI complexification is pervasive in the RNA model. Origin of inviable single (A), double (B), and triple (C) introgressions. The width of each sector indicates the proportion of inviable introgressions arising either de novo (blue) or through the modification of another inviable introgression present after the previous substitution. (A) Origin of single introgressions. Incoming arrows into i=1 in Figure 6B: double, 21; triple, 31. (B) Origin of double introgressions. Incoming arrows into i=2 in Figure 6B: single, 12; triple, 32. (C) Origin of triple introgressions. Incoming arrows into i=3 in Figure 6B: single, 13; double, 23.

When we compared the final numbers of inviable single and double introgressions we found that they were strongly negatively correlated (Spearman’s rank correlation coefficient: ρ=0.469, P<106; Figure S4). This result is difficult to explain under the Orr model, because it envisages that the rates of origination of DMIs of different complexity are independent. However, a negative correlation would be expected if complexification is important and the rates of complexification vary across simulation runs.

Reproductive isolation does not snowball in the RNA model

Since inviable introgressions snowball in the RNA model, RI would be expected to snowball as well (prediction #3). However, we found that RI showed a kind of inverse snowball effect—a “slowdown” with divergence (Figure 8A). This pattern has been found in many organisms (e.g., Gourbière and Mallet 2010; Giraud and Gourbière 2012). The slowdown was caused by the fact that RI increased slower than linearly with the number of DMIs (Figure 8B). Thus, DMIs did not act independently of each other on RI. One reason for this nonindependence is that the total number of inviable introgressions among highly diverged sequences is high enough (Figure 2) that a substantial fraction of individual sites must participate in multiple DMIs (Figure S2).

Figure 8.

Figure 8

RI does not snowball in the RNA model. (A) Evolution of RI. Values are means of 103 simulations with α=12. Shaded regions indicate 95% CIs. (B) Mean numbers of inviable single introgressions and RI at k=1,4,7,,40. Error bars are 95% C.I.

The structure of the fitness landscape influences the accumulation of DMIs

Figure 9 shows that the accumulation of inviable single introgressions varied extensively between simulations (see Figure S4 for inviable double introgressions). What caused this variation? All simulations took place on the same sequence space, but with different fitness landscapes. Since all fitness landscapes were “holey” (Gavrilets 2004), it follows that the exact pattern of “holeyness” might have had an effect on the evolutionary dynamics. One component of the holeyness of a fitness landscape is the proportion of inviable single mutant neighbors of all the sequences generated during the course of evolution. This measure of the local holeyness of the fitness landscape was strongly positively correlated with the final number of inviable single introgressions (ρ=0.308, P<106; Figure 9).

Figure 9.

Figure 9

The structure of the fitness landscape influences the accumulation of DMIs in the RNA model. The number of inviable single introgressions accumulated after 40 substitutions is positively correlated with the holeyness of the fitness landscape. One- and two-dimensional kernel density estimates based on 103 simulations with α=12.

What determines holeyness? The fitness landscapes in our RNA folding model have two determinants: the reference structure and the value of α (Equation 4). RNA secondary structures can differ in many ways, such as the number and size of base pair stacks, interior loops, and hairpin loops (Schuster et al. 1994). The relationship between these structural features and holeyness is difficult to predict a priori. For a given reference structure, lower values of α are expected to specify fitness landscapes with more inviable sequences (i.e., holes) in them. To evaluate the extent to which these determinants of the fitness landscape influence holeyness, we ran 103 independent evolutionary simulations at each of another four values of α. We found that holeyness was influenced by both determinants of the fitness landscape: it was negatively correlated with α (ρ=0.476; P<106; Figure S5A) and positively correlated with the number of base pairs in the reference sequence, β (ρ=0.134; P<106; Figure S5B). Interestingly, changing the value of α had only small effects on the patterns of accumulation of inviable introgressions (Figure S6 and Table S3).

Discussion

We have tested both predictions and assumptions of the Orr model using a computational model of RNA folding. Our results provide mixed support for the snowball effect (Table 2). Inviable introgressions snowballed, but did so more slowly than expected (predictions #1 and #2; Figure 2 and Table 1). To elucidate why, we tested two assumptions of the Orr model. First, that simple DMIs arise with constant probability, p (assumption #1). Although we did detect a decline in p, it was not sufficient to account for the pattern of accumulation of inviable single introgressions. Second, we tested assumption #4 that DMIs, once they have arisen, persist indefinitely. We found that this assumption was violated in the RNA model. Instead, DMIs had a tendency to become more complex as further substitutions took place.

Table 2. The RNA model provides mixed support for the Orr model.

Test Confirmed? Data
Assumption
 1. p constant with divergence Yes, approximately Figure S3
 2. DMIs have small effects on RI Yes Figure 8A
 3. DMIs have independent effects on RI No Figure 8B
 4. Simple DMIs persist indefinitely No Figure 4 and Figure 7
Prediction
 1. Simple DMIs snowball No Figure 2 and Table 1
 2. Complex DMIs snowball Yes, but more slowly than expected Figure 2 and Table 1
 3. RI snowballs No Figure 8A

Despite the snowballing of inviable introgressions, RI did not snowball (prediction #3 of the Orr model), because DMIs did not act independently of each other on RI (assumption #3). These results indicate that RI is a poor indicator for the number of inviable introgressions or DMIs in our model. Thus, the pattern of change in RI with divergence is unsuitable to test the Orr model (Mendelson et al. 2004; Johnson 2006; Gourbière and Mallet 2010; Presgraves 2010a). In conclusion, the RNA model provided qualitative support for the central prediction of the Orr model that the total number of DMIs snowballs. However, our results failed to confirm certain predictions of the Orr model, as well as some of its assumptions.

An earlier test of the Orr model using a computational model of gene networks also found no evidence for a snowball effect in RI (prediction #3), and concluded that some assumptions of the Orr model were not met (Palmer and Feldman 2009). However, the extent to which the complexification of DMIs influenced their results is unclear because they did not attempt to investigate the dynamics of the DMIs underlying RI.

In one direct empirical test of the snowball effect, inviable single introgressions affecting pollen sterility were found to accumulate linearly, whereas those affecting seed sterility were found to accumulate faster than linearly (Moyle and Nakazato 2010). Our results suggest a possible explanation for the discrepancy: faster complexification (i.e., higher q) of pollen sterility DMIs. Sherman et al. (2014) found evidence of greater complexity of the DMIs involved in pollen sterility.

If all DMIs are simple and individual loci are at most involved in one DMI, then the proportion of the fixed differences between species where an allele from one species is deleterious in another species, P1, is expected to increase linearly with genetic distance (Equation 3; Welch 2004). This prediction is contradicted by the observation that P1 is approximately constant over large genetic distances (Kondrashov et al. 2002; Kulathinal et al. 2004)—a result we call Welch’s paradox. Our results contradict both assumptions behind the prediction that P1 should increase linearly with genetic distance (Welch 2004): most DMIs are complex and individual loci are involved in multiple DMIs. These effects are expected to act in opposite directions: the former would cause P1 to snowball faster, whereas the latter would cause P1 to snowball more slowly. In the RNA simulations, P1 increased with divergence but did so slower than linearly (Figure S7), indicating that the lack of independence between DMIs dominates the evolution of P1. These results suggest a possible resolution for Welch’s paradox: P1 can be constant even if DMIs snowball if individual loci participate in multiple DMIs. Alternative resolutions of Welch’s paradox have been proposed (e.g., Fraïsse et al. 2016).

We found that the vast majority of DMIs in the RNA model were complex. Complex DMIs have been discovered in many introgression studies (reviewed in Wu and Palopoli 1994, Fraïsse et al. 2014). For example, Orr and Irving (2001) investigated the sterility of male F1 hybrids between the USA and Bogota subspecies of Drosophila pseudoobscura and found that it is caused by a DMI between loci in both chromosomes 2 and 3 of USA, and loci in at least three different regions of the X chromosome of Bogota—a DMI of order n5. More generally, high-order epistasis appears to be common (Weinreich et al. 2013; Kondrashov and Kondrashov 2015; Taylor and Ehrenreich 2015). However, the relative prevalence of simple and complex DMIs in nature is unclear, because complex DMIs are more difficult to detect.

Two explanations for the abundance of complex DMIs have been proposed. First, that more complex DMIs evolve more easily than simpler DMIs because they allow a greater proportion of the possible evolutionary paths between the common ancestor and the evolved genotypes containing the DMI (Cabot et al. 1994; Orr 1995). Fraïsse et al. (2014) tested this mechanism using simulations and concluded that it is unlikely to be effective. Second, that the number of combinations of n loci increases with n (Orr 1995). This explanation is difficult to evaluate in the absence of more information on the probability of origination of complex DMIs. Our results indicate that that probability could be higher than previously thought because complex DMIs are continuously generated from simpler DMIs. Indeed, our results suggest a new explanation for the abundance of complex DMIs: that DMIs have a tendency to become increasingly complex with divergence.

Our study has identified one determinant of the origination and complexification of DMIs: the holeyness of the fitness landscape. In a holey fitness landscape, our measure of holeyness is inversely related to the mutational robustness of the genotypes assayed (van Nimwegen et al. 1999; Ancel and Fontana 2000). In our model (as in Orr’s) “populations” are assumed to contain a single genotype; periodically, a mutant genotype arises and either goes to fixation or disappears. In such a model, mutational robustness is not expected to evolve (van Nimwegen et al. 1999). Individual-based simulations would allow us to investigate the intriguing possibility that factors that influence the evolution of mutational robustness (e.g., mutation rate, recombination rate, environmental fluctuations: Ancel and Fontana 2000; Wilke et al. 2001; Azevedo et al. 2006; Gardner and Kalinka 2006) may influence the accumulation of DMIs.

Perhaps the central insight from our study is that DMIs have a tendency to become more complex. At first glance this claim might seem baffling. Can a DMI really be simple one moment and complex the next? The solution to this puzzle rests, we believe, on the difference between a DMI having a certain order n and our ability to infer that it has order n through genetic crosses. Imagine that one lineage has accumulated three consecutive substitutions at loci A, B, and C. Derived alleles are indicated by uppercase letters and ancestral alleles by lowercase letters. Now, imagine that there is a complex DMI of order n=3 between the alleles a, B, and c, and that there are no simple DMIs between any pairs of the three alleles (i.e., a/B, a/c, and B/c). For simplicity, we also assume that none of the other alleles at the A, B, and C loci are involved in DMIs. The existence of a DMI is defined in the strict sense that any conceivable genotype containing all alleles involved in the DMI is inviable (conversely, the absence of a DMI indicates that at least one of the genotypes containing all alleles involved in the DMI are viable). Despite the a/B/c DMI being complex, after two substitutions, our introgression and rescue tests (File S2) would detect a nonexistent simple DMI between alleles a and B. The actual complex DMI can only be inferred after the third substitution has taken place. In the language we have been using so far, the simple DMI would appear to become more complex.

We believe that our central finding that DMIs have a tendency to become more complex is independent of the details of our model. Other results, such as the precise rate of accumulation of DMIs or inviable introgressions, are likely to be influenced by the details of our model. The extent to which the RNA folding model is representative of other types of epistatic interactions (e.g., in gene networks) is unclear. One possible criticism is that we used very short sequences and that these are likely to experience unusually strong epistatic interactions. Orr and Turelli (2001) estimated p107 in Drosophila, and Moyle and Nakazato (2010) estimated p109 in Solanum, much lower values than found in our simulations. However, an evolution experiment in S. cerevisiae detected a simple DMI between two lineages that had only accumulated six unique mutations each (k=12) (Anderson et al. 2010). This indicates a value of p0.015, close to what we estimated in the RNA model (Figures 3 and Figure S3). Our approach to testing the Orr model can be applied to other computational models of biological systems, such as transcription-factor binding (Tulchinsky et al. 2014; Khatri and Goldstein 2015), gene networks (Palmer and Feldman 2009; ten Tusscher and Hogeweg 2009), and metabolic networks (Barve and Wagner 2013).

Our results were robust to a broad range of holey fitness landscapes defined in the RNA folding model. However, the holey landscape model makes two strong assumptions about the fitness landscape: all viable genotypes had the same fitness, and all low fitness genotypes were completely inviable. Neither assumption is met universally: many alleles involved in DMIs appear to have experienced positive selection during their evolutionary history (Presgraves 2010b; Rieseberg and Blackman 2010; Maheshwari and Barbash 2011), and some DMIs are only mildly deleterious rather than lethal (Presgraves 2003; Schumer et al. 2014). These assumptions can be relaxed in the RNA folding model (e.g., Cowperthwaite et al. 2005; Draghi et al. 2011), and in other models (e.g., Palmer and Feldman 2009; Tulchinsky et al. 2014; Khatri and Goldstein 2015).

Studies like ours can test whether the snowball effect occurs under well-defined circumstances. However, they cannot test the reality of the snowball effect; introgression studies with real organisms remain the only way to do so (Matute et al. 2010; Moyle and Nakazato 2010; Matute and Gavin-Smyth 2014; Sherman et al. 2014; Wang et al. 2015).

Supplementary Material

Supplemental material is available online at www.genetics.org/lookup/suppl/doi:10.1534/genetics.116.196030/-/DC1.

Acknowledgments

Tim Cooper, Tiago Paixão, Leonie Moyle, and two anonymous reviewers provided useful comments on the manuscript. We had helpful discussions with Rafael Guerrero, Peter Olofsson, and Jeff Tabor. We used the Maxwell and Opuntia clusters from the Center of Advanced Computing and Data Systems (CACDS) at the University of Houston. CACDS staff provided technical support. The National Science Foundation (grant DEB-1354952 awarded to R.B.R.A.) funded this work.

Footnotes

Communicating editor: L. C. Moyle

Literature Cited

  1. Ancel L. W., Fontana W., 2000.  Plasticity, evolvability, and modularity in RNA. J. Exp. Zool. 288: 242–283. [DOI] [PubMed] [Google Scholar]
  2. Anderson J. B., Funt J., Thompson D. A., Prabhu S., Socha A., et al. , 2010.  Determinants of divergent adaptation and Dobzhansky-Muller interaction in experimental yeast populations. Curr. Biol. 20: 1383–1388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Azevedo R. B. R., Lohaus R., Srinivasan S., Dang K. K., Burch C. L., 2006.  Sexual reproduction selects for robustness and negative epistasis in artificial gene networks. Nature 440: 87–90. [DOI] [PubMed] [Google Scholar]
  4. Barve A., Wagner A., 2013.  A latent capacity for evolutionary innovation through exaptation in metabolic systems. Nature 500: 203–206. [DOI] [PubMed] [Google Scholar]
  5. Bolnick D. I., Near T. J., 2005.  Tempo of hybrid inviability in centrarchid fishes (Teleostei: Centrarchidae). Evolution 59: 1754–1767. [PubMed] [Google Scholar]
  6. Cabot E. L., Davis A. W., Johnson N. A., Wu C. I., 1994.  Genetics of reproductive isolation in the Drosophila simulans clade: complex epistasis underlying hybrid male sterility. Genetics 137: 175–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cowperthwaite M. C., Bull J. J., Meyers L. A., 2005.  Distributions of beneficial fitness effects in RNA. Genetics 170: 1449–1457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Coyne J. A., Orr H. A., 1989.  Patterns of speciation in Drosophila. Evolution 43: 362–381. [DOI] [PubMed] [Google Scholar]
  9. Darwin C., 1859.  On the Origin of Species by Means of Natural Selection. J. Murray, London. [Google Scholar]
  10. Dettman J. R., Jacobson D. J., Turner E., Pringle A., Taylor J. W., 2003.  Reproductive isolation and phylogenetic divergence in Neurospora: comparing methods of species recognition in a model eukaryote. Evolution 57: 2721–2741. [DOI] [PubMed] [Google Scholar]
  11. Dobzhansky T., 1937.  Genetics and the Origin of Species. Columbia University Press, New York. [Google Scholar]
  12. Doshi K. J., Cannone J. J., Cobaugh C. W., Gutell R. R., 2004.  Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction. BMC Bioinformatics 5: 105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Draghi J. A., Parsons T. L., Wagner G. P., Plotkin J. B., 2010.  Mutational robustness can facilitate adaptation. Nature 463: 353–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Draghi J. A., Parsons T. L., Plotkin J. B., 2011.  Epistasis increases the rate of conditionally neutral substitution in an adapting population. Genetics 187: 1139–1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Edmands S., 2002.  Does parental divergence predict reproductive compatibility? Trends Ecol. Evol. 17: 520–527. [Google Scholar]
  16. Fraïsse C., Elderfield J. A. D., Welch J. J., 2014.  The genetics of speciation: are complex incompatibilities easier to evolve? J. Evol. Biol. 27: 688–699. [DOI] [PubMed] [Google Scholar]
  17. Fraïsse C., Gunnarsson P. A., Roze D., Bierne N., Welch J. J., 2016.  The genetics of speciation: insights from Fisher’s geometric model. Evolution 70: 1450–1464. [DOI] [PubMed] [Google Scholar]
  18. Gardner A., Kalinka A. T., 2006.  Recombination and the evolution of mutational robustness. J. Theor. Biol. 241: 707–715. [DOI] [PubMed] [Google Scholar]
  19. Gavrilets S., 2004.  Fitness Landscapes and the Origin of Species. Princeton University Press, Princeton, NJ. [Google Scholar]
  20. Giraud T., Gourbière S., 2012.  The tempo and modes of evolution of reproductive isolation in fungi. Heredity 109: 204–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gourbière S., Mallet J., 2010.  Are species real? The shape of the species boundary with exponential failure, reinforcement, and the “missing snowball”. Evolution 64: 1–24. [DOI] [PubMed] [Google Scholar]
  22. Guerrero R. F., Nakazato T., Muir C. D., Josway S., Moyle L. C., 2016.  Pervasive antagonistic interactions among hybrid incompatibility loci. bioRxiv DOI: 10.1101/090886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Jewell C., Papineau A. D., Freyre R., Moyle L. C., 2012.  Patterns of reproductive isolation in Nolana (Chilean bellflower). Evolution 66: 2628–2636. [DOI] [PubMed] [Google Scholar]
  24. Johnson N. A., 2006.  The evolution of reproductive isolating barriers, pp. 374–398 in Evolutionary Genetics: Concepts and Case Studies, edited by Fox C. W., Wolf J. B. Oxford University Press, Oxford, UK. [Google Scholar]
  25. Kachroo A. H., Laurent J. M., Yellman C. M., Meyer A. G., Wilke C. O., et al. , 2015.  Systematic humanization of yeast genes reveals conserved functions and genetic modularity. Science 348: 921–925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Khatri B. S., Goldstein R. A., 2015.  Simple biophysical model predicts faster accumulation of hybrid incompatibilities in small populations under stabilizing selection. Genetics 201: 1525–1537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Kondrashov A. S., Sunyaev S., Kondrashov F. A., 2002.  Dobzhansky-Muller incompatibilities in protein evolution. Proc. Natl. Acad. Sci. USA 99: 14878–14883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kondrashov D. A., Kondrashov F. A., 2015.  Topological features of rugged fitness landscapes in sequence space. Trends Genet. 31: 24–33. [DOI] [PubMed] [Google Scholar]
  29. Kulathinal R. J., Bettencourt B. R., Hartl D. L., 2004.  Compensated deleterious mutations in insect genomes. Science 306: 1553–1554. [DOI] [PubMed] [Google Scholar]
  30. Larcombe M. J., Holland B., Steane D. a., Jones R. C., Nicolle D., et al. , 2015.  Patterns of reproductive isolation in Eucalyptus—a phylogenetic perspective. Mol. Biol. Evol. 32: 1833–1846. [DOI] [PubMed] [Google Scholar]
  31. Lijtmaer D. A., Mahler B., Tubaro P. L., 2003.  Hybridization and postzygotic isolation patterns in pigeons and doves. Evolution 57: 1411–1418. [DOI] [PubMed] [Google Scholar]
  32. Liti G., Barton D. B. H., Louis E. J., 2006.  Sequence diversity, reproductive isolation and species concepts in Saccharomyces. Genetics 174: 839–850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Livingstone K., Olofsson P., Cochran G., Dagilis A., MacPherson K., et al. , 2012.  A stochastic model for the development of Bateson–Dobzhansky–Muller incompatibilities that incorporates protein interaction networks. Math. Biosci. 238: 49–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lorenz R., Bernhart S. H., Höner Zu Siederdissen C., Tafer H., Flamm C., et al. , 2011.  ViennaRNA Package 2.0. Algorithms Mol. Biol. 6: 26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Maheshwari S., Barbash D. A., 2011.  The genetics of hybrid incompatibilities. Annu. Rev. Genet. 45: 331–355. [DOI] [PubMed] [Google Scholar]
  36. Mathews D. H., Sabina J., Zuker M., Turner D. H., 1999.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 288: 911–940. [DOI] [PubMed] [Google Scholar]
  37. Matute D. R., Gavin-Smyth J., 2014.  Fine mapping of dominant X-linked incompatibility alleles in Drosophila hybrids. PLoS Genet. 10: e1004270 (erratum: PLoS Genet. 11:e1005558). [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Matute D. R., Butler I. A., Turissini D. A., Coyne J. A., 2010.  A test of the snowball theory for the rate of evolution of hybrid incompatibilities. Science 329: 1518–1521. [DOI] [PubMed] [Google Scholar]
  39. Mendelson T. C., 2003.  Sexual isolation evolves faster than hybrid inviability in a diverse and sexually dimorphic genus of fish (Percidae: Etheostoma). Evolution 57: 317–327. [DOI] [PubMed] [Google Scholar]
  40. Mendelson T. C., Inouye B. D., Rausher M. D., 2004.  Quantifying patterns in the evolution of reproductive isolation. Evolution 58: 1424–1433. [DOI] [PubMed] [Google Scholar]
  41. Moyle L. C., Nakazato T., 2010.  Hybrid incompatibility ”snowballs” between Solanum species. Science 329: 1521–1523. [DOI] [PubMed] [Google Scholar]
  42. Moyle L. C., Olson M. S., Tiffin P., 2004.  Patterns of reproductive isolation in three angiosperm genera. Evolution 58: 1195–1208. [DOI] [PubMed] [Google Scholar]
  43. Muller H. J., 1942.  Isolating mechanisms, evolution and temperature. Biol. Symp. 6: 71–125. [Google Scholar]
  44. Nosil P., Feder J. L., 2012.  Genomic divergence during speciation: causes and consequences. Phil. Trans. R. Soc. B 367: 332–342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Orr H. A., 1995.  The population genetics of speciation: the evolution of hybrid incompatibilities. Genetics 139: 1805–1813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Orr H. A., Irving S., 2001.  Complex epistasis and the genetic basis of hybrid sterility in the Drosophila pseudoobscura Bogota-USA hybridization. Genetics 158: 1089–1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Orr H. A., Turelli M., 2001.  The evolution of postzygotic isolation: accumulating Dobzhansky-Muller incompatibilities. Evolution 55: 1085–1094. [DOI] [PubMed] [Google Scholar]
  48. Palmer M. E., Feldman M. W., 2009.  Dynamics of hybrid incompatibility in gene networks in a constant environment. Evolution 63: 418–431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Presgraves D. C., 2002.  Patterns of postzygotic isolation in Lepidoptera. Evolution 56: 1168–1183. [DOI] [PubMed] [Google Scholar]
  50. Presgraves D. C., 2003.  A fine-scale genetic analysis of hybrid incompatibilities in Drosophila. Genetics 163: 955–972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Presgraves D. C., 2010a Speciation genetics: search for the missing snowball. Curr. Biol. 20: R1073–R1074. [DOI] [PubMed] [Google Scholar]
  52. Presgraves D. C., 2010b The molecular evolutionary basis of species formation. Nat. Rev. Genet. 11: 175–180. [DOI] [PubMed] [Google Scholar]
  53. R Core Team , 2016.  R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  54. Rieseberg L. H., Blackman B. K., 2010.  Speciation genes in plants. Ann. Bot. (Lond.) 106: 439–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Roberts M. S., Cohan F. M., 1993.  The effect of DNA sequence divergence on sexual isolation in Bacillus. Genetics 134: 401–408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Sasa M. M., Chippindale P. T., Johnson N. A., 1998.  Patterns of postzygotic isolation in frogs. Evolution 52: 1811–1820. [DOI] [PubMed] [Google Scholar]
  57. Schumer M., Cui R., Powell D., Dresner R., Rosenthal G. G., et al. , 2014.  High-resolution mapping reveals hundreds of genetic incompatibilities in hybridizing fish species. Elife 3: e02535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Schuster P., Fontana W., Stadler P. F., Hofacker I. L., 1994.  From sequences to shapes and back: a case study in RNA secondary structures. Proc. Biol. Sci. 255: 279–284. [DOI] [PubMed] [Google Scholar]
  59. Scopece G., Musacchio A., Widmer A., Cozzolino S., 2007.  Patterns of reproductive isolation in Mediterranean deceptive orchids. Evolution 61: 2623–2642. [DOI] [PubMed] [Google Scholar]
  60. Seehausen O., Butlin R. K., Keller I., Wagner C. E., Boughman J. W., et al. , 2014.  Genomics and the origin of species. Nat. Rev. Genet. 15: 176–192. [DOI] [PubMed] [Google Scholar]
  61. Sherman N. A., Victorine A., Wang R. J., Moyle L. C., 2014.  Interspecific tests of allelism reveal the evolutionary timing and pattern of accumulation of reproductive isolation mutations. PLoS Genet. 10: e1004623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Stelkens R. B., Young K. A., Seehausen O., 2010.  The accumulation of reproductive incompatibilities in African cichlid fish. Evolution 64: 617–633. [DOI] [PubMed] [Google Scholar]
  63. Taylor M. B., Ehrenreich I. M., 2015.  Higher-order genetic interactions and their contribution to complex traits. Trends Genet. 31: 34–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. ten Tusscher K. H. W. J., Hogeweg P., 2009.  The role of genome and gene regulatory network canalization in the evolution of multi-trait polymorphisms and sympatric speciation. BMC Evol. Biol. 9: 159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. The Marie Curie SPECIATION Network , 2012.  What do we need to know about speciation? Trends Ecol. Evol. 27: 27–39. [DOI] [PubMed] [Google Scholar]
  66. Tulchinsky A. Y., Johnson N. A., Watt W. B., Porter A. H., 2014.  Hybrid incompatibility arises in a sequence-based bioenergetic model of transcription factor binding. Genetics 198: 1155–1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. van Nimwegen E., Crutchfield J. P., Huynen M., 1999.  Neutral evolution of mutational robustness. Proc. Natl. Acad. Sci. USA 96: 9716–9720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Wagner A., 2008.  Robustness and evolvability: a paradox resolved. Proc. Biol. Sci. 275: 91–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wang R. J., White M. A., Payseur B. A., 2015.  The pace of hybrid incompatibility evolution in house mice. Genetics 201: 229–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Weinreich D. M., Lan Y., Wylie C. S., Heckendorn R. B., 2013.  Should evolutionary geneticists worry about higher-order epistasis? Curr. Opin. Genet. Dev. 23: 700–707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Welch J. J., 2004.  Accumulating Dobzhansky-Muller incompatibilities: reconciling theory and data. Evolution 58: 1145–1156. [DOI] [PubMed] [Google Scholar]
  72. Wilke C. O., Wang J. L., Ofria C., Lenski R. E., Adami C., 2001.  Evolution of digital organisms at high mutation rates leads to survival of the flattest. Nature 412: 331–333. [DOI] [PubMed] [Google Scholar]
  73. Wu C.-I., Palopoli M. F., 1994.  Genetics of postmating reproductive isolation in animals. Annu. Rev. Genet. 28: 283–308. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The software used to run all simulations was written in Python 2.7 and is available at https://github.com/Kalirad/spiraling_complexity (doi: http://doi.org/10.5281/zenodo.400020).


Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES