Abstract
In hybrid zones, whether barrier loci experience selection mostly independently or as a unit depends on the ratio of selection to recombination as captured by the coupling coefficient. Theory predicts a sharper transition between an uncoupled and coupled system when more loci affect hybrid fitness. However, the extent of coupling in hybrid zones has rarely been quantified. Here, we use simulations to characterize the relationship between the coupling coefficient and variance in clines across genetic loci. We then reanalyze 25 hybrid zone data sets and find that cline variances and estimated coupling coefficients form a smooth continuum from high variance and weak coupling to low variance and strong coupling. Our results are consistent with low rates of hybridization and a strong genome-wide barrier to gene flow when the coupling coefficient is much greater than 1, but also suggest that this boundary might be approached gradually and at a near constant rate over time.
Species are often separated by genetic and phenotypic discontinuities that evolved and are maintained by natural selection. Whether such discontinuities generally arise from selection on one, a few, or many genes is largely unknown (Nosil and Schluter 2011; Nosil et al. 2021). For example, theory suggests that speciation with gene flow occurs more readily when selection is concentrated on a few large effect genes (Yeaman and Whitlock 2011), and there is some empirical evidence consistent with this prediction (Kozak et al. 2019; Unbehend et al. 2021). However, empirical studies also show that speciation can involve divergence at many genes (Michel et al. 2010; Martin et al. 2019; Kautt et al. 2020), although this does not necessarily imply that many genes causally drive speciation. When multiple genes contribute to speciation, linkage disequilibrium (LD) among these genes and other regions of the genome increases the total selection experienced by each locus (Barton 1983; Barton and de Cara 2009). In essence, LD causes selection on causal loci (i.e., barrier loci) to spill over leading to indirect selection at other loci. Barton (1983) referred to this process as “coupling,” the usage of which is consistent with the definition of coupling described in perspective 2 of Dopman et al. (2023), but is somewhat different from the definition from Ritchie and Butlin (2023) that emphasizes associations between genes affecting signals and preference. Whether selection affects individual genes or larger genomic regions (e.g., chromosomes or the entire genome) depends on coupling and this in turn influences whether genes or genomes diverge during speciation (Barton and Bengtsson 1986; Wu 2001). The transition to a coupled system where LD causes a genome-wide barrier to gene flow might be important for completing the speciation process (Barton and de Cara 2009; Nosil et al. 2021).
Hybrid zones are powerful systems for studying speciation in general (Peñalba et al. 2023), and specifically for assessing whether selection occurs on barrier loci independently or as a unit creating a genome-wide barrier to gene flow (Abbott et al. 2013; Harrison and Larson 2014; Gompert et al. 2017). In hybrid zones, spatial structure and selection (s) against hybrids maintains LD among barrier loci, whereas recombination (r) in hybrids breaks down these associations. The balance between these processes determines the extent to which barrier loci operate independently or as a unit and this is captured by Barton's coupling coefficient θ = s/r (Barton 1983). Specifically, Barton (1983) considered a model of simple underdominance, such that an individual heterozygous for n barrier loci has a fitness of (1−s)n; r denotes the average recombination rate between neighboring underdominant loci, such that the total map length with L loci is R = (L−1)/r. When θ is greater than 1, the system is coupled and the total selection experienced by each locus (denoted s*) approaches sL (at least up to sL = 1, at which point hybrid fitness is 0), that is, the combined effects of all barrier loci (Fig. 1A; Barton 1983). Whenever coupling occurs, it is s* (direct and indirect selection combined) rather than s (direct selection due to the causal effect of a locus on fitness) that determines the shape of a cline, particularly the slope near the center of the cline (Barton 1983).
Figure 1.
Coupling and its consequences. (A) The coupling coefficient θ = s/r determines the extent to which barrier loci operate independently or as a unit, depicted here with a few example loci. As coupling (θ) increases the total selection (s*, represented by the size of the vertical arrows) experienced by barrier loci (red bars) and neutral loci (gray bars) increases because of increased linkage disequilibrium. With low coupling, geographic (B) and genomic (C) clines vary across the genome, whereas with high coupling geographic clines steepen and exhibit similar cline centers and widths and genomic clines converge to the genome-average admixture gradient (hybrid index). Geographic and genomic clines at barrier loci and neutral loci are shown in red and gray, respectively, in B and C. Dashed lines in C denote the genome-average admixture gradient.
Barton (1983) noted a sharp transition between uncoupled and coupled systems in analytical models at θ = 1 when holding R and the total selection constant but increasing the number of barrier loci L. More recently, results from individual-based simulations have documented a similarly sharp transition in time when populations diverge with gene flow (Flaxman et al. 2014; Nosil et al. 2017). Specifically, divergence remains low until a sufficient number of adaptive mutations (i.e., barrier loci) build-up at which point the system enters a positive feedback loop where divergent selection increases genetic differentiation. This increases LD for the system as a whole, in turn further increasing the selection experienced by each locus. These sharp transitions in space and time suggest that LD among barrier loci can lead to rapid transitions from early to late stages of speciation. However, other theoretical results suggest a smoother transition from uncoupled to coupled systems (i.e., Kruuk et al. 1999) and provide limited evidence for the critical transition in terms of LD among barrier loci at θ = 1 perhaps because of a slow approach to equilibrium expectations (Baird 1995).
Recent work has placed an increasing emphasis on coupling (Bierne et al. 2011; Abbott et al. 2013; Kunerth et al. 2022), and several hybrid zone studies have tested for or analyzed coupling (Vines et al. 2016; Ryan et al. 2017; Cruzan et al. 2021). Nonetheless, we know relatively little about coupling in most hybrid zones. One reason for our limited knowledge is that coupling is not directly measured in the most commonly estimated cline summaries, such as cline concordance and coincidence, and these estimates provide only a binary classification of clines. We aim to overcome these limitations by first using simulations to determine the relationship between the coupling coefficient (θ) and the variance in clines across loci. Specifically, we test the predictions that (1) cline variances decline with increased values of θ, and (2) that the decline in cline variances is nonlinear with a notable transition to lower variances at θ = 1. We then reanalyze genomic data from 25 hybrid zones to infer cline variances and convert these into quantitative estimates of coupling. This allows us to ask whether the hybrid zones we consider exhibit only strong or weak coupling, as predicted by tipping point models, or form more of a connected continuum with a notable zone of intermediate systems (i.e., a gray zone of speciation; Roux et al. 2016). We conclude by assessing the consistency of coupling across replicate hybrid zone transects and differential coupling of autosomes versus sex chromosomes based on a subset of these data sets.
MEASURING CLINE VARIANCE
We used two complementary approaches to quantify the variance in clines across a set of loci and evaluate whether the variances relate to the coupling coefficient (θ) in a linear or nonlinear manner (Fig. 1). First, we used geographic clines in allele frequencies. One difficulty with the geographic approach is that expected cline shape differs for cases with and without coupling. Uncoupled, single locus clines are expected to be sigmoidal, whereas coupled, multilocus clines can be better approximated by a steeper sigmoidal function in the center of the cline and shallower exponential decay in the tails (Barton 1983; Szymura and Barton 1986). This is because the shape of the central portion of the cline reflects the total selection (s*) on a locus, including that caused by LD with barrier loci, whereas the shape of the tails of the cline should mostly reflect direct selection on the locus (or on tightly linked loci in LD outside of the hybrid zone). Thus, to provide a common framework for quantifying cline variance using a single model, we focus on the center of each cline—the region governed by total selection (s*), which is equivalent to s in the uncoupled, single locus model. We fit a linear model for the logit allele frequency, log(p/(1–p)). The slope in this model is expected to be four times the allele frequency gradient (i.e., the reciprocal of cline width) at the center of the cline (i.e., where p = 0.5) (Barton and Hewitt 1989; Kruuk et al. 1999). The variance (or rather the standard deviation [SD]) in slope among loci then serves as our metric of cline variance in this model.
Our second approach involved fitting clines in the probability of locus-specific ancestry along a genome-wide admixture gradient (i.e., as a function of a hybrid index; Szymura and Barton 1986; Gompert and Buerkle 2009, 2011). Here, we use the logit–logistic genomic cline function proposed by Fitzpatrick (2013), where the probability a gene copy at locus i for individual j was inherited from species 2 is ϕij = . In this equation, hj denotes the hybrid index (proportion of the genome inherited from a species 2), vi measures the cline slope (gradient) for locus i relative to the average (v = 1) and ui defines the cline center for locus i relative to the genome average and to vi (Fitzpatrick 2013). Consequently, information on ancestry at a locus depends on the genotype, an individual's hybrid index (based on all loci), and the genomic cline parameters (based on all of the analyzed hybrids) (Gompert and Buerkle 2011). We use a reparameterization following Bailey (2022) with logit(ci) = ui/vi to define the more intuitive cline center parameter (ci) that indicates the hybrid index value at which ϕij = 0.5 (i.e., the probability of ancestry from both species is equal). We measure the variance in clines here as the SD in log(vi) and logit(ci), both of which have expected means of 0 (i.e., the values averaged across all loci used to define the hybrid index should be 0).
Each of these models has benefits and drawbacks. Geographic clines can only capture hybrid zone dynamics when there is a geographic gradient of gene flow and the scale of cline parameters are dependent on the geographic scale of dispersal, which varies among organisms and is often poorly known. Genomic clines are always relative to genome-average admixture (hybrid index) and thus are not in geographic units, but instead measure the change in the ancestry probability per unit change in hybrid index. This makes the variance in genomic clines easier to compare across hybrid zones, however genomic clines cannot capture the absolute extent of introgression and fail when hybrids are rare (i.e., when the hybrid zone lacks an admixture gradient in h). For this reason, we consider both geographic and genomic clines in simulations (where the scale of dispersal is set), but consider only genomic clines in our analyses of empirical data sets.
We take a hierarchical Bayesian approach to fitting geographic and genomic clines. This is important because it allows us to explicitly estimate the cline variances as parameters in our models. Thus, we assume that the slopes (β) for geographic clines and log(v) and logit(c) for genomic clines at each locus represent independent draws from a higher level (normal) distribution with a mean and SD. We are most interested in the SD that describes the variability in clines across the genome. For the geographic cline analysis, we estimate both the mean and SD. We placed a weakly informative normal prior (mean = 0, SD = 5) on the mean (μβ) and a weakly informative gamma prior (shape = 0.1, rate = 0.01) on the SD (σβ). In the case of the genomic clines, we expect means of 0 and set them as such (i.e., we use soft centering but do not impose a sum-to-zero constraint), but we estimate the SDs (σv and σc) by setting weakly informative normal priors (mean = 0, SD = 1) on both. We fit these models using Hamiltonian Monte Carlo (HMC), which is an algorithm that allows for efficient exploration of and sampling from complex posterior probability distributions (Neal 2011). This allows us to better estimate these SD parameters, which can exhibit poorer mixing using alternative Markov chain Monte Carlo methods (Z Gompert, pers. observation). Model fitting was done using Stan via the R (versions 4.1 and 4.2) interface, rstan (version 2.21.7).
SIMULATIONS CONNECTING CLINE VARIANCE TO COUPLING
We simulated hybrid zones with known coupling coefficients and analyzed the simulated data using the models described in the preceding section to measure the association between coupling and the variance in cline parameters. We were especially interested in whether there was a sharp decrease in cline variance at a coupling coefficient of θ = 1. Hybrid zones were simulated using the dfuse software described by Lindtke and Buerkle (2015). This software runs individual-based simulations of secondary contact using a stepping stone model and tracks ancestry junctions (Baird 1995; Buerkle and Rieseberg 2008). We modified the existing software to include a model of multiplicative underdominance equivalent to that considered by Barton (1983), that is, where the fitness of a hybrid heterozygous at n barrier loci is (1−s)n. All simulations included 110 demes, each with an adult carrying capacity of 50, arrayed in a one-dimensional stepping-stone model. We set the migration rate between neighboring demes to either 0.1 or 0.2. This set of conditions was chosen to approximate a pair of hybridizing species distributed continuously in space using a large number of small, well-connected demes. We simulated hermaphroditic, diploid organisms with a single 1 Morgan chromosome (thus, there was one expected recombination per meiosis). The simulations assumed random mating within demes with viability selection on progeny. Each generation began with a reproduction phase, which involved creating offspring until either the progeny carrying capacity was reached (100 individuals or twice the adult carrying capacity) or until all of maternal gametes from the adults from the previous generation were exhausted (five per individual). This was followed by progeny dispersal and mortality selection where the probability of survival was given by hybrid fitness, that is (1−s)n. Surviving progeny (now adults) were then culled randomly to the adult carrying capacity if more than 50 progeny survived (see Lindtke and Buerkle 2015 for a full description of the simulation model).
We conducted simulations with coupling coefficients, θ = 0.05, 0.1, 0.3, 0.5, 0.7, 0.9, 1, 1.1, 1.5, and 2, and with the number of underdominant barrier loci set to L = 2, 10, 100, 200, 500, or 1000. These were spaced equally across the chromosome, with an average recombination rate between neighboring loci of r = 1/(L−1). We then used the relationship θ = s/r to calculate the appropriate per locus selection coefficient to achieve the desired coupling coefficient (a few combinations of L and θ would have required s > 1 and fitness < 0; these combinations were dropped). We ran the simulations for 2000 generations but examined output every 500 generations to verify that the clines appeared stable (a lack of change could reflect an approximate equilibrium outcome or very gradual approach to a not-yet-reached equilibrium; see Baird 1995). In total, 1140 hybrid zone data sets were simulated. In each case, genotypes at 51 diagnostic (fixed differences between species) biallelic markers spaced evenly along the chromosome (every 2 cM) were output for analysis.
To run the geographic cline analyses, we defined the center of each simulated hybrid zone as the deme where the mean allele frequency was closest to 0.5 (demes near the outer edges of the simulated hybrid zone [i.e., demes 1 to 20 and 90 to 110] were excluded from consideration). We then focused our analyses on the 11 demes centered on this central deme (i.e., the central deme plus five on each side). We fit our hierarchical Bayesian geographic cline models to the logit allele frequencies from these 11 sites. Our focus on 11 sites represents a compromise between avoiding the exponential decay portion of multilocus clines, retaining a sufficient number of demes for analysis, and not unduly constraining the possible values for the slope. This differs from the threshold of logit(p) between −2 and 2 used by Kruuk et al. (1999), which would not have been as practical across the range of simulated hybrid zones. We fit each cline model using HMC with 20 chains each comprising 1200 iterations and a 1000 iteration warmup. We computed the Gelman–Rubin convergence diagnostic to verify adequate HMC mixing and likely convergence of the HMC algorithm to the posterior distribution.
Across the 1140 simulated hybrid zone data sets, the mean cline slope (μβ, measured in units of change in logit allele frequency per deme) ranged from −1.65 to −0.11 (mean = −0.88) and the SD in slopes (σβ) ranged from 0.0027 to 0.67 (mean 0.26). Together, the mean and SD in cline slopes explained 85.2% of the variation in the simulated coupling coefficients (standardized regression coefficients from simple linear regression with an interaction term: |μβ| = 0.47, s.e. = 0.008; σβ = −0.027, s.e. = 0.008; |μβ|:σβ = −0.084, s.e. = 0.010; model P-value < 0.0001) (Fig. 2A). Because the scale of variability in clines (σβ) depends on the average slope, we also considered the coefficient of variation (CV), that is, σβ/|μβ|. We found that the CV was negatively associated with θ and by itself explained 74.0% of the variation in this parameter for the simulated data sets. Importantly, we found that μβ and the CV varied smoothly with θ, without an obvious, abrupt transition in these high-level cline parameters at θ = 1. With that said, there was evidence that μβ and CV approached an asymptote around θ = 1 (Fig. 2B,D; supplemental Table S1). A similarly nonlinear relationship was detected between σβ and θ, such that the SD in clines increased and then decreased as a function of the coupling coefficient with the transition occurring around θ = 1 (Fig. 2C; supplemental Table S1). This transition was especially pronounced when viewed on a log–log scale, where there was evidence of a bimodal distribution of σβ and possible bistability around θ = 1 as is sometimes observed at tipping points (supplemental Fig. S1; e.g., Nosil et al. 2017).
Figure 2.
Relationship between geographic clines and coupling in simulated hybrid zones. Panel A shows estimates of mean cline slope (μβ) and the standard deviation (SD) in slopes (σβ) from the simulated hybrid zones. Points are colored by the known coupling coefficient. Panels B–D show the relationship between the coupling coefficient (θ) and μβ (B), σβ (C), and the coefficient of variation (CV) for cline slopes (D). Points denote results from individual simulations and are colored based on the migration rate (m) between neighboring demes. The best fit line from polynomial regression is shown along with the corresponding coefficient of determination (r2).
To fit genomic clines, we first designated the subset of demes with mean allele frequencies between 0.1 and 0.9 to constitute the hybrid zone (as opposed to nonhybrid parental populations). Such demes need not be contiguous and do not necessarily include only or even mostly hybrid individuals, but intermediate allele frequencies at the diagnostics marker loci suggest at least the possibility for hybrids. We then sampled up to 300 individuals as putative hybrids from these demes. We only fit models when one or more demes met this criterion. We used the known hybrid indexes and parental allele frequencies from dfuse and fit the hierarchical Bayesian genomic cline models described above using HMC. We ran four independent HMC chains per data set with 2000 total iterations and a 1000 iteration burn-in for each chain. As with the geographic cline analysis, we used the Gelman–Rubin convergence diagnostic to verify good HMC performance.
The total number of hybrids, defined here as individuals with hybrid indexes between 0.1 and 0.9, in the simulated hybrid zones declined with increasing coupling coefficients (linear regression r2 = 0.71, P < 0.0001) (supplemental Fig. S2). The SDs for genomic cline center (σc) and slope (σv) together explained 52.3% of the variation in the simulated coupling coefficients (standardized regression coefficients from linear regression: σc = −0.0167, s.e. = 0.013; σv = −0.24, s.e. = 0.013; σc:σv = 0.097, s.e. = 0.0090; model P-value < 0.0001) (Fig. 3A). These results consider only simulated data sets with at least 10 hybrids (defined as individuals with hybrid indexes between 0.1 and 0.9); similar results were obtained when analyzing data sets that included 50 or more hybrids (see supplemental Fig. S3). Using polynomial regression, we found some evidence of a nonlinear relationship between the coupling coefficient (θ) and these cline hyperparameters (σc and σv) (Fig. 3; supplemental Table S2), but we did not detect a sharp transition for either parameter around θ = 1. However, we were only able to analyze a modest proportion of data sets with θ > 1 (e.g., 34% with θ = 1.5 and 2% with θ = 2, compared to 83% with θ = 1 and all simulations with θ ≤ 0.5) as these produced few actual hybrids, often too few to have a reasonable hybrid index axis for cline fitting (see supplemental Fig. S2). Thus, the lack of hybrids when coupling coefficients notably exceed 1 makes detecting such a sharp transition difficult but at the same time represents a transition to strong reproductive isolation. In other words, while we cannot estimate the variance in clines without hybrids, we can conceptualize such hybrid zones as having a cline variance of 0.
Figure 3.
Relationship between genomic clines and coupling in simulated hybrid zones. Panel A shows estimates of the standard deviation (SD) for genomic cline center (σc) and slope (σv) from the simulated hybrid zones. Points are colored by the known coupling coefficient. Panels B and C show the relationship between the coupling coefficient (θ) and σc (B) or σv (C). Points denote results from individual simulations and are colored based on the migration rate (m) between neighboring demes (see Fig. 2). The best fit line from polynomial regression is shown along with the corresponding coefficient of determination (r2).
Overall, our geographic and genomic analyses of simulated hybrid zones revealed a strong relationship between the coupling coefficients (θ) and the cline hyperparameters (σβ and μβ for geographic clines and σc and σv for genomic clines), with mixed evidence of a smooth versus more abrupt transition in these parameters as a function of θ. Importantly, even if patterns for clines vary continuously with θ, feedbacks in nature could cause most hybrid zones to fall into a low coupling state with high cline variance (and wide mean geographic clines) or a high coupling state with low cline variance (and steep mean geographic clines). It is to this topic of natural hybrid zones that we now turn. Although our analyses of simulations suggested a stronger relationship between geographic cline parameters and coupling than between genomic cline parameters and coupling, this was for a set of simulations conducted on a similar scale of dispersal. This will not be true for diverse natural systems making geographic comparisons difficult. We thus focus exclusively on genomic clines for natural hybrid zones. This also allows us to analyze mosaic hybrid zones (e.g., Harrison 1986; Bierne et al. 2003) and other hybrid zones lacking a major geographic axis.
TESTING ROBUSTNESS OF OUR ESTIMATES OF COUPLING FROM SIMULATED DATA
Our goal was to use the relationship between cline parameters and θ documented above to estimate θ in empirical data sets. Before doing so, we conducted several tests of the robustness of our results based on simulations. These are detailed in the online supplemental material, but we highlight the key results here. First, we compared the performance of the linear regression models used to estimate θ to a nonlinear regression approach, specifically the random forest machine learning model. The out-of-bag (i.e., for simulations not used to fit the model) percent variance explained from a random forest model relating σc and σv to θ was similar to the explanatory power of a linear regression model (random forest out-of-bag variance explained = 58.0% compared to linear model r2 = 0.523), suggesting robustness of our analytical approach. We also found that estimates of the cline parameter SDs were reasonably robust to alternative sampling schemes within the simulated hybrid zones (see the supplemental material; supplemental Fig. S4). Likewise, our results were qualitatively consistent for different genetic map sizes. Specifically, across a set of simulations that included 0.5, 1, or 2 Morgan chromosomes, σc and σv explained 37% of the variation in the simulated values of θ (linear regression, model P < 0.0001) (see the supplemental material; supplemental Fig. S5 for details). Nonetheless, differences in map size quantitatively affected patterns of hybridization and introgression such that the number of hybrids and the SD in clines declined more rapidly with increased θ when recombination occurred less frequently (supplemental Fig. S5A). Consequently, a regression model incorporating an interaction between map size and the cline SDs increased our ability to explain variation in θ from r2 = 0.371 to r2 = 0.534.
Last, we conducted a full additional set of simulations with a lower rate of gene flow (m = 0.05). We found that the relationship between cline parameters and θ was weaker in these simulations (linear model r2 = 0.415, random forest out-of-bag variance explained = 50.6%) (supplemental Fig. S6), and that estimates of coupling coefficients for these simulations based on models fit from the original simulations with m = 0.1 or 0.2 were less accurate (estimated values of θ based on the linear regression model fit for m = 0.1 and 0.2 explained 22.5% of the variation in the true, simulated values of θ for m = 0.05). Importantly, random forest regression models fit for m = 0.1 and 0.2 failed to predict θ values for m = 0.05 (out-of-bag variance explained ∼0%) (in contrast both linear regression and random forest performed well in terms of predicting m = 0.1 results from models fit based on m = 0.2 and vice versa; see supplemental Fig. S7). Because of this, we focus mostly on the linear regression model for predicting θ with empirical hybrid zones. We also caution that our estimates of θ for the empirical data sets should be taken as estimates given the specific simulation conditions and linear model used here (we discuss ways that this could be improved in the Concluding Remarks section). However, our estimates of cline SDs for the empirical hybrid zones are themselves informative about the variability of clines across the genome and thus the extent that evolutionary processes are operating on all loci similarly, which is central to understanding hybrid zone dynamics and speciation regardless of the connection to a theoretical coupling coefficient.
Empirical Data Sets
We fit genomic cline models for 25 empirical hybrid zones (see Table 1 for details) to determine whether these data sets smoothly spanned the continuum from weak to strong coupling (or likewise, high to low cline variances). We chose these data sets to span taxonomic diversity and only included hybrid zones with genome-wide single nucleotide polymorphism (SNP) data. This is not meant to be an exhaustive set of hybrid zones, but we hope it is representative of the genomic hybrid zone data sets that exist in the literature. Our use of these data sets to estimate θ assumes that each is a hybrid zone maintained by selection against hybrids (e.g., a tension zone) or at least that hybrids are not favored within the hybrid zone (as they would be in models of bounded hybrid superiority) (Moore 1977). Past theory and reviews suggest that most hybrid zones are tension zones or at least exhibit clines similar to expectations from tension zones (Barton and Hewitt 1985). Moreover, by using genomic rather than geographic clines, our analyses can extend to cases where hybridization does not occur in a simple geographic context (e.g., to mosaic hybrid zones). Thus, we expect this assumption to be at least reasonable for most of the data sets, and even if a few do not fit the tension zone model very well, this should not qualitatively alter our core conclusions. Details of the data sets and data processing are provided in the online supplemental material.
Table 1.
Empirical hybrid zone data sets
| Organism | Taxonomic group | Divergence time (MYA) | Number of loci | SD cline center (σc) | SD cline slope (σv) | θ | References |
|---|---|---|---|---|---|---|---|
| Agalychnis | Amphibian | – | 25 | 1.42 | 0.56 | 0.12 | Akopyan et al. 2020 |
| Alouatta | Mammal | ∼3.0 | 1000 | 1.02 | 0.23 | 0.65 | Baiz et al. 2019 |
| Coenonympha | Insect | <0.02 | 81 | 1.11 | 0.31 | 0.44 | Capblancq et al. 2020 |
| Corvus | Bird | ∼0.443 | 588 | 0.86 | 0.40 | 0.14 | Slager et al. 2020 |
| Crotalus | Reptile | 3.0–5.2 | 1000 | 0.91 | 0.33 | 0.37 | Nikolakis et al. 2022 |
| Encelia | Plant | ∼1.4 | 1000 | 0.38 | 0.31 | 0.41 | DiVittorio et al. 2020 |
| Fundulus | Fish | – | 1000 | 0.45/0.33 | 0.12/0.08 | 1.30/1.58a | Schaefer et al. 2016 |
| Gryllus | Insect | ∼0.2 | 110 | 0.81/0.48 | 0.24/0.15 | 0.67/1.15a | Larson et al. 2014 |
| Hirundo | Bird | <0.1 | 54 | 0.53 | 0.31 | 0.42 | Scordato et al. 2017 |
| Iris | Plant | – | 1000 | 1.16 | 0.19 | 0.71 | Sung et al. 2018 |
| Lissotriton | Amphibian | ∼1.0 | 737/730 | 0.35/0.97 | 0.14/0.14 | 1.26/0.92a | Zieliński et al. 2019 |
| Lycaeides | Insect | ∼2.4 | 500 | 0.94 | 0.35 | 0.31 | Chaturvedi et al. 2020 |
| Mus | Mammal | ∼0.5 | 1000 | 0.69/0.70/1.01 | 0.16/0.14/0.23 | 1.00/1.07/0.66a | Janoušek et al. 2012 |
| Mytilus | Bivalve | ∼3.5 | 419 | 0.29 | 0.27 | 0.62 | Saarman and Pogson 2015 |
| Nematocharax | Fish | 0.27–0.67 | 52 | 0.80 | 0.41 | 0.08 | Barreto et al. 2020 |
| Neotoma | Mammal | ∼1.6 | 623 | 0.52 | 0.26 | 0.64 | Jahner et al. 2021 |
| Oleria | Insect | – | 1000 | 1.73 | 0.50 | 0.42 | Gauthier et al. 2020 |
| Papilio | Insect | 0.5–0.6 | 164 | 0.29 | 0.21 | 0.93 | Ryan et al. 2017 |
| Papio | Mammal | 1.0–2.2 | 501 | 0.84 | 0.41 | 0.10 | Chiou et al. 2021 |
| Picea | Plant | 12.5–15.0 | 221 | 0.67 | 0.22 | 0.77 | Hamilton et al. 2013 |
| Pinus | Plant | ∼18.04 | 670 | 1.10 | 0.37 | 0.29 | Menon et al. 2021) |
| Poecile | Bird | ∼1.5 | 1000 | 0.94/0.82 | 0.40/0.38 | 0.17/0.19a | Unpublished |
| Sceloporus | Reptile | 0.045–1.9 | 38 | 1.09 | 0.21 | 0.69 | Leaché et al. 2017 |
| Sternotherus | Reptile | <4.0 | 798 | 0.57 | 0.28 | 0.55 | Scott et al. 2019 |
| Yucca | Plant | 0.1–0.2 | 1000 | 0.64 | 0.15 | 1.06 | Royer et al. 2016 |
(SD) Standard deviation, (Θ) coupling coefficient.
aDenotes that there were multiple transects that had coupling coefficients calculated for these organisms.
For each data set, we analyzed only ancestry informative loci (here defined as SNPs with an allele frequency difference between the parental taxa > 0.3), and we limited our analysis to 1000 randomly sampled loci (mean number of loci = 583, SD = 389, minimum = 25, maximum = 1000). We chose a minimum allele frequency difference cutoff of 0.3 to ensure loci carried sufficient information about genetic ancestry to provide meaningful estimates of genomic clines (clines in ancestry), while also not excluding data sets with lower levels of genetic differentiation (see, e.g., Gompert et al. 2012). Still, this does not ensure an identical level of ancestry informativeness of loci across data sets. We limited the number of loci to 1000 as our goal was to efficiently estimate the variance in clines (even a sample size of 25 provides a relatively precise estimate of a variance) not to describe detailed patterns of introgression across the genome. Additionally, using no more than 1000 loci minimizes the effect of LD caused by tight physical linkage on our results (as opposed to LD more generally, which is a signal of interest). Cline variances were inferred using the Bayesian genomic cline model described above with eight HMC chains each comprising a 1000-iteration warmup and 1500 iterations for sampling. We then estimated coupling coefficients based on the values of σc and σv and the parameterized linear model or random forest regression model from the simulated data sets (the parameterized models were based on the core simulations with m = 0.1 or 0.2).
For the 25 empirical hybrid zones, estimated SDs in genomic cline center (σc) and slope (σv) ranged from 0.29 to 1.73 (mean = 0.83) and 0.12 to 0.56 (mean = 0.29), respectively, and thus broadly overlapped with estimates from our simulated hybrid zones (Figs. 4 and 5; see supplemental Fig. S8 for the distribution of hybrid indexes in each hybrid zone). Using the linear model fit on the simulated data sets, estimates of the coupling coefficient (θ) for the empirical hybrid zone data sets ranged from 0.08 to 1.30 (mean = 0.55). We found no evidence of a gap in inferred coupling coefficients, rather we documented a smooth continuum from low to high coupling across these 25 data sets (Fig. 5). Moreover, similar results were obtained when estimating θ from our fit random forest regression model; this was true both with respect to the distribution of coupling coefficients across the data sets (supplemental Fig. S9) and the specific estimates of θ for each data set (Pearson correlation = 0.82, 95% confidence intervals (CIs) = 0.63–0.92) (supplemental Fig. S10). As noted in the preceding section, both the linear regression and random forest models were parameterized from a specific set of simulations and the relationship between the cline variances and θ could (and likely does) vary under different conditions. However, we also documented a smooth continuum in the cline SDs (σc and σv) across the 25 hybrid zone data sets. This suggests that even if these metrics do not relate to the theoretical parameter θ in the manner suggested here, we still find a smooth continuum in the degree to which loci introgress independently (uncoupled) or not (coupled) across these hybrid zones, and it is this that is most relevant for our understanding of speciation.
Figure 4.
Summary of genomic clines empirical hybrid zones. The plots show estimated genomic clines (gray lines) for each of 25 hybrid zones. Each cline denotes the probability of ancestry for species 2 as a function of hybrid index (the total proportion of the genome inherited from species 2). Clines for 100 randomly chosen loci (or all loci if there were fewer than 100) are shown. The dashed one-to-one line denotes an ancestry probability equal to hybrid index. Our estimate of the coupling coefficient based on the variability among clines is reported.
Figure 5.
Cline variances and coupling in empirical hybrid zones. Panel A shows estimates of the standard deviation (SD) for genomic cline center (σc) and slope (σv) from the simulated (light gray points) and empirical (colored points) hybrid zones. Coupling coefficients for the 25 empirical hybrid zones were estimated from a linear regression model fit with the simulated hybrid zone data. Panel B plots the estimated coupling coefficients (θ) for the empirical hybrid zones sorted from smallest to largest. The horizontal line denotes θ = 1. Panels C and D show estimates of the SD for genomic cline center (σc) and slope (σv) for each of the 25 empirical hybrid zone data sets, here sorted from the largest (high variability among clines) to smallest (low variability among clines). These data suggest a continuum of estimated coupling coefficients (B) and cline parameters SDs (C,D).
Our analyses above consider a single hybrid zone for each species, but the evolutionary outcomes of secondary contact can vary among populations due to genomic and ecological context (Gompert et al. 2017; Mandeville et al. 2017). Thus, to assess this possibility with respect to coupling and thereby further evaluate the robustness of our general results to details of the systems considered, we compared estimates of coupling across transects for a subset of species. Specifically, the data sets we obtained included five species with two hybrid zones or transects, and one species (Mus) with three transects, resulting in eight pairs of transects (only one transect from each was included in the core analyses above). Using these additional transects, we found moderate consistency in estimates of θ for the pairs (Pearson correlation = 0.67, 95% CI = −0.17–0.95, P = 0.101, similar results were obtained for estimates based on random forest regression, r = 0.75, 95% CI = −0.01–0.96, P = 0.053) (supplemental Fig. S10; supplemental Table S3).
Last, given the widespread interest in the role of sex chromosomes in speciation (Haldane 1922; Coyne and Orr 1989) and observation that sex chromosomes often have steeper clines across hybrid zones (e.g., Tucker et al. 1992; Carling and Brumfield 2008; Hooper et al. 2019), we asked whether coupling was stronger (higher estimates of θ) for sex chromosomes than autosomes. For this, we ran additional analyses separately estimating cline SDs and θ for eight hybrid zone data sets (including replicate transects) that we were especially familiar with and where there was clear information about which SNPs were on the X or Z sex chromosome versus autosomes (see supplemental Table S4). Importantly, differences in recombination rates or effective population size for sex chromosomes relative to autosomes, in addition to differences in the number of barrier loci, could contribute to differences in estimates of θ. Across these data sets, estimates of θ for autosomes and sex chromosomes were positively correlated (Pearson correlation = 0.65, 95% CI = −0.09–0.93, P = 0.078). Moreover, in seven of the eight data sets θ was higher for the X/Z chromosome than for the autosomes (mean difference in θ across data sets = 0.12) (supplemental Table S4). The one exception was for the Gryllus Connecticut transect where θ was notably higher for the autosomes (0.86) than for the X chromosome (0.39).
CONCLUDING REMARKS
We documented a smooth continuum of coupling across 25 natural hybrid zones, ranging from very weak coupling to near-complete coupling suggestive of a strong genome-wide barrier to gene flow (θ > 1) (Fig. 5B). As such, we found no evidence of a tipping point or positive feedback loop near θ = 1 that would rapidly drive systems to higher levels of coupling resulting in a dearth of systems with θ∼1 (analogous to the gap in genetic differentiation documented in sympatric Timema stick insects, see Riesch et al. 2017). Several factors likely contributed to our finding of a smooth continuum of inferred coupling coefficients. First, if species mostly diverge in allopatry with limited gene flow, there would be limited opportunity for the feedback of increased coupling to reduce effective migration and further increase LD among barrier loci. This is consistent with findings from theoretical work by Flaxman et al. (2014) that found evidence of a sudden transition to genome-wide congealing (analogous to coupling) when gene flow was strong relative to selection, but more gradual divergence otherwise (also see Sinitambirivoutin et al. 2023). However, this feedback could still operate upon secondary contact and thus in hybrid zones if species come into contact before speciation is complete (see, e.g., Flaxman et al. 2014). Second, different organisms vary in ecology and demographic histories, including the time since secondary contact, and these differences could act to smooth out the empirical distributions of cline SDs and estimates of coupling across systems (this is part of a general difficulty in treating pairs of species as a reconstructed “chrono-sequence”; see, e.g., Nosil et al. 2017; Bolnick et al. 2023). Third, different sets of loci, such as barrier loci versus neutral loci, can become coupled at different rates (Barton 1983; Barton and Bengtsson 1986; Schilling et al. 2018). Our analyses necessarily average over such variation, which could contribute to the continuum of coupling documented here. And finally, some predictions for a sudden transition in coupling in terms of parameter space may not reflect the temporal dynamics by which reproductive isolation evolves. For example, we may not see the expected transition from uncoupled to coupled dynamics at θ = 1 when holding R and total selection constant but increasing L (Barton 1983) if speciation does not progress with total selection held constant.
Despite the smooth continuum we detected in terms of cline SDs and estimated coupling coefficients, we did observe patterns that suggest a transition in hybrid zone dynamics around θ = 1. Specifically, in simulations, hybrids become increasingly rare around θ = 1 and few of our empirical hybrid zones had estimates of θ notably larger than 1. Thus, consistent with past work (e.g., Barton 1983), our results suggest that by θ = 1 the overall barrier to gene flow across much of the genome is quite strong. The small number of empirical hybrid zones with θ larger than 1 suggests that species pairs with such high levels of coupling rarely have patterns that evolutionary biologists would classify as hybrid zones. In other words, θ = 1 might roughly approximate a genome-wide species boundary even if the approach to this boundary occurs gradually and at a near constant rate with increases in θ (e.g., with an increase in the number of barrier loci and thus a decrease in r).
Our results are relevant for at least two other classic issues in the study of hybrid zones and speciation. First, considerable work has been done on the consistency or variability of overall and locus-specific patterns of introgression across replicate hybrid zones or hybrid zone transects (Buerkle and Rieseberg 2001; Nolte et al. 2009; Janoušek et al. 2012; Larson et al. 2014; Schaefer et al. 2016; Mandeville et al. 2017). This is relevant as it bears on the degree to which reproductive isolation and hybridization outcomes are contingent on the ecological context of secondary contact. Such analyses have rarely, if ever, considered cline SDs and coupling explicitly when comparing hybrid zones. We found relatively high levels of consistency even in systems where locus-specific patterns of introgression have been shown to be less consistent, that is, where patterns of introgression for individual loci have varied substantially across transects (e.g., Mus; Teeter et al. 2010). Thus, our results hint at a greater consistency in the overall barrier to gene flow, as captured by cline SDs and estimates of θ, than in locus-specific patterns. Second, there is considerable evidence that sex chromosomes (X or Z) contribute disproportionately to reproductive isolation and exhibit reduced introgression in hybrid zones (Tucker et al. 1992; Carling and Brumfield 2008; Janoušek et al. 2012; Hooper et al. 2019). Our results suggest that, consistent with these other patterns, sex chromosomes show higher levels of coupling (lower cline SDs) than autosomes, which could be at least partially responsible for the lower overall rates of introgression (Muirhead and Presgraves 2016).
Our simulations and analyses documented a link between genome-wide variation in clines and a theoretical quantity relevant for understanding the extent to which genes or the entire genome experience a barrier to gene flow (the coupling coefficient, θ). Using simulations, we showed that this relationship is somewhat consistent under different conditions, but does vary to an extent and is likely to vary even more under conditions that diverge more from those we considered, such as in mosaic hybrid zones lacking a strong spatial axis. Our focus here was on cline SDs, but additional information, such as the prevalence of hybrids or the distribution of hybrids and introgression in space, could provide additional information about coupling. Using such additional metrics perhaps combined with more tailored simulations based on individual hybrid zones could provide a powerful framework to infer coupling coefficients using, for example, approximate Bayesian computation or neural networks.
More generally, our results demonstrate the relevance of quantifying cline variances rather than focusing solely on patterns of introgression for individual loci in analyses of hybrid zones. Under some conditions, especially strong coupling in the later stages of speciation, cline variances could be more informative about the process of speciation than patterns of introgression for individual loci. In contrast, when coupling is weak, loci resistant to introgression would be more likely to reside in genomic regions causally connected to reproductive isolation (Gompert et al. 2012). Additional understanding could come from quantifying cline variances and coupling at different genomic scales. For example, here we documented differences in variances and coupling for autosomes versus sex chromosomes. Finer scale analyses looking at coupling along chromosomes (e.g., in megabase windows) or within versus outside of large structural variants could provide additional insights on how the ratio of selection to recombination varies across the genome and thus on the genetics of speciation. In light of our findings, we think further empirical analyses of cline variances and coupling focused on the transition between weak and strong coupling, ideally within specific taxonomic groups, are critical for advancing understanding of the dynamics of speciation.
DATA AND CODE AVAILABILITY
Simulated hybrid zone data sets and our formatted input files for the empirical hybrid zones are available on Zenodo (doi: 10.5281/zenodo.8231089). Computer scripts used for simulations, data processing, and analyses are available from GitHub, github.com/zgompert/ClineCoupling.
AUTHOR CONTRIBUTIONS
All authors conceived of and designed the study. T.J.F. compiled the hybrid zone data, with help from G.S., S.A.T., E.L.L., and Z.G. Z.G. conducted the hybrid zone simulations and analyzed the data. T.J.F. and Z.G. wrote an initial draft. All authors contributed substantially to editing and revising the material.
Supplementary Material
ACKNOWLEDGMENTS
The support and resources from the Center for High Performance Computing at the University of Utah are gratefully acknowledged. The National Science Foundation provided support for this project to Z.G. (NSF DEB 1844941), T.J.F. (NSF DBI 2208825), E.L.L. (NSF DEB 2012041), and S.A.T. (NSF IOS-1754909; DEB-192889). Silhouettes for Figure 4 were obtained from phylopic.org. We thank our colleagues who generated each of the 25 empirical hybrid zone data sets for making these data publicly available or providing them to us upon our request.
Footnotes
Editors: Catherine L. Peichel, Daniel I. Bolnick, Åke Brännström, Ulf Dieckmann, and Rebecca J. Safran
Additional Perspectives on Speciation available at www.cshperspectives.org
REFERENCES
*Reference is also in this subject collection.
- Abbott R, Albach D, Ansell S, Arntzen JW, Baird SJ, Bierne N, Boughman J, Brelsford A, Buerkle CA, Buggs R, et al. 2013. Hybridization and speciation. J Evol Biol 26: 229–246. 10.1111/j.1420-9101.2012.02599.x [DOI] [PubMed] [Google Scholar]
- Akopyan M, Gompert Z, Klonoski K, Vega A, Kaiser K, Mackelprang R, Rosenblum EB, Robertson JM. 2020. Genetic and phenotypic evidence of a contact zone between divergent colour morphs of the iconic red-eyed treefrog. Mol Ecol 29: 4442–4456. 10.1111/mec.15639 [DOI] [PubMed] [Google Scholar]
- Bailey R. 2022. Bayesian hybrid index and genomic cline estimation with the R package gghybrid. Authorea, March 28, 2022. 10.22541/au.164848698.82546348/v1 [DOI] [PubMed] [Google Scholar]
- Baird SJE. 1995. A simulation study of multilocus clines. Evolution (N Y) 49: 1038–1045. 10.2307/2410429 [DOI] [PubMed] [Google Scholar]
- Baiz MD, Tucker PK, Cortés-Ortiz L. 2019. Multiple forms of selection shape reproductive isolation in a primate hybrid zone. Mol Ecol 28: 1056–1069. 10.1111/mec.14966 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barreto SB, Knowles LL, de Affonso MPRA, Batalha-Filho H. 2020. Riverscape properties contribute to the origin and structure of a hybrid zone in a neotropical freshwater fish. J Evol Biol 33: 1530–1542. 10.1111/jeb.13689 [DOI] [PubMed] [Google Scholar]
- Barton NH. 1983. Multilocus clines. Evolution (N Y) 37: 454–471. 10.2307/2408260 [DOI] [PubMed] [Google Scholar]
- Barton N, Bengtsson BO. 1986. The barrier to genetic exchange between hybridising populations. Heredity (Edinb) 57: 357–376. 10.1038/hdy.1986.135 [DOI] [PubMed] [Google Scholar]
- Barton NH, de Cara MAR. 2009. The evolution of strong reproductive isolation. Evolution (N Y) 63: 1171–1190. 10.1111/j.1558-5646.2009.00622.x [DOI] [PubMed] [Google Scholar]
- Barton NH, Hewitt GM. 1985. Analysis of hybrid zones. Annu Rev Ecol Syst 16: 113–148. 10.1146/annurev.es.16.110185.000553 [DOI] [Google Scholar]
- Barton NH, Hewitt GM. 1989. Adaptation, speciation and hybrid zones. Nature 341: 497–503. 10.1038/341497a0 [DOI] [PubMed] [Google Scholar]
- Bierne N, Borsa P, Daguin C, Jollivet D, Viard F, Bonhomme F, David P. 2003. Introgression patterns in the mosaic hybrid zone between Mytilus edulis and M. galloprovincialis. Mol Ecol 12: 447–461. 10.1046/j.1365-294X.2003.01730.x [DOI] [PubMed] [Google Scholar]
- Bierne N, Welch J, Loire E, Bonhomme F, David P. 2011. The coupling hypothesis: why genome scans may fail to map local adaptation genes. Mol Ecol 20: 2044–2072. 10.1111/j.1365-294X.2011.05080.x [DOI] [PubMed] [Google Scholar]
- Bolnick DI, Hund AK, Nosil P, Peng F, Ravinet M, Stankowski S, Subramanian S, Wolf JBW, Yukilevich R. 2023. A multivariate view of the speciation continuum. Evolution (N Y) 77: 318–328. 10.1093/evolut/qpac004 [DOI] [PubMed] [Google Scholar]
- Buerkle CA, Rieseberg LJ. 2001. Low intraspecific variation for genomic isolation between hybridizing sunflower species. Evolution (N Y) 55: 684–691. 10.1554/0014-3820(2001)055[0684:LIVFGI]2.0.CO;2 [DOI] [PubMed] [Google Scholar]
- Buerkle CA, Rieseberg LJ. 2008. The rate of genome stabilization in homoploid hybrid species. Evolution (N Y) 62: 266–275. 10.1111/j.1558-5646.2007.00267.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Capblancq T, Després L, Mavárez J. 2020. Genetic, morphological and ecological variation across a sharp hybrid zone between two alpine butterfly species. Evol Appl 13: 1435–1450. 10.1111/eva.12925 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carling MD, Brumfield RT. 2008. Haldane's rule in an avian system: using cline theory and divergence population genetics to test for differential introgression of mitochondrial, autosomal, and sex-linked loci across the Passerina bunting hybrid zone. Evolution (N Y) 62: 2600–2615. 10.1111/j.1558-5646.2008.00477.x [DOI] [PubMed] [Google Scholar]
- Chaturvedi S, Lucas LK, Buerkle CA, Fordyce JF, Forister ML, Nice CC, Gompert Z. 2020. Recent hybrids recapitulate ancient hybrid outcomes. Nat Commun 11: 2179. 10.1038/s41467-020-15641-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chiou KL, Bergey CM, Burrell AS, Disotell TR, Rogers J, Jolly CJ, Phillips-Conroy JE. 2021. Genome-wide ancestry and introgression in a Zambian baboon hybrid zone. Mol Ecol 30: 1907–1920. 10.1111/mec.15858 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coyne JA, Orr AH. 1989. Two rules of speciation. In Speciation and its consequences (ed. Otte D, Endler JA), pp. 180–207. Sinauer Associates, Sunderland, MA. [Google Scholar]
- Cruzan MB, Thompson PG, Diaz NA, Hendrickson EC, Gerloff KR, Kline KA, Machiorlete HM, Persinger JM. 2021. Weak coupling among barrier loci and waves of neutral and adaptive introgression across an expanding hybrid zone. Evolution (N Y) 75: 3098–3114. 10.1111/evo.14381 [DOI] [PMC free article] [PubMed] [Google Scholar]
- *.Dopman EB, Shaw KL, Servedio MR, Butlin R, Smadja CM. 2023. Coupling barriers to gene exchange: causes and consequences. Cold Spring Harb Perspect Biol 10.1101/cshperspect.a041432 [DOI] [PMC free article] [PubMed] [Google Scholar]
- DiVittorio CT, Singhal S, Roddy AB, Zapata F, Ackerly DD, Baldwin BG, Brodersen CR, Búrquez A, Fine PVA, Padilla Flores M, et al. 2020. Natural selection maintains species despite frequent hybridization in the desert shrub Encelia. Proc Natl Acad Sci 117: 33373–33383. 10.1073/pnas.2001337117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitzpatrick BM. 2013. Alternative forms for genomic clines. Ecol Evol 3: 1951–1966. 10.1002/ece3.609 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flaxman SM, Wacholder AC, Feder JL, Nosil P. 2014. Theoretical models of the influence of genomic architecture on the dynamics of speciation. Mol Ecol 23: 4074–4088. 10.1111/mec.12750 [DOI] [PubMed] [Google Scholar]
- Gauthier J, de Silva DL, Gompert Z, Whibley A, Houssin C, Le Poul Y, McClure M, Lemaitre C, Legeai F, Mallet J, et al. 2020. Contrasting genomic and phenotypic outcomes of hybridization between pairs of mimetic butterfly taxa across a suture zone. Mol Ecol 29: 1328–1343. 10.1111/mec.15403 [DOI] [PubMed] [Google Scholar]
- Gompert Z, Buerkle CA. 2009. A powerful regression-based method for admixture mapping of isolation across the genome of hybrids. Mol Ecol 18: 1207–1224. 10.1111/j.1365-294X.2009.04098.x [DOI] [PubMed] [Google Scholar]
- Gompert Z, Buerkle CA. 2011. Bayesian estimation of genomic clines. Mol Ecol 20: 2111–2127. 10.1111/j.1365-294X.2011.05074.x [DOI] [PubMed] [Google Scholar]
- Gompert Z, Parchman TL, Buerkle CA. 2012. Genomics of isolation in hybrids. Philos Trans R Soc Lond B Biol Sci 367: 439–450. 10.1098/rstb.2011.0196 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gompert Z, Mandeville EG, Buerkle CA. 2017. Analysis of population genomic data from hybrid zones. Annu Rev Ecol Evol Syst 48: 207–229. 10.1146/annurev-ecolsys-110316-022652 [DOI] [Google Scholar]
- Haldane JBS. 1922. Sex ratio and unisexual sterility in hybrid animals. J Genet 12: 101–109. 10.1007/BF02983075 [DOI] [Google Scholar]
- Hamilton JA, Lexer C, Aitken SN. 2013. Genomic and phenotypic architecture of a spruce hybrid zone (Picea sitchensis × P. glauca). Mol Ecol 22: 827–841. 10.1111/mec.12007 [DOI] [PubMed] [Google Scholar]
- Harrison RG. 1986. Pattern and process in a narrow hybrid zone. Heredity (Edinb) 56: 337–349. 10.1038/hdy.1986.55 [DOI] [Google Scholar]
- Harrison RG, Larson EL. 2014. Hybridization, introgression, and the nature of species boundaries. J Hered 105: 795–809. 10.1093/jhered/esu033 [DOI] [PubMed] [Google Scholar]
- Hooper DM, Griffith SC, Price TD. 2019. Sex chromosome inversions enforce reproductive isolation across an avian hybrid zone. Mol Ecol 28: 1246–1262. 10.1111/mec.14874 [DOI] [PubMed] [Google Scholar]
- Jahner JP, Parchman FL, Matocq MD. 2021. Multigenerational backcrossing and introgression between two woodrat species at an abrupt ecological transition. Mol Ecol 30: 4245–4258. 10.1111/mec.16056 [DOI] [PubMed] [Google Scholar]
- Janoušek V, Wang L, Luzynski K, Dufková P, Vyskočilová MM, Nachman MW, Munclinger P, Macholán M, Piálek J, Tucker PK. 2012. Genome-wide architecture of reproductive isolation in a naturally occurring hybrid zone between Mus musculus musculus and M. m. domesticus. Mol Ecol 21: 3032–3047. 10.1111/j.1365-294X.2012.05583.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kautt AF, Kratochwil CF, Nater A, Machado-Schiaffino G, Olave M, Henning F, Torres-Dowdall J, Härer A, Hulsey CD, Franchini P, et al. 2020. Contrasting signatures of genomic divergence during sympatric speciation. Nature 588: 106–111. 10.1038/s41586-020-2845-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak GM, Wadsworth CB, Kahne SC, Bogdanowicz SM, Harrison RG, Coates BS, Dopman EB. 2019. Genomic basis of circannual rhythm in the European corn borer moth. Curr Biol 29: 3501–3509.e5. 10.1016/j.cub.2019.08.053 [DOI] [PubMed] [Google Scholar]
- Kruuk LE, Baird SJ, Gale KS, Barton NH. 1999. A comparison of multilocus clines maintained by environmental adaptation or by selection against hybrids. Genetics 153: 1959–1971. 10.1093/genetics/153.4.1959 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kunerth HD, Bogdanowicz SM, Searle JB, Harrison RG, Coates BS, Koak GM, Dopman EB. 2022. Consequences of coupled barriers to gene flow for the build-up of genomic differentiation. Evolution (N Y) 76: 985–1002. 10.1111/evo.14466 [DOI] [PubMed] [Google Scholar]
- Larson EL, White TA, Ross CL, Harrison RG. 2014. Gene flow and the maintenance of species boundaries. Mol Ecol 23: 1668–1678. 10.1111/mec.12601 [DOI] [PubMed] [Google Scholar]
- Leaché AD, Grummer JA, Harris RB, Breckheimer IK. 2017. Evidence for concerted movement of nuclear and mitochondrial clines in a lizard hybrid zone. Mol Ecol 26: 2306–2316. 10.1111/mec.14033 [DOI] [PubMed] [Google Scholar]
- Lindtke D, Buerkle CA. 2015. The genetic architecture of hybrid incompatibilities and their effect on barriers to introgression in secondary contact. Evolution (N Y) 69: 1987–2004. 10.1111/evo.12725 [DOI] [PubMed] [Google Scholar]
- Mandeville EG, Parchman TL, Thompson KG, Compton RI, Gelwicks KR, Song SJ, Buerkle CA. 2017. Inconsistent reproductive isolation revealed by interactions between Catostomus fish species. Evol Lett 1: 255–268. 10.1002/evl3.29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin SH, Davey JW, Salazar C, Jiggins CD. 2019. Recombination rate variation shapes barriers to introgression across butterfly genomes. PLoS Biol 17: e2006288. 10.1371/journal.pbio.2006288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menon M, Bagley JC, Page GFM, Whipple AV, Schoettle AW, Still CJ, Wehenkel C, Waring KM, Flores-Renteria L, Cushman SA, et al. 2021. Adaptive evolution in a conifer hybrid zone is driven by a mosaic of recently introgressed and background genetic variants. Commun Biol 4: 160. 10.1038/s42003-020-01632-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michel AP, Sim S, Powell THQ, Taylor MS, Nosil P, Feder JL. 2010. Widespread genomic divergence during sympatric speciation. Proc Natl Acad Sci 107: 9724–9729. 10.1073/pnas.1000939107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore WS. 1977. An evaluation of narrow hybrid zones in vertebrates. Q Rev Biol 52: 263–277. 10.1086/409995 [DOI] [Google Scholar]
- Muirhead CA, Presgraves DC. 2016. Hybrid incompatibilities, local adaptation, and the genomic distribution of natural introgression between species. Am Nat 187: 249–261. 10.1086/684583 [DOI] [PubMed] [Google Scholar]
- Neal RM. 2011. Handbook of Markov chain Monte Carlo, p. 50 in handbooks of modern statistical methods. Chapman and Hall/CRC, Boca Raton, FL. [Google Scholar]
- Nikolakis ZL, Schield DR, Westfall AK, Perry BW, Ivey KN, Orton RW, Hales NR, Adams RH, Meik JM, Parker JM, et al. 2022. Evidence that genomic incompatibilities and other multilocus processes impact hybrid fitness in a rattlesnake hybrid zone. Evolution (N Y) 76: 2513–2530. 10.1111/evo.14612 [DOI] [PubMed] [Google Scholar]
- Nolte AW, Gompert Z, Buerkle CA. 2009. Variable patterns of introgression in two sculpin hybrid zones suggest that genomic isolation differs among populations. Mol Ecol 18: 2615–2627. 10.1111/j.1365-294X.2009.04208.x [DOI] [PubMed] [Google Scholar]
- Nosil P, Schluter D. 2011. The genes underlying the process of speciation. Trends Ecol Evol 26: 160–167. 10.1016/j.tree.2011.01.001 [DOI] [PubMed] [Google Scholar]
- Nosil P, Feder JL, Flaxman SM, Gompert Z. 2017. Tipping points in the dynamics of speciation. Nat Ecol Evol 1: 1. 10.1038/s41559-016-0001 [DOI] [PubMed] [Google Scholar]
- Nosil P, Feder JL, Gompert Z. 2021. How many genetic changes create new species? Science 371: 777–779. 10.1126/science.abf6671 [DOI] [PubMed] [Google Scholar]
- *.Peñalba JV, Runemark A, Meier J, Singh P, Wogan G, Sánchez-Guillén R, Mallet J, Rometsch SJ, Menon M, Seehausen O, et al. 2023. The role of hybridization in species formation. Cold Spring Harb Perspect Biol 10.1101/cshperspect.a041445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riesch R, Muschick M, Lindtke D, Villoutreix R, Comeault AA, Farkas TE, Lucek K, Hellen E, Soria-Carrasco V, Dennis SR, et al. 2017. Transitions between phases of genomic differentiation during stick-insect speciation. Nat Ecol Evol 1: 82. 10.1038/s41559-017-0082 [DOI] [PubMed] [Google Scholar]
- *.Ritchie MG, Butlin RK. 2023. Genetic coupling in the genomic era. Cold Spring Harb Perspect Biol 10.1101/cshperspect.a041437 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roux C, Fraïsse C, Romiguier J, Anciaux Y, Galtier N, Bierne N. 2016. Shedding light on the grey zone of speciation along a continuum of genomic divergence. PLoS Biol 14: e2000234. 10.1371/journal.pbio.2000234 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Royer AM, Streisfeld MA, Smith CI. 2016. Population genomics of divergence within an obligate pollination mutualism: selection maintains differences between Joshua tree species. Am J Bot 103: 1730–1741. 10.3732/ajb.1600069 [DOI] [PubMed] [Google Scholar]
- Ryan SF, Fontaine MC, Scriber JM, Pfrender ME, O'Neil ST, Hellmann JJ. 2017. Patterns of divergence across the geographic and genomic landscape of a butterfly hybrid zone associated with a climatic gradient. Mol Ecol 26: 4725–4742. 10.1111/mec.14236 [DOI] [PubMed] [Google Scholar]
- Saarman NP, Pogson GH. 2015. Introgression between invasive and native blue mussels (genus Mytilus) in the central California hybrid zone. Mol Ecol 24: 4723–4738. 10.1111/mec.13340 [DOI] [PubMed] [Google Scholar]
- Schaefer J, Duvernell D, Campbell DC. 2016. Hybridization and introgression in two ecologically dissimilar Fundulus hybrid zones. Evolution (N Y) 70: 1051–1063. 10.1111/evo.12920 [DOI] [PubMed] [Google Scholar]
- Schilling MP, Mullen SP, Kronforst M, Safran RJ, Nosil P, Feder JL, Gompert Z, Flaxma SM. 2018. Transitions from single- to multi-locus processes during speciation with gene flow. Genes (Basel) 9: 274. 10.3390/genes9060274 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scordato ESC, Wilkins MR, Semenov G, Rubtsov AS, Kane NC, Safran RJ. 2017. Genomic variation across two barn swallow hybrid zones reveals traits associated with divergence in sympatry and allopatry. Mol Ecol 26: 5676–5691. 10.1111/mec.14276 [DOI] [PubMed] [Google Scholar]
- Scott PA, Glenn TC, Rissler LJ. 2019. Formation of a recent hybrid zone offers insight into the geographic puzzle and maintenance of species boundaries in musk turtles. Mol Ecol 28: 761–771. 10.1111/mec.14983 [DOI] [PubMed] [Google Scholar]
- Sinitambirivoutin M, Nosil P, Flaxman S, Feder J, Gompert Z, Dakos V. 2023. Early-warning signals of impending speciation. Evolution (N Y) 77: 1444–1457. 10.1093/evolut/qpad054 [DOI] [PubMed] [Google Scholar]
- Slager DL, Epperly KL, Ha RR, Rohwer S, Wood C, Van Hemert C, Klicka J. 2020. Cryptic and extensive hybridization between ancient lineages of American crows. Mol Ecol 29: 956–969. 10.1111/mec.15377 [DOI] [PubMed] [Google Scholar]
- Sung CJ, Bell KL, Nice CC, Martin NH. 2018. Integrating Bayesian genomic cline analyses and association mapping of morphological and ecological traits to dissect reproductive isolation and introgression in a Louisiana Iris hybrid zone. Mol Ecol 27: 959–978. 10.1111/mec.14481 [DOI] [PubMed] [Google Scholar]
- Szymura JM, Barton NH. 1986. Genetic analysis of a hybrid zone between the fire-bellied toads, Bombia bombina and B. variegata, near Cracow in southern Poland. Evolution (N Y) 40: 1141–1159. [DOI] [PubMed] [Google Scholar]
- Teeter KC, Thibodeau LM, Gompert Z, Buerkle CA, Nachman MW, Tucker PK. 2010. The variable genomic architecture of isolation between hybridizing species of house mice. Evolution (N Y) 64: 472–485. 10.1111/j.1558-5646.2009.00846.x [DOI] [PubMed] [Google Scholar]
- Tucker PK, Sage RD, Warner J, Wilson AC, Eicher EM. 1992. Abrupt cline for sex chromosomes in a hybrid zone between two species of mice. Evolution (N Y) 46: 1146–1163. 10.2307/2409762 [DOI] [PubMed] [Google Scholar]
- Unbehend M, Kozak GM, Koutroumpa F, Coates BS, Dekker T, Groot AT, Heckel DG, Dopman EB. 2021. Bric à brac controls sex pheromone choice by male European corn borer moths. Nat Commun 12: 2818. 10.1038/s41467-021-23026-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vines TH, Dalziel AC, Albert AYK, Veen T, Schulte PM, Schluter D. 2016. Cline coupling and uncoupling in a stickleback hybrid zone. Evolution (N Y) 70: 1023–1038. 10.1111/evo.12917 [DOI] [PubMed] [Google Scholar]
- Wu CI. 2001. The genic view of the process of speciation. J Evol Biol 14: 851–865. 10.1046/j.1420-9101.2001.00335.x [DOI] [Google Scholar]
- Yeaman S, Whitlock MC. 2011. The genetic architecture of adaptation under migration-selection balance. Evolution (N Y) 65: 1897–1911. 10.1111/j.1558-5646.2011.01269.x [DOI] [PubMed] [Google Scholar]
- Zieliński P, Dudek K, Arntzen JW, Palomar G, Niedzicka M, Fijarczyk A, Liana M, Cogălniceanu D, Babik W. 2019. Differential introgression across newt hybrid zones: evidence from replicated transects. Mol Ecol 28: 4811–4824. 10.1111/mec.15251 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Simulated hybrid zone data sets and our formatted input files for the empirical hybrid zones are available on Zenodo (doi: 10.5281/zenodo.8231089). Computer scripts used for simulations, data processing, and analyses are available from GitHub, github.com/zgompert/ClineCoupling.





