Abstract
Apparent cases of sympatric speciation may actually be due to micro-allopatric or micro-parapatric speciation. One way to distinguish between these models is to examine the existence and nature of genomic islands of divergence, wherein divergent DNA segments are interspersed with low-divergence segments. Such islands should be rare or absent under micro-allopatric speciation but common in cases of speciation with gene flow. Sympatric divergence of endemic fishes is known from isolated saline, crater, postglacial, and ancient lakes. Two morphologically distinct cyprinid fishes, Gymnocypris eckloni scoliostomus (GS) and G. eckloni eckloni (GE), in a small glacial lake on the Qinghai–Tibet Plateau, Lake Sunmcuo, match the biogeographic criteria of sympatric speciation. In this study, we examined genome-wide variation in 46 individuals from these two groups. The divergence time between the GS and GE lineages was estimated to be 20–60 Kya. We identified 54 large genomic islands (≥100 kb) of speciation, which accounted for 89.4% of the total length of all genomic islands. These islands harboured divergent genes related to olfactory receptors and olfaction signals that may play important roles in food selection and assortative mating in fishes. Although the genomic islands clearly indicated speciation with gene flow and rejected micro-allopatric speciation, they were too large to support the hypothesis of sympatric speciation. Theoretical and recent empirical studies suggested that continual gene flow in sympatry should give rise to many small genomic islands (as small as a few kilobases in size). Thus, the observed pattern is consistent with the extensive evidence on parapatric speciation, in which adjacent habitats facilitate divergent selection but also permit gene flow during speciation. We suggest that many, if not most, of the reported cases of sympatric speciation are likely to be micro-parapatric speciation.
Keywords: sympatric speciation, gene flow, genomic islands, micro-parapatric speciation, selection, olfaction
INTRODUCTION
Allopatric speciation requires geographic barriers that completely prevent gene flow and allow the populations to evolve independently, which eventually leads to reproductive isolation (RI) [1,2]. Sympatric speciation, proposed by Darwin [3], is the evolution of RI without geographic barriers, in which new species arise from a single ancestral population [4,5]. Some biogeographic criteria suggested that sympatric speciation can occur if the two species overlap in their cruising range [6]. However, this is merely a case of sympatric coexistence From a population genetic perspective, any case in which the spatial structure of progenitor populations affects habitat selection or mating is not considered pure sympatric speciation [7–9]. In this study, pure sympatric speciation was considered divergence within a single geographical region where the range of one nascent species completely overlaps the other. These diverging groups are not separated by any spatial structure barrier in the ancestral range. The initial gene exchange rate between This mode of speciation has been controversial for over a century, partly because antagonism between selection and recombination makes this mode of speciation theoretically difficult [8,10–12].
The key aspect of sympatric speciation is that incipient species can potentially exchange genes (genetic recombination) by interbreeding, which breaks up the correlation between co-adapted gene groups necessary for species formation [13]. Therefore, the potential for sympatric speciation may be greatly restricted unless there is strong disruptive selection [12,14–18]. This selection will cause the population to divide into two subpopulations, each specialized on a different resource. The hybrids between the subpopulations are poorly adapted to either resource and eventually suffer from reduced fitness [14]. Disruptive selection contributes to the fixation of locally adaptive alleles that are beneficial in their ecological niches or mate choice [19,20]. In this case, sympatric speciation can occur as a result of habitat isolation or sexual isolation [13].
With the advent of high-throughput sequencing, and the development and application of population genomic approaches, increasing numbers of empirical case studies on sympatric speciation in both plants and animals have been reassessed [21–38]. However, these approaches have limited ability to distinguish spatiotemporal overlap because almost all empirical case studies of sympatric speciation have some degree of spatiotemporal differentiation between sister taxa [21–38]. As a result, the most inclusive definition of sympatric will skew our understanding of the truth about the speciation process. For example, consider two sister species that diverged from a single population and occupied the same biogeographic range. They are considered sympatric in terms of their broad-scale biogeography. However, if speciation that results from specialization to different habitats within their range and genetic exchange is inherently non-random, it can be also argued that speciation was not truly sympatric but rather micro-allopatric [39] or micro-parapatric by fine-scale spatial partitioning. From a population genetic perspective, it is dubbed micro-allopatric [40] or micro-parapatric, if no gene flow or restricted gene flow occurs during some episodes before the completion of speciation. Consequently, it is difficult to distinguish sympatric, micro-allopatric, and micro-parapatric speciation based solely on geographical considerations.
The main distinctions among the three modes are uninterrupted gene flow and the intensity of gene flow during speciation (Fig. 1A–C). In this genic view of speciation, speciation with gene flow has been shown to leave footprints in the form of genomic islands that are non-introgressable [27,41–45] (Fig. 1A–C). These genomic islands are frequently composed of highly divergent DNA regions interspersed with less divergent portions. In theory, the sizes of the genomic islands should be negatively correlated with gene flow between diverging populations. Therefore, sympatric speciation is expected to have small genomic islands owing to frequent bidirectional gene flow, whereas micro-parapatric speciation should leave footprints of large genomic islands, as in parapatric speciation, due to restricted gene flow. Micro-allopatric speciation will exhibit genome-wide differentiation, as in allopatric speciation, because geographic barriers prevented gene flow. In addition, these genomic islands may contain a small set of ‘speciation genes’ that govern ecological specialization or generate intrinsic genomic incompatibilities [46].
It has been suggested that the presence of endemic sister species in small circumscribed areas (e.g. isolated lakes or islands [13] or phytophagous insects) might indicate that these species originated sympatrically, such as Howea forsteriana and H. belmoreana on Lord Howe Island [47,48], cichlid fishes in crater lakes [22,27], and apple and hawthorn maggot flies [49–51]. These species are likely to undergo species differentiation in such habitats without apparent physical isolation. A previous study indicated that Lake Sunmcuo was a hydrographically isolated inland lake that did not flow into the Yellow River until approximately the Holocene period, and it is now connected by a torrential mountain stream [52]. This glacial lake is small (3.8 km2) [53], oligotrophic, and homogenous in habitat, all of which provide an ideal location for testing sympatric speciation [54].
Gymnocypris eckloni scoliostomus (GS) and G. eckloni eckloni (GE), two sister subspecies of G. eckloni (Cyprinidae: Schizothoracinae: Gymnocypris), are sympatrically distributed in Lake Sunmcuo in the Qinghai–Tibet Plateau, China at an altitude of 4100 m [55,56]. Despite their close relationship, these two subspecies show significant differences in morphology (Fig. 1D) and reproductive characteristics. GS has a terminal or superior mouth with a more oblique and deeply arched mouth cleft. It prefers to feed on plankton and spawns in July with salmon-pink eggs. By contrast, GE has a subterminal mouth without a horny ridge at the inner margin of the lower jaw. It feeds on plankton, zoobenthos, algae, hydrophytes, and small fish, and spawns in April and May with yellow eggs [52,54,55,57,58]. It is known that the shape of the fish's mouth and lower jaw are tightly linked to the different nutritional types and river depths of food. GS prefers to feed on the upper part of the water column or on shoal rich in plankton, whereas GE usually has a wider niche that corresponds to a variety of food items [46,59]. All these characteristics demonstrate that GS and GE satisfy the biogeographic criteria for sympatric speciation.
In this study, we constructed a highly contiguous genome assembly of GS, and surveyed genomic variation of the two species by whole-genome resequencing of 46 samples from Lake Sunmcuo to investigate the genomic patterns of divergence. We explored gene flow and identified highly divergent genome regions. Unexpectedly, we discovered many large genomic islands (≥100 kb) between GS and GE, even though continual gene flow in sympatry should give rise to many small genomic islands (a few kilobases in size). This pattern of genomic divergence is consistent with the extensive evidence on parapatric speciation, in which adjacent habitats facilitate divergent selection but also permit gene flow during speciation. This study will provide more accurate insights into sympatric speciation.
RESULTS
Chromosome-level assembly and annotation of GS
Using a combination of Illumina HiSeq X-Ten reads (90.56×), Nanopore long reads (145.38×), and Hi-C sequencing (229.16×) technologies (Supplementary Figs. S1, S2; and Supplementary Table S1), we generated a chromosome-level genome assembly of GS (Supplementary Fig. S3). The assembly was estimated to be 948 Mb, which was close to the genome size estimated by 17-mer analysis (Supplementary Fig. S4 and Supplementary Table S2). In total, 97.60% of contigs were anchored to 25 chromosomes (Supplementary Fig. S3) with a contig N50 of 2 Mb and scaffold N50 of 37 Mb (Supplementary Table S3). Overall, 52.49% of the genome was identified as repeat elements (Supplementary Table S4). Based on the high-quality genome assemblies, we identified a total of 24194 protein-coding genes in GS (Supplementary Table S5). These characteristics of gene structure, including gene length, exon number/length, coding sequence length, and intron number/length, were compared with those of five other fish species (Supplementary Fig. S5). Using BUSCO v3.0.2 [60], we estimated the coverage of 4584 highly conserved single-copy Actinopterygii genes to be 90% in the assembly (Supplementary Table S6). Ultimately, we generated a high-quality reference genome of GS for subsequent population genomic analysis.
Genetic diversity and divergence of the two populations
To further investigate the genetic diversity and divergence of these two fish populations, we conducted whole genome resequencing of 46 Gymnocypris species, including 23 GS and 23 GE individuals. The mean sequencing coverage was approximately 19.14× per individual (range: 15.75–24.08×, Supplementary Table S7). Using the GATK method [61], we identified approximately 12.33 million single-nucleotide polymorphisms (SNPs) (Supplementary Fig. S6). The GS lineage showed slightly higher genetic diversity than the GE lineage (nucleotide diversity [π]: 3.14 × 10−3 vs. 2.87 × 10−3; Tajima's D: 1.097 vs. 1.056; linkage disequilibrium [LD] mean r2: 0.079 vs. 0.086) (Supplementary Fig. S7 and Supplementary Table S8). The genome-wide mean population fixation statistics (Fst) and absolute divergence (Dxy) were 0.03 (95% CI: 0.0297–0.0303) and 0.26 (95% CI: 0.2634–0.2639), respectively, which indicated that these two populations were only weakly structured. The species pairs displayed typical Fst distributions, with a single large peak centred close to the median score and a tail that represented relatively few regions with heightened divergence (Supplementary Fig. S8).
GS and GE were identified as two distinct Gymnocypris species population clusters based on analysis of all SNPs using the neighbour-joining method (Fig. 2A), principal component analysis (Fig. 2B), and the maximum likelihood method in Admixture v1.3 [62] (Fig. 2C and Supplementary Table S9).
The evolutionary divergence of the two lineages was investigated using the multiple sequentially Markovian coalescent (MSMC2) model [63] and SMC++ v1.11 [64]. The GS population showed a higher Ne than the GE population, and the two subspecies underwent two rounds of population decline during or following three intense uplift phases, including the Qingzang, Kunhuang, and Gonghe movements in the third tectonic uplift of the Qinghai–Tibet Plateau (Fig. 3A, and C). The split analyses showed that the divergence time of the two species was approximately 57 Kya (Fig. 3B) or 20 Kya (Fig. 3D).
Gene flow
Previous evidence showed that gene flow commonly occurs between recently diverged species despite the existence of barriers to gene flow in their genomes [65]. We used Fastsimcoal v2.6 [66] to simulate and compare five different hypothetical models (Supplementary Fig. S9). Using the Akaike information criterion (AIC) and maximum likelihood algorithm, our dataset supported a scenario of different gene flow matrices in which there was a large amount of gene flow in the early stage of interspecific differentiation, but gradually decreased with the progress of speciation (Fig. 3E; Supplementary Figs. S10, and S11; and Supplementary Table S10). In the early stage of interspecific differentiation, gene flow was estimated to be 1.22 × 10−3 per generation (95% CI: 0.0008–0.0011) from the GS lineage to the GE lineage, and 2.75 × 10−3 per generation (95% CI: 0.0021–0.0026) from the GE lineage to the GS lineage. In the late stage of speciation, however, gene flow was approximately 4.38 × 10−4 per generation (95% CI: 0.0003–0.0004) from the GS lineage to the GE lineage and 2.04 × 10−6 per generation (95% CI: 2.05 × 10−6–5.54 × 10−6) from the GE lineage to the GS lineage (Fig. 3E).
The pure sympatric speciation model predicts small islands
To determine the characteristics of genomic islands formed in pure sympatric speciation, we carried out computer simulations based on the recurrent selection and backcross (RSB) model [67]. The RSB method, which is based on recurrent selection, and backcross and intercross schemes, was initially proposed for identifying genes in quantitative trait loci. This is accomplished by continuously selecting for breed A traits while backcrossing to breed B. This strategy was also adapted to the speciation-with-gene-flow model by adjusting parameter values (strength of selection, migration rate, recombination rate, and relative fitness of a sequence; see Materials and Methods).
Introgression was simulated in 10 Mb diploid genomes under the conditions of strong selection (s = −0.5) and high migration rate (m = 0.1 for each generation). The simulation results for 1000, 5000 and 10000 generations revealed different degrees of negative selection (Supplementary Fig. S12). More than 99% of the 10 Mb sequence was negatively selected after 10000 generations (Supplementary Fig. S12). The simulation results revealed that, except for the middle region, the sequences on both sides were eventually replaced by more frequent bidirectional migrations in diverging sympatric populations under strong selection. These results indicated that large genomic islands (≥100 kb) between sympatric species are rare, whereas small genomic islands (<100 kb) are widespread. Moreover, the LD decay distance of both species was approximately 10 kb (Fig. 3F), which was consistent with the results of the above model.
Genomic islands between species
Genomic islands between species pairs were detected by relative divergence (ZFst) values. In total, 744 (merged into 107 nonoverlapping windows) genomic islands were identified between the two species; 54 of these 107 genomic islands were ≥100 kb and accounted for 89.4% of total genomic island length (Table 1). The most significant genomic islands were on chromosomes 1, 6, 8, 10, 16, and 25, regardless of the calculated ZFst in 10- or 20-kb nonoverlapping windows (Fig. 4A and Supplementary Figs. S13, and S14). The two lineages showed significantly elevated Dxy and reduced population-scaled recombination rate (ρ) in these genomic islands compared with the rest of the genome (Mann–Whitney U P < 2.2 × 10−16) (Fig. 4B, E, and F; Supplementary Figs. S15A, and 15B; and Supplementary Table S11). Intriguingly, we found strongly increased π in the GS lineage and strongly reduced π in the GE lineage in these genomic islands compared with the rest of the genome (Mann–Whitney U P < 2.2 × 10−16) (Fig. 4C, and D; Supplementary Fig. S15C–S15F; and Supplementary Table S11).
Table 1.
Length of genomic islands (kb) | Numbers of genomic islands (ZFst ≥4) | Numbers of genomic islands (ZFst ≥3.5) | Numbers of genomic islands (ZFst ≥3) |
---|---|---|---|
20–80 | 53 | 72 | 101 |
100–200 | 28 | 27 | 26 |
220–620 | 19 | 21 | 19 |
640–1000 | 4 | 5 | 1 |
1160–2020 | 3 | 3 | 8 |
Total | 107 | 128 | 155 |
Number of genomic islands (≥100 kb) | 54 (50%) | 56 (44%) | 54 (35%) |
Total length of genomic islands (≥100 kb) | 17.91 Mb (89.4%) | 19.67 Mb (87.5%) | 23.79 Mb (85.9%) |
Number of genomic islands (≥220 kb) | 26 (24.2%) | 29 (22.66%) | 28 (18.1%) |
Total length of genomic islands (≥220 kb) | 13.79 Mb (68.89%) | 15.86 Mb (70.58%) | 20.25 (73.1%) |
In total, we identified 226 genes in these genomic islands. Gene Ontology (GO) analysis showed that these genes were significantly enriched for olfactory receptor activity, voltage-gated potassium channel related, G-protein coupled receptor signalling pathway, response to stimulus, cation channel complex, signal transducer activity, GTPase activity, and protein phosphorylation and dephosphorylation process (Supplementary Tables S12, and S13). Strikingly, these genes were significantly concentrated in olfactory transduction and its related signalling pathway (Supplementary Table S14). In addition, we found significant selection signals in these genomic islands (see Selection signals in GS and GE lineages), which indicated that these islands are potential selection regions.
The micro-parapatric speciation model predicts large genomic islands
The RSB method [67] was also used to determine the characteristics of genomic islands formed in micro-parapatric speciation. Introgression was simulated in 10 Mb diploid genomes under the conditions of strong selection (s = −0.5) and low migration rate (m = 0.01 for each generation). The simulation results for 1000, 5000 and 10000 generations revealed different degrees of negative selection (Supplementary Fig. S16). These results suggest that large genomic islands (≥100 kb) occur disproportionately in micro-parapatric speciation (Supplementary Fig. S16).
Selection signals in GS and GE lineages
Despite being closely related, GS and GE are morphologically distinct, especially regarding the shape of the mouth and lower jaw [52]. We used a combination of Fst and π between the two lineages to explore the selection signals of this differentiation (see Materials and Methods) (Fig. 5A). In total, we identified 187 genes from 1006 putatively selected regions in the GE lineage. The selected genes of the GE lineage were on chromosomes 6, 9, 10, 16 and 25 (Supplementary Fig. S17). Interestingly, no selection signal was found in the GS lineage (Fig. 5A).
Because the selected regions of the top 1% of Fst values may be too strict, we also identified the windows with the top 5% of Fst values (Fst ≥ 0.137) as outlier windows. Surprisingly, only 63 windows including 45 genes were identified in the GS lineage (Supplementary Fig. S18). Compared with GS, these findings indicated that the GE lineage was likely subjected to stronger divergent selection. This was also consistent with the strongly reduced π in the genomic islands of the GE lineage (Fig. 4C, and D; Supplementary Fig. S15C–S15F; and Supplementary Table S11). Importantly, 151 selective sweep genes of the GE lineage (80.2% of selected genes) were located inside identified genomic islands (Supplementary Fig. S19).
GO analysis indicated that the 187 genes were significantly enriched for olfactory receptor activity, protein phosphorylation and dephosphorylation process, voltage-gated potassium channel activity, and ion channel activity (Supplementary Tables S15, and S16). Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis also revealed that these genes were highly enriched in the signalling pathway associated with olfactory transduction (Fig. 5B and Supplementary Table S17). Additionally, π was assessed around the selected olfactory receptor genes or52k2 and or52k1 on chromosome 6 (Fig. 5C, and D). The or52k2 and or52k1 loci showed strong reduction in π compared with the 40–60 kb genomic region adjacent to this site. This indicates that there is substantial selective advantage of or52k2 and or52k1 and that they are not genetic hitchhikers.
In addition, cross population composite likelihood ratio (XP-CLR) test [68] was also used to detect selective sweeps. Using the upper 5% of normalized xpclr values as a cut-off, we identified 1480 genes and 1485 candidate selected genes in the GS and GE lineages, respectively. Enrichment analysis also showed that the selected genes were significantly enriched in the olfactory signalling pathway in GE (Supplementary Tables S18, and S19). However, little enrichment of the olfactory signalling pathway was found in GS. This result is similar to that of the above analysis based on Fst and π.
Those shared selected genes in GE from the above two approaches were significantly enriched in biological processes, such as response to chemical, response to stimulus, signal transduction, and olfactory receptor activity (Supplementary Table S20). This highlights the selective role of olfaction in GE.
DISCUSSION
GS and GE are sister species that are sympatrically distributed in Lake Sunmcuo, and they have large differences in morphology (Fig. 1D) and reproductive characteristics. The divergence time between the GS and GE lineages was estimated to be 20–60 Kya (Fig. 3B, and D), which was before Lake Sunmcuo opened into the Yellow River [52]; this indicated that speciation occurred entirely within the lake. It is therefore considered a classic case of sympatric speciation from a biogeographic perspective.
Because of the absence of obvious geographical barriers, bidirectional gene flow was frequent in the early stages of speciation, and the selection intensity had to be sufficiently high to continue differentiation without reversing speciation. The RSB method was used to simulate the pure sympatric model of speciation-with-gene-flow, and the results revealed that small genomic islands (<100 kb) were widespread in sympatric species and large genomic islands were rare (Supplementary Fig. S12). Theoretically, the genomic island size will also be close to the LD decay distance, which was approximately 10 kb (Fig. 3F) for GS and GE. Moreover, recent strong empirical evidence on sympatric mangroves revealed that significant post-speciation gene flow resulted in a large number of introgression blocks averaging only about 3–4 kb in size and non-introgressable genomic islands averaging 1.4 kb in size [69]. Both theoretical and empirical results indicated that large genomic islands are less likely to form in sympatric species (Fig. 1A). Thus, we expected to see numerous different small genomic islands between GS and GE.
In this study, we discovered that genomic islands were concentrated on chromosomes 1, 6, 8, 10, 16 and 25. The genomic islands showed restricted gene flow (high Dxy) and low recombination (low ρ) (Fig. 4B, E, and F). In addition, half of the total number of genomic islands were ≥100 kb, and these islands accounted for 89.4% of the total length of all genomic islands (Table 1). This is the opposite of what we expected. The presence of a high proportion of large genomic islands (≥100 kb) indicated that gene flow was largely restricted during speciation. Some scholars have suspected that sympatric speciation may be micro-allopatric speciation in disguise [39,40,70,71]. However, in our study, it is easy to reject micro-allopatric speciation (Fig. 1B) because both genomic islands and non-islands were observed between the two populations, which is different from allopatric speciation, in which geographical barriers completely prevent gene flow. Although the pattern clearly invalidates micro-allopatry, the large genomic islands (≥100 kb) are not compatible with the sympatric model either.
Inversions may also promote the formation of genomic islands [72]. Chromosomal inversions can reduce gene flow through the suppression of recombination, making the accumulation of genetic differences more probable within such inversions [73,74]. Inversion may play a pre-existing role in speciation. If an allele causing significant reproductive isolation is associated with an inversion, the gene flow near that locus will be restricted [75]. However, if the inversions are neutral, their probability of fixation or loss depends purely on population size and migration [76].
Species differentiation is usually related to ecological adaptation, which is the result of natural selection. For example, the apple and hawthorn fly taxa that we previously discussed have resulted, in part, from the sorting of pre-existing ancestral variation, followed by the rapid evolution and substitution of entirely novel host-choice adaptations [73,77]. Thus, this switch onto a novel host was the trigger for speciation [73]. The speciation process of apple and hawthorn flies [78] may not be pure sympatry. In addition, if sympatric speciation's many large genomic islands were the result of inversions, it would require numerous inversions to produce them, yet generating these numerous pre-existing large inversions is extremely difficult.
A lake is typically an exceedingly complex ecosystem with distinct ecological landscapes depending on the lake's vertical water depth. This pattern is consistent with the extensive evidence on parapatric speciation (Fig. 1C): the two subspecies are distributed and carry out essential life activities in different parts of the water column, and the adjacent habitats facilitate divergent selection but also permit gene flow. This micro-parapatry facilitates the formation of large genomic islands (≥100 kb) because habitat preference reduces gene exchange between species, which may favour continued response to selection and thus promote species adaptation. As a result, all evidence showed that GE and GS do not represent a simple case of sympatric speciation but rather micro-parapatric speciation. The RSB simulation also revealed that large genomic islands (≥100 kb) occur disproportionately in micro-parapatric speciation (Supplementary Fig. S16). Recent empirical studies on sympatric speciation [21–38] have not carried out detailed assessments of the relationship between gene flow and genomic islands, even though speciation processes all involved the selection of different habitats and restricted gene flow. Similar to our case, many of those presumed sympatric speciation events may actually be micro-parapatric speciation events because most of those studies examined genomic islands and restricted gene flow [21–38].
Speciation always involves sexual isolation [11,14,79]. Ecological differences may impose barriers to gene flow, and sexual isolation may occur when barriers are strong enough to prevent recent gene flow. Therefore, when assortative mating depends on an ecological character, speciation is not hindered by recombination between mating and ecological loci [11]. Consequently, subpopulations in different habitats may have sufficient intrinsic premating isolation to promote speciation [80]. However, how does RI arise as a correlate of the genetic divergence? The most direct way to answer this question is to identify the differentiated genes. In this study, GE underwent stronger selection than GS, which led to a high level of genomic differentiation (Fig. 4C, and D, and 5A).
Genes related to olfactory receptor activity and olfactory transduction pathways were also found on the genomic islands. Olfactory genes may act as pleiotropic genes that influence habitat and sexual selection. Animals rely strongly on olfaction to locate and identify food sources [81]. Compared with the GS lineage, the GE lineage consumes a wider variety of foods. The GE lineage was discovered to have extensive olfactory signals, and a particularly strong selective effect on three olfactory receptors on chromosome 6 (Fig. 5C, and D). This allowed the GE lineage to occupy more niches in an oligotrophic lake in a short period of time by increasing the variety of food available. However, no significant selected olfactory signal was detected in GS. For fish, olfaction is also crucial in chemosensory communication, which has been related to speciation, particularly in terms of sexual isolation [82–84]. For example, female preference for conspecific males was shown to rely predominantly, if not exclusively, on olfactory cues in Lake Malawi cichlids [85]. Evolution of assortative mating may be the most powerful isolating barrier between ecologically diverging subpopulations [11,86,87].
CONCLUSION
This study combined theoretical and empirical evidence to provide a new perspective on sympatric speciation. Large genomic islands occurred between the examined subspecies, which revealed reliable evidence that this case of presumed sympatric speciation is actually micro-parapatric speciation.
MATERIALS AND METHODS
Sampling and genome sequencing, assembly and variant calling
Fish were collected from Lake Sunmcuo in Jiuzhi, Qinghai, China (101°11′E, 33°38′N) using gill nets and cast nets. GS and GE individuals were identified based on the taxonomic description by Chen and Cao [56]. Genomic DNA was extracted from muscle and liver tissues (frozen in liquid nitrogen) using a DNeasy Blood & Tissue Kit (Qiagen, 69506) in accordance with the manufacturer's protocol. Illumina, Nanopore, and Hi-C libraries were generated using the HiSeq X-Ten platform and GridION X5 DNA sequencers. To assist with genome annotation, RNA was extracted from the heart, liver, kidney, muscle, gill, brain, and gonad tissues (frozen in liquid nitrogen) using an HP Total RNA Kit (Omega Bio-Tek, R6812-00). We annotated the genomes using a combination of ab initio, homologous-based gene predictions and RNA-seq [88]. For comparative population genomics analysis, we re-sequenced 46 individuals, including 23 GS and 23 GE individuals, using the HiSeq X-Ten platform (Illumina). The detailed methods of sequencing, assembly, annotation, and SNP calling are provided in the Supplementary Methods.
Population structure analysis and demographic history estimation
Population structures were investigated using three approaches, including a neighbour-joining phylogenetic tree, a nonparametric principal component analysis, and a full maximum likelihood approach. Species demographic history and divergence time were estimated by MSMC2 [63] and SMC++ [64]. The detailed methods are provided in the Supplementary Methods.
Gene flow model estimate and genome patterns of genetic divergence
We tested five gene flow models (Supplementary Fig. S8) and used Fastsimcoal v2.6 [66] to infer the dynamic history of gene flow. The AIC was used to identify the best of the five models. The detailed simulation methods are provided in the Supplementary Methods.
Population genomic differentiation was measured by ZFst and Dxy. Genomic regions with ZFst ≥4 were considered genomic islands. Significance of Dxy, π, and ρ were assessed for genomic islands. The detailed estimation methods of ZFst, Dxy, π, and ρ are provided in the Supplementary Methods.
Population genetic simulations under migration, selection, and recombination
To investigate the influences of migration, selection, and recombination on genomic sequences, we carried out computer simulations based on the RSB model [67]. Pure sympatric speciation and micro-parapatric speciation models were simulated. The difference between the two models is the migration rate in speciation. We set a high level of migration (m = 0.1 per generation) in the pure sympatric speciation model because there was frequent bidirectional gene flow between species. A low level of migration (m = 0.01 per generation) was set in micro-parapatric speciation model. Population size was set to 5000 and the recombination rate (r) was set to 10−8 per generation between adjacent base pairs. The length of simulated sequences was 10 Mb (100 kb is the basic unit that cannot be separated by recombination). The recombination probability for a 10 Mb sequence was 0.1. Because population size was 5000, there was an average of 500 individuals with recombination in each generation. The relative fitness of the sequence of a hybrid is 0.5. At the beginning of the simulations, the sequences were in their original state (Supplementary Fig. S12A). After several generations of migration, selection, and recombination, the sequences were shuffled.
Identification of selected genomic regions
For SNPs, we performed a test for selective sweeps in the GS and GE lineages to identify candidate regions using a cross of π and Fst approaches; π and Fst were calculated using VCFtools v1.13 [89] in a 100-kb sliding window with a step size of 10 kb. We identified the window with the top 1% or 5% of Fst values (Fst >0.26 and Fst >0.137, respectively) as the outlier windows. On this basis, π ratio values were used (ratio of π values: π [GS]/π [GE] >2.07, top 5%; π [GS]/π [GE] <0.88, bottom 5%) to identify selected genomic regions in the GE and GS lineages. XP-CLR test [66] was also used to detect selective sweeps using the upper 5% of normalized xpclr values as a cut-off. These protein-coding genes were annotated with GO [90] using the InterPro [91] and eggNOG [92] databases. KEGG annotation [93] used KASS [94] to identify the function of selected genes.
DATA AVAILABILITY
The raw genome data for Gymnocypris eckloni scoliostomus and the genome-resequencing data for G. eckloni scoliostomus and G. eckloni eckloni have been deposited in the NCBI database under BioProject accession number PRJNA771798. The genome assembly file of G. eckloni scoliostomus is available under accession number JAJHNS000000000.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Chung-I Wu, Suhua Shi and Xinfeng Wang for their insightful comments on the manuscript, all members of Kai Zhao's lab for assistance with sampling, and Kun Wang and Kang Du for their advice on data analysis.
Contributor Information
Ning Sun, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; University of Chinese Academy of Sciences, Beijing 100049, China.
Liandong Yang, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
Fei Tian, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, China.
Honghui Zeng, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
Ziwen He, State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou 510275, China.
Kai Zhao, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, China.
Cheng Wang, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; University of Chinese Academy of Sciences, Beijing 100049, China.
Minghui Meng, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
Chenguang Feng, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; School of Ecology and Environment, Northwestern Polytechnical University, Xi’an 710129, China.
Chengchi Fang, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
Wenqi Lv, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; University of Chinese Academy of Sciences, Beijing 100049, China.
Jing Bo, Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China; University of Chinese Academy of Sciences, Beijing 100049, China.
Yongtao Tang, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, China.
Xiaoni Gan, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China.
Zuogang Peng, Key Laboratory of Freshwater Fish Reproduction and Development (Ministry of Education), Southwest University School of Life Sciences, Chongqing 400700, China.
Yiyu Chen, National Natural Science Foundation of China, Beijing 100085, China.
Shunping He, State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China; Institute of Deep-Sea Science and Engineering, Chinese Academy of Sciences, Sanya 572000, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China.
FUNDING
This work was supported by the National Natural Science Foundation of China (91731301, 32170480, 31972866, 31601858 and 32170438), the Strategic Priority Research Program of Chinese Academy of Sciences (XDB31000000), the Youth Innovation Promotion Association, Chinese Academy of Sciences and the Young Top-notch Talent Cultivation Program of Hubei Province.
AUTHOR CONTRIBUTIONS
S.H. led the project. Y.C. provided consultation on both practical and theoretical plateau research. N.S., L.Y. and K.Z. designed the project. N.S., M.M. and Z.H. performed all computational analyses. F.T., C.W., C.F., C.F., W.L., J.B. and X.G. provided data analysis advice. Y.T., H.Z., F.T. and Z.P. collected the samples. N.S. wrote the paper. S.H. revised the paper.
REFERENCES
- 1. Dobzhansky T. Genetics and the Origins of Species. New York: Columbia University Press, 1937. [Google Scholar]
- 2. Via S. Natural selection in action during speciation. Proc Natl Acad Sci USA 2009; 106: 9939–46. 10.1073/pnas.0901397106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Darwin C. On the Origins of Species. London: John Murray, 1859. [Google Scholar]
- 4. Mayr E. Systematics and Origin of Species. New York: Columbia University Press, 1942. [Google Scholar]
- 5. Coyne JA, Orr HA. Speciation. Sunderland, MA: Sinauer Associates, 2004. [Google Scholar]
- 6. Mallet J, Meyer A, Nosil Pet al. Space, sympatry and speciation. J Evol Biol 2009; 22: 2332–41. 10.1111/j.1420-9101.2009.01816.x [DOI] [PubMed] [Google Scholar]
- 7. Schluter D. Ecology and the origin of species. Trends Ecol Evol 2001; 16: 372–80. 10.1016/S0169-5347(01)02198-X [DOI] [PubMed] [Google Scholar]
- 8. Fitzpatrick BM, Fordyce JA, Gavrilets S. What, if anything, is sympatric speciation? J Evolution Bio 2008;21: 1452–9. 10.1111/j.1420-9101.2008.01611.x [DOI] [PubMed] [Google Scholar]
- 9. Schluter D. Evidence for ecological speciation and its alternative. Science 2009; 323: 737–41. 10.1126/science.1160006 [DOI] [PubMed] [Google Scholar]
- 10. Felsenstein J. Skepticism towards Santa Rosalia, or why are there so few kinds of animals? Evolution 1981; 35: 124–38. [DOI] [PubMed] [Google Scholar]
- 11. Dieckmann U, Doebeli M. On the origin of species by sympatric speciation. Nature 1999; 400: 354–7. 10.1038/22521 [DOI] [PubMed] [Google Scholar]
- 12. Bolnick DI, Fitzpatrick BM. Sympatric speciation: models and empirical evidence. Annu Rev Ecol Evol Syst 2007; 38: 459–87. 10.1146/annurev.ecolsys.38.091206.095804 [DOI] [Google Scholar]
- 13. Coyne JA. Sympatric speciation. Curr Biol 2007; 17: 787–8. 10.1016/j.cub.2007.06.056 [DOI] [PubMed] [Google Scholar]
- 14. Thoday JM, Gibson JB. Isolation by disruptive selection. Nature 1962; 193: 1164–6. 10.1038/1931164a0 [DOI] [PubMed] [Google Scholar]
- 15. Smith JM. Sympatric speciation. Am Nat 1966; 100: 637–50. 10.1086/282457 [DOI] [Google Scholar]
- 16. Kondrashov AS, Kondrashov FA. Interactions among quantitative traits in the course of sympatric speciation. Nature 1999; 400: 351–4. 10.1038/22514 [DOI] [PubMed] [Google Scholar]
- 17. Higashi M, Takimoto G, Yamamura AN. Sympatric speciation by sexual selection. Nature 1999; 402: 523–6. 10.1038/990087 [DOI] [PubMed] [Google Scholar]
- 18. Gavrilets S. Rapid evolution of reproductive barriers driven by sexual conflict. Nature 2000; 403: 886–9. 10.1038/35002564 [DOI] [PubMed] [Google Scholar]
- 19. Matute DR, Butler IA, Turissini DAet al. A test of the snowball theory for the rate of evolution of hybrid incompatibilities. Science 2010; 329: 1518–21. 10.1126/science.1193440 [DOI] [PubMed] [Google Scholar]
- 20. Via S. Divergence hitchhiking and the spread of genomic isolation during ecological speciation-with-gene-flow. Phil Trans R Soc B 2012; 367: 451–60. 10.1098/rstb.2011.0260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Barluenga M, Meyer A. The Midas cichlid species complex: incipient sympatric speciation in Nicaraguan cichlid fishes? Mol Ecol 2004; 13: 2061–76. 10.1111/j.1365-294X.2004.02211.x [DOI] [PubMed] [Google Scholar]
- 22. Barluenga M, Stölting KN, Salzburger Wet al. Sympatric speciation in Nicaraguan crater lake cichlid fish. Nature 2006; 439: 719–23. 10.1038/nature04325 [DOI] [PubMed] [Google Scholar]
- 23. Martin CH. Strong assortative mating by diet, color, size, and morphology but limited progress toward sympatric speciation in a classic example: Cameroon crater lake cichlids. Evolution 2013; 67: 2114–23. 10.1111/evo.12090 [DOI] [PubMed] [Google Scholar]
- 24. Papadopulos AS, Kaye M, Devaux Cet al. Evaluation of genetic isolation within an island flora reveals unusually widespread local adaptation and supports sympatric speciation. Phil Trans R Soc B 2014; 369: 20130342. 10.1098/rstb.2013.0342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Martin CH, Cutler JS, Friel JPet al. Complex histories of repeated gene flow in Cameroon crater lake cichlids cast doubt on one of the clearest examples of sympatric speciation. Evolution 2015; 69: 1406–22. 10.1111/evo.12674 [DOI] [PubMed] [Google Scholar]
- 26. Li K, Hong W, Jiao Het al. Sympatric speciation revealed by genome-wide divergence in the blind mole rat Spalax. Proc Natl Acad Sci USA 2015; 112: 11905–10. 10.1073/pnas.1514896112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Malinsky M, Challis RJ, Tyers AMet al. Genomic islands of speciation separate cichlid ecomorphs in an East African crater lake. Science 2015; 350: 1493–8. 10.1126/science.aac9927 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Li K, Wang H, Cai Zet al. Sympatric speciation of spiny mice, Acomys, unfolded transcriptomically at Evolution Canyon, Israel. Proc Natl Acad Sci USA 2016; 113: 8254–9. 10.1073/pnas.1608743113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Richards EJ, Poelstra JW, Martin CH. Don’t throw out the sympatric speciation with the crater lake water: fine-scale investigation of introgression provides equivocal support for causal role of secondary gene flow in one of the clearest examples of sympatric speciation. Evol Lett 2018; 2: 524–40. 10.1002/evl3.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Ravinet M, Yoshida K, Shigenobu Set al. The genomic landscape at a late stage of stickleback speciation: high genomic divergence interspersed by small localized regions of introgression. PLoS Genet 2018; 14: e1007358. 10.1371/journal.pgen.1007358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Dean LL, Magalhaes IS, Foote Aet al. Admixture between ancient lineages, selection, and the formation of sympatric stickleback species-pairs. Mol Bio Evol 2019; 36: 2481–97. 10.1093/molbev/msz161 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Osborne OG, Ciezarek A, Wilson Tet al. Speciation in Howea palms occurred in sympatry, was preceded by ancestral admixture, and was associated with edaphic and phenological adaptation. Mol Bio Evol 2019; 36: 2682–97. 10.1093/molbev/msz166 [DOI] [PubMed] [Google Scholar]
- 33. Choi JY, Purugganan M, Stacy EA. Divergent selection and primary gene flow shape incipient speciation of a riparian tree on Hawaii Island. Mol Bio Evol 2020; 37: 695–710. 10.1093/molbev/msz259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Li K, Zhang S, Song Xet al. Genome evolution of blind subterranean mole rats: adaptive peripatric versus sympatric speciation. Proc Natl Acad Sci USA 2020; 117: 32499–508. 10.1073/pnas.2018123117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Kautt AF, Kratochwil CF, Nater Aet al. Contrasting signatures of genomic divergence during sympatric speciation. Nature 2020; 588: 106–11. 10.1038/s41586-020-2845-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Wang H, Yin H, Jiao Cet al. Sympatric speciation of wild emmer wheat driven by ecology and chromosomal rearrangements. Proc Natl Acad Sci USA 2020; 117: 5955–63. 10.1073/pnas.1920415117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Yamasaki YY, Kakioka R, Takahashi Het al. Genome-wide patterns of divergence and introgression after secondary contact between Pungitius sticklebacks. Phil Trans R Soc B 2020; 375: 20190548. 10.1098/rstb.2019.0548 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Christmas MJ, Jones JC, Olsson Aet al. Genetic barriers to historical gene flow between cryptic species of alpine bumblebees revealed by comparative population genomics. Mol Bio Evol 2021; 38: 3126–43. 10.1093/molbev/msab086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Getz WM, Kaitala V. Ecogenetic models, competition, and heteropatry. Theor Popul Biol 1989; 36: 34–58. 10.1016/0040-5809(89)90022-1 [DOI] [Google Scholar]
- 40. Smith HM. The perspective of species. Turtox News 1955; 33: 74–7. [Google Scholar]
- 41. Wu CI. The genic view of the process of speciation. J Evol Biol 2001; 14: 851–65. 10.1046/j.1420-9101.2001.00335.x [DOI] [Google Scholar]
- 42. Wu CI, Ting CT. Genes and speciation. Nat Rev Genet 2004; 5: 114–22. 10.1038/nrg1269 [DOI] [PubMed] [Google Scholar]
- 43. Renaut S, Grassa CJ, Yeaman Set al. Genomic islands of divergence are not affected by geography of speciation in sunflowers. Nat Commun 2013; 4: 1–8. 10.1038/ncomms2833 [DOI] [PubMed] [Google Scholar]
- 44. Carneiro M, Albert FW, Afonso Set al. The genomic architecture of population divergence between subspecies of the European rabbit. PLoS Genet 2014; 10: e1003519. 10.1371/journal.pgen.1003519 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Han F, Lamichhaney S, Grant BR. Gene flow, ancient polymorphism, and ecological adaptation shape the genomic landscape of divergence among Darwin's finches. Genome Res 2017; 27: 1004–15. 10.1101/gr.212522.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Ravinet M, Faria R, Butlin RKet al. Interpreting the genomic landscape of speciation: a road map for finding barriers to gene flow. J Evol Biol 2017; 30: 1450–77. 10.1111/jeb.13047 [DOI] [PubMed] [Google Scholar]
- 47. Savolainen V, Anstett MC, Lexer Cet al. Sympatric speciation in palms on an oceanic island. Nature 2006; 441: 210–3. 10.1038/nature04566 [DOI] [PubMed] [Google Scholar]
- 48. Papadopulos AST, Igea J, Dunning LTet al. Ecological speciation in sympatric palms: 3. Genetic map reveals genomic islands underlying species divergence in Howea. Evolution 2019; 73: 1986–95. 10.1111/evo.13796 [DOI] [PubMed] [Google Scholar]
- 49. Feder JL, Chilcote CA, Bush GL. Genetic differentiation between sympatric host races of the apple maggot fly Rhagoletis pomonella. Nature 1988; 336: 61–4. 10.1038/336061a0 [DOI] [Google Scholar]
- 50. Feder JL, Opp SB, Wlazlo Bet al. Host fidelity is an effective premating barrier between sympatric races of the apple maggot fly. Proc Natl Acad Sci USA 1994; 91: 7990–4. 10.1073/pnas.91.17.7990 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Filchak KE, Roethele JB, Feder JL. Natural selection and sympatric divergence in the apple maggot Rhagoletis pomonella. Nature 2000; 407: 739–42. 10.1038/35037578 [DOI] [PubMed] [Google Scholar]
- 52. Zhao K, Duan ZY, Peng ZGet al. The youngest split in sympatric schizothoracine fish (Cyprinidae) is shaped by ecological adaptations in a Tibetan Plateau glacier lake. Mol Ecol 2009; 18: 3616–28. 10.1111/j.1365-294X.2009.04274.x [DOI] [PubMed] [Google Scholar]
- 53. Lei YB, Zhang HC, Shang HMet al. Lake evolution and glaciation of the Nianbaoyuze Mountain in the Tibetan Plateau since the middle of the last glacial period. Quat Sci 2008; 28: 132–9. [Google Scholar]
- 54. Tsao WS, Wu HW. An investigation of the fish biology and fishery problems in Ganze-Apa region of western Szechwan Province. Acta Hydrobiol Sin 1962; 2: 79–111. [Google Scholar]
- 55. Zhao TQ. On the taxonomic problem of Gymnocypris eckloni Herzenstein. La Anim Mondo 1986; 3: 49–55. [Google Scholar]
- 56. Chen YY, Cao WX. Fauna Sinica, Osteichthyes, Cypriniformes II. Beijing: Science Press, 1998. [Google Scholar]
- 57. Li SC, Chang SY. Two new species and one new subspecies of fishes from the northern part of Kansu Province, China. Acta Zool Sin 1974; 20: 414–9. [Google Scholar]
- 58. Zhu SQ, Wu YF. Studies on the ichthyofauna of Lake Qinghai. In: Qinghai Institute of Biology (ed.). Ichthyofauna of Lake Qinghai and Biology of Gymnoypris przewalskii (Cyprinidae). Beijing: Science Press, 1975, 9–26. [Google Scholar]
- 59. Wu YF, Chen Y. The fishes in Guoluo and Yushu region of Qinghai Province. Acta Zool Sin 1979; 4: 287–96. [Google Scholar]
- 60. Simao FA, Waterhouse RM, Ioannidis Pet al. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015; 31: 3210–2. 10.1093/bioinformatics/btv351 [DOI] [PubMed] [Google Scholar]
- 61. McKenna A, Hanna M, Banks Eet al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010; 20: 1297–303. 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res 2009; 19: 1655–64. 10.1101/gr.094052.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Schiffels S, Durbin R. Inferring human population size and separation history from multiple genome sequences. Nat Genet 2014; 46: 919–25. 10.1038/ng.3015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Terhorst J, Kamm JA, Song YS. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat Genet 2017; 49: 303–9. 10.1038/ng.3748 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Christmas MJ, Jones JC, Olsson Aet al. Genetic barriers to historical gene flow between cryptic species of alpine bumblebees revealed by comparative population genomics. Mol Bio Evol 2021; 38: 3126–43. 10.1093/molbev/msab086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Excoffier L, Dupanloup I, Huerta-Sanchez E. Robust demographic inference from genomic and SNP data. PLoS Genet 2013; 9: e1003905. 10.1371/journal.pgen.1003905 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Luo ZW, Wu CI, Kearsey MJ. Precision and high-resolution mapping of quantitative trait loci by use of recurrent selection, backcross or intercross schemes. Genetics 2002; 161: 915–29. 10.1093/genetics/161.2.915 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Meyer RS, Choi JY, Sanches Met al. Domestication history and geographical adaptation inferred from a SNP map of African rice. Nat Genet 2016; 48: 1083–8. 10.1038/ng.3633 [DOI] [PubMed] [Google Scholar]
- 69. Wang X, Guo Z, He Zet al. In the absence of reproductive isolation–Extensive gene flow after speciation. bioRxiv; doi: 10.1101/622019. [DOI]
- 70. Bird CE, Fernandez-Silva I, Skillings DJet al. Sympatric speciation in the post “modern synthesis” era of evolutionary biology. Evol Biol 2012; 39: 158–80. 10.1007/s11692-012-9183-6 [DOI] [Google Scholar]
- 71. Smith HM. More evolutionary terms. Syst Zool 1965; 14: 57–8. 10.2307/2411904 [DOI] [Google Scholar]
- 72. Schluter D, Rieseberg LH. Three problems in the genetics of speciation by selection. Proc Natl Acad Sci USA 2022; 119: e2122153119. 10.1073/pnas.2122153119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Rieseberg LH. Chromosomal rearrangements and speciation. Trends Ecol Evol 2001; 16: 351–8. 10.1016/S0169-5347(01)02187-5 [DOI] [PubMed] [Google Scholar]
- 74. Jiggins CD, Bridle JR. Speciation in the apple maggot fly: a blend of vintages? Trends Ecol Evol 2004; 19: 111–4. 10.1016/j.tree.2003.12.008 [DOI] [PubMed] [Google Scholar]
- 75. Barton NH. Gene flow past a cline. Heredity 1979; 43: 333–9. 10.1038/hdy.1979.86 [DOI] [Google Scholar]
- 76. Hoffmann AA, Rieseberg LH. Revisiting the impact of inversions in evolution: from population genetic markers to drivers of adaptive shifts and speciation? Annu Rev Ecol Evol Syst 2008; 39: 21–42. 10.1146/annurev.ecolsys.39.110707.173532 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Linn C Jr, Feder JL, Nojima Set al. Fruit odor discrimination and sympatric host race formation in Rhagoletis. Proc Natl Acad Sci USA 2003; 100: 11490–3. 10.1073/pnas.1635049100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Feder JL, Roethele JB, Filchak Ket al. Evidence for inversion polymorphism related to sympatric host race formation in the apple maggot fly, Rhagoletis pomonella. Genetics 2003; 163: 939–53. 10.1093/genetics/163.3.939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Wu CI. A stochastic simulation study on speciation by sexual selection. Evolution 1985; 39: 66–82. [DOI] [PubMed] [Google Scholar]
- 80. Bush GL. Sympatric speciation in animals: new wine in old bottles. Trends Ecol Evol 1994; 9: 285–8. 10.1016/0169-5347(94)90031-0 [DOI] [PubMed] [Google Scholar]
- 81. Wyatt TD. Pheromones and Animal Behavior: Chemical Signals and Signatures. Cambridge: Cambridge University Press, 2014. [Google Scholar]
- 82. Smadja C, Butlin RK. On the scent of speciation: the chemosensory system and its role in premating isolation. Heredity 2009; 102: 77–97. 10.1038/hdy.2008.55 [DOI] [PubMed] [Google Scholar]
- 83. Heuschele J, Mannerla M, Gienapp P. Environment-dependent use of mate choice cues in sticklebacks. Behav Ecol 2009; 20: 1223–7. 10.1093/beheco/arp123 [DOI] [Google Scholar]
- 84. Poelstra JW, Richards EJ, Martin CH. Speciation in sympatry with ongoing secondary gene flow and a potential olfactory trigger in a radiation of Cameroon cichlids. Mol Ecol 2018; 27: 4270–88. 10.1111/mec.14784 [DOI] [PubMed] [Google Scholar]
- 85. Plenderleith M, Oosterhout CV, Robinson RLet al. Female preference for conspecific males based on olfactory cues in a Lake Malawi cichlid fish. Biol Lett 2005; 1: 411–4. 10.1098/rsbl.2005.0355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. Martin CH. Weak disruptive selection and incomplete phenotypic divergence in two classic examples of sympatric speciation: Cameroon crater lake cichlids. Am Nat 2012; 180: E90–109. 10.1086/667586 [DOI] [PubMed] [Google Scholar]
- 87. Martin CH. Strong assortative mating by diet, color, size, and morphology but limited progress toward sympatric speciation in a classic example: Cameroon crater lake cichlids. Evolution 2013; 67: 2114–23. 10.1111/evo.12090 [DOI] [PubMed] [Google Scholar]
- 88. Wang K, Shen Y, Yang Yet al. Morphology and genome of a snailfish from the Mariana Trench provide insights into deep-sea adaptation. Nat Ecol Evol 2019; 3: 823–33. 10.1038/s41559-019-0864-8 [DOI] [PubMed] [Google Scholar]
- 89. Danecek P, Auton A, Abecasis Get al. The variant call format and VCF tools. Bioinformatics 2011; 27: 2156–8. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Ashburner M, Ball CA, Blake JAet al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000; 5: 25–9. 10.1038/75556 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91. Mitchell AL, Attwood TK, Babbitt PCet al. InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res 2019; 47: D351–60. 10.1093/nar/gky1100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Huerta-Cepas J, Szklarczyk D, Heller Det al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 2019; 47: D309–14. 10.1093/nar/gky1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Kanehisa M, Goto S.. KEGG: KyotoEncyclopedia of Genes and Genomes. Nucleic Acids Res 2000; 28: 27–30. 10.1093/nar/28.1.27 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Moriya Y, Itoh M, Okuda Set al. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res, 2007; 35: 182–5. 10.1093/nar/gkm321 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw genome data for Gymnocypris eckloni scoliostomus and the genome-resequencing data for G. eckloni scoliostomus and G. eckloni eckloni have been deposited in the NCBI database under BioProject accession number PRJNA771798. The genome assembly file of G. eckloni scoliostomus is available under accession number JAJHNS000000000.