Abstract
In the conventional view, species are separate gene pools delineated by reproductive isolation (RI). In an alternative view, species may also be delineated by a small set of ‘speciation genes’ without full RI, a view that has gained broad acceptance. A recent survey, however, suggested that the extensive literature on ‘speciation with gene flow’ is mostly (if not all) about exchanges in the early stages of speciation. There is no definitive evidence that the observed gene flow actually happened after speciation is completed. Here, we wish to know whether ‘good species’ (defined by the ‘secondary sympatry’ test) do continue to exchange genes and, importantly, under what conditions such exchanges can be observed. De novo whole-genome assembly and re-sequencing of individuals across the range of two closely related mangrove species (Rhizophora mucronata and R. stylosa) reveal the genomes to be well delineated in allopatry. They became sympatric in northeastern Australia but remain distinct species. Nevertheless, their genomes harbor ∼4000–10 000 introgression blocks averaging only about 3–4 Kb. These fine-grained introgressions indicate continual gene flow long after speciation as non-introgressable ‘genomic islets,’ ∼1.4 Kb in size, often harbor diverging genes of flower or gamete development. The fine-grained introgression in secondary sympatry may help settle the debate about sympatric vs. micro-allopatric speciation. In conclusion, true ‘good species’ may often continue to exchange genes but the opportunity for detection is highly constrained.
Keywords: mangroves, population genomics, speciation, introgression, species hybridization
INTRODUCTION
Biological species are generally defined as taxa that do not exchange genes due to various forms of reproductive isolation (RI). RI mechanisms include ecological, behavioral, and reproductive incompatibilities [1,2]. These mechanisms are the foundation of the Biological Species Concept (BSC) [3–5] and have been accepted as both necessary and sufficient for species to evolve along diverging paths.
Strictly speaking, two true species should be separated by a combination of RI mechanisms such that they cease to exchange genes anywhere in their genomes. RI needs to be complete to avoid the logical quagmire of defining how much isolation is enough. In addition, RI assumes a very high degree of genetic cohesiveness within each species such that any exchange would be harmful [3,5,6]. Complete RI, however, is difficult to ascertain. For example, many good species appear to be only partially isolated as they produce sterile hybrids in one sex but the other sex is highly fertile [7–11]. In many other cases, there is no evidence even for partial RI [12–15]. An equally thorny question is the many seemingly convincing cases of sympatric speciation, say, between fish species in crater lakes [16–18]. A common defense of BSC in such cases is that it is often micro-allopatric speciation imposing restricted gene flow in a small spatial scale.
Against this backdrop, the basic assumptions of BSC in asserting full RI across the whole genome has been questioned. In reality, only a small fraction of the genome may be responsible for differentiation between species in morphology, behavior, reproduction, and ecology (i.e. ‘speciation genes’). In this genic view of speciation (GVS) [5], genome regions not germane to speciation should be easily interchangeable between species. In short, BSC asserts that true species should be fully reproductively isolated. The alternative GVS, by allowing species to extensively share genetic material, would abandon RI as the defining concept of species. The central question is, therefore, ‘do true species extensively exchange genes?’
Conditions for detecting post-speciation gene flow and the literature on speciation with gene flow
While there is an extensive literature on gene flow during, or even after, speciation (reviewed in [6]), a series of recent debates raise the possibility that the evidence is almost entirely about gene flow in the early phase of speciation [6,19–23]. In other words, there is still no definitive evidence that gene flow continues through the entire process of speciation and extends to ‘good species’.
We may use the model outlined in Fig. 1A–D to evaluate the conditions for detecting post-speciation gene flow. To begin with, the species status needs to be defined by unambiguous criteria. The most stringent of them are: First, two populations have evolved into putatively full species in allopatry. Second, the two species subsequently come into sympatry but maintain their biological distinction over an extended period, even with ample opportunities for genetic exchanges to merge into a hybrid swarm (Fig. 1A). These criteria are the classical and most stringent definition of species [1,4,5,11] which shall be referred to simply as the ‘secondary sympatry’ test in this study. The sympatry test is important as seen in previous debates on this issue [6,24–26]. In many cases, the species may not pass the sympatry test as shown in Fig. 1B (see the legends). Furthermore, while the extensive genomic literature has convincingly demonstrated gene flow during speciation [6], the inferred gene flow most likely happened in the early stages. Note that the focus here is on post-speciation gene flow.
Figure 1.
Models of post-speciation gene flow and the genomic structure in such models. (A) Two populations undergo allopatric speciation, resulting in two good species depicted in orange and green, respectively. In an area away from the center of each species’ distribution, the two putative species re-establish sympatry. In sympatry, the two species retain their biological characters (shown in their distinctive colors), passing the ‘secondary sympatry’ test. (B) In many cases, the two putative species hybridize and fuse (indicated by the olive color) in sympatry. They are not considered true species. (C and D) When the two species pass the sympatry test, there are two possibilities. In (C), the two species do not exchange genes at all. This is the most common scenario as observed in many secondary sympatry cases. (In such cases, the hybrid zones may show some admixture although the hybrids are usually quickly eliminated by natural selection.) (D) The true case of post-speciation gene flow. The genomic sequences of the two species show signs of genetic exchanges in the genomic segments of olive color. The small genomic segments flanked by dotted lines are non-introgressable and may harbor ‘speciation genes.’ It should be noted that the patterns of introgression are transient. As the two species continue to diverge, gene flow within the same species will homogenize the genomic sequences of each species, thus reversing the genomic patterns back to that shown in (C). (E) The genomes of R. mucronata and R. stylosa. Circular tracks represent, from outer to inner, top 18 longest scaffolds of R. mucronata (chr1–18 in orange) and R. stylosa (chr1–18 in green), percentage of repeats (3.97–99.92%, the darker the higher), gene density (0–47, the darker the higher), GC content (29.73–51.97%) and the spectrum of inter-specific collinear analysis (each line connects one pair of homologous genes and a cluster of such lines represents one collinear block). All statistics are calculated for windows of 200 Kb.
Once good species are confirmed by the test, it would be possible to check for past genetic exchanges discernible in their genomes. Importantly, the extensive survey by Wang et al. finds no convincing evidence for post-speciation gene flow [6]. Such evidence may not be readily accessible as the window of time for convincing observations is quite narrow. If it is too early, the species may not pass the ‘secondary sympatry’ test (Fig. 1B). On the other hand, the footprint of post-speciation gene flow may not last long as subsequent divergence and gene flow would erase it (Fig. 1C).
The true signature of post-speciation gene flow requires two signals. First, allopatric populations of these species should show the pattern depicted in Fig. 1C. Second, sympatric populations would show the Fig. 1D pattern. The contrast between the two signals ensures that the gene flow in sympatry has indeed happened after speciation has completed. As stated above, gene flow during speciation, as concluded in many studies [6,27,28], most likely happens in the early stages. The model illustrated in Fig. 1A–D shows the challenges of proving post-speciation gene flow. A convincing proof, requiring careful experimental design to identify species at the right stage and in the right place of evolution, would be highly conceptually significant. This study provides such a proof by sequencing a large number of genomes of two closely-related mangrove species Rhizophora mucronata and R. stylosa. These samples are collected from 11 populations across their range of distribution over a large geographical area.
Mangroves are woody plants that have colonized intertidal zones of tropical coasts [29–32]. R. mucronata and R. stylosa are distributed on the Indo-Western Pacific (IWP) coasts [31]. They have distinct but overlapping geographical distributions and have diverged consistently in morphological and ecological characteristics such as the style length and saline tolerance [31,33]. Because of the narrow band of suitable habitats along the coasts (or near river mouths), global mangrove distributions are essentially one-dimensional, making them ideal for biogeographical studies of speciation.
RESULTS
De novo assembling R. Mucronata and R. Stylosa genomes
We sequenced the genome of one R. mucronata and one R. stylosa individual, respectively (Supplementary Table S1; see Supplementary Table S2 for details). Although the genomes of these species have been re-sequenced before [30], the aims of this study demand accurate assembly of their own genomes in order to assess possible fine-grained introgressions. We therefore carried out de novo sequencing and attained the chromosome-level assembly in this study. These two genomes were sequenced as a part of the Genomics of World Mangroves Project as described in several of our previous publications [34,35], in particular, the phylogenomic analysis [36]. The full description of genome sequencing, annotation and analyses are provided here, upon which the re-sequencing of individuals from multiple populations is based.
We obtained 33.84 Gb (gigabases) and 36.13 Gb of raw data, corresponding to 142X–164X coverage of the assembled genomes (Supplementary Table S2). We assembled 237.85 Mb (megabases) of R. mucronata sequences into 14 496 scaffolds with N50 at 12.03 Mb and 18 scaffolds (chr1–18) >= 5 Mb. The assembled chr1–18 account for 84.38% (200.70 Mb) of the genome and correspond to the diploid chromosome number (2n = 36, Fig. 1E, Supplementary Table S1 and Fig. S1). The assembled R. stylosa genome is 219.67 Mb, containing 319 scaffolds with N50 at 12.64 Mb. The top 18 scaffolds account for 218.34 Mb, or 99.39%, of the genome (Fig. 1E, Supplementary Table S1 and Fig. S2). The sizes of our assemblies were smaller than the estimated genome sizes by flow cytometry and k-mer analysis, which range from 352.03 to 369 Mb in R. mucronata and from 299.71 to 307 in R. stylosa [36]. We were able to map 95.2% and 95.9% of the 2326 BUSCOs, as well as 94.52% and 92.53% of the corresponding Illumina sequencing reads, to the genome assemblies of R. mucronata and R. stylosa, respectively (Supplementary Tables S1 and S2). These results indicate the high completeness of our de novo assembled genomes, approaching a chromosomal level. A total of 22 574 gene models were annotated and classified into 17 897 families in the R. mucronata genome, while 23 545 genes from 18 134 families were identified in R. stylosa (Supplementary Table S1). This gene prediction comes also with high completeness, indicated by the fact that 89.5% and 91.4% of the 2326 BUSCOs were successfully mapped to our predicted gene models of R. mucronata and R. stylosa, respectively (Supplementary Table S1).
To examine the evolution of R. mucronata and R. stylosa genome structure in the larger phylogenetic context, we compare their genomes to those of Carallia pectinifolia [36], Bruguiera gymnorrhiza [32] and Rhizophora apiculata [30]. C. pectinifolia is from one of the closest non-mangrove genera in the Rhizophoraceae family; 29 653 gene families were identified among the five species, with 6890 single-copy. Using these single-copy orthologs, we reconstructed the species’ phylogeny and estimated divergence times (Supplementary Fig. S3). The phylogeny shows that R. mucronata and R. stylosa are sisters to each other diverging ∼2.70 Mya (with 95% confidence interval [2.05, 3.44] Mya; Supplementary Table S3 and Fig. S3). R. mucronata and R. stylosa are likely to be most closely related to each other as shown previously [36]. Most important, they meet the requirements for detection of post-speciation gene flow within a fairly small window of time after speciation (see Discussion). We identify 837 collinear blocks between the two species that harbor 18 716 genes in R. mucronata and 18 663 in R. stylosa (Fig. 1E). The inter-specific Ks (∼0.0031) and the genomic divergence (Dxy = 0.0031) are relatively low (Supplementary Table S2 and Fig. S4).
R. Mucronata and R. Stylosa genomic diversity
Rhizophora mucronata is widely distributed in the Indo-Western Pacific (IWP) region, particularly to the west of the Strait of Malacca and all the way to East Africa. In contrast, R. stylosa extends eastward from the Strait of Malacca to western Pacific islands (Fig. 2A). The two species have been reported to overlap in scattered locales along several western Pacific coastlines. However, in our own field trips, their relative abundance is often skewed in favor of one species and their co-occurrence has rarely been found. The sole exception in our collection is in the Daintree River (DR) area of northeastern Australia, where both species are quite abundant (Fig. 2A).
Figure 2.
Phylogenetic relationships and biogeographical observations. (A) The genealogy of R. mucronata and R. stylosa. The maximum-likelihood (ML) tree, generated by RAxML with 100 bootstraps. The numbers on the nodes indicate support values. R. mucronata is colored in orange while R. stylosa is in green. Orange boxes below the phylogeny indicate geographical regions of the sampled populations (see Supplementary Table S4 for details). The sympatric R. mucronata (m1) and R. stylosa (s1) in Daintree River, Australia are highlighted with red background. (B) The same phylogeny excluding the sympatric m1 and s1 samples from the Daintree area. We denote the allopatric populations as Mallo (m2–m7) and Sallo (s2–s4). (C) The spectrum of the FST statistic between the Mallo and Sallo samples.
We collected 21 R. stylosa individuals from four locations (labeled s1–s4) and 31 R. mucronata samples from seven locations (m1–m7) for population genomic studies (Fig. 2A and Supplementary Table S4). Note that m1 and s1 refer to the sympatric DR samples. Whole genomes of all samples were sequenced on the Illumina Hiseq 2000 platform, yielding a mean depth of 16X (ranging from 12X to 22X) (Supplementary Tables S5 and S6). Short reads from each individual were mapped to the de novo R. mucronata genome, with average genomic coverage of 81% (80%–83%, Supplementary Table S4). The level of genetic diversity shows two patterns. Low genetic diversity is found in all allopatric populations (average θπ at 0.62 and 0.60 per Kb for R. mucronata m2–m7 and R. stylosa s2–s4, respectively, see also Supplementary Tables S2 and S4). The level is much higher in the sympatric DR populations (θπ = 1.37/Kb and 2.09/Kb, respectively). Watterson's estimates (θw) are similar (Supplementary Table S4, see Supplementary Methods).
Divergence between the two species in allopatry
Genomic divergence (Dxy) between the two species is 4.14 × 10−3 per site (Supplementary Table S7). We first constructed a Maximum Likelihood (ML) tree using RAXML [37] on 31 R. mucronata and 21 R. stylosa individual sequences from the 11 populations. The ML tree bifurcates with a clear delineation between species across all allopatric populations. However, the m1 and s1 (i.e. DR) samples show strong signs of admixture as they are ‘in the middle’ of the bifurcated tree (Fig. 2A). When the DR samples are removed, the phylogeny shows clear delineation (Fig. 2B). These two trees are robust when rebuilt using the ML method in IQTREE [38] or the Neighbor-Joining (NJ) method in MEGA7 [39] (Supplementary Figs. S5–S6). The monophyletic delineation of R. mucronata and R. stylosa in allopatry is also supported by a principal component analysis (PCA, Supplementary Fig. S7a) [40].
We detected 1.7 million variable sites across all populations of the two species (Supplementary Table S8). We first partition these sites by excluding the DR samples (see Supplementary Methods). Each site is then represented by an FST value with FST = 0 indicating no differentiation between the two species in allopatry and FST = 1 indicating complete differentiation. Figure 2C shows the U-shaped distribution where the abundance of sites at the far right reveals the extensive differentiation between species. At the other end, the sites with low differentiation may indicate gene flow between species. Such a U-shape distribution is typical of species diverging with little gene flow [41]. In contrast, the distribution between R. mucronata and R. stylosa samples in the DR region (m1 vs. s1) is the typical L shape, suggesting extensive introgression within this region (Supplementary Fig. S7b).
The two species can be easily distinguished in the field. R. mucronata tends to settle in the less saline and further upstream habitats in comparison with R. stylosa found in saline habitats closer to river mouths (see Fig. 3E). The two species also differ substantially in overall tree morphology (see Fig. 3D). They are most readily distinguished by the reproductive characters of the flower, in particular the style length [31,33] as pictured in Fig. 3A. The morphological differences between R. mucronata and R. stylosa across populations are shown in Fig. 3C. R. mucronata is readily distinguished by its short style, between 0.9 and 1.6 mm (Fig. 3A). In contrast, the R. stylosa style is long, 2.4–5.3 mm (Fig. 3A and C) [31,33]. While the style length varies from locale to locale in both species, this trait is species-diagnostic across locales. Additional, albeit less stable, diagnostic morphological characters are listed in the Supplement (Table S9). In short, we show that R. mucronata and R. stylosa have diverged in their genomes, geographical distribution, habitat choice, and various morphological characters.
Figure 3.
Key diagnostic characters between R. mucronata and R. stylosa. (A) R. mucronata (Pulau Burung, Malaysia, 101°50'14.6''E, 2°29'33.7'N) and R. stylosa (Zhanjiang, China, 109°45'46.50''E, 21°34'7.32''N) styles from allopatric sites. (B) R. mucronata and R. stylosa styles from the Daintree River, Australia, sympatric site. (C) Variation of style lengths of R. stylosa (green oval) and R. mucronata (orange oval) throughout the Indo-West Pacific region. Solid ellipses contrast sympatric samples. (D) General tree morphology of R. mucronata and R. stylosa from the Daintree River, Australia, sympatric site. (E) Diagrams showing the habitat preferences of R. mucronata and R. stylosa in a typical estuary (adapted from mangrove ID [75]).
Characterizations of R. Mucronata and R. Stylosa in sympatry
We next apply the ‘secondary sympatry’ test to these two species found at the DR site of northern Australia. DR is at the periphery of the distribution of either species (Fig. 2) with R. mucronata to the west and R. stylosa to the north. It appears that speciation between them had been completed in allopatry and the post-speciation contact happened in DR (see also Supplementary Notes). Importantly, the two species have remained distinct in their ecology and morphology for a substantial period without intermingling. The two extant species rarely produce F1 hybrids and morphological intermediates are uncommon in our field work. In particular, as we observed in the field trips, the style lengths of DR samples are concordant with that of the allopatric populations of the same species (Figure 3B and C). In the DR area, these two species are parapatric-sympatric with distributions up- or down-river and extensive overlap in the middle (Fig. 3E). This difference in habitat preference is a hallmark of the speciation between R. mucronata and R. stylosa.
To elaborate on the phylogenetic positions of the DR samples in Fig. 2A, we used Bayesian clustering analysis implemented in ADMIXTURE [42]. We identified two genetic components that make up DR sample genomes (Fig. 4A). PCA results also indicate significant admixture in m1 and s1 individuals (Supplementary Fig. S7a). Furthermore, because species divergence is monophyletic in all allopatric comparisons, incomplete lineage sorting is an unlikely cause of the observed admixture in the DR samples (see also Supplementary Notes). In short, we interpret the high FST sites as representing divergence after speciation (Fig. 2C) with subsequent admixture in the DR area. Additional tests of introgression (LD analysis, Patterson's D statistic, and the modified fd statistic) are presented in the Supplement (Tables S8 and S10 and Figs. S8 and S9).
Figure 4.
Interspersion between introgressions and non-introgressions. (A) Genetic clustering of all 52 individuals of the two species by ADMIXTURE (K = 2). Orange and green colors denote the R. mucronata (m) and R. stylosa (s) components, respectively. Individuals from each population are grouped between adjacent black lines. (B) Distribution of i-allele occurrences in m1 (orange) and s1 (green) populations. Given the five individuals (or 10 haploid genomes) from each population, the occurrence ranges from 0 to 10. Exact numbers are given in the inset table (see also Supplementary Fig. S10). (C) A schematic diagram for delineating i-blocks, each harboring consecutive i-sites without being interrupted by d-sites. The length of an i-block is determined by the midpoints of the flanking (d and i) intervals. Three i-blocks are shown. (D) The genome-wide landscape of i-blocks. Top 18 longest scaffolds (chr1–18) and the rest of the genome (others) are marked and sibling scaffolds are distinguished by gray rectangles. Each row indicates an individual with each vertical line indicating a site. All 10 individuals from the sympatric s1 and m1 populations are shown. For comparison, one individual is randomly selected from each of other populations (see Fig. S12 for the full display). Each site is color-coded for its genotype: MM (orange), MS (light green) and SS (green) type, where M stands for R. mucronata and S for R. stylosa. The percentages are summarized on the right. Note that extensive interspersions are observed only in the sympatric samples. (E) A close-up view of i-blocks in the longest scaffold (chr1). Only 10 individuals from the sympatric s1 and m1 populations are shown.
Extensive introgression in sympatry
Using the sympatric samples, we ask the following questions: (1) How many introgressed segments can be found in each species? (2) Is the introgression symmetric? (3) What is the introgressed segment size distribution? A few large blocks are expected if hybridization was recent but many fine-grained blocks should result from old introgressions that have been eroded by recombination. (4) How many genomic segments fail to introgress and what is their genic content? Question 4 will be the subject of the next section.
To identify DR area introgressions, we first define divergent sites (or d-sites) between R. stylosa and R. mucronata in allopatry. Among the d-sites, we can then define introgressed sites (i-sites) between the m1 and s1 samples in sympatry. There are 305 418 d-sites, defined as sites with FST >0.8 between the two species in allopatry. Note that the bulk of d-sites (212 626) are fully divergent with FST = 1.0 (Fig. 2C). An i-site is a locus where the introgressed allele (or i-allele) is found in >= n of the 10 genomes in m1 or s1 sample sets, where n is usually equal to 8. (Note that both m1 and s1 samples have five diploid individuals, or 10 genomes.) In general, if introgression is observed in one direction, say from R. stylosa to R. mucronata in the DR area, the same site usually does not show introgression (n <= 1) in the reciprocal direction from R. mucronata to R. stylosa. (In this context, a non-introgressable site, or j-site, is defined as the d-site which contains zero or only one i-allele in both m1 and s1 samples; see Materials and Methods, and Supplementary Methods. Note that a small fraction of d-sites are statistically undefinable between i- and j-sites.)
To call i-sites, we first define n (the number of genomes carrying the introgressed allele). It is obviously better to set n close to the maximum of 10 for strongly penetrant introgressions. Figure 4B shows the number of introgressions in the two directions. We set n = 8 for the m1 samples where the i-allele is usually found >= 8 times (orange bars in Fig. 4B). Hence, the results with n = 2 and n = 8 would not be very different. Furthermore, to avoid the confounding presence of remnant ancient polymorphisms, we require introgressions at an i-site to be strongly asymmetric: >= n one way (say, from R. stylosa to R. mucronata) and <= 1 in the reciprocal direction (Supplementary Fig. S10). I-allele counts are close to uniformly distributed between 2 and 10 (green bars in Fig. 4B) among R. stylosa (s1) samples. The asymmetry is probably due to the geography of the DR area, which is at the fringe of the R. mucronata distribution. Consequently, gene flow from R. mucronata into R. stylosa may be more limited here, resulting in the lower frequency of introgressions in the s1 samples. In this regard, setting n = 8 would miss many introgressions in R. stylosa leading to a much lower introgression rate than in R. mucronata. Nevertheless, the final estimates appear robust even when n is set as low as 2 (see below). Simulations of these scenarios are presented in the Supplement.
Introgression necessarily starts with the exchange of large genomic segments, but subsequent recombinations would gradually erode the introgressed segments into smaller fragments, resulting in short segments of DNA consisting of consecutive i-sites. We call these segments ‘introgression blocks’ (or i-blocks; Supplementary Fig. S11). Figure 4C shows a segment of the genome that comprises a string of d-sites and i-sites as defined above. These d- and i-sites are embedded in a background of low-FST or invariant sites shown as dots. This figure shows three i-blocks, each consisting of one, two, or three i-sites. The length of each block is defined by the distance between the two breakpoints flanking it. Unless otherwise specified, we remove the singleton i-blocks that harbor only a single i-site when presenting i-block length distributions.
The analysis of i-blocks is summarized in Table 1 (see also Supplementary Tables S11–S13 and Supplementary file2). We focus on the results with n = 8 but the results with n = 2 and n = 10 are given for comparison. In the DR area, R. mucronata (m1) samples harbor far more introgressions than R. stylosa (s1). The bottom of Table 1 at n = 8 shows that 16.09 or 23.09% of the R. mucronata genomes are introgressions from R. stylosa, the two values depending on whether singleton i-blocks are counted. In the opposite direction, 7.97–12.06% of the R. stylosa genomes are introgressions. The introgressions in Table 1 are visualized in Fig. 4. The salient observation is the highly fine-grained nature of the introgressions (Fig. 4 and Supplementary Figs. S12–S13). In R. mucronata, the introgressions are distributed over 9963 i-blocks with an average length of 3.24 Kb (Table 1 and Supplementary file2). In R. stylosa, there are 3874 i-blocks with an average size of 4.13 Kb (Table 1 and Supplementary file2). Thus, there likely were numerous recombination events that broke introgressions into thousands of tiny i-blocks. It should be noted that Table 1 and Fig. 4 present only the extreme cases of introgressions that rise to very high frequencies.
Table 1.
Summary of high-penetrance introgressed i-blocks between sympatric species.
| >= 2 occurrences of i-allele | >= 8 occurrences of i-allele | =10 occurrences of i-allele | ||||
|---|---|---|---|---|---|---|
| m1 | s1 | m1 | s1 | m1 | s1 | |
| No. of i-blocks (No. of scaffolds with i-blocks) | 9742 (18) | 8228 (18) | 9963 (18) | 3874 (18) | 9741 (18) | 2219 (18) |
| Length of i-block—Range (mean) | 5 bp −944.95 Kb (3.14 Kb) | 8 bp–938.88 Kb (3.49 Kb) | 5 bp–944.95 Kb (3.24 Kb) | 8 bp–938.88 Kb (4.13 Kb) | 5 bp–944.95 Kb (3.02 Kb) | 8 bp–137.48 Kb (4.15 Kb) |
| No. of i-sites in a block—Range (total i-sites) | 2–356 (40 780) | 2–116 (42 862) | 2–349 (42 572) | 2–116 (19 958) | 2–132 (40 727) | 2–100 (14 868) |
| Total length of i-blocks (% of the genome) | 30.63 Mb (15.26%) | 28.73 Mb (14.31%) | 32.29 Mb (16.09%) | 16.00 Mb (7.97%) | 29.54 Mb (14.72%) | 9.22 Mb (4.59%) |
| Total length of i-blocks a (% of the genome) | 45.33 Mb (22.58%) | 39.61 Mb (19.74%) | 46.33 Mb (23.09%) | 24.21 Mb (12.06%) | 43.14 Mb (21.49%) | 15.57 Mb (7.76%) |
Note that the species origin of introgressed alleles (i-alleles) is first defined in the allopatric populations. Hence, i-alleles in the DR area can be identified even when they are bi-directional. All i-alleles in this table are uni-directional with, for example, > = 8 in one direction, while the reciprocal direction has < = 1 i-allele. An introgressed block (i-block), unless explicitly stated, has > = 2 introgressed sites (i-sites). The 18 scaffolds collectively account for 84.38% (200.70 Mb) of the whole genome (237.85 Mb). aThese include singleton i-blocks.
The distributions of i-blocks are shown at the large genomic scale in Fig. 4D, at the scaffold scale in Fig. 4E and as individual sites in Supplementary Fig. S13A–S13C. Note that only d-sites and i-sites are displayed in these figures. As shown in Fig. 2C, the d- and i-sites are the 305 418 sites with FST > 0.8. The rest are invariant or close to invariant sites. The i-blocks are dispersed across the whole genome (Fig. 4D and Supplementary Fig. S12). Indeed, all top 18 scaffolds harbor numerous transitions between i- and d-blocks both in m1 and s1 genomes (Fig. 4D and Table 1). Figure 4E shows that transitions between i- and d-blocks can occur in a few to tens of Kb. At the site level, i-blocks and d-blocks can switch within a small distance (Supplementary Fig. S13A–S13C). An i-block (or d-block) may harbor only one i-site (or d-site), referred to as a singleton block (Table 1, Supplementary Table S11 and Fig. S13). Singleton blocks, not uncommon but less reliable, are not used in the tally.
The extensive fine-grained introgressions convey two messages. First, hybridizations may happen continually over a long period of time. Each hybridization event would initially bring in whole-chromosome introgressions that are subsequently broken down by recombination. Small DNA fragments may have been continually introgressed in this piecemeal manner. Second, loci underlying differential adaptation between species may be very common such that introgressions tend to be small, and thus free of the introgressed alleles that are deleterious in the genetic background of another species [43]. In the next section, we will direct our attention toward non-introgressions, which are blocks of native alleles flanked by introgressed DNA segments.
Very fine-grained interspersion between ‘introgressable’ and ‘non-introgressable’ blocks
Some DNA segments may not be introgressable due to the presence of adaptively significant genes. Such loci, by definition, contribute to reproductive isolation or ecological speciation [5,44] and have sometimes been referred to as ‘speciation genes’ [11,45–50]. The number, size, and direction of introgressions, therefore, depend on several parameters: (1) hybridization rate; (2) the strength of selection against the speciation genes when introgressed; (3) the number and location of speciation loci; (4) recombination rate; and (5) the length of time since initial hybridization.
To test the evolutionary forces shaping the genomic pattern of introgression, we carried out computer simulations based on the Recurrent Selection and Backcross (RSB) model [51] in a companion study [6] (see also Materials and Methods, and Supplementary Methods). The RSB model has been proposed for identifying genes’ underlying complex traits [51]. It involves repeated dilution of the genome of breed A (say, the bulldog) with that of breed B (e.g. the Border Collie) while retaining the desired phenotypic traits of the former. This is done by continually selecting for the traits of breed A while backcrossing to breed B. The scheme is almost identical to the process of ‘speciation with gene flow.’ They differ only in the parameter values; for example, the length of time in speciation is far greater and gene flow is much smaller, and often bidirectional. The differences necessitate separate simulations for speciation with gene flow. As shown in Fig. 2 of Wang et al. [6] and Supplementary Figs. S13 and S14 in this study (see also Supplementary Methods and Fig. S14), introgressions are fine-grained around almost all speciation genes. These patterns resemble the observations reported in this study.
Consistent with the observation from computer simulations, we indeed observed a number of non-introgressable genome blocks in R. mucronata and R. stylosa. With the j-site defined above, a j-block (i.e. non-introgressable block) is defined as a DNA segment containing at least one j-site (Table 2). Using these stringent criteria, we see ∼1200 j-blocks which together account for <1% of the genome and harbor 328 coding genes of which 171 contain j-sites (Table 2, Supplementary Table S14 and Fig. S15). For higher confidence, we also show j-blocks with at least two j-sites (Table 2). While only 19 genes containing j-sites are found in these j-blocks (Supplementary Table S15), it is remarkable that six of the 19 genes function in flower development and/or gamete production and development as shown in Table 2 (see the WEGO gene ontology in Supplementary Fig. S15, where a larger set of genes is presented under less stringent criteria). One (RM_77078.7) of the six genes, known as EMF1 (embryonic flower 1), regulates reproductive development and is involved in controlling flowering development [52,53]. RM_76773.10, RM_76979.9 (NAC2) and RM_77530.24 are involved in regulating the stamen development, pollen germination and tube growth [54–56]. RM_76929.10, RM_76979 and RM_77333.68 all play a role in embryonic development [57–59]. Since all six genes contain highly differentiated amino acids and non-introgressable sites (j-sites) (Table 2, Supplementary Table S15 and Fig. S16), their involvement in speciation between R. mucronata and R. stylosa seems plausible.
Table 2.
High-confidence non-introgressable j-blocks for the identification of genes involved in speciation.
| > = 1 j-sites | > = 2 j-sites | ||||
|---|---|---|---|---|---|
| No. of j-blocks (No. scaffolds with j-blocks) | 1189 (184) | 168 (44) | |||
| Length of j-blocks—Range (mean) |
3 bp–43.51 Kb (1 010 bp) |
23 bp–35.75 Kb (1 062 bp) |
|||
| No. of j-sites in a block—Range (total j-sites) | 1–6 (1443) | 2–6 (422) | |||
| Total length of j-blocks (% of the genome) | 1 201 823 bp (0.51%) | 178 520 bp (0.075%) | |||
| No. of genes with j-sites | 171 | 19 | |||
| No. of genes of flower development with j-sites | – | 6 (see below) | |||
| Gene name | j-sites | L(aa) | Site a | Function in Arabidopsis thaliana | |
|
RM_76773.10
(AT2G14110) |
3 | 255 | 4 | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein. Participating in pollen germination and tube growth [54]. | |
|
RM_76929.10
(AT1G55490) |
2 | 294 | 3 | CPN60B. Mutants in this gene develops lesions on its leaves and accelerated cell death due to heat shock stress. Participating in embryo and seed development [57]. | |
|
RM_76979.9
(AT3G15510) |
2 | 199 | 1 | NAC2 (NAC domain containing protein 2). Involved in the regulation of stamen development and embryonic development [55,58]. | |
|
RM_77078.7
(AT5G11530) |
3 | 1415 | 1 | EMF1 (embryonic flower 1). Involved in regulating reproductive development [52,53]. | |
|
RM_77333.68
(AT1G08520) |
2 | 755 | 1 | ALB1. Encoding the CHLD subunit of the Mg-chelatase enzyme which is involved in chlorophyll biosynthesis. Participating in embryo and seed development [59]. | |
|
RM_77530.24
(AT3G05420) |
2 | 671 | 2 | ACBP4 (acyl-CoA binding protein 4). Expressed and function in floral lipid metabolism. Playing combinatory roles in pollen development [56]. | |
A j-block, unless explicitly stated, has > = 2 non-introgressable sites (j-sites). j-sites: the number of non-introgressable sites within the gene. L(aa): amino acid sequence length of the gene. aSite: No. of highly differentiated amino acids between R. mucronata and R. stylosa are given (see Supplementary Fig. S16).
DISCUSSION
R. mucronata and R. stylosa in the DR area affirm their species status as they retain their biological characteristics in sympatry. However, we found numerous introgressions where the two species coexist. Since these exchanged segments are small on average, we infer that post-speciation gene flow may have lasted for a long time. There appear to be few exchanges at present with rare F1 hybrids found in DR, as well as at other sampling sites. For example, the m2/s2 collections from Singapore show the expected phylogenetic relationship of their species designation (Fig. 2A and B). Low hybridizations at present have also been reported in Brandan, Indonesia [60]; Panay Island, Philippines; Kosrae, Micronesia; Yap, Micronesia; and North Sulawesi, Indonesia [61].
R. mucronata and R. stylosa appear to have come into sympatry in the DR area at the right time for extensive post-speciation gene flow to occur. Northern Australia has been suggested as the place where the two species first came into contact [31]. In this interpretation, the two diverging taxa moved eastward either along the northern coasts of the Indian Ocean to Southeast Asia or by crossing the Indian Ocean to Australia [31]. R. mucronata in Southeast Asia then dispersed south and eastward to Australia, while R. stylosa in Australia migrated further east and north into SE Asia (see also Supplementary Notes) [31].
The fine-grained introgressions we find in R. mucronata and R. stylosa suggest that several conditions during these species’ evolution. First, the diverging populations must have been at the right evolutionary stage when they first came into contact. Had the secondary contact happened earlier, the process of speciation could have been arrested or reversed. Conversely, if the contact would have happened too late, there would be too little gene flow to achieve extensive introgression. The second condition may be even more difficult to satisfy: the two species had to remain in secondary contact for a long time [4]. This is because numerous recombination events, accumulated over a long period, are necessary to achieve the fine-grained introgression. The third condition is ecological. Sympatric species without niche separation would face competitive exclusion [62]. R. mucronata and R. stylosa had evolved a degree of niche separation that results in incomplete overlap in habitat preference (Fig. 3E).
The fine-grained pattern of introgression between R. mucronata and R. stylosa we find is exceptional in the literature. Nevertheless, the rarity of such observations does not necessarily mean that post-speciation gene flow is unusual. It only means that the opportunities for such observations may be rare. Indeed, R. mucronata and R. stylosa in the DR area represent the confluence of the three conditions presented above. Post-speciation gene flow could not be easily detected without careful planning.
The extensive gene flow in secondary sympatry also appears to remove the main doubts about sympatric speciation which permits nearly unimpeded gene flow. However, in the genic view of speciation, the homogenizing effect of gene flow is thwarted by diverging selective pressures between two adjacent environments. This force opposing the homogenization is not operative in sympatry as there is only one shared environment. Hence, the fine-grained introgressions observed in secondary sympatry here can shed light on the nature of sympatric speciation. A recent study addresses this issue of micro-parapatry vs. sympatry between two sub-species of fishes that evolved divergently in a fully isolated crater lake.
In this study, we interpret BSC to require complete RI as proposed in the original literature [3] (see [5] for further analyses). Although many have argued that BSC should tolerate ‘a little’ gene flow (textbooks like Futuyma's) [1], the argument is conceptually inconsistent with BSC. It also makes RI an operationally undefinable quantity. After all, genomic studies have suggested that more than a third of the genome may be exchangeable between species [5,6,11,25,63]. In conclusion, BSC and the full RI should be abandoned as a key criterion for species delineation given the many recent genomic studies [11,29,43,64–68] and the observations reported here.
MATERIALS AND METHODS
Plant material, genome sequencing and assembly
R. mucronata and R. stylosa individuals were collected for whole-genome sequencing from Dongzhai Harbor, Hainan, China (110°35'5.79'' E, 19°56'39.67'' N, Supplementary Tables S1 and S2), although the R. mucronata individual was originally introduced from Australia [69]. Genomic DNA extraction from leaves was performed following the CTAB method [70]. Total RNA was extracted from leaves, flowers, and fruits using the modified CTAB method [71]. Short-read libraries were constructed and sequenced using the BGISEQ-500 platform; 50 Kb long-read libraries were prepared using the 10X Genomics (Illumina Hiseq X Ten) platform. We also re-sequenced 31 R. mucronata individuals from seven populations and 21 R. stylosa individuals from four populations using the Illumina Hiseq 2000 platform. The detailed methods of sequencing, assembling, annotation, collinearity analysis, SNP calling and divergent time estimation are given in Supplementary Methods.
Detecting gene flow
We applied Patterson's D statistic and a modified fd statistic to quantify gene flow [72,73]. A positive D or fd value is an indicator of introgression. The basic model has three ingroups (P1, P2, and P3) and the outgroup (O) in the genealogical relationship (((P1, P2), P3), O). In our analysis, P1 and P2 are different populations from the same species R. mucronata (or R. stylosa), while P3 corresponds to the other species. The outgroup is R. apiculata [30]. Positive D values imply that P2 and P3 have more shared alleles than P1 and P3. The plink-1.07 [74] software package was used to estimate linkage disequilibrium (LD), represented by the r2 statistic within each population or group. LD decay was used to test for the presence of admixture events. We also calculated LD decay in sympatric populations in Singapore (s2 and m2) and allopatric R. mucronata and R. stylosa populations as controls.
Genomic scan for introgressed and non-introgressable blocks
We then identified introgressed sites (i-sites) and non-introgressable sites (j-sites) by using the population sequencing data. To probe the influences of hybridization, selection, and recombination on genomic sequences, we further carried out computer simulations. The detailed methods of genomic scanning for introgressed (i-blocks) and non-introgressable blocks (j-blocks) and simulations are given in Supplementary Methods.
DATA AVAILABILITY
The sequences of this study have been deposited in National Genomics Data Center (NGDC), China National Center for Bioinformation. The whole genome sequence of R. mucronata and R. stylosa have been deposited in the Genome Warehouse (https://ngdc.cncb.ac.cn/gwh) in NGDC, under accession numbers GWHBGBM00000000 and GWHBGBK00000000 with BioProject ID PRJCA001504. Genomic raw reads of R. mucronata and R. stylosa individuals have been deposited in the Genome Sequence Archive (https://ngdc.cncb.ac.cn/gsa) in NGDC, under accession number CRA002289 and CRA001688 with the same BioProject ID PRJCA001504.
Supplementary Material
Acknowledgements
We thank Loren Rieseberg, James Mallet, Matthew Hahn, Richard Abbott, Patrik Nosil, Jeffrey Feder, Trevor Price, Nicholas Barton and Nolan Kane for insightful comments. We thank Wei Lun Ng for the photo of Malaysia R. mucronata style in Fig. 3A.
Contributor Information
Xinfeng Wang, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China; Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China.
Ziwen He, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Zixiao Guo, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Ming Yang, Department of Laboratory Medicine & Pathology, University of Washington, Seattle, WA, 98195, USA.
Shaohua Xu, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Qipian Chen, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Shao Shao, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Sen Li, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
Cairong Zhong, Hainan Academy of Forestry (Hainan Academy of Mangrove), Haikou, 571100, China.
Norman C Duke, Centre for Tropical Water and Aquatic Ecosystem Research, James Cook University, Townsville, QLD 4811, Australia.
Suhua Shi, State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, 510275, China.
FUNDING
This work was supported by the National Natural Science Foundation of China (31830005 and 31971540), the National Key Research and Development Plan of China (2017FY100705), the Guangdong Basic and Applied Basic Research Foundation (2019A1515010752), the Science and Technology Project of Guangzhou (202102020483), and the Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai) (311021006).
AUTHOR CONTRIBUTIONS
S. Shi and Z.H. conceived the study. X.W., Z.H., Z.G., M.Y, S.X., Q.C. and S. Shao analyzed the data. X.W. collected the sequencing data. S. Shi, Z.G., S.L., C.Z. and N. C.D. contributed to sample collection. X.W., Z.H., Z.G. and S. Shi prepared the manuscript. All authors read and approved the final manuscript.
Conflict of interest statement
None declared.
REFERENCES
- 1. Futuyma DJ, Kirkpatrick M. Evolution. Sunderland, MA: Sinauer Associates, Inc., 2017. [Google Scholar]
- 2. Dobzhansky T. Genetics and the Origin of Species. New York: Columbia University Press, 1937. [Google Scholar]
- 3. Mayr E. Animal Species and Evolution. Cambridge, MA: Harvard University Press, 1963. [Google Scholar]
- 4. Coyne JA, Orr HA. Speciation. Sunderland, MA: Sinauer Associates, Inc., 2004. [Google Scholar]
- 5. Wu C-I. The genic view of the process of speciation. J Evol Biol 2001; 14: 851–65. 10.1046/j.1420-9101.2001.00335.x [DOI] [Google Scholar]
- 6. Wang X, He Z, Shi S, et al. Genes and speciation: is it time to abandon the biological species concept? Natl Sci Rev 2020; 7: 1387–97. 10.1093/nsr/nwz220 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Wu C-I, Davis AW. Evolution of postmating reproductive isolation: the composite nature of Haldane's rule and its genetic bases. Am Nat 1993; 142: 187–212. [DOI] [PubMed] [Google Scholar]
- 8. Sawamura K, Davis AW, Wu C-I. Genetic analysis of speciation by means of introgression into Drosophila melanogaster. Proc Natl Acad Sci USA 2000; 97: 2652–5. 10.1073/pnas.050558597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Palopoli MF, Wu C-I. Genetics of hybrid male sterility between Drosophila sibling species: a complex web of epistasis is revealed in interspecific studies. Genetics 1994; 138: 329–41. 10.1093/genetics/138.2.329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Cabot EL, Davis AW, Johnson NA, et al. Genetics of reproductive isolation in the Drosophila simulans clade: complex epistasis underlying hybrid male sterility. Genetics 1994; 137: 175–89. 10.1093/genetics/137.1.175 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Wu C-I, Ting CT. Genes and speciation. Nat Rev Genet 2004; 5: 114–22. 10.1038/nrg1269 [DOI] [PubMed] [Google Scholar]
- 12. Meier JI, Marques DA, Mwaiko S, et al. Ancient hybridization fuels rapid cichlid fish adaptive radiations. Nat Commun 2017; 8: 14363. 10.1038/ncomms14363 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Seehausen O. Patterns in fish radiation are compatible with Pleistocene desiccation of Lake Victoria and 14 600 year history for its cichlid species flock. Proc R Soc B Biol Sci 2002; 269: 491–7. 10.1098/rspb.2001.1906 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Ryan MJ. Food, song and speciation. Nature 2001; 409: 139–40. 10.1038/35051671 [DOI] [PubMed] [Google Scholar]
- 15. Lamichhaney S, Berglund J, Almén MS, et al. Evolution of Darwin's finches and their beaks revealed by genome sequencing. Nature 2015; 518: 371–5. 10.1038/nature14181 [DOI] [PubMed] [Google Scholar]
- 16. Barluenga M, Stölting KN, Salzburger W, et al. Sympatric speciation in Nicaraguan crater lake cichlid fish. Nature 2006; 439: 719–23. 10.1038/nature04325 [DOI] [PubMed] [Google Scholar]
- 17. Martin CH. Strong assortative mating by diet, color, size, and morphology but limited progress toward sympatric speciation in a classic example: Cameroon crater lake cichlids. Evolution 2013; 67: 2114–23. [DOI] [PubMed] [Google Scholar]
- 18. Richards EJ, Poelstra JW, Martin CH. Don’t throw out the sympatric speciation with the crater lake water: fine-scale investigation of introgression provides equivocal support for causal role of secondary gene flow in one of the clearest examples of sympatric speciation. Evol Lett 2018; 2: 524–40. 10.1002/evl3.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Yang M, He Z, Shi S, et al. Can genomic data alone tell us whether speciation happened with gene flow? Mol Ecol 2017; 26: 2845–9. 10.1111/mec.14117 [DOI] [PubMed] [Google Scholar]
- 20. Wu C-I, Wang X, He Z, et al. Replies to the commentaries on the question of “Is it time to abandon the biological species concept?” Natl Sci Rev 2020; 7: 1407–9. 10.1093/nsr/nwaa117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Mallet J. Alternative views of biological species: reproductively isolated units or genotypic clusters? Natl Sci Rev 2020; 7: 1401–7. 10.1093/nsr/nwaa116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Gao L, Rieseberg LH. While neither universally applicable nor practical operationally, the biological species concept continues to offer a compelling framework for studying species and speciation. Natl Sci Rev 2020; 7: 1398–400. 10.1093/nsr/nwaa108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Butlin RK, Stankowski S. Is it time to abandon the biological species concept? No. Natl Sci Rev 2020; 7: 1400–1. 10.1093/nsr/nwaa109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Turner TL, Hahn MW, Nuzhdin SV. Genomic islands of speciation in Anopheles gambiae. PLoS Biol 2005; 3: 1572–8. 10.1371/journal.pbio.0030285 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Martin SH, Dasmahapatra KK, Nadeau NJ, et al. Genome-wide evidence for speciation with gene flow in Heliconius butterflies. Genome Res 2013; 23: 1817–28. 10.1101/gr.159426.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Abbott R, Albach D, Ansell S, et al. Hybridization and speciation. J Evol Biol 2013; 26: 229–46. 10.1111/j.1420-9101.2012.02599.x [DOI] [PubMed] [Google Scholar]
- 27. Poelstra JW, Vijay N, Bossu CM, et al. The genomic landscape underlying phenotypic integrity in the face of gene flow in crows. Science 2014; 344: 1410–4. 10.1126/science.1253226 [DOI] [PubMed] [Google Scholar]
- 28. Vijay N, Bossu CM, Poelstra JW, et al. Evolution of heterogeneous genome differentiation across multiple contact zones in a crow species complex. Nat Commun 2016; 7: 13195. 10.1038/ncomms13195 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. He Z, Li X, Yang M, et al. Speciation with gene flow via cycles of isolation and migration: insights from multiple mangrove taxa. Natl Sci Rev 2019; 6: 275–88. 10.1093/nsr/nwy078 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Xu S, He Z, Zhang Z, et al. The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing. Natl Sci Rev 2017; 4: 721–34. 10.1093/nsr/nwx065 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Duke NC, Lo E, Sun M. Global distribution and genetic discontinuities of mangroves – emerging patterns in the evolution of Rhizophora. Trees 2002; 16: 65–79. 10.1007/s00468-001-0141-7 [DOI] [Google Scholar]
- 32. He Z, Xu S, Zhang Z, et al. Convergent adaptation of the genomes of woody plants at the land-sea interface. Natl Sci Rev 2020; 7: 978–93. 10.1093/nsr/nwaa027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Duke NC. Indo-West Pacific stilt mangroves: Rhizophora apiculata, R. mucronata, R. stylosa, R. x annamalai, R. x lamarckii. In: Elevitch CR (ed.) Traditional Trees of Pacific Islands: Their Culture, Environment, and Use. Holualoa, Hawaii: Permanent Agriculture Resources, 2006, 641–60. [Google Scholar]
- 34. Xie W, Guo Z, Wang J, et al. Evolution of woody plants to the land-sea interface – The atypical genomic features of mangroves with atypical phenotypic adaptation. Mol Ecol 2022; 1–15. 10.1111/mec.16587 [DOI] [PubMed] [Google Scholar]
- 35. Guo Z, Xu S, Xie W, et al. Adaptation to a new environment with pre-adaptive genomic features – Evidence from woody plants colonizing the land–sea interface. Plant J 2022; 111: 1411–24. 10.1111/tpj.15899 [DOI] [PubMed] [Google Scholar]
- 36. He Z, Feng X, Chen Q, et al. Evolution of coastal forests based on a full set of mangrove genomes. Nat Ecol Evol 2022; 6: 738–49. 10.1038/s41559-022-01744-9 [DOI] [PubMed] [Google Scholar]
- 37. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014; 30: 1312–3. 10.1093/bioinformatics/btu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Nguyen LT, Schmidt HA, Von Haeseler A, et al. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015; 32: 268–74. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol 2016; 33: 1870–4. 10.1093/molbev/msw054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Galinsky KJ, Bhatia G, Loh PR, et al. Fast principal-component analysis reveals convergent evolution of ADH1B in Europe and East Asia. Am J Hum Genet 2016; 98: 456–72. 10.1016/j.ajhg.2015.12.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Futuyma DJ. Evolution. Sunderland, MA: Sinauer Associates, Inc., 2005. [Google Scholar]
- 42. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res 2009; 19: 1655–64. 10.1101/gr.094052.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Fang S, Yukilevich R, Chen Y, et al. Incompatibility and competitive exclusion of genomic segments between sibling Drosophila species. PLoS Genet 2012; 8: e1002795. 10.1371/journal.pgen.1002795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Schluter D. Evidence for ecological speciation and its alternative. Science 2009; 323: 737–41. 10.1126/science.1160006 [DOI] [PubMed] [Google Scholar]
- 45. Presgraves DC. The molecular evolutionary basis of species formation. Nat Rev Genet 2010; 11: 175–80. 10.1038/nrg2718 [DOI] [PubMed] [Google Scholar]
- 46. Mallet J. What does Drosophila genetics tell us about speciation? Trends Ecol Evol 2006; 21: 386–93. 10.1016/j.tree.2006.05.004 [DOI] [PubMed] [Google Scholar]
- 47. Sun S, Ting CT, Wu C-I. The normal function of a speciation gene, Odysseus, and its hybrid sterility effect. Science 2004; 305: 81–3. 10.1126/science.1093904 [DOI] [PubMed] [Google Scholar]
- 48. Ting CT, Tsaur SC, Wu ML, et al. A rapidly evolving homeobox at the site of a hybrid sterility gene. Science 1998; 282: 1501–4. 10.1126/science.282.5393.1501 [DOI] [PubMed] [Google Scholar]
- 49. Nosil P, Schluter D. The genes underlying the process of speciation. Trends Ecol Evol 2011; 26: 160–7. 10.1016/j.tree.2011.01.001 [DOI] [PubMed] [Google Scholar]
- 50. Wittbrodt J, Adam D, Malitschek B, et al. Novel putative receptor tyrosine kinase encoded by the melanoma-inducing Tu locus in Xiphophorus. Nature 1989; 341: 415–21. 10.1038/341415a0 [DOI] [PubMed] [Google Scholar]
- 51. Luo ZW, Wu C-I, Kearsey MJ. Precision and high-resolution mapping of quantitative trait loci by use of recurrent selection, backcross or intercross schemes. Genetics 2002; 161: 915–29. 10.1093/genetics/161.2.915 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Sánchez R, Kim MY, Calonje M, et al. Temporal and spatial requirement of EMF1 activity for arabidopsis vegetative and reproductive development. Mol Plant 2009; 2: 643–53. 10.1093/mp/ssp004 [DOI] [PubMed] [Google Scholar]
- 53. Park HY, Lee SY, Seok HY, et al. EMF1 interacts with EIP1, EIP6 or EIP9 involved in the regulation of flowering time in Arabidopsis. Plant Cell Physiol 2011; 52: 1376–88. 10.1093/pcp/pcr084 [DOI] [PubMed] [Google Scholar]
- 54. Wang Y, Zhang W-Z, Song L-F, et al. Transcriptome analyses show changes in gene expression to accompany Pollen germination and tube growth in Arabidopsis. Plant Physiol 2008; 148: 1201–11. 10.1104/pp.108.126375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Mandaokar A, Thines B, Shin B, et al. Transcriptional regulators of stamen development in Arabidopsis identified by transcriptional profiling. Plant J 2006; 46: 984–1008. 10.1111/j.1365-313X.2006.02756.x [DOI] [PubMed] [Google Scholar]
- 56. Hsiao AS, Yeung EC, Ye ZW, et al. The Arabidopsis cytosolic Acyl-CoA-binding proteins play combinatory roles in pollen development. Plant Cell Physiol 2015; 56: 322–33. 10.1093/pcp/pcu163 [DOI] [PubMed] [Google Scholar]
- 57. Ke X, Zou W, Ren Y, et al. Functional divergence of chloroplast Cpn60α subunits during Arabidopsis embryo development. PLoS Genet 2017; 13: e1007036. 10.1371/journal.pgen.1007036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Kunieda T, Mitsuda N, Ohme-Takagi M, et al. NAC family proteins NARS1/NAC2 and NARS2/NAM in the outer integument regulate embryogenesis in arabidopsis. Plant Cell 2008; 20: 2631–42. 10.1105/tpc.108.060160 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Bryant N, Lloyd J, Sweeney C, et al. Identification of nuclear genes encoding chloroplast-localized proteins required for embryo development in Arabidopsis. Plant Physiol 2011; 155: 1678–89. 10.1104/pp.110.168120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Wee AKS, Takayama K, Chua JL, et al. Genetic differentiation and phylogeography of partially sympatric species complex Rhizophora mucronata Lam. and R. stylosa Griff. using SSR markers. BMC Evol Biol 2015; 15: 1–13. 10.1186/s12862-015-0331-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Yan Y-B, Duke NC, Sun M. Comparative analysis of the pattern of population genetic diversity in three Indo-West Pacific Rhizophora mangrove species. Front Plant Sci 2016; 7: 1434. 10.3389/fpls.2016.01434 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Hardin G. The competitive exclusion principle. Science 1960; 131: 1292–7. 10.1126/science.131.3409.1292 [DOI] [PubMed] [Google Scholar]
- 63. Ellegren H, Smeds L, Burri R, et al. The genomic landscape of species divergence in Ficedula flycatchers. Nature 2012; 491: 756–60. 10.1038/nature11584 [DOI] [PubMed] [Google Scholar]
- 64. Seehausen O. Hybridization and adaptive radiation. Trends Ecol Evol 2004; 19: 198–207. 10.1016/j.tree.2004.01.003 [DOI] [PubMed] [Google Scholar]
- 65. Rieseberg LH. A new model of speciation. Natl Sci Rev 2019; 6: 289–90. 10.1093/nsr/nwy111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Abbott RJ. A mixing-isolation-mixing model of speciation can potentially explain hotspots of species diversity. Natl Sci Rev 2019; 6: 290–1. 10.1093/nsr/nwy112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Price T. Allo-parapatric speciation goes offshore. Natl Sci Rev 2019; 6: 289. 10.1093/nsr/nwy110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Barton NH. Is speciation driven by cycles of mixing and isolation? Natl Sci Rev 2019; 6: 291–2. 10.1093/nsr/nwy113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Liao B, Zheng S, Chen Y, et al. Preliminary report on introduction of several Alien Mangrove plants in Dongzhai Harbor of Hainan province. J Cent SOUTH For Univ 2006; 26: 63–7. [Google Scholar]
- 70. Doyle JJ. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull 1987; 19: 11–5. [Google Scholar]
- 71. Yang G, Zhou R, Tang T, et al. Simple and efficient isolation of high-quality total RNA from Hibiscus tiliaceus, a mangrove associate and its relatives. Prep Biochem Biotechnol 2008; 38: 257–64. 10.1080/10826060802164991 [DOI] [PubMed] [Google Scholar]
- 72. Durand EY, Patterson N, Reich D, et al. Testing for ancient admixture between closely related populations. Mol Biol Evol 2011; 28: 2239–52. 10.1093/molbev/msr048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Green RE, Krause J, Briggs AW, et al. A draft sequence of the neandertal genome. Science 2010; 328: 710–22. 10.1126/science.1188021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007; 81: 559–75. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Duke NC. “World Mangrove iD: expert information at your fingertips” App Store Version 1.2, July 2017. MangroveWatch Publication, Australia - E-Book, 2017. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The sequences of this study have been deposited in National Genomics Data Center (NGDC), China National Center for Bioinformation. The whole genome sequence of R. mucronata and R. stylosa have been deposited in the Genome Warehouse (https://ngdc.cncb.ac.cn/gwh) in NGDC, under accession numbers GWHBGBM00000000 and GWHBGBK00000000 with BioProject ID PRJCA001504. Genomic raw reads of R. mucronata and R. stylosa individuals have been deposited in the Genome Sequence Archive (https://ngdc.cncb.ac.cn/gsa) in NGDC, under accession number CRA002289 and CRA001688 with the same BioProject ID PRJCA001504.




