Abstract
Allopatric speciation requiring an unbroken period of geographical isolation has been the standard model of neo-Darwinism. While doubts have been repeatedly raised, strict allopatry without any gene flow remains a plausible mechanism in most cases. To rigorously reject strict allopatry, genomic sequences superimposed on the geological records of a well-delineated geographical barrier are necessary. The Strait of Malacca, narrowly connecting the Pacific and Indian Ocean coasts, serves at different times either as a geographical barrier or a conduit of gene flow for coastal/marine species. We surveyed 1700 plants from 29 populations of 5 common mangrove species by large-scale DNA sequencing and added several whole-genome assemblies. Speciation between the two oceans is driven by cycles of isolation and gene flow due to the fluctuations in sea level leading to the opening/closing of the Strait to ocean currents. Because the time required for speciation in mangroves is longer than the isolation phases, speciation in these mangroves has proceeded through many cycles of mixing-isolation-mixing, or MIM, cycles. The MIM mechanism, by relaxing the condition of no gene flow, can promote speciation in many more geographical features than strict allopatry can. Finally, the MIM mechanism of speciation is also efficient, potentially yielding mn (m > 1) species after n cycles.
Keywords: speciation, gene flow, allopatry, geographical isolation, mangrove
SIGNIFICANCE STATEMENT
Mechanisms of species formation have always been a conundrum. Speciation between populations that are fully geographically isolated, or allopatric speciation, has been the standard solution in the last 50 years. Complete geographical isolation with no possibility of gene flow, however, is often untenable and is inefficient in generating the enormous biodiversity. By studying mangroves on the Indo-Malayan coasts, a global hotspot of coastal biodiversity, we were able to combine genomic data with geographical records on the Indo-Pacific Barrier that separates Pacific and Indian Ocean coasts. We discovered a novel mechanism of speciation that we call mixing-isolation-mixing (MIM) cycles. By permitting intermittent gene flow during speciation, MIM cycles can potentially generate species at an exponential rate, thus combining speciation and biodiversity in a unified framework.
INTRODUCTION
Speciation driven by geographical isolation with no possibility of gene flow, or strict allopatric speciation, has been the standard model of neo-Darwinism [1,2]. Although occasional exceptions are acceptable to this view [3–5], extensive violations of strict allopatry would contradict many of its central tenets. One of these tenets is the nature of species as defined by the Biological Species Concept [2]. The argument for strict allopatry has usually been that gene flow would homogenize diverging populations and retard speciation [2]. After the completion of speciation, secondary contact may lead to a hybrid zone but the process of speciation should have become irreversible by then [6–8].
The stringent requirement for complete geographical isolation, however, is not without difficulties. Chief among them is the paucity of geographical features that can fully stop gene flow to sustain long-term isolation. As a result, the observed biodiversity seems too extensive to rely solely on the limited opportunities for strict allopatric speciation [9]. Fisher outlined a verbal theoretical model of clinal speciation [10] and Endler suggested that parapatric speciation, arising between adjacent populations that continue to exchange genes at a reduced level, may be far more common than allopatry [11]. Divergent selection in parapatry can be sufficient to overcome the homogenizing effects of migration if individual genic effects are examined (see [6]). In this genic view, the level of divergence at the completion of speciation would be uneven across the genome. In particular, there may exist ‘genomic islands of speciation’ (GIS) that are involved in divergent adaptation or reproductive isolation [5,12–14].
Evidence for locus-dependent gene flow leading to the formation of GIS has been widely reported [5,12–14], although Cruickshank and Hahn rejected many reported examples of GIS as products of processes unrelated to speciation [15]. More generally, it has been pointed out that genomic data alone could not have the power to reject the allopatric model, even when GIS can be properly identified [16]. In particular, if geographical isolation arises between subdivided populations, allopatry would likely be falsely rejected. Other sources of data are therefore necessary.
The resolution of the issue of speciation with gene flow may be possible if historical data on geographical barriers, offering a temporal perspective, are available. Fauna and flora of the two ocean coasts delineated by the Indo-Pacific Barrier (IPB) are particularly suited to such inquiries (Fig. 1a). The Strait of Malacca, a main feature of the IPB, can impose large-scale geographical isolation for taxa with ocean-current-dependent migration. Unlike the isolation at the Isthmus of Panama, the IPB isolation is not permanent. When the sea level rose and fell periodically during the Pleistocene [17], the Strait of Malacca, which is much shallower than the two oceans, closed and opened intermittently to ocean currents and gene flow (Fig. 1b) [18]. The timing of the alternation of the phases can be inferred from geological records (Fig. 1b). Hence, the DNA divergence pattern can be superimposed on the geographical records of the physical barrier itself.
A larger issue raised by speciation mechanisms concerns biodiversity. There are a number of biodiversity centers globally. Among them, an exceptionally biota-rich region is found along the Indo-Malayan coasts [1,19]. Major groups of flora and fauna display unequalled species richness on these coasts [20–22]. Mechanisms of speciation have been proposed, and rejected, as an explanation for exceptionally high diversity in such hot-spots [23], for two reasons. First, these centers often do not have geographical features that can facilitate allopatric speciation by imposing long-term geographical isolation [19]. Second, speciation by strict allopatry (e.g. at the Isthmus of Panama) is not an efficient mechanism to generate the multitude of species because species would simply double in number. This study will attempt to connect speciation mechanisms and species richness in a single framework. Finally, given the breadth of the subject matter, necessary backgrounds and potential criticisms cannot be fully addressed in the main text. These additional topics are therefore presented in the section ‘Replies to objections to the MIM model’ of the Supplementary Note, available as Supplementary Data at NSR online.
RESULTS
In this study, we analyse the divergence in five distantly related taxa of mangroves, which are woody plants that independently invaded the intersection between land and sea within the last 100 million years (Myrs) [21,24–26]. Because mangroves occupy a narrow band along tropical coasts, their distributions are essentially one-dimensional, making it easier to identify geographical barriers between species. For mangroves along the two ocean coasts (referred to as the East vs. West regions in Fig. 1a), the barrier is often the Strait of Malacca, which opens and closes periodically to ocean currents, and serves as a conduit for mangrove seed dispersal when open (see ‘Introduction’). Globally, there are 70 or so mangrove species and >80% of them can be found along the Indo-Malayan coasts [27]. Many of these mangrove taxa have existed and undergone diversification only in this region. In contrast, only eight species exist in the New World tropics [21,24,28]. Since other taxa are also highly diverse on the Indo-Malayan coasts [29,30], the geographical mechanism of speciation in mangroves may be broadly applicable to other fauna and flora in this region.
In this study, we approach mangrove speciation from both ends: divergence between good species and differentiation between geographical populations. By doing so, we resolve the dilemma in studying speciation. The dilemma is that good species may be too divergent to inform about speciation events [12,15,17,31], but subspecies and geographical populations are not, and may not become, true species.
We have generated high-quality whole-genome sequences from multiple species of mangroves [26]. For the analysis of speciation history, many more samples but fewer genes per species are necessary to study population differentiation. The geographical populations of five common mangrove species on the two coasts to the East and West of the Strait are shown in Fig. 1a. Sampling was done in 14 locations of the three areas: Hainan Island in China (H), the East coast of Thailand facing the gulf (G) and the west coast of Thailand (W). In total, approximately 1700 individuals from the five species were collected (Supplementary Table 1, available as Supplementary Data at NSR online) and subjected to sequencing. Following a published method [32], we obtained an average of 70 Kb of sequence across 80 genes per individual (Supplementary Table 2, available as Supplementary Data at NSR online).
Speciation history in the Indo-Western Pacific (IWP)
The Indo-Malayan coasts, as part of the IWP and depicted in Fig. 1a, represent an important biodiversity hotspot. Near the tip of the Malay Peninsula, more than 20 mangrove species can be found in a local population ([33,34] and our field observations). At least nine mangrove genera had formed relatively recently originated species along these coasts and the recent speciation events (<4% divergence) are shown in Table 1. The five genera most actively involved in recent speciation are analysed in detail here. Documented hybridization is not uncommon in areas of sympatry (Table 1). Molecular typing has shown that all hybrids belong to first-filial generation [35–40] and exhibt poor pollen maturation or seed germination in the hybrids in planting experiments [41,42].
Table 1.
% Divergencea | |||||
---|---|---|---|---|---|
Genus | No. of species | No. of hybrids | Minimum | Maximum | Estimated divergence time (Myrs)b |
Ceriops | 5 | 1 | 0.28 | 2.10 | 0.97 ∼ 7.23 |
Rhizophora | 3 | 3 | 0.28 | 1.50 | 1.20 ∼ 6.44 |
Sonneratia | 6 | 4 | 0.70 | 3.75 | 1.25 ∼ 6.67 |
Avicennia | 5 (7)c | 1 | 0.80 | 3.70 | 1.31 ∼ 6.07 |
Aegiceras | 2 | 0 | 2.14 | 1.66 | |
Xylocarpus | 3 | 0 | 1.88d | 2.00 | |
Lumnitzera | 2 | 1 | 2.72 | 2.85 | |
Kandelia | 2 | 0 | 0.78 | 2.23 | |
Bruguiera | 6 | 1 | 0.99d | 4.63 |
aThe divergence level was estimated from 3–60 loci in each genus. Sequences were obtained from published studies or by ourselves and deposited in GenBank (see Supplementary Table 5, available as Supplementary Data at NSR online).
bIn millions of years (Myrs). Divergence times were calculated as genetic divergence divided by the corresponding nucleotide substitution rate. The rate for Lumnitzera and Xylocarpus was estimated as 4.78 × 10−9/site/year based on internal transcribed spacers of the nuclear ribosomal DNA [77]. We obtained whole-genome and/or whole-transcriptome data for the other genera from a separate study ([26] and He et al., unpublished data). The substitution rate for each genus was then inferred for the specific branches using PhyML [91] and PAML [92] in conjunction with fossil dating. A further adjustment was made to compensate for the different substitution rates between coding and non-coding regions (see Supplementary Table 6 and Supplementary Note, available as Supplementary Data at NSR online).
cIf the three subspecies of Av. marina are counted separately, the number would be 7.
dDivergence information is available for only one pair of species in this genus.
The Strait of Malacca is a major geographical barrier for mangroves in the IWP (Fig. 1b and c). One example concerns the species pair Ceriops decandra and C. zippeliana (Fig. 1c), between which the extant boundary is right along the Strait. The species boundary between Sonneratia ovata and S. griffithii is broader but also falls along the Strait (Fig. 1d), whereas the ranges of Rhizophora mucronata and R. stylosa (Fig. 1e) overlap broadly along the two ocean coasts adjacent to the Strait of Malacca, although their general distributions suggest post-speciation dispersion across the Strait, which is in keeping with the better dispersal ability of Rhizophora compared to either Ceriops or Sonneratia [43]. Three other genera are likely to have experienced post-speciation migration through the Strait of Malacca, much like Rhizophora. They concern the species pairs: Avicennia rumphiana and Av. alba; Lumnitzera littorea and L. racemosa; and Bruguiera sexangula and B. gymnorhiza (Supplementary Fig. 1, available as Supplementary Data at NSR online). The geology of the region and the sea-level records are shown in Fig. 1a and b, and indicate that the East and West regions would be strongly isolated when the sea level drops below –25 meters, which is the historical norm.
It is important to point out that the Strait of Malacca connecting/separating the Pacific and Indian Ocean coasts is only one of many barriers in the IWP. Other geographical barriers can also be inferred. For example, the Torres Strait may have restricted the distributions of the sibling species Sonneratia caseolaris and S. lanceolata in northern Australia (Supplementary Fig. 1a, available as Supplementary Data at NSR online; reviewed in [44]). The biodiversity in the IWP in relation to these barriers will be discussed below.
Speciation with gene flow between the two ocean coasts
The time of species divergence in the nine genera listed in Table 1 was estimated for each node of the phylogenetic tree based on DNA-sequence data and the estimated species-specific nucleotide substitution rates (see Supplementary Note, available as Supplementary Data at NSR online). In eight of these nine genera, the most recent species divergence time is within the last 3 Myrs. The oldest divergence time in Table 1 is about 6.5 Myrs ago. The most recent events within each genus generally fall in the time frame depicted in Fig. 1b, which shows the possible periods of gene flow (above –25 meters, indicated by the red broken line).
A history of gene flow should be reflected in the genomic data because genomic segments involved in differential adaptation (in physiology, morphology, reproduction, etc.) should be more divergent than the rest of the genome [6,45]. Many statistical tests have been developed to test this hypothesis by asking whether the level of divergence is ‘over-dispersed’ across the genome. Here, we employed two methods (Supplementary Fig. 2, available as Supplementary Data at NSR online) on R. mucronata and R. stylosa (Fig. 1e), using the complete genome sequences published recently [26]. In the first method, the divergence level in genic regions is compared with that of intergenic regions (see Supplementary Note, available as Supplementary Data at NSR online, for definition) on the assumption that the former are more likely to be involved in differential adaptation than the latter [46]. The second method [47] relies on the variance in divergence across the genome. Both methods implement likelihood-ratio tests to compare the allopatric (H0) and speciation with gene flow (H1) models. In both methods, the null model is rejected with high confidence (P ∼ 0; Supplementary Table 3, available as Supplementary Data at NSR online), thus supporting the model of gene flow during speciation (see details for Supplementary Note, available as Supplementary Data at NSR online). In order to identify the genomic segments most likely involved in speciation, we conducted a sliding-window analysis. Very large GIS regions between R. mucronata and R. stylosa that are unusually divergent are shown in Fig. 1f (see legend). Four of them, marked by red arrows, are more stringently called. In total, 40 GIS segments are identified for a total of 4775 Kb, or 2.33% of the sequenced genome.
Figure 1f follows the standard procedure in testing ‘speciation with gene flow’ and rejects the null hypothesis. However, Yang et al. recently suggested that the statistical rejection is valid only for the simplest form of allopatry. For example, if gene flow occurs between geographical populations before, but not during, speciation, the null model would still be rejected, hence leading to the false rejection of allopatry [16,48]. In other words, the tests are done because the failure to reject would be biologically informative while the rejection is much less so. In cases of rejection, other types of data (e.g. geographical distributions of species and the nature of the putative barrier prior to the completion of speciation) need to be superimposed on the genomic analyses. In the remaining sections, such data will be used on geographical populations located along the two ocean coasts. The objective is to estimate the minimal time required for speciation, which will then be compared with the geological records of the geographical barrier itself.
Differentiation between geographical populations on the two ocean coasts
The Strait of Malacca has served as a geographical barrier leading to speciation in the past. We asked whether it continues to serve as a barrier for geographical differentiation at present. Morphological observations support the inference of East–West differentiation (see Fig. 2a and b) and DNA-sequence divergence provides the time depth of geographical differentiation. The latter is usually expressed by partitioning the diversity within and between regions. Both πR, the genetic diversity within each area (H, G or W), and πT, the total diversity of the species, are listed in Table 2 and legends, as well as Supplementary Table 4 and Supplementary Fig. 3, available as Supplementary Data at NSR online. Population divergence between regions, denoted by FST = (πT – πR)/πT [49], generally follows the speciation pattern.
Table 2.
C. tagal | R. apiculata | S. alba | Av. marina | Ae. corniculatum | |
---|---|---|---|---|---|
Samples and sequences | |||||
No. of stands (H, G, W)a | 2, 1, 1 | 3, 2, 1 | 2, 2, 3 | 2, 3, 1 | 3, 1, 2 |
Sample size (H, G, W) | 200, 100, 100 | 90, 68, 34 | 185, 100, 150 | 200, 89, 35 | 174, 50, 87 |
No. segments (Total Kb) | 102 (76.6) | 124 (65.2) | 101 (59.6) | 150 (85.2) | 115 (57.3) |
Differentiation between East and West regions | |||||
F ST (πT, πR)b (/Kb) | 0.267 (0.343, 0.251) | 0.48 (1.241, 0.645) | 0.67 (2.151, 0.701) | 0.72 (2.519, 0.705) | 0.75 (10.746, 2.640) |
Estimation of mutation rate and divergence | |||||
Mutation rate, μ (/Kb/generation)c | 1.80 × 10−5 | 1.63 × 10−5 | 2.84 × 10−5 | 3.26 × 10−5 | 4.06 × 10−5 |
N eμ in SIM (MIM) | 0.009 (0.007) | 0.029 (0.035) | 0.035 (0.030) | 0.020 (0.019) | 0.225 (0.069) |
N em in SIM (MIM)d | 0.348 (0.420) | 0.796 (0.282) | 0.541 (0.183) | 0.265 (0.091) | 0.878 (0.135) |
T min (Myrs) | 0.18 | 1.38 | 1.14 | 1.10 | 1.56 |
P e | 0.67 | >0.999 | >0.999 | >0.999 | >0.999 |
aH, Hainan; G, Gulf of Thailand; W, West coast of Thailand (see Fig. 1a).
bπR is the average within-region diversity, πT is the total diversity and FST = (πT – πR)/πT (see text). Nucleotide diversities for population stands (πS) are given in Supplementary Table 4, available as Supplementary Data at NSR online.
cMutation rate estimation is based on fossil record divergence time (see legends in Table 1).
d N eμ and Nem are estimated separately for the SIM and MIM models (see text and Supplementary Note, available as Supplementary Data at NSR online).
eMaximum likelihood probability that the MIM cycles model is better than the SIM model, based on 2000 replicate simulations.
One of the five species, Ceriops tagal has unusually low diversity (πT = 0.343/Kb, less than ¼ of that of the next lowest species) and hence little differentiation among all populations. Table 2 shows that all other species exhibit a larger πT than πR and strong population differentiation. Figure 2c shows pairwise differentiation patterns between the three geographical areas. The divergence is relatively low in the H–G comparison in the three species with intermediate diversity (Rhizophora apiculata, Sonneratia alba and Avicennia marina), despite substantial geographical distance between the two areas. Differentiation is mainly observed between coasts of the East (combining H and G areas) and West regions (see Fig. 2c and Supplementary Fig. 4, available as Supplementary Data at NSR online). Thus, these three species suggest a key role of the Strait of Malacca in the geographical isolation between the two ocean coasts. In the most diverse species, Aegiceras corniculatum, the East–West divergence is even stronger and an additional barrier (likely due to distance) also causes the divergence between the G and H populations (Supplementary Fig. 5c, available as Supplementary Data at NSR online).
Geographical differentiation can be analysed in greater detail by analysing haplotype structures. Three examples of haplotype networks are shown in Fig. 2d–f (see more cases in Supplementary Fig. 5, available as Supplementary Data at NSR online). The haplotypes can be clearly divided into two clades, referred to as the Eastern or Western haplotype, depending on where they are more commonly found. The existence of distinct haplotypes without intermediates usually indicates strong population differentiation [50]. Both the FST statistics and haplotype structures hence suggest strong differentiation between the East and West regions demarcated by the Strait of Malacca.
DNA-sequence divergence vs. geological record: how much time is needed for mangrove speciation?
Under the past sea-level changes [17], the East and West regions have experienced cycles of isolation and admixture due to the repeated opening and closing of the Strait (see Fig. 1b). To compare the geological records of barrier duration with the divergence history inferred from genomic sequences, it is necessary to estimate the time required for speciation to be completed (Tspp, or speciation time). This can then be compared to the isolation time (Tiso), the length of the periods when physical barriers to gene flow were recorded in historical data.
If we assume strict allopatry (Fig. 3a), speciation needs to be completed during geographical isolation, or Tspp < Tiso. From Table 1, species divergence takes 1.2–6.7 Myrs, with a mid-point Tspp of ∼4 Myrs. (The lowest estimate of 0.84 Myrs in Ceriops is less reliable due to its very low mutation rate; see Table 2). From Fig. 1b, Tiso is always smaller than 0.5 Myrs and rarely larger than 0.2 Myrs. Obviously, the allopatric condition of Tspp < Tiso is not met. Nevertheless, since the divergence time between good species given in Table 1 represents over-estimation of Tspp, the rejection of Tspp < Tiso is not informative.
We shall now use the lower bound estimate of Tspp against Tiso. This lower bound is the divergence time between geographical populations. A new population genetic framework is developed for the purpose of estimating Tspp between two randomly chosen genes from the same or different populations. This new framework is presented in detail in Supplementary Note, available as Supplementary Data at NSR online. It is distinct from previous models because it will be needed to compare the allopatric model (Fig. 3a) with our new MIM model (Fig. 3c) with multiple cycles of isolation and migration. The likelihood of observing various distributions of divergence is formulated as the function of Tspp, Ne and m, where Ne is the effective population size and m is the migration rate (Table 2). We then use the maximum-likelihood estimates (MLE) to obtain parameters (Table 2). Note that the null model here is strict allopatry, portrayed by the single isolation-mixing (SIM) cycle (Fig. 3a). If gene flow occurred during isolation, we would under-estimate Tspp and the rejection of allopatry would be conservative.
Figure 3b presents the estimated Tspp for the five species of mangroves under the allopatric SIM model. For a comparison, the temporal sequence of migration and isolation phases at the Strait of Malacca is also shown. With the exception of C. tagal, the estimated Tspp values exceed 1.2 Myrs in the four other species. As the null model of Tspp < Tiso is rejected, Tspp must span several cycles of isolation-mixing (see Fig. 3b).
Speciation through MIM cycles
Speciation in mangroves along the Pacific vs. Indian Ocean coasts had to go through cycles of isolation interspersed by episodes of gene flow, as recorded in the geological data (Fig. 3b). This mode of speciation will be referred to as the MIM model. The likelihood-ratio test (last row of Table 2) shows that the MIM model agrees with the observations better than the SIM model (Supplementary Note, available as Supplementary Data at NSR online), except in C. tagal, which has a very low species-wide polymorphism.
Under the MIM model, the distribution of neutral divergence among loci should be broader than under SIM, if everything else is equal. We use Dmax (differences between the two most divergent haplotypes at any locus) as the measure. The distribution of Dmax is given in Fig. 4a–c. The vertical red lines represent the average level of divergence between species or subspecies. All three species have many loci where Dmax is larger than the level of (sub-)species divergence (upper panels). These loci may reflect aspects of the East–West divergence due to geographical isolation. The standard deviations of Dmax are simulated and plotted (insets in Fig. 4a–c). The observations are indeed much larger than the predictions of the SIM model and fall within the simulated distributions under the MIM cycles. Thus, the divergence of mangroves on these coasts may have been influenced by periodic gene flow increasing among-locus variation.
Because isolation increases genetic variation, it also increases the effective population size. Hence, MIM and SIM models would show distinct patterns. As the genomes of three of the five species have been sequenced ([26] and He et al., unpublished data), we re-sequenced two additional individuals for each of the three species. The pairwise sequentially Markovian coalescent (PSMC) method [51] infers effective population sizes at different time points in the past by comparing haploid genomes. Periods of isolation are reflected in non-coalescence and can be defined as changes in effective population size.
The PSMC results on R. apiculata, S. alba and Av. marina are given in Fig. 4d. While PSMC is usually used to model the changes in population sizes, we use it here in the context of the timing of population differentiation on the Pacific vs. Indian Ocean coasts (see Supplementary Note, available as Supplementary Data at NSR online). All three species show very small effective population sizes in the last 20 000 years, corresponding to the retreat of the last global glaciation. Going back in time, the effective population sizes increase gradually, suggesting isolated populations that have had low or intermittent gene flow during the preceding 2 Myrs. The overall PSMC patterns indicate historical gene flow spread over a long span of time, in accordance with the geological records. Had the gene flow been concentrated in a short period, the simulated SIM model would yield a steep increase in effective population size during a very short window of time (Fig. 4d).
DISCUSSION
Gene flow is conventionally perceived as a homogenizing force that can reverse population divergence and block speciation (black line in Fig. 4e). This has been the principal consideration of the strict allopatric model of speciation. The absence of gene flow due to geographical isolation is eventually superseded by the evolution of reproductive isolation that underpins the Biological Species Concept [2,52]. In recent years, the genic perspective suggests that gene flow during speciation would not necessarily impede divergence, as long as selection is not swamped by migration (red line in Fig. 4e) [6,12,53,54]. By superimposing the genomic information on the geological records, this study demonstrates that speciation on the Indo-Malayan coasts must have progressed in alternate phases of gene flow and isolation.
The MIM model therefore bridges two large sets of speciation literature. In one set, the main concern has been about the geological and phylogeographical records of speciation, which have been expertly reviewed by Hewitt [55]. It is, however, not clear whether the phylogeographical literature has rejected the model of strict allopatry or has reinforced it. For example, depending on when a hybrid zone is formed, geographical records may either suggest ‘speciation with gene flow’ or reinforce the view that a hybrid zone would reverse divergence until after speciation is fait accompli [56,57]. In this backdrop, earlier cyclic hybridization models linking climatic cycles with speciation [58–60] are extensions of the allopatric model. In these extensions, speciation is completed in one cycle with full isolation followed by migration. The process would continue through cycles of geographical speciation and post-speciation range expansion.
A second set of the literature concerns the genomic divergence that can reveal the speciation history [6,12,14,61,62]. Nevertheless, as shown by several analyses [16,48], genomic data can inform about the occurrence of gene flow but not about when it happened. Gene flow prior to the onset of speciation might be misinterpreted to be gene flow during speciation. No less important, gene flow could be a continuous trickle or might be concentrated in short episodes of geographical panmixia, interspersed with periods of strict isolation. These isolation phases are important for the evolution of postmating reproductive isolation because incompatibility cannot evolve easily under gene flow [63,64]. In this sense, the MIM model has features of both allopatry and ‘speciation with gene flow’.
Interestingly, it has been posited that gene flow may even speed up speciation (the blue dotted line in Fig. 4e). This could happen if and when adaptive gene complexes, built up during isolation, are shuffled to generate many new combinations. Hybrid speciation [65–67] and adaptive radiation by hybrid swarms are such examples [68]. Furthermore, many domesticated breeds were indeed created by hybridization between existing varieties [69–71]. Thus, both plant and animal domestication resembles the MIM cycles, whereby breeds were separately domesticated with occasional exchange of genes. Although the idea of well-timed gene flow speeding up speciation is attractive, there is currently no evidence that it applies to mangrove speciation.
Finally, the MIM model may also bridge the gap between biodiversity and speciation studies. Many explanations have been proposed for the existence of biodiversity hotpots. Strangely, speciation has often been ruled out [72] as a mechanism of biodiversity, mainly for want of geographical features that can impose long-term isolation. With MIM cycles, the stringent requirement is relaxed and many geographical features could conceivably drive speciation. In the IWP, because the sea floor in the Indo-Malayan region has been relatively high, many shallow barriers have existed throughout the region [73]. When the global sea level began to decrease and fluctuate around that lower level 25 Myrs ago [17], many parts of the Indo-Malayan coasts may have experienced cycles of isolation and admixture. Indeed, as Renema et al. have pointed out, species diversity in the Indo-Malesia started to increase during the Miocene [23,74].
The MIM model may be applicable to other high-diversity spots as well. In the same time frame as mangrove speciation on the Indo-Malayan coasts, islands of the Aegean Archipelago in the Mediterranean may have been repeatedly connected and disconnected due to sea-level changes. Thus, the radiation of annual plants in the genus Nigella across this archipelago [75] could have also been driven by a mechanism like the MIM model. Similarly, MIM cycles may have driven: the extraordinary diversity of cichlid fish in Lake Victoria, which has experienced repeated rises and falls of water level [76]; diverse flora in neo-tropical rain forests subject to periods of cooler and drier climates driven by cyclical glacial events [77]; and avian diversification in the neo-tropics where fragmentation and reconnection of high-elevation habitats occurred during the late Pleistocene [78].
When diverging populations become fully fledged species, migration in the next M phase would be equivalent to range expansion. If speciation occurs after each isolation phase, there can be as many as 2n species after n cycles [58]. In that sense, the migration phase in the MIM cycles would play an added role in the evolution of biodiversity. More generally, isolation may create i fragmented populations. If speciation is achieved after j cycles, then the number of species after n cycles would be [i]n/j. In other words, the number of species after n cycles can potentially be mn where m = i1/j > 1. In the special case of i = 2 and j = 1, m = 2n. Centers of high biodiversity are fascinating phenomena with many possible causes [20–22,79,80]. We suggest that efficient speciation mechanisms like MIM cycles may play a role.
MATERIALS AND METHODS
Geographical distribution of mangrove species in IWP
The geographical distribution of each of the nine mangrove genera in the IWP (Kandelia, Aegiceras, Lumnitzera, Ceriops, Xylocarpus, Bruguiera, Avicennia, Rhizophora and Sonneratia) was compiled from World Mangrove ID [33]. Species distribution ranges of Ceriops were updated according to Tsai et al. [81]. The distributions of Rhizophora and Sonneratia in China were updated from the field survey data of Wang and Chen [82].
Scanning the genome for speciation islands
To identify genomic regions highly divergent between R. mucronata and R. stylosa, we performed a genome-wide divergence scan using absolute measures of differentiation. Re-sequencing data of one R. mucronata individual from Ranong, Thailand, and one R. stylosa individual from Hainan, China, were generated using Illumina HiSeq 2000 platform. Reads were mapped to the R. apiculata reference genome using the BWA software [83]. Heterozygous sites were called using the GATK pipeline [84]. We used sliding windows to scan divergence levels between the two species. We set the window size to 50 Kb and step size to 25 Kb. Windows with fewer than 10 000 sequenced sites were discarded. The divergence level of each retained window was calculated as the number of differentiated sites divided by the number of sequenced sites. Divergent sites were defined as loci homozygous within each species but different between taxa.
Sampling, sequencing and mapping
We collected leaf material from populations of five mangrove species from 15 stands in the three regions as shown in Fig. 1a and Supplementary Table 1, available as Supplementary Data at NSR online. For each species, at least one stand was sampled in each region and 19–100 individuals were collected from each stand. Intervals between sampled individuals were at least 5 meters. Sequencing protocols were as described in our earlier work [32]. An equal amount of leaf material from each individual in every stand was mixed before DNA extraction. Based on sequences from cDNA libraries of the species, we designed primers for over 150 loci for each species. We succeeded in amplifying approximately 70 genes per species (Supplementary Table 2, available as Supplementary Data at NSR online) and sequenced them using the Illumina GA-II/HiSeq 2000 platform. The short reads sequence data were deposited in NCBI, BioProject: PRJNA303892. We mapped short reads to references using MAQ [85] with main parameters -m 0.002 and -e 200 and the parameter -q 30 to filter low-quality reads. To reduce sequencing errors, we ignored bases that were (i) located in the first 11 bp or the last 7 bp of the mapped reads, (ii) with base quality less than 22 and (iii) with minimum coverage less than 100. Putative single nucleotide polymorphisms (SNPs) were called if the minor allele frequency was >0.01.
Haplotype inference
Using the linkage information for SNPs in each pair of short reads, we estimated haplotypes and their frequencies using an expectation-maximization algorithm [86,87]. We divided the genes into two or more segments if the distance between two SNPs was longer than the length covered by paired reads. To validate the estimated haplotype phases, we sequenced 360 alleles in eight populations using the Sanger method (Supplementary Dataset 1, available as Supplementary Data at NSR online). Haplotypes and their frequencies estimated using these two approaches were very similar. Short reads were informative for our haplotype analyses thanks to large sample sizes. We constructed haplotype networks for each gene segment based on the inferred haplotypes.
Estimating nucleotide diversity and population structure
Using the obtained haplotype profiles, we estimated nucleotide diversity (π, the average number of nucleotide mismatches per site between two sequences [88]) within a stand/population (πs), region (πR, with all areas weighted equally) and species (πT, with all three regions weighted equally) for each haplotype segment. We employed F-statistics at different levels to measure population differentiation () [49].
After carefully reviewing haplotype networks for the five species, we observed that haplotypes of many genes could be clustered into two distinct clades corresponding to the samples from the East and West Indo-Malayan regions. We therefore clustered the haplotypes into two clades using complete linkage method hierarchical clustering analysis [86]. We included segments with more than two SNPs, or two SNPs and two haplotypes. We calculated frequencies of haplotype clades in both regions.
Demographic models and parameter estimation
We used a maximum-likelihood method to estimate effective population size (Ne) and migration rate (m) for the SIM and MIM models. The SIM model requires an additional parameter: the isolation time imposed by the geographical isolation, as depicted in Fig. 3a. The time elements in the MIM model were defined by the geological records of sea-level changes. The mutation rate was inferred from exome/transcriptome data with the fossil records as described in the Table 1 legend.
The number of divergent nucleotides between two sequences sampled from the same populations was denoted as Dw, while differentiation between populations was denoted as Db (w stands for within population and b for between populations). The log-likelihood function can be constructed as follows:
(1) |
is the observed number of sequence pairs between populations where Db is equal to x and is the observed number of sequence pairs within a population where Dw is equal to y. The probability P of Db and Dw could be deduced using the transition probability matrix during the M phase and I phase, according to the coalescent process. The detail equations are given in the Equations S2–S9 in Supplementary Note, available as Supplementary Data at NSR online.
We wrote Mathematica scripts to obtain MLE of effective population size and migration rate for the MIM and SIM model using numerical methods. Given a generation time equal to 20 years, the MIM model parameters j and k were set to 5000 and 500 generations for each I and M phase according to geographical evidence of the recent cycles. In SIM, j is the additional parameter to be estimated.
To validate method accuracy, we carried out a series of simulations. We used the ms [89] to simulate sequences under the MIM and SIM models for 1000 replicates for each set of parameters (Supplementary Fig. 6, available as Supplementary Data at NSR online). When isolation time was set to 1 Myrs, the standard deviation for 1000 simulation results was no more than 0.1 Myrs. The estimation under the MIM model was also comparably accurate (Supplementary Fig. 7, available as Supplementary Data at NSR online).
Simulations of DNA-sequence evolution
We used ms [89] to simulate sequence evolution under the MIM scheme for 2000 replicates in each species. Parameter values used in simulations are listed in Table 2. Six statistics were obtained from the simulated sequences for each species: Dwithin (average divergence within region), Dbetween (average divergence between regions), Dclade (differentiation among the most recent common ancestors of each clade), Dmax (differences between two most divergent haplotypes), Ptotal (total number of SNPs) and FST among regions. The simulated distributions of the six statistics are comparable to values observed from data (Supplementary Figs 8–12, available as Supplementary Data at NSR online).
We also simulated 2000 replicates under the SIM evolution scheme using the parameters listed in Table 2 for calculating Dmax. We calculated the standard deviation of Dmax among genes in each relicate derived from the SIM or MIM model. The distributions of the 2000 standard deviations from the two models are depicted in the insets of Fig. 4a–c.
To test whether the MIM model fits observed data better than the SIM model, we obtained MLE of the two models for sequences simulated under the SIM model. We calculated the differences in the likelihood values (Diff = log-likelihood of the MIM model – log-likelihood of SIM model) for each of the 2000 repetitions. For R. apiculata, S. alba, Av. marina and Ae. corniculatum, the Diff value of the real data is larger than all the Diff values of the simulated sequences. Hence, the probability that the SIM model fits data better than the MIM model is less than 0.001. For C. tagal, the probability is 0.33. As discussed in the main text, the unusually low genetic diversity of C. tagal makes it powerless to compare models.
Estimating effective population-size change using whole-genome sequence data
To estimate past effective population size, we used the PSMC [51]. We used the whole-genome sequence data from six individuals (data deposited in NCBI, BioProject: PRJNA298659). Avicennia marina and S. alba individuals were from the Gulf of Thailand and the West Coast; R. apiculata samples were from Sanya and Wenchang. We mapped the re-sequencing data generated by the Illumina HiSeq 2000 platform to the corresponding draft genomes ([26] and He et al., unpublished data) using BWA [83]. The parameters of PSMC estimation were: -N25 -t15 -r5 -p “4+25*2+4+6”. Generation time was set to 20 years. The mutation rate for each species is given in Table 2.
We also produced simulated sequence data for PSMC analysis (see Fig. 4d and Supplementary Figs 13–15, available as Supplementary Data at NSR online). The simulated sequences were generated by msHOT [90] with the following parameters: mutation rate (μ) set as 0.5, 1.0, 2.0 × 10−9/site/year, migration rate (m) as 1 × 10−4, 5 × 10−4, 10 × 10−4 per generation, population size (N) as 100, 500, 1000 and 5000. Each simulated genome contained 500 loci and the length of each gene was set to 200 Kb. The recombination rate was set to 1 × 10−9/site/generation.
Supplementary Material
Acknowledgements
We thank David Jablonski, Roger Butlin, Richard Abbott, Trevor Price, Loren Rieseberg, Nick Barton, Patrik Nosil, James Mallet, Daven Presgraves, Dolph Schluter, Fangliang He, Chuck Canon and Xionglei He for insightful comments that improved the manuscript.
FUNDING
This study was supported by the National Natural Science Foundation of China (91731301 and 31600182); the National Key Research and Development Plan (2017FY100705); the 985 Project (33000–18831107) and the Fundamental Research Funds for the Central Universities (17lgpy99).
AUTHOR CONTRIBUTIONS
C.-I.W. and S.S. conceived the project and wrote the manuscript; Z.H., X.L. and M.Y. designed the experiments, analysed data and prepared the manuscript; Z.H. and C.-I.W. constructed demographic models; S.S., X.L., C.Z. and N.D. contributed to sample collections. X.L. and X.W. performed experiments; N.D. contributed to manuscript modifications.
ACCESSION CODES
Illumina reads are available at the Short Read Archive under the NCBI BioProject PRJNA303892 and PRJNA298659.
REFERENCES
- 1. Coyne JA, Orr HA. Speciation. Sunderland:Sinauer Associates, 2004. [Google Scholar]
- 2. Mayr E. Animal Species and Evolution. Cambridge:Belknap, 1963. [Google Scholar]
- 3. Fontaine MC, Pease JB, Steele Aet al.. Extensive introgression in a malaria vector species complex revealed by phylogenomics. Science 2015; 347: 1258524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Martin SH, Dasmahapatra KK, Nadeau NJet al.. Genome-wide evidence for speciation with gene flow in Heliconius butterflies. Genome Res 2013; 23: 1817–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Toews DP, Taylor SA, Vallender Ret al.. Plumage genes and little else distinguish the genomes of hybridizing warblers. Curr Biol 2016; 26: 2313–8. [DOI] [PubMed] [Google Scholar]
- 6. Wu CI. The genic view of the process of speciation. J Evol Biol 2001; 14: 851–65. [Google Scholar]
- 7. Sambatti JB, Strasburg JL, Ortiz-Barrientos Det al.. Reconciling extremely strong barriers with high levels of gene exchange in annual sunflowers. Evolution 2012; 66: 1459–73. [DOI] [PubMed] [Google Scholar]
- 8. Leroy T, Roux C, Villate Let al.. Extensive recent secondary contacts between four European white oak species. New Phytol 2017; 214: 865–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Schilthuizen M. Dualism and conflicts in understanding speciation. Bioessays 2000; 22: 1134–41. [DOI] [PubMed] [Google Scholar]
- 10. Fisher RA Sir. The Genetical Theory of Natural Selection. Oxford: The Clarendon Press, 1930. [Google Scholar]
- 11. Endler JA. Geographic Variation, Speciation and Clines. Princeton:Princeton University Press, 1977. [PubMed] [Google Scholar]
- 12. Feder JL, Egan SP, Nosil P. The genomics of speciation-with-gene-flow. Trends Genet 2012; 28: 342–50. [DOI] [PubMed] [Google Scholar]
- 13. Malinsky M, Challis RJ, Tyers AMet al.. Genomic islands of speciation separate cichlid ecomorphs in an East African crater lake. Science 2015; 350: 1493–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Poelstra JW, Vijay N, Bossu CMet al.. The genomic landscape underlying phenotypic integrity in the face of gene flow in crows. Science 2014; 344: 1410–4. [DOI] [PubMed] [Google Scholar]
- 15. Cruickshank TE, Hahn MW. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Mol Ecol 2014; 23: 3133–57. [DOI] [PubMed] [Google Scholar]
- 16. Yang M, He Z, Shi Set al.. Can genomic data alone tell us whether speciation happened with gene flow? Mol Ecol 2017; 26: 2845–9. [DOI] [PubMed] [Google Scholar]
- 17. Miller KG, Kominz MA, Browning JVet al.. The Phanerozoic record of global sea-level change. Science 2005; 310: 1293–8. [DOI] [PubMed] [Google Scholar]
- 18. Ludt WB, Rocha LA. Shifting seas: the impacts of Pleistocene sea-level fluctuations on the evolution of tropical marine taxa. J Biogeogr 2015; 42: 25–38. [Google Scholar]
- 19. Fukami H, Budd AF, Paulay Get al.. Conventional taxonomy obscures deep divergence between Pacific and Atlantic corals. Nature 2004; 427: 832–5. [DOI] [PubMed] [Google Scholar]
- 20. Bellwood DR, Hughes TP. Regional-scale assembly rules and biodiversity of coral reefs. Science 2001; 292: 1532–5. [DOI] [PubMed] [Google Scholar]
- 21. Ricklefs RE, Schwarzbach AE, Renner SS. Rate of lineage origin explains the diversity anomaly in the world's mangrove vegetation. Am Nat 2006; 168: 805–10. [DOI] [PubMed] [Google Scholar]
- 22. Tittensor DP, Mora C, Jetz Wet al.. Global patterns and predictors of marine biodiversity across taxa. Nature 2010; 466: 1098–101. [DOI] [PubMed] [Google Scholar]
- 23. Hoeksema BW. Delineation of the Indo-Malayan Centre of Maximum Marine Biodiversity: The Coral Triangle. Dordrecht:Springer, 2007, 117–78. [Google Scholar]
- 24. Ellison AM, Farnsworth EJ, Merkt RE. Origins of mangrove ecosystems and the mangrove biodiversity anomaly. Glob Ecol Biogeogr 1999; 8: 95–115. [Google Scholar]
- 25. Plaziat J-C, Cavagnetto C, Koeniguer J-Cet al.. History and biogeography of the mangrove ecosystem, based on a critical reassessment of the paleontological record. Wetlands Ecol Manage 2001; 9: 161–80. [Google Scholar]
- 26. Xu S, He Z, Zhang Zet al.. The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole genome sequencing. Natl Sci Rev 2017; 4: 721–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Duke NC, Ball MC, Ellison JC. Factors influencing biodiversity and distributional gradients in mangroves. Global Ecol Biogeogr Lett 1998; 7: 27–47. [Google Scholar]
- 28. Tomlinson P. The Botany of Mangrove. London: Cambridge University Press, 1986. [Google Scholar]
- 29. Molinos JG, Halpern BS, Schoeman DSet al.. Climate velocity and the future global redistribution of marine biodiversity. Nat Clim Change 2015; 6: 83–8. [Google Scholar]
- 30. Stuart-Smith RD, Bates AE, Lefcheck JSet al.. Integrating abundance and functional traits reveals new global hotspots of fish diversity. Nature 2013; 501: 539–42. [DOI] [PubMed] [Google Scholar]
- 31. Nosil P, Schluter D. The genes underlying the process of speciation. Trends Ecol Evol 2011; 26: 160–7. [DOI] [PubMed] [Google Scholar]
- 32. Zhou R, Ling S, Zhao Wet al.. Population genetics in nonmodel organisms: II. Natural selection in marginal habitats revealed by deep sequencing on dual platforms. Mol Biol Evol 2011; 28: 2833–42. [DOI] [PubMed] [Google Scholar]
- 33. Duke NC. World Mangrove ID. Brisbane & Townsville: : MangroveWatch, 2013. [Google Scholar]
- 34. Polidoro BA, Carpenter KE, Collins Let al.. The loss of species: mangrove extinction risk and geographic areas of global concern. Plos One 2010; 5: e10095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Guo M, Zhou R, Huang Yet al.. Molecular confirmation of natural hybridization between Lumnitzera racemosa and Lumnitzera littorea. Aquat Bot 2011; 95: 59–64. [Google Scholar]
- 36. Huang L, Li X, Huang Yet al.. Molecular evidence for natural hybridization in the mangrove genus Avicennia. Pak J Bot 2014; 46: 1577–84. [Google Scholar]
- 37. Lo EYY. Testing hybridization hypotheses and evaluating the evolutionary potential of hybrids in mangrove plant species. J Evol Biol 2010; 23: 2249–61. [DOI] [PubMed] [Google Scholar]
- 38. Qiu S, Zhou R-C, Li Y-Qet al.. Molecular evidence for natural hybridization between Sonneratia alba and S.griffithii. J Syst Evol 2008; 46: 391–5. [Google Scholar]
- 39. Zhang R, Liu T, Wu Wet al.. Molecular evidence for natural hybridization in the mangrove fern genus Acrostichum. BMC Plant Biol 2013; 13: 74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Zhou R, Gong X, Boufford Det al.. Testing a hypothesis of unidirectional hybridization in plants: observations on Sonneratia, Bruguiera and Ligularia. BMC Evol Biol 2008; 8: 149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Tyagi PA, Singh VV. Pollen fertility and intraspecific and interspecific compatibility in mangroves of Fiji. Sex Plant Reprod 1998; 11: 60–3. [Google Scholar]
- 42. Wang R, Chen Z, Chen Eet al.. Two hybrids of the genus Sonneratia (Sonneratiaceae) from China. Guihaia 1998; 19: 199–204. [Google Scholar]
- 43. Duke NC, Lo E, Sun M. Global distribution and genetic discontinuities of mangroves—emerging patterns in the evolution of Rhizophora. Trees 2002; 16: 65–79. [Google Scholar]
- 44. Duke NC. Mangrove Floristics and Biogeography. Washington, DC: American Geophysical Union, 1992, 63–100. [Google Scholar]
- 45. Seehausen O, Butlin RK, Keller Iet al.. Genomics and the origin of species. Nat Rev Genet 2014; 15: 176–92. [DOI] [PubMed] [Google Scholar]
- 46. Osada N, Wu C-I. Inferring the mode of speciation from genomic data: a study of the great apes. Genetics 2005; 169: 259–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Yang Z. A likelihood ratio test of speciation with gene flow using genomic sequence data. Genome Biol Evol 2010; 2: 200–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Strasburg JL, Rieseberg LH. Interpreting the estimated timing of migration events between hybridizing species. Mol Ecol 2011; 20: 2353–66. [DOI] [PubMed] [Google Scholar]
- 49. Hartl DL, Clark AG. Principles of Population Genetics. Sunderland:Sinauer Associates, 2007. [Google Scholar]
- 50. Hewitt GM. Some genetic consequences of ice ages, and their role in divergence and speciation. Biol J Linn Soc 1996; 58: 247–76. [Google Scholar]
- 51. Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature 2011; 475: 493–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Dobzhansky T. Genetics and the Origin of Species. New York:Columbia University Press, 1937. [Google Scholar]
- 53. Schluter D. Evidence for ecological speciation and its alternative. Science 2009; 323: 737–41. [DOI] [PubMed] [Google Scholar]
- 54. Schluter D, Conte GL. Genetics and ecological speciation. Proc Natl Acad Sci USA 2009; 106: 9955–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Hewitt GM. Quaternary phylogeography: the roots of hybrid zones. Genetica 2011; 139: 617–38. [DOI] [PubMed] [Google Scholar]
- 56. Harrison RG. Hybrid Zones: Windows on Evolutionary Process. Oxford: Oxford University Press, 1990. [Google Scholar]
- 57. Butlin RK. Population genomics and speciation. Genetica 2010; 138: 409–18. [DOI] [PubMed] [Google Scholar]
- 58. Haffer J. Speciation in Amazonian forest birds. Science 1969; 165: 131–7. [DOI] [PubMed] [Google Scholar]
- 59. Ehrendorfer F. Differentiation-hybridization cycles and polyploidy in Achillea. Cold Spring Harb Symp Quant Biol 1959; 24: 141–52. [DOI] [PubMed] [Google Scholar]
- 60. Rattenbury JA. Cyclic hybridization as a survival mechanism in the New Zealand forest flora. Evolution 1962; 16: 348–63. [Google Scholar]
- 61. Payseur BA, Rieseberg LH. A genomic perspective on hybridization and speciation. Mol Ecol 2016; 25: 2337–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Pinho C, Hey J. Divergence with gene flow: models and data. Annu Rev Ecol Evol Syst 2010; 41: 215–30. [Google Scholar]
- 63. Bank C, Bürger R, Hermisson J. The limits to parapatric speciation: Dobzhansky–Muller incompatibilities in a continent–island model. Genetics 2012; 191: 845–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Nosil P, Flaxman SM. Conditions for mutation-order speciation. Proc R Soc B 2011; 278: 399–407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Abbott R, Albach D, Ansell Set al.. Hybridization and speciation. J Evol Biol 2013; 26: 229–46. [DOI] [PubMed] [Google Scholar]
- 66. Mallet J. Hybrid speciation. Nature 2007; 446: 279–83. [DOI] [PubMed] [Google Scholar]
- 67. Rieseberg LH, Willis JH. Plant speciation. Science 2007; 317: 910–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Seehausen O. Hybridization and adaptive radiation. Trends Ecol Evol 2004; 19: 198–207. [DOI] [PubMed] [Google Scholar]
- 69. Cornille A, Giraud T, Smulders MJMet al.. The domestication and evolutionary ecology of apples. Trends Genet 2014; 30: 57–65. [DOI] [PubMed] [Google Scholar]
- 70. He Z, Zhai W, Wen Het al.. Two evolutionary histories in the genome of rice: the roles of domestication genes. PLos Genet 2011; 7: e1002100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Larson G, Burger J. A population genetics view of animal domestication. Trends Genet 2013; 29: 197–205. [DOI] [PubMed] [Google Scholar]
- 72. Barber PH, Bellwood DR. Biodiversity hotspots: evolutionary origins of biodiversity in wrasses (Halichoeres: Labridae) in the Indo-Pacific and new world tropics. Mol Phylogen Evol 2005; 35: 235–53. [DOI] [PubMed] [Google Scholar]
- 73. Hall R. Southeast Asia's changing palaeogeography. Blumea 2009; 54: 148–61. [Google Scholar]
- 74. Renema W, Bellwood DR, Braga JCet al.. Hopping hotspots: global shifts in marine biodiversity. Science 2008; 321: 654–7. [DOI] [PubMed] [Google Scholar]
- 75. Comes HP, Tribsch A, Bittkau C. Plant speciation in continental island floras as exemplified by Nigella in the Aegean Archipelago. Philos Trans Royal Soc B 2008; 363: 3083–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Verheyen E, Salzburger W, Snoeks Jet al.. Origin of the superflock of cichlid fishes from Lake Victoria, East Africa. Science 2003; 300: 325–9. [DOI] [PubMed] [Google Scholar]
- 77. Richardson JE, Pennington RT, Pennington TDet al.. Rapid diversification of a species-rich genus of neotropical rain forest trees. Science 2001; 293: 2242–5. [DOI] [PubMed] [Google Scholar]
- 78. Weir JT. Divergent timing and patterns of species accumulation in lowland and highland neotropical birds. Evolution 2006; 60: 842–55. [PubMed] [Google Scholar]
- 79. Bowen BW, Rocha LA, Toonen RJet al.. The origins of tropical marine biodiversity. Trends Ecol Evol 2013; 28: 359–66. [DOI] [PubMed] [Google Scholar]
- 80. Roberts CM, McClean CJ, Veron JEet al.. Marine biodiversity hotspots and conservation priorities for tropical reefs. Science 2002; 295: 1280–4. [DOI] [PubMed] [Google Scholar]
- 81. Tsai CC, Li S-J, Su Y-Yet al.. Molecular phylogeny and evidence for natural hybridization and historical introgression between Ceriops species (Rhizophoraceae). Biochem Syst Ecol 2012; 43: 178–91. [Google Scholar]
- 82. Wang W, Chen Q. Salt-Tolerant Plant Resources from Coastal Areas of South China. Xiamen: Xiamen University Press, 2013. [Google Scholar]
- 83. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009; 25: 1754–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. McKenna A, Hanna M, Banks Eet al.. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010; 20: 1297–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Li H, Ruan J, Durbin R. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res 2008; 18: 1851–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. Bilmes JA. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. In: Proceedings of International Computer Science Institute. International Computer Science Institute, 1998, 126. [Google Scholar]
- 87. Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J Roy Stat Soc Ser B (Stat Method) 1977; 39: 1–38. [Google Scholar]
- 88. Nei M, Li W-H. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci USA 1979; 76: 5269–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 2002; 18: 337–8. [DOI] [PubMed] [Google Scholar]
- 90. Hellenthal G, Stephens M. msHOT: modifying Hudson's ms simulator to incorporate crossover and gene conversion hotspots. Bioinformatics 2007; 23: 520–1. [DOI] [PubMed] [Google Scholar]
- 91. Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 2003; 52: 696–704. [DOI] [PubMed] [Google Scholar]
- 92. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 2007; 24: 1586–91. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.