Abstract
Despite a long-standing interest since Darwin’s time, the genomic implications of obligate self‐fertilization remain elusive. Complete cleistogamy—the obligate production of closed, self‐pollinating flowers—represents an extreme reproductive strategy. Here, we present the genomic profiles and evolutionary history of two lineages of the mycoheterotrophic orchid Gastrodia, both of which independently acquired complete cleistogamy, based on detailed sampling and a combination of simple sequence repeat (SSR), multiplexed ISSR genotyping by sequencing (MIG-seq) and RNA‐seq data. Our analysis reveals clear species delimitation, with no evidence of introgression between the completely cleistogamous species and their co‐occurring allogamous sisters. Intriguingly, all analyses indicate that both the completely cleistogamous Gastrodia species and their allogamous sisters exhibit genetic profiles typical of self‐pollinating plants. This pattern suggests that their ancestors, probably bearing allogamous flowers, had already evolved mechanisms to mitigate the deleterious effects of selfing, potentially facilitating the emergence of complete cleistogamy through benefits such as reproductive assurance, enhanced colonization ability and species reinforcement. Meanwhile, further analyses suggest that complete cleistogamy evolved very recently (possibly within the last 1000–2000 years) in these two Gastrodia lineages. Combined with the scant evidence of complete cleistogamy outside Gastrodia, our findings imply a limited and ephemeral role for complete cleistogamy in plant speciation.
Keywords: complete cleistogamy, deleterious mutations, genomic selfing syndrome, speciation, mycoheterotrophy
1. Introduction
Charles Darwin once remarked, ‘It is hardly an exaggeration to say that Nature tells us, in the most emphatic manner, that she abhors perpetual self-fertilisation’ [1, p. 359]. Yet, self-fertilization (selfing) has evolved repeatedly across the angiosperms, often providing crucial reproductive assurance in environments where pollinators or conspecific mates are scarce [2,3]. Despite these apparent benefits, prolonged reliance on selfing can exacerbate genetic drift, promote the accumulation of deleterious mutations and reduce the effective population size (Ne) [4,5]. This duality, where selfing emerges frequently yet carries inherent genetic risks, remains a central enigma in plant evolutionary biology [3,6].
Many studies on the evolution of self-fertilization highlight its immediate advantages: selfers can reproduce even in the absence of pollinators or within sparsely distributed populations [3,7–9]. Meanwhile, transitions to selfing are often accompanied by a genomic ‘selfing syndrome’, characterized by reduced heterozygosity and elevated burdens of deleterious alleles [6,10–13]. Notably, predominantly selfing species such as Arabidopsis thaliana and Capsella rubella still permit occasional outcrossing, which introduces heterozygosity and may buffer against genomic decay [14–17]. Consequently, it remains unclear how lineages that are entirely isolated from outcrossing (i.e. obligate selfers) surmount these genetic constraints and over what timescale obligate selfing can persist as an evolutionarily stable strategy [9].
Although autonomous selfing probably evolved primarily to ensure reproductive assurance, it may also function as an effective barrier to heterospecific mating, thereby enhancing reproductive isolation [3,17–19]. While a single reproductive barrier is often insufficient to prevent interspecific hybridization [20,21], complete cleistogamy—the obligate formation of closed, self-pollinating flowers—constitutes an extreme form of reproductive isolation, probably eliminating gene flow with closely related taxa [22,23]. Therefore, complete cleistogamy may be selected not only in response to colonization challenges but also as a reinforcement mechanism during speciation, provided that the deleterious effects of inbreeding are mitigated. However, owing to the extreme rarity of complete cleistogamy [9,24], such possibilities have remained largely unexplored.
The mycoheterotrophic orchid genus Gastrodia offers an exceptional system for examining the dynamics of obligate selfing and complete cleistogamy, as it includes several species that produce exclusively cleistogamous flowers, with no apparent potential for outcrossing [25–29]. While cleistogamy has independently evolved across diverse lineages of angiosperms, most cleistogamous species also produce open, chasmogamous flowers [9,24]. To date, Gastrodia appears to be the only genus comprising multiple species that exhibit complete cleistogamy [22]. Unlike commonly studied models of selfing, such as Mimulus nasutus and C. rubella, which retain some capacity for outcrossing [12,19], the completely cleistogamous species of Gastrodia (e.g. Gastrodia takeshimensis and Gastrodia kuroshimensis) probably rely entirely on obligate self-fertilization [22,23]. Intriguingly, these lineages are typically restricted to islands where closely related chasmogamous and allogamous Gastrodia species are also present. Thus, beyond the classical rationales for selfing, such as providing reproductive assurance and facilitating colonization of unoccupied habitats [7–9], this distributional pattern suggests that obligate selfing may function as a mechanism of reinforcement, effectively preventing interspecific gene flow and maintaining species integrity [30].
To elucidate the origins and timing of obligate selfing and to determine whether these species exhibit any genomic signals of introgression, we employed an integrative genomic framework. First, we used hypervariable simple sequence repeat (SSR) markers, which evolve rapidly through de novo mutations and effectively capture recent demographic events [31,32]. Second, we applied multiplexed ISSR genotyping by sequencing (MIG-seq), a reduced-representation genomic technique well-suited for detecting single-nucleotide polymorphisms (SNPs) [33], providing robust insights into historical genome-wide variation even in non-model organisms [34,35]. Finally, we analysed low-copy transcript data from RNA-seq to characterize heterozygous mutations and the accumulation of deleterious variants [36], enabling evaluation of typical features associated with the selfing syndrome in Gastrodia. By integrating these complementary approaches, we characterized genomic traits linked to obligate selfing, focusing on two probably independently evolved cleistogamous species, G. kuroshimensis and G. takeshimensis, in comparison with their respective allogamous relatives, G. foetida and G. fontinalis [23].
Our study addresses three key questions: (i) how recently did G. takeshimensis and G. kuroshimensis diverge from their allogamous relatives, and can we estimate the timing of the evolution of complete cleistogamy?; (ii) is there evidence of introgression between cleistogamous and chasmogamous species in sympatric regions?; and (iii) under what conditions does Gastrodia probably favour complete cleistogamy? For instance, are chasmogamous congeners already partially inbred, thereby predisposing them to a shift toward obligate selfing? Through genomic analyses, this study offers novel insights into the evolutionary trajectory of complete cleistogamy, shedding light on plant speciation, mating system transitions and the long-term sustainability of obligate selfing.
2. Material and methods
(a). Study species
Gastrodia species are fully mycoheterotrophic orchids that depend on saprotrophic fungi [37,38]. Their flowers, like those of many orchids, are zygomorphic, comprising an outer whorl of three sepals and an inner whorl of three petals, including a specialized labellum. In all Gastrodia species, the sepals and petals other than the labellum are partially fused, forming a five-lobed perianth tube. In the chasmogamous species G. foetida and G. fontinalis, the dehiscence zone at the free part of the sepals consistently opens, whereas, in the cleistogamous sister species G. takeshimensis and G. kuroshimensis, it remains fused throughout the reproductive period (figure 1a). Consequently, the mature flowers of the cleistogamous species retain a more juvenile, bud-like appearance [23]. In G. foetida and G. fontinalis, a pronounced rostellum prevents autonomous self-pollination, requiring pollination by drosophilid flies for fruit set [40]. By contrast, the absence of a well-developed rostellum in G. takeshimensis and G. kuroshimensis enables autonomous self-pollination [26,28].
Figure 1.
Four Gastrodia species were investigated, and their genomic profiles were based on SSR genotype data. (a) Flower morphology (front and lateral views). Scale bar: 5 mm. (b) Map showing the sampling localities. (c) Unrooted neighbour-joining phylogenetic tree based on DA distances [39] calculated from SSR genotypes. Two well-supported clades were recovered: one comprising G. foetida and G. takeshimensis and the other comprising G. fontinalis and G. kuroshimensis, each with 99% bootstrap support. Numbers in parentheses denote the number of individuals sharing an identical multilocus genotype. Nodes with bootstrap support below 50% are not shown.
Although G. foetida was long considered synonymous with Gastrodia nipponica, a recent taxonomic study has shown that G. foetida is morphologically and phylogenetically distinct from G. nipponica [41]. These species also exhibit geographical segregation, with G. nipponica occurring north of the Kyushu mainisland and G. foetida inhabiting the Ryukyu Islands and possibly Taiwan [41]. While our previous work on the development of completely cleistogamous flowers used the name G. nipponica as the sister species of G. takeshimensis [23], G. nipponica sensu [23] is now recognized as G. foetida.
The distributions of G. takeshimensis and G. kuroshimensis, both endemic to the northern Ryukyu, generally overlap with those of their allogamous sister species [26,28,42,43]. Although the cleistogamous taxa and their allogamous sisters occasionally occur in close proximity, no individuals exhibiting intermediate morphological traits indicative of hybridization have been observed even in sympatric sites [26,28]. Given that these four Gastrodia species can reproduce clonally through tuber production, with clonal propagation documented up to approximately 60 cm [44,45], individuals located at least 1 m apart were collected to minimize the possibility of sampling multiple clonal ramets. Consequently, 91 individuals of G. foetida, 119 of G. takeshimensis, 81 of G. fontinalis and 41 of G. kuroshimensis were sampled, covering nearly all known populations (and all existing populations in the cleistogamous species) (figure 1b; electronic supplementary material, table S1). Previous morphological and phylogenetic evidence strongly supports the sister-species relationships between G. fontinalis–G. kuroshimensis and G. foetida–G. takeshimensis [23,26,28,41,46]. To further substantiate these relationships, we conducted an expanded MIG-seq-based phylogenetic analysis incorporating additional morphologically similar species (see the electronic supplementary material, note S1).
(b). Simple sequence repeat marker analysis
Genomic DNA was extracted from fresh or silica‐dried samples using the DNeasy Plant Mini Kit (Qiagen, Venlo, Netherlands). A total of 18 SSR markers previously developed (Gtake009, Gtake011, Gtake012, Gtake015, Gtake020, Gtake021, Gtake022, Gtake029, Gtake032, Gtake037, Gtake042, Gtake043, Gfont027, Gfont028, Gfont034, Gfont035, Gfont043 and Gfont048; [43,46]) were used to genotype the four Gastrodia species (electronic supplementary material, table S1). Polymerase chain reaction (PCR) amplification was performed in 5.0 μl reaction volumes using the QIAGEN Multiplex PCR Kit. Each reaction comprised 10 ng of genomic DNA, 2.5 μl of Multiplex PCR Master Mix and 0.2 μM each of a fluorescently labelled forward primer and a reverse primer. The amplification protocol included an initial denaturation at 95°C for 15 min, followed by 33 cycles at 94°C for 30 s, 57°C for 1.5 min, and 72°C for 1 min, with a final extension at 60°C for 30 min. An ABI PRISM 3130 Genetic Analyzer (Applied Biosystems, MA, USA) and GeneMarker software (Softgenetics, PA, USA) were used to determine product sizes.
Only samples that successfully amplified all 18 loci (68 G. foetida, 82 G. takeshimensis, 54 G. fontinalis and 31 G. kuroshimensis individuals) were used for subsequent analyses. Genetic variation within each population was assessed by calculating the average number of alleles per locus (A), allelic richness (RS), observed heterozygosity (HO) and expected heterozygosity (HE) using GenAlEx v. 6.5 [47], with allelic richness computed using FSTAT v. 2.9.3 [48]. Genetic relationships among multilocus genotypes were assessed by constructing an unrooted neighbour-joining tree based on Nei’s genetic distance DA [39], using Populations v. 1.2.31 [49].
To estimate the number of generations over which fixed alleles at SSR markers could be maintained despite their high mutation rate, we used the mutation rate directly estimated in A. thaliana and considered three scenarios for effective population sizes. In an obligate self-fertilizing lineage, the probability (Pt) that no new alleles arise through mutation at all l SSR loci—with a haploid mutation rate μ—after t generations is given by:
Each line is expected to produce new alleles with a 2µ probability per locus per generation, but only half of these new alleles will be fixed in the subsequent generations of selfed propagation. Furthermore, within a population with an effective population size Ne, the probability Pi that only an identical genotype is observed across all lines can be expressed as:
The mutation rate of SSR markers was estimated by genotyping A. thaliana mutation accumulation lines targeting dinucleotide repeat loci [50]. Because the mutation rate varies with dinucleotide motif types and the number of repeats (ranging from 2.0 × 10⁻³ to 5.0 × 10⁻⁵) [50], we used the lowest rate (4.96 × 10⁻⁵) to conservatively estimate the number of generations during which the same genotype is maintained. In our study, the number of loci (l) considered is six (those with dinucleotide repeats among the SSR markers). Based on current population sizes (approx. 50 individuals for G. kuroshimensis, 250 for G. fontinalis and G. foetida and 500−1000 for G. takeshimensis), Nₑ was evaluated for three scenarios (50, 100 and 500) to estimate the probability that a new allele will not become fixed in the population.
(c). Multiplexed ISSR genotyping by sequencing analysis
Genomic DNA was extracted from silica‐dried samples using the cetyltrimethylammonium bromide method [51]. The MIG‐seq library was prepared following Suyama et al. [33]. Paired‐end sequencing (2 × 80 bases) was performed on an Illumina MiSeq sequencer using a MiSeq Reagent Kit v. 3 (150 cycles) (Illumina, San Diego, CA, USA). The raw sequencing data have been deposited in the DDBJ Sequence Read Archive under accession number PRJDB18852. As with the SSR data, only samples yielding robust amplification and reliable SNP calls were used; consequently, 78 individuals of G. foetida, 107 of G. takeshimensis, 60 of G. fontinalis and 34 of G. kuroshimensis were used for subsequent analyses (electronic supplementary material, table S1).
The primer sequences and low-quality reads were removed using Trimmomatic v. 0.39 [52]. The quality‐controlled reads were aligned to the genome of Gastrodia menghaiensis, which is the most closely related species with an available genome [53], using BWA v. 0.7.17 [54] with default parameters. Although the G. menghaiensis genome was used as a reference for mapping MIG‐seq reads from our focal species, G. menghaiensis itself was not included in the analysis owing to the unavailability of MIG‐seq data.
SNP calling was performed using the ‘gstacks’ and ‘populations’ modules of the Stacks pipeline v. 2.65 [55]. Stacks use the alpha value in gstacks as a measure of evidence in genotyping instead of read depth [56]. In this study, both alpha values—var‐alpha and gt‐alpha—were set to 0.001 (reduced from their default values of 0.01 and 0.05, respectively). Using the Populations program, SNPs with high heterozygosity (HO ≥ 0.6) were removed, because excess heterozygosity may result from artefactual loci arising from multiple paralogous genomic regions [57]. Additionally, SNPs with a minor allele detected in fewer than three samples throughout the entire dataset were excluded.
SNPs were further selected using minimum proportion thresholds of 0.1 and 0.7 for samples retaining each SNP (the Populations parameter, R, was set to 0.1 and 0.7, respectively). Based on SNP data with a threshold of R = 0.1, the maximum likelihood phylogeny was reconstructed with RAxML v. 8.2.10 [58], employing a GTR substitution model with Lewis’ ascertainment bias correction and 1000 bootstrap iterations. A neighbour-net network was generated using SplitsTree v. 4.14 [59], based on an uncorrected p-distance matrix while excluding ambiguous sites. In addition to SNP detection across all four species, we conducted SNP detection hierarchically within sister species [60] to achieve higher resolution using lineage-specific SNPs (electronic supplementary material, figure S2). With a threshold of R = 0.7, subsequent analyses were conducted separately for three datasets: all four species and the sister pairs (G. foetida and G. takeshimensis and G. fontinalis and G. kuroshimensis). For this ‘SNP dataset for species pairwise comparisons’, only the first SNP from each locus was considered to avoid linked SNPs. The population structure for both pairs was analysed using Structure v. 2.3.4 [61] under the admixed and correlated allele frequency model to examine the extent of admixture between chasmogamous and cleistogamous species. The number of clusters (K) was set from 1 to 10 based on 30 independent runs for each K and 100 000 Markov chain Monte Carlo (MCMC) steps after 100 000 burn-in steps. Log-likelihoods for each cluster were calculated, and optimal K values were identified using the Delta K method [62] via Structure Harvester v. 0.7 [63], with outcomes visualized using Clumpak v. 1.1 [64]. It should be noted that, because our analyses were performed separately by sister species pairs, we could not directly assess potential introgression between the cleistogamous lineages. Consequently, one might be concerned that such introgression could account for the emergence of cleistogamy in G. kuroshimensis and G. takeshimensis. However, this possibility was previously ruled out using the ABBA-BABA test with D statistics based on RNA-seq data [23]. Therefore, genetic convergence, where independent mutations result in similar phenotypes, is the most likely explanation for the independent evolution of cleistogamy, making the separate analysis of species pairs appropriate.
Population statistics, including the number of private alleles, observed heterozygosity (HO), expected heterozygosity (HE), the population’s average inbreeding coefficient (FIS) and nucleotide diversity (π), were calculated using the SNP dataset for species pairwise comparisons with the Populations program in Stacks. To visualize shared alleles between sister species, histograms of ancestral allele frequencies were constructed, considering the allele predominant in the chasmogamous species as ancestral.
Finally, we estimated the demographic history of the sister pairs (G. foetida–G. takeshimensis and G. fontinalis–G. kuroshimensis), including the number of generations since divergence, using the approximate Bayesian computation (ABC) algorithm implemented in DIYABC v. 2.1.0 [65]. Given that (i) complete cleistogamy leads to strong reproductive isolation, and (ii) our molecular analyses detected no signs of introgression, we adopted a simple divergence model that assumes no hybridization occurred after speciation.
(d). Analysis of heterozygous single-nucleotide variants in low-copy messenger RNA sequences
Gastrodia foetida and G. takeshimensis individuals (n = 9 each) were collected on Yakushima Island, Kagoshima Prefecture, Japan, in early to mid-April 2017. Concurrently, individuals of G. fontinalis and G. kuroshimensis (n = 9 each) were obtained from Kuroshima Island, Kagoshima Prefecture, Japan, during mid- to late-April 2017 (electronic supplementary material, table S1). Owing to the higher cost per RNA‐seq library and the stringent preservation requirements for sampling on remote islands, fewer samples were analysed for RNA‐seq than for SSR or MIG‐seq. Samples were minced into fragments of less than 3 mm in length and immediately submerged in RNAlater stabilization solution (Sigma-Aldrich, St Louis, MO, USA). They were stored in a cool box during transport to the laboratory and subsequently preserved at −80°C until RNA extraction.
Total RNA was prepared following Suetsugu et al. [41]. For library preparation, only samples with RNA integrity numbers of 8.0 or higher were selected. Libraries were constructed using the MGI Easy RNA Directional Library Prep Set (MGI, Shenzhen, China), and paired-end sequencing (2 × 150 bases) was performed on a DNBSEQ-G400RS instrument (MGI). The RNA‐seq data accession numbers are PRJDB10993 for G. foetida, PRJDB10965 for G. takeshimensis, PRJDB10966 for G. fontinalis and PRJDB10992 for G. kuroshimensis.
To accurately detect heterozygous loci within individuals, each RNA‐seq sample was assembled and analysed separately. De novo transcriptome assembly was performed using Trinity v. 2.11.0 [66] after pre-processing reads with fastp v. 0.20.1 [67]. Open reading frames were predicted using TransDecoder v. 5.5.0 (https://github.com/TransDecoder/TransDecoder). To focus on primary transcripts of well-conserved genes, we extracted splicing isoforms with the longest coding sequences using CDSKIT v. 0.9.2 (https://github.com/kfuku52/cdskit) and selected low-copy genes via BUSCO v. 5.3.2 [68] with the embryophyta_odb10 dataset. Primary transcripts identified as single-copy, duplicated or fragmented were included in downstream analyses.
To identify heterozygous single-nucleotide variants (SNVs), RNA‐seq reads were mapped to the low-copy reference transcript set using BWA v. 0.7.13 [69], and SNVs were called using SAMtools v. 1.3 [70]. Heterozygous SNVs, indicative of genetic diversity, were quantified per kilobase (kb) of total transcript length. The impact of non-synonymous SNVs on protein function was assessed using PROVEAN v. 1.1.5 with the NCBI NR database (downloaded 13 October 2013), as described previously [36,71]. Typically, a PROVEAN score of −2.5 or lower predicts deleterious variation, whereas scores above −2.5 are considered neutral. In this study, non-synonymous variations with an absolute PROVEAN score greater than 2.5 were classified as deleterious, given the difficulty in differentiating derived from ancestral variants at heterozygous loci. We then calculated the ratio of deleterious amino acid variations among heterozygous SNVs and identified transcripts harbouring non-synonymous SNVs. These SNV-based metrics were statistically compared between sister species pairs using a Student’s t‐test.
To compare heterozygous SNV metrics between Gastrodia and other self-pollinating lineages, we retrieved angiosperm RNA‐seq data from GenBank using AMALGKIT v. 0.6.5.6 (https://github.com/kfuku52/amalgkit) and applied the same analyses to individual samples. These database-derived samples included model lineages for studying genomic selfing syndrome, such as Capsella [10,11,17,72,73]. Because mycoheterotrophy may represent an adaptation to low-light environments that hinders pollinator attraction (potentially increasing selfing rates) [22,74,75], we included other mycoheterotrophic plants as controls. To avoid biases related to sequencing data quality, only paired-end RNA‐seq datasets with transcriptome assemblies achieving BUSCO completeness scores above 50% were used. After screening, the methodologies described above were applied to obtain SNV-based metrics across 301 RNA-seq samples from 37 angiosperm species, including 33 samples from four Gastrodia species sequenced in this study (electronic supplementary material, table S2).
(e). Phylogenetic generalized least squares
For each species, one sample from the low-copy transcript set with the highest BUSCO completeness was selected for species tree inference. This inference was performed using the maximum-likelihood method in IQ-Tree v. 2.2.5 [76] with the LG+R model. The concatenated multiple alignments of amino acid sequences, encompassing a total of 790 412 sites, were generated using MAFFT v. 7.508 [77] and trimmed with trimAl v. 1.4.1 [78], treating duplicated and missing genes as missing data [79]. Divergence times were estimated using MCMCtree in the PAML package v. 4.10.7 [80]. Time constraints were prepared using data from timetree.org and processed with NWKIT v. 0.14.2 using the ‘--timetree ci’ option (https://github.com/kfuku52/nwkit). We conducted a phylogenetic generalized least squares (PGLS) analysis using Rphylopars v. 0.3.9 [81] to detect associations between SNV-based metrics and the selfing syndrome across species. The selfing syndrome was encoded as a categorical variable and SNV‐based metrics as numerical variables. For each species, the mean value of these metrics across RNA‐seq samples was calculated and used as input to the phylopars.lm function. The evolutionary dynamics of these metrics were modelled using a Brownian motion process (see the electronic supplementary material, Dataset for detailed code).
3. Results
(a). Uniform simple sequence repeat genotypes indicate recent transitions to complete cleistogamy
To detect de novo mutations over short timescales, we used SSR marker analysis. Despite the high mutation rate of SSR markers, which would typically allow the accumulation of new alleles even in the absence of outcrossing, our analysis of 18 loci revealed that the cleistogamous species G. takeshimensis and G. kuroshimensis exhibited identical multilocus genotypes across individuals from different populations (electronic supplementary material, table S3–S4). Our estimates indicate that the number of generations over which this identical multilocus genotype can be maintained is very limited. For an effective population size (Nₑ) of 500, the probability (Pᵢ) that no new mutation becomes fixed falls below 0.05 by the 21st generation; for Nₑ = 100, Pᵢ drops below 0.05 by the 101st generation; and for Nₑ = 50, by the 202nd generation (electronic supplementary material, figure S1). These findings indicate that the extant populations of the cleistogamous species were established very recently.
By contrast, the closely related chasmogamous species G. foetida and G. fontinalis exhibited some allele variation in SSR markers despite overall low diversity (electronic supplementary material, tables S3 and S4). Phylogenetic analysis based on SSR genotypes revealed that G. takeshimensis forms a clade within a well-supported monophyletic group largely consisting of G. foetida, whereas G. kuroshimensis showed a noticeable genetic distance from G. fontinalis (figure 1c). Allele frequency comparisons indicated that all SSR alleles in G. takeshimensis were shared with G. foetida, while in G. kuroshimensis, 15 of the 18 loci were fixed for unique alleles not shared with G. fontinalis.
(b). Multiplexed ISSR genotyping by sequencing analysis confirms very recent divergence of a completely cleistogamous species
Despite the generally low substitution rate of SNPs, genome-wide SNP data provide a robust dataset for estimating historical genomic changes compared with SSR marker analysis. To achieve more accurate estimates of historical genomic shifts, we employed MIG‐seq analysis. MIG‐seq, a genome-wide genotyping technique that targets inter-simple sequence repeat (ISSR) regions, enables efficient SNP detection with relatively low sequencing effort, even in species with large genome sizes [33,34].
A total of 48 684 524 raw reads, averaging 174 497 ± 2026 reads per sample (mean ± s.d.), were obtained from MIG-seq, and 43 122 384 reads (154 561 ± 1793 reads per sample) were used for further analysis. By creating MIG-seq loci using gstacks, the mean effective per-sample coverage was 31.4× ± 13.4× (mean ± s.d.). After reference-based SNP detection and filtering for four species, 5084 SNPs from 279 samples were selected with R = 0.1, and 372 unlinked SNPs from 279 samples were selected with R = 0.7. Hierarchical SNP detection in sister species revealed 375 SNPs from 94 samples for G. foetida and G. takeshimensis and 247 SNPs from 185 samples for G. fontinalis and G. kuroshimensis.
The maximum-likelihood phylogenetic analysis, including morphologically similar Gastrodia species, confirmed that G. kuroshimensis is most closely related to G. fontinalis, whereas G. takeshimensis is most closely related to G. foetida, though their separation is incomplete (electronic supplementary material, figure S2). This broader analysis supports focusing on these four species to investigate the origins of complete cleistogamy.
Both phylogenetic and Structure analyses, whether examining these four species alone or with other morphologically similar taxa, indicate minimal divergence between G. foetida and G. takeshimensis (electronic supplementary material, figures S2–S4). Given this, hierarchical SNP detection within sister species [60] is preferable for achieving higher resolution through lineage-specific SNPs within each of the two highly differentiated lineages, providing more robust results. Consequently, combined evidence from maximum-likelihood and neighbour-net analyses, along with Structure analyses of sister pairs, identified G. takeshimensis and G. kuroshimensis as distinct phylogenetic groups, clearly separated from their chasmogamous relatives (figure 2a,b; electronic supplementary material, figure S5), with no sign of introgression.
Figure 2.
Genomic profiles of the four Gastrodia species based on MIG-seq data. (a) Neighbour-net network for sister species pairs (G. foetida–G. takeshimensis and G. fontinalis–G. kuroshimensis), based on uncorrected p distances. Branch lengths represent the average number of substitutions per site. (b) Population structure analysis, with taxonomic divisions marked by thick black vertical lines and population divisions by thin black vertical lines. (c) Allele frequency based on SNP genotypes, if alleles predominant in chasmogamous species represent the ancestral state. Sampling locations, abbreviated in (a) and (b), are shown in figure 1b. See the electronic supplementary material, figures S3 and S4, for the phylogenetic tree and population structure encompassing all four species.
Analysis of population statistics revealed consistently low levels of genetic diversity across all four Gastrodia species (electronic supplementary material, figures S5–S8). Notably, the genetic diversity metrics for G. takeshimensis were consistently lower than those for G. foetida, while G. kuroshimensis exhibited higher diversity than G. fontinalis. Furthermore, G. foetida populations in the southern Ryukyu Islands (OTM and OMT) showed relatively high genetic diversity indices, whereas populations from other regions displayed low diversity similar to that of G. takeshimensis. Allele frequency comparisons revealed that most SNP sites in G. takeshimensis were dominated by the same alleles as in G. foetida, with only four SNPs fixed for opposite alleles, whereas 59% of SNP sites (223 out of 375) in G. kuroshimensis were fixed for unique alleles (figure 2c), supporting greater genetic divergence between G. kuroshimensis and G. fontinalis.
Demographic history analysis using ABC methods estimated the divergence time between G. foetida and G. takeshimensis to be approximately 35 generations ago, with a 95% credible interval of 15 to 100 generations (electronic supplementary material, table S9). However, the divergence time estimation for G. kuroshimensis and G. fontinalis was inconclusive because the principal component analysis (PCA) of simulated data did not overlap with the observed data point, indicating model non-convergence (electronic supplementary material, figure S6 and table S10).
(c). RNA-seq analysis reveals a genomic selfing syndrome in both chasmogamous and completely cleistogamous Gastrodia species
To define the signature of a genomic selfing syndrome, we analysed heterozygous SNV metrics from publicly available RNA‐seq data for representative self‐pollinating lineages (figure 3a). Because the genomic selfing syndrome is expected to affect both the frequency and fitness effects of SNVs, we focused specifically on SNVs with potentially significant impacts, including non-synonymous and computationally predicted deleterious variants. PGLS analysis revealed significantly lower numbers of SNVs per unit transcript length in self-pollinating species (p < 0.001; electronic supplementary material, table S8), along with a higher proportion of non-synonymous SNVs (p < 0.001) and deleterious SNVs (p < 0.001) across the angiosperms analysed (figure 3b; electronic supplementary material, table S11). Since de novo mutations tend to be more deleterious than variants maintained in outcrossing populations [14,15], the higher rates of deleterious mutations in selfing lineages probably result from the rapid generation of homozygous alleles via selfing. Although selfing lineages had significantly lower numbers of deleterious SNVs per kb (p < 0.001), this pattern probably reflects an overall reduction in heterozygous variation.
Figure 3.
Heterozygous SNV profiles in Gastrodia and other angiosperm lineages. (a) Phylogenetic relationships among analysed species. All branches in the maximum-likelihood tree are supported by 100% bootstrap values. (b) Genetic diversity and predicted deleterious variation in Gastrodia and other autogamous and allogamous angiosperms, inferred from BUSCO transcripts based on RNA-seq data. Variant frequencies are presented either normalized per kilobase of transcript or as percentages of total SNVs. Each point represents an individual RNA-seq sample. In the box plots, the centre line denotes the median, box boundaries correspond to the upper and lower quartiles, and whiskers indicate 1.5 times the interquartile range.
Interestingly, the genetic characteristics of Gastrodia deviated from the typical distinctions between autogamous and allogamous species. Except for SNVs per kb and the proportion of transcripts with SNVs, where G. takeshimensis showed significantly lower values than its chasmogamous sister species (p < 0.05 for both metrics), the metrics in all other analysed categories were similar, with no significant differences between the cleistogamous species and their chasmogamous relatives (figure 3; electronic supplementary material, table S12). This suggests that the genomic selfing syndrome may have been established before the emergence of complete cleistogamy (figure 4). Moreover, the signature of the genomic selfing syndrome was consistently detected across species with various nutritional modes, including non-photosynthetic species outside the genus Gastrodia (electronic supplementary material, figures S7 and S8), indicating that heterotrophy itself has minimal impact on these genomic profiles.
Figure 4.
Conceptual diagram illustrating the evolution of complete cleistogamy. (a) General scenario: in most angiosperms, plants producing both cleistogamous and chasmogamous flowers may evolve from ancestrally chasmogamous forms; however, the evolution of complete cleistogamy is probably constrained genetically. (b) Gastrodia scenario: drosophilid pollination and reduced genetic diversity owing to founder effects probably promoted the shift to complete cleistogamy. Although chasmogamous Gastrodia species lack autonomous selfing, the limited mobility of drosophilid pollinators leads to high geitonogamy, functionally equivalent to self-pollination. These species also exhibit reduced genetic diversity at range margins. Combined with benefits such as reproductive assurance, colonization ability and reinforcement, these factors probably drove the evolution of complete cleistogamy. Flower colours symbolically represent genetic diversity. Designed by Shun Anzai, Kenji Suetsugu and Shingo Kaneko.
4. Discussion
(a). Genetic profiles and evolutionary history of completely cleistogamous selfers
We used integrative genomic approaches to elucidate the evolutionary history of Gastrodia lineages exhibiting complete cleistogamy. Although these lineages (i.e. G. takeshimensis and G. kuroshimensis) are distributed across multiple isolated islands, each appears to have independently evolved obligate selfing only once. Notably, these species exhibit no evidence of introgression with their chasmogamous relatives, even in cases of close sympatry [22,23]. In contrast to other selfing lineages that produce some cleistogamous flowers, where introgression may persist [11,19], the strict floral closure in Gastrodia probably impedes interspecific gene flow, indicating that complete cleistogamy serves as a robust reproductive barrier. Their distribution patterns further suggest that cleistogamy strengthens species boundaries in regions where closely related chasmogamous lineages coexist.
Our SSR-based analysis suggests that obligate selfing arose very recently in both G. takeshimensis and G. kuroshimensis. Despite the high mutation rates typically associated with SSR markers, we detected remarkable genetic uniformity among all sampled individuals. Even under conservative estimates involving low mutation rates and small effective population sizes, the likelihood of preserving identical SSR genotypes beyond approximately 200 generations remains exceedingly low. Given a generation time of roughly 5−10 years [45,82], it is probable that G. takeshimensis and G. kuroshimensis adopted complete cleistogamy within the past 2000 years. Supporting this, MIG‐seq data indicate that G. takeshimensis diverged from its chasmogamous sister species only about 35 generations ago, suggesting that the transition to complete cleistogamy initiated speciation. By contrast, G. kuroshimensis exhibits pronounced allelic divergence from G. fontinalis, indicating that G. fontinalis is unlikely to be its direct ancestor; instead, its true sister species may be an unexamined chasmogamous lineage that diverged early from G. fontinalis and subsequently became extinct.
While the genomic signatures of G. takeshimensis and G. kuroshimensis strongly support their obligate selfing nature, the chasmogamous species G. foetida and G. fontinalis also display genomic profiles indicative of predominant selfing, including reduced genetic diversity and low heterozygosity. Similar patterns were observed in nonsynonymous substitutions and predicted deleterious mutations—features expected to accumulate under sustained self-pollination [3,13]. These trends suggest that self-fertilization or mating among genetically similar individuals is widespread not only in the two cleistogamous species but also in their chasmogamous relatives. Accordingly, the chasmogamous ancestors of the current cleistogamous lineages were probably already predisposed to selfing-like reproductive modes, facilitating the subsequent evolution of complete cleistogamy.
(b). Driving factors of the evolution of complete cleistogamy
Although G. foetida and G. fontinalis exhibit genomic profiles typically observed in predominantly selfing plants, they retain floral structures (e.g. a prominent rostellum) that prevent autonomous self-pollination [23]. This seemingly paradoxical combination of ‘outcrossing flowers’ and inbreeding-like genetic features can be attributed to geitonogamy and mating among genetically similar neighbours mediated by Drosophila fruit flies, the principal pollinators in these species [40]. The drosophilid pollination system, while adapted to low-light environments, probably results in restricted pollen dispersal and high geitonogamy owing to the limited movement of these flies [40,83]. Given that complete cleistogamy ensures a nearly 100% fruit set, in contrast to the relatively low success of Drosophila pollination [22,84], selfing is probably favoured as a form of reproductive assurance [85].
Small island populations represent another likely driver. Island endemics often exhibit reduced genetic diversity owing to founder effects, geographical isolation and small effective population sizes [86]. Interestingly, G. takeshimensis, G. kuroshimensis and other cleistogamous taxa (e.g. Gastrodia amamiana, Gastrodia clausa and Gastrodia flexistyloides) predominantly inhabit small islands in the Ryukyu Islands [25–29]. Moreover, while chasmogamous Gastrodia species (notably G. foetida) show moderate genetic diversity on certain islands (e.g. Iriomote Island for G. foetida and Takeshima Island for G. fontinalis), populations that co-occur with cleistogamous sister species exhibit reduced diversity (e.g. Yakushima Island for G. foetida and Kuroshima Island for G. fontinalis). The diminished genetic variation in these populations may have rendered mating among different individuals functionally nearly equivalent to self-fertilization, thereby facilitating the transition to complete cleistogamy.
Completely cleistogamous species also appear well-adapted for colonizing new islands, as selfing allows a single individual to establish a population without reliance on mates or pollinators [7,8,87]. Notably, while G. takeshimensis and G. kuroshimensis often co-occur with chasmogamous sister species, they are also distributed on islands lacking such relatives (e.g. G. takeshimensis on Takeshima, Kuroshima and Suwanosejima Islands and G. kuroshimensis on Yakushima Island) [26,28,42,43].
Finally, autonomous selfing mitigates reproductive interference, such as heterospecific pollen deposition that can clog stigmas, inhibit conspecific pollen germination or usurp ovules, thereby facilitating the coexistence of closely related species in some cases [88,89]. Among selfing strategies, complete cleistogamy is particularly effective at precluding heterospecific pollen receipt [22]. Given that the divergence between the ancestors of G. fontinalis and G. kuroshimensis probably predates the origin of complete cleistogamy in G. kuroshimensis, it is plausible that the lineages initially diverged allopatrically for reasons unrelated to cleistogamy. Subsequent secondary contact may have selected for complete cleistogamy in the direct ancestor of G. kuroshimensis (now probably extinct) as a reinforcement mechanism.
Taken together, the recurrent emergence of completely cleistogamous taxa in Gastrodia probably reflects a trade-off between immediate and long-term fitness benefits [22]. Once a lineage becomes genetically uniform and pollinator-mediated outcrossing is inherently restricted, the evolution of complete cleistogamy may confer advantages such as reproductive assurance, enhanced colonization ability and strong reproductive isolation in multispecies communities. However, the extremely recent origin (<200 generations) of complete cleistogamy in G. takeshimensis and G. kuroshimensis suggests that obligate selfing may be evolutionarily ephemeral. As even minimal levels of outcrossing can reduce the accumulation of deleterious mutations [14,15], complete cleistogamy might entail long-term genetic vulnerabilities.
Considering the rarity of complete cleistogamy and the historical scepticism about its existence since Darwin [9], comparative studies, including analyses of other lineages (e.g. Gastrodia indica) [90], will be essential to clarify the ecological and genetic conditions that promote its evolution (e.g. whether reduced genetic diversity precedes its emergence) and to determine the timescales over which such lineages can persist without outcrossing. Moreover, while this study sheds light on the ultimate (evolutionary) drivers of complete cleistogamy, complementary approaches such as interspecific crossing and quantitative trait locus (QTL) mapping [91] will be instrumental in uncovering the proximate mechanisms and genomic architecture underlying this reproductive strategy.
Acknowledgements
The authors thank Hiroaki Yamashita, Hisashi Oiwane, Katsutoshi Maruno, Kazui Hiroshi, Kenshi Tetsuka, Koichi Ueda, Mikio Kashima, Nobuhiko Mizoguchi, Shinsaku Yamazaki, Shuichiro Tagane, Hironobu Kanemitsu, Tetsuro Ikeda, Hidehito Okada, Kohei Yamana, Yukito Asano, Yoshiaki Kitada, and Toshihiro Saito for assistance with fieldwork or for providing specimens and photographs. The authors also thank Keiju Kishikawa, Kenji Ogaki, Yuki Odagiri, Rin Aoki, Kazuma Takizawa, Hidehito Okada, Tomoaki Sakamoto, Seisuke Kimura, Toshihito Takagi and Takako Shizuka for technical support. We further thank Drs Shota Sakaguchi and Yuji Isagi for valuable discussions. The illustration was prepared by Shun Anzai. Computations were partly conducted using the supercomputer at the National Institute of Genetics.
Contributor Information
Kenji Suetsugu, Email: kenji.suetsugu@gmail.com.
Shun K. Hirota, Email: shun.hirota@gmail.com.
Takashi Makino, Email: tamakino@tohoku.ac.jp.
Yoshihisa Suyama, Email: yoshihisa.suyama.e2@tohoku.ac.jp.
Shingo Kaneko, Email: kaneko.shingo@gmail.com.
Kenji Fukushima, Email: kenji.fukushima@nig.ac.jp.
Ethics
This work did not require ethical approval from a human subject or animal welfare committee.
Data accessibility
The raw sequencing data have been deposited in the DDBJ Sequence Read Archive under BioProject accession numbers PRJDB18852 for MIG-seq, and PRJDB10965, PRJDB10966, PRJDB10992 and PRJDB10993 for RNA-seq. Additional data from this study are available on Figshare [92]. Supplementary material is available online [93].
Declaration of AI use
We have not used AI-assisted technologies in creating this article.
Authors’ contributions
K.S.: conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, project administration, resources, validation, visualization, writing—original draft, writing—review and editing; H.K.S.: formal analysis, investigation, validation, visualization, writing—review and editing; T.M.: investigation, methodology; Y.S.: investigation; S.K.: funding acquisition, investigation, validation, visualization, writing—review and editing; K.F.: formal analysis, investigation, methodology, validation, visualization, writing—review and editing.
All authors gave final approval for publication and agreed to be held accountable for the work performed therein.
Conflict of interest declaration
We declare we have no competing interests.
Funding
This study was financially supported by the JSPS KAKENHI (grant numbers JP15K18470 and JP17H05016 to K.S., and JP18K06408 to S.K. and K.S.), the PRESTO (grant number JPMJPR21D6 to K.S.) from the Japan Science and Technology Agency, the Environment Research and Technology Development Fund (grant number 4-2001 to K.S. and Y.S.) from the Ministry of the Environment, Japan, and the Sofja Kovalevskaja programme of the Alexander von Humboldt Foundation (to K.F.).
References
- 1. Darwin C. 1862. On the various contrivances by which British and foreign orchids are fertilized by insects. London, UK: John Murray. [PMC free article] [PubMed] [Google Scholar]
- 2. Stebbins GL. 1957. Self fertilization and population variability in the higher plants. Am. Nat. 91, 337–354. ( 10.1086/281999) [DOI] [Google Scholar]
- 3. Wright SI, Kalisz S, Slotte T. 2013. Evolutionary consequences of self-fertilization in plants. Proc. R. Soc. B 280, 20130133. ( 10.1098/rspb.2013.0133) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Nordborg M. 2000. Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. Genetics 154. ( 10.1093/genetics/154.2.923) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Pollak E. 1987. On the theory of partially inbreeding finite populations. I. Partial selfing. Genetics 117, 353–360. ( 10.1093/genetics/117.2.353) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Cutter AD. 2019. Reproductive transitions in plants and animals: selfing syndrome, sexual selection and speciation. New Phytol. 224, 1080–1094. ( 10.1111/nph.16075) [DOI] [PubMed] [Google Scholar]
- 7. Baker HG. 1955. Self-compatibility and establishment after ‘long-distance’ dispersal. Evolution 9, 347–349. ( 10.1111/j.1558-5646.1955.tb01544.x) [DOI] [Google Scholar]
- 8. Baker HG. 1967. Support for Baker’s law-as a rule. Evolution 21, 853–856. ( 10.1111/j.1558-5646.1967.tb03440.x) [DOI] [PubMed] [Google Scholar]
- 9. Pannell JR. 2009. On the problems of a closed marriage: celebrating Darwin 200. Biol. Lett. 5, 332–335. ( 10.1098/rsbl.2009.0142) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Slotte T, et al. 2013. The Capsella rubella genome and the genomic consequences of rapid mating system evolution. Nat. Genet. 45, 831–835. ( 10.1038/ng.2669) [DOI] [PubMed] [Google Scholar]
- 11. Brandvain Y, Kenney AM, Flagel L, Coop G, Sweigart AL. 2014. Speciation and introgression between Mimulus nasutus and Mimulus guttatus. PLoS Genet. 10, e1004410. ( 10.1371/journal.pgen.1004410) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Shimizu KK, Tsuchimatsu T. 2015. Evolution of selfing: recurrent patterns in molecular adaptation. Annu. Rev. Ecol. Evol. Syst 46. ( 10.1146/annurev-ecolsys-112414-054249) [DOI] [Google Scholar]
- 13. Wang XJ, Barrett SCH, Zhong L, Wu ZK, Li DZ, Wang H, Zhou W. 2021. The genomic selfing syndrome accompanies the evolutionary breakdown of heterostyly. Mol. Biol. Evol. 38, 168–180. ( 10.1093/molbev/msaa199) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Pamilo P, Nei M, Li WH. 1987. Accumulation of mutations in sexual and asexual populations. Genet. Res. 49, 135–146. [DOI] [PubMed] [Google Scholar]
- 15. Wright SI, Ness RW, Foxe JP, Barrett SCH. 2008. Genomic consequences of outcrossing and selfing in plants. Int. J. Plant Sci. 169, 105–118. ( 10.1086/523366) [DOI] [Google Scholar]
- 16. Shimizu KK, Kudoh H, Kobayashi MJ. 2011. Plant sexual reproduction during climate change: gene function in natura studied by ecological and evolutionary systems biology. Ann. Bot. 108, 777–787. ( 10.1093/aob/mcr180) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Tsuchimatsu T, Fujii S. 2022. The selfing syndrome and beyond: diverse evolutionary consequences of mating system transitions in plants. Phil. Trans. R. Soc. B 377, 20200510. ( 10.1098/rstb.2020.0510) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Brys R, van Cauwenberghe J, Jacquemyn H. 2016. The importance of autonomous selfing in preventing hybridization in three closely related plant species. J. Ecol. 104, 601–610. ( 10.1111/1365-2745.12524) [DOI] [Google Scholar]
- 19. Kenney AM, Sweigart AL. 2016. Reproductive isolation and introgression between sympatric Mimulus species. Mol. Ecol. 25, 2499–2517. ( 10.1111/mec.13630) [DOI] [PubMed] [Google Scholar]
- 20. Lowry DB, Rockwood RC, Willis JH. 2008. Ecological reproductive isolation of coast and inland races of Mimulus guttatus. Evolution 62. ( 10.1111/j.1558-5646.2008.00457.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Martin NH, Willis JH. 2007. Ecological divergence associated with mating system causes nearly complete reproductive isolation between sympatric Mimulus species. Evolution 61, 68–82. ( 10.1111/j.1558-5646.2007.00006.x) [DOI] [PubMed] [Google Scholar]
- 22. Suetsugu K. 2022. Living in the shadows: Gastrodia orchids lack functional leaves and open flowers. Plants People Planet 4, 418–422. ( 10.1002/ppp3.10281) [DOI] [Google Scholar]
- 23. Suetsugu K, Fukushima K, Makino T, Ikematsu S, Sakamoto T, Kimura S. 2023. Transcriptomic heterochrony and completely cleistogamous flower development in the mycoheterotrophic orchid Gastrodia. New Phytol. 237, 323–338. ( 10.1111/nph.18495) [DOI] [PubMed] [Google Scholar]
- 24. Culley TM, Klooster MR. 2007. The cleistogamous breeding system: a review of its frequency, evolution, and ecology in angiosperms. Bot. Rev. 73, 1–30. ( 10.1663/0006-8101(2007)73[1:tcbsar]2.0.co;2) [DOI] [Google Scholar]
- 25. Hsu TC, Chung SW, Kuo CM. 2012. Supplements to the orchid flora of Taiwan (vi). Taiwania 57, 271–277. ( 10.6165/tai.2012.57(3).271) [DOI] [Google Scholar]
- 26. Suetsugu K. 2013. Gastrodia takeshimensis (Orchidaceae), a new mycoheterotrophic species from Japan. Ann. Bot. Fenn. 50, 375–378. ( 10.5735/085.050.0613) [DOI] [Google Scholar]
- 27. Suetsugu K. 2014. Gastrodia flexistyloides (Orchidaceae), a new mycoheterotrophic plant with complete cleistogamy from Japan. Phytotaxa 175. ( 10.11646/phytotaxa.175.5.5) [DOI] [Google Scholar]
- 28. Suetsugu K. 2016. Gastrodia kuroshimensis (Orchidaceae), a new mycoheterotrophic and complete cleistogamous plant from Japan. Phytotaxa 278, 265–272. ( 10.11646/phytotaxa.278.3.6) [DOI] [Google Scholar]
- 29. Suetsugu K. 2019. Gastrodia amamiana (Orchidaceae; Epidendroideae; Gastrodieae), a new completely cleistogamous species from Japan. Phytotaxa 413, 225–230. ( 10.11646/phytotaxa.413.3.3) [DOI] [Google Scholar]
- 30. Servedio MR, Noor MAF. 2003. The role of reinforcement in speciation: theory and data. Annu. Rev. Ecol. Evol. Syst. 34, 339–364. ( 10.1146/annurev.ecolsys.34.011802.132412) [DOI] [Google Scholar]
- 31. Ossowski S, Schneeberger K, Lucas-Lledó JI, Warthmann N, Clark RM, Shaw RG, Weigel D, Lynch M. 2010. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327, 92–94. ( 10.1126/science.1180677) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Hirao AS, Watanabe Y, Hasegawa Y, Takagi T, Ueno S, Kaneko S. 2022. Mutational effects of chronic gamma radiation throughout the life cycle of Arabidopsis thaliana: insight into radiosensitivity in the reproductive stage. Sci. Total Environ. 838, 156224. ( 10.1016/j.scitotenv.2022.156224) [DOI] [PubMed] [Google Scholar]
- 33. Suyama Y, Hirota SK, Matsuo A, Tsunamoto Y, Mitsuyuki C, Shimura A, Okano K. 2022. Complementary combination of multiplex high‐throughput DNA sequencing for molecular phylogeny. Ecol. Res. 37, 171–181. ( 10.1111/1440-1703.12270) [DOI] [Google Scholar]
- 34. Nota K, et al. 2022. Norway spruce postglacial recolonization of Fennoscandia. Nat. Commun. 13, 1333. ( 10.1038/s41467-022-28976-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Suetsugu K, Hirota SK, Shitara T, Ishida K, Nakato N, Hayakawa H, Suyama Y. 2024. The absence of bumblebees on an oceanic island blurs the species boundary of two closely related orchids. New Phytol. 241, 1321–1333. ( 10.1111/nph.19325) [DOI] [PubMed] [Google Scholar]
- 36. Hamabata T, Kinoshita G, Kurita K, Cao PL, Ito M, Murata J, Komaki Y, Isagi Y, Makino T. 2019. Endangered island endemic plants have vulnerable genomes. Commun. Biol. 2, 1–10. ( 10.1038/s42003-019-0490-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Ogura-Tsujita Y, Gebauer G, Hashimoto T, Umata H, Yukawa T. 2009. Evidence for novel and specialized mycorrhizal parasitism: the orchid Gastrodia confusa gains carbon from saprotrophic Mycena. Proc. R. Soc. B 276, 761–767. ( 10.1098/rspb.2008.1225) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Martos F, Dulormne M, Pailler T, Bonfante P, Faccio A, Fournel J, Dubois MP, Selosse MA. 2009. Independent recruitment of saprotrophic fungi as mycorrhizal partners by tropical achlorophyllous orchids. New Phytol. 184, 668–681. ( 10.1111/j.1469-8137.2009.02987.x) [DOI] [PubMed] [Google Scholar]
- 39. Nei M, Tajima F, Tateno Y. 1983. Accuracy of estimated phylogenetic trees from molecular data. J. Mol. Evol. 19, 153–170. ( 10.1007/bf02300753) [DOI] [PubMed] [Google Scholar]
- 40. Suetsugu K. 2023. A novel nursery pollination system between a mycoheterotrophic orchid and mushroom‐feeding flies. Ecology 104, e4152. ( 10.1002/ecy.4152) [DOI] [PubMed] [Google Scholar]
- 41. Suetsugu K, Aoki R, Kaneko S. 2023. Resurrection and emended description of Gastrodia foetida (Orchidaceae). Phytotaxa 583, 251–259. 3. ( 10.11646/phytotaxa.583.3.3) [DOI] [Google Scholar]
- 42. Suetsugu K. 2017. Range extensions for two mycoheterotrophic orchids, Gastrodia takeshimensis and G. flexistyloides (Orchidaceae), outside their type locality. Acta Phytotax. Geobot 68, 53–57. ( 10.18942/apg.201618) [DOI] [Google Scholar]
- 43. Kishikawa K, Suetsugu K, Kyogoku D, Ogaki K, Iga D, Shutoh K, Isagi Y, Kaneko S. 2019. Development of microsatellite markers for the completely cleistogamous species Gastrodia takeshimensis (Orchidaceae) that are transferable to its chasmogamous sister G. nipponica Genes Genet. Syst. 94, 95–98. ( 10.1266/ggs.18-00057) [DOI] [PubMed] [Google Scholar]
- 44. Suetsugu K, Umata H, Yokota M. 2014. First record of the mycoheterotrophic orchid Gastrodia fontinalis (Orchidaceae) from Takeshima Island, the Ryukyu Islands, Japan. Taiwania 59, 383–386. [Google Scholar]
- 45. Ogura-Tsujita Y, Tetsuka K, Tagane S, Kubota M, Anan S, Yamashita Y, Tone K, Yukawa T. 2021. Differing life-history strategies of two mycoheterotrophic orchid species associated with leaf litter- and wood-decaying fungi. Diversity 13, 161. ( 10.3390/d13040161) [DOI] [Google Scholar]
- 46. Ogaki K, Suetsugu K, Kishikawa K, Kyogoku D, Shutoh K, Isagi Y, Kaneko S. 2020. New microsatellite markers recognize differences in tandem repeats among four related Gastrodia species (Orchidaceae). Genes Genet. Syst. 95, 52–53. ( 10.1266/ggs.95.52) [DOI] [PubMed] [Google Scholar]
- 47. Peakall R, Smouse PE. 2012. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics 28, 2537–2539. ( 10.1093/bioinformatics/bts460) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Goudet J. 1995. FSTAT (Version 1.2): a computer program to calculate F-statistics. J. Hered. 86, 485–486. ( 10.1093/oxfordjournals.jhered.a111627) [DOI] [Google Scholar]
- 49. Langella O. 2007. Populations ver. 1.2.30: population genetic software (individuals or populations distances, phylogenetic trees). See http://bioinformatics.org/∼tryphon/populations/.
- 50. Marriage TN, Hudman S, Mort ME, Orive ME, Shaw RG, Kelly JK. 2009. Direct estimation of the mutation rate at dinucleotide microsatellite loci in Arabidopsis thaliana (Brassicaceae). Heredity 103, 310–317. ( 10.1038/hdy.2009.67) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Doyle JJ, Doyle JL. 1990. Isolation of plant DNA from fresh tissue. Focus 12, 13–15. [Google Scholar]
- 52. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. ( 10.1093/bioinformatics/btu170) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Jiang Y, et al. 2022. The Gastrodia menghaiensis (Orchidaceae) genome provides new insights of orchid mycorrhizal interactions. BMC Plant Biol. 22, 179. ( 10.1186/s12870-022-03573-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Li H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv. ( 10.48550/ARXIV.1303.3997) [DOI]
- 55. Rochette NC, Rivera‐Colón AG, Catchen JM. 2019. Stacks 2: analytical methods for paired‐end sequencing improve RADseq‐based population genomics. Mol. Ecol. 28, 4737–4754. ( 10.1111/mec.15253) [DOI] [PubMed] [Google Scholar]
- 56. Rivera-Colón AG, Catchen J. 2022. Population genomics analysis with RAD, reprised: stacks 2. In Marine genomics: methods and protocols, vol. 2498 (eds Verde C, Giordano D), pp. 99–149. New York, NY: Springer US. ( 10.1007/978-1-0716-2313-8_7) [DOI] [PubMed] [Google Scholar]
- 57. Paris JR, Stevens JR, Catchen JM. 2017. Lost in parameter space: a road map for Stacks. Methods Ecol. Evol. 8, 1360–1373. ( 10.1111/2041-210x.12775) [DOI] [Google Scholar]
- 58. Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313. ( 10.1093/bioinformatics/btu033) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Huson DH, Bryant D. 2006. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254–267. ( 10.1093/molbev/msj030) [DOI] [PubMed] [Google Scholar]
- 60. Yahara T, Hirota SK, Fujii S, Kokami Y, Fuse K, Sato H, Tagane S, Suyama Y. 2023. Molecular phylogeny and taxonomy of Hosta (Asparagaceae) on Shikoku Island, Japan, including five new species, one new subspecies, and two new status assignments. PhytoKeys 235, 137–187. ( 10.3897/phytokeys.235.99140) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Pritchard JK, Stephens M, Donnelly P. 2000. Inference of population structure using multilocus genotype data. Genetics 155, 945–959. ( 10.1093/genetics/155.2.945) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Evanno G, Regnaut S, Goudet J. 2005. Detecting the number of clusters of individuals using the software structure: a simulation study. Mol. Ecol. 14, 2611–2620. ( 10.1111/j.1365-294x.2005.02553.x) [DOI] [PubMed] [Google Scholar]
- 63. Earl DA, vonHoldt BM. 2012. Structure harvester: a website and program for visualizing Structure output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361. ( 10.1007/s12686-011-9548-7) [DOI] [Google Scholar]
- 64. Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I. 2015. Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. 15, 1179–1191. ( 10.1111/1755-0998.12387) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Cornuet JM, Pudlo P, Veyssier J, Dehne-Garcia A, Gautier M, Leblois R, Marin JM, Estoup A. 2014. DIYABC v2.0: a software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data. Bioinformatics 30, 1187–1189. ( 10.1093/bioinformatics/btt763) [DOI] [PubMed] [Google Scholar]
- 66. Grabherr MG, et al. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652. ( 10.1038/nbt.1883) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890. ( 10.1093/bioinformatics/bty560) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Seppey M, Manni M, Zdobnov EM. 2019. BUSCO: assessing genome assembly and annotation completeness. In Gene prediction: methods and protocols (ed. Kollmar M), pp. 227–245. New York, NY: Springer. ( 10.1007/978-1-4939-9173-0_14) [DOI] [PubMed] [Google Scholar]
- 69. Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760. ( 10.1093/bioinformatics/btp324) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Li H, et al. 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. ( 10.1093/bioinformatics/btp352) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Hirota T, Izumi M, Wada S, Makino A, Ishida H. 2018. Vacuolar protein degradation via autophagy provides substrates to amino acid catabolic pathways as an adaptive response to sugar starvation in Arabidopsis thaliana. Plant Cell Physiol. 59, 1363–1376. ( 10.1093/pcp/pcy005) [DOI] [PubMed] [Google Scholar]
- 72. Fishman L, Beardsley PM, Stathos A, Williams CF, Hill JP. 2015. The genetic architecture of traits associated with the evolution of self‐pollination in Mimulus. New Phytol. 205, 907–917. ( 10.1111/nph.13091) [DOI] [PubMed] [Google Scholar]
- 73. Zhang Z, Kryvokhyzha D, Orsucci M, Glémin S, Milesi P, Lascoux M. 2022. How broad is the selfing syndrome? Insights from convergent evolution of gene expression across species and tissues in the Capsella genus. New Phytol. 236, 2344–2357. ( 10.1111/nph.18477) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Suetsugu K. 2015. Autonomous self-pollination and insect visitors in partially and fully mycoheterotrophic species of Cymbidium (Orchidaceae). J. Plant Res. 128, 115–125. ( 10.1007/s10265-014-0669-4) [DOI] [PubMed] [Google Scholar]
- 75. Shutoh K, Izuno A, Isagi Y, Kurosawa T, Kaneko S. 2017. Development of microsatellite markers for partially and putative fully mycoheterotrophic varieties of Pyrola japonica sensu lato (Ericaceae). Genes Genet. Syst. 92, 99–103. ( 10.1266/ggs.16-00048) [DOI] [PubMed] [Google Scholar]
- 76. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. 2020. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (doi. ( 10.1093/molbev/msaa015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780. ( 10.1093/molbev/mst010) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973. ( 10.1093/bioinformatics/btp348) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Saul F, et al. 2023. Subgenome dominance shapes novel gene evolution in the decaploid pitcher plant Nepenthes gracilis. Nat. Plants 9. ( 10.1038/s41477-023-01562-2) [DOI] [PubMed] [Google Scholar]
- 80. Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591. ( 10.1093/molbev/msm088) [DOI] [PubMed] [Google Scholar]
- 81. Goolsby EW, Bruggeman J, Ané C. 2017. Rphylopars: fast multivariate phylogenetic comparative methods for missing data and within‐species variation. Methods Ecol. Evol. 8, 22–27. ( 10.1111/2041-210x.12612) [DOI] [Google Scholar]
- 82. Shimaoka C, Fukunaga H, Inagaki S, Sawa S. 2017. Artificial cultivation system for Gastrodia spp. and identification of associated mycorrhizal fungi. Int. J. Biol 9, 27–34. ( 10.5539/ijb.v9n4p27) [DOI] [Google Scholar]
- 83. Martos F, Cariou ML, Pailler T, Fournel J, Bytebier B, Johnson SD. 2015. Chemical and morphological filters in a specialized floral mimicry system. New Phytol. 207, 225–234. ( 10.1111/nph.13350) [DOI] [PubMed] [Google Scholar]
- 84. Suetsugu K. 2018. Achlorophyllous orchid can utilize fungi not only for nutritional demands but also pollinator attraction. Ecology 99, 1498–1500. ( 10.1002/ecy.2170) [DOI] [PubMed] [Google Scholar]
- 85. Acoca‐Pidolle S, Gauthier P, Devresse L, Deverge Merdrignac A, Pons V, Cheptou PO. 2024. Ongoing convergent evolution of a selfing syndrome threatens plant–pollinator interactions. New Phytol. 242, 717–726. ( 10.1111/nph.19422) [DOI] [PubMed] [Google Scholar]
- 86. Jamieson IG. 2011. Founder effects, inbreeding, and loss of genetic diversity in four avian reintroduction programs. Conserv. Biol 25. ( 10.1111/j.1523-1739.2010.01574.x) [DOI] [PubMed] [Google Scholar]
- 87. Barrett SCH. 1996. The reproductive biology and genetics of island plants. Phil. Trans. R. Soc. B 351, 725–733. [Google Scholar]
- 88. Randle AM, Spigler RB, Kalisz S. 2018. Shifts to earlier selfing in sympatry may reduce costs of pollinator sharing. Evolution 72, 1587–1599. ( 10.1111/evo.13522) [DOI] [PubMed] [Google Scholar]
- 89. Katsuhara KR, Ushimaru A. 2019. Prior selfing can mitigate the negative effects of mutual reproductive interference between coexisting congeners. Funct. Ecol. 33, 1504–1513. ( 10.1111/1365-2435.13344) [DOI] [Google Scholar]
- 90. Khanal M, Sarkar S, Sarkar K, Chhetri P, Rai SK, Kumar D, Rawat S. 2024. Gastrodia indica (Orchidaceae: Gastrodieae), a new completely cleistogamous holomycotrophic species from India. Phytotaxa 663, 26–32. ( 10.11646/phytotaxa.663.1.3) [DOI] [Google Scholar]
- 91. Sicard A, Stacey N, Hermann K, Dessoly J, Neuffer B, Bäurle I, Lenhard M. 2011. Genetics, evolution, and adaptive significance of the selfing syndrome in the genus Capsella. Plant Cell 23, 3156–3171. ( 10.1105/tpc.111.088237) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Suetsugu K, Hirota KS, Makino T, Suyama Y, Kaneko S, Fukushima K. 2024. Data from: Genomic signature and evolutionary history of completely cleistogamous lineages in the non-photosynthetic orchid Gastrodia. Figshare. ( 10.6084/m9.figshare.25756938) [DOI] [PMC free article] [PubMed]
- 93. Suetsugu K, Shun HK, Makino T, Suyama Y, Kaneko S, Fukushima K. 2025. Supplementary material from: Genomic signature and evolutionary history of completely cleistogamous lineages in the non-photosynthetic orchid Gastrodia. Figshare. ( 10.6084/m9.figshare.c.7811530) [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The raw sequencing data have been deposited in the DDBJ Sequence Read Archive under BioProject accession numbers PRJDB18852 for MIG-seq, and PRJDB10965, PRJDB10966, PRJDB10992 and PRJDB10993 for RNA-seq. Additional data from this study are available on Figshare [92]. Supplementary material is available online [93].




