Skip to main content
Communications Biology logoLink to Communications Biology
. 2024 May 16;7:581. doi: 10.1038/s42003-024-06266-7

Population genomics and epigenomics of Spirodela polyrhiza provide insights into the evolution of facultative asexuality

Yangzi Wang 1,2,#, Pablo Duchen 1,2,#, Alexandra Chávez 1,2,3, K Sowjanya Sree 4, Klaus J Appenroth 5, Hai Zhao 6, Martin Höfer 1,2, Meret Huber 1,3, Shuqing Xu 1,2,7,
PMCID: PMC11099151  PMID: 38755313

Abstract

Many plants are facultatively asexual, balancing short-term benefits with long-term costs of asexuality. During range expansion, natural selection likely influences the genetic controls of asexuality in these organisms. However, evidence of natural selection driving asexuality is limited, and the evolutionary consequences of asexuality on the genomic and epigenomic diversity remain controversial. We analyzed population genomes and epigenomes of Spirodela polyrhiza, (L.) Schleid., a facultatively asexual plant that flowers rarely, revealing remarkably low genomic diversity and DNA methylation levels. Within species, demographic history and the frequency of asexual reproduction jointly determined intra-specific variations of genomic diversity and DNA methylation levels. Genome-wide scans revealed that genes associated with stress adaptations, flowering and embryogenesis were under positive selection. These data are consistent with the hypothesize that natural selection can shape the evolution of asexuality during habitat expansions, which alters genomic and epigenomic diversity levels.

Subject terms: Evolutionary genetics, Genetic variation, Epigenomics


Population genomic and epigenomic study in a facultatively asexual plant indicates that natural selection can act on the controls of asexual reproduction during range expansion, which in turn might reduce genetic and epigenetic diversity in the population.

Introduction

Understanding the evolution of sexual reproduction has long been at the center of evolutionary biology. Theories suggest that asexual reproduction is beneficial for the short term but costly for the long term, mainly due to accumulations of deleterious mutations and low effective population size15. Facultative asexuality, where organisms can reproduce both sexually and asexually depending on environmental conditions, should be optimal for one individual’s lifespan6,7. While rather few animals such as aphids (Aphidoidea)8, water fleas (Cladocerans)9, and rotifers10 reproduce facultatively asexually, up to ~80% of the flowering plants, including important crops and keystone species, can reproduce both sexually and asexually11. Asexual reproduction in plants involves different types of vegetative reproduction (e.g. runners, tubers, bulbs, corms, suckers, plantlets), as well as apomixis, the formation of seeds without fertilization12. Because changes between sexual and asexual reproduction affect the ability to persist in the short and long term, natural selection might act on the genetic controls of sexual and asexual reproduction in facultative asexual organisms, which in turn can alter the levels of genomic diversity, heterozygosity and effectiveness of selection in the population2,5,1315. However, direct evidence supporting this prediction remains scarce, mainly due to the lack of a suitable facultative asexually reproducing system in which the signature of selection can be detected at genomic levels.

Evolutionary changes in sexual and asexual reproduction might also affect the maintenance and dynamics of chromatin marks, e.g., epigenetic markers such as DNA methylations. In plants, cytosine methylation can occur in three sequence contexts: CpG, CHG, and CHH (H = A, T, or C), which are controlled by different mechanisms and have different dynamics during reproduction16. Typically, CpG and CHG methylation are maintained by methyltransferases1 (MET1) and CHROMOMETHYLASE3 (CMT3), respectively, whereas CHH methylation is mostly maintained by CMT217. During sexual reproduction, DNA methylations are highly dynamic18. In both male and female gametogenesis, the megaspore mother cell and microspore mother cell experience dramatic chromatin changes during cell specification, such as heterochromatin decondensation and an enlarged nuclear volume19,20. During male gametogenesis, sperm DNA is highly methylated in the CpG and CHG context but has low CHH methylation in retrotransposons18,21,22. During female gametogenesis, CpG and CHH methylation remains largely steady23. After fertilization, CHH methylation increases during embryogenesis and can approach 100% at individual cytosines, which then decreases likely through a passive mechanism after germination2426. In contrast, during vegetative reproduction, DNA methylation is likely steady since meiosis and embryogenesis are lacking2729. Although Niederhuth, C. E. et al.30. comparing DNA methylations among 34 angiosperm species suggested that clonally propagated species often have low CHH methylation, the extent to which asexual reproduction affects genome-wide methylation levels remains unclear.

Here, we investigated the population genome and epigenome of a facultatively asexual plant, Spirodela polyrhiza (the giant duckweed; Lemnaceae), using samples from a global collection. This species, like other duckweeds from the genera Spirodela, Landoltia and Lemna, is characterized by leaf-like fronds derived from fused stems31 and, with multiple roots on each frond32 and with a highly reduced vascular system33. Spirodela polyrhiza reproduces vegetatively via budding under normal conditions but very rarely switches to sexual reproduction under unfavorable conditions34,35. Recent studies showed that despite its global distribution in diverse habitats, the genomic diversity, spontaneous mutation rates and DNA methylation levels in S. polyrhiza are very low3641, which might be associated with its overall low frequency of sexual reproduction. DNA methylation profiling of two genotypes suggests that DNA methylation in S. polyrhiza, which is substantially lower than in other plants, varied between genotypes41. Further insights into the evolutionary origin and consequences of asexuality on genomic and epigenomic variation in S. polyrhiza are required to understand the demographic history and to identify the footprint of selection on the genome.

Results

Extremely low genomic variations in S. polyrhiza

We sequenced the genomes of 131 globally distributed S. polyrhiza genotypes with an average of ~25 X coverage. Together with previously published samples36,37, we analyzed the genomic diversity of 228 S. polyrhiza individuals across five continents (Supplementary Data 1). We identified 1,241,981 high-quality biallelic single-nucleotide polymorphisms (SNPs) and 166,075 short insertions and deletions (INDELs, less than 50 bp of length). Based on an updated genome annotation of S. polyrhiza (see Supplementary Results Methods 1.1 and Supplementary Results Section 2.1), we found that most of the SNPs (70.3%) are in the intergenic regions (Supplementary Fig. 1). Of all the SNPs located in the protein-coding regions, 61,039 were identified as nonsynonymous and 44,287 as synonymous (Supplementary Data 2). Consistent with our previous study36, the genome-wide nucleotide diversity is 0.0016 (Supplementary Table 1), which falls within the lower range of genome-wide nucleotide diversity of other tested multicellular eukaryotes (Supplementary Table 2 and Supplementary Fig. 2). The species-wide efficacy of selection (πNS ratio) is 0.37, the highest among studied organisms42, indicating a relatively relaxed purifying selection in S. polyrhiza, despite its large effective population size36,37.

In addition to SNPs and small INDELs, we also characterize the genome-wide structural variations (SVs, ≥50 bp in length) in S. polyrhiza (see Supplementary Methods Section 1.2 and Supplementary Results Section 2.2). We identified 3,205 high-quality SVs, including 2,089 deletions, 291 duplications, and 825 insertions. Among all identified SVs, 155 duplications and 169 deletions affected protein-coding sequences (Supplementary Table 3 and Supplementary Data 3). Using a permutation approach at a genome-wide level, we identified gene families that are significantly enriched with SVs and small INDELs (see Supplementary Methods Section 1.3, 1.4, and 1.5, Supplementary Results Section 2.3, and Supplementary Data 4 and 5), respectively. We found several gene families related to defences, such as RPP843 and the glycoside hydrolase44, are enriched with both SVs and small INDELs. This is consistent with findings from Arabidopsis and other plant species, which show that SVs are enriched in stress and pathogen resistance45,46 (Supplementary Data 5). Interestingly, we also found SVs and small INDELs are also enriched in gene families that are involved in organ development and reproduction, such as the receptor-like protein kinases gene family47 and MADS-box gene family that has been shown to have substantial gene losses and copy number variations in duckweeds4850.

Population structure and demographic history of S. polyrhiza

Because S. polyrhiza is facultatively asexual, genotypes collected from the geographic proximity can be derived from the same clonal family. Using a previously established grouping threshold that was developed in S. polyrhiza2, we identified 159 likely clonal families in the sampled population (Supplementary Data 6).

Population structure and principal component analyses revealed four populations in the sampled S. polyrhiza (Fig. 1a and b). Consistent with our previous study, the four populations are largely concordant with their geographic origins, namely America, Southeast Asia (SE-Asia), Europe, and India (Supplementary Fig. 3), with a few exceptions that can be due to recent migration events or artifacts during long-term duckweed maintenance.

Fig. 1. Phylogeny, population structure and demographic model of 228 S. polyrhiza.

Fig. 1

a The population structure of 228 S. polyrhiza genotypes. b The principal component (PC) analysis of the SNPs from 228 S. polyrhiza genotypes. The three coordinates indicate the first three PCs. c The Maximum Likelihood phylogenetic tree of 228 S. polyrhiza genotypes. The gray branch represents the outgroup - C. esculenta. Dashed branches represent internal nodes with supporting values lower than 0.75 (the max is 1). d The demographic model of S. polyrhiza populations. “Tpopulation” indicates the estimated divergence time in generations.

We inferred the population history with a Maximum Likelihood (ML) phylogeny and Approximate Bayesian Computation (ABC). For ML, we used Colocasia esculenta (from the Araceae family) as an outgroup. The maximum likelihood phylogeny of all 228 genotypes indicated an early split of the American population from the other populations and subsequent splits of the Indian and European populations from SE-Asia (Fig. 1c). The European population constitutes the most recent split (Fig. 1c and d). Here, genotypes collected from the transcontinental region (e.g. Russia) showed intermediate features of SE-Asian and European populations, suggesting this as a likely migration route. Furthermore, the Indian population possibly originated via Thailand and Vietnam, as genotypes from these countries show intermediate features between Indian and SE-Asian populations.

We modeled the demographic history using an ABC modeling approach to further validate the evolutionary history of the four populations in S. polyrhiza (see Supplementary Methods Section 1.6 and Supplementary Table 4). Based on the phylogenetic analysis, we simulated three plausible demographic scenarios, allowing for either the SE-Asian, American or an additional putative population to function as the ancestral population (Supplementary Fig. 4). We found that the scenario, in which the American population and Asian population were derived from an additional putative ancestral population, constituted the most supported model (Fig. 1d). While the American population was separated from other populations around one million generations ago, the European population was derived from the SE-Asian population only 12,000 generations ago (see Supplementary Results Section 2.4 and Supplementary Table 5).

Determinants of genomic diversity among populations

Among the four populations, nucleotide diversity (π) and the efficacy of selection (πNS ratio) varied among populations (Fig. 2b). While the SE-Asian population has the highest π and lowest πNS ratio, the American population has the lowest π and highest πN/πS ratio. Interestingly, while the European population has a much smaller π compared to the SE-Asian population, the πN/πS ratio of the European population remains similar to the latter, likely due to its recent split from the SE-Asian population.

Fig. 2. Genomic diversity variation among four populations might result from the switching between sexual and asexual propagation in S. polyrhiza.

Fig. 2

a Scheme of asexual and sexual propagation cycles in S. polyrhiza. (A) Vegetative stage of S. polyrhiza; (B) Budding; (C) Offspring from clonal propagation; (D) S. polyrhiza flowering; (E) Putative schematic of ovule at endosperm cellularization stage; (F) Putative schematic of seeds. b Bar plots show the differences among the four populations in terms of “π”: genome-wide nucleotide diversity; “LD”: the physical extent (in kb) of pairwise SNPs at r2 of 0.2 (Europe does not yet reach r2 = 0.2 at 100 kb, Supplementary Fig. 5); “πNS”: the efficacy of linked selection; “r”: genome-wide recombination rate; “H”: the median of per population genome-wide heterozygosity rate. c Two panels of pie charts indicate the allele frequencies of the functional allele of SOC1-like and AGL62-like genes among populations. Gray: functional allele frequency; Black: SVs allele frequency. d The distribution and migration world map of the four S. polyrhiza populations. “+” suggests the increased functional allele frequency, while  “-”  suggests the decreased functional allele frequency.

Using genome-wide SNPs, we found that linkage disequilibrium (LD) is comparable to Arabidopsis thaliana51, suggesting considerable historical sexual reproduction in S. polyrhiza. However, the extent of LD decay varied substantially among populations (Fig. 2b and Supplementary Fig. 5). While the Asian population showed the most rapid LD decay (about 12 kb at r2 = 0.2), the European population had very long LD blocks (>100 kb). The Indian and American populations had intermediate LD decay. Consistently, the Asian population had the highest recombination rate compared to the other three (Fig. 2b). Different LDs and recombination rates found among populations indicate that the frequencies of sexual reproduction varied among populations. In addition, we found that the variations of heterozygosity in S. polyrhiza showed a similar pattern with the genomic diversity and recombination rate among four populations (Fig. 2b and Supplementary Fig. 6).

Interestingly, the changes in genomic diversity and levels of heterozygosity are associated with two SVs involving MADS-box genes that are involved in sexual reproduction. One SV is an 84 bp insertion at the last coding sequence (CDS) of gene SpGA2022_005278, a homolog of AGL62 from the Mα subclade of MADS-box genes (Supplementary Fig. 7). In A. thaliana, AGL62 is a transcription factor that suppresses endosperm cellularization by activating the expression of a putative invertase inhibitor, InvINH1, in the micropylar region of the endosperm52,53 (Supplementary Fig. 8). The insertion may potentially disrupt the function of the AGL62-like gene, suggesting a possible reduction in the suppression of endosperm development, which might be required for sexual reproduction (Fig. 2a). Consistently, we found the insertion was at a higher abundance in the SE-Asian population (87.5%) than in other populations (Fig. 2c, d). In addition, the insertion positively correlates with heterozygosity within the European population (Supplementary Table 6 and Supplementary Fig. 9).

Another SV is a 69 bp deletion at 1.8 kb upstream of SpGA2022_007306, (Supplementary Fig. 7), a gene that show homology to SOC1 (but shorter than SOC1, Supplementary Data 7), which is a positive regulator of the flowering process in A. thaliana54 (Fig. 2a). Conserved protein domain analyses suggested that SpGA2022_007306 has SRF-like MADS domain but lacks the K-box region (Supplementary Fig. 10), which is similar to Os03g03100 (OsMADS50), a SOC1 homology that are involved in regulating flowering time in rice5558. The deletion was exclusively found in the Indian population with the alternate allele frequency of 73% (Fig. 2c). It is plausible that the deletion, due to its disruption potential at the cis-regulatory region, reduces the ability of this SOC1-like gene to respond to the upstream floral activators (e.g. CO) in S. polyrhiza, thus reducing the frequency of sexual reproduction in the Indian population (Fig. 2d). Consistently, this deletion negatively correlates with heterozygosity in the Indian population (Supplementary Table 6, Supplementary Fig. 11). However, future functional validations on SV of the two MADS-box genes are needed to provide further mechanistic insights into the observed patterns.

Population epigenomic diversity in S. polyrhiza

As changes in sexual reproduction can also alter epigenomic dynamics, we further investigated the patterns of population epigenomic diversity in S. polyrhiza. We selected five individuals from each population and quantified their shoot DNA methylation levels at single-base resolution using whole genome bisulfite sequencing (Supplementary Table 7). Similar to a recent study39, we found that only 1.6% of cytosines are methylated in S. polyrhiza (7.6% of CpG, 2.3% of CHG, and 0.1% of CHH; Supplementary Table 8), and the average species-wide methylation level is the lowest among all studied angiosperms (Supplementary Fig. 12)30,59. The hierarchical clustering of 20 methylomes in CHG and CHH contexts in gene bodies show overall consistency with their genetic similarity (Supplementary Fig. 13 and 14) with few discrepancies were mostly found within the same population or between the recently diverged SE-Asian and European populations. While in the CpG context, we did not observe clear correlations between genetic and methylation distances (Supplementary Fig. 15).

We then compared the genome-wide weighted methylation level (wML) among populations. For CpG methylation, no differences were found among four populations at genome-wide, gene body, or TE levels (Fig. 3a, d and g). For CHG, the Indian population had the lowest genome-wide methylation level among all four populations (Fig. 3b, e, and h). Interestingly, for CHH, the SE-Asia and Europe populations had the higher genome-wide methylation levels compared to American and India populations (P < 0.05, pairwise Wilcoxon test; Fig. 3c), while the European and Indian populations showed a gradual reduction of methylation in comparison to the SE-Asian population. The pattern was the same for both gene bodies and TEs (P < 0.05, pairwise Wilcoxon test; Fig. 3f, i). The genome-wide reduction of CHH methylation is consistent with the hypothesis that clonal reproduction reduces CHH methylation, and the effects gradually accumulate over clonal generations60.

Fig. 3. Weighted methylation level (wML) among four populations.

Fig. 3

ac Box plots of genome-wide weighted methylation level (wML) in (a) CpG, (b) CHG, and (c) CHH context (N = 5). Uppercase letters indicate statistical differences among populations using Wilcoxon test (with bonferroni method for multiple tests corrections). df wML in the gene body and its flanking 2 kb regions in (d) CpG, (e) CHG, and (f) CHH context. gi wML in TE and its flanking 2 kb regions in (g) CpG, (h) CHG, and (i) CHH context. The asterisks indicate significant differences between populations (P < 0.05; Wilcoxon test), while “n.s.” indicates no significant difference.

The footprint of selection on the genome

To identify the genomic signature of selection at the species level, we performed genome-wide scans. To reduce false positives, we used the μ-statistics from RAiSD61, the composite likelihood ratio CLR statistic from SweeD62, and the T statistic from LASSI63. We found 69 genes showed strong signatures of selection using all three methods (Supplementary Fig. 16 and Supplementary Data 8). Manual inspection indicated that several orthologs of these genes are related to gametogenesis (e.g., NOTCHLESS) and embryogenesis (e.g., NUP214, CPSF, CDK, AGP, and ACR4) in Arabidopsis thaliana6468. Further enrichment analysis indeed showed that embryo lethal genes were enriched in these 69 genes (P = 0.016, χ2 test). In addition, the A. thaliana orthologs of several genes under selection are also associated with controlling sexual reproduction, including floral development (DRMY1 and ACR4)64,69, flowering time (NF-Y AT2G27470, NF-YAT1G72830, and CPSF), pollen development (EFOP3, ELMOD, and CLC)66,7074, seed development (NUP214, NF-Y AT2G27470 and NF-YAT1G72830, and Transducin/WD40)65,70,75. Furthermore, among these 69 genes, we also found several genes involved in leaf development and vascularity (SECA2, RbgA, PHABULOSA/PHAVOLUTA)7678, light signaling (NF-Y, CCR4-NOT, and PPP)70,79,80, root development (GEND1, WAVY, and ACR4)64,81,82, DNA damage repair (ATM and Xrcc3)83,84, and stress tolerance (phospholipase D, histone superfamily protein, RabGAP, FC1, NUDX2) (Supplementary Data 8).

To further understand the selection that drove the evolution within individual populations, we identified the signature of positive selection in a three-population tree using patterns of linked allele frequency differentiation and calculating the corresponding composite-likelihood ratio (CLR, see Methods). In total, we found 1,883 genes on the SE-Asian branch, 593 genes on the Indian branch and 401 genes on the European branch (Fig. 4a; see Supplementary Results Section 2.6, and Supplementary Data 9) which showed strong signatures of selection (top 1% of CLR values). We did not find evidence supporting the hypothesis that differentially methylated genes were under positive selection (see Supplementary Methods Section 1.7, Supplementary Results Section 2.5, and Supplementary Data 10).

Fig. 4. Branch-specific selection signature scans.

Fig. 4

a Population tree is used for the population-specific selection analyses. Input consists of an “outgroup” population (America) and two “ingroup” populations (India and SE-Asia, or Europe and SE-Asia). We present the number of genes in the ingroup populations belonging to the top 1% CLR scores reported by 3P-CLR. b Venn diagram showing the genes under selection that are common between the populations of SE-Asia, India, and Europe. c Branch-specific genome-wide scan of selection: each dot represents a chromosomal locus picked by 3P-CLR for which a CLR score is calculated; loci on the top 1% CLR are shown in red. The dashed purple lines between European and Indian panels indicate common peaks of selection detected at the same genomic regions in both populations. d Genomic diversity as depicted by π (solid black lines) and Tajima’s D (solid red lines) for two selected loci: the FLK (left) and the CYP78A9 (right). FLK shows a signature of selection (low π and negative Tajima’s D) in the European population, while CYP78A9 shows a signature of selection in both Europe and India. Dashed lines indicate genome-wide averages for π (black) and Tajima’s D (red).

We found that genes under positive selection in the European branch are enriched with reproduction and development-related GO terms (Supplementary Fig. 17). Among these, SpGA2022_013448, in chromosome 9, is an ortholog of FLOWERING LOCUS KH DOMAIN (FLK) that delays flowering by up-regulating FLC family members in A. thaliana85. This gene showed a strong signature of selection in the European branch but not in other branches (Fig. 4c, d). Similarly, SpGA2022_006111, on chromosome 3, is an ortholog of the A. thaliana BIG BROTHER (BB) that negatively regulates floral organ size and is also under selection in Europe86 (Fig. 4c).

In the SE-Asian population, we found that gene SpGA2022_051517, a CHROMOMETHYLASE3 (CMT3) ortholog in A. thaliana that is likely associated with maintaining CHG methylation17, was under positive selection. This is consistent with the higher CHG methylation levels observed in the SE-Asian population when compared to the European and Indian populations (Figs. 3a, b). Within the Indian population, we found that five MADS-box genes have been under selection exclusively along this branch. Given that there are 43 MADS-box genes in the genome, the fact that five of them have been targeted by selection, constitutes a significant enrichment of such genes under selection (P = 0.0075, Fisher’s Exact test). For example, SpGA2022_013078 is an homolog of AGAMOUS-LIKE6 (AGL6), which is involved in flower and meristem identity specification in rice87; SpGA2022_052274, a homolog of APETALA3 (AP3), is involved in the petal and stamen specification in A. thaliana88; and SpGA2022_006905 belongs to the SHORT VEGETATIVE PHASE (SVP-group) which controls the time of flowering and meristem identity89.

We found 77 genes under positive selection (top 1% CLR values) in both the European and Indian populations (Fig. 4b), significantly more genes than expected by chance (P < 2.2e-16, Fisher’s Exact Test). Among these, gene SpGA2022_055195, an ortholog to CYP78A9 of cytochrome P450 monooxygenases in A. thaliana, belongs to a highly conserved gene family CYP78A. Previous studies in A. thaliana and other species found that CYP78A9 plays a critical role in promoting cell proliferation during flower development and further impacts seed size9092. In addition, the RNA-seq data indicates that CYP78A9 is differentially expressed between India and Europe populations (see Supplementary Methods Section 1.8, Supplementary Results Section 2.7, and Supplementary Data 11). Overall, these data consistently suggest that genes involved in reproduction and development were under selection in Indian and European populations, which might have led to reduced sexual reproduction in these two populations.

Discussion

Here, we characterized the genomic and epigenomic diversity, as well as the demographic history of a facultative asexual flowering plant, S. polyrhiza. We found that among populations of S. polyrhiza, demographic history and reproductive system jointly determine the population’s genomic and epigenomic diversity. Analyses on the footprint of selection suggest that natural selection drove the reduced vascular system and increased asexuality in S. polyrhiza.

Theory predicts that asexual reproduction reduces genomic diversity and the efficiency of purifying selection93. Consistent with this prediction, at the species level, we found that S. polyrhiza has very low genomic diversity and reduced purifying selection (seen as an increased πN/πS ratio), when compared to a wide range of spermatophyte plants42. Within species, the SE-Asian population, which has the highest frequency of sexual reproduction based on the estimated recombination rate, has the highest genomic diversity, the lowest πN/πS ratio and the highest heterozygosity (Fig. 2b), supporting the theoretical prediction2,5,1315. The low πN/πS ratio found in the European population, which has the lowest sexual reproduction and genomic diversity, is most likely due to its migration history. The demographic model suggested that the European population derived from the SE-Asian population very recently (Fig. 1d). It is likely that the πN/πS ratio in the European population remained the same as its ancestral population and has not reached an equilibrium level yet.

While there are fewer genome-wide SVs in S. polyrhiza compared to other species94,95, we found these variants and small INDELs are in tendency enriched in stress responses and reproduction, such as MADS-box genes. This indicates that the loss-of-function of genes involved in flower development and sexual reproduction, is under natural selection. The results are consistent with the observation that the number of functional MADS-box genes was dramatically reduced in S. polyrhiza49.

Single-base resolution methylomes of 20 individuals showed that the overall CpG, CHG and CHH methylation levels in S. polyrhiza shoots are very low, consistent with previous studies39,41. The low levels of DNA methylation might be associated with reduced sexual reproduction: while CpG and CHG methylations in plants are important for controlling cross-overs during meiosis96 and are increased during male gametogenesis, CHH methylation is highly accumulated during embryogenesis18,2426. In facultative asexual plants, due to reduced sexual reproduction and meiosis, the selection of genetic mechanisms maintaining or increasing the CpG, CHG and CHH methylation is reduced or absent, which might have led to the reduced CpG, CHG and CHH methylation levels. Consistently, a recent study suggests that S. polyrhiza has lost several genes in the RdDM pathway41. Interestingly, within species, the CHG and CHH methylation profile of the 20 individuals largely correlates with their genetic distance (Supplementary Fig. 13 and 14), indicating a gradual neutral evolution of DNA methylomes in S. polyrhiza. For example, the Indian and European populations, which diverged from SE-Asian populations around 51,000 and 12,000 generations ago, gradually decreased their CHH methylations (Fig. 3a–c).

At the species level, using a genome-wide scan approach, we found a strong signature of natural selection on genes involved in flower and seed development, indicating that the evolution of reproduction, likely, an increased clonal propagation in S. polyrhiza, was driven by natural selection. This is consistent with the pattern that many aquatic organisms reproduce clonally97. In addition, several genes related to vascularity, root development and DNA damage repair were also under strong selection, suggesting the reduced root and vascular development and low mutation rate in S. polyrhiza were likely also driven by natural selection.

Among populations, we found strong positive selection on genes involved in sexual reproduction and development in India and Europe populations, two recently evolved populations that showed reduced genomic recombination. These results are consistent with the hypothesis that natural selection favors clonal reproduction in S. polyrhiza during the recent colonization process, a pattern that was frequently found in many invasive species98,99. However, despite strong selection favoring clonal reproduction, substantial recombination in the S. polyrhiza genome, mostly in the SE-Asian population, remained, reflecting that sexual reproduction is essential to overcome the costs involved in clonal reproduction in the long term.

Taken together, the structure of population genomes and epigenomes of S. polyrhiza suggest that demography and natural selection acting on the reproduction system and organ development can shape genome-wide genomic and epigenomic variations.

Materials and Methods

DNA sample preparation and sequencing

We sequenced 131 genotypes that were primarily collected from Asia and Europe (Supplementary Data 1). These samples were cultivated in N-medium100 until DNA isolation using a CTAB method. Library preparations were carried out following the protocol described in Xu et al.36. All libraries were sequenced either on Illumina HiSeq X Ten or Illumina Hiseq 4000 platforms for paired-end sequencing with a read size of 150 bp. Low-quality reads and adapter sequences were trimmed with AdapterRemoval (v2.033)101. On average, 33.8 million reads per genotype were obtained. The clean reads were aligned to the S. polyrhiza reference genome48,102 using BWA-MEM (https://github.com/lh3/bwa) with default parameters. Reads without alignment hits or with multiple alignment positions were removed. SAMtools “rmdup” function was used to remove PCR duplicates103.

Genetic variant identification and gene family annotation

After filtered out low-quality SNPs using GATK104 (v4.1.4.1, Java 11) with options: “QD < 2.0 | QUAL < 30.0 | SOR > 3.0 | FS > 60.0 | MQ < 40.0 | MQRankSum < -12.5 | ReadPosRankSum < −8.0”, we identified 8,363,387 SNPs. Then, VCFtools (v0.1.13)105 and GATK were used to remove SNPs that have the following features: (1) SNPs from organelle genomes (9,278 SNPs); (2) missing genotypes >20% (85,645 SNPs); (3) mean sequencing depth <8 or >41 (179,920 SNPs); (4) non-biallelic (448,404 SNPs); (5) minor allele frequency (MAF) <1% (6,102,027 SNPs); and finally, (6) located in small SNP clusters (3 SNPs in a ten base-pair window, accounted for 296,132 SNPs). We updated the protein-coding gene annotation of S. polyrhiza based on recently published transcriptomes and Iso-seq data (see Supplementary Methods Section 1.1, Supplementary Results Section 2.1, Supplementary Table 9, and Supplementary Figs. 1820). We used SnpEff (version 5.0c)106 to annotate SNPs and INDELs. To exam whether SNP cluster filtering criterion affects the estimation of genomic diversity and selection, we performed additional analyses based on a more relaxed filtering parameter (≥200 SNPs in 1 Kb region). Although the second SNP cluster filtering criterion resulted in 18.7% more SNPs, which are mostly (>88%) located in TE regions or nearby the SV or INDELs, the patterns of genomic diversity and selection did not change. In addition to SNPs and INDELs, We identified SVs using a joint genotyping pipeline and stringent quality filtration processes (see Supplementary Methods Section 1.2, Supplementary Results Section 2.2, and Supplementary Fig. 21-24).

We estimated genome-wide nucleotide diversity (π) and genome-wide πNS ratios using SNPgenie (v2019.10.31)107. The SNPs overlapping with the structure variations were excluded from the calculation to minimize potential interference caused by misalignments, ensuring a more accurate and reliable analysis.

The genome-wide heterozygosity for each individual was calculated using VCFtools (v0.1.13)105. We estimated the genetic associations between heterozygosity and the SVs of AGL62 and SOC1 using RVTESTS108 with the single variant Wald test.

To study the potential genetic factors related to the variation of sexual reproduction frequency in S. polyrhiza, we annotated the MADS-box gene family (see Supplementary Methods Section 1.3 and Supplementary Results Section 2.3, Supplementary Fig. 25-27, Supplementary Table 10, and Supplementary Data 12). Other gene families that were annotated in Arabidopsis were also identified in S. polyrhiza using an orthology-based method (see Supplementary Methods Section 1.4 and Supplementary Data 4).

Population structure and linkage disequilibrium (LD)

We grouped genetically similar genotypes by defining clonal genotype pairs that have no more than 0.01% different homozygous sites and no more than 2% different heterozygous sites. These thresholds were previously adopted by Ho et al.37.

Prior to the population structure analysis, we removed SNPs that (1) deviated from Hardy-Weinberg Equilibrium (Fisher exact test, P < 0.01) or (2) linked loci (each pair of SNP have correlation coefficient r2 > 0.33 in a sliding window with a size of 50 SNPs and step of 5 SNPs), using VCFtools (v0.1.13)105 and Plink (v1.9)109.

Principal component analysis (PCA) and population structure analysis were carried out using Plink (v1.9)109 and fastStructure (v1.0)110, respectively. The simple mode (as default) from fastStructure was used for the population structure analysis. The K value was estimated using a heuristic function in fastStructure.

For each of the 159 clonal families, we selected the least missingness genotype (i.e. the genotype with the highest sequencing coverage of that clonal family) as the representative genotype. SNP information from all 159 representative genotypes was used to estimate the linkage disequilibrium decay for each of the four populations. PopLDdecay (v3.41)111 was used to measure LD decay. For each population, we used the following filters: SNP of missing allele > 20% and MAF < 0.05. The allele frequency correlation (denoted as r2) of pairwise SNPs within 100 kb physical distance was calculated.

Phylogenetic tree reconstruction

We used BLAST+ version 2.9.0112 to identify orthologous fragments between the genomes of S. polyrhiza and Colocasia esculenta (Araceae). For each SNP from the core set, the reference allele and its flanking 300 bp (upstream 150 bp and downstream 150 bp, respectively) sequences were extracted from the S. polyrhiza genome and then aligned to the C. esculenta reference genome113. The hit thresholds were set as (1) alignment identity >70%; (2) e-value > 1e − 6; (3) minimum aligned sequence length ≥50 (the aligned sequence must cover SNP position); (4) keep the best hit; and (5) ignore short deletions from C. esculenta. The orthologous alleles from C. esculenta were used as the outgroup genotype. We identified only 13,120 SNPs that have orthologous fragments in the C. esculenta genome. Those data were further used to infer the maximum-likelihood (ML) phylogenetic tree using RAxML-ng (v1.0.1)114. The best hit model was estimated to be ‘TVM + G4’ using Modeltest-ng (0.1.6)115,116. The bootstrapping converged after 700 iterations of the ML tree search. ITOL v5117 and the Python package ETE2118 were used for tree visualization.

Selection analysis

Genome-wide scans of selection were performed on all 20 chromosomes of all sampled populations. Selective sweeps were inferred by three programs: RAiSD61, SweeD62 and LASSI63. RAiSD uses the μ statistic, which provides information on the SFS, LD, and genomic diversity to evaluate the presence of positive selection61. SweeD calculates the traditional composite likelihood ratio (CLR) to infer loci under selection62. LASSI employs the T statistic, which uses a likelihood model based on the haplotype frequency spectrum to detect hard and soft sweeps63. As recommended by the authors of LASSI, we selected the top 5% T scores as candidates for selection. For RAiSD and SweeD we selected the top 1% scores. After finding the common genes under selection according to all three programs, we reported the genes that have orthologs in A. thaliana. The embryo lethal genes from A. thaliana119 were used for the enrichment analysis.

To test for population/branch-specific signals of selection, we ran a composite likelihood ratio (CLR) approach as implemented in 3P-CLR120. Briefly, this method uses three-population trees coupled with genomic data as input, from which patterns of linked allele frequency differentiation are calculated. By doing this, this algorithm can tell apart signals of selection that happened in either branch of the tree or in the ancestral lineage, as well as outputting the loci with the highest CLR120. In our case, we used either a North America-Asia-Europe, or a North America-Asia-India population tree as input, and 3P-CLR output the CLR across windows along each chromosome in the S. polyrhiza genome. We then selected the top 1% windows for each branch of the input tree and reported the genes that are present in each window. To further validate the evidence of positive selection on the regions with the highest CLR, we ran scans of Tajima’s D and genomic diversity along the same windows and contrasted them with the same signal along the other population branches. We expect negative Tajima’s D and low genomic diversity values along the populations with high CLR values. For authenticity validation of genes under selection, we used RT-qPCR to check the expression of eight genes (see Supplementary Methods 1.9, Supplementary Results 2.8, Supplementary Fig. 28, and Supplementary Table 11 and 12). Another expanded list that includes 37 candidate genes was also created, and these genes’ expression (RNA-seq) and orthology alignments against their Arabidopsis counterparts were examined (Supplementary Data 7).

DNA methylation in S. polyrhiza

We selected five genotypes from each of the four populations (America, India, SE-Asia, Europe) for single-base whole-genome bisulfite sequencing (WGBS). The genotypes originated from distinct clonal families, except for two European genotypes that came from the same clonal family (Supplementary Table 7).

FastQC (v0.11.5, https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used to summarize statistics of the sequencing reads. Trimmomatic (v 0.36)121 was used to filter out low-quality reads with the parameters “SLIDINGWINDOW: 4:15, LEADING:3, TRAILING:3, ILLUMINACLIP: adapter.fa: 2: 30: 10, MINLEN:36”. To account for the genetic variations among genotypes, we generated pseudo-reference genome for each genotype by substituting SNP from the S. polyrhiza reference genome using GATK, using a similar strategy to previous studies122,123. Bismark (v 0.16.3)124 was used to align bisulfite-treated reads to pseudo-reference genomes. Identical reads aligned to the same genomic regions were deemed as duplicated reads and thus were removed. Cytosines covered by less than five sequencing reads were excluded from the study. Only after applying these filters the sequencing depth and coverage were then summarized. The sodium bisulfite non-conversion rate was calculated as the percentage of non-converted cytosines to all cytosines in the reads that mapped to the chloroplast genome125 (GenBank: JN160603.2). For each cytosine site, a binomial test was performed to determine if the cytosine was methylated. If the methylation frequency at the site was lower than the background, which was estimated as the non-conversion rate, then the site was considered unmethylated, and the reads supporting methylation at this site were excluded126.

We calculated two different methylation parameters: the proportion of methylated cytosines (mC methylation) and weighted methylation level (wML)126. For both parameters, only cytosines covered by more than four sequencing reads were involved in the calculation. Those cytosines with low reads supporting methylation but not passing the binomial test were considered as un-methylated cytosines. The mC proportion was calculated by dividing the number of methylated cytosines by the total number of cytosines. Genomic regional wML was calculated using the methylKit (v1.17.5)127 and the regioneR (v1.28.0)128, with input based on the cytosine report generated with the Bismark pipeline. Line plots that show the wML patterns across the gene body and transposable elements, as well as their 2 kb flanking regions, were generated using ViewBS (v0.1.11)129. The hierarchical clustering, based on the methylation profiles’ similarity, was done using methylKit. The comparison between the genetic phylogenetic tree and hierarchical clustering based on the methylome was made using the R packages ggtree (v3.4.4)130, treeio (v1.20.2)131, ape (v5.6.2)132, and phytools (v1.2.0)133.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Peer Review File (4.2MB, pdf)
42003_2024_6266_MOESM3_ESM.pdf (148.7KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1-12 (13.4MB, zip)
Reporting Summary (2.1MB, pdf)

Acknowledgements

We thank Martin Schäfer, Marie Sárazová, and Laura Böttner for supporting plant sample maintenance and DNA isolations. We thank Arturo Mari-Ordonez and Pavlos Pavlidis for valuable comments and Alex Widmer for contributing resources at the early stage of this project. This project is supported by the German Research Foundation (427577435 and 438887884 to S. X, and 422213951 to M. Hu), the Center for Adaptation to a Changing Environment (ACE) at ETH Zurich (to S. X.), the Swiss National Science Foundation (P400PB_186770 to M. Hu.), the Volkswagen Foundation (97236 to M. Hu.) and through career development measures of the University of Münster (to M. Hu.) The project was inspired by discussions with the members of the CRC TRR 212 (NC3) – Project number 316099922, and Research Training Group 2526 (GenEvo) – Project number 407023052. Parts of this research were conducted using the supercomputer Mogon and/or advisory services offered by the Johannes Gutenberg University Mainz (hpc.uni-mainz.de), which is a member of the AHRP (Alliance for High-Performance Computing in Rhineland Palatinate, www.ahrp.info) and the Gauss Alliance e.V. The authors gratefully acknowledge the computing time granted on the supercomputer Mogon at the Johannes Gutenberg University Mainz (hpc.uni-mainz.de) and PALMA-II at the University of Münster.

Author contributions

Y. W., P. D. and S. X. performed data analysis. A. C. and M. Ho. performed the experiments. K. J. A., H. Z., K. S. S., and S.X. contributed to the giant duckweed collections and resources. S. X. and M. Hu. conceived and supervised the project. S. X., Y. W., P. D., A. C. and M. Hu. wrote the manuscript. All authors contributed to the final version of the manuscript.

Peer review

Peer review information

Communications Biology thanks Yang Jae Kang, Kent Holsinger and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: George Inglis. A peer review file is available.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Data availability

The raw genomic and bisulfite sequencing reads involved in this study can be retrieved from NCBI under accession numbers Bioproject PRJNA701543 and Bioproject PRJNA934173. The scripts for the data analyses are deposited in https://github.com/Xu-lab-Evolution/Great_duckweed_popg. The authors declare that the data and corresponding computational codes supporting the conclusions of this study are available within the article and its supplementary information file.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Yangzi Wang, Pablo Duchen.

Supplementary information

The online version contains supplementary material available at 10.1038/s42003-024-06266-7.

References

  • 1.Kondrashov AS. Deleterious mutations and the evolution of sexual reproduction. Nature. 1988;336:435–440. doi: 10.1038/336435a0. [DOI] [PubMed] [Google Scholar]
  • 2.Muller HJ. The relation of recombination to mutational advance. Mutat. Res. 1964;1:2–9. doi: 10.1016/0027-5107(64)90047-8. [DOI] [PubMed] [Google Scholar]
  • 3.Case TJ, Taper ML. On the coexistence and coevolution of asexual and sexual competitors. Evolution. 1986;40:366–387. doi: 10.2307/2408816. [DOI] [PubMed] [Google Scholar]
  • 4.Doncaster CP, Pound GE, Cox SJ. The ecological cost of sex. Nature. 2000;404:281–285. doi: 10.1038/35005078. [DOI] [PubMed] [Google Scholar]
  • 5.Hartfield M. Evolutionary genetic consequences of facultative sex and outcrossing. J. Evol. Biol. 2016;29:5–22. doi: 10.1111/jeb.12770. [DOI] [PubMed] [Google Scholar]
  • 6.Green RF, Noakes DLG. Is a little bit of sex as good as a lot. J. Theor. Biol. 1995;174:87–96. doi: 10.1006/jtbi.1995.0081. [DOI] [Google Scholar]
  • 7.Lynch M, Gabriel W. Phenotypic evolution and parthenogenesis. Am. Nat. 1983;122:745–764. doi: 10.1086/284169. [DOI] [Google Scholar]
  • 8.Simon JC, Rispe C, Sunnucks P. Ecology and evolution of sex in aphids. Trends Ecol. Evol. 2002;17:34–39. doi: 10.1016/S0169-5347(01)02331-X. [DOI] [Google Scholar]
  • 9.Hebert PDN. Population biology of Daphnia (Crustacea, Daphnidae) Biol. Rev. 1978;53:387–426. doi: 10.1111/j.1469-185X.1978.tb00860.x. [DOI] [Google Scholar]
  • 10.Wallace RL. Rotifers: Exquisite metazoans. Integr. Comp. Biol. 2002;42:660–667. doi: 10.1093/icb/42.3.660. [DOI] [PubMed] [Google Scholar]
  • 11.Klimeš, L., Klimešová, J., Hendriks, R. & van Groenendael, J. in The Ecology and Evolution of Clonal Plants (eds H. de Kroon & J. van Groenendael) 1–29 (Backhuys Publishers, 1997).
  • 12.de Meeus T, Prugnolle F, Agnew P. Asexual reproduction: genetics and evolutionary aspects. Cell Mol. Life Sci. 2007;64:1355–1372. doi: 10.1007/s00018-007-6515-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Keightley PD, Otto SP. Interference among deleterious mutations favours sex and recombination in finite populations. Nature. 2006;443:89–92. doi: 10.1038/nature05049. [DOI] [PubMed] [Google Scholar]
  • 14.Jaron KS, et al. Convergent consequences of parthenogenesis on stick insect genomes. Sci. Adv. 2022;8:eabg3842. doi: 10.1126/sciadv.abg3842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Tucker AE, Ackerman MS, Eads BD, Xu S, Lynch M. Population-genomic insights into the evolutionary origin and fate of obligately asexual Daphnia pulex. Proc. Natl Acad. Sci. USA. 2013;110:15740–15745. doi: 10.1073/pnas.1313388110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Niederhuth CE, Schmitz RJ. Covering your bases: inheritance of DNA methylation in plant genomes. Mol. Plant. 2014;7:472–480. doi: 10.1093/mp/sst165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Matzke MA, Mosher RA. RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat. Rev. Genet. 2014;15:394–408. doi: 10.1038/nrg3683. [DOI] [PubMed] [Google Scholar]
  • 18.Gehring M. Epigenetic dynamics during flowering plant reproduction: evidence for reprogramming? N. Phytol. 2019;224:91–96. doi: 10.1111/nph.15856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.She W, et al. Chromatin reprogramming during the somatic-to-reproductive cell fate transition in plants. Development. 2013;140:4008–4019. doi: 10.1242/dev.095034. [DOI] [PubMed] [Google Scholar]
  • 20.She, W. J. & Baroux, C. Chromatin dynamics in pollen mother cells underpin a common scenario at the somatic-to-reproductive fate transition of both the male and female lineages in Arabidopsis. Front. Plant Sci. 6, 294 (2015). [DOI] [PMC free article] [PubMed]
  • 21.Slotkin RK, et al. Epigenetic reprogramming and small RNA silencing of transposable elements in pollen. Cell. 2009;136:461–472. doi: 10.1016/j.cell.2008.12.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Calarco JP, et al. Reprogramming of DNA methylation in pollen guides epigenetic inheritance via small RNA. Cell. 2012;151:194–205. doi: 10.1016/j.cell.2012.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ingouff M, et al. Live-cell analysis of DNA methylation during sexual reproduction in Arabidopsis reveals context and sex-specific dynamics controlled by noncanonical RdDM. Genes Dev. 2017;31:72–83. doi: 10.1101/gad.289397.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bouyer D, et al. DNA methylation dynamics during early plant life. Genome Biol. 2017;18:179. doi: 10.1186/s13059-017-1313-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kawakatsu T, Nery JR, Castanon R, Ecker JR. Dynamic DNA methylation reconfiguration during seed development and germination. Genome. Biol. 2017;18:171. doi: 10.1186/s13059-017-1251-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Narsai R, et al. Extensive transcriptomic and epigenomic remodelling occurs during Arabidopsis thaliana germination. Genome. Biol. 2017;18:172. doi: 10.1186/s13059-017-1302-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Verhoeven KJF, Jansen JJ, van Dijk PJ, Biere A. Stress-induced DNA methylation changes and their heritability in asexual dandelions. N. Phytol. 2010;185:1108–1118. doi: 10.1111/j.1469-8137.2009.03121.x. [DOI] [PubMed] [Google Scholar]
  • 28.Verhoeven KJ, Preite V. Epigenetic variation in asexually reproducing organisms. Evolution. 2014;68:644–655. doi: 10.1111/evo.12320. [DOI] [PubMed] [Google Scholar]
  • 29.Van Antro M, et al. DNA methylation in clonal duckweed (Lemna minor L.) lineages reflects current and historical environmental exposures. Mol. Ecol. 2023;32:428–443. doi: 10.1111/mec.16757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Niederhuth, C. E. et al. Widespread natural variation of DNA methylation within angiosperms. Genome Biol.17, 194 (2016). [DOI] [PMC free article] [PubMed]
  • 31.Landolt, E., Jäger-Zürn, I. & Schnell, R. Extreme Adaptations in Angiospermous Hydrophytes, 290 (Gebrüder Borntraeger, 1998).
  • 32.Bog, M., Appenroth, K. J. & Sree, K. S. Key to the determination of taxa of lemnaceae: an update. Nordic. J. Botany38, e02658 (2020).
  • 33.Kim I. Structural differentiation of the connective stalk in Spirodela polyrhiza (L.) schleiden. Appl. Microsc. 2016;46:83–88. doi: 10.9729/AM.2016.46.2.83. [DOI] [Google Scholar]
  • 34.Hicks LE. Flower production in the lemnaceae. Ohio J. Sci. 1932;32:115–132. [Google Scholar]
  • 35.Fourounjian P, Slovin J, Messing J. Flowering and seed production across the lemnaceae. Int J. Mol. Sci. 2021;22:2733. doi: 10.3390/ijms22052733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Xu S, et al. Low genetic variation is associated with low mutation rate in the giant duckweed. Nat. Commun. 2019;10:1243. doi: 10.1038/s41467-019-09235-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ho EKH, Bartkowska M, Wright SI, Agrawal AF. Population genomics of the facultatively asexual duckweed Spirodela polyrhiza. N. Phytol. 2019;224:1361–1371. doi: 10.1111/nph.16056. [DOI] [PubMed] [Google Scholar]
  • 38.Sandler G, Bartkowska M, Agrawal AF, Wright SI. Estimation of the SNP mutation rate in two vegetatively propagating species of duckweed. G3-Genes Genom. Genet. 2020;10:4191–4200. doi: 10.1534/g3.120.401704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Michael TP, et al. Comprehensive definition of genome features in Spirodela polyrhiza by high-depth physical mapping and short-read DNA sequencing strategies. Plant J. 2017;89:617–635. doi: 10.1111/tpj.13400. [DOI] [PubMed] [Google Scholar]
  • 40.Bog M, et al. Strategies for intraspecific genotyping of duckweed: comparison of five orthogonal methods applied to the giant duckweed Spirodela polyrhiza. Plants (Basel) 2022;11:3033. doi: 10.3390/plants11223033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Harkess A, et al. The unusual predominance of maintenance DNA methylation in spirodela polyrhiza. G3 Genes Genomes Genet. 2024;14:jkae004. doi: 10.1093/g3journal/jkae004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Chen J, Glemin S, Lascoux M. Genetic diversity and the efficacy of purifying selection across plant and animal species. Mol. Biol. Evol. 2017;34:1417–1428. doi: 10.1093/molbev/msx088. [DOI] [PubMed] [Google Scholar]
  • 43.McDowell JM, et al. Intragenic recombination and diversifying selection contribute to the evolution of downy mildew resistance at the RPP8 locus of Arabidopsis. Plant Cell. 1998;10:1861–1874. doi: 10.1105/tpc.10.11.1861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Xu ZW, et al. Functional genomic analysis of glycoside hydrolase family 1. Plant Mol. Biol. 2004;55:343–367. doi: 10.1007/s11103-004-0790-1. [DOI] [PubMed] [Google Scholar]
  • 45.Pinosio S, et al. Characterization of the poplar pan-genome by genome-wide identification of structural variation. Mol. Biol. Evol. 2016;33:2706–2719. doi: 10.1093/molbev/msw161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Zmienko A, et al. Athcnv: A map of DNA copy number variations in the Arabidopsis genome. Plant Cell. 2020;32:1797–1819. doi: 10.1105/tpc.19.00640. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Cui Y, Lu X, Gou X. Receptor-like protein kinases in plant reproduction: current understanding and future perspectives. Plant Commun. 2022;3:100273. doi: 10.1016/j.xplc.2021.100273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wang, W. et al. The Spirodela polyrhiza genome reveals insights into its neotenous reduction fast growth and aquatic lifestyle. Nat. Commun.5, 3311 (2014). [DOI] [PMC free article] [PubMed]
  • 49.Gramzow L., Theissen G. Stranger than fiction: Loss of MADS-box genes during evolutionary miniaturization of the duckweed body plan. Loss of MADS-box genes in duckweeds. In: The Duckweed Genomes, Compendium of Plant Genomes. (eds. Cao X.H., Fourounjian, P. & Wang, W.) (Springer Nature; Cham, Switzerland, 2020).
  • 50.Yoshida A, et al. Characterization of frond and flower development and identification of ft and fd genes from duckweed Lemna aequinoctialis Nd. Front. Plant Sci. 2021;12:697206. doi: 10.3389/fpls.2021.697206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cao J, et al. Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat. Genet. 2011;43:956–963. doi: 10.1038/ng.911. [DOI] [PubMed] [Google Scholar]
  • 52.Kang IH, Steffen JG, Portereiko MF, Lloyd A, Drews GN. The AGL62 MADS domain protein regulates cellularization during endosperm development in Arabidopsis. Plant Cell. 2008;20:635–647. doi: 10.1105/tpc.107.055137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hoffmann T, et al. The identification of type I MADS box genes as the upstream activators of an endosperm-specific invertase inhibitor in Arabidopsis. BMC Plant Biol. 2022;22:18. doi: 10.1186/s12870-021-03399-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Lee J, Lee I. Regulation and function of SOC1, a flowering pathway integrator. J. Exp. Bot. 2010;61:2247–2254. doi: 10.1093/jxb/erq098. [DOI] [PubMed] [Google Scholar]
  • 55.Norton GJ, et al. Genome wide association mapping of grain and straw biomass traits in the rice Bengal and Assam Aus panel (baap) grown under alternate wetting and drying and permanently flooded irrigation. Front. Plant Sci. 2018;9:1223. doi: 10.3389/fpls.2018.01223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ryu CH, et al. OsMADS50 and OsMADS56 function antagonistically in regulating long day (LD)-dependent flowering in rice. Plant Cell Environ. 2009;32:1412–1427. doi: 10.1111/j.1365-3040.2009.02008.x. [DOI] [PubMed] [Google Scholar]
  • 57.Lee S, Kim J, Han JJ, Han MJ, An G. Functional analyses of the flowering time gene OsMADS50, the putative suppressor of overexpression of CO 1/AGAMOUS-LIKE 20 (SOC1/AGL20) ortholog in rice. Plant J. 2004;38:754–764. doi: 10.1111/j.1365-313X.2004.02082.x. [DOI] [PubMed] [Google Scholar]
  • 58.Lee S, An G. Diversified mechanisms for regulating flowering time in a short-day plant rice. J. Plant Biol. 2007;50:241–248. doi: 10.1007/BF03030651. [DOI] [Google Scholar]
  • 59.Cokus SJ, et al. Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature. 2008;452:215–219. doi: 10.1038/nature06745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Ibanez VN, Quadrana L. Shaping inheritance: how distinct reproductive strategies influence DNA methylation memory in plants. Curr. Opin. Genet Dev. 2023;78:102018. doi: 10.1016/j.gde.2022.102018. [DOI] [PubMed] [Google Scholar]
  • 61.Alachiotis N, Pavlidis P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. Commun. Biol. 2018;1:79. doi: 10.1038/s42003-018-0085-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Pavlidis P, Zivkovic D, Stamatakis A, Alachiotis N. SweeD: likelihood-based detection of selective sweeps in thousands of genomes. Mol. Biol. Evol. 2013;30:2224–2234. doi: 10.1093/molbev/mst112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Harris AM, DeGiorgio M. A likelihood approach for uncovering selective sweep signatures from haplotype data. Mol. Biol. Evol. 2020;37:3023–3046. doi: 10.1093/molbev/msaa115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Demko V, Ako E, Perroud PF, Quatrano R, Olsen OA. The phenotype of the CRINKLY4 deletion mutant of Physcomitrella patens suggests a broad role in developmental regulation in early land plants. Planta. 2016;244:275–284. doi: 10.1007/s00425-016-2526-2. [DOI] [PubMed] [Google Scholar]
  • 65.Braud C, Zheng W, Xiao W. Identification and analysis of LNO1-like and AtGLE1-like nucleoporins in plants. Plant Signal Behav. 2013;8:e27376. doi: 10.4161/psb.27376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Zhao H, Xing D, Li QQ. Unique features of plant cleavage and polyadenylation specificity factor revealed by proteomic studies. Plant Physiol. 2009;151:1546–1556. doi: 10.1104/pp.109.142729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Takatsuka H, Umeda-Hara C, Umeda M. Cyclin-dependent kinase-activating kinases CDKD;1 and CDKD;3 are essential for preserving mitotic activity in Arabidopsis thaliana. Plant J. 2015;82:1004–1017. doi: 10.1111/tpj.12872. [DOI] [PubMed] [Google Scholar]
  • 68.Johnson KL, Kibble NA, Bacic A, Schultz CJ. A fasciclin-like arabinogalactan-protein (FLA) mutant of Arabidopsis thaliana, fla1, shows defects in shoot regeneration. PLoS One. 2011;6:e25154. doi: 10.1371/journal.pone.0025154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Zhu M, et al. Robust organ size requires robust timing of initiation orchestrated by focused auxin and cytokinin signalling. Nat. Plants. 2020;6:686–698. doi: 10.1038/s41477-020-0666-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Zhao H, et al. The Arabidopsis thaliana nuclear factor Y transcription factors. Front. Plant Sci. 2016;7:2045. doi: 10.3389/fpls.2016.02045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Chantha SC, Gray-Mitsumune M, Houde J, Matton DP. The MIDASIN and NOTCHLESS genes are essential for female gametophyte development in Arabidopsis thaliana. Physiol. Mol. Biol. Plants. 2010;16:3–18. doi: 10.1007/s12298-010-0005-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Chen, X. et al. Full-length EFOP3 and EFOP4 proteins are essential for pollen intine development in Arabidopsis thaliana. Plant J. 115, 37–51 (2023). [DOI] [PubMed]
  • 73.Zhou Y, et al. Members of the ELMOD protein family specify formation of distinct aperture domains on the Arabidopsis pollen surface. eLife. 2021;10:e71061. doi: 10.7554/eLife.71061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Jossier M, et al. The Arabidopsis vacuolar anion transporter, AtCLCc, is involved in the regulation of stomatal movements and contributes to salt tolerance. Plant J. 2010;64:563–576. doi: 10.1111/j.1365-313X.2010.04352.x. [DOI] [PubMed] [Google Scholar]
  • 75.Gachomo EW, Jimenez-Lopez JC, Baptiste LJ, Kotchoni SO. GIGANTUS1 (GTS1), a member of Transducin/WD40 protein superfamily, controls seed germination, growth and biomass accumulation through ribosome-biogenesis protein interactions in Arabidopsis thaliana. BMC Plant Biol. 2014;14:37. doi: 10.1186/1471-2229-14-37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Skalitzky CA, et al. Plastids contain a second sec translocase system with essential functions. Plant Physiol. 2011;155:354–369. doi: 10.1104/pp.110.166546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Jeon Y, Ahn HK, Kang YW, Pai HS. Functional characterization of chloroplast-targeted RbgA GTPase in higher plants. Plant Mol. Biol. 2017;95:463–479. doi: 10.1007/s11103-017-0664-y. [DOI] [PubMed] [Google Scholar]
  • 78.McConnell JR, et al. Role of PHABULOSA and PHAVOLUTA in determining radial patterning in shoots. Nature. 2001;411:709–713. doi: 10.1038/35079635. [DOI] [PubMed] [Google Scholar]
  • 79.Schwenk P, et al. Uncovering a novel function of the CCR4-NOT complex in phytochrome A-mediated light signalling in plants. eLife. 2021;10:e63697. doi: 10.7554/eLife.63697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Farkas I, Dombradi V, Miskei M, Szabados L, Koncz C. Arabidopsis PPP family of serine/threonine phosphatases. Trends Plant Sci. 2007;12:169–176. doi: 10.1016/j.tplants.2007.03.003. [DOI] [PubMed] [Google Scholar]
  • 81.Guo ZF, Wang XY, Hu ZB, Wu CY, Shen ZG. The pentatricopeptide repeat protein GEND1 is required for root development and high temperature tolerance in Arabidopsis thaliana. Biochem. Biophys. Res. Commun. 2021;578:63–69. doi: 10.1016/j.bbrc.2021.09.022. [DOI] [PubMed] [Google Scholar]
  • 82.Mochizuki S, et al. The Arabidopsis WAVY GROWTH 2 protein modulates root bending in response to environmental stimuli. Plant Cell. 2005;17:537–547. doi: 10.1105/tpc.104.028530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Liu CH, et al. Repair of dna damage induced by the cytidine analog zebularine requires atr and atm in Arabidopsis. Plant Cell. 2015;27:1788–1800. doi: 10.1105/tpc.114.135467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Bleuyard JY, White CI. The Arabidopsis homologue of Xrcc3 plays an essential role in meiosis. EMBO J. 2004;23:439–449. doi: 10.1038/sj.emboj.7600055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Lim MH, et al. A new Arabidopsis gene, FLK, encodes an RNA binding protein with K homology motifs and regulates flowering time via FLOWERING LOCUS C. Plant Cell. 2004;16:731–740. doi: 10.1105/tpc.019331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Disch S, et al. The E3 ubiquitin ligase BIG BROTHER controls Arabidopsis organ size in a dosage-dependent manner. Curr. Biol. 2006;16:272–279. doi: 10.1016/j.cub.2005.12.026. [DOI] [PubMed] [Google Scholar]
  • 87.Li HF, et al. The AGL6-like gene OsMADS6 regulates floral organ and meristem identities in rice. Cell Res. 2010;20:299–313. doi: 10.1038/cr.2009.143. [DOI] [PubMed] [Google Scholar]
  • 88.Krizek BA, Meyerowitz EM. The Arabidopsis homeotic genes APETALA3 and PISTILLATA are sufficient to provide the B class organ identity function. Development. 1996;122:11–22. doi: 10.1242/dev.122.1.11. [DOI] [PubMed] [Google Scholar]
  • 89.Lee S, Choi SC, An G. Rice SVP-group MADS-box proteins, OsMADS22 and OsMADS55, are negative regulators of brassinosteroid responses. Plant J. 2008;54:93–105. doi: 10.1111/j.1365-313X.2008.03406.x. [DOI] [PubMed] [Google Scholar]
  • 90.Fang WJ, Wang ZB, Cui RF, Li J, Li YH. Maternal control of seed size by EOD3/CYP78A6 in Arabidopsis thaliana. Plant J. 2012;70:929–939. doi: 10.1111/j.1365-313X.2012.04907.x. [DOI] [PubMed] [Google Scholar]
  • 91.Sotelo-Silveira M, et al. Cytochrome P450 CYP78A9 is involved in Arabidopsis reproductive development. Plant Physiol. 2013;162:779–799. doi: 10.1104/pp.113.218214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Qi XL, Liu CL, Song LL, Li YH, Li M. Pacyp78a9, a cytochrome P450, regulates fruit size in sweet cherry (Prunus avium L.) Front Plant Sci. 2017;8:2076. doi: 10.3389/fpls.2017.02076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Ellegren H, Galtier N. Determinants of genetic diversity. Nat. Rev. Genet. 2016;17:422–433. doi: 10.1038/nrg.2016.58. [DOI] [PubMed] [Google Scholar]
  • 94.Zhou YF, et al. The population genetics of structural variants in grapevine domestication. Nat. Plants. 2019;5:965–979. doi: 10.1038/s41477-019-0507-8. [DOI] [PubMed] [Google Scholar]
  • 95.Guan J, et al. Genome structure variation analyses of peach reveal population dynamics and a 1.67 Mb causal inversion for fruit shape. Genome Biol. 2021;22:13. doi: 10.1186/s13059-020-02239-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Underwood CJ, et al. Epigenetic activation of meiotic recombination near Arabidopsis thaliana centromeres via loss of H3K9me2 and non-CG DNA methylation. Genome Res. 2018;28:519–531. doi: 10.1101/gr.227116.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Santamaria L. Why are most aquatic plants widely distributed? dispersal, clonal growth and small-scale heterogeneity in a stressful environment. Acta Oecol. 2002;23:137–154. doi: 10.1016/S1146-609X(02)01146-3. [DOI] [Google Scholar]
  • 98.Wang YJ, et al. Invasive alien plants benefit more from clonal integration in heterogeneous environments than natives. N. Phytol. 2017;216:1072–1078. doi: 10.1111/nph.14820. [DOI] [PubMed] [Google Scholar]
  • 99.Gutekunst J, et al. Clonal genome evolution and rapid invasive spread of the marbled crayfish. Nat. Ecol. Evol. 2018;2:567–573. doi: 10.1038/s41559-018-0467-9. [DOI] [PubMed] [Google Scholar]
  • 100.Appenroth, K.J.; et al. Photophysiology of turion formation and germination in Spirodela polyrhiza. Biol. Plant.38, 95–106 (1996)
  • 101.Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res. Notes. 2016;9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Cao HX, et al. The map-based genome sequence of Spirodela polyrhiza aligned with its chromosomes, a reference for karyotype evolution. N. Phytol. 2016;209:354–363. doi: 10.1111/nph.13592. [DOI] [PubMed] [Google Scholar]
  • 103.Li H, et al. The sequence alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.McKenna A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Danecek P, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Nelson CW, Moncla LH, Hughes AL. SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data. Bioinformatics. 2015;31:3709–3711. doi: 10.1093/bioinformatics/btv449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Zhan X, Hu Y, Li B, Abecasis GR, Liu DJ. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics. 2016;32:1423–1426. doi: 10.1093/bioinformatics/btw079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Purcell S, et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Raj A, Stephens M, Pritchard JK. fastStructure: variational inference of population structure in large SNP data sets. Genetics. 2014;197:573–U207. doi: 10.1534/genetics.114.164350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35:1786–1788. doi: 10.1093/bioinformatics/bty875. [DOI] [PubMed] [Google Scholar]
  • 112.Camacho, C. et al. BLAST +: architecture and applications. BMC Bioinform.10, 421 (2009). [DOI] [PMC free article] [PubMed]
  • 113.Yin JM, et al. A high-quality genome of taro (Colocasia esculenta(L.) Schott), one of the world’s oldest crops. Mol. Ecol. Resour. 2021;21:68–77. doi: 10.1111/1755-0998.13239. [DOI] [PubMed] [Google Scholar]
  • 114.Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics. 2019;35:4453–4455. doi: 10.1093/bioinformatics/btz305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Flouri T, et al. The phylogenetic likelihood library. Syst. Biol. 2015;64:356–362. doi: 10.1093/sysbio/syu084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Darriba D, et al. ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol. 2020;37:291–294. doi: 10.1093/molbev/msz189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Letunic I, Bork P. Interactive tree Of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47:W256–W259. doi: 10.1093/nar/gkz239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Huerta-Cepas J, Dopazo J, Gabaldon T. ETE: a python environment for tree exploration. BMC Bioinforma. 2010;11:24. doi: 10.1186/1471-2105-11-24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Meinke DW. Genome-wide identification of EMBRYO-DEFECTIVE (EMB) genes required for growth and development in Arabidopsis. N. Phytol. 2020;226:306–325. doi: 10.1111/nph.16071. [DOI] [PubMed] [Google Scholar]
  • 120.Racimo F. Testing for ancient selection using cross-population allele frequency differentiation. Genetics. 2016;202:733–750. doi: 10.1534/genetics.115.178095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Schmitz RJ, et al. Patterns of population epigenomic diversity. Nature. 2013;495:193–198. doi: 10.1038/nature11968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Kawakatsu T, et al. Epigenomic diversity in a global collection of arabidopsis thaliana accessions. Cell. 2016;166:492–505. doi: 10.1016/j.cell.2016.06.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011;27:1571–1572. doi: 10.1093/bioinformatics/btr167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Wang WQ, Messing J. High-throughput sequencing of three lemnoideae (duckweeds) chloroplast genomes from total DNA. PLoS One. 2011;6:e24670. doi: 10.1371/journal.pone.0024670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Schultz MD, Schmitz RJ, Ecker JR. Leveling’ the playing field for analyses of single-base resolution DNA methylomes. Trends Genet. 2012;28:583–585. doi: 10.1016/j.tig.2012.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Akalin, A. et al. methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles. Genome. Biol.13, R87 (2012). [DOI] [PMC free article] [PubMed]
  • 128.Gel B, et al. regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests. Bioinformatics. 2016;32:289–291. doi: 10.1093/bioinformatics/btv562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Huang XS, Zhang SL, Li KQ, Thimmapuram J, Xie SJ. ViewBS: a powerful toolkit for visualization of high-throughput bisulfite sequencing data. Bioinformatics. 2018;34:708–709. doi: 10.1093/bioinformatics/btx633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Yu GC, Lam TTY, Zhu HC, Guan Y. Two methods for mapping and visualizing associated data on phylogeny using ggtree. Mol. Biol. Evol. 2018;35:3041–3043. doi: 10.1093/molbev/msy194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Wang LG, et al. Treeio: An R package for phylogenetic tree input and output with richly annotated and associated data. Mol. Biol. Evol. 2020;37:599–603. doi: 10.1093/molbev/msz240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Paradis E, Claude J, Strimmer K. APE: Analyses of phylogenetics and evolution in R language. Bioinformatics. 2004;20:289–290. doi: 10.1093/bioinformatics/btg412. [DOI] [PubMed] [Google Scholar]
  • 133.Revell LJ. phytools: an R package for phylogenetic comparative biology (and other things) Methods Ecol. Evol. 2012;3:217–223. doi: 10.1111/j.2041-210X.2011.00169.x. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (4.2MB, pdf)
42003_2024_6266_MOESM3_ESM.pdf (148.7KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1-12 (13.4MB, zip)
Reporting Summary (2.1MB, pdf)

Data Availability Statement

The raw genomic and bisulfite sequencing reads involved in this study can be retrieved from NCBI under accession numbers Bioproject PRJNA701543 and Bioproject PRJNA934173. The scripts for the data analyses are deposited in https://github.com/Xu-lab-Evolution/Great_duckweed_popg. The authors declare that the data and corresponding computational codes supporting the conclusions of this study are available within the article and its supplementary information file.


Articles from Communications Biology are provided here courtesy of Nature Publishing Group

RESOURCES