Abstract
To provide insights into the fate of transposable elements (TEs) across timescales in a post-polyploidization context, we comparatively investigate five sibling Dactylorhiza allotetraploids (Orchidaceae) formed independently and sequentially between 500 and 100K generations ago by unidirectional hybridization between diploids D. fuchsii and D. incarnata. Our results first reveal that the paternal D. incarnata genome shows a marked increased content of LTR retrotransposons compared to the maternal species, reflected in its larger genome size and consistent with a previously hypothesized bottleneck. With regard to the allopolyploids, in the youngest D. purpurella both genome size and TE composition appear to be largely additive with respect to parents, whereas for polyploids of intermediate ages we uncover rampant genome expansion on a magnitude of multiple entire genomes of some plants such as Arabidopsis. The oldest allopolyploids in the series are not larger than the intermediate ones. A putative tandem repeat, potentially derived from a non-autonomous miniature inverted-repeat TE (MITE) drives much of the genome dynamics in the allopolyploids. The highly dynamic MITE-like element is found in higher proportions in the maternal diploid, D. fuchsii, but is observed to increase in copy number in both subgenomes of the allopolyploids. Altogether, the fate of repeats appears strongly regulated and therefore predictable across multiple independent allopolyploidization events in this system. Apart from the MITE-like element, we consistently document a mild genomic shock following the allopolyploidizations investigated here, which may be linked to their relatively large genome sizes, possibly associated with strong selection against further genome expansions.
Keywords: allopolyploidy, genomic shock, genome size, marsh orchids, transposable elements
Introduction
An important evolutionary insight of recent years was the realization that whole-genome duplication (WGD) has contributed significantly to the angiosperm dominance and diversity (Soltis and Soltis 2016). Moreover, WGDs profoundly shaped the structure and function of most modern eukaryotic genomes, including many crops (Dehal and Boore 2005; Jiao et al. 2011; Van de Peer et al. 2017; Li et al. 2018). Despite their omnipresence in evolution, neopolyploids will often fail to establish (Mayrose et al. 2011) due, among other effects, to negative aspects regularly associated with their origins, including expression redundancies, regulatory incompatibilities, and epigenetic imbalances that can activate transposable elements (TEs) (Ramsey and Schemske 2002; Chen 2007; Paun et al. 2007; Doyle et al. 2008; Parisod et al. 2010). Such challenges have been together referred to as a “genomic shock” (McClintock 1984), which is likely to be especially profound in early-generation allopolyploids that combine two divergent parental genomes. Allopolyploidization is therefore expected to initiate a plethora of stochastic, cascading molecular responses with immediate implications for the adaptive success of neoallopolyploids.
In the long term, WGD is followed by diploidization, a repeating process that results in polyploidization–diploidization cycles (Van de Peer et al. 2009; Jiao et al. 2011; Wendel 2015; Chase et al. 2022). When distinct genomes are combined together, one of them can dominate the other during the diploidization process to ultimately result in “biased fractionation” (Sankoff et al. 2012). This model explains allopolyploid genome evolution starting from epigenetic differences linked to an imbalance between parental insertion load of TEs: the smaller subgenome with a lower TE load becoming dominant in the resulting polyploid and therefore leading to different levels of heterochromatinization between homoeologs (Garsmeur et al. 2014; Woodhouse et al. 2014). These differences trigger unequal expression patterns between parental subgenomes, eventually resulting in preferential degradation of one of the homoeologs (Cheng et al. 2016; Edger et al. 2017; Wendel et al. 2018; Gaebelein et al. 2019; Mhiri et al. 2019; Eriksson et al. 2020). Such a process has been reported to be initiated for example in the neoallopolyploid Mimulus peregrinus, strongly associated with subgenome-wide methylation levels and homeolog expression bias (Edger et al. 2017).
On the other hand, the “nuclear-cytoplasmic interaction” hypothesis (Song et al. 1995; Leitch et al. 2006; Dodsworth et al. 2020) proposes that the maternal subgenome of an allopolyploid will become dominant over the paternal subgenome due to potential incompatibilities between the maternally inherited cytoplasmic environment and the paternal subgenome. In turn, this incompatibility would lead to asymmetric expression, retention and degradation between the subgenomes as a function of the degree of divergence between the diploid parental species (Song et al. 1995). With regard to mobile elements, the maternal subgenome of allopolyploids is also initially expected to be more efficient in controlling its TEs based on maternally inherited cytoplasmic TE-repressing factors, such as small interfering RNAs (Vicient and Casacuberta 2017). For example, a bias towards paternal subgenome degradation has been reported for Nicotiana tabacum (Leitch et al. 2006; Renny-Byfield et al. 2011). However, in several other study systems this pattern was clearly not detected (e.g., Parisod et al. 2009; Mhri et al. 2019; Dodsworth et al. 2020). Nonetheless, altogether post-WGD TE activation is often (Kashkush et al. 2003; Parisod et al. 2010; Renny-Byfield et al. 2011; Oliver et al. 2013; Vicient and Casacuberta 2017; Chase et al. 2022), but not always (IWGSC et al. 2018; Burns et al. 2021) regarded as a key driver of adaptation to the polyploid state and of further evolution.
In addition to relatively continuous low-frequency transposition, TEs proliferate in bursts driven by genomic (Parisod et al. 2010; Wendel et al. 2018) or environmental disturbances (Chuong et al. 2017; Dubin et al. 2018), often coupled with demographic factors that impair efficient selection. As each burst is expected to be a largely independent, erratic event, TE mobilization may vary widely across populations and evolutionary stages. It has been shown that the composition and copy number of TEs can differ significantly between closely related species, which may otherwise display general synteny (Devos 2005; Willing et al. 2015), even between and within conspecific populations (Springer et al. 2016; Carpentier et al. 2019). This variation can translate into phenotypic diversity (Parisod et al. 2009; Sigman and Slotkin 2016; Springer et al. 2016; Chuong et al. 2017; Vicient and Casacuberta 2017; Dubin et al. 2018; Weissensteiner et al. 2020) when TEs affect in cis the activity of adjacent genes, for example by disrupting open reading frames when inserting in exons, acting as regulatory elements when inserting in promoters or introns, triggering transcript truncation and other novel splicing variants, or attracting epigenetic marks to the respective genomic region, thereby down-regulating or silencing genes. Such phenotypic effects will be immediately visible to selection. In recurrently formed neoallopolyploids, the content, positions and activity of TEs can therefore influence divergent trait expression, potentially leading to distinct phenotypes and environmental trajectories that may aid establishment as viable species, independent of diploid and polyploid relatives (Lynch and Force 2000; Soltis et al. 2004; Comai 2005; Jackson and Chen 2010). However, it is still unclear in which conditions repeated allopolyploidization will ultimately result in species with different evolutionary trajectories (Soltis et al. 2010) and how such sibling species maintain distinctiveness despite sharing the same ploidy and genetic background.
In this paper, we aim to provide insights into the fate of TEs across early stages of allopolyploid evolution by focusing on a naturally occurring series of independently established sibling allopolyploid marsh orchids (Dactylorhiza majalis s.l., Orchidaceae). We study five sequentially produced allotetraploids, Dactylorhiza baltica, D. majalis s.str., D. praetermissa, D. purpurella and D. traunsteineri s.l. (fig. 1) (Devos et al. 2006; Pillon et al. 2007; Brandrud et al. 2020), and their diploid parents, D. fuchsii and D. incarnata. Numerous previous studies have established that diploids in the Dactylorhiza genus have 20 chromosome pairs (2n = 2x = 40), whereas allotetraploids have 40 chromosome pairs (2n = 4x = 80) (Hagerup 1938; Vermeulen 1938; Heslop-Harrison 1953; Holmen and Kaad 1956; Lord and Richards 1977; Jonsell 1982; Lövkvist and Hultgård 1999). Since the divergence of the two diploid parental species around 5.5 MYA (Inda et al. 2012; Brandrud et al. 2020), their genomes have diverged under different demographic dynamics. In contrast to D. fuchsii, D. incarnata apparently suffered a dramatic bottleneck, likely related to a long-distance dispersal event from Asia to Europe roughly 1.5 MYA (Hedrén 1996; Pillon et al. 2007; Balao et al. 2016, 2017; Brandrud et al. 2020). Related to such a relatively recent bottleneck, Balao et al. (2017) have provided evidence for an overexpression in D. incarnata relative to D. fuchsii of the RNA-dependent DNA polymerase (RdDp) pathway and DNA integration, likely associated with transposition of mobile elements. These results, together with the divergent genome sizes of the diploids (1C-value estimates of 2.89 pg for D. fuchsii, and 3.55 pg for D. incarnata; Aagaard et al. 2005) suggest a distinct repeat load in the diploids, potentially driving significant genomic conflicts and dramatic genomic shock in their hybrids. Indeed, despite hybridizing on several occasions, these diploids have not produced any homoploid hybrid species (Pillon et al. 2007; Paun et al. 2009; Brandrud et al. 2020). However, the TE landscape of the two diploid parents and allopolyploids has not been previously studied.
Fig. 1.
The Dactylorhiza species investigated here and their genome sizes as estimated for this study, given as 1C values in picograms (pg) plus error bars as standard error of the mean. Note the difference genome-size scale on the Y-axes in (a) and (b). (a) Diploids: fuc—D. fuchsii, inc—D. incarnata. (b) Allotetraploids: bal—D. baltica, maj—D. majalis, pra—D. praetermissa, pur—D. purpurella, sil—in silico allopolyploid, tra—D. traunsteineri. The dashed line in (b) represents the mean of the additive genome size of the diploid parents (D. fuchsii and D. incarnata), whereas the grey area shows the 95% confidence interval. Superscript letters (a/b/c) are used to display significant differences in genome size. Absence of shared letters (e.g., ab vs. c) indicates a significant difference in genome size (supplementary table S2, Supplementary Material online). Ordering of the allotetraploids on the X-axis follows the relative age estimates according to Brandrud et al. (2020), with D. purpurella further estimated to be ca 530 generations old and D. majalis ca 104,000 generations (maximum estimates from Hawranek 2021). Plant illustrations modified from Nelson (1976).
Recent genomic work based on RAD-seq across the orchid genus Dactylorhiza (Brandrud et al. 2020) illustrated repeated, unidirectional, independent allopolyploidization involving the two diploid parents, resulting in an array of ecologically divergent, sibling allotetraploids (Hedrén 1996; Dijk and Grootjans 1998; Pillon et al. 2007; Hedrén et al. 2011; Paun et al. 2011; Wolfe et al. 2021). Based on the number of private alleles accumulated in each allopolyploid, Brandrud et al. (2020) inferred relative ages for these allopolyploidizations, later confirmed by Hawranek (2021) that estimated with coalescent methods maximum ages for the allopolyploids ranging from ca. 530 generations for D. purpurella, to ca. 74,000 generations for D. traunsteineri and ca. 104,000 generations for D. majalis s.str. Making use of this series of sibling allopolyploid marsh orchids of different ages, we specifically ask: 1) does WGD trigger monoploid genome size changes in Dactylorhiza majalis s.l. and what are the temporal dynamics of these changes, 2) which TE families experience changes post-polyploidization and does this relate to imbalances between parental TE loads, and 3) are the observed large-scale changes repeated among the sibling allopolyploids?
Results
Genome Size of Early Allopolyploids Is Consistent with Parental Additivity, But Those of Older Allotetraploids Are Larger Than Expected
To allow comparative analyses of repeat composition in the sibling Dactylorhiza allopolyploids and their diploid progenitors, we first estimated genome sizes for multiple accessions of each species (supplementary table S1, Supplementary Material online) using flow cytometry. For the diploid parental species, we obtained 1C genome sizes consistent with previous estimates (Aagaard et al. 2005): 2.93 pg versus 2.89 pg for D. fuchsii, and 3.60 pg versus 3.55 pg for D. incarnata (fig. 1a, table 1; supplementary table S1, Supplementary Material online).
Table 1.
Flow Cytometry Estimates of Genome Size.
| Species | Ploidy | Na | 1C (pg)b | 1C, SEMc | 2C (Gbp)b |
|---|---|---|---|---|---|
| Dactylorhiza fuchsii | 2x | 4 | 2.93 | ±0.18 | 5.73 |
| D. incarnata | 2x | 7 | 3.60 | ±0.14 | 7.04 |
| D. purpurella | 4x | 5 | 6.69 | ±0.05 | 13.09 |
| D. baltica | 4x | 7 | 6.83 | ±0.03 | 13.37 |
| D. praetermissa | 4x | 5 | 7.40 | ±0.29 | 14.47 |
| D. traunsteineri | 4x | 5 | 7.24 | ±0.25 | 14.16 |
| D. majalis | 4x | 8 | 7.28 | ±0.16 | 14.24 |
Individual genome-size measurements are reported in supplementary table S1, Supplementary material online.
N, sample size.
The mean genome size, given as picograms (1C) or giga base pair (2C).
SEM, standard error of the mean.
Genome sizes for the sibling allopolyploids varied between 6.69 pg/1C for D. purpurella and 7.40 pg/1C for D. praetermissa (fig. 1b and table 1). Comparison of the observed and expected (i.e., additive relative to the parents) genome sizes of allotetraploids revealed that the youngest allopolyploid, D. purpurella, previously estimated to be maximum 530 generations old (Hawranek 2021) fell closest to but slightly outside the expected value (fig. 1b). The significance of differences in genome sizes between pairs of species and between each allotetraploid and additive expectations is indicated in figure 1 and in supplementary table S2, Supplementary Material online. Taking into account relative (Brandrud et al. 2020) and absolute age estimates (Hawranek 2021) to order the allotetraploid formations through time, our results (fig. 1b, and table 1) therefore indicate that the allotetraploid genome size is initially additive, but appears enlarged by ca. 13.5% in comparison to expectations in intermediate-aged and older allopolyploids (Hawranek 2021).
The Diploid Parental Species have a Distinct Repeat Composition
To assess the relative repeat composition of the two parental diploids and their allotetraploids, we used Illumina whole-genome skimming for 35 accessions representing seven species (supplementary table S4, Supplementary Material online) and the RepeatExplorer pipeline (Novak et al. 2013; Novak et al. 2020b). When comparing the abundances (supplementary table S3, Supplementary Material online) and relative proportions (table 2) of tandem and dispersed repeat proportions between the two parental diploids, our results confirmed that D. incarnata has overall higher repeat content corresponding to its larger genome (total repeats estimated at 2.6 Gb for D. incarnata vs. 2.03 Gb for D. fuchsii, calculated on haploid genome sizes; supplementary table S3, Supplementary Material online). When looking at the different repeat types, we found that Ty1-copia LTR-retrotransposons are in general more abundant in the D. incarnata genome in comparison to D. fuchsii (1.75 Gb vs. 1.28 Gb, calculated on haploid genome sizes; supplementary table S2, Supplementary Material online; fig. 2a–c and table 2) and appear to drive most of the genome size difference between genomes of the two diploid parents (fig. 1). Ty3-gypsy LTR-retrotransposons exhibit a more heterogeneous pattern for the two diploid genomes (fig. 2d–f, table 2; supplementary table S3, Supplementary Material online) with chromoviruses being more abundant in the genome of D. fuchsii and non-chromoviruses in the D. incarnata genome. Lastly, we found one element, a putative tandem repeat, potentially derived from a non-autonomous miniature inverted-repeat TE (MITE), that makes up a difference of more than 170 Mbp between the two diploids and is more abundant in the smaller genome of D. fuchsii (i.e., the MITE-like element makes up 206.3 Mb in the diploid genome size of D. fuchsii and 35.2 Mb for D. incarnata; see fig. 2g, table 2; supplementary table S2, Supplementary Material online). Finally, we identified two potentially differentiated satDNAs in each diploid (fig. 2h), but little difference in the proportion of clusters annotated as rDNAs (fig. 2i).
Table 2.
Genome Proportion Estimates (%) of Repeats for Each Species Analysed Here.
| Repeat type/family | fuc | inc | pur | bal | pra | tra | maj |
|---|---|---|---|---|---|---|---|
| LTR retrotransposons | 44.8 | 49.6 | 48.2 | 47.0 | 45.7 | 45.9 | 46.0 |
| Ty1-copia | 22.9 | 27.7 | 26.2 | 26.0 | 24.8 | 25.0 | 24.8 |
| SIRE/maximus | 17.0 | 21.2 | 19.7 | 19.8 | 18.8 | 18.7 | 18.4 |
| Other | 5.9 | 6.6 | 6.5 | 6.2 | 6.0 | 6.3 | 6.5 |
| Ty3-gypsy | 17.5 | 16.5 | 16.8 | 16.1 | 16.0 | 16.1 | 16.2 |
| Chromovirus | 8.1 | 5.5 | 6.6 | 6.3 | 6.4 | 6.3 | 6.4 |
| CRM | 1.4 | 0.7 | 1.0 | 1.0 | 1.0 | 0.9 | 1.0 |
| Tekay | 6.3 | 4.5 | 5.2 | 5.0 | 5.1 | 5.0 | 5.0 |
| Other | 0.4 | 0.3 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 |
| Non-chromovirus | 9.4 | 11.0 | 10.3 | 9.8 | 9.6 | 9.8 | 9.8 |
| Athila | 3.2 | 4.6 | 4.0 | 3.9 | 3.8 | 3.8 | 3.7 |
| Retand | 6.2 | 6.4 | 6.3 | 5.9 | 5.8 | 6.0 | 6.1 |
| LTR unclassified | 4.4 | 5.4 | 5.2 | 5.0 | 4.9 | 4.9 | 5.0 |
| Non-LTR retrotransposons | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 |
| LINE | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 |
| DNA transposons | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 | 0.4 |
| hAT | 0.3 | 0.3 | 0.3 | 0.3 | 0.2 | 0.3 | 0.3 |
| MITE | 0.06 | 0.06 | 0.06 | 0.05 | 0.05 | 0.05 | 0.06 |
| Other | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 |
| Tandem repeats | 5.2 | 1.9 | 2.5 | 4.6 | 7.9 | 5.8 | 5.8 |
| rDNA | 0.4 | 0.4 | 0.3 | 0.3 | 0.2 | 0.2 | 0.3 |
| satDNA | 4.8 | 1.5 | 2.2 | 4.3 | 7.7 | 5.6 | 5.5 |
| MITE-like | 3.6 | 0.5 | 1.0 | 3.2 | 6.6 | 4.5 | 4.5 |
| Unclassified | 7.3 | 8.6 | 8.3 | 8.0 | 7.8 | 7.9 | 8.1 |
| Total TEs | 45.6 | 50.4 | 48.9 | 47.8 | 46.5 | 46.7 | 46.9 |
| Total repeats | 71.0 | 73.8 | 72.1 | 73.3 | 73.8 | 73.0 | 72.6 |
Species abbreviations are explained in the legend of figure 1.
Fig. 2.
Repeat comparison for the parental diploid species. Cluster proportion for different repeat classes in paternal D. incarnata (“inc”, X-axis) against the maternal D. fuchsii (“fuc”, Y-axis). Panels (a–c) show LTR-retrotransposons Ty3-gypsy elements in grey, with specific families highlighted in yellow: chromovirus, subfamily CRM (a); chromovirus, subfamily Tekay (b); non-chromovirus (c). Panels (d–f) show LTR-retrotransposons Ty1-copia elements in grey, with specific families highlighted in orange: Alesia (d); SIRE/Maximus (e); other Ty1-copia families (f). Panels (g–i) show tandem repeat clusters in grey, with specific classes highlighted in blue: MITE-like element (g); satellite DNA (h); 5S and 35S rDNA (I).
Net Repeat Content Increases with Age among the Sibling Allopolyploids up to a Certain Level
The proportion estimates of total repeats for the allotetraploids were found to range from 72.1% of the genome for D. purpurella up to 73.8% for D. praetermissa (table 2 and fig. 3e). These proportions were similar to the additive expectation of 72.5%. The pattern of total repeats (fig. 3a; supplementary table S3, Supplementary Material online) showed a maximum net difference of almost 1.25 Gbp between D. purpurella and D. praetermissa, corresponding well to the maximum observed difference in genome size estimated by flow cytometry (i.e., 1.38 Gbp between the same species; table 1). With respect to the previously estimated age variation between the allotetraploids (Brandrud et al. 2020; Hawranek 2021), the lowest estimate for total repeats was recovered in the youngest allotetraploid, D. purpurella—an estimate slightly higher than expected (fig. 3a; supplementary table S3, Supplementary Material online). The genome length representing the repeats is markedly increased compared to expectations for the intermediate aged (i.e., D. praetermissa and D. traunstenieri) and the oldest allotetraploids (i.e., D, majalis), but it is not remarkably different between them.
Fig. 3.
Repeat composition in Gbp (a–d) and relative proportions of repeats classifications (e–h) for the sibling allotetraploid Dactylorhiza analysed here. Species abbreviations follow figure 1. The lines indicate the expected additivity of the parental diploids (D. fuchsii and D. incarnata) for the respective type of elements. Ordering of the allotetraploids on the X-axis follows the relative age estimates according to Brandrud et al. (2020), with D. purpurella further estimated to be ca. 530 generations old and D. majalis ca. 104,000 generations (maximum estimates from Hawranek 2021). Note the magnified scale of the Y-axes.
Over 45% of the allotetraploid genomes was found to be represented by LTR retrotransposons (table 2), mainly Ty1-copia (ranging among allotetraploids between 24.8 and 26.2%) and Ty3-gypsy (between 16 and 16.8%; supplementary fig. S1bandc, Supplementary Material online). In both Ty1-copia and Ty3-gypsy families an increase in net amounts is estimated, in particular in the older allotetraploids (fig. 3bandc; supplementary table S3, Supplementary Material online). However, in genomic proportions we observe slightly less than parental additivity for LTRs in most allotetraploids (fig. 3fandgandtable 2). Both Ty1-copia and Ty3-gypsy elements are abundant in these polyploid genomes (fig. 3bandc; supplementary table S3, Supplementary Material online), and the differences in LTR elements between the young and old allotetraploids contribute altogether ca. 300 Mb to the maximum observed variation in genome size.
More than 800 Mbp of the maximum difference observed among allotetraploid genome sizes appears to be driven by tandem repeats (supplementary table S3, Supplementary Material online), which are found in genomic proportions ranging between 2.5% for D. purpurella and 7.9% for D. praetermissa (table 2). In particular, a specific element, potentially derived from a MITE, which shows variation between the diploid parental representatives (with larger amounts in the maternal species, D. fuchsii; fig. 2g), is found to be highly variable among the allotetraploid species (fig. 3d, fig. 4a, table 2; supplementary table S2, Supplementary Material online), but also between individuals within all allotetraploids, except the youngest D. purpurella (fig. 4b). This represents the largest cluster in the analysis (fig. 5), as well as the element contributing the most to the observed genome size difference among the allotetraploids. The cluster localizes to subterminal chromosomal positions (see below and fig. 6) and may represent a putative tandem repeat, although it still carries genomic signatures of MITEs, specifically inverted repeats that may represent original TIRs. Therefore, we refer to it here as a MITE-like element.
Fig. 4.
Repeat comparison for sibling allotetraploids. (a) The relative proportion of repeat elements of each allotetraploid (Y-axis) relative to an in silico allopolyploid (X-axis), representing the expected additivity. The repeat clusters for each allotetraploid are shown in colors corresponding to those used in panel (b). A MITE-like element is indicated with arrows. (b) Within-species variation of the MITE-like element. Ordering of the allopolyploids on the X-axis follows age estimates following Brandrud et al. (2020) and Hawranek (2021). Species abbreviations follow figure 1.
Fig. 5.
Repeat elements with at least 20% size difference between the observed value of at least one allopolyploid and the expected (in silico). The clusters are sorted into groups originally larger in a parental species or similar in size. Each column of the square plot is normalized by the expected value (in silico), with smaller squares indicating a decrease whereas larger squares indicate an increase in size according to the legend. The color of the squares represents repeat types. The bar graph on top shows the size of a cluster in the analysis, that is, the proportion of reads out of all analysed reads (note the break in the Y scale going from 0.50 and 3.25%).
Fig. 6.
GISH in D. majalis labelled with genomic DNA of D. incarnata and D. fuchsii as probes, followed by FISH localization of the MITE-like repeat. Chromosomes were counterstained by DAPI; GISH and FISH signals are shown in color as indicated. Scale bar, 10 µm.
To investigate a potential subgenome dominance bias in the allotetraploids, we compared the observed estimates in the sibling allopolyploids to an expected value of an in silico allopolyploid to extract clusters different in size from the expected. We further classified the respective clusters as originally larger (i.e., by at least 20%) in a parental species or similar in size between the parents. The largest observed difference was the change in the MITE-like element (fig. 5). We also found that among the clusters originally larger in the paternal D. incarnata genome mainly Ty1-copia elements were different among the allopolyploids, whereas among the clusters originally larger in the maternal D. fuchsii mainly Ty3-gypsy elements were different (fig. 5).
We then looked into specific results for different TE families. We normalized each cluster relative to the in silico allopolyploid with values between −1 and 1, where a value of 0 means that the cluster equals the in silico allopolyploid and by extension the expectation from the parental species. We found a few elements with large effects scattered among families (supplementary fig. S2, Supplementary Material online).
The Highly Dynamic MITE-Like Element Localizes in Subterminal Positions on Most Allotetraploid Chromosomes
Due to the highly dynamic amounts recovered for this MITE-like element, we further used a combination of genomic in situ hybridization (GISH) and fluorescence in situ hybridization (FISH) in accessions of D. majalis and D. traunsteineri to investigate its location in the allotetraploid genomes and establish if it is interspersed or clustered. This MITE-like element was localized in subterminal chromosomal positions on nearly all chromosomes of the allotetraploids (fig. 6; supplementary fig. S3, Supplementary Material online). Because we estimated with RepeatExplorer a significant difference in the amount of the MITE-like element copies in the parental diploids (fig. 2g, fig. 4b, table 2; supplementary Table S3, Supplementary Material online), we confirmed with FISH its relatively high copy number in the maternal diploid, D. fuchsii (supplementary fig. S3b, Supplementary Material online), in contrast to D. incarnata that showed no detectable signal (supplementary fig. S3a, Supplementary Material online). We were further interested if the MITE-like element has spread among the two subgenomes of the allotetraploids. Therefore we stained the two subgenomes and the MITE-like element with fluorescent dyes and observed that this element is found in both, but it provides comparatively larger signals on the D. fuchsii allotetraploid subgenomes (fig. 6).
Discussion
Rampant TE Amplification Is Observed Only in Older Allopolyploids
Genome reorganizations after allopolyploidization have been investigated in different systems, from recent neoallopolyploids (e.g., 140 years old Mimulus peregrinus, Edger et al. 2017) to old, well-established paleopolyploids (e.g., ∼2–4 Ma tetraploid and hexaploid Spartina, Giraud et al. 2020, and ∼6 Ma Nicotiana sect. Suaveolentes, Dodsworth et al. 2020; Chase et al. 2022), and from synthetic allopolyploids (e.g., Nicotiana, Mhiri et al. 2019) to natural populations (e.g., Spartina anglica, Parisod et al. 2009). However, to our knowledge, mobile element turnover after allopolyploidization has not been evaluated thus far in multiple allopolyploid species formed independently and unidirectionally by the same parental diploids. The sibling allotetraploid marsh orchids studied here fill in an important knowledge gap because they allow testing if these replicated the events following genome merger. Allotetraploids in Dactylorhiza majalis s.l. are still in the early phases of their evolution, but they are established species (Pillon et al. 2007; Brandrud et al. 2020; Hawranek 2021), with large distributions and distinct ecologies (Hedrén et al. 2011; Paun et al. 2011; Wolfe et al. 2021), and therefore represent an excellent model for studying post-WGD processes through time.
In the youngest of the sibling marsh orchids studied here, D. purpurella (maximum 530 generations old, Hawranek 2021; see also Hedrén et al. 2011, Brandrud et al. 2020) we observe nearly the expected (additive) genome size (fig. 1) and repeat content (fig. 3). Previous studies of rDNA ITS sequences have also shown that D. purpurella individuals often exhibit an equal proportion of the parental alleles (Pillon et al. 2007). These observations are consistent with findings in other systems, for example in synthetic allopolyploid wheat (Kashkush et al. 2003), Brassica napus (Sarilar et al. 2013) and Nicotiana (Mhri et al. 2019), where no or only minimal transposition and gene conversion have been reported after recent genome merger. The situation is different in Tragopogon neoallotetraploids, where homogenization of rDNA repeats happens quickly and in alternate directions in different populations (Kovarik et al. 2005; Matyášek et al. 2007). In resynthesized Mimulus perregrinus, Edger et al. (2017) reported reduced levels of CHH methylation within TEs that should promote their activation, and Kashkush et al. (2003) documented in synthetic wheat transcriptional upregulation of some retrotransposons despite a lack of net TE amplification. However, repressive control of retroelements seems often to be maintained in early stages of allopolyploid evolution, perhaps at the level of post-transcriptional regulation via heterochromatic small interfering RNAs (Wendel et al. 2016). We find this phase in which allopolyploids maintain a largely additive genome size to extend at least for hundreds of generations after whole genome doubling in marsh orchids, but in other systems it may cover tens to hundreds of thousand generations, as reported for example for Arabidopsis suecica (Burns et al. 2021); Melampodium strigosum (McCann et al. 2018) and Nicotiana rustica (Dodsworth et al. 2020).
However, in the case of Dactylorhiza, in species between hundreds and thousands of generations post-allopolyploidization (i.e., for D. baltica and D. praetermissa), we observe a genome upsizing close to 1 Gbp (1C) (fig. 1) and a corresponding level of repeat amplification (fig. 3). The allopolyploids could have experienced a phase when drift rather than selection affected their genomes as they likely expanded the sizes of their populations, niches and distribution. Repeat amplification in marsh orchids seems to involve largely proportional changes in copy numbers across the majority of repeat families, both tandem and dispersed, but it is particularly driven by changes in a MITE-like element. A correlation between genome size and number of TEs in a genome is widely accepted (Kidwell 2002; Touchon and Rocha 2007; Elliott and Gregory 2015). However, a limit to this correlation appears in the context of allopolyploid evolution in Dactylorhiza, as due to amplification of the MITE-like element, the proportion of other types of repeats in the genome appears to slightly decrease (fig. 3fandg and table 2) even though the net number of repeats increases (fig. 3bandc; supplementary table S3, Supplementary Material online) (Kelly et al. 2015; Novak et al. 2020a). In particular, in D. praetermissa, which must have had fewer than 74,000 generations (Brandrud et al. 2020; Hawranek 2021), total repeat number appears to have expanded with 14.5% relative to expectations (fig. 3a). Such an extensive TE mobilization may be associated with significant chromosomal rearrangements and impact evolution of genes and functions (Wendel et al. 2018). Similar but even more extensive TE amplification of around 29% was also observed in Nicotiana repanda, estimated to be roughly 4 M generations old (Dodsworth et al. 2020), much older than these allopolyploid marsh orchids.
Finally, no significant large scale differences are obvious between Dactylorhiza traunsteineri and D. majalis (fig. 3), which are each likely tens of thousands generations old (their origins predate the last glacial maximum; Hawranek 2021), and occur widely across Europe (Wolfe et al. 2021). Concerted evolution of rDNA alleles is well underway in these allopolyploids (Pillon et al. 2007), with D. majalis typically exhibiting only maternal copies. In multiple other established species a decrease of TE content rather than an increase compared to expected was evident over the long term (e.g., Ozkan et al. 2003; Leitch and Bennett 2004; Eilam et al. 2009; Chase et al. 2022). Indeed, it is generally accepted that over broad evolutionary time scales polyploidy is not a major factor in genome size increase (Wang et al. 2021; Chase et al. 2022). Our results confirm that the mechanisms controlling genome expansion and downsizing post-WGD obviously act on different time horizons. A similar age effect has been observed previously in Nicotiana, albeit across species that did not fully share parentage (Dodsworth et al. 2020). Our findings follow a similar pattern across timescales as in Nicotiana, although at a smaller scale and without a significant decrease compared to additivity; possibly our oldest allotetraploid, D. majalis, is too young (∼100k generations; Hawranek 2021) to show significant genome downsizing.
Element-Specific, But No Genome-Wide Parental Bias in TE Dynamics
Allopolyploidization can trigger genomic shock (McClintock 1984), imbalancing epigenetic control of TEs that are thus prone to undergo an acute proliferation (Wendel et al. 2016). In early stages of allopolyploid evolution, selection is also likely reduced, whereas the highly duplicated genomic landscape will mask deleterious effects of individual TE insertion in or around genes (Comai 2005). Our results document divergent genomic compositions in the parental diploid species, D. incarnata and D. fuchsii (fig. 2), and confirm a more than 20% difference between genome sizes (fig. 1a), suggesting a high potential for genomic shock in the resulting allotetraploids. In stark contrast to this expectation, we consistently observe that in the sibling Dactylorhiza allotetraploids nearly all TEs conform to the additive expectations (fig. 4a; supplementary fig. S1, Supplementary Material online), which has also been reported in other allopolyploids (e.g., McCann et al. 2018; Mhiri et al. 2019; Chen ZJ et al. 2020; Dodsworth et al. 2020; Burns et al. 2021). An overall genomic increase of TEs compared to expectation was also not found in Capsella bursa-pastoris (Ågren et al. 2016), but an increase in TE abundance in gene-rich regions was documented, consistent with the hypothesis that relaxed selection rather than an epigenetic imbalance explains the TE patterns. Chen et al. (2020) also found concordant TE changes in five Gossypium allopolyploid species and, most strikingly, they reported consistent movement of TE copies from the larger parental subgenome into the smaller parental subgenome, suggesting a homogenization of the initial genome-size difference.
The observed departure from expectation in genome sizes of Dactylorhiza allotetraploids is associated with moderate, apparently unstructured TE dynamics, in particular retroelements (table 2, fig. 4a, fig. 5; supplementary fig. S2, Supplementary Material online), but is mostly driven by amplification of a MITE-like element. It is noteworthy that this putative tandem repeat is found at such low copy-number in the larger paternal genome of D. incarnata (fig. 4b and table 2) that there is no detectable FISH signal in contrast to the maternal species (supplementary fig. S3aandb, Supplementary Material online). In the allopolyploids, clear amplification (and thus likely spread) of the MITE-like element occurs in both subgenomes (fig. 6; supplementary fig. S3candd, Supplementary Material online), which, in addition to overexpression of the RdDp pathway and DNA integration in D. incarnata in comparison to D. fuchsii (Balao et al. 2017), could suggest that the silencing mechanisms of the smaller maternal genome may control activity of a large portion of TEs in the allopolyploids. However, it does not control that of the MITE-like element, which shows a high copy number already in the maternal genome. Tandem repeats are known to quickly evolve and in general not under the same constraints as other types of repeats. When it comes to tandem repeats and especially satellites, the “library hypothesis” (Ruiz-Ruano et al. 2016) proposes that a variety of satDNA families in an ancestral species can diverge and expand/retract rapidly between lineages (Koukalova et al. 2010; Garrido-Ramos 2017). This rapid rate of evolution can result in considerable differences in the satDNA landscape among closely related species (Koukalova et al. 2010; Ambrožová et al. 2011; Emadzade et al. 2014; Samoluk et al. 2017; Belyayev et al. 2020; Palacios-Gimenez et al. 2020) and among populations within species (Garrido-Ramos 2017).
Evolutionary Dynamics of TEs Are Remarkably Consistent among Independently Formed Sibling Allopolyploids
It is widely accepted that most polyploids have multiple origins in time and place, creating an array of populations with distinct genetic, ecological, morphological, and physiological properties (Soltis and Soltis 1999, Paun et al. 2007). The sibling marsh orchids studied here show distinct ecological preferences and largely specific distributions, a legacy of their individual evolutionary histories (Hedrén 1996; Dijk and Grootjans 1998; Pillon et al. 2007; Paun et al. 2011, Wolfe et al. 2021). Here, we evaluated stochasticity of TE amplification after allopolyploidization and found little variation between sibling allopolyploids suggesting no significant “genomic shock” following their formation. In contrast, species-specific DNA methylation patterns have been previously documented in natural populations of D. majalis and D. traunsteineri, with almost 10% of the investigated cytosines found to be fully methylated in one polyploid and unmethylated in the other (Paun et al. 2010; Trucchi et al. 2016). This suggests that although the TE types and proportions over the entire genomes appear to be tightly regulated and consistent among sibling allopolyploids, there may be high variation among individual TE insertion sites. Individual TE insertions can trigger a shift in phenotype (Chuong et al. 2017; Dubin et al. 2018), potentially contributing molecular and ecological individuality to each of the sibling allopolyploids. Such locus-specific effects of TE insertions and their potential phenotypic effects remain to be investigated in future studies.
Materials and Methods
Plant Material, DNA, and Illumina Sequencing
Thirty-eight samples were included in the genomic analyses (supplementary table S3, Supplementary Material online) in addition to 46 other samples that have been used for genome-size estimations (supplementary table S1, Supplementary Material online). The sampling aimed to represent as much as possible the variation in each species, for example, by covering much of the distribution of each species. Total DNA was isolated from silica-dried leaves using a cetyl trimethylammonium bromide (CTAB) procedure (Doyle and Doyle 1990) or the DNeasy Plant Mini Kit (Qiagen, Venlo, Netherlands). DNA was purified with the Nucleospin gDNA clean-up kit (Macherey-Nagel, Düren, Germany) following the manufacturer’s protocol. The DNA was further sheared to an average 450 bp size using a Bioruptor Pico (Diagenode). Individually indexed high-throughput sequencing libraries were prepared using the TruSeq DNA PCR-Free Library Kit (Illumina Inc.). The Illumina HiSeq platform was used to generate paired-end 126 or 150 bp reads for each library (VBCF, NGS Unit, Vienna, Austria).
Genome Size Estimation
Genomes size was estimated following Temsch et al. 2010. Young fruits and leaves were collected and stored at 4 °C for up to a week until used for flow cytometry. The material was co-chopped with a suitable internal standard (Galbraith et al. 1983) in cold Otto’s isolation buffer (Otto et al. 1981). Solanum pseudocapsicum (1.295 pg/1C; Temsch et al. 2010) or Pisum sativum (4.42 pg/1C; Greilhuber and Ebert 1994) were used as internal standards. The suspension was first filtered through a 30 μm nylon mesh and incubated together with RNase at 37 °C for 30 min. Then the suspension was stained with propidium iodide dissolved in Otto’s buffer II for about one hour at 4 °C. Genome size measurements were done on a CyFlow ML flow cytometer (Partec, Münster, Germany) equipped with a green laser (100 mW, 532 nm, Cobolt Samba; Cobolt AB, Stockholm). For each sample, an average of three runs with 3,333 particles was performed (supplementary table S1, Supplementary Material online); the results from all runs were averaged to estimate the final genome size. Multiple measurements were conducted to overcome potential biases in the genome size estimation (cf. Greilhuber et al. 2007).
As a reference point for what to expect by additivity of the parental genome sizes in silico allopolyploid genome size were computed by pairwise combining samples of the diploid parental species. To determine species pairs with significantly different genome sizes a pairwise t-test was performed with a Benjamini and Hochberg (1995) multiple testing correction.
Data Preparation and Repeat Content Estimation
Raw Illumina sequence pairs were pre-processed for quality using a custom Python script (qualityFilterPairEnd.py available at https://github.com/mc-er/dact-TEs/), which removes read pairs with quality scores lower than 20 for a maximum of 5% of the total read length. Adapters were removed using the program cutadapt v.2.10 (Martin 2011) for Illumina TruSeq adapters, discarding any pairs with trimmed reads or those containing indeterminate bases (N). Following pre-processing, read pairs were trimmed to a length of 125 bp and mapped against the plastid genome of D. fuchsii (GenBank Accession number MK908418) using BWA (Li and Durbin 2009). Read pairs with high mapping scores towards the plastid genome were discarded to unambiguously focus the repeat content estimation solely on the nuclear genome.
RepeatExplorer v.2.3.7 (Novak et al. 2013; Novak et al. 2020b) was used to estimate repeat content of each parental species and the five sibling allopolyploids. To capture potential intraspecies variation, sequences from five individuals across the range of each species (supplementary table S3, Supplementary Material online) were used in the RepeatExplorer analyses (comparative analysis mode). First an analysis was run for each species (analyses 1–7; supplementary table S4, Supplementary Material online) using as many reads as possible with the maximum memory allowed (TAREAN_MAX_MEM = 110 Gb) to provide the basis of cluster annotation of the comparative analysis run with all species together. RepeatExplorer optimized memory usage and therefore the number of reads used per individual depending on the exactly-matching “repetitiveness” identified for each individual (Novak et al. 2020b). Lastly, a comparative analysis was conducted for parental diploids and allopolyploids together using a number of reads proportional to their genome sizes (analysis 8; supplementary table S4, Supplementary Material online). Prior to pooling species samples into a species representative for the comparative analyses, pairwise comparisons of cluster sizes were made between all samples within species in order to only include samples with similar repeat dynamics (supplementary figs. S5–S11, Supplementary Material online). All analyses used the Viridiplantae v.3.0 database (Neumann et al. 2019), automatic filtering for abundant satellites, keeping the read name and an “extra-long” analysis.
Repeat Annotation
The RepeatExplorer clusters were first annotated based on superclusters. Each cluster within a supercluster was annotated according to the majority of matches found across all clusters to a TE protein-domain database (Neumann et al. 2019). Contigs from the remaining clusters for which no domains were present were inspected using clview and dotter v.4.44.1 (Sonnhamme and Durbin 1995) to detect insertion sites and inverted repeat signatures. Satellite DNAs were recovered using TAREAN, a module of the RepeatExplorer (Novak et al. 2017) and manually annotated using dotter v.4.44.1 (Sonnhammer and Durbin 1995). Clusters exclusively exhibiting insertion sites with no other characteristics typical of known repeats were left unclassified. The clusters returned by RepeatExplorer comparative analysis were annotated by cross referenced annotations from the species analysis with custom Python scripts (comparative_annotation.py and annotateCOMP.py available at https://github.com/mc-er/dact-TEs/). Clusters with ≥ 20% of reads with an annotation from the species analysis and the second major annotation being unclassified (> 50%) were given the annotation based on the species analyses.
Parental Bias
The clusters identified in the comparative analysis (analysis 8; supplementary table S4, Supplementary Material online) were first filtered to only keep clusters with a proportion ≥ 0.001 out of analysed reads. The remaining clusters were then classified in two further steps. First, the clusters were regarded either as originally larger in one of the parental genomes or similar in size in the parents. For a cluster to be classified as originally larger in one of the parental genomes, the size difference between the parental clusters had to be ≥ 20%; if the difference was < 20% the cluster was classified as “similar” in size. The second classification compared the observed cluster size in the sibling allopolyploids to the expected value of an in silico allopolyploid. Clusters were retained and reported in fig. 5 if at least one of the sibling allopolyploids exceeded a threshold difference of ±20% to the expected in silico value.
Chromosome Preparations
Root tips were harvested from cultivated plants, pre-treated with ice-cold water for 16 h, fixed in 3:1 ethanol:acetic acid fixative for 24 h at 4 °C and stored at −20 °C until further use. Selected root tips were rinsed twice for 5 min in distilled water, and twice for 5 min in citrate buffer (10 mM sodium citrate, pH 4.8). Then the root tips were digested in 0.3% cellulase, cytohelicase and pectolyase (all Sigma Aldrich) in citrate buffer at 37 °C for 3 h. After digestion, individual root tips were dissected on a microscope slide in 20 μL acetic acid and spread on the slide placed on a metal hot plate (50 °C) for ca. 30 s. This preparation was fixed in freshly prepared 3:1 ethanol:acetic acid fixative by dropping the fixative around the drop of acetic acid and into it. The preparation was dried using a hair dryer and staged using a phase contrast microscope. Chromosome preparations were treated with 100 μg·mL−1. RNase in 2 × sodium saline citrate (SSC; 20 × SSC: 3 M sodium chloride, 300 mM trisodium citrate, pH 7.0) for 60 min and with 0.1 mg·ml−1 pepsin in 0.01 M HCl at 37 °C for 5 min; then postfixed in 4% formaldehyde in 2 × SSC for 10 min, washed in 2 × SSC twice for 5 min, and dehydrated in an ethanol series (70, 90, and 100%, 2 min each).
DNA Probes
A synthetic oligonucleotide probe was designed for the MITE-like repeats. A target sequence of 60 nt (AATTGCGAGTCGCATAGTTTAGGTAATATACGCAGAACACGCGCCCTTTGAAAATAGACG) was selected from DNA alignments using Geneious v.11.1.5 software package v.2 to minimize self-annealing and formation of hairpin structures. The DNA probe preparation and labelling followed published protocols (Mandáková and Lysak 2016). For GISH, total genomic DNA (gDNA) was extracted from young leaves of D. incarnata and D. fuchsii according to Dellaporta et al. (1983) followed by RNase treatment (50 μg·mL−1). Extracted gDNA was checked for protein, starch and RNA contamination using a Beckmann photospectrometer and ran on a 1% (w/v) agarose gel in 1 × Tris-acetate-EDTA (TAE) buffer. All DNA probes were labelled with biotin-dUTP or digoxigenin-dUTP by nick translation as described in Mandáková and Lysak (2016).
In Situ Hybridization
Selected labelled DNA probes were pooled together, ethanol precipitated, dissolved in a 20 μL mixture containing 50% formamide, 10% dextran sulphate and 2 × SSC, and pipetted onto each of the microscopic slides. The slides were heated at 80 °C for 2 min and incubated at 37 °C overnight. The hybridized probes were visualized through fluorescently labeled antibodies against biotin-dUTP (red) and digoxigenin-dUTP (green) as in Mandáková and Lysak (2016). Chromosomes were counterstained with 4′,6-diamidino-2-phenylindole (DAPI, 2 μg·mL−1) in Vectashield antifade. Fluorescence signals were analysed and photographed using a Zeiss Axioimager epifluorescence microscope and a CoolCube camera (MetaSys- tems, Altlussheim, Germany). Individual images were merged and processed using the Photoshop CS software (Adobe Systems).
Supplementary Material
Acknowledgments
This work was financially supported by the Austrian Science Fund (FWF) by grant Y661-B16 to O.P. and by a doctoral programme (D.K.) grant W1225-B20 to a faculty team, including O.P. Faculty members, in particular Andreas Futschik, Robert Kofler and Christian Schlötterer, the students and the scientific advisory board of the Vienna Graduate School of Population Genetics (www.popgen-vienna.at/) are acknowledged for numerous discussions and feedback on this work. We thank Richard Bateman, Kadri Tali, David Ståhlberg, Erik Westberg, Henrik Ærenlund Pedersen, Sofie Nordström and Sebastian Sczepanski for collecting some of the samples, and Juliane Baar, Marie Huber, and Daniela Paun for their support during laboratory work. Sequencing was performed at the Vienna BioCenter Core Facilities (https://www.viennabiocenter.org/). Computational resources were provided by the Vienna Scientific Cluster and the Life Science Compute Cluster of the University of Vienna. Natural England and the Forestry Commission (UK), and regional county administrations in Austria, France, and Sweden are acknowledged for issuing necessary collecting permits.
Contributor Information
Mimmi C Eriksson, Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria; Vienna Graduate School of Population Genetics, Veterinärplatz 1, A-1210 Vienna, Austria.
Terezie Mandáková, Plant Cytogenomics Research Group, CEITEC−Central−European Institute of Technology, Masaryk University, Brno 62500, Czech Republic; Central European Institute of Technology, Masaryk University, Brno 62500, Czech Republic; Institute of Experimental Biology, Faculty of Science, Masaryk University, Brno 62500, Czech Republic.
Jamie McCann, Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria.
Eva M Temsch, Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria.
Mark W Chase, Royal Botanic Gardens Kew, London TW9 3AE, United Kingdom; Department of Environment and Agriculture, Curtin University, Perth, Western Australia, Australia.
Mikael Hedrén, Department of Biology, University of Lund, Sölvegatan 37, SE-223 62 Lund, Sweden.
Hanna Weiss-Schneeweiss, Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria.
Ovidiu Paun, Department of Botany and Biodiversity Research, University of Vienna, Rennweg 14, A-1030 Vienna, Austria.
Supplementary Material
Supplementary data are available at Molecular Biology and Evolution online.
Author Contributions
Study conceived and designed by O.P. Sampling and laboratory work conducted by M.C.E., T.M., E.T., M.W.C., M.H., O.P. Data analyses conducted by M.C.E., with feedback from J.M., H.W.S. and O.P. Interpretation of the results was undertaken by M.C.E., T.M., J.M., M.W.C., H.W.S., O.P. The manuscript was drafted by M.C.E., with feedback from M.W.C. and O.P., and was revised and approved by all authors.
Data Availability
The raw Illumina sequencing data underlying this article are available in NCBI SRA database at https://www.ncbi.nlm.nih.gov/sra under the BioProject PRJNA845973 (accession numbers SRX15595094-SRX15595128).
References
- International Wheat Genome Sequencing Consortium (IWGSC) . 2018. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science 361:eaar7191. [DOI] [PubMed] [Google Scholar]
- Aagaard SMD, Såstad SM, Greilhuber J, Moen A. 2005. A secondary hybrid zone between diploid Dactylorhiza incarnata ssp. cruenta and allotetraploid D. lapponica (Orchidaceae). Heredity (Edinb) 94:488–496. [DOI] [PubMed] [Google Scholar]
- Ågren JA, Huang H-R, Wright SI. 2016. Transposable element evolution in the allotetraploid Capsella bursa-pastoris. Am J Bot. 103:1197–1202. [DOI] [PubMed] [Google Scholar]
- Ambrožová K, Mandáková T, Bureš P, Neumann P, Leitch IJ, Koblížková A, Macas J, Lysak MA. 2011. Diverse retrotransposon families and an AT-rich satellite DNA revealed in giant genomes of Fritillaria lilies. Ann Bot. 107:255–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balao F, Tannhäuser M, Lorenzo MT, Hedrén M, Paun O. 2016. Genetic differentiation and admixture between sibling allopolyploids in the Dactylorhiza majalis complex. Heredity (Edinb) 116:351–361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balao F, Trucchi E, Wolfe T, Hao B-H, Lorenzo MT, Baar J, Sedman L, Kosiol C, Amman F, Chase MW, et al. 2017. Adaptive sequence evolution is driven by biotic stress in a pair of orchid species (Dactylorhiza) with distinct ecological optima. Mol Ecol. 26:3649–3662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belyayev A, Jandová M, Josefiová J, Kalendar R, Mahelka V, Mandák B, Krak K. 2020. The major satellite DNA families of the diploid Chenopodium album aggregate species: arguments for and against the “library hypothesis”. PLoS One 15:e0241206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benjamini Y, Hochberg Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol. 57:289–300. [Google Scholar]
- Brandrud MK, Baar J, Lorenzo MT, Athanasiadis A, Bateman RM, Chase MW, Hedrén M, Paun O. 2020. Phylogenomic relationships of diploids and the origins of allotetraploids in Dactylorhiza (Orchidaceae). Syst Biol. 69:91–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burns R, Mandáková T, Gunis J, Soto-Jiménez LM, Liu C, Lysak MA, Novikova PY, Nordborg M. 2021. Gradual evolution of allopolyploidy in Arabidopsis suecica. Nat Ecol Evol. 5:1367–1381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carpentier MC, Manfroi E, Wei FJ, Wu HP, Lasserre E, Llauro C, Debladis E, Akakpo R, Hsing YI, Panaud O. 2019. Retrotranspositional landscape of Asian rice revealed by 3000 genomes. Nat Commun. 10:24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chase MW, Samuel RM, Leitch AR, Guignard MS, Conran JG, Nollet F, Fletcher P, Jakob A, Cauz-Santos LA, Vignolle G, et al. 2022. Down, then up: non-parallel genome size changes and a descending chromosome series in a recent radiation of Australian allotetraploid plant species in Nicotiana section Suaveolentes (Solanaceae). Ann Bot. Advance Access published January 13, 2022, doi: 10.1093/aob/mcac006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen ZJ. 2007. Genetic and epigenetic mechanisms for gene expression and phenotypic variation in plant polyploids. Annu Rev Plant Biol. 58:377–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen ZJ, Sreedasyam A, Ando A, Song Q, De Santiago LM, Hulse-Kemp AM, Ding M, Ye W, Kirkbride RC, Jenkins J, et al. 2020. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat Genet. 52:525–533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng F, Sun C, Wu J, Schnable J, Woodhouse MR, Liang J, Cai C, Freeling M, Wang X. 2016. Epigenetic regulation of subgenome dominance following whole genome triplication in Brassica rapa. New Phytol. 211:288–299. [DOI] [PubMed] [Google Scholar]
- Chuong EB, Elde NC, Feschotte C. 2017. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet. 18:71–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Comai L. 2005. The advantages and disadvantages of being polyploid. Nature 6:836–846. [DOI] [PubMed] [Google Scholar]
- Dehal P, Boore JL. 2005. Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol. 3:e314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dellaporta SL, Wood J, Hicks JB. 1983. A plant DNA, minipreparation: version II. Plant Mol Biol Rep. 1:19–21. [Google Scholar]
- Devos KM. 2005. Updating the “crop circle”. Curr Opin Plant Biol. 8:155–162. [DOI] [PubMed] [Google Scholar]
- Devos N, Raspé O, Oh S-H, Tyteca D, Jacquemart A-L. 2006. The evolution of Dactylorhiza (Orchidaceae) allotetraploid complex: insights from nrDNA sequences and cpDNA PCR-RFLP data. Mol Phylogenet Evol. 38:767–778. [DOI] [PubMed] [Google Scholar]
- Dijk E, Grootjans AB. 1998. Performance of four Dactylorhiza species over a complex trophic gradient. Acta Bot Neerl. 47:351–368. [Google Scholar]
- Dodsworth S, Guignard MS, Pérez-Escobar OA, Struebig M, Chase MW, Leitch AR. 2020. Repetitive DNA restructuring across multiple Nicotiana allopolyploidisation events shows a lack of strong cytoplasmic bias in influencing repeat turnover. Genes 11:216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doyle JJ, Doyle JL. 1990. Isolation of plant DNA from fresh tissue. Focus 12:13–15. [Google Scholar]
- Doyle JJ, Flagel LE, Paterson AH, Rapp RA, Soltis DE, Soltis PS, Wendel JF. 2008. Evolutionary genetics of genome merger and doubling in plants. Annu Rev Genet 42:443–461. [DOI] [PubMed] [Google Scholar]
- Dubin MJ, Mittelsten Scheid O, Becker C. 2018. Transposons: a blessing curse. Curr Opin Plant Biol. 42:23–29. [DOI] [PubMed] [Google Scholar]
- Edger PP, Smith R, McKain MR, Cooley AM, Vallejo-Marin M, Yuan Y, Bewick AJ, Ji L, Platts AE, Bowman MJ, et al. 2017. Subgenome dominance in an interspecific hybrid, synthetic allopolyploid, and a 140-year-old naturally established neo-allopolyploid monkeyflower. Plant Cell. 29:2150–2167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eilam T, Anikster YA, Millet EM, Manisterski J, Feldman MF. 2009. Genome size in natural and synthetic autopolyploids and in a natural segmental allopolyploid of several Triticeae species. Genome 52:275–285. [DOI] [PubMed] [Google Scholar]
- Elliott TA, Gregory TR. 2015. What’s in a genome? The C-value enigma and the evolution of eukaryotic genome content. Philos Trans R Soc Lond B Biol Sci. 370:20140331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emadzade K, Jang TS, Macas J, Kovařík A, Novák P, Parker J, Weiss-Schneeweiss H. 2014. Differential amplification of satellite PaB6 in chromosomally hypervariable Prospero autumnale complex (Hyacinthaceae). Ann Bot. 114:1597–1608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eriksson MC, Szukala A, Tian B, Paun O. 2020. Current research frontiers in plant epigenetics: an introduction to a virtual Issue. New Phytol. 226:285–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gaebelein R, Schiessl SV, Samans B, Batley J, Mason AS. 2019. Inherited allelic variants and novel karyotype changes influence fertility and genome stability in Brassica allohexaploids. New Phytol. 223:965–978. [DOI] [PubMed] [Google Scholar]
- Galbraith DW, Harkins KR, Maddox JM, Ayres NM, Sharma DP, Firoozabady E. 1983. Rapid flow cytometric analysis of the cell cycle in intact plant tissues. Science 220:1049–1051. [DOI] [PubMed] [Google Scholar]
- Garrido-Ramos MA. 2017. Satellite DNA: an evolving topic. Genes 8:230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garsmeur O, Schnable JC, Almeida A, Jourda C, D’Hont A, Freeling M. 2014. Two evolutionarily distinct classes of paleopolyploidy. Mol Biol Evol. 31:448–454. [DOI] [PubMed] [Google Scholar]
- Giraud D, Lima O, Huteau V, Coriton O, Boutte J, Kovarik A, Leitch AR, Leitch IJ, Ainouche M, Salmon A. 2020. Evolutionary dynamics of transposable elements and satellite DNAs in polyploid Spartina species. Plant Sci. 302:110671. [DOI] [PubMed] [Google Scholar]
- Greilhuber J, Ebert I. 1994. Genome size variation in Pisum sativum. Genome 37:646–655. [DOI] [PubMed] [Google Scholar]
- Greilhuber J, Temsch EM, Loureiro JCM. 2007. Nuclear DNA content measurement. In: Doležel J, Greilhuber J, Suda J, editors. Flow cytometry with plant cells, analysis of genes, chromosomes and genomes. Weinheim: Wiley. p. 67–101. [Google Scholar]
- Hagerup O. 1938. Studies on the significance of polyploidy. II. Orchis. Hereditas 24:258–264. [Google Scholar]
- Hawranek AS. 2021. The impact of recurrent origins and gene flow on the genetic structure of allopolyploid marsh orchids (Dactylorhiza, Orchidaceae) [MSc thesis]. [Vienna (Austria)]:University of Vienna. [Google Scholar]
- Hedrén M. 1996. Genetic differentiation, polyploidization and hybridization in northern European Dactylorhiza (Orchidaceae): evidence from allozyme markers. Plant Syst Evol. 201:31–55. [Google Scholar]
- Hedrén M, Nordström S, Bateman RM. 2011. Plastid and nuclear DNA marker data support the recognition of four tetraploid marsh orchids (Dactylorhiza majalis s.l. Orchidaceae) in Britain and Ireland, but require their recircumscription. Biol J Linn Soc. 104:107–128. [Google Scholar]
- Heslop-Harrison J. 1953. Microsporogenesis in some triploid Dactylorchid hybrids. Ann Bot. 17:539–549. [Google Scholar]
- Holmen K, Kaad P. 1956. Ober Dactylorchis traunsteineri auf der Insel Läsö. Botanisk tidsskrift. 53:35–48. [Google Scholar]
- Inda LA, Pimentel M, Chase MW. 2012. Phylogenetics of tribe Orchideae (Orchidaceae: Orchidoideae) based on combined DNA matrices: inferences regarding timing of diversification and evolution of pollination syndromes. Ann Bot. 110:71–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson SA, Chen ZJ. 2010. Genomic and expression plasticity of polyploidy. Curr Opin Plant Biol. 13:153–159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiao Y, Wickett NJ, Ayyampalayam S, Chanderbali AS, Landherr L, Ralph PE, Tomsho LP, Hu Y, Liang H, Soltis PS, et al. 2011. Ancestral polyploidy in seed plants and angiosperms. Nature 473:97–100. [DOI] [PubMed] [Google Scholar]
- Jonsell B. 1982. Ängsnycklar och sumpnycklar i nordligaste Uppland. Svensk Botanisk Tidskrift. 76:103–111. [Google Scholar]
- Kashkush K, Feldman M, Levy AA. 2003. Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nat Genet. 33:102–106. [DOI] [PubMed] [Google Scholar]
- Kelly LJ, Renny-Byfield S, Pellicer J, Macas J, Novák P, Neumann P, Lysak MA, Day PD, Berger M, Fay MF, et al. 2015. Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size. New Phytol. 208:596–607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kidwell MG. 2002. Transposable elements and the evolution of genome size in eukaryotes. Genetica 115:49–63. [DOI] [PubMed] [Google Scholar]
- Koukalova B, Moraes AP, Renny-Byfield S, Matyasek R, Leitch AR, Kovarik A. 2010. Fall and rise of satellite repeats in allopolyploids of Nicotiana over c. 5 million years. New Phytol. 186:148–160. [DOI] [PubMed] [Google Scholar]
- Kovarik A, Pires JC, Leitch AR, Lim KY, Sherwood AM, Matyasek R, Rocca J, Soltis DE, Soltis PS. 2005. Rapid concerted evolution of nuclear ribosomal DNA in two Tragopogon allopolyploids of recent and recurrent origin. Genetics 169:931–944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leitch IJ, Bennett MD. 2004. Genome downsizing in polyploid plants. Biol J Linn Soc 82:651–663. [Google Scholar]
- Leitch AR, Lim KY, Skalicka K, Kovarik A. 2006. Nuclear cytoplasmic interaction hypothesis and the role of translocations In Nicotiana allopolyploids. In: Cigna AA, Durante M, editors. Radiation risk estimates in normal and emergency situations. Dordrecht: Springer. p. 319–326. [Google Scholar]
- Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Z, Tiley GP, Galuska SR, Reardon CR, Kidder TI, Rundell RJ, Barker MS. 2018. Multiple large-scale gene and genome duplications during the evolution of hexapods. Proc Natl Acad Sci U S A. 115:4713–4718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lord RM, Richards AJ. 1977. A hybrid swarm between the diploid Dactylorhiza fuchsii (Druce) Soó and the tetraploid D. purpurella (T. & T.A.Steph.) Soó in Durham. Watsonia 11:205–210. [Google Scholar]
- Lövkvist B, Hulrgård UM. 1999. Chromosome numbers in South Swedish vascular plants. Opera Bot. 137:1–42. [Google Scholar]
- Lynch M, Force A. 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics 154:459–473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandáková T, Lysak MA. 2016. Painting of Arabidopsis chromosomes with chromosome-specific BAC clones. Curr Protoc Plant Biol. 1:359–371. [DOI] [PubMed] [Google Scholar]
- Martin M. 2011. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17:11–12. [Google Scholar]
- Matyášek R, Tate JA, Lim YK, Šrubařová H, Koh J, Leitch AR, Soltis DE, Soltis PS, Kovařík A. 2007. Concerted evolution of rDNA in recently formed Tragopogon allotetraploids is typically associated with an inverse correlation between gene copy number and expression. Genetics 176:2509–2519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mayrose I, Zhan SH, Rothfels CJ, Magnuson-Ford K, Barker MS, Rieseberg LH, Otto SP. 2011. Recently formed polyploid plants diversify at lower rates. Science 333:1257. [DOI] [PubMed] [Google Scholar]
- McCann J, Jang T-S, Macas J, Schneeweiss GM, Matzke NJ, Novák P, Stuessy TF, Villaseñor JL, Weiss-Schneeweiss H. 2018. Dating the species network: allopolyploidy and repetitive DNA evolution in American daisies (Melampodium sect. Melampodium, Asteraceae). Syst Biol. 67:1010–1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McClintock B. 1984. The significance of responses of the genome to challenge. Science 226:792–801. [DOI] [PubMed] [Google Scholar]
- Mhiri C, Parisod C, Daniel J, Petit M, Lim KY, Dorlhac de Borne F, Kovařík A, Leitch AR, Grandbastien M-A.. 2019. Parental transposable element loads influence their dynamics in young Nicotiana hybrids and allotetraploids. New Phytol. 221:1619–1633. [DOI] [PubMed] [Google Scholar]
- Nelson E. 1976. Monographie und Ikonographie der Orchidaceen-Gattung Dactylorhiza. Zürich: Speich. [Google Scholar]
- Neumann P, Novák P, Hoštáková N, Macas J. 2019. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob DNA. 10:1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Novák P, Guignard MS, Neumann P, Kelly LJ, Mlinarec J, Koblížková A, Dodsworth S, Kovařík A, Pellicer J, Wang W, et al. 2020a. Repeat-sequence turnover shifts fundamentally in species with large genomes. Nat Plants. 6:1325–1329. [DOI] [PubMed] [Google Scholar]
- Novák P, Neumann P, Macas J. 2020b. Global analysis of repetitive DNA from unassembled sequence reads using RepeatExplorer2. Nat Prot. 15:3745–3776. [DOI] [PubMed] [Google Scholar]
- Novák P, Neumann P, Pech J, Steinhaisl J, Macas J. 2013. RepeatExplorer: a galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29:792–793. [DOI] [PubMed] [Google Scholar]
- Novák P, Robledillo LÁ, Koblížková A, Vrbová I, Neumann P, Macas J. 2017. TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads. Nucleic Acids Res. 45:e111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oliver KR, McComb JA, Greene WK. 2013. Transposable elements: powerful contributors to angiosperm evolution and diversity. Genome Biol Evol. 5:1886–1901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otto FJ, Oldiges H, Göhde W, Jain VK. 1981. Flow cytometric measurement of nuclear DNA content variations as a potential in vivo mutagenicity test. Cytometry 2:189–191. [DOI] [PubMed] [Google Scholar]
- Ozkan H, Tuna M, Arumuganathan K. 2003. Nonadditive changes in genome size during allopolyploidization in the wheat (Aegilops-Triticum) group. J Hered. 94:260–264. [DOI] [PubMed] [Google Scholar]
- Palacios-Gimenez OM, Milani D, Song H, Marti DA, López-León MD, Ruiz-Ruano FJ, Camacho JPM, Cabral-De-Mello DC, O’Neill R. 2020. Eight million years of satellite DNA evolution in grasshoppers of the genus Schistocerca illuminate the ins and outs of the library hypothesis. Genome Biol Evol. 12:88–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parisod C, Alix K, Just J, Petit M, Sarilar V, Mhiri C, Ainouche M, Chalhoub B, Grandbastien M-A. 2010. Impact of transposable elements on the organization and function of allopolyploid genomes. New Phytol 186:37–45. [DOI] [PubMed] [Google Scholar]
- Parisod C, Salmon A, Zerjal T, Tenaillon M, Grandbastien M-A, Ainouche M. 2009. Rapid structural and epigenetic reorganization near transposable elements in hybrid and allopolyploid genomes in Spartina. New Phytol. 184:1003–1015. [DOI] [PubMed] [Google Scholar]
- Paun O, Bateman RM, Fay MF, Hedrén M, Civeyrel L, Chase MW. 2010. Stable epigenetic effects and adaptation in allopolyploid orchids (Dactylorhiza: Orchidaceae). Mol Biol Evol. 27:2465–2473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paun O, Bateman RM, Fay MF, Luna JA, Moat J, Hedrén M, Chase MW. 2011. Altered gene expression and ecological divergence in sibling allopolyploids of Dactylorhiza (Orchidaceae). BMC Evol Biol. 11:113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paun O, Fay MF, Soltis DE, Chase MW. 2007. Genetic and epigenetic alterations after hybridization and genome doubling. Taxon 56:649–656. [PMC free article] [PubMed] [Google Scholar]
- Paun O, Forest F, Fay MF, Chase MW. 2009. Hybrid speciation in angiosperms: parental divergence drives ploidy. New Phytol. 182:507–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pillon Y, Fay MF, Hedrén M, Bateman RM, Devey DS, Shipunov AB, van der Bank M, Chase MW. 2007. Evolution and temporal diversification of western European polyploid species complexes in Dactylorhiza (Orchidaceae). Taxon 56:1185–1208. [Google Scholar]
- Ramsey J, Schemske DW. 2002. Neopolyploidy in flowering plants. Annu Rev Ecol Syst. 33:589–639. [Google Scholar]
- Renny-Byfield S, Chester M, Kovařík A, Le Comber SC, Grandbastien M-A, Deloger M, Nichols RA, Macas J, Novák P, Chase MW, et al. 2011. Next generation sequencing reveals genome downsizing in allotetraploid Nicotiana tabacum, predominantly through the elimination of paternally derived repetitive DNAs. Mol Biol Evol. 28:2843–2854. [DOI] [PubMed] [Google Scholar]
- Ruiz-Ruano FJ, López-León MD, Cabrero J, Camacho JPM. 2016. High-throughput analysis of the satellitome illuminates satellite DNA evolution. Sci Rep. 6:28333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Samoluk SS, Robledo G, Bertioli D, Seijo JG. 2017. Evolutionary dynamics of an AT-rich satellite DNA and its contribution to karyotype differentiation in wild diploid Arachis species. Mol Genet Genomics. 292:283–296. [DOI] [PubMed] [Google Scholar]
- Sankoff D, Zheng C, Wang B. 2012. A model for biased fractionation after whole genome duplication. BMC Genomics. 13:S8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sarilar V, Palacios PM, Rousselet A, Ridel C, Falque M, Eber F, Chèvre A-M, Joets J, Brabant P, Alix K. 2013. Allopolyploidy has a moderate impact on restructuring at three contrasting transposable element insertion sites in resynthesized Brassica napus allotetraploids. New Phytol. 198:593–604. [DOI] [PubMed] [Google Scholar]
- Sigman MJ, Slotkin RK. 2016. The first rule of plant transposable element silencing: location, location, location. Plant Cell. 28:304–313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soltis DE, Buggs RJA, Doyle JJ, Soltis PS. 2010. What we still don’t know about polyploidy. Taxon 59:1387–1403. [Google Scholar]
- Soltis DE, Soltis PS. 1999. Polyploidy: recurrent formation and genome evolution. Trends Ecol Evol. 14:348–352. [DOI] [PubMed] [Google Scholar]
- Soltis PS, Soltis DE. 2016. Ancient WGD events as drivers of key innovations in angiosperms. Curr Opin Plant Biol. 30:159–165. [DOI] [PubMed] [Google Scholar]
- Soltis DE, Soltis PS, Pires JC, Kovařík A, Tate JA, Mavrodiev E. 2004. Recent and recurrent polyploidy in Tragopogon (Asteraceae): cytogenetic, genomic and genetic comparisons. Biol J Linn Soc. 82:485–501. [Google Scholar]
- Song K, Lu P, Tang K, Osborn TC. 1995. Rapid genome change in synthetic polyploids of Brassica and its implications for polyploid evolution. Proc Natl Acad Sci U S A. 92:7719–7723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonnhammer ELL, Durbin R. 1995. A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 167:GC1–GC10. [DOI] [PubMed] [Google Scholar]
- Springer NM, Lisch D, Li Q. 2016. Creating order from chaos: epigenome dynamics in plants with complex genomes. Plant Cell. 28:314–325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Temsch EM, Greilhuber J, Krisai R. 2010. Genome size in liverworts. Preslia 82:63–80. [Google Scholar]
- Touchon M, Rocha EPC. 2007. Causes of insertion sequences abundance in prokaryotic genomes. Mol Biol Evol. 24:969–981. [DOI] [PubMed] [Google Scholar]
- Trucchi E, Mazzarella AB, Gilfillan GD, Lorenzo Romero M, Schönswetter P, Paun O. 2016. BsRADseq: screening DNA methylation in natural populations of non-model species. Mol Ecol. 25:1697–1713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van de Peer Y, Maere S, Meyer A. 2009. The evolutionary significance of ancient genome duplications. Nat Rev Genet. 10:725–732. [DOI] [PubMed] [Google Scholar]
- Van de Peer Y, Mizrachi E, Marchal K. 2017. The evolutionary significance of polyploidy. Nat Rev Genet. 18:411–424. [DOI] [PubMed] [Google Scholar]
- Vermeulen R. 1938. Chromosomes in Orchis. Chron Bot. 4:107–108. [Google Scholar]
- Vicient CM, Casacuberta JM. 2017. Impact of transposable elements on polyploid plant genomes. Ann Bot. 120:195–207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang D, Zheng Z, Li Y, Hu H, Wang Z, Du X, Zhang S, Zhu M, Dong L, Ren G, et al. 2021. Which factors contribute most to genome size variation within angiosperms? Ecol Evol. 11:2660–2668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weissensteiner MH, Bunikis I, Catalan A, Francoijs K-J, Knief U, Heim W, Peona V, Pophaly SD, Sedlazeck FJ, Suh A, et al. 2020. Discovery and population genomics of structural variation in a songbird genus. Nat Commun. 11:3403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wendel JF. 2015. The wondrous cycles of polyploidy in plants. Am J Bot. 102:1753–1756. [DOI] [PubMed] [Google Scholar]
- Wendel JF, Jackson SA, Meyers BC, Wing RA. 2016. Evolution of plant genome architecture. Genome Biol. 17:37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wendel JF, Lisch D, Hu G, Mason AS. 2018. The long and short of doubling down, polyploidy, epigenetics, and the temporal dynamics of genome fractionation. Curr Opin Genet Dev. 49:1–7. [DOI] [PubMed] [Google Scholar]
- Willing EM, Rawat V, Mandáková T, Maumus F, James GV, Nordström KJV, Becker C, Warthmann N, Chica C, Szarzynska B, et al. 2015. Genome expansion of Arabis alpina linked with retrotransposition and reduced symmetric DNA methylation. Nat Plants. 1:14023. [DOI] [PubMed] [Google Scholar]
- Wolfe T, Balao F, Trucchi E, Bachmann G, Gu W, Baar J, Hedrén M, Weckwerth W, Leitch AR, Paun O. 2021. Recurrent allopolyploidization events diversify eco-physiological traits in marsh orchids. bioRxiv, 2021 Aug 28. doi: 10.1101/2021.08.28.458039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Woodhouse MR, Cheng F, Pires JC, Lisch D, Freeling M, Wang X. 2014. Origin, inheritance, and gene regulatory consequences of genome dominance in polyploids. Proc Natl Acad Sci U S A. 111:5283–5288. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw Illumina sequencing data underlying this article are available in NCBI SRA database at https://www.ncbi.nlm.nih.gov/sra under the BioProject PRJNA845973 (accession numbers SRX15595094-SRX15595128).






