Skip to main content
eLife logoLink to eLife
. 2024 Jan 8;12:RP88398. doi: 10.7554/eLife.88398

Separating phases of allopolyploid evolution with resynthesized and natural Capsella bursa-pastoris

Tianlin Duan 1,, Adrien Sicard 2, Sylvain Glémin 1,3, Martin Lascoux 1,
Editors: Regina S Baucom4, Christian R Landry5
PMCID: PMC10945474  PMID: 38189348

Abstract

Allopolyploidization is a frequent evolutionary transition in plants that combines whole-genome duplication (WGD) and interspecific hybridization. The genome of an allopolyploid species results from initial interactions between parental genomes and long-term evolution. Distinguishing the contributions of these two phases is essential to understanding the evolutionary trajectory of allopolyploid species. Here, we compared phenotypic and transcriptomic changes in natural and resynthesized Capsella allotetraploids with their diploid parental species. We focused on phenotypic traits associated with the selfing syndrome and on transcription-level phenomena such as expression-level dominance (ELD), transgressive expression (TRE), and homoeolog expression bias (HEB). We found that selfing syndrome, high pollen, and seed quality in natural allotetraploids likely resulted from long-term evolution. Similarly, TRE and most down-regulated ELD were only found in natural allopolyploids. Natural allotetraploids also had more ELD toward the self-fertilizing parental species than resynthesized allotetraploids, mirroring the establishment of the selfing syndrome. However, short-term changes mattered, and 40% of the cases of ELD in natural allotetraploids were already observed in resynthesized allotetraploids. Resynthesized allotetraploids showed striking variation of HEB among chromosomes and individuals. Homoeologous synapsis was its primary source and may still be a source of genetic variation in natural allotetraploids. In conclusion, both short- and long-term mechanisms contributed to transcriptomic and phenotypic changes in natural allotetraploids. However, the initial gene expression changes were largely reshaped during long-term evolution leading to further morphological changes.

Research organism: Other

Introduction

Allopolyploidization is the coupling of whole genome duplication and interspecific hybridization, resulting in organisms possessing two or more diverged genomes. This intriguing evolutionary transition is widespread in nature (Albertin and Marullo, 2012; Barker et al., 2016) and is of agricultural importance (Behling et al., 2020). Allopolyploidization is expected to have both short-term and long-term consequences: not only can the merging of divergent genomes itself be seen as a macromutation but also it triggers subsequent genomic changes over distinct time scales.

Right after allopolyploidization or within a few generations, various genomic and transcriptomic changes can be caused by a series of mechanisms, including DNA methylation repatterning (Edger et al., 2017; Li et al., 2019), reactivation of transposable elements (TE, reviewed in Vicient and Casacuberta, 2017), chromosome rearrangements, including homoeologous exchanges (Parisod et al., 2009; Szadkowski et al., 2010; Lashermes et al., 2014; Xiong et al., 2021) and intergenomic interactions between regulatory elements (Shi et al., 2012; Hu and Wendel, 2019). These multifaceted effects were initially proposed to be dramatic but are likely smoother and more subtle than initially thought. The fact remains that these short-term mechanisms add further complexity to the genetic variation gathered from parental lineages. Genetic changes can reinforce some initial epigenetic changes, leading to long-term heritable consequences in established allopolyploids. For instance, an epigenetically downregulated/silenced gene copy is more likely to degenerate than the other copy due to weaker purifying selection.

Apart from instant genomic changes, allopolyploidization also alters multiple genetic attributes, impacting the long-term evolution of allopolyploid genomes. First, as a minority cytotype, newly formed allopolyploid populations often experience a bottleneck (Levin, 1975; Novikova et al., 2017; Griffiths et al., 2019). This bottleneck reduces genetic variation within allopolyploid species and favors the fixation of neutral or slightly deleterious mutations (Novikova et al., 2017). Second, with an extra genome, allotetraploid species could undergo a period of relaxed purifying selection (Lynch and Conery, 2000; Douglas et al., 2015; Paape et al., 2018). Relaxed selection also accelerates the accumulation of deleterious mutations on allopolyploid genomes. At the same time, it facilitates neofunctionalization by allowing functional mutations to accumulate in one paralog while maintaining the ancestral function through the second (Ohno, 1970). Third, allopolyploidization immediately distorts both the relative and absolute dosage of gene product, which further alters physiological balance and efficiency (Anneberg and Segraves, 2020; Yu et al., 2021; Domínguez-Delgado et al., 2021). In the long term, both relative and absolute dosages of gene expression of allopolyploid genomes are expected to be under selection (Bekaert et al., 2011), and gradually adapt to a polyploid or hybrid state (Bomblies, 2020). Under the joint action of these forces, allopolyploid subgenomes are further co-adapted and degenerated, and subgenomes are often biasedly retained, termed biased fractionation (Schnable et al., 2011; Tang et al., 2012; Renny-Byfield et al., 2015; Wendel et al., 2018).

Both short-term reactions and long-term evolution can generate novel evolutionary opportunities and potentially allow allopolyploid lineages to have advantages in adaptation to novel environments (Baniaga et al., 2020). In established allopolyploids, phenomena caused by long-term evolutionary forces can be confounded by traces of short-term genomic changes. The relative contributions of short- and long-term mechanisms to genomic changes in allopolyploids can be assessed by comparing established natural allopolyploids with resynthesized allopolyploids (e.g. Wang et al., 2006; Wang et al., 2016; Buggs et al., 2011; Yoo et al., 2013; Zhang et al., 2016b).

Variation and novelties in gene expression caused by allopolyploidization are often assessed by homoeolog expression bias (HEB) and non-additive gene expression (Grover et al., 2012; Yoo et al., 2013; Zhang et al., 2016a; Wu et al., 2018; Shan et al., 2020). HEB measures the separate contributions of gene copies from different parental species (homoeologs) and non-additive gene expression measures the deviation of the total expression of both homoeologs from an intermediate value between parental species. Non-additive patterns of gene expressions are further classified as expression level dominance (ELD) and transgressive expression (TRE). ELD means that the total expression of both homoeologs is similar to the expression level of only one parental species (Grover et al., 2012), but differs from the expression level of the other. TRE means that gene expression in allopolyploids is higher or lower than in both parental species. Variation of HEB and non-additive gene expression in allopolyploids can be triggered by several mechanisms in the early generations of the new allopolyploid; or alternatively, they may arise during long-term evolution due to either neutral or selective processes.

Capsella bursa-pastoris is a natural allotetraploid plant species which originated about 100,000 years ago (Douglas et al., 2015). Two diploid species, C. orientalis and C. grandiflora, are extant relatives of the maternal and paternal progenitors (hereinafter referred to as parental species) of C. bursa-pastoris, respectively (Hurka et al., 2012; Douglas et al., 2015). C. grandiflora is self-incompatible (SI), but C. orientalis was already self-compatible (SC) before the formation of C. bursa-pastoris (Bachmann et al., 2019). C. bursa-pastoris is also a self-compatible species, with typical selfing-syndrome characteristics. In particular, it has smaller petals, fewer pollen grains, and shorter styles than the outcrossing C. grandiflora (Neuffer and Paetsch, 2013). Yet, it remains unclear whether the inconspicuous flower phenotypes of C. bursa-pastoris only reflect the dominance relationship of the parental alleles or if these traits have also evolved post-allopolyploidization.

Natural C. bursa-pastoris exhibits disomic inheritance (Hurka et al., 1989; Roux and Pannell, 2015), with which chromosomes only recombine and segregate with their homologs during meiosis, but not with homoeologs. However, the strictness of disomic inheritance in C. bursa-pastoris has not been tested. In general, the two subgenomes of C. bursa-pastoris are still well-retained and functional. There is no sign of large-scale gene loss or silencing, although purifying selection has been weaker genome-wide (Douglas et al., 2015), and the C. orientalis-derived subgenome (Cbp_co) has accumulated more putatively deleterious mutations than the C. grandiflora-derived subgenome (Cbp_cg), both before and after the formation of C. bursa-pastoris (Douglas et al., 2015; Kryvokhyzha et al., 2019a). The majority of genes are expressed from both homoeologs, and on average there is only a slight HEB toward Cbp_cg homoeologs (Douglas et al., 2015; Kryvokhyzha et al., 2019a). Most genes are additively expressed in natural C. bursa-pastoris, but ELD and transgressive gene expression have also been observed (Kryvokhyzha et al., 2019a). Despite the moderate HEB and non-additive expression, gene expression in C. bursa-pastoris showed some striking tissue-specific features (Kryvokhyzha et al., 2019a). In flowers, gene expression levels in C. bursa-pastoris resembled those in C. orientalis, while in leaves and roots, gene expression levels were more similar to those in C. grandiflora.

In contrast to the drastic genomic or transcriptomic changes observed in allopolyploid wheat (Zhang et al., 2016a), Brassica (Szadkowski et al., 2010; Lloyd et al., 2018), and Tragopogon (Chester et al., 2012), natural C. bursa-pastoris represents another paradigm where established allopolyploid species show only mild genomic changes and expression bias. This contrast raises questions. Was the genome of natural C. bursa-pastoris less affected by putative short-term mechanisms, or was it the result of 100,000 years’ evolution, which filtered out or compensated for the initial drastic changes? What are the relative strengths of short-term mechanisms and long-term evolution in shaping genomic and phenotypic variation in allopolyploids?

Resynthesized allopolyploids are the closest approximation to the early stage of natural allopolyploids. They provide a reference point for separating the short-term effects of allopolyploidization from long-term evolutionary changes. The present study builds upon Duan et al., 2023, which showed that hybridization played a much larger role than whole genome doubling during the creation of resynthesized polyploids in the Capsella genus. Here, we compared transcriptomes and phenotypes of resynthesized C. bursa-pastoris-like allotetraploids with natural C. bursa-pastoris and its two diploid progenitors. We focused on teasing apart the contributions from short- and long-term processes to (1) phenotypes, (2) non-additive gene expression, and (3) HEB in Capsella allotetraploids.

Results

The selfing syndrome was observed in natural C. bursa-pastoris but not in resynthesized allotetraploids

The breakdown of self-incompatibility in allotetraploid Capsella can directly result from hybridizing with the self-fertilizing species (Bachmann et al., 2021; Duan et al., 2023). We explored to what extent the development of a selfing syndrome was instantly achieved after allopolyploidization or, instead, developed later on by comparing phenotypes of resynthesized allotetraploids (groups Sd and Sh), natural C. bursa-pastoris (Cbp) and the diploid parental species, C. grandiflora (Cg2) and C. orientalis (Co2). The resynthesized allotetraploids were generated with individuals from one population of each diploid parental species (Figure 1), and the Sd (‘WGD-first‘) and Sh (‘hybridization-first’) groups only differed in the order of WGD and hybridization (Duan et al., 2023). There were six ‘lines’ in each of the five plant groups. For the Sd and Sh groups, each line represented an independent allopolyploidization event, while the six lines of natural C. bursa-pastoris were from six different populations (Figure 1c and Figure 1—source data 1), representing three major genetic clusters of the wild C. bursa-pastoris (Kryvokhyzha et al., 2016). For diploid parental groups, a line referred to a full-sibling family resulting from self-fertilization (Co2) or one controlled cross (Cg2). Phenotypes of the five groups were measured in a growth chamber on about 36 individuals per plant group (6 individuals × 6 lines).

Figure 1. Plant material used in the present study.

Figure 1.

(a) Five groups of Capsella plants. Diploid species (Co2 and Cg2 groups) and the second generation of resynthesized allotetraploids (Sd and Sh groups) were from Duan et al., 2023. Samples of natural allotetraploids, C. bursa-pastoris, were added to the present study. (b) Phylogenetic relationship of the three natural species used in the present study, modified from Douglas et al., 2015; C. bursa-pastoris originated from the hybridization between the ancestral population of C. orientalis and the (C. grandiflora + C. rubella) lineage, and C. rubella were omitted from the figure; kya: thousand years ago. (c) Geographic origin of the Capsella samples.

Figure 1—source data 1. Capsella plants used in the present study.

Resynthesized and natural allotetraploids had distinct floral morphologies (Figure 2a–g). Indeed, natural allotetraploids had significantly shorter and narrower petals, sepals and pistils and shorter stamina than resynthesized allotetraploids (one-way ANOVA, F4,160 > 78 and p<0.001 in all seven tests; Tukey’s HSD test, α=0.01). Pollen and seed production was also affected. Natural allotetraploids had fewer pollen grains per flower (Figure 2i). While the number of pollen grains in resynthesized allotetraploids was intermediate between the two parental species, the number of pollen grains of the natural allotetraploid group was now similar to that of the Co2 group (one-way ANOVA, F4,137 = 164.6, p<0.001; Tukey’s HSD test, α=0.01). Moreover, the number of seeds per fruit in natural allotetraploids was much larger than in resynthesized allotetraploids. The resynthesized allotetraploid groups had a similar number of seeds in 10 fruits to that of the Cg2 group, whereas the number of seeds in 10 fruits in the natural allotetraploid group was even higher than that of the Co2 group (Figure 2j; one-way ANOVA, F4,146 = 152.5, p<0.001; Tukey’s HSD test, α=0.01).

Figure 2. Phenotypic traits of the five Capsella groups.

Figure 2.

Co2: diploid C. orientalis; Cg2: diploid C. grandiflora; Sd: whole-genome-duplication-first resynthesized allotetraploids; Sh: hybridization-first resynthesized allotetraploids; Cbp: natural allotetraploid C. bursa-pastoris. The measured traits were (a) petal length, (b) sepal length, (c) pistil length, (d) stamen length, (e) petal width, (f) sepal length, (g) pistil width, (h) length of the longest stem, (i) number of pollen grains per flower, (j) number of seeds in ten fruits, (k) number of days from germination to the opening of the first flower, (l) proportion of viable pollen grains, and (m) proportion of normal seeds in ten fruits. Sample sizes are shown above the groups.

Figure 2—source data 1. Effects of plant group and positions (tray ID) on phenotypes.

The architecture and phenology of the whole plant were affected too. The stem length of natural Cbp was shorter than in resynthesized allotetraploids but was similar to the stem length of the Co2 group (Figure 2h; one-way ANOVA, F4,166 = 84.5, p<0.001; Tukey’s HSD test, α=0.01). Finally, plants of the Cbp group flowered earlier than those of the Sd group, but at a similar time as those of the Sh group (Figure 2k; one-way ANOVA with hc3 White’s correction, F4,165 = 49.2, p<0.001; Tukey’s HSD test, α=0.01).

Pollen viability and seed quality improved in natural Capsella allotetraploids

Pollen viability and the proportion of normal seeds were compared between resynthesized and natural allotetraploids. For both traits, we observed a decrease in pollen viability in resynthesized allotetraploids followed by recovery in natural Cbp. Both Sd and Sh groups had lower proportions of viable pollen than the diploid parental species, but the proportion of viable pollen in natural Cbp was similar to that of the diploid parental species (Figure 2l; GLM, quasi binomial, F4,137 = 24.4, p<0.001; Tukey’s HSD test, α=0.01). The resynthesized allotetraploids generated a higher proportion of abnormal seeds than the three natural species (Figure 2m; GLM, quasi binomial, F4,146 = 59.2, p<0.001; Tukey’s HSD test, α=0.01). The average percentage of normal seeds was 69.6±4.3% in the Sd group and 77.5±2.2% in the Sh group. In contrast, the natural Cbp had almost no abnormal seeds, with a percentage of normal seeds of 99.6±0.8%.

A two-step evolution of the global expression pattern of natural allopolyploid Cbp

To compare the gene expression pattern of the five plant groups, RNA-sequencing was conducted for one individual per line and six lines per group, using young inflorescences (flowers) and leaves. Expression levels were determined for 21,937 genes in flower samples and 18,999 genes in leaf samples after excluding genes with CPM >1 in less than two samples. The overall gene expression pattern was visualized with multi-dimensional scaling (MDS) analysis (Figure 3a and b). For unphased gene expression, the resynthesized allotetraploids lay between the diploid parental species in both flowers and leaves. Natural Cbp samples were also intermediate between parental species in the first dimension but were far from the resynthesized allotetraploids in the other dimension, showing the effect of long-term evolution in Cbp and possibly also the divergence between extant diploid species and the real progenitors.

Figure 3. Multi-dimensional scaling (MDS) analyses of gene or homoeolog expression in two tissues.

Down-sampled gene expressions were used to compare gene expression patterns of the five plant groups in either flowers (a) or leaves (b). The five groups of Capsella plants are: diploid parental species C. grandiflora (Cg2) and C. orientalis (Co2), hybridization-first (Sh) and whole-genome-duplication-first (Sd) resynthesized allotetraploids, and natural allotetraploid C. bursa-pastoris (Cbp). Separated homoeolog expressions in allotetraploids were then compared with rescaled gene expression of diploid groups in both flowers (c) and leaves (d). All the MDS analyses used genes with count-per-million (CPM) >1 in at least two samples, and expression levels were normalized with the trimmed mean of M-values (TMM) method.

Figure 3.

Figure 3—figure supplement 1. Differentially expressed genes (DEGs) in pair-wise contrasts among the five Capsella plant groups in flowers and leaves.

Figure 3—figure supplement 1.

Genes with expression (CPM >1) in at least two samples were used for the differential expression (DE) analysis. DE analysis was conducted with the R package edgeR (version 3.28.1; Robinson et al., 2010), using TMM normalized unphased gene expression levels. Genes with fold-change >2 and false discovery rate <0.05 were considered significant DEGs.
Figure 3—figure supplement 2. Summary of differential expression analyses of allotetraploid groups.

Figure 3—figure supplement 2.

The number of up-regulated and down-regulated differentially expressed genes (DEGs) in each group pair were annotated on plots. DE analysis was performed on 21,937 genes in flowers and 18,999 genes in leaves, with a threshold of FC >2 and FDR <0.05. Gene expression levels were normalized with the trimmed mean of M-values (TMM) method.

The expression levels of separate homoeologs in allotetraploids were determined with the diagnostic SNPs between the two diploid species. For the Sd, Sh, and Cbp groups, 52.8%, 53.7%, and 44.7% of the mapped reads could be assigned to one of the homoeologs, respectively. The expression pattern of each allotetraploid subgenome was more similar to the corresponding diploid progenitor (Figure 3c and d). The pattern of expression of resynthesized allotetraploids was intermediate between those of diploid progenitors and natural Cbp.

Differential expression analysis was performed among the five plant groups, using the down-sampled unphased gene expression data. With a threshold of FC >2 and FDR <0.05, no significant differentially expressed genes (DEGs) were found between the two resynthesized allotetraploid groups, while 311–2888 DEGs were revealed in other group contrasts (Figure 3—figure supplements 1 and 2). There are two salient features. First, compared to either diploid progenitor, most DEGs in both resynthesized and natural allotetraploid groups were up-regulated (Figure 3—figure supplement 2). The proportion of down-regulated DEGs increased, nevertheless, in natural allotetraploids. Second, both resynthesized allotetraploids, Sd and Sh, have much more DEG with Co2 than with Cg2. However, this is no longer the case in Cbp where the two comparisons yielded similar results.

Although we could not make a clear expectation for gene ontology (GO) terms that would be overrepresented in DEGs between resynthesized and natural allopolyploids, we are not the only study that compared newly resynthesized and established allopolyploids, and GO terms that were repeatedly revealed by exploratory analysis may give a hint for future studies. For this reason, we further performed a GO enrichment analysis, using genes that were differentially expressed in both Cbp-Sd and Cbp-Sh contrasts as the test set, and all expressed genes with GO annotations as the background set. In flowers or leaves, the top 10 most-enriched GO terms for biological processes were related to proteolysis, xenobiotic transport, regulation of translational fidelity, telomere maintenance and organization, DNA geometric change and duplex unwinding, cellular response to toxic substance, aminoacylation, peptidyl-lysine methylation, and heterochromatin formation and organization (Supplementary file 3). But after adjusting p-values with the Benjamini-Hochberg procedure, only GO term proteolysis (GO:0006508) was significantly overrepresented in DEGs between resynthesized and natural allotetraploids in leaves (Fisher’s exact test, adjusted p-value = 0.0175).

Both short- and long-term mechanisms contributed to expression level dominance in natural Cbp, but transgressive expressions were mainly from long-term evolution

Non-additive gene expression shared by natural allotetraploids may be triggered right after allopolyploidization by short-term deterministic mechanisms, such as intergenomic interactions of regulatory elements. Alternatively, non-additive expression may have been caused by mechanisms with stochastic effects or arose later during long-term evolution. We explored to what extent the non-additive expression shared by natural allotetraploids could reflect short-term deterministic mechanisms. By comparing gene expression levels in allotetraploids and diploid species, 21,647 genes in flowers and 18,758 genes in leaves were classified into one of the 10 expression categories, using the results of DE analysis on unphased gene expression (FC >2 and FDR <0.05). We focused on complete ELD and TREs: complete ELD is obtained when the gene expression level in an allopolyploid group is similar to that in one diploid group but not to the expression in the other diploid group, and TRE is detected when the gene expression level in an allopolyploid group is either higher or lower than in both diploid groups.

The percentage of genes showing complete ELD was altogether limited but doubled in natural allotetraploids relative to resynthesized allotetraploids (5.5% of genes in resynthesized allotetraploids and 10.2% in natural allotetraploids. Figure 4a and b and Figure 4—source data 1). Genes with ELD and the directions of ELD were highly shared between the two resynthesized allotetraploid groups (Figure 4c, Figure 4—figure supplement 1). The majority of these shared ELD were retained in natural allotetraploids (63.3% in flowers and 72.2% in leaves), suggesting that short-term deterministic mechanism contributed to ELD in natural allotetraploids. However, Cbp-specific ELD was also abundant, comprising more than half of the cases found in natural allotetraploids (56.6% in flowers and 60.8% in leaves), thereby showing the effects of long-term evolution.

Figure 4. Additive and non-additive expression in allotetraploid groups.

Sd: whole-genome-duplication-first resynthesized allotetraploids; Sh: hybridization-first resynthesized allotetraploids; Cbp: natural allotetraploid C. bursa-pastoris. (a) Number of genes that showed additive expression (ADD, including partial expression level dominance [ELD]), complete ELD, and transgressive expression (TRE) in each allotetraploid group. (b) Genes with complete ELD or TRE were further classified by whether they were up- or down-regulated in allotetraploids, and whether the expression level in allotetraploids was similar to C. grandiflora (Cg-ELD) or C. orientalis (Co-ELD). (c) Venn diagram of genes with complete ELD of the three allotetraploid groups in flowers, separated by directions of ELD.

Figure 4—source data 1. Additive and non-additive gene expression in allotetraploid groups.

Figure 4.

Figure 4—figure supplement 1. Genes showed expression level dominance (ELD) of the three allotetraploid groups in leaves.

Figure 4—figure supplement 1.

Venn diagram of genes showed ELD of the three allotetraploid groups in leaves, separated by directions of ELD.

The direction of ELD shifted between the resynthesized and natural allotetraploids (Figure 4a and b and Figure 4—source data 1). In resynthesized allotetraploids, most cases of ELD were up-regulated, and the number of genes with ELD toward C. grandiflora (Cg-ELD) was about twice of that toward C. orientalis (Co-ELD). Natural allotetraploids still had more up-regulated ELD than down-regulated ones, but the proportion of down-regulated cases increased. The proportion of Co-ELD had also increased in natural allotetraploids. In flowers, natural allotetraploids had more Co-ELD (1101) than Cg-ELD (938), and the number of Cg- and Co-ELDs were similar in leaves (Cg-ELD: 854, Co-ELD: 839).

Almost no TRE was found in resynthesized allotetraploids (less than five genes in either Sd or Sh group and in either tissue, Figure 4a and b, and Figure 4—source data 1). In contrast, about 1.3% of genes in Cbp showed TRE in both flowers and leaves.

Segregation and recombination of homoeologous chromosomes were a major source of homoeolog expression bias variation in resynthesized Capsella allotetraploids

HEB of genes in an allotetraploid individual was measured by the ratio of expression of the C. grandiflora-origin homoeolog (cg) to the total expression of both homoeologs (cg/(cg +co)). To obtain a reliable gene expression ratio, lowly expressed genes (CPM(cg +co)<1 in any allotetraploid individual) were excluded from this analysis. Eventually, the homoeolog expression ratio was calculated for 18,255 genes in flowers, and 15,581 genes in leaves.

Overall, none of the three allotetraploid groups showed a strong average HEB. In flowers, the average HEB was 0.499±0.001, 0.531±0.001, and 0.475±0.001 for the Sd, Sh, and Cbp groups, respectively. In leaves, the average HEB was 0.504±0.001, 0.532±0.001, and 0.477 for the Sd, Sh, and Cbp groups. When averaged among individuals, the HEB of resynthesized allotetraploids had smaller gene-wise variation than that of the Cbp group (Levene’s test, p-value <0.001).

Among resynthesized allotetraploids, although the average homoeolog expression ratio was not systematically biased toward Cg or Co, HEB showed great variation among chromosomes and individuals (Figure 5a and b, Figure 5—figure supplements 1 and 2). The distribution of HEB in some chromosomes had peaks around 0, 0.25, 0.75, or 1, but the shape of the distribution was almost identical between flower and leaf samples. When HEB of genes was plotted along chromosomal positions, we found that the extra peaks in the distribution of HEB can be further explained by large genomic segments separated by a sudden change of average HEB (Figure 5c, Figure 5—figure supplements 36). Altogether, the pattern suggested that some chromosomes or chromosomal regions in resynthesized allotetraploids had an unbalanced number of cg- and co-homoeologs (not 2:2), which were likely caused by the segregation and recombination of homoeologous chromosomes. Both the segregation and recombination of homoeologous chromosomes are outcomes of homoeologous synapsis (synapsis between homoeologous chromosomes during meiosis), which reflects polysomic or mixed inheritance in resynthesized allopolyploids. For short, we refer to both segregation and recombination of homoeologous chromosomes as homoeologous synapsis.

Figure 5. Variation of homoeolog expression bias (HEB) of the three allotetraploid groups in flowers.

Sd: whole-genome-duplication-first resynthesized allotetraploids; Sh: hybridization-first resynthesized allotetraploids; Cbp: natural allotetraploid C. bursa-pastoris. Gene-wise HEB was calculated as the expression level of cg-homoeolog divided by the total expression level of both cg- and co- homoeologs (cg/(cg +co)). For each individual, HEB was calculated for 18,255 genes, which had count-per-million>1 in all flower samples. The distribution of HEB was shown by (a) individuals and (b) chromosomes. (c) HEB was also plotted along chromosome positions to show the sudden change of average HEB between genomic blocks, taking individual Sd-6–4 as an example. The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.

Figure 5.

Figure 5—figure supplement 1. Distribution of gene homoeolog expression bias (HEB) by individuals.

Figure 5—figure supplement 1.

Distribution of gene HEB of each allotetraploid individual in flowers (upper panel) and leaves (lower panel). For each gene, HEB was estimated as the expression of C. grandiflora-derived homoeolog divided by the total expression of both C. grandiflora- and C. orientalis-derived homoeolog (cg/(cg +co)). HEBs were calculated for 18,255 genes in flowers and 15,581 genes in leaves.
Figure 5—figure supplement 2. Distribution of gene homoeolog expression bias (HEB) by chromosomes in leaves.

Figure 5—figure supplement 2.

The distribution of gene HEB of resynthesized and natural Capsella allotetraploids was showed by eight major chromosomes in leaves.
Figure 5—figure supplement 3. Homoeolog expression bias along chromosome positions in the inflorescence sample of the Sd group (‘Whole-genome-duplication-first’” resynthesized Capsella allotetraploids).

Figure 5—figure supplement 3.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.
Figure 5—figure supplement 4. Homoeolog expression bias along chromosome positions in the leaf sample of the Sd group (‘Whole-genome-duplication-first’ resynthesized Capsella allotetraploids).

Figure 5—figure supplement 4.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.
Figure 5—figure supplement 5. Homoeolog expression bias along chromosome positions in the inflorescence sample of the Sh group (‘hybridization-first’ resynthesized Capsella allotetraploids).

Figure 5—figure supplement 5.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.
Figure 5—figure supplement 6. Homoeolog expression bias along chromosome positions in the leaf sample of the Sh group (‘hybridization-first’ resynthesized Capsella allotetraploids).

Figure 5—figure supplement 6.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.
Figure 5—figure supplement 7. Estimated number of breakpoints per chromosome quartet in resynthesized allotetraploids.

Figure 5—figure supplement 7.

The number of breakpoints between chromosome segments with distinct average homoeologous expression bias (HEB) per chromosome was estimated with a five-state Hidden Markov Model for inflorescence and leaf sample of the 12 resynthesized allotetraploid individuals (groups Sd and Sh). Each dot represents the estimation of one chromosome in one individual. Quartiles and median were shown by boxes and thicker bars.

The effect of possessing an unbalanced number of homoeologs largely increased the variation of HEB in resynthesized allotetraploids. The breakpoint between segments with distinct average HEB and the copy number of cg-homoeolog on each segment were estimated with a five-state Hidden Markov Model (HMM), using HEB along chromosomes (Figure 5c, Figure 5—figure supplement 7). Among the 96 chromosome quartets (two pairs of homologous chromosomes) from the 12 resynthesized allotetraploid individuals, only 39 chromosome quartets showed no sign of homoeologous synapsis, that is no breakpoint was identified and the estimated number of cg-homoeolog across the chromosome was two. On average 0.833±0.097 (mean ± se) breakpoint was identified for each chromosome quartet, and 31.0% of genes were estimated to have different numbers of cg- and co-homoeologs. Finally, for resynthesized allotetraploids, the estimated copy number of homoeologs was able to explain 48.4% and 46.8% of the variance of HEB in flowers and leaves, respectively (GLM with quasi-binomial error distribution, p<0.001 in both tissues).

In contrast to resynthesized allotetraploids, the distribution of HEB of natural Cbp was similar among individuals and chromosomes (Figure 5a and b, Figure 5—figure supplement 2), although the distribution of HEB of some chromosomes of Cbp also showed weak bumps around 0, 0.25, 0.75, or 1 (Figure 5b, Figure 5—figure supplement 2). We could not confidently estimate the number of homoeologs or the breakpoint of segments for Cbp with only RNA-sequencing data, as segments resulting from homoeologous exchanges could be shorter in natural Cbp, and the signals of copy number could be blurred by the variance of regulatory divergence. Nevertheless, the HMM segmentation algorithm also identified some candidate segments of which the average HEB strongly deviated from 0.5. Some candidate segments were only shared by individuals from the same population (Figure 6, Figure 6—figure supplements 1 and 2).

Figure 6. Homoeolog expression bias (cg/(cg +co)) along chromosomes of natural allotetraploid C. bursa-pastoris in flowers, taking four pairs of chromosome quartets with typical patterns as an example.

The number of cg-homoeologs estimated by the five-state Hidden Markov Model was indicated by five colors. The two chromosome quartets in the same row are from the two individuals of the same major genetic cluster of natural C. bursa-pastoris (Kryvokhyzha et al., 2016), showing that some estimated segments with an unbalanced number of cg- and co-homoeologs were shared between the individuals from the same genetic cluster.

Figure 6.

Figure 6—figure supplement 1. Homoeolog expression bias along chromosome positions in the inflorescence sample of the Cbp group (natural Capsella bursa-pastoris).

Figure 6—figure supplement 1.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.
Figure 6—figure supplement 2. Homoeolog expression bias along chromosome positions in the leaf samples of the Cbp group (natural Capsella bursa-pastoris).

Figure 6—figure supplement 2.

The number of cg-homoeologs at each gene estimated by the five-state Hidden Markov Model (HMM) was indicated by five colors. Dark green, light green, grey, light purple, and dark purple represent (0, 1, 2, 3, 4) cg-homoeologs and (4, 3, 2, 1, 0) co-homoeologs, respectively.

Most resynthesized allotetraploids had less homoeolog expression loss than natural Cbp, but with extreme outliers

Loss of homoeolog expression is a common phenomenon in allopolyploid species, which can be caused by homoeolog loss or silencing (Buggs et al., 2009; Cox et al., 2014; Lashermes et al., 2016). Loss of homoeolog expression may quickly arise after allopolyploidy, or alternatively, reflect a gradual biased gene fractionation during diploidization. We compared the extent of loss of homoeolog expression between resynthesized and natural Capsella allotetraploids, and among allotetraploid individuals.

The loss of homoeolog expression was identified from genes with medium to high expression levels in all individuals of the corresponding diploid species (Figure 7) to reduce noise from RNA-sequencing and phasing. On average, only 1.0% of these genes showed homoeolog-specific expression loss in natural Cbp. Most resynthesized allotetraploids have a lower level of homoeolog-specific expression loss than natural Cbp, but three individuals (Sd-6–4, Sd-8–5, and Sh-5–5) showed an extremely high level of homoeolog expression loss. The striking homoeolog expression loss in these three resynthesized allotetraploids was most likely caused by the segregation and recombination of homoeologous chromosomes, as the extreme HEB in the three outliers was restricted to chromosome 3, where the entire chromosome or a large chunk of the chromosome has only expression from one homoeolog (Figure 5b, Figure 5—figure supplements 36).

Figure 7. Loss of homoeolog expression in resynthesized (Sd and Sh) and natural allotetraploids (Cbp).

Figure 7.

The number of genes with the expression loss of C. orientalis-homoeolog (a,c) or C. grandiflora-homoeolog (b,d) per individual was compared among the three groups of allotetraploids in flowers (a,b) or leaves (c,d). Homoeologous genes that had obvious expression (count per million >5) in all individuals of the corresponding diploid species but almost no expression (count per million <0.5) in one allotetraploid individual were considered cases of homoeolog expression loss.

Expression level dominance is caused by different mechanisms in resynthesized and natural allotetraploids

As homoeologous synapsis seemed to be a major cause of HEB and homoeolog-specific expression loss in resynthesized allotetraploids, we assessed whether it could have also played a role in the evolution of ELD. To do so, we explored the mechanism of ELD in resynthesized allotetraploids by comparing the gene expression change of separate homoeologs relative to the corresponding gene in diploid groups (log2FC(cg/Cg2) and log2FC(co/Co2)) among non-additive gene expression categories (Figure 8).

Figure 8. Relationships between homoeolog expression change and non-additive gene expression in flowers.

(a) Expected patterns of homoeolog expression change in each scenario that may explain the cause of expression level dominance (ELD) in resynthesized allotetraploids. (b) Observed homoeolog expression change among genes with ELD in resynthesized allotetraploids (Sd and Sh). (c) Observed homoeolog expression change among genes with Cbp-specific non-additive expression.

Figure 8—source data 1. Expression level fold-change (log2FC) of homoeologs relative to the corresponding gene in diploid groups among genes with expression level dominance (ELD) in flowers or leaves.

Figure 8.

Figure 8—figure supplement 1. Relationships between homoeolog expression change and non-additive gene expression in leaves.

Figure 8—figure supplement 1.

Gene (homoeolog) expression levels were normalized with the trimmed mean of M-values (TMM) method. (a) Homoeolog expression change among genes with ELD in resynthesized allotetraploids. (b) Homoeolog expression change among genes with Cbp-specific non-additive expression.

For ELD in resynthesized allotetraploids, different non-exclusive short-term mechanisms would produce different patterns of average expression change of EL-dominant (homoeolog derived from the diploid progenitor to which the total expression of both homoeologs was similar) and EL-recessive homoeologs (the opposite homoeolog), among genes with significant ELD: (i) If ELD was mainly caused by possessing more than two copies of the EL-dominant homoeolog (due to the segregation or recombination of homoeologous chromosomes), we would expect on average the expression of EL-dominant homoeolog to increase, and EL-recessive homoeolog to decrease, in both up- and down-regulated cases of ELD. (ii) If ELD was mainly caused by mechanisms with random effects, such as TE transposition, on average, there should be no large difference between the expression changes of EL-dominant and EL-recessive homoeologs. Because the occurrence and regulatory effects of new TE transpositions do not depend on the original relative expression level of the two homoeologs, that is the highly and lowly expressed homoeologs are equally likely to be up- or down-regulated by new TE transpositions. (iii) Predictions for new intergenomic interactions of regulatory elements can be complex, but under a simple scenario (Hu and Wendel, 2019), ELD may be caused by divergent trans-acting factors. In allopolyploids, stronger trans-acting factors act on the cis-regulatory elements of the opposite homoeolog, causing a similar regulatory effect if transcription rate were not limited by the concentration of these trans-regulators. If this mechanism were the main cause of ELD, on average the EL-recessive homoeolog should have the larger expression change in all categories of ELD, while the expression of EL-dominant homoeolog is not expected to change. In all other cases, we would not have direct inference on the exact mechanism of ELD, but at least the three mechanisms listed above could not be the predominant cause of ELD.

Concerning the ELD found in resynthesized allotetraploids, the change of homoeolog expression fits the third scenario. In all four categories of ELD, the EL-recessive homoeolog had a larger average expression change in the same direction as ELD, while the average expression change of EL-dominant homoeolog was closer to zero (Figure 8, Figure 8—figure supplement 1, and Figure 8—source data 1).

The Cbp-specific ELD showed a different pattern. Although the EL-recessive homoeolog still had a larger expression change, the EL-dominant also showed non-zero average expression change toward the direction of ELD, especially in Cg-ELD genes. For genes with Cbp-specific TRE, both homoeologs had expression change in the same direction of TRE (Figure 8, Figure 8—figure supplement 1, and Figure 8—source data 1).

Discussion

Distinguishing parental legacy from the effects of evolutionary forces is essential for interpreting the outcome of allopolyploidization. The short-term genomic interactions in allopolyploids reflect the divergence of parental genomes (Johnson, 2010). In this sense, short-term transcriptomic changes in new allopolyploids are also part of parental legacy but are not observable with only information from diploid parental species. In this study, we used resynthesized Capsella allotetraploids as an approximation of the early stage of the natural allotetraploid species to separate and compare the short- and long-term transcriptomic and phenotypic changes. The timing and pattern of the variation also provided hints for locating the exact mechanism.

The extant diploid species were not the exact parental populations of natural C. bursa-pastoris, and the sampled diploid individuals were genetically closer to the resynthesized allotetraploids than to natural C. bursa-pastoris, potentially leading to an overestimation of the contribution of long-term mechanisms. However, the divergence between C. grandiflora and C. orientalis was much more ancient than the formation of C. bursa-pastoris (Douglas et al., 2015). Therefore, the molecular divergence between C. grandiflora and C. orientalis after the formation of C. bursa-pastoris is only a small fraction of the total divergence.

Besides, the mating system of the real parental populations of C. bursa-pastoris was likely the same as today: A nonfunctional self-incompatibility haplotype that was fixed in C. orientalis was also found in C. bursa-pastoris, suggesting that C.orientalis became self-compatible before the formation of C. bursa-pastoris (Bachmann et al., 2019); On the other hand, restoring the great polymorphism of functional self-incompatibility haplotypes from a self-compatible ancestral population is very unlikely, therefore self-incompatibility should be the ancestral state of the (C. grandiflora +C. rubella) lineage. Although we cannot exclude the alternative hypothesis that the progenitor of Cbp_cg is a self-fertilizing ghost population, the most parsimonious hypothesis is that Cbp_cg subgenome originated from outcrossing individuals. For all these reasons, the resynthesized Capsella allotetraploids may still provide a realistic approximation to the early stages of natural C. bursa-pastoris.

Another limitation of using resynthesized allotetraploids is that we could not completely exclude the effect of colchicine treatment, even though we used second-generation allotetraploids (Münzbergová, 2017). Colchicine treatment could affect pollen and seed quality and the rate of homoeologous synapsis in resynthesized allotetraploids. Spontaneous Capsella allotetraploids have been repeatedly found among diploid hybrids that were not treated with colchicine solution (Bachmann et al., 2021 own unpublished results). For future studies, these spontaneous allotetraploids would be excellent materials for accurately estimating the rate of homoeologous synapsis in newly formed Capsella allotetraploids. Nevertheless, the reported influence of colchicine treatments on the second generation of synthetic polyploids was either trivial (Husband et al., 2016) or not in the same direction as our results (Münzbergová, 2017). Hence, the observed pollen and seed quality reduction and rampant homoeologous synapsis were unlikely pure artifacts from colchicine treatment.

Resynthesized and natural Capsella allotetraploids had distinct phenotypes

The most noticeable morphological difference between resynthesized and natural Capsella allotetraploids was related to the selfing syndrome. Compared to second-generation resynthesized allotetraploids, natural C. bursa-pastoris had smaller floral organs, more pollen per flower, and a shorter stem length (Figure 1). In particular, trait values of petal size and stamen length of the resynthesized allotetraploids had almost no overlap with natural C. bursa-pastoris but were similar to the outcrossing progenitor C. grandiflora. The shorter stem length in natural C. bursa-pastoris may reflect a shorter lifespan, which is also a feature of self-fertilizing species (Duminil et al., 2009; Lesaffre and Billiard, 2020). If natural C. bursa-pastoris indeed originated from the hybridization between C. grandiflora-like outcrossing plants and C. orientalis-like self-fertilizing plants, the selfing syndrome in C. bursa-pastoris does not reflect the instant dominance effect of the C. orientalis alleles, but evolved afterward. A remaining question is whether the genetic basis of the selfing syndrome in C. bursa-pastoris is the same as in C. orientalis. Did the pre-existing selfing syndrome-related alleles from C. orientalis facilitate the evolution of selfing syndrome in C. bursa-pastoris? Was the selfing syndrome of C. bursa-pastoris established by silencing/replacing the C. grandiflora alleles or new regulations on both C. orientalis and C. grandiflora homoeologs?

Although the selfing syndrome in natural C. bursa-pastoris was most likely an adaptation to the change in mating system, these morphological changes may be accelerated by the compensation or adaptation to a polyploid state (Hollister, 2015). WGD directly increases the size of various types of cells (Beaulieu et al., 2008; Katagiri et al., 2016; Snodgrass et al., 2017; Wilson et al., 2021) and disturbs the efficiency of multiple physiological processes (Drake et al., 2013; Bomblies, 2020). Compared to newly resynthesized autopolyploids, natural autopolyploid plants often have smaller cell or organ sizes (Maherali et al., 2009; Münzbergová, 2017; Landis et al., 2020), possibly driven by the demand for optimizing physiological processes or resource allocation (Roddy et al., 2020; Domínguez-Delgado et al., 2021). In the case of allotetraploid Capsella, the selection of selfing-syndrome-related traits and the adaptation to a polyploid state (e.g. decreasing the size of cell or organ for physiological efficiency or better energy allocation) may work synergistically and can be difficult to separate.

Apart from the selfing-syndrome-related traits, newly resynthesized Capsella allotetraploids had lower proportions of viable pollen (Figure 1l) and normal seeds (Figure 1m). In contrast, pollen and seed quality in natural C. bursa-pastoris were much higher and as good as in diploid species. The higher pollen and seed quality in natural C. bursa-pastoris was possibly achieved by improving meiotic behaviors. Meiotic stabilization is another important aspect of adaptation to an allopolyploid state (Blasio et al., 2022). Newly resynthesized allopolyploids suffer more often than natural allopolyploids, from frequent and severe meiotic abnormalities, which are associated with lower pollen viability and fertility in resynthesized allopolyploids (Szadkowski et al., 2010; Zhang et al., 2013; Henry et al., 2014). The molecular basis of improved meiotic synapsis in natural allopolyploids is not completely clear, but several loci that suppress homoeologous synapsis or recombination are essential for the process (Jenczewski et al., 2003; Nicolas et al., 2009; Greer et al., 2012; Martín et al., 2017). The exact mechanism of meiotic stabilization in natural C. bursa-pastoris needs further investigation.

The emergence of non-additive gene expression in allotetraploids was a two-stage process

Non-additive gene expression in allotetraploid Capsella was altogether limited and neither a complete relict of short-term genomic interactions nor entirely due to gradual divergence. We found that about 40% of the cases of ELD in natural C. bursa-pastoris could already be found in the second generation of resynthesized allotetraploids (relict ELDs). Most of these relict ELDs and their directions were shared by the two resynthesized allotetraploid groups (Figure 4c, Figure 4—figure supplement 1), suggesting that most relict ELDs were repeatable alterations. On the other hand, about 60% of the cases of ELD were specific to natural C. bursa-pastoris (Cbp-specific ELDs, Figure 4c, and Figure 4—figure supplement 1), revealing the contributions from long-term evolution.

The relict ELDs and Cbp-specific ELDs differed in several features. While the vast majority of relict ELDs were up-regulated (97% in flowers and 88% in leaves), Cbp-specific ELDs had a more balanced number of up- and down-regulated ELDs (61% and 54% were up-regulated in flowers and leaves, respectively), suggesting the short and long-term ELD had different molecular basis. In diploid or polyploid interspecific hybrids, overexpression is often more common than underexpression. The trend has been observed in a wide range of organisms, including Brassica (Wu et al., 2018; Li et al., 2020; Wei et al., 2021) and Raphanobrassica (Ye et al., 2016), cotton (Yoo et al., 2013), brown algae (Sousa et al., 2019), and copepod (Barreto et al., 2015). Results in Capsella further showed that short-term mechanisms mainly caused the excess of up-regulated ELDs. Among the short-term mechanisms, intergenomic interaction of regulatory elements is the most likely candidate for generating the excess of up-regulated ELDs, considering that these up-regulated ELDs were highly shared between the two resynthesized allotetraploid groups, and between resynthesized and natural allotetraploids. A global DNA methylation change may also contribute to the excess of up-regulation in resynthesized allotetraploids, if methylation levels were systematically lower in Capsella allotetraploids, as observed in Mimulus (Edger et al., 2017). However, methylation change alone fails to explain why the majority of these up-regulated ELDs in resynthesized allotetraploids were retained in natural allotetraploids, especially in leaves (Figure 4c, Figure 4—figure supplement 1).

Besides, the relict ELDs contained more Cg-ELDs than Co-ELD, but Cbp-specific ELDs had more Co-ELDs, especially in flowers (Figure 4c, Figure 4—figure supplement 1). The increase of Co-ELDs in natural C. bursa-pastoris mirrored the morphological difference: The floral organ size of resynthesized allotetraploids was similar to that of C. grandiflora, whereas natural C. bursa-pastoris was more similar to C. orientalis (Figure 1a–g). However, it is worth noting that the reversed trend of ELD may not be the fundamental genetic basis of selfing syndrome, but reflect the different composition of tissue/cells in RNA samples. Both morphological changes and the direction of ELD could result from upstream regulatory changes.

In addition, although unbalanced homoeolog content (not 2:2) caused by homoeologous synapsis was common in our resynthesized allotetraploids, they were still not the main cause of ELD in resynthesized allotetraploids. If ELD were mainly caused by possessing more than two copies of the EL-dominant homoeolog, we would expect the relative expression from the EL-dominant homoeolog to increase and the EL-recessive homoeolog to decrease in genes with significant ELD. In contrast to this expectation, we found that, on average, the expression of EL-dominant homoeologs (relative to the transcriptome of the subgenome) was similar to that in diploid parental species, while the expression of EL-recessive homoeologs changed toward the EL-dominant homoeolog (Figure 8). This suggests that ELD is mainly achieved by altering the expression of EL-recessive homoeologs. This result is consistent with studies in a wide range of allopolyploid organisms (Yoo et al., 2013; Cox et al., 2014; Combes et al., 2015; Sousa et al., 2019), although not in resynthesized Brasssica napus (Wu et al., 2018). This conservative pattern can be explained by intergenomic interaction between divergent regulatory elements (Hu and Wendel, 2019), but direct evidence is still lacking.

As for transgressive gene expression, we found almost no TRE genes in resynthesized allotetraploids, but a mere 1.3% TRE genes in natural C. bursa-pastoris, with a threshold of FC >2 (Figure 4—source data 1). In agreement with several previous observations (Flagel and Wendel, 2010; Yoo et al., 2013; Zhang et al., 2016b), the results in Capsella suggest that transcriptional novelties in allopolyploids were not an instant outcome of allopolyploidization but mainly arose during long-term evolution and remained altogether rather limited.

Homoeologous synapsis was common in resynthesized Capsella allotetraploids, and may still be a source of variation in natural Cbp

Disomic inheritance in allopolyploid species is not established all at once (Henry et al., 2014), and strict disomic inheritance may have never been achieved in some allopolyploid species. In contrast to the disomic inheritance and the low level of homoeolog expression loss in natural C. bursa-pastoris, we found abundant traces of homoeologous segregation or recombination in all 12 lines of resynthesized Capsella allotetraploids, after only one meiosis. The observation is in line with many recent studies in which abundant homoeologous exchanges were found in allopolyploids (Lloyd et al., 2018; Pelé et al., 2018). The contrast between resynthesized and natural allotetraploids suggested that preferential synapsis was rapidly improved in natural C. bursa-pastoris, and/or a balanced number of homoeologs was strongly preferred by selection, otherwise, we would expect a much higher proportion of homoeolog replacement (having four copies of the same homoeolog) after 100,000 year recurrent self-fertilization with tetrasomic/heterosomic inheritance.

Homoeologous synapsis was the major mechanism for the variation of HEB and homoeolog expression loss in resynthesized Capsella allopolyploids (Figures 5 and 7). Several models have been proposed to explain HEB and biased genome fractionation in allopolyploids, including different TE contents (Woodhouse et al., 2014; Cheng et al., 2016; Wendel et al., 2018), the epigenetic difference of subgenomes (Li et al., 2014), different strength of regulatory elements, as a result of enhancer runaway (Fyon et al., 2015; Bottani et al., 2018). It was also suggested that the initial HEB may be reinforced in long-term evolution, as the homoeolog with a lower initial expression level is subject to weaker purifying selection, and has a larger chance of degeneration (Woodhouse et al., 2014). However, in resynthesized Capsella allotetraploids, homoeologous synapsis played an important role in generating HEB variation, possibly overshadowing the influence of other mechanisms. This result does not conflict with the observed association between TE content in parental species and genome dominance (Woodhouse et al., 2014). While TE contents may have only a minor effect in directly generating HEB variation, they could still be informative in predicting HEB in established allopolyploids, as the presence of TEs in the flanking regions may affect the fitness effect of HEB variation (Hollister and Gaut, 2009). In other words, TEs may not function as a strong mutagenic mechanism of HEB variation, but affect the selection on HEB variation, as a form of genetic load.

For established natural allotetraploids, occasional homoeologous synapsis may still be an important source of genetic variation, even long after the allopolyploidization event. Although an earlier allozymic study (Hurka et al., 1989) and an approximate Bayesian computation (ABC) with high throughput sequencing data (Roux and Pannell, 2015) suggest that natural C. bursa-pastoris exhibits disomic inheritance, neither analysis could reject homoeologous synapsis at a lower rate. A very small proportion of homoeologous synapsis may be negligible for inferring the dominant mode of inheritance, but in terms of causing homoeologous gene loss and creating genetic variation, homoeologous synapsis can still be more influential than point mutations. Due to the inevitable technical variation of RNA-seq and expression variation across genes, we were unable to confidently resolve smaller blocks of unbalanced homoeolog content. Despite the small sample size and the low resolution of RNA-seq data, we noticed that some small genomic blocks with homoeolog replacement were shared by the individuals of the same population but varied among populations of natural C. bursa-pastoris (Figure 6, Figure 6—figure supplements 1 and 2), suggesting that homoeologous synapsis still contribute to expressional variation in natural C. bursa-pastoris. Apart from homoeolog synapsis in a single-origin allopolyploid species, unbalanced content of homoeologs could also arise from secondary introgression from diploid parental species. Detailed demographic modeling would be needed for distinguishing the two scenarios.

Conclusion

In conclusion, together with Duan et al., 2023, the present study shows that both short- and long-term mechanisms contributed to transcriptomic and phenotypic changes in natural allotetraploids. However, the initial gene expression changes were largely reshaped during long-term evolution leading to more pronounced morphological changes. Resource limitations and/or adaptation to self-fertilization also, drive flowers' evolution after polyploidization.

Materials and methods

Plant material

The present study used five Capsella plant groups (Figure 1, Figure 1—source data 1), including diploid C. orientalis (Co2), diploid C. grandiflora (Cg2), two types of resynthesized allotetraploids (Sd and Sh), and natural allotetraploids, C. bursa-pastoris (Cbp). RNA-sequencing data and most phenotypic data (except floral morphologic traits) of Co2, Cg2, Sd, and Sh groups were from Duan et al., 2023. The Sd and Sh allotetraploids only differed in the order of hybridization and whole genome duplication. Allotetraploids of the Sd group were generated by crossing synthetic autotetraploid C. orientalis with autotetraploid C. grandiflora, whereas the Sh group was generated by inducing WGD in the first generation of diploid hybrids of C. orientalis×C. grandiflora. In all interspecific crosses, C. orientalis served as maternal plants, mimicking the formation of the natural allotetraploid species, C. bursa-pastoris (Hurka et al., 2012). All C. orientalis plants used in the experiment are descendants of one wild C. orientalis individual, and all the C. grandiflora plants are descendants of three individuals from one C. grandiflora population (Figure 1c and Figure 1—source data 1).

To compare the resynthesized allotetraploids with natural allotetraploids, natural C. bursa-pastoris was added to the present study. Seeds of wild C. bursa-pastoris plants were grown in a growth chamber for one generation. Then the second generation of C. bursa-pastoris plants was grown in the same experiment together with the other four plant groups used in Duan et al., 2023. All five plant groups were grown in a growth chamber under long-day conditions (16 hr light at 22 °C and 8 hr dark at 20 °C, light intensity = 137 uE·m-2·s-1).

Each of the five plant groups was represented by six ‘lines’, and each line had six individuals, which were full siblings from either self-fertilization (Co2, Cbp, Sh, and Sd groups) or brother-sister mating (Cg2 group). The six lines of C. bursa-pastoris were from six populations (Figure 1c and Figure 1—source data 1), representing three major genetic clusters of the wild C. bursa-pastoris (Kryvokhyzha et al., 2019b). Each line originated from an independent allopolyploidization event for the Sh and Sd groups. For Co2 and Cg2 groups, ‘line’ only referred to the offspring of one parental plant (Co2) or a pair of parental plants (Cg2) in the previous generation.

Plants used in the present study and a previously published work (Duan et al., 2023) were different subsets of a single experiment. The entire experiment had eight plant groups, including the five plant groups used in the present study (Groups Cg2, Co2, Sh, Sd, and Cbp; 5 groups x 6 lines x 6 individuals = 180 plants), and other three groups that were only used in Duan et al., 2023, Groups F, Co4 and Cg4; 3 groups x 6 lines x 6 individuals = 108 plants. All these plants were grown in 36 trays placed on six shelves inside the same growth chamber. Each tray had exactly one plant from each of the eight groups, and the positions of the eight plants within each tray (A-H) were randomized with random.shuffle() method in Python (Supplementary file 1). The position of the 36 trays inside the growth room (1-36) was also random and the positions of all trays were shuffled once again 28 days after germination (randomized with RAND() and sorting in Microsoft Excel Spreadsheet).

Phenotyping

Floral morphological traits were measured for all five groups on 165 plants 7–14 days after the first flower opened. The rest 15 plants were not measured due to technical errors. Two fully opened young flowers were dissected for each plant, and the floral organs were scanned with a photo scanner (Epson Perfection V370) at 3200 dpi. Seven floral morphological traits were measured on the digital images with Fiji, an open-source platform for biological-image analysis (Schindelin et al., 2012), including sepal width, sepal length, petal width, petal length, pistil width, pistil length, and stamen length. For each plant, two flowers were examined. Three sepals, petals and stamina, and one pistil were measured for each flower.

Stem length (total sample size n=171), flowering time (n=170), pollen traits (n=142), and seed traits (n=151) were measured for the five plant groups. Measurements of the Cg2, Co2, Sh, and Sd groups were from Duan et al., 2023, and measurements of the Cbp group were added by the present study, which were obtained in the same way as the other four groups. The length of the longest stem (stem length) was measured on dry plants. The number of days from germination start to the opening of the first flower was recorded as flowering time. The number of pollen grains per flower (pollen counts) was calculated by counting 1/60 (Co2, Sd, Sh, and Cbp groups) or 1/120 (Cg2 group) of the total pollen grains of a flower using a hemocytometer. Pollen viability was measured with an aceto-carmine staining method (Duan et al., 2023), by examining at least 300 pollen grains per flower. Pollen counts and pollen viability were measured on two flowers of each individual. Seeds from the 11th to 20th fruits were counted and were used to measure the proportion of normal seeds. In the case of the Cg2 and Cg4 groups, seeds were obtained through hand pollination. The first ten fruits were used when not all of these flowers set fruits. Individuals with less than ten fruits were excluded from the analysis. Seeds that were flat or dark and small were considered abnormal.

General linear models (for non-ratio data) or generalized linear models with quasi-binomial error distribution and a logit link function (for pollen viability and seed quality) were fit to each phenotypic trait. The difference in phenotypic traits among the five plant groups was tested with one-way analyses of variance (ANOVA, trait value ~plant group, type-III tests), using R package ‘car’ version 3.1–2 (Fox and Weisberg, 2019). A two-way ANOVA (trait value ~plant group +tray ID, type-III tests) was also tried to test the positional effect. When plant group had a significant effect on a trait, groups with different means were identified by Tukey’s HSD test using R packages ‘agricolae’ version 1.3–6 (de Mendiburu and Yaseen, 2020) and ‘multcomp’ version 1.4–25 (Hothorn et al., 2008).

RNA-sequencing

RNA-sequencing was conducted for the five plant groups (Co2, Cg2, Sd, Sh, and Cbp). For each line, leaf and young inflorescence (flower) samples from one randomly chosen individual were sequenced, resulting in 60 RNA-sequencing samples (5 groups × 6 lines × 2 tissues). RNA-sequencing data of the Co2, Cg2, Sd, and Sh groups were from Duan et al., 2023. Data from the Cbp group was added to the present study, but the Cbp samples were collected and sequenced simultaneously with the other four plant groups in 2019. The 8th and 9th leaves were harvested at the emergence of the 11th leaf, and 2–4 inflorescences with only unopened flower buds were collected 7–14 days after the first flower opened. The collected tissue was frozen in liquid nitrogen and stored at –80 °C.

Total RNA was extracted from leaf and flower samples with a cetyl-trimethyl-ammonium-bromide (CTAB)-based method (Duan et al., 2023). DNA contamination was further removed by the RNase-Free DNase Set (QIAGEN). RNA libraries were prepared with Illumina TruSeq Stranded mRNA (poly-A selection) kits and sequenced with pair-end reads of 150 bp on three NovaSeq 6000 S4 lanes, by the SNP&SEQ Technology Platform in Uppsala, Sweden. One sequencing library was generated for each diploid sample, and two libraries were generated for each allotetraploid sample.

Gene and homoeolog expression

Raw RNA-seq reads were mapped to the reference genome of Capsella rubella (Slotte et al., 2013) using Stampy v.1.0.32 (Lunter and Goodson, 2011). The expected divergence between reference and query sequences was set to 0.02, 0.04, and 0.025 for C. grandiflora, C. orientalis, and allotetraploids, respectively. Mapping quality was inspected by Qualimap v. 2.2.1 (Okonechnikov et al., 2016). The number of reads mapped to each gene was counted by HTSeq v.0.12.4 (Anders et al., 2015), using the mode ‘union’ (hereinafter referred to as ‘unphased gene expression’). The average number of mapped reads was 38.4±2.4 and 70.0±3.5 for the diploid and tetraploid samples, respectively (Supplementary file 2). For all analyses on unphased gene expression, the mapped reads were downsampled with a custom Python script (Duan et al., 2023), so that all five groups had a similar average number of mapped reads.

The expression level of separate homoeologs in allotetraploids was determined by the program HyLiTE v.2.0.2 (Duchemin et al., 2015). Alignment results from the software Stampy v.1.0.32 (Lunter and Goodson, 2011) of all five groups and the C. rubella reference genome (Slotte et al., 2013) were used as the input for HyliTE. HyLiTE performed SNP calling, classified RNA-seq reads of allotetraploids to parental types according to the identified diagnostic variation between the two diploid parental species and generated a table of homoeolog read counts for allotetraploid individuals (hereinafter referred to as ‘phased gene expression’).

After partitioning the homoeolog expression, the average library size of allotetraploid subgenomes was similar to the library size of diploid groups (one-way ANOVA, F4,91=1.28, p=0.29). To reduce bias between phased and unphased expression datasets, when the homoeolog expression of allotetraploids was compared with gene expression in diploid parental species, the expression level of each gene in diploid individuals was rescaled by the proportion of reads that can be phased for the same gene in allotetraploid individuals.

The overall gene expression pattern of five plant groups in each tissue was visualized by MDS analysis, using the R package ‘edgeR’ (version 3.28.1; Robinson et al., 2010) in the R software environment version 3.6.3 (R Core Team, 2020). For both phased and unphased gene expression data, genes with count-per-million (CPM) over one in at least two samples were used for the MDS analysis, and expression levels were normalized with the trimmed mean of M-values (TMM) method.

Differential expression (DE) analysis

DE analysis was conducted on both unphased and phased gene expression data with the R package ‘edgeR’ (version 3.28.1; Robinson et al., 2010), using TMM normalized gene expression levels. A negative binomial generalized linear model (GLM) was fitted to each dataset. Pairwise group contrasts were then made for each GLM model, and gene-wise quasi-likelihood F-tests were conducted to detect expression changes in each contrast. For unphased data, pairwise contrasts were made among the original five groups (Co2, Cg2, Sd, Sh, and Cbp). For phased data, allopolyploid subgenomes were treated as separate groups (Sd_co, Sd_cg, Sh_co, Sh_cg, Cbp_co, and Cbp_cg) and were compared with the two diploid groups (Co2, Cg2). Genes with an expression fold-change (FC) larger than two and a false discovery rate (FDR) less than 0.05 were considered significant DEGs.

Expression level dominance and transgressive expression

To measure the extent of non-additive expression in allotetraploids, genes were classified into ten expression categories (Table 1), by comparing the total expression of both homoeologs in an allotetraploid group to the gene expression level in a diploid group. The 10 categories were modified from the classification by Zhang et al., 2016a. Results of DE analysis on unphased gene expression (FC >2 and FDR <0.05) were used for the classification.

Table 1. Classification of additive and non-additive gene expression pattern in allotetraploids.

Group Description Classification criteria*
a Additive expression with no parental differentiation Cgi = xij = Coi
b Partial ELD or additive expression with parental differentiation (Cgi <xij < Coi) or
(Coi <xij < Cgi) or
(Cgi ≠ Coi and xij = Cgi and xij = Coi)
c Up-regulated ELD toward Cg2 xij = Cgi and xij >Coi
d Down-regulated ELD toward Cg2 xij = Cgi and xij <Coi
e Up-regulated ELD toward Co2 xij = Coi and xij >Cgi
f Down-regulated ELD toward Co2 xij = Coi and xij <Cgi
g Up-regulated TRE with no parental differentiation Cgi = Coi and xij >Cgi and xij >Coi
h Up-regulated TRE with parental differentiation Cgi ≠ Coi and xij >Cgi and xij >Coi
i Down-regulated TRE with no parental differentiation Cgi = Coi and xij <Cgi and xij <Coi
j Down-regulated TRE with parental differentiation Cgi ≠ Coi and xij <Cgi and xij <Coi
*

Cgi: expression level of gene i in the Cg2 group; Coi: expression level of gene i in the Co2 group; xij: expression level of gene i in allotetraploid group j, and j∈(Sd, Sh, Cbp); ELD: expression level dominance; TRE: transgressive expression; The significance of differential expression between groups were determined by results of differential expression analysis on unphased gene expression, with a threshold of fold-change >2 and false discovery rate <0.05.

Homoeolog expression bias

HEB of gene i in allotetraploid individual j was measured by the proportion of cg-homoeolog expression (cgij) in the total expression of both homoeologs (cgij/(cgij +coij)). The distribution of HEB was then viewed by individuals, chromosomes, or along genomic coordinates. Signs of the segregation or recombination of homoeologous chromosomes were revealed in resynthesized allopolyploids by the distribution of HEB along genomic coordinates.

The copy number of cg- and co-homoeologs of each gene and the breakpoints between chromosomal segments resulting from homoeologous recombination were estimated with a five-state HMM of gene-wise HEB, using a modified version of the R package ‘HMMcopy’ version 1.40.0 (Lai et al., 2022). The five states corresponded to (0, 1, 2, 3, 4) gene copies from Cg and (4, 3, 2, 1, 0) copies from Co, respectively. The expected optimal values of median HEB (m) were set to 0.01, 0.25, 0.5, 0.75, and 0.99 for the five states. The length of segments was controlled by arguments e and strength. e is the initial value of the transition probability that the state (copy number of cg homoeolog) does not change between two adjacent genes, and strength is the strength of this initial e. Smaller values of strength increase the flexibility of transition probability, and an extremely large value leads to almost fixed transition probabilities. For natural C. bursa-pastoris, e and strength were set to (1 – 1e-7) and 1e+7, respectively. For resynthesized allotetraploids, e was increased to (1 – 1e-10), and strength was increased to 1e+12, as homoeologous exchanges were expected to be rare for resynthesized allotetraploids which had only experienced one round of meiosis.

The effect of homoeologous segregation and recombination on HEB in resynthesized allotetraploids was analyzed by a generalized linear model with quasi-binomial error distribution and logit link function. Gene-wise HEB was reshaped as binomial data (read counts from cg homeolog and total counts from both homoeologs). The copy number of cg homoeologs (0, 1, 2, 3, and 4) estimated by HMM segmentation was used as the explanatory variable. The variance of HEB explained by the estimated copy number of cg homoeolog was assessed by the coefficient of determination (R2), calculated with R package ‘rsq’ version 2.5 (Zhang, 2022).

The relationship between non-additive gene expression and homoeolog expression was explored by comparing the estimated expression fold change of each homoeolog among additive and different non-additive expression categories. The expression fold change of homoeologs was measured by homoeolog expression of gene i in allotetraploid group k divided by expression of gene i in the corresponding diploid group, that is log2FC(cgik/Cg2i) or log2FC(coik/Co2i), using the estimated FC from DE analysis on phased data. The difference of expression fold change between EL-dominant and EL-recessive homoeologs was tested by Welch’s two-sample t-tests.

Loss of homoeolog expression

The most extreme HEB occurs when one homoeolog is silenced or lost in allotetraploids. To explore the timing and mechanism of the loss of homoeolog expression in Capsella, the number of genes with homoeolog expression loss was counted for each resynthesized or natural allotetraploid. Lowly or occasionally expressed genes were excluded from the analysis to reduce noise from sequencing and phasing. Specifically, if one homoeologous gene had obvious expression (CPM >5) in all six individuals of the corresponding diploid species, but had almost no expression in one allotetraploid individual (CPM <0.5), the case was considered a loss of homoeologous expression.

Gene ontology (GO) enrichment analysis

GO enrichment analysis was performed with R package ‘topGO’ version 2.52.0 (Alexa and Maintainer, 2023), using results of DE analysis on unphased gene expression and GO annotations of C. rubella downloaded from PlantRegMap (Tian et al., 2020). Genes that were differentially expressed in both Cbp-Sd and Cbp-Sh comparisons (FC >1.5 and FDR < 0.05; 578 genes in flowers and 345 in leaves) were used as the test set, and all genes with GO annotation and CPM > 1 in at least two samples were used as the background set (17,718 genes in flowers and 15,631 genes in leaves). GO terms were scored with the ‘classic’ algorithm and Fisher’s exact tests, and p-values were adjusted with the Benjamini-Hochberg procedure in R. GO terms with less than 10 annotated genes were excluded from the analysis.

Acknowledgements

Computation and data handling were provided by the Swedish National Infrastructure for Computing (SNIC) at Uppmax. We would like to thank Barbara Mable for her critical comments on the manuscript, and Elsa Sundkvist and Mattias Vass for their help with seed collecting and counting. Swedish Research Council (grant 2019–00806 and 2018–05973) Martin Lascoux, Nilsson-Ehle research grant from the Royal Physiographic Society in Lund Tianlin Duan, Erik Philip-Sörensen Foundation Martin Lascoux, Sven and Lily Lawski Foundation Tianlin Duan.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Tianlin Duan, Email: tianlin.duan42@gmail.com.

Martin Lascoux, Email: martin.lascoux@ebc.uu.se.

Regina S Baucom, University of Michigan, United States.

Christian R Landry, Université Laval, Canada.

Funding Information

This paper was supported by the following grants:

  • Swedish Research Council grant 2019-00806 to Martin Lascoux.

  • Erik Philip-Sorensen Foundation to Martin Lascoux.

  • Royal Physiographic Society in Lund Nilsson-Ehle research grant to Tianlin Duan.

  • Sven och Lily Lawskis fond to Tianlin Duan.

  • Swedish Research Council grant 2018-05973 to Martin Lascoux.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Resources, Data curation, Software, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Resources, Investigation, Methodology, Writing - review and editing.

Conceptualization, Methodology, Writing - review and editing.

Conceptualization, Resources, Supervision, Funding acquisition, Methodology, Project administration, Writing - review and editing.

Additional files

Supplementary file 1. Tray ID of each Capsella plant and the position of eight plants within each tray.
elife-88398-supp1.csv (2.3KB, csv)
Supplementary file 2. RNA-sequencing information.
elife-88398-supp2.csv (8.2KB, csv)
Supplementary file 3. Gene ontology (GO) terms that were overrepresented in differentially expressed genes (FC >1.5 and FDR <0.05) between resynthesized and natural allopolyploids.
elife-88398-supp3.csv (1.5KB, csv)
MDAR checklist

Data availability

The RNA-sequencing data of natural C. bursa-pastoris generated by this article are available in NCBI Sequence Read Archive (SRA) and can be accessed with BioProject number PRJNA848625. BioSample accessions are listed in Supplementary file 2. Unphased and phased expression datasets, phenotypic measurements, and R scripts used for statistical tests and visualization are available on GitHub (https://github.com/disporum/RNA104_paper2, copy archived at Disporum, 2023).

The following dataset was generated:

Duan T, Sicard A, Glémin S, Lascoux M. 2022. Resynthesized Capsella allopolyploids. NCBI BioProject. PRJNA848625

References

  1. Albertin W, Marullo P. Polyploidy in fungi: evolution after whole-genome duplication. Proceedings. Biological Sciences. 2012;279:2497–2509. doi: 10.1098/rspb.2012.0434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alexa A, Maintainer JR. topGO: enrichment analysis for gene Ontology. 2.52.0R Package. 2023 https://bioconductor.org/packages/release/bioc/html/topGO.html
  3. Anders S, Pyl PT, Huber W. HTSeq--a python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anneberg TJ, Segraves KA. Nutrient enrichment and neopolyploidy interact to increase lifetime fitness of Arabidopsis thaliana. Plant and Soil. 2020;456:439–453. doi: 10.1007/s11104-020-04727-6. [DOI] [Google Scholar]
  5. Bachmann JA, Tedder A, Laenen B, Fracassetti M, Désamoré A, Lafon-Placette C, Steige KA, Callot C, Marande W, Neuffer B, Bergès H, Köhler C, Castric V, Slotte T. Genetic basis and timing of a major mating system shift in Capsella. The New Phytologist. 2019;224:505–517. doi: 10.1111/nph.16035. [DOI] [PubMed] [Google Scholar]
  6. Bachmann JA, Tedder A, Fracassetti M, Steige KA, Lafon-Placette C, Köhler C, Slotte T. On the origin of the widespread self-compatible allotetraploid Capsella bursa-pastoris (Brassicaceae) Heredity. 2021;127:124–134. doi: 10.1038/s41437-021-00434-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baniaga AE, Marx HE, Arrigo N, Barker MS. Polyploid plants have faster rates of multivariate niche differentiation than their diploid relatives. Ecology Letters. 2020;23:68–78. doi: 10.1111/ele.13402. [DOI] [PubMed] [Google Scholar]
  8. Barker MS, Arrigo N, Baniaga AE, Li Z, Levin DA. On the relative abundance of autopolyploids and allopolyploids. The New Phytologist. 2016;210:391–398. doi: 10.1111/nph.13698. [DOI] [PubMed] [Google Scholar]
  9. Barreto FS, Pereira RJ, Burton RS. Hybrid dysfunction and physiological compensation in gene expression. Molecular Biology and Evolution. 2015;32:613–622. doi: 10.1093/molbev/msu321. [DOI] [PubMed] [Google Scholar]
  10. Beaulieu JM, Leitch IJ, Patel S, Pendharkar A, Knight CA. Genome size is a strong predictor of cell size and stomatal density in angiosperms. The New Phytologist. 2008;179:975–986. doi: 10.1111/j.1469-8137.2008.02528.x. [DOI] [PubMed] [Google Scholar]
  11. Behling AH, Shepherd LD, Cox MP. The importance and prevalence of allopolyploidy in Aotearoa New Zealand. Journal of the Royal Society of New Zealand. 2020;50:189–210. doi: 10.1080/03036758.2019.1676797. [DOI] [Google Scholar]
  12. Bekaert M, Edger PP, Pires JC, Conant GC. Two-phase resolution of polyploidy in the Arabidopsis metabolic network gives rise to relative and absolute dosage constraints. The Plant Cell. 2011;23:1719–1728. doi: 10.1105/tpc.110.081281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Blasio F, Prieto P, Pradillo M, Naranjo T. Genomic and meiotic changes accompanying polyploidization. Plants. 2022;11:125. doi: 10.3390/plants11010125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bomblies K. When everything changes at once: finding a new normal after genome duplication. Proceedings. Biological Sciences. 2020;287:20202154. doi: 10.1098/rspb.2020.2154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bottani S, Zabet NR, Wendel JF, Veitia RA. Gene expression dominance in allopolyploids. Hypotheses and Models. 2018;23:393–402. doi: 10.1016/j.tplants.2018.01.002. [DOI] [PubMed] [Google Scholar]
  16. Buggs RJA, Doust AN, Tate JA, Koh J, Soltis K, Feltus FA, Paterson AH, Soltis PS, Soltis DE. Gene loss and silencing in Tragopogon miscellus (Asteraceae): comparison of natural and synthetic allotetraploids. Heredity. 2009;103:73–81. doi: 10.1038/hdy.2009.24. [DOI] [PubMed] [Google Scholar]
  17. Buggs RJA, Zhang L, Miles N, Tate JA, Gao L, Wei W, Schnable PS, Barbazuk WB, Soltis PS, Soltis DE. Transcriptomic shock generates evolutionary novelty in a newly formed, natural allopolyploid plant. Current Biology. 2011;21:551–556. doi: 10.1016/j.cub.2011.02.016. [DOI] [PubMed] [Google Scholar]
  18. Cheng F, Sun C, Wu J, Schnable J, Woodhouse MR, Liang J, Cai C, Freeling M, Wang X. Epigenetic regulation of subgenome dominance following whole genome triplication in Brassica rapa. The New Phytologist. 2016;211:288–299. doi: 10.1111/nph.13884. [DOI] [PubMed] [Google Scholar]
  19. Chester M, Gallagher JP, Symonds VV, Cruz da Silva AV, Mavrodiev EV, Leitch AR, Soltis PS, Soltis DE. Extensive chromosomal variation in a recently formed natural allopolyploid species, Tragopogon miscellus (Asteraceae) PNAS. 2012;109:1176–1181. doi: 10.1073/pnas.1112041109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Combes MC, Hueber Y, Dereeper A, Rialle S, Herrera JC, Lashermes P. Regulatory divergence between parental alleles determines gene expression patterns in hybrids. Genome Biology and Evolution. 2015;7:1110–1121. doi: 10.1093/gbe/evv057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Cox MP, Dong T, Shen GG, Dalvi Y, Scott DB, Ganley ARD. An interspecific fungal hybrid reveals cross-kingdom rules for allopolyploid gene expression patterns. PLOS Genetics. 2014;10:e1004180. doi: 10.1371/journal.pgen.1004180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. de Mendiburu F, Yaseen M. Agricolae: statistical procedures for agricultural research. 1.3-7R Package. 2020 https://cran.r-project.org/web/packages/agricolae/index.html
  23. Disporum Rna104_Paper2. swh:1:rev:f7f29482ea71be2b7f195b55d91787ffba5d57aaSoftware Heritage. 2023 https://archive.softwareheritage.org/swh:1:dir:699e174748f2be727b2d50627af6747f7d38b9cb;origin=https://github.com/disporum/RNA104_paper2;visit=swh:1:snp:bfc0474d9066e8db6b2b3e543d5af4dcf43715b5;anchor=swh:1:rev:f7f29482ea71be2b7f195b55d91787ffba5d57aa
  24. Domínguez-Delgado JJ, López-Jurado J, Mateos-Naranjo E, Balao F. Phenotypic diploidization in plant functional traits uncovered by synthetic neopolyploids in Dianthus broteri. Journal of Experimental Botany. 2021;72:5522–5533. doi: 10.1093/jxb/erab179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Douglas GM, Gos G, Steige KA, Salcedo A, Holm K, Josephs EB, Arunkumar R, Ågren JA, Hazzouri KM, Wang W, Platts AE, Williamson RJ, Neuffer B, Lascoux M, Slotte T, Wright SI. Hybrid origins and the earliest stages of diploidization in the highly successful recent polyploid Capsella bursa-pastoris. PNAS. 2015;112:2806–2811. doi: 10.1073/pnas.1412277112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Drake PL, Froend RH, Franks PJ. Smaller, faster stomata: scaling of stomatal size, rate of response, and stomatal conductance. Journal of Experimental Botany. 2013;64:495–505. doi: 10.1093/jxb/ers347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Duan T, Sicard A, Glémin S, Lascoux M. Expression pattern of resynthesized allotetraploid Capsella is determined by hybridization, not whole-genome duplication. The New Phytologist. 2023;237:339–353. doi: 10.1111/nph.18542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Duchemin W, Dupont PY, Campbell MA, Ganley ARD, Cox MP. HyLiTE: accurate and flexible analysis of gene expression in hybrid and allopolyploid species. BMC Bioinformatics. 2015;16:8. doi: 10.1186/s12859-014-0433-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Duminil J, Hardy OJ, Petit RJ. Plant traits correlated with generation time directly affect inbreeding depression and mating system and indirectly genetic structure. BMC Evolutionary Biology. 2009;9:1–14. doi: 10.1186/1471-2148-9-177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Edger PP, Smith R, McKain MR, Cooley AM, Vallejo-Marin M, Yuan Y, Bewick AJ, Ji L, Platts AE, Bowman MJ, Childs KL, Washburn JD, Schmitz RJ, Smith GD, Pires JC, Puzey JR. Subgenome dominance in an interspecific hybrid, synthetic allopolyploid, and a 140-year-old naturally established neo-allopolyploid monkeyflower. The Plant Cell. 2017;29:2150–2167. doi: 10.1105/tpc.17.00010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Flagel LE, Wendel JF. Evolutionary rate variation, genomic dominance and duplicate gene expression evolution during allotetraploid cotton speciation. The New Phytologist. 2010;186:184–193. doi: 10.1111/j.1469-8137.2009.03107.x. [DOI] [PubMed] [Google Scholar]
  32. Fox J, Weisberg S. An R Companion to Applied Regression. Thousand Oaks CA: Sage; 2019. [Google Scholar]
  33. Fyon F, Cailleau A, Lenormand T. Enhancer runaway and the evolution of diploid gene expression. PLOS Genetics. 2015;11:e1005665. doi: 10.1371/journal.pgen.1005665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Greer E, Martín AC, Pendle A, Colas I, Jones AME, Moore G, Shaw P. The Ph1 locus suppresses Cdk2-type activity during premeiosis and meiosis in wheat. The Plant Cell. 2012;24:152–162. doi: 10.1105/tpc.111.094771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Griffiths AG, Moraga R, Tausen M, Gupta V, Bilton TP, Campbell MA, Ashby R, Nagy I, Khan A, Larking A, Anderson C, Franzmayr B, Hancock K, Scott A, Ellison NW, Cox MP, Asp T, Mailund T, Schierup MH, Andersen SU. Breaking free: the genomics of allopolyploidy-facilitated niche expansion in white clover. The Plant Cell. 2019;31:1466–1487. doi: 10.1105/tpc.18.00606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Grover CE, Gallagher JP, Szadkowski EP, Yoo MJ, Flagel LE, Wendel JF. Homoeolog expression bias and expression level dominance in allopolyploids. The New Phytologist. 2012;196:966–971. doi: 10.1111/j.1469-8137.2012.04365.x. [DOI] [PubMed] [Google Scholar]
  37. Henry IM, Dilkes BP, Tyagi A, Gao J, Christensen B, Comai L. The BOY NAMED SUE quantitative trait locus confers increased meiotic stability to an adapted natural allopolyploid of Arabidopsis. The Plant Cell. 2014;26:181–194. doi: 10.1105/tpc.113.120626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hollister JD, Gaut BS. Epigenetic silencing of transposable elements: A trade-off between reduced transposition and deleterious effects on neighboring gene expression. Genome Research. 2009;19:1419–1428. doi: 10.1101/gr.091678.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hollister JD. Polyploidy: adaptation to the genomic environment. The New Phytologist. 2015;205:1034–1039. doi: 10.1111/nph.12939. [DOI] [PubMed] [Google Scholar]
  40. Hothorn T, Bretz F, Westfall P. Simultaneous inference in general parametric models. Biometrical Journal. Biometrische Zeitschrift. 2008;50:346–363. doi: 10.1002/bimj.200810425. [DOI] [PubMed] [Google Scholar]
  41. Hu G, Wendel JF. Cis-trans controls and regulatory novelty accompanying allopolyploidization. The New Phytologist. 2019;221:1691–1700. doi: 10.1111/nph.15515. [DOI] [PubMed] [Google Scholar]
  42. Hurka H, Freundner S, Brown AHD, Plantholt U. Aspartate aminotransferase isozymes in the genus Capsella (Brassicaceae): subcellular location, gene duplication, and polymorphism. Biochemical Genetics. 1989;27:77–90. doi: 10.1007/BF00563019. [DOI] [PubMed] [Google Scholar]
  43. Hurka H, Friesen N, German DA, Franzke A, Neuffer B. “Missing link” species Capsella orientalis and Capsella thracica elucidate evolution of model plant genus Capsella (Brassicaceae) Molecular Ecology. 2012;21:1223–1238. doi: 10.1111/j.1365-294X.2012.05460.x. [DOI] [PubMed] [Google Scholar]
  44. Husband BC, Baldwin SJ, Sabara HA. Direct vs. indirect effects of whole-genome duplication on prezygotic isolation in Chamerion angustifolium: Implications for rapid speciation. American Journal of Botany. 2016;103:1259–1271. doi: 10.3732/ajb.1600097. [DOI] [PubMed] [Google Scholar]
  45. Jenczewski E, Eber F, Grimaud A, Huet S, Lucas MO, Monod H, Chèvre AM. PrBn, a major gene controlling homeologous pairing in oilseed rape (Brassica napus) haploids. Genetics. 2003;164:645–653. doi: 10.1093/genetics/164.2.645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Johnson NA. Hybrid incompatibility genes: remnants of a genomic battlefield? Trends in Genetics. 2010;26:317–325. doi: 10.1016/j.tig.2010.04.005. [DOI] [PubMed] [Google Scholar]
  47. Katagiri Y, Hasegawa J, Fujikura U, Hoshino R, Matsunaga S, Tsukaya H. The coordination of ploidy and cell size differs between cell layers in leaves. Development. 2016;143:1120–1125. doi: 10.1242/dev.130021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Kryvokhyzha D, Holm K, Chen J, Cornille A, Glémin S, Wright SI, Lagercrantz U, Lascoux M. The influence of population structure on gene expression and flowering time variation in the ubiquitous weed C apsella bursa‐pastoris (Brassicaceae) Molecular Ecology. 2016;25:1106–1121. doi: 10.1111/mec.13537. [DOI] [PubMed] [Google Scholar]
  49. Kryvokhyzha D, Milesi P, Duan T, Orsucci M, Wright SI, Glémin S, Lascoux M, Ross-Ibarra J. Towards the new normal: transcriptomic convergence and genomic legacy of the two subgenomes of an allopolyploid weed (Capsella bursa-pastoris) PLOS Genetics. 2019a;15:e1008131. doi: 10.1371/journal.pgen.1008131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kryvokhyzha D, Salcedo A, Eriksson MC, Duan T, Tawari N, Chen J, Guerrina M, Kreiner JM, Kent TV, Lagercrantz U, Stinchcombe JR, Glémin S, Wright SI, Lascoux M, de Meaux J. Parental legacy, demography, and admixture influenced the evolution of the two subgenomes of the tetraploid Capsella bursa-pastoris (Brassicaceae) PLOS Genetics. 2019b;15:e1007949. doi: 10.1371/journal.pgen.1007949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lai D, Ha G, Shah S. Hmmcopy: copy number prediction with correction for GC and Mappability bias for HTS data. 1.38.0R Package Version. 2022 https://bioconductor.org/packages/release/bioc/html/HMMcopy
  52. Landis JB, Kurti A, Lawhorn AJ, Litt A, McCarthy EW. Differential gene expression with an emphasis on floral organ size differences in natural and synthetic polyploids of Nicotiana tabacum (Solanaceae) Genes. 2020;11:1097. doi: 10.3390/genes11091097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Lashermes P, Combes MC, Hueber Y, Severac D, Dereeper A. Genome rearrangements derived from homoeologous recombination following allopolyploidy speciation in coffee. The Plant Journal. 2014;78:674–685. doi: 10.1111/tpj.12505. [DOI] [PubMed] [Google Scholar]
  54. Lashermes P, Hueber Y, Combes MC, Severac D, Dereeper A. Inter-genomic DNA exchanges and homeologous gene silencing shaped the nascent allopolyploid coffee genome (Coffea arabica L.) G3: Genes, Genomes, Genetics. 2016;6:2937–2948. doi: 10.1534/g3.116.030858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lesaffre T, Billiard S. The joint evolution of lifespan and self-fertilization. Journal of Evolutionary Biology. 2020;33:41–56. doi: 10.1111/jeb.13543. [DOI] [PubMed] [Google Scholar]
  56. Levin DA. Minority cytotype exclusion in local plant populations. TAXON. 1975;24:35–43. doi: 10.2307/1218997. [DOI] [Google Scholar]
  57. Li A, Liu D, Wu J, Zhao X, Hao M, Geng S, Yan J, Jiang X, Zhang L, Wu J, Yin L, Zhang R, Wu L, Zheng Y, Mao L. mRNA and Small RNA transcriptomes reveal insights into dynamic homoeolog regulation of allopolyploid heterosis in nascent hexaploid wheat. The Plant Cell. 2014;26:1878–1900. doi: 10.1105/tpc.114.124388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Li N, Xu C, Zhang A, Lv R, Meng X, Lin X, Gong L, Wendel JF, Liu B. DNA methylation repatterning accompanying hybridization, whole genome doubling and homoeolog exchange in nascent segmental rice allotetraploids. The New Phytologist. 2019;223:979–992. doi: 10.1111/nph.15820. [DOI] [PubMed] [Google Scholar]
  59. Li M, Wang R, Wu X, Wang J. Homoeolog expression bias and expression level dominance (ELD) in four tissues of natural allotetraploid Brassica napus. BMC Genomics. 2020;21:330. doi: 10.1186/s12864-020-6747-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Lloyd A, Blary A, Charif D, Charpentier C, Tran J, Balzergue S, Delannoy E, Rigaill G, Jenczewski E. Homoeologous exchanges cause extensive dosage-dependent gene expression changes in an allopolyploid crop. The New Phytologist. 2018;217:367–377. doi: 10.1111/nph.14836. [DOI] [PubMed] [Google Scholar]
  61. Lunter G, Goodson M. Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Research. 2011;21:936–939. doi: 10.1101/gr.111120.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290:1151–1155. doi: 10.1126/science.290.5494.1151. [DOI] [PubMed] [Google Scholar]
  63. Maherali H, Walden AE, Husband BC. Genome duplication and the evolution of physiological responses to water stress. The New Phytologist. 2009;184:721–731. doi: 10.1111/j.1469-8137.2009.02997.x. [DOI] [PubMed] [Google Scholar]
  64. Martín AC, Rey MD, Shaw P, Moore G. Dual effect of the wheat Ph1 locus on chromosome synapsis and crossover. Chromosoma. 2017;126:669–680. doi: 10.1007/s00412-017-0630-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Münzbergová Z. Colchicine application significantly affects plant performance in the second generation of synthetic polyploids and its effects vary between populations. Annals of Botany. 2017;120:329–339. doi: 10.1093/aob/mcx070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Neuffer B, Paetsch M. Flower morphology and pollen germination in the genus Capsella (Brassicaceae) Flora - Morphology, Distribution, Functional Ecology of Plants. 2013;208:626–640. doi: 10.1016/j.flora.2013.09.007. [DOI] [Google Scholar]
  67. Nicolas SD, Leflon M, Monod H, Eber F, Coriton O, Huteau V, Chèvre AM, Jenczewski E. Genetic regulation of meiotic cross-overs between related genomes in Brassica napus haploids and hybrids. The Plant Cell. 2009;21:373–385. doi: 10.1105/tpc.108.062273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Novikova PY, Tsuchimatsu T, Simon S, Nizhynska V, Voronin V, Burns R, Fedorenko OM, Holm S, Säll T, Prat E, Marande W, Castric V, Nordborg M. Genome sequencing reveals the origin of the allotetraploid Arabidopsis suecica. Molecular Biology and Evolution. 2017;34:957–968. doi: 10.1093/molbev/msw299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Ohno S. Evolution by Gene Duplication. Springer; 1970. [DOI] [Google Scholar]
  70. Okonechnikov K, Conesa A, García-Alcalde F. Qualimap 2: advanced multi-sample quality control for high-throughput sequencing data. Bioinformatics. 2016;32:292–294. doi: 10.1093/bioinformatics/btv566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Paape T, Briskine RV, Halstead-Nussloch G, Lischer HEL, Shimizu-Inatsugi R, Hatakeyama M, Tanaka K, Nishiyama T, Sabirov R, Sese J, Shimizu KK. Patterns of polymorphism and selection in the subgenomes of the allopolyploid Arabidopsis kamchatica. Nature Communications. 2018;9:3909. doi: 10.1038/s41467-018-06108-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Parisod C, Salmon A, Zerjal T, Tenaillon M, Grandbastien MA, Ainouche M. Rapid structural and epigenetic reorganization near transposable elements in hybrid and allopolyploid genomes in Spartina. The New Phytologist. 2009;184:1003–1015. doi: 10.1111/j.1469-8137.2009.03029.x. [DOI] [PubMed] [Google Scholar]
  73. Pelé A, Rousseau-Gueutin M, Chèvre AM. Speciation success of polyploid plants closely relates to the regulation of meiotic recombination. Frontiers in Plant Science. 2018;9:907. doi: 10.3389/fpls.2018.00907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. R Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2020. [Google Scholar]
  75. Renny-Byfield S, Gong L, Gallagher JP, Wendel JF. Persistence of subgenomes in paleopolyploid cotton after 60 my of evolution. Molecular Biology and Evolution. 2015;32:1063–1071. doi: 10.1093/molbev/msv001. [DOI] [PubMed] [Google Scholar]
  76. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Roddy AB, Théroux-Rancourt G, Abbo T, Benedetti JW, Brodersen CR, Castro M, Castro S, Gilbride AB, Jensen B, Jiang GF, Perkins JA, Perkins SD, Loureiro J, Syed Z, Thompson RA, Kuebbing SE, Simonin KA. The scaling of genome size and cell size limits maximum rates of photosynthesis with implications for ecological strategies. International Journal of Plant Sciences. 2020;181:75–87. doi: 10.1086/706186. [DOI] [Google Scholar]
  78. Roux C, Pannell JR. Inferring the mode of origin of polyploid species from next-generation sequence data. Molecular Ecology. 2015;24:1047–1059. doi: 10.1111/mec.13078. [DOI] [PubMed] [Google Scholar]
  79. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Schnable JC, Springer NM, Freeling M. Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. PNAS. 2011;108:4069–4074. doi: 10.1073/pnas.1101368108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Shan S, Boatwright JL, Liu X, Chanderbali AS, Fu C, Soltis PS, Soltis DE. Transcriptome dynamics of the inflorescence in reciprocally formed Allopolyploid Tragopogon miscellus (Asteraceae) Frontiers in Genetics. 2020;11:888. doi: 10.3389/fgene.2020.00888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Shi X, Ng DW-K, Zhang C, Comai L, Ye W, Chen ZJ. Cis- and trans-regulatory divergence between progenitor species determines gene-expression novelty in Arabidopsis allopolyploids. Nature Communications. 2012;3:950. doi: 10.1038/ncomms1954. [DOI] [PubMed] [Google Scholar]
  83. Slotte T, Hazzouri KM, Ågren JA, Koenig D, Maumus F, Guo YL, Steige K, Platts AE, Escobar JS, Newman LK, Wang W, Mandáková T, Vello E, Smith LM, Henz SR, Steffen J, Takuno S, Brandvain Y, Coop G, Andolfatto P, Hu TT, Blanchette M, Clark RM, Quesneville H, Nordborg M, Gaut BS, Lysak MA, Jenkins J, Grimwood J, Chapman J, Prochnik S, Shu S, Rokhsar D, Schmutz J, Weigel D, Wright SI. The Capsella rubella genome and the genomic consequences of rapid mating system evolution. Nature Genetics. 2013;45:831–835. doi: 10.1038/ng.2669. [DOI] [PubMed] [Google Scholar]
  84. Snodgrass SJ, Jareczek J, Wendel JF. An examination of nucleotypic effects in diploid and polyploid cotton. AOB PLANTS. 2017;9:plw082. doi: 10.1093/aobpla/plw082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Sousa F, Neiva J, Martins N, Jacinto R, Anderson L, Raimondi PT, Serrão EA, Pearson GA. Increased evolutionary rates and conserved transcriptional response following allopolyploidization in brown algae. Evolution; International Journal of Organic Evolution. 2019;73:59–72. doi: 10.1111/evo.13645. [DOI] [PubMed] [Google Scholar]
  86. Szadkowski E, Eber F, Huteau V, Lodé M, Huneau C, Belcram H, Coriton O, Manzanares-Dauleux MJ, Delourme R, King GJ, Chalhoub B, Jenczewski E, Chèvre A-M. The first meiosis of resynthesized Brassica napus, a genome blender. The New Phytologist. 2010;186:102–112. doi: 10.1111/j.1469-8137.2010.03182.x. [DOI] [PubMed] [Google Scholar]
  87. Tang H, Woodhouse MR, Cheng F, Schnable JC, Pedersen BS, Conant G, Wang X, Freeling M, Pires JC. Altered patterns of fractionation and exon deletions in Brassica rapa support a two-step model of paleohexaploidy. Genetics. 2012;190:1563–1574. doi: 10.1534/genetics.111.137349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Tian F, Yang DC, Meng YQ, Jin J, Gao G. PlantRegMap: charting functional regulatory maps in plants. Nucleic Acids Research. 2020;48:D1104–D1113. doi: 10.1093/nar/gkz1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Vicient CM, Casacuberta JM. Impact of transposable elements on polyploid plant genomes. Annals of Botany. 2017;120:195–207. doi: 10.1093/aob/mcx078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wang J, Tian L, Lee H-S, Wei NE, Jiang H, Watson B, Madlung A, Osborn TC, Doerge RW, Comai L, Chen ZJ. Genomewide nonadditive gene regulation in Arabidopsis allotetraploids. Genetics. 2006;172:507–517. doi: 10.1534/genetics.105.047894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wang X, Zhang H, Li Y, Zhang Z, Li L, Liu B. Transcriptome asymmetry in synthetic and natural allotetraploid wheats, revealed by RNA-sequencing. The New Phytologist. 2016;209:1264–1277. doi: 10.1111/nph.13678. [DOI] [PubMed] [Google Scholar]
  92. Wei Y, Li G, Zhang S, Zhang S, Zhang H, Sun R, Zhang R, Li F. Analysis of transcriptional changes in different Brassica napus synthetic allopolyploids. Genes. 2021;12:82. doi: 10.3390/genes12010082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Wendel JF, Lisch D, Hu G, Mason AS. The long and short of doubling down: polyploidy, epigenetics, and the temporal dynamics of genome fractionation. Current Opinion in Genetics & Development. 2018;49:1–7. doi: 10.1016/j.gde.2018.01.004. [DOI] [PubMed] [Google Scholar]
  94. Wilson MJ, Fradera-Soler M, Summers R, Sturrock CJ, Fleming AJ. Ploidy influences wheat mesophyll cell geometry, packing and leaf function. Plant Direct. 2021;5:e00314. doi: 10.1002/pld3.314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Woodhouse MR, Cheng F, Pires JC, Lisch D, Freeling M, Wang X. Origin, inheritance, and gene regulatory consequences of genome dominance in polyploids. PNAS. 2014;111:5283–5288. doi: 10.1073/pnas.1402475111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Wu J, Lin L, Xu M, Chen P, Liu D, Sun Q, Ran L, Wang Y. Homoeolog expression bias and expression level dominance in resynthesized allopolyploid Brassica napus. BMC Genomics. 2018;19:586. doi: 10.1186/s12864-018-4966-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Xiong Z, Gaeta RT, Edger PP, Cao Y, Zhao K, Zhang S, Pires JC, Parkin I. Chromosome inheritance and meiotic stability in allopolyploid Brassica napus. G3. 2021;11:jkaa011. doi: 10.1093/g3journal/jkaa011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Ye B, Wang R, Wang J. Correlation analysis of the mRNA and miRNA expression profiles in the nascent synthetic allotetraploid Raphanobrassica. Scientific Reports. 2016;6:37416. doi: 10.1038/srep37416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Yoo MJ, Szadkowski E, Wendel JF. Homoeolog expression bias and expression level dominance in allopolyploid cotton. Heredity. 2013;110:171–180. doi: 10.1038/hdy.2012.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Yu X, Zhai Y, Wang P, Cheng C, Li J, Lou Q, Chen J. Morphological, anatomical and photosynthetic consequences of artificial allopolyploidization in Cucumis. Euphytica. 2021;217:1–13. doi: 10.1007/s10681-020-02735-2. [DOI] [Google Scholar]
  101. Zhang H, Bian Y, Gou X, Zhu B, Xu C, Qi B, Li N, Rustgi S, Zhou H, Han F, Jiang J, von Wettstein D, Liu B. Persistent whole-chromosome aneuploidy is generally associated with nascent allohexaploid wheat. PNAS. 2013;110:3447–3452. doi: 10.1073/pnas.1300153110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Zhang H, Gou X, Zhang A, Wang X, Zhao N, Dong Y, Li L, Liu B. Transcriptome shock invokes disruption of parental expression-conserved genes in tetraploid wheat. Scientific Reports. 2016a;6:26363. doi: 10.1038/srep26363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Zhang D, Pan Q, Tan C, Zhu B, Ge X, Shao Y, Li Z. Genome-wide gene expressions respond differently to A-subgenome origins in Brassica napus synthetic hybrids and natural allotetraploid. Frontiers in Plant Science. 2016b;7:1508. doi: 10.3389/fpls.2016.01508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Zhang D. Rsq: R-squared and related measures. version 2.5R Package. 2022 https://rdrr.io/cran/rsq/#:~:text=Calculate%20generalized%20R%2Dsquared%2C%20partial,with%20well%20defined%20variance%20functions)

eLife assessment

Regina S Baucom 1

This important study offers new insight into how floral and reproductive phenotypes and gene expression evolve in allopolyploids. The authors marshal compelling evidence, using well-constructed genetic lines, RNA sequencing, and phenotypic analyses to distinguish the roles of hybridization, whole genome duplication, and subsequent evolution in phenotypes associated with the selfing syndrome and in gene expression. The work will be of interest to researchers working in plant speciation and genomics, as well as those more broadly interested in the effects of genome copy number on phenotypic and expression evolution.

Reviewer #1 (Public Review):

Anonymous

This study aims to determine the relative importance of the immediate effects of allopolyploidization from subsequent evolution in phenotypic traits associated with the selfing syndrome and in gene expression traits in the selfing allopolyploid Capsella bursa-pastoris and its diploid progenitors Capsella grandiflora, which is outcrossing, and Capsella orientalis, which is selfing. To do this, they compared five categories of plant: the two progenitors of the allopolyploid, hybrids resynthesized from the progenitors with a whole-genome duplication either before or after the hybridization event, and the naturally occurring allopolyploid.

Two lines of evidence were used: phenotypic data from the plants grown in a common environment, and RNAseq data from a subset of the plants.

The phenotypic data indicate that the selfing syndrome of C. bursa-pastoris likely evolved after the initial allopolyploidization event, and that pollen and seed viability recovered following the allopolyploidization event. The results are compelling but would benefit from small clarifications to the methods and statistics to account for possible positional effects in the growth chamber. Using a linear mixed model rather than a simple ANOVA would solve this problem.

The RNAseq data are used to explore overall expression patterns (using multi-dimensional scaling), patterns of differential expression (additive, dominant, or transgressive), and homeolog expression bias, and to determine the relative contributions of the original allopolyploidization event and subsequent evolution. Statistical cutoffs were used to categorize gene expression patterns, but the description and categorization of these patterns appears to have been largely qualitative, and might be strengthened by including more statistical detail in questions like whether homeologous expression bias did indeed show more variation in resynthesized and evolved allopolyploids.

The study includes evidence that homeolog expression bias (overrepresentation of an allele from one species) results in part from homeologous synapsis (uneven inheritance of chromosome segments). These deviations from patterns consistent with 2:2 inheritance of genomic regions are highly variable between individuals in resynthesized allopolyploids but appeared to be mostly consistent within (but not between) populations in natural C. bursa-pastoris. This is intriguing evidence that segregation can be an important source of variation in allopolyploids. However, it was limited by the difficulty of inferring homeologous recombination breakpoints with RNAseq data because of the scale of recombination in wild populations (rather than resynthesized allopolyploids). In future identifying such breakpoints will be an interesting direction for this and other allopolyploid systems.

This research suggests many follow-up questions. In particular, it may be possible to identify evidence about the mechanism of the original hybridization event. How frequently do unreduced gametes occur in these species, and is it likely that C. bursa-pastoris evolved via a triploid bridge? Exploring the viability, fertility, and phenotypes of triploids produced in both directions could be a valuable future direction.

Future research, or the current study, could also valuably explore what kinds of genes experienced what forms of expression evolution. A brief description of GO terms frequently represented in genes which showed strong patterns of expression evolution might be suggestive of which selective pressures led to the changes in expression in the C. bursa-pastoris lineage, and to what extent they related to adaptation to polyploidization (e.g. cell-cycle regulators), compensating for the initial pollen and seed inviability or adapting to selfing (endosperm- or pollen-specific genes), or adaptation to abiotic conditions.

Overall, this is an interesting and valuable contribution to the field's understanding of how expression evolves in interaction with hybridization and polyploidy. Particularly in combination with the team's previous study on these lines, this experimental design is effective for separating the contributions of hybridization, WGD, and evolution over time.

Update: the authors have thoughtfully and thoroughly updated the manuscript to address all the questions I raised. I appreciate the chance to review this valuable contribution to the scientific literature.

Reviewer #2 (Public Review):

Anonymous

The flowering plant Capsella bursa-pastoris is an allotetraploid formed from the genomes of Capsella orientalis and Capsella grandiflora. An outstanding question in the evolution of allotetraploids is the relative contribution of immediate consequences of allopolyploidization vs. long-term evolution after the event. The authors address this question by re-synthesizing the allotetraploid in the lab using the two progenitor species, and comparing its phenotypic and gene expression variation to naturally occurring C. bursa-pastoris. They find evidence primarily for long-term phenotypic evolution towards a selfing syndrome in C. bursa-pastoris, and a combination of short and long-term changes to gene expression.

The manuscript is thorough and provides lots of new insights into the mechanisms driving evolution in allopolyploids. I especially appreciated the detailed examination of different mechanisms driving gene expression variation. There are some important limitations of the experimental design related to independent evolution of the progenitor species and effects of the colchicine treatment used to induce polyploidy, but these are well-addressed in the Discussion.

eLife. 2024 Jan 8;12:RP88398. doi: 10.7554/eLife.88398.3.sa3

Author Response

Tianlin Duan 1, Adrien Sicard 2, Sylvain Glémin 3, Martin Lascoux 4

The following is the authors’ response to the original reviews.

Public Reviews:

The study could also valuably explore what kinds of genes experienced what forms of expression evolution. A brief description of GO terms frequently represented in genes which showed strong patterns of expression evolution might be suggestive of which selective pressures led to the changes in expression in the C. bursa-pastoris lineage, and to what extent they related to adaptation to polyploidization (e.g. cell-cycle regulators), compensating for the initial pollen and seed inviability or adapting to selfing (endosperm- or pollen-specific genes), or adaptation to abiotic conditions. ”

We did not include a gene ontology (GO) analysis in the first place as we did not have a clear expectation on the GO terms that would be enriched in the genes that are differentially expressed between resynthesized and natural allotetraploids. Even if we only consider adaptive changes, the modifications could occur in various aspects, such as stabilizing meiosis, adapting to the new cell size, reducing hybrid incompatibility and adapting to self-fertilization. And each of these modifications involves numerous biological processes and molecular functions. As we could make post-hoc stories for too many GO terms, extrapolating at this stage have limited implications and could be misleading.

Nonetheless, we are not the only study that compared newly resynthesized and established allopolyploids. GO terms that were repeatedly revealed by this type of exploratory analysis may give a hint for future studies. For this reason, now we have reported the results of a simple GO analysis.

Recommendations for the authors: please note that you control which, if any, revisions, to undertake

The majority of concerns from reviewers and the reviewing editor are in regards to the presentation of the manuscript; that the framing of the manuscript does not help the general reader understand how this work advances our knowledge of allopolyploid evolution in the broad sense. The manuscript may be challenging to read for those who aren't familiar with the study system or the genetic basis of polyploidy/gene expression regulation. Further, it is difficult to understand from the introduction how this work is novel compared to the recently published work from Duan et al and compared to other systems. Because eLife is a journal that caters to a broad readership, re-writing the introduction to bring home the novelty for the reader will be key.

Additionally, the writing is quite technical and contains many short-hands and acronyms that can be difficult to keep straight. Revising the full text for clarity (and additionally not using acronyms) would help highlight the findings for a larger audience.

Reviewer #1 (Recommendations For The Authors):

Most of my suggestions on this interesting and well-written study are minor changes to clarify the writing and the statistical approaches.

The use of abbreviations throughout for both transcriptional phenomena and lines is logical because of word limits, but for me as a reader, it really added to the cognitive burden. Even though writing out "homoeolog expression bias" or "hybridization-first" every time would add length, I would find it easier to follow and suspect others would too.

Thank you for this suggestion. Indeed, using less uncommon acronyms or short-hands should increase the readability of the text for broader audience. Now in most places, we refer to “Sd/Sh” and “Cbp” as “resynthesized allotetraploids” and “natural allotetraploids”, respectively. We have also replaced the most occurrences of the acronyms for transcriptional phenomena (ELD, HEB and TRE) with full phrases, unless there are extra attributes before them (such as “Cg-/Co-ELD” and “relic/Cbp-specific ELD”).

It would be helpful to include complete sample sizes to either a slightly modified Figure 1 or the beginning of the methods, just to reduce mental arithmetic ("Each of the five groups was represented by six "lines", and each line had six individuals" so there were 180 total plants, of which 167 were phenotyped - presumably the other 13 died? - and 30 were sequenced).

The number 167 only applied to floral morphorlogical traits (“Floral morphological traits were measured for all five groups on 167 plants…”), but the exact total sample size for other traits differed. Now the total sample sizes of other traits have also been added to beginning of the second paragraph of the methods.

For this study 180 seedings have been transplanted from Petri dishes to soil, but 8 seedlings died right after transplanting, seemingly caused by mechanical damage and insufficient moistening. Later phenotyping (2020.02-2020.05) was also disrupted by the COVID-19 pandemic, and some individuals were not measured as we missed the right life stages. Specifically, 5 individuals were missing for floral morphological traits (sepal width, sepal length, petal width, petal length, pistil width, pistil length, and stamen length), 30 for pollen traits, 1 for stem length, and 2 for flowering time. As for seed traits, we only measured individuals with more than ten fruits, so apart from the reasons mentioned above, individuals that were self-incompatible and had insufficient hand-pollination were also excluded. We spotted another mistake during the revision: two individuals with floral morphological measurements had no positional information (tray ID). These measurements were likely mis-sampled or mislabeled, and were therefore excluded from analysis. We assumed most of these missing values resulted from random technical mistakes and were not directly related to the measured traits.

In general, the methods did a thorough job of describing the genomics approaches but could have used more detail for the plant growth (were plants randomized in the growth chamber, can you rule out block/position effects) and basic statistics (what statistical software was used to perform which tests comparing groups in each section, after the categories were identified).

When describing the methods, mention whether the plants; this should be straightforward as a linear model with position as a covariate.

Data used in the present study and a previously published work (Duan et al., 2023) were different subsets of a single experiment. For this reason, we spent fewer words in describing shared methods in this manuscript but tried to summarize some methods that were essential for understanding the current paper. But as you have pointed out, we did miss many important details that should have been kept. Now we have added some description and a table (Supplementary file 1) in the “Plant material” section for explaining randomization, and added more information of the software used for performing statistic tests in the “Phenotyping” section.

Although we did not mention in the present manuscript, we used a randomized block design for the experiment (Author response image 1).

Author response image 1.

Author response image 1. Plant positions inside the growth chamber.

Author response image 1.

Plants used in the present study and Duan et al. (2023) were different subsets of a single experiment. The entire experiment had eight plant groups, including the five plant groups used in the present study diploid C. orientalis (Co2), diploid C. grandiflora (Cg2), “whole-genome-duplication-first” (Sd) and “hybridization-first”(Sh) resynthesized allotetraploids, and natural allotetraploids, C. bursa pastoris (Cbp), as well as three plant groups that were only used in Duan et al. (2023; tetraploid C. orientalis (Co4), tetraploid C. grandiflora (Cg4) and diploid hybrids (F)). Each of the eight plant groups had six lines and each line represented by six plants, resulting in 288 plants (8 groups x 6 lines x 6 individuals = 288 plants). The 288 plants were grown in 36 trays placed on six shelves inside the same growth chamber. Each tray had exactly one plant from each of the eight groups, and the position of the eight plants within each tray (A-H) were randomized with random.shuffle() method in Python (Supplementary file 1). The position of the 36 trays inside the growth room (1-36) was also random and the positions of all trays were shuffled once again 28 days after germination (randomized with RAND() and sorting in Microsoft Excel Spreadsheet). (a) Plant distribution; (b) An example of one tray; (c) A view inside the growth chamber, showing the six benches.

With the randomized block design and one round of shuffling, positional effect is very unlikely to bias the comparison among the five plant groups. The main risk of not adding positions to the statistical model is increasing error variance and decreasing the statistical power for detecting group effect. As we had already observed significant among-group variation in all phenotypic traits (p-value <2.2e-16 for group effect in most tests), further increasing statistical power is not our primary concern. In addition, during the experiment we did not notice obvious difference in plant growth related to positions. Although we could have added more variables to account for potential positional effects (tray ID, shelf ID, positions in a tray etc.), adding variables with little effect may reduce statistical power due to the loss of degree of freedom.

Due to one round of random shuffling, positions cannot be easily added as a single continuous variable. Now we have redone all the statistical tests on phenotypic traits and included tray ID as a categorical factor (Figure 2-Source Data 1). In general, the results were similar to the models without tray ID. The F-values of group effect was only slightly changed, and p-values were almost unchanged in most cases (still < 2.2e-16). The tray effect (df=35) was not significant in most tests and was only significant in petal length (p-value=0.0111), sepal length (p-value=0.0242) and the number of seeds in ten fruits (p-value=0.0367). As expected, positions (tray ID) had limited effect on phenotypic traits.

Figure 2 - I assume the numbers at the top indicate sample sizes but perhaps add this to the figure caption.

Statistical power depends on both the total sample size and the sample size of each group, especially the group with the fewest observations. We lost different number of measurements in each phenotypic trait, and for pollen traits we did have a notable loss, so we chose to show sample sizes above each group to increase transparency. Since we had five different sets of sample sizes (for floral morphological traits, stem length, days to flowering, pollen traits and seed traits, respectively), it would be cumbersome to introduce all 25 numbers in figure caption and could be hard for readers to match the sample sizes with results. For this reason, we would like to keep the sample sizes in the figure, and now we have modified the legend to clarify that the numbers above groups are sample sizes.

’The trend has been observed in a wide range of organisms, including ...’ - perhaps group Brassica and Raphanobrassica into one clause in the sentence, since separating them out undermines the diversity somewhat.

Indeed, it is very strange to put “cotton” between two representatives from Brassicaceae. Now the sentence is changed to “… including Brassica (Wu et al., 2018; Li et al., 2020; Wei et al., 2021) and Raphanobrassica (Ye et al., 2016), cotton (Yoo et al., 2013)…”

The diagrams under the graph in Figure 4B are particularly helpful for understanding the expression patterns under consideration! I appreciated them a lot!

Thank you for the comment. We also feel the direction of expression level dominance is convoluted and hard to remember, so we adopted the convention of showing the directions with diagrams.

Reviewer #2 (Recommendations For The Authors):

The science is very interesting and thorough, so my comments are mostly meant to improve the clarity of the manuscript text:

  • I found it challenging to remember the acronyms for the different gene expression phenomena and had to consistently cross-reference different parts of the manuscript to remind myself. I think using the full phrase once or twice at the start of a paragraph to remind readers what the acronym stands for could improve readability.

Thank you for this reasonable suggestion. Now we have replaced the most occurrence of acronyms with the full phrases.

  • There are some technical terms, such as "homoeologous synapsis" and "disomic inheritance", which I think are under-defined in the current text.

Indeed these terms were not well-defined before using in the manuscript. Now we have added a brief explanation for each term.

  • Under the joint action of these forces, allopolyploid subgenomes are further coordinated and degenerated, and subgenomes are often biasedly fractionated" This sentence has some unclear terminology. Does "coordinated" mean co-adapted, co-inherited, or something else? Is "biasedly fractionated" referring to biased inheritance or evolution of one of the parental subgenomes?

We apologize for not using accurate terms. With “coordinated” we emphasized the evolution of both homoeologs depends on the selection on total expression of both homoeologs, and on both relative and absolute dosages, which may have shifted away from optima after allopolyploidization. “Co-evolved” or “co-adapted” might be a better word.

But the term "biasedly fractionation" has been commonly used for referring to the phenomenon that genes from one subgenome of polyploids are preferentially retained during diploidization (Woodhouse et al., 2014; Wendel, 2015). Instead of inventing a new term, we prefer to keep the same term for consistency, so readers could link our findings with numerous studies in this field. Now the sentence is changed to “Under the joint action of these forces, allopolyploid subgenomes are further co-adapted and degenerated, and subgenomes are often biasedly retained, termed biased fractionation”.

  • There are a series of paragraphs in the results, starting with "Resynthesized allotetraploids and the natural Cbp had distinct floral morphologies", which consistently reference Figure 1 where they should be referencing Figure 2.

Thank you for spotting this mistake! Now the numbers have been corrected.

  • ‘The number of pollen grains per flower decreased in natural Cbp’ this wording implies it's the effect of some experimental treatment on Cbp, rather than just measured natural variation.

Yes, it is not scientifically precise to say this in the Results section, especially when describing details of results. We meant that assuming resynthesized allopolyploids are good approximation of the initial state of natural allotetraploid C. bursa-pastoris, our results indicate that the number of pollen grains had decreased in natural C. bursa-pastoris. But this is an implication, rather than an observation, so the sentence is better rewritten as “Natural allotetraploids had less pollen grains per flower.”

  • ‘The percentage of genes showing complete ELD was altogether limited but doubled between resynthesized allotetraploid groups and natural allotetraploids’ for clarity, I would suggest revising this to something like "doubled in natural allotetraploids relative to resynthesized allotetraploids

Thank you for the suggestion. The sentence has been revised as suggested.

  • I'm not sure I understand what the difference is between expression-level dominance and homeolog expression bias. It seems to me like the former falls under the umbrella of the latter.

Expression-level dominance and homeolog expression bias are easily confused, but they are conceptually independent. One gene could have expression-level dominance without any homeolog expression bias, or strong homeolog expression bias without any expression-level dominance. The concepts were well explained in Grover et al., (2012) with nice figures.

Expression level dominance compares the total expression level of both homoeologs in allopolyploids with the expression of the same gene in parental species, and judges whether the total expression level in allopolyploids is only similar to one of the parental species. The contributions from different homoeologs are not distinguished.

While homoeolog expression bias compares the relative expression level of each homoeologs in allopolyploids, with no implication on the total expression of both homoeologs.

Let the expression level of one gene in parental species X and Y be e(X) and e(Y), respectively. And let the expression level of x homoeolog (from species X) and y homoeolog (from species Y) in allopolyploids be e(x) and e(y), respectively.

Then a (complete) expression level dominance toward species X means:e(x)+e(y)=e(X) and e(x)+e(y)≠e(Y);

While a homoeolog expression bias toward species X means:e(x) > e(y), or e(x)/e(y) > e(X)/e(Y), depending on the definition of studies.

Both expression-level dominance and homeolog expression bias have been widely studied in allopolyploids (Combes et al., 2013; Li et al., 2014; Yoo et al., 2014; Hu & Wendel, 2019). As the two phenomena could be in opposite directions, and may be caused by different mechanisms, we think adopting the definitions in Grover et al., (2012) and distinguishing the two concepts would facilitate communication.

  • Is it possible to split up the results in Figure 7 to show which of the two homeologs was lost (i.e. orientalis vs. grandiflora)? Or at least clarify in the legend that these scenarios are pooled together in the figure?

Maybe using acronyms without explanation made the figure titles hard to understand, but in the original Figure 7 the loss of two homoeologs were shown separately. Figure 7a,c showed the loss of C. orientalis-homoeolog (“co-expession loss”), and Figure 7b,d showed the loss of C. grandiflora-homoeolog (“cg-expession loss”). Now the legends have been modified to explain the Figure.

  • The paragraph starting with "The extant diploid species" is too long, should probably be split into two paragraphs and edited for clarity.

The whole paragraph was used to explain why the resynthesized allotetraploids could be a realistic approximation of the early stage of C. bursa-pastoris with two arguments:

1. The further divergence between C. grandiflora and C. orientalis after the formation of C. bursa-pastoris should be small compared to the total divergence between the two parental species;

2. The mating systems of real parental populations were most likely the same as today.Now the two arguments were separated as two paragraphs, and the second paragraph has been shortened.

  • On the other hand, the number of seeds per fruit" implies this is evidence for an alternative hypothesis, when I think it's really just more support for the same idea.

“On the other hand” was used to contrast the reduced number of pollen grains and the increased number of seeds in natural allotetraploids. As both changes are typical selfing syndrome, indeed the two support the same idea. We replaced the “On the other hand” with “Moreover”.

  • ‘has become self-compatible before the formation" "has become" should be "became".

The tense of the word has been changed.

  • If natural C. bursa-pastoris indeed originated from the hybridization between C. grandiflora-like outcrossing plants and C. orientalis-like self-fertilizing plants, the selfing syndrome in C. bursa-pastoris does not reflect the instant dominance effect of the C. orientalis alleles, but evolved afterward.’ This sentence should be closer to the end of the paragraph, after the main morphological results are summarized.

Thank you for the suggestion. The paragraph is indeed more coherent after moving the conclusion sentence.

References

Combes, M.C., Dereeper, A., Severac, D., Bertrand, B. & Lashermes, P. (2013) Contribution of subgenomes to the transcriptome and their intertwined regulation in the allopolyploid Coffea arabica grown at contrasted temperatures. New Phytologist, 200, 251–260.

Grover, C.E., Gallagher, J.P., Szadkowski, E.P., Yoo, M.J., Flagel, L.E. & Wendel, J.F. (2012) Homoeolog expression bias and expression level dominance in allopolyploids. New Phytologist, 196, 966–971.

Hu, G. & Wendel, J.F. (2019) Cis – trans controls and regulatory novelty accompanying allopolyploidization. New Phytologist, 221, 1691–1700.

Li, A., Liu, D., Wu, J., Zhao, X., Hao, M., Geng, S., et al. (2014) mRNA and Small RNA Transcriptomes Reveal Insights into Dynamic Homoeolog Regulation of Allopolyploid Heterosis in

Nascent Hexaploid Wheat. The Plant Cell, 26, 1878–1900.Wendel, J.F. (2015) The wondrous cycles of polyploidy in plants. American Journal of Botany, 102, 1753–1756.

Woodhouse, M.R., Cheng, F., Pires, J.C., Lisch, D., Freeling, M. & Wang, X. (2014) Origin, inheritance, and gene regulatory consequences of genome dominance in polyploids. Proceedings of the National Academy of Sciences of the United States of America, 111, 5283–5288.

Yoo, M.J., Liu, X., Pires, J.C., Soltis, P.S. & Soltis, D.E. (2014) Nonadditive Gene Expression in Polyploids. https://doi.org/10.1146/annurev-genet-120213-092159, 48, 485–517.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Duan T, Sicard A, Glémin S, Lascoux M. 2022. Resynthesized Capsella allopolyploids. NCBI BioProject. PRJNA848625

    Supplementary Materials

    Figure 1—source data 1. Capsella plants used in the present study.
    Figure 2—source data 1. Effects of plant group and positions (tray ID) on phenotypes.
    Figure 4—source data 1. Additive and non-additive gene expression in allotetraploid groups.
    Figure 8—source data 1. Expression level fold-change (log2FC) of homoeologs relative to the corresponding gene in diploid groups among genes with expression level dominance (ELD) in flowers or leaves.
    Supplementary file 1. Tray ID of each Capsella plant and the position of eight plants within each tray.
    elife-88398-supp1.csv (2.3KB, csv)
    Supplementary file 2. RNA-sequencing information.
    elife-88398-supp2.csv (8.2KB, csv)
    Supplementary file 3. Gene ontology (GO) terms that were overrepresented in differentially expressed genes (FC >1.5 and FDR <0.05) between resynthesized and natural allopolyploids.
    elife-88398-supp3.csv (1.5KB, csv)
    MDAR checklist

    Data Availability Statement

    The RNA-sequencing data of natural C. bursa-pastoris generated by this article are available in NCBI Sequence Read Archive (SRA) and can be accessed with BioProject number PRJNA848625. BioSample accessions are listed in Supplementary file 2. Unphased and phased expression datasets, phenotypic measurements, and R scripts used for statistical tests and visualization are available on GitHub (https://github.com/disporum/RNA104_paper2, copy archived at Disporum, 2023).

    The following dataset was generated:

    Duan T, Sicard A, Glémin S, Lascoux M. 2022. Resynthesized Capsella allopolyploids. NCBI BioProject. PRJNA848625


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES